Miyakogusa Predicted Gene

Lj0g3v0313719.3
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj0g3v0313719.3 Non Chatacterized Hit- tr|I1KWS2|I1KWS2_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.33907
PE,72.43,0,Cse1,Exportin/Importin, Cse1-like; ARM
repeat,Armadillo-type fold; no description,Armadillo-like
hel,CUFF.21186.3
         (581 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT3G59020.1 | Symbols:  | ARM repeat superfamily protein | chr3:...   761   0.0  
AT3G59020.2 | Symbols:  | ARM repeat superfamily protein | chr3:...   760   0.0  
AT2G31660.1 | Symbols: SAD2, URM9 | ARM repeat superfamily prote...   751   0.0  
AT3G17340.2 | Symbols:  | ARM repeat superfamily protein | chr3:...    64   3e-10
AT3G17340.1 | Symbols:  | ARM repeat superfamily protein | chr3:...    64   3e-10

>AT3G59020.1 | Symbols:  | ARM repeat superfamily protein |
           chr3:21810973-21817418 REVERSE LENGTH=1029
          Length = 1029

 Score =  761 bits (1964), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 371/583 (63%), Positives = 454/583 (77%), Gaps = 6/583 (1%)

Query: 4   QHRMLQSDEDMVRDNILIFITQVPPLLRIQLGECLKTIINSDYPVKWPHPLEWVKHNLRC 63
           Q+ +L SD+++VR+ IL+F++QVPP+LR+Q+GECLKTII +DYP +WP  L+WVK NL+ 
Sbjct: 78  QNIILPSDKNVVRNQILVFVSQVPPILRVQMGECLKTIIYADYPEQWPELLDWVKQNLQ- 136

Query: 64  EEEVYCALFVLQILSKRFMLISTKKGAPGYHIVEETFPHLRYIFNRLVQIVNPSLEAADL 123
           + +VY ALFVL+ILS ++   S +  AP + +VEETFPHL  IFN LV + NPSLE AD 
Sbjct: 137 KPQVYGALFVLRILSSKYEFKSDEDRAPIHRVVEETFPHLLNIFNNLVHVENPSLEVADH 196

Query: 124 IKFICKIFWYSIHVDIPQHLFDQDIFDVWMVSFLNLLERPVPSEGQPVDPELRKSWGWWK 183
           IK ICKIFW  I++++P+ LFD + F+ WM  FLN+LERPVP EGQP DPELRKSWGWWK
Sbjct: 197 IKLICKIFWSCIYLELPRPLFDPNFFNAWMGLFLNILERPVPVEGQPEDPELRKSWGWWK 256

Query: 184 VKKWIANILNRLFTRFGQFEMVFSQRTVFT-MFHKHYAGKILECHLNLLNVVRFGDYLPD 242
            KKWIA+ILNRL+TRFG  ++       F  MF  +YA KILECHL LLN +R G YLPD
Sbjct: 257 AKKWIAHILNRLYTRFGDLKLQNPDNKAFAQMFQINYAAKILECHLKLLNAIRIGGYLPD 316

Query: 243 RVINLILQYLTISISRRSLYALIQPRLDVLLFEIIFPLICFNDNDQKLWDEDPQEYARKG 302
           RVINLILQYL+ SIS+ S+Y L+QP L+ LLFEI+FPL+CFNDNDQ LWDEDP EY RKG
Sbjct: 317 RVINLILQYLSNSISKSSMYNLLQPHLNTLLFEIVFPLMCFNDNDQMLWDEDPHEYVRKG 376

Query: 303 YDIFEDMHSPSTAAMDFVSELVRKRGKENLQKCIQFIVETFKRYDEASIEYKPYRQKDGA 362
           YDI ED++SP TA+MDFV+ELVRKRGKEN  K IQF+V+ FKRY+EAS+E KPYR KDGA
Sbjct: 377 YDIIEDLYSPRTASMDFVTELVRKRGKENFPKFIQFVVDIFKRYNEASLENKPYRLKDGA 436

Query: 363 LRVFGTLHEKFKEIEPYKSELEHMLVHHVFPELNSPVGHLRAKAAWVAGQYAHISFSYQN 422
           L   GTL +K ++ EPYKSELE+MLV HVFPE +SP GHLRAKAAWVAGQYA+I FS Q+
Sbjct: 437 LLAVGTLCDKLRQTEPYKSELENMLVQHVFPEFSSPAGHLRAKAAWVAGQYANIDFSDQS 496

Query: 423 NFQRALQCIVLRLQDSELPVRVDSFVALRSFIEACKDMNEIFPILPRLLDEFSKLMNEVE 482
           NF +AL C++  + D ELPVRVDS  ALRSFIEACKD++EI P+LP+LLDEF KLM EVE
Sbjct: 497 NFSKALHCVISGMCDLELPVRVDSVFALRSFIEACKDLDEIRPVLPQLLDEFFKLMKEVE 556

Query: 483 NEALVFTLETMLDKFGED---FALALYYNLAGVFWRRMNTIKDDDEASRNRTTGS-GCLR 538
           NE L FTLET++ KFGE+   +AL L  NLA  FWR ++T   DDE        + GCLR
Sbjct: 557 NEDLAFTLETIVYKFGEEISPYALGLCQNLASAFWRCIDTDNGDDETDDAGALAAVGCLR 616

Query: 539 AISIILESVSCLPDLFVQIEPTFVLIMRRMLTNNDQESFEKVL 581
           AIS ILES+S LP L+ QIEP  + IMR+MLT + Q+ FE+VL
Sbjct: 617 AISTILESISSLPHLYGQIEPQLLPIMRKMLTTDGQDVFEEVL 659


>AT3G59020.2 | Symbols:  | ARM repeat superfamily protein |
           chr3:21810973-21817418 REVERSE LENGTH=1030
          Length = 1030

 Score =  760 bits (1963), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 371/583 (63%), Positives = 454/583 (77%), Gaps = 6/583 (1%)

Query: 4   QHRMLQSDEDMVRDNILIFITQVPPLLRIQLGECLKTIINSDYPVKWPHPLEWVKHNLRC 63
           Q+ +L SD+++VR+ IL+F++QVPP+LR+Q+GECLKTII +DYP +WP  L+WVK NL+ 
Sbjct: 78  QNIILPSDKNVVRNQILVFVSQVPPILRVQMGECLKTIIYADYPEQWPELLDWVKQNLQ- 136

Query: 64  EEEVYCALFVLQILSKRFMLISTKKGAPGYHIVEETFPHLRYIFNRLVQIVNPSLEAADL 123
           + +VY ALFVL+ILS ++   S +  AP + +VEETFPHL  IFN LV + NPSLE AD 
Sbjct: 137 KPQVYGALFVLRILSSKYEFKSDEDRAPIHRVVEETFPHLLNIFNNLVHVENPSLEVADH 196

Query: 124 IKFICKIFWYSIHVDIPQHLFDQDIFDVWMVSFLNLLERPVPSEGQPVDPELRKSWGWWK 183
           IK ICKIFW  I++++P+ LFD + F+ WM  FLN+LERPVP EGQP DPELRKSWGWWK
Sbjct: 197 IKLICKIFWSCIYLELPRPLFDPNFFNAWMGLFLNILERPVPVEGQPEDPELRKSWGWWK 256

Query: 184 VKKWIANILNRLFTRFGQFEMVFSQRTVFT-MFHKHYAGKILECHLNLLNVVRFGDYLPD 242
            KKWIA+ILNRL+TRFG  ++       F  MF  +YA KILECHL LLN +R G YLPD
Sbjct: 257 AKKWIAHILNRLYTRFGDLKLQNPDNKAFAQMFQINYAAKILECHLKLLNAIRIGGYLPD 316

Query: 243 RVINLILQYLTISISRRSLYALIQPRLDVLLFEIIFPLICFNDNDQKLWDEDPQEYARKG 302
           RVINLILQYL+ SIS+ S+Y L+QP L+ LLFEI+FPL+CFNDNDQ LWDEDP EY RKG
Sbjct: 317 RVINLILQYLSNSISKSSMYNLLQPHLNTLLFEIVFPLMCFNDNDQMLWDEDPHEYVRKG 376

Query: 303 YDIFEDMHSPSTAAMDFVSELVRKRGKENLQKCIQFIVETFKRYDEASIEYKPYRQKDGA 362
           YDI ED++SP TA+MDFV+ELVRKRGKEN  K IQF+V+ FKRY+EAS+E KPYR KDGA
Sbjct: 377 YDIIEDLYSPRTASMDFVTELVRKRGKENFPKFIQFVVDIFKRYNEASLENKPYRLKDGA 436

Query: 363 LRVFGTLHEKFKEIEPYKSELEHMLVHHVFPELNSPVGHLRAKAAWVAGQYAHISFSYQN 422
           L   GTL +K ++ EPYKSELE+MLV HVFPE +SP GHLRAKAAWVAGQYA+I FS Q+
Sbjct: 437 LLAVGTLCDKLRQTEPYKSELENMLVQHVFPEFSSPAGHLRAKAAWVAGQYANIDFSDQS 496

Query: 423 NFQRALQCIVLRLQDSELPVRVDSFVALRSFIEACKDMNEIFPILPRLLDEFSKLMNEVE 482
           NF +AL C++  + D ELPVRVDS  ALRSFIEACKD++EI P+LP+LLDEF KLM EVE
Sbjct: 497 NFSKALHCVISGMCDLELPVRVDSVFALRSFIEACKDLDEIRPVLPQLLDEFFKLMKEVE 556

Query: 483 NEALVFTLETMLDKFGED---FALALYYNLAGVFWRRMNTIKDDDEASRNRTTGS-GCLR 538
           NE L FTLET++ KFGE+   +AL L  NLA  FWR ++T   DDE        + GCLR
Sbjct: 557 NEDLAFTLETIVYKFGEEISPYALGLCQNLASAFWRCIDTDNGDDETDDAGALAAVGCLR 616

Query: 539 AISIILESVSCLPDLFVQIEPTFVLIMRRMLTNNDQESFEKVL 581
           AIS ILES+S LP L+ QIEP  + IMR+MLT + Q+ FE+VL
Sbjct: 617 AISTILESISSLPHLYGQIEPQLLPIMRKMLTTDGQDVFEEVL 659


>AT2G31660.1 | Symbols: SAD2, URM9 | ARM repeat superfamily protein
           | chr2:13464519-13471353 FORWARD LENGTH=1040
          Length = 1040

 Score =  751 bits (1939), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 371/583 (63%), Positives = 465/583 (79%), Gaps = 6/583 (1%)

Query: 4   QHRMLQSDEDMVRDNILIFITQVPPLLRIQLGECLKTIINSDYPVKWPHPLEWVKHNLRC 63
           Q ++ +SD+++VRDNIL+++TQVP LLR QLGE LKTII +DYP +WP  L+WVK+NL+ 
Sbjct: 82  QQQIFESDKELVRDNILVYVTQVPTLLRSQLGESLKTIIYADYPEQWPRLLDWVKYNLQ- 140

Query: 64  EEEVYCALFVLQILSKRFMLISTKKGAPGYHIVEETFPHLRYIFNRLVQIVNPSLEAADL 123
            +++Y ALFVL+ILS+++   S ++  P   IVEETFP L  IFN L+QI NPSLE A+L
Sbjct: 141 NQQIYGALFVLRILSRKYEFKSDEERTPVSRIVEETFPQLLTIFNGLIQIPNPSLEIAEL 200

Query: 124 IKFICKIFWYSIHVDIPQHLFDQDIFDVWMVSFLNLLERPVPSEGQPVDPELRKSWGWWK 183
           +K ICKIFW SI++++P+ LFD ++F+ WMV FL++ ERPVP EGQP+DPELRKSWGWWK
Sbjct: 201 MKLICKIFWSSIYLELPRQLFDLNVFNAWMVLFLSVSERPVPVEGQPMDPELRKSWGWWK 260

Query: 184 VKKWIANILNRLFTRFGQFEMVFSQRTVFT-MFHKHYAGKILECHLNLLNVVRFGDYLPD 242
           VKKW  +ILNRL++RFG  ++   +   F  MF K+YAG+ILE HLN LN +R G YLPD
Sbjct: 261 VKKWTVHILNRLYSRFGDPKLQSPENKPFAQMFQKNYAGRILEGHLNFLNTIRVGGYLPD 320

Query: 243 RVINLILQYLTISISRRSLYALIQPRLDVLLFEIIFPLICFNDNDQKLWDEDPQEYARKG 302
           RVINL+LQYL+ SIS+ S+Y L+ PRLDVLLFEI+FPL+CFNDNDQKLW+EDP EY RKG
Sbjct: 321 RVINLLLQYLSNSISKNSMYKLLLPRLDVLLFEIVFPLMCFNDNDQKLWEEDPHEYVRKG 380

Query: 303 YDIFEDMHSPSTAAMDFVSELVRKRGKENLQKCIQFIVETFKRYDEASIEYKPYRQKDGA 362
           Y+I ED++SP TA+MDFV+ELVRKRGKENL K ++F+VE F  Y++A++E KPYRQKDGA
Sbjct: 381 YNIIEDLYSPRTASMDFVNELVRKRGKENLPKFVKFVVEIFLSYEKATVEEKPYRQKDGA 440

Query: 363 LRVFGTLHEKFKEIEPYKSELEHMLVHHVFPELNSPVGHLRAKAAWVAGQYAHISFSYQN 422
           +   G L +K K+ +PYKS+LE MLV H+FP+ NSPVGHLRAKAAWVAGQYAHI+FS QN
Sbjct: 441 MLAVGALCDKLKQTDPYKSQLELMLVQHIFPDFNSPVGHLRAKAAWVAGQYAHINFSDQN 500

Query: 423 NFQRALQCIVLRLQDSELPVRVDSFVALRSFIEACKDMNEIFPILPRLLDEFSKLMNEVE 482
           NF++AL  +V  L+D +LPVRVDS  ALRSF+EACKD+NEI PILP+LLDEF KLMNEVE
Sbjct: 501 NFRKALHSVVSGLRDPDLPVRVDSVFALRSFVEACKDLNEIRPILPQLLDEFFKLMNEVE 560

Query: 483 NEALVFTLETMLDKFGED---FALALYYNLAGVFWRRMNTIK-DDDEASRNRTTGSGCLR 538
           NE LVFTLET++DKFGE+   FA  L  NLA  FWR +NT + +DD          GCLR
Sbjct: 561 NEDLVFTLETIVDKFGEEMAPFAFGLCQNLAAAFWRCLNTSEANDDSDDMGALAAVGCLR 620

Query: 539 AISIILESVSCLPDLFVQIEPTFVLIMRRMLTNNDQESFEKVL 581
           AIS ILESVS LP LFV+IEPT + IM++MLT + QE FE+VL
Sbjct: 621 AISTILESVSSLPQLFVEIEPTILPIMQKMLTTDGQEVFEEVL 663


>AT3G17340.2 | Symbols:  | ARM repeat superfamily protein |
           chr3:5920613-5926846 REVERSE LENGTH=1093
          Length = 1093

 Score = 63.9 bits (154), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 76/343 (22%), Positives = 146/343 (42%), Gaps = 57/343 (16%)

Query: 216 HKHYAGK----ILECHLNLL----NVVRFGDYLPDRVINLILQYLTISISRRSLYALIQP 267
           H+ Y+ K    I+ C + ++    N+ + G  L +R+I+L    ++  +     + L+ P
Sbjct: 290 HRKYSDKLVPEIINCSMKIVKHSSNIGKLG-CLTERIISLAFDVISRVMEIGPGWRLLSP 348

Query: 268 RLDVLLFEIIFPLICFNDNDQKLWDEDPQEYARKGYDI--------FEDMHSPSTAAMDF 319
               LL   IFP +  N+ D   W+ED  E+ RK             +D+ +   +AM+ 
Sbjct: 349 HFSFLLDSAIFPALVLNERDISEWEEDADEFIRKNLPSELEEISGWRDDLFTARKSAMNL 408

Query: 320 VSELV-------------------RKRGKENL---QKCIQ--FIVETFKRYDEASIEYKP 355
           +  L                    RK+G++N    Q+C+    ++    ++   S  YK 
Sbjct: 409 LCVLAMSKGPPVSTTNTASPAACKRKKGEKNRGNNQRCMGDLLVLPFLSKFPVPSKSYKL 468

Query: 356 YRQKD----GALRVFGTLHEKFKEIEPYKSELEHMLVHHVFPELNSP--VGHLRAKAAWV 409
                    G L  +G+L E  +E  P    +   +   V P  ++P    +L A A WV
Sbjct: 469 DASTSAAYFGVLMAYGSLQEFIQEQNP--EYVASFVRTRVLPIYSTPDCSPYLVASANWV 526

Query: 410 AGQYAHISFSYQNN--FQRALQCIVL--RLQDSELPVRVDSFVALRSFIEACKDMNEIFP 465
            G+ A       N   F   L+ + +  +++ S  PVR  +   + S +E      E+ P
Sbjct: 527 LGELASCLPEEMNADVFSSLLKALAMPDQVEISCYPVRFSAAGGIGSLLENEYQPPELLP 586

Query: 466 ILPRLLDEFSKLMNEVENEALVFT-LETMLDKFGEDFALALYY 507
           +L  +     K+ NE + ++++F  L+++++   +D A+ + Y
Sbjct: 587 LLQFIT---GKIGNEEDEDSMLFQLLKSVVESGNQDIAMHIPY 626


>AT3G17340.1 | Symbols:  | ARM repeat superfamily protein |
           chr3:5920613-5926846 REVERSE LENGTH=1090
          Length = 1090

 Score = 63.5 bits (153), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 76/343 (22%), Positives = 146/343 (42%), Gaps = 57/343 (16%)

Query: 216 HKHYAGK----ILECHLNLL----NVVRFGDYLPDRVINLILQYLTISISRRSLYALIQP 267
           H+ Y+ K    I+ C + ++    N+ + G  L +R+I+L    ++  +     + L+ P
Sbjct: 290 HRKYSDKLVPEIINCSMKIVKHSSNIGKLG-CLTERIISLAFDVISRVMEIGPGWRLLSP 348

Query: 268 RLDVLLFEIIFPLICFNDNDQKLWDEDPQEYARKGYDI--------FEDMHSPSTAAMDF 319
               LL   IFP +  N+ D   W+ED  E+ RK             +D+ +   +AM+ 
Sbjct: 349 HFSFLLDSAIFPALVLNERDISEWEEDADEFIRKNLPSELEEISGWRDDLFTARKSAMNL 408

Query: 320 VSELV-------------------RKRGKENL---QKCIQ--FIVETFKRYDEASIEYKP 355
           +  L                    RK+G++N    Q+C+    ++    ++   S  YK 
Sbjct: 409 LCVLAMSKGPPVSTTNTASPAACKRKKGEKNRGNNQRCMGDLLVLPFLSKFPVPSKSYKL 468

Query: 356 YRQKD----GALRVFGTLHEKFKEIEPYKSELEHMLVHHVFPELNSP--VGHLRAKAAWV 409
                    G L  +G+L E  +E  P    +   +   V P  ++P    +L A A WV
Sbjct: 469 DASTSAAYFGVLMAYGSLQEFIQEQNP--EYVASFVRTRVLPIYSTPDCSPYLVASANWV 526

Query: 410 AGQYAHISFSYQNN--FQRALQCIVL--RLQDSELPVRVDSFVALRSFIEACKDMNEIFP 465
            G+ A       N   F   L+ + +  +++ S  PVR  +   + S +E      E+ P
Sbjct: 527 LGELASCLPEEMNADVFSSLLKALAMPDQVEISCYPVRFSAAGGIGSLLENEYQPPELLP 586

Query: 466 ILPRLLDEFSKLMNEVENEALVFT-LETMLDKFGEDFALALYY 507
           +L  +     K+ NE + ++++F  L+++++   +D A+ + Y
Sbjct: 587 LLQFIT---GKIGNEEDEDSMLFQLLKSVVESGNQDIAMHIPY 626