Miyakogusa Predicted Gene

Lj1g3v3834170.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj1g3v3834170.1 Non Chatacterized Hit- tr|I1MLK7|I1MLK7_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.4215
PE=,71.26,0,seg,NULL; DILUTE,Dilute; NT-C2,EEIG1/EHBP1 N-terminal
domain; coiled-coil,NULL; FAMILY NOT NAMED,NUL,CUFF.31256.1
         (1048 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT3G01810.3 | Symbols:  | FUNCTIONS IN: molecular_function unkno...   565   e-161
AT3G01810.1 | Symbols:  | FUNCTIONS IN: molecular_function unkno...   565   e-161
AT3G01810.2 | Symbols:  | FUNCTIONS IN: molecular_function unkno...   503   e-142
AT2G42320.2 | Symbols:  | nucleolar protein gar2-related | chr2:...   452   e-127
AT2G42320.1 | Symbols:  | nucleolar protein gar2-related | chr2:...   452   e-127
AT3G57780.1 | Symbols:  | BEST Arabidopsis thaliana protein matc...   451   e-126
AT5G43230.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   438   e-123
AT5G06930.1 | Symbols:  | LOCATED IN: chloroplast; EXPRESSED IN:...   360   3e-99

>AT3G01810.3 | Symbols:  | FUNCTIONS IN: molecular_function unknown;
            INVOLVED IN: biological_process unknown; EXPRESSED IN: 21
            plant structures; EXPRESSED DURING: 13 growth stages;
            BEST Arabidopsis thaliana protein match is: nucleolar
            protein gar2-related (TAIR:AT2G42320.2). |
            chr3:289218-292557 FORWARD LENGTH=921
          Length = 921

 Score =  565 bits (1457), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 304/604 (50%), Positives = 378/604 (62%), Gaps = 63/604 (10%)

Query: 465  EKLKYVKSVRSSADIARSISLGSNHHAEVKENGFNGDAQNSGGNIRSSDKREAKIYPREA 524
            EK + VKSVRSS DI RS S  S                        S+++EAK+YP   
Sbjct: 361  EKSRKVKSVRSSLDINRSNSRLSLF----------------------SERKEAKVYPNST 398

Query: 525  RNNILDSKVEHLESKIKMLEGELREAAAIEASLYSVVAEHGSSMGKVHAPARRLSRLYLH 584
             +  L+SK+++LES++K LEGEL EAAAIEA+LYSVVAEHGSS  KVHAPARRL RLYLH
Sbjct: 399  HDTTLESKIKNLESRVKKLEGELCEAAAIEAALYSVVAEHGSSSSKVHAPARRLLRLYLH 458

Query: 585  ACKENIQARRSGAAKSAISGLVLVAKACGNDVPRLTFWLSNSIVLRTIISKTTKDVAPSN 644
            AC+E   +RR+ AA+SA+SGLVLVAKACGNDVPRLTFWLSN+IVLRTIIS T+ +     
Sbjct: 459  ACRETHLSRRANAAESAVSGLVLVAKACGNDVPRLTFWLSNTIVLRTIISDTSAEEELPV 518

Query: 645  PAVSSTRRKSGEGNGKIAQSLIWKGYSHKKSENTAIEFGGIGNWDDPNVFTSALEKVEAW 704
             A    R++  E   +   SL WK     K      +    G WDDP  F +ALEKVEAW
Sbjct: 519  SAGPGPRKQKAERETEKRSSLKWKDSPLSKK-----DIKSFGAWDDPVTFITALEKVEAW 573

Query: 705  LFSRIVESIWWQSLTPHMQ------------------KSYTKMSGTCDQDLGNLSLDIWK 746
            +FSR+VESIWWQ+LTP MQ                  K++ +   + +Q+LG+ SL++WK
Sbjct: 574  IFSRVVESIWWQTLTPRMQSSAASTREFDKGNGSASKKTFGRTPSSTNQELGDFSLELWK 633

Query: 747  NAFREACERICPLRAGRHECGCLSVLPRLIMEQCIARLDVAMFNAILRESVXXXXXXXXX 806
             AFREA ER+CPLR   HECGCL +  RLIMEQC+ARLDVAMFNAILR+S          
Sbjct: 634  KAFREAHERLCPLRGSGHECGCLPIPARLIMEQCVARLDVAMFNAILRDSDDNFPTDPVS 693

Query: 807  XXXXXXKVLPIPPGKSSFGAGAQLKTAIGNWSRWLTDLFGIXXXXXXXXXXXXXXXXXXG 866
                  +VLPIP   SSFG+GAQLK +IGNWSRWLTDLFGI                   
Sbjct: 694  DPIADLRVLPIPSRTSSFGSGAQLKNSIGNWSRWLTDLFGIDDEDDDSSDENSYV----- 748

Query: 867  RQNTSFKPFHLLNALSDLLMLPKDMLLSESIRKEVCPMFNASQIKKILDNFVPDEFCPDP 926
                SFK F+LL ALSDL+MLPKDMLL+ S+RKEVCPMF A  IK++L+NFVPDEFCPDP
Sbjct: 749  --EKSFKTFNLLKALSDLMMLPKDMLLNSSVRKEVCPMFGAPLIKRVLNNFVPDEFCPDP 806

Query: 927  IPTDVFEALDSKDDLEDGKDSVNNFPCIAAPIVYSPPPATTIASITGDIGSES--QLXXX 984
            +P  V ++L+S+++ E  K  + ++PC A   VY PP  T+I++I G+ G     QL   
Sbjct: 807  VPDAVLKSLESEEEAE--KSIITSYPCTAPSPVYCPPSRTSISTIIGNFGQPQAPQLSRI 864

Query: 985  XXXXXXXXYTSDDELDELNSPLSSILFSGSPSPVSTKPNWKKKESRTESAVRYELLRNVW 1044
                    YTSDDELDEL+SPL+ ++   + S        K      +  +RY+LLR  W
Sbjct: 865  RSSITRKAYTSDDELDELSSPLAVVVLQQAGSK-------KINNGDADETIRYQLLRECW 917

Query: 1045 MNSE 1048
            MN E
Sbjct: 918  MNGE 921



 Score =  243 bits (620), Expect = 5e-64,   Method: Compositional matrix adjust.
 Identities = 132/232 (56%), Positives = 173/232 (74%), Gaps = 4/232 (1%)

Query: 1   MKGKNRRSGGIVQLDYLIHIEEIKPWPPSQSLRSCRSVLIQWENGERSSGSTNIASPSLG 60
           +  KNRR    VQ+DYLIHI +IKPWPPSQSLRS RSV+IQWENG+R+SG+T++ +PSLG
Sbjct: 5   LSSKNRRCSS-VQVDYLIHIHDIKPWPPSQSLRSLRSVVIQWENGDRNSGTTSVVAPSLG 63

Query: 61  SIVGEGRIEFNESFRLPVALLRDMSVKNSDAVVFQRNCLEFNLYEPRRDKIVKGQLLATA 120
           S++GEG+IEFNESF+LP+ LL+D+S +     VF +N LE NLYEPRR+K    QLLATA
Sbjct: 64  SVIGEGKIEFNESFKLPLTLLKDVSARGKGGDVFFKNVLELNLYEPRREKT--HQLLATA 121

Query: 121 IVDLADCGILRETLSISAPLNCKRSYRNTDQSFLFIKIEPVEKNRARPSLKDRLSKD-NN 179
            +DLA  G+++E+ S++A +N KRSYRN  Q  L++ I+PV + RA  S  + L  +  N
Sbjct: 122 TIDLAVYGVVKESFSLTAQMNSKRSYRNATQPVLYLTIQPVSRRRASSSSMNSLKDEAKN 181

Query: 180 GSDSVSALMNGEYAEEAEIASFTDDDVSSHSSLAAITTSPESSGCMPREHEE 231
           G +SVSALMN EY +EAEIAS TDDD+SSHSSL   +++ ES+G      EE
Sbjct: 182 GGESVSALMNEEYYKEAEIASITDDDISSHSSLTVSSSTLESNGGFSVRTEE 233


>AT3G01810.1 | Symbols:  | FUNCTIONS IN: molecular_function unknown;
            INVOLVED IN: biological_process unknown; LOCATED IN:
            plasma membrane; EXPRESSED IN: 21 plant structures;
            EXPRESSED DURING: 13 growth stages; BEST Arabidopsis
            thaliana protein match is: nucleolar protein gar2-related
            (TAIR:AT2G42320.2); Has 1327 Blast hits to 470 proteins
            in 132 species: Archae - 2; Bacteria - 131; Metazoa -
            139; Fungi - 114; Plants - 114; Viruses - 0; Other
            Eukaryotes - 827 (source: NCBI BLink). |
            chr3:289218-292557 FORWARD LENGTH=921
          Length = 921

 Score =  565 bits (1457), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 304/604 (50%), Positives = 378/604 (62%), Gaps = 63/604 (10%)

Query: 465  EKLKYVKSVRSSADIARSISLGSNHHAEVKENGFNGDAQNSGGNIRSSDKREAKIYPREA 524
            EK + VKSVRSS DI RS S  S                        S+++EAK+YP   
Sbjct: 361  EKSRKVKSVRSSLDINRSNSRLSLF----------------------SERKEAKVYPNST 398

Query: 525  RNNILDSKVEHLESKIKMLEGELREAAAIEASLYSVVAEHGSSMGKVHAPARRLSRLYLH 584
             +  L+SK+++LES++K LEGEL EAAAIEA+LYSVVAEHGSS  KVHAPARRL RLYLH
Sbjct: 399  HDTTLESKIKNLESRVKKLEGELCEAAAIEAALYSVVAEHGSSSSKVHAPARRLLRLYLH 458

Query: 585  ACKENIQARRSGAAKSAISGLVLVAKACGNDVPRLTFWLSNSIVLRTIISKTTKDVAPSN 644
            AC+E   +RR+ AA+SA+SGLVLVAKACGNDVPRLTFWLSN+IVLRTIIS T+ +     
Sbjct: 459  ACRETHLSRRANAAESAVSGLVLVAKACGNDVPRLTFWLSNTIVLRTIISDTSAEEELPV 518

Query: 645  PAVSSTRRKSGEGNGKIAQSLIWKGYSHKKSENTAIEFGGIGNWDDPNVFTSALEKVEAW 704
             A    R++  E   +   SL WK     K      +    G WDDP  F +ALEKVEAW
Sbjct: 519  SAGPGPRKQKAERETEKRSSLKWKDSPLSKK-----DIKSFGAWDDPVTFITALEKVEAW 573

Query: 705  LFSRIVESIWWQSLTPHMQ------------------KSYTKMSGTCDQDLGNLSLDIWK 746
            +FSR+VESIWWQ+LTP MQ                  K++ +   + +Q+LG+ SL++WK
Sbjct: 574  IFSRVVESIWWQTLTPRMQSSAASTREFDKGNGSASKKTFGRTPSSTNQELGDFSLELWK 633

Query: 747  NAFREACERICPLRAGRHECGCLSVLPRLIMEQCIARLDVAMFNAILRESVXXXXXXXXX 806
             AFREA ER+CPLR   HECGCL +  RLIMEQC+ARLDVAMFNAILR+S          
Sbjct: 634  KAFREAHERLCPLRGSGHECGCLPIPARLIMEQCVARLDVAMFNAILRDSDDNFPTDPVS 693

Query: 807  XXXXXXKVLPIPPGKSSFGAGAQLKTAIGNWSRWLTDLFGIXXXXXXXXXXXXXXXXXXG 866
                  +VLPIP   SSFG+GAQLK +IGNWSRWLTDLFGI                   
Sbjct: 694  DPIADLRVLPIPSRTSSFGSGAQLKNSIGNWSRWLTDLFGIDDEDDDSSDENSYV----- 748

Query: 867  RQNTSFKPFHLLNALSDLLMLPKDMLLSESIRKEVCPMFNASQIKKILDNFVPDEFCPDP 926
                SFK F+LL ALSDL+MLPKDMLL+ S+RKEVCPMF A  IK++L+NFVPDEFCPDP
Sbjct: 749  --EKSFKTFNLLKALSDLMMLPKDMLLNSSVRKEVCPMFGAPLIKRVLNNFVPDEFCPDP 806

Query: 927  IPTDVFEALDSKDDLEDGKDSVNNFPCIAAPIVYSPPPATTIASITGDIGSES--QLXXX 984
            +P  V ++L+S+++ E  K  + ++PC A   VY PP  T+I++I G+ G     QL   
Sbjct: 807  VPDAVLKSLESEEEAE--KSIITSYPCTAPSPVYCPPSRTSISTIIGNFGQPQAPQLSRI 864

Query: 985  XXXXXXXXYTSDDELDELNSPLSSILFSGSPSPVSTKPNWKKKESRTESAVRYELLRNVW 1044
                    YTSDDELDEL+SPL+ ++   + S        K      +  +RY+LLR  W
Sbjct: 865  RSSITRKAYTSDDELDELSSPLAVVVLQQAGSK-------KINNGDADETIRYQLLRECW 917

Query: 1045 MNSE 1048
            MN E
Sbjct: 918  MNGE 921



 Score =  243 bits (620), Expect = 5e-64,   Method: Compositional matrix adjust.
 Identities = 132/232 (56%), Positives = 173/232 (74%), Gaps = 4/232 (1%)

Query: 1   MKGKNRRSGGIVQLDYLIHIEEIKPWPPSQSLRSCRSVLIQWENGERSSGSTNIASPSLG 60
           +  KNRR    VQ+DYLIHI +IKPWPPSQSLRS RSV+IQWENG+R+SG+T++ +PSLG
Sbjct: 5   LSSKNRRCSS-VQVDYLIHIHDIKPWPPSQSLRSLRSVVIQWENGDRNSGTTSVVAPSLG 63

Query: 61  SIVGEGRIEFNESFRLPVALLRDMSVKNSDAVVFQRNCLEFNLYEPRRDKIVKGQLLATA 120
           S++GEG+IEFNESF+LP+ LL+D+S +     VF +N LE NLYEPRR+K    QLLATA
Sbjct: 64  SVIGEGKIEFNESFKLPLTLLKDVSARGKGGDVFFKNVLELNLYEPRREKT--HQLLATA 121

Query: 121 IVDLADCGILRETLSISAPLNCKRSYRNTDQSFLFIKIEPVEKNRARPSLKDRLSKD-NN 179
            +DLA  G+++E+ S++A +N KRSYRN  Q  L++ I+PV + RA  S  + L  +  N
Sbjct: 122 TIDLAVYGVVKESFSLTAQMNSKRSYRNATQPVLYLTIQPVSRRRASSSSMNSLKDEAKN 181

Query: 180 GSDSVSALMNGEYAEEAEIASFTDDDVSSHSSLAAITTSPESSGCMPREHEE 231
           G +SVSALMN EY +EAEIAS TDDD+SSHSSL   +++ ES+G      EE
Sbjct: 182 GGESVSALMNEEYYKEAEIASITDDDISSHSSLTVSSSTLESNGGFSVRTEE 233


>AT3G01810.2 | Symbols:  | FUNCTIONS IN: molecular_function unknown;
           INVOLVED IN: biological_process unknown; LOCATED IN:
           plasma membrane; EXPRESSED IN: 21 plant structures;
           EXPRESSED DURING: 13 growth stages; BEST Arabidopsis
           thaliana protein match is: nucleolar protein
           gar2-related (TAIR:AT2G42320.2); Has 1232 Blast hits to
           443 proteins in 120 species: Archae - 2; Bacteria - 119;
           Metazoa - 136; Fungi - 117; Plants - 114; Viruses - 0;
           Other Eukaryotes - 744 (source: NCBI BLink). |
           chr3:289218-292375 FORWARD LENGTH=859
          Length = 859

 Score =  503 bits (1295), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 269/529 (50%), Positives = 334/529 (63%), Gaps = 56/529 (10%)

Query: 465 EKLKYVKSVRSSADIARSISLGSNHHAEVKENGFNGDAQNSGGNIRSSDKREAKIYPREA 524
           EK + VKSVRSS DI RS S  S                        S+++EAK+YP   
Sbjct: 361 EKSRKVKSVRSSLDINRSNSRLSLF----------------------SERKEAKVYPNST 398

Query: 525 RNNILDSKVEHLESKIKMLEGELREAAAIEASLYSVVAEHGSSMGKVHAPARRLSRLYLH 584
            +  L+SK+++LES++K LEGEL EAAAIEA+LYSVVAEHGSS  KVHAPARRL RLYLH
Sbjct: 399 HDTTLESKIKNLESRVKKLEGELCEAAAIEAALYSVVAEHGSSSSKVHAPARRLLRLYLH 458

Query: 585 ACKENIQARRSGAAKSAISGLVLVAKACGNDVPRLTFWLSNSIVLRTIISKTTKDVAPSN 644
           AC+E   +RR+ AA+SA+SGLVLVAKACGNDVPRLTFWLSN+IVLRTIIS T+ +     
Sbjct: 459 ACRETHLSRRANAAESAVSGLVLVAKACGNDVPRLTFWLSNTIVLRTIISDTSAEEELPV 518

Query: 645 PAVSSTRRKSGEGNGKIAQSLIWKGYSHKKSENTAIEFGGIGNWDDPNVFTSALEKVEAW 704
            A    R++  E   +   SL WK     K      +    G WDDP  F +ALEKVEAW
Sbjct: 519 SAGPGPRKQKAERETEKRSSLKWKDSPLSKK-----DIKSFGAWDDPVTFITALEKVEAW 573

Query: 705 LFSRIVESIWWQSLTPHMQ------------------KSYTKMSGTCDQDLGNLSLDIWK 746
           +FSR+VESIWWQ+LTP MQ                  K++ +   + +Q+LG+ SL++WK
Sbjct: 574 IFSRVVESIWWQTLTPRMQSSAASTREFDKGNGSASKKTFGRTPSSTNQELGDFSLELWK 633

Query: 747 NAFREACERICPLRAGRHECGCLSVLPRLIMEQCIARLDVAMFNAILRESVXXXXXXXXX 806
            AFREA ER+CPLR   HECGCL +  RLIMEQC+ARLDVAMFNAILR+S          
Sbjct: 634 KAFREAHERLCPLRGSGHECGCLPIPARLIMEQCVARLDVAMFNAILRDSDDNFPTDPVS 693

Query: 807 XXXXXXKVLPIPPGKSSFGAGAQLKTAIGNWSRWLTDLFGIXXXXXXXXXXXXXXXXXXG 866
                 +VLPIP   SSFG+GAQLK +IGNWSRWLTDLFGI                   
Sbjct: 694 DPIADLRVLPIPSRTSSFGSGAQLKNSIGNWSRWLTDLFGIDDEDDDSSDENSYV----- 748

Query: 867 RQNTSFKPFHLLNALSDLLMLPKDMLLSESIRKEVCPMFNASQIKKILDNFVPDEFCPDP 926
               SFK F+LL ALSDL+MLPKDMLL+ S+RKEVCPMF A  IK++L+NFVPDEFCPDP
Sbjct: 749 --EKSFKTFNLLKALSDLMMLPKDMLLNSSVRKEVCPMFGAPLIKRVLNNFVPDEFCPDP 806

Query: 927 IPTDVFEALDSKDDLEDGKDSVNNFPCIAAPI----VYSPPPATTIASI 971
           +P  V ++L+S+        +++    +   +    + SPP   T+ S+
Sbjct: 807 VPDAVLKSLESEKLRRVSSQAIHALHLLQCTVHLHGLQSPPSLETLDSL 855



 Score =  244 bits (622), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 132/232 (56%), Positives = 173/232 (74%), Gaps = 4/232 (1%)

Query: 1   MKGKNRRSGGIVQLDYLIHIEEIKPWPPSQSLRSCRSVLIQWENGERSSGSTNIASPSLG 60
           +  KNRR    VQ+DYLIHI +IKPWPPSQSLRS RSV+IQWENG+R+SG+T++ +PSLG
Sbjct: 5   LSSKNRRCSS-VQVDYLIHIHDIKPWPPSQSLRSLRSVVIQWENGDRNSGTTSVVAPSLG 63

Query: 61  SIVGEGRIEFNESFRLPVALLRDMSVKNSDAVVFQRNCLEFNLYEPRRDKIVKGQLLATA 120
           S++GEG+IEFNESF+LP+ LL+D+S +     VF +N LE NLYEPRR+K    QLLATA
Sbjct: 64  SVIGEGKIEFNESFKLPLTLLKDVSARGKGGDVFFKNVLELNLYEPRREKT--HQLLATA 121

Query: 121 IVDLADCGILRETLSISAPLNCKRSYRNTDQSFLFIKIEPVEKNRARPSLKDRLSKD-NN 179
            +DLA  G+++E+ S++A +N KRSYRN  Q  L++ I+PV + RA  S  + L  +  N
Sbjct: 122 TIDLAVYGVVKESFSLTAQMNSKRSYRNATQPVLYLTIQPVSRRRASSSSMNSLKDEAKN 181

Query: 180 GSDSVSALMNGEYAEEAEIASFTDDDVSSHSSLAAITTSPESSGCMPREHEE 231
           G +SVSALMN EY +EAEIAS TDDD+SSHSSL   +++ ES+G      EE
Sbjct: 182 GGESVSALMNEEYYKEAEIASITDDDISSHSSLTVSSSTLESNGGFSVRTEE 233


>AT2G42320.2 | Symbols:  | nucleolar protein gar2-related |
            chr2:17628102-17630657 FORWARD LENGTH=669
          Length = 669

 Score =  452 bits (1162), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 250/560 (44%), Positives = 323/560 (57%), Gaps = 69/560 (12%)

Query: 503  QNSGGNIRSSDKREAKIYPREARNNILDSKVEHLESKIKMLEGELREAAAIEASLYSVVA 562
            +N+GGN       E               K+E LE++I+ LE ELRE AA+E SLYSVV 
Sbjct: 162  ENNGGNFEDGSSEE---------------KIERLETRIEKLEEELREVAALEISLYSVVP 206

Query: 563  EHGSSMGKVHAPARRLSRLYLHACKENIQARRSGAAKSAISGLVLVAKACGNDVPRLTFW 622
            +H SS  K+H PARR+SR+Y+HACK   Q +R+  A++++SGLVLVAK+CGNDV RLTFW
Sbjct: 207  DHCSSAHKLHTPARRISRIYIHACKHFTQGKRATIARNSVSGLVLVAKSCGNDVSRLTFW 266

Query: 623  LSNSIVLRTIISKTTKDVAPSNPAVSSTRRKSGEGNGKIAQSLIWK-GYSHKKSENTAIE 681
            LSN I LR IIS+       S     S   +SG  +     +L WK G+           
Sbjct: 267  LSNIIALRQIISQA---FGRSRITQISEPNESGNSDSGKKTNLRWKNGFQQL-------- 315

Query: 682  FGGIGNWDDPNVFTSALEKVEAWLFSRIVESIWWQSLTPHMQ---------KSYTKMSGT 732
               + +W +   FT+ALEK+E W+FSRIVES+WWQ  TPHMQ         KS  K+ G 
Sbjct: 316  ---LEDWQETETFTTALEKIEFWVFSRIVESVWWQVFTPHMQSPEDDSSASKSNGKLMGP 372

Query: 733  C--DQDLGNLSLDIWKNAFREACERICPLRAGRHECGCLSVLPRLIMEQCIARLDVAMFN 790
               DQ+ G  S+ +WKNAFR+A +RICP+R   HECGCL VL R++M++CI R DVAMFN
Sbjct: 373  SLGDQNQGTFSISLWKNAFRDALQRICPMRGAGHECGCLPVLARMVMDKCIGRFDVAMFN 432

Query: 791  AILRESVXXXXXXXXXXXXXXXKVLPIPPGKSSFGAGAQLKTAIGNWSRWLTDLFGIXXX 850
            AILRES                KVLPIP G  SFG+GAQLK AIGNWSR LT++FG+   
Sbjct: 433  AILRESEHQIPTDPVSDPILDSKVLPIPAGDLSFGSGAQLKNAIGNWSRCLTEMFGMNSD 492

Query: 851  XXXXXXXXXXXXXXXGRQNTSFKPFHLLNALSDLLMLPKDMLLSESIRKEVCPMFNASQI 910
                              +   K F LLN LSDLLMLPKDML+  SIR+E+CP  +   I
Sbjct: 493  DSSAKEKRNSED-----DHVESKAFVLLNELSDLLMLPKDMLMEISIREEICPSISLPLI 547

Query: 911  KKILDNFVPDEFCPDPIPTDVFEALDSKDDLEDGKDSVNNFPCIAAPIVYSPPPATTIAS 970
            K+IL NF PDEFCPD +P  V E L++ + + D K S  +FP  A+ + Y PP    IA 
Sbjct: 548  KRILCNFTPDEFCPDQVPGAVLEELNAAESIGDRKLSEASFPYAASSVSYMPPSTMDIAE 607

Query: 971  ITGDIGSESQLXXXXXXXXXXXYTSDDELDELNSPLSSIL-----FSGSPSPVSTKPNWK 1025
               +  + ++L           YTSD+EL+EL+SPL+SI+     F+GS +         
Sbjct: 608  KVAE--ASAKLSRNVSMIQRKGYTSDEELEELDSPLTSIVDKASDFTGSAT--------- 656

Query: 1026 KKESRTESAVRYELLRNVWM 1045
                   S  RY+LLR VW+
Sbjct: 657  -------SNARYKLLRQVWV 669


>AT2G42320.1 | Symbols:  | nucleolar protein gar2-related |
            chr2:17628102-17630657 FORWARD LENGTH=669
          Length = 669

 Score =  452 bits (1162), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 250/560 (44%), Positives = 323/560 (57%), Gaps = 69/560 (12%)

Query: 503  QNSGGNIRSSDKREAKIYPREARNNILDSKVEHLESKIKMLEGELREAAAIEASLYSVVA 562
            +N+GGN       E               K+E LE++I+ LE ELRE AA+E SLYSVV 
Sbjct: 162  ENNGGNFEDGSSEE---------------KIERLETRIEKLEEELREVAALEISLYSVVP 206

Query: 563  EHGSSMGKVHAPARRLSRLYLHACKENIQARRSGAAKSAISGLVLVAKACGNDVPRLTFW 622
            +H SS  K+H PARR+SR+Y+HACK   Q +R+  A++++SGLVLVAK+CGNDV RLTFW
Sbjct: 207  DHCSSAHKLHTPARRISRIYIHACKHFTQGKRATIARNSVSGLVLVAKSCGNDVSRLTFW 266

Query: 623  LSNSIVLRTIISKTTKDVAPSNPAVSSTRRKSGEGNGKIAQSLIWK-GYSHKKSENTAIE 681
            LSN I LR IIS+       S     S   +SG  +     +L WK G+           
Sbjct: 267  LSNIIALRQIISQA---FGRSRITQISEPNESGNSDSGKKTNLRWKNGFQQL-------- 315

Query: 682  FGGIGNWDDPNVFTSALEKVEAWLFSRIVESIWWQSLTPHMQ---------KSYTKMSGT 732
               + +W +   FT+ALEK+E W+FSRIVES+WWQ  TPHMQ         KS  K+ G 
Sbjct: 316  ---LEDWQETETFTTALEKIEFWVFSRIVESVWWQVFTPHMQSPEDDSSASKSNGKLMGP 372

Query: 733  C--DQDLGNLSLDIWKNAFREACERICPLRAGRHECGCLSVLPRLIMEQCIARLDVAMFN 790
               DQ+ G  S+ +WKNAFR+A +RICP+R   HECGCL VL R++M++CI R DVAMFN
Sbjct: 373  SLGDQNQGTFSISLWKNAFRDALQRICPMRGAGHECGCLPVLARMVMDKCIGRFDVAMFN 432

Query: 791  AILRESVXXXXXXXXXXXXXXXKVLPIPPGKSSFGAGAQLKTAIGNWSRWLTDLFGIXXX 850
            AILRES                KVLPIP G  SFG+GAQLK AIGNWSR LT++FG+   
Sbjct: 433  AILRESEHQIPTDPVSDPILDSKVLPIPAGDLSFGSGAQLKNAIGNWSRCLTEMFGMNSD 492

Query: 851  XXXXXXXXXXXXXXXGRQNTSFKPFHLLNALSDLLMLPKDMLLSESIRKEVCPMFNASQI 910
                              +   K F LLN LSDLLMLPKDML+  SIR+E+CP  +   I
Sbjct: 493  DSSAKEKRNSED-----DHVESKAFVLLNELSDLLMLPKDMLMEISIREEICPSISLPLI 547

Query: 911  KKILDNFVPDEFCPDPIPTDVFEALDSKDDLEDGKDSVNNFPCIAAPIVYSPPPATTIAS 970
            K+IL NF PDEFCPD +P  V E L++ + + D K S  +FP  A+ + Y PP    IA 
Sbjct: 548  KRILCNFTPDEFCPDQVPGAVLEELNAAESIGDRKLSEASFPYAASSVSYMPPSTMDIAE 607

Query: 971  ITGDIGSESQLXXXXXXXXXXXYTSDDELDELNSPLSSIL-----FSGSPSPVSTKPNWK 1025
               +  + ++L           YTSD+EL+EL+SPL+SI+     F+GS +         
Sbjct: 608  KVAE--ASAKLSRNVSMIQRKGYTSDEELEELDSPLTSIVDKASDFTGSAT--------- 656

Query: 1026 KKESRTESAVRYELLRNVWM 1045
                   S  RY+LLR VW+
Sbjct: 657  -------SNARYKLLRQVWV 669


>AT3G57780.1 | Symbols:  | BEST Arabidopsis thaliana protein match is:
            nucleolar protein gar2-related (TAIR:AT2G42320.2); Has
            3163 Blast hits to 2460 proteins in 357 species: Archae -
            16; Bacteria - 291; Metazoa - 841; Fungi - 335; Plants -
            248; Viruses - 72; Other Eukaryotes - 1360 (source: NCBI
            BLink). | chr3:21399766-21402329 REVERSE LENGTH=671
          Length = 671

 Score =  451 bits (1159), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 251/557 (45%), Positives = 337/557 (60%), Gaps = 35/557 (6%)

Query: 497  GFNGDAQNSGGNIRSSDKREAKIYPREARNNILDSKVEHLESKIKMLEGELREAAAIEAS 556
            G +G ++N  G+++  ++        E    +L   VE LE++++ LE ELRE AA+E S
Sbjct: 139  GLSGGSENEAGDVKEKNEN------FEEDEEMLKQMVETLETRVEKLEEELREVAALEIS 192

Query: 557  LYSVVAEHGSSMGKVHAPARRLSRLYLHACKENIQARRSGAAKSAISGLVLVAKACGNDV 616
            LYSVV +H SS  K+H PARR+SR+Y+HACK   Q +R+  A++++SGL+L AK+CGNDV
Sbjct: 193  LYSVVPDHSSSAHKLHTPARRISRIYIHACKHWSQGKRATVARNSVSGLILAAKSCGNDV 252

Query: 617  PRLTFWLSNSIVLRTIISKT-TKDVAPSNPAVSSTRRKSGEGNGKIAQSLIWKGYSHKKS 675
             RLTFWLSN I LR II +   K   PS+   + T   +G  +  + +    K    K+S
Sbjct: 253  SRLTFWLSNIISLREIILQAFGKTSVPSH--FTETSASNGSEHNVLGKVRRKKNQWTKQS 310

Query: 676  ENTAIEFGGIGNWDDPNVFTSALEKVEAWLFSRIVESIWWQSLTPHMQK----SYTKMSG 731
                  F    +W +   FT+ALEKVE W+FSRIVES+WWQ  TPHMQ       TK   
Sbjct: 311  NGFKQVF---EDWQESQTFTAALEKVEFWIFSRIVESVWWQVFTPHMQSPENGGKTKEHI 367

Query: 732  TCDQDLGNLSLDIWKNAFREACERICPLRAGRHECGCLSVLPRLIMEQCIARLDVAMFNA 791
              D + G+ S+ +WKNAF+    R+CP+R  RHECGCL +L +++ME+CIAR+DVAMFNA
Sbjct: 368  LGDIEQGSFSISLWKNAFKVTLSRLCPMRGARHECGCLPILAKMVMEKCIARIDVAMFNA 427

Query: 792  ILRESVXXXXXXXXXXXXXXXKVLPIPPGKSSFGAGAQLKTAIGNWSRWLTDLFGIXXXX 851
            ILRES                KVLPI  G  SFG+GAQLK AIGNWSR L ++F I    
Sbjct: 428  ILRESEHQIPTDPVSDPILDSKVLPILSGNLSFGSGAQLKNAIGNWSRCLAEMFSINTRD 487

Query: 852  XXXXXXXXXXXXXXGRQNTSFKPFHLLNALSDLLMLPKDMLLSESIRKEVCPMFNASQIK 911
                               S K F LLN LSDLLMLPKDML+  S R+EVCP  + + IK
Sbjct: 488  SVEENDPIE----------SEKSFSLLNELSDLLMLPKDMLMDRSTREEVCPSISLALIK 537

Query: 912  KILDNFVPDEFCPDPIPTDVFEALDSKDDLEDGKDSVNNFPCIAAPIVYSPPPATTIASI 971
            +IL NF PDEFCPD +P  V E L++ + + + K S  +FP  A+P+ Y+PP +T +A +
Sbjct: 538  RILCNFTPDEFCPDDVPGAVLEELNN-ESISEQKLSGVSFPYAASPVSYTPPSSTNVAEV 596

Query: 972  TGDIGSESQLXXXXXXXXXXXYTSDDELDELNSPLSSILFSGSPSPVSTKP-NWKKKESR 1030
             GDI   S++           YTSDDEL+EL+SPL+SI+ + S SP+S +  N K++  +
Sbjct: 597  -GDI---SRMSRNVSMIQRKGYTSDDELEELDSPLTSIIENVSLSPISAQGRNVKQEAEK 652

Query: 1031 TESAV---RYELLRNVW 1044
                V   RYELLR VW
Sbjct: 653  IGPGVTISRYELLREVW 669


>AT5G43230.1 | Symbols:  | unknown protein; BEST Arabidopsis thaliana
            protein match is: unknown protein (TAIR:AT3G01810.3); Has
            30201 Blast hits to 17322 proteins in 780 species: Archae
            - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422;
            Plants - 5037; Viruses - 0; Other Eukaryotes - 2996
            (source: NCBI BLink). | chr5:17349125-17352747 FORWARD
            LENGTH=848
          Length = 848

 Score =  438 bits (1127), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 240/519 (46%), Positives = 307/519 (59%), Gaps = 52/519 (10%)

Query: 537  ESKIKMLEGELREAAAIEASLYSVVAEHGSSMGKVHAPARRLSRLYLHACKEN--IQARR 594
            +SK + LE EL+EAA +EA++YSVVAEH SSM KVHAPARRL+R YLHACK N    ++R
Sbjct: 365  DSKTETLEDELKEAAVLEAAIYSVVAEHTSSMSKVHAPARRLARFYLHACKGNGSDHSKR 424

Query: 595  SGAAKSAISGLVLVAKACGNDVPRLTFWLSNSIVLRTIISKTTKDVAPSNPAVSSTRRKS 654
            + AA++A+SGL+LV+KACGNDVPRLTFWLSNSIVLR I+S+                   
Sbjct: 425  ATAARAAVSGLILVSKACGNDVPRLTFWLSNSIVLRAILSR------------------- 465

Query: 655  GEGNGKIAQSLIWKGYSHKKSENTAIEFGGIGNWDDPNVFTSALEKVEAWLFSRIVESIW 714
            G    KI                   E  G   W+DP  F +ALEK E+W+FSR+V+S+W
Sbjct: 466  GMEKMKIVP-----------------EKAGSDEWEDPRAFLAALEKFESWIFSRVVKSVW 508

Query: 715  WQSLTPHMQKSYTK------MSGT---CDQDLGNLSLDIWKNAFREACERICPLRAGRHE 765
            WQS+TPHMQ    K      +SG      ++ G  ++++WKNAFR ACER+CPLR  R E
Sbjct: 509  WQSMTPHMQSPAVKGSIARKVSGKRRLGHRNQGLYAIELWKNAFRAACERLCPLRGSRQE 568

Query: 766  CGCLSVLPRLIMEQCIARLDVAMFNAILRESVXXXXXXXXXXXXXXXKVLPIPPGKSSFG 825
            CGCL +L +L+MEQ I+RLDVAMFNAILRES                 VLPIP GK+SFG
Sbjct: 569  CGCLPMLAKLVMEQLISRLDVAMFNAILRESAGEMPTDPVSDPISDINVLPIPAGKASFG 628

Query: 826  AGAQLKTAIGNWSRWLTDLFGIXXXXXXXXXXXXXXXXXXGRQNTSFKPFHLLNALSDLL 885
            AGAQLK AIG WSRWL D F                      +   F+ FHLLN+L DL+
Sbjct: 629  AGAQLKNAIGTWSRWLEDQFE-QKEDKSGRNKDEDNNDKEKPECEHFRLFHLLNSLGDLM 687

Query: 886  MLPKDMLLSESIRKEVCPMFNASQIKKILDNFVPDEFCPDPIPTDVFEALDSKDDLEDGK 945
            MLP  ML  +S RKEVCP      IK++L NFVPDEF P  IP  +F+ L+S+   E+  
Sbjct: 688  MLPFKMLADKSTRKEVCPTLGPPIIKRVLRNFVPDEFNPHRIPRRLFDVLNSEGLTEEDN 747

Query: 946  DSVNNFPCIAAPIVYSPPPATTIASITGDIGSESQLXXXXXXXXXXXYTSDDELDELNSP 1005
              +  FP  A+P VY  P   +I    G++ + S +           YTSDDELD+L++ 
Sbjct: 748  GCITVFPSAASPTVYLMPSTDSIKRFIGELNNPS-ISETGSSVFKKQYTSDDELDDLDTS 806

Query: 1006 LSSILFSGSPSPVSTKPNWKKKESRTESAVRYELLRNVW 1044
            ++SI FS   +  S++  W  K       VRY+LLR +W
Sbjct: 807  INSI-FSAPGTTNSSE--WMPKGYGRRKTVRYQLLREIW 842



 Score =  105 bits (263), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 67/182 (36%), Positives = 103/182 (56%), Gaps = 21/182 (11%)

Query: 1   MKGKNRRSGGIVQLDYLIHIEEIKPWPPSQSLRSCRSVLIQWENGERSSGSTNIASPSLG 60
           ++ K+RR  G+  ++YLI I+E+KPWP SQ    C  VL++WENGE +SGS         
Sbjct: 5   LRTKSRRDNGVF-VEYLISIKELKPWPTSQVPAQC--VLLKWENGENNSGS-------FI 54

Query: 61  SIVGEGRIEFNESFRLPVALLRDMSVKNSDAVVFQRNCLEFNLYEPRRDKIVKGQ-LLAT 119
           ++VG+  I FNESFRL + L   +   N     F +N LE ++Y+ ++        LL T
Sbjct: 55  AVVGKDTIMFNESFRLTLTLEPKVGSDNK----FHKNLLELHVYDAKKKDKGVKNKLLGT 110

Query: 120 AIVDLADCGILRETLSISAPLNCKRSYRNTDQSFLFIKIEPV------EKNRARPSLKDR 173
           A V+LAD G+L  ++ + AP   K+S RN   S +++ +EP       E NR+  S + +
Sbjct: 111 ASVNLADFGLLTNSVPVGAPFTFKKSSRNDASSEIYLTVEPAGEEDYDEGNRSSGSSQPK 170

Query: 174 LS 175
           +S
Sbjct: 171 MS 172


>AT5G06930.1 | Symbols:  | LOCATED IN: chloroplast; EXPRESSED IN: 15
            plant structures; EXPRESSED DURING: 7 growth stages; BEST
            Arabidopsis thaliana protein match is: nucleolar protein
            gar2-related (TAIR:AT2G42320.2); Has 3369 Blast hits to
            1526 proteins in 313 species: Archae - 2; Bacteria - 910;
            Metazoa - 754; Fungi - 336; Plants - 137; Viruses - 11;
            Other Eukaryotes - 1219 (source: NCBI BLink). |
            chr5:2145139-2147849 FORWARD LENGTH=723
          Length = 723

 Score =  360 bits (924), Expect = 3e-99,   Method: Compositional matrix adjust.
 Identities = 219/571 (38%), Positives = 303/571 (53%), Gaps = 90/571 (15%)

Query: 471  KSVRSSADIARSISLGSNHHA-------------------EVKENGFNGDAQNSGGNIRS 511
            K+VRSS   A+++S  S++ +                   E KE+    DA NS  N  S
Sbjct: 177  KTVRSSKSQAKALSDFSSYRSSENNKAFSSASPVDSTPFEEGKEDDEFEDALNSVHNNES 236

Query: 512  SDKREAKIYPREARNNI---LDSKVEHLESKIKMLEGELREAAAIEASLYSVVAEHGSSM 568
             +  E  +Y  + R+++   L  K+E +E++I+ LE ELRE AA+E SLYSV  EHGSS 
Sbjct: 237  DN--ETLVYKEKKRSDVEKVLAQKIETMEARIEKLEEELREVAALEMSLYSVFPEHGSSS 294

Query: 569  GKVHAPARRLSRLYLHACKENIQARRSGAAKSAISGLVLVAKACGNDVPRLTFWLSNSIV 628
             K+H PAR LSRLY  A K   + +     K+ +SGL L+ K+CG+DV RLT+WLSN+++
Sbjct: 295  HKLHKPARNLSRLYALARKNQSENKIISVTKNIVSGLSLLLKSCGSDVSRLTYWLSNTVM 354

Query: 629  LRTIISKTTKDVAPSNPAVSSTRRKSGEGNGKIAQSLIWKGYSHKKSENTAIEFGGIGNW 688
            LR IIS                               +  G S     N+  E     +W
Sbjct: 355  LREIIS-------------------------------LDFGSSKLNGLNSLKE-----DW 378

Query: 689  DDPNVFTSALEKVEAWLFSRIVESIWWQSLTPHM-----QKSYTKMSG------TCDQDL 737
             D     +AL +VE+  F++ VESIW Q +  HM       +  +M G      TCD+  
Sbjct: 379  GDVRTLIAALRRVESCFFTQAVESIWSQVMMVHMIPQGVDSTMGEMIGNFSEPATCDRLQ 438

Query: 738  GNLSLDIWKNAFREACERICPLRAGRHECGCLSVLPRLIMEQCIARLDVAMFNAILRESV 797
             + S+++WK AF EA +R+CP++A R +CGCL VL R++MEQCI RLDVAMFNAILRES 
Sbjct: 439  ESFSVNLWKEAFEEALQRLCPVQATRRQCGCLHVLTRMVMEQCIVRLDVAMFNAILRESA 498

Query: 798  XXXXXXXXXXXXXXXKVLPIPPGKSSFGAGAQLKTAIGNWSRWLTDLFGIXXXXXXXXXX 857
                           +VLPIP G  SF +G +LK  +  WSR LTD+FGI          
Sbjct: 499  HHIPTDSASDPIADSRVLPIPAGVLSFESGVKLKNTVSYWSRLLTDIFGIDVEQKMQ--- 555

Query: 858  XXXXXXXXGRQNTSFKPFHLLNALSDLLMLPKDMLLSESIRKEVCPMFNASQIKKILDNF 917
                     R + +FKPFHLLN LSDLLMLPK+M +  S R EVCP    S IK+I+ NF
Sbjct: 556  ---------RGDETFKPFHLLNELSDLLMLPKEMFVDSSTRDEVCPSIGLSLIKRIVCNF 606

Query: 918  VPDEFCPDPIPTDVFEALDSKDDLED---GKDSVNNFPCIAAPIVYSPPPATTIASITGD 974
             PDEFCP P+P  V E L+++  LE+    +D+   FP    P+ YSPP  + +     D
Sbjct: 607  TPDEFCPYPVPGTVLEELNAQSILENRSLSRDTARGFPRQVNPVSYSPPSCSHLT----D 662

Query: 975  IGSESQLXXXXXXXXXXXYTSDDELDELNSP 1005
            I +E  +           Y+S+++++   SP
Sbjct: 663  IVAEFSVKLKLSMTHKNGYSSNEKVETPRSP 693