Miyakogusa Predicted Gene

Lj5g3v0843440.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj5g3v0843440.1 Non Chatacterized Hit- tr|D7U577|D7U577_VITVI
Putative uncharacterized protein OS=Vitis vinifera
GN=,35.34,1e-18,Myb_DNA-bind_3,Myb/SANT-like domain,CUFF.54062.1
         (441 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT2G24960.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   262   4e-70
AT2G24960.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   261   6e-70
AT4G02210.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   227   1e-59
AT4G02210.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   227   1e-59
AT5G05800.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    97   3e-20
AT5G05800.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    97   3e-20
AT3G11290.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    82   8e-16
AT4G02550.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    71   1e-12
AT4G02550.3 | Symbols:  | unknown protein; BEST Arabidopsis thal...    71   1e-12
AT2G19220.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    68   1e-11
AT4G02550.4 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    68   1e-11
AT4G02550.2 | Symbols:  | unknown protein; BEST Arabidopsis thal...    68   1e-11
AT3G11310.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    66   6e-11
AT1G30140.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    56   4e-08
AT5G27260.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    55   1e-07

>AT2G24960.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; EXPRESSED IN: 21 plant
           structures; EXPRESSED DURING: 12 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT4G02210.2); Has 30201 Blast hits to 17322
           proteins in 780 species: Archae - 12; Bacteria - 1396;
           Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
           0; Other Eukaryotes - 2996 (source: NCBI BLink). |
           chr2:10617263-10620034 FORWARD LENGTH=774
          Length = 774

 Score =  262 bits (669), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 122/304 (40%), Positives = 191/304 (62%), Gaps = 15/304 (4%)

Query: 10  DNFRANWTPSQDQYFLELMLSHVHRGNKTGKVFSRRAWADMIEQFNTTFGLKYDIDVLKN 69
           D  R  WTP+ +++F++LML H+HRGN+TG  F+++AW +M+  FN+ FG +YD DVLK+
Sbjct: 9   DRTRTYWTPTMERFFIDLMLEHLHRGNRTGHTFNKQAWNEMLTVFNSKFGSQYDKDVLKS 68

Query: 70  RFKRFRKQYIEIKTIISQKGFQWDNALNMIVAGEKTWDEYIKDHPSARAFRTRVVPYYND 129
           R+    KQY ++K ++   GF WD     ++  +  W  Y+K HP AR ++T+ V  ++D
Sbjct: 69  RYTNLWKQYNDVKCLLDHGGFVWDQTHQTVIGDDSLWSLYLKAHPEARVYKTKPVLNFSD 128

Query: 130 MCIIYGHAVADGRYSLSCFDVDFEYEDIASKELDDQTTPSKGVDDQTPPTVINQSKIDWS 189
           +C+IYG+ VADGRYS+S  D++ E       E++ ++    G +          SK +W+
Sbjct: 129 LCLIYGYTVADGRYSMSSHDLEIE------DEINGESVVLSGKE---------SSKTEWT 173

Query: 190 PMMDHFFVELMVDQVRKGNKIGRSFDKKAWVDMTESFNDRFESHYCKVVLKNRLNVLIRH 249
             MD +FVE+MVDQ+ +GNK G +F K+AW+DM   FN RF   Y K VL++R N L+++
Sbjct: 174 LEMDQYFVEIMVDQIGRGNKTGNAFSKQAWIDMLVLFNARFSGQYGKRVLRHRYNKLLKY 233

Query: 250 YCSINALLGKEGFSWDKRQQKVVADDQVWQKCIRVNHNFRLYRIKSMPFYSGMCIICRDE 309
           Y  + A+L ++GFSWD+ +  + ADD VW   I+ +   R YR+KS+P Y+ +  I   +
Sbjct: 234 YKDMEAILKEDGFSWDETRLMISADDAVWDSYIKDHPLARTYRMKSLPSYNDLDTIFACQ 293

Query: 310 ATAG 313
           A  G
Sbjct: 294 AEQG 297



 Score =  228 bits (580), Expect = 8e-60,   Method: Compositional matrix adjust.
 Identities = 115/308 (37%), Positives = 175/308 (56%), Gaps = 17/308 (5%)

Query: 13  RANWTPSQDQYFLELMLSHVHRGNKTGKVFSRRAWADMIEQFNTTFGLKYDIDVLKNRFK 72
           +  WT   DQYF+E+M+  + RGNKTG  FS++AW DM+  FN  F  +Y   VL++R+ 
Sbjct: 169 KTEWTLEMDQYFVEIMVDQIGRGNKTGNAFSKQAWIDMLVLFNARFSGQYGKRVLRHRYN 228

Query: 73  RFRKQYIEIKTIISQKGFQWDNALNMIVAGEKTWDEYIKDHPSARAFRTRVVPYYNDMCI 132
           +  K Y +++ I+ + GF WD    MI A +  WD YIKDHP AR +R + +P YND+  
Sbjct: 229 KLLKYYKDMEAILKEDGFSWDETRLMISADDAVWDSYIKDHPLARTYRMKSLPSYNDLDT 288

Query: 133 IYGHAVADGRYSLSCFDVDFEYEDIASKELDDQTTPSKGVDDQTPPTVINQSKIDWSPMM 192
           I+      G           ++ D  S     QT+ +K   +Q      ++++I W+P M
Sbjct: 289 IFACQAEQGT----------DHRDDGSAA---QTSETKASQEQNS----DRTRIFWTPPM 331

Query: 193 DHFFVELMVDQVRKGNKIGRSFDKKAWVDMTESFNDRFESHYCKVVLKNRLNVLIRHYCS 252
           D+  ++L+V+QV  GN++G++F   AW +M  +FN +F S + K VLKNR   L R Y  
Sbjct: 332 DYHLIDLLVEQVNNGNRVGQTFITSAWNEMVTAFNAKFGSQHNKDVLKNRYKHLRRLYND 391

Query: 253 INALLGKEGFSWDKRQQKVVADDQVWQKCIRVNHNFRLYRIKSMPFYSGMCIICRDEATA 312
           I  LL + GFSWD R+  V+ADD +W   I+ +   R YR+K++P Y  +C I   E + 
Sbjct: 392 IKFLLEQNGFSWDARRDMVIADDDIWNTYIQAHPEARSYRVKTIPSYPNLCFIFGKETSD 451

Query: 313 GCRSNLEK 320
           G  + L +
Sbjct: 452 GRYTRLAQ 459



 Score =  221 bits (562), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 107/306 (34%), Positives = 178/306 (58%), Gaps = 12/306 (3%)

Query: 10  DNFRANWTPSQDQYFLELMLSHVHRGNKTGKVFSRRAWADMIEQFNTTFGLKYDIDVLKN 69
           D  R  WTP  D + ++L++  V+ GN+ G+ F   AW +M+  FN  FG +++ DVLKN
Sbjct: 321 DRTRIFWTPPMDYHLIDLLVEQVNNGNRVGQTFITSAWNEMVTAFNAKFGSQHNKDVLKN 380

Query: 70  RFKRFRKQYIEIKTIISQKGFQWDNALNMIVAGEKTWDEYIKDHPSARAFRTRVVPYYND 129
           R+K  R+ Y +IK ++ Q GF WD   +M++A +  W+ YI+ HP AR++R + +P Y +
Sbjct: 381 RYKHLRRLYNDIKFLLEQNGFSWDARRDMVIADDDIWNTYIQAHPEARSYRVKTIPSYPN 440

Query: 130 MCIIYGHAVADGRYS--LSCFDVDFEY-----EDIASKELDDQTTPSKGV-----DDQTP 177
           +C I+G   +DGRY+     FD          E  ++    D  +  K V      +  P
Sbjct: 441 LCFIFGKETSDGRYTRLAQAFDPSPAETVRMNESGSTDGFKDTRSFQKVVYTSNEKNDYP 500

Query: 178 PTVINQSKIDWSPMMDHFFVELMVDQVRKGNKIGRSFDKKAWVDMTESFNDRFESHYCKV 237
            + I    I+W+ +MDH  ++LM++QV +GNKIG +F ++AW DM ESFN +F       
Sbjct: 501 CSNIGPPCIEWTRVMDHCLIDLMLEQVSRGNKIGETFTEQAWADMAESFNAKFGLQTDMF 560

Query: 238 VLKNRLNVLIRHYCSINALLGKEGFSWDKRQQKVVADDQVWQKCIRVNHNFRLYRIKSMP 297
           +L+NR  +L++    IN +L  +GF+WD  +Q +VA+D+ W+  I+ + +  +Y+ K++ 
Sbjct: 561 MLENRYILLMKERDDINNILNLDGFTWDVEKQTIVAEDEYWEAYIKEHPDATIYKGKTLD 620

Query: 298 FYSGMC 303
            Y  +C
Sbjct: 621 SYGNLC 626



 Score =  112 bits (279), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 60/172 (34%), Positives = 97/172 (56%), Gaps = 9/172 (5%)

Query: 16  WTPSQDQYFLELMLSHVHRGNKTGKVFSRRAWADMIEQFNTTFGLKYDIDVLKNRFKRFR 75
           WT   D   ++LML  V RGNK G+ F+ +AWADM E FN  FGL+ D+ +L+NR+    
Sbjct: 511 WTRVMDHCLIDLMLEQVSRGNKIGETFTEQAWADMAESFNAKFGLQTDMFMLENRYILLM 570

Query: 76  KQYIEIKTIISQKGFQWDNALNMIVAGEKTWDEYIKDHPSARAFRTRVVPYYNDMCIIYG 135
           K+  +I  I++  GF WD     IVA ++ W+ YIK+HP A  ++ + +  Y ++C +  
Sbjct: 571 KERDDINNILNLDGFTWDVEKQTIVAEDEYWEAYIKEHPDATIYKGKTLDSYGNLCKLNE 630

Query: 136 HAVADGRYSLSCFDVDFEYEDIASKE--LDDQTTPSKGVDDQ----TPPTVI 181
           H   +   S +C ++  E E+  ++   +DD ++P K  + +    TPP  I
Sbjct: 631 HLSQE---SFNCENLMIELENYGNEMEIVDDFSSPHKQQNKRPNPITPPLGI 679



 Score =  106 bits (264), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 53/155 (34%), Positives = 88/155 (56%), Gaps = 4/155 (2%)

Query: 179 TVINQSKIDWSPMMDHFFVELMVDQVRKGNKIGRSFDKKAWVDMTESFNDRFESHYCKVV 238
           T  ++++  W+P M+ FF++LM++ + +GN+ G +F+K+AW +M   FN +F S Y K V
Sbjct: 6   TCNDRTRTYWTPTMERFFIDLMLEHLHRGNRTGHTFNKQAWNEMLTVFNSKFGSQYDKDV 65

Query: 239 LKNRLNVLIRHYCSINALLGKEGFSWDKRQQKVVADDQVWQKCIRVNHNFRLYRIKSMPF 298
           LK+R   L + Y  +  LL   GF WD+  Q V+ DD +W   ++ +   R+Y+ K +  
Sbjct: 66  LKSRYTNLWKQYNDVKCLLDHGGFVWDQTHQTVIGDDSLWSLYLKAHPEARVYKTKPVLN 125

Query: 299 YSGMCIICRDEATAGCRS----NLEKESPIGEKSV 329
           +S +C+I       G  S    +LE E  I  +SV
Sbjct: 126 FSDLCLIYGYTVADGRYSMSSHDLEIEDEINGESV 160


>AT2G24960.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G02210.2); Has 1453 Blast hits to 509 proteins
           in 26 species: Archae - 0; Bacteria - 0; Metazoa - 1;
           Fungi - 39; Plants - 1363; Viruses - 0; Other Eukaryotes
           - 50 (source: NCBI BLink). | chr2:10617263-10620034
           FORWARD LENGTH=797
          Length = 797

 Score =  261 bits (667), Expect = 6e-70,   Method: Compositional matrix adjust.
 Identities = 122/304 (40%), Positives = 191/304 (62%), Gaps = 15/304 (4%)

Query: 10  DNFRANWTPSQDQYFLELMLSHVHRGNKTGKVFSRRAWADMIEQFNTTFGLKYDIDVLKN 69
           D  R  WTP+ +++F++LML H+HRGN+TG  F+++AW +M+  FN+ FG +YD DVLK+
Sbjct: 9   DRTRTYWTPTMERFFIDLMLEHLHRGNRTGHTFNKQAWNEMLTVFNSKFGSQYDKDVLKS 68

Query: 70  RFKRFRKQYIEIKTIISQKGFQWDNALNMIVAGEKTWDEYIKDHPSARAFRTRVVPYYND 129
           R+    KQY ++K ++   GF WD     ++  +  W  Y+K HP AR ++T+ V  ++D
Sbjct: 69  RYTNLWKQYNDVKCLLDHGGFVWDQTHQTVIGDDSLWSLYLKAHPEARVYKTKPVLNFSD 128

Query: 130 MCIIYGHAVADGRYSLSCFDVDFEYEDIASKELDDQTTPSKGVDDQTPPTVINQSKIDWS 189
           +C+IYG+ VADGRYS+S  D++ E       E++ ++    G +          SK +W+
Sbjct: 129 LCLIYGYTVADGRYSMSSHDLEIE------DEINGESVVLSGKE---------SSKTEWT 173

Query: 190 PMMDHFFVELMVDQVRKGNKIGRSFDKKAWVDMTESFNDRFESHYCKVVLKNRLNVLIRH 249
             MD +FVE+MVDQ+ +GNK G +F K+AW+DM   FN RF   Y K VL++R N L+++
Sbjct: 174 LEMDQYFVEIMVDQIGRGNKTGNAFSKQAWIDMLVLFNARFSGQYGKRVLRHRYNKLLKY 233

Query: 250 YCSINALLGKEGFSWDKRQQKVVADDQVWQKCIRVNHNFRLYRIKSMPFYSGMCIICRDE 309
           Y  + A+L ++GFSWD+ +  + ADD VW   I+ +   R YR+KS+P Y+ +  I   +
Sbjct: 234 YKDMEAILKEDGFSWDETRLMISADDAVWDSYIKDHPLARTYRMKSLPSYNDLDTIFACQ 293

Query: 310 ATAG 313
           A  G
Sbjct: 294 AEQG 297



 Score =  217 bits (553), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 116/331 (35%), Positives = 177/331 (53%), Gaps = 40/331 (12%)

Query: 13  RANWTPSQDQYFLELMLSHVHRGNKTGKVFSRRAWADMIEQFNTTFGLKYDIDVLKNRFK 72
           +  WT   DQYF+E+M+  + RGNKTG  FS++AW DM+  FN  F  +Y   VL++R+ 
Sbjct: 169 KTEWTLEMDQYFVEIMVDQIGRGNKTGNAFSKQAWIDMLVLFNARFSGQYGKRVLRHRYN 228

Query: 73  RFRKQYIEIKTIISQKGFQWDNALNMIVAGEKTWDEYIKDHPSARAFRTRVVPYYNDMCI 132
           +  K Y +++ I+ + GF WD    MI A +  WD YIKDHP AR +R + +P YND+  
Sbjct: 229 KLLKYYKDMEAILKEDGFSWDETRLMISADDAVWDSYIKDHPLARTYRMKSLPSYNDLDT 288

Query: 133 IYGHAVADGRYSLSCFDVDFEYEDIASKELDDQTTPSKGVDDQTPPTVINQSKIDWSPMM 192
           I+      G           ++ D  S     QT+ +K   +Q      ++++I W+P M
Sbjct: 289 IFACQAEQGT----------DHRDDGSAA---QTSETKASQEQNS----DRTRIFWTPPM 331

Query: 193 DHFFVELMVDQVRKGNKIGRSFDKKAWVDMTESFNDRFESHYCKVVLKNRLNVLIRHYCS 252
           D+  ++L+V+QV  GN++G++F   AW +M  +FN +F S + K VLKNR   L R Y  
Sbjct: 332 DYHLIDLLVEQVNNGNRVGQTFITSAWNEMVTAFNAKFGSQHNKDVLKNRYKHLRRLYND 391

Query: 253 INALLGKEGFSWDKRQQKVVADDQVWQK------------------CIRVNH-----NFR 289
           I  LL + GFSWD R+  V+ADD +W                    C+++ H       R
Sbjct: 392 IKFLLEQNGFSWDARRDMVIADDDIWNTYIQACHILFLFKISVICLCLQMKHVQAHPEAR 451

Query: 290 LYRIKSMPFYSGMCIICRDEATAGCRSNLEK 320
            YR+K++P Y  +C I   E + G  + L +
Sbjct: 452 SYRVKTIPSYPNLCFIFGKETSDGRYTRLAQ 482



 Score =  207 bits (527), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 107/329 (32%), Positives = 178/329 (54%), Gaps = 35/329 (10%)

Query: 10  DNFRANWTPSQDQYFLELMLSHVHRGNKTGKVFSRRAWADMIEQFNTTFGLKYDIDVLKN 69
           D  R  WTP  D + ++L++  V+ GN+ G+ F   AW +M+  FN  FG +++ DVLKN
Sbjct: 321 DRTRIFWTPPMDYHLIDLLVEQVNNGNRVGQTFITSAWNEMVTAFNAKFGSQHNKDVLKN 380

Query: 70  RFKRFRKQYIEIKTIISQKGFQWDNALNMIVAGEKTWDEYIKD----------------- 112
           R+K  R+ Y +IK ++ Q GF WD   +M++A +  W+ YI+                  
Sbjct: 381 RYKHLRRLYNDIKFLLEQNGFSWDARRDMVIADDDIWNTYIQACHILFLFKISVICLCLQ 440

Query: 113 ------HPSARAFRTRVVPYYNDMCIIYGHAVADGRYS--LSCFDVDFEY-----EDIAS 159
                 HP AR++R + +P Y ++C I+G   +DGRY+     FD          E  ++
Sbjct: 441 MKHVQAHPEARSYRVKTIPSYPNLCFIFGKETSDGRYTRLAQAFDPSPAETVRMNESGST 500

Query: 160 KELDDQTTPSKGV-----DDQTPPTVINQSKIDWSPMMDHFFVELMVDQVRKGNKIGRSF 214
               D  +  K V      +  P + I    I+W+ +MDH  ++LM++QV +GNKIG +F
Sbjct: 501 DGFKDTRSFQKVVYTSNEKNDYPCSNIGPPCIEWTRVMDHCLIDLMLEQVSRGNKIGETF 560

Query: 215 DKKAWVDMTESFNDRFESHYCKVVLKNRLNVLIRHYCSINALLGKEGFSWDKRQQKVVAD 274
            ++AW DM ESFN +F       +L+NR  +L++    IN +L  +GF+WD  +Q +VA+
Sbjct: 561 TEQAWADMAESFNAKFGLQTDMFMLENRYILLMKERDDINNILNLDGFTWDVEKQTIVAE 620

Query: 275 DQVWQKCIRVNHNFRLYRIKSMPFYSGMC 303
           D+ W+  I+ + +  +Y+ K++  Y  +C
Sbjct: 621 DEYWEAYIKEHPDATIYKGKTLDSYGNLC 649



 Score =  112 bits (279), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 60/172 (34%), Positives = 97/172 (56%), Gaps = 9/172 (5%)

Query: 16  WTPSQDQYFLELMLSHVHRGNKTGKVFSRRAWADMIEQFNTTFGLKYDIDVLKNRFKRFR 75
           WT   D   ++LML  V RGNK G+ F+ +AWADM E FN  FGL+ D+ +L+NR+    
Sbjct: 534 WTRVMDHCLIDLMLEQVSRGNKIGETFTEQAWADMAESFNAKFGLQTDMFMLENRYILLM 593

Query: 76  KQYIEIKTIISQKGFQWDNALNMIVAGEKTWDEYIKDHPSARAFRTRVVPYYNDMCIIYG 135
           K+  +I  I++  GF WD     IVA ++ W+ YIK+HP A  ++ + +  Y ++C +  
Sbjct: 594 KERDDINNILNLDGFTWDVEKQTIVAEDEYWEAYIKEHPDATIYKGKTLDSYGNLCKLNE 653

Query: 136 HAVADGRYSLSCFDVDFEYEDIASKE--LDDQTTPSKGVDDQ----TPPTVI 181
           H   +   S +C ++  E E+  ++   +DD ++P K  + +    TPP  I
Sbjct: 654 HLSQE---SFNCENLMIELENYGNEMEIVDDFSSPHKQQNKRPNPITPPLGI 702



 Score =  105 bits (263), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 53/155 (34%), Positives = 88/155 (56%), Gaps = 4/155 (2%)

Query: 179 TVINQSKIDWSPMMDHFFVELMVDQVRKGNKIGRSFDKKAWVDMTESFNDRFESHYCKVV 238
           T  ++++  W+P M+ FF++LM++ + +GN+ G +F+K+AW +M   FN +F S Y K V
Sbjct: 6   TCNDRTRTYWTPTMERFFIDLMLEHLHRGNRTGHTFNKQAWNEMLTVFNSKFGSQYDKDV 65

Query: 239 LKNRLNVLIRHYCSINALLGKEGFSWDKRQQKVVADDQVWQKCIRVNHNFRLYRIKSMPF 298
           LK+R   L + Y  +  LL   GF WD+  Q V+ DD +W   ++ +   R+Y+ K +  
Sbjct: 66  LKSRYTNLWKQYNDVKCLLDHGGFVWDQTHQTVIGDDSLWSLYLKAHPEARVYKTKPVLN 125

Query: 299 YSGMCIICRDEATAGCRS----NLEKESPIGEKSV 329
           +S +C+I       G  S    +LE E  I  +SV
Sbjct: 126 FSDLCLIYGYTVADGRYSMSSHDLEIEDEINGESV 160


>AT4G02210.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; EXPRESSED IN: 22 plant
           structures; EXPRESSED DURING: 13 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT2G24960.2). | chr4:974320-975917 REVERSE
           LENGTH=439
          Length = 439

 Score =  227 bits (579), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 110/303 (36%), Positives = 172/303 (56%), Gaps = 8/303 (2%)

Query: 10  DNFRANWTPSQDQYFLELMLSHVHRGNK-TGKVFSRRAWADMIEQFNTTFGLKYDIDVLK 68
           +  R  WTP  DQYF+ELM+  V +GN+    +FS+RAW  M   F   F   Y  DVLK
Sbjct: 8   ERLRTVWTPEMDQYFIELMVEQVRKGNRFEDHLFSKRAWKFMSCSFTAKFKFLYGKDVLK 67

Query: 69  NRFKRFRKQYIEIKTIISQKGFQWDNALNMIVAGEKTWDEYIKDHPSARAFRTRVVPYYN 128
           NR K  R  +  +  ++ + GF WD+   M+VA    WDEY+K HP +R+FR + +P Y 
Sbjct: 68  NRHKTLRNLFKSVNNLLIEDGFSWDDTRQMVVADNCVWDEYLKIHPDSRSFRIKSIPCYK 127

Query: 129 DMCIIYGHAVADGRYSLSCFDVDFEYEDIASKELDD---QTTPSKGVDDQTPPTVINQSK 185
           D+C++Y   +++ +   S      E E     + DD   +   S  V   +  + + + +
Sbjct: 128 DLCLVYSDGMSEHKAEESIS----EGESKTLIQEDDGYNRICESSTVRSNSKGSSVTRCR 183

Query: 186 IDWSPMMDHFFVELMVDQVRKGNKIGRSFDKKAWVDMTESFNDRFESHYCKVVLKNRLNV 245
             W P MD +F++LM+DQ R+GN+I   F K+AW +M   FN +FES++   VLKNR   
Sbjct: 184 TTWHPPMDRYFIDLMLDQARRGNQIEGVFRKQAWTEMVNLFNAKFESNFDVDVLKNRYKS 243

Query: 246 LIRHYCSINALLGKEGFSWDKRQQKVVADDQVWQKCIRVNHNFRLYRIKSMPFYSGMCII 305
           L R + +I ++L  +GF+WD  +Q V AD+ VWQ  I+ + + R +  + +P+Y  +C++
Sbjct: 244 LRRQFNAIKSILRSDGFAWDNERQMVTADNNVWQDYIKAHRDARQFMTRPIPYYKDLCVL 303

Query: 306 CRD 308
           C D
Sbjct: 304 CGD 306



 Score =  136 bits (343), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 63/169 (37%), Positives = 96/169 (56%), Gaps = 3/169 (1%)

Query: 8   SLDNFRANWTPSQDQYFLELMLSHVHRGNKTGKVFSRRAWADMIEQFNTTFGLKYDIDVL 67
           S+   R  W P  D+YF++LML    RGN+   VF ++AW +M+  FN  F   +D+DVL
Sbjct: 178 SVTRCRTTWHPPMDRYFIDLMLDQARRGNQIEGVFRKQAWTEMVNLFNAKFESNFDVDVL 237

Query: 68  KNRFKRFRKQYIEIKTIISQKGFQWDNALNMIVAGEKTWDEYIKDHPSARAFRTRVVPYY 127
           KNR+K  R+Q+  IK+I+   GF WDN   M+ A    W +YIK H  AR F TR +PYY
Sbjct: 238 KNRYKSLRRQFNAIKSILRSDGFAWDNERQMVTADNNVWQDYIKAHRDARQFMTRPIPYY 297

Query: 128 NDMCIIYGHAVADGR---YSLSCFDVDFEYEDIASKELDDQTTPSKGVD 173
            D+C++ G +  +      ++  FD + E+++  S    D +  ++  D
Sbjct: 298 KDLCVLCGDSGIEENECFVAMDWFDPETEFQEFKSSGTTDLSISAEEED 346


>AT4G02210.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT2G24960.2); Has 791 Blast hits to 465 proteins
           in 19 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 17; Plants - 748; Viruses - 0; Other Eukaryotes
           - 26 (source: NCBI BLink). | chr4:974320-975917 REVERSE
           LENGTH=439
          Length = 439

 Score =  227 bits (579), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 110/303 (36%), Positives = 172/303 (56%), Gaps = 8/303 (2%)

Query: 10  DNFRANWTPSQDQYFLELMLSHVHRGNK-TGKVFSRRAWADMIEQFNTTFGLKYDIDVLK 68
           +  R  WTP  DQYF+ELM+  V +GN+    +FS+RAW  M   F   F   Y  DVLK
Sbjct: 8   ERLRTVWTPEMDQYFIELMVEQVRKGNRFEDHLFSKRAWKFMSCSFTAKFKFLYGKDVLK 67

Query: 69  NRFKRFRKQYIEIKTIISQKGFQWDNALNMIVAGEKTWDEYIKDHPSARAFRTRVVPYYN 128
           NR K  R  +  +  ++ + GF WD+   M+VA    WDEY+K HP +R+FR + +P Y 
Sbjct: 68  NRHKTLRNLFKSVNNLLIEDGFSWDDTRQMVVADNCVWDEYLKIHPDSRSFRIKSIPCYK 127

Query: 129 DMCIIYGHAVADGRYSLSCFDVDFEYEDIASKELDD---QTTPSKGVDDQTPPTVINQSK 185
           D+C++Y   +++ +   S      E E     + DD   +   S  V   +  + + + +
Sbjct: 128 DLCLVYSDGMSEHKAEESIS----EGESKTLIQEDDGYNRICESSTVRSNSKGSSVTRCR 183

Query: 186 IDWSPMMDHFFVELMVDQVRKGNKIGRSFDKKAWVDMTESFNDRFESHYCKVVLKNRLNV 245
             W P MD +F++LM+DQ R+GN+I   F K+AW +M   FN +FES++   VLKNR   
Sbjct: 184 TTWHPPMDRYFIDLMLDQARRGNQIEGVFRKQAWTEMVNLFNAKFESNFDVDVLKNRYKS 243

Query: 246 LIRHYCSINALLGKEGFSWDKRQQKVVADDQVWQKCIRVNHNFRLYRIKSMPFYSGMCII 305
           L R + +I ++L  +GF+WD  +Q V AD+ VWQ  I+ + + R +  + +P+Y  +C++
Sbjct: 244 LRRQFNAIKSILRSDGFAWDNERQMVTADNNVWQDYIKAHRDARQFMTRPIPYYKDLCVL 303

Query: 306 CRD 308
           C D
Sbjct: 304 CGD 306



 Score =  136 bits (343), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 63/169 (37%), Positives = 96/169 (56%), Gaps = 3/169 (1%)

Query: 8   SLDNFRANWTPSQDQYFLELMLSHVHRGNKTGKVFSRRAWADMIEQFNTTFGLKYDIDVL 67
           S+   R  W P  D+YF++LML    RGN+   VF ++AW +M+  FN  F   +D+DVL
Sbjct: 178 SVTRCRTTWHPPMDRYFIDLMLDQARRGNQIEGVFRKQAWTEMVNLFNAKFESNFDVDVL 237

Query: 68  KNRFKRFRKQYIEIKTIISQKGFQWDNALNMIVAGEKTWDEYIKDHPSARAFRTRVVPYY 127
           KNR+K  R+Q+  IK+I+   GF WDN   M+ A    W +YIK H  AR F TR +PYY
Sbjct: 238 KNRYKSLRRQFNAIKSILRSDGFAWDNERQMVTADNNVWQDYIKAHRDARQFMTRPIPYY 297

Query: 128 NDMCIIYGHAVADGR---YSLSCFDVDFEYEDIASKELDDQTTPSKGVD 173
            D+C++ G +  +      ++  FD + E+++  S    D +  ++  D
Sbjct: 298 KDLCVLCGDSGIEENECFVAMDWFDPETEFQEFKSSGTTDLSISAEEED 346


>AT5G05800.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; EXPRESSED IN: 24 plant
           structures; EXPRESSED DURING: 15 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT3G11290.1); Has 881 Blast hits to 512 proteins
           in 30 species: Archae - 0; Bacteria - 2; Metazoa - 0;
           Fungi - 38; Plants - 833; Viruses - 0; Other Eukaryotes
           - 8 (source: NCBI BLink). | chr5:1743234-1744751 REVERSE
           LENGTH=449
          Length = 449

 Score = 96.7 bits (239), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 69/302 (22%), Positives = 124/302 (41%), Gaps = 27/302 (8%)

Query: 13  RANWTPSQDQYFLELMLSHVHRGNKTGKVFSRRAWADMIEQFNTTFGLKYDIDVLKNRFK 72
           +A W P   + F++L +     GNK G  FS+  W +++  F    G  YD   LKN + 
Sbjct: 4   KAVWEPEYHRVFVDLCVEQTMLGNKPGTHFSKEGWRNILISFQEQTGAMYDRMQLKNHWD 63

Query: 73  RFRKQYIEIKTIISQKGFQWDNALNMIVAGEKTWDEYIKDHPSARAFRTRVVPYYNDMCI 132
              +Q+   + ++      W+   N   A +  W  Y++++P A  +R  V      + I
Sbjct: 64  TMSRQWKIWRRLVETSFMNWNPESNRFRATDDDWANYLQENPDAGQYRLSVPHDLKKLEI 123

Query: 133 IYGHAVAD---------GRYSLSCFDVDFEYEDIASKELDDQTTPSKGVDDQTPPTVINQ 183
           ++     +          +   SC    +E ED  ++ +   + P              Q
Sbjct: 124 LFAGCNVEVKNDEVSGVRKRRRSC----YEEEDEDNQSMCSSSNP--------------Q 165

Query: 184 SKIDWSPMMDHFFVELMVDQVRKGNKIGRSFDKKAWVDMTESFNDRFESHYCKVVLKNRL 243
           +K  WSP     F++L+V +  KGN+    F+K+ W  +  + N+     Y +  LKN  
Sbjct: 166 TKGYWSPSTHKLFLDLLVQETLKGNRPDTHFNKEGWKTILGTINENTGLGYTRPQLKNHW 225

Query: 244 NVLIRHYCSINALLGKEGFSWDKRQQKVVADDQVWQKCIRVNHNFRLYRIKSMPFYSGMC 303
           +   + +     L+G     WD   +   A ++ W+  IR N     +R K +P    + 
Sbjct: 226 DCTRKAWKIWCQLVGASSMKWDPESRSFGATEEEWRIYIRENPRAGQFRHKEVPHADQLA 285

Query: 304 II 305
           II
Sbjct: 286 II 287



 Score = 71.6 bits (174), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 35/129 (27%), Positives = 62/129 (48%)

Query: 13  RANWTPSQDQYFLELMLSHVHRGNKTGKVFSRRAWADMIEQFNTTFGLKYDIDVLKNRFK 72
           +  W+PS  + FL+L++    +GN+    F++  W  ++   N   GL Y    LKN + 
Sbjct: 167 KGYWSPSTHKLFLDLLVQETLKGNRPDTHFNKEGWKTILGTINENTGLGYTRPQLKNHWD 226

Query: 73  RFRKQYIEIKTIISQKGFQWDNALNMIVAGEKTWDEYIKDHPSARAFRTRVVPYYNDMCI 132
             RK +     ++     +WD       A E+ W  YI+++P A  FR + VP+ + + I
Sbjct: 227 CTRKAWKIWCQLVGASSMKWDPESRSFGATEEEWRIYIRENPRAGQFRHKEVPHADQLAI 286

Query: 133 IYGHAVADG 141
           I+   +  G
Sbjct: 287 IFNGVIEPG 295


>AT5G05800.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G11290.1); Has 1807 Blast hits to 1807 proteins
           in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736;
           Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes
           - 339 (source: NCBI BLink). | chr5:1743234-1744751
           REVERSE LENGTH=449
          Length = 449

 Score = 96.7 bits (239), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 69/302 (22%), Positives = 124/302 (41%), Gaps = 27/302 (8%)

Query: 13  RANWTPSQDQYFLELMLSHVHRGNKTGKVFSRRAWADMIEQFNTTFGLKYDIDVLKNRFK 72
           +A W P   + F++L +     GNK G  FS+  W +++  F    G  YD   LKN + 
Sbjct: 4   KAVWEPEYHRVFVDLCVEQTMLGNKPGTHFSKEGWRNILISFQEQTGAMYDRMQLKNHWD 63

Query: 73  RFRKQYIEIKTIISQKGFQWDNALNMIVAGEKTWDEYIKDHPSARAFRTRVVPYYNDMCI 132
              +Q+   + ++      W+   N   A +  W  Y++++P A  +R  V      + I
Sbjct: 64  TMSRQWKIWRRLVETSFMNWNPESNRFRATDDDWANYLQENPDAGQYRLSVPHDLKKLEI 123

Query: 133 IYGHAVAD---------GRYSLSCFDVDFEYEDIASKELDDQTTPSKGVDDQTPPTVINQ 183
           ++     +          +   SC    +E ED  ++ +   + P              Q
Sbjct: 124 LFAGCNVEVKNDEVSGVRKRRRSC----YEEEDEDNQSMCSSSNP--------------Q 165

Query: 184 SKIDWSPMMDHFFVELMVDQVRKGNKIGRSFDKKAWVDMTESFNDRFESHYCKVVLKNRL 243
           +K  WSP     F++L+V +  KGN+    F+K+ W  +  + N+     Y +  LKN  
Sbjct: 166 TKGYWSPSTHKLFLDLLVQETLKGNRPDTHFNKEGWKTILGTINENTGLGYTRPQLKNHW 225

Query: 244 NVLIRHYCSINALLGKEGFSWDKRQQKVVADDQVWQKCIRVNHNFRLYRIKSMPFYSGMC 303
           +   + +     L+G     WD   +   A ++ W+  IR N     +R K +P    + 
Sbjct: 226 DCTRKAWKIWCQLVGASSMKWDPESRSFGATEEEWRIYIRENPRAGQFRHKEVPHADQLA 285

Query: 304 II 305
           II
Sbjct: 286 II 287



 Score = 71.6 bits (174), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 35/129 (27%), Positives = 62/129 (48%)

Query: 13  RANWTPSQDQYFLELMLSHVHRGNKTGKVFSRRAWADMIEQFNTTFGLKYDIDVLKNRFK 72
           +  W+PS  + FL+L++    +GN+    F++  W  ++   N   GL Y    LKN + 
Sbjct: 167 KGYWSPSTHKLFLDLLVQETLKGNRPDTHFNKEGWKTILGTINENTGLGYTRPQLKNHWD 226

Query: 73  RFRKQYIEIKTIISQKGFQWDNALNMIVAGEKTWDEYIKDHPSARAFRTRVVPYYNDMCI 132
             RK +     ++     +WD       A E+ W  YI+++P A  FR + VP+ + + I
Sbjct: 227 CTRKAWKIWCQLVGASSMKWDPESRSFGATEEEWRIYIRENPRAGQFRHKEVPHADQLAI 286

Query: 133 IYGHAVADG 141
           I+   +  G
Sbjct: 287 IFNGVIEPG 295


>AT3G11290.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G11310.1); Has 720 Blast hits to 435 proteins
           in 28 species: Archae - 0; Bacteria - 2; Metazoa - 0;
           Fungi - 32; Plants - 682; Viruses - 0; Other Eukaryotes
           - 4 (source: NCBI BLink). | chr3:3535766-3537295 REVERSE
           LENGTH=460
          Length = 460

 Score = 82.0 bits (201), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 64/302 (21%), Positives = 120/302 (39%), Gaps = 13/302 (4%)

Query: 13  RANWTPSQDQYFLELMLSHVHRGNKTGKVFSRRAWADMIEQFNTTFGLKYDIDVLKNRFK 72
           +A W P   + F++L +     GN+ G          +++ F    G ++  + LKN + 
Sbjct: 4   KAAWEPEYHRVFVDLCVEQKMLGNQPGT-------QHILKPFLQRTGARFTRNQLKNHWD 56

Query: 73  RFRKQYIEIKTIISQKGFQWDNALNMIVAGEKTWDEYIKDHPSARAFRTRVVPYYNDMCI 132
              KQ+     ++     QWD   N   A ++ W  Y+  +P A  +R     +   + +
Sbjct: 57  TMIKQWKIWCRLVQCSDMQWDPQTNTFGANDQDWANYLHVNPEAGQYRLNPPSFLEKLEL 116

Query: 133 IYGHAVADGRYSLSCFDVDFEYEDIASKELDDQTTPSKGVDDQTPPTVIN-QSKIDWSPM 191
           I+  +  D   +        + + IA    +D        D Q+     + QSK  WSP 
Sbjct: 117 IFEDSNLDDEGTSGS-----KRKRIAKHRDEDNDNTGDEEDTQSASNFSSPQSKGYWSPS 171

Query: 192 MDHFFVELMVDQVRKGNKIGRSFDKKAWVDMTESFNDRFESHYCKVVLKNRLNVLIRHYC 251
               FV+L+  +  KGN+    + K+ W  + E+ N      + +  LKN  +   + + 
Sbjct: 172 SHELFVDLLFQEALKGNRPDSHYPKETWKMILETINQNTGKSFTRPQLKNHWDCTRKSWK 231

Query: 252 SINALLGKEGFSWDKRQQKVVADDQVWQKCIRVNHNFRLYRIKSMPFYSGMCIICRDEAT 311
               ++G     WD   +   A D+ W+  ++ NH    +R K +P    +  I +    
Sbjct: 232 IWCQVIGAPVMKWDATSRTFGATDEDWKNYLKENHRAAPFRRKQLPHADKLATIFKGLIE 291

Query: 312 AG 313
            G
Sbjct: 292 PG 293



 Score = 63.9 bits (154), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 33/145 (22%), Positives = 66/145 (45%), Gaps = 5/145 (3%)

Query: 3   DDDSVSLDNF-----RANWTPSQDQYFLELMLSHVHRGNKTGKVFSRRAWADMIEQFNTT 57
           ++D+ S  NF     +  W+PS  + F++L+     +GN+    + +  W  ++E  N  
Sbjct: 150 EEDTQSASNFSSPQSKGYWSPSSHELFVDLLFQEALKGNRPDSHYPKETWKMILETINQN 209

Query: 58  FGLKYDIDVLKNRFKRFRKQYIEIKTIISQKGFQWDNALNMIVAGEKTWDEYIKDHPSAR 117
            G  +    LKN +   RK +     +I     +WD       A ++ W  Y+K++  A 
Sbjct: 210 TGKSFTRPQLKNHWDCTRKSWKIWCQVIGAPVMKWDATSRTFGATDEDWKNYLKENHRAA 269

Query: 118 AFRTRVVPYYNDMCIIYGHAVADGR 142
            FR + +P+ + +  I+   +  G+
Sbjct: 270 PFRRKQLPHADKLATIFKGLIEPGK 294


>AT4G02550.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G02210.2); Has 370 Blast hits to 300 proteins
           in 18 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 10; Plants - 354; Viruses - 0; Other Eukaryotes
           - 6 (source: NCBI BLink). | chr4:1120622-1121629 REVERSE
           LENGTH=307
          Length = 307

 Score = 71.2 bits (173), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 41/154 (26%), Positives = 71/154 (46%), Gaps = 1/154 (0%)

Query: 16  WTPSQDQYFLELMLSHVHRGNKTGKVFSRRAWADMIEQFNTTFGLKYDIDVLKNRFKRFR 75
           W+   D+  +E +      GNK  K F+ +A+       NT F L        NR K  +
Sbjct: 22  WSVGMDKCLIEALAVQAKNGNKVDKCFNDKAYTAACVAVNTRFNLNLTSQKAINRLKTIK 81

Query: 76  KQYIEIKTIISQKGFQWDNALNMI-VAGEKTWDEYIKDHPSARAFRTRVVPYYNDMCIIY 134
           K+Y  ++ I+S+ GF W+++  MI    ++ W  YI  +P A+AFR + +  Y ++  + 
Sbjct: 82  KRYRVMRDILSRDGFWWNSSTKMIDCESDELWRRYIAVNPDAKAFRGKQIEMYEELRTVC 141

Query: 135 GHAVADGRYSLSCFDVDFEYEDIASKELDDQTTP 168
           G     G+Y+    +      D+   E D  + P
Sbjct: 142 GDYQTPGKYNKVKKESSHHLNDVKQFEEDSVSFP 175



 Score = 71.2 bits (173), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 38/138 (27%), Positives = 70/138 (50%), Gaps = 1/138 (0%)

Query: 186 IDWSPMMDHFFVELMVDQVRKGNKIGRSFDKKAWVDMTESFNDRFESHYCKVVLKNRLNV 245
           + WS  MD   +E +  Q + GNK+ + F+ KA+     + N RF  +       NRL  
Sbjct: 20  VIWSVGMDKCLIEALAVQAKNGNKVDKCFNDKAYTAACVAVNTRFNLNLTSQKAINRLKT 79

Query: 246 LIRHYCSINALLGKEGFSWDKRQQKV-VADDQVWQKCIRVNHNFRLYRIKSMPFYSGMCI 304
           + + Y  +  +L ++GF W+   + +    D++W++ I VN + + +R K +  Y  +  
Sbjct: 80  IKKRYRVMRDILSRDGFWWNSSTKMIDCESDELWRRYIAVNPDAKAFRGKQIEMYEELRT 139

Query: 305 ICRDEATAGCRSNLEKES 322
           +C D  T G  + ++KES
Sbjct: 140 VCGDYQTPGKYNKVKKES 157


>AT4G02550.3 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G02210.2); Has 35333 Blast hits to 34131
           proteins in 2444 species: Archae - 798; Bacteria -
           22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
           - 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
           chr4:1120622-1121674 REVERSE LENGTH=322
          Length = 322

 Score = 71.2 bits (173), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 41/154 (26%), Positives = 71/154 (46%), Gaps = 1/154 (0%)

Query: 16  WTPSQDQYFLELMLSHVHRGNKTGKVFSRRAWADMIEQFNTTFGLKYDIDVLKNRFKRFR 75
           W+   D+  +E +      GNK  K F+ +A+       NT F L        NR K  +
Sbjct: 37  WSVGMDKCLIEALAVQAKNGNKVDKCFNDKAYTAACVAVNTRFNLNLTSQKAINRLKTIK 96

Query: 76  KQYIEIKTIISQKGFQWDNALNMI-VAGEKTWDEYIKDHPSARAFRTRVVPYYNDMCIIY 134
           K+Y  ++ I+S+ GF W+++  MI    ++ W  YI  +P A+AFR + +  Y ++  + 
Sbjct: 97  KRYRVMRDILSRDGFWWNSSTKMIDCESDELWRRYIAVNPDAKAFRGKQIEMYEELRTVC 156

Query: 135 GHAVADGRYSLSCFDVDFEYEDIASKELDDQTTP 168
           G     G+Y+    +      D+   E D  + P
Sbjct: 157 GDYQTPGKYNKVKKESSHHLNDVKQFEEDSVSFP 190



 Score = 70.9 bits (172), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 38/138 (27%), Positives = 70/138 (50%), Gaps = 1/138 (0%)

Query: 186 IDWSPMMDHFFVELMVDQVRKGNKIGRSFDKKAWVDMTESFNDRFESHYCKVVLKNRLNV 245
           + WS  MD   +E +  Q + GNK+ + F+ KA+     + N RF  +       NRL  
Sbjct: 35  VIWSVGMDKCLIEALAVQAKNGNKVDKCFNDKAYTAACVAVNTRFNLNLTSQKAINRLKT 94

Query: 246 LIRHYCSINALLGKEGFSWDKRQQKV-VADDQVWQKCIRVNHNFRLYRIKSMPFYSGMCI 304
           + + Y  +  +L ++GF W+   + +    D++W++ I VN + + +R K +  Y  +  
Sbjct: 95  IKKRYRVMRDILSRDGFWWNSSTKMIDCESDELWRRYIAVNPDAKAFRGKQIEMYEELRT 154

Query: 305 ICRDEATAGCRSNLEKES 322
           +C D  T G  + ++KES
Sbjct: 155 VCGDYQTPGKYNKVKKES 172


>AT2G19220.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G11290.1); Has 443 Blast hits to 267 proteins
           in 21 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 17; Plants - 426; Viruses - 0; Other Eukaryotes
           - 0 (source: NCBI BLink). | chr2:8340678-8342161 REVERSE
           LENGTH=439
          Length = 439

 Score = 68.2 bits (165), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 66/307 (21%), Positives = 123/307 (40%), Gaps = 23/307 (7%)

Query: 13  RANWTPSQDQYFLELMLSHVHRGNKTGKVFSRRAWADMIEQFNTTFGLKYDIDVLKNRFK 72
           +A W P  D+ F++L +     GN+        A+ +M        G+++ ID L N + 
Sbjct: 4   KAAWEPEHDEVFVDLCVEQKMLGNQPEMQHILEAFQEM--------GVRFTIDQLINHWD 55

Query: 73  RFRKQYIEIKTIISQKGFQWDNALNMIVAGEKTWDEYIKDHPSARAFRTRVVPYYNDMCI 132
              KQ+     ++  K  +WD+  N   A ++ W  Y++ +P A  +R     +   + I
Sbjct: 56  TMIKQWKIWCRLVQCKDIKWDSLTNTFGATDQEWANYLEVNPEAGQYRCNPPLFLEKLEI 115

Query: 133 IYGHAVADGRYSLSCFDVDFEYEDIASKELDDQTTPSKGVDDQTPPTVINQSKIDWSPMM 192
           I+     DG  + S      + + I   E  D+   +  V   +   +  +    WSP  
Sbjct: 116 IFAGMNLDGEGTSS----GSKMKQIC--EHRDEENVTGYVPRLSASDIATRRHYKWSPSS 169

Query: 193 DHFFVELMVDQVRKGNK-IGRS--FDKKAWVDMTESFNDRFESHYCKVVLKN---RLNVL 246
               V+    +  KG + I R+  F K++W  + E  N      Y    L+N   R    
Sbjct: 170 HAIVVDTCFQESLKGIRPIKRNHLFTKESWKMILEKINRITGLGYTHKQLENHFTRTRTS 229

Query: 247 IRHYCSINALLGKEGFSWDKRQQKVVADDQVWQKCIRVNHNFRLYRIKSMPFYSGMCIIC 306
            +H+C     +      WD   +K  A ++ W K + +N   R+++ + +P    +  I 
Sbjct: 230 WKHWCET---IASPIMKWDANTRKFGATEEDWDKYLMINKRARVFKRRHIPHADKLATIF 286

Query: 307 RDEATAG 313
           +     G
Sbjct: 287 KGRIEPG 293



 Score = 61.2 bits (147), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 43/172 (25%), Positives = 72/172 (41%), Gaps = 12/172 (6%)

Query: 16  WTPSQDQYFLELMLSHVHRGNKTGK---VFSRRAWADMIEQFNTTFGLKYDIDVLKNRFK 72
           W+PS     ++       +G +  K   +F++ +W  ++E+ N   GL Y    L+N F 
Sbjct: 165 WSPSSHAIVVDTCFQESLKGIRPIKRNHLFTKESWKMILEKINRITGLGYTHKQLENHFT 224

Query: 73  RFRKQYIEIKTIISQKGFQWDNALNMIVAGEKTWDEYIKDHPSARAFRTRVVPYYNDMCI 132
           R R  +      I+    +WD       A E+ WD+Y+  +  AR F+ R +P+ + +  
Sbjct: 225 RTRTSWKHWCETIASPIMKWDANTRKFGATEEDWDKYLMINKRARVFKRRHIPHADKLAT 284

Query: 133 IYGHAVADG-----RYSLSCFDVDFEYEDIASKELDDQTTPSKGVDDQTPPT 179
           I+   +  G     RY     D   E   +     D Q TPS  V +   P 
Sbjct: 285 IFKGRIEPGKTKTRRYRKRVIDHHSESPQLH----DHQPTPSSVVVNTNEPV 332


>AT4G02550.4 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; EXPRESSED IN: 18 plant
           structures; EXPRESSED DURING: 7 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT4G02210.2). | chr4:1120622-1121629 REVERSE
           LENGTH=278
          Length = 278

 Score = 67.8 bits (164), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 35/121 (28%), Positives = 60/121 (49%), Gaps = 1/121 (0%)

Query: 16  WTPSQDQYFLELMLSHVHRGNKTGKVFSRRAWADMIEQFNTTFGLKYDIDVLKNRFKRFR 75
           W+   D+  +E +      GNK  K F+ +A+       NT F L        NR K  +
Sbjct: 22  WSVGMDKCLIEALAVQAKNGNKVDKCFNDKAYTAACVAVNTRFNLNLTSQKAINRLKTIK 81

Query: 76  KQYIEIKTIISQKGFQWDNALNMI-VAGEKTWDEYIKDHPSARAFRTRVVPYYNDMCIIY 134
           K+Y  ++ I+S+ GF W+++  MI    ++ W  YI  +P A+AFR + +  Y ++  + 
Sbjct: 82  KRYRVMRDILSRDGFWWNSSTKMIDCESDELWRRYIAVNPDAKAFRGKQIEMYEELRTVC 141

Query: 135 G 135
           G
Sbjct: 142 G 142



 Score = 67.4 bits (163), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 35/129 (27%), Positives = 64/129 (49%), Gaps = 1/129 (0%)

Query: 186 IDWSPMMDHFFVELMVDQVRKGNKIGRSFDKKAWVDMTESFNDRFESHYCKVVLKNRLNV 245
           + WS  MD   +E +  Q + GNK+ + F+ KA+     + N RF  +       NRL  
Sbjct: 20  VIWSVGMDKCLIEALAVQAKNGNKVDKCFNDKAYTAACVAVNTRFNLNLTSQKAINRLKT 79

Query: 246 LIRHYCSINALLGKEGFSWDKRQQKV-VADDQVWQKCIRVNHNFRLYRIKSMPFYSGMCI 304
           + + Y  +  +L ++GF W+   + +    D++W++ I VN + + +R K +  Y  +  
Sbjct: 80  IKKRYRVMRDILSRDGFWWNSSTKMIDCESDELWRRYIAVNPDAKAFRGKQIEMYEELRT 139

Query: 305 ICRDEATAG 313
           +C D  T G
Sbjct: 140 VCGDYQTPG 148


>AT4G02550.2 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G02210.2); Has 350 Blast hits to 284 proteins
           in 18 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 13; Plants - 331; Viruses - 0; Other Eukaryotes
           - 6 (source: NCBI BLink). | chr4:1120622-1121629 REVERSE
           LENGTH=278
          Length = 278

 Score = 67.8 bits (164), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 35/121 (28%), Positives = 60/121 (49%), Gaps = 1/121 (0%)

Query: 16  WTPSQDQYFLELMLSHVHRGNKTGKVFSRRAWADMIEQFNTTFGLKYDIDVLKNRFKRFR 75
           W+   D+  +E +      GNK  K F+ +A+       NT F L        NR K  +
Sbjct: 22  WSVGMDKCLIEALAVQAKNGNKVDKCFNDKAYTAACVAVNTRFNLNLTSQKAINRLKTIK 81

Query: 76  KQYIEIKTIISQKGFQWDNALNMI-VAGEKTWDEYIKDHPSARAFRTRVVPYYNDMCIIY 134
           K+Y  ++ I+S+ GF W+++  MI    ++ W  YI  +P A+AFR + +  Y ++  + 
Sbjct: 82  KRYRVMRDILSRDGFWWNSSTKMIDCESDELWRRYIAVNPDAKAFRGKQIEMYEELRTVC 141

Query: 135 G 135
           G
Sbjct: 142 G 142



 Score = 67.4 bits (163), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 35/129 (27%), Positives = 64/129 (49%), Gaps = 1/129 (0%)

Query: 186 IDWSPMMDHFFVELMVDQVRKGNKIGRSFDKKAWVDMTESFNDRFESHYCKVVLKNRLNV 245
           + WS  MD   +E +  Q + GNK+ + F+ KA+     + N RF  +       NRL  
Sbjct: 20  VIWSVGMDKCLIEALAVQAKNGNKVDKCFNDKAYTAACVAVNTRFNLNLTSQKAINRLKT 79

Query: 246 LIRHYCSINALLGKEGFSWDKRQQKV-VADDQVWQKCIRVNHNFRLYRIKSMPFYSGMCI 304
           + + Y  +  +L ++GF W+   + +    D++W++ I VN + + +R K +  Y  +  
Sbjct: 80  IKKRYRVMRDILSRDGFWWNSSTKMIDCESDELWRRYIAVNPDAKAFRGKQIEMYEELRT 139

Query: 305 ICRDEATAG 313
           +C D  T G
Sbjct: 140 VCGDYQTPG 148


>AT3G11310.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G11290.1); Has 575 Blast hits to 342 proteins
           in 22 species: Archae - 0; Bacteria - 2; Metazoa - 0;
           Fungi - 10; Plants - 559; Viruses - 0; Other Eukaryotes
           - 4 (source: NCBI BLink). | chr3:3542536-3544333 REVERSE
           LENGTH=539
          Length = 539

 Score = 65.9 bits (159), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 42/196 (21%), Positives = 86/196 (43%), Gaps = 10/196 (5%)

Query: 7   VSLDNFRANWTPSQDQYFLELMLSHVHRGNKTGKV-----FSRRAWADMIEQFNTTFGLK 61
           +++  ++A W+ S  + F++L+ +   + N+         +++  W  M+E FN   GL+
Sbjct: 167 ITIPRYKAYWSSSSHEIFVDLLFTESLKENRPKPARRNGYYAKETWNMMVESFNQKTGLR 226

Query: 62  YDIDVLKNRFKRFRKQYIEIKTIISQKGFQWDNALNMIVAGEKTWDEYIKDHPSARAFRT 121
           Y    LKN +   R  +      +     +WD       A  + W+ Y K++  A  FR 
Sbjct: 227 YTRKQLKNHWNITRDAWRRWCQAVGSPLLKWDANTKTFGATSEDWENYSKENKRAEQFRL 286

Query: 122 RVVPYYNDMCIIYGHAVADGRYSLSCFDVDFEYEDIASKELDDQTTPSKGVD-DQTPPTV 180
           + +P+ + + II+   V  G+ +L  +     +   A +       PS  ++ +++ P  
Sbjct: 287 KHIPHADKLAIIFKGHVEPGKTALRPYRKRVNHHSEAPQ----HPAPSSALNINESVPGS 342

Query: 181 INQSKIDWSPMMDHFF 196
              +  D   +MDH F
Sbjct: 343 EGGADDDHHIVMDHHF 358



 Score = 65.5 bits (158), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 65/308 (21%), Positives = 112/308 (36%), Gaps = 19/308 (6%)

Query: 16  WTPSQDQYFLELMLSHVHRGNKTGKVFSRRAWADMIEQFNTTFGLKYDIDVLKNRFKRFR 75
           W P   + F++L +     G +   +   R W    E F    G ++  D LKN +    
Sbjct: 8   WEPELHKVFVDLCVEQKMLGFRLPGL--NRIW----ESFVQNTGARFTRDQLKNHWDTML 61

Query: 76  KQYIEIKTIISQKGFQWDNALNMIVAGEKTWDEYIKDHPSARA--FRTRVVPYYNDMCII 133
           + +     ++     +WD       A  + W  Y + +P A+   FR+   P+  D+ +I
Sbjct: 62  RLWRAWCRLVECSEMKWDPQTKKFGASTEVWTNYFRVNPKAKQYRFRSSPPPFLKDLKMI 121

Query: 134 YGHAVADGRYSLSCFDVDFEYEDIASKELDDQTTPSKGVDDQTPPTVINQSKIDWSPMMD 193
           +           SC       +       +D  T             I + K  WS    
Sbjct: 122 FEGTDLGDEEGTSCGKRKRIPDADNDTGDEDNDTGDDDNYTGDDDITIPRYKAYWSSSSH 181

Query: 194 HFFVELMVDQVRKGNKIGRS-----FDKKAWVDMTESFNDRFESHYCKVVLKNRLNVL-- 246
             FV+L+  +  K N+   +     + K+ W  M ESFN +    Y +  LKN  N+   
Sbjct: 182 EIFVDLLFTESLKENRPKPARRNGYYAKETWNMMVESFNQKTGLRYTRKQLKNHWNITRD 241

Query: 247 -IRHYCSINALLGKEGFSWDKRQQKVVADDQVWQKCIRVNHNFRLYRIKSMPFYSGMCII 305
             R +C     +G     WD   +   A  + W+   + N     +R+K +P    + II
Sbjct: 242 AWRRWCQA---VGSPLLKWDANTKTFGATSEDWENYSKENKRAEQFRLKHIPHADKLAII 298

Query: 306 CRDEATAG 313
            +     G
Sbjct: 299 FKGHVEPG 306



 Score = 49.3 bits (116), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 31/117 (26%), Positives = 54/117 (46%), Gaps = 6/117 (5%)

Query: 181 INQSKIDWSPMMDHFFVELMVDQVRKGNKIGRSFDKKAWVDMTESFNDRFESHYCKVVLK 240
           + + K+ W P +   FV+L V+Q   G ++      + W    ESF     + + +  LK
Sbjct: 1   MTREKVMWEPELHKVFVDLCVEQKMLGFRLPGL--NRIW----ESFVQNTGARFTRDQLK 54

Query: 241 NRLNVLIRHYCSINALLGKEGFSWDKRQQKVVADDQVWQKCIRVNHNFRLYRIKSMP 297
           N  + ++R + +   L+      WD + +K  A  +VW    RVN   + YR +S P
Sbjct: 55  NHWDTMLRLWRAWCRLVECSEMKWDPQTKKFGASTEVWTNYFRVNPKAKQYRFRSSP 111


>AT1G30140.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT5G27260.1); Has 313 Blast hits to 256 proteins
           in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 8; Plants - 295; Viruses - 0; Other Eukaryotes -
           10 (source: NCBI BLink). | chr1:10598764-10599527
           FORWARD LENGTH=222
          Length = 222

 Score = 56.2 bits (134), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 32/137 (23%), Positives = 64/137 (46%), Gaps = 7/137 (5%)

Query: 16  WTPSQDQYFLELMLSHVHRGNKTGKVFSRRAWADMIEQFNTTFGLKYDIDVLKNRFKRFR 75
           WTP +    +EL+  +    + +G +      + ++   N   G   +     +R K  +
Sbjct: 17  WTPDETDVLIELIRQNWR--DSSGIIGKLTVESKLLPALNKRLGCNKNHKNYMSRLKFLK 74

Query: 76  ---KQYIEIKTIISQKGFQWDNALNMIVAGEKTWDEYIKDHPSARAFRTRVVPYYNDMCI 132
              + Y+++K   S  GF WD       A ++ W +Y+K HP+ +  +T  + ++ D+ I
Sbjct: 75  NLYQSYLDLKRFSS--GFGWDPETKKFTAPDEVWRDYLKAHPNHKHMQTESIDHFEDLQI 132

Query: 133 IYGHAVADGRYSLSCFD 149
           I+G  VA G +++   D
Sbjct: 133 IFGDVVATGSFAVGMSD 149


>AT5G27260.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT2G29880.1); Has 1807 Blast hits to 1807 proteins
           in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736;
           Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes
           - 339 (source: NCBI BLink). | chr5:9603943-9604930
           FORWARD LENGTH=303
          Length = 303

 Score = 55.1 bits (131), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 29/139 (20%), Positives = 65/139 (46%), Gaps = 7/139 (5%)

Query: 16  WTPSQDQYFLELMLSHVHRG--NKTGKVFSRRAWADMIEQFNTTFGLKYDIDVLKNRFKR 73
           W+P + +  ++L++  ++    +  G +         + + N  F    + +   +R K 
Sbjct: 17  WSPEETKLLVQLLVEGINNNWRDSNGTISKLTVETKFMPEINKEFCRSKNYNHYLSRMKY 76

Query: 74  FRKQY---IEIKTIISQKGFQWDNALNMIVAGEKTWDEYIKDHPSARAFRTRVVPYYNDM 130
            + QY   ++++   S  GF WD       A ++ W +Y+K HP+ +  R     +++++
Sbjct: 77  LKIQYQSCLDLQRFSS--GFGWDPLTKRFTASDEVWSDYLKAHPNNKQLRYDTFEFFDEL 134

Query: 131 CIIYGHAVADGRYSLSCFD 149
            II+G  VA G+ ++   D
Sbjct: 135 QIIFGEGVATGKNAIGLCD 153