Miyakogusa Predicted Gene

Lj5g3v0658240.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj5g3v0658240.1 Non Chatacterized Hit- tr|K4ALK8|K4ALK8_SETIT
Uncharacterized protein OS=Setaria italica
GN=Si039789,38.58,1e-18,Myb_DNA-bind_3,Myb/SANT-like domain;
seg,NULL,CUFF.53634.1
         (279 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT5G05800.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    90   1e-18
AT5G05800.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    90   1e-18
AT3G11290.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    82   4e-16
AT2G24960.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    67   1e-11
AT2G24960.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    67   1e-11
AT3G11310.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    64   9e-11
AT4G02210.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    54   1e-07
AT4G02210.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    54   1e-07
AT2G19220.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    54   1e-07
AT1G30140.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    49   5e-06

>AT5G05800.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; EXPRESSED IN: 24 plant
           structures; EXPRESSED DURING: 15 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT3G11290.1); Has 881 Blast hits to 512 proteins
           in 30 species: Archae - 0; Bacteria - 2; Metazoa - 0;
           Fungi - 38; Plants - 833; Viruses - 0; Other Eukaryotes
           - 8 (source: NCBI BLink). | chr5:1743234-1744751 REVERSE
           LENGTH=449
          Length = 449

 Score = 90.1 bits (222), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 46/138 (33%), Positives = 73/138 (52%), Gaps = 1/138 (0%)

Query: 14  SKATWDFQATKLYVKLCLAEHHKGERLGSSFTKKGWISILTKFNASTGRNYDKPKLKNKW 73
           +K  W     KL++ L + E  KG R  + F K+GW +IL   N +TG  Y +P+LKN W
Sbjct: 166 TKGYWSPSTHKLFLDLLVQETLKGNRPDTHFNKEGWKTILGTINENTGLGYTRPQLKNHW 225

Query: 74  DNFRREWQAWYKLFEKETGLGWDKAKNTVDAPDEWWEKKQLENPLYGKYREKGLSFGEEL 133
           D  R+ W+ W +L    + + WD    +  A +E W     ENP  G++R K +   ++L
Sbjct: 226 DCTRKAWKIWCQLV-GASSMKWDPESRSFGATEEEWRIYIRENPRAGQFRHKEVPHADQL 284

Query: 134 TIMFKDVVATGKSAWAPT 151
            I+F  V+  G++   P+
Sbjct: 285 AIIFNGVIEPGETYTPPS 302



 Score = 85.1 bits (209), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 43/123 (34%), Positives = 68/123 (55%), Gaps = 1/123 (0%)

Query: 15  KATWDFQATKLYVKLCLAEHHKGERLGSSFTKKGWISILTKFNASTGRNYDKPKLKNKWD 74
           KA W+ +  +++V LC+ +   G + G+ F+K+GW +IL  F   TG  YD+ +LKN WD
Sbjct: 4   KAVWEPEYHRVFVDLCVEQTMLGNKPGTHFSKEGWRNILISFQEQTGAMYDRMQLKNHWD 63

Query: 75  NFRREWQAWYKLFEKETGLGWDKAKNTVDAPDEWWEKKQLENPLYGKYREKGLSFGEELT 134
              R+W+ W +L E  + + W+   N   A D+ W     ENP  G+YR       ++L 
Sbjct: 64  TMSRQWKIWRRLVET-SFMNWNPESNRFRATDDDWANYLQENPDAGQYRLSVPHDLKKLE 122

Query: 135 IMF 137
           I+F
Sbjct: 123 ILF 125


>AT5G05800.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G11290.1); Has 1807 Blast hits to 1807 proteins
           in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736;
           Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes
           - 339 (source: NCBI BLink). | chr5:1743234-1744751
           REVERSE LENGTH=449
          Length = 449

 Score = 90.1 bits (222), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 46/138 (33%), Positives = 73/138 (52%), Gaps = 1/138 (0%)

Query: 14  SKATWDFQATKLYVKLCLAEHHKGERLGSSFTKKGWISILTKFNASTGRNYDKPKLKNKW 73
           +K  W     KL++ L + E  KG R  + F K+GW +IL   N +TG  Y +P+LKN W
Sbjct: 166 TKGYWSPSTHKLFLDLLVQETLKGNRPDTHFNKEGWKTILGTINENTGLGYTRPQLKNHW 225

Query: 74  DNFRREWQAWYKLFEKETGLGWDKAKNTVDAPDEWWEKKQLENPLYGKYREKGLSFGEEL 133
           D  R+ W+ W +L    + + WD    +  A +E W     ENP  G++R K +   ++L
Sbjct: 226 DCTRKAWKIWCQLV-GASSMKWDPESRSFGATEEEWRIYIRENPRAGQFRHKEVPHADQL 284

Query: 134 TIMFKDVVATGKSAWAPT 151
            I+F  V+  G++   P+
Sbjct: 285 AIIFNGVIEPGETYTPPS 302



 Score = 85.1 bits (209), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 43/123 (34%), Positives = 68/123 (55%), Gaps = 1/123 (0%)

Query: 15  KATWDFQATKLYVKLCLAEHHKGERLGSSFTKKGWISILTKFNASTGRNYDKPKLKNKWD 74
           KA W+ +  +++V LC+ +   G + G+ F+K+GW +IL  F   TG  YD+ +LKN WD
Sbjct: 4   KAVWEPEYHRVFVDLCVEQTMLGNKPGTHFSKEGWRNILISFQEQTGAMYDRMQLKNHWD 63

Query: 75  NFRREWQAWYKLFEKETGLGWDKAKNTVDAPDEWWEKKQLENPLYGKYREKGLSFGEELT 134
              R+W+ W +L E  + + W+   N   A D+ W     ENP  G+YR       ++L 
Sbjct: 64  TMSRQWKIWRRLVET-SFMNWNPESNRFRATDDDWANYLQENPDAGQYRLSVPHDLKKLE 122

Query: 135 IMF 137
           I+F
Sbjct: 123 ILF 125


>AT3G11290.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G11310.1); Has 720 Blast hits to 435 proteins
           in 28 species: Archae - 0; Bacteria - 2; Metazoa - 0;
           Fungi - 32; Plants - 682; Viruses - 0; Other Eukaryotes
           - 4 (source: NCBI BLink). | chr3:3535766-3537295 REVERSE
           LENGTH=460
          Length = 460

 Score = 82.0 bits (201), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 44/133 (33%), Positives = 68/133 (51%), Gaps = 1/133 (0%)

Query: 14  SKATWDFQATKLYVKLCLAEHHKGERLGSSFTKKGWISILTKFNASTGRNYDKPKLKNKW 73
           SK  W   + +L+V L   E  KG R  S + K+ W  IL   N +TG+++ +P+LKN W
Sbjct: 164 SKGYWSPSSHELFVDLLFQEALKGNRPDSHYPKETWKMILETINQNTGKSFTRPQLKNHW 223

Query: 74  DNFRREWQAWYKLFEKETGLGWDKAKNTVDAPDEWWEKKQLENPLYGKYREKGLSFGEEL 133
           D  R+ W+ W ++      + WD    T  A DE W+    EN     +R K L   ++L
Sbjct: 224 DCTRKSWKIWCQVIGAPV-MKWDATSRTFGATDEDWKNYLKENHRAAPFRRKQLPHADKL 282

Query: 134 TIMFKDVVATGKS 146
             +FK ++  GK+
Sbjct: 283 ATIFKGLIEPGKA 295



 Score = 69.3 bits (168), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 39/125 (31%), Positives = 64/125 (51%), Gaps = 8/125 (6%)

Query: 15  KATWDFQATKLYVKLCLAEHHKGERLGSSFTKKGWISILTKFNASTGRNYDKPKLKNKWD 74
           KA W+ +  +++V LC+ +   G + G+         IL  F   TG  + + +LKN WD
Sbjct: 4   KAAWEPEYHRVFVDLCVEQKMLGNQPGTQ-------HILKPFLQRTGARFTRNQLKNHWD 56

Query: 75  NFRREWQAWYKLFEKETGLGWDKAKNTVDAPDEWWEKKQLENPLYGKYREKGLSFGEELT 134
              ++W+ W +L +  + + WD   NT  A D+ W      NP  G+YR    SF E+L 
Sbjct: 57  TMIKQWKIWCRLVQC-SDMQWDPQTNTFGANDQDWANYLHVNPEAGQYRLNPPSFLEKLE 115

Query: 135 IMFKD 139
           ++F+D
Sbjct: 116 LIFED 120


>AT2G24960.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G02210.2); Has 1453 Blast hits to 509 proteins
           in 26 species: Archae - 0; Bacteria - 0; Metazoa - 1;
           Fungi - 39; Plants - 1363; Viruses - 0; Other Eukaryotes
           - 50 (source: NCBI BLink). | chr2:10617263-10620034
           FORWARD LENGTH=797
          Length = 797

 Score = 67.4 bits (163), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 37/134 (27%), Positives = 62/134 (46%), Gaps = 1/134 (0%)

Query: 12  DNSKATWDFQATKLYVKLCLAEHHKGERLGSSFTKKGWISILTKFNASTGRNYDKPKLKN 71
           D ++  W     + ++ L L   H+G R G +F K+ W  +LT FN+  G  YDK  LK+
Sbjct: 9   DRTRTYWTPTMERFFIDLMLEHLHRGNRTGHTFNKQAWNEMLTVFNSKFGSQYDKDVLKS 68

Query: 72  KWDNFRREWQAWYKLFEKETGLGWDKAKNTVDAPDEWWEKKQLENPLYGKYREKGLSFGE 131
           ++ N  +++    K      G  WD+   TV   D  W      +P    Y+ K +    
Sbjct: 69  RYTNLWKQYND-VKCLLDHGGFVWDQTHQTVIGDDSLWSLYLKAHPEARVYKTKPVLNFS 127

Query: 132 ELTIMFKDVVATGK 145
           +L +++   VA G+
Sbjct: 128 DLCLIYGYTVADGR 141



 Score = 65.9 bits (159), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 36/129 (27%), Positives = 67/129 (51%), Gaps = 7/129 (5%)

Query: 12  DNSKATWDFQATKLYVKLCLAEHHKGERLGSSFTKKGWISILTKFNASTGRNYDKPKLKN 71
           ++SK  W  +  + +V++ + +  +G + G++F+K+ WI +L  FNA     Y K  L++
Sbjct: 166 ESSKTEWTLEMDQYFVEIMVDQIGRGNKTGNAFSKQAWIDMLVLFNARFSGQYGKRVLRH 225

Query: 72  KWDNFRREWQAWYKLFE---KETGLGWDKAKNTVDAPDEWWEKKQLENPLYGKYREKGLS 128
           +++   +    +YK  E   KE G  WD+ +  + A D  W+    ++PL   YR K L 
Sbjct: 226 RYNKLLK----YYKDMEAILKEDGFSWDETRLMISADDAVWDSYIKDHPLARTYRMKSLP 281

Query: 129 FGEELTIMF 137
              +L  +F
Sbjct: 282 SYNDLDTIF 290



 Score = 53.9 bits (128), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 27/98 (27%), Positives = 48/98 (48%), Gaps = 1/98 (1%)

Query: 12  DNSKATWDFQATKLYVKLCLAEHHKGERLGSSFTKKGWISILTKFNASTGRNYDKPKLKN 71
           D ++  W        + L + + + G R+G +F    W  ++T FNA  G  ++K  LKN
Sbjct: 321 DRTRIFWTPPMDYHLIDLLVEQVNNGNRVGQTFITSAWNEMVTAFNAKFGSQHNKDVLKN 380

Query: 72  KWDNFRREWQAWYKLFEKETGLGWDKAKNTVDAPDEWW 109
           ++ + RR +    K   ++ G  WD  ++ V A D+ W
Sbjct: 381 RYKHLRRLYND-IKFLLEQNGFSWDARRDMVIADDDIW 417



 Score = 52.0 bits (123), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 32/105 (30%), Positives = 52/105 (49%), Gaps = 2/105 (1%)

Query: 27  VKLCLAEHHKGERLGSSFTKKGWISILTKFNASTGRNYDKPKLKNKWDNFRREWQAWYKL 86
           + L L +  +G ++G +FT++ W  +   FNA  G   D   L+N++    +E      +
Sbjct: 543 IDLMLEQVSRGNKIGETFTEQAWADMAESFNAKFGLQTDMFMLENRYILLMKERDDINNI 602

Query: 87  FEKETGLGWDKAKNTVDAPDEWWEKKQLENPLYGKYREKGL-SFG 130
              + G  WD  K T+ A DE+WE    E+P    Y+ K L S+G
Sbjct: 603 LNLD-GFTWDVEKQTIVAEDEYWEAYIKEHPDATIYKGKTLDSYG 646


>AT2G24960.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; EXPRESSED IN: 21 plant
           structures; EXPRESSED DURING: 12 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT4G02210.2); Has 30201 Blast hits to 17322
           proteins in 780 species: Archae - 12; Bacteria - 1396;
           Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
           0; Other Eukaryotes - 2996 (source: NCBI BLink). |
           chr2:10617263-10620034 FORWARD LENGTH=774
          Length = 774

 Score = 67.4 bits (163), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 37/134 (27%), Positives = 62/134 (46%), Gaps = 1/134 (0%)

Query: 12  DNSKATWDFQATKLYVKLCLAEHHKGERLGSSFTKKGWISILTKFNASTGRNYDKPKLKN 71
           D ++  W     + ++ L L   H+G R G +F K+ W  +LT FN+  G  YDK  LK+
Sbjct: 9   DRTRTYWTPTMERFFIDLMLEHLHRGNRTGHTFNKQAWNEMLTVFNSKFGSQYDKDVLKS 68

Query: 72  KWDNFRREWQAWYKLFEKETGLGWDKAKNTVDAPDEWWEKKQLENPLYGKYREKGLSFGE 131
           ++ N  +++    K      G  WD+   TV   D  W      +P    Y+ K +    
Sbjct: 69  RYTNLWKQYND-VKCLLDHGGFVWDQTHQTVIGDDSLWSLYLKAHPEARVYKTKPVLNFS 127

Query: 132 ELTIMFKDVVATGK 145
           +L +++   VA G+
Sbjct: 128 DLCLIYGYTVADGR 141



 Score = 65.9 bits (159), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 36/129 (27%), Positives = 67/129 (51%), Gaps = 7/129 (5%)

Query: 12  DNSKATWDFQATKLYVKLCLAEHHKGERLGSSFTKKGWISILTKFNASTGRNYDKPKLKN 71
           ++SK  W  +  + +V++ + +  +G + G++F+K+ WI +L  FNA     Y K  L++
Sbjct: 166 ESSKTEWTLEMDQYFVEIMVDQIGRGNKTGNAFSKQAWIDMLVLFNARFSGQYGKRVLRH 225

Query: 72  KWDNFRREWQAWYKLFE---KETGLGWDKAKNTVDAPDEWWEKKQLENPLYGKYREKGLS 128
           +++   +    +YK  E   KE G  WD+ +  + A D  W+    ++PL   YR K L 
Sbjct: 226 RYNKLLK----YYKDMEAILKEDGFSWDETRLMISADDAVWDSYIKDHPLARTYRMKSLP 281

Query: 129 FGEELTIMF 137
              +L  +F
Sbjct: 282 SYNDLDTIF 290



 Score = 59.3 bits (142), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 34/134 (25%), Positives = 60/134 (44%), Gaps = 1/134 (0%)

Query: 12  DNSKATWDFQATKLYVKLCLAEHHKGERLGSSFTKKGWISILTKFNASTGRNYDKPKLKN 71
           D ++  W        + L + + + G R+G +F    W  ++T FNA  G  ++K  LKN
Sbjct: 321 DRTRIFWTPPMDYHLIDLLVEQVNNGNRVGQTFITSAWNEMVTAFNAKFGSQHNKDVLKN 380

Query: 72  KWDNFRREWQAWYKLFEKETGLGWDKAKNTVDAPDEWWEKKQLENPLYGKYREKGLSFGE 131
           ++ + RR +    K   ++ G  WD  ++ V A D+ W      +P    YR K +    
Sbjct: 381 RYKHLRRLYND-IKFLLEQNGFSWDARRDMVIADDDIWNTYIQAHPEARSYRVKTIPSYP 439

Query: 132 ELTIMFKDVVATGK 145
            L  +F    + G+
Sbjct: 440 NLCFIFGKETSDGR 453



 Score = 52.0 bits (123), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 32/105 (30%), Positives = 52/105 (49%), Gaps = 2/105 (1%)

Query: 27  VKLCLAEHHKGERLGSSFTKKGWISILTKFNASTGRNYDKPKLKNKWDNFRREWQAWYKL 86
           + L L +  +G ++G +FT++ W  +   FNA  G   D   L+N++    +E      +
Sbjct: 520 IDLMLEQVSRGNKIGETFTEQAWADMAESFNAKFGLQTDMFMLENRYILLMKERDDINNI 579

Query: 87  FEKETGLGWDKAKNTVDAPDEWWEKKQLENPLYGKYREKGL-SFG 130
              + G  WD  K T+ A DE+WE    E+P    Y+ K L S+G
Sbjct: 580 LNLD-GFTWDVEKQTIVAEDEYWEAYIKEHPDATIYKGKTLDSYG 623


>AT3G11310.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G11290.1); Has 575 Blast hits to 342 proteins
           in 22 species: Archae - 0; Bacteria - 2; Metazoa - 0;
           Fungi - 10; Plants - 559; Viruses - 0; Other Eukaryotes
           - 4 (source: NCBI BLink). | chr3:3542536-3544333 REVERSE
           LENGTH=539
          Length = 539

 Score = 64.3 bits (155), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 43/141 (30%), Positives = 63/141 (44%), Gaps = 6/141 (4%)

Query: 15  KATWDFQATKLYVKLCLAE-----HHKGERLGSSFTKKGWISILTKFNASTGRNYDKPKL 69
           KA W   + +++V L   E       K  R    + K+ W  ++  FN  TG  Y + +L
Sbjct: 173 KAYWSSSSHEIFVDLLFTESLKENRPKPARRNGYYAKETWNMMVESFNQKTGLRYTRKQL 232

Query: 70  KNKWDNFRREWQAWYKLFEKETGLGWDKAKNTVDAPDEWWEKKQLENPLYGKYREKGLSF 129
           KN W+  R  W+ W +       L WD    T  A  E WE    EN    ++R K +  
Sbjct: 233 KNHWNITRDAWRRWCQAVGSPL-LKWDANTKTFGATSEDWENYSKENKRAEQFRLKHIPH 291

Query: 130 GEELTIMFKDVVATGKSAWAP 150
            ++L I+FK  V  GK+A  P
Sbjct: 292 ADKLAIIFKGHVEPGKTALRP 312



 Score = 55.1 bits (131), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 38/130 (29%), Positives = 59/130 (45%), Gaps = 9/130 (6%)

Query: 11  MDNSKATWDFQATKLYVKLCLAEHHKGERLGSSFTKKGWISILTKFNASTGRNYDKPKLK 70
           M   K  W+ +  K++V LC+ +   G RL       G   I   F  +TG  + + +LK
Sbjct: 1   MTREKVMWEPELHKVFVDLCVEQKMLGFRL------PGLNRIWESFVQNTGARFTRDQLK 54

Query: 71  NKWDNFRREWQAWYKLFEKETGLGWDKAKNTVDAPDEWWEKKQLENPLYGKYREKG--LS 128
           N WD   R W+AW +L E  + + WD       A  E W      NP   +YR +     
Sbjct: 55  NHWDTMLRLWRAWCRLVEC-SEMKWDPQTKKFGASTEVWTNYFRVNPKAKQYRFRSSPPP 113

Query: 129 FGEELTIMFK 138
           F ++L ++F+
Sbjct: 114 FLKDLKMIFE 123


>AT4G02210.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; EXPRESSED IN: 22 plant
           structures; EXPRESSED DURING: 13 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT2G24960.2). | chr4:974320-975917 REVERSE
           LENGTH=439
          Length = 439

 Score = 53.9 bits (128), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 27/126 (21%), Positives = 59/126 (46%), Gaps = 1/126 (0%)

Query: 14  SKATWDFQATKLYVKLCLAEHHKGERLGSSFTKKGWISILTKFNASTGRNYDKPKLKNKW 73
            + TW     + ++ L L +  +G ++   F K+ W  ++  FNA    N+D   LKN++
Sbjct: 182 CRTTWHPPMDRYFIDLMLDQARRGNQIEGVFRKQAWTEMVNLFNAKFESNFDVDVLKNRY 241

Query: 74  DNFRREWQAWYKLFEKETGLGWDKAKNTVDAPDEWWEKKQLENPLYGKYREKGLSFGEEL 133
            + RR++ A   +  +  G  WD  +  V A +  W+     +    ++  + + + ++L
Sbjct: 242 KSLRRQFNAIKSIL-RSDGFAWDNERQMVTADNNVWQDYIKAHRDARQFMTRPIPYYKDL 300

Query: 134 TIMFKD 139
            ++  D
Sbjct: 301 CVLCGD 306


>AT4G02210.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT2G24960.2); Has 791 Blast hits to 465 proteins
           in 19 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 17; Plants - 748; Viruses - 0; Other Eukaryotes
           - 26 (source: NCBI BLink). | chr4:974320-975917 REVERSE
           LENGTH=439
          Length = 439

 Score = 53.9 bits (128), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 27/126 (21%), Positives = 59/126 (46%), Gaps = 1/126 (0%)

Query: 14  SKATWDFQATKLYVKLCLAEHHKGERLGSSFTKKGWISILTKFNASTGRNYDKPKLKNKW 73
            + TW     + ++ L L +  +G ++   F K+ W  ++  FNA    N+D   LKN++
Sbjct: 182 CRTTWHPPMDRYFIDLMLDQARRGNQIEGVFRKQAWTEMVNLFNAKFESNFDVDVLKNRY 241

Query: 74  DNFRREWQAWYKLFEKETGLGWDKAKNTVDAPDEWWEKKQLENPLYGKYREKGLSFGEEL 133
            + RR++ A   +  +  G  WD  +  V A +  W+     +    ++  + + + ++L
Sbjct: 242 KSLRRQFNAIKSIL-RSDGFAWDNERQMVTADNNVWQDYIKAHRDARQFMTRPIPYYKDL 300

Query: 134 TIMFKD 139
            ++  D
Sbjct: 301 CVLCGD 306


>AT2G19220.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G11290.1); Has 443 Blast hits to 267 proteins
           in 21 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 17; Plants - 426; Viruses - 0; Other Eukaryotes
           - 0 (source: NCBI BLink). | chr2:8340678-8342161 REVERSE
           LENGTH=439
          Length = 439

 Score = 53.9 bits (128), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 36/124 (29%), Positives = 57/124 (45%), Gaps = 9/124 (7%)

Query: 14  SKATWDFQATKLYVKLCLAEHHKGERLGSSFTKKGWISILTKFNASTGRNYDKPKLKNKW 73
           SKA W+ +  +++V LC+ +   G +            IL  F    G  +   +L N W
Sbjct: 3   SKAAWEPEHDEVFVDLCVEQKMLGNQPEMQ-------HILEAFQ-EMGVRFTIDQLINHW 54

Query: 74  DNFRREWQAWYKLFEKETGLGWDKAKNTVDAPDEWWEKKQLENPLYGKYREKGLSFGEEL 133
           D   ++W+ W +L + +  + WD   NT  A D+ W      NP  G+YR     F E+L
Sbjct: 55  DTMIKQWKIWCRLVQCK-DIKWDSLTNTFGATDQEWANYLEVNPEAGQYRCNPPLFLEKL 113

Query: 134 TIMF 137
            I+F
Sbjct: 114 EIIF 117



 Score = 52.0 bits (123), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 35/132 (26%), Positives = 56/132 (42%), Gaps = 4/132 (3%)

Query: 18  WDFQATKLYVKLCLAEHHKGER---LGSSFTKKGWISILTKFNASTGRNYDKPKLKNKWD 74
           W   +  + V  C  E  KG R       FTK+ W  IL K N  TG  Y   +L+N + 
Sbjct: 165 WSPSSHAIVVDTCFQESLKGIRPIKRNHLFTKESWKMILEKINRITGLGYTHKQLENHFT 224

Query: 75  NFRREWQAWYKLFEKETGLGWDKAKNTVDAPDEWWEKKQLENPLYGKYREKGLSFGEELT 134
             R  W+ W +       + WD       A +E W+K  + N     ++ + +   ++L 
Sbjct: 225 RTRTSWKHWCETIASPI-MKWDANTRKFGATEEDWDKYLMINKRARVFKRRHIPHADKLA 283

Query: 135 IMFKDVVATGKS 146
            +FK  +  GK+
Sbjct: 284 TIFKGRIEPGKT 295


>AT1G30140.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT5G27260.1); Has 313 Blast hits to 256 proteins
           in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 8; Plants - 295; Viruses - 0; Other Eukaryotes -
           10 (source: NCBI BLink). | chr1:10598764-10599527
           FORWARD LENGTH=222
          Length = 222

 Score = 48.5 bits (114), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 45/105 (42%)

Query: 52  ILTKFNASTGRNYDKPKLKNKWDNFRREWQAWYKLFEKETGLGWDKAKNTVDAPDEWWEK 111
           +L   N   G N +     ++    +  +Q++  L    +G GWD       APDE W  
Sbjct: 49  LLPALNKRLGCNKNHKNYMSRLKFLKNLYQSYLDLKRFSSGFGWDPETKKFTAPDEVWRD 108

Query: 112 KQLENPLYGKYREKGLSFGEELTIMFKDVVATGKSAWAPTSGILP 156
               +P +   + + +   E+L I+F DVVATG  A   +    P
Sbjct: 109 YLKAHPNHKHMQTESIDHFEDLQIIFGDVVATGSFAVGMSDSTCP 153