Miyakogusa Predicted Gene
- Lj5g3v0658240.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj5g3v0658240.1 Non Chatacterized Hit- tr|K4ALK8|K4ALK8_SETIT
Uncharacterized protein OS=Setaria italica
GN=Si039789,38.58,1e-18,Myb_DNA-bind_3,Myb/SANT-like domain;
seg,NULL,CUFF.53634.1
(279 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G05800.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 90 1e-18
AT5G05800.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 90 1e-18
AT3G11290.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 82 4e-16
AT2G24960.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 67 1e-11
AT2G24960.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 67 1e-11
AT3G11310.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 64 9e-11
AT4G02210.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 54 1e-07
AT4G02210.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 54 1e-07
AT2G19220.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 54 1e-07
AT1G30140.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 49 5e-06
>AT5G05800.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 24 plant
structures; EXPRESSED DURING: 15 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT3G11290.1); Has 881 Blast hits to 512 proteins
in 30 species: Archae - 0; Bacteria - 2; Metazoa - 0;
Fungi - 38; Plants - 833; Viruses - 0; Other Eukaryotes
- 8 (source: NCBI BLink). | chr5:1743234-1744751 REVERSE
LENGTH=449
Length = 449
Score = 90.1 bits (222), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 46/138 (33%), Positives = 73/138 (52%), Gaps = 1/138 (0%)
Query: 14 SKATWDFQATKLYVKLCLAEHHKGERLGSSFTKKGWISILTKFNASTGRNYDKPKLKNKW 73
+K W KL++ L + E KG R + F K+GW +IL N +TG Y +P+LKN W
Sbjct: 166 TKGYWSPSTHKLFLDLLVQETLKGNRPDTHFNKEGWKTILGTINENTGLGYTRPQLKNHW 225
Query: 74 DNFRREWQAWYKLFEKETGLGWDKAKNTVDAPDEWWEKKQLENPLYGKYREKGLSFGEEL 133
D R+ W+ W +L + + WD + A +E W ENP G++R K + ++L
Sbjct: 226 DCTRKAWKIWCQLV-GASSMKWDPESRSFGATEEEWRIYIRENPRAGQFRHKEVPHADQL 284
Query: 134 TIMFKDVVATGKSAWAPT 151
I+F V+ G++ P+
Sbjct: 285 AIIFNGVIEPGETYTPPS 302
Score = 85.1 bits (209), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 43/123 (34%), Positives = 68/123 (55%), Gaps = 1/123 (0%)
Query: 15 KATWDFQATKLYVKLCLAEHHKGERLGSSFTKKGWISILTKFNASTGRNYDKPKLKNKWD 74
KA W+ + +++V LC+ + G + G+ F+K+GW +IL F TG YD+ +LKN WD
Sbjct: 4 KAVWEPEYHRVFVDLCVEQTMLGNKPGTHFSKEGWRNILISFQEQTGAMYDRMQLKNHWD 63
Query: 75 NFRREWQAWYKLFEKETGLGWDKAKNTVDAPDEWWEKKQLENPLYGKYREKGLSFGEELT 134
R+W+ W +L E + + W+ N A D+ W ENP G+YR ++L
Sbjct: 64 TMSRQWKIWRRLVET-SFMNWNPESNRFRATDDDWANYLQENPDAGQYRLSVPHDLKKLE 122
Query: 135 IMF 137
I+F
Sbjct: 123 ILF 125
>AT5G05800.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G11290.1); Has 1807 Blast hits to 1807 proteins
in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736;
Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes
- 339 (source: NCBI BLink). | chr5:1743234-1744751
REVERSE LENGTH=449
Length = 449
Score = 90.1 bits (222), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 46/138 (33%), Positives = 73/138 (52%), Gaps = 1/138 (0%)
Query: 14 SKATWDFQATKLYVKLCLAEHHKGERLGSSFTKKGWISILTKFNASTGRNYDKPKLKNKW 73
+K W KL++ L + E KG R + F K+GW +IL N +TG Y +P+LKN W
Sbjct: 166 TKGYWSPSTHKLFLDLLVQETLKGNRPDTHFNKEGWKTILGTINENTGLGYTRPQLKNHW 225
Query: 74 DNFRREWQAWYKLFEKETGLGWDKAKNTVDAPDEWWEKKQLENPLYGKYREKGLSFGEEL 133
D R+ W+ W +L + + WD + A +E W ENP G++R K + ++L
Sbjct: 226 DCTRKAWKIWCQLV-GASSMKWDPESRSFGATEEEWRIYIRENPRAGQFRHKEVPHADQL 284
Query: 134 TIMFKDVVATGKSAWAPT 151
I+F V+ G++ P+
Sbjct: 285 AIIFNGVIEPGETYTPPS 302
Score = 85.1 bits (209), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 43/123 (34%), Positives = 68/123 (55%), Gaps = 1/123 (0%)
Query: 15 KATWDFQATKLYVKLCLAEHHKGERLGSSFTKKGWISILTKFNASTGRNYDKPKLKNKWD 74
KA W+ + +++V LC+ + G + G+ F+K+GW +IL F TG YD+ +LKN WD
Sbjct: 4 KAVWEPEYHRVFVDLCVEQTMLGNKPGTHFSKEGWRNILISFQEQTGAMYDRMQLKNHWD 63
Query: 75 NFRREWQAWYKLFEKETGLGWDKAKNTVDAPDEWWEKKQLENPLYGKYREKGLSFGEELT 134
R+W+ W +L E + + W+ N A D+ W ENP G+YR ++L
Sbjct: 64 TMSRQWKIWRRLVET-SFMNWNPESNRFRATDDDWANYLQENPDAGQYRLSVPHDLKKLE 122
Query: 135 IMF 137
I+F
Sbjct: 123 ILF 125
>AT3G11290.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G11310.1); Has 720 Blast hits to 435 proteins
in 28 species: Archae - 0; Bacteria - 2; Metazoa - 0;
Fungi - 32; Plants - 682; Viruses - 0; Other Eukaryotes
- 4 (source: NCBI BLink). | chr3:3535766-3537295 REVERSE
LENGTH=460
Length = 460
Score = 82.0 bits (201), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 44/133 (33%), Positives = 68/133 (51%), Gaps = 1/133 (0%)
Query: 14 SKATWDFQATKLYVKLCLAEHHKGERLGSSFTKKGWISILTKFNASTGRNYDKPKLKNKW 73
SK W + +L+V L E KG R S + K+ W IL N +TG+++ +P+LKN W
Sbjct: 164 SKGYWSPSSHELFVDLLFQEALKGNRPDSHYPKETWKMILETINQNTGKSFTRPQLKNHW 223
Query: 74 DNFRREWQAWYKLFEKETGLGWDKAKNTVDAPDEWWEKKQLENPLYGKYREKGLSFGEEL 133
D R+ W+ W ++ + WD T A DE W+ EN +R K L ++L
Sbjct: 224 DCTRKSWKIWCQVIGAPV-MKWDATSRTFGATDEDWKNYLKENHRAAPFRRKQLPHADKL 282
Query: 134 TIMFKDVVATGKS 146
+FK ++ GK+
Sbjct: 283 ATIFKGLIEPGKA 295
Score = 69.3 bits (168), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 39/125 (31%), Positives = 64/125 (51%), Gaps = 8/125 (6%)
Query: 15 KATWDFQATKLYVKLCLAEHHKGERLGSSFTKKGWISILTKFNASTGRNYDKPKLKNKWD 74
KA W+ + +++V LC+ + G + G+ IL F TG + + +LKN WD
Sbjct: 4 KAAWEPEYHRVFVDLCVEQKMLGNQPGTQ-------HILKPFLQRTGARFTRNQLKNHWD 56
Query: 75 NFRREWQAWYKLFEKETGLGWDKAKNTVDAPDEWWEKKQLENPLYGKYREKGLSFGEELT 134
++W+ W +L + + + WD NT A D+ W NP G+YR SF E+L
Sbjct: 57 TMIKQWKIWCRLVQC-SDMQWDPQTNTFGANDQDWANYLHVNPEAGQYRLNPPSFLEKLE 115
Query: 135 IMFKD 139
++F+D
Sbjct: 116 LIFED 120
>AT2G24960.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G02210.2); Has 1453 Blast hits to 509 proteins
in 26 species: Archae - 0; Bacteria - 0; Metazoa - 1;
Fungi - 39; Plants - 1363; Viruses - 0; Other Eukaryotes
- 50 (source: NCBI BLink). | chr2:10617263-10620034
FORWARD LENGTH=797
Length = 797
Score = 67.4 bits (163), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 37/134 (27%), Positives = 62/134 (46%), Gaps = 1/134 (0%)
Query: 12 DNSKATWDFQATKLYVKLCLAEHHKGERLGSSFTKKGWISILTKFNASTGRNYDKPKLKN 71
D ++ W + ++ L L H+G R G +F K+ W +LT FN+ G YDK LK+
Sbjct: 9 DRTRTYWTPTMERFFIDLMLEHLHRGNRTGHTFNKQAWNEMLTVFNSKFGSQYDKDVLKS 68
Query: 72 KWDNFRREWQAWYKLFEKETGLGWDKAKNTVDAPDEWWEKKQLENPLYGKYREKGLSFGE 131
++ N +++ K G WD+ TV D W +P Y+ K +
Sbjct: 69 RYTNLWKQYND-VKCLLDHGGFVWDQTHQTVIGDDSLWSLYLKAHPEARVYKTKPVLNFS 127
Query: 132 ELTIMFKDVVATGK 145
+L +++ VA G+
Sbjct: 128 DLCLIYGYTVADGR 141
Score = 65.9 bits (159), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 36/129 (27%), Positives = 67/129 (51%), Gaps = 7/129 (5%)
Query: 12 DNSKATWDFQATKLYVKLCLAEHHKGERLGSSFTKKGWISILTKFNASTGRNYDKPKLKN 71
++SK W + + +V++ + + +G + G++F+K+ WI +L FNA Y K L++
Sbjct: 166 ESSKTEWTLEMDQYFVEIMVDQIGRGNKTGNAFSKQAWIDMLVLFNARFSGQYGKRVLRH 225
Query: 72 KWDNFRREWQAWYKLFE---KETGLGWDKAKNTVDAPDEWWEKKQLENPLYGKYREKGLS 128
+++ + +YK E KE G WD+ + + A D W+ ++PL YR K L
Sbjct: 226 RYNKLLK----YYKDMEAILKEDGFSWDETRLMISADDAVWDSYIKDHPLARTYRMKSLP 281
Query: 129 FGEELTIMF 137
+L +F
Sbjct: 282 SYNDLDTIF 290
Score = 53.9 bits (128), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 27/98 (27%), Positives = 48/98 (48%), Gaps = 1/98 (1%)
Query: 12 DNSKATWDFQATKLYVKLCLAEHHKGERLGSSFTKKGWISILTKFNASTGRNYDKPKLKN 71
D ++ W + L + + + G R+G +F W ++T FNA G ++K LKN
Sbjct: 321 DRTRIFWTPPMDYHLIDLLVEQVNNGNRVGQTFITSAWNEMVTAFNAKFGSQHNKDVLKN 380
Query: 72 KWDNFRREWQAWYKLFEKETGLGWDKAKNTVDAPDEWW 109
++ + RR + K ++ G WD ++ V A D+ W
Sbjct: 381 RYKHLRRLYND-IKFLLEQNGFSWDARRDMVIADDDIW 417
Score = 52.0 bits (123), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 52/105 (49%), Gaps = 2/105 (1%)
Query: 27 VKLCLAEHHKGERLGSSFTKKGWISILTKFNASTGRNYDKPKLKNKWDNFRREWQAWYKL 86
+ L L + +G ++G +FT++ W + FNA G D L+N++ +E +
Sbjct: 543 IDLMLEQVSRGNKIGETFTEQAWADMAESFNAKFGLQTDMFMLENRYILLMKERDDINNI 602
Query: 87 FEKETGLGWDKAKNTVDAPDEWWEKKQLENPLYGKYREKGL-SFG 130
+ G WD K T+ A DE+WE E+P Y+ K L S+G
Sbjct: 603 LNLD-GFTWDVEKQTIVAEDEYWEAYIKEHPDATIYKGKTLDSYG 646
>AT2G24960.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 21 plant
structures; EXPRESSED DURING: 12 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT4G02210.2); Has 30201 Blast hits to 17322
proteins in 780 species: Archae - 12; Bacteria - 1396;
Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
0; Other Eukaryotes - 2996 (source: NCBI BLink). |
chr2:10617263-10620034 FORWARD LENGTH=774
Length = 774
Score = 67.4 bits (163), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 37/134 (27%), Positives = 62/134 (46%), Gaps = 1/134 (0%)
Query: 12 DNSKATWDFQATKLYVKLCLAEHHKGERLGSSFTKKGWISILTKFNASTGRNYDKPKLKN 71
D ++ W + ++ L L H+G R G +F K+ W +LT FN+ G YDK LK+
Sbjct: 9 DRTRTYWTPTMERFFIDLMLEHLHRGNRTGHTFNKQAWNEMLTVFNSKFGSQYDKDVLKS 68
Query: 72 KWDNFRREWQAWYKLFEKETGLGWDKAKNTVDAPDEWWEKKQLENPLYGKYREKGLSFGE 131
++ N +++ K G WD+ TV D W +P Y+ K +
Sbjct: 69 RYTNLWKQYND-VKCLLDHGGFVWDQTHQTVIGDDSLWSLYLKAHPEARVYKTKPVLNFS 127
Query: 132 ELTIMFKDVVATGK 145
+L +++ VA G+
Sbjct: 128 DLCLIYGYTVADGR 141
Score = 65.9 bits (159), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 36/129 (27%), Positives = 67/129 (51%), Gaps = 7/129 (5%)
Query: 12 DNSKATWDFQATKLYVKLCLAEHHKGERLGSSFTKKGWISILTKFNASTGRNYDKPKLKN 71
++SK W + + +V++ + + +G + G++F+K+ WI +L FNA Y K L++
Sbjct: 166 ESSKTEWTLEMDQYFVEIMVDQIGRGNKTGNAFSKQAWIDMLVLFNARFSGQYGKRVLRH 225
Query: 72 KWDNFRREWQAWYKLFE---KETGLGWDKAKNTVDAPDEWWEKKQLENPLYGKYREKGLS 128
+++ + +YK E KE G WD+ + + A D W+ ++PL YR K L
Sbjct: 226 RYNKLLK----YYKDMEAILKEDGFSWDETRLMISADDAVWDSYIKDHPLARTYRMKSLP 281
Query: 129 FGEELTIMF 137
+L +F
Sbjct: 282 SYNDLDTIF 290
Score = 59.3 bits (142), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 34/134 (25%), Positives = 60/134 (44%), Gaps = 1/134 (0%)
Query: 12 DNSKATWDFQATKLYVKLCLAEHHKGERLGSSFTKKGWISILTKFNASTGRNYDKPKLKN 71
D ++ W + L + + + G R+G +F W ++T FNA G ++K LKN
Sbjct: 321 DRTRIFWTPPMDYHLIDLLVEQVNNGNRVGQTFITSAWNEMVTAFNAKFGSQHNKDVLKN 380
Query: 72 KWDNFRREWQAWYKLFEKETGLGWDKAKNTVDAPDEWWEKKQLENPLYGKYREKGLSFGE 131
++ + RR + K ++ G WD ++ V A D+ W +P YR K +
Sbjct: 381 RYKHLRRLYND-IKFLLEQNGFSWDARRDMVIADDDIWNTYIQAHPEARSYRVKTIPSYP 439
Query: 132 ELTIMFKDVVATGK 145
L +F + G+
Sbjct: 440 NLCFIFGKETSDGR 453
Score = 52.0 bits (123), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 52/105 (49%), Gaps = 2/105 (1%)
Query: 27 VKLCLAEHHKGERLGSSFTKKGWISILTKFNASTGRNYDKPKLKNKWDNFRREWQAWYKL 86
+ L L + +G ++G +FT++ W + FNA G D L+N++ +E +
Sbjct: 520 IDLMLEQVSRGNKIGETFTEQAWADMAESFNAKFGLQTDMFMLENRYILLMKERDDINNI 579
Query: 87 FEKETGLGWDKAKNTVDAPDEWWEKKQLENPLYGKYREKGL-SFG 130
+ G WD K T+ A DE+WE E+P Y+ K L S+G
Sbjct: 580 LNLD-GFTWDVEKQTIVAEDEYWEAYIKEHPDATIYKGKTLDSYG 623
>AT3G11310.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G11290.1); Has 575 Blast hits to 342 proteins
in 22 species: Archae - 0; Bacteria - 2; Metazoa - 0;
Fungi - 10; Plants - 559; Viruses - 0; Other Eukaryotes
- 4 (source: NCBI BLink). | chr3:3542536-3544333 REVERSE
LENGTH=539
Length = 539
Score = 64.3 bits (155), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 43/141 (30%), Positives = 63/141 (44%), Gaps = 6/141 (4%)
Query: 15 KATWDFQATKLYVKLCLAE-----HHKGERLGSSFTKKGWISILTKFNASTGRNYDKPKL 69
KA W + +++V L E K R + K+ W ++ FN TG Y + +L
Sbjct: 173 KAYWSSSSHEIFVDLLFTESLKENRPKPARRNGYYAKETWNMMVESFNQKTGLRYTRKQL 232
Query: 70 KNKWDNFRREWQAWYKLFEKETGLGWDKAKNTVDAPDEWWEKKQLENPLYGKYREKGLSF 129
KN W+ R W+ W + L WD T A E WE EN ++R K +
Sbjct: 233 KNHWNITRDAWRRWCQAVGSPL-LKWDANTKTFGATSEDWENYSKENKRAEQFRLKHIPH 291
Query: 130 GEELTIMFKDVVATGKSAWAP 150
++L I+FK V GK+A P
Sbjct: 292 ADKLAIIFKGHVEPGKTALRP 312
Score = 55.1 bits (131), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 38/130 (29%), Positives = 59/130 (45%), Gaps = 9/130 (6%)
Query: 11 MDNSKATWDFQATKLYVKLCLAEHHKGERLGSSFTKKGWISILTKFNASTGRNYDKPKLK 70
M K W+ + K++V LC+ + G RL G I F +TG + + +LK
Sbjct: 1 MTREKVMWEPELHKVFVDLCVEQKMLGFRL------PGLNRIWESFVQNTGARFTRDQLK 54
Query: 71 NKWDNFRREWQAWYKLFEKETGLGWDKAKNTVDAPDEWWEKKQLENPLYGKYREKG--LS 128
N WD R W+AW +L E + + WD A E W NP +YR +
Sbjct: 55 NHWDTMLRLWRAWCRLVEC-SEMKWDPQTKKFGASTEVWTNYFRVNPKAKQYRFRSSPPP 113
Query: 129 FGEELTIMFK 138
F ++L ++F+
Sbjct: 114 FLKDLKMIFE 123
>AT4G02210.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 22 plant
structures; EXPRESSED DURING: 13 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT2G24960.2). | chr4:974320-975917 REVERSE
LENGTH=439
Length = 439
Score = 53.9 bits (128), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 27/126 (21%), Positives = 59/126 (46%), Gaps = 1/126 (0%)
Query: 14 SKATWDFQATKLYVKLCLAEHHKGERLGSSFTKKGWISILTKFNASTGRNYDKPKLKNKW 73
+ TW + ++ L L + +G ++ F K+ W ++ FNA N+D LKN++
Sbjct: 182 CRTTWHPPMDRYFIDLMLDQARRGNQIEGVFRKQAWTEMVNLFNAKFESNFDVDVLKNRY 241
Query: 74 DNFRREWQAWYKLFEKETGLGWDKAKNTVDAPDEWWEKKQLENPLYGKYREKGLSFGEEL 133
+ RR++ A + + G WD + V A + W+ + ++ + + + ++L
Sbjct: 242 KSLRRQFNAIKSIL-RSDGFAWDNERQMVTADNNVWQDYIKAHRDARQFMTRPIPYYKDL 300
Query: 134 TIMFKD 139
++ D
Sbjct: 301 CVLCGD 306
>AT4G02210.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT2G24960.2); Has 791 Blast hits to 465 proteins
in 19 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 17; Plants - 748; Viruses - 0; Other Eukaryotes
- 26 (source: NCBI BLink). | chr4:974320-975917 REVERSE
LENGTH=439
Length = 439
Score = 53.9 bits (128), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 27/126 (21%), Positives = 59/126 (46%), Gaps = 1/126 (0%)
Query: 14 SKATWDFQATKLYVKLCLAEHHKGERLGSSFTKKGWISILTKFNASTGRNYDKPKLKNKW 73
+ TW + ++ L L + +G ++ F K+ W ++ FNA N+D LKN++
Sbjct: 182 CRTTWHPPMDRYFIDLMLDQARRGNQIEGVFRKQAWTEMVNLFNAKFESNFDVDVLKNRY 241
Query: 74 DNFRREWQAWYKLFEKETGLGWDKAKNTVDAPDEWWEKKQLENPLYGKYREKGLSFGEEL 133
+ RR++ A + + G WD + V A + W+ + ++ + + + ++L
Sbjct: 242 KSLRRQFNAIKSIL-RSDGFAWDNERQMVTADNNVWQDYIKAHRDARQFMTRPIPYYKDL 300
Query: 134 TIMFKD 139
++ D
Sbjct: 301 CVLCGD 306
>AT2G19220.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G11290.1); Has 443 Blast hits to 267 proteins
in 21 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 17; Plants - 426; Viruses - 0; Other Eukaryotes
- 0 (source: NCBI BLink). | chr2:8340678-8342161 REVERSE
LENGTH=439
Length = 439
Score = 53.9 bits (128), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 57/124 (45%), Gaps = 9/124 (7%)
Query: 14 SKATWDFQATKLYVKLCLAEHHKGERLGSSFTKKGWISILTKFNASTGRNYDKPKLKNKW 73
SKA W+ + +++V LC+ + G + IL F G + +L N W
Sbjct: 3 SKAAWEPEHDEVFVDLCVEQKMLGNQPEMQ-------HILEAFQ-EMGVRFTIDQLINHW 54
Query: 74 DNFRREWQAWYKLFEKETGLGWDKAKNTVDAPDEWWEKKQLENPLYGKYREKGLSFGEEL 133
D ++W+ W +L + + + WD NT A D+ W NP G+YR F E+L
Sbjct: 55 DTMIKQWKIWCRLVQCK-DIKWDSLTNTFGATDQEWANYLEVNPEAGQYRCNPPLFLEKL 113
Query: 134 TIMF 137
I+F
Sbjct: 114 EIIF 117
Score = 52.0 bits (123), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 35/132 (26%), Positives = 56/132 (42%), Gaps = 4/132 (3%)
Query: 18 WDFQATKLYVKLCLAEHHKGER---LGSSFTKKGWISILTKFNASTGRNYDKPKLKNKWD 74
W + + V C E KG R FTK+ W IL K N TG Y +L+N +
Sbjct: 165 WSPSSHAIVVDTCFQESLKGIRPIKRNHLFTKESWKMILEKINRITGLGYTHKQLENHFT 224
Query: 75 NFRREWQAWYKLFEKETGLGWDKAKNTVDAPDEWWEKKQLENPLYGKYREKGLSFGEELT 134
R W+ W + + WD A +E W+K + N ++ + + ++L
Sbjct: 225 RTRTSWKHWCETIASPI-MKWDANTRKFGATEEDWDKYLMINKRARVFKRRHIPHADKLA 283
Query: 135 IMFKDVVATGKS 146
+FK + GK+
Sbjct: 284 TIFKGRIEPGKT 295
>AT1G30140.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT5G27260.1); Has 313 Blast hits to 256 proteins
in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 8; Plants - 295; Viruses - 0; Other Eukaryotes -
10 (source: NCBI BLink). | chr1:10598764-10599527
FORWARD LENGTH=222
Length = 222
Score = 48.5 bits (114), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 45/105 (42%)
Query: 52 ILTKFNASTGRNYDKPKLKNKWDNFRREWQAWYKLFEKETGLGWDKAKNTVDAPDEWWEK 111
+L N G N + ++ + +Q++ L +G GWD APDE W
Sbjct: 49 LLPALNKRLGCNKNHKNYMSRLKFLKNLYQSYLDLKRFSSGFGWDPETKKFTAPDEVWRD 108
Query: 112 KQLENPLYGKYREKGLSFGEELTIMFKDVVATGKSAWAPTSGILP 156
+P + + + + E+L I+F DVVATG A + P
Sbjct: 109 YLKAHPNHKHMQTESIDHFEDLQIIFGDVVATGSFAVGMSDSTCP 153