Miyakogusa Predicted Gene
- Lj5g3v0843440.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj5g3v0843440.1 Non Chatacterized Hit- tr|D7U577|D7U577_VITVI
Putative uncharacterized protein OS=Vitis vinifera
GN=,35.34,1e-18,Myb_DNA-bind_3,Myb/SANT-like domain,CUFF.54062.1
(441 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT2G24960.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 262 4e-70
AT2G24960.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 261 6e-70
AT4G02210.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 227 1e-59
AT4G02210.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 227 1e-59
AT5G05800.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 97 3e-20
AT5G05800.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 97 3e-20
AT3G11290.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 82 8e-16
AT4G02550.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 71 1e-12
AT4G02550.3 | Symbols: | unknown protein; BEST Arabidopsis thal... 71 1e-12
AT2G19220.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 68 1e-11
AT4G02550.4 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 68 1e-11
AT4G02550.2 | Symbols: | unknown protein; BEST Arabidopsis thal... 68 1e-11
AT3G11310.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 66 6e-11
AT1G30140.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 56 4e-08
AT5G27260.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 55 1e-07
>AT2G24960.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 21 plant
structures; EXPRESSED DURING: 12 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT4G02210.2); Has 30201 Blast hits to 17322
proteins in 780 species: Archae - 12; Bacteria - 1396;
Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
0; Other Eukaryotes - 2996 (source: NCBI BLink). |
chr2:10617263-10620034 FORWARD LENGTH=774
Length = 774
Score = 262 bits (669), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 122/304 (40%), Positives = 191/304 (62%), Gaps = 15/304 (4%)
Query: 10 DNFRANWTPSQDQYFLELMLSHVHRGNKTGKVFSRRAWADMIEQFNTTFGLKYDIDVLKN 69
D R WTP+ +++F++LML H+HRGN+TG F+++AW +M+ FN+ FG +YD DVLK+
Sbjct: 9 DRTRTYWTPTMERFFIDLMLEHLHRGNRTGHTFNKQAWNEMLTVFNSKFGSQYDKDVLKS 68
Query: 70 RFKRFRKQYIEIKTIISQKGFQWDNALNMIVAGEKTWDEYIKDHPSARAFRTRVVPYYND 129
R+ KQY ++K ++ GF WD ++ + W Y+K HP AR ++T+ V ++D
Sbjct: 69 RYTNLWKQYNDVKCLLDHGGFVWDQTHQTVIGDDSLWSLYLKAHPEARVYKTKPVLNFSD 128
Query: 130 MCIIYGHAVADGRYSLSCFDVDFEYEDIASKELDDQTTPSKGVDDQTPPTVINQSKIDWS 189
+C+IYG+ VADGRYS+S D++ E E++ ++ G + SK +W+
Sbjct: 129 LCLIYGYTVADGRYSMSSHDLEIE------DEINGESVVLSGKE---------SSKTEWT 173
Query: 190 PMMDHFFVELMVDQVRKGNKIGRSFDKKAWVDMTESFNDRFESHYCKVVLKNRLNVLIRH 249
MD +FVE+MVDQ+ +GNK G +F K+AW+DM FN RF Y K VL++R N L+++
Sbjct: 174 LEMDQYFVEIMVDQIGRGNKTGNAFSKQAWIDMLVLFNARFSGQYGKRVLRHRYNKLLKY 233
Query: 250 YCSINALLGKEGFSWDKRQQKVVADDQVWQKCIRVNHNFRLYRIKSMPFYSGMCIICRDE 309
Y + A+L ++GFSWD+ + + ADD VW I+ + R YR+KS+P Y+ + I +
Sbjct: 234 YKDMEAILKEDGFSWDETRLMISADDAVWDSYIKDHPLARTYRMKSLPSYNDLDTIFACQ 293
Query: 310 ATAG 313
A G
Sbjct: 294 AEQG 297
Score = 228 bits (580), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 115/308 (37%), Positives = 175/308 (56%), Gaps = 17/308 (5%)
Query: 13 RANWTPSQDQYFLELMLSHVHRGNKTGKVFSRRAWADMIEQFNTTFGLKYDIDVLKNRFK 72
+ WT DQYF+E+M+ + RGNKTG FS++AW DM+ FN F +Y VL++R+
Sbjct: 169 KTEWTLEMDQYFVEIMVDQIGRGNKTGNAFSKQAWIDMLVLFNARFSGQYGKRVLRHRYN 228
Query: 73 RFRKQYIEIKTIISQKGFQWDNALNMIVAGEKTWDEYIKDHPSARAFRTRVVPYYNDMCI 132
+ K Y +++ I+ + GF WD MI A + WD YIKDHP AR +R + +P YND+
Sbjct: 229 KLLKYYKDMEAILKEDGFSWDETRLMISADDAVWDSYIKDHPLARTYRMKSLPSYNDLDT 288
Query: 133 IYGHAVADGRYSLSCFDVDFEYEDIASKELDDQTTPSKGVDDQTPPTVINQSKIDWSPMM 192
I+ G ++ D S QT+ +K +Q ++++I W+P M
Sbjct: 289 IFACQAEQGT----------DHRDDGSAA---QTSETKASQEQNS----DRTRIFWTPPM 331
Query: 193 DHFFVELMVDQVRKGNKIGRSFDKKAWVDMTESFNDRFESHYCKVVLKNRLNVLIRHYCS 252
D+ ++L+V+QV GN++G++F AW +M +FN +F S + K VLKNR L R Y
Sbjct: 332 DYHLIDLLVEQVNNGNRVGQTFITSAWNEMVTAFNAKFGSQHNKDVLKNRYKHLRRLYND 391
Query: 253 INALLGKEGFSWDKRQQKVVADDQVWQKCIRVNHNFRLYRIKSMPFYSGMCIICRDEATA 312
I LL + GFSWD R+ V+ADD +W I+ + R YR+K++P Y +C I E +
Sbjct: 392 IKFLLEQNGFSWDARRDMVIADDDIWNTYIQAHPEARSYRVKTIPSYPNLCFIFGKETSD 451
Query: 313 GCRSNLEK 320
G + L +
Sbjct: 452 GRYTRLAQ 459
Score = 221 bits (562), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 107/306 (34%), Positives = 178/306 (58%), Gaps = 12/306 (3%)
Query: 10 DNFRANWTPSQDQYFLELMLSHVHRGNKTGKVFSRRAWADMIEQFNTTFGLKYDIDVLKN 69
D R WTP D + ++L++ V+ GN+ G+ F AW +M+ FN FG +++ DVLKN
Sbjct: 321 DRTRIFWTPPMDYHLIDLLVEQVNNGNRVGQTFITSAWNEMVTAFNAKFGSQHNKDVLKN 380
Query: 70 RFKRFRKQYIEIKTIISQKGFQWDNALNMIVAGEKTWDEYIKDHPSARAFRTRVVPYYND 129
R+K R+ Y +IK ++ Q GF WD +M++A + W+ YI+ HP AR++R + +P Y +
Sbjct: 381 RYKHLRRLYNDIKFLLEQNGFSWDARRDMVIADDDIWNTYIQAHPEARSYRVKTIPSYPN 440
Query: 130 MCIIYGHAVADGRYS--LSCFDVDFEY-----EDIASKELDDQTTPSKGV-----DDQTP 177
+C I+G +DGRY+ FD E ++ D + K V + P
Sbjct: 441 LCFIFGKETSDGRYTRLAQAFDPSPAETVRMNESGSTDGFKDTRSFQKVVYTSNEKNDYP 500
Query: 178 PTVINQSKIDWSPMMDHFFVELMVDQVRKGNKIGRSFDKKAWVDMTESFNDRFESHYCKV 237
+ I I+W+ +MDH ++LM++QV +GNKIG +F ++AW DM ESFN +F
Sbjct: 501 CSNIGPPCIEWTRVMDHCLIDLMLEQVSRGNKIGETFTEQAWADMAESFNAKFGLQTDMF 560
Query: 238 VLKNRLNVLIRHYCSINALLGKEGFSWDKRQQKVVADDQVWQKCIRVNHNFRLYRIKSMP 297
+L+NR +L++ IN +L +GF+WD +Q +VA+D+ W+ I+ + + +Y+ K++
Sbjct: 561 MLENRYILLMKERDDINNILNLDGFTWDVEKQTIVAEDEYWEAYIKEHPDATIYKGKTLD 620
Query: 298 FYSGMC 303
Y +C
Sbjct: 621 SYGNLC 626
Score = 112 bits (279), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 60/172 (34%), Positives = 97/172 (56%), Gaps = 9/172 (5%)
Query: 16 WTPSQDQYFLELMLSHVHRGNKTGKVFSRRAWADMIEQFNTTFGLKYDIDVLKNRFKRFR 75
WT D ++LML V RGNK G+ F+ +AWADM E FN FGL+ D+ +L+NR+
Sbjct: 511 WTRVMDHCLIDLMLEQVSRGNKIGETFTEQAWADMAESFNAKFGLQTDMFMLENRYILLM 570
Query: 76 KQYIEIKTIISQKGFQWDNALNMIVAGEKTWDEYIKDHPSARAFRTRVVPYYNDMCIIYG 135
K+ +I I++ GF WD IVA ++ W+ YIK+HP A ++ + + Y ++C +
Sbjct: 571 KERDDINNILNLDGFTWDVEKQTIVAEDEYWEAYIKEHPDATIYKGKTLDSYGNLCKLNE 630
Query: 136 HAVADGRYSLSCFDVDFEYEDIASKE--LDDQTTPSKGVDDQ----TPPTVI 181
H + S +C ++ E E+ ++ +DD ++P K + + TPP I
Sbjct: 631 HLSQE---SFNCENLMIELENYGNEMEIVDDFSSPHKQQNKRPNPITPPLGI 679
Score = 106 bits (264), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 53/155 (34%), Positives = 88/155 (56%), Gaps = 4/155 (2%)
Query: 179 TVINQSKIDWSPMMDHFFVELMVDQVRKGNKIGRSFDKKAWVDMTESFNDRFESHYCKVV 238
T ++++ W+P M+ FF++LM++ + +GN+ G +F+K+AW +M FN +F S Y K V
Sbjct: 6 TCNDRTRTYWTPTMERFFIDLMLEHLHRGNRTGHTFNKQAWNEMLTVFNSKFGSQYDKDV 65
Query: 239 LKNRLNVLIRHYCSINALLGKEGFSWDKRQQKVVADDQVWQKCIRVNHNFRLYRIKSMPF 298
LK+R L + Y + LL GF WD+ Q V+ DD +W ++ + R+Y+ K +
Sbjct: 66 LKSRYTNLWKQYNDVKCLLDHGGFVWDQTHQTVIGDDSLWSLYLKAHPEARVYKTKPVLN 125
Query: 299 YSGMCIICRDEATAGCRS----NLEKESPIGEKSV 329
+S +C+I G S +LE E I +SV
Sbjct: 126 FSDLCLIYGYTVADGRYSMSSHDLEIEDEINGESV 160
>AT2G24960.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G02210.2); Has 1453 Blast hits to 509 proteins
in 26 species: Archae - 0; Bacteria - 0; Metazoa - 1;
Fungi - 39; Plants - 1363; Viruses - 0; Other Eukaryotes
- 50 (source: NCBI BLink). | chr2:10617263-10620034
FORWARD LENGTH=797
Length = 797
Score = 261 bits (667), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 122/304 (40%), Positives = 191/304 (62%), Gaps = 15/304 (4%)
Query: 10 DNFRANWTPSQDQYFLELMLSHVHRGNKTGKVFSRRAWADMIEQFNTTFGLKYDIDVLKN 69
D R WTP+ +++F++LML H+HRGN+TG F+++AW +M+ FN+ FG +YD DVLK+
Sbjct: 9 DRTRTYWTPTMERFFIDLMLEHLHRGNRTGHTFNKQAWNEMLTVFNSKFGSQYDKDVLKS 68
Query: 70 RFKRFRKQYIEIKTIISQKGFQWDNALNMIVAGEKTWDEYIKDHPSARAFRTRVVPYYND 129
R+ KQY ++K ++ GF WD ++ + W Y+K HP AR ++T+ V ++D
Sbjct: 69 RYTNLWKQYNDVKCLLDHGGFVWDQTHQTVIGDDSLWSLYLKAHPEARVYKTKPVLNFSD 128
Query: 130 MCIIYGHAVADGRYSLSCFDVDFEYEDIASKELDDQTTPSKGVDDQTPPTVINQSKIDWS 189
+C+IYG+ VADGRYS+S D++ E E++ ++ G + SK +W+
Sbjct: 129 LCLIYGYTVADGRYSMSSHDLEIE------DEINGESVVLSGKE---------SSKTEWT 173
Query: 190 PMMDHFFVELMVDQVRKGNKIGRSFDKKAWVDMTESFNDRFESHYCKVVLKNRLNVLIRH 249
MD +FVE+MVDQ+ +GNK G +F K+AW+DM FN RF Y K VL++R N L+++
Sbjct: 174 LEMDQYFVEIMVDQIGRGNKTGNAFSKQAWIDMLVLFNARFSGQYGKRVLRHRYNKLLKY 233
Query: 250 YCSINALLGKEGFSWDKRQQKVVADDQVWQKCIRVNHNFRLYRIKSMPFYSGMCIICRDE 309
Y + A+L ++GFSWD+ + + ADD VW I+ + R YR+KS+P Y+ + I +
Sbjct: 234 YKDMEAILKEDGFSWDETRLMISADDAVWDSYIKDHPLARTYRMKSLPSYNDLDTIFACQ 293
Query: 310 ATAG 313
A G
Sbjct: 294 AEQG 297
Score = 217 bits (553), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 116/331 (35%), Positives = 177/331 (53%), Gaps = 40/331 (12%)
Query: 13 RANWTPSQDQYFLELMLSHVHRGNKTGKVFSRRAWADMIEQFNTTFGLKYDIDVLKNRFK 72
+ WT DQYF+E+M+ + RGNKTG FS++AW DM+ FN F +Y VL++R+
Sbjct: 169 KTEWTLEMDQYFVEIMVDQIGRGNKTGNAFSKQAWIDMLVLFNARFSGQYGKRVLRHRYN 228
Query: 73 RFRKQYIEIKTIISQKGFQWDNALNMIVAGEKTWDEYIKDHPSARAFRTRVVPYYNDMCI 132
+ K Y +++ I+ + GF WD MI A + WD YIKDHP AR +R + +P YND+
Sbjct: 229 KLLKYYKDMEAILKEDGFSWDETRLMISADDAVWDSYIKDHPLARTYRMKSLPSYNDLDT 288
Query: 133 IYGHAVADGRYSLSCFDVDFEYEDIASKELDDQTTPSKGVDDQTPPTVINQSKIDWSPMM 192
I+ G ++ D S QT+ +K +Q ++++I W+P M
Sbjct: 289 IFACQAEQGT----------DHRDDGSAA---QTSETKASQEQNS----DRTRIFWTPPM 331
Query: 193 DHFFVELMVDQVRKGNKIGRSFDKKAWVDMTESFNDRFESHYCKVVLKNRLNVLIRHYCS 252
D+ ++L+V+QV GN++G++F AW +M +FN +F S + K VLKNR L R Y
Sbjct: 332 DYHLIDLLVEQVNNGNRVGQTFITSAWNEMVTAFNAKFGSQHNKDVLKNRYKHLRRLYND 391
Query: 253 INALLGKEGFSWDKRQQKVVADDQVWQK------------------CIRVNH-----NFR 289
I LL + GFSWD R+ V+ADD +W C+++ H R
Sbjct: 392 IKFLLEQNGFSWDARRDMVIADDDIWNTYIQACHILFLFKISVICLCLQMKHVQAHPEAR 451
Query: 290 LYRIKSMPFYSGMCIICRDEATAGCRSNLEK 320
YR+K++P Y +C I E + G + L +
Sbjct: 452 SYRVKTIPSYPNLCFIFGKETSDGRYTRLAQ 482
Score = 207 bits (527), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 107/329 (32%), Positives = 178/329 (54%), Gaps = 35/329 (10%)
Query: 10 DNFRANWTPSQDQYFLELMLSHVHRGNKTGKVFSRRAWADMIEQFNTTFGLKYDIDVLKN 69
D R WTP D + ++L++ V+ GN+ G+ F AW +M+ FN FG +++ DVLKN
Sbjct: 321 DRTRIFWTPPMDYHLIDLLVEQVNNGNRVGQTFITSAWNEMVTAFNAKFGSQHNKDVLKN 380
Query: 70 RFKRFRKQYIEIKTIISQKGFQWDNALNMIVAGEKTWDEYIKD----------------- 112
R+K R+ Y +IK ++ Q GF WD +M++A + W+ YI+
Sbjct: 381 RYKHLRRLYNDIKFLLEQNGFSWDARRDMVIADDDIWNTYIQACHILFLFKISVICLCLQ 440
Query: 113 ------HPSARAFRTRVVPYYNDMCIIYGHAVADGRYS--LSCFDVDFEY-----EDIAS 159
HP AR++R + +P Y ++C I+G +DGRY+ FD E ++
Sbjct: 441 MKHVQAHPEARSYRVKTIPSYPNLCFIFGKETSDGRYTRLAQAFDPSPAETVRMNESGST 500
Query: 160 KELDDQTTPSKGV-----DDQTPPTVINQSKIDWSPMMDHFFVELMVDQVRKGNKIGRSF 214
D + K V + P + I I+W+ +MDH ++LM++QV +GNKIG +F
Sbjct: 501 DGFKDTRSFQKVVYTSNEKNDYPCSNIGPPCIEWTRVMDHCLIDLMLEQVSRGNKIGETF 560
Query: 215 DKKAWVDMTESFNDRFESHYCKVVLKNRLNVLIRHYCSINALLGKEGFSWDKRQQKVVAD 274
++AW DM ESFN +F +L+NR +L++ IN +L +GF+WD +Q +VA+
Sbjct: 561 TEQAWADMAESFNAKFGLQTDMFMLENRYILLMKERDDINNILNLDGFTWDVEKQTIVAE 620
Query: 275 DQVWQKCIRVNHNFRLYRIKSMPFYSGMC 303
D+ W+ I+ + + +Y+ K++ Y +C
Sbjct: 621 DEYWEAYIKEHPDATIYKGKTLDSYGNLC 649
Score = 112 bits (279), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 60/172 (34%), Positives = 97/172 (56%), Gaps = 9/172 (5%)
Query: 16 WTPSQDQYFLELMLSHVHRGNKTGKVFSRRAWADMIEQFNTTFGLKYDIDVLKNRFKRFR 75
WT D ++LML V RGNK G+ F+ +AWADM E FN FGL+ D+ +L+NR+
Sbjct: 534 WTRVMDHCLIDLMLEQVSRGNKIGETFTEQAWADMAESFNAKFGLQTDMFMLENRYILLM 593
Query: 76 KQYIEIKTIISQKGFQWDNALNMIVAGEKTWDEYIKDHPSARAFRTRVVPYYNDMCIIYG 135
K+ +I I++ GF WD IVA ++ W+ YIK+HP A ++ + + Y ++C +
Sbjct: 594 KERDDINNILNLDGFTWDVEKQTIVAEDEYWEAYIKEHPDATIYKGKTLDSYGNLCKLNE 653
Query: 136 HAVADGRYSLSCFDVDFEYEDIASKE--LDDQTTPSKGVDDQ----TPPTVI 181
H + S +C ++ E E+ ++ +DD ++P K + + TPP I
Sbjct: 654 HLSQE---SFNCENLMIELENYGNEMEIVDDFSSPHKQQNKRPNPITPPLGI 702
Score = 105 bits (263), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 53/155 (34%), Positives = 88/155 (56%), Gaps = 4/155 (2%)
Query: 179 TVINQSKIDWSPMMDHFFVELMVDQVRKGNKIGRSFDKKAWVDMTESFNDRFESHYCKVV 238
T ++++ W+P M+ FF++LM++ + +GN+ G +F+K+AW +M FN +F S Y K V
Sbjct: 6 TCNDRTRTYWTPTMERFFIDLMLEHLHRGNRTGHTFNKQAWNEMLTVFNSKFGSQYDKDV 65
Query: 239 LKNRLNVLIRHYCSINALLGKEGFSWDKRQQKVVADDQVWQKCIRVNHNFRLYRIKSMPF 298
LK+R L + Y + LL GF WD+ Q V+ DD +W ++ + R+Y+ K +
Sbjct: 66 LKSRYTNLWKQYNDVKCLLDHGGFVWDQTHQTVIGDDSLWSLYLKAHPEARVYKTKPVLN 125
Query: 299 YSGMCIICRDEATAGCRS----NLEKESPIGEKSV 329
+S +C+I G S +LE E I +SV
Sbjct: 126 FSDLCLIYGYTVADGRYSMSSHDLEIEDEINGESV 160
>AT4G02210.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 22 plant
structures; EXPRESSED DURING: 13 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT2G24960.2). | chr4:974320-975917 REVERSE
LENGTH=439
Length = 439
Score = 227 bits (579), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 110/303 (36%), Positives = 172/303 (56%), Gaps = 8/303 (2%)
Query: 10 DNFRANWTPSQDQYFLELMLSHVHRGNK-TGKVFSRRAWADMIEQFNTTFGLKYDIDVLK 68
+ R WTP DQYF+ELM+ V +GN+ +FS+RAW M F F Y DVLK
Sbjct: 8 ERLRTVWTPEMDQYFIELMVEQVRKGNRFEDHLFSKRAWKFMSCSFTAKFKFLYGKDVLK 67
Query: 69 NRFKRFRKQYIEIKTIISQKGFQWDNALNMIVAGEKTWDEYIKDHPSARAFRTRVVPYYN 128
NR K R + + ++ + GF WD+ M+VA WDEY+K HP +R+FR + +P Y
Sbjct: 68 NRHKTLRNLFKSVNNLLIEDGFSWDDTRQMVVADNCVWDEYLKIHPDSRSFRIKSIPCYK 127
Query: 129 DMCIIYGHAVADGRYSLSCFDVDFEYEDIASKELDD---QTTPSKGVDDQTPPTVINQSK 185
D+C++Y +++ + S E E + DD + S V + + + + +
Sbjct: 128 DLCLVYSDGMSEHKAEESIS----EGESKTLIQEDDGYNRICESSTVRSNSKGSSVTRCR 183
Query: 186 IDWSPMMDHFFVELMVDQVRKGNKIGRSFDKKAWVDMTESFNDRFESHYCKVVLKNRLNV 245
W P MD +F++LM+DQ R+GN+I F K+AW +M FN +FES++ VLKNR
Sbjct: 184 TTWHPPMDRYFIDLMLDQARRGNQIEGVFRKQAWTEMVNLFNAKFESNFDVDVLKNRYKS 243
Query: 246 LIRHYCSINALLGKEGFSWDKRQQKVVADDQVWQKCIRVNHNFRLYRIKSMPFYSGMCII 305
L R + +I ++L +GF+WD +Q V AD+ VWQ I+ + + R + + +P+Y +C++
Sbjct: 244 LRRQFNAIKSILRSDGFAWDNERQMVTADNNVWQDYIKAHRDARQFMTRPIPYYKDLCVL 303
Query: 306 CRD 308
C D
Sbjct: 304 CGD 306
Score = 136 bits (343), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 63/169 (37%), Positives = 96/169 (56%), Gaps = 3/169 (1%)
Query: 8 SLDNFRANWTPSQDQYFLELMLSHVHRGNKTGKVFSRRAWADMIEQFNTTFGLKYDIDVL 67
S+ R W P D+YF++LML RGN+ VF ++AW +M+ FN F +D+DVL
Sbjct: 178 SVTRCRTTWHPPMDRYFIDLMLDQARRGNQIEGVFRKQAWTEMVNLFNAKFESNFDVDVL 237
Query: 68 KNRFKRFRKQYIEIKTIISQKGFQWDNALNMIVAGEKTWDEYIKDHPSARAFRTRVVPYY 127
KNR+K R+Q+ IK+I+ GF WDN M+ A W +YIK H AR F TR +PYY
Sbjct: 238 KNRYKSLRRQFNAIKSILRSDGFAWDNERQMVTADNNVWQDYIKAHRDARQFMTRPIPYY 297
Query: 128 NDMCIIYGHAVADGR---YSLSCFDVDFEYEDIASKELDDQTTPSKGVD 173
D+C++ G + + ++ FD + E+++ S D + ++ D
Sbjct: 298 KDLCVLCGDSGIEENECFVAMDWFDPETEFQEFKSSGTTDLSISAEEED 346
>AT4G02210.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT2G24960.2); Has 791 Blast hits to 465 proteins
in 19 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 17; Plants - 748; Viruses - 0; Other Eukaryotes
- 26 (source: NCBI BLink). | chr4:974320-975917 REVERSE
LENGTH=439
Length = 439
Score = 227 bits (579), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 110/303 (36%), Positives = 172/303 (56%), Gaps = 8/303 (2%)
Query: 10 DNFRANWTPSQDQYFLELMLSHVHRGNK-TGKVFSRRAWADMIEQFNTTFGLKYDIDVLK 68
+ R WTP DQYF+ELM+ V +GN+ +FS+RAW M F F Y DVLK
Sbjct: 8 ERLRTVWTPEMDQYFIELMVEQVRKGNRFEDHLFSKRAWKFMSCSFTAKFKFLYGKDVLK 67
Query: 69 NRFKRFRKQYIEIKTIISQKGFQWDNALNMIVAGEKTWDEYIKDHPSARAFRTRVVPYYN 128
NR K R + + ++ + GF WD+ M+VA WDEY+K HP +R+FR + +P Y
Sbjct: 68 NRHKTLRNLFKSVNNLLIEDGFSWDDTRQMVVADNCVWDEYLKIHPDSRSFRIKSIPCYK 127
Query: 129 DMCIIYGHAVADGRYSLSCFDVDFEYEDIASKELDD---QTTPSKGVDDQTPPTVINQSK 185
D+C++Y +++ + S E E + DD + S V + + + + +
Sbjct: 128 DLCLVYSDGMSEHKAEESIS----EGESKTLIQEDDGYNRICESSTVRSNSKGSSVTRCR 183
Query: 186 IDWSPMMDHFFVELMVDQVRKGNKIGRSFDKKAWVDMTESFNDRFESHYCKVVLKNRLNV 245
W P MD +F++LM+DQ R+GN+I F K+AW +M FN +FES++ VLKNR
Sbjct: 184 TTWHPPMDRYFIDLMLDQARRGNQIEGVFRKQAWTEMVNLFNAKFESNFDVDVLKNRYKS 243
Query: 246 LIRHYCSINALLGKEGFSWDKRQQKVVADDQVWQKCIRVNHNFRLYRIKSMPFYSGMCII 305
L R + +I ++L +GF+WD +Q V AD+ VWQ I+ + + R + + +P+Y +C++
Sbjct: 244 LRRQFNAIKSILRSDGFAWDNERQMVTADNNVWQDYIKAHRDARQFMTRPIPYYKDLCVL 303
Query: 306 CRD 308
C D
Sbjct: 304 CGD 306
Score = 136 bits (343), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 63/169 (37%), Positives = 96/169 (56%), Gaps = 3/169 (1%)
Query: 8 SLDNFRANWTPSQDQYFLELMLSHVHRGNKTGKVFSRRAWADMIEQFNTTFGLKYDIDVL 67
S+ R W P D+YF++LML RGN+ VF ++AW +M+ FN F +D+DVL
Sbjct: 178 SVTRCRTTWHPPMDRYFIDLMLDQARRGNQIEGVFRKQAWTEMVNLFNAKFESNFDVDVL 237
Query: 68 KNRFKRFRKQYIEIKTIISQKGFQWDNALNMIVAGEKTWDEYIKDHPSARAFRTRVVPYY 127
KNR+K R+Q+ IK+I+ GF WDN M+ A W +YIK H AR F TR +PYY
Sbjct: 238 KNRYKSLRRQFNAIKSILRSDGFAWDNERQMVTADNNVWQDYIKAHRDARQFMTRPIPYY 297
Query: 128 NDMCIIYGHAVADGR---YSLSCFDVDFEYEDIASKELDDQTTPSKGVD 173
D+C++ G + + ++ FD + E+++ S D + ++ D
Sbjct: 298 KDLCVLCGDSGIEENECFVAMDWFDPETEFQEFKSSGTTDLSISAEEED 346
>AT5G05800.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 24 plant
structures; EXPRESSED DURING: 15 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT3G11290.1); Has 881 Blast hits to 512 proteins
in 30 species: Archae - 0; Bacteria - 2; Metazoa - 0;
Fungi - 38; Plants - 833; Viruses - 0; Other Eukaryotes
- 8 (source: NCBI BLink). | chr5:1743234-1744751 REVERSE
LENGTH=449
Length = 449
Score = 96.7 bits (239), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 69/302 (22%), Positives = 124/302 (41%), Gaps = 27/302 (8%)
Query: 13 RANWTPSQDQYFLELMLSHVHRGNKTGKVFSRRAWADMIEQFNTTFGLKYDIDVLKNRFK 72
+A W P + F++L + GNK G FS+ W +++ F G YD LKN +
Sbjct: 4 KAVWEPEYHRVFVDLCVEQTMLGNKPGTHFSKEGWRNILISFQEQTGAMYDRMQLKNHWD 63
Query: 73 RFRKQYIEIKTIISQKGFQWDNALNMIVAGEKTWDEYIKDHPSARAFRTRVVPYYNDMCI 132
+Q+ + ++ W+ N A + W Y++++P A +R V + I
Sbjct: 64 TMSRQWKIWRRLVETSFMNWNPESNRFRATDDDWANYLQENPDAGQYRLSVPHDLKKLEI 123
Query: 133 IYGHAVAD---------GRYSLSCFDVDFEYEDIASKELDDQTTPSKGVDDQTPPTVINQ 183
++ + + SC +E ED ++ + + P Q
Sbjct: 124 LFAGCNVEVKNDEVSGVRKRRRSC----YEEEDEDNQSMCSSSNP--------------Q 165
Query: 184 SKIDWSPMMDHFFVELMVDQVRKGNKIGRSFDKKAWVDMTESFNDRFESHYCKVVLKNRL 243
+K WSP F++L+V + KGN+ F+K+ W + + N+ Y + LKN
Sbjct: 166 TKGYWSPSTHKLFLDLLVQETLKGNRPDTHFNKEGWKTILGTINENTGLGYTRPQLKNHW 225
Query: 244 NVLIRHYCSINALLGKEGFSWDKRQQKVVADDQVWQKCIRVNHNFRLYRIKSMPFYSGMC 303
+ + + L+G WD + A ++ W+ IR N +R K +P +
Sbjct: 226 DCTRKAWKIWCQLVGASSMKWDPESRSFGATEEEWRIYIRENPRAGQFRHKEVPHADQLA 285
Query: 304 II 305
II
Sbjct: 286 II 287
Score = 71.6 bits (174), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 35/129 (27%), Positives = 62/129 (48%)
Query: 13 RANWTPSQDQYFLELMLSHVHRGNKTGKVFSRRAWADMIEQFNTTFGLKYDIDVLKNRFK 72
+ W+PS + FL+L++ +GN+ F++ W ++ N GL Y LKN +
Sbjct: 167 KGYWSPSTHKLFLDLLVQETLKGNRPDTHFNKEGWKTILGTINENTGLGYTRPQLKNHWD 226
Query: 73 RFRKQYIEIKTIISQKGFQWDNALNMIVAGEKTWDEYIKDHPSARAFRTRVVPYYNDMCI 132
RK + ++ +WD A E+ W YI+++P A FR + VP+ + + I
Sbjct: 227 CTRKAWKIWCQLVGASSMKWDPESRSFGATEEEWRIYIRENPRAGQFRHKEVPHADQLAI 286
Query: 133 IYGHAVADG 141
I+ + G
Sbjct: 287 IFNGVIEPG 295
>AT5G05800.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G11290.1); Has 1807 Blast hits to 1807 proteins
in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736;
Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes
- 339 (source: NCBI BLink). | chr5:1743234-1744751
REVERSE LENGTH=449
Length = 449
Score = 96.7 bits (239), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 69/302 (22%), Positives = 124/302 (41%), Gaps = 27/302 (8%)
Query: 13 RANWTPSQDQYFLELMLSHVHRGNKTGKVFSRRAWADMIEQFNTTFGLKYDIDVLKNRFK 72
+A W P + F++L + GNK G FS+ W +++ F G YD LKN +
Sbjct: 4 KAVWEPEYHRVFVDLCVEQTMLGNKPGTHFSKEGWRNILISFQEQTGAMYDRMQLKNHWD 63
Query: 73 RFRKQYIEIKTIISQKGFQWDNALNMIVAGEKTWDEYIKDHPSARAFRTRVVPYYNDMCI 132
+Q+ + ++ W+ N A + W Y++++P A +R V + I
Sbjct: 64 TMSRQWKIWRRLVETSFMNWNPESNRFRATDDDWANYLQENPDAGQYRLSVPHDLKKLEI 123
Query: 133 IYGHAVAD---------GRYSLSCFDVDFEYEDIASKELDDQTTPSKGVDDQTPPTVINQ 183
++ + + SC +E ED ++ + + P Q
Sbjct: 124 LFAGCNVEVKNDEVSGVRKRRRSC----YEEEDEDNQSMCSSSNP--------------Q 165
Query: 184 SKIDWSPMMDHFFVELMVDQVRKGNKIGRSFDKKAWVDMTESFNDRFESHYCKVVLKNRL 243
+K WSP F++L+V + KGN+ F+K+ W + + N+ Y + LKN
Sbjct: 166 TKGYWSPSTHKLFLDLLVQETLKGNRPDTHFNKEGWKTILGTINENTGLGYTRPQLKNHW 225
Query: 244 NVLIRHYCSINALLGKEGFSWDKRQQKVVADDQVWQKCIRVNHNFRLYRIKSMPFYSGMC 303
+ + + L+G WD + A ++ W+ IR N +R K +P +
Sbjct: 226 DCTRKAWKIWCQLVGASSMKWDPESRSFGATEEEWRIYIRENPRAGQFRHKEVPHADQLA 285
Query: 304 II 305
II
Sbjct: 286 II 287
Score = 71.6 bits (174), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 35/129 (27%), Positives = 62/129 (48%)
Query: 13 RANWTPSQDQYFLELMLSHVHRGNKTGKVFSRRAWADMIEQFNTTFGLKYDIDVLKNRFK 72
+ W+PS + FL+L++ +GN+ F++ W ++ N GL Y LKN +
Sbjct: 167 KGYWSPSTHKLFLDLLVQETLKGNRPDTHFNKEGWKTILGTINENTGLGYTRPQLKNHWD 226
Query: 73 RFRKQYIEIKTIISQKGFQWDNALNMIVAGEKTWDEYIKDHPSARAFRTRVVPYYNDMCI 132
RK + ++ +WD A E+ W YI+++P A FR + VP+ + + I
Sbjct: 227 CTRKAWKIWCQLVGASSMKWDPESRSFGATEEEWRIYIRENPRAGQFRHKEVPHADQLAI 286
Query: 133 IYGHAVADG 141
I+ + G
Sbjct: 287 IFNGVIEPG 295
>AT3G11290.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G11310.1); Has 720 Blast hits to 435 proteins
in 28 species: Archae - 0; Bacteria - 2; Metazoa - 0;
Fungi - 32; Plants - 682; Viruses - 0; Other Eukaryotes
- 4 (source: NCBI BLink). | chr3:3535766-3537295 REVERSE
LENGTH=460
Length = 460
Score = 82.0 bits (201), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 64/302 (21%), Positives = 120/302 (39%), Gaps = 13/302 (4%)
Query: 13 RANWTPSQDQYFLELMLSHVHRGNKTGKVFSRRAWADMIEQFNTTFGLKYDIDVLKNRFK 72
+A W P + F++L + GN+ G +++ F G ++ + LKN +
Sbjct: 4 KAAWEPEYHRVFVDLCVEQKMLGNQPGT-------QHILKPFLQRTGARFTRNQLKNHWD 56
Query: 73 RFRKQYIEIKTIISQKGFQWDNALNMIVAGEKTWDEYIKDHPSARAFRTRVVPYYNDMCI 132
KQ+ ++ QWD N A ++ W Y+ +P A +R + + +
Sbjct: 57 TMIKQWKIWCRLVQCSDMQWDPQTNTFGANDQDWANYLHVNPEAGQYRLNPPSFLEKLEL 116
Query: 133 IYGHAVADGRYSLSCFDVDFEYEDIASKELDDQTTPSKGVDDQTPPTVIN-QSKIDWSPM 191
I+ + D + + + IA +D D Q+ + QSK WSP
Sbjct: 117 IFEDSNLDDEGTSGS-----KRKRIAKHRDEDNDNTGDEEDTQSASNFSSPQSKGYWSPS 171
Query: 192 MDHFFVELMVDQVRKGNKIGRSFDKKAWVDMTESFNDRFESHYCKVVLKNRLNVLIRHYC 251
FV+L+ + KGN+ + K+ W + E+ N + + LKN + + +
Sbjct: 172 SHELFVDLLFQEALKGNRPDSHYPKETWKMILETINQNTGKSFTRPQLKNHWDCTRKSWK 231
Query: 252 SINALLGKEGFSWDKRQQKVVADDQVWQKCIRVNHNFRLYRIKSMPFYSGMCIICRDEAT 311
++G WD + A D+ W+ ++ NH +R K +P + I +
Sbjct: 232 IWCQVIGAPVMKWDATSRTFGATDEDWKNYLKENHRAAPFRRKQLPHADKLATIFKGLIE 291
Query: 312 AG 313
G
Sbjct: 292 PG 293
Score = 63.9 bits (154), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 33/145 (22%), Positives = 66/145 (45%), Gaps = 5/145 (3%)
Query: 3 DDDSVSLDNF-----RANWTPSQDQYFLELMLSHVHRGNKTGKVFSRRAWADMIEQFNTT 57
++D+ S NF + W+PS + F++L+ +GN+ + + W ++E N
Sbjct: 150 EEDTQSASNFSSPQSKGYWSPSSHELFVDLLFQEALKGNRPDSHYPKETWKMILETINQN 209
Query: 58 FGLKYDIDVLKNRFKRFRKQYIEIKTIISQKGFQWDNALNMIVAGEKTWDEYIKDHPSAR 117
G + LKN + RK + +I +WD A ++ W Y+K++ A
Sbjct: 210 TGKSFTRPQLKNHWDCTRKSWKIWCQVIGAPVMKWDATSRTFGATDEDWKNYLKENHRAA 269
Query: 118 AFRTRVVPYYNDMCIIYGHAVADGR 142
FR + +P+ + + I+ + G+
Sbjct: 270 PFRRKQLPHADKLATIFKGLIEPGK 294
>AT4G02550.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G02210.2); Has 370 Blast hits to 300 proteins
in 18 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 10; Plants - 354; Viruses - 0; Other Eukaryotes
- 6 (source: NCBI BLink). | chr4:1120622-1121629 REVERSE
LENGTH=307
Length = 307
Score = 71.2 bits (173), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 41/154 (26%), Positives = 71/154 (46%), Gaps = 1/154 (0%)
Query: 16 WTPSQDQYFLELMLSHVHRGNKTGKVFSRRAWADMIEQFNTTFGLKYDIDVLKNRFKRFR 75
W+ D+ +E + GNK K F+ +A+ NT F L NR K +
Sbjct: 22 WSVGMDKCLIEALAVQAKNGNKVDKCFNDKAYTAACVAVNTRFNLNLTSQKAINRLKTIK 81
Query: 76 KQYIEIKTIISQKGFQWDNALNMI-VAGEKTWDEYIKDHPSARAFRTRVVPYYNDMCIIY 134
K+Y ++ I+S+ GF W+++ MI ++ W YI +P A+AFR + + Y ++ +
Sbjct: 82 KRYRVMRDILSRDGFWWNSSTKMIDCESDELWRRYIAVNPDAKAFRGKQIEMYEELRTVC 141
Query: 135 GHAVADGRYSLSCFDVDFEYEDIASKELDDQTTP 168
G G+Y+ + D+ E D + P
Sbjct: 142 GDYQTPGKYNKVKKESSHHLNDVKQFEEDSVSFP 175
Score = 71.2 bits (173), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 38/138 (27%), Positives = 70/138 (50%), Gaps = 1/138 (0%)
Query: 186 IDWSPMMDHFFVELMVDQVRKGNKIGRSFDKKAWVDMTESFNDRFESHYCKVVLKNRLNV 245
+ WS MD +E + Q + GNK+ + F+ KA+ + N RF + NRL
Sbjct: 20 VIWSVGMDKCLIEALAVQAKNGNKVDKCFNDKAYTAACVAVNTRFNLNLTSQKAINRLKT 79
Query: 246 LIRHYCSINALLGKEGFSWDKRQQKV-VADDQVWQKCIRVNHNFRLYRIKSMPFYSGMCI 304
+ + Y + +L ++GF W+ + + D++W++ I VN + + +R K + Y +
Sbjct: 80 IKKRYRVMRDILSRDGFWWNSSTKMIDCESDELWRRYIAVNPDAKAFRGKQIEMYEELRT 139
Query: 305 ICRDEATAGCRSNLEKES 322
+C D T G + ++KES
Sbjct: 140 VCGDYQTPGKYNKVKKES 157
>AT4G02550.3 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G02210.2); Has 35333 Blast hits to 34131
proteins in 2444 species: Archae - 798; Bacteria -
22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
- 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
chr4:1120622-1121674 REVERSE LENGTH=322
Length = 322
Score = 71.2 bits (173), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 41/154 (26%), Positives = 71/154 (46%), Gaps = 1/154 (0%)
Query: 16 WTPSQDQYFLELMLSHVHRGNKTGKVFSRRAWADMIEQFNTTFGLKYDIDVLKNRFKRFR 75
W+ D+ +E + GNK K F+ +A+ NT F L NR K +
Sbjct: 37 WSVGMDKCLIEALAVQAKNGNKVDKCFNDKAYTAACVAVNTRFNLNLTSQKAINRLKTIK 96
Query: 76 KQYIEIKTIISQKGFQWDNALNMI-VAGEKTWDEYIKDHPSARAFRTRVVPYYNDMCIIY 134
K+Y ++ I+S+ GF W+++ MI ++ W YI +P A+AFR + + Y ++ +
Sbjct: 97 KRYRVMRDILSRDGFWWNSSTKMIDCESDELWRRYIAVNPDAKAFRGKQIEMYEELRTVC 156
Query: 135 GHAVADGRYSLSCFDVDFEYEDIASKELDDQTTP 168
G G+Y+ + D+ E D + P
Sbjct: 157 GDYQTPGKYNKVKKESSHHLNDVKQFEEDSVSFP 190
Score = 70.9 bits (172), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 38/138 (27%), Positives = 70/138 (50%), Gaps = 1/138 (0%)
Query: 186 IDWSPMMDHFFVELMVDQVRKGNKIGRSFDKKAWVDMTESFNDRFESHYCKVVLKNRLNV 245
+ WS MD +E + Q + GNK+ + F+ KA+ + N RF + NRL
Sbjct: 35 VIWSVGMDKCLIEALAVQAKNGNKVDKCFNDKAYTAACVAVNTRFNLNLTSQKAINRLKT 94
Query: 246 LIRHYCSINALLGKEGFSWDKRQQKV-VADDQVWQKCIRVNHNFRLYRIKSMPFYSGMCI 304
+ + Y + +L ++GF W+ + + D++W++ I VN + + +R K + Y +
Sbjct: 95 IKKRYRVMRDILSRDGFWWNSSTKMIDCESDELWRRYIAVNPDAKAFRGKQIEMYEELRT 154
Query: 305 ICRDEATAGCRSNLEKES 322
+C D T G + ++KES
Sbjct: 155 VCGDYQTPGKYNKVKKES 172
>AT2G19220.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G11290.1); Has 443 Blast hits to 267 proteins
in 21 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 17; Plants - 426; Viruses - 0; Other Eukaryotes
- 0 (source: NCBI BLink). | chr2:8340678-8342161 REVERSE
LENGTH=439
Length = 439
Score = 68.2 bits (165), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 66/307 (21%), Positives = 123/307 (40%), Gaps = 23/307 (7%)
Query: 13 RANWTPSQDQYFLELMLSHVHRGNKTGKVFSRRAWADMIEQFNTTFGLKYDIDVLKNRFK 72
+A W P D+ F++L + GN+ A+ +M G+++ ID L N +
Sbjct: 4 KAAWEPEHDEVFVDLCVEQKMLGNQPEMQHILEAFQEM--------GVRFTIDQLINHWD 55
Query: 73 RFRKQYIEIKTIISQKGFQWDNALNMIVAGEKTWDEYIKDHPSARAFRTRVVPYYNDMCI 132
KQ+ ++ K +WD+ N A ++ W Y++ +P A +R + + I
Sbjct: 56 TMIKQWKIWCRLVQCKDIKWDSLTNTFGATDQEWANYLEVNPEAGQYRCNPPLFLEKLEI 115
Query: 133 IYGHAVADGRYSLSCFDVDFEYEDIASKELDDQTTPSKGVDDQTPPTVINQSKIDWSPMM 192
I+ DG + S + + I E D+ + V + + + WSP
Sbjct: 116 IFAGMNLDGEGTSS----GSKMKQIC--EHRDEENVTGYVPRLSASDIATRRHYKWSPSS 169
Query: 193 DHFFVELMVDQVRKGNK-IGRS--FDKKAWVDMTESFNDRFESHYCKVVLKN---RLNVL 246
V+ + KG + I R+ F K++W + E N Y L+N R
Sbjct: 170 HAIVVDTCFQESLKGIRPIKRNHLFTKESWKMILEKINRITGLGYTHKQLENHFTRTRTS 229
Query: 247 IRHYCSINALLGKEGFSWDKRQQKVVADDQVWQKCIRVNHNFRLYRIKSMPFYSGMCIIC 306
+H+C + WD +K A ++ W K + +N R+++ + +P + I
Sbjct: 230 WKHWCET---IASPIMKWDANTRKFGATEEDWDKYLMINKRARVFKRRHIPHADKLATIF 286
Query: 307 RDEATAG 313
+ G
Sbjct: 287 KGRIEPG 293
Score = 61.2 bits (147), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 43/172 (25%), Positives = 72/172 (41%), Gaps = 12/172 (6%)
Query: 16 WTPSQDQYFLELMLSHVHRGNKTGK---VFSRRAWADMIEQFNTTFGLKYDIDVLKNRFK 72
W+PS ++ +G + K +F++ +W ++E+ N GL Y L+N F
Sbjct: 165 WSPSSHAIVVDTCFQESLKGIRPIKRNHLFTKESWKMILEKINRITGLGYTHKQLENHFT 224
Query: 73 RFRKQYIEIKTIISQKGFQWDNALNMIVAGEKTWDEYIKDHPSARAFRTRVVPYYNDMCI 132
R R + I+ +WD A E+ WD+Y+ + AR F+ R +P+ + +
Sbjct: 225 RTRTSWKHWCETIASPIMKWDANTRKFGATEEDWDKYLMINKRARVFKRRHIPHADKLAT 284
Query: 133 IYGHAVADG-----RYSLSCFDVDFEYEDIASKELDDQTTPSKGVDDQTPPT 179
I+ + G RY D E + D Q TPS V + P
Sbjct: 285 IFKGRIEPGKTKTRRYRKRVIDHHSESPQLH----DHQPTPSSVVVNTNEPV 332
>AT4G02550.4 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 18 plant
structures; EXPRESSED DURING: 7 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT4G02210.2). | chr4:1120622-1121629 REVERSE
LENGTH=278
Length = 278
Score = 67.8 bits (164), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 35/121 (28%), Positives = 60/121 (49%), Gaps = 1/121 (0%)
Query: 16 WTPSQDQYFLELMLSHVHRGNKTGKVFSRRAWADMIEQFNTTFGLKYDIDVLKNRFKRFR 75
W+ D+ +E + GNK K F+ +A+ NT F L NR K +
Sbjct: 22 WSVGMDKCLIEALAVQAKNGNKVDKCFNDKAYTAACVAVNTRFNLNLTSQKAINRLKTIK 81
Query: 76 KQYIEIKTIISQKGFQWDNALNMI-VAGEKTWDEYIKDHPSARAFRTRVVPYYNDMCIIY 134
K+Y ++ I+S+ GF W+++ MI ++ W YI +P A+AFR + + Y ++ +
Sbjct: 82 KRYRVMRDILSRDGFWWNSSTKMIDCESDELWRRYIAVNPDAKAFRGKQIEMYEELRTVC 141
Query: 135 G 135
G
Sbjct: 142 G 142
Score = 67.4 bits (163), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 35/129 (27%), Positives = 64/129 (49%), Gaps = 1/129 (0%)
Query: 186 IDWSPMMDHFFVELMVDQVRKGNKIGRSFDKKAWVDMTESFNDRFESHYCKVVLKNRLNV 245
+ WS MD +E + Q + GNK+ + F+ KA+ + N RF + NRL
Sbjct: 20 VIWSVGMDKCLIEALAVQAKNGNKVDKCFNDKAYTAACVAVNTRFNLNLTSQKAINRLKT 79
Query: 246 LIRHYCSINALLGKEGFSWDKRQQKV-VADDQVWQKCIRVNHNFRLYRIKSMPFYSGMCI 304
+ + Y + +L ++GF W+ + + D++W++ I VN + + +R K + Y +
Sbjct: 80 IKKRYRVMRDILSRDGFWWNSSTKMIDCESDELWRRYIAVNPDAKAFRGKQIEMYEELRT 139
Query: 305 ICRDEATAG 313
+C D T G
Sbjct: 140 VCGDYQTPG 148
>AT4G02550.2 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G02210.2); Has 350 Blast hits to 284 proteins
in 18 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 13; Plants - 331; Viruses - 0; Other Eukaryotes
- 6 (source: NCBI BLink). | chr4:1120622-1121629 REVERSE
LENGTH=278
Length = 278
Score = 67.8 bits (164), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 35/121 (28%), Positives = 60/121 (49%), Gaps = 1/121 (0%)
Query: 16 WTPSQDQYFLELMLSHVHRGNKTGKVFSRRAWADMIEQFNTTFGLKYDIDVLKNRFKRFR 75
W+ D+ +E + GNK K F+ +A+ NT F L NR K +
Sbjct: 22 WSVGMDKCLIEALAVQAKNGNKVDKCFNDKAYTAACVAVNTRFNLNLTSQKAINRLKTIK 81
Query: 76 KQYIEIKTIISQKGFQWDNALNMI-VAGEKTWDEYIKDHPSARAFRTRVVPYYNDMCIIY 134
K+Y ++ I+S+ GF W+++ MI ++ W YI +P A+AFR + + Y ++ +
Sbjct: 82 KRYRVMRDILSRDGFWWNSSTKMIDCESDELWRRYIAVNPDAKAFRGKQIEMYEELRTVC 141
Query: 135 G 135
G
Sbjct: 142 G 142
Score = 67.4 bits (163), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 35/129 (27%), Positives = 64/129 (49%), Gaps = 1/129 (0%)
Query: 186 IDWSPMMDHFFVELMVDQVRKGNKIGRSFDKKAWVDMTESFNDRFESHYCKVVLKNRLNV 245
+ WS MD +E + Q + GNK+ + F+ KA+ + N RF + NRL
Sbjct: 20 VIWSVGMDKCLIEALAVQAKNGNKVDKCFNDKAYTAACVAVNTRFNLNLTSQKAINRLKT 79
Query: 246 LIRHYCSINALLGKEGFSWDKRQQKV-VADDQVWQKCIRVNHNFRLYRIKSMPFYSGMCI 304
+ + Y + +L ++GF W+ + + D++W++ I VN + + +R K + Y +
Sbjct: 80 IKKRYRVMRDILSRDGFWWNSSTKMIDCESDELWRRYIAVNPDAKAFRGKQIEMYEELRT 139
Query: 305 ICRDEATAG 313
+C D T G
Sbjct: 140 VCGDYQTPG 148
>AT3G11310.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G11290.1); Has 575 Blast hits to 342 proteins
in 22 species: Archae - 0; Bacteria - 2; Metazoa - 0;
Fungi - 10; Plants - 559; Viruses - 0; Other Eukaryotes
- 4 (source: NCBI BLink). | chr3:3542536-3544333 REVERSE
LENGTH=539
Length = 539
Score = 65.9 bits (159), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 42/196 (21%), Positives = 86/196 (43%), Gaps = 10/196 (5%)
Query: 7 VSLDNFRANWTPSQDQYFLELMLSHVHRGNKTGKV-----FSRRAWADMIEQFNTTFGLK 61
+++ ++A W+ S + F++L+ + + N+ +++ W M+E FN GL+
Sbjct: 167 ITIPRYKAYWSSSSHEIFVDLLFTESLKENRPKPARRNGYYAKETWNMMVESFNQKTGLR 226
Query: 62 YDIDVLKNRFKRFRKQYIEIKTIISQKGFQWDNALNMIVAGEKTWDEYIKDHPSARAFRT 121
Y LKN + R + + +WD A + W+ Y K++ A FR
Sbjct: 227 YTRKQLKNHWNITRDAWRRWCQAVGSPLLKWDANTKTFGATSEDWENYSKENKRAEQFRL 286
Query: 122 RVVPYYNDMCIIYGHAVADGRYSLSCFDVDFEYEDIASKELDDQTTPSKGVD-DQTPPTV 180
+ +P+ + + II+ V G+ +L + + A + PS ++ +++ P
Sbjct: 287 KHIPHADKLAIIFKGHVEPGKTALRPYRKRVNHHSEAPQ----HPAPSSALNINESVPGS 342
Query: 181 INQSKIDWSPMMDHFF 196
+ D +MDH F
Sbjct: 343 EGGADDDHHIVMDHHF 358
Score = 65.5 bits (158), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 65/308 (21%), Positives = 112/308 (36%), Gaps = 19/308 (6%)
Query: 16 WTPSQDQYFLELMLSHVHRGNKTGKVFSRRAWADMIEQFNTTFGLKYDIDVLKNRFKRFR 75
W P + F++L + G + + R W E F G ++ D LKN +
Sbjct: 8 WEPELHKVFVDLCVEQKMLGFRLPGL--NRIW----ESFVQNTGARFTRDQLKNHWDTML 61
Query: 76 KQYIEIKTIISQKGFQWDNALNMIVAGEKTWDEYIKDHPSARA--FRTRVVPYYNDMCII 133
+ + ++ +WD A + W Y + +P A+ FR+ P+ D+ +I
Sbjct: 62 RLWRAWCRLVECSEMKWDPQTKKFGASTEVWTNYFRVNPKAKQYRFRSSPPPFLKDLKMI 121
Query: 134 YGHAVADGRYSLSCFDVDFEYEDIASKELDDQTTPSKGVDDQTPPTVINQSKIDWSPMMD 193
+ SC + +D T I + K WS
Sbjct: 122 FEGTDLGDEEGTSCGKRKRIPDADNDTGDEDNDTGDDDNYTGDDDITIPRYKAYWSSSSH 181
Query: 194 HFFVELMVDQVRKGNKIGRS-----FDKKAWVDMTESFNDRFESHYCKVVLKNRLNVL-- 246
FV+L+ + K N+ + + K+ W M ESFN + Y + LKN N+
Sbjct: 182 EIFVDLLFTESLKENRPKPARRNGYYAKETWNMMVESFNQKTGLRYTRKQLKNHWNITRD 241
Query: 247 -IRHYCSINALLGKEGFSWDKRQQKVVADDQVWQKCIRVNHNFRLYRIKSMPFYSGMCII 305
R +C +G WD + A + W+ + N +R+K +P + II
Sbjct: 242 AWRRWCQA---VGSPLLKWDANTKTFGATSEDWENYSKENKRAEQFRLKHIPHADKLAII 298
Query: 306 CRDEATAG 313
+ G
Sbjct: 299 FKGHVEPG 306
Score = 49.3 bits (116), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 31/117 (26%), Positives = 54/117 (46%), Gaps = 6/117 (5%)
Query: 181 INQSKIDWSPMMDHFFVELMVDQVRKGNKIGRSFDKKAWVDMTESFNDRFESHYCKVVLK 240
+ + K+ W P + FV+L V+Q G ++ + W ESF + + + LK
Sbjct: 1 MTREKVMWEPELHKVFVDLCVEQKMLGFRLPGL--NRIW----ESFVQNTGARFTRDQLK 54
Query: 241 NRLNVLIRHYCSINALLGKEGFSWDKRQQKVVADDQVWQKCIRVNHNFRLYRIKSMP 297
N + ++R + + L+ WD + +K A +VW RVN + YR +S P
Sbjct: 55 NHWDTMLRLWRAWCRLVECSEMKWDPQTKKFGASTEVWTNYFRVNPKAKQYRFRSSP 111
>AT1G30140.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT5G27260.1); Has 313 Blast hits to 256 proteins
in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 8; Plants - 295; Viruses - 0; Other Eukaryotes -
10 (source: NCBI BLink). | chr1:10598764-10599527
FORWARD LENGTH=222
Length = 222
Score = 56.2 bits (134), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 32/137 (23%), Positives = 64/137 (46%), Gaps = 7/137 (5%)
Query: 16 WTPSQDQYFLELMLSHVHRGNKTGKVFSRRAWADMIEQFNTTFGLKYDIDVLKNRFKRFR 75
WTP + +EL+ + + +G + + ++ N G + +R K +
Sbjct: 17 WTPDETDVLIELIRQNWR--DSSGIIGKLTVESKLLPALNKRLGCNKNHKNYMSRLKFLK 74
Query: 76 ---KQYIEIKTIISQKGFQWDNALNMIVAGEKTWDEYIKDHPSARAFRTRVVPYYNDMCI 132
+ Y+++K S GF WD A ++ W +Y+K HP+ + +T + ++ D+ I
Sbjct: 75 NLYQSYLDLKRFSS--GFGWDPETKKFTAPDEVWRDYLKAHPNHKHMQTESIDHFEDLQI 132
Query: 133 IYGHAVADGRYSLSCFD 149
I+G VA G +++ D
Sbjct: 133 IFGDVVATGSFAVGMSD 149
>AT5G27260.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT2G29880.1); Has 1807 Blast hits to 1807 proteins
in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736;
Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes
- 339 (source: NCBI BLink). | chr5:9603943-9604930
FORWARD LENGTH=303
Length = 303
Score = 55.1 bits (131), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 29/139 (20%), Positives = 65/139 (46%), Gaps = 7/139 (5%)
Query: 16 WTPSQDQYFLELMLSHVHRG--NKTGKVFSRRAWADMIEQFNTTFGLKYDIDVLKNRFKR 73
W+P + + ++L++ ++ + G + + + N F + + +R K
Sbjct: 17 WSPEETKLLVQLLVEGINNNWRDSNGTISKLTVETKFMPEINKEFCRSKNYNHYLSRMKY 76
Query: 74 FRKQY---IEIKTIISQKGFQWDNALNMIVAGEKTWDEYIKDHPSARAFRTRVVPYYNDM 130
+ QY ++++ S GF WD A ++ W +Y+K HP+ + R +++++
Sbjct: 77 LKIQYQSCLDLQRFSS--GFGWDPLTKRFTASDEVWSDYLKAHPNNKQLRYDTFEFFDEL 134
Query: 131 CIIYGHAVADGRYSLSCFD 149
II+G VA G+ ++ D
Sbjct: 135 QIIFGEGVATGKNAIGLCD 153