Miyakogusa Predicted Gene
- Lj5g3v1473460.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj5g3v1473460.1 Non Chatacterized Hit- tr|G7JAN8|G7JAN8_MEDTR
Putative uncharacterized protein OS=Medicago
truncatul,23.2,7e-18,Myb_DNA-bind_3,Myb/SANT-like domain;
seg,NULL,CUFF.55289.1
(473 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT2G24960.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 265 5e-71
AT2G24960.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 252 5e-67
AT4G02210.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 175 6e-44
AT4G02210.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 175 6e-44
AT5G05800.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 79 7e-15
AT5G05800.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 79 7e-15
AT3G11290.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 63 3e-10
AT4G02550.4 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 62 1e-09
AT4G02550.2 | Symbols: | unknown protein; BEST Arabidopsis thal... 62 1e-09
AT4G02550.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 62 1e-09
AT4G02550.3 | Symbols: | unknown protein; BEST Arabidopsis thal... 62 1e-09
AT3G11310.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 56 7e-08
AT2G19220.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 53 6e-07
>AT2G24960.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 21 plant
structures; EXPRESSED DURING: 12 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT4G02210.2); Has 30201 Blast hits to 17322
proteins in 780 species: Archae - 12; Bacteria - 1396;
Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
0; Other Eukaryotes - 2996 (source: NCBI BLink). |
chr2:10617263-10620034 FORWARD LENGTH=774
Length = 774
Score = 265 bits (677), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 167/534 (31%), Positives = 256/534 (47%), Gaps = 98/534 (18%)
Query: 16 TNWTPAMENYFIGLLLDQVHKGNK------------------------------------ 39
T WTP ME +FI L+L+ +H+GN+
Sbjct: 13 TYWTPTMERFFIDLMLEHLHRGNRTGHTFNKQAWNEMLTVFNSKFGSQYDKDVLKSRYTN 72
Query: 40 ----FNDIKNLLDRNGFSWDETSRMVVASDRVWDAYIKVHPHAQAYRGKELVDIKDLCLV 95
+ND+K LLD GF WD+T + V+ D +W Y+K HP A+ Y+ K +++ DLCL+
Sbjct: 73 LWKQYNDVKCLLDHGGFVWDQTHQTVIGDDSLWSLYLKAHPEARVYKTKPVLNFSDLCLI 132
Query: 96 YAHERTDGRYSLSSHDVDFGDD---EQVVNTGSGREGVYHSSDGDEYVRGSWXXXXXXXX 152
Y + DGRYS+SSHD++ D+ E VV +G E + W
Sbjct: 133 YGYTVADGRYSMSSHDLEIEDEINGESVVLSGK------------ESSKTEWTLEMDQYF 180
Query: 153 XXXXXNQALKVNNSSHDFTFEAWCDIVTSFCVKFGSHYTKEDLKNRQKYLEKHFDDLKVL 212
+Q + N + + F+ +AW D++ F +F Y K L++R L K++ D++ +
Sbjct: 181 VEIMVDQIGRGNKTGNAFSKQAWIDMLVLFNARFSGQYGKRVLRHRYNKLLKYYKDMEAI 240
Query: 213 TKQSGFAWDGKQEMVMAEDEVWNSYTKVHPDALLYRNKFVPIYHKLSLIYGGEFSEERXX 272
K+ GF+WD + M+ A+D VW+SY K HP A YR K +P Y+ L I+ + +
Sbjct: 241 LKEDGFSWDETRLMISADDAVWDSYIKDHPLARTYRMKSLPSYNDLDTIFACQAEQ---- 296
Query: 273 XXXXXXXXXXNGPISTIGVDEDIQDCAIDYFSRVDGTPYMDRYLIDLMVEEVRRRNKIDY 332
+G + + Q+ D +R+ TP MD +LIDL+VE+V N++
Sbjct: 297 ----GTDHRDDGSAAQTSETKASQEQNSDR-TRIFWTPPMDYHLIDLLVEQVNNGNRVGQ 351
Query: 333 VRNDQACLDMVVMFKERFGIQFDKNYLKHCCKGLEKLYHKMRSLLEERGFSWDETRQMIT 392
A +MV F +FG Q +K+ LK+ K L +LY+ ++ LLE+ GFSWD R M+
Sbjct: 352 TFITSAWNEMVTAFNAKFGSQHNKDVLKNRYKHLRRLYNDIKFLLEQNGFSWDARRDMVI 411
Query: 393 ACNGVWDAYIKEHPDANSYRNHQKPNYNDLCLIYG--SSDTELT---------------C 435
A + +W+ YI+ HP+A SYR P+Y +LC I+G +SD T
Sbjct: 412 ADDDIWNTYIQAHPEARSYRVKTIPSYPNLCFIFGKETSDGRYTRLAQAFDPSPAETVRM 471
Query: 436 NPANQNVGYNDCSIICQKLHWRSN----------------WTPPMDRYFMDLML 473
N + G+ D QK+ + SN WT MD +DLML
Sbjct: 472 NESGSTDGFKDTRSF-QKVVYTSNEKNDYPCSNIGPPCIEWTRVMDHCLIDLML 524
Score = 232 bits (592), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 151/470 (32%), Positives = 218/470 (46%), Gaps = 71/470 (15%)
Query: 14 SGTNWTPAMENYFIGLLLDQVHKGNK---------------------------------- 39
S T WT M+ YF+ +++DQ+ +GNK
Sbjct: 168 SKTEWTLEMDQYFVEIMVDQIGRGNKTGNAFSKQAWIDMLVLFNARFSGQYGKRVLRHRY 227
Query: 40 ------FNDIKNLLDRNGFSWDETSRMVVASDRVWDAYIKVHPHAQAYRGKELVDIKDLC 93
+ D++ +L +GFSWDET M+ A D VWD+YIK HP A+ YR K L DL
Sbjct: 228 NKLLKYYKDMEAILKEDGFSWDETRLMISADDAVWDSYIKDHPLARTYRMKSLPSYNDLD 287
Query: 94 LVYAHERTDGRYSLSSHDVDFGDDEQVVNTGSGREGVYHSSDGDEYVRGSWXXXXXXXXX 153
++A + G D DD T + +SD R W
Sbjct: 288 TIFACQAEQG--------TDHRDDGSAAQTSETKASQEQNSD---RTRIFWTPPMDYHLI 336
Query: 154 XXXXNQALKVNNSSHDFTFEAWCDIVTSFCVKFGSHYTKEDLKNRQKYLEKHFDDLKVLT 213
Q N F AW ++VT+F KFGS + K+ LKNR K+L + ++D+K L
Sbjct: 337 DLLVEQVNNGNRVGQTFITSAWNEMVTAFNAKFGSQHNKDVLKNRYKHLRRLYNDIKFLL 396
Query: 214 KQSGFAWDGKQEMVMAEDEVWNSYTKVHPDALLYRNKFVPIYHKLSLIYGGEFSEERXXX 273
+Q+GF+WD +++MV+A+D++WN+Y + HP+A YR K +P Y L I+G E S+ R
Sbjct: 397 EQNGFSWDARRDMVIADDDIWNTYIQAHPEARSYRVKTIPSYPNLCFIFGKETSDGRYTR 456
Query: 274 XXXX------XXXXXNGPISTIGVDEDIQDCAIDYFSR--------------VDGTPYMD 313
N ST G + + Y S ++ T MD
Sbjct: 457 LAQAFDPSPAETVRMNESGSTDGFKDTRSFQKVVYTSNEKNDYPCSNIGPPCIEWTRVMD 516
Query: 314 RYLIDLMVEEVRRRNKIDYVRNDQACLDMVVMFKERFGIQFDKNYLKHCCKGLEKLYHKM 373
LIDLM+E+V R NKI +QA DM F +FG+Q D L++ L K +
Sbjct: 517 HCLIDLMLEQVSRGNKIGETFTEQAWADMAESFNAKFGLQTDMFMLENRYILLMKERDDI 576
Query: 374 RSLLEERGFSWDETRQMITACNGVWDAYIKEHPDANSYRNHQKPNYNDLC 423
++L GF+WD +Q I A + W+AYIKEHPDA Y+ +Y +LC
Sbjct: 577 NNILNLDGFTWDVEKQTIVAEDEYWEAYIKEHPDATIYKGKTLDSYGNLC 626
Score = 116 bits (291), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 59/175 (33%), Positives = 101/175 (57%), Gaps = 5/175 (2%)
Query: 304 SRVDGTPYMDRYLIDLMVEEVRRRNKIDYVRNDQACLDMVVMFKERFGIQFDKNYLKHCC 363
+R TP M+R+ IDLM+E + R N+ + N QA +M+ +F +FG Q+DK+ LK
Sbjct: 11 TRTYWTPTMERFFIDLMLEHLHRGNRTGHTFNKQAWNEMLTVFNSKFGSQYDKDVLKSRY 70
Query: 364 KGLEKLYHKMRSLLEERGFSWDETRQMITACNGVWDAYIKEHPDANSYRNHQKPNYNDLC 423
L K Y+ ++ LL+ GF WD+T Q + + +W Y+K HP+A Y+ N++DLC
Sbjct: 71 TNLWKQYNDVKCLLDHGGFVWDQTHQTVIGDDSLWSLYLKAHPEARVYKTKPVLNFSDLC 130
Query: 424 LIYGSSDTELTCNPANQNVGYND-----CSIICQKLHWRSNWTPPMDRYFMDLML 473
LIYG + + + ++ ++ D ++ K ++ WT MD+YF+++M+
Sbjct: 131 LIYGYTVADGRYSMSSHDLEIEDEINGESVVLSGKESSKTEWTLEMDQYFVEIMV 185
Score = 65.1 bits (157), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 40/133 (30%), Positives = 57/133 (42%), Gaps = 41/133 (30%)
Query: 6 PRGNVNVPSGTNWTPAMENYFIGLLLDQVHKGNKF------------------------- 40
P N+ P WT M++ I L+L+QV +GNK
Sbjct: 500 PCSNIG-PPCIEWTRVMDHCLIDLMLEQVSRGNKIGETFTEQAWADMAESFNAKFGLQTD 558
Query: 41 ---------------NDIKNLLDRNGFSWDETSRMVVASDRVWDAYIKVHPHAQAYRGKE 85
+DI N+L+ +GF+WD + +VA D W+AYIK HP A Y+GK
Sbjct: 559 MFMLENRYILLMKERDDINNILNLDGFTWDVEKQTIVAEDEYWEAYIKEHPDATIYKGKT 618
Query: 86 LVDIKDLCLVYAH 98
L +LC + H
Sbjct: 619 LDSYGNLCKLNEH 631
>AT2G24960.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G02210.2); Has 1453 Blast hits to 509 proteins
in 26 species: Archae - 0; Bacteria - 0; Metazoa - 1;
Fungi - 39; Plants - 1363; Viruses - 0; Other Eukaryotes
- 50 (source: NCBI BLink). | chr2:10617263-10620034
FORWARD LENGTH=797
Length = 797
Score = 252 bits (643), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 167/557 (29%), Positives = 256/557 (45%), Gaps = 121/557 (21%)
Query: 16 TNWTPAMENYFIGLLLDQVHKGNK------------------------------------ 39
T WTP ME +FI L+L+ +H+GN+
Sbjct: 13 TYWTPTMERFFIDLMLEHLHRGNRTGHTFNKQAWNEMLTVFNSKFGSQYDKDVLKSRYTN 72
Query: 40 ----FNDIKNLLDRNGFSWDETSRMVVASDRVWDAYIKVHPHAQAYRGKELVDIKDLCLV 95
+ND+K LLD GF WD+T + V+ D +W Y+K HP A+ Y+ K +++ DLCL+
Sbjct: 73 LWKQYNDVKCLLDHGGFVWDQTHQTVIGDDSLWSLYLKAHPEARVYKTKPVLNFSDLCLI 132
Query: 96 YAHERTDGRYSLSSHDVDFGDD---EQVVNTGSGREGVYHSSDGDEYVRGSWXXXXXXXX 152
Y + DGRYS+SSHD++ D+ E VV +G E + W
Sbjct: 133 YGYTVADGRYSMSSHDLEIEDEINGESVVLSGK------------ESSKTEWTLEMDQYF 180
Query: 153 XXXXXNQALKVNNSSHDFTFEAWCDIVTSFCVKFGSHYTKEDLKNRQKYLEKHFDDLKVL 212
+Q + N + + F+ +AW D++ F +F Y K L++R L K++ D++ +
Sbjct: 181 VEIMVDQIGRGNKTGNAFSKQAWIDMLVLFNARFSGQYGKRVLRHRYNKLLKYYKDMEAI 240
Query: 213 TKQSGFAWDGKQEMVMAEDEVWNSYTKVHPDALLYRNKFVPIYHKLSLIYGGEFSEERXX 272
K+ GF+WD + M+ A+D VW+SY K HP A YR K +P Y+ L I+ + +
Sbjct: 241 LKEDGFSWDETRLMISADDAVWDSYIKDHPLARTYRMKSLPSYNDLDTIFACQAEQ---- 296
Query: 273 XXXXXXXXXXNGPISTIGVDEDIQDCAIDYFSRVDGTPYMDRYLIDLMVEEVRRRNKIDY 332
+G + + Q+ D +R+ TP MD +LIDL+VE+V N++
Sbjct: 297 ----GTDHRDDGSAAQTSETKASQEQNSDR-TRIFWTPPMDYHLIDLLVEQVNNGNRVGQ 351
Query: 333 VRNDQACLDMVVMFKERFGIQFDKNYLKHCCKGLEKLYHKMRSLLEERGFSWDETRQMIT 392
A +MV F +FG Q +K+ LK+ K L +LY+ ++ LLE+ GFSWD R M+
Sbjct: 352 TFITSAWNEMVTAFNAKFGSQHNKDVLKNRYKHLRRLYNDIKFLLEQNGFSWDARRDMVI 411
Query: 393 ACNGVWDAYI-----------------------KEHPDANSYRNHQKPNYNDLCLIYG-- 427
A + +W+ YI + HP+A SYR P+Y +LC I+G
Sbjct: 412 ADDDIWNTYIQACHILFLFKISVICLCLQMKHVQAHPEARSYRVKTIPSYPNLCFIFGKE 471
Query: 428 SSDTELT---------------CNPANQNVGYNDCSIICQKLHWRSN------------- 459
+SD T N + G+ D QK+ + SN
Sbjct: 472 TSDGRYTRLAQAFDPSPAETVRMNESGSTDGFKDTRSF-QKVVYTSNEKNDYPCSNIGPP 530
Query: 460 ---WTPPMDRYFMDLML 473
WT MD +DLML
Sbjct: 531 CIEWTRVMDHCLIDLML 547
Score = 219 bits (558), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 151/493 (30%), Positives = 218/493 (44%), Gaps = 94/493 (19%)
Query: 14 SGTNWTPAMENYFIGLLLDQVHKGNK---------------------------------- 39
S T WT M+ YF+ +++DQ+ +GNK
Sbjct: 168 SKTEWTLEMDQYFVEIMVDQIGRGNKTGNAFSKQAWIDMLVLFNARFSGQYGKRVLRHRY 227
Query: 40 ------FNDIKNLLDRNGFSWDETSRMVVASDRVWDAYIKVHPHAQAYRGKELVDIKDLC 93
+ D++ +L +GFSWDET M+ A D VWD+YIK HP A+ YR K L DL
Sbjct: 228 NKLLKYYKDMEAILKEDGFSWDETRLMISADDAVWDSYIKDHPLARTYRMKSLPSYNDLD 287
Query: 94 LVYAHERTDGRYSLSSHDVDFGDDEQVVNTGSGREGVYHSSDGDEYVRGSWXXXXXXXXX 153
++A + G D DD T + +SD R W
Sbjct: 288 TIFACQAEQG--------TDHRDDGSAAQTSETKASQEQNSD---RTRIFWTPPMDYHLI 336
Query: 154 XXXXNQALKVNNSSHDFTFEAWCDIVTSFCVKFGSHYTKEDLKNRQKYLEKHFDDLKVLT 213
Q N F AW ++VT+F KFGS + K+ LKNR K+L + ++D+K L
Sbjct: 337 DLLVEQVNNGNRVGQTFITSAWNEMVTAFNAKFGSQHNKDVLKNRYKHLRRLYNDIKFLL 396
Query: 214 KQSGFAWDGKQEMVMAEDEVWNSY-----------------------TKVHPDALLYRNK 250
+Q+GF+WD +++MV+A+D++WN+Y + HP+A YR K
Sbjct: 397 EQNGFSWDARRDMVIADDDIWNTYIQACHILFLFKISVICLCLQMKHVQAHPEARSYRVK 456
Query: 251 FVPIYHKLSLIYGGEFSEERXXXXXXX------XXXXXNGPISTIGVDEDIQDCAIDYFS 304
+P Y L I+G E S+ R N ST G + + Y S
Sbjct: 457 TIPSYPNLCFIFGKETSDGRYTRLAQAFDPSPAETVRMNESGSTDGFKDTRSFQKVVYTS 516
Query: 305 R--------------VDGTPYMDRYLIDLMVEEVRRRNKIDYVRNDQACLDMVVMFKERF 350
++ T MD LIDLM+E+V R NKI +QA DM F +F
Sbjct: 517 NEKNDYPCSNIGPPCIEWTRVMDHCLIDLMLEQVSRGNKIGETFTEQAWADMAESFNAKF 576
Query: 351 GIQFDKNYLKHCCKGLEKLYHKMRSLLEERGFSWDETRQMITACNGVWDAYIKEHPDANS 410
G+Q D L++ L K + ++L GF+WD +Q I A + W+AYIKEHPDA
Sbjct: 577 GLQTDMFMLENRYILLMKERDDINNILNLDGFTWDVEKQTIVAEDEYWEAYIKEHPDATI 636
Query: 411 YRNHQKPNYNDLC 423
Y+ +Y +LC
Sbjct: 637 YKGKTLDSYGNLC 649
Score = 116 bits (291), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 59/175 (33%), Positives = 101/175 (57%), Gaps = 5/175 (2%)
Query: 304 SRVDGTPYMDRYLIDLMVEEVRRRNKIDYVRNDQACLDMVVMFKERFGIQFDKNYLKHCC 363
+R TP M+R+ IDLM+E + R N+ + N QA +M+ +F +FG Q+DK+ LK
Sbjct: 11 TRTYWTPTMERFFIDLMLEHLHRGNRTGHTFNKQAWNEMLTVFNSKFGSQYDKDVLKSRY 70
Query: 364 KGLEKLYHKMRSLLEERGFSWDETRQMITACNGVWDAYIKEHPDANSYRNHQKPNYNDLC 423
L K Y+ ++ LL+ GF WD+T Q + + +W Y+K HP+A Y+ N++DLC
Sbjct: 71 TNLWKQYNDVKCLLDHGGFVWDQTHQTVIGDDSLWSLYLKAHPEARVYKTKPVLNFSDLC 130
Query: 424 LIYGSSDTELTCNPANQNVGYND-----CSIICQKLHWRSNWTPPMDRYFMDLML 473
LIYG + + + ++ ++ D ++ K ++ WT MD+YF+++M+
Sbjct: 131 LIYGYTVADGRYSMSSHDLEIEDEINGESVVLSGKESSKTEWTLEMDQYFVEIMV 185
Score = 65.5 bits (158), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 40/133 (30%), Positives = 57/133 (42%), Gaps = 41/133 (30%)
Query: 6 PRGNVNVPSGTNWTPAMENYFIGLLLDQVHKGNKF------------------------- 40
P N+ P WT M++ I L+L+QV +GNK
Sbjct: 523 PCSNIG-PPCIEWTRVMDHCLIDLMLEQVSRGNKIGETFTEQAWADMAESFNAKFGLQTD 581
Query: 41 ---------------NDIKNLLDRNGFSWDETSRMVVASDRVWDAYIKVHPHAQAYRGKE 85
+DI N+L+ +GF+WD + +VA D W+AYIK HP A Y+GK
Sbjct: 582 MFMLENRYILLMKERDDINNILNLDGFTWDVEKQTIVAEDEYWEAYIKEHPDATIYKGKT 641
Query: 86 LVDIKDLCLVYAH 98
L +LC + H
Sbjct: 642 LDSYGNLCKLNEH 654
>AT4G02210.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 22 plant
structures; EXPRESSED DURING: 13 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT2G24960.2). | chr4:974320-975917 REVERSE
LENGTH=439
Length = 439
Score = 175 bits (444), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 106/309 (34%), Positives = 153/309 (49%), Gaps = 10/309 (3%)
Query: 133 SSDGDEYVRGSWXXXXXXXXXXXXXNQALKVNN-SSHDFTFEAWCDIVTSFCVKFGSHYT 191
S +G+E +R W Q K N H F+ AW + SF KF Y
Sbjct: 3 SRNGNERLRTVWTPEMDQYFIELMVEQVRKGNRFEDHLFSKRAWKFMSCSFTAKFKFLYG 62
Query: 192 KEDLKNRQKYLEKHFDDLKVLTKQSGFAWDGKQEMVMAEDEVWNSYTKVHPDALLYRNKF 251
K+ LKNR K L F + L + GF+WD ++MV+A++ VW+ Y K+HPD+ +R K
Sbjct: 63 KDVLKNRHKTLRNLFKSVNNLLIEDGFSWDDTRQMVVADNCVWDEYLKIHPDSRSFRIKS 122
Query: 252 VPIYHKLSLIYGGEFSEERXXXXXXXXXXXX--------NGPISTIGVDEDIQDCAIDYF 303
+P Y L L+Y SE + N + V + + ++
Sbjct: 123 IPCYKDLCLVYSDGMSEHKAEESISEGESKTLIQEDDGYNRICESSTVRSNSKGSSVTR- 181
Query: 304 SRVDGTPYMDRYLIDLMVEEVRRRNKIDYVRNDQACLDMVVMFKERFGIQFDKNYLKHCC 363
R P MDRY IDLM+++ RR N+I+ V QA +MV +F +F FD + LK+
Sbjct: 182 CRTTWHPPMDRYFIDLMLDQARRGNQIEGVFRKQAWTEMVNLFNAKFESNFDVDVLKNRY 241
Query: 364 KGLEKLYHKMRSLLEERGFSWDETRQMITACNGVWDAYIKEHPDANSYRNHQKPNYNDLC 423
K L + ++ ++S+L GF+WD RQM+TA N VW YIK H DA + P Y DLC
Sbjct: 242 KSLRRQFNAIKSILRSDGFAWDNERQMVTADNNVWQDYIKAHRDARQFMTRPIPYYKDLC 301
Query: 424 LIYGSSDTE 432
++ G S E
Sbjct: 302 VLCGDSGIE 310
Score = 152 bits (385), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 92/303 (30%), Positives = 144/303 (47%), Gaps = 52/303 (17%)
Query: 16 TNWTPAMENYFIGLLLDQVHKGNKFND--------------------------------- 42
T WTP M+ YFI L+++QV KGN+F D
Sbjct: 12 TVWTPEMDQYFIELMVEQVRKGNRFEDHLFSKRAWKFMSCSFTAKFKFLYGKDVLKNRHK 71
Query: 43 --------IKNLLDRNGFSWDETSRMVVASDRVWDAYIKVHPHAQAYRGKELVDIKDLCL 94
+ NLL +GFSWD+T +MVVA + VWD Y+K+HP ++++R K + KDLCL
Sbjct: 72 TLRNLFKSVNNLLIEDGFSWDDTRQMVVADNCVWDEYLKIHPDSRSFRIKSIPCYKDLCL 131
Query: 95 VYAHERTDGRYSLSSHDVDFGDDEQVVNTGSGREGVYHSSDGDEYVRGS--------WXX 146
VY+ ++ + + + G+ + ++ G + SS +GS W
Sbjct: 132 VYSDGMSEHK---AEESISEGESKTLIQEDDGYNRICESSTVRSNSKGSSVTRCRTTWHP 188
Query: 147 XXXXXXXXXXXNQALKVNNSSHDFTFEAWCDIVTSFCVKFGSHYTKEDLKNRQKYLEKHF 206
+QA + N F +AW ++V F KF S++ + LKNR K L + F
Sbjct: 189 PMDRYFIDLMLDQARRGNQIEGVFRKQAWTEMVNLFNAKFESNFDVDVLKNRYKSLRRQF 248
Query: 207 DDLKVLTKQSGFAWDGKQEMVMAEDEVWNSYTKVHPDALLYRNKFVPIYHKLSLIYGGEF 266
+ +K + + GFAWD +++MV A++ VW Y K H DA + + +P Y L ++ G
Sbjct: 249 NAIKSILRSDGFAWDNERQMVTADNNVWQDYIKAHRDARQFMTRPIPYYKDLCVLCGDSG 308
Query: 267 SEE 269
EE
Sbjct: 309 IEE 311
Score = 128 bits (321), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 76/188 (40%), Positives = 104/188 (55%), Gaps = 26/188 (13%)
Query: 309 TPYMDRYLIDLMVEEVRRRNKI-DYVRNDQACLDMVVMFKERFGIQFDKNYLKHCCKGLE 367
TP MD+Y I+LMVE+VR+ N+ D++ + +A M F +F + K+ LK+ K L
Sbjct: 15 TPEMDQYFIELMVEQVRKGNRFEDHLFSKRAWKFMSCSFTAKFKFLYGKDVLKNRHKTLR 74
Query: 368 KLYHKMRSLLEERGFSWDETRQMITACNGVWDAYIKEHPDANSYRNHQKPNYNDLCLIYG 427
L+ + +LL E GFSWD+TRQM+ A N VWD Y+K HPD+ S+R P Y DLCL+Y
Sbjct: 75 NLFKSVNNLLIEDGFSWDDTRQMVVADNCVWDEYLKIHPDSRSFRIKSIPCYKDLCLVYS 134
Query: 428 SSDTELTCNPA----------NQNVGYNDCSIICQKLHWRSN------------WTPPMD 465
+E + ++ GYN IC+ RSN W PPMD
Sbjct: 135 DGMSEHKAEESISEGESKTLIQEDDGYNR---ICESSTVRSNSKGSSVTRCRTTWHPPMD 191
Query: 466 RYFMDLML 473
RYF+DLML
Sbjct: 192 RYFIDLML 199
>AT4G02210.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT2G24960.2); Has 791 Blast hits to 465 proteins
in 19 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 17; Plants - 748; Viruses - 0; Other Eukaryotes
- 26 (source: NCBI BLink). | chr4:974320-975917 REVERSE
LENGTH=439
Length = 439
Score = 175 bits (444), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 106/309 (34%), Positives = 153/309 (49%), Gaps = 10/309 (3%)
Query: 133 SSDGDEYVRGSWXXXXXXXXXXXXXNQALKVNN-SSHDFTFEAWCDIVTSFCVKFGSHYT 191
S +G+E +R W Q K N H F+ AW + SF KF Y
Sbjct: 3 SRNGNERLRTVWTPEMDQYFIELMVEQVRKGNRFEDHLFSKRAWKFMSCSFTAKFKFLYG 62
Query: 192 KEDLKNRQKYLEKHFDDLKVLTKQSGFAWDGKQEMVMAEDEVWNSYTKVHPDALLYRNKF 251
K+ LKNR K L F + L + GF+WD ++MV+A++ VW+ Y K+HPD+ +R K
Sbjct: 63 KDVLKNRHKTLRNLFKSVNNLLIEDGFSWDDTRQMVVADNCVWDEYLKIHPDSRSFRIKS 122
Query: 252 VPIYHKLSLIYGGEFSEERXXXXXXXXXXXX--------NGPISTIGVDEDIQDCAIDYF 303
+P Y L L+Y SE + N + V + + ++
Sbjct: 123 IPCYKDLCLVYSDGMSEHKAEESISEGESKTLIQEDDGYNRICESSTVRSNSKGSSVTR- 181
Query: 304 SRVDGTPYMDRYLIDLMVEEVRRRNKIDYVRNDQACLDMVVMFKERFGIQFDKNYLKHCC 363
R P MDRY IDLM+++ RR N+I+ V QA +MV +F +F FD + LK+
Sbjct: 182 CRTTWHPPMDRYFIDLMLDQARRGNQIEGVFRKQAWTEMVNLFNAKFESNFDVDVLKNRY 241
Query: 364 KGLEKLYHKMRSLLEERGFSWDETRQMITACNGVWDAYIKEHPDANSYRNHQKPNYNDLC 423
K L + ++ ++S+L GF+WD RQM+TA N VW YIK H DA + P Y DLC
Sbjct: 242 KSLRRQFNAIKSILRSDGFAWDNERQMVTADNNVWQDYIKAHRDARQFMTRPIPYYKDLC 301
Query: 424 LIYGSSDTE 432
++ G S E
Sbjct: 302 VLCGDSGIE 310
Score = 152 bits (385), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 92/303 (30%), Positives = 144/303 (47%), Gaps = 52/303 (17%)
Query: 16 TNWTPAMENYFIGLLLDQVHKGNKFND--------------------------------- 42
T WTP M+ YFI L+++QV KGN+F D
Sbjct: 12 TVWTPEMDQYFIELMVEQVRKGNRFEDHLFSKRAWKFMSCSFTAKFKFLYGKDVLKNRHK 71
Query: 43 --------IKNLLDRNGFSWDETSRMVVASDRVWDAYIKVHPHAQAYRGKELVDIKDLCL 94
+ NLL +GFSWD+T +MVVA + VWD Y+K+HP ++++R K + KDLCL
Sbjct: 72 TLRNLFKSVNNLLIEDGFSWDDTRQMVVADNCVWDEYLKIHPDSRSFRIKSIPCYKDLCL 131
Query: 95 VYAHERTDGRYSLSSHDVDFGDDEQVVNTGSGREGVYHSSDGDEYVRGS--------WXX 146
VY+ ++ + + + G+ + ++ G + SS +GS W
Sbjct: 132 VYSDGMSEHK---AEESISEGESKTLIQEDDGYNRICESSTVRSNSKGSSVTRCRTTWHP 188
Query: 147 XXXXXXXXXXXNQALKVNNSSHDFTFEAWCDIVTSFCVKFGSHYTKEDLKNRQKYLEKHF 206
+QA + N F +AW ++V F KF S++ + LKNR K L + F
Sbjct: 189 PMDRYFIDLMLDQARRGNQIEGVFRKQAWTEMVNLFNAKFESNFDVDVLKNRYKSLRRQF 248
Query: 207 DDLKVLTKQSGFAWDGKQEMVMAEDEVWNSYTKVHPDALLYRNKFVPIYHKLSLIYGGEF 266
+ +K + + GFAWD +++MV A++ VW Y K H DA + + +P Y L ++ G
Sbjct: 249 NAIKSILRSDGFAWDNERQMVTADNNVWQDYIKAHRDARQFMTRPIPYYKDLCVLCGDSG 308
Query: 267 SEE 269
EE
Sbjct: 309 IEE 311
Score = 128 bits (321), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 76/188 (40%), Positives = 104/188 (55%), Gaps = 26/188 (13%)
Query: 309 TPYMDRYLIDLMVEEVRRRNKI-DYVRNDQACLDMVVMFKERFGIQFDKNYLKHCCKGLE 367
TP MD+Y I+LMVE+VR+ N+ D++ + +A M F +F + K+ LK+ K L
Sbjct: 15 TPEMDQYFIELMVEQVRKGNRFEDHLFSKRAWKFMSCSFTAKFKFLYGKDVLKNRHKTLR 74
Query: 368 KLYHKMRSLLEERGFSWDETRQMITACNGVWDAYIKEHPDANSYRNHQKPNYNDLCLIYG 427
L+ + +LL E GFSWD+TRQM+ A N VWD Y+K HPD+ S+R P Y DLCL+Y
Sbjct: 75 NLFKSVNNLLIEDGFSWDDTRQMVVADNCVWDEYLKIHPDSRSFRIKSIPCYKDLCLVYS 134
Query: 428 SSDTELTCNPA----------NQNVGYNDCSIICQKLHWRSN------------WTPPMD 465
+E + ++ GYN IC+ RSN W PPMD
Sbjct: 135 DGMSEHKAEESISEGESKTLIQEDDGYNR---ICESSTVRSNSKGSSVTRCRTTWHPPMD 191
Query: 466 RYFMDLML 473
RYF+DLML
Sbjct: 192 RYFIDLML 199
>AT5G05800.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 24 plant
structures; EXPRESSED DURING: 15 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT3G11290.1); Has 881 Blast hits to 512 proteins
in 30 species: Archae - 0; Bacteria - 2; Metazoa - 0;
Fungi - 38; Plants - 833; Viruses - 0; Other Eukaryotes
- 8 (source: NCBI BLink). | chr5:1743234-1744751 REVERSE
LENGTH=449
Length = 449
Score = 79.0 bits (193), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 61/270 (22%), Positives = 111/270 (41%), Gaps = 5/270 (1%)
Query: 159 QALKVNNSSHDFTFEAWCDIVTSFCVKFGSHYTKEDLKNRQKYLEKHFDDLKVLTKQSGF 218
Q + N F+ E W +I+ SF + G+ Y + LKN + + + + L + S
Sbjct: 22 QTMLGNKPGTHFSKEGWRNILISFQEQTGAMYDRMQLKNHWDTMSRQWKIWRRLVETSFM 81
Query: 219 AWDGKQEMVMAEDEVWNSYTKVHPDALLYRNKFVPIYHKLSLIYGGEFSEERXXXXXXXX 278
W+ + A D+ W +Y + +PDA YR KL +++ G E +
Sbjct: 82 NWNPESNRFRATDDDWANYLQENPDAGQYRLSVPHDLKKLEILFAGCNVEVKNDEVSGVR 141
Query: 279 XXXXNGPISTIGVDEDIQDCAIDYFSRVDG--TPYMDRYLIDLMVEEVRRRNKIDYVRND 336
+ DED Q + G +P + +DL+V+E + N+ D N
Sbjct: 142 KRRRS---CYEEEDEDNQSMCSSSNPQTKGYWSPSTHKLFLDLLVQETLKGNRPDTHFNK 198
Query: 337 QACLDMVVMFKERFGIQFDKNYLKHCCKGLEKLYHKMRSLLEERGFSWDETRQMITACNG 396
+ ++ E G+ + + LK+ K + L+ WD + A
Sbjct: 199 EGWKTILGTINENTGLGYTRPQLKNHWDCTRKAWKIWCQLVGASSMKWDPESRSFGATEE 258
Query: 397 VWDAYIKEHPDANSYRNHQKPNYNDLCLIY 426
W YI+E+P A +R+ + P+ + L +I+
Sbjct: 259 EWRIYIRENPRAGQFRHKEVPHADQLAIIF 288
Score = 66.2 bits (160), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 56/230 (24%), Positives = 89/230 (38%), Gaps = 21/230 (9%)
Query: 44 KNLLDRNGFSWDETSRMVVASDRVWDAYIKVHPHAQAYRGKELVDIKDLCLVYAHERTDG 103
+ L++ + +W+ S A+D W Y++ +P A YR D+K L +++A
Sbjct: 73 RRLVETSFMNWNPESNRFRATDDDWANYLQENPDAGQYRLSVPHDLKKLEILFA------ 126
Query: 104 RYSLSSHDVDFGDDEQVVNTGSGREGVYHSSDGD---------EYVRGSWXXXXXXXXXX 154
+V+ +DE V R Y D D +G W
Sbjct: 127 -----GCNVEVKNDE-VSGVRKRRRSCYEEEDEDNQSMCSSSNPQTKGYWSPSTHKLFLD 180
Query: 155 XXXNQALKVNNSSHDFTFEAWCDIVTSFCVKFGSHYTKEDLKNRQKYLEKHFDDLKVLTK 214
+ LK N F E W I+ + G YT+ LKN K + L
Sbjct: 181 LLVQETLKGNRPDTHFNKEGWKTILGTINENTGLGYTRPQLKNHWDCTRKAWKIWCQLVG 240
Query: 215 QSGFAWDGKQEMVMAEDEVWNSYTKVHPDALLYRNKFVPIYHKLSLIYGG 264
S WD + A +E W Y + +P A +R+K VP +L++I+ G
Sbjct: 241 ASSMKWDPESRSFGATEEEWRIYIRENPRAGQFRHKEVPHADQLAIIFNG 290
Score = 52.0 bits (123), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 37/177 (20%), Positives = 74/177 (41%), Gaps = 15/177 (8%)
Query: 310 PYMDRYLIDLMVEEVRRRNKIDYVRNDQACLDMVVMFKERFGIQFDKNYLKHCCKGLEKL 369
P R +DL VE+ NK + + ++++ F+E+ G +D+ LK+ + +
Sbjct: 9 PEYHRVFVDLCVEQTMLGNKPGTHFSKEGWRNILISFQEQTGAMYDRMQLKNHWDTMSRQ 68
Query: 370 YHKMRSLLEERGFSWDETRQMITACNGVWDAYIKEHPDANSYRNHQKPNYNDLCLIYGSS 429
+ R L+E +W+ A + W Y++E+PDA YR + L +++
Sbjct: 69 WKIWRRLVETSFMNWNPESNRFRATDDDWANYLQENPDAGQYRLSVPHDLKKLEILFAGC 128
Query: 430 DTEL-------------TCNPANQNVGYNDCSIICQKLHWRSNWTPPMDRYFMDLML 473
+ E+ +C + CS + W+P + F+DL++
Sbjct: 129 NVEVKNDEVSGVRKRRRSCYEEEDEDNQSMCS--SSNPQTKGYWSPSTHKLFLDLLV 183
>AT5G05800.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G11290.1); Has 1807 Blast hits to 1807 proteins
in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736;
Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes
- 339 (source: NCBI BLink). | chr5:1743234-1744751
REVERSE LENGTH=449
Length = 449
Score = 79.0 bits (193), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 61/270 (22%), Positives = 111/270 (41%), Gaps = 5/270 (1%)
Query: 159 QALKVNNSSHDFTFEAWCDIVTSFCVKFGSHYTKEDLKNRQKYLEKHFDDLKVLTKQSGF 218
Q + N F+ E W +I+ SF + G+ Y + LKN + + + + L + S
Sbjct: 22 QTMLGNKPGTHFSKEGWRNILISFQEQTGAMYDRMQLKNHWDTMSRQWKIWRRLVETSFM 81
Query: 219 AWDGKQEMVMAEDEVWNSYTKVHPDALLYRNKFVPIYHKLSLIYGGEFSEERXXXXXXXX 278
W+ + A D+ W +Y + +PDA YR KL +++ G E +
Sbjct: 82 NWNPESNRFRATDDDWANYLQENPDAGQYRLSVPHDLKKLEILFAGCNVEVKNDEVSGVR 141
Query: 279 XXXXNGPISTIGVDEDIQDCAIDYFSRVDG--TPYMDRYLIDLMVEEVRRRNKIDYVRND 336
+ DED Q + G +P + +DL+V+E + N+ D N
Sbjct: 142 KRRRS---CYEEEDEDNQSMCSSSNPQTKGYWSPSTHKLFLDLLVQETLKGNRPDTHFNK 198
Query: 337 QACLDMVVMFKERFGIQFDKNYLKHCCKGLEKLYHKMRSLLEERGFSWDETRQMITACNG 396
+ ++ E G+ + + LK+ K + L+ WD + A
Sbjct: 199 EGWKTILGTINENTGLGYTRPQLKNHWDCTRKAWKIWCQLVGASSMKWDPESRSFGATEE 258
Query: 397 VWDAYIKEHPDANSYRNHQKPNYNDLCLIY 426
W YI+E+P A +R+ + P+ + L +I+
Sbjct: 259 EWRIYIRENPRAGQFRHKEVPHADQLAIIF 288
Score = 66.2 bits (160), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 56/230 (24%), Positives = 89/230 (38%), Gaps = 21/230 (9%)
Query: 44 KNLLDRNGFSWDETSRMVVASDRVWDAYIKVHPHAQAYRGKELVDIKDLCLVYAHERTDG 103
+ L++ + +W+ S A+D W Y++ +P A YR D+K L +++A
Sbjct: 73 RRLVETSFMNWNPESNRFRATDDDWANYLQENPDAGQYRLSVPHDLKKLEILFA------ 126
Query: 104 RYSLSSHDVDFGDDEQVVNTGSGREGVYHSSDGD---------EYVRGSWXXXXXXXXXX 154
+V+ +DE V R Y D D +G W
Sbjct: 127 -----GCNVEVKNDE-VSGVRKRRRSCYEEEDEDNQSMCSSSNPQTKGYWSPSTHKLFLD 180
Query: 155 XXXNQALKVNNSSHDFTFEAWCDIVTSFCVKFGSHYTKEDLKNRQKYLEKHFDDLKVLTK 214
+ LK N F E W I+ + G YT+ LKN K + L
Sbjct: 181 LLVQETLKGNRPDTHFNKEGWKTILGTINENTGLGYTRPQLKNHWDCTRKAWKIWCQLVG 240
Query: 215 QSGFAWDGKQEMVMAEDEVWNSYTKVHPDALLYRNKFVPIYHKLSLIYGG 264
S WD + A +E W Y + +P A +R+K VP +L++I+ G
Sbjct: 241 ASSMKWDPESRSFGATEEEWRIYIRENPRAGQFRHKEVPHADQLAIIFNG 290
Score = 52.0 bits (123), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 37/177 (20%), Positives = 74/177 (41%), Gaps = 15/177 (8%)
Query: 310 PYMDRYLIDLMVEEVRRRNKIDYVRNDQACLDMVVMFKERFGIQFDKNYLKHCCKGLEKL 369
P R +DL VE+ NK + + ++++ F+E+ G +D+ LK+ + +
Sbjct: 9 PEYHRVFVDLCVEQTMLGNKPGTHFSKEGWRNILISFQEQTGAMYDRMQLKNHWDTMSRQ 68
Query: 370 YHKMRSLLEERGFSWDETRQMITACNGVWDAYIKEHPDANSYRNHQKPNYNDLCLIYGSS 429
+ R L+E +W+ A + W Y++E+PDA YR + L +++
Sbjct: 69 WKIWRRLVETSFMNWNPESNRFRATDDDWANYLQENPDAGQYRLSVPHDLKKLEILFAGC 128
Query: 430 DTEL-------------TCNPANQNVGYNDCSIICQKLHWRSNWTPPMDRYFMDLML 473
+ E+ +C + CS + W+P + F+DL++
Sbjct: 129 NVEVKNDEVSGVRKRRRSCYEEEDEDNQSMCS--SSNPQTKGYWSPSTHKLFLDLLV 183
>AT3G11290.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G11310.1); Has 720 Blast hits to 435 proteins
in 28 species: Archae - 0; Bacteria - 2; Metazoa - 0;
Fungi - 32; Plants - 682; Viruses - 0; Other Eukaryotes
- 4 (source: NCBI BLink). | chr3:3535766-3537295 REVERSE
LENGTH=460
Length = 460
Score = 63.2 bits (152), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 63/282 (22%), Positives = 95/282 (33%), Gaps = 35/282 (12%)
Query: 18 WTPAMENYFIGLLLDQVHKGNKFND---IKNLLDRNG----------------------- 51
W P F+ L ++Q GN+ +K L R G
Sbjct: 7 WEPEYHRVFVDLCVEQKMLGNQPGTQHILKPFLQRTGARFTRNQLKNHWDTMIKQWKIWC 66
Query: 52 -------FSWDETSRMVVASDRVWDAYIKVHPHAQAYRGKELVDIKDLCLVYAHERTD-- 102
WD + A+D+ W Y+ V+P A YR ++ L L++ D
Sbjct: 67 RLVQCSDMQWDPQTNTFGANDQDWANYLHVNPEAGQYRLNPPSFLEKLELIFEDSNLDDE 126
Query: 103 GRYSLSSHDVDFGDDEQVVNTGSGREGVYHSSDGDEYVRGSWXXXXXXXXXXXXXNQALK 162
G + DE NTG + S+ +G W +ALK
Sbjct: 127 GTSGSKRKRIAKHRDEDNDNTGDEEDTQSASNFSSPQSKGYWSPSSHELFVDLLFQEALK 186
Query: 163 VNNSSHDFTFEAWCDIVTSFCVKFGSHYTKEDLKNRQKYLEKHFDDLKVLTKQSGFAWDG 222
N + E W I+ + G +T+ LKN K + + WD
Sbjct: 187 GNRPDSHYPKETWKMILETINQNTGKSFTRPQLKNHWDCTRKSWKIWCQVIGAPVMKWDA 246
Query: 223 KQEMVMAEDEVWNSYTKVHPDALLYRNKFVPIYHKLSLIYGG 264
A DE W +Y K + A +R K +P KL+ I+ G
Sbjct: 247 TSRTFGATDEDWKNYLKENHRAAPFRRKQLPHADKLATIFKG 288
Score = 63.2 bits (152), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 58/258 (22%), Positives = 99/258 (38%), Gaps = 14/258 (5%)
Query: 178 IVTSFCVKFGSHYTKEDLKNRQKYLEKHFDDLKVLTKQSGFAWDGKQEMVMAEDEVWNSY 237
I+ F + G+ +T+ LKN + K + L + S WD + A D+ W +Y
Sbjct: 34 ILKPFLQRTGARFTRNQLKNHWDTMIKQWKIWCRLVQCSDMQWDPQTNTFGANDQDWANY 93
Query: 238 TKVHPDALLYRNKFVPIYHKLSLIY-------GGEFSEERXXXXXXXXXXXXNGPISTIG 290
V+P+A YR KL LI+ G +R N G
Sbjct: 94 LHVNPEAGQYRLNPPSFLEKLELIFEDSNLDDEGTSGSKRKRIAKHRDEDNDN-----TG 148
Query: 291 VDEDIQDCAIDYFSRVDG--TPYMDRYLIDLMVEEVRRRNKIDYVRNDQACLDMVVMFKE 348
+ED Q + + G +P +DL+ +E + N+ D + ++ +
Sbjct: 149 DEEDTQSASNFSSPQSKGYWSPSSHELFVDLLFQEALKGNRPDSHYPKETWKMILETINQ 208
Query: 349 RFGIQFDKNYLKHCCKGLEKLYHKMRSLLEERGFSWDETRQMITACNGVWDAYIKEHPDA 408
G F + LK+ K + ++ WD T + A + W Y+KE+ A
Sbjct: 209 NTGKSFTRPQLKNHWDCTRKSWKIWCQVIGAPVMKWDATSRTFGATDEDWKNYLKENHRA 268
Query: 409 NSYRNHQKPNYNDLCLIY 426
+R Q P+ + L I+
Sbjct: 269 APFRRKQLPHADKLATIF 286
>AT4G02550.4 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 18 plant
structures; EXPRESSED DURING: 7 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT4G02210.2). | chr4:1120622-1121629 REVERSE
LENGTH=278
Length = 278
Score = 62.0 bits (149), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 58/121 (47%), Gaps = 1/121 (0%)
Query: 312 MDRYLIDLMVEEVRRRNKIDYVRNDQACLDMVVMFKERFGIQFDKNYLKHCCKGLEKLYH 371
MD+ LI+ + + + NK+D ND+A V RF + + K ++K Y
Sbjct: 26 MDKCLIEALAVQAKNGNKVDKCFNDKAYTAACVAVNTRFNLNLTSQKAINRLKTIKKRYR 85
Query: 372 KMRSLLEERGFSWDETRQMI-TACNGVWDAYIKEHPDANSYRNHQKPNYNDLCLIYGSSD 430
MR +L GF W+ + +MI + +W YI +PDA ++R Q Y +L + G
Sbjct: 86 VMRDILSRDGFWWNSSTKMIDCESDELWRRYIAVNPDAKAFRGKQIEMYEELRTVCGDYQ 145
Query: 431 T 431
T
Sbjct: 146 T 146
Score = 53.9 bits (128), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 30/103 (29%), Positives = 54/103 (52%), Gaps = 4/103 (3%)
Query: 162 KVNNSSHDFTFEAWCDIVTSFCVKFGSHYTKEDLKNRQKYLEKHFDDLKVLTKQSGFAWD 221
KV+ +D + A C V + +F + T + NR K ++K + ++ + + GF W+
Sbjct: 43 KVDKCFNDKAYTAACVAVNT---RFNLNLTSQKAINRLKTIKKRYRVMRDILSRDGFWWN 99
Query: 222 GKQEMVMAE-DEVWNSYTKVHPDALLYRNKFVPIYHKLSLIYG 263
+M+ E DE+W Y V+PDA +R K + +Y +L + G
Sbjct: 100 SSTKMIDCESDELWRRYIAVNPDAKAFRGKQIEMYEELRTVCG 142
>AT4G02550.2 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G02210.2); Has 350 Blast hits to 284 proteins
in 18 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 13; Plants - 331; Viruses - 0; Other Eukaryotes
- 6 (source: NCBI BLink). | chr4:1120622-1121629 REVERSE
LENGTH=278
Length = 278
Score = 62.0 bits (149), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 58/121 (47%), Gaps = 1/121 (0%)
Query: 312 MDRYLIDLMVEEVRRRNKIDYVRNDQACLDMVVMFKERFGIQFDKNYLKHCCKGLEKLYH 371
MD+ LI+ + + + NK+D ND+A V RF + + K ++K Y
Sbjct: 26 MDKCLIEALAVQAKNGNKVDKCFNDKAYTAACVAVNTRFNLNLTSQKAINRLKTIKKRYR 85
Query: 372 KMRSLLEERGFSWDETRQMI-TACNGVWDAYIKEHPDANSYRNHQKPNYNDLCLIYGSSD 430
MR +L GF W+ + +MI + +W YI +PDA ++R Q Y +L + G
Sbjct: 86 VMRDILSRDGFWWNSSTKMIDCESDELWRRYIAVNPDAKAFRGKQIEMYEELRTVCGDYQ 145
Query: 431 T 431
T
Sbjct: 146 T 146
Score = 53.9 bits (128), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 30/103 (29%), Positives = 54/103 (52%), Gaps = 4/103 (3%)
Query: 162 KVNNSSHDFTFEAWCDIVTSFCVKFGSHYTKEDLKNRQKYLEKHFDDLKVLTKQSGFAWD 221
KV+ +D + A C V + +F + T + NR K ++K + ++ + + GF W+
Sbjct: 43 KVDKCFNDKAYTAACVAVNT---RFNLNLTSQKAINRLKTIKKRYRVMRDILSRDGFWWN 99
Query: 222 GKQEMVMAE-DEVWNSYTKVHPDALLYRNKFVPIYHKLSLIYG 263
+M+ E DE+W Y V+PDA +R K + +Y +L + G
Sbjct: 100 SSTKMIDCESDELWRRYIAVNPDAKAFRGKQIEMYEELRTVCG 142
>AT4G02550.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G02210.2); Has 370 Blast hits to 300 proteins
in 18 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 10; Plants - 354; Viruses - 0; Other Eukaryotes
- 6 (source: NCBI BLink). | chr4:1120622-1121629 REVERSE
LENGTH=307
Length = 307
Score = 61.6 bits (148), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 58/121 (47%), Gaps = 1/121 (0%)
Query: 312 MDRYLIDLMVEEVRRRNKIDYVRNDQACLDMVVMFKERFGIQFDKNYLKHCCKGLEKLYH 371
MD+ LI+ + + + NK+D ND+A V RF + + K ++K Y
Sbjct: 26 MDKCLIEALAVQAKNGNKVDKCFNDKAYTAACVAVNTRFNLNLTSQKAINRLKTIKKRYR 85
Query: 372 KMRSLLEERGFSWDETRQMI-TACNGVWDAYIKEHPDANSYRNHQKPNYNDLCLIYGSSD 430
MR +L GF W+ + +MI + +W YI +PDA ++R Q Y +L + G
Sbjct: 86 VMRDILSRDGFWWNSSTKMIDCESDELWRRYIAVNPDAKAFRGKQIEMYEELRTVCGDYQ 145
Query: 431 T 431
T
Sbjct: 146 T 146
Score = 55.1 bits (131), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 35/114 (30%), Positives = 64/114 (56%), Gaps = 11/114 (9%)
Query: 39 KFNDIKNLLDRNGFSWDETSRMV-VASDRVWDAYIKVHPHAQAYRGKELVDIKDLCLVYA 97
++ ++++L R+GF W+ +++M+ SD +W YI V+P A+A+RGK++ ++L V
Sbjct: 83 RYRVMRDILSRDGFWWNSSTKMIDCESDELWRRYIAVNPDAKAFRGKQIEMYEELRTVCG 142
Query: 98 HERTDGRYSL----SSHDVD----FGDDEQVVNTGSGREGVYHSSDGDEYVRGS 143
+T G+Y+ SSH ++ F +D GS E + +DG E G+
Sbjct: 143 DYQTPGKYNKVKKESSHHLNDVKQFEEDSVSFPLGSSEE--HSDTDGTESYAGA 194
Score = 53.5 bits (127), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 30/103 (29%), Positives = 54/103 (52%), Gaps = 4/103 (3%)
Query: 162 KVNNSSHDFTFEAWCDIVTSFCVKFGSHYTKEDLKNRQKYLEKHFDDLKVLTKQSGFAWD 221
KV+ +D + A C V + +F + T + NR K ++K + ++ + + GF W+
Sbjct: 43 KVDKCFNDKAYTAACVAVNT---RFNLNLTSQKAINRLKTIKKRYRVMRDILSRDGFWWN 99
Query: 222 GKQEMVMAE-DEVWNSYTKVHPDALLYRNKFVPIYHKLSLIYG 263
+M+ E DE+W Y V+PDA +R K + +Y +L + G
Sbjct: 100 SSTKMIDCESDELWRRYIAVNPDAKAFRGKQIEMYEELRTVCG 142
>AT4G02550.3 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G02210.2); Has 35333 Blast hits to 34131
proteins in 2444 species: Archae - 798; Bacteria -
22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
- 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
chr4:1120622-1121674 REVERSE LENGTH=322
Length = 322
Score = 61.6 bits (148), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 58/121 (47%), Gaps = 1/121 (0%)
Query: 312 MDRYLIDLMVEEVRRRNKIDYVRNDQACLDMVVMFKERFGIQFDKNYLKHCCKGLEKLYH 371
MD+ LI+ + + + NK+D ND+A V RF + + K ++K Y
Sbjct: 41 MDKCLIEALAVQAKNGNKVDKCFNDKAYTAACVAVNTRFNLNLTSQKAINRLKTIKKRYR 100
Query: 372 KMRSLLEERGFSWDETRQMI-TACNGVWDAYIKEHPDANSYRNHQKPNYNDLCLIYGSSD 430
MR +L GF W+ + +MI + +W YI +PDA ++R Q Y +L + G
Sbjct: 101 VMRDILSRDGFWWNSSTKMIDCESDELWRRYIAVNPDAKAFRGKQIEMYEELRTVCGDYQ 160
Query: 431 T 431
T
Sbjct: 161 T 161
Score = 55.1 bits (131), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 35/114 (30%), Positives = 64/114 (56%), Gaps = 11/114 (9%)
Query: 39 KFNDIKNLLDRNGFSWDETSRMV-VASDRVWDAYIKVHPHAQAYRGKELVDIKDLCLVYA 97
++ ++++L R+GF W+ +++M+ SD +W YI V+P A+A+RGK++ ++L V
Sbjct: 98 RYRVMRDILSRDGFWWNSSTKMIDCESDELWRRYIAVNPDAKAFRGKQIEMYEELRTVCG 157
Query: 98 HERTDGRYSL----SSHDVD----FGDDEQVVNTGSGREGVYHSSDGDEYVRGS 143
+T G+Y+ SSH ++ F +D GS E + +DG E G+
Sbjct: 158 DYQTPGKYNKVKKESSHHLNDVKQFEEDSVSFPLGSSEE--HSDTDGTESYAGA 209
Score = 53.5 bits (127), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 30/103 (29%), Positives = 54/103 (52%), Gaps = 4/103 (3%)
Query: 162 KVNNSSHDFTFEAWCDIVTSFCVKFGSHYTKEDLKNRQKYLEKHFDDLKVLTKQSGFAWD 221
KV+ +D + A C V + +F + T + NR K ++K + ++ + + GF W+
Sbjct: 58 KVDKCFNDKAYTAACVAVNT---RFNLNLTSQKAINRLKTIKKRYRVMRDILSRDGFWWN 114
Query: 222 GKQEMVMAE-DEVWNSYTKVHPDALLYRNKFVPIYHKLSLIYG 263
+M+ E DE+W Y V+PDA +R K + +Y +L + G
Sbjct: 115 SSTKMIDCESDELWRRYIAVNPDAKAFRGKQIEMYEELRTVCG 157
>AT3G11310.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G11290.1); Has 575 Blast hits to 342 proteins
in 22 species: Archae - 0; Bacteria - 2; Metazoa - 0;
Fungi - 10; Plants - 559; Viruses - 0; Other Eukaryotes
- 4 (source: NCBI BLink). | chr3:3542536-3544333 REVERSE
LENGTH=539
Length = 539
Score = 55.8 bits (133), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 55/248 (22%), Positives = 88/248 (35%), Gaps = 45/248 (18%)
Query: 46 LLDRNGFSWDETSRMVVASDRVWDAYIKVHPHAQAYRGKELVD--IKDLCLVYAHERTDG 103
L++ + WD ++ AS VW Y +V+P A+ YR + +KDL +++
Sbjct: 70 LVECSEMKWDPQTKKFGASTEVWTNYFRVNPKAKQYRFRSSPPPFLKDLKMIF------- 122
Query: 104 RYSLSSHDVDFGDDEQVVNTGSGREGVYHSSDGDEYV----------------------R 141
D GD+E T G+ +D D +
Sbjct: 123 ------EGTDLGDEE---GTSCGKRKRIPDADNDTGDEDNDTGDDDNYTGDDDITIPRYK 173
Query: 142 GSWXXXXXXXXXXXXXNQALKVNNSSHD-----FTFEAWCDIVTSFCVKFGSHYTKEDLK 196
W ++LK N + E W +V SF K G YT++ LK
Sbjct: 174 AYWSSSSHEIFVDLLFTESLKENRPKPARRNGYYAKETWNMMVESFNQKTGLRYTRKQLK 233
Query: 197 NRQKYLEKHFDDLKVLTKQSGFAWDGKQEMVMAEDEVWNSYTKVHPDALLYRNKFVPIYH 256
N + WD + A E W +Y+K + A +R K +P
Sbjct: 234 NHWNITRDAWRRWCQAVGSPLLKWDANTKTFGATSEDWENYSKENKRAEQFRLKHIPHAD 293
Query: 257 KLSLIYGG 264
KL++I+ G
Sbjct: 294 KLAIIFKG 301
>AT2G19220.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G11290.1); Has 443 Blast hits to 267 proteins
in 21 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 17; Plants - 426; Viruses - 0; Other Eukaryotes
- 0 (source: NCBI BLink). | chr2:8340678-8342161 REVERSE
LENGTH=439
Length = 439
Score = 52.8 bits (125), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 52/231 (22%), Positives = 92/231 (39%), Gaps = 17/231 (7%)
Query: 46 LLDRNGFSWDETSRMVVASDRVWDAYIKVHPHAQAYRGKELVDIKDLCLVYAHERTDGRY 105
L+ WD + A+D+ W Y++V+P A YR + ++ L +++A DG
Sbjct: 67 LVQCKDIKWDSLTNTFGATDQEWANYLEVNPEAGQYRCNPPLFLEKLEIIFAGMNLDGEG 126
Query: 106 SLSSHDV----DFGDDEQVVNTGSGREGVYHSSDGDEYVRGSWXXXXXXXXXXXXXNQAL 161
+ S + + D+E V +G +SD W ++L
Sbjct: 127 TSSGSKMKQICEHRDEENV----TGYVPRLSASDIATRRHYKWSPSSHAIVVDTCFQESL 182
Query: 162 K---VNNSSHDFTFEAWCDIVTSFCVKFGSHYTKEDLKN---RQKYLEKHFDDLKVLTKQ 215
K +H FT E+W I+ G YT + L+N R + KH+ +
Sbjct: 183 KGIRPIKRNHLFTKESWKMILEKINRITGLGYTHKQLENHFTRTRTSWKHWCE---TIAS 239
Query: 216 SGFAWDGKQEMVMAEDEVWNSYTKVHPDALLYRNKFVPIYHKLSLIYGGEF 266
WD A +E W+ Y ++ A +++ + +P KL+ I+ G
Sbjct: 240 PIMKWDANTRKFGATEEDWDKYLMINKRARVFKRRHIPHADKLATIFKGRI 290