Miyakogusa Predicted Gene
- Lj1g3v4863150.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v4863150.1 Non Chatacterized Hit- tr|I1K8L6|I1K8L6_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.4811
PE=,37.8,3e-18,coiled-coil,NULL; Myb_DNA-bind_3,Myb/SANT-like domain;
seg,NULL,CUFF.33488.1
(475 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT4G02210.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 389 e-108
AT4G02210.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 389 e-108
AT2G24960.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 244 8e-65
AT2G24960.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 232 5e-61
AT5G05800.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 115 7e-26
AT5G05800.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 115 7e-26
AT3G11290.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 105 7e-23
AT3G11310.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 87 2e-17
AT2G19220.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 80 2e-15
AT4G02550.4 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 74 2e-13
AT4G02550.2 | Symbols: | unknown protein; BEST Arabidopsis thal... 74 2e-13
AT4G02550.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 74 3e-13
AT4G02550.3 | Symbols: | unknown protein; BEST Arabidopsis thal... 73 4e-13
AT5G27260.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 54 2e-07
AT1G30140.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 53 5e-07
>AT4G02210.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 22 plant
structures; EXPRESSED DURING: 13 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT2G24960.2). | chr4:974320-975917 REVERSE
LENGTH=439
Length = 439
Score = 389 bits (1000), Expect = e-108, Method: Compositional matrix adjust.
Identities = 214/461 (46%), Positives = 284/461 (61%), Gaps = 49/461 (10%)
Query: 4 ERLRTIWTPEMDRYFIDLLLEQVGDDGNRFHDHLSRQGAWKHISSLFNAKFNFQYEKDVI 63
ERLRT+WTPEMD+YFI+L++EQV GNRF DHL + AWK +S F AKF F Y KDV+
Sbjct: 8 ERLRTVWTPEMDQYFIELMVEQVRK-GNRFEDHLFSKRAWKFMSCSFTAKFKFLYGKDVL 66
Query: 64 KNRYKTLRNLHRDVSYILAQPGFSWDEKRNMVSADNHVWDEYLKVHPSARSCRVKSIPYF 123
KNR+KTLRNL + V+ +L + GFSWD+ R MV ADN VWDEYLK+HP +RS R+KSIP +
Sbjct: 67 KNRHKTLRNLFKSVNNLLIEDGFSWDDTRQMVVADNCVWDEYLKIHPDSRSFRIKSIPCY 126
Query: 124 KALCTIYGNAVTQEKGDNGQVGSSNVGEVADEVFHDIKVDEDCGISTLENGTGDSEQGAP 183
K LC +Y + +++ K + S + GE + D + C ST+ + + S
Sbjct: 127 KDLCLVYSDGMSEHKAEE----SISEGESKTLIQEDDGYNRICESSTVRSNSKGSSV--- 179
Query: 184 KATATSFRTRNRTCWQPPMDRYFINLMLAHVHKGNHVDGVFSRQAWMEMISSFNEKFGFD 243
TR RT W PPMDRYFI+LML +GN ++GVF +QAW EM++ FN KF +
Sbjct: 180 --------TRCRTTWHPPMDRYFIDLMLDQARRGNQIEGVFRKQAWTEMVNLFNAKFESN 231
Query: 244 YSLENLKNRYKTLRRQYNLIRSLLDLDGFVWDETRQMVTADDCVWQDYIKVYSDARQFMT 303
+ ++ LKNRYK+LRRQ+N I+S+L DGF WD RQMVTAD+ VWQDYIK + DARQFMT
Sbjct: 232 FDVDVLKNRYKSLRRQFNAIKSILRSDGFAWDNERQMVTADNNVWQDYIKAHRDARQFMT 291
Query: 304 RPLPYYKALCVIY-DPNFDGKESYLAQYLELQNVADVTTESPWTSKTGQSP-NTSNSNED 361
RP+PYYK LCV+ D + E ++A + D TE +G + + S ED
Sbjct: 292 RPIPYYKDLCVLCGDSGIEENECFVA-----MDWFDPETEFQEFKSSGTTDLSISAEEED 346
Query: 362 QRQLAHIGQKQKRQLEKCPDS-TSPKKSKDDE-QGMAIALHEMATVXXXXXXXXXXXXXX 419
L + ++ QL S +PKK + DE Q M+I
Sbjct: 347 SNSLLFDPKNKRDQLANTDTSPINPKKPRVDETQTMSIE--------------------- 385
Query: 420 XXXXVIKEVQALPDMDEDLVLDACDFLEDEKKAKTFLALNA 460
++ +QALPDMD++L+LDACD LED+ KAKTFLAL+
Sbjct: 386 ---DTVEAIQALPDMDDELILDACDLLEDKLKAKTFLALDV 423
>AT4G02210.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT2G24960.2); Has 791 Blast hits to 465 proteins
in 19 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 17; Plants - 748; Viruses - 0; Other Eukaryotes
- 26 (source: NCBI BLink). | chr4:974320-975917 REVERSE
LENGTH=439
Length = 439
Score = 389 bits (1000), Expect = e-108, Method: Compositional matrix adjust.
Identities = 214/461 (46%), Positives = 284/461 (61%), Gaps = 49/461 (10%)
Query: 4 ERLRTIWTPEMDRYFIDLLLEQVGDDGNRFHDHLSRQGAWKHISSLFNAKFNFQYEKDVI 63
ERLRT+WTPEMD+YFI+L++EQV GNRF DHL + AWK +S F AKF F Y KDV+
Sbjct: 8 ERLRTVWTPEMDQYFIELMVEQVRK-GNRFEDHLFSKRAWKFMSCSFTAKFKFLYGKDVL 66
Query: 64 KNRYKTLRNLHRDVSYILAQPGFSWDEKRNMVSADNHVWDEYLKVHPSARSCRVKSIPYF 123
KNR+KTLRNL + V+ +L + GFSWD+ R MV ADN VWDEYLK+HP +RS R+KSIP +
Sbjct: 67 KNRHKTLRNLFKSVNNLLIEDGFSWDDTRQMVVADNCVWDEYLKIHPDSRSFRIKSIPCY 126
Query: 124 KALCTIYGNAVTQEKGDNGQVGSSNVGEVADEVFHDIKVDEDCGISTLENGTGDSEQGAP 183
K LC +Y + +++ K + S + GE + D + C ST+ + + S
Sbjct: 127 KDLCLVYSDGMSEHKAEE----SISEGESKTLIQEDDGYNRICESSTVRSNSKGSSV--- 179
Query: 184 KATATSFRTRNRTCWQPPMDRYFINLMLAHVHKGNHVDGVFSRQAWMEMISSFNEKFGFD 243
TR RT W PPMDRYFI+LML +GN ++GVF +QAW EM++ FN KF +
Sbjct: 180 --------TRCRTTWHPPMDRYFIDLMLDQARRGNQIEGVFRKQAWTEMVNLFNAKFESN 231
Query: 244 YSLENLKNRYKTLRRQYNLIRSLLDLDGFVWDETRQMVTADDCVWQDYIKVYSDARQFMT 303
+ ++ LKNRYK+LRRQ+N I+S+L DGF WD RQMVTAD+ VWQDYIK + DARQFMT
Sbjct: 232 FDVDVLKNRYKSLRRQFNAIKSILRSDGFAWDNERQMVTADNNVWQDYIKAHRDARQFMT 291
Query: 304 RPLPYYKALCVIY-DPNFDGKESYLAQYLELQNVADVTTESPWTSKTGQSP-NTSNSNED 361
RP+PYYK LCV+ D + E ++A + D TE +G + + S ED
Sbjct: 292 RPIPYYKDLCVLCGDSGIEENECFVA-----MDWFDPETEFQEFKSSGTTDLSISAEEED 346
Query: 362 QRQLAHIGQKQKRQLEKCPDS-TSPKKSKDDE-QGMAIALHEMATVXXXXXXXXXXXXXX 419
L + ++ QL S +PKK + DE Q M+I
Sbjct: 347 SNSLLFDPKNKRDQLANTDTSPINPKKPRVDETQTMSIE--------------------- 385
Query: 420 XXXXVIKEVQALPDMDEDLVLDACDFLEDEKKAKTFLALNA 460
++ +QALPDMD++L+LDACD LED+ KAKTFLAL+
Sbjct: 386 ---DTVEAIQALPDMDDELILDACDLLEDKLKAKTFLALDV 423
>AT2G24960.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 21 plant
structures; EXPRESSED DURING: 12 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT4G02210.2); Has 30201 Blast hits to 17322
proteins in 780 species: Archae - 12; Bacteria - 1396;
Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
0; Other Eukaryotes - 2996 (source: NCBI BLink). |
chr2:10617263-10620034 FORWARD LENGTH=774
Length = 774
Score = 244 bits (624), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 163/472 (34%), Positives = 243/472 (51%), Gaps = 50/472 (10%)
Query: 4 ERLRTIWTPEMDRYFIDLLLEQVGDDGNRFHDHLSRQGAWKHISSLFNAKFNFQYEKDVI 63
+R R WTP MD + IDLL+EQV ++GNR AW + + FNAKF Q+ KDV+
Sbjct: 321 DRTRIFWTPPMDYHLIDLLVEQV-NNGNRVGQTFI-TSAWNEMVTAFNAKFGSQHNKDVL 378
Query: 64 KNRYKTLRNLHRDVSYILAQPGFSWDEKRNMVSADNHVWDEYLKVHPSARSCRVKSIPYF 123
KNRYK LR L+ D+ ++L Q GFSWD +R+MV AD+ +W+ Y++ HP ARS RVK+IP +
Sbjct: 379 KNRYKHLRRLYNDIKFLLEQNGFSWDARRDMVIADDDIWNTYIQAHPEARSYRVKTIPSY 438
Query: 124 KALCTIYGNAVTQEKGDNGQVGSSNVGEVADEVFHDIKVDEDCGISTLENGTGDSEQGAP 183
LC I+G + G ++ + A+ V +++E S +G D+ +
Sbjct: 439 PNLCFIFGKETSD--GRYTRLAQAFDPSPAETV----RMNE----SGSTDGFKDT-RSFQ 487
Query: 184 KATATSFRTRNRTC---------WQPPMDRYFINLMLAHVHKGNHVDGVFSRQAWMEMIS 234
K TS + C W MD I+LML V +GN + F+ QAW +M
Sbjct: 488 KVVYTSNEKNDYPCSNIGPPCIEWTRVMDHCLIDLMLEQVSRGNKIGETFTEQAWADMAE 547
Query: 235 SFNEKFGFDYSLENLKNRYKTLRRQYNLIRSLLDLDGFVWDETRQMVTADDCVWQDYIKV 294
SFN KFG + L+NRY L ++ + I ++L+LDGF WD +Q + A+D W+ YIK
Sbjct: 548 SFNAKFGLQTDMFMLENRYILLMKERDDINNILNLDGFTWDVEKQTIVAEDEYWEAYIKE 607
Query: 295 YSDARQFMTRPLPYYKALCVIYDPNFDGKESYLAQYL--ELQNVAD----VTTESPWTSK 348
+ DA + + L Y LC + + +ES+ + L EL+N + V S +
Sbjct: 608 HPDATIYKGKTLDSYGNLCKLNE--HLSQESFNCENLMIELENYGNEMEIVDDFSSPHKQ 665
Query: 349 TGQSPNTSNSNEDQRQLAHIGQKQKRQLEKCPDSTSPKKSKDDEQGMAIALHEMATVXXX 408
+ PN L + K ++ + + DD+ + E+ +
Sbjct: 666 QNKRPNPITP-----PLGIVVCKAQKTGVETRKPLCETEGDDDDCTKPMPQIEIYS---- 716
Query: 409 XXXXXXXXXXXXXXXVIKEVQALPDMDEDLVLDACDFLEDEKKAKTFLALNA 460
+ +QALPDMD++L+LDACD LEDE+KAKTFLAL+
Sbjct: 717 -----------RIGNALDALQALPDMDDELLLDACDLLEDERKAKTFLALDV 757
Score = 232 bits (591), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 127/335 (37%), Positives = 187/335 (55%), Gaps = 38/335 (11%)
Query: 4 ERLRTIWTPEMDRYFIDLLLEQVGDDGNRFHDHLSRQGAWKHISSLFNAKFNFQYEKDVI 63
E +T WT EMD+YF++++++Q+G GN+ + S+Q AW + LFNA+F+ QY K V+
Sbjct: 166 ESSKTEWTLEMDQYFVEIMVDQIGR-GNKTGNAFSKQ-AWIDMLVLFNARFSGQYGKRVL 223
Query: 64 KNRYKTLRNLHRDVSYILAQPGFSWDEKRNMVSADNHVWDEYLKVHPSARSCRVKSIPYF 123
++RY L ++D+ IL + GFSWDE R M+SAD+ VWD Y+K HP AR+ R+KS+P +
Sbjct: 224 RHRYNKLLKYYKDMEAILKEDGFSWDETRLMISADDAVWDSYIKDHPLARTYRMKSLPSY 283
Query: 124 KALCTIYGNAVTQEKGDNGQVGSSNVGEVADEVFHDIKVDEDCGISTLENGTGDSEQGAP 183
L TI+ Q G ++G+ ++
Sbjct: 284 NDLDTIFACQAEQ------------------------------GTDHRDDGSA-AQTSET 312
Query: 184 KATATSFRTRNRTCWQPPMDRYFINLMLAHVHKGNHVDGVFSRQAWMEMISSFNEKFGFD 243
KA+ R R W PPMD + I+L++ V+ GN V F AW EM+++FN KFG
Sbjct: 313 KASQEQNSDRTRIFWTPPMDYHLIDLLVEQVNNGNRVGQTFITSAWNEMVTAFNAKFGSQ 372
Query: 244 YSLENLKNRYKTLRRQYNLIRSLLDLDGFVWDETRQMVTADDCVWQDYIKVYSDARQFMT 303
++ + LKNRYK LRR YN I+ LL+ +GF WD R MV ADD +W YI+ + +AR +
Sbjct: 373 HNKDVLKNRYKHLRRLYNDIKFLLEQNGFSWDARRDMVIADDDIWNTYIQAHPEARSYRV 432
Query: 304 RPLPYYKALCVIYDPNFDGKESYLAQYLELQNVAD 338
+ +P Y LC I+ GKE+ +Y L D
Sbjct: 433 KTIPSYPNLCFIF-----GKETSDGRYTRLAQAFD 462
Score = 210 bits (534), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 114/313 (36%), Positives = 172/313 (54%), Gaps = 31/313 (9%)
Query: 4 ERLRTIWTPEMDRYFIDLLLEQVGDDGNRFHDHLSRQGAWKHISSLFNAKFNFQYEKDVI 63
+R RT WTP M+R+FIDL+LE + GNR ++Q AW + ++FN+KF QY+KDV+
Sbjct: 9 DRTRTYWTPTMERFFIDLMLEHL-HRGNRTGHTFNKQ-AWNEMLTVFNSKFGSQYDKDVL 66
Query: 64 KNRYKTLRNLHRDVSYILAQPGFSWDEKRNMVSADNHVWDEYLKVHPSARSCRVKSIPYF 123
K+RY L + DV +L GF WD+ V D+ +W YLK HP AR + K + F
Sbjct: 67 KSRYTNLWKQYNDVKCLLDHGGFVWDQTHQTVIGDDSLWSLYLKAHPEARVYKTKPVLNF 126
Query: 124 KALCTIYGNAVTQEKGDNGQVGSSNVGEVADEVFHDIKVDEDCGISTLENGTGDSEQGAP 183
LC IYG V +G+ S+ HD++++++ NG
Sbjct: 127 SDLCLIYGYTVA-----DGRYSMSS---------HDLEIEDEI------NG--------- 157
Query: 184 KATATSFRTRNRTCWQPPMDRYFINLMLAHVHKGNHVDGVFSRQAWMEMISSFNEKFGFD 243
++ S + ++T W MD+YF+ +M+ + +GN FS+QAW++M+ FN +F
Sbjct: 158 ESVVLSGKESSKTEWTLEMDQYFVEIMVDQIGRGNKTGNAFSKQAWIDMLVLFNARFSGQ 217
Query: 244 YSLENLKNRYKTLRRQYNLIRSLLDLDGFVWDETRQMVTADDCVWQDYIKVYSDARQFMT 303
Y L++RY L + Y + ++L DGF WDETR M++ADD VW YIK + AR +
Sbjct: 218 YGKRVLRHRYNKLLKYYKDMEAILKEDGFSWDETRLMISADDAVWDSYIKDHPLARTYRM 277
Query: 304 RPLPYYKALCVIY 316
+ LP Y L I+
Sbjct: 278 KSLPSYNDLDTIF 290
Score = 140 bits (354), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 66/152 (43%), Positives = 97/152 (63%), Gaps = 1/152 (0%)
Query: 185 ATATSFRTRNRTCWQPPMDRYFINLMLAHVHKGNHVDGVFSRQAWMEMISSFNEKFGFDY 244
+ T+ R RT W P M+R+FI+LML H+H+GN F++QAW EM++ FN KFG Y
Sbjct: 2 SNQTTCNDRTRTYWTPTMERFFIDLMLEHLHRGNRTGHTFNKQAWNEMLTVFNSKFGSQY 61
Query: 245 SLENLKNRYKTLRRQYNLIRSLLDLDGFVWDETRQMVTADDCVWQDYIKVYSDARQFMTR 304
+ LK+RY L +QYN ++ LLD GFVWD+T Q V DD +W Y+K + +AR + T+
Sbjct: 62 DKDVLKSRYTNLWKQYNDVKCLLDHGGFVWDQTHQTVIGDDSLWSLYLKAHPEARVYKTK 121
Query: 305 PLPYYKALCVIYDPNF-DGKESYLAQYLELQN 335
P+ + LC+IY DG+ S + LE+++
Sbjct: 122 PVLNFSDLCLIYGYTVADGRYSMSSHDLEIED 153
>AT2G24960.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G02210.2); Has 1453 Blast hits to 509 proteins
in 26 species: Archae - 0; Bacteria - 0; Metazoa - 1;
Fungi - 39; Plants - 1363; Viruses - 0; Other Eukaryotes
- 50 (source: NCBI BLink). | chr2:10617263-10620034
FORWARD LENGTH=797
Length = 797
Score = 232 bits (591), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 163/495 (32%), Positives = 243/495 (49%), Gaps = 73/495 (14%)
Query: 4 ERLRTIWTPEMDRYFIDLLLEQVGDDGNRFHDHLSRQGAWKHISSLFNAKFNFQYEKDVI 63
+R R WTP MD + IDLL+EQV ++GNR AW + + FNAKF Q+ KDV+
Sbjct: 321 DRTRIFWTPPMDYHLIDLLVEQV-NNGNRVGQTFI-TSAWNEMVTAFNAKFGSQHNKDVL 378
Query: 64 KNRYKTLRNLHRDVSYILAQPGFSWDEKRNMVSADNHVWDEYL----------------- 106
KNRYK LR L+ D+ ++L Q GFSWD +R+MV AD+ +W+ Y+
Sbjct: 379 KNRYKHLRRLYNDIKFLLEQNGFSWDARRDMVIADDDIWNTYIQACHILFLFKISVICLC 438
Query: 107 ------KVHPSARSCRVKSIPYFKALCTIYGNAVTQEKGDNGQVGSSNVGEVADEVFHDI 160
+ HP ARS RVK+IP + LC I+G + G ++ + A+ V
Sbjct: 439 LQMKHVQAHPEARSYRVKTIPSYPNLCFIFGKETSD--GRYTRLAQAFDPSPAETV---- 492
Query: 161 KVDEDCGISTLENGTGDSEQGAPKATATSFRTRNRTC---------WQPPMDRYFINLML 211
+++E S +G D+ + K TS + C W MD I+LML
Sbjct: 493 RMNE----SGSTDGFKDT-RSFQKVVYTSNEKNDYPCSNIGPPCIEWTRVMDHCLIDLML 547
Query: 212 AHVHKGNHVDGVFSRQAWMEMISSFNEKFGFDYSLENLKNRYKTLRRQYNLIRSLLDLDG 271
V +GN + F+ QAW +M SFN KFG + L+NRY L ++ + I ++L+LDG
Sbjct: 548 EQVSRGNKIGETFTEQAWADMAESFNAKFGLQTDMFMLENRYILLMKERDDINNILNLDG 607
Query: 272 FVWDETRQMVTADDCVWQDYIKVYSDARQFMTRPLPYYKALCVIYDPNFDGKESYLAQYL 331
F WD +Q + A+D W+ YIK + DA + + L Y LC + + +ES+ + L
Sbjct: 608 FTWDVEKQTIVAEDEYWEAYIKEHPDATIYKGKTLDSYGNLCKLNE--HLSQESFNCENL 665
Query: 332 --ELQNVAD----VTTESPWTSKTGQSPNTSNSNEDQRQLAHIGQKQKRQLEKCPDSTSP 385
EL+N + V S + + PN L + K ++ +
Sbjct: 666 MIELENYGNEMEIVDDFSSPHKQQNKRPNPITP-----PLGIVVCKAQKTGVETRKPLCE 720
Query: 386 KKSKDDEQGMAIALHEMATVXXXXXXXXXXXXXXXXXXVIKEVQALPDMDEDLVLDACDF 445
+ DD+ + E+ + + +QALPDMD++L+LDACD
Sbjct: 721 TEGDDDDCTKPMPQIEIYS---------------RIGNALDALQALPDMDDELLLDACDL 765
Query: 446 LEDEKKAKTFLALNA 460
LEDE+KAKTFLAL+
Sbjct: 766 LEDERKAKTFLALDV 780
Score = 219 bits (557), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 127/358 (35%), Positives = 187/358 (52%), Gaps = 61/358 (17%)
Query: 4 ERLRTIWTPEMDRYFIDLLLEQVGDDGNRFHDHLSRQGAWKHISSLFNAKFNFQYEKDVI 63
E +T WT EMD+YF++++++Q+G GN+ + S+Q AW + LFNA+F+ QY K V+
Sbjct: 166 ESSKTEWTLEMDQYFVEIMVDQIGR-GNKTGNAFSKQ-AWIDMLVLFNARFSGQYGKRVL 223
Query: 64 KNRYKTLRNLHRDVSYILAQPGFSWDEKRNMVSADNHVWDEYLKVHPSARSCRVKSIPYF 123
++RY L ++D+ IL + GFSWDE R M+SAD+ VWD Y+K HP AR+ R+KS+P +
Sbjct: 224 RHRYNKLLKYYKDMEAILKEDGFSWDETRLMISADDAVWDSYIKDHPLARTYRMKSLPSY 283
Query: 124 KALCTIYGNAVTQEKGDNGQVGSSNVGEVADEVFHDIKVDEDCGISTLENGTGDSEQGAP 183
L TI+ Q G ++G+ ++
Sbjct: 284 NDLDTIFACQAEQ------------------------------GTDHRDDGSA-AQTSET 312
Query: 184 KATATSFRTRNRTCWQPPMDRYFINLMLAHVHKGNHVDGVFSRQAWMEMISSFNEKFGFD 243
KA+ R R W PPMD + I+L++ V+ GN V F AW EM+++FN KFG
Sbjct: 313 KASQEQNSDRTRIFWTPPMDYHLIDLLVEQVNNGNRVGQTFITSAWNEMVTAFNAKFGSQ 372
Query: 244 YSLENLKNRYKTLRRQYNLIRSLLDLDGFVWDETRQMVTADDCVWQDYI----------- 292
++ + LKNRYK LRR YN I+ LL+ +GF WD R MV ADD +W YI
Sbjct: 373 HNKDVLKNRYKHLRRLYNDIKFLLEQNGFSWDARRDMVIADDDIWNTYIQACHILFLFKI 432
Query: 293 ------------KVYSDARQFMTRPLPYYKALCVIYDPNFDGKESYLAQYLELQNVAD 338
+ + +AR + + +P Y LC I+ GKE+ +Y L D
Sbjct: 433 SVICLCLQMKHVQAHPEARSYRVKTIPSYPNLCFIF-----GKETSDGRYTRLAQAFD 485
Score = 210 bits (534), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 116/313 (37%), Positives = 168/313 (53%), Gaps = 31/313 (9%)
Query: 4 ERLRTIWTPEMDRYFIDLLLEQVGDDGNRFHDHLSRQGAWKHISSLFNAKFNFQYEKDVI 63
+R RT WTP M+R+FIDL+LE + GNR ++Q AW + ++FN+KF QY+KDV+
Sbjct: 9 DRTRTYWTPTMERFFIDLMLEHL-HRGNRTGHTFNKQ-AWNEMLTVFNSKFGSQYDKDVL 66
Query: 64 KNRYKTLRNLHRDVSYILAQPGFSWDEKRNMVSADNHVWDEYLKVHPSARSCRVKSIPYF 123
K+RY L + DV +L GF WD+ V D+ +W YLK HP AR + K + F
Sbjct: 67 KSRYTNLWKQYNDVKCLLDHGGFVWDQTHQTVIGDDSLWSLYLKAHPEARVYKTKPVLNF 126
Query: 124 KALCTIYGNAVTQEKGDNGQVGSSNVGEVADEVFHDIKVDEDCGISTLENGTGDSEQGAP 183
LC IYG V D SS+ E+ DE+ NG
Sbjct: 127 SDLCLIYGYTV----ADGRYSMSSHDLEIEDEI----------------NG--------- 157
Query: 184 KATATSFRTRNRTCWQPPMDRYFINLMLAHVHKGNHVDGVFSRQAWMEMISSFNEKFGFD 243
++ S + ++T W MD+YF+ +M+ + +GN FS+QAW++M+ FN +F
Sbjct: 158 ESVVLSGKESSKTEWTLEMDQYFVEIMVDQIGRGNKTGNAFSKQAWIDMLVLFNARFSGQ 217
Query: 244 YSLENLKNRYKTLRRQYNLIRSLLDLDGFVWDETRQMVTADDCVWQDYIKVYSDARQFMT 303
Y L++RY L + Y + ++L DGF WDETR M++ADD VW YIK + AR +
Sbjct: 218 YGKRVLRHRYNKLLKYYKDMEAILKEDGFSWDETRLMISADDAVWDSYIKDHPLARTYRM 277
Query: 304 RPLPYYKALCVIY 316
+ LP Y L I+
Sbjct: 278 KSLPSYNDLDTIF 290
Score = 140 bits (354), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 66/152 (43%), Positives = 97/152 (63%), Gaps = 1/152 (0%)
Query: 185 ATATSFRTRNRTCWQPPMDRYFINLMLAHVHKGNHVDGVFSRQAWMEMISSFNEKFGFDY 244
+ T+ R RT W P M+R+FI+LML H+H+GN F++QAW EM++ FN KFG Y
Sbjct: 2 SNQTTCNDRTRTYWTPTMERFFIDLMLEHLHRGNRTGHTFNKQAWNEMLTVFNSKFGSQY 61
Query: 245 SLENLKNRYKTLRRQYNLIRSLLDLDGFVWDETRQMVTADDCVWQDYIKVYSDARQFMTR 304
+ LK+RY L +QYN ++ LLD GFVWD+T Q V DD +W Y+K + +AR + T+
Sbjct: 62 DKDVLKSRYTNLWKQYNDVKCLLDHGGFVWDQTHQTVIGDDSLWSLYLKAHPEARVYKTK 121
Query: 305 PLPYYKALCVIYDPNF-DGKESYLAQYLELQN 335
P+ + LC+IY DG+ S + LE+++
Sbjct: 122 PVLNFSDLCLIYGYTVADGRYSMSSHDLEIED 153
>AT5G05800.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 24 plant
structures; EXPRESSED DURING: 15 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT3G11290.1); Has 881 Blast hits to 512 proteins
in 30 species: Archae - 0; Bacteria - 2; Metazoa - 0;
Fungi - 38; Plants - 833; Viruses - 0; Other Eukaryotes
- 8 (source: NCBI BLink). | chr5:1743234-1744751 REVERSE
LENGTH=449
Length = 449
Score = 115 bits (288), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 86/375 (22%), Positives = 162/375 (43%), Gaps = 28/375 (7%)
Query: 5 RLRTIWTPEMDRYFIDLLLEQVGDDGNRFHDHLSRQGAWKHISSLFNAKFNFQYEKDVIK 64
R + +W PE R F+DL +EQ GN+ H S++G W++I F + Y++ +K
Sbjct: 2 RPKAVWEPEYHRVFVDLCVEQTML-GNKPGTHFSKEG-WRNILISFQEQTGAMYDRMQLK 59
Query: 65 NRYKTLRNLHRDVSYILAQPGFSWDEKRNMVSADNHVWDEYLKVHPSARSCRVKSIPY-F 123
N + T+ + ++ +W+ + N A + W YL+ +P A R+ S+P+
Sbjct: 60 NHWDTMSRQWKIWRRLVETSFMNWNPESNRFRATDDDWANYLQENPDAGQYRL-SVPHDL 118
Query: 124 KALCTIYGNAVTQEKGDNGQVGSSNVGEVADEVFHDIKVDEDCGISTLENGTGDSEQGAP 183
K L ++ NV ++K DE G+ + E
Sbjct: 119 KKLEILFAGC--------------NV---------EVKNDEVSGVRKRRRSCYEEEDEDN 155
Query: 184 KATATSFRTRNRTCWQPPMDRYFINLMLAHVHKGNHVDGVFSRQAWMEMISSFNEKFGFD 243
++ +S + + W P + F++L++ KGN D F+++ W ++ + NE G
Sbjct: 156 QSMCSSSNPQTKGYWSPSTHKLFLDLLVQETLKGNRPDTHFNKEGWKTILGTINENTGLG 215
Query: 244 YSLENLKNRYKTLRRQYNLIRSLLDLDGFVWDETRQMVTADDCVWQDYIKVYSDARQFMT 303
Y+ LKN + R+ + + L+ WD + A + W+ YI+ A QF
Sbjct: 216 YTRPQLKNHWDCTRKAWKIWCQLVGASSMKWDPESRSFGATEEEWRIYIRENPRAGQFRH 275
Query: 304 RPLPYYKALCVIYDPNFDGKESYLAQYLELQNVADVTTESP-WTSKTGQSPNTSNSNEDQ 362
+ +P+ L +I++ + E+Y + + +ESP W T S + E
Sbjct: 276 KEVPHADQLAIIFNGVIEPGETYTPPSRSRKKLLHNRSESPQWRDTTPLSKMHVDEAETS 335
Query: 363 RQLAHIGQKQKRQLE 377
RQ + Q+ +++
Sbjct: 336 RQNGCYAESQEDRID 350
>AT5G05800.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G11290.1); Has 1807 Blast hits to 1807 proteins
in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736;
Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes
- 339 (source: NCBI BLink). | chr5:1743234-1744751
REVERSE LENGTH=449
Length = 449
Score = 115 bits (288), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 86/375 (22%), Positives = 162/375 (43%), Gaps = 28/375 (7%)
Query: 5 RLRTIWTPEMDRYFIDLLLEQVGDDGNRFHDHLSRQGAWKHISSLFNAKFNFQYEKDVIK 64
R + +W PE R F+DL +EQ GN+ H S++G W++I F + Y++ +K
Sbjct: 2 RPKAVWEPEYHRVFVDLCVEQTML-GNKPGTHFSKEG-WRNILISFQEQTGAMYDRMQLK 59
Query: 65 NRYKTLRNLHRDVSYILAQPGFSWDEKRNMVSADNHVWDEYLKVHPSARSCRVKSIPY-F 123
N + T+ + ++ +W+ + N A + W YL+ +P A R+ S+P+
Sbjct: 60 NHWDTMSRQWKIWRRLVETSFMNWNPESNRFRATDDDWANYLQENPDAGQYRL-SVPHDL 118
Query: 124 KALCTIYGNAVTQEKGDNGQVGSSNVGEVADEVFHDIKVDEDCGISTLENGTGDSEQGAP 183
K L ++ NV ++K DE G+ + E
Sbjct: 119 KKLEILFAGC--------------NV---------EVKNDEVSGVRKRRRSCYEEEDEDN 155
Query: 184 KATATSFRTRNRTCWQPPMDRYFINLMLAHVHKGNHVDGVFSRQAWMEMISSFNEKFGFD 243
++ +S + + W P + F++L++ KGN D F+++ W ++ + NE G
Sbjct: 156 QSMCSSSNPQTKGYWSPSTHKLFLDLLVQETLKGNRPDTHFNKEGWKTILGTINENTGLG 215
Query: 244 YSLENLKNRYKTLRRQYNLIRSLLDLDGFVWDETRQMVTADDCVWQDYIKVYSDARQFMT 303
Y+ LKN + R+ + + L+ WD + A + W+ YI+ A QF
Sbjct: 216 YTRPQLKNHWDCTRKAWKIWCQLVGASSMKWDPESRSFGATEEEWRIYIRENPRAGQFRH 275
Query: 304 RPLPYYKALCVIYDPNFDGKESYLAQYLELQNVADVTTESP-WTSKTGQSPNTSNSNEDQ 362
+ +P+ L +I++ + E+Y + + +ESP W T S + E
Sbjct: 276 KEVPHADQLAIIFNGVIEPGETYTPPSRSRKKLLHNRSESPQWRDTTPLSKMHVDEAETS 335
Query: 363 RQLAHIGQKQKRQLE 377
RQ + Q+ +++
Sbjct: 336 RQNGCYAESQEDRID 350
>AT3G11290.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G11310.1); Has 720 Blast hits to 435 proteins
in 28 species: Archae - 0; Bacteria - 2; Metazoa - 0;
Fungi - 32; Plants - 682; Viruses - 0; Other Eukaryotes
- 4 (source: NCBI BLink). | chr3:3535766-3537295 REVERSE
LENGTH=460
Length = 460
Score = 105 bits (262), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 79/339 (23%), Positives = 146/339 (43%), Gaps = 31/339 (9%)
Query: 7 RTIWTPEMDRYFIDLLLEQVGDDGNRFHDHLSRQGAWKHISSLFNAKFNFQYEKDVIKNR 66
+ W PE R F+DL +EQ L Q +HI F + ++ ++ +KN
Sbjct: 4 KAAWEPEYHRVFVDLCVEQ---------KMLGNQPGTQHILKPFLQRTGARFTRNQLKNH 54
Query: 67 YKTLRNLHRDVSYILAQPGFSWDEKRNMVSADNHVWDEYLKVHPSARSCRVKSIPYFKAL 126
+ T+ + ++ WD + N A++ W YL V+P A R+ + + L
Sbjct: 55 WDTMIKQWKIWCRLVQCSDMQWDPQTNTFGANDQDWANYLHVNPEAGQYRLNPPSFLEKL 114
Query: 127 CTIYGNAVTQEKGDNGQVGSSNVGEVADEVFHDIKVDEDCGISTLENGTGDSEQGAPKAT 186
I+ ++ ++G +G S +A DED + TGD E +
Sbjct: 115 ELIFEDSNLDDEGTSG----SKRKRIAKHR------DED------NDNTGDEED---TQS 155
Query: 187 ATSFRT-RNRTCWQPPMDRYFINLMLAHVHKGNHVDGVFSRQAWMEMISSFNEKFGFDYS 245
A++F + +++ W P F++L+ KGN D + ++ W ++ + N+ G ++
Sbjct: 156 ASNFSSPQSKGYWSPSSHELFVDLLFQEALKGNRPDSHYPKETWKMILETINQNTGKSFT 215
Query: 246 LENLKNRYKTLRRQYNLIRSLLDLDGFVWDETRQMVTADDCVWQDYIKVYSDARQFMTRP 305
LKN + R+ + + ++ WD T + A D W++Y+K A F +
Sbjct: 216 RPQLKNHWDCTRKSWKIWCQVIGAPVMKWDATSRTFGATDEDWKNYLKENHRAAPFRRKQ 275
Query: 306 LPYYKALCVIYDPNFDGKESYLAQYLELQNVADVTTESP 344
LP+ L I+ + ++Y Y + V D +ESP
Sbjct: 276 LPHADKLATIFKGLIEPGKAYFRSY--RRRVLDHHSESP 312
>AT3G11310.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G11290.1); Has 575 Blast hits to 342 proteins
in 22 species: Archae - 0; Bacteria - 2; Metazoa - 0;
Fungi - 10; Plants - 559; Viruses - 0; Other Eukaryotes
- 4 (source: NCBI BLink). | chr3:3542536-3544333 REVERSE
LENGTH=539
Length = 539
Score = 87.0 bits (214), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 78/337 (23%), Positives = 132/337 (39%), Gaps = 33/337 (9%)
Query: 3 MERLRTIWTPEMDRYFIDLLLEQ--VGDDGNRFHDHLSRQGAWKHISSLFNAKFNFQYEK 60
M R + +W PE+ + F+DL +EQ +G R I F ++ +
Sbjct: 1 MTREKVMWEPELHKVFVDLCVEQKMLG----------FRLPGLNRIWESFVQNTGARFTR 50
Query: 61 DVIKNRYKTLRNLHRDVSYILAQPGFSWDEKRNMVSADNHVWDEYLKVHPSARSCRVKS- 119
D +KN + T+ L R ++ WD + A VW Y +V+P A+ R +S
Sbjct: 51 DQLKNHWDTMLRLWRAWCRLVECSEMKWDPQTKKFGASTEVWTNYFRVNPKAKQYRFRSS 110
Query: 120 -IPYFKALCTIYGNAVTQEKGDNGQVGSSNVGEVADEVFHDIKVDEDCGISTLENGTGDS 178
P+ K L I+ + GD + D D D G + G D+
Sbjct: 111 PPPFLKDLKMIFEGT---DLGDEEGTSCGKRKRIPD-------ADNDTGDEDNDTGDDDN 160
Query: 179 EQGAPKATATSFRTRNRTCWQPPMDRYFINLMLAHVHKGNHV-----DGVFSRQAWMEMI 233
G T R + W F++L+ K N +G ++++ W M+
Sbjct: 161 YTGDDDITI----PRYKAYWSSSSHEIFVDLLFTESLKENRPKPARRNGYYAKETWNMMV 216
Query: 234 SSFNEKFGFDYSLENLKNRYKTLRRQYNLIRSLLDLDGFVWDETRQMVTADDCVWQDYIK 293
SFN+K G Y+ + LKN + R + + WD + A W++Y K
Sbjct: 217 ESFNQKTGLRYTRKQLKNHWNITRDAWRRWCQAVGSPLLKWDANTKTFGATSEDWENYSK 276
Query: 294 VYSDARQFMTRPLPYYKALCVIYDPNFDGKESYLAQY 330
A QF + +P+ L +I+ + + ++ L Y
Sbjct: 277 ENKRAEQFRLKHIPHADKLAIIFKGHVEPGKTALRPY 313
Score = 50.4 bits (119), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 33/142 (23%), Positives = 58/142 (40%), Gaps = 5/142 (3%)
Query: 1 MVMERLRTIWTPEMDRYFIDLL----LEQVGDDGNRFHDHLSRQGAWKHISSLFNAKFNF 56
+ + R + W+ F+DLL L++ R + + +++ W + FN K
Sbjct: 167 ITIPRYKAYWSSSSHEIFVDLLFTESLKENRPKPARRNGYYAKE-TWNMMVESFNQKTGL 225
Query: 57 QYEKDVIKNRYKTLRNLHRDVSYILAQPGFSWDEKRNMVSADNHVWDEYLKVHPSARSCR 116
+Y + +KN + R+ R + P WD A + W+ Y K + A R
Sbjct: 226 RYTRKQLKNHWNITRDAWRRWCQAVGSPLLKWDANTKTFGATSEDWENYSKENKRAEQFR 285
Query: 117 VKSIPYFKALCTIYGNAVTQEK 138
+K IP+ L I+ V K
Sbjct: 286 LKHIPHADKLAIIFKGHVEPGK 307
>AT2G19220.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G11290.1); Has 443 Blast hits to 267 proteins
in 21 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 17; Plants - 426; Viruses - 0; Other Eukaryotes
- 0 (source: NCBI BLink). | chr2:8340678-8342161 REVERSE
LENGTH=439
Length = 439
Score = 80.5 bits (197), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 78/362 (21%), Positives = 135/362 (37%), Gaps = 43/362 (11%)
Query: 7 RTIWTPEMDRYFIDLLLEQVGDDGNRFHDHLSRQGAWKHISSLFNAKFNFQYEKDVIKNR 66
+ W PE D F+DL +EQ L Q +HI F + ++ D + N
Sbjct: 4 KAAWEPEHDEVFVDLCVEQ---------KMLGNQPEMQHILEAFQ-EMGVRFTIDQLINH 53
Query: 67 YKTLRNLHRDVSYILAQPGFSWDEKRNMVSADNHVWDEYLKVHPSARSCRVKSIPYFKAL 126
+ T+ + ++ WD N A + W YL+V+P A R + + L
Sbjct: 54 WDTMIKQWKIWCRLVQCKDIKWDSLTNTFGATDQEWANYLEVNPEAGQYRCNPPLFLEKL 113
Query: 127 CTIYGNAVTQEKGDNGQVGSSNVGEVADEVFHDIKVDEDCGISTLENGTGDSEQGAPKAT 186
I+ +G + + E DE EN TG P+ +
Sbjct: 114 EIIFAGMNLDGEGTSSGSKMKQICEHRDE----------------ENVTG----YVPRLS 153
Query: 187 ATSFRTRNRTCWQPPMDRYFINLMLAHVHKG------NHVDGVFSRQAWMEMISSFNEKF 240
A+ TR W P ++ KG NH +F++++W ++ N
Sbjct: 154 ASDIATRRHYKWSPSSHAIVVDTCFQESLKGIRPIKRNH---LFTKESWKMILEKINRIT 210
Query: 241 GFDYSLENLKNRYKTLRRQYNLIRSLLDLDGFVWDETRQMVTADDCVWQDYIKVYSDARQ 300
G Y+ + L+N + R + + WD + A + W Y+ + AR
Sbjct: 211 GLGYTHKQLENHFTRTRTSWKHWCETIASPIMKWDANTRKFGATEEDWDKYLMINKRARV 270
Query: 301 FMTRPLPYYKALCVIYDPNFDGKESYLAQYLELQNVADVTTESPWTSKTGQSPNT--SNS 358
F R +P+ L I+ + ++ +Y + V D +ESP +P++ N+
Sbjct: 271 FKRRHIPHADKLATIFKGRIEPGKTKTRRY--RKRVIDHHSESPQLHDHQPTPSSVVVNT 328
Query: 359 NE 360
NE
Sbjct: 329 NE 330
Score = 52.4 bits (124), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 37/156 (23%), Positives = 70/156 (44%), Gaps = 13/156 (8%)
Query: 194 NRTCWQPPMDRYFINLMLAHVHKGNHVDGVFSRQAWMEMISSFNEKFGFDYSLENLKNRY 253
++ W+P D F++L + GN + +A+ EM G ++++ L N +
Sbjct: 3 SKAAWEPEHDEVFVDLCVEQKMLGNQPEMQHILEAFQEM--------GVRFTIDQLINHW 54
Query: 254 KTLRRQYNLIRSLLDLDGFVWDETRQMVTADDCVWQDYIKVYSDARQFMTRPLPYYKALC 313
T+ +Q+ + L+ WD A D W +Y++V +A Q+ P + + L
Sbjct: 55 DTMIKQWKIWCRLVQCKDIKWDSLTNTFGATDQEWANYLEVNPEAGQYRCNPPLFLEKLE 114
Query: 314 VIY-DPNFDGK----ESYLAQYLELQNVADVTTESP 344
+I+ N DG+ S + Q E ++ +VT P
Sbjct: 115 IIFAGMNLDGEGTSSGSKMKQICEHRDEENVTGYVP 150
>AT4G02550.4 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 18 plant
structures; EXPRESSED DURING: 7 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT4G02210.2). | chr4:1120622-1121629 REVERSE
LENGTH=278
Length = 278
Score = 73.9 bits (180), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 41/141 (29%), Positives = 70/141 (49%), Gaps = 12/141 (8%)
Query: 198 WQPPMDRYFINLMLAHVHKGNHVDGVFSRQAWMEMISSFNEKFGFDYSLENLKNRYKTLR 257
W MD+ I + GN VD F+ +A+ + N +F + + + NR KT++
Sbjct: 22 WSVGMDKCLIEALAVQAKNGNKVDKCFNDKAYTAACVAVNTRFNLNLTSQKAINRLKTIK 81
Query: 258 RQYNLIRSLLDLDGFVWDETRQMVTAD-DCVWQDYIKVYSDARQFMTRPLPYYKAL---C 313
++Y ++R +L DGF W+ + +M+ + D +W+ YI V DA+ F + + Y+ L C
Sbjct: 82 KRYRVMRDILSRDGFWWNSSTKMIDCESDELWRRYIAVNPDAKAFRGKQIEMYEELRTVC 141
Query: 314 VIYD--------PNFDGKESY 326
Y + DG ESY
Sbjct: 142 GDYQTPGSSEEHSDTDGTESY 162
Score = 60.5 bits (145), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 37/129 (28%), Positives = 63/129 (48%), Gaps = 3/129 (2%)
Query: 8 TIWTPEMDRYFIDLLLEQVGDDGNRFHDHLSRQGAWKHISSLFNAKFNFQYEKDVIKNRY 67
IW+ MD+ I+ L Q +GN+ D A+ N +FN NR
Sbjct: 20 VIWSVGMDKCLIEALAVQ-AKNGNKV-DKCFNDKAYTAACVAVNTRFNLNLTSQKAINRL 77
Query: 68 KTLRNLHRDVSYILAQPGFSWDEKRNMVSAD-NHVWDEYLKVHPSARSCRVKSIPYFKAL 126
KT++ +R + IL++ GF W+ M+ + + +W Y+ V+P A++ R K I ++ L
Sbjct: 78 KTIKKRYRVMRDILSRDGFWWNSSTKMIDCESDELWRRYIAVNPDAKAFRGKQIEMYEEL 137
Query: 127 CTIYGNAVT 135
T+ G+ T
Sbjct: 138 RTVCGDYQT 146
>AT4G02550.2 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G02210.2); Has 350 Blast hits to 284 proteins
in 18 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 13; Plants - 331; Viruses - 0; Other Eukaryotes
- 6 (source: NCBI BLink). | chr4:1120622-1121629 REVERSE
LENGTH=278
Length = 278
Score = 73.9 bits (180), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 41/141 (29%), Positives = 70/141 (49%), Gaps = 12/141 (8%)
Query: 198 WQPPMDRYFINLMLAHVHKGNHVDGVFSRQAWMEMISSFNEKFGFDYSLENLKNRYKTLR 257
W MD+ I + GN VD F+ +A+ + N +F + + + NR KT++
Sbjct: 22 WSVGMDKCLIEALAVQAKNGNKVDKCFNDKAYTAACVAVNTRFNLNLTSQKAINRLKTIK 81
Query: 258 RQYNLIRSLLDLDGFVWDETRQMVTAD-DCVWQDYIKVYSDARQFMTRPLPYYKAL---C 313
++Y ++R +L DGF W+ + +M+ + D +W+ YI V DA+ F + + Y+ L C
Sbjct: 82 KRYRVMRDILSRDGFWWNSSTKMIDCESDELWRRYIAVNPDAKAFRGKQIEMYEELRTVC 141
Query: 314 VIYD--------PNFDGKESY 326
Y + DG ESY
Sbjct: 142 GDYQTPGSSEEHSDTDGTESY 162
Score = 60.5 bits (145), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 37/129 (28%), Positives = 63/129 (48%), Gaps = 3/129 (2%)
Query: 8 TIWTPEMDRYFIDLLLEQVGDDGNRFHDHLSRQGAWKHISSLFNAKFNFQYEKDVIKNRY 67
IW+ MD+ I+ L Q +GN+ D A+ N +FN NR
Sbjct: 20 VIWSVGMDKCLIEALAVQ-AKNGNKV-DKCFNDKAYTAACVAVNTRFNLNLTSQKAINRL 77
Query: 68 KTLRNLHRDVSYILAQPGFSWDEKRNMVSAD-NHVWDEYLKVHPSARSCRVKSIPYFKAL 126
KT++ +R + IL++ GF W+ M+ + + +W Y+ V+P A++ R K I ++ L
Sbjct: 78 KTIKKRYRVMRDILSRDGFWWNSSTKMIDCESDELWRRYIAVNPDAKAFRGKQIEMYEEL 137
Query: 127 CTIYGNAVT 135
T+ G+ T
Sbjct: 138 RTVCGDYQT 146
>AT4G02550.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G02210.2); Has 370 Blast hits to 300 proteins
in 18 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 10; Plants - 354; Viruses - 0; Other Eukaryotes
- 6 (source: NCBI BLink). | chr4:1120622-1121629 REVERSE
LENGTH=307
Length = 307
Score = 73.6 bits (179), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 34/116 (29%), Positives = 62/116 (53%), Gaps = 1/116 (0%)
Query: 198 WQPPMDRYFINLMLAHVHKGNHVDGVFSRQAWMEMISSFNEKFGFDYSLENLKNRYKTLR 257
W MD+ I + GN VD F+ +A+ + N +F + + + NR KT++
Sbjct: 22 WSVGMDKCLIEALAVQAKNGNKVDKCFNDKAYTAACVAVNTRFNLNLTSQKAINRLKTIK 81
Query: 258 RQYNLIRSLLDLDGFVWDETRQMVTAD-DCVWQDYIKVYSDARQFMTRPLPYYKAL 312
++Y ++R +L DGF W+ + +M+ + D +W+ YI V DA+ F + + Y+ L
Sbjct: 82 KRYRVMRDILSRDGFWWNSSTKMIDCESDELWRRYIAVNPDAKAFRGKQIEMYEEL 137
Score = 60.1 bits (144), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 40/142 (28%), Positives = 69/142 (48%), Gaps = 3/142 (2%)
Query: 8 TIWTPEMDRYFIDLLLEQVGDDGNRFHDHLSRQGAWKHISSLFNAKFNFQYEKDVIKNRY 67
IW+ MD+ I+ L Q +GN+ D A+ N +FN NR
Sbjct: 20 VIWSVGMDKCLIEALAVQ-AKNGNKV-DKCFNDKAYTAACVAVNTRFNLNLTSQKAINRL 77
Query: 68 KTLRNLHRDVSYILAQPGFSWDEKRNMVSAD-NHVWDEYLKVHPSARSCRVKSIPYFKAL 126
KT++ +R + IL++ GF W+ M+ + + +W Y+ V+P A++ R K I ++ L
Sbjct: 78 KTIKKRYRVMRDILSRDGFWWNSSTKMIDCESDELWRRYIAVNPDAKAFRGKQIEMYEEL 137
Query: 127 CTIYGNAVTQEKGDNGQVGSSN 148
T+ G+ T K + + SS+
Sbjct: 138 RTVCGDYQTPGKYNKVKKESSH 159
>AT4G02550.3 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G02210.2); Has 35333 Blast hits to 34131
proteins in 2444 species: Archae - 798; Bacteria -
22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
- 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
chr4:1120622-1121674 REVERSE LENGTH=322
Length = 322
Score = 72.8 bits (177), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 34/119 (28%), Positives = 63/119 (52%), Gaps = 1/119 (0%)
Query: 198 WQPPMDRYFINLMLAHVHKGNHVDGVFSRQAWMEMISSFNEKFGFDYSLENLKNRYKTLR 257
W MD+ I + GN VD F+ +A+ + N +F + + + NR KT++
Sbjct: 37 WSVGMDKCLIEALAVQAKNGNKVDKCFNDKAYTAACVAVNTRFNLNLTSQKAINRLKTIK 96
Query: 258 RQYNLIRSLLDLDGFVWDETRQMVTAD-DCVWQDYIKVYSDARQFMTRPLPYYKALCVI 315
++Y ++R +L DGF W+ + +M+ + D +W+ YI V DA+ F + + Y+ L +
Sbjct: 97 KRYRVMRDILSRDGFWWNSSTKMIDCESDELWRRYIAVNPDAKAFRGKQIEMYEELRTV 155
Score = 60.1 bits (144), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 40/142 (28%), Positives = 69/142 (48%), Gaps = 3/142 (2%)
Query: 8 TIWTPEMDRYFIDLLLEQVGDDGNRFHDHLSRQGAWKHISSLFNAKFNFQYEKDVIKNRY 67
IW+ MD+ I+ L Q +GN+ D A+ N +FN NR
Sbjct: 35 VIWSVGMDKCLIEALAVQ-AKNGNKV-DKCFNDKAYTAACVAVNTRFNLNLTSQKAINRL 92
Query: 68 KTLRNLHRDVSYILAQPGFSWDEKRNMVSAD-NHVWDEYLKVHPSARSCRVKSIPYFKAL 126
KT++ +R + IL++ GF W+ M+ + + +W Y+ V+P A++ R K I ++ L
Sbjct: 93 KTIKKRYRVMRDILSRDGFWWNSSTKMIDCESDELWRRYIAVNPDAKAFRGKQIEMYEEL 152
Query: 127 CTIYGNAVTQEKGDNGQVGSSN 148
T+ G+ T K + + SS+
Sbjct: 153 RTVCGDYQTPGKYNKVKKESSH 174
>AT5G27260.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT2G29880.1); Has 1807 Blast hits to 1807 proteins
in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736;
Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes
- 339 (source: NCBI BLink). | chr5:9603943-9604930
FORWARD LENGTH=303
Length = 303
Score = 53.9 bits (128), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 35/146 (23%), Positives = 63/146 (43%), Gaps = 13/146 (8%)
Query: 10 WTPEMDRYFIDLLLEQVGDDGNRFHDHLSRQGAWKHISSLFNAKF-------NFQYEKDV 62
W+PE + + LL+E + ++ + +S+ N +F ++
Sbjct: 17 WSPEETKLLVQLLVEGINNNWRDSNGTISKLTVETKFMPEINKEFCRSKNYNHYLSRMKY 76
Query: 63 IKNRYKTLRNLHRDVSYILAQPGFSWDEKRNMVSADNHVWDEYLKVHPSARSCRVKSIPY 122
+K +Y++ +L R S GF WD +A + VW +YLK HP+ + R + +
Sbjct: 77 LKIQYQSCLDLQRFSS------GFGWDPLTKRFTASDEVWSDYLKAHPNNKQLRYDTFEF 130
Query: 123 FKALCTIYGNAVTQEKGDNGQVGSSN 148
F L I+G V K G S++
Sbjct: 131 FDELQIIFGEGVATGKNAIGLCDSTD 156
>AT1G30140.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT5G27260.1); Has 313 Blast hits to 256 proteins
in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 8; Plants - 295; Viruses - 0; Other Eukaryotes -
10 (source: NCBI BLink). | chr1:10598764-10599527
FORWARD LENGTH=222
Length = 222
Score = 52.8 bits (125), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 32/123 (26%), Positives = 63/123 (51%), Gaps = 9/123 (7%)
Query: 198 WQPPMDRYFINLMLAHVHKGNHVDGVFSRQAWMEMISSFNEKFGFDYSLENLKNRYKTLR 257
W P I L+ + + + G + ++ +++ + N++ G + + +N +R K L+
Sbjct: 17 WTPDETDVLIELIRQNWRDSSGIIGKLTVES--KLLPALNKRLGCNKNHKNYMSRLKFLK 74
Query: 258 RQYNLIRSLLDL----DGFVWDETRQMVTADDCVWQDYIKVYSDARQFMTRPLPYYKALC 313
NL +S LDL GF WD + TA D VW+DY+K + + + T + +++ L
Sbjct: 75 ---NLYQSYLDLKRFSSGFGWDPETKKFTAPDEVWRDYLKAHPNHKHMQTESIDHFEDLQ 131
Query: 314 VIY 316
+I+
Sbjct: 132 IIF 134
Score = 50.4 bits (119), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 26/80 (32%), Positives = 43/80 (53%), Gaps = 6/80 (7%)
Query: 55 NFQYEKDVIKNRYKTLRNLHRDVSYILAQPGFSWDEKRNMVSADNHVWDEYLKVHPSARS 114
N+ +KN Y++ +L R S GF WD + +A + VW +YLK HP+ +
Sbjct: 65 NYMSRLKFLKNLYQSYLDLKRFSS------GFGWDPETKKFTAPDEVWRDYLKAHPNHKH 118
Query: 115 CRVKSIPYFKALCTIYGNAV 134
+ +SI +F+ L I+G+ V
Sbjct: 119 MQTESIDHFEDLQIIFGDVV 138