Miyakogusa Predicted Gene
- Lj4g3v2799400.2
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj4g3v2799400.2 Non Chatacterized Hit- tr|A5C188|A5C188_VITVI
Putative uncharacterized protein OS=Vitis vinifera
GN=,51.58,1e-16,seg,NULL; coiled-coil,NULL,CUFF.51669.2
(352 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G10010.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 402 e-112
AT5G64910.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 278 3e-75
AT5G64910.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 272 3e-73
>AT5G10010.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: nucleolus;
EXPRESSED IN: 25 plant structures; EXPRESSED DURING: 15
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT5G64910.1); Has 33260 Blast
hits to 16857 proteins in 1270 species: Archae - 88;
Bacteria - 3040; Metazoa - 11915; Fungi - 3137; Plants -
1371; Viruses - 424; Other Eukaryotes - 13285 (source:
NCBI BLink). | chr5:3128098-3131452 FORWARD LENGTH=434
Length = 434
Score = 402 bits (1034), Expect = e-112, Method: Compositional matrix adjust.
Identities = 194/321 (60%), Positives = 228/321 (71%), Gaps = 1/321 (0%)
Query: 32 EINDQXXXXXXXXXXXXXXXESEPEYFEDKRNLEDLWRQTFPVGTEWDQLDSVYQIKWDF 91
EI D+ + EP YFE+KR+LEDLW+ FPVGTEWDQLD++Y+ WDF
Sbjct: 115 EIKDEKKPVPKAKKPRAAKVKEEPVYFEEKRSLEDLWKVAFPVGTEWDQLDALYEFNWDF 174
Query: 92 SNLENAFEEGGKLYDKKVYLFGCTEPQLVPFRGEHKMXXXXXXXXXXXXXXXSDKIGINS 151
NLE A EEGGKLY KKVY+FGCTEPQLVP++G +K+ SDKIGI S
Sbjct: 175 QNLEEALEEGGKLYGKKVYVFGCTEPQLVPYKGANKIVHVPAVVVIESPFPPSDKIGITS 234
Query: 152 VQREAEEIIPMKQMKMDWVPYIPLENRSSMVDRLNSQIFILSCTQRRSALKHLKVDRLKK 211
VQRE EEIIPMK+MKMDW+PYIP+E R VD++NSQIF L CTQRRSAL+H+K D+LKK
Sbjct: 235 VQREVEEIIPMKKMKMDWLPYIPIEKRDRQVDKMNSQIFTLGCTQRRSALRHMKEDQLKK 294
Query: 212 YEYCLPYFYQPFKEDELEQSTEVQIIFPSEQKPIFCEFDWEMDELEEFTDNLIQAEELSE 271
+EYCLPYFYQPFKEDELEQSTEVQI+FPSE P+ CEFDWE DEL+EF D L++ E L
Sbjct: 295 FEYCLPYFYQPFKEDELEQSTEVQIMFPSE-PPVVCEFDWEFDELQEFVDKLVEEEALPA 353
Query: 272 DEKGAFKEFVXXXXXXXXXXXXXXXXXXXXXXXXMSEETKAAFESMKFYKFYPVQSPDTP 331
++ FKE+V MSE+TK AF+ MKFYKFYP SPDTP
Sbjct: 354 EQADEFKEYVKEQVRAAKKANREAKDARKKAIEEMSEDTKQAFQKMKFYKFYPQPSPDTP 413
Query: 332 DLSEVKSPFINRYYGKAHEVL 352
D+S V+SPFINRYYGKAHEVL
Sbjct: 414 DVSGVQSPFINRYYGKAHEVL 434
>AT5G64910.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT5G10010.1); Has 1807 Blast hits to 1807 proteins
in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736;
Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes
- 339 (source: NCBI BLink). | chr5:25940900-25944114
FORWARD LENGTH=487
Length = 487
Score = 278 bits (712), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 150/284 (52%), Positives = 185/284 (65%), Gaps = 1/284 (0%)
Query: 53 SEPEYFEDKRNLEDLWRQTFPVGTEWDQLDSVYQIKWDFSNLENAFEEGGKLYDKKVYLF 112
SEPEYFE+KRNLEDLW+ TF VGTEWDQ D++ + WDF+NLE A EEGG+LY K+VY+F
Sbjct: 175 SEPEYFEEKRNLEDLWKATFSVGTEWDQQDALNEFNWDFTNLEEALEEGGELYGKQVYVF 234
Query: 113 GCTEPQLVPFRGEHKMXXXXXXXXXXXXXXXSDKIGINSVQREAEEIIPMKQMKMDWVPY 172
GCTE V ++ E+K SD+IG+ SVQ E EII MK MKM WVPY
Sbjct: 235 GCTESHSVTYKDENKDVLVPVVVCIDSPIPPSDEIGVASVQGEVGEIIAMKTMKMAWVPY 294
Query: 173 IPLENRSSMVDRLNSQIFILSCTQRRSALKHLKVDRLKKYEYCLPYFYQPFKEDELEQST 232
IPLE R VD N IFIL CTQRRSALKHL DR+KK+ YCLPY P+K D+ E+ST
Sbjct: 295 IPLEQRDRQVDNKNFPIFILGCTQRRSALKHLPDDRVKKFNYCLPYINNPYKVDDSEKST 354
Query: 233 EVQIIFPSEQKPIFCEFDWEMDELEEFTDNLIQAEELSEDEKGAFKEFVXXXXXXXXXXX 292
V+I+FPSE P+ CE+DW +EEFTD+LI E L ++K AF+EFV
Sbjct: 355 VVKIMFPSEP-PVECEYDWVKSVIEEFTDSLINEEVLLPEQKVAFEEFVKEKSDKAMAAY 413
Query: 293 XXXXXXXXXXXXXMSEETKAAFESMKFYKFYPVQSPDTPDLSEV 336
+SEETK A++ M+ YKFYP+ SPDTP + +
Sbjct: 414 DTAQEALEKAKEGLSEETKKAYQEMRLYKFYPLPSPDTPHTAGI 457
>AT5G64910.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT5G10010.1). |
chr5:25940900-25944114 FORWARD LENGTH=484
Length = 484
Score = 272 bits (695), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 149/284 (52%), Positives = 184/284 (64%), Gaps = 4/284 (1%)
Query: 53 SEPEYFEDKRNLEDLWRQTFPVGTEWDQLDSVYQIKWDFSNLENAFEEGGKLYDKKVYLF 112
SEPEYFE+KRNLEDLW+ TF VGTEWDQ D++ + WDF+NLE A EEGG+LY K+VY+F
Sbjct: 175 SEPEYFEEKRNLEDLWKATFSVGTEWDQQDALNEFNWDFTNLEEALEEGGELYGKQVYVF 234
Query: 113 GCTEPQLVPFRGEHKMXXXXXXXXXXXXXXXSDKIGINSVQREAEEIIPMKQMKMDWVPY 172
GCTE ++ E+K SD+IG+ SVQ E EII MK MKM WVPY
Sbjct: 235 GCTE---FTYKDENKDVLVPVVVCIDSPIPPSDEIGVASVQGEVGEIIAMKTMKMAWVPY 291
Query: 173 IPLENRSSMVDRLNSQIFILSCTQRRSALKHLKVDRLKKYEYCLPYFYQPFKEDELEQST 232
IPLE R VD N IFIL CTQRRSALKHL DR+KK+ YCLPY P+K D+ E+ST
Sbjct: 292 IPLEQRDRQVDNKNFPIFILGCTQRRSALKHLPDDRVKKFNYCLPYINNPYKVDDSEKST 351
Query: 233 EVQIIFPSEQKPIFCEFDWEMDELEEFTDNLIQAEELSEDEKGAFKEFVXXXXXXXXXXX 292
V+I+FPSE P+ CE+DW +EEFTD+LI E L ++K AF+EFV
Sbjct: 352 VVKIMFPSEP-PVECEYDWVKSVIEEFTDSLINEEVLLPEQKVAFEEFVKEKSDKAMAAY 410
Query: 293 XXXXXXXXXXXXXMSEETKAAFESMKFYKFYPVQSPDTPDLSEV 336
+SEETK A++ M+ YKFYP+ SPDTP + +
Sbjct: 411 DTAQEALEKAKEGLSEETKKAYQEMRLYKFYPLPSPDTPHTAGI 454