Miyakogusa Predicted Gene
- Lj1g3v4226650.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v4226650.1 Non Chatacterized Hit- tr|I1JN93|I1JN93_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.48387
PE,89.36,7e-18,seg,NULL; Glyco_tranf_2_4,NULL; SUBFAMILY NOT
NAMED,NULL; EXTENDED SYNAPTOTAGMIN-RELATED,NULL,CUFF.32071.1
(264 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT3G57200.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 348 3e-96
AT2G41451.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 325 3e-89
AT3G08550.1 | Symbols: ELD1, ABI8, KOB1 | elongation defective 1... 323 5e-89
>AT3G57200.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT2G41451.1); Has 94 Blast hits to 94 proteins in
31 species: Archae - 0; Bacteria - 12; Metazoa - 0;
Fungi - 0; Plants - 77; Viruses - 0; Other Eukaryotes -
5 (source: NCBI BLink). | chr3:21169413-21172103 FORWARD
LENGTH=514
Length = 514
Score = 348 bits (892), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 164/227 (72%), Positives = 189/227 (83%), Gaps = 8/227 (3%)
Query: 37 QWRGGVTDDPLTRWSPDHHQFPGMSLSNXXXXXXXXXXXXHHSHPDCASLIANSHSPSFP 96
QWRGG+ DDP+T WS DHH+FPGM + S C L+ S SPSFP
Sbjct: 46 QWRGGL-DDPVTHWSIDHHEFPGMVTTQEKRSLRRSV-----SDSGCVDLLGQSRSPSFP 99
Query: 97 YFRDWKPDYSSDLVLSPKICITTSTSAGLEQTLPWIFYHKVMGVSSFFLFVEGKAASPNV 156
YFR+WK DY SDL P+ICITTSTSAGLEQTLPWI++HKV+GVS+F+LFVEGKAASPNV
Sbjct: 100 YFRNWKFDYHSDL--KPRICITTSTSAGLEQTLPWIYFHKVIGVSTFYLFVEGKAASPNV 157
Query: 157 SRVLESIPGVKVIYRTRELEEQQAKSRIWNETWLASFFYKPCNYELFVKQSLNMEMAIVM 216
SRVLE+IPGVKVIYRT+ELEE+QAKSRIWNETWL+SFFYKPCNYELFVKQSLNMEMAI M
Sbjct: 158 SRVLETIPGVKVIYRTKELEEKQAKSRIWNETWLSSFFYKPCNYELFVKQSLNMEMAITM 217
Query: 217 ARDSGMDWILHLDTDELIHPAGTQEYSLRQLLSDVPGDVDMVIFPNY 263
A+D+GM+WI+HLDTDELIHP+GT EYSLR+LL ++ DVD+VIFPNY
Sbjct: 218 AQDAGMEWIIHLDTDELIHPSGTHEYSLRKLLGNISADVDVVIFPNY 264
>AT2G41451.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: endomembrane
system; BEST Arabidopsis thaliana protein match is:
unknown protein (TAIR:AT3G57200.1); Has 30201 Blast hits
to 17322 proteins in 780 species: Archae - 12; Bacteria
- 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037;
Viruses - 0; Other Eukaryotes - 2996 (source: NCBI
BLink). | chr2:17282957-17285264 FORWARD LENGTH=451
Length = 451
Score = 325 bits (832), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 156/227 (68%), Positives = 178/227 (78%), Gaps = 11/227 (4%)
Query: 37 QWRGGVTDDPLTRWSPDHHQFPGMSLSNXXXXXXXXXXXXHHSHPDCASLIANSHSPSFP 96
QWR GV +D +T+W D++ FPGM+ + S C SL+ S + +FP
Sbjct: 30 QWRSGV-NDSVTQWFDDNYPFPGMATVSEKRSL--------RSDSSCVSLLGQSRTQAFP 80
Query: 97 YFRDWKPDYSSDLVLSPKICITTSTSAGLEQTLPWIFYHKVMGVSSFFLFVEGKAASPNV 156
Y RD K D+ DL P+ICITTSTSAGLEQTLPWIFYHKV+GV +F+LFVEG AASPNV
Sbjct: 81 YLRDLKLDHKPDL--KPRICITTSTSAGLEQTLPWIFYHKVIGVETFYLFVEGTAASPNV 138
Query: 157 SRVLESIPGVKVIYRTRELEEQQAKSRIWNETWLASFFYKPCNYELFVKQSLNMEMAIVM 216
SRVLE+IPGV VIYRTRELEE+QAKSRIWNETWL FFYKPCNYELFVKQ+LNMEMAI M
Sbjct: 139 SRVLETIPGVNVIYRTRELEEEQAKSRIWNETWLEKFFYKPCNYELFVKQNLNMEMAITM 198
Query: 217 ARDSGMDWILHLDTDELIHPAGTQEYSLRQLLSDVPGDVDMVIFPNY 263
ARD+GMDWILHLDTDEL+HP+GT+EYSLR LL DVP DVD VIF NY
Sbjct: 199 ARDAGMDWILHLDTDELVHPSGTREYSLRNLLRDVPADVDEVIFTNY 245
>AT3G08550.1 | Symbols: ELD1, ABI8, KOB1 | elongation defective 1
protein / ELD1 protein | chr3:2596513-2599515 FORWARD
LENGTH=533
Length = 533
Score = 323 bits (829), Expect = 5e-89, Method: Compositional matrix adjust.
Identities = 163/238 (68%), Positives = 185/238 (77%), Gaps = 20/238 (8%)
Query: 37 QWRGGVTDDPLT---RWSP--------DHHQFPGMSLSNXXXXXXXXXXXXHHSHPDCAS 85
QWRGG DP + R S +H FPGM H S DC++
Sbjct: 50 QWRGGGLADPASASVRSSTSVPGGSDLNHEVFPGME------TVSSVSPKSHQSSSDCSN 103
Query: 86 LIANSHSPSFPYFRDWKPDYSSDLVLSPKICITTSTSAGLEQTLPWIFYHKVMGVSSFFL 145
L A S SPSFPY+ DWK + D L PKICITTSTSAGL+Q LPW+FYHKV+GVS+FFL
Sbjct: 104 L-ARSSSPSFPYYADWK--FGVDTSLKPKICITTSTSAGLDQILPWMFYHKVLGVSTFFL 160
Query: 146 FVEGKAASPNVSRVLESIPGVKVIYRTRELEEQQAKSRIWNETWLASFFYKPCNYELFVK 205
FVEGKAA+P++S+VLESIPGVKVIYRT+ELEE+QAKSRIWNETWL+SFFYKPCNYELFVK
Sbjct: 161 FVEGKAATPSISKVLESIPGVKVIYRTKELEEKQAKSRIWNETWLSSFFYKPCNYELFVK 220
Query: 206 QSLNMEMAIVMARDSGMDWILHLDTDELIHPAGTQEYSLRQLLSDVPGDVDMVIFPNY 263
QSLNMEMAIVMARD+GMDWILHLDTDELI+PAG +EYSLR+LL DVP +VDMVIFPNY
Sbjct: 221 QSLNMEMAIVMARDAGMDWILHLDTDELIYPAGAREYSLRRLLLDVPPNVDMVIFPNY 278