Miyakogusa Predicted Gene
- Lj3g3v2666510.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj3g3v2666510.1 Non Chatacterized Hit- tr|K4BMX8|K4BMX8_SOLLC
Uncharacterized protein OS=Solanum lycopersicum GN=Sol,42.37,7e-19,
,CUFF.44362.1
(125 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT3G13898.1 | Symbols: | unknown protein; LOCATED IN: endomembr... 75 1e-14
AT2G30370.1 | Symbols: CHAL, EPFL6 | allergen-related | chr2:129... 53 4e-08
AT5G10310.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 52 1e-07
AT2G30370.2 | Symbols: CHAL, EPFL6 | allergen-related | chr2:129... 52 1e-07
AT4G37810.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 49 1e-06
>AT3G13898.1 | Symbols: | unknown protein; LOCATED IN: endomembrane
system; BEST Arabidopsis thaliana protein match is:
unknown protein (TAIR:AT5G10310.1). |
chr3:4584011-4584334 FORWARD LENGTH=107
Length = 107
Score = 75.1 bits (183), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 37/72 (51%), Positives = 47/72 (65%), Gaps = 3/72 (4%)
Query: 51 SKKGLMGMSEKREEAYDRGMSKIGSSPPGCEHKCYGCVPCEAIQVPSPSS--HLGIQYAN 108
S++ ++ +E +EE R +IGS PP CE KCYGC PCEAIQ P+ SS HL YAN
Sbjct: 36 SRRRILNPNENKEEIVKR-RRRIGSKPPSCEKKCYGCEPCEAIQFPTISSIPHLSPHYAN 94
Query: 109 YELESWKCKCGP 120
Y+ E W+C C P
Sbjct: 95 YQPEGWRCHCPP 106
>AT2G30370.1 | Symbols: CHAL, EPFL6 | allergen-related |
chr2:12940577-12942167 REVERSE LENGTH=230
Length = 230
Score = 53.1 bits (126), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 41/87 (47%), Gaps = 3/87 (3%)
Query: 39 EALESSKPYTIESKKGLMGMSEKREEAYDRGMSKIGSSPPGCEHKCYGCVPCEAIQVPSP 98
+ +E K + K +G +E R + +GSSPP C KC C PC+ + VP P
Sbjct: 147 DRVEEGKSTVVIKKTRKIG-DRSKEAELRRILRGLGSSPPRCSSKCGRCTPCKPVHVPVP 205
Query: 99 SSHLGIQYANYELESWKCKCGPSFYSP 125
A Y E+W+CKCG Y P
Sbjct: 206 PGT--PVTAEYYPEAWRCKCGNKLYMP 230
>AT5G10310.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: endomembrane
system; EXPRESSED IN: 17 plant structures; EXPRESSED
DURING: 10 growth stages; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT3G13898.1);
Has 1807 Blast hits to 1807 proteins in 277 species:
Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347;
Plants - 385; Viruses - 0; Other Eukaryotes - 339
(source: NCBI BLink). | chr5:3241666-3242127 REVERSE
LENGTH=122
Length = 122
Score = 52.0 bits (123), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 29/85 (34%), Positives = 41/85 (48%), Gaps = 22/85 (25%)
Query: 63 EEAYDRGMSKIGSSPPGCEHKCYGCVPCEAIQVPS--------------------PSSHL 102
+ A +++GS+PP C ++C C PC AIQVP+ PSS
Sbjct: 38 QVALIEDKARLGSTPPSCHNRCNNCHPCMAIQVPTLPTRSRFTRVNPFSGGFVRPPSSLT 97
Query: 103 GI--QYANYELESWKCKCGPSFYSP 125
+ QY+NY+ WKC C FY+P
Sbjct: 98 TVLDQYSNYKPMGWKCHCNGHFYNP 122
>AT2G30370.2 | Symbols: CHAL, EPFL6 | allergen-related |
chr2:12940577-12942167 REVERSE LENGTH=156
Length = 156
Score = 52.0 bits (123), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 29/78 (37%), Positives = 37/78 (47%), Gaps = 2/78 (2%)
Query: 48 TIESKKGLMGMSEKREEAYDRGMSKIGSSPPGCEHKCYGCVPCEAIQVPSPSSHLGIQYA 107
T+ KK +E R + +GSSPP C KC C PC+ + VP P A
Sbjct: 81 TVVIKKTRKIGDRSKEAELRRILRGLGSSPPRCSSKCGRCTPCKPVHVPVPPGTP--VTA 138
Query: 108 NYELESWKCKCGPSFYSP 125
Y E+W+CKCG Y P
Sbjct: 139 EYYPEAWRCKCGNKLYMP 156
>AT4G37810.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: endomembrane
system; BEST Arabidopsis thaliana protein match is:
unknown protein (TAIR:AT5G10310.1); Has 149 Blast hits
to 149 proteins in 15 species: Archae - 0; Bacteria - 0;
Metazoa - 0; Fungi - 0; Plants - 149; Viruses - 0; Other
Eukaryotes - 0 (source: NCBI BLink). |
chr4:17780970-17781544 FORWARD LENGTH=128
Length = 128
Score = 48.5 bits (114), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 33/81 (40%), Positives = 39/81 (48%), Gaps = 25/81 (30%)
Query: 68 RGMSKIGSSPPGCEH-KCYGCVPCEAIQVPS-PSSHL---------------------GI 104
RG+ IGS PP CE +C C CEAIQVP+ P + L G
Sbjct: 50 RGL--IGSRPPRCERVRCRSCGHCEAIQVPTNPQTKLHSPLTTSSSSSSETIHLDYTRGD 107
Query: 105 QYANYELESWKCKCGPSFYSP 125
NY+ SWKCKCG S Y+P
Sbjct: 108 DSTNYKPMSWKCKCGNSIYNP 128