Miyakogusa Predicted Gene
- Lj3g3v1855430.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj3g3v1855430.1 Non Chatacterized Hit- tr|I1KML0|I1KML0_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.41941 PE,25.65,0.000002,
,CUFF.43205.1
(189 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT4G02550.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 196 7e-51
AT4G02550.3 | Symbols: | unknown protein; BEST Arabidopsis thal... 196 1e-50
AT4G02550.4 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 170 6e-43
AT4G02550.2 | Symbols: | unknown protein; BEST Arabidopsis thal... 170 6e-43
AT4G02210.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 49 3e-06
AT4G02210.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 49 3e-06
>AT4G02550.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G02210.2); Has 370 Blast hits to 300 proteins
in 18 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 10; Plants - 354; Viruses - 0; Other Eukaryotes
- 6 (source: NCBI BLink). | chr4:1120622-1121629 REVERSE
LENGTH=307
Length = 307
Score = 196 bits (499), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 103/205 (50%), Positives = 139/205 (67%), Gaps = 17/205 (8%)
Query: 1 MIECDRVELWKRYVAAHPDAKGFCGKQIEMYDQLKIVCGNYQAPGRWAKVKDD--NHLVN 58
MI+C+ ELW+RY+A +PDAK F GKQIEMY++L+ VCG+YQ PG++ KVK + +HL +
Sbjct: 104 MIDCESDELWRRYIAVNPDAKAFRGKQIEMYEELRTVCGDYQTPGKYNKVKKESSHHLND 163
Query: 59 MKNCDDESASFASPVSENTSETDGTESYSEPPEYEQMPNGYQEAPVVHPLRQLPKRPRSS 118
+K +++S SF SE S+TDGTESY+ EY P PLR+ KR R+S
Sbjct: 164 VKQFEEDSVSFPLGSSEEHSDTDGTESYAGASEYMH-EESQDLPPPRDPLRRPSKRSRNS 222
Query: 119 DELQEALMTVASSIRRLADSMERSKCSIDAAELLQA--------------PFEYLNADPI 164
D QEA++ VASSIRRLAD++ +SK I+ ELL+A FEYLN DP+
Sbjct: 223 DPCQEAMLVVASSIRRLADAVVQSKTLINTEELLKAVMEIDELEEAKQMYAFEYLNGDPV 282
Query: 165 KARAFLTYNTRMRKIYMFKQFWWWR 189
KARAF+ YN RMRK+++F+QFWWW+
Sbjct: 283 KARAFMAYNNRMRKMFLFRQFWWWK 307
>AT4G02550.3 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G02210.2); Has 35333 Blast hits to 34131
proteins in 2444 species: Archae - 798; Bacteria -
22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
- 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
chr4:1120622-1121674 REVERSE LENGTH=322
Length = 322
Score = 196 bits (497), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 103/205 (50%), Positives = 139/205 (67%), Gaps = 17/205 (8%)
Query: 1 MIECDRVELWKRYVAAHPDAKGFCGKQIEMYDQLKIVCGNYQAPGRWAKVKDD--NHLVN 58
MI+C+ ELW+RY+A +PDAK F GKQIEMY++L+ VCG+YQ PG++ KVK + +HL +
Sbjct: 119 MIDCESDELWRRYIAVNPDAKAFRGKQIEMYEELRTVCGDYQTPGKYNKVKKESSHHLND 178
Query: 59 MKNCDDESASFASPVSENTSETDGTESYSEPPEYEQMPNGYQEAPVVHPLRQLPKRPRSS 118
+K +++S SF SE S+TDGTESY+ EY P PLR+ KR R+S
Sbjct: 179 VKQFEEDSVSFPLGSSEEHSDTDGTESYAGASEYMH-EESQDLPPPRDPLRRPSKRSRNS 237
Query: 119 DELQEALMTVASSIRRLADSMERSKCSIDAAELLQA--------------PFEYLNADPI 164
D QEA++ VASSIRRLAD++ +SK I+ ELL+A FEYLN DP+
Sbjct: 238 DPCQEAMLVVASSIRRLADAVVQSKTLINTEELLKAVMEIDELEEAKQMYAFEYLNGDPV 297
Query: 165 KARAFLTYNTRMRKIYMFKQFWWWR 189
KARAF+ YN RMRK+++F+QFWWW+
Sbjct: 298 KARAFMAYNNRMRKMFLFRQFWWWK 322
>AT4G02550.4 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 18 plant
structures; EXPRESSED DURING: 7 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT4G02210.2). | chr4:1120622-1121629 REVERSE
LENGTH=278
Length = 278
Score = 170 bits (430), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 94/203 (46%), Positives = 121/203 (59%), Gaps = 42/203 (20%)
Query: 1 MIECDRVELWKRYVAAHPDAKGFCGKQIEMYDQLKIVCGNYQAPGRWAKVKDDNHLVNMK 60
MI+C+ ELW+RY+A +PDAK F GKQIEMY++L+ VCG+YQ PG
Sbjct: 104 MIDCESDELWRRYIAVNPDAKAFRGKQIEMYEELRTVCGDYQTPGS-------------- 149
Query: 61 NCDDESASFASPVSENTSETDGTESYSEPPEYEQMPNGYQEAPVVHPLRQLPKRPRSSDE 120
SE S+TDGTESY+ EY P PLR+ KR R+SD
Sbjct: 150 -------------SEEHSDTDGTESYAGASEYMH-EESQDLPPPRDPLRRPSKRSRNSDP 195
Query: 121 LQEALMTVASSIRRLADSMERSKCSIDAAELLQA--------------PFEYLNADPIKA 166
QEA++ VASSIRRLAD++ +SK I+ ELL+A FEYLN DP+KA
Sbjct: 196 CQEAMLVVASSIRRLADAVVQSKTLINTEELLKAVMEIDELEEAKQMYAFEYLNGDPVKA 255
Query: 167 RAFLTYNTRMRKIYMFKQFWWWR 189
RAF+ YN RMRK+++F+QFWWW+
Sbjct: 256 RAFMAYNNRMRKMFLFRQFWWWK 278
>AT4G02550.2 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G02210.2); Has 350 Blast hits to 284 proteins
in 18 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 13; Plants - 331; Viruses - 0; Other Eukaryotes
- 6 (source: NCBI BLink). | chr4:1120622-1121629 REVERSE
LENGTH=278
Length = 278
Score = 170 bits (430), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 94/203 (46%), Positives = 121/203 (59%), Gaps = 42/203 (20%)
Query: 1 MIECDRVELWKRYVAAHPDAKGFCGKQIEMYDQLKIVCGNYQAPGRWAKVKDDNHLVNMK 60
MI+C+ ELW+RY+A +PDAK F GKQIEMY++L+ VCG+YQ PG
Sbjct: 104 MIDCESDELWRRYIAVNPDAKAFRGKQIEMYEELRTVCGDYQTPGS-------------- 149
Query: 61 NCDDESASFASPVSENTSETDGTESYSEPPEYEQMPNGYQEAPVVHPLRQLPKRPRSSDE 120
SE S+TDGTESY+ EY P PLR+ KR R+SD
Sbjct: 150 -------------SEEHSDTDGTESYAGASEYMH-EESQDLPPPRDPLRRPSKRSRNSDP 195
Query: 121 LQEALMTVASSIRRLADSMERSKCSIDAAELLQA--------------PFEYLNADPIKA 166
QEA++ VASSIRRLAD++ +SK I+ ELL+A FEYLN DP+KA
Sbjct: 196 CQEAMLVVASSIRRLADAVVQSKTLINTEELLKAVMEIDELEEAKQMYAFEYLNGDPVKA 255
Query: 167 RAFLTYNTRMRKIYMFKQFWWWR 189
RAF+ YN RMRK+++F+QFWWW+
Sbjct: 256 RAFMAYNNRMRKMFLFRQFWWWK 278
>AT4G02210.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 22 plant
structures; EXPRESSED DURING: 13 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT2G24960.2). | chr4:974320-975917 REVERSE
LENGTH=439
Length = 439
Score = 48.5 bits (114), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 40/183 (21%), Positives = 81/183 (44%), Gaps = 29/183 (15%)
Query: 9 LWKRYVAAHPDAKGFCGKQIEMYDQLKIVCGNYQAPGRWAKVKDDNHLVNMKNCDDES-- 66
+W+ Y+ AH DA+ F + I Y L ++CG+ + ++++ V M D E+
Sbjct: 275 VWQDYIKAHRDARQFMTRPIPYYKDLCVLCGD-------SGIEENECFVAMDWFDPETEF 327
Query: 67 ----ASFASPVSENTSETDGTESYSEPPEYEQMPNGYQEAPVVHPLRQLPKRPRSSDELQ 122
+S + +S + E D +P +P+ PK+PR +
Sbjct: 328 QEFKSSGTTDLSISAEEEDSNSLLFDPKNKRDQLANTDTSPIN------PKKPRVDETQT 381
Query: 123 EALMTVASSIRRLADSMERSKCSIDAAELLQAPFEYLNADPIKARAFLTYNTRMRKIYMF 182
++ +I+ L D + + +DA +LL+ D +KA+ FL + ++RK ++
Sbjct: 382 MSIEDTVEAIQALPDMDD--ELILDACDLLE--------DKLKAKTFLALDVKLRKKWLL 431
Query: 183 KQF 185
++
Sbjct: 432 RKL 434
>AT4G02210.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT2G24960.2); Has 791 Blast hits to 465 proteins
in 19 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 17; Plants - 748; Viruses - 0; Other Eukaryotes
- 26 (source: NCBI BLink). | chr4:974320-975917 REVERSE
LENGTH=439
Length = 439
Score = 48.5 bits (114), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 40/183 (21%), Positives = 81/183 (44%), Gaps = 29/183 (15%)
Query: 9 LWKRYVAAHPDAKGFCGKQIEMYDQLKIVCGNYQAPGRWAKVKDDNHLVNMKNCDDES-- 66
+W+ Y+ AH DA+ F + I Y L ++CG+ + ++++ V M D E+
Sbjct: 275 VWQDYIKAHRDARQFMTRPIPYYKDLCVLCGD-------SGIEENECFVAMDWFDPETEF 327
Query: 67 ----ASFASPVSENTSETDGTESYSEPPEYEQMPNGYQEAPVVHPLRQLPKRPRSSDELQ 122
+S + +S + E D +P +P+ PK+PR +
Sbjct: 328 QEFKSSGTTDLSISAEEEDSNSLLFDPKNKRDQLANTDTSPIN------PKKPRVDETQT 381
Query: 123 EALMTVASSIRRLADSMERSKCSIDAAELLQAPFEYLNADPIKARAFLTYNTRMRKIYMF 182
++ +I+ L D + + +DA +LL+ D +KA+ FL + ++RK ++
Sbjct: 382 MSIEDTVEAIQALPDMDD--ELILDACDLLE--------DKLKAKTFLALDVKLRKKWLL 431
Query: 183 KQF 185
++
Sbjct: 432 RKL 434