Miyakogusa Predicted Gene
- Lj0g3v0309019.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0309019.1 CUFF.20878.1
(525 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G51150.1 | Symbols: | Mitochondrial import inner membrane tr... 708 0.0
AT1G34630.1 | Symbols: | BEST Arabidopsis thaliana protein matc... 119 5e-27
AT1G34630.2 | Symbols: | FUNCTIONS IN: molecular_function unkno... 95 1e-19
>AT5G51150.1 | Symbols: | Mitochondrial import inner membrane
translocase subunit Tim17/Tim22/Tim23 family protein |
chr5:20789034-20791495 FORWARD LENGTH=531
Length = 531
Score = 708 bits (1827), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 347/470 (73%), Positives = 388/470 (82%), Gaps = 6/470 (1%)
Query: 61 CDHGPDESCVAHAVGNLCQTFLLSYGVRVGIGILLRAFKLARGQSYSSLLDLKQLVSEKD 120
CDH D SCVA+A+GNLCQ+FLLSYGVRVGIGILLRAFKLARGQSYSSLLDLKQLVSEKD
Sbjct: 63 CDHA-DVSCVANAIGNLCQSFLLSYGVRVGIGILLRAFKLARGQSYSSLLDLKQLVSEKD 121
Query: 121 LIVREEACRIGLLFGGFTGSYHALRCLLRKWRKKETPLNAILAGSVAGFSILALNDSNXX 180
LIVREEACRIGLLFGGFTGSYHALRC LRKWRKKETPLN++LAGSVAG SILAL+DSN
Sbjct: 122 LIVREEACRIGLLFGGFTGSYHALRCCLRKWRKKETPLNSVLAGSVAGLSILALDDSNQR 181
Query: 181 XXXXXXXXXXXXQSAYNSAKSKNKFHFWGSHWRHGDSLLFALACAQVMYAFVMRPESLPK 240
Q+AYNSAKSKNKFH WGSHWRHGDSLLF+LACAQVMY+F+MRPE+LPK
Sbjct: 182 RTLALYLLARLGQAAYNSAKSKNKFHLWGSHWRHGDSLLFSLACAQVMYSFIMRPETLPK 241
Query: 241 SYQDFIQKTGPVAAPVYKAVRDSCRGHPVDVASLHTYLSHRGRSEYVKLEEFPSIIPCSI 300
SY++FIQKTGPVA PVY+AVR+ CRG P+DVASL Y+S + + VK+EEF SIIPC+
Sbjct: 242 SYREFIQKTGPVARPVYQAVRECCRGGPIDVASLSAYISSKNEASDVKVEEFASIIPCAA 301
Query: 301 IHPGTKSCLAHEVNATSATFKKTFPLYFSLTFVPFVVLRQQKFTDAPFRTLWFAIKGSVR 360
IHP T SCLA NA SATFKKTFPLYFSLTFVP+VVL QKF +P+RT W AI+ SVR
Sbjct: 302 IHPNTNSCLAQNANAMSATFKKTFPLYFSLTFVPYVVLHLQKFMASPYRTSWLAIRDSVR 361
Query: 361 STAFLSAFVGIFQGVICLHRKLASRDHKFVYWIAGGISALSVLLEKKARRSELALYVLPR 420
ST+FLSAFVGIFQ IC HRK+A++DHK VYW AGG +ALSV+LEKK RRSELALYVLPR
Sbjct: 362 STSFLSAFVGIFQAFICAHRKVATKDHKLVYWFAGGAAALSVMLEKKPRRSELALYVLPR 421
Query: 421 AVDSLWYILVNRHLLPIVKNAEVFLFSLCMGGIMYYLEYEPETMAPFLRSLIRRFLASRI 480
A DSLW ILVNRHLLP +KNAEV LF CMGGIMYYLEYEP+TMAPFLR LIRRFLAS+I
Sbjct: 422 AGDSLWEILVNRHLLPDIKNAEVALFCGCMGGIMYYLEYEPDTMAPFLRGLIRRFLASQI 481
Query: 481 XXXXXXXXXTA--SYLQALDGTTKPKLPR-RDSES--SSEQYNLESIPGL 525
++ SYLQ LD KPK R+ E+ + E+YNLE+IPGL
Sbjct: 482 SNPSSKYPHSSSYSYLQTLDALKKPKTQESREGETPKAEEKYNLEAIPGL 531
>AT1G34630.1 | Symbols: | BEST Arabidopsis thaliana protein match
is: Mitochondrial import inner membrane translocase
subunit Tim17/Tim22/Tim23 family protein
(TAIR:AT5G51150.1); Has 323 Blast hits to 315 proteins
in 124 species: Archae - 0; Bacteria - 0; Metazoa - 95;
Fungi - 110; Plants - 73; Viruses - 0; Other Eukaryotes
- 45 (source: NCBI BLink). | chr1:12685317-12687435
FORWARD LENGTH=481
Length = 481
Score = 119 bits (298), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 92/350 (26%), Positives = 168/350 (48%), Gaps = 24/350 (6%)
Query: 125 EEACRIGLLFGGFTGSY----HALRCLLRKWRKKETPLNAILAGSVAGFSILALNDSNXX 180
+E R GL G F G++ A+ L K+ A+ AG VAG S+L L N
Sbjct: 112 KETLRYGLFLGTFAGTFVSVDEAIAALAGD--KRTAKWRALFAGLVAGPSML-LTGPNTQ 168
Query: 181 XXXXXXXXXXXXQSAYNSAKSKNKFHFWGS-----HWRHGDSLLFALACAQVMYAFVMRP 235
++A +++ K +G+ W+HGD L L+ +Q++ A++++
Sbjct: 169 HTSLAVYILM--RAAVLASRCGIKSKRFGTICKPLTWKHGDLFLMCLSSSQILSAYILKQ 226
Query: 236 ESLPKSYQDFIQKTGPVAAPVYKAVRDSCRGHP-VDVASLHTYLSHRGRSEYVKLEEFPS 294
ESLP SY+ F+ K G + + V+D P ++ ++ Y G V ++ P+
Sbjct: 227 ESLPSSYKSFLNKQGGKDLSILQGVKDIATAQPFTNLRAIEKYYKSVG----VDIKLDPT 282
Query: 295 I-IPCSIIHPGTKSCLAHEVNATSATFKKTFPLYFSLTFVPFVVLRQQKFTDAPFRTLWF 353
+ +PC+IIH G +SC+ H V +K+ P+Y + +P +++ +Q + L
Sbjct: 283 MKVPCTIIH-GNESCVKHGVTFFLQAYKRALPVYVPVYLIPALIVHRQDLLKKQYSILGK 341
Query: 354 AIKGSVRSTAFLSAFVGIFQGVICLHRKLASRDHKFVYWIAGGISALSVLLEKKARRSEL 413
+ G+ RS+ FL+ + CL + + + IA + L++ +EKK+RR E+
Sbjct: 342 GLLGTARSSLFLATYCSSAWAWTCLLFRTFETCNIPLVAIATFPTGLALAIEKKSRRIEI 401
Query: 414 ALYVLPRAVDSLWYILVNRHLL---PIVKNAEVFLFSLCMGGIMYYLEYE 460
+LY L RA++S + + + ++ A+V +FS+ IM+ E
Sbjct: 402 SLYCLARAIESFFTCMTEAGYIRPPKSLRRADVVVFSVSTAIIMHCYAQE 451
>AT1G34630.2 | Symbols: | FUNCTIONS IN: molecular_function unknown;
INVOLVED IN: biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 25 plant
structures; EXPRESSED DURING: 15 growth stages; BEST
Arabidopsis thaliana protein match is: Mitochondrial
import inner membrane translocase subunit
Tim17/Tim22/Tim23 family protein (TAIR:AT5G51150.1); Has
30201 Blast hits to 17322 proteins in 780 species:
Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi -
3422; Plants - 5037; Viruses - 0; Other Eukaryotes -
2996 (source: NCBI BLink). | chr1:12685317-12687107
FORWARD LENGTH=395
Length = 395
Score = 94.7 bits (234), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 72/270 (26%), Positives = 126/270 (46%), Gaps = 31/270 (11%)
Query: 125 EEACRIGLLFGGFTGSY----HALRCL-----LRKWRKKETPLNAILAGSVAGFSILALN 175
+E R GL G F G++ A+ L KWR A+ AG VAG S+L L
Sbjct: 112 KETLRYGLFLGTFAGTFVSVDEAIAALAGDKRTAKWR-------ALFAGLVAGPSML-LT 163
Query: 176 DSNXXXXXXXXXXXXXXQSAYNSAKSKNKFHFWGS-----HWRHGDSLLFALACAQVMYA 230
N ++A +++ K +G+ W+HGD L L+ +Q++ A
Sbjct: 164 GPNTQHTSLAVYILM--RAAVLASRCGIKSKRFGTICKPLTWKHGDLFLMCLSSSQILSA 221
Query: 231 FVMRPESLPKSYQDFIQKTGPVAAPVYKAVRDSCRGHP-VDVASLHTYLSHRGRSEYVKL 289
++++ ESLP SY+ F+ K G + + V+D P ++ ++ Y G V +
Sbjct: 222 YILKQESLPSSYKSFLNKQGGKDLSILQGVKDIATAQPFTNLRAIEKYYKSVG----VDI 277
Query: 290 EEFPSI-IPCSIIHPGTKSCLAHEVNATSATFKKTFPLYFSLTFVPFVVLRQQKFTDAPF 348
+ P++ +PC+IIH G +SC+ H V +K+ P+Y + +P +++ +Q +
Sbjct: 278 KLDPTMKVPCTIIH-GNESCVKHGVTFFLQAYKRALPVYVPVYLIPALIVHRQDLLKKQY 336
Query: 349 RTLWFAIKGSVRSTAFLSAFVGIFQGVICL 378
L + G+ RS+ FL+ + CL
Sbjct: 337 SILGKGLLGTARSSLFLATYCSSAWAWTCL 366