Miyakogusa Predicted Gene
- chr5.CM0345.750.nc
BLASTP 2.2.18 [Mar-02-2008]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= chr5.CM0345.750.nc + phase: 0
(1200 letters)
Database: TAIR8_pep
32,825 sequences; 13,166,001 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G11040.1 | Symbols: | similar to hypothetical protein OsI_01... 1810 0.0
>AT5G11040.1 | Symbols: | similar to hypothetical protein OsI_015984
[Oryza sativa (indica cultivar-group)] (GB:EAY94751.1);
similar to H0311C03.11 [Oryza sativa (indica
cultivar-group)] (GB:CAH67557.1); similar to unnamed
protein product [Vitis vinifera] (GB:CAO44298.1);
contains InterPro domain Transport protein Trs120
(InterPro:IPR013935) | chr5:3495333-3500611 FORWARD
Length = 1186
Score = 1810 bits (4689), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 891/1204 (74%), Positives = 1015/1204 (84%), Gaps = 23/1204 (1%)
Query: 1 MEPEVSIEGSSVIQVAVVPIGTVPPNMLRDYYSMLLPLHSIPLSAISSFYTEHQKSPFAN 60
MEP+VSIE S+I++AV+PIGT+PP +LRDY+SMLL H+I LSAISSFYTEHQKSPF N
Sbjct: 1 MEPDVSIETLSIIRIAVLPIGTIPPTLLRDYHSMLLRHHTIALSAISSFYTEHQKSPFTN 60
Query: 61 QPWDTGSLSFKFVLGGAPPSPWEDFQSYRKTLAVVGIVHCPSSPDLDAVVDQFASACKSY 120
QPWD+GSL FKFVLGG+PPSPWEDFQS RK LAV+G+ HCPSSPDLD+V ++F ACKSY
Sbjct: 61 QPWDSGSLRFKFVLGGSPPSPWEDFQSNRKMLAVIGLCHCPSSPDLDSVTEKFNVACKSY 120
Query: 121 PSSLVDRCFAFCPNDSQLDDGSKREGNLRLFPPADRPTLEFHLNTMMQEVAASLLMEFEK 180
S+LV RCFAF P DSQL+DG K+ NL LFPP+D+ T EFHL TMMQ++AASLLMEFEK
Sbjct: 121 SSALVRRCFAFSPGDSQLEDGDKKGENLILFPPSDKQTQEFHLQTMMQDIAASLLMEFEK 180
Query: 181 WVLQAESSGTILKTPLDSQASLSSEEVIKAKKRRLGRAQKTIGDYCLLAGSPVDANAHYS 240
WVLQAES+GTILKTPLDSQASL+SEEVIKAKKRRLGRAQKTIGDY LLAGSPVDANAHYS
Sbjct: 181 WVLQAESAGTILKTPLDSQASLNSEEVIKAKKRRLGRAQKTIGDYSLLAGSPVDANAHYS 240
Query: 241 TALELARLTGDYFWYAGALEGSVCALLIDRMGQKDSILEDEVRYRYNSVILNYKKS--QD 298
TALELARLTGDYFWYAGALEGSVCALL+DRMGQ+D LEDEVRYRY +VIL+Y+KS Q+
Sbjct: 241 TALELARLTGDYFWYAGALEGSVCALLVDRMGQRDVALEDEVRYRYTNVILHYRKSFIQE 300
Query: 299 NAQRVSPITFELEATLKLARFLCRRELAKEVVELLTTAADGAKCLIDASDRLVLYIEIAR 358
AQRVSP++FELEATLKLARFLCRRELAKEVVELLT AADGAK LIDASDRL+LY+E+AR
Sbjct: 301 IAQRVSPLSFELEATLKLARFLCRRELAKEVVELLTNAADGAKSLIDASDRLILYVEVAR 360
Query: 359 LYGSLGYQRKAAFFSRQVAQLYLQQDNRLAAISAMQVLAMTTKAYHVQSRSSISDHSMHN 418
L+G+LGYQRKAAFF RQVAQLYLQQDNRLAAISAMQVL+MTT AY +QSR+S+S S++N
Sbjct: 361 LFGALGYQRKAAFFCRQVAQLYLQQDNRLAAISAMQVLSMTTDAYRIQSRASMSKVSVNN 420
Query: 419 KGIGSNNADSGKMYHQSVVSLFESQWSTLQMVVLREILLSAVRAGDPLTAWSAAARLLRS 478
+ D+GKM+H S+VSLFES WSTLQMVVLREILLSAVRAGDPL AWSAAARLLR
Sbjct: 421 ET--GRLPDAGKMHHHSIVSLFESHWSTLQMVVLREILLSAVRAGDPLAAWSAAARLLRW 478
Query: 479 YYPLIPPAGQHGLANALLNSAERLPPGTRCADPALPFIRLHSFPLHPIQMDIVKRNPARE 538
+YPLI P+GQ+GLAN+L NSA+RLP GTRCADPALPF+RL SFPLH Q+DIVKRNPARE
Sbjct: 479 HYPLITPSGQNGLANSLANSADRLPSGTRCADPALPFVRLFSFPLHSSQVDIVKRNPARE 538
Query: 539 DWWAGSAPSGPFIYTPFSKGEPNNMKKQELIWVVGEPVQVLVELANPCGFDLRVDSIYLS 598
DWW GSAPSGPFIYTPFSKG+ N KQELIWVVGEPVQVLVELANPC FDLR+DSIYLS
Sbjct: 539 DWWTGSAPSGPFIYTPFSKGDANESSKQELIWVVGEPVQVLVELANPCCFDLRIDSIYLS 598
Query: 599 VHSGNFDAFPVSVSLLPNSSKVITLSGIPTSVGTVTIPGCIVHCFGVITEHLFREVDNLL 658
HS NFDAFPVSV + PNS+KVITLSGIPT+VG VTIPGC VHCFGVITEH+FR+VDNLL
Sbjct: 599 AHSSNFDAFPVSVDIPPNSAKVITLSGIPTAVGPVTIPGCTVHCFGVITEHVFRDVDNLL 658
Query: 659 LGAAQGLVLSDPFRCCGSPKLKNVSVPNISVVQPLPLLISHVVGGDGAIILYEGEIRDVW 718
LGAAQGLV SDPFR CGS KL++V VPNISV PLPLL+++VVGGDGAIILYEGEIR+V
Sbjct: 659 LGAAQGLVFSDPFRSCGSAKLRHVFVPNISVAPPLPLLVANVVGGDGAIILYEGEIREVC 718
Query: 719 ISLANAGTVQIEQAHISLSGKNQDSVISYSSETLKSCLPLKPGAEVTFPVTLRAWQVGSV 778
I+ ANAGTV I QAH+SLSGKNQD+VIS + E L+S LPLKPGA+VT PVTL+AW VG
Sbjct: 719 INFANAGTVPIVQAHVSLSGKNQDAVISIADEALQSALPLKPGAQVTLPVTLKAWHVGPT 778
Query: 779 DTDTGV--GRTVSGSNMRHSKDGSDPSLLIHYAGPLKTSEEPLTNGSTVPPGRRLIVPLQ 836
D+D + GR +G+ R KDG+ PSLLIHYAGPL + + S VPPGRRL+VPLQ
Sbjct: 779 DSDNTMSSGRNAAGNTGR-PKDGTSPSLLIHYAGPLSNNGDSQEKESIVPPGRRLVVPLQ 837
Query: 837 ICVLQGLSFVKAQLLSMEFPAHVGENLPKLDDMDNKSPGEHVKSETKMDRLVKIDPFRGS 896
ICVLQGLSFVKA+LLSME PAHV +NL E ++ E+ D LVKI+PFRGS
Sbjct: 838 ICVLQGLSFVKARLLSMEIPAHVSDNLRD----------EDIERESNADSLVKINPFRGS 887
Query: 897 WGLRFLELELSNPTDVAFEINVSVKLENSSNEDNHLADQGATEYGYPKTRIDRDCSARVL 956
WGLRFLELELSNPTDV FEI+V V+LENS+ ED+ Q + EY YPKTRIDRD SARVL
Sbjct: 888 WGLRFLELELSNPTDVVFEISVFVQLENSAKEDDSSPVQDSPEYEYPKTRIDRDYSARVL 947
Query: 957 VPLEHFKLPVLDDSFFMKXXXXXXXXXXXXXSFSEKSTKAELNACIKNLISRIKVQWHSG 1016
+PLEHFKLPVLD SFF K SFSEK+TKAE+N IKNLIS+IKV+W SG
Sbjct: 948 IPLEHFKLPVLDGSFFTKDPPPGSPSSSRNPSFSEKNTKAEINTLIKNLISKIKVRWQSG 1007
Query: 1017 RNSSGELNIRDAIQAALQTSVMDVLLPDPLTFGFRLVRDGFESEKLDSDKESDIIESPAS 1076
RNSSGEL+I+DAIQ ALQT+VMDVLLPDPLTFGFRLVR+G E D E+ ESP S
Sbjct: 1008 RNSSGELDIKDAIQTALQTTVMDVLLPDPLTFGFRLVRNGLE-----KDPETK-AESPFS 1061
Query: 1077 KDSVLAHEMTPMEVVVRNNTKDMIKMSLNITCRDVAGENCVDGIKATVLWTGVLSDITME 1136
K SVL+HE+TPMEV+VRNNT + IK++L++TCRDVAG+NC +G ATVLW G LS I+ME
Sbjct: 1062 KGSVLSHEVTPMEVLVRNNTSEAIKLNLSVTCRDVAGQNCTEGADATVLWAGALSGISME 1121
Query: 1137 IPPLQQIKHSFCLHFFVPGEYTLLAAAVIEDASDILRARAKTTSAAEPIFCRGPPYHVRV 1196
+ PLQ+ +H F L F VPGEYT++AAAVIEDA+++LRARA T S EPIFCRGPP+HV V
Sbjct: 1122 VAPLQEARHCFSLFFLVPGEYTMVAAAVIEDANNVLRARAGTASPNEPIFCRGPPFHVCV 1181
Query: 1197 VGTA 1200
G A
Sbjct: 1182 AGGA 1185