Miyakogusa Predicted Gene
- Lj2g3v1803090.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj2g3v1803090.1 Non Chatacterized Hit- tr|I1J8Y5|I1J8Y5_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.20348
PE,90.36,0,SUBFAMILY NOT NAMED,NULL; MAJOR FACILITATOR SUPERFAMILY
DOMAIN-CONTAINING PROTEIN-RELATED,NULL; seg,,CUFF.37889.1
(467 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT4G36790.1 | Symbols: | Major facilitator superfamily protein ... 590 e-169
AT2G18590.1 | Symbols: | Major facilitator superfamily protein ... 459 e-129
AT5G10190.1 | Symbols: | Major facilitator superfamily protein ... 313 2e-85
AT1G78130.1 | Symbols: UNE2 | Major facilitator superfamily prot... 287 1e-77
>AT4G36790.1 | Symbols: | Major facilitator superfamily protein |
chr4:17336360-17338304 FORWARD LENGTH=489
Length = 489
Score = 590 bits (1520), Expect = e-169, Method: Compositional matrix adjust.
Identities = 299/469 (63%), Positives = 349/469 (74%), Gaps = 14/469 (2%)
Query: 1 MKTKQIFGVXXXXXXXXXXXXMERADENLLPSVYKEVSEAFNAGPSDLGYLTFIRNFVQG 60
+KT GV MERADENLLPSVYKEVSEAFNAGPSDLGYLTF+RNFVQG
Sbjct: 33 IKTGTFLGVSISLILINLAAIMERADENLLPSVYKEVSEAFNAGPSDLGYLTFVRNFVQG 92
Query: 61 LSSPLAGILVINYDRPTILAMGTFCWALSTAAVGVCRDFMQVAFWRAINGFGLAIVIPAL 120
L+SPLAG+LVI YDRP +LA+GTFCWALSTAAVG F+QVA WRA+NGFGLAIVIPAL
Sbjct: 93 LASPLAGVLVITYDRPIVLAIGTFCWALSTAAVGASSYFIQVALWRAVNGFGLAIVIPAL 152
Query: 121 QSFIADSYMDGVRXXXXXXXXXXXXXXXXXXXXXXXXXXXQQFWGIQGWRCAFILMATLS 180
QSFIADSY DG R +FWGI GWRCAFI+MA LS
Sbjct: 153 QSFIADSYKDGARGAGFGMLNLIGTIGGIGGGVVATVMAGSEFWGIPGWRCAFIMMAALS 212
Query: 181 ALIGTLVLLYVVEPKKRFTTNVDASQSSDREDPI-YKGNASVASIWMTSWAAMKAVIKVK 239
A+IG LV L+VV+P+K +RE+ + +K N++ S+W S AA K+V+KV
Sbjct: 213 AVIGLLVFLFVVDPRKNI----------EREELMAHKMNSN--SVWNDSLAAAKSVVKVS 260
Query: 240 TFQIIVLQGIIGSLPWTAMVFFTMWFELIGFDNNTSATLLSLFAIGCAMGSFFGGSIADQ 299
TFQIIV QGIIGS PWTAMVFFTMWFELIGFD+N +A LL +FA G A+G+ GG IAD+
Sbjct: 261 TFQIIVAQGIIGSFPWTAMVFFTMWFELIGFDHNQTAALLGVFATGGAIGTLMGGIIADK 320
Query: 300 LSRVYPHSGRIMCAQFSAFMGIPFSWFLLRVIPQSVSSFLTFSVTLFLMGLTISWNATAA 359
+SR+YP+SGR+MCAQFSAFMGIPFS LL+VIPQS SS+ FS+TLFLMGLTI+W +A
Sbjct: 321 MSRIYPNSGRVMCAQFSAFMGIPFSIILLKVIPQSTSSYSIFSITLFLMGLTITWCGSAV 380
Query: 360 NGPMFAEVVPVKHRTMIYAFDRAFEGSFSSVAAPLVGILSEKIFGYNAKSVDPIKGSS-P 418
N PMFAEVVP +HRTMIYAFDRAFEGSFSS AAPLVGILSEK+FGY+++ +DP+KGSS
Sbjct: 381 NAPMFAEVVPPRHRTMIYAFDRAFEGSFSSFAAPLVGILSEKLFGYDSRGIDPLKGSSVR 440
Query: 419 EALALSKGLLSMMVIPFGXXXXXXXXXXXXFKKDRENARIVALKEEEMM 467
EA ALSKGLLSMM +PFG F+KDRENA+I + KE EM+
Sbjct: 441 EADALSKGLLSMMAVPFGLCCLCYTPLHFVFQKDRENAKIASSKETEMI 489
>AT2G18590.1 | Symbols: | Major facilitator superfamily protein |
chr2:8069988-8072866 FORWARD LENGTH=473
Length = 473
Score = 459 bits (1182), Expect = e-129, Method: Compositional matrix adjust.
Identities = 234/449 (52%), Positives = 298/449 (66%), Gaps = 4/449 (0%)
Query: 22 MERADENLLPSVYKEVSEAFNAGPSDLGYLTFIRNFVQGLSSPLAGILVINYDRPTILAM 81
M+RADE L+PS KE+ EAF+A SD+G L+FIRN VQGL+SPLAG+ I+YDRPT+ A
Sbjct: 24 MQRADEKLIPSTAKELKEAFHAKLSDIGLLSFIRNIVQGLASPLAGLFAISYDRPTVFAF 83
Query: 82 GTFCWALSTAAVGVCRDFMQVAFWRAINGFGLAIVIPALQSFIADSYMDGVRXXXXXXXX 141
G+F W ST A GV R F+QV A NG G AIV P LQS IADS+ + R
Sbjct: 84 GSFFWVSSTVATGVSRYFIQVTLGVAFNGVGHAIVYPVLQSIIADSFKESSRGFGFGLWN 143
Query: 142 XXXXXXXXXXXXXXXXXXXQQFWGIQGWRCAFILMATLSALIGTLVLLYVVEPKKRFTTN 201
F+GI GWRCAFIL ATLS ++G LV +V +P+++ T++
Sbjct: 144 LIGTVGGIGGTVVPTVMAGHDFFGISGWRCAFILSATLSTIVGILVFFFVSDPREKKTSS 203
Query: 202 VDASQSSDREDPIYKGNASV----ASIWMTSWAAMKAVIKVKTFQIIVLQGIIGSLPWTA 257
V E G + +S+W SW A+K V K++TFQIIVLQGI+GS+PW A
Sbjct: 204 VIVHHDDQHERDENNGGTMMESPSSSVWKESWVAIKDVTKLRTFQIIVLQGIVGSVPWNA 263
Query: 258 MVFFTMWFELIGFDNNTSATLLSLFAIGCAMGSFFGGSIADQLSRVYPHSGRIMCAQFSA 317
M+F+TMWFELIGFD+N +A L +FA G A+GS GG IAD++SRV+P+SGR++CAQFS
Sbjct: 264 MLFWTMWFELIGFDHNQAALLNGIFATGQAIGSLVGGIIADKMSRVFPNSGRLICAQFSV 323
Query: 318 FMGIPFSWFLLRVIPQSVSSFLTFSVTLFLMGLTISWNATAANGPMFAEVVPVKHRTMIY 377
FMG FS LLR+IPQSV+SF F VTLFLMGLTI+W A N P+ AE+VP KHRTM+Y
Sbjct: 324 FMGAMFSIVLLRMIPQSVNSFYIFLVTLFLMGLTITWCGPAINSPILAEIVPAKHRTMVY 383
Query: 378 AFDRAFEGSFSSVAAPLVGILSEKIFGYNAKSVDPIKGSSPEALALSKGLLSMMVIPFGX 437
AFDRA E +FSS APLVGI+SEK+FG++AK +D + S EA AL KG++ MM +PFG
Sbjct: 384 AFDRALEVTFSSFGAPLVGIMSEKLFGFDAKGIDHVNDSGREAEALGKGIMWMMALPFGL 443
Query: 438 XXXXXXXXXXXFKKDRENARIVALKEEEM 466
F+KDR+ R + +E EM
Sbjct: 444 CCLCYTPLHFLFRKDRKIDRTTSSREVEM 472
>AT5G10190.1 | Symbols: | Major facilitator superfamily protein |
chr5:3199205-3201140 FORWARD LENGTH=488
Length = 488
Score = 313 bits (801), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 176/454 (38%), Positives = 249/454 (54%), Gaps = 24/454 (5%)
Query: 22 MERADENLLPSVYKEVSEAFNAGPSDLGYLTFIRNFVQGLSSPLAGILVINYDRPTILAM 81
MERADE+LLP VYKEV +A + P+ LG LT R+ VQ PLA L ++R ++A+
Sbjct: 17 MERADESLLPGVYKEVGDALHVDPTALGTLTLFRSIVQSSCYPLAAYLSSRHNRAHVIAL 76
Query: 82 GTFCWALSTAAVGVCRDFMQVAFWRAINGFGLAIVIPALQSFIADSYMDGVRXXXXXXXX 141
G F WA +T V V F QVA R +NG GLAIV PA+QS +ADS D R
Sbjct: 77 GAFLWATATFLVAVSTTFFQVAVSRGLNGIGLAIVTPAIQSLVADSTDDYNRGMAFGWLG 136
Query: 142 XXXXXXXXXXXXXXXXXXXQQFWGIQGWRCAFILMATLSALIGTLVLLYVVEPK---KRF 198
+ F G+ GWR AF+L+A +S ++G LV L+ +P ++
Sbjct: 137 FTSNIGSILGYVCSILFASKSFNGVAGWRIAFLLVAVVSVIVGILVRLFATDPHYSDRKI 196
Query: 199 TTNV-DASQSSDREDPIYKGNASVASIWMTSWAAMKAVIKVKTFQIIVLQGIIGSLPWTA 257
T +V D SD D + + K VIK+ +FQI V QG+ GS PW+A
Sbjct: 197 TKHVKDKPFWSDIRDLLKEA---------------KMVIKIPSFQIFVAQGVSGSFPWSA 241
Query: 258 MVFFTMWFELIGFDNNTSATLLSLFAIGCAMGSFFGGSIADQLSRVYPHSGRIMCAQFSA 317
+ F +W ELIGF + T+A L++LF I C++G FGG + D L++ +P+ GRI +Q S+
Sbjct: 242 LAFAPLWLELIGFSHKTTAVLVTLFTISCSLGGLFGGYMGDTLAKKFPNGGRIFLSQVSS 301
Query: 318 FMGIPFSWFLLRVIPQSVSSFLTFSVTLFLMGLTISWNATAANGPMFAEVVPVKHRTMIY 377
IP + LL +P S+ + + L +MGL ISWN A NGP+FAE+VP + RT IY
Sbjct: 302 GSAIPLAAILLIGLPDDPSTAFSHGLVLVIMGLCISWNGAATNGPIFAEIVPERARTSIY 361
Query: 378 AFDRAFEGSFSSVAAPLVGILSEKIFGYN-----AKSVDPIKGSSPEALALSKGLLSMMV 432
A DR+FE +S A P+VG+L++ I+GY + S I A +L+K L + +
Sbjct: 362 ALDRSFESILASFAPPIVGMLAQNIYGYKPIPEGSTSSVKIDTDRANAASLAKALYTSIG 421
Query: 433 IPFGXXXXXXXXXXXXFKKDRENARIVALKEEEM 466
IP + +DR+ A++ AL E EM
Sbjct: 422 IPMVICCTIYSFLYCTYPRDRDRAKMQALIESEM 455
>AT1G78130.1 | Symbols: UNE2 | Major facilitator superfamily protein
| chr1:29400171-29401814 FORWARD LENGTH=490
Length = 490
Score = 287 bits (734), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 170/452 (37%), Positives = 241/452 (53%), Gaps = 17/452 (3%)
Query: 22 MERADENLLPSVYKEVSEAFNAGPSDLGYLTFIRNFVQGLSSPLAGILVINYDRPTILAM 81
MERADE+LLP VYKEV A + P+ LG LT +R+ VQ PLA + I ++R ++A+
Sbjct: 17 MERADESLLPGVYKEVGLALHTDPTGLGSLTLLRSMVQAACYPLAAYMAIRHNRAHVIAL 76
Query: 82 GTFCWALSTAAVGVCRDFMQVAFWRAINGFGLAIVIPALQSFIADSYMDGVRXXXXXXXX 141
G F W+ +T V F QVA RA+NG GLA+V PA+QS +ADS D R
Sbjct: 77 GAFLWSAATFLVAFSSTFFQVAVSRALNGIGLALVAPAIQSLVADSTDDANRGTAFGWLQ 136
Query: 142 XXXXXXXXXXXXXXXXXXXQQFWGIQGWRCAFILMATLSALIGTLVLLYVVEPKKRFTTN 201
F GI GWR AF ++ +S ++G LV ++ +P F +
Sbjct: 137 LTANIGSILGGLCSVLIAPLTFMGIPGWRVAFHIVGVISVIVGVLVRVFANDP--HFVKD 194
Query: 202 -VDASQSSDREDPIYKGNASVASIWMTSWAAMKAVIKVKTFQIIVLQGIIGSLPWTAMVF 260
VD S P + VIK+++FQIIV QG+ GS PW+A+ F
Sbjct: 195 GVDVSNQPGSRKPFCTEVKDLVR-------EADTVIKIRSFQIIVAQGVTGSFPWSALSF 247
Query: 261 FTMWFELIGFDNNTSATLLSLFAIGCAMGSFFGGSIADQLSRVYPHSGRIMCAQFSAFMG 320
MW ELIGF + +A L+ LF ++G FGG + D LS P+SGRI+ AQ S+
Sbjct: 248 APMWLELIGFSHGKTAFLMGLFVAASSLGGLFGGKMGDFLSTRLPNSGRIILAQISSASA 307
Query: 321 IPFSWFLLRVIPQSVSSFLTFSVTLFLMGLTISWNATAANGPMFAEVVPVKHRTMIYAFD 380
IP + LL V+P S+ + L L+GL +SWNA A N P+FAE+VP K RT +YA D
Sbjct: 308 IPLAAILLLVLPDDPSTAAIHGLILVLLGLFVSWNAPATNNPIFAEIVPEKSRTSVYALD 367
Query: 381 RAFEGSFSSVAAPLVGILSEKIFGY------NAKSVDPIKGSSPEALALSKGLLSMMVIP 434
++FE SS A P+VGIL++ ++GY +++S + I A +L+K L + + +P
Sbjct: 368 KSFESILSSFAPPIVGILAQHVYGYKPIPEGSSRSTE-IATDRENAASLAKALYTSIGLP 426
Query: 435 FGXXXXXXXXXXXXFKKDRENARIVALKEEEM 466
+ DR+ AR+ A + EM
Sbjct: 427 MAACCFIYSFLYRSYPLDRDRARMEAFIDSEM 458