Miyakogusa Predicted Gene
- Lj4g3v3093890.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj4g3v3093890.1 Non Chatacterized Hit- tr|I1KQ35|I1KQ35_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.54097
PE,85.53,0,UNCHARACTERIZED,NULL; SERINE INCORPORATOR,TMS membrane
protein/tumour differentially expressed prote,CUFF.52243.1
(388 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT3G24460.1 | Symbols: | Serinc-domain containing serine and sp... 546 e-155
AT4G13345.1 | Symbols: MEE55 | Serinc-domain containing serine a... 523 e-148
AT4G13345.2 | Symbols: MEE55 | Serinc-domain containing serine a... 520 e-148
AT2G33205.1 | Symbols: | Serinc-domain containing serine and sp... 333 1e-91
AT3G06170.1 | Symbols: | Serinc-domain containing serine and sp... 162 3e-40
AT1G16180.2 | Symbols: | Serinc-domain containing serine and sp... 147 1e-35
AT1G16180.1 | Symbols: | Serinc-domain containing serine and sp... 147 1e-35
>AT3G24460.1 | Symbols: | Serinc-domain containing serine and
sphingolipid biosynthesis protein | chr3:8886160-8889717
REVERSE LENGTH=409
Length = 409
Score = 546 bits (1406), Expect = e-155, Method: Compositional matrix adjust.
Identities = 259/397 (65%), Positives = 309/397 (77%), Gaps = 9/397 (2%)
Query: 1 METG--VSNNG----NEACTISKDTSWWGQFRNASNPGMARYVYALMFLVANLLAWAARD 54
METG VSNN N++ K+ SW+ QFRN NP MARYVY L+FL+ANLLAWAARD
Sbjct: 1 METGTSVSNNNQSIRNDSYEAIKNGSWFNQFRNGCNPWMARYVYGLIFLIANLLAWAARD 60
Query: 55 YGRGALTEMERLKGCNGGKDCLGAEGVLRVSLGCFIFYFIMFLTTARTSKLNEVRDTWHS 114
YGRGAL ++ R K C GG++CLG +GVLRVSLGCF+FYF+MFL+T TSK + RD WHS
Sbjct: 61 YGRGALRKVTRFKNCKGGENCLGTDGVLRVSLGCFLFYFVMFLSTLGTSKTHSSRDRWHS 120
Query: 115 GWWSVKIVLWVGMTVIPFLLPSEFIQIYGEVAHFGAGVXXXXXXXXXXXXXTWLNDCCES 174
GWW VK+++W +T+IPFLLPS I +YGE+AHFGAGV WLN+C +S
Sbjct: 121 GWWFVKLIMWPALTIIPFLLPSSIIHLYGEIAHFGAGVFLLIQLISVISFIQWLNECYQS 180
Query: 175 EKYAAKCQIHVMLFATTAYVVCLVGIILMFIWYAPKPSCLLNIFFIAWTLVLLQLMTSVS 234
+K A +C+++VML +TT+Y VC+VG+ILM+IWYAP SCLLNIFFI WTL L+QLMTS++
Sbjct: 181 QKDAERCRVYVMLLSTTSYTVCIVGVILMYIWYAPDSSCLLNIFFITWTLFLIQLMTSIA 240
Query: 235 LHPKVNAGILTPGLMGLYVVFLCWNAIRSEPDGGSCIRKSDSSTKTDWXXXXXXXXXXXX 294
LHPKVNAG LTP LMGLYVVF+CW AIRSEP G SC RK+ +S +TDW
Sbjct: 241 LHPKVNAGYLTPALMGLYVVFICWCAIRSEPVGESCNRKAAASNRTDWLTIISFVVALLA 300
Query: 295 XXXXTFSTGIDSKCFQFRKDD---DIPAEDDVPYGYGFFHFVFATGAMYFAMLLIGWNSH 351
TFSTGIDS+CFQF+KD+ + AEDDVPYGYGFFHFVFATGAMYFAMLLIGWN+H
Sbjct: 301 MVIATFSTGIDSQCFQFKKDENDQEEEAEDDVPYGYGFFHFVFATGAMYFAMLLIGWNTH 360
Query: 352 HSMRKWTIDVGWTSTWVRIVNEWLAVCVYLWMLVAPI 388
H M+KWTIDVGWTSTWVR+VNEWLAVCVY+WMLVAP+
Sbjct: 361 HPMKKWTIDVGWTSTWVRVVNEWLAVCVYIWMLVAPL 397
>AT4G13345.1 | Symbols: MEE55 | Serinc-domain containing serine and
sphingolipid biosynthesis protein | chr4:7767292-7769426
FORWARD LENGTH=394
Length = 394
Score = 523 bits (1346), Expect = e-148, Method: Compositional matrix adjust.
Identities = 257/390 (65%), Positives = 300/390 (76%), Gaps = 6/390 (1%)
Query: 1 METGVS--NNGNEACTISKDTSWWGQFRNASNPGMARYVYALMFLVANLLAWAARDYGRG 58
METG S N G E K+ SW+ QFRN NP MARYVY L+FL+ANLLAWA RDYGRG
Sbjct: 1 METGTSIDNTGYEGI---KNGSWFIQFRNGCNPWMARYVYGLIFLLANLLAWALRDYGRG 57
Query: 59 ALTEMERLKGCNGGKDCLGAEGVLRVSLGCFIFYFIMFLTTARTSKLNEVRDTWHSGWWS 118
ALTEM + K C G DCLG EGVLRVS GCF+FYFIMFL+T TSK + RD WHSGWW
Sbjct: 58 ALTEMRKFKNCKEGGDCLGTEGVLRVSFGCFLFYFIMFLSTVGTSKTHSSRDKWHSGWWF 117
Query: 119 VKIVLWVGMTVIPFLLPSEFIQIYGEVAHFGAGVXXXXXXXXXXXXXTWLNDCCESEKYA 178
K+ + +G+T+ PFLLPS IQ YGE+AHFGAGV TWLN+C +++K A
Sbjct: 118 AKLFMLLGLTIFPFLLPSSIIQFYGEIAHFGAGVFLLIQLISIISFITWLNECFQAQKDA 177
Query: 179 AKCQIHVMLFATTAYVVCLVGIILMFIWYAPKPSCLLNIFFIAWTLVLLQLMTSVSLHPK 238
+C +HVML ATTAY VC++G+ILM+IWY P+PSCLLNIFFI WTL L+QLMTS+SLHPK
Sbjct: 178 ERCHVHVMLLATTAYTVCILGVILMYIWYVPEPSCLLNIFFITWTLFLIQLMTSISLHPK 237
Query: 239 VNAGILTPGLMGLYVVFLCWNAIRSEPDGGSCIRKSDSSTKTDWXXXXXXXXXXXXXXXX 298
+NAG LTP LMGLYVVF+CW AIRSEP G +C RK++ S++TDW
Sbjct: 238 INAGFLTPALMGLYVVFICWCAIRSEPVGETCNRKAEGSSRTDWLTIISFVVALLAMVIA 297
Query: 299 TFSTGIDSKCFQFRKDDDIPAEDDVPYGYGFFHFVFATGAMYFAMLLIGWNSHHSMRKWT 358
TFSTG+DS+CFQFRKD++ ED +PYGYGFFHFVFATGAMYFAMLL+GWN HHSM+KWT
Sbjct: 298 TFSTGVDSQCFQFRKDEN-HEEDAIPYGYGFFHFVFATGAMYFAMLLVGWNIHHSMKKWT 356
Query: 359 IDVGWTSTWVRIVNEWLAVCVYLWMLVAPI 388
IDVGWTSTWVRIVNEWLAV VY+WMLVAP+
Sbjct: 357 IDVGWTSTWVRIVNEWLAVGVYIWMLVAPM 386
>AT4G13345.2 | Symbols: MEE55 | Serinc-domain containing serine and
sphingolipid biosynthesis protein | chr4:7767292-7769426
FORWARD LENGTH=394
Length = 394
Score = 520 bits (1338), Expect = e-148, Method: Compositional matrix adjust.
Identities = 255/390 (65%), Positives = 299/390 (76%), Gaps = 6/390 (1%)
Query: 1 METGVS--NNGNEACTISKDTSWWGQFRNASNPGMARYVYALMFLVANLLAWAARDYGRG 58
METG S N G E K+ SW+ QFRN NP MARYVY L+FL+ANLLAWA RDYGRG
Sbjct: 1 METGTSIDNTGYEGI---KNGSWFIQFRNGCNPWMARYVYGLIFLLANLLAWALRDYGRG 57
Query: 59 ALTEMERLKGCNGGKDCLGAEGVLRVSLGCFIFYFIMFLTTARTSKLNEVRDTWHSGWWS 118
ALTEM + K C G DCLG EGVLRVS GCF+FYFIMFL+T TSK + RD WHSGWW
Sbjct: 58 ALTEMRKFKNCKEGGDCLGTEGVLRVSFGCFLFYFIMFLSTVGTSKTHSSRDKWHSGWWF 117
Query: 119 VKIVLWVGMTVIPFLLPSEFIQIYGEVAHFGAGVXXXXXXXXXXXXXTWLNDCCESEKYA 178
K+ + +G+T+ PFLLPS IQ YGE+AHFGAGV TWLN+C +++K A
Sbjct: 118 AKLFMLLGLTIFPFLLPSSIIQFYGEIAHFGAGVFLLIQLISIISFITWLNECFQAQKDA 177
Query: 179 AKCQIHVMLFATTAYVVCLVGIILMFIWYAPKPSCLLNIFFIAWTLVLLQLMTSVSLHPK 238
+C +HVML ATTAY VC++G+ILM+IWY P+PSCLLNIFFI WTL L+QLMTS+SLHPK
Sbjct: 178 ERCHVHVMLLATTAYTVCILGVILMYIWYVPEPSCLLNIFFITWTLFLIQLMTSISLHPK 237
Query: 239 VNAGILTPGLMGLYVVFLCWNAIRSEPDGGSCIRKSDSSTKTDWXXXXXXXXXXXXXXXX 298
+NAG LTP LMGLYVVF+CW AIR +P G +C RK++ S++TDW
Sbjct: 238 INAGFLTPALMGLYVVFICWCAIRRQPVGETCNRKAEGSSRTDWLTIISFVVALLAMVIA 297
Query: 299 TFSTGIDSKCFQFRKDDDIPAEDDVPYGYGFFHFVFATGAMYFAMLLIGWNSHHSMRKWT 358
TFSTG+DS+CFQFRKD++ ED +PYGYGFFHFVFATGAMYFAMLL+GWN HHSM+KWT
Sbjct: 298 TFSTGVDSQCFQFRKDEN-HEEDAIPYGYGFFHFVFATGAMYFAMLLVGWNIHHSMKKWT 356
Query: 359 IDVGWTSTWVRIVNEWLAVCVYLWMLVAPI 388
IDVGWTSTWVRIVNEWLAV VY+WMLVAP+
Sbjct: 357 IDVGWTSTWVRIVNEWLAVGVYIWMLVAPM 386
>AT2G33205.1 | Symbols: | Serinc-domain containing serine and
sphingolipid biosynthesis protein |
chr2:14070987-14073211 REVERSE LENGTH=422
Length = 422
Score = 333 bits (853), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 174/376 (46%), Positives = 229/376 (60%), Gaps = 6/376 (1%)
Query: 15 ISKDTSWWGQFRNASNPGMARYVYALMFLVANLLAWAARDYGRGALTEMERLKGCN-GGK 73
I + + ++ Q +N S ARY Y +FL+ NL AW RDY + AL + + C G
Sbjct: 32 IEQRSLYYYQEKNKS--LRARYTYGTIFLIINLCAWFIRDYAQKALALLPYVSSCGPEGS 89
Query: 74 DCLGAEGVLRVSLGCFIFYFIMFLTTARTSKLNEVRDTWHSGWWSVKIVLWVGMTVIPFL 133
C GVLRVSLGCFIFYF+MFL+T T KL+E +++WHS W K L V + V F
Sbjct: 90 RCFHTLGVLRVSLGCFIFYFVMFLSTWNTMKLHEAQNSWHSDNWIFKFFLLVIVMVASFF 149
Query: 134 LPSEFIQIYGEVAHFGAGVXXXXXXXXXXXXXTWLNDCCESEKYAAKCQIHVMLFATTAY 193
+P +IQIYGE+A GAG+ TW N+ + + + ++ + Y
Sbjct: 150 IPQLYIQIYGEIARVGAGIFLGLQLVSVIEFITWWNNYWMPQNQSKQSCSFGLVMSIVFY 209
Query: 194 VVCLVGIILMFIWYAPKPSCLLNIFFIAWTLVLLQLMTSVSLHPKV-NAGILTPGLMGLY 252
+ + GI +M+ +Y +C LNIFFI+WT++LL +M +SLH KV N G+L+ G+M Y
Sbjct: 210 IGSVCGIAVMYYFYGASTACGLNIFFISWTVILLIVMMVISLHSKVKNRGLLSSGIMASY 269
Query: 253 VVFLCWNAIRSEPDGGSCIRKSDSSTKTDWXXXXXXXXXXXXXXXXTFSTGIDSKCFQFR 312
+VFLCW+AIRSEP C + +S TDW TFSTGIDS+ F+FR
Sbjct: 270 IVFLCWSAIRSEPSHTKCNAHTQNS-HTDWTTILSFLIAIGAIVMATFSTGIDSESFRFR 328
Query: 313 KDDDIPAEDDVPYGYGFFHFVFATGAMYFAMLLIGWNSHHSMRKWTIDVGWTSTWVRIVN 372
KD+ EDD+PY YGFFH VF+ GAMYFAML I WN HS KW+IDVGWTSTWV+IVN
Sbjct: 329 KDEA-KEEDDIPYSYGFFHLVFSLGAMYFAMLFISWNLSHSTEKWSIDVGWTSTWVKIVN 387
Query: 373 EWLAVCVYLWMLVAPI 388
EW A +YLW L+API
Sbjct: 388 EWFAAAIYLWKLIAPI 403
>AT3G06170.1 | Symbols: | Serinc-domain containing serine and
sphingolipid biosynthesis protein | chr3:1867520-1869738
FORWARD LENGTH=409
Length = 409
Score = 162 bits (411), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 105/375 (28%), Positives = 173/375 (46%), Gaps = 22/375 (5%)
Query: 34 ARYVYALMFLVANLLAWAARDYGRGALTEMERLKGCNG-GKDCLGAEGVLRVSLGCFIFY 92
AR Y +F + +++W R+ G L ++ + + K+ + VLRVS G F+F+
Sbjct: 29 ARIAYCGLFGASLVVSWILRETGAPLLEKLPWINTSDSYTKEWYQQQAVLRVSFGNFLFF 88
Query: 93 FIMFLTTARTSKLNEVRDTWHSGWWSVKIVLWVGMTVIPFLLPSEFIQIYGEVAHFGAGV 152
I L N+ RD+WH G W +K+++W + V+ F +P+ + +YG ++ FGAG
Sbjct: 89 AIYALIMIGVKDQNDRRDSWHHGGWGLKMIVWFLLVVLMFFVPNVIVSLYGTLSKFGAGA 148
Query: 153 XXXXXXXXXXXXXTWLNDCCESEKYAAKCQIHVMLFATTAYVVCLVGIILMFIWYAPK-P 211
ND EK K I +++ + Y+ ++FIW+ P
Sbjct: 149 FLLVQVVLLLDATHNWND-SWVEKDEKKWYIALLVISIVCYIATYTFSGILFIWFNPSGQ 207
Query: 212 SCLLNIFFIAWTLVLLQLMTSVSLHPKVNAGILTPGLMGLYVVFLCWNAIRSEPDGGSC- 270
C LN+FFI ++L + ++LHP VN +L ++ +Y ++C+ + SEP C
Sbjct: 208 DCGLNVFFIVMPMILAFVFAIIALHPAVNGSLLPASVISVYCAYVCYTGLSSEPHDYVCN 267
Query: 271 -IRKSDSSTKTD-------------WXXXXXXXXXXXXXXXXTFSTGIDSKCFQFRKDDD 316
+ KS + + + + +G+ +D
Sbjct: 268 GLNKSKAVNASTLILGMLTTVLSVLYSALRAGSSTTFLSPPSSPRSGVKDALLGDPEDGK 327
Query: 317 IPAEDD---VPYGYGFFHFVFATGAMYFAMLLIGWNSHHSMRKWTIDVGWTSTWVRIVNE 373
E + V Y Y FFH +FA +MY AMLL GW + S IDVGWTS WV+I
Sbjct: 328 KSGEAEARPVSYSYSFFHIIFALASMYAAMLLSGW-TDSSESATLIDVGWTSVWVKICTG 386
Query: 374 WLAVCVYLWMLVAPI 388
W+ +Y+W L+AP+
Sbjct: 387 WVTAGLYIWTLIAPL 401
>AT1G16180.2 | Symbols: | Serinc-domain containing serine and
sphingolipid biosynthesis protein | chr1:5540905-5542670
FORWARD LENGTH=412
Length = 412
Score = 147 bits (371), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 111/383 (28%), Positives = 166/383 (43%), Gaps = 36/383 (9%)
Query: 34 ARYVYALMFLVANLLAWAARDYGRGALTEMERLKGCN-----GGKDCLGAEGVLRVSLGC 88
AR Y +F ++ +++W R+ A ME+L N ++ + VLRVSLG
Sbjct: 31 ARIAYCGLFALSLIVSWILREV---AAPLMEKLPWINHFHKTPDREWFETDAVLRVSLGN 87
Query: 89 FIFYFIMFLTTARTSKLNEVRDTWHSGWWSVKIVLWVGMTVIPFLLPSEFIQIYGEVAHF 148
F+F+ I+ + + RD H G W +KI+ W + + F LP+E I Y ++ F
Sbjct: 88 FLFFSILSVMMIGVKNQKDPRDGIHHGGWMMKIICWCILVIFMFFLPNEIISFYESMSKF 147
Query: 149 GAGVXXXXXXXXXXXXXTWLNDC----CESEKYAAKCQIHVMLFATTAYVVCLVGIILMF 204
GAG ND E YAA +++ + Y+ V +F
Sbjct: 148 GAGFFLLVQVVLLLDFVHGWNDTWVGYDEQFWYAA-----LLVVSLVCYLATFVFSGFLF 202
Query: 205 IWYAPK-PSCLLNIFFIAWTLVLLQLMTSVSLHPKVNAGILTPGLMGLYVVFLCWNAIRS 263
W+ P C LN FFI TL+ + + V LHP V IL ++ LY ++LC++ + S
Sbjct: 203 HWFTPSGHDCGLNTFFIIMTLIFVFVFAIVVLHPTVGGSILPASVISLYCMYLCYSGLAS 262
Query: 264 EPDGGSC-----IRKSDSSTKTDWXXXXXXXXXXXXXXXXTFSTGIDSKCFQFRKDDDIP 318
EP C K+ S+ ST + S R + +
Sbjct: 263 EPRDYECNGLHNHSKAVSTGTMTIGLLTTVLSVVYSAVRAGSSTTLLSPPDSPRAEKPLL 322
Query: 319 AED-------------DVPYGYGFFHFVFATGAMYFAMLLIGWNSHHSMRKWTIDVGWTS 365
D V Y Y FFH +F+ +MY AMLL GW++ +DVGW S
Sbjct: 323 PIDGKAEEKEEKENKKPVSYSYAFFHIIFSLASMYSAMLLTGWSTSVGESGKLVDVGWPS 382
Query: 366 TWVRIVNEWLAVCVYLWMLVAPI 388
WVR+V W +++W LVAPI
Sbjct: 383 VWVRVVTSWATAGLFIWSLVAPI 405
>AT1G16180.1 | Symbols: | Serinc-domain containing serine and
sphingolipid biosynthesis protein | chr1:5540905-5542670
FORWARD LENGTH=412
Length = 412
Score = 147 bits (371), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 111/383 (28%), Positives = 166/383 (43%), Gaps = 36/383 (9%)
Query: 34 ARYVYALMFLVANLLAWAARDYGRGALTEMERLKGCN-----GGKDCLGAEGVLRVSLGC 88
AR Y +F ++ +++W R+ A ME+L N ++ + VLRVSLG
Sbjct: 31 ARIAYCGLFALSLIVSWILREV---AAPLMEKLPWINHFHKTPDREWFETDAVLRVSLGN 87
Query: 89 FIFYFIMFLTTARTSKLNEVRDTWHSGWWSVKIVLWVGMTVIPFLLPSEFIQIYGEVAHF 148
F+F+ I+ + + RD H G W +KI+ W + + F LP+E I Y ++ F
Sbjct: 88 FLFFSILSVMMIGVKNQKDPRDGIHHGGWMMKIICWCILVIFMFFLPNEIISFYESMSKF 147
Query: 149 GAGVXXXXXXXXXXXXXTWLNDC----CESEKYAAKCQIHVMLFATTAYVVCLVGIILMF 204
GAG ND E YAA +++ + Y+ V +F
Sbjct: 148 GAGFFLLVQVVLLLDFVHGWNDTWVGYDEQFWYAA-----LLVVSLVCYLATFVFSGFLF 202
Query: 205 IWYAPK-PSCLLNIFFIAWTLVLLQLMTSVSLHPKVNAGILTPGLMGLYVVFLCWNAIRS 263
W+ P C LN FFI TL+ + + V LHP V IL ++ LY ++LC++ + S
Sbjct: 203 HWFTPSGHDCGLNTFFIIMTLIFVFVFAIVVLHPTVGGSILPASVISLYCMYLCYSGLAS 262
Query: 264 EPDGGSC-----IRKSDSSTKTDWXXXXXXXXXXXXXXXXTFSTGIDSKCFQFRKDDDIP 318
EP C K+ S+ ST + S R + +
Sbjct: 263 EPRDYECNGLHNHSKAVSTGTMTIGLLTTVLSVVYSAVRAGSSTTLLSPPDSPRAEKPLL 322
Query: 319 AED-------------DVPYGYGFFHFVFATGAMYFAMLLIGWNSHHSMRKWTIDVGWTS 365
D V Y Y FFH +F+ +MY AMLL GW++ +DVGW S
Sbjct: 323 PIDGKAEEKEEKENKKPVSYSYAFFHIIFSLASMYSAMLLTGWSTSVGESGKLVDVGWPS 382
Query: 366 TWVRIVNEWLAVCVYLWMLVAPI 388
WVR+V W +++W LVAPI
Sbjct: 383 VWVRVVTSWATAGLFIWSLVAPI 405