Miyakogusa Predicted Gene
- Lj0g3v0101219.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0101219.1 Non Chatacterized Hit- tr|G7J850|G7J850_MEDTR
Membrane protein, putative OS=Medicago truncatula
GN=M,81.71,0,seg,NULL; FAMILY NOT NAMED,NULL,CUFF.5675.1
(430 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT4G12680.1 | Symbols: | unknown protein; INVOLVED IN: vegetati... 525 e-149
AT5G40640.1 | Symbols: | unknown protein; LOCATED IN: plasma me... 281 8e-76
AT3G27390.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 277 1e-74
AT4G37030.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 250 2e-66
>AT4G12680.1 | Symbols: | unknown protein; INVOLVED IN: vegetative
to reproductive phase transition of meristem; LOCATED
IN: endomembrane system; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT5G40640.1);
Has 103 Blast hits to 103 proteins in 14 species: Archae
- 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 103;
Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).
| chr4:7475104-7478174 FORWARD LENGTH=575
Length = 575
Score = 525 bits (1352), Expect = e-149, Method: Compositional matrix adjust.
Identities = 258/431 (59%), Positives = 318/431 (73%), Gaps = 3/431 (0%)
Query: 1 MVQDVTDFCYYSYFSYTDELRENLPPNEKPIDIRXXXXXXXXXXXXXXXXFAMAIIISIA 60
+V D TDFC++SYFSY DELRE + + +P++I+ + +I ++A
Sbjct: 145 VVTDFTDFCFHSYFSYMDELREMVSADVEPLEIKLSRLPSCLLASLIGVMVDVLLITAVA 204
Query: 61 IWKSPYMLFRGWKRLLEDLIGRKGPFLETECVPFAGLAIILWPLAVVGAVLAASIISFFL 120
++KSPYML +GWKRLLEDL+GR+GPFLE+ CVPFAGLAI+LWPLAV GAV+A+ + SFFL
Sbjct: 205 VYKSPYMLLKGWKRLLEDLVGREGPFLESVCVPFAGLAILLWPLAVAGAVIASVLSSFFL 264
Query: 121 GLYSGVVVHQEDSMQMGFAYIVSVVSLFDEYVNDLLYLREGSCIPRPIYRRKMTHALESK 180
GLYSGV+VHQEDS +MG YI++ VSLFDEYVNDLLYLREG+ +PRP YR K +
Sbjct: 265 GLYSGVIVHQEDSFRMGLNYIIAAVSLFDEYVNDLLYLREGTSLPRPCYRTKTETVHGKR 324
Query: 181 SLGGS-NHNLKIRRDSSQNSKHILQQTRSLKWKIQQYKPVQVWDWLFKSCEVNGRIVLRD 239
LG S N +LK +R SS SK + +Q+R+LK I YKPVQVW+WLFKSCEVNGRI+LRD
Sbjct: 325 ILGESKNVDLKSKRSSSLGSKLVSEQSRTLKKAITLYKPVQVWEWLFKSCEVNGRILLRD 384
Query: 240 GLISVKEIEECILKGNCKKLGIKLPAWSLLQCLLTSAKSNSDGLVISDEVELTRMNGPKD 299
GLI VK++EEC++KGN KKL IKLPAW++LQCLL SAKSNS GLVI+D VELT +N P+D
Sbjct: 385 GLIDVKDVEECLVKGNSKKLYIKLPAWTVLQCLLASAKSNSSGLVITDGVELTELNSPRD 444
Query: 300 KVFEWFIGPLLIMXXXXXXXXXXXXXXXXXXXXVMRCKNDIPEEWDSTGFPSNDNVRRAQ 359
KVF W +GPLLIM VM CKN+ E+WD+TGFPS+D VR+AQ
Sbjct: 445 KVFVWLVGPLLIMKEQIKNLKLTEDEEFCLRKLVMVCKNERTEDWDNTGFPSSDTVRKAQ 504
Query: 360 LQAIIRRLQGIVASMSRIPTFRRRFRNLVKVLYMEALQASASASHIGANAIPKHREKGSL 419
LQAIIRRLQG+VASMSRIPTFRRRF NLVKVLY+EAL+ AS + G P + G+L
Sbjct: 505 LQAIIRRLQGMVASMSRIPTFRRRFMNLVKVLYIEALEMGASGNRAGGILKPNSDQTGNL 564
Query: 420 QRKE--DNNVV 428
R E D +VV
Sbjct: 565 DRTETPDMDVV 575
>AT5G40640.1 | Symbols: | unknown protein; LOCATED IN: plasma
membrane; EXPRESSED IN: 19 plant structures; EXPRESSED
DURING: 7 growth stages; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT3G27390.1);
Has 104 Blast hits to 102 proteins in 14 species: Archae
- 0; Bacteria - 0; Metazoa - 2; Fungi - 0; Plants - 101;
Viruses - 0; Other Eukaryotes - 1 (source: NCBI BLink).
| chr5:16277345-16280258 FORWARD LENGTH=586
Length = 586
Score = 281 bits (718), Expect = 8e-76, Method: Compositional matrix adjust.
Identities = 167/398 (41%), Positives = 218/398 (54%), Gaps = 21/398 (5%)
Query: 1 MVQDVTDFCYYSYFSYTDELRENLPPNEKPIDIRXXXXXXXXXXXXXXXXFAMAIIISIA 60
+V D D C++SYFS+ D+LR + N +IR +I +A
Sbjct: 145 VVCDFKDVCFHSYFSFMDDLRTS-TANRHYYEIRLLQIPGAVIVAVLGILVDFPVISLLA 203
Query: 61 IWKSPYMLFRGWKRLLEDLIGRKGPFLETECVPFAGLAIILWPLAVVGAVLAASIISFFL 120
+ KSPYMLF+GW RL DLIGR+GPFLET CVP AGL I+LWPLAVVGAVL + + S FL
Sbjct: 204 LCKSPYMLFKGWHRLFHDLIGREGPFLETMCVPIAGLVILLWPLAVVGAVLGSVVSSVFL 263
Query: 121 GLYSGVVVHQEDSMQMGFAYIVSVVSLFDEYVNDLLYLREGSCIPRPIYRRKMTHALESK 180
G Y GVV +QE S G Y+V+ VS++DEY ND+L + EGSC PRPIYRR A +
Sbjct: 264 GAYGGVVSYQESSFFFGLCYVVASVSIYDEYSNDVLDMPEGSCFPRPIYRRNEEGASTAF 323
Query: 181 SLGGSNHNLKIRRDSSQNSKHILQQTRSLKWKIQQYKPVQVWDWLFKSCEVNGRIVLRDG 240
S G S N + K + S K + KP+ + + LF C +G I++ G
Sbjct: 324 SGGLSRPN---------SFKTTPSRGGSNKGPMIDLKPLDLLEALFVECRRHGEIMVTKG 374
Query: 241 LISVKEIEECILKGNCKKLGIKLPAWSLLQCLLTSAKSNSDGLVISDEV-ELTRMNGPKD 299
+I+ K+IEE + + LPA+SLL LL S KSNS GL++ D V E+T N PKD
Sbjct: 375 IINSKDIEEAKSSKGSQVISFGLPAYSLLHELLRSIKSNSTGLLLGDGVTEITTRNRPKD 434
Query: 300 KVFEWFIGPLLIMXXXXXXXXXXXXXXXXXXXXVM------RCKNDIPEEWDSTGFPSND 353
F+WF+ P LI+ V+ R K+ I E P
Sbjct: 435 AFFDWFLNPFLILKDQIEAANLSEEEEEYLGKLVLLFGDSERLKSSIVESES----PPLT 490
Query: 354 NVRRAQLQAIIRRLQGIVASMSRIPTFRRRFRNLVKVL 391
+R+A+L A RRLQG+ S+SR PTFRR F LVK L
Sbjct: 491 ELRKAELDAFARRLQGLTKSVSRYPTFRRHFVELVKKL 528
>AT3G27390.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: plasma membrane;
EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT5G40640.1); Has 101 Blast
hits to 99 proteins in 12 species: Archae - 0; Bacteria
- 0; Metazoa - 0; Fungi - 0; Plants - 101; Viruses - 0;
Other Eukaryotes - 0 (source: NCBI BLink). |
chr3:10133372-10136111 REVERSE LENGTH=588
Length = 588
Score = 277 bits (708), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 163/396 (41%), Positives = 220/396 (55%), Gaps = 17/396 (4%)
Query: 1 MVQDVTDFCYYSYFSYTDELRENLPPNEKPIDIRXXXXXXXXXXXXXXXXFAMAIIISIA 60
+V+D D C++SYFS DEL+++ P + K +IR +I +A
Sbjct: 145 VVRDFKDVCFHSYFSLMDELKQSCP-DRKYYEIRLLQLPGALVVSVLGILVDPPVISLVA 203
Query: 61 IWKSPYMLFRGWKRLLEDLIGRKGPFLETECVPFAGLAIILWPLAVVGAVLAASIISFFL 120
I KSPYMLF+GW RL DLIGR+GPFLET CVP AGLAI+LWPLAV GAV+ + I S FL
Sbjct: 204 ICKSPYMLFKGWHRLFHDLIGREGPFLETMCVPIAGLAILLWPLAVTGAVIGSVISSIFL 263
Query: 121 GLYSGVVVHQEDSMQMGFAYIVSVVSLFDEYVNDLLYLREGSCIPRPIYRRKMTHALESK 180
G Y+GVV +QE S G YIV+ VS++DEY D+L L EGSC PRP YRRK E
Sbjct: 264 GAYAGVVSYQESSFYYGLCYIVASVSIYDEYSTDILDLPEGSCFPRPKYRRKDE---EPT 320
Query: 181 SLGGSNHNLKIRRDSSQNSKHILQQTRSLKWKIQQYKPVQVWDWLFKSCEVNGRIVLRDG 240
G L +++S + S++ + KP+ + + LF C G ++ G
Sbjct: 321 PFSGPVPRLGSVKNASS------MRGGSVRVPMIDIKPLDLLNELFVECRRYGEVLATKG 374
Query: 241 LISVKEIEECILKGNCKKLGIKLPAWSLLQCLLTSAKSNSDGLVISDEV-ELTRMNGPKD 299
LI+ K+IEE + + + LPA+ LL +L S K+NS GL++SD V E+T MN PKD
Sbjct: 375 LINSKDIEEARSSKGSQVISVGLPAYGLLYEILRSVKANSSGLLLSDGVTEITTMNRPKD 434
Query: 300 KVFEWFIGPLLIMXXXXXXXXXXXXXXXXXXXXVMRCKNDIPEEWDS----TGFPSNDNV 355
F+WF+ P LI+ V+ + PE S + P
Sbjct: 435 VFFDWFLNPFLILKEQMKATNLSEEEEEYLGRLVLLFGD--PERLKSSNAISASPPLTER 492
Query: 356 RRAQLQAIIRRLQGIVASMSRIPTFRRRFRNLVKVL 391
+RA+L A RR+QG+ ++SR PTFRR F LVK L
Sbjct: 493 KRAELDAFARRMQGLTKTVSRYPTFRRHFVALVKKL 528
>AT4G37030.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: endomembrane
system; BEST Arabidopsis thaliana protein match is:
unknown protein (TAIR:AT4G12680.1); Has 101 Blast hits
to 99 proteins in 12 species: Archae - 0; Bacteria - 0;
Metazoa - 0; Fungi - 0; Plants - 101; Viruses - 0; Other
Eukaryotes - 0 (source: NCBI BLink). |
chr4:17452150-17454629 FORWARD LENGTH=569
Length = 569
Score = 250 bits (638), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 141/403 (34%), Positives = 221/403 (54%), Gaps = 16/403 (3%)
Query: 1 MVQDVTDFCYYSYFSYTDELRENLPPNEKPIDIRXXXXXXXXXXXXXXXXFAMAIIISIA 60
+V D DFCY+SY Y ELRE+ P +++ +R + + +IA
Sbjct: 146 VVTDFADFCYHSYPLYLKELRES-PVSDELQTLRLIHVPGCIIVGILGLVIDIPLFTAIA 204
Query: 61 IWKSPYMLFRGWKRLLEDLIGRKGPFLETECVPFAGLAIILWPLAVVGAVLAASIISFFL 120
+ KSPY+L +GW RL +D I R+GPFLE C+P AGL ++LWP+ V+G +L S F+
Sbjct: 205 VIKSPYLLLKGWYRLAQDAINREGPFLEIACIPVAGLTVLLWPIVVIGFILVTIFSSIFV 264
Query: 121 GLYSGVVVHQEDSMQMGFAYIVSVVSLFDEYVNDLLYLREGSCIPRPIYRRKMTHALESK 180
GLY VVV QE S + G +Y+++VV FDEY ND LYLREG+ P+P YR M S
Sbjct: 265 GLYGAVVVFQERSFRRGVSYVIAVVGEFDEYTNDWLYLREGTIFPKPRYR--MGRGSFSS 322
Query: 181 SLGGSNHNLKIRRDSSQNSKHI-------LQQTRSLKWKIQQYKPVQVWDWLFKSCEVNG 233
+ H + R +S S L + S++ IQ+ + VQ+W+ + E+ G
Sbjct: 323 EVSVIVHPSDVTRVNSSGSVDAPAMLVPSLVHSVSVREAIQEVRMVQIWEHMMGWFEMQG 382
Query: 234 RIVLRDGLISVKEIEECILKG----NCKKLGIKLPAWSLLQCLLTSAKSNSDGLVISDEV 289
+ +L +++ ++ E LKG + + LP+++LL LL+S K+ G+++ D
Sbjct: 383 KELLDAEVLTPTDLYES-LKGRHGNESSIINVGLPSYALLHTLLSSIKAGVHGVLLLDGS 441
Query: 290 ELTRMNGPKDKVFEWFIGPLLIMXXXXXXXXXXXXXXXXXXXXVMRCKNDI-PEEWDSTG 348
E+T +N P+DK +W P++++ V+ ++ E WD+
Sbjct: 442 EVTHLNRPQDKFLDWVFNPIMVLKDQIRALKLGESEVKYLEKVVLFGNHEQRMEAWDNHS 501
Query: 349 FPSNDNVRRAQLQAIIRRLQGIVASMSRIPTFRRRFRNLVKVL 391
P +N+R AQ+Q I RR+ G+V S+S++PT+RRRFR +VK L
Sbjct: 502 NPPQENLRTAQIQGISRRMMGMVRSVSKLPTYRRRFRQVVKAL 544