Miyakogusa Predicted Gene
- Lj0g3v0305589.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0305589.1 Non Chatacterized Hit- tr|D8SUN4|D8SUN4_SELML
Putative uncharacterized protein OS=Selaginella
moelle,28.32,3e-17,seg,NULL; FAMILY NOT NAMED,NULL,CUFF.20567.1
(438 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT3G27390.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 483 e-136
AT5G40640.1 | Symbols: | unknown protein; LOCATED IN: plasma me... 467 e-132
AT4G12680.1 | Symbols: | unknown protein; INVOLVED IN: vegetati... 320 1e-87
AT4G37030.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 249 2e-66
>AT3G27390.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: plasma membrane;
EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT5G40640.1); Has 101 Blast
hits to 99 proteins in 12 species: Archae - 0; Bacteria
- 0; Metazoa - 0; Fungi - 0; Plants - 101; Viruses - 0;
Other Eukaryotes - 0 (source: NCBI BLink). |
chr3:10133372-10136111 REVERSE LENGTH=588
Length = 588
Score = 483 bits (1242), Expect = e-136, Method: Compositional matrix adjust.
Identities = 240/422 (56%), Positives = 309/422 (73%), Gaps = 3/422 (0%)
Query: 1 MEPPVEFWASLWSFICFLPYFIGLWLLGHIKGVIFCPLICLIMTIGNSAIILGLWVIHCI 60
MEPP+ F ASL+ F+ FLPYFIGL LG IKG++ CPL+CL++TIGNSA+IL L +H +
Sbjct: 1 MEPPIGFRASLFQFLLFLPYFIGLLFLGFIKGIVLCPLVCLVVTIGNSAVILSLLPVHIV 60
Query: 61 WTYYCVVRSKQLGPLFKLVMCTCLLPALLILWPXXXXXXXXXXXAAYGFFSPIVATFEAV 120
WT+Y +V +KQ+GP+ K+ +C CL PA +ILWP A YGFFSPI ATF+AV
Sbjct: 61 WTFYSIVSAKQVGPILKIFLCLCL-PAAIILWPIVGILGSVLGGALYGFFSPIFATFDAV 119
Query: 121 EGGKENKIVHCFIDGTWSTVLKTFDIVKDVGNACFHTYFSVMDDLKEE-GNGRYYEIRPL 179
GK + HCF DGTWST+ ++F +V+D + CFH+YFS+MD+LK+ + +YYEIR L
Sbjct: 120 GEGKPYQFFHCFYDGTWSTMQRSFTVVRDFKDVCFHSYFSLMDELKQSCPDRKYYEIRLL 179
Query: 180 YVPGAVVASVLGIIIDVPVISFVAIYKSPYMLFKGWNRLLHDLIGREGPFLETICVPLAG 239
+PGA+V SVLGI++D PVIS VAI KSPYMLFKGW+RL HDLIGREGPFLET+CVP+AG
Sbjct: 180 QLPGALVVSVLGILVDPPVISLVAICKSPYMLFKGWHRLFHDLIGREGPFLETMCVPIAG 239
Query: 240 LAILLWPLAVVGAVLASVIASIILGVRAGVVAYEETSVFYGLRYIVAALSLYDEYSNDVL 299
LAILLWPLAV GAV+ SVI+SI LG AGVV+Y+E+S +YGL YIVA++S+YDEYS D+L
Sbjct: 240 LAILLWPLAVTGAVIGSVISSIFLGAYAGVVSYQESSFYYGLCYIVASVSIYDEYSTDIL 299
Query: 300 DMPQRSCFPRPPFRKKDEMPXXXXXXXXXXXXXXXXXXXXXXXXXKNSIAELKPFELLDG 359
D+P+ SCFPRP +R+KDE P + + ++KP +LL+
Sbjct: 300 DLPEGSCFPRPKYRRKDEEP-TPFSGPVPRLGSVKNASSMRGGSVRVPMIDIKPLDLLNE 358
Query: 360 LCKECLQMGERLVSEGLITSEDIQETWFGKESRVISVGLPAYCLLQALLRSAKANSPGIL 419
L EC + GE L ++GLI S+DI+E K S+VISVGLPAY LL +LRS KANS G+L
Sbjct: 359 LFVECRRYGEVLATKGLINSKDIEEARSSKGSQVISVGLPAYGLLYEILRSVKANSSGLL 418
Query: 420 IS 421
+S
Sbjct: 419 LS 420
>AT5G40640.1 | Symbols: | unknown protein; LOCATED IN: plasma
membrane; EXPRESSED IN: 19 plant structures; EXPRESSED
DURING: 7 growth stages; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT3G27390.1);
Has 104 Blast hits to 102 proteins in 14 species: Archae
- 0; Bacteria - 0; Metazoa - 2; Fungi - 0; Plants - 101;
Viruses - 0; Other Eukaryotes - 1 (source: NCBI BLink).
| chr5:16277345-16280258 FORWARD LENGTH=586
Length = 586
Score = 467 bits (1201), Expect = e-132, Method: Compositional matrix adjust.
Identities = 235/421 (55%), Positives = 293/421 (69%), Gaps = 3/421 (0%)
Query: 1 MEPPVEFWASLWSFICFLPYFIGLWLLGHIKGVIFCPLICLIMTIGNSAIILGLWVIHCI 60
MEPP ASLW FI F+PYF GL LLG IKG++ CPLICL + IGNSAIILGL +H I
Sbjct: 1 MEPPTGILASLWQFILFIPYFTGLLLLGVIKGIVLCPLICLTVAIGNSAIILGLLPVHAI 60
Query: 61 WTYYCVVRSKQLGPLFKLVMCTCLLPALLILWPXXXXXXXXXXXAAYGFFSPIVATFEAV 120
WT Y + +KQLGP+ K+ +C C+ P +ILW A YGF SPI ATF+AV
Sbjct: 61 WTLYSIASAKQLGPILKIFLCLCV-PLGVILWLVVSILGSVLGGAIYGFLSPIFATFDAV 119
Query: 121 EGGKENKIVHCFIDGTWSTVLKTFDIVKDVGNACFHTYFSVMDDLKEE-GNGRYYEIRPL 179
GK N HCF DGTWSTV +F +V D + CFH+YFS MDDL+ N YYEIR L
Sbjct: 120 GEGKSNPFFHCFYDGTWSTVQGSFTVVCDFKDVCFHSYFSFMDDLRTSTANRHYYEIRLL 179
Query: 180 YVPGAVVASVLGIIIDVPVISFVAIYKSPYMLFKGWNRLLHDLIGREGPFLETICVPLAG 239
+PGAV+ +VLGI++D PVIS +A+ KSPYMLFKGW+RL HDLIGREGPFLET+CVP+AG
Sbjct: 180 QIPGAVIVAVLGILVDFPVISLLALCKSPYMLFKGWHRLFHDLIGREGPFLETMCVPIAG 239
Query: 240 LAILLWPLAVVGAVLASVIASIILGVRAGVVAYEETSVFYGLRYIVAALSLYDEYSNDVL 299
L ILLWPLAVVGAVL SV++S+ LG GVV+Y+E+S F+GL Y+VA++S+YDEYSNDVL
Sbjct: 240 LVILLWPLAVVGAVLGSVVSSVFLGAYGGVVSYQESSFFFGLCYVVASVSIYDEYSNDVL 299
Query: 300 DMPQRSCFPRPPFRKKDEMPXXXXXXXXXXXXXXXXXXXXXXXXXKNSIAELKPFELLDG 359
DMP+ SCFPRP +R+ +E K + +LKP +LL+
Sbjct: 300 DMPEGSCFPRPIYRRNEE-GASTAFSGGLSRPNSFKTTPSRGGSNKGPMIDLKPLDLLEA 358
Query: 360 LCKECLQMGERLVSEGLITSEDIQETWFGKESRVISVGLPAYCLLQALLRSAKANSPGIL 419
L EC + GE +V++G+I S+DI+E K S+VIS GLPAY LL LLRS K+NS G+L
Sbjct: 359 LFVECRRHGEIMVTKGIINSKDIEEAKSSKGSQVISFGLPAYSLLHELLRSIKSNSTGLL 418
Query: 420 I 420
+
Sbjct: 419 L 419
>AT4G12680.1 | Symbols: | unknown protein; INVOLVED IN: vegetative
to reproductive phase transition of meristem; LOCATED
IN: endomembrane system; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT5G40640.1);
Has 103 Blast hits to 103 proteins in 14 species: Archae
- 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 103;
Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).
| chr4:7475104-7478174 FORWARD LENGTH=575
Length = 575
Score = 320 bits (821), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 188/432 (43%), Positives = 267/432 (61%), Gaps = 12/432 (2%)
Query: 1 MEPPVEFWASLWSFICFLPYFIGLWLLGHIKGVIFCPLICLIMTIGNSAIILGLWVIHCI 60
ME P F+ LWSF+ FLPYF L LLG K +I P+ I+ +GNS +I+GLW H I
Sbjct: 1 MEVPKGFFEKLWSFVSFLPYFFLLLLLGVTKALIIGPISSAIILVGNSCVIIGLWPAHFI 60
Query: 61 WTYYCVVRSKQLGPLFKLVMCTCLLPALLILWPXXXXXXXXXXXAAYGFFSPIVATFEAV 120
WTYYC+ R+K++G + K + L P L+LWP AYGFF+P++ATFEAV
Sbjct: 61 WTYYCLARTKRIGLVLK-TLALVLFPLPLLLWPVAGIVGSLFGGIAYGFFTPLMATFEAV 119
Query: 121 EGGKENKIVHCFIDGTWSTVLKTFDIVKDVGNACFHTYFSVMDDLKE--EGNGRYYEIRP 178
+K HCF+DG++ST+ + +V D + CFH+YFS MD+L+E + EI+
Sbjct: 120 GESVTSKCYHCFVDGSFSTIKGSCTVVTDFTDFCFHSYFSYMDELREMVSADVEPLEIKL 179
Query: 179 LYVPGAVVASVLGIIIDVPVISFVAIYKSPYMLFKGWNRLLHDLIGREGPFLETICVPLA 238
+P ++AS++G+++DV +I+ VA+YKSPYML KGW RLL DL+GREGPFLE++CVP A
Sbjct: 180 SRLPSCLLASLIGVMVDVLLITAVAVYKSPYMLLKGWKRLLEDLVGREGPFLESVCVPFA 239
Query: 239 GLAILLWPLAVVGAVLASVIASIILGVRAGVVAYEETSVFYGLRYIVAALSLYDEYSNDV 298
GLAILLWPLAV GAV+ASV++S LG+ +GV+ ++E S GL YI+AA+SL+DEY ND+
Sbjct: 240 GLAILLWPLAVAGAVIASVLSSFFLGLYSGVIVHQEDSFRMGLNYIIAAVSLFDEYVNDL 299
Query: 299 LDMPQRSCFPRPPFRKKDEM---------PXXXXXXXXXXXXXXXXXXXXXXXXXKNSIA 349
L + + + PRP +R K E K +I
Sbjct: 300 LYLREGTSLPRPCYRTKTETVHGKRILGESKNVDLKSKRSSSLGSKLVSEQSRTLKKAIT 359
Query: 350 ELKPFELLDGLCKECLQMGERLVSEGLITSEDIQETWFGKESRVISVGLPAYCLLQALLR 409
KP ++ + L K C G L+ +GLI +D++E S+ + + LPA+ +LQ LL
Sbjct: 360 LYKPVQVWEWLFKSCEVNGRILLRDGLIDVKDVEECLVKGNSKKLYIKLPAWTVLQCLLA 419
Query: 410 SAKANSPGILIS 421
SAK+NS G++I+
Sbjct: 420 SAKSNSSGLVIT 431
>AT4G37030.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: endomembrane
system; BEST Arabidopsis thaliana protein match is:
unknown protein (TAIR:AT4G12680.1); Has 101 Blast hits
to 99 proteins in 12 species: Archae - 0; Bacteria - 0;
Metazoa - 0; Fungi - 0; Plants - 101; Viruses - 0; Other
Eukaryotes - 0 (source: NCBI BLink). |
chr4:17452150-17454629 FORWARD LENGTH=569
Length = 569
Score = 249 bits (637), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 139/426 (32%), Positives = 226/426 (53%), Gaps = 21/426 (4%)
Query: 13 SFICFLPYFIGLWLLGHIKGVIFCPLICLIMTIGNSAIILGLWVIHCIWTYYCVVRSKQL 72
S++ F F + LG IKG+I P+ L + +GN +IL L+ H WT Y V ++ +
Sbjct: 15 SYVIFA--FCSAFFLGAIKGLIVGPIAGLTLIVGNVGVILCLFPAHVTWTIYAVAKTNRF 72
Query: 73 GPLFKLVMCTCLLPALLILWPXXXXXXXXXXXAAYGFFSPIVATFEAVEGGKE-NKIVHC 131
K+ + L PAL +W YGFF+P ++ FEA E NK HC
Sbjct: 73 DIPLKVAILVAL-PALFGIWLGLSLAISVLVGVGYGFFTPWISAFEAFRQDTESNKFFHC 131
Query: 132 FIDGTWSTVLKTFDIVKDVGNACFHTYFSVMDDLKEEG-NGRYYEIRPLYVPGAVVASVL 190
+DGTW T+ + +V D + C+H+Y + +L+E + +R ++VPG ++ +L
Sbjct: 132 LVDGTWGTIKGSCIVVTDFADFCYHSYPLYLKELRESPVSDELQTLRLIHVPGCIIVGIL 191
Query: 191 GIIIDVPVISFVAIYKSPYMLFKGWNRLLHDLIGREGPFLETICVPLAGLAILLWPLAVV 250
G++ID+P+ + +A+ KSPY+L KGW RL D I REGPFLE C+P+AGL +LLWP+ V+
Sbjct: 192 GLVIDIPLFTAIAVIKSPYLLLKGWYRLAQDAINREGPFLEIACIPVAGLTVLLWPIVVI 251
Query: 251 GAVLASVIASIILGVRAGVVAYEETSVFYGLRYIVAALSLYDEYSNDVLDMPQRSCFPRP 310
G +L ++ +SI +G+ VV ++E S G+ Y++A + +DEY+ND L + + + FP+P
Sbjct: 252 GFILVTIFSSIFVGLYGAVVVFQERSFRRGVSYVIAVVGEFDEYTNDWLYLREGTIFPKP 311
Query: 311 PFRK-----KDEMPXXXXXXXXXXXXXXXXX--------XXXXXXXXKNSIAELKPFELL 357
+R E+ + +I E++ ++
Sbjct: 312 RYRMGRGSFSSEVSVIVHPSDVTRVNSSGSVDAPAMLVPSLVHSVSVREAIQEVRMVQIW 371
Query: 358 DGLCKECLQMGERLVSEGLITSEDIQETW---FGKESRVISVGLPAYCLLQALLRSAKAN 414
+ + G+ L+ ++T D+ E+ G ES +I+VGLP+Y LL LL S KA
Sbjct: 372 EHMMGWFEMQGKELLDAEVLTPTDLYESLKGRHGNESSIINVGLPSYALLHTLLSSIKAG 431
Query: 415 SPGILI 420
G+L+
Sbjct: 432 VHGVLL 437