Miyakogusa Predicted Gene
- Lj0g3v0305589.2
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0305589.2 Non Chatacterized Hit- tr|I1N6X4|I1N6X4_SOYBN
Uncharacterized protein OS=Glycine max PE=4 SV=1,74.57,0,seg,NULL;
FAMILY NOT NAMED,NULL,CUFF.20567.2
(518 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT3G27390.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 581 e-166
AT5G40640.1 | Symbols: | unknown protein; LOCATED IN: plasma me... 558 e-159
AT4G12680.1 | Symbols: | unknown protein; INVOLVED IN: vegetati... 380 e-105
AT4G37030.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 319 2e-87
>AT3G27390.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: plasma membrane;
EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT5G40640.1); Has 101 Blast
hits to 99 proteins in 12 species: Archae - 0; Bacteria
- 0; Metazoa - 0; Fungi - 0; Plants - 101; Viruses - 0;
Other Eukaryotes - 0 (source: NCBI BLink). |
chr3:10133372-10136111 REVERSE LENGTH=588
Length = 588
Score = 581 bits (1497), Expect = e-166, Method: Compositional matrix adjust.
Identities = 289/522 (55%), Positives = 373/522 (71%), Gaps = 6/522 (1%)
Query: 1 MEPPVEFWASLWSFICFLPYFIGLWLLGHIKGVIFCPLICLIMTIGNSAIILGLWVIHCI 60
MEPP+ F ASL+ F+ FLPYFIGL LG IKG++ CPL+CL++TIGNSA+IL L +H +
Sbjct: 1 MEPPIGFRASLFQFLLFLPYFIGLLFLGFIKGIVLCPLVCLVVTIGNSAVILSLLPVHIV 60
Query: 61 WTYYCVVRSKQLGPLFKLVMCTCLLPALLILWPXXXXXXXXXXXAAYGFFSPIVATFEAV 120
WT+Y +V +KQ+GP+ K+ +C CL PA +ILWP A YGFFSPI ATF+AV
Sbjct: 61 WTFYSIVSAKQVGPILKIFLCLCL-PAAIILWPIVGILGSVLGGALYGFFSPIFATFDAV 119
Query: 121 EGGKENKIVHCFIDGTWSTVLKTFDIVKDVGNACFHTYFSVMDDLKEE-GNGRYYEIRPL 179
GK + HCF DGTWST+ ++F +V+D + CFH+YFS+MD+LK+ + +YYEIR L
Sbjct: 120 GEGKPYQFFHCFYDGTWSTMQRSFTVVRDFKDVCFHSYFSLMDELKQSCPDRKYYEIRLL 179
Query: 180 YVPGAVVASVLGIIIDVPVISFVAIYKSPYMLFKGWNRLLHDLIGREGPFLETICVPLAG 239
+PGA+V SVLGI++D PVIS VAI KSPYMLFKGW+RL HDLIGREGPFLET+CVP+AG
Sbjct: 180 QLPGALVVSVLGILVDPPVISLVAICKSPYMLFKGWHRLFHDLIGREGPFLETMCVPIAG 239
Query: 240 LAILLWPLAVVGAVLASVIASIILGVRAGVVAYEETSVFYGLRYIVAALSLYDEYSNDVL 299
LAILLWPLAV GAV+ SVI+SI LG AGVV+Y+E+S +YGL YIVA++S+YDEYS D+L
Sbjct: 240 LAILLWPLAVTGAVIGSVISSIFLGAYAGVVSYQESSFYYGLCYIVASVSIYDEYSTDIL 299
Query: 300 DMPQRSCFPRPPFRKKDEMPXXXXXXXXXXXXXXXXXXXXXXXXXKNSIAELKPFELLDG 359
D+P+ SCFPRP +R+KDE P + + ++KP +LL+
Sbjct: 300 DLPEGSCFPRPKYRRKDEEP-TPFSGPVPRLGSVKNASSMRGGSVRVPMIDIKPLDLLNE 358
Query: 360 LCKECLQMGERLVSEGLITSEDIQETWFGKESRVISVGLPAYCLLQALLRSAKANSPGXX 419
L EC + GE L ++GLI S+DI+E K S+VISVGLPAY LL +LRS KANS G
Sbjct: 359 LFVECRRYGEVLATKGLINSKDIEEARSSKGSQVISVGLPAYGLLYEILRSVKANSSGLL 418
Query: 420 XXX-XXXXXXXXRPKEKFFEWFLNPLLVMKEQIKAENFSVSEEDYLCKLVLFNCDPNRVK 478
RPK+ FF+WFLNP L++KEQ+KA N S EE+YL +LVL DP R+K
Sbjct: 419 LSDGVTEITTMNRPKDVFFDWFLNPFLILKEQMKATNLSEEEEEYLGRLVLLFGDPERLK 478
Query: 479 --NSTFTPPPECDRKRAELDALARRLQGITKFITRFPTYKRR 518
N+ PP +RKRAELDA ARR+QG+TK ++R+PT++R
Sbjct: 479 SSNAISASPPLTERKRAELDAFARRMQGLTKTVSRYPTFRRH 520
>AT5G40640.1 | Symbols: | unknown protein; LOCATED IN: plasma
membrane; EXPRESSED IN: 19 plant structures; EXPRESSED
DURING: 7 growth stages; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT3G27390.1);
Has 104 Blast hits to 102 proteins in 14 species: Archae
- 0; Bacteria - 0; Metazoa - 2; Fungi - 0; Plants - 101;
Viruses - 0; Other Eukaryotes - 1 (source: NCBI BLink).
| chr5:16277345-16280258 FORWARD LENGTH=586
Length = 586
Score = 558 bits (1437), Expect = e-159, Method: Compositional matrix adjust.
Identities = 282/522 (54%), Positives = 356/522 (68%), Gaps = 6/522 (1%)
Query: 1 MEPPVEFWASLWSFICFLPYFIGLWLLGHIKGVIFCPLICLIMTIGNSAIILGLWVIHCI 60
MEPP ASLW FI F+PYF GL LLG IKG++ CPLICL + IGNSAIILGL +H I
Sbjct: 1 MEPPTGILASLWQFILFIPYFTGLLLLGVIKGIVLCPLICLTVAIGNSAIILGLLPVHAI 60
Query: 61 WTYYCVVRSKQLGPLFKLVMCTCLLPALLILWPXXXXXXXXXXXAAYGFFSPIVATFEAV 120
WT Y + +KQLGP+ K+ +C C+ P +ILW A YGF SPI ATF+AV
Sbjct: 61 WTLYSIASAKQLGPILKIFLCLCV-PLGVILWLVVSILGSVLGGAIYGFLSPIFATFDAV 119
Query: 121 EGGKENKIVHCFIDGTWSTVLKTFDIVKDVGNACFHTYFSVMDDLKEE-GNGRYYEIRPL 179
GK N HCF DGTWSTV +F +V D + CFH+YFS MDDL+ N YYEIR L
Sbjct: 120 GEGKSNPFFHCFYDGTWSTVQGSFTVVCDFKDVCFHSYFSFMDDLRTSTANRHYYEIRLL 179
Query: 180 YVPGAVVASVLGIIIDVPVISFVAIYKSPYMLFKGWNRLLHDLIGREGPFLETICVPLAG 239
+PGAV+ +VLGI++D PVIS +A+ KSPYMLFKGW+RL HDLIGREGPFLET+CVP+AG
Sbjct: 180 QIPGAVIVAVLGILVDFPVISLLALCKSPYMLFKGWHRLFHDLIGREGPFLETMCVPIAG 239
Query: 240 LAILLWPLAVVGAVLASVIASIILGVRAGVVAYEETSVFYGLRYIVAALSLYDEYSNDVL 299
L ILLWPLAVVGAVL SV++S+ LG GVV+Y+E+S F+GL Y+VA++S+YDEYSNDVL
Sbjct: 240 LVILLWPLAVVGAVLGSVVSSVFLGAYGGVVSYQESSFFFGLCYVVASVSIYDEYSNDVL 299
Query: 300 DMPQRSCFPRPPFRKKDEMPXXXXXXXXXXXXXXXXXXXXXXXXXKNSIAELKPFELLDG 359
DMP+ SCFPRP +R+ +E K + +LKP +LL+
Sbjct: 300 DMPEGSCFPRPIYRRNEE-GASTAFSGGLSRPNSFKTTPSRGGSNKGPMIDLKPLDLLEA 358
Query: 360 LCKECLQMGERLVSEGLITSEDIQETWFGKESRVISVGLPAYCLLQALLRSAKANSPGXX 419
L EC + GE +V++G+I S+DI+E K S+VIS GLPAY LL LLRS K+NS G
Sbjct: 359 LFVECRRHGEIMVTKGIINSKDIEEAKSSKGSQVISFGLPAYSLLHELLRSIKSNSTGLL 418
Query: 420 X-XXXXXXXXXXRPKEKFFEWFLNPLLVMKEQIKAENFSVSEEDYLCKLVLFNCDPNRVK 478
RPK+ FF+WFLNP L++K+QI+A N S EE+YL KLVL D R+K
Sbjct: 419 LGDGVTEITTRNRPKDAFFDWFLNPFLILKDQIEAANLSEEEEEYLGKLVLLFGDSERLK 478
Query: 479 NSTF--TPPPECDRKRAELDALARRLQGITKFITRFPTYKRR 518
+S PP + ++AELDA ARRLQG+TK ++R+PT++R
Sbjct: 479 SSIVESESPPLTELRKAELDAFARRLQGLTKSVSRYPTFRRH 520
>AT4G12680.1 | Symbols: | unknown protein; INVOLVED IN: vegetative
to reproductive phase transition of meristem; LOCATED
IN: endomembrane system; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT5G40640.1);
Has 103 Blast hits to 103 proteins in 14 species: Archae
- 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 103;
Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).
| chr4:7475104-7478174 FORWARD LENGTH=575
Length = 575
Score = 380 bits (975), Expect = e-105, Method: Compositional matrix adjust.
Identities = 224/530 (42%), Positives = 317/530 (59%), Gaps = 14/530 (2%)
Query: 1 MEPPVEFWASLWSFICFLPYFIGLWLLGHIKGVIFCPLICLIMTIGNSAIILGLWVIHCI 60
ME P F+ LWSF+ FLPYF L LLG K +I P+ I+ +GNS +I+GLW H I
Sbjct: 1 MEVPKGFFEKLWSFVSFLPYFFLLLLLGVTKALIIGPISSAIILVGNSCVIIGLWPAHFI 60
Query: 61 WTYYCVVRSKQLGPLFKLVMCTCLLPALLILWPXXXXXXXXXXXAAYGFFSPIVATFEAV 120
WTYYC+ R+K++G + K + L P L+LWP AYGFF+P++ATFEAV
Sbjct: 61 WTYYCLARTKRIGLVLK-TLALVLFPLPLLLWPVAGIVGSLFGGIAYGFFTPLMATFEAV 119
Query: 121 EGGKENKIVHCFIDGTWSTVLKTFDIVKDVGNACFHTYFSVMDDLKE--EGNGRYYEIRP 178
+K HCF+DG++ST+ + +V D + CFH+YFS MD+L+E + EI+
Sbjct: 120 GESVTSKCYHCFVDGSFSTIKGSCTVVTDFTDFCFHSYFSYMDELREMVSADVEPLEIKL 179
Query: 179 LYVPGAVVASVLGIIIDVPVISFVAIYKSPYMLFKGWNRLLHDLIGREGPFLETICVPLA 238
+P ++AS++G+++DV +I+ VA+YKSPYML KGW RLL DL+GREGPFLE++CVP A
Sbjct: 180 SRLPSCLLASLIGVMVDVLLITAVAVYKSPYMLLKGWKRLLEDLVGREGPFLESVCVPFA 239
Query: 239 GLAILLWPLAVVGAVLASVIASIILGVRAGVVAYEETSVFYGLRYIVAALSLYDEYSNDV 298
GLAILLWPLAV GAV+ASV++S LG+ +GV+ ++E S GL YI+AA+SL+DEY ND+
Sbjct: 240 GLAILLWPLAVAGAVIASVLSSFFLGLYSGVIVHQEDSFRMGLNYIIAAVSLFDEYVNDL 299
Query: 299 LDMPQRSCFPRPPFRKKDEM---------PXXXXXXXXXXXXXXXXXXXXXXXXXKNSIA 349
L + + + PRP +R K E K +I
Sbjct: 300 LYLREGTSLPRPCYRTKTETVHGKRILGESKNVDLKSKRSSSLGSKLVSEQSRTLKKAIT 359
Query: 350 ELKPFELLDGLCKECLQMGERLVSEGLITSEDIQETWFGKESRVISVGLPAYCLLQALLR 409
KP ++ + L K C G L+ +GLI +D++E S+ + + LPA+ +LQ LL
Sbjct: 360 LYKPVQVWEWLFKSCEVNGRILLRDGLIDVKDVEECLVKGNSKKLYIKLPAWTVLQCLLA 419
Query: 410 SAKANSPGXXXXXXXXXXXXXRPKEKFFEWFLNPLLVMKEQIKAENFSVSEEDYLCKLVL 469
SAK+NS G P++K F W + PLL+MKEQIK + EE L KLV+
Sbjct: 420 SAKSNSSGLVITDGVELTELNSPRDKVFVWLVGPLLIMKEQIKNLKLTEDEEFCLRKLVM 479
Query: 470 FNCDPNRVKNSTFTPPPECDR-KRAELDALARRLQGITKFITRFPTYKRR 518
C R ++ T P D ++A+L A+ RRLQG+ ++R PT++RR
Sbjct: 480 V-CKNERTEDWDNTGFPSSDTVRKAQLQAIIRRLQGMVASMSRIPTFRRR 528
>AT4G37030.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: endomembrane
system; BEST Arabidopsis thaliana protein match is:
unknown protein (TAIR:AT4G12680.1); Has 101 Blast hits
to 99 proteins in 12 species: Archae - 0; Bacteria - 0;
Metazoa - 0; Fungi - 0; Plants - 101; Viruses - 0; Other
Eukaryotes - 0 (source: NCBI BLink). |
chr4:17452150-17454629 FORWARD LENGTH=569
Length = 569
Score = 319 bits (818), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 170/525 (32%), Positives = 281/525 (53%), Gaps = 22/525 (4%)
Query: 13 SFICFLPYFIGLWLLGHIKGVIFCPLICLIMTIGNSAIILGLWVIHCIWTYYCVVRSKQL 72
S++ F F + LG IKG+I P+ L + +GN +IL L+ H WT Y V ++ +
Sbjct: 15 SYVIFA--FCSAFFLGAIKGLIVGPIAGLTLIVGNVGVILCLFPAHVTWTIYAVAKTNRF 72
Query: 73 GPLFKLVMCTCLLPALLILWPXXXXXXXXXXXAAYGFFSPIVATFEAVEGGKE-NKIVHC 131
K+ + L PAL +W YGFF+P ++ FEA E NK HC
Sbjct: 73 DIPLKVAILVAL-PALFGIWLGLSLAISVLVGVGYGFFTPWISAFEAFRQDTESNKFFHC 131
Query: 132 FIDGTWSTVLKTFDIVKDVGNACFHTYFSVMDDLKEEG-NGRYYEIRPLYVPGAVVASVL 190
+DGTW T+ + +V D + C+H+Y + +L+E + +R ++VPG ++ +L
Sbjct: 132 LVDGTWGTIKGSCIVVTDFADFCYHSYPLYLKELRESPVSDELQTLRLIHVPGCIIVGIL 191
Query: 191 GIIIDVPVISFVAIYKSPYMLFKGWNRLLHDLIGREGPFLETICVPLAGLAILLWPLAVV 250
G++ID+P+ + +A+ KSPY+L KGW RL D I REGPFLE C+P+AGL +LLWP+ V+
Sbjct: 192 GLVIDIPLFTAIAVIKSPYLLLKGWYRLAQDAINREGPFLEIACIPVAGLTVLLWPIVVI 251
Query: 251 GAVLASVIASIILGVRAGVVAYEETSVFYGLRYIVAALSLYDEYSNDVLDMPQRSCFPRP 310
G +L ++ +SI +G+ VV ++E S G+ Y++A + +DEY+ND L + + + FP+P
Sbjct: 252 GFILVTIFSSIFVGLYGAVVVFQERSFRRGVSYVIAVVGEFDEYTNDWLYLREGTIFPKP 311
Query: 311 PFRK-----KDEMPXXXXXXXXXXXXXXXXX--------XXXXXXXXKNSIAELKPFELL 357
+R E+ + +I E++ ++
Sbjct: 312 RYRMGRGSFSSEVSVIVHPSDVTRVNSSGSVDAPAMLVPSLVHSVSVREAIQEVRMVQIW 371
Query: 358 DGLCKECLQMGERLVSEGLITSEDIQETW---FGKESRVISVGLPAYCLLQALLRSAKAN 414
+ + G+ L+ ++T D+ E+ G ES +I+VGLP+Y LL LL S KA
Sbjct: 372 EHMMGWFEMQGKELLDAEVLTPTDLYESLKGRHGNESSIINVGLPSYALLHTLLSSIKAG 431
Query: 415 SPGXXXXXXXXXXXXXRPKEKFFEWFLNPLLVMKEQIKAENFSVSEEDYLCKLVLFNCDP 474
G RP++KF +W NP++V+K+QI+A SE YL K+VLF
Sbjct: 432 VHGVLLLDGSEVTHLNRPQDKFLDWVFNPIMVLKDQIRALKLGESEVKYLEKVVLFGNHE 491
Query: 475 NRVKN-STFTPPPECDRKRAELDALARRLQGITKFITRFPTYKRR 518
R++ + PP+ + + A++ ++RR+ G+ + +++ PTY+RR
Sbjct: 492 QRMEAWDNHSNPPQENLRTAQIQGISRRMMGMVRSVSKLPTYRRR 536