Miyakogusa Predicted Gene

chr1.CM0955.30.nd
Show Alignment: 
BLASTP 2.2.18 [Mar-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= chr1.CM0955.30.nd + phase: 1 /partial
         (297 letters)

Database: trembl 
           6,964,485 sequences; 2,268,126,488 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

A7PKP0_VITVI (tr|A7PKP0) Chromosome chr7 scaffold_20, whole geno...   151   5e-35
B6SH76_MAIZE (tr|B6SH76) Putative uncharacterized protein OS=Zea...   123   2e-26
Q6YYY1_ORYSJ (tr|Q6YYY1) Putative uncharacterized protein P0604E...   120   8e-26
A3BVR7_ORYSJ (tr|A3BVR7) Putative uncharacterized protein OS=Ory...   119   2e-25
A2Z492_ORYSI (tr|A2Z492) Putative uncharacterized protein OS=Ory...   114   7e-24
A3C1Q1_ORYSJ (tr|A3C1Q1) Putative uncharacterized protein OS=Ory...   114   7e-24
Q7Y216_ARATH (tr|Q7Y216) Putative uncharacterized protein At3g45...   112   2e-23
Q9M168_ARATH (tr|Q9M168) Putative uncharacterized protein T6D9_8...   111   5e-23
A2YY46_ORYSI (tr|A2YY46) Putative uncharacterized protein OS=Ory...   110   2e-22
Q652N4_ORYSJ (tr|Q652N4) Putative uncharacterized protein OJ1003...    72   6e-11
A9T2T7_PHYPA (tr|A9T2T7) Predicted protein (Fragment) OS=Physcom...    63   2e-08
A9SX13_PHYPA (tr|A9SX13) Predicted protein OS=Physcomitrella pat...    58   1e-06
A4S2T1_OSTLU (tr|A4S2T1) Predicted protein OS=Ostreococcus lucim...    52   5e-05

>A7PKP0_VITVI (tr|A7PKP0) Chromosome chr7 scaffold_20, whole genome shotgun
           sequence OS=Vitis vinifera GN=GSVIVT00019256001 PE=4
           SV=1
          Length = 314

 Score =  151 bits (382), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 73/165 (44%), Positives = 104/165 (63%), Gaps = 11/165 (6%)

Query: 6   SVTKNVETYFNYGKQNEESLTKLFITLLVKLASVESLWQQGICASVYEGSWILKTWNRPY 65
           +VTKNV  +  YG+ N+ESL +LF+TLL+KL S+E+LW +G+CAS+Y+GSWI KTW+   
Sbjct: 3   TVTKNVINFLGYGEVNKESLAELFVTLLLKLQSIETLWSKGLCASIYDGSWIYKTWDSGV 62

Query: 66  SMRQIEDFMDRSQNVARAVGAAKAEIIYRCIHKSLSYLSQFLNDQIQGIELMDLLFGTHT 125
               +EDF DRSQNVARAV   +   IY+CIH SL ++S F+N +++G +L   LFG   
Sbjct: 63  GCINVEDFTDRSQNVARAVATKQVTKIYKCIHHSLHWISVFMNGRMEGPKLRRRLFGQDH 122

Query: 126 VSTP-----------GAGGTSNINGNNLPSPENLCPQKKPRLMEG 159
           V  P             G + + N  +  + ++    KKPRLM+G
Sbjct: 123 VPKPLSGPGQLPVPEDGGKSLDENAASALTQDDFIQTKKPRLMDG 167


>B6SH76_MAIZE (tr|B6SH76) Putative uncharacterized protein OS=Zea mays PE=2 SV=1
          Length = 690

 Score =  123 bits (308), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 56/121 (46%), Positives = 80/121 (66%)

Query: 1   GADLVSVTKNVETYFNYGKQNEESLTKLFITLLVKLASVESLWQQGICASVYEGSWILKT 60
           G D  S+ KNV  +  +G  N+ES+ +LF++L+ KL SVE LW+QG+CAS +EG+WI KT
Sbjct: 270 GTDFASIEKNVSLFQGFGHSNKESIAELFVSLMSKLVSVEGLWEQGLCASNFEGTWISKT 329

Query: 61  WNRPYSMRQIEDFMDRSQNVARAVGAAKAEIIYRCIHKSLSYLSQFLNDQIQGIELMDLL 120
           W +      +EDF+DRSQN AR+VG  + + I  C+  S+S LS+F   +I   +L  LL
Sbjct: 330 WAKGVGNLNVEDFLDRSQNFARSVGVKEMQKICECLRASVSDLSKFSKGEIAAPKLKALL 389

Query: 121 F 121
           F
Sbjct: 390 F 390


>Q6YYY1_ORYSJ (tr|Q6YYY1) Putative uncharacterized protein P0604E01.3
           (Os08g0559900 protein) (Putative uncharacterized protein
           P0562A06.26) OS=Oryza sativa subsp. japonica
           GN=P0604E01.3 PE=4 SV=1
          Length = 581

 Score =  120 bits (302), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 56/121 (46%), Positives = 79/121 (65%)

Query: 1   GADLVSVTKNVETYFNYGKQNEESLTKLFITLLVKLASVESLWQQGICASVYEGSWILKT 60
           G D  S+ +NV     +G +N+ES+ +LF++L+ KL SVE LW+QG+CAS +EGSWI KT
Sbjct: 269 GPDFPSIQRNVSLVEGFGSRNKESVAELFVSLMSKLLSVEGLWEQGLCASNFEGSWIFKT 328

Query: 61  WNRPYSMRQIEDFMDRSQNVARAVGAAKAEIIYRCIHKSLSYLSQFLNDQIQGIELMDLL 120
           W R      +EDF+DRSQN ARAVG  + + I  CI  ++  L+ F   +I   +L +LL
Sbjct: 329 WERGVGNLSVEDFLDRSQNFARAVGKEEMQKISECIRVAVLNLNNFFRGKIDAPKLKNLL 388

Query: 121 F 121
           F
Sbjct: 389 F 389


>A3BVR7_ORYSJ (tr|A3BVR7) Putative uncharacterized protein OS=Oryza sativa subsp.
           japonica GN=OsJ_027139 PE=4 SV=1
          Length = 548

 Score =  119 bits (299), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 56/121 (46%), Positives = 79/121 (65%)

Query: 1   GADLVSVTKNVETYFNYGKQNEESLTKLFITLLVKLASVESLWQQGICASVYEGSWILKT 60
           G D  S+ +NV     +G +N+ES+ +LF++L+ KL SVE LW+QG+CAS +EGSWI KT
Sbjct: 251 GPDFPSIQRNVSLVEGFGSRNKESVAELFVSLMSKLLSVEGLWEQGLCASNFEGSWIFKT 310

Query: 61  WNRPYSMRQIEDFMDRSQNVARAVGAAKAEIIYRCIHKSLSYLSQFLNDQIQGIELMDLL 120
           W R      +EDF+DRSQN ARAVG  + + I  CI  ++  L+ F   +I   +L +LL
Sbjct: 311 WERGVGNLSVEDFLDRSQNFARAVGKEEMQKISECIRVAVLNLNNFFRGKIDAPKLKNLL 370

Query: 121 F 121
           F
Sbjct: 371 F 371


>A2Z492_ORYSI (tr|A2Z492) Putative uncharacterized protein OS=Oryza sativa subsp.
            indica GN=OsI_031385 PE=4 SV=1
          Length = 1170

 Score =  114 bits (285), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 51/122 (41%), Positives = 77/122 (63%)

Query: 1    GADLVSVTKNVETYFNYGKQNEESLTKLFITLLVKLASVESLWQQGICASVYEGSWILKT 60
            G+D  SV +N   +  +G+ N+E++ +LF++L+ KL S ESLW+ G+CAS +E SWI KT
Sbjct: 892  GSDFESVERNTLAFKGFGRTNKETVAELFVSLISKLLSAESLWEHGLCASNFEASWISKT 951

Query: 61   WNRPYSMRQIEDFMDRSQNVARAVGAAKAEIIYRCIHKSLSYLSQFLNDQIQGIELMDLL 120
            W +      +EDF+DRSQN AR+VG  + + I RC+      L  F+  ++   +L  LL
Sbjct: 952  WKKGIGNLNVEDFLDRSQNFARSVGKKEMQKICRCLRDCALNLLDFMRGKLDTSKLKTLL 1011

Query: 121  FG 122
            FG
Sbjct: 1012 FG 1013


>A3C1Q1_ORYSJ (tr|A3C1Q1) Putative uncharacterized protein OS=Oryza sativa subsp.
            japonica GN=OsJ_029223 PE=4 SV=1
          Length = 1170

 Score =  114 bits (285), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 51/122 (41%), Positives = 77/122 (63%)

Query: 1    GADLVSVTKNVETYFNYGKQNEESLTKLFITLLVKLASVESLWQQGICASVYEGSWILKT 60
            G+D  SV +N   +  +G+ N+E++ +LF++L+ KL S ESLW+ G+CAS +E SWI KT
Sbjct: 892  GSDFESVERNTLAFKGFGRTNKETVAELFVSLISKLLSAESLWEHGLCASNFEASWISKT 951

Query: 61   WNRPYSMRQIEDFMDRSQNVARAVGAAKAEIIYRCIHKSLSYLSQFLNDQIQGIELMDLL 120
            W +      +EDF+DRSQN AR+VG  + + I RC+      L  F+  ++   +L  LL
Sbjct: 952  WKKGIGNLNVEDFLDRSQNFARSVGKKEMQKICRCLRDCALNLLDFMRGKLDTSKLKTLL 1011

Query: 121  FG 122
            FG
Sbjct: 1012 FG 1013


>Q7Y216_ARATH (tr|Q7Y216) Putative uncharacterized protein At3g45750
           OS=Arabidopsis thaliana GN=At3g45750 PE=2 SV=1
          Length = 682

 Score =  112 bits (281), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 55/127 (43%), Positives = 80/127 (62%), Gaps = 1/127 (0%)

Query: 1   GADLVSVTKNVETYFNYGKQNEESLTKLFITLLVKLASVESLWQQGICASVYEGSWILKT 60
           G D  +V K  + + N+G++N+ESL +LF T  +KL SVE LW+QG+C SV  G WI K 
Sbjct: 250 GMDPPNVEKRAQKFLNWGQRNQESLGRLFATFFIKLQSVEFLWRQGLCVSVLNGLWISKK 309

Query: 61  WNRP-YSMRQIEDFMDRSQNVARAVGAAKAEIIYRCIHKSLSYLSQFLNDQIQGIELMDL 119
           W +       +EDF + SQNVAR V  A A+ IY  I++++  + +FLND++ G +L   
Sbjct: 310 WKKVGVGSISVEDFTNISQNVARRVNGAGAKKIYSSINRTVEDIFEFLNDKVAGTDLRHR 369

Query: 120 LFGTHTV 126
           LFG  +V
Sbjct: 370 LFGKGSV 376


>Q9M168_ARATH (tr|Q9M168) Putative uncharacterized protein T6D9_80 OS=Arabidopsis
           thaliana GN=T6D9_80 PE=4 SV=1
          Length = 690

 Score =  111 bits (278), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 55/135 (40%), Positives = 81/135 (60%), Gaps = 9/135 (6%)

Query: 1   GADLVSVTKNVETYFNYGKQNEESLTKLFITLLVKLASVESLWQQGICASVYEGSWILKT 60
           G D  +V K  + + N+G++N+ESL +LF T  +KL SVE LW+QG+C SV  G WI K 
Sbjct: 250 GMDPPNVEKRAQKFLNWGQRNQESLGRLFATFFIKLQSVEFLWRQGLCVSVLNGLWISKK 309

Query: 61  WNRP---------YSMRQIEDFMDRSQNVARAVGAAKAEIIYRCIHKSLSYLSQFLNDQI 111
           W +            +  +EDF + SQNVAR V  A A+ IY  I++++  + +FLND++
Sbjct: 310 WKKVGVGSISVSYKKLYSVEDFTNISQNVARRVNGAGAKKIYSSINRTVEDIFEFLNDKV 369

Query: 112 QGIELMDLLFGTHTV 126
            G +L   LFG  +V
Sbjct: 370 AGTDLRHRLFGKGSV 384


>A2YY46_ORYSI (tr|A2YY46) Putative uncharacterized protein OS=Oryza sativa subsp.
           indica GN=OsI_029239 PE=4 SV=1
          Length = 565

 Score =  110 bits (274), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 56/136 (41%), Positives = 79/136 (58%), Gaps = 15/136 (11%)

Query: 1   GADLVSVTKNVETYFNYGKQNEESLTKLFITLLVK---------------LASVESLWQQ 45
           G D  S+ +NV     +G +N+ES+ +LF++L+ K               L SVE LW+Q
Sbjct: 251 GPDFPSIQRNVSLVEGFGSRNKESVAELFVSLMSKAKLFAYFGVVYTYEKLLSVEGLWEQ 310

Query: 46  GICASVYEGSWILKTWNRPYSMRQIEDFMDRSQNVARAVGAAKAEIIYRCIHKSLSYLSQ 105
           G+CAS +EGSWI KTW R      +EDF+DRSQN ARAVG  + + I  CI  ++  L+ 
Sbjct: 311 GLCASNFEGSWIFKTWERGVGNLSVEDFLDRSQNFARAVGKEEMQKISECIRVAVLNLNN 370

Query: 106 FLNDQIQGIELMDLLF 121
           F   +I   +L +LLF
Sbjct: 371 FFRGKIDAPKLKNLLF 386


>Q652N4_ORYSJ (tr|Q652N4) Putative uncharacterized protein OJ1003_C09.37
           (Os09g0570600 protein) OS=Oryza sativa subsp. japonica
           GN=OJ1003_C09.37 PE=4 SV=1
          Length = 310

 Score = 71.6 bits (174), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 29/63 (46%), Positives = 45/63 (71%)

Query: 1   GADLVSVTKNVETYFNYGKQNEESLTKLFITLLVKLASVESLWQQGICASVYEGSWILKT 60
           G+D  SV +N   +  +G+ N+E++ +LF++L+ KL S ESLW+ G+CAS +E SWI KT
Sbjct: 215 GSDFESVERNTLAFKGFGRTNKETVAELFVSLISKLLSAESLWEHGLCASNFEASWISKT 274

Query: 61  WNR 63
           W +
Sbjct: 275 WKK 277


>A9T2T7_PHYPA (tr|A9T2T7) Predicted protein (Fragment) OS=Physcomitrella patens
           subsp. patens GN=PHYPADRAFT_46237 PE=4 SV=1
          Length = 172

 Score = 63.2 bits (152), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 25/47 (53%), Positives = 35/47 (74%)

Query: 17  YGKQNEESLTKLFITLLVKLASVESLWQQGICASVYEGSWILKTWNR 63
           +G+ N+E+L +LF +   K  +VESLW+QG+CASVYEG WI K W +
Sbjct: 117 FGRDNKETLGQLFGSFFTKFLAVESLWEQGLCASVYEGKWISKVWAK 163


>A9SX13_PHYPA (tr|A9SX13) Predicted protein OS=Physcomitrella patens subsp.
           patens GN=PHYPADRAFT_166963 PE=4 SV=1
          Length = 1171

 Score = 57.8 bits (138), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 40/111 (36%), Positives = 57/111 (51%), Gaps = 8/111 (7%)

Query: 17  YGKQNEESLTKLFITLLVKLASVESLWQQGICASVYEGSWILKT-----WNRP-YSMRQI 70
           +G+ N+ S+ +LF++   + ASV+SLW  G+  S + G W   T     WNR  Y+MR +
Sbjct: 680 FGQDNKCSIGQLFLSFFGQFASVKSLWVNGLAVSPFWGEWGDSTTTNPAWNRKQYAMR-V 738

Query: 71  EDFMDRSQNVARAVGAAKAEIIYRCIHKSLSYLSQFLNDQIQGIELMDLLF 121
           ED  DR  N AR++  A   II      +   L Q   D  Q + L  LLF
Sbjct: 739 EDPFDRMDNCARSIQDAGLPIICNSFAAAFESLLQ-PPDWDQLLSLRQLLF 788


>A4S2T1_OSTLU (tr|A4S2T1) Predicted protein OS=Ostreococcus lucimarinus (strain
           CCE9901) GN=OSTLU_94724 PE=4 SV=1
          Length = 633

 Score = 52.0 bits (123), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 47/201 (23%), Positives = 85/201 (42%), Gaps = 26/201 (12%)

Query: 16  NYGKQNEESLTKLFITLLVKLASVESLWQQGICASVYEGSWIL-KTWNRPYSMRQIEDFM 74
           +   +N E+L +LF++    L +++ L++  + AS Y G++I+  +W        +ED  
Sbjct: 230 DIAAENTETLAELFVSFFAHLCAIKDLFRNAVNASTYHGTFIVGSSWQAFKYPLGVEDPF 289

Query: 75  DRSQNVARAVGAAKAEIIYRCIHKSLSYLSQFLN--DQIQGIELMDLLFGTHTV------ 126
               NVARAV     + +      + + +S+ L+  D +Q +  +  L G  +V      
Sbjct: 290 AAGDNVARAVQMRTRDYVLNAFPAACADISKMLHATDNVQFMRSLLCLLGDKSVPSEVLA 349

Query: 127 ----STPGAGGTSNINGNNLP-------SPENLCPQKKPRLMEGIVENLAKRHSQG---- 171
               + PG GG     G  LP        P  +  Q    L E  ++ L ++ + G    
Sbjct: 350 RLRPTLPGMGGAPQPPG--LPGAPRPPQGPPVMLQQPAKSLNEHTLDMLGRQVAPGASAE 407

Query: 172 NILAQYLQGKEFQGMGLMQQS 192
            ILA   + ++ Q      QS
Sbjct: 408 EILAMLTRQRQVQAEAQRDQS 428