KMC010089A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC010089A_C01 KMC010089A_c01
ttgacgaaatgtcatctatcattttatggtcacttattccattttagtattctataaaaa
aacaaaaaatAATGATAACTGTCTGTTGGTGAAAGAGGGAGCACTGAGTATTCGGTTTTA
CATTGGGGTGGGGGAGGGTAGACCTTTATCTTTCTATGTAAGTTCTTGCCTTTGGATTTT
CATGTCTAATATATTGAAATCAGACATGAAAAATCAAAGGAAATATGTAGATGAAGGCAA
TATCAGATGGGAAGACATAAGAGGGCAGCATCATCCCCTTACAGTTTTTCCATTATTCAA
GGTCAAGGATCCTGCACCCAAACCCTCACAGTTCTATCACCATCAAGGCCAGCTGAAGCA
ATTTTATTCTCTGTTGGGTGACAGGTAACGGATATTACAGTGTCTGTATGGCCTTCAAGT
TTCTGAATCATATTTTTCTGCTGAAGATCCCACAAATAAACACATCGATCTTCTGAACCG
CTAACAATGTATCTCCCATTTGTAACAGAAT


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC010089A_C01 KMC010089A_c01
         (511 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_192182.1| G-protein beta family; protein id: At4g02730.1,...   108  4e-23
gb|AAH08547.1| Unknown (protein for IMAGE:3591165) [Mus musculus]      98  5e-20
ref|NP_060058.1| WD repeat domain 5 protein; WD-repeat protein 5...    98  5e-20
emb|CAB66159.1| hypothetical protein [Homo sapiens]                    98  5e-20
ref|NP_061942.2| WD repeat domain 5B [Homo sapiens] gi|27695090|...    98  6e-20

>ref|NP_192182.1| G-protein beta family; protein id: At4g02730.1, supported by cDNA:
           41490., supported by cDNA: gi_16612251 [Arabidopsis
           thaliana] gi|25387430|pir||G85034 probable WD-repeat
           protein [imported] - Arabidopsis thaliana
           gi|4263521|gb|AAD15347.1| putative WD-repeat protein
           [Arabidopsis thaliana] gi|7269758|emb|CAB77758.1|
           putative WD-repeat protein [Arabidopsis thaliana]
           gi|16612252|gb|AAL27497.1|AF439825_1 AT4g02730/T5J8_2
           [Arabidopsis thaliana] gi|21593699|gb|AAM65666.1|
           putative WD-repeat protein [Arabidopsis thaliana]
           gi|21928079|gb|AAM78068.1| AT4g02730/T5J8_2 [Arabidopsis
           thaliana]
          Length = 333

 Score =  108 bits (270), Expect = 4e-23
 Identities = 46/67 (68%), Positives = 59/67 (87%)
 Frame = -3

Query: 509 SVTNGRYIVSGSEDRCVYLWDLQQKNMIQKLEGHTDTVISVTCHPTENKIASAGLDGDRT 330
           SVTNG+YIVSGSED CVYLWDLQ +N++Q+LEGHTD VISV+CHP +N+I+S+G   D+T
Sbjct: 266 SVTNGKYIVSGSEDNCVYLWDLQARNILQRLEGHTDAVISVSCHPVQNEISSSGNHLDKT 325

Query: 329 VRVWVQD 309
           +R+W QD
Sbjct: 326 IRIWKQD 332

 Score = 33.5 bits (75), Expect = 1.5
 Identities = 16/61 (26%), Positives = 29/61 (47%)
 Frame = -3

Query: 500 NGRYIVSGSEDRCVYLWDLQQKNMIQKLEGHTDTVISVTCHPTENKIASAGLDGDRTVRV 321
           +G  + S S D+ + LW     ++I + EGH+  +  +      +   SA    D T+R+
Sbjct: 54  DGNLLASASVDKTMILWSATNYSLIHRYEGHSSGISDLAWSSDSHYTCSA--SDDCTLRI 111

Query: 320 W 318
           W
Sbjct: 112 W 112

 Score = 32.7 bits (73), Expect = 2.5
 Identities = 16/57 (28%), Positives = 32/57 (56%)
 Frame = -3

Query: 488 IVSGSEDRCVYLWDLQQKNMIQKLEGHTDTVISVTCHPTENKIASAGLDGDRTVRVW 318
           IVSGS D  + +W+++    ++ ++ H+  + SV  +   + I SA  DG  + ++W
Sbjct: 143 IVSGSFDETIRIWEVKTGKCVRMIKAHSMPISSVHFNRDGSLIVSASHDG--SCKIW 197

>gb|AAH08547.1| Unknown (protein for IMAGE:3591165) [Mus musculus]
          Length = 199

 Score = 98.2 bits (243), Expect = 5e-20
 Identities = 42/67 (62%), Positives = 54/67 (79%)
 Frame = -3

Query: 509 SVTNGRYIVSGSEDRCVYLWDLQQKNMIQKLEGHTDTVISVTCHPTENKIASAGLDGDRT 330
           SVT G++IVSGSED  VY+W+LQ K ++QKL+GHTD VIS  CHPTEN IASA L+ D+T
Sbjct: 132 SVTGGKWIVSGSEDNLVYIWNLQTKEIVQKLQGHTDVVISTACHPTENIIASAALENDKT 191

Query: 329 VRVWVQD 309
           +++W  D
Sbjct: 192 IKLWKSD 198

 Score = 37.0 bits (84), Expect = 0.13
 Identities = 20/57 (35%), Positives = 31/57 (54%)
 Frame = -3

Query: 488 IVSGSEDRCVYLWDLQQKNMIQKLEGHTDTVISVTCHPTENKIASAGLDGDRTVRVW 318
           IVSGS D  V +WD++    ++ L  H+D V +V  +   + I S+  DG    R+W
Sbjct: 9   IVSGSFDESVRIWDVKTGKCLKTLPAHSDPVSAVHFNRDGSLIVSSSYDG--LCRIW 63

>ref|NP_060058.1| WD repeat domain 5 protein; WD-repeat protein 5 [Homo sapiens]
           gi|16554629|ref|NP_438172.1| WD repeat domain 5 protein;
           WD-repeat protein 5; likely ortholog of mouse WD repeat
           protein BIG-3 (BMP-2-induced gene 3 kb) [Homo sapiens]
           gi|18252790|ref|NP_543124.1| WD repeat domain 5;
           Bmp2-induced gene [Mus musculus]
           gi|20141972|sp|Q9UGP9|WDR5_HUMAN WD-repeat protein 5 (WD
           repeat protein BIG-3) gi|7020724|dbj|BAA91248.1| unnamed
           protein product [Homo sapiens]
           gi|12804457|gb|AAH01635.1|AAH01635 Similar to
           hypothetical protein [Homo sapiens]
           gi|16359284|gb|AAH16103.1| Similar to hypothetical
           protein [Mus musculus]
           gi|16589079|gb|AAL27006.1|AF416510_1 WD repeat protein
           BIG-3 [Mus musculus] gi|19388008|gb|AAH25801.1| Similar
           to WD repeat domain 5 [Mus musculus]
           gi|26344836|dbj|BAC36067.1| unnamed protein product [Mus
           musculus]
          Length = 334

 Score = 98.2 bits (243), Expect = 5e-20
 Identities = 42/67 (62%), Positives = 54/67 (79%)
 Frame = -3

Query: 509 SVTNGRYIVSGSEDRCVYLWDLQQKNMIQKLEGHTDTVISVTCHPTENKIASAGLDGDRT 330
           SVT G++IVSGSED  VY+W+LQ K ++QKL+GHTD VIS  CHPTEN IASA L+ D+T
Sbjct: 267 SVTGGKWIVSGSEDNLVYIWNLQTKEIVQKLQGHTDVVISTACHPTENIIASAALENDKT 326

Query: 329 VRVWVQD 309
           +++W  D
Sbjct: 327 IKLWKSD 333

 Score = 42.0 bits (97), Expect = 0.004
 Identities = 18/62 (29%), Positives = 34/62 (54%)
 Frame = -3

Query: 503 TNGRYIVSGSEDRCVYLWDLQQKNMIQKLEGHTDTVISVTCHPTENKIASAGLDGDRTVR 324
           ++   +VS S+D+ + +WD+     ++ L+GH++ V     +P  N I S     D +VR
Sbjct: 97  SDSNLLVSASDDKTLKIWDVSSGKCLKTLKGHSNYVFCCNFNPQSNLIVSGSF--DESVR 154

Query: 323 VW 318
           +W
Sbjct: 155 IW 156

 Score = 37.0 bits (84), Expect = 0.13
 Identities = 20/57 (35%), Positives = 31/57 (54%)
 Frame = -3

Query: 488 IVSGSEDRCVYLWDLQQKNMIQKLEGHTDTVISVTCHPTENKIASAGLDGDRTVRVW 318
           IVSGS D  V +WD++    ++ L  H+D V +V  +   + I S+  DG    R+W
Sbjct: 144 IVSGSFDESVRIWDVKTGKCLKTLPAHSDPVSAVHFNRDGSLIVSSSYDG--LCRIW 198

 Score = 33.5 bits (75), Expect = 1.5
 Identities = 15/61 (24%), Positives = 28/61 (45%)
 Frame = -3

Query: 500 NGRYIVSGSEDRCVYLWDLQQKNMIQKLEGHTDTVISVTCHPTENKIASAGLDGDRTVRV 321
           NG ++ S S D+ + +W        + + GH   +  V      N + SA    D+T+++
Sbjct: 56  NGEWLASSSADKLIKIWGAYDGKFEKTISGHKLGISDVAWSSDSNLLVSA--SDDKTLKI 113

Query: 320 W 318
           W
Sbjct: 114 W 114

>emb|CAB66159.1| hypothetical protein [Homo sapiens]
          Length = 362

 Score = 98.2 bits (243), Expect = 5e-20
 Identities = 42/67 (62%), Positives = 54/67 (79%)
 Frame = -3

Query: 509 SVTNGRYIVSGSEDRCVYLWDLQQKNMIQKLEGHTDTVISVTCHPTENKIASAGLDGDRT 330
           SVT G++IVSGSED  VY+W+LQ K ++QKL+GHTD VIS  CHPTEN IASA L+ D+T
Sbjct: 295 SVTGGKWIVSGSEDNLVYIWNLQTKEIVQKLQGHTDVVISTACHPTENIIASAALENDKT 354

Query: 329 VRVWVQD 309
           +++W  D
Sbjct: 355 IKLWKSD 361

 Score = 42.0 bits (97), Expect = 0.004
 Identities = 18/62 (29%), Positives = 34/62 (54%)
 Frame = -3

Query: 503 TNGRYIVSGSEDRCVYLWDLQQKNMIQKLEGHTDTVISVTCHPTENKIASAGLDGDRTVR 324
           ++   +VS S+D+ + +WD+     ++ L+GH++ V     +P  N I S     D +VR
Sbjct: 125 SDSNLLVSASDDKTLKIWDVSSGKCLKTLKGHSNYVFCCNFNPQSNLIVSGSF--DESVR 182

Query: 323 VW 318
           +W
Sbjct: 183 IW 184

 Score = 37.0 bits (84), Expect = 0.13
 Identities = 20/57 (35%), Positives = 31/57 (54%)
 Frame = -3

Query: 488 IVSGSEDRCVYLWDLQQKNMIQKLEGHTDTVISVTCHPTENKIASAGLDGDRTVRVW 318
           IVSGS D  V +WD++    ++ L  H+D V +V  +   + I S+  DG    R+W
Sbjct: 172 IVSGSFDESVRIWDVKTGKCLKTLPAHSDPVSAVHFNRDGSLIVSSSYDG--LCRIW 226

 Score = 33.5 bits (75), Expect = 1.5
 Identities = 15/61 (24%), Positives = 28/61 (45%)
 Frame = -3

Query: 500 NGRYIVSGSEDRCVYLWDLQQKNMIQKLEGHTDTVISVTCHPTENKIASAGLDGDRTVRV 321
           NG ++ S S D+ + +W        + + GH   +  V      N + SA    D+T+++
Sbjct: 84  NGEWLASSSADKLIKIWGAYDGKFEKTISGHKLGISDVAWSSDSNLLVSA--SDDKTLKI 141

Query: 320 W 318
           W
Sbjct: 142 W 142

>ref|NP_061942.2| WD repeat domain 5B [Homo sapiens] gi|27695090|gb|AAH43494.1| WD
           repeat domain 5B [Homo sapiens]
          Length = 330

 Score = 97.8 bits (242), Expect = 6e-20
 Identities = 41/67 (61%), Positives = 55/67 (81%)
 Frame = -3

Query: 509 SVTNGRYIVSGSEDRCVYLWDLQQKNMIQKLEGHTDTVISVTCHPTENKIASAGLDGDRT 330
           SVT G++IVSGSED  VY+W+LQ K ++QKL+GHTD VIS  CHPTEN IASA L+ D+T
Sbjct: 263 SVTGGKWIVSGSEDNLVYIWNLQTKEIVQKLQGHTDVVISAACHPTENLIASAALENDKT 322

Query: 329 VRVWVQD 309
           +++W+ +
Sbjct: 323 IKLWMSN 329

 Score = 42.7 bits (99), Expect = 0.002
 Identities = 19/57 (33%), Positives = 33/57 (57%)
 Frame = -3

Query: 488 IVSGSEDRCVYLWDLQQKNMIQKLEGHTDTVISVTCHPTENKIASAGLDGDRTVRVW 318
           +VS S+D+ + LWD++    ++ L+GH++ V     +P  N I S     D TV++W
Sbjct: 98  LVSASDDKTLKLWDVRSGKCLKTLKGHSNYVFCCNFNPPSNLIISGSF--DETVKIW 152

 Score = 35.4 bits (80), Expect = 0.38
 Identities = 18/57 (31%), Positives = 31/57 (53%)
 Frame = -3

Query: 488 IVSGSEDRCVYLWDLQQKNMIQKLEGHTDTVISVTCHPTENKIASAGLDGDRTVRVW 318
           I+SGS D  V +W+++    ++ L  H+D V +V  + + + I S   DG    R+W
Sbjct: 140 IISGSFDETVKIWEVKTGKCLKTLSAHSDPVSAVHFNCSGSLIVSGSYDG--LCRIW 194

 Score = 34.3 bits (77), Expect = 0.86
 Identities = 16/61 (26%), Positives = 29/61 (47%)
 Frame = -3

Query: 500 NGRYIVSGSEDRCVYLWDLQQKNMIQKLEGHTDTVISVTCHPTENKIASAGLDGDRTVRV 321
           NG ++ S S DR + +W        + L GH   +  V      +++ SA    D+T+++
Sbjct: 52  NGEWLASSSADRLIIIWGAYDGKYEKTLYGHNLEISDVAWSSDSSRLVSA--SDDKTLKL 109

Query: 320 W 318
           W
Sbjct: 110 W 110

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 460,482,393
Number of Sequences: 1393205
Number of extensions: 10336800
Number of successful extensions: 28989
Number of sequences better than 10.0: 1723
Number of HSP's better than 10.0 without gapping: 23745
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 28658
length of database: 448,689,247
effective HSP length: 114
effective length of database: 289,863,877
effective search space used: 15942513235
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MF012a06_f BP028840 1 511
2 MR052c04_f BP080000 71 453




Lotus japonicus
Kazusa DNA Research Institute