KMC002175A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC002175A_C01 KMC002175A_c01
ataaataaaACTAGGATATGCATAAATATTTTATTAAATTTATAACCGAGCCAGGTTACA
TGGTAATCGGCCAAAAGAAGAATGAATTAACAAACTTGAATAAGAGACAAATACTACTAG
CTTAATTACTCTGTTTTTTCTCTTACAATTCTCTCCCATTTTCTGTATGTAACATTTTTT
TAAACAACTTTAACTTATATTTTTGTGTAATTAGAGTCCTTTACTGTACAAGGATGAAAT
TAAAATCCAAAAGCGACGTCGTTTGGGTTCAATCTTCAAGCCCAATAGGTCACTAACCGA
AAGAGCAAAGTGAAAACAACAGCAAAAGAAACAAGCATAACACTCATATCCAACGCTTTC
TCTCTCTTCAATCTCCTCTCTCTCCTCTCTCTCTTCACCAAACAAAATCTCTCAAAACTC
TTCTTCGCTTCCTCCGCCACCACCTTCCTCCTCTTCCCACCCTCCATCGCCGCCTGCAAA
ACCATCATACACTCCCCCCCT


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC002175A_C01 KMC002175A_c01
         (501 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

dbj|BAC33489.1| unnamed protein product [Mus musculus]                 42  0.005
ref|NP_663433.1| similar to CG8726 gene product [Mus musculus] g...    42  0.005
emb|CAA46515.1| proline-rich antigen [Mycobacterium leprae]            41  0.007
ref|NP_302551.1| proline rich antigenic protein [Mycobacterium l...    41  0.007
ref|XP_206076.1| hypothetical protein XP_206076 [Mus musculus]         40  0.011

>dbj|BAC33489.1| unnamed protein product [Mus musculus]
          Length = 581

 Score = 41.6 bits (96), Expect = 0.005
 Identities = 37/85 (43%), Positives = 46/85 (53%), Gaps = 3/85 (3%)
 Frame = +2

Query: 248 QKRRRLGSIFKPNRSLTERAK*KQQQKKQA*HSYPTLSLSSISS-LSSLSSPNKISQN-- 418
           +KRR++ +  K  RS  E ++  +Q  K   HS    S S  SS L+S SSP   S    
Sbjct: 456 RKRRKILARKKSKRSAVENSE--EQPVK---HSNSNNSGSGASSPLTSPSSPTPPSTAGL 510

Query: 419 SSSLPPPPPSSSSHPPSPPAKPSYT 493
           SS+LPPPPP     PP PPA PS T
Sbjct: 511 SSALPPPPPPPP--PPPPPAGPSPT 533

 Score = 33.1 bits (74), Expect = 1.8
 Identities = 32/102 (31%), Positives = 46/102 (44%), Gaps = 10/102 (9%)
 Frame = +2

Query: 224 LYKDEIKIQKRRRLGSIFKPNRSLTERAK*KQQQKKQA*HS-------YPTLSLSSISSL 382
           L +++ +I + RRL      + S  ER + K   +K++  S        P    +S +S 
Sbjct: 430 LTEEQKQIHQHRRLTRAQSHHGSEEERKRRKILARKKSKRSAVENSEEQPVKHSNSNNSG 489

Query: 383 SSLSSPNKISQNSSSLPPPPPS---SSSHPPSPPAKPSYTPP 499
           S  SSP      S S P PP +   SS+ PP PP  P   PP
Sbjct: 490 SGASSP----LTSPSSPTPPSTAGLSSALPPPPPPPPPPPPP 527

>ref|NP_663433.1| similar to CG8726 gene product [Mus musculus]
           gi|16359349|gb|AAH16131.1| Similar to hypothetical
           protein FLJ20335 [Mus musculus]
          Length = 582

 Score = 41.6 bits (96), Expect = 0.005
 Identities = 27/82 (32%), Positives = 45/82 (53%)
 Frame = +2

Query: 248 QKRRRLGSIFKPNRSLTERAK*KQQQKKQA*HSYPTLSLSSISSLSSLSSPNKISQNSSS 427
           +KRR++ +  K  RS  E ++ +  +   + +S  + + S ++S SS + P+    +S+ 
Sbjct: 456 RKRRKILARKKSKRSAVENSEEQPVKHSNSNNSAGSGASSPLTSPSSPTPPSTAGLSSAL 515

Query: 428 LPPPPPSSSSHPPSPPAKPSYT 493
            PPPPP     PP PPA PS T
Sbjct: 516 PPPPPPPP---PPPPPAGPSPT 534

 Score = 32.0 bits (71), Expect = 4.0
 Identities = 28/99 (28%), Positives = 44/99 (44%), Gaps = 7/99 (7%)
 Frame = +2

Query: 224 LYKDEIKIQKRRRLGSIFKPNRSLTERAK*K----QQQKKQA*HSYPTLSLSSISSLSSL 391
           L +++ +I + RRL      + S  ER + K    ++ K+ A  +     +   +S +S 
Sbjct: 430 LTEEQKQIHQHRRLTRAQSHHGSEEERKRRKILARKKSKRSAVENSEEQPVKHSNSNNSA 489

Query: 392 SSPNKISQNSSSLPPPPPS---SSSHPPSPPAKPSYTPP 499
            S       S S P PP +   SS+ PP PP  P   PP
Sbjct: 490 GSGASSPLTSPSSPTPPSTAGLSSALPPPPPPPPPPPPP 528

>emb|CAA46515.1| proline-rich antigen [Mycobacterium leprae]
          Length = 249

 Score = 41.2 bits (95), Expect = 0.007
 Identities = 19/42 (45%), Positives = 20/42 (47%)
 Frame = +2

Query: 374 SSLSSLSSPNKISQNSSSLPPPPPSSSSHPPSPPAKPSYTPP 499
           S L S   P        S PPPPP   S+PP PP   SY PP
Sbjct: 32  SELGSAYPPPTAPPVGGSYPPPPPPGGSYPPPPPPGGSYPPP 73

 Score = 39.3 bits (90), Expect = 0.025
 Identities = 14/25 (56%), Positives = 17/25 (68%)
 Frame = +2

Query: 425 SLPPPPPSSSSHPPSPPAKPSYTPP 499
           S PPPPP   S+PP PP+  +Y PP
Sbjct: 59  SYPPPPPPGGSYPPPPPSTGAYAPP 83

>ref|NP_302551.1| proline rich antigenic protein [Mycobacterium leprae]
           gi|13432206|sp|P41484|PRA_MYCLE Proline-rich antigen (36
           kDa antigen) gi|80644|pir||A41497 36K antigen pra -
           Mycobacterium leprae gi|699272|gb|AAA63035.1| ag36
           [Mycobacterium leprae] gi|13093981|emb|CAC31911.1|
           proline rich antigenic protein [Mycobacterium leprae]
          Length = 249

 Score = 41.2 bits (95), Expect = 0.007
 Identities = 19/42 (45%), Positives = 20/42 (47%)
 Frame = +2

Query: 374 SSLSSLSSPNKISQNSSSLPPPPPSSSSHPPSPPAKPSYTPP 499
           S L S   P        S PPPPP   S+PP PP   SY PP
Sbjct: 32  SELGSAYPPPTAPPVGGSYPPPPPPGGSYPPPPPPGGSYPPP 73

 Score = 39.3 bits (90), Expect = 0.025
 Identities = 14/25 (56%), Positives = 17/25 (68%)
 Frame = +2

Query: 425 SLPPPPPSSSSHPPSPPAKPSYTPP 499
           S PPPPP   S+PP PP+  +Y PP
Sbjct: 59  SYPPPPPPGGSYPPPPPSTGAYAPP 83

>ref|XP_206076.1| hypothetical protein XP_206076 [Mus musculus]
          Length = 241

 Score = 40.4 bits (93), Expect = 0.011
 Identities = 23/43 (53%), Positives = 25/43 (57%)
 Frame = -1

Query: 486 DGFAGGDGGWEEEEGGGGGSEEEF*EILFGEEREEREEIEERE 358
           +G  GG GG     GGGGG EEE  E    EE EE EE EE+E
Sbjct: 143 EGGRGGGGGGGGGGGGGGGEEEEEEEEEEEEEEEEEEEEEEKE 185

 Score = 37.7 bits (86), Expect = 0.073
 Identities = 24/47 (51%), Positives = 25/47 (53%)
 Frame = -1

Query: 483 GFAGGDGGWEEEEGGGGGSEEEF*EILFGEEREEREEIEERESVGYE 343
           G  GG GG     GGGGG EEE  E    EE EE EE EE +  G E
Sbjct: 148 GGGGGGGG-----GGGGGGEEEEEEEEEEEEEEEEEEEEEEKERGKE 189

 Score = 32.3 bits (72), Expect = 3.1
 Identities = 22/47 (46%), Positives = 25/47 (52%)
 Frame = -1

Query: 498 GGVYDGFAGGDGGWEEEEGGGGGSEEEF*EILFGEEREEREEIEERE 358
           GG   G  GG GG EEEE      EEE  E    EE EE+E  +E+E
Sbjct: 149 GGGGGGGGGGGGGEEEEEEEEEEEEEEEEE----EEEEEKERGKEKE 191

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 432,847,052
Number of Sequences: 1393205
Number of extensions: 9904867
Number of successful extensions: 166925
Number of sequences better than 10.0: 1616
Number of HSP's better than 10.0 without gapping: 70827
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 135627
length of database: 448,689,247
effective HSP length: 114
effective length of database: 289,863,877
effective search space used: 15072921604
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MR039f08_f BP079034 1 332
2 SPD023b02_f BP045786 6 450
3 GNf028b01 BP069363 10 448
4 MR054g01_f BP080187 20 336
5 MR020d07_f BP077517 65 479
6 GNf064a03 BP072086 95 501
7 MR026g11_f BP078025 103 524
8 GENf042d08 BP060132 116 430
9 MR094f09_f BP083244 184 355




Lotus japonicus
Kazusa DNA Research Institute