KMC003806A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC003806A_C01 KMC003806A_c01
ATGGAGCATTAGTTGGAAGGCTGGTACGTTCCCCAGCTTAGGAAACAATAGCAACACAAA
AAAACACAAACATGATTTTATCAAACAAAATGAGGGAGCTATATTCTGTCACTTTGTCAT
GATGCATTAAACGATACAACATTTTGATTTATGTTTCTCTTCAGAAGCTAAAGTCAATCA
AGAAACTTCGCCTCATAACTCCTCTACATGCTGCTCAAGGGAAGATAAGTACCTGTCTAC
AGTGTTTAGAGTATCTTGATCAACTTCAGCAGTTACAACCAATTTCGGTGAAGAATAATC
CAATTCTTCTGCTGAAGTGGATTCGCTTTTAGCAATTTCATTCTCAGGTGTTTCATGGTC
TAGAATTTCTATTGAGCCAGAGTCCACTTCCAAATTGAAGAGTTGGGACTTCATCTCCAA
GCAAGAGTTCAAATCCTCCTCACTCCTGGCTTTGTTGATTTTATCAATAAGATCACTTAT
AGCAGATAACCTTTCATCACTACAACTTTTTGGCCTTTTAGAAACTTCACTAGGTTCTTC
ATCAATTGGTTTATTGATGTCATCATCCACTGCAAACTTCTCAAGCCTGTGTAATCCCTT
ATCTACAAATGGTTGTATTCTCATTCT


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC003806A_C01 KMC003806A_c01
         (627 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_564407.1| expressed protein; protein id: At1g32730.1, sup...    55  6e-07
ref|NP_758307.1| hypothetical protein [Mycoplasma penetrans] gi|...    39  0.056
ref|XP_235683.1| similar to keratin protein K6irs [Homo sapiens]...    39  0.073
ref|NP_082518.1| RIKEN cDNA 2600017A12 [Mus musculus] gi|2505532...    38  0.12
ref|XP_236420.1| similar to Protein C20orf129 [Rattus norvegicus]      38  0.12

>ref|NP_564407.1| expressed protein; protein id: At1g32730.1, supported by cDNA:
           116730. [Arabidopsis thaliana] gi|25403412|pir||C86452
           protein F6N18.11 [imported] - Arabidopsis thaliana
           gi|6714274|gb|AAF25970.1|AC017118_7 F6N18.11
           [Arabidopsis thaliana] gi|21536980|gb|AAM61321.1|
           unknown [Arabidopsis thaliana]
          Length = 327

 Score = 55.5 bits (132), Expect = 6e-07
 Identities = 39/139 (28%), Positives = 67/139 (48%)
 Frame = -1

Query: 627 RMRIQPFVDKGLHRLEKFAVDDDINKPIDEEPSEVSKRPKSCSDERLSAISDLIDKINKA 448
           + RI   V  GL +L++     D+    D++     +  +   +E+ SA++++IDK+NKA
Sbjct: 210 KKRIAETVKAGLVKLKRL----DLGSSSDDQDDIKRRVKRKKWEEKGSALNEIIDKLNKA 265

Query: 447 RSEEDLNSCLEMKSQLFNLEVDSGSIEILDHETPENEIAKSESTSAEELDYSSPKLVVTA 268
           R+EEDL SCLEMKS+L                       +   T+A E +   P +V   
Sbjct: 266 RTEEDLKSCLEMKSKL---------------------CGQVSPTAASEKNKIFPGVVRKV 304

Query: 267 EVDQDTLNTVDRYLSSLEQ 211
           E+ ++ L  +   L S ++
Sbjct: 305 EMSEEALQKIAENLQSFDK 323

>ref|NP_758307.1| hypothetical protein [Mycoplasma penetrans]
            gi|26454383|dbj|BAC44711.1| hypothetical protein
            [Mycoplasma penetrans]
          Length = 915

 Score = 38.9 bits (89), Expect = 0.056
 Identities = 39/156 (25%), Positives = 69/156 (44%), Gaps = 16/156 (10%)
 Frame = -1

Query: 615  QPFVDK--GLHRLEKFAVDDDINKPIDEEPSEVSKRPKSCSDERLSAISDLIDKI-NKAR 445
            + FVD    +  L  F   DD +KP  E PS + +  K    +  + +S++ D + NK  
Sbjct: 731  EKFVDTEFNVEELYSFQTSDDDSKPTVESPSSLEEEVKEFFSDLSAELSNIPDVVENKKE 790

Query: 444  SEED-------LNSCLE----MKSQLFNLEVDSGSIEILDHETPENEIAKSEST-SAEEL 301
            + +D       +NS L+      SQLF  E     +++ +    +     S ST S++E 
Sbjct: 791  NNDDIVIEDISINSDLDSLDLSSSQLFETEFLPNELKLDEEMNNQFSTLVSPSTESSKEA 850

Query: 300  DYSSPKLVVTAEV-DQDTLNTVDRYLSSLEQHVEEL 196
            D   P +     V D+   + + ++L  L+   E+L
Sbjct: 851  DVYEPNISADKIVSDKQITSDLQKFLEDLKVEKEKL 886

>ref|XP_235683.1| similar to keratin protein K6irs [Homo sapiens] [Rattus norvegicus]
          Length = 1675

 Score = 38.5 bits (88), Expect = 0.073
 Identities = 28/85 (32%), Positives = 43/85 (49%)
 Frame = -1

Query: 435 DLNSCLEMKSQLFNLEVDSGSIEILDHETPENEIAKSESTSAEELDYSSPKLVVTAEVDQ 256
           D N  L M +   NL++DS   E+   ++    IA      +EEL +S  KL VTA    
Sbjct: 349 DTNVILSMDNNR-NLDLDSIIAEV---QSQYEIIAHKSKAESEELYHSKAKLQVTAVKHG 404

Query: 255 DTLNTVDRYLSSLEQHVEEL*GEVS 181
           D+L  +   +S L + ++ L GE+S
Sbjct: 405 DSLKEIKMEISELNRTIQRLQGEIS 429

>ref|NP_082518.1| RIKEN cDNA 2600017A12 [Mus musculus] gi|25055327|ref|XP_135878.2|
           RIKEN cDNA 2600017A12 [Mus musculus]
           gi|22902415|gb|AAH37711.1| Similar to HIV TAT specific
           factor 1 [Mus musculus] gi|26340228|dbj|BAC33777.1|
           unnamed protein product [Mus musculus]
          Length = 757

 Score = 37.7 bits (86), Expect = 0.12
 Identities = 29/118 (24%), Positives = 52/118 (43%), Gaps = 1/118 (0%)
 Frame = -1

Query: 600 KGLHRLEKFAVDDDINKPIDEEPSEVSKRPKSCSDERLSAISDLIDK-INKARSEEDLNS 424
           +G   L+K + DDD  +  +E+ SE   +  S  +   + +   +D+ ++     ED+  
Sbjct: 529 EGEDSLKKESEDDDSEEESEEDSSEKQSQDGSDKEIEENGVKKDVDQDVSDKEFPEDVEK 588

Query: 423 CLEMKSQLFNLEVDSGSIEILDHETPENEIAKSESTSAEELDYSSPKLVVTAEVDQDT 250
             E +++    E D GS  +LD E  E E  +      EE D    ++V     D D+
Sbjct: 589 ESE-ENETDKSEFDEGSERVLDEEGSEREFEEDSDEKEEEGDDDEEEVVYERVFDDDS 645

>ref|XP_236420.1| similar to Protein C20orf129 [Rattus norvegicus]
          Length = 1413

 Score = 37.7 bits (86), Expect = 0.12
 Identities = 30/104 (28%), Positives = 52/104 (49%), Gaps = 2/104 (1%)
 Frame = -1

Query: 561  DINKPIDEEPSEVSKRPKSCSDERLSAISDLIDKINKARSEEDLNSCLEMKSQLFNLEVD 382
            ++    DE+  EV+KR    +  +  +I+ L+D +NK    ++LNS  E K+    L+  
Sbjct: 953  NVTHSTDEDDDEVTKRDPPSASAKSISIAALLD-VNKEEPNKELNSKKEGKASPSFLKKG 1011

Query: 381  SGSIEILDHETPE--NEIAKSESTSAEELDYSSPKLVVTAEVDQ 256
            S  +  L   TPE    +AK+++ +   +  SS  LV   E +Q
Sbjct: 1012 SQKLRSLLSLTPEKRENLAKNKAPAFYRMCSSSDTLVSEGEENQ 1055

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 487,752,968
Number of Sequences: 1393205
Number of extensions: 9528049
Number of successful extensions: 31204
Number of sequences better than 10.0: 125
Number of HSP's better than 10.0 without gapping: 29471
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 31010
length of database: 448,689,247
effective HSP length: 118
effective length of database: 284,291,057
effective search space used: 25586195130
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 SPD065a02_f BP049150 1 523
2 MPD026c09_f AV771769 117 632
3 MFB094g03_f BP040872 146 697
4 GNf074h08 BP072875 154 423
5 SPDL015d06_f BP052930 154 571
6 MFB040b03_f BP036903 168 719




Lotus japonicus
Kazusa DNA Research Institute