KMC015163A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC015163A_C01 KMC015163A_c01
ctaacaaaaaatagtgttgaatagagaagaggtttagacacaaaaaAAACAGAACAGGTA
AGAGGTGAGCTAGATTCTTAAATTCAGGTGGCACTAACAAGCAAAGCATAATCGATGTTA
CAAAGTTGCATGTTATCGATATATCCATCAACAGTGAATCATACATATAGATACAAACAA
ACTGCAAATTAATAAGTTTCTTCAACTGGGGTTTCAGAGACAAGGTTCATTTCATGTATT
TGTGCAAGAATTTCCTGTGGAAATCATTCCTGATGGTATCATTAATGAATCGCTTCGGTG
CCTTGGCTTCACCACGGGCTTGGTTCTTAAACACGACATCATCATCCCACCTTCTTTTCA
CATTGAAAGATGCTGGATTATTTAGCAATGGGTTCCCACGCAGAAGCTCTGCCTCCTTTA
CTTTAAGCTCTTCCTCTTGCTCTAGTCGTTCCTTCCGTAGCTTTTCCTCTGCTCTTTCCT
TCTTTATTTGTTCAAGCTCAGCCAGCAAAGCTTCAGTGTCATCCTCATCATCGTCATCAT
CATCACTCTCTTCATCACTTTTGACTTCAACATCAGAATCATCTGCATCAACACTTCGTG
CAACAATATGGTCTTCAACATCTCTCTTTGTTCCTTCCAAAAACAGATGGCTACTCTTTC
CATGATCCCTGTCATCACTATAAGATTTATTTTTTGATGAGAAATGCCTTCGCTCACGCT
CTTCCAGTTCATCACGCAGGTTCCTTCTCTTTAACTCATCCTGTGTGTCCTGTCCGTCTC
TCCTTGGCTTGAGAGTGGTGTGAGATGCGATGTCCCTGGAGGAGTATTTCTGAGAGGGGC
CGAAAATTCGAGTACCACCTTGTTCGTTGCCACCTTTAGCCGGTGCCCATGTGGGTCGAG
CAGcggttgtcatcttcaatcgccgccggtaatggaggaggaggaagggtttcgtcgaaa
ggaggtgcgatgcgatgaggatc


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC015163A_C01 KMC015163A_c01
         (983 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|AAM64310.1| unknown [Arabidopsis thaliana]                         382  e-105
gb|AAK96864.1| Unknown protein [Arabidopsis thaliana] gi|1837747...   380  e-104
ref|NP_566447.1| expressed protein; protein id: At3g13200.1, sup...   379  e-104
dbj|BAA90629.1| ESTs AU032852(S15362),AU070591(S5037) correspond...   374  e-103
ref|NP_075642.1| RIKEN cDNA 0610040D20 [Mus musculus] gi|8926741...   206  6e-52

>gb|AAM64310.1| unknown [Arabidopsis thaliana]
          Length = 230

 Score =  382 bits (980), Expect = e-105
 Identities = 184/230 (80%), Positives = 208/230 (90%), Gaps = 2/230 (0%)
 Frame = -2

Query: 913 MTTAARPTWAPAKGGNEQGGTRIFGPSQKYSSRDIASHTTLKPRRDGQDTQDELKRRNLR 734
           MTTAARPTWAPAKGGNEQGG RIFGPSQKYSSRD+A+HTTLKPRR+GQ TQ+EL++ NLR
Sbjct: 1   MTTAARPTWAPAKGGNEQGGARIFGPSQKYSSRDVAAHTTLKPRREGQHTQEELQKINLR 60

Query: 733 DELEERERRHFSSKNKSYSDDRDHGKSSHLFLEGTKRDVEDHIVARSVDADDSDVEVKSD 554
           DELEERERRHFSSK+KSY+DDRD  + S L LEG+KRD E+ I+ RSVDADDSDV++KSD
Sbjct: 61  DELEERERRHFSSKDKSYNDDRDRRRGSQLLLEGSKRDPEERIIPRSVDADDSDVDIKSD 120

Query: 553 EESDD--DDDDEDDTEALLAELEQIKKERAEEKLRKERLEQEEELKVKEAELLRGNPLLN 380
           ++SDD  DDDDEDDTEAL+AEL+QIKKER EE+LRKE+ +Q EEL  KE ELL+GNPLLN
Sbjct: 121 DDSDDESDDDDEDDTEALMAELDQIKKERVEERLRKEKEQQMEELNAKEEELLKGNPLLN 180

Query: 379 NPASFNVKRRWDDDVVFKNQARGEAKAPKRFINDTIRNDFHRKFLHKYMK 230
            P SF+VKRRWDDDVVFKNQARGE KAPKRFINDTIRNDFHRKFLH+YMK
Sbjct: 181 TPTSFSVKRRWDDDVVFKNQARGEMKAPKRFINDTIRNDFHRKFLHRYMK 230

>gb|AAK96864.1| Unknown protein [Arabidopsis thaliana] gi|18377472|gb|AAL66902.1|
           unknown protein [Arabidopsis thaliana]
          Length = 230

 Score =  380 bits (977), Expect = e-104
 Identities = 184/230 (80%), Positives = 208/230 (90%), Gaps = 2/230 (0%)
 Frame = -2

Query: 913 MTTAARPTWAPAKGGNEQGGTRIFGPSQKYSSRDIASHTTLKPRRDGQDTQDELKRRNLR 734
           MTTAARPTWAPAKGGNEQGG RIFGPSQKYSSRD+A+HTTLKPRR+GQ TQ+EL++ NLR
Sbjct: 1   MTTAARPTWAPAKGGNEQGGARIFGPSQKYSSRDVAAHTTLKPRREGQHTQEELQKINLR 60

Query: 733 DELEERERRHFSSKNKSYSDDRDHGKSSHLFLEGTKRDVEDHIVARSVDADDSDVEVKSD 554
           DELEERERRHFSSK+KSY+DDRD  + S L LE +KRD E+ I+ RSVDADDSDV++KSD
Sbjct: 61  DELEERERRHFSSKDKSYNDDRDRRRGSQLLLEDSKRDPEERIIPRSVDADDSDVDIKSD 120

Query: 553 EESDD--DDDDEDDTEALLAELEQIKKERAEEKLRKERLEQEEELKVKEAELLRGNPLLN 380
           ++SDD  DDDDEDDTEAL+AEL+QIKKER EE+LRKE+ +Q EEL  KE ELL+GNPLLN
Sbjct: 121 DDSDDESDDDDEDDTEALMAELDQIKKERVEERLRKEKEQQMEELNAKEEELLKGNPLLN 180

Query: 379 NPASFNVKRRWDDDVVFKNQARGEAKAPKRFINDTIRNDFHRKFLHKYMK 230
            PASF+VKRRWDDDVVFKNQARGE KAPKRFINDTIRNDFHRKFLH+YMK
Sbjct: 181 TPASFSVKRRWDDDVVFKNQARGEMKAPKRFINDTIRNDFHRKFLHRYMK 230

>ref|NP_566447.1| expressed protein; protein id: At3g13200.1, supported by cDNA:
           22807., supported by cDNA: gi_15451185, supported by
           cDNA: gi_18377471 [Arabidopsis thaliana]
           gi|10172608|dbj|BAB01412.1|
           dbj|BAA90629.1~gene_id:MJG19.16~similar to unknown
           protein [Arabidopsis thaliana]
          Length = 230

 Score =  379 bits (973), Expect = e-104
 Identities = 183/230 (79%), Positives = 207/230 (89%), Gaps = 2/230 (0%)
 Frame = -2

Query: 913 MTTAARPTWAPAKGGNEQGGTRIFGPSQKYSSRDIASHTTLKPRRDGQDTQDELKRRNLR 734
           MTTAARPTWAPAKGGNEQGG RIFGPSQKYSSRD+A+HTTLKPRR+GQ TQ+EL++ NLR
Sbjct: 1   MTTAARPTWAPAKGGNEQGGARIFGPSQKYSSRDVAAHTTLKPRREGQHTQEELQKINLR 60

Query: 733 DELEERERRHFSSKNKSYSDDRDHGKSSHLFLEGTKRDVEDHIVARSVDADDSDVEVKSD 554
           DELEERERRHFSSK+KSY+DDRD  + S L LE +KRD E+ I+ RSVDADDSDV++KSD
Sbjct: 61  DELEERERRHFSSKDKSYNDDRDRRRGSQLLLEDSKRDPEERIIPRSVDADDSDVDIKSD 120

Query: 553 EESDD--DDDDEDDTEALLAELEQIKKERAEEKLRKERLEQEEELKVKEAELLRGNPLLN 380
           ++SDD  DDDDEDDTEAL+AEL+QIKKER EE+LRKE+ +Q EEL  KE ELL+GNPLLN
Sbjct: 121 DDSDDESDDDDEDDTEALMAELDQIKKERVEERLRKEKEQQMEELNAKEEELLKGNPLLN 180

Query: 379 NPASFNVKRRWDDDVVFKNQARGEAKAPKRFINDTIRNDFHRKFLHKYMK 230
            P SF+VKRRWDDDVVFKNQARGE KAPKRFINDTIRNDFHRKFLH+YMK
Sbjct: 181 TPTSFSVKRRWDDDVVFKNQARGEMKAPKRFINDTIRNDFHRKFLHRYMK 230

>dbj|BAA90629.1| ESTs AU032852(S15362),AU070591(S5037) correspond to a region of the
           predicted gene.~Similar to Caenorhabditis elegans cosmid
           T10C6. (Z93388) [Oryza sativa (japonica cultivar-group)]
          Length = 230

 Score =  374 bits (961), Expect = e-103
 Identities = 179/230 (77%), Positives = 209/230 (90%), Gaps = 2/230 (0%)
 Frame = -2

Query: 913 MTTAARPTWAPAKGGNEQGGTRIFGPSQKYSSRDIASHTTLKPRRDGQDTQDELKRRNLR 734
           MTTAARPTWAPAKGGNEQGGTRIFGPSQKYSSRD+A+HTTLKPR++GQ TQ+EL++RNLR
Sbjct: 1   MTTAARPTWAPAKGGNEQGGTRIFGPSQKYSSRDLAAHTTLKPRKEGQHTQEELQKRNLR 60

Query: 733 DELEERERRHFSSKNKSYSDDRDHGKSSHLFLEGTKRDVEDHIVARSVDADDSDVEVKSD 554
           DELEERER+H+SSK+KSY+++RD  KS+ L LEG++R+ ED IV R +DADDSDVE +SD
Sbjct: 61  DELEERERKHYSSKDKSYAEERDRRKSTSLLLEGSRREAEDKIVPREIDADDSDVEPRSD 120

Query: 553 EESDDDDDDEDDTEALLAELEQIKKERAEEKLRKERLEQEEELKVKEAELLRGNPL--LN 380
           +ESD+DDDD+DDTEAL+AELE+IKKERAEE LRKER + EEE K+KEAEL+RGNPL  +N
Sbjct: 121 DESDEDDDDDDDTEALMAELERIKKERAEENLRKERQQAEEEAKMKEAELMRGNPLININ 180

Query: 379 NPASFNVKRRWDDDVVFKNQARGEAKAPKRFINDTIRNDFHRKFLHKYMK 230
           N  SFNVKRRWDDDVVFKNQARGE K PKRFINDTIR+DFHRKFL +YMK
Sbjct: 181 NAGSFNVKRRWDDDVVFKNQARGETKTPKRFINDTIRSDFHRKFLQRYMK 230

>ref|NP_075642.1| RIKEN cDNA 0610040D20 [Mus musculus] gi|8926741|emb|CAB96547.1|
           hypothetical protein [Mus musculus]
           gi|12833160|dbj|BAB22414.1| unnamed protein product [Mus
           musculus] gi|12833722|dbj|BAB22638.1| unnamed protein
           product [Mus musculus] gi|12834742|dbj|BAB23026.1|
           unnamed protein product [Mus musculus]
           gi|13435729|gb|AAH04726.1| RIKEN cDNA 0610040D20 gene
           [Mus musculus]
          Length = 229

 Score =  206 bits (523), Expect = 6e-52
 Identities = 117/238 (49%), Positives = 161/238 (67%), Gaps = 10/238 (4%)
 Frame = -2

Query: 913 MTTAARPTWAPAKGGNEQGGTRIFGPSQKYSSRDIASHTTLKPRRDGQDTQDELKRRNLR 734
           MTTAARPT+ PA+GG  +G   +   S++YSSRD+ SHT +K R+  QD  +E++ R+ R
Sbjct: 1   MTTAARPTFEPARGGRGKGEGDLSQLSKQYSSRDLPSHTKIKYRQTTQDAPEEVRNRDFR 60

Query: 733 DELEERERRHFSSKNKSYSDDRDHGKSSHLFLEGTKRDVEDHIVARSVDADDSDVEVKSD 554
            ELEERER     KN+     R+H  SS +    +K+   D I A ++DADD      +D
Sbjct: 61  RELEERERAAARDKNRD-RPTREHTTSSSV----SKKPRLDQIPAANLDADDP----LTD 111

Query: 553 EESDD--DDDDEDDTEALLAELEQIKKERAEEKLRKERLEQEEELKVKEAELLRGNPLLN 380
           EE +D  ++ D+DDT ALLAELE+IKKERAEE+ RKE+ ++ EE +++   +L GNPLLN
Sbjct: 112 EEDEDFEEESDDDDTAALLAELEKIKKERAEEQARKEQEQKAEEERIRMENILSGNPLLN 171

Query: 379 ------NPASFNVKRRWDDDVVFKNQARG--EAKAPKRFINDTIRNDFHRKFLHKYMK 230
                   A+F VKRRWDDDVVFKN A+G  + K  KRF+NDT+R++FH+KF+ KY+K
Sbjct: 172 LTGPSQPQANFKVKRRWDDDVVFKNCAKGIDDQKKDKRFVNDTLRSEFHKKFMEKYIK 229

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 857,407,650
Number of Sequences: 1393205
Number of extensions: 20722814
Number of successful extensions: 304246
Number of sequences better than 10.0: 4775
Number of HSP's better than 10.0 without gapping: 110480
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 195922
length of database: 448,689,247
effective HSP length: 123
effective length of database: 277,325,032
effective search space used: 56574306528
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MFB069g09_f BP039040 1 504
2 MWM051c02_f AV765490 47 384
3 SPD002c07_f BP044147 68 567
4 MFB042b08_f BP037046 75 649
5 MFB100a11_f BP041238 404 983




Lotus japonicus
Kazusa DNA Research Institute