KMC002722A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC002722A_C01 KMC002722A_c01
gtgtgcaaaaaagcgtaaaagctttttttaaactagttgttgttcttcacaaaattatat
atatattacaaacttttaaaAACATTCTTTTCAGACTTAAATTATTTGAAAAACAATTGG
GCCCTGGCTATTTTTAAATTTAAAAATGAAAATTTAATCCTTCACTTACCTCCACACAGA
CACTCCTCTAAGCAAACACACCTTGAAAGCATATGGTAAACTATGCTAATTTGCTTAAAA
TAAAGCACTCCTATGCCGATAAACATGTTACTAAACAACACACTCAATACAAATTGCATG
AACAAAATTTCTAGCTCTAATAAAATACATATATCTATCTGAAAATAGAAGCTTCGAATG
TAGCCCTTCGTGCAGACATGTCCTCTGAGGAGTCTGTCCCAAGTGCAATTTCCAAATCTG
AAACACTGACTGGCCAGGAATCCAGTAACAAATACTTGGTTTCAAAGAAGGACTGAGAAA
GAACTCATATGGTTTCTTCTTCCTTTGAGTGAGTTTCTAATGCATACACCAAATCGCGCT
CTTGTTCTTCGCTAAAAAGAAGATTGCCAGGATCTTCTTCCTCCTCCAAAAGATGCAAAG
GTGTTTGCGGAGGGTATTTGTTCAAAATGAACTCTAGTTCTTCACCACTGATTTCCTTCT
GATTGAGAAGAACCTTTACAGCTTTGAGCAAAGCTGCATGGTGCCTCCTAAGGAGAGACA
CCGTCTTTCCATACATGTCGCAAAT


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC002722A_C01 KMC002722A_c01
         (745 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_567691.1| FtsH protease, putative; protein id: At4g23940....    89  5e-17
gb|EAA38694.1| GLP_516_32988_35969 [Giardia lamblia ATCC 50803]        35  1.5
gb|AAD52701.1|AF091540_1 cysteine dioxygenase [Schistosoma japon...    34  1.9
ref|XP_242331.1| hypothetical protein XP_242331 [Rattus norvegicus]    33  4.3
ref|NP_781876.1| chromosome segregation protein smc2 [Clostridiu...    33  4.3

>ref|NP_567691.1| FtsH protease, putative; protein id: At4g23940.1 [Arabidopsis
            thaliana] gi|7484842|pir||T08913 cell division protein
            FtsH homolog T32A16.110 - Arabidopsis thaliana
            gi|2262118|gb|AAB63626.1| cell division protein isolog
            [Arabidopsis thaliana] gi|4972098|emb|CAB43894.1| cell
            division protein-like [Arabidopsis thaliana]
            gi|7269243|emb|CAB81312.1| cell division protein-like
            [Arabidopsis thaliana]
          Length = 946

 Score = 89.4 bits (220), Expect = 5e-17
 Identities = 45/70 (64%), Positives = 55/70 (78%)
 Frame = -1

Query: 745  ICDMYGKTVSLLRRHHAALLKAVKVLLNQKEISGEELEFILNKYPPQTPLHLLEEEEDPG 566
            I  MY KTVSLLR++  ALLK VKVLLNQKEISGE ++FIL+ YPPQTPL+ L +E++PG
Sbjct: 858  ISQMYNKTVSLLRQNQTALLKTVKVLLNQKEISGEAIDFILDHYPPQTPLNSLLQEQNPG 917

Query: 565  NLLFSEEQER 536
            +L F  E  R
Sbjct: 918  SLPFVPEHLR 927

>gb|EAA38694.1| GLP_516_32988_35969 [Giardia lamblia ATCC 50803]
          Length = 993

 Score = 34.7 bits (78), Expect = 1.5
 Identities = 31/106 (29%), Positives = 50/106 (46%), Gaps = 3/106 (2%)
 Frame = +3

Query: 342 KIEASNVALRADMSSEESVPSAISKSETLTGQESSNKYLVSKKD*ERTHMVSSSFE*VSN 521
           ++  S V   A  +   S+ S I  + +  G   S  +  S++    +H  +S+   VS+
Sbjct: 565 QVSQSGVRGGATHTLASSIISGIGSAGSTNGLRKS--HASSRRGDSNSHRHTSTHLSVSS 622

Query: 522 AYTK---SRSCSSLKRRLPGSSSSSKRCKGVCGGYLFKMNSSSSPL 650
           ++TK   S+S  SLK+   G  SSSK+     G +   M  S SPL
Sbjct: 623 SFTKLNASKSGKSLKKSTTGGKSSSKK---RAGSHRASMGKSKSPL 665

>gb|AAD52701.1|AF091540_1 cysteine dioxygenase [Schistosoma japonicum]
          Length = 212

 Score = 34.3 bits (77), Expect = 1.9
 Identities = 36/142 (25%), Positives = 59/142 (41%), Gaps = 18/142 (12%)
 Frame = -1

Query: 727 KTVSLLRRHHAALLKAVKVLLNQKEISGEELEFILN----------KYPPQTPLHLLEEE 578
           KTVS L      L+K ++++ NQKEI+  E+  ILN          KY      H     
Sbjct: 16  KTVSTLND----LIKTIRIIFNQKEINVNEIHKILNDFQCDFTEWQKYIYFNKTHYTRNL 71

Query: 577 EDPGN-------LLFSEEQERDLVYALETHSKEEETI*VLSQSFFE-TKYLLLDSWPVSV 422
            D GN       L +SE+Q   +      H   +     + ++ FE  KY  ++    S+
Sbjct: 72  IDEGNGRYNLFLLCWSEDQGTRIHDHSGAHCFVKLIKGCIKETIFEWPKYFTVEKSNYSI 131

Query: 421 SDLEIALGTDSSEDMSARRATF 356
           + +++ L   S  +M     T+
Sbjct: 132 NQIDLPLTVKSVSEMRPGDVTY 153

>ref|XP_242331.1| hypothetical protein XP_242331 [Rattus norvegicus]
          Length = 781

 Score = 33.1 bits (74), Expect = 4.3
 Identities = 28/112 (25%), Positives = 55/112 (49%), Gaps = 2/112 (1%)
 Frame = -1

Query: 649 SGEELEFILNKYPPQTPLH-LLEEEEDPGNLLFSEEQERDLVYALETHSKEEETI*VLSQ 473
           SG+EL+++++    ++    LLE   D  +L+FS    +  + ALET  +E+      ++
Sbjct: 294 SGDELQYVISSKNRRSKKQILLESSRDLYSLVFSSPLYKPSILALETGQREDVHDSDKAK 353

Query: 472 SFFETK-YLLLDSWPVSVSDLEIALGTDSSEDMSARRATFEASIFR*IYVFY 320
                + ++ +D W V V  L++     S+  + +  A  EASI   + V +
Sbjct: 354 HLGAVQGWMFMDLWFVEVCTLQVIHILSSAAHVGSPGAHGEASIAHRLIVLW 405

>ref|NP_781876.1| chromosome segregation protein smc2 [Clostridium tetani E88]
           gi|28203371|gb|AAO35813.1| chromosome segregation
           protein smc2 [Clostridium tetani E88]
          Length = 1186

 Score = 33.1 bits (74), Expect = 4.3
 Identities = 22/76 (28%), Positives = 39/76 (50%), Gaps = 3/76 (3%)
 Frame = -1

Query: 712 LRRHHAALLKAVKVLLNQKEISGEELEFILNKYPPQTPLHLLEEEEDPGNLLFSEEQERD 533
           + + +  L   +K+L N+KE   +ELE            +  +EEE+ GNLL+ E+ + D
Sbjct: 745 IEKENLKLKDRMKILKNEKETLNKELED-----------YGKDEEENKGNLLYLEKTQND 793

Query: 532 ---LVYALETHSKEEE 494
               ++ LE   K++E
Sbjct: 794 NEKKIHILENELKDKE 809

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 608,594,420
Number of Sequences: 1393205
Number of extensions: 13151381
Number of successful extensions: 35199
Number of sequences better than 10.0: 18
Number of HSP's better than 10.0 without gapping: 32517
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 34730
length of database: 448,689,247
effective HSP length: 120
effective length of database: 281,504,647
effective search space used: 35751090169
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MPDL082g03_f AV780778 1 598
2 SPDL014a01_f BP052830 108 596
3 GNLf004h12 BP075050 137 550
4 SPDL087e03_f BP057463 216 595
5 SPDL042g02_f BP054664 236 751




Lotus japonicus
Kazusa DNA Research Institute