KMC007167A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC007167A_C01 KMC007167A_c01
tgagcacccccccactggttattggatgaatcattcaaacgggcccagtcaagGAAATAG
AATCATCTACAACTTATGATCTGAAATGATCTATTCTTCTATTCCCAGGAAAGGCACATT
GATCCGGGAACACATGAATTACATACAAGCAGACAACAACCGTCCACGAGTCCTACACAA
AGGCAAGGAGATAAAACAAATAGAGCTAAAACACCTACTCAGTAATCTTTACACGGGCTC
TACCAGGATTCTTCATTAAAACAACTTCTCCAAATCCCAAGGGTCCCCAAAGTAAAACAA
CTTGGCACATGTCTACAAAGACATAACATATTCCCTATGCAGCTAGACTCTGCTAGGAAG
CTCCATACTTGCCTATTTTGAAATAACTATCTGGCTTGACTGTCACCATCCTCTTCTGAT
ATGTTAAACAAGAACTTGGGAAGAGATGTGATTCGAGGAAGATGGAGAAGCAAATCGCTA
AACGAGTCTTTCCTGGACATGCCTAGTCCTTGCTTGCCACCAGAAGTGTCCTTTGAGTCC
TCCTCAGAAGGCTTGACATCTTTTTCAGCAACAGGTCCATCGACAGTACTATCTT


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC007167A_C01 KMC007167A_c01
         (595 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_194807.1| putative protein; protein id: At4g30780.1 [Arab...    71  1e-11
pir||F84632 hypothetical protein At2g24100 [imported] - Arabidop...    67  2e-10
ref|NP_565562.1| expressed protein; protein id: At2g24100.1 [Ara...    67  2e-10
sp|Q07162|PRTG_ERWCH Secreted protease G precursor (ProG) gi|107...    34  1.6
ref|XP_051005.1| KIAA1297 protein [Homo sapiens]                       33  3.6

>ref|NP_194807.1| putative protein; protein id: At4g30780.1 [Arabidopsis thaliana]
           gi|25407680|pir||C85360 hypothetical protein AT4g30780
           [imported] - Arabidopsis thaliana
           gi|5725442|emb|CAB52451.1| putative protein [Arabidopsis
           thaliana] gi|7269979|emb|CAB79796.1| putative protein
           [Arabidopsis thaliana] gi|28393759|gb|AAO42289.1|
           unknown protein [Arabidopsis thaliana]
           gi|28973227|gb|AAO63938.1| unknown protein [Arabidopsis
           thaliana]
          Length = 589

 Score = 70.9 bits (172), Expect = 1e-11
 Identities = 35/47 (74%), Positives = 38/47 (80%)
 Frame = -3

Query: 530 DTSGGKQGLGMSRKDSFSDLLLHLPRITSLPKFLFNISEEDGDSQAR 390
           DT+   +  GM RKDSFSDLLLHLPRITSLPKFL NISEEDGD+  R
Sbjct: 543 DTASSSKPQGMLRKDSFSDLLLHLPRITSLPKFLSNISEEDGDAYNR 589

>pir||F84632 hypothetical protein At2g24100 [imported] - Arabidopsis thaliana
           gi|14596217|gb|AAK68836.1| Unknown protein [Arabidopsis
           thaliana] gi|15809913|gb|AAL06884.1| At2g24100/F27D4.1
           [Arabidopsis thaliana]
          Length = 466

 Score = 66.6 bits (161), Expect = 2e-10
 Identities = 33/39 (84%), Positives = 34/39 (86%)
 Frame = -3

Query: 524 SGGKQGLGMSRKDSFSDLLLHLPRITSLPKFLFNISEED 408
           S  K   GMSRKDSFSDLL+HLPRITSLPKFLFNISEED
Sbjct: 428 SSSKPLQGMSRKDSFSDLLVHLPRITSLPKFLFNISEED 466

>ref|NP_565562.1| expressed protein; protein id: At2g24100.1 [Arabidopsis thaliana]
           gi|20197500|gb|AAD03372.2| expressed protein
           [Arabidopsis thaliana]
          Length = 463

 Score = 66.6 bits (161), Expect = 2e-10
 Identities = 33/39 (84%), Positives = 34/39 (86%)
 Frame = -3

Query: 524 SGGKQGLGMSRKDSFSDLLLHLPRITSLPKFLFNISEED 408
           S  K   GMSRKDSFSDLL+HLPRITSLPKFLFNISEED
Sbjct: 425 SSSKPLQGMSRKDSFSDLLVHLPRITSLPKFLFNISEED 463

>sp|Q07162|PRTG_ERWCH Secreted protease G precursor (ProG) gi|1073289|pir||S48132
           metalloproteinase G (EC 3.4.24.-) - Erwinia chrysanthemi
           gi|297861|emb|CAA50501.1| protease G [Erwinia
           chrysanthemi]
          Length = 475

 Score = 33.9 bits (76), Expect = 1.6
 Identities = 21/66 (31%), Positives = 30/66 (44%)
 Frame = +2

Query: 137 NYIQADNNRPRVLHKGKEIKQIELKHLLSNLYTGSTRILH*NNFSKSQGSPK*NNLAHVY 316
           NY QADN RP +   G+     E+ H L         + H  ++  S G+P        Y
Sbjct: 165 NYNQADNQRPDINEFGRNTLTHEIGHTLG--------LYHPGDYDASDGNPG-------Y 209

Query: 317 KDITYS 334
           KD+TY+
Sbjct: 210 KDVTYA 215

>ref|XP_051005.1| KIAA1297 protein [Homo sapiens]
          Length = 281

 Score = 32.7 bits (73), Expect = 3.6
 Identities = 15/31 (48%), Positives = 19/31 (60%)
 Frame = +1

Query: 124 PGTHELHTSRQQPSTSPTQRQGDKTNRAKTP 216
           PG HEL  +RQ+P+TSP  R G     A +P
Sbjct: 146 PGWHELPPARQRPTTSPHSRTGACKRAAASP 176

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 531,131,686
Number of Sequences: 1393205
Number of extensions: 11410485
Number of successful extensions: 33210
Number of sequences better than 10.0: 20
Number of HSP's better than 10.0 without gapping: 32067
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 33181
length of database: 448,689,247
effective HSP length: 117
effective length of database: 285,684,262
effective search space used: 22854740960
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 SPDL038f04_f BP054403 1 357
2 GENf036h11 BP059914 54 593
3 SPD091h06_f BP051305 81 558
4 SPDL032e07_f BP053999 92 601




Lotus japonicus
Kazusa DNA Research Institute