KMC000007A_c02
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC000007A_C02 KMC000007A_c02
GGGAGACAACAAGTTAGGCTGATTGGCATGATGGCATCAATTTCAGAATCTACAACTTCA
AAATATTAAAAAAAACTACAATAATTTATTAACACTGTAAAATAATTATAGCAACAACGG
AAAATTGTGTTACGCAAGAATACAATTTCCTATCCTTGGGGAATAGATCCCGAATCTAAA
GGGCTTTCTTTACTTGAAAGAAGTTGGAACGCTCTTTGTTTGCCGCATCTACATCCTCTG
GAGATGCCACTGCAACCACAATCCCTTTATCCTTAACTGCTTCCATTGCATCAATAAAAT
TTCTGAAGTCCTTCACACTAGTAGATAATATCTCTTCACGTCTTCTTTGCCTTTCTTCCT
CTGTGATACCCAGTAAGTGCCGCAACAAACTACTATAACCTTTGGCATCAGGAAGTTGAT
ATGAGTCTACATCCCCAATGGTTCCAATTATGGCTTTTGTTAGAGTATCATCATCTATTT
CCAATTCTCTTAAGAAATCCCCAGTTCCATCATATACATCGAGCGTCTTCAGTAAATTGG
GATCACGATAAGATAAGAAGGAGAACACTCCTGAATGTGTATCAAAATCACAGAAACCTC
CATAAG


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC000007A_C02 KMC000007A_c02
         (606 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

dbj|BAB02957.1| zinc metalloprotease (insulinase family) [Arabid...   246  2e-64
ref|NP_188548.2| metalloprotease, putative; protein id: At3g1917...   246  2e-64
pir||A96533 probable zinc metalloproteinase [imported] - Arabido...   241  4e-63
gb|AAG13049.1|AC011807_8 Putative zinc metalloprotease [Arabidop...   241  4e-63
ref|NP_175386.2| hydrogenase protein, putative; protein id: At1g...   241  4e-63

>dbj|BAB02957.1| zinc metalloprotease (insulinase family) [Arabidopsis thaliana]
          Length = 1052

 Score =  246 bits (628), Expect = 2e-64
 Identities = 115/142 (80%), Positives = 134/142 (93%)
 Frame = -3

Query: 604  YGGFCDFDTHSGVFSFLSYRDPNLLKTLDVYDGTGDFLRELEIDDDTLTKAIIGTIGDVD 425
            YGGFCDFD+HSGVFS+LSYRDPNLLKTLD+YDGTGDFLR L++D +TLTKAIIGTIGDVD
Sbjct: 911  YGGFCDFDSHSGVFSYLSYRDPNLLKTLDIYDGTGDFLRGLDVDQETLTKAIIGTIGDVD 970

Query: 424  SYQLPDAKGYSSLLRHLLGITEEERQRRREEILSTSVKDFRNFIDAMEAVKDKGIVVAVA 245
            SYQLPDAKGYSSLLRHLLG+T+EERQR+REEIL+TS+KDF++F  A++ V+DKG+ VAVA
Sbjct: 971  SYQLPDAKGYSSLLRHLLGVTDEERQRKREEILTTSLKDFKDFAQAIDVVRDKGVAVAVA 1030

Query: 244  SPEDVDAANKERSNFFQVKKAL 179
            S ED+DAAN ERSNFF+VKKAL
Sbjct: 1031 SAEDIDAANNERSNFFEVKKAL 1052

>ref|NP_188548.2| metalloprotease, putative; protein id: At3g19170.1, supported by
            cDNA: gi_19699072, supported by cDNA: gi_20259503
            [Arabidopsis thaliana] gi|19699073|gb|AAL90904.1|
            AT3g19170/MVI11_8 [Arabidopsis thaliana]
            gi|20259504|gb|AAM13872.1| putative metalloprotease
            [Arabidopsis thaliana] gi|26983906|gb|AAN86205.1|
            putative metalloprotease [Arabidopsis thaliana]
          Length = 1080

 Score =  246 bits (628), Expect = 2e-64
 Identities = 115/142 (80%), Positives = 134/142 (93%)
 Frame = -3

Query: 604  YGGFCDFDTHSGVFSFLSYRDPNLLKTLDVYDGTGDFLRELEIDDDTLTKAIIGTIGDVD 425
            YGGFCDFD+HSGVFS+LSYRDPNLLKTLD+YDGTGDFLR L++D +TLTKAIIGTIGDVD
Sbjct: 939  YGGFCDFDSHSGVFSYLSYRDPNLLKTLDIYDGTGDFLRGLDVDQETLTKAIIGTIGDVD 998

Query: 424  SYQLPDAKGYSSLLRHLLGITEEERQRRREEILSTSVKDFRNFIDAMEAVKDKGIVVAVA 245
            SYQLPDAKGYSSLLRHLLG+T+EERQR+REEIL+TS+KDF++F  A++ V+DKG+ VAVA
Sbjct: 999  SYQLPDAKGYSSLLRHLLGVTDEERQRKREEILTTSLKDFKDFAQAIDVVRDKGVAVAVA 1058

Query: 244  SPEDVDAANKERSNFFQVKKAL 179
            S ED+DAAN ERSNFF+VKKAL
Sbjct: 1059 SAEDIDAANNERSNFFEVKKAL 1080

>pir||A96533 probable zinc metalloproteinase [imported] - Arabidopsis thaliana
          Length = 1077

 Score =  241 bits (616), Expect = 4e-63
 Identities = 114/141 (80%), Positives = 132/141 (92%)
 Frame = -3

Query: 604  YGGFCDFDTHSGVFSFLSYRDPNLLKTLDVYDGTGDFLRELEIDDDTLTKAIIGTIGDVD 425
            YGG CDFD+HSGVFSFLSYRDPNLLKTLD+YDGTGDFLR L++D+DTLTKAIIGTIGDVD
Sbjct: 935  YGGSCDFDSHSGVFSFLSYRDPNLLKTLDIYDGTGDFLRGLDVDEDTLTKAIIGTIGDVD 994

Query: 424  SYQLPDAKGYSSLLRHLLGITEEERQRRREEILSTSVKDFRNFIDAMEAVKDKGIVVAVA 245
            SYQLPDAKGY+SLLRHLL +T+EERQ RREEILSTS+KDF+ F +A+++V DKG+ VAVA
Sbjct: 995  SYQLPDAKGYTSLLRHLLNVTDEERQIRREEILSTSLKDFKEFAEAIDSVSDKGVAVAVA 1054

Query: 244  SPEDVDAANKERSNFFQVKKA 182
            S ED+DAAN+ERSNFF+VKKA
Sbjct: 1055 SQEDIDAANRERSNFFEVKKA 1075

>gb|AAG13049.1|AC011807_8 Putative zinc metalloprotease [Arabidopsis thaliana]
          Length = 1077

 Score =  241 bits (616), Expect = 4e-63
 Identities = 114/141 (80%), Positives = 132/141 (92%)
 Frame = -3

Query: 604  YGGFCDFDTHSGVFSFLSYRDPNLLKTLDVYDGTGDFLRELEIDDDTLTKAIIGTIGDVD 425
            YGG CDFD+HSGVFSFLSYRDPNLLKTLD+YDGTGDFLR L++D+DTLTKAIIGTIGDVD
Sbjct: 935  YGGSCDFDSHSGVFSFLSYRDPNLLKTLDIYDGTGDFLRGLDVDEDTLTKAIIGTIGDVD 994

Query: 424  SYQLPDAKGYSSLLRHLLGITEEERQRRREEILSTSVKDFRNFIDAMEAVKDKGIVVAVA 245
            SYQLPDAKGY+SLLRHLL +T+EERQ RREEILSTS+KDF+ F +A+++V DKG+ VAVA
Sbjct: 995  SYQLPDAKGYTSLLRHLLNVTDEERQIRREEILSTSLKDFKEFAEAIDSVSDKGVAVAVA 1054

Query: 244  SPEDVDAANKERSNFFQVKKA 182
            S ED+DAAN+ERSNFF+VKKA
Sbjct: 1055 SQEDIDAANRERSNFFEVKKA 1075

>ref|NP_175386.2| hydrogenase protein, putative; protein id: At1g49630.1, supported by
            cDNA: gi_18377703 [Arabidopsis thaliana]
            gi|18377704|gb|AAL67002.1| putative hydrogenase protein
            [Arabidopsis thaliana] gi|28393925|gb|AAO42370.1|
            putative hydrogenase [Arabidopsis thaliana]
          Length = 1080

 Score =  241 bits (616), Expect = 4e-63
 Identities = 114/141 (80%), Positives = 132/141 (92%)
 Frame = -3

Query: 604  YGGFCDFDTHSGVFSFLSYRDPNLLKTLDVYDGTGDFLRELEIDDDTLTKAIIGTIGDVD 425
            YGG CDFD+HSGVFSFLSYRDPNLLKTLD+YDGTGDFLR L++D+DTLTKAIIGTIGDVD
Sbjct: 938  YGGSCDFDSHSGVFSFLSYRDPNLLKTLDIYDGTGDFLRGLDVDEDTLTKAIIGTIGDVD 997

Query: 424  SYQLPDAKGYSSLLRHLLGITEEERQRRREEILSTSVKDFRNFIDAMEAVKDKGIVVAVA 245
            SYQLPDAKGY+SLLRHLL +T+EERQ RREEILSTS+KDF+ F +A+++V DKG+ VAVA
Sbjct: 998  SYQLPDAKGYTSLLRHLLNVTDEERQIRREEILSTSLKDFKEFAEAIDSVSDKGVAVAVA 1057

Query: 244  SPEDVDAANKERSNFFQVKKA 182
            S ED+DAAN+ERSNFF+VKKA
Sbjct: 1058 SQEDIDAANRERSNFFEVKKA 1078

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 497,947,488
Number of Sequences: 1393205
Number of extensions: 10707148
Number of successful extensions: 32021
Number of sequences better than 10.0: 65
Number of HSP's better than 10.0 without gapping: 30393
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 31939
length of database: 448,689,247
effective HSP length: 117
effective length of database: 285,684,262
effective search space used: 23997478008
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MPDL073a12_f AV780234 1 548
2 MFBL048g05_f BP043729 8 476
3 SPDL067h08_f BP056190 8 512
4 SPDL013e07_f BP052801 8 502
5 SPDL051d09_f BP055218 8 356
6 MFBL013g07_f BP041926 9 471
7 MFBL024g11_f BP042480 9 504
8 MRL004g08_f BP083921 11 370
9 SPDL005f01_f BP052298 12 535
10 MRL037f05_f BP085553 14 539
11 MPDL055a03_f AV779268 16 538
12 SPDL003b03_f BP052154 19 504
13 MRL018f04_f BP084648 23 369
14 SPDL093a08_f BP057810 24 563
15 MRL003g04_f BP083867 43 466
16 GENLf001c06 BP062416 44 515
17 MFBL008d12_f BP041670 51 539
18 MRL005f08_f BP083969 77 445
19 SPDL028h02_f BP053762 78 539
20 GENLf028a06 BP063805 78 567
21 SPDL014a12_f BP052840 82 616
22 MWL031f09_f AV769088 98 557




Lotus japonicus
Kazusa DNA Research Institute