KMC005302A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC005302A_C01 KMC005302A_c01
tgggtcgggccccCATTTCAAGCTTCCTGCCCACATTAAACATCATATAAAAAATGACTA
TCCTCGATGAGCAACAAGGAGGTGACAATGCCACAAGGTGAAGAGAACAAAGAGCATTCT
CAACAAATCACCAATGACCTGGTCCTTGGATGGTGGCTTTCCATTGCCCAAACCATTCTC
AAGTGCTGGATTCATAGCCCCTGGATATCAATTCATATGGCAAATCTTTCAGAGATTATT
ATGGTGAAAGCGAAAGGCAAAAGTCTGTGGAGGAATTATACCGACTACAACACATCAACC
AAACATATGAATTTGCCAAGACAATGAGGGAGGAGTATGGGAAATTGAATAAAGGAGAAA
TGGGTATATGGGAATGTTGTGAGCTACTTGATGAAATTGTGGATGCTAGTGATCCTGACT
TGGAAGAATCTCAAATTAAGCATGCTTTGCAGTCAGCTGAAGCTATTAGGAAGGACTATC
CTAATGAAGATTGGTTACATTTAACAGCTCTTATTCATGAT


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC005302A_C01 KMC005302A_c01
         (521 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_194356.1| putative protein; protein id: At4g26260.1 [Arab...   173  2e-42
gb|AAN13052.1| unknown protein [Arabidopsis thaliana]                 173  2e-42
ref|NP_565459.1| expressed protein; protein id: At2g19800.1, sup...   169  1e-41
gb|AAM63498.1| unknown [Arabidopsis thaliana]                         167  7e-41
gb|AAF43953.1|AC012188_30 Strong similarity to an unknown protei...   162  2e-39

>ref|NP_194356.1| putative protein; protein id: At4g26260.1 [Arabidopsis thaliana]
           gi|7487428|pir||T06010 hypothetical protein T25K17.70 -
           Arabidopsis thaliana gi|4539422|emb|CAB38955.1| putative
           protein [Arabidopsis thaliana]
           gi|7269477|emb|CAB79481.1| putative protein [Arabidopsis
           thaliana]
          Length = 318

 Score =  173 bits (438), Expect(2) = 2e-42
 Identities = 80/106 (75%), Positives = 91/106 (85%)
 Frame = +3

Query: 204 DINSYGKSFRDYYGESERQKSVEELYRLQHINQTYEFAKTMREEYGKLNKGEMGIWECCE 383
           ++N++G+ FRDY  ESERQK VEE YRLQHINQT +F K MR EYGKL+K  M IWECCE
Sbjct: 50  EMNAFGRQFRDYDVESERQKGVEEFYRLQHINQTVDFVKKMRAEYGKLDKMVMSIWECCE 109

Query: 384 LLDEIVDASDPDLEESQIKHALQSAEAIRKDYPNEDWLHLTALIHD 521
           LL+E+VD SDPDL+E QI+H LQSAEAIRKDYPNEDWLHLTALIHD
Sbjct: 110 LLNEVVDESDPDLDEPQIQHLLQSAEAIRKDYPNEDWLHLTALIHD 155

 Score = 21.2 bits (43), Expect(2) = 2e-42
 Identities = 10/21 (47%), Positives = 14/21 (66%), Gaps = 2/21 (9%)
 Frame = +2

Query: 146 LDGGFPLPKPFSS--AGFIAP 202
           LDGGF +PK  ++    F+AP
Sbjct: 29  LDGGFSMPKMDTNDDEAFLAP 49

>gb|AAN13052.1| unknown protein [Arabidopsis thaliana]
          Length = 317

 Score =  173 bits (438), Expect(2) = 2e-42
 Identities = 80/106 (75%), Positives = 91/106 (85%)
 Frame = +3

Query: 204 DINSYGKSFRDYYGESERQKSVEELYRLQHINQTYEFAKTMREEYGKLNKGEMGIWECCE 383
           ++N++G+ FRDY  ESERQK VEE YRLQHINQT +F K MR EYGKL+K  M IWECCE
Sbjct: 49  EMNAFGRQFRDYDVESERQKGVEEFYRLQHINQTVDFVKKMRAEYGKLDKMVMSIWECCE 108

Query: 384 LLDEIVDASDPDLEESQIKHALQSAEAIRKDYPNEDWLHLTALIHD 521
           LL+E+VD SDPDL+E QI+H LQSAEAIRKDYPNEDWLHLTALIHD
Sbjct: 109 LLNEVVDESDPDLDEPQIQHLLQSAEAIRKDYPNEDWLHLTALIHD 154

 Score = 21.2 bits (43), Expect(2) = 2e-42
 Identities = 10/21 (47%), Positives = 14/21 (66%), Gaps = 2/21 (9%)
 Frame = +2

Query: 146 LDGGFPLPKPFSS--AGFIAP 202
           LDGGF +PK  ++    F+AP
Sbjct: 28  LDGGFSMPKMDTNDDEAFLAP 48

>ref|NP_565459.1| expressed protein; protein id: At2g19800.1, supported by cDNA:
           254633. [Arabidopsis thaliana]
           gi|20197290|gb|AAC62136.2| expressed protein
           [Arabidopsis thaliana]
          Length = 317

 Score =  169 bits (429), Expect = 1e-41
 Identities = 79/107 (73%), Positives = 92/107 (85%), Gaps = 1/107 (0%)
 Frame = +3

Query: 204 DINSYGKSFRDYY-GESERQKSVEELYRLQHINQTYEFAKTMREEYGKLNKGEMGIWECC 380
           D+N  G SFRDY  GESERQ+ VEE YR+QHI+QTY+F K MR+EYGKLNK EM IWECC
Sbjct: 48  DMNFLGHSFRDYENGESERQQGVEEFYRMQHIHQTYDFVKKMRKEYGKLNKMEMSIWECC 107

Query: 381 ELLDEIVDASDPDLEESQIKHALQSAEAIRKDYPNEDWLHLTALIHD 521
           ELL+ +VD SDPDL+E QI+H LQ+AEAIR+DYP+EDWLHLTALIHD
Sbjct: 108 ELLNNVVDESDPDLDEPQIQHLLQTAEAIRRDYPDEDWLHLTALIHD 154

>gb|AAM63498.1| unknown [Arabidopsis thaliana]
          Length = 317

 Score =  167 bits (423), Expect = 7e-41
 Identities = 78/107 (72%), Positives = 91/107 (84%), Gaps = 1/107 (0%)
 Frame = +3

Query: 204 DINSYGKSFRDYYG-ESERQKSVEELYRLQHINQTYEFAKTMREEYGKLNKGEMGIWECC 380
           D+N  G SFRDY   ESERQ+ VEE YR+QHI+QTY+F K MR+EYGKLNK EM IWECC
Sbjct: 48  DMNFLGHSFRDYENDESERQQGVEEFYRMQHIHQTYDFVKKMRKEYGKLNKMEMSIWECC 107

Query: 381 ELLDEIVDASDPDLEESQIKHALQSAEAIRKDYPNEDWLHLTALIHD 521
           ELL+ +VD SDPDL+E QI+H LQ+AEAIR+DYP+EDWLHLTALIHD
Sbjct: 108 ELLNNVVDESDPDLDEPQIQHLLQTAEAIRRDYPDEDWLHLTALIHD 154

>gb|AAF43953.1|AC012188_30 Strong similarity to an unknown protein from Arabidopsis thaliana
           gb|AL049171.1
          Length = 422

 Score =  162 bits (410), Expect = 2e-39
 Identities = 72/104 (69%), Positives = 87/104 (83%)
 Frame = +3

Query: 210 NSYGKSFRDYYGESERQKSVEELYRLQHINQTYEFAKTMREEYGKLNKGEMGIWECCELL 389
           NS+G++FRDY  ESER++ VEE YR+ HI QT +F + MREEY KLN+ EM IWECCELL
Sbjct: 43  NSFGRTFRDYDAESERRRGVEEFYRVNHIGQTVDFVRKMREEYEKLNRTEMSIWECCELL 102

Query: 390 DEIVDASDPDLEESQIKHALQSAEAIRKDYPNEDWLHLTALIHD 521
           +E +D SDPDL+E QI+H LQ+AEAIRKDYP+EDWLHLT LIHD
Sbjct: 103 NEFIDESDPDLDEPQIEHLLQTAEAIRKDYPDEDWLHLTGLIHD 146

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 487,817,737
Number of Sequences: 1393205
Number of extensions: 11254940
Number of successful extensions: 28694
Number of sequences better than 10.0: 44
Number of HSP's better than 10.0 without gapping: 27791
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 28663
length of database: 448,689,247
effective HSP length: 115
effective length of database: 288,470,672
effective search space used: 16731298976
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MPD051b04_f AV773445 1 530
2 MPD097b09_f AV776332 2 475
3 MPD086f07_f AV775661 16 382
4 MPD088d01_f AV775778 30 491
5 MPD053h10_f AV773597 45 147




Lotus japonicus
Kazusa DNA Research Institute