KMC004339A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC004339A_C01 KMC004339A_c01
AGCGATTGGGAATTGGGAACACCAAATCCATTTAATTTTATTTCACCAAAACACAGGTGA
GTTGCAAGATACAAATTCTGTAGAAGTGATGGATTATTAATTCCTTATGGGACAAGGTAC
AAAAATAGTGTGCAAAAGGAAAAGGGTAAACAAAAGATAGAGAACAAAGACCCACACACA
TGTTGTAGTGATACTCAATCAATCACAGGCAGCAATCCAGGGCACAGCAACAACATAATC
CAGCACAACATCCTTTCCAGAACCCATCACCCCGGGCTAGGGGGTTTTCACGTGGAACAC
TTGGGTGGGGGACCATCGGTGGGGTGGGAATAGCCCACAGGTGGTGGTATAGTATTAAGA
TAAGAAGATGGAGGATTGCTCTGACCCACA


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC004339A_C01 KMC004339A_c01
         (390 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

dbj|BAB01942.1| gb|AAD15386.1~gene_id:MMP21.2~similar to unknown...    56  7e-08
ref|NP_181673.1| unknown protein; protein id: At2g41420.1 [Arabi...    44  1e-05
ref|NP_565739.1| expressed protein; protein id: At2g32190.1, sup...    48  3e-05
ref|NP_563734.1| expressed protein; protein id: At1g05340.1, sup...    48  3e-05
ref|NP_565740.1| expressed protein; protein id: At2g32210.1, sup...    47  4e-05

>dbj|BAB01942.1| gb|AAD15386.1~gene_id:MMP21.2~similar to unknown protein
           [Arabidopsis thaliana] gi|21593174|gb|AAM65123.1|
           unknown [Arabidopsis thaliana]
          Length = 72

 Score = 55.8 bits (133), Expect(2) = 7e-08
 Identities = 20/36 (55%), Positives = 23/36 (63%)
 Frame = -3

Query: 316 MVPHPSVPRENPLARGDGFWKGCCAGLCCCCALDCC 209
           MV  P        ++GDGFWKGCCA +CCCC LD C
Sbjct: 36  MVGDPPAAAVETKSKGDGFWKGCCAAICCCCVLDAC 71

 Score = 20.8 bits (42), Expect(2) = 7e-08
 Identities = 14/33 (42%), Positives = 17/33 (51%), Gaps = 4/33 (12%)
 Frame = -2

Query: 389 VGQSNPPSSYLNTIPPPVGYSHPTD----GPPP 303
           V + +  SS   T PPP+GY  PT     G PP
Sbjct: 11  VEKPSQTSSGPYTSPPPIGY--PTRDAMVGDPP 41

>ref|NP_181673.1| unknown protein; protein id: At2g41420.1 [Arabidopsis thaliana]
           gi|7487447|pir||T02437 hypothetical protein At2g41420
           [imported] - Arabidopsis thaliana
           gi|20197423|gb|AAM15069.1| unknown protein [Arabidopsis
           thaliana] gi|27808506|gb|AAO24533.1| At2g41420
           [Arabidopsis thaliana]
          Length = 98

 Score = 44.3 bits (103), Expect(2) = 1e-05
 Identities = 18/39 (46%), Positives = 21/39 (53%), Gaps = 1/39 (2%)
 Frame = -3

Query: 322 PPMVP-HPSVPRENPLARGDGFWKGCCAGLCCCCALDCC 209
           PP  P +P  P+        GF +GC A LCCCC LD C
Sbjct: 59  PPYAPQYPPPPQHQQQQSSPGFLEGCLAALCCCCLLDAC 97

 Score = 25.0 bits (53), Expect(2) = 1e-05
 Identities = 12/31 (38%), Positives = 14/31 (44%), Gaps = 3/31 (9%)
 Frame = -2

Query: 383 QSNPPSSYLNTIPPPVGY---SHPTDGPPPK 300
           Q  PP  Y     PP GY    +P  G PP+
Sbjct: 15  QGYPPEGYPKDAYPPQGYPPQGYPQQGYPPQ 45

>ref|NP_565739.1| expressed protein; protein id: At2g32190.1, supported by cDNA:
           40344. [Arabidopsis thaliana] gi|25408217|pir||A84730
           hypothetical protein At2g32190 [imported] - Arabidopsis
           thaliana gi|4263700|gb|AAD15386.1| expressed protein
           [Arabidopsis thaliana] gi|21593591|gb|AAM65558.1|
           unknown [Arabidopsis thaliana]
           gi|26451539|dbj|BAC42867.1| unknown protein [Arabidopsis
           thaliana] gi|28827664|gb|AAO50676.1| unknown protein
           [Arabidopsis thaliana]
          Length = 71

 Score = 48.1 bits (113), Expect = 3e-05
 Identities = 22/59 (37%), Positives = 28/59 (47%), Gaps = 17/59 (28%)
 Frame = -3

Query: 334 AIPTPPM-----------------VPHPSVPRENPLARGDGFWKGCCAGLCCCCALDCC 209
           A PTPP+                 + H +V      ++GDGF KGC A +CCCC LD C
Sbjct: 12  AYPTPPVSTGPYMTPPPLGYPTSDISHATVAPVETKSKGDGFLKGCLAAMCCCCVLDAC 70

>ref|NP_563734.1| expressed protein; protein id: At1g05340.1, supported by cDNA:
           20380., supported by cDNA: gi_13430463, supported by
           cDNA: gi_15810660 [Arabidopsis thaliana]
           gi|25406839|pir||C86188 hypothetical protein [imported]
           - Arabidopsis thaliana gi|2388562|gb|AAB71443.1| EST
           gb|ATTS0295  comes from this gene. [Arabidopsis
           thaliana] gi|13430464|gb|AAK25854.1|AF360144_1 unknown
           protein [Arabidopsis thaliana]
           gi|15810661|gb|AAL07255.1| unknown protein [Arabidopsis
           thaliana] gi|21554110|gb|AAM63190.1| unknown
           [Arabidopsis thaliana]
          Length = 72

 Score = 47.8 bits (112), Expect = 3e-05
 Identities = 21/48 (43%), Positives = 27/48 (55%), Gaps = 8/48 (16%)
 Frame = -3

Query: 328 PTPPMVPHPSVPRENPLA--------RGDGFWKGCCAGLCCCCALDCC 209
           P PP+    + P    +A        +GDGF+KGC A +CCCCALD C
Sbjct: 24  PPPPIGYPTNQPSHGSVAQGKVETKSKGDGFFKGCLAAMCCCCALDIC 71

>ref|NP_565740.1| expressed protein; protein id: At2g32210.1, supported by cDNA:
           31665., supported by cDNA: gi_13272416, supported by
           cDNA: gi_20465358 [Arabidopsis thaliana]
           gi|25408220|pir||C84730 hypothetical protein At2g32210
           [imported] - Arabidopsis thaliana
           gi|4263698|gb|AAD15384.1| expressed protein [Arabidopsis
           thaliana] gi|13272417|gb|AAK17147.1|AF325079_1 unknown
           protein [Arabidopsis thaliana]
           gi|18389222|gb|AAL67054.1| unknown protein [Arabidopsis
           thaliana] gi|20465359|gb|AAM20083.1| unknown protein
           [Arabidopsis thaliana] gi|21618272|gb|AAM67322.1|
           unknown [Arabidopsis thaliana]
          Length = 71

 Score = 47.4 bits (111), Expect = 4e-05
 Identities = 22/59 (37%), Positives = 27/59 (45%), Gaps = 17/59 (28%)
 Frame = -3

Query: 334 AIPTPPM-----------------VPHPSVPRENPLARGDGFWKGCCAGLCCCCALDCC 209
           A PTPP+                   H +V      ++GDGF KGC A +CCCC LD C
Sbjct: 12  AYPTPPVSTGPYVAPPPLGYPTNDTSHATVATVETKSKGDGFLKGCLAAMCCCCVLDAC 70

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 434,251,286
Number of Sequences: 1393205
Number of extensions: 11374194
Number of successful extensions: 47500
Number of sequences better than 10.0: 173
Number of HSP's better than 10.0 without gapping: 36833
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 45929
length of database: 448,689,247
effective HSP length: 105
effective length of database: 302,402,722
effective search space used: 7257665328
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MR028e04_f BP078164 1 349
2 MR033a01_f BP078504 1 390




Lotus japonicus
Kazusa DNA Research Institute