KMC002397A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC002397A_C01 KMC002397A_c01
gggtacaatcacgtgacagcaacaacagggaacttctaagctaattttaaacccacaaat
ttcattttcaatttaaggtcGCAGCACCTAAAACTGGAGACAAAACTAAATGGATCAGAG
TCTAATAAAACACCAAATACATACACTAATCTATATTTTCTTCACTAAATTTGACAAAGG
CATTGGGGACAGGCAAAAGTGTGGAGAACGCATATAGGCTCTAGAAATCATTCGATCCAG
ATAAGAATGCCAACCCTATCAAAGTGGAAGGAAGTAGAAGGTTTCTCCTTTGCTGGTCAT
GGTTGTCCTCTCCATTTGATATCATTAGAGCTCTGAACAATGATTTGACTTTGATCTCGG
GAAGAGCAGGTCGAGGACTCTCAAAATCCATGTTCTGCAGATTCTCCGGCGAAGGTCTTG
CCGGAGAAGACGGTGGTGCAAGGCGGCTGGTTGCGGTTTCCGTCGAAGGTTGTCGGAATG
ACAAATTGATTGAGGGGTTTTGTTCCGGAAGGTGAGGGGAGTCCAACGGTGGTGTTAGTT
TCTCCAGTAGATGAGCGGTTCGCCGGAGATCGAAGGTTGAAGGTGGCGGTGCTAGGGTTT
CTCGGTGGTGGCTATGGTGGCGTCGTTCACGCGAGGATGAGGGCGTGGAGATCGTGCTCG
CGTCTGGGATCGTCAGAGAAGAGGAGAAAGTttgtggtgctagcggtttggcacgtggtg
ctggcggtttacccatggtggtgctgtggcgtgccggccactgaaggagag


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC002397A_C01 KMC002397A_c01
         (771 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

dbj|BAA78424.1| polyprotein [Arabidopsis thaliana]                     48  1e-04
ref|NP_192205.1| putative polyprotein of LTR transposon; protein...    48  1e-04
dbj|BAA78426.1| polyprotein [Arabidopsis thaliana]                     47  2e-04
gb|AAN41263.1| collagen XXVII proalpha 1 chain precursor; prepro...    47  4e-04
dbj|BAA78427.1| polyprotein [Arabidopsis thaliana]                     46  5e-04

>dbj|BAA78424.1| polyprotein [Arabidopsis thaliana]
          Length = 1330

 Score = 48.1 bits (113), Expect = 1e-04
 Identities = 35/131 (26%), Positives = 58/131 (43%), Gaps = 13/131 (9%)
 Frame = -3

Query: 748  HSTTMGKPPAPRAKPLAPQTFSSSLTIPDASTISTPSSSRERRHHSHHRETLAPPPSTFD 569
            H  T  +PP+  +     Q  SS+L    +S+IS+PSSS       +  +  A P  T +
Sbjct: 647  HLDTSPRPPSSPSPLCTTQVSSSNLP---SSSISSPSSSEPTAPSHNGPQPTAQPHQTQN 703

Query: 568  LRRTAHLLEKLTPPLDSPHLPEQN-------------PSINLSFRQPSTETATSRLAPPS 428
                + +L    P   SP+ P QN             P+ + S  +P++ +++S   PP 
Sbjct: 704  SNSNSPILNNPNPNSPSPNSPNQNSPLPQSPISSPHIPTPSTSISEPNSPSSSSTSTPPL 763

Query: 427  SPARPSPENLQ 395
             P  P+P  +Q
Sbjct: 764  PPVLPAPPIIQ 774

>ref|NP_192205.1| putative polyprotein of LTR transposon; protein id: At4g02960.1
            [Arabidopsis thaliana] gi|7444420|pir||T01397 LTR gag/pol
            polyprotein homolog T4I9.16 - Arabidopsis thaliana
            gi|3924609|gb|AAC79110.1| putative polyprotein of LTR
            transposon [Arabidopsis thaliana]
            gi|7269781|emb|CAB77781.1| putative polyprotein of LTR
            transposon [Arabidopsis thaliana]
          Length = 1456

 Score = 48.1 bits (113), Expect = 1e-04
 Identities = 35/131 (26%), Positives = 58/131 (43%), Gaps = 13/131 (9%)
 Frame = -3

Query: 748  HSTTMGKPPAPRAKPLAPQTFSSSLTIPDASTISTPSSSRERRHHSHHRETLAPPPSTFD 569
            H  T  +PP+  +     Q  SS+L    +S+IS+PSSS       +  +  A P  T +
Sbjct: 773  HLDTSPRPPSSPSPLCTTQVSSSNLP---SSSISSPSSSEPTAPSHNGPQPTAQPHQTQN 829

Query: 568  LRRTAHLLEKLTPPLDSPHLPEQN-------------PSINLSFRQPSTETATSRLAPPS 428
                + +L    P   SP+ P QN             P+ + S  +P++ +++S   PP 
Sbjct: 830  SNSNSPILNNPNPNSPSPNSPNQNSPLPQSPISSPHIPTPSTSISEPNSPSSSSTSTPPL 889

Query: 427  SPARPSPENLQ 395
             P  P+P  +Q
Sbjct: 890  PPVLPAPPIIQ 900

>dbj|BAA78426.1| polyprotein [Arabidopsis thaliana]
          Length = 1475

 Score = 47.4 bits (111), Expect = 2e-04
 Identities = 35/131 (26%), Positives = 58/131 (43%), Gaps = 13/131 (9%)
 Frame = -3

Query: 748  HSTTMGKPPAPRAKPLAPQTFSSSLTIPDASTISTPSSSRERRHHSHHRETLAPPPSTFD 569
            H  T  +PP+  +     Q  SS+L    +S+IS+PSSS       +  +  A P  T +
Sbjct: 792  HLDTSPRPPSLPSPLCTTQVSSSNLP---SSSISSPSSSEPTAPSHNGPQPTAQPHQTQN 848

Query: 568  LRRTAHLLEKLTPPLDSPHLPEQN-------------PSINLSFRQPSTETATSRLAPPS 428
                + +L    P   SP+ P QN             P+ + S  +P++ +++S   PP 
Sbjct: 849  SNSNSPILNNPNPNSPSPNSPNQNSPLPQSPISSPHIPTPSTSISEPNSPSSSSTSTPPL 908

Query: 427  SPARPSPENLQ 395
             P  P+P  +Q
Sbjct: 909  PPVLPAPPIIQ 919

>gb|AAN41263.1| collagen XXVII proalpha 1 chain precursor; preproprotein [Homo
           sapiens]
          Length = 1860

 Score = 46.6 bits (109), Expect = 4e-04
 Identities = 33/132 (25%), Positives = 49/132 (37%), Gaps = 15/132 (11%)
 Frame = -3

Query: 757 PARHSTTMGKPPAPRAKPLAPQTFSSSLTIPDASTISTPSSSRERRHHSHHRETLAPPPS 578
           P + +  M +PP P  +PL P T SS   IP  +      +S   +  S    T  PPP 
Sbjct: 425 PIQRNPGMPRPPPPSTRPLPPTTSSSKKPIPTLARTEAKITSHASKPASARTSTHKPPPF 484

Query: 577 TF---------DLRRTAHLLEKLTPPLDSPHLPEQNPSI------NLSFRQPSTETATSR 443
           T             R+      + PP      P   P++          ++P    A+ +
Sbjct: 485 TALSSSPAPTPGSTRSTRPPATMVPPTSGTSTPRTAPAVPTPGSAPTGSKKPIGSEASKK 544

Query: 442 LAPPSSPARPSP 407
             P SSP +P P
Sbjct: 545 AGPKSSPRKPVP 556

>dbj|BAA78427.1| polyprotein [Arabidopsis thaliana]
          Length = 1421

 Score = 46.2 bits (108), Expect = 5e-04
 Identities = 34/131 (25%), Positives = 57/131 (42%), Gaps = 13/131 (9%)
 Frame = -3

Query: 748  HSTTMGKPPAPRAKPLAPQTFSSSLTIPDASTISTPSSSRERRHHSHHRETLAPPPSTFD 569
            H  T  +PP+  +     Q  SS+L    +S+IS+PSSS       +  +    P  T +
Sbjct: 792  HLDTSPRPPSSPSPLCTTQVSSSNLP---SSSISSPSSSEPTAPSYNGPQPTTQPHQTQN 848

Query: 568  LRRTAHLLEKLTPPLDSPHLPEQN-------------PSINLSFRQPSTETATSRLAPPS 428
                + +L    P   SP+ P QN             P+ + S  +P++ +++S   PP 
Sbjct: 849  SNSNSPILNNPNPNSPSPNSPNQNSPLPQSPISSPHIPTPSTSISEPNSPSSSSTSTPPL 908

Query: 427  SPARPSPENLQ 395
             P  P+P  +Q
Sbjct: 909  PPVLPAPPIIQ 919

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 725,372,335
Number of Sequences: 1393205
Number of extensions: 18150458
Number of successful extensions: 81968
Number of sequences better than 10.0: 513
Number of HSP's better than 10.0 without gapping: 67827
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 79410
length of database: 448,689,247
effective HSP length: 121
effective length of database: 280,111,442
effective search space used: 37815044670
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 GENf062g05 BP061007 1 262
2 GNf092b12 BP074150 100 503
3 SPD075e12_f BP050007 204 775




Lotus japonicus
Kazusa DNA Research Institute