KMC019900A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC019900A_C01 KMC019900A_c01
ccccccctttttttttttACAAAAAGAAAACACACTTTTATTGCTTGAAACTGGAACATT
ACACAAAGACAAAGTTTGCTTGAGAACATCATGATGGGGATGACCCATTCTATTATGCCA
CAACTCATAAATGGAAATATTACTACTACTATTACAAGGTGCAGAACTTGAAACACTAGA
ATTCACTAATGAAGAAGGCAAATCAATAGCAGAAGATGCTTTGTTTCCTACAGGCTGAGA
AGCTAAGCCAGCCTGTGCCAGTGGCTGAGAATGCAAAGAAGACTGAAACAAAGAAGCTTT
AACAGGAGAGTGAGACACAGCTTGTGAGGGTGGTCCATCAAAGTAATAAAGACCATCATC
ACCAACACTTCCAGTGAGAAGAAGTTCAGAGGTGTCCTATGGACTGAATTGGTAATCCTT
GACCATTGCCCACCAGAAGTTGATCTTGTGTTGCTAGAGGCACATTATCCACAAAAATGC
CAGGATTGTTTGTCACATGATGTGTGGCCCCAGAGTCAGGACACCAATTCACTTGAGGTG
CAGCAGTGAATTGTGTAGAAGATGCAGGAGCAGTAG


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC019900A_C01 KMC019900A_c01
         (576 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|AAN04167.1| Putative copia-like retrotransposon polyprotein [...    34  1e-04
ref|NP_192350.1| putative polyprotein [Arabidopsis thaliana] gi|...    44  0.002
ref|NP_194619.1| putative protein; protein id: At4g28900.1 [Arab...    42  0.004
ref|NP_192807.1| retrotransposon like protein; protein id: At4g1...    42  0.007
pir||T02087 gag/pol polyprotein - maize retrotransposon Hopscotc...    42  0.007

>gb|AAN04167.1| Putative copia-like retrotransposon polyprotein [Oryza sativa
           (japonica cultivar-group)]
          Length = 1042

 Score = 33.9 bits (76), Expect(2) = 1e-04
 Identities = 16/46 (34%), Positives = 23/46 (49%)
 Frame = -3

Query: 529 NWCPDSGATHHVTNNPGIFVDNVPLATQDQLLVGNGQGLPIQSIGH 392
           NW  D+GAT H+T+              DQ+   +G G+ I+ IGH
Sbjct: 218 NWYVDTGATDHITSQLEKLNTREVYKGHDQIHTASGAGMKIKHIGH 263

 Score = 33.1 bits (74), Expect(2) = 1e-04
 Identities = 26/128 (20%), Positives = 52/128 (40%), Gaps = 23/128 (17%)
 Frame = -1

Query: 321 AVSHSPVKASLFQSSLHSQPLAQAGLASQPVGNKASSAIDLPSS--LVNSSVSSSAPCNS 148
           A+ H+P +     + LH    A+  +++  + +  S  +++ S   L+    + S     
Sbjct: 264 AIVHTPTRPLHLNNVLHVPQAAKNLISATKLASDNSVFVEIHSKYFLIKDRTTRSTVLKG 323

Query: 147 SSNISIYEL------------------WHNRMGHPHHDVLKQTLSLCNVPV---SSNKSV 31
                +Y L                  WH+R+GHP   ++ + +S   +P    S+ +SV
Sbjct: 324 PRRHGLYPLPSTSSTKQAFAVAPSLERWHSRLGHPSIPIVMKVISSNKLPCLRESNKESV 383

Query: 30  FSFCKKKK 7
              C+K K
Sbjct: 384 CDACQKAK 391

>ref|NP_192350.1| putative polyprotein [Arabidopsis thaliana] gi|25407268|pir||G85055
           probable polyprotein [imported] - Arabidopsis thaliana
           gi|4773895|gb|AAD29768.1|AF076243_15 putative
           polyprotein [Arabidopsis thaliana]
           gi|7267198|emb|CAB77909.1| putative polyprotein
           [Arabidopsis thaliana]
          Length = 1017

 Score = 43.5 bits (101), Expect = 0.002
 Identities = 21/54 (38%), Positives = 33/54 (60%)
 Frame = -3

Query: 550 FTAAPQVNWCPDSGATHHVTNNPGIFVDNVPLATQDQLLVGNGQGLPIQSIGHL 389
           FT+ P      DSGA+HH+ NNP + +DN+  A    +++ NG  +P++ IG L
Sbjct: 321 FTSEPSKTLVIDSGASHHMINNPSL-IDNIKPAL-GNVVIANGDKVPVKEIGEL 372

>ref|NP_194619.1| putative protein; protein id: At4g28900.1 [Arabidopsis thaliana]
           gi|7444467|pir||T08945 hypothetical protein F25O24.20 -
           Arabidopsis thaliana gi|4972079|emb|CAB43904.1| putative
           protein [Arabidopsis thaliana]
           gi|7269745|emb|CAB81478.1| putative protein [Arabidopsis
           thaliana]
          Length = 1415

 Score = 42.4 bits (98), Expect = 0.004
 Identities = 19/44 (43%), Positives = 25/44 (56%)
 Frame = -3

Query: 526 WCPDSGATHHVTNNPGIFVDNVPLATQDQLLVGNGQGLPIQSIG 395
           W  DSGAT H+TN+        P + +D ++VGN   LPI  IG
Sbjct: 293 WVTDSGATSHITNSTSQLQSAQPYSGEDSVIVGNSDFLPITHIG 336

 Score = 35.4 bits (80), Expect = 0.52
 Identities = 16/36 (44%), Positives = 23/36 (63%)
 Frame = -1

Query: 126 ELWHNRMGHPHHDVLKQTLSLCNVPVSSNKSVFSFC 19
           E+WH R+GHP+ DVL+Q L   N  +  +K+  S C
Sbjct: 425 EVWHMRLGHPNQDVLQQLLR--NKAIVISKTSHSLC 458

>ref|NP_192807.1| retrotransposon like protein; protein id: At4g10690.1 [Arabidopsis
           thaliana] gi|7444419|pir||T04204 hypothetical protein
           T4F9.150 - Arabidopsis thaliana
           gi|4539447|emb|CAB40035.1| retrotransposon like protein
           [Arabidopsis thaliana] gi|7267767|emb|CAB81170.1|
           retrotransposon like protein [Arabidopsis thaliana]
          Length = 1515

 Score = 41.6 bits (96), Expect = 0.007
 Identities = 19/44 (43%), Positives = 25/44 (56%)
 Frame = -3

Query: 526 WCPDSGATHHVTNNPGIFVDNVPLATQDQLLVGNGQGLPIQSIG 395
           W PDS AT H+TN      ++   +  D ++VGNG  LPI  IG
Sbjct: 324 WLPDSAATAHITNTTDGLQNSQTYSGDDSVIVGNGDFLPITHIG 367

 Score = 31.6 bits (70), Expect = 7.5
 Identities = 13/37 (35%), Positives = 23/37 (62%)
 Frame = -1

Query: 126 ELWHNRMGHPHHDVLKQTLSLCNVPVSSNKSVFSFCK 16
           E+WH R+GHP+ +VL+  +    + V  NK+  + C+
Sbjct: 456 EVWHQRLGHPNKEVLQHLIKTKAIVV--NKTSSNMCE 490

>pir||T02087 gag/pol polyprotein - maize retrotransposon Hopscotch
           gi|531389|gb|AAA57005.1| copia-like retrotransposon
           Hopscotch polyprotein
          Length = 1439

 Score = 41.6 bits (96), Expect = 0.007
 Identities = 19/45 (42%), Positives = 25/45 (55%)
 Frame = -1

Query: 153 NSSSNISIYELWHNRMGHPHHDVLKQTLSLCNVPVSSNKSVFSFC 19
           N SS     E WH R+GHP  D++ + +S  N+P  SN S  S C
Sbjct: 447 NFSSTRVPLEHWHKRLGHPSRDIVHRVISNNNLPCLSNNSTTSVC 491

 Score = 36.6 bits (83), Expect = 0.23
 Identities = 18/58 (31%), Positives = 28/58 (48%)
 Frame = -3

Query: 565 ASSTQFTAAPQVNWCPDSGATHHVTNNPGIFVDNVPLATQDQLLVGNGQGLPIQSIGH 392
           A+S        V W  D+GAT H+T +      +      DQ++  NG G+ I +IG+
Sbjct: 310 ANSAAHQNGSNVPWYTDTGATDHITGDLDRLTMHDKYTGTDQIIAANGTGMTISNIGN 367

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 507,856,996
Number of Sequences: 1393205
Number of extensions: 10942540
Number of successful extensions: 42841
Number of sequences better than 10.0: 162
Number of HSP's better than 10.0 without gapping: 37509
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 41131
length of database: 448,689,247
effective HSP length: 116
effective length of database: 287,077,467
effective search space used: 21530810025
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MFBL024c01_f BP042445 1 576
2 MFB056a02_f BP038024 19 570
3 MFL017b03_f BP033820 22 559




Lotus japonicus
Kazusa DNA Research Institute