KMC000509A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC000509A_C01 KMC000509A_c01
acaaatgTATATCATATCATACAATCTTGCATGCTTAGCTTTCTTACATAAGATCAGACA
AATATGAGGATTGATGAGGAATTAAAGATAAAGAGATTCTAGGAAAAATATATATCTGGA
ATATAATTGAATGGGATAAAATATGTTGATAAGAATACAGGGGAAGATGTAGCCTTTTTA
TACTACTAAGCTAACAATATGTTAGTAGTACAGATGTAGTACAAGCACAGTACGTACATC
CTTTGCTATCTCACTAAGTCAAATCATAACGGTAAAATGCATGTATCCCCAGCATTATTT
CCTGATTTGTCTCCTAATTCATCTCCTGATTTGCCTCCTAATTTATCTCCTAATACCCCC
CCACAAGCTGAGGAGTATATATTGAGAACTCCCAGCTTGGAAACTAAGTGTGAAAAAGGT
CCAGATTCAAGGGGCTTGGTCAAAATGTCGGCTGTTTGATCAACACTTGAAATAGGAAGT
AGATGAAAGAGCTTGGCTTGCAACTTTTCCCGAACAACATGGCAATCAATGTCGAGATGT
TTAGTACGTTCATGAAACACAGCATTAGTTGCAATATGTCTAGCAGACTGACTATCGCAG
TAAAGCAATGAAGGAGAGATGAAAGGAACACGAAGATCTTGAAGGATGTAGGTAAGCCAC
TAAAGTTCACA


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC000509A_C01 KMC000509A_c01
         (671 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_177159.1| hypothetical protein; protein id: At1g70010.1 [...   110  2e-23
ref|NP_174020.1| polyprotein, putative; protein id: At1g26990.1 ...   109  3e-23
gb|AAF79879.1|AC000348_32 T7N9.5 [Arabidopsis thaliana]               109  3e-23
pir||T01956 hypothetical protein T2L5.9 - Arabidopsis thaliana g...   106  3e-22
pir||T01879 hypothetical protein F8M12.17 - Arabidopsis thaliana...   106  3e-22

>ref|NP_177159.1| hypothetical protein; protein id: At1g70010.1 [Arabidopsis thaliana]
            gi|25301690|pir||G96722 hypothetical protein F20P5.25
            [imported] - Arabidopsis thaliana
            gi|2194136|gb|AAB61111.1| Strong similarity to Zea mays
            retrotransposon Hopscotch polyprotein (gb|U12626).
            [Arabidopsis thaliana]
          Length = 1315

 Score =  110 bits (275), Expect = 2e-23
 Identities = 54/101 (53%), Positives = 72/101 (70%)
 Frame = -1

Query: 668  EL*WLTYILQDLRVPFISPSLLYCDSQSARHIATNAVFHERTKHLDIDCHVVREKLQAKL 489
            EL WLT  L++L+VP   P+LL+CD+++A HIA N VFHERTKH++ DCH VRE+L   L
Sbjct: 1215 ELVWLTNFLKELQVPLSKPTLLFCDNEAAIHIANNHVFHERTKHIESDCHSVRERLLKGL 1274

Query: 488  FHLLPISSVDQTADILTKPLESGPFSHLVSKLGVLNIYSSA 366
            F L  I++  Q AD  TKPL    F  L+SK+G+LNI+ S+
Sbjct: 1275 FELYHINTELQIADPFTKPLYPSHFHRLISKMGLLNIFVSS 1315

>ref|NP_174020.1| polyprotein, putative; protein id: At1g26990.1 [Arabidopsis thaliana]
          Length = 1425

 Score =  109 bits (273), Expect = 3e-23
 Identities = 52/100 (52%), Positives = 70/100 (70%)
 Frame = -1

Query: 668  EL*WLTYILQDLRVPFISPSLLYCDSQSARHIATNAVFHERTKHLDIDCHVVREKLQAKL 489
            EL WL YIL   ++PF  P+ LYCD+++A HIA N+VFHERTKH++ DCH VRE ++A +
Sbjct: 1324 ELIWLGYILTAFKIPFTHPAYLYCDNEAALHIANNSVFHERTKHIENDCHKVRECIEAGI 1383

Query: 488  FHLLPISSVDQTADILTKPLESGPFSHLVSKLGVLNIYSS 369
               + + + +Q AD LTKPL   PF    SKLG+LNIY +
Sbjct: 1384 LKTIFVRTDNQLADTLTKPLYPKPFRENNSKLGLLNIYEA 1423

>gb|AAF79879.1|AC000348_32 T7N9.5 [Arabidopsis thaliana]
          Length = 1436

 Score =  109 bits (273), Expect = 3e-23
 Identities = 52/100 (52%), Positives = 70/100 (70%)
 Frame = -1

Query: 668  EL*WLTYILQDLRVPFISPSLLYCDSQSARHIATNAVFHERTKHLDIDCHVVREKLQAKL 489
            EL WL YIL   ++PF  P+ LYCD+++A HIA N+VFHERTKH++ DCH VRE ++A +
Sbjct: 1335 ELIWLGYILTAFKIPFTHPAYLYCDNEAALHIANNSVFHERTKHIENDCHKVRECIEAGI 1394

Query: 488  FHLLPISSVDQTADILTKPLESGPFSHLVSKLGVLNIYSS 369
               + + + +Q AD LTKPL   PF    SKLG+LNIY +
Sbjct: 1395 LKTIFVRTDNQLADTLTKPLYPKPFRENNSKLGLLNIYEA 1434

>pir||T01956 hypothetical protein T2L5.9 - Arabidopsis thaliana
            gi|3695393|gb|AAC62795.1| contains similarity to
            retroviral aspartyl proteases (Pfam: rvp.hmm, score:
            11.80) [Arabidopsis thaliana]
          Length = 1244

 Score =  106 bits (264), Expect = 3e-22
 Identities = 51/102 (50%), Positives = 74/102 (72%)
 Frame = -1

Query: 671  CEL*WLTYILQDLRVPFISPSLLYCDSQSARHIATNAVFHERTKHLDIDCHVVREKLQAK 492
            CE+ WL  +L DL++   S  +++ DS +A +IATN VFHERTKH++IDCH+VRE+L   
Sbjct: 1143 CEMVWLASLLLDLKIITGSVPIVFSDSTAAIYIATNPVFHERTKHIEIDCHLVRERLDKG 1202

Query: 491  LFHLLPISSVDQTADILTKPLESGPFSHLVSKLGVLNIYSSA 366
            L  +L + + DQ ADILTKPL    FS+L+SK+ + NI++S+
Sbjct: 1203 LIRMLHVRTEDQVADILTKPLFPHQFSYLMSKMSLHNIFASS 1244

>pir||T01879 hypothetical protein F8M12.17 - Arabidopsis thaliana
            gi|3513747|gb|AAC33963.1| contains similarity to reverse
            transcriptases (Pfam; rvt.hmm, score: 11.19) [Arabidopsis
            thaliana]
          Length = 1633

 Score =  106 bits (264), Expect = 3e-22
 Identities = 47/93 (50%), Positives = 67/93 (71%)
 Frame = -1

Query: 671  CEL*WLTYILQDLRVPFISPSLLYCDSQSARHIATNAVFHERTKHLDIDCHVVREKLQAK 492
            CE+ WL  +L+DL V    P+ L+CD++SA H+ATN VFHERTKH++IDCH VR++++A 
Sbjct: 1336 CEIIWLQQLLKDLHVTMTCPAKLFCDNKSALHLATNPVFHERTKHIEIDCHTVRDQIKAG 1395

Query: 491  LFHLLPISSVDQTADILTKPLESGPFSHLVSKL 393
                L + + +Q ADILTKPL  GPF  L+ ++
Sbjct: 1396 KLKTLHVPTGNQLADILTKPLHPGPFHSLLKRI 1428

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 529,377,649
Number of Sequences: 1393205
Number of extensions: 11357165
Number of successful extensions: 42910
Number of sequences better than 10.0: 454
Number of HSP's better than 10.0 without gapping: 35593
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 42125
length of database: 448,689,247
effective HSP length: 119
effective length of database: 282,897,852
effective search space used: 29421376608
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 GENLf062h02 BP065694 1 354
2 MRL030d08_f BP085213 8 116
3 MPD091d12_f AV775978 99 452
4 GENLf020e08 BP063415 99 589
5 GENLf043f03 BP064630 99 589
6 MWM041g08_f AV765329 102 468
7 MRL011b11_f BP084247 114 491
8 SPDL017h11_f BP053098 168 674




Lotus japonicus
Kazusa DNA Research Institute