KMC000365A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC000365A_C01 KMC000365A_c01
gatcaTTTCTCACGATAGCACAAATCAGATTATCAGAAGACGACGAGTGCAAAAATAGGT
CTAAGTACTCGTCTAGGTTAAAATTTCGATTACCAACCCAATACACTCGATCATGAACAG
AAAATCTTGGTGGGCATGTTCCTCAAAAGGACCGAGGTCTCCACGAGTATTGGAAGATAA
TAAATTCAATAACTGAGTATCAAAAAGAATTCTAAAACTCAAACAGGGGGAGGAGAGTAC
AGAGAGAACTAGACCAAAAGTCTGAAGAACCAGCAACATATATCATGCAATGATCCCCCA
AAGAGCCGCAGTAACAAACCAGGGCAAAGAATGATCAACGATCACGATTCCTGGAAGATG
ACCATGTGGTGGATGACTTTGGAGCACCAGAACCAAAGGAGGTTCTGCGATCCTCACTGC
GCCACCTGTCACCATCCGGCCGGCTGCTACTGTTATTCCAACGATCTGTTTCCGGAGGAG
GAGGAGCAGCAGCGGCAGCGGCTGCAGGTTCACCTCCTTGGCGTCGGTGTCTGGGAACAT
ATCTCCCTGGGTTTGCAACGGCAGCTGCTGCAGCAGGTGCAGATTCCAACGGTCGAGCAG
GAGGCTCAGCGGGCTTAAGAACTGGTTCGGTTGGTCTTGCCAACAATGCTTCCTTCCTCT
Gnctctctttttcttcaagctccc


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC000365A_C01 KMC000365A_c01
         (684 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

sp|Q40554|IF3A_TOBAC Eukaryotic translation initiation factor 3 ...    88  1e-16
dbj|BAB07943.1| putative eukaryotic translation initiation facto...    65  1e-09
ref|NP_192881.1| putative protein; protein id: At4g11420.1, supp...    64  1e-09
sp|Q9XHR2|IF3A_MAIZE Eukaryotic translation initiation factor 3 ...    64  3e-09
pir||S22697 extensin - Volvox carteri (fragment) gi|21992|emb|CA...    42  0.006

>sp|Q40554|IF3A_TOBAC Eukaryotic translation initiation factor 3 subunit 10 (eIF-3 theta)
            (Eukaryotic translation initiation factor 3 large
            subunit) (eIF3a) (PNLA-35) gi|629692|pir||S47179
            hypothetical protein - common tobacco
            gi|506471|emb|CAA56189.1| unnamed protein product
            [Nicotiana tabacum]
          Length = 958

 Score = 88.2 bits (217), Expect = 1e-16
 Identities = 53/120 (44%), Positives = 65/120 (54%), Gaps = 9/120 (7%)
 Frame = -3

Query: 682  ELEEKEXQRKEALLARPTEPVLKPAEPPA--RPLE-------SAPAAAAAVANPGRYVPR 530
            ELEEKE + +E +L + T  + KPAEPP   RP E        A AA A    PG+YVP+
Sbjct: 845  ELEEKEKREREEILRKSTAVLPKPAEPPTLGRPAELGGAAPIPAAAATAPTPGPGKYVPK 904

Query: 529  HRRQGGEPAAAAAAAPPPPETDRWNNSSSRPDGDRWRSEDRRTSFGSGAPKSSTTWSSSR 350
            H R   + A  A    PPPETD+W   S   D   WR E +  SFGSG   S T+W +SR
Sbjct: 905  HLRTKMDGAGQA----PPPETDKWGGGSKPDDRPSWRDERKPPSFGSG---SRTSWPASR 957

>dbj|BAB07943.1| putative eukaryotic translation initiation factor 3 large subunit
            [Oryza sativa (japonica cultivar-group)]
          Length = 984

 Score = 64.7 bits (156), Expect = 1e-09
 Identities = 55/142 (38%), Positives = 67/142 (46%), Gaps = 30/142 (21%)
 Frame = -3

Query: 682  ELEEK-EXQRKEALLAR---PTEPVLKP-AEPPARPLE--SAPAAAAAVANP--GRYVPR 530
            ELEEK E QR EAL+ R     EP   P A P A+P +  +APAAAAA A P  G+YVP+
Sbjct: 844  ELEEKKEKQRMEALMGRGAGAAEPARTPDAAPVAQPAQPVAAPAAAAAAAAPAAGKYVPK 903

Query: 529  HRRQGGEPAAAAAAAPP--PPETDRWNNSSSRPDGDRWRSEDRRTSFGSGAP-------- 380
             +R GG+  ++A    P   PE DRW +   RP  D              AP        
Sbjct: 904  FKR-GGDGGSSAGGQRPAVAPEQDRWGSRDDRPRPDMRPLRQEAPPARDAAPPARQDGPP 962

Query: 379  -----------KSSTTWSSSRN 347
                        SS+TWSS RN
Sbjct: 963  GTWRPSRYSSSSSSSTWSSRRN 984

>ref|NP_192881.1| putative protein; protein id: At4g11420.1, supported by cDNA:
            gi_12407748 [Arabidopsis thaliana]
            gi|23396624|sp|Q9LD55|IF3A_ARATH Eukaryotic translation
            initiation factor 3 subunit 10 (eIF-3 theta) (Eukaryotic
            translation initiation factor 3 large subunit) (eIF3a)
            (p114) gi|7486118|pir||T10562 hypothetical protein
            F25E4.40 - Arabidopsis thaliana
            gi|7267841|emb|CAB81243.1| putative protein [Arabidopsis
            thaliana] gi|7321039|emb|CAB82147.1| putative protein
            [Arabidopsis thaliana]
            gi|12407749|gb|AAG53635.1|AF291711_1 initiation factor 3a
            [Arabidopsis thaliana]
          Length = 987

 Score = 64.3 bits (155), Expect = 1e-09
 Identities = 44/116 (37%), Positives = 59/116 (49%), Gaps = 3/116 (2%)
 Frame = -3

Query: 682  ELEEKEXQRKEALLARPTEPVLKPAEPPARPL-ESAPAAAAAVAN--PGRYVPRHRRQGG 512
            ELEEK  + +E LL     P  + AEP   P+  +APAAAAA A      YVP+ +RQ  
Sbjct: 846  ELEEKSRREREELLRGTNAPPARLAEPTVTPVGTTAPAAAAAAAGAPAAPYVPKWKRQTT 905

Query: 511  EPAAAAAAAPPPPETDRWNNSSSRPDGDRWRSEDRRTSFGSGAPKSSTTWSSSRNR 344
            E   +  +AP   ETDR +N    P  D W S         GA +++  W+S+R R
Sbjct: 906  E--VSGPSAPTSSETDRRSNRGPPPGDDHWGS-------NRGAAQNTDRWTSNRER 952

>sp|Q9XHR2|IF3A_MAIZE Eukaryotic translation initiation factor 3 subunit 10 (eIF-3 theta)
            (Eukaryotic translation initiation factor 3 large
            subunit) (eIF3a) gi|5106764|gb|AAD39834.1|AF073329_1
            eukaryotic translation initiation factor 3 large subunit
            [Zea mays]
          Length = 962

 Score = 63.5 bits (153), Expect = 3e-09
 Identities = 47/132 (35%), Positives = 66/132 (49%), Gaps = 20/132 (15%)
 Frame = -3

Query: 682  ELEEKEXQRKEALLARPTEPVLKP-----AEPPARPLESAPAAAAAVANPGRYVPRHRRQ 518
            ELEEK    +E LL + +E V  P     A+PP     +A AAAAA   P +Y+P+ +R 
Sbjct: 840  ELEEKAKATREKLL-KGSEAVRAPDSAPVAQPPRESAAAAAAAAAAAPAPSKYIPKFKR- 897

Query: 517  GGEPAAAAAAAPPPPETDRWNN---------------SSSRPDGDRWRSEDRRTSFGSGA 383
            GG+ ++  + +    + DRW +                SSR D DRWR     + F S +
Sbjct: 898  GGDSSSIPSGS---RDEDRWGSRGPLRQDGPPARLDAPSSRQDTDRWRG----SRFPSNS 950

Query: 382  PKSSTTWSSSRN 347
              SS+TWS SRN
Sbjct: 951  TSSSSTWSRSRN 962

>pir||S22697 extensin - Volvox carteri (fragment) gi|21992|emb|CAA46283.1|
           extensin [Volvox carteri]
          Length = 464

 Score = 42.4 bits (98), Expect = 0.006
 Identities = 23/56 (41%), Positives = 24/56 (42%)
 Frame = -3

Query: 631 TEPVLKPAEPPARPLESAPAAAAAVANPGRYVPRHRRQGGEPAAAAAAAPPPPETD 464
           + P   P  PP  P  S P  A A ANP    P   R GG P       PPPPE D
Sbjct: 381 SSPPPPPRPPPPSPPPSPPPPATAAANPPSPAPSRSRAGG-PPLGTRPPPPPPEDD 435

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 650,635,508
Number of Sequences: 1393205
Number of extensions: 16756689
Number of successful extensions: 101623
Number of sequences better than 10.0: 749
Number of HSP's better than 10.0 without gapping: 74404
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 95794
length of database: 448,689,247
effective HSP length: 119
effective length of database: 282,897,852
effective search space used: 30552968016
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 GNLf011d08 BP075459 1 444
2 MFL016f09_f BP033817 6 377
3 GENLf013h11 BP063057 50 538
4 MFB011d01_f BP034707 52 595
5 MWL063a09_f AV769678 59 308
6 GENLf046h11 BP064806 63 594
7 MRL026g08_f BP085070 68 515
8 GENLf077e05 BP066535 70 556
9 MRL020h07_f BP084763 76 450
10 GENLf043h05 BP064644 77 594
11 GENLf085a02 BP066953 78 626
12 GNLf011c03 BP075446 111 584
13 MRL009d12_f BP084160 116 489
14 MRL002a02_f BP083767 116 486
15 GENLf021a03 BP063435 143 622
16 GENLf061f08 BP065626 171 690




Lotus japonicus
Kazusa DNA Research Institute