KMC003004A_c02
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC003004A_C02 KMC003004A_c02
gtgtggcaaaagatcattcagactataccgttttagatagaaaatactAGGTACTAAACA
ACCATTACTCATTTGCAGAACAATCCCATTTCACGAGCAAGGGTGGAACTTGCTACCAAG
CAGACTAGAAACGAAATATAAACTAGATAATGGAAAGATTCCTTCATGGCATGAGAATAA
AAGTTGTATGGTTCAAAAGTCAAAATTTAACAGAAAATCTTGAGACAAAACAAAGGACAA
TCCAAAAAACTGAATATAACTACCAAGCACTCAAGTCTATTGAATACCCCATTAAACAAA
GATTCAAGCAGCAGTTGCTTGTTGGGCAGCAATCCTCTTCCTAGCTTCCTCAATGATTGA
TAGCTTAATACCCTTACCCTTGGGAAGAGATATCCATGGTTTGCTCCCCTTGCCAATGGT
GAACACATTGCCCAGACGGGTAGCAAACTCATGACCGGTTGCATCCTGAACATGGATTGT
GTCAAAGCTTCCCTTCTGCCTGTCCCTGCTCTTGATCACTCCAACTCTGCCTCTGTTCCT
TCCACCAGTGACCATGACAACATTCCCAACGTCAAACTTGATGAAATCAACAATCTTGTT
CTCCTGAAGATCCAGCTTGATGGTGTCATTTGCCCTGATGACTGGGTCCGGGTAGCGGAT
GGTGCGGCCATCATAGGTGTTCAGGTATGGGATACCCTTTTGTCCAAACTGCACAGACCT
CACCTTGCATAGCTTAAACTTAGCCTCATCATCCCTGACTGAGTGGAGACGGAATCGGCC
CTTAGTGTCATAAAGCAGGCGGAAGTTCTCATTGGTTTTTGGGATTGAGACAACATCCAT
GAAACCAGAGGGGTAAGTCTTGTCAGTCCTGACTTTGCCATCAACAAGGATATGACGCTG
CATCAGAATAGCGATAACCTCACGGTAAGTCAGAGCATACTTCAGCCTGTTTCGCAAGAT
CAGGATGAGTGGGAGACACTCCCTCGACTTATGTGGTCCAGATGAGGGTTTAGGAGCAAA
AGCACCACCCAGTTTGTCAAGCATCCAATGCTTCGGCGCATTGAGCCTCTTCAAATGCTT
CTTCAACCCTCTCGCCATTTTGGAGTTCAGAAACAGAGaaacgcttacggggggggcccg
g


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC003004A_C02 KMC003004A_c02
         (1141 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|AAM93434.1| 40S ribosomal S4 protein [Glycine max]                 517  e-145
ref|NP_565414.1| putative ribosomal protein S4; protein id: At2g...   496  e-139
ref|NP_568179.1| 40S ribosomal protein S4; protein id: At5g07090...   496  e-139
dbj|BAB11167.1| 40S ribosomal protein S4 [Arabidopsis thaliana]       494  e-138
ref|NP_200650.1| ribosomal protein S4 - like; protein id: At5g58...   493  e-138

>gb|AAM93434.1| 40S ribosomal S4 protein [Glycine max]
          Length = 264

 Score =  517 bits (1332), Expect = e-145
 Identities = 252/264 (95%), Positives = 263/264 (99%)
 Frame = -2

Query: 1098 MARGLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLKYALTYRE 919
            MARGLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLKYALTYRE
Sbjct: 1    MARGLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLKYALTYRE 60

Query: 918  VIAILMQRHILVDGKVRTDKTYPSGFMDVVSIPKTNENFRLLYDTKGRFRLHSVRDDEAK 739
            VIAILMQRH+LVDGKVRTDKTYP+GFMDVVSIPKTNENFRLLYDTKGRFRLHSVRDDEAK
Sbjct: 61   VIAILMQRHVLVDGKVRTDKTYPAGFMDVVSIPKTNENFRLLYDTKGRFRLHSVRDDEAK 120

Query: 738  FKLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPVIRANDTIKLDLQENKIVDFIKFDVGNV 559
            FKLCKVRSVQFGQKGIPYLNTYDGRTIRYPDP+IRANDTIKLDL+ENKIVDFIKFDVGNV
Sbjct: 121  FKLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPLIRANDTIKLDLEENKIVDFIKFDVGNV 180

Query: 558  VMVTGGRNRGRVGVIKSRDRQKGSFDTIHVQDATGHEFATRLGNVFTIGKGSKPWISLPK 379
            VMVTGGRNRGRVGVIK+R++ KGSF+TIHVQD+TGHEFATR+GNVFTIGKG+KPWISLPK
Sbjct: 181  VMVTGGRNRGRVGVIKNREKHKGSFETIHVQDSTGHEFATRMGNVFTIGKGTKPWISLPK 240

Query: 378  GKGIKLSIIEEARKRIAAQQATAA 307
            GKGIKLSIIEEARKRIAAQQATAA
Sbjct: 241  GKGIKLSIIEEARKRIAAQQATAA 264

>ref|NP_565414.1| putative ribosomal protein S4; protein id: At2g17360.1, supported by
            cDNA: 10042., supported by cDNA: gi_13877536, supported
            by cDNA: gi_14334915 [Arabidopsis thaliana]
            gi|13877537|gb|AAK43846.1|AF370469_1 Unknown protein
            [Arabidopsis thaliana] gi|14334916|gb|AAK59636.1|
            putative ribosomal protein S4 [Arabidopsis thaliana]
            gi|17104537|gb|AAL34157.1| putative ribosomal protein S4
            [Arabidopsis thaliana] gi|17978689|gb|AAL47338.1| unknown
            protein [Arabidopsis thaliana] gi|21536498|gb|AAM60830.1|
            putative ribosomal protein S4 [Arabidopsis thaliana]
          Length = 261

 Score =  496 bits (1276), Expect = e-139
 Identities = 234/261 (89%), Positives = 258/261 (98%)
 Frame = -2

Query: 1098 MARGLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLKYALTYRE 919
            MARGLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPL+LI+RNRLKYALTYRE
Sbjct: 1    MARGLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLVLIIRNRLKYALTYRE 60

Query: 918  VIAILMQRHILVDGKVRTDKTYPSGFMDVVSIPKTNENFRLLYDTKGRFRLHSVRDDEAK 739
            VI+ILMQRHI VDGKVRTDKTYP+GFMDVVSIPKTNENFRLLYDTKGRFRLHS++D+EAK
Sbjct: 61   VISILMQRHIQVDGKVRTDKTYPAGFMDVVSIPKTNENFRLLYDTKGRFRLHSIKDEEAK 120

Query: 738  FKLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPVIRANDTIKLDLQENKIVDFIKFDVGNV 559
            FKLCKVRS+QFGQKGIPYLNTYDGRTIRYPDP+I+ NDTIKLDL+ENKIV+FIKFDVGNV
Sbjct: 121  FKLCKVRSIQFGQKGIPYLNTYDGRTIRYPDPLIKPNDTIKLDLEENKIVEFIKFDVGNV 180

Query: 558  VMVTGGRNRGRVGVIKSRDRQKGSFDTIHVQDATGHEFATRLGNVFTIGKGSKPWISLPK 379
            VMVTGGRNRGRVGVIK+R++ KGSF+TIH+QD+TGHEFATRLGNV+TIGKG+KPW+SLPK
Sbjct: 181  VMVTGGRNRGRVGVIKNREKHKGSFETIHIQDSTGHEFATRLGNVYTIGKGTKPWVSLPK 240

Query: 378  GKGIKLSIIEEARKRIAAQQA 316
            GKGIKL+IIEEARKR++AQQA
Sbjct: 241  GKGIKLTIIEEARKRLSAQQA 261

>ref|NP_568179.1| 40S ribosomal protein S4; protein id: At5g07090.1, supported by cDNA:
            13813., supported by cDNA: gi_15292998, supported by
            cDNA: gi_17979542 [Arabidopsis thaliana]
            gi|20143904|sp|P49204|RS4_ARATH 40S ribosomal protein S4
            gi|11276521|pir||T48480 ribosomal protein S4 -
            Arabidopsis thaliana gi|7546687|emb|CAB87265.1| ribosomal
            protein S4 [Arabidopsis thaliana]
            gi|15292999|gb|AAK93610.1| putative 40S ribosomal protein
            S4 [Arabidopsis thaliana] gi|17979543|gb|AAL50106.1|
            AT5g07090/T28J14_30 [Arabidopsis thaliana]
            gi|19310835|gb|AAL85148.1| putative 40S ribosomal protein
            S4 [Arabidopsis thaliana] gi|20147251|gb|AAM10339.1|
            AT5g07090/T28J14_30 [Arabidopsis thaliana]
            gi|21537414|gb|AAM61755.1| 40S ribosomal protein S4
            [Arabidopsis thaliana]
          Length = 262

 Score =  496 bits (1276), Expect = e-139
 Identities = 234/261 (89%), Positives = 258/261 (98%)
 Frame = -2

Query: 1098 MARGLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLKYALTYRE 919
            MARGLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPL+LI+RNRLKYALTYRE
Sbjct: 1    MARGLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLVLIIRNRLKYALTYRE 60

Query: 918  VIAILMQRHILVDGKVRTDKTYPSGFMDVVSIPKTNENFRLLYDTKGRFRLHSVRDDEAK 739
            VI+ILMQRHI VDGKVRTDKTYP+GFMDVVSIPKTNENFRLLYDTKGRFRLHS++D+EAK
Sbjct: 61   VISILMQRHIQVDGKVRTDKTYPAGFMDVVSIPKTNENFRLLYDTKGRFRLHSIKDEEAK 120

Query: 738  FKLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPVIRANDTIKLDLQENKIVDFIKFDVGNV 559
            FKLCKVRS+QFGQKGIPYLNTYDGRTIRYPDP+I+ NDTIKLDL+ENKIV+FIKFDVGNV
Sbjct: 121  FKLCKVRSIQFGQKGIPYLNTYDGRTIRYPDPLIKPNDTIKLDLEENKIVEFIKFDVGNV 180

Query: 558  VMVTGGRNRGRVGVIKSRDRQKGSFDTIHVQDATGHEFATRLGNVFTIGKGSKPWISLPK 379
            VMVTGGRNRGRVGVIK+R++ KGSF+TIH+QD+TGHEFATRLGNV+TIGKG+KPW+SLPK
Sbjct: 181  VMVTGGRNRGRVGVIKNREKHKGSFETIHIQDSTGHEFATRLGNVYTIGKGTKPWVSLPK 240

Query: 378  GKGIKLSIIEEARKRIAAQQA 316
            GKGIKL+IIEEARKR+A+QQA
Sbjct: 241  GKGIKLTIIEEARKRLASQQA 261

>dbj|BAB11167.1| 40S ribosomal protein S4 [Arabidopsis thaliana]
          Length = 263

 Score =  494 bits (1271), Expect = e-138
 Identities = 233/260 (89%), Positives = 257/260 (98%)
 Frame = -2

Query: 1095 ARGLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLKYALTYREV 916
            ARGLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPL+LI+RNRLKYALTYREV
Sbjct: 3    ARGLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLVLIIRNRLKYALTYREV 62

Query: 915  IAILMQRHILVDGKVRTDKTYPSGFMDVVSIPKTNENFRLLYDTKGRFRLHSVRDDEAKF 736
            I+ILMQRHI VDGKVRTDKTYP+GFMDVVSIPKTNENFRLLYDTKGRFRLHS++D+EAKF
Sbjct: 63   ISILMQRHIQVDGKVRTDKTYPAGFMDVVSIPKTNENFRLLYDTKGRFRLHSIKDEEAKF 122

Query: 735  KLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPVIRANDTIKLDLQENKIVDFIKFDVGNVV 556
            KLCKVRS+QFGQKGIPYLNTYDGRTIRYPDP+I+ NDTIKLDL+ENKIV+FIKFDVGNVV
Sbjct: 123  KLCKVRSIQFGQKGIPYLNTYDGRTIRYPDPLIKPNDTIKLDLEENKIVEFIKFDVGNVV 182

Query: 555  MVTGGRNRGRVGVIKSRDRQKGSFDTIHVQDATGHEFATRLGNVFTIGKGSKPWISLPKG 376
            MVTGGRNRGRVGVIK+R++ KGSF+TIH+QD+TGHEFATRLGNV+TIGKG+KPW+SLPKG
Sbjct: 183  MVTGGRNRGRVGVIKNREKHKGSFETIHIQDSTGHEFATRLGNVYTIGKGTKPWVSLPKG 242

Query: 375  KGIKLSIIEEARKRIAAQQA 316
            KGIKL+IIEEARKR+A+QQA
Sbjct: 243  KGIKLTIIEEARKRLASQQA 262

>ref|NP_200650.1| ribosomal protein S4 - like; protein id: At5g58420.1, supported by
            cDNA: 22434., supported by cDNA: gi_16226258, supported
            by cDNA: gi_17979232 [Arabidopsis thaliana]
            gi|16226259|gb|AAL16117.1|AF428285_1 AT5g58420/mqj2_10
            [Arabidopsis thaliana] gi|21592333|gb|AAM64284.1|
            ribosomal protein S4-like [Arabidopsis thaliana]
          Length = 262

 Score =  493 bits (1270), Expect = e-138
 Identities = 233/261 (89%), Positives = 257/261 (98%)
 Frame = -2

Query: 1098 MARGLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLKYALTYRE 919
            MARGLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPL+LI+RNRLKYALTYRE
Sbjct: 1    MARGLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLVLIIRNRLKYALTYRE 60

Query: 918  VIAILMQRHILVDGKVRTDKTYPSGFMDVVSIPKTNENFRLLYDTKGRFRLHSVRDDEAK 739
            VI+ILMQRHI VDGKVRTDKTYP+GFMDVVSIPKTNENFRLLYDTKGRFRLHS++D+EAK
Sbjct: 61   VISILMQRHIQVDGKVRTDKTYPAGFMDVVSIPKTNENFRLLYDTKGRFRLHSIKDEEAK 120

Query: 738  FKLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPVIRANDTIKLDLQENKIVDFIKFDVGNV 559
            FKLCKVRS+QFGQKGIPYLNTYDGRTIRYPDP+I+ NDTIKLDL+ NKIV+FIKFDVGNV
Sbjct: 121  FKLCKVRSIQFGQKGIPYLNTYDGRTIRYPDPLIKPNDTIKLDLEANKIVEFIKFDVGNV 180

Query: 558  VMVTGGRNRGRVGVIKSRDRQKGSFDTIHVQDATGHEFATRLGNVFTIGKGSKPWISLPK 379
            VMVTGGRNRGRVGVIK+R++ KGSF+TIH+QD+TGHEFATRLGNV+TIGKG+KPW+SLPK
Sbjct: 181  VMVTGGRNRGRVGVIKNREKHKGSFETIHIQDSTGHEFATRLGNVYTIGKGTKPWVSLPK 240

Query: 378  GKGIKLSIIEEARKRIAAQQA 316
            GKGIKL+IIEEARKR+A+QQA
Sbjct: 241  GKGIKLTIIEEARKRLASQQA 261

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,008,497,030
Number of Sequences: 1393205
Number of extensions: 23252663
Number of successful extensions: 64782
Number of sequences better than 10.0: 153
Number of HSP's better than 10.0 without gapping: 61079
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 64644
length of database: 448,689,247
effective HSP length: 125
effective length of database: 274,538,622
effective search space used: 69732809988
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MPD067a04_f AV774430 1 400
2 MFB030b10_f BP036174 49 586
3 GNf011e03 BP068167 139 570
4 SPD076g10_f BP050107 140 632
5 MR016e03_f BP077190 140 541
6 MF032f11_f BP029989 142 262
7 SPD083f02_f BP050641 149 623
8 SPD087d01_f BP050940 165 718
9 SPD019b12_f BP045475 171 632
10 MPD040c09_f AV772726 186 250
11 MR057a02_f BP080345 202 590
12 SPD037b05_f BP046917 231 637
13 SPD066b10_f BP049249 559 1130
14 SPD032h02_f BP046578 651 1129
15 MFBL053b08_f BP043957 654 1133
16 MWM060f03_f AV765649 750 1130
17 MWM069a10_f AV765808 772 1156




Lotus japonicus
Kazusa DNA Research Institute