KMC020463A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC020463A_C01 KMC020463A_c01
atggcatctgtggctcagcaattgagtgctgttcggtggtctcCAATGCTATGGAGGCGG
AGCCAGAAGCAGAGGCGCGGTGCTGGTACCATCGTCTGTTCGGTTGCAATCTCGAACGCA
CAGAACAAAGAGAGAGCCAAGCTCAAACAGCTCTTTGAAGATGCTTACGAGAGGTGCCGC
ACTGCTCCTACAGATGGTGTTTCCTTCACCCTCGAGCAGTTCACTACCGCTCTCGAGAAG
TATGACTTCGATGCTGAGATTGGGACCAAGGTTAAGGGCACTGTGTTTGGTACTGATGCC
AGTGGAGCTTATGTTGATATTACTGCAAAGTCTACGGCATACTTGCCCCTCCAAGAGGCA
TGCATCCACAAAATTAAGCATGTTGAAGAAGCAGGTTTAGTTCCAGGCGTGAGAGACGAA
TTTGTAATCATTGGTGAAAATGAATCTGATGATACCTTGTTCTTGAGTTTAAAGTCTATT
CAGTTTGGGCTTGCATGGGAACGCTGTAGACAGCTTCAGGCTGAAGATGCTGTTGTCAAG
GGTAAAATTGTTAA


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC020463A_C01 KMC020463A_c01
         (554 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

sp|P29344|RR1_SPIOL 30S ribosomal protein S1, chloroplast precur...   243  9e-64
ref|NP_198266.1| ribosomal protein S1; protein id: At5g29771.1, ...   243  1e-63
gb|ZP_00113919.1| hypothetical protein [Prochlorococcus marinus ...   104  6e-22
ref|ZP_00116164.1| hypothetical protein [Synechococcus sp. WH 8102]   103  2e-21
ref|NP_440890.1| 30S ribosomal protein S1 [Synechocystis sp. PCC...   102  4e-21

>sp|P29344|RR1_SPIOL 30S ribosomal protein S1, chloroplast precursor (CS1)
           gi|322404|pir||A44121 ribosomal protein S1 precursor,
           chloroplast - spinach gi|18060|emb|CAA46927.1| ribosomal
           protein S1 [Spinacia oleracea] gi|170143|gb|AAA34045.1|
           chloroplast ribosomal protein S1
          Length = 411

 Score =  243 bits (621), Expect = 9e-64
 Identities = 127/192 (66%), Positives = 155/192 (80%), Gaps = 8/192 (4%)
 Frame = +1

Query: 1   MASVAQQLSA-VRWSPMLWRRSQKQRRGAGT-------IVCSVAISNAQNKERAKLKQLF 156
           MAS+AQQL+  +R  P+      K      T       IV +VA+SNAQ +ER KLKQLF
Sbjct: 1   MASLAQQLAGGLRCPPLSNSNLSKPFSPKHTLKPRFSPIVSAVAVSNAQTRERQKLKQLF 60

Query: 157 EDAYERCRTAPTDGVSFTLEQFTTALEKYDFDAEIGTKVKGTVFGTDASGAYVDITAKST 336
           EDAYERCR AP +GVSFT++ F TAL+KYDF++E+G++VKGTVF TDA+GA VDITAKS+
Sbjct: 61  EDAYERCRNAPMEGVSFTIDDFHTALDKYDFNSEMGSRVKGTVFCTDANGALVDITAKSS 120

Query: 337 AYLPLQEACIHKIKHVEEAGLVPGVRDEFVIIGENESDDTLFLSLKSIQFGLAWERCRQL 516
           AYLPL EACI++IK+VEEAG++PGVR+EFVIIGENE+DD+L LSL+ IQ+ LAWERCRQL
Sbjct: 121 AYLPLAEACIYRIKNVEEAGIIPGVREEFVIIGENEADDSLILSLRQIQYELAWERCRQL 180

Query: 517 QAEDAVVKGKIV 552
           QAED VVKGKIV
Sbjct: 181 QAEDVVVKGKIV 192

>ref|NP_198266.1| ribosomal protein S1; protein id: At5g29771.1, supported by cDNA:
           4565., supported by cDNA: gi_13877938, supported by
           cDNA: gi_16649088 [Arabidopsis thaliana]
           gi|13877939|gb|AAK44047.1|AF370232_1 putative ribosomal
           protein S1 [Arabidopsis thaliana]
           gi|16649089|gb|AAL24396.1| Unknown protein [Arabidopsis
           thaliana] gi|21593804|gb|AAM65771.1| ribosomal protein
           S1 [Arabidopsis thaliana] gi|23296539|gb|AAN13122.1|
           putative ribosomal protein S1 [Arabidopsis thaliana]
          Length = 416

 Score =  243 bits (619), Expect = 1e-63
 Identities = 122/195 (62%), Positives = 160/195 (81%), Gaps = 11/195 (5%)
 Frame = +1

Query: 1   MASVAQQLSAVRWSPM-----LWRRSQK---QRRGAG---TIVCSVAISNAQNKERAKLK 147
           MAS+AQQ S +R SP+     L RR+ K   Q + A    TIV +VA+S+ Q KER +LK
Sbjct: 1   MASLAQQFSGLRCSPLSSSSRLSRRASKNFPQNKSASVSPTIVAAVAMSSGQTKERLELK 60

Query: 148 QLFEDAYERCRTAPTDGVSFTLEQFTTALEKYDFDAEIGTKVKGTVFGTDASGAYVDITA 327
           ++FEDAYERCRT+P +GV+FT++ F  A+E+YDF++EIGT+VKGTVF TDA+GA VDI+A
Sbjct: 61  KMFEDAYERCRTSPMEGVAFTVDDFAAAIEQYDFNSEIGTRVKGTVFKTDANGALVDISA 120

Query: 328 KSTAYLPLQEACIHKIKHVEEAGLVPGVRDEFVIIGENESDDTLFLSLKSIQFGLAWERC 507
           KS+AYL +++ACIH+IKHVEEAG+VPG+ +EFVIIGENESDD+L LSL++IQ+ LAWERC
Sbjct: 121 KSSAYLSVEQACIHRIKHVEEAGIVPGMVEEFVIIGENESDDSLLLSLRNIQYELAWERC 180

Query: 508 RQLQAEDAVVKGKIV 552
           RQLQAED +VK K++
Sbjct: 181 RQLQAEDVIVKAKVI 195

>gb|ZP_00113919.1| hypothetical protein [Prochlorococcus marinus str. MIT 9313]
          Length = 367

 Score =  104 bits (260), Expect = 6e-22
 Identities = 52/131 (39%), Positives = 80/131 (60%)
 Frame = +1

Query: 157 EDAYERCRTAPTDGVSFTLEQFTTALEKYDFDAEIGTKVKGTVFGTDASGAYVDITAKST 336
           +D   R       G  FTL++F + L KYD++ + G  V GTVF  ++ GA +DI AK+ 
Sbjct: 52  DDPSSRAAKNDLSGAGFTLDEFASLLSKYDYNFKPGDIVNGTVFALESKGAMIDIGAKTA 111

Query: 337 AYLPLQEACIHKIKHVEEAGLVPGVRDEFVIIGENESDDTLFLSLKSIQFGLAWERCRQL 516
           A++PLQE  I++++ + +  L+PG   EF I+ E   D  L LS++ I++  AWER RQL
Sbjct: 112 AFMPLQEVSINRVEGLSDV-LLPGEIREFFIMSEENEDGQLSLSIRRIEYQRAWERVRQL 170

Query: 517 QAEDAVVKGKI 549
           Q EDA +  ++
Sbjct: 171 QKEDATIYSEV 181

>ref|ZP_00116164.1| hypothetical protein [Synechococcus sp. WH 8102]
          Length = 367

 Score =  103 bits (256), Expect = 2e-21
 Identities = 52/131 (39%), Positives = 79/131 (59%)
 Frame = +1

Query: 157 EDAYERCRTAPTDGVSFTLEQFTTALEKYDFDAEIGTKVKGTVFGTDASGAYVDITAKST 336
           +D   R  +   D   FT+++F   L KYD++ + G  V GTVF  +A GA +DI AK+ 
Sbjct: 52  DDPGSRASSRNLDDAGFTIDEFAALLSKYDYNFKPGDIVNGTVFALEAKGAMIDIGAKTA 111

Query: 337 AYLPLQEACIHKIKHVEEAGLVPGVRDEFVIIGENESDDTLFLSLKSIQFGLAWERCRQL 516
           A++PLQE  I++++ + +  L PG   EF I+ E   D  L LS++ I++  AWER RQL
Sbjct: 112 AFMPLQEVSINRVEGLSDV-LQPGEIREFFIMSEENEDGQLALSVRRIEYQRAWERVRQL 170

Query: 517 QAEDAVVKGKI 549
           Q EDA +  ++
Sbjct: 171 QKEDATIYSEV 181

>ref|NP_440890.1| 30S ribosomal protein S1 [Synechocystis sp. PCC 6803]
           gi|2500385|sp|P73530|RS1A_SYNY3 30S ribosomal protein S1
           homolog A gi|7447089|pir||S77236 ribosomal protein S1 -
           Synechocystis sp. (strain PCC 6803)
           gi|1652650|dbj|BAA17570.1| 30S ribosomal protein S1
           [Synechocystis sp. PCC 6803]
          Length = 328

 Score =  102 bits (253), Expect = 4e-21
 Identities = 53/120 (44%), Positives = 73/120 (60%)
 Frame = +1

Query: 190 TDGVSFTLEQFTTALEKYDFDAEIGTKVKGTVFGTDASGAYVDITAKSTAYLPLQEACIH 369
           T  + FTLE F   L+KYD+    G  V GTVF  ++ GA +DI AK+ AY+P+QE  I+
Sbjct: 7   TATIGFTLEDFAALLDKYDYHFSPGDIVAGTVFSMESRGALIDIGAKTAAYIPIQEMSIN 66

Query: 370 KIKHVEEAGLVPGVRDEFVIIGENESDDTLFLSLKSIQFGLAWERCRQLQAEDAVVKGKI 549
           ++   EE  L P    EF I+ +   D  L LS++ I++  AWER RQLQAEDA V+  +
Sbjct: 67  RVDDPEEV-LQPNETREFFILTDENEDGQLTLSIRRIEYMRAWERVRQLQAEDATVRSNV 125

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 468,392,961
Number of Sequences: 1393205
Number of extensions: 9625805
Number of successful extensions: 33639
Number of sequences better than 10.0: 79
Number of HSP's better than 10.0 without gapping: 32387
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 33596
length of database: 448,689,247
effective HSP length: 116
effective length of database: 287,077,467
effective search space used: 19521267756
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MFBL017g02_f BP042124 1 554
2 MF002d01_f BP028327 44 517




Lotus japonicus
Kazusa DNA Research Institute