KMC003626A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC003626A_C01 KMC003626A_c01
gaatcggcacgaggcTAGTCCAAATCTGAAAACCCTAGTTTCAACCCTCACTCACTCGCT
TCACTCCAAGAATCACCAAACGAAGCCATGGTTAACGAATCCATCACCAAGAAGCGAAAG
CTCGTCCCCAAATCCTCTCAATCGGCGGAGCTACCGTTCAAGCTGCAGAACAAGTTGGAG
GCTGAGGTTCAAGAGTACGAAGAAGTCGAAGAAGAAGTAGAGGAAGAAGTGGAGGAGGAA
GAAGAAGAGGAGGAAGAGGAAGAGGAGGGTGAAGAAGAAGAAGAAGAAGAAGAAGAAGAA
GAGGAAGTGGAAGCGGAAGCGGAGGAGGAAGACGATGAACCGATCCAGAAGCTTCTCGAA
CCCATGGGGAAGGAGCAGATCATTAGCCTCCTTGCCGACGCGGCGTCTAACCACCGCGAT
GTAGCAGATCGGATCCGTAAGGTGGCCGACGAGGACGTTTCTCATCGGAAAATCTTCGTT
CACGGCCTCGGGTGGGATACCACCGCCACGACGCTGGTTTACGCGTTCCAGCAGTACGGC
GCGATTGAGGATTGCAAGGCGGTT


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC003626A_C01 KMC003626A_c01
         (564 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|AAM73765.1|AF403292_1 RNA-binding protein AKIP1 [Vicia faba]       182  2e-45
gb|AAK68825.1| putative protein [Arabidopsis thaliana] gi|201483...   163  2e-39
ref|NP_567042.1| RRM-containing RNA-binding protein; protein id:...   163  2e-39
ref|NP_181639.1| RRM-containing RNA-binding protein; protein id:...   144  8e-34
dbj|BAA90354.1| unnamed protein product [Oryza sativa (japonica ...   113  1e-24

>gb|AAM73765.1|AF403292_1 RNA-binding protein AKIP1 [Vicia faba]
          Length = 515

 Score =  182 bits (462), Expect = 2e-45
 Identities = 102/165 (61%), Positives = 119/165 (71%), Gaps = 9/165 (5%)
 Frame = +1

Query: 97  ESITKKRKLVPKSSQSAELPFKL---QNKLEAEVQEY---EEVEEE---VEEEVEEEEEE 249
           +++ +KRKLV KSSQ  E P K    Q + + E + Y   EEVEEE   VEEE EE EE 
Sbjct: 19  QTMVRKRKLVAKSSQPEEPPLKQHHSQPQPDPEPEPYIPTEEVEEEYEEVEEEYEEVEEI 78

Query: 250 EEEEEEGEEEEEEEEEEEEVEAEAEEEDDEPIQKLLEPMGKEQIISLLADAASNHRDVAD 429
           E EEEE EEEEEE+  E+    E  EEDDEPI+ L+EP  KEQI +LL +AA+ HRDVAD
Sbjct: 79  EVEEEEEEEEEEEDGGEQAQGGEYLEEDDEPIKDLVEPFTKEQIATLLCEAAAKHRDVAD 138

Query: 430 RIRKVADEDVSHRKIFVHGLGWDTTATTLVYAFQQYGAIEDCKAV 564
           RIRK+AD D SHRKIFVHGLGWDTT+ TL+ AF QYG IEDCKAV
Sbjct: 139 RIRKIADGDASHRKIFVHGLGWDTTSATLINAFSQYGEIEDCKAV 183

>gb|AAK68825.1| putative protein [Arabidopsis thaliana] gi|20148397|gb|AAM10089.1|
           putative protein [Arabidopsis thaliana]
          Length = 478

 Score =  163 bits (412), Expect = 2e-39
 Identities = 94/173 (54%), Positives = 118/173 (67%), Gaps = 19/173 (10%)
 Frame = +1

Query: 103 ITKKRKLVPKSSQSAELPF-KLQNKLEAEVQ----------EYEEVE-EEVEEEVEEEEE 246
           +TKKRKL  + S  AE P  KL+   E E Q          + EEVE EEVEEE EEE E
Sbjct: 1   MTKKRKLEGEESNEAEEPSQKLKQTPEEEQQLVIKNQDNQGDVEEVEYEEVEEEQEEEVE 60

Query: 247 EEEEEEEGEEEEEEEEEEEEVEAEA-------EEEDDEPIQKLLEPMGKEQIISLLADAA 405
           ++++E++G+E E++ +    +EA A       E++DDEPIQ LLEP  KEQ++SLL +AA
Sbjct: 61  DDDDEDDGDENEDQTDGNR-IEAAATSGSGDQEDDDDEPIQDLLEPFSKEQVLSLLKEAA 119

Query: 406 SNHRDVADRIRKVADEDVSHRKIFVHGLGWDTTATTLVYAFQQYGAIEDCKAV 564
             H DVA+RIR+VADED  HRKIFVHGLGWDT   TL+ AF+QYG IEDCKAV
Sbjct: 120 EKHVDVANRIREVADEDPVHRKIFVHGLGWDTKTETLIEAFKQYGEIEDCKAV 172

>ref|NP_567042.1| RRM-containing RNA-binding protein; protein id: At3g56860.1,
           supported by cDNA: gi_14194148, supported by cDNA:
           gi_14335131, supported by cDNA: gi_14596194, supported
           by cDNA: gi_20148396, supported by cDNA: gi_20259481
           [Arabidopsis thaliana] gi|11358431|pir||T51274
           hypothetical protein T8M16_190 - Arabidopsis thaliana
           gi|9663005|emb|CAC00749.1| putative protein [Arabidopsis
           thaliana] gi|14194149|gb|AAK56269.1|AF367280_1
           AT3g56860/T8M16_190 [Arabidopsis thaliana]
           gi|14335132|gb|AAK59846.1| AT3g56860/T8M16_190
           [Arabidopsis thaliana] gi|19682816|emb|CAD28672.1| UBP1
           interacting protein 2a [Arabidopsis thaliana]
           gi|20259482|gb|AAM13861.1| unknown protein [Arabidopsis
           thaliana] gi|21436451|gb|AAM51426.1| unknown protein
           [Arabidopsis thaliana] gi|22137068|gb|AAM91379.1|
           At3g56860/T8M16_190 [Arabidopsis thaliana]
          Length = 478

 Score =  163 bits (412), Expect = 2e-39
 Identities = 94/173 (54%), Positives = 118/173 (67%), Gaps = 19/173 (10%)
 Frame = +1

Query: 103 ITKKRKLVPKSSQSAELPF-KLQNKLEAEVQ----------EYEEVE-EEVEEEVEEEEE 246
           +TKKRKL  + S  AE P  KL+   E E Q          + EEVE EEVEEE EEE E
Sbjct: 1   MTKKRKLEGEESNEAEEPSQKLKQTPEEEQQLVIKNQDNQGDVEEVEYEEVEEEQEEEVE 60

Query: 247 EEEEEEEGEEEEEEEEEEEEVEAEA-------EEEDDEPIQKLLEPMGKEQIISLLADAA 405
           ++++E++G+E E++ +    +EA A       E++DDEPIQ LLEP  KEQ++SLL +AA
Sbjct: 61  DDDDEDDGDENEDQTDGNR-IEAAATSGSGNQEDDDDEPIQDLLEPFSKEQVLSLLKEAA 119

Query: 406 SNHRDVADRIRKVADEDVSHRKIFVHGLGWDTTATTLVYAFQQYGAIEDCKAV 564
             H DVA+RIR+VADED  HRKIFVHGLGWDT   TL+ AF+QYG IEDCKAV
Sbjct: 120 EKHVDVANRIREVADEDPVHRKIFVHGLGWDTKTETLIEAFKQYGEIEDCKAV 172

>ref|NP_181639.1| RRM-containing RNA-binding protein; protein id: At2g41060.1,
           supported by cDNA: gi_16612301 [Arabidopsis thaliana]
           gi|7487621|pir||T02113 probable RNA-binding protein
           At2g41060 [imported] - Arabidopsis thaliana
           gi|3402711|gb|AAD12005.1| putative RNA-binding protein
           [Arabidopsis thaliana]
           gi|16612302|gb|AAL27512.1|AF439844_1 At2g41060/T3K9.17
           [Arabidopsis thaliana] gi|22137136|gb|AAM91413.1|
           At2g41060/T3K9.17 [Arabidopsis thaliana]
          Length = 451

 Score =  144 bits (363), Expect = 8e-34
 Identities = 79/160 (49%), Positives = 105/160 (65%), Gaps = 6/160 (3%)
 Frame = +1

Query: 103 ITKKRKLVPKSSQSAELPFKLQNKLEAEVQEYEEVE---EEVEEEVEEEEEEEEEEEEGE 273
           +TKKRKL  +S++++E   K Q + E E  E   V+   ++ E+ VE++  +E  EEE +
Sbjct: 1   MTKKRKLESESNETSEPTEKQQQQCEKEDPEIRNVDNQRDDDEQVVEQDTLKEMHEEEAK 60

Query: 274 EEEEEEEEEEEVEAEAEEEDD---EPIQKLLEPMGKEQIISLLADAASNHRDVADRIRKV 444
            E+  E E          EDD   EPI+ LLEP  K+Q++ LL +AA  HRDVA+RIR V
Sbjct: 61  GEDNIEAETSSGSGNQGNEDDDEEEPIEDLLEPFSKDQLLILLKEAAERHRDVANRIRIV 120

Query: 445 ADEDVSHRKIFVHGLGWDTTATTLVYAFQQYGAIEDCKAV 564
           ADED+ HRKIFVHGLGWDT A +L+ AF+QYG IEDCK V
Sbjct: 121 ADEDLVHRKIFVHGLGWDTKADSLIDAFKQYGEIEDCKCV 160

>dbj|BAA90354.1| unnamed protein product [Oryza sativa (japonica cultivar-group)]
          Length = 490

 Score =  113 bits (283), Expect = 1e-24
 Identities = 62/119 (52%), Positives = 75/119 (62%)
 Frame = +1

Query: 208 EEEVEEEVEEEEEEEEEEEEGEEEEEEEEEEEEVEAEAEEEDDEPIQKLLEPMGKEQIIS 387
           EEEVEE   EEE E +E+E+GE E EEEEE       A E D + IQ LL    K+Q++ 
Sbjct: 77  EEEVEEVEVEEEVEVDEDEDGEGEGEEEEE-------AAERDADSIQALLNSFPKDQLVE 129

Query: 388 LLADAASNHRDVADRIRKVADEDVSHRKIFVHGLGWDTTATTLVYAFQQYGAIEDCKAV 564
           LL+ AA +H DV   + + AD D + RKIFVHGLGWD TA TL  AF  YG IED + V
Sbjct: 130 LLSAAALSHEDVLTAVHRAADADPALRKIFVHGLGWDATAETLTEAFSAYGEIEDLRVV 188

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.302    0.124    0.320 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 537,493,028
Number of Sequences: 1393205
Number of extensions: 16359622
Number of successful extensions: 1191613
Number of sequences better than 10.0: 25268
Number of HSP's better than 10.0 without gapping: 140276
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 505768
length of database: 448,689,247
effective HSP length: 116
effective length of database: 287,077,467
effective search space used: 20382500157
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 17 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 43 (21.7 bits)


EST assemble image


clone accession position
1 GNf056c02 BP071529 1 481
2 MFB050b07_f BP037599 16 567
3 MPD098a05_f AV776381 30 462




Lotus japonicus
Kazusa DNA Research Institute