KMC004086A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC004086A_C01 KMC004086A_c01
aaattaatGCCCTAAATGATGACATGCCTTGATTATTTTCAGACAAGTGACACTTACACC
ACCCCAACTCCTTACAACAGATTATAATCTTAAAAATCACATCACATGAATGGTCATAGT
CATAGGTTCATCAGTGTCAGCATACGAATTAAGAAAAAGCAAAAGATATTAGCCATTCTC
ACCTAATGACCCATATAGGGAGCTCCAGAACCAGGGTGAGGTCTAACACCACTGTTCTGT
CCCATCTGAGGGATTCTGGGTATCCCGGTTGCATCCCTGGCTGGTTCCCATAGTTCATGG
GAGCCTGATTACCATAAGGAGCAGCAGGCAGGTGGTCCAGCCTGGTTCACTGGTGCACCA
CTAAAAGTTCCAAGCAGGTTCCCAAGCCCCAACCCAGCTCCTTGATTAGCAAGCAAAGCT
AAAGCCTGCCCTAGAGCAGGGTTCAACCCCTGACCCTGCACACCAGGGTTATAACCTCCC
ATGCCGGGATTCGATGGCGCCATCAGGTGACCACCACCACCATGTGATGGCCCACCACCA
GAGTTGTACTTGTGCTTCTTATGCGGCTGATAGTGAGGCTGATGATGGTGAGAATGGTGT
TGGTGCTGGTGCTGGTGCTGGTTCTGATGATACCCTTTGCTCCCCTTAGGACCATCCACA
GCCTTCTGACAATACAGAGTGTGCCCTTCATATTCCTTGTTTGGCTCTTCTAAAGCCTTC
TTGGCACTCTCCACCGACCTGTAAACAAAAAGCGCAAACCCCTTCGGCCTCCCGGTGTTC
TTATCAAGCCCCAAGGGACCATCTTCAACCTCCCCAAACTGCTTGAAGAACTCAAGCAGC
TTCAATGGTTCAATATCCGAGCTCACATTGCTCACAAAGATCTTCCTCTGGGTGTACTCA
GAAACCGGAGGAGCATTCGGCGGCGTTGCCGGCACCGGTCCAGCGGATGCCAACTGAGAC
GAggtcgttcggttcccaatcttcttctgaggattcttcagagccctgcgacaaccgtcc
ctatgcttgaacaggggggccc


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC004086A_C01 KMC004086A_c01
         (1042 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_181639.1| RRM-containing RNA-binding protein; protein id:...   236  3e-61
gb|AAM73765.1|AF403292_1 RNA-binding protein AKIP1 [Vicia faba]       236  3e-61
gb|AAK68825.1| putative protein [Arabidopsis thaliana] gi|201483...   233  5e-60
ref|NP_567042.1| RRM-containing RNA-binding protein; protein id:...   233  5e-60
dbj|BAA90354.1| unnamed protein product [Oryza sativa (japonica ...   227  3e-58

>ref|NP_181639.1| RRM-containing RNA-binding protein; protein id: At2g41060.1,
            supported by cDNA: gi_16612301 [Arabidopsis thaliana]
            gi|7487621|pir||T02113 probable RNA-binding protein
            At2g41060 [imported] - Arabidopsis thaliana
            gi|3402711|gb|AAD12005.1| putative RNA-binding protein
            [Arabidopsis thaliana]
            gi|16612302|gb|AAL27512.1|AF439844_1 At2g41060/T3K9.17
            [Arabidopsis thaliana] gi|22137136|gb|AAM91413.1|
            At2g41060/T3K9.17 [Arabidopsis thaliana]
          Length = 451

 Score =  236 bits (603), Expect = 3e-61
 Identities = 150/297 (50%), Positives = 178/297 (59%), Gaps = 15/297 (5%)
 Frame = -3

Query: 1034 LFKHRDGCRRALKNPQKKIGNRTTSSQLASAGPVPATPPNAPPVS---EYTQRKIFVSNV 864
            LFK R G R ALK PQKKIG R T+ QLAS GPV   P  AP      E  QRKI+VSNV
Sbjct: 175  LFKSRSGARNALKQPQKKIGTRMTACQLASIGPVQGNPVVAPAQHFNPENVQRKIYVSNV 234

Query: 863  SSDIEPLKLLEFFKQFGEVEDGPLGLDKNTGRPKGFALFVYRSVESAKKALEEPNKEYEG 684
            S+DI+P KLLEFF +FGE+E+GPLGLDK TGRPKGFALFVYRS+ESAKKALEEP+K +EG
Sbjct: 235  SADIDPQKLLEFFSRFGEIEEGPLGLDKATGRPKGFALFVYRSLESAKKALEEPHKTFEG 294

Query: 683  HTLYCQKAVDGPKGSKGYHQNQHQHQHQHHSHHHQPHYQPHKKHKYNSGGGPSHGGGGHL 504
            H L+C KA DGPK  K       QHQH H+SH+    YQ +  + Y   G P  GG GH 
Sbjct: 295  HVLHCHKANDGPKQVK-------QHQHNHNSHNQNSRYQRNDNNGY---GAP--GGHGHF 342

Query: 503  MAPSNPGMGGYNPGVQGQGLNPALGQAL-ALLANQGAGLGLGN-----LLGTF-SGAPVN 345
            +A +N  +         Q  NPA+GQAL ALLA+QGAGLGL       LLGT  + +P  
Sbjct: 343  IAGNNQAV---------QAFNPAIGQALTALLASQGAGLGLNQAFGQALLGTLGTASPGA 393

Query: 344  QAGPPACCSLW*S---GSHELWEPARDATGIPRIPQMGQNSGVRPHPGS--GAPYMG 189
              G P+      +   G +  +       G  +  Q GQ    R   G+  G PYMG
Sbjct: 394  VGGMPSGYGTQANISPGVYPGYGAQAGYQGGYQTQQPGQGGAGRGQHGAGYGGPYMG 450

 Score = 61.2 bits (147), Expect = 3e-08
 Identities = 30/84 (35%), Positives = 48/84 (56%)
 Frame = -3

Query: 899 EYTQRKIFVSNVSSDIEPLKLLEFFKQFGEVEDGPLGLDKNTGRPKGFALFVYRSVESAK 720
           +   RKIFV  +  D +   L++ FKQ+GE+ED    +DK +G+ KG+   +++S   A+
Sbjct: 124 DLVHRKIFVHGLGWDTKADSLIDAFKQYGEIEDCKCVVDKVSGQSKGYGFILFKSRSGAR 183

Query: 719 KALEEPNKEYEGHTLYCQKAVDGP 648
            AL++P K+       CQ A  GP
Sbjct: 184 NALKQPQKKIGTRMTACQLASIGP 207

>gb|AAM73765.1|AF403292_1 RNA-binding protein AKIP1 [Vicia faba]
          Length = 515

 Score =  236 bits (603), Expect = 3e-61
 Identities = 147/319 (46%), Positives = 184/319 (57%), Gaps = 36/319 (11%)
 Frame = -3

Query: 1034 LFKHRDGCRRALKNPQKKIGNRTTSSQLASAGPVPATPPNA----PPVSEYTQRKIFVSN 867
            LFK R G R ALK PQKKIGNR T+ QLAS GPV  TP  A     P SEYTQRKI++SN
Sbjct: 198  LFKKRSGARNALKEPQKKIGNRMTACQLASIGPVAQTPQPAVQLVQPGSEYTQRKIYISN 257

Query: 866  VSSDIEPLKLLEFFKQFGEVEDGPLGLDKNTGRPKGFALFVYRSVESAKKALEEPNKEYE 687
            V  +++P KL  +F +FGE+E+GPLGLDK TG+PKGF LFVY+S ESA+++LEEP+KE+E
Sbjct: 258  VGPELDPHKLFAYFSRFGEIEEGPLGLDKATGKPKGFCLFVYKSAESARRSLEEPHKEFE 317

Query: 686  GHTLYCQKAVDGPKGSKGYHQNQHQHQH-----------QHHSHHHQPHYQPHKKHKYNS 540
            GH L+CQ+A+DGPK  K  H    Q QH           Q +    +  +Q +    Y  
Sbjct: 318  GHILHCQRAIDGPKAGKTQHPQPQQLQHVNSLNQFNQLNQLNPATQRTQFQRNDNVGYVG 377

Query: 539  GGGPSHGGGGHLMAPSNPGMGGYNPGV--QGQGLN----PALGQAL-ALLANQGAGLGLG 381
            G   +    GHLMAP+ P + GYN       QGLN    PALGQAL ALLA+QGA LGL 
Sbjct: 378  GSSVAVSQPGHLMAPAGPTI-GYNQAAASAAQGLNPVLTPALGQALTALLASQGATLGLS 436

Query: 380  NLLGTF-SGAPVNQAGPPACCSLW*SGSHELWEPARDATG-----IPRI--------PQM 243
            +LLG+  +   +N  G PA      SG       +    G     +P++         Q+
Sbjct: 437  DLLGSLGTSNALNHGGVPAAGHGVQSGYSAQPTISPSVMGAYGNSVPQVGLQVQYPNQQI 496

Query: 242  GQNSGVRPHPGSGAPYMGH 186
            GQ    R   G  A YMGH
Sbjct: 497  GQGGSGRGQYGGAASYMGH 515

 Score = 60.1 bits (144), Expect = 6e-08
 Identities = 38/132 (28%), Positives = 56/132 (41%), Gaps = 6/132 (4%)
 Frame = -3

Query: 893 TQRKIFVSNVSSDIEPLKLLEFFKQFGEVEDGPLGLDKNTGRPKGFALFVYRSVESAKKA 714
           + RKIFV  +  D     L+  F Q+GE+ED     DK +G+ KG+   +++    A+ A
Sbjct: 149 SHRKIFVHGLGWDTTSATLINAFSQYGEIEDCKAVTDKASGKSKGYGFILFKKRSGARNA 208

Query: 713 LEEPNKEYEGHTLYCQKAVDGPKGS------KGYHQNQHQHQHQHHSHHHQPHYQPHKKH 552
           L+EP K+       CQ A  GP         +         Q + +  +  P   PHK  
Sbjct: 209 LKEPQKKIGNRMTACQLASIGPVAQTPQPAVQLVQPGSEYTQRKIYISNVGPELDPHKLF 268

Query: 551 KYNSGGGPSHGG 516
            Y S  G    G
Sbjct: 269 AYFSRFGEIEEG 280

>gb|AAK68825.1| putative protein [Arabidopsis thaliana] gi|20148397|gb|AAM10089.1|
            putative protein [Arabidopsis thaliana]
          Length = 478

 Score =  233 bits (593), Expect = 5e-60
 Identities = 150/307 (48%), Positives = 183/307 (58%), Gaps = 24/307 (7%)
 Frame = -3

Query: 1034 LFKHRDGCRRALKNPQKKIGNRTTSSQLASAGPVPATPPNAPPV---------SEYTQRK 882
            L+K R G R ALK PQKKIG+R T+ QLAS GPV    P A            SE+TQ+K
Sbjct: 187  LYKSRSGARNALKQPQKKIGSRMTACQLASKGPVFGGAPIAAAAVSAPAQHSNSEHTQKK 246

Query: 881  IFVSNVSSDIEPLKLLEFFKQFGEVEDGPLGLDKNTGRPKGFALFVYRSVESAKKALEEP 702
            I+VSNV ++++P KLL FF +FGE+E+GPLGLDK TGRPKGF LFVY+S ESAK+ALEEP
Sbjct: 247  IYVSNVGAELDPQKLLMFFSKFGEIEEGPLGLDKYTGRPKGFCLFVYKSSESAKRALEEP 306

Query: 701  NKEYEGHTLYCQKAVDGPKGSKGYHQNQHQHQHQHHSHHHQPHYQPHKKHKYNSGGGPSH 522
            +K +EGH L+CQKA+DGPK  K     Q QH H  H++++ P YQ +     N+G GP  
Sbjct: 307  HKTFEGHILHCQKAIDGPKPGK-----QQQHHHNPHAYNN-PRYQRND----NNGYGPP- 355

Query: 521  GGGGHLMAPSNPGMGGYNPGVQGQGLNPALGQAL-ALLANQGAGL--------GLGNLLG 369
            GG GHLMA +  GMG    G   Q +NPA+GQAL ALLA+QGAGL         L   LG
Sbjct: 356  GGHGHLMAGNPAGMG----GPTAQVINPAIGQALTALLASQGAGLAFNPAIGQALLGSLG 411

Query: 368  TFSGA-PVNQAGPPA--CCSLW*SGSHELWEPARDATGIPRIPQMGQNSGVRPHPG---S 207
            T +G  P N  G P          G+   +       G  + PQ GQ    R   G    
Sbjct: 412  TAAGVNPGNGVGMPTGYGTQAMAPGTMPGYGTQPGLQGGYQTPQPGQGGTSRGQHGVGPY 471

Query: 206  GAPYMGH 186
            G PYMGH
Sbjct: 472  GTPYMGH 478

 Score = 65.1 bits (157), Expect = 2e-09
 Identities = 32/80 (40%), Positives = 46/80 (57%)
 Frame = -3

Query: 887 RKIFVSNVSSDIEPLKLLEFFKQFGEVEDGPLGLDKNTGRPKGFALFVYRSVESAKKALE 708
           RKIFV  +  D +   L+E FKQ+GE+ED     DK +G+ KG+   +Y+S   A+ AL+
Sbjct: 140 RKIFVHGLGWDTKTETLIEAFKQYGEIEDCKAVFDKISGKSKGYGFILYKSRSGARNALK 199

Query: 707 EPNKEYEGHTLYCQKAVDGP 648
           +P K+       CQ A  GP
Sbjct: 200 QPQKKIGSRMTACQLASKGP 219

>ref|NP_567042.1| RRM-containing RNA-binding protein; protein id: At3g56860.1,
            supported by cDNA: gi_14194148, supported by cDNA:
            gi_14335131, supported by cDNA: gi_14596194, supported by
            cDNA: gi_20148396, supported by cDNA: gi_20259481
            [Arabidopsis thaliana] gi|11358431|pir||T51274
            hypothetical protein T8M16_190 - Arabidopsis thaliana
            gi|9663005|emb|CAC00749.1| putative protein [Arabidopsis
            thaliana] gi|14194149|gb|AAK56269.1|AF367280_1
            AT3g56860/T8M16_190 [Arabidopsis thaliana]
            gi|14335132|gb|AAK59846.1| AT3g56860/T8M16_190
            [Arabidopsis thaliana] gi|19682816|emb|CAD28672.1| UBP1
            interacting protein 2a [Arabidopsis thaliana]
            gi|20259482|gb|AAM13861.1| unknown protein [Arabidopsis
            thaliana] gi|21436451|gb|AAM51426.1| unknown protein
            [Arabidopsis thaliana] gi|22137068|gb|AAM91379.1|
            At3g56860/T8M16_190 [Arabidopsis thaliana]
          Length = 478

 Score =  233 bits (593), Expect = 5e-60
 Identities = 150/307 (48%), Positives = 183/307 (58%), Gaps = 24/307 (7%)
 Frame = -3

Query: 1034 LFKHRDGCRRALKNPQKKIGNRTTSSQLASAGPVPATPPNAPPV---------SEYTQRK 882
            L+K R G R ALK PQKKIG+R T+ QLAS GPV    P A            SE+TQ+K
Sbjct: 187  LYKSRSGARNALKQPQKKIGSRMTACQLASKGPVFGGAPIAAAAVSAPAQHSNSEHTQKK 246

Query: 881  IFVSNVSSDIEPLKLLEFFKQFGEVEDGPLGLDKNTGRPKGFALFVYRSVESAKKALEEP 702
            I+VSNV ++++P KLL FF +FGE+E+GPLGLDK TGRPKGF LFVY+S ESAK+ALEEP
Sbjct: 247  IYVSNVGAELDPQKLLMFFSKFGEIEEGPLGLDKYTGRPKGFCLFVYKSSESAKRALEEP 306

Query: 701  NKEYEGHTLYCQKAVDGPKGSKGYHQNQHQHQHQHHSHHHQPHYQPHKKHKYNSGGGPSH 522
            +K +EGH L+CQKA+DGPK  K     Q QH H  H++++ P YQ +     N+G GP  
Sbjct: 307  HKTFEGHILHCQKAIDGPKPGK-----QQQHHHNPHAYNN-PRYQRND----NNGYGPP- 355

Query: 521  GGGGHLMAPSNPGMGGYNPGVQGQGLNPALGQAL-ALLANQGAGL--------GLGNLLG 369
            GG GHLMA +  GMG    G   Q +NPA+GQAL ALLA+QGAGL         L   LG
Sbjct: 356  GGHGHLMAGNPAGMG----GPTAQVINPAIGQALTALLASQGAGLAFNPAIGQALLGSLG 411

Query: 368  TFSGA-PVNQAGPPA--CCSLW*SGSHELWEPARDATGIPRIPQMGQNSGVRPHPG---S 207
            T +G  P N  G P          G+   +       G  + PQ GQ    R   G    
Sbjct: 412  TAAGVNPGNGVGMPTGYGTQAMAPGTMPGYGTQPGLQGGYQTPQPGQGGTSRGQHGVGPY 471

Query: 206  GAPYMGH 186
            G PYMGH
Sbjct: 472  GTPYMGH 478

 Score = 65.1 bits (157), Expect = 2e-09
 Identities = 32/80 (40%), Positives = 46/80 (57%)
 Frame = -3

Query: 887 RKIFVSNVSSDIEPLKLLEFFKQFGEVEDGPLGLDKNTGRPKGFALFVYRSVESAKKALE 708
           RKIFV  +  D +   L+E FKQ+GE+ED     DK +G+ KG+   +Y+S   A+ AL+
Sbjct: 140 RKIFVHGLGWDTKTETLIEAFKQYGEIEDCKAVFDKISGKSKGYGFILYKSRSGARNALK 199

Query: 707 EPNKEYEGHTLYCQKAVDGP 648
           +P K+       CQ A  GP
Sbjct: 200 QPQKKIGSRMTACQLASKGP 219

>dbj|BAA90354.1| unnamed protein product [Oryza sativa (japonica cultivar-group)]
          Length = 490

 Score =  227 bits (578), Expect = 3e-58
 Identities = 148/312 (47%), Positives = 178/312 (56%), Gaps = 29/312 (9%)
 Frame = -3

Query: 1034 LFKHRDGCRRALKNPQKKIGNRTTSSQLASAGPVP----ATPPNA-----------PPVS 900
            LF  R G R AL+ PQKKIGNRTT+ QLAS GPVP    AT P             PPVS
Sbjct: 203  LFSRRSGARAALREPQKKIGNRTTACQLASVGPVPPGGMATNPAPAVAPAPAQLALPPVS 262

Query: 899  EYTQRKIFVSNVSSDIEPLKLLEFFKQFGEVEDGPLGLDKNTGRPKGFALFVYRSVESAK 720
            EYTQRKIFVSNV +DI+P KLL+FF ++GE+E+GPLGLDK TG+PKGFALFVY++++SAK
Sbjct: 263  EYTQRKIFVSNVGADIDPQKLLQFFSKYGEIEEGPLGLDKVTGKPKGFALFVYKTLDSAK 322

Query: 719  KALEEPNKEYEGHTLYCQKAVDGPKGSKGYHQNQHQHQHQHHSHHHQPHYQPHKKHKYNS 540
            KAL+EP+K++EG  L+CQKA+DGPK +KG                    Y  H      S
Sbjct: 323  KALQEPHKQFEGVVLHCQKAIDGPKPNKGGGLGGLYGAGTSGGRKGAGGYGAH------S 376

Query: 539  GGGPSHGGGGHLMAPSNPGMGGYNPGVQ-GQGLNPALGQAL-ALLANQGAGLGLGNLLG- 369
               P    GGH+M PS        PGV  G G+NPALGQAL A+LA+QG GLGL N+LG 
Sbjct: 377  HSLPGAAVGGHVM-PSPVSSLTSLPGVAGGPGVNPALGQALTAILASQGGGLGLNNILGV 435

Query: 368  --TFSGAPVNQAGPPACCSLW*SGSHELWEPARDATGIPRIPQM---------GQNSGVR 222
                SG P     P A   L              ++G+P +P           G   G  
Sbjct: 436  GANGSGLP----NPGASAGL-------------GSSGLPGMPGAGGYLGGYGGGGGYGST 478

Query: 221  PHPGSGAPYMGH 186
            P  G G  YMGH
Sbjct: 479  PPGGPGRNYMGH 490

 Score = 56.2 bits (134), Expect = 8e-07
 Identities = 32/89 (35%), Positives = 43/89 (47%)
 Frame = -3

Query: 887 RKIFVSNVSSDIEPLKLLEFFKQFGEVEDGPLGLDKNTGRPKGFALFVYRSVESAKKALE 708
           RKIFV  +  D     L E F  +GE+ED  +  D+ TG+ KG+   ++     A+ AL 
Sbjct: 156 RKIFVHGLGWDATAETLTEAFSAYGEIEDLRVVTDRATGKCKGYGFILFSRRSGARAALR 215

Query: 707 EPNKEYEGHTLYCQKAVDGPKGSKGYHQN 621
           EP K+    T  CQ A  GP    G   N
Sbjct: 216 EPQKKIGNRTTACQLASVGPVPPGGMATN 244

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,050,404,319
Number of Sequences: 1393205
Number of extensions: 28862891
Number of successful extensions: 215955
Number of sequences better than 10.0: 3022
Number of HSP's better than 10.0 without gapping: 110146
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 179063
length of database: 448,689,247
effective HSP length: 124
effective length of database: 275,931,827
effective search space used: 61256865594
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MFB011b06_f BP034691 1 523
2 MFB036h08_f BP036664 9 587
3 MFB011c05_f BP034701 10 572
4 MFB100e07_f BP041269 13 429
5 MFB083f05_f BP040084 481 1058




Lotus japonicus
Kazusa DNA Research Institute