KCC001537A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KCC001537A_C01 KCC001537A_c01
cgcatactgtagaatacagaAATACTGTAGAATACAGAAATCACTGTGCCAAGAATCGGG
CCAACTCCTGACCAAGCTTCTGCGTTACTGCTTGACTTTAGCCCATCGTAACATATTGAT
GACAAGCGTCAGAATTCTTTCCCAGAGTTGAAGCTTAGCGAGGAGGGCCCGGAGTCTTCT
GCGCTGCTTTTGTCACTGCTTGGGCTCGATTCTTGAGAAGAGAGCGCCAACTCTAGCTGC
GGAGCGAGGCGGGTGCAGTAGCTGCGACCTAGTCACCCACGACTGGCGTGTGAACCCGCC
TAGCGGCTCCCGTGCCACCGTTGCCACCGTTGCCACCAGCCGTCGGCGCCATGAGCAAGA
TAACGGGGGGCAAGAAGGCGTCGGGCGTGGATAACACGGCCCGTCGCACGTGGGACCGTG
AGGAGTACCGGGCGATAGCGGAGGAGAAGGAGAAGGAGAAGAAGGAAGCGGCCAAGGCCC
GGGGGGCCAAGGACGGTGACGATGACGACCACGAGGAGACGGCGGCGGATATCCGCCGCC
GCAAGCGCCAGGAGCGCGACCCCCTGCATCAGGGCCTGATCGTGGAGCGCTCGCTGCTCA
AGCAGCGCGACTATGCCATCGACCTCACCTCGCGACTTGGCAAGACGCAGGTGGTGGGAT
TCAACACGCCGCTCAACCAGCAGGCGGGCTGGTTCTGCAACGTGTGCAACTGCGTGCTGC
GCGACTCGCAGAGCTACCTCGACCACATCAACGGCAAGTGGCACAACCGCGCTCTGGGCA
TGAACATGAAAGTGGAGAAGTCCACACTGGAGCAGGTGAAGAACAAGTTCGAGGGAGCTC
AATCCCGCAAGAGCCCGCCGnCGGACGAGTACGTGCCTGACGGCTTTGACGGCGCGGCGT
CGGCGGGAGCCAAGAGCGGG


Nr search


BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KCC001537A_C01 KCC001537A_c01
         (920 letters)

Database: nr 
           1,537,769 sequences; 498,525,298 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|AAO39853.1| unknown protein [Oryza sativa (japonica cultivar-...   145  7e-34
ref|NP_566257.1| expressed protein [Arabidopsis thaliana] gi|145...   142  6e-33
gb|AAF26078.1|AC012393_4 hypothetical protein [Arabidopsis thali...   134  2e-30
ref|XP_226007.2| similar to histidyl-tRNA synthetase-like [Rattu...   122  8e-27
ref|XP_316308.1| ENSANGP00000020597 [Anopheles gambiae] gi|21298...   121  1e-26

>gb|AAO39853.1| unknown protein [Oryza sativa (japonica cultivar-group)]
           gi|31249762|gb|AAP46254.1| unknown protein [Oryza sativa
           (japonica cultivar-group)]
          Length = 202

 Score =  145 bits (367), Expect = 7e-34
 Identities = 78/158 (49%), Positives = 102/158 (64%)
 Frame = +3

Query: 384 GVDNTARRTWDREEYRAIAEEKEKEKKEAAKARGAKDGDDDDHEETAADIRRRKRQERDP 563
           GVDNT RR +D+EEY   A ++E+E+KE A                      RK +E+ P
Sbjct: 7   GVDNTFRRKFDKEEYLERARQREREEKEEA----------------------RKGKEKGP 44

Query: 564 LHQGLIVERSLLKQRDYAIDLTSRLGKTQVVGFNTPLNQQAGWFCNVCNCVLRDSQSYLD 743
                 V+R  LK RDY +DL SRLGKTQVV    PL+QQAG++C VC CV++DS +YLD
Sbjct: 45  -----PVQRQPLKHRDYEVDLESRLGKTQVVTPIAPLSQQAGYYCKVCECVVKDSANYLD 99

Query: 744 HINGKWHNRALGMNMKVEKSTLEQVKNKFEGAQSRKSP 857
           HINGK H RALGM+M+VE+++LEQV+ +FE  + RK P
Sbjct: 100 HINGKKHQRALGMSMRVERASLEQVQKRFESLKKRKDP 137

>ref|NP_566257.1| expressed protein [Arabidopsis thaliana] gi|14517383|gb|AAK62582.1|
           AT3g05760/F10A16_5 [Arabidopsis thaliana]
           gi|15450543|gb|AAK96449.1| AT3g05760/F10A16_5
           [Arabidopsis thaliana]
          Length = 202

 Score =  142 bits (359), Expect = 6e-33
 Identities = 75/159 (47%), Positives = 104/159 (65%)
 Frame = +3

Query: 381 SGVDNTARRTWDREEYRAIAEEKEKEKKEAAKARGAKDGDDDDHEETAADIRRRKRQERD 560
           +GVDNT R+ +D EE++  A E+EK++ + +K+R                          
Sbjct: 8   TGVDNTFRKKFDVEEFKERAREREKKESDRSKSRS------------------------- 42

Query: 561 PLHQGLIVERSLLKQRDYAIDLTSRLGKTQVVGFNTPLNQQAGWFCNVCNCVLRDSQSYL 740
              +G  V+R+ LK RDY +DL SRLGKTQVV    PL+QQAG+FC VC+CV++DS +YL
Sbjct: 43  ---KGPPVQRAPLKHRDYHVDLESRLGKTQVVTPVAPLSQQAGYFCRVCDCVVKDSANYL 99

Query: 741 DHINGKWHNRALGMNMKVEKSTLEQVKNKFEGAQSRKSP 857
           DHINGK H RALGM+M+VE+S+LEQV+ +FE  + RK+P
Sbjct: 100 DHINGKKHQRALGMSMRVERSSLEQVQERFEVLKKRKAP 138

>gb|AAF26078.1|AC012393_4 hypothetical protein [Arabidopsis thaliana]
          Length = 180

 Score =  134 bits (337), Expect = 2e-30
 Identities = 74/159 (46%), Positives = 99/159 (61%)
 Frame = +3

Query: 381 SGVDNTARRTWDREEYRAIAEEKEKEKKEAAKARGAKDGDDDDHEETAADIRRRKRQERD 560
           +GVDNT R+ +D EE++  A E+EK+    A                             
Sbjct: 8   TGVDNTFRKKFDVEEFKERAREREKKGVVFAA---------------------------- 39

Query: 561 PLHQGLIVERSLLKQRDYAIDLTSRLGKTQVVGFNTPLNQQAGWFCNVCNCVLRDSQSYL 740
              +G  V+R+ LK RDY +DL SRLGKTQVV    PL+QQAG+FC VC+CV++DS +YL
Sbjct: 40  ---KGPPVQRAPLKHRDYHVDLESRLGKTQVVTPVAPLSQQAGYFCRVCDCVVKDSANYL 96

Query: 741 DHINGKWHNRALGMNMKVEKSTLEQVKNKFEGAQSRKSP 857
           DHINGK H RALGM+M+VE+S+LEQV+ +FE  + RK+P
Sbjct: 97  DHINGKKHQRALGMSMRVERSSLEQVQERFEVLKKRKAP 135

>ref|XP_226007.2| similar to histidyl-tRNA synthetase-like [Rattus norvegicus]
          Length = 727

 Score =  122 bits (306), Expect = 8e-27
 Identities = 71/192 (36%), Positives = 106/192 (54%), Gaps = 9/192 (4%)
 Frame = +3

Query: 324 PPLPPAVGAMSKITGGKKASGVDN---------TARRTWDREEYRAIAEEKEKEKKEAAK 476
           PP  P  G +S   G   A+ +++           RR WD++EY  +AE++  E++E   
Sbjct: 506 PPRWPLGGFLSFSRGAGTATDLEHFFPLQTKNLDFRRKWDKDEYEKLAEKRLTEEREK-- 563

Query: 477 ARGAKDGDDDDHEETAADIRRRKRQERDPLHQGLIVERSLLKQRDYAIDLTSRLGKTQVV 656
               KDG                     P+     V+R LL+ RDY +DL S+LGKT V+
Sbjct: 564 ----KDGK--------------------PVQP---VKRELLRHRDYKVDLESKLGKTIVI 596

Query: 657 GFNTPLNQQAGWFCNVCNCVLRDSQSYLDHINGKWHNRALGMNMKVEKSTLEQVKNKFEG 836
              TP ++  G++CNVC+CV++DS ++LDHINGK H R LGM+M+VE+STL+QVK +FE 
Sbjct: 597 TKTTPQSEMGGYYCNVCDCVVKDSINFLDHINGKKHQRNLGMSMRVERSTLDQVKKRFEV 656

Query: 837 AQSRKSPPXDEY 872
            + +      +Y
Sbjct: 657 NKKKMEEKQKDY 668

>ref|XP_316308.1| ENSANGP00000020597 [Anopheles gambiae] gi|21298623|gb|EAA10768.1|
           ENSANGP00000020597 [Anopheles gambiae str. PEST]
          Length = 949

 Score =  121 bits (304), Expect = 1e-26
 Identities = 69/187 (36%), Positives = 105/187 (55%)
 Frame = +3

Query: 312 VPPLPPLPPAVGAMSKITGGKKASGVDNTARRTWDREEYRAIAEEKEKEKKEAAKARGAK 491
           +P  P   P +GA+      + +S   +  RR WDR+EY  +A E+   K +        
Sbjct: 9   LPLGPQNSPLLGAI------RMSSMRPDDHRRKWDRKEYERLAHERILAKDK-------- 54

Query: 492 DGDDDDHEETAADIRRRKRQERDPLHQGLIVERSLLKQRDYAIDLTSRLGKTQVVGFNTP 671
            G++DD E                      V + LLKQR+Y +DL S+LGK+ V+  +TP
Sbjct: 55  -GNEDDGEP---------------------VTKELLKQREYKVDLDSKLGKSMVINKSTP 92

Query: 672 LNQQAGWFCNVCNCVLRDSQSYLDHINGKWHNRALGMNMKVEKSTLEQVKNKFEGAQSRK 851
            +Q  G++CNVC+CV++DS ++LDHINGK H R LGM+MKVE+S+L+QVK +F+  + + 
Sbjct: 93  SSQSGGYYCNVCDCVVKDSINFLDHINGKKHQRNLGMSMKVERSSLDQVKERFKINKKKT 152

Query: 852 SPPXDEY 872
                +Y
Sbjct: 153 EEKKKDY 159



EST assemble image


clone accession position
1 HC018h07_r AV633280 1 470
2 LC072a11_r AV624036 21 505
3 MX068h01_r BP088777 35 413
4 CM060b10_r AV390750 363 975




Chlamydomonas reinhardtii
Kazusa DNA Research Institute