KCC001501A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KCC001501A_C01 KCC001501A_c01
gtaaaaaccctcAAGAGATCTAGCTAGCACGCACTGCAACGCAAACTGAGTTTGGCAGGA
CTGCGATTTGTGCTTCTTCTAGCCATCTCGTTACGCCCAGTGCACTTAAGACAGCTATCT
AGCCCTTCCAATAGACCCAGTTGAAAAGATGCAGCGCACTCTGCCTTCTGGTCGCGTCCA
TCAGCAGCAGCGGGCTGGCCCTGCTCGCCGCGCCGTGCCTTTCACCACCGCTCGCCCTCT
GTCGAGCGTTGCGTGTAACGCGGCTCCTGCGGCCAACGGCAACGGTGTGCATGCCAATGG
AAATGCCGCTTCGCACGGAAAGTGCCCGACGCCCGCCCAGACCGTCAGGACGCTCATAGA
CATTGTGAATGAGGGCACACTCTGCACCGTTGGCCCGAACGGCCTCCCGGTCGGCCTGCC
CGTCACATTCAGCATGGACAAGTCCGGAAAGCTGCAGCTCCAGATGGATGCTGCGGCCGT
AGAGATGTCCAACCTCAAGTCCGGAGTGAACTCCTGCAGnCTGATGGTGCAGGCGGCCAC
GCAGGCCGCGCGCGCGGTCGGCGCGGTGTCGCTTCATGGGCC


Nr search


BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KCC001501A_C01 KCC001501A_c01
         (582 letters)

Database: nr 
           1,537,769 sequences; 498,525,298 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_862421.1| putative hydroxyproline-rich protein [Micrococc...    45  7e-04
ref|ZP_00094438.1| COG4638: Phenylpropionate dioxygenase and rel...    44  0.001
ref|ZP_00139981.1| COG2010: Cytochrome c, mono- and diheme varia...    44  0.002
dbj|BAC57318.1| P0700D12.12 [Oryza sativa (japonica cultivar-gro...    41  0.014
ref|NP_566678.1| expressed protein [Arabidopsis thaliana] gi|928...    40  0.031

>ref|NP_862421.1| putative hydroxyproline-rich protein [Micrococcus sp. 28]
           gi|18025409|gb|AAK62517.1| putative hydroxyproline-rich
           protein [Micrococcus sp. 28]
          Length = 406

 Score = 45.1 bits (105), Expect = 7e-04
 Identities = 34/119 (28%), Positives = 49/119 (40%), Gaps = 5/119 (4%)
 Frame = +3

Query: 228 PLALCRALRVTRLLRPTATVCMPMEMPLRTESARRPPRPSGRS*TL*MRAHSAPLARTAS 407
           P+A+    R TR         +    PLR  + R P  P   S T  + A   PL+R++ 
Sbjct: 20  PVAMNSTTRTTRGTMAMLCKSVKPHRPLRPGTVRFPNTPCWGSRTAALTADPPPLSRSSG 79

Query: 408 RSAC----PSHSAWTSPESCSSR-WMLRP*RCPTSSPE*TPAX*WCRRPRRPRARSARC 569
              C    P+H   + P   S+R W  RP   P + P  +    W R  +RP    +RC
Sbjct: 80  PDQCRPSSPAHQQRSGPHGPSARPWPARPDDAPCAVPAWSSKHPWWRPRQRPEHPCSRC 138

>ref|ZP_00094438.1| COG4638: Phenylpropionate dioxygenase and related
           ring-hydroxylating dioxygenases, large terminal subunit
           [Novosphingobium aromaticivorans]
          Length = 715

 Score = 44.3 bits (103), Expect = 0.001
 Identities = 55/171 (32%), Positives = 67/171 (39%), Gaps = 18/171 (10%)
 Frame = +1

Query: 97  PVHLRQLSSPSNRPS*K-----DAAHSAFWSRPSAAAGWPCSPRRAFHHR----SPSVER 249
           P H R+ S+P  R   +      +A S   +RPS   G PC      HH     SP+  R
Sbjct: 57  PCHARRGSAPGGRVCRRRCGVDGSARSPCAARPSRNGGVPCR-----HHGPLGLSPASAR 111

Query: 250 CV*RGSCGQRQRCACQWKCRFARKVPDARPDRQDAHRHCE*GHTLHRWPERPPGRPARHI 429
            + R  CG R     +W  R  R  P     R  A R    G        RP  R  RH 
Sbjct: 112 GL-RPRCGHRWHGGARWYRRADRGRPAGGFPRFPAGRGNRVGLR-----HRPCAR--RHR 163

Query: 430 QHGQVRKAAAPDGCCGRRDVQPQV---------RSELLQXDGAGGHAGRAR 555
             G  R+   P    GRR V+P+V         R    + DG GGHAGR R
Sbjct: 164 AAG--RRCRYPADAAGRRAVRPRVDLRAAIPDARRARRRPDGPGGHAGRNR 212

>ref|ZP_00139981.1| COG2010: Cytochrome c, mono- and diheme variants [Pseudomonas
           aeruginosa UCBPP-PA14]
          Length = 639

 Score = 43.5 bits (101), Expect = 0.002
 Identities = 30/87 (34%), Positives = 35/87 (39%)
 Frame = +1

Query: 316 RKVPDARPDRQDAHRHCE*GHTLHRWPERPPGRPARHIQHGQVRKAAAPDGCCGRRDVQP 495
           R++P  RPD Q   R     H L    +R P  P RH  H Q R+   P+G  G R    
Sbjct: 51  RQLPRPRPDLQGRLRPATAAHDLRLEGQRDPHEPLRHRAHAQDRRGDEPEGHLGER---- 106

Query: 496 QVRSELLQXDGAGGHAGRARGRRGVAS 576
           Q     LQ  G   H    RG  G  S
Sbjct: 107 QELRRPLQYTGLPDHPPARRGDHGQRS 133

>dbj|BAC57318.1| P0700D12.12 [Oryza sativa (japonica cultivar-group)]
          Length = 320

 Score = 40.8 bits (94), Expect = 0.014
 Identities = 46/159 (28%), Positives = 64/159 (39%), Gaps = 18/159 (11%)
 Frame = +2

Query: 158 TLPSGRVHQ-QQRAGPARRAVPF--TTARPLSSVACNAAPAANGN---GVHANGNAA--- 310
           +LP+   HQ + R  P   A+P   +   PL + A      A  +   GV  NG+     
Sbjct: 11  SLPALPSHQPRSRLAPRSLALPGGRSCCGPLRAAAAGGGGGAKDDAQAGVTPNGSPVIKS 70

Query: 311 ---SHGKCPTPAQTVRTLIDIVNEGTLCTV------GPNGLPVGLPVTFSMDKSGKLQLQ 463
              +HG  P PA  VR L++      LCTV         G P G  V FS D  G     
Sbjct: 71  ATFAHG-LPPPALAVRNLMEQARFAHLCTVMSGMHHRRTGYPFGSLVDFSNDSMGHPIFS 129

Query: 464 MDAAAVEMSNLKSGVNSCXLMVQAATQAARAVGAVSLHG 580
           +   A+   NL S    C L+VQ    +  +   V++ G
Sbjct: 130 LSPLAIHTRNLLSDPR-CTLVVQVPGWSGLSNARVTIFG 167

>ref|NP_566678.1| expressed protein [Arabidopsis thaliana] gi|9280221|dbj|BAB01711.1|
           gene_id:MXL8.5~unknown protein [Arabidopsis thaliana]
           gi|17065156|gb|AAL32732.1| Unknown protein [Arabidopsis
           thaliana] gi|27311937|gb|AAO00934.1| Unknown protein
           [Arabidopsis thaliana]
          Length = 317

 Score = 39.7 bits (91), Expect = 0.031
 Identities = 23/74 (31%), Positives = 39/74 (52%), Gaps = 2/74 (2%)
 Frame = +2

Query: 326 PTPAQTVRTLIDIVNEGTLCTVGPNGLPVGLPVTFSMDKSGK--LQLQMDAAAVEMSNLK 499
           P PA+  R+++++ + GTL T+  +G P+G+ V F++DK G   L L    +  + S L 
Sbjct: 59  PFPAEVSRSIMELSSVGTLSTLTHDGWPLGVGVRFAVDKDGTPVLCLNRSVSPDKRSALH 118

Query: 500 SGVNSCXLMVQAAT 541
             +  C L     T
Sbjct: 119 VQLEQCGLRTPQCT 132



EST assemble image


clone accession position
1 LCL098d05_r AV631698 1 388
2 CM055h10_r AV390220 6 585




Chlamydomonas reinhardtii
Kazusa DNA Research Institute