KCC001871A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KCC001871A_C01 KCC001871A_c01
cacTTACTCACCTCGAACTCCTCACAAGGGCTGCCAAGCTACCGACGACTGTTTCGGGAC
TACAATTAGGGTTTGCTCGATTGCGCGCTGCGCAGCTTAACCCGCCGCTACCAGGCTGCT
AAGCCTCAGTGACGTCGACCGTGACAAAATATGATGAGCCTCAGCGCAAGAGCGGCTTTT
CGCGCGCCCCTCAGCCATCATCGGCCGCGCCCCCAAGCCTACCCCCGGGCTGTTGTCACG
CCGTGCGCAAGGATGCATATCCCTGCGGATTCTTTTTCGGGGGCTTCGCCAGAGCGTAAA
GCTGCTGTAGCCCTGCGGTCGCTGTTCACGTTTGTTGCAGCTCGGGTGGTGCTGGAGCAG
CTGCAGGGCCCCGGCGGCCCTGAGACCACCTACAACCAGCAGGCATACCTAGACCTGATG
GACTTCCTGGGCACGCCCATGAAGGGCGATGGCGGCGACGAGTGGATGGCCGCTGTCATG
AGGAAGAACCACGCTTTGGCCCTGCGCCTGATGGAGGTGCGCGAGGCCTACCTGGACGAG
TTTGAGTGGGGAAAGACCATGGAGATGGCCAGCCGCGAGACGCGCGAGTGCCAACACACG
CCTCATGCGCGCGGCGGCCATGGCCAGCCTGCAGGCGTCTCTGACGGAGCCGGTGGGCGG
CGGTGCCGGCGCCGGCTGCATGTCTATGGAGGACCTGGACGGCCCCGGCAAGGGTGCCGC
GTGATGGGCGGCAACGGTGCTGCGAGCCCACGTGTCCGTAGGGACAGACCACGCGCATGC
GTGTGGCATGCGGCT


Nr search


BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KCC001871A_C01 KCC001871A_c01
         (795 letters)

Database: nr 
           1,537,769 sequences; 498,525,298 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_567263.1| expressed protein [Arabidopsis thaliana] gi|150...    74  3e-12
dbj|BAC98560.1| unknown protein [Oryza sativa (japonica cultivar...    68  1e-10
ref|NP_767161.1| blr0521 [Bradyrhizobium japonicum] gi|27348769|...    43  8e-06
ref|NP_629843.1| conserved hypothetical protein SC3C3.03c [Strep...    37  8e-05
ref|ZP_00092697.1| COG1020: Non-ribosomal peptide synthetase mod...    46  6e-04

>ref|NP_567263.1| expressed protein [Arabidopsis thaliana] gi|15027851|gb|AAK76456.1|
           unknown protein [Arabidopsis thaliana]
           gi|23296672|gb|AAN13142.1| unknown protein [Arabidopsis
           thaliana]
          Length = 174

 Score = 73.9 bits (180), Expect = 3e-12
 Identities = 46/111 (41%), Positives = 67/111 (59%), Gaps = 4/111 (3%)
 Frame = +1

Query: 250 RMHIPADSFSGASPERKAAVALRSLFTFVAARVVLEQLQGPGGPETTYNQQAYLDLMDFL 429
           +M++P   F  ASPE KAA  L   FT+VA R+V  QL+       +YN +AY++L +FL
Sbjct: 46  KMYVPG--FGEASPEAKAAKHLHDFFTYVAVRIVSAQLE-------SYNPEAYMELREFL 96

Query: 430 GTPMKGDGGDEWMAAVMRKNHA---LALRLMEVREAYL-DEFEWGKTMEMA 570
            T    D GD++ A +MR++     LALR++EVR AY  ++FEW     +A
Sbjct: 97  DTNSVSD-GDKFCATLMRRSSRHMNLALRILEVRSAYCKNDFEWDNMKRLA 146

>dbj|BAC98560.1| unknown protein [Oryza sativa (japonica cultivar-group)]
          Length = 186

 Score = 68.2 bits (165), Expect = 1e-10
 Identities = 42/119 (35%), Positives = 69/119 (57%), Gaps = 4/119 (3%)
 Frame = +1

Query: 244 CARMHIPADSFSGASPERKAAVALRSLFTFVAARVVLEQLQGPGGPETTYNQQAYLDLMD 423
           C++M++P   F   SPE+KAA  L+  F ++A RVVL QL+       +YN++AY +LMD
Sbjct: 58  CSKMYVPG--FGEGSPEKKAARNLQHFFNYIAVRVVLTQLE-------SYNREAYGELMD 108

Query: 424 FLGTPMKGDGGDEWMAAVMR---KNHALALRLMEVREAYL-DEFEWGKTMEMASRETRE 588
           F+      D  D +   ++R   ++  LA+R++EVR AY+  +FEW     ++ +   E
Sbjct: 109 FVNRNSLND-ADTFCKKLIRESPRHKQLAMRILEVRSAYVKHDFEWDNLKRLSFKMVDE 166

>ref|NP_767161.1| blr0521 [Bradyrhizobium japonicum] gi|27348769|dbj|BAC45786.1|
           blr0521 [Bradyrhizobium japonicum USDA 110]
          Length = 745

 Score = 42.7 bits (99), Expect(2) = 8e-06
 Identities = 56/191 (29%), Positives = 66/191 (34%)
 Frame = -3

Query: 601 ACVGTRASRGWPSPWSFPTQTRPGRPRAPPSGAGPKRGSSS*QRPSTRRRHRPSWACPGS 422
           A   T A    P+P S P     GRP APP G  P    ++   P+      P+     +
Sbjct: 227 APTATPAPTATPAPGSTPGAPPAGRPGAPPPGVRPGSPPAAGSPPAPGATPAPT--TTPA 284

Query: 421 PSGLGMPAGCRWSQGRRGPAAAPAPPELQQT*TATAGLQQLYALAKPPKKNPQGYASLRT 242
           P G   P       GR GPA+ PAP       TAT       AL  PP +   G     T
Sbjct: 285 PGGTATP-----PSGRPGPASTPAPGAATPAPTATPAPGG--ALTPPPGRPGAG----PT 333

Query: 241 A*QQPGGRLGGAADDG*GAREKPLLR*GSSYFVTVDVTEA*QPGSGGLSCAARNRANPNC 62
              Q G    GA   G                     T A  P +GGL       A    
Sbjct: 334 PGPQGGTPPAGAPAAG---------------------TPAAPPQAGGLPARPAAPAGA-A 371

Query: 61  SPETVVGSLAA 29
           +P TV GS AA
Sbjct: 372 APSTVPGSAAA 382

 Score = 32.7 bits (73), Expect(2) = 0.061
 Identities = 37/114 (32%), Positives = 45/114 (39%), Gaps = 5/114 (4%)
 Frame = -3

Query: 589 TRASRGWPSPWSFPTQTRPGRPRAPPSGAGPKRGSSS*QRPSTRRRHRPSWACPGSPSGL 410
           T A    P+P    T   PGRP A P+  GP+ G+     P+      P  A  G P+  
Sbjct: 308 TPAPTATPAPGGALTPP-PGRPGAGPT-PGPQGGTPPAGAPAAGTPAAPPQA-GGLPARP 364

Query: 409 GMPAGCRWSQGRRGPAAAPAPPELQQ--T*TATAGLQ---QLYALAKPPKKNPQ 263
             PAG        G AAA  PP   Q    T T   Q    + A   PP + PQ
Sbjct: 365 AAPAGAAAPSTVPGSAAATPPPNRAQFAPPTVTPAFQAAPTVVAPLPPPPRPPQ 418

 Score = 29.3 bits (64), Expect(2) = 8e-06
 Identities = 15/41 (36%), Positives = 16/41 (38%)
 Frame = -2

Query: 737 PLPPITRHPCRGRPGPP*TCSRRRHRRPPAPSETPAGWPWP 615
           P PP    P    P PP   +R     PP P   PA  P P
Sbjct: 151 PAPPPAAAPQHAPPPPPPPAARPTPTPPPPPPAGPAARPTP 191

 Score = 28.9 bits (63), Expect(2) = 0.17
 Identities = 35/145 (24%), Positives = 41/145 (28%), Gaps = 13/145 (8%)
 Frame = -3

Query: 568 PSPWSFPTQTRPGRPRAPPSGAGPKRGSSS*QRPSTRRRHRPSWACPGSPSGLGMPAGCR 389
           P P + P       P APP  A P+        P+ R    P    P  P+    PA   
Sbjct: 136 PPPPAPPAARPAPTPPAPPPAAAPQHAPPPPPPPAARPTPTPPPPPPAGPAARPTPAPTA 195

Query: 388 W-------------SQGRRGPAAAPAPPELQQT*TATAGLQQLYALAKPPKKNPQGYASL 248
                           G   PAA PAP       TAT       A    P   P G    
Sbjct: 196 TPTPVAPPPAAPTARPGSPAPAATPAPTPTPAP-TATPAPTATPAPGSTPGAPPAGRPGA 254

Query: 247 RTA*QQPGGRLGGAADDG*GAREKP 173
                +PG      +    GA   P
Sbjct: 255 PPPGVRPGSPPAAGSPPAPGATPAP 279

 Score = 28.1 bits (61), Expect(2) = 0.17
 Identities = 17/48 (35%), Positives = 18/48 (37%), Gaps = 4/48 (8%)
 Frame = -2

Query: 743 AAPLPPITRHPCRGRP----GPP*TCSRRRHRRPPAPSETPAGWPWPP 612
           AAP  P    P    P     PP   +  R   PP P   PA  P PP
Sbjct: 62  AAPARPAAPPPAAAPPHPPAAPPPAAAPPRPAAPPPPPPPPAARPAPP 109

 Score = 25.8 bits (55), Expect(2) = 0.061
 Identities = 16/36 (44%), Positives = 18/36 (49%)
 Frame = -2

Query: 737 PLPPITRHPCRGRPGPP*TCSRRRHRRPPAPSETPA 630
           P P  T  P  GRPGP  T +       PAP+ TPA
Sbjct: 283 PAPGGTATPPSGRPGPASTPA--PGAATPAPTATPA 316

>ref|NP_629843.1| conserved hypothetical protein SC3C3.03c [Streptomyces coelicolor
           A3(2)] gi|3413391|emb|CAA20252.1| conserved hypothetical
           protein SC3C3.03c [Streptomyces coelicolor A3(2)]
          Length = 1083

 Score = 36.6 bits (83), Expect(2) = 8e-05
 Identities = 42/144 (29%), Positives = 51/144 (35%), Gaps = 21/144 (14%)
 Frame = -3

Query: 574 GWPSPWSFPTQTRPGRPRAPPSGAG----------------PKRGSSS*QRPSTRRRHRP 443
           G+P P +   Q +P  P  PP G G                P  G    Q P  +   + 
Sbjct: 493 GFPQPGAQAPQPQPHSPAQPPGGYGFPQAPQAPHGPSPQQQPYPGVPQQQAPQAQGGGQA 552

Query: 442 SWACPGSPSGLGMP--AGCRWSQGRRG--PAAAPAPPELQQT*TATAGLQQLYALAKPPK 275
             A PG P+  G P   G     G+ G  P+A  A P   Q   A     Q  A    P+
Sbjct: 553 PPAQPGQPAQPGQPMQPGQPGQSGQPGQAPSAPQAAPHPPQAPPAQQPEYQPQAQQPQPQ 612

Query: 274 KNPQGYASLRTA*QQPGG-RLGGA 206
             PQ Y       QQP   RLG A
Sbjct: 613 PQPQPY-------QQPADPRLGAA 629

 Score = 32.0 bits (71), Expect(2) = 8e-05
 Identities = 15/44 (34%), Positives = 19/44 (43%)
 Frame = -2

Query: 743 AAPLPPITRHPCRGRPGPP*TCSRRRHRRPPAPSETPAGWPWPP 612
           A P PP         P  P   +++   +P AP   PAGW  PP
Sbjct: 443 AQPQPPTQPQTAPAPPEQPGVAAQQPPFQPQAPQPAPAGWDAPP 486

>ref|ZP_00092697.1| COG1020: Non-ribosomal peptide synthetase modules and related
            proteins [Azotobacter vinelandii]
          Length = 4332

 Score = 46.2 bits (108), Expect = 6e-04
 Identities = 64/214 (29%), Positives = 74/214 (33%), Gaps = 26/214 (12%)
 Frame = -3

Query: 736  RCRPSRGTLAGAVQVLHRHAAGAGTAAHRLRQRRLQAGHGRRA-HEACVGTRASRGW--- 569
            R R  R     A +   R  AG      R R RRL+A   RRA   A +G RA R     
Sbjct: 1073 RLRTVRARQRRASRRAFRAPAGTDRREPRARHRRLRAARCRRAPATARLGARAGRTGRRP 1132

Query: 568  -----------------PSPW-SFPTQTRPGRPR----APPSGAGPKRGSSS*QRPSTRR 455
                             P  W   P   RPG PR    APP+ AG + G     RP+ R 
Sbjct: 1133 AARIADRPGTGDAAGHRPGQWRGDPRLHRPGAPRQPPGAPPARAGRRPGGEG--RPAGRT 1190

Query: 454  RHRPSWACPGSPSGLGMPAGCRWSQGRRGPAAAPAPPELQQT*TATAGLQQLYALAKPPK 275
            R R      G  +G G       + GRR PA  P             GL       +P  
Sbjct: 1191 RRRTD----GGSAGGGQGRRRLRADGRRLPARTP-------------GLDDRRQRPEPAA 1233

Query: 274  KNPQGYASLRTA*QQPGGRLGGAADDG*GAREKP 173
              P G     +A     GR G AA  G G R  P
Sbjct: 1234 GPPAGAGRAGSA-----GRTGDAASGGAGRRGLP 1262

 Score = 38.1 bits (87), Expect = 0.16
 Identities = 64/213 (30%), Positives = 79/213 (37%), Gaps = 38/213 (17%)
 Frame = -3

Query: 673  GAGTAAHRLR------------QRRLQAGHGRRA-----HEACVGTRASRGWPSPWS--- 554
            G  TAA R R            +R L+AG GRRA         +  R   G P P     
Sbjct: 1468 GGRTAARRQRPGARLSGASGADRRALRAGRGRRAAVPQRRPGALAGRRGAGIPRPRRRAG 1527

Query: 553  ----FPTQTRPGRPRAPP----------SGAGPKRGSSS*QRPSTRRRHRPS--WACPGS 422
                FP +T  G PRAP           +GA  +RG ++ + P  RRR R S     P +
Sbjct: 1528 ESARFPHRTG-GNPRAPAVAAGGAPGRGAGARGRRGCATGRLPDQRRRTRRSGPGRTPQA 1586

Query: 421  PSGLGMPAGCRWSQGRRGPAAAPAPPELQQT*TATAGLQQLYALAKPPKKNPQGYASLRT 242
              G   PAG   +   R P  A A  + Q      AG        +          S R 
Sbjct: 1587 RPG-RQPAGVHGAGAVRPPRRAAADADRQAGSQGAAGAGLAGGRIR---------RSARR 1636

Query: 241  A*QQPGGRLGGAADDG*GARE--KPLLR*GSSY 149
            A   PGG L G A  G  AR   + LLR G  +
Sbjct: 1637 ARTAPGGDLAGGA--GLAARRPGRRLLRPGRPF 1667

 Score = 33.5 bits (75), Expect = 4.0
 Identities = 47/165 (28%), Positives = 51/165 (30%), Gaps = 16/165 (9%)
 Frame = +3

Query: 348  GAGAAAGPRRP*DHLQPAGIP--RPDGLPGHAHEGRWRRRVDG------------RCHEE 485
            GA  A   RRP    +PAG    R DG      +GR R R DG            R    
Sbjct: 1170 GAPPARAGRRPGGEGRPAGRTRRRTDGGSAGGGQGRRRLRADGRRLPARTPGLDDRRQRP 1229

Query: 486  EPRFGPAPDGGARGLPGRV*VGKDHGDGQPRDARVPTHAS--CARRPWPACRRL*RSRWA 659
            EP  GP    G  G  GR     D   G      +P HA+   A R  P  R L      
Sbjct: 1230 EPAAGPPAGAGRAGSAGRT---GDAASGGAGRRGLPGHAARPGAGRRQPGLRDLHLRLHR 1286

Query: 660  AVPAPAACLWRTWTAPARVPRDGRQRCCEPTCP*GQTTRMRVACG 794
                     WR    PA      R  C       G     RV  G
Sbjct: 1287 PAQGRRRQPWRAERTPALDAPRVRPGCFRRAAAEGAARLRRVGLG 1331



EST assemble image


clone accession position
1 LC034g07_r AV621347 1 583
2 HC013d04_r AV632846 455 872




Chlamydomonas reinhardtii
Kazusa DNA Research Institute