KCC001092A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KCC001092A_C01 KCC001092A_c01
AAGGCAAACTGTCAACAGTCGCAAACCCAAGTACCAAATTCTCAAGAAAAATCTGTTGCA
ATTGAAACGCCAGCAAAATTCGCACTTCTCGGCTGACAAAGCAAGCCCTAACCAGCCAAG
CTTCGCGTGCCGAAGCATCCTCGCTCGCTTGTCCAAAACGTCAGAATGACTTCCGAGGTT
CCCACCACCAGCCGGCAAGGCAGCGTGGTGCCCTTCACTCGCATTGAGGGCCCCGACGCC
TGGGTTGCGGCCGACTTCCCGAACCTTGAGAAGGAAATGTTTCACCTGACGCCCGAGCAC
ATCGCCGAGCTTGATGCAGCCGTGGACAAGGTCATTGCAAGCGGCAAGCCCCTGCAGGAG
GTGTCCCTGGCTGATGTCCACCTGCCCACGCTATCGCTGCCGCTGATTGATGTGGGTCAG
CAGGCTCAGCACGGCCGCGGCTGGTCGCTGCTGCGCGGCGTGCCCGTGCAGCGCTACAGC
CGCCAGCAGCAGCTGACGGCCTGGTGGATCCTGGGGCTGCACTGGGGCCGCGCCGTGCCC
CAGAACGCCAAGGGCCACCTGATCGGACACATCAAGGACCTGGGTCGCGACCCCGCTGAC
CCCAACACTCGCCTCTACGCCACCAACGCCGCACAGCCCTGGCACAACGACGGCCCGGCA
GACCTCGTGGGCCTGCTGTGCCTGTCTGACGGCGCTGAGGGCGGCGAGAGCGGCTGGTCG
TCTTCAATCTCCGTGCACAACGAGATCCTGCGGACCGCGCCGCACCTGGCGCGCGTGCTG
GCTGACTCGTGGTTCTTTGACCGCAAGGGCGAGGTGCCTGCGGGCAAGAAGCCCTTCTTC
GAGATCCCCGTGTTCAACTACCACAAGGGCTACCTGTCCGTCAACTACAGCGACAACTAC
TACCACCTCAGCCAGCGCCACGCCGAGGTGCCGCGCCTGGGACCCGACCACCACGCCGCC
ATGGAGCTGTTCAACTCGCTGGCGTGCTCGCAGCAGCTGTCGCTGCGCCACATCCTGCAG
CCGGGGGACGTGCAGCTGCTCAGCAACCACACCTGCCTGCACTACCGCGGCGCGTTCAGG
GACAGCCCCGAGCACACGCGGCACCTGCTGCGGCTGTGGGTGTCGCCGCCCGACGACCGG
CCGCTTGCCGAGGTGTACAGCGAAATCATGGGCCGGCAGTGTGGTGCCGCGGCAAGCGCG
GAGGCATCTTCATCCAGAACGGAGCCGACCACAACCCCATCCCGCTGGAGGCCGAGTAAG
GCCGGAGTACGGAATGACAGAACCCCAGAGGCCGCAGCATGAGGGTATGCCCACGGGGTA
TGCCAGCTAAACTTATTTGCTAACTTATCCAGAGTTTGTGCAACGTTGATTGATTATTTT
GAGGGGCTGGTGCGCAATTGTGCCGCACGCTTGGGCAACCTGCCAGATGGCCAAAATGGT
ATTGAGAGGTGACATTTATTGTTCCGTATGGCCGTTCATCAATCAAGCGGCTGCTTGGTA
TTCATGTAC


Nr search


BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KCC001092A_C01 KCC001092A_c01
         (1509 letters)

Database: nr 
           1,537,769 sequences; 498,525,298 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_879728.1| conserved hypothetical protein [Bordetella pert...   214  3e-54
ref|NP_885326.1| conserved hypothetical protein [Bordetella para...   214  4e-54
ref|NP_890089.1| conserved hypothetical protein [Bordetella bron...   214  4e-54
ref|XP_326550.1| predicted protein [Neurospora crassa] gi|183761...   177  3e-43
ref|NP_929451.1| hypothetical protein [Photorhabdus luminescens ...   129  1e-28

>ref|NP_879728.1| conserved hypothetical protein [Bordetella pertussis]
            gi|33571728|emb|CAE41228.1| conserved hypothetical
            protein [Bordetella pertussis]
          Length = 357

 Score =  214 bits (546), Expect = 3e-54
 Identities = 126/306 (41%), Positives = 170/306 (55%), Gaps = 10/306 (3%)
 Frame = +1

Query: 259  PNLEKE----MFHLTPEHIAELDAAVDKVIASGKPLQEVSLADVHLPTLSLPLIDVGQQA 426
            P L+K     + HLT   + +LD AV +  A GK + E+S  D  L  L   L  V  + 
Sbjct: 24   PQLDKHPEYWVHHLTGPELEQLDRAVRRADAGGKDITELSQDDFELGELGQRLQHVKHEV 83

Query: 427  QHGRGWSLLRGVPVQRYSRQQQLTAWWILGLHWGRAVPQNAKGHLIGHIKDLGRDPADPN 606
             HGRG  L+RGVPV++Y+ +Q   A+W LG + G  V QN KGH++GH+ +LG D AD  
Sbjct: 84   LHGRGLYLIRGVPVEQYTMRQSAIAFWALGTNLGLPVSQNGKGHVLGHVANLGLDYADAA 143

Query: 607  TRLYATNAAQPWHNDGPADLVGLLCLSDGAEGGESGWSSSISVHNEILRTAPHLARVLAD 786
             R Y T+   P+H D  +D+VGLLC+     GG S   SS +V NE+    P  AR L D
Sbjct: 144  VRGYQTSNRLPYHTDS-SDIVGLLCVRPAKAGGLSSVVSSTTVWNELAARHPEHARTLLD 202

Query: 787  SWFFDRKGEVPAGKKPFFEIPVFNYHKGYLSVNYSDNYYHLSQRHAEVPRLGPDHHAAME 966
            S+   R GE+P G+KP+   PVF  ++G +  NY  +    +Q    VPRL    + A++
Sbjct: 203  SFHRTRWGEIPEGQKPYSSSPVFAPYQGRMYANYVRSAIRKAQALPSVPRLSAQQNEALD 262

Query: 967  LFNSLACSQQLSLRHILQPGDVQLLSNHTCLHYRGAFRDSP--EHTRHLLRLWVS----P 1128
              ++L C   L L    +PGDVQLLSN T  H R A+ D P  E  RHLLRLW++    P
Sbjct: 263  CLDALTCDPALYLDMDFKPGDVQLLSNFTIFHSRTAYEDWPETERRRHLLRLWLACEGGP 322

Query: 1129 PDDRPL 1146
            P   PL
Sbjct: 323  PIPEPL 328

>ref|NP_885326.1| conserved hypothetical protein [Bordetella parapertussis]
            gi|33574111|emb|CAE38438.1| conserved hypothetical
            protein [Bordetella parapertussis]
          Length = 357

 Score =  214 bits (544), Expect = 4e-54
 Identities = 126/306 (41%), Positives = 169/306 (55%), Gaps = 10/306 (3%)
 Frame = +1

Query: 259  PNLEKE----MFHLTPEHIAELDAAVDKVIASGKPLQEVSLADVHLPTLSLPLIDVGQQA 426
            P L+K     + HLT   + +LD AV    A GK + E+S  D  L  L   L  V  + 
Sbjct: 24   PQLDKHPEYWVHHLTGPELEQLDRAVRHADAGGKDITELSQDDFELGELGQRLQQVKHEV 83

Query: 427  QHGRGWSLLRGVPVQRYSRQQQLTAWWILGLHWGRAVPQNAKGHLIGHIKDLGRDPADPN 606
             HGRG  L+RGVPV++Y+ +Q   A+W LG + G  V QN KGH++GH+ +LG D AD  
Sbjct: 84   LHGRGLYLIRGVPVEQYTMRQSAIAFWALGTNLGLPVSQNGKGHVLGHVANLGLDYADAA 143

Query: 607  TRLYATNAAQPWHNDGPADLVGLLCLSDGAEGGESGWSSSISVHNEILRTAPHLARVLAD 786
             R Y T+   P+H D  +D+VGLLC+     GG S   SS +V NE+    P  AR L D
Sbjct: 144  VRGYQTSNRLPYHTDS-SDIVGLLCVRPAKAGGLSSVVSSTTVWNELAARHPEHARTLLD 202

Query: 787  SWFFDRKGEVPAGKKPFFEIPVFNYHKGYLSVNYSDNYYHLSQRHAEVPRLGPDHHAAME 966
            S+   R GE+P G+KP+   PVF  ++G +  NY  +    +Q    VPRL    + A++
Sbjct: 203  SFHRTRWGEIPEGQKPYSSSPVFAPYQGRMYANYVRSAIRKAQALPSVPRLSAQQNEALD 262

Query: 967  LFNSLACSQQLSLRHILQPGDVQLLSNHTCLHYRGAFRDSP--EHTRHLLRLWVS----P 1128
              ++L C   L L    +PGDVQLLSN T  H R A+ D P  E  RHLLRLW++    P
Sbjct: 263  CLDALTCDPALYLDMDFKPGDVQLLSNFTIFHSRTAYEDWPETERRRHLLRLWLACEGGP 322

Query: 1129 PDDRPL 1146
            P   PL
Sbjct: 323  PIPEPL 328

>ref|NP_890089.1| conserved hypothetical protein [Bordetella bronchiseptica]
            gi|33576968|emb|CAE34048.1| conserved hypothetical
            protein [Bordetella bronchiseptica]
          Length = 357

 Score =  214 bits (544), Expect = 4e-54
 Identities = 126/306 (41%), Positives = 169/306 (55%), Gaps = 10/306 (3%)
 Frame = +1

Query: 259  PNLEKE----MFHLTPEHIAELDAAVDKVIASGKPLQEVSLADVHLPTLSLPLIDVGQQA 426
            P L+K     + HLT   + +LD AV    A GK + E+S  D  L  L   L  V  + 
Sbjct: 24   PQLDKHPEYWVHHLTGPELEQLDRAVRHADAGGKDITELSQDDFELGELGQRLQQVKHEV 83

Query: 427  QHGRGWSLLRGVPVQRYSRQQQLTAWWILGLHWGRAVPQNAKGHLIGHIKDLGRDPADPN 606
             HGRG  L+RGVPV++Y+ +Q   A+W LG + G  V QN KGH++GH+ +LG D AD  
Sbjct: 84   LHGRGLYLIRGVPVEQYTMRQSAIAFWALGTNLGLPVSQNGKGHVLGHVANLGLDYADAA 143

Query: 607  TRLYATNAAQPWHNDGPADLVGLLCLSDGAEGGESGWSSSISVHNEILRTAPHLARVLAD 786
             R Y T+   P+H D  +D+VGLLC+     GG S   SS +V NE+    P  AR L D
Sbjct: 144  VRGYQTSNRLPYHTDS-SDIVGLLCVRPAKAGGLSSVVSSTTVWNELTARHPEHARTLLD 202

Query: 787  SWFFDRKGEVPAGKKPFFEIPVFNYHKGYLSVNYSDNYYHLSQRHAEVPRLGPDHHAAME 966
            S+   R GE+P G+KP+   PVF  ++G +  NY  +    +Q    VPRL    + A++
Sbjct: 203  SFHRTRWGEIPEGQKPYSSSPVFAPYQGRMYANYVRSAIRKAQALPSVPRLSAQQNEALD 262

Query: 967  LFNSLACSQQLSLRHILQPGDVQLLSNHTCLHYRGAFRDSP--EHTRHLLRLWVS----P 1128
              ++L C   L L    +PGDVQLLSN T  H R A+ D P  E  RHLLRLW++    P
Sbjct: 263  CLDALTCDPALYLDMDFKPGDVQLLSNFTIFHSRTAYEDWPETERRRHLLRLWLACEGGP 322

Query: 1129 PDDRPL 1146
            P   PL
Sbjct: 323  PIPEPL 328

>ref|XP_326550.1| predicted protein [Neurospora crassa] gi|18376172|emb|CAD21289.1|
            conserved hypothetical protein [Neurospora crassa]
            gi|28923217|gb|EAA32433.1| predicted protein [Neurospora
            crassa]
          Length = 409

 Score =  177 bits (450), Expect = 3e-43
 Identities = 114/315 (36%), Positives = 164/315 (51%), Gaps = 11/315 (3%)
 Frame = +1

Query: 223  IEGPDAWVAADFPNLEKEMFH-LTPEHIAELDAAVDKVIASGKPLQEVSLADVHLPTLSL 399
            I GP  W   DF N  +   H  T E + EL    D  IASG PL  +S  +  LP L  
Sbjct: 64   IAGPTVWKREDFVNNPERWVHPFTDEEVQELSDTADAFIASGTPLTGISQENFPLPKLGT 123

Query: 400  PLIDVGQQAQHGRGWSLLRGVPVQRYSRQQQLTAWWILGLHWGRAVPQNAKGHLIGHIKD 579
             L ++     +G+G+ L +  P   +  ++   A+  LG + G  + QN +GH++GH+KD
Sbjct: 124  VLTNLRDDLLNGKGFILFKRFPADVWGAEKNAVAYMGLGTYLGYFLSQNGRGHVLGHVKD 183

Query: 580  LGRDPADPNT-RLYATNAAQPWHNDGPADLVGLLCLSDGAEGGESGWSSSISVHNEILRT 756
            +G DP   +T R+Y T A Q +H D   D+VGLLC+    EGGES   S   V N + + 
Sbjct: 184  VGDDPTQIHTVRIYRTTARQFFHAD-DGDIVGLLCVHRAQEGGESDIVSVHHVWNTLQQE 242

Query: 757  APHLARVLADS-WFFDRKGEVPAGKKPFFEIPVF---NYHKGYLSVNYSDNYYHLSQRHA 924
             P +A +L    W+FDRKGEV  G++ +   PV    N  KG L   +   Y     R +
Sbjct: 243  HPDVAELLTKPIWYFDRKGEVSEGQQEWVRQPVVYLENGGKGRLYCKWDPYYVKSLTRFS 302

Query: 925  E---VPRLGPDHHAAMELFNSLACSQQLSLRHILQPGDVQLLSNHTCLHYRGAFRD--SP 1089
            +   +P L  +   AM++       Q+L+L  IL+ GD+Q LSN   LH R A++D   P
Sbjct: 303  DKGLIPALSEEQLRAMQILEETC--QRLALHMILEVGDIQFLSNAHLLHARTAYKDFAPP 360

Query: 1090 EHTRHLLRLWVSPPD 1134
               RHLLRLW++ P+
Sbjct: 361  APRRHLLRLWLATPE 375

>ref|NP_929451.1| hypothetical protein [Photorhabdus luminescens subsp. laumondii TTO1]
            gi|36785537|emb|CAE14486.1| unnamed protein product
            [Photorhabdus luminescens subsp. laumondii TTO1]
          Length = 335

 Score =  129 bits (324), Expect = 1e-28
 Identities = 84/311 (27%), Positives = 157/311 (50%), Gaps = 4/311 (1%)
 Frame = +1

Query: 226  EGPDAWVAADFPNLEKEMFHLTPEHIAELDAAVDKVIASGKPLQEVSLADVHLPTLSLPL 405
            + P  W +A   + +  +  ++ E I      +  +    +P +  + +D     +++  
Sbjct: 10   DDPAVWCSAQLESKKDVLLPVSDEQIEAFRHHLSAM--EDRPSEAFNASDFSFEEITILQ 67

Query: 406  IDVGQQAQHGRGWSLLRGVPVQRYSRQQQLTAWWILGLHWGRAVPQNAKGHLIGHIKDLG 585
              + Q+   GRG  ++ G+P + ++       +W +G   GR V QN++GH IGH+++  
Sbjct: 68   ERIHQRLTEGRGVVVVSGIPREMFTDSILSHLFWGIGTGLGRPVVQNSQGHRIGHVRN-- 125

Query: 586  RDPADPNTRLYATNAAQPWHNDGPADLVGLLCLSDGAEGGESGWSSSISVHNEILRTAPH 765
             +  +PN R Y +N    +H+D   ++VGL+CL + A GG +   S ++++N++LR  P 
Sbjct: 126  -EKNNPNNRGYMSNRELGFHSDA-FEIVGLMCLREAASGGLTQIVSGLAIYNQMLREKPE 183

Query: 766  LARVLADSWFFDRKGEVPAGKKPF--FEIPVFNYHKGYLSVNYSDNYYHLSQRHAEVPRL 939
            L   L + + +    E  + K P+  ++IP+F+   G +S      Y   +   A++  L
Sbjct: 184  LLDALFEGYHY-ATAERSSSKLPYTSYKIPIFSKMSGRVSSMCLGAYMRAA---AKLQGL 239

Query: 940  G-PDH-HAAMELFNSLACSQQLSLRHILQPGDVQLLSNHTCLHYRGAFRDSPEHTRHLLR 1113
              PD   A +  F  +    +  L  +L+PG++  L+N+T LH R  F+D   + RHLLR
Sbjct: 240  ALPDALDAGLHAFYEICSRPEFRLEFMLEPGEILFLNNYTTLHSRTEFQDDALNQRHLLR 299

Query: 1114 LWVSPPDDRPL 1146
            LW+   D RP+
Sbjct: 300  LWIELSDGRPV 310



EST assemble image


clone accession position
1 LC016b05_r AV620006 1 503
2 LCL015g08_r AV626807 122 593
3 MX057h07_r BP088336 124 486
4 LC090d12_r AV625288 142 633
5 MX254h09_r BP092495 142 631
6 MX232c02_r BP090995 170 485
7 MX249d11_r BP092076 189 632
8 HCL054e03_r AV642598 191 642
9 LC036f06_r AV621475 390 928
10 MX005c10_r BP086264 402 645
11 HC040b08_r AV634962 637 1158
12 CM016h01_r AV387445 728 1338
13 CM028a10_r AV387726 728 1320
14 HCL021d05_r AV640728 1109 1605




Chlamydomonas reinhardtii
Kazusa DNA Research Institute