KCC001598A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KCC001598A_C01 KCC001598A_c01
caaccacggtccccacgagactaacgctaatgattgtcaataatcttgattataacttaa
tcggtcgcaacctgaaacgtGCAGAGTCACTACAGTTAAAGCGGTACCCAATTACCCAAG
TAAATAGTTGGTTGGATCGGGCCTGGGGGCCTGCGGCACCATTATTCGCATGTGTGTGGA
TGCGATTGGTCACATGGGAGTGCTTAAACGCGCGGGGCGCTCTCACGTTTGAGGCGGCGC
CGGGCTGTGTCTCGTGGACGCCCGCGCGATGTTGAGCGGGGCCACGGCGCCCGCTGCGCA
TCATCTGAGTACGCCCTTCTGCTTCATGCGTGGGTGGGTGGGCGGGGTTCCCTGCCGGGG
TTGTGAGCCGTGCCGAATGCACACGATGCGTCAAGGGGCTGCAATCTGAGCAGCACAGCA
TACAGGGGTGGAGTTGAACAGCTGTACTCGACAGAAAGCGCTGTTGCGCCAGGCATGCCC
ACACGGCGAGGTAGGCTGCGGGCGGGTTTGTAGGTCCGCACGCCGGGGCGCCGATGTTGA
CACTGCAGCAGCGCTGTGTTGCTAGCACAGGCTCTAGTCGAGCGTATACGTACACTATGC
TCATCCCCTACATTATTCTTGGGGGTCGGTGGTGGTGCAGCCGAGTGCCAACGGAATATG
GAGCTGCCAGTGGGAAGATCGCTATCGCGGCAAACGCGTGTTGCGGGGACAGATGTGGCA
CCCGGGTGGGGAAGGATGCCTGTGCGGCTGATGAGGGGTCGTGATTGTGTGGGTGTCCCC
ATCTTGTCTTTTTGTGATGTGGTGTCTCTGGTAGGCGGCTAGGGTAAGGTGTGAACCGGG
TTTTCATGGTGTGAATCGGnCAACAGTGCACCAGGCAGACGAGTAAG


Nr search


BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KCC001598A_C01 KCC001598A_c01
         (887 letters)

Database: nr 
           1,537,769 sequences; 498,525,298 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

pir||T43481 probable mucin DKFZp434C196.1 - human (fragment) gi|...    40  0.040
ref|NP_276495.1| conserved protein (contains ferredoxin domain) ...    40  0.052
dbj|BAA91361.1| unnamed protein product [Homo sapiens] gi|126529...    39  0.089
ref|XP_318717.1| ENSANGP00000004655 [Anopheles gambiae] gi|30174...    39  0.15
ref|NP_490918.2| putative protein family member of eukaryotic or...    38  0.26

>pir||T43481 probable mucin DKFZp434C196.1 - human (fragment)
           gi|6599134|emb|CAB63715.1| hypothetical protein [Homo
           sapiens]
          Length = 580

 Score = 40.4 bits (93), Expect = 0.040
 Identities = 62/237 (26%), Positives = 83/237 (34%), Gaps = 8/237 (3%)
 Frame = -1

Query: 881 RLPGAL-LXDSHHENPVHTLP*PPTRDTTSQKDKMGTP-----TQSRPLISRTGILPHPG 720
           RLP A  +   H  +P+ T P      T S     GTP     T + P  S TG  P   
Sbjct: 107 RLPRASPMGSPHRASPMRTPPRASPTGTPSTASPTGTPSSASPTGTPPRASPTGTPPRAW 166

Query: 719 ATSVPATRVCRDSDLPTGSSIFRWHSAAPPPTPKNNVGDEHSVRIRSTRACASNTALLQC 540
           AT  P+T     +  P+ +S+ RW    PP           S R+ S RA  + T     
Sbjct: 167 ATRSPSTASL--TRTPSRASLTRW----PPRASPTRTPPRESPRM-SHRASPTRTPPRAS 219

Query: 539 QHRRPGVRTYKPARSLPRRVGMPGATALSVEYSCSTPPLYAVLLRLQPLDASCAFGTAHN 360
             RR       P R+ P R     +   S   S +  P  A   R  P  +        +
Sbjct: 220 PTRR-------PPRASPTRTPPRESLRTSHRASPTRMPPRASPTRRPPRASPTGSPPRAS 272

Query: 359 PGREPRPPTHA*SRRAYSDDAQRAPWPRSTSRGRPRDTARRR--LKRESAPRV*ALP 195
           P   PR       R + +    RA   R+ S   P  T  R   +K ES   +   P
Sbjct: 273 PMTPPRASPRTPPRASPTTTPSRASLTRTPSWASPTTTPSRASLMKMESTVSITRTP 329

>ref|NP_276495.1| conserved protein (contains ferredoxin domain) [Methanothermobacter
           thermautotrophicus] gi|7446609|pir||D69050 conserved
           hypothetical protein MTH1379 - Methanobacterium
           thermoautotrophicum (strain Delta H)
           gi|2622489|gb|AAB85856.1| conserved protein (contains
           ferredoxin domain) [Methanothermobacter
           thermautotrophicus str. Delta H]
          Length = 236

 Score = 40.0 bits (92), Expect = 0.052
 Identities = 28/70 (40%), Positives = 32/70 (45%), Gaps = 2/70 (2%)
 Frame = +2

Query: 353 CRGCEPCRMHTMRQGAAI*AAQHTGVELN--SCTRQKALLRQACPHGEVGCGRVCRSARR 526
           CRGCEPC    +   A    A   GVE+   SC R     R ACPHG V  GR+     R
Sbjct: 148 CRGCEPC----LAAAACPEDAIVPGVEIRLLSC-RGCGACRTACPHGAVSGGRIITIHMR 202

Query: 527 GADVDTAAAL 556
             D+   A L
Sbjct: 203 EVDIRNTARL 212

>dbj|BAA91361.1| unnamed protein product [Homo sapiens] gi|12652975|gb|AAH00248.1|
           MGC25062 protein [Homo sapiens]
          Length = 147

 Score = 39.3 bits (90), Expect = 0.089
 Identities = 31/105 (29%), Positives = 40/105 (37%), Gaps = 3/105 (2%)
 Frame = -2

Query: 481 WACLAQQRFLSSTAVQLHPCMLCCSDCSPLTHRVHSARLTTPAGNPAHPPTHEAEGRTQM 302
           W C            +  PCMLC   C PL  R    R   P G  AH  TH     T+ 
Sbjct: 42  WGCQGSTLCSLGRGKEAPPCMLCRGPCRPLCLR---GRRRGPLGKCAHTHTH-THTHTRT 97

Query: 301 MRSGRRGPAQH-RAGVH--ETQPGAASNVRAPRAFKHSHVTNRIH 176
               R     H  AGV     +PG A+ +  P ++    ++ RIH
Sbjct: 98  CTHARMHTHTHICAGVQPKSVEPGLAAQLGLPHSWSPQGMSLRIH 142

>ref|XP_318717.1| ENSANGP00000004655 [Anopheles gambiae] gi|30174516|gb|EAA13969.2|
            ENSANGP00000004655 [Anopheles gambiae str. PEST]
          Length = 3150

 Score = 38.5 bits (88), Expect = 0.15
 Identities = 52/241 (21%), Positives = 78/241 (31%), Gaps = 13/241 (5%)
 Frame = -1

Query: 830  TLP*PPTRDTTSQKDKMGTPTQSRPLISRTGILPHPGATSVPATRVCRDSDLPTGSSIFR 651
            T P  PT   T+    M + +   P  +       PG T    TR       PT S++  
Sbjct: 2782 TTPTRPTPTDTTMSSSMSSASTPEPSTT-------PGTTRTTPTR-----PTPTDSTMSS 2829

Query: 650  WHSAAPPPTPKNNVGDEHSVRIRSTRACASNTALLQCQHRRPGVRTYKPARSLPRRVGMP 471
              S+A  P P    G   +   R T   ++ ++ +      PG     P R  P    M 
Sbjct: 2830 SMSSASTPEPSTTPGTTRTTPTRPTPTDSTMSSSMSSVSTTPGTTRTTPTRPTPTDSTMS 2889

Query: 470  GATALSVEYSCSTPPLYAVLLRLQPLDASCAFGTAHNPGREPRP-------------PTH 330
             + + +     ST P        +P        ++ +    P P             PT 
Sbjct: 2890 SSMSSASTPEPSTTPGTTRTTPTRPTPTDSTMSSSMSSASTPEPSTTPGTTRTTPTRPTP 2949

Query: 329  A*SRRAYSDDAQRAPWPRSTSRGRPRDTARRRLKRESAPRV*ALPCDQSHPHTCE*WCRR 150
              S  + S  +   P P +T+ G  R T  R    +S     + P   S P T      R
Sbjct: 2950 TDSTMSSSMSSASTPEP-TTTPGTTRTTPTRPTPTDSTMSSASTPKPSSTPGTTRTTPTR 3008

Query: 149  P 147
            P
Sbjct: 3009 P 3009

>ref|NP_490918.2| putative protein family member of eukaryotic origin (149.5 kD)
           (1C602C) [Caenorhabditis elegans]
           gi|18652629|gb|AAL00866.2|AC093703_6 Hypothetical
           protein Y20F4.4 [Caenorhabditis elegans]
          Length = 1319

 Score = 37.7 bits (86), Expect = 0.26
 Identities = 59/214 (27%), Positives = 77/214 (35%), Gaps = 14/214 (6%)
 Frame = -1

Query: 773 PTQSRPLISRTGILPHPGATSVPATRVCRDSDL---PTGS------SIFRWHSAAPPPTP 621
           P Q R   SR G     G    P+    ++ +L    TGS      + F+  + A PP  
Sbjct: 284 PVQDR---SRNGRATPMGTRGAPSPASSQNGNLFRNGTGSQHRGSPTDFKSKAPAKPPAQ 340

Query: 620 KNNVG-DEHSVRIRSTRACASNTALL----QCQHRRPGVRTYKPARSLPRRVGMPGATAL 456
             N   +EH  R RS RA    +A      Q +H R G  ++  +RS        GA+  
Sbjct: 341 NGNASHNEHRSRSRS-RATVPRSAQYRSEHQTEHARTGSSSHHESRSRAAAAAPVGAS-- 397

Query: 455 SVEYSCSTPPLYAVLLRLQPLDASCAFGTAHNPGREPRPPTHA*SRRAYSDDAQRAPWPR 276
               S S PP  +   R    +          P R   PP              +  W R
Sbjct: 398 ----SQSKPPAPSPTSR----ETDSIPEAPAEPIRNISPP--------------QPRWTR 435

Query: 275 STSRGRPRDTARRRLKRESAPRV*ALPCDQSHPH 174
           S SRGR   T R   K  SAP    LP   S P+
Sbjct: 436 SQSRGRSEATRRSPPKGPSAPPAYTLPFGGSTPN 469



EST assemble image


clone accession position
1 CM067e09_r AV390891 1 597
2 HC006h01_r AV632331 372 887




Chlamydomonas reinhardtii
Kazusa DNA Research Institute