KCC000364A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KCC000364A_C01 KCC000364A_c01
cacggcgaggaggacgagcgctggctgtacggcctgatggggctgacggtggcggggtac
gcgggcacactcacactggcGGGCCTGATGTACGCCTGGTTCAAGCCGGCGGGCGCGGGC
AGCTGCAGCCTCAACATCGGCGCCATCACGCTCACGCTGCTGCTGGTGGTGGCCTTCTCA
GTGCTCAGCCTGGCGCCACTGGCGCGGCAGGGCTCCATCTTCCCCTCCGCGGCCATCGGG
CTGTACGCGGCCTACCTGTGCTTCAGCGCGCTGCAGTCCGAGCCCAAGGAGTACGCCTGC
AACGGACTGGGCCGCTCGCTCACGGCCGCATCGGGTGGCACGCTGGCTCTGGGCATGCTG
GTGACGCTGGCCTCAGTGGTGTACGCCGCCTTCCGCGCCGGCAGCAACACAGCGCTGTTC
ACACTGGACGGAAGCGAGGACGGCGAGGGAGGAGCCGGAGGCGGTGCGGGGCAGCGGCAG
GCGCTGCTGGCGGATGTGGAGGGCACCAGCGCGGGTCTGGACGGAGTGCCGGATGTGGCG
GAGGCCACACGCGAGGCCGTCACAGGCGGCGCGCCCAAGCCTGACGCCGCCGCGGTGGCT
CGTGCCGAGGCGCTGACGCCCGTGTCCTACAACTACAGTTTCTTCCACCTCATCTTCGCG
CTGGCCTCCATGTACATTGCCATGCTCATGACCGGATGGGGCAGCGTGGCGCAGGACAAG
GACCGCATCGACGTGGGCTGGGCCAGCGTGTGGGTCAAGCTGGGCGCGCAGTGGGTGACG
GGGCTGCTGTACATGTGGACGCTGTTGGCGCCGGCGCTGTTCCCGGACCGCGACTTCTCC
TAGAGGACGGGGGGGGGGCGGGAGGCATGCGAGGAGGGCTGGAGGAGCTCTGTGTG


Nr search


BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KCC000364A_C01 KCC000364A_c01
         (896 letters)

Database: nr 
           1,537,769 sequences; 498,525,298 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|AAO73245.1| putative membrane protein [Oryza sativa (japonica...   219  6e-56
ref|NP_187268.1| expressed protein [Arabidopsis thaliana] gi|686...   209  4e-53
ref|NP_173069.1| expressed protein [Arabidopsis thaliana] gi|253...   195  8e-49
ref|NP_506611.1| tumor differentially expressed 1 like family me...   117  3e-25
gb|AAH11295.1| Tde1 protein [Mus musculus] gi|18606124|gb|AAH229...   114  3e-24

>gb|AAO73245.1| putative membrane protein [Oryza sativa (japonica cultivar-group)]
          Length = 417

 Score =  219 bits (557), Expect = 6e-56
 Identities = 120/279 (43%), Positives = 166/279 (59%), Gaps = 1/279 (0%)
 Frame = +1

Query: 7   EEDERWLYGLMGLTVAGYAGTLTLAGLMYAWFKPAGAGSCSLNIGAITLTLLLVVAFSVL 186
           +++++W   L+ +TV  Y  T   +GL++ WF P+G   C LN+  IT+T++L  AF+++
Sbjct: 174 KDEQKWEIALLVVTVVCYLSTFAFSGLLFTWFNPSGH-DCGLNVFFITMTIILAFAFAII 232

Query: 187 SLAPLARQGSIFPSAAIGLYAAYLCFSALQSEPKEYACNGLGRSLTAASGGTLALGMLVT 366
           +L P    GS+ P++ I +Y AYLC+++L SEP +YACNGL R     S   L LGML T
Sbjct: 233 ALHPQVN-GSVMPASVISVYCAYLCYTSLSSEPDDYACNGLHRHSKQVSMSALILGMLTT 291

Query: 367 LASVVYAAFRAGSNTALFTLDGSEDGEGGAGGGAGQRQALLADVEGTSAGLDGVPDVAEA 546
           + SVVY+A RAGS+T   +   S          +G +  LL D        D V      
Sbjct: 292 VLSVVYSAVRAGSSTTFLSPPSSP--------RSGIKNPLLGD--------DNV------ 329

Query: 547 TREAVTGGAPKPDAAAVARAEALTPVSYNYSFFHLIFALASMYIAMLMTGWGSVAQDKDR 726
             EA    + + DA          PVSY+Y+FFH+IFALASMY AML+TGW S A D   
Sbjct: 330 --EAGKSNSKEIDA---------RPVSYSYTFFHVIFALASMYSAMLLTGWTSAASDSSE 378

Query: 727 I-DVGWASVWVKLGAQWVTGLLYMWTLLAPALFPDRDFS 840
           + DVGW +VWV++  +W T  LY+WTL+AP LFPDRDFS
Sbjct: 379 LMDVGWTTVWVRICTEWATAALYIWTLVAPLLFPDRDFS 417

>ref|NP_187268.1| expressed protein [Arabidopsis thaliana]
           gi|6862921|gb|AAF30310.1|AC018907_10 hypothetical
           protein [Arabidopsis thaliana]
          Length = 315

 Score =  209 bits (533), Expect = 4e-53
 Identities = 113/277 (40%), Positives = 163/277 (58%)
 Frame = +1

Query: 7   EEDERWLYGLMGLTVAGYAGTLTLAGLMYAWFKPAGAGSCSLNIGAITLTLLLVVAFSVL 186
           +++++W   L+ +++  Y  T T +G+++ WF P+G   C LN+  I + ++L   F+++
Sbjct: 77  KDEKKWYIALLVISIVCYIATYTFSGILFIWFNPSGQ-DCGLNVFFIVMPMILAFVFAII 135

Query: 187 SLAPLARQGSIFPSAAIGLYAAYLCFSALQSEPKEYACNGLGRSLTAASGGTLALGMLVT 366
           +L P A  GS+ P++ I +Y AY+C++ L SEP +Y CNGL +S  A +  TL LGML T
Sbjct: 136 ALHP-AVNGSLLPASVISVYCAYVCYTGLSSEPHDYVCNGLNKS-KAVNASTLILGMLTT 193

Query: 367 LASVVYAAFRAGSNTALFTLDGSEDGEGGAGGGAGQRQALLADVEGTSAGLDGVPDVAEA 546
           + SV+Y+A RAGS+T   +   S          +G + ALL D E      DG       
Sbjct: 194 VLSVLYSALRAGSSTTFLSPPSSPR--------SGVKDALLGDPE------DG------- 232

Query: 547 TREAVTGGAPKPDAAAVARAEALTPVSYNYSFFHLIFALASMYIAMLMTGWGSVAQDKDR 726
                     K    A AR     PVSY+YSFFH+IFALASMY AML++GW   ++    
Sbjct: 233 ----------KKSGEAEAR-----PVSYSYSFFHIIFALASMYAAMLLSGWTDSSESATL 277

Query: 727 IDVGWASVWVKLGAQWVTGLLYMWTLLAPALFPDRDF 837
           IDVGW SVWVK+   WVT  LY+WTL+AP + PDR+F
Sbjct: 278 IDVGWTSVWVKICTGWVTAGLYIWTLIAPLILPDREF 314

>ref|NP_173069.1| expressed protein [Arabidopsis thaliana] gi|25354033|pir||F86296
           hypothetical protein T24D18.26 - Arabidopsis thaliana
           gi|6587821|gb|AAF18512.1|AC010924_25 Contains similarity
           to gb|AF181686 membrane protein TMS1d from Drosophila
           melanogaster.  ESTs gb|R64994, gb|AI994832, gb|Z47674
           come from this gene. [Arabidopsis thaliana]
          Length = 412

 Score =  195 bits (496), Expect = 8e-49
 Identities = 110/279 (39%), Positives = 155/279 (55%), Gaps = 1/279 (0%)
 Frame = +1

Query: 4   GEEDERWLYGLMGLTVAGYAGTLTLAGLMYAWFKPAGAGSCSLNIGAITLTLLLVVAFSV 183
           G +++ W   L+ +++  Y  T   +G ++ WF P+G   C LN   I +TL+ V  F++
Sbjct: 173 GYDEQFWYAALLVVSLVCYLATFVFSGFLFHWFTPSGH-DCGLNTFFIIMTLIFVFVFAI 231

Query: 184 LSLAPLARQGSIFPSAAIGLYAAYLCFSALQSEPKEYACNGLGRSLTAASGGTLALGMLV 363
           + L P    GSI P++ I LY  YLC+S L SEP++Y CNGL     A S GT+ +G+L 
Sbjct: 232 VVLHPTVG-GSILPASVISLYCMYLCYSGLASEPRDYECNGLHNHSKAVSTGTMTIGLLT 290

Query: 364 TLASVVYAAFRAGSNTALFTLDGSEDGEGGAGGGAGQRQALLADVEGTSAGLDGVPDVAE 543
           T+ SVVY+A RAGS+T L +   S   E          + LL         +DG  +  E
Sbjct: 291 TVLSVVYSAVRAGSSTTLLSPPDSPRAE----------KPLLP--------IDGKAEEKE 332

Query: 544 ATREAVTGGAPKPDAAAVARAEALTPVSYNYSFFHLIFALASMYIAMLMTGWG-SVAQDK 720
                                E   PVSY+Y+FFH+IF+LASMY AML+TGW  SV +  
Sbjct: 333 -------------------EKENKKPVSYSYAFFHIIFSLASMYSAMLLTGWSTSVGESG 373

Query: 721 DRIDVGWASVWVKLGAQWVTGLLYMWTLLAPALFPDRDF 837
             +DVGW SVWV++   W T  L++W+L+AP LFPDR+F
Sbjct: 374 KLVDVGWPSVWVRVVTSWATAGLFIWSLVAPILFPDREF 412

>ref|NP_506611.1| tumor differentially expressed 1 like family member, possibly
           N-myristoylated (48.2 kD) (5P79) [Caenorhabditis
           elegans] gi|7506652|pir||T24196 hypothetical protein
           R11H6.2 - Caenorhabditis elegans
           gi|3879145|emb|CAB07645.1| Hypothetical protein R11H6.2
           [Caenorhabditis elegans]
          Length = 442

 Score =  117 bits (292), Expect = 3e-25
 Identities = 88/298 (29%), Positives = 131/298 (43%), Gaps = 21/298 (7%)
 Frame = +1

Query: 7   EEDERWLYGLMGLTVAGYAGTLT-LAGLMYAWFKPAGAGSCSLNIGAITLTLLLVVAFSV 183
           + D R  Y   GL +  + G L  L   +Y +   A    C L    +   +L+ VA S+
Sbjct: 188 DNDSRACYA--GLLITTFGGFLVCLIAAVYVFINYAIGDGCGLPKFFVIFNVLICVAISL 245

Query: 184 LSLAPLARQ----GSIFPSAAIGLYAAYLCFSALQSEPKEYACNGLGRSLTAAS---GGT 342
           LS++P+ ++      +     I  Y  YL +SAL S P E +CN    ++T ++   GG 
Sbjct: 246 LSVSPMVQEVNPRSGLLQPVVISAYIIYLTWSALLSNPNE-SCNPTLANVTQSAIPTGGV 304

Query: 343 LA-------------LGMLVTLASVVYAAFRAGSNTALFTLDGSEDGEGGAGGGAGQRQA 483
                          + +L+ L  +VYA+ R  SNT+L  + G  +              
Sbjct: 305 TKDDSFVTPLPVHSLISLLIWLICLVYASIRNSSNTSLGKITGDNE-----------EHV 353

Query: 484 LLADVEGTSAGLDGVPDVAEATREAVTGGAPKPDAAAVARAEALTPVSYNYSFFHLIFAL 663
            L DVEG  A  +    VA                             Y+YSFFH +F L
Sbjct: 354 QLNDVEGGKAWDNEEEGVA-----------------------------YSYSFFHFMFCL 384

Query: 664 ASMYIAMLMTGWGSVAQDKDRIDVGWASVWVKLGAQWVTGLLYMWTLLAPALFPDRDF 837
           AS+Y+ M +T W     D   ++   ASVWVK+ + W+ G LY WTL+AP +FPDR+F
Sbjct: 385 ASLYVMMTLTSWYHPDSDLAHLNSNMASVWVKMFSSWICGGLYAWTLVAPIIFPDREF 442

>gb|AAH11295.1| Tde1 protein [Mus musculus] gi|18606124|gb|AAH22901.1| Tde1 protein
            [Mus musculus] gi|20809425|gb|AAH29026.1| Tde1 protein
            [Mus musculus]
          Length = 472

 Score =  114 bits (284), Expect = 3e-24
 Identities = 91/308 (29%), Positives = 139/308 (44%), Gaps = 35/308 (11%)
 Frame = +1

Query: 22   WLYGLMGLTVAGYAGTLTLAGLMYAWF-KPAGAGSCSLNIGAITLTLLLVVAFSVLSLAP 198
            W   L+  T   Y  ++  A L+Y ++ KP     C+ N   I+L L+  VA S++S+ P
Sbjct: 202  WYAALLSFTSLFYILSIVFAALLYVFYTKP---DDCTENKVFISLNLIFCVAVSIVSILP 258

Query: 199  LARQ----GSIFPSAAIGLYAAYLCFSALQSEPKEYACNGLGRSL---------TAASGG 339
              ++      +  S+ I LY  YL +SA+ +EP E +CN    S+         + A+  
Sbjct: 259  KVQEHQPRSGLLQSSIITLYTLYLTWSAMTNEP-ERSCNPSLMSIITHLTSPTVSPANST 317

Query: 340  TLA------------------LGMLVTLASVVYAAFRAGSNTAL--FTLDGSEDGEGGAG 459
            TLA                   G+++ +  ++Y++FR  SN+ +   TL GS+       
Sbjct: 318  TLAPAYAPPSQSGHFMNLDDIWGLIIFVFCLIYSSFRTSSNSQVNKLTLSGSDS------ 371

Query: 460  GGAGQRQALLADV-EGTSAGLDGVPDVAEATREAVTGGAPKPDAAAVARAEALTPVSYNY 636
                    +L D   G +   DG P      R AV                    V YNY
Sbjct: 372  -------VILGDTTNGANDEEDGQP------RRAVDNEKEG--------------VQYNY 404

Query: 637  SFFHLIFALASMYIAMLMTGWGSVAQDKDRIDVGWASVWVKLGAQWVTGLLYMWTLLAPA 816
            SFFHL+   AS+YI M +T W S      ++   W +VW K+G+ W+  LLY+WTL+AP 
Sbjct: 405  SFFHLMLCCASLYIMMTITSWYSPDAKFQKVSSKWLAVWFKMGSSWLCLLLYLWTLVAPL 464

Query: 817  LFPDRDFS 840
            +   RDFS
Sbjct: 465  VLTGRDFS 472



EST assemble image


clone accession position
1 LC021g11_r AV620412 1 521
2 LCL004g06_r AV626230 269 530
3 CM011e01_r AV386884 313 817
4 LC065d07_r AV623544 467 900
5 CL22c03_r AV394220 711 896




Chlamydomonas reinhardtii
Kazusa DNA Research Institute