KCC001653A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KCC001653A_C01 KCC001653A_c01
GCACCAGATTCCTCCGATCCTTGCCCAGCCGGGGACTTGCACAACCACAGAAACCATGCA
GCTCGCGCAGAAGGCTTCGGGCGTTCGCCCCGCTCAGAAGTCGGGAGCTAGGGCTGCGCG
TCCCAGCGTGTGCCGCAAGGCCGTGGTGTGCAAGGCGCAGTCGTCGCTGGGCCAGAAGCT
TGCTTCTGTCGGTGCCGCGGCTATGCTGTCGCTGGGTGCCCTGGGCGCCCCTGCCATTGC
CTCGGAGTTCGATATCCTGGGCGAGCCCACCCCCACCTCCAACTACTTCATTGATGATGC
CAGCGTGCTGAGCAAGGCCACTCGCCAGGACATCAACAAGCGCCTCAAGCTGCTGGAGAT
CCAGACTGGCTACCGTGTCGAGGTGGTGACCGTTCGGCGCCTGGAGTTCGAGACTGACGC
CTTCGCGTTCGCTGATAAGGTGCTGGAGAACTGGTACCCCACCGCCGAGGCGGGCAAGGA
TAAGGGCCTGCTGCTGGTGGTGACCGCCTCCAAGGAGGGCGCCGTTACCGGCGGCGCTGG
CTTCACCGGCGCCGTGGGTGACGACCTGATCGACTCCATCATCTCCACCAACATCCCCAT
CTTCACTGAGGAGGAGAAGTACAACCAGACGGTGGTGTCGGCGGTGGAGCGCCTGGAGGC
CAAGCTGCTGGGCAAACCCGTGCCCGAGGCGCCGGTCCGCAACGAGCAGAATCGTGAGCG
CACCTACCGCACGAAGGAGGAGACTGAGAAGAGCCGGAACGTTACCAGCACCGTTGTGGG
CACCCTGCTGCTGATTGCCGTTGTGGTCCCCATGCTCCAATACTACGGCTACACTGCCCG
CGACTAAGCGAGCTTGGAGCTGTGTTGGAGCTTTAGAGCCTAGGCGGTTGTCATGAGCAG
CTGG


Nr search


BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KCC001653A_C01 KCC001653A_c01
         (904 letters)

Database: nr 
           1,537,769 sequences; 498,525,298 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_564667.1| thylakoid lumen 18.3 kDa protein [Arabidopsis t...   204  2e-51
ref|ZP_00072906.1| COG1512: Beta-propeller domains of methanol d...   106  5e-22
ref|NP_441552.1| hypothetical protein [Synechocystis sp. PCC 680...   105  1e-21
ref|NP_681194.1| ORF_ID:tll0404~hypothetical protein [Thermosyne...   103  4e-21
ref|NP_488140.1| hypothetical protein [Nostoc sp. PCC 7120] gi|2...    99  7e-20

>ref|NP_564667.1| thylakoid lumen 18.3 kDa protein [Arabidopsis thaliana]
           gi|25405770|pir||H96589 hypothetical protein T22H22.19
           [imported] - Arabidopsis thaliana
           gi|3776572|gb|AAC64889.1| ESTs gb|R65052, gb|AA712146,
           gb|H76533, gb|H76282, gb|AA650771, gb|H76287,
           gb|AA650887, gb|N37383, gb|Z29721 and gb|Z29722 come
           from this gene. [Arabidopsis thaliana]
           gi|14030683|gb|AAK53016.1|AF375432_1 At1g54780/T22H22_19
           [Arabidopsis thaliana] gi|17064782|gb|AAL32545.1|
           Unknown protein [Arabidopsis thaliana]
           gi|19698897|gb|AAL91184.1| unknown protein [Arabidopsis
           thaliana] gi|20259868|gb|AAM13281.1| unknown protein
           [Arabidopsis thaliana] gi|21593390|gb|AAM65339.1|
           unknown [Arabidopsis thaliana]
           gi|23198362|gb|AAN15708.1| unknown protein [Arabidopsis
           thaliana] gi|23505937|gb|AAN28828.1| At1g54780/T22H22_19
           [Arabidopsis thaliana]
          Length = 285

 Score =  204 bits (519), Expect = 2e-51
 Identities = 99/223 (44%), Positives = 149/223 (66%)
 Frame = +2

Query: 173 QKLASVGAAAMLSLGALGAPAIASEFDILGEPTPTSNYFIDDASVLSKATRQDINKRLKL 352
           Q LA++  +  L+   +G  A+ASEF+IL +  P   Y +DDA VLS+ T+ D+ K L  
Sbjct: 63  QGLAALALSLTLTFSPVGT-ALASEFNILNDGPPKETYVVDDAGVLSRVTKSDLKKLLSD 121

Query: 353 LEIQTGYRVEVVTVRRLEFETDAFAFADKVLENWYPTAEAGKDKGLLLVVTASKEGAVTG 532
           LE +   R+  +TVR+L  + DAF +AD+VLE WYP+ E G +KG+++++T+ KEGA+TG
Sbjct: 122 LEYRKKLRLNFITVRKLTSKADAFEYADQVLEKWYPSIEEGNNKGIVVLITSQKEGAITG 181

Query: 533 GAGFTGAVGDDLIDSIISTNIPIFTEEEKYNQTVVSAVERLEAKLLGKPVPEAPVRNEQN 712
           G  F  AVG++++D+ +S N+P+   +EKYN+ V S+ +RL A + G+P P  P   +  
Sbjct: 182 GPAFIEAVGENILDATVSENLPVLATDEKYNEAVYSSAKRLVAAIDGQPDPGGPTVKDSK 241

Query: 713 RERTYRTKEETEKSRNVTSTVVGTLLLIAVVVPMLQYYGYTAR 841
           RE  ++TKEET++ R   S VVG LL+IA VVPM QY+ Y +R
Sbjct: 242 RESNFKTKEETDEKRGQFSLVVGGLLVIAFVVPMAQYFAYVSR 284

>ref|ZP_00072906.1| COG1512: Beta-propeller domains of methanol dehydrogenase type
           [Trichodesmium erythraeum IMS101]
          Length = 242

 Score =  106 bits (265), Expect = 5e-22
 Identities = 61/190 (32%), Positives = 106/190 (55%), Gaps = 2/190 (1%)
 Frame = +2

Query: 269 TPTSNYFIDDASVLSKATRQDINKRLKLLEIQTGYRVEVVTVRRLEFETDAFAFADKVLE 448
           T    + +DDA VLS+ T+  +N  L+ L   TG  V  VT+RRL++   A +F +K+ +
Sbjct: 48  TGNRTFIVDDADVLSRVTKNKLNNTLENLANLTGNEVRFVTIRRLDYGETADSFTEKLFD 107

Query: 449 NWYPTAEAGKDKGLLLVVTASKEGAVTGGAGFTGAVGDDLIDSIISTNIPI-FTEEEKYN 625
            W+PT EA  ++ L+++ T +   A+  G      + +D+  S+++  I +   +  KYN
Sbjct: 108 KWFPTLEAKANQTLVVLDTLTNNDAIRIGDAVKIFMSNDITQSLVNETIQVPIRDGNKYN 167

Query: 626 QTVVSAVERLEAKLLGKPVPEAPVRNEQNRERTYRTKEETEKSRNVTSTV-VGTLLLIAV 802
           +  ++A +RL A L G+P P  P   ++   +   T +  E++ + ++TV V  LL+IA 
Sbjct: 168 EAFLAASDRLTAVLSGEPDPGPPDIKDELSAQVAATFKSAEETNDQSATVLVVVLLVIAT 227

Query: 803 VVPMLQYYGY 832
           VVPM  Y+ Y
Sbjct: 228 VVPMATYFWY 237

>ref|NP_441552.1| hypothetical protein [Synechocystis sp. PCC 6803]
           gi|7470069|pir||S75671 hypothetical protein sll1390 -
           Synechocystis sp. (strain PCC 6803)
           gi|1653317|dbj|BAA18232.1| ORF_ID:sll1390~hypothetical
           protein [Synechocystis sp. PCC 6803]
          Length = 249

 Score =  105 bits (262), Expect = 1e-21
 Identities = 67/222 (30%), Positives = 113/222 (50%), Gaps = 2/222 (0%)
 Frame = +2

Query: 173 QKLASVGAAAMLSLGALGAPAIASEFDILGEPTPTSNYF-IDDASVLSKATRQDINKRLK 349
           ++L S     ++ LG   AP++A+    L   +P S  F +D A  +S A    +N  LK
Sbjct: 24  KRLLSFLFLTLVLLGLSPAPSLATGVYDLPILSPGSKTFLVDQAEAISLANENRLNSDLK 83

Query: 350 LLEIQTGYRVEVVTVRRLEFETDAFAFADKVLENWYPTAEAGKDKGLLLVVTASKEGAVT 529
            L   +G     V +RRL+F+     F + + E WYP   +  ++ LL++ T +   A+ 
Sbjct: 84  KLAQSSGQEARFVVIRRLDFDATIDGFVNDLFERWYPDEASQSNQTLLVLDTLTNSTALR 143

Query: 530 GGAGFTGAVGDDLIDSIISTNIPI-FTEEEKYNQTVVSAVERLEAKLLGKPVPEAPVRNE 706
            G      + D+++DS++   + +   +  KYNQ ++ A +RL A L G+P P  P   E
Sbjct: 144 RGETAESLLTDEMVDSLLRETLAVPLKDGAKYNQALIEADKRLGAILAGQPDPGPPALEE 203

Query: 707 QNRERTYRTKEETEKSRNVTSTVVGTLLLIAVVVPMLQYYGY 832
            + E T+ T EET+ +      VV  LL +A ++PM+ Y+ Y
Sbjct: 204 ISLEGTFTTAEETDDTSATVWVVV--LLALATLIPMVTYFWY 243

>ref|NP_681194.1| ORF_ID:tll0404~hypothetical protein [Thermosynechococcus elongatus
           BP-1] gi|22294125|dbj|BAC07956.1|
           ORF_ID:tll0404~hypothetical protein [Thermosynechococcus
           elongatus BP-1]
          Length = 228

 Score =  103 bits (257), Expect = 4e-21
 Identities = 61/186 (32%), Positives = 100/186 (52%)
 Frame = +2

Query: 275 TSNYFIDDASVLSKATRQDINKRLKLLEIQTGYRVEVVTVRRLEFETDAFAFADKVLENW 454
           T+   ID+ +VLS  T+  + + L+ L   TG  V VVT+ RL++     +F D +   W
Sbjct: 41  TATGVIDEGNVLSAVTQGSVGRSLQDLSEATGINVHVVTLHRLDYGETPQSFVDDLFSQW 100

Query: 455 YPTAEAGKDKGLLLVVTASKEGAVTGGAGFTGAVGDDLIDSIISTNIPIFTEEEKYNQTV 634
           +P  E+  ++ ++ + T +   A+  G      +  +  +SI+   + +   E  YNQ V
Sbjct: 101 FPDPESQANQVIIALDTVTNGTAIHYGDAVAERLNPETAESIVQETMRVPLREGNYNQAV 160

Query: 635 VSAVERLEAKLLGKPVPEAPVRNEQNRERTYRTKEETEKSRNVTSTVVGTLLLIAVVVPM 814
           +  V+RL   L G+P P  PV  E   E+TY++KEET+  R+ T  VV  LL+ A V+PM
Sbjct: 161 LDTVDRLGKVLKGEPDPGPPVVREVVVEKTYKSKEETD-DRSATIIVV-ALLIAATVIPM 218

Query: 815 LQYYGY 832
           + Y+ Y
Sbjct: 219 VTYFMY 224

>ref|NP_488140.1| hypothetical protein [Nostoc sp. PCC 7120] gi|25359462|pir||AE2318
           hypothetical protein alr4100 [imported] - Nostoc sp.
           (strain PCC 7120) gi|17133235|dbj|BAB75799.1|
           ORF_ID:alr4100~hypothetical protein [Nostoc sp. PCC
           7120]
          Length = 245

 Score = 99.4 bits (246), Expect = 7e-20
 Identities = 69/217 (31%), Positives = 112/217 (50%), Gaps = 3/217 (1%)
 Frame = +2

Query: 197 AAMLSLGALGAPAIASE-FDILGEPTPTSNYFIDDASVLSKATRQDINKRLKLLEIQTGY 373
           A +L+     APA+AS  + I       S + +D   V+S+     I+  L+ L  +TG 
Sbjct: 24  AIILAASLSSAPALASGVYQIPNLTAGDSTWVLDQGDVISRINEGAISSSLEDLAKETGK 83

Query: 374 RVEVVTVRRLEFETDAFAFADKVLENWYPTAEAGKDKGLLLVVTASKEGAVTGGAGFTGA 553
            V  VT+ RL++     +FA  + E W+P+ EA  ++ LL++ T +   A+  G      
Sbjct: 84  EVRFVTIHRLDYGETPESFAQALFEKWFPSKEAQANQILLVLDTVTNGTAIITGDEVKPL 143

Query: 554 VGDDLIDSIISTNIPI-FTEEEKYNQTVVSAVERLEAKLLGKPVPEAP-VRNEQNRERTY 727
           + D + +S+    +     +  KYNQ  + A +RL A L G+P P  P + ++   E T+
Sbjct: 144 LTDTIANSVAEETLAAPLRDGNKYNQAFLDASDRLVAVLSGQPDPGPPQIVDKVQVEGTF 203

Query: 728 RTKEETEKSRNVTSTVVGTLLLIAVVVPMLQYYGYTA 838
           +  EET+K  N T+ VVG LL+ A ++PM  YY Y A
Sbjct: 204 KKAEETDKG-NATAWVVG-LLIAATIIPMATYYIYLA 238



EST assemble image


clone accession position
1 CM100c02_r AV392859 1 542
2 LC089h06_r AV625247 79 543
3 MX252a11_r BP092286 92 326
4 MXL099c02_r BP098792 95 504
5 MX063g08_r BP088570 97 451
6 LC064a12_r AV623458 97 598
7 CM089a11_r AV392238 99 615
8 MX059c05_r BP088391 100 446
9 HCL100h04_r AV645305 100 497
10 HC081c06_r AV638055 101 567
11 MX043g05_r BP087794 103 546
12 MX248c08_r BP092008 108 500
13 HC010h01_r AV632649 112 601
14 HC041g01_r AV635087 112 592
15 MX023b06_r BP087027 122 555
16 HC055f11_r AV636141 122 541
17 HC003g02_r AV632086 166 693
18 CM075d09_r AV391596 174 843
19 HC007b12_r AV632361 314 815
20 MXL019g02_r BP094219 507 1010




Chlamydomonas reinhardtii
Kazusa DNA Research Institute