KCC002123A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KCC002123A_C01 KCC002123A_c01
gtaggtaggaagtttCTCGATAAGGGCTGAGCATTCCCCGTAGCTGTCTCCAACAGCTCG
CTCATGTTTCTCGCCCCTCGCCCGCCAGCTGATGAAAATGCTGGAGTCGGTGGTGCTATT
AAGCTTAGGTCAGGGCTTGGGGGTGGTCTCCGTGGGATCGGGAGCGCAAGCGGTCCGCTG
AAGCCGCTGGCGTCGGCCAACAACAAGGCGGCTCCAGGCGGACCCTCAAAGCTTGGGCAG
ACATATTCTGCTCAGCCACGTGCAGTCTTGGGGAACTTGACCAATGTCAACGCGAAGCTG
GGCTCCTCGTCGGCCGCATTGACGGGGAAGGCGAAGCCGGCTCAACCGCTGCAGTATCAG
GCAGCTTCCATCGAGAATGCTGCCGGCAAGGGCTGGAAAGCACAGGAGAGCGACCGCACG
GTGCAGGAGGCGAACGCTGTGACGATCCGTGTTAACCGGACCATTGCGGCCGTCAGCACG
TGGAGGACAGCCCCCTTCTACATCCTGGCAGACGAGGACGAGAGCGACGCCAGCGAGGCC
GATGAGCCCGCGCGCGA


Nr search


BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KCC002123A_C01 KCC002123A_c01
         (557 letters)

Database: nr 
           1,537,769 sequences; 498,525,298 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

pir||T10741 extensin-like protein PRP5 - Persian tobacco gi|2653...    36  4e-04
ref|NP_501120.1| hect domain containing protein 1 (4H900) [Caeno...    42  0.004
ref|XP_233556.2| similar to Ser/Arg-related nuclear matrix prote...    42  0.007
ref|ZP_00033604.1| COG4625: Uncharacterized protein with a C-ter...    40  0.021
ref|XP_294174.2| similar to mucin [Homo sapiens]                       40  0.028

>pir||T10741 extensin-like protein PRP5 - Persian tobacco
           gi|2653671|gb|AAC15893.1| 120 kDa style glycoprotein
           [Nicotiana alata]
          Length = 461

 Score = 36.2 bits (82), Expect(2) = 4e-04
 Identities = 18/48 (37%), Positives = 24/48 (49%)
 Frame = -2

Query: 436 RSPPAPCGRSPVLSSPCRQHSRWKLPDTAAVEPASPSPSMRPTRSPAS 293
           +SPP P  +SP L  P  Q  +   P + A +P  P PS +P   P S
Sbjct: 91  KSPPPPPAKSPPLPPPPVQPPKQSPPPSPAKQPPPPPPSAKPPVKPPS 138

 Score = 30.4 bits (67), Expect(2) = 1.1
 Identities = 12/27 (44%), Positives = 17/27 (62%)
 Frame = -1

Query: 173 PLALPIPRRPPPSPDLSLIAPPTPAFS 93
           P  LPI + PPP+  L +  PP PA++
Sbjct: 197 PAQLPIRQPPPPATQLPIRKPPPPAYT 223

 Score = 28.9 bits (63), Expect(2) = 4e-04
 Identities = 12/25 (48%), Positives = 15/25 (60%)
 Frame = -1

Query: 173 PLALPIPRRPPPSPDLSLIAPPTPA 99
           P  LPI + PPP+  L +  PP PA
Sbjct: 174 PAQLPIRQPPPPATQLPIRKPPPPA 198

 Score = 22.7 bits (47), Expect(2) = 1.1
 Identities = 20/90 (22%), Positives = 32/90 (35%), Gaps = 2/90 (2%)
 Frame = -2

Query: 469 PQWSG*HGSSQRSPPAPCGRSPVLSSPCRQHSRWKLPDTAAVEPASPSPSMR--PTRSPA 296
           P+ S     +++ PP P    P +  P    +        A  P+ P P  R  P + P 
Sbjct: 111 PKQSPPPSPAKQPPPPPPSAKPPVKPPSPSPAAQPPATQRATPPSQPPPMQRAPPPKLPL 170

Query: 295 SR*HWSSSPRLHVAEQNMSAQALRVRLEPP 206
                    +L + +    A  L +R  PP
Sbjct: 171 P----PPPAQLPIRQPPPPATQLPIRKPPP 196

>ref|NP_501120.1| hect domain containing protein 1 (4H900) [Caenorhabditis elegans]
            gi|7497007|pir||T29285 hypothetical protein C34D4.14 -
            Caenorhabditis elegans gi|1330345|gb|AAB00699.1|
            Hypothetical protein C34D4.14 [Caenorhabditis elegans]
          Length = 2761

 Score = 42.4 bits (98), Expect = 0.004
 Identities = 35/108 (32%), Positives = 47/108 (43%), Gaps = 6/108 (5%)
 Frame = +1

Query: 73   APRPPADENAGVGGAIKLRSGLGGGLRGIGSASGP----LKPLASANNKAAPGGPSKLGQ 240
            +P PP   ++       L SGLG GL      + P    L   AS  N    G PS  G 
Sbjct: 1645 SPPPPPPSSSTFSS---LASGLGFGLNRHKQHNKPAASALSRFASVKNTTPAGTPSSGGS 1701

Query: 241  TYSAQPRAVLG--NLTNVNAKLGSSSAALTGKAKPAQPLQYQAASIEN 378
            +  A  +  +   NL +   K    S A TG+A  A+ LQ+Q  S+EN
Sbjct: 1702 SGGAIGKKSMSTTNLVDERQKTSGPSVASTGQAASAESLQHQTPSLEN 1749

>ref|XP_233556.2| similar to Ser/Arg-related nuclear matrix protein;
           plenty-of-prolines-101; serine/arginine repetitive
           matrix protein 1 [Rattus norvegicus]
          Length = 958

 Score = 41.6 bits (96), Expect = 0.007
 Identities = 32/90 (35%), Positives = 42/90 (46%)
 Frame = -2

Query: 538 PRWRRSRPRLPGCRRGLSSTC*RPQWSG*HGSSQRSPPAPCGRSPVLSSPCRQHSRWKLP 359
           P+ R + P  P  RR   S   + + S      QRSPP    RSP LSS   +H +   P
Sbjct: 664 PKRRTASPPPPPKRRASPSPPPKRRVSHSPPPKQRSPPVTKRRSPSLSS---KHRKGSSP 720

Query: 358 DTAAVEPASPSPSMRPTRSPASR*HWSSSP 269
             +  E  SP P+ R + SP  R   +SSP
Sbjct: 721 GRSTREARSPQPNKRHSPSPRPRAPQTSSP 750

>ref|ZP_00033604.1| COG4625: Uncharacterized protein with a C-terminal OMP (outer
           membrane protein) domain [Burkholderia fungorum]
          Length = 484

 Score = 40.0 bits (92), Expect = 0.021
 Identities = 29/84 (34%), Positives = 38/84 (44%)
 Frame = +1

Query: 136 LGGGLRGIGSASGPLKPLASANNKAAPGGPSKLGQTYSAQPRAVLGNLTNVNAKLGSSSA 315
           L GGL G   A GPL PL S  + +  G    LG T S  P A L  +  V++  G+ S 
Sbjct: 351 LSGGLTGGSGAGGPLAPLTSVVS-SLTGSLGGLGGTGSGSPLAPLTGV--VSSVTGALSG 407

Query: 316 ALTGKAKPAQPLQYQAASIENAAG 387
           A      P  P+    +S+  A G
Sbjct: 408 ATNSSGNPLAPVTSAVSSLTGALG 431

>ref|XP_294174.2| similar to mucin [Homo sapiens]
          Length = 765

 Score = 39.7 bits (91), Expect = 0.028
 Identities = 39/131 (29%), Positives = 56/131 (41%), Gaps = 7/131 (5%)
 Frame = -2

Query: 550 RAHRPRWRRSRPRLPGCRRGLSSTC*RPQWSG*HGSSQRSPPAPCGRSPVLSSP---CRQ 380
           RAH  R+ R RP  P       +   +P      G++++SPP    ++P  +SP    RQ
Sbjct: 492 RAHPRRYARRRPEEPTTDATSGARVEQPTPDATPGAARKSPPQTLHQAPAPNSPPQTLRQ 551

Query: 379 HSRWKL-PDTAAVEPASPSPSMRPTRSPASR*HWSSSPR-LHVAEQNMS--AQALRVRLE 212
              W   P T    PA  SP     ++PA    W+S P+ L  A    +   +  R R E
Sbjct: 552 APAWNSPPQTLRQAPAWNSPPQTLRQAPA----WNSPPQTLRQAPPGTAHHRRYARRRPE 607

Query: 211 PPCCWPTPAAS 179
            P    TP A+
Sbjct: 608 QPTTDATPGAA 618



EST assemble image


clone accession position
1 LC003a05_r AV619043 1 557
2 HC079b11_r AV637908 16 508
3 MX053a06_r BP088159 23 347




Chlamydomonas reinhardtii
Kazusa DNA Research Institute