KCC002953A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KCC002953A_C01 KCC002953A_c01
ctacataCGATATTATACACATACAGTAAATCCTCTTATCTGCCGTCATCCCCCAGTGGC
CCAGCGAGGAGGCCAGCCAAAAAAATGCCGCGCTCGTCTCGCTACGCAGTTGTGGTGGAC
AACATCTCCTCCAACACGCCCGTTCGCGACATCGAGCGCGAGTTCGCCTTCTTTGGCCGC
ATCCGCGACTGCGTCAAGGATGGAAAGCACCGCCTGGCGCTGATTGAGTTCGAGAAGTCC
CAGGATGCCACTGCGGCCTGGCGCAAGATGGATGGGTTCCGCATGGACGGCCGGCAGTGG
CGTGTGGAGTACGCCACGCGGGAGGACTTCCGGTTCTTCGGCTGGAAATGGTTTGAGCAC
TCGCCCTCGCCCCCGCGGTACCGGTCGCGCTCCCGCTCCCCGCGCCGCTCGCGCTCGCCG
TCCCGGTCGCCTGTGCGCCGCTCCCGCTCGCCCGACACCCCTGGTGGTAACGATGGCCGG
ACACGGAATGCATCCAAGTCGACCTCGCCGCCCCCGCGTCGTGAGGA


Nr search


BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KCC002953A_C01 KCC002953A_c01
         (527 letters)

Database: nr 
           1,537,769 sequences; 498,525,298 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|XP_317163.1| ENSANGP00000018287 [Anopheles gambiae] gi|21300...    70  1e-11
ref|NP_495013.1| serine/aRginine rich pre-mRNA SPlicing factor, ...    69  4e-11
gb|AAK93651.1| unknown protein [Arabidopsis thaliana]                  64  1e-09
pir||T50647 serine/arginine-rich protein [imported] - Arabidopsi...    64  1e-09
gb|AAH46668.1| Similar to splicing factor, arginine/serine-rich ...    62  3e-09

>ref|XP_317163.1| ENSANGP00000018287 [Anopheles gambiae] gi|21300086|gb|EAA12231.1|
           ENSANGP00000018287 [Anopheles gambiae str. PEST]
          Length = 264

 Score = 70.5 bits (171), Expect = 1e-11
 Identities = 70/221 (31%), Positives = 94/221 (41%), Gaps = 47/221 (21%)
 Frame = +1

Query: 1   LHTILYTYSKSSYLPSSPSGPARRPAKKMPRSSRYA----VVVDNISSNTPVRDIEREFA 168
           LHT   TY+  +  P S   P+ R   +  + SRY     V V  + +N   +DIE  F 
Sbjct: 8   LHTRTTTYNTRA--PRSVRTPSHRAVNEQSKMSRYPHDAKVYVGELGNNASKQDIEEAFG 65

Query: 169 FFGRIRDC-VKDGKHRLALIEFEKSQDATAAWRKMDGFRMDGRQWRVEYAT--------- 318
           ++G +R+  V       A +EFE ++DA  A R +DG  + GR+ RVE +T         
Sbjct: 66  YYGPLRNVWVARNPPGFAFVEFEDARDAEDAVRGLDGRTISGRRARVELSTGRGGRGGGG 125

Query: 319 ---------------REDFRFFGWKWFEH-----SPSPPRYRSRSRSPRR---------- 408
                          + D R +      H     + S  R R RSRSPR           
Sbjct: 126 GRGGPPRGGGKGGRFQSDDRCYECGGRGHFARDCARSGRRGRKRSRSPRSRSRELRTRSR 185

Query: 409 --SRSPSRSPVRRSRSPDTP-GGNDGRTRNASKSTSPPPRR 522
             SRS SRS  RRSRS  TP  G     R+ S+  S  PRR
Sbjct: 186 SYSRSRSRSRDRRSRSKATPKRGAAANNRSLSRDVSKTPRR 226

>ref|NP_495013.1| serine/aRginine rich pre-mRNA SPlicing factor, Serpin (22.6 kD)
           (rsp-4) [Caenorhabditis elegans]
           gi|3929375|sp|Q09511|SFR2_CAEEL Putative splicing
           factor, arginine/serine-rich 2 (Splicing factor SC35)
           (SC-35) (Splicing component, 35 kDa)
           gi|7439997|pir||T15917 hypothetical protein EEED8.7 -
           Caenorhabditis elegans gi|733604|gb|AAC46767.1| Sr
           protein (splicing factor) protein 4, isoform a
           [Caenorhabditis elegans]
          Length = 196

 Score = 68.9 bits (167), Expect = 4e-11
 Identities = 59/154 (38%), Positives = 75/154 (48%), Gaps = 17/154 (11%)
 Frame = +1

Query: 115 VDNISSNTPVRDIEREFAFFGRI------RDCVKDGKHRLALIEFEKSQDATAAWRKMDG 276
           +DN+S  T   D+ R F  +G I      RD           + F + +DA  A  + DG
Sbjct: 23  IDNLSYQTTPNDLRRTFERYGDIGDVHIPRDKYSRQSKGFGFVRFYERRDAEHALDRTDG 82

Query: 277 FRMDGRQWRV---EYATREDFRFF--GWKWFEHSPSP------PRYRSRSRSPRRSRSPS 423
             +DGR+ RV   +Y    D R    G      S SP      PRY SRSRSPRRSRS +
Sbjct: 83  KLVDGRELRVTLAKYDRPSDERGGRGGGGGRRRSRSPRRRSRSPRY-SRSRSPRRSRSRT 141

Query: 424 RSPVRRSRSPDTPGGNDGRTRNASKSTSPPPRRE 525
           RSP  R R  D+P   D R  + S+S SPPPR +
Sbjct: 142 RSPPSRDRR-DSP---DRRDNSRSRSRSPPPRED 171

>gb|AAK93651.1| unknown protein [Arabidopsis thaliana]
          Length = 263

 Score = 63.9 bits (154), Expect = 1e-09
 Identities = 67/217 (30%), Positives = 89/217 (40%), Gaps = 54/217 (24%)
 Frame = +1

Query: 34  SYLPSSPSGPARRPAKKMPRSSRY---------AVVVDNISSNTPVRDIEREFAFFGRI- 183
           SY PS P G  RR     PR  RY         +++V N+  +    D+ + F  FG + 
Sbjct: 5   SYTPSPPRGYGRRGRSPSPRG-RYGGRSRDLPTSLLVRNLRHDCRQEDLRKSFEQFGPVK 63

Query: 184 -----RDCVKDGKHRLALIEFEKSQDATAAWRKMDGFRMDGRQWRVEYATREDFRFF--- 339
                RD           ++F    DA  A   MDG+ + GR+  V +A     +     
Sbjct: 64  DIYLPRDYYTGDPRGFGFVQFMDPADAADAKHHMDGYLLLGRELTVVFAEENRKKPTEMR 123

Query: 340 -----GWKWFEHSPSPPRYRSRSRS--PRRSRSPSRS------PVRR------------- 441
                G ++ +   +PPRY SRSRS  PRR RS SRS      P RR             
Sbjct: 124 ARERGGGRFRDRRRTPPRYYSRSRSPPPRRGRSRSRSGDYYSPPPRRHHPRSISPREERY 183

Query: 442 ------SRSPDTPGGNDGRT----RNASKSTSPPPRR 522
                 SRSP + G   GR+    R  S+S SP PRR
Sbjct: 184 DGRRSYSRSPASDGSR-GRSLTPVRGKSRSLSPSPRR 219

 Score = 35.8 bits (81), Expect = 0.35
 Identities = 26/52 (50%), Positives = 28/52 (53%), Gaps = 6/52 (11%)
 Frame = +1

Query: 361 SPSPPRYRSRSRSPRRSRSPSRS------PVRRSRSPDTPGGNDGRTRNASK 498
           SPSP R  S SRSPRRSRSP RS         RSRS    GG     R+ S+
Sbjct: 214 SPSPRR--SISRSPRRSRSPRRSRRSYTPEPARSRSQSPHGGQYDEDRSPSQ 263

 Score = 35.0 bits (79), Expect = 0.59
 Identities = 25/54 (46%), Positives = 30/54 (55%)
 Frame = +1

Query: 361 SPSPPRYRSRSRSPRRSRSPSRSPVRRSRSPDTPGGNDGRTRNASKSTSPPPRR 522
           S +P R +SRS SP   RS SRSP RRSRSP          R + +S +P P R
Sbjct: 202 SLTPVRGKSRSLSPSPRRSISRSP-RRSRSP----------RRSRRSYTPEPAR 244

>pir||T50647 serine/arginine-rich protein [imported] - Arabidopsis thaliana
           gi|6572475|gb|AAF17288.1|AF099940_1 Serine/arginine-rich
           protein [Arabidopsis thaliana]
           gi|9843659|emb|CAC03603.1| SC35-like splicing factor
           SCL33, 33 kD [Arabidopsis thaliana]
          Length = 287

 Score = 63.9 bits (154), Expect = 1e-09
 Identities = 67/217 (30%), Positives = 89/217 (40%), Gaps = 54/217 (24%)
 Frame = +1

Query: 34  SYLPSSPSGPARRPAKKMPRSSRY---------AVVVDNISSNTPVRDIEREFAFFGRI- 183
           SY PS P G  RR     PR  RY         +++V N+  +    D+ + F  FG + 
Sbjct: 5   SYTPSPPRGYGRRGRSPSPRG-RYGGRSRDLPTSLLVRNLRHDCRQEDLRKSFEQFGPVK 63

Query: 184 -----RDCVKDGKHRLALIEFEKSQDATAAWRKMDGFRMDGRQWRVEYATREDFRFF--- 339
                RD           ++F    DA  A   MDG+ + GR+  V +A     +     
Sbjct: 64  DIYLPRDYYTGDPRGFGFVQFMDPADAADAKHHMDGYLLLGRELTVVFAEENRKKPTEMR 123

Query: 340 -----GWKWFEHSPSPPRYRSRSRS--PRRSRSPSRS------PVRR------------- 441
                G ++ +   +PPRY SRSRS  PRR RS SRS      P RR             
Sbjct: 124 ARERGGGRFRDRRRTPPRYYSRSRSPPPRRGRSRSRSGDYYSPPPRRHHPRSISPREERY 183

Query: 442 ------SRSPDTPGGNDGRT----RNASKSTSPPPRR 522
                 SRSP + G   GR+    R  S+S SP PRR
Sbjct: 184 DGRRSYSRSPASDGSR-GRSLTPVRGKSRSLSPSPRR 219

 Score = 36.2 bits (82), Expect = 0.27
 Identities = 31/68 (45%), Positives = 34/68 (49%), Gaps = 14/68 (20%)
 Frame = +1

Query: 361 SPSPPRYRSRSRSPRRSRSP--------------SRSPVRRSRSPDTPGGNDGRTRNASK 498
           SPSP R  S SRSPRRSRSP              SRSP RRSRSP          R + +
Sbjct: 214 SPSPRR--SISRSPRRSRSPSPKRNRSVSPRRSISRSP-RRSRSP----------RRSRR 260

Query: 499 STSPPPRR 522
           S +P P R
Sbjct: 261 SYTPEPAR 268

 Score = 33.5 bits (75), Expect = 1.7
 Identities = 24/58 (41%), Positives = 26/58 (44%), Gaps = 12/58 (20%)
 Frame = +1

Query: 361 SPSPPRYRS------------RSRSPRRSRSPSRSPVRRSRSPDTPGGNDGRTRNASK 498
           SPSP R RS            RSRSPRRSR        RSRS    GG     R+ S+
Sbjct: 230 SPSPKRNRSVSPRRSISRSPRRSRSPRRSRRSYTPEPARSRSQSPHGGQYDEDRSPSQ 287

>gb|AAH46668.1| Similar to splicing factor, arginine/serine-rich 6 [Xenopus laevis]
          Length = 667

 Score = 62.4 bits (150), Expect = 3e-09
 Identities = 52/151 (34%), Positives = 76/151 (49%), Gaps = 13/151 (8%)
 Frame = +1

Query: 88  PRSSRYAVVVDNISSNTPVRDIEREFAFFGRIR--DCVKDGKHRLALIEFEKSQDATAAW 261
           P  + + +VV+N+SS    +D++      G +   D  K+  +   +IEF    D   A 
Sbjct: 101 PVRTEFRLVVENLSSRCSWQDLKDFMRQAGEVTYADAHKERANE-GVIEFRSYSDMKRAV 159

Query: 262 RKMDGFRMDGRQWR-VEYATREDFRFFGWKWFEHSPSPPRYRSRSRSPR--RSRSPSRSP 432
            K+DG  ++GR+ R VE  TR    + G      S S  R RSRSR P   RSRS SRSP
Sbjct: 160 EKLDGTEINGRRIRLVEGKTRHRRPYSGSHSRSRSRSRRRSRSRSRHPSHSRSRSQSRSP 219

Query: 433 VRRSRS--------PDTPGGNDGRTRNASKS 501
            ++SRS          +PG +  R+R+ S+S
Sbjct: 220 AKKSRSRSLAKSSHSQSPGKSQSRSRSRSRS 250



EST assemble image


clone accession position
1 LC100c12_r AV625958 1 463
2 MX045g06_r BP087887 8 468
3 MX066e04_r BP088687 8 344
4 MX011h04_r BP086575 8 439
5 LC058d10_r AV623063 22 527




Chlamydomonas reinhardtii
Kazusa DNA Research Institute