KCC001971A_c01
[Fasta Sequence]
[Nr Search]
[EST assemble image]
Fasta Sequence
>KCC001971A_C01 KCC001971A_c01
CGACATTAAAGCTGTCTAGGAAGGGCCTGAGCGCCCGGTAGCTTTGCACTCAATCCTTTG
CTCGCTCTTTCCTAGCTCGGGCCGCATCATGGCGCTACGCGCGTTGGGCCGTCAGCTGCG
TAGCCTGAACCTGCTGCCAGCTGTGCAGCCAGCTCGCTTCTTCGGCGCGGGAGCCCACCA
CGATGACGAGCATGAGGAGGAGGAGCACGGCCCCGCTCAGACGCCCACGGTCTTCGATAA
GCTGGTGGAGGTGACCGTGGTCGACATGAACGGCATTCGCCACAGGGTCCGCGGCCTGCA
GGGCCAGAGCCTGGCGCAGGCGCTGGTCGAGTATGGCTTCCCGGACACCTATTTCTTCCC
CAACATGGGCTTCTACACGCAACACATTGTGGATGCGCATGTGTTCGTGCCCAAGGAGTT
CTGGGGCAAGGTGCAGAACGTGGACCCGGAGAGCGACGACGGCCTCGCGGTGAAGCGCAT
GTTCCGCGACATCGTGCAGGATTACCAGCGCGACACCTCGTTCTTCGCCTCCTACATCAC
GCTGGGGGCAGAGCACAACGGTATGCCGTGGGCATCGG
Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KCC001971A_C01 KCC001971A_c01
(578 letters)
Database: nr
1,537,769 sequences; 498,525,298 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gb|AAH57590.1| 3110043L15Rik protein [Mus musculus] 58 1e-07
sp|P03181|YHL1_EBV HYPOTHETICAL BHLF1 PROTEIN gi|73912|pir||QQBE... 56 4e-07
ref|NP_492875.2| pre-mRNA splicing SR protein related (68.2 kD) ... 55 7e-07
ref|NP_055541.1| ProSAPiP2 protein [Homo sapiens] gi|3882271|dbj... 55 9e-07
gb|AAA66445.1| unknown protein 52 8e-06
>gb|AAH57590.1| 3110043L15Rik protein [Mus musculus]
Length = 596
Score = 57.8 bits (138), Expect = 1e-07
Identities = 51/143 (35%), Positives = 62/143 (42%), Gaps = 12/143 (8%)
Frame = -1
Query: 497 ARCRGTCASPRGRRRSPGPRSAPCPRTPWARTHAHPQCVACRSPCWGRNRCPGSHTRPAP 318
AR G SP +R SP P + P P P AR P C C SP + R P S + P+P
Sbjct: 327 ARNAGQRHSPLSQRHSPAP-ACPSPSPP-ARP---PPCAPCPSP--QQRRSPASPSCPSP 379
Query: 317 AP-----------GSGPAGRGPCG-ECRSCRPRSPPPAYRRPWASERGRAPPPHARHRGG 174
P P R P C + +PR PPP R A ER A PP + G
Sbjct: 380 VPQRRSPVPPSCQSPSPQRRSPVPPSCPAPQPRPPPPPGERTLA-ERVYAKPPSHHAKAG 438
Query: 173 LPRRRSELAAQLAAGSGYAADGP 105
RRS ++LA G+ YA P
Sbjct: 439 FQGRRS--YSELAEGAAYAGASP 459
>sp|P03181|YHL1_EBV HYPOTHETICAL BHLF1 PROTEIN gi|73912|pir||QQBE3 BHLF1 protein -
human herpesvirus 4 (strain B95-8)
gi|23893591|emb|CAD53473.1| BHLF1 early reading frame
[Human herpesvirus 4]
Length = 660
Score = 55.8 bits (133), Expect = 4e-07
Identities = 58/144 (40%), Positives = 63/144 (43%), Gaps = 18/144 (12%)
Frame = -1
Query: 518 RCRAGNPARCRGTCASPRGRRRSPG-PRSAP---CPRTPWAR----THAHPQCVACR--- 372
RC AG P R A+ R RR PG PRSA CPRT W R HP A +
Sbjct: 372 RCPAGPPPT-RSGAAAQRTHRRPPGCPRSARNPGCPRT-WRRRSGAQRGHPPPGAGQRPS 429
Query: 371 SPCWGRNRCPGSHTRP-APAPGSG---PAGRGPCGECRSCRPRSPPPAYRRPWASERGRA 204
P GR PG+ P AP PG G P+G P E R P PP A R P + R
Sbjct: 430 GPTGGRPAAPGAPGTPAAPGPGGGAAVPSGATPHPE-RGSGPADPPAAARLPPERQEPRL 488
Query: 203 PPPHA---RHRGGLPRRRSELAAQ 141
P A R G P RS AAQ
Sbjct: 489 PQDLAAAQRCPAGPPPTRSGAAAQ 512
Score = 55.8 bits (133), Expect = 4e-07
Identities = 58/144 (40%), Positives = 63/144 (43%), Gaps = 18/144 (12%)
Frame = -1
Query: 518 RCRAGNPARCRGTCASPRGRRRSPG-PRSAP---CPRTPWAR----THAHPQCVACR--- 372
RC AG P R A+ R RR PG PRSA CPRT W R HP A +
Sbjct: 497 RCPAGPPPT-RSGAAAQRTHRRPPGCPRSARNPGCPRT-WRRRSGAQRGHPPPGAGQRPS 554
Query: 371 SPCWGRNRCPGSHTRP-APAPGSG---PAGRGPCGECRSCRPRSPPPAYRRPWASERGRA 204
P GR PG+ P AP PG G P+G P E R P PP A R P + R
Sbjct: 555 GPTGGRPAAPGAPGTPAAPGPGGGAAVPSGATPHPE-RGSGPADPPAAARLPPERQEPRL 613
Query: 203 PPPHA---RHRGGLPRRRSELAAQ 141
P A R G P RS AAQ
Sbjct: 614 PQDLAAAQRCPAGPPPTRSGAAAQ 637
Score = 55.8 bits (133), Expect = 4e-07
Identities = 58/144 (40%), Positives = 63/144 (43%), Gaps = 18/144 (12%)
Frame = -1
Query: 518 RCRAGNPARCRGTCASPRGRRRSPG-PRSAP---CPRTPWAR----THAHPQCVACR--- 372
RC AG P R A+ R RR PG PRSA CPRT W R HP A +
Sbjct: 247 RCPAGPPPT-RSGAAAQRTHRRPPGCPRSARNPGCPRT-WRRRSGAQRGHPPPGAGQRPS 304
Query: 371 SPCWGRNRCPGSHTRP-APAPGSG---PAGRGPCGECRSCRPRSPPPAYRRPWASERGRA 204
P GR PG+ P AP PG G P+G P E R P PP A R P + R
Sbjct: 305 GPTGGRPAAPGAPGTPAAPGPGGGAAVPSGATPHPE-RGSGPADPPAAARLPPERQEPRL 363
Query: 203 PPPHA---RHRGGLPRRRSELAAQ 141
P A R G P RS AAQ
Sbjct: 364 PQDLAAAQRCPAGPPPTRSGAAAQ 387
Score = 39.7 bits (91), Expect = 0.031
Identities = 46/164 (28%), Positives = 56/164 (34%), Gaps = 7/164 (4%)
Frame = -1
Query: 485 GTCASPRGRRRSPGPRSAPCPRTPWARTHAHPQCVACRSP--CWGRNRCPGSHTRPAPAP 312
G A P G +P P P P A P+ R P RCP
Sbjct: 202 GGAAVPSGA--TPHPERGSGPADPPAAARLPPERQEPRLPQDLAAAQRCPAGPPPTRSGA 259
Query: 311 GSGPAGRGPCGECRSCRPRSPPPAYRRPWASERGRAPPPHARHR-----GGLPRRRSELA 147
+ R P G RS R P +RR ++RG PPP A R GG P
Sbjct: 260 AAQRTHRRPPGCPRSARNPGCPRTWRRRSGAQRGH-PPPGAGQRPSGPTGGRPAAPGAPG 318
Query: 146 AQLAAGSGYAADGPTRVAP*CGPS*ERASKGLSAKLPGAQALPR 15
A G G A P+ P A +A+LP + PR
Sbjct: 319 TPAAPGPGGGAAVPSGATPHPERGSGPADPPAAARLPPERQEPR 362
Score = 33.1 bits (74), Expect = 2.9
Identities = 51/170 (30%), Positives = 63/170 (37%), Gaps = 19/170 (11%)
Frame = -1
Query: 467 RGRRRSPGP-----RSAPCPRTPWARTHAHPQCVACRSPCWGRNRCPGSHTRPAP-APGS 306
RGR +P P R+ P + A H++P C P R P TR A A G
Sbjct: 78 RGRPGTPAPSRQSRRTGPAEQADHA--HSNPTG-GCSDP----QRSP--RTRQAGYALGE 128
Query: 305 GPAGRGPCGECR--------SCRPRSPPPAYRRPWASERGRAPPPHARHR-----GGLPR 165
G AG G G S R P +RR ++RG PPP A R GG P
Sbjct: 129 GSAGLGSRGPRPHPAFQVQWSARNPGCPRTWRRRSGAQRGH-PPPGAGQRPSGPTGGRPA 187
Query: 164 RRSELAAQLAAGSGYAADGPTRVAP*CGPS*ERASKGLSAKLPGAQALPR 15
A G G A P+ P A +A+LP + PR
Sbjct: 188 APGAPGTPAAPGPGGGAAVPSGATPHPERGSGPADPPAAARLPPERQEPR 237
>ref|NP_492875.2| pre-mRNA splicing SR protein related (68.2 kD) (rsr-1)
[Caenorhabditis elegans] gi|19571645|emb|CAB04214.3| C.
elegans RSR-1 protein (corresponding sequence F28D9.1)
[Caenorhabditis elegans]
Length = 601
Score = 55.1 bits (131), Expect = 7e-07
Identities = 52/159 (32%), Positives = 71/159 (43%), Gaps = 18/159 (11%)
Frame = -1
Query: 548 PPA*CRRRRTRCRAGNPARCRGTCASPRGRRRSPGP-RSAPCPRTPWARTHAHPQCVACR 372
PPA RRRR+ ++ +PA R P RRRSP +S P P+ +R+ + P R
Sbjct: 385 PPA-PRRRRSPSKSRSPAPKREI--PPARRRRSPSASKSPPAPKRAKSRSKSPPAPRRRR 441
Query: 371 SPCWGRNRCPGSHTRPAPAPGSGPAGRGPCG-ECRSCRPRSPPPA--------------- 240
SP ++ P P+ +P + R P G + RS R R P A
Sbjct: 442 SPSQSKSPAPRRRRSPSKSPQAPRRRRSPSGSKSRSPRRRRSPAAAPRRRQSPQRRRSPR 501
Query: 239 -YRRPWASERGRAPPPHARHRGGLPRRRSELAAQLAAGS 126
R P +S R R+PPP R PR+ SE A +A S
Sbjct: 502 RRRSPSSSSRSRSPPPPPRR----PRQDSEQQAPVAVKS 536
Score = 49.3 bits (116), Expect = 4e-05
Identities = 41/132 (31%), Positives = 56/132 (42%), Gaps = 6/132 (4%)
Frame = -1
Query: 518 RCRAGNPARCRGTCAS----PRGRRRSPGPRSAPCPRTPWARTHAHPQCVACRSPCWGRN 351
R ++G+P R R AS P RRRSP +P P+ +R+ + P R
Sbjct: 296 RAKSGSPRRRRSPSASKSPPPARRRRSPSQSKSPAPKRAKSRSKSPPAPAR-------RR 348
Query: 350 RCPGSHTRPAPAP--GSGPAGRGPCGECRSCRPRSPPPAYRRPWASERGRAPPPHARHRG 177
R P + P PAP + P RS PPA RR + + R+P P R
Sbjct: 349 RSPSASKSPPPAPKRAKSRSKSPPARRRRSPSASKSPPAPRRRRSPSKSRSPAP-KREIP 407
Query: 176 GLPRRRSELAAQ 141
RRRS A++
Sbjct: 408 PARRRRSPSASK 419
>ref|NP_055541.1| ProSAPiP2 protein [Homo sapiens] gi|3882271|dbj|BAA34495.1|
KIAA0775 protein [Homo sapiens]
Length = 615
Score = 54.7 bits (130), Expect = 9e-07
Identities = 48/129 (37%), Positives = 58/129 (44%), Gaps = 4/129 (3%)
Frame = -1
Query: 479 CASPRGRRRSPGPRSAPCPRTPWARTHAHPQCVAC----RSPCWGRNRCPGSHTRPAPAP 312
C SP +RRSP P PCP R+ A P C + RSP + P S R +P P
Sbjct: 363 CQSPVPQRRSPVP---PCPSPQQRRSPASPSCPSPVPQRRSPVPPSCQSP-SPQRRSPVP 418
Query: 311 GSGPAGRGPCGECRSCRPRSPPPAYRRPWASERGRAPPPHARHRGGLPRRRSELAAQLAA 132
S PA + RP PPP R A ER A PP + G RRS ++LA
Sbjct: 419 PSCPAPQP--------RPPPPPPPGERTLA-ERAYAKPPSHHVKAGFQGRRS--YSELAE 467
Query: 131 GSGYAADGP 105
G+ YA P
Sbjct: 468 GAAYAGASP 476
>gb|AAA66445.1| unknown protein
Length = 296
Score = 51.6 bits (122), Expect = 8e-06
Identities = 49/121 (40%), Positives = 54/121 (44%), Gaps = 15/121 (12%)
Frame = -1
Query: 518 RCRAGNPARCRGTCASPRGRRRSPG-PRSAP---CPRTPWAR----THAHPQCVACR--- 372
RC AG P R A+ R RR PG PRSA CPRT W R HP A +
Sbjct: 87 RCPAGPPPT-RSGAAAQRTHRRPPGCPRSARNPGCPRT-WRRRSGAQRGHPPPGAGQRPS 144
Query: 371 SPCWGRNRCPGSHTRP-APAPGSG---PAGRGPCGECRSCRPRSPPPAYRRPWASERGRA 204
P GR PG+ P AP PG G P+G P E R P PP A R P + R
Sbjct: 145 GPTGGRPAAPGAPGTPAAPGPGGGAAVPSGATPHPE-RGSGPADPPAAARLPPERQEPRL 203
Query: 203 P 201
P
Sbjct: 204 P 204
Score = 35.0 bits (79), Expect = 0.75
Identities = 34/117 (29%), Positives = 42/117 (35%), Gaps = 5/117 (4%)
Frame = -1
Query: 350 RCPGSHTRPAPAPGSGPAGRGPCGECRSCRPRSPPPAYRRPWASERGRAPPPHARHR--- 180
RCP + R P G RS R P +RR ++RG PPP A R
Sbjct: 87 RCPAGPPPTRSGAAAQRTHRRPPGCPRSARNPGCPRTWRRRSGAQRGH-PPPGAGQRPSG 145
Query: 179 --GGLPRRRSELAAQLAAGSGYAADGPTRVAP*CGPS*ERASKGLSAKLPGAQALPR 15
GG P A G G A P+ P A +A+LP + PR
Sbjct: 146 PTGGRPAAPGAPGTPAAPGPGGGAAVPSGATPHPERGSGPADPPAAARLPPERQEPR 202
EST assemble image
|
|
|
|
clone |
accession |
position |
1 |
MX223b08_r |
BP090380 |
1 |
418 |
2 |
HC087h10_r |
AV638560 |
100 |
568 |
3 |
HC066b04_r |
AV636922 |
100 |
607 |
4 |
LC084g07_r |
AV624936 |
104 |
623 |
5 |
LC019g08_r |
AV620264 |
106 |
639 |
6 |
LC027b04_r |
AV620808 |
107 |
587 |
7 |
LC051a02_r |
AV622538 |
107 |
608 |
8 |
MX201g09_r |
BP089157 |
110 |
459 |
9 |
LC059d01_r |
AV623133 |
116 |
653 |
10 |
HC100g04_r |
AV639541 |
117 |
523 |
11 |
HC036g04_r |
AV634704 |
125 |
607 |
12 |
MX055h02_r |
BP088270 |
138 |
445 |
13 |
LC075c09_r |
AV624266 |
177 |
684 |
|
Chlamydomonas reinhardtii
Kazusa DNA Research Institute