KCC001151A_c01
[Fasta Sequence]
[Nr Search]
[EST assemble image]
Fasta Sequence
>KCC001151A_C01 KCC001151A_c01
gcacgagAAGTGCTTGGACACAGCTCTAAAGCTCGCTACTACGGCCGGCTTGAGACTTTC
AGTGACCGGGTCTACTACGCTTTGCGGTCGCGACGAGTTTGCCAAATGGCCCTCGCCTTC
ACCTCTCGCGTCGCCTCTCGGCCCCAGCTGCACGCACGGAGCGCAGCAAGCCATGCGGCG
GCTGTTCCCGCGGTGGCTCGCCCGGCGACCCCCGCAACCGTAAGCGCCGCTGCCCCCAGC
AGCAGCACCCGCAGCAGCGGCGTGGCCTGCCGGGCGGTGCGTATGGATGCGGAGCTGCAG
GCGCTGCTGGAGGGAGCCATGGCTCGGTCGTCATGCACCGTGGGCGACCGCGTGGGCATC
ATCGGGCTGAAGTTTGCGGAGGAGGGCCAGAAGGAGGGCTGCCCGGTCATGGAGTCGGCC
ATCCGCCTGAGCATCTACCTGCGTGACTACAAGGCCAGCGACGTACGCGGCCACAGCATC
CTGGAGCTCGGCACCGGCATCGGTGTGGCGGGTCTGACTCTGGCGGCGTACGGCGCGCAC
GTCCTGCTGACCGAACTT
Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KCC001151A_C01 KCC001151A_c01
(558 letters)
Database: nr
1,537,769 sequences; 498,525,298 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|XP_315989.1| ENSANGP00000010869 [Anopheles gambiae] gi|21298... 52 7e-06
ref|XP_315988.1| ENSANGP00000005715 [Anopheles gambiae] gi|30175... 52 7e-06
ref|NP_499936.1| putative nuclear protein (4B256) [Caenorhabditi... 51 1e-05
ref|ZP_00137694.1| COG1535: Isochorismate hydrolase [Pseudomonas... 50 3e-05
gb|AAK61382.1| basic proline-rich protein [Sus scrofa] 49 3e-05
>ref|XP_315989.1| ENSANGP00000010869 [Anopheles gambiae] gi|21298800|gb|EAA10945.1|
ENSANGP00000010869 [Anopheles gambiae str. PEST]
Length = 260
Score = 51.6 bits (122), Expect = 7e-06
Identities = 45/144 (31%), Positives = 53/144 (36%), Gaps = 26/144 (18%)
Frame = -3
Query: 529 TPPESDPPHRCRCRAPGC-CGRVRRWPCSHAGRCSGGWPTP*PGSPPSGPPPQTSAR--* 359
TPP + PP R P C G V + C++ G + P P PP+ PPP R
Sbjct: 79 TPPPTRPPPPPPTRPPSCPAGGVPPYCCTNGGSGPNCYVPPPPTPPPTRPPPPPPTRPPS 138
Query: 358 CPRG--RPRCMTT-----------EPWLPPAAPAAPHPYAPP-----GRPRRCC-----G 248
CP G P C T P PP P P P PP G P CC G
Sbjct: 139 CPAGGVPPYCCTNGGSGPNCYVPPPPTPPPTRPPPPPPTRPPSCPAGGVPPYCCTNGGSG 198
Query: 247 CCCWGQRRLRLRGSPGEPPREQPP 176
C+ + P PP PP
Sbjct: 199 PNCY------VPPPPTPPPTRPPP 216
Score = 50.1 bits (118), Expect = 2e-05
Identities = 38/114 (33%), Positives = 44/114 (38%), Gaps = 21/114 (18%)
Frame = -3
Query: 529 TPPESDPPHRCRCRAPGC-CGRVRRWPCSHAGRCSGGWPTP*PGSPPSGPPPQTSAR*-- 359
TPP + PP R P C G V + C++ G + P P PP+ PPP R
Sbjct: 122 TPPPTRPPPPPPTRPPSCPAGGVPPYCCTNGGSGPNCYVPPPPTPPPTRPPPPPPTRPPS 181
Query: 358 CPRGR--PRCMTT-----------EPWLPPAAPAAPHPYAPP-----GRPRRCC 251
CP G P C T P PP P P P PP G P CC
Sbjct: 182 CPAGGVPPYCCTNGGSGPNCYVPPPPTPPPTRPPPPPPTRPPSCPAGGVPPYCC 235
Score = 43.5 bits (101), Expect = 0.002
Identities = 40/129 (31%), Positives = 47/129 (36%), Gaps = 24/129 (18%)
Frame = -3
Query: 490 RAPGC-CGRVRRWPCSHAGRCSGGWPTP*PGSPPSGPPPQTSAR*CPRG--RPRCMTT-- 326
R P C G V + C++ G + P P PP+ PPP T CP G P C T
Sbjct: 9 RPPSCPAGGVPPYCCTNGGTGPNCYVPPPPPPPPTRPPP-TRPPSCPAGGVPPYCCTNGG 67
Query: 325 ---------EPWLPPAAPAAPHPYAPP-----GRPRRCC-----GCCCWGQRRLRLRGSP 203
P PP P P P PP G P CC G C+ + P
Sbjct: 68 TGPNCYVPPPPTPPPTRPPPPPPTRPPSCPAGGVPPYCCTNGGSGPNCY------VPPPP 121
Query: 202 GEPPREQPP 176
PP PP
Sbjct: 122 TPPPTRPPP 130
Score = 39.3 bits (90), Expect = 0.036
Identities = 26/72 (36%), Positives = 32/72 (44%), Gaps = 5/72 (6%)
Frame = -3
Query: 529 TPPESDPPHRCRCRAPGC-CGRVRRWPCSHAGRCSGGWPTP*PGSPPSGPPPQTSAR--* 359
TPP + PP R P C G V + C++ G + P P PP+ PPP R
Sbjct: 165 TPPPTRPPPPPPTRPPSCPAGGVPPYCCTNGGSGPNCYVPPPPTPPPTRPPPPPPTRPPS 224
Query: 358 CPRG--RPRCMT 329
CP G P C T
Sbjct: 225 CPAGGVPPYCCT 236
Score = 34.7 bits (78), Expect = 0.89
Identities = 30/93 (32%), Positives = 32/93 (34%), Gaps = 12/93 (12%)
Frame = -3
Query: 418 PTP*PGSPPSGPPPQTSAR*CPRGR--PRCMTTEPWLPPAAPAAPHPYAPP-----GRPR 260
PT P P G PP C G P C P PP P P P PP G P
Sbjct: 7 PTRPPSCPAGGVPPYC----CTNGGTGPNCYVPPP--PPPPPTRPPPTRPPSCPAGGVPP 60
Query: 259 RCC-----GCCCWGQRRLRLRGSPGEPPREQPP 176
CC G C+ + P PP PP
Sbjct: 61 YCCTNGGTGPNCY------VPPPPTPPPTRPPP 87
>ref|XP_315988.1| ENSANGP00000005715 [Anopheles gambiae] gi|30175861|gb|EAA11805.2|
ENSANGP00000005715 [Anopheles gambiae str. PEST]
Length = 245
Score = 51.6 bits (122), Expect = 7e-06
Identities = 45/144 (31%), Positives = 53/144 (36%), Gaps = 26/144 (18%)
Frame = -3
Query: 529 TPPESDPPHRCRCRAPGC-CGRVRRWPCSHAGRCSGGWPTP*PGSPPSGPPPQTSAR--* 359
TPP + PP R P C G V + C++ G + P P PP+ PPP R
Sbjct: 36 TPPPTRPPPPPPTRPPSCPAGGVPPYCCTNGGSGPNCYVPPPPTPPPTRPPPPPPTRPPS 95
Query: 358 CPRG--RPRCMTT-----------EPWLPPAAPAAPHPYAPP-----GRPRRCC-----G 248
CP G P C T P PP P P P PP G P CC G
Sbjct: 96 CPAGGVPPYCCTNGGSGPNCYVPPPPTPPPTRPPPPPPTRPPSCPAGGVPPYCCTNGGSG 155
Query: 247 CCCWGQRRLRLRGSPGEPPREQPP 176
C+ + P PP PP
Sbjct: 156 PNCY------VPPPPTPPPTRPPP 173
Score = 51.6 bits (122), Expect = 7e-06
Identities = 45/144 (31%), Positives = 53/144 (36%), Gaps = 26/144 (18%)
Frame = -3
Query: 529 TPPESDPPHRCRCRAPGC-CGRVRRWPCSHAGRCSGGWPTP*PGSPPSGPPPQTSAR--* 359
TPP + PP R P C G V + C++ G + P P PP+ PPP R
Sbjct: 79 TPPPTRPPPPPPTRPPSCPAGGVPPYCCTNGGSGPNCYVPPPPTPPPTRPPPPPPTRPPS 138
Query: 358 CPRG--RPRCMTT-----------EPWLPPAAPAAPHPYAPP-----GRPRRCC-----G 248
CP G P C T P PP P P P PP G P CC G
Sbjct: 139 CPAGGVPPYCCTNGGSGPNCYVPPPPTPPPTRPPPPPPTRPPSCPAGGVPPYCCTNGGSG 198
Query: 247 CCCWGQRRLRLRGSPGEPPREQPP 176
C+ + P PP PP
Sbjct: 199 PNCY------VPPPPTPPPTRPPP 216
Score = 45.8 bits (107), Expect = 4e-04
Identities = 35/104 (33%), Positives = 41/104 (38%), Gaps = 15/104 (14%)
Frame = -3
Query: 529 TPPESDPPHRCRCRAPGC-CGRVRRWPCSHAGRCSGGWPTP*PGSPPSGPPPQTSAR*-- 359
TPP + PP R P C G V + C++ G + P P PP+ PPP R
Sbjct: 122 TPPPTRPPPPPPTRPPSCPAGGVPPYCCTNGGSGPNCYVPPPPTPPPTRPPPPPPTRPPS 181
Query: 358 CPRGR------------PRCMTTEPWLPPAAPAAPHPYAPPGRP 263
CP G P C P PP P P P PP RP
Sbjct: 182 CPAGGVPPYCCTNGGSGPNCYVPPPPTPP--PTRPPP-PPPTRP 222
Score = 42.4 bits (98), Expect = 0.004
Identities = 42/142 (29%), Positives = 49/142 (33%), Gaps = 26/142 (18%)
Frame = -3
Query: 523 PESDPPHRCRCRAPGC-CGRVRRWPCSHAGRCSGGWPTP*PGSPPSGPPPQTSAR--*CP 353
P PP P C G V + C++ G + P P PP+ PPP R CP
Sbjct: 1 PRPPPP------PPSCPAGGVPPYCCTNGGSGPNCYVPPPPTPPPTRPPPPPPTRPPSCP 54
Query: 352 RG--RPRCMTT-----------EPWLPPAAPAAPHPYAPP-----GRPRRCC-----GCC 242
G P C T P PP P P P PP G P CC G
Sbjct: 55 AGGVPPYCCTNGGSGPNCYVPPPPTPPPTRPPPPPPTRPPSCPAGGVPPYCCTNGGSGPN 114
Query: 241 CWGQRRLRLRGSPGEPPREQPP 176
C+ + P PP PP
Sbjct: 115 CY------VPPPPTPPPTRPPP 130
Score = 34.3 bits (77), Expect = 1.2
Identities = 28/81 (34%), Positives = 32/81 (38%), Gaps = 1/81 (1%)
Frame = -3
Query: 529 TPPESDPPHRCRCRAPGC-CGRVRRWPCSHAGRCSGGWPTP*PGSPPSGPPPQTSAR*CP 353
TPP + PP R P C G V + C++ G SG P PP PP T P
Sbjct: 165 TPPPTRPPPPPPTRPPSCPAGGVPPYCCTNGG--SG----PNCYVPPPPTPPPTRPPPPP 218
Query: 352 RGRPRCMTTEPWLPPAAPAAP 290
RP LPP P P
Sbjct: 219 PTRPPSSLPPSALPPGPPQGP 239
>ref|NP_499936.1| putative nuclear protein (4B256) [Caenorhabditis elegans]
gi|7508913|pir||T33997 hypothetical protein W03G1.5 -
Caenorhabditis elegans gi|4262637|gb|AAD14753.1|
Hypothetical protein W03G1.5 [Caenorhabditis elegans]
Length = 471
Score = 50.8 bits (120), Expect = 1e-05
Identities = 44/123 (35%), Positives = 46/123 (36%), Gaps = 10/123 (8%)
Frame = +3
Query: 189 RGGSPGDPRNRKRRCPQQQHPQQRRGLPGGAYGCGAAGAAGG---------SHGSVVMHR 341
R GSPG R R R G PGG +G G G G HG HR
Sbjct: 228 RSGSPGGRRGHGGR------HGSRSGSPGGRHGHGGHGGHGSRSGSPGGRHGHGGSGHHR 281
Query: 342 G-RPRGHHRAEVCGGGPEGGLPGHGVGHPPEHLPA*LQGQRRTRPQHPGARHRHRCGGSD 518
G R GH R C G P G GHG GH G +R PG RH H G
Sbjct: 282 GGRHGGHGRHGSCSGSPR-GRHGHG-GH----------GGHGSRSGSPGGRHGHGGSGHH 329
Query: 519 SGG 527
GG
Sbjct: 330 RGG 332
Score = 47.4 bits (111), Expect = 1e-04
Identities = 47/155 (30%), Positives = 52/155 (33%), Gaps = 24/155 (15%)
Frame = +3
Query: 102 PNGPRLHLSR---RLSAPAARTERSKPCGGCSRGGSP-------GDPRNRKRRCPQQQHP 251
P G R H R R +P R G SR GSP G +R R
Sbjct: 232 PGGRRGHGGRHGSRSGSPGGRHGHGGHGGHGSRSGSPGGRHGHGGSGHHRGGRHGGHGRH 291
Query: 252 QQRRGLPGGAYGCGAAGAAGG---------SHGSVVMHRGRPRGHHRAEVCGGGPEGGLP 404
G P G +G G G G HG HRG G H G GG
Sbjct: 292 GSCSGSPRGRHGHGGHGGHGSRSGSPGGRHGHGGSGHHRGGRHGGHGRHGSRSGSPGGRH 351
Query: 405 GHGVGHPPEHLPA*LQGQRRTRPQHP-----GARH 494
GHG H P H P G+ +R P G RH
Sbjct: 352 GHGGRHGPPHCPG-RHGRHGSRSHSPRGHGHGGRH 385
Score = 37.7 bits (86), Expect = 0.11
Identities = 40/142 (28%), Positives = 48/142 (33%), Gaps = 3/142 (2%)
Frame = +3
Query: 138 SAPAARTERSKPCGGCSRGGSPGDPRNRKRRCPQQQHPQQRRGLPGGAYGCGAAGAAGGS 317
S+ ++ + S+ G G K+ Q+ + RG G G GG
Sbjct: 185 SSSSSSSSESRSPGRHEEPEKQGKKEELKQLKKQKSEGVEHRGRSGSP---GGRRGHGGR 241
Query: 318 HGSVVMHRGRPRGHHRAEVCGG-GPEGGLPG--HGVGHPPEHLPA*LQGQRRTRPQHPGA 488
HGS G P G H GG G G PG HG G H G R
Sbjct: 242 HGS---RSGSPGGRHGHGGHGGHGSRSGSPGGRHGHGGSGHHRGGRHGGHGRHGSCSGSP 298
Query: 489 RHRHRCGGSDSGGVRRARPADR 554
R RH GG G R P R
Sbjct: 299 RGRHGHGGHGGHGSRSGSPGGR 320
>ref|ZP_00137694.1| COG1535: Isochorismate hydrolase [Pseudomonas aeruginosa
UCBPP-PA14]
Length = 399
Score = 49.7 bits (117), Expect = 3e-05
Identities = 47/150 (31%), Positives = 53/150 (35%), Gaps = 7/150 (4%)
Frame = +3
Query: 123 LSRRLSAPAARTERSKPCGGCSRGGSPGDPRNRKRRCPQQQHPQQRRGLPGGAYGCGAAG 302
+ RR P R +PCG RG PG + R P+ R P G A
Sbjct: 1 MDRRAHPPGRR----RPCGAAGRGAQPGGLQGRSGDRPRPVAGALRASRPAPRAGAPDAD 56
Query: 303 AAGGSHGSVVMHRGRPRGHHRAEVCGGGPEGGLPGHGVGHPPEHLPA*LQGQRRTRPQHP 482
A G R G GGG G G G E A +R R Q P
Sbjct: 57 RADG----------RAEGRRAPAAAGGG---GARGRAPGDLAERPDARQHHRRALRQQDP 103
Query: 483 -GARHRHRCGGSDSGGV------RRARPAD 551
GA+HR GG GGV RRA P D
Sbjct: 104 PGAQHRRGGGGVPPGGVRLRRRGRRAAPGD 133
>gb|AAK61382.1| basic proline-rich protein [Sus scrofa]
Length = 511
Score = 49.3 bits (116), Expect = 3e-05
Identities = 42/119 (35%), Positives = 42/119 (35%)
Frame = -3
Query: 526 PPESDPPHRCRCRAPGCCGRVRRWPCSHAGRCSGGWPTP*PGSPPSGPPPQTSAR*CPRG 347
PP PP P G P H R P PG PP GPPP A P
Sbjct: 269 PPGPAPPGARPPPGPPPPGPPPPGPAPHGAR-------PPPGPPPPGPPPPGPAP--PGA 319
Query: 346 RPRCMTTEPWLPPAAPAAPHPYAPPGRPRRCCGCCCWGQRRLRLRGSPGEPPREQPPHG 170
RP P PP PA P PPG P G G R PG PP PP G
Sbjct: 320 RPPPGPPPPGPPPPGPAPPGARPPPGPPPP--GPPPPGPAPPGARPPPGPPPPGPPPPG 376
Score = 47.0 bits (110), Expect = 2e-04
Identities = 33/81 (40%), Positives = 33/81 (40%)
Frame = -3
Query: 412 P*PGSPPSGPPPQTSAR*CPRGRPRCMTTEPWLPPAAPAAPHPYAPPGRPRRCCGCCCWG 233
P PG PP GPPP A P RP P PP PA P PPG P G G
Sbjct: 342 PPPGPPPPGPPPPGPAP--PGARPPPGPPPPGPPPPGPAPPGARPPPGPPPP--GPPPPG 397
Query: 232 QRRLRLRGSPGEPPREQPPHG 170
R PG PP PP G
Sbjct: 398 PAPPGARPPPGPPPPGPPPPG 418
Score = 46.6 bits (109), Expect = 2e-04
Identities = 33/81 (40%), Positives = 33/81 (40%)
Frame = -3
Query: 412 P*PGSPPSGPPPQTSAR*CPRGRPRCMTTEPWLPPAAPAAPHPYAPPGRPRRCCGCCCWG 233
P PG PP GPPP A P RP P PP PA P PPG P G G
Sbjct: 363 PPPGPPPPGPPPPGPAP--PGARPPPGPPPPGPPPPGPAPPGARPPPGPPPP--GPPPPG 418
Query: 232 QRRLRLRGSPGEPPREQPPHG 170
R PG PP PP G
Sbjct: 419 PAPPGARPLPGPPPPGPPPPG 439
Score = 45.8 bits (107), Expect = 4e-04
Identities = 47/157 (29%), Positives = 52/157 (32%), Gaps = 32/157 (20%)
Frame = -3
Query: 550 SAGRARRTPPESDPP------------HRCRCRAPGCCGRVRRWPCSHAGRCSGGWPTP* 407
SA + R PP PP H+ R R PG + P R G P P
Sbjct: 30 SAEKFLRPPPGGGPPRPPPPEESQGEGHQKRPRPPG--DGPEQGPAPPGARPPPGPPPPG 87
Query: 406 P--------------GSPPSGPPPQTSAR*CPRGRPRCMTTEPWLPPAAPAAPHPYAPPG 269
P G PP GPPP A P RP P PP PA P PPG
Sbjct: 88 PPPPGPAPPGARPPPGPPPPGPPPPGPAP--PGARPPPGPPPPGPPPPGPAPPGARPPPG 145
Query: 268 RPRRCCGC------CCWGQRRLRLRGSPGEPPREQPP 176
P G G ++ G PPR PP
Sbjct: 146 PPPPAGGLQQGPAPSHVGPKKKPPPPGAGHPPRPPPP 182
Score = 45.4 bits (106), Expect = 5e-04
Identities = 41/121 (33%), Positives = 47/121 (37%), Gaps = 1/121 (0%)
Frame = -3
Query: 529 TPPESDPPHRCRCRAPGCCG-RVRRWPCSHAGRCSGGWPTP*PGSPPSGPPPQTSAR*CP 353
+PP +D P + P G + ++ P AG P P PG PP GP P P
Sbjct: 208 SPPSADGPQQ----GPAPSGDKPKKKPPPPAGPPPP--PPPPPGPPPPGPAP-------P 254
Query: 352 RGRPRCMTTEPWLPPAAPAAPHPYAPPGRPRRCCGCCCWGQRRLRLRGSPGEPPREQPPH 173
RP P PP PA P PPG P G G R PG PP PP
Sbjct: 255 GARPPPGPPPPGPPPPGPAPPGARPPPGPPPP--GPPPPGPAPHGARPPPGPPPPGPPPP 312
Query: 172 G 170
G
Sbjct: 313 G 313
Score = 40.4 bits (93), Expect = 0.016
Identities = 31/87 (35%), Positives = 34/87 (38%), Gaps = 5/87 (5%)
Frame = -3
Query: 412 P*PGSPPSGPPPQTSAR*CPRGRPRCMTTEPWLPPAAPAAPHPYAPPGRPRRC----CGC 245
P PG PP GPPP A P RP P PP PA P PP P G
Sbjct: 405 PPPGPPPPGPPPPGPAP--PGARPLPGPPPPGPPPPGPAPPGARPPPPPPPPADEPQQGP 462
Query: 244 CCWGQRRLRLRGSP-GEPPREQPPHGL 167
G + + P G PP PP G+
Sbjct: 463 APSGDKPKKKPPPPAGPPPPPPPPPGI 489
Score = 37.7 bits (86), Expect = 0.11
Identities = 30/83 (36%), Positives = 32/83 (38%), Gaps = 2/83 (2%)
Frame = -3
Query: 412 P*PGSPPSGPPPQTSAR*CPRGRPRCMTTEPWLPPAAPAAPHPYA--PPGRPRRCCGCCC 239
P G PP PPP+ S + RPR P PA P A P PPG P
Sbjct: 38 PPGGGPPRPPPPEESQGEGHQKRPRPPGDGPEQGPAPPGARPPPGPPPPGPPPP------ 91
Query: 238 WGQRRLRLRGSPGEPPREQPPHG 170
G R PG PP PP G
Sbjct: 92 -GPAPPGARPPPGPPPPGPPPPG 113
Score = 33.5 bits (75), Expect = 2.0
Identities = 41/140 (29%), Positives = 50/140 (35%), Gaps = 21/140 (15%)
Frame = -3
Query: 526 PPESDPPHRCRCRAPGCCGRVRRWPC-SHAGRCSGGWPTP*PGSPPSGPPPQTSAR*CPR 350
PP + PP P G +++ P SH G P P G PP PPP ++ PR
Sbjct: 137 PPGARPPPG----PPPPAGGLQQGPAPSHVGPKKKP-PPPGAGHPPRPPPPANESQPGPR 191
Query: 349 -----GRPRCMTTEPWLPPAA------PA---------APHPYAPPGRPRRCCGCCCWGQ 230
P ++ PP+A PA P P PP P G G
Sbjct: 192 PPPGPPSPPANDSQEGSPPSADGPQQGPAPSGDKPKKKPPPPAGPPPPPPPPPGPPPPGP 251
Query: 229 RRLRLRGSPGEPPREQPPHG 170
R PG PP PP G
Sbjct: 252 APPGARPPPGPPPPGPPPPG 271
EST assemble image
|
|
|
|
clone |
accession |
position |
1 |
CM022a05_r |
AV387887 |
1 |
558 |
2 |
LCL077c04_r |
AV630323 |
1 |
468 |
|
Chlamydomonas reinhardtii
Kazusa DNA Research Institute