FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE5549, 400 aa
1>>>pF1KE5549 400 - 400 aa - 400 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.8740+/-0.00083; mu= 14.8171+/- 0.050
mean_var=77.3028+/-15.378, 0's: 0 Z-trim(108.0): 30 B-trim: 43 in 1/51
Lambda= 0.145874
statistics sampled from 9915 (9934) to 9915 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.682), E-opt: 0.2 (0.305), width: 16
Scan time: 2.370
The best scores are: opt bits E(32554)
CCDS47204.1 PAIP1 gene_id:10605|Hs108|chr5 ( 400) 2666 570.4 1e-162
CCDS3947.1 PAIP1 gene_id:10605|Hs108|chr5 ( 479) 2610 558.6 4.3e-159
CCDS3948.1 PAIP1 gene_id:10605|Hs108|chr5 ( 367) 2440 522.8 2e-148
>>CCDS47204.1 PAIP1 gene_id:10605|Hs108|chr5 (400 aa)
initn: 2666 init1: 2666 opt: 2666 Z-score: 3034.6 bits: 570.4 E(32554): 1e-162
Smith-Waterman score: 2666; 100.0% identity (100.0% similar) in 400 aa overlap (1-400:1-400)
10 20 30 40 50 60
pF1KE5 MSDGFDRAPEQTRPLRAPPSSQDKIPQQNSESAMAKPQVVVAPVLMSKLSVNAPEFYPSG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 MSDGFDRAPEQTRPLRAPPSSQDKIPQQNSESAMAKPQVVVAPVLMSKLSVNAPEFYPSG
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE5 YSSSYTESYEDGCEDYPTLSEYVQDFLNHLTEQPGSFETEIEQFAETLNGCVTTDDALQE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 YSSSYTESYEDGCEDYPTLSEYVQDFLNHLTEQPGSFETEIEQFAETLNGCVTTDDALQE
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE5 LVELIYQQATSIPNFSYMGARLCNYLSHHLTISPQSGNFRQLLLQRCRTEYEVKDQAAKG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 LVELIYQQATSIPNFSYMGARLCNYLSHHLTISPQSGNFRQLLLQRCRTEYEVKDQAAKG
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE5 DEVTRKRFHAFVLFLGELYLNLEIKGTNGQVTRADILQVGLRELLNALFSNPMDDNLICA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 DEVTRKRFHAFVLFLGELYLNLEIKGTNGQVTRADILQVGLRELLNALFSNPMDDNLICA
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE5 VKLLKLTGSVLEDAWKEKGKMDMEEIIQRIENVVLDANCSRDVKQMLLKLVELRSSNWGR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 VKLLKLTGSVLEDAWKEKGKMDMEEIIQRIENVVLDANCSRDVKQMLLKLVELRSSNWGR
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE5 VHATSTYREATPENDPNYFMNEPTFYTSDGVPFTAADPDYQEKYQELLEREDFFPDYEEN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 VHATSTYREATPENDPNYFMNEPTFYTSDGVPFTAADPDYQEKYQELLEREDFFPDYEEN
310 320 330 340 350 360
370 380 390 400
pF1KE5 GTDLSGAGDPYLDDIDDEMDPEIEEAYEKFCLESERKRKQ
::::::::::::::::::::::::::::::::::::::::
CCDS47 GTDLSGAGDPYLDDIDDEMDPEIEEAYEKFCLESERKRKQ
370 380 390 400
>>CCDS3947.1 PAIP1 gene_id:10605|Hs108|chr5 (479 aa)
initn: 2610 init1: 2610 opt: 2610 Z-score: 2969.7 bits: 558.6 E(32554): 4.3e-159
Smith-Waterman score: 2610; 100.0% identity (100.0% similar) in 392 aa overlap (9-400:88-479)
10 20 30
pF1KE5 MSDGFDRAPEQTRPLRAPPSSQDKIPQQNSESAMAKPQ
::::::::::::::::::::::::::::::
CCDS39 PLRQPRTTPPPGAQCEVPASPQRPSRPGALPEQTRPLRAPPSSQDKIPQQNSESAMAKPQ
60 70 80 90 100 110
40 50 60 70 80 90
pF1KE5 VVVAPVLMSKLSVNAPEFYPSGYSSSYTESYEDGCEDYPTLSEYVQDFLNHLTEQPGSFE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS39 VVVAPVLMSKLSVNAPEFYPSGYSSSYTESYEDGCEDYPTLSEYVQDFLNHLTEQPGSFE
120 130 140 150 160 170
100 110 120 130 140 150
pF1KE5 TEIEQFAETLNGCVTTDDALQELVELIYQQATSIPNFSYMGARLCNYLSHHLTISPQSGN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS39 TEIEQFAETLNGCVTTDDALQELVELIYQQATSIPNFSYMGARLCNYLSHHLTISPQSGN
180 190 200 210 220 230
160 170 180 190 200 210
pF1KE5 FRQLLLQRCRTEYEVKDQAAKGDEVTRKRFHAFVLFLGELYLNLEIKGTNGQVTRADILQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS39 FRQLLLQRCRTEYEVKDQAAKGDEVTRKRFHAFVLFLGELYLNLEIKGTNGQVTRADILQ
240 250 260 270 280 290
220 230 240 250 260 270
pF1KE5 VGLRELLNALFSNPMDDNLICAVKLLKLTGSVLEDAWKEKGKMDMEEIIQRIENVVLDAN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS39 VGLRELLNALFSNPMDDNLICAVKLLKLTGSVLEDAWKEKGKMDMEEIIQRIENVVLDAN
300 310 320 330 340 350
280 290 300 310 320 330
pF1KE5 CSRDVKQMLLKLVELRSSNWGRVHATSTYREATPENDPNYFMNEPTFYTSDGVPFTAADP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS39 CSRDVKQMLLKLVELRSSNWGRVHATSTYREATPENDPNYFMNEPTFYTSDGVPFTAADP
360 370 380 390 400 410
340 350 360 370 380 390
pF1KE5 DYQEKYQELLEREDFFPDYEENGTDLSGAGDPYLDDIDDEMDPEIEEAYEKFCLESERKR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS39 DYQEKYQELLEREDFFPDYEENGTDLSGAGDPYLDDIDDEMDPEIEEAYEKFCLESERKR
420 430 440 450 460 470
400
pF1KE5 KQ
::
CCDS39 KQ
>>CCDS3948.1 PAIP1 gene_id:10605|Hs108|chr5 (367 aa)
initn: 2440 init1: 2440 opt: 2440 Z-score: 2778.1 bits: 522.8 E(32554): 2e-148
Smith-Waterman score: 2440; 100.0% identity (100.0% similar) in 367 aa overlap (34-400:1-367)
10 20 30 40 50 60
pF1KE5 GFDRAPEQTRPLRAPPSSQDKIPQQNSESAMAKPQVVVAPVLMSKLSVNAPEFYPSGYSS
::::::::::::::::::::::::::::::
CCDS39 MAKPQVVVAPVLMSKLSVNAPEFYPSGYSS
10 20 30
70 80 90 100 110 120
pF1KE5 SYTESYEDGCEDYPTLSEYVQDFLNHLTEQPGSFETEIEQFAETLNGCVTTDDALQELVE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS39 SYTESYEDGCEDYPTLSEYVQDFLNHLTEQPGSFETEIEQFAETLNGCVTTDDALQELVE
40 50 60 70 80 90
130 140 150 160 170 180
pF1KE5 LIYQQATSIPNFSYMGARLCNYLSHHLTISPQSGNFRQLLLQRCRTEYEVKDQAAKGDEV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS39 LIYQQATSIPNFSYMGARLCNYLSHHLTISPQSGNFRQLLLQRCRTEYEVKDQAAKGDEV
100 110 120 130 140 150
190 200 210 220 230 240
pF1KE5 TRKRFHAFVLFLGELYLNLEIKGTNGQVTRADILQVGLRELLNALFSNPMDDNLICAVKL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS39 TRKRFHAFVLFLGELYLNLEIKGTNGQVTRADILQVGLRELLNALFSNPMDDNLICAVKL
160 170 180 190 200 210
250 260 270 280 290 300
pF1KE5 LKLTGSVLEDAWKEKGKMDMEEIIQRIENVVLDANCSRDVKQMLLKLVELRSSNWGRVHA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS39 LKLTGSVLEDAWKEKGKMDMEEIIQRIENVVLDANCSRDVKQMLLKLVELRSSNWGRVHA
220 230 240 250 260 270
310 320 330 340 350 360
pF1KE5 TSTYREATPENDPNYFMNEPTFYTSDGVPFTAADPDYQEKYQELLEREDFFPDYEENGTD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS39 TSTYREATPENDPNYFMNEPTFYTSDGVPFTAADPDYQEKYQELLEREDFFPDYEENGTD
280 290 300 310 320 330
370 380 390 400
pF1KE5 LSGAGDPYLDDIDDEMDPEIEEAYEKFCLESERKRKQ
:::::::::::::::::::::::::::::::::::::
CCDS39 LSGAGDPYLDDIDDEMDPEIEEAYEKFCLESERKRKQ
340 350 360
400 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Tue Nov 8 01:43:14 2016 done: Tue Nov 8 01:43:15 2016
Total Scan time: 2.370 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]