FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5549, 400 aa 1>>>pF1KE5549 400 - 400 aa - 400 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.8740+/-0.00083; mu= 14.8171+/- 0.050 mean_var=77.3028+/-15.378, 0's: 0 Z-trim(108.0): 30 B-trim: 43 in 1/51 Lambda= 0.145874 statistics sampled from 9915 (9934) to 9915 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.682), E-opt: 0.2 (0.305), width: 16 Scan time: 2.370 The best scores are: opt bits E(32554) CCDS47204.1 PAIP1 gene_id:10605|Hs108|chr5 ( 400) 2666 570.4 1e-162 CCDS3947.1 PAIP1 gene_id:10605|Hs108|chr5 ( 479) 2610 558.6 4.3e-159 CCDS3948.1 PAIP1 gene_id:10605|Hs108|chr5 ( 367) 2440 522.8 2e-148 >>CCDS47204.1 PAIP1 gene_id:10605|Hs108|chr5 (400 aa) initn: 2666 init1: 2666 opt: 2666 Z-score: 3034.6 bits: 570.4 E(32554): 1e-162 Smith-Waterman score: 2666; 100.0% identity (100.0% similar) in 400 aa overlap (1-400:1-400) 10 20 30 40 50 60 pF1KE5 MSDGFDRAPEQTRPLRAPPSSQDKIPQQNSESAMAKPQVVVAPVLMSKLSVNAPEFYPSG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 MSDGFDRAPEQTRPLRAPPSSQDKIPQQNSESAMAKPQVVVAPVLMSKLSVNAPEFYPSG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 YSSSYTESYEDGCEDYPTLSEYVQDFLNHLTEQPGSFETEIEQFAETLNGCVTTDDALQE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 YSSSYTESYEDGCEDYPTLSEYVQDFLNHLTEQPGSFETEIEQFAETLNGCVTTDDALQE 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 LVELIYQQATSIPNFSYMGARLCNYLSHHLTISPQSGNFRQLLLQRCRTEYEVKDQAAKG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 LVELIYQQATSIPNFSYMGARLCNYLSHHLTISPQSGNFRQLLLQRCRTEYEVKDQAAKG 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 DEVTRKRFHAFVLFLGELYLNLEIKGTNGQVTRADILQVGLRELLNALFSNPMDDNLICA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 DEVTRKRFHAFVLFLGELYLNLEIKGTNGQVTRADILQVGLRELLNALFSNPMDDNLICA 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE5 VKLLKLTGSVLEDAWKEKGKMDMEEIIQRIENVVLDANCSRDVKQMLLKLVELRSSNWGR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 VKLLKLTGSVLEDAWKEKGKMDMEEIIQRIENVVLDANCSRDVKQMLLKLVELRSSNWGR 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE5 VHATSTYREATPENDPNYFMNEPTFYTSDGVPFTAADPDYQEKYQELLEREDFFPDYEEN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 VHATSTYREATPENDPNYFMNEPTFYTSDGVPFTAADPDYQEKYQELLEREDFFPDYEEN 310 320 330 340 350 360 370 380 390 400 pF1KE5 GTDLSGAGDPYLDDIDDEMDPEIEEAYEKFCLESERKRKQ :::::::::::::::::::::::::::::::::::::::: CCDS47 GTDLSGAGDPYLDDIDDEMDPEIEEAYEKFCLESERKRKQ 370 380 390 400 >>CCDS3947.1 PAIP1 gene_id:10605|Hs108|chr5 (479 aa) initn: 2610 init1: 2610 opt: 2610 Z-score: 2969.7 bits: 558.6 E(32554): 4.3e-159 Smith-Waterman score: 2610; 100.0% identity (100.0% similar) in 392 aa overlap (9-400:88-479) 10 20 30 pF1KE5 MSDGFDRAPEQTRPLRAPPSSQDKIPQQNSESAMAKPQ :::::::::::::::::::::::::::::: CCDS39 PLRQPRTTPPPGAQCEVPASPQRPSRPGALPEQTRPLRAPPSSQDKIPQQNSESAMAKPQ 60 70 80 90 100 110 40 50 60 70 80 90 pF1KE5 VVVAPVLMSKLSVNAPEFYPSGYSSSYTESYEDGCEDYPTLSEYVQDFLNHLTEQPGSFE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS39 VVVAPVLMSKLSVNAPEFYPSGYSSSYTESYEDGCEDYPTLSEYVQDFLNHLTEQPGSFE 120 130 140 150 160 170 100 110 120 130 140 150 pF1KE5 TEIEQFAETLNGCVTTDDALQELVELIYQQATSIPNFSYMGARLCNYLSHHLTISPQSGN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS39 TEIEQFAETLNGCVTTDDALQELVELIYQQATSIPNFSYMGARLCNYLSHHLTISPQSGN 180 190 200 210 220 230 160 170 180 190 200 210 pF1KE5 FRQLLLQRCRTEYEVKDQAAKGDEVTRKRFHAFVLFLGELYLNLEIKGTNGQVTRADILQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS39 FRQLLLQRCRTEYEVKDQAAKGDEVTRKRFHAFVLFLGELYLNLEIKGTNGQVTRADILQ 240 250 260 270 280 290 220 230 240 250 260 270 pF1KE5 VGLRELLNALFSNPMDDNLICAVKLLKLTGSVLEDAWKEKGKMDMEEIIQRIENVVLDAN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS39 VGLRELLNALFSNPMDDNLICAVKLLKLTGSVLEDAWKEKGKMDMEEIIQRIENVVLDAN 300 310 320 330 340 350 280 290 300 310 320 330 pF1KE5 CSRDVKQMLLKLVELRSSNWGRVHATSTYREATPENDPNYFMNEPTFYTSDGVPFTAADP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS39 CSRDVKQMLLKLVELRSSNWGRVHATSTYREATPENDPNYFMNEPTFYTSDGVPFTAADP 360 370 380 390 400 410 340 350 360 370 380 390 pF1KE5 DYQEKYQELLEREDFFPDYEENGTDLSGAGDPYLDDIDDEMDPEIEEAYEKFCLESERKR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS39 DYQEKYQELLEREDFFPDYEENGTDLSGAGDPYLDDIDDEMDPEIEEAYEKFCLESERKR 420 430 440 450 460 470 400 pF1KE5 KQ :: CCDS39 KQ >>CCDS3948.1 PAIP1 gene_id:10605|Hs108|chr5 (367 aa) initn: 2440 init1: 2440 opt: 2440 Z-score: 2778.1 bits: 522.8 E(32554): 2e-148 Smith-Waterman score: 2440; 100.0% identity (100.0% similar) in 367 aa overlap (34-400:1-367) 10 20 30 40 50 60 pF1KE5 GFDRAPEQTRPLRAPPSSQDKIPQQNSESAMAKPQVVVAPVLMSKLSVNAPEFYPSGYSS :::::::::::::::::::::::::::::: CCDS39 MAKPQVVVAPVLMSKLSVNAPEFYPSGYSS 10 20 30 70 80 90 100 110 120 pF1KE5 SYTESYEDGCEDYPTLSEYVQDFLNHLTEQPGSFETEIEQFAETLNGCVTTDDALQELVE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS39 SYTESYEDGCEDYPTLSEYVQDFLNHLTEQPGSFETEIEQFAETLNGCVTTDDALQELVE 40 50 60 70 80 90 130 140 150 160 170 180 pF1KE5 LIYQQATSIPNFSYMGARLCNYLSHHLTISPQSGNFRQLLLQRCRTEYEVKDQAAKGDEV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS39 LIYQQATSIPNFSYMGARLCNYLSHHLTISPQSGNFRQLLLQRCRTEYEVKDQAAKGDEV 100 110 120 130 140 150 190 200 210 220 230 240 pF1KE5 TRKRFHAFVLFLGELYLNLEIKGTNGQVTRADILQVGLRELLNALFSNPMDDNLICAVKL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS39 TRKRFHAFVLFLGELYLNLEIKGTNGQVTRADILQVGLRELLNALFSNPMDDNLICAVKL 160 170 180 190 200 210 250 260 270 280 290 300 pF1KE5 LKLTGSVLEDAWKEKGKMDMEEIIQRIENVVLDANCSRDVKQMLLKLVELRSSNWGRVHA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS39 LKLTGSVLEDAWKEKGKMDMEEIIQRIENVVLDANCSRDVKQMLLKLVELRSSNWGRVHA 220 230 240 250 260 270 310 320 330 340 350 360 pF1KE5 TSTYREATPENDPNYFMNEPTFYTSDGVPFTAADPDYQEKYQELLEREDFFPDYEENGTD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS39 TSTYREATPENDPNYFMNEPTFYTSDGVPFTAADPDYQEKYQELLEREDFFPDYEENGTD 280 290 300 310 320 330 370 380 390 400 pF1KE5 LSGAGDPYLDDIDDEMDPEIEEAYEKFCLESERKRKQ ::::::::::::::::::::::::::::::::::::: CCDS39 LSGAGDPYLDDIDDEMDPEIEEAYEKFCLESERKRKQ 340 350 360 400 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 01:43:14 2016 done: Tue Nov 8 01:43:15 2016 Total Scan time: 2.370 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]