FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5761, 811 aa 1>>>pF1KE5761 811 - 811 aa - 811 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 9.2620+/-0.000948; mu= 3.2655+/- 0.057 mean_var=234.9963+/-48.189, 0's: 0 Z-trim(112.3): 38 B-trim: 135 in 1/53 Lambda= 0.083665 statistics sampled from 13048 (13066) to 13048 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.739), E-opt: 0.2 (0.401), width: 16 Scan time: 3.890 The best scores are: opt bits E(32554) CCDS54520.1 EIF4ENIF1 gene_id:56478|Hs108|chr22 ( 811) 5397 665.0 1.4e-190 CCDS13898.1 EIF4ENIF1 gene_id:56478|Hs108|chr22 ( 985) 3220 402.3 2e-111 >>CCDS54520.1 EIF4ENIF1 gene_id:56478|Hs108|chr22 (811 aa) initn: 5397 init1: 5397 opt: 5397 Z-score: 3535.2 bits: 665.0 E(32554): 1.4e-190 Smith-Waterman score: 5397; 100.0% identity (100.0% similar) in 811 aa overlap (1-811:1-811) 10 20 30 40 50 60 pF1KE5 MDRRSMGETESGDAFLDLKKPPASKCPHRYTKEELLDIKELPHSKQRPSCLSEKYDSDGV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 MDRRSMGETESGDAFLDLKKPPASKCPHRYTKEELLDIKELPHSKQRPSCLSEKYDSDGV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 WDPEKWHASLYPASGRSSPVESLKKELDTDRPSLVRRIVGIVECNGGVAEEDEVEVILAQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 WDPEKWHASLYPASGRSSPVESLKKELDTDRPSLVRRIVGIVECNGGVAEEDEVEVILAQ 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 EPAADQEVPRDAVLPEQSPGDFDFNEFFNLDKVPCLASMIEDVLGEGSVSASRFSRWFSN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 EPAADQEVPRDAVLPEQSPGDFDFNEFFNLDKVPCLASMIEDVLGEGSVSASRFSRWFSN 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 PSRSGSRSSSLGSTPHEELERLAGLEQAILSPGQNSGNYFAPIPLEDHAENKVDILEMLQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 PSRSGSRSSSLGSTPHEELERLAGLEQAILSPGQNSGNYFAPIPLEDHAENKVDILEMLQ 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE5 KAKVDLKPLLSSLSANKEKLKESSHSGVVLSVEEVEAGLKGLKVDQQVKNSTPFMAEHLE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 KAKVDLKPLLSSLSANKEKLKESSHSGVVLSVEEVEAGLKGLKVDQQVKNSTPFMAEHLE 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE5 ETLSAVTNNRQLKKDGDMTAFNKLVSTMKRNLESHLMSPAEIPGQPVPKNILQELLGQPV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 ETLSAVTNNRQLKKDGDMTAFNKLVSTMKRNLESHLMSPAEIPGQPVPKNILQELLGQPV 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE5 QRPASSNLLSGLMGSLEPTTSLLGQRAPSPPLSQVFQTRAASADYLRPRIPSPIGFTPGP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 QRPASSNLLSGLMGSLEPTTSLLGQRAPSPPLSQVFQTRAASADYLRPRIPSPIGFTPGP 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE5 QQLLGDPFQGMRKPMSPITAQQMSQLELQQAALEGLALPHDLAVQAANFYQPGFGKPQVD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 QQLLGDPFQGMRKPMSPITAQQMSQLELQQAALEGLALPHDLAVQAANFYQPGFGKPQVD 430 440 450 460 470 480 490 500 510 520 530 540 pF1KE5 RTRDGFRNRQQRVTKSPAPVHRGNSSSPAPAASITSMLSPSFTPTSVIRKMYESKEKSKE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 RTRDGFRNRQQRVTKSPAPVHRGNSSSPAPAASITSMLSPSFTPTSVIRKMYESKEKSKE 490 500 510 520 530 540 550 560 570 580 590 600 pF1KE5 EPASGKAALGDSKEDTQKASEENLLSSSSVPSADRDSSPTTNSKLSALQRSSCSTPLSQA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 EPASGKAALGDSKEDTQKASEENLLSSSSVPSADRDSSPTTNSKLSALQRSSCSTPLSQA 550 560 570 580 590 600 610 620 630 640 650 660 pF1KE5 NRYTKEQDYRPKATGRKTPTLASPVPTTPFLRPVHQVPLVPHVPMVRPAHQLHPGLVQRM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 NRYTKEQDYRPKATGRKTPTLASPVPTTPFLRPVHQVPLVPHVPMVRPAHQLHPGLVQRM 610 620 630 640 650 660 670 680 690 700 710 720 pF1KE5 LAQGVHPQHLPSLLQTGVLPPGMDLSHLQGISGPILGQPFYPLPAASHPLLNPRPGTPLH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 LAQGVHPQHLPSLLQTGVLPPGMDLSHLQGISGPILGQPFYPLPAASHPLLNPRPGTPLH 670 680 690 700 710 720 730 740 750 760 770 780 pF1KE5 LAMVQQQLQRSVLHPPGSGSHAAAVSVQTTPQNVPSRSGLPHMHSQLEHRPSQRSSSPVG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 LAMVQQQLQRSVLHPPGSGSHAAAVSVQTTPQNVPSRSGLPHMHSQLEHRPSQRSSSPVG 730 740 750 760 770 780 790 800 810 pF1KE5 LAKWFGSDVLQQPLPSMPAKVISVDELEYRQ ::::::::::::::::::::::::::::::: CCDS54 LAKWFGSDVLQQPLPSMPAKVISVDELEYRQ 790 800 810 >>CCDS13898.1 EIF4ENIF1 gene_id:56478|Hs108|chr22 (985 aa) initn: 4613 init1: 2481 opt: 3220 Z-score: 2113.9 bits: 402.3 E(32554): 2e-111 Smith-Waterman score: 4665; 91.6% identity (93.5% similar) in 796 aa overlap (39-811:196-985) 10 20 30 40 50 60 pF1KE5 TESGDAFLDLKKPPASKCPHRYTKEELLDIKELPHSKQRPSCLSEKYDSDGVWDPE-KWH .:. ::. ..:. .:. . : .: CCDS13 FEKDHRLSDKDLRDLRDRDRERDFKDKRFRREFGDSKR---VFGERRRNDSYTEEEPEWF 170 180 190 200 210 220 70 80 90 100 110 pF1KE5 ASLYPASGRSSPVESL---KKELDTD---RPSLVRRIV----GIVECNGGVAEEDEVEVI : :.: .: .: : :. : : :: . :::::::::::::::::: CCDS13 -SAGPTS-QSETIELTGFDDKILEEDHKGRKRTRRRTASVKEGIVECNGGVAEEDEVEVI 230 240 250 260 270 280 120 130 140 150 160 170 pF1KE5 LAQEPAADQEVPRDAVLPEQSPGDFDFNEFFNLDKVPCLASMIEDVLGEGSVSASRFSRW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 LAQEPAADQEVPRDAVLPEQSPGDFDFNEFFNLDKVPCLASMIEDVLGEGSVSASRFSRW 290 300 310 320 330 340 180 190 200 210 220 230 pF1KE5 FSNPSRSGSRSSSLGSTPHEELERLAGLEQAILSPGQNSGNYFAPIPLEDHAENKVDILE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 FSNPSRSGSRSSSLGSTPHEELERLAGLEQAILSPGQNSGNYFAPIPLEDHAENKVDILE 350 360 370 380 390 400 240 250 260 270 280 290 pF1KE5 MLQKAKVDLKPLLSSLSANKEKLKESSHSGVVLSVEEVEAGLKGLKVDQQVKNSTPFMAE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 MLQKAKVDLKPLLSSLSANKEKLKESSHSGVVLSVEEVEAGLKGLKVDQQVKNSTPFMAE 410 420 430 440 450 460 300 310 320 330 340 pF1KE5 HLEETLSAVTNNRQLKKDGDMTAFNKLVSTMK------------RNLESHLMSPAEIPGQ :::::::::::::::::::::::::::::::: :::::::::::::::: CCDS13 HLEETLSAVTNNRQLKKDGDMTAFNKLVSTMKASGTLPSQPKVSRNLESHLMSPAEIPGQ 470 480 490 500 510 520 350 360 370 380 390 400 pF1KE5 PVPKNILQELLGQPVQRPASSNLLSGLMGSLEPTTSLLGQRAPSPPLSQVFQTRAASADY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 PVPKNILQELLGQPVQRPASSNLLSGLMGSLEPTTSLLGQRAPSPPLSQVFQTRAASADY 530 540 550 560 570 580 410 420 430 440 450 460 pF1KE5 LRPRIPSPIGFTPGPQQLLGDPFQGMRKPMSPITAQQMSQLELQQAALEGLALPHDLAVQ :::::::::::::::::::::::::::::::::::: ::::::::::::::::::::::: CCDS13 LRPRIPSPIGFTPGPQQLLGDPFQGMRKPMSPITAQ-MSQLELQQAALEGLALPHDLAVQ 590 600 610 620 630 470 480 490 500 510 520 pF1KE5 AANFYQPGFGKPQVDRTRDGFRNRQQRVTKSPAPVHRGNSSSPAPAASITSMLSPSFTPT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 AANFYQPGFGKPQVDRTRDGFRNRQQRVTKSPAPVHRGNSSSPAPAASITSMLSPSFTPT 640 650 660 670 680 690 530 540 550 560 570 580 pF1KE5 SVIRKMYESKEKSKEEPASGKAALGDSKEDTQKASEENLLSSSSVPSADRDSSPTTNSKL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 SVIRKMYESKEKSKEEPASGKAALGDSKEDTQKASEENLLSSSSVPSADRDSSPTTNSKL 700 710 720 730 740 750 590 600 610 620 630 640 pF1KE5 SALQRSSCSTPLSQANRYTKEQDYRPKATGRKTPTLASPVPTTPFLRPVHQVPLVPHVPM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 SALQRSSCSTPLSQANRYTKEQDYRPKATGRKTPTLASPVPTTPFLRPVHQVPLVPHVPM 760 770 780 790 800 810 650 660 670 680 690 700 pF1KE5 VRPAHQLHPGLVQRMLAQGVHPQHLPSLLQTGVLPPGMDLSHLQGISGPILGQPFYPLPA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 VRPAHQLHPGLVQRMLAQGVHPQHLPSLLQTGVLPPGMDLSHLQGISGPILGQPFYPLPA 820 830 840 850 860 870 710 720 730 740 750 760 pF1KE5 ASHPLLNPRPGTPLHLAMVQQQLQRSVLHPPGSGSHAAAVSVQTTPQNVPSRSGLPHMHS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 ASHPLLNPRPGTPLHLAMVQQQLQRSVLHPPGSGSHAAAVSVQTTPQNVPSRSGLPHMHS 880 890 900 910 920 930 770 780 790 800 810 pF1KE5 QLEHRPSQRSSSPVGLAKWFGSDVLQQPLPSMPAKVISVDELEYRQ :::::::::::::::::::::::::::::::::::::::::::::: CCDS13 QLEHRPSQRSSSPVGLAKWFGSDVLQQPLPSMPAKVISVDELEYRQ 940 950 960 970 980 >-- initn: 708 init1: 683 opt: 697 Z-score: 468.0 bits: 97.8 E(32554): 9.5e-20 Smith-Waterman score: 697; 86.0% identity (92.6% similar) in 121 aa overlap (1-121:1-117) 10 20 30 40 50 60 pF1KE5 MDRRSMGETESGDAFLDLKKPPASKCPHRYTKEELLDIKELPHSKQRPSCLSEKYDSDGV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 MDRRSMGETESGDAFLDLKKPPASKCPHRYTKEELLDIKELPHSKQRPSCLSEKYDSDGV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 WDPEKWHASLYPASGRSSPVESLKKELDTDRPSLVRRIVGIVECNGGVAEEDEVEVILAQ ::::::::::::::::::::::::::::::::::::::: : ..::...:.:. CCDS13 WDPEKWHASLYPASGRSSPVESLKKELDTDRPSLVRRIVDPRE----RVKEDDLDVVLSP 70 80 90 100 110 130 140 150 160 170 180 pF1KE5 EPAADQEVPRDAVLPEQSPGDFDFNEFFNLDKVPCLASMIEDVLGEGSVSASRFSRWFSN . CCDS13 QRRSFGGGCHVTAAVSSRRSGSPLEKDSDGLRLLGGRRIGSGRIISARTFEKDHRLSDKD 120 130 140 150 160 170 811 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 06:25:15 2016 done: Tue Nov 8 06:25:16 2016 Total Scan time: 3.890 Total Display time: 0.030 Function used was FASTA [36.3.4 Apr, 2011]