FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3687, 485 aa 1>>>pF1KE3687 485 - 485 aa - 485 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 9.3582+/-0.00159; mu= -0.5982+/- 0.091 mean_var=427.6470+/-104.810, 0's: 0 Z-trim(107.1): 553 B-trim: 438 in 1/51 Lambda= 0.062020 statistics sampled from 8725 (9399) to 8725 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.635), E-opt: 0.2 (0.289), width: 16 Scan time: 2.650 The best scores are: opt bits E(32554) CCDS3227.1 ZNF639 gene_id:51193|Hs108|chr3 ( 485) 3386 318.3 1.2e-86 CCDS55390.1 ZFX gene_id:7543|Hs108|chrX ( 576) 586 67.9 3.3e-11 CCDS14211.1 ZFX gene_id:7543|Hs108|chrX ( 805) 586 68.1 4.1e-11 CCDS83461.1 ZFX gene_id:7543|Hs108|chrX ( 844) 586 68.1 4.2e-11 CCDS48200.1 ZFY gene_id:7544|Hs108|chrY ( 610) 582 67.6 4.4e-11 CCDS48201.1 ZFY gene_id:7544|Hs108|chrY ( 724) 582 67.7 4.9e-11 CCDS14774.1 ZFY gene_id:7544|Hs108|chrY ( 801) 582 67.7 5.2e-11 >>CCDS3227.1 ZNF639 gene_id:51193|Hs108|chr3 (485 aa) initn: 3386 init1: 3386 opt: 3386 Z-score: 1669.4 bits: 318.3 E(32554): 1.2e-86 Smith-Waterman score: 3386; 100.0% identity (100.0% similar) in 485 aa overlap (1-485:1-485) 10 20 30 40 50 60 pF1KE3 MNEYPKKRKRKTLHPSRYSDSSGISRIADGFNGIFSDHCYSVCSMRQPDLKYFDNKDDDS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 MNEYPKKRKRKTLHPSRYSDSSGISRIADGFNGIFSDHCYSVCSMRQPDLKYFDNKDDDS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 DTETSNDLPKFADGIKARNRNQNYLVPSPVLRILDHTAFSTEKSADIVICDEECDSPESV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 DTETSNDLPKFADGIKARNRNQNYLVPSPVLRILDHTAFSTEKSADIVICDEECDSPESV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 NQQTQEESPIEVHTAEDVPIAVEVHAISEDYDIETENNSSESLQDQTDEEPPAKLCKILD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 NQQTQEESPIEVHTAEDVPIAVEVHAISEDYDIETENNSSESLQDQTDEEPPAKLCKILD 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE3 KSQALNVTAQQKWPLLRANSSGLYKCELCEFNSKYFSDLKQHMILKHKRTDSNVCRVCKE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 KSQALNVTAQQKWPLLRANSSGLYKCELCEFNSKYFSDLKQHMILKHKRTDSNVCRVCKE 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE3 SFSTNMLLIEHAKLHEEDPYICKYCDYKTVIFENLSQHIADTHFSDHLYWCEQCDVQFSS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 SFSTNMLLIEHAKLHEEDPYICKYCDYKTVIFENLSQHIADTHFSDHLYWCEQCDVQFSS 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE3 SSELYLHFQEHSCDEQYLCQFCEHETNDPEDLHSHVVNEHACKLIELSDKYNNGEHGQYS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 SSELYLHFQEHSCDEQYLCQFCEHETNDPEDLHSHVVNEHACKLIELSDKYNNGEHGQYS 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE3 LLSKITFDKCKNFFVCQVCGFRSRLHTNVNRHVAIEHTKIFPHVCDDCGKGFSSMLEYCK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 LLSKITFDKCKNFFVCQVCGFRSRLHTNVNRHVAIEHTKIFPHVCDDCGKGFSSMLEYCK 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE3 HLNSHLSEGIYLCQYCEYSTGQIEDLKIHLDFKHSADLPHKCSDCLMRFGNERELISHLP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 HLNSHLSEGIYLCQYCEYSTGQIEDLKIHLDFKHSADLPHKCSDCLMRFGNERELISHLP 430 440 450 460 470 480 pF1KE3 VHETT ::::: CCDS32 VHETT >>CCDS55390.1 ZFX gene_id:7543|Hs108|chrX (576 aa) initn: 1152 init1: 472 opt: 586 Z-score: 314.6 bits: 67.9 E(32554): 3.3e-11 Smith-Waterman score: 586; 29.2% identity (58.7% similar) in 281 aa overlap (203-482:289-569) 180 190 200 210 220 230 pF1KE3 AKLCKILDKSQALNVTAQQKWPLLRANSSGLYKCELCEFNSKYFSDLKQHMILKHKRTDS ..::..::... . :..:.. :... CCDS55 IECDECGKHFSHAGALFTHKMVHKEKGANKMHKCKFCEYETAEQGLLNRHLLAVHSKNFP 260 270 280 290 300 310 240 250 260 270 280 290 pF1KE3 NVCRVCKESFSTNMLLIEHAKLHE-EDPYICKYCDYKTVIFENLSQHIADTHFSDHLYWC ..: : ..: : .: ..: : :: :.::.:... ::. :. : .. . : CCDS55 HICVECGKGFRHPSELKKHMRIHTGEKPYQCQYCEYRSADSSNLKTHVKTKHSKEMPFKC 320 330 340 350 360 370 300 310 320 330 340 350 pF1KE3 EQCDVQFSSSSELYLHFQEHSCDEQYLCQFCEHETNDPEDLHSHVVNEHACKLIELSDKY . : . ::...:. : :. .. . : :.:.... ::. :... :. . : CCDS55 DICLLTFSDTKEVQQHALIHQESKTHQCLHCDHKSSNSSDLKRHIISVHTKDYPHKCDMC 380 390 400 410 420 430 360 370 380 390 400 410 pF1KE3 NNGEHGQYSLLSKITFDKCKNFFVCQVCGFRSRLHTNVNRHVAIEHTKIFPHVCDDCGKG ..: : : .... : :.. :. : :. ..::. ::: .: : : :: CCDS55 DKGFHRPSELKKHVAAHKGKKMHQCRHCDFKIADPFVLSRHILSVHTKDLPFRCKRCRKG 440 450 460 470 480 490 420 430 440 450 460 470 pF1KE3 FSSMLEYCKHLNSHLSEGIYLCQYCEYSTGQIEDLKIHLDFKHSADLPHKCSDCLMRFGN : .. : ::...: .. .: :.:::::: . .: :. :. : ::.: : : CCDS55 FRQQSELKKHMKTHSGRKVYQCEYCEYSTTDASGFKRHVISIHTKDYPHRCEYCKKGFRR 500 510 520 530 540 550 480 pF1KE3 ERELISHLPVHETT : .:. : CCDS55 PSEKNQHIMRHHKEVGLP 560 570 >>CCDS14211.1 ZFX gene_id:7543|Hs108|chrX (805 aa) initn: 1152 init1: 472 opt: 586 Z-score: 313.1 bits: 68.1 E(32554): 4.1e-11 Smith-Waterman score: 586; 29.2% identity (58.7% similar) in 281 aa overlap (203-482:518-798) 180 190 200 210 220 230 pF1KE3 AKLCKILDKSQALNVTAQQKWPLLRANSSGLYKCELCEFNSKYFSDLKQHMILKHKRTDS ..::..::... . :..:.. :... CCDS14 IECDECGKHFSHAGALFTHKMVHKEKGANKMHKCKFCEYETAEQGLLNRHLLAVHSKNFP 490 500 510 520 530 540 240 250 260 270 280 290 pF1KE3 NVCRVCKESFSTNMLLIEHAKLHE-EDPYICKYCDYKTVIFENLSQHIADTHFSDHLYWC ..: : ..: : .: ..: : :: :.::.:... ::. :. : .. . : CCDS14 HICVECGKGFRHPSELKKHMRIHTGEKPYQCQYCEYRSADSSNLKTHVKTKHSKEMPFKC 550 560 570 580 590 600 300 310 320 330 340 350 pF1KE3 EQCDVQFSSSSELYLHFQEHSCDEQYLCQFCEHETNDPEDLHSHVVNEHACKLIELSDKY . : . ::...:. : :. .. . : :.:.... ::. :... :. . : CCDS14 DICLLTFSDTKEVQQHALIHQESKTHQCLHCDHKSSNSSDLKRHIISVHTKDYPHKCDMC 610 620 630 640 650 660 360 370 380 390 400 410 pF1KE3 NNGEHGQYSLLSKITFDKCKNFFVCQVCGFRSRLHTNVNRHVAIEHTKIFPHVCDDCGKG ..: : : .... : :.. :. : :. ..::. ::: .: : : :: CCDS14 DKGFHRPSELKKHVAAHKGKKMHQCRHCDFKIADPFVLSRHILSVHTKDLPFRCKRCRKG 670 680 690 700 710 720 420 430 440 450 460 470 pF1KE3 FSSMLEYCKHLNSHLSEGIYLCQYCEYSTGQIEDLKIHLDFKHSADLPHKCSDCLMRFGN : .. : ::...: .. .: :.:::::: . .: :. :. : ::.: : : CCDS14 FRQQSELKKHMKTHSGRKVYQCEYCEYSTTDASGFKRHVISIHTKDYPHRCEYCKKGFRR 730 740 750 760 770 780 480 pF1KE3 ERELISHLPVHETT : .:. : CCDS14 PSEKNQHIMRHHKEVGLP 790 800 >>CCDS83461.1 ZFX gene_id:7543|Hs108|chrX (844 aa) initn: 1152 init1: 472 opt: 586 Z-score: 312.9 bits: 68.1 E(32554): 4.2e-11 Smith-Waterman score: 586; 29.2% identity (58.7% similar) in 281 aa overlap (203-482:557-837) 180 190 200 210 220 230 pF1KE3 AKLCKILDKSQALNVTAQQKWPLLRANSSGLYKCELCEFNSKYFSDLKQHMILKHKRTDS ..::..::... . :..:.. :... CCDS83 IECDECGKHFSHAGALFTHKMVHKEKGANKMHKCKFCEYETAEQGLLNRHLLAVHSKNFP 530 540 550 560 570 580 240 250 260 270 280 290 pF1KE3 NVCRVCKESFSTNMLLIEHAKLHE-EDPYICKYCDYKTVIFENLSQHIADTHFSDHLYWC ..: : ..: : .: ..: : :: :.::.:... ::. :. : .. . : CCDS83 HICVECGKGFRHPSELKKHMRIHTGEKPYQCQYCEYRSADSSNLKTHVKTKHSKEMPFKC 590 600 610 620 630 640 300 310 320 330 340 350 pF1KE3 EQCDVQFSSSSELYLHFQEHSCDEQYLCQFCEHETNDPEDLHSHVVNEHACKLIELSDKY . : . ::...:. : :. .. . : :.:.... ::. :... :. . : CCDS83 DICLLTFSDTKEVQQHALIHQESKTHQCLHCDHKSSNSSDLKRHIISVHTKDYPHKCDMC 650 660 670 680 690 700 360 370 380 390 400 410 pF1KE3 NNGEHGQYSLLSKITFDKCKNFFVCQVCGFRSRLHTNVNRHVAIEHTKIFPHVCDDCGKG ..: : : .... : :.. :. : :. ..::. ::: .: : : :: CCDS83 DKGFHRPSELKKHVAAHKGKKMHQCRHCDFKIADPFVLSRHILSVHTKDLPFRCKRCRKG 710 720 730 740 750 760 420 430 440 450 460 470 pF1KE3 FSSMLEYCKHLNSHLSEGIYLCQYCEYSTGQIEDLKIHLDFKHSADLPHKCSDCLMRFGN : .. : ::...: .. .: :.:::::: . .: :. :. : ::.: : : CCDS83 FRQQSELKKHMKTHSGRKVYQCEYCEYSTTDASGFKRHVISIHTKDYPHRCEYCKKGFRR 770 780 790 800 810 820 480 pF1KE3 ERELISHLPVHETT : .:. : CCDS83 PSEKNQHIMRHHKEVGLP 830 840 >>CCDS48200.1 ZFY gene_id:7544|Hs108|chrY (610 aa) initn: 759 init1: 469 opt: 582 Z-score: 312.4 bits: 67.6 E(32554): 4.4e-11 Smith-Waterman score: 582; 29.5% identity (58.7% similar) in 281 aa overlap (203-482:323-603) 180 190 200 210 220 230 pF1KE3 AKLCKILDKSQALNVTAQQKWPLLRANSSGLYKCELCEFNSKYFSDLKQHMILKHKRTDS ..::..::... . :..:.. :... CCDS48 IECDECGKHFSHAGALFTHKMVHKEKGANKMHKCKFCEYETAEQGLLNRHLLAVHSKNFP 300 310 320 330 340 350 240 250 260 270 280 290 pF1KE3 NVCRVCKESFSTNMLLIEHAKLHE-EDPYICKYCDYKTVIFENLSQHIADTHFSDHLYWC ..: : ..: : .: ..: : :: :.::.:... ::. :: : .. . : CCDS48 HICVECGKGFRHPSELRKHMRIHTGEKPYQCQYCEYRSADSSNLKTHIKTKHSKEMPFKC 360 370 380 390 400 410 300 310 320 330 340 350 pF1KE3 EQCDVQFSSSSELYLHFQEHSCDEQYLCQFCEHETNDPEDLHSHVVNEHACKLIELSDKY . : . ::...:. : :. .. . : :.:.... ::. ::.. :. . . CCDS48 DICLLTFSDTKEVQQHTLVHQESKTHQCLHCDHKSSNSSDLKRHVISVHTKDYPHKCEMC 420 430 440 450 460 470 360 370 380 390 400 410 pF1KE3 NNGEHGQYSLLSKITFDKCKNFFVCQVCGFRSRLHTNVNRHVAIEHTKIFPHVCDDCGKG ..: : : .... : :.. :. : :. ..::. ::: .: : : :: CCDS48 EKGFHRPSELKKHVAVHKGKKMHQCRHCDFKIADPFVLSRHILSVHTKDLPFRCKRCRKG 480 490 500 510 520 530 420 430 440 450 460 470 pF1KE3 FSSMLEYCKHLNSHLSEGIYLCQYCEYSTGQIEDLKIHLDFKHSADLPHKCSDCLMRFGN : .. : ::...: .. .: :.:::::: . .: :. :. : ::.: : : CCDS48 FRQQNELKKHMKTHSGRKVYQCEYCEYSTTDASGFKRHVISIHTKDYPHRCEYCKKGFRR 540 550 560 570 580 590 480 pF1KE3 ERELISHLPVHETT : .:. : CCDS48 PSEKNQHIMRHHKEVGLP 600 610 >>CCDS48201.1 ZFY gene_id:7544|Hs108|chrY (724 aa) initn: 759 init1: 469 opt: 582 Z-score: 311.6 bits: 67.7 E(32554): 4.9e-11 Smith-Waterman score: 582; 29.5% identity (58.7% similar) in 281 aa overlap (203-482:437-717) 180 190 200 210 220 230 pF1KE3 AKLCKILDKSQALNVTAQQKWPLLRANSSGLYKCELCEFNSKYFSDLKQHMILKHKRTDS ..::..::... . :..:.. :... CCDS48 IECDECGKHFSHAGALFTHKMVHKEKGANKMHKCKFCEYETAEQGLLNRHLLAVHSKNFP 410 420 430 440 450 460 240 250 260 270 280 290 pF1KE3 NVCRVCKESFSTNMLLIEHAKLHE-EDPYICKYCDYKTVIFENLSQHIADTHFSDHLYWC ..: : ..: : .: ..: : :: :.::.:... ::. :: : .. . : CCDS48 HICVECGKGFRHPSELRKHMRIHTGEKPYQCQYCEYRSADSSNLKTHIKTKHSKEMPFKC 470 480 490 500 510 520 300 310 320 330 340 350 pF1KE3 EQCDVQFSSSSELYLHFQEHSCDEQYLCQFCEHETNDPEDLHSHVVNEHACKLIELSDKY . : . ::...:. : :. .. . : :.:.... ::. ::.. :. . . CCDS48 DICLLTFSDTKEVQQHTLVHQESKTHQCLHCDHKSSNSSDLKRHVISVHTKDYPHKCEMC 530 540 550 560 570 580 360 370 380 390 400 410 pF1KE3 NNGEHGQYSLLSKITFDKCKNFFVCQVCGFRSRLHTNVNRHVAIEHTKIFPHVCDDCGKG ..: : : .... : :.. :. : :. ..::. ::: .: : : :: CCDS48 EKGFHRPSELKKHVAVHKGKKMHQCRHCDFKIADPFVLSRHILSVHTKDLPFRCKRCRKG 590 600 610 620 630 640 420 430 440 450 460 470 pF1KE3 FSSMLEYCKHLNSHLSEGIYLCQYCEYSTGQIEDLKIHLDFKHSADLPHKCSDCLMRFGN : .. : ::...: .. .: :.:::::: . .: :. :. : ::.: : : CCDS48 FRQQNELKKHMKTHSGRKVYQCEYCEYSTTDASGFKRHVISIHTKDYPHRCEYCKKGFRR 650 660 670 680 690 700 480 pF1KE3 ERELISHLPVHETT : .:. : CCDS48 PSEKNQHIMRHHKEVGLP 710 720 >>CCDS14774.1 ZFY gene_id:7544|Hs108|chrY (801 aa) initn: 759 init1: 469 opt: 582 Z-score: 311.2 bits: 67.7 E(32554): 5.2e-11 Smith-Waterman score: 582; 29.5% identity (58.7% similar) in 281 aa overlap (203-482:514-794) 180 190 200 210 220 230 pF1KE3 AKLCKILDKSQALNVTAQQKWPLLRANSSGLYKCELCEFNSKYFSDLKQHMILKHKRTDS ..::..::... . :..:.. :... CCDS14 IECDECGKHFSHAGALFTHKMVHKEKGANKMHKCKFCEYETAEQGLLNRHLLAVHSKNFP 490 500 510 520 530 540 240 250 260 270 280 290 pF1KE3 NVCRVCKESFSTNMLLIEHAKLHE-EDPYICKYCDYKTVIFENLSQHIADTHFSDHLYWC ..: : ..: : .: ..: : :: :.::.:... ::. :: : .. . : CCDS14 HICVECGKGFRHPSELRKHMRIHTGEKPYQCQYCEYRSADSSNLKTHIKTKHSKEMPFKC 550 560 570 580 590 600 300 310 320 330 340 350 pF1KE3 EQCDVQFSSSSELYLHFQEHSCDEQYLCQFCEHETNDPEDLHSHVVNEHACKLIELSDKY . : . ::...:. : :. .. . : :.:.... ::. ::.. :. . . CCDS14 DICLLTFSDTKEVQQHTLVHQESKTHQCLHCDHKSSNSSDLKRHVISVHTKDYPHKCEMC 610 620 630 640 650 660 360 370 380 390 400 410 pF1KE3 NNGEHGQYSLLSKITFDKCKNFFVCQVCGFRSRLHTNVNRHVAIEHTKIFPHVCDDCGKG ..: : : .... : :.. :. : :. ..::. ::: .: : : :: CCDS14 EKGFHRPSELKKHVAVHKGKKMHQCRHCDFKIADPFVLSRHILSVHTKDLPFRCKRCRKG 670 680 690 700 710 720 420 430 440 450 460 470 pF1KE3 FSSMLEYCKHLNSHLSEGIYLCQYCEYSTGQIEDLKIHLDFKHSADLPHKCSDCLMRFGN : .. : ::...: .. .: :.:::::: . .: :. :. : ::.: : : CCDS14 FRQQNELKKHMKTHSGRKVYQCEYCEYSTTDASGFKRHVISIHTKDYPHRCEYCKKGFRR 730 740 750 760 770 780 480 pF1KE3 ERELISHLPVHETT : .:. : CCDS14 PSEKNQHIMRHHKEVGLP 790 800 485 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 17:38:39 2016 done: Sun Nov 6 17:38:39 2016 Total Scan time: 2.650 Total Display time: 0.050 Function used was FASTA [36.3.4 Apr, 2011]