FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE4080, 425 aa 1>>>pF1KE4080 425 - 425 aa - 425 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 9.8784+/-0.000915; mu= -1.8703+/- 0.055 mean_var=269.7348+/-55.553, 0's: 0 Z-trim(115.5): 11 B-trim: 115 in 2/51 Lambda= 0.078092 statistics sampled from 16060 (16069) to 16060 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.795), E-opt: 0.2 (0.494), width: 16 Scan time: 3.410 The best scores are: opt bits E(32554) CCDS14205.1 ZNF645 gene_id:158506|Hs108|chrX ( 425) 3037 354.9 8.3e-98 CCDS64754.1 CBLL1 gene_id:79872|Hs108|chr7 ( 490) 1319 161.4 1.7e-39 CCDS5747.1 CBLL1 gene_id:79872|Hs108|chr7 ( 491) 1316 161.1 2.2e-39 >>CCDS14205.1 ZNF645 gene_id:158506|Hs108|chrX (425 aa) initn: 3037 init1: 3037 opt: 3037 Z-score: 1869.4 bits: 354.9 E(32554): 8.3e-98 Smith-Waterman score: 3037; 99.8% identity (100.0% similar) in 425 aa overlap (1-425:1-425) 10 20 30 40 50 60 pF1KE4 MNKMPAGEQECEYNKEGKYYSKGVKLVRKKKKIPGYRWGDIKINIIGEKDDLPIHFCDKC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 MNKMPAGEQECEYNKEGKYYSKGVKLVRKKKKIPGYRWGDIKINIIGEKDDLPIHFCDKC 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 DLPIKIYGRIIPCKHAFCYHCANLYDKVGYKVCPRCRYPVLRIEAHKRGSVFMCSIVQQC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 DLPIKIYGRIIPCKHAFCYHCANLYDKVGYKVCPRCRYPVLRIEAHKRGSVFMCSIVQQC 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 KRTYLSQKSLQAHIKRRHKRARKQVTSASLEKVRPHIAPPQTEISDIPKRLQDRDHLSYI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 KRTYLSQKSLQAHIKRRHKRARKQVTSASLEKVRPHIAPPQTEISDIPKRLQDRDHLSYI 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 PPEQHTMVSLPSVQHMLQEQHNQPHKDIQAPPPELSLSLPFPIQWETVSIFTRKHGNLTV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 PPEQHTMVSLPSVQHMLQEQHNQPHKDIQAPPPELSLSLPFPIQWETVSIFTRKHGNLTV 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE4 DHIQNNSDSGVKKPTPPDYYPECQSQPAVSSPHHIIPQKQHYAPPPSPSSPVNHQMPYPP ::::::::::.::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 DHIQNNSDSGAKKPTPPDYYPECQSQPAVSSPHHIIPQKQHYAPPPSPSSPVNHQMPYPP 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE4 QDVVTPNSVRSQVPALTTTYDPSSGYIIVKVPPDMNSPPLRAPQSQNGNPSASEFASHHY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 QDVVTPNSVRSQVPALTTTYDPSSGYIIVKVPPDMNSPPLRAPQSQNGNPSASEFASHHY 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE4 NLNILPQFTENQETLSPQFTQTDAMDHRRWPAWKRLSPCPPTRSPPPSTLHGRSHHSHQR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 NLNILPQFTENQETLSPQFTQTDAMDHRRWPAWKRLSPCPPTRSPPPSTLHGRSHHSHQR 370 380 390 400 410 420 pF1KE4 RHRRY ::::: CCDS14 RHRRY >>CCDS64754.1 CBLL1 gene_id:79872|Hs108|chr7 (490 aa) initn: 1571 init1: 917 opt: 1319 Z-score: 822.5 bits: 161.4 E(32554): 1.7e-39 Smith-Waterman score: 1562; 52.7% identity (71.7% similar) in 446 aa overlap (1-425:48-488) 10 20 pF1KE4 MNKMPA----GEQECEYNKEGKYYSKGVKL .:.::: :.. .::.: .: :: .: CCDS64 GGLDVRRRIPIKLISKQANKAKPAPRTQRTINRMPAKAPPGDEGFDYNEEERYDCKGGEL 20 30 40 50 60 70 30 40 50 60 70 80 pF1KE4 VRKKKKIPGYRWGDIKINIIGEKDDLPIHFCDKCDLPIKIYGRIIPCKHAFCYHCANLYD .....::. . :..:::.::::: :.:::::: ::::::::.:::::.::: :: :.. CCDS64 FANQRRFPGHLFWDFQINILGEKDDTPVHFCDKCGLPIKIYGRMIPCKHVFCYDCAILHE 80 90 100 110 120 130 90 100 110 120 130 140 pF1KE4 KVGYKVCPRCRYPVLRIEAHKRGSVFMCSIVQQCKRTYLSQKSLQAHIKRRHKRARKQVT : : :.:: : :: ::: :::.::::::: ::::::::..:::::..:: :: : :: CCDS64 KKGDKMCPGCSDPVQRIEQCTRGSLFMCSIVQGCKRTYLSQRDLQAHINHRHMRAGKPVT 140 150 160 170 180 190 150 160 170 180 190 200 pF1KE4 SASLEKVRPHIAPPQTEISDIPKRL---QDRDHLSYIPPEQHTMVSLPSVQHMLQEQHNQ ::::.:.: :::: ::: :.:. :. :.:.:::.:: :. : .::. .:..:: CCDS64 RASLENVHPPIAPPPTEI---PERFIMPPDKHHMSHIPPKQHIMMPPPPLQHVPHEHYNQ 200 210 220 230 240 250 210 220 230 240 250 pF1KE4 PHKDIQAPPPELSLSLPFP--IQWETVSIFTRKHGNLTVDHIQNNSDSGVKKPTPP---- ::.::.::: :::.. : : .. :: : ::::.:: . ::..:.::...: :: CCDS64 PHEDIRAPPAELSMAPPPPRSVSQETFRISTRKHSNLITVPIQDDSNSGAREPPPPAPAP 260 270 280 290 300 310 260 270 280 290 300 310 pF1KE4 -DYYPECQSQPAVSSPHHIIPQKQHYAPPPSPSSPVNHQMPYPPQDVVTPNSVRSQVPA- ..:: :.::.:: ::::.: .::::::: : :..: ::.::: . ::. : ::.: CCDS64 AHHHPEYQGQPVVSHPHHIMPPQQHYAPPPPPPPPISHPMPHPPQAAGTPHLVYSQAPPP 320 330 340 350 360 370 320 330 340 350 360 370 pF1KE4 -LTTT---YDPSSGYIIVKVPPDMNSPPLRAPQSQNGNPSASEFASHHYNLNILPQFTEN .:.. : :.::...:: :: :: : :.:.: .. :::: : ::::::. CCDS64 PMTSAPPPITPPPGHIIAQMPPYMNHPPPGPPPPQHGGPPVTAPPPHHYNPNSLPQFTED 380 390 400 410 420 430 380 390 400 410 420 pF1KE4 QETLSPQFTQTDAMDHRRWPAWKRLSPCPPTRSPPPST--LHGRSHHSHQRRHRRY : :::: ::: .:. ::: : : :: . ::: : : :: : :.: : CCDS64 QGTLSPPFTQPGGMSPGIWPA-PRGPPPPPRLQGPPSQTPLPG-PHHPDQTRYRPYYQ 440 450 460 470 480 490 >>CCDS5747.1 CBLL1 gene_id:79872|Hs108|chr7 (491 aa) initn: 1571 init1: 917 opt: 1316 Z-score: 820.7 bits: 161.1 E(32554): 2.2e-39 Smith-Waterman score: 1559; 52.8% identity (71.8% similar) in 447 aa overlap (1-425:48-489) 10 20 pF1KE4 MNKMPA----GEQE-CEYNKEGKYYSKGVK .:.::: :..: .::.: .: :: . CCDS57 GGLDVRRRIPIKLISKQANKAKPAPRTQRTINRMPAKAPPGDEEGFDYNEEERYDCKGGE 20 30 40 50 60 70 30 40 50 60 70 80 pF1KE4 LVRKKKKIPGYRWGDIKINIIGEKDDLPIHFCDKCDLPIKIYGRIIPCKHAFCYHCANLY : .....::. . :..:::.::::: :.:::::: ::::::::.:::::.::: :: :. CCDS57 LFANQRRFPGHLFWDFQINILGEKDDTPVHFCDKCGLPIKIYGRMIPCKHVFCYDCAILH 80 90 100 110 120 130 90 100 110 120 130 140 pF1KE4 DKVGYKVCPRCRYPVLRIEAHKRGSVFMCSIVQQCKRTYLSQKSLQAHIKRRHKRARKQV .: : :.:: : :: ::: :::.::::::: ::::::::..:::::..:: :: : : CCDS57 EKKGDKMCPGCSDPVQRIEQCTRGSLFMCSIVQGCKRTYLSQRDLQAHINHRHMRAGKPV 140 150 160 170 180 190 150 160 170 180 190 200 pF1KE4 TSASLEKVRPHIAPPQTEISDIPKRL---QDRDHLSYIPPEQHTMVSLPSVQHMLQEQHN : ::::.:.: :::: ::: :.:. :. :.:.:::.:: :. : .::. .:..: CCDS57 TRASLENVHPPIAPPPTEI---PERFIMPPDKHHMSHIPPKQHIMMPPPPLQHVPHEHYN 200 210 220 230 240 250 210 220 230 240 250 pF1KE4 QPHKDIQAPPPELSLSLPFP--IQWETVSIFTRKHGNLTVDHIQNNSDSGVKKPTPP--- :::.::.::: :::.. : : .. :: : ::::.:: . ::..:.::...: :: CCDS57 QPHEDIRAPPAELSMAPPPPRSVSQETFRISTRKHSNLITVPIQDDSNSGAREPPPPAPA 260 270 280 290 300 310 260 270 280 290 300 310 pF1KE4 --DYYPECQSQPAVSSPHHIIPQKQHYAPPPSPSSPVNHQMPYPPQDVVTPNSVRSQVPA ..:: :.::.:: ::::.: .::::::: : :..: ::.::: . ::. : ::.: CCDS57 PAHHHPEYQGQPVVSHPHHIMPPQQHYAPPPPPPPPISHPMPHPPQAAGTPHLVYSQAPP 320 330 340 350 360 370 320 330 340 350 360 370 pF1KE4 --LTTT---YDPSSGYIIVKVPPDMNSPPLRAPQSQNGNPSASEFASHHYNLNILPQFTE .:.. : :.::...:: :: :: : :.:.: .. :::: : :::::: CCDS57 PPMTSAPPPITPPPGHIIAQMPPYMNHPPPGPPPPQHGGPPVTAPPPHHYNPNSLPQFTE 380 390 400 410 420 430 380 390 400 410 420 pF1KE4 NQETLSPQFTQTDAMDHRRWPAWKRLSPCPPTRSPPPST--LHGRSHHSHQRRHRRY .: :::: ::: .:. ::: : : :: . ::: : : :: : :.: : CCDS57 DQGTLSPPFTQPGGMSPGIWPA-PRGPPPPPRLQGPPSQTPLPG-PHHPDQTRYRPYYQ 440 450 460 470 480 490 425 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 16:23:21 2016 done: Mon Nov 7 16:23:22 2016 Total Scan time: 3.410 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]