FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5567, 433 aa 1>>>pF1KE5567 433 - 433 aa - 433 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.1772+/-0.000732; mu= 13.8562+/- 0.044 mean_var=96.9139+/-19.216, 0's: 0 Z-trim(111.2): 84 B-trim: 0 in 0/51 Lambda= 0.130281 statistics sampled from 12103 (12191) to 12103 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.737), E-opt: 0.2 (0.374), width: 16 Scan time: 2.960 The best scores are: opt bits E(32554) CCDS41300.1 ZSCAN20 gene_id:7579|Hs108|chr1 (1043) 1531 298.1 2.7e-80 CCDS66921.1 ZSCAN32 gene_id:54925|Hs108|chr16 ( 697) 850 170.0 6.5e-42 CCDS10095.2 ZSCAN29 gene_id:146050|Hs108|chr15 ( 852) 725 146.6 9.1e-35 CCDS10495.1 ZNF213 gene_id:7760|Hs108|chr16 ( 459) 562 115.8 9.2e-26 CCDS32410.1 ZKSCAN2 gene_id:342357|Hs108|chr16 ( 967) 542 112.2 2.3e-24 CCDS10499.1 ZNF263 gene_id:10127|Hs108|chr16 ( 683) 511 106.3 9.7e-23 CCDS76814.1 ZNF500 gene_id:26048|Hs108|chr16 ( 441) 507 105.4 1.2e-22 CCDS32383.1 ZNF500 gene_id:26048|Hs108|chr16 ( 480) 507 105.4 1.2e-22 CCDS4649.1 ZSCAN31 gene_id:64288|Hs108|chr6 ( 406) 481 100.5 3.2e-21 CCDS5667.1 ZKSCAN5 gene_id:23660|Hs108|chr7 ( 839) 477 100.0 9.6e-21 CCDS14649.1 ZNF449 gene_id:203523|Hs108|chrX ( 518) 462 97.0 4.6e-20 CCDS32814.1 ZNF397 gene_id:84307|Hs108|chr18 ( 275) 458 96.1 4.7e-20 CCDS82246.1 ZNF396 gene_id:252884|Hs108|chr18 ( 210) 456 95.6 4.9e-20 CCDS11913.1 ZNF396 gene_id:252884|Hs108|chr18 ( 333) 456 95.7 7.1e-20 CCDS82247.1 ZNF396 gene_id:252884|Hs108|chr18 ( 335) 456 95.7 7.1e-20 CCDS34355.1 ZBED9 gene_id:114821|Hs108|chr6 (1325) 462 97.3 9.8e-20 CCDS11912.1 ZNF24 gene_id:7572|Hs108|chr18 ( 368) 453 95.2 1.1e-19 CCDS4643.1 ZNF165 gene_id:7718|Hs108|chr6 ( 485) 453 95.3 1.4e-19 CCDS4644.1 ZSCAN16 gene_id:80345|Hs108|chr6 ( 348) 449 94.4 1.8e-19 CCDS77175.1 ZNF24 gene_id:7572|Hs108|chr18 ( 193) 445 93.5 1.9e-19 CCDS45852.1 ZNF397 gene_id:84307|Hs108|chr18 ( 534) 451 94.9 2e-19 CCDS11068.1 ZNF232 gene_id:7775|Hs108|chr17 ( 444) 446 93.9 3.3e-19 CCDS82043.1 ZNF232 gene_id:7775|Hs108|chr17 ( 435) 441 93.0 6.2e-19 CCDS8443.1 ZNF202 gene_id:7753|Hs108|chr11 ( 648) 440 92.9 9.7e-19 CCDS5666.1 ZNF394 gene_id:84124|Hs108|chr7 ( 561) 437 92.3 1.3e-18 CCDS34698.1 ZKSCAN1 gene_id:7586|Hs108|chr7 ( 563) 429 90.8 3.6e-18 CCDS5681.1 ZSCAN21 gene_id:7589|Hs108|chr7 ( 473) 427 90.4 4.1e-18 CCDS42427.1 ZSCAN30 gene_id:100101467|Hs108|chr18 ( 494) 424 89.8 6.3e-18 CCDS5671.2 ZSCAN25 gene_id:221785|Hs108|chr7 ( 544) 424 89.9 6.8e-18 CCDS12975.1 ZSCAN22 gene_id:342945|Hs108|chr19 ( 491) 423 89.7 7.1e-18 CCDS1631.1 ZNF496 gene_id:84838|Hs108|chr1 ( 587) 424 89.9 7.2e-18 CCDS59427.1 MZF1 gene_id:7593|Hs108|chr19 ( 290) 418 88.6 8.9e-18 CCDS2714.1 ZKSCAN7 gene_id:55888|Hs108|chr3 ( 276) 414 87.8 1.4e-17 CCDS12988.1 MZF1 gene_id:7593|Hs108|chr19 ( 734) 418 88.8 1.9e-17 CCDS4645.1 ZKSCAN8 gene_id:7745|Hs108|chr6 ( 578) 415 88.2 2.3e-17 CCDS4647.1 ZKSCAN4 gene_id:387032|Hs108|chr6 ( 545) 413 87.8 2.8e-17 CCDS47393.1 ZSCAN23 gene_id:222696|Hs108|chr6 ( 389) 410 87.1 3.2e-17 CCDS2717.1 ZNF197 gene_id:10168|Hs108|chr3 (1029) 416 88.5 3.2e-17 CCDS2715.1 ZKSCAN7 gene_id:55888|Hs108|chr3 ( 754) 414 88.1 3.2e-17 CCDS35105.1 ZNF483 gene_id:158399|Hs108|chr9 ( 256) 405 86.1 4.4e-17 CCDS33743.1 ZNF197 gene_id:10168|Hs108|chr3 ( 267) 405 86.1 4.5e-17 CCDS4650.1 ZKSCAN3 gene_id:80317|Hs108|chr6 ( 538) 405 86.3 8e-17 CCDS69349.1 ZKSCAN1 gene_id:7586|Hs108|chr7 ( 527) 403 85.9 1e-16 CCDS35106.1 ZNF483 gene_id:158399|Hs108|chr9 ( 744) 405 86.4 1e-16 CCDS76957.1 ZNF18 gene_id:7566|Hs108|chr17 ( 548) 403 85.9 1.1e-16 CCDS32568.1 ZNF18 gene_id:7566|Hs108|chr17 ( 549) 403 85.9 1.1e-16 CCDS10504.1 ZNF174 gene_id:7727|Hs108|chr16 ( 407) 397 84.7 1.8e-16 CCDS32380.1 ZNF174 gene_id:7727|Hs108|chr16 ( 234) 391 83.4 2.5e-16 CCDS14648.1 ZNF75D gene_id:7626|Hs108|chrX ( 510) 395 84.4 2.8e-16 CCDS56407.1 ZSCAN9 gene_id:7746|Hs108|chr6 ( 445) 394 84.2 2.9e-16 >>CCDS41300.1 ZSCAN20 gene_id:7579|Hs108|chr1 (1043 aa) initn: 2976 init1: 1530 opt: 1531 Z-score: 1555.2 bits: 298.1 E(32554): 2.7e-80 Smith-Waterman score: 2653; 87.3% identity (87.5% similar) in 465 aa overlap (23-432:23-487) 10 20 30 40 50 60 pF1KE5 MAMALELQAQASPQPEPEELLIVKLEEDSWGSESKLWEKDRGSVSGPEASRQRFRQFQYR :::::::::::::::::::::::::::::::::::::: CCDS41 MAMALELQAQASPQPEPEELLIVKLEEDSWGSESKLWEKDRGSVSGPEASRQRFRQFQYR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 DAAGPHEAFSQLWALCCRWLRPEIRLKEQILELLVLEQFLTILPREVQTWVQARHPESGE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 DAAGPHEAFSQLWALCCRWLRPEIRLKEQILELLVLEQFLTILPREVQTWVQARHPESGE 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 EAVALVEDWHRETRTAGQSGLELHTEETRPLKTGEEAQSFQLQPVDPWPEGQSQKKGVKN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 EAVALVEDWHRETRTAGQSGLELHTEETRPLKTGEEAQSFQLQPVDPWPEGQSQKKGVKN 130 140 150 160 170 180 190 200 pF1KE5 TCPDLPNHLNAEVAPQPLKES--------------------------------------- ::::::::::::::::::::: CCDS41 TCPDLPNHLNAEVAPQPLKESAVLTPRVPTLPKMGSVGDWEVTAESQEALGPGKHAEKEL 190 200 210 220 230 240 210 220 230 240 pF1KE5 ---------------GVPVSKPSNTSEKEQGPEFWGLSLINSGKRSTADYSLDNEPAQAL ::::::::::::::::::::::::::::::::::::::::::::: CCDS41 CKDPPGDDCGNSVCLGVPVSKPSNTSEKEQGPEFWGLSLINSGKRSTADYSLDNEPAQAL 250 260 270 280 290 300 250 260 270 280 290 300 pF1KE5 TWRDSRAWEEQYQWDVEDMKVSGVHWGYEETKTFLAILSESPFSEKLRTCHQNRQVYRAI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 TWRDSRAWEEQYQWDVEDMKVSGVHWGYEETKTFLAILSESPFSEKLRTCHQNRQVYRAI 310 320 330 340 350 360 310 320 330 340 350 360 pF1KE5 AEQLRARGFLRTLEQCRYRVKNLLRNYRKAKSSHPPGTCPFYEELEALVRARTAIRATDG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 AEQLRARGFLRTLEQCRYRVKNLLRNYRKAKSSHPPGTCPFYEELEALVRARTAIRATDG 370 380 390 400 410 420 370 380 390 400 410 420 pF1KE5 PGEAVALPRLGDSDAEMDEQEEGGWDPEEMAEDCNGAGLVNVESTQGPRIAGAPALFQSR ::::::::::: :::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 PGEAVALPRLGYSDAEMDEQEEGGWDPEEMAEDCNGAGLVNVESTQGPRIAGAPALFQSR 430 440 450 460 470 480 430 pF1KE5 I-GKNMGV : : . : CCDS41 IAGVHWGYEETKAFLAILSESPFSEKLRTCHQNSQVYRAIAERLCALGFLRTLEQCRYRF 490 500 510 520 530 540 >-- initn: 616 init1: 589 opt: 589 Z-score: 598.3 bits: 121.1 E(32554): 5.3e-27 Smith-Waterman score: 589; 77.8% identity (88.0% similar) in 117 aa overlap (274-390:488-604) 250 260 270 280 290 300 pF1KE5 QALTWRDSRAWEEQYQWDVEDMKVSGVHWGYEETKTFLAILSESPFSEKLRTCHQNRQVY :::::.:::::::::::::::::::: ::: CCDS41 GLVNVESTQGPRIAGAPALFQSRIAGVHWGYEETKAFLAILSESPFSEKLRTCHQNSQVY 460 470 480 490 500 510 310 320 330 340 350 360 pF1KE5 RAIAEQLRARGFLRTLEQCRYRVKNLLRNYRKAKSSHPPGTCPFYEELEALVRARTAIRA :::::.: : :::::::::::: :::::.:::::::::::::::::::..:.:::.:.:: CCDS41 RAIAERLCALGFLRTLEQCRYRFKNLLRSYRKAKSSHPPGTCPFYEELDSLMRARAAVRA 520 530 540 550 560 570 370 380 390 400 410 420 pF1KE5 TDGPGEAVALPRLGDSDAEMDEQEEGGWDPEEMAEDCNGAGLVNVESTQGPRIAGAPALF ::..::: :.:.:: : :: : CCDS41 MGTVREAAGLPRCGQSSAETDAQEAWGEVANEDAVKPSTLCPKAPDMGFEMRHEDEDQIS 580 590 600 610 620 630 >>CCDS66921.1 ZSCAN32 gene_id:54925|Hs108|chr16 (697 aa) initn: 774 init1: 449 opt: 850 Z-score: 865.9 bits: 170.0 E(32554): 6.5e-42 Smith-Waterman score: 850; 39.7% identity (62.9% similar) in 418 aa overlap (26-428:16-412) 10 20 30 40 50 60 pF1KE5 MAMALELQAQASPQPEPEELLIVKLEEDSWGSESKLWEKDRGSVSGPEASRQRFRQFQYR .. . :..: : .:. :::::::::: :. CCDS66 MMAAVKSTEAHPSSNKDPTQGQKSAL----QGNSPDSEASRQRFRQFCYQ 10 20 30 40 70 80 90 100 110 120 pF1KE5 DAAGPHEAFSQLWALCCRWLRPEIRLKEQILELLVLEQFLTILPREVQTWVQARHPESGE ...:::::::.:: :::.::::. . ::.:::::::::::::::.:.::::. .:::.:: CCDS66 EVTGPHEAFSKLWELCCQWLRPKTHSKEEILELLVLEQFLTILPEEIQTWVREQHPENGE 50 60 70 80 90 100 130 140 150 160 170 pF1KE5 EAVALVEDWHR---ETRTAGQSGLELHTEETRPLKTGEEAQSFQLQPVDPWPEGQSQKKG :::::::: .: . ... :.. .: :: . .:. : . . :: . :: CCDS66 EAVALVEDVQRAPGQQVLDSEKDLKVLMKEMAPLGATRESLRSQWKQ-EVQPE-EPTFKG 110 120 130 140 150 160 180 190 200 210 220 230 pF1KE5 VKNTCPDLPNHLNAEVAPQPLKESGVPVSKPSNTS--EKEQGPEFWGLSLINSGKRSTAD ... .. .: .::: .: . :.::. ..: : : . . . :.. CCDS66 SQSSHQRPGEQSEAWLAPQ------APRNLPQNTGLHDQETGAVVWTAGSQGPAMRDNRA 170 180 190 200 210 240 250 260 270 280 pF1KE5 YSLDNE------PAQALTWRDSRAWEEQYQWDVEDMKVSGVHWGYEETKTFLAILSESPF :: .. ::: .: . : . : . ..:: ::::::::.::::: : : CCDS66 VSLCQQEWMCPGPAQRALYRGAT----QRK-DSHVSLATGVPWGYEETKTLLAILSSSQF 220 230 240 250 260 270 290 300 310 320 330 340 pF1KE5 SEKLRTCHQNRQVYRAIAEQLRARGFLRTLEQCRYRVKNLLRNYRKAKSSHPPGTCPFYE ::.::.:: :.:::.:: : .::::: :::: . :.: .:::.. .. : : ::: CCDS66 YGKLQTCQQNSQIYRAMAEGLWEQGFLRTPEQCRTKFKSLQLSYRKVRRGRVPEPCIFYE 280 290 300 310 320 330 350 360 370 380 390 400 pF1KE5 ELEALVRARTAIRATDGPGEAVALPRLGDSDAEMDEQEEGGWDPEEMAEDCNGAGLVNVE :..:: .. :. : . :.: :: : : .. . .: :. . .. . . CCDS66 EMNAL----SGSWASAPPMASDAVPGQEGSDIEAGELNHQNGEPTEVEDGTVDGADRDEK 340 350 360 370 380 410 420 430 pF1KE5 STQGP----RIAGAPALFQSRIGKNMGV . ..: : :.:: .:.: CCDS66 DFRNPGQEVRKLDLPVLFPNRLGFEFKNEIKKENLKWDDSEEVEINKALQRKSRGVYWHS 390 400 410 420 430 440 >>CCDS10095.2 ZSCAN29 gene_id:146050|Hs108|chr15 (852 aa) initn: 1038 init1: 574 opt: 725 Z-score: 737.7 bits: 146.6 E(32554): 9.1e-35 Smith-Waterman score: 1144; 46.7% identity (72.1% similar) in 409 aa overlap (41-432:8-412) 20 30 40 50 60 70 pF1KE5 ASPQPEPEELLIVKLEEDSWGSESKLWEKDRGSVSGPEASRQRFRQFQYRDAAGPHEAFS : . .. :. :::::.:.:...:::.:::: CCDS10 MMAKSALRENGTNSETFRQRFRRFHYQEVAGPREAFS 10 20 30 80 90 100 110 120 130 pF1KE5 QLWALCCRWLRPEIRLKEQILELLVLEQFLTILPREVQTWVQARHPESGEEAVALVEDWH ::: :::::::::.: ::::.::::::::::.:: :.:.::: . ::.:::::.:::: . CCDS10 QLWELCCRWLRPEVRTKEQIVELLVLEQFLTVLPGEIQNWVQEQCPENGEEAVTLVEDLE 40 50 60 70 80 90 140 150 160 170 180 pF1KE5 RE------TRTAGQSGLELHTEETRPLKTGEEAQSFQLQPVDPWPEGQSQKKGVKNTCPD :: . :.. .: :.. :. : :...: : . . :.: :.: .:. ... :: CCDS10 REPGRPRSSVTVSVKGQEVRLEKMTPPKSSQELLSVRQESVEPQPRGVPKKERARS--PD 100 110 120 130 140 150 190 200 210 220 230 240 pF1KE5 L-PN-HLNAEVAPQPLKESGVPVSKPSNTSEKEQGPEFWGLSLINSGKRSTADYS-LDNE : :. ..: . .:...::.: : . .:. ::: : : .:..: .. . : . .. CCDS10 LGPQEQMNPKEKLKPFQRSGLPFPKSGVVSRLEQG-EPWIPDLLGSKEKELPSGSHIGDR 160 170 180 190 200 210 250 260 270 280 290 pF1KE5 PAQAL---TWRDSRAWEEQYQWDVEDMKVSGVHWGYEETKTFLAILSESPFSEKLRTCHQ ..: . .: :.: :: .:. :: ::.:::::::::.:.:::::.. : : ::.::. CCDS10 RVHADLLPSKKDRRSWVEQDHWSFEDEKVAGVHWGYEETRTLLAILSQTEFYEALRNCHR 220 230 240 250 260 270 300 310 320 330 340 350 pF1KE5 NRQVYRAIAEQLRARGFLRTLEQCRYRVKNLLRNYRKAKSSHPPGTCPFYEELEALVRAR : ::: :.::.:: :::::::::: . :.: ..:::.::.::: ::::.::.:::. :. CCDS10 NSQVYGAVAERLREYGFLRTLEQCRTKFKGLQKSYRKVKSGHPPETCPFFEEMEALMSAQ 280 290 300 310 320 330 360 370 380 390 400 410 pF1KE5 TAIRATDGPGEAVALPRLGDSDAEMDEQEEGGWDPEEMAEDC----NGAGLVNVEST-QG . ..: :.. : :::: .: . ::. :: ::. . . ...:.: : CCDS10 VIALPSNGLEAAASHSGLVGSDAETEEPGQRGWQHEEGAEEAVAQESDSDDMDLEATPQD 340 350 360 370 380 390 420 430 pF1KE5 PRIAGAPALFQSRIGKNMGV : : ::..:.: : . : CCDS10 PNSA-APVVFRSPGGVHWGYEETKTYLAILSETQFYEALRNCHRNSQLYGAVAERLWEYG 400 410 420 430 440 450 >-- initn: 471 init1: 403 opt: 443 Z-score: 451.2 bits: 93.6 E(32554): 8.2e-19 Smith-Waterman score: 443; 47.2% identity (69.8% similar) in 159 aa overlap (274-432:413-568) 250 260 270 280 290 300 pF1KE5 QALTWRDSRAWEEQYQWDVEDMKVSGVHWGYEETKTFLAILSESPFSEKLRTCHQNRQVY ::::::.::::::. : : ::.::.: :.: CCDS10 SDDMDLEATPQDPNSAAPVVFRSPGGVHWGYEETKTYLAILSETQFYEALRNCHRNSQLY 390 400 410 420 430 440 310 320 330 340 350 360 pF1KE5 RAIAEQLRARGFLRTLEQCRYRVKNLLRNYRKAKSSHPPGTCPFYEELEALVRARTAIRA :.::.: ::::: :::: . :.: .:::.:... : ::::.::..::: .:.: CCDS10 GAVAERLWEYGFLRTPEQCRTKFKSLQTSYRKVKNGQAPETCPFFEEMDALVSVRVAAPP 450 460 470 480 490 500 370 380 390 400 410 420 pF1KE5 TDGPGEAVALPRLGDSDAEMDEQEEGGWDPEEMAEDCNGAGLVNVESTQGPRIAGAPALF .:: :... : : :.:: ..: : . .: .:. . ..: : :. ::.:: CCDS10 NDGQEETASCPVQGTSEAEAQKQAE---EADEATEEDSDDDEEDTEIPPGAVITRAPVLF 510 520 530 540 550 430 pF1KE5 QSRIGKNMGV :: : . : CCDS10 QSPRGFEAGFENEDNSKRDISEEVQLHRTLLARSERKIPRYLHQGKGNESDCRSGRQWAK 560 570 580 590 600 610 >>CCDS10495.1 ZNF213 gene_id:7760|Hs108|chr16 (459 aa) initn: 576 init1: 459 opt: 562 Z-score: 576.0 bits: 115.8 E(32554): 9.2e-26 Smith-Waterman score: 602; 37.0% identity (57.1% similar) in 378 aa overlap (3-346:1-348) 10 20 30 40 50 60 pF1KE5 MAMALELQAQASPQPEPEELLIVKLEEDSWGSESKLWEKDRGSVSGPEASRQRFRQFQYR :: :.:: . : : :::::.:..:: .:: : : : :: ::::::: : CCDS10 MAAPLEAQDQAPGEGEGLLIVKVEDSSWEQESAQHEDGRDS----EACRQRFRQFCYG 10 20 30 40 50 70 80 90 100 110 120 pF1KE5 DAAGPHEAFSQLWALCCRWLRPEIRLKEQILELLVLEQFLTILPREVQTWVQARHPESGE :. :::::::::: :::::::::.: :::::::::::::::.:: :.: ::. .:: ::: CCDS10 DVHGPHEAFSQLWELCCRWLRPELRTKEQILELLVLEQFLTVLPGEIQGWVREQHPGSGE 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE5 EAVALVEDWHRETRTAGQSGLELHTEETRPLKTGEEAQSFQLQPVDPWPEGQSQKKGVKN :::::::: ... : .. .. .::..: .:. .:. . : : .... CCDS10 EAVALVEDLQKQPVKAWRQ--DVPSEEAEPEAAGRGSQA-----TGPPPTVGARRR---- 120 130 140 150 160 190 200 210 pF1KE5 TCPDLPNHLNAEVAPQP--LKE------------SGV--PVSK---PSNTSEKEQG---P :..:.. ... : : ::: ::: ::. : :..: : : CCDS10 --PSVPQEQHSHSAQPPALLKEGRPGETTDTCFVSGVHGPVALGDIPFYFSREEWGTLDP 170 180 190 200 210 220 220 230 240 250 260 270 pF1KE5 E----FWGLSLINSGKRSTADYSLDNEPAQALTWRDSRAWEEQYQWDVEDMKVSGVHWGY :: .. :: . .: ..: .. ..: . . : :: : :. CCDS10 AQRDLFWDIKRENS-RNTTLGFGLKGQSEKSLLQEMVPVVPGQTGSDVT------VSWSP 230 240 250 260 270 280 290 300 310 320 pF1KE5 EETKTFL------AILSESPFSEKLRTCHQNRQVYRAIAEQLRARGFLRTLEQC--RYRV ::.... : :. ... : . :: ::. .. : :: :.: CCDS10 EEAEAWESENRPRAALGPVVGARRGRPPTRRRQFRDLAAEKPHSCG------QCGKRFRW 280 290 300 310 320 330 340 350 360 370 380 pF1KE5 KNLLRNYRKAKSSHPPGTCPFYEELEALVRARTAIRATDGPGEAVALPRLGDSDAEMDEQ . : ........ : :: CCDS10 GSDLARHQRTHTGEKPHKCPECDKSFRSSSDLVRHQGVHTGEKPFSCSECGKSFSRSAYL 330 340 350 360 370 380 >>CCDS32410.1 ZKSCAN2 gene_id:342357|Hs108|chr16 (967 aa) initn: 1264 init1: 452 opt: 542 Z-score: 551.0 bits: 112.2 E(32554): 2.3e-24 Smith-Waterman score: 878; 38.4% identity (59.6% similar) in 463 aa overlap (48-432:42-501) 20 30 40 50 60 70 pF1KE5 EELLIVKLEEDSWGSESKLWEKDRGSVSGPEASRQRFRQFQYRDAAGPHEAFSQLWALCC :. :. :::: :.:..:::::::.:: ::: CCDS32 PLEVEGCLIMKVEKDPEWASEPILEGSDSSETFRKCFRQFCYEDVTGPHEAFSKLWELCC 20 30 40 50 60 70 80 90 100 110 120 130 pF1KE5 RWLRPEIRLKEQILELLVLEQFLTILPREVQTWVQARHPESGEEAVALVEDWHRET-RTA :::.::.: :::::::::.::::::::...:.:.: . :.::::::::: ..:: : CCDS32 RWLKPEMRSKEQILELLVIEQFLTILPEKIQAWAQKQCPQSGEEAVALVVHLEKETGRLR 80 90 100 110 120 130 140 150 160 170 pF1KE5 GQSGLELHTEETRPLKTGEEAQSFQLQPVDPWPEGQSQKK-GV----------------- : . .: :. :: .. :. .:: . :. :.. :... : CCDS32 QQVSSPVHREKHSPLGAAWEVADFQPEQVETQPRAVSREEPGSLHSGHQEQLNRKRERRP 140 150 160 170 180 190 180 190 200 pF1KE5 --KNTCPD--LP------NHLNAEVA----P----QPLKES------------------- ::. :. .: : :. ::. : .:.:. CCDS32 LPKNARPSPWVPALADEWNTLDQEVTTTRLPAGSQEPVKDVHVARGFSYRKSVHQIPAQR 200 210 220 230 240 250 210 220 230 240 pF1KE5 -----------------GVPVSKPSNTSEKEQGPEFWGLSLINSGKRSTADYSLDNEPA- : :: .. .. :: : : :.: .:.::: . .: . CCDS32 DLYRDFRKENVGNVVSLGSAVSTSNKITRLEQRKEPWTLGLHSSNKRSILRSNYVKEKSV 260 270 280 290 300 310 250 260 270 280 290 300 pF1KE5 ---QALTWRDSRAWEEQYQWDVEDMKVSGVHWGYEETKTFLAILSESPFSEKLRTCHQNR :. . ...:.:: :: .:: :..::::.:::::::::::.:: : : :..: .: CCDS32 HAIQVPARSAGKTWREQQQWGLEDEKIAGVHWSYEETKTFLAILKESRFYETLQACPRNS 320 330 340 350 360 370 310 320 330 340 350 360 pF1KE5 QVYRAIAEQLRARGFLRTLEQCRYRVKNLLRNYRKAKSSHPPGTCPFYEELEALVRARTA ::: :.:: :: ::::: :::: . :.: ..:::....: : :.:...::. . CCDS32 QVYGAVAEWLRECGFLRTPEQCRTKFKSLQKSYRKVRNGHMLEPCAFFEDMDALLNPAAR 380 390 400 410 420 430 370 380 390 400 410 pF1KE5 IRATDGPGEAVALPRLGDSDAEMDEQEEGGW-DPEEMAEDCNGAGLVNVESTQGPRIAGA .:: : : . .::: . .. .:. . . :: ::: . ...: . .: :: CCDS32 APSTDKPKEMIPVPRL--KRIAISAKEHISLVEEEEAAEDSDD-DEIGIEFIRKSEIHGA 440 450 460 470 480 420 430 pF1KE5 PALFQSRIGKNMGV :.:::. : . : CCDS32 PVLFQNLSGVHWGYEETKTFLDILRETRFYEALQACHRKSKLYGAVAEQLRECGFLRTPE 490 500 510 520 530 540 >-- initn: 403 init1: 378 opt: 410 Z-score: 416.9 bits: 87.4 E(32554): 6.7e-17 Smith-Waterman score: 410; 42.7% identity (68.2% similar) in 157 aa overlap (274-425:502-658) 250 260 270 280 290 300 pF1KE5 QALTWRDSRAWEEQYQWDVEDMKVSGVHWGYEETKTFLAILSESPFSEKLRTCHQNRQVY :::::::: :: :. : : :..::.. ..: CCDS32 DDEIGIEFIRKSEIHGAPVLFQNLSGVHWGYEETKTFLDILRETRFYEALQACHRKSKLY 480 490 500 510 520 530 310 320 330 340 350 360 pF1KE5 RAIAEQLRARGFLRTLEQCRYRVKNLLRNYRKAKSSHPPGTCPFYEELEALVRARTAIRA :.::::: ::::: :::: . :.: ..:::.:..: .: ::.:..::. .:.. . CCDS32 GAVAEQLRECGFLRTPEQCRTKFKSLQKSYRKVKNGHVLESCAFYKEMDALINSRASAPS 540 550 560 570 580 590 370 380 390 400 410 pF1KE5 TDGPGEAVALPRLGDSDAEMDEQEEGGWDPEEMA-----EDCNGAGLVNVESTQGPRIAG . : :. . : . :.. :: ::.::: . :: . . . : .: :.. : CCDS32 PSTPEEVPSPSRQERGGIEVEPQEPTGWEPEETSQEAVIEDSCSERMSEEEIVQEPEFQG 600 610 620 630 640 650 420 430 pF1KE5 APALFQSRIGKNMGV :.:.:: CCDS32 PPGLLQSPNDFEIGSSIKEDPTQIVYKDMEQHRALIEKSKRVVSQSTDPSKYRKRECISG 660 670 680 690 700 710 >>CCDS10499.1 ZNF263 gene_id:10127|Hs108|chr16 (683 aa) initn: 504 init1: 404 opt: 511 Z-score: 521.7 bits: 106.3 E(32554): 9.7e-23 Smith-Waterman score: 512; 53.7% identity (72.6% similar) in 175 aa overlap (9-172:2-172) 10 20 30 40 50 pF1KE5 MAMALELQAQASPQPEPEELLIVKLEED-SWGSESKLWEKDRGSVSGPEASRQRFRQFQY :.. . : : ::::::::: .:..: : : : .::::. :::.:.. CCDS10 MASGPGSQEREGLLIVKLEEDCAWSQE--LPPPDPGP--SPEASHLRFRRFRF 10 20 30 40 60 70 80 90 100 110 pF1KE5 RDAAGPHEAFSQLWALCCRWLRPEIRLKEQILELLVLEQFLTILPREVQTWVQARHPESG ..::::.::.:.: :: :::::.: ::::::::::::::::::.:.:. :: ::::: CCDS10 QEAAGPREALSRLQELCHGWLRPEMRTKEQILELLVLEQFLTILPQEIQSRVQELHPESG 50 60 70 80 90 100 120 130 140 150 160 pF1KE5 EEAVALVEDWHRETRTAGQS------GLELHTEETRPLKTGEEAQSFQLQPVD----PWP ::::.:::: .:: :. : :. :: ::.:..:. ::.:.:.. : : CCDS10 EEAVTLVEDMQRELGRLRQQVTNHGRGTEVLLEEPLPLETARESPSFKLEPMETERSPGP 110 120 130 140 150 160 170 180 190 200 210 220 pF1KE5 EGQSQKKGVKNTCPDLPNHLNAEVAPQPLKESGVPVSKPSNTSEKEQGPEFWGLSLINSG . : CCDS10 RLQELLGPSPQRDPQAVKERALSAPWLSLFPPEGNMEDKEMTGPQLPESLEDVAMYISQE 170 180 190 200 210 220 >>CCDS76814.1 ZNF500 gene_id:26048|Hs108|chr16 (441 aa) initn: 486 init1: 439 opt: 507 Z-score: 520.4 bits: 105.4 E(32554): 1.2e-22 Smith-Waterman score: 507; 42.4% identity (62.9% similar) in 229 aa overlap (16-242:18-238) 10 20 30 40 50 pF1KE5 MAMALELQAQASPQPEPEELLIVKLEEDSWGSESKLWEKDRGSVSGPEASRQRFRQFQ : ::.::::.::: : : . : ::. :: :: : CCDS76 MATVPGLQPLPTLEQDLEQEEILIVKVEEDFCLEEEPSVETEDPS---PETFRQLFRLFC 10 20 30 40 50 60 70 80 90 100 110 pF1KE5 YRDAAGPHEAFSQLWALCCRWLRPEIRLKEQILELLVLEQFLTILPREVQTWVQARHPES :...:::.::.:.:: :::::::::.: :::::::::::::::.:: :.:. :. ..::: CCDS76 YQEVAGPREALSRLWELCCRWLRPELRTKEQILELLVLEQFLTVLPGEIQARVREQQPES 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE5 GEEAVALVEDWHRETRTAGQSGLELHTEETRPLKTGEEAQSFQLQPVDPWPEGQSQKKGV :::::.::: .:. : : : :: ... :: : . : . .. :: : .. . CCDS76 GEEAVVLVEGLQRKPRKHRQRGSELLSDDEVPLGIGGQ---FLKHQAEAQPEDLSLEEEA 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE5 KNTCPDLPNHLNAEVAPQPL--KESGVPVSKPSNTSEKEQGPEFWGLSLINSGKRSTADY . . . : .:. . :: : : :. : . .: . . : . . ...: : CCDS76 RFSSQQPPAQLSHRPQRGPLLWPERGPPA--PRHQEMASASPFLSAWSQVPVNLEDVAVY 180 190 200 210 220 230 240 250 260 270 280 290 pF1KE5 SLDNEPAQALTWRDSRAWEEQYQWDVEDMKVSGVHWGYEETKTFLAILSESPFSEKLRTC .:: CCDS76 LSGEEPRCMDPAQRDAPLENEGPGIQLEDGGDGREDAPLRMEWYRVLSARCQGPGHPLPG 240 250 260 270 280 290 >>CCDS32383.1 ZNF500 gene_id:26048|Hs108|chr16 (480 aa) initn: 486 init1: 439 opt: 507 Z-score: 519.9 bits: 105.4 E(32554): 1.2e-22 Smith-Waterman score: 507; 42.4% identity (62.9% similar) in 229 aa overlap (16-242:18-238) 10 20 30 40 50 pF1KE5 MAMALELQAQASPQPEPEELLIVKLEEDSWGSESKLWEKDRGSVSGPEASRQRFRQFQ : ::.::::.::: : : . : ::. :: :: : CCDS32 MATVPGLQPLPTLEQDLEQEEILIVKVEEDFCLEEEPSVETEDPS---PETFRQLFRLFC 10 20 30 40 50 60 70 80 90 100 110 pF1KE5 YRDAAGPHEAFSQLWALCCRWLRPEIRLKEQILELLVLEQFLTILPREVQTWVQARHPES :...:::.::.:.:: :::::::::.: :::::::::::::::.:: :.:. :. ..::: CCDS32 YQEVAGPREALSRLWELCCRWLRPELRTKEQILELLVLEQFLTVLPGEIQARVREQQPES 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE5 GEEAVALVEDWHRETRTAGQSGLELHTEETRPLKTGEEAQSFQLQPVDPWPEGQSQKKGV :::::.::: .:. : : : :: ... :: : . : . .. :: : .. . CCDS32 GEEAVVLVEGLQRKPRKHRQRGSELLSDDEVPLGIGGQ---FLKHQAEAQPEDLSLEEEA 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE5 KNTCPDLPNHLNAEVAPQPL--KESGVPVSKPSNTSEKEQGPEFWGLSLINSGKRSTADY . . . : .:. . :: : : :. : . .: . . : . . ...: : CCDS32 RFSSQQPPAQLSHRPQRGPLLWPERGPPA--PRHQEMASASPFLSAWSQVPVNLEDVAVY 180 190 200 210 220 230 240 250 260 270 280 290 pF1KE5 SLDNEPAQALTWRDSRAWEEQYQWDVEDMKVSGVHWGYEETKTFLAILSESPFSEKLRTC .:: CCDS32 LSGEEPRCMDPAQRDAPLENEGPGIQLEDGGDGREDAPLRMEWYRVLSARCQGPGHPLPG 240 250 260 270 280 290 >>CCDS4649.1 ZSCAN31 gene_id:64288|Hs108|chr6 (406 aa) initn: 472 init1: 411 opt: 481 Z-score: 494.5 bits: 100.5 E(32554): 3.2e-21 Smith-Waterman score: 487; 35.4% identity (59.1% similar) in 347 aa overlap (16-345:6-325) 10 20 30 40 50 pF1KE5 MAMALELQAQASPQPEPEELLIVKLEEDS-WGSESKLWEKDRGS-VSGPEASRQRFRQFQ : .: :::.::: : .:..: ::. :: ::::: :::: CCDS46 MASTEEQYDLKIVKVEEDPIWDQETHL----RGNNFSGQEASRQLFRQFC 10 20 30 40 60 70 80 90 100 110 pF1KE5 YRDAAGPHEAFSQLWALCCRWLRPEIRLKEQILELLVLEQFLTILPREVQTWVQARHPES :... ::.::.:.: :: .::::::. ::::::::::::::::::.:.:.::. .:::: CCDS46 YQETPGPREALSRLRELCHQWLRPEIHTKEQILELLVLEQFLTILPEELQAWVREHHPES 50 60 70 80 90 100 120 130 140 150 160 170 pF1KE5 GEEAVALVEDWHRETRTAGQSG-------LELHTEETRPLKTGEEAQSFQLQPVDPWPEG ::::::.::: ..: :... :. :... ::. .: ..::::. CCDS46 GEEAVAVVEDLEQELSEPGNQAPDHEHGHSEVLLEDVEHLKVKQEPTDIQLQPM------ 110 120 130 140 150 160 180 190 200 210 220 230 pF1KE5 QSQKKGVKNTCPDLPNHLNAEVAPQPLKESGVPVSKPSNTSEKEQGPEFWGLSLINSGKR .: . .. : .. ..: : :. .:: .: :. : : . .: CCDS46 VTQLR-YESFCLHQFQEQDGESIP----ENQELASKQEILKEMEH----LGDSKL---QR 170 180 190 200 240 250 260 270 280 290 pF1KE5 STADYSLDNEPAQALTWRDSRAWEEQYQWDVEDMKVSGVHWGYEETKTFLAILSESPFS- : :::.. .. :::.: :.: .. . . . : ::. . : . . CCDS46 ---DVSLDSKYRETCK-RDSKA-EKQQAHSTGERRHRCNECGKSFTKSSVLIEHQRIHTG 210 220 230 240 250 260 300 310 320 330 340 pF1KE5 EKLRTCHQNRQVY--RAIAEQLRARGFLRTLEQCR-----YRVKNLLRNYRKAKSSHPPG :: :.. ... :. .. : . ::. . ..: : .:. .... : CCDS46 EKPYECEECGKAFSRRSSLNEHRRSHTGEKPYQCKECGKAFSASNGLTRHRRIHTGEKPY 270 280 290 300 310 320 350 360 370 380 390 400 pF1KE5 TCPFYEELEALVRARTAIRATDGPGEAVALPRLGDSDAEMDEQEEGGWDPEEMAEDCNGA : CCDS46 ECKVCGKAFLLSSCLVQHQRIHTGEKRYQCRECGKAFIQNAGLFQHLRVHTGEKPYQCSQ 330 340 350 360 370 380 >>CCDS5667.1 ZKSCAN5 gene_id:23660|Hs108|chr7 (839 aa) initn: 409 init1: 409 opt: 477 Z-score: 485.9 bits: 100.0 E(32554): 9.6e-21 Smith-Waterman score: 477; 45.0% identity (65.8% similar) in 202 aa overlap (5-201:10-199) 10 20 30 40 50 pF1KE5 MAMALELQAQASPQPEPEELLIVKLEEDSWGSESKLWEKDRGSVSGPEASRQRFR ..:. : . : :.:.:::.:: :. : .. . . :. :::: CCDS56 MIMTESREVIDLDPPAETSQEQEDLFIVKVEE-----EDCTWMQEYNPPTF-ETFYQRFR 10 20 30 40 50 60 70 80 90 100 110 pF1KE5 QFQYRDAAGPHEAFSQLWALCCRWLRPEIRLKEQILELLVLEQFLTILPREVQTWVQARH .:::..:.::.::.::: .:::.:::::.. ::::::::::::::::::.: : ::. .: CCDS56 HFQYHEASGPREALSQLRVLCCEWLRPELHTKEQILELLVLEQFLTILPEEFQPWVREHH 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE5 PESGEEAVALVEDWHRETRTAGQSGLELHTEETRPLKT-GEEAQSFQLQP--VDPWPEGQ :::::::::..:. .:: . :. . : . : : .: . .: :: :: CCDS56 PESGEEAVAVIENIQRELEERRQQIVACPDVLPRKMATPGAVQESCSPHPLTVDTQPEQA 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE5 SQKKGV--KNTCPDLPNHLNAEVAPQPLKESGVPVSKPSNTSEKEQGPEFWGLSLINSGK :: . .:. : : .: :::.: CCDS56 PQKPRLLEENALPVL------QVPSLPLKDSQELTASLLSTGSQKLVKIEEVADVAVSFI 180 190 200 210 220 240 250 260 270 280 290 pF1KE5 RSTADYSLDNEPAQALTWRDSRAWEEQYQWDVEDMKVSGVHWGYEETKTFLAILSESPFS CCDS56 LEEWGHLDQSQKSLYRDDRKENYGSITSMGYESRDNMELIVKQISDDSESHWVAPEHTER 230 240 250 260 270 280 433 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 01:54:29 2016 done: Tue Nov 8 01:54:29 2016 Total Scan time: 2.960 Total Display time: 0.060 Function used was FASTA [36.3.4 Apr, 2011]