FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5293, 178 aa 1>>>pF1KE5293 178 - 178 aa - 178 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.4206+/-0.000591; mu= 10.6557+/- 0.036 mean_var=119.2741+/-23.486, 0's: 0 Z-trim(117.2): 78 B-trim: 6 in 1/50 Lambda= 0.117436 statistics sampled from 17785 (17863) to 17785 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.846), E-opt: 0.2 (0.549), width: 16 Scan time: 2.140 The best scores are: opt bits E(32554) CCDS12969.1 ZSCAN1 gene_id:284312|Hs108|chr19 ( 408) 862 155.5 3.5e-38 CCDS12971.1 ZSCAN18 gene_id:65982|Hs108|chr19 ( 510) 430 82.4 4.5e-16 CCDS46214.1 ZSCAN18 gene_id:65982|Hs108|chr19 ( 566) 430 82.5 4.9e-16 CCDS74476.1 ZNF274 gene_id:10782|Hs108|chr19 ( 548) 419 80.6 1.7e-15 CCDS74475.1 ZNF274 gene_id:10782|Hs108|chr19 ( 611) 419 80.6 1.9e-15 CCDS74474.1 ZNF274 gene_id:10782|Hs108|chr19 ( 621) 419 80.6 1.9e-15 CCDS74473.1 ZNF274 gene_id:10782|Hs108|chr19 ( 653) 419 80.7 2e-15 CCDS59427.1 MZF1 gene_id:7593|Hs108|chr19 ( 290) 409 78.7 3.5e-15 CCDS12988.1 MZF1 gene_id:7593|Hs108|chr19 ( 734) 409 79.0 7e-15 CCDS12975.1 ZSCAN22 gene_id:342945|Hs108|chr19 ( 491) 396 76.7 2.4e-14 CCDS35105.1 ZNF483 gene_id:158399|Hs108|chr9 ( 256) 386 74.7 4.7e-14 CCDS35106.1 ZNF483 gene_id:158399|Hs108|chr9 ( 744) 386 75.1 1.1e-13 CCDS66921.1 ZSCAN32 gene_id:54925|Hs108|chr16 ( 697) 376 73.4 3.3e-13 CCDS7775.1 ZNF215 gene_id:7762|Hs108|chr11 ( 517) 372 72.6 4.2e-13 CCDS4644.1 ZSCAN16 gene_id:80345|Hs108|chr6 ( 348) 359 70.3 1.4e-12 CCDS41300.1 ZSCAN20 gene_id:7579|Hs108|chr1 (1043) 366 71.8 1.4e-12 CCDS10495.1 ZNF213 gene_id:7760|Hs108|chr16 ( 459) 359 70.4 1.7e-12 CCDS10499.1 ZNF263 gene_id:10127|Hs108|chr16 ( 683) 361 70.8 1.9e-12 CCDS5681.1 ZSCAN21 gene_id:7589|Hs108|chr7 ( 473) 358 70.2 2e-12 CCDS11179.2 ZNF287 gene_id:57336|Hs108|chr17 ( 761) 360 70.7 2.3e-12 CCDS14649.1 ZNF449 gene_id:203523|Hs108|chrX ( 518) 354 69.6 3.4e-12 CCDS5671.2 ZSCAN25 gene_id:221785|Hs108|chr7 ( 544) 344 67.9 1.2e-11 CCDS1631.1 ZNF496 gene_id:84838|Hs108|chr1 ( 587) 344 67.9 1.2e-11 CCDS2713.1 ZNF445 gene_id:353274|Hs108|chr3 (1031) 345 68.3 1.7e-11 CCDS33743.1 ZNF197 gene_id:10168|Hs108|chr3 ( 267) 336 66.3 1.7e-11 CCDS2714.1 ZKSCAN7 gene_id:55888|Hs108|chr3 ( 276) 334 66.0 2.2e-11 CCDS82246.1 ZNF396 gene_id:252884|Hs108|chr18 ( 210) 332 65.5 2.3e-11 CCDS10095.2 ZSCAN29 gene_id:146050|Hs108|chr15 ( 852) 340 67.4 2.6e-11 CCDS11913.1 ZNF396 gene_id:252884|Hs108|chr18 ( 333) 332 65.7 3.3e-11 CCDS82247.1 ZNF396 gene_id:252884|Hs108|chr18 ( 335) 332 65.7 3.3e-11 CCDS77175.1 ZNF24 gene_id:7572|Hs108|chr18 ( 193) 326 64.5 4.4e-11 CCDS2717.1 ZNF197 gene_id:10168|Hs108|chr3 (1029) 336 66.8 4.8e-11 CCDS2715.1 ZKSCAN7 gene_id:55888|Hs108|chr3 ( 754) 334 66.3 4.8e-11 CCDS59426.1 ZNF444 gene_id:55311|Hs108|chr19 ( 326) 327 64.8 5.8e-11 CCDS12982.1 ZNF446 gene_id:55663|Hs108|chr19 ( 450) 329 65.3 5.8e-11 CCDS69349.1 ZKSCAN1 gene_id:7586|Hs108|chr7 ( 527) 330 65.5 5.8e-11 CCDS34698.1 ZKSCAN1 gene_id:7586|Hs108|chr7 ( 563) 330 65.5 6.1e-11 CCDS11068.1 ZNF232 gene_id:7775|Hs108|chr17 ( 444) 328 65.1 6.5e-11 CCDS5667.1 ZKSCAN5 gene_id:23660|Hs108|chr7 ( 839) 332 66.0 6.6e-11 CCDS11912.1 ZNF24 gene_id:7572|Hs108|chr18 ( 368) 326 64.7 7.1e-11 CCDS82043.1 ZNF232 gene_id:7775|Hs108|chr17 ( 435) 327 64.9 7.2e-11 CCDS5666.1 ZNF394 gene_id:84124|Hs108|chr7 ( 561) 328 65.2 7.7e-11 >>CCDS12969.1 ZSCAN1 gene_id:284312|Hs108|chr19 (408 aa) initn: 862 init1: 862 opt: 862 Z-score: 798.9 bits: 155.5 E(32554): 3.5e-38 Smith-Waterman score: 865; 77.0% identity (82.2% similar) in 174 aa overlap (1-159:1-171) 10 20 30 40 50 60 pF1KE5 MLPRPKAPASPRRPQTPTPSEQDADPGPASPRDTEAQRLRFRQFQYHVASGPHLALGQLW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 MLPRPKAPASPRRPQTPTPSEQDADPGPASPRDTEAQRLRFRQFQYHVASGPHLALGQLW 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 TLCRQWLRPEARSKEQMLELLVLEQFLGALPSKMRTWVQSQGPRSCREAASLVEDLTQMC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 TLCRQWLRPEARSKEQMLELLVLEQFLGALPSKMRTWVQSQGPRSCREAASLVEDLTQMC 70 80 90 100 110 120 130 140 150 160 pF1KE5 QQE---------------GERRRLRAPGRARGPDVSEDRTALLPAHIAPTPARPLPTSKP ::: ::.. ..: . : :. .: : .: .:: : CCDS12 QQEVLVSLDSVEPQDWSFGEEEDGKSPRSQKEP--SQASELILDA-VAAAPALPEESEWL 130 140 150 160 170 170 pF1KE5 LLPRGCVFNYFYI CCDS12 ETTQLQQSLHTRAEAEAPRAPGLLGSRARLPLKPSIWDEPEDLLAGPSSDLRAEGTVISS 180 190 200 210 220 230 >>CCDS12971.1 ZSCAN18 gene_id:65982|Hs108|chr19 (510 aa) initn: 431 init1: 386 opt: 430 Z-score: 402.0 bits: 82.4 E(32554): 4.5e-16 Smith-Waterman score: 458; 47.6% identity (64.2% similar) in 187 aa overlap (1-167:1-187) 10 20 30 40 pF1KE5 MLPRPKAPASPRR----PQTPTPSE----QDADPG--PA-SPRDTEAQRLRFRQFQYHVA ::: :: :::: :. :::. :. .: : .: : : .:::::.: :. : CCDS12 MLPLEKAFASPRSSPAPPDLPTPGSAAGVQQEEPETIPERTPADLEFSRLRFREFVYQEA 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE5 SGPHLALGQLWTLCRQWLRPEARSKEQMLELLVLEQFLGALPSKMRTWVQSQGPRSCREA .::: .:..: :::::: :::::::::::::::::::: ::.:.: :: .: :.::..: CCDS12 AGPHQTLARLHELCRQWLMPEARSKEQMLELLVLEQFLGILPDKVRPWVVAQYPESCKKA 70 80 90 100 110 120 110 120 130 140 150 160 pF1KE5 ASLVEDLTQMCQQEGERRRLRAPGRARGPDVSEDRTA---LLPAHIA-PTPAR-----PL ::::: :... .. : : . . : .: :::...: :. : : CCDS12 ASLVEGLADVLEEPGMLLGSPAGSSSILSDGVYERHMDPLLLPGELASPSQALGAGEIPA 130 140 150 160 170 180 170 pF1KE5 PTSKPLLPRGCVFNYFYI :. : : CCDS12 PSETPWLSPDPLFLEQRRVREAKTEEDGPANTEQKLKSFPEDPQHLGEWGHLDPAEENLK 190 200 210 220 230 240 >>CCDS46214.1 ZSCAN18 gene_id:65982|Hs108|chr19 (566 aa) initn: 431 init1: 386 opt: 430 Z-score: 401.4 bits: 82.5 E(32554): 4.9e-16 Smith-Waterman score: 458; 47.6% identity (64.2% similar) in 187 aa overlap (1-167:57-243) 10 20 pF1KE5 MLPRPKAPASPRR----PQTPTPSE----Q ::: :: :::: :. :::. : CCDS46 TGHLVIPGTSPLEPPCPWLDSHIFQCRFGKMLPLEKAFASPRSSPAPPDLPTPGSAAGVQ 30 40 50 60 70 80 30 40 50 60 70 pF1KE5 DADPG--PA-SPRDTEAQRLRFRQFQYHVASGPHLALGQLWTLCRQWLRPEARSKEQMLE . .: : .: : : .:::::.: :. :.::: .:..: :::::: ::::::::::: CCDS46 QEEPETIPERTPADLEFSRLRFREFVYQEAAGPHQTLARLHELCRQWLMPEARSKEQMLE 90 100 110 120 130 140 80 90 100 110 120 130 pF1KE5 LLVLEQFLGALPSKMRTWVQSQGPRSCREAASLVEDLTQMCQQEGERRRLRAPGRARGPD ::::::::: ::.:.: :: .: :.::..:::::: :... .. : : . . : CCDS46 LLVLEQFLGILPDKVRPWVVAQYPESCKKAASLVEGLADVLEEPGMLLGSPAGSSSILSD 150 160 170 180 190 200 140 150 160 170 pF1KE5 VSEDRTA---LLPAHIA-PTPAR-----PLPTSKPLLPRGCVFNYFYI .: :::...: :. : : :. : : CCDS46 GVYERHMDPLLLPGELASPSQALGAGEIPAPSETPWLSPDPLFLEQRRVREAKTEEDGPA 210 220 230 240 250 260 CCDS46 NTEQKLKSFPEDPQHLGEWGHLDPAEENLKSYRKLLLWGYQLSQPDAASRLDTEELRLVE 270 280 290 300 310 320 >>CCDS74476.1 ZNF274 gene_id:10782|Hs108|chr19 (548 aa) initn: 411 init1: 389 opt: 419 Z-score: 391.5 bits: 80.6 E(32554): 1.7e-15 Smith-Waterman score: 419; 61.0% identity (80.0% similar) in 105 aa overlap (19-123:38-141) 10 20 30 40 pF1KE5 MLPRPKAPASPRRPQTPTPSEQDADPGPASPRDTEAQRLRFRQFQYHV :::: . : : : :: : :::::.:. CCDS74 VLTLNQEVAGPRNAQIQALYAEDGSLSADAPSEQVQQQGK-HPGDPEAARQRFRQFRYKD 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE5 ASGPHLALGQLWTLCRQWLRPEARSKEQMLELLVLEQFLGALPSKMRTWVQSQGPRSCRE .::. :: :: ::.:::.:.::::::.:::::::::::::: :.::::.:: :..:.: CCDS74 MTGPREALDQLRELCHQWLQPKARSKEQILELLVLEQFLGALPVKLRTWVESQHPENCQE 70 80 90 100 110 120 110 120 130 140 150 160 pF1KE5 AASLVEDLTQMCQQEGERRRLRAPGRARGPDVSEDRTALLPAHIAPTPARPLPTSKPLLP ...::: .: : ..: CCDS74 VVALVEGVTWMSEEEVLPAGQPAEGTTCCLEVTAQQEEKQEDAAICPVTVLPEEPVTFQD 130 140 150 160 170 180 >>CCDS74475.1 ZNF274 gene_id:10782|Hs108|chr19 (611 aa) initn: 389 init1: 389 opt: 419 Z-score: 390.9 bits: 80.6 E(32554): 1.9e-15 Smith-Waterman score: 419; 61.0% identity (80.0% similar) in 105 aa overlap (19-123:101-204) 10 20 30 40 pF1KE5 MLPRPKAPASPRRPQTPTPSEQDADPGPASPRDTEAQRLRFRQFQYHV :::: . : : : :: : :::::.:. CCDS74 VLTLNQEVAGPRNAQIQALYAEDGSLSADAPSEQVQQQGK-HPGDPEAARQRFRQFRYKD 80 90 100 110 120 50 60 70 80 90 100 pF1KE5 ASGPHLALGQLWTLCRQWLRPEARSKEQMLELLVLEQFLGALPSKMRTWVQSQGPRSCRE .::. :: :: ::.:::.:.::::::.:::::::::::::: :.::::.:: :..:.: CCDS74 MTGPREALDQLRELCHQWLQPKARSKEQILELLVLEQFLGALPVKLRTWVESQHPENCQE 130 140 150 160 170 180 110 120 130 140 150 160 pF1KE5 AASLVEDLTQMCQQEGERRRLRAPGRARGPDVSEDRTALLPAHIAPTPARPLPTSKPLLP ...::: .: : ..: CCDS74 VVALVEGVTWMSEEEVLPAGQPAEGTTCCLEVTAQQEEKQEDAAICPVTVLPEEPVTFQD 190 200 210 220 230 240 >>CCDS74474.1 ZNF274 gene_id:10782|Hs108|chr19 (621 aa) initn: 389 init1: 389 opt: 419 Z-score: 390.8 bits: 80.6 E(32554): 1.9e-15 Smith-Waterman score: 419; 61.0% identity (80.0% similar) in 105 aa overlap (19-123:111-214) 10 20 30 40 pF1KE5 MLPRPKAPASPRRPQTPTPSEQDADPGPASPRDTEAQRLRFRQFQYHV :::: . : : : :: : :::::.:. CCDS74 VLTLNQEVAGPRNAQIQALYAEDGSLSADAPSEQVQQQGK-HPGDPEAARQRFRQFRYKD 90 100 110 120 130 50 60 70 80 90 100 pF1KE5 ASGPHLALGQLWTLCRQWLRPEARSKEQMLELLVLEQFLGALPSKMRTWVQSQGPRSCRE .::. :: :: ::.:::.:.::::::.:::::::::::::: :.::::.:: :..:.: CCDS74 MTGPREALDQLRELCHQWLQPKARSKEQILELLVLEQFLGALPVKLRTWVESQHPENCQE 140 150 160 170 180 190 110 120 130 140 150 160 pF1KE5 AASLVEDLTQMCQQEGERRRLRAPGRARGPDVSEDRTALLPAHIAPTPARPLPTSKPLLP ...::: .: : ..: CCDS74 VVALVEGVTWMSEEEVLPAGQPAEGTTCCLEVTAQQEEKQEDAAICPVTVLPEEPVTFQD 200 210 220 230 240 250 >>CCDS74473.1 ZNF274 gene_id:10782|Hs108|chr19 (653 aa) initn: 389 init1: 389 opt: 419 Z-score: 390.5 bits: 80.7 E(32554): 2e-15 Smith-Waterman score: 419; 61.0% identity (80.0% similar) in 105 aa overlap (19-123:143-246) 10 20 30 40 pF1KE5 MLPRPKAPASPRRPQTPTPSEQDADPGPASPRDTEAQRLRFRQFQYHV :::: . : : : :: : :::::.:. CCDS74 VLTLNQEVAGPRNAQIQALYAEDGSLSADAPSEQVQQQGK-HPGDPEAARQRFRQFRYKD 120 130 140 150 160 170 50 60 70 80 90 100 pF1KE5 ASGPHLALGQLWTLCRQWLRPEARSKEQMLELLVLEQFLGALPSKMRTWVQSQGPRSCRE .::. :: :: ::.:::.:.::::::.:::::::::::::: :.::::.:: :..:.: CCDS74 MTGPREALDQLRELCHQWLQPKARSKEQILELLVLEQFLGALPVKLRTWVESQHPENCQE 180 190 200 210 220 230 110 120 130 140 150 160 pF1KE5 AASLVEDLTQMCQQEGERRRLRAPGRARGPDVSEDRTALLPAHIAPTPARPLPTSKPLLP ...::: .: : ..: CCDS74 VVALVEGVTWMSEEEVLPAGQPAEGTTCCLEVTAQQEEKQEDAAICPVTVLPEEPVTFQD 240 250 260 270 280 290 >>CCDS59427.1 MZF1 gene_id:7593|Hs108|chr19 (290 aa) initn: 415 init1: 351 opt: 409 Z-score: 386.1 bits: 78.7 E(32554): 3.5e-15 Smith-Waterman score: 409; 45.9% identity (66.5% similar) in 170 aa overlap (4-165:2-165) 10 20 30 40 50 pF1KE5 MLPRPKAPASPRR--PQTPTPS----EQDADPGPASPRDT--EAQRLRFRQFQYHVASGP :: . .:: : :. : :.. . : :. : :: ::::: :.:. :.:: CCDS59 MRPAVLGSPDRAPPEDEGPVMVKLEDSEEEGEAALWDPGPEAARLRFRCFRYEEATGP 10 20 30 40 50 60 70 80 90 100 110 pF1KE5 HLALGQLWTLCRQWLRPEARSKEQMLELLVLEQFLGALPSKMRTWVQSQGPRSCREAASL . ::.:: :::::::::.:::::::::::::::::::: .... ::.: : : .:::.: CCDS59 QEALAQLRELCRQWLRPEVRSKEQMLELLVLEQFLGALPPEIQARVQGQRPGSPEEAAAL 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE5 VEDLTQMCQQEGERRRLRAPGRARGPDVSEDRTALLPAHIAPTPARPLPTSKPLLPRGCV :. : . . : :: . . ...: .: .. . :. . : : :: .: CCDS59 VDGLRR--EPGGPRRWVTV--QVQGQEVLSEK--MEPSSFQPLPETEPPTPEPGPKTPPR 120 130 140 150 160 170 pF1KE5 FNYFYI CCDS59 TMQESPLGLQVKEESEVTEDSDFLESGPLAATQESVPTLLPEEAQRCGTVLDQIFPHSKT 180 190 200 210 220 230 >>CCDS12988.1 MZF1 gene_id:7593|Hs108|chr19 (734 aa) initn: 379 init1: 351 opt: 409 Z-score: 380.6 bits: 79.0 E(32554): 7e-15 Smith-Waterman score: 409; 45.9% identity (66.5% similar) in 170 aa overlap (4-165:2-165) 10 20 30 40 50 pF1KE5 MLPRPKAPASPRR--PQTPTPS----EQDADPGPASPRDT--EAQRLRFRQFQYHVASGP :: . .:: : :. : :.. . : :. : :: ::::: :.:. :.:: CCDS12 MRPAVLGSPDRAPPEDEGPVMVKLEDSEEEGEAALWDPGPEAARLRFRCFRYEEATGP 10 20 30 40 50 60 70 80 90 100 110 pF1KE5 HLALGQLWTLCRQWLRPEARSKEQMLELLVLEQFLGALPSKMRTWVQSQGPRSCREAASL . ::.:: :::::::::.:::::::::::::::::::: .... ::.: : : .:::.: CCDS12 QEALAQLRELCRQWLRPEVRSKEQMLELLVLEQFLGALPPEIQARVQGQRPGSPEEAAAL 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE5 VEDLTQMCQQEGERRRLRAPGRARGPDVSEDRTALLPAHIAPTPARPLPTSKPLLPRGCV :. : . . : :: . . ...: .: .. . :. . : : :: .: CCDS12 VDGLRR--EPGGPRRWVTV--QVQGQEVLSEK--MEPSSFQPLPETEPPTPEPGPKTPPR 120 130 140 150 160 170 pF1KE5 FNYFYI CCDS12 TMQESPLGLQVKEESEVTEDSDFLESGPLAATQESVPTLLPEEAQRCGTVLDQIFPHSKT 180 190 200 210 220 230 >>CCDS12975.1 ZSCAN22 gene_id:342945|Hs108|chr19 (491 aa) initn: 389 init1: 389 opt: 396 Z-score: 371.1 bits: 76.7 E(32554): 2.4e-14 Smith-Waterman score: 398; 50.7% identity (73.1% similar) in 134 aa overlap (9-124:2-135) 10 20 30 40 pF1KE5 MLPRPKAPASPRRPQTPTPSEQDA--------------DPGPASPRD----TEAQRLRFR : :.. .:.: :.:. . : : .: .:: ::::: CCDS12 MAIPKHSLSPVPWEEDSFLQVKVEEEEEASLSQGGESSHDHIAHSEAARLRFR 10 20 30 40 50 50 60 70 80 90 100 pF1KE5 QFQYHVASGPHLALGQLWTLCRQWLRPEARSKEQMLELLVLEQFLGALPSKMRTWVQSQG .:.:. ::::: ::..: .:: :::.:::.::::.:::::::::::::: ....:: .:. CCDS12 HFRYEEASGPHEALAHLRALCCQWLQPEAHSKEQILELLVLEQFLGALPPEIQAWVGAQS 60 70 80 90 100 110 110 120 130 140 150 160 pF1KE5 PRSCREAASLVEDLTQMCQQEGERRRLRAPGRARGPDVSEDRTALLPAHIAPTPARPLPT :.: .::: :::::::. ...: CCDS12 PKSGEEAAVLVEDLTQVLDKRGWDPGAEPTEASCKQSDLGESEPSNVTETLMGGVSLGPA 120 130 140 150 160 170 178 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 23:28:45 2016 done: Mon Nov 7 23:28:46 2016 Total Scan time: 2.140 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]