FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB8906, 239 aa 1>>>pF1KB8906 239 - 239 aa - 239 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.1309+/-0.000644; mu= 10.0987+/- 0.040 mean_var=179.9250+/-37.633, 0's: 0 Z-trim(117.7): 152 B-trim: 944 in 1/54 Lambda= 0.095616 statistics sampled from 18381 (18553) to 18381 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.847), E-opt: 0.2 (0.57), width: 16 Scan time: 2.860 The best scores are: opt bits E(32554) CCDS9660.1 8 gene_id:26257|Hs108|chr14 ( 239) 1647 237.9 4.6e-63 CCDS42855.1 4 gene_id:644524|Hs108|chr20 ( 354) 450 72.9 3.1e-13 CCDS13145.1 2 gene_id:4821|Hs108|chr20 ( 273) 436 70.9 9.8e-13 CCDS4387.1 5 gene_id:1482|Hs108|chr5 ( 324) 433 70.5 1.5e-12 CCDS41558.1 3 gene_id:159296|Hs108|chr10 ( 364) 424 69.3 3.8e-12 >>CCDS9660.1 8 gene_id:26257|Hs108|chr14 (239 aa) initn: 1647 init1: 1647 opt: 1647 Z-score: 1245.6 bits: 237.9 E(32554): 4.6e-63 Smith-Waterman score: 1647; 100.0% identity (100.0% similar) in 239 aa overlap (1-239:1-239) 10 20 30 40 50 60 pF1KB8 MATSGRLSFTVRSLLDLPEQDAQHLPRREPEPRAPQPDPCAAWLDSERGHYPSSDESSLE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS96 MATSGRLSFTVRSLLDLPEQDAQHLPRREPEPRAPQPDPCAAWLDSERGHYPSSDESSLE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 TSPPDSSQRPSARPASPGSDAEKRKKRRVLFSKAQTLELERRFRQQRYLSAPEREQLASL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS96 TSPPDSSQRPSARPASPGSDAEKRKKRRVLFSKAQTLELERRFRQQRYLSAPEREQLASL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB8 LRLTPTQVKIWFQNHRYKLKRARAPGAAESPDLAASAELHAAPGLLRRVVVPVLVRDGQP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS96 LRLTPTQVKIWFQNHRYKLKRARAPGAAESPDLAASAELHAAPGLLRRVVVPVLVRDGQP 130 140 150 160 170 180 190 200 210 220 230 pF1KB8 CGGGGGGEVGTAAAQEKCGAPPAAACPLPGYPAFGPGSALGLFPAYQHLASPALVSWNW ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS96 CGGGGGGEVGTAAAQEKCGAPPAAACPLPGYPAFGPGSALGLFPAYQHLASPALVSWNW 190 200 210 220 230 >>CCDS42855.1 4 gene_id:644524|Hs108|chr20 (354 aa) initn: 449 init1: 338 opt: 450 Z-score: 351.1 bits: 72.9 E(32554): 3.1e-13 Smith-Waterman score: 450; 51.7% identity (73.2% similar) in 149 aa overlap (81-225:186-334) 60 70 80 90 100 110 pF1KB8 YPSSDESSLETSPPDSSQRPSARPASPGSDAEKRKKRRVLFSKAQTLELERRFRQQRYLS : :.:::::::.::. ::::::.::.::: CCDS42 AGVNVAGMGSLTGIADAAKSLGPLHAAAAAAAPRRKRRVLFSQAQVYELERRFKQQKYLS 160 170 180 190 200 210 120 130 140 150 160 pF1KB8 APEREQLASLLRLTPTQVKIWFQNHRYKLKR-ARAPGAAESPDLAASAELHAAPGLLRRV :::::.:::...::::::::::::::::.:: :. .: . . .. . : ::: CCDS42 APEREHLASMIHLTPTQVKIWFQNHRYKMKRQAKDKAAQQLQQEGGLGPPPPPPPSPRRV 220 230 240 250 260 270 170 180 190 200 210 220 pF1KB8 VVPVLVRDGQPCGGGGGGEV-GTAAAQEKCGAPPAAACPL-PGYPAF-GPGSALGLFPAY .:::::.::.:: .:.. . : :. : .: : :. ::. :::..:. . : CCDS42 AVPVLVKDGKPCQNGASTPTPGQAGPQPPAPTPAPELEELSPSPPALHGPGGGLAALDAA 280 290 300 310 320 330 230 pF1KB8 QHLASPALVSWNW CCDS42 AGEYSGGVLGANLLYGRTW 340 350 >>CCDS13145.1 2 gene_id:4821|Hs108|chr20 (273 aa) initn: 557 init1: 388 opt: 436 Z-score: 342.1 bits: 70.9 E(32554): 9.8e-13 Smith-Waterman score: 526; 45.8% identity (63.6% similar) in 236 aa overlap (21-239:54-273) 10 20 30 40 pF1KB8 MATSGRLSFTVRSLLDLPEQDA-QHLPRREPEPRAPQPDPCAAWLDSERG :: : :: ..: . .: . :: : .: CCDS13 EEGSVAEGPEEENEGPEPAKRAGPLGQGALDAVQSLPLKNPF-YDSSDNPYTRWLASTEG 30 40 50 60 70 80 50 60 70 80 90 pF1KB8 -HY---------PSSDESSLETSPPDSSQRPSARPASPGS--DAEKRKKRRVLFSKAQTL .: : .: :: .. :.... :. .::. :: :..::::::::::: CCDS13 LQYSLHGLAAGAPPQDSSS-KSPEPSADESPDNDKETPGGGGDAGKKRKRRVLFSKAQTY 90 100 110 120 130 140 100 110 120 130 140 150 pF1KB8 ELERRFRQQRYLSAPEREQLASLLRLTPTQVKIWFQNHRYKLKRARAPGAAESPDLAASA ::::::::::::::::::.::::.:::::::::::::::::.::::: . : : . CCDS13 ELERRFRQQRYLSAPEREHLASLIRLTPTQVKIWFQNHRYKMKRARAEKGMEVTPLPS-- 150 160 170 180 190 160 170 180 190 200 210 pF1KB8 ELHAAPGLLRRVVVPVLVRDGQPCGGGGGGEVGTAAAQEKCGAPPAA--ACPLPG--YPA : :::.::::::::.:: . . .. ::: . : : .: : : : : CCDS13 -----P---RRVAVPVLVRDGKPCHALKAQDL--AAATFQAGIPFSAYSAQSLQHMQYNA 200 210 220 230 240 220 230 pF1KB8 FGPGSALGLFPAYQHLASPALVSWNW ... .:. . :.. .:.: CCDS13 QYSSASTPQYPTAHPLVQAQ--QWTW 250 260 270 >>CCDS4387.1 5 gene_id:1482|Hs108|chr5 (324 aa) initn: 470 init1: 318 opt: 433 Z-score: 338.9 bits: 70.5 E(32554): 1.5e-12 Smith-Waterman score: 480; 41.0% identity (55.4% similar) in 249 aa overlap (18-238:59-289) 10 20 30 40 pF1KB8 MATSGRLSFTVRSLLDLPEQDAQHLPRREPE-PRAPQPDPCAAWLDS :: : ::. . : :::.: ::. . . CCDS43 AAGELSARLEATLAPSSCMLAAFKPEAYAGPEAAAPGLPELRAELGRAPSPAKCASAFPA 30 40 50 60 70 80 50 60 70 80 pF1KB8 ERGHYPS--SDESS-------------------LETSPPDSSQRPSARPASPGSDAEKRK . :: :: . :: . :...:: :: .:. CCDS43 APAFYPRAYSDPDPAKDPRAEKKELCALQKAVELEKTEADNAERPRAR---------RRR 90 100 110 120 130 90 100 110 120 130 140 pF1KB8 KRRVLFSKAQTLELERRFRQQRYLSAPEREQLASLLRLTPTQVKIWFQNHRYKLKRARAP : :::::.::. ::::::.::::::::::.::::.:.:: :::::::::.::: :: : CCDS43 KPRVLFSQAQVYELERRFKQQRYLSAPERDQLASVLKLTSTQVKIWFQNRRYKCKRQRQD 140 150 160 170 180 190 150 160 170 180 190 200 pF1KB8 GAAESPDLAASAELHAAPGLLRRVVVPVLVRDGQPCGGGGGGEVGTAAAQEKCGAPPAAA . : : : ::..::::::::.:: : . . : : : . CCDS43 QTLELVGLPPP-----PPPPARRIAVPVLVRDGKPCLG----DSAPYAPAYGVGLNPYGY 200 210 220 230 240 250 210 220 230 pF1KB8 CPLPGYPAFG-----PG-SALGLFPAYQHLASPALVSWNW :.::..: :: : . .:: :.:: .. : CCDS43 NAYPAYPGYGGAACSPGYSCTAAYPAGPSPAQPATAAANNNFVNFGVGDLNAVQSPGIPQ 260 270 280 290 300 310 CCDS43 SNSGVSTLHGIRAW 320 >>CCDS41558.1 3 gene_id:159296|Hs108|chr10 (364 aa) initn: 502 init1: 307 opt: 424 Z-score: 331.6 bits: 69.3 E(32554): 3.8e-12 Smith-Waterman score: 443; 47.4% identity (66.1% similar) in 192 aa overlap (31-219:103-270) 10 20 30 40 50 60 pF1KB8 MATSGRLSFTVRSLLDLPEQDAQHLPRREPEPRAPQPDPCAAWLDSERGHYPSSDESSLE ::. . .: .. .:.. . ..::: CCDS41 LNSLAAADGHGDSGLCPQGYVHTVLRDSCSEPKEHEEEPEVV---RDRSQKSCQLKKSLE 80 90 100 110 120 70 80 90 100 110 120 pF1KB8 TSPPDSSQRPSARPASPGSDAEKRKKRRVLFSKAQTLELERRFRQQRYLSAPEREQLASL :. .. . : :: .: : :.: :::::.::..::::::.:::::::::::.::: CCDS41 TAGDCKAAEESERP-KPRS----RRKPRVLFSQAQVFELERRFKQQRYLSAPEREHLASS 130 140 150 160 170 180 130 140 150 160 170 180 pF1KB8 LRLTPTQVKIWFQNHRYKLKRARAPGAAESPDLAASAELHAAPGLLRRVVVPVLVRDGQP :.:: :::::::::.::: :: : .: .:.: :: : :::.::::::::.: CCDS41 LKLTSTQVKIWFQNRRYKCKRQRQD---KSLELGA----HAPPPPPRRVAVPVLVRDGKP 190 200 210 220 230 190 200 210 220 230 pF1KB8 CGGGGGGEVGTAAAQEKCGAP---PAAACPLPGYPAFGPGSALGLFPAYQHLASPALVSW : : .:: ::: :.: ..::.: :.. CCDS41 CV--------TPSAQAY-GAPYSVGASAYSYNSFPAYGYGNSAAAAAAAAAAAAAAAAYS 240 250 260 270 280 pF1KB8 NW CCDS41 SSYGCAYPAGGGGGGGGTSAATTAMQPACSAAGGGPFVNVSNLGGFGSGGSAQPLHQGTA 290 300 310 320 330 340 239 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 04:37:18 2016 done: Tue Nov 8 04:37:18 2016 Total Scan time: 2.860 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]