FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3520, 293 aa 1>>>pF1KE3520 293 - 293 aa - 293 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.9712+/-0.000686; mu= 10.6928+/- 0.042 mean_var=163.9746+/-34.347, 0's: 0 Z-trim(116.5): 171 B-trim: 1781 in 2/51 Lambda= 0.100158 statistics sampled from 16959 (17148) to 16959 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.832), E-opt: 0.2 (0.527), width: 16 Scan time: 2.600 The best scores are: opt bits E(32554) CCDS41558.1 3 gene_id:159296|Hs108|chr10 ( 364) 1860 279.7 2.1e-75 CCDS4387.1 5 gene_id:1482|Hs108|chr5 ( 324) 724 115.5 5.1e-26 CCDS13145.1 2 gene_id:4821|Hs108|chr20 ( 273) 491 81.8 6.2e-16 CCDS42855.1 4 gene_id:644524|Hs108|chr20 ( 354) 474 79.5 4.1e-15 CCDS9660.1 8 gene_id:26257|Hs108|chr14 ( 239) 428 72.6 3.1e-13 CCDS3410.1 2 gene_id:579|Hs108|chr4 ( 333) 374 65.0 8.7e-11 >>CCDS41558.1 3 gene_id:159296|Hs108|chr10 (364 aa) initn: 1860 init1: 1860 opt: 1860 Z-score: 1467.1 bits: 279.7 E(32554): 2.1e-75 Smith-Waterman score: 1860; 100.0% identity (100.0% similar) in 272 aa overlap (1-272:1-272) 10 20 30 40 50 60 pF1KE3 MMLPSPVTSTPFSVKDILNLEQQHQHFHGAHLQADLEHHFHSAPCMLAAAEGTQFSDGGE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 MMLPSPVTSTPFSVKDILNLEQQHQHFHGAHLQADLEHHFHSAPCMLAAAEGTQFSDGGE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 EDEEDEGEKLSYLNSLAAADGHGDSGLCPQGYVHTVLRDSCSEPKEHEEEPEVVRDRSQK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 EDEEDEGEKLSYLNSLAAADGHGDSGLCPQGYVHTVLRDSCSEPKEHEEEPEVVRDRSQK 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 SCQLKKSLETAGDCKAAEESERPKPRSRRKPRVLFSQAQVFELERRFKQQRYLSAPEREH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 SCQLKKSLETAGDCKAAEESERPKPRSRRKPRVLFSQAQVFELERRFKQQRYLSAPEREH 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE3 LASSLKLTSTQVKIWFQNRRYKCKRQRQDKSLELGAHAPPPPPRRVAVPVLVRDGKPCVT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 LASSLKLTSTQVKIWFQNRRYKCKRQRQDKSLELGAHAPPPPPRRVAVPVLVRDGKPCVT 190 200 210 220 230 240 250 260 270 280 290 pF1KE3 PSAQAYGAPYSVGASAYSYNSFPAYGYGNSAAGPPRRPLPCSPPAARPEAAPL :::::::::::::::::::::::::::::::: CCDS41 PSAQAYGAPYSVGASAYSYNSFPAYGYGNSAAAAAAAAAAAAAAAAYSSSYGCAYPAGGG 250 260 270 280 290 300 CCDS41 GGGGGTSAATTAMQPACSAAGGGPFVNVSNLGGFGSGGSAQPLHQGTAAGAACAQGTLQG 310 320 330 340 350 360 >>CCDS4387.1 5 gene_id:1482|Hs108|chr5 (324 aa) initn: 611 init1: 491 opt: 724 Z-score: 580.6 bits: 115.5 E(32554): 5.1e-26 Smith-Waterman score: 837; 51.5% identity (69.1% similar) in 301 aa overlap (2-291:1-286) 10 20 30 40 50 pF1KE3 MMLPSP-VTSTPFSVKDILNLEQQHQHFHGA-HLQADLEHHFHSAPCMLAAAEGTQFSDG :.::: .: ::::::::::::::.. . .: .:.: :: . . ::::: . .. CCDS43 MFPSPALTPTPFSVKDILNLEQQQRSLAAAGELSARLEATLAPSSCMLAAFKPEAYA-- 10 20 30 40 50 60 70 80 90 100 110 pF1KE3 GEEDEEDEGEKLSYLNSLAAADGHGDSGLCPQGYVHTVLRDSCSEPKEHEEEPEVVRD-R : : : : : : :.. : : . . :. . . :. ..: : CCDS43 GPEAAAP-G-----LPELRAELGRAPS---PAKCASAFPAAPAFYPRAYSD-PDPAKDPR 60 70 80 90 100 120 130 140 150 160 170 pF1KE3 SQKS--CQLKKSLETAGDCKAAEESERPKPRSRRKPRVLFSQAQVFELERRFKQQRYLSA ..:. : :.:..: . :...:::. : :::::::::::::.:::::::::::::: CCDS43 AEKKELCALQKAVEL--EKTEADNAERPRARRRRKPRVLFSQAQVYELERRFKQQRYLSA 110 120 130 140 150 160 180 190 200 210 220 230 pF1KE3 PEREHLASSLKLTSTQVKIWFQNRRYKCKRQRQDKSLEL-GAHAPPPPP-RRVAVPVLVR :::..::: :::::::::::::::::::::::::..::: : ::::: ::.::::::: CCDS43 PERDQLASVLKLTSTQVKIWFQNRRYKCKRQRQDQTLELVGLPPPPPPPARRIAVPVLVR 170 180 190 200 210 220 240 250 260 270 280 pF1KE3 DGKPCVTPSAQAYGAPYSVGASAYSYNSFPAY-GYGNSAAGPPRR---PLPCSPPAARPE :::::. :: :. :.:: . :.::..::: :::..: .: : .: :.: CCDS43 DGKPCLGDSAP-YAPAYGVGLNPYGYNAYPAYPGYGGAACSPGYSCTAAYPAGPSPAQPA 230 240 250 260 270 280 290 pF1KE3 AAPL .: CCDS43 TAAANNNFVNFGVGDLNAVQSPGIPQSNSGVSTLHGIRAW 290 300 310 320 >>CCDS13145.1 2 gene_id:4821|Hs108|chr20 (273 aa) initn: 477 init1: 334 opt: 491 Z-score: 399.5 bits: 81.8 E(32554): 6.2e-16 Smith-Waterman score: 495; 40.5% identity (60.9% similar) in 284 aa overlap (8-278:6-260) 10 20 30 40 50 60 pF1KE3 MMLPSPVTSTPFSVKDILNLEQQHQHFHGAHLQADLEHHFHSAPCMLAAAEGTQFSDGGE :.: :::::::.: . . : : ..::: . . : CCDS13 MSLTNTKTGFSVKDILDLPDTN----------DEEG---------SVAEGPEEENEGP 10 20 30 70 80 90 100 110 pF1KE3 EDEEDEGE----KLSYLNSLAAADGHGDSGLCPQGYVHTVLRDSCSEPKEHEEEPEVVRD : . : :. ..:: . ::. : :.. . . .: .. . .. CCDS13 EPAKRAGPLGQGALDAVQSLPLKNPFYDSSDNP--YTRWL---ASTEGLQYSLHGLAAGA 40 50 60 70 80 90 120 130 140 150 160 170 pF1KE3 RSQKSCQLKKSLETAGDCKAAEESERP----KPRSRRKPRVLFSQAQVFELERRFKQQRY : : . :: : ..: . ...: : ..:: :::::.::..::::::.:::: CCDS13 PPQDSSS--KSPEPSADESPDNDKETPGGGGDAGKKRKRRVLFSKAQTYELERRFRQQRY 100 110 120 130 140 150 180 190 200 210 220 230 pF1KE3 LSAPEREHLASSLKLTSTQVKIWFQNRRYKCKRQRQDKSLELGAHAPPPPPRRVAVPVLV ::::::::::: ..:: :::::::::.::: :: : .:..:. .: : :::::::::: CCDS13 LSAPEREHLASLIRLTPTQVKIWFQNHRYKMKRARAEKGMEV---TPLPSPRRVAVPVLV 160 170 180 190 200 240 250 260 270 280 pF1KE3 RDGKPCVTPSAQAYGAP-YSVGA--SAYSYNSFPAYGYGN--SAAGPPRRPLPCSPPAAR :::::: . .:: .: ...: :::: .:. . :. :.:. :. : CCDS13 RDGKPCHALKAQDLAAATFQAGIPFSAYSAQSLQHMQYNAQYSSASTPQYPTAHPLVQAQ 210 220 230 240 250 260 290 pF1KE3 PEAAPL CCDS13 QWTW 270 >>CCDS42855.1 4 gene_id:644524|Hs108|chr20 (354 aa) initn: 519 init1: 342 opt: 474 Z-score: 384.9 bits: 79.5 E(32554): 4.1e-15 Smith-Waterman score: 474; 51.5% identity (66.7% similar) in 171 aa overlap (129-291:174-336) 100 110 120 130 140 150 pF1KE3 DSCSEPKEHEEEPEVVRDRSQKSCQLKKSLETAGDCKAAEESERPKPRSRRKPRVLFSQA .. : .:: . : ::: ::::::: CCDS42 RYSSISRFMGPSAGVNVAGMGSLTGIADAAKSLGPLHAAAAAAAP----RRKRRVLFSQA 150 160 170 180 190 160 170 180 190 200 210 pF1KE3 QVFELERRFKQQRYLSAPEREHLASSLKLTSTQVKIWFQNRRYKCKRQRQDKSLE----- ::.:::::::::.:::::::::::: ..:: :::::::::.::: ::: .::. . CCDS42 QVYELERRFKQQKYLSAPEREHLASMIHLTPTQVKIWFQNHRYKMKRQAKDKAAQQLQQE 200 210 220 230 240 250 220 230 240 250 260 270 pF1KE3 --LGAHAPPPP-PRRVAVPVLVRDGKPCVTPSAQAYGAPYSVGASAYSYNSFPAYGYGNS :: :::: ::::::::::.::::: .. . .: ..: . . . :: . CCDS42 GGLGPPPPPPPSPRRVAVPVLVKDGKPC--QNGASTPTPGQAGPQPPAPT--PAPELEEL 260 270 280 290 300 310 280 290 pF1KE3 AAGPPRRPLPCSPPAARPEAAPL . .:: : . :: :: CCDS42 SPSPPALHGPGGGLAALDAAAGEYSGGVLGANLLYGRTW 320 330 340 350 >>CCDS9660.1 8 gene_id:26257|Hs108|chr14 (239 aa) initn: 390 init1: 293 opt: 428 Z-score: 351.1 bits: 72.6 E(32554): 3.1e-13 Smith-Waterman score: 446; 45.9% identity (63.9% similar) in 205 aa overlap (103-283:31-232) 80 90 100 110 120 pF1KE3 LNSLAAADGHGDSGLCPQGYVHTVLRDSCSEPKEHEEEPEVV---RDRSQKSCQLKKSLE ::. . .: .. .:.. . ..::: CCDS96 MATSGRLSFTVRSLLDLPEQDAQHLPRREPEPRAPQPDPCAAWLDSERGHYPSSDESSLE 10 20 30 40 50 60 130 140 150 160 170 180 pF1KE3 TAGDCKAAEESERP-KPRS----RRKPRVLFSQAQVFELERRFKQQRYLSAPEREHLASS :. .. . : :: .: : :.: :::::.::..::::::.:::::::::::.::: CCDS96 TSPPDSSQRPSARPASPGSDAEKRKKRRVLFSKAQTLELERRFRQQRYLSAPEREQLASL 70 80 90 100 110 120 190 200 210 220 230 pF1KE3 LKLTSTQVKIWFQNRRYKCKRQRQD---KSLELGA----HAPPPPPRRVAVPVLVRDGKP :.:: :::::::::.::: :: : .: .:.: :: : :::.::::::::.: CCDS96 LRLTPTQVKIWFQNHRYKLKRARAPGAAESPDLAASAELHAAPGLLRRVVVPVLVRDGQP 130 140 150 160 170 180 240 250 260 270 280 pF1KE3 CV--------TPSAQAY-GAPYSVGASAYSYNSFPAYGYGNSAAGPPRRPLPCSPPAARP : : .:: ::: :.: ..::.: :.. . : :: CCDS96 CGGGGGGEVGTAAAQEKCGAP---PAAACPLPGYPAFGPGSALGLFPAYQHLASPALVSW 190 200 210 220 230 290 pF1KE3 EAAPL CCDS96 NW >>CCDS3410.1 2 gene_id:579|Hs108|chr4 (333 aa) initn: 348 init1: 295 opt: 374 Z-score: 307.1 bits: 65.0 E(32554): 8.7e-11 Smith-Waterman score: 374; 48.3% identity (67.6% similar) in 145 aa overlap (131-272:190-328) 110 120 130 140 150 160 pF1KE3 CSEPKEHEEEPEVVRDRSQKSCQLKKSLETAGDCKAAEESERPKPRSRRKPRVLFSQAQV :: . :: ::::..:. :. ::.::: CCDS34 PRTEDDGVGPRGAHVSALCSGAGGGGGSGPAGVAEEEEEPAAPKPRKKRS-RAAFSHAQV 160 170 180 190 200 210 170 180 190 200 210 220 pF1KE3 FELERRFKQQRYLSAPEREHLASSLKLTSTQVKIWFQNRRYKCKRQRQDKSLELGAHAPP :::::::..:::::.::: ::.::::: ::::::::::::: ::... .: : CCDS34 FELERRFNHQRYLSGPERADLAASLKLTETQVKIWFQNRRYKTKRRQMAADLL----ASA 220 230 240 250 260 270 230 240 250 260 270 pF1KE3 PPPRRVAVPVLVRDGKPCVTPSAQAYGAPYSVGASA---YSYNSFPAYGYGNSAAGPPRR : ..::: ::::: . :. .. : . . : : .:... .. :: CCDS34 PAAKKVAVKVLVRDDQRQYLPG-EVLRPPSLLPLQPSYYYPYYCLPGWALSTCAAAAGTQ 280 290 300 310 320 330 280 290 pF1KE3 PLPCSPPAARPEAAPL 293 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 22:33:24 2016 done: Sat Nov 5 22:33:25 2016 Total Scan time: 2.600 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]