FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1541, 475 aa 1>>>pF1KE1541 475 - 475 aa - 475 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 9.8637+/-0.000992; mu= 1.2593+/- 0.061 mean_var=290.5501+/-57.699, 0's: 0 Z-trim(114.8): 5 B-trim: 0 in 0/53 Lambda= 0.075243 statistics sampled from 15330 (15334) to 15330 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.754), E-opt: 0.2 (0.471), width: 16 Scan time: 3.080 The best scores are: opt bits E(32554) CCDS41309.1 CAP1 gene_id:10487|Hs108|chr1 ( 475) 3070 346.2 4.4e-95 CCDS81304.1 CAP1 gene_id:10487|Hs108|chr1 ( 474) 3053 344.4 1.6e-94 CCDS4539.1 CAP2 gene_id:10486|Hs108|chr6 ( 477) 2013 231.5 1.5e-60 >>CCDS41309.1 CAP1 gene_id:10487|Hs108|chr1 (475 aa) initn: 3070 init1: 3070 opt: 3070 Z-score: 1820.5 bits: 346.2 E(32554): 4.4e-95 Smith-Waterman score: 3070; 98.7% identity (98.9% similar) in 475 aa overlap (1-475:1-475) 10 20 30 40 50 60 pF1KE1 MADMQNLVERLERAVGRLEAVSHTSDMHRGYADSPSKAGAAPYVQAFDSLLAGPVAEYLK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 MADMQNLVERLERAVGRLEAVSHTSDMHRGYADSPSKAGAAPYVQAFDSLLAGPVAEYLK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 ISKEIGGDVQKHAEMVHTGLKLERALLVTASQCQQPAENKLSDLLAPISEQIKEVITFRE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 ISKEIGGDVQKHAEMVHTGLKLERALLVTASQCQQPAENKLSDLLAPISEQIKEVITFRE 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 KNRGSKLFNHLSAVSESIQALGWVAMAPKPGPYVKEMNDAAMFYTNRVLKEYKDVDKKHV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 KNRGSKLFNHLSAVSESIQALGWVAMAPKPGPYVKEMNDAAMFYTNRVLKEYKDVDKKHV 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 DWVKAYLSIWTELQAYIKEFHTTGLAWSKTGPVAKELSGLPSGPSAGSGPPPPPPGPPPP :::::::::::::::::::::::::::::::::::::::::::::::: :::::: :::: CCDS41 DWVKAYLSIWTELQAYIKEFHTTGLAWSKTGPVAKELSGLPSGPSAGSCPPPPPPCPPPP 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE1 PVSTSSGSDESASRSALFAQINQGESITHALKHVSDDMKTHKNPALKAQSGPVRSGPKPF :::: : : ::::::.:::::::::::::::::::::::::::::::::::::::::::: CCDS41 PVSTISCSYESASRSSLFAQINQGESITHALKHVSDDMKTHKNPALKAQSGPVRSGPKPF 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE1 SAPKPQTSPSPKRATKKEPAVLELEGKKWRVENQENVSNLVIEDTELKQVAYIYKCVNTT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 SAPKPQTSPSPKRATKKEPAVLELEGKKWRVENQENVSNLVIEDTELKQVAYIYKCVNTT 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE1 LQIKGKINSITVDNCKKLGLVFDDVVGIVEIINSKDVKVQVMGKVPTISINKTDGCHAYL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 LQIKGKINSITVDNCKKLGLVFDDVVGIVEIINSKDVKVQVMGKVPTISINKTDGCHAYL 370 380 390 400 410 420 430 440 450 460 470 pF1KE1 SKNSLDCEIVSAKSSEMNVLIPTEGGDFNEFPVPEQFKTLWNGQKLVTTVTEIAG ::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 SKNSLDCEIVSAKSSEMNVLIPTEGGDFNEFPVPEQFKTLWNGQKLVTTVTEIAG 430 440 450 460 470 >>CCDS81304.1 CAP1 gene_id:10487|Hs108|chr1 (474 aa) initn: 2825 init1: 2825 opt: 3053 Z-score: 1810.6 bits: 344.4 E(32554): 1.6e-94 Smith-Waterman score: 3053; 98.5% identity (98.7% similar) in 475 aa overlap (1-475:1-474) 10 20 30 40 50 60 pF1KE1 MADMQNLVERLERAVGRLEAVSHTSDMHRGYADSPSKAGAAPYVQAFDSLLAGPVAEYLK ::::::::::::::::::::::::::::::::::::: :::::::::::::::::::::: CCDS81 MADMQNLVERLERAVGRLEAVSHTSDMHRGYADSPSK-GAAPYVQAFDSLLAGPVAEYLK 10 20 30 40 50 70 80 90 100 110 120 pF1KE1 ISKEIGGDVQKHAEMVHTGLKLERALLVTASQCQQPAENKLSDLLAPISEQIKEVITFRE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 ISKEIGGDVQKHAEMVHTGLKLERALLVTASQCQQPAENKLSDLLAPISEQIKEVITFRE 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE1 KNRGSKLFNHLSAVSESIQALGWVAMAPKPGPYVKEMNDAAMFYTNRVLKEYKDVDKKHV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 KNRGSKLFNHLSAVSESIQALGWVAMAPKPGPYVKEMNDAAMFYTNRVLKEYKDVDKKHV 120 130 140 150 160 170 190 200 210 220 230 240 pF1KE1 DWVKAYLSIWTELQAYIKEFHTTGLAWSKTGPVAKELSGLPSGPSAGSGPPPPPPGPPPP :::::::::::::::::::::::::::::::::::::::::::::::: :::::: :::: CCDS81 DWVKAYLSIWTELQAYIKEFHTTGLAWSKTGPVAKELSGLPSGPSAGSCPPPPPPCPPPP 180 190 200 210 220 230 250 260 270 280 290 300 pF1KE1 PVSTSSGSDESASRSALFAQINQGESITHALKHVSDDMKTHKNPALKAQSGPVRSGPKPF :::: : : ::::::.:::::::::::::::::::::::::::::::::::::::::::: CCDS81 PVSTISCSYESASRSSLFAQINQGESITHALKHVSDDMKTHKNPALKAQSGPVRSGPKPF 240 250 260 270 280 290 310 320 330 340 350 360 pF1KE1 SAPKPQTSPSPKRATKKEPAVLELEGKKWRVENQENVSNLVIEDTELKQVAYIYKCVNTT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 SAPKPQTSPSPKRATKKEPAVLELEGKKWRVENQENVSNLVIEDTELKQVAYIYKCVNTT 300 310 320 330 340 350 370 380 390 400 410 420 pF1KE1 LQIKGKINSITVDNCKKLGLVFDDVVGIVEIINSKDVKVQVMGKVPTISINKTDGCHAYL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 LQIKGKINSITVDNCKKLGLVFDDVVGIVEIINSKDVKVQVMGKVPTISINKTDGCHAYL 360 370 380 390 400 410 430 440 450 460 470 pF1KE1 SKNSLDCEIVSAKSSEMNVLIPTEGGDFNEFPVPEQFKTLWNGQKLVTTVTEIAG ::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 SKNSLDCEIVSAKSSEMNVLIPTEGGDFNEFPVPEQFKTLWNGQKLVTTVTEIAG 420 430 440 450 460 470 >>CCDS4539.1 CAP2 gene_id:10486|Hs108|chr6 (477 aa) initn: 2084 init1: 934 opt: 2013 Z-score: 1200.4 bits: 231.5 E(32554): 1.5e-60 Smith-Waterman score: 2013; 64.6% identity (84.4% similar) in 480 aa overlap (1-473:1-475) 10 20 30 40 50 pF1KE1 MADMQNLVERLERAVGRLEAVSHTSDMHRGYADSPSK--AGAAPYVQAFDSLLAGPVAEY ::.::.:::::::::.:::..: : : . ::.:: :.:::.:. . :::. CCDS45 MANMQGLVERLERAVSRLESLSAESHRPPGNCGEVNGVIAGVAPSVEAFDKLMDSMVAEF 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE1 LKISKEIGGDVQKHAEMVHTGLKLERALLVTASQCQQPAENKLSDLLAPISEQIKEVITF :: :. ..:::. ::::::.... .::.:. ::: ::: :: .. :: ::::.:.:. :: CCDS45 LKNSRILAGDVETHAEMVHSAFQAQRAFLLMASQYQQPHENDVAALLKPISEKIQEIQTF 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE1 REKNRGSKLFNHLSAVSESIQALGWVAMAPKPGPYVKEMNDAAMFYTNRVLKEYKDVDKK ::.::::..::::::::::: ::::.:..:::::::::::::: ::::::::.:: : . CCDS45 RERNRGSNMFNHLSAVSESIPALGWIAVSPKPGPYVKEMNDAATFYTNRVLKDYKHSDLR 130 140 150 160 170 180 180 190 200 210 220 230 pF1KE1 HVDWVKAYLSIWTELQAYIKEFHTTGLAWSKTGPVAKELSGLPSGPSAGSG-PPPPPPGP ::::::.::.::.:::::::: :::::.::::::::. .:.. : :.: : :::::: : CCDS45 HVDWVKSYLNIWSELQAYIKEHHTTGLTWSKTGPVASTVSAF-SVLSSGPGLPPPPPPLP 190 200 210 220 230 240 250 260 270 280 290 pF1KE1 PP--PPVSTSSGSDE--SASRSALFAQINQGESITHALKHVSDDMKTHKNPALKAQSGPV :: ::. . :. : : ::::::::.::::.::..:.::.::.::.:::.:.::.: . CCDS45 PPGPPPLFENEGKKEESSPSRSALFAQLNQGEAITKGLRHVTDDQKTYKNPSLRAQGGQT 240 250 260 270 280 290 300 310 320 330 340 350 pF1KE1 RSGPKPFSAPKPQTSPSPKRATKKEPAVLELEGKKWRVENQENVSNLVIEDTELKQVAYI .: : .:.: :::. . :. : :::::::::::: ::. ..::: .::::::::: CCDS45 QS-PTKSHTPSP-TSPKSYPSQKHAP-VLELEGKKWRVEYQEDRNDLVISETELKQVAYI 300 310 320 330 340 350 360 370 380 390 400 410 pF1KE1 YKCVNTTLQIKGKINSITVDNCKKLGLVFDDVVGIVEIINSKDVKVQVMGKVPTISINKT .:: ..:.:::::.::: .:::::::::::.::::::.:::.:...::::.::::::::: CCDS45 FKCEKSTIQIKGKVNSIIIDNCKKLGLVFDNVVGIVEVINSQDIQIQVMGRVPTISINKT 360 370 380 390 400 410 420 430 440 450 460 470 pF1KE1 DGCHAYLSKNSLDCEIVSAKSSEMNVLIPTEGGDFNEFPVPEQFKTLWNGQKLVTTVTEI .::: :::...::::::::::::::.::: . ::. :::.:::::: :.:.::.: .:: CCDS45 EGCHIYLSEDALDCEIVSAKSSEMNILIP-QDGDYREFPIPEQFKTAWDGSKLITEPAEI 420 430 440 450 460 470 pF1KE1 AG CCDS45 MA 475 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 23:00:05 2016 done: Sun Nov 6 23:00:05 2016 Total Scan time: 3.080 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]