FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1718, 224 aa 1>>>pF1KE1718 224 - 224 aa - 224 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.4401+/-0.000908; mu= 12.2397+/- 0.054 mean_var=54.6415+/-11.305, 0's: 0 Z-trim(103.8): 21 B-trim: 0 in 0/48 Lambda= 0.173505 statistics sampled from 7588 (7602) to 7588 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.62), E-opt: 0.2 (0.234), width: 16 Scan time: 2.010 The best scores are: opt bits E(32554) CCDS30911.1 CRP gene_id:1401|Hs108|chr1 ( 224) 1502 384.1 3.9e-107 CCDS1186.1 APCS gene_id:325|Hs108|chr1 ( 223) 777 202.6 1.7e-52 CCDS32762.1 NPTX1 gene_id:4884|Hs108|chr17 ( 432) 304 84.3 1.4e-16 CCDS32362.1 PTX4 gene_id:390667|Hs108|chr16 ( 473) 289 80.5 2e-15 CCDS33647.1 NPTXR gene_id:23467|Hs108|chr22 ( 500) 281 78.5 8.4e-15 CCDS5657.1 NPTX2 gene_id:4885|Hs108|chr7 ( 431) 277 77.5 1.5e-14 CCDS48004.1 SVEP1 gene_id:79987|Hs108|chr9 (3571) 266 74.9 7.3e-13 CCDS3180.1 PTX3 gene_id:5806|Hs108|chr3 ( 381) 250 70.7 1.4e-12 >>CCDS30911.1 CRP gene_id:1401|Hs108|chr1 (224 aa) initn: 1502 init1: 1502 opt: 1502 Z-score: 2036.9 bits: 384.1 E(32554): 3.9e-107 Smith-Waterman score: 1502; 100.0% identity (100.0% similar) in 224 aa overlap (1-224:1-224) 10 20 30 40 50 60 pF1KE1 MEKLLCFLVLTSLSHAFGQTDMSRKAFVFPKESDTSYVSLKAPLTKPLKAFTVCLHFYTE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS30 MEKLLCFLVLTSLSHAFGQTDMSRKAFVFPKESDTSYVSLKAPLTKPLKAFTVCLHFYTE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 LSSTRGYSIFSYATKRQDNEILIFWSKDIGYSFTVGGSEILFEVPEVTVAPVHICTSWES :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS30 LSSTRGYSIFSYATKRQDNEILIFWSKDIGYSFTVGGSEILFEVPEVTVAPVHICTSWES 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 ASGIVEFWVDGKPRVRKSLKKGYTVGAEASIILGQEQDSFGGNFEGSQSLVGDIGNVNMW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS30 ASGIVEFWVDGKPRVRKSLKKGYTVGAEASIILGQEQDSFGGNFEGSQSLVGDIGNVNMW 130 140 150 160 170 180 190 200 210 220 pF1KE1 DFVLSPDEINTIYLGGPFSPNVLNWRALKYEVQGEVFTKPQLWP :::::::::::::::::::::::::::::::::::::::::::: CCDS30 DFVLSPDEINTIYLGGPFSPNVLNWRALKYEVQGEVFTKPQLWP 190 200 210 220 >>CCDS1186.1 APCS gene_id:325|Hs108|chr1 (223 aa) initn: 769 init1: 582 opt: 777 Z-score: 1056.1 bits: 202.6 E(32554): 1.7e-52 Smith-Waterman score: 777; 51.3% identity (79.5% similar) in 224 aa overlap (1-223:1-222) 10 20 30 40 50 pF1KE1 MEKLLCFL-VLTSLSHAFGQTDMSRKAFVFPKESDTSYVSLKAPLTKPLKAFTVCLHFYT :.: : .. ::::: .::..::.: :.::::.:: :..:.: .:: :::. ::.:.. :. CCDS11 MNKPLLWISVLTSLLEAFAHTDLSGKVFVFPRESVTDHVNLITPLEKPLQNFTLCFRAYS 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE1 ELSSTRGYSIFSYATKRQDNEILIFWSKDIGYSFTVGGSEILFEVPEVTVAPVHICTSWE .:: :.::.::: :. .:::.:.. . ::. .: .. .: : ::::::.::: CCDS11 DLS--RAYSLFSYNTQGRDNELLVYKERVGEYSLYIGRHKVTSKVIEKFPAPVHICVSWE 70 80 90 100 110 120 130 140 150 160 170 pF1KE1 SASGIVEFWVDGKPRVRKSLKKGYTVGAEASIILGQEQDSFGGNFEGSQSLVGDIGNVNM :.:::.:::..: : :.:.:..:: : :. .:.:::::::.::.:. :::.::.::.. : CCDS11 SSSGIAEFWINGTPLVKKGLRQGYFVEAQPKIVLGQEQDSYGGKFDRSQSFVGEIGDLYM 120 130 140 150 160 170 180 190 200 210 220 pF1KE1 WDFVLSPDEINTIYLGGPFSPNVLNWRALKYEVQGEVFTKPQLWP :: :: :..: . : : :. :.:.:.::.::..: :. :: .: CCDS11 WDSVLPPENILSAYQGTPLPANILDWQALNYEIRGYVIIKPLVWV 180 190 200 210 220 >>CCDS32762.1 NPTX1 gene_id:4884|Hs108|chr17 (432 aa) initn: 223 init1: 138 opt: 304 Z-score: 411.4 bits: 84.3 E(32554): 1.4e-16 Smith-Waterman score: 304; 29.5% identity (61.4% similar) in 207 aa overlap (18-214:218-418) 10 20 30 40 pF1KE1 MEKLLCFLVLTSLSHAFGQTDM---SRKAFVFPKESDTSYVSLKAPL :: : .. ..:: ... :...: : CCDS32 KGGPRNDTEERVKIETALTSLHQRISELEKGQKDNRPGDKFQLTFPLRTNYMYAKVKKSL 190 200 210 220 230 240 50 60 70 80 90 100 pF1KE1 TKPLKAFTVCLHFYTELSSTRGYSI-FSYATKRQDNE-ILIFWSKDIGYSFTVGGSEILF . . :::::. . . :.: : . ::::. : :: .:: :... . . .. . CCDS32 PE-MYAFTVCM--WLKSSATPGVGTPFSYAVPGQANELVLIEWGNN---PMEILINDKVA 250 260 270 280 290 300 110 120 130 140 150 pF1KE1 EVPEVTVAPV--HICTSWESASGIVEFWVDG-KPRVRKSLKKGYTVGAEASIILGQEQDS ..: : :::..: . .:. : . :: . ..: . . .. ..::::::. CCDS32 KLPFVINDGKWHHICVTWTTRDGVWEAYQDGTQGGSGENLAPYHPIKPQGVLVLGQEQDT 310 320 330 340 350 360 160 170 180 190 200 210 pF1KE1 FGGNFEGSQSLVGDIGNVNMWDFVLSPDEINTIYLGGP--FSPNVLNWRALKYEVQGEVF .::.:...:..::.... :.:: :.: :. .. . .: ::. : . :. : CCDS32 LGGGFDATQAFVGELAHFNIWDRKLTPGEVYNLATCSTKALSGNVIAWAESHIEIYGGAT 370 380 390 400 410 420 220 pF1KE1 TKPQLWP CCDS32 KWTFEACRQIN 430 >>CCDS32362.1 PTX4 gene_id:390667|Hs108|chr16 (473 aa) initn: 247 init1: 177 opt: 289 Z-score: 390.4 bits: 80.5 E(32554): 2e-15 Smith-Waterman score: 289; 30.4% identity (63.0% similar) in 181 aa overlap (27-198:268-443) 10 20 30 40 50 pF1KE1 MEKLLCFLVLTSLSHAFGQTDMSRKAFVFPKESDTSYVSLKAPLTKPLKAFTVCLH .:::. : . : :. .. :.:.. : CCDS32 RVLSGTAPKDPRQQAWSPQVPGEICGVGPTLVFPNASTRNVVFLSPGFVTALRALSFCS- 240 250 260 270 280 290 60 70 80 90 100 110 pF1KE1 FYTELSSTRGYSIFSYATKRQDNEILIFWSKDI---GYSFTVGGSEILFEVPEVTVAPV- ... .: : ...::::. .::. :.. ..: : : :. . :.: . CCDS32 -WVRTASGRLGTLLSYATEDNDNK-LVLHGRDSLLPGSIHFVIGDPAFRELPLQLLLDGQ 300 310 320 330 340 350 120 130 140 150 160 pF1KE1 --HICTSWESASGIVEFWVDGKPRVRKS---LKKGYTVGAEASIILGQEQDSFGGNFEGS :::. : :..: ..:. :. . ...:: . .:..::::::: ::.:..: CCDS32 WHHICVIWTSTQG--RYWLHVDRRLVATGSRFREGYEIPPGGSLVLGQEQDSVGGGFDSS 360 370 380 390 400 410 170 180 190 200 210 220 pF1KE1 QSLVGDIGNVNMWDFVLSPDEINTIYLGGPFSPNVLNWRALKYEVQGEVFTKPQLWP ...::..... .:: .: : :. .. .: : CCDS32 EAFVGSMSGLAIWDRALVPGEVANLAIGKEFPTGAILTLANAALAGGFVQGANCTCLERC 420 430 440 450 460 470 CCDS32 P >>CCDS33647.1 NPTXR gene_id:23467|Hs108|chr22 (500 aa) initn: 255 init1: 165 opt: 281 Z-score: 379.2 bits: 78.5 E(32554): 8.4e-15 Smith-Waterman score: 281; 27.6% identity (59.6% similar) in 203 aa overlap (14-214:285-484) 10 20 30 40 pF1KE1 MEKLLCFLVLTSLSHAFGQTDMSRKAFVFPKESDTSYVSLKAP : :.. : . . .: ... :. .. CCDS33 ALSHSSRRQRQEVEKELDVLQGRVAELEHGSSAYSPPDAFKIS--IPIRNNYMYARVRKA 260 270 280 290 300 310 50 60 70 80 90 100 pF1KE1 LTKPLKAFTVCLHFYTELSSTRGYSIFSYATKRQDNEILIFWSKDIGYSFTVGGSEILFE : . : :::.:. . .. :.: . :::.. : :::... . . . .. . . CCDS33 LPE-LYAFTACMWLRSRSSGTGQGTPFSYSVPGQANEIVLLEAGHEPMELLINDKVAQLP 320 330 340 350 360 370 110 120 130 140 150 160 pF1KE1 VPEVTVAPVHICTSWESASGIVEFWVDGKPRVR-KSLKKGYTVGAEASIILGQEQDSFGG . . ::: .: . .:. . ::. . ..: . . .. .:::::::..:: CCDS33 LSLKDNGWHHICIAWTTRDGLWSAYQDGELQGSGENLAAWHPIKPHGILILGQEQDTLGG 380 390 400 410 420 430 170 180 190 200 210 220 pF1KE1 NFEGSQSLVGDIGNVNMWDFVLSPDEINTIY-LGGPFSPNVLNWRALKYEVQGEVFTKPQ :...:..::::.. :.:: .:.: .. : .:. ::: :. :. : CCDS33 RFDATQAFVGDIAQFNLWDHALTPAQVLGIANCTAPLLGNVLPWEDKLVEAFGGATKAAF 440 450 460 470 480 490 pF1KE1 LWP CCDS33 DVCKGRAKA 500 >>CCDS5657.1 NPTX2 gene_id:4885|Hs108|chr7 (431 aa) initn: 244 init1: 149 opt: 277 Z-score: 374.9 bits: 77.5 E(32554): 1.5e-14 Smith-Waterman score: 277; 30.4% identity (57.9% similar) in 214 aa overlap (16-224:218-420) 10 20 30 40 pF1KE1 MEKLLCFLVLTSLSHAFGQTDMSRKAFVFPKESDTSYVSLKAPLT :: . : . . .: ... : ..: : CCDS56 HNETSAHRQKTESTLNALLQRVTELERGNSAFKSPDAFKVS--LPLRTNYLYGKIKKTLP 190 200 210 220 230 240 50 60 70 80 90 100 pF1KE1 KPLKAFTVCLHFYTELSSTRGYSIFSYATKRQDNEI-LIFWSKDIGYSFTVGGSEILFEV . : :::.:: . . : : . ::::. : ::: :: :... . . .. . .. CCDS56 E-LYAFTICLWLRSSASPGIG-TPFSYAVPGQANEIVLIEWGNN---PIELLINDKVAQL 250 260 270 280 290 300 110 120 130 140 150 160 pF1KE1 PEVTVAPV--HICTSWESASGIVEFWVDG-KPRVRKSLKKGYTVGAEASIILGQEQDSFG : . :::..: . .:. : . :: : . ..: . . . .:::::::. : CCDS56 PLFVSDGKWHHICVTWTTRDGMWEAFQDGEKLGTGENLAPWHPIKPGGVLILGQEQDTVG 310 320 330 340 350 360 170 180 190 200 210 220 pF1KE1 GNFEGSQSLVGDIGNVNMWDFVLSPDEINTIYLGGPFSP-NVLNWRALKYEVQGEVFTKP : :...:..::.... :.:: :: .:: .: . : :.. : . . .:: CCDS56 GRFDATQAFVGELSQFNIWDRVLRAQEIVNIANCSTNMPGNIIPW----VDNNVDVFGGA 370 380 390 400 410 pF1KE1 QLWP . :: CCDS56 SKWPVETCEERLLDL 420 430 >>CCDS48004.1 SVEP1 gene_id:79987|Hs108|chr9 (3571 aa) initn: 211 init1: 147 opt: 266 Z-score: 344.4 bits: 74.9 E(32554): 7.3e-13 Smith-Waterman score: 266; 31.7% identity (64.0% similar) in 186 aa overlap (37-216:1438-1619) 10 20 30 40 50 60 pF1KE1 FLVLTSLSHAFGQTDMSRKAFVFPKESDTSYVSLKAPLTKPLKAFTVCLHFYTELSSTRG :: : . : . :.:.: : :. . :. . CCDS48 KCQPGFSGKRCETEQSTGFNLDFEVSGIYGYVMLDGMLPS-LHALT-CT-FWMKSSDDMN 1410 1420 1430 1440 1450 1460 70 80 90 100 110 120 pF1KE1 YSI-FSYATKRQDNEILIFWSKDIGYSFTVGGSEILFEVPEVTVAPVH-ICTSWESASGI :. .:::. ... :.. . . :. . :.: : . . : :. . : : .: ::.:: CCDS48 YGTPISYAVDNGSDNTLLLTDYN-GWVLYVNGREKITNCPSVNDGRWHHIAITWTSANGI 1470 1480 1490 1500 1510 1520 130 140 150 160 170 180 pF1KE1 VEFWVDGK-PRVRKSLKKGYTVGAEASIILGQEQDSFGGNFEGSQSLVGDIGNVNMWDFV . ..::: .:. : . . ....::::::. : .: ..:.::.:...:.::.: CCDS48 WKVYIDGKLSDGGAGLSVGLPIPGGGALVLGQEQDKKGEGFSPAESFVGSISQLNLWDYV 1530 1540 1550 1560 1570 1580 190 200 210 220 pF1KE1 LSPDEINTIYLGGPF---SPNVLNWRALKYEVQGEVFTKPQLWP :::...... . : . ::: : . . :.: CCDS48 LSPQQVKSLATSCPEELSKGNVLAWPDFLSGIVGKVKIDSKSIFCSDCPRLGGSVPHLRT 1590 1600 1610 1620 1630 1640 CCDS48 ASEDLKPGSKVNLFCDPGFQLVGNPVQYCLNQGQWTQPLPHCERISCGVPPPLENGFHSA 1650 1660 1670 1680 1690 1700 >>CCDS3180.1 PTX3 gene_id:5806|Hs108|chr3 (381 aa) initn: 125 init1: 83 opt: 250 Z-score: 339.3 bits: 70.7 E(32554): 1.4e-12 Smith-Waterman score: 250; 28.5% identity (60.0% similar) in 200 aa overlap (26-214:182-375) 10 20 30 40 50 pF1KE1 MEKLLCFLVLTSLSHAFGQTDMSRKAFVFPKESDTSYVSLKAPLTKPLKAFTVCL :..:: .: . :.. :..:..:. CCDS31 VLEELRQTRADLHAVQGWAARSWLPAGCETAILFPMRSKKIFGSVHPVRPMRLESFSACI 160 170 180 190 200 210 60 70 80 90 100 110 pF1KE1 HFYTELSSTRGYSI-FSYATKRQDNEILIFWSKDIGYSFTVGGSEILFEVPEVTVAP--- ... ... . .: :::.:::. :: .. : . . :.::: : . : :. :. CCDS31 --WVKATDVLNKTILFSYGTKRNPYEIQLYLSYQ-SIVFVVGGEENKL-VAEAMVSLGRW 220 230 240 250 260 120 130 140 150 160 pF1KE1 VHICTSWESASGIVEFWVDGK-PRVRKSLKKGYTVGAEASIILGQEQDS--FGGNFEGSQ .:.: .:.: :.. .::.:. . . :. : . . .:::... ::.:. . CCDS31 THLCGTWNSEEGLTSLWVNGELAATTVEMATGHIVPEGGILQIGQEKNGCCVGGGFDETL 270 280 290 300 310 320 170 180 190 200 210 220 pF1KE1 SLVGDIGNVNMWDFVLSPDEINTIYLGGPFS----PNVLNWRALKYEVQGEVFTKPQLWP .. : . . :.:: ::: .:: :: : :...: . . . .: CCDS31 AFSGRLTGFNIWDSVLSNEEIRE--TGGAESCHIRGNIVGWGVTEIQPHGGAQYVS 330 340 350 360 370 380 224 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 23:25:22 2016 done: Mon Nov 7 23:25:22 2016 Total Scan time: 2.010 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]