FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1855, 373 aa 1>>>pF1KE1855 373 - 373 aa - 373 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.6465+/-0.000869; mu= 14.3941+/- 0.052 mean_var=62.7630+/-12.517, 0's: 0 Z-trim(106.1): 11 B-trim: 15 in 1/50 Lambda= 0.161891 statistics sampled from 8804 (8809) to 8804 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.65), E-opt: 0.2 (0.271), width: 16 Scan time: 2.790 The best scores are: opt bits E(32554) CCDS30871.1 PGLYRP4 gene_id:57115|Hs108|chr1 ( 373) 2547 603.5 9.6e-173 CCDS1035.1 PGLYRP3 gene_id:114771|Hs108|chr1 ( 341) 1581 377.9 7.3e-105 CCDS12680.1 PGLYRP1 gene_id:8993|Hs108|chr19 ( 196) 552 137.5 9.8e-33 CCDS12330.2 PGLYRP2 gene_id:114770|Hs108|chr19 ( 576) 288 76.0 9.6e-14 >>CCDS30871.1 PGLYRP4 gene_id:57115|Hs108|chr1 (373 aa) initn: 2547 init1: 2547 opt: 2547 Z-score: 3214.8 bits: 603.5 E(32554): 9.6e-173 Smith-Waterman score: 2547; 99.5% identity (99.7% similar) in 373 aa overlap (1-373:1-373) 10 20 30 40 50 60 pF1KE1 MLPWLLVFSALGLQAWGDSSWNKTQAKQVSEGLQYLFENISQLTEKGLPTDVSTTVSRKA ::::::::::::.::::::::::::::::::::::::::::::::::::::::::::::: CCDS30 MLPWLLVFSALGIQAWGDSSWNKTQAKQVSEGLQYLFENISQLTEKGLPTDVSTTVSRKA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 WGAEAVGCSIQLTTPVNVLVIHHVPGLECHDQTVCSQRLRELQAHHVHNNSGCDVAYNFL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS30 WGAEAVGCSIQLTTPVNVLVIHHVPGLECHDQTVCSQRLRELQAHHVHNNSGCDVAYNFL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 VGDDGRVYEGVGWNIQGVHTQGYNNISLGFAFFGTKKGHSPSPAALSAMENLITYAVQKG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS30 VGDDGRVYEGVGWNIQGVHTQGYNNISLGFAFFGTKKGHSPSPAALSAMENLITYAVQKG 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 HLSSSYVQPLLVKGENCLAPRQKTSLKKACPGVVPRSVWGARETHCPRMTLPAKYGIIIH ::::::::::: :::::::::::::::::::::::::::::::::::::::::::::::: CCDS30 HLSSSYVQPLLGKGENCLAPRQKTSLKKACPGVVPRSVWGARETHCPRMTLPAKYGIIIH 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE1 TAGRTCNISDECRLLVRDIQSFYIDRLKSCDIGYNFLVGQDGAIYEGVGWNVQGSSTPGY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS30 TAGRTCNISDECRLLVRDIQSFYIDRLKSCDIGYNFLVGQDGAIYEGVGWNVQGSSTPGY 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE1 DDIALGITFMGTFTGIPPNAAALEAAQDLIQCAMVKGYLTPNYLLVGHSDVARTLSPGQA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS30 DDIALGITFMGTFTGIPPNAAALEAAQDLIQCAMVKGYLTPNYLLVGHSDVARTLSPGQA 310 320 330 340 350 360 370 pF1KE1 LYNIISTWPHFKH ::::::::::::: CCDS30 LYNIISTWPHFKH 370 >>CCDS1035.1 PGLYRP3 gene_id:114771|Hs108|chr1 (341 aa) initn: 1656 init1: 1581 opt: 1581 Z-score: 1996.1 bits: 377.9 E(32554): 7.3e-105 Smith-Waterman score: 1592; 62.1% identity (78.8% similar) in 372 aa overlap (2-373:4-341) 10 20 30 40 50 pF1KE1 MLPWLLVFSALGLQAWGDSSWNKTQAKQVSEGLQYLFENISQLTEKGLPTDVSTTVSR :::::.: :::::: :. : ::: CCDS10 MGTLPWLLAFFILGLQAW----------------------------------DTPTIVSR 10 20 60 70 80 90 100 110 pF1KE1 KAWGAEAVGCSIQLTTPVNVLVIHHVPGLECHDQTVCSQRLRELQAHHVHNNSGCDVAYN : :::. ..: :: :: .. ..::..:..:.:::: :: ::.: :.. . :::::: CCDS10 KEWGARPLACRALLTLPVAYIITDQLPGMQCQQQSVCSQMLRGLQSHSVYTIGWCDVAYN 30 40 50 60 70 80 120 130 140 150 160 170 pF1KE1 FLVGDDGRVYEGVGWNIQGVHTQGYNNISLGFAFFGTKKGHSPSPAALSAMENLITYAVQ :::::::::::::::::::.:::::::::::.::::.: : ::::::::: :.::.::.: CCDS10 FLVGDDGRVYEGVGWNIQGLHTQGYNNISLGIAFFGNKIGSSPSPAALSAAEGLISYAIQ 90 100 110 120 130 140 180 190 200 210 220 230 pF1KE1 KGHLSSSYVQPLLVKGENCLAPRQKTSLKKACPGVVPRSVWGARETHCPRMTLPAKYGII ::::: :.::::.: :.:: :.. . .:.::... ::.: :::::::.:.::::: :: CCDS10 KGHLSPRYIQPLLLKEETCLDPQHPVMPRKVCPNIIKRSAWEARETHCPKMNLPAKYVII 150 160 170 180 190 200 240 250 260 270 280 290 pF1KE1 IHTAGRTCNISDECRLLVRDIQSFYIDRLKSCDIGYNFLVGQDGAIYEGVGWNVQGSSTP ::::: .:..: .:. .::.::::..: . :::::.:::::::..::::::..::: : CCDS10 IHTAGTSCTVSTDCQTVVRNIQSFHMDTRNFCDIGYHFLVGQDGGVYEGVGWHIQGSHTY 210 220 230 240 250 260 300 310 320 330 340 350 pF1KE1 GYDDIALGITFMGTFTGIPPNAAALEAAQDLIQCAMVKGYLTPNYLLVGHSDVARTLSPG :..::::::.:.: :. :::::::::::::::::.:.:::::::::.:::::. :::: CCDS10 GFNDIALGIAFIGYFVEKPPNAAALEAAQDLIQCAVVEGYLTPNYLLMGHSDVVNILSPG 270 280 290 300 310 320 360 370 pF1KE1 QALYNIISTWPHFKH ::::::::::::::: CCDS10 QALYNIISTWPHFKH 330 340 >>CCDS12680.1 PGLYRP1 gene_id:8993|Hs108|chr19 (196 aa) initn: 483 init1: 253 opt: 552 Z-score: 701.2 bits: 137.5 E(32554): 9.8e-33 Smith-Waterman score: 552; 41.2% identity (69.5% similar) in 177 aa overlap (198-372:18-194) 170 180 190 200 210 220 pF1KE1 AMENLITYAVQKGHLSSSYVQPLLVKGENCLAPRQKTSLKKACPGVVPRSVWGARETHCP :. :.: : .:::. : : ..: CCDS12 MSRRSMLLAWALPSLLRLGAAQETEDPACCSPIVPRNEWKALASECA 10 20 30 40 230 240 250 260 270 280 pF1KE1 R-MTLPAKYGIIIHTAGRTCNISDECRLLVRDIQSFYIDRLKSCDIGYNFLVGQDGAIYE . ..:: .: .. :::: .:: :. .:..: ... : ::.:::::.:.:: .:: CCDS12 QHLSLPLRYVVVSHTAGSSCNTPASCQQQARNVQHYHMKTLGWCDVGYNFLIGEDGLVYE 50 60 70 80 90 100 290 300 310 320 330 340 pF1KE1 GVGWNVQGSSTPG-YDDIALGITFMGTFTGIPPNAAALEAAQDLIQCAMVKGYLTPNYLL : ::: :. . .. ...::.:::.. :. :..::: :. :....: : ::.: CCDS12 GRGWNFTGAHSGHLWNPMSIGISFMGNYMDRVPTPQAIRAAQGLLACGVAQGALRSNYVL 110 120 130 140 150 160 350 360 370 pF1KE1 VGHSDVARTLSPGQALYNIISTWPHFKH :: :: ::::::. ::..:..:::.. CCDS12 KGHRDVQRTLSPGNQLYHLIQNWPHYRSP 170 180 190 >>CCDS12330.2 PGLYRP2 gene_id:114770|Hs108|chr19 (576 aa) initn: 436 init1: 236 opt: 288 Z-score: 360.3 bits: 76.0 E(32554): 9.6e-14 Smith-Waterman score: 411; 37.7% identity (67.7% similar) in 167 aa overlap (210-371:379-545) 180 190 200 210 220 230 pF1KE1 GHLSSSYVQPLLVKGENCLAPRQKTSLKKACPGVVPRSVWGARETHC-PRMT-LPAKYGI ::.. :: ::: . :.. :: . CCDS12 VHLQLQCMSQEQLAQVAANATKEFTEAFLGCPAIHPRCRWGAAPYRGRPKLLQLPLGFLY 350 360 370 380 390 400 240 250 260 270 280 290 pF1KE1 IIHT--AGRTCNISDECRLLVRDIQSFYIDRLKSCDIGYNFLVGQDGAIYEGVGWNVQGS . :: . :. .: .:..: .. : ::::.:.::.:: .::: ::. :. CCDS12 VHHTYVPAPPCTDFTRCAANMRSMQRYHQDTQGWGDIGYSFVVGSDGYVYEGRGWHWVGA 410 420 430 440 450 460 300 310 320 330 340 350 pF1KE1 STPGYDDIALGITFMGTFTGIPPNAAALEAAQD-LIQCAMVKGYLTPNYLLVGHSDVART : :... ..:....:..:. :. :::....: : .::. : : :.: :.:: ...:: CCDS12 HTLGHNSRGFGVAIVGNYTAALPTEAALRTVRDTLPSCAVRAGLLRPDYALLGHRQLVRT 470 480 490 500 510 520 360 370 pF1KE1 LSPGQALYNIISTWPHFKH ::.::.... ::::: CCDS12 DCPGDALFDLLRTWPHFTATVKPRPARSVSKRSRREPPPRTLPATDLQ 530 540 550 560 570 373 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 12:41:10 2016 done: Sun Nov 6 12:41:11 2016 Total Scan time: 2.790 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]