FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1682, 196 aa 1>>>pF1KE1682 196 - 196 aa - 196 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.8931+/-0.000693; mu= 15.8326+/- 0.042 mean_var=67.3948+/-13.647, 0's: 0 Z-trim(110.0): 11 B-trim: 600 in 1/51 Lambda= 0.156229 statistics sampled from 11262 (11267) to 11262 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.73), E-opt: 0.2 (0.346), width: 16 Scan time: 2.040 The best scores are: opt bits E(32554) CCDS12680.1 PGLYRP1 gene_id:8993|Hs108|chr19 ( 196) 1391 321.7 1.8e-88 CCDS30871.1 PGLYRP4 gene_id:57115|Hs108|chr1 ( 373) 552 132.8 2.4e-31 CCDS12330.2 PGLYRP2 gene_id:114770|Hs108|chr19 ( 576) 470 114.5 1.2e-25 CCDS1035.1 PGLYRP3 gene_id:114771|Hs108|chr1 ( 341) 420 103.1 2.1e-22 >>CCDS12680.1 PGLYRP1 gene_id:8993|Hs108|chr19 (196 aa) initn: 1391 init1: 1391 opt: 1391 Z-score: 1701.9 bits: 321.7 E(32554): 1.8e-88 Smith-Waterman score: 1391; 100.0% identity (100.0% similar) in 196 aa overlap (1-196:1-196) 10 20 30 40 50 60 pF1KE1 MSRRSMLLAWALPSLLRLGAAQETEDPACCSPIVPRNEWKALASECAQHLSLPLRYVVVS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 MSRRSMLLAWALPSLLRLGAAQETEDPACCSPIVPRNEWKALASECAQHLSLPLRYVVVS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 HTAGSSCNTPASCQQQARNVQHYHMKTLGWCDVGYNFLIGEDGLVYEGRGWNFTGAHSGH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 HTAGSSCNTPASCQQQARNVQHYHMKTLGWCDVGYNFLIGEDGLVYEGRGWNFTGAHSGH 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 LWNPMSIGISFMGNYMDRVPTPQAIRAAQGLLACGVAQGALRSNYVLKGHRDVQRTLSPG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 LWNPMSIGISFMGNYMDRVPTPQAIRAAQGLLACGVAQGALRSNYVLKGHRDVQRTLSPG 130 140 150 160 170 180 190 pF1KE1 NQLYHLIQNWPHYRSP :::::::::::::::: CCDS12 NQLYHLIQNWPHYRSP 190 >>CCDS30871.1 PGLYRP4 gene_id:57115|Hs108|chr1 (373 aa) initn: 483 init1: 253 opt: 552 Z-score: 676.1 bits: 132.8 E(32554): 2.4e-31 Smith-Waterman score: 552; 41.8% identity (69.5% similar) in 177 aa overlap (18-194:198-372) 10 20 30 40 pF1KE1 MSRRSMLLAWALPSLLRLGAAQETEDPACCSPIVPRNEWKALASECA :. :.: : .:::. : : ..: CCDS30 AMENLITYAVQKGHLSSSYVQPLLGKGENCLAPRQKTSLKKACPGVVPRSVWGARETHCP 170 180 190 200 210 220 50 60 70 80 90 100 pF1KE1 QHLSLPLRYVVVSHTAGSSCNTPASCQQQARNVQHYHMKTLGWCDVGYNFLIGEDGLVYE . ..:: .: .. :::: .:: :. .:..: ... : ::.:::::.:.:: .:: CCDS30 R-MTLPAKYGIIIHTAGRTCNISDECRLLVRDIQSFYIDRLKSCDIGYNFLVGQDGAIYE 230 240 250 260 270 280 110 120 130 140 150 160 pF1KE1 GRGWNFTGAHSGHLWNPMSIGISFMGNYMDRVPTPQAIRAAQGLLACGVAQGALRSNYVL : ::: :. : .. ...::.:::.. :. :..::: :. :....: : ::.: CCDS30 GVGWNVQGS-STPGYDDIALGITFMGTFTGIPPNAAALEAAQDLIQCAMVKGYLTPNYLL 290 300 310 320 330 340 170 180 190 pF1KE1 KGHRDVQRTLSPGNQLYHLIQNWPHYRSP :: :: ::::::. ::..:..:::.. CCDS30 VGHSDVARTLSPGQALYNIISTWPHFKH 350 360 370 >-- initn: 190 init1: 154 opt: 376 Z-score: 461.7 bits: 93.2 E(32554): 2.1e-19 Smith-Waterman score: 376; 38.5% identity (68.9% similar) in 148 aa overlap (21-166:41-187) 10 20 30 40 pF1KE1 MSRRSMLLAWALPSLLRLGAAQETED--PACCSPIVPRNEWKALASECAQ .: :: :. : : :. : : : :. CCDS30 LGIQAWGDSSWNKTQAKQVSEGLQYLFENISQLTEKGLPTDVSTTVSRKAWGAEAVGCSI 20 30 40 50 60 70 50 60 70 80 90 100 pF1KE1 HLSLPLRYVVVSHTAGSSCNTPASCQQQARNVQHYHMKTLGWCDVGYNFLIGEDGLVYEG .:. :. .:. :. : :. . :.:. :..: .:... . :::.::::.:.:: :::: CCDS30 QLTTPVNVLVIHHVPGLECHDQTVCSQRLRELQAHHVHNNSGCDVAYNFLVGDDGRVYEG 80 90 100 110 120 130 110 120 130 140 150 160 pF1KE1 RGWNFTGAHSGHLWNPMSIGISFMGNYMDRVPTPQAIRAAQGLLACGVAQGALRSNYVLK :::. :.:. . .: .:.:..:.:. . :.: :. : ..:.. .: .: : :.:: CCDS30 VGWNIQGVHT-QGYNNISLGFAFFGTKKGHSPSPAALSAMENLITYAVQKGHLSSSYVQP 140 150 160 170 180 170 180 190 pF1KE1 GHRDVQRTLSPGNQLYHLIQNWPHYRSP CCDS30 LLGKGENCLAPRQKTSLKKACPGVVPRSVWGARETHCPRMTLPAKYGIIIHTAGRTCNIS 190 200 210 220 230 240 >>CCDS12330.2 PGLYRP2 gene_id:114770|Hs108|chr19 (576 aa) initn: 379 init1: 312 opt: 470 Z-score: 573.6 bits: 114.5 E(32554): 1.2e-25 Smith-Waterman score: 470; 41.1% identity (67.2% similar) in 180 aa overlap (20-193:368-545) 10 20 30 40 pF1KE1 MSRRSMLLAWALPSLLRLGAAQE-TEDPACCSPIVPRNEWKALASEC-A :..: :: : : :: .: : . CCDS12 GTLVLLQRLEPVHLQLQCMSQEQLAQVAANATKEFTEAFLGCPAIHPRCRWGAAPYRGRP 340 350 360 370 380 390 50 60 70 80 90 100 pF1KE1 QHLSLPLRYVVVSHT--AGSSCNTPASCQQQARNVQHYHMKTLGWCDVGYNFLIGEDGLV . :.::: .. : :: . :. . : . :..:.::. : :: :.::.:..: :: : CCDS12 KLLQLPLGFLYVHHTYVPAPPCTDFTRCAANMRSMQRYHQDTQGWGDIGYSFVVGSDGYV 400 410 420 430 440 450 110 120 130 140 150 160 pF1KE1 YEGRGWNFTGAHS-GHLWNPMSIGISFMGNYMDRVPTPQAIRAAQGLL-ACGVAQGALRS ::::::...:::. :: : ..:....::: .:: :.:... : .:.: : :: CCDS12 YEGRGWHWVGAHTLGH--NSRGFGVAIVGNYTAALPTEAALRTVRDTLPSCAVRAGLLRP 460 470 480 490 500 510 170 180 190 pF1KE1 NYVLKGHRDVQRTLSPGNQLYHLIQNWPHYRSP .:.: :::.. :: ::. :. :...:::. CCDS12 DYALLGHRQLVRTDCPGDALFDLLRTWPHFTATVKPRPARSVSKRSRREPPPRTLPATDL 520 530 540 550 560 570 >>CCDS1035.1 PGLYRP3 gene_id:114771|Hs108|chr1 (341 aa) initn: 624 init1: 305 opt: 420 Z-score: 515.8 bits: 103.1 E(32554): 2.1e-22 Smith-Waterman score: 582; 41.4% identity (73.7% similar) in 186 aa overlap (13-194:157-340) 10 20 30 pF1KE1 MSRRSMLLAWALPSLLRLGAAQETEDPA----CCSPIVPRNE : ::. . . . :. : :. :. CCDS10 SSPSPAALSAAEGLISYAIQKGHLSPRYIQPLLLKEETCLDPQHPVMPRKVCPNIIKRSA 130 140 150 160 170 180 40 50 60 70 80 90 pF1KE1 WKALASECAQHLSLPLRYVVVSHTAGSSCNTPASCQQQARNVQHYHMKTLGWCDVGYNFL :.: ..: . ..:: .::.. ::::.::.. ..:: .::.: .:: : ..::.::.:: CCDS10 WEARETHCPK-MNLPAKYVIIIHTAGTSCTVSTDCQTVVRNIQSFHMDTRNFCDIGYHFL 190 200 210 220 230 240 100 110 120 130 140 150 pF1KE1 IGEDGLVYEGRGWNFTGAHSGHLWNPMSIGISFMGNYMDRVPTPQAIRAAQGLLACGVAQ .:.:: :::: ::.. :.:. . .: ...::.:.: .... :. :..::: :. :.:.. CCDS10 VGQDGGVYEGVGWHIQGSHT-YGFNDIALGIAFIGYFVEKPPNAAALEAAQDLIQCAVVE 250 260 270 280 290 300 160 170 180 190 pF1KE1 GALRSNYVLKGHRDVQRTLSPGNQLYHLIQNWPHYRSP : : ::.: :: :: ::::. ::..:..:::.. CCDS10 GYLTPNYLLMGHSDVVNILSPGQALYNIISTWPHFKH 310 320 330 340 >-- initn: 631 init1: 305 opt: 420 Z-score: 515.8 bits: 103.1 E(32554): 2.1e-22 Smith-Waterman score: 420; 39.6% identity (66.7% similar) in 159 aa overlap (8-166:4-155) 10 20 30 40 50 60 pF1KE1 MSRRSMLLAWALPSLLRLGAAQETEDPACCSPIVPRNEWKALASECAQHLSLPLRYVVVS : : : .. :: : . :. :: :.:: : : :.::. :.... CCDS10 MGTLPWLLAFFI-LGL-QAWDTPT----IVSRKEWGARPLACRALLTLPVAYIITD 10 20 30 40 50 70 80 90 100 110 120 pF1KE1 HTAGSSCNTPASCQQQARNVQHYHMKTLGWCDVGYNFLIGEDGLVYEGRGWNFTGAHSGH . : .:. . :.:. :..: . . :.:::::.::::.:.:: :::: :::. : :. . CCDS10 QLPGMQCQQQSVCSQMLRGLQSHSVYTIGWCDVAYNFLVGDDGRVYEGVGWNIQGLHT-Q 60 70 80 90 100 130 140 150 160 170 180 pF1KE1 LWNPMSIGISFMGNYMDRVPTPQAIRAAQGLLACGVAQGALRSNYVLKGHRDVQRTLSPG .: .:.::.:.:: . :.: :. ::.::.. .. .: : :. CCDS10 GYNNISLGIAFFGNKIGSSPSPAALSAAEGLISYAIQKGHLSPRYIQPLLLKEETCLDPQ 110 120 130 140 150 160 190 pF1KE1 NQLYHLIQNWPHYRSP CCDS10 HPVMPRKVCPNIIKRSAWEARETHCPKMNLPAKYVIIIHTAGTSCTVSTDCQTVVRNIQS 170 180 190 200 210 220 196 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 13:20:42 2016 done: Sun Nov 6 13:20:43 2016 Total Scan time: 2.040 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]