FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6611, 267 aa 1>>>pF1KE6611 267 - 267 aa - 267 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.1942+/-0.00084; mu= 11.5840+/- 0.051 mean_var=87.9733+/-17.735, 0's: 0 Z-trim(109.1): 35 B-trim: 0 in 0/50 Lambda= 0.136741 statistics sampled from 10592 (10626) to 10592 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.683), E-opt: 0.2 (0.326), width: 16 Scan time: 1.660 The best scores are: opt bits E(32554) CCDS7988.1 MS4A12 gene_id:54860|Hs108|chr11 ( 267) 1750 354.7 3.9e-98 CCDS53638.1 MS4A12 gene_id:54860|Hs108|chr11 ( 221) 831 173.4 1.3e-43 CCDS7990.1 MS4A8 gene_id:83661|Hs108|chr11 ( 250) 506 109.3 2.8e-24 CCDS44617.1 MS4A15 gene_id:219995|Hs108|chr11 ( 240) 431 94.5 7.7e-20 CCDS7991.1 MS4A15 gene_id:219995|Hs108|chr11 ( 147) 337 75.8 1.9e-14 CCDS7982.1 MS4A4A gene_id:51338|Hs108|chr11 ( 239) 289 66.4 2.1e-11 CCDS60802.1 MS4A15 gene_id:219995|Hs108|chr11 ( 199) 281 64.8 5.3e-11 >>CCDS7988.1 MS4A12 gene_id:54860|Hs108|chr11 (267 aa) initn: 1750 init1: 1750 opt: 1750 Z-score: 1875.3 bits: 354.7 E(32554): 3.9e-98 Smith-Waterman score: 1750; 100.0% identity (100.0% similar) in 267 aa overlap (1-267:1-267) 10 20 30 40 50 60 pF1KE6 MMSSKPTSHAEVNETIPNPYPPSSFMAPGFQQPLGSINLENQAQGAQRAQPYGITSPGIF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS79 MMSSKPTSHAEVNETIPNPYPPSSFMAPGFQQPLGSINLENQAQGAQRAQPYGITSPGIF 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 ASSQPGQGNIQMINPSVGTAVMNFKEEAKALGVIQIMVGLMHIGFGIVLCLISFSFREVL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS79 ASSQPGQGNIQMINPSVGTAVMNFKEEAKALGVIQIMVGLMHIGFGIVLCLISFSFREVL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 GFASTAVIGGYPFWGGLSFIISGSLSVSASKELSRCLVKGSLGMNIVSSILAFIGVILLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS79 GFASTAVIGGYPFWGGLSFIISGSLSVSASKELSRCLVKGSLGMNIVSSILAFIGVILLL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 VDMCINGVAGQDYWAVLSGKGISATLMIFSLLEFFVACATAHFANQANTTTNMSVLVIPN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS79 VDMCINGVAGQDYWAVLSGKGISATLMIFSLLEFFVACATAHFANQANTTTNMSVLVIPN 190 200 210 220 230 240 250 260 pF1KE6 MYESNPVTPASSSAPPRCNNYSANAPK ::::::::::::::::::::::::::: CCDS79 MYESNPVTPASSSAPPRCNNYSANAPK 250 260 >>CCDS53638.1 MS4A12 gene_id:54860|Hs108|chr11 (221 aa) initn: 831 init1: 831 opt: 831 Z-score: 896.7 bits: 173.4 E(32554): 1.3e-43 Smith-Waterman score: 1339; 82.8% identity (82.8% similar) in 267 aa overlap (1-267:1-221) 10 20 30 40 50 60 pF1KE6 MMSSKPTSHAEVNETIPNPYPPSSFMAPGFQQPLGSINLENQAQGAQRAQPYGITSPGIF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 MMSSKPTSHAEVNETIPNPYPPSSFMAPGFQQPLGSINLENQAQGAQRAQPYGITSPGIF 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 ASSQPGQGNIQMINPSVGTAVMNFKEEAKALGVIQIMVGLMHIGFGIVLCLISFSFREVL :::::::::::::::::::::::::::::::: CCDS53 ASSQPGQGNIQMINPSVGTAVMNFKEEAKALG---------------------------- 70 80 90 130 140 150 160 170 180 pF1KE6 GFASTAVIGGYPFWGGLSFIISGSLSVSASKELSRCLVKGSLGMNIVSSILAFIGVILLL :::::::::::::::::::::::::::::::::::::::::: CCDS53 ------------------FIISGSLSVSASKELSRCLVKGSLGMNIVSSILAFIGVILLL 100 110 120 130 190 200 210 220 230 240 pF1KE6 VDMCINGVAGQDYWAVLSGKGISATLMIFSLLEFFVACATAHFANQANTTTNMSVLVIPN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 VDMCINGVAGQDYWAVLSGKGISATLMIFSLLEFFVACATAHFANQANTTTNMSVLVIPN 140 150 160 170 180 190 250 260 pF1KE6 MYESNPVTPASSSAPPRCNNYSANAPK ::::::::::::::::::::::::::: CCDS53 MYESNPVTPASSSAPPRCNNYSANAPK 200 210 220 >>CCDS7990.1 MS4A8 gene_id:83661|Hs108|chr11 (250 aa) initn: 327 init1: 133 opt: 506 Z-score: 549.4 bits: 109.3 E(32554): 2.8e-24 Smith-Waterman score: 506; 42.5% identity (66.4% similar) in 226 aa overlap (52-256:23-240) 30 40 50 60 70 pF1KE6 PSSFMAPGFQQPLGSINLENQAQGAQRAQPYGITSPGI------FASSQPGQGNIQMINP : .: ::: . .::: . : CCDS79 MNSMTSAVPVANSVLVVAPHNGYPVT-PGIMSHVPLYPNSQPQVHLVPGNPP 10 20 30 40 50 80 90 100 110 120 pF1KE6 SV-----GTAVMNFKEEAKALGVIQIMVGLMHIGFGIVLCLISFSFREVLG-FASTAVIG :. : :.. .:.:.::.:::..:: :::.: .. . ..: . : . : CCDS79 SLVSNVNGQPVQKALKEGKTLGAIQIIIGLAHIGLGSIMATV------LVGEYLSISFYG 60 70 80 90 100 130 140 150 160 170 180 pF1KE6 GYPFWGGLSFIISGSLSVSA-SKELSRCLVKGSLGMNIVSSILAFIGVILLLVDMCI-NG :.:::::: :::::::::.: .. : ::..::::.::::.: . .::::...:. : . CCDS79 GFPFWGGLWFIISGSLSVAAENQPYSYCLLSGSLGLNIVSAICSAVGVILFITDLSIPHP 110 120 130 140 150 160 190 200 210 220 230 240 pF1KE6 VAGQDY----WAVLSGKGISATLMIFSLLEFFVACATAHFANQ--ANTTTNMSVLVIPNM : :: :.: : .::..:..: :::: .:::..::. : ..:.:: . ::. CCDS79 YAYPDYYPYAWGVNPGMAISGVLLVFCLLEFGIACASSHFGCQLVCCQSSNVSV-IYPNI 170 180 190 200 210 220 250 260 pF1KE6 YESNPV-TPASSSAPPRCNNYSANAPK : .::: :: ..:: CCDS79 YAANPVITPEPVTSPPSYSSEIQANK 230 240 250 >>CCDS44617.1 MS4A15 gene_id:219995|Hs108|chr11 (240 aa) initn: 447 init1: 221 opt: 431 Z-score: 469.7 bits: 94.5 E(32554): 7.7e-20 Smith-Waterman score: 446; 35.5% identity (60.4% similar) in 265 aa overlap (2-260:1-232) 10 20 30 40 50 pF1KE6 MMSSKPTSHAEVNETIPNPY----PPSSFMAPGFQQPLGSINLENQAQGAQRAQPYGITS ::. :.:.. :: :: ... .. :: : ...:. ::: . CCDS44 MSAAPASNGVFVVIPPNNASGLCPPPAILPTSMCQPPGIMQFEEPPLGAQ--------T 10 20 30 40 50 60 70 80 90 100 110 pF1KE6 PGIFASSQPGQGNIQMINPSVGTAVMNFKEEAKALGVIQIMVGLMHIGFGIVLCLISFSF : ..:: :.. . . : :.::..::..::.:.::: :: .. CCDS44 P---RATQP---------PDLRPVETFLTGEPKVLGTVQILIGLIHLGFGSVLLMVR--- 60 70 80 90 120 130 140 150 160 170 pF1KE6 REVLGFASTAVIGGYPFWGGLSFIISGSLSVSASKELSRCLVKGSLGMNIVSSILAFIGV : .:. . :: ::::: :::::::::.: :. . :::..::: ::.: . :: :. CCDS44 RGHVGI--FFIEGGVPFWGGACFIISGSLSVAAEKNHTSCLVRSSLGTNILSVMAAFAGT 100 110 120 130 140 150 180 190 200 210 220 230 pF1KE6 ILLLVDMCINGVAGQDYWAVLSGKGISATLMIFSLLEFFVACATAHFANQA-NTTTNMSV .::.:. ::...: .: :.: ::..::::.: . ::. :: .. .. : CCDS44 AILLMDF---GVTNRDV-----DRGYLAVLTIFTVLEFFTAVIAMHFGCQAIHAQASAPV 160 170 180 190 200 240 250 260 pF1KE6 LVIPNMYESNPVTPA-SSSAPPRCNNYSANAPK . .:: . .. :. ..:::: .: CCDS44 IFLPNAFSADFNIPSPAASAPPAYDNVAYAQGVV 210 220 230 240 >>CCDS7991.1 MS4A15 gene_id:219995|Hs108|chr11 (147 aa) initn: 330 init1: 221 opt: 337 Z-score: 372.7 bits: 75.8 E(32554): 1.9e-14 Smith-Waterman score: 337; 44.0% identity (69.4% similar) in 134 aa overlap (129-260:14-139) 100 110 120 130 140 150 pF1KE6 GLMHIGFGIVLCLISFSFREVLGFASTAVIGGYPFWGGLSFIISGSLSVSASKELSRCLV :: ::::: :::::::::.: :. . ::: CCDS79 MVRRGHVGIFFIEGGVPFWGGACFIISGSLSVAAEKNHTSCLV 10 20 30 40 160 170 180 190 200 210 pF1KE6 KGSLGMNIVSSILAFIGVILLLVDMCINGVAGQDYWAVLSGKGISATLMIFSLLEFFVAC ..::: ::.: . :: :. .::.:. ::...: .: :.: ::..::::.: CCDS79 RSSLGTNILSVMAAFAGTAILLMDF---GVTNRDV-----DRGYLAVLTIFTVLEFFTAV 50 60 70 80 90 220 230 240 250 260 pF1KE6 ATAHFANQA-NTTTNMSVLVIPNMYESNPVTPA-SSSAPPRCNNYSANAPK . ::. :: .. .. :. .:: . .. :. ..:::: .: CCDS79 IAMHFGCQAIHAQASAPVIFLPNAFSADFNIPSPAASAPPAYDNVAYAQGVV 100 110 120 130 140 >>CCDS7982.1 MS4A4A gene_id:51338|Hs108|chr11 (239 aa) initn: 246 init1: 173 opt: 289 Z-score: 318.3 bits: 66.4 E(32554): 2.1e-11 Smith-Waterman score: 318; 31.8% identity (63.7% similar) in 223 aa overlap (44-249:24-235) 20 30 40 50 60 70 pF1KE6 ETIPNPYPPSSFMAPGFQQPLGSINLENQAQGAQRAQPYGITSPGIFASSQPGQGNIQMI :: ..:.: : ..::. : ::. .: CCDS79 MHQTYSRHCRPEESTFSAAMTTMQGMEQAMP-G-AGPGV-----PQLGNMAVI 10 20 30 40 80 90 100 110 120 130 pF1KE6 NPSVGTAVMN--FKEEAKALGVIQIMVGLMHIGFGI-VLCLISFSFREVLGFASTAVIGG . . .... .: : :.:::.::...:: ...:: ..:. : .. : .: : CCDS79 HSHLWKGLQEKFLKGEPKVLGVVQILTALMSLSMGITMMCMASNTY----GSNPISVYIG 50 60 70 80 90 100 140 150 160 170 180 pF1KE6 YPFWGGLSFIISGSLSVSASKELSRCLVKGSLGMNIVSSILAFIGVILLLVDM------- : .::.. ::::::::..:. . .. ::.:::::::.::.:: :... .. CCDS79 YTIWGSVMFIISGSLSIAAGIRTTKGLVRGSLGMNITSSVLAASGILINTFSLAFYSFHH 110 120 130 140 150 160 190 200 210 220 230 pF1KE6 --C-INGVAGQDYWAVLSGKGISATLMIFSLLEFFVACATAHFANQA-NTTTNMSVLVIP : : ... . .. :... ....:.::: .: . . :. .. : . ::..: CCDS79 PYCNYYGNSNNCHGTMSILMGLDGMVLLLSVLEFCIAVSLSAFGCKVLCCTPGGVVLILP 170 180 190 200 210 220 240 250 260 pF1KE6 N---MYESNPVTPASSSAPPRCNNYSANAPK . : :. :: CCDS79 SHSHMAETASPTPLNEV 230 >>CCDS60802.1 MS4A15 gene_id:219995|Hs108|chr11 (199 aa) initn: 301 init1: 160 opt: 281 Z-score: 311.0 bits: 64.8 E(32554): 5.3e-11 Smith-Waterman score: 281; 42.1% identity (69.8% similar) in 126 aa overlap (137-260:74-191) 110 120 130 140 150 160 pF1KE6 IVLCLISFSFREVLGFASTAVIGGYPFWGGLSFIISGSLSVSASKELSRCLVKGSLGMNI :.:::::::::.: :. . :::..::: :: CCDS60 EPPLGAQTPRATQPPDLRPVETFLTGEPKVLGFIISGSLSVAAEKNHTSCLVRSSLGTNI 50 60 70 80 90 100 170 180 190 200 210 220 pF1KE6 VSSILAFIGVILLLVDMCINGVAGQDYWAVLSGKGISATLMIFSLLEFFVACATAHFANQ .: . :: :. .::.:. ::...: .: :.: ::..::::.: . ::. : CCDS60 LSVMAAFAGTAILLMDF---GVTNRDV-----DRGYLAVLTIFTVLEFFTAVIAMHFGCQ 110 120 130 140 150 230 240 250 260 pF1KE6 A-NTTTNMSVLVIPNMYESNPVTPA-SSSAPPRCNNYSANAPK : .. .. :. .:: . .. :. ..:::: .: CCDS60 AIHAQASAPVIFLPNAFSADFNIPSPAASAPPAYDNVAYAQGVV 160 170 180 190 267 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 14:45:43 2016 done: Tue Nov 8 14:45:43 2016 Total Scan time: 1.660 Total Display time: -0.030 Function used was FASTA [36.3.4 Apr, 2011]