FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6253, 214 aa 1>>>pF1KE6253 214 - 214 aa - 214 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.7186+/-0.000717; mu= 17.2735+/- 0.043 mean_var=62.4192+/-12.491, 0's: 0 Z-trim(108.3): 31 B-trim: 0 in 0/52 Lambda= 0.162336 statistics sampled from 10073 (10100) to 10073 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.689), E-opt: 0.2 (0.31), width: 16 Scan time: 2.200 The best scores are: opt bits E(32554) CCDS31567.1 MS4A3 gene_id:932|Hs108|chr11 ( 214) 1418 340.1 6.1e-94 CCDS31568.1 MS4A3 gene_id:932|Hs108|chr11 ( 168) 757 185.2 2e-47 CCDS41651.1 MS4A3 gene_id:932|Hs108|chr11 ( 91) 587 145.2 1.2e-35 CCDS7982.1 MS4A4A gene_id:51338|Hs108|chr11 ( 239) 315 81.8 3.8e-16 CCDS7987.1 MS4A5 gene_id:64232|Hs108|chr11 ( 200) 260 68.9 2.5e-12 CCDS7980.1 MS4A2 gene_id:2206|Hs108|chr11 ( 244) 250 66.6 1.5e-11 CCDS7990.1 MS4A8 gene_id:83661|Hs108|chr11 ( 250) 245 65.5 3.4e-11 >>CCDS31567.1 MS4A3 gene_id:932|Hs108|chr11 (214 aa) initn: 1418 init1: 1418 opt: 1418 Z-score: 1800.0 bits: 340.1 E(32554): 6.1e-94 Smith-Waterman score: 1418; 100.0% identity (100.0% similar) in 214 aa overlap (1-214:1-214) 10 20 30 40 50 60 pF1KE6 MASHEVDNAELGSASAHGTPGSEAGPEELNTSVYQPIDGSPDYQKAKLQVLGAIQILNAA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 MASHEVDNAELGSASAHGTPGSEAGPEELNTSVYQPIDGSPDYQKAKLQVLGAIQILNAA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 MILALGVFLGSLQYPYHFQKHFFFFTFYTGYPIWGAVFFCSSGTLSVVAGIKPTRTWIQN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 MILALGVFLGSLQYPYHFQKHFFFFTFYTGYPIWGAVFFCSSGTLSVVAGIKPTRTWIQN 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 SFGMNIASATIALVGTAFLSLNIAVNIQSLRSCHSSSESPDLCNYMGSISNGMVSLLLIL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 SFGMNIASATIALVGTAFLSLNIAVNIQSLRSCHSSSESPDLCNYMGSISNGMVSLLLIL 130 140 150 160 170 180 190 200 210 pF1KE6 TLLELCVTISTIAMWCNANCCNSREEISSPPNSV :::::::::::::::::::::::::::::::::: CCDS31 TLLELCVTISTIAMWCNANCCNSREEISSPPNSV 190 200 210 >>CCDS31568.1 MS4A3 gene_id:932|Hs108|chr11 (168 aa) initn: 1085 init1: 757 opt: 757 Z-score: 964.8 bits: 185.2 E(32554): 2e-47 Smith-Waterman score: 997; 78.5% identity (78.5% similar) in 214 aa overlap (1-214:1-168) 10 20 30 40 50 60 pF1KE6 MASHEVDNAELGSASAHGTPGSEAGPEELNTSVYQPIDGSPDYQKAKLQVLGAIQILNAA :::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 MASHEVDNAELGSASAHGTPGSEAGPEELNTSVYQPIDGSPDYQKAKLQVLG-------- 10 20 30 40 50 70 80 90 100 110 120 pF1KE6 MILALGVFLGSLQYPYHFQKHFFFFTFYTGYPIWGAVFFCSSGTLSVVAGIKPTRTWIQN :::::::::::::::::::::: CCDS31 --------------------------------------FCSSGTLSVVAGIKPTRTWIQN 60 70 130 140 150 160 170 180 pF1KE6 SFGMNIASATIALVGTAFLSLNIAVNIQSLRSCHSSSESPDLCNYMGSISNGMVSLLLIL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 SFGMNIASATIALVGTAFLSLNIAVNIQSLRSCHSSSESPDLCNYMGSISNGMVSLLLIL 80 90 100 110 120 130 190 200 210 pF1KE6 TLLELCVTISTIAMWCNANCCNSREEISSPPNSV :::::::::::::::::::::::::::::::::: CCDS31 TLLELCVTISTIAMWCNANCCNSREEISSPPNSV 140 150 160 >>CCDS41651.1 MS4A3 gene_id:932|Hs108|chr11 (91 aa) initn: 587 init1: 587 opt: 587 Z-score: 753.3 bits: 145.2 E(32554): 1.2e-35 Smith-Waterman score: 587; 100.0% identity (100.0% similar) in 91 aa overlap (124-214:1-91) 100 110 120 130 140 150 pF1KE6 WGAVFFCSSGTLSVVAGIKPTRTWIQNSFGMNIASATIALVGTAFLSLNIAVNIQSLRSC :::::::::::::::::::::::::::::: CCDS41 MNIASATIALVGTAFLSLNIAVNIQSLRSC 10 20 30 160 170 180 190 200 210 pF1KE6 HSSSESPDLCNYMGSISNGMVSLLLILTLLELCVTISTIAMWCNANCCNSREEISSPPNS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 HSSSESPDLCNYMGSISNGMVSLLLILTLLELCVTISTIAMWCNANCCNSREEISSPPNS 40 50 60 70 80 90 pF1KE6 V : CCDS41 V >>CCDS7982.1 MS4A4A gene_id:51338|Hs108|chr11 (239 aa) initn: 349 init1: 172 opt: 315 Z-score: 403.3 bits: 81.8 E(32554): 3.8e-16 Smith-Waterman score: 315; 31.7% identity (64.5% similar) in 186 aa overlap (20-201:31-212) 10 20 30 40 pF1KE6 MASHEVDNAELGSASAHGTPGSEAG-PEELNTSVYQP--IDGSPD-YQK ::. : :. : .: . : . . : CCDS79 MHQTYSRHCRPEESTFSAAMTTMQGMEQAMPGAGPGVPQLGNMAVIHSHLWKGLQEKFLK 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE6 AKLQVLGAIQILNAAMILALGVFLGSLQYPYHFQKHFFFFTFYTGYPIWGAVFFCSSGTL .. .:::..:::.: : :..:. . . . .. .. : :: :::.:.: ::.: CCDS79 GEPKVLGVVQILTALMSLSMGITMMCMASNTYGSNP---ISVYIGYTIWGSVMFIISGSL 70 80 90 100 110 110 120 130 140 150 160 pF1KE6 SVVAGIKPTRTWIQNSFGMNIASATIALVGTAFLSLNIAVNIQSLRSCHSSSESPDLCNY :..:::. :. ...:.::::.:...: : . ....: :. ..: . :. CCDS79 SIAAGIRTTKGLVRGSLGMNITSSVLAASGILINTFSLAFYSFHHPYCNYYGNSNN-CHG 120 130 140 150 160 170 170 180 190 200 210 pF1KE6 MGSISNGMVSLLLILTLLELCVTISTIAMWCNANCCNSREEISSPPNSV :: :. ...:.:..::.:...: :. :.. :: CCDS79 TMSILMGLDGMVLLLSVLEFCIAVSLSAFGCKVLCCTPGGVVLILPSHSHMAETASPTPL 180 190 200 210 220 230 CCDS79 NEV >>CCDS7987.1 MS4A5 gene_id:64232|Hs108|chr11 (200 aa) initn: 198 init1: 123 opt: 260 Z-score: 334.7 bits: 68.9 E(32554): 2.5e-12 Smith-Waterman score: 260; 28.6% identity (60.2% similar) in 206 aa overlap (11-205:1-198) 10 20 30 40 50 pF1KE6 MASHEVDNAELGSASAHGTPGSEAGPEELNTSVYQPIDGSPD-------YQKA---KLQV . :..:: .: . : :...: :. . : :: :... CCDS79 MDSSTAH-SPVFLVFPPEITASEYESTELSATTFSTQSPLQKLFARKMKI 10 20 30 40 60 70 80 90 100 pF1KE6 LGAIQILNAAMILALGV-FLGSLQYPYHFQKHFFFFTFYTGYPIWGAVFFCSSGTLSVVA ::.:::: . : ...:: :: .: :: : : : .:::.::.:.: .::.. ... CCDS79 LGTIQILFGIMTFSFGVIFLFTLLKPYPR----FPFIFLSGYPFWGSVLFINSGAFLIAV 50 60 70 80 90 100 110 120 130 140 150 160 pF1KE6 GIKPTRTWIQNSFGMNIASATIALVGTAFLSLNIAVNIQSLRSCHSSSESPDLCNYMGSI : :.: : : ::. :: :..: .:.... .. . . : : .. . :. . . CCDS79 KRKTTETLIILSRIMNFLSALGAIAGIILLTFGFILDQNYI--CGYSHQNSQ-CKAVTVL 110 120 130 140 150 160 170 180 190 200 210 pF1KE6 SNGMVSLLLILTLLELCVTISTIAMWCNANCCNSREEISSPPNSV :.. :. ....:: ... . :... :. .. CCDS79 FLGILITLMTFSIIELFISLPFSILGCHSEDCDCEQCC 170 180 190 200 >>CCDS7980.1 MS4A2 gene_id:2206|Hs108|chr11 (244 aa) initn: 187 init1: 152 opt: 250 Z-score: 320.9 bits: 66.6 E(32554): 1.5e-11 Smith-Waterman score: 250; 29.3% identity (61.4% similar) in 184 aa overlap (23-198:27-204) 10 20 30 40 50 pF1KE6 MASHEVDNAELGSASAHGTPGSEAGPEELNTSVYQPIDGSPDYQ------KAKLQV : .:.:.... .:: . : . . CCDS79 MDTESNRRANLALPQEPSSVPAFEVLEISPQEVSSGRLLKSASSPPLHTWLTVLKKEQEF 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE6 LGAIQILNAAMILALGVFLGSLQYPYHFQKHFFFFTFYTGYPIWGAVFFCSSGTLSVVAG ::. :::.: . : .:. . :. :.. .: .: .:::.:::.:: :: ::... CCDS79 LGVTQILTAMICLCFGTVVCSVLDISHIEGDIFS-SFKAGYPFWGAIFFSISGMLSIISE 70 80 90 100 110 120 130 140 150 160 pF1KE6 IKPTRTWIQNSFGMNIASATIALVGTAFLSLNIAVNIQ--SLRSCHSSSESPDLCNYMGS . . ...:.: : ::. . .: ..: .:. .. ..::.. :. : .:.: CCDS79 RRNATYLVRGSLGANTASSIAGGTGITILIINLKKSLAYIHIHSCQKFFETK--C-FMAS 120 130 140 150 160 170 170 180 190 200 210 pF1KE6 ISNGMVSLLLILTLLELCVTISTIAMWCNANCCNSREEISSPPNSV .:. .: ..:.::.: : ..: :.: CCDS79 FSTEIVVMMLFLTILGLGSAVSLTI--CGAGEELKGNKVPEDRVYEELNIYSATYSELED 180 190 200 210 220 230 >>CCDS7990.1 MS4A8 gene_id:83661|Hs108|chr11 (250 aa) initn: 205 init1: 109 opt: 245 Z-score: 314.4 bits: 65.5 E(32554): 3.4e-11 Smith-Waterman score: 247; 28.9% identity (61.2% similar) in 201 aa overlap (17-212:44-223) 10 20 30 40 pF1KE6 MASHEVDNAELGSASAHGTPGSEAGPEELNTSVYQPIDGSPDYQKA : .::. : : ..: .:.: ::: CCDS79 VLVVAPHNGYPVTPGIMSHVPLYPNSQPQVHLVPGN---PPSLVSNV----NGQP-VQKA 20 30 40 50 60 50 60 70 80 90 100 pF1KE6 --KLQVLGAIQILNAAMILALGVFLGSLQYPYHFQKHFFFFTFYTGYPIWGAVFFCSSGT . ..::::::. . ..:: ..... . ... ..:: :.:.::...: ::. CCDS79 LKEGKTLGAIQIIIGLAHIGLGSIMATV-----LVGEYLSISFYGGFPFWGGLWFIISGS 70 80 90 100 110 120 110 120 130 140 150 160 pF1KE6 LSVVAGIKP-TRTWIQNSFGMNIASATIALVGTAFLSLNIAVNIQSLRSCHSSSESPDLC :::.: .: . ...:.:.::.:: . ::. .. .... : . :: CCDS79 LSVAAENQPYSYCLLSGSLGLNIVSAICSAVGVILFITDLSIP-------HPYAY-PDYY 130 140 150 160 170 170 180 190 200 210 pF1KE6 NYMGSISNGMV--SLLLILTLLELCVTISTIAMWCNANCCNSREEISSPPNSV : ... ::. ..::.. :::. .. .. . :. ::.: . :: CCDS79 PYAWGVNPGMAISGVLLVFCLLEFGIACASSHFGCQLVCCQSSNVSVIYPNIYAANPVIT 180 190 200 210 220 230 CCDS79 PEPVTSPPSYSSEIQANK 240 250 214 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 11:29:32 2016 done: Tue Nov 8 11:29:33 2016 Total Scan time: 2.200 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]