FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1738, 244 aa 1>>>pF1KE1738 244 - 244 aa - 244 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.7230+/-0.000729; mu= 17.2828+/- 0.044 mean_var=55.3055+/-11.201, 0's: 0 Z-trim(108.0): 28 B-trim: 556 in 1/46 Lambda= 0.172461 statistics sampled from 9886 (9913) to 9886 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.688), E-opt: 0.2 (0.305), width: 16 Scan time: 2.070 The best scores are: opt bits E(32554) CCDS7980.1 MS4A2 gene_id:2206|Hs108|chr11 ( 244) 1573 399.0 1.5e-111 CCDS73292.1 MS4A2 gene_id:2206|Hs108|chr11 ( 199) 864 222.6 1.6e-58 CCDS7982.1 MS4A4A gene_id:51338|Hs108|chr11 ( 239) 257 71.6 5.3e-13 CCDS31567.1 MS4A3 gene_id:932|Hs108|chr11 ( 214) 250 69.8 1.6e-12 CCDS7988.1 MS4A12 gene_id:54860|Hs108|chr11 ( 267) 239 67.1 1.3e-11 CCDS44617.1 MS4A15 gene_id:219995|Hs108|chr11 ( 240) 237 66.6 1.7e-11 CCDS58136.1 MS4A14 gene_id:84689|Hs108|chr11 ( 712) 242 68.1 1.7e-11 >>CCDS7980.1 MS4A2 gene_id:2206|Hs108|chr11 (244 aa) initn: 1573 init1: 1573 opt: 1573 Z-score: 2116.3 bits: 399.0 E(32554): 1.5e-111 Smith-Waterman score: 1573; 100.0% identity (100.0% similar) in 244 aa overlap (1-244:1-244) 10 20 30 40 50 60 pF1KE1 MDTESNRRANLALPQEPSSVPAFEVLEISPQEVSSGRLLKSASSPPLHTWLTVLKKEQEF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS79 MDTESNRRANLALPQEPSSVPAFEVLEISPQEVSSGRLLKSASSPPLHTWLTVLKKEQEF 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 LGVTQILTAMICLCFGTVVCSVLDISHIEGDIFSSFKAGYPFWGAIFFSISGMLSIISER :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS79 LGVTQILTAMICLCFGTVVCSVLDISHIEGDIFSSFKAGYPFWGAIFFSISGMLSIISER 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 RNATYLVRGSLGANTASSIAGGTGITILIINLKKSLAYIHIHSCQKFFETKCFMASFSTE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS79 RNATYLVRGSLGANTASSIAGGTGITILIINLKKSLAYIHIHSCQKFFETKCFMASFSTE 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 IVVMMLFLTILGLGSAVSLTICGAGEELKGNKVPEDRVYEELNIYSATYSELEDPGEMSP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS79 IVVMMLFLTILGLGSAVSLTICGAGEELKGNKVPEDRVYEELNIYSATYSELEDPGEMSP 190 200 210 220 230 240 pF1KE1 PIDL :::: CCDS79 PIDL >>CCDS73292.1 MS4A2 gene_id:2206|Hs108|chr11 (199 aa) initn: 880 init1: 864 opt: 864 Z-score: 1164.2 bits: 222.6 E(32554): 1.6e-58 Smith-Waterman score: 1162; 81.6% identity (81.6% similar) in 244 aa overlap (1-244:1-199) 10 20 30 40 50 60 pF1KE1 MDTESNRRANLALPQEPSSVPAFEVLEISPQEVSSGRLLKSASSPPLHTWLTVLKKEQEF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 MDTESNRRANLALPQEPSSVPAFEVLEISPQEVSSGRLLKSASSPPLHTWLTVLKKEQEF 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 LGVTQILTAMICLCFGTVVCSVLDISHIEGDIFSSFKAGYPFWGAIFFSISGMLSIISER :: ::::::::::::: CCDS73 LG---------------------------------------------FSISGMLSIISER 70 130 140 150 160 170 180 pF1KE1 RNATYLVRGSLGANTASSIAGGTGITILIINLKKSLAYIHIHSCQKFFETKCFMASFSTE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 RNATYLVRGSLGANTASSIAGGTGITILIINLKKSLAYIHIHSCQKFFETKCFMASFSTE 80 90 100 110 120 130 190 200 210 220 230 240 pF1KE1 IVVMMLFLTILGLGSAVSLTICGAGEELKGNKVPEDRVYEELNIYSATYSELEDPGEMSP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 IVVMMLFLTILGLGSAVSLTICGAGEELKGNKVPEDRVYEELNIYSATYSELEDPGEMSP 140 150 160 170 180 190 pF1KE1 PIDL :::: CCDS73 PIDL >>CCDS7982.1 MS4A4A gene_id:51338|Hs108|chr11 (239 aa) initn: 217 init1: 156 opt: 257 Z-score: 346.8 bits: 71.6 E(32554): 5.3e-13 Smith-Waterman score: 257; 37.0% identity (62.4% similar) in 165 aa overlap (48-203:49-206) 20 30 40 50 60 70 pF1KE1 SSVPAFEVLEISPQEVSSGRLLKSASSPPLHTWLTV----LKKEQEFLGVTQILTAMICL : : . :: : . :::.:::::.. : CCDS79 AMTTMQGMEQAMPGAGPGVPQLGNMAVIHSHLWKGLQEKFLKGEPKVLGVVQILTALMSL 20 30 40 50 60 70 80 90 100 110 120 130 pF1KE1 CFG-TVVCSVLDISHIEGDIFSSFKAGYPFWGAIFFSISGMLSIISERRNATYLVRGSLG .: :..: . :. :. : :: .::...: ::: ::: . :.. ::::::: CCDS79 SMGITMMCMA---SNTYGSNPISVYIGYTIWGSVMFIISGSLSIAAGIRTTKGLVRGSLG 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 ANTASSIAGGTGITILIINLKKSLAYIHIHS--CQKFFETKCFMASFSTEIVV--MMLFL : .::. ...:: .:: :::. .: :. . ... ...: . . :.:.: CCDS79 MNITSSVLAASGI---LINTF-SLAFYSFHHPYCNYYGNSNNCHGTMSILMGLDGMVLLL 140 150 160 170 180 190 190 200 210 220 230 240 pF1KE1 TILGLGSAVSLTICGAGEELKGNKVPEDRVYEELNIYSATYSELEDPGEMSPPIDL ..: . ::::. : CCDS79 SVLEFCIAVSLSAFGCKVLCCTPGGVVLILPSHSHMAETASPTPLNEV 200 210 220 230 >>CCDS31567.1 MS4A3 gene_id:932|Hs108|chr11 (214 aa) initn: 187 init1: 152 opt: 250 Z-score: 338.1 bits: 69.8 E(32554): 1.6e-12 Smith-Waterman score: 250; 29.3% identity (61.4% similar) in 184 aa overlap (27-204:23-198) 10 20 30 40 50 60 pF1KE1 MDTESNRRANLALPQEPSSVPAFEVLEISPQEVSSGRLLKSASSPPLHTWLTVLKKEQEF : .:.:.... .:: . : . . CCDS31 MASHEVDNAELGSASAHGTPGSEAGPEELNTSVYQPIDGSPDYQ------KAKLQV 10 20 30 40 50 70 80 90 100 110 pF1KE1 LGVTQILTAMICLCFGTVVCSVLDISHIEGDIFS-SFKAGYPFWGAIFFSISGMLSIISE ::. :::.: . : .:. . :. :.. .: .: .:::.:::.:: :: ::... CCDS31 LGAIQILNAAMILALGVFLGSLQYPYHFQKHFFFFTFYTGYPIWGAVFFCSSGTLSVVAG 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE1 RRNATYLVRGSLGANTASSIAGGTGITILIINLKKSLAYIHIHSCQKFFETK--C-FMAS . . ...:.: : ::. . .: ..: .:. .. ..::.. :. : .:.: CCDS31 IKPTRTWIQNSFGMNIASATIALVGTAFLSLNIAVNIQ--SLRSCHSSSESPDLCNYMGS 120 130 140 150 160 180 190 200 210 220 230 pF1KE1 FSTEIVVMMLFLTILGLGSAVSLTI--CGAGEELKGNKVPEDRVYEELNIYSATYSELED .:. .: ..:.::.: : ..: :.: CCDS31 ISNGMVSLLLILTLLELCVTISTIAMWCNANCCNSREEISSPPNSV 170 180 190 200 210 >>CCDS7988.1 MS4A12 gene_id:54860|Hs108|chr11 (267 aa) initn: 197 init1: 143 opt: 239 Z-score: 321.9 bits: 67.1 E(32554): 1.3e-11 Smith-Waterman score: 239; 34.5% identity (69.1% similar) in 110 aa overlap (44-152:74-183) 20 30 40 50 60 70 pF1KE1 PQEPSSVPAFEVLEISPQEVSSGRLLKSASSPPLHTWLTVLKKEQEFLGVTQILTAMICL .: . : . .:.: . ::: ::..... . CCDS79 QGAQRAQPYGITSPGIFASSQPGQGNIQMINPSVGTAVMNFKEEAKALGVIQIMVGLMHI 50 60 70 80 90 100 80 90 100 110 120 130 pF1KE1 CFGTVVCSV-LDISHIEGDIFSSFKAGYPFWGAIFFSISGMLSIISERRNATYLVRGSLG :: :.: . ... .. : .. .::::::.. : ::: ::. . .. . ::.:::: CCDS79 GFGIVLCLISFSFREVLGFASTAVIGGYPFWGGLSFIISGSLSVSASKELSRCLVKGSLG 110 120 130 140 150 160 140 150 160 170 180 190 pF1KE1 ANTASSIAGGTGITILIINLKKSLAYIHIHSCQKFFETKCFMASFSTEIVVMMLFLTILG : .::: . :. .:.... CCDS79 MNIVSSILAFIGVILLLVDMCINGVAGQDYWAVLSGKGISATLMIFSLLEFFVACATAHF 170 180 190 200 210 220 >>CCDS44617.1 MS4A15 gene_id:219995|Hs108|chr11 (240 aa) initn: 205 init1: 158 opt: 237 Z-score: 319.9 bits: 66.6 E(32554): 1.7e-11 Smith-Waterman score: 237; 31.3% identity (57.9% similar) in 195 aa overlap (13-198:29-214) 10 20 30 40 pF1KE1 MDTESNRRANLALPQEPSSVPAFEVLEISPQEVSSGRLLKSASS :: . :.. .: : ... : :.. CCDS44 MSAAPASNGVFVVIPPNNASGLCPPPAILPTSMCQPPGIMQFEEPPLGAQTPR----ATQ 10 20 30 40 50 50 60 70 80 90 100 pF1KE1 PP-LHTWLTVLKKEQEFLGVTQILTAMICLCFGTVVCSVLDISHIEGDIFSSFKAGYPFW :: :. : : : . ::..::: ..: : ::.:. : .:. : .: ...: ::: CCDS44 PPDLRPVETFLTGEPKVLGTVQILIGLIHLGFGSVLLMVRR-GHV-GIFF--IEGGVPFW 60 70 80 90 100 110 110 120 130 140 150 pF1KE1 GAIFFSISGMLSIISERRNATYLVRGSLGANTASSIAGGTGITILIINL--------KKS :. : ::: ::. .:. ... :::.:::.: : .:. .: .::.... . CCDS44 GGACFIISGSLSVAAEKNHTSCLVRSSLGTNILSVMAAFAGTAILLMDFGVTNRDVDRGY 120 130 140 150 160 170 160 170 180 190 200 210 pF1KE1 LAYIHIHSCQKFFETKCFMASFSTEIVVMMLFLTILGLGSAVSLTICGAGEELKGNKVPE :: . : . .:: : . :. . . . .. : .: : CCDS44 LAVLTIFTVLEFF-TAVIAMHFGCQAIHAQASAPVIFLPNAFSADFNIPSPAASAPPAYD 180 190 200 210 220 230 220 230 240 pF1KE1 DRVYEELNIYSATYSELEDPGEMSPPIDL CCDS44 NVAYAQGVV 240 >>CCDS58136.1 MS4A14 gene_id:84689|Hs108|chr11 (712 aa) initn: 188 init1: 95 opt: 242 Z-score: 319.7 bits: 68.1 E(32554): 1.7e-11 Smith-Waterman score: 242; 30.7% identity (61.5% similar) in 179 aa overlap (22-199:10-176) 10 20 30 40 50 60 pF1KE1 MDTESNRRANLALPQEPSSVPAFEVLEISPQEVSSGRLLKSASSPPLHTWLTVLKKEQEF : .:. :.:.:. .: . : . : :: : . CCDS58 MESTSQDRRATHVITIKPNET----VLTAFPYRPHSSLLDFLKGEPRV 10 20 30 40 70 80 90 100 110 120 pF1KE1 LGVTQILTAMICLCFGTVVCSVLDISHIEGDIFSSFKAGYPFWGAIFFSISGMLSIISER ::.:::: :.: . :::. .:. . . .:::::::..: ..:.:.. ... CCDS58 LGATQILLALIIVGFGTIF--ALNYIGFSQRLPLVVLTGYPFWGALIFILTGYLTVTDKK 50 60 70 80 90 100 130 140 150 160 170 pF1KE1 RNATYLVRGSLGANTASSIAGGTGITILIINLKKSLAYIHIHSCQKFFETKC-FMASFST . : .: : :. ::... ::::. :.. ... : .. : :: : : .. CCDS58 --SKLLGQGVTGMNVISSLVAITGITFTILSYRHQDKYCQMPS----FEEICVFSRTLFI 110 120 130 140 150 180 190 200 210 220 230 pF1KE1 EIVVMMLFLTILGLGSAVSLTICGAGEELKGNKVPEDRVYEELNIYSATYSELEDPGEMS :....:...: :. .:.. CCDS58 GILLILLIISIAELSISVTIASFRSKCWTQSDEVLFFLPSDVTQNSEQPAPEENDQLQFV 160 170 180 190 200 210 244 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 21:33:10 2016 done: Sun Nov 6 21:33:10 2016 Total Scan time: 2.070 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]