FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE1738, 244 aa
1>>>pF1KE1738 244 - 244 aa - 244 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 4.7230+/-0.000729; mu= 17.2828+/- 0.044
mean_var=55.3055+/-11.201, 0's: 0 Z-trim(108.0): 28 B-trim: 556 in 1/46
Lambda= 0.172461
statistics sampled from 9886 (9913) to 9886 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.688), E-opt: 0.2 (0.305), width: 16
Scan time: 2.070
The best scores are: opt bits E(32554)
CCDS7980.1 MS4A2 gene_id:2206|Hs108|chr11 ( 244) 1573 399.0 1.5e-111
CCDS73292.1 MS4A2 gene_id:2206|Hs108|chr11 ( 199) 864 222.6 1.6e-58
CCDS7982.1 MS4A4A gene_id:51338|Hs108|chr11 ( 239) 257 71.6 5.3e-13
CCDS31567.1 MS4A3 gene_id:932|Hs108|chr11 ( 214) 250 69.8 1.6e-12
CCDS7988.1 MS4A12 gene_id:54860|Hs108|chr11 ( 267) 239 67.1 1.3e-11
CCDS44617.1 MS4A15 gene_id:219995|Hs108|chr11 ( 240) 237 66.6 1.7e-11
CCDS58136.1 MS4A14 gene_id:84689|Hs108|chr11 ( 712) 242 68.1 1.7e-11
>>CCDS7980.1 MS4A2 gene_id:2206|Hs108|chr11 (244 aa)
initn: 1573 init1: 1573 opt: 1573 Z-score: 2116.3 bits: 399.0 E(32554): 1.5e-111
Smith-Waterman score: 1573; 100.0% identity (100.0% similar) in 244 aa overlap (1-244:1-244)
10 20 30 40 50 60
pF1KE1 MDTESNRRANLALPQEPSSVPAFEVLEISPQEVSSGRLLKSASSPPLHTWLTVLKKEQEF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS79 MDTESNRRANLALPQEPSSVPAFEVLEISPQEVSSGRLLKSASSPPLHTWLTVLKKEQEF
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE1 LGVTQILTAMICLCFGTVVCSVLDISHIEGDIFSSFKAGYPFWGAIFFSISGMLSIISER
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS79 LGVTQILTAMICLCFGTVVCSVLDISHIEGDIFSSFKAGYPFWGAIFFSISGMLSIISER
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE1 RNATYLVRGSLGANTASSIAGGTGITILIINLKKSLAYIHIHSCQKFFETKCFMASFSTE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS79 RNATYLVRGSLGANTASSIAGGTGITILIINLKKSLAYIHIHSCQKFFETKCFMASFSTE
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE1 IVVMMLFLTILGLGSAVSLTICGAGEELKGNKVPEDRVYEELNIYSATYSELEDPGEMSP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS79 IVVMMLFLTILGLGSAVSLTICGAGEELKGNKVPEDRVYEELNIYSATYSELEDPGEMSP
190 200 210 220 230 240
pF1KE1 PIDL
::::
CCDS79 PIDL
>>CCDS73292.1 MS4A2 gene_id:2206|Hs108|chr11 (199 aa)
initn: 880 init1: 864 opt: 864 Z-score: 1164.2 bits: 222.6 E(32554): 1.6e-58
Smith-Waterman score: 1162; 81.6% identity (81.6% similar) in 244 aa overlap (1-244:1-199)
10 20 30 40 50 60
pF1KE1 MDTESNRRANLALPQEPSSVPAFEVLEISPQEVSSGRLLKSASSPPLHTWLTVLKKEQEF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS73 MDTESNRRANLALPQEPSSVPAFEVLEISPQEVSSGRLLKSASSPPLHTWLTVLKKEQEF
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE1 LGVTQILTAMICLCFGTVVCSVLDISHIEGDIFSSFKAGYPFWGAIFFSISGMLSIISER
:: :::::::::::::
CCDS73 LG---------------------------------------------FSISGMLSIISER
70
130 140 150 160 170 180
pF1KE1 RNATYLVRGSLGANTASSIAGGTGITILIINLKKSLAYIHIHSCQKFFETKCFMASFSTE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS73 RNATYLVRGSLGANTASSIAGGTGITILIINLKKSLAYIHIHSCQKFFETKCFMASFSTE
80 90 100 110 120 130
190 200 210 220 230 240
pF1KE1 IVVMMLFLTILGLGSAVSLTICGAGEELKGNKVPEDRVYEELNIYSATYSELEDPGEMSP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS73 IVVMMLFLTILGLGSAVSLTICGAGEELKGNKVPEDRVYEELNIYSATYSELEDPGEMSP
140 150 160 170 180 190
pF1KE1 PIDL
::::
CCDS73 PIDL
>>CCDS7982.1 MS4A4A gene_id:51338|Hs108|chr11 (239 aa)
initn: 217 init1: 156 opt: 257 Z-score: 346.8 bits: 71.6 E(32554): 5.3e-13
Smith-Waterman score: 257; 37.0% identity (62.4% similar) in 165 aa overlap (48-203:49-206)
20 30 40 50 60 70
pF1KE1 SSVPAFEVLEISPQEVSSGRLLKSASSPPLHTWLTV----LKKEQEFLGVTQILTAMICL
: : . :: : . :::.:::::.. :
CCDS79 AMTTMQGMEQAMPGAGPGVPQLGNMAVIHSHLWKGLQEKFLKGEPKVLGVVQILTALMSL
20 30 40 50 60 70
80 90 100 110 120 130
pF1KE1 CFG-TVVCSVLDISHIEGDIFSSFKAGYPFWGAIFFSISGMLSIISERRNATYLVRGSLG
.: :..: . :. :. : :: .::...: ::: ::: . :.. :::::::
CCDS79 SMGITMMCMA---SNTYGSNPISVYIGYTIWGSVMFIISGSLSIAAGIRTTKGLVRGSLG
80 90 100 110 120 130
140 150 160 170 180
pF1KE1 ANTASSIAGGTGITILIINLKKSLAYIHIHS--CQKFFETKCFMASFSTEIVV--MMLFL
: .::. ...:: .:: :::. .: :. . ... ...: . . :.:.:
CCDS79 MNITSSVLAASGI---LINTF-SLAFYSFHHPYCNYYGNSNNCHGTMSILMGLDGMVLLL
140 150 160 170 180 190
190 200 210 220 230 240
pF1KE1 TILGLGSAVSLTICGAGEELKGNKVPEDRVYEELNIYSATYSELEDPGEMSPPIDL
..: . ::::. :
CCDS79 SVLEFCIAVSLSAFGCKVLCCTPGGVVLILPSHSHMAETASPTPLNEV
200 210 220 230
>>CCDS31567.1 MS4A3 gene_id:932|Hs108|chr11 (214 aa)
initn: 187 init1: 152 opt: 250 Z-score: 338.1 bits: 69.8 E(32554): 1.6e-12
Smith-Waterman score: 250; 29.3% identity (61.4% similar) in 184 aa overlap (27-204:23-198)
10 20 30 40 50 60
pF1KE1 MDTESNRRANLALPQEPSSVPAFEVLEISPQEVSSGRLLKSASSPPLHTWLTVLKKEQEF
: .:.:.... .:: . : . .
CCDS31 MASHEVDNAELGSASAHGTPGSEAGPEELNTSVYQPIDGSPDYQ------KAKLQV
10 20 30 40 50
70 80 90 100 110
pF1KE1 LGVTQILTAMICLCFGTVVCSVLDISHIEGDIFS-SFKAGYPFWGAIFFSISGMLSIISE
::. :::.: . : .:. . :. :.. .: .: .:::.:::.:: :: ::...
CCDS31 LGAIQILNAAMILALGVFLGSLQYPYHFQKHFFFFTFYTGYPIWGAVFFCSSGTLSVVAG
60 70 80 90 100 110
120 130 140 150 160 170
pF1KE1 RRNATYLVRGSLGANTASSIAGGTGITILIINLKKSLAYIHIHSCQKFFETK--C-FMAS
. . ...:.: : ::. . .: ..: .:. .. ..::.. :. : .:.:
CCDS31 IKPTRTWIQNSFGMNIASATIALVGTAFLSLNIAVNIQ--SLRSCHSSSESPDLCNYMGS
120 130 140 150 160
180 190 200 210 220 230
pF1KE1 FSTEIVVMMLFLTILGLGSAVSLTI--CGAGEELKGNKVPEDRVYEELNIYSATYSELED
.:. .: ..:.::.: : ..: :.:
CCDS31 ISNGMVSLLLILTLLELCVTISTIAMWCNANCCNSREEISSPPNSV
170 180 190 200 210
>>CCDS7988.1 MS4A12 gene_id:54860|Hs108|chr11 (267 aa)
initn: 197 init1: 143 opt: 239 Z-score: 321.9 bits: 67.1 E(32554): 1.3e-11
Smith-Waterman score: 239; 34.5% identity (69.1% similar) in 110 aa overlap (44-152:74-183)
20 30 40 50 60 70
pF1KE1 PQEPSSVPAFEVLEISPQEVSSGRLLKSASSPPLHTWLTVLKKEQEFLGVTQILTAMICL
.: . : . .:.: . ::: ::..... .
CCDS79 QGAQRAQPYGITSPGIFASSQPGQGNIQMINPSVGTAVMNFKEEAKALGVIQIMVGLMHI
50 60 70 80 90 100
80 90 100 110 120 130
pF1KE1 CFGTVVCSV-LDISHIEGDIFSSFKAGYPFWGAIFFSISGMLSIISERRNATYLVRGSLG
:: :.: . ... .. : .. .::::::.. : ::: ::. . .. . ::.::::
CCDS79 GFGIVLCLISFSFREVLGFASTAVIGGYPFWGGLSFIISGSLSVSASKELSRCLVKGSLG
110 120 130 140 150 160
140 150 160 170 180 190
pF1KE1 ANTASSIAGGTGITILIINLKKSLAYIHIHSCQKFFETKCFMASFSTEIVVMMLFLTILG
: .::: . :. .:....
CCDS79 MNIVSSILAFIGVILLLVDMCINGVAGQDYWAVLSGKGISATLMIFSLLEFFVACATAHF
170 180 190 200 210 220
>>CCDS44617.1 MS4A15 gene_id:219995|Hs108|chr11 (240 aa)
initn: 205 init1: 158 opt: 237 Z-score: 319.9 bits: 66.6 E(32554): 1.7e-11
Smith-Waterman score: 237; 31.3% identity (57.9% similar) in 195 aa overlap (13-198:29-214)
10 20 30 40
pF1KE1 MDTESNRRANLALPQEPSSVPAFEVLEISPQEVSSGRLLKSASS
:: . :.. .: : ... : :..
CCDS44 MSAAPASNGVFVVIPPNNASGLCPPPAILPTSMCQPPGIMQFEEPPLGAQTPR----ATQ
10 20 30 40 50
50 60 70 80 90 100
pF1KE1 PP-LHTWLTVLKKEQEFLGVTQILTAMICLCFGTVVCSVLDISHIEGDIFSSFKAGYPFW
:: :. : : : . ::..::: ..: : ::.:. : .:. : .: ...: :::
CCDS44 PPDLRPVETFLTGEPKVLGTVQILIGLIHLGFGSVLLMVRR-GHV-GIFF--IEGGVPFW
60 70 80 90 100 110
110 120 130 140 150
pF1KE1 GAIFFSISGMLSIISERRNATYLVRGSLGANTASSIAGGTGITILIINL--------KKS
:. : ::: ::. .:. ... :::.:::.: : .:. .: .::.... .
CCDS44 GGACFIISGSLSVAAEKNHTSCLVRSSLGTNILSVMAAFAGTAILLMDFGVTNRDVDRGY
120 130 140 150 160 170
160 170 180 190 200 210
pF1KE1 LAYIHIHSCQKFFETKCFMASFSTEIVVMMLFLTILGLGSAVSLTICGAGEELKGNKVPE
:: . : . .:: : . :. . . . .. : .: :
CCDS44 LAVLTIFTVLEFF-TAVIAMHFGCQAIHAQASAPVIFLPNAFSADFNIPSPAASAPPAYD
180 190 200 210 220 230
220 230 240
pF1KE1 DRVYEELNIYSATYSELEDPGEMSPPIDL
CCDS44 NVAYAQGVV
240
>>CCDS58136.1 MS4A14 gene_id:84689|Hs108|chr11 (712 aa)
initn: 188 init1: 95 opt: 242 Z-score: 319.7 bits: 68.1 E(32554): 1.7e-11
Smith-Waterman score: 242; 30.7% identity (61.5% similar) in 179 aa overlap (22-199:10-176)
10 20 30 40 50 60
pF1KE1 MDTESNRRANLALPQEPSSVPAFEVLEISPQEVSSGRLLKSASSPPLHTWLTVLKKEQEF
: .:. :.:.:. .: . : . : :: : .
CCDS58 MESTSQDRRATHVITIKPNET----VLTAFPYRPHSSLLDFLKGEPRV
10 20 30 40
70 80 90 100 110 120
pF1KE1 LGVTQILTAMICLCFGTVVCSVLDISHIEGDIFSSFKAGYPFWGAIFFSISGMLSIISER
::.:::: :.: . :::. .:. . . .:::::::..: ..:.:.. ...
CCDS58 LGATQILLALIIVGFGTIF--ALNYIGFSQRLPLVVLTGYPFWGALIFILTGYLTVTDKK
50 60 70 80 90 100
130 140 150 160 170
pF1KE1 RNATYLVRGSLGANTASSIAGGTGITILIINLKKSLAYIHIHSCQKFFETKC-FMASFST
. : .: : :. ::... ::::. :.. ... : .. : :: : : ..
CCDS58 --SKLLGQGVTGMNVISSLVAITGITFTILSYRHQDKYCQMPS----FEEICVFSRTLFI
110 120 130 140 150
180 190 200 210 220 230
pF1KE1 EIVVMMLFLTILGLGSAVSLTICGAGEELKGNKVPEDRVYEELNIYSATYSELEDPGEMS
:....:...: :. .:..
CCDS58 GILLILLIISIAELSISVTIASFRSKCWTQSDEVLFFLPSDVTQNSEQPAPEENDQLQFV
160 170 180 190 200 210
244 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 21:33:10 2016 done: Sun Nov 6 21:33:10 2016
Total Scan time: 2.070 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]