FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE6253, 214 aa
1>>>pF1KE6253 214 - 214 aa - 214 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 4.7186+/-0.000717; mu= 17.2735+/- 0.043
mean_var=62.4192+/-12.491, 0's: 0 Z-trim(108.3): 31 B-trim: 0 in 0/52
Lambda= 0.162336
statistics sampled from 10073 (10100) to 10073 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.689), E-opt: 0.2 (0.31), width: 16
Scan time: 2.200
The best scores are: opt bits E(32554)
CCDS31567.1 MS4A3 gene_id:932|Hs108|chr11 ( 214) 1418 340.1 6.1e-94
CCDS31568.1 MS4A3 gene_id:932|Hs108|chr11 ( 168) 757 185.2 2e-47
CCDS41651.1 MS4A3 gene_id:932|Hs108|chr11 ( 91) 587 145.2 1.2e-35
CCDS7982.1 MS4A4A gene_id:51338|Hs108|chr11 ( 239) 315 81.8 3.8e-16
CCDS7987.1 MS4A5 gene_id:64232|Hs108|chr11 ( 200) 260 68.9 2.5e-12
CCDS7980.1 MS4A2 gene_id:2206|Hs108|chr11 ( 244) 250 66.6 1.5e-11
CCDS7990.1 MS4A8 gene_id:83661|Hs108|chr11 ( 250) 245 65.5 3.4e-11
>>CCDS31567.1 MS4A3 gene_id:932|Hs108|chr11 (214 aa)
initn: 1418 init1: 1418 opt: 1418 Z-score: 1800.0 bits: 340.1 E(32554): 6.1e-94
Smith-Waterman score: 1418; 100.0% identity (100.0% similar) in 214 aa overlap (1-214:1-214)
10 20 30 40 50 60
pF1KE6 MASHEVDNAELGSASAHGTPGSEAGPEELNTSVYQPIDGSPDYQKAKLQVLGAIQILNAA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS31 MASHEVDNAELGSASAHGTPGSEAGPEELNTSVYQPIDGSPDYQKAKLQVLGAIQILNAA
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE6 MILALGVFLGSLQYPYHFQKHFFFFTFYTGYPIWGAVFFCSSGTLSVVAGIKPTRTWIQN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS31 MILALGVFLGSLQYPYHFQKHFFFFTFYTGYPIWGAVFFCSSGTLSVVAGIKPTRTWIQN
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE6 SFGMNIASATIALVGTAFLSLNIAVNIQSLRSCHSSSESPDLCNYMGSISNGMVSLLLIL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS31 SFGMNIASATIALVGTAFLSLNIAVNIQSLRSCHSSSESPDLCNYMGSISNGMVSLLLIL
130 140 150 160 170 180
190 200 210
pF1KE6 TLLELCVTISTIAMWCNANCCNSREEISSPPNSV
::::::::::::::::::::::::::::::::::
CCDS31 TLLELCVTISTIAMWCNANCCNSREEISSPPNSV
190 200 210
>>CCDS31568.1 MS4A3 gene_id:932|Hs108|chr11 (168 aa)
initn: 1085 init1: 757 opt: 757 Z-score: 964.8 bits: 185.2 E(32554): 2e-47
Smith-Waterman score: 997; 78.5% identity (78.5% similar) in 214 aa overlap (1-214:1-168)
10 20 30 40 50 60
pF1KE6 MASHEVDNAELGSASAHGTPGSEAGPEELNTSVYQPIDGSPDYQKAKLQVLGAIQILNAA
::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS31 MASHEVDNAELGSASAHGTPGSEAGPEELNTSVYQPIDGSPDYQKAKLQVLG--------
10 20 30 40 50
70 80 90 100 110 120
pF1KE6 MILALGVFLGSLQYPYHFQKHFFFFTFYTGYPIWGAVFFCSSGTLSVVAGIKPTRTWIQN
::::::::::::::::::::::
CCDS31 --------------------------------------FCSSGTLSVVAGIKPTRTWIQN
60 70
130 140 150 160 170 180
pF1KE6 SFGMNIASATIALVGTAFLSLNIAVNIQSLRSCHSSSESPDLCNYMGSISNGMVSLLLIL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS31 SFGMNIASATIALVGTAFLSLNIAVNIQSLRSCHSSSESPDLCNYMGSISNGMVSLLLIL
80 90 100 110 120 130
190 200 210
pF1KE6 TLLELCVTISTIAMWCNANCCNSREEISSPPNSV
::::::::::::::::::::::::::::::::::
CCDS31 TLLELCVTISTIAMWCNANCCNSREEISSPPNSV
140 150 160
>>CCDS41651.1 MS4A3 gene_id:932|Hs108|chr11 (91 aa)
initn: 587 init1: 587 opt: 587 Z-score: 753.3 bits: 145.2 E(32554): 1.2e-35
Smith-Waterman score: 587; 100.0% identity (100.0% similar) in 91 aa overlap (124-214:1-91)
100 110 120 130 140 150
pF1KE6 WGAVFFCSSGTLSVVAGIKPTRTWIQNSFGMNIASATIALVGTAFLSLNIAVNIQSLRSC
::::::::::::::::::::::::::::::
CCDS41 MNIASATIALVGTAFLSLNIAVNIQSLRSC
10 20 30
160 170 180 190 200 210
pF1KE6 HSSSESPDLCNYMGSISNGMVSLLLILTLLELCVTISTIAMWCNANCCNSREEISSPPNS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 HSSSESPDLCNYMGSISNGMVSLLLILTLLELCVTISTIAMWCNANCCNSREEISSPPNS
40 50 60 70 80 90
pF1KE6 V
:
CCDS41 V
>>CCDS7982.1 MS4A4A gene_id:51338|Hs108|chr11 (239 aa)
initn: 349 init1: 172 opt: 315 Z-score: 403.3 bits: 81.8 E(32554): 3.8e-16
Smith-Waterman score: 315; 31.7% identity (64.5% similar) in 186 aa overlap (20-201:31-212)
10 20 30 40
pF1KE6 MASHEVDNAELGSASAHGTPGSEAG-PEELNTSVYQP--IDGSPD-YQK
::. : :. : .: . : . . :
CCDS79 MHQTYSRHCRPEESTFSAAMTTMQGMEQAMPGAGPGVPQLGNMAVIHSHLWKGLQEKFLK
10 20 30 40 50 60
50 60 70 80 90 100
pF1KE6 AKLQVLGAIQILNAAMILALGVFLGSLQYPYHFQKHFFFFTFYTGYPIWGAVFFCSSGTL
.. .:::..:::.: : :..:. . . . .. .. : :: :::.:.: ::.:
CCDS79 GEPKVLGVVQILTALMSLSMGITMMCMASNTYGSNP---ISVYIGYTIWGSVMFIISGSL
70 80 90 100 110
110 120 130 140 150 160
pF1KE6 SVVAGIKPTRTWIQNSFGMNIASATIALVGTAFLSLNIAVNIQSLRSCHSSSESPDLCNY
:..:::. :. ...:.::::.:...: : . ....: :. ..: . :.
CCDS79 SIAAGIRTTKGLVRGSLGMNITSSVLAASGILINTFSLAFYSFHHPYCNYYGNSNN-CHG
120 130 140 150 160 170
170 180 190 200 210
pF1KE6 MGSISNGMVSLLLILTLLELCVTISTIAMWCNANCCNSREEISSPPNSV
:: :. ...:.:..::.:...: :. :.. ::
CCDS79 TMSILMGLDGMVLLLSVLEFCIAVSLSAFGCKVLCCTPGGVVLILPSHSHMAETASPTPL
180 190 200 210 220 230
CCDS79 NEV
>>CCDS7987.1 MS4A5 gene_id:64232|Hs108|chr11 (200 aa)
initn: 198 init1: 123 opt: 260 Z-score: 334.7 bits: 68.9 E(32554): 2.5e-12
Smith-Waterman score: 260; 28.6% identity (60.2% similar) in 206 aa overlap (11-205:1-198)
10 20 30 40 50
pF1KE6 MASHEVDNAELGSASAHGTPGSEAGPEELNTSVYQPIDGSPD-------YQKA---KLQV
. :..:: .: . : :...: :. . : :: :...
CCDS79 MDSSTAH-SPVFLVFPPEITASEYESTELSATTFSTQSPLQKLFARKMKI
10 20 30 40
60 70 80 90 100
pF1KE6 LGAIQILNAAMILALGV-FLGSLQYPYHFQKHFFFFTFYTGYPIWGAVFFCSSGTLSVVA
::.:::: . : ...:: :: .: :: : : : .:::.::.:.: .::.. ...
CCDS79 LGTIQILFGIMTFSFGVIFLFTLLKPYPR----FPFIFLSGYPFWGSVLFINSGAFLIAV
50 60 70 80 90 100
110 120 130 140 150 160
pF1KE6 GIKPTRTWIQNSFGMNIASATIALVGTAFLSLNIAVNIQSLRSCHSSSESPDLCNYMGSI
: :.: : : ::. :: :..: .:.... .. . . : : .. . :. . .
CCDS79 KRKTTETLIILSRIMNFLSALGAIAGIILLTFGFILDQNYI--CGYSHQNSQ-CKAVTVL
110 120 130 140 150 160
170 180 190 200 210
pF1KE6 SNGMVSLLLILTLLELCVTISTIAMWCNANCCNSREEISSPPNSV
:.. :. ....:: ... . :... :. ..
CCDS79 FLGILITLMTFSIIELFISLPFSILGCHSEDCDCEQCC
170 180 190 200
>>CCDS7980.1 MS4A2 gene_id:2206|Hs108|chr11 (244 aa)
initn: 187 init1: 152 opt: 250 Z-score: 320.9 bits: 66.6 E(32554): 1.5e-11
Smith-Waterman score: 250; 29.3% identity (61.4% similar) in 184 aa overlap (23-198:27-204)
10 20 30 40 50
pF1KE6 MASHEVDNAELGSASAHGTPGSEAGPEELNTSVYQPIDGSPDYQ------KAKLQV
: .:.:.... .:: . : . .
CCDS79 MDTESNRRANLALPQEPSSVPAFEVLEISPQEVSSGRLLKSASSPPLHTWLTVLKKEQEF
10 20 30 40 50 60
60 70 80 90 100 110
pF1KE6 LGAIQILNAAMILALGVFLGSLQYPYHFQKHFFFFTFYTGYPIWGAVFFCSSGTLSVVAG
::. :::.: . : .:. . :. :.. .: .: .:::.:::.:: :: ::...
CCDS79 LGVTQILTAMICLCFGTVVCSVLDISHIEGDIFS-SFKAGYPFWGAIFFSISGMLSIISE
70 80 90 100 110
120 130 140 150 160
pF1KE6 IKPTRTWIQNSFGMNIASATIALVGTAFLSLNIAVNIQ--SLRSCHSSSESPDLCNYMGS
. . ...:.: : ::. . .: ..: .:. .. ..::.. :. : .:.:
CCDS79 RRNATYLVRGSLGANTASSIAGGTGITILIINLKKSLAYIHIHSCQKFFETK--C-FMAS
120 130 140 150 160 170
170 180 190 200 210
pF1KE6 ISNGMVSLLLILTLLELCVTISTIAMWCNANCCNSREEISSPPNSV
.:. .: ..:.::.: : ..: :.:
CCDS79 FSTEIVVMMLFLTILGLGSAVSLTI--CGAGEELKGNKVPEDRVYEELNIYSATYSELED
180 190 200 210 220 230
>>CCDS7990.1 MS4A8 gene_id:83661|Hs108|chr11 (250 aa)
initn: 205 init1: 109 opt: 245 Z-score: 314.4 bits: 65.5 E(32554): 3.4e-11
Smith-Waterman score: 247; 28.9% identity (61.2% similar) in 201 aa overlap (17-212:44-223)
10 20 30 40
pF1KE6 MASHEVDNAELGSASAHGTPGSEAGPEELNTSVYQPIDGSPDYQKA
: .::. : : ..: .:.: :::
CCDS79 VLVVAPHNGYPVTPGIMSHVPLYPNSQPQVHLVPGN---PPSLVSNV----NGQP-VQKA
20 30 40 50 60
50 60 70 80 90 100
pF1KE6 --KLQVLGAIQILNAAMILALGVFLGSLQYPYHFQKHFFFFTFYTGYPIWGAVFFCSSGT
. ..::::::. . ..:: ..... . ... ..:: :.:.::...: ::.
CCDS79 LKEGKTLGAIQIIIGLAHIGLGSIMATV-----LVGEYLSISFYGGFPFWGGLWFIISGS
70 80 90 100 110 120
110 120 130 140 150 160
pF1KE6 LSVVAGIKP-TRTWIQNSFGMNIASATIALVGTAFLSLNIAVNIQSLRSCHSSSESPDLC
:::.: .: . ...:.:.::.:: . ::. .. .... : . ::
CCDS79 LSVAAENQPYSYCLLSGSLGLNIVSAICSAVGVILFITDLSIP-------HPYAY-PDYY
130 140 150 160 170
170 180 190 200 210
pF1KE6 NYMGSISNGMV--SLLLILTLLELCVTISTIAMWCNANCCNSREEISSPPNSV
: ... ::. ..::.. :::. .. .. . :. ::.: . ::
CCDS79 PYAWGVNPGMAISGVLLVFCLLEFGIACASSHFGCQLVCCQSSNVSVIYPNIYAANPVIT
180 190 200 210 220 230
CCDS79 PEPVTSPPSYSSEIQANK
240 250
214 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Tue Nov 8 11:29:32 2016 done: Tue Nov 8 11:29:33 2016
Total Scan time: 2.200 Total Display time: -0.020
Function used was FASTA [36.3.4 Apr, 2011]