FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE2150, 237 aa
1>>>pF1KE2150 237 - 237 aa - 237 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.0434+/-0.000775; mu= 16.4970+/- 0.047
mean_var=72.9923+/-14.512, 0's: 0 Z-trim(108.6): 128 B-trim: 6 in 1/49
Lambda= 0.150119
statistics sampled from 10216 (10349) to 10216 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.688), E-opt: 0.2 (0.318), width: 16
Scan time: 2.200
The best scores are: opt bits E(32554)
CCDS8590.1 CLEC4A gene_id:50856|Hs108|chr12 ( 237) 1641 364.1 4.5e-101
CCDS41745.1 CLEC4A gene_id:50856|Hs108|chr12 ( 198) 1233 275.7 1.6e-74
CCDS8591.1 CLEC4A gene_id:50856|Hs108|chr12 ( 165) 1024 230.4 5.7e-61
CCDS8592.1 CLEC4A gene_id:50856|Hs108|chr12 ( 204) 1019 229.4 1.4e-60
CCDS8583.1 CLEC4C gene_id:170482|Hs108|chr12 ( 213) 720 164.6 4.6e-41
CCDS8584.1 CLEC4C gene_id:170482|Hs108|chr12 ( 182) 700 160.2 8.2e-40
CCDS31739.1 CLEC6A gene_id:93978|Hs108|chr12 ( 209) 613 141.4 4.3e-34
CCDS8593.1 CLEC4D gene_id:338339|Hs108|chr12 ( 215) 539 125.4 2.9e-29
CCDS8594.1 CLEC4E gene_id:26253|Hs108|chr12 ( 219) 453 106.8 1.2e-23
CCDS45597.1 CLEC10A gene_id:10462|Hs108|chr17 ( 292) 340 82.4 3.5e-16
CCDS82049.1 CLEC10A gene_id:10462|Hs108|chr17 ( 289) 334 81.1 8.5e-16
CCDS11087.1 CLEC10A gene_id:10462|Hs108|chr17 ( 316) 334 81.2 9.1e-16
CCDS59347.1 CLEC4M gene_id:10332|Hs108|chr19 ( 263) 318 77.6 8.7e-15
CCDS12187.1 CLEC4M gene_id:10332|Hs108|chr19 ( 399) 320 78.2 8.8e-15
CCDS56087.1 CLEC17A gene_id:388512|Hs108|chr19 ( 378) 300 73.9 1.7e-13
CCDS56017.1 ASGR1 gene_id:432|Hs108|chr17 ( 252) 297 73.1 2e-13
CCDS11088.1 ASGR2 gene_id:433|Hs108|chr17 ( 287) 297 73.1 2.2e-13
CCDS11089.1 ASGR1 gene_id:432|Hs108|chr17 ( 291) 297 73.1 2.2e-13
CCDS45598.1 ASGR2 gene_id:433|Hs108|chr17 ( 292) 297 73.1 2.2e-13
CCDS32544.1 ASGR2 gene_id:433|Hs108|chr17 ( 311) 297 73.1 2.3e-13
CCDS59345.1 CD209 gene_id:30835|Hs108|chr19 ( 243) 291 71.8 4.7e-13
CCDS45949.1 CD209 gene_id:30835|Hs108|chr19 ( 360) 283 70.2 2.1e-12
CCDS45950.1 CD209 gene_id:30835|Hs108|chr19 ( 380) 283 70.2 2.2e-12
CCDS59344.1 CD209 gene_id:30835|Hs108|chr19 ( 268) 281 69.6 2.3e-12
CCDS12186.1 CD209 gene_id:30835|Hs108|chr19 ( 404) 283 70.2 2.3e-12
CCDS45952.1 CD209 gene_id:30835|Hs108|chr19 ( 312) 281 69.7 2.6e-12
CCDS74520.1 CD207 gene_id:50489|Hs108|chr2 ( 328) 277 68.8 4.8e-12
CCDS32782.1 COLEC12 gene_id:81035|Hs108|chr18 ( 742) 281 70.0 4.9e-12
CCDS11634.1 MRC2 gene_id:9902|Hs108|chr17 (1479) 268 67.4 5.9e-11
>>CCDS8590.1 CLEC4A gene_id:50856|Hs108|chr12 (237 aa)
initn: 1641 init1: 1641 opt: 1641 Z-score: 1928.1 bits: 364.1 E(32554): 4.5e-101
Smith-Waterman score: 1641; 100.0% identity (100.0% similar) in 237 aa overlap (1-237:1-237)
10 20 30 40 50 60
pF1KE2 MTSEITYAEVRFKNEFKSSGINTASSAASKERTAPHKSNTGFPKLLCASLLIFFLLLAIS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS85 MTSEITYAEVRFKNEFKSSGINTASSAASKERTAPHKSNTGFPKLLCASLLIFFLLLAIS
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE2 FFIAFVIFFQKYSQLLEKKTTKELVHTTLECVKKNMPVEETAWSCCPKNWKSFSSNCYFI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS85 FFIAFVIFFQKYSQLLEKKTTKELVHTTLECVKKNMPVEETAWSCCPKNWKSFSSNCYFI
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE2 STESASWQDSEKDCARMEAHLLVINTQEEQDFIFQNLQEESAYFVGLSDPEGQRHWQWVD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS85 STESASWQDSEKDCARMEAHLLVINTQEEQDFIFQNLQEESAYFVGLSDPEGQRHWQWVD
130 140 150 160 170 180
190 200 210 220 230
pF1KE2 QTPYNESSTFWHPREPSDPNERCVVLNFRKSPKRWGWNDVNCLGPQRSVCEMMKIHL
:::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS85 QTPYNESSTFWHPREPSDPNERCVVLNFRKSPKRWGWNDVNCLGPQRSVCEMMKIHL
190 200 210 220 230
>>CCDS41745.1 CLEC4A gene_id:50856|Hs108|chr12 (198 aa)
initn: 1379 init1: 1233 opt: 1233 Z-score: 1451.6 bits: 275.7 E(32554): 1.6e-74
Smith-Waterman score: 1305; 83.1% identity (83.5% similar) in 237 aa overlap (1-237:1-198)
10 20 30 40 50 60
pF1KE2 MTSEITYAEVRFKNEFKSSGINTASSAASKERTAPHKSNTGFPKLLCASLLIFFLLLAIS
:::::::::::::::::::::::::::
CCDS41 MTSEITYAEVRFKNEFKSSGINTASSA---------------------------------
10 20
70 80 90 100 110 120
pF1KE2 FFIAFVIFFQKYSQLLEKKTTKELVHTTLECVKKNMPVEETAWSCCPKNWKSFSSNCYFI
.:::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 ------VFFQKYSQLLEKKTTKELVHTTLECVKKNMPVEETAWSCCPKNWKSFSSNCYFI
30 40 50 60 70 80
130 140 150 160 170 180
pF1KE2 STESASWQDSEKDCARMEAHLLVINTQEEQDFIFQNLQEESAYFVGLSDPEGQRHWQWVD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 STESASWQDSEKDCARMEAHLLVINTQEEQDFIFQNLQEESAYFVGLSDPEGQRHWQWVD
90 100 110 120 130 140
190 200 210 220 230
pF1KE2 QTPYNESSTFWHPREPSDPNERCVVLNFRKSPKRWGWNDVNCLGPQRSVCEMMKIHL
:::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 QTPYNESSTFWHPREPSDPNERCVVLNFRKSPKRWGWNDVNCLGPQRSVCEMMKIHL
150 160 170 180 190
>>CCDS8591.1 CLEC4A gene_id:50856|Hs108|chr12 (165 aa)
initn: 1024 init1: 1024 opt: 1024 Z-score: 1208.0 bits: 230.4 E(32554): 5.7e-61
Smith-Waterman score: 1030; 69.6% identity (69.6% similar) in 237 aa overlap (1-237:1-165)
10 20 30 40 50 60
pF1KE2 MTSEITYAEVRFKNEFKSSGINTASSAASKERTAPHKSNTGFPKLLCASLLIFFLLLAIS
:::::::::::::::::::::::::::
CCDS85 MTSEITYAEVRFKNEFKSSGINTASSA---------------------------------
10 20
70 80 90 100 110 120
pF1KE2 FFIAFVIFFQKYSQLLEKKTTKELVHTTLECVKKNMPVEETAWSCCPKNWKSFSSNCYFI
:::::::::::::::::::::
CCDS85 ---------------------------------------ETAWSCCPKNWKSFSSNCYFI
30 40
130 140 150 160 170 180
pF1KE2 STESASWQDSEKDCARMEAHLLVINTQEEQDFIFQNLQEESAYFVGLSDPEGQRHWQWVD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS85 STESASWQDSEKDCARMEAHLLVINTQEEQDFIFQNLQEESAYFVGLSDPEGQRHWQWVD
50 60 70 80 90 100
190 200 210 220 230
pF1KE2 QTPYNESSTFWHPREPSDPNERCVVLNFRKSPKRWGWNDVNCLGPQRSVCEMMKIHL
:::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS85 QTPYNESSTFWHPREPSDPNERCVVLNFRKSPKRWGWNDVNCLGPQRSVCEMMKIHL
110 120 130 140 150 160
>>CCDS8592.1 CLEC4A gene_id:50856|Hs108|chr12 (204 aa)
initn: 1412 init1: 1019 opt: 1019 Z-score: 1200.9 bits: 229.4 E(32554): 1.4e-60
Smith-Waterman score: 1350; 85.7% identity (86.1% similar) in 237 aa overlap (1-237:1-204)
10 20 30 40 50 60
pF1KE2 MTSEITYAEVRFKNEFKSSGINTASSAASKERTAPHKSNTGFPKLLCASLLIFFLLLAIS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS85 MTSEITYAEVRFKNEFKSSGINTASSAASKERTAPHKSNTGFPKLLCASLLIFFLLLAIS
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE2 FFIAFVIFFQKYSQLLEKKTTKELVHTTLECVKKNMPVEETAWSCCPKNWKSFSSNCYFI
:::::: .::::::::::::::::::::
CCDS85 FFIAFV---------------------------------KTAWSCCPKNWKSFSSNCYFI
70 80
130 140 150 160 170 180
pF1KE2 STESASWQDSEKDCARMEAHLLVINTQEEQDFIFQNLQEESAYFVGLSDPEGQRHWQWVD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS85 STESASWQDSEKDCARMEAHLLVINTQEEQDFIFQNLQEESAYFVGLSDPEGQRHWQWVD
90 100 110 120 130 140
190 200 210 220 230
pF1KE2 QTPYNESSTFWHPREPSDPNERCVVLNFRKSPKRWGWNDVNCLGPQRSVCEMMKIHL
:::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS85 QTPYNESSTFWHPREPSDPNERCVVLNFRKSPKRWGWNDVNCLGPQRSVCEMMKIHL
150 160 170 180 190 200
>>CCDS8583.1 CLEC4C gene_id:170482|Hs108|chr12 (213 aa)
initn: 663 init1: 550 opt: 720 Z-score: 850.7 bits: 164.6 E(32554): 4.6e-41
Smith-Waterman score: 720; 50.8% identity (76.4% similar) in 191 aa overlap (51-237:26-213)
30 40 50 60 70
pF1KE2 INTASSAASKERTAPHKSNTGFPKLLCASLLIFFLLLAISFFIAFVI----FFQKYSQLL
.. .:::.. : .. :. ...: . :
CCDS85 MVPEEEPQDREKGLWWFQLKVWSMAVVSILLLSVCFTVSSVVPHNFMYSKTVKRL
10 20 30 40 50
80 90 100 110 120 130
pF1KE2 EKKTTKELVHTTLECVKKNMPVEETAWSCCPKNWKSFSSNCYFISTESASWQDSEKDCAR
: . : .: :: .. .:. ::::: : ::.:.:::::: :: :.:.:.
CCDS85 SKLREYQQYHPSLTCVMEGKDIED--WSCCPTPWTSFQSSCYFISTGMQSWTKSQKNCSV
60 70 80 90 100 110
140 150 160 170 180 190
pF1KE2 MEAHLLVINTQEEQDFIFQNLQEESAYFVGLSDPEGQRHWQWVDQTPYNESSTFWHPREP
: : :.::::.::::::.:::...:.::.::::: :.:::::::::::::. :::: ::
CCDS85 MGADLVVINTREEQDFIIQNLKRNSSYFLGLSDPGGRRHWQWVDQTPYNENVTFWHSGEP
120 130 140 150 160 170
200 210 220 230
pF1KE2 SDPNERCVVLNFRKSPKRWGWNDVNCLGPQRSVCEMMKIHL
.. .:::...:::.: ..:::::..: ::.:.:.: ::..
CCDS85 NNLDERCAIINFRSS-EEWGWNDIHCHVPQKSICKMKKIYI
180 190 200 210
>>CCDS8584.1 CLEC4C gene_id:170482|Hs108|chr12 (182 aa)
initn: 663 init1: 550 opt: 700 Z-score: 828.2 bits: 160.2 E(32554): 8.2e-40
Smith-Waterman score: 700; 54.4% identity (78.1% similar) in 169 aa overlap (69-237:17-182)
40 50 60 70 80 90
pF1KE2 NTGFPKLLCASLLIFFLLLAISFFIAFVIFFQKYSQLLEKKTTKELVHTTLECVKKNMPV
..: . : : . : .: :: .. .
CCDS85 MVPEEEPQDRVPHNFMYSKTVKRLSKLREYQQYHPSLTCVMEGKDI
10 20 30 40
100 110 120 130 140 150
pF1KE2 EETAWSCCPKNWKSFSSNCYFISTESASWQDSEKDCARMEAHLLVINTQEEQDFIFQNLQ
:. ::::: : ::.:.:::::: :: :.:.:. : : :.::::.::::::.:::.
CCDS85 ED--WSCCPTPWTSFQSSCYFISTGMQSWTKSQKNCSVMGADLVVINTREEQDFIIQNLK
50 60 70 80 90 100
160 170 180 190 200 210
pF1KE2 EESAYFVGLSDPEGQRHWQWVDQTPYNESSTFWHPREPSDPNERCVVLNFRKSPKRWGWN
..:.::.::::: :.:::::::::::::. :::: ::.. .:::...:::.: ..::::
CCDS85 RNSSYFLGLSDPGGRRHWQWVDQTPYNENVTFWHSGEPNNLDERCAIINFRSS-EEWGWN
110 120 130 140 150 160
220 230
pF1KE2 DVNCLGPQRSVCEMMKIHL
:..: ::.:.:.: ::..
CCDS85 DIHCHVPQKSICKMKKIYI
170 180
>>CCDS31739.1 CLEC6A gene_id:93978|Hs108|chr12 (209 aa)
initn: 590 init1: 470 opt: 613 Z-score: 725.6 bits: 141.4 E(32554): 4.3e-34
Smith-Waterman score: 613; 44.8% identity (72.7% similar) in 183 aa overlap (56-237:30-209)
30 40 50 60 70 80
pF1KE2 SAASKERTAPHKSNTGFPKLLCASLLIFFLLLAISFFIAFVIFFQ-KYSQLLEKKTTKEL
::. :... :. .. :.. .. . .
CCDS31 MMQEQQPQSTEKRGWLSLRLWSVAGISIALLSACFIVSCVVTYHFTYGETGKRLSELHS
10 20 30 40 50
90 100 110 120 130 140
pF1KE2 VHTTLECVKKNMPVEETAWSCCPKNWKSFSSNCYFISTESASWQDSEKDCARMEAHLLVI
:..: : ... : ::.::: .::::.:.:::::.: :. ::..:..: :::.:.
CCDS31 YHSSLTCFSEGTKV--PAWGCCPASWKSFGSSCYFISSEEKVWSKSEQNCVEMGAHLVVF
60 70 80 90 100 110
150 160 170 180 190 200
pF1KE2 NTQEEQDFIFQNLQEESAYFVGLSDPEGQRHWQWVDQTPYNESSTFWHPREPSDPNERCV
::. ::.:: :.:.: .::.:::::.:. .:::.:.:::... ::: ::. :.:.
CCDS31 NTEAEQNFIVQQLNESFSYFLGLSDPQGNNNWQWIDKTPYEKNVRFWHLGEPNHSAEQCA
120 130 140 150 160 170
210 220 230
pF1KE2 VLNFRKSPKRWGWNDVNCLGPQRSVCEMMKIHL
. : : : :::::: : . :.::: ::.:
CCDS31 SIVFWK-PTGWGWNDVICETRRNSICEMNKIYL
180 190 200
>>CCDS8593.1 CLEC4D gene_id:338339|Hs108|chr12 (215 aa)
initn: 447 init1: 381 opt: 539 Z-score: 638.8 bits: 125.4 E(32554): 2.9e-29
Smith-Waterman score: 539; 36.7% identity (72.4% similar) in 196 aa overlap (43-232:16-209)
20 30 40 50 60 70
pF1KE2 KNEFKSSGINTASSAASKERTAPHKSNTGFPKLLCASL-LIFFLLLAISFFIAFVIFFQK
:.:. . . ..:.:::.. :. . .. ..
CCDS85 MGLEKPQSKLEGGMHPQLIPSVIAVVFILLLSVCFIASCLVTHHN
10 20 30 40
80 90 100 110 120
pF1KE2 YSQLLEKKTTKELVH-TTLECVKKNMPV---EETAWSCCPKNWKSFSSNCYFISTESASW
.:. . ...: : . :.:.:.. . : ..:.::: .:..:.::::: :.. .:
CCDS85 FSRCKRGTGVHKLEHHAKLKCIKEKSELKSAEGSTWNCCPIDWRAFQSNCYFPLTDNKTW
50 60 70 80 90 100
130 140 150 160 170 180
pF1KE2 QDSEKDCARMEAHLLVINTQEEQDFIFQNLQEESAYFVGLSDPEGQRHWQWVDQTPYNES
.::..:. : :::..:.:. ::.::.: :... .::.:: : ... .:.::::::.:
CCDS85 AESERNCSGMGAHLMTISTEAEQNFIIQFLDRRLSYFLGLRDENAKGQWRWVDQTPFNPR
110 120 130 140 150 160
190 200 210 220 230
pF1KE2 STFWHPREPSDPN-ERCVVLNFRKSPKRWGWNDVNCLGPQRSVCEMMKIHL
.::: ::.. . : :::: . .. .:.:::: : .:..
CCDS85 RVFWHKNEPDNSQGENCVVLVYNQD--KWAWNDVPCNFEASRICKIPGTTLN
170 180 190 200 210
>>CCDS8594.1 CLEC4E gene_id:26253|Hs108|chr12 (219 aa)
initn: 413 init1: 354 opt: 453 Z-score: 538.0 bits: 106.8 E(32554): 1.2e-23
Smith-Waterman score: 453; 38.0% identity (63.1% similar) in 187 aa overlap (52-236:29-211)
30 40 50 60 70 80
pF1KE2 NTASSAASKERTAPHKSNTGFPKLLCASLLIFFLLLAISFFIAFVIFFQKYSQLLEKKTT
: .:.:. :. :. :. .. :::
CCDS85 MNSSKSSETQCTERGCFSSQMFLWTVAGIPILFLSACFITRCVVTFRIFQTCDEKKFQ
10 20 30 40 50
90 100 110 120 130 140
pF1KE2 KELVHTTLECVKKNMPVEETAWSCCPKNWKSFSSNCYFISTESASWQDSEKDCARMEAHL
: : : . . .. .::: ::. :.:.:::.::.. :: : :.:. : :::
CCDS85 LPENFTELSCYNYGSG---SVKNCCPLNWEYFQSSCYFFSTDTISWALSLKNCSAMGAHL
60 70 80 90 100 110
150 160 170 180 190 200
pF1KE2 LVINTQEEQDFIFQNLQEESAYFVGLSDPEGQRHWQWVDQTPYNESSTFWHPREPSDPN-
.:::.::::.:. . . .:.:::: . .::::: :: ..: .:: ::..
CCDS85 VVINSQEEQEFLSYKKPKMREFFIGLSDQVVEGQWQWVDGTPLTKSLSFWDVGEPNNIAT
120 130 140 150 160 170
210 220 230
pF1KE2 -ERCVVLNFRKSPKRWGWNDVNCLGPQRSVCEMMKIHL
: :... ..: : .::::.:. .:::. :.
CCDS85 LEDCATMRDSSNP-RQNWNDVTCFLNYFRICEMVGINPLNKGKSL
180 190 200 210
>>CCDS45597.1 CLEC10A gene_id:10462|Hs108|chr17 (292 aa)
initn: 263 init1: 190 opt: 340 Z-score: 404.1 bits: 82.4 E(32554): 3.5e-16
Smith-Waterman score: 340; 36.6% identity (60.5% similar) in 172 aa overlap (72-231:119-281)
50 60 70 80 90
pF1KE2 FPKLLCASLLIFFLLLAISFFIAFVIFFQKYSQLLEK--KTTKELVHTTLECVKKNMPVE
.:..: . . ...: . : . . : :
CCDS45 ALTSQGSSLEETIASLKAEVEGFKQERQAVHSEMLLRVQQLVQDLKKLTCQVATLNNNGE
90 100 110 120 130 140
100 110 120 130 140 150
pF1KE2 E--TAWSCCPKNWKSFSSNCYFISTESASWQDSEKDCARMEAHLLVINTQEEQDFIFQNL
: : .::: :: ...::..: . :: ..:: : .:::.:::..:::.:. . :
CCDS45 EASTEGTCCPVNWVEHQDSCYWFSHSGMSWAEAEKYCQLKNAHLVVINSREEQNFVQKYL
150 160 170 180 190 200
160 170 180 190 200
pF1KE2 QEESAY-FVGLSDPEGQRHWQWVDQTPYNESSTFWHPREPSD-------PNERCVVLNFR
::: ..::::::: :.::: : : . :.: .:.: .: :. .:.
CCDS45 G--SAYTWMGLSDPEGA--WKWVDGTDYATGFQNWKPGQPDDWQGHGLGGGEDCA--HFH
210 220 230 240 250 260
210 220 230
pF1KE2 KSPKRWGWNDVNCLGPQRSVCEMMKIHL
. . ::: : : . :::
CCDS45 PDGR---WNDDVCQRPYHWVCEAGLGQTSQESH
270 280 290
237 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Mon Nov 7 15:47:31 2016 done: Mon Nov 7 15:47:32 2016
Total Scan time: 2.200 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]