FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE2150, 237 aa 1>>>pF1KE2150 237 - 237 aa - 237 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.0434+/-0.000775; mu= 16.4970+/- 0.047 mean_var=72.9923+/-14.512, 0's: 0 Z-trim(108.6): 128 B-trim: 6 in 1/49 Lambda= 0.150119 statistics sampled from 10216 (10349) to 10216 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.688), E-opt: 0.2 (0.318), width: 16 Scan time: 2.200 The best scores are: opt bits E(32554) CCDS8590.1 CLEC4A gene_id:50856|Hs108|chr12 ( 237) 1641 364.1 4.5e-101 CCDS41745.1 CLEC4A gene_id:50856|Hs108|chr12 ( 198) 1233 275.7 1.6e-74 CCDS8591.1 CLEC4A gene_id:50856|Hs108|chr12 ( 165) 1024 230.4 5.7e-61 CCDS8592.1 CLEC4A gene_id:50856|Hs108|chr12 ( 204) 1019 229.4 1.4e-60 CCDS8583.1 CLEC4C gene_id:170482|Hs108|chr12 ( 213) 720 164.6 4.6e-41 CCDS8584.1 CLEC4C gene_id:170482|Hs108|chr12 ( 182) 700 160.2 8.2e-40 CCDS31739.1 CLEC6A gene_id:93978|Hs108|chr12 ( 209) 613 141.4 4.3e-34 CCDS8593.1 CLEC4D gene_id:338339|Hs108|chr12 ( 215) 539 125.4 2.9e-29 CCDS8594.1 CLEC4E gene_id:26253|Hs108|chr12 ( 219) 453 106.8 1.2e-23 CCDS45597.1 CLEC10A gene_id:10462|Hs108|chr17 ( 292) 340 82.4 3.5e-16 CCDS82049.1 CLEC10A gene_id:10462|Hs108|chr17 ( 289) 334 81.1 8.5e-16 CCDS11087.1 CLEC10A gene_id:10462|Hs108|chr17 ( 316) 334 81.2 9.1e-16 CCDS59347.1 CLEC4M gene_id:10332|Hs108|chr19 ( 263) 318 77.6 8.7e-15 CCDS12187.1 CLEC4M gene_id:10332|Hs108|chr19 ( 399) 320 78.2 8.8e-15 CCDS56087.1 CLEC17A gene_id:388512|Hs108|chr19 ( 378) 300 73.9 1.7e-13 CCDS56017.1 ASGR1 gene_id:432|Hs108|chr17 ( 252) 297 73.1 2e-13 CCDS11088.1 ASGR2 gene_id:433|Hs108|chr17 ( 287) 297 73.1 2.2e-13 CCDS11089.1 ASGR1 gene_id:432|Hs108|chr17 ( 291) 297 73.1 2.2e-13 CCDS45598.1 ASGR2 gene_id:433|Hs108|chr17 ( 292) 297 73.1 2.2e-13 CCDS32544.1 ASGR2 gene_id:433|Hs108|chr17 ( 311) 297 73.1 2.3e-13 CCDS59345.1 CD209 gene_id:30835|Hs108|chr19 ( 243) 291 71.8 4.7e-13 CCDS45949.1 CD209 gene_id:30835|Hs108|chr19 ( 360) 283 70.2 2.1e-12 CCDS45950.1 CD209 gene_id:30835|Hs108|chr19 ( 380) 283 70.2 2.2e-12 CCDS59344.1 CD209 gene_id:30835|Hs108|chr19 ( 268) 281 69.6 2.3e-12 CCDS12186.1 CD209 gene_id:30835|Hs108|chr19 ( 404) 283 70.2 2.3e-12 CCDS45952.1 CD209 gene_id:30835|Hs108|chr19 ( 312) 281 69.7 2.6e-12 CCDS74520.1 CD207 gene_id:50489|Hs108|chr2 ( 328) 277 68.8 4.8e-12 CCDS32782.1 COLEC12 gene_id:81035|Hs108|chr18 ( 742) 281 70.0 4.9e-12 CCDS11634.1 MRC2 gene_id:9902|Hs108|chr17 (1479) 268 67.4 5.9e-11 >>CCDS8590.1 CLEC4A gene_id:50856|Hs108|chr12 (237 aa) initn: 1641 init1: 1641 opt: 1641 Z-score: 1928.1 bits: 364.1 E(32554): 4.5e-101 Smith-Waterman score: 1641; 100.0% identity (100.0% similar) in 237 aa overlap (1-237:1-237) 10 20 30 40 50 60 pF1KE2 MTSEITYAEVRFKNEFKSSGINTASSAASKERTAPHKSNTGFPKLLCASLLIFFLLLAIS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS85 MTSEITYAEVRFKNEFKSSGINTASSAASKERTAPHKSNTGFPKLLCASLLIFFLLLAIS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 FFIAFVIFFQKYSQLLEKKTTKELVHTTLECVKKNMPVEETAWSCCPKNWKSFSSNCYFI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS85 FFIAFVIFFQKYSQLLEKKTTKELVHTTLECVKKNMPVEETAWSCCPKNWKSFSSNCYFI 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 STESASWQDSEKDCARMEAHLLVINTQEEQDFIFQNLQEESAYFVGLSDPEGQRHWQWVD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS85 STESASWQDSEKDCARMEAHLLVINTQEEQDFIFQNLQEESAYFVGLSDPEGQRHWQWVD 130 140 150 160 170 180 190 200 210 220 230 pF1KE2 QTPYNESSTFWHPREPSDPNERCVVLNFRKSPKRWGWNDVNCLGPQRSVCEMMKIHL ::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS85 QTPYNESSTFWHPREPSDPNERCVVLNFRKSPKRWGWNDVNCLGPQRSVCEMMKIHL 190 200 210 220 230 >>CCDS41745.1 CLEC4A gene_id:50856|Hs108|chr12 (198 aa) initn: 1379 init1: 1233 opt: 1233 Z-score: 1451.6 bits: 275.7 E(32554): 1.6e-74 Smith-Waterman score: 1305; 83.1% identity (83.5% similar) in 237 aa overlap (1-237:1-198) 10 20 30 40 50 60 pF1KE2 MTSEITYAEVRFKNEFKSSGINTASSAASKERTAPHKSNTGFPKLLCASLLIFFLLLAIS ::::::::::::::::::::::::::: CCDS41 MTSEITYAEVRFKNEFKSSGINTASSA--------------------------------- 10 20 70 80 90 100 110 120 pF1KE2 FFIAFVIFFQKYSQLLEKKTTKELVHTTLECVKKNMPVEETAWSCCPKNWKSFSSNCYFI .::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 ------VFFQKYSQLLEKKTTKELVHTTLECVKKNMPVEETAWSCCPKNWKSFSSNCYFI 30 40 50 60 70 80 130 140 150 160 170 180 pF1KE2 STESASWQDSEKDCARMEAHLLVINTQEEQDFIFQNLQEESAYFVGLSDPEGQRHWQWVD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 STESASWQDSEKDCARMEAHLLVINTQEEQDFIFQNLQEESAYFVGLSDPEGQRHWQWVD 90 100 110 120 130 140 190 200 210 220 230 pF1KE2 QTPYNESSTFWHPREPSDPNERCVVLNFRKSPKRWGWNDVNCLGPQRSVCEMMKIHL ::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 QTPYNESSTFWHPREPSDPNERCVVLNFRKSPKRWGWNDVNCLGPQRSVCEMMKIHL 150 160 170 180 190 >>CCDS8591.1 CLEC4A gene_id:50856|Hs108|chr12 (165 aa) initn: 1024 init1: 1024 opt: 1024 Z-score: 1208.0 bits: 230.4 E(32554): 5.7e-61 Smith-Waterman score: 1030; 69.6% identity (69.6% similar) in 237 aa overlap (1-237:1-165) 10 20 30 40 50 60 pF1KE2 MTSEITYAEVRFKNEFKSSGINTASSAASKERTAPHKSNTGFPKLLCASLLIFFLLLAIS ::::::::::::::::::::::::::: CCDS85 MTSEITYAEVRFKNEFKSSGINTASSA--------------------------------- 10 20 70 80 90 100 110 120 pF1KE2 FFIAFVIFFQKYSQLLEKKTTKELVHTTLECVKKNMPVEETAWSCCPKNWKSFSSNCYFI ::::::::::::::::::::: CCDS85 ---------------------------------------ETAWSCCPKNWKSFSSNCYFI 30 40 130 140 150 160 170 180 pF1KE2 STESASWQDSEKDCARMEAHLLVINTQEEQDFIFQNLQEESAYFVGLSDPEGQRHWQWVD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS85 STESASWQDSEKDCARMEAHLLVINTQEEQDFIFQNLQEESAYFVGLSDPEGQRHWQWVD 50 60 70 80 90 100 190 200 210 220 230 pF1KE2 QTPYNESSTFWHPREPSDPNERCVVLNFRKSPKRWGWNDVNCLGPQRSVCEMMKIHL ::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS85 QTPYNESSTFWHPREPSDPNERCVVLNFRKSPKRWGWNDVNCLGPQRSVCEMMKIHL 110 120 130 140 150 160 >>CCDS8592.1 CLEC4A gene_id:50856|Hs108|chr12 (204 aa) initn: 1412 init1: 1019 opt: 1019 Z-score: 1200.9 bits: 229.4 E(32554): 1.4e-60 Smith-Waterman score: 1350; 85.7% identity (86.1% similar) in 237 aa overlap (1-237:1-204) 10 20 30 40 50 60 pF1KE2 MTSEITYAEVRFKNEFKSSGINTASSAASKERTAPHKSNTGFPKLLCASLLIFFLLLAIS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS85 MTSEITYAEVRFKNEFKSSGINTASSAASKERTAPHKSNTGFPKLLCASLLIFFLLLAIS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 FFIAFVIFFQKYSQLLEKKTTKELVHTTLECVKKNMPVEETAWSCCPKNWKSFSSNCYFI :::::: .:::::::::::::::::::: CCDS85 FFIAFV---------------------------------KTAWSCCPKNWKSFSSNCYFI 70 80 130 140 150 160 170 180 pF1KE2 STESASWQDSEKDCARMEAHLLVINTQEEQDFIFQNLQEESAYFVGLSDPEGQRHWQWVD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS85 STESASWQDSEKDCARMEAHLLVINTQEEQDFIFQNLQEESAYFVGLSDPEGQRHWQWVD 90 100 110 120 130 140 190 200 210 220 230 pF1KE2 QTPYNESSTFWHPREPSDPNERCVVLNFRKSPKRWGWNDVNCLGPQRSVCEMMKIHL ::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS85 QTPYNESSTFWHPREPSDPNERCVVLNFRKSPKRWGWNDVNCLGPQRSVCEMMKIHL 150 160 170 180 190 200 >>CCDS8583.1 CLEC4C gene_id:170482|Hs108|chr12 (213 aa) initn: 663 init1: 550 opt: 720 Z-score: 850.7 bits: 164.6 E(32554): 4.6e-41 Smith-Waterman score: 720; 50.8% identity (76.4% similar) in 191 aa overlap (51-237:26-213) 30 40 50 60 70 pF1KE2 INTASSAASKERTAPHKSNTGFPKLLCASLLIFFLLLAISFFIAFVI----FFQKYSQLL .. .:::.. : .. :. ...: . : CCDS85 MVPEEEPQDREKGLWWFQLKVWSMAVVSILLLSVCFTVSSVVPHNFMYSKTVKRL 10 20 30 40 50 80 90 100 110 120 130 pF1KE2 EKKTTKELVHTTLECVKKNMPVEETAWSCCPKNWKSFSSNCYFISTESASWQDSEKDCAR : . : .: :: .. .:. ::::: : ::.:.:::::: :: :.:.:. CCDS85 SKLREYQQYHPSLTCVMEGKDIED--WSCCPTPWTSFQSSCYFISTGMQSWTKSQKNCSV 60 70 80 90 100 110 140 150 160 170 180 190 pF1KE2 MEAHLLVINTQEEQDFIFQNLQEESAYFVGLSDPEGQRHWQWVDQTPYNESSTFWHPREP : : :.::::.::::::.:::...:.::.::::: :.:::::::::::::. :::: :: CCDS85 MGADLVVINTREEQDFIIQNLKRNSSYFLGLSDPGGRRHWQWVDQTPYNENVTFWHSGEP 120 130 140 150 160 170 200 210 220 230 pF1KE2 SDPNERCVVLNFRKSPKRWGWNDVNCLGPQRSVCEMMKIHL .. .:::...:::.: ..:::::..: ::.:.:.: ::.. CCDS85 NNLDERCAIINFRSS-EEWGWNDIHCHVPQKSICKMKKIYI 180 190 200 210 >>CCDS8584.1 CLEC4C gene_id:170482|Hs108|chr12 (182 aa) initn: 663 init1: 550 opt: 700 Z-score: 828.2 bits: 160.2 E(32554): 8.2e-40 Smith-Waterman score: 700; 54.4% identity (78.1% similar) in 169 aa overlap (69-237:17-182) 40 50 60 70 80 90 pF1KE2 NTGFPKLLCASLLIFFLLLAISFFIAFVIFFQKYSQLLEKKTTKELVHTTLECVKKNMPV ..: . : : . : .: :: .. . CCDS85 MVPEEEPQDRVPHNFMYSKTVKRLSKLREYQQYHPSLTCVMEGKDI 10 20 30 40 100 110 120 130 140 150 pF1KE2 EETAWSCCPKNWKSFSSNCYFISTESASWQDSEKDCARMEAHLLVINTQEEQDFIFQNLQ :. ::::: : ::.:.:::::: :: :.:.:. : : :.::::.::::::.:::. CCDS85 ED--WSCCPTPWTSFQSSCYFISTGMQSWTKSQKNCSVMGADLVVINTREEQDFIIQNLK 50 60 70 80 90 100 160 170 180 190 200 210 pF1KE2 EESAYFVGLSDPEGQRHWQWVDQTPYNESSTFWHPREPSDPNERCVVLNFRKSPKRWGWN ..:.::.::::: :.:::::::::::::. :::: ::.. .:::...:::.: ..:::: CCDS85 RNSSYFLGLSDPGGRRHWQWVDQTPYNENVTFWHSGEPNNLDERCAIINFRSS-EEWGWN 110 120 130 140 150 160 220 230 pF1KE2 DVNCLGPQRSVCEMMKIHL :..: ::.:.:.: ::.. CCDS85 DIHCHVPQKSICKMKKIYI 170 180 >>CCDS31739.1 CLEC6A gene_id:93978|Hs108|chr12 (209 aa) initn: 590 init1: 470 opt: 613 Z-score: 725.6 bits: 141.4 E(32554): 4.3e-34 Smith-Waterman score: 613; 44.8% identity (72.7% similar) in 183 aa overlap (56-237:30-209) 30 40 50 60 70 80 pF1KE2 SAASKERTAPHKSNTGFPKLLCASLLIFFLLLAISFFIAFVIFFQ-KYSQLLEKKTTKEL ::. :... :. .. :.. .. . . CCDS31 MMQEQQPQSTEKRGWLSLRLWSVAGISIALLSACFIVSCVVTYHFTYGETGKRLSELHS 10 20 30 40 50 90 100 110 120 130 140 pF1KE2 VHTTLECVKKNMPVEETAWSCCPKNWKSFSSNCYFISTESASWQDSEKDCARMEAHLLVI :..: : ... : ::.::: .::::.:.:::::.: :. ::..:..: :::.:. CCDS31 YHSSLTCFSEGTKV--PAWGCCPASWKSFGSSCYFISSEEKVWSKSEQNCVEMGAHLVVF 60 70 80 90 100 110 150 160 170 180 190 200 pF1KE2 NTQEEQDFIFQNLQEESAYFVGLSDPEGQRHWQWVDQTPYNESSTFWHPREPSDPNERCV ::. ::.:: :.:.: .::.:::::.:. .:::.:.:::... ::: ::. :.:. CCDS31 NTEAEQNFIVQQLNESFSYFLGLSDPQGNNNWQWIDKTPYEKNVRFWHLGEPNHSAEQCA 120 130 140 150 160 170 210 220 230 pF1KE2 VLNFRKSPKRWGWNDVNCLGPQRSVCEMMKIHL . : : : :::::: : . :.::: ::.: CCDS31 SIVFWK-PTGWGWNDVICETRRNSICEMNKIYL 180 190 200 >>CCDS8593.1 CLEC4D gene_id:338339|Hs108|chr12 (215 aa) initn: 447 init1: 381 opt: 539 Z-score: 638.8 bits: 125.4 E(32554): 2.9e-29 Smith-Waterman score: 539; 36.7% identity (72.4% similar) in 196 aa overlap (43-232:16-209) 20 30 40 50 60 70 pF1KE2 KNEFKSSGINTASSAASKERTAPHKSNTGFPKLLCASL-LIFFLLLAISFFIAFVIFFQK :.:. . . ..:.:::.. :. . .. .. CCDS85 MGLEKPQSKLEGGMHPQLIPSVIAVVFILLLSVCFIASCLVTHHN 10 20 30 40 80 90 100 110 120 pF1KE2 YSQLLEKKTTKELVH-TTLECVKKNMPV---EETAWSCCPKNWKSFSSNCYFISTESASW .:. . ...: : . :.:.:.. . : ..:.::: .:..:.::::: :.. .: CCDS85 FSRCKRGTGVHKLEHHAKLKCIKEKSELKSAEGSTWNCCPIDWRAFQSNCYFPLTDNKTW 50 60 70 80 90 100 130 140 150 160 170 180 pF1KE2 QDSEKDCARMEAHLLVINTQEEQDFIFQNLQEESAYFVGLSDPEGQRHWQWVDQTPYNES .::..:. : :::..:.:. ::.::.: :... .::.:: : ... .:.::::::.: CCDS85 AESERNCSGMGAHLMTISTEAEQNFIIQFLDRRLSYFLGLRDENAKGQWRWVDQTPFNPR 110 120 130 140 150 160 190 200 210 220 230 pF1KE2 STFWHPREPSDPN-ERCVVLNFRKSPKRWGWNDVNCLGPQRSVCEMMKIHL .::: ::.. . : :::: . .. .:.:::: : .:.. CCDS85 RVFWHKNEPDNSQGENCVVLVYNQD--KWAWNDVPCNFEASRICKIPGTTLN 170 180 190 200 210 >>CCDS8594.1 CLEC4E gene_id:26253|Hs108|chr12 (219 aa) initn: 413 init1: 354 opt: 453 Z-score: 538.0 bits: 106.8 E(32554): 1.2e-23 Smith-Waterman score: 453; 38.0% identity (63.1% similar) in 187 aa overlap (52-236:29-211) 30 40 50 60 70 80 pF1KE2 NTASSAASKERTAPHKSNTGFPKLLCASLLIFFLLLAISFFIAFVIFFQKYSQLLEKKTT : .:.:. :. :. :. .. ::: CCDS85 MNSSKSSETQCTERGCFSSQMFLWTVAGIPILFLSACFITRCVVTFRIFQTCDEKKFQ 10 20 30 40 50 90 100 110 120 130 140 pF1KE2 KELVHTTLECVKKNMPVEETAWSCCPKNWKSFSSNCYFISTESASWQDSEKDCARMEAHL : : : . . .. .::: ::. :.:.:::.::.. :: : :.:. : ::: CCDS85 LPENFTELSCYNYGSG---SVKNCCPLNWEYFQSSCYFFSTDTISWALSLKNCSAMGAHL 60 70 80 90 100 110 150 160 170 180 190 200 pF1KE2 LVINTQEEQDFIFQNLQEESAYFVGLSDPEGQRHWQWVDQTPYNESSTFWHPREPSDPN- .:::.::::.:. . . .:.:::: . .::::: :: ..: .:: ::.. CCDS85 VVINSQEEQEFLSYKKPKMREFFIGLSDQVVEGQWQWVDGTPLTKSLSFWDVGEPNNIAT 120 130 140 150 160 170 210 220 230 pF1KE2 -ERCVVLNFRKSPKRWGWNDVNCLGPQRSVCEMMKIHL : :... ..: : .::::.:. .:::. :. CCDS85 LEDCATMRDSSNP-RQNWNDVTCFLNYFRICEMVGINPLNKGKSL 180 190 200 210 >>CCDS45597.1 CLEC10A gene_id:10462|Hs108|chr17 (292 aa) initn: 263 init1: 190 opt: 340 Z-score: 404.1 bits: 82.4 E(32554): 3.5e-16 Smith-Waterman score: 340; 36.6% identity (60.5% similar) in 172 aa overlap (72-231:119-281) 50 60 70 80 90 pF1KE2 FPKLLCASLLIFFLLLAISFFIAFVIFFQKYSQLLEK--KTTKELVHTTLECVKKNMPVE .:..: . . ...: . : . . : : CCDS45 ALTSQGSSLEETIASLKAEVEGFKQERQAVHSEMLLRVQQLVQDLKKLTCQVATLNNNGE 90 100 110 120 130 140 100 110 120 130 140 150 pF1KE2 E--TAWSCCPKNWKSFSSNCYFISTESASWQDSEKDCARMEAHLLVINTQEEQDFIFQNL : : .::: :: ...::..: . :: ..:: : .:::.:::..:::.:. . : CCDS45 EASTEGTCCPVNWVEHQDSCYWFSHSGMSWAEAEKYCQLKNAHLVVINSREEQNFVQKYL 150 160 170 180 190 200 160 170 180 190 200 pF1KE2 QEESAY-FVGLSDPEGQRHWQWVDQTPYNESSTFWHPREPSD-------PNERCVVLNFR ::: ..::::::: :.::: : : . :.: .:.: .: :. .:. CCDS45 G--SAYTWMGLSDPEGA--WKWVDGTDYATGFQNWKPGQPDDWQGHGLGGGEDCA--HFH 210 220 230 240 250 260 210 220 230 pF1KE2 KSPKRWGWNDVNCLGPQRSVCEMMKIHL . . ::: : : . ::: CCDS45 PDGR---WNDDVCQRPYHWVCEAGLGQTSQESH 270 280 290 237 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 15:47:31 2016 done: Mon Nov 7 15:47:32 2016 Total Scan time: 2.200 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]