FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1712, 216 aa 1>>>pF1KE1712 216 - 216 aa - 216 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.7518+/-0.000809; mu= 16.9845+/- 0.049 mean_var=63.8683+/-12.503, 0's: 0 Z-trim(107.4): 97 B-trim: 50 in 1/49 Lambda= 0.160484 statistics sampled from 9483 (9583) to 9483 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.675), E-opt: 0.2 (0.294), width: 16 Scan time: 1.850 The best scores are: opt bits E(32554) CCDS8623.1 KLRK1 gene_id:22914|Hs108|chr12 ( 216) 1544 365.7 1.2e-101 CCDS44830.1 CLEC12B gene_id:387837|Hs108|chr12 ( 276) 327 84.1 9.7e-17 CCDS8622.1 KLRD1 gene_id:3824|Hs108|chr12 ( 148) 305 78.7 2.1e-15 CCDS8621.1 KLRD1 gene_id:3824|Hs108|chr12 ( 179) 305 78.8 2.4e-15 CCDS8610.1 CLEC12B gene_id:387837|Hs108|chr12 ( 232) 290 75.4 3.2e-14 CCDS41751.1 CLEC1B gene_id:51266|Hs108|chr12 ( 196) 285 74.2 6.3e-14 CCDS41752.1 CLEC1B gene_id:51266|Hs108|chr12 ( 229) 285 74.3 7.1e-14 CCDS8618.1 OLR1 gene_id:4973|Hs108|chr12 ( 273) 284 74.1 9.5e-14 CCDS8611.1 CLEC9A gene_id:283420|Hs108|chr12 ( 241) 276 72.2 3.1e-13 CCDS59347.1 CLEC4M gene_id:10332|Hs108|chr19 ( 263) 265 69.7 2e-12 CCDS12187.1 CLEC4M gene_id:10332|Hs108|chr19 ( 399) 265 69.8 2.7e-12 CCDS59345.1 CD209 gene_id:30835|Hs108|chr19 ( 243) 261 68.7 3.5e-12 CCDS59344.1 CD209 gene_id:30835|Hs108|chr19 ( 268) 261 68.8 3.8e-12 CCDS45952.1 CD209 gene_id:30835|Hs108|chr19 ( 312) 261 68.8 4.2e-12 CCDS45949.1 CD209 gene_id:30835|Hs108|chr19 ( 360) 261 68.9 4.7e-12 CCDS45950.1 CD209 gene_id:30835|Hs108|chr19 ( 380) 261 68.9 4.9e-12 CCDS12186.1 CD209 gene_id:30835|Hs108|chr19 ( 404) 261 68.9 5.1e-12 CCDS8599.1 KLRG1 gene_id:10219|Hs108|chr12 ( 189) 257 67.7 5.5e-12 CCDS53743.1 KLRF2 gene_id:100431172|Hs108|chr12 ( 207) 253 66.8 1.1e-11 CCDS76528.1 CLEC1A gene_id:51267|Hs108|chr12 ( 188) 251 66.3 1.4e-11 CCDS45951.1 CD209 gene_id:30835|Hs108|chr19 ( 398) 253 67.0 1.8e-11 CCDS8625.1 KLRC1 gene_id:3821|Hs108|chr12 ( 233) 250 66.2 2e-11 CCDS73443.1 CLEC1A gene_id:51267|Hs108|chr12 ( 247) 249 66.0 2.4e-11 CCDS8612.1 CLEC1A gene_id:51267|Hs108|chr12 ( 280) 249 66.0 2.7e-11 CCDS76531.1 KLRC1 gene_id:3821|Hs108|chr12 ( 228) 243 64.5 6e-11 >>CCDS8623.1 KLRK1 gene_id:22914|Hs108|chr12 (216 aa) initn: 1544 init1: 1544 opt: 1544 Z-score: 1938.3 bits: 365.7 E(32554): 1.2e-101 Smith-Waterman score: 1544; 99.5% identity (100.0% similar) in 216 aa overlap (1-216:1-216) 10 20 30 40 50 60 pF1KE1 MGWIRGRRSRHSWEMSEFHNYNLDLKKSDFSTRWQKQRCPVVKSKCRENASPFFFCCFIA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS86 MGWIRGRRSRHSWEMSEFHNYNLDLKKSDFSTRWQKQRCPVVKSKCRENASPFFFCCFIA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 VAMGIRFIIMVAIWSAVFLNSLFNQEVQIPLTESYCGPCPKNWICYKNNCYQFFDESKNW :::::::::::.:::::::::::::::::::::::::::::::::::::::::::::::: CCDS86 VAMGIRFIIMVTIWSAVFLNSLFNQEVQIPLTESYCGPCPKNWICYKNNCYQFFDESKNW 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 YESQASCMSQNASLLKVYSKEDQDLLKLVKSYHWMGLVHIPTNGSWQWEDGSILSPNLLT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS86 YESQASCMSQNASLLKVYSKEDQDLLKLVKSYHWMGLVHIPTNGSWQWEDGSILSPNLLT 130 140 150 160 170 180 190 200 210 pF1KE1 IIEMQKGDCALYASSFKGYIENCSTPNTYICMQRTV :::::::::::::::::::::::::::::::::::: CCDS86 IIEMQKGDCALYASSFKGYIENCSTPNTYICMQRTV 190 200 210 >>CCDS44830.1 CLEC12B gene_id:387837|Hs108|chr12 (276 aa) initn: 224 init1: 118 opt: 327 Z-score: 414.0 bits: 84.1 E(32554): 9.7e-17 Smith-Waterman score: 327; 38.0% identity (65.7% similar) in 137 aa overlap (85-211:129-263) 60 70 80 90 100 110 pF1KE1 FCCFIAVAMGIRFIIMVAIWSAVFLNSLFNQEVQIPLTESYCGPCPKNWICYKNNCYQFF ::. : .. :.:::: : :.:.:: : CCDS44 SNNLSMEEEFLKSQISSVLKRQEQMAIKLCQELIIHTSDHRCNPCPKMWQWYQNSCYYFT 100 110 120 130 140 150 120 130 140 150 160 pF1KE1 -DESKNWYESQASCMSQNASLLKVYSKEDQDLLK----LVKSYHWMGLVHIPTNGSWQWE .: :.: .:. .:...:..:.:. : :..:.: :. :. :.:: .. :: :: CCDS44 TNEEKTWANSRKDCIDKNSTLVKIDSLEEKDFLMSQPLLMFSFFWLGLSWDSSGRSWFWE 160 170 180 190 200 210 170 180 190 200 210 pF1KE1 DGSILSPNLLTIIEMQ-----KGDCALYASSFKGYIENCSTPNTYICMQRTV :::. ::.:.. :.. :: :: . .. . :: ::. .:: CCDS44 DGSVPSPSLFSTKELDQINGSKG-CAYFQKG-NIYISRCSAEIFWICEKTAAPVKTEDLD 220 230 240 250 260 270 >>CCDS8622.1 KLRD1 gene_id:3824|Hs108|chr12 (148 aa) initn: 221 init1: 200 opt: 305 Z-score: 390.2 bits: 78.7 E(32554): 2.1e-15 Smith-Waterman score: 305; 32.8% identity (63.4% similar) in 134 aa overlap (84-216:18-148) 60 70 80 90 100 110 pF1KE1 FFCCFIAVAMGIRFIIMVAIWSAVFLNSLFNQEVQIPLTESYCGPCPKNWICYKNNCYQF : :.: .: : : ..:. :. ::: . CCDS86 MAAFTKLSIEPAFTPGPNIELQ---KDSDCCSCQEKWVGYRCNCYFI 10 20 30 40 120 130 140 150 160 170 pF1KE1 FDESKNWYESQASCMSQNASLLKVYSKEDQDLLKLVKSYHWMGLVHIPTNGSWQWEDGSI .:.:.: ::. : ::..:::.. . .. :... ....:.:: . . .: ::.:: CCDS86 SSEQKTWNESRHLCASQKSSLLQLQNTDELDFMSSSQQFYWIGLSYSEEHTAWLWENGSA 50 60 70 80 90 100 180 190 200 210 pF1KE1 LSPNLLTIIE-MQKGDCALYASSFKGYIENCSTPNTYICMQRTV :: :. .: .. .: : . .. :.: : ::: :. . CCDS86 LSQYLFPSFETFNTKNCIAYNPNGNALDESCEDKNRYICKQQLI 110 120 130 140 >>CCDS8621.1 KLRD1 gene_id:3824|Hs108|chr12 (179 aa) initn: 221 init1: 200 opt: 305 Z-score: 389.1 bits: 78.8 E(32554): 2.4e-15 Smith-Waterman score: 305; 32.8% identity (63.4% similar) in 134 aa overlap (84-216:49-179) 60 70 80 90 100 110 pF1KE1 FFCCFIAVAMGIRFIIMVAIWSAVFLNSLFNQEVQIPLTESYCGPCPKNWICYKNNCYQF : :.: .: : : ..:. :. ::: . CCDS86 ICLSLMSTLGILLKNSFTKLSIEPAFTPGPNIELQ---KDSDCCSCQEKWVGYRCNCYFI 20 30 40 50 60 70 120 130 140 150 160 170 pF1KE1 FDESKNWYESQASCMSQNASLLKVYSKEDQDLLKLVKSYHWMGLVHIPTNGSWQWEDGSI .:.:.: ::. : ::..:::.. . .. :... ....:.:: . . .: ::.:: CCDS86 SSEQKTWNESRHLCASQKSSLLQLQNTDELDFMSSSQQFYWIGLSYSEEHTAWLWENGSA 80 90 100 110 120 130 180 190 200 210 pF1KE1 LSPNLLTIIE-MQKGDCALYASSFKGYIENCSTPNTYICMQRTV :: :. .: .. .: : . .. :.: : ::: :. . CCDS86 LSQYLFPSFETFNTKNCIAYNPNGNALDESCEDKNRYICKQQLI 140 150 160 170 >>CCDS8610.1 CLEC12B gene_id:387837|Hs108|chr12 (232 aa) initn: 185 init1: 118 opt: 290 Z-score: 368.8 bits: 75.4 E(32554): 3.2e-14 Smith-Waterman score: 290; 41.4% identity (69.7% similar) in 99 aa overlap (85-178:129-227) 60 70 80 90 100 110 pF1KE1 FCCFIAVAMGIRFIIMVAIWSAVFLNSLFNQEVQIPLTESYCGPCPKNWICYKNNCYQFF ::. : .. :.:::: : :.:.:: : CCDS86 SNNLSMEEEFLKSQISSVLKRQEQMAIKLCQELIIHTSDHRCNPCPKMWQWYQNSCYYFT 100 110 120 130 140 150 120 130 140 150 160 pF1KE1 -DESKNWYESQASCMSQNASLLKVYSKEDQDLLK----LVKSYHWMGLVHIPTNGSWQWE .: :.: .:. .:...:..:.:. : :..:.: :. :. :.:: .. :: :: CCDS86 TNEEKTWANSRKDCIDKNSTLVKIDSLEEKDFLMSQPLLMFSFFWLGLSWDSSGRSWFWE 160 170 180 190 200 210 170 180 190 200 210 pF1KE1 DGSILSPNLLTIIEMQKGDCALYASSFKGYIENCSTPNTYICMQRTV :::. ::.: CCDS86 DGSVPSPSLYVSNY 220 230 >>CCDS41751.1 CLEC1B gene_id:51266|Hs108|chr12 (196 aa) initn: 257 init1: 146 opt: 285 Z-score: 363.5 bits: 74.2 E(32554): 6.3e-14 Smith-Waterman score: 285; 32.8% identity (64.8% similar) in 125 aa overlap (96-214:66-186) 70 80 90 100 110 120 pF1KE1 RFIIMVAIWSAVFLNSLFNQEVQIPLTESYCGPCPKNWICYKNNCYQFFDESKNWYESQA :.:: :: : ..:: :: .. .: ::. CCDS41 RTGTLQQLAKRFCQYVVKQSELKGTFKGHKCSPCDTNWRYYGDSCYGFFRHNLTWEESKQ 40 50 60 70 80 90 130 140 150 160 170 180 pF1KE1 SCMSQNASLLKVYSKEDQDLLKLVKSY-H---WMGLVHIPTNGSWQWEDGSILSPNLLTI : ..::.:::. ....... .:. : :.:: . .: :.:::::..: :.. . CCDS41 YCTDMNATLLKI---DNRNIVEYIKARTHLIRWVGLSRQKSNEVWKWEDGSVISENMFEF 100 110 120 130 140 150 190 200 210 pF1KE1 IEMQKGD--CALYASSFKGYIENCSTPNTYICMQRTV .: ::. :: . .. : . : . . .: .. CCDS41 LEDGKGNMNCAYFHNG-KMHPTFCENKHYLMCERKAGMTKVDQLP 160 170 180 190 >>CCDS41752.1 CLEC1B gene_id:51266|Hs108|chr12 (229 aa) initn: 257 init1: 146 opt: 285 Z-score: 362.6 bits: 74.3 E(32554): 7.1e-14 Smith-Waterman score: 285; 32.8% identity (64.8% similar) in 125 aa overlap (96-214:99-219) 70 80 90 100 110 120 pF1KE1 RFIIMVAIWSAVFLNSLFNQEVQIPLTESYCGPCPKNWICYKNNCYQFFDESKNWYESQA :.:: :: : ..:: :: .. .: ::. CCDS41 RTGTLQQLAKRFCQYVVKQSELKGTFKGHKCSPCDTNWRYYGDSCYGFFRHNLTWEESKQ 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 SCMSQNASLLKVYSKEDQDLLKLVKSY-H---WMGLVHIPTNGSWQWEDGSILSPNLLTI : ..::.:::. ....... .:. : :.:: . .: :.:::::..: :.. . CCDS41 YCTDMNATLLKI---DNRNIVEYIKARTHLIRWVGLSRQKSNEVWKWEDGSVISENMFEF 130 140 150 160 170 180 190 200 210 pF1KE1 IEMQKGD--CALYASSFKGYIENCSTPNTYICMQRTV .: ::. :: . .. : . : . . .: .. CCDS41 LEDGKGNMNCAYFHNG-KMHPTFCENKHYLMCERKAGMTKVDQLP 190 200 210 220 >>CCDS8618.1 OLR1 gene_id:4973|Hs108|chr12 (273 aa) initn: 296 init1: 187 opt: 284 Z-score: 360.3 bits: 74.1 E(32554): 9.5e-14 Smith-Waterman score: 298; 38.0% identity (58.9% similar) in 129 aa overlap (96-214:140-267) 70 80 90 100 110 120 pF1KE1 RFIIMVAIWSAVFLNSLFNQEVQIPLTESYCG-PCPKNWICYKNNCYQFFDESKNWYESQ :. :::..:: . .::: : . : :: .:: CCDS86 ARKLNEKSKEQMELHHQNLNLQETLKRVANCSAPCPQDWIWHGENCYLFSSGSFNWEKSQ 110 120 130 140 150 160 130 140 150 160 170 180 pF1KE1 ASCMSQNASLLKVYSKEDQDLLKLVKSYH----WMGLVHIPTNGSWQWEDGSILSPNLLT .:.: .:.:::. : : :... . :: :::: . . : ::::: : :.:. CCDS86 EKCLSLDAKLLKINSTADLDFIQQAISYSSFPFWMGLSRRNPSYPWLWEDGSPLMPHLFR 170 180 190 200 210 220 190 200 210 pF1KE1 I-----IEMQKGDCALYASSFKGYIENCSTPNTYICMQRTV . . .: :: : . : ::: ::... CCDS86 VRGAVSQTYPSGTCA-YIQRGAVYAENCILAAFSICQKKANLRAQ 230 240 250 260 270 >>CCDS8611.1 CLEC9A gene_id:283420|Hs108|chr12 (241 aa) initn: 298 init1: 159 opt: 276 Z-score: 351.0 bits: 72.2 E(32554): 3.1e-13 Smith-Waterman score: 276; 36.1% identity (66.4% similar) in 122 aa overlap (98-211:112-232) 70 80 90 100 110 120 pF1KE1 IIMVAIWSAVFLNSLFNQEVQIPLTESYCGPCPKNWICYKNNCYQFFDESKNWYESQASC :::.::: ...:: . . :. :: .: CCDS86 FTEWKRSCALQMKYCQAFMQNSLSSAHNSSPCPNNWIQNRESCYYVSEIWSIWHTSQENC 90 100 110 120 130 140 130 140 150 160 170 180 pF1KE1 MSQNASLLKVYSKEDQDLL-----KLVKSY-HWMGLVHIPTNGSWQWEDGSILSPNLLTI ......::.. :::..:.. :. :: .:.:: . .: : :.::: ::.:: CCDS86 LKEGSTLLQIESKEEMDFITGSLRKIKGSYDYWVGLSQDGHSGRWLWQDGSSPSPGLLPA 150 160 170 180 190 200 190 200 210 pF1KE1 IEMQKGD--CALYASSFKGYIENCSTPNTYICMQRTV . :... :. :..: . :::: . .:: CCDS86 ERSQSANQVCG-YVKSNSLLSSNCSTWKYFICEKYALRSSV 210 220 230 240 >>CCDS59347.1 CLEC4M gene_id:10332|Hs108|chr19 (263 aa) initn: 236 init1: 116 opt: 265 Z-score: 336.7 bits: 69.7 E(32554): 2e-12 Smith-Waterman score: 292; 34.6% identity (63.8% similar) in 130 aa overlap (93-211:126-253) 70 80 90 100 110 120 pF1KE1 MGIRFIIMVAIWSAVFLNSLFNQEVQIPLTESYCGPCPKNWICYKNNCYQFFDESKNWYE : : :::.: ...::: . . ..::.. CCDS59 TQLKAAVGELPEKSKLQEIYQELTRLKAAVERLCRHCPKDWTFFQGNCYFMSNSQRNWHD 100 110 120 130 140 150 130 140 150 160 170 pF1KE1 SQASCMSQNASLLKVYSKEDQDLLKLVKS----YHWMGLVHIPTNGSWQWEDGSILSPNL : ..:. :.:. . . :.:..:.: : . :::: . .:.::: ::: :::.. CCDS59 SVTACQEVRAQLVVIKTAEEQNFLQLQTSRSNRFSWMGLSDLNQEGTWQWVDGSPLSPSF 160 170 180 190 200 210 180 190 200 210 pF1KE1 LTIIEM----QKG--DCALYASSFKGYIEN-CSTPNTYICMQRTV . ..: ::: ...: :. .: :.. : .:: CCDS59 QRYWNSGEPNNSGNEDCAEFSGS--GWNDNRCDVDNYWICKKPAACFRDE 220 230 240 250 260 216 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 14:56:55 2016 done: Sun Nov 6 14:56:55 2016 Total Scan time: 1.850 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]