FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE1554, 273 aa
1>>>pF1KE1554 273 - 273 aa - 273 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 7.0837+/-0.000893; mu= 7.2293+/- 0.053
mean_var=121.0320+/-24.310, 0's: 0 Z-trim(110.3): 93 B-trim: 451 in 1/52
Lambda= 0.116580
statistics sampled from 11407 (11500) to 11407 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.725), E-opt: 0.2 (0.353), width: 16
Scan time: 2.410
The best scores are: opt bits E(32554)
CCDS8618.1 OLR1 gene_id:4973|Hs108|chr12 ( 273) 1852 322.1 2.6e-88
CCDS53746.1 OLR1 gene_id:4973|Hs108|chr12 ( 189) 1249 220.6 6.6e-58
CCDS53745.1 OLR1 gene_id:4973|Hs108|chr12 ( 181) 907 163.1 1.3e-40
CCDS44830.1 CLEC12B gene_id:387837|Hs108|chr12 ( 276) 450 86.3 2.6e-17
CCDS8614.1 CLEC7A gene_id:64581|Hs108|chr12 ( 168) 412 79.8 1.4e-15
CCDS41753.1 CLEC7A gene_id:64581|Hs108|chr12 ( 247) 411 79.7 2.2e-15
CCDS8613.1 CLEC7A gene_id:64581|Hs108|chr12 ( 201) 404 78.5 4.2e-15
CCDS76528.1 CLEC1A gene_id:51267|Hs108|chr12 ( 188) 364 71.7 4.2e-13
CCDS8610.1 CLEC12B gene_id:387837|Hs108|chr12 ( 232) 361 71.3 7.1e-13
CCDS73443.1 CLEC1A gene_id:51267|Hs108|chr12 ( 247) 361 71.3 7.5e-13
CCDS8612.1 CLEC1A gene_id:51267|Hs108|chr12 ( 280) 358 70.8 1.2e-12
CCDS59344.1 CD209 gene_id:30835|Hs108|chr19 ( 268) 332 66.5 2.4e-11
CCDS45949.1 CD209 gene_id:30835|Hs108|chr19 ( 360) 334 66.9 2.4e-11
CCDS45950.1 CD209 gene_id:30835|Hs108|chr19 ( 380) 334 66.9 2.5e-11
CCDS12186.1 CD209 gene_id:30835|Hs108|chr19 ( 404) 334 66.9 2.6e-11
CCDS45952.1 CD209 gene_id:30835|Hs108|chr19 ( 312) 332 66.5 2.7e-11
CCDS59347.1 CLEC4M gene_id:10332|Hs108|chr19 ( 263) 326 65.4 4.7e-11
CCDS8611.1 CLEC9A gene_id:283420|Hs108|chr12 ( 241) 321 64.6 7.8e-11
>>CCDS8618.1 OLR1 gene_id:4973|Hs108|chr12 (273 aa)
initn: 1852 init1: 1852 opt: 1852 Z-score: 1698.8 bits: 322.1 E(32554): 2.6e-88
Smith-Waterman score: 1852; 100.0% identity (100.0% similar) in 273 aa overlap (1-273:1-273)
10 20 30 40 50 60
pF1KE1 MTFDDLKIQTVKDQPDEKSNGKKAKGLQFLYSPWWCLAAATLGVLCLGLVVTIMVLGMQL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS86 MTFDDLKIQTVKDQPDEKSNGKKAKGLQFLYSPWWCLAAATLGVLCLGLVVTIMVLGMQL
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE1 SQVSDLLTQEQANLTHQKKKLEGQISARQQAEEASQESENELKEMIETLARKLNEKSKEQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS86 SQVSDLLTQEQANLTHQKKKLEGQISARQQAEEASQESENELKEMIETLARKLNEKSKEQ
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE1 MELHHQNLNLQETLKRVANCSAPCPQDWIWHGENCYLFSSGSFNWEKSQEKCLSLDAKLL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS86 MELHHQNLNLQETLKRVANCSAPCPQDWIWHGENCYLFSSGSFNWEKSQEKCLSLDAKLL
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE1 KINSTADLDFIQQAISYSSFPFWMGLSRRNPSYPWLWEDGSPLMPHLFRVRGAVSQTYPS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS86 KINSTADLDFIQQAISYSSFPFWMGLSRRNPSYPWLWEDGSPLMPHLFRVRGAVSQTYPS
190 200 210 220 230 240
250 260 270
pF1KE1 GTCAYIQRGAVYAENCILAAFSICQKKANLRAQ
:::::::::::::::::::::::::::::::::
CCDS86 GTCAYIQRGAVYAENCILAAFSICQKKANLRAQ
250 260 270
>>CCDS53746.1 OLR1 gene_id:4973|Hs108|chr12 (189 aa)
initn: 1249 init1: 1249 opt: 1249 Z-score: 1153.1 bits: 220.6 E(32554): 6.6e-58
Smith-Waterman score: 1249; 100.0% identity (100.0% similar) in 188 aa overlap (1-188:1-188)
10 20 30 40 50 60
pF1KE1 MTFDDLKIQTVKDQPDEKSNGKKAKGLQFLYSPWWCLAAATLGVLCLGLVVTIMVLGMQL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS53 MTFDDLKIQTVKDQPDEKSNGKKAKGLQFLYSPWWCLAAATLGVLCLGLVVTIMVLGMQL
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE1 SQVSDLLTQEQANLTHQKKKLEGQISARQQAEEASQESENELKEMIETLARKLNEKSKEQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS53 SQVSDLLTQEQANLTHQKKKLEGQISARQQAEEASQESENELKEMIETLARKLNEKSKEQ
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE1 MELHHQNLNLQETLKRVANCSAPCPQDWIWHGENCYLFSSGSFNWEKSQEKCLSLDAKLL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS53 MELHHQNLNLQETLKRVANCSAPCPQDWIWHGENCYLFSSGSFNWEKSQEKCLSLDAKLL
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE1 KINSTADLDFIQQAISYSSFPFWMGLSRRNPSYPWLWEDGSPLMPHLFRVRGAVSQTYPS
::::::::
CCDS53 KINSTADLI
>>CCDS53745.1 OLR1 gene_id:4973|Hs108|chr12 (181 aa)
initn: 904 init1: 904 opt: 907 Z-score: 842.5 bits: 163.1 E(32554): 1.3e-40
Smith-Waterman score: 907; 97.9% identity (98.6% similar) in 145 aa overlap (1-145:1-145)
10 20 30 40 50 60
pF1KE1 MTFDDLKIQTVKDQPDEKSNGKKAKGLQFLYSPWWCLAAATLGVLCLGLVVTIMVLGMQL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS53 MTFDDLKIQTVKDQPDEKSNGKKAKGLQFLYSPWWCLAAATLGVLCLGLVVTIMVLGMQL
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE1 SQVSDLLTQEQANLTHQKKKLEGQISARQQAEEASQESENELKEMIETLARKLNEKSKEQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS53 SQVSDLLTQEQANLTHQKKKLEGQISARQQAEEASQESENELKEMIETLARKLNEKSKEQ
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE1 MELHHQNLNLQETLKRVANCSAPCPQDWIWHGENCYLFSSGSFNWEKSQEKCLSLDAKLL
:::::::::::::::::::::. :
CCDS53 MELHHQNLNLQETLKRVANCSGLHPASNFLFQFSILDGAVSEEPQLPMALGGRFSFDAPL
130 140 150 160 170 180
>>CCDS44830.1 CLEC12B gene_id:387837|Hs108|chr12 (276 aa)
initn: 409 init1: 123 opt: 450 Z-score: 424.4 bits: 86.3 E(32554): 2.6e-17
Smith-Waterman score: 451; 31.5% identity (61.2% similar) in 276 aa overlap (1-268:10-267)
10 20 30 40 50
pF1KE1 MTFDDLK-IQTVKDQPDEKSNGKKAKGLQFLYSPWWCLAAATLGVLCLGLV
.::.: .. .: . .. :. : :: : :: : .::: :.
CCDS44 MSEEVTYATLTFQDSAGARNNRDGNNLRKRGHPAP------SPIWRHAALGLVTLCLMLL
10 20 30 40 50
60 70 80 90 100
pF1KE1 VTIMVLGMQLSQVSDLLTQEQANLTHQKKKLEGQISARQQAEEASQE--SENELKEMIET
. ...:::.. :.:. ..... .:.. .: .. :: .. ::. . :.:. :
CCDS44 IGLVTLGMMFLQISNDINSDSEKLSQLQKTIQ------QQQDNLSQQLGNSNNLSMEEEF
60 70 80 90 100
110 120 130 140 150 160
pF1KE1 LARKLNE--KSKEQMELHH-QNLNLQETLKRVANCSAPCPQDWIWHGENCYLFSSGSFN-
: ... : .::: .. :.: .. . .: :. :::. : :. ..:: :... .
CCDS44 LKSQISSVLKRQEQMAIKLCQELIIHTSDHR---CN-PCPKMWQWYQNSCYYFTTNEEKT
110 120 130 140 150 160
170 180 190 200 210 220
pF1KE1 WEKSQEKCLSLDAKLLKINSTADLDFIQ-QAISYSSFPFWMGLSRRNPSYPWLWEDGSPL
: .:.. :.. .. :.::.: . ::.. : . . :: ::.::: . . :.:::::
CCDS44 WANSRKDCIDKNSTLVKIDSLEEKDFLMSQPLLMFSF-FWLGLSWDSSGRSWFWEDGSVP
170 180 190 200 210 220
230 240 250 260 270
pF1KE1 MPHLFRVRGAVSQTYPSGTCAYIQRGAVYAENCILAAFSICQKKANLRAQ
: :: .. ..: : :::.:.: .: : : ::.: :
CCDS44 SPSLFSTK-ELDQINGSKGCAYFQKGNIYISRCSAEIFWICEKTAAPVKTEDLD
230 240 250 260 270
>>CCDS8614.1 CLEC7A gene_id:64581|Hs108|chr12 (168 aa)
initn: 416 init1: 235 opt: 412 Z-score: 393.0 bits: 79.8 E(32554): 1.4e-15
Smith-Waterman score: 412; 36.5% identity (71.1% similar) in 159 aa overlap (113-270:10-168)
90 100 110 120 130 140
pF1KE1 GQISARQQAEEASQESENELKEMIETLARKLNEKSKEQMELHHQNLNLQETLKRVANCSA
:.: . :... :. . .... . :.
CCDS86 MEYHPDLENLDEDGYTQLHFDSQSNTRIAVVSEKGVLSS
10 20 30
150 160 170 180 190 200
pF1KE1 PCPQDWIWHGENCYLFSSGSFNWEKSQEKCLSLDAKLLKINSTADLDFI-QQAISYSSFP
::: .:: . ..::::: . .:. :...: .: ..::::.:. .: :: .:. : .
CCDS86 PCPPNWIIYEKSCYLFSMSLNSWDGSKRQCWQLGSNLLKIDSSNELGFIVKQVSSQPDNS
40 50 60 70 80 90
210 220 230 240 250 260
pF1KE1 FWMGLSRRNPSYPWLWEDGSPLMPHLFRVRGAVSQTYPSGTCAYIQRGAVYAENCILAAF
::.:::: . :::::::: . .::..: ...: :: .:..:. ...: . : . ..
CCDS86 FWIGLSRPQTEVPWLWEDGSTFSSNLFQIRTTATQENPSPNCVWIHVSVIYDQLCSVPSY
100 110 120 130 140 150
270
pF1KE1 SICQKKANLRAQ
:::.:: ..
CCDS86 SICEKKFSM
160
>>CCDS41753.1 CLEC7A gene_id:64581|Hs108|chr12 (247 aa)
initn: 476 init1: 235 opt: 411 Z-score: 389.7 bits: 79.7 E(32554): 2.2e-15
Smith-Waterman score: 440; 31.9% identity (59.2% similar) in 260 aa overlap (16-270:21-247)
10 20 30 40 50
pF1KE1 MTFDDLKIQTVKDQPDEKSNGKKA----KGLQFLYSPWWCLAAATLGVLCLGLVV
: .:: . : :: . :: : : :. ::.::: ..:
CCDS41 MEYHPDLENLDEDGYTQLHFDSQSNTRIAVVSEKG-SCAASPPWRLIAVILGILCLVILV
10 20 30 40 50
60 70 80 90 100 110
pF1KE1 TIMVLGMQLSQVSDLLTQEQANLTHQKKKLEGQISARQQAEEASQESENELKEMIETLAR
.::: . :. .. : : . .:.. :. :: ... :.. .
CCDS41 IAVVLGTMAIWRSNSGSNTLEN---------GYFLSRNK-ENHSQPTQSSLEDSVTPT--
60 70 80 90 100
120 130 140 150 160 170
pF1KE1 KLNEKSKEQMELHHQNLNLQETLKRVANCSAPCPQDWIWHGENCYLFSSGSFNWEKSQEK
...: .. :.::: .:: . ..::::: . .:. :...
CCDS41 --------------------KAVKTTGVLSSPCPPNWIIYEKSCYLFSMSLNSWDGSKRQ
110 120 130 140
180 190 200 210 220 230
pF1KE1 CLSLDAKLLKINSTADLDFI-QQAISYSSFPFWMGLSRRNPSYPWLWEDGSPLMPHLFRV
: .: ..::::.:. .: :: .:. : . ::.:::: . :::::::: . .::..
CCDS41 CWQLGSNLLKIDSSNELGFIVKQVSSQPDNSFWIGLSRPQTEVPWLWEDGSTFSSNLFQI
150 160 170 180 190 200
240 250 260 270
pF1KE1 RGAVSQTYPSGTCAYIQRGAVYAENCILAAFSICQKKANLRAQ
: ...: :: .:..:. ...: . : . ..:::.:: ..
CCDS41 RTTATQENPSPNCVWIHVSVIYDQLCSVPSYSICEKKFSM
210 220 230 240
>>CCDS8613.1 CLEC7A gene_id:64581|Hs108|chr12 (201 aa)
initn: 476 init1: 235 opt: 404 Z-score: 384.6 bits: 78.5 E(32554): 4.2e-15
Smith-Waterman score: 404; 41.2% identity (74.0% similar) in 131 aa overlap (141-270:71-201)
120 130 140 150 160 170
pF1KE1 RKLNEKSKEQMELHHQNLNLQETLKRVANCSAPCPQDWIWHGENCYLFSSGSFNWEKSQE
:.::: .:: . ..::::: . .:. :..
CCDS86 PPWRLIAVILGILCLVILVIAVVLGTMGVLSSPCPPNWIIYEKSCYLFSMSLNSWDGSKR
50 60 70 80 90 100
180 190 200 210 220
pF1KE1 KCLSLDAKLLKINSTADLDFI-QQAISYSSFPFWMGLSRRNPSYPWLWEDGSPLMPHLFR
.: .: ..::::.:. .: :: .:. : . ::.:::: . :::::::: . .::.
CCDS86 QCWQLGSNLLKIDSSNELGFIVKQVSSQPDNSFWIGLSRPQTEVPWLWEDGSTFSSNLFQ
110 120 130 140 150 160
230 240 250 260 270
pF1KE1 VRGAVSQTYPSGTCAYIQRGAVYAENCILAAFSICQKKANLRAQ
.: ...: :: .:..:. ...: . : . ..:::.:: ..
CCDS86 IRTTATQENPSPNCVWIHVSVIYDQLCSVPSYSICEKKFSM
170 180 190 200
>>CCDS76528.1 CLEC1A gene_id:51267|Hs108|chr12 (188 aa)
initn: 263 init1: 188 opt: 364 Z-score: 348.7 bits: 71.7 E(32554): 4.2e-13
Smith-Waterman score: 364; 33.7% identity (60.7% similar) in 163 aa overlap (113-270:12-171)
90 100 110 120 130 140
pF1KE1 GQISARQQAEEASQESENELKEMIETLARKLNEKSKEQMELHHQN--LNLQETLKRVANC
:.. . : :: :. . . .:.:.
CCDS76 MQAKYSSTRDMLDDDGDTTMSLHSQGSATTRHPEPRRTAHR
10 20 30 40
150 160 170 180 190 200
pF1KE1 SAPCPQDWIWHGENCYLFSSGSFNWEKSQEKCLSLDAKLLKINSTADLDFIQQAISYSSF
.:: ..: :::.::: : . : .:: . ::: .. .::::. ::.: . ::: :
CCDS76 CSPCTEQWKWHGDNCYQFYKDSKSWEDCKYFCLSENSTMLKINKQEDLEFAASQ-SYSEF
50 60 70 80 90 100
210 220 230 240 250
pF1KE1 --PFWMGLSRRNPSYPWLWEDGSPLMPHLFRVRGAVSQTYP-SGTCAYIQRGAVYAENCI
.: :: : . . ::: ::.:. .::.. .. : : : :. : : .....:
CCDS76 FYSYWTGLLRPDSGKAWLWMDGTPFTSELFHI--IIDVTSPRSRDCVAILNGMIFSKDCK
110 120 130 140 150
260 270
pF1KE1 LAAFSICQKKANLRAQ
.:...:..
CCDS76 ELKRCVCERRAGMVKPESLHVPPETLGEGD
160 170 180
>>CCDS8610.1 CLEC12B gene_id:387837|Hs108|chr12 (232 aa)
initn: 404 init1: 148 opt: 361 Z-score: 344.6 bits: 71.3 E(32554): 7.1e-13
Smith-Waterman score: 362; 30.5% identity (62.3% similar) in 236 aa overlap (1-228:10-228)
10 20 30 40 50
pF1KE1 MTFDDLK-IQTVKDQPDEKSNGKKAKGLQFLYSPWWCLAAATLGVLCLGLV
.::.: .. .: . .. :. : :: : :: : .::: :.
CCDS86 MSEEVTYATLTFQDSAGARNNRDGNNLRKRGHPAP------SPIWRHAALGLVTLCLMLL
10 20 30 40 50
60 70 80 90 100
pF1KE1 VTIMVLGMQLSQVSDLLTQEQANLTHQKKKLEGQISARQQAEEASQE--SENELKEMIET
. ...:::.. :.:. ..... .:.. .: .. :: .. ::. . :.:. :
CCDS86 IGLVTLGMMFLQISNDINSDSEKLSQLQKTIQ------QQQDNLSQQLGNSNNLSMEEEF
60 70 80 90 100
110 120 130 140 150 160
pF1KE1 LARKLNE--KSKEQMELHH-QNLNLQETLKRVANCSAPCPQDWIWHGENCYLFSSGSFN-
: ... : .::: .. :.: .. . .: :. :::. : :. ..:: :... .
CCDS86 LKSQISSVLKRQEQMAIKLCQELIIHTSDHR---CN-PCPKMWQWYQNSCYYFTTNEEKT
110 120 130 140 150 160
170 180 190 200 210 220
pF1KE1 WEKSQEKCLSLDAKLLKINSTADLDFIQ-QAISYSSFPFWMGLSRRNPSYPWLWEDGSPL
: .:.. :.. .. :.::.: . ::.. : . . :: ::.::: . . :.:::::
CCDS86 WANSRKDCIDKNSTLVKIDSLEEKDFLMSQPLLMFSF-FWLGLSWDSSGRSWFWEDGSVP
170 180 190 200 210 220
230 240 250 260 270
pF1KE1 MPHLFRVRGAVSQTYPSGTCAYIQRGAVYAENCILAAFSICQKKANLRAQ
: :.
CCDS86 SPSLYVSNY
230
>>CCDS73443.1 CLEC1A gene_id:51267|Hs108|chr12 (247 aa)
initn: 263 init1: 188 opt: 361 Z-score: 344.2 bits: 71.3 E(32554): 7.5e-13
Smith-Waterman score: 377; 30.2% identity (56.9% similar) in 232 aa overlap (59-270:4-230)
30 40 50 60 70 80
pF1KE1 FLYSPWWCLAAATLGVLCLGLVVTIMVLGMQLSQVSDLLTQE---QANLTHQKKKLEGQI
. :.. :.: .. .: : . .
CCDS73 MQAKYSSTRDMLDDDGDTTMSLHSQGSATTRHP
10 20 30
90 100 110 120 130
pF1KE1 SARQQAEEASQESENELKEMIETLARKLNEKSKEQMELHHQNLNLQETLKRVAN------
:. . . : : : .. : . ..:.. :.: . :. ::..: .:..::.
CCDS73 EPRRTVFQYYQLS-NTGQDTISQMEERLGNTSQELQSLQVQNIKLAGSLQHVAEKLCREL
40 50 60 70 80 90
140 150 160 170 180 190
pF1KE1 --------CSAPCPQDWIWHGENCYLFSSGSFNWEKSQEKCLSLDAKLLKINSTADLDFI
:: :: ..: :::.::: : . : .:: . ::: .. .::::. ::.:
CCDS73 YNKAGAHRCS-PCTEQWKWHGDNCYQFYKDSKSWEDCKYFCLSENSTMLKINKQEDLEFA
100 110 120 130 140 150
200 210 220 230 240
pF1KE1 QQAISYSSF--PFWMGLSRRNPSYPWLWEDGSPLMPHLFRVRGAVSQTYP-SGTCAYIQR
. ::: : .: :: : . . ::: ::.:. .::.. .. : : : :. :
CCDS73 ASQ-SYSEFFYSYWTGLLRPDSGKAWLWMDGTPFTSELFHI--IIDVTSPRSRDCVAILN
160 170 180 190 200
250 260 270
pF1KE1 GAVYAENCILAAFSICQKKANLRAQ
: .....: .:...:..
CCDS73 GMIFSKDCKELKRCVCERRAGMVKPESLHVPPETLGEGD
210 220 230 240
273 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 22:43:34 2016 done: Sun Nov 6 22:43:35 2016
Total Scan time: 2.410 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]