FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6556, 154 aa 1>>>pF1KE6556 154 - 154 aa - 154 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.8751+/-0.000649; mu= 14.1400+/- 0.039 mean_var=63.2303+/-12.262, 0's: 0 Z-trim(110.4): 99 B-trim: 6 in 1/49 Lambda= 0.161291 statistics sampled from 11504 (11613) to 11504 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.746), E-opt: 0.2 (0.357), width: 16 Scan time: 1.090 The best scores are: opt bits E(32554) CCDS8602.1 CLEC2D gene_id:29121|Hs108|chr12 ( 191) 1100 263.9 3.5e-71 CCDS31741.1 CLEC2D gene_id:29121|Hs108|chr12 ( 194) 1100 263.9 3.6e-71 CCDS55800.1 CLEC2D gene_id:29121|Hs108|chr12 ( 132) 829 200.7 2.5e-52 CCDS55801.1 CLEC2D gene_id:29121|Hs108|chr12 ( 154) 722 175.9 8.9e-45 CCDS55802.1 CLEC2D gene_id:29121|Hs108|chr12 ( 95) 451 112.6 5.9e-26 CCDS8606.1 CLEC2A gene_id:387836|Hs108|chr12 ( 160) 379 96.1 9.8e-21 CCDS44829.1 CLEC2A gene_id:387836|Hs108|chr12 ( 174) 379 96.1 1e-20 CCDS8604.1 CD69 gene_id:969|Hs108|chr12 ( 199) 358 91.2 3.4e-19 CCDS8605.1 CLEC2B gene_id:9976|Hs108|chr12 ( 149) 314 80.9 3.3e-16 CCDS56087.1 CLEC17A gene_id:388512|Hs108|chr19 ( 378) 259 68.4 4.9e-12 CCDS56017.1 ASGR1 gene_id:432|Hs108|chr17 ( 252) 253 66.9 9.4e-12 CCDS11089.1 ASGR1 gene_id:432|Hs108|chr17 ( 291) 253 66.9 1e-11 CCDS45597.1 CLEC10A gene_id:10462|Hs108|chr17 ( 292) 253 66.9 1e-11 CCDS47724.1 CLEC2L gene_id:154790|Hs108|chr7 ( 214) 244 64.7 3.5e-11 CCDS11088.1 ASGR2 gene_id:433|Hs108|chr17 ( 287) 241 64.1 7.2e-11 CCDS45598.1 ASGR2 gene_id:433|Hs108|chr17 ( 292) 241 64.1 7.3e-11 CCDS32544.1 ASGR2 gene_id:433|Hs108|chr17 ( 311) 241 64.1 7.6e-11 >>CCDS8602.1 CLEC2D gene_id:29121|Hs108|chr12 (191 aa) initn: 1100 init1: 1100 opt: 1100 Z-score: 1391.4 bits: 263.9 E(32554): 3.5e-71 Smith-Waterman score: 1100; 99.4% identity (100.0% similar) in 154 aa overlap (1-154:1-154) 10 20 30 40 50 60 pF1KE6 MHDSNNVEKDITPSELPANPGCVHSKEHSIKATLIWRLFFLIMFLTIIVCGMVAALSAIR ::::::::::::::::::::::.::::::::::::::::::::::::::::::::::::: CCDS86 MHDSNNVEKDITPSELPANPGCLHSKEHSIKATLIWRLFFLIMFLTIIVCGMVAALSAIR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 ANCHQEPSVCLQAACPESWIGFQRKCFYFSDDTKNWTSSQRFCDSQDADLAQVESFQELN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS86 ANCHQEPSVCLQAACPESWIGFQRKCFYFSDDTKNWTSSQRFCDSQDADLAQVESFQELN 70 80 90 100 110 120 130 140 150 pF1KE6 FLLRYKGPSDHWIGLSREQGQPWKWINGTEWTRQ :::::::::::::::::::::::::::::::::: CCDS86 FLLRYKGPSDHWIGLSREQGQPWKWINGTEWTRQFPILGAGECAYLNDKGASSARHYTER 130 140 150 160 170 180 CCDS86 KWICSKSDIHV 190 >>CCDS31741.1 CLEC2D gene_id:29121|Hs108|chr12 (194 aa) initn: 1100 init1: 1100 opt: 1100 Z-score: 1391.3 bits: 263.9 E(32554): 3.6e-71 Smith-Waterman score: 1100; 99.4% identity (100.0% similar) in 154 aa overlap (1-154:1-154) 10 20 30 40 50 60 pF1KE6 MHDSNNVEKDITPSELPANPGCVHSKEHSIKATLIWRLFFLIMFLTIIVCGMVAALSAIR ::::::::::::::::::::::.::::::::::::::::::::::::::::::::::::: CCDS31 MHDSNNVEKDITPSELPANPGCLHSKEHSIKATLIWRLFFLIMFLTIIVCGMVAALSAIR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 ANCHQEPSVCLQAACPESWIGFQRKCFYFSDDTKNWTSSQRFCDSQDADLAQVESFQELN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 ANCHQEPSVCLQAACPESWIGFQRKCFYFSDDTKNWTSSQRFCDSQDADLAQVESFQELN 70 80 90 100 110 120 130 140 150 pF1KE6 FLLRYKGPSDHWIGLSREQGQPWKWINGTEWTRQ :::::::::::::::::::::::::::::::::: CCDS31 FLLRYKGPSDHWIGLSREQGQPWKWINGTEWTRQLVMKEDGANLYVAKVSQVPRMNPRPV 130 140 150 160 170 180 CCDS31 MVSYPGSRRVCLFE 190 >>CCDS55800.1 CLEC2D gene_id:29121|Hs108|chr12 (132 aa) initn: 829 init1: 829 opt: 829 Z-score: 1052.9 bits: 200.7 E(32554): 2.5e-52 Smith-Waterman score: 829; 99.2% identity (100.0% similar) in 119 aa overlap (1-119:1-119) 10 20 30 40 50 60 pF1KE6 MHDSNNVEKDITPSELPANPGCVHSKEHSIKATLIWRLFFLIMFLTIIVCGMVAALSAIR ::::::::::::::::::::::.::::::::::::::::::::::::::::::::::::: CCDS55 MHDSNNVEKDITPSELPANPGCLHSKEHSIKATLIWRLFFLIMFLTIIVCGMVAALSAIR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 ANCHQEPSVCLQAACPESWIGFQRKCFYFSDDTKNWTSSQRFCDSQDADLAQVESFQELN ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 ANCHQEPSVCLQAACPESWIGFQRKCFYFSDDTKNWTSSQRFCDSQDADLAQVESFQELV 70 80 90 100 110 120 130 140 150 pF1KE6 FLLRYKGPSDHWIGLSREQGQPWKWINGTEWTRQ CCDS55 SYPGSRRVCLFE 130 >>CCDS55801.1 CLEC2D gene_id:29121|Hs108|chr12 (154 aa) initn: 845 init1: 722 opt: 722 Z-score: 917.4 bits: 175.9 E(32554): 8.9e-45 Smith-Waterman score: 775; 76.0% identity (76.0% similar) in 154 aa overlap (1-154:1-117) 10 20 30 40 50 60 pF1KE6 MHDSNNVEKDITPSELPANPGCVHSKEHSIKATLIWRLFFLIMFLTIIVCGMVAALSAIR :::::::::::::::::::: ::: CCDS55 MHDSNNVEKDITPSELPANP-------------------------------------AIR 10 20 70 80 90 100 110 120 pF1KE6 ANCHQEPSVCLQAACPESWIGFQRKCFYFSDDTKNWTSSQRFCDSQDADLAQVESFQELN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 ANCHQEPSVCLQAACPESWIGFQRKCFYFSDDTKNWTSSQRFCDSQDADLAQVESFQELN 30 40 50 60 70 80 130 140 150 pF1KE6 FLLRYKGPSDHWIGLSREQGQPWKWINGTEWTRQ :::::::::::::::::::::::::::::::::: CCDS55 FLLRYKGPSDHWIGLSREQGQPWKWINGTEWTRQFPILGAGECAYLNDKGASSARHYTER 90 100 110 120 130 140 CCDS55 KWICSKSDIHV 150 >>CCDS55802.1 CLEC2D gene_id:29121|Hs108|chr12 (95 aa) initn: 574 init1: 451 opt: 451 Z-score: 579.5 bits: 112.6 E(32554): 5.9e-26 Smith-Waterman score: 504; 68.9% identity (68.9% similar) in 119 aa overlap (1-119:1-82) 10 20 30 40 50 60 pF1KE6 MHDSNNVEKDITPSELPANPGCVHSKEHSIKATLIWRLFFLIMFLTIIVCGMVAALSAIR :::::::::::::::::::: ::: CCDS55 MHDSNNVEKDITPSELPANP-------------------------------------AIR 10 20 70 80 90 100 110 120 pF1KE6 ANCHQEPSVCLQAACPESWIGFQRKCFYFSDDTKNWTSSQRFCDSQDADLAQVESFQELN ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 ANCHQEPSVCLQAACPESWIGFQRKCFYFSDDTKNWTSSQRFCDSQDADLAQVESFQELV 30 40 50 60 70 80 130 140 150 pF1KE6 FLLRYKGPSDHWIGLSREQGQPWKWINGTEWTRQ CCDS55 SYPGSRRVCLFE 90 >>CCDS8606.1 CLEC2A gene_id:387836|Hs108|chr12 (160 aa) initn: 366 init1: 351 opt: 379 Z-score: 485.8 bits: 96.1 E(32554): 9.8e-21 Smith-Waterman score: 379; 46.0% identity (69.4% similar) in 124 aa overlap (28-149:16-132) 10 20 30 40 50 pF1KE6 MHDSNNVEKDITPSELPANPGCVHSKEHSIKATLI--WRLFFLIMFLTIIVCGMVAALSA : : :: :.. :. ::.::. . . : CCDS86 MINPELRDGRADGFIHRIVPKLIQNWKIG-LMCFLSIIITTVCIIMIA 10 20 30 40 60 70 80 90 100 110 pF1KE6 IRANCHQEPSVCLQAACPESWIGFQRKCFYFSDDTKNWTSSQRFCDSQDADLAQVESFQE .. : .: .:: .:.: . :::::::::.:::.:. ::. : :.:::... .. CCDS86 TWSK-HAKP-----VACSGDWLGVRDKCFYFSDDTRNWTASKIFCSLQKAELAQIDTQED 50 60 70 80 90 100 120 130 140 150 pF1KE6 LNFLLRYKGPSDHWIGLSREQGQPWKWINGTEWTRQ ..:: :: : . :::::::.::. ::: ::: CCDS86 MEFLKRYAGTDMHWIGLSRKQGDSWKWTNGTTFNGWPSNSKWSCNWSLRQWLLLLGPLR 110 120 130 140 150 160 >>CCDS44829.1 CLEC2A gene_id:387836|Hs108|chr12 (174 aa) initn: 366 init1: 351 opt: 379 Z-score: 485.3 bits: 96.1 E(32554): 1e-20 Smith-Waterman score: 379; 46.0% identity (69.4% similar) in 124 aa overlap (28-149:16-132) 10 20 30 40 50 pF1KE6 MHDSNNVEKDITPSELPANPGCVHSKEHSIKATLI--WRLFFLIMFLTIIVCGMVAALSA : : :: :.. :. ::.::. . . : CCDS44 MINPELRDGRADGFIHRIVPKLIQNWKIG-LMCFLSIIITTVCIIMIA 10 20 30 40 60 70 80 90 100 110 pF1KE6 IRANCHQEPSVCLQAACPESWIGFQRKCFYFSDDTKNWTSSQRFCDSQDADLAQVESFQE .. : .: .:: .:.: . :::::::::.:::.:. ::. : :.:::... .. CCDS44 TWSK-HAKP-----VACSGDWLGVRDKCFYFSDDTRNWTASKIFCSLQKAELAQIDTQED 50 60 70 80 90 100 120 130 140 150 pF1KE6 LNFLLRYKGPSDHWIGLSREQGQPWKWINGTEWTRQ ..:: :: : . :::::::.::. ::: ::: CCDS44 MEFLKRYAGTDMHWIGLSRKQGDSWKWTNGTTFNGWFEIIGNGSFAFLSADGVHSSRGFI 110 120 130 140 150 160 CCDS44 DIKWICSKPKYFL 170 >>CCDS8604.1 CD69 gene_id:969|Hs108|chr12 (199 aa) initn: 375 init1: 326 opt: 358 Z-score: 458.0 bits: 91.2 E(32554): 3.4e-19 Smith-Waterman score: 358; 40.9% identity (71.3% similar) in 115 aa overlap (42-151:48-161) 20 30 40 50 60 pF1KE6 TPSELPANPGCVHSKEHSIKATLIWRLFFLIMFLTIIVCGMVAALSAIRANCHQE----- ..:.::.. ...: ::. . :: . CCDS86 SGQENDATSPHFSTRHEGSFQVPVLCAVMNVVFITILIIALIA-LSVGQYNCPGQYTFSM 20 30 40 50 60 70 70 80 90 100 110 120 pF1KE6 PSVCLQAACPESWIGFQRKCFYFSDDTKNWTSSQRFCDSQDADLAQVESFQELNFLLRYK :: ..: :.:.:.::::...: ..:::.: :. . : :: ..: ...::: :: CCDS86 PSDSHVSSCSEDWVGYQRKCYFISTVKRSWTSAQNACSEHGATLAVIDSEKDMNFLKRYA 80 90 100 110 120 130 130 140 150 pF1KE6 GPSDHWIGLSREQGQPWKWINGTEWTRQ : .::.::..: :.:::: :: :. CCDS86 GREEHWVGLKKEPGHPWKWSNGKEFNNWFNVTGSDKCVFLKNTEVSSMECEKNLYWICNK 140 150 160 170 180 190 >>CCDS8605.1 CLEC2B gene_id:9976|Hs108|chr12 (149 aa) initn: 352 init1: 308 opt: 314 Z-score: 404.5 bits: 80.9 E(32554): 3.3e-16 Smith-Waterman score: 314; 46.3% identity (78.0% similar) in 82 aa overlap (72-153:32-113) 50 60 70 80 90 100 pF1KE6 IMFLTIIVCGMVAALSAIRANCHQEPSVCLQAACPESWIGFQRKCFYFSDDTKNWTSSQR :. :: .::::: ::.::: . .:.::. CCDS86 MTKHKKCFIIVGVLITTNIITLIVKLTRDSQSLCPYDWIGFQNKCYYFSKEEGDWNSSKY 10 20 30 40 50 60 110 120 130 140 150 pF1KE6 FCDSQDADLAQVESFQELNFLLRYKGPSDHWIGLSREQGQPWKWINGTEWTRQ :..: :::. .....:.::: ::: :::::::. ... .:..:. .:. CCDS86 NCSTQHADLTIIDNIEEMNFLRRYKCSSDHWIGLKMAKNRTGQWVDGATFTKSFGMRGSE 70 80 90 100 110 120 CCDS86 GCAYLSDDGAATARCYTERKWICRKRIH 130 140 >>CCDS56087.1 CLEC17A gene_id:388512|Hs108|chr19 (378 aa) initn: 228 init1: 180 opt: 259 Z-score: 329.6 bits: 68.4 E(32554): 4.9e-12 Smith-Waterman score: 259; 38.8% identity (69.4% similar) in 85 aa overlap (70-152:249-333) 40 50 60 70 80 90 pF1KE6 FLIMFLTIIVCGMVAALSAIRANCHQEPSVCLQAACPESWIGFQRKCFYFSDDTKNWTSS : . .:::.:. :. ::.::: .::.: . CCDS56 MAGLAGLKHDIARVRADTNQSLVELWGLLDCRRITCPEGWLPFEGKCYYFSPSTKSWDEA 220 230 240 250 260 270 100 110 120 130 140 150 pF1KE6 QRFCDSQDADLAQVESFQELNFLLRYKG-PSDHWIGLS-REQGQPWKWINGTEWTRQ . ::. . . :. ..:: : ::. . .: : .:.::. : : :.:..:. : CCDS56 RMFCQENYSHLVIINSFAEHNFVAKAHGSPRVYWLGLNDRAQEGDWRWLDGSPVTLSFWE 280 290 300 310 320 330 CCDS56 PEEPNNIHDEDCATMNKGGTWNDLSCYKTTYWICERKCSC 340 350 360 370 154 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 14:23:01 2016 done: Tue Nov 8 14:23:02 2016 Total Scan time: 1.090 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]