FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1425, 364 aa 1>>>pF1KE1425 364 - 364 aa - 364 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.6313+/-0.000797; mu= 15.5992+/- 0.048 mean_var=86.5913+/-17.743, 0's: 0 Z-trim(109.3): 68 B-trim: 438 in 2/52 Lambda= 0.137828 statistics sampled from 10723 (10793) to 10723 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.701), E-opt: 0.2 (0.332), width: 16 Scan time: 2.910 The best scores are: opt bits E(32554) CCDS33084.1 CD33 gene_id:945|Hs108|chr19 ( 364) 2467 500.3 1.1e-141 CCDS54299.1 CD33 gene_id:945|Hs108|chr19 ( 310) 2095 426.3 1.8e-119 CCDS46157.1 CD33 gene_id:945|Hs108|chr19 ( 237) 1516 311.0 6.5e-85 CCDS12836.3 SIGLEC6 gene_id:946|Hs108|chr19 ( 353) 1071 222.7 3.9e-58 CCDS12834.3 SIGLEC6 gene_id:946|Hs108|chr19 ( 453) 1071 222.8 4.7e-58 CCDS54308.1 SIGLEC6 gene_id:946|Hs108|chr19 ( 389) 1036 215.8 5.2e-56 CCDS12835.3 SIGLEC6 gene_id:946|Hs108|chr19 ( 437) 1028 214.2 1.7e-55 CCDS12825.1 SIGLEC9 gene_id:27180|Hs108|chr19 ( 463) 998 208.3 1.1e-53 CCDS12826.1 SIGLEC7 gene_id:27036|Hs108|chr19 ( 467) 998 208.3 1.1e-53 CCDS56100.1 SIGLEC9 gene_id:27180|Hs108|chr19 ( 479) 998 208.3 1.1e-53 CCDS33086.1 SIGLEC8 gene_id:27181|Hs108|chr19 ( 499) 989 206.5 4.1e-53 CCDS59416.1 SIGLEC12 gene_id:89858|Hs108|chr19 ( 477) 960 200.7 2.1e-51 CCDS12833.1 SIGLEC12 gene_id:89858|Hs108|chr19 ( 595) 960 200.8 2.5e-51 CCDS42604.1 SIGLEC14 gene_id:100049587|Hs108|chr19 ( 396) 899 188.5 8.3e-48 CCDS33088.1 SIGLEC5 gene_id:8778|Hs108|chr19 ( 551) 869 182.7 6.7e-46 CCDS54307.1 SIGLEC6 gene_id:946|Hs108|chr19 ( 401) 863 181.4 1.2e-45 CCDS82384.1 SIGLEC10 gene_id:89790|Hs108|chr19 ( 512) 785 165.9 6.8e-41 CCDS54305.1 SIGLEC10 gene_id:89790|Hs108|chr19 ( 602) 785 166.0 7.7e-41 CCDS12832.1 SIGLEC10 gene_id:89790|Hs108|chr19 ( 697) 785 166.0 8.6e-41 CCDS46150.1 SIGLEC11 gene_id:114132|Hs108|chr19 ( 602) 732 155.5 1.1e-37 CCDS12790.2 SIGLEC11 gene_id:114132|Hs108|chr19 ( 698) 732 155.5 1.3e-37 CCDS42601.1 SIGLEC7 gene_id:27036|Hs108|chr19 ( 374) 711 151.1 1.4e-36 CCDS59417.1 SIGLEC6 gene_id:946|Hs108|chr19 ( 342) 534 115.9 5.2e-26 CCDS62771.1 SIGLEC7 gene_id:27036|Hs108|chr19 ( 145) 504 109.7 1.7e-24 CCDS54302.1 SIGLEC10 gene_id:89790|Hs108|chr19 ( 544) 484 106.1 7.4e-23 CCDS54303.1 SIGLEC10 gene_id:89790|Hs108|chr19 ( 639) 484 106.2 8.4e-23 CCDS54304.1 SIGLEC10 gene_id:89790|Hs108|chr19 ( 554) 471 103.5 4.5e-22 CCDS54301.1 SIGLEC10 gene_id:89790|Hs108|chr19 ( 454) 469 103.1 5.1e-22 CCDS12456.1 MAG gene_id:4099|Hs108|chr19 ( 582) 353 80.1 5.4e-15 CCDS12455.1 MAG gene_id:4099|Hs108|chr19 ( 626) 353 80.1 5.7e-15 CCDS56090.1 MAG gene_id:4099|Hs108|chr19 ( 601) 321 73.7 4.6e-13 >>CCDS33084.1 CD33 gene_id:945|Hs108|chr19 (364 aa) initn: 2467 init1: 2467 opt: 2467 Z-score: 2657.3 bits: 500.3 E(32554): 1.1e-141 Smith-Waterman score: 2467; 99.7% identity (99.7% similar) in 364 aa overlap (1-364:1-364) 10 20 30 40 50 60 pF1KE1 MPLLLLLPLLWAGALAMDPNFWLQVQESVTVQEGLCVLVPCTFFHPIPYYDKNSPVHGYW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 MPLLLLLPLLWAGALAMDPNFWLQVQESVTVQEGLCVLVPCTFFHPIPYYDKNSPVHGYW 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 FREGAIISGDSPVATNKLDQEVQEETQGRFRLLGDPSRNNCSLSIVDARRRDNGSYFFRM :::::::: ::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 FREGAIISRDSPVATNKLDQEVQEETQGRFRLLGDPSRNNCSLSIVDARRRDNGSYFFRM 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 ERGSTKYSYKSPQLSVHVTDLTHRPKILIPGTLEPGHSKNLTCSVSWACEQGTPPIFSWL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 ERGSTKYSYKSPQLSVHVTDLTHRPKILIPGTLEPGHSKNLTCSVSWACEQGTPPIFSWL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 SAAPTSLGPRTTHSSVLIITPRPQDHGTNLTCQVKFAGAGVTTERTIQLNVTYVPQNPTT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 SAAPTSLGPRTTHSSVLIITPRPQDHGTNLTCQVKFAGAGVTTERTIQLNVTYVPQNPTT 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE1 GIFPGDGSGKQETRAGVVHGAIGGAGVTALLALCLCLIFFIVKTHRRKAARTAVGRNDTH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 GIFPGDGSGKQETRAGVVHGAIGGAGVTALLALCLCLIFFIVKTHRRKAARTAVGRNDTH 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE1 PTTGSASPKHQKKSKLHGPTETSSCSGAAPTVEMDEELHYASLNFHGMNPSKDTSTEYSE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 PTTGSASPKHQKKSKLHGPTETSSCSGAAPTVEMDEELHYASLNFHGMNPSKDTSTEYSE 310 320 330 340 350 360 pF1KE1 VRTQ :::: CCDS33 VRTQ >>CCDS54299.1 CD33 gene_id:945|Hs108|chr19 (310 aa) initn: 2095 init1: 2095 opt: 2095 Z-score: 2258.5 bits: 426.3 E(32554): 1.8e-119 Smith-Waterman score: 2095; 99.7% identity (99.7% similar) in 308 aa overlap (1-308:1-308) 10 20 30 40 50 60 pF1KE1 MPLLLLLPLLWAGALAMDPNFWLQVQESVTVQEGLCVLVPCTFFHPIPYYDKNSPVHGYW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 MPLLLLLPLLWAGALAMDPNFWLQVQESVTVQEGLCVLVPCTFFHPIPYYDKNSPVHGYW 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 FREGAIISGDSPVATNKLDQEVQEETQGRFRLLGDPSRNNCSLSIVDARRRDNGSYFFRM :::::::: ::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 FREGAIISRDSPVATNKLDQEVQEETQGRFRLLGDPSRNNCSLSIVDARRRDNGSYFFRM 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 ERGSTKYSYKSPQLSVHVTDLTHRPKILIPGTLEPGHSKNLTCSVSWACEQGTPPIFSWL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 ERGSTKYSYKSPQLSVHVTDLTHRPKILIPGTLEPGHSKNLTCSVSWACEQGTPPIFSWL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 SAAPTSLGPRTTHSSVLIITPRPQDHGTNLTCQVKFAGAGVTTERTIQLNVTYVPQNPTT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 SAAPTSLGPRTTHSSVLIITPRPQDHGTNLTCQVKFAGAGVTTERTIQLNVTYVPQNPTT 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE1 GIFPGDGSGKQETRAGVVHGAIGGAGVTALLALCLCLIFFIVKTHRRKAARTAVGRNDTH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 GIFPGDGSGKQETRAGVVHGAIGGAGVTALLALCLCLIFFIVKTHRRKAARTAVGRNDTH 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE1 PTTGSASPKHQKKSKLHGPTETSSCSGAAPTVEMDEELHYASLNFHGMNPSKDTSTEYSE :::::::: CCDS54 PTTGSASPVR 310 >>CCDS46157.1 CD33 gene_id:945|Hs108|chr19 (237 aa) initn: 1584 init1: 1516 opt: 1516 Z-score: 1637.9 bits: 311.0 E(32554): 6.5e-85 Smith-Waterman score: 1516; 100.0% identity (100.0% similar) in 225 aa overlap (140-364:13-237) 110 120 130 140 150 160 pF1KE1 RRDNGSYFFRMERGSTKYSYKSPQLSVHVTDLTHRPKILIPGTLEPGHSKNLTCSVSWAC :::::::::::::::::::::::::::::: CCDS46 MPLLLLLPLLWADLTHRPKILIPGTLEPGHSKNLTCSVSWAC 10 20 30 40 170 180 190 200 210 220 pF1KE1 EQGTPPIFSWLSAAPTSLGPRTTHSSVLIITPRPQDHGTNLTCQVKFAGAGVTTERTIQL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 EQGTPPIFSWLSAAPTSLGPRTTHSSVLIITPRPQDHGTNLTCQVKFAGAGVTTERTIQL 50 60 70 80 90 100 230 240 250 260 270 280 pF1KE1 NVTYVPQNPTTGIFPGDGSGKQETRAGVVHGAIGGAGVTALLALCLCLIFFIVKTHRRKA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 NVTYVPQNPTTGIFPGDGSGKQETRAGVVHGAIGGAGVTALLALCLCLIFFIVKTHRRKA 110 120 130 140 150 160 290 300 310 320 330 340 pF1KE1 ARTAVGRNDTHPTTGSASPKHQKKSKLHGPTETSSCSGAAPTVEMDEELHYASLNFHGMN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 ARTAVGRNDTHPTTGSASPKHQKKSKLHGPTETSSCSGAAPTVEMDEELHYASLNFHGMN 170 180 190 200 210 220 350 360 pF1KE1 PSKDTSTEYSEVRTQ ::::::::::::::: CCDS46 PSKDTSTEYSEVRTQ 230 >>CCDS12836.3 SIGLEC6 gene_id:946|Hs108|chr19 (353 aa) initn: 1113 init1: 861 opt: 1071 Z-score: 1157.3 bits: 222.7 E(32554): 3.9e-58 Smith-Waterman score: 1077; 67.7% identity (80.6% similar) in 248 aa overlap (3-248:13-251) 10 20 30 40 pF1KE1 MPLLLLLPLLWAGALAMDPNFWLQVQESVTVQEGLCVLVPCTFFHPIP-- : ::::::::::::.. : :. ::.:::::::::::: . .: CCDS12 MQGAQEASASEMLPLLLPLLWAGALAQERRFQLEGPESLTVQEGLCVLVPCRLPTTLPAS 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE1 YYDKNSPVHGYWFREGAIISGDSPVATNKLDQEVQEETQGRFRLLGDPSRNNCSLSIVDA :: .:::: ::: : ::::: :.::::::.:::.:: :: :.:::::: :: CCDS12 YYG-----YGYWFLEGA----DVPVATNDPDEEVQEETRGRFHLLWDPRRKNCSLSIRDA 70 80 90 100 110 110 120 130 140 150 160 pF1KE1 RRRDNGSYFFRMERGSTKYSYKSPQLSVHVTDLTHRPKILIPGTLEPGHSKNLTCSVSWA :::::..::::.. ::.: : .:::.: :::::.: :::::: :: .:::::: :. CCDS12 RRRDNAAYFFRLKSKWMKYGYTSSKLSVRVMALTHRPNISIPGTLESGHPSNLTCSVPWV 120 130 140 150 160 170 170 180 190 200 210 220 pF1KE1 CEQGTPPIFSWLSAAPTSLGPRTTHSSVLIITPRPQDHGTNLTCQVKFAGAGVTTERTIQ :::::::::::.::::::::::::.:::: ::::::::.::::::: : ::::: ::::: CCDS12 CEQGTPPIFSWMSAAPTSLGPRTTQSSVLTITPRPQDHSTNLTCQVTFPGAGVTMERTIQ 180 190 200 210 220 230 230 240 250 260 270 280 pF1KE1 LNVTYVPQNPTTGIFPGDGSGKQETRAGVVHGAIGGAGVTALLALCLCLIFFIVKTHRRK :::.:.::. . .:: :... CCDS12 LNVSYAPQKVAISIFQGNSAAFKILQNTSSLPVLEGQALRLLCDADGNPPAHLSWFQGFP 240 250 260 270 280 290 >>CCDS12834.3 SIGLEC6 gene_id:946|Hs108|chr19 (453 aa) initn: 1372 init1: 861 opt: 1071 Z-score: 1155.8 bits: 222.8 E(32554): 4.7e-58 Smith-Waterman score: 1077; 67.7% identity (80.6% similar) in 248 aa overlap (3-248:13-251) 10 20 30 40 pF1KE1 MPLLLLLPLLWAGALAMDPNFWLQVQESVTVQEGLCVLVPCTFFHPIP-- : ::::::::::::.. : :. ::.:::::::::::: . .: CCDS12 MQGAQEASASEMLPLLLPLLWAGALAQERRFQLEGPESLTVQEGLCVLVPCRLPTTLPAS 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE1 YYDKNSPVHGYWFREGAIISGDSPVATNKLDQEVQEETQGRFRLLGDPSRNNCSLSIVDA :: .:::: ::: : ::::: :.::::::.:::.:: :: :.:::::: :: CCDS12 YYG-----YGYWFLEGA----DVPVATNDPDEEVQEETRGRFHLLWDPRRKNCSLSIRDA 70 80 90 100 110 110 120 130 140 150 160 pF1KE1 RRRDNGSYFFRMERGSTKYSYKSPQLSVHVTDLTHRPKILIPGTLEPGHSKNLTCSVSWA :::::..::::.. ::.: : .:::.: :::::.: :::::: :: .:::::: :. CCDS12 RRRDNAAYFFRLKSKWMKYGYTSSKLSVRVMALTHRPNISIPGTLESGHPSNLTCSVPWV 120 130 140 150 160 170 170 180 190 200 210 220 pF1KE1 CEQGTPPIFSWLSAAPTSLGPRTTHSSVLIITPRPQDHGTNLTCQVKFAGAGVTTERTIQ :::::::::::.::::::::::::.:::: ::::::::.::::::: : ::::: ::::: CCDS12 CEQGTPPIFSWMSAAPTSLGPRTTQSSVLTITPRPQDHSTNLTCQVTFPGAGVTMERTIQ 180 190 200 210 220 230 230 240 250 260 270 280 pF1KE1 LNVTYVPQNPTTGIFPGDGSGKQETRAGVVHGAIGGAGVTALLALCLCLIFFIVKTHRRK :::.:.::. . .:: :... CCDS12 LNVSYAPQKVAISIFQGNSAAFKILQNTSSLPVLEGQALRLLCDADGNPPAHLSWFQGFP 240 250 260 270 280 290 >>CCDS54308.1 SIGLEC6 gene_id:946|Hs108|chr19 (389 aa) initn: 1189 init1: 826 opt: 1036 Z-score: 1119.1 bits: 215.8 E(32554): 5.2e-56 Smith-Waterman score: 1042; 68.5% identity (80.3% similar) in 238 aa overlap (3-238:13-241) 10 20 30 40 pF1KE1 MPLLLLLPLLWAGALAMDPNFWLQVQESVTVQEGLCVLVPCTFFHPIP-- : ::::::::::::.. : :. ::.:::::::::::: . .: CCDS54 MQGAQEASASEMLPLLLPLLWAGALAQERRFQLEGPESLTVQEGLCVLVPCRLPTTLPAS 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE1 YYDKNSPVHGYWFREGAIISGDSPVATNKLDQEVQEETQGRFRLLGDPSRNNCSLSIVDA :: .:::: ::: : ::::: :.::::::.:::.:: :: :.:::::: :: CCDS54 YYG-----YGYWFLEGA----DVPVATNDPDEEVQEETRGRFHLLWDPRRKNCSLSIRDA 70 80 90 100 110 110 120 130 140 150 160 pF1KE1 RRRDNGSYFFRMERGSTKYSYKSPQLSVHVTDLTHRPKILIPGTLEPGHSKNLTCSVSWA :::::..::::.. ::.: : .:::.: :::::.: :::::: :: .:::::: :. CCDS54 RRRDNAAYFFRLKSKWMKYGYTSSKLSVRVMALTHRPNISIPGTLESGHPSNLTCSVPWV 120 130 140 150 160 170 170 180 190 200 210 220 pF1KE1 CEQGTPPIFSWLSAAPTSLGPRTTHSSVLIITPRPQDHGTNLTCQVKFAGAGVTTERTIQ :::::::::::.::::::::::::.:::: ::::::::.::::::: : ::::: ::::: CCDS54 CEQGTPPIFSWMSAAPTSLGPRTTQSSVLTITPRPQDHSTNLTCQVTFPGAGVTMERTIQ 180 190 200 210 220 230 230 240 250 260 270 280 pF1KE1 LNVTYVPQNPTTGIFPGDGSGKQETRAGVVHGAIGGAGVTALLALCLCLIFFIVKTHRRK :::... . : CCDS54 LNVSWMLRRPPLSTPDAPQKVAISIFQGNSAAFKILQNTSSLPVLEGQALRLLCDADGNP 240 250 260 270 280 290 >>CCDS12835.3 SIGLEC6 gene_id:946|Hs108|chr19 (437 aa) initn: 1333 init1: 817 opt: 1028 Z-score: 1109.8 bits: 214.2 E(32554): 1.7e-55 Smith-Waterman score: 1077; 52.2% identity (65.0% similar) in 383 aa overlap (3-313:13-385) 10 20 30 40 pF1KE1 MPLLLLLPLLWAGALAMDPNFWLQVQESVTVQEGLCVLVPCTFFHPIP-- : ::::::::::::.. : :. ::.:::::::::::: . .: CCDS12 MQGAQEASASEMLPLLLPLLWAGALAQERRFQLEGPESLTVQEGLCVLVPCRLPTTLPAS 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE1 YYDKNSPVHGYWFREGAIISGDSPVATNKLDQEVQEETQGRFRLLGDPSRNNCSLSIVDA :: .:::: ::: : ::::: :.::::::.:::.:: :: :.:::::: :: CCDS12 YYG-----YGYWFLEGA----DVPVATNDPDEEVQEETRGRFHLLWDPRRKNCSLSIRDA 70 80 90 100 110 110 120 130 140 150 160 pF1KE1 RRRDNGSYFFRMERGSTKYSYKSPQLSVHVTDLTHRPKILIPGTLEPGHSKNLTCSVSWA :::::..::::.. ::.: : .:::.: :::::.: :::::: :: .:::::: :. CCDS12 RRRDNAAYFFRLKSKWMKYGYTSSKLSVRVMALTHRPNISIPGTLESGHPSNLTCSVPWV 120 130 140 150 160 170 170 180 190 200 210 220 pF1KE1 CEQGTPPIFSWLSAAPTSLGPRTTHSSVLIITPRPQDHGTNLTCQVKFAGAGVTTERTIQ :::::::::::.::::::::::::.:::: ::::::::.::::::: : ::::: ::::: CCDS12 CEQGTPPIFSWMSAAPTSLGPRTTQSSVLTITPRPQDHSTNLTCQVTFPGAGVTMERTIQ 180 190 200 210 220 230 230 240 pF1KE1 LNVTY---------VP--------------QNP-------------------TTGIF--P :::. .: :: .::.. : CCDS12 LNVSSFKILQNTSSLPVLEGQALRLLCDADGNPPAHLSWFQGFPALNATPISNTGVLELP 240 250 260 270 280 290 250 260 270 pF1KE1 GDGSG--------------------------KQETRAGVVHGAIGGAGVTALLALCLCLI ::. : : ::: : ::. ::..:.:. ::.:.: CCDS12 QVGSAEEGDFTCRAQHPLGSLQISLSLFVHWKPEGRAGGVLGAVWGASITTLVFLCVCFI 300 310 320 330 340 350 280 290 300 310 320 330 pF1KE1 FFIVKTHRRKAARTAVGRNDTHPTTGSASPKHQKKSKLHGPTETSSCSGAAPTVEMDEEL : :::.:.:::. . . .:..:. :.: ::.. CCDS12 FR-VKTRRKKAAQPVQNTDDVNPVMVSGSRGHQHQFQTGIVSDHPAEAGPISEDEQELHY 360 370 380 390 400 410 340 350 360 pF1KE1 HYASLNFHGMNPSKDTSTEYSEVRTQ CCDS12 AVLHFHKVQPQEPKVTDTEYSEIKIHK 420 430 >>CCDS12825.1 SIGLEC9 gene_id:27180|Hs108|chr19 (463 aa) initn: 1248 init1: 857 opt: 998 Z-score: 1077.2 bits: 208.3 E(32554): 1.1e-53 Smith-Waterman score: 998; 61.3% identity (78.2% similar) in 248 aa overlap (3-248:2-249) 10 20 30 40 50 pF1KE1 MPLLLLLPLLWAGALAM-DPNFWLQVQESVTVQEGLCVLVPCTFFHPIPYYDKNSPV-HG :::::::::. : . . : .: :::::::::: :::.: .: . .:: :: CCDS12 MLLLLLPLLWGRERAEGQTSKLLTMQSSVTVQEGLCVHVPCSFSYPSHGWIYPGPVVHG 10 20 30 40 50 60 70 80 90 100 110 pF1KE1 YWFREGAIISGDSPVATNKLDQEVQEETQGRFRLLGDPSRNNCSLSIVDARRRDNGSYFF ::::::: . :.:::::. . : :::. ::.::::: .::.::: :::: : : ::: CCDS12 YWFREGANTDQDAPVATNNPARAVWEETRDRFHLLGDPHTKNCTLSIRDARRSDAGRYFF 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE1 RMERGSTKYSYKSPQLSVHVTDLTHRPKILIPGTLEPGHSKNLTCSVSWACEQGTPPIFS :::.:: :..:: .:::.:: :::::.:::::::: : .:::::: :::::::::..: CCDS12 RMEKGSIKWNYKHHRLSVNVTALTHRPNILIPGTLESGCPQNLTCSVPWACEQGTPPMIS 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE1 WLSAAPTSLGPRTTHSSVLIITPRPQDHGTNLTCQVKFAGAGVTTERTIQLNVTYVPQNP :.... . : : ::.:::: . :.::::::.::::: : ::.:::..:..:::.: ::: CCDS12 WIGTSVSPLDPSTTRSSVLTLIPQPQDHGTSLTCQVTFPGASVTTNKTVHLNVSYPPQNL 180 190 200 210 220 230 240 250 260 270 280 290 pF1KE1 TTGIFPGDGSGKQETRAGVVHGAIGGAGVTALLALCLCLIFFIVKTHRRKAARTAVGRND : .: :::. CCDS12 TMTVFQGDGTVSTVLGNGSSLSLPEGQSLRLVCAVDAVDSNPPARLSLSWRGLTLCPSQP 240 250 260 270 280 290 >>CCDS12826.1 SIGLEC7 gene_id:27036|Hs108|chr19 (467 aa) initn: 1200 init1: 942 opt: 998 Z-score: 1077.1 bits: 208.3 E(32554): 1.1e-53 Smith-Waterman score: 998; 59.7% identity (76.7% similar) in 253 aa overlap (1-248:1-253) 10 20 30 40 50 pF1KE1 MPLLLLLPLLWA-----GALAMDPNFWLQVQESVTVQEGLCVLVPCTFFHPIPYYDKNSP : :::::::::. : . .. : .: :::::::.:: : :.: .:. ..: CCDS12 MLLLLLLPLLWGRERVEGQKSNRKDYSLTMQSSVTVQEGMCVHVRCSFSYPVDSQTDSDP 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE1 VHGYWFREGAIISGDSPVATNKLDQEVQEETQGRFRLLGDPSRNNCSLSIVDARRRDNGS ::::::: : :: .:::::. :::::. ::.:::::. .::.::: ::: : : CCDS12 VHGYWFRAGNDISWKAPVATNNPAWAVQEETRDRFHLLGDPQTKNCTLSIRDARMSDAGR 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE1 YFFRMERGSTKYSYKSPQLSVHVTDLTHRPKILIPGTLEPGHSKNLTCSVSWACEQGTPP ::::::.:. :..:: ::::.:: :::::.:::::::: : .:::::: ::::::::: CCDS12 YFFRMEKGNIKWNYKYDQLSVNVTALTHRPNILIPGTLESGCFQNLTCSVPWACEQGTPP 130 140 150 160 170 180 180 190 200 210 220 230 pF1KE1 IFSWLSAAPTSLGPRTTHSSVLIITPRPQDHGTNLTCQVKFAGAGVTTERTIQLNVTYVP ..::.... . : : ::.:::: . :.:: :::.::::: . ::::::.:::::::.: : CCDS12 MISWMGTSVSPLHPSTTRSSVLTLIPQPQHHGTSLTCQVTLPGAGVTTNRTIQLNVSYPP 190 200 210 220 230 240 240 250 260 270 280 290 pF1KE1 QNPTTGIFPGDGSGKQETRAGVVHGAIGGAGVTALLALCLCLIFFIVKTHRRKAARTAVG :: :. .: :.:. CCDS12 QNLTVTVFQGEGTASTALGNSSSLSVLEGQSLRLVCAVDSNPPARLSWTWRSLTLYPSQP 250 260 270 280 290 300 >>CCDS56100.1 SIGLEC9 gene_id:27180|Hs108|chr19 (479 aa) initn: 1186 init1: 857 opt: 998 Z-score: 1077.0 bits: 208.3 E(32554): 1.1e-53 Smith-Waterman score: 998; 61.3% identity (78.2% similar) in 248 aa overlap (3-248:2-249) 10 20 30 40 50 pF1KE1 MPLLLLLPLLWAGALAM-DPNFWLQVQESVTVQEGLCVLVPCTFFHPIPYYDKNSPV-HG :::::::::. : . . : .: :::::::::: :::.: .: . .:: :: CCDS56 MLLLLLPLLWGRERAEGQTSKLLTMQSSVTVQEGLCVHVPCSFSYPSHGWIYPGPVVHG 10 20 30 40 50 60 70 80 90 100 110 pF1KE1 YWFREGAIISGDSPVATNKLDQEVQEETQGRFRLLGDPSRNNCSLSIVDARRRDNGSYFF ::::::: . :.:::::. . : :::. ::.::::: .::.::: :::: : : ::: CCDS56 YWFREGANTDQDAPVATNNPARAVWEETRDRFHLLGDPHTKNCTLSIRDARRSDAGRYFF 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE1 RMERGSTKYSYKSPQLSVHVTDLTHRPKILIPGTLEPGHSKNLTCSVSWACEQGTPPIFS :::.:: :..:: .:::.:: :::::.:::::::: : .:::::: :::::::::..: CCDS56 RMEKGSIKWNYKHHRLSVNVTALTHRPNILIPGTLESGCPQNLTCSVPWACEQGTPPMIS 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE1 WLSAAPTSLGPRTTHSSVLIITPRPQDHGTNLTCQVKFAGAGVTTERTIQLNVTYVPQNP :.... . : : ::.:::: . :.::::::.::::: : ::.:::..:..:::.: ::: CCDS56 WIGTSVSPLDPSTTRSSVLTLIPQPQDHGTSLTCQVTFPGASVTTNKTVHLNVSYPPQNL 180 190 200 210 220 230 240 250 260 270 280 290 pF1KE1 TTGIFPGDGSGKQETRAGVVHGAIGGAGVTALLALCLCLIFFIVKTHRRKAARTAVGRND : .: :::. CCDS56 TMTVFQGDGTVSTVLGNGSSLSLPEGQSLRLVCAVDAVDSNPPARLSLSWRGLTLCPSQP 240 250 260 270 280 290 364 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 00:20:39 2016 done: Mon Nov 7 00:20:40 2016 Total Scan time: 2.910 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]