FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE4505, 529 aa 1>>>pF1KE4505 529 - 529 aa - 529 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.0940+/-0.000889; mu= 13.6372+/- 0.053 mean_var=67.9567+/-13.904, 0's: 0 Z-trim(105.9): 14 B-trim: 149 in 2/50 Lambda= 0.155582 statistics sampled from 8702 (8706) to 8702 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.641), E-opt: 0.2 (0.267), width: 16 Scan time: 3.280 The best scores are: opt bits E(32554) CCDS10243.1 HEXA gene_id:3073|Hs108|chr15 ( 529) 3650 828.5 0 CCDS81905.1 HEXA gene_id:3073|Hs108|chr15 ( 540) 3075 699.4 2.7e-201 CCDS4022.1 HEXB gene_id:3074|Hs108|chr5 ( 556) 2038 466.6 3.2e-131 CCDS78021.1 HEXB gene_id:3074|Hs108|chr5 ( 331) 1552 357.5 1.4e-98 >>CCDS10243.1 HEXA gene_id:3073|Hs108|chr15 (529 aa) initn: 3650 init1: 3650 opt: 3650 Z-score: 4425.1 bits: 828.5 E(32554): 0 Smith-Waterman score: 3650; 99.8% identity (100.0% similar) in 529 aa overlap (1-529:1-529) 10 20 30 40 50 60 pF1KE4 MTSSRLWFSLLLAAAFAGRATALWPWPQNFQTSDQRYVLYPNNFQFQYDVSSAAQPGCSV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 MTSSRLWFSLLLAAAFAGRATALWPWPQNFQTSDQRYVLYPNNFQFQYDVSSAAQPGCSV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 LDEAFQRYRDLLFGSGSWPRPYLTGKRHTLEKNVLVVSVVTPGCNQLPTLESVENYTLTI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 LDEAFQRYRDLLFGSGSWPRPYLTGKRHTLEKNVLVVSVVTPGCNQLPTLESVENYTLTI 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 NDDQCLLLSETVWGALRGLETFSQLVWKSAEGTFFINKTEIEDFPRFPHRGLLLDTSRHY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 NDDQCLLLSETVWGALRGLETFSQLVWKSAEGTFFINKTEIEDFPRFPHRGLLLDTSRHY 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 LPLSSILDTLDVMAYNKLNVFHWHLVDDPSFPYESFTFPELMRKGSYNPVTHIYTAQDVK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 LPLSSILDTLDVMAYNKLNVFHWHLVDDPSFPYESFTFPELMRKGSYNPVTHIYTAQDVK 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE4 EVIEYARLRGIRVLAEFDTPGHTLSWGPGIPGLLTPCYSGSEPSGTFGPVNPSLNNTYEF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 EVIEYARLRGIRVLAEFDTPGHTLSWGPGIPGLLTPCYSGSEPSGTFGPVNPSLNNTYEF 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE4 MSTFFLEVSSVFPDFYLHLGGDEVDFTCWKSNPEIQDFMRKKGFGEDFKQLESFYIQTLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 MSTFFLEVSSVFPDFYLHLGGDEVDFTCWKSNPEIQDFMRKKGFGEDFKQLESFYIQTLL 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE4 DIVSSYGKGYVVWQEVFDNKVKIQPDTIIQVWREDIPVNYMKELELVTKAGFRALLSAPW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 DIVSSYGKGYVVWQEVFDNKVKIQPDTIIQVWREDIPVNYMKELELVTKAGFRALLSAPW 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE4 YLNRISYGPDWKDFYVVEPLAFEGTPEQKALVIGGEACMWGEYVDNTNLVPRLWPRAGAV :::::::::::::::.:::::::::::::::::::::::::::::::::::::::::::: CCDS10 YLNRISYGPDWKDFYIVEPLAFEGTPEQKALVIGGEACMWGEYVDNTNLVPRLWPRAGAV 430 440 450 460 470 480 490 500 510 520 pF1KE4 AERLWSNKLTSDLTFAYERLSHFRCELLRRGVQAQPLNVGFCEQEFEQT ::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 AERLWSNKLTSDLTFAYERLSHFRCELLRRGVQAQPLNVGFCEQEFEQT 490 500 510 520 >>CCDS81905.1 HEXA gene_id:3073|Hs108|chr15 (540 aa) initn: 3066 init1: 3066 opt: 3075 Z-score: 3727.4 bits: 699.4 E(32554): 2.7e-201 Smith-Waterman score: 3618; 97.8% identity (98.0% similar) in 540 aa overlap (1-529:1-540) 10 20 30 40 50 60 pF1KE4 MTSSRLWFSLLLAAAFAGRATALWPWPQNFQTSDQRYVLYPNNFQFQYDVSSAAQPGCSV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 MTSSRLWFSLLLAAAFAGRATALWPWPQNFQTSDQRYVLYPNNFQFQYDVSSAAQPGCSV 10 20 30 40 50 60 70 80 90 100 pF1KE4 LDEAFQRYRDLLFGSGSWPRPYLTG-----------KRHTLEKNVLVVSVVTPGCNQLPT ::::::::::::::::::::::::: :::::::::::::::::::::::: CCDS81 LDEAFQRYRDLLFGSGSWPRPYLTGWPHQAYPVFLGKRHTLEKNVLVVSVVTPGCNQLPT 70 80 90 100 110 120 110 120 130 140 150 160 pF1KE4 LESVENYTLTINDDQCLLLSETVWGALRGLETFSQLVWKSAEGTFFINKTEIEDFPRFPH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 LESVENYTLTINDDQCLLLSETVWGALRGLETFSQLVWKSAEGTFFINKTEIEDFPRFPH 130 140 150 160 170 180 170 180 190 200 210 220 pF1KE4 RGLLLDTSRHYLPLSSILDTLDVMAYNKLNVFHWHLVDDPSFPYESFTFPELMRKGSYNP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 RGLLLDTSRHYLPLSSILDTLDVMAYNKLNVFHWHLVDDPSFPYESFTFPELMRKGSYNP 190 200 210 220 230 240 230 240 250 260 270 280 pF1KE4 VTHIYTAQDVKEVIEYARLRGIRVLAEFDTPGHTLSWGPGIPGLLTPCYSGSEPSGTFGP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 VTHIYTAQDVKEVIEYARLRGIRVLAEFDTPGHTLSWGPGIPGLLTPCYSGSEPSGTFGP 250 260 270 280 290 300 290 300 310 320 330 340 pF1KE4 VNPSLNNTYEFMSTFFLEVSSVFPDFYLHLGGDEVDFTCWKSNPEIQDFMRKKGFGEDFK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 VNPSLNNTYEFMSTFFLEVSSVFPDFYLHLGGDEVDFTCWKSNPEIQDFMRKKGFGEDFK 310 320 330 340 350 360 350 360 370 380 390 400 pF1KE4 QLESFYIQTLLDIVSSYGKGYVVWQEVFDNKVKIQPDTIIQVWREDIPVNYMKELELVTK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 QLESFYIQTLLDIVSSYGKGYVVWQEVFDNKVKIQPDTIIQVWREDIPVNYMKELELVTK 370 380 390 400 410 420 410 420 430 440 450 460 pF1KE4 AGFRALLSAPWYLNRISYGPDWKDFYVVEPLAFEGTPEQKALVIGGEACMWGEYVDNTNL ::::::::::::::::::::::::::.::::::::::::::::::::::::::::::::: CCDS81 AGFRALLSAPWYLNRISYGPDWKDFYIVEPLAFEGTPEQKALVIGGEACMWGEYVDNTNL 430 440 450 460 470 480 470 480 490 500 510 520 pF1KE4 VPRLWPRAGAVAERLWSNKLTSDLTFAYERLSHFRCELLRRGVQAQPLNVGFCEQEFEQT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 VPRLWPRAGAVAERLWSNKLTSDLTFAYERLSHFRCELLRRGVQAQPLNVGFCEQEFEQT 490 500 510 520 530 540 >>CCDS4022.1 HEXB gene_id:3074|Hs108|chr5 (556 aa) initn: 2016 init1: 805 opt: 2038 Z-score: 2469.2 bits: 466.6 E(32554): 3.2e-131 Smith-Waterman score: 2038; 56.9% identity (80.2% similar) in 504 aa overlap (22-525:55-554) 10 20 30 40 50 pF1KE4 MTSSRLWFSLLLAAAFAGRATALWPWPQNFQTSDQRYVLYPNNFQFQYDVS :::: : . . . : :.:: .... . CCDS40 AMLALLTQVALVVQVAEAARAPSVSAKPGPALWPLPLLVKMTPNLLHLAPENFYISHSPN 30 40 50 60 70 80 60 70 80 90 100 110 pF1KE4 SAAQPGCSVLDEAFQRYRDLLFGSGSWPRPYLTGKRHTLEKNVLVVSVVTPGCNQLPTLE :.: :.:..:.:::.::. .:: .: . . .: ...:: .. :. .:.. CCDS40 STAGPSCTLLEEAFRRYHGYIFGFYKWHHEPAEFQAKTQVQQLLVSITLQSECDAFPNIS 90 100 110 120 130 140 120 130 140 150 160 170 pF1KE4 SVENYTLTINDDQCLLLSETVWGALRGLETFSQLVWKSAEGTFFINKTEIEDFPRFPHRG : :.::: ... .: .. :::::::::::::::.... ::: ::.. : : ::: ::: CCDS40 SDESYTLLVKEPVAVLKANRVWGALRGLETFSQLVYQDSYGTFTINESTIIDSPRFSHRG 150 160 170 180 190 200 180 190 200 210 220 230 pF1KE4 LLLDTSRHYLPLSSILDTLDVMAYNKLNVFHWHLVDDPSFPYESFTFPELMRKGSYNPVT .:.::::::::.. :: :::.::.::.::.:::.::: ::::.:.::::: ::::. .. CCDS40 ILIDTSRHYLPVKIILKTLDAMAFNKFNVLHWHIVDDQSFPYQSITFPELSNKGSYS-LS 210 220 230 240 250 260 240 250 260 270 280 290 pF1KE4 HIYTAQDVKEVIEYARLRGIRVLAEFDTPGHTLSWGPGIPGLLTPCYSGSEPSGTFGPVN :.:: .::. ::::::::::::: :::::::::::: : ::::::: .. .:::.: CCDS40 HVYTPNDVRMVIEYARLRGIRVLPEFDTPGHTLSWGKGQKDLLTPCYSRQNKLDSFGPIN 270 280 290 300 310 320 300 310 320 330 340 350 pF1KE4 PSLNNTYEFMSTFFLEVSSVFPDFYLHLGGDEVDFTCWKSNPEIQDFMRKKGFGEDFKQL :.::.:: :..::: :.: :::: ..:::::::.: ::.:::.::::::.:::: :::.: CCDS40 PTLNTTYSFLTTFFKEISEVFPDQFIHLGGDEVEFKCWESNPKIQDFMRQKGFGTDFKKL 330 340 350 360 370 380 360 370 380 390 400 410 pF1KE4 ESFYIQTLLDIVSSYGKGYVVWQEVFDNKVKIQPDTIIQVWREDIPVNYMKELELVTKAG :::::: .:::... .:: .:::::::.:.:. : ::..::... : .:: :: .: CCDS40 ESFYIQKVLDIIATINKGSIVWQEVFDDKAKLAPGTIVEVWKDSA---YPEELSRVTASG 390 400 410 420 430 440 420 430 440 450 460 470 pF1KE4 FRALLSAPWYLNRISYGPDWKDFYVVEPLAFEGTPEQKALVIGGEACMWGEYVDNTNLVP : ..:::::::. :::: ::. .: :::: : :: .:: : ::::::.:::::: :::.: CCDS40 FPVILSAPWYLDLISYGQDWRKYYKVEPLDFGGTQKQKQLFIGGEACLWGEYVDATNLTP 450 460 470 480 490 500 480 490 500 510 520 pF1KE4 RLWPRAGAVAERLWSNKLTSDLTFAYERLSHFRCELLRRGVQAQPLNVGFCEQEFEQT ::::::.::.:::::.: . :. ::.::.. ::....::. :::: .:.:..: CCDS40 RLWPRASAVGERLWSSKDVRDMDDAYDRLTRHRCRMVERGIAAQPLYAGYCNHENM 510 520 530 540 550 >>CCDS78021.1 HEXB gene_id:3074|Hs108|chr5 (331 aa) initn: 1530 init1: 805 opt: 1552 Z-score: 1883.5 bits: 357.5 E(32554): 1.4e-98 Smith-Waterman score: 1552; 63.7% identity (84.4% similar) in 333 aa overlap (193-525:1-329) 170 180 190 200 210 220 pF1KE4 DFPRFPHRGLLLDTSRHYLPLSSILDTLDVMAYNKLNVFHWHLVDDPSFPYESFTFPELM ::.::.::.:::.::: ::::.:.::::: CCDS78 MAFNKFNVLHWHIVDDQSFPYQSITFPELS 10 20 30 230 240 250 260 270 280 pF1KE4 RKGSYNPVTHIYTAQDVKEVIEYARLRGIRVLAEFDTPGHTLSWGPGIPGLLTPCYSGSE ::::. ..:.:: .::. ::::::::::::: :::::::::::: : ::::::: .. CCDS78 NKGSYS-LSHVYTPNDVRMVIEYARLRGIRVLPEFDTPGHTLSWGKGQKDLLTPCYSRQN 40 50 60 70 80 290 300 310 320 330 340 pF1KE4 PSGTFGPVNPSLNNTYEFMSTFFLEVSSVFPDFYLHLGGDEVDFTCWKSNPEIQDFMRKK .:::.::.::.:: :..::: :.: :::: ..:::::::.: ::.:::.::::::.: CCDS78 KLDSFGPINPTLNTTYSFLTTFFKEISEVFPDQFIHLGGDEVEFKCWESNPKIQDFMRQK 90 100 110 120 130 140 350 360 370 380 390 400 pF1KE4 GFGEDFKQLESFYIQTLLDIVSSYGKGYVVWQEVFDNKVKIQPDTIIQVWREDIPVNYMK ::: :::.::::::: .:::... .:: .:::::::.:.:. : ::..::... : . CCDS78 GFGTDFKKLESFYIQKVLDIIATINKGSIVWQEVFDDKAKLAPGTIVEVWKDSA---YPE 150 160 170 180 190 200 410 420 430 440 450 460 pF1KE4 ELELVTKAGFRALLSAPWYLNRISYGPDWKDFYVVEPLAFEGTPEQKALVIGGEACMWGE :: :: .:: ..:::::::. :::: ::. .: :::: : :: .:: : ::::::.::: CCDS78 ELSRVTASGFPVILSAPWYLDLISYGQDWRKYYKVEPLDFGGTQKQKQLFIGGEACLWGE 210 220 230 240 250 260 470 480 490 500 510 520 pF1KE4 YVDNTNLVPRLWPRAGAVAERLWSNKLTSDLTFAYERLSHFRCELLRRGVQAQPLNVGFC ::: :::.:::::::.::.:::::.: . :. ::.::.. ::....::. :::: .:.: CCDS78 YVDATNLTPRLWPRASAVGERLWSSKDVRDMDDAYDRLTRHRCRMVERGIAAQPLYAGYC 270 280 290 300 310 320 pF1KE4 EQEFEQT ..: CCDS78 NHENM 330 529 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 15:15:28 2016 done: Mon Nov 7 15:15:29 2016 Total Scan time: 3.280 Total Display time: 0.030 Function used was FASTA [36.3.4 Apr, 2011]