FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE4505, 529 aa
1>>>pF1KE4505 529 - 529 aa - 529 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 6.0940+/-0.000889; mu= 13.6372+/- 0.053
mean_var=67.9567+/-13.904, 0's: 0 Z-trim(105.9): 14 B-trim: 149 in 2/50
Lambda= 0.155582
statistics sampled from 8702 (8706) to 8702 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.641), E-opt: 0.2 (0.267), width: 16
Scan time: 3.280
The best scores are: opt bits E(32554)
CCDS10243.1 HEXA gene_id:3073|Hs108|chr15 ( 529) 3650 828.5 0
CCDS81905.1 HEXA gene_id:3073|Hs108|chr15 ( 540) 3075 699.4 2.7e-201
CCDS4022.1 HEXB gene_id:3074|Hs108|chr5 ( 556) 2038 466.6 3.2e-131
CCDS78021.1 HEXB gene_id:3074|Hs108|chr5 ( 331) 1552 357.5 1.4e-98
>>CCDS10243.1 HEXA gene_id:3073|Hs108|chr15 (529 aa)
initn: 3650 init1: 3650 opt: 3650 Z-score: 4425.1 bits: 828.5 E(32554): 0
Smith-Waterman score: 3650; 99.8% identity (100.0% similar) in 529 aa overlap (1-529:1-529)
10 20 30 40 50 60
pF1KE4 MTSSRLWFSLLLAAAFAGRATALWPWPQNFQTSDQRYVLYPNNFQFQYDVSSAAQPGCSV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 MTSSRLWFSLLLAAAFAGRATALWPWPQNFQTSDQRYVLYPNNFQFQYDVSSAAQPGCSV
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE4 LDEAFQRYRDLLFGSGSWPRPYLTGKRHTLEKNVLVVSVVTPGCNQLPTLESVENYTLTI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 LDEAFQRYRDLLFGSGSWPRPYLTGKRHTLEKNVLVVSVVTPGCNQLPTLESVENYTLTI
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE4 NDDQCLLLSETVWGALRGLETFSQLVWKSAEGTFFINKTEIEDFPRFPHRGLLLDTSRHY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 NDDQCLLLSETVWGALRGLETFSQLVWKSAEGTFFINKTEIEDFPRFPHRGLLLDTSRHY
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE4 LPLSSILDTLDVMAYNKLNVFHWHLVDDPSFPYESFTFPELMRKGSYNPVTHIYTAQDVK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 LPLSSILDTLDVMAYNKLNVFHWHLVDDPSFPYESFTFPELMRKGSYNPVTHIYTAQDVK
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE4 EVIEYARLRGIRVLAEFDTPGHTLSWGPGIPGLLTPCYSGSEPSGTFGPVNPSLNNTYEF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 EVIEYARLRGIRVLAEFDTPGHTLSWGPGIPGLLTPCYSGSEPSGTFGPVNPSLNNTYEF
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE4 MSTFFLEVSSVFPDFYLHLGGDEVDFTCWKSNPEIQDFMRKKGFGEDFKQLESFYIQTLL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 MSTFFLEVSSVFPDFYLHLGGDEVDFTCWKSNPEIQDFMRKKGFGEDFKQLESFYIQTLL
310 320 330 340 350 360
370 380 390 400 410 420
pF1KE4 DIVSSYGKGYVVWQEVFDNKVKIQPDTIIQVWREDIPVNYMKELELVTKAGFRALLSAPW
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 DIVSSYGKGYVVWQEVFDNKVKIQPDTIIQVWREDIPVNYMKELELVTKAGFRALLSAPW
370 380 390 400 410 420
430 440 450 460 470 480
pF1KE4 YLNRISYGPDWKDFYVVEPLAFEGTPEQKALVIGGEACMWGEYVDNTNLVPRLWPRAGAV
:::::::::::::::.::::::::::::::::::::::::::::::::::::::::::::
CCDS10 YLNRISYGPDWKDFYIVEPLAFEGTPEQKALVIGGEACMWGEYVDNTNLVPRLWPRAGAV
430 440 450 460 470 480
490 500 510 520
pF1KE4 AERLWSNKLTSDLTFAYERLSHFRCELLRRGVQAQPLNVGFCEQEFEQT
:::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 AERLWSNKLTSDLTFAYERLSHFRCELLRRGVQAQPLNVGFCEQEFEQT
490 500 510 520
>>CCDS81905.1 HEXA gene_id:3073|Hs108|chr15 (540 aa)
initn: 3066 init1: 3066 opt: 3075 Z-score: 3727.4 bits: 699.4 E(32554): 2.7e-201
Smith-Waterman score: 3618; 97.8% identity (98.0% similar) in 540 aa overlap (1-529:1-540)
10 20 30 40 50 60
pF1KE4 MTSSRLWFSLLLAAAFAGRATALWPWPQNFQTSDQRYVLYPNNFQFQYDVSSAAQPGCSV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS81 MTSSRLWFSLLLAAAFAGRATALWPWPQNFQTSDQRYVLYPNNFQFQYDVSSAAQPGCSV
10 20 30 40 50 60
70 80 90 100
pF1KE4 LDEAFQRYRDLLFGSGSWPRPYLTG-----------KRHTLEKNVLVVSVVTPGCNQLPT
::::::::::::::::::::::::: ::::::::::::::::::::::::
CCDS81 LDEAFQRYRDLLFGSGSWPRPYLTGWPHQAYPVFLGKRHTLEKNVLVVSVVTPGCNQLPT
70 80 90 100 110 120
110 120 130 140 150 160
pF1KE4 LESVENYTLTINDDQCLLLSETVWGALRGLETFSQLVWKSAEGTFFINKTEIEDFPRFPH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS81 LESVENYTLTINDDQCLLLSETVWGALRGLETFSQLVWKSAEGTFFINKTEIEDFPRFPH
130 140 150 160 170 180
170 180 190 200 210 220
pF1KE4 RGLLLDTSRHYLPLSSILDTLDVMAYNKLNVFHWHLVDDPSFPYESFTFPELMRKGSYNP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS81 RGLLLDTSRHYLPLSSILDTLDVMAYNKLNVFHWHLVDDPSFPYESFTFPELMRKGSYNP
190 200 210 220 230 240
230 240 250 260 270 280
pF1KE4 VTHIYTAQDVKEVIEYARLRGIRVLAEFDTPGHTLSWGPGIPGLLTPCYSGSEPSGTFGP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS81 VTHIYTAQDVKEVIEYARLRGIRVLAEFDTPGHTLSWGPGIPGLLTPCYSGSEPSGTFGP
250 260 270 280 290 300
290 300 310 320 330 340
pF1KE4 VNPSLNNTYEFMSTFFLEVSSVFPDFYLHLGGDEVDFTCWKSNPEIQDFMRKKGFGEDFK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS81 VNPSLNNTYEFMSTFFLEVSSVFPDFYLHLGGDEVDFTCWKSNPEIQDFMRKKGFGEDFK
310 320 330 340 350 360
350 360 370 380 390 400
pF1KE4 QLESFYIQTLLDIVSSYGKGYVVWQEVFDNKVKIQPDTIIQVWREDIPVNYMKELELVTK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS81 QLESFYIQTLLDIVSSYGKGYVVWQEVFDNKVKIQPDTIIQVWREDIPVNYMKELELVTK
370 380 390 400 410 420
410 420 430 440 450 460
pF1KE4 AGFRALLSAPWYLNRISYGPDWKDFYVVEPLAFEGTPEQKALVIGGEACMWGEYVDNTNL
::::::::::::::::::::::::::.:::::::::::::::::::::::::::::::::
CCDS81 AGFRALLSAPWYLNRISYGPDWKDFYIVEPLAFEGTPEQKALVIGGEACMWGEYVDNTNL
430 440 450 460 470 480
470 480 490 500 510 520
pF1KE4 VPRLWPRAGAVAERLWSNKLTSDLTFAYERLSHFRCELLRRGVQAQPLNVGFCEQEFEQT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS81 VPRLWPRAGAVAERLWSNKLTSDLTFAYERLSHFRCELLRRGVQAQPLNVGFCEQEFEQT
490 500 510 520 530 540
>>CCDS4022.1 HEXB gene_id:3074|Hs108|chr5 (556 aa)
initn: 2016 init1: 805 opt: 2038 Z-score: 2469.2 bits: 466.6 E(32554): 3.2e-131
Smith-Waterman score: 2038; 56.9% identity (80.2% similar) in 504 aa overlap (22-525:55-554)
10 20 30 40 50
pF1KE4 MTSSRLWFSLLLAAAFAGRATALWPWPQNFQTSDQRYVLYPNNFQFQYDVS
:::: : . . . : :.:: .... .
CCDS40 AMLALLTQVALVVQVAEAARAPSVSAKPGPALWPLPLLVKMTPNLLHLAPENFYISHSPN
30 40 50 60 70 80
60 70 80 90 100 110
pF1KE4 SAAQPGCSVLDEAFQRYRDLLFGSGSWPRPYLTGKRHTLEKNVLVVSVVTPGCNQLPTLE
:.: :.:..:.:::.::. .:: .: . . .: ...:: .. :. .:..
CCDS40 STAGPSCTLLEEAFRRYHGYIFGFYKWHHEPAEFQAKTQVQQLLVSITLQSECDAFPNIS
90 100 110 120 130 140
120 130 140 150 160 170
pF1KE4 SVENYTLTINDDQCLLLSETVWGALRGLETFSQLVWKSAEGTFFINKTEIEDFPRFPHRG
: :.::: ... .: .. :::::::::::::::.... ::: ::.. : : ::: :::
CCDS40 SDESYTLLVKEPVAVLKANRVWGALRGLETFSQLVYQDSYGTFTINESTIIDSPRFSHRG
150 160 170 180 190 200
180 190 200 210 220 230
pF1KE4 LLLDTSRHYLPLSSILDTLDVMAYNKLNVFHWHLVDDPSFPYESFTFPELMRKGSYNPVT
.:.::::::::.. :: :::.::.::.::.:::.::: ::::.:.::::: ::::. ..
CCDS40 ILIDTSRHYLPVKIILKTLDAMAFNKFNVLHWHIVDDQSFPYQSITFPELSNKGSYS-LS
210 220 230 240 250 260
240 250 260 270 280 290
pF1KE4 HIYTAQDVKEVIEYARLRGIRVLAEFDTPGHTLSWGPGIPGLLTPCYSGSEPSGTFGPVN
:.:: .::. ::::::::::::: :::::::::::: : ::::::: .. .:::.:
CCDS40 HVYTPNDVRMVIEYARLRGIRVLPEFDTPGHTLSWGKGQKDLLTPCYSRQNKLDSFGPIN
270 280 290 300 310 320
300 310 320 330 340 350
pF1KE4 PSLNNTYEFMSTFFLEVSSVFPDFYLHLGGDEVDFTCWKSNPEIQDFMRKKGFGEDFKQL
:.::.:: :..::: :.: :::: ..:::::::.: ::.:::.::::::.:::: :::.:
CCDS40 PTLNTTYSFLTTFFKEISEVFPDQFIHLGGDEVEFKCWESNPKIQDFMRQKGFGTDFKKL
330 340 350 360 370 380
360 370 380 390 400 410
pF1KE4 ESFYIQTLLDIVSSYGKGYVVWQEVFDNKVKIQPDTIIQVWREDIPVNYMKELELVTKAG
:::::: .:::... .:: .:::::::.:.:. : ::..::... : .:: :: .:
CCDS40 ESFYIQKVLDIIATINKGSIVWQEVFDDKAKLAPGTIVEVWKDSA---YPEELSRVTASG
390 400 410 420 430 440
420 430 440 450 460 470
pF1KE4 FRALLSAPWYLNRISYGPDWKDFYVVEPLAFEGTPEQKALVIGGEACMWGEYVDNTNLVP
: ..:::::::. :::: ::. .: :::: : :: .:: : ::::::.:::::: :::.:
CCDS40 FPVILSAPWYLDLISYGQDWRKYYKVEPLDFGGTQKQKQLFIGGEACLWGEYVDATNLTP
450 460 470 480 490 500
480 490 500 510 520
pF1KE4 RLWPRAGAVAERLWSNKLTSDLTFAYERLSHFRCELLRRGVQAQPLNVGFCEQEFEQT
::::::.::.:::::.: . :. ::.::.. ::....::. :::: .:.:..:
CCDS40 RLWPRASAVGERLWSSKDVRDMDDAYDRLTRHRCRMVERGIAAQPLYAGYCNHENM
510 520 530 540 550
>>CCDS78021.1 HEXB gene_id:3074|Hs108|chr5 (331 aa)
initn: 1530 init1: 805 opt: 1552 Z-score: 1883.5 bits: 357.5 E(32554): 1.4e-98
Smith-Waterman score: 1552; 63.7% identity (84.4% similar) in 333 aa overlap (193-525:1-329)
170 180 190 200 210 220
pF1KE4 DFPRFPHRGLLLDTSRHYLPLSSILDTLDVMAYNKLNVFHWHLVDDPSFPYESFTFPELM
::.::.::.:::.::: ::::.:.:::::
CCDS78 MAFNKFNVLHWHIVDDQSFPYQSITFPELS
10 20 30
230 240 250 260 270 280
pF1KE4 RKGSYNPVTHIYTAQDVKEVIEYARLRGIRVLAEFDTPGHTLSWGPGIPGLLTPCYSGSE
::::. ..:.:: .::. ::::::::::::: :::::::::::: : ::::::: ..
CCDS78 NKGSYS-LSHVYTPNDVRMVIEYARLRGIRVLPEFDTPGHTLSWGKGQKDLLTPCYSRQN
40 50 60 70 80
290 300 310 320 330 340
pF1KE4 PSGTFGPVNPSLNNTYEFMSTFFLEVSSVFPDFYLHLGGDEVDFTCWKSNPEIQDFMRKK
.:::.::.::.:: :..::: :.: :::: ..:::::::.: ::.:::.::::::.:
CCDS78 KLDSFGPINPTLNTTYSFLTTFFKEISEVFPDQFIHLGGDEVEFKCWESNPKIQDFMRQK
90 100 110 120 130 140
350 360 370 380 390 400
pF1KE4 GFGEDFKQLESFYIQTLLDIVSSYGKGYVVWQEVFDNKVKIQPDTIIQVWREDIPVNYMK
::: :::.::::::: .:::... .:: .:::::::.:.:. : ::..::... : .
CCDS78 GFGTDFKKLESFYIQKVLDIIATINKGSIVWQEVFDDKAKLAPGTIVEVWKDSA---YPE
150 160 170 180 190 200
410 420 430 440 450 460
pF1KE4 ELELVTKAGFRALLSAPWYLNRISYGPDWKDFYVVEPLAFEGTPEQKALVIGGEACMWGE
:: :: .:: ..:::::::. :::: ::. .: :::: : :: .:: : ::::::.:::
CCDS78 ELSRVTASGFPVILSAPWYLDLISYGQDWRKYYKVEPLDFGGTQKQKQLFIGGEACLWGE
210 220 230 240 250 260
470 480 490 500 510 520
pF1KE4 YVDNTNLVPRLWPRAGAVAERLWSNKLTSDLTFAYERLSHFRCELLRRGVQAQPLNVGFC
::: :::.:::::::.::.:::::.: . :. ::.::.. ::....::. :::: .:.:
CCDS78 YVDATNLTPRLWPRASAVGERLWSSKDVRDMDDAYDRLTRHRCRMVERGIAAQPLYAGYC
270 280 290 300 310 320
pF1KE4 EQEFEQT
..:
CCDS78 NHENM
330
529 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Mon Nov 7 15:15:28 2016 done: Mon Nov 7 15:15:29 2016
Total Scan time: 3.280 Total Display time: 0.030
Function used was FASTA [36.3.4 Apr, 2011]