FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE5354, 266 aa
1>>>pF1KE5354 266 - 266 aa - 266 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.3792+/-0.00105; mu= 14.1164+/- 0.063
mean_var=63.0325+/-12.473, 0's: 0 Z-trim(102.7): 81 B-trim: 37 in 1/48
Lambda= 0.161544
statistics sampled from 7005 (7091) to 7005 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.588), E-opt: 0.2 (0.218), width: 16
Scan time: 1.610
The best scores are: opt bits E(32554)
CCDS3821.1 HPGD gene_id:3248|Hs108|chr4 ( 266) 1734 413.0 1.1e-115
CCDS54821.1 HPGD gene_id:3248|Hs108|chr4 ( 178) 1085 261.6 2.7e-70
CCDS58933.1 HPGD gene_id:3248|Hs108|chr4 ( 145) 945 229.0 1.5e-60
CCDS58935.1 HPGD gene_id:3248|Hs108|chr4 ( 143) 908 220.4 5.8e-58
CCDS58934.1 HPGD gene_id:3248|Hs108|chr4 ( 198) 834 203.2 1.2e-52
CCDS3812.1 CBR4 gene_id:84869|Hs108|chr4 ( 237) 362 93.2 1.8e-19
CCDS3619.1 HSD17B11 gene_id:51170|Hs108|chr4 ( 300) 297 78.1 8.1e-15
CCDS33375.1 PECR gene_id:55825|Hs108|chr2 ( 303) 288 76.0 3.5e-14
CCDS12736.1 HSD17B14 gene_id:51171|Hs108|chr19 ( 270) 277 73.4 1.9e-13
CCDS3663.1 BDH2 gene_id:56898|Hs108|chr4 ( 245) 275 72.9 2.4e-13
CCDS9604.1 DHRS2 gene_id:10202|Hs108|chr14 ( 280) 268 71.3 8.2e-13
CCDS41927.1 DHRS2 gene_id:10202|Hs108|chr14 ( 300) 259 69.2 3.7e-12
CCDS9605.1 DHRS4 gene_id:10901|Hs108|chr14 ( 278) 250 67.1 1.5e-11
CCDS6167.1 SDR16C5 gene_id:195814|Hs108|chr8 ( 309) 248 66.7 2.3e-11
CCDS83296.1 SDR16C5 gene_id:195814|Hs108|chr8 ( 318) 248 66.7 2.3e-11
CCDS81810.1 DHRS7 gene_id:51635|Hs108|chr14 ( 289) 242 65.3 5.6e-11
CCDS9743.1 DHRS7 gene_id:51635|Hs108|chr14 ( 339) 242 65.3 6.5e-11
>>CCDS3821.1 HPGD gene_id:3248|Hs108|chr4 (266 aa)
initn: 1734 init1: 1734 opt: 1734 Z-score: 2190.3 bits: 413.0 E(32554): 1.1e-115
Smith-Waterman score: 1734; 100.0% identity (100.0% similar) in 266 aa overlap (1-266:1-266)
10 20 30 40 50 60
pF1KE5 MHVNGKVALVTGAAQGIGRAFAEALLLKGAKVALVDWNLEAGVQCKAALDEQFEPQKTLF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 MHVNGKVALVTGAAQGIGRAFAEALLLKGAKVALVDWNLEAGVQCKAALDEQFEPQKTLF
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE5 IQCDVADQQQLRDTFRKVVDHFGRLDILVNNAGVNNEKNWEKTLQINLVSVISGTYLGLD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 IQCDVADQQQLRDTFRKVVDHFGRLDILVNNAGVNNEKNWEKTLQINLVSVISGTYLGLD
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE5 YMSKQNGGEGGIIINMSSLAGLMPVAQQPVYCASKHGIVGFTRSAALAANLMNSGVRLNA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 YMSKQNGGEGGIIINMSSLAGLMPVAQQPVYCASKHGIVGFTRSAALAANLMNSGVRLNA
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE5 ICPGFVNTAILESIEKEENMGQYIEYKDHIKDMIKYYGILDPPLIANGLITLIEDDALNG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 ICPGFVNTAILESIEKEENMGQYIEYKDHIKDMIKYYGILDPPLIANGLITLIEDDALNG
190 200 210 220 230 240
250 260
pF1KE5 AIMKITTSKGIHFQDYDTTPFQAKTQ
::::::::::::::::::::::::::
CCDS38 AIMKITTSKGIHFQDYDTTPFQAKTQ
250 260
>>CCDS54821.1 HPGD gene_id:3248|Hs108|chr4 (178 aa)
initn: 1085 init1: 1085 opt: 1085 Z-score: 1375.6 bits: 261.6 E(32554): 2.7e-70
Smith-Waterman score: 1085; 100.0% identity (100.0% similar) in 166 aa overlap (1-166:1-166)
10 20 30 40 50 60
pF1KE5 MHVNGKVALVTGAAQGIGRAFAEALLLKGAKVALVDWNLEAGVQCKAALDEQFEPQKTLF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 MHVNGKVALVTGAAQGIGRAFAEALLLKGAKVALVDWNLEAGVQCKAALDEQFEPQKTLF
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE5 IQCDVADQQQLRDTFRKVVDHFGRLDILVNNAGVNNEKNWEKTLQINLVSVISGTYLGLD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 IQCDVADQQQLRDTFRKVVDHFGRLDILVNNAGVNNEKNWEKTLQINLVSVISGTYLGLD
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE5 YMSKQNGGEGGIIINMSSLAGLMPVAQQPVYCASKHGIVGFTRSAALAANLMNSGVRLNA
::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 YMSKQNGGEGGIIINMSSLAGLMPVAQQPVYCASKHGIVGFTRSAAPTIDCQWIDNTH
130 140 150 160 170
190 200 210 220 230 240
pF1KE5 ICPGFVNTAILESIEKEENMGQYIEYKDHIKDMIKYYGILDPPLIANGLITLIEDDALNG
>>CCDS58933.1 HPGD gene_id:3248|Hs108|chr4 (145 aa)
initn: 945 init1: 945 opt: 945 Z-score: 1200.7 bits: 229.0 E(32554): 1.5e-60
Smith-Waterman score: 945; 100.0% identity (100.0% similar) in 145 aa overlap (122-266:1-145)
100 110 120 130 140 150
pF1KE5 AGVNNEKNWEKTLQINLVSVISGTYLGLDYMSKQNGGEGGIIINMSSLAGLMPVAQQPVY
::::::::::::::::::::::::::::::
CCDS58 MSKQNGGEGGIIINMSSLAGLMPVAQQPVY
10 20 30
160 170 180 190 200 210
pF1KE5 CASKHGIVGFTRSAALAANLMNSGVRLNAICPGFVNTAILESIEKEENMGQYIEYKDHIK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS58 CASKHGIVGFTRSAALAANLMNSGVRLNAICPGFVNTAILESIEKEENMGQYIEYKDHIK
40 50 60 70 80 90
220 230 240 250 260
pF1KE5 DMIKYYGILDPPLIANGLITLIEDDALNGAIMKITTSKGIHFQDYDTTPFQAKTQ
:::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS58 DMIKYYGILDPPLIANGLITLIEDDALNGAIMKITTSKGIHFQDYDTTPFQAKTQ
100 110 120 130 140
>>CCDS58935.1 HPGD gene_id:3248|Hs108|chr4 (143 aa)
initn: 908 init1: 908 opt: 908 Z-score: 1154.2 bits: 220.4 E(32554): 5.8e-58
Smith-Waterman score: 908; 100.0% identity (100.0% similar) in 140 aa overlap (1-140:1-140)
10 20 30 40 50 60
pF1KE5 MHVNGKVALVTGAAQGIGRAFAEALLLKGAKVALVDWNLEAGVQCKAALDEQFEPQKTLF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS58 MHVNGKVALVTGAAQGIGRAFAEALLLKGAKVALVDWNLEAGVQCKAALDEQFEPQKTLF
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE5 IQCDVADQQQLRDTFRKVVDHFGRLDILVNNAGVNNEKNWEKTLQINLVSVISGTYLGLD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS58 IQCDVADQQQLRDTFRKVVDHFGRLDILVNNAGVNNEKNWEKTLQINLVSVISGTYLGLD
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE5 YMSKQNGGEGGIIINMSSLAGLMPVAQQPVYCASKHGIVGFTRSAALAANLMNSGVRLNA
::::::::::::::::::::
CCDS58 YMSKQNGGEGGIIINMSSLAAHH
130 140
>>CCDS58934.1 HPGD gene_id:3248|Hs108|chr4 (198 aa)
initn: 834 init1: 834 opt: 834 Z-score: 1058.7 bits: 203.2 E(32554): 1.2e-52
Smith-Waterman score: 1150; 74.4% identity (74.4% similar) in 266 aa overlap (1-266:1-198)
10 20 30 40 50 60
pF1KE5 MHVNGKVALVTGAAQGIGRAFAEALLLKGAKVALVDWNLEAGVQCKAALDEQFEPQKTLF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS58 MHVNGKVALVTGAAQGIGRAFAEALLLKGAKVALVDWNLEAGVQCKAALDEQFEPQKTLF
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE5 IQCDVADQQQLRDTFRKVVDHFGRLDILVNNAGVNNEKNWEKTLQINLVSVISGTYLGLD
::::::::::::
CCDS58 IQCDVADQQQLR------------------------------------------------
70
130 140 150 160 170 180
pF1KE5 YMSKQNGGEGGIIINMSSLAGLMPVAQQPVYCASKHGIVGFTRSAALAANLMNSGVRLNA
::::::::::::::::::::::::::::::::::::::::
CCDS58 --------------------GLMPVAQQPVYCASKHGIVGFTRSAALAANLMNSGVRLNA
80 90 100 110
190 200 210 220 230 240
pF1KE5 ICPGFVNTAILESIEKEENMGQYIEYKDHIKDMIKYYGILDPPLIANGLITLIEDDALNG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS58 ICPGFVNTAILESIEKEENMGQYIEYKDHIKDMIKYYGILDPPLIANGLITLIEDDALNG
120 130 140 150 160 170
250 260
pF1KE5 AIMKITTSKGIHFQDYDTTPFQAKTQ
::::::::::::::::::::::::::
CCDS58 AIMKITTSKGIHFQDYDTTPFQAKTQ
180 190
>>CCDS3812.1 CBR4 gene_id:84869|Hs108|chr4 (237 aa)
initn: 320 init1: 136 opt: 362 Z-score: 463.0 bits: 93.2 E(32554): 1.8e-19
Smith-Waterman score: 366; 32.2% identity (61.2% similar) in 245 aa overlap (6-245:3-229)
10 20 30 40 50 60
pF1KE5 MHVNGKVALVTGAAQGIGRAFAEALLLKGAKVALVDWNLEAGVQCKAALDEQFEPQKTLF
:: : :...::::: :. . :: ..:.. :::.. ::: . :
CCDS38 MDKVCAVFGGSRGIGRAVAQLMARKGYRLAVIARNLEGA---KAAAGDL--GGDHLA
10 20 30 40 50
70 80 90 100 110
pF1KE5 IQCDVADQQQLRDTFRKVVDHFGRLDILVNNAGVNNEKNWEKTLQINLVSVISGTYLGL-
..:::: ......::... :.::...::: ::.: . .: ..:: . . ::
CCDS38 FSCDVAKEHDVQNTFEELEKHLGRVNFLVNAAGINRDGLLVRTKTEDMVSQLHTNLLGSM
60 70 80 90 100 110
120 130 140 150 160 170
pF1KE5 ----DYMSKQNGGEGGIIINMSSLAGLMPVAQQPVYCASKHGIVGFTRSAALAANLMNSG
: . .:: :.:..:..:: . : :: ::: :.:::.: ::: .. .
CCDS38 LTCKAAMRTMIQQQGGSIVNVGSIVGLKGNSGQSVYSASKGGLVGFSR--ALAKEVARKK
120 130 140 150 160 170
180 190 200 210 220 230
pF1KE5 VRLNAICPGFVNTAILESIEKEENMGQYIEYKDHIKDMIKYYGILDPPLIANGLITLIED
.:.:.. ::::.: . ... ::: :.: : . . .:.... :.:.
CCDS38 IRVNVVAPGFVHTDMTKDL-KEE----------HLKKNIPLGRFGETIEVAHAVVFLLES
180 190 200 210
240 250 260
pF1KE5 DALNGAIMKITTSKGIHFQDYDTTPFQAKTQ
..: .. .
CCDS38 PYITGHVLVVDGGLQLIL
220 230
>>CCDS3619.1 HSD17B11 gene_id:51170|Hs108|chr4 (300 aa)
initn: 246 init1: 85 opt: 297 Z-score: 379.5 bits: 78.1 E(32554): 8.1e-15
Smith-Waterman score: 297; 30.7% identity (64.5% similar) in 228 aa overlap (3-220:34-251)
10 20 30
pF1KE5 MHVNGKVALVTGAAQGIGRAFAEALLLKGAKV
:.:...:.:::..:::: : . .:.
CCDS36 LLDILLLLPLLIVCSLESFVKLFIPKRRKSVTGEIVLITGAGHGIGRLTAYEFAKLKSKL
10 20 30 40 50 60
40 50 60 70 80 90
pF1KE5 ALVDWNLEAGVQCKAALDEQFEPQKTLFIQCDVADQQQLRDTFRKVVDHFGRLDILVNNA
.: : : . :.. :: . . . :. : ...... .. .:: ..: ..::::::
CCDS36 VLWDIN-KHGLEETAAKCKGLGAKVHTFV-VDCSNREDIYSSAKKVKAEIGDVSILVNNA
70 80 90 100 110 120
100 110 120 130 140
pF1KE5 GV--------NNEKNWEKTLQINLVSVISGTYLGLDYMSKQNGGEGGIIINMSSLAGLMP
:: ... . :::...:... . : : :.:.: :. :....: :: .
CCDS36 GVVYTSDLFATQDPQIEKTFEVNVLAHFWTTKAFLPAMTKNNHGH---IVTVASAAGHVS
130 140 150 160 170
150 160 170 180 190 200
pF1KE5 VAQQPVYCASKHGIVGFTRSAA--LAANLMNSGVRLNAICPGFVNTAILESIEKEENMGQ
: .::.:: . ::: .. . ::: :. .::. . .::.::::..... ..:
CCDS36 VPFLLAYCSSKFAAVGFHKTLTDELAA-LQITGVKTTCLCPNFVNTGFIKN--PSTSLGP
180 190 200 210 220 230
210 220 230 240 250 260
pF1KE5 YIEYKDHIKDMIKYYGILDPPLIANGLITLIEDDALNGAIMKITTSKGIHFQDYDTTPFQ
.: .. .. .. .:::
CCDS36 TLEPEEVVNRLM--HGILTEQKMIFIPSSIAFLTTLERILPERFLAVLKRKISVKFDAVI
240 250 260 270 280 290
>>CCDS33375.1 PECR gene_id:55825|Hs108|chr2 (303 aa)
initn: 238 init1: 105 opt: 288 Z-score: 368.1 bits: 76.0 E(32554): 3.5e-14
Smith-Waterman score: 288; 27.0% identity (60.6% similar) in 274 aa overlap (3-257:16-273)
10 20 30 40
pF1KE5 MHVNGKVALVTGAAQGIGRAFAEALLLKGAKVALVDWNLEAGVQCKA
..:.::.:::.: :::.:... :: :..:.... .:: . :.
CCDS33 MASWAKGRSYLAPGLLQGQVAIVTGGATGIGKAIVKELLELGSNVVIASRKLE---RLKS
10 20 30 40 50
50 60 70 80 90
pF1KE5 ALDE---QFEPQK---TLFIQCDVADQQQLRDTFRKVVDHFGRLDILVNNAGVN------
: :: .. : : .. :::.. ..... . ....: ::....::::.: .
CCDS33 AADELQANLPPTKQARVIPIQCNIRNEEEVNNLVKSTLDTFGKINFLVNNGGGQFLSPAE
60 70 80 90 100 110
100 110 120 130 140 150
pF1KE5 --NEKNWEKTLQINLVSVISGT-YLGLDYMSKQNGGEGGIIINM--SSLAGLMPVAQQPV
. :.:. .:. :: .:: :. .:. .:: :.:. . ::. :.: .
CCDS33 HISSKGWHAVLETNL----TGTFYMCKAVYSSWMKEHGGSIVNIIVPTKAGF-PLAVHS-
120 130 140 150 160 170
160 170 180 190 200 210
pF1KE5 YCASKHGIVGFTRSAALAANLMNSGVRLNAICPGFVNTAILESIEKEENMGQYIEYKDHI
:.. :. ..:.: :: . ::.:.: . :: . . ..:. . :: . .
CCDS33 -GAARAGVYNLTKSLAL--EWACSGIRINCVAPGVIYSQ--TAVENYGSWGQSFFEGSFQ
180 190 200 210 220
220 230 240 250 260
pF1KE5 KDMIKYYGILDPPLIANGLITLIEDDA--LNGAIMKITTSKGIHFQDYDTTPFQAKTQ
: : :. : ... . :. : ..: . . ..... ..:.
CCDS33 KIPAKRIGV--PEEVSSVVCFLLSPAASFITGQSVDVDGGRSLYTHSYEVPDHDNWPKGA
230 240 250 260 270 280
CCDS33 GDLSVVKKMKETFKEKAKL
290 300
>>CCDS12736.1 HSD17B14 gene_id:51171|Hs108|chr19 (270 aa)
initn: 309 init1: 110 opt: 277 Z-score: 355.1 bits: 73.4 E(32554): 1.9e-13
Smith-Waterman score: 314; 34.7% identity (67.8% similar) in 199 aa overlap (5-194:9-195)
10 20 30 40 50
pF1KE5 MHVNGKVALVTGAAQGIGRAFAEALLLKGAKVALVDWNLEAGVQCKAALDEQFEPQ
:::..:::...::: ....:.. .::.:.. : . :.: . :: :: :
CCDS12 MATGTRYAGKVVVVTGGGRGIGAGIVRAFVNSGARVVICDKD-ESGGR---AL-EQELPG
10 20 30 40 50
60 70 80 90 100
pF1KE5 KTLFIQCDVADQQQLRDTFRKVVDHFGRLDILVNNAGVN---------NEKNWEKTLQIN
..:: :::....... ... .::::: .::::: . . ..... :..:
CCDS12 -AVFILCDVTQEDDVKTLVSETIRRFGRLDCVVNNAGHHPPPQRPEETSAQGFRQLLELN
60 70 80 90 100 110
110 120 130 140 150 160
pF1KE5 LVSVISGTYLGLDYMSKQNGGEGGIIINMSSLAGLMPVAQQPVYCASKHGIVGFTRSAAL
:... . : :.: :. :..:. .::.:::.: . :: : :.: .....:. ::
CCDS12 LLGTYTLTKLALPYLRKSQGN----VINISSLVGAIGQAQAVPYVATKGAVTAMTK--AL
120 130 140 150 160
170 180 190 200 210 220
pF1KE5 AANLMNSGVRLNAICPGFVNTAILESIEKEENMGQYIEYKDHIKDMIKYYGILDPPLIAN
: . :::.: : :: . : . : .
CCDS12 ALDESPYGVRVNCISPGNIWTPLWEELAALMPDPRATIREGMLAQPLGRMGQPAEVGAAA
170 180 190 200 210 220
>>CCDS3663.1 BDH2 gene_id:56898|Hs108|chr4 (245 aa)
initn: 233 init1: 104 opt: 275 Z-score: 353.2 bits: 72.9 E(32554): 2.4e-13
Smith-Waterman score: 298; 35.6% identity (65.4% similar) in 208 aa overlap (3-199:4-194)
10 20 30 40 50
pF1KE5 MHVNGKVALVTGAAQGIGRAFAEALLLKGAKVALVDWNLEAGVQCKAALDEQFEPQKTL
..::: ..:.::::::.: : :. .:::: .: : :. .: : :.. .:
CCDS36 MGRLDGKVIILTAAAQGIGQAAALAFAREGAKVIATDIN-ESKLQ---EL-EKYPGIQTR
10 20 30 40 50
60 70 80 90 100 110
pF1KE5 FIQCDVADQQQLRDTFRKVVDHFGRLDILVNNAGVNN--------EKNWEKTLQINLVSV
. ::. ..:. : : . :. :::.: : :: . ::.:. ....:. :.
CCDS36 VL--DVTKKKQI-DQFANEVE---RLDVLFNVAGFVHHGTVLDCEEKDWDFSMNLNVRSM
60 70 80 90 100
120 130 140 150 160
pF1KE5 ISGTYLGLD-YMSKQNGGEGGIIINMSSLAGLMP-VAQQPVYCASKHGIVGFTRSAALAA
:: . .. :. . ..: ::::::.:. . :... :: ..: ...:.:.: .::
CCDS36 ----YLMIKAFLPKMLAQKSGNIINMSSVASSVKGVVNRCVYSTTKAAVIGLTKS--VAA
110 120 130 140 150 160
170 180 190 200 210 220
pF1KE5 NLMNSGVRLNAICPGFVNTAIL-ESIEKEENMGQYIEYKDHIKDMIKYYGILDPPLIANG
.....:.: : .::: :.: : : :. . :
CCDS36 DFIQQGIRCNCVCPGTVDTPSLQERIQARGNPEEARNDFLKRQKTGRFATAEEIAMLCVY
170 180 190 200 210 220
230 240 250 260
pF1KE5 LITLIEDDALNGAIMKITTSKGIHFQDYDTTPFQAKTQ
CCDS36 LASDESAYVTGNPVIIDGGWSL
230 240
266 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Tue Nov 8 00:03:04 2016 done: Tue Nov 8 00:03:04 2016
Total Scan time: 1.610 Total Display time: 0.020
Function used was FASTA [36.3.4 Apr, 2011]