FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5354, 266 aa 1>>>pF1KE5354 266 - 266 aa - 266 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.3792+/-0.00105; mu= 14.1164+/- 0.063 mean_var=63.0325+/-12.473, 0's: 0 Z-trim(102.7): 81 B-trim: 37 in 1/48 Lambda= 0.161544 statistics sampled from 7005 (7091) to 7005 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.588), E-opt: 0.2 (0.218), width: 16 Scan time: 1.610 The best scores are: opt bits E(32554) CCDS3821.1 HPGD gene_id:3248|Hs108|chr4 ( 266) 1734 413.0 1.1e-115 CCDS54821.1 HPGD gene_id:3248|Hs108|chr4 ( 178) 1085 261.6 2.7e-70 CCDS58933.1 HPGD gene_id:3248|Hs108|chr4 ( 145) 945 229.0 1.5e-60 CCDS58935.1 HPGD gene_id:3248|Hs108|chr4 ( 143) 908 220.4 5.8e-58 CCDS58934.1 HPGD gene_id:3248|Hs108|chr4 ( 198) 834 203.2 1.2e-52 CCDS3812.1 CBR4 gene_id:84869|Hs108|chr4 ( 237) 362 93.2 1.8e-19 CCDS3619.1 HSD17B11 gene_id:51170|Hs108|chr4 ( 300) 297 78.1 8.1e-15 CCDS33375.1 PECR gene_id:55825|Hs108|chr2 ( 303) 288 76.0 3.5e-14 CCDS12736.1 HSD17B14 gene_id:51171|Hs108|chr19 ( 270) 277 73.4 1.9e-13 CCDS3663.1 BDH2 gene_id:56898|Hs108|chr4 ( 245) 275 72.9 2.4e-13 CCDS9604.1 DHRS2 gene_id:10202|Hs108|chr14 ( 280) 268 71.3 8.2e-13 CCDS41927.1 DHRS2 gene_id:10202|Hs108|chr14 ( 300) 259 69.2 3.7e-12 CCDS9605.1 DHRS4 gene_id:10901|Hs108|chr14 ( 278) 250 67.1 1.5e-11 CCDS6167.1 SDR16C5 gene_id:195814|Hs108|chr8 ( 309) 248 66.7 2.3e-11 CCDS83296.1 SDR16C5 gene_id:195814|Hs108|chr8 ( 318) 248 66.7 2.3e-11 CCDS81810.1 DHRS7 gene_id:51635|Hs108|chr14 ( 289) 242 65.3 5.6e-11 CCDS9743.1 DHRS7 gene_id:51635|Hs108|chr14 ( 339) 242 65.3 6.5e-11 >>CCDS3821.1 HPGD gene_id:3248|Hs108|chr4 (266 aa) initn: 1734 init1: 1734 opt: 1734 Z-score: 2190.3 bits: 413.0 E(32554): 1.1e-115 Smith-Waterman score: 1734; 100.0% identity (100.0% similar) in 266 aa overlap (1-266:1-266) 10 20 30 40 50 60 pF1KE5 MHVNGKVALVTGAAQGIGRAFAEALLLKGAKVALVDWNLEAGVQCKAALDEQFEPQKTLF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS38 MHVNGKVALVTGAAQGIGRAFAEALLLKGAKVALVDWNLEAGVQCKAALDEQFEPQKTLF 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 IQCDVADQQQLRDTFRKVVDHFGRLDILVNNAGVNNEKNWEKTLQINLVSVISGTYLGLD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS38 IQCDVADQQQLRDTFRKVVDHFGRLDILVNNAGVNNEKNWEKTLQINLVSVISGTYLGLD 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 YMSKQNGGEGGIIINMSSLAGLMPVAQQPVYCASKHGIVGFTRSAALAANLMNSGVRLNA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS38 YMSKQNGGEGGIIINMSSLAGLMPVAQQPVYCASKHGIVGFTRSAALAANLMNSGVRLNA 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 ICPGFVNTAILESIEKEENMGQYIEYKDHIKDMIKYYGILDPPLIANGLITLIEDDALNG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS38 ICPGFVNTAILESIEKEENMGQYIEYKDHIKDMIKYYGILDPPLIANGLITLIEDDALNG 190 200 210 220 230 240 250 260 pF1KE5 AIMKITTSKGIHFQDYDTTPFQAKTQ :::::::::::::::::::::::::: CCDS38 AIMKITTSKGIHFQDYDTTPFQAKTQ 250 260 >>CCDS54821.1 HPGD gene_id:3248|Hs108|chr4 (178 aa) initn: 1085 init1: 1085 opt: 1085 Z-score: 1375.6 bits: 261.6 E(32554): 2.7e-70 Smith-Waterman score: 1085; 100.0% identity (100.0% similar) in 166 aa overlap (1-166:1-166) 10 20 30 40 50 60 pF1KE5 MHVNGKVALVTGAAQGIGRAFAEALLLKGAKVALVDWNLEAGVQCKAALDEQFEPQKTLF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 MHVNGKVALVTGAAQGIGRAFAEALLLKGAKVALVDWNLEAGVQCKAALDEQFEPQKTLF 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 IQCDVADQQQLRDTFRKVVDHFGRLDILVNNAGVNNEKNWEKTLQINLVSVISGTYLGLD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 IQCDVADQQQLRDTFRKVVDHFGRLDILVNNAGVNNEKNWEKTLQINLVSVISGTYLGLD 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 YMSKQNGGEGGIIINMSSLAGLMPVAQQPVYCASKHGIVGFTRSAALAANLMNSGVRLNA :::::::::::::::::::::::::::::::::::::::::::::: CCDS54 YMSKQNGGEGGIIINMSSLAGLMPVAQQPVYCASKHGIVGFTRSAAPTIDCQWIDNTH 130 140 150 160 170 190 200 210 220 230 240 pF1KE5 ICPGFVNTAILESIEKEENMGQYIEYKDHIKDMIKYYGILDPPLIANGLITLIEDDALNG >>CCDS58933.1 HPGD gene_id:3248|Hs108|chr4 (145 aa) initn: 945 init1: 945 opt: 945 Z-score: 1200.7 bits: 229.0 E(32554): 1.5e-60 Smith-Waterman score: 945; 100.0% identity (100.0% similar) in 145 aa overlap (122-266:1-145) 100 110 120 130 140 150 pF1KE5 AGVNNEKNWEKTLQINLVSVISGTYLGLDYMSKQNGGEGGIIINMSSLAGLMPVAQQPVY :::::::::::::::::::::::::::::: CCDS58 MSKQNGGEGGIIINMSSLAGLMPVAQQPVY 10 20 30 160 170 180 190 200 210 pF1KE5 CASKHGIVGFTRSAALAANLMNSGVRLNAICPGFVNTAILESIEKEENMGQYIEYKDHIK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 CASKHGIVGFTRSAALAANLMNSGVRLNAICPGFVNTAILESIEKEENMGQYIEYKDHIK 40 50 60 70 80 90 220 230 240 250 260 pF1KE5 DMIKYYGILDPPLIANGLITLIEDDALNGAIMKITTSKGIHFQDYDTTPFQAKTQ ::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 DMIKYYGILDPPLIANGLITLIEDDALNGAIMKITTSKGIHFQDYDTTPFQAKTQ 100 110 120 130 140 >>CCDS58935.1 HPGD gene_id:3248|Hs108|chr4 (143 aa) initn: 908 init1: 908 opt: 908 Z-score: 1154.2 bits: 220.4 E(32554): 5.8e-58 Smith-Waterman score: 908; 100.0% identity (100.0% similar) in 140 aa overlap (1-140:1-140) 10 20 30 40 50 60 pF1KE5 MHVNGKVALVTGAAQGIGRAFAEALLLKGAKVALVDWNLEAGVQCKAALDEQFEPQKTLF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 MHVNGKVALVTGAAQGIGRAFAEALLLKGAKVALVDWNLEAGVQCKAALDEQFEPQKTLF 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 IQCDVADQQQLRDTFRKVVDHFGRLDILVNNAGVNNEKNWEKTLQINLVSVISGTYLGLD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 IQCDVADQQQLRDTFRKVVDHFGRLDILVNNAGVNNEKNWEKTLQINLVSVISGTYLGLD 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 YMSKQNGGEGGIIINMSSLAGLMPVAQQPVYCASKHGIVGFTRSAALAANLMNSGVRLNA :::::::::::::::::::: CCDS58 YMSKQNGGEGGIIINMSSLAAHH 130 140 >>CCDS58934.1 HPGD gene_id:3248|Hs108|chr4 (198 aa) initn: 834 init1: 834 opt: 834 Z-score: 1058.7 bits: 203.2 E(32554): 1.2e-52 Smith-Waterman score: 1150; 74.4% identity (74.4% similar) in 266 aa overlap (1-266:1-198) 10 20 30 40 50 60 pF1KE5 MHVNGKVALVTGAAQGIGRAFAEALLLKGAKVALVDWNLEAGVQCKAALDEQFEPQKTLF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 MHVNGKVALVTGAAQGIGRAFAEALLLKGAKVALVDWNLEAGVQCKAALDEQFEPQKTLF 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 IQCDVADQQQLRDTFRKVVDHFGRLDILVNNAGVNNEKNWEKTLQINLVSVISGTYLGLD :::::::::::: CCDS58 IQCDVADQQQLR------------------------------------------------ 70 130 140 150 160 170 180 pF1KE5 YMSKQNGGEGGIIINMSSLAGLMPVAQQPVYCASKHGIVGFTRSAALAANLMNSGVRLNA :::::::::::::::::::::::::::::::::::::::: CCDS58 --------------------GLMPVAQQPVYCASKHGIVGFTRSAALAANLMNSGVRLNA 80 90 100 110 190 200 210 220 230 240 pF1KE5 ICPGFVNTAILESIEKEENMGQYIEYKDHIKDMIKYYGILDPPLIANGLITLIEDDALNG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 ICPGFVNTAILESIEKEENMGQYIEYKDHIKDMIKYYGILDPPLIANGLITLIEDDALNG 120 130 140 150 160 170 250 260 pF1KE5 AIMKITTSKGIHFQDYDTTPFQAKTQ :::::::::::::::::::::::::: CCDS58 AIMKITTSKGIHFQDYDTTPFQAKTQ 180 190 >>CCDS3812.1 CBR4 gene_id:84869|Hs108|chr4 (237 aa) initn: 320 init1: 136 opt: 362 Z-score: 463.0 bits: 93.2 E(32554): 1.8e-19 Smith-Waterman score: 366; 32.2% identity (61.2% similar) in 245 aa overlap (6-245:3-229) 10 20 30 40 50 60 pF1KE5 MHVNGKVALVTGAAQGIGRAFAEALLLKGAKVALVDWNLEAGVQCKAALDEQFEPQKTLF :: : :...::::: :. . :: ..:.. :::.. ::: . : CCDS38 MDKVCAVFGGSRGIGRAVAQLMARKGYRLAVIARNLEGA---KAAAGDL--GGDHLA 10 20 30 40 50 70 80 90 100 110 pF1KE5 IQCDVADQQQLRDTFRKVVDHFGRLDILVNNAGVNNEKNWEKTLQINLVSVISGTYLGL- ..:::: ......::... :.::...::: ::.: . .: ..:: . . :: CCDS38 FSCDVAKEHDVQNTFEELEKHLGRVNFLVNAAGINRDGLLVRTKTEDMVSQLHTNLLGSM 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE5 ----DYMSKQNGGEGGIIINMSSLAGLMPVAQQPVYCASKHGIVGFTRSAALAANLMNSG : . .:: :.:..:..:: . : :: ::: :.:::.: ::: .. . CCDS38 LTCKAAMRTMIQQQGGSIVNVGSIVGLKGNSGQSVYSASKGGLVGFSR--ALAKEVARKK 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE5 VRLNAICPGFVNTAILESIEKEENMGQYIEYKDHIKDMIKYYGILDPPLIANGLITLIED .:.:.. ::::.: . ... ::: :.: : . . .:.... :.:. CCDS38 IRVNVVAPGFVHTDMTKDL-KEE----------HLKKNIPLGRFGETIEVAHAVVFLLES 180 190 200 210 240 250 260 pF1KE5 DALNGAIMKITTSKGIHFQDYDTTPFQAKTQ ..: .. . CCDS38 PYITGHVLVVDGGLQLIL 220 230 >>CCDS3619.1 HSD17B11 gene_id:51170|Hs108|chr4 (300 aa) initn: 246 init1: 85 opt: 297 Z-score: 379.5 bits: 78.1 E(32554): 8.1e-15 Smith-Waterman score: 297; 30.7% identity (64.5% similar) in 228 aa overlap (3-220:34-251) 10 20 30 pF1KE5 MHVNGKVALVTGAAQGIGRAFAEALLLKGAKV :.:...:.:::..:::: : . .:. CCDS36 LLDILLLLPLLIVCSLESFVKLFIPKRRKSVTGEIVLITGAGHGIGRLTAYEFAKLKSKL 10 20 30 40 50 60 40 50 60 70 80 90 pF1KE5 ALVDWNLEAGVQCKAALDEQFEPQKTLFIQCDVADQQQLRDTFRKVVDHFGRLDILVNNA .: : : . :.. :: . . . :. : ...... .. .:: ..: ..:::::: CCDS36 VLWDIN-KHGLEETAAKCKGLGAKVHTFV-VDCSNREDIYSSAKKVKAEIGDVSILVNNA 70 80 90 100 110 120 100 110 120 130 140 pF1KE5 GV--------NNEKNWEKTLQINLVSVISGTYLGLDYMSKQNGGEGGIIINMSSLAGLMP :: ... . :::...:... . : : :.:.: :. :....: :: . CCDS36 GVVYTSDLFATQDPQIEKTFEVNVLAHFWTTKAFLPAMTKNNHGH---IVTVASAAGHVS 130 140 150 160 170 150 160 170 180 190 200 pF1KE5 VAQQPVYCASKHGIVGFTRSAA--LAANLMNSGVRLNAICPGFVNTAILESIEKEENMGQ : .::.:: . ::: .. . ::: :. .::. . .::.::::..... ..: CCDS36 VPFLLAYCSSKFAAVGFHKTLTDELAA-LQITGVKTTCLCPNFVNTGFIKN--PSTSLGP 180 190 200 210 220 230 210 220 230 240 250 260 pF1KE5 YIEYKDHIKDMIKYYGILDPPLIANGLITLIEDDALNGAIMKITTSKGIHFQDYDTTPFQ .: .. .. .. .::: CCDS36 TLEPEEVVNRLM--HGILTEQKMIFIPSSIAFLTTLERILPERFLAVLKRKISVKFDAVI 240 250 260 270 280 290 >>CCDS33375.1 PECR gene_id:55825|Hs108|chr2 (303 aa) initn: 238 init1: 105 opt: 288 Z-score: 368.1 bits: 76.0 E(32554): 3.5e-14 Smith-Waterman score: 288; 27.0% identity (60.6% similar) in 274 aa overlap (3-257:16-273) 10 20 30 40 pF1KE5 MHVNGKVALVTGAAQGIGRAFAEALLLKGAKVALVDWNLEAGVQCKA ..:.::.:::.: :::.:... :: :..:.... .:: . :. CCDS33 MASWAKGRSYLAPGLLQGQVAIVTGGATGIGKAIVKELLELGSNVVIASRKLE---RLKS 10 20 30 40 50 50 60 70 80 90 pF1KE5 ALDE---QFEPQK---TLFIQCDVADQQQLRDTFRKVVDHFGRLDILVNNAGVN------ : :: .. : : .. :::.. ..... . ....: ::....::::.: . CCDS33 AADELQANLPPTKQARVIPIQCNIRNEEEVNNLVKSTLDTFGKINFLVNNGGGQFLSPAE 60 70 80 90 100 110 100 110 120 130 140 150 pF1KE5 --NEKNWEKTLQINLVSVISGT-YLGLDYMSKQNGGEGGIIINM--SSLAGLMPVAQQPV . :.:. .:. :: .:: :. .:. .:: :.:. . ::. :.: . CCDS33 HISSKGWHAVLETNL----TGTFYMCKAVYSSWMKEHGGSIVNIIVPTKAGF-PLAVHS- 120 130 140 150 160 170 160 170 180 190 200 210 pF1KE5 YCASKHGIVGFTRSAALAANLMNSGVRLNAICPGFVNTAILESIEKEENMGQYIEYKDHI :.. :. ..:.: :: . ::.:.: . :: . . ..:. . :: . . CCDS33 -GAARAGVYNLTKSLAL--EWACSGIRINCVAPGVIYSQ--TAVENYGSWGQSFFEGSFQ 180 190 200 210 220 220 230 240 250 260 pF1KE5 KDMIKYYGILDPPLIANGLITLIEDDA--LNGAIMKITTSKGIHFQDYDTTPFQAKTQ : : :. : ... . :. : ..: . . ..... ..:. CCDS33 KIPAKRIGV--PEEVSSVVCFLLSPAASFITGQSVDVDGGRSLYTHSYEVPDHDNWPKGA 230 240 250 260 270 280 CCDS33 GDLSVVKKMKETFKEKAKL 290 300 >>CCDS12736.1 HSD17B14 gene_id:51171|Hs108|chr19 (270 aa) initn: 309 init1: 110 opt: 277 Z-score: 355.1 bits: 73.4 E(32554): 1.9e-13 Smith-Waterman score: 314; 34.7% identity (67.8% similar) in 199 aa overlap (5-194:9-195) 10 20 30 40 50 pF1KE5 MHVNGKVALVTGAAQGIGRAFAEALLLKGAKVALVDWNLEAGVQCKAALDEQFEPQ :::..:::...::: ....:.. .::.:.. : . :.: . :: :: : CCDS12 MATGTRYAGKVVVVTGGGRGIGAGIVRAFVNSGARVVICDKD-ESGGR---AL-EQELPG 10 20 30 40 50 60 70 80 90 100 pF1KE5 KTLFIQCDVADQQQLRDTFRKVVDHFGRLDILVNNAGVN---------NEKNWEKTLQIN ..:: :::....... ... .::::: .::::: . . ..... :..: CCDS12 -AVFILCDVTQEDDVKTLVSETIRRFGRLDCVVNNAGHHPPPQRPEETSAQGFRQLLELN 60 70 80 90 100 110 110 120 130 140 150 160 pF1KE5 LVSVISGTYLGLDYMSKQNGGEGGIIINMSSLAGLMPVAQQPVYCASKHGIVGFTRSAAL :... . : :.: :. :..:. .::.:::.: . :: : :.: .....:. :: CCDS12 LLGTYTLTKLALPYLRKSQGN----VINISSLVGAIGQAQAVPYVATKGAVTAMTK--AL 120 130 140 150 160 170 180 190 200 210 220 pF1KE5 AANLMNSGVRLNAICPGFVNTAILESIEKEENMGQYIEYKDHIKDMIKYYGILDPPLIAN : . :::.: : :: . : . : . CCDS12 ALDESPYGVRVNCISPGNIWTPLWEELAALMPDPRATIREGMLAQPLGRMGQPAEVGAAA 170 180 190 200 210 220 >>CCDS3663.1 BDH2 gene_id:56898|Hs108|chr4 (245 aa) initn: 233 init1: 104 opt: 275 Z-score: 353.2 bits: 72.9 E(32554): 2.4e-13 Smith-Waterman score: 298; 35.6% identity (65.4% similar) in 208 aa overlap (3-199:4-194) 10 20 30 40 50 pF1KE5 MHVNGKVALVTGAAQGIGRAFAEALLLKGAKVALVDWNLEAGVQCKAALDEQFEPQKTL ..::: ..:.::::::.: : :. .:::: .: : :. .: : :.. .: CCDS36 MGRLDGKVIILTAAAQGIGQAAALAFAREGAKVIATDIN-ESKLQ---EL-EKYPGIQTR 10 20 30 40 50 60 70 80 90 100 110 pF1KE5 FIQCDVADQQQLRDTFRKVVDHFGRLDILVNNAGVNN--------EKNWEKTLQINLVSV . ::. ..:. : : . :. :::.: : :: . ::.:. ....:. :. CCDS36 VL--DVTKKKQI-DQFANEVE---RLDVLFNVAGFVHHGTVLDCEEKDWDFSMNLNVRSM 60 70 80 90 100 120 130 140 150 160 pF1KE5 ISGTYLGLD-YMSKQNGGEGGIIINMSSLAGLMP-VAQQPVYCASKHGIVGFTRSAALAA :: . .. :. . ..: ::::::.:. . :... :: ..: ...:.:.: .:: CCDS36 ----YLMIKAFLPKMLAQKSGNIINMSSVASSVKGVVNRCVYSTTKAAVIGLTKS--VAA 110 120 130 140 150 160 170 180 190 200 210 220 pF1KE5 NLMNSGVRLNAICPGFVNTAIL-ESIEKEENMGQYIEYKDHIKDMIKYYGILDPPLIANG .....:.: : .::: :.: : : :. . : CCDS36 DFIQQGIRCNCVCPGTVDTPSLQERIQARGNPEEARNDFLKRQKTGRFATAEEIAMLCVY 170 180 190 200 210 220 230 240 250 260 pF1KE5 LITLIEDDALNGAIMKITTSKGIHFQDYDTTPFQAKTQ CCDS36 LASDESAYVTGNPVIIDGGWSL 230 240 266 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 00:03:04 2016 done: Tue Nov 8 00:03:04 2016 Total Scan time: 1.610 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]