FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6654, 244 aa 1>>>pF1KE6654 244 - 244 aa - 244 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.2206+/-0.000925; mu= 14.6214+/- 0.055 mean_var=69.4494+/-14.571, 0's: 0 Z-trim(104.9): 69 B-trim: 377 in 1/50 Lambda= 0.153901 statistics sampled from 8056 (8126) to 8056 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.626), E-opt: 0.2 (0.25), width: 16 Scan time: 1.900 The best scores are: opt bits E(32554) CCDS11799.1 DCXR gene_id:51181|Hs108|chr17 ( 244) 1580 359.9 8.9e-100 CCDS9605.1 DHRS4 gene_id:10901|Hs108|chr14 ( 278) 383 94.2 1e-19 CCDS3663.1 BDH2 gene_id:56898|Hs108|chr4 ( 245) 320 80.1 1.5e-15 CCDS12736.1 HSD17B14 gene_id:51171|Hs108|chr19 ( 270) 320 80.2 1.6e-15 CCDS10409.1 DECR2 gene_id:26063|Hs108|chr16 ( 292) 309 77.7 9.2e-15 CCDS4769.1 HSD17B8 gene_id:7923|Hs108|chr6 ( 261) 299 75.5 3.9e-14 CCDS9604.1 DHRS2 gene_id:10202|Hs108|chr14 ( 280) 299 75.5 4.1e-14 CCDS3812.1 CBR4 gene_id:84869|Hs108|chr4 ( 237) 292 73.9 1.1e-13 CCDS41927.1 DHRS2 gene_id:10202|Hs108|chr14 ( 300) 264 67.8 9.6e-12 CCDS9606.2 DHRS4L2 gene_id:317749|Hs108|chr14 ( 232) 256 65.9 2.7e-11 >>CCDS11799.1 DCXR gene_id:51181|Hs108|chr17 (244 aa) initn: 1580 init1: 1580 opt: 1580 Z-score: 1904.8 bits: 359.9 E(32554): 8.9e-100 Smith-Waterman score: 1580; 100.0% identity (100.0% similar) in 244 aa overlap (1-244:1-244) 10 20 30 40 50 60 pF1KE6 MELFLAGRRVLVTGAGKGIGRGTVQALHATGARVVAVSRTQADLDSLVRECPGIEPVCVD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 MELFLAGRRVLVTGAGKGIGRGTVQALHATGARVVAVSRTQADLDSLVRECPGIEPVCVD 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 LGDWEATERALGSVGPVDLLVNNAAVALLQPFLEVTKEAFDRSFEVNLRAVIQVSQIVAR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 LGDWEATERALGSVGPVDLLVNNAAVALLQPFLEVTKEAFDRSFEVNLRAVIQVSQIVAR 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 GLIARGVPGAIVNVSSQCSQRAVTNHSVYCSTKGALDMLTKVMALELGPHKIRVNAVNPT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 GLIARGVPGAIVNVSSQCSQRAVTNHSVYCSTKGALDMLTKVMALELGPHKIRVNAVNPT 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 VVMTSMGQATWSDPHKAKTMLNRIPLGKFAEVEHVVNAILFLLSDRSGMTTGSTLPVEGG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 VVMTSMGQATWSDPHKAKTMLNRIPLGKFAEVEHVVNAILFLLSDRSGMTTGSTLPVEGG 190 200 210 220 230 240 pF1KE6 FWAC :::: CCDS11 FWAC >>CCDS9605.1 DHRS4 gene_id:10901|Hs108|chr14 (278 aa) initn: 289 init1: 211 opt: 383 Z-score: 467.6 bits: 94.2 E(32554): 1e-19 Smith-Waterman score: 383; 32.2% identity (63.7% similar) in 245 aa overlap (5-240:30-273) 10 20 30 pF1KE6 MELFLAGRRVLVTGAGKGIGRGTVQALHATGARVV ::.. .:::.. ::: . .. : ::.:: CCDS96 MHKAGLLGLCARAWNSVRMASSGMTRRDPLANKVALVTASTDGIGFAIARRLAQDGAHVV 10 20 30 40 50 60 40 50 60 70 80 pF1KE6 AVSRTQADLDSLVRECPG----IEPVCVDLGDWEATERALGSV----GPVDLLVNNAAVA . :: : ..:. : : . . .: : :: .... : .:.::.:::: CCDS96 VSSRKQQNVDQAVATLQGEGLSVTGTVCHVGKAEDRERLVATAVKLHGGIDILVSNAAVN 70 80 90 100 110 120 90 100 110 120 130 140 pF1KE6 -LLQPFLEVTKEAFDRSFEVNLRAVIQVSQIVARGLIARGVPGAIVNVSSQCSQRAVTNH .. ...::.:..:.....:..: ... :. . :: :..: ::: . . CCDS96 PFFGSIMDVTEEVWDKTLDINVKAPALMTKAVVPEMEKRG-GGSVVIVSSIAAFSPSPGF 130 140 150 160 170 150 160 170 180 190 200 pF1KE6 SVYCSTKGALDMLTKVMALELGPHKIRVNAVNPTVVMTSMGQATWSDPHKAKTMLNRIPL : : .: :: :::..:.::.:..:::: . : .. ::... : : .: ..: . . . CCDS96 SPYNVSKTALLGLTKTLAIELAPRNIRVNCLAPGLIKTSFSRMLWMDKEKEESMKETLRI 180 190 200 210 220 230 210 220 230 240 pF1KE6 GKFAEVEHVVNAILFLLSDRSGMTTGSTLPVEGGFWAC ...: : .. . :: :. ... :: :. : :: CCDS96 RRLGEPEDCAGIVSFLCSEDASYITGETVVVGGGTPSRL 240 250 260 270 >>CCDS3663.1 BDH2 gene_id:56898|Hs108|chr4 (245 aa) initn: 226 init1: 97 opt: 320 Z-score: 392.8 bits: 80.1 E(32554): 1.5e-15 Smith-Waterman score: 320; 29.0% identity (66.5% similar) in 245 aa overlap (5-243:4-244) 10 20 30 40 50 60 pF1KE6 MELFLAGRRVLVTGAGKGIGRGTVQALHATGARVVAVSRTQADLDSLVRECPGIEPVCVD : :. ...:.:..:::.... :. ::.:.:.. ... :. : .. :::. .: CCDS36 MGRLDGKVIILTAAAQGIGQAAALAFAREGAKVIATDINESKLQEL-EKYPGIQTRVLD 10 20 30 40 50 70 80 90 100 110 pF1KE6 LGDWEATERALGSVGPVDLLVNNAAVALLQPFLEVTKEAFDRSFEVNLRAV-IQVSQIVA . . .. . : .:.: : :. . :. .. .: :...:.:.. .... .. CCDS36 VTKKKQIDQFANEVERLDVLFNVAGFVHHGTVLDCEEKDWDFSMNLNVRSMYLMIKAFLP 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE6 RGLIARGVPGAIVNVSSQCSQ-RAVTNHSVYCSTKGALDMLTKVMALELGPHKIRVNAVN . : .. : :.:.:: :. ..:.:. :: .::.:. ::: .: .. . :: : : CCDS36 KMLAQKS--GNIINMSSVASSVKGVVNRCVYSTTKAAVIGLTKSVAADFIQQGIRCNCVC 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE6 PTVVMTSMGQA---TWSDPHKAKT-MLNRIPLGKFAEVEHVVNAILFLLSDRSGMTTGST : .: : : . ..:..:.. .:.: :.:: .:... ..: ::.:...::. CCDS36 PGTVDTPSLQERIQARGNPEEARNDFLKRQKTGRFATAEEIAMLCVYLASDESAYVTGNP 180 190 200 210 220 230 240 pF1KE6 LPVEGGFWAC . ..:: :. CCDS36 VIIDGG-WSL 240 >>CCDS12736.1 HSD17B14 gene_id:51171|Hs108|chr19 (270 aa) initn: 283 init1: 137 opt: 320 Z-score: 392.2 bits: 80.2 E(32554): 1.6e-15 Smith-Waterman score: 320; 30.5% identity (61.0% similar) in 246 aa overlap (6-240:8-248) 10 20 30 40 50 pF1KE6 MELFLAGRRVLVTGAGKGIGRGTVQALHATGARVVAVSRTQADLDSLVRECPGIEPVC ::. :.:::.:.::: : :.:. .::::: .. .. .: .: :: . CCDS12 MATGTRYAGKVVVVTGGGRGIGAGIVRAFVNSGARVVICDKDESGGRALEQELPGAVFIL 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE6 VDLGDWEATERALGSV----GPVDLLVNNAA--VALLQPFLEVTKEAFDRSFEVNLRAVI :. . . .. .. . : .: .::::. .: :.. ..: . .:.:: .. CCDS12 CDVTQEDDVKTLVSETIRRFGRLDCVVNNAGHHPPPQRPE-ETSAQGFRQLLELNLLGTY 70 80 90 100 110 120 130 140 150 160 170 pF1KE6 QVSQIVARGLIARGVPGAIVNVSSQCSQRAVTNHSVYCSTKGALDMLTKVMALELGPHKI ..... : : : ..:.:: . . .. : .::::. .::..::. .:. . CCDS12 TLTKLALPYL--RKSQGNVINISSLVGAIGQAQAVPYVATKGAVTAMTKALALDESPYGV 120 130 140 150 160 170 180 190 200 210 220 pF1KE6 RVNAVNPTVVMTSMGQ---ATWSDPHKA--KTMLNRIPLGKFAEVEHVVNAILFLLSDRS ::: ..: . : . . : ::. . . :: . :::.... .: : .:: :. . CCDS12 RVNCISPGNIWTPLWEELAALMPDPRATIREGMLAQ-PLGRMGQPAEVGAAAVFLASE-A 180 190 200 210 220 230 230 240 pF1KE6 GMTTGSTLPVEGGFWAC .. :: : : :: CCDS12 NFCTGIELLVTGGAELGYGCKASRSTPVDAPDIPS 240 250 260 270 >>CCDS10409.1 DECR2 gene_id:26063|Hs108|chr16 (292 aa) initn: 239 init1: 109 opt: 309 Z-score: 378.5 bits: 77.7 E(32554): 9.2e-15 Smith-Waterman score: 309; 25.7% identity (63.1% similar) in 249 aa overlap (4-242:25-272) 10 20 30 pF1KE6 MELFLAGRRVLVTGAGKGIGRGTVQALHATGARVVAVSR .: . ...::.:.::: .. . : ..: .:: CCDS10 MAQPPPDVEGDDCLPAYRHLFCPDLLRDKVAFITGGGSGIGFRIAEIFMRHGCHTVIASR 10 20 30 40 50 60 40 50 60 70 80 90 pF1KE6 TQADLDSLVRECPGIE-----PVCVDL----GDWEATERALGSVGPVDLLVNNAAVALLQ . . . .:. : :. .:. . :...:: : .:.:.: :: .: CCDS10 SLPRVLTAARKLAGATGRRCLPLSMDVRAPPAVMAAVDQALKEFGRIDILINCAAGNFLC 70 80 90 100 110 120 100 110 120 130 140 150 pF1KE6 PFLEVTKEAFDRSFEVNLRAVIQVSQIVARGLIARGVPGAIVNVSSQCSQRAVTNHSVYC : .. .:: .... ....::... . .. : :.:::... ..:. . . CCDS10 PAGALSFNAFKTVMDIDTSGTFNVSRVLYEKFF-RDHGGVIVNITATLGNRGQALQVHAG 130 140 150 160 170 160 170 180 190 200 pF1KE6 STKGALDMLTKVMALELGPHKIRVNAVNPTVVMTSMGQATWSDPHKA-KTMLNRIPLGKF :.:.:.: .:. .:.: ::..::::.. : . . : . :. . .: .. :: .. CCDS10 SAKAAVDAMTRHLAVEWGPQNIRVNSLAPGPISGTEGLRRLGGPQASLSTKVTASPLQRL 180 190 200 210 220 230 210 220 230 240 pF1KE6 AEVEHVVNAILFLLSDRSGMTTGSTLPVEGGFWAC .. ......:.: : ....::..: ..:: : CCDS10 GNKTEIAHSVLYLASPLASYVTGAVLVADGGAWLTFPNGVKGLPDFASFSAKL 240 250 260 270 280 290 >>CCDS4769.1 HSD17B8 gene_id:7923|Hs108|chr6 (261 aa) initn: 341 init1: 174 opt: 299 Z-score: 367.2 bits: 75.5 E(32554): 3.9e-14 Smith-Waterman score: 344; 30.9% identity (64.7% similar) in 249 aa overlap (11-242:15-260) 10 20 30 40 50 pF1KE6 MELFLAGRRVLVTGAGKGIGRGTVQALHATGARVVAVSRTQADLDSLVREC--PGI ::::::.::::.. : . :: :.: . .: . :: :: CCDS47 MASQLQNRLRSALALVTGAGSGIGRAVSVRLAGEGATVAACDLDRAAAQETVRLLGGPGS 10 20 30 40 50 60 60 70 80 90 100 pF1KE6 E--P-------VCVDLGDWEATERALGSVG-----PVDLLVNNAAVALLQPFLEVTKEAF . : .:... .:.. : .: : ...:. :... . .:..... . CCDS47 KEGPPRGNHAAFQADVSEARAARCLLEQVQACFSRPPSVVVSCAGITQDEFLLHMSEDDW 70 80 90 100 110 120 110 120 130 140 150 160 pF1KE6 DRSFEVNLRAVIQVSQIVARGLIARGVPGAIVNVSSQCSQRAVTNHSVYCSTKGALDMLT :. . :::.... :.: .:..:.. : :.:.:.:: .. . .... : ..:... :: CCDS47 DKVIAVNLKGTFLVTQAAAQALVSNGCRGSIINISSIVGKVGNVGQTNYAASKAGVIGLT 130 140 150 160 170 180 170 180 190 200 210 pF1KE6 KVMALELGPHKIRVNAVNPTVVMTSMGQATWSDPHKA-KTMLNRIPLGKFAEVEHVVNAI .. : ::: : :: :.: : . : : : . :.:. . . ::.:.... : :.... CCDS47 QTAARELGRHGIRCNSVLPGFIATPMTQKV---PQKVVDKITEMIPMGHLGDPEDVADVV 190 200 210 220 230 220 230 240 pF1KE6 LFLLSDRSGMTTGSTLPVEGGFWAC :: :. ::. ::... : ::.. CCDS47 AFLASEDSGYITGTSVEVTGGLFM 240 250 260 >>CCDS9604.1 DHRS2 gene_id:10202|Hs108|chr14 (280 aa) initn: 303 init1: 172 opt: 299 Z-score: 366.8 bits: 75.5 E(32554): 4.1e-14 Smith-Waterman score: 299; 30.2% identity (60.0% similar) in 245 aa overlap (5-239:34-275) 10 20 30 pF1KE6 MELFLAGRRVLVTGAGKGIGRGTVQALHATGARV ::.: ..:::. .::: . .. : ::.: CCDS96 AVARGYQGWFHPCARLSVRMSSTGIDRKGVLANRVAVVTGSTSGIGFAIARRLARDGAHV 10 20 30 40 50 60 40 50 60 70 80 pF1KE6 VAVSRTQADLDSLVRECPG----IEPVCVDLGDWEATE----RALGSVGPVDLLVNNAAV : :: : ..: . . : . . .: : : .:: : ::.:: .:.: CCDS96 VISSRKQQNVDRAMAKLQGEGLSVAGIVCHVGKAEDREQLVAKALEHCGGVDFLVCSAGV 70 80 90 100 110 120 90 100 110 120 130 140 pF1KE6 A-LLQPFLEVTKEAFDRSFEVNLRA-VIQVSQIVARGLIARGVPGAIVNVSSQCSQRAVT :. : .... .:. . ::... .. .::.. : ::.. ::: . :. CCDS96 NPLVGSTLGTSEQIWDKILSVNVKSPALLLSQLLPYMENRR---GAVILVSSIAAYNPVV 130 140 150 160 170 180 150 160 170 180 190 200 pF1KE6 NHSVYCSTKGALDMLTKVMALELGPHKIRVNAVNPTVVMTSMGQATWSDPHKAKTMLNRI .:: .: :: ::...::::.:. :::: : : .. :..... .. :.. .. CCDS96 ALGVYNVSKTALLGLTRTLALELAPKDIRVNCVVPGIIKTDFSKVFHGNESLWKNFKEHH 190 200 210 220 230 240 210 220 230 240 pF1KE6 PLGKFAEVEHVVNAILFLLSDRSGMTTGSTLPVEGGFWAC : ...: : .. . :: : .....: .. : : CCDS96 QLQRIGESEDCAGIVSFLCSPDASYVNGENIAVAGYSTRL 250 260 270 280 >>CCDS3812.1 CBR4 gene_id:84869|Hs108|chr4 (237 aa) initn: 264 init1: 137 opt: 292 Z-score: 359.4 bits: 73.9 E(32554): 1.1e-13 Smith-Waterman score: 292; 30.5% identity (63.2% similar) in 239 aa overlap (12-241:7-233) 10 20 30 40 50 pF1KE6 MELFLAGRRVLVTGAGKGIGRGTVQALHATGARVVAVSRT-------QADL--DSLVREC : :...::::...: . : :.....:. .:: : :. : CCDS38 MDKVCAVFGGSRGIGRAVAQLMARKGYRLAVIARNLEGAKAAAGDLGGDHLAFSC 10 20 30 40 50 60 70 80 90 100 110 pF1KE6 PGIEPVCVDLGDWEATERALGSVGPVDLLVNNAAVALLQPFLEVTKEAFDRSFEVNLRAV . :. . .: :. :: : ..::: :.. .... : . ....:: . CCDS38 DVAKEHDVQ-NTFEELEKHLGRV---NFLVNAAGINRDGLLVRTKTEDMVSQLHTNLLGS 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE6 IQVSQIVARGLIARGVPGAIVNVSSQCSQRAVTNHSVYCSTKGALDMLTKVMALELGPHK . . . . : .: . :.::::.: . .. ...::: ..::.: .....: :.. .: CCDS38 MLTCKAAMRTMIQQQ-GGSIVNVGSIVGLKGNSGQSVYSASKGGLVGFSRALAKEVARKK 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE6 IRVNAVNPTVVMTSMGQATWSDPHKAKTMLNRIPLGKFAEVEHVVNAILFLLSDRSGMTT ::::.: : : :.: . : . . . ::::.:.:. .:..:..::: .: . : CCDS38 IRVNVVAPGFVHTDMTKDL-----KEEHLKKNIPLGRFGETIEVAHAVVFLL--ESPYIT 180 190 200 210 220 240 pF1KE6 GSTLPVEGGFWAC : .: :.::. CCDS38 GHVLVVDGGLQLIL 230 >>CCDS41927.1 DHRS2 gene_id:10202|Hs108|chr14 (300 aa) initn: 268 init1: 137 opt: 264 Z-score: 324.3 bits: 67.8 E(32554): 9.6e-12 Smith-Waterman score: 264; 33.0% identity (60.3% similar) in 194 aa overlap (5-188:34-224) 10 20 30 pF1KE6 MELFLAGRRVLVTGAGKGIGRGTVQALHATGARV ::.: ..:::. .::: . .. : ::.: CCDS41 AVARGYQGWFHPCARLSVRMSSTGIDRKGVLANRVAVVTGSTSGIGFAIARRLARDGAHV 10 20 30 40 50 60 40 50 60 70 80 pF1KE6 VAVSRTQADLDSLVRECPG----IEPVCVDLGDWEATE----RALGSVGPVDLLVNNAAV : :: : ..: . . : . . .: : : .:: : ::.:: .:.: CCDS41 VISSRKQQNVDRAMAKLQGEGLSVAGIVCHVGKAEDREQLVAKALEHCGGVDFLVCSAGV 70 80 90 100 110 120 90 100 110 120 130 140 pF1KE6 A-LLQPFLEVTKEAFDRSFEVNLRA-VIQVSQIVARGLIARGVPGAIVNVSSQCSQRAVT :. : .... .:. . ::... .. .::.. :: :.. ::: . :. CCDS41 NPLVGSTLGTSEQIWDKILSVNVKSPALLLSQLLPYMENRRG---AVILVSSIAAYNPVV 130 140 150 160 170 180 150 160 170 180 190 200 pF1KE6 NHSVYCSTKGALDMLTKVMALELGPHKIRVNAVNPTVVMTSMGQATWSDPHKAKTMLNRI .:: .: :: ::...::::.:. :::: : : .. :.... CCDS41 ALGVYNVSKTALLGLTRTLALELAPKDIRVNCVVPGIIKTDFSKVVRIGFMGMSLSGRTS 190 200 210 220 230 240 210 220 230 240 pF1KE6 PLGKFAEVEHVVNAILFLLSDRSGMTTGSTLPVEGGFWAC CCDS41 RNIISCRGLGSQRTVQESCPSCALQMPATSTGRTLRWQATPLGSERSGGGCVAVVPGPGA 250 260 270 280 290 300 >>CCDS9606.2 DHRS4L2 gene_id:317749|Hs108|chr14 (232 aa) initn: 150 init1: 93 opt: 256 Z-score: 316.3 bits: 65.9 E(32554): 2.7e-11 Smith-Waterman score: 256; 30.3% identity (62.1% similar) in 198 aa overlap (5-192:30-226) 10 20 30 pF1KE6 MELFLAGRRVLVTGAGKGIGRGTVQALHATGARVV :... .:::.. ::: . .. : :.:: CCDS96 MQMARLLGLCAWARKSVRMASSRMTRRDPLTNKVALVTASTDGIGFAIARRLAQDRAHVV 10 20 30 40 50 60 40 50 60 70 80 pF1KE6 AVSRTQADLDSLVRECPG----IEPVCVDLGDWEATERALGSV----GPVDLLVNNAAVA . :: : ..:. : : . . .: : :: .. . : .:.::.:::: CCDS96 VSSRKQQNVDQAVATLQGEGLSVTGTVCHVGKAEDRERLVAMAVKLHGGIDILVSNAAVN 70 80 90 100 110 120 90 100 110 120 130 140 pF1KE6 -LLQPFLEVTKEAFDRSFEVNLRAVIQVSQIVARGLIARGVPGAIVNVSSQCSQRAVTNH .. ...::.:..:.....:..: ... :. . :: :..: ::: . . CCDS96 PFFGSLMDVTEEVWDKTLDINVKAPALMTKAVVPEMEKRG-GGSVVIVSSIAAFSPSPGF 130 140 150 160 170 150 160 170 180 190 200 pF1KE6 SVYCSTKGALDMLTKVMALELGPHKIRVNAVNPTVV-MTSMGQATWSDPHKAKTMLNRIP : : .: :: :....:.::.:..:::: .. . ..: : . :. CCDS96 SPYNVSKTALLGLNNTLAIELAPRNIRVNCLHLDLSRLASAGCSGWTRKKRKA 180 190 200 210 220 230 210 220 230 240 pF1KE6 LGKFAEVEHVVNAILFLLSDRSGMTTGSTLPVEGGFWAC 244 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 15:09:32 2016 done: Tue Nov 8 15:09:33 2016 Total Scan time: 1.900 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]