FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5212, 185 aa 1>>>pF1KE5212 185 - 185 aa - 185 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.9523+/-0.000894; mu= 8.8855+/- 0.053 mean_var=63.8080+/-12.749, 0's: 0 Z-trim(105.5): 63 B-trim: 88 in 1/49 Lambda= 0.160560 statistics sampled from 8373 (8436) to 8373 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.654), E-opt: 0.2 (0.259), width: 16 Scan time: 1.850 The best scores are: opt bits E(32554) CCDS11469.1 DUSP3 gene_id:1845|Hs108|chr17 ( 185) 1217 290.5 4e-79 CCDS53542.1 DUSP13 gene_id:51207|Hs108|chr10 ( 188) 465 116.3 1.1e-26 CCDS31223.1 DUPD1 gene_id:338599|Hs108|chr10 ( 220) 452 113.3 1e-25 CCDS7346.1 DUSP13 gene_id:51207|Hs108|chr10 ( 198) 430 108.2 3.2e-24 CCDS31224.1 DUSP13 gene_id:51207|Hs108|chr10 ( 248) 430 108.2 4e-24 CCDS6092.1 DUSP26 gene_id:78986|Hs108|chr8 ( 211) 408 103.1 1.2e-22 CCDS30932.1 DUSP27 gene_id:92235|Hs108|chr1 (1158) 411 103.9 3.7e-22 CCDS14724.1 DUSP9 gene_id:1852|Hs108|chrX ( 384) 285 74.7 7.8e-14 CCDS1528.1 DUSP10 gene_id:11221|Hs108|chr1 ( 482) 283 74.2 1.3e-13 CCDS33766.2 DUSP7 gene_id:1849|Hs108|chr3 ( 419) 278 73.0 2.6e-13 CCDS9033.1 DUSP6 gene_id:1848|Hs108|chr12 ( 381) 268 70.7 1.2e-12 CCDS7566.1 DUSP5 gene_id:1847|Hs108|chr10 ( 384) 251 66.8 1.8e-11 CCDS9711.1 STYX gene_id:6815|Hs108|chr14 ( 223) 240 64.2 6.4e-11 >>CCDS11469.1 DUSP3 gene_id:1845|Hs108|chr17 (185 aa) initn: 1217 init1: 1217 opt: 1217 Z-score: 1534.0 bits: 290.5 E(32554): 4e-79 Smith-Waterman score: 1217; 100.0% identity (100.0% similar) in 185 aa overlap (1-185:1-185) 10 20 30 40 50 60 pF1KE5 MSGSFELSVQDLNDLLSDGSGCYSLPSQPCNEVTPRIYVGNASVAQDIPKLQKLGITHVL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 MSGSFELSVQDLNDLLSDGSGCYSLPSQPCNEVTPRIYVGNASVAQDIPKLQKLGITHVL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 NAAEGRSFMHVNTNANFYKDSGITYLGIKANDTQEFNLSAYFERAADFIDQALAQKNGRV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 NAAEGRSFMHVNTNANFYKDSGITYLGIKANDTQEFNLSAYFERAADFIDQALAQKNGRV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 LVHCREGYSRSPTLVIAYLMMRQKMDVKSALSIVRQNREIGPNDGFLAQLCQLNDRLAKE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 LVHCREGYSRSPTLVIAYLMMRQKMDVKSALSIVRQNREIGPNDGFLAQLCQLNDRLAKE 130 140 150 160 170 180 pF1KE5 GKLKP ::::: CCDS11 GKLKP >>CCDS53542.1 DUSP13 gene_id:51207|Hs108|chr10 (188 aa) initn: 432 init1: 331 opt: 465 Z-score: 592.5 bits: 116.3 E(32554): 1.1e-26 Smith-Waterman score: 465; 44.3% identity (73.9% similar) in 176 aa overlap (8-182:20-187) 10 20 30 40 pF1KE5 MSGSFELSVQDLNDLLSDG-SGCYSLPSQPCNEVTPRIYVGNASVAQD :. .:..:: : :.: . .:: : ...:.:..:.. CCDS53 MAETSLPELGGEDKATPCPSILELEELLRAGKSSCSRV-----DEVWPNLFIGDAATANN 10 20 30 40 50 50 60 70 80 90 100 pF1KE5 IPKLQKLGITHVLNAAEGRSFMHVNTNANFYKDSGITYLGIKANDTQEFNLSAYFERAAD .: :::::::::::. . .. . . .:: :...:::. :.: .:..:::: ::: CCDS53 RFELWKLGITHVLNAAH--KGLYCQGGPDFY-GSSVSYLGVPAHDLPDFDISAYFSSAAD 60 70 80 90 100 110 110 120 130 140 150 160 pF1KE5 FIDQALAQKNGRVLVHCREGYSRSPTLVIAYLMMRQKMDVKSALSIVRQNREIGPNDGFL :: .:: ...::::: : ::: :::.::::..:......:. :::.: . :: ::: CCDS53 FIHRALNTPGAKVLVHCVVGVSRSATLVLAYLMLHQRLSLRQAVITVRQHRWVFPNRGFL 120 130 140 150 160 170 170 180 pF1KE5 AQLCQLNDRLAKEGKLKP :::.:...: :. CCDS53 HQLCRLDQQLRGAGQS 180 >>CCDS31223.1 DUPD1 gene_id:338599|Hs108|chr10 (220 aa) initn: 372 init1: 246 opt: 452 Z-score: 575.1 bits: 113.3 E(32554): 1e-25 Smith-Waterman score: 453; 43.8% identity (70.8% similar) in 178 aa overlap (3-180:37-203) 10 20 30 pF1KE5 MSGSFELSVQDLNDLLSDGSGCYSLPSQPCNE :.::: . :. :: :. :: CCDS31 KTSLKNAYSSAKRLSPKMEEEGEEEDYCTPGAFEL-----ERLFWKGSPQYT----HVNE 10 20 30 40 50 40 50 60 70 80 90 pF1KE5 VTPRIYVGNASVAQDIPKLQKLGITHVLNAAEGRSFMHVNTNANFYKDSGITYLGIKAND : :..:.:. ..: : .::: :.:::::::.:: .:.:. ..:.: : : :..:.: CCDS31 VWPKLYIGDEATALDRYRLQKAGFTHVLNAAHGR--WNVDTGPDYYRDMDIQYHGVEADD 60 70 80 90 100 110 100 110 120 130 140 150 pF1KE5 TQEFNLSAYFERAADFIDQALAQKNGRVLVHCREGYSRSPTLVIAYLMMRQKMDVKSALS :.::..: :: :::.::.. ....:::: : ::: :::.::::... : . .:.. CCDS31 LPTFDLSVFFYPAAAFIDRALSDDHSKILVHCVMGRSRSATLVLAYLMIHKDMTLVDAIQ 120 130 140 150 160 170 160 170 180 pF1KE5 IVRQNREIGPNDGFLAQLCQLNDRLAKEGKLKP : .:: . :: ::: :: .:. .:... CCDS31 QVAKNRCVLPNRGFLKQLRELDKQLVQQRRRSQRQDGEEEDGREL 180 190 200 210 220 >>CCDS7346.1 DUSP13 gene_id:51207|Hs108|chr10 (198 aa) initn: 410 init1: 301 opt: 430 Z-score: 548.3 bits: 108.2 E(32554): 3.2e-24 Smith-Waterman score: 430; 47.3% identity (76.0% similar) in 150 aa overlap (31-180:47-194) 10 20 30 40 50 60 pF1KE5 MSGSFELSVQDLNDLLSDGSGCYSLPSQPCNEVTPRIYVGNASVAQDIPKLQKLGITHVL .:: : ...:.: .:.: :: .::::::. CCDS73 AVQASPYQPPTLASLQRLLWVRQAATLNHIDEVWPSLFLGDAYAARDKSKLIQLGITHVV 20 30 40 50 60 70 70 80 90 100 110 120 pF1KE5 NAAEGRSFMHVNTNANFYKDSGITYLGIKANDTQEFNLSAYFERAADFIDQALAQKNGRV ::: :. ..:.:.:.::. .. : ::.:.:. :.::.:: .: .: ::. .::: CCDS73 NAAAGK--FQVDTGAKFYRGMSLEYYGIEADDNPFFDLSVYFLPVARYIRAALSVPQGRV 80 90 100 110 120 130 130 140 150 160 170 180 pF1KE5 LVHCREGYSRSPTLVIAYLMMRQKMDVKSALSIVRQNREIGPNDGFLAQLCQLNDRLAKE :::: : ::: :::.:.::. ..: . :.. :. .:.: ::.::: :: :..::..: CCDS73 LVHCAMGVSRSATLVLAFLMICENMTLVEAIQTVQAHRNICPNSGFLRQLQVLDNRLGRE 140 150 160 170 180 190 pF1KE5 GKLKP CCDS73 TGRF >>CCDS31224.1 DUSP13 gene_id:51207|Hs108|chr10 (248 aa) initn: 410 init1: 301 opt: 430 Z-score: 546.6 bits: 108.2 E(32554): 4e-24 Smith-Waterman score: 430; 47.3% identity (76.0% similar) in 150 aa overlap (31-180:97-244) 10 20 30 40 50 60 pF1KE5 MSGSFELSVQDLNDLLSDGSGCYSLPSQPCNEVTPRIYVGNASVAQDIPKLQKLGITHVL .:: : ...:.: .:.: :: .::::::. CCDS31 AVQASPYQPPTLASLQRLLWVRQAATLNHIDEVWPSLFLGDAYAARDKSKLIQLGITHVV 70 80 90 100 110 120 70 80 90 100 110 120 pF1KE5 NAAEGRSFMHVNTNANFYKDSGITYLGIKANDTQEFNLSAYFERAADFIDQALAQKNGRV ::: :. ..:.:.:.::. .. : ::.:.:. :.::.:: .: .: ::. .::: CCDS31 NAAAGK--FQVDTGAKFYRGMSLEYYGIEADDNPFFDLSVYFLPVARYIRAALSVPQGRV 130 140 150 160 170 180 130 140 150 160 170 180 pF1KE5 LVHCREGYSRSPTLVIAYLMMRQKMDVKSALSIVRQNREIGPNDGFLAQLCQLNDRLAKE :::: : ::: :::.:.::. ..: . :.. :. .:.: ::.::: :: :..::..: CCDS31 LVHCAMGVSRSATLVLAFLMICENMTLVEAIQTVQAHRNICPNSGFLRQLQVLDNRLGRE 190 200 210 220 230 240 pF1KE5 GKLKP CCDS31 TGRF >>CCDS6092.1 DUSP26 gene_id:78986|Hs108|chr8 (211 aa) initn: 416 init1: 321 opt: 408 Z-score: 520.3 bits: 103.1 E(32554): 1.2e-22 Smith-Waterman score: 412; 42.0% identity (70.5% similar) in 176 aa overlap (7-181:43-208) 10 20 30 pF1KE5 MSGSFELSVQDLNDLLSDG-SGCYSLPSQPCNEVTP :.: .:. :: : ..: . .:: : CCDS60 FMARFSRSSSRSPVRTRGTLEEMPTVQHPFLNVFELERLLYTGKTAC-----NHADEVWP 20 30 40 50 60 40 50 60 70 80 90 pF1KE5 RIYVGNASVAQDIPKLQKLGITHVLNAAEGRSFMHVNTNANFYKDSGITYLGIKANDTQE .:.:. ..:.. .:..:::::::::...: . . :. :: :::..:.:. CCDS60 GLYLGDQDMANNRRELRRLGITHVLNASHSR----WRGTPEAYEGLGIRYLGVEAHDSPA 70 80 90 100 110 120 100 110 120 130 140 150 pF1KE5 FNLSAYFERAADFIDQALAQKNGRVLVHCREGYSRSPTLVIAYLMMRQKMDVKSALSIVR :..: .:. ::::: .::.: .:..:::: : ::: :::.::::. ... . :.. :. CCDS60 FDMSIHFQTAADFIHRALSQPGGKILVHCAVGVSRSATLVLAYLMLYHHLTLVEAIKKVK 130 140 150 160 170 180 160 170 180 pF1KE5 QNREIGPNDGFLAQLCQLNDRLAKEGKLKP ..: : :: ::: :: :. :: ..: CCDS60 DHRGIIPNRGFLRQLLALDRRL-RQGLEA 190 200 210 >>CCDS30932.1 DUSP27 gene_id:92235|Hs108|chr1 (1158 aa) initn: 429 init1: 298 opt: 411 Z-score: 511.4 bits: 103.9 E(32554): 3.7e-22 Smith-Waterman score: 411; 43.2% identity (71.6% similar) in 155 aa overlap (26-180:130-281) 10 20 30 40 50 pF1KE5 MSGSFELSVQDLNDLLSDGSGCYSLPSQPCNEVTPRIYVGNASVAQDIPKLQKLG : . .:: : ..... ::: . .:..:: CCDS30 VREKMDDTSLYNTPCVLDLQRALVQDRQEAPWNEVDEVWPNVFIAEKSVAVNKGRLKRLG 100 110 120 130 140 150 60 70 80 90 100 110 pF1KE5 ITHVLNAAEGRSFMHVNTNANFYKDSGITYLGIKANDTQEFNLSAYFERAADFIDQALAQ :::.::::.: . : :. .:: : :::....: : ..: .:..:..:.:.:: CCDS30 ITHILNAAHGTG---VYTGPEFYTGLEIQYLGVEVDDFPEVDISQHFRKASEFLDEALLT 160 170 180 190 200 210 120 130 140 150 160 170 pF1KE5 KNGRVLVHCREGYSRSPTLVIAYLMMRQKMDVKSALSIVRQNREIGPNDGFLAQLCQLND :.::: . : ::: .::.::::. ..: . :: ::..: : ::.::: :: .::. CCDS30 YRGKVLVSSEMGISRSAVLVVAYLMIFHNMAILEALMTVRKKRAIYPNEGFLKQLRELNE 220 230 240 250 260 270 180 pF1KE5 RLAKEGKLKP .: .: CCDS30 KLMEEREEDYGREGGSAEAEEGEGTGSMLGARVHALTVEEEDDSASHLSGSSLGKATQAS 280 290 300 310 320 330 >>CCDS14724.1 DUSP9 gene_id:1852|Hs108|chrX (384 aa) initn: 216 init1: 113 opt: 285 Z-score: 361.8 bits: 74.7 E(32554): 7.8e-14 Smith-Waterman score: 286; 36.1% identity (65.2% similar) in 158 aa overlap (24-180:202-347) 10 20 30 40 50 pF1KE5 MSGSFELSVQDLNDLLSDGSGCYSLPSQPCNEVTPRIYVGNASVAQDIPKLQK :.: : . : .:.:.: . .. .: : CCDS14 SDAESEADRDSMSCGLDSEGATPPPVGLRASFPVQ----ILPNLYLGSARDSANLESLAK 180 190 200 210 220 60 70 80 90 100 110 pF1KE5 LGITHVLNAAEGRSFMHVNTNANFYKDSGITYLGIKANDTQEFNLSAYFERAADFIDQAL ::: ..::.. : : :.. . : : .: ::: .: .: .:::.:: CCDS14 LGIRYILNVTP-------NLPNFFEKNGDFHYKQIPISDHWSQNLSRFFPEAIEFIDEAL 230 240 250 260 270 280 120 130 140 150 160 170 pF1KE5 AQKNGRVLVHCREGYSRSPTLVIAYLMMRQKMDVKSALSIV-RQNREIGPNDGFLAQLCQ .:. : ::::: : ::: :...::::.. ......: ..: :.. .:.:: .:..:: . CCDS14 SQNCG-VLVHCLAGVSRSVTVTVAYLMQKLHLSLNDAYDLVKRKKSNISPNFNFMGQLLD 290 300 310 320 330 180 pF1KE5 LNDRLAKEGKLKP .. : : CCDS14 FERSLRLEERHSQEQGSGGQASAASNPPSFFTTPTSDGAFELAPT 340 350 360 370 380 >>CCDS1528.1 DUSP10 gene_id:11221|Hs108|chr1 (482 aa) initn: 248 init1: 115 opt: 283 Z-score: 357.7 bits: 74.2 E(32554): 1.3e-13 Smith-Waterman score: 283; 34.9% identity (68.5% similar) in 146 aa overlap (33-177:325-462) 10 20 30 40 50 60 pF1KE5 GSFELSVQDLNDLLSDGSGCYSLPSQPCNEVTPRIYVGNASVAQDIPKLQKLGITHVLNA . : ...:: . :::. .:.:.: .:.:. CCDS15 EVGGGASAASSLLPQPIPTTPDIENAELTPILPFLFLGNEQDAQDLDTMQRLNIGYVINV 300 310 320 330 340 350 70 80 90 100 110 120 pF1KE5 AEGRSFMHVNTNANFYKDSGITYLGIKANDTQEFNLSAYFERAADFIDQALAQKNGRVLV . ..: :. . ..: . :.:... :: :::.: .::..: .: .:. CCDS15 TTHLPLYH-------YEKGLFNYKRLPATDSNKQNLRQYFEEAFEFIEEAHQCGKG-LLI 360 370 380 390 400 130 140 150 160 170 180 pF1KE5 HCREGYSRSPTLVIAYLMMRQKMDVKSALSIVRQNREI-GPNDGFLAQLCQLNDRLAKEG ::. : ::: :.:::::: . .: . .: ..:. .: : .:: .:..:: .... : CCDS15 HCQAGVSRSATIVIAYLMKHTRMTMTDAYKFVKGKRPIISPNLNFMGQLLEFEEDLNNGV 410 420 430 440 450 460 pF1KE5 KLKP CCDS15 TPRILTPKLMGVETVV 470 480 >>CCDS33766.2 DUSP7 gene_id:1849|Hs108|chr3 (419 aa) initn: 268 init1: 113 opt: 278 Z-score: 352.4 bits: 73.0 E(32554): 2.6e-13 Smith-Waterman score: 279; 35.6% identity (64.4% similar) in 163 aa overlap (17-177:232-385) 10 20 30 40 pF1KE5 MSGSFELSVQDLNDLLSDGSGC-YSLPSQPCNEVTPRIYVGNASVA :::: : :. : .. : .:.: :. . CCDS33 TSVLGLGGLRISSDCSDGESDRELPSSATESDGSPVPSSQPAFPV-QILPYLYLGCAKDS 210 220 230 240 250 260 50 60 70 80 90 100 pF1KE5 QDIPKLQKLGITHVLNAAEGRSFMHVNTNANFYKDSGITYLGIKANDTQEFNLSAYFERA .. : : :: ..::.. : : . . .:: : .: ::: .: .: CCDS33 TNLDVLGKYGIKYILNVTP-------NLPNAFEHGGEFTYKQIPISDHWSQNLSQFFPEA 270 280 290 300 310 110 120 130 140 150 160 pF1KE5 ADFIDQALAQKNGRVLVHCREGYSRSPTLVIAYLMMRQKMDVKSALSIV-RQNREIGPND .:::.: ..: : ::::: : ::: :...::::.........: ..: :.. .:.:: CCDS33 ISFIDEARSKKCG-VLVHCLAGISRSVTVTVAYLMQKMNLSLNDAYDFVKRKKSNISPNF 320 330 340 350 360 370 170 180 pF1KE5 GFLAQLCQLNDRLAKEGKLKP .:..:: ... : CCDS33 NFMGQLLDFERTLGLSSPCDNHASSEQLYFSTPTNHNLFPLNTLEST 380 390 400 410 185 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 22:33:18 2016 done: Mon Nov 7 22:33:18 2016 Total Scan time: 1.850 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]