FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5234, 211 aa 1>>>pF1KE5234 211 - 211 aa - 211 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.5580+/-0.000843; mu= 11.4870+/- 0.050 mean_var=68.0375+/-13.326, 0's: 0 Z-trim(106.8): 67 B-trim: 90 in 1/48 Lambda= 0.155489 statistics sampled from 9112 (9189) to 9112 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.668), E-opt: 0.2 (0.282), width: 16 Scan time: 1.400 The best scores are: opt bits E(32554) CCDS6092.1 DUSP26 gene_id:78986|Hs108|chr8 ( 211) 1431 329.9 7.3e-91 CCDS53542.1 DUSP13 gene_id:51207|Hs108|chr10 ( 188) 656 156.0 1.4e-38 CCDS31223.1 DUPD1 gene_id:338599|Hs108|chr10 ( 220) 584 139.9 1.2e-33 CCDS7346.1 DUSP13 gene_id:51207|Hs108|chr10 ( 198) 567 136.0 1.5e-32 CCDS31224.1 DUSP13 gene_id:51207|Hs108|chr10 ( 248) 568 136.3 1.6e-32 CCDS30932.1 DUSP27 gene_id:92235|Hs108|chr1 (1158) 486 118.2 2.1e-26 CCDS11469.1 DUSP3 gene_id:1845|Hs108|chr17 ( 185) 408 100.3 7.9e-22 CCDS11320.1 DUSP14 gene_id:11072|Hs108|chr17 ( 198) 307 77.7 5.5e-15 CCDS2016.1 DUSP2 gene_id:1844|Hs108|chr2 ( 314) 308 78.0 7e-15 CCDS7566.1 DUSP5 gene_id:1847|Hs108|chr10 ( 384) 296 75.4 5.4e-14 CCDS4380.1 DUSP1 gene_id:1843|Hs108|chr5 ( 367) 293 74.7 8.3e-14 CCDS4468.1 DUSP22 gene_id:56940|Hs108|chr6 ( 184) 282 72.1 2.5e-13 CCDS69035.1 DUSP22 gene_id:56940|Hs108|chr6 ( 205) 282 72.1 2.8e-13 CCDS6073.1 DUSP4 gene_id:1846|Hs108|chr8 ( 303) 284 72.6 2.8e-13 CCDS6072.1 DUSP4 gene_id:1846|Hs108|chr8 ( 394) 284 72.7 3.6e-13 CCDS14724.1 DUSP9 gene_id:1852|Hs108|chrX ( 384) 283 72.5 4.1e-13 CCDS8157.1 SSH3 gene_id:54961|Hs108|chr11 ( 659) 273 70.3 3.1e-12 CCDS1528.1 DUSP10 gene_id:11221|Hs108|chr1 ( 482) 270 69.6 3.8e-12 CCDS13883.1 DUSP18 gene_id:150290|Hs108|chr22 ( 188) 263 67.8 4.9e-12 CCDS13193.1 DUSP15 gene_id:128853|Hs108|chr20 ( 235) 255 66.1 2.1e-11 CCDS82607.1 DUSP15 gene_id:128853|Hs108|chr20 ( 232) 251 65.2 3.8e-11 CCDS55882.1 SSH1 gene_id:54434|Hs108|chr12 ( 692) 257 66.7 3.9e-11 CCDS53825.1 SSH1 gene_id:54434|Hs108|chr12 ( 703) 257 66.7 3.9e-11 CCDS8650.1 DUSP16 gene_id:80824|Hs108|chr12 ( 665) 256 66.5 4.4e-11 CCDS9121.1 SSH1 gene_id:54434|Hs108|chr12 (1049) 257 66.8 5.5e-11 CCDS14264.1 DUSP21 gene_id:63904|Hs108|chrX ( 190) 244 63.6 9.6e-11 >>CCDS6092.1 DUSP26 gene_id:78986|Hs108|chr8 (211 aa) initn: 1431 init1: 1431 opt: 1431 Z-score: 1744.7 bits: 329.9 E(32554): 7.3e-91 Smith-Waterman score: 1431; 100.0% identity (100.0% similar) in 211 aa overlap (1-211:1-211) 10 20 30 40 50 60 pF1KE5 MCPGNWLWASMTFMARFSRSSSRSPVRTRGTLEEMPTVQHPFLNVFELERLLYTGKTACN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS60 MCPGNWLWASMTFMARFSRSSSRSPVRTRGTLEEMPTVQHPFLNVFELERLLYTGKTACN 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 HADEVWPGLYLGDQDMANNRRELRRLGITHVLNASHSRWRGTPEAYEGLGIRYLGVEAHD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS60 HADEVWPGLYLGDQDMANNRRELRRLGITHVLNASHSRWRGTPEAYEGLGIRYLGVEAHD 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 SPAFDMSIHFQTAADFIHRALSQPGGKILVHCAVGVSRSATLVLAYLMLYHHLTLVEAIK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS60 SPAFDMSIHFQTAADFIHRALSQPGGKILVHCAVGVSRSATLVLAYLMLYHHLTLVEAIK 130 140 150 160 170 180 190 200 210 pF1KE5 KVKDHRGIIPNRGFLRQLLALDRRLRQGLEA ::::::::::::::::::::::::::::::: CCDS60 KVKDHRGIIPNRGFLRQLLALDRRLRQGLEA 190 200 210 >>CCDS53542.1 DUSP13 gene_id:51207|Hs108|chr10 (188 aa) initn: 641 init1: 431 opt: 656 Z-score: 805.9 bits: 156.0 E(32554): 1.4e-38 Smith-Waterman score: 656; 58.9% identity (83.3% similar) in 168 aa overlap (41-206:17-183) 20 30 40 50 60 70 pF1KE5 MTFMARFSRSSSRSPVRTRGTLEEMPTVQHPFLNVFELERLLYTGKTACNHADEVWPGLY : ...:::.:: .::..:...:::::.:. CCDS53 MAETSLPELGGEDKATPCPSILELEELLRAGKSSCSRVDEVWPNLF 10 20 30 40 80 90 100 110 120 pF1KE5 LGDQDMANNRRELRRLGITHVLNASHSRW--RGTPEAYEGLGIRYLGVEAHDSPAFDMSI .:: :::: :: .:::::::::.:. .: :. : : .. :::: ::: : ::.: CCDS53 IGDAATANNRFELWKLGITHVLNAAHKGLYCQGGPDFY-GSSVSYLGVPAHDLPDFDISA 50 60 70 80 90 100 130 140 150 160 170 180 pF1KE5 HFQTAADFIHRALSQPGGKILVHCAVGVSRSATLVLAYLMLYHHLTLVEAIKKVKDHRGI .:..:::::::::. ::.:.::::.::::::::::::::::...:.: .:. :..:: . CCDS53 YFSSAADFIHRALNTPGAKVLVHCVVGVSRSATLVLAYLMLHQRLSLRQAVITVRQHRWV 110 120 130 140 150 160 190 200 210 pF1KE5 IPNRGFLRQLLALDRRLRQGLEA .::::::.:: ::..:: CCDS53 FPNRGFLHQLCRLDQQLRGAGQS 170 180 >>CCDS31223.1 DUPD1 gene_id:338599|Hs108|chr10 (220 aa) initn: 574 init1: 360 opt: 584 Z-score: 717.6 bits: 139.9 E(32554): 1.2e-33 Smith-Waterman score: 584; 49.5% identity (74.7% similar) in 194 aa overlap (17-207:14-202) 10 20 30 40 50 pF1KE5 MCPGNWLWASMTFMARFSRSSSRSP-VRTRGTLEEMPTVQHPFLNVFELERLLYTGKTAC .: .. :: .. .: :.. : : ..::::::.. :. CCDS31 MTSGEVKTSLKNAYSSAKRLSPKMEEEGEEEDYCT---P--GAFELERLFWKGSPQY 10 20 30 40 50 60 70 80 90 100 110 pF1KE5 NHADEVWPGLYLGDQDMANNRRELRRLGITHVLNASHSRWR--GTPEAYEGLGIRYLGVE .:..:::: ::.::. : .: .:.. :.::::::.:.:: :. :. . :.: ::: CCDS31 THVNEVWPKLYIGDEATALDRYRLQKAGFTHVLNAAHGRWNVDTGPDYYRDMDIQYHGVE 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE5 AHDSPAFDMSIHFQTAADFIHRALSQPGGKILVHCAVGVSRSATLVLAYLMLYHHLTLVE : : :.::.:. : :: :: ::::. .::::::..: ::::::::::::... .:::. CCDS31 ADDLPTFDLSVFFYPAAAFIDRALSDDHSKILVHCVMGRSRSATLVLAYLMIHKDMTLVD 120 130 140 150 160 170 180 190 200 210 pF1KE5 AIKKVKDHRGIIPNRGFLRQLLALDRRLRQGLEA ::..: .: ..::::::.:: ::..: : CCDS31 AIQQVAKNRCVLPNRGFLKQLRELDKQLVQQRRRSQRQDGEEEDGREL 180 190 200 210 220 >>CCDS7346.1 DUSP13 gene_id:51207|Hs108|chr10 (198 aa) initn: 553 init1: 399 opt: 567 Z-score: 697.7 bits: 136.0 E(32554): 1.5e-32 Smith-Waterman score: 567; 48.2% identity (74.9% similar) in 195 aa overlap (14-205:1-191) 10 20 30 40 50 pF1KE5 MCPGNWLWASMTFMARFSRSSSRSPVRTRGTLEEMPTVQHPFLNVFELERLLYTGKTAC- : ..... : : . .:... : : : : :.:::.. ..: CCDS73 MDSLQKQDLRRP-KIHGAVQASP-YQPPTLA--SLQRLLWVRQAATL 10 20 30 40 60 70 80 90 100 110 pF1KE5 NHADEVWPGLYLGDQDMANNRRELRRLGITHVLNASHSRWRGTPEA--YEGLGIRYLGVE :: :::::.:.::: : .. .: .::::::.::. .... : :.:....: :.: CCDS73 NHIDEVWPSLFLGDAYAARDKSKLIQLGITHVVNAAAGKFQVDTGAKFYRGMSLEYYGIE 50 60 70 80 90 100 120 130 140 150 160 170 pF1KE5 AHDSPAFDMSIHFQTAADFIHRALSQPGGKILVHCAVGVSRSATLVLAYLMLYHHLTLVE : :.: ::.:..: .: .:. ::: : :..:::::.:::::::::::.::. ...:::: CCDS73 ADDNPFFDLSVYFLPVARYIRAALSVPQGRVLVHCAMGVSRSATLVLAFLMICENMTLVE 110 120 130 140 150 160 180 190 200 210 pF1KE5 AIKKVKDHRGIIPNRGFLRQLLALDRRLRQGLEA ::. :. ::.: :: :::::: .:: :: CCDS73 AIQTVQAHRNICPNSGFLRQLQVLDNRLGRETGRF 170 180 190 >>CCDS31224.1 DUSP13 gene_id:51207|Hs108|chr10 (248 aa) initn: 675 init1: 399 opt: 568 Z-score: 697.4 bits: 136.3 E(32554): 1.6e-32 Smith-Waterman score: 568; 46.3% identity (73.7% similar) in 205 aa overlap (7-205:41-241) 10 20 30 pF1KE5 MCPGNWLWASMTF---MARFSRSSSRSPVRTRGTLE .: .. . : ..... : : . .:... CCDS31 GEDKATPCPSILELEELLRAGKSSCSRVDEVWPNLFIGDAMDSLQKQDLRRP-KIHGAVQ 20 30 40 50 60 40 50 60 70 80 90 pF1KE5 EMPTVQHPFLNVFELERLLYTGKTAC-NHADEVWPGLYLGDQDMANNRRELRRLGITHVL : : : : :.:::.. ..: :: :::::.:.::: : .. .: .::::::. CCDS31 ASP-YQPPTLA--SLQRLLWVRQAATLNHIDEVWPSLFLGDAYAARDKSKLIQLGITHVV 70 80 90 100 110 120 100 110 120 130 140 150 pF1KE5 NASHSRWRGTPEA--YEGLGIRYLGVEAHDSPAFDMSIHFQTAADFIHRALSQPGGKILV ::. .... : :.:....: :.:: :.: ::.:..: .: .:. ::: : :..:: CCDS31 NAAAGKFQVDTGAKFYRGMSLEYYGIEADDNPFFDLSVYFLPVARYIRAALSVPQGRVLV 130 140 150 160 170 180 160 170 180 190 200 210 pF1KE5 HCAVGVSRSATLVLAYLMLYHHLTLVEAIKKVKDHRGIIPNRGFLRQLLALDRRLRQGLE :::.:::::::::::.::. ...::::::. :. ::.: :: :::::: .:: :: CCDS31 HCAMGVSRSATLVLAFLMICENMTLVEAIQTVQAHRNICPNSGFLRQLQVLDNRLGRETG 190 200 210 220 230 240 pF1KE5 A CCDS31 RF >>CCDS30932.1 DUSP27 gene_id:92235|Hs108|chr1 (1158 aa) initn: 469 init1: 338 opt: 486 Z-score: 587.6 bits: 118.2 E(32554): 2.1e-26 Smith-Waterman score: 486; 42.9% identity (73.1% similar) in 182 aa overlap (27-205:99-278) 10 20 30 40 50 pF1KE5 MCPGNWLWASMTFMARFSRSSSRSPVRTRGTLEEMPTVQHPFLNVFELERLLYTGK :.: ... . : :..:.: : . CCDS30 ELKPPGVRADAECPGMLESAEQLLVEDLYNRVREKMDDTSLYNTPC--VLDLQRALVQDR 70 80 90 100 110 120 60 70 80 90 100 110 pF1KE5 TAC--NHADEVWPGLYLGDQDMANNRRELRRLGITHVLNASHSRWRGT-PEAYEGLGIRY :..:::::.........: :. .:.::::::.:::.:. : :: : :: :.: CCDS30 QEAPWNEVDEVWPNVFIAEKSVAVNKGRLKRLGITHILNAAHGTGVYTGPEFYTGLEIQY 130 140 150 160 170 180 120 130 140 150 160 170 pF1KE5 LGVEAHDSPAFDMSIHFQTAADFIHRALSQPGGKILVHCAVGVSRSATLVLAYLMLYHHL ::::. : : :.: ::. :..:. .:: ::.:: .:.::::.::.::::..:.. CCDS30 LGVEVDDFPEVDISQHFRKASEFLDEALLTYRGKVLVSSEMGISRSAVLVVAYLMIFHNM 190 200 210 220 230 240 180 190 200 210 pF1KE5 TLVEAIKKVKDHRGIIPNRGFLRQLLALDRRLRQGLEA ...::. :. .:.: ::.:::.:: :...: CCDS30 AILEALMTVRKKRAIYPNEGFLKQLRELNEKLMEEREEDYGREGGSAEAEEGEGTGSMLG 250 260 270 280 290 300 CCDS30 ARVHALTVEEEDDSASHLSGSSLGKATQASKPLTLIDEEEEEKLYEQWKKGQGLLSDKVP 310 320 330 340 350 360 >>CCDS11469.1 DUSP3 gene_id:1845|Hs108|chr17 (185 aa) initn: 416 init1: 321 opt: 408 Z-score: 505.4 bits: 100.3 E(32554): 7.9e-22 Smith-Waterman score: 412; 42.0% identity (70.5% similar) in 176 aa overlap (43-208:7-181) 20 30 40 50 60 pF1KE5 FMARFSRSSSRSPVRTRGTLEEMPTVQHPFLNVFELERLLYTGKTAC-----NHADEVWP :.: .:. :: : ..: . .:: : CCDS11 MSGSFELSVQDLNDLLSDG-SGCYSLPSQPCNEVTP 10 20 30 70 80 90 100 110 120 pF1KE5 GLYLGDQDMANNRRELRRLGITHVLNASHSR----WRGTPEAYEGLGIRYLGVEAHDSPA .:.:. ..:.. .:..:::::::::...: . . :. :: :::..:.:. CCDS11 RIYVGNASVAQDIPKLQKLGITHVLNAAEGRSFMHVNTNANFYKDSGITYLGIKANDTQE 40 50 60 70 80 90 130 140 150 160 170 180 pF1KE5 FDMSIHFQTAADFIHRALSQPGGKILVHCAVGVSRSATLVLAYLMLYHHLTLVEAIKKVK :..: .:. ::::: .::.: .:..:::: : ::: :::.::::. ... . :.. :. CCDS11 FNLSAYFERAADFIDQALAQKNGRVLVHCREGYSRSPTLVIAYLMMRQKMDVKSALSIVR 100 110 120 130 140 150 190 200 210 pF1KE5 DHRGIIPNRGFLRQLLALDRRL-RQGLEA ..: : :: ::: :: :. :: ..: CCDS11 QNREIGPNDGFLAQLCQLNDRLAKEGKLKP 160 170 180 >>CCDS11320.1 DUSP14 gene_id:11072|Hs108|chr17 (198 aa) initn: 253 init1: 157 opt: 307 Z-score: 382.5 bits: 77.7 E(32554): 5.5e-15 Smith-Waterman score: 307; 41.3% identity (68.5% similar) in 143 aa overlap (64-205:29-165) 40 50 60 70 80 90 pF1KE5 EMPTVQHPFLNVFELERLLYTGKTACNHADEVWPGLYLGDQDMANNRRELRRLGITHVLN .. .:.:: ..:.::. :. ::: ..: CCDS11 MSSRGHSTLPRTLMAPRMISEGDIGGIAQITSSLFLGRGSVASNRHLLQARGITCIVN 10 20 30 40 50 100 110 120 130 140 150 pF1KE5 ASHSRWRGTPEAYEGLGIRYLGVEAHDSPAFDMSIHFQTAADFIHRALSQPGGKILVHCA :. :. .. ..:. : : : ....:.:.:: :: ..:. : ::::: CCDS11 ATIE----IPN-FNWPQFEYVKVPLADMPHAPIGLYFDTVADKIH-SVSRKHGATLVHCA 60 70 80 90 100 110 160 170 180 190 200 210 pF1KE5 VGVSRSATLVLAYLMLYHHLTLVEAIKKVKDHRGII-PNRGFLRQLLALDRRLRQGLEA .:::::::: .:::: .:.. :.:: . :: .: .: :: :: :::. .:.: CCDS11 AGVSRSATLCIAYLMKFHNVCLLEAYNWVKARRPVIRPNVGFWRQLIDYERQLFGKSTVK 120 130 140 150 160 170 CCDS11 MVQTPYGIVPDVYEKESRHLMPYWGI 180 190 >>CCDS2016.1 DUSP2 gene_id:1844|Hs108|chr2 (314 aa) initn: 280 init1: 163 opt: 308 Z-score: 380.6 bits: 78.0 E(32554): 7e-15 Smith-Waterman score: 308; 40.6% identity (69.2% similar) in 143 aa overlap (64-205:175-311) 40 50 60 70 80 90 pF1KE5 EMPTVQHPFLNVFELERLLYTGKTACNHADEVWPGLYLGDQDMANNRRELRRLGITHVLN :. : :.::. . ... . :. ::: ::: CCDS20 EAPAPALPPTGDKTSRSDSRAPVYDQGGPVEILPYLFLGSCSHSSDLQGLQACGITAVLN 150 160 170 180 190 200 100 110 120 130 140 150 pF1KE5 ASHSRWRGTPEAYEGLGIRYLGVEAHDSPAFDMSIHFQTAADFIHRALSQPGGKILVHCA .: : :. .::: .:: .. ..:. ..: :: : :: ... ::..:::: CCDS20 VSAS----CPNHFEGL-FRYKSIPVEDNQMVEISAWFQEAIGFIDW-VKNSGGRVLVHCQ 210 220 230 240 250 160 170 180 190 200 210 pF1KE5 VGVSRSATLVLAYLMLYHHLTLVEAIKKVKDHRGII-PNRGFLRQLLALDRRLRQGLEA .:.:::::. ::::: ... : ::. ::..::.: :: .:. ::: .. .. CCDS20 AGISRSATICLAYLMQSRRVRLDEAFDFVKQRRGVISPNFSFMGQLLQFETQVLCH 260 270 280 290 300 310 >>CCDS7566.1 DUSP5 gene_id:1847|Hs108|chr10 (384 aa) initn: 244 init1: 149 opt: 296 Z-score: 364.7 bits: 75.4 E(32554): 5.4e-14 Smith-Waterman score: 296; 40.6% identity (65.7% similar) in 143 aa overlap (64-205:181-317) 40 50 60 70 80 90 pF1KE5 EMPTVQHPFLNVFELERLLYTGKTACNHADEVWPGLYLGDQDMANNRRELRRLGITHVLN :. : ::::. :.. . : : :: .:: CCDS75 IESERALISQCGKPVVNVSYRPAYDQGGPVEILPFLYLGSAYHASKCEFLANLHITALLN 160 170 180 190 200 210 100 110 120 130 140 150 pF1KE5 ASHSRWRGTPEAYEGLGIRYLGVEAHDSPAFDMSIHFQTAADFIHRALSQPGGKILVHCA .: : : :: . ..: . ..:: . :.: ::: : ::: . . :::.:::: CCDS75 VS----RRTSEAC-ATHLHYKWIPVEDSHTADISSHFQEAIDFID-CVREKGGKVLVHCE 220 230 240 250 260 160 170 180 190 200 210 pF1KE5 VGVSRSATLVLAYLMLYHHLTLVEAIKKVKDHRGII-PNRGFLRQLLALDRRLRQGLEA .:.::: :. .:::: ... : ::. .:..:... :: ::. ::: . .. CCDS75 AGISRSPTICMAYLMKTKQFRLKEAFDYIKQRRSMVSPNFGFMGQLLQYESEILPSTPNP 270 280 290 300 310 320 CCDS75 QPPSCQGEAAGSSLIGHLQTLSPDMQGAYCTFPASVLAPVPTHSTVSELSRSPVATATSC 330 340 350 360 370 380 211 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 22:45:11 2016 done: Mon Nov 7 22:45:11 2016 Total Scan time: 1.400 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]