FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5278, 235 aa 1>>>pF1KE5278 235 - 235 aa - 235 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.5630+/-0.000653; mu= 13.9876+/- 0.039 mean_var=85.9513+/-16.743, 0's: 0 Z-trim(112.5): 52 B-trim: 91 in 1/52 Lambda= 0.138340 statistics sampled from 13162 (13214) to 13162 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.786), E-opt: 0.2 (0.406), width: 16 Scan time: 2.450 The best scores are: opt bits E(32554) CCDS13193.1 DUSP15 gene_id:128853|Hs108|chr20 ( 235) 1604 329.2 1.5e-90 CCDS82607.1 DUSP15 gene_id:128853|Hs108|chr20 ( 232) 1557 319.8 9.7e-88 CCDS82606.1 DUSP15 gene_id:128853|Hs108|chr20 ( 295) 953 199.3 2.3e-51 CCDS42862.1 DUSP15 gene_id:128853|Hs108|chr20 ( 132) 895 187.5 3.7e-48 CCDS69035.1 DUSP22 gene_id:56940|Hs108|chr6 ( 205) 687 146.1 1.6e-35 CCDS4468.1 DUSP22 gene_id:56940|Hs108|chr6 ( 184) 686 145.9 1.7e-35 CCDS13883.1 DUSP18 gene_id:150290|Hs108|chr22 ( 188) 310 70.8 6.9e-13 CCDS1528.1 DUSP10 gene_id:11221|Hs108|chr1 ( 482) 311 71.3 1.2e-12 CCDS2289.1 DUSP19 gene_id:142679|Hs108|chr2 ( 217) 306 70.1 1.3e-12 CCDS7724.1 DUSP8 gene_id:1850|Hs108|chr11 ( 625) 306 70.4 3e-12 CCDS33418.1 DUSP28 gene_id:285193|Hs108|chr2 ( 176) 298 68.4 3.4e-12 CCDS7566.1 DUSP5 gene_id:1847|Hs108|chr10 ( 384) 300 69.1 4.7e-12 CCDS8157.1 SSH3 gene_id:54961|Hs108|chr11 ( 659) 297 68.7 1.1e-11 CCDS1234.1 DUSP12 gene_id:11266|Hs108|chr1 ( 340) 292 67.4 1.3e-11 CCDS9711.1 STYX gene_id:6815|Hs108|chr14 ( 223) 287 66.3 1.9e-11 CCDS8650.1 DUSP16 gene_id:80824|Hs108|chr12 ( 665) 288 66.9 3.8e-11 >>CCDS13193.1 DUSP15 gene_id:128853|Hs108|chr20 (235 aa) initn: 1604 init1: 1604 opt: 1604 Z-score: 1739.3 bits: 329.2 E(32554): 1.5e-90 Smith-Waterman score: 1604; 99.6% identity (100.0% similar) in 235 aa overlap (1-235:1-235) 10 20 30 40 50 60 pF1KE5 MGNGMTKVLPGLYLGNFIDAKDLDQLGRNKITHIISIHESPQPLLQDITYLRIPVADTPE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 MGNGMTKVLPGLYLGNFIDAKDLDQLGRNKITHIISIHESPQPLLQDITYLRIPVADTPE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 VPIKKHFKECINFIHCCRLNGGNCLVHCFAGISRSTTIVTAYVMTVTGLGWRDVLEAIKA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 VPIKKHFKECINFIHCCRLNGGNCLVHCFAGISRSTTIVTAYVMTVTGLGWRDVLEAIKA 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 TRPIANPNPGFRQQLEEFGWASSQKLRRQLEERFGESPFRDEEELRALLPLCKRCRQGSA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 TRPIANPNPGFRQQLEEFGWASSQKLRRQLEERFGESPFRDEEELRALLPLCKRCRQGSA 130 140 150 160 170 180 190 200 210 220 230 pF1KE5 TSASSAGPHSAASEGTLQRLVPRTPREAHRPLPLLARVKQTFSCLPRCLSRKGGK ::::::::::::::::.:::::::::::::::::::::::::::::::::::::: CCDS13 TSASSAGPHSAASEGTVQRLVPRTPREAHRPLPLLARVKQTFSCLPRCLSRKGGK 190 200 210 220 230 >>CCDS82607.1 DUSP15 gene_id:128853|Hs108|chr20 (232 aa) initn: 1556 init1: 1556 opt: 1557 Z-score: 1688.7 bits: 319.8 E(32554): 9.7e-88 Smith-Waterman score: 1557; 98.7% identity (99.6% similar) in 232 aa overlap (5-235:1-232) 10 20 30 40 50 pF1KE5 MGNGMTK-VLPGLYLGNFIDAKDLDQLGRNKITHIISIHESPQPLLQDITYLRIPVADTP ::. :::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 MTEGVLPGLYLGNFIDAKDLDQLGRNKITHIISIHESPQPLLQDITYLRIPVADTP 10 20 30 40 50 60 70 80 90 100 110 pF1KE5 EVPIKKHFKECINFIHCCRLNGGNCLVHCFAGISRSTTIVTAYVMTVTGLGWRDVLEAIK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 EVPIKKHFKECINFIHCCRLNGGNCLVHCFAGISRSTTIVTAYVMTVTGLGWRDVLEAIK 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE5 ATRPIANPNPGFRQQLEEFGWASSQKLRRQLEERFGESPFRDEEELRALLPLCKRCRQGS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 ATRPIANPNPGFRQQLEEFGWASSQKLRRQLEERFGESPFRDEEELRALLPLCKRCRQGS 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE5 ATSASSAGPHSAASEGTLQRLVPRTPREAHRPLPLLARVKQTFSCLPRCLSRKGGK :::::::::::::::::.:::::::::::::::::::::::::::::::::::::: CCDS82 ATSASSAGPHSAASEGTVQRLVPRTPREAHRPLPLLARVKQTFSCLPRCLSRKGGK 180 190 200 210 220 230 >>CCDS82606.1 DUSP15 gene_id:128853|Hs108|chr20 (295 aa) initn: 952 init1: 952 opt: 953 Z-score: 1035.8 bits: 199.3 E(32554): 2.3e-51 Smith-Waterman score: 953; 96.6% identity (97.9% similar) in 146 aa overlap (5-149:1-146) 10 20 30 40 50 pF1KE5 MGNGMTK-VLPGLYLGNFIDAKDLDQLGRNKITHIISIHESPQPLLQDITYLRIPVADTP ::. :::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 MTEGVLPGLYLGNFIDAKDLDQLGRNKITHIISIHESPQPLLQDITYLRIPVADTP 10 20 30 40 50 60 70 80 90 100 110 pF1KE5 EVPIKKHFKECINFIHCCRLNGGNCLVHCFAGISRSTTIVTAYVMTVTGLGWRDVLEAIK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 EVPIKKHFKECINFIHCCRLNGGNCLVHCFAGISRSTTIVTAYVMTVTGLGWRDVLEAIK 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE5 ATRPIANPNPGFRQQLEEFGWASSQKLRRQLEERFGESPFRDEEELRALLPLCKRCRQGS :::::::::::::::::::::::::: :. CCDS82 ATRPIANPNPGFRQQLEEFGWASSQKGARHRTSKTSGAQCPPMTSATCLLAARVALLSAA 120 130 140 150 160 170 >>CCDS42862.1 DUSP15 gene_id:128853|Hs108|chr20 (132 aa) initn: 895 init1: 895 opt: 895 Z-score: 978.0 bits: 187.5 E(32554): 3.7e-48 Smith-Waterman score: 895; 99.2% identity (100.0% similar) in 132 aa overlap (104-235:1-132) 80 90 100 110 120 130 pF1KE5 IHCCRLNGGNCLVHCFAGISRSTTIVTAYVMTVTGLGWRDVLEAIKATRPIANPNPGFRQ :::::::::::::::::::::::::::::: CCDS42 MTVTGLGWRDVLEAIKATRPIANPNPGFRQ 10 20 30 140 150 160 170 180 190 pF1KE5 QLEEFGWASSQKLRRQLEERFGESPFRDEEELRALLPLCKRCRQGSATSASSAGPHSAAS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 QLEEFGWASSQKLRRQLEERFGESPFRDEEELRALLPLCKRCRQGSATSASSAGPHSAAS 40 50 60 70 80 90 200 210 220 230 pF1KE5 EGTLQRLVPRTPREAHRPLPLLARVKQTFSCLPRCLSRKGGK :::.:::::::::::::::::::::::::::::::::::::: CCDS42 EGTVQRLVPRTPREAHRPLPLLARVKQTFSCLPRCLSRKGGK 100 110 120 130 >>CCDS69035.1 DUSP22 gene_id:56940|Hs108|chr6 (205 aa) initn: 686 init1: 686 opt: 687 Z-score: 751.0 bits: 146.1 E(32554): 1.6e-35 Smith-Waterman score: 687; 53.9% identity (80.3% similar) in 178 aa overlap (1-178:1-176) 10 20 30 40 50 60 pF1KE5 MGNGMTKVLPGLYLGNFIDAKDLDQLGRNKITHIISIHESPQPLLQDITYLRIPVADTPE :::::.:.:::::.::: ::.: .::..::.:::.:.:.: .:.:. . :: ::.::.: CCDS69 MGNGMNKILPGLYIGNFKDARDAEQLSKNKVTHILSVHDSARPMLEGVKYLCIPAADSPS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 VPIKKHFKECINFIHCCRLNGGNCLVHCFAGISRSTTIVTAYVMTVTGLGWRDVLEAIKA . .:::: :.::: ::: : .:::::.::.:::.:.: ::.:::: .::.:.:....: CCDS69 QNLTRHFKESIKFIHECRLRGESCLVHCLAGVSRSVTLVIAYIMTVTDFGWEDALHTVRA 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 TRPIANPNPGFRQQLEEFGWASSQKLRRQLEERFGESPFRDEEELRALLPLCKRCRQGSA : :::: ::..::.:: .. :. :.:..::::..: :: . .: : .:: CCDS69 GRSCANPNVGFQRQLQEFEKHEVHQYRQWLKEEYGESPLQDAEEAKNILG--KYKEQGRT 130 140 150 160 170 190 200 210 220 230 pF1KE5 TSASSAGPHSAASEGTLQRLVPRTPREAHRPLPLLARVKQTFSCLPRCLSRKGGK CCDS69 EPQPGARRWSSFPALAPLTYDNYTTET 180 190 200 >>CCDS4468.1 DUSP22 gene_id:56940|Hs108|chr6 (184 aa) initn: 686 init1: 686 opt: 686 Z-score: 750.6 bits: 145.9 E(32554): 1.7e-35 Smith-Waterman score: 686; 55.0% identity (82.2% similar) in 169 aa overlap (1-169:1-169) 10 20 30 40 50 60 pF1KE5 MGNGMTKVLPGLYLGNFIDAKDLDQLGRNKITHIISIHESPQPLLQDITYLRIPVADTPE :::::.:.:::::.::: ::.: .::..::.:::.:.:.: .:.:. . :: ::.::.: CCDS44 MGNGMNKILPGLYIGNFKDARDAEQLSKNKVTHILSVHDSARPMLEGVKYLCIPAADSPS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 VPIKKHFKECINFIHCCRLNGGNCLVHCFAGISRSTTIVTAYVMTVTGLGWRDVLEAIKA . .:::: :.::: ::: : .:::::.::.:::.:.: ::.:::: .::.:.:....: CCDS44 QNLTRHFKESIKFIHECRLRGESCLVHCLAGVSRSVTLVIAYIMTVTDFGWEDALHTVRA 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 TRPIANPNPGFRQQLEEFGWASSQKLRRQLEERFGESPFRDEEELRALLPLCKRCRQGSA : :::: ::..::.:: .. :. :.:..::::..: :: . .: CCDS44 GRSCANPNVGFQRQLQEFEKHEVHQYRQWLKEEYGESPLQDAEEAKNILAAPGILKFWAF 130 140 150 160 170 180 190 200 210 220 230 pF1KE5 TSASSAGPHSAASEGTLQRLVPRTPREAHRPLPLLARVKQTFSCLPRCLSRKGGK CCDS44 LRRL >>CCDS13883.1 DUSP18 gene_id:150290|Hs108|chr22 (188 aa) initn: 234 init1: 234 opt: 310 Z-score: 344.9 bits: 70.8 E(32554): 6.9e-13 Smith-Waterman score: 310; 31.0% identity (62.6% similar) in 171 aa overlap (3-171:18-188) 10 20 30 40 pF1KE5 MGNGMTKVLPGLYLGNFIDAKDLDQLGRNKITHIISIH-ESPQPL .:.... .::..: . :.. .:. :.:: .:.. : . : CCDS13 MTAPSCAFPVQFRQPSVSGLSQITKSLYISNGVAANNKLMLSSNQITMVINVSVEVVNTL 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE5 LQDITYLRIPVADTPEVPIKKHFKECINFIHCCRLNGGNCLVHCFAGISRSTTIVTAYVM .:: :...::::.:. . : . :: ... : :.:: ::.:::... ::.: CCDS13 YEDIQYMQVPVADSPNSRLCDFFDPIADHIHSVEMKQGRTLLHCAAGVSRSAALCLAYLM 70 80 90 100 110 120 110 120 130 140 150 160 pF1KE5 TVTGLGWRDVLEAIKATRPIANPNPGFRQQLEEFGWAS-SQKLRRQLEERFGESPFRDEE ... :. :. ::: :: :: .:: .. . ... ... : : :. CCDS13 KYHAMSLLDAHTWTKSCRPIIRPNSGFWEQLIHYEFQLFGKNTVHMVSSPVGMIPDIYEK 130 140 150 160 170 180 170 180 190 200 210 220 pF1KE5 ELRALLPLCKRCRQGSATSASSAGPHSAASEGTLQRLVPRTPREAHRPLPLLARVKQTFS :.: ..:: CCDS13 EVRLMIPL >>CCDS1528.1 DUSP10 gene_id:11221|Hs108|chr1 (482 aa) initn: 279 init1: 201 opt: 311 Z-score: 340.3 bits: 71.3 E(32554): 1.2e-12 Smith-Waterman score: 311; 38.4% identity (67.4% similar) in 138 aa overlap (5-138:322-458) 10 20 30 pF1KE5 MGNGMTKVLPGLYLGNFIDAKDLDQLGRNKITHI .: .:: :.::: ::.::: . : .: .. CCDS15 ECREVGGGASAASSLLPQPIPTTPDIENAELTPILPFLFLGNEQDAQDLDTMQRLNIGYV 300 310 320 330 340 350 40 50 60 70 80 90 pF1KE5 ISIHESPQPLLQD----ITYLRIPVADTPEVPIKKHFKECINFIHCCRLNGGNCLVHCFA :.. . :: . ..: :.:..:. . ....:.: ..::. . : . :.:: : CCDS15 INV-TTHLPLYHYEKGLFNYKRLPATDSNKQNLRQYFEEAFEFIEEAHQCGKGLLIHCQA 360 370 380 390 400 410 100 110 120 130 140 150 pF1KE5 GISRSTTIVTAYVMTVTGLGWRDVLEAIKATRPIANPNPGFRQQLEEFGWASSQKLRRQL :.:::.::: ::.: : . :. . .:. ::: .:: .: :: :: CCDS15 GVSRSATIVIAYLMKHTRMTMTDAYKFVKGKRPIISPNLNFMGQLLEFEEDLNNGVTPRI 420 430 440 450 460 470 160 170 180 190 200 210 pF1KE5 EERFGESPFRDEEELRALLPLCKRCRQGSATSASSAGPHSAASEGTLQRLVPRTPREAHR CCDS15 LTPKLMGVETVV 480 >>CCDS2289.1 DUSP19 gene_id:142679|Hs108|chr2 (217 aa) initn: 302 init1: 217 opt: 306 Z-score: 339.7 bits: 70.1 E(32554): 1.3e-12 Smith-Waterman score: 306; 36.6% identity (65.5% similar) in 145 aa overlap (8-149:69-213) 10 20 30 pF1KE5 MGNGMTKVLPGLYLGNFIDAKDLDQLGRNKITHIISI . : : ::. :.::: : .::.:::... CCDS22 RIHVVEEVEPSSGGGCGYVQDLSSDLQVGVIKPWLLLGSQDAAHDLDTLKKNKVTHILNV 40 50 60 70 80 90 40 50 60 70 80 90 pF1KE5 -HESPQPLLQDITYLRIPVADTPEVPIKKHFKECINFIHCCRLNGGNCLVHCFAGISRST . . .:.:.:: : . : ::. : ..: ::..::. . . : :::: ::.::.. CCDS22 AYGVENAFLSDFTYKSISILDLPETNILSYFPECFEFIEEAKRKDGVVLVHCNAGVSRAA 100 110 120 130 140 150 100 110 120 130 140 150 pF1KE5 TIVTAYVMTVTGLGWRDVLEAIKATRPIANPNPGFRQQLEEF--GWASSQKLRRQLEERF .:: ...:. .. ... .: .:: :: :: .::. . : :.. : : CCDS22 AIVIGFLMNSEQTSFTSAFSLVKNARPSICPNSGFMEQLRTYQEGKESNKCDRIQENSS 160 170 180 190 200 210 160 170 180 190 200 210 pF1KE5 GESPFRDEEELRALLPLCKRCRQGSATSASSAGPHSAASEGTLQRLVPRTPREAHRPLPL >>CCDS7724.1 DUSP8 gene_id:1850|Hs108|chr11 (625 aa) initn: 260 init1: 180 opt: 306 Z-score: 333.4 bits: 70.4 E(32554): 3e-12 Smith-Waterman score: 306; 32.7% identity (58.7% similar) in 196 aa overlap (4-197:160-352) 10 20 30 pF1KE5 MGNGMTKVLPGLYLGNFIDAKDLDQLGRNKITH :.:..:: ::::. :. . : . .: :.. CCDS77 SSCFPGLCEGKPAALLPMSLSQPCLPVPSVGLTRILPHLYLGSQKDVLNKDLMTQNGISY 130 140 150 160 170 180 40 50 60 70 80 90 pF1KE5 IISIHES-PQP-LLQDITYLRIPVADTPEVPIKKHFKECINFIHCCRLNGGNCLVHCFAG ... .: :.: .. . ..:.:. :. . . . :.:: .:.. . .:::.:: CCDS77 VLNASNSCPKPDFICESRFMRVPINDNYCEKLLPWLDKSIEFIDKAKLSSCQVIVHCLAG 190 200 210 220 230 240 100 110 120 130 140 150 pF1KE5 ISRSTTIVTAYVMTVTGLGWRDVLEAIKATRPIANPNPGFRQQLEEFGWASSQKLRRQLE ::::.::. ::.: . :.. :. . .: :: .:: .: :: :. : :: :. CCDS77 ISRSATIAIAYIMKTMGMSSDDAYRFVKDRRPSISPNFNFLGQLLEY--ERSLKLLAALQ 250 260 270 280 290 300 160 170 180 190 200 210 pF1KE5 ERFGESPFRDEEELRALLPLCKRCRQGSATSASSAGPHSAASEGTLQRLVPRTPREAHRP : .: : . : :: :.: ..:: :: : CCDS77 GDPG-TPSGTPEPPPSPAAGAPLPRLPPPTSESAATGNAAAREGGLSAGGEPPAPPTPPA 310 320 330 340 350 360 220 230 pF1KE5 LPLLARVKQTFSCLPRCLSRKGGK CCDS77 TSALQQGLRGLHLSSDRLQDTNRLKRSFSLDIKSAYAPSRRPDGPGPPDPGEAPKLCKLD 370 380 390 400 410 420 235 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 23:21:29 2016 done: Mon Nov 7 23:21:30 2016 Total Scan time: 2.450 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]