FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE5212, 185 aa
1>>>pF1KE5212 185 - 185 aa - 185 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.9523+/-0.000894; mu= 8.8855+/- 0.053
mean_var=63.8080+/-12.749, 0's: 0 Z-trim(105.5): 63 B-trim: 88 in 1/49
Lambda= 0.160560
statistics sampled from 8373 (8436) to 8373 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.654), E-opt: 0.2 (0.259), width: 16
Scan time: 1.850
The best scores are: opt bits E(32554)
CCDS11469.1 DUSP3 gene_id:1845|Hs108|chr17 ( 185) 1217 290.5 4e-79
CCDS53542.1 DUSP13 gene_id:51207|Hs108|chr10 ( 188) 465 116.3 1.1e-26
CCDS31223.1 DUPD1 gene_id:338599|Hs108|chr10 ( 220) 452 113.3 1e-25
CCDS7346.1 DUSP13 gene_id:51207|Hs108|chr10 ( 198) 430 108.2 3.2e-24
CCDS31224.1 DUSP13 gene_id:51207|Hs108|chr10 ( 248) 430 108.2 4e-24
CCDS6092.1 DUSP26 gene_id:78986|Hs108|chr8 ( 211) 408 103.1 1.2e-22
CCDS30932.1 DUSP27 gene_id:92235|Hs108|chr1 (1158) 411 103.9 3.7e-22
CCDS14724.1 DUSP9 gene_id:1852|Hs108|chrX ( 384) 285 74.7 7.8e-14
CCDS1528.1 DUSP10 gene_id:11221|Hs108|chr1 ( 482) 283 74.2 1.3e-13
CCDS33766.2 DUSP7 gene_id:1849|Hs108|chr3 ( 419) 278 73.0 2.6e-13
CCDS9033.1 DUSP6 gene_id:1848|Hs108|chr12 ( 381) 268 70.7 1.2e-12
CCDS7566.1 DUSP5 gene_id:1847|Hs108|chr10 ( 384) 251 66.8 1.8e-11
CCDS9711.1 STYX gene_id:6815|Hs108|chr14 ( 223) 240 64.2 6.4e-11
>>CCDS11469.1 DUSP3 gene_id:1845|Hs108|chr17 (185 aa)
initn: 1217 init1: 1217 opt: 1217 Z-score: 1534.0 bits: 290.5 E(32554): 4e-79
Smith-Waterman score: 1217; 100.0% identity (100.0% similar) in 185 aa overlap (1-185:1-185)
10 20 30 40 50 60
pF1KE5 MSGSFELSVQDLNDLLSDGSGCYSLPSQPCNEVTPRIYVGNASVAQDIPKLQKLGITHVL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 MSGSFELSVQDLNDLLSDGSGCYSLPSQPCNEVTPRIYVGNASVAQDIPKLQKLGITHVL
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE5 NAAEGRSFMHVNTNANFYKDSGITYLGIKANDTQEFNLSAYFERAADFIDQALAQKNGRV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 NAAEGRSFMHVNTNANFYKDSGITYLGIKANDTQEFNLSAYFERAADFIDQALAQKNGRV
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE5 LVHCREGYSRSPTLVIAYLMMRQKMDVKSALSIVRQNREIGPNDGFLAQLCQLNDRLAKE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 LVHCREGYSRSPTLVIAYLMMRQKMDVKSALSIVRQNREIGPNDGFLAQLCQLNDRLAKE
130 140 150 160 170 180
pF1KE5 GKLKP
:::::
CCDS11 GKLKP
>>CCDS53542.1 DUSP13 gene_id:51207|Hs108|chr10 (188 aa)
initn: 432 init1: 331 opt: 465 Z-score: 592.5 bits: 116.3 E(32554): 1.1e-26
Smith-Waterman score: 465; 44.3% identity (73.9% similar) in 176 aa overlap (8-182:20-187)
10 20 30 40
pF1KE5 MSGSFELSVQDLNDLLSDG-SGCYSLPSQPCNEVTPRIYVGNASVAQD
:. .:..:: : :.: . .:: : ...:.:..:..
CCDS53 MAETSLPELGGEDKATPCPSILELEELLRAGKSSCSRV-----DEVWPNLFIGDAATANN
10 20 30 40 50
50 60 70 80 90 100
pF1KE5 IPKLQKLGITHVLNAAEGRSFMHVNTNANFYKDSGITYLGIKANDTQEFNLSAYFERAAD
.: :::::::::::. . .. . . .:: :...:::. :.: .:..:::: :::
CCDS53 RFELWKLGITHVLNAAH--KGLYCQGGPDFY-GSSVSYLGVPAHDLPDFDISAYFSSAAD
60 70 80 90 100 110
110 120 130 140 150 160
pF1KE5 FIDQALAQKNGRVLVHCREGYSRSPTLVIAYLMMRQKMDVKSALSIVRQNREIGPNDGFL
:: .:: ...::::: : ::: :::.::::..:......:. :::.: . :: :::
CCDS53 FIHRALNTPGAKVLVHCVVGVSRSATLVLAYLMLHQRLSLRQAVITVRQHRWVFPNRGFL
120 130 140 150 160 170
170 180
pF1KE5 AQLCQLNDRLAKEGKLKP
:::.:...: :.
CCDS53 HQLCRLDQQLRGAGQS
180
>>CCDS31223.1 DUPD1 gene_id:338599|Hs108|chr10 (220 aa)
initn: 372 init1: 246 opt: 452 Z-score: 575.1 bits: 113.3 E(32554): 1e-25
Smith-Waterman score: 453; 43.8% identity (70.8% similar) in 178 aa overlap (3-180:37-203)
10 20 30
pF1KE5 MSGSFELSVQDLNDLLSDGSGCYSLPSQPCNE
:.::: . :. :: :. ::
CCDS31 KTSLKNAYSSAKRLSPKMEEEGEEEDYCTPGAFEL-----ERLFWKGSPQYT----HVNE
10 20 30 40 50
40 50 60 70 80 90
pF1KE5 VTPRIYVGNASVAQDIPKLQKLGITHVLNAAEGRSFMHVNTNANFYKDSGITYLGIKAND
: :..:.:. ..: : .::: :.:::::::.:: .:.:. ..:.: : : :..:.:
CCDS31 VWPKLYIGDEATALDRYRLQKAGFTHVLNAAHGR--WNVDTGPDYYRDMDIQYHGVEADD
60 70 80 90 100 110
100 110 120 130 140 150
pF1KE5 TQEFNLSAYFERAADFIDQALAQKNGRVLVHCREGYSRSPTLVIAYLMMRQKMDVKSALS
:.::..: :: :::.::.. ....:::: : ::: :::.::::... : . .:..
CCDS31 LPTFDLSVFFYPAAAFIDRALSDDHSKILVHCVMGRSRSATLVLAYLMIHKDMTLVDAIQ
120 130 140 150 160 170
160 170 180
pF1KE5 IVRQNREIGPNDGFLAQLCQLNDRLAKEGKLKP
: .:: . :: ::: :: .:. .:...
CCDS31 QVAKNRCVLPNRGFLKQLRELDKQLVQQRRRSQRQDGEEEDGREL
180 190 200 210 220
>>CCDS7346.1 DUSP13 gene_id:51207|Hs108|chr10 (198 aa)
initn: 410 init1: 301 opt: 430 Z-score: 548.3 bits: 108.2 E(32554): 3.2e-24
Smith-Waterman score: 430; 47.3% identity (76.0% similar) in 150 aa overlap (31-180:47-194)
10 20 30 40 50 60
pF1KE5 MSGSFELSVQDLNDLLSDGSGCYSLPSQPCNEVTPRIYVGNASVAQDIPKLQKLGITHVL
.:: : ...:.: .:.: :: .::::::.
CCDS73 AVQASPYQPPTLASLQRLLWVRQAATLNHIDEVWPSLFLGDAYAARDKSKLIQLGITHVV
20 30 40 50 60 70
70 80 90 100 110 120
pF1KE5 NAAEGRSFMHVNTNANFYKDSGITYLGIKANDTQEFNLSAYFERAADFIDQALAQKNGRV
::: :. ..:.:.:.::. .. : ::.:.:. :.::.:: .: .: ::. .:::
CCDS73 NAAAGK--FQVDTGAKFYRGMSLEYYGIEADDNPFFDLSVYFLPVARYIRAALSVPQGRV
80 90 100 110 120 130
130 140 150 160 170 180
pF1KE5 LVHCREGYSRSPTLVIAYLMMRQKMDVKSALSIVRQNREIGPNDGFLAQLCQLNDRLAKE
:::: : ::: :::.:.::. ..: . :.. :. .:.: ::.::: :: :..::..:
CCDS73 LVHCAMGVSRSATLVLAFLMICENMTLVEAIQTVQAHRNICPNSGFLRQLQVLDNRLGRE
140 150 160 170 180 190
pF1KE5 GKLKP
CCDS73 TGRF
>>CCDS31224.1 DUSP13 gene_id:51207|Hs108|chr10 (248 aa)
initn: 410 init1: 301 opt: 430 Z-score: 546.6 bits: 108.2 E(32554): 4e-24
Smith-Waterman score: 430; 47.3% identity (76.0% similar) in 150 aa overlap (31-180:97-244)
10 20 30 40 50 60
pF1KE5 MSGSFELSVQDLNDLLSDGSGCYSLPSQPCNEVTPRIYVGNASVAQDIPKLQKLGITHVL
.:: : ...:.: .:.: :: .::::::.
CCDS31 AVQASPYQPPTLASLQRLLWVRQAATLNHIDEVWPSLFLGDAYAARDKSKLIQLGITHVV
70 80 90 100 110 120
70 80 90 100 110 120
pF1KE5 NAAEGRSFMHVNTNANFYKDSGITYLGIKANDTQEFNLSAYFERAADFIDQALAQKNGRV
::: :. ..:.:.:.::. .. : ::.:.:. :.::.:: .: .: ::. .:::
CCDS31 NAAAGK--FQVDTGAKFYRGMSLEYYGIEADDNPFFDLSVYFLPVARYIRAALSVPQGRV
130 140 150 160 170 180
130 140 150 160 170 180
pF1KE5 LVHCREGYSRSPTLVIAYLMMRQKMDVKSALSIVRQNREIGPNDGFLAQLCQLNDRLAKE
:::: : ::: :::.:.::. ..: . :.. :. .:.: ::.::: :: :..::..:
CCDS31 LVHCAMGVSRSATLVLAFLMICENMTLVEAIQTVQAHRNICPNSGFLRQLQVLDNRLGRE
190 200 210 220 230 240
pF1KE5 GKLKP
CCDS31 TGRF
>>CCDS6092.1 DUSP26 gene_id:78986|Hs108|chr8 (211 aa)
initn: 416 init1: 321 opt: 408 Z-score: 520.3 bits: 103.1 E(32554): 1.2e-22
Smith-Waterman score: 412; 42.0% identity (70.5% similar) in 176 aa overlap (7-181:43-208)
10 20 30
pF1KE5 MSGSFELSVQDLNDLLSDG-SGCYSLPSQPCNEVTP
:.: .:. :: : ..: . .:: :
CCDS60 FMARFSRSSSRSPVRTRGTLEEMPTVQHPFLNVFELERLLYTGKTAC-----NHADEVWP
20 30 40 50 60
40 50 60 70 80 90
pF1KE5 RIYVGNASVAQDIPKLQKLGITHVLNAAEGRSFMHVNTNANFYKDSGITYLGIKANDTQE
.:.:. ..:.. .:..:::::::::...: . . :. :: :::..:.:.
CCDS60 GLYLGDQDMANNRRELRRLGITHVLNASHSR----WRGTPEAYEGLGIRYLGVEAHDSPA
70 80 90 100 110 120
100 110 120 130 140 150
pF1KE5 FNLSAYFERAADFIDQALAQKNGRVLVHCREGYSRSPTLVIAYLMMRQKMDVKSALSIVR
:..: .:. ::::: .::.: .:..:::: : ::: :::.::::. ... . :.. :.
CCDS60 FDMSIHFQTAADFIHRALSQPGGKILVHCAVGVSRSATLVLAYLMLYHHLTLVEAIKKVK
130 140 150 160 170 180
160 170 180
pF1KE5 QNREIGPNDGFLAQLCQLNDRLAKEGKLKP
..: : :: ::: :: :. :: ..:
CCDS60 DHRGIIPNRGFLRQLLALDRRL-RQGLEA
190 200 210
>>CCDS30932.1 DUSP27 gene_id:92235|Hs108|chr1 (1158 aa)
initn: 429 init1: 298 opt: 411 Z-score: 511.4 bits: 103.9 E(32554): 3.7e-22
Smith-Waterman score: 411; 43.2% identity (71.6% similar) in 155 aa overlap (26-180:130-281)
10 20 30 40 50
pF1KE5 MSGSFELSVQDLNDLLSDGSGCYSLPSQPCNEVTPRIYVGNASVAQDIPKLQKLG
: . .:: : ..... ::: . .:..::
CCDS30 VREKMDDTSLYNTPCVLDLQRALVQDRQEAPWNEVDEVWPNVFIAEKSVAVNKGRLKRLG
100 110 120 130 140 150
60 70 80 90 100 110
pF1KE5 ITHVLNAAEGRSFMHVNTNANFYKDSGITYLGIKANDTQEFNLSAYFERAADFIDQALAQ
:::.::::.: . : :. .:: : :::....: : ..: .:..:..:.:.::
CCDS30 ITHILNAAHGTG---VYTGPEFYTGLEIQYLGVEVDDFPEVDISQHFRKASEFLDEALLT
160 170 180 190 200 210
120 130 140 150 160 170
pF1KE5 KNGRVLVHCREGYSRSPTLVIAYLMMRQKMDVKSALSIVRQNREIGPNDGFLAQLCQLND
:.::: . : ::: .::.::::. ..: . :: ::..: : ::.::: :: .::.
CCDS30 YRGKVLVSSEMGISRSAVLVVAYLMIFHNMAILEALMTVRKKRAIYPNEGFLKQLRELNE
220 230 240 250 260 270
180
pF1KE5 RLAKEGKLKP
.: .:
CCDS30 KLMEEREEDYGREGGSAEAEEGEGTGSMLGARVHALTVEEEDDSASHLSGSSLGKATQAS
280 290 300 310 320 330
>>CCDS14724.1 DUSP9 gene_id:1852|Hs108|chrX (384 aa)
initn: 216 init1: 113 opt: 285 Z-score: 361.8 bits: 74.7 E(32554): 7.8e-14
Smith-Waterman score: 286; 36.1% identity (65.2% similar) in 158 aa overlap (24-180:202-347)
10 20 30 40 50
pF1KE5 MSGSFELSVQDLNDLLSDGSGCYSLPSQPCNEVTPRIYVGNASVAQDIPKLQK
:.: : . : .:.:.: . .. .: :
CCDS14 SDAESEADRDSMSCGLDSEGATPPPVGLRASFPVQ----ILPNLYLGSARDSANLESLAK
180 190 200 210 220
60 70 80 90 100 110
pF1KE5 LGITHVLNAAEGRSFMHVNTNANFYKDSGITYLGIKANDTQEFNLSAYFERAADFIDQAL
::: ..::.. : : :.. . : : .: ::: .: .: .:::.::
CCDS14 LGIRYILNVTP-------NLPNFFEKNGDFHYKQIPISDHWSQNLSRFFPEAIEFIDEAL
230 240 250 260 270 280
120 130 140 150 160 170
pF1KE5 AQKNGRVLVHCREGYSRSPTLVIAYLMMRQKMDVKSALSIV-RQNREIGPNDGFLAQLCQ
.:. : ::::: : ::: :...::::.. ......: ..: :.. .:.:: .:..:: .
CCDS14 SQNCG-VLVHCLAGVSRSVTVTVAYLMQKLHLSLNDAYDLVKRKKSNISPNFNFMGQLLD
290 300 310 320 330
180
pF1KE5 LNDRLAKEGKLKP
.. : :
CCDS14 FERSLRLEERHSQEQGSGGQASAASNPPSFFTTPTSDGAFELAPT
340 350 360 370 380
>>CCDS1528.1 DUSP10 gene_id:11221|Hs108|chr1 (482 aa)
initn: 248 init1: 115 opt: 283 Z-score: 357.7 bits: 74.2 E(32554): 1.3e-13
Smith-Waterman score: 283; 34.9% identity (68.5% similar) in 146 aa overlap (33-177:325-462)
10 20 30 40 50 60
pF1KE5 GSFELSVQDLNDLLSDGSGCYSLPSQPCNEVTPRIYVGNASVAQDIPKLQKLGITHVLNA
. : ...:: . :::. .:.:.: .:.:.
CCDS15 EVGGGASAASSLLPQPIPTTPDIENAELTPILPFLFLGNEQDAQDLDTMQRLNIGYVINV
300 310 320 330 340 350
70 80 90 100 110 120
pF1KE5 AEGRSFMHVNTNANFYKDSGITYLGIKANDTQEFNLSAYFERAADFIDQALAQKNGRVLV
. ..: :. . ..: . :.:... :: :::.: .::..: .: .:.
CCDS15 TTHLPLYH-------YEKGLFNYKRLPATDSNKQNLRQYFEEAFEFIEEAHQCGKG-LLI
360 370 380 390 400
130 140 150 160 170 180
pF1KE5 HCREGYSRSPTLVIAYLMMRQKMDVKSALSIVRQNREI-GPNDGFLAQLCQLNDRLAKEG
::. : ::: :.:::::: . .: . .: ..:. .: : .:: .:..:: .... :
CCDS15 HCQAGVSRSATIVIAYLMKHTRMTMTDAYKFVKGKRPIISPNLNFMGQLLEFEEDLNNGV
410 420 430 440 450 460
pF1KE5 KLKP
CCDS15 TPRILTPKLMGVETVV
470 480
>>CCDS33766.2 DUSP7 gene_id:1849|Hs108|chr3 (419 aa)
initn: 268 init1: 113 opt: 278 Z-score: 352.4 bits: 73.0 E(32554): 2.6e-13
Smith-Waterman score: 279; 35.6% identity (64.4% similar) in 163 aa overlap (17-177:232-385)
10 20 30 40
pF1KE5 MSGSFELSVQDLNDLLSDGSGC-YSLPSQPCNEVTPRIYVGNASVA
:::: : :. : .. : .:.: :. .
CCDS33 TSVLGLGGLRISSDCSDGESDRELPSSATESDGSPVPSSQPAFPV-QILPYLYLGCAKDS
210 220 230 240 250 260
50 60 70 80 90 100
pF1KE5 QDIPKLQKLGITHVLNAAEGRSFMHVNTNANFYKDSGITYLGIKANDTQEFNLSAYFERA
.. : : :: ..::.. : : . . .:: : .: ::: .: .:
CCDS33 TNLDVLGKYGIKYILNVTP-------NLPNAFEHGGEFTYKQIPISDHWSQNLSQFFPEA
270 280 290 300 310
110 120 130 140 150 160
pF1KE5 ADFIDQALAQKNGRVLVHCREGYSRSPTLVIAYLMMRQKMDVKSALSIV-RQNREIGPND
.:::.: ..: : ::::: : ::: :...::::.........: ..: :.. .:.::
CCDS33 ISFIDEARSKKCG-VLVHCLAGISRSVTVTVAYLMQKMNLSLNDAYDFVKRKKSNISPNF
320 330 340 350 360 370
170 180
pF1KE5 GFLAQLCQLNDRLAKEGKLKP
.:..:: ... :
CCDS33 NFMGQLLDFERTLGLSSPCDNHASSEQLYFSTPTNHNLFPLNTLEST
380 390 400 410
185 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Mon Nov 7 22:33:18 2016 done: Mon Nov 7 22:33:18 2016
Total Scan time: 1.850 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]