FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB7951, 255 aa
1>>>pF1KB7951 255 - 255 aa - 255 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 7.2039+/-0.000724; mu= 7.6684+/- 0.044
mean_var=142.1283+/-29.954, 0's: 0 Z-trim(114.5): 183 B-trim: 776 in 2/50
Lambda= 0.107581
statistics sampled from 14817 (15027) to 14817 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.799), E-opt: 0.2 (0.462), width: 16
Scan time: 2.750
The best scores are: opt bits E(32554)
CCDS2247.2 DLX1 gene_id:1745|Hs108|chr2 ( 255) 1712 276.3 1.5e-74
CCDS47647.2 DLX6 gene_id:1750|Hs108|chr7 ( 293) 769 129.9 1.9e-30
CCDS33328.1 DLX1 gene_id:1745|Hs108|chr2 ( 129) 733 124.1 4.7e-29
CCDS11555.1 DLX4 gene_id:1748|Hs108|chr17 ( 240) 534 93.4 1.5e-19
CCDS45728.1 DLX4 gene_id:1748|Hs108|chr17 ( 168) 481 85.1 3.4e-17
CCDS5647.1 DLX5 gene_id:1749|Hs108|chr7 ( 289) 476 84.5 9e-17
CCDS2248.1 DLX2 gene_id:1746|Hs108|chr2 ( 328) 471 83.7 1.7e-16
CCDS11556.1 DLX3 gene_id:1747|Hs108|chr17 ( 287) 466 82.9 2.6e-16
>>CCDS2247.2 DLX1 gene_id:1745|Hs108|chr2 (255 aa)
initn: 1712 init1: 1712 opt: 1712 Z-score: 1452.1 bits: 276.3 E(32554): 1.5e-74
Smith-Waterman score: 1712; 100.0% identity (100.0% similar) in 255 aa overlap (1-255:1-255)
10 20 30 40 50 60
pF1KB7 MTMTTMPESLNSPVSGKAVFMEFGPPNQQMSPSPMSHGHYSMHCLHSAGHSQPDGAYSSA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS22 MTMTTMPESLNSPVSGKAVFMEFGPPNQQMSPSPMSHGHYSMHCLHSAGHSQPDGAYSSA
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB7 SSFSRPLGYPYVNSVSSHASSPYISSVQSYPGSASLAQSRLEDPGADSEKSTVVEGGEVR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS22 SSFSRPLGYPYVNSVSSHASSPYISSVQSYPGSASLAQSRLEDPGADSEKSTVVEGGEVR
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB7 FNGKGKKIRKPRTIYSSLQLQALNRRFQQTQYLALPERAELAASLGLTQTQVKIWFQNKR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS22 FNGKGKKIRKPRTIYSSLQLQALNRRFQQTQYLALPERAELAASLGLTQTQVKIWFQNKR
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB7 SKFKKLMKQGGAALEGSALANGRALSAGSPPVPPGWNPNSSSGKGSGGNAGSYIPSYTSW
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS22 SKFKKLMKQGGAALEGSALANGRALSAGSPPVPPGWNPNSSSGKGSGGNAGSYIPSYTSW
190 200 210 220 230 240
250
pF1KB7 YPSAHQEAMQQPQLM
:::::::::::::::
CCDS22 YPSAHQEAMQQPQLM
250
>>CCDS47647.2 DLX6 gene_id:1750|Hs108|chr7 (293 aa)
initn: 816 init1: 550 opt: 769 Z-score: 660.3 bits: 129.9 E(32554): 1.9e-30
Smith-Waterman score: 824; 56.4% identity (75.7% similar) in 243 aa overlap (25-255:55-293)
10 20 30 40 50
pF1KB7 MTMTTMPESLNSPVSGKAVFMEFGPPNQQMSPSPMSHGHYSMHCLHSAGHSQPD
: .:: ::. :. .:: .::::::. .
CCDS47 GQQQQQQQQQQQQQQQQQQQPPPPPPPPPQPHSQQSSPA-MAGAHYPLHCLHSAAAAAAA
30 40 50 60 70 80
60 70 80 90 100
pF1KB7 GAYSSASSFSRPLGYPYV----NSVS--SHASSPYISS------VQSYPGSASLAQSRLE
:.. . : ::. :: . : :. ::.: .::: .:.. ::.: .
CCDS47 GSHHHHHHQHHHHGSPYASGGGNSYNHRSLAAYPYMSHSQHSPYLQSYHNSSAAAQTRGD
90 100 110 120 130 140
110 120 130 140 150 160
pF1KB7 DPGADSEKSTVVEGGEVRFNGKGKKIRKPRTIYSSLQLQALNRRFQQTQYLALPERAELA
: .:..:.::.:.::.:::::::::::::::::::::::::.:::::::::::::::::
CCDS47 D--TDQQKTTVIENGEIRFNGKGKKIRKPRTIYSSLQLQALNHRFQQTQYLALPERAELA
150 160 170 180 190 200
170 180 190 200 210 220
pF1KB7 ASLGLTQTQVKIWFQNKRSKFKKLMKQGGAALEGSALANGRALSAGSPPVPPGWNPNSSS
::::::::::::::::::::::::.:::. :.. : .. ::: :: .:: :. :.:
CCDS47 ASLGLTQTQVKIWFQNKRSKFKKLLKQGSNPHESDPLQGSAALSPRSPALPPVWDV-SAS
210 220 230 240 250 260
230 240 250
pF1KB7 GKGSGGNAGSYIPSYTSWYPSAHQEAMQQPQLM
.:: . .::.:.:. :: : ::..::.::.:
CCDS47 AKGVSMPPNSYMPGYSHWYSSPHQDTMQRPQMM
270 280 290
>>CCDS33328.1 DLX1 gene_id:1745|Hs108|chr2 (129 aa)
initn: 770 init1: 725 opt: 733 Z-score: 635.1 bits: 124.1 E(32554): 4.7e-29
Smith-Waterman score: 733; 86.7% identity (92.2% similar) in 128 aa overlap (1-125:1-128)
10 20 30 40 50 60
pF1KB7 MTMTTMPESLNSPVSGKAVFMEFGPPNQQMSPSPMSHGHYSMHCLHSAGHSQPDGAYSSA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS33 MTMTTMPESLNSPVSGKAVFMEFGPPNQQMSPSPMSHGHYSMHCLHSAGHSQPDGAYSSA
10 20 30 40 50 60
70 80 90 100 110
pF1KB7 SSFSRPLGYPYVNSVSSHASSPYISSVQSYPGSASLAQSRLEDPGAD---SEKSTVVEGG
::::::::::::::::::::::::::::::::::::::::::::: : .. : :.
CCDS33 SSFSRPLGYPYVNSVSSHASSPYISSVQSYPGSASLAQSRLEDPGQDLVPKQAIQVQEAD
70 80 90 100 110 120
120 130 140 150 160 170
pF1KB7 EVRFNGKGKKIRKPRTIYSSLQLQALNRRFQQTQYLALPERAELAASLGLTQTQVKIWFQ
:. ..:.:
CCDS33 EAGWGGSGG
>>CCDS11555.1 DLX4 gene_id:1748|Hs108|chr17 (240 aa)
initn: 479 init1: 404 opt: 534 Z-score: 464.4 bits: 93.4 E(32554): 1.5e-19
Smith-Waterman score: 547; 43.9% identity (63.0% similar) in 262 aa overlap (3-255:1-240)
10 20 30 40 50 60
pF1KB7 MTMTTMPESLNSPVSGKAVFMEFGPPNQQMSPSPMSHGHYSMHCLHSAGHSQPDGAYSSA
::..: : . ..:::: :. ..: : . : . : : : : :
CCDS11 MTSLPCPLPGRDASKAVF-----PD--LAPVPSVAAAYPL------GLS-PTTAASPN
10 20 30 40
70 80 90 100 110
pF1KB7 SSFSRPLG----YPYVNSVSSHASSPYISSVQSYPGSASLAQSRLEDPG---ADSEKSTV
:.::: : :::.. .. : :.: : : : : : ::::: .
CCDS11 LSYSRPYGHLLSYPYTEPANPGDS--YLSCQQPAALSQPLCGPA-EHPQELEADSEKPRL
50 60 70 80 90 100
120 130 140 150 160 170
pF1KB7 V-EGGEVRFNGKGKKIRKPRTIYSSLQLQALNRRFQQTQYLALPERAELAASLGLTQTQV
: .: : .. .::.::::::::::::: ::.:::.::::::::::.:::.::::::::
CCDS11 SPEPSERRPQAPAKKLRKPRTIYSSLQLQHLNQRFQHTQYLALPERAQLAAQLGLTQTQV
110 120 130 140 150 160
180 190 200 210 220 230
pF1KB7 KIWFQNKRSKFKKLMKQGGAALEGSALANGRALSAGSPPVPPGWN-PNSSSGKGSGGNAG
::::::::::.:::.::.... ::. . ..: :::.: :. :.... ::
CCDS11 KIWFQNKRSKYKKLLKQNSGGQEGDFPGRTFSVSPCSPPLPSLWDLPKAGTLPTSG----
170 180 190 200 210
240 250
pF1KB7 SYIPSYTSWYPSAHQEAMQQPQLM
: :. .:: .... .::.:
CCDS11 -YGNSFGAWYQHHSSDVLASPQMM
220 230 240
>>CCDS45728.1 DLX4 gene_id:1748|Hs108|chr17 (168 aa)
initn: 454 init1: 404 opt: 481 Z-score: 422.1 bits: 85.1 E(32554): 3.4e-17
Smith-Waterman score: 481; 53.6% identity (76.2% similar) in 151 aa overlap (107-255:23-168)
80 90 100 110 120 130
pF1KB7 SHASSPYISSVQSYPGSASLAQSRLEDPGADSEKSTVV-EGGEVRFNGKGKKIRKPRTIY
:::: . : .: : .. .::.:::::::
CCDS45 MKLSVLPPRSLLAPYTVLCCPPDSEKPRLSPEPSERRPQAPAKKLRKPRTIY
10 20 30 40 50
140 150 160 170 180 190
pF1KB7 SSLQLQALNRRFQQTQYLALPERAELAASLGLTQTQVKIWFQNKRSKFKKLMKQGGAALE
:::::: ::.:::.::::::::::.:::.::::::::::::::::::.:::.::.... :
CCDS45 SSLQLQHLNQRFQHTQYLALPERAQLAAQLGLTQTQVKIWFQNKRSKYKKLLKQNSGGQE
60 70 80 90 100 110
200 210 220 230 240 250
pF1KB7 GSALANGRALSAGSPPVPPGWN-PNSSSGKGSGGNAGSYIPSYTSWYPSAHQEAMQQPQL
:. . ..: :::.: :. :.... :: : :. .:: .... .::.
CCDS45 GDFPGRTFSVSPCSPPLPSLWDLPKAGTLPTSG-----YGNSFGAWYQHHSSDVLASPQM
120 130 140 150 160
pF1KB7 M
:
CCDS45 M
>>CCDS5647.1 DLX5 gene_id:1749|Hs108|chr7 (289 aa)
initn: 430 init1: 389 opt: 476 Z-score: 414.6 bits: 84.5 E(32554): 9e-17
Smith-Waterman score: 501; 44.0% identity (63.1% similar) in 252 aa overlap (5-244:36-263)
10 20 30
pF1KB7 MTMTTMPESLNSPVSGKAVFMEFGPPNQQMSPSP
:.::: : ... : :. ::.
CCDS56 DRRVPSIRSGDFQAPFQTSAAMHHPSQESPTLPES--SATDSDYYSPTGGAPHGYCSPTS
10 20 30 40 50 60
40 50 60 70 80 90
pF1KB7 MSHGHYSMHCLHSAGHSQPDGAYSSASSFSRPLGYPYVNSVSSHASSPYISSVQSYPGSA
:.:. :. . : :. .::.: :: ....:. : :: ..: :
CCDS56 ASYGK----ALNPYQY-QYHGVNGSAGS------YP----AKAYADYSYASSYHQYGG--
70 80 90 100
100 110 120 130 140 150
pF1KB7 SLAQSRLEDPGADSEKSTVVEGGEVRF-NGKGKKIRKPRTIYSSLQLQALNRRFQQTQYL
: .:. :.: .. : :::. ::: ::.:::::::::.:: ::.::::.::::
CCDS56 --AYNRV--PSATNQPEKEVTEPEVRMVNGKPKKVRKPRTIYSSFQLAALQRRFQKTQYL
110 120 130 140 150 160
160 170 180 190 200 210
pF1KB7 ALPERAELAASLGLTQTQVKIWFQNKRSKFKKLMKQGGAALEGSALANGRALSAGSPPVP
:::::::::::::::::::::::::::::.::.::.: : : ... .. .:: :
CCDS56 ALPERAELAASLGLTQTQVKIWFQNKRSKIKKIMKNGEMPPEHSP-SSSDPMACNSPQSP
170 180 190 200 210 220
220 230 240 250
pF1KB7 PGWNPNSSSGKGSG-----------GNAGSYIPSYTSWYPSAHQEAMQQPQLM
:.:..:: . : . :.::. . .::: ::
CCDS56 AVWEPQGSSRSLSHHPHAHPPTSNQSPASSYLENSASWYTSAASSINSHLPPPGSLQHPL
230 240 250 260 270 280
CCDS56 ALASGTLY
>>CCDS2248.1 DLX2 gene_id:1746|Hs108|chr2 (328 aa)
initn: 426 init1: 394 opt: 471 Z-score: 409.7 bits: 83.7 E(32554): 1.7e-16
Smith-Waterman score: 471; 44.4% identity (63.8% similar) in 232 aa overlap (29-248:51-275)
10 20 30 40 50
pF1KB7 MTMTTMPESLNSPVSGKAVFMEFGPPNQQMSPS-PMS---HGHYSMHCLHSAGHSQPD
: ::. :.: . : . : :: .
CCDS22 STYHQHQQPPSGGGAGPGGNSSSSSSLHKPQESPTLPVSTATDSSYYTNQQHPAGGGGGG
30 40 50 60 70 80
60 70 80 90 100 110
pF1KB7 GA-YSSASSFSRPLGYPYVNSVSSHASSPY-ISSVQSYPGSASLAQSRLEDPGADSEKST
:. :. .:.. . .:.: :.: : .. . .: . : . : .:. . ..
CCDS22 GSPYAHMGSYQYQASG--LNNVPYSAKSSYDLGYTAAYTSYAPYGTS--SSPANNEPEKE
90 100 110 120 130
120 130 140 150 160 170
pF1KB7 VVEGGEVRF-NGKGKKIRKPRTIYSSLQLQALNRRFQQTQYLALPERAELAASLGLTQTQ
.: :.:. ::: ::.:::::::::.:: ::.::::.::::::::::::::::::::::
CCDS22 DLEP-EIRIVNGKPKKVRKPRTIYSSFQLAALQRRFQKTQYLALPERAELAASLGLTQTQ
140 150 160 170 180 190
180 190 200 210 220
pF1KB7 VKIWFQNKRSKFKKLMKQGGAALEGSALANGRALSAGSPPV--PPGWN---PNSSSGKGS
:::::::.::::::. :.: : :.. : :::: : .:. :. .: :.
CCDS22 VKIWFQNRRSKFKKMWKSGEIPSEQHPGASASPPCA-SPPVSAPASWDFGVPQRMAGGGG
200 210 220 230 240 250
230 240 250
pF1KB7 GGNAGSYIPSYTSWYPSAHQEAMQQPQLM
:..:: : : ::. :
CCDS22 PGSGGSGAGSSGS-SPSSAASAFLGNYPWYHQTSGSASHLQATAPLLHPTQTPQPHHHHH
260 270 280 290 300 310
>>CCDS11556.1 DLX3 gene_id:1747|Hs108|chr17 (287 aa)
initn: 457 init1: 388 opt: 466 Z-score: 406.3 bits: 82.9 E(32554): 2.6e-16
Smith-Waterman score: 477; 41.0% identity (60.5% similar) in 266 aa overlap (24-254:3-262)
10 20 30 40 50
pF1KB7 MTMTTMPESLNSPVSGKAVFMEFGPPNQQMSPSPMSHGHYSMHCLHSAGHSQP-------
: ....: : .. :. : :......:
CCDS11 MSGSFDRKLS-SILTDISSSLSC-HAGSKDSPTLPESSV
10 20 30
60 70 80 90
pF1KB7 -DGAYSSASSFSRPLGYPY---VNSVSSH--------ASSPYISSVQSYPGSASLAQ--S
: .: :: . . : :: :: . : :.. : . : .:: : .
CCDS11 TDLGYYSAPQHDYYSGQPYGQTVNPYTYHHQFNLNGLAGTGAYSPKSEYTYGASYRQYGA
40 50 60 70 80 90
100 110 120 130 140 150
pF1KB7 RLEDPGADSEKSTVVEG--GEVRF-NGKGKKIRKPRTIYSSLQLQALNRRFQQTQYLALP
:.: .. .: : .:::. ::: ::.::::::::: :: ::.::::..::::::
CCDS11 YREQPLPAQDPVSVKEEPEAEVRMVNGKPKKVRKPRTIYSSYQLAALQRRFQKAQYLALP
100 110 120 130 140 150
160 170 180 190 200 210
pF1KB7 ERAELAASLGLTQTQVKIWFQNKRSKFKKLMKQGGAALEGSALANGRALSAGSPPVPPGW
:::::::.::::::::::::::.:::::::.:.: . :: : :. ... .::: : :
CCDS11 ERAELAAQLGLTQTQVKIWFQNRRSKFKKLYKNGEVPLEHSP-NNSDSMACNSPPSPALW
160 170 180 190 200 210
220 230 240 250
pF1KB7 NPNSSSGKGSGGNA------GSYIPSY-----TSWYPSAHQEAMQQPQLM
. .: : . . . : ::: .::: : . .. :.:
CCDS11 DTSSHSTPAPARSQLPPPLPYSASPSYLDDPTNSWY---HAQNLSGPHLQQQPPQPATLH
220 230 240 250 260 270
CCDS11 HASPGPPPNPGAVY
280
255 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sat Nov 5 10:15:12 2016 done: Sat Nov 5 10:15:13 2016
Total Scan time: 2.750 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]