FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB8934, 284 aa
1>>>pF1KB8934 284 - 284 aa - 284 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 7.4940+/-0.000823; mu= 8.3448+/- 0.050
mean_var=225.5429+/-47.007, 0's: 0 Z-trim(115.0): 114 B-trim: 810 in 1/52
Lambda= 0.085400
statistics sampled from 15409 (15533) to 15409 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.802), E-opt: 0.2 (0.477), width: 16
Scan time: 2.760
The best scores are: opt bits E(32554)
CCDS11536.1 HOXB13 gene_id:10481|Hs108|chr17 ( 284) 1992 257.4 8.7e-69
CCDS8865.1 HOXC13 gene_id:3229|Hs108|chr12 ( 330) 848 116.5 2.6e-26
CCDS5412.1 HOXA13 gene_id:3209|Hs108|chr7 ( 388) 714 100.1 2.7e-21
CCDS2264.2 HOXD13 gene_id:3239|Hs108|chr2 ( 343) 669 94.5 1.1e-19
>>CCDS11536.1 HOXB13 gene_id:10481|Hs108|chr17 (284 aa)
initn: 1992 init1: 1992 opt: 1992 Z-score: 1348.4 bits: 257.4 E(32554): 8.7e-69
Smith-Waterman score: 1992; 100.0% identity (100.0% similar) in 284 aa overlap (1-284:1-284)
10 20 30 40 50 60
pF1KB8 MEPGNYATLDGAKDIEGLLGAGGGRNLVAHSPLTSHPAAPTLMPAVNYAPLDLPGSAEPP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 MEPGNYATLDGAKDIEGLLGAGGGRNLVAHSPLTSHPAAPTLMPAVNYAPLDLPGSAEPP
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB8 KQCHPCPGVPQGTSPAPVPYGYFGGGYYSCRVSRSSLKPCAQAATLAAYPAETPTAGEEY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 KQCHPCPGVPQGTSPAPVPYGYFGGGYYSCRVSRSSLKPCAQAATLAAYPAETPTAGEEY
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB8 PSRPTEFAFYPGYPGTYQPMASYLDVSVVQTLGAPGEPRHDSLLPVDSYQSWALAGGWNS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 PSRPTEFAFYPGYPGTYQPMASYLDVSVVQTLGAPGEPRHDSLLPVDSYQSWALAGGWNS
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB8 QMCCQGEQNPPGPFWKAAFADSSGQHPPDACAFRRGRKKRIPYSKGQLRELEREYAANKF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 QMCCQGEQNPPGPFWKAAFADSSGQHPPDACAFRRGRKKRIPYSKGQLRELEREYAANKF
190 200 210 220 230 240
250 260 270 280
pF1KB8 ITKDKRRKISAATSLSERQITIWFQNRRVKEKKVLAKVKNSATP
::::::::::::::::::::::::::::::::::::::::::::
CCDS11 ITKDKRRKISAATSLSERQITIWFQNRRVKEKKVLAKVKNSATP
250 260 270 280
>>CCDS8865.1 HOXC13 gene_id:3229|Hs108|chr12 (330 aa)
initn: 787 init1: 351 opt: 848 Z-score: 585.9 bits: 116.5 E(32554): 2.6e-26
Smith-Waterman score: 848; 48.2% identity (72.0% similar) in 282 aa overlap (3-279:51-323)
10 20 30
pF1KB8 MEPGNYATLDGAKDIEGLLGAGGGRNLVAHSP
::. ..:: . . :. :.:. : :
CCDS88 DSAAESGIGGGGGGGGGGTGGAGGGCSGASPGKAPSMDG---LGSSCPASHCRDLLPH-P
30 40 50 60 70
40 50 60 70 80 90
pF1KB8 LTSHPAAPTLMPAVNYAPLDLPGSAEPPKQCHPCPGVPQGTSPAPVPYGY-FGGGYYSCR
. ..: :: : . . :.: . : .:: : : .: .: : . ::: :::.::.::
CCDS88 VLGRPPAPLGAPQ-GAVYTDIP-APEAARQCAP-PPAPPTSSSATLGYGYPFGGSYYGCR
80 90 100 110 120 130
100 110 120 130 140
pF1KB8 VSRS---SLKPCAQAATLAAYPAETPT-AGEEYPSRPTEFAFYPGYPGTYQPMASYLDVS
.:.. . :::: :: . . :.. :: ::::::.. ..:: : .:::::
CCDS88 LSHNVNLQQKPCAYHPG-DKYPEPSGALPGDDLSSRAKEFAFYPSFASSYQAMPGYLDVS
140 150 160 170 180 190
150 160 170 180 190 200
pF1KB8 VVQTLGAPGEPRHDSLLPVDSYQSWALAGGWNSQMCCQGEQNPPGPFWKAAFADSSGQHP
:: ... :::::.:.::..:: :::..::.::. :. ::. . .::. : : .:
CCDS88 VVPGISGHPEPRHDALIPVEGYQHWALSNGWDSQVYCSKEQSQSAHLWKSPFPDVVPLQP
200 210 220 230 240 250
210 220 230 240 250 260
pF1KB8 PDACAFRRGRKKRIPYSKGQLRELEREYAANKFITKDKRRKISAATSLSERQITIWFQNR
.. ..:::::::.::.: ::.:::.::::.:::::.:::.:::.:.:::::.:::::::
CCDS88 -EVSSYRRGRKKRVPYTKVQLKELEKEYAASKFITKEKRRRISATTNLSERQVTIWFQNR
260 270 280 290 300 310
270 280
pF1KB8 RVKEKKVLAKVKNSATP
:::::::..: :
CCDS88 RVKEKKVVSKSKAPHLHST
320 330
>>CCDS5412.1 HOXA13 gene_id:3209|Hs108|chr7 (388 aa)
initn: 868 init1: 457 opt: 714 Z-score: 495.9 bits: 100.1 E(32554): 2.7e-21
Smith-Waterman score: 871; 57.3% identity (75.2% similar) in 246 aa overlap (54-282:144-388)
30 40 50 60 70 80
pF1KB8 GRNLVAHSPLTSHPAAPTLMPAVNYAPLDLPGSAEPPKQCHPCPGVPQGTS-PAPVPYGY
:..:: ::: :: .. :..: :: .::::
CCDS54 PSAAAAAAAAAAAAAAAAAASSSGGPGPAGPAGAEAAKQCSPCSAAAQSSSGPAALPYGY
120 130 140 150 160 170
90 100 110 120 130
pF1KB8 FGGGYYSC-RVSR--SSLKPCAQAATLAAYPAETP----TAG---EEYPSRPTEFAFY-P
::.::: : :.. ...: ::: :. :: : . ::: ::. :: :::::
CCDS54 FGSGYYPCARMGPHPNAIKSCAQPASAAAAAAFADKYMDTAGPAAEEFSSRAKEFAFYHQ
180 190 200 210 220 230
140 150 160 170 180
pF1KB8 GYP-GTY---QPMASYLDVSVVQTLGAPGEPRHDSL-LPVDSYQSWALAGGWNSQMCCQG
:: : : ::: .:::. :: ::.::: ::. : ::..::: ::: .:::.:: :
CCDS54 GYAAGPYHHHQPMPGYLDMPVVPGLGGPGESRHEPLGLPMESYQPWALPNGWNGQMYCPK
240 250 260 270 280 290
190 200 210 220 230 240
pF1KB8 EQNPPGPFWKAAFADSSGQHPPDACAFRRGRKKRIPYSKGQLRELEREYAANKFITKDKR
:: : .::... : . :: :: ..:::::::.::.: ::.:::::::.:::::::::
CCDS54 EQAQPPHLWKSTLPDVVS-HPSDASSYRRGRKKRVPYTKVQLKELEREYATNKFITKDKR
300 310 320 330 340 350
250 260 270 280
pF1KB8 RKISAATSLSERQITIWFQNRRVKEKKVLAKVKNSATP
:.:::.:.:::::.::::::::::::::. :.:...
CCDS54 RRISATTNLSERQVTIWFQNRRVKEKKVINKLKTTS
360 370 380
>>CCDS2264.2 HOXD13 gene_id:3239|Hs108|chr2 (343 aa)
initn: 740 init1: 568 opt: 669 Z-score: 466.5 bits: 94.5 E(32554): 1.1e-19
Smith-Waterman score: 749; 47.8% identity (74.1% similar) in 247 aa overlap (57-283:101-343)
30 40 50 60 70 80
pF1KB8 LVAHSPLTSHPAAPTLMPAVNYAPLDLPGSAEPPKQCH-PCPGVPQGTSP-AP-VPYGY-
: : :.: : :.. .. : :: . :::
CCDS22 ASGFAYPGTSERTGSSSSSSSSAVVAARPEAPPAKECPAPTPAAAAAAPPSAPALGYGYH
80 90 100 110 120 130
90 100 110 120
pF1KB8 FGGGYYSCRVS------RSSLKPCAQAATLAAYPAE----------TPTAGEEYPSRPTE
::.::::::.: ...:: .: .:...:.: . . ..: :.: :
CCDS22 FGNGYYSCRMSHGVGLQQNALKSSPHA-SLGGFPVEKYMDVSGLASSSVPANEVPARAKE
140 150 160 170 180
130 140 150 160 170 180
pF1KB8 FAFYPGYPGTYQPMASYLDVSVVQTLGAPGEPRHDSLLPVDSYQSWALAGGWNSQMCCQG
.:: :: . :: . .:.: .:.:.:. :::::.. . ...::::.::.:::::. :
CCDS22 VSFYQGYTSPYQHVPGYID--MVSTFGS-GEPRHEAYISMEGYQSWTLANGWNSQVYCTK
190 200 210 220 230 240
190 200 210 220 230 240
pF1KB8 EQNPPGPFWKAAFADSSGQHPPDACAFRRGRKKRIPYSKGQLRELEREYAANKFITKDKR
.: . :::..: . . . :: :..:::::::.::.: ::.::: ::: ::::.::::
CCDS22 DQPQGSHFWKSSFPGDVALNQPDMCVYRRGRKKRVPYTKLQLKELENEYAINKFINKDKR
250 260 270 280 290 300
250 260 270 280
pF1KB8 RKISAATSLSERQITIWFQNRRVKEKKVLAKVKNSATP
:.:::::.:::::.::::::::::.::...:.:....
CCDS22 RRISAATNLSERQVTIWFQNRRVKDKKIVSKLKDTVS
310 320 330 340
284 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 16:34:32 2016 done: Fri Nov 4 16:34:33 2016
Total Scan time: 2.760 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]