FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB9701, 320 aa
1>>>pF1KB9701 320 - 320 aa - 320 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 10.7634+/-0.00104; mu= -3.8367+/- 0.063
mean_var=492.2584+/-101.129, 0's: 0 Z-trim(118.4): 105 B-trim: 186 in 1/51
Lambda= 0.057807
statistics sampled from 19244 (19353) to 19244 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.846), E-opt: 0.2 (0.594), width: 16
Scan time: 3.300
The best scores are: opt bits E(32554)
CCDS5405.1 HOXA4 gene_id:3201|Hs108|chr7 ( 320) 2264 202.2 4.6e-52
CCDS11529.1 HOXB4 gene_id:3214|Hs108|chr17 ( 251) 756 76.3 2.8e-14
CCDS2269.1 HOXD4 gene_id:3233|Hs108|chr2 ( 255) 747 75.5 4.8e-14
CCDS8873.1 HOXC4 gene_id:3221|Hs108|chr12 ( 264) 705 72.1 5.6e-13
>>CCDS5405.1 HOXA4 gene_id:3201|Hs108|chr7 (320 aa)
initn: 2264 init1: 2264 opt: 2264 Z-score: 1048.2 bits: 202.2 E(32554): 4.6e-52
Smith-Waterman score: 2264; 99.4% identity (99.4% similar) in 320 aa overlap (1-320:1-320)
10 20 30 40 50 60
pF1KB9 MTMSSFLINSNYIEPKFPPFEEYAQHSGSGGADGGPGGGPGYQQPPAPPTQHLPLQQPQL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 MTMSSFLINSNYIEPKFPPFEEYAQHSGSGGADGGPGGGPGYQQPPAPPTQHLPLQQPQL
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB9 PHAGGGREPPASYYAPRTAREPAYPAAALYPAHGAADTAYPYGYRGGASPGRPPQPEQPP
::::::::: ::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 PHAGGGREPTASYYAPRTAREPAYPAAALYPAHGAADTAYPYGYRGGASPGRPPQPEQPP
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB9 AQAKGPAHGLHASHVLQPQPPPPLQPRAVPPAAPRRCEAAPATPGVPAGGSAPACPLLLA
::::::::::::::::::: ::::::::::::::::::::::::::::::::::::::::
CCDS54 AQAKGPAHGLHASHVLQPQLPPPLQPRAVPPAAPRRCEAAPATPGVPAGGSAPACPLLLA
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB9 DKSPLGLKGKEPVVYPWMKKIHVSAVNPSYNGGEPKRSRTAYTRQQVLELEKEFHFNRYL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 DKSPLGLKGKEPVVYPWMKKIHVSAVNPSYNGGEPKRSRTAYTRQQVLELEKEFHFNRYL
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB9 TRRRRIEIAHTLCLSERQVKIWFQNRRMKWKKDHKLPNTKMRSSNSASASAGPPGKAQTQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 TRRRRIEIAHTLCLSERQVKIWFQNRRMKWKKDHKLPNTKMRSSNSASASAGPPGKAQTQ
250 260 270 280 290 300
310 320
pF1KB9 SPHLHPHPHPSTSTPVPSSI
::::::::::::::::::::
CCDS54 SPHLHPHPHPSTSTPVPSSI
310 320
>>CCDS11529.1 HOXB4 gene_id:3214|Hs108|chr17 (251 aa)
initn: 904 init1: 691 opt: 756 Z-score: 369.7 bits: 76.3 E(32554): 2.8e-14
Smith-Waterman score: 868; 49.7% identity (63.9% similar) in 296 aa overlap (1-296:1-243)
10 20 30 40 50 60
pF1KB9 MTMSSFLINSNYIEPKFPPFEEYAQHSGSGGADGGPGGGPGYQQPPAPPTQHLPLQQPQL
:.::::::::::..::::: :::.: : .: .:: : :. .. .:
CCDS11 MAMSSFLINSNYVDPKFPPCEEYSQ-SDYLPSDHSPGYYAGGQR------RESSFQ----
10 20 30 40
70 80 90 100 110 120
pF1KB9 PHAGGGREPPASYYAPRTAREPAYPAAALYPAHGAADTAYPYGYRGGASPGRPPQPEQPP
:.:: ::. . . :.:. : : :: : ::
CCDS11 PEAGFGRRAACTVQRYAACRDPGPP------------------------PPPPPPPPPPP
50 60 70 80
130 140 150 160 170 180
pF1KB9 AQAKGPAHGLHASHVLQPQPPPPLQPRAVPPAAPRRCEAAPATPGVPAGGSAPACPLLLA
. .: : ::: :. : .::::. ..: : .. : :
CCDS11 PPGLSPR---------APAPPPA---GALLPEPGQRCEAVSSSPPPPPCAQNPLHP----
90 100 110 120
190 200 210 220 230 240
pF1KB9 DKSPLGLKGKEPVVYPWMKKIHVSAVNPSYNGGEPKRSRTAYTRQQVLELEKEFHFNRYL
:: :::::::::.:.:::.:::.: ::::::::::::::::::::::::.::::
CCDS11 --SPSHSACKEPVVYPWMRKVHVSTVNPNYAGGEPKRSRTAYTRQQVLELEKEFHYNRYL
130 140 150 160 170 180
250 260 270 280 290 300
pF1KB9 TRRRRIEIAHTLCLSERQVKIWFQNRRMKWKKDHKLPNTKMRSSNSASASAGPPGKAQTQ
:::::.::::.:::::::.:::::::::::::::::::::.::...:....::::.
CCDS11 TRRRRVEIAHALCLSERQIKIWFQNRRMKWKKDHKLPNTKIRSGGAAGSAGGPPGRPNGG
190 200 210 220 230 240
310 320
pF1KB9 SPHLHPHPHPSTSTPVPSSI
CCDS11 PRAL
250
>>CCDS2269.1 HOXD4 gene_id:3233|Hs108|chr2 (255 aa)
initn: 815 init1: 622 opt: 747 Z-score: 365.6 bits: 75.5 E(32554): 4.8e-14
Smith-Waterman score: 803; 49.4% identity (61.7% similar) in 308 aa overlap (1-306:1-243)
10 20 30 40 50 60
pF1KB9 MTMSSFLINSNYIEPKFPPFEEYAQHSGSGGADGGPGGGPGYQQPPAPPTQHLPLQQPQL
:.:::...::.:..::::: ::: : .: : .:. : : :
CCDS22 MVMSSYMVNSKYVDPKFPPCEEYLQ-GGYLGEQGADYYGGGAQ-----------------
10 20 30 40
70 80 90 100 110 120
pF1KB9 PHAGGGREPPASYYAPRTAREPAYPAAALYPAHGAADTAYPYGYRGGASPGRPPQPEQPP
:. .::. : :: : : :.: :..:: : :
CCDS22 ---GADFQPPGLY--PR-------------PDFGE----QPFG---GSGPG--PGSALP-
50 60 70
130 140 150 160 170
pF1KB9 AQAKGPAHGLHASHVLQPQPPPPLQPRAVPPAAPRRCEAAPATPGVPA-GGSAPACPLLL
:...: : ..: : : : : .:: :: ::. : . : : :
CCDS22 ARGHGQEPGGPGGHYAAPGEPCPAPP--APPPAP--------LPGARAYSQSDPKQP---
80 90 100 110 120
180 190 200 210 220 230
pF1KB9 ADKSPLGLKGKEP-VVYPWMKKIHVSAVNPSYNGGEPKRSRTAYTRQQVLELEKEFHFNR
: : :.: ::::::::.::..:::.:.:::::::::::::::::::::::::::
CCDS22 ----PSGTALKQPAVVYPWMKKVHVNSVNPNYTGGEPKRSRTAYTRQQVLELEKEFHFNR
130 140 150 160 170
240 250 260 270 280 290
pF1KB9 YLTRRRRIEIAHTLCLSERQVKIWFQNRRMKWKKDHKLPNTKMRSSNSASASAGPPGKAQ
::::::::::::::::::::.::::::::::::::::::::: :::.:.:.:. . :
CCDS22 YLTRRRRIEIAHTLCLSERQIKIWFQNRRMKWKKDHKLPNTKGRSSSSSSSSSCSSSVAP
180 190 200 210 220 230
300 310 320
pF1KB9 TQSPHLHPHPHPSTSTPVPSSI
.: ::.:
CCDS22 SQ--HLQPMAKDHHTDLTTL
240 250
>>CCDS8873.1 HOXC4 gene_id:3221|Hs108|chr12 (264 aa)
initn: 853 init1: 659 opt: 705 Z-score: 346.5 bits: 72.1 E(32554): 5.6e-13
Smith-Waterman score: 758; 46.0% identity (59.8% similar) in 311 aa overlap (1-306:1-252)
10 20 30 40 50 60
pF1KB9 MTMSSFLINSNYIEPKFPPFEEYAQHSGSGGADGGPGGGPGYQQPPAPPTQHLPLQQPQL
: :::.:..::::.::::: :::.:.: : : . .. .
CCDS88 MIMSSYLMDSNYIDPKFPPCEEYSQNS---------------YIPEHSPEYYGRTRESGF
10 20 30 40
70 80 90 100 110 120
pF1KB9 PHAGGGREPPASYYAPRTAREPAYPAAALYPAHGAADTAYPYGYRGGASPGRPPQPEQPP
: :: :: :.:: .. .. : . :: . :
CCDS88 QHHHQELYPPPP---PR----PSYPERQ----YSCTSLQGPGNSRG-----------HGP
50 60 70 80
130 140 150 160 170 180
pF1KB9 AQAKGPAHGLHASHVLQPQPPPPLQPRAVPPAAPRRCEAAPATPGVPAGGSAPACPLLLA
::: : : ... . .: : .: :.:. :: :::
CCDS88 AQA-GHHHPEKSQSLCEPAP----------------LSGASASPS-PA---PPACSQPAP
90 100 110 120
190 200 210 220 230 240
pF1KB9 DKSPLGLKGKEPVVYPWMKKIHVSAVNPSYNGGEPKRSRTAYTRQQVLELEKEFHFNRYL
:. : . .:.:.:::::::::::.:::.::::::::::::::::::::::::::.::::
CCDS88 DH-PSSAASKQPIVYPWMKKIHVSTVNPNYNGGEPKRSRTAYTRQQVLELEKEFHYNRYL
130 140 150 160 170 180
250 260 270 280 290
pF1KB9 TRRRRIEIAHTLCLSERQVKIWFQNRRMKWKKDHKLPNTKMRSSNSASA-----SAGPPG
::::::::::.:::::::.:::::::::::::::.:::::.::. :.: ::. ::
CCDS88 TRRRRIEIAHSLCLSERQIKIWFQNRRMKWKKDHRLPNTKVRSAPPAGAAPSTLSAATPG
190 200 210 220 230 240
300 310 320
pF1KB9 KAQTQSPHLHPHPHPSTSTPVPSSI
.. .: :
CCDS88 TSEDHSQSATPPEQQRAEDITRL
250 260
320 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 23:25:43 2016 done: Fri Nov 4 23:25:43 2016
Total Scan time: 3.300 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]