FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB8897, 224 aa
1>>>pF1KB8897 224 - 224 aa - 224 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 6.7534+/-0.000747; mu= 9.7909+/- 0.046
mean_var=155.9547+/-31.794, 0's: 0 Z-trim(115.1): 156 B-trim: 0 in 0/52
Lambda= 0.102701
statistics sampled from 15465 (15635) to 15465 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.814), E-opt: 0.2 (0.48), width: 16
Scan time: 2.510
The best scores are: opt bits E(32554)
CCDS11531.1 HOXB6 gene_id:3216|Hs108|chr17 ( 224) 1567 242.8 1.3e-64
CCDS8871.1 HOXC6 gene_id:3223|Hs108|chr12 ( 235) 632 104.3 6.9e-23
CCDS5407.1 HOXA6 gene_id:3203|Hs108|chr7 ( 233) 589 97.9 5.6e-21
CCDS41792.1 HOXC6 gene_id:3223|Hs108|chr12 ( 153) 497 84.1 5.3e-17
CCDS2269.1 HOXD4 gene_id:3233|Hs108|chr2 ( 255) 484 82.4 2.9e-16
CCDS5406.1 HOXA5 gene_id:3202|Hs108|chr7 ( 270) 480 81.8 4.5e-16
CCDS8873.1 HOXC4 gene_id:3221|Hs108|chr12 ( 264) 474 80.9 8.3e-16
CCDS11532.1 HOXB7 gene_id:3217|Hs108|chr17 ( 217) 472 80.6 8.9e-16
CCDS11530.1 HOXB5 gene_id:3215|Hs108|chr17 ( 269) 460 78.9 3.5e-15
CCDS11529.1 HOXB4 gene_id:3214|Hs108|chr17 ( 251) 457 78.4 4.6e-15
CCDS8870.1 HOXC8 gene_id:3224|Hs108|chr12 ( 242) 452 77.6 7.5e-15
CCDS11533.1 HOXB8 gene_id:3218|Hs108|chr17 ( 243) 439 75.7 2.8e-14
CCDS5408.1 HOXA7 gene_id:3204|Hs108|chr7 ( 230) 436 75.3 3.7e-14
CCDS5405.1 HOXA4 gene_id:3201|Hs108|chr7 ( 320) 437 75.5 4.2e-14
CCDS8872.1 HOXC5 gene_id:3222|Hs108|chr12 ( 222) 422 73.2 1.5e-13
CCDS2268.1 HOXD8 gene_id:3234|Hs108|chr2 ( 290) 396 69.4 2.7e-12
CCDS56148.1 HOXD8 gene_id:3234|Hs108|chr2 ( 289) 394 69.1 3.3e-12
CCDS56149.1 HOXD8 gene_id:3234|Hs108|chr2 ( 106) 365 64.4 3.2e-11
>>CCDS11531.1 HOXB6 gene_id:3216|Hs108|chr17 (224 aa)
initn: 1567 init1: 1567 opt: 1567 Z-score: 1273.4 bits: 242.8 E(32554): 1.3e-64
Smith-Waterman score: 1567; 100.0% identity (100.0% similar) in 224 aa overlap (1-224:1-224)
10 20 30 40 50 60
pF1KB8 MSSYFVNSTFPVTLASGQESFLGQLPLYSSGYADPLRHYPAPYGPGPGQDKGFATSSYYP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 MSSYFVNSTFPVTLASGQESFLGQLPLYSSGYADPLRHYPAPYGPGPGQDKGFATSSYYP
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB8 PAGGGYGRAAPCDYGPAPAFYREKESACALSGADEQPPFHPEPRKSDCAQDKSVFGETEE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 PAGGGYGRAAPCDYGPAPAFYREKESACALSGADEQPPFHPEPRKSDCAQDKSVFGETEE
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB8 QKCSTPVYPWMQRMNSCNSSSFGPSGRRGRQTYTRYQTLELEKEFHYNRYLTRRRRIEIA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 QKCSTPVYPWMQRMNSCNSSSFGPSGRRGRQTYTRYQTLELEKEFHYNRYLTRRRRIEIA
130 140 150 160 170 180
190 200 210 220
pF1KB8 HALCLTERQIKIWFQNRRMKWKKESKLLSASQLSAEEEEEKQAE
::::::::::::::::::::::::::::::::::::::::::::
CCDS11 HALCLTERQIKIWFQNRRMKWKKESKLLSASQLSAEEEEEKQAE
190 200 210 220
>>CCDS8871.1 HOXC6 gene_id:3223|Hs108|chr12 (235 aa)
initn: 607 init1: 486 opt: 632 Z-score: 524.4 bits: 104.3 E(32554): 6.9e-23
Smith-Waterman score: 635; 48.3% identity (72.7% similar) in 238 aa overlap (1-224:1-229)
10 20 30 40 50 60
pF1KB8 MSSYFVNSTFPVTLASGQESFLGQLPLYSSGYADPLRHYPAPYGPGPGQDKGFATSSYYP
:.:::.: .. ::.::. : .. : :..: ::.::. . :: . .:.. ..: : :
CCDS88 MNSYFTNPSLSCHLAGGQD-VLPNVALNSTAY-DPVRHF-STYGAAVAQNRIYSTPFYSP
10 20 30 40 50
70 80 90 100 110
pF1KB8 PAGGGYGRA-APCDYGPAPAFYREKESACALSGADEQPPFHPEPRKSDCAQD-KSVFGET
. .. . .: ::: . .::.::. ::. .. : ... ::: .: :.:
CCDS88 QENVVFSSSRGPYDYG-SNSFYQEKD---MLSNCRQNTLGHN--TQTSIAQDFSSEQGRT
60 70 80 90 100 110
120 130 140 150 160 170
pF1KB8 --EEQKCSTPVYPWMQRMNSCNSSSFGPSGRRGRQTYTRYQTLELEKEFHYNRYLTRRRR
..:: : .::::::::: .. ..: . ::::: :.::::::::::::.:::::::::
CCDS88 APQDQKASIQIYPWMQRMNSHSGVGYGADRRRGRQIYSRYQTLELEKEFHFNRYLTRRRR
120 130 140 150 160 170
180 190 200 210 220
pF1KB8 IEIAHALCLTERQIKIWFQNRRMKWKKESKLLS----------ASQLSAEEEEEKQAE
::::.::::::::::::::::::::::::.: : :..:...::.....:
CCDS88 IEIANALCLTERQIKIWFQNRRMKWKKESNLTSTLSGGGGGATADSLGGKEEKREETEEE
180 190 200 210 220 230
CCDS88 KQKE
>>CCDS5407.1 HOXA6 gene_id:3203|Hs108|chr7 (233 aa)
initn: 780 init1: 551 opt: 589 Z-score: 490.0 bits: 97.9 E(32554): 5.6e-21
Smith-Waterman score: 815; 57.2% identity (75.8% similar) in 236 aa overlap (1-224:1-233)
10 20 30 40 50 60
pF1KB8 MSSYFVNSTFPVTLASGQESFLGQLPLYSSGYADPLRHYPAPYGPGPGQDKGFATSSYYP
::::::: ::: .: :::.:::::::::..:: : :: .:: :: . :: ... .:
CCDS54 MSSYFVNPTFPGSLPSGQDSFLGQLPLYQAGY-DALRPFPASYGASSLPDKTYTSPCFYQ
10 20 30 40 50
70 80 90 100
pF1KB8 PAGG--GYGRAAPCDYGPAPAFYREKE-SACALSGADEQ-PP-----FHPEPR-KSDCA-
... . .::. .:: : :: .:. :. . ::. .: : : :: . : : .
CCDS54 QSNSVLACNRASY-EYG-ASCFYSDKDLSGASPSGSGKQRGPGDYLHFSPEQQYKPDSSS
60 70 80 90 100 110
110 120 130 140 150 160
pF1KB8 -QDKSVFGETEEQKCSTPVYPWMQRMNSCNSSSFGPSGRRGRQTYTRYQTLELEKEFHYN
: :.. : ..: ..:::::::::::: .. .: :::::::::::::::::::::.:
CCDS54 GQGKALHDEGADRKYTSPVYPWMQRMNSCAGAVYGSHGRRGRQTYTRYQTLELEKEFHFN
120 130 140 150 160 170
170 180 190 200 210 220
pF1KB8 RYLTRRRRIEIAHALCLTERQIKIWFQNRRMKWKKESKLLSASQLSAEEEEEKQAE
::::::::::::.:::::::::::::::::::::::.::....: :.:. : : .:
CCDS54 RYLTRRRRIEIANALCLTERQIKIWFQNRRMKWKKENKLINSTQPSGEDSEAKAGE
180 190 200 210 220 230
>>CCDS41792.1 HOXC6 gene_id:3223|Hs108|chr12 (153 aa)
initn: 531 init1: 486 opt: 497 Z-score: 418.7 bits: 84.1 E(32554): 5.3e-17
Smith-Waterman score: 500; 60.9% identity (81.2% similar) in 133 aa overlap (105-224:15-147)
80 90 100 110 120 130
pF1KB8 GPAPAFYREKESACALSGADEQPPFHPEPRKSDCAQD-KSVFGET--EEQKCSTPVYPWM
... ::: .: :.: ..:: : .::::
CCDS41 MLSNCRQNTLGHNTQTSIAQDFSSEQGRTAPQDQKASIQIYPWM
10 20 30 40
140 150 160 170 180 190
pF1KB8 QRMNSCNSSSFGPSGRRGRQTYTRYQTLELEKEFHYNRYLTRRRRIEIAHALCLTERQIK
::::: .. ..: . ::::: :.::::::::::::.:::::::::::::.::::::::::
CCDS41 QRMNSHSGVGYGADRRRGRQIYSRYQTLELEKEFHFNRYLTRRRRIEIANALCLTERQIK
50 60 70 80 90 100
200 210 220
pF1KB8 IWFQNRRMKWKKESKLLS----------ASQLSAEEEEEKQAE
::::::::::::::.: : :..:...::.....:
CCDS41 IWFQNRRMKWKKESNLTSTLSGGGGGATADSLGGKEEKREETEEEKQKE
110 120 130 140 150
>>CCDS2269.1 HOXD4 gene_id:3233|Hs108|chr2 (255 aa)
initn: 440 init1: 371 opt: 484 Z-score: 405.5 bits: 82.4 E(32554): 2.9e-16
Smith-Waterman score: 484; 44.7% identity (58.0% similar) in 226 aa overlap (1-207:3-215)
10 20 30 40
pF1KB8 MSSYFVNST-----FPVTLASGQESFLGQL--PLYSSGY--AD--PLRHYPAP-YGPG
::::.::: :: : ..::. :..: :: : :: : .:
CCDS22 MVMSSYMVNSKYVDPKFPPCEEYLQGGYLGEQGADYYGGGAQGADFQPPGLYPRPDFGEQ
10 20 30 40 50 60
50 60 70 80 90 100
pF1KB8 PGQDKGFATSSYYPPAGGGYGRAAPCDYGPAPAFYREKESACALSGADEQPPFHPEP---
: .: . .: : : : ..: . ::. : : :: : :
CCDS22 PFGGSGPGPGSALPARGHGQEPGGPGGHYAAPG------EPCP---APPAPPPAPLPGAR
70 80 90 100 110
110 120 130 140 150
pF1KB8 --RKSDCAQDKSVFGETEEQKCSTPVYPWMQRM--NSCNSSSFGPSGRRGRQTYTRYQTL
.:: : : : . .: . :::::... :: : . : .:.: .::: :.:
CCDS22 AYSQSDPKQPPS--GTALKQP--AVVYPWMKKVHVNSVNPNYTGGEPKRSRTAYTRQQVL
120 130 140 150 160
160 170 180 190 200 210
pF1KB8 ELEKEFHYNRYLTRRRRIEIAHALCLTERQIKIWFQNRRMKWKKESKLLSASQLSAEEEE
:::::::.::::::::::::::.:::.:::::::::::::::::. ::
CCDS22 ELEKEFHFNRYLTRRRRIEIAHTLCLSERQIKIWFQNRRMKWKKDHKLPNTKGRSSSSSS
170 180 190 200 210 220
220
pF1KB8 EKQAE
CCDS22 SSSCSSSVAPSQHLQPMAKDHHTDLTTL
230 240 250
>>CCDS5406.1 HOXA5 gene_id:3202|Hs108|chr7 (270 aa)
initn: 528 init1: 448 opt: 480 Z-score: 402.0 bits: 81.8 E(32554): 4.5e-16
Smith-Waterman score: 482; 45.5% identity (67.4% similar) in 187 aa overlap (30-215:95-264)
10 20 30 40 50
pF1KB8 MSSYFVNSTFPVTLASGQESFLGQLPLYSSGYADPLRHYP-APYGPGPGQDKGFATSSY
: ::: : . .:.::.:. . ..
CCDS54 GSGERARSYAASASAAPAEPRYSQPATSTHSPQPDPL---PCSAVAPSPGSDSHHGGKNS
70 80 90 100 110 120
60 70 80 90 100 110
pF1KB8 YPPAGGGYGRAAPCDYGPAPAFYREKESACALSGADEQPPFHPEPRKSDCAQDKSVFGET
..: : : : . .:.. . :::.:. : : ... .. : .
CCDS54 LSNSSG-----ASADAGST--HISSREGVGTASGAEEDAPASSE--QASAQSEPSPAPPA
130 140 150 160 170
120 130 140 150 160 170
pF1KB8 EEQKCSTPVYPWMQRMNSCNSSSFGPSGRRGRQTYTRYQTLELEKEFHYNRYLTRRRRIE
. : .::::.... ... :: :.:.: .::::::::::::::.:::::::::::
CCDS54 QPQ-----IYPWMRKLHISHDNIGGPEGKRARTAYTRYQTLELEKEFHFNRYLTRRRRIE
180 190 200 210 220
180 190 200 210 220
pF1KB8 IAHALCLTERQIKIWFQNRRMKWKKESKLLSASQLSAEEEEEKQAE
:::::::.:::::::::::::::::..:: : :. .:
CCDS54 IAHALCLSERQIKIWFQNRRMKWKKDNKLKSMSMAAAGGAFRP
230 240 250 260 270
>>CCDS8873.1 HOXC4 gene_id:3221|Hs108|chr12 (264 aa)
initn: 388 init1: 356 opt: 474 Z-score: 397.3 bits: 80.9 E(32554): 8.3e-16
Smith-Waterman score: 474; 41.1% identity (61.5% similar) in 231 aa overlap (1-215:8-225)
10 20 30 40
pF1KB8 MSSYFVNSTFPVTLASGQESFLGQL-PLY-----SSGYADPLRH-YPAPYGPG
:.: ... :: .:.:.. . : : ::. .. :: : :
CCDS88 MIMSSYLMDSNYIDPKFPPCEEYSQNSYIPEHSPEYYGRTRESGFQHHHQELYPPP-PPR
10 20 30 40 50
50 60 70 80 90
pF1KB8 PGQ-DKGFATSSYYPPAGGGYGRAAPCDYGPAPAFYREKE---SACA---LSGADEQPPF
:. .. .. .: : :.. : .::: : ... : : : ::::. .:
CCDS88 PSYPERQYSCTSLQGP-GNSRG------HGPAQAGHHHPEKSQSLCEPAPLSGASASPS-
60 70 80 90 100 110
100 110 120 130 140 150
pF1KB8 HPEPRKSDCAQDKSVFGETEEQKCSTPVYPWMQRMN--SCNSSSFGPSGRRGRQTYTRYQ
: : :.: . .: . :::::.... . : . : .:.: .::: :
CCDS88 -PAP--PACSQPAPDHPSSAASK-QPIVYPWMKKIHVSTVNPNYNGGEPKRSRTAYTRQQ
120 130 140 150 160
160 170 180 190 200 210
pF1KB8 TLELEKEFHYNRYLTRRRRIEIAHALCLTERQIKIWFQNRRMKWKKESKLLSASQLSAEE
.:::::::::::::::::::::::.:::.:::::::::::::::::. .: ... ::
CCDS88 VLELEKEFHYNRYLTRRRRIEIAHSLCLSERQIKIWFQNRRMKWKKDHRLPNTKVRSAPP
170 180 190 200 210 220
220
pF1KB8 EEEKQAE
CCDS88 AGAAPSTLSAATPGTSEDHSQSATPPEQQRAEDITRL
230 240 250 260
>>CCDS11532.1 HOXB7 gene_id:3217|Hs108|chr17 (217 aa)
initn: 498 init1: 393 opt: 472 Z-score: 396.7 bits: 80.6 E(32554): 8.9e-16
Smith-Waterman score: 506; 45.1% identity (63.4% similar) in 235 aa overlap (1-222:1-216)
10 20 30 40 50
pF1KB8 MSS-YFVNSTFPVTLASGQESFLGQLPLYSS-GYA-DPLRHYPAPYGPGPGQDKGFATSS
::: :..:. : ::.. : .: .: ..: .: : :. :: : : . . . ..
CCDS11 MSSLYYANTLFSKYPASSSVFATGAFPEQTSCAFASNPQR--PG-YGAGSGASFAASMQG
10 20 30 40 50
60 70 80 90 100 110
pF1KB8 YYPPAGGGYGRAAPCDYGPAPAFYREKESACALSGADEQPPFHPEPRKSDCAQDKSVFGE
:: .:: :..: : : : . :. . : ::. . .. : :.. .
CCDS11 LYPGGGGMAGQSAA---GVYAAGYGLEPSSFNMHCA----PFE-QNLSGVCPGDSAKAAG
60 70 80 90 100
120 130 140 150 160 170
pF1KB8 TEEQKCST-------PVYPWMQRMNSCNSSSFGPSGRRGRQTYTRYQTLELEKEFHYNRY
..::. : .::::. : : . .:::::::::::::::::::::::
CCDS11 AKEQRDSDLAAESNFRIYPWMR--------SSGTDRKRGRQTYTRYQTLELEKEFHYNRY
110 120 130 140 150 160
180 190 200 210 220
pF1KB8 LTRRRRIEIAHALCLTERQIKIWFQNRRMKWKKESKLL---SASQLSAEEEEEKQAE
:::::::::::.::::::::::::::::::::::.: ...: :: :::..
CCDS11 LTRRRRIEIAHTLCLTERQIKIWFQNRRMKWKKENKTAGPGTTGQDRAEAEEEEEE
170 180 190 200 210
>>CCDS11530.1 HOXB5 gene_id:3215|Hs108|chr17 (269 aa)
initn: 507 init1: 411 opt: 460 Z-score: 386.0 bits: 78.9 E(32554): 3.5e-15
Smith-Waterman score: 461; 45.5% identity (63.5% similar) in 189 aa overlap (45-215:77-263)
20 30 40 50 60
pF1KB8 ASGQESFLGQLPLYSSGYADPLRHYPAPYGPGPGQDKGF--ATSS-------YYPPAGGG
:.:.:. : :.:: : ..:
CCDS11 YNYNGMDLSVNRSSASSSHFGAVGESSRAFPAPAQEPRFRQAASSCSLSSPESLPCTNGD
50 60 70 80 90 100
70 80 90 100 110
pF1KB8 YGRAAPCDYGPAP--------AFYREKESACALSGADEQPPFHPEPRKSDCAQDKSVFGE
: : .:. : . : . : : : .: : . :: . .
CCDS11 SHGAKPSASSPSDQATSASSSANFTEIDEASASSEPEEAASQLSSPSLAR-AQPEPMATS
110 120 130 140 150 160
120 130 140 150 160 170
pF1KB8 TEEQKCSTP-VYPWMQRMNSCNSSSFGPSGRRGRQTYTRYQTLELEKEFHYNRYLTRRRR
: . .:: ..:::.... .. . ::.:.:.: .::::::::::::::.:::::::::
CCDS11 TAAPEGQTPQIFPWMRKLHISHDMT-GPDGKRARTAYTRYQTLELEKEFHFNRYLTRRRR
170 180 190 200 210 220
180 190 200 210 220
pF1KB8 IEIAHALCLTERQIKIWFQNRRMKWKKESKLLSASQLSAEEEEEKQAE
:::::::::.:::::::::::::::::..:: : : .:
CCDS11 IEIAHALCLSERQIKIWFQNRRMKWKKDNKLKSMSLATAGSAFQP
230 240 250 260
>>CCDS11529.1 HOXB4 gene_id:3214|Hs108|chr17 (251 aa)
initn: 417 init1: 361 opt: 457 Z-score: 383.9 bits: 78.4 E(32554): 4.6e-15
Smith-Waterman score: 475; 43.1% identity (60.2% similar) in 211 aa overlap (2-207:26-223)
10 20 30
pF1KB8 MSSYFVNSTFPVTLASGQESFLGQLPLYSSGY--AD
:.:. .. : :.::. . : . : :
CCDS11 MAMSSFLINSNYVDPKFPPCEEYSQSDYLPSDHSPGYYAGGQRRESSFQPEAGFGRRAAC
10 20 30 40 50 60
40 50 60 70 80 90
pF1KB8 PLRHYPAPYGPGPGQDKGFATSSYYPPAGGGYGRAAPCDYGPAPAFYREKESACALSGAD
...: : ::: :: : . :: :: :. : . : ...
CCDS11 TVQRYAACRDPGPPPPPPPPPP---PPPPPGLSPRAPAP-PPAGALLPEPGQRC--EAVS
70 80 90 100 110
100 110 120 130 140 150
pF1KB8 EQPPFHPEPRKSDCAQDKSVFGETEEQKCSTPV-YPWMQRMN--SCNSSSFGPSGRRGRQ
.:: : : :::. . .. :. :: ::::.... . : . : .:.:
CCDS11 SSPP--PPP----CAQNP-LHPSPSHSACKEPVVYPWMRKVHVSTVNPNYAGGEPKRSRT
120 130 140 150 160
160 170 180 190 200 210
pF1KB8 TYTRYQTLELEKEFHYNRYLTRRRRIEIAHALCLTERQIKIWFQNRRMKWKKESKLLSAS
.::: :.::::::::::::::::::.::::::::.:::::::::::::::::. ::
CCDS11 AYTRQQVLELEKEFHYNRYLTRRRRVEIAHALCLSERQIKIWFQNRRMKWKKDHKLPNTK
170 180 190 200 210 220
220
pF1KB8 QLSAEEEEEKQAE
CCDS11 IRSGGAAGSAGGPPGRPNGGPRAL
230 240 250
224 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 16:24:38 2016 done: Fri Nov 4 16:24:39 2016
Total Scan time: 2.510 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]