FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB8960, 340 aa
1>>>pF1KB8960 340 - 340 aa - 340 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 6.9468+/-0.000862; mu= 9.8542+/- 0.052
mean_var=143.1211+/-29.685, 0's: 0 Z-trim(111.2): 158 B-trim: 686 in 2/51
Lambda= 0.107207
statistics sampled from 12042 (12211) to 12042 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.743), E-opt: 0.2 (0.375), width: 16
Scan time: 2.480
The best scores are: opt bits E(32554)
CCDS2266.1 HOXD10 gene_id:3236|Hs108|chr2 ( 340) 2271 362.5 2.8e-100
CCDS8868.1 HOXC10 gene_id:3226|Hs108|chr12 ( 342) 958 159.5 3.8e-39
CCDS5410.2 HOXA10 gene_id:3206|Hs108|chr7 ( 410) 604 104.8 1.3e-22
CCDS2267.2 HOXD9 gene_id:3235|Hs108|chr2 ( 352) 439 79.2 5.6e-15
CCDS11534.1 HOXB9 gene_id:3219|Hs108|chr17 ( 250) 432 78.0 9.3e-15
CCDS8869.1 HOXC9 gene_id:3225|Hs108|chr12 ( 260) 431 77.8 1.1e-14
CCDS5409.1 HOXA9 gene_id:3205|Hs108|chr7 ( 272) 419 76.0 4e-14
>>CCDS2266.1 HOXD10 gene_id:3236|Hs108|chr2 (340 aa)
initn: 2271 init1: 2271 opt: 2271 Z-score: 1913.9 bits: 362.5 E(32554): 2.8e-100
Smith-Waterman score: 2271; 100.0% identity (100.0% similar) in 340 aa overlap (1-340:1-340)
10 20 30 40 50 60
pF1KB8 MSFPNSSPAANTFLVDSLISACRSDSFYSSSASMYMPPPSADMGTYGMQTCGLLPSLAKR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS22 MSFPNSSPAANTFLVDSLISACRSDSFYSSSASMYMPPPSADMGTYGMQTCGLLPSLAKR
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB8 EVNHQNMGMNVHPYIPQVDSWTDPNRSCRIEQPVTQQVPTCSFTTNIKEESNCCMYSDKR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS22 EVNHQNMGMNVHPYIPQVDSWTDPNRSCRIEQPVTQQVPTCSFTTNIKEESNCCMYSDKR
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB8 NKLISAEVPSYQRLVPESCPVENPEVPVPGYFRLSQTYATGKTQEYNNSPEGSSTVMLQL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS22 NKLISAEVPSYQRLVPESCPVENPEVPVPGYFRLSQTYATGKTQEYNNSPEGSSTVMLQL
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB8 NPRGAAKPQLSAAQLQMEKKMNEPVSGQEPTKVSQVESPEAKGGLPEERSCLAEVSVSSP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS22 NPRGAAKPQLSAAQLQMEKKMNEPVSGQEPTKVSQVESPEAKGGLPEERSCLAEVSVSSP
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB8 EVQEKESKEEIKSDTPTSNWLTAKSGRKKRCPYTKHQTLELEKEFLFNMYLTRERRLEIS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS22 EVQEKESKEEIKSDTPTSNWLTAKSGRKKRCPYTKHQTLELEKEFLFNMYLTRERRLEIS
250 260 270 280 290 300
310 320 330 340
pF1KB8 KSVNLTDRQVKIWFQNRRMKLKKMSRENRIRELTANLTFS
::::::::::::::::::::::::::::::::::::::::
CCDS22 KSVNLTDRQVKIWFQNRRMKLKKMSRENRIRELTANLTFS
310 320 330 340
>>CCDS8868.1 HOXC10 gene_id:3226|Hs108|chr12 (342 aa)
initn: 947 init1: 589 opt: 958 Z-score: 816.3 bits: 159.5 E(32554): 3.8e-39
Smith-Waterman score: 958; 50.5% identity (75.1% similar) in 321 aa overlap (28-340:25-342)
10 20 30 40 50 60
pF1KB8 MSFPNSSPAANTFLVDSLISACRSDSFYSSSASMYMPPPSADMGTYGMQTCGLLPSLAKR
:: ::.::: : :.. :. ::: :::.::
CCDS88 MTCPRNVTPNSYAEPLAAPGGGERYSRSAGMYMQSGS-DFNCGVMRGCGLAPSLSKR
10 20 30 40 50
70 80 90 100 110
pF1KB8 -EVNHQNMGMNVHP-YIPQVDSWTDPNRSCRIEQPVTQQVPTCSFTTNIKEESNCCMYSD
: . ....:..: :. :.::: ::. . :.:::: . . .::. ..:::. :::::
CCDS88 DEGSSPSLALNTYPSYLSQLDSWGDPKAAYRLEQPVGRPLSSCSYPPSVKEENVCCMYSA
60 70 80 90 100 110
120 130 140 150 160 170
pF1KB8 KRNKLISAEVPSYQRLVPESCPVENPEVPVPGYFRLSQTY-ATGKTQEYN--NSPEGSST
.. . :. :.. .:::: :. :::::.:.: : .: : :: . . :. :.
CCDS88 EKRAKSGPEAALYSHPLPESCLGEH-EVPVPSYYRASPSYSALDKTPHCSGANDFEAPFE
120 130 140 150 160 170
180 190 200 210 220 230
pF1KB8 VMLQLNPRGA--AKPQLSAAQLQMEKKMNEPVSGQEPTKVSQVESPEAKGGLPEE-RSCL
.::::. .:::.. .... . . . :.... .: . : : : ..
CCDS88 QRASLNPRAEHLESPQLGG-KVSFPETPKSDSQTPSPNEIKTEQSLAGPKGSPSESEKER
180 190 200 210 220 230
240 250 260 270 280 290
pF1KB8 AEVSVSSPEVQEKESKEEIKSDTPTSNWLTAKSGRKKRCPYTKHQTLELEKEFLFNMYLT
:... :::.....:.:::::... :.::::::::::::::::::::::::::::::::::
CCDS88 AKAADSSPDTSDNEAKEEIKAENTTGNWLTAKSGRKKRCPYTKHQTLELEKEFLFNMYLT
240 250 260 270 280 290
300 310 320 330 340
pF1KB8 RERRLEISKSVNLTDRQVKIWFQNRRMKLKKMSRENRIRELTANLTFS
:::::::::..:::::::::::::::::::::.:::::::::.:..:.
CCDS88 RERRLEISKTINLTDRQVKIWFQNRRMKLKKMNRENRIRELTSNFNFT
300 310 320 330 340
>>CCDS5410.2 HOXA10 gene_id:3206|Hs108|chr7 (410 aa)
initn: 864 init1: 514 opt: 604 Z-score: 519.4 bits: 104.8 E(32554): 1.3e-22
Smith-Waterman score: 802; 46.1% identity (65.7% similar) in 362 aa overlap (28-340:57-410)
10 20 30 40 50
pF1KB8 MSFPNSSPAANTFLVDSLISACRSDSFYSSSASMYMPPPSADMGTYGMQTCGLLPSL
: . ...:.:: .::. ::.:.:::.:.:
CCDS54 NSFLVDSLISSGRGEAGGGGGGAGGGGGGGYYAHGGVYLPP-AADL-PYGLQSCGLFPTL
30 40 50 60 70 80
60 70 80 90
pF1KB8 A-KREVNHQ--------NMGMNVHPYIPQ-VDSWTDPNRSCRIEQP-----VTQQVP---
. ::. . ..: ..: : :. .: : : ::::.: : :: :
CCDS54 GGKRNEAASPGSGGGGGGLGPGAHGYGPSPIDLWLDAPRSCRMEPPDGPPPPPQQQPPPP
90 100 110 120 130 140
100 110 120 130 140
pF1KB8 -----------TCSFTTNIKEESNCCMY--SDKRNKL--ISAEVPSYQRLVP-ESCPV-E
.:::. ::::::. :.: .:: :. .::. . : : ..: .
CCDS54 PQPPQPAPQATSCSFAQNIKEESSYCLYDSADKCPKVSATAAELAPFPRGPPPDGCALGT
150 160 170 180 190 200
150 160 170 180 190
pF1KB8 NPEVPVPGYFRLSQTYATGKTQEYNNSPEGSSTVM---LQLNPRGAA---KPQLSAAQLQ
. :::::::::::.:.:.: :... :.. . . .: : . : :.... .
CCDS54 SSGVPVPGYFRLSQAYGTAKG--YGSGGGGAQQLGAGPFPAQPPGRGFDLPPALASGSAD
210 220 230 240 250 260
200 210 220 230 240
pF1KB8 MEKKMNEPVSGQEPTKV------SQV-ESPEAKGGLPEERS-CLAEVSVSSPEVQEKESK
.: : :: . :: : .:... :: : .: : .::: :.:
CCDS54 AARKERALDSPPPPTLACGSGGGSQGDEEAHASSSAAEELSPAPSESSKASPE---KDSL
270 280 290 300 310
250 260 270 280 290 300
pF1KB8 EEIKSDTPTSNWLTAKSGRKKRCPYTKHQTLELEKEFLFNMYLTRERRLEISKSVNLTDR
. :... ..::::::::::::::::::::::::::::::::::::::::::.::.::::
CCDS54 GNSKGEN-AANWLTAKSGRKKRCPYTKHQTLELEKEFLFNMYLTRERRLEISRSVHLTDR
320 330 340 350 360 370
310 320 330 340
pF1KB8 QVKIWFQNRRMKLKKMSRENRIRELTANLTFS
::::::::::::::::.:::::::::::..::
CCDS54 QVKIWFQNRRMKLKKMNRENRIRELTANFNFS
380 390 400 410
>>CCDS2267.2 HOXD9 gene_id:3235|Hs108|chr2 (352 aa)
initn: 468 init1: 385 opt: 439 Z-score: 382.3 bits: 79.2 E(32554): 5.6e-15
Smith-Waterman score: 443; 30.6% identity (56.9% similar) in 353 aa overlap (5-327:12-346)
10 20 30 40
pF1KB8 MSFPNSSPAANTFLVDSLISACRSDSFYSSSASMYMPP-PSADMGTYGMQ---
.:: . ... :::::. .. : :. . :: :.:. :.
CCDS22 MLGGSAGRLKMSSSGTLSNYYVDSLIGHEGDEVF----AARFGPPGPGAQGRPAGVADGP
10 20 30 40 50
50 60 70 80
pF1KB8 --------TCGLLP-----SLAKREVNHQ-----NMGMNVHPYIPQ---VDSWTDPNRSC
.:.. : : . : : :. :::.: . : ..:.:
CCDS22 AATAAEFASCSFAPRSAVFSASWSAVPSQPPAAAAMSGLYHPYVPPPPLAASASEPGRYV
60 70 80 90 100 110
90 100 110 120 130 140
pF1KB8 RIEQPVTQQVPTCSFTTNIKEESNCCMYSDKRNKLISAEVPSYQRLVPESCPVENPEVPV
: . . : .: . .. . :. . :. : . :. .:.
CCDS22 R-----SWMEPLPGFPGGAGGGGGGGGGGPGRGPSPGPSGPANGRHY--GIKPETRAAPA
120 130 140 150 160
150 160 170 180 190 200
pF1KB8 PGYFRLSQTYATGKTQEYNNSPEGSSTVMLQLNPRGAAKPQLSAAQLQMEKKMNEPVSGQ
:. ..: ....:. ..: . .: . .:.. :..: .. ...: ..:
CCDS22 PA--TAASTTSSSSTSLSSSSKRTECSVARE--SQGSSGPEFSCNSF-LQEKAAAATGGT
170 180 190 200 210 220
210 220 230 240 250 260
pF1KB8 EPTKVSQVESPEAKGGLPEERSC----LAEVSVSSPEVQEKE-SKEEIKSDTPTSNWLTA
: . . . . :: : .: . :.. : :... ..... ..:..::. :
CCDS22 GPG--AGIGAATGTGGSSEPSACSDHPIPGCSLKEEEKQHSQPQQQQLDPNNPAANWIHA
230 240 250 260 270 280
270 280 290 300 310 320
pF1KB8 KSGRKKRCPYTKHQTLELEKEFLFNMYLTRERRLEISKSVNLTDRQVKIWFQNRRMKLKK
.: :::::::::.:::::::::::::::::.:: :... .:::.:::::::::::::.::
CCDS22 RSTRKKRCPYTKYQTLELEKEFLFNMYLTRDRRYEVARILNLTERQVKIWFQNRRMKMKK
290 300 310 320 330 340
330 340
pF1KB8 MSRENRIRELTANLTFS
::.:
CCDS22 MSKEKCPKGD
350
>>CCDS11534.1 HOXB9 gene_id:3219|Hs108|chr17 (250 aa)
initn: 478 init1: 401 opt: 432 Z-score: 378.5 bits: 78.0 E(32554): 9.3e-15
Smith-Waterman score: 436; 39.5% identity (65.0% similar) in 220 aa overlap (125-327:35-246)
100 110 120 130 140 150
pF1KB8 TQQVPTCSFTTNIKEESNCCMYSDKRNKLISAEVPSY-QRLVPESCPVENPEVPVPG--Y
:.. :.. ..: :: . :..:: : .
CCDS11 GTLSSYYVDSIISHESEDAPPAKFPSGQYASSRQPGHAEHLEFPSCSFQ-PKAPVFGASW
10 20 30 40 50 60
160 170 180 190 200
pF1KB8 FRLSQTYATGKTQEYNN---SPEGSSTV-------MLQLNPRGAAKPQLSAAQLQMEKKM
:: .:.:. . .:.: . :. ::: : : . : .. : .
CCDS11 APLSP-HASGSLPSVYHPYIQPQGVPPAESRYLRTWLEPAPRGEAAPGQGQAAVKAEPLL
70 80 90 100 110 120
210 220 230 240 250
pF1KB8 NEPVSGQEPTKVSQVESPEAKGG----LPEERSCLAEVSVSSPEVQEKESKEEIKSDTPT
. : :. . . : :...: : ..: .. .. . .:.::. . .:.
CCDS11 GAP--GELLKQGTPEYSLETSAGREAVLSNQRPGYGDNKIC----EGSEDKERPDQTNPS
130 140 150 160 170
260 270 280 290 300 310
pF1KB8 SNWLTAKSGRKKRCPYTKHQTLELEKEFLFNMYLTRERRLEISKSVNLTDRQVKIWFQNR
.::: :.:.:::::::::.:::::::::::::::::.:: :... .::..::::::::::
CCDS11 ANWLHARSSRKKRCPYTKYQTLELEKEFLFNMYLTRDRRHEVARLLNLSERQVKIWFQNR
180 190 200 210 220 230
320 330 340
pF1KB8 RMKLKKMSRENRIRELTANLTFS
:::.:::..:
CCDS11 RMKMKKMNKEQGKE
240 250
>>CCDS8869.1 HOXC9 gene_id:3225|Hs108|chr12 (260 aa)
initn: 466 init1: 411 opt: 431 Z-score: 377.4 bits: 77.8 E(32554): 1.1e-14
Smith-Waterman score: 435; 39.4% identity (66.2% similar) in 216 aa overlap (134-332:43-258)
110 120 130 140 150 160
pF1KB8 TTNIKEESNCCMYSDKRNKLISAEVPSYQRLVPESCPVEN-PEVPVPGYFRLSQTYATGK
:::. . .: :. : : . . ..
CCDS88 DSLISHDNEDLLASRFPATGAHPAAARPSGLVPDCSDFPSCSFAPKPAVFSTSWAPVPSQ
20 30 40 50 60 70
170 180 190 200 210
pF1KB8 T----QEYNNSPE-GSSTVMLQ--LNP-RGAAK-PQLSAAQLQMEKKMNEPVSGQEPTKV
. . :. .:. :..: ... :.: ::.. :.. :. .. : . . .
CCDS88 SSVVYHPYGPQPHLGADTRYMRTWLEPLSGAVSFPSFPAGGRHYALKPDAYPGRRADCGP
80 90 100 110 120 130
220 230 240 250 260
pF1KB8 SQVES-PEAKGGLPEERSCLAEVSVSSPEV------QEKESKEEIKSDTPTSNWLTAKSG
.. .: :. : : : : .. :::. ..:: : .. ..:..::. :.:
CCDS88 GEGRSYPDYMYGSPGELRDRAPQTLPSPEADALAGSKHKEEKADLDPSNPVANWIHARST
140 150 160 170 180 190
270 280 290 300 310 320
pF1KB8 RKKRCPYTKHQTLELEKEFLFNMYLTRERRLEISKSVNLTDRQVKIWFQNRRMKLKKMSR
:::::::::.:::::::::::::::::.:: :... .:::.:::::::::::::.:::..
CCDS88 RKKRCPYTKYQTLELEKEFLFNMYLTRDRRYEVARVLNLTERQVKIWFQNRRMKMKKMNK
200 210 220 230 240 250
330 340
pF1KB8 ENRIRELTANLTFS
:. .:
CCDS88 EKTDKEQS
260
>>CCDS5409.1 HOXA9 gene_id:3205|Hs108|chr7 (272 aa)
initn: 431 init1: 383 opt: 419 Z-score: 367.1 bits: 76.0 E(32554): 4e-14
Smith-Waterman score: 419; 53.4% identity (73.7% similar) in 133 aa overlap (203-327:135-267)
180 190 200 210 220
pF1KB8 SSTVMLQLNPRGAAKPQLSAAQLQMEKKMNEPVS---GQEPTKVSQVES--PEAKGGLPE
::.: :. :: ... : : :. :
CCDS54 GRYMRSWLEPTPGALSFAGLPSSRPYGIKPEPLSARRGDCPTLDTHTLSLTDYACGSPPV
110 120 130 140 150 160
230 240 250 260 270 280
pF1KB8 ERSCLAEVSVSSPEVQEKES---KEEIKSDTPTSNWLTAKSGRKKRCPYTKHQTLELEKE
.: .. : . :.:: : : ..:..::: :.: ::::::::::::::::::
CCDS54 DREKQPSEGAFSENNAENESGGDKPPIDPNNPAANWLHARSTRKKRCPYTKHQTLELEKE
170 180 190 200 210 220
290 300 310 320 330 340
pF1KB8 FLFNMYLTRERRLEISKSVNLTDRQVKIWFQNRRMKLKKMSRENRIRELTANLTFS
:::::::::.:: :... .:::.:::::::::::::.::....
CCDS54 FLFNMYLTRDRRYEVARLLNLTERQVKIWFQNRRMKMKKINKDRAKDE
230 240 250 260 270
340 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 16:41:59 2016 done: Fri Nov 4 16:41:59 2016
Total Scan time: 2.480 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]