FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB8941, 301 aa
1>>>pF1KB8941 301 - 301 aa - 301 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 8.7084+/-0.000849; mu= 3.6585+/- 0.052
mean_var=303.6330+/-62.267, 0's: 0 Z-trim(117.1): 131 B-trim: 0 in 0/53
Lambda= 0.073604
statistics sampled from 17702 (17845) to 17702 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.83), E-opt: 0.2 (0.548), width: 16
Scan time: 2.710
The best scores are: opt bits E(32554)
CCDS32675.1 HOXB1 gene_id:3211|Hs108|chr17 ( 301) 2090 234.4 7.9e-62
CCDS5401.1 HOXA1 gene_id:3198|Hs108|chr7 ( 335) 627 79.1 4.9e-15
CCDS2271.1 HOXD1 gene_id:3231|Hs108|chr2 ( 328) 538 69.7 3.4e-12
>>CCDS32675.1 HOXB1 gene_id:3211|Hs108|chr17 (301 aa)
initn: 2090 init1: 2090 opt: 2090 Z-score: 1223.5 bits: 234.4 E(32554): 7.9e-62
Smith-Waterman score: 2090; 99.7% identity (99.7% similar) in 301 aa overlap (1-301:1-301)
10 20 30 40 50 60
pF1KB8 MDYNRMNSFLEYPLCNRGPSAYSAHSAPTSFPPSSAQAVDSYASEGRYGGGLSSPAFQQN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS32 MDYNRMNSFLEYPLCNRGPSAYSAHSAPTSFPPSSAQAVDSYASEGRYGGGLSSPAFQQN
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB8 SGYPAQQPPSTLGVPFPSSAPSGYAPAACSPSYGPSQYYPLGQSEGDGGYFHPSSYGAQL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS32 SGYPAQQPPSTLGVPFPSSAPSGYAPAACSPSYGPSQYYPLGQSEGDGGYFHPSSYGAQL
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB8 GGLSDGYGAGGAGPGPYPPQHPPYGNEQTASFAPAYADLLSEDKETPCPSEPNTPTARTF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS32 GGLSDGYGAGGAGPGPYPPQHPPYGNEQTASFAPAYADLLSEDKETPCPSEPNTPTARTF
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB8 DWMKVKRNPPKTAKVSEPGLGSPSGLRTNFTTRQLTELEKEFHFNKYLSRARRVEIAATL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS32 DWMKVKRNPPKTAKVSEPGLGSPSGLRTNFTTRQLTELEKEFHFNKYLSRARRVEIAATL
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB8 ELNETQVKIWFQNRRMKQKKREREGGRVPPAPPGCPKEAAGDASDQSTCTSPEASPSSVT
:::::::::::::::::::::::: :::::::::::::::::::::::::::::::::::
CCDS32 ELNETQVKIWFQNRRMKQKKREREEGRVPPAPPGCPKEAAGDASDQSTCTSPEASPSSVT
250 260 270 280 290 300
pF1KB8 S
:
CCDS32 S
>>CCDS5401.1 HOXA1 gene_id:3198|Hs108|chr7 (335 aa)
initn: 623 init1: 386 opt: 627 Z-score: 383.4 bits: 79.1 E(32554): 4.9e-15
Smith-Waterman score: 732; 44.5% identity (66.2% similar) in 337 aa overlap (1-301:1-328)
10 20 30 40 50
pF1KB8 MDYNRMNSFLEYPLCNRGPSAY-SAHSAP-----TSFPPSSAQAVDSYASEGRY--GGG-
:: :::::::::. . : :. ::.. : :.: : : ...: ... :. : :
CCDS54 MDNARMNSFLEYPILSSGDSGTCSARAYPSDHRITTFQ-SCAVSANSCGGDDRFLVGRGV
10 20 30 40 50
60 70 80 90 100
pF1KB8 -LSSPAFQQNSGYPAQQPPSTLGVPFPSSAPSG--YAPAACSPSYG------PSQYYPLG
..:: ... . :: . . .:. : :. ..:.:::: : . : :.
CCDS54 QIGSPHHHHHHHHRHPQPAT-----YQTSGNLGVSYSHSSCGPSYGSQNFSAPYSPYALN
60 70 80 90 100 110
110 120 130 140 150
pF1KB8 QSEGD--GGYFH--PSSYGAQLGG-------LSDGYGAGGAGPGPYPPQHPPYGNEQTAS
: :.: ::: . :. :...:.. .::..:..: : : ::.:. .
CCDS54 Q-EADVSGGYPQCAPAVYSGNLSSPMVQHHHHHQGYAGGAVGSPQYI--HHSYGQEHQSL
120 130 140 150 160 170
160 170 180 190 200
pF1KB8 FAPAYADLLSE---DKETPC--PSEPNTPTARTFDWMKVKRNPPKTAKVSEPG-LGSPSG
.: . :: ... : :. .. :.::::::::::::::.::.: : ::.:..
CCDS54 ALATYNNSLSPLHASHQEACRSPASETSSPAQTFDWMKVKRNPPKTGKVGEYGYLGQPNA
180 190 200 210 220 230
210 220 230 240 250 260
pF1KB8 LRTNFTTRQLTELEKEFHFNKYLSRARRVEIAATLELNETQVKIWFQNRRMKQKKREREG
.::::::.:::::::::::::::.:::::::::.:.:::::::::::::::::::::.::
CCDS54 VRTNFTTKQLTELEKEFHFNKYLTRARRVEIAASLQLNETQVKIWFQNRRMKQKKREKEG
240 250 260 270 280 290
270 280 290 300
pF1KB8 GR-VPPAPPGCPKEAAGDASDQSTCTSPEASPSSVTS
. :: : : : ..:..:. . ::.: ::
CCDS54 LLPISPATPPGNDEKAEESSEKSSSSPCVPSPGSSTSDTLTTSH
300 310 320 330
>>CCDS2271.1 HOXD1 gene_id:3231|Hs108|chr2 (328 aa)
initn: 427 init1: 352 opt: 538 Z-score: 332.4 bits: 69.7 E(32554): 3.4e-12
Smith-Waterman score: 560; 41.9% identity (58.7% similar) in 315 aa overlap (6-276:1-305)
10 20 30 40 50
pF1KB8 MDYNRMNSFLEYPLCNR-GPSAYSAHSAPTSFPPSSAQAVD-SYASEGRYGGGLSSPAFQ
:.:.::: :. : . .. : .: :.:. : . : : : .
CCDS22 MSSYLEYVSCSSSGGVGGDVLSLAPKFCRSDARPVALQPAFPLGNGDGAFVSCLP
10 20 30 40 50
60 70 80 90
pF1KB8 QNSGYPAQQPPSTLGVPF--PSSAPS------------GYAPAACS--PSYG-----PSQ
.. :. .::.. . : : .::. : :::: . .:: :.
CCDS22 LAAARPSPSPPAAPARPSVPPPAAPQYAQCTLEGAYEPGAAPAAAAGGADYGFLGSGPAY
60 70 80 90 100 110
100 110 120 130 140
pF1KB8 YYP--LGQSEGDGG-YFHPSSYGAQLGG---LSDG---YGAGGAGPGPYPP-------QH
.: ::.. ::: . : .. .. :: : .: :.: : :::.: :
CCDS22 DFPGVLGRAADDGGSHVHYATSAVFSGGGSFLLSGQVDYAAFGE-PGPFPACLKASADGH
120 130 140 150 160 170
150 160 170 180 190 200
pF1KB8 PPYGNEQTASFAPA-YADLLSEDKETPCPSEPNTPTARTFDWMKVKRNPPKTAKVSEPGL
: : :::: ::. : .: : . : . . ::.::::::: : .:..: :
CCDS22 P--GAFQTASPAPGTYPKSVS-----PASGLPAAFS--TFEWMKVKRNASKKGKLAEYGA
180 190 200 210 220
210 220 230 240 250
pF1KB8 GSPSG-LRTNFTTRQLTELEKEFHFNKYLSRARRVEIAATLELNETQVKIWFQNRRMKQK
.:::. .::::.:.:::::::::::::::.::::.::: :.::.:::::::::::::::
CCDS22 ASPSSAIRTNFSTKQLTELEKEFHFNKYLTRARRIEIANCLHLNDTQVKIWFQNRRMKQK
230 240 250 260 270 280
260 270 280 290 300
pF1KB8 KREREG---GRVPPAPPGCPKEAAGDASDQSTCTSPEASPSSVTS
:::::: .: :: :
CCDS22 KREREGLLATAIPVAPLQLPLSGTTPTKFIKNPGSPSQSQEPS
290 300 310 320
301 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Tue Nov 8 04:22:10 2016 done: Tue Nov 8 04:22:10 2016
Total Scan time: 2.710 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]