FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB8980, 376 aa
1>>>pF1KB8980 376 - 376 aa - 376 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 8.9068+/-0.000802; mu= 1.9726+/- 0.049
mean_var=202.6803+/-41.877, 0's: 0 Z-trim(115.4): 150 B-trim: 871 in 2/51
Lambda= 0.090088
statistics sampled from 15790 (15956) to 15790 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.799), E-opt: 0.2 (0.49), width: 16
Scan time: 3.000
The best scores are: opt bits E(32554)
CCDS5403.1 HOXA2 gene_id:3199|Hs108|chr7 ( 376) 2513 338.5 6e-93
CCDS11527.1 HOXB2 gene_id:3212|Hs108|chr17 ( 356) 866 124.4 1.6e-28
>>CCDS5403.1 HOXA2 gene_id:3199|Hs108|chr7 (376 aa)
initn: 2513 init1: 2513 opt: 2513 Z-score: 1782.2 bits: 338.5 E(32554): 6e-93
Smith-Waterman score: 2513; 100.0% identity (100.0% similar) in 376 aa overlap (1-376:1-376)
10 20 30 40 50 60
pF1KB8 MNYEFEREIGFINSQPSLAECLTSFPPVADTFQSSSIKTSTLSHSTLIPPPFEQTIPSLN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 MNYEFEREIGFINSQPSLAECLTSFPPVADTFQSSSIKTSTLSHSTLIPPPFEQTIPSLN
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB8 PGSHPRHGAGGRPKPSPAGSRGSPVPAGALQPPEYPWMKEKKAAKKTALLPAAAAAATAA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 PGSHPRHGAGGRPKPSPAGSRGSPVPAGALQPPEYPWMKEKKAAKKTALLPAAAAAATAA
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB8 ATGPACLSHKESLEIADGSGGGSRRLRTAYTNTQLLELEKEFHFNKYLCRPRRVEIAALL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 ATGPACLSHKESLEIADGSGGGSRRLRTAYTNTQLLELEKEFHFNKYLCRPRRVEIAALL
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB8 DLTERQVKVWFQNRRMKHKRQTQCKENQNSEGKCKSLEDSEKVEEDEEEKTLFEQALSVS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 DLTERQVKVWFQNRRMKHKRQTQCKENQNSEGKCKSLEDSEKVEEDEEEKTLFEQALSVS
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB8 GALLEREGYTFQQNALSQQQAPNGHNGDSQSFPVSPLTSNEKNLKHFQHQSPTVPNCLST
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 GALLEREGYTFQQNALSQQQAPNGHNGDSQSFPVSPLTSNEKNLKHFQHQSPTVPNCLST
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB8 MGQNCGAGLNNDSPEALEVPSLQDFSVFSTDSCLQLSDAVSPSLPGSLDSPVDISADSLD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 MGQNCGAGLNNDSPEALEVPSLQDFSVFSTDSCLQLSDAVSPSLPGSLDSPVDISADSLD
310 320 330 340 350 360
370
pF1KB8 FFTDTLTTIDLQHLNY
::::::::::::::::
CCDS54 FFTDTLTTIDLQHLNY
370
>>CCDS11527.1 HOXB2 gene_id:3212|Hs108|chr17 (356 aa)
initn: 921 init1: 716 opt: 866 Z-score: 625.7 bits: 124.4 E(32554): 1.6e-28
Smith-Waterman score: 1037; 50.3% identity (67.5% similar) in 378 aa overlap (1-372:1-354)
10 20 30 40 50 60
pF1KB8 MNYEFEREIGFINSQPSLAECLTSFPPVADTFQSSSIKTSTLSHSTLIPPPFEQTIPSLN
::.::::::::::::::::::::::: : .:::.:::: ::: :::::::.:::.
CCDS11 MNFEFEREIGFINSQPSLAECLTSFPAVLETFQTSSIKESTLIPP---PPPFEQTFPSLQ
10 20 30 40 50
70 80 90 100 110
pF1KB8 PGSHP--RHGAGGRPKPSPAGSRGSPVPAGALQP-PEYPWMKEKKAAKKTALLPAAAAAA
::. : . : . .:: : : : : ::.:::::::.::: . .. . :
CCDS11 PGASTLQRPRSQKRAEDGPALPPPPPPPLPAAPPAPEFPWMKEKKSAKKPSQSATSPSPA
60 70 80 90 100 110
120 130 140 150 160 170
pF1KB8 TAAATGPACLSHKESLEIADGSGGGSRRLRTAYTNTQLLELEKEFHFNKYLCRPRRVEIA
..:. . . : ..: . ...:::.::::::::::::::::::::::::::::::::::
CCDS11 ASAVPASGVGSPADGLGLPEAGGGGARRLRTAYTNTQLLELEKEFHFNKYLCRPRRVEIA
120 130 140 150 160 170
180 190 200 210 220 230
pF1KB8 ALLDLTERQVKVWFQNRRMKHKRQTQCKENQNSEGKCK-SLEDSEKVEEDEEEKTLFEQA
:::::::::::::::::::::::::: .: ..: : .::: . . :: :
CCDS11 ALLDLTERQVKVWFQNRRMKHKRQTQHREPPDGEPACPGALED---ICDPAEEP-----A
180 190 200 210 220
240 250 260 270 280 290
pF1KB8 LSVSGALLEREGYTFQQNALSQQQAPNGHNGDSQSFPVSPLTSNEKNLKHFQHQSPTVPN
: .: : .. . . .:.. ..: . . : ... . . :.
CCDS11 ASPGGPSASRAAW--EACCHPPEVVPGALSADPRPLAV-----------RLEGAGASSPG
230 240 250 260 270
300 310 320 330 340 350
pF1KB8 C-LSTMGQNCGAGLNNDSPEA-LEVPSLQDFSVFSTDSCLQLSDAVSPSLPGSLDSPVDI
: : : . : .: . . : : :.. :..::::::: ..:::: ::::::: .
CCDS11 CALRGAGGLEPGPLPEDVFSGRQDSPFLPDLNFFAADSCLQLSGGLSPSLQGSLDSPVPF
280 290 300 310 320 330
360 370
pF1KB8 SADSLDFFTDTLTTIDLQHLNY
: . :::::.:: .::::
CCDS11 SEEELDFFTSTLCAIDLQFP
340 350
376 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 16:49:30 2016 done: Fri Nov 4 16:49:30 2016
Total Scan time: 3.000 Total Display time: -0.020
Function used was FASTA [36.3.4 Apr, 2011]