FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB9838, 352 aa
1>>>pF1KB9838 352 - 352 aa - 352 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 9.0696+/-0.000933; mu= 3.4497+/- 0.057
mean_var=317.7144+/-65.414, 0's: 0 Z-trim(116.9): 20 B-trim: 280 in 2/50
Lambda= 0.071954
statistics sampled from 17583 (17601) to 17583 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.817), E-opt: 0.2 (0.541), width: 16
Scan time: 3.440
The best scores are: opt bits E(32554)
CCDS34955.1 MAFA gene_id:389692|Hs108|chr8 ( 353) 2473 269.5 3.1e-72
CCDS42198.1 MAF gene_id:4094|Hs108|chr16 ( 373) 616 76.7 3.4e-14
CCDS10928.1 MAF gene_id:4094|Hs108|chr16 ( 403) 616 76.8 3.6e-14
CCDS13311.1 MAFB gene_id:9935|Hs108|chr20 ( 323) 576 72.5 5.5e-13
>>CCDS34955.1 MAFA gene_id:389692|Hs108|chr8 (353 aa)
initn: 1873 init1: 1533 opt: 2473 Z-score: 1410.4 bits: 269.5 E(32554): 3.1e-72
Smith-Waterman score: 2473; 99.7% identity (99.7% similar) in 353 aa overlap (1-352:1-353)
10 20 30 40 50 60
pF1KB9 MAAELAMGAELPSSPLAIEYVNDFDLMKFEVKKEPPEAERFCHRLPPGSLSSTPLSTPCS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS34 MAAELAMGAELPSSPLAIEYVNDFDLMKFEVKKEPPEAERFCHRLPPGSLSSTPLSTPCS
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB9 SVPSSPSFCAPSPGTGGGGGAGGGGGSSQAGGAPGPPSGGPGAVGGTSGKPALEDLYWMS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS34 SVPSSPSFCAPSPGTGGGGGAGGGGGSSQAGGAPGPPSGGPGAVGGTSGKPALEDLYWMS
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB9 GYQHHLNPEALNLTPEDAVEALIGSGHHGAHHGAHHPAAAAAYEAFRGPGFAGGGGADDM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS34 GYQHHLNPEALNLTPEDAVEALIGSGHHGAHHGAHHPAAAAAYEAFRGPGFAGGGGADDM
130 140 150 160 170 180
190 200 210 220 230
pF1KB9 GAGHHHGAHHAAHHHHAAHHHHHHHHH-GGAGHGGGAGHHVRLEERFSDDQLVSMSVREL
::::::::::::::::::::::::::: ::::::::::::::::::::::::::::::::
CCDS34 GAGHHHGAHHAAHHHHAAHHHHHHHHHHGGAGHGGGAGHHVRLEERFSDDQLVSMSVREL
190 200 210 220 230 240
240 250 260 270 280 290
pF1KB9 NRQLRGFSKEEVIRLKQKRRTLKNRGYAQSCRFKRVQQRHILESEKCQLQSQVEQLKLEV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS34 NRQLRGFSKEEVIRLKQKRRTLKNRGYAQSCRFKRVQQRHILESEKCQLQSQVEQLKLEV
250 260 270 280 290 300
300 310 320 330 340 350
pF1KB9 GRLAKERDLYKEKYEKLAGRGGPGSAGGAGFPREPSPPQAGPGGAKGTADFFL
:::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS34 GRLAKERDLYKEKYEKLAGRGGPGSAGGAGFPREPSPPQAGPGGAKGTADFFL
310 320 330 340 350
>>CCDS42198.1 MAF gene_id:4094|Hs108|chr16 (373 aa)
initn: 767 init1: 547 opt: 616 Z-score: 368.3 bits: 76.7 E(32554): 3.4e-14
Smith-Waterman score: 1036; 50.0% identity (63.6% similar) in 396 aa overlap (4-334:4-369)
10 20 30 40 50
pF1KB9 MAAELAMG-AELPSSPLAIEYVNDFDLMKFEVKKEPPEAERF---CHRL-PPGSLSSTPL
::::. ..::.::::.::::::::::::::::: :..:. : :: :::::::.
CCDS42 MASELAMSNSDLPTSPLAMEYVNDFDLMKFEVKKEPVETDRIISQCGRLIAGGSLSSTPM
10 20 30 40 50 60
60 70 80 90 100 110
pF1KB9 STPCSSVPSSPSFCAPSPGTGGGGGAGGGGGSSQAGGAPGPPSGGPGAVGGTSGKPALED
:::::::: :::: :::::. :: : : :::
CCDS42 STPCSSVPPSPSFSAPSPGS----------GSEQ--------------------KAHLED
70 80 90
120 130 140 150 160 170
pF1KB9 LYWMSGYQHHLNPEALNLTPEDAVEALIGSGHH--GAHHGAHHPAAAAAYEAFRGPGFAG
:::.:: ..::::::...:::::::::...:. :. : . : : : : : .
CCDS42 YYWMTGYPQQLNPEALGFSPEDAVEALISNSHQLQGGFDGYARGAQQLAAAAGAGAGASL
100 110 120 130 140 150
180 190 200
pF1KB9 GGGADDMG------------AGHHHGA--HHAAHHHHAAHHHHHHH--------------
::....:: :. . :: :. :::::: ::::
CCDS42 GGSGEEMGPAAAVVSAVIAAAAAQSGAGPHYHHHHHHAAGHHHHPTAGAPGAAGSAAASA
160 170 180 190 200 210
210 220 230
pF1KB9 ------------HHGGAGHGGGAG--------------HH----VRLEERFSDDQLVSMS
::.: :::.: :: .....::::.:::.::
CCDS42 GGAGGAGGGGPASAGGGGGGGGGGGGGGAAGAGGALHPHHAAGGLHFDDRFSDEQLVTMS
220 230 240 250 260 270
240 250 260 270 280 290
pF1KB9 VRELNRQLRGFSKEEVIRLKQKRRTLKNRGYAQSCRFKRVQQRHILESEKCQLQSQVEQL
:::::::::: :::::::::::::::::::::::::::::::::.::::: :: .::..:
CCDS42 VRELNRQLRGVSKEEVIRLKQKRRTLKNRGYAQSCRFKRVQQRHVLESEKNQLLQQVDHL
280 290 300 310 320 330
300 310 320 330 340 350
pF1KB9 KLEVGRLAKERDLYKEKYEKLAGRGGPGSAGGAGFPREPSPPQAGPGGAKGTADFFL
: :..::..::: ::::::::.. : ..... : :
CCDS42 KQEISRLVRERDAYKEKYEKLVSSGFRENGSSSDNPSSPEFFM
340 350 360 370
>>CCDS10928.1 MAF gene_id:4094|Hs108|chr16 (403 aa)
initn: 767 init1: 547 opt: 616 Z-score: 367.9 bits: 76.8 E(32554): 3.6e-14
Smith-Waterman score: 1036; 50.0% identity (63.6% similar) in 396 aa overlap (4-334:4-369)
10 20 30 40 50
pF1KB9 MAAELAMG-AELPSSPLAIEYVNDFDLMKFEVKKEPPEAERF---CHRL-PPGSLSSTPL
::::. ..::.::::.::::::::::::::::: :..:. : :: :::::::.
CCDS10 MASELAMSNSDLPTSPLAMEYVNDFDLMKFEVKKEPVETDRIISQCGRLIAGGSLSSTPM
10 20 30 40 50 60
60 70 80 90 100 110
pF1KB9 STPCSSVPSSPSFCAPSPGTGGGGGAGGGGGSSQAGGAPGPPSGGPGAVGGTSGKPALED
:::::::: :::: :::::. :: : : :::
CCDS10 STPCSSVPPSPSFSAPSPGS----------GSEQ--------------------KAHLED
70 80 90
120 130 140 150 160 170
pF1KB9 LYWMSGYQHHLNPEALNLTPEDAVEALIGSGHH--GAHHGAHHPAAAAAYEAFRGPGFAG
:::.:: ..::::::...:::::::::...:. :. : . : : : : : .
CCDS10 YYWMTGYPQQLNPEALGFSPEDAVEALISNSHQLQGGFDGYARGAQQLAAAAGAGAGASL
100 110 120 130 140 150
180 190 200
pF1KB9 GGGADDMG------------AGHHHGA--HHAAHHHHAAHHHHHHH--------------
::....:: :. . :: :. :::::: ::::
CCDS10 GGSGEEMGPAAAVVSAVIAAAAAQSGAGPHYHHHHHHAAGHHHHPTAGAPGAAGSAAASA
160 170 180 190 200 210
210 220 230
pF1KB9 ------------HHGGAGHGGGAG--------------HH----VRLEERFSDDQLVSMS
::.: :::.: :: .....::::.:::.::
CCDS10 GGAGGAGGGGPASAGGGGGGGGGGGGGGAAGAGGALHPHHAAGGLHFDDRFSDEQLVTMS
220 230 240 250 260 270
240 250 260 270 280 290
pF1KB9 VRELNRQLRGFSKEEVIRLKQKRRTLKNRGYAQSCRFKRVQQRHILESEKCQLQSQVEQL
:::::::::: :::::::::::::::::::::::::::::::::.::::: :: .::..:
CCDS10 VRELNRQLRGVSKEEVIRLKQKRRTLKNRGYAQSCRFKRVQQRHVLESEKNQLLQQVDHL
280 290 300 310 320 330
300 310 320 330 340 350
pF1KB9 KLEVGRLAKERDLYKEKYEKLAGRGGPGSAGGAGFPREPSPPQAGPGGAKGTADFFL
: :..::..::: ::::::::.. : ..... : :
CCDS10 KQEISRLVRERDAYKEKYEKLVSSGFRENGSSSDNPSSPEFFITEPTRKLEPSVGYATFW
340 350 360 370 380 390
CCDS10 KPQHRVLTSVFTK
400
>>CCDS13311.1 MAFB gene_id:9935|Hs108|chr20 (323 aa)
initn: 931 init1: 510 opt: 576 Z-score: 346.6 bits: 72.5 E(32554): 5.5e-13
Smith-Waterman score: 991; 51.7% identity (61.8% similar) in 377 aa overlap (1-334:1-319)
10 20 30 40 50
pF1KB9 MAAELAMGAELPSSPLAIEYVNDFDLMKFEVKKEP-PEAERF---CHRLPP-GSLSSTPL
:::::.:: :::.::::.::::::::.::.::::: .::: : :: : ::.:::::
CCDS13 MAAELSMGPELPTSPLAMEYVNDFDLLKFDVKKEPLGRAERPGRPCTRLQPAGSVSSTPL
10 20 30 40 50 60
60 70 80 90 100 110
pF1KB9 STPCSSVPSSPSFCAPSPGTGGGGGAGGGGGSSQAGGAPGPPSGGPGAVGGTSGKPALED
::::::::::::: :: : : :::
CCDS13 STPCSSVPSSPSF---SP---------------------------------TEQKTHLED
70 80
120 130 140 150 160 170
pF1KB9 LYWMSGYQHHLNPEALNLTPEDAVEALIGSGHHGAHHGAHHPAAAAAYEAFRGPGFAGGG
::::.. ...::::::::::::::::::: : . .: . ...:::
CCDS13 LYWMASNYQQMNPEALNLTPEDAVEALIGS------HPVPQPLQS--FDSFRG-------
90 100 110 120
180 190 200
pF1KB9 GADDMGAGHHHGAHHAAHHHHA--------------AHHHHHHHH---------------
.::: :: : ::: :: ::::::
CCDS13 -------AHHHHHHHHPHPHHAYPGAGVAHDELGPHAHPHHHHHHQASPPPSSAASPAQQ
130 140 150 160 170 180
210 220 230 240 250
pF1KB9 ----HGGAG-HGGG----AGHHVRLEERFSDDQLVSMSVRELNRQLRGFSKEEVIRLKQK
: : : :. . :: . .:.:::::::::::::::::.::::.:.::::::::
CCDS13 LPTSHPGPGPHATASATAAGGNGSVEDRFSDDQLVSMSVRELNRHLRGFTKDEVIRLKQK
190 200 210 220 230 240
260 270 280 290 300 310
pF1KB9 RRTLKNRGYAQSCRFKRVQQRHILESEKCQLQSQVEQLKLEVGRLAKERDLYKEKYEKLA
::::::::::::::.:::::.: ::.:: :: .:::::: ::.:::.::: :: : ::::
CCDS13 RRTLKNRGYAQSCRYKRVQQKHHLENEKTQLIQQVEQLKQEVSRLARERDAYKVKCEKLA
250 260 270 280 290 300
320 330 340 350
pF1KB9 GRGGPGSAGGAGFPREPSPPQAGPGGAKGTADFFL
. : ... . : :
CCDS13 NSGFREAGSTSDSPSSPEFFL
310 320
352 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 09:31:39 2016 done: Sun Nov 6 09:31:40 2016
Total Scan time: 3.440 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]