FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB9838, 352 aa 1>>>pF1KB9838 352 - 352 aa - 352 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 9.0696+/-0.000933; mu= 3.4497+/- 0.057 mean_var=317.7144+/-65.414, 0's: 0 Z-trim(116.9): 20 B-trim: 280 in 2/50 Lambda= 0.071954 statistics sampled from 17583 (17601) to 17583 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.817), E-opt: 0.2 (0.541), width: 16 Scan time: 3.440 The best scores are: opt bits E(32554) CCDS34955.1 MAFA gene_id:389692|Hs108|chr8 ( 353) 2473 269.5 3.1e-72 CCDS42198.1 MAF gene_id:4094|Hs108|chr16 ( 373) 616 76.7 3.4e-14 CCDS10928.1 MAF gene_id:4094|Hs108|chr16 ( 403) 616 76.8 3.6e-14 CCDS13311.1 MAFB gene_id:9935|Hs108|chr20 ( 323) 576 72.5 5.5e-13 >>CCDS34955.1 MAFA gene_id:389692|Hs108|chr8 (353 aa) initn: 1873 init1: 1533 opt: 2473 Z-score: 1410.4 bits: 269.5 E(32554): 3.1e-72 Smith-Waterman score: 2473; 99.7% identity (99.7% similar) in 353 aa overlap (1-352:1-353) 10 20 30 40 50 60 pF1KB9 MAAELAMGAELPSSPLAIEYVNDFDLMKFEVKKEPPEAERFCHRLPPGSLSSTPLSTPCS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 MAAELAMGAELPSSPLAIEYVNDFDLMKFEVKKEPPEAERFCHRLPPGSLSSTPLSTPCS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 SVPSSPSFCAPSPGTGGGGGAGGGGGSSQAGGAPGPPSGGPGAVGGTSGKPALEDLYWMS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 SVPSSPSFCAPSPGTGGGGGAGGGGGSSQAGGAPGPPSGGPGAVGGTSGKPALEDLYWMS 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB9 GYQHHLNPEALNLTPEDAVEALIGSGHHGAHHGAHHPAAAAAYEAFRGPGFAGGGGADDM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 GYQHHLNPEALNLTPEDAVEALIGSGHHGAHHGAHHPAAAAAYEAFRGPGFAGGGGADDM 130 140 150 160 170 180 190 200 210 220 230 pF1KB9 GAGHHHGAHHAAHHHHAAHHHHHHHHH-GGAGHGGGAGHHVRLEERFSDDQLVSMSVREL ::::::::::::::::::::::::::: :::::::::::::::::::::::::::::::: CCDS34 GAGHHHGAHHAAHHHHAAHHHHHHHHHHGGAGHGGGAGHHVRLEERFSDDQLVSMSVREL 190 200 210 220 230 240 240 250 260 270 280 290 pF1KB9 NRQLRGFSKEEVIRLKQKRRTLKNRGYAQSCRFKRVQQRHILESEKCQLQSQVEQLKLEV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 NRQLRGFSKEEVIRLKQKRRTLKNRGYAQSCRFKRVQQRHILESEKCQLQSQVEQLKLEV 250 260 270 280 290 300 300 310 320 330 340 350 pF1KB9 GRLAKERDLYKEKYEKLAGRGGPGSAGGAGFPREPSPPQAGPGGAKGTADFFL ::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 GRLAKERDLYKEKYEKLAGRGGPGSAGGAGFPREPSPPQAGPGGAKGTADFFL 310 320 330 340 350 >>CCDS42198.1 MAF gene_id:4094|Hs108|chr16 (373 aa) initn: 767 init1: 547 opt: 616 Z-score: 368.3 bits: 76.7 E(32554): 3.4e-14 Smith-Waterman score: 1036; 50.0% identity (63.6% similar) in 396 aa overlap (4-334:4-369) 10 20 30 40 50 pF1KB9 MAAELAMG-AELPSSPLAIEYVNDFDLMKFEVKKEPPEAERF---CHRL-PPGSLSSTPL ::::. ..::.::::.::::::::::::::::: :..:. : :: :::::::. CCDS42 MASELAMSNSDLPTSPLAMEYVNDFDLMKFEVKKEPVETDRIISQCGRLIAGGSLSSTPM 10 20 30 40 50 60 60 70 80 90 100 110 pF1KB9 STPCSSVPSSPSFCAPSPGTGGGGGAGGGGGSSQAGGAPGPPSGGPGAVGGTSGKPALED :::::::: :::: :::::. :: : : ::: CCDS42 STPCSSVPPSPSFSAPSPGS----------GSEQ--------------------KAHLED 70 80 90 120 130 140 150 160 170 pF1KB9 LYWMSGYQHHLNPEALNLTPEDAVEALIGSGHH--GAHHGAHHPAAAAAYEAFRGPGFAG :::.:: ..::::::...:::::::::...:. :. : . : : : : : . CCDS42 YYWMTGYPQQLNPEALGFSPEDAVEALISNSHQLQGGFDGYARGAQQLAAAAGAGAGASL 100 110 120 130 140 150 180 190 200 pF1KB9 GGGADDMG------------AGHHHGA--HHAAHHHHAAHHHHHHH-------------- ::....:: :. . :: :. :::::: :::: CCDS42 GGSGEEMGPAAAVVSAVIAAAAAQSGAGPHYHHHHHHAAGHHHHPTAGAPGAAGSAAASA 160 170 180 190 200 210 210 220 230 pF1KB9 ------------HHGGAGHGGGAG--------------HH----VRLEERFSDDQLVSMS ::.: :::.: :: .....::::.:::.:: CCDS42 GGAGGAGGGGPASAGGGGGGGGGGGGGGAAGAGGALHPHHAAGGLHFDDRFSDEQLVTMS 220 230 240 250 260 270 240 250 260 270 280 290 pF1KB9 VRELNRQLRGFSKEEVIRLKQKRRTLKNRGYAQSCRFKRVQQRHILESEKCQLQSQVEQL :::::::::: :::::::::::::::::::::::::::::::::.::::: :: .::..: CCDS42 VRELNRQLRGVSKEEVIRLKQKRRTLKNRGYAQSCRFKRVQQRHVLESEKNQLLQQVDHL 280 290 300 310 320 330 300 310 320 330 340 350 pF1KB9 KLEVGRLAKERDLYKEKYEKLAGRGGPGSAGGAGFPREPSPPQAGPGGAKGTADFFL : :..::..::: ::::::::.. : ..... : : CCDS42 KQEISRLVRERDAYKEKYEKLVSSGFRENGSSSDNPSSPEFFM 340 350 360 370 >>CCDS10928.1 MAF gene_id:4094|Hs108|chr16 (403 aa) initn: 767 init1: 547 opt: 616 Z-score: 367.9 bits: 76.8 E(32554): 3.6e-14 Smith-Waterman score: 1036; 50.0% identity (63.6% similar) in 396 aa overlap (4-334:4-369) 10 20 30 40 50 pF1KB9 MAAELAMG-AELPSSPLAIEYVNDFDLMKFEVKKEPPEAERF---CHRL-PPGSLSSTPL ::::. ..::.::::.::::::::::::::::: :..:. : :: :::::::. CCDS10 MASELAMSNSDLPTSPLAMEYVNDFDLMKFEVKKEPVETDRIISQCGRLIAGGSLSSTPM 10 20 30 40 50 60 60 70 80 90 100 110 pF1KB9 STPCSSVPSSPSFCAPSPGTGGGGGAGGGGGSSQAGGAPGPPSGGPGAVGGTSGKPALED :::::::: :::: :::::. :: : : ::: CCDS10 STPCSSVPPSPSFSAPSPGS----------GSEQ--------------------KAHLED 70 80 90 120 130 140 150 160 170 pF1KB9 LYWMSGYQHHLNPEALNLTPEDAVEALIGSGHH--GAHHGAHHPAAAAAYEAFRGPGFAG :::.:: ..::::::...:::::::::...:. :. : . : : : : : . CCDS10 YYWMTGYPQQLNPEALGFSPEDAVEALISNSHQLQGGFDGYARGAQQLAAAAGAGAGASL 100 110 120 130 140 150 180 190 200 pF1KB9 GGGADDMG------------AGHHHGA--HHAAHHHHAAHHHHHHH-------------- ::....:: :. . :: :. :::::: :::: CCDS10 GGSGEEMGPAAAVVSAVIAAAAAQSGAGPHYHHHHHHAAGHHHHPTAGAPGAAGSAAASA 160 170 180 190 200 210 210 220 230 pF1KB9 ------------HHGGAGHGGGAG--------------HH----VRLEERFSDDQLVSMS ::.: :::.: :: .....::::.:::.:: CCDS10 GGAGGAGGGGPASAGGGGGGGGGGGGGGAAGAGGALHPHHAAGGLHFDDRFSDEQLVTMS 220 230 240 250 260 270 240 250 260 270 280 290 pF1KB9 VRELNRQLRGFSKEEVIRLKQKRRTLKNRGYAQSCRFKRVQQRHILESEKCQLQSQVEQL :::::::::: :::::::::::::::::::::::::::::::::.::::: :: .::..: CCDS10 VRELNRQLRGVSKEEVIRLKQKRRTLKNRGYAQSCRFKRVQQRHVLESEKNQLLQQVDHL 280 290 300 310 320 330 300 310 320 330 340 350 pF1KB9 KLEVGRLAKERDLYKEKYEKLAGRGGPGSAGGAGFPREPSPPQAGPGGAKGTADFFL : :..::..::: ::::::::.. : ..... : : CCDS10 KQEISRLVRERDAYKEKYEKLVSSGFRENGSSSDNPSSPEFFITEPTRKLEPSVGYATFW 340 350 360 370 380 390 CCDS10 KPQHRVLTSVFTK 400 >>CCDS13311.1 MAFB gene_id:9935|Hs108|chr20 (323 aa) initn: 931 init1: 510 opt: 576 Z-score: 346.6 bits: 72.5 E(32554): 5.5e-13 Smith-Waterman score: 991; 51.7% identity (61.8% similar) in 377 aa overlap (1-334:1-319) 10 20 30 40 50 pF1KB9 MAAELAMGAELPSSPLAIEYVNDFDLMKFEVKKEP-PEAERF---CHRLPP-GSLSSTPL :::::.:: :::.::::.::::::::.::.::::: .::: : :: : ::.::::: CCDS13 MAAELSMGPELPTSPLAMEYVNDFDLLKFDVKKEPLGRAERPGRPCTRLQPAGSVSSTPL 10 20 30 40 50 60 60 70 80 90 100 110 pF1KB9 STPCSSVPSSPSFCAPSPGTGGGGGAGGGGGSSQAGGAPGPPSGGPGAVGGTSGKPALED ::::::::::::: :: : : ::: CCDS13 STPCSSVPSSPSF---SP---------------------------------TEQKTHLED 70 80 120 130 140 150 160 170 pF1KB9 LYWMSGYQHHLNPEALNLTPEDAVEALIGSGHHGAHHGAHHPAAAAAYEAFRGPGFAGGG ::::.. ...::::::::::::::::::: : . .: . ...::: CCDS13 LYWMASNYQQMNPEALNLTPEDAVEALIGS------HPVPQPLQS--FDSFRG------- 90 100 110 120 180 190 200 pF1KB9 GADDMGAGHHHGAHHAAHHHHA--------------AHHHHHHHH--------------- .::: :: : ::: :: :::::: CCDS13 -------AHHHHHHHHPHPHHAYPGAGVAHDELGPHAHPHHHHHHQASPPPSSAASPAQQ 130 140 150 160 170 180 210 220 230 240 250 pF1KB9 ----HGGAG-HGGG----AGHHVRLEERFSDDQLVSMSVRELNRQLRGFSKEEVIRLKQK : : : :. . :: . .:.:::::::::::::::::.::::.:.:::::::: CCDS13 LPTSHPGPGPHATASATAAGGNGSVEDRFSDDQLVSMSVRELNRHLRGFTKDEVIRLKQK 190 200 210 220 230 240 260 270 280 290 300 310 pF1KB9 RRTLKNRGYAQSCRFKRVQQRHILESEKCQLQSQVEQLKLEVGRLAKERDLYKEKYEKLA ::::::::::::::.:::::.: ::.:: :: .:::::: ::.:::.::: :: : :::: CCDS13 RRTLKNRGYAQSCRYKRVQQKHHLENEKTQLIQQVEQLKQEVSRLARERDAYKVKCEKLA 250 260 270 280 290 300 320 330 340 350 pF1KB9 GRGGPGSAGGAGFPREPSPPQAGPGGAKGTADFFL . : ... . : : CCDS13 NSGFREAGSTSDSPSSPEFFL 310 320 352 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 09:31:39 2016 done: Sun Nov 6 09:31:40 2016 Total Scan time: 3.440 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]