FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE2725, 403 aa 1>>>pF1KE2725 403 - 403 aa - 403 aa Library: human.CCDS.faa 18921897 residues in 33420 sequences Statistics: Expectation_n fit: rho(ln(x))= 8.5111+/-0.000984; mu= 7.3532+/- 0.060 mean_var=343.4482+/-70.840, 0's: 0 Z-trim(115.5): 51 B-trim: 0 in 0/53 Lambda= 0.069206 statistics sampled from 16226 (16266) to 16226 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.785), E-opt: 0.2 (0.487), width: 16 Scan time: 1.600 The best scores are: opt bits E(33420) CCDS10928.1 MAF gene_id:4094|Hs109|chr16 ( 403) 2712 284.1 1.6e-76 CCDS42198.1 MAF gene_id:4094|Hs109|chr16 ( 373) 2506 263.5 2.4e-70 CCDS13311.1 MAFB gene_id:9935|Hs109|chr20 ( 323) 706 83.7 2.8e-16 CCDS34955.1 MAFA gene_id:389692|Hs109|chr8 ( 353) 615 74.7 1.6e-13 >>CCDS10928.1 MAF gene_id:4094|Hs109|chr16 (403 aa) initn: 2712 init1: 2712 opt: 2712 Z-score: 1487.5 bits: 284.1 E(33420): 1.6e-76 Smith-Waterman score: 2712; 100.0% identity (100.0% similar) in 403 aa overlap (1-403:1-403) 10 20 30 40 50 60 pF1KE2 MASELAMSNSDLPTSPLAMEYVNDFDLMKFEVKKEPVETDRIISQCGRLIAGGSLSSTPM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 MASELAMSNSDLPTSPLAMEYVNDFDLMKFEVKKEPVETDRIISQCGRLIAGGSLSSTPM 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 STPCSSVPPSPSFSAPSPGSGSEQKAHLEDYYWMTGYPQQLNPEALGFSPEDAVEALISN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 STPCSSVPPSPSFSAPSPGSGSEQKAHLEDYYWMTGYPQQLNPEALGFSPEDAVEALISN 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 SHQLQGGFDGYARGAQQLAAAAGAGAGASLGGSGEEMGPAAAVVSAVIAAAAAQSGAGPH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 SHQLQGGFDGYARGAQQLAAAAGAGAGASLGGSGEEMGPAAAVVSAVIAAAAAQSGAGPH 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE2 YHHHHHHAAGHHHHPTAGAPGAAGSAAASAGGAGGAGGGGPASAGGGGGGGGGGGGGGAA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 YHHHHHHAAGHHHHPTAGAPGAAGSAAASAGGAGGAGGGGPASAGGGGGGGGGGGGGGAA 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE2 GAGGALHPHHAAGGLHFDDRFSDEQLVTMSVRELNRQLRGVSKEEVIRLKQKRRTLKNRG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 GAGGALHPHHAAGGLHFDDRFSDEQLVTMSVRELNRQLRGVSKEEVIRLKQKRRTLKNRG 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE2 YAQSCRFKRVQQRHVLESEKNQLLQQVDHLKQEISRLVRERDAYKEKYEKLVSSGFRENG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 YAQSCRFKRVQQRHVLESEKNQLLQQVDHLKQEISRLVRERDAYKEKYEKLVSSGFRENG 310 320 330 340 350 360 370 380 390 400 pF1KE2 SSSDNPSSPEFFITEPTRKLEPSVGYATFWKPQHRVLTSVFTK ::::::::::::::::::::::::::::::::::::::::::: CCDS10 SSSDNPSSPEFFITEPTRKLEPSVGYATFWKPQHRVLTSVFTK 370 380 390 400 >>CCDS42198.1 MAF gene_id:4094|Hs109|chr16 (373 aa) initn: 2804 init1: 2506 opt: 2506 Z-score: 1376.7 bits: 263.5 E(33420): 2.4e-70 Smith-Waterman score: 2506; 99.7% identity (100.0% similar) in 373 aa overlap (1-373:1-373) 10 20 30 40 50 60 pF1KE2 MASELAMSNSDLPTSPLAMEYVNDFDLMKFEVKKEPVETDRIISQCGRLIAGGSLSSTPM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 MASELAMSNSDLPTSPLAMEYVNDFDLMKFEVKKEPVETDRIISQCGRLIAGGSLSSTPM 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 STPCSSVPPSPSFSAPSPGSGSEQKAHLEDYYWMTGYPQQLNPEALGFSPEDAVEALISN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 STPCSSVPPSPSFSAPSPGSGSEQKAHLEDYYWMTGYPQQLNPEALGFSPEDAVEALISN 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 SHQLQGGFDGYARGAQQLAAAAGAGAGASLGGSGEEMGPAAAVVSAVIAAAAAQSGAGPH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 SHQLQGGFDGYARGAQQLAAAAGAGAGASLGGSGEEMGPAAAVVSAVIAAAAAQSGAGPH 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE2 YHHHHHHAAGHHHHPTAGAPGAAGSAAASAGGAGGAGGGGPASAGGGGGGGGGGGGGGAA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 YHHHHHHAAGHHHHPTAGAPGAAGSAAASAGGAGGAGGGGPASAGGGGGGGGGGGGGGAA 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE2 GAGGALHPHHAAGGLHFDDRFSDEQLVTMSVRELNRQLRGVSKEEVIRLKQKRRTLKNRG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 GAGGALHPHHAAGGLHFDDRFSDEQLVTMSVRELNRQLRGVSKEEVIRLKQKRRTLKNRG 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE2 YAQSCRFKRVQQRHVLESEKNQLLQQVDHLKQEISRLVRERDAYKEKYEKLVSSGFRENG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 YAQSCRFKRVQQRHVLESEKNQLLQQVDHLKQEISRLVRERDAYKEKYEKLVSSGFRENG 310 320 330 340 350 360 370 380 390 400 pF1KE2 SSSDNPSSPEFFITEPTRKLEPSVGYATFWKPQHRVLTSVFTK ::::::::::::. CCDS42 SSSDNPSSPEFFM 370 >>CCDS13311.1 MAFB gene_id:9935|Hs109|chr20 (323 aa) initn: 1071 init1: 641 opt: 706 Z-score: 406.1 bits: 83.7 E(33420): 2.8e-16 Smith-Waterman score: 1125; 53.3% identity (71.1% similar) in 377 aa overlap (1-373:1-323) 10 20 30 40 50 pF1KE2 MASELAMSNSDLPTSPLAMEYVNDFDLMKFEVKKEPV-ETDRIISQCGRLIAGGSLSSTP ::.::.:. .::::::::::::::::.::.:::::. ...: : :: .::.:::: CCDS13 MAAELSMG-PELPTSPLAMEYVNDFDLLKFDVKKEPLGRAERPGRPCTRLQPAGSVSSTP 10 20 30 40 50 60 70 80 90 100 110 pF1KE2 MSTPCSSVPPSPSFSAPSPGSGSEQKAHLEDYYWMTGYPQQLNPEALGFSPEDAVEALIS .:::::::: ::::: : .:::.:::: :::.. ::.:::::...:::::::::. CCDS13 LSTPCSSVPSSPSFS-P-----TEQKTHLEDLYWMASNYQQMNPEALNLTPEDAVEALIG 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE2 NSH---QLQGGFDGYARGAQQLAAAAGAGAGASLGGSGEEMGPAAAVVSAVIAAAAAQSG :: : .::.. :::.. . :.: .:.. CCDS13 -SHPVPQPLQSFDSF-RGAHHHHHHHHPHPHHAYPGAG-----------------VAHDE 120 130 140 150 180 190 200 210 220 230 pF1KE2 AGPHYHHHHHHAAGHHHHPTAGAPGAAGSAAASAGGAGGAGGGGPASAGGGGGGGGGGGG ::: : ::: ::: .. :..:.: : . .. : : :.:.. ..::.:. CCDS13 LGPHAHPHHH-----HHHQASPPPSSAASPAQQLP-TSHPGPGPHATASATAAGGNGS-- 160 170 180 190 200 240 250 260 270 280 290 pF1KE2 GGAAGAGGALHPHHAAGGLHFDDRFSDEQLVTMSVRELNRQLRGVSKEEVIRLKQKRRTL .:::::.:::.::::::::.::: .:.:::::::::::: CCDS13 --------------------VEDRFSDDQLVSMSVRELNRHLRGFTKDEVIRLKQKRRTL 210 220 230 240 300 310 320 330 340 350 pF1KE2 KNRGYAQSCRFKRVQQRHVLESEKNQLLQQVDHLKQEISRLVRERDAYKEKYEKLVSSGF ::::::::::.:::::.: ::.::.::.:::..::::.:::.::::::: : :::..::: CCDS13 KNRGYAQSCRYKRVQQKHHLENEKTQLIQQVEQLKQEVSRLARERDAYKVKCEKLANSGF 250 260 270 280 290 300 360 370 380 390 400 pF1KE2 RENGSSSDNPSSPEFFITEPTRKLEPSVGYATFWKPQHRVLTSVFTK :: ::.::.:::::::. CCDS13 REAGSTSDSPSSPEFFL 310 320 >>CCDS34955.1 MAFA gene_id:389692|Hs109|chr8 (353 aa) initn: 909 init1: 536 opt: 615 Z-score: 356.6 bits: 74.7 E(33420): 1.6e-13 Smith-Waterman score: 1049; 50.1% identity (63.9% similar) in 399 aa overlap (1-369:1-335) 10 20 30 40 50 60 pF1KE2 MASELAMSNSDLPTSPLAMEYVNDFDLMKFEVKKEPVETDRIISQCGRLIAGGSLSSTPM ::.::::. ..::.::::.::::::::::::::::: :..:. : :: :::::::. CCDS34 MAAELAMG-AELPSSPLAIEYVNDFDLMKFEVKKEPPEAERF---CHRL-PPGSLSSTPL 10 20 30 40 50 70 80 90 pF1KE2 STPCSSVPPSPSFSAPSPGSG----------SEQ--------------------KAHLED :::::::: :::: :::::.: : : : ::: CCDS34 STPCSSVPSSPSFCAPSPGTGGGGGAGGGGGSSQAGGAPGPPSGGPGAVGGTSGKPALED 60 70 80 90 100 110 100 110 120 130 140 150 pF1KE2 YYWMTGYPQQLNPEALGFSPEDAVEALISNSHQLQGGFDGYARGAQQLAAAAGAGAGASL :::.:: ..::::::...:::::::::...:. :. : . : : : : : . CCDS34 LYWMSGYQHHLNPEALNLTPEDAVEALIGSGHH--GAHHGAHHPAAAAAYEAFRGPGFAG 120 130 140 150 160 170 160 170 180 190 200 210 pF1KE2 GGSGEEMGPAAAVVSAVIAAAAAQSGAGPHYHHHHHHAAGHHHHPTAGAPGAAGSAAASA ::....:: :. . :: :. :::::: :::: CCDS34 GGGADDMG------------AGHHHGA--HHAAHHHHAAHHHHH---------------- 180 190 200 220 230 240 250 260 270 pF1KE2 GGAGGAGGGGPASAGGGGGGGGGGGGGGAAGAGGALHPHHAAGGLHFDDRFSDEQLVTMS ::.: :::.: :: .....::::.:::.:: CCDS34 ------------HHHHHGGAGHGGGAG-----------HH----VRLEERFSDDQLVSMS 210 220 230 280 290 300 310 320 330 pF1KE2 VRELNRQLRGVSKEEVIRLKQKRRTLKNRGYAQSCRFKRVQQRHVLESEKNQLLQQVDHL :::::::::: :::::::::::::::::::::::::::::::::.::::: :: .::..: CCDS34 VRELNRQLRGFSKEEVIRLKQKRRTLKNRGYAQSCRFKRVQQRHILESEKCQLQSQVEQL 240 250 260 270 280 290 340 350 360 370 380 390 pF1KE2 KQEISRLVRERDAYKEKYEKLVSSGFRENGSSSDNPSSPEFFITEPTRKLEPSVGYATFW : :..::..::: ::::::::.. : ..... : : CCDS34 KLEVGRLAKERDLYKEKYEKLAGRGGPGSAGGAGFPREPSPPQAGPGGAKGTADFFL 300 310 320 330 340 350 400 pF1KE2 KPQHRVLTSVFTK 403 residues in 1 query sequences 18921897 residues in 33420 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Oct 2 18:30:41 2018 done: Tue Oct 2 18:30:41 2018 Total Scan time: 1.600 Total Display time: 0.050 Function used was FASTA [36.3.4 Apr, 2011]