FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB3015, 736 aa
1>>>pF1KB3015 736 - 736 aa - 736 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 6.9746+/-0.00102; mu= 13.4626+/- 0.062
mean_var=135.9511+/-27.424, 0's: 0 Z-trim(108.7): 151 B-trim: 0 in 0/51
Lambda= 0.109998
statistics sampled from 10252 (10408) to 10252 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.678), E-opt: 0.2 (0.32), width: 16
Scan time: 4.210
The best scores are: opt bits E(32554)
CCDS13585.1 BACH1 gene_id:571|Hs108|chr21 ( 736) 4929 794.3 0
CCDS5026.1 BACH2 gene_id:60468|Hs108|chr6 ( 841) 612 109.2 2.6e-23
>>CCDS13585.1 BACH1 gene_id:571|Hs108|chr21 (736 aa)
initn: 4929 init1: 4929 opt: 4929 Z-score: 4235.2 bits: 794.3 E(32554): 0
Smith-Waterman score: 4929; 100.0% identity (100.0% similar) in 736 aa overlap (1-736:1-736)
10 20 30 40 50 60
pF1KB3 MSLSENSVFAYESSVHSTNVLLSLNDQRKKDVLCDVTIFVEGQRFRAHRSVLAACSSYFH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 MSLSENSVFAYESSVHSTNVLLSLNDQRKKDVLCDVTIFVEGQRFRAHRSVLAACSSYFH
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB3 SRIVGQADGELNITLPEEVTVKGFEPLIQFAYTAKLILSKENVDEVCKCVEFLSVHNIEE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 SRIVGQADGELNITLPEEVTVKGFEPLIQFAYTAKLILSKENVDEVCKCVEFLSVHNIEE
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB3 SCFQFLKFKFLDSTADQQECPRKKCFSSHCQKTDLKLSLLDQRDLETDEVEEFLENKNVQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 SCFQFLKFKFLDSTADQQECPRKKCFSSHCQKTDLKLSLLDQRDLETDEVEEFLENKNVQ
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB3 TPQCKLRRYQGNAKASPPLQDSASQTYESMCLEKDAALALPSLCPKYRKFQKAFGTDRVR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 TPQCKLRRYQGNAKASPPLQDSASQTYESMCLEKDAALALPSLCPKYRKFQKAFGTDRVR
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB3 TGESSVKDIHASVQPNERSENECLGGVPECRDLQVMLKCDESKLAMEPEETKKDPASQCP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 TGESSVKDIHASVQPNERSENECLGGVPECRDLQVMLKCDESKLAMEPEETKKDPASQCP
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB3 TEKSEVTPFPHNSSIDPHGLYSLSLLHTYDQYGDLNFAGMQNTTVLTEKPLSGTDVQEKT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 TEKSEVTPFPHNSSIDPHGLYSLSLLHTYDQYGDLNFAGMQNTTVLTEKPLSGTDVQEKT
310 320 330 340 350 360
370 380 390 400 410 420
pF1KB3 FGESQDLPLKSDLGTREDSSVASSDRSSVEREVAEHLAKGFWSDICSTDTPCQMQLSPAV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 FGESQDLPLKSDLGTREDSSVASSDRSSVEREVAEHLAKGFWSDICSTDTPCQMQLSPAV
370 380 390 400 410 420
430 440 450 460 470 480
pF1KB3 AKDGSEQISQKRSECPWLGIRISESPEPGQRTFTTLSSVNCPFISTLSTEGCSSNLEIGN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 AKDGSEQISQKRSECPWLGIRISESPEPGQRTFTTLSSVNCPFISTLSTEGCSSNLEIGN
430 440 450 460 470 480
490 500 510 520 530 540
pF1KB3 DDYVSEPQQEPCPYACVISLGDDSETDTEGDSESCSAREQECEVKLPFNAQRIISLSRND
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 DDYVSEPQQEPCPYACVISLGDDSETDTEGDSESCSAREQECEVKLPFNAQRIISLSRND
490 500 510 520 530 540
550 560 570 580 590 600
pF1KB3 FQSLLKMHKLTPEQLDCIHDIRRRSKNRIAAQRCRKRKLDCIQNLESEIEKLQSEKESLL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 FQSLLKMHKLTPEQLDCIHDIRRRSKNRIAAQRCRKRKLDCIQNLESEIEKLQSEKESLL
550 560 570 580 590 600
610 620 630 640 650 660
pF1KB3 KERDHILSTLGETKQNLTGLCQKVCKEAALSQEQIQILAKYSAADCPLSFLISEKDKSTP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 KERDHILSTLGETKQNLTGLCQKVCKEAALSQEQIQILAKYSAADCPLSFLISEKDKSTP
610 620 630 640 650 660
670 680 690 700 710 720
pF1KB3 DGELALPSIFSLSDRPPAVLPPCARGNSEPGYARGQESQQMSTATSEQAGPAEQCRQSGG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 DGELALPSIFSLSDRPPAVLPPCARGNSEPGYARGQESQQMSTATSEQAGPAEQCRQSGG
670 680 690 700 710 720
730
pF1KB3 ISDFCQQMTDKCTTDE
::::::::::::::::
CCDS13 ISDFCQQMTDKCTTDE
730
>>CCDS5026.1 BACH2 gene_id:60468|Hs108|chr6 (841 aa)
initn: 1234 init1: 565 opt: 612 Z-score: 531.9 bits: 109.2 E(32554): 2.6e-23
Smith-Waterman score: 1169; 35.2% identity (58.5% similar) in 764 aa overlap (1-662:1-750)
10 20 30 40 50
pF1KB3 MSLSE---NSVFAYESSVHSTNVLLSLNDQRKKDVLCDVTIFVEGQRFRAHRSVLAACSS
::..: . ...:::.:: ::.::.::::::::.:::::..:: ..:::::.::::::
CCDS50 MSVDEKPDSPMYVYESTVHCTNILLGLNDQRKKDILCDVTLIVERKEFRAHRAVLAACSE
10 20 30 40 50 60
60 70 80 90 100 110
pF1KB3 YFHSRIVGQADGELNITLPEEVTVKGFEPLIQFAYTAKLILSKENVDEVCKCVEFLSVHN
:: . .:::. ..: ..::::::..:: ::.::::::::.::.::. :: .:.::: .::
CCDS50 YFWQALVGQTKNDLVVSLPEEVTARGFGPLLQFAYTAKLLLSRENIREVIRCAEFLRMHN
70 80 90 100 110 120
120 130 140 150 160 170
pF1KB3 IEESCFQFLKFKFLDSTADQQECPRKKCFSSHCQKT--DLKLSLLDQRDLETDEVEEFLE
.:.:::.::. ..:.: : :: .. ::. : . : ...: : . .. :
CCDS50 LEDSCFSFLQTQLLNSEDGLFVC-RK---DAACQRPHEDCENSAGEEEDEEEETMDS--E
130 140 150 160 170
180 190 200 210 220
pF1KB3 NKNVQTPQCKL--RRYQGNAKASP---------PLQDSASQTYESMCLEKDAALALPSLC
. .. :. .. . . .: : : : : ..: :: :::: :
CCDS50 TAKMACPRDQMLPEPISFEAAAIPVAEKEEALLPEPDVPTDTKESS--EKDALTQYP---
180 190 200 210 220
230 240 250 260
pF1KB3 PKYRKFQKA---------------FGTD-RVRTGESSVKDIHASVQ-----PNERSENE-
.:.:.: : :.. : .. .:.: : : :.:..:.:
CCDS50 -RYKKYQLACTKNVYNASSHSTSGFASTFREDNSSNSLKPGLARGQIKSEPPSEENEEES
230 240 250 260 270 280
270 280 290 300 310
pF1KB3 ---CLGG-VPECRDLQVMLKCDESKLAMEPEETKKDPASQCPTEKSEVTP------FPHN
::.: :. .: .. :... . : : :. .: ..: : .
CCDS50 ITLCLSGDEPDAKDRAGDVEMDRKQPSPAPTPTAPAGAACLERSRSVASPSCLRSLFSIT
290 300 310 320 330 340
320 330 340
pF1KB3 SSIDPHGLYSLSLLH-------TYDQ---YGDLN-----FAGMQNTTVLTEK--------
.:.. :: : : : .:. :::. :.: . . .:
CCDS50 KSVELSGLPSTSQQHFARSPACPFDKGITQGDLKTDYTPFTGNYGQPHVGQKEVSNFTMG
350 360 370 380 390 400
350 360 370 380 390 400
pF1KB3 -PLSGTDVQE--KTFGE--SQDLPLKSDLGTREDSSVAS-SDRSSVEREVAEHLAKGFWS
:: : .. : :: ... ..:. . ..:: : : ::......: . ::.:
CCDS50 SPLRGPGLEALCKQEGELDRRSVIFSSSACDQVSTSVHSYSGVSSLDKDLSEPVPKGLWV
410 420 430 440 450 460
410 420 430 440 450 460
pF1KB3 DICSTDTPCQMQLSPAVAKDGSEQISQKRSECPWLGIRIS-ESP--EPGQRTFTTLSSVN
.. : .. : . . :: . :.. .:: : :: .. :: .
CCDS50 GAGQSLPSSQAYSHGGLMADHLPGRMRPNTSCP-VPIKVCPRSPPLETRTRTSSSCSSYS
470 480 490 500 510 520
470 480 490
pF1KB3 ---------------CPFISTLSTEGC----SSNLE---IGNDDYVSEPQQEPCPYACVI
: : :. ..: . . : .:. : . : : .
CCDS50 YAEDGSGGSPCSLPLCEFSSSPCSQGARFLATEHQEPGLMGDGMYNQVRPQIKCEQSYGT
530 540 550 560 570 580
500 510 520 530 540 550
pF1KB3 SLGDDSETDTEGDSESCSAREQECEVKLPFNAQRIISLSRNDFQSLLKMHKLTPEQLDCI
. .:.: . .:.::::: .... :::::: ...: .: ::::: ..:::::: :::. :
CCDS50 NSSDESGSFSEADSESCPVQDRGQEVKLPFPVDQITDLPRNDFQMMIKMHKLTSEQLEFI
590 600 610 620 630 640
560 570 580 590 600 610
pF1KB3 HDIRRRSKNRIAAQRCRKRKLDCIQNLESEIEKLQSEKESLLKERDHILSTLGETKQNLT
::.::::::::::::::::::::::::: ::.:: :::.::.::... . .:: .:..
CCDS50 HDVRRRSKNRIAAQRCRKRKLDCIQNLECEIRKLVCEKEKLLSERNQLKACMGELLDNFS
650 660 670 680 690 700
620 630 640 650 660 670
pF1KB3 GLCQKVCKEAALSQEQIQILAKYSAADCPLSFLISEKDKSTPDGELALPSIFSLSDRPPA
: :.::.. : :::: : .: . :... . . . .: :
CCDS50 CLSQEVCRDIQ-SPEQIQALHRYCPVLRPMDLPTASSINPAPLGAEQNIAASQCAVGENV
710 720 730 740 750 760
680 690 700 710 720 730
pF1KB3 VLPPCARGNSEPGYARGQESQQMSTATSEQAGPAEQCRQSGGISDFCQQMTDKCTTDE
CCDS50 PCCLEPGAAPPGPPWAPSNTSENCTSGRRLEGTDPGTFSERGPPLEPRSQTVTVDFCQEM
770 780 790 800 810 820
736 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Thu Nov 3 20:30:30 2016 done: Thu Nov 3 20:30:31 2016
Total Scan time: 4.210 Total Display time: 0.010
Function used was FASTA [36.3.4 Apr, 2011]