FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB3214, 841 aa
1>>>pF1KB3214 841 - 841 aa - 841 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 7.7996+/-0.000869; mu= 11.5319+/- 0.053
mean_var=171.5991+/-34.184, 0's: 0 Z-trim(113.2): 139 B-trim: 0 in 0/52
Lambda= 0.097908
statistics sampled from 13675 (13822) to 13675 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.747), E-opt: 0.2 (0.425), width: 16
Scan time: 4.070
The best scores are: opt bits E(32554)
CCDS5026.1 BACH2 gene_id:60468|Hs108|chr6 ( 841) 5749 824.5 0
CCDS13585.1 BACH1 gene_id:571|Hs108|chr21 ( 736) 614 99.1 2.9e-20
>>CCDS5026.1 BACH2 gene_id:60468|Hs108|chr6 (841 aa)
initn: 5749 init1: 5749 opt: 5749 Z-score: 4396.5 bits: 824.5 E(32554): 0
Smith-Waterman score: 5749; 99.9% identity (100.0% similar) in 841 aa overlap (1-841:1-841)
10 20 30 40 50 60
pF1KB3 MSVDEKPDSPMYVYESTVHCTNILLGLNDQRKKDILCDVTLIVERKEFRAHRAVLAACSE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS50 MSVDEKPDSPMYVYESTVHCTNILLGLNDQRKKDILCDVTLIVERKEFRAHRAVLAACSE
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB3 YFWQALVGQTKNDLAVSLPEEVTARGFGPLLQFAYTAKLLLSRENIREVIRCAEFLRMHN
::::::::::::::.:::::::::::::::::::::::::::::::::::::::::::::
CCDS50 YFWQALVGQTKNDLVVSLPEEVTARGFGPLLQFAYTAKLLLSRENIREVIRCAEFLRMHN
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB3 LEDSCFSFLQTQLLNSEDGLFVCRKDAACQRPHEDCENSAGEEEDEEEETMDSETAKMAC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS50 LEDSCFSFLQTQLLNSEDGLFVCRKDAACQRPHEDCENSAGEEEDEEEETMDSETAKMAC
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB3 PRDQMLPEPISFEAAAIPVAEKEEALLPEPDVPTDTKESSEKDALTQYPRYKKYQLACTK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS50 PRDQMLPEPISFEAAAIPVAEKEEALLPEPDVPTDTKESSEKDALTQYPRYKKYQLACTK
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB3 NVYNASSHSTSGFASTFREDNSSNSLKPGLARGQIKSEPPSEENEEESITLCLSGDEPDA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS50 NVYNASSHSTSGFASTFREDNSSNSLKPGLARGQIKSEPPSEENEEESITLCLSGDEPDA
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB3 KDRAGDVEMDRKQPSPAPTPTAPAGAACLERSRSVASPSCLRSLFSITKSVELSGLPSTS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS50 KDRAGDVEMDRKQPSPAPTPTAPAGAACLERSRSVASPSCLRSLFSITKSVELSGLPSTS
310 320 330 340 350 360
370 380 390 400 410 420
pF1KB3 QQHFARSPACPFDKGITQGDLKTDYTPFTGNYGQPHVGQKEVSNFTMGSPLRGPGLEALC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS50 QQHFARSPACPFDKGITQGDLKTDYTPFTGNYGQPHVGQKEVSNFTMGSPLRGPGLEALC
370 380 390 400 410 420
430 440 450 460 470 480
pF1KB3 KQEGELDRRSVIFSSSACDQVSTSVHSYSGVSSLDKDLSEPVPKGLWVGAGQSLPSSQAY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS50 KQEGELDRRSVIFSSSACDQVSTSVHSYSGVSSLDKDLSEPVPKGLWVGAGQSLPSSQAY
430 440 450 460 470 480
490 500 510 520 530 540
pF1KB3 SHGGLMADHLPGRMRPNTSCPVPIKVCPRSPPLETRTRTSSSCSSYSYAEDGSGGSPCSL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS50 SHGGLMADHLPGRMRPNTSCPVPIKVCPRSPPLETRTRTSSSCSSYSYAEDGSGGSPCSL
490 500 510 520 530 540
550 560 570 580 590 600
pF1KB3 PLCEFSSSPCSQGARFLATEHQEPGLMGDGMYNQVRPQIKCEQSYGTNSSDESGSFSEAD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS50 PLCEFSSSPCSQGARFLATEHQEPGLMGDGMYNQVRPQIKCEQSYGTNSSDESGSFSEAD
550 560 570 580 590 600
610 620 630 640 650 660
pF1KB3 SESCPVQDRGQEVKLPFPVDQITDLPRNDFQMMIKMHKLTSEQLEFIHDVRRRSKNRIAA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS50 SESCPVQDRGQEVKLPFPVDQITDLPRNDFQMMIKMHKLTSEQLEFIHDVRRRSKNRIAA
610 620 630 640 650 660
670 680 690 700 710 720
pF1KB3 QRCRKRKLDCIQNLECEIRKLVCEKEKLLSERNQLKACMGELLDNFSCLSQEVCRDIQSP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS50 QRCRKRKLDCIQNLECEIRKLVCEKEKLLSERNQLKACMGELLDNFSCLSQEVCRDIQSP
670 680 690 700 710 720
730 740 750 760 770 780
pF1KB3 EQIQALHRYCPVLRPMDLPTASSINPAPLGAEQNIAASQCAVGENVPCCLEPGAAPPGPP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS50 EQIQALHRYCPVLRPMDLPTASSINPAPLGAEQNIAASQCAVGENVPCCLEPGAAPPGPP
730 740 750 760 770 780
790 800 810 820 830 840
pF1KB3 WAPSNTSENCTSGRRLEGTDPGTFSERGPPLEPRSQTVTVDFCQEMTDKCTTDEQPRKDY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS50 WAPSNTSENCTSGRRLEGTDPGTFSERGPPLEPRSQTVTVDFCQEMTDKCTTDEQPRKDY
790 800 810 820 830 840
pF1KB3 T
:
CCDS50 T
>>CCDS13585.1 BACH1 gene_id:571|Hs108|chr21 (736 aa)
initn: 1236 init1: 567 opt: 614 Z-score: 477.3 bits: 99.1 E(32554): 2.9e-20
Smith-Waterman score: 1276; 34.7% identity (58.5% similar) in 850 aa overlap (1-834:1-736)
10 20 30 40 50 60
pF1KB3 MSVDEKPDSPMYVYESTVHCTNILLGLNDQRKKDILCDVTLIVERKEFRAHRAVLAACSE
::..: . ...:::.:: ::.::.::::::::.:::::..:: ..:::::.::::::
CCDS13 MSLSE---NSVFAYESSVHSTNVLLSLNDQRKKDVLCDVTIFVEGQRFRAHRSVLAACSS
10 20 30 40 50
70 80 90 100 110 120
pF1KB3 YFWQALVGQTKNDLAVSLPEEVTARGFGPLLQFAYTAKLLLSRENIREVIRCAEFLRMHN
:: . .:::. ..: ..::::::..:: ::.::::::::.::.::. :: .:.::: .::
CCDS13 YFHSRIVGQADGELNITLPEEVTVKGFEPLIQFAYTAKLILSKENVDEVCKCVEFLSVHN
60 70 80 90 100 110
130 140 150 160 170
pF1KB3 LEDSCFSFLQTQLLNSEDGLFVC-RK---DAACQRPHEDCENSAGEEEDEEEETMDS--E
.:.:::.::. ..:.: : :: .. ::. : . : ...: : . .. :
CCDS13 IEESCFQFLKFKFLDSTADQQECPRKKCFSSHCQKT--DLKLSLLDQRDLETDEVEEFLE
120 130 140 150 160 170
180 190 200 210 220
pF1KB3 TAKMACPRDQMLPEPISFEAAAIPVAEKEEALLPEPDVPTDTKESS--EKDALTQYP---
. .. :. .. . . .: : : : : ..: :: :::: :
CCDS13 NKNVQTPQCKL--RRYQGNAKASP---------PLQDSASQTYESMCLEKDAALALPSLC
180 190 200 210 220
230 240 250 260 270 280
pF1KB3 -RYKKYQLACTKNVYNASSHSTSGFASTFREDNSSNSLKPGLARGQIKSEPPSEENEEES
.:.:.: : :.. : .. .:.: : : :.:..:.:
CCDS13 PKYRKFQKA---------------FGTD-RVRTGESSVKDIHASVQ-----PNERSENE-
230 240 250 260
290 300 310 320 330 340
pF1KB3 ITLCLSGDEPDAKDRAGDVEMDRKQPSPAPTPTAPAGAACLERSRSVASPSCLRSLFSIT
::.: :. .: .. :... . : : :. .: ..: : .
CCDS13 ---CLGG-VPECRDLQVMLKCDESKLAMEPEETKKDPASQCPTEKSEVTP------FPHN
270 280 290 300 310
350 360 370 380 390 400
pF1KB3 KSVELSGLPSTSQQHFARSPACPFDKGITQGDLKTDYTPFTGNYGQPHVGQKEVSNFTMG
.:.. :: : : : .:. ::: :.. :..... .:
CCDS13 SSIDPHGLYSLSLLH-------TYDQ---YGDL---------NFA----GMQNTTVLTE-
320 330 340
410 420 430 440 450 460
pF1KB3 SPLRGPGLEALCKQEGELDRRSVIFSSSACDQVSTSVHSYSGVSSLDKDLSEPVPKGLWV
.:: : .. : :: ... ..:. . ..:: : : ::......: . ::.:
CCDS13 KPLSGTDVQE--KTFGE--SQDLPLKSDLGTREDSSVAS-SDRSSVEREVAEHLAKGFWS
350 360 370 380 390 400
470 480 490 500 510 520
pF1KB3 GAGQSLPSSQAYSHGGLMADHLPGRMRPNTSCP-VPIKVCPRSPPLETRTRTSSSCSSYS
.. : .. : . . :: . :.. .:: : :: .. :: .
CCDS13 DICSTDTPCQMQLSPAVAKDGSEQISQKRSECPWLGIRIS-ESP--EPGQRTFTTLSSVN
410 420 430 440 450 460
530 540 550 560 570 580
pF1KB3 YAEDGSGGSPCSLPLCEFSSSPCSQGARFLATEHQEPGLMGDGMYNQVRPQIKCEQSYGT
: : :. ..: . . : .:. : . : : .
CCDS13 ---------------CPFISTLSTEGC----SSNLE---IGNDDYVSEPQQEPCPYACVI
470 480 490
590 600 610 620 630 640
pF1KB3 NSSDESGSFSEADSESCPVQDRGQEVKLPFPVDQITDLPRNDFQMMIKMHKLTSEQLEFI
. .:.: . .:.::::: .... :::::: ...: .: ::::: ..:::::: :::. :
CCDS13 SLGDDSETDTEGDSESCSAREQECEVKLPFNAQRIISLSRNDFQSLLKMHKLTPEQLDCI
500 510 520 530 540 550
650 660 670 680 690 700
pF1KB3 HDVRRRSKNRIAAQRCRKRKLDCIQNLECEIRKLVCEKEKLLSERNQLKACMGELLDNFS
::.::::::::::::::::::::::::: ::.:: :::.::.::... . .:: .:..
CCDS13 HDIRRRSKNRIAAQRCRKRKLDCIQNLESEIEKLQSEKESLLKERDHILSTLGETKQNLT
560 570 580 590 600 610
710 720 730 740 750 760
pF1KB3 CLSQEVCRDIQ-SPEQIQALHRYCPVLRPMDLPTASSINPAPLGAEQNIAASQCAVGENV
: :.::.. : :::: : .: . :... . . . .: : ..: .
CCDS13 GLCQKVCKEAALSQEQIQILAKYSAADCPLSFLISEKDKSTPDG---ELA---------L
620 630 640 650 660
770 780 790 800 810 820
pF1KB3 PCCLEPGAAPPG--PPWAPSNTSENCTSGRRLEGTDPGTFSERGPPLEPRSQTVTVDFCQ
: . . ::. :: : .:. . . :.. . . .: . :: . :.. ::::
CCDS13 PSIFSLSDRPPAVLPPCARGNSEPGYARGQESQQMSTATSEQAGPAEQCRQSGGISDFCQ
670 680 690 700 710 720
830 840
pF1KB3 EMTDKCTTDEQPRKDYT
.:::::::::
CCDS13 QMTDKCTTDE
730
841 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sat Nov 5 00:27:42 2016 done: Sat Nov 5 00:27:42 2016
Total Scan time: 4.070 Total Display time: 0.030
Function used was FASTA [36.3.4 Apr, 2011]