FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE1282, 794 aa
1>>>pF1KE1282 794 - 794 aa - 794 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 12.3793+/-0.00103; mu= -8.5854+/- 0.062
mean_var=471.0500+/-98.159, 0's: 0 Z-trim(117.6): 16 B-trim: 485 in 1/53
Lambda= 0.059094
statistics sampled from 18405 (18417) to 18405 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.812), E-opt: 0.2 (0.566), width: 16
Scan time: 4.750
The best scores are: opt bits E(32554)
CCDS9854.1 IRF2BPL gene_id:64207|Hs108|chr14 ( 796) 5436 478.0 2.6e-134
CCDS41475.1 IRF2BP2 gene_id:359948|Hs108|chr1 ( 571) 750 78.4 3.7e-14
CCDS1602.1 IRF2BP2 gene_id:359948|Hs108|chr1 ( 587) 750 78.4 3.7e-14
>>CCDS9854.1 IRF2BPL gene_id:64207|Hs108|chr14 (796 aa)
initn: 4833 init1: 4833 opt: 5436 Z-score: 2524.9 bits: 478.0 E(32554): 2.6e-134
Smith-Waterman score: 5436; 99.7% identity (99.7% similar) in 796 aa overlap (1-794:1-796)
10 20 30 40 50 60
pF1KE1 MSAAQVSSSRRQSCYLCDLPRMPWAMIWDFSEPVCRGCVNYEGADRIEFVIETARQLKRA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS98 MSAAQVSSSRRQSCYLCDLPRMPWAMIWDFSEPVCRGCVNYEGADRIEFVIETARQLKRA
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE1 HGCFQDGRSPGPPPPVGVKTVALSAKEAAAAAAAAAAAAAAAQQQQQQQQQQQQQQQQQQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS98 HGCFQDGRSPGPPPPVGVKTVALSAKEAAAAAAAAAAAAAAAQQQQQQQQQQQQQQQQQQ
70 80 90 100 110 120
130 140 150 160 170
pF1KE1 QQQQQ--LNHVDGSSKPAVLAAPSGLERYGLSAAAAAAAAAAAAVEQRSRFEYPPPPVSL
::::: :::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS98 QQQQQQQLNHVDGSSKPAVLAAPSGLERYGLSAAAAAAAAAAAAVEQRSRFEYPPPPVSL
130 140 150 160 170 180
180 190 200 210 220 230
pF1KE1 GSSSHTARLPNGLGGPNGFPKPTPEEGPPELNRQSPNSSSAAASVASRRGTHGGLVTGLP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS98 GSSSHTARLPNGLGGPNGFPKPTPEEGPPELNRQSPNSSSAAASVASRRGTHGGLVTGLP
190 200 210 220 230 240
240 250 260 270 280 290
pF1KE1 NPGGGGGPQLTVPPNLLPQTLLNGPASAAVLPPPPPHALGSRGPPTPAPPGAPGGPACLG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS98 NPGGGGGPQLTVPPNLLPQTLLNGPASAAVLPPPPPHALGSRGPPTPAPPGAPGGPACLG
250 260 270 280 290 300
300 310 320 330 340 350
pF1KE1 GTPGVSATSSSASSSTSSSVAEVGVGAGGKRPGSVSSTDQERELKEKQRNAEALAELSES
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS98 GTPGVSATSSSASSSTSSSVAEVGVGAGGKRPGSVSSTDQERELKEKQRNAEALAELSES
310 320 330 340 350 360
360 370 380 390 400 410
pF1KE1 LRNRAEEWASKPKMVRDTLLTLAGCTPYEVRFKKDHSLLGRVFAFDAVSKPGMDYELKLF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS98 LRNRAEEWASKPKMVRDTLLTLAGCTPYEVRFKKDHSLLGRVFAFDAVSKPGMDYELKLF
370 380 390 400 410 420
420 430 440 450 460 470
pF1KE1 IEYPTGSGNVYSSASGVAKQMYQDCMKDFGRGLSSGFKYLEYEKKHGSGDWRLLGDLLPE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS98 IEYPTGSGNVYSSASGVAKQMYQDCMKDFGRGLSSGFKYLEYEKKHGSGDWRLLGDLLPE
430 440 450 460 470 480
480 490 500 510 520 530
pF1KE1 AVRFFKEGVPGADMLPQPYLDASCPMLPTALVSLSRAPSAPPGTGALPPAAPSGRGAAAS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS98 AVRFFKEGVPGADMLPQPYLDASCPMLPTALVSLSRAPSAPPGTGALPPAAPSGRGAAAS
490 500 510 520 530 540
540 550 560 570 580 590
pF1KE1 LRKRKASPEPPDSAEGALKLGEEQQRQQWMANQSEALKLTMSAGGFAAPGHAAGGPPPPP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS98 LRKRKASPEPPDSAEGALKLGEEQQRQQWMANQSEALKLTMSAGGFAAPGHAAGGPPPPP
550 560 570 580 590 600
600 610 620 630 640 650
pF1KE1 PPLGPHSNRTTPPESAPQNGPSPMAALMSVADTLGTAHSPKDGSSVHSTTASARRNSSSP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS98 PPLGPHSNRTTPPESAPQNGPSPMAALMSVADTLGTAHSPKDGSSVHSTTASARRNSSSP
610 620 630 640 650 660
660 670 680 690 700 710
pF1KE1 VSPASVPGQRRLASRNGDLNLQVAPPPPSAHPGMDQVHPQNIPDSPMANSGPLCCTICHE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS98 VSPASVPGQRRLASRNGDLNLQVAPPPPSAHPGMDQVHPQNIPDSPMANSGPLCCTICHE
670 680 690 700 710 720
720 730 740 750 760 770
pF1KE1 RLEDTHFVQCPSVPSHKFCFPCSRESIKAQGATGEVYCPSGEKCPLVGSNVPWAFMQGEI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS98 RLEDTHFVQCPSVPSHKFCFPCSRESIKAQGATGEVYCPSGEKCPLVGSNVPWAFMQGEI
730 740 750 760 770 780
780 790
pF1KE1 ATILAGDVKVKKERDP
::::::::::::::::
CCDS98 ATILAGDVKVKKERDP
790
>>CCDS41475.1 IRF2BP2 gene_id:359948|Hs108|chr1 (571 aa)
initn: 1575 init1: 673 opt: 750 Z-score: 367.8 bits: 78.4 E(32554): 3.7e-14
Smith-Waterman score: 1476; 41.4% identity (55.6% similar) in 795 aa overlap (2-793:4-570)
10 20 30 40 50
pF1KE1 MSAAQVSSSRRQSCYLCDLPRMPWAMIWDFSEPVCRGCVNYEGADRIEFVIETARQLK
..: ...::::::::::::::::::::::.:::::::::::::::.:::::::::::
CCDS41 MAAAVAVAAASRRQSCYLCDLPRMPWAMIWDFTEPVCRGCVNYEGADRVEFVIETARQLK
10 20 30 40 50 60
60 70 80 90 100 110
pF1KE1 RAHGCFQDGRSPGPPPPVGVKTVALSAKEAAAAAAAAAAAAAAAQQQQQQQQQQQQQQQQ
:::::: .:::: : .:::.::: .:.. :::
CCDS41 RAHGCFPEGRSP---P-------------GAAASAAAKPPPLSAKDILLQQQ--------
70 80 90
120 130 140 150 160 170
pF1KE1 QQQQQQQLNHVDGSSKPAVLAAPSGLERYGLSAAAAAAAAAAAAVEQRSRFEYPPPPVSL
:::.: . : ::..:::: : :::: . : : : .::
CCDS41 -----QQLGHGGPEAAP---RAPQALERYPL------AAAAERPPRLGSDFGSSRPAASL
100 110 120 130 140
180 190 200 210 220 230
pF1KE1 GSSSHTARLP-NGLGGPNGFPKPTPEEGPPELNRQSPNSSSAAASVASRRGTHGGLVTGL
.. : ::. :::: : : :::::::::: ::: :.
CCDS41 AQPPTPQPPPVNGILVPNGFSK---LEEPPELNRQSPNP---------RRG-HA------
150 160 170 180
240 250 260 270 280 290
pF1KE1 PNPGGGGGPQLTVPPNLLPQTLLNGPASAAVLPPPPPHALGSRGPPTPAPPGAPGGPACL
:::.:.: :.:: :. : : ::: :
CCDS41 ------------VPPTLVP--LMNGSAT------PLPTALG------------------L
190 200
300 310 320 330 340 350
pF1KE1 GGTPGVSATSSSASSSTS-SSVAEVGVGAGGKRPGSVSSTDQERELKEKQRNAEALAELS
:: ..: .. :.....: .:. . .:: :::.::::. ....::.: :
CCDS41 GGRAAASLAAVSGTAAASLGSAQPTDLGAH-KRPASVSSS---AAVEHEQREAAA-----
210 220 230 240 250
360 370 380 390 400 410
pF1KE1 ESLRNRAEEWASKPKMVRDTLLTLAGCTPYEVRFKKDHSLLGRVFAFDAVSKPGMDYELK
.: : :
CCDS41 -------KEKQPPPPAHR------------------------------------------
260
420 430 440 450 460 470
pF1KE1 LFIEYPTGSGNVYSSASGVAKQMYQDCMKDFGRGLSSGFKYLEYEKKHGSGDWRLLGDLL
: .. :.:.:.:. .. : : : : : ::
CCDS41 -------GPADSLSTAAGAAE------LSAEGAGKSRG---------SGEQDWVN----R
270 280 290 300
480 490 500 510 520 530
pF1KE1 PEAVRFFKEGVPGADMLPQPYLDASCPMLPTALVSLSRAPSAPPGTGALPPAAPSGRGAA
:..:: .:..: . . : . . :. ..:
CCDS41 PKTVR-------------------------DTLLALHQHGHSGPFESKFKKE-PALTAVA
310 320 330
540 550 560 570 580 590
pF1KE1 ASLRKRKASPEPPDSAEGALKLGEEQQRQQWMANQSEALKLTMS-AGGFAAPGHAAGGPP
. :::: :::: .. : :.. : : :.....:.::. :. ...:..
CCDS41 RTARKRKPSPEP-EGEVGPPKINGEA--QPWLSTSTEGLKIPMTPTSSFVS---------
340 350 360 370 380
600 610 620 630 640 650
pF1KE1 PPPPPLGPHSNRTTPPESAPQNGPSPMAALMSVADTLGTAHSPKDGSSVHSTTASARRNS
:::: .::::::::::.: ::: ::::::. :::. : .:. ::...::::: ::::
CCDS41 PPPPTASPHSNRTTPPEAA-QNGQSPMAALILVADNAGGSHASKDANQVHSTT---RRNS
390 400 410 420 430
660 670 680 690 700 710
pF1KE1 SSPVSPASVPGQRRLASRNGDLNLQVAPPPPSAHPGMDQVHPQNIPDSPMANSGPLCCTI
.:: ::.:. .::::. : .:. . :.. ::: ..::: .:.:.:::::.
CCDS41 NSPPSPSSM-NQRRLGPR------EVGGQGAGNTGGLEPVHPASLPDSSLATSAPLCCTL
440 450 460 470 480 490
720 730 740 750 760 770
pF1KE1 CHERLEDTHFVQCPSVPSHKFCFPCSRESIKAQGATGEVYCPSGEKCPLVGSNVPWAFMQ
:::::::::::::::::::::::::::.::: :::.::::::::::::::::::::::::
CCDS41 CHERLEDTHFVQCPSVPSHKFCFPCSRQSIKQQGASGEVYCPSGEKCPLVGSNVPWAFMQ
500 510 520 530 540 550
780 790
pF1KE1 GEIATILAGDVKVKKERDP
::::::::::::::::::
CCDS41 GEIATILAGDVKVKKERDS
560 570
>>CCDS1602.1 IRF2BP2 gene_id:359948|Hs108|chr1 (587 aa)
initn: 1581 init1: 673 opt: 750 Z-score: 367.6 bits: 78.4 E(32554): 3.7e-14
Smith-Waterman score: 1500; 41.7% identity (56.2% similar) in 797 aa overlap (2-793:4-586)
10 20 30 40 50
pF1KE1 MSAAQVSSSRRQSCYLCDLPRMPWAMIWDFSEPVCRGCVNYEGADRIEFVIETARQLK
..: ...::::::::::::::::::::::.:::::::::::::::.:::::::::::
CCDS16 MAAAVAVAAASRRQSCYLCDLPRMPWAMIWDFTEPVCRGCVNYEGADRVEFVIETARQLK
10 20 30 40 50 60
60 70 80 90 100 110
pF1KE1 RAHGCFQDGRSPGPPPPVGVKTVALSAKEAAAAAAAAAAAAAAAQQQQQQQQQQQQQQQQ
:::::: .:::: : .:::.::: .:.. :::
CCDS16 RAHGCFPEGRSP---P-------------GAAASAAAKPPPLSAKDILLQQQ--------
70 80 90
120 130 140 150 160 170
pF1KE1 QQQQQQQLNHVDGSSKPAVLAAPSGLERYGLSAAAAAAAAAAAAVEQRSRFEYPPPPVSL
:::.: . : ::..:::: : :::: . : : : .::
CCDS16 -----QQLGHGGPEAAP---RAPQALERYPL------AAAAERPPRLGSDFGSSRPAASL
100 110 120 130 140
180 190 200 210 220 230
pF1KE1 GSSSHTARLP-NGLGGPNGFPKPTPEEGPPELNRQSPNSSSAAASVASRRGTHGGLVTGL
.. : ::. :::: : : :::::::::: ::: :.
CCDS16 AQPPTPQPPPVNGILVPNGFSK---LEEPPELNRQSPNP---------RRG-HA------
150 160 170 180
240 250 260 270 280 290
pF1KE1 PNPGGGGGPQLTVPPNLLPQTLLNGPASAAVLPPPPPHALGSRGPPTPAPPGAPGGPACL
:::.:.: :.:: :. : : ::: :
CCDS16 ------------VPPTLVP--LMNGSAT------PLPTALG------------------L
190 200
300 310 320 330 340 350
pF1KE1 GGTPGVSATSSSASSSTS-SSVAEVGVGAGGKRPGSVSSTDQERELKEKQRNAEALAELS
:: ..: .. :.....: .:. . .:: :::.::::. ....::.: :
CCDS16 GGRAAASLAAVSGTAAASLGSAQPTDLGAH-KRPASVSSS---AAVEHEQREAAA-----
210 220 230 240 250
360 370 380 390 400 410
pF1KE1 ESLRNRAEEWASKPKMVRDTLLTLAGCTPYEVRFKKDHSLLGRVFAFDAVSKPGMDYELK
.: : :
CCDS16 -------KEKQPPPPAHR------------------------------------------
260
420 430 440 450 460 470
pF1KE1 LFIEYPTGSGNVYSSASGVAKQMYQDCMKDFGRGLSSGFKYLEYEKKHGSGDWRLLGDLL
: .. :.:.:.:. .. : : : : : ::
CCDS16 -------GPADSLSTAAGAAE------LSAEGAGKSRG---------SGEQDWVNR----
270 280 290 300
480 490 500 510 520 530
pF1KE1 PEAVRFFKEGVPGADMLPQPYLDASCPMLPTALVSLSRAPSAPPGT--GALPPAAPSGRG
:..:: .: : :. .... :. : : .: ....
CCDS16 PKTVR--------DTLLALHQHGHSGPFES----KFKKEPALTAGRLLGFEANGANGSKA
310 320 330 340
540 550 560 570 580 590
pF1KE1 AAASLRKRKASPEPPDSAEGALKLGEEQQRQQWMANQSEALKLTMS-AGGFAAPGHAAGG
.: . :::: :::: .. : :.. : : :.....:.::. :. ...:..
CCDS16 VARTARKRKPSPEP-EGEVGPPKINGEAQ--PWLSTSTEGLKIPMTPTSSFVS-------
350 360 370 380 390
600 610 620 630 640 650
pF1KE1 PPPPPPPLGPHSNRTTPPESAPQNGPSPMAALMSVADTLGTAHSPKDGSSVHSTTASARR
:::: .::::::::::.: ::: ::::::. :::. : .:. ::...::::: ::
CCDS16 --PPPPTASPHSNRTTPPEAA-QNGQSPMAALILVADNAGGSHASKDANQVHSTT---RR
400 410 420 430 440 450
660 670 680 690 700 710
pF1KE1 NSSSPVSPASVPGQRRLASRNGDLNLQVAPPPPSAHPGMDQVHPQNIPDSPMANSGPLCC
::.:: ::.:. .::::. : .:. . :.. ::: ..::: .:.:.::::
CCDS16 NSNSPPSPSSM-NQRRLGPR------EVGGQGAGNTGGLEPVHPASLPDSSLATSAPLCC
460 470 480 490 500
720 730 740 750 760 770
pF1KE1 TICHERLEDTHFVQCPSVPSHKFCFPCSRESIKAQGATGEVYCPSGEKCPLVGSNVPWAF
:.:::::::::::::::::::::::::::.::: :::.::::::::::::::::::::::
CCDS16 TLCHERLEDTHFVQCPSVPSHKFCFPCSRQSIKQQGASGEVYCPSGEKCPLVGSNVPWAF
510 520 530 540 550 560
780 790
pF1KE1 MQGEIATILAGDVKVKKERDP
::::::::::::::::::::
CCDS16 MQGEIATILAGDVKVKKERDS
570 580
794 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Mon Nov 7 16:20:15 2016 done: Mon Nov 7 16:20:16 2016
Total Scan time: 4.750 Total Display time: 0.040
Function used was FASTA [36.3.4 Apr, 2011]