FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1282, 794 aa 1>>>pF1KE1282 794 - 794 aa - 794 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 12.3793+/-0.00103; mu= -8.5854+/- 0.062 mean_var=471.0500+/-98.159, 0's: 0 Z-trim(117.6): 16 B-trim: 485 in 1/53 Lambda= 0.059094 statistics sampled from 18405 (18417) to 18405 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.812), E-opt: 0.2 (0.566), width: 16 Scan time: 4.750 The best scores are: opt bits E(32554) CCDS9854.1 IRF2BPL gene_id:64207|Hs108|chr14 ( 796) 5436 478.0 2.6e-134 CCDS41475.1 IRF2BP2 gene_id:359948|Hs108|chr1 ( 571) 750 78.4 3.7e-14 CCDS1602.1 IRF2BP2 gene_id:359948|Hs108|chr1 ( 587) 750 78.4 3.7e-14 >>CCDS9854.1 IRF2BPL gene_id:64207|Hs108|chr14 (796 aa) initn: 4833 init1: 4833 opt: 5436 Z-score: 2524.9 bits: 478.0 E(32554): 2.6e-134 Smith-Waterman score: 5436; 99.7% identity (99.7% similar) in 796 aa overlap (1-794:1-796) 10 20 30 40 50 60 pF1KE1 MSAAQVSSSRRQSCYLCDLPRMPWAMIWDFSEPVCRGCVNYEGADRIEFVIETARQLKRA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS98 MSAAQVSSSRRQSCYLCDLPRMPWAMIWDFSEPVCRGCVNYEGADRIEFVIETARQLKRA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 HGCFQDGRSPGPPPPVGVKTVALSAKEAAAAAAAAAAAAAAAQQQQQQQQQQQQQQQQQQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS98 HGCFQDGRSPGPPPPVGVKTVALSAKEAAAAAAAAAAAAAAAQQQQQQQQQQQQQQQQQQ 70 80 90 100 110 120 130 140 150 160 170 pF1KE1 QQQQQ--LNHVDGSSKPAVLAAPSGLERYGLSAAAAAAAAAAAAVEQRSRFEYPPPPVSL ::::: ::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS98 QQQQQQQLNHVDGSSKPAVLAAPSGLERYGLSAAAAAAAAAAAAVEQRSRFEYPPPPVSL 130 140 150 160 170 180 180 190 200 210 220 230 pF1KE1 GSSSHTARLPNGLGGPNGFPKPTPEEGPPELNRQSPNSSSAAASVASRRGTHGGLVTGLP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS98 GSSSHTARLPNGLGGPNGFPKPTPEEGPPELNRQSPNSSSAAASVASRRGTHGGLVTGLP 190 200 210 220 230 240 240 250 260 270 280 290 pF1KE1 NPGGGGGPQLTVPPNLLPQTLLNGPASAAVLPPPPPHALGSRGPPTPAPPGAPGGPACLG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS98 NPGGGGGPQLTVPPNLLPQTLLNGPASAAVLPPPPPHALGSRGPPTPAPPGAPGGPACLG 250 260 270 280 290 300 300 310 320 330 340 350 pF1KE1 GTPGVSATSSSASSSTSSSVAEVGVGAGGKRPGSVSSTDQERELKEKQRNAEALAELSES :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS98 GTPGVSATSSSASSSTSSSVAEVGVGAGGKRPGSVSSTDQERELKEKQRNAEALAELSES 310 320 330 340 350 360 360 370 380 390 400 410 pF1KE1 LRNRAEEWASKPKMVRDTLLTLAGCTPYEVRFKKDHSLLGRVFAFDAVSKPGMDYELKLF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS98 LRNRAEEWASKPKMVRDTLLTLAGCTPYEVRFKKDHSLLGRVFAFDAVSKPGMDYELKLF 370 380 390 400 410 420 420 430 440 450 460 470 pF1KE1 IEYPTGSGNVYSSASGVAKQMYQDCMKDFGRGLSSGFKYLEYEKKHGSGDWRLLGDLLPE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS98 IEYPTGSGNVYSSASGVAKQMYQDCMKDFGRGLSSGFKYLEYEKKHGSGDWRLLGDLLPE 430 440 450 460 470 480 480 490 500 510 520 530 pF1KE1 AVRFFKEGVPGADMLPQPYLDASCPMLPTALVSLSRAPSAPPGTGALPPAAPSGRGAAAS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS98 AVRFFKEGVPGADMLPQPYLDASCPMLPTALVSLSRAPSAPPGTGALPPAAPSGRGAAAS 490 500 510 520 530 540 540 550 560 570 580 590 pF1KE1 LRKRKASPEPPDSAEGALKLGEEQQRQQWMANQSEALKLTMSAGGFAAPGHAAGGPPPPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS98 LRKRKASPEPPDSAEGALKLGEEQQRQQWMANQSEALKLTMSAGGFAAPGHAAGGPPPPP 550 560 570 580 590 600 600 610 620 630 640 650 pF1KE1 PPLGPHSNRTTPPESAPQNGPSPMAALMSVADTLGTAHSPKDGSSVHSTTASARRNSSSP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS98 PPLGPHSNRTTPPESAPQNGPSPMAALMSVADTLGTAHSPKDGSSVHSTTASARRNSSSP 610 620 630 640 650 660 660 670 680 690 700 710 pF1KE1 VSPASVPGQRRLASRNGDLNLQVAPPPPSAHPGMDQVHPQNIPDSPMANSGPLCCTICHE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS98 VSPASVPGQRRLASRNGDLNLQVAPPPPSAHPGMDQVHPQNIPDSPMANSGPLCCTICHE 670 680 690 700 710 720 720 730 740 750 760 770 pF1KE1 RLEDTHFVQCPSVPSHKFCFPCSRESIKAQGATGEVYCPSGEKCPLVGSNVPWAFMQGEI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS98 RLEDTHFVQCPSVPSHKFCFPCSRESIKAQGATGEVYCPSGEKCPLVGSNVPWAFMQGEI 730 740 750 760 770 780 780 790 pF1KE1 ATILAGDVKVKKERDP :::::::::::::::: CCDS98 ATILAGDVKVKKERDP 790 >>CCDS41475.1 IRF2BP2 gene_id:359948|Hs108|chr1 (571 aa) initn: 1575 init1: 673 opt: 750 Z-score: 367.8 bits: 78.4 E(32554): 3.7e-14 Smith-Waterman score: 1476; 41.4% identity (55.6% similar) in 795 aa overlap (2-793:4-570) 10 20 30 40 50 pF1KE1 MSAAQVSSSRRQSCYLCDLPRMPWAMIWDFSEPVCRGCVNYEGADRIEFVIETARQLK ..: ...::::::::::::::::::::::.:::::::::::::::.::::::::::: CCDS41 MAAAVAVAAASRRQSCYLCDLPRMPWAMIWDFTEPVCRGCVNYEGADRVEFVIETARQLK 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE1 RAHGCFQDGRSPGPPPPVGVKTVALSAKEAAAAAAAAAAAAAAAQQQQQQQQQQQQQQQQ :::::: .:::: : .:::.::: .:.. ::: CCDS41 RAHGCFPEGRSP---P-------------GAAASAAAKPPPLSAKDILLQQQ-------- 70 80 90 120 130 140 150 160 170 pF1KE1 QQQQQQQLNHVDGSSKPAVLAAPSGLERYGLSAAAAAAAAAAAAVEQRSRFEYPPPPVSL :::.: . : ::..:::: : :::: . : : : .:: CCDS41 -----QQLGHGGPEAAP---RAPQALERYPL------AAAAERPPRLGSDFGSSRPAASL 100 110 120 130 140 180 190 200 210 220 230 pF1KE1 GSSSHTARLP-NGLGGPNGFPKPTPEEGPPELNRQSPNSSSAAASVASRRGTHGGLVTGL .. : ::. :::: : : :::::::::: ::: :. CCDS41 AQPPTPQPPPVNGILVPNGFSK---LEEPPELNRQSPNP---------RRG-HA------ 150 160 170 180 240 250 260 270 280 290 pF1KE1 PNPGGGGGPQLTVPPNLLPQTLLNGPASAAVLPPPPPHALGSRGPPTPAPPGAPGGPACL :::.:.: :.:: :. : : ::: : CCDS41 ------------VPPTLVP--LMNGSAT------PLPTALG------------------L 190 200 300 310 320 330 340 350 pF1KE1 GGTPGVSATSSSASSSTS-SSVAEVGVGAGGKRPGSVSSTDQERELKEKQRNAEALAELS :: ..: .. :.....: .:. . .:: :::.::::. ....::.: : CCDS41 GGRAAASLAAVSGTAAASLGSAQPTDLGAH-KRPASVSSS---AAVEHEQREAAA----- 210 220 230 240 250 360 370 380 390 400 410 pF1KE1 ESLRNRAEEWASKPKMVRDTLLTLAGCTPYEVRFKKDHSLLGRVFAFDAVSKPGMDYELK .: : : CCDS41 -------KEKQPPPPAHR------------------------------------------ 260 420 430 440 450 460 470 pF1KE1 LFIEYPTGSGNVYSSASGVAKQMYQDCMKDFGRGLSSGFKYLEYEKKHGSGDWRLLGDLL : .. :.:.:.:. .. : : : : : :: CCDS41 -------GPADSLSTAAGAAE------LSAEGAGKSRG---------SGEQDWVN----R 270 280 290 300 480 490 500 510 520 530 pF1KE1 PEAVRFFKEGVPGADMLPQPYLDASCPMLPTALVSLSRAPSAPPGTGALPPAAPSGRGAA :..:: .:..: . . : . . :. ..: CCDS41 PKTVR-------------------------DTLLALHQHGHSGPFESKFKKE-PALTAVA 310 320 330 540 550 560 570 580 590 pF1KE1 ASLRKRKASPEPPDSAEGALKLGEEQQRQQWMANQSEALKLTMS-AGGFAAPGHAAGGPP . :::: :::: .. : :.. : : :.....:.::. :. ...:.. CCDS41 RTARKRKPSPEP-EGEVGPPKINGEA--QPWLSTSTEGLKIPMTPTSSFVS--------- 340 350 360 370 380 600 610 620 630 640 650 pF1KE1 PPPPPLGPHSNRTTPPESAPQNGPSPMAALMSVADTLGTAHSPKDGSSVHSTTASARRNS :::: .::::::::::.: ::: ::::::. :::. : .:. ::...::::: :::: CCDS41 PPPPTASPHSNRTTPPEAA-QNGQSPMAALILVADNAGGSHASKDANQVHSTT---RRNS 390 400 410 420 430 660 670 680 690 700 710 pF1KE1 SSPVSPASVPGQRRLASRNGDLNLQVAPPPPSAHPGMDQVHPQNIPDSPMANSGPLCCTI .:: ::.:. .::::. : .:. . :.. ::: ..::: .:.:.:::::. CCDS41 NSPPSPSSM-NQRRLGPR------EVGGQGAGNTGGLEPVHPASLPDSSLATSAPLCCTL 440 450 460 470 480 490 720 730 740 750 760 770 pF1KE1 CHERLEDTHFVQCPSVPSHKFCFPCSRESIKAQGATGEVYCPSGEKCPLVGSNVPWAFMQ :::::::::::::::::::::::::::.::: :::.:::::::::::::::::::::::: CCDS41 CHERLEDTHFVQCPSVPSHKFCFPCSRQSIKQQGASGEVYCPSGEKCPLVGSNVPWAFMQ 500 510 520 530 540 550 780 790 pF1KE1 GEIATILAGDVKVKKERDP :::::::::::::::::: CCDS41 GEIATILAGDVKVKKERDS 560 570 >>CCDS1602.1 IRF2BP2 gene_id:359948|Hs108|chr1 (587 aa) initn: 1581 init1: 673 opt: 750 Z-score: 367.6 bits: 78.4 E(32554): 3.7e-14 Smith-Waterman score: 1500; 41.7% identity (56.2% similar) in 797 aa overlap (2-793:4-586) 10 20 30 40 50 pF1KE1 MSAAQVSSSRRQSCYLCDLPRMPWAMIWDFSEPVCRGCVNYEGADRIEFVIETARQLK ..: ...::::::::::::::::::::::.:::::::::::::::.::::::::::: CCDS16 MAAAVAVAAASRRQSCYLCDLPRMPWAMIWDFTEPVCRGCVNYEGADRVEFVIETARQLK 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE1 RAHGCFQDGRSPGPPPPVGVKTVALSAKEAAAAAAAAAAAAAAAQQQQQQQQQQQQQQQQ :::::: .:::: : .:::.::: .:.. ::: CCDS16 RAHGCFPEGRSP---P-------------GAAASAAAKPPPLSAKDILLQQQ-------- 70 80 90 120 130 140 150 160 170 pF1KE1 QQQQQQQLNHVDGSSKPAVLAAPSGLERYGLSAAAAAAAAAAAAVEQRSRFEYPPPPVSL :::.: . : ::..:::: : :::: . : : : .:: CCDS16 -----QQLGHGGPEAAP---RAPQALERYPL------AAAAERPPRLGSDFGSSRPAASL 100 110 120 130 140 180 190 200 210 220 230 pF1KE1 GSSSHTARLP-NGLGGPNGFPKPTPEEGPPELNRQSPNSSSAAASVASRRGTHGGLVTGL .. : ::. :::: : : :::::::::: ::: :. CCDS16 AQPPTPQPPPVNGILVPNGFSK---LEEPPELNRQSPNP---------RRG-HA------ 150 160 170 180 240 250 260 270 280 290 pF1KE1 PNPGGGGGPQLTVPPNLLPQTLLNGPASAAVLPPPPPHALGSRGPPTPAPPGAPGGPACL :::.:.: :.:: :. : : ::: : CCDS16 ------------VPPTLVP--LMNGSAT------PLPTALG------------------L 190 200 300 310 320 330 340 350 pF1KE1 GGTPGVSATSSSASSSTS-SSVAEVGVGAGGKRPGSVSSTDQERELKEKQRNAEALAELS :: ..: .. :.....: .:. . .:: :::.::::. ....::.: : CCDS16 GGRAAASLAAVSGTAAASLGSAQPTDLGAH-KRPASVSSS---AAVEHEQREAAA----- 210 220 230 240 250 360 370 380 390 400 410 pF1KE1 ESLRNRAEEWASKPKMVRDTLLTLAGCTPYEVRFKKDHSLLGRVFAFDAVSKPGMDYELK .: : : CCDS16 -------KEKQPPPPAHR------------------------------------------ 260 420 430 440 450 460 470 pF1KE1 LFIEYPTGSGNVYSSASGVAKQMYQDCMKDFGRGLSSGFKYLEYEKKHGSGDWRLLGDLL : .. :.:.:.:. .. : : : : : :: CCDS16 -------GPADSLSTAAGAAE------LSAEGAGKSRG---------SGEQDWVNR---- 270 280 290 300 480 490 500 510 520 530 pF1KE1 PEAVRFFKEGVPGADMLPQPYLDASCPMLPTALVSLSRAPSAPPGT--GALPPAAPSGRG :..:: .: : :. .... :. : : .: .... CCDS16 PKTVR--------DTLLALHQHGHSGPFES----KFKKEPALTAGRLLGFEANGANGSKA 310 320 330 340 540 550 560 570 580 590 pF1KE1 AAASLRKRKASPEPPDSAEGALKLGEEQQRQQWMANQSEALKLTMS-AGGFAAPGHAAGG .: . :::: :::: .. : :.. : : :.....:.::. :. ...:.. CCDS16 VARTARKRKPSPEP-EGEVGPPKINGEAQ--PWLSTSTEGLKIPMTPTSSFVS------- 350 360 370 380 390 600 610 620 630 640 650 pF1KE1 PPPPPPPLGPHSNRTTPPESAPQNGPSPMAALMSVADTLGTAHSPKDGSSVHSTTASARR :::: .::::::::::.: ::: ::::::. :::. : .:. ::...::::: :: CCDS16 --PPPPTASPHSNRTTPPEAA-QNGQSPMAALILVADNAGGSHASKDANQVHSTT---RR 400 410 420 430 440 450 660 670 680 690 700 710 pF1KE1 NSSSPVSPASVPGQRRLASRNGDLNLQVAPPPPSAHPGMDQVHPQNIPDSPMANSGPLCC ::.:: ::.:. .::::. : .:. . :.. ::: ..::: .:.:.:::: CCDS16 NSNSPPSPSSM-NQRRLGPR------EVGGQGAGNTGGLEPVHPASLPDSSLATSAPLCC 460 470 480 490 500 720 730 740 750 760 770 pF1KE1 TICHERLEDTHFVQCPSVPSHKFCFPCSRESIKAQGATGEVYCPSGEKCPLVGSNVPWAF :.:::::::::::::::::::::::::::.::: :::.:::::::::::::::::::::: CCDS16 TLCHERLEDTHFVQCPSVPSHKFCFPCSRQSIKQQGASGEVYCPSGEKCPLVGSNVPWAF 510 520 530 540 550 560 780 790 pF1KE1 MQGEIATILAGDVKVKKERDP :::::::::::::::::::: CCDS16 MQGEIATILAGDVKVKKERDS 570 580 794 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 16:20:15 2016 done: Mon Nov 7 16:20:16 2016 Total Scan time: 4.750 Total Display time: 0.040 Function used was FASTA [36.3.4 Apr, 2011]