FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1872, 393 aa 1>>>pF1KE1872 393 - 393 aa - 393 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.8913+/-0.000693; mu= 11.2054+/- 0.042 mean_var=120.0216+/-23.540, 0's: 0 Z-trim(114.3): 19 B-trim: 0 in 0/51 Lambda= 0.117070 statistics sampled from 14829 (14847) to 14829 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.793), E-opt: 0.2 (0.456), width: 16 Scan time: 3.460 The best scores are: opt bits E(32554) CCDS9615.1 IRF9 gene_id:10379|Hs108|chr14 ( 393) 2687 464.2 9.3e-131 CCDS10956.1 IRF8 gene_id:3394|Hs108|chr16 ( 426) 679 125.1 1.2e-28 CCDS4469.1 IRF4 gene_id:3662|Hs108|chr6 ( 451) 529 99.7 5.4e-21 CCDS1492.1 IRF6 gene_id:3664|Hs108|chr1 ( 467) 403 78.5 1.4e-14 CCDS5808.1 IRF5 gene_id:3663|Hs108|chr7 ( 498) 369 72.7 8e-13 CCDS43645.1 IRF5 gene_id:3663|Hs108|chr7 ( 514) 368 72.6 9.3e-13 CCDS56512.1 IRF5 gene_id:3663|Hs108|chr7 ( 412) 362 71.5 1.6e-12 CCDS4155.1 IRF1 gene_id:3659|Hs108|chr5 ( 325) 351 69.6 4.7e-12 CCDS3835.1 IRF2 gene_id:3660|Hs108|chr4 ( 349) 344 68.4 1.1e-11 CCDS12775.1 IRF3 gene_id:3661|Hs108|chr19 ( 427) 339 67.6 2.4e-11 >>CCDS9615.1 IRF9 gene_id:10379|Hs108|chr14 (393 aa) initn: 2687 init1: 2687 opt: 2687 Z-score: 2461.0 bits: 464.2 E(32554): 9.3e-131 Smith-Waterman score: 2687; 99.7% identity (100.0% similar) in 393 aa overlap (1-393:1-393) 10 20 30 40 50 60 pF1KE1 MASGRARCTRKLRNWVVEQVESGQFPGVCWDDTAKTMFRIPWKHAGKQDFREDQDAAFFK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS96 MASGRARCTRKLRNWVVEQVESGQFPGVCWDDTAKTMFRIPWKHAGKQDFREDQDAAFFK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 AWAIFKGKYKEGDTGGPAVWKTRLRCALNKSSEFKEVPERGRMDVAEPYKVYQLLPPGIV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS96 AWAIFKGKYKEGDTGGPAVWKTRLRCALNKSSEFKEVPERGRMDVAEPYKVYQLLPPGIV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 SGQPGTQKVPSKRQQSSVSSERKEEEDAMQNCTLSPSVLQDSLNNEEEGASGGAVHSDIG ::::::::::::::.::::::::::::::::::::::::::::::::::::::::::::: CCDS96 SGQPGTQKVPSKRQHSSVSSERKEEEDAMQNCTLSPSVLQDSLNNEEEGASGGAVHSDIG 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 SSSSSSSPEPQEVTDTTEAPFQGDQRSLEFLLPPEPDYSLLLTFIYNGRVVGEAQVQSLD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS96 SSSSSSSPEPQEVTDTTEAPFQGDQRSLEFLLPPEPDYSLLLTFIYNGRVVGEAQVQSLD 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE1 CRLVAEPSGSESSMEQVLFPKPGPLEPTQRLLSQLERGILVASNPRGLFVQRLCPIPISW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS96 CRLVAEPSGSESSMEQVLFPKPGPLEPTQRLLSQLERGILVASNPRGLFVQRLCPIPISW 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE1 NAPQAPPGPGPHLLPSNECVELFRTAYFCRDLVRYFQGLGPPPKFQVTLNFWEESHGSSH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS96 NAPQAPPGPGPHLLPSNECVELFRTAYFCRDLVRYFQGLGPPPKFQVTLNFWEESHGSSH 310 320 330 340 350 360 370 380 390 pF1KE1 TPQNLITVKMEQAFARYLLEQTPEQQAAILSLV ::::::::::::::::::::::::::::::::: CCDS96 TPQNLITVKMEQAFARYLLEQTPEQQAAILSLV 370 380 390 >>CCDS10956.1 IRF8 gene_id:3394|Hs108|chr16 (426 aa) initn: 729 init1: 525 opt: 679 Z-score: 627.6 bits: 125.1 E(32554): 1.2e-28 Smith-Waterman score: 740; 34.3% identity (64.9% similar) in 396 aa overlap (10-388:8-387) 10 20 30 40 50 60 pF1KE1 MASGRARCTRKLRNWVVEQVESGQFPGVCWDDTAKTMFRIPWKHAGKQDFREDQDAAFFK :.::.:..::..:...::. :.. :.:::::::::::::. .. ::..:: CCDS10 MCDRNGGRRLRQWLIEQIDSSMYPGLIWENEEKSMFRIPWKHAGKQDYNQEVDASIFK 10 20 30 40 50 70 80 90 100 110 120 pF1KE1 AWAIFKGKYKEGDTGGPAVWKTRLRCALNKSSEFKEVPERGRMDVAEPYKVYQLLPPGIV :::.::::.:::: . ::.:::::::::::: .:.:: .:...:..::::::...: CCDS10 AWAVFKGKFKEGDKAEPATWKTRLRCALNKSPDFEEVTDRSQLDISEPYKVYRIVPEEEQ 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE1 SGQPGTQKVPSKRQQSSVSSERKEEEDAMQNCTLSPSVLQDSLNNEEEGASGGAVHSDIG . . :. . . . . :.: .. ... ::: .: .. ... : CCDS10 KCKLGVATAGCVNEVTEMECGRSEIDELIKE----PSV-DDYMGMIKRSPSPPE------ 120 130 140 150 160 190 200 210 220 230 pF1KE1 SSSSSSSPEPQEVTDTTEAPFQGDQRSLEFLLPPEPDYS-LLLTFIYNGRVVGEAQVQSL . :. :. .: .:. . . . .: ....: :.:..::.: . CCDS10 ACRSQLLPDWWAQQPSTGVPLVTGYTTYD---AHHSAFSQMVISFYYGGKLVGQATTTCP 170 180 190 200 210 220 240 250 260 270 280 pF1KE1 D-CRL-VAEPS--GSE----SSMEQVLFPKPGPLEPTQR-------LLSQLERGILVASN . ::: ...:. :.. ..: : :: :. :..: :...::::.:. :. CCDS10 EGCRLSLSQPGLPGTKLYGPEGLELVRFP-PADAIPSERQRQVTRKLFGHLERGVLLHSS 230 240 250 260 270 280 290 300 310 320 330 340 pF1KE1 PRGLFVQRLCPIPISWNAPQAPPGPG-PHLLPSNECVELFRTAYFCRDLVRYFQGLGPPP .:.::.::: . . . .: : :. : .: :..: :. : :.: ..... : : CCDS10 RQGVFVKRLCQGRV-FCSGNAVVCKGRPNKLERDEVVQVFDTSQFFRELQQFYNSQGRLP 290 300 310 320 330 340 350 360 370 380 390 pF1KE1 KFQVTLNFWEESHGSSHTPQNLITVKMEQAFARYLLEQTPEQQAAILSLV .:.: : :: . ..:: :..:: ..: : :.. .. .: CCDS10 DGRVVLCFGEEFPDMAPLRSKLILVQIEQLYVRQLAEEAGKSCGAGSVMQAPEEPPPDQV 350 360 370 380 390 400 CCDS10 FRMFPDICASHQRSFFRENQQITV 410 420 >>CCDS4469.1 IRF4 gene_id:3662|Hs108|chr6 (451 aa) initn: 741 init1: 328 opt: 529 Z-score: 490.3 bits: 99.7 E(32554): 5.4e-21 Smith-Waterman score: 734; 34.9% identity (61.1% similar) in 398 aa overlap (11-378:23-413) 10 20 30 40 pF1KE1 MASGRARCTRKLRNWVVEQVESGQFPGVCWDDTAKTMFRIPWKHAGKQ :::.:...:..::..::. :.. :..::::::::::: CCDS44 MNLEGGGRGGEFGMSAVSCGNGKLRQWLIDQIDSGKYPGLVWENEEKSIFRIPWKHAGKQ 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE1 DFREDQDAAFFKAWAIFKGKYKEG-DTGGPAVWKTRLRCALNKSSEFKEVPERGRMDVAE :. ...:::.:::::.::::..:: : : .::::::::::::..:.:. ::...:... CCDS44 DYNREEDAALFKAWALFKGKFREGIDKPDPPTWKTRLRCALNKSNDFEELVERSQLDISD 70 80 90 100 110 120 110 120 130 140 150 pF1KE1 PYKVYQLLPPGIVSGQPGTQKVPSKRQQSSVSSERKEEED-----AMQ--NCTLSP---- :::::...: : .. :.... . : :.: :.: : . : CCDS44 PYKVYRIVPEG---AKKGAKQLTLEDPQMSMSHPYTMTTPYPSLPAQQVHNYMMPPLDRS 130 140 150 160 170 160 170 180 190 200 pF1KE1 --SVLQDSLNNE----------EEGA--SGGAVHSDIGSSSSSSSPEPQEVTDTTEAPFQ . . :. . : .: .: : .. ... . : : ... .: . CCDS44 WRDYVPDQPHPEIPYQCPMTFGPRGHHWQGPACENGCQVTGTFYACAPPE-SQAPGVPTE 180 190 200 210 220 230 210 220 230 240 250 260 pF1KE1 GDQRSLEFLLPPEPDYSLLLTFIYNGRVVGEAQVQSLD-CRLVAEPSGSESSMEQVLFPK . :: : : : : . . : .: : ..: . ::. . . :...::::: CCDS44 PSIRSAEAL--AFSDCRLHICLYYREILVKELTTSSPEGCRISHGHTYDASNLDQVLFPY 240 250 260 270 280 290 270 280 290 300 310 pF1KE1 P---GPLEPTQRLLSQLERGILVASNPRGLFVQRLCPIPISWNAPQAPPGPGPHLLPSNE : : . ..:::.::::... : ::...::: : :..: : . :. : .. CCDS44 PEDNGQRKNIEKLLSHLERGVVLWMAPDGLYAKRLCQSRIYWDGPLALCNDRPNKLERDQ 300 310 320 330 340 350 320 330 340 350 360 370 pF1KE1 CVELFRTAYFCRDLVRYFQGLGPPPKFQVTLNFWEESHGSSHTPQNLITVKMEQAFARYL .:: : : .: . . :.::::: : :: . . ..:::...: .:: : CCDS44 TCKLFDTQQFLSELQAFAHHGRSLPRFQVTLCFGEE-FPDPQRQRKLITAHVEPLLARQL 360 370 380 390 400 410 380 390 pF1KE1 LEQTPEQQAAILSLV CCDS44 YYFAQQNSGHFLRGYDLPEHISNPEDYHRSIRHSSIQE 420 430 440 450 >>CCDS1492.1 IRF6 gene_id:3664|Hs108|chr1 (467 aa) initn: 490 init1: 222 opt: 403 Z-score: 375.1 bits: 78.5 E(32554): 1.4e-14 Smith-Waterman score: 595; 31.2% identity (58.6% similar) in 401 aa overlap (11-380:9-404) 10 20 30 40 50 60 pF1KE1 MASGRARCTRKLRNWVVEQVESGQFPGVCWDDTAKTMFRIPWKHAGKQDFREDQDAAFFK .:. :.: ::.:: .::. : . :.:::::: ... ..... ..:: CCDS14 MALHPRRVRLKPWLVAQVDSGLYPGLIWLHRDSKRFQIPWKHATRHSPQQEEENTIFK 10 20 30 40 50 70 80 90 100 110 pF1KE1 AWAIFKGKYKEG-DTGGPAVWKTRLRCALNKSSEFKEVPERGRMDVAEPYKVYQLL---- :::. :::.:: : :: ::..:::::::: ::. . . . .: :.::. CCDS14 AWAVETGKYQEGVDDPDPAKWKAQLRCALNKSREFNLMYDGTKEVPMNPVKIYQVCDIPQ 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE1 PPGIVSGQPGTQKVPSKRQQSSVSSERKEEE-DAMQNCTLSPSVLQDS---LN-NEEEGA : : . . .: ..: .....:. : .:.: : :. . : .::. :: : : CCDS14 PQGSIINPGSTGSAPWDEKDNDVDEEDEEDELDQSQHHV--P--IQDTFPFLNINGSPMA 120 130 140 150 160 170 180 190 200 210 220 pF1KE1 SGGAVHSDIGSSSSSS---SPEPQEVTDTTEAPFQGDQRSLEFLLPPEPDYSLLLTFIYN ... . ..:. : . . :: :. .. .::.: : :. . : .: . : : CCDS14 PASVGNCSVGNCSPEAVWPKTEPLEM-EVPQAPIQPFYSSPELWISSLPMTDLDIKFQYR 180 190 200 210 220 230 230 240 250 260 270 pF1KE1 GRVVGEAQVQS--LDCRLV-----AEPSGSE----SSMEQVLFPKPGPLEP------TQR :. :.... : ::: :. : :.::: :: : . :.. CCDS14 GKEYGQTMTVSNPQGCRLFYGDLGPMPDQEELFGPVSLEQVKFPGPEHITNEKQKLFTSK 240 250 260 270 280 290 280 290 300 310 320 330 pF1KE1 LLSQLERGILVASNPRGLFVQRLCPIPISWNAPQAPPGPGPHLLPSNECVELFRTAYFCR ::. ..::... . ..... ::: . :..: :: .:.:. .. :.:: : CCDS14 LLDVMDRGLILEVSGHAIYAIRLCQCKVYWSGPCAPSLVAPNLIERQKKVKLFCLETFLS 300 310 320 330 340 350 340 350 360 370 380 pF1KE1 DLVRYFQG-LGPPPKFQVTLNFWEESHGSSHTPQNLITVKMEQAFARYLLEQTPEQQAAI ::. . .: . : :.. : : :: .. ..:: :.. . ::.. : CCDS14 DLIAHQKGQIEKQPPFEIYLCFGEEWPDGKPLERKLILVQVIPVVARMIYEMFSGDFTRS 360 370 380 390 400 410 390 pF1KE1 LSLV CCDS14 FDSGSVRLQISTPDIKDNIVAQLKQLYRILQTQESWQPMQPTPSMQLPPALPPQ 420 430 440 450 460 >>CCDS5808.1 IRF5 gene_id:3663|Hs108|chr7 (498 aa) initn: 474 init1: 218 opt: 369 Z-score: 343.6 bits: 72.7 E(32554): 8e-13 Smith-Waterman score: 516; 29.1% identity (53.5% similar) in 413 aa overlap (11-380:16-428) 10 20 30 40 50 pF1KE1 MASGRARCTRKLRNWVVEQVESGQFPGVCWDDTAKTMFRIPWKHAGKQDFREDQD .:. :.: ::.: :.::. : . : .: :::.:: .. .: : CCDS58 MNQSIPVAPTPPRRVRLKPWLVAQVNSCQYPGLQWVNGEKKLFCIPWRHATRHGPSQDGD 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE1 AAFFKAWAIFKGKYKEG-DTGGPAVWKTRLRCALNKSSEFKEVPERGRMDVAEPYKVYQL ..::::: ::: :: : . :: ::. :::::::: .:. . . : .:::.:.. CCDS58 NTIFKAWAKETGKYTEGVDEADPAKWKANLRCALNKSRDFRLIYDGPRDMPPQPYKIYEV 70 80 90 100 110 120 120 130 140 150 160 pF1KE1 LP--PGIVSGQP--------GTQKVPSKRQQSSVSSERKEEE----DAMQNCTLSPSVLQ :. ...:: : .. .. : . : :. ..: :: : .:: CCDS58 CSNGPAPTDSQPPEDYSFGAGEEEEEEEELQRMLPSLSLTEDVKWPPTLQPPTLRPPTLQ 130 140 150 160 170 180 170 180 190 200 210 pF1KE1 D-SLNNEEEGASGGAVHSDIGSSSSSSSPEPQEVTDTTEA-------PFQGDQRSLEFLL .:. . . : .. .. . . .... : : :.: ..:. CCDS58 PPTLQPPVVLGPPAPDPSPLAPPPGNPAGFRELLSEVLEPGPLPASLPPAGEQLLPDLLI 190 200 210 220 230 240 220 230 240 250 260 pF1KE1 PPE--PDYSLLLTFIYNGRVVGEAQVQS-LDCRLVA---EPSGSES------SMEQVLFP :. : .: . : : :: ... ::: : . . :.::: :: CCDS58 SPHMLPLTDLEIKFQYRGRPPRALTISNPHGCRLFYSQLEATQEQVELFGPISLEQVRFP 250 260 270 280 290 300 270 280 290 300 310 pF1KE1 KPGPLEP------TQRLLSQLERGILVASNPRGLFVQRLCPIPISWNAPQAPPGPG-PHL .: . :..::. :.::... . . :.. ::: . :..: : . :. CCDS58 SPEDIPSDKQRFYTNQLLDVLDRGLILQLQGQDLYAIRLCQCKVFWSGPCASAHDSCPNP 310 320 330 340 350 360 320 330 340 350 360 370 pF1KE1 LPSNECVELFRTAYFCRDLVRYFQG-LGPPPKFQVTLNFWEESHGSSHTPQNLITVKMEQ . . ..:: .: .:. . .: . :: :.. . : :: . ..::::.. CCDS58 IQREVKTKLFSLEHFLNELILFQKGQTNTPPPFEIFFCFGEEWPDRKPREKKLITVQVVP 370 380 390 400 410 420 380 390 pF1KE1 AFARYLLEQTPEQQAAILSLV . :: ::: CCDS58 VAARLLLEMFSGELSWSADSIRLQISNPDLKDRMVEQFKELHHIWQSQQRLQPVAQAPPG 430 440 450 460 470 480 >>CCDS43645.1 IRF5 gene_id:3663|Hs108|chr7 (514 aa) initn: 483 init1: 218 opt: 368 Z-score: 342.5 bits: 72.6 E(32554): 9.3e-13 Smith-Waterman score: 458; 28.2% identity (52.2% similar) in 404 aa overlap (11-354:16-418) 10 20 30 40 50 pF1KE1 MASGRARCTRKLRNWVVEQVESGQFPGVCWDDTAKTMFRIPWKHAGKQDFREDQD .:. :.: ::.: :.::. : . : .: :::.:: .. .: : CCDS43 MNQSIPVAPTPPRRVRLKPWLVAQVNSCQYPGLQWVNGEKKLFCIPWRHATRHGPSQDGD 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE1 AAFFKAWAIFKGKYKEG-DTGGPAVWKTRLRCALNKSSEFKEVPERGRMDVAEPYKVYQL ..::::: ::: :: : . :: ::. :::::::: .:. . . : .:::.:.. CCDS43 NTIFKAWAKETGKYTEGVDEADPAKWKANLRCALNKSRDFRLIYDGPRDMPPQPYKIYEV 70 80 90 100 110 120 120 130 140 pF1KE1 LP--PGIVSGQP--------GTQK---------VPSKRQQSSVSSERKE------EEDAM :. ...:: : .. .:: ..:.: . .::. CCDS43 CSNGPAPTDSQPPEDYSFGAGEEEEEEEELQRMLPSLSLTDAVQSGPHMTPYSLLKEDVK 130 140 150 160 170 180 150 160 170 180 190 pF1KE1 QNCTLSPSVLQDSLNNEEEGASGGAVHSDIGSSSSSSSPEP-------QEVTDTTEA--- ::.: .:. . . . .: . . . : .: : . .... : CCDS43 WPPTLQPPTLRPP-TLQPPTLQPPVVLGPPAPDPSPLAPPPGNPAGFRELLSEVLEPGPL 190 200 210 220 230 200 210 220 230 240 pF1KE1 ----PFQGDQRSLEFLLPPE--PDYSLLLTFIYNGRVVGEAQVQS-LDCRLVA---EPSG : :.: ..:. :. : .: . : : :: ... ::: : . CCDS43 PASLPPAGEQLLPDLLISPHMLPLTDLEIKFQYRGRPPRALTISNPHGCRLFYSQLEATQ 240 250 260 270 280 290 250 260 270 280 290 pF1KE1 SES------SMEQVLFPKPGPLEP------TQRLLSQLERGILVASNPRGLFVQRLCPIP . :.::: ::.: . :..::. :.::... . . :.. ::: CCDS43 EQVELFGPISLEQVRFPSPEDIPSDKQRFYTNQLLDVLDRGLILQLQGQDLYAIRLCQCK 300 310 320 330 340 350 300 310 320 330 340 350 pF1KE1 ISWNAPQAPPGPG-PHLLPSNECVELFRTAYFCRDLVRYFQG-LGPPPKFQVTLNFWEES . :..: : . :. . . ..:: .: .:. . .: . :: :.. . : :: CCDS43 VFWSGPCASAHDSCPNPIQREVKTKLFSLEHFLNELILFQKGQTNTPPPFEIFFCFGEEW 360 370 380 390 400 410 360 370 380 390 pF1KE1 HGSSHTPQNLITVKMEQAFARYLLEQTPEQQAAILSLV CCDS43 PDRKPREKKLITVQVVPVAARLLLEMFSGELSWSADSIRLQISNPDLKDRMVEQFKELHH 420 430 440 450 460 470 >>CCDS56512.1 IRF5 gene_id:3663|Hs108|chr7 (412 aa) initn: 520 init1: 218 opt: 362 Z-score: 338.4 bits: 71.5 E(32554): 1.6e-12 Smith-Waterman score: 478; 30.3% identity (53.8% similar) in 379 aa overlap (11-380:16-342) 10 20 30 40 50 pF1KE1 MASGRARCTRKLRNWVVEQVESGQFPGVCWDDTAKTMFRIPWKHAGKQDFREDQD .:. :.: ::.: :.::. : . : .: :::.:: .. .: : CCDS56 MNQSIPVAPTPPRRVRLKPWLVAQVNSCQYPGLQWVNGEKKLFCIPWRHATRHGPSQDGD 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE1 AAFFKAWAIFKGKYKEG-DTGGPAVWKTRLRCALNKSSEFKEVPERGRMDVAEPYKVYQL ..::::: ::: :: : . :: ::. :::::::: .:. . . : .:::.:. CCDS56 NTIFKAWAKETGKYTEGVDEADPAKWKANLRCALNKSRDFRLIYDGPRDMPPQPYKIYE- 70 80 90 100 110 120 130 140 150 160 170 pF1KE1 LPPGIVSGQPGTQKVPSKRQQSSVSSERKEEEDAMQNCTLSPSVLQDSLNNEEEGASGGA . :. :. .. : ..:..:::. .: . ::. ::. CCDS56 ----VCSNGPAPTDSQPPEDYSFGAGEEEEEEEELQR--MLPSL---SLT---------- 120 130 140 150 160 180 190 200 210 220 230 pF1KE1 VHSDIGSSSSSSSPEPQEVTDTTEAPFQGDQRSLEFLLPPEPDYSLLLTFIYNGRVVGEA ::: : :: : ::. .: .. .. :. . CCDS56 ------------------VTDL-EIKFQYRGR------PPR---ALTISNPHGCRLF-YS 170 180 190 240 250 260 270 280 pF1KE1 QVQSLDCRLVAEPSGSESSMEQVLFPKPGPLEP------TQRLLSQLERGILVASNPRGL :... . .. : : :.::: ::.: . :..::. :.::... . . : CCDS56 QLEATQEQV--ELFGP-ISLEQVRFPSPEDIPSDKQRFYTNQLLDVLDRGLILQLQGQDL 200 210 220 230 240 290 300 310 320 330 340 pF1KE1 FVQRLCPIPISWNAPQAPPGPG-PHLLPSNECVELFRTAYFCRDLVRYFQG-LGPPPKFQ .. ::: . :..: : . :. . . ..:: .: .:. . .: . :: :. CCDS56 YAIRLCQCKVFWSGPCASAHDSCPNPIQREVKTKLFSLEHFLNELILFQKGQTNTPPPFE 250 260 270 280 290 300 350 360 370 380 390 pF1KE1 VTLNFWEESHGSSHTPQNLITVKMEQAFARYLLEQTPEQQAAILSLV . . : :: . ..::::.. . :: ::: CCDS56 IFFCFGEEWPDRKPREKKLITVQVVPVAARLLLEMFSGELSWSADSIRLQISNPDLKDRM 310 320 330 340 350 360 CCDS56 VEQFKELHHIWQSQQRLQPVAQAPPGAGLGVGQGPWPMHPAGMQ 370 380 390 400 410 >>CCDS4155.1 IRF1 gene_id:3659|Hs108|chr5 (325 aa) initn: 306 init1: 177 opt: 351 Z-score: 329.9 bits: 69.6 E(32554): 4.7e-12 Smith-Waterman score: 351; 34.0% identity (69.9% similar) in 156 aa overlap (11-165:7-154) 10 20 30 40 50 60 pF1KE1 MASGRARCTRKLRNWVVEQVESGQFPGVCWDDTAKTMFRIPWKHAGKQDFREDQDAAFFK ..: :. :..:.:.::. : . . .:.::::::.:. . ..:: .:. CCDS41 MPITRMRMRPWLEMQINSNQIPGLIWINKEEMIFQIPWKHAAKHGWDINKDACLFR 10 20 30 40 50 70 80 90 100 110 pF1KE1 AWAIFKGKYKEGDT-GGPAVWKTRLRCALNKSSEFKEVPERGRMDVAEPYKVYQLLPPGI .::: :.:: :. : .::. .:::.:. ...:: ...: . .::..::: . CCDS41 SWAIHTGRYKAGEKEPDPKTWKANFRCAMNSLPDIEEVKDQSRNKGSSAVRVYRMLPP-L 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE1 VSGQPGTQKVPSKRQQSSVSSERKEEEDAMQNCTLSPSVLQDSLNNEEEGASGGAVHSDI ...: .: :.:. .: ...:: :. ::....:.:.. CCDS41 TKNQRKERKSKSSRDAKS-KAKRKSCGDS------SPDTFSDGLSSSTLPDDHSSYTVPG 120 130 140 150 160 180 190 200 210 220 230 pF1KE1 GSSSSSSSPEPQEVTDTTEAPFQGDQRSLEFLLPPEPDYSLLLTFIYNGRVVGEAQVQSL CCDS41 YMQDLEVEQALTPALSPCAVSSTLPDWHIPVEVVPDSTSDLYNFQVSPMPSTSEATTDED 170 180 190 200 210 220 >>CCDS3835.1 IRF2 gene_id:3660|Hs108|chr4 (349 aa) initn: 276 init1: 181 opt: 344 Z-score: 323.1 bits: 68.4 E(32554): 1.1e-11 Smith-Waterman score: 346; 28.0% identity (63.8% similar) in 218 aa overlap (11-213:7-221) 10 20 30 40 50 60 pF1KE1 MASGRARCTRKLRNWVVEQVESGQFPGVCWDDTAKTMFRIPWKHAGKQDFREDQDAAFFK ..: :. ::..:. .::. : . : .:.::: ::... . ..:: .:. CCDS38 MPVERMRMRPWLEEQINSNTIPGLKWLNKEKKIFQIPWMHAARHGWDVEKDAPLFR 10 20 30 40 50 70 80 90 100 110 pF1KE1 AWAIFKGKYKEG-DTGGPAVWKTRLRCALNKSSEFKEVPERGRMDVAEPYKVYQLLPPGI ::: ::.. : : : .::. .:::.:. ...:: ... . ..::..:: CCDS38 NWAIHTGKHQPGVDKPDPKTWKANFRCAMNSLPDIEEVKDKSIKKGNNAFRVYRMLP--- 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE1 VSGQPGTQ-KVPSKRQQSSVSSERKEEEDA---MQN--CTLSP--SVLQDSLNNEEEGAS .: .:. . : :. .....:. ..: .. ..: ::: .:: ....:: ... CCDS38 LSERPSKKGKKPKTEKEDKVKHIKQEPVESSLGLSNGVSDLSPEYAVLTSTIKNEVDSTV 120 130 140 150 160 170 180 190 200 210 220 pF1KE1 G----GAVHSDIGSSSSSSSPEPQEVTDTTEAPFQGDQR--SLEFLLPPEPDYSLLLTFI . : : : . .. .: .. ...:. ..:.. :. : : CCDS38 NIIVVGQSHLDSNIENQEIVTNPPDICQVVEVTTESDEQPVSMSELYPLQISPVSSYAES 180 190 200 210 220 230 230 240 250 260 270 280 pF1KE1 YNGRVVGEAQVQSLDCRLVAEPSGSESSMEQVLFPKPGPLEPTQRLLSQLERGILVASNP CCDS38 ETTDSVPSDEESAEGRPHWRKRNIEGKQYLSNMGTRGSYLLPGMASFVTSNKPDLQVTIK 240 250 260 270 280 290 >>CCDS12775.1 IRF3 gene_id:3661|Hs108|chr19 (427 aa) initn: 203 init1: 178 opt: 339 Z-score: 317.2 bits: 67.6 E(32554): 2.4e-11 Smith-Waterman score: 387; 27.5% identity (52.4% similar) in 389 aa overlap (15-380:11-377) 10 20 30 40 50 60 pF1KE1 MASGRARCTRKLRNWVVEQVESGQFPGVCWDDTAKTMFRIPWKHAGKQDFREDQDAAFFK :.: :.. ::. :: : . ..: :::::::. .:: ... : ..:. CCDS12 MGTPKPRILPWLVSQLDLGQLEGVAWVNKSRTRFRIPWKHGLRQDAQQE-DFGIFQ 10 20 30 40 50 70 80 90 100 110 pF1KE1 AWAIFKGKYKEG-DTGGPAVWKTRLRCALNKSSEFKEVPERGRMDVAEPYKVYQLLPPGI ::: : : : : .:: .: :::.. .. . .:.. : .:.:.:... :. CCDS12 AWAEATGAYVPGRDKPDLPTWKRNFRSALNRKEGLRLAEDRSK-DPHDPHKIYEFVNSGV 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE1 VS-GQPGTQKVPSKRQQSSVSSERKEEEDAMQ-NCTLSPSVLQDSLNNEEEGASGGAVHS . .:: :. :. .:.:. ... : . : .:.: : : : . :: CCDS12 GDFSQPDTS--PDTNGGGSTSDTQEDILDELLGNMVLAP--LPDP------GPPSLAVAP 120 130 140 150 160 180 190 200 210 220 230 pF1KE1 DIGS----SSSSSSPEPQEVTDTTEAPFQGDQRSLEFLLPPEPDYSLLLTFIYNGRVVGE . : : ..: : .: : :. :: : .. . .: .: :: : . CCDS12 EPCPQPLRSPSLDNPTPFPNLGPSENP-------LKRLLVPGEEWEFEVTAFYRGRQVFQ 170 180 190 200 210 240 250 260 270 280 pF1KE1 AQVQSLDC----RLVAEPSGSES-SMEQVLFPKPGP-------LEPTQRLLSQLERGILV :...: :::. :... : .: :: . ....:: : :. . CCDS12 ---QTISCPEGLRLVGSEVGDRTLPGWPVTLPDPGMSLTDRGVMSYVRHVLSCLGGGLAL 220 230 240 250 260 270 290 300 310 320 330 pF1KE1 ASNPRGLFVQRLCPIPISWNAPQA--P-PGPGPH-LLPSNECVELFRTAYFCRDLVRYFQ . :..::: : . . : : :: .:... .: . : ::. . . CCDS12 WRAGQWLWAQRLGHCHTYWAVSEELLPNSGHGPDGEVPKDKEGGVFDLGPFIVDLITFTE 280 290 300 310 320 330 340 350 360 370 380 390 pF1KE1 GLGPPPKFQVTLNFWEESHGSSHTPQNLITVKMEQAFARYLLEQTPEQQAAILSLV : : :.. . . : .. . :. ::. . : :.: CCDS12 GSGRSPRYALWFCVGESWPQDQPWTKRLVMVKVVPTCLRALVEMARVGGASSLENTVDLH 340 350 360 370 380 390 CCDS12 ISNSHPLSLTSDQYKAYLQDLVEGMDFQGPGES 400 410 420 393 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 01:59:02 2016 done: Sun Nov 6 01:59:03 2016 Total Scan time: 3.460 Total Display time: 0.040 Function used was FASTA [36.3.4 Apr, 2011]