FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE1872, 393 aa
1>>>pF1KE1872 393 - 393 aa - 393 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 6.8913+/-0.000693; mu= 11.2054+/- 0.042
mean_var=120.0216+/-23.540, 0's: 0 Z-trim(114.3): 19 B-trim: 0 in 0/51
Lambda= 0.117070
statistics sampled from 14829 (14847) to 14829 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.793), E-opt: 0.2 (0.456), width: 16
Scan time: 3.460
The best scores are: opt bits E(32554)
CCDS9615.1 IRF9 gene_id:10379|Hs108|chr14 ( 393) 2687 464.2 9.3e-131
CCDS10956.1 IRF8 gene_id:3394|Hs108|chr16 ( 426) 679 125.1 1.2e-28
CCDS4469.1 IRF4 gene_id:3662|Hs108|chr6 ( 451) 529 99.7 5.4e-21
CCDS1492.1 IRF6 gene_id:3664|Hs108|chr1 ( 467) 403 78.5 1.4e-14
CCDS5808.1 IRF5 gene_id:3663|Hs108|chr7 ( 498) 369 72.7 8e-13
CCDS43645.1 IRF5 gene_id:3663|Hs108|chr7 ( 514) 368 72.6 9.3e-13
CCDS56512.1 IRF5 gene_id:3663|Hs108|chr7 ( 412) 362 71.5 1.6e-12
CCDS4155.1 IRF1 gene_id:3659|Hs108|chr5 ( 325) 351 69.6 4.7e-12
CCDS3835.1 IRF2 gene_id:3660|Hs108|chr4 ( 349) 344 68.4 1.1e-11
CCDS12775.1 IRF3 gene_id:3661|Hs108|chr19 ( 427) 339 67.6 2.4e-11
>>CCDS9615.1 IRF9 gene_id:10379|Hs108|chr14 (393 aa)
initn: 2687 init1: 2687 opt: 2687 Z-score: 2461.0 bits: 464.2 E(32554): 9.3e-131
Smith-Waterman score: 2687; 99.7% identity (100.0% similar) in 393 aa overlap (1-393:1-393)
10 20 30 40 50 60
pF1KE1 MASGRARCTRKLRNWVVEQVESGQFPGVCWDDTAKTMFRIPWKHAGKQDFREDQDAAFFK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS96 MASGRARCTRKLRNWVVEQVESGQFPGVCWDDTAKTMFRIPWKHAGKQDFREDQDAAFFK
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE1 AWAIFKGKYKEGDTGGPAVWKTRLRCALNKSSEFKEVPERGRMDVAEPYKVYQLLPPGIV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS96 AWAIFKGKYKEGDTGGPAVWKTRLRCALNKSSEFKEVPERGRMDVAEPYKVYQLLPPGIV
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE1 SGQPGTQKVPSKRQQSSVSSERKEEEDAMQNCTLSPSVLQDSLNNEEEGASGGAVHSDIG
::::::::::::::.:::::::::::::::::::::::::::::::::::::::::::::
CCDS96 SGQPGTQKVPSKRQHSSVSSERKEEEDAMQNCTLSPSVLQDSLNNEEEGASGGAVHSDIG
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE1 SSSSSSSPEPQEVTDTTEAPFQGDQRSLEFLLPPEPDYSLLLTFIYNGRVVGEAQVQSLD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS96 SSSSSSSPEPQEVTDTTEAPFQGDQRSLEFLLPPEPDYSLLLTFIYNGRVVGEAQVQSLD
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE1 CRLVAEPSGSESSMEQVLFPKPGPLEPTQRLLSQLERGILVASNPRGLFVQRLCPIPISW
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS96 CRLVAEPSGSESSMEQVLFPKPGPLEPTQRLLSQLERGILVASNPRGLFVQRLCPIPISW
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE1 NAPQAPPGPGPHLLPSNECVELFRTAYFCRDLVRYFQGLGPPPKFQVTLNFWEESHGSSH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS96 NAPQAPPGPGPHLLPSNECVELFRTAYFCRDLVRYFQGLGPPPKFQVTLNFWEESHGSSH
310 320 330 340 350 360
370 380 390
pF1KE1 TPQNLITVKMEQAFARYLLEQTPEQQAAILSLV
:::::::::::::::::::::::::::::::::
CCDS96 TPQNLITVKMEQAFARYLLEQTPEQQAAILSLV
370 380 390
>>CCDS10956.1 IRF8 gene_id:3394|Hs108|chr16 (426 aa)
initn: 729 init1: 525 opt: 679 Z-score: 627.6 bits: 125.1 E(32554): 1.2e-28
Smith-Waterman score: 740; 34.3% identity (64.9% similar) in 396 aa overlap (10-388:8-387)
10 20 30 40 50 60
pF1KE1 MASGRARCTRKLRNWVVEQVESGQFPGVCWDDTAKTMFRIPWKHAGKQDFREDQDAAFFK
:.::.:..::..:...::. :.. :.:::::::::::::. .. ::..::
CCDS10 MCDRNGGRRLRQWLIEQIDSSMYPGLIWENEEKSMFRIPWKHAGKQDYNQEVDASIFK
10 20 30 40 50
70 80 90 100 110 120
pF1KE1 AWAIFKGKYKEGDTGGPAVWKTRLRCALNKSSEFKEVPERGRMDVAEPYKVYQLLPPGIV
:::.::::.:::: . ::.:::::::::::: .:.:: .:...:..::::::...:
CCDS10 AWAVFKGKFKEGDKAEPATWKTRLRCALNKSPDFEEVTDRSQLDISEPYKVYRIVPEEEQ
60 70 80 90 100 110
130 140 150 160 170 180
pF1KE1 SGQPGTQKVPSKRQQSSVSSERKEEEDAMQNCTLSPSVLQDSLNNEEEGASGGAVHSDIG
. . :. . . . . :.: .. ... ::: .: .. ... :
CCDS10 KCKLGVATAGCVNEVTEMECGRSEIDELIKE----PSV-DDYMGMIKRSPSPPE------
120 130 140 150 160
190 200 210 220 230
pF1KE1 SSSSSSSPEPQEVTDTTEAPFQGDQRSLEFLLPPEPDYS-LLLTFIYNGRVVGEAQVQSL
. :. :. .: .:. . . . .: ....: :.:..::.: .
CCDS10 ACRSQLLPDWWAQQPSTGVPLVTGYTTYD---AHHSAFSQMVISFYYGGKLVGQATTTCP
170 180 190 200 210 220
240 250 260 270 280
pF1KE1 D-CRL-VAEPS--GSE----SSMEQVLFPKPGPLEPTQR-------LLSQLERGILVASN
. ::: ...:. :.. ..: : :: :. :..: :...::::.:. :.
CCDS10 EGCRLSLSQPGLPGTKLYGPEGLELVRFP-PADAIPSERQRQVTRKLFGHLERGVLLHSS
230 240 250 260 270 280
290 300 310 320 330 340
pF1KE1 PRGLFVQRLCPIPISWNAPQAPPGPG-PHLLPSNECVELFRTAYFCRDLVRYFQGLGPPP
.:.::.::: . . . .: : :. : .: :..: :. : :.: ..... : :
CCDS10 RQGVFVKRLCQGRV-FCSGNAVVCKGRPNKLERDEVVQVFDTSQFFRELQQFYNSQGRLP
290 300 310 320 330 340
350 360 370 380 390
pF1KE1 KFQVTLNFWEESHGSSHTPQNLITVKMEQAFARYLLEQTPEQQAAILSLV
.:.: : :: . ..:: :..:: ..: : :.. .. .:
CCDS10 DGRVVLCFGEEFPDMAPLRSKLILVQIEQLYVRQLAEEAGKSCGAGSVMQAPEEPPPDQV
350 360 370 380 390 400
CCDS10 FRMFPDICASHQRSFFRENQQITV
410 420
>>CCDS4469.1 IRF4 gene_id:3662|Hs108|chr6 (451 aa)
initn: 741 init1: 328 opt: 529 Z-score: 490.3 bits: 99.7 E(32554): 5.4e-21
Smith-Waterman score: 734; 34.9% identity (61.1% similar) in 398 aa overlap (11-378:23-413)
10 20 30 40
pF1KE1 MASGRARCTRKLRNWVVEQVESGQFPGVCWDDTAKTMFRIPWKHAGKQ
:::.:...:..::..::. :.. :..:::::::::::
CCDS44 MNLEGGGRGGEFGMSAVSCGNGKLRQWLIDQIDSGKYPGLVWENEEKSIFRIPWKHAGKQ
10 20 30 40 50 60
50 60 70 80 90 100
pF1KE1 DFREDQDAAFFKAWAIFKGKYKEG-DTGGPAVWKTRLRCALNKSSEFKEVPERGRMDVAE
:. ...:::.:::::.::::..:: : : .::::::::::::..:.:. ::...:...
CCDS44 DYNREEDAALFKAWALFKGKFREGIDKPDPPTWKTRLRCALNKSNDFEELVERSQLDISD
70 80 90 100 110 120
110 120 130 140 150
pF1KE1 PYKVYQLLPPGIVSGQPGTQKVPSKRQQSSVSSERKEEED-----AMQ--NCTLSP----
:::::...: : .. :.... . : :.: :.: : . :
CCDS44 PYKVYRIVPEG---AKKGAKQLTLEDPQMSMSHPYTMTTPYPSLPAQQVHNYMMPPLDRS
130 140 150 160 170
160 170 180 190 200
pF1KE1 --SVLQDSLNNE----------EEGA--SGGAVHSDIGSSSSSSSPEPQEVTDTTEAPFQ
. . :. . : .: .: : .. ... . : : ... .: .
CCDS44 WRDYVPDQPHPEIPYQCPMTFGPRGHHWQGPACENGCQVTGTFYACAPPE-SQAPGVPTE
180 190 200 210 220 230
210 220 230 240 250 260
pF1KE1 GDQRSLEFLLPPEPDYSLLLTFIYNGRVVGEAQVQSLD-CRLVAEPSGSESSMEQVLFPK
. :: : : : : . . : .: : ..: . ::. . . :...:::::
CCDS44 PSIRSAEAL--AFSDCRLHICLYYREILVKELTTSSPEGCRISHGHTYDASNLDQVLFPY
240 250 260 270 280 290
270 280 290 300 310
pF1KE1 P---GPLEPTQRLLSQLERGILVASNPRGLFVQRLCPIPISWNAPQAPPGPGPHLLPSNE
: : . ..:::.::::... : ::...::: : :..: : . :. : ..
CCDS44 PEDNGQRKNIEKLLSHLERGVVLWMAPDGLYAKRLCQSRIYWDGPLALCNDRPNKLERDQ
300 310 320 330 340 350
320 330 340 350 360 370
pF1KE1 CVELFRTAYFCRDLVRYFQGLGPPPKFQVTLNFWEESHGSSHTPQNLITVKMEQAFARYL
.:: : : .: . . :.::::: : :: . . ..:::...: .:: :
CCDS44 TCKLFDTQQFLSELQAFAHHGRSLPRFQVTLCFGEE-FPDPQRQRKLITAHVEPLLARQL
360 370 380 390 400 410
380 390
pF1KE1 LEQTPEQQAAILSLV
CCDS44 YYFAQQNSGHFLRGYDLPEHISNPEDYHRSIRHSSIQE
420 430 440 450
>>CCDS1492.1 IRF6 gene_id:3664|Hs108|chr1 (467 aa)
initn: 490 init1: 222 opt: 403 Z-score: 375.1 bits: 78.5 E(32554): 1.4e-14
Smith-Waterman score: 595; 31.2% identity (58.6% similar) in 401 aa overlap (11-380:9-404)
10 20 30 40 50 60
pF1KE1 MASGRARCTRKLRNWVVEQVESGQFPGVCWDDTAKTMFRIPWKHAGKQDFREDQDAAFFK
.:. :.: ::.:: .::. : . :.:::::: ... ..... ..::
CCDS14 MALHPRRVRLKPWLVAQVDSGLYPGLIWLHRDSKRFQIPWKHATRHSPQQEEENTIFK
10 20 30 40 50
70 80 90 100 110
pF1KE1 AWAIFKGKYKEG-DTGGPAVWKTRLRCALNKSSEFKEVPERGRMDVAEPYKVYQLL----
:::. :::.:: : :: ::..:::::::: ::. . . . .: :.::.
CCDS14 AWAVETGKYQEGVDDPDPAKWKAQLRCALNKSREFNLMYDGTKEVPMNPVKIYQVCDIPQ
60 70 80 90 100 110
120 130 140 150 160 170
pF1KE1 PPGIVSGQPGTQKVPSKRQQSSVSSERKEEE-DAMQNCTLSPSVLQDS---LN-NEEEGA
: : . . .: ..: .....:. : .:.: : :. . : .::. :: : :
CCDS14 PQGSIINPGSTGSAPWDEKDNDVDEEDEEDELDQSQHHV--P--IQDTFPFLNINGSPMA
120 130 140 150 160 170
180 190 200 210 220
pF1KE1 SGGAVHSDIGSSSSSS---SPEPQEVTDTTEAPFQGDQRSLEFLLPPEPDYSLLLTFIYN
... . ..:. : . . :: :. .. .::.: : :. . : .: . : :
CCDS14 PASVGNCSVGNCSPEAVWPKTEPLEM-EVPQAPIQPFYSSPELWISSLPMTDLDIKFQYR
180 190 200 210 220 230
230 240 250 260 270
pF1KE1 GRVVGEAQVQS--LDCRLV-----AEPSGSE----SSMEQVLFPKPGPLEP------TQR
:. :.... : ::: :. : :.::: :: : . :..
CCDS14 GKEYGQTMTVSNPQGCRLFYGDLGPMPDQEELFGPVSLEQVKFPGPEHITNEKQKLFTSK
240 250 260 270 280 290
280 290 300 310 320 330
pF1KE1 LLSQLERGILVASNPRGLFVQRLCPIPISWNAPQAPPGPGPHLLPSNECVELFRTAYFCR
::. ..::... . ..... ::: . :..: :: .:.:. .. :.:: :
CCDS14 LLDVMDRGLILEVSGHAIYAIRLCQCKVYWSGPCAPSLVAPNLIERQKKVKLFCLETFLS
300 310 320 330 340 350
340 350 360 370 380
pF1KE1 DLVRYFQG-LGPPPKFQVTLNFWEESHGSSHTPQNLITVKMEQAFARYLLEQTPEQQAAI
::. . .: . : :.. : : :: .. ..:: :.. . ::.. :
CCDS14 DLIAHQKGQIEKQPPFEIYLCFGEEWPDGKPLERKLILVQVIPVVARMIYEMFSGDFTRS
360 370 380 390 400 410
390
pF1KE1 LSLV
CCDS14 FDSGSVRLQISTPDIKDNIVAQLKQLYRILQTQESWQPMQPTPSMQLPPALPPQ
420 430 440 450 460
>>CCDS5808.1 IRF5 gene_id:3663|Hs108|chr7 (498 aa)
initn: 474 init1: 218 opt: 369 Z-score: 343.6 bits: 72.7 E(32554): 8e-13
Smith-Waterman score: 516; 29.1% identity (53.5% similar) in 413 aa overlap (11-380:16-428)
10 20 30 40 50
pF1KE1 MASGRARCTRKLRNWVVEQVESGQFPGVCWDDTAKTMFRIPWKHAGKQDFREDQD
.:. :.: ::.: :.::. : . : .: :::.:: .. .: :
CCDS58 MNQSIPVAPTPPRRVRLKPWLVAQVNSCQYPGLQWVNGEKKLFCIPWRHATRHGPSQDGD
10 20 30 40 50 60
60 70 80 90 100 110
pF1KE1 AAFFKAWAIFKGKYKEG-DTGGPAVWKTRLRCALNKSSEFKEVPERGRMDVAEPYKVYQL
..::::: ::: :: : . :: ::. :::::::: .:. . . : .:::.:..
CCDS58 NTIFKAWAKETGKYTEGVDEADPAKWKANLRCALNKSRDFRLIYDGPRDMPPQPYKIYEV
70 80 90 100 110 120
120 130 140 150 160
pF1KE1 LP--PGIVSGQP--------GTQKVPSKRQQSSVSSERKEEE----DAMQNCTLSPSVLQ
:. ...:: : .. .. : . : :. ..: :: : .::
CCDS58 CSNGPAPTDSQPPEDYSFGAGEEEEEEEELQRMLPSLSLTEDVKWPPTLQPPTLRPPTLQ
130 140 150 160 170 180
170 180 190 200 210
pF1KE1 D-SLNNEEEGASGGAVHSDIGSSSSSSSPEPQEVTDTTEA-------PFQGDQRSLEFLL
.:. . . : .. .. . . .... : : :.: ..:.
CCDS58 PPTLQPPVVLGPPAPDPSPLAPPPGNPAGFRELLSEVLEPGPLPASLPPAGEQLLPDLLI
190 200 210 220 230 240
220 230 240 250 260
pF1KE1 PPE--PDYSLLLTFIYNGRVVGEAQVQS-LDCRLVA---EPSGSES------SMEQVLFP
:. : .: . : : :: ... ::: : . . :.::: ::
CCDS58 SPHMLPLTDLEIKFQYRGRPPRALTISNPHGCRLFYSQLEATQEQVELFGPISLEQVRFP
250 260 270 280 290 300
270 280 290 300 310
pF1KE1 KPGPLEP------TQRLLSQLERGILVASNPRGLFVQRLCPIPISWNAPQAPPGPG-PHL
.: . :..::. :.::... . . :.. ::: . :..: : . :.
CCDS58 SPEDIPSDKQRFYTNQLLDVLDRGLILQLQGQDLYAIRLCQCKVFWSGPCASAHDSCPNP
310 320 330 340 350 360
320 330 340 350 360 370
pF1KE1 LPSNECVELFRTAYFCRDLVRYFQG-LGPPPKFQVTLNFWEESHGSSHTPQNLITVKMEQ
. . ..:: .: .:. . .: . :: :.. . : :: . ..::::..
CCDS58 IQREVKTKLFSLEHFLNELILFQKGQTNTPPPFEIFFCFGEEWPDRKPREKKLITVQVVP
370 380 390 400 410 420
380 390
pF1KE1 AFARYLLEQTPEQQAAILSLV
. :: :::
CCDS58 VAARLLLEMFSGELSWSADSIRLQISNPDLKDRMVEQFKELHHIWQSQQRLQPVAQAPPG
430 440 450 460 470 480
>>CCDS43645.1 IRF5 gene_id:3663|Hs108|chr7 (514 aa)
initn: 483 init1: 218 opt: 368 Z-score: 342.5 bits: 72.6 E(32554): 9.3e-13
Smith-Waterman score: 458; 28.2% identity (52.2% similar) in 404 aa overlap (11-354:16-418)
10 20 30 40 50
pF1KE1 MASGRARCTRKLRNWVVEQVESGQFPGVCWDDTAKTMFRIPWKHAGKQDFREDQD
.:. :.: ::.: :.::. : . : .: :::.:: .. .: :
CCDS43 MNQSIPVAPTPPRRVRLKPWLVAQVNSCQYPGLQWVNGEKKLFCIPWRHATRHGPSQDGD
10 20 30 40 50 60
60 70 80 90 100 110
pF1KE1 AAFFKAWAIFKGKYKEG-DTGGPAVWKTRLRCALNKSSEFKEVPERGRMDVAEPYKVYQL
..::::: ::: :: : . :: ::. :::::::: .:. . . : .:::.:..
CCDS43 NTIFKAWAKETGKYTEGVDEADPAKWKANLRCALNKSRDFRLIYDGPRDMPPQPYKIYEV
70 80 90 100 110 120
120 130 140
pF1KE1 LP--PGIVSGQP--------GTQK---------VPSKRQQSSVSSERKE------EEDAM
:. ...:: : .. .:: ..:.: . .::.
CCDS43 CSNGPAPTDSQPPEDYSFGAGEEEEEEEELQRMLPSLSLTDAVQSGPHMTPYSLLKEDVK
130 140 150 160 170 180
150 160 170 180 190
pF1KE1 QNCTLSPSVLQDSLNNEEEGASGGAVHSDIGSSSSSSSPEP-------QEVTDTTEA---
::.: .:. . . . .: . . . : .: : . .... :
CCDS43 WPPTLQPPTLRPP-TLQPPTLQPPVVLGPPAPDPSPLAPPPGNPAGFRELLSEVLEPGPL
190 200 210 220 230
200 210 220 230 240
pF1KE1 ----PFQGDQRSLEFLLPPE--PDYSLLLTFIYNGRVVGEAQVQS-LDCRLVA---EPSG
: :.: ..:. :. : .: . : : :: ... ::: : .
CCDS43 PASLPPAGEQLLPDLLISPHMLPLTDLEIKFQYRGRPPRALTISNPHGCRLFYSQLEATQ
240 250 260 270 280 290
250 260 270 280 290
pF1KE1 SES------SMEQVLFPKPGPLEP------TQRLLSQLERGILVASNPRGLFVQRLCPIP
. :.::: ::.: . :..::. :.::... . . :.. :::
CCDS43 EQVELFGPISLEQVRFPSPEDIPSDKQRFYTNQLLDVLDRGLILQLQGQDLYAIRLCQCK
300 310 320 330 340 350
300 310 320 330 340 350
pF1KE1 ISWNAPQAPPGPG-PHLLPSNECVELFRTAYFCRDLVRYFQG-LGPPPKFQVTLNFWEES
. :..: : . :. . . ..:: .: .:. . .: . :: :.. . : ::
CCDS43 VFWSGPCASAHDSCPNPIQREVKTKLFSLEHFLNELILFQKGQTNTPPPFEIFFCFGEEW
360 370 380 390 400 410
360 370 380 390
pF1KE1 HGSSHTPQNLITVKMEQAFARYLLEQTPEQQAAILSLV
CCDS43 PDRKPREKKLITVQVVPVAARLLLEMFSGELSWSADSIRLQISNPDLKDRMVEQFKELHH
420 430 440 450 460 470
>>CCDS56512.1 IRF5 gene_id:3663|Hs108|chr7 (412 aa)
initn: 520 init1: 218 opt: 362 Z-score: 338.4 bits: 71.5 E(32554): 1.6e-12
Smith-Waterman score: 478; 30.3% identity (53.8% similar) in 379 aa overlap (11-380:16-342)
10 20 30 40 50
pF1KE1 MASGRARCTRKLRNWVVEQVESGQFPGVCWDDTAKTMFRIPWKHAGKQDFREDQD
.:. :.: ::.: :.::. : . : .: :::.:: .. .: :
CCDS56 MNQSIPVAPTPPRRVRLKPWLVAQVNSCQYPGLQWVNGEKKLFCIPWRHATRHGPSQDGD
10 20 30 40 50 60
60 70 80 90 100 110
pF1KE1 AAFFKAWAIFKGKYKEG-DTGGPAVWKTRLRCALNKSSEFKEVPERGRMDVAEPYKVYQL
..::::: ::: :: : . :: ::. :::::::: .:. . . : .:::.:.
CCDS56 NTIFKAWAKETGKYTEGVDEADPAKWKANLRCALNKSRDFRLIYDGPRDMPPQPYKIYE-
70 80 90 100 110
120 130 140 150 160 170
pF1KE1 LPPGIVSGQPGTQKVPSKRQQSSVSSERKEEEDAMQNCTLSPSVLQDSLNNEEEGASGGA
. :. :. .. : ..:..:::. .: . ::. ::.
CCDS56 ----VCSNGPAPTDSQPPEDYSFGAGEEEEEEEELQR--MLPSL---SLT----------
120 130 140 150 160
180 190 200 210 220 230
pF1KE1 VHSDIGSSSSSSSPEPQEVTDTTEAPFQGDQRSLEFLLPPEPDYSLLLTFIYNGRVVGEA
::: : :: : ::. .: .. .. :. .
CCDS56 ------------------VTDL-EIKFQYRGR------PPR---ALTISNPHGCRLF-YS
170 180 190
240 250 260 270 280
pF1KE1 QVQSLDCRLVAEPSGSESSMEQVLFPKPGPLEP------TQRLLSQLERGILVASNPRGL
:... . .. : : :.::: ::.: . :..::. :.::... . . :
CCDS56 QLEATQEQV--ELFGP-ISLEQVRFPSPEDIPSDKQRFYTNQLLDVLDRGLILQLQGQDL
200 210 220 230 240
290 300 310 320 330 340
pF1KE1 FVQRLCPIPISWNAPQAPPGPG-PHLLPSNECVELFRTAYFCRDLVRYFQG-LGPPPKFQ
.. ::: . :..: : . :. . . ..:: .: .:. . .: . :: :.
CCDS56 YAIRLCQCKVFWSGPCASAHDSCPNPIQREVKTKLFSLEHFLNELILFQKGQTNTPPPFE
250 260 270 280 290 300
350 360 370 380 390
pF1KE1 VTLNFWEESHGSSHTPQNLITVKMEQAFARYLLEQTPEQQAAILSLV
. . : :: . ..::::.. . :: :::
CCDS56 IFFCFGEEWPDRKPREKKLITVQVVPVAARLLLEMFSGELSWSADSIRLQISNPDLKDRM
310 320 330 340 350 360
CCDS56 VEQFKELHHIWQSQQRLQPVAQAPPGAGLGVGQGPWPMHPAGMQ
370 380 390 400 410
>>CCDS4155.1 IRF1 gene_id:3659|Hs108|chr5 (325 aa)
initn: 306 init1: 177 opt: 351 Z-score: 329.9 bits: 69.6 E(32554): 4.7e-12
Smith-Waterman score: 351; 34.0% identity (69.9% similar) in 156 aa overlap (11-165:7-154)
10 20 30 40 50 60
pF1KE1 MASGRARCTRKLRNWVVEQVESGQFPGVCWDDTAKTMFRIPWKHAGKQDFREDQDAAFFK
..: :. :..:.:.::. : . . .:.::::::.:. . ..:: .:.
CCDS41 MPITRMRMRPWLEMQINSNQIPGLIWINKEEMIFQIPWKHAAKHGWDINKDACLFR
10 20 30 40 50
70 80 90 100 110
pF1KE1 AWAIFKGKYKEGDT-GGPAVWKTRLRCALNKSSEFKEVPERGRMDVAEPYKVYQLLPPGI
.::: :.:: :. : .::. .:::.:. ...:: ...: . .::..::: .
CCDS41 SWAIHTGRYKAGEKEPDPKTWKANFRCAMNSLPDIEEVKDQSRNKGSSAVRVYRMLPP-L
60 70 80 90 100 110
120 130 140 150 160 170
pF1KE1 VSGQPGTQKVPSKRQQSSVSSERKEEEDAMQNCTLSPSVLQDSLNNEEEGASGGAVHSDI
...: .: :.:. .: ...:: :. ::....:.:..
CCDS41 TKNQRKERKSKSSRDAKS-KAKRKSCGDS------SPDTFSDGLSSSTLPDDHSSYTVPG
120 130 140 150 160
180 190 200 210 220 230
pF1KE1 GSSSSSSSPEPQEVTDTTEAPFQGDQRSLEFLLPPEPDYSLLLTFIYNGRVVGEAQVQSL
CCDS41 YMQDLEVEQALTPALSPCAVSSTLPDWHIPVEVVPDSTSDLYNFQVSPMPSTSEATTDED
170 180 190 200 210 220
>>CCDS3835.1 IRF2 gene_id:3660|Hs108|chr4 (349 aa)
initn: 276 init1: 181 opt: 344 Z-score: 323.1 bits: 68.4 E(32554): 1.1e-11
Smith-Waterman score: 346; 28.0% identity (63.8% similar) in 218 aa overlap (11-213:7-221)
10 20 30 40 50 60
pF1KE1 MASGRARCTRKLRNWVVEQVESGQFPGVCWDDTAKTMFRIPWKHAGKQDFREDQDAAFFK
..: :. ::..:. .::. : . : .:.::: ::... . ..:: .:.
CCDS38 MPVERMRMRPWLEEQINSNTIPGLKWLNKEKKIFQIPWMHAARHGWDVEKDAPLFR
10 20 30 40 50
70 80 90 100 110
pF1KE1 AWAIFKGKYKEG-DTGGPAVWKTRLRCALNKSSEFKEVPERGRMDVAEPYKVYQLLPPGI
::: ::.. : : : .::. .:::.:. ...:: ... . ..::..::
CCDS38 NWAIHTGKHQPGVDKPDPKTWKANFRCAMNSLPDIEEVKDKSIKKGNNAFRVYRMLP---
60 70 80 90 100 110
120 130 140 150 160 170
pF1KE1 VSGQPGTQ-KVPSKRQQSSVSSERKEEEDA---MQN--CTLSP--SVLQDSLNNEEEGAS
.: .:. . : :. .....:. ..: .. ..: ::: .:: ....:: ...
CCDS38 LSERPSKKGKKPKTEKEDKVKHIKQEPVESSLGLSNGVSDLSPEYAVLTSTIKNEVDSTV
120 130 140 150 160 170
180 190 200 210 220
pF1KE1 G----GAVHSDIGSSSSSSSPEPQEVTDTTEAPFQGDQR--SLEFLLPPEPDYSLLLTFI
. : : : . .. .: .. ...:. ..:.. :. : :
CCDS38 NIIVVGQSHLDSNIENQEIVTNPPDICQVVEVTTESDEQPVSMSELYPLQISPVSSYAES
180 190 200 210 220 230
230 240 250 260 270 280
pF1KE1 YNGRVVGEAQVQSLDCRLVAEPSGSESSMEQVLFPKPGPLEPTQRLLSQLERGILVASNP
CCDS38 ETTDSVPSDEESAEGRPHWRKRNIEGKQYLSNMGTRGSYLLPGMASFVTSNKPDLQVTIK
240 250 260 270 280 290
>>CCDS12775.1 IRF3 gene_id:3661|Hs108|chr19 (427 aa)
initn: 203 init1: 178 opt: 339 Z-score: 317.2 bits: 67.6 E(32554): 2.4e-11
Smith-Waterman score: 387; 27.5% identity (52.4% similar) in 389 aa overlap (15-380:11-377)
10 20 30 40 50 60
pF1KE1 MASGRARCTRKLRNWVVEQVESGQFPGVCWDDTAKTMFRIPWKHAGKQDFREDQDAAFFK
:.: :.. ::. :: : . ..: :::::::. .:: ... : ..:.
CCDS12 MGTPKPRILPWLVSQLDLGQLEGVAWVNKSRTRFRIPWKHGLRQDAQQE-DFGIFQ
10 20 30 40 50
70 80 90 100 110
pF1KE1 AWAIFKGKYKEG-DTGGPAVWKTRLRCALNKSSEFKEVPERGRMDVAEPYKVYQLLPPGI
::: : : : : .:: .: :::.. .. . .:.. : .:.:.:... :.
CCDS12 AWAEATGAYVPGRDKPDLPTWKRNFRSALNRKEGLRLAEDRSK-DPHDPHKIYEFVNSGV
60 70 80 90 100 110
120 130 140 150 160 170
pF1KE1 VS-GQPGTQKVPSKRQQSSVSSERKEEEDAMQ-NCTLSPSVLQDSLNNEEEGASGGAVHS
. .:: :. :. .:.:. ... : . : .:.: : : : . ::
CCDS12 GDFSQPDTS--PDTNGGGSTSDTQEDILDELLGNMVLAP--LPDP------GPPSLAVAP
120 130 140 150 160
180 190 200 210 220 230
pF1KE1 DIGS----SSSSSSPEPQEVTDTTEAPFQGDQRSLEFLLPPEPDYSLLLTFIYNGRVVGE
. : : ..: : .: : :. :: : .. . .: .: :: : .
CCDS12 EPCPQPLRSPSLDNPTPFPNLGPSENP-------LKRLLVPGEEWEFEVTAFYRGRQVFQ
170 180 190 200 210
240 250 260 270 280
pF1KE1 AQVQSLDC----RLVAEPSGSES-SMEQVLFPKPGP-------LEPTQRLLSQLERGILV
:...: :::. :... : .: :: . ....:: : :. .
CCDS12 ---QTISCPEGLRLVGSEVGDRTLPGWPVTLPDPGMSLTDRGVMSYVRHVLSCLGGGLAL
220 230 240 250 260 270
290 300 310 320 330
pF1KE1 ASNPRGLFVQRLCPIPISWNAPQA--P-PGPGPH-LLPSNECVELFRTAYFCRDLVRYFQ
. :..::: : . . : : :: .:... .: . : ::. . .
CCDS12 WRAGQWLWAQRLGHCHTYWAVSEELLPNSGHGPDGEVPKDKEGGVFDLGPFIVDLITFTE
280 290 300 310 320 330
340 350 360 370 380 390
pF1KE1 GLGPPPKFQVTLNFWEESHGSSHTPQNLITVKMEQAFARYLLEQTPEQQAAILSLV
: : :.. . . : .. . :. ::. . : :.:
CCDS12 GSGRSPRYALWFCVGESWPQDQPWTKRLVMVKVVPTCLRALVEMARVGGASSLENTVDLH
340 350 360 370 380 390
CCDS12 ISNSHPLSLTSDQYKAYLQDLVEGMDFQGPGES
400 410 420
393 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 01:59:02 2016 done: Sun Nov 6 01:59:03 2016
Total Scan time: 3.460 Total Display time: 0.040
Function used was FASTA [36.3.4 Apr, 2011]