FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE9629, 1001 aa 1>>>pF1KE9629 1001 - 1001 aa - 1001 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.2644+/-0.000933; mu= 13.7532+/- 0.057 mean_var=133.4077+/-26.348, 0's: 0 Z-trim(110.9): 12 B-trim: 6 in 1/50 Lambda= 0.111041 statistics sampled from 11944 (11953) to 11944 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.711), E-opt: 0.2 (0.367), width: 16 Scan time: 3.450 The best scores are: opt bits E(32554) CCDS11194.1 TOP3A gene_id:7156|Hs108|chr17 (1001) 6931 1122.3 0 CCDS13797.1 TOP3B gene_id:8940|Hs108|chr22 ( 862) 1392 234.9 5.4e-61 >>CCDS11194.1 TOP3A gene_id:7156|Hs108|chr17 (1001 aa) initn: 6931 init1: 6931 opt: 6931 Z-score: 6003.2 bits: 1122.3 E(32554): 0 Smith-Waterman score: 6931; 100.0% identity (100.0% similar) in 1001 aa overlap (1-1001:1-1001) 10 20 30 40 50 60 pF1KE9 MIFPVARYALRWLRRPEDRAFSRAAMEMALRGVRKVLCVAEKNDAAKGIADLLSNGRMRR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 MIFPVARYALRWLRRPEDRAFSRAAMEMALRGVRKVLCVAEKNDAAKGIADLLSNGRMRR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE9 REGLSKFNKIYEFDYHLYGQNVTMVMTSVSGHLLAHDFQMQFRKWQSCNPLVLFEAEIEK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 REGLSKFNKIYEFDYHLYGQNVTMVMTSVSGHLLAHDFQMQFRKWQSCNPLVLFEAEIEK 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE9 YCPENFVDIKKTLERETRQCQALVIWTDCDREGENIGFEIIHVCKAVKPNLQVLRARFSE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 YCPENFVDIKKTLERETRQCQALVIWTDCDREGENIGFEIIHVCKAVKPNLQVLRARFSE 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE9 ITPHAVRTACENLTEPDQRVSDAVDVRQELDLRIGAAFTRFQTLRLQRIFPEVLAEQLIS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 ITPHAVRTACENLTEPDQRVSDAVDVRQELDLRIGAAFTRFQTLRLQRIFPEVLAEQLIS 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE9 YGSCQFPTLGFVVERFKAIQAFVPEIFHRIKVTHDHKDGIVEFNWKRHRLFNHTACLVLY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 YGSCQFPTLGFVVERFKAIQAFVPEIFHRIKVTHDHKDGIVEFNWKRHRLFNHTACLVLY 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE9 QLCVEDPMATVVEVRSKPKSKWRPQALDTVELEKLASRKLRINAKETMRIAEKLYTQGYI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 QLCVEDPMATVVEVRSKPKSKWRPQALDTVELEKLASRKLRINAKETMRIAEKLYTQGYI 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE9 SYPRTETNIFPRDLNLTVLVEQQTPDPRWGAFAQSILERGGPTPRNGNKSDQAHPPIHPT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 SYPRTETNIFPRDLNLTVLVEQQTPDPRWGAFAQSILERGGPTPRNGNKSDQAHPPIHPT 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE9 KYTNNLQGDEQRLYEFIVRHFLACCSQDAQGQETTVEIDIAQERFVAHGLMILARNYLDV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 KYTNNLQGDEQRLYEFIVRHFLACCSQDAQGQETTVEIDIAQERFVAHGLMILARNYLDV 430 440 450 460 470 480 490 500 510 520 530 540 pF1KE9 YPYDHWSDKILPVYEQGSHFQPSTVEMVDGETSPPKLLTEADLIALMEKHGIGTDATHAE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 YPYDHWSDKILPVYEQGSHFQPSTVEMVDGETSPPKLLTEADLIALMEKHGIGTDATHAE 490 500 510 520 530 540 550 560 570 580 590 600 pF1KE9 HIETIKARMYVGLTPDKRFLPGHLGMGLVEGYDSMGYEMSKPDLRAELEADLKLICDGKK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 HIETIKARMYVGLTPDKRFLPGHLGMGLVEGYDSMGYEMSKPDLRAELEADLKLICDGKK 550 560 570 580 590 600 610 620 630 640 650 660 pF1KE9 DKFVVLRQQVQKYKQVFIEAVAKAKKLDEALAQYFGNGTELAQQEDIYPAMPEPIRKCPQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 DKFVVLRQQVQKYKQVFIEAVAKAKKLDEALAQYFGNGTELAQQEDIYPAMPEPIRKCPQ 610 620 630 640 650 660 670 680 690 700 710 720 pF1KE9 CNKDMVLKTKKNGGFYLSCMGFPECRSAVWLPDSVLEASRDSSVCPVCQPHPVYRLKLKF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 CNKDMVLKTKKNGGFYLSCMGFPECRSAVWLPDSVLEASRDSSVCPVCQPHPVYRLKLKF 670 680 690 700 710 720 730 740 750 760 770 780 pF1KE9 KRGSLPPTMPLEFVCCIGGCDDTLREILDLRFSGGPPRASQPSGRLQANQSLNRMDNSQH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 KRGSLPPTMPLEFVCCIGGCDDTLREILDLRFSGGPPRASQPSGRLQANQSLNRMDNSQH 730 740 750 760 770 780 790 800 810 820 830 840 pF1KE9 PQPADSRQTGSSKALAQTLPPPTAAGESNSVTCNCGQEAVLLTVRKEGPNRGRQFFKCNG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 PQPADSRQTGSSKALAQTLPPPTAAGESNSVTCNCGQEAVLLTVRKEGPNRGRQFFKCNG 790 800 810 820 830 840 850 860 870 880 890 900 pF1KE9 GSCNFFLWADSPNPGAGGPPALAYRPLGASLGCPPGPGIHLGGFGNPGDGSGSGTSCLCS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 GSCNFFLWADSPNPGAGGPPALAYRPLGASLGCPPGPGIHLGGFGNPGDGSGSGTSCLCS 850 860 870 880 890 900 910 920 930 940 950 960 pF1KE9 QPSVTRTVQKDGPNKGRQFHTCAKPREQQCGFFQWVDENTAPGTSGAPSWTGDRGRTLES :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 QPSVTRTVQKDGPNKGRQFHTCAKPREQQCGFFQWVDENTAPGTSGAPSWTGDRGRTLES 910 920 930 940 950 960 970 980 990 1000 pF1KE9 EARSKRPRASSSDMGSTAKKPRKCSLCHQPGHTRPFCPQNR ::::::::::::::::::::::::::::::::::::::::: CCDS11 EARSKRPRASSSDMGSTAKKPRKCSLCHQPGHTRPFCPQNR 970 980 990 1000 >>CCDS13797.1 TOP3B gene_id:8940|Hs108|chr22 (862 aa) initn: 1230 init1: 425 opt: 1392 Z-score: 1208.6 bits: 234.9 E(32554): 5.4e-61 Smith-Waterman score: 1431; 36.2% identity (62.2% similar) in 749 aa overlap (33-756:1-742) 10 20 30 40 50 60 pF1KE9 FPVARYALRWLRRPEDRAFSRAAMEMALRGVRKVLCVAEKNDAAKGIADLLSNGRMRRRE .. :: :::: . :..:: .:: : . .. CCDS13 MKTVLMVAEKPSLAQSIAKILSRGSLSSHK 10 20 30 70 80 90 100 110 120 pF1KE9 GLSKFNKIYEFDYHLYGQNVTMVMTSVSGHLLAHDFQMQFRKWQSCNPLVLF-EAEIEKY ::. ...:. . :: : . :::: ::... :: .. ::.. .: :: .: :: CCDS13 GLNGACSVHEYTGTFAGQPVRFKMTSVCGHVMTLDFLGKYNKWDKVDPAELFSQAPTEKK 40 50 60 70 80 90 130 140 150 160 170 pF1KE9 CPENFVDIKKTLERETRQCQALVIWTDCDREGENIGFEIIHVC-----KAVKPNLQVLRA . ... : :. : : :. .:.: :::.::::: ::.. . :: . :.:: CCDS13 EANPKLNMVKFLQVEGRGCDYIVLWLDCDKEGENICFEVLDAVLPVMNKAHGGEKTVFRA 100 110 120 130 140 150 180 190 200 210 220 230 pF1KE9 RFSEITPHAVRTACENLTEPDQRVSDAVDVRQELDLRIGAAFTRFQTLRLQRIFPEVLAE ::: :: . .: : :::. . .::.::::::::: ::::::: .: . . : CCDS13 RFSSITDTDICNAMACLGEPDHNEALSVDARQELDLRIGCAFTRFQTKYFQGKYGD-LDS 160 170 180 190 200 240 250 260 270 280 290 pF1KE9 QLISYGSCQFPTLGFVVERFKAIQAFVPEIFH--RIKVTHDHKDGIVEFNWKRHRLFNHT .:::.: :: ::::: ::: ::.: :: . . ::. : :: . ..: : :.:.. CCDS13 SLISFGPCQTPTLGFCVERHDKIQSFKPETYWVLQAKVNTD-KDRSLLLDWDRVRVFDRE 210 220 230 240 250 260 300 310 320 330 340 350 pF1KE9 ACLVLYQLCVEDPMATVVEVRSKPKSKWRPQALDTVELEKLASRKLRINAKETMRIAEKL .. .. . : : . : :.: :: ::.:::. ..:: .: .. ...:. ::.: CCDS13 IAQMFLNMTKLEKEAQVEATSRKEKAKQRPLALNTVEMLRVASSSLGMGPQHAMQTAERL 270 280 290 300 310 320 360 370 380 390 400 410 pF1KE9 YTQGYISYPRTETNIFPRDLNLTVLVEQQTPDPRWGAFAQSILERGGPTPRNGNKSDQAH :::::::::::::. .:....: ..::. : :. .. .: .: ::.:. . . : CCDS13 YTQGYISYPRTETTHYPENFDLKGSLRQQANHPYWADTVKRLLAEGINRPRKGHDAGD-H 330 340 350 360 370 380 420 430 440 450 460 470 pF1KE9 PPIHPTKYTNN--LQGDEQRLYEFIVRHFLACCSQDAQGQETTVEIDIAQERFVAHGLMI ::: : : ... : :: ::::.:.:::.: :.: . ..:. . :. : :. : . CCDS13 PPITPMKSATEAELGGDAWRLYEYITRHFIATVSHDCKYLQSTISFRIGPELFTCSGKTV 390 400 410 420 430 440 480 490 500 510 520 530 pF1KE9 LARNYLDVYPYDHWS-DKILPVYEQGSHFQPSTVEMVDGETSPPKLLTEADLIALMEKHG :. .. .:.:.. .. ::. ..:. : . :.:.. .:.:: ::::.::.:::::: CCDS13 LSPGFTEVMPWQSVPLEESLPTCQRGDAFPVGEVKMLEKQTNPPDYLTEAELITLMEKHG 450 460 470 480 490 500 540 550 560 570 580 590 pF1KE9 IGTDATHAEHIETIKARMYVGLTPDKRFLPGHLGMGLVEGYDSMGYEMSKPDLRAELEAD :::::. ::..: : :: . .:. : .::. ::.:: .. :. : .:. .: . CCDS13 IGTDASIPVHINNICQRNYVTVESGRRLKPTNLGIVLVHGYYKIDAELVLPTIRSAVEKQ 510 520 530 540 550 560 600 610 620 630 640 pF1KE9 LKLICDGKKDKFVVLRQQVQKYKQVFIEAVAKAKKLDEALAQYFG----NGTELAQQEDI :.:: .:: : :: . .. .:. : : . .:: . :. .: :.. CCDS13 LNLIAQGKADYRQVLGHTLDVFKRKFHYFVDSIAGMDELMEVSFSPLAATGKPLSRCGKC 570 580 590 600 610 620 650 660 670 680 690 700 pF1KE9 YPAMP----EPIR-KCPQCNKDMVLKTKKNGGFY--LSCMGFPECRSAVWLPDSVLEASR . : .: : .: .:.. ..: . . .: : : . . . ..: : .. CCDS13 HRFMKYIQAKPSRLHCSHCDETYTLPQNGTIKLYKELRC-PLDDFELVLWSSGS---RGK 630 640 650 660 670 680 710 720 730 740 750 pF1KE9 DSSVCPVCQPHPVYR-LK--LKFKRGSLPPTMPLEFVCCIGGCDDTLREILDLRFSGGPP . .:: : :: .: .: . .. . : . . :: : . .: : ..:: CCDS13 SYPLCPYCYNHPPFRDMKKGMGCNECTHPSCQHSLSMLGIGQCVECESGVLVLDPTSGPK 690 700 710 720 730 740 760 770 780 790 800 810 pF1KE9 RASQPSGRLQANQSLNRMDNSQHPQPADSRQTGSSKALAQTLPPPTAAGESNSVTCNCGQ CCDS13 WKVACNKCNVVAHCFENAHRVRVSADTCSVCEAALLDVDFNKAKSPLPGDETQHMGCVFC 750 760 770 780 790 800 1001 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 18:47:06 2016 done: Mon Nov 7 18:47:06 2016 Total Scan time: 3.450 Total Display time: 0.070 Function used was FASTA [36.3.4 Apr, 2011]