FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KA0259, 1522 aa 1>>>pF1KA0259 1522 - 1522 aa - 1522 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.5725+/-0.00112; mu= 11.7448+/- 0.067 mean_var=99.3735+/-19.744, 0's: 0 Z-trim(104.9): 25 B-trim: 0 in 0/50 Lambda= 0.128659 statistics sampled from 8134 (8147) to 8134 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.598), E-opt: 0.2 (0.25), width: 16 Scan time: 5.990 The best scores are: opt bits E(32554) CCDS46919.1 TOPBP1 gene_id:11073|Hs108|chr3 (1522) 10112 1888.4 0 CCDS3220.1 ECT2 gene_id:1894|Hs108|chr3 ( 883) 462 97.2 2.4e-19 CCDS58860.1 ECT2 gene_id:1894|Hs108|chr3 ( 914) 455 95.9 6e-19 >>CCDS46919.1 TOPBP1 gene_id:11073|Hs108|chr3 (1522 aa) initn: 10112 init1: 10112 opt: 10112 Z-score: 10137.2 bits: 1888.4 E(32554): 0 Smith-Waterman score: 10112; 99.9% identity (100.0% similar) in 1522 aa overlap (1-1522:1-1522) 10 20 30 40 50 60 pF1KA0 MSRNDKEPFFVKFLKSSDNSKCFFKALESIKEFQSEEYLQIITEEEALKIKENDRSLYIC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 MSRNDKEPFFVKFLKSSDNSKCFFKALESIKEFQSEEYLQIITEEEALKIKENDRSLYIC 10 20 30 40 50 60 70 80 90 100 110 120 pF1KA0 DPFSGVVFDHLKKLGCRIVGPQVVIFCMHHQRCVPRAEHPVYNMVMSDVTISCTSLEKEK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 DPFSGVVFDHLKKLGCRIVGPQVVIFCMHHQRCVPRAEHPVYNMVMSDVTISCTSLEKEK 70 80 90 100 110 120 130 140 150 160 170 180 pF1KA0 REEVHKYVQMMGGRVYRDLNVSVTHLIAGEVGSKKYLVAANLKKPILLPSWIKTLWEKSQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 REEVHKYVQMMGGRVYRDLNVSVTHLIAGEVGSKKYLVAANLKKPILLPSWIKTLWEKSQ 130 140 150 160 170 180 190 200 210 220 230 240 pF1KA0 EKKITRYTDINMEDFKCPIFLGCIICVTGLCGLDRKEVQQLTVKHGGQYMGQLKMNECTH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 EKKITRYTDINMEDFKCPIFLGCIICVTGLCGLDRKEVQQLTVKHGGQYMGQLKMNECTH 190 200 210 220 230 240 250 260 270 280 290 300 pF1KA0 LIVQEPKGQKYECAKRWNVHCVTTQWFFDSIEKGFCQDESIYKTEPRPEAKTMPNSSTPT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 LIVQEPKGQKYECAKRWNVHCVTTQWFFDSIEKGFCQDESIYKTEPRPEAKTMPNSSTPT 250 260 270 280 290 300 310 320 330 340 350 360 pF1KA0 SQINTIDSRTLSDVSNISNINASCVSESICNSLNSKLEPTLENLENLDVSAFQAPEDLLD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 SQINTIDSRTLSDVSNISNINASCVSESICNSLNSKLEPTLENLENLDVSAFQAPEDLLD 310 320 330 340 350 360 370 380 390 400 410 420 pF1KA0 GCRIYLCGFSGRKLDKLRRLINSGGGVRFNQLNEDVTHVIVGDYDDELKQFWNKSAHRPH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 GCRIYLCGFSGRKLDKLRRLINSGGGVRFNQLNEDVTHVIVGDYDDELKQFWNKSAHRPH 370 380 390 400 410 420 430 440 450 460 470 480 pF1KA0 VVGAKWLLECFSKGYMLSEEPYIHANYQPVEIPVSHQPESKAALLKKKNSSFSKKDFAPS ::::::::::::::::::::::::::::::::::::.::::::::::::::::::::::: CCDS46 VVGAKWLLECFSKGYMLSEEPYIHANYQPVEIPVSHKPESKAALLKKKNSSFSKKDFAPS 430 440 450 460 470 480 490 500 510 520 530 540 pF1KA0 EKHEQADEDLLSQYENGSSTVVEAKTSEARPFNDSTHAEPLNDSTHISLQEENQSSVSHC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 EKHEQADEDLLSQYENGSSTVVEAKTSEARPFNDSTHAEPLNDSTHISLQEENQSSVSHC 490 500 510 520 530 540 550 560 570 580 590 600 pF1KA0 VPDVSTITEEGLFSQKSFLVLGFSNENESNIANIIKENAGKIMSLLSRTVADYAVVPLLG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 VPDVSTITEEGLFSQKSFLVLGFSNENESNIANIIKENAGKIMSLLSRTVADYAVVPLLG 550 560 570 580 590 600 610 620 630 640 650 660 pF1KA0 CEVEATVGEVVTNTWLVTCIDYQTLFDPKSNPLFTPVPVMTGMTPLEDCVISFSQCAGAE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 CEVEATVGEVVTNTWLVTCIDYQTLFDPKSNPLFTPVPVMTGMTPLEDCVISFSQCAGAE 610 620 630 640 650 660 670 680 690 700 710 720 pF1KA0 KESLTFLANLLGASVQEYFVRKSNAKKGMFASTHLILKERGGSKYEAAKKWNLPAVTIAW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 KESLTFLANLLGASVQEYFVRKSNAKKGMFASTHLILKERGGSKYEAAKKWNLPAVTIAW 670 680 690 700 710 720 730 740 750 760 770 780 pF1KA0 LLETARTGKRADESHFLIENSTKEERSLETEITNGINLNSDTAEHPGTRLQTHRKTVVTP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 LLETARTGKRADESHFLIENSTKEERSLETEITNGINLNSDTAEHPGTRLQTHRKTVVTP 730 740 750 760 770 780 790 800 810 820 830 840 pF1KA0 LDMNRFQSKAFRAVVSQHARQVAASPAVGQPLQKEPSLHLDTPSKFLSKDKLFKPSFDVK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 LDMNRFQSKAFRAVVSQHARQVAASPAVGQPLQKEPSLHLDTPSKFLSKDKLFKPSFDVK 790 800 810 820 830 840 850 860 870 880 890 900 pF1KA0 DALAALETPGRPSQQKRKPSTPLSEVIVKNLQLALANSSRNAVALSASPQLKEAQSEKEE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 DALAALETPGRPSQQKRKPSTPLSEVIVKNLQLALANSSRNAVALSASPQLKEAQSEKEE 850 860 870 880 890 900 910 920 930 940 950 960 pF1KA0 APKPLHKVVVCVSKKLSKKQSELNGIAASLGADYRWSFDETVTHFIYQGRPNDTNREYKS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 APKPLHKVVVCVSKKLSKKQSELNGIAASLGADYRWSFDETVTHFIYQGRPNDTNREYKS 910 920 930 940 950 960 970 980 990 1000 1010 1020 pF1KA0 VKERGVHIVSEHWLLDCAQECKHLPESLYPHTYNPKMSLDISAVQDGRLCNSRLLSAVSS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 VKERGVHIVSEHWLLDCAQECKHLPESLYPHTYNPKMSLDISAVQDGRLCNSRLLSAVSS 970 980 990 1000 1010 1020 1030 1040 1050 1060 1070 1080 pF1KA0 TKDDEPDPLILEENDVDNMATNNKESAPSNGSGKNDSKGVLTQTLEMRENFQKQLQEIMS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 TKDDEPDPLILEENDVDNMATNNKESAPSNGSGKNDSKGVLTQTLEMRENFQKQLQEIMS 1030 1040 1050 1060 1070 1080 1090 1100 1110 1120 1130 1140 pF1KA0 ATSIVKPQGQRTSLSRSGCNSASSTPDSTRSARSGRSRVLEALRQSRQTVPDVNTEPSQN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 ATSIVKPQGQRTSLSRSGCNSASSTPDSTRSARSGRSRVLEALRQSRQTVPDVNTEPSQN 1090 1100 1110 1120 1130 1140 1150 1160 1170 1180 1190 1200 pF1KA0 EQIIWDDPTAREERARLASNLQWPSCPTQYSELQVDIQNLEDSPFQKPLHDSEIAKQAVC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 EQIIWDDPTAREERARLASNLQWPSCPTQYSELQVDIQNLEDSPFQKPLHDSEIAKQAVC 1150 1160 1170 1180 1190 1200 1210 1220 1230 1240 1250 1260 pF1KA0 DPGNIRVTEAPKHPISEELETPIKDSHLIPTPQAPSIAFPLANPPVAPHPREKIITIEET :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 DPGNIRVTEAPKHPISEELETPIKDSHLIPTPQAPSIAFPLANPPVAPHPREKIITIEET 1210 1220 1230 1240 1250 1260 1270 1280 1290 1300 1310 1320 pF1KA0 HEELKKQYIFQLSSLNPQERIDYCHLIEKLGGLVIEKQCFDPTCTHIVVGHPLRNEKYLA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 HEELKKQYIFQLSSLNPQERIDYCHLIEKLGGLVIEKQCFDPTCTHIVVGHPLRNEKYLA 1270 1280 1290 1300 1310 1320 1330 1340 1350 1360 1370 1380 pF1KA0 SVAAGKWVLHRSYLEACRTAGHFVQEEDYEWGSSSILDVLTGINVQQRRLALAAMRWRKK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 SVAAGKWVLHRSYLEACRTAGHFVQEEDYEWGSSSILDVLTGINVQQRRLALAAMRWRKK 1330 1340 1350 1360 1370 1380 1390 1400 1410 1420 1430 1440 pF1KA0 IQQRQESGIVEGAFSGWKVILHVDQSREAGFKRLLQSGGAKVLPGHSVPLFKEATHLFSD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 IQQRQESGIVEGAFSGWKVILHVDQSREAGFKRLLQSGGAKVLPGHSVPLFKEATHLFSD 1390 1400 1410 1420 1430 1440 1450 1460 1470 1480 1490 1500 pF1KA0 LNKLKPDDSGVNIAEAAAQNVYCLRTEYIADYLMQESPPHVENYCLPEAISFIQNNKELG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 LNKLKPDDSGVNIAEAAAQNVYCLRTEYIADYLMQESPPHVENYCLPEAISFIQNNKELG 1450 1460 1470 1480 1490 1500 1510 1520 pF1KA0 TGLSQKRKAPTEKNKIKRPRVH :::::::::::::::::::::: CCDS46 TGLSQKRKAPTEKNKIKRPRVH 1510 1520 >>CCDS3220.1 ECT2 gene_id:1894|Hs108|chr3 (883 aa) initn: 393 init1: 249 opt: 462 Z-score: 460.9 bits: 97.2 E(32554): 2.4e-19 Smith-Waterman score: 462; 30.3% identity (60.7% similar) in 300 aa overlap (11-301:53-345) 10 20 30 40 pF1KA0 MSRNDKEPFFVKFLKSSDNSKCFFKALESIKEFQSEEYLQ : ... . ... ..:::..:: . .. CCDS32 DSKVTEISKENLLIGSTSYVEEEMPQIETRVILVQEAGKQEELIKALKDIKV--GFVKME 30 40 50 60 70 80 50 60 70 80 90 100 pF1KA0 IITEEEALKIKENDRSLYICDPFSGVVFDHLKKLGCRIVGPQVVIFCMHHQRCVPRAEHP . : :.: : . . . : :. ::. : : ::..:: ::. : .. . .: . .: CCDS32 SVEEFEGLDSPEFENVFVVTD-FQDSVFNDLYKADCRVIGPPVVLNCSQKGEPLPFSCRP 90 100 110 120 130 110 120 130 140 150 pF1KA0 VYNMVMSDVTISCTSLEKEKREEVH--KYVQMMGGRVYRDLNVSVTHLIAGEVGSKKYLV .: : .... : . ..:.: :. :. ::: . .:.: .::::.:. . ..:. : CCDS32 LYCTSMMNLVL-CFTGFRKKEELVRLVTLVHHMGGVIRKDFNSKVTHLVANCTQGEKFRV 140 150 160 170 180 190 160 170 180 190 200 210 pF1KA0 AANLKKPILLPSWIKTLWEKSQEKKITRYTDINMEDFKCPIFLGCIICVTGLCGLDRKEV :..: ::. : :: ::. .:. . .: ..:: : : ::. :. .. .. CCDS32 AVSLGTPIMKPEWIYKAWERRNEQDFYAAVDDFRNEFKVPPFQDCILSFLGFSDEEKTNM 200 210 220 230 240 250 220 230 240 250 260 270 pF1KA0 QQLTVKHGGQYMGQLKMNECTHLIVQEP--KGQKYECAKRWNVHCVTTQWFFDSIEKGFC ...: .::.:. : ..::::.:.: : .: .:. .. : .::. ::. CCDS32 EEMTEMQGGKYL-PLGDERCTHLVVEENIVKDLPFEPSKK--LYVVKQEWFWGSIQMDAR 260 270 280 290 300 310 280 290 300 310 320 330 pF1KA0 QDESIYKTEP--RPEAK---TMPNSSTPTSQINTIDSRTLSDVSNISNINASCVSESICN :..: : :: : .: . .::.: CCDS32 AGETMYLYEKANTPELKKSVSMLSLNTPNSNRKRRRLKETLAQLSRETDVSPFPPRKRPS 320 330 340 350 360 370 >>CCDS58860.1 ECT2 gene_id:1894|Hs108|chr3 (914 aa) initn: 393 init1: 249 opt: 455 Z-score: 453.7 bits: 95.9 E(32554): 6e-19 Smith-Waterman score: 456; 31.2% identity (58.1% similar) in 301 aa overlap (12-301:93-376) 10 20 30 pF1KA0 MSRNDKEPFFVKFLKSSDNS--KCFFKALESIKEFQSEEYL :..:: : : : .::..:: CCDS58 EELIKALKTIKIMEVPVIKIKESCPGKSDEKLIKSVINMDIKVGFVKMESVEEF------ 70 80 90 100 110 40 50 60 70 80 90 pF1KA0 QIITEEEALKIKENDRSLYICDPFSGVVFDHLKKLGCRIVGPQVVIFCMHHQRCVPRAEH :.: : . . . : :. ::. : : ::..:: ::. : .. . .: . . CCDS58 ------EGLDSPEFENVFVVTD-FQDSVFNDLYKADCRVIGPPVVLNCSQKGEPLPFSCR 120 130 140 150 160 100 110 120 130 140 150 pF1KA0 PVYNMVMSDVTISCTSLEKEKREEVH--KYVQMMGGRVYRDLNVSVTHLIAGEVGSKKYL :.: : .... : . ..:.: :. :. ::: . .:.: .::::.:. . ..:. CCDS58 PLYCTSMMNLVL-CFTGFRKKEELVRLVTLVHHMGGVIRKDFNSKVTHLVANCTQGEKFR 170 180 190 200 210 220 160 170 180 190 200 210 pF1KA0 VAANLKKPILLPSWIKTLWEKSQEKKITRYTDINMEDFKCPIFLGCIICVTGLCGLDRKE ::..: ::. : :: ::. .:. . .: ..:: : : ::. :. .. . CCDS58 VAVSLGTPIMKPEWIYKAWERRNEQDFYAAVDDFRNEFKVPPFQDCILSFLGFSDEEKTN 230 240 250 260 270 280 220 230 240 250 260 270 pF1KA0 VQQLTVKHGGQYMGQLKMNECTHLIVQEP--KGQKYECAKRWNVHCVTTQWFFDSIEKGF ....: .::.:. : ..::::.:.: : .: .:. .. : .::. ::. CCDS58 MEEMTEMQGGKYL-PLGDERCTHLVVEENIVKDLPFEPSKK--LYVVKQEWFWGSIQMDA 290 300 310 320 330 340 280 290 300 310 320 330 pF1KA0 CQDESIYKTEP--RPEAK---TMPNSSTPTSQINTIDSRTLSDVSNISNINASCVSESIC :..: : :: : .: . .::.: CCDS58 RAGETMYLYEKANTPELKKSVSMLSLNTPNSNRKRRRLKETLAQLSRETDVSPFPPRKRP 350 360 370 380 390 400 340 350 360 370 380 390 pF1KA0 NSLNSKLEPTLENLENLDVSAFQAPEDLLDGCRIYLCGFSGRKLDKLRRLINSGGGVRFN CCDS58 SAEHSLSIGSLLDISNTPESSINYGDTPKSCTKSSKSSTPVPSKQSARWQVAKELYQTES 410 420 430 440 450 460 1522 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Thu Nov 3 19:37:19 2016 done: Thu Nov 3 19:37:20 2016 Total Scan time: 5.990 Total Display time: 0.150 Function used was FASTA [36.3.4 Apr, 2011]