FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5794, 689 aa 1>>>pF1KE5794 689 - 689 aa - 689 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 10.0218+/-0.00101; mu= -1.0543+/- 0.060 mean_var=272.0470+/-57.054, 0's: 0 Z-trim(114.1): 28 B-trim: 407 in 1/50 Lambda= 0.077759 statistics sampled from 14662 (14678) to 14662 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.763), E-opt: 0.2 (0.451), width: 16 Scan time: 3.840 The best scores are: opt bits E(32554) CCDS45523.1 ATXN1L gene_id:342371|Hs108|chr16 ( 689) 4660 536.4 5.3e-152 CCDS34342.1 ATXN1 gene_id:6310|Hs108|chr6 ( 815) 579 78.6 3.9e-14 >>CCDS45523.1 ATXN1L gene_id:342371|Hs108|chr16 (689 aa) initn: 4660 init1: 4660 opt: 4660 Z-score: 2842.4 bits: 536.4 E(32554): 5.3e-152 Smith-Waterman score: 4660; 100.0% identity (100.0% similar) in 689 aa overlap (1-689:1-689) 10 20 30 40 50 60 pF1KE5 MKPVHERSQECLPPKKRDLPVTSEDMGRTTSCSTNHTPSSDASEWSRGVVVAGQSQAGAR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 MKPVHERSQECLPPKKRDLPVTSEDMGRTTSCSTNHTPSSDASEWSRGVVVAGQSQAGAR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 VSLGGDGAEAITGLTVDQYGMLYKVAVPPATFSPTGLPSVVNMSPLPPTFNVASSLIQHP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 VSLGGDGAEAITGLTVDQYGMLYKVAVPPATFSPTGLPSVVNMSPLPPTFNVASSLIQHP 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 GIHYPPLHYAQLPSTSLQFIGSPYSLPYAVPPNFLPSPLLSPSANLATSHLPHFVPYASL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 GIHYPPLHYAQLPSTSLQFIGSPYSLPYAVPPNFLPSPLLSPSANLATSHLPHFVPYASL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 LAEGATPPPQAPSPAHSFNKAPSATSPSGQLPHHSSTQPLDLAPGRMPIYYQMSRLPAGY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 LAEGATPPPQAPSPAHSFNKAPSATSPSGQLPHHSSTQPLDLAPGRMPIYYQMSRLPAGY 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE5 TLHETPPAGASPVLTPQESQSALEAAAANGGQRPRERNLVRRESEALDSPNSKGEGQGLV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 TLHETPPAGASPVLTPQESQSALEAAAANGGQRPRERNLVRRESEALDSPNSKGEGQGLV 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE5 PVVECVVDGQLFSGSQTPRVEVAAPAHRGTPDTDLEVQRVVGALASQDYRVVAAQRKEEP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 PVVECVVDGQLFSGSQTPRVEVAAPAHRGTPDTDLEVQRVVGALASQDYRVVAAQRKEEP 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE5 SPLNLSHHTPDHQGEGRGSARNPAELAEKSQARGFYPQSHQEPVKHRPLPKAMVVANGNL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 SPLNLSHHTPDHQGEGRGSARNPAELAEKSQARGFYPQSHQEPVKHRPLPKAMVVANGNL 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE5 VPTGTDSGLLPVGSEILVASSLDVQARATFPDKEPTPPPITSSHLPSHFMKGAIIQLATG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 VPTGTDSGLLPVGSEILVASSLDVQARATFPDKEPTPPPITSSHLPSHFMKGAIIQLATG 430 440 450 460 470 480 490 500 510 520 530 540 pF1KE5 ELKRVEDLQTQDFVRSAEVSGGLKIDSSTVVDIQESQWPGFVMLHFVVGEQQSKVSIEVP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 ELKRVEDLQTQDFVRSAEVSGGLKIDSSTVVDIQESQWPGFVMLHFVVGEQQSKVSIEVP 490 500 510 520 530 540 550 560 570 580 590 600 pF1KE5 PEHPFFVYGQGWSSCSPGRTTQLFSLPCHRLQVGDVCISISLQSLNSNSVSQASCAPPSQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 PEHPFFVYGQGWSSCSPGRTTQLFSLPCHRLQVGDVCISISLQSLNSNSVSQASCAPPSQ 550 560 570 580 590 600 610 620 630 640 650 660 pF1KE5 LGPPRERPERTVLGSRELCDSEGKSQPAGEGSRVVEPSQPESGAQACWPAPSFQRYSMQG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 LGPPRERPERTVLGSRELCDSEGKSQPAGEGSRVVEPSQPESGAQACWPAPSFQRYSMQG 610 620 630 640 650 660 670 680 pF1KE5 EEARAALLRPSFIPQEVKLSIEGRSNAGK ::::::::::::::::::::::::::::: CCDS45 EEARAALLRPSFIPQEVKLSIEGRSNAGK 670 680 >>CCDS34342.1 ATXN1 gene_id:6310|Hs108|chr6 (815 aa) initn: 916 init1: 565 opt: 579 Z-score: 367.2 bits: 78.6 E(32554): 3.9e-14 Smith-Waterman score: 794; 30.8% identity (53.7% similar) in 760 aa overlap (82-689:77-815) 60 70 80 90 100 110 pF1KE5 AGQSQAGARVSLGGDGAEAITGLTVDQYGMLYKVAVPPATFSPTGLPSVVNMSPLPPTFN :.:. .:: . : : :. :. CCDS34 PGNPGGRGHGGGRHGPAGTSVELGLQQGIGLHKALSTGLDYSPPSAPRSV---PVATTLP 50 60 70 80 90 100 120 130 140 150 160 170 pF1KE5 VASSLIQHPGIHYPPLHYAQLPSTSLQFIGSP-YSLPYAVPPNFLPSPLLSPSANLATSH .: . : :: :..::.:: : .::::: :: :: .:.:: :. :.:: .:: CCDS34 AAYATPQ-PGTPVSPVQYAHLPHT-FQFIGSSQYSGTYA---SFIPSQLIPPTANPVTSA 110 120 130 140 150 180 190 pF1KE5 L--------P----HFVPYASLLAE-GA---TPPPQAPSPAHS----------------- . : .. :..:::. :. :: .: . .. CCDS34 VASAAGATTPSQRSQLEAYSTLLANMGSLSQTPGHKAEQQQQQQQQQQQQHQHQQQQQQQ 160 170 180 190 200 210 200 210 220 230 pF1KE5 --------FNKAPSATSPSGQLPH------HSSTQPLDL----APGRMPIYYQMSRLPAG ...::. .:.. : : :..: . .: .:.. . . CCDS34 QQQQQQQHLSRAPGLITPGSPPPAQQNQYVHISSSPQNTGRTASPPAIPVHLHPHQTMIP 220 230 240 250 260 270 240 250 260 270 280 290 pF1KE5 YTLHETPP-------AGASPVLTPQESQSALEAAAANGGQRPRER-NLVRRESEALDSPN .:: :: : .. ..:.:. . :.. . . . .: : ..:. .:. CCDS34 HTLTLGPPSQVVMQYADSGSHFVPREATKKAESSRLQQAIQAKEVLNGEMEKSRRYGAPS 280 290 300 310 320 330 300 310 320 330 pF1KE5 SKGEGQGL-----VP----VVECVVDGQLFS-GSQTP---RVEVAAPAHRGTPDTDLEVQ : : : :: . :: . . .:. : :. : . . .:: .::::: CCDS34 SADLGLGKAGGKSVPHPYESRHVVVHPSPSDYSSRDPSGVRASVMVLPNSNTPAADLEVQ 340 350 360 370 380 390 340 350 360 370 380 pF1KE5 RVVGALASQDYRVVAAQRKEEPSPLN----LSHHTPDHQG---------EGRGSARNPAE . :..:. :: :: : : :.. . :: .: CCDS34 Q-------------ATHREASPSTLNDKSGLHLGKPGHRSYALSPHTVIQTTHSASEPLP 400 410 420 430 440 390 400 410 420 430 pF1KE5 LAEKSQA--RGFYP------QSHQEPVKHR-PLPKAMVVANGN--LVPTG-TD---SGLL .. . : : : ...:. . . ::. .:. . . :.:.: :: :: CCDS34 VGLPATAFYAGTQPPVIGYLSGQQQAITYAGSLPQHLVIPGTQPLLIPVGSTDMEASGAA 450 460 470 480 490 500 440 450 460 pF1KE5 P--VGS--------EILVASSLD-------------------VQARATFPDKEPTPPPIT : : : . .:...: :::. .: . . : . CCDS34 PAIVTSSPQFAAVPHTFVTTALPKSENFNPEALVTQAAYPAMVQAQIHLPVVQSVASPAA 510 520 530 540 550 560 470 480 490 500 510 520 pF1KE5 SSH-LPSHFMKGAIIQLATGELKRVEDLQTQDFVRSAEVSGGLKIDSSTVVDIQESQWPG . :: .::::.:::::.::::.::::.:.::..:::.:. :::::::: :..:. :: CCDS34 APPTLPPYFMKGSIIQLANGELKKVEDLKTEDFIQSAEISNDLKIDSSTVERIEDSHSPG 570 580 590 600 610 620 530 540 550 560 570 580 pF1KE5 FVMLHFVVGEQQSKVSIEVPPEHPFFVYGQGWSSCSPGRTTQLFSLPCHRLQVGDVCISI ....:.:::....::.:: :.::::.::::::: : ::.:::.::: .:.:::::::. CCDS34 VAVIQFAVGEHRAQVSVEVLVEYPFFVFGQGWSSCCPERTSQLFDLPCSKLSVGDVCISL 630 640 650 660 670 680 590 600 610 620 pF1KE5 SLQSLNSNSVSQASCAPPSQ----------LGPPRER---PERTV-LGSRELCDSEGKSQ .:..:...::.... . :.. :. :.: : . :: .. . .:. . CCDS34 TLKNLKNGSVKKGQPVDPASVLLKHSKADGLAGSRHRYAEQENGINQGSAQMLSENGELK 690 700 710 720 730 740 630 640 650 660 670 pF1KE5 -------PAGEGSRVVEPSQPESGAQACWPAPSFQRYSMQGEEARAALLRPSFIPQEVKL ::. .:::.: . . : :: .. . .: .: .::.::::::. CCDS34 FPEKMGLPAAPFLTKIEPSKPAATRKRRWSAPESRKLEKSEDEPPLTLPKPSLIPQEVKI 750 760 770 780 790 800 680 pF1KE5 SIEGRSNAGK ::::::.:: CCDS34 CIEGRSNVGK 810 689 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 06:42:21 2016 done: Tue Nov 8 06:42:22 2016 Total Scan time: 3.840 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]