FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB7066, 815 aa 1>>>pF1KB7066 815 - 815 aa - 815 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 11.7437+/-0.000424; mu= -8.8551+/- 0.026 mean_var=423.5842+/-87.775, 0's: 0 Z-trim(122.1): 131 B-trim: 16 in 1/58 Lambda= 0.062317 statistics sampled from 39645 (39808) to 39645 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.76), E-opt: 0.2 (0.467), width: 16 Scan time: 13.800 The best scores are: opt bits E(85289) NP_001121636 (OMIM: 164400,601556) ataxin-1 [Homo ( 815) 5438 503.9 1.2e-141 NP_000323 (OMIM: 164400,601556) ataxin-1 [Homo sap ( 815) 5438 503.9 1.2e-141 NP_001131147 (OMIM: 614301) ataxin-1-like [Homo sa ( 689) 579 67.0 3.3e-10 >>NP_001121636 (OMIM: 164400,601556) ataxin-1 [Homo sapi (815 aa) initn: 5438 init1: 5438 opt: 5438 Z-score: 2664.2 bits: 503.9 E(85289): 1.2e-141 Smith-Waterman score: 5438; 100.0% identity (100.0% similar) in 815 aa overlap (1-815:1-815) 10 20 30 40 50 60 pF1KB7 MKSNQERSNECLPPKKREIPATSRSSEEKAPTLPSDNHRVEGTAWLPGNPGGRGHGGGRH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MKSNQERSNECLPPKKREIPATSRSSEEKAPTLPSDNHRVEGTAWLPGNPGGRGHGGGRH 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 GPAGTSVELGLQQGIGLHKALSTGLDYSPPSAPRSVPVATTLPAAYATPQPGTPVSPVQY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 GPAGTSVELGLQQGIGLHKALSTGLDYSPPSAPRSVPVATTLPAAYATPQPGTPVSPVQY 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB7 AHLPHTFQFIGSSQYSGTYASFIPSQLIPPTANPVTSAVASAAGATTPSQRSQLEAYSTL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 AHLPHTFQFIGSSQYSGTYASFIPSQLIPPTANPVTSAVASAAGATTPSQRSQLEAYSTL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB7 LANMGSLSQTPGHKAEQQQQQQQQQQQQHQHQQQQQQQQQQQQQQHLSRAPGLITPGSPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 LANMGSLSQTPGHKAEQQQQQQQQQQQQHQHQQQQQQQQQQQQQQHLSRAPGLITPGSPP 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB7 PAQQNQYVHISSSPQNTGRTASPPAIPVHLHPHQTMIPHTLTLGPPSQVVMQYADSGSHF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 PAQQNQYVHISSSPQNTGRTASPPAIPVHLHPHQTMIPHTLTLGPPSQVVMQYADSGSHF 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB7 VPREATKKAESSRLQQAIQAKEVLNGEMEKSRRYGAPSSADLGLGKAGGKSVPHPYESRH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 VPREATKKAESSRLQQAIQAKEVLNGEMEKSRRYGAPSSADLGLGKAGGKSVPHPYESRH 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB7 VVVHPSPSDYSSRDPSGVRASVMVLPNSNTPAADLEVQQATHREASPSTLNDKSGLHLGK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 VVVHPSPSDYSSRDPSGVRASVMVLPNSNTPAADLEVQQATHREASPSTLNDKSGLHLGK 370 380 390 400 410 420 430 440 450 460 470 480 pF1KB7 PGHRSYALSPHTVIQTTHSASEPLPVGLPATAFYAGTQPPVIGYLSGQQQAITYAGSLPQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 PGHRSYALSPHTVIQTTHSASEPLPVGLPATAFYAGTQPPVIGYLSGQQQAITYAGSLPQ 430 440 450 460 470 480 490 500 510 520 530 540 pF1KB7 HLVIPGTQPLLIPVGSTDMEASGAAPAIVTSSPQFAAVPHTFVTTALPKSENFNPEALVT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 HLVIPGTQPLLIPVGSTDMEASGAAPAIVTSSPQFAAVPHTFVTTALPKSENFNPEALVT 490 500 510 520 530 540 550 560 570 580 590 600 pF1KB7 QAAYPAMVQAQIHLPVVQSVASPAAAPPTLPPYFMKGSIIQLANGELKKVEDLKTEDFIQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 QAAYPAMVQAQIHLPVVQSVASPAAAPPTLPPYFMKGSIIQLANGELKKVEDLKTEDFIQ 550 560 570 580 590 600 610 620 630 640 650 660 pF1KB7 SAEISNDLKIDSSTVERIEDSHSPGVAVIQFAVGEHRAQVSVEVLVEYPFFVFGQGWSSC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 SAEISNDLKIDSSTVERIEDSHSPGVAVIQFAVGEHRAQVSVEVLVEYPFFVFGQGWSSC 610 620 630 640 650 660 670 680 690 700 710 720 pF1KB7 CPERTSQLFDLPCSKLSVGDVCISLTLKNLKNGSVKKGQPVDPASVLLKHSKADGLAGSR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 CPERTSQLFDLPCSKLSVGDVCISLTLKNLKNGSVKKGQPVDPASVLLKHSKADGLAGSR 670 680 690 700 710 720 730 740 750 760 770 780 pF1KB7 HRYAEQENGINQGSAQMLSENGELKFPEKMGLPAAPFLTKIEPSKPAATRKRRWSAPESR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 HRYAEQENGINQGSAQMLSENGELKFPEKMGLPAAPFLTKIEPSKPAATRKRRWSAPESR 730 740 750 760 770 780 790 800 810 pF1KB7 KLEKSEDEPPLTLPKPSLIPQEVKICIEGRSNVGK ::::::::::::::::::::::::::::::::::: NP_001 KLEKSEDEPPLTLPKPSLIPQEVKICIEGRSNVGK 790 800 810 >>NP_000323 (OMIM: 164400,601556) ataxin-1 [Homo sapiens (815 aa) initn: 5438 init1: 5438 opt: 5438 Z-score: 2664.2 bits: 503.9 E(85289): 1.2e-141 Smith-Waterman score: 5438; 100.0% identity (100.0% similar) in 815 aa overlap (1-815:1-815) 10 20 30 40 50 60 pF1KB7 MKSNQERSNECLPPKKREIPATSRSSEEKAPTLPSDNHRVEGTAWLPGNPGGRGHGGGRH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 MKSNQERSNECLPPKKREIPATSRSSEEKAPTLPSDNHRVEGTAWLPGNPGGRGHGGGRH 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 GPAGTSVELGLQQGIGLHKALSTGLDYSPPSAPRSVPVATTLPAAYATPQPGTPVSPVQY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 GPAGTSVELGLQQGIGLHKALSTGLDYSPPSAPRSVPVATTLPAAYATPQPGTPVSPVQY 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB7 AHLPHTFQFIGSSQYSGTYASFIPSQLIPPTANPVTSAVASAAGATTPSQRSQLEAYSTL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 AHLPHTFQFIGSSQYSGTYASFIPSQLIPPTANPVTSAVASAAGATTPSQRSQLEAYSTL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB7 LANMGSLSQTPGHKAEQQQQQQQQQQQQHQHQQQQQQQQQQQQQQHLSRAPGLITPGSPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 LANMGSLSQTPGHKAEQQQQQQQQQQQQHQHQQQQQQQQQQQQQQHLSRAPGLITPGSPP 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB7 PAQQNQYVHISSSPQNTGRTASPPAIPVHLHPHQTMIPHTLTLGPPSQVVMQYADSGSHF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 PAQQNQYVHISSSPQNTGRTASPPAIPVHLHPHQTMIPHTLTLGPPSQVVMQYADSGSHF 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB7 VPREATKKAESSRLQQAIQAKEVLNGEMEKSRRYGAPSSADLGLGKAGGKSVPHPYESRH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 VPREATKKAESSRLQQAIQAKEVLNGEMEKSRRYGAPSSADLGLGKAGGKSVPHPYESRH 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB7 VVVHPSPSDYSSRDPSGVRASVMVLPNSNTPAADLEVQQATHREASPSTLNDKSGLHLGK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 VVVHPSPSDYSSRDPSGVRASVMVLPNSNTPAADLEVQQATHREASPSTLNDKSGLHLGK 370 380 390 400 410 420 430 440 450 460 470 480 pF1KB7 PGHRSYALSPHTVIQTTHSASEPLPVGLPATAFYAGTQPPVIGYLSGQQQAITYAGSLPQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 PGHRSYALSPHTVIQTTHSASEPLPVGLPATAFYAGTQPPVIGYLSGQQQAITYAGSLPQ 430 440 450 460 470 480 490 500 510 520 530 540 pF1KB7 HLVIPGTQPLLIPVGSTDMEASGAAPAIVTSSPQFAAVPHTFVTTALPKSENFNPEALVT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 HLVIPGTQPLLIPVGSTDMEASGAAPAIVTSSPQFAAVPHTFVTTALPKSENFNPEALVT 490 500 510 520 530 540 550 560 570 580 590 600 pF1KB7 QAAYPAMVQAQIHLPVVQSVASPAAAPPTLPPYFMKGSIIQLANGELKKVEDLKTEDFIQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 QAAYPAMVQAQIHLPVVQSVASPAAAPPTLPPYFMKGSIIQLANGELKKVEDLKTEDFIQ 550 560 570 580 590 600 610 620 630 640 650 660 pF1KB7 SAEISNDLKIDSSTVERIEDSHSPGVAVIQFAVGEHRAQVSVEVLVEYPFFVFGQGWSSC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 SAEISNDLKIDSSTVERIEDSHSPGVAVIQFAVGEHRAQVSVEVLVEYPFFVFGQGWSSC 610 620 630 640 650 660 670 680 690 700 710 720 pF1KB7 CPERTSQLFDLPCSKLSVGDVCISLTLKNLKNGSVKKGQPVDPASVLLKHSKADGLAGSR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 CPERTSQLFDLPCSKLSVGDVCISLTLKNLKNGSVKKGQPVDPASVLLKHSKADGLAGSR 670 680 690 700 710 720 730 740 750 760 770 780 pF1KB7 HRYAEQENGINQGSAQMLSENGELKFPEKMGLPAAPFLTKIEPSKPAATRKRRWSAPESR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 HRYAEQENGINQGSAQMLSENGELKFPEKMGLPAAPFLTKIEPSKPAATRKRRWSAPESR 730 740 750 760 770 780 790 800 810 pF1KB7 KLEKSEDEPPLTLPKPSLIPQEVKICIEGRSNVGK ::::::::::::::::::::::::::::::::::: NP_000 KLEKSEDEPPLTLPKPSLIPQEVKICIEGRSNVGK 790 800 810 >>NP_001131147 (OMIM: 614301) ataxin-1-like [Homo sapien (689 aa) initn: 916 init1: 565 opt: 579 Z-score: 304.3 bits: 67.0 E(85289): 3.3e-10 Smith-Waterman score: 910; 31.3% identity (54.7% similar) in 830 aa overlap (1-815:1-689) 10 20 30 40 50 pF1KB7 MKSNQERSNECLPPKKREIPATSRSSEEKAPTLPSDNHRVEGTAWLPGNP-GGRGHGGGR :: .:::.::::::::..:.::.. . . . . ... : : .:....:.: NP_001 MKPVHERSQECLPPKKRDLPVTSEDMGRTTSCSTNHTPSSDASEWSRGVVVAGQSQAGAR 10 20 30 40 50 60 60 70 80 90 100 110 pF1KB7 HGPAGTSVE--LGL---QQGIGLHKALSTGLDYSPPSAPRSV---PVATTLPAAYATPQ- . .: ..: :: : :. :.:. .:: . : : :. :. .: . : NP_001 VSLGGDGAEAITGLTVDQYGM-LYKVAVPPATFSPTGLPSVVNMSPLPPTFNVASSLIQH 70 80 90 100 110 120 130 140 150 160 pF1KB7 PGTPVSPVQYAHLPHT-FQFIGSSQYSGTYA---SFIPSQLIPPTANPVTSAVASAAGAT :: :..::.:: : .::::: :: :: .:.:: :. :.:: .:: . NP_001 PGIHYPPLHYAQLPSTSLQFIGSP-YSLPYAVPPNFLPSPLLSPSANLATSHL------- 120 130 140 150 160 170 170 180 190 200 210 220 pF1KB7 TPSQRSQLEAYSTLLANMGSLSQTPGHKAEQQQQQQQQQQQQHQHQQQQQQQQQQQQQQH : .. :..:::. :. :: : . :. NP_001 -P----HFVPYASLLAE-GA---TP---------PPQAPSPAHS---------------- 180 190 230 240 250 260 270 280 pF1KB7 LSRAPGLITPGSPPPAQQNQYVHISSSPQNTGRTASPPAIPVHLHPHQTMIPHTLTLGPP ...::. .:.. : : :..: . .: .:.. . . .:: :: NP_001 FNKAPSATSPSGQLPH------HSSTQPLDL----APGRMPIYYQMSRLPAGYTLHETPP 200 210 220 230 240 290 300 310 320 330 340 pF1KB7 SQVVMQYADSGSHFVPREATKKAESSRLQQAIQAKEVLNGEMEKSRRYGAPSSADLGLGK : .. ..:.:. . :.. . . . .: : ..:. .:.: : : NP_001 -------AGASPVLTPQESQSALEAAAANGGQRPRER-NLVRRESEALDSPNSKGEGQGL 250 260 270 280 290 350 360 370 380 390 400 pF1KB7 AGGKSVPHPYESRHVVVHPSPSDYSSRDPSGVRASVMVLPNSNTPAADLEVQQATHREAS :: . :: . . .:. : :. : . . .:: .:::::... :: NP_001 -----VP----VVECVVDGQLFS-GSQTP---RVEVAAPAHRGTPDTDLEVQRVVGALAS 300 310 320 330 340 410 420 430 440 450 460 pF1KB7 PSTLNDKSGLHLGKPGHRSYALSPHTVI-QTTHSASEPLPVGLPATAFYAGTQPPVIGYL . . : :: :: : .: :. : . : : NP_001 Q---DYRVVAAQRKEEPSPLNLSHHTPDHQGEGRGSARNPAELAEKSQARGFYP------ 350 360 370 380 390 470 480 490 500 510 520 pF1KB7 SGQQQAITYAGSLPQHLVIPGTQPLLIPVGSTDMEASGAAPAIVTSSPQFAAVPHTFVTT ...:. . . ::. .:. . . :.:.: :: :: : : : . .:.. NP_001 QSHQEPVKHR-PLPKAMVVANGN--LVPTG-TD---SGLLP--VGS--------EILVAS 400 410 420 430 440 530 540 550 560 570 580 pF1KB7 ALPKSENFNPEALVTQAAYPAMVQAQIHLPVVQSVASPAAAPPTLPPYFMKGSIIQLANG .: :::. .: . . : .. :: .::::.:::::.: NP_001 SLD-------------------VQARATFPDKEPTPPPITSSH-LPSHFMKGAIIQLATG 450 460 470 480 590 600 610 620 630 640 pF1KB7 ELKKVEDLKTEDFIQSAEISNDLKIDSSTVERIEDSHSPGVAVIQFAVGEHRAQVSVEVL :::.::::.:.::..:::.:. :::::::: :..:. :: ....:.:::....::.:: NP_001 ELKRVEDLQTQDFVRSAEVSGGLKIDSSTVVDIQESQWPGFVMLHFVVGEQQSKVSIEVP 490 500 510 520 530 540 650 660 670 680 690 700 pF1KB7 VEYPFFVFGQGWSSCCPERTSQLFDLPCSKLSVGDVCISLTLKNLKNGSVKKGQPVDPAS :.::::.::::::: : ::.:::.::: .:.:::::::..:..:...::.... . :.. NP_001 PEHPFFVYGQGWSSCSPGRTTQLFSLPCHRLQVGDVCISISLQSLNSNSVSQASCAPPSQ 550 560 570 580 590 600 710 720 730 740 750 760 pF1KB7 VLLKHSKADGLAGSRHRYAEQENGINQGSAQMLSENGELKFPEKMGLPAAPFLTKIEPSK :. :.: : . :: .. . .:. . ::. .:::. NP_001 ----------LGPPRER---PERTV-LGSRELCDSEGKSQ-------PAGEGSRVVEPSQ 610 620 630 770 780 790 800 810 pF1KB7 PAATRKRRWSAPESRKLEKSEDEPPLTLPKPSLIPQEVKICIEGRSNVGK : . . : :: .. . .: .: .::.::::::. ::::::.:: NP_001 PESGAQACWPAPSFQRYSMQGEEARAALLRPSFIPQEVKLSIEGRSNAGK 640 650 660 670 680 815 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 04:19:04 2016 done: Tue Nov 8 04:19:06 2016 Total Scan time: 13.800 Total Display time: 0.080 Function used was FASTA [36.3.4 Apr, 2011]