FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1289, 929 aa 1>>>pF1KE1289 929 - 929 aa - 929 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 15.2055+/-0.000428; mu= -23.6518+/- 0.027 mean_var=551.5238+/-109.948, 0's: 0 Z-trim(125.2): 62 B-trim: 0 in 0/60 Lambda= 0.054613 statistics sampled from 48485 (48579) to 48485 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.816), E-opt: 0.2 (0.57), width: 16 Scan time: 17.560 The best scores are: opt bits E(85289) NP_114129 (OMIM: 606576) transcription initiation ( 929) 6223 505.3 5.5e-142 XP_011518043 (OMIM: 606576) PREDICTED: transcripti (1005) 6205 503.9 1.6e-141 NP_066554 (OMIM: 105400,162230,616924) neurofilame (1020) 403 46.8 0.00065 >>NP_114129 (OMIM: 606576) transcription initiation fact (929 aa) initn: 6223 init1: 6223 opt: 6223 Z-score: 2670.0 bits: 505.3 E(85289): 5.5e-142 Smith-Waterman score: 6223; 99.8% identity (99.9% similar) in 929 aa overlap (1-929:1-929) 10 20 30 40 50 60 pF1KE1 MCESYSRSLLRVSVAQICQALGWDSVQLSACHLLTDVLQRYLQQLGRGCHRYSELYGRTD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_114 MCESYSRSLLRVSVAQICQALGWDSVQLSACHLLTDVLQRYLQQLGRGCHRYSELYGRTD 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 PILDDVGEAFQLMGVSLHELEDYIHNIEPVTFPHQIPSFPVSKNNVLQFPQPGSKDAEER :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_114 PILDDVGEAFQLMGVSLHELEDYIHNIEPVTFPHQIPSFPVSKNNVLQFPQPGSKDAEER 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 KEYIPDYLPPIVSSQEEEEEEQVPTDGGTSAEAMQVPLEEDDELEEEEIINDENFLGKRP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_114 KEYIPDYLPPIVSSQEEEEEEQVPTDGGTSAEAMQVPLEEDDELEEEEIINDENFLGKRP 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 LDSPEAEELPAMKRPRLLSTKGDTLDVVLLEAREPLSSINTQKIPPMLSPVHVQDSTDLA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_114 LDSPEAEELPAMKRPRLLSTKGDTLDVVLLEAREPLSSINTQKIPPMLSPVHVQDSTDLA 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE1 PPSPEPPMLAPVAKSQMPTAKPLETKSFTPKTKTKTSSPGQKTKSPKTAQSPAMVGSPIR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_114 PPSPEPPMLAPVAKSQMPTAKPLETKSFTPKTKTKTSSPGQKTKSPKTAQSPAMVGSPIR 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE1 SPKTVSKEKKSPGRSKSPKSPKSPKVTTHIPQTPVRPETPNRTPSATLSEKISKETIQVK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_114 SPKTVSKEKKSPGRSKSPKSPKSPKVTTHIPQTPVRPETPNRTPSATLSEKISKETIQVK 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE1 QIQTPPDAGKLNSENQPKKAVVADKTIEASIDAVIARACAEREPDPFEFSSGSESEGDIF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_114 QIQTPPDAGKLNSENQPKKAVVADKTIEASIDAVIARACAEREPDPFEFSSGSESEGDIF 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE1 TSPKRISGPECTTPKASTSANSFTKSGSTPLPLSGGTSSSDNSWTMDASIDEVVRKAKLG :::::::::::::::::::::.:::::::::::::::::::::::::::::::::::::: NP_114 TSPKRISGPECTTPKASTSANNFTKSGSTPLPLSGGTSSSDNSWTMDASIDEVVRKAKLG 430 440 450 460 470 480 490 500 510 520 530 540 pF1KE1 TPSNMPPNFPYISSPSVSPPTPEPLHKVYEEKTKLPSSVEVKKKLKKELKTKMKKKEKQR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_114 TPSNMPPNFPYISSPSVSPPTPEPLHKVYEEKTKLPSSVEVKKKLKKELKTKMKKKEKQR 490 500 510 520 530 540 550 560 570 580 590 600 pF1KE1 DREREKDKNKDKSKEKDKVKEKEKDKETGRETKYPWKEFLKEEEADPYKFKIKEFEDVDP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_114 DREREKDKNKDKSKEKDKVKEKEKDKETGRETKYPWKEFLKEEEADPYKFKIKEFEDVDP 550 560 570 580 590 600 610 620 630 640 650 660 pF1KE1 KVKLKDGLVRKEKEKHKDKKKDREKGKKDKDKREKEKVKDKGREDKMKAPAPPLVLPPKE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_114 KVKLKDGLVRKEKEKHKDKKKDREKGKKDKDKREKEKVKDKGREDKMKAPAPPLVLPPKE 610 620 630 640 650 660 670 680 690 700 710 720 pF1KE1 LALPLFSPATASRVPAMLPSLLPVLPEKLFEEKEKPKEKEKKKDKKEKKKKKEKEKEKKE ::::::::::::::::::::::::::::::::::: :::::::::::::::::::::::: NP_114 LALPLFSPATASRVPAMLPSLLPVLPEKLFEEKEKVKEKEKKKDKKEKKKKKEKEKEKKE 670 680 690 700 710 720 730 740 750 760 770 780 pF1KE1 KEREKEKREREKREKEKEKHKHEKIKVEPVALAPSPVIPRLTLRVGAGQDKIVISKVVPA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_114 KEREKEKREREKREKEKEKHKHEKIKVEPVALAPSPVIPRLTLRVGAGQDKIVISKVVPA 730 740 750 760 770 780 790 800 810 820 830 840 pF1KE1 PEAKPAPSQNRPKTPPPAPAPAPGPMLVSPAPVPLPLLAQAAAGPALLPSPGPAASGASA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_114 PEAKPAPSQNRPKTPPPAPAPAPGPMLVSPAPVPLPLLAQAAAGPALLPSPGPAASGASA 790 800 810 820 830 840 850 860 870 880 890 900 pF1KE1 KAPVRSVVTETVSTYVIRDEWGNQIWICPGCNKPDDGSPMIGCDDCDDWYHWPCVGIMTA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_114 KAPVRSVVTETVSTYVIRDEWGNQIWICPGCNKPDDGSPMIGCDDCDDWYHWPCVGIMTA 850 860 870 880 890 900 910 920 pF1KE1 PPEEMQWFCPKCANKKKDKKHKKRKHRAH ::::::::::::::::::::::::::::: NP_114 PPEEMQWFCPKCANKKKDKKHKKRKHRAH 910 920 >>XP_011518043 (OMIM: 606576) PREDICTED: transcription i (1005 aa) initn: 5305 init1: 5305 opt: 6205 Z-score: 2661.9 bits: 503.9 E(85289): 1.6e-141 Smith-Waterman score: 6205; 99.7% identity (99.8% similar) in 929 aa overlap (1-929:78-1005) 10 20 30 pF1KE1 MCESYSRSLLRVSVAQICQALGWDSVQLSA :::::::::::::::::::::::::::::: XP_011 LEDESGTLLRSGGGAGEQWQQGLRWRPRSGMCESYSRSLLRVSVAQICQALGWDSVQLSA 50 60 70 80 90 100 40 50 60 70 80 90 pF1KE1 CHLLTDVLQRYLQQLGRGCHRYSELYGRTDPILDDVGEAFQLMGVSLHELEDYIHNIEPV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 CHLLTDVLQRYLQQLGRGCHRYSELYGRTDPILDDVGEAFQLMGVSLHELEDYIHNIEPV 110 120 130 140 150 160 100 110 120 130 140 150 pF1KE1 TFPHQIPSFPVSKNNVLQFPQPGSKDAEERKEYIPDYLPPIVSSQEEEEEEQVPTDGGTS :::::::::::::::::::::::::::::::::::::::::::::::::: ::::::::: XP_011 TFPHQIPSFPVSKNNVLQFPQPGSKDAEERKEYIPDYLPPIVSSQEEEEE-QVPTDGGTS 170 180 190 200 210 220 160 170 180 190 200 210 pF1KE1 AEAMQVPLEEDDELEEEEIINDENFLGKRPLDSPEAEELPAMKRPRLLSTKGDTLDVVLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 AEAMQVPLEEDDELEEEEIINDENFLGKRPLDSPEAEELPAMKRPRLLSTKGDTLDVVLL 230 240 250 260 270 280 220 230 240 250 260 270 pF1KE1 EAREPLSSINTQKIPPMLSPVHVQDSTDLAPPSPEPPMLAPVAKSQMPTAKPLETKSFTP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 EAREPLSSINTQKIPPMLSPVHVQDSTDLAPPSPEPPMLAPVAKSQMPTAKPLETKSFTP 290 300 310 320 330 340 280 290 300 310 320 330 pF1KE1 KTKTKTSSPGQKTKSPKTAQSPAMVGSPIRSPKTVSKEKKSPGRSKSPKSPKSPKVTTHI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 KTKTKTSSPGQKTKSPKTAQSPAMVGSPIRSPKTVSKEKKSPGRSKSPKSPKSPKVTTHI 350 360 370 380 390 400 340 350 360 370 380 390 pF1KE1 PQTPVRPETPNRTPSATLSEKISKETIQVKQIQTPPDAGKLNSENQPKKAVVADKTIEAS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 PQTPVRPETPNRTPSATLSEKISKETIQVKQIQTPPDAGKLNSENQPKKAVVADKTIEAS 410 420 430 440 450 460 400 410 420 430 440 450 pF1KE1 IDAVIARACAEREPDPFEFSSGSESEGDIFTSPKRISGPECTTPKASTSANSFTKSGSTP :::::::::::::::::::::::::::::::::::::::::::::::::::.:::::::: XP_011 IDAVIARACAEREPDPFEFSSGSESEGDIFTSPKRISGPECTTPKASTSANNFTKSGSTP 470 480 490 500 510 520 460 470 480 490 500 510 pF1KE1 LPLSGGTSSSDNSWTMDASIDEVVRKAKLGTPSNMPPNFPYISSPSVSPPTPEPLHKVYE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 LPLSGGTSSSDNSWTMDASIDEVVRKAKLGTPSNMPPNFPYISSPSVSPPTPEPLHKVYE 530 540 550 560 570 580 520 530 540 550 560 570 pF1KE1 EKTKLPSSVEVKKKLKKELKTKMKKKEKQRDREREKDKNKDKSKEKDKVKEKEKDKETGR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 EKTKLPSSVEVKKKLKKELKTKMKKKEKQRDREREKDKNKDKSKEKDKVKEKEKDKETGR 590 600 610 620 630 640 580 590 600 610 620 630 pF1KE1 ETKYPWKEFLKEEEADPYKFKIKEFEDVDPKVKLKDGLVRKEKEKHKDKKKDREKGKKDK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 ETKYPWKEFLKEEEADPYKFKIKEFEDVDPKVKLKDGLVRKEKEKHKDKKKDREKGKKDK 650 660 670 680 690 700 640 650 660 670 680 690 pF1KE1 DKREKEKVKDKGREDKMKAPAPPLVLPPKELALPLFSPATASRVPAMLPSLLPVLPEKLF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 DKREKEKVKDKGREDKMKAPAPPLVLPPKELALPLFSPATASRVPAMLPSLLPVLPEKLF 710 720 730 740 750 760 700 710 720 730 740 750 pF1KE1 EEKEKPKEKEKKKDKKEKKKKKEKEKEKKEKEREKEKREREKREKEKEKHKHEKIKVEPV ::::: :::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 EEKEKVKEKEKKKDKKEKKKKKEKEKEKKEKEREKEKREREKREKEKEKHKHEKIKVEPV 770 780 790 800 810 820 760 770 780 790 800 810 pF1KE1 ALAPSPVIPRLTLRVGAGQDKIVISKVVPAPEAKPAPSQNRPKTPPPAPAPAPGPMLVSP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 ALAPSPVIPRLTLRVGAGQDKIVISKVVPAPEAKPAPSQNRPKTPPPAPAPAPGPMLVSP 830 840 850 860 870 880 820 830 840 850 860 870 pF1KE1 APVPLPLLAQAAAGPALLPSPGPAASGASAKAPVRSVVTETVSTYVIRDEWGNQIWICPG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 APVPLPLLAQAAAGPALLPSPGPAASGASAKAPVRSVVTETVSTYVIRDEWGNQIWICPG 890 900 910 920 930 940 880 890 900 910 920 pF1KE1 CNKPDDGSPMIGCDDCDDWYHWPCVGIMTAPPEEMQWFCPKCANKKKDKKHKKRKHRAH ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 CNKPDDGSPMIGCDDCDDWYHWPCVGIMTAPPEEMQWFCPKCANKKKDKKHKKRKHRAH 950 960 970 980 990 1000 >>NP_066554 (OMIM: 105400,162230,616924) neurofilament h (1020 aa) initn: 421 init1: 143 opt: 403 Z-score: 191.2 bits: 46.8 E(85289): 0.00065 Smith-Waterman score: 460; 27.1% identity (50.0% similar) in 664 aa overlap (89-744:419-993) 60 70 80 90 100 110 pF1KE1 TDPILDDVGEAFQLMGVSLHELEDYIHNIEPVTFPHQIPSFPVSKNNVLQFPQPGSKDAE : ..:. .:..: .... . .: NP_066 LNVKMALDIEIAAYRKLLEGEECRIGFGPIPFSLPEGLPKIPSVSTHI-------KVKSE 390 400 410 420 430 440 120 130 140 150 160 170 pF1KE1 ERKEYIPDYLPPIVSSQEEEEEEQVPTDGGTSAEAMQVPLEEDDELEEEEIINDENFLGK :. . . : .:. :: :: :. : : .. :: : : : ..: :. NP_066 EKIKVVEKSEKETVIVEEQTEETQV-TEEVTEEEEKEAKEEEGKEEEGGE--EEEAEGGE 450 460 470 480 490 180 190 200 210 220 230 pF1KE1 RPLDSPEAEELPAMKRPRLLSTKGDTLDVVLLEAREPLSSINTQKIPPMLSPVHVQDSTD . :: ::: : . .. . : ::. : . . .: ::..:.. NP_066 EETKSPPAEEAA--------SPEKEAKSPVKEEAKSPAEAKSPEK-EEAKSPAEVKSPEK 500 510 520 530 540 240 250 260 270 280 290 pF1KE1 LAPPSPEPPMLAPVAKSQMPTAKPLETKSFTPKTKTKTSSPGQKTKSP--KTAQSPAMVG :. : : ::: : . .. .: ....:: .:.::: . :.::: . NP_066 AKSPAKEEAKSPPEAKS------PEKEEAKSP---AEVKSP-EKAKSPAKEEAKSPAEAK 550 560 570 580 590 300 310 320 330 340 350 pF1KE1 SPIRSPKTVSKEKKSPGRSKSPKSPKSPKVTTHIPQTPVRPETPNRTPSATLSEKISKET :: .. . :..: :::...::: . .. ..:.. ..:... : : : : : NP_066 SPEKAKSPVKEEAKSPAEAKSPVKEEA--------KSPAEVKSPEKAKSPTKEEAKSPEK 600 610 620 630 640 650 360 370 380 390 400 410 pF1KE1 IQVKQIQTPPDAGKLNSENQPKKAVVAD-KTIEASIDAVIARACAEREPDPFEFSSGSES . :. . .: .. :. : :. :. : . . : :.: . :. . :. NP_066 AK------SPEKEEAKSPEKAKSPVKAEAKSPEKAKSPVKAEA---KSPEKAKSPVKEEA 660 670 680 690 700 420 430 440 450 460 470 pF1KE1 EGDIFTSPKRISGPECTTPKASTSANSFTKS-GSTPLPLSGGTSSSDNSWTMDASIDEVV . ::.. ..: :. .:.: .: ..:: .. .. .: :. : . NP_066 K-----SPEKAKSPVKEEAKSPEKAKSPVKEEAKTPEKAKSPVKEEAKS-PEKAKSPEKA 710 720 730 740 750 480 490 500 510 520 530 pF1KE1 RKAKLGTPSNMPPNFPYISSPSVSPPTPEPLHKVYEEKTKLPSSVEVKKKLKKELKTKMK . . .: : ::. . : ::.: : ::...:. :.: NP_066 KTLDVKSPEAKTPAKEEARSPADKFP----------EKAKSP----VKEEVKSPEKAKSP 760 770 780 790 800 540 550 560 570 580 590 pF1KE1 KKEKQRDREREKDKNKDKSKEKDKVKEKEKDKETGRETKYPWKEFLKEEEADPYKFKIKE :: . :.: : : . :. :::.:: .:. ..: : :. ::: : : .: NP_066 LKEDAKAPEKEIPK---KEEVKSPVKEEEKPQEV--KVKEPPKK--AEEEKAPATPKTEE 810 820 830 840 850 600 610 620 630 640 650 pF1KE1 FEDVDPKVKLKDGLVRKEKEKHKDKKKDREKGKKDKDKREKEKVKDKGREDKMKAPAP-- .: . : :: : : ..... .: :. : ..: . ::: :.:.: NP_066 KKDSKKEEAPK-------KEAPKPKVEEKKEPAVEKPKESKVEAKKEEAEDKKKVPTPEK 860 870 880 890 900 660 670 680 690 700 710 pF1KE1 --PLVLPPKELALPLFSPATASRVPAMLPSLLPVLPEKLFEEKEKPKEKEKKKDKKEKKK : . :: : : . .:.. : . : : :.:: :: :: ::.: NP_066 EAPAKVEVKEDAKPKEKTEVAKKEPDDAKAKEPSKPA---EKKEAAPEK---KDTKEEKA 910 920 930 940 950 960 720 730 740 750 760 770 pF1KE1 KKEKEKEKKEKEREKEKREREKREKEKEKHKHEKIKVEPVALAPSPVIPRLTLRVGAGQD :: .:: : : : :.. . :: : : :: NP_066 KKPEEKPKTE---AKAKEDDKTLSKEPSKPKAEKAEKSSSTDQKDSKPPEKATEDKAAKG 970 980 990 1000 1010 780 790 800 810 820 830 pF1KE1 KIVISKVVPAPEAKPAPSQNRPKTPPPAPAPAPGPMLVSPAPVPLPLLAQAAAGPALLPS NP_066 K 1020 929 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 04:36:01 2016 done: Sun Nov 6 04:36:04 2016 Total Scan time: 17.560 Total Display time: 0.070 Function used was FASTA [36.3.4 Apr, 2011]