FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1595, 337 aa 1>>>pF1KE1595 337 - 337 aa - 337 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 13.0953+/-0.000524; mu= -19.9288+/- 0.033 mean_var=479.8602+/-102.152, 0's: 0 Z-trim(119.9): 153 B-trim: 63 in 1/57 Lambda= 0.058549 statistics sampled from 34339 (34514) to 34339 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.712), E-opt: 0.2 (0.405), width: 16 Scan time: 7.480 The best scores are: opt bits E(85289) NP_003185 (OMIM: 168600,600075,607136) TATA-box-bi ( 339) 2194 199.5 8.6e-51 NP_001165556 (OMIM: 168600,600075,607136) TATA-box ( 319) 2055 187.7 2.8e-47 NP_950248 (OMIM: 608964) TATA box-binding protein- ( 375) 1231 118.2 2.8e-26 XP_016867002 (OMIM: 605521) PREDICTED: TATA box-bi ( 186) 460 52.8 6.7e-07 NP_004856 (OMIM: 605521) TATA box-binding protein- ( 186) 460 52.8 6.7e-07 NP_001240605 (OMIM: 605521) TATA box-binding prote ( 186) 460 52.8 6.7e-07 XP_011541326 (OMIM: 607537) PREDICTED: mastermind- ( 928) 330 42.4 0.0046 XP_011541325 (OMIM: 607537) PREDICTED: mastermind- (1009) 330 42.4 0.0049 NP_683697 (OMIM: 602081,605317) forkhead box prote ( 432) 318 41.1 0.0052 NP_001166238 (OMIM: 602081,605317) forkhead box pr ( 457) 318 41.1 0.0054 NP_115803 (OMIM: 607537) mastermind-like protein 2 (1156) 330 42.5 0.0055 XP_011541327 (OMIM: 607537) PREDICTED: mastermind- ( 735) 324 41.8 0.0055 NP_683698 (OMIM: 602081,605317) forkhead box prote ( 732) 322 41.6 0.0062 NP_055306 (OMIM: 602081,605317) forkhead box prote ( 715) 318 41.3 0.0076 XP_016868290 (OMIM: 602081,605317) PREDICTED: fork ( 740) 318 41.3 0.0078 NP_683696 (OMIM: 602081,605317) forkhead box prote ( 740) 318 41.3 0.0078 NP_001166237 (OMIM: 602081,605317) forkhead box pr ( 714) 314 41.0 0.0096 >>NP_003185 (OMIM: 168600,600075,607136) TATA-box-bindin (339 aa) initn: 2557 init1: 1830 opt: 2194 Z-score: 1032.9 bits: 199.5 E(85289): 8.6e-51 Smith-Waterman score: 2194; 99.4% identity (99.4% similar) in 339 aa overlap (1-337:1-339) 10 20 30 40 50 60 pF1KE1 MDQNNSLPPYAQGLASPQGAMTPGIPIFSPMMPYGTGLTPQPIQNTNSLSILEEQQRQQQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_003 MDQNNSLPPYAQGLASPQGAMTPGIPIFSPMMPYGTGLTPQPIQNTNSLSILEEQQRQQQ 10 20 30 40 50 60 70 80 90 100 110 pF1KE1 QQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQ--AVAAAAVQQSTSQQATQGTSGQAPQ ::::::::::::::::::::::::::::::::: ::::::::::::::::::::::::: NP_003 QQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQAVAAAAVQQSTSQQATQGTSGQAPQ 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE1 LFHSQTLTTAPLPGTTPLYPSPMTPMTPITPATPASESSGIVPQLQNIVSTVNLGCKLDL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_003 LFHSQTLTTAPLPGTTPLYPSPMTPMTPITPATPASESSGIVPQLQNIVSTVNLGCKLDL 130 140 150 160 170 180 180 190 200 210 220 230 pF1KE1 KTIALRARNAEYNPKRFAAVIMRIREPRTTALIFSSGKMVCTGAKSEEQSRLAARKYARV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_003 KTIALRARNAEYNPKRFAAVIMRIREPRTTALIFSSGKMVCTGAKSEEQSRLAARKYARV 190 200 210 220 230 240 240 250 260 270 280 290 pF1KE1 VQKLGFPAKFLDFKIQNMVGSCDVKFPIRLEGLVLTHQQFSSYEPELFPGLIYRMIKPRI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_003 VQKLGFPAKFLDFKIQNMVGSCDVKFPIRLEGLVLTHQQFSSYEPELFPGLIYRMIKPRI 250 260 270 280 290 300 300 310 320 330 pF1KE1 VLLIFVSGKVVLTGAKVRAEIYEAFENIYPILKGFRKTT ::::::::::::::::::::::::::::::::::::::: NP_003 VLLIFVSGKVVLTGAKVRAEIYEAFENIYPILKGFRKTT 310 320 330 >>NP_001165556 (OMIM: 168600,600075,607136) TATA-box-bin (319 aa) initn: 2418 init1: 1830 opt: 2055 Z-score: 969.8 bits: 187.7 E(85289): 2.8e-47 Smith-Waterman score: 2055; 99.4% identity (99.4% similar) in 319 aa overlap (21-337:1-319) 10 20 30 40 50 60 pF1KE1 MDQNNSLPPYAQGLASPQGAMTPGIPIFSPMMPYGTGLTPQPIQNTNSLSILEEQQRQQQ :::::::::::::::::::::::::::::::::::::::: NP_001 MTPGIPIFSPMMPYGTGLTPQPIQNTNSLSILEEQQRQQQ 10 20 30 40 70 80 90 100 110 pF1KE1 QQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQ--AVAAAAVQQSTSQQATQGTSGQAPQ ::::::::::::::::::::::::::::::::: ::::::::::::::::::::::::: NP_001 QQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQAVAAAAVQQSTSQQATQGTSGQAPQ 50 60 70 80 90 100 120 130 140 150 160 170 pF1KE1 LFHSQTLTTAPLPGTTPLYPSPMTPMTPITPATPASESSGIVPQLQNIVSTVNLGCKLDL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 LFHSQTLTTAPLPGTTPLYPSPMTPMTPITPATPASESSGIVPQLQNIVSTVNLGCKLDL 110 120 130 140 150 160 180 190 200 210 220 230 pF1KE1 KTIALRARNAEYNPKRFAAVIMRIREPRTTALIFSSGKMVCTGAKSEEQSRLAARKYARV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 KTIALRARNAEYNPKRFAAVIMRIREPRTTALIFSSGKMVCTGAKSEEQSRLAARKYARV 170 180 190 200 210 220 240 250 260 270 280 290 pF1KE1 VQKLGFPAKFLDFKIQNMVGSCDVKFPIRLEGLVLTHQQFSSYEPELFPGLIYRMIKPRI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 VQKLGFPAKFLDFKIQNMVGSCDVKFPIRLEGLVLTHQQFSSYEPELFPGLIYRMIKPRI 230 240 250 260 270 280 300 310 320 330 pF1KE1 VLLIFVSGKVVLTGAKVRAEIYEAFENIYPILKGFRKTT ::::::::::::::::::::::::::::::::::::::: NP_001 VLLIFVSGKVVLTGAKVRAEIYEAFENIYPILKGFRKTT 290 300 310 >>NP_950248 (OMIM: 608964) TATA box-binding protein-like (375 aa) initn: 1366 init1: 1150 opt: 1231 Z-score: 592.7 bits: 118.2 E(85289): 2.8e-26 Smith-Waterman score: 1231; 59.6% identity (78.1% similar) in 342 aa overlap (1-335:33-374) 10 20 pF1KE1 MDQNNS-LPPYAQGLASPQGAMTPGIPIFS :.:... : : . :. .: : :.:: NP_950 SAPWPERVPRLLAPRLPSYPPPPPTVGLRSMEQEETYLELYLDQCAAQDGLAPPRSPLFS 10 20 30 40 50 60 30 40 50 60 70 80 pF1KE1 PMMPYGTGL--TPQPIQNTNSLSILEEQQRQQQQQQ----QQQQQQQQQQQQQQQQQQQQ :..:: . . .: :: ..: . . .. . .. :....: .... . NP_950 PVVPYDMYILNASNPDTAFNSNPEVKETSGDFSSVDLSFLPDEVTQENKDQPVISKHETE 70 80 90 100 110 120 90 100 110 120 130 140 pF1KE1 QQQQQQQQQQAVAAAAVQQSTSQQATQGTSGQAPQLFHSQTLTTAPLPGTTPLYPSPMTP .....:. :. . . . :. ... :.. :: ..: .. : : .. NP_950 ENSESQSPQSRLPSPSEQDVGLGLNSSSLSNSHSQLHPGDTDSVQPSPEKPNSDSLSLAS 130 140 150 160 170 180 150 160 170 180 190 200 pF1KE1 MTPITPATPASESSGIVPQLQNIVSTVNLGCKLDLKTIALRARNAEYNPKRFAAVIMRIR .::.:: :: :: :::::::::::::::.:::::: :::.:.::::::::::::::::: NP_950 ITPMTPMTPISECCGIVPQLQNIVSTVNLACKLDLKKIALHAKNAEYNPKRFAAVIMRIR 190 200 210 220 230 240 210 220 230 240 250 260 pF1KE1 EPRTTALIFSSGKMVCTGAKSEEQSRLAARKYARVVQKLGFPAKFLDFKIQNMVGSCDVK :::::::::::::::::::::::::::::::::::::::::::.:::::::::::::::. NP_950 EPRTTALIFSSGKMVCTGAKSEEQSRLAARKYARVVQKLGFPARFLDFKIQNMVGSCDVR 250 260 270 280 290 300 270 280 290 300 310 320 pF1KE1 FPIRLEGLVLTHQQFSSYEPELFPGLIYRMIKPRIVLLIFVSGKVVLTGAKVRAEIYEAF ::::::::::::::::::::::::::::::.:::::::::::::::::::: :.:::::: NP_950 FPIRLEGLVLTHQQFSSYEPELFPGLIYRMVKPRIVLLIFVSGKVVLTGAKERSEIYEAF 310 320 330 340 350 360 330 pF1KE1 ENIYPILKGFRKTT ::::::::::.: NP_950 ENIYPILKGFKKA 370 >>XP_016867002 (OMIM: 605521) PREDICTED: TATA box-bindin (186 aa) initn: 443 init1: 359 opt: 460 Z-score: 244.9 bits: 52.8 E(85289): 6.7e-07 Smith-Waterman score: 460; 42.2% identity (74.0% similar) in 173 aa overlap (163-335:13-183) 140 150 160 170 180 190 pF1KE1 TTPLYPSPMTPMTPITPATPASESSGIVPQLQNIVSTVNLGCKLDLKTIALRARNAEYNP . :.: . :.:.:. :::.. :. :. XP_016 MDADSDVALDILITNVVCVFRTRCHLNLRKIALEGANVIYK- 10 20 30 40 200 210 220 230 240 250 pF1KE1 KRFAAVIMRIREPRTTALIFSSGKMVCTGAKSEEQSRLAARKYARVVQKLGFPAKFLDFK . . :.:..:.:: :: :.::::..:::: :::.....::. :: .::::: . : ::: XP_016 RDVGKVLMKLRKPRITATIWSSGKIICTGATSEEEAKFGARRLARSLQKLGFQVIFTDFK 50 60 70 80 90 100 260 270 280 290 300 310 pF1KE1 IQNMVGSCDVKFPIRLEGLVLTHQQFSSYEPELFPGLIYRMIKPRIVLLIFVSGKVVLTG . :... :.. : ::: .. ... .:::::: :.. ::. . : .: :: .:....:: XP_016 VVNVLAVCNMPFEIRLPEFTKNNRPHASYEPELHPAVCYRIKSLRATLQIFSTGSITVTG 110 120 130 140 150 160 320 330 pF1KE1 AKVRAEIYEAFENIYPILKGFRKTT .:.: . : :.:::.. :: XP_016 PNVKA-VATAVEQIYPFVFESRKEIL 170 180 >>NP_004856 (OMIM: 605521) TATA box-binding protein-like (186 aa) initn: 443 init1: 359 opt: 460 Z-score: 244.9 bits: 52.8 E(85289): 6.7e-07 Smith-Waterman score: 460; 42.2% identity (74.0% similar) in 173 aa overlap (163-335:13-183) 140 150 160 170 180 190 pF1KE1 TTPLYPSPMTPMTPITPATPASESSGIVPQLQNIVSTVNLGCKLDLKTIALRARNAEYNP . :.: . :.:.:. :::.. :. :. NP_004 MDADSDVALDILITNVVCVFRTRCHLNLRKIALEGANVIYK- 10 20 30 40 200 210 220 230 240 250 pF1KE1 KRFAAVIMRIREPRTTALIFSSGKMVCTGAKSEEQSRLAARKYARVVQKLGFPAKFLDFK . . :.:..:.:: :: :.::::..:::: :::.....::. :: .::::: . : ::: NP_004 RDVGKVLMKLRKPRITATIWSSGKIICTGATSEEEAKFGARRLARSLQKLGFQVIFTDFK 50 60 70 80 90 100 260 270 280 290 300 310 pF1KE1 IQNMVGSCDVKFPIRLEGLVLTHQQFSSYEPELFPGLIYRMIKPRIVLLIFVSGKVVLTG . :... :.. : ::: .. ... .:::::: :.. ::. . : .: :: .:....:: NP_004 VVNVLAVCNMPFEIRLPEFTKNNRPHASYEPELHPAVCYRIKSLRATLQIFSTGSITVTG 110 120 130 140 150 160 320 330 pF1KE1 AKVRAEIYEAFENIYPILKGFRKTT .:.: . : :.:::.. :: NP_004 PNVKA-VATAVEQIYPFVFESRKEIL 170 180 >>NP_001240605 (OMIM: 605521) TATA box-binding protein-l (186 aa) initn: 443 init1: 359 opt: 460 Z-score: 244.9 bits: 52.8 E(85289): 6.7e-07 Smith-Waterman score: 460; 42.2% identity (74.0% similar) in 173 aa overlap (163-335:13-183) 140 150 160 170 180 190 pF1KE1 TTPLYPSPMTPMTPITPATPASESSGIVPQLQNIVSTVNLGCKLDLKTIALRARNAEYNP . :.: . :.:.:. :::.. :. :. NP_001 MDADSDVALDILITNVVCVFRTRCHLNLRKIALEGANVIYK- 10 20 30 40 200 210 220 230 240 250 pF1KE1 KRFAAVIMRIREPRTTALIFSSGKMVCTGAKSEEQSRLAARKYARVVQKLGFPAKFLDFK . . :.:..:.:: :: :.::::..:::: :::.....::. :: .::::: . : ::: NP_001 RDVGKVLMKLRKPRITATIWSSGKIICTGATSEEEAKFGARRLARSLQKLGFQVIFTDFK 50 60 70 80 90 100 260 270 280 290 300 310 pF1KE1 IQNMVGSCDVKFPIRLEGLVLTHQQFSSYEPELFPGLIYRMIKPRIVLLIFVSGKVVLTG . :... :.. : ::: .. ... .:::::: :.. ::. . : .: :: .:....:: NP_001 VVNVLAVCNMPFEIRLPEFTKNNRPHASYEPELHPAVCYRIKSLRATLQIFSTGSITVTG 110 120 130 140 150 160 320 330 pF1KE1 AKVRAEIYEAFENIYPILKGFRKTT .:.: . : :.:::.. :: NP_001 PNVKA-VATAVEQIYPFVFESRKEIL 170 180 >>XP_011541326 (OMIM: 607537) PREDICTED: mastermind-like (928 aa) initn: 1235 init1: 287 opt: 330 Z-score: 175.9 bits: 42.4 E(85289): 0.0046 Smith-Waterman score: 346; 44.2% identity (61.9% similar) in 147 aa overlap (14-144:314-458) 10 20 30 40 pF1KE1 MDQNNSLPPYAQGLASPQGAMTPGIPIFSPMMPYGTGLTPQ-- . .:. :: : .:.. ... . : XP_011 NYMYKAGPSAQGGHLDVLMQQKPQDLSRSFINNPHPAMEPRQGNTKPLFHFNSDQANQQM 290 300 310 320 330 340 50 60 70 80 90 pF1KE1 ----PIQNTNSLSILEEQQRQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQ-------QQQ : :: :: .::.:::::::::::::::::::::::::::::: ::: XP_011 PSVLPSQNKPSLLHYTQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQSSISAQQQQQ 350 360 370 380 390 400 100 110 120 130 140 pF1KE1 QQQAVAAAAVQQSTSQQATQGTSGQAPQLFHSQTLTTAP---LPGTTPLYPSPMTPMTPI ::....: ::. .:: : . : : ..: .. : :: . :: ::. :. XP_011 QQSSISAQQQQQQQQQQQQQQQQQQQQQQQQQQQPSSQPAQSLP-SQPLLRSPL-PLQQK 410 420 430 440 450 460 150 160 170 180 190 200 pF1KE1 TPATPASESSGIVPQLQNIVSTVNLGCKLDLKTIALRARNAEYNPKRFAAVIMRIREPRT XP_011 LLLQQMQNQPIAGMGYQVSQQQRQDQHSVVGQNTGPSPSPNPCSNPNTGSGYMNSQQSLL 470 480 490 500 510 520 >>XP_011541325 (OMIM: 607537) PREDICTED: mastermind-like (1009 aa) initn: 1235 init1: 287 opt: 330 Z-score: 175.4 bits: 42.4 E(85289): 0.0049 Smith-Waterman score: 346; 44.2% identity (61.9% similar) in 147 aa overlap (14-144:395-539) 10 20 30 40 pF1KE1 MDQNNSLPPYAQGLASPQGAMTPGIPIFSPMMPYGTGLTPQ-- . .:. :: : .:.. ... . : XP_011 NYMYKAGPSAQGGHLDVLMQQKPQDLSRSFINNPHPAMEPRQGNTKPLFHFNSDQANQQM 370 380 390 400 410 420 50 60 70 80 90 pF1KE1 ----PIQNTNSLSILEEQQRQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQ-------QQQ : :: :: .::.:::::::::::::::::::::::::::::: ::: XP_011 PSVLPSQNKPSLLHYTQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQSSISAQQQQQ 430 440 450 460 470 480 100 110 120 130 140 pF1KE1 QQQAVAAAAVQQSTSQQATQGTSGQAPQLFHSQTLTTAP---LPGTTPLYPSPMTPMTPI ::....: ::. .:: : . : : ..: .. : :: . :: ::. :. XP_011 QQSSISAQQQQQQQQQQQQQQQQQQQQQQQQQQQPSSQPAQSLP-SQPLLRSPL-PLQQK 490 500 510 520 530 540 150 160 170 180 190 200 pF1KE1 TPATPASESSGIVPQLQNIVSTVNLGCKLDLKTIALRARNAEYNPKRFAAVIMRIREPRT XP_011 LLLQQMQNQPIAGMGYQVSQQQRQDQHSVVGQNTGPSPSPNPCSNPNTGSGYMNSQQSLL 550 560 570 580 590 600 >>NP_683697 (OMIM: 602081,605317) forkhead box protein P (432 aa) initn: 821 init1: 298 opt: 318 Z-score: 175.0 bits: 41.1 E(85289): 0.0052 Smith-Waterman score: 318; 47.9% identity (63.9% similar) in 119 aa overlap (44-162:143-258) 20 30 40 50 60 70 pF1KE1 LASPQGAMTPGIPIFSPMMPYGTGLTPQPIQNTNSLSILEEQQRQQQQQQQQQQQQQQQQ :. :..:..::.:::::::::::::::: NP_683 LSPQQLQALLQQQQAVMLQQQQLQEFYKKQQEQLHLQLLQQQQQQQQQQQQQQQQQQQQQ 120 130 140 150 160 170 80 90 100 110 120 130 pF1KE1 QQQQQQQQQQQQQQQQQQQQAVAAAAVQQSTSQQATQGTSGQAPQLFHSQTLTTAPLPGT :::::::::::::::::::. : ::. .:: : .. : .:..: : : NP_683 QQQQQQQQQQQQQQQQQQQHPGKQAKEQQQQQQQQQQLAAQQ--LVFQQQLLQMQQLQQQ 180 190 200 210 220 230 140 150 160 170 180 190 pF1KE1 TPLYPSPMTPMTPITPATPASESSGIVPQLQNIVSTVNLGCKLDLKTIALRARNAEYNPK : . : :. : ... :: NP_683 QHLLSLQRQGLISIPPGQAALPVQSL-PQAGLSPAEIQQLWKEVTGVHSMEDNGIKHGGL 240 250 260 270 280 >>NP_001166238 (OMIM: 602081,605317) forkhead box protei (457 aa) initn: 821 init1: 298 opt: 318 Z-score: 174.7 bits: 41.1 E(85289): 0.0054 Smith-Waterman score: 318; 47.9% identity (63.9% similar) in 119 aa overlap (44-162:168-283) 20 30 40 50 60 70 pF1KE1 LASPQGAMTPGIPIFSPMMPYGTGLTPQPIQNTNSLSILEEQQRQQQQQQQQQQQQQQQQ :. :..:..::.:::::::::::::::: NP_001 LSPQQLQALLQQQQAVMLQQQQLQEFYKKQQEQLHLQLLQQQQQQQQQQQQQQQQQQQQQ 140 150 160 170 180 190 80 90 100 110 120 130 pF1KE1 QQQQQQQQQQQQQQQQQQQQAVAAAAVQQSTSQQATQGTSGQAPQLFHSQTLTTAPLPGT :::::::::::::::::::. : ::. .:: : .. : .:..: : : NP_001 QQQQQQQQQQQQQQQQQQQHPGKQAKEQQQQQQQQQQLAAQQ--LVFQQQLLQMQQLQQQ 200 210 220 230 240 250 140 150 160 170 180 190 pF1KE1 TPLYPSPMTPMTPITPATPASESSGIVPQLQNIVSTVNLGCKLDLKTIALRARNAEYNPK : . : :. : ... :: NP_001 QHLLSLQRQGLISIPPGQAALPVQSL-PQAGLSPAEIQQLWKEVTGVHSMEDNGIKHGGL 260 270 280 290 300 310 337 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 06:36:37 2016 done: Sun Nov 6 06:36:38 2016 Total Scan time: 7.480 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]