FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3424, 504 aa 1>>>pF1KE3424 504 - 504 aa - 504 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 9.0694+/-0.000369; mu= 4.2277+/- 0.023 mean_var=333.1524+/-67.646, 0's: 0 Z-trim(123.5): 43 B-trim: 0 in 0/56 Lambda= 0.070267 statistics sampled from 43284 (43329) to 43284 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.787), E-opt: 0.2 (0.508), width: 16 Scan time: 10.640 The best scores are: opt bits E(85289) NP_004843 (OMIM: 604894) one cut domain family mem ( 504) 3466 365.0 2.9e-100 XP_016881585 (OMIM: 604894) PREDICTED: one cut dom ( 480) 2860 303.5 8.8e-82 NP_004489 (OMIM: 604164) hepatocyte nuclear factor ( 465) 1849 201.0 6.2e-51 XP_011519789 (OMIM: 604164) PREDICTED: hepatocyte ( 402) 1348 150.1 1.1e-35 NP_001073957 (OMIM: 611294) one cut domain family ( 494) 1038 118.8 3.6e-26 >>NP_004843 (OMIM: 604894) one cut domain family member (504 aa) initn: 3466 init1: 3466 opt: 3466 Z-score: 1921.0 bits: 365.0 E(85289): 2.9e-100 Smith-Waterman score: 3466; 100.0% identity (100.0% similar) in 504 aa overlap (1-504:1-504) 10 20 30 40 50 60 pF1KE3 MKAAYTAYRCLTKDLEGCAMNPELTMESLGTLHGPAGGGSGGGGGGGGGGGGGGPGHEQE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_004 MKAAYTAYRCLTKDLEGCAMNPELTMESLGTLHGPAGGGSGGGGGGGGGGGGGGPGHEQE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 LLASPSPHHAGRGAAGSLRGPPPPPTAHQELGTAAAAAAAASRSAMVTSMASILDGGDYR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_004 LLASPSPHHAGRGAAGSLRGPPPPPTAHQELGTAAAAAAAASRSAMVTSMASILDGGDYR 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 PELSIPLHHAMSMSCDSSPPGMGMSNTYTTLTPLQPLPPISTVSDKFHHPHPHHHPHHHH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_004 PELSIPLHHAMSMSCDSSPPGMGMSNTYTTLTPLQPLPPISTVSDKFHHPHPHHHPHHHH 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE3 HHHHQRLSGNVSGSFTLMRDERGLPAMNNLYSPYKEMPGMSQSLSPLAATPLGNGLGGLH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_004 HHHHQRLSGNVSGSFTLMRDERGLPAMNNLYSPYKEMPGMSQSLSPLAATPLGNGLGGLH 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE3 NAQQSLPNYGPPGHDKMLSPNFDAHHTAMLTRGEQHLSRGLGTPPAAMMSHLNGLHHPGH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_004 NAQQSLPNYGPPGHDKMLSPNFDAHHTAMLTRGEQHLSRGLGTPPAAMMSHLNGLHHPGH 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE3 TQSHGPVLAPSRERPPSSSSGSQVATSGQLEEINTKEVAQRITAELKRYSIPQAIFAQRV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_004 TQSHGPVLAPSRERPPSSSSGSQVATSGQLEEINTKEVAQRITAELKRYSIPQAIFAQRV 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE3 LCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAACKRKEQEPNK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_004 LCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAACKRKEQEPNK 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE3 DRNNSQKKSRLVFTDLQRRTLFAIFKENKRPSKEMQITISQQLGLELTTVSNFFMNARRR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_004 DRNNSQKKSRLVFTDLQRRTLFAIFKENKRPSKEMQITISQQLGLELTTVSNFFMNARRR 430 440 450 460 470 480 490 500 pF1KE3 SLEKWQDDLSTGGSSSTSSTCTKA :::::::::::::::::::::::: NP_004 SLEKWQDDLSTGGSSSTSSTCTKA 490 500 >>XP_016881585 (OMIM: 604894) PREDICTED: one cut domain (480 aa) initn: 2860 init1: 2860 opt: 2860 Z-score: 1589.2 bits: 303.5 E(85289): 8.8e-82 Smith-Waterman score: 2860; 100.0% identity (100.0% similar) in 410 aa overlap (1-410:1-410) 10 20 30 40 50 60 pF1KE3 MKAAYTAYRCLTKDLEGCAMNPELTMESLGTLHGPAGGGSGGGGGGGGGGGGGGPGHEQE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 MKAAYTAYRCLTKDLEGCAMNPELTMESLGTLHGPAGGGSGGGGGGGGGGGGGGPGHEQE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 LLASPSPHHAGRGAAGSLRGPPPPPTAHQELGTAAAAAAAASRSAMVTSMASILDGGDYR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 LLASPSPHHAGRGAAGSLRGPPPPPTAHQELGTAAAAAAAASRSAMVTSMASILDGGDYR 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 PELSIPLHHAMSMSCDSSPPGMGMSNTYTTLTPLQPLPPISTVSDKFHHPHPHHHPHHHH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 PELSIPLHHAMSMSCDSSPPGMGMSNTYTTLTPLQPLPPISTVSDKFHHPHPHHHPHHHH 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE3 HHHHQRLSGNVSGSFTLMRDERGLPAMNNLYSPYKEMPGMSQSLSPLAATPLGNGLGGLH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 HHHHQRLSGNVSGSFTLMRDERGLPAMNNLYSPYKEMPGMSQSLSPLAATPLGNGLGGLH 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE3 NAQQSLPNYGPPGHDKMLSPNFDAHHTAMLTRGEQHLSRGLGTPPAAMMSHLNGLHHPGH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 NAQQSLPNYGPPGHDKMLSPNFDAHHTAMLTRGEQHLSRGLGTPPAAMMSHLNGLHHPGH 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE3 TQSHGPVLAPSRERPPSSSSGSQVATSGQLEEINTKEVAQRITAELKRYSIPQAIFAQRV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 TQSHGPVLAPSRERPPSSSSGSQVATSGQLEEINTKEVAQRITAELKRYSIPQAIFAQRV 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE3 LCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAACKRKEQEPNK :::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 LCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAAAILMGMRSNK 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE3 DRNNSQKKSRLVFTDLQRRTLFAIFKENKRPSKEMQITISQQLGLELTTVSNFFMNARRR XP_016 LSTGRTGCQSSEGEPGFQTPAQCLATSLLKLKRNHLPYSMCKFSRQILLYPALTPGSRRC 430 440 450 460 470 480 >>NP_004489 (OMIM: 604164) hepatocyte nuclear factor 6 [ (465 aa) initn: 1348 init1: 1017 opt: 1849 Z-score: 1035.5 bits: 201.0 E(85289): 6.2e-51 Smith-Waterman score: 1890; 63.4% identity (76.3% similar) in 514 aa overlap (20-504:1-465) 10 20 30 40 50 60 pF1KE3 MKAAYTAYRCLTKDLEGCAMNPELTMESLGTLHGPAGGGSGGGGGGGGGGGGGGPGHEQE :: .::::..: ::: . . . :: NP_004 MNAQLTMEAIGELHGVSHEPVPAPADLLGG----------- 10 20 30 70 80 90 100 110 pF1KE3 LLASPSPHHAGRGAAGSLRGPPPPPTAHQELGTAAAAAAAASRSAMVTSMASILDGG--- ::: .:.... :: ::. . .: :::.:::: NP_004 -----SPH--ARSSVAH-RGSHLPPAHPRSMG-----------------MASLLDGGSGG 40 50 60 120 130 140 150 160 pF1KE3 -DYR-----PELSI--PLHHAMSMSCDSSPPGMGMSNTYTTLTPLQPLPPISTVSDKFHH ::. :: :. ::: .:.:.:.. ::::.: .::::::::::::::::::::: NP_004 GDYHHHHRAPEHSLAGPLHPTMTMACET-PPGMSMPTTYTTLTPLQPLPPISTVSDKF-- 70 80 90 100 110 120 170 180 190 200 210 220 pF1KE3 PHPHHHPHHHHH-HHHQRLSGNVSGSFTLMRDERGLPAMNNLYSPY-KEMPGMSQSLSPL :: ::: ::::: ::::::.:::::::::::::::: .:::::.:: :.. ::.:::::: NP_004 PHHHHHHHHHHHPHHHQRLAGNVSGSFTLMRDERGLASMNNLYTPYHKDVAGMGQSLSPL 130 140 150 160 170 180 230 240 250 260 270 280 pF1KE3 AATPLGNGLGGLHNAQQSLPNYGPPGH----DKMLSPN-FDAHHTAMLTR-GEQHLSRGL ... :::..::.::.::.:. :: ::::.:: :.::: ::: : ::::: NP_004 SSS----GLGSIHNSQQGLPHYAHPGAAMPTDKMLTPNGFEAHHPAMLGRHGEQHL---- 190 200 210 220 230 290 300 310 320 330 pF1KE3 GTPPAAMMSHLNGL--HHP-GH--TQSHGPVLAPSRERPPSSSSGSQVAT---SGQLEEI :: .: : .::: ::: .: .:.:: .:. .:: : : .:.::.. :::.::: NP_004 -TPTSAGMVPINGLPPHHPHAHLNAQGHGQLLGTARE-PNPSVTGAQVSNGSNSGQMEEI 240 250 260 270 280 290 340 350 360 370 380 390 pF1KE3 NTKEVAQRITAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWK ::::::::::.::::::::::::::::::::::::::::::::::::::::::::::::: NP_004 NTKEVAQRITTELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWK 300 310 320 330 340 350 400 410 420 430 440 450 pF1KE3 WLQEPEFQRMSALRLAACKRKEQEPNKDRNNSQKKSRLVFTDLQRRTLFAIFKENKRPSK :::::::::::::::::::::::: .:::.:. :: ::::::.::::: ::::::::::: NP_004 WLQEPEFQRMSALRLAACKRKEQEHGKDRGNTPKKPRLVFTDVQRRTLHAIFKENKRPSK 360 370 380 390 400 410 460 470 480 490 500 pF1KE3 EMQITISQQLGLELTTVSNFFMNARRRSLEKWQDDLST--GGSSSTSSTCTKA :.::::::::::::.::::::::::::::.::::. :. :.:::.::::::: NP_004 ELQITISQQLGLELSTVSNFFMNARRRSLDKWQDEGSSNSGNSSSSSSTCTKA 420 430 440 450 460 >>XP_011519789 (OMIM: 604164) PREDICTED: hepatocyte nucl (402 aa) initn: 877 init1: 547 opt: 1348 Z-score: 761.7 bits: 150.1 E(85289): 1.1e-35 Smith-Waterman score: 1389; 59.2% identity (72.4% similar) in 417 aa overlap (20-409:1-368) 10 20 30 40 50 60 pF1KE3 MKAAYTAYRCLTKDLEGCAMNPELTMESLGTLHGPAGGGSGGGGGGGGGGGGGGPGHEQE :: .::::..: ::: . . . :: XP_011 MNAQLTMEAIGELHGVSHEPVPAPADLLGG----------- 10 20 30 70 80 90 100 110 pF1KE3 LLASPSPHHAGRGAAGSLRGPPPPPTAHQELGTAAAAAAAASRSAMVTSMASILDGG--- ::: .:.... :: ::. . .: :::.:::: XP_011 -----SPH--ARSSVAH-RGSHLPPAHPRSMG-----------------MASLLDGGSGG 40 50 60 120 130 140 150 160 pF1KE3 -DYR-----PELSI--PLHHAMSMSCDSSPPGMGMSNTYTTLTPLQPLPPISTVSDKFHH ::. :: :. ::: .:.:.:.. ::::.: .::::::::::::::::::::: XP_011 GDYHHHHRAPEHSLAGPLHPTMTMACET-PPGMSMPTTYTTLTPLQPLPPISTVSDKF-- 70 80 90 100 110 120 170 180 190 200 210 220 pF1KE3 PHPHHHPHHHHH-HHHQRLSGNVSGSFTLMRDERGLPAMNNLYSPY-KEMPGMSQSLSPL :: ::: ::::: ::::::.:::::::::::::::: .:::::.:: :.. ::.:::::: XP_011 PHHHHHHHHHHHPHHHQRLAGNVSGSFTLMRDERGLASMNNLYTPYHKDVAGMGQSLSPL 130 140 150 160 170 180 230 240 250 260 270 280 pF1KE3 AATPLGNGLGGLHNAQQSLPNYGPPGH----DKMLSPN-FDAHHTAMLTR-GEQHLSRGL ... :::..::.::.::.:. :: ::::.:: :.::: ::: : ::::: XP_011 SSS----GLGSIHNSQQGLPHYAHPGAAMPTDKMLTPNGFEAHHPAMLGRHGEQHL---- 190 200 210 220 230 290 300 310 320 330 pF1KE3 GTPPAAMMSHLNGL--HHP-GH--TQSHGPVLAPSRERPPSSSSGSQVAT---SGQLEEI :: .: : .::: ::: .: .:.:: .:. .:: : : .:.::.. :::.::: XP_011 -TPTSAGMVPINGLPPHHPHAHLNAQGHGQLLGTARE-PNPSVTGAQVSNGSNSGQMEEI 240 250 260 270 280 290 340 350 360 370 380 390 pF1KE3 NTKEVAQRITAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWK ::::::::::.::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 NTKEVAQRITTELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWK 300 310 320 330 340 350 400 410 420 430 440 450 pF1KE3 WLQEPEFQRMSALRLAACKRKEQEPNKDRNNSQKKSRLVFTDLQRRTLFAIFKENKRPSK :::::::::::::::: XP_011 WLQEPEFQRMSALRLADQWGKLNKQTSEARHLVTSVLAADSATEVRERSG 360 370 380 390 400 >>NP_001073957 (OMIM: 611294) one cut domain family memb (494 aa) initn: 1167 init1: 949 opt: 1038 Z-score: 590.8 bits: 118.8 E(85289): 3.6e-26 Smith-Waterman score: 1452; 52.5% identity (69.5% similar) in 531 aa overlap (23-504:2-494) 10 20 30 40 50 60 pF1KE3 MKAAYTAYRCLTKDLEGCAMNPELTMESLGTLHGPAGGGSGGGGGGGGGGGGGGPGHEQE ::..:::: ::. : . .: : NP_001 MELSLESLGGLHSVAHAQAG------------------E 10 20 70 80 90 100 110 pF1KE3 LLASPSPHHAGRGAAGSLRGPPPPPTAHQELGTAA----AAAAAASRSAMVTSMASILDG :: :: :: :.::.. :: : : :. ....... .. . . .: : NP_001 LL---SPGHA-RSAAAQHRGLVAPGRPGLVAGMASLLDGGGGGGGGGAGGAGGAGSAGGG 30 40 50 60 70 120 130 140 150 160 170 pF1KE3 GDYRPELSIPLHHAMSMSCDSSPPGMGMSNTYTTLTPLQPLPPISTVSDKFHHP------ .:.: ::. ::: ::.:.:.. ::.: .::::::::: :::...:.::::. NP_001 ADFRGELAGPLHPAMGMACEA--PGLG--GTYTTLTPLQHLPPLAAVADKFHQHAAAAAV 80 90 100 110 120 130 180 190 200 210 pF1KE3 ------HPHHHPHHHHHHHH----QRLSGNVSGSFTLMRDERG-LPAMNNLYSPY-KEMP ::: ::: :::...::::::::::::. : ....::.:: ::.: NP_001 AGAHGGHPHAHPHPAAAPPPPPPPQRLAASVSGSFTLMRDERAALASVGHLYGPYGKELP 140 150 160 170 180 190 220 230 240 250 260 pF1KE3 GMSQSLSPLAATPLGNGLG-GLHNAQQSLP--------NYGPPGH---DKMLSPNFDAHH .:. ::: .:: :.: .::.: : : :::::: ::.: : : NP_001 AMG---SPL--SPLPNALPPALHGAPQPPPPPPPPPLAAYGPPGHLAGDKLLPPAAFEPH 200 210 220 230 240 270 280 290 300 310 pF1KE3 TAMLTRGEQHLSRGL------------GTPPAA-MMSHLNGLHHPGHTQSHGPVLAPSRE .:.: :.:. :.::: :. :: ... :.:: : .::: . . NP_001 AALLGRAEDALARGLPGGGGGTGSGGAGSGSAAGLLAPLGGLAAAG---AHGPHGGGG-- 250 260 270 280 290 300 320 330 340 350 360 370 pF1KE3 RPPSSSSGSQVATSGQLEEINTKEVAQRITAELKRYSIPQAIFAQRVLCRSQGTLSDLLR :..:.:. : .. :::::::::::::::::::::::::::::.::::::::::::: NP_001 -GPGGSGGGPSAGAAA-EEINTKEVAQRITAELKRYSIPQAIFAQRILCRSQGTLSDLLR 310 320 330 340 350 360 380 390 400 410 420 430 pF1KE3 NPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAACKRKEQEPNKDRNNSQKKSRLVF :::::::::::::::::::::::::::::::::::::::::::: .:.: . ::.:::: NP_001 NPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAACKRKEQEQQKERALQPKKQRLVF 370 380 390 400 410 420 440 450 460 470 480 490 pF1KE3 TDLQRRTLFAIFKENKRPSKEMQITISQQLGLELTTVSNFFMNARRRSLEKWQDDLST-- ::::::::.::::::::::::::.::::::::::.:::::::::::: ...: .. :: NP_001 TDLQRRTLIAIFKENKRPSKEMQVTISQQLGLELNTVSNFFMNARRRCMNRWAEEPSTAP 430 440 450 460 470 480 500 pF1KE3 GGSSSTSSTCTKA :: .....: .:: NP_001 GGPAGATATFSKA 490 504 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 04:32:38 2016 done: Tue Nov 8 04:32:39 2016 Total Scan time: 10.640 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]