FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1249, 494 aa 1>>>pF1KE1249 494 - 494 aa - 494 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 12.5132+/-0.000444; mu= -10.7850+/- 0.028 mean_var=603.1425+/-125.707, 0's: 0 Z-trim(125.1): 27 B-trim: 861 in 1/57 Lambda= 0.052223 statistics sampled from 48121 (48170) to 48121 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.807), E-opt: 0.2 (0.565), width: 16 Scan time: 10.320 The best scores are: opt bits E(85289) NP_001073957 (OMIM: 611294) one cut domain family ( 494) 3398 270.5 7.6e-72 NP_004843 (OMIM: 604894) one cut domain family mem ( 504) 1038 92.7 2.6e-18 NP_004489 (OMIM: 604164) hepatocyte nuclear factor ( 465) 1028 91.9 4.1e-18 XP_016881585 (OMIM: 604894) PREDICTED: one cut dom ( 480) 612 60.6 1.1e-08 XP_011519789 (OMIM: 604164) PREDICTED: hepatocyte ( 402) 593 59.1 2.8e-08 >>NP_001073957 (OMIM: 611294) one cut domain family memb (494 aa) initn: 3398 init1: 3398 opt: 3398 Z-score: 1410.9 bits: 270.5 E(85289): 7.6e-72 Smith-Waterman score: 3398; 100.0% identity (100.0% similar) in 494 aa overlap (1-494:1-494) 10 20 30 40 50 60 pF1KE1 MELSLESLGGLHSVAHAQAGELLSPGHARSAAAQHRGLVAPGRPGLVAGMASLLDGGGGG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MELSLESLGGLHSVAHAQAGELLSPGHARSAAAQHRGLVAPGRPGLVAGMASLLDGGGGG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 GGGGAGGAGGAGSAGGGADFRGELAGPLHPAMGMACEAPGLGGTYTTLTPLQHLPPLAAV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 GGGGAGGAGGAGSAGGGADFRGELAGPLHPAMGMACEAPGLGGTYTTLTPLQHLPPLAAV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 ADKFHQHAAAAAVAGAHGGHPHAHPHPAAAPPPPPPPQRLAASVSGSFTLMRDERAALAS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 ADKFHQHAAAAAVAGAHGGHPHAHPHPAAAPPPPPPPQRLAASVSGSFTLMRDERAALAS 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 VGHLYGPYGKELPAMGSPLSPLPNALPPALHGAPQPPPPPPPPPLAAYGPPGHLAGDKLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 VGHLYGPYGKELPAMGSPLSPLPNALPPALHGAPQPPPPPPPPPLAAYGPPGHLAGDKLL 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE1 PPAAFEPHAALLGRAEDALARGLPGGGGGTGSGGAGSGSAAGLLAPLGGLAAAGAHGPHG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 PPAAFEPHAALLGRAEDALARGLPGGGGGTGSGGAGSGSAAGLLAPLGGLAAAGAHGPHG 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE1 GGGGPGGSGGGPSAGAAAEEINTKEVAQRITAELKRYSIPQAIFAQRILCRSQGTLSDLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 GGGGPGGSGGGPSAGAAAEEINTKEVAQRITAELKRYSIPQAIFAQRILCRSQGTLSDLL 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE1 RNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAACKRKEQEQQKERALQPKKQRLV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 RNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAACKRKEQEQQKERALQPKKQRLV 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE1 FTDLQRRTLIAIFKENKRPSKEMQVTISQQLGLELNTVSNFFMNARRRCMNRWAEEPSTA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 FTDLQRRTLIAIFKENKRPSKEMQVTISQQLGLELNTVSNFFMNARRRCMNRWAEEPSTA 430 440 450 460 470 480 490 pF1KE1 PGGPAGATATFSKA :::::::::::::: NP_001 PGGPAGATATFSKA 490 >>NP_004843 (OMIM: 604894) one cut domain family member (504 aa) initn: 1167 init1: 949 opt: 1038 Z-score: 449.8 bits: 92.7 E(85289): 2.6e-18 Smith-Waterman score: 1452; 52.4% identity (69.1% similar) in 531 aa overlap (2-494:23-504) 10 20 pF1KE1 MELSLESLGGLHSVAHAQAG------------------E ::..:::: ::. : . .: : NP_004 MKAAYTAYRCLTKDLEGCAMNPELTMESLGTLHGPAGGGSGGGGGGGGGGGGGGPGHEQE 10 20 30 40 50 60 30 40 50 60 70 pF1KE1 LL---SPGHA-RSAAAQHRGLVAPGRPGLVAGMASLLDGGGGGGGGGAGGAGGAGSAGGG :: :: :: :.::.. :: : : :. ....... .. . . .: : NP_004 LLASPSPHHAGRGAAGSLRGPPPPPTAHQELGTAA----AAAAAASRSAMVTSMASILDG 70 80 90 100 110 80 90 100 110 120 130 pF1KE1 ADFRGELAGPLHPAMGMACEA--PGLG--GTYTTLTPLQHLPPLAAVADKFHQHAAAAAV .:.: ::. ::: ::.:.:.. ::.: .::::::::: :::...:.::::. NP_004 GDYRPELSIPLHHAMSMSCDSSPPGMGMSNTYTTLTPLQPLPPISTVSDKFHHP------ 120 130 140 150 160 170 140 150 160 170 180 190 pF1KE1 AGAHGGHPHAHPHPAAAPPPPPPPQRLAASVSGSFTLMRDERAALASVGHLYGPYGKELP ::: ::: :::...::::::::::::. : ....::.:: ::.: NP_004 ------HPHHHPHHHHH----HHHQRLSGNVSGSFTLMRDERG-LPAMNNLYSPY-KEMP 180 190 200 210 200 210 220 230 240 pF1KE1 AMGSPLSPLPNALP-----PALHGAPQPPPPPPPPPLAAYGPPGHLAGDKLLPPAAFEPH .:.. :::: : : .::.: : : :::::: ::.: : : NP_004 GMSQSLSPLA-ATPLGNGLGGLHNAQQSLP--------NYGPPGH---DKMLSPNFDAHH 220 230 240 250 260 250 260 270 280 290 300 pF1KE1 AALLGRAEDALARGLPGGGGGTGSGGAGSGSAAGLLAPLGGLAAAG---AHGPHGGGG-- .:.: :.:. :.::: :. :: ... :.:: : .::: . . NP_004 TAMLTRGEQHLSRGL------------GTPPAA-MMSHLNGLHHPGHTQSHGPVLAPSRE 270 280 290 300 310 310 320 330 340 350 360 pF1KE1 -GPGGSGGGPSAGAAA-EEINTKEVAQRITAELKRYSIPQAIFAQRILCRSQGTLSDLLR :..:.:. : .. :::::::::::::::::::::::::::::.::::::::::::: NP_004 RPPSSSSGSQVATSGQLEEINTKEVAQRITAELKRYSIPQAIFAQRVLCRSQGTLSDLLR 320 330 340 350 360 370 370 380 390 400 410 420 pF1KE1 NPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAACKRKEQEQQKERALQPKKQRLVF :::::::::::::::::::::::::::::::::::::::::::: .:.: . ::.:::: NP_004 NPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAACKRKEQEPNKDRNNSQKKSRLVF 380 390 400 410 420 430 430 440 450 460 470 480 pF1KE1 TDLQRRTLIAIFKENKRPSKEMQVTISQQLGLELNTVSNFFMNARRRCMNRWAEEPSTAP ::::::::.::::::::::::::.::::::::::.:::::::::::: ...: .. :: NP_004 TDLQRRTLFAIFKENKRPSKEMQITISQQLGLELTTVSNFFMNARRRSLEKWQDDLST-- 440 450 460 470 480 490 490 pF1KE1 GGPAGATATFSKA :: .....: .:: NP_004 GGSSSTSSTCTKA 500 >>NP_004489 (OMIM: 604164) hepatocyte nuclear factor 6 [ (465 aa) initn: 1320 init1: 963 opt: 1028 Z-score: 446.1 bits: 91.9 E(85289): 4.1e-18 Smith-Waterman score: 1500; 54.3% identity (70.6% similar) in 506 aa overlap (2-494:4-465) 10 20 30 40 50 pF1KE1 MELSLESLGGLHSVAH----AQAGELLSPGHARSAAAQHRGL-VAPGRPGLVAGMASL .:..:..: ::.:.: : : : . ::::..: ::: . :..: . ::::: NP_004 MNAQLTMEAIGELHGVSHEPVPAPADLLGGSPHARSSVA-HRGSHLPPAHPRSM-GMASL 10 20 30 40 50 60 70 80 90 100 110 pF1KE1 LDGGGGGGGGGAGGAGGAGSAGGGADFRGELAGPLHPAMGMACEAP-GLG--GTYTTLTP ::::.::: . : :::::::.: ::::.: :.. ::::::: NP_004 LDGGSGGGDYHHHHRAPEHS----------LAGPLHPTMTMACETPPGMSMPTTYTTLTP 60 70 80 90 100 120 130 140 150 160 pF1KE1 LQHLPPLAAVADKF-HQHAAAAAVAGAHGGHPHAHPHPAAAPPPPPPPQRLAASVSGSFT :: :::...:.::: :.: : : : ::: ::::..:::::: NP_004 LQPLPPISTVSDKFPHHH---------HHHHHHHHPHHH---------QRLAGNVSGSFT 110 120 130 140 150 170 180 190 200 210 220 pF1KE1 LMRDERAALASVGHLYGPYGKELPAMGSPLSPLPNALPPALHGAPQPPPPPPPPPLAAYG ::::::. :::...:: :: :.. .::. :::: .. ..:.. : : :. NP_004 LMRDERG-LASMNNLYTPYHKDVAGMGQSLSPLSSSGLGSIHNSQQGLPH--------YA 160 170 180 190 200 230 240 250 260 270 280 pF1KE1 PPGH-LAGDKLLPPAAFEPH-AALLGR-AEDALARGLPGGGGGTGSGGAGSGSAAGLLAP :: . ::.: : .:: : :.::: .:. :. : ..: . .: . : NP_004 HPGAAMPTDKMLTPNGFEAHHPAMLGRHGEQHLT---PTSAGMVPINGLPPHHPHAHLNA 210 220 230 240 250 290 300 310 320 330 340 pF1KE1 LG-GLAAAGAHGPHGGGGGPGGSGGGPSAGAAAEEINTKEVAQRITAELKRYSIPQAIFA : : . :. :. . : :.: : .. :::::::::::::.::::::::::::: NP_004 QGHGQLLGTAREPNPSVTGAQVSNG--SNSGQMEEINTKEVAQRITTELKRYSIPQAIFA 260 270 280 290 300 310 350 360 370 380 390 400 pF1KE1 QRILCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAACKRKEQE ::.::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_004 QRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAACKRKEQE 320 330 340 350 360 370 410 420 430 440 450 460 pF1KE1 QQKERALQPKKQRLVFTDLQRRTLIAIFKENKRPSKEMQVTISQQLGLELNTVSNFFMNA . :.:. ::: ::::::.::::: ::::::::::::.:.::::::::::.::::::::: NP_004 HGKDRGNTPKKPRLVFTDVQRRTLHAIFKENKRPSKELQITISQQLGLELSTVSNFFMNA 380 390 400 410 420 430 470 480 490 pF1KE1 RRRCMNRWAEEPSTAPGGPAGATATFSKA ::: ...: .: :. :. .....: .:: NP_004 RRRSLDKWQDEGSSNSGNSSSSSSTCTKA 440 450 460 >>XP_016881585 (OMIM: 604894) PREDICTED: one cut domain (480 aa) initn: 755 init1: 537 opt: 612 Z-score: 276.6 bits: 60.6 E(85289): 1.1e-08 Smith-Waterman score: 1026; 48.7% identity (64.8% similar) in 435 aa overlap (2-398:23-410) 10 20 pF1KE1 MELSLESLGGLHSVAHAQAG------------------E ::..:::: ::. : . .: : XP_016 MKAAYTAYRCLTKDLEGCAMNPELTMESLGTLHGPAGGGSGGGGGGGGGGGGGGPGHEQE 10 20 30 40 50 60 30 40 50 60 70 pF1KE1 LL---SPGHA-RSAAAQHRGLVAPGRPGLVAGMASLLDGGGGGGGGGAGGAGGAGSAGGG :: :: :: :.::.. :: : : :. ....... .. . . .: : XP_016 LLASPSPHHAGRGAAGSLRGPPPPPTAHQELGTAA----AAAAAASRSAMVTSMASILDG 70 80 90 100 110 80 90 100 110 120 130 pF1KE1 ADFRGELAGPLHPAMGMACEA--PGLG--GTYTTLTPLQHLPPLAAVADKFHQHAAAAAV .:.: ::. ::: ::.:.:.. ::.: .::::::::: :::...:.:::: : XP_016 GDYRPELSIPLHHAMSMSCDSSPPGMGMSNTYTTLTPLQPLPPISTVSDKFH-HP----- 120 130 140 150 160 170 140 150 160 170 180 190 pF1KE1 AGAHGGHPHAHPHPAAAPPPPPPPQRLAASVSGSFTLMRDERAALASVGHLYGPYGKELP ::: ::: :::...::::::::::::. : ....::.:: ::.: XP_016 ------HPHHHPHHHHHH----HHQRLSGNVSGSFTLMRDERG-LPAMNNLYSPY-KEMP 180 190 200 210 200 210 220 230 240 pF1KE1 AMGSPLSPLPNALP-----PALHGAPQPPPPPPPPPLAAYGPPGHLAGDKLLPPAAFEPH .:.. :::: : : .::.: : : :::::: ::.: : : XP_016 GMSQSLSPLA-ATPLGNGLGGLHNAQQSLP--------NYGPPGH---DKMLSPNFDAHH 220 230 240 250 260 250 260 270 280 290 300 pF1KE1 AALLGRAEDALARGLPGGGGGTGSGGAGSGSAAGLLAPLGGLAAAG---AHGPHGGGG-- .:.: :.:. :.::: :. :: ... :.:: : .::: . . XP_016 TAMLTRGEQHLSRGL------------GTPPAA-MMSHLNGLHHPGHTQSHGPVLAPSRE 270 280 290 300 310 310 320 330 340 350 360 pF1KE1 -GPGGSGGGPSAGAAA-EEINTKEVAQRITAELKRYSIPQAIFAQRILCRSQGTLSDLLR :..:.:. : .. :::::::::::::::::::::::::::::.::::::::::::: XP_016 RPPSSSSGSQVATSGQLEEINTKEVAQRITAELKRYSIPQAIFAQRVLCRSQGTLSDLLR 320 330 340 350 360 370 370 380 390 400 410 420 pF1KE1 NPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAACKRKEQEQQKERALQPKKQRLVF ::::::::::::::::::::::::::::::::::::: XP_016 NPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAAAILMGMRSNKLSTGRTGCQSSEG 380 390 400 410 420 430 430 440 450 460 470 480 pF1KE1 TDLQRRTLIAIFKENKRPSKEMQVTISQQLGLELNTVSNFFMNARRRCMNRWAEEPSTAP XP_016 EPGFQTPAQCLATSLLKLKRNHLPYSMCKFSRQILLYPALTPGSRRC 440 450 460 470 480 >>XP_011519789 (OMIM: 604164) PREDICTED: hepatocyte nucl (402 aa) initn: 880 init1: 523 opt: 593 Z-score: 269.8 bits: 59.1 E(85289): 2.8e-08 Smith-Waterman score: 1065; 50.2% identity (66.0% similar) in 420 aa overlap (2-409:4-380) 10 20 30 40 50 pF1KE1 MELSLESLGGLHSVAH----AQAGELLSPGHARSAAAQHRGL-VAPGRPGLVAGMASL .:..:..: ::.:.: : : : . ::::..: ::: . :..: . ::::: XP_011 MNAQLTMEAIGELHGVSHEPVPAPADLLGGSPHARSSVA-HRGSHLPPAHPRSM-GMASL 10 20 30 40 50 60 70 80 90 100 110 pF1KE1 LDGGGGGGGGGAGGAGGAGSAGGGADFRGELAGPLHPAMGMACEAP-GLG--GTYTTLTP ::::.::: . : :::::::.: ::::.: :.. ::::::: XP_011 LDGGSGGGDYHHHHRAPEHS----------LAGPLHPTMTMACETPPGMSMPTTYTTLTP 60 70 80 90 100 120 130 140 150 160 170 pF1KE1 LQHLPPLAAVADKFHQHAAAAAVAGAHGGHPHAHPHPAAAPPPPPPPQRLAASVSGSFTL :: :::...:.::: .: : : : ::: ::::..::::::: XP_011 LQPLPPISTVSDKFPHHH--------HHHHHHHHPHHH---------QRLAGNVSGSFTL 110 120 130 140 150 180 190 200 210 220 230 pF1KE1 MRDERAALASVGHLYGPYGKELPAMGSPLSPLPNALPPALHGAPQPPPPPPPPPLAAYGP :::::. :::...:: :: :.. .::. :::: .. ..:.. : : :. XP_011 MRDERG-LASMNNLYTPYHKDVAGMGQSLSPLSSSGLGSIHNSQQGLPH--------YAH 160 170 180 190 200 240 250 260 270 280 pF1KE1 PGH-LAGDKLLPPAAFEPH-AALLGR-AEDALARGLPGGGGGTGSGGAGSGSAAGLLAPL :: . ::.: : .:: : :.::: .:. :. : ..: . .: . : XP_011 PGAAMPTDKMLTPNGFEAHHPAMLGRHGEQHLT---PTSAGMVPINGLPPHHPHAHLNAQ 210 220 230 240 250 290 300 310 320 330 340 pF1KE1 G-GLAAAGAHGPHGGGGGPGGSGGGPSAGAAAEEINTKEVAQRITAELKRYSIPQAIFAQ : : . :. :. . : :.: : .. :::::::::::::.:::::::::::::: XP_011 GHGQLLGTAREPNPSVTGAQVSNG--SNSGQMEEINTKEVAQRITTELKRYSIPQAIFAQ 260 270 280 290 300 310 350 360 370 380 390 400 pF1KE1 RILCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAACKRKEQEQ :.::::::::::::::::::::::::::::::::::::::::::::::::: : ..: XP_011 RVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLADQWGKLNKQ 320 330 340 350 360 370 410 420 430 440 450 460 pF1KE1 QKERALQPKKQRLVFTDLQRRTLIAIFKENKRPSKEMQVTISQQLGLELNTVSNFFMNAR .: XP_011 TSEARHLVTSVLAADSATEVRERSG 380 390 400 494 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 04:29:19 2016 done: Tue Nov 8 04:29:21 2016 Total Scan time: 10.320 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]