FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB9730, 465 aa
1>>>pF1KB9730 465 - 465 aa - 465 aa
Library: /omim/omim.rfq.tfa
60827320 residues in 85289 sequences
Statistics: Expectation_n fit: rho(ln(x))= 7.6342+/-0.000368; mu= 9.0639+/- 0.023
mean_var=209.1881+/-43.503, 0's: 0 Z-trim(120.0): 68 B-trim: 279 in 1/57
Lambda= 0.088676
statistics sampled from 34513 (34597) to 34513 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.741), E-opt: 0.2 (0.406), width: 16
Scan time: 10.640
The best scores are: opt bits E(85289)
NP_004489 (OMIM: 604164) hepatocyte nuclear factor ( 465) 3225 425.3 1.8e-118
XP_011519789 (OMIM: 604164) PREDICTED: hepatocyte ( 402) 2590 344.0 4.5e-94
NP_004843 (OMIM: 604894) one cut domain family mem ( 504) 1849 249.3 1.8e-65
XP_016881585 (OMIM: 604894) PREDICTED: one cut dom ( 480) 1353 185.8 2.2e-46
NP_001073957 (OMIM: 611294) one cut domain family ( 494) 1028 144.2 7.4e-34
>>NP_004489 (OMIM: 604164) hepatocyte nuclear factor 6 [ (465 aa)
initn: 3225 init1: 3225 opt: 3225 Z-score: 2248.1 bits: 425.3 E(85289): 1.8e-118
Smith-Waterman score: 3225; 100.0% identity (100.0% similar) in 465 aa overlap (1-465:1-465)
10 20 30 40 50 60
pF1KB9 MNAQLTMEAIGELHGVSHEPVPAPADLLGGSPHARSSVAHRGSHLPPAHPRSMGMASLLD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_004 MNAQLTMEAIGELHGVSHEPVPAPADLLGGSPHARSSVAHRGSHLPPAHPRSMGMASLLD
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB9 GGSGGGDYHHHHRAPEHSLAGPLHPTMTMACETPPGMSMPTTYTTLTPLQPLPPISTVSD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_004 GGSGGGDYHHHHRAPEHSLAGPLHPTMTMACETPPGMSMPTTYTTLTPLQPLPPISTVSD
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB9 KFPHHHHHHHHHHHPHHHQRLAGNVSGSFTLMRDERGLASMNNLYTPYHKDVAGMGQSLS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_004 KFPHHHHHHHHHHHPHHHQRLAGNVSGSFTLMRDERGLASMNNLYTPYHKDVAGMGQSLS
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB9 PLSSSGLGSIHNSQQGLPHYAHPGAAMPTDKMLTPNGFEAHHPAMLGRHGEQHLTPTSAG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_004 PLSSSGLGSIHNSQQGLPHYAHPGAAMPTDKMLTPNGFEAHHPAMLGRHGEQHLTPTSAG
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB9 MVPINGLPPHHPHAHLNAQGHGQLLGTAREPNPSVTGAQVSNGSNSGQMEEINTKEVAQR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_004 MVPINGLPPHHPHAHLNAQGHGQLLGTAREPNPSVTGAQVSNGSNSGQMEEINTKEVAQR
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB9 ITTELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_004 ITTELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQ
310 320 330 340 350 360
370 380 390 400 410 420
pF1KB9 RMSALRLAACKRKEQEHGKDRGNTPKKPRLVFTDVQRRTLHAIFKENKRPSKELQITISQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_004 RMSALRLAACKRKEQEHGKDRGNTPKKPRLVFTDVQRRTLHAIFKENKRPSKELQITISQ
370 380 390 400 410 420
430 440 450 460
pF1KB9 QLGLELSTVSNFFMNARRRSLDKWQDEGSSNSGNSSSSSSTCTKA
:::::::::::::::::::::::::::::::::::::::::::::
NP_004 QLGLELSTVSNFFMNARRRSLDKWQDEGSSNSGNSSSSSSTCTKA
430 440 450 460
>>XP_011519789 (OMIM: 604164) PREDICTED: hepatocyte nucl (402 aa)
initn: 2590 init1: 2590 opt: 2590 Z-score: 1809.9 bits: 344.0 E(85289): 4.5e-94
Smith-Waterman score: 2590; 100.0% identity (100.0% similar) in 368 aa overlap (1-368:1-368)
10 20 30 40 50 60
pF1KB9 MNAQLTMEAIGELHGVSHEPVPAPADLLGGSPHARSSVAHRGSHLPPAHPRSMGMASLLD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 MNAQLTMEAIGELHGVSHEPVPAPADLLGGSPHARSSVAHRGSHLPPAHPRSMGMASLLD
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB9 GGSGGGDYHHHHRAPEHSLAGPLHPTMTMACETPPGMSMPTTYTTLTPLQPLPPISTVSD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 GGSGGGDYHHHHRAPEHSLAGPLHPTMTMACETPPGMSMPTTYTTLTPLQPLPPISTVSD
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB9 KFPHHHHHHHHHHHPHHHQRLAGNVSGSFTLMRDERGLASMNNLYTPYHKDVAGMGQSLS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 KFPHHHHHHHHHHHPHHHQRLAGNVSGSFTLMRDERGLASMNNLYTPYHKDVAGMGQSLS
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB9 PLSSSGLGSIHNSQQGLPHYAHPGAAMPTDKMLTPNGFEAHHPAMLGRHGEQHLTPTSAG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 PLSSSGLGSIHNSQQGLPHYAHPGAAMPTDKMLTPNGFEAHHPAMLGRHGEQHLTPTSAG
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB9 MVPINGLPPHHPHAHLNAQGHGQLLGTAREPNPSVTGAQVSNGSNSGQMEEINTKEVAQR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 MVPINGLPPHHPHAHLNAQGHGQLLGTAREPNPSVTGAQVSNGSNSGQMEEINTKEVAQR
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB9 ITTELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 ITTELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQ
310 320 330 340 350 360
370 380 390 400 410 420
pF1KB9 RMSALRLAACKRKEQEHGKDRGNTPKKPRLVFTDVQRRTLHAIFKENKRPSKELQITISQ
::::::::
XP_011 RMSALRLADQWGKLNKQTSEARHLVTSVLAADSATEVRERSG
370 380 390 400
>>NP_004843 (OMIM: 604894) one cut domain family member (504 aa)
initn: 1348 init1: 1017 opt: 1849 Z-score: 1296.3 bits: 249.3 E(85289): 1.8e-65
Smith-Waterman score: 1890; 63.4% identity (76.2% similar) in 513 aa overlap (1-465:20-504)
10 20 30
pF1KB9 MNAQLTMEAIGELHGVSHEPVPAPADLLGG-----------
:: .::::..: ::: . . . ::
NP_004 MKAAYTAYRCLTKDLEGCAMNPELTMESLGTLHGPAGGGSGGGGGGGGGGGGGGPGHEQE
10 20 30 40 50 60
40 50 60
pF1KB9 -----SPH--ARSSVAH-RGSHLPPAHPRSMG-----------------MASLLDGGSGG
::: .:.... :: ::. . .: :::.:: :
NP_004 LLASPSPHHAGRGAAGSLRGPPPPPTAHQELGTAAAAAAAASRSAMVTSMASILD----G
70 80 90 100 110
70 80 90 100 110 120
pF1KB9 GDYHHHHRAPEHSLAGPLHPTMTMACET-PPGMSMPTTYTTLTPLQPLPPISTVSDKFPH
:::. :: :. ::: .:.:.:.. ::::.: .::::::::::::::::::::: :
NP_004 GDYR-----PELSI--PLHHAMSMSCDSSPPGMGMSNTYTTLTPLQPLPPISTVSDKFHH
120 130 140 150 160
130 140 150 160 170 180
pF1KB9 HHHHHH-HHHHPHHHQRLAGNVSGSFTLMRDERGLASMNNLYTPYHKDVAGMGQSLSPLS
: ::: :::: ::::::.:::::::::::::::: .:::::.:: :.. ::.::::::.
NP_004 PHPHHHPHHHHHHHHQRLSGNVSGSFTLMRDERGLPAMNNLYSPY-KEMPGMSQSLSPLA
170 180 190 200 210 220
190 200 210 220 230
pF1KB9 SS----GLGSIHNSQQGLPHYAHPGAAMPTDKMLTPNGFEAHHPAMLGRHGEQHL-----
.. :::..::.::.::.:. :: ::::.:: :.::: ::: : :::::
NP_004 ATPLGNGLGGLHNAQQSLPNYGPPGH----DKMLSPN-FDAHHTAMLTR-GEQHLSRGLG
230 240 250 260 270 280
240 250 260 270 280 290
pF1KB9 TPTSAGMVPINGLPPHHPHAHLNAQGHGQLLGTARE-PNPSVTGAQVSNGSNSGQMEEIN
:: .: : .::: ::: .: .:.:: .:. .:: : : .:.::.. :::.::::
NP_004 TPPAAMMSHLNGL--HHP-GH--TQSHGPVLAPSRERPPSSSSGSQVAT---SGQLEEIN
290 300 310 320 330
300 310 320 330 340 350
pF1KB9 TKEVAQRITTELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKW
:::::::::.::::::::::::::::::::::::::::::::::::::::::::::::::
NP_004 TKEVAQRITAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKW
340 350 360 370 380 390
360 370 380 390 400 410
pF1KB9 LQEPEFQRMSALRLAACKRKEQEHGKDRGNTPKKPRLVFTDVQRRTLHAIFKENKRPSKE
::::::::::::::::::::::: .:::.:. :: ::::::.::::: ::::::::::::
NP_004 LQEPEFQRMSALRLAACKRKEQEPNKDRNNSQKKSRLVFTDLQRRTLFAIFKENKRPSKE
400 410 420 430 440 450
420 430 440 450 460
pF1KB9 LQITISQQLGLELSTVSNFFMNARRRSLDKWQDEGSSNSGNSSSSSSTCTKA
.::::::::::::.::::::::::::::.::::. :. :.:::.:::::::
NP_004 MQITISQQLGLELTTVSNFFMNARRRSLEKWQDDLST--GGSSSTSSTCTKA
460 470 480 490 500
>>XP_016881585 (OMIM: 604894) PREDICTED: one cut domain (480 aa)
initn: 882 init1: 552 opt: 1353 Z-score: 953.6 bits: 185.8 E(85289): 2.2e-46
Smith-Waterman score: 1394; 59.2% identity (72.4% similar) in 417 aa overlap (1-369:20-410)
10 20 30
pF1KB9 MNAQLTMEAIGELHGVSHEPVPAPADLLGG-----------
:: .::::..: ::: . . . ::
XP_016 MKAAYTAYRCLTKDLEGCAMNPELTMESLGTLHGPAGGGSGGGGGGGGGGGGGGPGHEQE
10 20 30 40 50 60
40 50 60
pF1KB9 -----SPH--ARSSVAH-RGSHLPPAHPRSMG-----------------MASLLDGGSGG
::: .:.... :: ::. . .: :::.::::
XP_016 LLASPSPHHAGRGAAGSLRGPPPPPTAHQELGTAAAAAAAASRSAMVTSMASILDGG---
70 80 90 100 110
70 80 90 100 110 120
pF1KB9 GDYHHHHRAPEHSLAGPLHPTMTMACET-PPGMSMPTTYTTLTPLQPLPPISTVSDKFPH
::. :: :. ::: .:.:.:.. ::::.: .::::::::::::::::::::: :
XP_016 -DYR-----PELSI--PLHHAMSMSCDSSPPGMGMSNTYTTLTPLQPLPPISTVSDKFHH
120 130 140 150 160
130 140 150 160 170 180
pF1KB9 HHHHHH-HHHHPHHHQRLAGNVSGSFTLMRDERGLASMNNLYTPYHKDVAGMGQSLSPLS
: ::: :::: ::::::.:::::::::::::::: .:::::.:: :.. ::.::::::.
XP_016 PHPHHHPHHHHHHHHQRLSGNVSGSFTLMRDERGLPAMNNLYSPY-KEMPGMSQSLSPLA
170 180 190 200 210 220
190 200 210 220 230
pF1KB9 SS----GLGSIHNSQQGLPHYAHPGAAMPTDKMLTPNGFEAHHPAMLGRHGEQHL-----
.. :::..::.::.::.:. :: ::::.:: :.::: ::: : :::::
XP_016 ATPLGNGLGGLHNAQQSLPNYGPPGH----DKMLSPN-FDAHHTAMLTR-GEQHLSRGLG
230 240 250 260 270 280
240 250 260 270 280 290
pF1KB9 TPTSAGMVPINGLPPHHPHAHLNAQGHGQLLGTARE-PNPSVTGAQVSNGSNSGQMEEIN
:: .: : .::: ::: .: .:.:: .:. .:: : : .:.::. .:::.::::
XP_016 TPPAAMMSHLNGL--HHP-GH--TQSHGPVLAPSRERPPSSSSGSQVA---TSGQLEEIN
290 300 310 320 330
300 310 320 330 340 350
pF1KB9 TKEVAQRITTELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKW
:::::::::.::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 TKEVAQRITAELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKW
340 350 360 370 380 390
360 370 380 390 400 410
pF1KB9 LQEPEFQRMSALRLAACKRKEQEHGKDRGNTPKKPRLVFTDVQRRTLHAIFKENKRPSKE
::::::::::::::::
XP_016 LQEPEFQRMSALRLAAAILMGMRSNKLSTGRTGCQSSEGEPGFQTPAQCLATSLLKLKRN
400 410 420 430 440 450
>>NP_001073957 (OMIM: 611294) one cut domain family memb (494 aa)
initn: 1320 init1: 963 opt: 1028 Z-score: 728.8 bits: 144.2 E(85289): 7.4e-34
Smith-Waterman score: 1500; 54.3% identity (70.6% similar) in 506 aa overlap (4-465:2-494)
10 20 30 40 50
pF1KB9 MNAQLTMEAIGELHGVSHEPVPAPADLLGGSPHARSSVA-HRGSHLPPAHPRSM-GMASL
.:..:..: ::.:.: : : : . ::::..: ::: . :..: . :::::
NP_001 MELSLESLGGLHSVAH----AQAGELLSPGHARSAAAQHRGL-VAPGRPGLVAGMASL
10 20 30 40 50
60 70 80 90 100
pF1KB9 LDGGSGGGDYHHHHRAPEHS----------LAGPLHPTMTMACETPPGMSMPTTYTTLTP
::::.::: . : :::::::.: ::::.: :.. :::::::
NP_001 LDGGGGGGGGGAGGAGGAGSAGGGADFRGELAGPLHPAMGMACEAP-GLG--GTYTTLTP
60 70 80 90 100 110
110 120 130 140 150
pF1KB9 LQPLPPISTVSDKFPHHH---------HHHHHHHHPHHH---------QRLAGNVSGSFT
:: :::...:.::: :.: : : : ::: ::::..::::::
NP_001 LQHLPPLAAVADKF-HQHAAAAAVAGAHGGHPHAHPHPAAAPPPPPPPQRLAASVSGSFT
120 130 140 150 160
160 170 180 190 200
pF1KB9 LMRDERG-LASMNNLYTPYHKDVAGMGQSLSPLSSSGLGSIHNSQQGLPH--------YA
::::::. :::...:: :: :.. .::. :::: .. ..:.. : : :.
NP_001 LMRDERAALASVGHLYGPYGKELPAMGSPLSPLPNALPPALHGAPQPPPPPPPPPLAAYG
170 180 190 200 210 220
210 220 230 240 250
pF1KB9 HPGAAMPTDKMLTPNGFEAHHPAMLGRHGEQHLT---PTSAGMVPINGLPPHHPHAHLNA
:: . ::.: : .:: : :.::: .:. :. : ..: . .: . :
NP_001 PPGH-LAGDKLLPPAAFEPH-AALLGR-AEDALARGLPGGGGGTGSGGAGSGSAAGLLAP
230 240 250 260 270 280
260 270 280 290 300 310
pF1KB9 QGHGQLLGTAREPNPSVTGAQVSNG--SNSGQMEEINTKEVAQRITTELKRYSIPQAIFA
: : . :. :. . : :.: : .. :::::::::::::.:::::::::::::
NP_001 LG-GLAAAGAHGPHGGGGGPGGSGGGPSAGAAAEEINTKEVAQRITAELKRYSIPQAIFA
290 300 310 320 330 340
320 330 340 350 360 370
pF1KB9 QRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAACKRKEQE
::.:::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 QRILCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAACKRKEQE
350 360 370 380 390 400
380 390 400 410 420 430
pF1KB9 HGKDRGNTPKKPRLVFTDVQRRTLHAIFKENKRPSKELQITISQQLGLELSTVSNFFMNA
. :.:. ::: ::::::.::::: ::::::::::::.:.::::::::::.:::::::::
NP_001 QQKERALQPKKQRLVFTDLQRRTLIAIFKENKRPSKEMQVTISQQLGLELNTVSNFFMNA
410 420 430 440 450 460
440 450 460
pF1KB9 RRRSLDKWQDEGSSNSGNSSSSSSTCTKA
::: ...: .: :. :. .....: .::
NP_001 RRRCMNRWAEEPSTAPGGPAGATATFSKA
470 480 490
465 residues in 1 query sequences
60827320 residues in 85289 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 18:33:56 2016 done: Fri Nov 4 18:33:58 2016
Total Scan time: 10.640 Total Display time: 0.030
Function used was FASTA [36.3.4 Apr, 2011]