FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB9595, 553 aa
1>>>pF1KB9595 553 - 553 aa - 553 aa
Library: /omim/omim.rfq.tfa
60827320 residues in 85289 sequences
Statistics: Expectation_n fit: rho(ln(x))= 14.0995+/-0.000416; mu= -19.3295+/- 0.026
mean_var=609.4967+/-126.141, 0's: 0 Z-trim(125.9): 115 B-trim: 0 in 0/61
Lambda= 0.051950
statistics sampled from 50452 (50600) to 50452 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.826), E-opt: 0.2 (0.593), width: 16
Scan time: 9.420
The best scores are: opt bits E(85289)
NP_001444 (OMIM: 601090,601631,602482) forkhead bo ( 553) 3820 300.9 6.9e-81
NP_005242 (OMIM: 153400,602402) forkhead box prote ( 501) 1171 102.3 3.8e-21
NP_004109 (OMIM: 602939) forkhead box protein S1 [ ( 330) 655 63.5 1.2e-09
NP_004463 (OMIM: 601091) forkhead box protein D1 [ ( 465) 649 63.2 2.1e-09
NP_004465 (OMIM: 602211) forkhead box protein D2 [ ( 495) 644 62.8 2.9e-09
NP_005241 (OMIM: 603252) forkhead box protein L1 [ ( 345) 637 62.1 3.2e-09
NP_004464 (OMIM: 241850,602617,616534) forkhead bo ( 373) 629 61.6 5.1e-09
XP_016876735 (OMIM: 602294) PREDICTED: hepatocyte ( 439) 622 61.1 8.3e-09
NP_004487 (OMIM: 602294) hepatocyte nuclear factor ( 472) 622 61.1 8.8e-09
NP_036318 (OMIM: 107250,601094,610256) forkhead bo ( 319) 611 60.2 1.2e-08
NP_710141 (OMIM: 600288) hepatocyte nuclear factor ( 457) 589 58.7 4.8e-08
NP_068556 (OMIM: 600288) hepatocyte nuclear factor ( 463) 589 58.7 4.8e-08
NP_036316 (OMIM: 611084) forkhead box protein D4-l ( 408) 570 57.2 1.2e-07
NP_036320 (OMIM: 274600,600791,601093) forkhead bo ( 378) 560 56.4 1.9e-07
NP_036315 (OMIM: 607836,611539) forkhead box prote ( 478) 560 56.5 2.2e-07
NP_075555 (OMIM: 110100,605597,608996) forkhead bo ( 376) 548 55.5 3.5e-07
NP_004488 (OMIM: 602295) hepatocyte nuclear factor ( 350) 545 55.3 3.8e-07
NP_001129121 (OMIM: 612351) forkhead box protein I ( 420) 539 54.9 6e-07
NP_954714 (OMIM: 611085) forkhead box protein D4-l ( 416) 535 54.6 7.3e-07
NP_997188 (OMIM: 601092) forkhead box protein D4 [ ( 439) 535 54.6 7.6e-07
NP_954586 (OMIM: 611086) forkhead box protein D4-l ( 417) 531 54.3 9e-07
NP_001443 (OMIM: 603250) forkhead box protein F2 [ ( 444) 521 53.5 1.6e-06
NP_001442 (OMIM: 265380,601089) forkhead box prote ( 379) 512 52.8 2.3e-06
NP_005240 (OMIM: 164874,613454) forkhead box prote ( 489) 482 50.7 1.3e-05
NP_150285 (OMIM: 612788) forkhead box protein Q1 [ ( 403) 477 50.2 1.5e-05
NP_004505 (OMIM: 147685) forkhead box protein K2 [ ( 660) 423 46.4 0.00035
XP_011513493 (OMIM: 616302) PREDICTED: forkhead bo ( 570) 413 45.5 0.00052
NP_003914 (OMIM: 603621) forkhead box protein H1 [ ( 365) 405 44.8 0.00057
NP_001032242 (OMIM: 616302) forkhead box protein K ( 733) 413 45.6 0.00063
NP_001445 (OMIM: 602291) forkhead box protein J1 [ ( 421) 400 44.5 0.00082
XP_006710521 (OMIM: 616035) PREDICTED: forkhead bo ( 607) 386 43.5 0.0022
NP_055762 (OMIM: 616035) forkhead box protein J3 i ( 622) 386 43.6 0.0023
NP_001185780 (OMIM: 616035) forkhead box protein J ( 622) 386 43.6 0.0023
XP_011539328 (OMIM: 616035) PREDICTED: forkhead bo ( 622) 386 43.6 0.0023
NP_001185779 (OMIM: 616035) forkhead box protein J ( 622) 386 43.6 0.0023
XP_005270689 (OMIM: 616035) PREDICTED: forkhead bo ( 630) 386 43.6 0.0023
XP_016856182 (OMIM: 616035) PREDICTED: forkhead bo ( 573) 383 43.3 0.0025
NP_658982 (OMIM: 274600,600791,601093) forkhead bo ( 283) 373 42.3 0.0025
NP_001185781 (OMIM: 616035) forkhead box protein J ( 588) 383 43.3 0.0025
XP_006710522 (OMIM: 616035) PREDICTED: forkhead bo ( 596) 383 43.3 0.0026
XP_016880718 (OMIM: 600838,601705) PREDICTED: fork ( 567) 381 43.1 0.0027
XP_011523671 (OMIM: 600838,601705) PREDICTED: fork ( 436) 357 41.2 0.0079
XP_011523672 (OMIM: 600838,601705) PREDICTED: fork ( 436) 357 41.2 0.0079
XP_011523670 (OMIM: 600838,601705) PREDICTED: fork ( 436) 357 41.2 0.0079
XP_011523669 (OMIM: 600838,601705) PREDICTED: fork ( 462) 357 41.3 0.0082
NP_005188 (OMIM: 602628) forkhead box protein N3 i ( 468) 354 41.1 0.0097
>>NP_001444 (OMIM: 601090,601631,602482) forkhead box pr (553 aa)
initn: 3820 init1: 3820 opt: 3820 Z-score: 1573.1 bits: 300.9 E(85289): 6.9e-81
Smith-Waterman score: 3820; 100.0% identity (100.0% similar) in 553 aa overlap (1-553:1-553)
10 20 30 40 50 60
pF1KB9 MQARYSVSSPNSLGVVPYLGGEQSYYRAAAAAAGGGYTAMPAPMSVYSHPAHAEQYPGGM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 MQARYSVSSPNSLGVVPYLGGEQSYYRAAAAAAGGGYTAMPAPMSVYSHPAHAEQYPGGM
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB9 ARAYGPYTPQPQPKDMVKPPYSYIALITMAIQNAPDKKITLNGIYQFIMDRFPFYRDNKQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 ARAYGPYTPQPQPKDMVKPPYSYIALITMAIQNAPDKKITLNGIYQFIMDRFPFYRDNKQ
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB9 GWQNSIRHNLSLNECFVKVPRDDKKPGKGSYWTLDPDSYNMFENGSFLRRRRRFKKKDAV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 GWQNSIRHNLSLNECFVKVPRDDKKPGKGSYWTLDPDSYNMFENGSFLRRRRRFKKKDAV
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB9 KDKEEKDRLHLKEPPPPGRQPPPAPPEQADGNAPGPQPPPVRIQDIKTENGTCPSPPQPL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 KDKEEKDRLHLKEPPPPGRQPPPAPPEQADGNAPGPQPPPVRIQDIKTENGTCPSPPQPL
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB9 SPAAALGSGSAAAVPKIESPDSSSSSLSSGSSPPGSLPSARPLSLDGADSAPPPPAPSAP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 SPAAALGSGSAAAVPKIESPDSSSSSLSSGSSPPGSLPSARPLSLDGADSAPPPPAPSAP
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB9 PPHHSQGFSVDNIMTSLRGSPQSAAAELSSGLLASAAASSRAGIAPPLALGAYSPGQSSL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 PPHHSQGFSVDNIMTSLRGSPQSAAAELSSGLLASAAASSRAGIAPPLALGAYSPGQSSL
310 320 330 340 350 360
370 380 390 400 410 420
pF1KB9 YSSPCSQTSSAGSSGGGGGGAGAAGGAGGAGTYHCNLQAMSLYAAGERGGHLQGAPGGAG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 YSSPCSQTSSAGSSGGGGGGAGAAGGAGGAGTYHCNLQAMSLYAAGERGGHLQGAPGGAG
370 380 390 400 410 420
430 440 450 460 470 480
pF1KB9 GSAVDDPLPDYSLPPVTSSSSSSLSHGGGGGGGGGGQEAGHHPAAHQGRLTSWYLNQAGG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 GSAVDDPLPDYSLPPVTSSSSSSLSHGGGGGGGGGGQEAGHHPAAHQGRLTSWYLNQAGG
430 440 450 460 470 480
490 500 510 520 530 540
pF1KB9 DLGHLASAAAAAAAAGYPGQQQNFHSVREMFESQRIGLNNSPVNGNSSCQMAFPSSQSLY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 DLGHLASAAAAAAAAGYPGQQQNFHSVREMFESQRIGLNNSPVNGNSSCQMAFPSSQSLY
490 500 510 520 530 540
550
pF1KB9 RTSGAFVYDCSKF
:::::::::::::
NP_001 RTSGAFVYDCSKF
550
>>NP_005242 (OMIM: 153400,602402) forkhead box protein C (501 aa)
initn: 1177 init1: 813 opt: 1171 Z-score: 500.7 bits: 102.3 E(85289): 3.8e-21
Smith-Waterman score: 1524; 49.9% identity (66.8% similar) in 587 aa overlap (1-553:1-501)
10 20 30 40 50 60
pF1KB9 MQARYSVSSPNSLGVVPYLGGEQSYYRAAAAAAGGGYTAMPAPMSVYSHPAHAEQYPGGM
::::::::.::.:::::::. ::.::::: :.: .: .::.::: .: ::: .::
NP_005 MQARYSVSDPNALGVVPYLS-EQNYYRAA-----GSYGGMASPMGVYS--GHPEQYSAGM
10 20 30 40 50
70 80 90 100 110
pF1KB9 ARAYGPYTP-QPQ-PKDMVKPPYSYIALITMAIQNAPDKKITLNGIYQFIMDRFPFYRDN
.:.:.:: :: :::.:::::::::::::::::::.::::::::::::::::::::.:
NP_005 GRSYAPYHHHQPAAPKDLVKPPYSYIALITMAIQNAPEKKITLNGIYQFIMDRFPFYREN
60 70 80 90 100 110
120 130 140 150 160 170
pF1KB9 KQGWQNSIRHNLSLNECFVKVPRDDKKPGKGSYWTLDPDSYNMFENGSFLRRRRRFKKKD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_005 KQGWQNSIRHNLSLNECFVKVPRDDKKPGKGSYWTLDPDSYNMFENGSFLRRRRRFKKKD
120 130 140 150 160 170
180 190 200 210 220 230
pF1KB9 AVKDKEEKDRLHLKEPPPPGRQPPPAPPEQADGNAPGPQPPPVRIQDIKTENGTCPSPP-
. :.::: : ::::::: . . :: :. ::. :. .. ::.: .. :. :
NP_005 VSKEKEE--RAHLKEPPPAASKGAPATPHLADA----PKEAEKKVV-IKSEAAS-PALPV
180 190 200 210 220
240 250 260 270 280 290
pF1KB9 ----QPLSPAAALGSGSAAAVPKIESPDSSSSSLSSGSSPPGSLPSARPLSLDGADSAPP
. ::: .:: .:: :. :..: .:: : :::: .
NP_005 ITKVETLSPESAL-QGS----PR------SAASTPAGS-PDGSLPEHH------------
230 240 250 260
300 310 320 330 340 350
pF1KB9 PPAPSAPPPHHSQGFSVDNIMTSLRGSPQSAAAELSSGLLASAAASSRAG-IAPPLALGA
::.. : ::::.:::: :: :: .. ::: : ..::: ..:::::
NP_005 AAAPNGLP-----GFSVENIMT-LRTSPPGG--ELSPG-------AGRAGLVVPPLAL-P
270 280 290 300
360 370 380 390 400 410
pF1KB9 YSPGQSSLYSSPCSQTSSAGSSGGGGGGAGAAGGAGGAGTYHCNLQAMSLYAAGERGGHL
:. . . :..::.: : :::::: :.:...:::::...:: .:.
NP_005 YAAAPPAAYGQPCAQ----------GLEAGAAGG------YQCSMRAMSLYTGAERPAHM
310 320 330 340
420 430 440 450 460
pF1KB9 QGAPGGAGGSAVDDPLPDYSLPPVTSSSSSSLSHGGGGGGGGGG---QEAGHH-------
: :.:. : :. :.. :. .:. : :. .. : :. :::
NP_005 CVPP------ALDEALSDHPSGPTSPLSALNLAAGQEGALAATGHHHQHHGHHHPQAPPP
350 360 370 380 390 400
470 480 490 500 510
pF1KB9 -----------PAAHQGRLTSWYLNQAGGDLGHLASAAAAAAAAGYPGQQQNFHSVREMF
:.: .. .:::::..: ::.:: . . :: :::.: .:::::
NP_005 PPAPQPQPTPQPGAAAAQAASWYLNHSG-DLNHLPGHTFAA-------QQQTFPNVREMF
410 420 430 440 450
520 530 540 550
pF1KB9 ESQRIGLNNSP-----VNGNSSCQMAFPSSQSLYRTSGAFVYDCSKF
.:.:.:..:: :.::.:::. . :. ::: .. . :::.:.
NP_005 NSHRLGIENSTLGESQVSGNASCQLPYRSTPPLYRHAAPYSYDCTKY
460 470 480 490 500
>>NP_004109 (OMIM: 602939) forkhead box protein S1 [Homo (330 aa)
initn: 658 init1: 571 opt: 655 Z-score: 294.1 bits: 63.5 E(85289): 1.2e-09
Smith-Waterman score: 684; 44.6% identity (62.2% similar) in 278 aa overlap (65-301:8-282)
40 50 60 70 80 90
pF1KB9 GGYTAMPAPMSVYSHPAHAEQYPGGMARAYGPYTPQPQPKDMVKPPYSYIALITMAIQNA
:: .: .: .::::::::::.::::..
NP_004 MQQQPLPGPGAPTTEP---TKPPYSYIALIAMAIQSS
10 20 30
100 110 120 130 140 150
pF1KB9 PDKKITLNGIYQFIMDRFPFYRDNKQGWQNSIRHNLSLNECFVKVPRDDKKPGKGSYWTL
: .. ::.:::..:: :: ::: :. :::::::::::::::::::::::.::::::::::
NP_004 PGQRATLSGIYRYIMGRFAFYRHNRPGWQNSIRHNLSLNECFVKVPRDDRKPGKGSYWTL
40 50 60 70 80 90
160 170 180 190 200
pF1KB9 DPDSYNMFENGSFLRRRRRFKKKDAV-------KDKEEKDRLHLKEPPPP----GRQ---
::: ..:::.:::::::::: .. .. : .. : ..: : :::
NP_004 DPDCHDMFEHGSFLRRRRRFTRQTGAEGTRGPAKARRGPLRATSQDPGVPNATTGRQCSF
100 110 120 130 140 150
210 220 230 240
pF1KB9 PP--PAPPEQADGNAPGPQP-------------PPVRIQDIKTENGTCPSP-PQPLSPAA
:: : : . :. : .: ::.. ..:.: . .::. : : ..
NP_004 PPELPDPKGLSFGGLVGAMPASMCPATTDGRPRPPMEPKEISTPKPACPGELPVATSSSS
160 170 180 190 200 210
250 260 270 280 290
pF1KB9 ALGSGSAAAVPKIESPDSSSSSLSSGSSPPGSLPSARPLSLD---GAD--------SAPP
. : :. . :: ... . . : : :: . : .:. ::: :: :
NP_004 CPAFGFPAGFSEAESFNKAPTPVLSPESGIGSSYQCRLQALNFCMGADPGLEHLLASAAP
220 230 240 250 260 270
300 310 320 330 340 350
pF1KB9 PPAPSAPPPHHSQGFSVDNIMTSLRGSPQSAAAELSSGLLASAAASSRAGIAPPLALGAY
::: .::
NP_004 SPAPPTPPGSLRAPLPLPTDHKEPWVAGGFPVQGGSGYPLGLTPCLYRTPGMFFFE
280 290 300 310 320 330
>>NP_004463 (OMIM: 601091) forkhead box protein D1 [Homo (465 aa)
initn: 590 init1: 514 opt: 649 Z-score: 289.7 bits: 63.2 E(85289): 2.1e-09
Smith-Waterman score: 655; 37.7% identity (58.4% similar) in 361 aa overlap (31-387:80-419)
10 20 30 40 50 60
pF1KB9 MQARYSVSSPNSLGVVPYLGGEQSYYRAAAAAAGGGYTAMPAPMSVYSHPAHAEQYPGGM
: .:: : :.: . . : : ::
NP_004 AQRRRRRRSYAGEDELEDLEEEEDDDDILLAPPAGGSPAPPGPAPAAG--AGAGGGGGGG
50 60 70 80 90 100
70 80 90 100 110 120
pF1KB9 ARAYGPYTPQPQPKDMVKPPYSYIALITMAIQNAPDKKITLNGIYQFIMDRFPFYRDNKQ
. . : . . . .::::::::::::::: ..: :..::. : .:: :::.::..
NP_004 GAGGGGSAGSGAKNPLVKPPYSYIALITMAILQSPKKRLTLSEICEFISGRFPYYREKFP
110 120 130 140 150 160
130 140 150 160 170 180
pF1KB9 GWQNSIRHNLSLNECFVKVPRDDKKPGKGSYWTLDPDSYNMFENGSFLRRRRRFKKKDAV
.::::::::::::.::::.::. .::::.::::::.: .::.::::::::.:::.. .
NP_004 AWQNSIRHNLSLNDCFVKIPREPGNPGKGNYWTLDPESADMFDNGSFLRRRKRFKRQPLL
170 180 190 200 210 220
190 200 210 220 230
pF1KB9 K-DKEEKDRLHLKEPPPPGRQPPPAPPEQADGNAPGPQPPPVRIQDIKTENGTCPSPPQP
. . : :. : :: : :.: ::: : . : :
NP_004 PPNAAAAESLLLRGAGAAGGAGDPAAA--AALFPPAPPPPPHAYGYGPYGCGYGLQLP-P
230 240 250 260 270 280
240 250 260 270 280 290
pF1KB9 LSPAAALGSGSAAAVPKIESPDSSSSSLSSGSSPPGSLP--SARPLSLDGADSAPPPPAP
.: .:: ...::: ...... : :: : .: :. . : : .
NP_004 YAPPSALFAAAAAA--------AAAAAFHPHSPPPPPPPHGAAAELARTAFGYRPHPLGA
290 300 310 320 330
300 310 320 330 340 350
pF1KB9 SAPPPHHSQGFSVDNIMTS-LRGSPQSAAAELSSGLLASAAASSRAGIAPPLALGAYSPG
. : : ... .. . .: : :: : . . .: :. :::.. :. : : .. ::
NP_004 ALPGPLPASAAKAGGPGASALARSPFSIES-IIGGSLGPAAAAAAAAQAAAAAQASPSP-
340 350 360 370 380 390
360 370 380 390 400 410
pF1KB9 QSSLYSSPCSQTSSAGSSGGGGGGAGAAGGAGGAGTYHCNLQAMSLYAAGERGGHLQGAP
:: . . :::::: .. .:.: :
NP_004 ------SPVAAPPAPGSSGGGCAAQAAVGPAAALTRSLVAAAAAAASSVSSSAALGTLHQ
400 410 420 430 440
420 430 440 450 460 470
pF1KB9 GGAGGSAVDDPLPDYSLPPVTSSSSSSLSHGGGGGGGGGGQEAGHHPAAHQGRLTSWYLN
NP_004 GTALSSVENFTARISNC
450 460
>>NP_004465 (OMIM: 602211) forkhead box protein D2 [Homo (495 aa)
initn: 716 init1: 509 opt: 644 Z-score: 287.3 bits: 62.8 E(85289): 2.9e-09
Smith-Waterman score: 679; 33.9% identity (52.3% similar) in 499 aa overlap (1-459:39-484)
10 20 30
pF1KB9 MQARYSVSSPNSLGVVPYLGGEQSYYRAAA
. :: . .: . :.:. : : .: :
NP_004 EIMSSESSPAALSEADADIDVVGGGSGGGELPARSGPRAPRD--VLPH-GHEPPAEEAEA
10 20 30 40 50 60
40 50 60 70
pF1KB9 AAA------GGGYTAMPAPMSVYSHPAHAEQYPG-GMARAYGPYTPQPQPKD--------
: :: . : .. . : : :: : : : : : : : .
NP_004 DLAEDEEESGGCSDGEPRALASRG-AAAAAGSPGPGAAAARGAAGPGPGPPSGGAATRSP
70 80 90 100 110 120
80 90 100 110 120 130
pF1KB9 MVKPPYSYIALITMAIQNAPDKKITLNGIYQFIMDRFPFYRDNKQGWQNSIRHNLSLNEC
.::::::::::::::: ..: :..::. : .:: :::.::.. .::::::::::::.:
NP_004 LVKPPYSYIALITMAILQSPKKRLTLSEICEFISGRFPYYREKFPAWQNSIRHNLSLNDC
130 140 150 160 170 180
140 150 160 170 180 190
pF1KB9 FVKVPRDDKKPGKGSYWTLDPDSYNMFENGSFLRRRRRFKKKDAVKDKEEKDRLHLKEPP
:::.::. .::::.::::::.: .::.::::::::.:::.. :
NP_004 FVKIPREPGNPGKGNYWTLDPESADMFDNGSFLRRRKRFKRQPL---------------P
190 200 210 220
200 210 220 230
pF1KB9 PPGRQPPPAPPEQADGNA-----PGPQPPPVRIQD---------IKTENGTCPSP---PQ
:: .: : : :.: :: : . . .. :.: :.
NP_004 PPHPHPHPHPELLLRGGAAAAGDPGAFLPGFAAYGAYGYGYGLALPAYGAPPPGPAPHPH
230 240 250 260 270 280
240 250 260 270 280 290
pF1KB9 PLSPAAALGSGSAAAVPKIESPDSSSSSLSSGSSPPGSLPSARPLSLDGADSAPPPPAPS
: : :.....::: .. : . ... ::: :.: .. :. :: : . :
NP_004 PHPHAFAFAAAAAAAPCQLSVPPGRAAA-----PPPGP-PTASVFAGAGSAPAPAPASGS
290 300 310 320 330 340
300 310 320 330 340 350
pF1KB9 APPPHHSQGFSVDNIMTSLRGSPQSAAAELSSGLLASAAASSRAGIAPPLALGAYSPGQS
.: : . : : .:::. : : :...:: : :. . .
NP_004 GPGPGPA-------------GLPAFLGAELGC-----AKAFYAASLSPPAA-GTAAGLPT
350 360 370 380
360 370 380 390 400 410
pF1KB9 SLYSSPCSQTSSAGSSGGGGGGAGAAG--------GAGGAGTYHCNLQAMSLYAAGERGG
.: . .:...:..::::.::: : ::.:. : . : :
NP_004 ALLRQGL-KTDAGGGAGGGGAGAGQRPSFSIDHIMGHGGGGA------APPGAGEGSPGP
390 400 410 420 430
420 430 440 450 460 470
pF1KB9 HLQGAPGGAGGSAVDDPLPDYSLPPVTSSSSSSLSHGGGGGGGGGGQEAGHHPAAHQGRL
. .: : .: . : : .: :: .. ::: : . ..:.. :
NP_004 PFAAAAGPGGQAQVLAMLTAPALAPV--AGHIRLSHPGDALLSSGSRFASKVAGLSGCHF
440 450 460 470 480 490
480 490 500 510 520 530
pF1KB9 TSWYLNQAGGDLGHLASAAAAAAAAGYPGQQQNFHSVREMFESQRIGLNNSPVNGNSSCQ
>>NP_005241 (OMIM: 603252) forkhead box protein L1 [Homo (345 aa)
initn: 632 init1: 562 opt: 637 Z-score: 286.5 bits: 62.1 E(85289): 3.2e-09
Smith-Waterman score: 671; 42.6% identity (64.4% similar) in 298 aa overlap (78-367:49-319)
50 60 70 80 90 100
pF1KB9 SHPAHAEQYPGGMARAYGPYTPQPQPKDMVKPPYSYIALITMAIQNAPDKKITLNGIYQF
::::::::::.::::.::....::::::::
NP_005 YLYGPERPGLPLAFAPAAALAASGRAETPQKPPYSYIALIAMAIQDAPEQRVTLNGIYQF
20 30 40 50 60 70
110 120 130 140 150 160
pF1KB9 IMDRFPFYRDNKQGWQNSIRHNLSLNECFVKVPRDDKKPGKGSYWTLDPDSYNMFENGSF
::::::::.::.::::::::::::::.:::::::. .::::::::::: .:::::..
NP_005 IMDRFPFYHDNRQGWQNSIRHNLSLNDCFVKVPREKGRPGKGSYWTLDPRCLDMFENGNY
80 90 100 110 120 130
170 180 190 200 210 220
pF1KB9 LRRRRRFKK-KDAVKDKEEKDRLHLKEPPPPGRQPPPAPPEQADGNAPGPQPPPVRIQDI
::.:. : : . :. . . : .. : :: ..: : : : :.:
NP_005 RRRKRKPKPGPGAPEAKRPRAETH--------QRSAEAQPEAGSG-AGGSGPAISRLQ--
140 150 160 170 180
230 240 250 260 270 280
pF1KB9 KTENGTCPSPPQPL----SPAAALGSGSAAAVPKIESPDSSSSSLSSGSSPPGSLPSARP
. :. :.:: :: : : : ::. .... ..:.. . .::
NP_005 -----AAPAGPSPLLDGPSPPAPLH------WPGTASPNEDAGDAAQGAAAVAVGQAAR-
190 200 210 220 230
290 300 310 320 330 340
pF1KB9 LSLDGADSAPPPPAPSAPPPH-HSQGFSVDNIMTSLRGSPQSAAAELSSGLLASAAASSR
. :: : : . :.: .:..::.:.:... .:. .. :: .: :. . ..:
NP_005 -TGDGPGSPLRPASRSSPKSSDKSKSFSIDSILAGKQGQKPPSGDELLGG--AKPGPGGR
240 250 260 270 280 290
350 360 370 380 390
pF1KB9 AGIAPPLALGA--YSPGQSSLYSSPCSQTSSAGSSGGGGGGAGAAGGAGGAGTYHCNLQA
: : :: .. : ..::. .: :
NP_005 LG-ASLLAASSSLRPPFNASLMLDPHVQGGFYQLGIPFLSYFPLQVPDTVLHFQ
300 310 320 330 340
>>NP_004464 (OMIM: 241850,602617,616534) forkhead box pr (373 aa)
initn: 684 init1: 540 opt: 629 Z-score: 282.9 bits: 61.6 E(85289): 5.1e-09
Smith-Waterman score: 641; 37.5% identity (57.3% similar) in 363 aa overlap (57-393:32-369)
30 40 50 60 70 80
pF1KB9 RAAAAAAGGGYTAMPAPMSVYSHPAHAEQYPG-GMARAYGPYTPQPQPKDMVKPPYSYIA
:: . .:. : . .: . ::::::::
NP_004 TAESGPPPPQPEVLATVKEERGETAAGAGVPGEATGRGAGGRR-RKRPLQRGKPPYSYIA
10 20 30 40 50 60
90 100 110 120 130 140
pF1KB9 LITMAIQNAPDKKITLNGIYQFIMDRFPFYRDNKQGWQNSIRHNLSLNECFVKVPRDDKK
::.::: .::....::.:::.:: .:::::::: . :::::::::.::.::.:.::. .
NP_004 LIAMAIAHAPERRLTLGGIYKFITERFPFYRDNPKKWQNSIRHNLTLNDCFLKIPREAGR
70 80 90 100 110 120
150 160 170 180 190
pF1KB9 PGKGSYWTLDPDSYNMFENGSFLRRRRRFKKKD-AVKDKEEKDRLHLKEPPP--------
::::.::.:::.. .:::.:::::::.:::..: .. .:
NP_004 PGKGNYWALDPNAEDMFESGSFLRRRKRFKRSDLSTYPAYMHDAAAAAAAAAAAAAAAAI
130 140 150 160 170 180
200 210 220 230 240
pF1KB9 -PGRQP---PPAPPEQADGNAPGPQ---PPPVRIQDIKTENGTCPS----PPQPLSPAAA
:: : :: : : :: :. :::: . : : : .::::
NP_004 FPGAVPAARPPYPGAVYAGYAP-PSLAAPPPVYYP--AASPGPCRVFGLVPERPLSPE--
190 200 210 220 230
250 260 270 280 290 300
pF1KB9 LGSGSAAAVPKIESPDSSSSSLSSGSSPPGSLPSARPLSLDGADSAPPPPAPSAPPPHHS
:: : .: .: . :.:. :.. . .: . :: : ::: ..
NP_004 LG-------PAPSGPGGSCAFASAGA--PATTTGYQPAGCTGAR----PANPSAYAAAYA
240 250 260 270 280
310 320 330 340 350 360
pF1KB9 QGFSVDNIMTSLRGSPQSAAAELSSGLLASAAASSRAGIAPPLAL-GAYSPGQSSLYSSP
. :. . . :: ::: .: . :..: .:. . . : :::: . ..
NP_004 ---GPDGAYPQGAGSAIFAAAGRLAGPASPPAGGSSGGVETTVDFYGRTSPGQFGALGA-
290 300 310 320 330
370 380 390 400 410 420
pF1KB9 CSQTSSAGSSGGGGGGA----GAAGGAGGAGTYHCNLQAMSLYAAGERGGHLQGAPGGAG
: . .:. ::...:: ::. :: .
NP_004 CY--NPGGQLGGASAGAYHARHAAAYPGGIDRFVSAM
340 350 360 370
430 440 450 460 470 480
pF1KB9 GSAVDDPLPDYSLPPVTSSSSSSLSHGGGGGGGGGGQEAGHHPAAHQGRLTSWYLNQAGG
>>XP_016876735 (OMIM: 602294) PREDICTED: hepatocyte nucl (439 aa)
initn: 556 init1: 523 opt: 622 Z-score: 279.1 bits: 61.1 E(85289): 8.3e-09
Smith-Waterman score: 645; 37.5% identity (60.7% similar) in 392 aa overlap (9-369:76-439)
10 20 30
pF1KB9 MQARYSVSSPNSLGVVPYLGGEQSYYRAAAAAAGGGYT
::...:. .:..: ::. . : :.
XP_016 PGAVAGMPGGSAGAMNSMTAAGVTAMGTALSPSGMGA---MGAQQ----AASMNGLGPYA
50 60 70 80 90
40 50 60 70 80 90
pF1KB9 AMPAP-MSVYSH-PAH---AEQYPGGMARAYGPYTPQPQPKDMVKPPYSYIALITMAIQN
: : :: ... :.. .. :: :... :. .:::::::.:::::::.
XP_016 AAMNPCMSPMAYAPSNLGRSRAGGGGDAKTFKRSYPH------AKPPYSYISLITMAIQQ
100 110 120 130 140 150
100 110 120 130 140 150
pF1KB9 APDKKITLNGIYQFIMDRFPFYRDNKQGWQNSIRHNLSLNECFVKVPRDDKKPGKGSYWT
::.: .::. :::.::: ::.::.:.: :::::::.::.:.::::: :. :::::::::
XP_016 APSKMLTLSEIYQWIMDLFPYYRQNQQRWQNSIRHSLSFNDCFVKVARSPDKPGKGSYWT
160 170 180 190 200 210
160 170 180 190 200 210
pF1KB9 LDPDSYNMFENGSFLRRRRRFK---KKDAVKDKEEKDRLHLKEPPPPGRQPPPAPPEQAD
: ::: :::::: .:::..::: . : . . : .:. : . ..
XP_016 LHPDSGNMFENGCYLRRQKRFKCEKQPGAGGGGGSGSGGSGAKGGPESRKDPSG---ASN
220 230 240 250 260
220 230 240 250 260
pF1KB9 GNAPGPQPPPVRIQDIKTENGTCPSP---PQPLSPAAALGSGSAAAVPKIESPDSSSSS-
.: .: :. . . :.. :.: :: :. ..: ..:.:. ....: ::..
XP_016 PSADSPLHRGVHGKTGQLEGAPAPGPAASPQTLDHSGATATGGAS---ELKTPASSTAPP
270 280 290 300 310 320
270 280 290 300 310
pF1KB9 LSSGSSPPGSLPSARPLSLDGADSAPPPPAPSAP-PPHHS--QGFSVDNIMTS-------
.::: ::.: :. : : . :: ::.: . ::..:.:.:
XP_016 ISSG---PGALASV-PASHPAHGLAPHESQLHLKGDPHYSFNHPFSINNLMSSSEQQHKL
330 340 350 360 370 380
320 330 340 350 360
pF1KB9 --------LRGSPQSAAAELSSGL-LASAAASSRAGIAPPLALGAYSPGQSSLYSSPCSQ
:. :: ... : ..: :.::....:. : : :: : .:: : .
XP_016 DFKAYEQALQYSPYGST--LPASLPLGSASVTTRSPIEPSALEPAYYQG---VYSRPVLN
390 400 410 420 430
370 380 390 400 410 420
pF1KB9 TSSAGSSGGGGGGAGAAGGAGGAGTYHCNLQAMSLYAAGERGGHLQGAPGGAGGSAVDDP
::
XP_016 TS
>>NP_004487 (OMIM: 602294) hepatocyte nuclear factor 3-a (472 aa)
initn: 556 init1: 523 opt: 622 Z-score: 278.7 bits: 61.1 E(85289): 8.8e-09
Smith-Waterman score: 645; 37.5% identity (60.7% similar) in 392 aa overlap (9-369:109-472)
10 20 30
pF1KB9 MQARYSVSSPNSLGVVPYLGGEQSYYRAAAAAAGGGYT
::...:. .:..: ::. . : :.
NP_004 PGAVAGMPGGSAGAMNSMTAAGVTAMGTALSPSGMGA---MGAQQ----AASMNGLGPYA
80 90 100 110 120 130
40 50 60 70 80 90
pF1KB9 AMPAP-MSVYSH-PAH---AEQYPGGMARAYGPYTPQPQPKDMVKPPYSYIALITMAIQN
: : :: ... :.. .. :: :... :. .:::::::.:::::::.
NP_004 AAMNPCMSPMAYAPSNLGRSRAGGGGDAKTFKRSYPH------AKPPYSYISLITMAIQQ
140 150 160 170 180
100 110 120 130 140 150
pF1KB9 APDKKITLNGIYQFIMDRFPFYRDNKQGWQNSIRHNLSLNECFVKVPRDDKKPGKGSYWT
::.: .::. :::.::: ::.::.:.: :::::::.::.:.::::: :. :::::::::
NP_004 APSKMLTLSEIYQWIMDLFPYYRQNQQRWQNSIRHSLSFNDCFVKVARSPDKPGKGSYWT
190 200 210 220 230 240
160 170 180 190 200 210
pF1KB9 LDPDSYNMFENGSFLRRRRRFK---KKDAVKDKEEKDRLHLKEPPPPGRQPPPAPPEQAD
: ::: :::::: .:::..::: . : . . : .:. : . ..
NP_004 LHPDSGNMFENGCYLRRQKRFKCEKQPGAGGGGGSGSGGSGAKGGPESRKDPSG---ASN
250 260 270 280 290 300
220 230 240 250 260
pF1KB9 GNAPGPQPPPVRIQDIKTENGTCPSP---PQPLSPAAALGSGSAAAVPKIESPDSSSSS-
.: .: :. . . :.. :.: :: :. ..: ..:.:. ....: ::..
NP_004 PSADSPLHRGVHGKTGQLEGAPAPGPAASPQTLDHSGATATGGAS---ELKTPASSTAPP
310 320 330 340 350
270 280 290 300 310
pF1KB9 LSSGSSPPGSLPSARPLSLDGADSAPPPPAPSAP-PPHHS--QGFSVDNIMTS-------
.::: ::.: :. : : . :: ::.: . ::..:.:.:
NP_004 ISSG---PGALASV-PASHPAHGLAPHESQLHLKGDPHYSFNHPFSINNLMSSSEQQHKL
360 370 380 390 400 410
320 330 340 350 360
pF1KB9 --------LRGSPQSAAAELSSGL-LASAAASSRAGIAPPLALGAYSPGQSSLYSSPCSQ
:. :: ... : ..: :.::....:. : : :: : .:: : .
NP_004 DFKAYEQALQYSPYGST--LPASLPLGSASVTTRSPIEPSALEPAYYQG---VYSRPVLN
420 430 440 450 460 470
370 380 390 400 410 420
pF1KB9 TSSAGSSGGGGGGAGAAGGAGGAGTYHCNLQAMSLYAAGERGGHLQGAPGGAGGSAVDDP
::
NP_004 TS
>>NP_036318 (OMIM: 107250,601094,610256) forkhead box pr (319 aa)
initn: 580 init1: 525 opt: 611 Z-score: 276.5 bits: 60.2 E(85289): 1.2e-08
Smith-Waterman score: 638; 40.8% identity (57.9% similar) in 292 aa overlap (32-297:11-301)
10 20 30 40 50
pF1KB9 QARYSVSSPNSLGVVPYLGGEQSYYRAAAAAAGGGYTAMPA--PMSVYSHP-AHAE--QY
:: .:. :.:: : . : : :: .
NP_036 MAGRSDMDPPAAFSGFPALPAVAPSGPPPSPLAGAEPGRE
10 20 30 40
60 70 80 90 100
pF1KB9 PGGMARAYGPYTPQP---------QPKDMVKPPYSYIALITMAIQNAPDKKITLNGIYQF
: : . : .: : .: . ::::::::::.::. .:: ...:: .::.:
NP_036 PEEAAAGRGEAAPTPAPGPGRRRRRPLQRGKPPYSYIALIAMALAHAPGRRLTLAAIYRF
50 60 70 80 90 100
110 120 130 140 150 160
pF1KB9 IMDRFPFYRDNKQGWQNSIRHNLSLNECFVKVPRDDKKPGKGSYWTLDPDSYNMFENGSF
: .:: ::::. . :::::::::.::.:::::::. .::::.:::::: . .::.::::
NP_036 ITERFAFYRDSPRKWQNSIRHNLTLNDCFVKVPREPGNPGKGNYWTLDPAAADMFDNGSF
110 120 130 140 150 160
170 180 190 200 210 220
pF1KB9 LRRRRRFKKKDAVKDKEEKDRLHLKEPPPPGRQPPPAPPEQADGNAPGPQP-PPVRIQDI
::::.:::. . : : : : :.: . . :: : ::.:. ..
NP_036 LRRRKRFKRAELPAHAAAAPGPPLPFPYAP-YAPAPGPALLVPPPSAGPGPSPPARLFSV
170 180 190 200 210
230 240 250 260 270
pF1KB9 KTENGT--------CPSPP---QPLSPAAALGSGSAAAVPKIESPDSSSSSLSSGSSPPG
. . : :: : . :::. .::: : . : . : . ::
NP_036 DSLVNLQPELAGLGAPEPPCCAAPDAAAAAFPPCAAAASPPLYSQVPDRLVLPATRPGPG
220 230 240 250 260 270
280 290 300 310 320 330
pF1KB9 SLPSARPLSLDGADSAPPPPAPSAPPPHHSQGFSVDNIMTSLRGSPQSAAAELSSGLLAS
::. :.: : .: : .:
NP_036 PLPAEPLLALAGPAAALGPLSPGEAYLRQPGFASGLERYL
280 290 300 310
553 residues in 1 query sequences
60827320 residues in 85289 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Tue Nov 8 03:43:52 2016 done: Tue Nov 8 03:43:53 2016
Total Scan time: 9.420 Total Display time: 0.070
Function used was FASTA [36.3.4 Apr, 2011]