FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB9595, 553 aa 1>>>pF1KB9595 553 - 553 aa - 553 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 14.0995+/-0.000416; mu= -19.3295+/- 0.026 mean_var=609.4967+/-126.141, 0's: 0 Z-trim(125.9): 115 B-trim: 0 in 0/61 Lambda= 0.051950 statistics sampled from 50452 (50600) to 50452 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.826), E-opt: 0.2 (0.593), width: 16 Scan time: 9.420 The best scores are: opt bits E(85289) NP_001444 (OMIM: 601090,601631,602482) forkhead bo ( 553) 3820 300.9 6.9e-81 NP_005242 (OMIM: 153400,602402) forkhead box prote ( 501) 1171 102.3 3.8e-21 NP_004109 (OMIM: 602939) forkhead box protein S1 [ ( 330) 655 63.5 1.2e-09 NP_004463 (OMIM: 601091) forkhead box protein D1 [ ( 465) 649 63.2 2.1e-09 NP_004465 (OMIM: 602211) forkhead box protein D2 [ ( 495) 644 62.8 2.9e-09 NP_005241 (OMIM: 603252) forkhead box protein L1 [ ( 345) 637 62.1 3.2e-09 NP_004464 (OMIM: 241850,602617,616534) forkhead bo ( 373) 629 61.6 5.1e-09 XP_016876735 (OMIM: 602294) PREDICTED: hepatocyte ( 439) 622 61.1 8.3e-09 NP_004487 (OMIM: 602294) hepatocyte nuclear factor ( 472) 622 61.1 8.8e-09 NP_036318 (OMIM: 107250,601094,610256) forkhead bo ( 319) 611 60.2 1.2e-08 NP_710141 (OMIM: 600288) hepatocyte nuclear factor ( 457) 589 58.7 4.8e-08 NP_068556 (OMIM: 600288) hepatocyte nuclear factor ( 463) 589 58.7 4.8e-08 NP_036316 (OMIM: 611084) forkhead box protein D4-l ( 408) 570 57.2 1.2e-07 NP_036320 (OMIM: 274600,600791,601093) forkhead bo ( 378) 560 56.4 1.9e-07 NP_036315 (OMIM: 607836,611539) forkhead box prote ( 478) 560 56.5 2.2e-07 NP_075555 (OMIM: 110100,605597,608996) forkhead bo ( 376) 548 55.5 3.5e-07 NP_004488 (OMIM: 602295) hepatocyte nuclear factor ( 350) 545 55.3 3.8e-07 NP_001129121 (OMIM: 612351) forkhead box protein I ( 420) 539 54.9 6e-07 NP_954714 (OMIM: 611085) forkhead box protein D4-l ( 416) 535 54.6 7.3e-07 NP_997188 (OMIM: 601092) forkhead box protein D4 [ ( 439) 535 54.6 7.6e-07 NP_954586 (OMIM: 611086) forkhead box protein D4-l ( 417) 531 54.3 9e-07 NP_001443 (OMIM: 603250) forkhead box protein F2 [ ( 444) 521 53.5 1.6e-06 NP_001442 (OMIM: 265380,601089) forkhead box prote ( 379) 512 52.8 2.3e-06 NP_005240 (OMIM: 164874,613454) forkhead box prote ( 489) 482 50.7 1.3e-05 NP_150285 (OMIM: 612788) forkhead box protein Q1 [ ( 403) 477 50.2 1.5e-05 NP_004505 (OMIM: 147685) forkhead box protein K2 [ ( 660) 423 46.4 0.00035 XP_011513493 (OMIM: 616302) PREDICTED: forkhead bo ( 570) 413 45.5 0.00052 NP_003914 (OMIM: 603621) forkhead box protein H1 [ ( 365) 405 44.8 0.00057 NP_001032242 (OMIM: 616302) forkhead box protein K ( 733) 413 45.6 0.00063 NP_001445 (OMIM: 602291) forkhead box protein J1 [ ( 421) 400 44.5 0.00082 XP_006710521 (OMIM: 616035) PREDICTED: forkhead bo ( 607) 386 43.5 0.0022 NP_055762 (OMIM: 616035) forkhead box protein J3 i ( 622) 386 43.6 0.0023 NP_001185780 (OMIM: 616035) forkhead box protein J ( 622) 386 43.6 0.0023 XP_011539328 (OMIM: 616035) PREDICTED: forkhead bo ( 622) 386 43.6 0.0023 NP_001185779 (OMIM: 616035) forkhead box protein J ( 622) 386 43.6 0.0023 XP_005270689 (OMIM: 616035) PREDICTED: forkhead bo ( 630) 386 43.6 0.0023 XP_016856182 (OMIM: 616035) PREDICTED: forkhead bo ( 573) 383 43.3 0.0025 NP_658982 (OMIM: 274600,600791,601093) forkhead bo ( 283) 373 42.3 0.0025 NP_001185781 (OMIM: 616035) forkhead box protein J ( 588) 383 43.3 0.0025 XP_006710522 (OMIM: 616035) PREDICTED: forkhead bo ( 596) 383 43.3 0.0026 XP_016880718 (OMIM: 600838,601705) PREDICTED: fork ( 567) 381 43.1 0.0027 XP_011523671 (OMIM: 600838,601705) PREDICTED: fork ( 436) 357 41.2 0.0079 XP_011523672 (OMIM: 600838,601705) PREDICTED: fork ( 436) 357 41.2 0.0079 XP_011523670 (OMIM: 600838,601705) PREDICTED: fork ( 436) 357 41.2 0.0079 XP_011523669 (OMIM: 600838,601705) PREDICTED: fork ( 462) 357 41.3 0.0082 NP_005188 (OMIM: 602628) forkhead box protein N3 i ( 468) 354 41.1 0.0097 >>NP_001444 (OMIM: 601090,601631,602482) forkhead box pr (553 aa) initn: 3820 init1: 3820 opt: 3820 Z-score: 1573.1 bits: 300.9 E(85289): 6.9e-81 Smith-Waterman score: 3820; 100.0% identity (100.0% similar) in 553 aa overlap (1-553:1-553) 10 20 30 40 50 60 pF1KB9 MQARYSVSSPNSLGVVPYLGGEQSYYRAAAAAAGGGYTAMPAPMSVYSHPAHAEQYPGGM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MQARYSVSSPNSLGVVPYLGGEQSYYRAAAAAAGGGYTAMPAPMSVYSHPAHAEQYPGGM 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 ARAYGPYTPQPQPKDMVKPPYSYIALITMAIQNAPDKKITLNGIYQFIMDRFPFYRDNKQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 ARAYGPYTPQPQPKDMVKPPYSYIALITMAIQNAPDKKITLNGIYQFIMDRFPFYRDNKQ 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB9 GWQNSIRHNLSLNECFVKVPRDDKKPGKGSYWTLDPDSYNMFENGSFLRRRRRFKKKDAV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 GWQNSIRHNLSLNECFVKVPRDDKKPGKGSYWTLDPDSYNMFENGSFLRRRRRFKKKDAV 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB9 KDKEEKDRLHLKEPPPPGRQPPPAPPEQADGNAPGPQPPPVRIQDIKTENGTCPSPPQPL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 KDKEEKDRLHLKEPPPPGRQPPPAPPEQADGNAPGPQPPPVRIQDIKTENGTCPSPPQPL 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB9 SPAAALGSGSAAAVPKIESPDSSSSSLSSGSSPPGSLPSARPLSLDGADSAPPPPAPSAP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 SPAAALGSGSAAAVPKIESPDSSSSSLSSGSSPPGSLPSARPLSLDGADSAPPPPAPSAP 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB9 PPHHSQGFSVDNIMTSLRGSPQSAAAELSSGLLASAAASSRAGIAPPLALGAYSPGQSSL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 PPHHSQGFSVDNIMTSLRGSPQSAAAELSSGLLASAAASSRAGIAPPLALGAYSPGQSSL 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB9 YSSPCSQTSSAGSSGGGGGGAGAAGGAGGAGTYHCNLQAMSLYAAGERGGHLQGAPGGAG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 YSSPCSQTSSAGSSGGGGGGAGAAGGAGGAGTYHCNLQAMSLYAAGERGGHLQGAPGGAG 370 380 390 400 410 420 430 440 450 460 470 480 pF1KB9 GSAVDDPLPDYSLPPVTSSSSSSLSHGGGGGGGGGGQEAGHHPAAHQGRLTSWYLNQAGG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 GSAVDDPLPDYSLPPVTSSSSSSLSHGGGGGGGGGGQEAGHHPAAHQGRLTSWYLNQAGG 430 440 450 460 470 480 490 500 510 520 530 540 pF1KB9 DLGHLASAAAAAAAAGYPGQQQNFHSVREMFESQRIGLNNSPVNGNSSCQMAFPSSQSLY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 DLGHLASAAAAAAAAGYPGQQQNFHSVREMFESQRIGLNNSPVNGNSSCQMAFPSSQSLY 490 500 510 520 530 540 550 pF1KB9 RTSGAFVYDCSKF ::::::::::::: NP_001 RTSGAFVYDCSKF 550 >>NP_005242 (OMIM: 153400,602402) forkhead box protein C (501 aa) initn: 1177 init1: 813 opt: 1171 Z-score: 500.7 bits: 102.3 E(85289): 3.8e-21 Smith-Waterman score: 1524; 49.9% identity (66.8% similar) in 587 aa overlap (1-553:1-501) 10 20 30 40 50 60 pF1KB9 MQARYSVSSPNSLGVVPYLGGEQSYYRAAAAAAGGGYTAMPAPMSVYSHPAHAEQYPGGM ::::::::.::.:::::::. ::.::::: :.: .: .::.::: .: ::: .:: NP_005 MQARYSVSDPNALGVVPYLS-EQNYYRAA-----GSYGGMASPMGVYS--GHPEQYSAGM 10 20 30 40 50 70 80 90 100 110 pF1KB9 ARAYGPYTP-QPQ-PKDMVKPPYSYIALITMAIQNAPDKKITLNGIYQFIMDRFPFYRDN .:.:.:: :: :::.:::::::::::::::::::.::::::::::::::::::::.: NP_005 GRSYAPYHHHQPAAPKDLVKPPYSYIALITMAIQNAPEKKITLNGIYQFIMDRFPFYREN 60 70 80 90 100 110 120 130 140 150 160 170 pF1KB9 KQGWQNSIRHNLSLNECFVKVPRDDKKPGKGSYWTLDPDSYNMFENGSFLRRRRRFKKKD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_005 KQGWQNSIRHNLSLNECFVKVPRDDKKPGKGSYWTLDPDSYNMFENGSFLRRRRRFKKKD 120 130 140 150 160 170 180 190 200 210 220 230 pF1KB9 AVKDKEEKDRLHLKEPPPPGRQPPPAPPEQADGNAPGPQPPPVRIQDIKTENGTCPSPP- . :.::: : ::::::: . . :: :. ::. :. .. ::.: .. :. : NP_005 VSKEKEE--RAHLKEPPPAASKGAPATPHLADA----PKEAEKKVV-IKSEAAS-PALPV 180 190 200 210 220 240 250 260 270 280 290 pF1KB9 ----QPLSPAAALGSGSAAAVPKIESPDSSSSSLSSGSSPPGSLPSARPLSLDGADSAPP . ::: .:: .:: :. :..: .:: : :::: . NP_005 ITKVETLSPESAL-QGS----PR------SAASTPAGS-PDGSLPEHH------------ 230 240 250 260 300 310 320 330 340 350 pF1KB9 PPAPSAPPPHHSQGFSVDNIMTSLRGSPQSAAAELSSGLLASAAASSRAG-IAPPLALGA ::.. : ::::.:::: :: :: .. ::: : ..::: ..::::: NP_005 AAAPNGLP-----GFSVENIMT-LRTSPPGG--ELSPG-------AGRAGLVVPPLAL-P 270 280 290 300 360 370 380 390 400 410 pF1KB9 YSPGQSSLYSSPCSQTSSAGSSGGGGGGAGAAGGAGGAGTYHCNLQAMSLYAAGERGGHL :. . . :..::.: : :::::: :.:...:::::...:: .:. NP_005 YAAAPPAAYGQPCAQ----------GLEAGAAGG------YQCSMRAMSLYTGAERPAHM 310 320 330 340 420 430 440 450 460 pF1KB9 QGAPGGAGGSAVDDPLPDYSLPPVTSSSSSSLSHGGGGGGGGGG---QEAGHH------- : :.:. : :. :.. :. .:. : :. .. : :. ::: NP_005 CVPP------ALDEALSDHPSGPTSPLSALNLAAGQEGALAATGHHHQHHGHHHPQAPPP 350 360 370 380 390 400 470 480 490 500 510 pF1KB9 -----------PAAHQGRLTSWYLNQAGGDLGHLASAAAAAAAAGYPGQQQNFHSVREMF :.: .. .:::::..: ::.:: . . :: :::.: .::::: NP_005 PPAPQPQPTPQPGAAAAQAASWYLNHSG-DLNHLPGHTFAA-------QQQTFPNVREMF 410 420 430 440 450 520 530 540 550 pF1KB9 ESQRIGLNNSP-----VNGNSSCQMAFPSSQSLYRTSGAFVYDCSKF .:.:.:..:: :.::.:::. . :. ::: .. . :::.:. NP_005 NSHRLGIENSTLGESQVSGNASCQLPYRSTPPLYRHAAPYSYDCTKY 460 470 480 490 500 >>NP_004109 (OMIM: 602939) forkhead box protein S1 [Homo (330 aa) initn: 658 init1: 571 opt: 655 Z-score: 294.1 bits: 63.5 E(85289): 1.2e-09 Smith-Waterman score: 684; 44.6% identity (62.2% similar) in 278 aa overlap (65-301:8-282) 40 50 60 70 80 90 pF1KB9 GGYTAMPAPMSVYSHPAHAEQYPGGMARAYGPYTPQPQPKDMVKPPYSYIALITMAIQNA :: .: .: .::::::::::.::::.. NP_004 MQQQPLPGPGAPTTEP---TKPPYSYIALIAMAIQSS 10 20 30 100 110 120 130 140 150 pF1KB9 PDKKITLNGIYQFIMDRFPFYRDNKQGWQNSIRHNLSLNECFVKVPRDDKKPGKGSYWTL : .. ::.:::..:: :: ::: :. :::::::::::::::::::::::.:::::::::: NP_004 PGQRATLSGIYRYIMGRFAFYRHNRPGWQNSIRHNLSLNECFVKVPRDDRKPGKGSYWTL 40 50 60 70 80 90 160 170 180 190 200 pF1KB9 DPDSYNMFENGSFLRRRRRFKKKDAV-------KDKEEKDRLHLKEPPPP----GRQ--- ::: ..:::.:::::::::: .. .. : .. : ..: : ::: NP_004 DPDCHDMFEHGSFLRRRRRFTRQTGAEGTRGPAKARRGPLRATSQDPGVPNATTGRQCSF 100 110 120 130 140 150 210 220 230 240 pF1KB9 PP--PAPPEQADGNAPGPQP-------------PPVRIQDIKTENGTCPSP-PQPLSPAA :: : : . :. : .: ::.. ..:.: . .::. : : .. NP_004 PPELPDPKGLSFGGLVGAMPASMCPATTDGRPRPPMEPKEISTPKPACPGELPVATSSSS 160 170 180 190 200 210 250 260 270 280 290 pF1KB9 ALGSGSAAAVPKIESPDSSSSSLSSGSSPPGSLPSARPLSLD---GAD--------SAPP . : :. . :: ... . . : : :: . : .:. ::: :: : NP_004 CPAFGFPAGFSEAESFNKAPTPVLSPESGIGSSYQCRLQALNFCMGADPGLEHLLASAAP 220 230 240 250 260 270 300 310 320 330 340 350 pF1KB9 PPAPSAPPPHHSQGFSVDNIMTSLRGSPQSAAAELSSGLLASAAASSRAGIAPPLALGAY ::: .:: NP_004 SPAPPTPPGSLRAPLPLPTDHKEPWVAGGFPVQGGSGYPLGLTPCLYRTPGMFFFE 280 290 300 310 320 330 >>NP_004463 (OMIM: 601091) forkhead box protein D1 [Homo (465 aa) initn: 590 init1: 514 opt: 649 Z-score: 289.7 bits: 63.2 E(85289): 2.1e-09 Smith-Waterman score: 655; 37.7% identity (58.4% similar) in 361 aa overlap (31-387:80-419) 10 20 30 40 50 60 pF1KB9 MQARYSVSSPNSLGVVPYLGGEQSYYRAAAAAAGGGYTAMPAPMSVYSHPAHAEQYPGGM : .:: : :.: . . : : :: NP_004 AQRRRRRRSYAGEDELEDLEEEEDDDDILLAPPAGGSPAPPGPAPAAG--AGAGGGGGGG 50 60 70 80 90 100 70 80 90 100 110 120 pF1KB9 ARAYGPYTPQPQPKDMVKPPYSYIALITMAIQNAPDKKITLNGIYQFIMDRFPFYRDNKQ . . : . . . .::::::::::::::: ..: :..::. : .:: :::.::.. NP_004 GAGGGGSAGSGAKNPLVKPPYSYIALITMAILQSPKKRLTLSEICEFISGRFPYYREKFP 110 120 130 140 150 160 130 140 150 160 170 180 pF1KB9 GWQNSIRHNLSLNECFVKVPRDDKKPGKGSYWTLDPDSYNMFENGSFLRRRRRFKKKDAV .::::::::::::.::::.::. .::::.::::::.: .::.::::::::.:::.. . NP_004 AWQNSIRHNLSLNDCFVKIPREPGNPGKGNYWTLDPESADMFDNGSFLRRRKRFKRQPLL 170 180 190 200 210 220 190 200 210 220 230 pF1KB9 K-DKEEKDRLHLKEPPPPGRQPPPAPPEQADGNAPGPQPPPVRIQDIKTENGTCPSPPQP . . : :. : :: : :.: ::: : . : : NP_004 PPNAAAAESLLLRGAGAAGGAGDPAAA--AALFPPAPPPPPHAYGYGPYGCGYGLQLP-P 230 240 250 260 270 280 240 250 260 270 280 290 pF1KB9 LSPAAALGSGSAAAVPKIESPDSSSSSLSSGSSPPGSLP--SARPLSLDGADSAPPPPAP .: .:: ...::: ...... : :: : .: :. . : : . NP_004 YAPPSALFAAAAAA--------AAAAAFHPHSPPPPPPPHGAAAELARTAFGYRPHPLGA 290 300 310 320 330 300 310 320 330 340 350 pF1KB9 SAPPPHHSQGFSVDNIMTS-LRGSPQSAAAELSSGLLASAAASSRAGIAPPLALGAYSPG . : : ... .. . .: : :: : . . .: :. :::.. :. : : .. :: NP_004 ALPGPLPASAAKAGGPGASALARSPFSIES-IIGGSLGPAAAAAAAAQAAAAAQASPSP- 340 350 360 370 380 390 360 370 380 390 400 410 pF1KB9 QSSLYSSPCSQTSSAGSSGGGGGGAGAAGGAGGAGTYHCNLQAMSLYAAGERGGHLQGAP :: . . :::::: .. .:.: : NP_004 ------SPVAAPPAPGSSGGGCAAQAAVGPAAALTRSLVAAAAAAASSVSSSAALGTLHQ 400 410 420 430 440 420 430 440 450 460 470 pF1KB9 GGAGGSAVDDPLPDYSLPPVTSSSSSSLSHGGGGGGGGGGQEAGHHPAAHQGRLTSWYLN NP_004 GTALSSVENFTARISNC 450 460 >>NP_004465 (OMIM: 602211) forkhead box protein D2 [Homo (495 aa) initn: 716 init1: 509 opt: 644 Z-score: 287.3 bits: 62.8 E(85289): 2.9e-09 Smith-Waterman score: 679; 33.9% identity (52.3% similar) in 499 aa overlap (1-459:39-484) 10 20 30 pF1KB9 MQARYSVSSPNSLGVVPYLGGEQSYYRAAA . :: . .: . :.:. : : .: : NP_004 EIMSSESSPAALSEADADIDVVGGGSGGGELPARSGPRAPRD--VLPH-GHEPPAEEAEA 10 20 30 40 50 60 40 50 60 70 pF1KB9 AAA------GGGYTAMPAPMSVYSHPAHAEQYPG-GMARAYGPYTPQPQPKD-------- : :: . : .. . : : :: : : : : : : : . NP_004 DLAEDEEESGGCSDGEPRALASRG-AAAAAGSPGPGAAAARGAAGPGPGPPSGGAATRSP 70 80 90 100 110 120 80 90 100 110 120 130 pF1KB9 MVKPPYSYIALITMAIQNAPDKKITLNGIYQFIMDRFPFYRDNKQGWQNSIRHNLSLNEC .::::::::::::::: ..: :..::. : .:: :::.::.. .::::::::::::.: NP_004 LVKPPYSYIALITMAILQSPKKRLTLSEICEFISGRFPYYREKFPAWQNSIRHNLSLNDC 130 140 150 160 170 180 140 150 160 170 180 190 pF1KB9 FVKVPRDDKKPGKGSYWTLDPDSYNMFENGSFLRRRRRFKKKDAVKDKEEKDRLHLKEPP :::.::. .::::.::::::.: .::.::::::::.:::.. : NP_004 FVKIPREPGNPGKGNYWTLDPESADMFDNGSFLRRRKRFKRQPL---------------P 190 200 210 220 200 210 220 230 pF1KB9 PPGRQPPPAPPEQADGNA-----PGPQPPPVRIQD---------IKTENGTCPSP---PQ :: .: : : :.: :: : . . .. :.: :. NP_004 PPHPHPHPHPELLLRGGAAAAGDPGAFLPGFAAYGAYGYGYGLALPAYGAPPPGPAPHPH 230 240 250 260 270 280 240 250 260 270 280 290 pF1KB9 PLSPAAALGSGSAAAVPKIESPDSSSSSLSSGSSPPGSLPSARPLSLDGADSAPPPPAPS : : :.....::: .. : . ... ::: :.: .. :. :: : . : NP_004 PHPHAFAFAAAAAAAPCQLSVPPGRAAA-----PPPGP-PTASVFAGAGSAPAPAPASGS 290 300 310 320 330 340 300 310 320 330 340 350 pF1KB9 APPPHHSQGFSVDNIMTSLRGSPQSAAAELSSGLLASAAASSRAGIAPPLALGAYSPGQS .: : . : : .:::. : : :...:: : :. . . NP_004 GPGPGPA-------------GLPAFLGAELGC-----AKAFYAASLSPPAA-GTAAGLPT 350 360 370 380 360 370 380 390 400 410 pF1KB9 SLYSSPCSQTSSAGSSGGGGGGAGAAG--------GAGGAGTYHCNLQAMSLYAAGERGG .: . .:...:..::::.::: : ::.:. : . : : NP_004 ALLRQGL-KTDAGGGAGGGGAGAGQRPSFSIDHIMGHGGGGA------APPGAGEGSPGP 390 400 410 420 430 420 430 440 450 460 470 pF1KB9 HLQGAPGGAGGSAVDDPLPDYSLPPVTSSSSSSLSHGGGGGGGGGGQEAGHHPAAHQGRL . .: : .: . : : .: :: .. ::: : . ..:.. : NP_004 PFAAAAGPGGQAQVLAMLTAPALAPV--AGHIRLSHPGDALLSSGSRFASKVAGLSGCHF 440 450 460 470 480 490 480 490 500 510 520 530 pF1KB9 TSWYLNQAGGDLGHLASAAAAAAAAGYPGQQQNFHSVREMFESQRIGLNNSPVNGNSSCQ >>NP_005241 (OMIM: 603252) forkhead box protein L1 [Homo (345 aa) initn: 632 init1: 562 opt: 637 Z-score: 286.5 bits: 62.1 E(85289): 3.2e-09 Smith-Waterman score: 671; 42.6% identity (64.4% similar) in 298 aa overlap (78-367:49-319) 50 60 70 80 90 100 pF1KB9 SHPAHAEQYPGGMARAYGPYTPQPQPKDMVKPPYSYIALITMAIQNAPDKKITLNGIYQF ::::::::::.::::.::....:::::::: NP_005 YLYGPERPGLPLAFAPAAALAASGRAETPQKPPYSYIALIAMAIQDAPEQRVTLNGIYQF 20 30 40 50 60 70 110 120 130 140 150 160 pF1KB9 IMDRFPFYRDNKQGWQNSIRHNLSLNECFVKVPRDDKKPGKGSYWTLDPDSYNMFENGSF ::::::::.::.::::::::::::::.:::::::. .::::::::::: .:::::.. NP_005 IMDRFPFYHDNRQGWQNSIRHNLSLNDCFVKVPREKGRPGKGSYWTLDPRCLDMFENGNY 80 90 100 110 120 130 170 180 190 200 210 220 pF1KB9 LRRRRRFKK-KDAVKDKEEKDRLHLKEPPPPGRQPPPAPPEQADGNAPGPQPPPVRIQDI ::.:. : : . :. . . : .. : :: ..: : : : :.: NP_005 RRRKRKPKPGPGAPEAKRPRAETH--------QRSAEAQPEAGSG-AGGSGPAISRLQ-- 140 150 160 170 180 230 240 250 260 270 280 pF1KB9 KTENGTCPSPPQPL----SPAAALGSGSAAAVPKIESPDSSSSSLSSGSSPPGSLPSARP . :. :.:: :: : : : ::. .... ..:.. . .:: NP_005 -----AAPAGPSPLLDGPSPPAPLH------WPGTASPNEDAGDAAQGAAAVAVGQAAR- 190 200 210 220 230 290 300 310 320 330 340 pF1KB9 LSLDGADSAPPPPAPSAPPPH-HSQGFSVDNIMTSLRGSPQSAAAELSSGLLASAAASSR . :: : : . :.: .:..::.:.:... .:. .. :: .: :. . ..: NP_005 -TGDGPGSPLRPASRSSPKSSDKSKSFSIDSILAGKQGQKPPSGDELLGG--AKPGPGGR 240 250 260 270 280 290 350 360 370 380 390 pF1KB9 AGIAPPLALGA--YSPGQSSLYSSPCSQTSSAGSSGGGGGGAGAAGGAGGAGTYHCNLQA : : :: .. : ..::. .: : NP_005 LG-ASLLAASSSLRPPFNASLMLDPHVQGGFYQLGIPFLSYFPLQVPDTVLHFQ 300 310 320 330 340 >>NP_004464 (OMIM: 241850,602617,616534) forkhead box pr (373 aa) initn: 684 init1: 540 opt: 629 Z-score: 282.9 bits: 61.6 E(85289): 5.1e-09 Smith-Waterman score: 641; 37.5% identity (57.3% similar) in 363 aa overlap (57-393:32-369) 30 40 50 60 70 80 pF1KB9 RAAAAAAGGGYTAMPAPMSVYSHPAHAEQYPG-GMARAYGPYTPQPQPKDMVKPPYSYIA :: . .:. : . .: . :::::::: NP_004 TAESGPPPPQPEVLATVKEERGETAAGAGVPGEATGRGAGGRR-RKRPLQRGKPPYSYIA 10 20 30 40 50 60 90 100 110 120 130 140 pF1KB9 LITMAIQNAPDKKITLNGIYQFIMDRFPFYRDNKQGWQNSIRHNLSLNECFVKVPRDDKK ::.::: .::....::.:::.:: .:::::::: . :::::::::.::.::.:.::. . NP_004 LIAMAIAHAPERRLTLGGIYKFITERFPFYRDNPKKWQNSIRHNLTLNDCFLKIPREAGR 70 80 90 100 110 120 150 160 170 180 190 pF1KB9 PGKGSYWTLDPDSYNMFENGSFLRRRRRFKKKD-AVKDKEEKDRLHLKEPPP-------- ::::.::.:::.. .:::.:::::::.:::..: .. .: NP_004 PGKGNYWALDPNAEDMFESGSFLRRRKRFKRSDLSTYPAYMHDAAAAAAAAAAAAAAAAI 130 140 150 160 170 180 200 210 220 230 240 pF1KB9 -PGRQP---PPAPPEQADGNAPGPQ---PPPVRIQDIKTENGTCPS----PPQPLSPAAA :: : :: : : :: :. :::: . : : : .:::: NP_004 FPGAVPAARPPYPGAVYAGYAP-PSLAAPPPVYYP--AASPGPCRVFGLVPERPLSPE-- 190 200 210 220 230 250 260 270 280 290 300 pF1KB9 LGSGSAAAVPKIESPDSSSSSLSSGSSPPGSLPSARPLSLDGADSAPPPPAPSAPPPHHS :: : .: .: . :.:. :.. . .: . :: : ::: .. NP_004 LG-------PAPSGPGGSCAFASAGA--PATTTGYQPAGCTGAR----PANPSAYAAAYA 240 250 260 270 280 310 320 330 340 350 360 pF1KB9 QGFSVDNIMTSLRGSPQSAAAELSSGLLASAAASSRAGIAPPLAL-GAYSPGQSSLYSSP . :. . . :: ::: .: . :..: .:. . . : :::: . .. NP_004 ---GPDGAYPQGAGSAIFAAAGRLAGPASPPAGGSSGGVETTVDFYGRTSPGQFGALGA- 290 300 310 320 330 370 380 390 400 410 420 pF1KB9 CSQTSSAGSSGGGGGGA----GAAGGAGGAGTYHCNLQAMSLYAAGERGGHLQGAPGGAG : . .:. ::...:: ::. :: . NP_004 CY--NPGGQLGGASAGAYHARHAAAYPGGIDRFVSAM 340 350 360 370 430 440 450 460 470 480 pF1KB9 GSAVDDPLPDYSLPPVTSSSSSSLSHGGGGGGGGGGQEAGHHPAAHQGRLTSWYLNQAGG >>XP_016876735 (OMIM: 602294) PREDICTED: hepatocyte nucl (439 aa) initn: 556 init1: 523 opt: 622 Z-score: 279.1 bits: 61.1 E(85289): 8.3e-09 Smith-Waterman score: 645; 37.5% identity (60.7% similar) in 392 aa overlap (9-369:76-439) 10 20 30 pF1KB9 MQARYSVSSPNSLGVVPYLGGEQSYYRAAAAAAGGGYT ::...:. .:..: ::. . : :. XP_016 PGAVAGMPGGSAGAMNSMTAAGVTAMGTALSPSGMGA---MGAQQ----AASMNGLGPYA 50 60 70 80 90 40 50 60 70 80 90 pF1KB9 AMPAP-MSVYSH-PAH---AEQYPGGMARAYGPYTPQPQPKDMVKPPYSYIALITMAIQN : : :: ... :.. .. :: :... :. .:::::::.:::::::. XP_016 AAMNPCMSPMAYAPSNLGRSRAGGGGDAKTFKRSYPH------AKPPYSYISLITMAIQQ 100 110 120 130 140 150 100 110 120 130 140 150 pF1KB9 APDKKITLNGIYQFIMDRFPFYRDNKQGWQNSIRHNLSLNECFVKVPRDDKKPGKGSYWT ::.: .::. :::.::: ::.::.:.: :::::::.::.:.::::: :. ::::::::: XP_016 APSKMLTLSEIYQWIMDLFPYYRQNQQRWQNSIRHSLSFNDCFVKVARSPDKPGKGSYWT 160 170 180 190 200 210 160 170 180 190 200 210 pF1KB9 LDPDSYNMFENGSFLRRRRRFK---KKDAVKDKEEKDRLHLKEPPPPGRQPPPAPPEQAD : ::: :::::: .:::..::: . : . . : .:. : . .. XP_016 LHPDSGNMFENGCYLRRQKRFKCEKQPGAGGGGGSGSGGSGAKGGPESRKDPSG---ASN 220 230 240 250 260 220 230 240 250 260 pF1KB9 GNAPGPQPPPVRIQDIKTENGTCPSP---PQPLSPAAALGSGSAAAVPKIESPDSSSSS- .: .: :. . . :.. :.: :: :. ..: ..:.:. ....: ::.. XP_016 PSADSPLHRGVHGKTGQLEGAPAPGPAASPQTLDHSGATATGGAS---ELKTPASSTAPP 270 280 290 300 310 320 270 280 290 300 310 pF1KB9 LSSGSSPPGSLPSARPLSLDGADSAPPPPAPSAP-PPHHS--QGFSVDNIMTS------- .::: ::.: :. : : . :: ::.: . ::..:.:.: XP_016 ISSG---PGALASV-PASHPAHGLAPHESQLHLKGDPHYSFNHPFSINNLMSSSEQQHKL 330 340 350 360 370 380 320 330 340 350 360 pF1KB9 --------LRGSPQSAAAELSSGL-LASAAASSRAGIAPPLALGAYSPGQSSLYSSPCSQ :. :: ... : ..: :.::....:. : : :: : .:: : . XP_016 DFKAYEQALQYSPYGST--LPASLPLGSASVTTRSPIEPSALEPAYYQG---VYSRPVLN 390 400 410 420 430 370 380 390 400 410 420 pF1KB9 TSSAGSSGGGGGGAGAAGGAGGAGTYHCNLQAMSLYAAGERGGHLQGAPGGAGGSAVDDP :: XP_016 TS >>NP_004487 (OMIM: 602294) hepatocyte nuclear factor 3-a (472 aa) initn: 556 init1: 523 opt: 622 Z-score: 278.7 bits: 61.1 E(85289): 8.8e-09 Smith-Waterman score: 645; 37.5% identity (60.7% similar) in 392 aa overlap (9-369:109-472) 10 20 30 pF1KB9 MQARYSVSSPNSLGVVPYLGGEQSYYRAAAAAAGGGYT ::...:. .:..: ::. . : :. NP_004 PGAVAGMPGGSAGAMNSMTAAGVTAMGTALSPSGMGA---MGAQQ----AASMNGLGPYA 80 90 100 110 120 130 40 50 60 70 80 90 pF1KB9 AMPAP-MSVYSH-PAH---AEQYPGGMARAYGPYTPQPQPKDMVKPPYSYIALITMAIQN : : :: ... :.. .. :: :... :. .:::::::.:::::::. NP_004 AAMNPCMSPMAYAPSNLGRSRAGGGGDAKTFKRSYPH------AKPPYSYISLITMAIQQ 140 150 160 170 180 100 110 120 130 140 150 pF1KB9 APDKKITLNGIYQFIMDRFPFYRDNKQGWQNSIRHNLSLNECFVKVPRDDKKPGKGSYWT ::.: .::. :::.::: ::.::.:.: :::::::.::.:.::::: :. ::::::::: NP_004 APSKMLTLSEIYQWIMDLFPYYRQNQQRWQNSIRHSLSFNDCFVKVARSPDKPGKGSYWT 190 200 210 220 230 240 160 170 180 190 200 210 pF1KB9 LDPDSYNMFENGSFLRRRRRFK---KKDAVKDKEEKDRLHLKEPPPPGRQPPPAPPEQAD : ::: :::::: .:::..::: . : . . : .:. : . .. NP_004 LHPDSGNMFENGCYLRRQKRFKCEKQPGAGGGGGSGSGGSGAKGGPESRKDPSG---ASN 250 260 270 280 290 300 220 230 240 250 260 pF1KB9 GNAPGPQPPPVRIQDIKTENGTCPSP---PQPLSPAAALGSGSAAAVPKIESPDSSSSS- .: .: :. . . :.. :.: :: :. ..: ..:.:. ....: ::.. NP_004 PSADSPLHRGVHGKTGQLEGAPAPGPAASPQTLDHSGATATGGAS---ELKTPASSTAPP 310 320 330 340 350 270 280 290 300 310 pF1KB9 LSSGSSPPGSLPSARPLSLDGADSAPPPPAPSAP-PPHHS--QGFSVDNIMTS------- .::: ::.: :. : : . :: ::.: . ::..:.:.: NP_004 ISSG---PGALASV-PASHPAHGLAPHESQLHLKGDPHYSFNHPFSINNLMSSSEQQHKL 360 370 380 390 400 410 320 330 340 350 360 pF1KB9 --------LRGSPQSAAAELSSGL-LASAAASSRAGIAPPLALGAYSPGQSSLYSSPCSQ :. :: ... : ..: :.::....:. : : :: : .:: : . NP_004 DFKAYEQALQYSPYGST--LPASLPLGSASVTTRSPIEPSALEPAYYQG---VYSRPVLN 420 430 440 450 460 470 370 380 390 400 410 420 pF1KB9 TSSAGSSGGGGGGAGAAGGAGGAGTYHCNLQAMSLYAAGERGGHLQGAPGGAGGSAVDDP :: NP_004 TS >>NP_036318 (OMIM: 107250,601094,610256) forkhead box pr (319 aa) initn: 580 init1: 525 opt: 611 Z-score: 276.5 bits: 60.2 E(85289): 1.2e-08 Smith-Waterman score: 638; 40.8% identity (57.9% similar) in 292 aa overlap (32-297:11-301) 10 20 30 40 50 pF1KB9 QARYSVSSPNSLGVVPYLGGEQSYYRAAAAAAGGGYTAMPA--PMSVYSHP-AHAE--QY :: .:. :.:: : . : : :: . NP_036 MAGRSDMDPPAAFSGFPALPAVAPSGPPPSPLAGAEPGRE 10 20 30 40 60 70 80 90 100 pF1KB9 PGGMARAYGPYTPQP---------QPKDMVKPPYSYIALITMAIQNAPDKKITLNGIYQF : : . : .: : .: . ::::::::::.::. .:: ...:: .::.: NP_036 PEEAAAGRGEAAPTPAPGPGRRRRRPLQRGKPPYSYIALIAMALAHAPGRRLTLAAIYRF 50 60 70 80 90 100 110 120 130 140 150 160 pF1KB9 IMDRFPFYRDNKQGWQNSIRHNLSLNECFVKVPRDDKKPGKGSYWTLDPDSYNMFENGSF : .:: ::::. . :::::::::.::.:::::::. .::::.:::::: . .::.:::: NP_036 ITERFAFYRDSPRKWQNSIRHNLTLNDCFVKVPREPGNPGKGNYWTLDPAAADMFDNGSF 110 120 130 140 150 160 170 180 190 200 210 220 pF1KB9 LRRRRRFKKKDAVKDKEEKDRLHLKEPPPPGRQPPPAPPEQADGNAPGPQP-PPVRIQDI ::::.:::. . : : : : :.: . . :: : ::.:. .. NP_036 LRRRKRFKRAELPAHAAAAPGPPLPFPYAP-YAPAPGPALLVPPPSAGPGPSPPARLFSV 170 180 190 200 210 230 240 250 260 270 pF1KB9 KTENGT--------CPSPP---QPLSPAAALGSGSAAAVPKIESPDSSSSSLSSGSSPPG . . : :: : . :::. .::: : . : . : . :: NP_036 DSLVNLQPELAGLGAPEPPCCAAPDAAAAAFPPCAAAASPPLYSQVPDRLVLPATRPGPG 220 230 240 250 260 270 280 290 300 310 320 330 pF1KB9 SLPSARPLSLDGADSAPPPPAPSAPPPHHSQGFSVDNIMTSLRGSPQSAAAELSSGLLAS ::. :.: : .: : .: NP_036 PLPAEPLLALAGPAAALGPLSPGEAYLRQPGFASGLERYL 280 290 300 310 553 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 03:43:52 2016 done: Tue Nov 8 03:43:53 2016 Total Scan time: 9.420 Total Display time: 0.070 Function used was FASTA [36.3.4 Apr, 2011]