Result of FASTA (ccds) for pF1KB9731
FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448

Query: pF1KB9731, 474 aa
  1>>>pF1KB9731 474 - 474 aa - 474 aa
Library: human.CCDS.faa
  18511270 residues in 32554 sequences

Statistics:  Expectation_n fit: rho(ln(x))= 6.7474+/-0.000863; mu= 13.1508+/- 0.053
 mean_var=122.0177+/-23.824, 0's: 0 Z-trim(111.1): 126  B-trim: 0 in 0/51
 Lambda= 0.116108
 statistics sampled from 12017 (12143) to 12017 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
 ktup: 2, E-join: 1 (0.721), E-opt: 0.2 (0.373), width:  16
 Scan time:  3.410

The best scores are:                                      opt bits E(32554)
CCDS13330.1 HNF4A gene_id:3172|Hs108|chr20         ( 474) 3183 544.0 1.2e-154
CCDS74728.1 HNF4A gene_id:3172|Hs108|chr20         ( 449) 2945 504.2 1.2e-142
CCDS42876.1 HNF4A gene_id:3172|Hs108|chr20         ( 452) 2945 504.2 1.2e-142
CCDS46605.1 HNF4A gene_id:3172|Hs108|chr20         ( 464) 2812 481.9 6.2e-136
CCDS46604.1 HNF4A gene_id:3172|Hs108|chr20         ( 442) 2574 442.0  6e-124
CCDS13331.1 HNF4A gene_id:3172|Hs108|chr20         ( 417) 2497 429.1 4.3e-120
CCDS68131.1 HNF4A gene_id:3172|Hs108|chr20         ( 395) 2259 389.2 4.2e-108
CCDS6220.2 HNF4G gene_id:3174|Hs108|chr8           ( 445) 2080 359.3 4.9e-99
CCDS83303.1 HNF4G gene_id:3174|Hs108|chr8          ( 408) 1946 336.8 2.6e-92
CCDS35172.1 RXRA gene_id:6256|Hs108|chr9           ( 462)  907 162.8 7.1e-40
CCDS4768.1 RXRB gene_id:6257|Hs108|chr6            ( 533)  907 162.8 7.9e-40
CCDS1248.1 RXRG gene_id:6258|Hs108|chr1            ( 463)  903 162.1 1.1e-39
CCDS72970.1 RXRG gene_id:6258|Hs108|chr1           ( 340)  896 160.9   2e-39
CCDS10375.1 NR2F2 gene_id:7026|Hs108|chr15         ( 414)  787 142.7 7.3e-34
CCDS4068.1 NR2F1 gene_id:7025|Hs108|chr5           ( 423)  786 142.5 8.3e-34
CCDS59007.1 RXRB gene_id:6257|Hs108|chr6           ( 537)  602 111.7 1.9e-24
CCDS45359.1 NR2F2 gene_id:7026|Hs108|chr15         ( 261)  431 82.9 4.5e-16
CCDS45358.1 NR2F2 gene_id:7026|Hs108|chr15         ( 281)  431 82.9 4.8e-16
CCDS41821.1 NR2C1 gene_id:7181|Hs108|chr12         ( 467)  431 83.1 7.2e-16
CCDS44953.1 NR2C1 gene_id:7181|Hs108|chr12         ( 483)  431 83.1 7.3e-16
CCDS9051.1 NR2C1 gene_id:7181|Hs108|chr12          ( 603)  431 83.1 8.7e-16
CCDS74905.1 NR2C2 gene_id:7182|Hs108|chr3          ( 596)  414 80.3 6.2e-15
CCDS2621.1 NR2C2 gene_id:7182|Hs108|chr3           ( 615)  414 80.3 6.4e-15
CCDS12352.1 NR2F6 gene_id:2063|Hs108|chr19         ( 404)  402 78.2 1.9e-14
CCDS8850.1 RARG gene_id:5916|Hs108|chr12           ( 454)  393 76.7 5.8e-14
CCDS58236.1 RARG gene_id:5916|Hs108|chr12          ( 382)  389 76.0   8e-14
CCDS41790.1 RARG gene_id:5916|Hs108|chr12          ( 443)  389 76.0   9e-14
CCDS2642.1 RARB gene_id:5915|Hs108|chr3            ( 448)  387 75.7 1.1e-13
CCDS69165.1 NR2E1 gene_id:7101|Hs108|chr6          ( 422)  381 74.7 2.2e-13
CCDS5063.1 NR2E1 gene_id:7101|Hs108|chr6           ( 385)  376 73.8 3.7e-13
CCDS42317.1 RARA gene_id:5914|Hs108|chr17          ( 457)  377 74.0 3.7e-13
CCDS11366.1 RARA gene_id:5914|Hs108|chr17          ( 462)  377 74.0 3.8e-13
CCDS2201.1 NR4A2 gene_id:4929|Hs108|chr2           ( 598)  369 72.7 1.2e-12
CCDS73751.1 NR2E3 gene_id:10002|Hs108|chr15        ( 367)  363 71.6 1.6e-12
CCDS73750.1 NR2E3 gene_id:10002|Hs108|chr15        ( 410)  363 71.6 1.7e-12
CCDS6744.1 NR4A3 gene_id:8013|Hs108|chr9           ( 443)  358 70.8 3.3e-12
CCDS6743.1 NR4A3 gene_id:8013|Hs108|chr9           ( 626)  357 70.8 4.9e-12
CCDS6742.1 NR4A3 gene_id:8013|Hs108|chr9           ( 637)  357 70.8 4.9e-12
CCDS58060.1 ESRRG gene_id:2104|Hs108|chr1          ( 396)  352 69.8 6.1e-12
CCDS2641.1 THRB gene_id:7068|Hs108|chr3            ( 461)  353 70.0 6.1e-12
CCDS42316.1 THRA gene_id:7067|Hs108|chr17          ( 410)  346 68.8 1.2e-11
CCDS58546.1 THRA gene_id:7067|Hs108|chr17          ( 451)  346 68.8 1.3e-11
CCDS11360.1 THRA gene_id:7067|Hs108|chr17          ( 490)  346 68.8 1.4e-11
CCDS8818.1 NR4A1 gene_id:3164|Hs108|chr12          ( 598)  345 68.7 1.9e-11
CCDS55828.1 NR4A1 gene_id:3164|Hs108|chr12         ( 611)  345 68.7 1.9e-11
CCDS73471.1 NR4A1 gene_id:3164|Hs108|chr12         ( 652)  345 68.8   2e-11
CCDS30856.1 RORC gene_id:6097|Hs108|chr1           ( 497)  340 67.8 2.9e-11
CCDS1004.1 RORC gene_id:6097|Hs108|chr1            ( 518)  340 67.8   3e-11
CCDS8757.1 VDR gene_id:7421|Hs108|chr12            ( 427)  337 67.3 3.7e-11
CCDS6646.1 RORB gene_id:6096|Hs108|chr9            ( 459)  337 67.3 3.9e-11


>>CCDS13330.1 HNF4A gene_id:3172|Hs108|chr20              (474 aa)
 initn: 3183 init1: 3183 opt: 3183  Z-score: 2889.7  bits: 544.0 E(32554): 1.2e-154
Smith-Waterman score: 3183; 100.0% identity (100.0% similar) in 474 aa overlap (1-474:1-474)

               10        20        30        40        50        60
pF1KB9 MRLSKTLVDMDMADYSAALDPAYTTLEFENVQVLTMGNDTSPSEGTNLNAPNSLGVSALC
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 MRLSKTLVDMDMADYSAALDPAYTTLEFENVQVLTMGNDTSPSEGTNLNAPNSLGVSALC
               10        20        30        40        50        60

               70        80        90       100       110       120
pF1KB9 AICGDRATGKHYGASSCDGCKGFFRRSVRKNHMYSCRFSRQCVVDKDKRNQCRYCRLKKC
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 AICGDRATGKHYGASSCDGCKGFFRRSVRKNHMYSCRFSRQCVVDKDKRNQCRYCRLKKC
               70        80        90       100       110       120

              130       140       150       160       170       180
pF1KB9 FRAGMKKEAVQNERDRISTRRSSYEDSSLPSINALLQAEVLSRQITSPVSGINGDIRAKK
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 FRAGMKKEAVQNERDRISTRRSSYEDSSLPSINALLQAEVLSRQITSPVSGINGDIRAKK
              130       140       150       160       170       180

              190       200       210       220       230       240
pF1KB9 IASIADVCESMKEQLLVLVEWAKYIPAFCELPLDDQVALLRAHAGEHLLLGATKRSMVFK
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 IASIADVCESMKEQLLVLVEWAKYIPAFCELPLDDQVALLRAHAGEHLLLGATKRSMVFK
              190       200       210       220       230       240

              250       260       270       280       290       300
pF1KB9 DVLLLGNDYIVPRHCPELAEMSRVSIRILDELVLPFQELQIDDNEYAYLKAIIFFDPDAK
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 DVLLLGNDYIVPRHCPELAEMSRVSIRILDELVLPFQELQIDDNEYAYLKAIIFFDPDAK
              250       260       270       280       290       300

              310       320       330       340       350       360
pF1KB9 GLSDPGKIKRLRSQVQVSLEDYINDRQYDSRGRFGELLLLLPTLQSITWQMIEQIQFIKL
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 GLSDPGKIKRLRSQVQVSLEDYINDRQYDSRGRFGELLLLLPTLQSITWQMIEQIQFIKL
              310       320       330       340       350       360

              370       380       390       400       410       420
pF1KB9 FGMAKIDNLLQEMLLGGSPSDAPHAHHPLHPHLMQEHMGTNVIVANTMPTHLSNGQMCEW
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 FGMAKIDNLLQEMLLGGSPSDAPHAHHPLHPHLMQEHMGTNVIVANTMPTHLSNGQMCEW
              370       380       390       400       410       420

              430       440       450       460       470    
pF1KB9 PRPRGQAATPETPQPSPPGGSGSEPYKLLPGAVATIVKPLSAIPQPTITKQEVI
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 PRPRGQAATPETPQPSPPGGSGSEPYKLLPGAVATIVKPLSAIPQPTITKQEVI
              430       440       450       460       470    

>>CCDS74728.1 HNF4A gene_id:3172|Hs108|chr20              (449 aa)
 initn: 2945 init1: 2945 opt: 2945  Z-score: 2674.6  bits: 504.2 E(32554): 1.2e-142
Smith-Waterman score: 2945; 100.0% identity (100.0% similar) in 436 aa overlap (39-474:14-449)

       10        20        30        40        50        60        
pF1KB9 DMDMADYSAALDPAYTTLEFENVQVLTMGNDTSPSEGTNLNAPNSLGVSALCAICGDRAT
                                     ::::::::::::::::::::::::::::::
CCDS74                  MSDWGQGFPQDPPDTSPSEGTNLNAPNSLGVSALCAICGDRAT
                                10        20        30        40   

       70        80        90       100       110       120        
pF1KB9 GKHYGASSCDGCKGFFRRSVRKNHMYSCRFSRQCVVDKDKRNQCRYCRLKKCFRAGMKKE
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS74 GKHYGASSCDGCKGFFRRSVRKNHMYSCRFSRQCVVDKDKRNQCRYCRLKKCFRAGMKKE
            50        60        70        80        90       100   

      130       140       150       160       170       180        
pF1KB9 AVQNERDRISTRRSSYEDSSLPSINALLQAEVLSRQITSPVSGINGDIRAKKIASIADVC
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS74 AVQNERDRISTRRSSYEDSSLPSINALLQAEVLSRQITSPVSGINGDIRAKKIASIADVC
           110       120       130       140       150       160   

      190       200       210       220       230       240        
pF1KB9 ESMKEQLLVLVEWAKYIPAFCELPLDDQVALLRAHAGEHLLLGATKRSMVFKDVLLLGND
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS74 ESMKEQLLVLVEWAKYIPAFCELPLDDQVALLRAHAGEHLLLGATKRSMVFKDVLLLGND
           170       180       190       200       210       220   

      250       260       270       280       290       300        
pF1KB9 YIVPRHCPELAEMSRVSIRILDELVLPFQELQIDDNEYAYLKAIIFFDPDAKGLSDPGKI
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS74 YIVPRHCPELAEMSRVSIRILDELVLPFQELQIDDNEYAYLKAIIFFDPDAKGLSDPGKI
           230       240       250       260       270       280   

      310       320       330       340       350       360        
pF1KB9 KRLRSQVQVSLEDYINDRQYDSRGRFGELLLLLPTLQSITWQMIEQIQFIKLFGMAKIDN
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS74 KRLRSQVQVSLEDYINDRQYDSRGRFGELLLLLPTLQSITWQMIEQIQFIKLFGMAKIDN
           290       300       310       320       330       340   

      370       380       390       400       410       420        
pF1KB9 LLQEMLLGGSPSDAPHAHHPLHPHLMQEHMGTNVIVANTMPTHLSNGQMCEWPRPRGQAA
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS74 LLQEMLLGGSPSDAPHAHHPLHPHLMQEHMGTNVIVANTMPTHLSNGQMCEWPRPRGQAA
           350       360       370       380       390       400   

      430       440       450       460       470    
pF1KB9 TPETPQPSPPGGSGSEPYKLLPGAVATIVKPLSAIPQPTITKQEVI
       ::::::::::::::::::::::::::::::::::::::::::::::
CCDS74 TPETPQPSPPGGSGSEPYKLLPGAVATIVKPLSAIPQPTITKQEVI
           410       420       430       440         

>>CCDS42876.1 HNF4A gene_id:3172|Hs108|chr20              (452 aa)
 initn: 2945 init1: 2945 opt: 2945  Z-score: 2674.5  bits: 504.2 E(32554): 1.2e-142
Smith-Waterman score: 2945; 100.0% identity (100.0% similar) in 436 aa overlap (39-474:17-452)

       10        20        30        40        50        60        
pF1KB9 DMDMADYSAALDPAYTTLEFENVQVLTMGNDTSPSEGTNLNAPNSLGVSALCAICGDRAT
                                     ::::::::::::::::::::::::::::::
CCDS42               MVSVNAPLGAPVESSYDTSPSEGTNLNAPNSLGVSALCAICGDRAT
                             10        20        30        40      

       70        80        90       100       110       120        
pF1KB9 GKHYGASSCDGCKGFFRRSVRKNHMYSCRFSRQCVVDKDKRNQCRYCRLKKCFRAGMKKE
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 GKHYGASSCDGCKGFFRRSVRKNHMYSCRFSRQCVVDKDKRNQCRYCRLKKCFRAGMKKE
         50        60        70        80        90       100      

      130       140       150       160       170       180        
pF1KB9 AVQNERDRISTRRSSYEDSSLPSINALLQAEVLSRQITSPVSGINGDIRAKKIASIADVC
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 AVQNERDRISTRRSSYEDSSLPSINALLQAEVLSRQITSPVSGINGDIRAKKIASIADVC
        110       120       130       140       150       160      

      190       200       210       220       230       240        
pF1KB9 ESMKEQLLVLVEWAKYIPAFCELPLDDQVALLRAHAGEHLLLGATKRSMVFKDVLLLGND
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 ESMKEQLLVLVEWAKYIPAFCELPLDDQVALLRAHAGEHLLLGATKRSMVFKDVLLLGND
        170       180       190       200       210       220      

      250       260       270       280       290       300        
pF1KB9 YIVPRHCPELAEMSRVSIRILDELVLPFQELQIDDNEYAYLKAIIFFDPDAKGLSDPGKI
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 YIVPRHCPELAEMSRVSIRILDELVLPFQELQIDDNEYAYLKAIIFFDPDAKGLSDPGKI
        230       240       250       260       270       280      

      310       320       330       340       350       360        
pF1KB9 KRLRSQVQVSLEDYINDRQYDSRGRFGELLLLLPTLQSITWQMIEQIQFIKLFGMAKIDN
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 KRLRSQVQVSLEDYINDRQYDSRGRFGELLLLLPTLQSITWQMIEQIQFIKLFGMAKIDN
        290       300       310       320       330       340      

      370       380       390       400       410       420        
pF1KB9 LLQEMLLGGSPSDAPHAHHPLHPHLMQEHMGTNVIVANTMPTHLSNGQMCEWPRPRGQAA
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 LLQEMLLGGSPSDAPHAHHPLHPHLMQEHMGTNVIVANTMPTHLSNGQMCEWPRPRGQAA
        350       360       370       380       390       400      

      430       440       450       460       470    
pF1KB9 TPETPQPSPPGGSGSEPYKLLPGAVATIVKPLSAIPQPTITKQEVI
       ::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 TPETPQPSPPGGSGSEPYKLLPGAVATIVKPLSAIPQPTITKQEVI
        410       420       430       440       450  

>>CCDS46605.1 HNF4A gene_id:3172|Hs108|chr20              (464 aa)
 initn: 2783 init1: 2783 opt: 2812  Z-score: 2554.0  bits: 481.9 E(32554): 6.2e-136
Smith-Waterman score: 3061; 97.7% identity (97.9% similar) in 474 aa overlap (1-474:1-464)

               10        20        30        40        50        60
pF1KB9 MRLSKTLVDMDMADYSAALDPAYTTLEFENVQVLTMGNDTSPSEGTNLNAPNSLGVSALC
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS46 MRLSKTLVDMDMADYSAALDPAYTTLEFENVQVLTMGNDTSPSEGTNLNAPNSLGVSALC
               10        20        30        40        50        60

               70        80        90       100       110       120
pF1KB9 AICGDRATGKHYGASSCDGCKGFFRRSVRKNHMYSCRFSRQCVVDKDKRNQCRYCRLKKC
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS46 AICGDRATGKHYGASSCDGCKGFFRRSVRKNHMYSCRFSRQCVVDKDKRNQCRYCRLKKC
               70        80        90       100       110       120

              130       140       150       160       170       180
pF1KB9 FRAGMKKEAVQNERDRISTRRSSYEDSSLPSINALLQAEVLSRQITSPVSGINGDIRAKK
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS46 FRAGMKKEAVQNERDRISTRRSSYEDSSLPSINALLQAEVLSRQITSPVSGINGDIRAKK
              130       140       150       160       170       180

              190       200       210       220       230       240
pF1KB9 IASIADVCESMKEQLLVLVEWAKYIPAFCELPLDDQVALLRAHAGEHLLLGATKRSMVFK
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS46 IASIADVCESMKEQLLVLVEWAKYIPAFCELPLDDQVALLRAHAGEHLLLGATKRSMVFK
              190       200       210       220       230       240

              250       260       270       280       290       300
pF1KB9 DVLLLGNDYIVPRHCPELAEMSRVSIRILDELVLPFQELQIDDNEYAYLKAIIFFDPDAK
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS46 DVLLLGNDYIVPRHCPELAEMSRVSIRILDELVLPFQELQIDDNEYAYLKAIIFFDPDAK
              250       260       270       280       290       300

              310       320       330       340       350       360
pF1KB9 GLSDPGKIKRLRSQVQVSLEDYINDRQYDSRGRFGELLLLLPTLQSITWQMIEQIQFIKL
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS46 GLSDPGKIKRLRSQVQVSLEDYINDRQYDSRGRFGELLLLLPTLQSITWQMIEQIQFIKL
              310       320       330       340       350       360

              370       380       390       400       410       420
pF1KB9 FGMAKIDNLLQEMLLGGSPSDAPHAHHPLHPHLMQEHMGTNVIVANTMPTHLSNGQMCEW
       :::::::::::::::::::::::::::::::::::::::::::::::::::::::::   
CCDS46 FGMAKIDNLLQEMLLGGSPSDAPHAHHPLHPHLMQEHMGTNVIVANTMPTHLSNGQM---
              370       380       390       400       410          

              430       440       450       460       470    
pF1KB9 PRPRGQAATPETPQPSPPGGSGSEPYKLLPGAVATIVKPLSAIPQPTITKQEVI
              .::::::::::::::::::::::::::::::::::::::::::::::
CCDS46 -------STPETPQPSPPGGSGSEPYKLLPGAVATIVKPLSAIPQPTITKQEVI
              420       430       440       450       460    

>>CCDS46604.1 HNF4A gene_id:3172|Hs108|chr20              (442 aa)
 initn: 2545 init1: 2545 opt: 2574  Z-score: 2338.8  bits: 442.0 E(32554): 6e-124
Smith-Waterman score: 2823; 97.5% identity (97.7% similar) in 436 aa overlap (39-474:17-442)

       10        20        30        40        50        60        
pF1KB9 DMDMADYSAALDPAYTTLEFENVQVLTMGNDTSPSEGTNLNAPNSLGVSALCAICGDRAT
                                     ::::::::::::::::::::::::::::::
CCDS46               MVSVNAPLGAPVESSYDTSPSEGTNLNAPNSLGVSALCAICGDRAT
                             10        20        30        40      

       70        80        90       100       110       120        
pF1KB9 GKHYGASSCDGCKGFFRRSVRKNHMYSCRFSRQCVVDKDKRNQCRYCRLKKCFRAGMKKE
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS46 GKHYGASSCDGCKGFFRRSVRKNHMYSCRFSRQCVVDKDKRNQCRYCRLKKCFRAGMKKE
         50        60        70        80        90       100      

      130       140       150       160       170       180        
pF1KB9 AVQNERDRISTRRSSYEDSSLPSINALLQAEVLSRQITSPVSGINGDIRAKKIASIADVC
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS46 AVQNERDRISTRRSSYEDSSLPSINALLQAEVLSRQITSPVSGINGDIRAKKIASIADVC
        110       120       130       140       150       160      

      190       200       210       220       230       240        
pF1KB9 ESMKEQLLVLVEWAKYIPAFCELPLDDQVALLRAHAGEHLLLGATKRSMVFKDVLLLGND
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS46 ESMKEQLLVLVEWAKYIPAFCELPLDDQVALLRAHAGEHLLLGATKRSMVFKDVLLLGND
        170       180       190       200       210       220      

      250       260       270       280       290       300        
pF1KB9 YIVPRHCPELAEMSRVSIRILDELVLPFQELQIDDNEYAYLKAIIFFDPDAKGLSDPGKI
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS46 YIVPRHCPELAEMSRVSIRILDELVLPFQELQIDDNEYAYLKAIIFFDPDAKGLSDPGKI
        230       240       250       260       270       280      

      310       320       330       340       350       360        
pF1KB9 KRLRSQVQVSLEDYINDRQYDSRGRFGELLLLLPTLQSITWQMIEQIQFIKLFGMAKIDN
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS46 KRLRSQVQVSLEDYINDRQYDSRGRFGELLLLLPTLQSITWQMIEQIQFIKLFGMAKIDN
        290       300       310       320       330       340      

      370       380       390       400       410       420        
pF1KB9 LLQEMLLGGSPSDAPHAHHPLHPHLMQEHMGTNVIVANTMPTHLSNGQMCEWPRPRGQAA
       :::::::::::::::::::::::::::::::::::::::::::::::::          .
CCDS46 LLQEMLLGGSPSDAPHAHHPLHPHLMQEHMGTNVIVANTMPTHLSNGQM----------S
        350       360       370       380       390                

      430       440       450       460       470    
pF1KB9 TPETPQPSPPGGSGSEPYKLLPGAVATIVKPLSAIPQPTITKQEVI
       ::::::::::::::::::::::::::::::::::::::::::::::
CCDS46 TPETPQPSPPGGSGSEPYKLLPGAVATIVKPLSAIPQPTITKQEVI
        400       410       420       430       440  

>>CCDS13331.1 HNF4A gene_id:3172|Hs108|chr20              (417 aa)
 initn: 2495 init1: 2495 opt: 2497  Z-score: 2269.4  bits: 429.1 E(32554): 4.3e-120
Smith-Waterman score: 2501; 95.3% identity (95.8% similar) in 403 aa overlap (1-389:1-403)

               10        20        30        40        50        60
pF1KB9 MRLSKTLVDMDMADYSAALDPAYTTLEFENVQVLTMGNDTSPSEGTNLNAPNSLGVSALC
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 MRLSKTLVDMDMADYSAALDPAYTTLEFENVQVLTMGNDTSPSEGTNLNAPNSLGVSALC
               10        20        30        40        50        60

               70        80        90       100       110       120
pF1KB9 AICGDRATGKHYGASSCDGCKGFFRRSVRKNHMYSCRFSRQCVVDKDKRNQCRYCRLKKC
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 AICGDRATGKHYGASSCDGCKGFFRRSVRKNHMYSCRFSRQCVVDKDKRNQCRYCRLKKC
               70        80        90       100       110       120

              130       140       150       160       170       180
pF1KB9 FRAGMKKEAVQNERDRISTRRSSYEDSSLPSINALLQAEVLSRQITSPVSGINGDIRAKK
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 FRAGMKKEAVQNERDRISTRRSSYEDSSLPSINALLQAEVLSRQITSPVSGINGDIRAKK
              130       140       150       160       170       180

              190       200       210       220       230       240
pF1KB9 IASIADVCESMKEQLLVLVEWAKYIPAFCELPLDDQVALLRAHAGEHLLLGATKRSMVFK
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 IASIADVCESMKEQLLVLVEWAKYIPAFCELPLDDQVALLRAHAGEHLLLGATKRSMVFK
              190       200       210       220       230       240

              250       260       270       280       290       300
pF1KB9 DVLLLGNDYIVPRHCPELAEMSRVSIRILDELVLPFQELQIDDNEYAYLKAIIFFDPDAK
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 DVLLLGNDYIVPRHCPELAEMSRVSIRILDELVLPFQELQIDDNEYAYLKAIIFFDPDAK
              250       260       270       280       290       300

              310       320       330       340       350       360
pF1KB9 GLSDPGKIKRLRSQVQVSLEDYINDRQYDSRGRFGELLLLLPTLQSITWQMIEQIQFIKL
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 GLSDPGKIKRLRSQVQVSLEDYINDRQYDSRGRFGELLLLLPTLQSITWQMIEQIQFIKL
              310       320       330       340       350       360

              370                    380        390       400      
pF1KB9 FGMAKIDNLLQEMLLGG-------------SPSDAPHA-HHPLHPHLMQEHMGTNVIVAN
       :::::::::::::::::             ::.: ::.   ::                 
CCDS13 FGMAKIDNLLQEMLLGGPCQAQEGRGWSGDSPGDRPHTVSSPLSSLASPLCRFGQVA   
              370       380       390       400       410          

        410       420       430       440       450       460      
pF1KB9 TMPTHLSNGQMCEWPRPRGQAATPETPQPSPPGGSGSEPYKLLPGAVATIVKPLSAIPQP

>>CCDS68131.1 HNF4A gene_id:3172|Hs108|chr20              (395 aa)
 initn: 2257 init1: 2257 opt: 2259  Z-score: 2054.3  bits: 389.2 E(32554): 4.2e-108
Smith-Waterman score: 2263; 94.8% identity (95.3% similar) in 365 aa overlap (39-389:17-381)

       10        20        30        40        50        60        
pF1KB9 DMDMADYSAALDPAYTTLEFENVQVLTMGNDTSPSEGTNLNAPNSLGVSALCAICGDRAT
                                     ::::::::::::::::::::::::::::::
CCDS68               MVSVNAPLGAPVESSYDTSPSEGTNLNAPNSLGVSALCAICGDRAT
                             10        20        30        40      

       70        80        90       100       110       120        
pF1KB9 GKHYGASSCDGCKGFFRRSVRKNHMYSCRFSRQCVVDKDKRNQCRYCRLKKCFRAGMKKE
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS68 GKHYGASSCDGCKGFFRRSVRKNHMYSCRFSRQCVVDKDKRNQCRYCRLKKCFRAGMKKE
         50        60        70        80        90       100      

      130       140       150       160       170       180        
pF1KB9 AVQNERDRISTRRSSYEDSSLPSINALLQAEVLSRQITSPVSGINGDIRAKKIASIADVC
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS68 AVQNERDRISTRRSSYEDSSLPSINALLQAEVLSRQITSPVSGINGDIRAKKIASIADVC
        110       120       130       140       150       160      

      190       200       210       220       230       240        
pF1KB9 ESMKEQLLVLVEWAKYIPAFCELPLDDQVALLRAHAGEHLLLGATKRSMVFKDVLLLGND
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS68 ESMKEQLLVLVEWAKYIPAFCELPLDDQVALLRAHAGEHLLLGATKRSMVFKDVLLLGND
        170       180       190       200       210       220      

      250       260       270       280       290       300        
pF1KB9 YIVPRHCPELAEMSRVSIRILDELVLPFQELQIDDNEYAYLKAIIFFDPDAKGLSDPGKI
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS68 YIVPRHCPELAEMSRVSIRILDELVLPFQELQIDDNEYAYLKAIIFFDPDAKGLSDPGKI
        230       240       250       260       270       280      

      310       320       330       340       350       360        
pF1KB9 KRLRSQVQVSLEDYINDRQYDSRGRFGELLLLLPTLQSITWQMIEQIQFIKLFGMAKIDN
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS68 KRLRSQVQVSLEDYINDRQYDSRGRFGELLLLLPTLQSITWQMIEQIQFIKLFGMAKIDN
        290       300       310       320       330       340      

      370                    380        390       400       410    
pF1KB9 LLQEMLLGG-------------SPSDAPHA-HHPLHPHLMQEHMGTNVIVANTMPTHLSN
       :::::::::             ::.: ::.   ::                         
CCDS68 LLQEMLLGGPCQAQEGRGWSGDSPGDRPHTVSSPLSSLASPLCRFGQVA           
        350       360       370       380       390                

          420       430       440       450       460       470    
pF1KB9 GQMCEWPRPRGQAATPETPQPSPPGGSGSEPYKLLPGAVATIVKPLSAIPQPTITKQEVI

>>CCDS6220.2 HNF4G gene_id:3174|Hs108|chr8                (445 aa)
 initn: 2028 init1: 1163 opt: 2080  Z-score: 1891.5  bits: 359.3 E(32554): 4.9e-99
Smith-Waterman score: 2153; 71.8% identity (86.8% similar) in 447 aa overlap (10-456:1-434)

               10        20        30        40        50        60
pF1KB9 MRLSKTLVDMDMADYSAALDPAYTTLEFENVQVLTMGNDTSPSEGTNLNAPNSLGVSALC
                ::::.:: .:::.:::::::..:.:  ..:.:  : :..:. .. ::. ::
CCDS62          MDMANYSEVLDPTYTTLEFETMQILYNSSDSSAPE-TSMNTTDN-GVNCLC
                        10        20        30         40          

               70        80        90       100       110       120
pF1KB9 AICGDRATGKHYGASSCDGCKGFFRRSVRKNHMYSCRFSRQCVVDKDKRNQCRYCRLKKC
       :::::::::::::::::::::::::::.::.:.::::::::::::::::::::::::.::
CCDS62 AICGDRATGKHYGASSCDGCKGFFRRSIRKSHVYSCRFSRQCVVDKDKRNQCRYCRLRKC
      50        60        70        80        90       100         

              130       140       150       160       170       180
pF1KB9 FRAGMKKEAVQNERDRISTRRSSYEDSSLPSINALLQAEVLSRQITSPVSGINGDIRAKK
       ::::::::::::::::::::::... :..::::.: :::: ::::.    : . :: .::
CCDS62 FRAGMKKEAVQNERDRISTRRSTFDGSNIPSINTLAQAEVRSRQISVSSPGSSTDINVKK
     110       120       130       140       150       160         

              190       200       210       220       230       240
pF1KB9 IASIADVCESMKEQLLVLVEWAKYIPAFCELPLDDQVALLRAHAGEHLLLGATKRSMVFK
       ::::.:::::::.::::::::::::::::::::::::::::::::::::::::::::..:
CCDS62 IASIGDVCESMKQQLLVLVEWAKYIPAFCELPLDDQVALLRAHAGEHLLLGATKRSMMYK
     170       180       190       200       210       220         

              250       260       270       280       290       300
pF1KB9 DVLLLGNDYIVPRHCPELAEMSRVSIRILDELVLPFQELQIDDNEYAYLKAIIFFDPDAK
       :.:::::.:.. :.  :. :.:::. :.::::: ::::.:::::::: ::::.:::::::
CCDS62 DILLLGNNYVIHRNSCEV-EISRVANRVLDELVRPFQEIQIDDNEYACLKAIVFFDPDAK
     230       240        250       260       270       280        

              310       320       330       340       350       360
pF1KB9 GLSDPGKIKRLRSQVQVSLEDYINDRQYDSRGRFGELLLLLPTLQSITWQMIEQIQFIKL
       ::::: ::: .: :::..:::::::::::::::::::::::::::::::::::::::.::
CCDS62 GLSDPVKIKNMRFQVQIGLEDYINDRQYDSRGRFGELLLLLPTLQSITWQMIEQIQFVKL
      290       300       310       320       330       340        

              370       380       390       400       410       420
pF1KB9 FGMAKIDNLLQEMLLGGSPSDAPHAHHPLHPHLMQEHMGTNVIVANTMPTHLSNGQMCEW
       :::.:::::::::::::. .:. : :::.:::: :. .  ..:. . : : .   :.   
CCDS62 FGMVKIDNLLQEMLLGGASNDGSHLHHPMHPHLSQDPLTGQTILLGPMSTLVHADQI---
      350       360       370       380       390       400        

              430       440       450       460       470    
pF1KB9 PRPRGQAATPETPQPSPPGGSGSEPYKLLPGAVATIVKPLSAIPQPTITKQEVI
              .::::: :::: :::.: ::.  . ...:                  
CCDS62 -------STPETPLPSPPQGSGQEQYKIAANQASVISHQHLSKQKQL       
                410       420       430       440            

>>CCDS83303.1 HNF4G gene_id:3174|Hs108|chr8               (408 aa)
 initn: 2028 init1: 1163 opt: 1946  Z-score: 1770.8  bits: 336.8 E(32554): 2.6e-92
Smith-Waterman score: 2019; 74.1% identity (87.3% similar) in 402 aa overlap (55-456:7-397)

           30        40        50        60        70        80    
pF1KB9 TLEFENVQVLTMGNDTSPSEGTNLNAPNSLGVSALCAICGDRATGKHYGASSCDGCKGFF
                                     ::. ::::::::::::::::::::::::::
CCDS83                         MNTTDNGVNCLCAICGDRATGKHYGASSCDGCKGFF
                                       10        20        30      

           90       100       110       120       130       140    
pF1KB9 RRSVRKNHMYSCRFSRQCVVDKDKRNQCRYCRLKKCFRAGMKKEAVQNERDRISTRRSSY
       :::.::.:.::::::::::::::::::::::::.::::::::::::::::::::::::..
CCDS83 RRSIRKSHVYSCRFSRQCVVDKDKRNQCRYCRLRKCFRAGMKKEAVQNERDRISTRRSTF
         40        50        60        70        80        90      

          150       160       170       180       190       200    
pF1KB9 EDSSLPSINALLQAEVLSRQITSPVSGINGDIRAKKIASIADVCESMKEQLLVLVEWAKY
       . :..::::.: :::: ::::.    : . :: .::::::.:::::::.:::::::::::
CCDS83 DGSNIPSINTLAQAEVRSRQISVSSPGSSTDINVKKIASIGDVCESMKQQLLVLVEWAKY
        100       110       120       130       140       150      

          210       220       230       240       250       260    
pF1KB9 IPAFCELPLDDQVALLRAHAGEHLLLGATKRSMVFKDVLLLGNDYIVPRHCPELAEMSRV
       :::::::::::::::::::::::::::::::::..::.:::::.:.. :.  :. :.:::
CCDS83 IPAFCELPLDDQVALLRAHAGEHLLLGATKRSMMYKDILLLGNNYVIHRNSCEV-EISRV
        160       170       180       190       200       210      

          270       280       290       300       310       320    
pF1KB9 SIRILDELVLPFQELQIDDNEYAYLKAIIFFDPDAKGLSDPGKIKRLRSQVQVSLEDYIN
       . :.::::: ::::.:::::::: ::::.:::::::::::: ::: .: :::..::::::
CCDS83 ANRVLDELVRPFQEIQIDDNEYACLKAIVFFDPDAKGLSDPVKIKNMRFQVQIGLEDYIN
         220       230       240       250       260       270     

          330       340       350       360       370       380    
pF1KB9 DRQYDSRGRFGELLLLLPTLQSITWQMIEQIQFIKLFGMAKIDNLLQEMLLGGSPSDAPH
       :::::::::::::::::::::::::::::::::.:::::.:::::::::::::. .:. :
CCDS83 DRQYDSRGRFGELLLLLPTLQSITWQMIEQIQFVKLFGMVKIDNLLQEMLLGGASNDGSH
         280       290       300       310       320       330     

          390       400       410       420       430       440    
pF1KB9 AHHPLHPHLMQEHMGTNVIVANTMPTHLSNGQMCEWPRPRGQAATPETPQPSPPGGSGSE
        :::.:::: :. .  ..:. . : : .   :.          .::::: :::: :::.:
CCDS83 LHHPMHPHLSQDPLTGQTILLGPMSTLVHADQI----------STPETPLPSPPQGSGQE
         340       350       360                 370       380     

          450       460       470    
pF1KB9 PYKLLPGAVATIVKPLSAIPQPTITKQEVI
        ::.  . ...:                  
CCDS83 QYKIAANQASVISHQHLSKQKQL       
         390       400               

>>CCDS35172.1 RXRA gene_id:6256|Hs108|chr9                (462 aa)
 initn: 910 init1: 515 opt: 907  Z-score: 829.4  bits: 162.8 E(32554): 7.1e-40
Smith-Waterman score: 909; 41.0% identity (70.5% similar) in 366 aa overlap (34-384:101-459)

            10        20        30        40          50           
pF1KB9 SKTLVDMDMADYSAALDPAYTTLEFENVQVLTMGNDTSPSEGTN--LNAP-NSLGVSA--
                                     .. ..: .:  : :  :..: .  :  :  
CCDS35 PMGPHSMSVPTTPTLGFSTGSPQLSSPMNPVSSSEDIKPPLGLNGVLKVPAHPSGNMASF
               80        90       100       110       120       130

          60        70        80        90       100       110     
pF1KB9 ---LCAICGDRATGKHYGASSCDGCKGFFRRSVRKNHMYSCRFSRQCVVDKDKRNQCRYC
          .:::::::..:::::. ::.::::::.:.:::.  :.:: ...:..:: .::.:.::
CCDS35 TKHICAICGDRSSGKHYGVYSCEGCKGFFKRTVRKDLTYTCRDNKDCLIDKRQRNRCQYC
              140       150       160       170       180       190

         120       130       140             150       160         
pF1KB9 RLKKCFRAGMKKEAVQNERDRISTRR------SSYEDSSLPSINALLQAEVLSRQITSPV
       : .::.  :::.::::.::.: . :       .:  . ..: .. .:.::.  .  :   
CCDS35 RYQKCLAMGMKREAVQEERQRGKDRNENEVESTSSANEDMP-VERILEAELAVEPKTETY
              200       210       220       230        240         

     170       180        190       200       210       220        
pF1KB9 SGINGDIRAKKIAS-IADVCESMKEQLLVLVEWAKYIPAFCELPLDDQVALLRAHAGEHL
          :  .  ..  . ....:..  .::..:::::: :: : :::::::: ::::  .: :
CCDS35 VEANMGLNPSSPNDPVTNICQAADKQLFTLVEWAKRIPHFSELPLDDQVILLRAGWNELL
     250       260       270       280       290       300         

      230       240       250       260       270       280        
pF1KB9 LLGATKRSMVFKDVLLLGNDYIVPRHCPELAEMSRVSIRILDELVLPFQELQIDDNEYAY
       . . ..::.. :: .::..   : :.  . : .. .  :.: :::  ....:.: .: . 
CCDS35 IASFSHRSIAVKDGILLATGLHVHRNSAHSAGVGAIFDRVLTELVSKMRDMQMDKTELGC
     310       320       330       340       350       360         

      290       300       310       320       330       340        
pF1KB9 LKAIIFFDPDAKGLSDPGKIKRLRSQVQVSLEDYINDRQYDSRGRFGELLLLLPTLQSIT
       :.::..:.::.::::.:.... :: .: .::: : . .  .. :::..::: ::.:.:: 
CCDS35 LRAIVLFNPDSKGLSNPAEVEALREKVYASLEAYCKHKYPEQPGRFAKLLLRLPALRSIG
     370       380       390       400       410       420         

      350       360       370       380       390       400        
pF1KB9 WQMIEQIQFIKLFGMAKIDNLLQEMLLGGSPSDAPHAHHPLHPHLMQEHMGTNVIVANTM
        . .:.. :.::.: . ::..:.:::      .:::                        
CCDS35 LKCLEHLFFFKLIGDTPIDTFLMEML------EAPHQMT                     
     430       440       450             460                       

      410       420       430       440       450       460        
pF1KB9 PTHLSNGQMCEWPRPRGQAATPETPQPSPPGGSGSEPYKLLPGAVATIVKPLSAIPQPTI




474 residues in 1 query   sequences
18511270 residues in 32554 library sequences
 Tcomplib [36.3.4 Apr, 2011] (8 proc)
 start: Fri Nov  4 18:34:31 2016 done: Fri Nov  4 18:34:32 2016
 Total Scan time:  3.410 Total Display time:  0.090

Function used was FASTA [36.3.4 Apr, 2011]
Inquiries or Suggestions ?
Send a message to flexiclone AT kazusagt.com