Result of FASTA (omim) for pFN21AB7841
FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448

Query: pF1KB7841, 504 aa
  1>>>pF1KB7841 504 - 504 aa - 504 aa
Library: /omim/omim.rfq.tfa
  60827320 residues in 85289 sequences

Statistics:  Expectation_n fit: rho(ln(x))= 7.3728+/-0.000392; mu= 9.2007+/- 0.025
 mean_var=128.0934+/-25.715, 0's: 0 Z-trim(115.8): 28  B-trim: 293 in 1/57
 Lambda= 0.113321
 statistics sampled from 26479 (26507) to 26479 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
 ktup: 2, E-join: 1 (0.674), E-opt: 0.2 (0.311), width:  16
 Scan time:  9.040

The best scores are:                                      opt bits E(85289)
NP_001121632 (OMIM: 609784) upstream-binding prote ( 504) 3376 563.4 5.4e-160
NP_005644 (OMIM: 189889) alpha-globin transcriptio ( 502) 2517 423.0  1e-117
NP_001166923 (OMIM: 189889) alpha-globin transcrip ( 501) 2500 420.2  7e-117
NP_001121633 (OMIM: 609784) upstream-binding prote ( 540) 1867 316.7 1.1e-85
NP_055332 (OMIM: 609784) upstream-binding protein  ( 540) 1867 316.7 1.1e-85
XP_016859391 (OMIM: 609785) PREDICTED: transcripti ( 406) 1863 316.0 1.3e-85
NP_055368 (OMIM: 609785) transcription factor CP2- ( 479) 1854 314.6 4.2e-85
XP_016859393 (OMIM: 609785) PREDICTED: transcripti ( 273) 1399 240.0 6.4e-63
XP_016859394 (OMIM: 609785) PREDICTED: transcripti ( 273) 1391 238.7 1.6e-62
NP_001166924 (OMIM: 189889) alpha-globin transcrip ( 450) 1099 191.1 5.7e-48
XP_016859392 (OMIM: 609785) PREDICTED: transcripti ( 274)  790 140.5 6.1e-33
XP_016859390 (OMIM: 609786) PREDICTED: grainyhead- ( 467)  385 74.4 8.1e-13
XP_005246216 (OMIM: 609786) PREDICTED: grainyhead- ( 429)  383 74.0 9.5e-13
NP_937825 (OMIM: 609786) grainyhead-like protein 1 ( 618)  385 74.5   1e-12
XP_016859389 (OMIM: 609786) PREDICTED: grainyhead- ( 479)  360 70.3 1.4e-11
XP_006711947 (OMIM: 609786) PREDICTED: grainyhead- ( 441)  358 70.0 1.6e-11
XP_011508645 (OMIM: 609786) PREDICTED: grainyhead- ( 583)  360 70.4 1.7e-11
XP_006711945 (OMIM: 609786) PREDICTED: grainyhead- ( 630)  360 70.4 1.8e-11
XP_011540172 (OMIM: 606713,608317) PREDICTED: grai ( 509)  351 68.9 4.1e-11
NP_001181939 (OMIM: 606713,608317) grainyhead-like ( 556)  351 68.9 4.4e-11
XP_011540171 (OMIM: 606713,608317) PREDICTED: grai ( 556)  351 68.9 4.4e-11
NP_937816 (OMIM: 606713,608317) grainyhead-like pr ( 602)  351 68.9 4.7e-11
NP_067003 (OMIM: 606713,608317) grainyhead-like pr ( 607)  351 68.9 4.7e-11
NP_937817 (OMIM: 606713,608317) grainyhead-like pr ( 626)  351 68.9 4.9e-11
XP_011515609 (OMIM: 608576,608641,616029) PREDICTE ( 591)  343 67.6 1.1e-10
NP_001317522 (OMIM: 608576,608641,616029) grainyhe ( 609)  343 67.6 1.2e-10
XP_011515608 (OMIM: 608576,608641,616029) PREDICTE ( 609)  343 67.6 1.2e-10
NP_079191 (OMIM: 608576,608641,616029) grainyhead- ( 625)  343 67.6 1.2e-10


>>NP_001121632 (OMIM: 609784) upstream-binding protein 1  (504 aa)
 initn: 3376 init1: 3376 opt: 3376  Z-score: 2993.3  bits: 563.4 E(85289): 5.4e-160
Smith-Waterman score: 3376; 100.0% identity (100.0% similar) in 504 aa overlap (1-504:1-504)

               10        20        30        40        50        60
pF1KB7 MAWVLKMDEVIESGLVHDFDASLSGIGQELGAGAYSMSDVLALPIFKQEDSSLPLDGETE
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 MAWVLKMDEVIESGLVHDFDASLSGIGQELGAGAYSMSDVLALPIFKQEDSSLPLDGETE
               10        20        30        40        50        60

               70        80        90       100       110       120
pF1KB7 HPPFQYVMCAATSPAVKLHDETLTYLNQGQSYEIRMLDNRKMGDMPEINGKLVKSIIRVV
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 HPPFQYVMCAATSPAVKLHDETLTYLNQGQSYEIRMLDNRKMGDMPEINGKLVKSIIRVV
               70        80        90       100       110       120

              130       140       150       160       170       180
pF1KB7 FHDRRLQYTEHQQLEGWKWNRPGDRLLDLDIPMSVGIIDTRTNPSQLNAVEFLWDPAKRT
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 FHDRRLQYTEHQQLEGWKWNRPGDRLLDLDIPMSVGIIDTRTNPSQLNAVEFLWDPAKRT
              130       140       150       160       170       180

              190       200       210       220       230       240
pF1KB7 SAFIQVHCISTEFTPRKHGGEKGVPFRIQVDTFKQNENGEYTDHLHSASCQIKVFKPKGA
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 SAFIQVHCISTEFTPRKHGGEKGVPFRIQVDTFKQNENGEYTDHLHSASCQIKVFKPKGA
              190       200       210       220       230       240

              250       260       270       280       290       300
pF1KB7 DRKQKTDREKMEKRTAHEKEKYQPSYDTTILTECSPWPDAPTAYVNNSPSPAPTFTSPQQ
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 DRKQKTDREKMEKRTAHEKEKYQPSYDTTILTECSPWPDAPTAYVNNSPSPAPTFTSPQQ
              250       260       270       280       290       300

              310       320       330       340       350       360
pF1KB7 STCSVPDSNSSSPNHQGDGASQTSGEQIQPSATIQETQQWLLKNRFSSYTRLFSNFSGAD
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 STCSVPDSNSSSPNHQGDGASQTSGEQIQPSATIQETQQWLLKNRFSSYTRLFSNFSGAD
              310       320       330       340       350       360

              370       380       390       400       410       420
pF1KB7 LLKLTKEDLVQICGAADGIRLYNSLKSRSVRPRLTIYVCREQPSSTVLQGQQQAASSASE
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 LLKLTKEDLVQICGAADGIRLYNSLKSRSVRPRLTIYVCREQPSSTVLQGQQQAASSASE
              370       380       390       400       410       420

              430       440       450       460       470       480
pF1KB7 NGSGAPYVYHAIYLEEMIASEVARKLALVFNIPLHQINQVYRQGPTGIHILVSDQMVQNF
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 NGSGAPYVYHAIYLEEMIASEVARKLALVFNIPLHQINQVYRQGPTGIHILVSDQMVQNF
              430       440       450       460       470       480

              490       500    
pF1KB7 QDESCFLFSTVKAESSDGIHIILK
       ::::::::::::::::::::::::
NP_001 QDESCFLFSTVKAESSDGIHIILK
              490       500    

>>NP_005644 (OMIM: 189889) alpha-globin transcription fa  (502 aa)
 initn: 2124 init1: 1693 opt: 2517  Z-score: 2234.4  bits: 423.0 E(85289): 1e-117
Smith-Waterman score: 2517; 73.3% identity (90.4% similar) in 509 aa overlap (1-504:1-502)

                  10        20        30        40        50       
pF1KB7 MAWVLKM---DEVIESGLVHDFDASLSGIGQELGAGAYSMSDVLALPIFKQEDSSLPLDG
       :::.::.   :::::::::.::::::::::::::::::::::::::::::::.:::: :.
NP_005 MAWALKLPLADEVIESGLVQDFDASLSGIGQELGAGAYSMSDVLALPIFKQEESSLPPDN
               10        20        30        40        50        60

        60        70        80        90       100       110       
pF1KB7 ETEHPPFQYVMCAATSPAVKLHDETLTYLNQGQSYEIRMLDNRKMGDMPEINGKLVKSII
       :..  :::::.:::::::::::::::::::::::::::::::::.:..:::::::::::.
NP_005 ENKILPFQYVLCAATSPAVKLHDETLTYLNQGQSYEIRMLDNRKLGELPEINGKLVKSIF
               70        80        90       100       110       120

       120       130       140       150       160       170       
pF1KB7 RVVFHDRRLQYTEHQQLEGWKWNRPGDRLLDLDIPMSVGIIDTRTNPSQLNAVEFLWDPA
       ::::::::::::::::::::.:::::::.::.:::::::::: :.::.:::.::::::::
NP_005 RVVFHDRRLQYTEHQQLEGWRWNRPGDRILDIDIPMSVGIIDPRANPTQLNTVEFLWDPA
              130       140       150       160       170       180

       180       190       200       210       220       230       
pF1KB7 KRTSAFIQVHCISTEFTPRKHGGEKGVPFRIQVDTFKQNENGEYTDHLHSASCQIKVFKP
       ::::.:::::::::::: ::::::::::::.:.::::.:::::::.::::::::::::::
NP_005 KRTSVFIQVHCISTEFTMRKHGGEKGVPFRVQIDTFKENENGEYTEHLHSASCQIKVFKP
              190       200       210       220       230       240

       240       250       260       270       280       290       
pF1KB7 KGADRKQKTDREKMEKRTAHEKEKYQPSYDTTILTECSPWPDAPTAYVNNSPSPAPTFTS
       :::::::::::::::::: ::::::::::.:::::::::::.   .::::::::.  :.:
NP_005 KGADRKQKTDREKMEKRTPHEKEKYQPSYETTILTECSPWPE--ITYVNNSPSPG--FNS
              250       260       270       280         290        

       300       310       320       330       340       350       
pF1KB7 PQQSTCSVPDSNSSSPNHQGDGASQTSGEQIQPSATIQETQQWLLKNRFSSYTRLFSNFS
        ..:. :. ..:.: :::: .    .. ... :..: ::.:::: .::::..::::.:::
NP_005 -SHSSFSLGEGNGS-PNHQPEPPPPVT-DNLLPTTTPQEAQQWLHRNRFSTFTRLFTNFS
         300        310       320        330       340       350   

       360       370       380       390       400       410       
pF1KB7 GADLLKLTKEDLVQICGAADGIRLYNSLKSRSVRPRLTIYVCREQPSSTVLQGQQQAASS
       ::::::::..:..:::: ::::::.:.::.: ::::::::::.:. .    : :::  ..
NP_005 GADLLKLTRDDVIQICGPADGIRLFNALKGRMVRPRLTIYVCQESLQLREQQQQQQQQQQ
           360       370       380       390       400       410   

       420         430       440       450       460       470     
pF1KB7 ASENG--SGAPYVYHAIYLEEMIASEVARKLALVFNIPLHQINQVYRQGPTGIHILVSDQ
         :.:  .:. .:::::::::. : :...:.: .:.:   ::.:.:.:::::::.:.::.
NP_005 KHEDGDSNGTFFVYHAIYLEELTAVELTEKIAQLFSISPCQISQIYKQGPTGIHVLISDE
           420       430       440       450       460       470   

         480       490       500    
pF1KB7 MVQNFQDESCFLFSTVKAESSDGIHIILK
       :.::::.:.::...:.:::..:. :::::
NP_005 MIQNFQEEACFILDTMKAETNDSYHIILK
           480       490       500  

>>NP_001166923 (OMIM: 189889) alpha-globin transcription  (501 aa)
 initn: 2457 init1: 1693 opt: 2500  Z-score: 2219.4  bits: 420.2 E(85289): 7e-117
Smith-Waterman score: 2500; 73.1% identity (90.2% similar) in 509 aa overlap (1-504:1-501)

                  10        20        30        40        50       
pF1KB7 MAWVLKM---DEVIESGLVHDFDASLSGIGQELGAGAYSMSDVLALPIFKQEDSSLPLDG
       :::.::.   :::::::::.::::::::::::::::::::::::::::::::.:::: :.
NP_001 MAWALKLPLADEVIESGLVQDFDASLSGIGQELGAGAYSMSDVLALPIFKQEESSLPPDN
               10        20        30        40        50        60

        60        70        80        90       100       110       
pF1KB7 ETEHPPFQYVMCAATSPAVKLHDETLTYLNQGQSYEIRMLDNRKMGDMPEINGKLVKSII
       :..  :::::.:::::::::::::::::::::::::::::::::.:..:::::::::::.
NP_001 ENKILPFQYVLCAATSPAVKLHDETLTYLNQGQSYEIRMLDNRKLGELPEINGKLVKSIF
               70        80        90       100       110       120

       120       130       140       150       160       170       
pF1KB7 RVVFHDRRLQYTEHQQLEGWKWNRPGDRLLDLDIPMSVGIIDTRTNPSQLNAVEFLWDPA
       ::::::::::::::::::::.:::::::.::.:::::::::: :.::.:::.::::::::
NP_001 RVVFHDRRLQYTEHQQLEGWRWNRPGDRILDIDIPMSVGIIDPRANPTQLNTVEFLWDPA
              130       140       150       160       170       180

       180       190       200       210       220       230       
pF1KB7 KRTSAFIQVHCISTEFTPRKHGGEKGVPFRIQVDTFKQNENGEYTDHLHSASCQIKVFKP
       ::::.:::::::::::: ::::::::::::.:.::::.:::::::.::::::::::::::
NP_001 KRTSVFIQVHCISTEFTMRKHGGEKGVPFRVQIDTFKENENGEYTEHLHSASCQIKVFKP
              190       200       210       220       230       240

       240       250       260       270       280       290       
pF1KB7 KGADRKQKTDREKMEKRTAHEKEKYQPSYDTTILTECSPWPDAPTAYVNNSPSPAPTFTS
       :::::::::::::::::: ::::::::::.:::::::::::.   .::::::::.  :.:
NP_001 KGADRKQKTDREKMEKRTPHEKEKYQPSYETTILTECSPWPE--ITYVNNSPSPG--FNS
              250       260       270       280         290        

       300       310       320       330       340       350       
pF1KB7 PQQSTCSVPDSNSSSPNHQGDGASQTSGEQIQPSATIQETQQWLLKNRFSSYTRLFSNFS
        ..:. :. ..:.: :::: .    .. ... :..: ::.:::: .::::..::::.:::
NP_001 -SHSSFSLGEGNGS-PNHQPEPPPPVT-DNLLPTTTPQEAQQWLHRNRFSTFTRLFTNFS
         300        310       320        330       340       350   

       360       370       380       390       400       410       
pF1KB7 GADLLKLTKEDLVQICGAADGIRLYNSLKSRSVRPRLTIYVCREQPSSTVLQGQQQAASS
       ::::::::..:..:::: ::::::.:.::.: ::::::::::.:. .    : :::  ..
NP_001 GADLLKLTRDDVIQICGPADGIRLFNALKGRMVRPRLTIYVCQESLQLREQQQQQQQQQQ
           360       370       380       390       400       410   

       420         430       440       450       460       470     
pF1KB7 ASENG--SGAPYVYHAIYLEEMIASEVARKLALVFNIPLHQINQVYRQGPTGIHILVSDQ
         :.:  .:. .:::::::::. : :...:.: .:.:   ::.:.:.:::::::.:.::.
NP_001 KHEDGDSNGTFFVYHAIYLEELTAVELTEKIAQLFSISPCQISQIYKQGPTGIHVLISDE
           420       430       440       450       460       470   

         480       490       500    
pF1KB7 MVQNFQDESCFLFSTVKAESSDGIHIILK
       :.::::.:.::...:.: :..:. :::::
NP_001 MIQNFQEEACFILDTMK-ETNDSYHIILK
           480       490        500 

>>NP_001121633 (OMIM: 609784) upstream-binding protein 1  (540 aa)
 initn: 3362 init1: 1847 opt: 1867  Z-score: 1659.6  bits: 316.7 E(85289): 1.1e-85
Smith-Waterman score: 3273; 93.3% identity (93.3% similar) in 536 aa overlap (1-500:1-536)

               10        20        30        40        50        60
pF1KB7 MAWVLKMDEVIESGLVHDFDASLSGIGQELGAGAYSMSDVLALPIFKQEDSSLPLDGETE
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 MAWVLKMDEVIESGLVHDFDASLSGIGQELGAGAYSMSDVLALPIFKQEDSSLPLDGETE
               10        20        30        40        50        60

               70        80        90       100       110       120
pF1KB7 HPPFQYVMCAATSPAVKLHDETLTYLNQGQSYEIRMLDNRKMGDMPEINGKLVKSIIRVV
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 HPPFQYVMCAATSPAVKLHDETLTYLNQGQSYEIRMLDNRKMGDMPEINGKLVKSIIRVV
               70        80        90       100       110       120

              130       140       150       160       170       180
pF1KB7 FHDRRLQYTEHQQLEGWKWNRPGDRLLDLDIPMSVGIIDTRTNPSQLNAVEFLWDPAKRT
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 FHDRRLQYTEHQQLEGWKWNRPGDRLLDLDIPMSVGIIDTRTNPSQLNAVEFLWDPAKRT
              130       140       150       160       170       180

              190       200       210       220       230       240
pF1KB7 SAFIQVHCISTEFTPRKHGGEKGVPFRIQVDTFKQNENGEYTDHLHSASCQIKVFKPKGA
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 SAFIQVHCISTEFTPRKHGGEKGVPFRIQVDTFKQNENGEYTDHLHSASCQIKVFKPKGA
              190       200       210       220       230       240

              250       260       270                              
pF1KB7 DRKQKTDREKMEKRTAHEKEKYQPSYDTTILTE---------------------------
       :::::::::::::::::::::::::::::::::                           
NP_001 DRKQKTDREKMEKRTAHEKEKYQPSYDTTILTEMRLEPIIEDAVEHEQKKSSKRTLPADY
              250       260       270       280       290       300

                    280       290       300       310       320    
pF1KB7 ---------CSPWPDAPTAYVNNSPSPAPTFTSPQQSTCSVPDSNSSSPNHQGDGASQTS
                :::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 GDSLAKRGSCSPWPDAPTAYVNNSPSPAPTFTSPQQSTCSVPDSNSSSPNHQGDGASQTS
              310       320       330       340       350       360

          330       340       350       360       370       380    
pF1KB7 GEQIQPSATIQETQQWLLKNRFSSYTRLFSNFSGADLLKLTKEDLVQICGAADGIRLYNS
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 GEQIQPSATIQETQQWLLKNRFSSYTRLFSNFSGADLLKLTKEDLVQICGAADGIRLYNS
              370       380       390       400       410       420

          390       400       410       420       430       440    
pF1KB7 LKSRSVRPRLTIYVCREQPSSTVLQGQQQAASSASENGSGAPYVYHAIYLEEMIASEVAR
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 LKSRSVRPRLTIYVCREQPSSTVLQGQQQAASSASENGSGAPYVYHAIYLEEMIASEVAR
              430       440       450       460       470       480

          450       460       470       480       490       500    
pF1KB7 KLALVFNIPLHQINQVYRQGPTGIHILVSDQMVQNFQDESCFLFSTVKAESSDGIHIILK
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::    
NP_001 KLALVFNIPLHQINQVYRQGPTGIHILVSDQMVQNFQDESCFLFSTVKAESSDGIHIILK
              490       500       510       520       530       540

>>NP_055332 (OMIM: 609784) upstream-binding protein 1 is  (540 aa)
 initn: 3362 init1: 1847 opt: 1867  Z-score: 1659.6  bits: 316.7 E(85289): 1.1e-85
Smith-Waterman score: 3273; 93.3% identity (93.3% similar) in 536 aa overlap (1-500:1-536)

               10        20        30        40        50        60
pF1KB7 MAWVLKMDEVIESGLVHDFDASLSGIGQELGAGAYSMSDVLALPIFKQEDSSLPLDGETE
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_055 MAWVLKMDEVIESGLVHDFDASLSGIGQELGAGAYSMSDVLALPIFKQEDSSLPLDGETE
               10        20        30        40        50        60

               70        80        90       100       110       120
pF1KB7 HPPFQYVMCAATSPAVKLHDETLTYLNQGQSYEIRMLDNRKMGDMPEINGKLVKSIIRVV
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_055 HPPFQYVMCAATSPAVKLHDETLTYLNQGQSYEIRMLDNRKMGDMPEINGKLVKSIIRVV
               70        80        90       100       110       120

              130       140       150       160       170       180
pF1KB7 FHDRRLQYTEHQQLEGWKWNRPGDRLLDLDIPMSVGIIDTRTNPSQLNAVEFLWDPAKRT
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_055 FHDRRLQYTEHQQLEGWKWNRPGDRLLDLDIPMSVGIIDTRTNPSQLNAVEFLWDPAKRT
              130       140       150       160       170       180

              190       200       210       220       230       240
pF1KB7 SAFIQVHCISTEFTPRKHGGEKGVPFRIQVDTFKQNENGEYTDHLHSASCQIKVFKPKGA
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_055 SAFIQVHCISTEFTPRKHGGEKGVPFRIQVDTFKQNENGEYTDHLHSASCQIKVFKPKGA
              190       200       210       220       230       240

              250       260       270                              
pF1KB7 DRKQKTDREKMEKRTAHEKEKYQPSYDTTILTE---------------------------
       :::::::::::::::::::::::::::::::::                           
NP_055 DRKQKTDREKMEKRTAHEKEKYQPSYDTTILTEMRLEPIIEDAVEHEQKKSSKRTLPADY
              250       260       270       280       290       300

                    280       290       300       310       320    
pF1KB7 ---------CSPWPDAPTAYVNNSPSPAPTFTSPQQSTCSVPDSNSSSPNHQGDGASQTS
                :::::::::::::::::::::::::::::::::::::::::::::::::::
NP_055 GDSLAKRGSCSPWPDAPTAYVNNSPSPAPTFTSPQQSTCSVPDSNSSSPNHQGDGASQTS
              310       320       330       340       350       360

          330       340       350       360       370       380    
pF1KB7 GEQIQPSATIQETQQWLLKNRFSSYTRLFSNFSGADLLKLTKEDLVQICGAADGIRLYNS
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_055 GEQIQPSATIQETQQWLLKNRFSSYTRLFSNFSGADLLKLTKEDLVQICGAADGIRLYNS
              370       380       390       400       410       420

          390       400       410       420       430       440    
pF1KB7 LKSRSVRPRLTIYVCREQPSSTVLQGQQQAASSASENGSGAPYVYHAIYLEEMIASEVAR
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_055 LKSRSVRPRLTIYVCREQPSSTVLQGQQQAASSASENGSGAPYVYHAIYLEEMIASEVAR
              430       440       450       460       470       480

          450       460       470       480       490       500    
pF1KB7 KLALVFNIPLHQINQVYRQGPTGIHILVSDQMVQNFQDESCFLFSTVKAESSDGIHIILK
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::    
NP_055 KLALVFNIPLHQINQVYRQGPTGIHILVSDQMVQNFQDESCFLFSTVKAESSDGIHIILK
              490       500       510       520       530       540

>>XP_016859391 (OMIM: 609785) PREDICTED: transcription f  (406 aa)
 initn: 1850 init1: 1446 opt: 1863  Z-score: 1657.9  bits: 316.0 E(85289): 1.3e-85
Smith-Waterman score: 1863; 69.4% identity (90.1% similar) in 395 aa overlap (32-425:16-402)

              10        20        30        40        50        60 
pF1KB7 AWVLKMDEVIESGLVHDFDASLSGIGQELGAGAYSMSDVLALPIFKQEDSSLPLDGETEH
                                     .:.: . :::::::::::. .:  ..:.. 
XP_016                MLFWHTQPEHYNQHNSGSY-LRDVLALPIFKQEEPQLSPENEARL
                              10         20        30        40    

              70        80        90       100       110       120 
pF1KB7 PPFQYVMCAATSPAVKLHDETLTYLNQGQSYEIRMLDNRKMGDMPEINGKLVKSIIRVVF
       ::.:::.:::::::::::.:::::::::::::::.:.:::.::. ..: : :::::::::
XP_016 PPLQYVLCAATSPAVKLHEETLTYLNQGQSYEIRLLENRKLGDFQDLNTKYVKSIIRVVF
           50        60        70        80        90       100    

             130       140       150       160       170       180 
pF1KB7 HDRRLQYTEHQQLEGWKWNRPGDRLLDLDIPMSVGIIDTRTNPSQLNAVEFLWDPAKRTS
       ::::::::::::::::.:.:::::.::.:::.::::.: :..:.::::::::::::::.:
XP_016 HDRRLQYTEHQQLEGWRWSRPGDRILDIDIPLSVGILDPRASPTQLNAVEFLWDPAKRAS
          110       120       130       140       150       160    

             190       200       210       220       230       240 
pF1KB7 AFIQVHCISTEFTPRKHGGEKGVPFRIQVDTFKQNENGEYTDHLHSASCQIKVFKPKGAD
       ::::::::::::::::::::::::::.:.::::::::::::.::::::::::::::::::
XP_016 AFIQVHCISTEFTPRKHGGEKGVPFRVQIDTFKQNENGEYTEHLHSASCQIKVFKPKGAD
          170       180       190       200       210       220    

             250       260       270       280        290       300
pF1KB7 RKQKTDREKMEKRTAHEKEKYQPSYDTTILTECSPWPDAPTAY-VNNSPSPAPTFTSPQQ
       :::::::::::::::.:::::::::.::::::::::::.  :: ::..:::. .  ::  
XP_016 RKQKTDREKMEKRTAQEKEKYQPSYETTILTECSPWPDV--AYQVNSAPSPSYN-GSP--
          230       240       250       260         270            

              310       320       330       340       350       360
pF1KB7 STCSVPDSNSSSPNHQGDGASQTSGEQIQPSATIQETQQWLLKNRFSSYTRLFSNFSGAD
       .. .. ..:.: :.:  . :  ...... :::.::..:::: .::::.. :::..:::::
XP_016 NSFGLGEGNAS-PTHPVE-ALPVGSDHLLPSASIQDAQQWLHRNRFSQFCRLFASFSGAD
     280       290         300       310       320       330       

              370       380       390       400       410       420
pF1KB7 LLKLTKEDLVQICGAADGIRLYNSLKSRSVRPRLTIYVCREQPSSTVLQGQQQAASSASE
       :::....::::::: ::::::.:..:.:.:::..:::::.:  .. :   :.. .:. :.
XP_016 LLKMSRDDLVQICGPADGIRLFNAIKGRNVRPKMTIYVCQELEQNRVPLQQKRDGSGDSN
       340       350       360       370       380       390       

              430       440       450       460       470       480
pF1KB7 NGSGAPYVYHAIYLEEMIASEVARKLALVFNIPLHQINQVYRQGPTGIHILVSDQMVQNF
        ..::                                                       
XP_016 LSDGAELPR                                                   
       400                                                         

>>NP_055368 (OMIM: 609785) transcription factor CP2-like  (479 aa)
 initn: 2156 init1: 1446 opt: 1854  Z-score: 1648.9  bits: 314.6 E(85289): 4.2e-85
Smith-Waterman score: 2176; 68.1% identity (89.2% similar) in 474 aa overlap (32-504:16-476)

              10        20        30        40        50        60 
pF1KB7 AWVLKMDEVIESGLVHDFDASLSGIGQELGAGAYSMSDVLALPIFKQEDSSLPLDGETEH
                                     .:.: . :::::::::::. .:  ..:.. 
NP_055                MLFWHTQPEHYNQHNSGSY-LRDVLALPIFKQEEPQLSPENEARL
                              10         20        30        40    

              70        80        90       100       110       120 
pF1KB7 PPFQYVMCAATSPAVKLHDETLTYLNQGQSYEIRMLDNRKMGDMPEINGKLVKSIIRVVF
       ::.:::.:::::::::::.:::::::::::::::.:.:::.::. ..: : :::::::::
NP_055 PPLQYVLCAATSPAVKLHEETLTYLNQGQSYEIRLLENRKLGDFQDLNTKYVKSIIRVVF
           50        60        70        80        90       100    

             130       140       150       160       170       180 
pF1KB7 HDRRLQYTEHQQLEGWKWNRPGDRLLDLDIPMSVGIIDTRTNPSQLNAVEFLWDPAKRTS
       ::::::::::::::::.:.:::::.::.:::.::::.: :..:.::::::::::::::.:
NP_055 HDRRLQYTEHQQLEGWRWSRPGDRILDIDIPLSVGILDPRASPTQLNAVEFLWDPAKRAS
          110       120       130       140       150       160    

             190       200       210       220       230       240 
pF1KB7 AFIQVHCISTEFTPRKHGGEKGVPFRIQVDTFKQNENGEYTDHLHSASCQIKVFKPKGAD
       ::::::::::::::::::::::::::.:.::::::::::::.::::::::::::::::::
NP_055 AFIQVHCISTEFTPRKHGGEKGVPFRVQIDTFKQNENGEYTEHLHSASCQIKVFKPKGAD
          170       180       190       200       210       220    

             250       260       270       280        290       300
pF1KB7 RKQKTDREKMEKRTAHEKEKYQPSYDTTILTECSPWPDAPTAY-VNNSPSPAPTFTSPQQ
       :::::::::::::::.:::::::::.::::::::::::  .:: ::..:::. .  ::  
NP_055 RKQKTDREKMEKRTAQEKEKYQPSYETTILTECSPWPD--VAYQVNSAPSPSYN-GSP--
          230       240       250       260         270            

              310       320       330       340       350       360
pF1KB7 STCSVPDSNSSSPNHQGDGASQTSGEQIQPSATIQETQQWLLKNRFSSYTRLFSNFSGAD
       .. .. ..:.: :.:  . :  ...... :::.::..:::: .::::.. :::..:::::
NP_055 NSFGLGEGNAS-PTHPVE-ALPVGSDHLLPSASIQDAQQWLHRNRFSQFCRLFASFSGAD
     280       290         300       310       320       330       

              370       380       390       400       410       420
pF1KB7 LLKLTKEDLVQICGAADGIRLYNSLKSRSVRPRLTIYVCREQPSSTVLQGQQQAASSASE
       :::....::::::: ::::::.:..:.:.:::..:::::.:  .. :   ::.  .:.. 
NP_055 LLKMSRDDLVQICGPADGIRLFNAIKGRNVRPKMTIYVCQELEQNRV-PLQQKRDGSGDS
       340       350       360       370       380        390      

              430       440       450       460       470       480
pF1KB7 NGSGAPYVYHAIYLEEMIASEVARKLALVFNIPLHQINQVYRQGPTGIHILVSDQMVQNF
       : :    :::::.:::. . :. .:.: ...:  ..:..::::::::::..::..:::::
NP_055 NLS----VYHAIFLEELTTLELIEKIANLYSISPQHIHRVYRQGPTGIHVVVSNEMVQNF
            400       410       420       430       440       450  

              490       500       
pF1KB7 QDESCFLFSTVKAESSDGIHIILK   
       ::::::..::.::::.:: :::::   
NP_055 QDESCFVLSTIKAESNDGYHIILKCGL
            460       470         

>>XP_016859393 (OMIM: 609785) PREDICTED: transcription f  (273 aa)
 initn: 1417 init1: 1389 opt: 1399  Z-score: 1250.5  bits: 240.0 E(85289): 6.4e-63
Smith-Waterman score: 1399; 79.3% identity (92.6% similar) in 256 aa overlap (32-279:16-270)

              10        20        30        40        50        60 
pF1KB7 AWVLKMDEVIESGLVHDFDASLSGIGQELGAGAYSMSDVLALPIFKQEDSSLPLDGETEH
                                     .:.: . :::::::::::. .:  ..:.. 
XP_016                MLFWHTQPEHYNQHNSGSY-LRDVLALPIFKQEEPQLSPENEARL
                              10         20        30        40    

              70        80        90       100       110       120 
pF1KB7 PPFQYVMCAATSPAVKLHDETLTYLNQGQSYEIRMLDNRKMGDMPEINGKLVKSIIRVVF
       ::.:::.:::::::::::.:::::::::::::::.:.:::.::. ..: : :::::::::
XP_016 PPLQYVLCAATSPAVKLHEETLTYLNQGQSYEIRLLENRKLGDFQDLNTKYVKSIIRVVF
           50        60        70        80        90       100    

             130       140       150       160       170       180 
pF1KB7 HDRRLQYTEHQQLEGWKWNRPGDRLLDLDIPMSVGIIDTRTNPSQLNAVEFLWDPAKRTS
       ::::::::::::::::.:.:::::.::.:::.::::.: :..:.::::::::::::::.:
XP_016 HDRRLQYTEHQQLEGWRWSRPGDRILDIDIPLSVGILDPRASPTQLNAVEFLWDPAKRAS
          110       120       130       140       150       160    

             190       200       210       220       230       240 
pF1KB7 AFIQVHCISTEFTPRKHGGEKGVPFRIQVDTFKQNENGEYTDHLHSASCQIKVFKPKGAD
       ::::::::::::::::::::::::::.:.::::::::::::.::::::::::::::::::
XP_016 AFIQVHCISTEFTPRKHGGEKGVPFRVQIDTFKQNENGEYTEHLHSASCQIKVFKPKGAD
          170       180       190       200       210       220    

             250       260       270               280       290   
pF1KB7 RKQKTDREKMEKRTAHEKEKYQPSYDTTILTE----CS----PWPDAPTAYVNNSPSPAP
       :::::::::::::::.:::::::::.::::::    ::    :: .              
XP_016 RKQKTDREKMEKRTAQEKEKYQPSYETTILTEPQPLCSSADCPWEERSW           
          230       240       250       260       270              

           300       310       320       330       340       350   
pF1KB7 TFTSPQQSTCSVPDSNSSSPNHQGDGASQTSGEQIQPSATIQETQQWLLKNRFSSYTRLF

>>XP_016859394 (OMIM: 609785) PREDICTED: transcription f  (273 aa)
 initn: 1385 init1: 1385 opt: 1391  Z-score: 1243.5  bits: 238.7 E(85289): 1.6e-62
Smith-Waterman score: 1391; 82.2% identity (95.9% similar) in 242 aa overlap (32-273:16-256)

              10        20        30        40        50        60 
pF1KB7 AWVLKMDEVIESGLVHDFDASLSGIGQELGAGAYSMSDVLALPIFKQEDSSLPLDGETEH
                                     .:.: . :::::::::::. .:  ..:.. 
XP_016                MLFWHTQPEHYNQHNSGSY-LRDVLALPIFKQEEPQLSPENEARL
                              10         20        30        40    

              70        80        90       100       110       120 
pF1KB7 PPFQYVMCAATSPAVKLHDETLTYLNQGQSYEIRMLDNRKMGDMPEINGKLVKSIIRVVF
       ::.:::.:::::::::::.:::::::::::::::.:.:::.::. ..: : :::::::::
XP_016 PPLQYVLCAATSPAVKLHEETLTYLNQGQSYEIRLLENRKLGDFQDLNTKYVKSIIRVVF
           50        60        70        80        90       100    

             130       140       150       160       170       180 
pF1KB7 HDRRLQYTEHQQLEGWKWNRPGDRLLDLDIPMSVGIIDTRTNPSQLNAVEFLWDPAKRTS
       ::::::::::::::::.:.:::::.::.:::.::::.: :..:.::::::::::::::.:
XP_016 HDRRLQYTEHQQLEGWRWSRPGDRILDIDIPLSVGILDPRASPTQLNAVEFLWDPAKRAS
          110       120       130       140       150       160    

             190       200       210       220       230       240 
pF1KB7 AFIQVHCISTEFTPRKHGGEKGVPFRIQVDTFKQNENGEYTDHLHSASCQIKVFKPKGAD
       ::::::::::::::::::::::::::.:.::::::::::::.::::::::::::::::::
XP_016 AFIQVHCISTEFTPRKHGGEKGVPFRVQIDTFKQNENGEYTEHLHSASCQIKVFKPKGAD
          170       180       190       200       210       220    

             250       260       270       280       290       300 
pF1KB7 RKQKTDREKMEKRTAHEKEKYQPSYDTTILTECSPWPDAPTAYVNNSPSPAPTFTSPQQS
       :::::::::::::::.:::::::::.::::::                            
XP_016 RKQKTDREKMEKRTAQEKEKYQPSYETTILTEEAQRDPLGTSLAPGVTY           
          230       240       250       260       270              

             310       320       330       340       350       360 
pF1KB7 TCSVPDSNSSSPNHQGDGASQTSGEQIQPSATIQETQQWLLKNRFSSYTRLFSNFSGADL

>>NP_001166924 (OMIM: 189889) alpha-globin transcription  (450 aa)
 initn: 1850 init1: 1074 opt: 1099  Z-score: 982.2  bits: 191.1 E(85289): 5.7e-48
Smith-Waterman score: 2057; 64.0% identity (80.4% similar) in 509 aa overlap (1-504:1-450)

                  10        20        30        40        50       
pF1KB7 MAWVLKM---DEVIESGLVHDFDASLSGIGQELGAGAYSMSDVLALPIFKQEDSSLPLDG
       :::.::.   :::::::::.::::::::::::::::::::::::::::::::.:::: :.
NP_001 MAWALKLPLADEVIESGLVQDFDASLSGIGQELGAGAYSMSDVLALPIFKQEESSLPPDN
               10        20        30        40        50        60

        60        70        80        90       100       110       
pF1KB7 ETEHPPFQYVMCAATSPAVKLHDETLTYLNQGQSYEIRMLDNRKMGDMPEINGKLVKSII
       :..  :::::.:::::::::::::::::::::::::::::::::.:..:::::::::::.
NP_001 ENKILPFQYVLCAATSPAVKLHDETLTYLNQGQSYEIRMLDNRKLGELPEINGKLVKSIF
               70        80        90       100       110       120

       120       130       140       150       160       170       
pF1KB7 RVVFHDRRLQYTEHQQLEGWKWNRPGDRLLDLDIPMSVGIIDTRTNPSQLNAVEFLWDPA
       ::::::::::::::::::::.:::::::.::.:::::::::: :.::.:::.::::::::
NP_001 RVVFHDRRLQYTEHQQLEGWRWNRPGDRILDIDIPMSVGIIDPRANPTQLNTVEFLWDPA
              130       140       150       160       170       180

       180       190       200       210       220       230       
pF1KB7 KRTSAFIQVHCISTEFTPRKHGGEKGVPFRIQVDTFKQNENGEYTDHLHSASCQIKVFKP
       ::::.:::                                                   :
NP_001 KRTSVFIQ---------------------------------------------------P
                                                                   

       240       250       260       270       280       290       
pF1KB7 KGADRKQKTDREKMEKRTAHEKEKYQPSYDTTILTECSPWPDAPTAYVNNSPSPAPTFTS
       :::::::::::::::::: ::::::::::.:::::::::::.   .::::::::.  :.:
NP_001 KGADRKQKTDREKMEKRTPHEKEKYQPSYETTILTECSPWPE--ITYVNNSPSPG--FNS
     190       200       210       220       230         240       

       300       310       320       330       340       350       
pF1KB7 PQQSTCSVPDSNSSSPNHQGDGASQTSGEQIQPSATIQETQQWLLKNRFSSYTRLFSNFS
        ..:. :. ..:.: :::: .    .. ... :..: ::.:::: .::::..::::.:::
NP_001 -SHSSFSLGEGNGS-PNHQPEPPPPVT-DNLLPTTTPQEAQQWLHRNRFSTFTRLFTNFS
          250        260       270        280       290       300  

       360       370       380       390       400       410       
pF1KB7 GADLLKLTKEDLVQICGAADGIRLYNSLKSRSVRPRLTIYVCREQPSSTVLQGQQQAASS
       ::::::::..:..:::: ::::::.:.::.: ::::::::::.:. .    : :::  ..
NP_001 GADLLKLTRDDVIQICGPADGIRLFNALKGRMVRPRLTIYVCQESLQLREQQQQQQQQQQ
            310       320       330       340       350       360  

       420         430       440       450       460       470     
pF1KB7 ASENG--SGAPYVYHAIYLEEMIASEVARKLALVFNIPLHQINQVYRQGPTGIHILVSDQ
         :.:  .:. .:::::::::. : :...:.: .:.:   ::.:.:.:::::::.:.::.
NP_001 KHEDGDSNGTFFVYHAIYLEELTAVELTEKIAQLFSISPCQISQIYKQGPTGIHVLISDE
            370       380       390       400       410       420  

         480       490       500    
pF1KB7 MVQNFQDESCFLFSTVKAESSDGIHIILK
       :.::::.:.::...:.: :..:. :::::
NP_001 MIQNFQEEACFILDTMK-ETNDSYHIILK
            430        440       450




504 residues in 1 query   sequences
60827320 residues in 85289 library sequences
 Tcomplib [36.3.4 Apr, 2011] (8 proc)
 start: Sat Nov  5 09:52:07 2016 done: Sat Nov  5 09:52:08 2016
 Total Scan time:  9.040 Total Display time:  0.070

Function used was FASTA [36.3.4 Apr, 2011]
Inquiries or Suggestions ?
Send a message to flexiclone AT kazusagt.com