Result of FASTA (omim) for pF1KB7554
FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448

Query: pF1KB7554, 270 aa
  1>>>pF1KB7554 270 - 270 aa - 270 aa
Library: /omim/omim.rfq.tfa
  60827320 residues in 85289 sequences

Statistics:  Expectation_n fit: rho(ln(x))= 8.0692+/-0.00032; mu= 5.7305+/- 0.020
 mean_var=253.4285+/-51.818, 0's: 0 Z-trim(124.1): 174  B-trim: 2320 in 1/57
 Lambda= 0.080565
 statistics sampled from 44944 (45176) to 44944 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
 ktup: 2, E-join: 1 (0.83), E-opt: 0.2 (0.53), width:  16
 Scan time:  8.760

The best scores are:                                      opt bits E(85289)
NP_002720 (OMIM: 604420) hematopoietically-express ( 270) 1870 229.3 5.7e-60
NP_057254 (OMIM: 604240) T-cell leukemia homeobox  ( 284)  322 49.4 8.6e-06
NP_068777 (OMIM: 142995) H2.0-like homeobox protei ( 488)  326 50.2 8.8e-06
NP_004088 (OMIM: 600034) homeobox protein EMX1 [Ho ( 290)  296 46.4 7.1e-05
NP_066305 (OMIM: 604640) T-cell leukemia homeobox  ( 291)  285 45.2 0.00017
NP_006159 (OMIM: 602563) homeobox protein Nkx-6.1  ( 367)  276 44.2 0.00041
NP_006158 (OMIM: 602041) homeobox protein Nkx-3.1  ( 234)  259 42.0  0.0012
XP_011538046 (OMIM: 186770) PREDICTED: T-cell leuk ( 342)  258 42.1  0.0017
NP_002132 (OMIM: 142953) homeobox protein Hox-A4 [ ( 320)  257 41.9  0.0017
NP_001243268 (OMIM: 602041) homeobox protein Nkx-3 ( 159)  251 40.9  0.0018
NP_000200 (OMIM: 125853,260370,600733,606176,60639 ( 283)  254 41.5   0.002
NP_001092304 (OMIM: 603354) homeobox protein GBX-1 ( 363)  255 41.8  0.0022
XP_016868632 (OMIM: 610772) PREDICTED: homeobox pr ( 265)  249 40.9  0.0029
NP_004089 (OMIM: 269160,600035) homeobox protein E ( 252)  248 40.8  0.0031
XP_016867454 (OMIM: 603354) PREDICTED: homeobox pr ( 203)  246 40.4  0.0031
XP_016867453 (OMIM: 603354) PREDICTED: homeobox pr ( 204)  246 40.4  0.0032
NP_076920 (OMIM: 142965) homeobox protein Hox-B4 [ ( 251)  246 40.5  0.0036
NP_005510 (OMIM: 600647) homeobox protein HMX2 [Ho ( 273)  245 40.5  0.0041
XP_005269800 (OMIM: 600647) PREDICTED: homeobox pr ( 273)  245 40.5  0.0041
XP_011530999 (OMIM: 600034) PREDICTED: homeobox pr ( 119)  238 39.2  0.0042
NP_005512 (OMIM: 186770) T-cell leukemia homeobox  ( 330)  246 40.7  0.0043
NP_055435 (OMIM: 142974) homeobox protein Hox-C4 [ ( 264)  244 40.3  0.0044
NP_705897 (OMIM: 142974) homeobox protein Hox-C4 [ ( 264)  244 40.3  0.0044
NP_001795 (OMIM: 600746) homeobox protein CDX-1 [H ( 265)  243 40.2  0.0048
NP_001256 (OMIM: 600297) homeobox protein CDX-2 [H ( 313)  244 40.4  0.0049
NP_796374 (OMIM: 605955) homeobox protein Nkx-6.2  ( 277)  243 40.2  0.0049
NP_067545 (OMIM: 603260) homeobox protein BarH-lik ( 254)  242 40.1   0.005
XP_011533177 (OMIM: 600297) PREDICTED: homeobox pr ( 321)  243 40.3  0.0054
XP_011538047 (OMIM: 186770) PREDICTED: T-cell leuk ( 269)  241 40.0  0.0056
XP_016872278 (OMIM: 605955) PREDICTED: homeobox pr ( 277)  241 40.0  0.0058
NP_006483 (OMIM: 136760,606014) homeobox protein a ( 343)  240 40.0  0.0072
XP_011541345 (OMIM: 604823) PREDICTED: homeobox pr ( 233)  235 39.2  0.0083
NP_003649 (OMIM: 604823) homeobox protein BarH-lik ( 279)  235 39.3  0.0094
NP_004378 (OMIM: 108900,187500,217095,225250,60058 ( 324)  236 39.5  0.0095


>>NP_002720 (OMIM: 604420) hematopoietically-expressed h  (270 aa)
 initn: 1870 init1: 1870 opt: 1870  Z-score: 1197.6  bits: 229.3 E(85289): 5.7e-60
Smith-Waterman score: 1870; 100.0% identity (100.0% similar) in 270 aa overlap (1-270:1-270)

               10        20        30        40        50        60
pF1KB7 MQYPHPGPAAGAVGVPLYAPTPLLQPAHPTPFYIEDILGRGPAAPTPAPTLPSPNSSFTS
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_002 MQYPHPGPAAGAVGVPLYAPTPLLQPAHPTPFYIEDILGRGPAAPTPAPTLPSPNSSFTS
               10        20        30        40        50        60

               70        80        90       100       110       120
pF1KB7 LVSPYRTPVYEPTPIHPAFSHHSAAALAAAYGPGGFGGPLYPFPRTVNDYTHALLRHDPL
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_002 LVSPYRTPVYEPTPIHPAFSHHSAAALAAAYGPGGFGGPLYPFPRTVNDYTHALLRHDPL
               70        80        90       100       110       120

              130       140       150       160       170       180
pF1KB7 GKPLLWSPFLQRPLHKRKGGQVRFSNDQTIELEKKFETQKYLSPPERKRLAKMLQLSERQ
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_002 GKPLLWSPFLQRPLHKRKGGQVRFSNDQTIELEKKFETQKYLSPPERKRLAKMLQLSERQ
              130       140       150       160       170       180

              190       200       210       220       230       240
pF1KB7 VKTWFQNRRAKWRRLKQENPQSNKKEELESLDSSCDQRQDLPSEQNKGASLDSSQCSPSP
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_002 VKTWFQNRRAKWRRLKQENPQSNKKEELESLDSSCDQRQDLPSEQNKGASLDSSQCSPSP
              190       200       210       220       230       240

              250       260       270
pF1KB7 ASQEDLESEISEDSDQEVDIEGDKSYFNAG
       ::::::::::::::::::::::::::::::
NP_002 ASQEDLESEISEDSDQEVDIEGDKSYFNAG
              250       260       270

>>NP_057254 (OMIM: 604240) T-cell leukemia homeobox prot  (284 aa)
 initn: 301 init1: 238 opt: 322  Z-score: 225.0  bits: 49.4 E(85289): 8.6e-06
Smith-Waterman score: 322; 36.3% identity (62.6% similar) in 171 aa overlap (41-204:56-224)

               20        30        40        50         60         
pF1KB7 GAVGVPLYAPTPLLQPAHPTPFYIEDILGRGPAAPTPAPTL-PSPNSSFTSLVSPYRTPV
                                     : ..  :: .: : :.:: ..  .  :.:.
NP_057 SGPETPGGGLGLGRGGQGHGENGAFSGGYHGASGYGPAGSLAPLPGSSGVGPGGVIRVPA
          30        40        50        60        70        80     

      70        80        90        100           110       120    
pF1KB7 YEPTPIHPAFSHHSAAALAAAYG-PGGFGGPLYPFP----RTVNDYTHALLRHDPL-GKP
       ..: :. :  .   :.   .. :  ::..:  .:.     : ..:   : :  .:. :  
NP_057 HRPLPVPPPAGGAPAVPGPSGLGGAGGLAGLTFPWMDSGRRFAKDRLTAAL--SPFSGTR
          90       100       110       120       130         140   

           130       140       150       160       170       180   
pF1KB7 LLWSPFLQRPLHKRKGGQVRFSNDQTIELEKKFETQKYLSPPERKRLAKMLQLSERQVKT
        .  :. .:   :::  .. :: .:..:::..:  ::::.  ::  ::: :.... ::::
NP_057 RIGHPYQNRTPPKRKKPRTSFSRSQVLELERRFLRQKYLASAERAALAKALRMTDAQVKT
           150       160       170       180       190       200   

           190       200       210       220       230       240   
pF1KB7 WFQNRRAKWRRLKQENPQSNKKEELESLDSSCDQRQDLPSEQNKGASLDSSQCSPSPASQ
       ::::::.::::   :. ....                                       
NP_057 WFQNRRTKWRRQTAEEREAERHRAGRLLLHLQQDALPRPLRPPLPPDPLCLHNSSLFALQ
           210       220       230       240       250       260   

>>NP_068777 (OMIM: 142995) H2.0-like homeobox protein [H  (488 aa)
 initn: 296 init1: 252 opt: 326  Z-score: 224.8  bits: 50.2 E(85289): 8.8e-06
Smith-Waterman score: 326; 32.0% identity (53.7% similar) in 281 aa overlap (2-260:136-406)

                                            10            20       
pF1KB7                              MQYPHPGPAAGAVGVPL----YAPTPLLQPA
                                     : : : : :::.  :      .:.:  . .
NP_068 AGFPQRLSPLSAAYHHHHPQQQQQQQQPQQQQPPPPPRAGALQPPASGTRVVPNPHHSGS
         110       120       130       140       150       160     

        30              40        50        60        70        80 
pF1KB7 HPTP------FYIEDILGRGPAAPTPAPTLPSPNSSFTSLVSPYRTPVYEPTPIHPAFSH
        :.:      : :. ::.   :   :     .   ..:::..  :     :. .: .  .
NP_068 APAPSSKDLKFGIDRILS---AEFDPKVKEGNTLRDLTSLLTGGR-----PAGVHLSGLQ
         170       180          190       200            210       

              90           100       110       120       130       
pF1KB7 HSAAALAAAYGP----GGFGGPLYPFPRTVNDYTHALLRHDPLGKPLLWSPFL-QRPLHK
        ::. . :.  :    ... .::   ::  :.  : .    :    .: .  . :   .:
NP_068 PSAGQFFASLDPINEASAILSPLNSNPR--NSVQHQFQDTFPGPYAVLTKDTMPQTYKRK
       220       230       240         250       260       270     

        140       150       160       170       180       190      
pF1KB7 RKGGQVRFSNDQTIELEKKFETQKYLSPPERKRLAKMLQLSERQVKTWFQNRRAKWRRLK
       :. ... ::: :   :::.:: :::.. :.::.:: :: :.. :::.:::::: :::. :
NP_068 RSWSRAVFSNLQRKGLEKRFEIQKYVTKPDRKQLAAMLGLTDAQVKVWFQNRRMKWRHSK
         280       290       300       310       320       330     

        200       210           220         230       240          
pF1KB7 QENPQSNKKEELESLDS----SCDQRQDL--PSEQNKGASLDSSQCSPSPASQEDLE-SE
       . . :..: .:     :    . : .::   ::...  :  .::.      .  : : .:
NP_068 EAQAQKDKDKEAGEKPSGGAPAADGEQDERSPSRSEGEAESESSDSESLDMAPSDTERTE
         340       350       360       370       380       390     

     250       260       270                                       
pF1KB7 ISEDSDQEVDIEGDKSYFNAG                                       
        :: : ... .                                                 
NP_068 GSERSLHQTTVIKAPVTGALITASSAGSGGSSGGGGNSFSFSSASSLSSSSTSAGCASSL
         400       410       420       430       440       450     

>>NP_004088 (OMIM: 600034) homeobox protein EMX1 [Homo s  (290 aa)
 initn: 259 init1: 190 opt: 296  Z-score: 208.6  bits: 46.4 E(85289): 7.1e-05
Smith-Waterman score: 299; 31.5% identity (56.8% similar) in 241 aa overlap (7-232:58-285)

                                       10        20        30      
pF1KB7                         MQYPHPGPAAGAVGVPLYAPTPLLQPAHPTPFYIED
                                     : ..:..:  : : .   .: .:: .    
NP_004 APAAATMFQPAAKRGFTIESLVAKDGGTGGGTGGGGAGSHLLAAAASEEPLRPTALNYPH
        30        40        50        60        70        80       

         40        50        60           70        80        90   
pF1KB7 ILGRGPAAPTPAPTLPSPNSSFTSL-VSPYRTP--VYEPTPIHPAFSHHSAAALAAA--Y
            :.:   : .   : .. ..   : :  :  :.  .  :::.. : :  :.:.   
NP_004 -----PSAAEAAFVSGFPAAAAAGAGRSLYGGPELVFPEAMNHPALTVHPAHQLGASPLQ
             90       100       110       120       130       140  

                  100         110        120       130       140   
pF1KB7 GPGGFGG-----PLYPFPRTVND--YTHALLRHD-PLGKPLLWSPFLQRPLHKRKGGQVR
        : .: :     ::. .: .. .  . : .   : :    :: .:: ..: . : .    
NP_004 PPHSFFGAQHRDPLHFYPWVLRNRFFGHRFQASDVPQDGLLLHGPFARKPKRIRTA----
            150       160       170       180       190            

           150       160       170       180       190         200 
pF1KB7 FSNDQTIELEKKFETQKYLSPPERKRLAKMLQLSERQVKTWFQNRRAKWRR--LKQENPQ
       :: .: ..::. :: ..:.   :::.::  :.::: :::.::::::.:..:  :..:.:.
NP_004 FSPSQLLRLERAFEKNHYVVGAERKQLAGSLSLSETQVKVWFQNRRTKYKRQKLEEEGPE
      200       210       220       230       240       250        

             210       220       230       240       250       260 
pF1KB7 SNKKEELESLDSSCDQRQDLPSEQNKGASLDSSQCSPSPASQEDLESEISEDSDQEVDIE
       :..:..     :   .:  . ..: .: ..:                             
NP_004 SEQKKK----GSHHINRWRIATKQANGEDIDVTSND                        
      260           270       280       290                        

             270
pF1KB7 GDKSYFNAG

>>NP_066305 (OMIM: 604640) T-cell leukemia homeobox prot  (291 aa)
 initn: 356 init1: 239 opt: 285  Z-score: 201.6  bits: 45.2 E(85289): 0.00017
Smith-Waterman score: 316; 32.0% identity (53.2% similar) in 231 aa overlap (2-206:8-235)

                      10        20        30        40        50   
pF1KB7       MQYPHPG-PAAGAVGVPLYAPTPLLQPAHPTPFYIEDILGRGPAAPTPAPTLPS
              : :::  : . ..   : .:     :: :      . :: :: .  :. : ::
NP_066 MEAPASAQTPHPHEPISFGIDQILNSPDQDSAPA-PRGPDGASYLG-GPPGGRPGATYPS
               10        20        30         40         50        

            60                          70          80        90   
pF1KB7 PNSSFTSLVSPY------------------RTPVYEPTP--IHPAFSHHSAAALAAAYGP
         .::..: .:.                  :.:...: :  . : .   .  :. ..   
NP_066 LPASFAGLGAPFEDAGSYSVNLSLAPAGVIRVPAHRPLPGAVPPPLPS-ALPAMPSVPTV
       60        70        80        90       100        110       

           100           110       120        130       140        
pF1KB7 GGFGGPLYPFP----RTVNDYTHALLRHDPLGKPL-LWSPFLQRPLHKRKGGQVRFSNDQ
       ...::  .:.     : :.:   :     :.     .  :. .:   :::  .. ::  :
NP_066 SSLGGLNFPWMESSRRFVKDRFTAAAALTPFTVTRRIGHPYQNRTPPKRKKPRTSFSRVQ
       120       130       140       150       160       170       

      150       160       170       180       190       200        
pF1KB7 TIELEKKFETQKYLSPPERKRLAKMLQLSERQVKTWFQNRRAKWRRLKQENPQSNKKEEL
         ::::.:. ::::.  ::  ::: :.... ::::::::::.::::   :. ......  
NP_066 ICELEKRFHRQKYLASAERAALAKSLKMTDAQVKTWFQNRRTKWRRQTAEEREAERQQAS
       180       190       200       210       220       230       

      210       220       230       240       250       260        
pF1KB7 ESLDSSCDQRQDLPSEQNKGASLDSSQCSPSPASQEDLESEISEDSDQEVDIEGDKSYFN
                                                                   
NP_066 RLMLQLQHDAFQKSLNDSIQPDPLCLHNSSLFALQNLQPWEEDSSKVPAVTSLV      
       240       250       260       270       280       290       

>>NP_006159 (OMIM: 602563) homeobox protein Nkx-6.1 [Hom  (367 aa)
 initn: 280 init1: 231 opt: 276  Z-score: 194.8  bits: 44.2 E(85289): 0.00041
Smith-Waterman score: 297; 32.0% identity (51.2% similar) in 297 aa overlap (5-257:68-337)

                                             10        20        30
pF1KB7                           MQYPHPG----PAAGAVGVPLYAPTPLLQPAHPT
                                     .::    ::.:...  : .:   :. :  :
NP_006 AAYPPLPAGPPSSSSSSSSSSSPSPPLGTHNPGGLKPPATGGLS-SLGSPPQQLSAA--T
        40        50        60        70        80         90      

               40         50        60        70        80         
pF1KB7 PFYIEDILGRGPAAPTPA-PTLPSPNSSFTSLVSPYRTPVYEPTPIHPAFSHHSAAALAA
       :  :.:::.: :. :. .  .::: . : .:  :   . .   .    :    .::: ::
NP_006 PHGINDILSR-PSMPVASGAALPSASPSGSSSSSSSSASASSASAAAAA----AAAAAAA
          100        110       120       130       140             

      90             100       110               120            130
pF1KB7 AYGPGGF--GGP----LYPFPRTVNDY-------THALLRHD-PLGK-----PLLWSPFL
       : .:.:.  : :    : : :   . :       . :. :.  ::..     :..:   .
NP_006 ASSPAGLLAGLPRFSSLSPPPPPPGLYFSPSAAAVAAVGRYPKPLAELPGRTPIFWPGVM
     150       160       170       180       190       200         

                                  140       150       160       170
pF1KB7 QRPL----------H----------KRKGGQVRFSNDQTIELEKKFETQKYLSPPERKRL
       : :           :          :::  .  ::..: . ::: ::  :::. ::: ::
NP_006 QSPPWRDARLACTPHQGSILLDKDGKRKHTRPTFSGQQIFALEKTFEQTKYLAGPERARL
     210       220       230       240       250       260         

              180       190       200       210       220       230
pF1KB7 AKMLQLSERQVKTWFQNRRAKWRRLKQENPQSNKKEELESLDSSCDQRQDLPSEQNKGAS
       :  : ..: :::.::::::.:::. .  .  . ::            .::  .:. ::::
NP_006 AYSLGMTESQVKVWFQNRRTKWRKKHAAEMATAKK------------KQDSETERLKGAS
     270       280       290       300                   310       

              240       250       260       270                 
pF1KB7 LDSSQCSPSPASQEDLESEISEDSDQEVDIEGDKSYFNAG                 
        .  .       ..: .. .. .::.:                              
NP_006 ENEEE-------DDDYNKPLDPNSDDEKITQLLKKHKSSSGGGGGLLLHASEPESSS
       320              330       340       350       360       

>>NP_006158 (OMIM: 602041) homeobox protein Nkx-3.1 isof  (234 aa)
 initn: 266 init1: 220 opt: 259  Z-score: 186.4  bits: 42.0 E(85289): 0.0012
Smith-Waterman score: 292; 35.4% identity (53.1% similar) in 254 aa overlap (4-241:7-232)

                  10        20         30        40        50      
pF1KB7    MQYPHPGPAAGAVGVPLYAPTPLLQPAHP-TPFYIEDILGRGPAAPTPAPTLPSPNS
             :.:: : .  ..:   ::    :..: : : :.:::  :      :    . .:
NP_006 MLRVPEPRPGEAKAEGAAP---PT----PSKPLTSFLIQDILRDG------AQRQGGRTS
               10           20            30              40       

         60        70        80        90       100       110      
pF1KB7 SFTSLVSPYRTPVYEPTPIHPAFSHHSAAALAAAYGPGGFGGPLYPFPRTVNDYTHALLR
       :     .  : :  :: : .:  ..  :.:     . :         ::.. . ...: .
NP_006 S-----QRQRDPEPEPEP-EPEGGRSRAGAQNDQLSTG---------PRAAPEEAETLAE
             50        60         70                 80        90  

           120              130        140       150       160     
pF1KB7 HDP---LGKPLLWS-------PFL-QRPLHKRKGGQVRFSNDQTIELEKKFETQKYLSPP
        .:   ::. :: :       : : : : . .: ... ::. :.::::.::  ::::: :
NP_006 TEPERHLGSYLLDSENTSGALPRLPQTPKQPQKRSRAAFSHTQVIELERKFSHQKYLSAP
            100       110       120       130       140       150  

         170       180       190         200       210       220   
pF1KB7 ERKRLAKMLQLSERQVKTWFQNRRAKWRR--LKQENPQSNKKEELESLDSSCDQRQDLPS
       :: .::: :.:.: ::: :::::: : .:  :..:  . .:.  : .:     .: .: :
NP_006 ERAHLAKNLKLTETQVKIWFQNRRYKTKRKQLSSELGDLEKHSSLPALKEEAFSRASLVS
            160       170       180       190       200       210  

           230         240       250       260       270
pF1KB7 EQNKGASLDSSQC--SPSPASQEDLESEISEDSDQEVDIEGDKSYFNAG
         :.        :  : :::                             
NP_006 VYNSYPYYPYLYCVGSWSPAFW                           
            220       230                               

>>XP_011538046 (OMIM: 186770) PREDICTED: T-cell leukemia  (342 aa)
 initn: 265 init1: 213 opt: 258  Z-score: 183.8  bits: 42.1 E(85289): 0.0017
Smith-Waterman score: 263; 28.4% identity (55.1% similar) in 236 aa overlap (6-224:82-306)

                                        10        20        30     
pF1KB7                          MQYPHPGPAAGAVGVPLYAPTPLLQPAHPTPFYIE
                                     ::  ::. :.  ..:           . ..
XP_011 GGAYTYGGGGSAAATGAGGAGAYGTGGPGGPGGPAGGGGACSMGPLT-------GSYNVN
              60        70        80        90              100    

          40        50        60        70              80         
pF1KB7 DILGRGPAAPTPAPTLPSPNSSFTSLVSPYRTPVYEPTP---IHP---AFSHHSAAALAA
         :. ::.    . .  : ...  : ..  :.:...:      ::   : .  .. .. :
XP_011 MALAGGPGPGGGGGS--SGGAGALSAAGVIRVPAHRPLAGAVAHPQPLATGLPTVPSVPA
          110         120       130       140       150       160  

      90       100       110       120            130       140    
pF1KB7 AYGPGGFGGPLYPFPRTVNDYTHALLRHDPLGKPLLWS-----PFLQRPLHKRKGGQVRF
         : ... :  .:. ..   ::.   :     .:.  .     :. .:   :.:  .. :
XP_011 MPGVNNLTGLTFPWMESNRRYTKD--RFTVALSPFTVTRRIGHPYQNRTPPKKKKPRTSF
            170       180         190       200       210       220

          150       160       170       180       190       200    
pF1KB7 SNDQTIELEKKFETQKYLSPPERKRLAKMLQLSERQVKTWFQNRRAKWRRLKQENPQSNK
       .  :  ::::.:. ::::.  ::  ::: :.... ::::::::::.::::   :. ....
XP_011 TRLQICELEKRFHRQKYLASAERAALAKALKMTDAQVKTWFQNRRTKWRRQTAEEREAER
              230       240       250       260       270       280

             210          220       230       240       250        
pF1KB7 KEE---LESLDSSCDQR---QDLPSEQNKGASLDSSQCSPSPASQEDLESEISEDSDQEV
       ..    : .:..   :.   : ::..                                  
XP_011 QQANRILLQLQQEAFQKSLAQPLPADPLCVHNSSLFALQNLQPWSDDSTKITSVTSVASA
              290       300       310       320       330       340

>>NP_002132 (OMIM: 142953) homeobox protein Hox-A4 [Homo  (320 aa)
 initn: 247 init1: 171 opt: 257  Z-score: 183.6  bits: 41.9 E(85289): 0.0017
Smith-Waterman score: 272; 27.8% identity (52.0% similar) in 273 aa overlap (2-250:59-320)

                                                10             20  
pF1KB7                              MQYPHPG----PAAGAVGV-----PLYAPTP
                                     : :: :    :.:.  .      : : :. 
NP_002 SGGADGGPGGGPGYQQPPAPPTQHLPLQQPQLPHAGGGREPTASYYAPRTAREPAY-PAA
       30        40        50        60        70        80        

             30        40          50               60        70   
pF1KB7 LLQPAHPTPFYIEDILGRGPAAP--TPAPTLP-----SPNSSF--TSLVSPYRTPVYEPT
        : ::: .         :: :.:   : :  :     .:  ..  . ...:   :  .: 
NP_002 ALYPAHGAADTAYPYGYRGGASPGRPPQPEQPPAQAKGPAHGLHASHVLQPQLPPPLQPR
        90       100       110       120       130       140       

            80        90       100       110        120       130  
pF1KB7 PIHPAFSHHSAAALAAAYGPGGFGGPLYPFPRTVNDYTHALLR-HDPLGKPLLWSPFLQR
        . ::  ..  :: :.   :.: ..:  :.   . : .   :. ..:.  : . .  .. 
NP_002 AVPPAAPRRCEAAPATPGVPAGGSAPACPL--LLADKSPLGLKGKEPVVYPWMKKIHVSA
       150       160       170         180       190       200     

            140            150       160       170       180       
pF1KB7 PLHKRKGGQVR-----FSNDQTIELEKKFETQKYLSPPERKRLAKMLQLSERQVKTWFQN
          . .::. .     .. .:..::::.:. ..::.  .: ..:. : ::::::: ::::
NP_002 VNPSYNGGEPKRSRTAYTRQQVLELEKEFHFNRYLTRRRRIEIAHTLCLSERQVKIWFQN
         210       220       230       240       250       260     

       190       200       210       220       230       240       
pF1KB7 RRAKWRRLKQENPQSNKKEELESLDSSCDQRQDLPSEQNKGASLDSSQCSPSPASQEDLE
       :: ::   :...   : :  ..: .:.  .     . :...  :      : :...  . 
NP_002 RRMKW---KKDHKLPNTK--MRSSNSASASAGPPGKAQTQSPHLHPH---PHPSTSTPVP
         270          280         290       300          310       

       250       260       270
pF1KB7 SEISEDSDQEVDIEGDKSYFNAG
       : :                    
NP_002 SSI                    
       320                    

>>NP_001243268 (OMIM: 602041) homeobox protein Nkx-3.1 i  (159 aa)
 initn: 236 init1: 220 opt: 251  Z-score: 183.3  bits: 40.9 E(85289): 0.0018
Smith-Waterman score: 252; 41.9% identity (56.8% similar) in 155 aa overlap (102-241:5-157)

              80        90       100       110          120        
pF1KB7 PTPIHPAFSHHSAAALAAAYGPGGFGGPLYPFPRTVNDYTHALL---RHDPLGKPLLWS-
                                     : ::  .  : :     ::  ::. :: : 
NP_001                           MLRVPEPRPGEAETLAETEPERH--LGSYLLDSE
                                         10        20          30  

             130        140       150       160       170       180
pF1KB7 ------PFL-QRPLHKRKGGQVRFSNDQTIELEKKFETQKYLSPPERKRLAKMLQLSERQ
             : : : : . .: ... ::. :.::::.::  ::::: ::: .::: :.:.: :
NP_001 NTSGALPRLPQTPKQPQKRSRAAFSHTQVIELERKFSHQKYLSAPERAHLAKNLKLTETQ
             40        50        60        70        80        90  

              190         200       210       220       230        
pF1KB7 VKTWFQNRRAKWRR--LKQENPQSNKKEELESLDSSCDQRQDLPSEQNKGASLDSSQC--
       :: :::::: : .:  :..:  . .:.  : .:     .: .: :  :.        :  
NP_001 VKIWFQNRRYKTKRKQLSSELGDLEKHSSLPALKEEAFSRASLVSVYNSYPYYPYLYCVG
            100       110       120       130       140       150  

        240       250       260       270
pF1KB7 SPSPASQEDLESEISEDSDQEVDIEGDKSYFNAG
       : :::                             
NP_001 SWSPAFW                           
                                         




270 residues in 1 query   sequences
60827320 residues in 85289 library sequences
 Tcomplib [36.3.4 Apr, 2011] (8 proc)
 start: Sat Nov  5 18:27:13 2016 done: Sat Nov  5 18:27:15 2016
 Total Scan time:  8.760 Total Display time: -0.010

Function used was FASTA [36.3.4 Apr, 2011]
Inquiries or Suggestions ?
Send a message to flexiclone AT kazusagt.com