Result of FASTA (omim) for pFN21ASDA0140
FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448

Query: pF1KSDA0140, 422 aa
  1>>>pF1KSDA0140 422 - 422 aa - 422 aa
Library: /omim/omim.rfq.tfa
  60827320 residues in 85289 sequences

Statistics:  Expectation_n fit: rho(ln(x))= 7.0721+/-0.000294; mu= 10.2957+/- 0.018
 mean_var=122.0027+/-24.544, 0's: 0 Z-trim(121.9): 10  B-trim: 635 in 1/53
 Lambda= 0.116115
 statistics sampled from 39131 (39143) to 39131 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
 ktup: 2, E-join: 1 (0.787), E-opt: 0.2 (0.459), width:  16
 Scan time:  8.260

The best scores are:                                      opt bits E(85289)
NP_057689 (OMIM: 609372) protein FAM53C [Homo sapi ( 392)  432 82.9 1.6e-15
NP_001129119 (OMIM: 609372) protein FAM53C [Homo s ( 392)  432 82.9 1.6e-15


>>NP_057689 (OMIM: 609372) protein FAM53C [Homo sapiens]  (392 aa)
 initn: 391 init1: 179 opt: 432  Z-score: 399.6  bits: 82.9 E(85289): 1.6e-15
Smith-Waterman score: 443; 30.4% identity (50.4% similar) in 450 aa overlap (1-422:1-392)

               10        20        30        40             50     
pF1KSD MVMVLSESLSTRGADSIACGTFSRELHTPKKMSQGPTLFSCG-----IMENDRWRDLDR-
       :. ...:.:. .  : . :  ::  :  : . .    . .::     . :.  :: : . 
NP_057 MITLITEQLQKQTLDELKCTRFSISLPLPDHAD----ISNCGNSFQLVSEGASWRGLPHC
               10        20        30            40        50      

           60               70        80        90       100       
pF1KSD KCPLQID------QPS-TSIWECLPEKDSSLWHREAVTACAVTSLIKDLSISDHNGNPSA
       .:    :      .::  :.    : . .:   .:   . ..     :    ..   : :
NP_057 SCAEFQDSLNFSYHPSGLSLHLRPPSRGNS--PKEQPFSQVLRPEPPD---PEKLPVPPA
         60        70        80          90       100          110 

       110       120       130       140       150       160       
pF1KSD PPSKRQCRSLSFSDEMSSCRTSWRPLGSKVWTPVEKRRCYSGGSVQRYSNGFSTMQRSSS
       :::::.:::::   ..:  .  :::  ::.:::...:   .::. :   .  :  .: ::
NP_057 PPSKRHCRSLSVPVDLSRWQPVWRPAPSKLWTPIKHRGSGGGGGPQVPHQ--SPPKRVSS
             120       130       140       150       160           

       170       180       190           200          210          
pF1KSD FSLPSRANVLSSPCDQAGLHHRFGGQPCQGVP----GSAPCG---QAGDTWSPD---LHP
       . .  .:   :: :  :   ::  . :  ..     .: ::.   :.: .:  :   : :
NP_057 LRF-LQAPSASSQCAPA---HRPYSPPFFSLALAQDSSRPCAASPQSG-SWESDAESLSP
     170        180          190       200       210        220    

       220        230       240       250       260          270   
pF1KSD VGGGR-LDLQRSLSCSHEQFSFVEYCPPSANSTPASTPELARRSSGLS---RSRSQPCVL
           : ..:. ::. .  .:       ::: :.:::.:::  :  ::    ::::::: :
NP_057 CPPQRRFSLSPSLGPQASRFL------PSARSSPASSPELPWRPRGLRNLPRSRSQPCDL
          230       240             250       260       270        

           280       290       300       310       320       330   
pF1KSD NDKKVGVKRRRPEEVQEQRPSLDLAKMAQNCQTFSSLSCLSAGTEDCGPQSPFARHVSNT
       . .:.:::::. :. .. :::::. :: :  . .:.  ::.  ... .  ::        
NP_057 DARKTGVKRRHEEDPRRLRPSLDFDKMNQ--KPYSGGLCLQETAREGSSISP--------
      280       290       300         310       320                

           340       350       360       370       380       390   
pF1KSD RAWTALLSASGPGGRTPAGTPVPEPLPPSFDDHLVCQEDLSCEESDSCALDEDCGRRAEP
         :  ... :            : ::  :      :.   .  .  : . .:. :     
NP_057 -PW--FMACS------------PPPLSAS------CSPTGGSSQVLSESEEEEEG-----
       330                     340             350       360       

           400       410        420  
pF1KSD AAAWRDRGAPGNSLCSLD-GELDIEQIEKN
       :. :  ..    .::. : :.::.. ::.:
NP_057 AVRWGRQALSKRTLCQRDFGDLDLNLIEEN
            370       380       390  

>>NP_001129119 (OMIM: 609372) protein FAM53C [Homo sapie  (392 aa)
 initn: 391 init1: 179 opt: 432  Z-score: 399.6  bits: 82.9 E(85289): 1.6e-15
Smith-Waterman score: 443; 30.4% identity (50.4% similar) in 450 aa overlap (1-422:1-392)

               10        20        30        40             50     
pF1KSD MVMVLSESLSTRGADSIACGTFSRELHTPKKMSQGPTLFSCG-----IMENDRWRDLDR-
       :. ...:.:. .  : . :  ::  :  : . .    . .::     . :.  :: : . 
NP_001 MITLITEQLQKQTLDELKCTRFSISLPLPDHAD----ISNCGNSFQLVSEGASWRGLPHC
               10        20        30            40        50      

           60               70        80        90       100       
pF1KSD KCPLQID------QPS-TSIWECLPEKDSSLWHREAVTACAVTSLIKDLSISDHNGNPSA
       .:    :      .::  :.    : . .:   .:   . ..     :    ..   : :
NP_001 SCAEFQDSLNFSYHPSGLSLHLRPPSRGNS--PKEQPFSQVLRPEPPD---PEKLPVPPA
         60        70        80          90       100          110 

       110       120       130       140       150       160       
pF1KSD PPSKRQCRSLSFSDEMSSCRTSWRPLGSKVWTPVEKRRCYSGGSVQRYSNGFSTMQRSSS
       :::::.:::::   ..:  .  :::  ::.:::...:   .::. :   .  :  .: ::
NP_001 PPSKRHCRSLSVPVDLSRWQPVWRPAPSKLWTPIKHRGSGGGGGPQVPHQ--SPPKRVSS
             120       130       140       150       160           

       170       180       190           200          210          
pF1KSD FSLPSRANVLSSPCDQAGLHHRFGGQPCQGVP----GSAPCG---QAGDTWSPD---LHP
       . .  .:   :: :  :   ::  . :  ..     .: ::.   :.: .:  :   : :
NP_001 LRF-LQAPSASSQCAPA---HRPYSPPFFSLALAQDSSRPCAASPQSG-SWESDAESLSP
     170        180          190       200       210        220    

       220        230       240       250       260          270   
pF1KSD VGGGR-LDLQRSLSCSHEQFSFVEYCPPSANSTPASTPELARRSSGLS---RSRSQPCVL
           : ..:. ::. .  .:       ::: :.:::.:::  :  ::    ::::::: :
NP_001 CPPQRRFSLSPSLGPQASRFL------PSARSSPASSPELPWRPRGLRNLPRSRSQPCDL
          230       240             250       260       270        

           280       290       300       310       320       330   
pF1KSD NDKKVGVKRRRPEEVQEQRPSLDLAKMAQNCQTFSSLSCLSAGTEDCGPQSPFARHVSNT
       . .:.:::::. :. .. :::::. :: :  . .:.  ::.  ... .  ::        
NP_001 DARKTGVKRRHEEDPRRLRPSLDFDKMNQ--KPYSGGLCLQETAREGSSISP--------
      280       290       300         310       320                

           340       350       360       370       380       390   
pF1KSD RAWTALLSASGPGGRTPAGTPVPEPLPPSFDDHLVCQEDLSCEESDSCALDEDCGRRAEP
         :  ... :            : ::  :      :.   .  .  : . .:. :     
NP_001 -PW--FMACS------------PPPLSAS------CSPTGGSSQVLSESEEEEEG-----
       330                     340             350       360       

           400       410        420  
pF1KSD AAAWRDRGAPGNSLCSLD-GELDIEQIEKN
       :. :  ..    .::. : :.::.. ::.:
NP_001 AVRWGRQALSKRTLCQRDFGDLDLNLIEEN
            370       380       390  




422 residues in 1 query   sequences
60827320 residues in 85289 library sequences
 Tcomplib [36.3.4 Apr, 2011] (8 proc)
 start: Thu Nov  3 00:12:24 2016 done: Thu Nov  3 00:12:25 2016
 Total Scan time:  8.260 Total Display time: -0.020

Function used was FASTA [36.3.4 Apr, 2011]
Inquiries or Suggestions ?
Send a message to flexiclone AT kazusagt.com