Result of FASTA (ccds) for pFN21AE6254
FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448

Query: pF1KE6254, 295 aa
  1>>>pF1KE6254 295 - 295 aa - 295 aa
Library: human.CCDS.faa
  18511270 residues in 32554 sequences

Statistics:  Expectation_n fit: rho(ln(x))= 5.5580+/-0.000697; mu= 14.9641+/- 0.042
 mean_var=72.6815+/-14.710, 0's: 0 Z-trim(110.7): 20  B-trim: 60 in 1/50
 Lambda= 0.150440
 statistics sampled from 11793 (11803) to 11793 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
 ktup: 2, E-join: 1 (0.722), E-opt: 0.2 (0.363), width:  16
 Scan time:  2.150

The best scores are:                                      opt bits E(32554)
CCDS46560.1 AQP12B gene_id:653437|Hs108|chr2       ( 307) 1697 376.9  1e-104
CCDS8251.1 AQP11 gene_id:282679|Hs108|chr11        ( 271)  347 83.9 1.5e-16


>>CCDS46560.1 AQP12B gene_id:653437|Hs108|chr2            (307 aa)
 initn: 1695 init1: 1695 opt: 1697  Z-score: 1993.6  bits: 376.9 E(32554): 1e-104
Smith-Waterman score: 1912; 95.1% identity (95.8% similar) in 307 aa overlap (1-295:1-307)

               10        20        30        40                    
pF1KE6 MAGLNVSLSFFFATFALCEAARRASKALLPVGAYEVFAREA------------MRTLVEL
       :::::::::::::::.:::::::::::::::::::::::::            :::::::
CCDS46 MAGLNVSLSFFFATFTLCEAARRASKALLPVGAYEVFAREAVGAVQLGACFLEMRTLVEL
               10        20        30        40        50        60

       50        60        70        80        90       100        
pF1KE6 GPWAGDFGPDLLLTLLFLLFLAHGVTLDGASANPTVSLQEFLMAEQSLPGTLLKLAAQGL
       :::::::::::::::::::::::::::::::::::::::::::::.::::::::::::::
CCDS46 GPWAGDFGPDLLLTLLFLLFLAHGVTLDGASANPTVSLQEFLMAEESLPGTLLKLAAQGL
               70        80        90       100       110       120

      110       120       130       140       150       160        
pF1KE6 GMQAACTLMRLCWAWELSDLHLLQSLMAQSCSSALRTSVPHGALVEAACAFCFHLTLLHL
       :::::::: :::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS46 GMQAACTLTRLCWAWELSDLHLLQSLMAQSCSSALRTSVPHGALVEAACAFCFHLTLLHL
              130       140       150       160       170       180

      170       180       190       200       210       220        
pF1KE6 RHSPPAYSGPAVALLVTVTAYTAGPFTSAFFNPALAASVTFACSGHTLLEYVQVYWLGPL
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS46 RHSPPAYSGPAVALLVTVTAYTAGPFTSAFFNPALAASVTFACSGHTLLEYVQVYWLGPL
              190       200       210       220       230       240

      230       240       250       260       270       280        
pF1KE6 TGMVLAVLLHQGRLPHLFQRNLFYGQKNKYRAPRGKPAPASGDTQTPAKGSSVREPGRSG
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS46 TGMVLAVLLHQGRLPHLFQRNLFYGQKNKYRAPRGKPAPASGDTQTPAKGSSVREPGRSG
              250       260       270       280       290       300

      290     
pF1KE6 VEGPHSS
       :::::::
CCDS46 VEGPHSS
              

>>CCDS8251.1 AQP11 gene_id:282679|Hs108|chr11             (271 aa)
 initn: 362 init1: 224 opt: 347  Z-score: 410.9  bits: 83.9 E(32554): 1.5e-16
Smith-Waterman score: 347; 32.6% identity (65.3% similar) in 193 aa overlap (57-245:75-261)

         30        40        50        60        70        80      
pF1KE6 ALLPVGAYEVFAREAMRTLVELGPWAGDFGPDLLLTLLFLLFLAHGVTLDGASANPTVSL
                                     :   :::.... :.::.:: :.:.::   .
CCDS82 HAFVLEFLATFQLCCCTHELQLLSEQHPAHPTWTLTLVYFFSLVHGLTLVGTSSNPCGVM
           50        60        70        80        90       100    

         90       100       110       120           130       140  
pF1KE6 QEFLMAEQSLPGTLLKLAAQGLGMQAACTLMRLC----WAWELSDLHLLQSLMAQSCSSA
       ...... .:     ..: :: ..  : :.  : :    :.  :.. :. .  .:  :.. 
CCDS82 MQMMLGGMSPETGAVRLLAQLVS--ALCS--RYCTSALWSLGLTQYHVSERSFA--CKNP
          110       120         130         140       150          

            150       160       170       180       190       200  
pF1KE6 LRTSVPHGALVEAACAFCFHLTLLHLRHSPPAYSGPAVALLVTVTAYTAGPFTSAFFNPA
       .:... .....::.:.: :: .:::...         .: :.:  .:..: .:.: ::::
CCDS82 IRVDLLKAVITEAVCSFLFHSALLHFQEVRTKLRIHLLAALITFLVYAGGSLTGAVFNPA
      160       170       180       190       200       210        

            210       220       230       240       250       260  
pF1KE6 LAASVTFACSGHTLLEYVQVYWLGPLTGMVLAVLLHQGRLPHLFQRNLFYGQKNKYRAPR
       :: :. : :  ... ..  ::::.:  :..: .:. .  :: :                 
CCDS82 LALSLHFMCFDEAFPQFFIVYWLAPSLGILLMILMFSFFLPWLHNNHTINKKE       
      220       230       240       250       260       270        

            270       280       290     
pF1KE6 GKPAPASGDTQTPAKGSSVREPGRSGVEGPHSS




295 residues in 1 query   sequences
18511270 residues in 32554 library sequences
 Tcomplib [36.3.4 Apr, 2011] (8 proc)
 start: Tue Nov  8 11:30:07 2016 done: Tue Nov  8 11:30:07 2016
 Total Scan time:  2.150 Total Display time: -0.020

Function used was FASTA [36.3.4 Apr, 2011]
Inquiries or Suggestions ?
Send a message to flexiclone AT kazusagt.com