Result of FASTA (ccds) for pFN21AE5404
FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448

Query: pF1KE5404, 310 aa
  1>>>pF1KE5404 310 - 310 aa - 310 aa
Library: human.CCDS.faa
  18511270 residues in 32554 sequences

Statistics:  Expectation_n fit: rho(ln(x))= 5.3201+/-0.00106; mu= 15.2912+/- 0.063
 mean_var=62.8704+/-13.131, 0's: 0 Z-trim(101.6): 80  B-trim: 0 in 0/51
 Lambda= 0.161753
 statistics sampled from 6526 (6607) to 6526 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
 ktup: 2, E-join: 1 (0.561), E-opt: 0.2 (0.203), width:  16
 Scan time:  1.820

The best scores are:                                      opt bits E(32554)
CCDS6716.1 HSD17B3 gene_id:3293|Hs108|chr9         ( 310) 1996 474.8 3.7e-134
CCDS7905.1 HSD17B12 gene_id:51144|Hs108|chr11      ( 312)  707 174.0 1.3e-43
CCDS10942.1 HSDL1 gene_id:83693|Hs108|chr16        ( 330)  550 137.4 1.5e-32
CCDS54046.1 HSDL1 gene_id:83693|Hs108|chr16        ( 275)  268 71.5 8.2e-13
CCDS9743.1 DHRS7 gene_id:51635|Hs108|chr14         ( 339)  250 67.4 1.8e-11
CCDS9605.1 DHRS4 gene_id:10901|Hs108|chr14         ( 278)  242 65.4 5.6e-11


>>CCDS6716.1 HSD17B3 gene_id:3293|Hs108|chr9              (310 aa)
 initn: 1996 init1: 1996 opt: 1996  Z-score: 2522.0  bits: 474.8 E(32554): 3.7e-134
Smith-Waterman score: 1996; 100.0% identity (100.0% similar) in 310 aa overlap (1-310:1-310)

               10        20        30        40        50        60
pF1KE5 MGDVLEQFFILTGLLVCLACLAKCVRFSRCVLLNYWKVLPKSFLRSMGQWAVITGAGDGI
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS67 MGDVLEQFFILTGLLVCLACLAKCVRFSRCVLLNYWKVLPKSFLRSMGQWAVITGAGDGI
               10        20        30        40        50        60

               70        80        90       100       110       120
pF1KE5 GKAYSFELAKRGLNVVLISRTLEKLEAIATEIERTTGRSVKIIQADFTKDDIYEHIKEKL
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS67 GKAYSFELAKRGLNVVLISRTLEKLEAIATEIERTTGRSVKIIQADFTKDDIYEHIKEKL
               70        80        90       100       110       120

              130       140       150       160       170       180
pF1KE5 AGLEIGILVNNVGMLPNLLPSHFLNAPDEIQSLIHCNITSVVKMTQLILKHMESRQKGLI
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS67 AGLEIGILVNNVGMLPNLLPSHFLNAPDEIQSLIHCNITSVVKMTQLILKHMESRQKGLI
              130       140       150       160       170       180

              190       200       210       220       230       240
pF1KE5 LNISSGIALFPWPLYSMYSASKAFVCAFSKALQEEYKAKEVIIQVLTPYAVSTAMTKYLN
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS67 LNISSGIALFPWPLYSMYSASKAFVCAFSKALQEEYKAKEVIIQVLTPYAVSTAMTKYLN
              190       200       210       220       230       240

              250       260       270       280       290       300
pF1KE5 TNVITKTADEFVKESLNYVTIGGETCGCLAHEILAGFLSLIPAWAFYSGAFQRLLLTHYV
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS67 TNVITKTADEFVKESLNYVTIGGETCGCLAHEILAGFLSLIPAWAFYSGAFQRLLLTHYV
              250       260       270       280       290       300

              310
pF1KE5 AYLKLNTKVR
       ::::::::::
CCDS67 AYLKLNTKVR
              310

>>CCDS7905.1 HSD17B12 gene_id:51144|Hs108|chr11           (312 aa)
 initn: 691 init1: 345 opt: 707  Z-score: 896.3  bits: 174.0 E(32554): 1.3e-43
Smith-Waterman score: 707; 41.7% identity (71.2% similar) in 295 aa overlap (18-306:19-312)

                10        20        30        40         50        
pF1KE5  MGDVLEQFFILTGLLVCLACLAKCVRFSRCVLLNYWKVLPKSFLR-SMGQWAVITGAGD
                         .: ::  . .:  . :  : :  .. .  ..:.:::.::. :
CCDS79 MESALPAAGFLYWVGAGTVAYLALRISYSLFTALRVWGVGNEAGVGPGLGEWAVVTGSTD
               10        20        30        40        50        60

       60        70        80        90       100       110        
pF1KE5 GIGKAYSFELAKRGLNVVLISRTLEKLEAIATEIERTTGRSVKIIQADFTKDDIYEHIKE
       ::::.:. ::::.:..::::::. .::. ...::..     .. : .::...:::..:: 
CCDS79 GIGKSYAEELAKHGMKVVLISRSKDKLDQVSSEIKEKFKVETRTIAVDFASEDIYDKIKT
               70        80        90       100       110       120

      120       130       140          150       160       170     
pF1KE5 KLAGLEIGILVNNVGMLPNLLPSHFLNAPDE---IQSLIHCNITSVVKMTQLILKHMESR
        :::::::::::::::  .  : .::..::    :...:. :: :: :::::.:  :  :
CCDS79 GLAGLEIGILVNNVGMSYEY-PEYFLDVPDLDNVIKKMININILSVCKMTQLVLPGMVER
              130       140        150       160       170         

         180       190       200       210       220       230     
pF1KE5 QKGLILNISSGIALFPWPLYSMYSASKAFVCAFSKALQEEYKAKEVIIQVLTPYAVSTAM
       .:: ::::::: ...: :: ..:::.:.::  ::. :.:::..: :..: . :: :.: .
CCDS79 SKGAILNISSGSGMLPVPLLTIYSATKTFVDFFSQCLHEEYRSKGVFVQSVLPYFVATKL
     180       190       200       210       220       230         

         240       250       260       270       280       290     
pF1KE5 TKYLNTNVITKTADEFVKESLNYVTIGGETCGCLAHEILAGFLSLIPAWAFYSGAFQ--R
       .:  . ..   . . ::: ... : . ..: : : : ......: .:.: . . ...  .
CCDS79 AKIRKPTLDKPSPETFVKSAIKTVGLQSRTNGYLIHALMGSIISNLPSWIYLKIVMNMNK
     240       250       260       270       280       290         

           300       310
pF1KE5 LLLTHYVAYLKLNTKVR
          .::.   : :    
CCDS79 STRAHYLKKTKKN    
     300       310      

>>CCDS10942.1 HSDL1 gene_id:83693|Hs108|chr16             (330 aa)
 initn: 487 init1: 286 opt: 550  Z-score: 697.9  bits: 137.4 E(32554): 1.5e-32
Smith-Waterman score: 550; 38.8% identity (68.2% similar) in 255 aa overlap (43-290:62-314)

             20        30        40        50        60        70  
pF1KE5 GLLVCLACLAKCVRFSRCVLLNYWKVLPKSFLRSMGQWAVITGAGDGIGKAYSFELAKRG
                                     .....:.:::..:: :::::::. :::.::
CCDS10 YTARKSITVICDFYSLIRLHFIPRLGSRADLIKQYGRWAVVSGATDGIGKAYAEELASRG
              40        50        60        70        80        90 

             80        90       100       110        120       130 
pF1KE5 LNVVLISRTLEKLEAIATEIERTTGRSVKIIQADFTKD-DIYEHIKEKLAGLEIGILVNN
       ::..::::. :::...: .:  :    . :: :::..  .::  :.: :   ..::::::
CCDS10 LNIILISRNEEKLQVVAKDIADTYKVETDIIVADFSSGREIYLPIREALKDKDVGILVNN
             100       110       120       130       140       150 

              140        150       160       170       180         
pF1KE5 VGML-PNLLPSHFLN-APDEIQSLIHCNITSVVKMTQLILKHMESRQKGLILNISSGIAL
       ::.. :   :..: . . :.. ..:. ::...  :....:  :  :.:: :..::::   
CCDS10 VGVFYP--YPQYFTQLSEDKLWDIINVNIAAASLMVHVVLPGMVERKKGAIVTISSGSCC
               160       170       180       190       200         

     190       200       210       220       230       240         
pF1KE5 FPWPLYSMYSASKAFVCAFSKALQEEYKAKEVIIQVLTPYAVSTAMTKYLN----TNVIT
        : :  . .:::::..  ::.::: :: .: ...: : :. :.:.::   :     . ..
CCDS10 KPTPQLAAFSASKAYLDHFSRALQYEYASKGIFVQSLIPFYVATSMTAPSNFLHRCSWLV
     210       220       230       240       250       260         

         250       260       270       280       290       300     
pF1KE5 KTADEFVKESLNYVTIGGETCGCLAHEILAGFLSLIPAWAFYSGAFQRLLLTHYVAYLKL
        .   ....... . :. .: :  .: :   : . .: : .  ::               
CCDS10 PSPKVYAHHAVSTLGISKRTTGYWSHSIQFLFAQYMPEWLWVWGANILNRSLRKEALSCT
     270       280       290       300       310       320         

         310
pF1KE5 NTKVR
            
CCDS10 A    
     330    

>>CCDS54046.1 HSDL1 gene_id:83693|Hs108|chr16             (275 aa)
 initn: 405 init1: 227 opt: 268  Z-score: 343.5  bits: 71.5 E(32554): 8.2e-13
Smith-Waterman score: 337; 31.0% identity (55.6% similar) in 252 aa overlap (43-290:62-259)

             20        30        40        50        60        70  
pF1KE5 GLLVCLACLAKCVRFSRCVLLNYWKVLPKSFLRSMGQWAVITGAGDGIGKAYSFELAKRG
                                     .....:.:::..:: :::::::. :::.::
CCDS54 YTARKSITVICDFYSLIRLHFIPRLGSRADLIKQYGRWAVVSGATDGIGKAYAEELASRG
              40        50        60        70        80        90 

             80        90       100       110       120       130  
pF1KE5 LNVVLISRTLEKLEAIATEIERTTGRSVKIIQADFTKDDIYEHIKEKLAGLEIGILVNNV
       ::..::::. :::.                    ::.      ..:              
CCDS54 LNIILISRNEEKLQY-------------------FTQ------LSE--------------
             100                                110                

            140       150       160       170       180       190  
pF1KE5 GMLPNLLPSHFLNAPDEIQSLIHCNITSVVKMTQLILKHMESRQKGLILNISSGIALFPW
                      :.. ..:. ::...  :....:  :  :.:: :..::::    : 
CCDS54 ---------------DKLWDIINVNIAAASLMVHVVLPGMVERKKGAIVTISSGSCCKPT
                           120       130       140       150       

            200       210       220       230       240            
pF1KE5 PLYSMYSASKAFVCAFSKALQEEYKAKEVIIQVLTPYAVSTAMTKYLN----TNVITKTA
       :  . .:::::..  ::.::: :: .: ...: : :. :.:.::   :     . .. . 
CCDS54 PQLAAFSASKAYLDHFSRALQYEYASKGIFVQSLIPFYVATSMTAPSNFLHRCSWLVPSP
       160       170       180       190       200       210       

      250       260       270       280       290       300        
pF1KE5 DEFVKESLNYVTIGGETCGCLAHEILAGFLSLIPAWAFYSGAFQRLLLTHYVAYLKLNTK
         ....... . :. .: :  .: :   : . .: : .  ::                  
CCDS54 KVYAHHAVSTLGISKRTTGYWSHSIQFLFAQYMPEWLWVWGANILNRSLRKEALSCTA  
       220       230       240       250       260       270       

      310
pF1KE5 VR

>>CCDS9743.1 DHRS7 gene_id:51635|Hs108|chr14              (339 aa)
 initn: 232 init1: 118 opt: 250  Z-score: 319.4  bits: 67.4 E(32554): 1.8e-11
Smith-Waterman score: 250; 27.7% identity (57.8% similar) in 256 aa overlap (3-247:4-253)

                10        20          30         40        50      
pF1KE5  MGDVLEQFFILTGLLVCLACLAKCVRFSR--CVLLNYWK-VLPKSFLRSMGQWAVITGA
          ..:  ...: .::. :. : . .: .    .:   :.   :.  : .:  :  .:::
CCDS97 MNWELLLWLLVLCALLLLLVQLLRFLRADGDLTLLWAEWQGRRPEWELTDMVVW--VTGA
               10        20        30        40        50          

         60        70        80        90          100       110   
pF1KE5 GDGIGKAYSFELAKRGLNVVLISRTLEKLEAIAT---EIERTTGRSVKIIQADFTKDDIY
       ..:::.  ...:.: :...:: .: ...:: .     :      ... ..  :.:  :  
CCDS97 SSGIGEELAYQLSKLGVSLVLSARRVHELERVKRRCLENGNLKEKDILVLPLDLT--DTG
       60        70        80        90       100       110        

           120           130       140       150       160         
pF1KE5 EHIKEKLAGLE----IGILVNNVGMLPNLLPSHFLNAPDEIQSLIHCNITSVVKMTQLIL
        :     : :.    : ::::: ::    :     .. :  ..::. :  ..:..:. .:
CCDS97 SHEAATKAVLQEFGRIDILVNNGGMSQRSLCMD--TSLDVYRKLIELNYLGTVSLTKCVL
        120       130       140         150       160       170    

     170       180       190       200       210       220         
pF1KE5 KHMESRQKGLILNISSGIALFPWPLYSMYSASKAFVCAFSKALQEEYKAKE-VIIQVLTP
        ::  :..: :....: ....  ::   : :::  . .: ..:. :  .   .:.. . :
CCDS97 PHMIERKQGKIVTVNSILGIISVPLSIGYCASKHALRGFFNGLRTELATYPGIIVSNICP
          180       190       200       210       220       230    

      230       240       250       260       270       280        
pF1KE5 YAVSTAMTKYLNTNVITKTADEFVKESLNYVTIGGETCGCLAHEILAGFLSLIPAWAFYS
         :.. ...   .. .:::                                         
CCDS97 GPVQSNIVENSLAGEVTKTIGNNGDQSHKMTTSRCVRLMLISMANDLKEVWISEQPFLLV
          240       250       260       270       280       290    

>>CCDS9605.1 DHRS4 gene_id:10901|Hs108|chr14              (278 aa)
 initn: 124 init1: 105 opt: 242  Z-score: 310.6  bits: 65.4 E(32554): 5.6e-11
Smith-Waterman score: 242; 27.9% identity (62.2% similar) in 222 aa overlap (51-266:35-248)

               30        40        50        60        70        80
pF1KE5 LAKCVRFSRCVLLNYWKVLPKSFLRSMGQWAVITGAGDGIGKAYSFELAKRGLNVVLISR
                                     :..:.. :::: : . .::. : .::. ::
CCDS96 GLLGLCARAWNSVRMASSGMTRRDPLANKVALVTASTDGIGFAIARRLAQDGAHVVVSSR
           10        20        30        40        50        60    

                90          100       110       120       130      
pF1KE5 TLEKLE-AIAT---EIERTTGRSVKIIQADFTKDDIYEHIKEKLAGLEIGILVNNVGMLP
         .... :.::   :   .::   .. .:.  .  .   .:  : :  : :::.:... :
CCDS96 KQQNVDQAVATLQGEGLSVTGTVCHVGKAEDRERLVATAVK--LHG-GIDILVSNAAVNP
           70        80        90       100          110       120 

        140       150        160       170       180       190     
pF1KE5 NLLPSHFLNAPDEI-QSLIHCNITSVVKMTQLILKHMESRQKGLILNISSGIALFPWPLY
        .  . .... .:. .. .  :. . . ::. .. .::.:  : .. .::  :. : : .
CCDS96 FF--GSIMDVTEEVWDKTLDINVKAPALMTKAVVPEMEKRGGGSVVIVSSIAAFSPSPGF
               130       140       150       160       170         

         200       210       220       230       240       250     
pF1KE5 SMYSASKAFVCAFSKALQEEYKAKEVIIQVLTPYAVSTAMTKYLNTNVITKTADEFVKES
       : :..::. . ...:.:  :   ... .. :.:  ..:.....:    . :  .: .::.
CCDS96 SPYNVSKTALLGLTKTLAIELAPRNIRVNCLAPGLIKTSFSRML---WMDKEKEESMKET
     180       190       200       210       220          230      

         260        270       280       290       300       310
pF1KE5 LNYVTIG-GETCGCLAHEILAGFLSLIPAWAFYSGAFQRLLLTHYVAYLKLNTKVR
       :    .:  : :                                            
CCDS96 LRIRRLGEPEDCAGIVSFLCSEDASYITGETVVVGGGTPSRL              
        240       250       260       270                      




310 residues in 1 query   sequences
18511270 residues in 32554 library sequences
 Tcomplib [36.3.4 Apr, 2011] (8 proc)
 start: Tue Nov  8 00:24:28 2016 done: Tue Nov  8 00:24:28 2016
 Total Scan time:  1.820 Total Display time:  0.000

Function used was FASTA [36.3.4 Apr, 2011]
Inquiries or Suggestions ?
Send a message to flexiclone AT kazusagt.com