Result of FASTA (ccds) for pFN21AE4306
FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448

Query: pF1KE4306, 306 aa
  1>>>pF1KE4306 306 - 306 aa - 306 aa
Library: human.CCDS.faa
  18511270 residues in 32554 sequences

Statistics:  Expectation_n fit: rho(ln(x))= 5.0103+/-0.000693; mu= 17.5596+/- 0.042
 mean_var=63.1138+/-12.626, 0's: 0 Z-trim(109.7): 10  B-trim: 0 in 0/51
 Lambda= 0.161440
 statistics sampled from 11075 (11080) to 11075 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
 ktup: 2, E-join: 1 (0.715), E-opt: 0.2 (0.34), width:  16
 Scan time:  2.380

The best scores are:                                      opt bits E(32554)
CCDS447.1 PPT1 gene_id:5538|Hs108|chr1             ( 306) 2100 497.2 6.3e-141
CCDS44119.1 PPT1 gene_id:5538|Hs108|chr1           ( 203) 1108 266.1 1.6e-71
CCDS4742.1 PPT2 gene_id:9374|Hs108|chr6            ( 302)  257 68.0   1e-11
CCDS4740.1 PPT2 gene_id:9374|Hs108|chr6            ( 308)  257 68.0   1e-11


>>CCDS447.1 PPT1 gene_id:5538|Hs108|chr1                  (306 aa)
 initn: 2100 init1: 2100 opt: 2100  Z-score: 2643.6  bits: 497.2 E(32554): 6.3e-141
Smith-Waterman score: 2100; 100.0% identity (100.0% similar) in 306 aa overlap (1-306:1-306)

               10        20        30        40        50        60
pF1KE4 MASPGCLWLLAVALLPWTCASRALQHLDPPAPLPLVIWHGMGDSCCNPLSMGAIKKMVEK
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 MASPGCLWLLAVALLPWTCASRALQHLDPPAPLPLVIWHGMGDSCCNPLSMGAIKKMVEK
               10        20        30        40        50        60

               70        80        90       100       110       120
pF1KE4 KIPGIYVLSLEIGKTLMEDVENSFFLNVNSQVTTVCQALAKDPKLQQGYNAMGFSQGGQF
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 KIPGIYVLSLEIGKTLMEDVENSFFLNVNSQVTTVCQALAKDPKLQQGYNAMGFSQGGQF
               70        80        90       100       110       120

              130       140       150       160       170       180
pF1KE4 LRAVAQRCPSPPMINLISVGGQHQGVFGLPRCPGESSHICDFIRKTLNAGAYSKVVQERL
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 LRAVAQRCPSPPMINLISVGGQHQGVFGLPRCPGESSHICDFIRKTLNAGAYSKVVQERL
              130       140       150       160       170       180

              190       200       210       220       230       240
pF1KE4 VQAEYWHDPIKEDVYRNHSIFLADINQERGINESYKKNLMALKKFVMVKFLNDSIVDPVD
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 VQAEYWHDPIKEDVYRNHSIFLADINQERGINESYKKNLMALKKFVMVKFLNDSIVDPVD
              190       200       210       220       230       240

              250       260       270       280       290       300
pF1KE4 SEWFGFYRSGQAKETIPLQETSLYTQDRLGLKEMDNAGQLVFLATEGDHLQLSEEWFYAH
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 SEWFGFYRSGQAKETIPLQETSLYTQDRLGLKEMDNAGQLVFLATEGDHLQLSEEWFYAH
              250       260       270       280       290       300

             
pF1KE4 IIPFLG
       ::::::
CCDS44 IIPFLG
             

>>CCDS44119.1 PPT1 gene_id:5538|Hs108|chr1                (203 aa)
 initn: 1098 init1: 1098 opt: 1108  Z-score: 1397.5  bits: 266.1 E(32554): 1.6e-71
Smith-Waterman score: 1190; 66.3% identity (66.3% similar) in 306 aa overlap (1-306:1-203)

               10        20        30        40        50        60
pF1KE4 MASPGCLWLLAVALLPWTCASRALQHLDPPAPLPLVIWHGMGDSCCNPLSMGAIKKMVEK
       ::::::::::::::::::::::::::::::::::::::::::                  
CCDS44 MASPGCLWLLAVALLPWTCASRALQHLDPPAPLPLVIWHGMG------------------
               10        20        30        40                    

               70        80        90       100       110       120
pF1KE4 KIPGIYVLSLEIGKTLMEDVENSFFLNVNSQVTTVCQALAKDPKLQQGYNAMGFSQGGQF
                                                                   
CCDS44 ------------------------------------------------------------
                                                                   

              130       140       150       160       170       180
pF1KE4 LRAVAQRCPSPPMINLISVGGQHQGVFGLPRCPGESSHICDFIRKTLNAGAYSKVVQERL
                                :::::::::::::::::::::::::::::::::::
CCDS44 -------------------------VFGLPRCPGESSHICDFIRKTLNAGAYSKVVQERL
                                      50        60        70       

              190       200       210       220       230       240
pF1KE4 VQAEYWHDPIKEDVYRNHSIFLADINQERGINESYKKNLMALKKFVMVKFLNDSIVDPVD
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 VQAEYWHDPIKEDVYRNHSIFLADINQERGINESYKKNLMALKKFVMVKFLNDSIVDPVD
        80        90       100       110       120       130       

              250       260       270       280       290       300
pF1KE4 SEWFGFYRSGQAKETIPLQETSLYTQDRLGLKEMDNAGQLVFLATEGDHLQLSEEWFYAH
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 SEWFGFYRSGQAKETIPLQETSLYTQDRLGLKEMDNAGQLVFLATEGDHLQLSEEWFYAH
       140       150       160       170       180       190       

             
pF1KE4 IIPFLG
       ::::::
CCDS44 IIPFLG
       200   

>>CCDS4742.1 PPT2 gene_id:9374|Hs108|chr6                 (302 aa)
 initn: 245 init1: 107 opt: 257  Z-score: 323.8  bits: 68.0 E(32554): 1e-11
Smith-Waterman score: 296; 28.5% identity (56.6% similar) in 288 aa overlap (8-281:13-274)

                    10        20        30             40        50
pF1KE4      MASPGCLWLLAVALLPWTCASRALQHLDPPAPL-----PLVIWHGMGDSCCNPLS
                   :.:   :::.      :  :  :::      :... ::. ::     :
CCDS47 MLGLCGQRLPAAWVLL--LLPFL----PLLLLAAPAPHRASYKPVIVVHGLFDSS---YS
               10          20            30        40           50 

               60          70        80        90       100        
pF1KE4 MGAIKKMVEKKIPG--IYVLSLEIGKTLMEDVENSFFLNVNSQVTTVCQALAKDPKLQQG
       .  . .....  ::  . ::.:  :.  ..     .. .:..   .:   .:: :   ::
CCDS47 FRHLLEYINETHPGTVVTVLDLFDGRESLRP----LWEQVQGFREAVVPIMAKAP---QG
              60        70        80            90       100       

      110       120       130       140       150       160        
pF1KE4 YNAMGFSQGGQFLRAVAQRCPSPPMINLISVGGQHQGVFGLPRCPGESSHICDF----IR
        . . .::::   ::. .   .  . ..::... ..: .:      .....  .    .:
CCDS47 VHLICYSQGGLVCRALLSVMDDHNVDSFISLSSPQMGQYG------DTDYLKWLFPTSMR
          110       120       130       140             150        

          170       180       190       200       210         220  
pF1KE4 KTLNAGAYSKVVQERLVQAEYWHDPIKEDVYRNHSIFLADINQERGINES--YKKNLMAL
       ..:    ::   ::  .  .::::: ..:.: : : ::: :: ::   ..  ..::.. .
CCDS47 SNLYRICYSPWGQEFSI-CNYWHDPHHDDLYLNASSFLALINGERDHPNATVWRKNFLRV
      160       170        180       190       200       210       

            230       240       250        260       270       280 
pF1KE4 KKFVMVKFLNDSIVDPVDSEWFGFYRSGQAKETI-PLQETSLYTQDRLGLKEMDNAGQLV
        ..:..   .:... : .: .::::   .:.::.  ..:  .: .: .::: .   : .:
CCDS47 GHLVLIGGPDDGVITPWQSSFFGFY---DANETVLEMEEQLVYLRDSFGLKTLLARGAIV
       220       230       240          250       260       270    

             290       300         
pF1KE4 FLATEGDHLQLSEEWFYAHIIPFLG   
                                   
CCDS47 RCPMAGISHTAWHSNRTLYETCIEPWLS
          280       290       300  

>>CCDS4740.1 PPT2 gene_id:9374|Hs108|chr6                 (308 aa)
 initn: 245 init1: 107 opt: 257  Z-score: 323.7  bits: 68.0 E(32554): 1e-11
Smith-Waterman score: 296; 28.5% identity (56.6% similar) in 288 aa overlap (8-281:19-280)

                          10        20        30             40    
pF1KE4            MASPGCLWLLAVALLPWTCASRALQHLDPPAPL-----PLVIWHGMGDS
                         :.:   :::.      :  :  :::      :... ::. ::
CCDS47 MKSCGSMLGLCGQRLPAAWVLL--LLPFL----PLLLLAAPAPHRASYKPVIVVHGLFDS
               10        20              30        40        50    

           50        60          70        80        90       100  
pF1KE4 CCNPLSMGAIKKMVEKKIPG--IYVLSLEIGKTLMEDVENSFFLNVNSQVTTVCQALAKD
            :.  . .....  ::  . ::.:  :.  ..     .. .:..   .:   .:: 
CCDS47 S---YSFRHLLEYINETHPGTVVTVLDLFDGRESLRP----LWEQVQGFREAVVPIMAKA
              60        70        80            90       100       

            110       120       130       140       150       160  
pF1KE4 PKLQQGYNAMGFSQGGQFLRAVAQRCPSPPMINLISVGGQHQGVFGLPRCPGESSHICDF
       :   :: . . .::::   ::. .   .  . ..::... ..: .:      .....  .
CCDS47 P---QGVHLICYSQGGLVCRALLSVMDDHNVDSFISLSSPQMGQYG------DTDYLKWL
          110       120       130       140       150              

                170       180       190       200       210        
pF1KE4 ----IRKTLNAGAYSKVVQERLVQAEYWHDPIKEDVYRNHSIFLADINQERGINES--YK
           .:..:    ::   ::  .  .::::: ..:.: : : ::: :: ::   ..  ..
CCDS47 FPTSMRSNLYRICYSPWGQEFSI-CNYWHDPHHDDLYLNASSFLALINGERDHPNATVWR
      160       170       180        190       200       210       

        220       230       240       250        260       270     
pF1KE4 KNLMALKKFVMVKFLNDSIVDPVDSEWFGFYRSGQAKETI-PLQETSLYTQDRLGLKEMD
       ::.. . ..:..   .:... : .: .::::   .:.::.  ..:  .: .: .::: . 
CCDS47 KNFLRVGHLVLIGGPDDGVITPWQSSFFGFY---DANETVLEMEEQLVYLRDSFGLKTLL
       220       230       240          250       260       270    

         280       290       300         
pF1KE4 NAGQLVFLATEGDHLQLSEEWFYAHIIPFLG   
         : .:                            
CCDS47 ARGAIVRCPMAGISHTAWHSNRTLYETCIEPWLS
          280       290       300        




306 residues in 1 query   sequences
18511270 residues in 32554 library sequences
 Tcomplib [36.3.4 Apr, 2011] (8 proc)
 start: Sat Nov  5 23:28:03 2016 done: Sat Nov  5 23:28:04 2016
 Total Scan time:  2.380 Total Display time:  0.010

Function used was FASTA [36.3.4 Apr, 2011]
Inquiries or Suggestions ?
Send a message to flexiclone AT kazusagt.com