Result of FASTA (ccds) for pFN21AB4637
FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448

Query: pF1KB4637, 300 aa
  1>>>pF1KB4637 300 - 300 aa - 300 aa
Library: human.CCDS.faa
  18511270 residues in 32554 sequences

Statistics:  Expectation_n fit: rho(ln(x))= 6.7072+/-0.000713; mu= 10.4802+/- 0.043
 mean_var=116.7727+/-22.883, 0's: 0 Z-trim(113.8): 21  B-trim: 0 in 0/51
 Lambda= 0.118687
 statistics sampled from 14368 (14383) to 14368 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
 ktup: 2, E-join: 1 (0.787), E-opt: 0.2 (0.442), width:  16
 Scan time:  2.450

The best scores are:                                      opt bits E(32554)
CCDS85.1 PHF13 gene_id:148479|Hs108|chr1           ( 300) 2093 368.5 3.4e-102
CCDS67144.1 PHF23 gene_id:79142|Hs108|chr17        ( 336)  434 84.5 1.2e-16
CCDS67143.1 PHF23 gene_id:79142|Hs108|chr17        ( 399)  434 84.5 1.4e-16
CCDS42250.1 PHF23 gene_id:79142|Hs108|chr17        ( 403)  434 84.5 1.4e-16


>>CCDS85.1 PHF13 gene_id:148479|Hs108|chr1                (300 aa)
 initn: 2093 init1: 2093 opt: 2093  Z-score: 1948.1  bits: 368.5 E(32554): 3.4e-102
Smith-Waterman score: 2093; 100.0% identity (100.0% similar) in 300 aa overlap (1-300:1-300)

               10        20        30        40        50        60
pF1KB4 MDSDSCAAAFHPEEYSPSCKRRRTVEDFNKFCTFVLAYAGYIPYPKEELPLRSSPSPANS
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS85 MDSDSCAAAFHPEEYSPSCKRRRTVEDFNKFCTFVLAYAGYIPYPKEELPLRSSPSPANS
               10        20        30        40        50        60

               70        80        90       100       110       120
pF1KB4 TAGTIDSDGWDAGFSDIASSVPLPVSDRCFSHLQPTLLQRAKPSNFLLDRKKTDKLKKKK
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS85 TAGTIDSDGWDAGFSDIASSVPLPVSDRCFSHLQPTLLQRAKPSNFLLDRKKTDKLKKKK
               70        80        90       100       110       120

              130       140       150       160       170       180
pF1KB4 KRKRRDSDAPGKEGYRGGLLKLEAADPYVETPTSPTLQDIPQAPSDPCSGWDSDTPSSGS
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS85 KRKRRDSDAPGKEGYRGGLLKLEAADPYVETPTSPTLQDIPQAPSDPCSGWDSDTPSSGS
              130       140       150       160       170       180

              190       200       210       220       230       240
pF1KB4 CATVSPDQVKEIKTEGKRTIVRQGKQVVFRDEDSTGNDEDIMVDSDDDSWDLVTCFCMKP
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS85 CATVSPDQVKEIKTEGKRTIVRQGKQVVFRDEDSTGNDEDIMVDSDDDSWDLVTCFCMKP
              190       200       210       220       230       240

              250       260       270       280       290       300
pF1KB4 FAGRPMIECNECHTWIHLSCAKIRKSNVPEVFVCQKCRDSKFDIRRSNRSRTGSRKLFLD
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS85 FAGRPMIECNECHTWIHLSCAKIRKSNVPEVFVCQKCRDSKFDIRRSNRSRTGSRKLFLD
              250       260       270       280       290       300

>>CCDS67144.1 PHF23 gene_id:79142|Hs108|chr17             (336 aa)
 initn: 585 init1: 395 opt: 434  Z-score: 412.2  bits: 84.5 E(32554): 1.2e-16
Smith-Waterman score: 569; 36.8% identity (56.1% similar) in 321 aa overlap (10-286:17-326)

                      10        20        30        40        50   
pF1KB4        MDSDSCAAAFHPEEYSPSCKRRRTVEDFNKFCTFVLAYAGYIPYPKEELPLRS
                       ..::   :  :::::.:::::::.:::::::::: :..: :  .
CCDS67 MLEAMAEPSPEDPPPTLKPETQPPE-KRRRTIEDFNKFCSFVLAYAGYIP-PSKEAPDSA
               10        20         30        40         50        

            60         70                   80        90           
pF1KB4 SPSPANSTAGTI-DSDGWDAG--FSDI---------ASSVPLPVSDRCFSH---------
       .     .   .. : ::  ..  .:           :. .:.:.:.  .::         
CCDS67 TLLEKMKLKDSLFDLDGPKVASPLSPTSLTHTSRPPAALTPVPLSQGDLSHPPRKKDRKN
       60        70        80        90       100       110        

                    100       110       120             130        
pF1KB4 --LQPT------LLQRAKPSNFLLDRKKTDKLKKKKKRKRRDSD------APGKEGYRGG
         : :       .:.: .:.    : .: ...::.:::: . ..       ::      .
CCDS67 RKLGPGAGAGFGVLRRPRPTP--GDGEKRSRIKKSKKRKLKKAERGDRLPPPGPPQAPPS
      120       130         140       150       160       170      

      140       150       160         170       180       190      
pF1KB4 LLKLEAADPYVETPTSPTLQDI--PQAPSDPCSGWDSDTPSSGSCATVSPDQVKEIKTEG
           :  .   :      .  .   .::  :      ..:     ::: :. :    .:.
CCDS67 DTDSEEEEEEEEEEEEEEMATVVGGEAPV-PVLPTPPEAPRPP--ATVHPEGVPPADSES
        180       190       200        210         220       230   

        200       210              220       230       240         
pF1KB4 KRTIVRQGKQVVFRDEDSTGN-------DEDIMVDSDDDSWDLVTCFCMKPFAGRPMIEC
       :..    :.  . .: :....       ::::::.: ::::::.::.: :::::::::::
CCDS67 KEV----GSTETSQDGDASSSEGEMRVMDEDIMVESGDDSWDLITCYCRKPFAGRPMIEC
               240       250       260       270       280         

     250       260       270       280       290       300
pF1KB4 NECHTWIHLSCAKIRKSNVPEVFVCQKCRDSKFDIRRSNRSRTGSRKLFLD
       . : ::::::::::.:.:::. : ::::.. . . ::              
CCDS67 SLCGTWIHLSCAKIKKTNVPDFFYCQKCKELRPEARRLGGPPKSGEP    
     290       300       310       320       330          

>>CCDS67143.1 PHF23 gene_id:79142|Hs108|chr17             (399 aa)
 initn: 667 init1: 395 opt: 434  Z-score: 411.1  bits: 84.5 E(32554): 1.4e-16
Smith-Waterman score: 435; 33.7% identity (53.9% similar) in 258 aa overlap (39-286:158-389)

       10        20        30        40        50        60        
pF1KB4 AFHPEEYSPSCKRRRTVEDFNKFCTFVLAYAGYIPYPKEELPLRSSPSPANSTAGTIDSD
                                     :.  : :  .  :   :   .     . . 
CCDS67 KLKDSLFDLDGPKVASPLSPTSLTHTSRPPAALTPVPLSQGDLSHPPRKKDRKNRKL-GP
       130       140       150       160       170       180       

       70        80        90       100       110       120        
pF1KB4 GWDAGFSDIASSVPLPVSDRCFSHLQPTLLQRAKPSNFLLDRKKTDKLKKKKKRKRRDSD
       :  :::. .    : : . .  :... .  .. : .      .. :.:      .   ::
CCDS67 GAGAGFGVLRRPRPTPGDGEKRSRIKKSKKRKLKKA------ERGDRLPPPGPPQAPPSD
        190       200       210       220             230       240

      130       140          150       160       170       180     
pF1KB4 APGKEGYRGGLLKLEAADPYV---ETPTSPTLQDIPQAPSDPCSGWDSDTPSSGSCATVS
       . ..:  .    . :     :   :.:. :.:   :.::  :              ::: 
CCDS67 TDSEEEEEEEEEEEEEEMATVVGGEAPV-PVLPTPPEAPRPP--------------ATVH
              250       260        270       280                   

         190       200       210              220       230        
pF1KB4 PDQVKEIKTEGKRTIVRQGKQVVFRDEDSTGN-------DEDIMVDSDDDSWDLVTCFCM
       :. :    .:.:..    :.  . .: :....       ::::::.: ::::::.::.: 
CCDS67 PEGVPPADSESKEV----GSTETSQDGDASSSEGEMRVMDEDIMVESGDDSWDLITCYCR
         290           300       310       320       330       340 

      240       250       260       270       280       290        
pF1KB4 KPFAGRPMIECNECHTWIHLSCAKIRKSNVPEVFVCQKCRDSKFDIRRSNRSRTGSRKLF
       :::::::::::. : ::::::::::.:.:::. : ::::.. . . ::            
CCDS67 KPFAGRPMIECSLCGTWIHLSCAKIKKTNVPDFFYCQKCKELRPEARRLGGPPKSGEP  
             350       360       370       380       390           

      300
pF1KB4 LD

>>CCDS42250.1 PHF23 gene_id:79142|Hs108|chr17             (403 aa)
 initn: 641 init1: 395 opt: 434  Z-score: 411.1  bits: 84.5 E(32554): 1.4e-16
Smith-Waterman score: 435; 33.7% identity (53.9% similar) in 258 aa overlap (39-286:162-393)

       10        20        30        40        50        60        
pF1KB4 AFHPEEYSPSCKRRRTVEDFNKFCTFVLAYAGYIPYPKEELPLRSSPSPANSTAGTIDSD
                                     :.  : :  .  :   :   .     . . 
CCDS42 KLKDSLFDLDGPKVASPLSPTSLTHTSRPPAALTPVPLSQGDLSHPPRKKDRKNRKL-GP
             140       150       160       170       180        190

       70        80        90       100       110       120        
pF1KB4 GWDAGFSDIASSVPLPVSDRCFSHLQPTLLQRAKPSNFLLDRKKTDKLKKKKKRKRRDSD
       :  :::. .    : : . .  :... .  .. : .      .. :.:      .   ::
CCDS42 GAGAGFGVLRRPRPTPGDGEKRSRIKKSKKRKLKKA------ERGDRLPPPGPPQAPPSD
              200       210       220             230       240    

      130       140          150       160       170       180     
pF1KB4 APGKEGYRGGLLKLEAADPYV---ETPTSPTLQDIPQAPSDPCSGWDSDTPSSGSCATVS
       . ..:  .    . :     :   :.:. :.:   :.::  :              ::: 
CCDS42 TDSEEEEEEEEEEEEEEMATVVGGEAPV-PVLPTPPEAPRPP--------------ATVH
          250       260       270        280                       

         190       200       210              220       230        
pF1KB4 PDQVKEIKTEGKRTIVRQGKQVVFRDEDSTGN-------DEDIMVDSDDDSWDLVTCFCM
       :. :    .:.:..    :.  . .: :....       ::::::.: ::::::.::.: 
CCDS42 PEGVPPADSESKEV----GSTETSQDGDASSSEGEMRVMDEDIMVESGDDSWDLITCYCR
     290       300           310       320       330       340     

      240       250       260       270       280       290        
pF1KB4 KPFAGRPMIECNECHTWIHLSCAKIRKSNVPEVFVCQKCRDSKFDIRRSNRSRTGSRKLF
       :::::::::::. : ::::::::::.:.:::. : ::::.. . . ::            
CCDS42 KPFAGRPMIECSLCGTWIHLSCAKIKKTNVPDFFYCQKCKELRPEARRLGGPPKSGEP  
         350       360       370       380       390       400     

      300
pF1KB4 LD




300 residues in 1 query   sequences
18511270 residues in 32554 library sequences
 Tcomplib [36.3.4 Apr, 2011] (8 proc)
 start: Fri Nov  4 03:07:17 2016 done: Fri Nov  4 03:07:17 2016
 Total Scan time:  2.450 Total Display time: -0.010

Function used was FASTA [36.3.4 Apr, 2011]
Inquiries or Suggestions ?
Send a message to flexiclone AT kazusagt.com