Result of FASTA (ccds) for pFN21AB8983
FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448

Query: pF1KB8983, 384 aa
  1>>>pF1KB8983 384 - 384 aa - 384 aa
Library: human.CCDS.faa
  18511270 residues in 32554 sequences

Statistics:  Expectation_n fit: rho(ln(x))= 9.9681+/-0.00087; mu= -0.1608+/- 0.053
 mean_var=334.2642+/-68.978, 0's: 0 Z-trim(118.2): 43  B-trim: 188 in 1/53
 Lambda= 0.070150
 statistics sampled from 19003 (19046) to 19003 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
 ktup: 2, E-join: 1 (0.839), E-opt: 0.2 (0.585), width:  16
 Scan time:  3.760

The best scores are:                                      opt bits E(32554)
CCDS13552.1 SOX18 gene_id:54345|Hs108|chr20        ( 384) 2712 287.3 1.6e-77
CCDS6159.1 SOX17 gene_id:64321|Hs108|chr8          ( 414)  683 81.9 1.1e-15
CCDS5977.1 SOX7 gene_id:83595|Hs108|chr8           ( 388)  559 69.4 6.4e-12


>>CCDS13552.1 SOX18 gene_id:54345|Hs108|chr20             (384 aa)
 initn: 2712 init1: 2712 opt: 2712  Z-score: 1505.2  bits: 287.3 E(32554): 1.6e-77
Smith-Waterman score: 2712; 100.0% identity (100.0% similar) in 384 aa overlap (1-384:1-384)

               10        20        30        40        50        60
pF1KB8 MQRSPPGYGAQDDPPARRDCAWAPGHGAAADTRGLAAGPAALAAPAAPASPPSPQRSPPR
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 MQRSPPGYGAQDDPPARRDCAWAPGHGAAADTRGLAAGPAALAAPAAPASPPSPQRSPPR
               10        20        30        40        50        60

               70        80        90       100       110       120
pF1KB8 SPEPGRYGLSPAGRGERQAADESRIRRPMNAFMVWAKDERKRLAQQNPDLHNAVLSKMLG
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 SPEPGRYGLSPAGRGERQAADESRIRRPMNAFMVWAKDERKRLAQQNPDLHNAVLSKMLG
               70        80        90       100       110       120

              130       140       150       160       170       180
pF1KB8 KAWKELNAAEKRPFVEEAERLRVQHLRDHPNYKYRPRRKKQARKARRLEPGLLLPGLAPP
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 KAWKELNAAEKRPFVEEAERLRVQHLRDHPNYKYRPRRKKQARKARRLEPGLLLPGLAPP
              130       140       150       160       170       180

              190       200       210       220       230       240
pF1KB8 QPPPEPFPAASGSARAFRELPPLGAEFDGLGLPTPERSPLDGLEPGEAAFFPPPAAPEDC
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 QPPPEPFPAASGSARAFRELPPLGAEFDGLGLPTPERSPLDGLEPGEAAFFPPPAAPEDC
              190       200       210       220       230       240

              250       260       270       280       290       300
pF1KB8 ALRPFRAPYAPTELSRDPGGCYGAPLAEALRTAPPAAPLAGLYYGTLGTPGPYPGPLSPP
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 ALRPFRAPYAPTELSRDPGGCYGAPLAEALRTAPPAAPLAGLYYGTLGTPGPYPGPLSPP
              250       260       270       280       290       300

              310       320       330       340       350       360
pF1KB8 PEAPPLESAEPLGPAADLWADVDLTEFDQYLNCSRTRPDAPGLPYHVALAKLGPRAMSCP
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 PEAPPLESAEPLGPAADLWADVDLTEFDQYLNCSRTRPDAPGLPYHVALAKLGPRAMSCP
              310       320       330       340       350       360

              370       380    
pF1KB8 EESSLISALSDASSAVYYSACISG
       ::::::::::::::::::::::::
CCDS13 EESSLISALSDASSAVYYSACISG
              370       380    

>>CCDS6159.1 SOX17 gene_id:64321|Hs108|chr8               (414 aa)
 initn: 819 init1: 541 opt: 683  Z-score: 395.0  bits: 81.9 E(32554): 1.1e-15
Smith-Waterman score: 824; 44.9% identity (60.0% similar) in 390 aa overlap (23-346:3-380)

               10        20        30          40          50      
pF1KB8 MQRSPPGYGAQDDPPARRDCAWAPGHGAAAD--TRGLAAGPAALAA--PAAPASPPSPQR
                             .:  : :.:  ..  .: ::..:.  :   :   ::  
CCDS61                     MSSPDAGYASDDQSQTQSALPAVMAGLGPCPWAESLSPIG
                                   10        20        30        40

         60         70        80        90       100       110     
pF1KB8 SPPRSPE-PGRYGLSPAGRGERQAADESRIRRPMNAFMVWAKDERKRLAQQNPDLHNAVL
       .   . : :.  : .::: . : :  :::::::::::::::::::::::::::::::: :
CCDS61 DMKVKGEAPANSG-APAGAAGR-AKGESRIRRPMNAFMVWAKDERKRLAQQNPDLHNAEL
               50         60         70        80        90        

         120       130       140       150       160       170     
pF1KB8 SKMLGKAWKELNAAEKRPFVEEAERLRVQHLRDHPNYKYRPRRKKQARKARRLEPGLLLP
       ::::::.:: :. :::::::::::::::::..:::::::::::.::... .:.: :.:  
CCDS61 SKMLGKSWKALTLAEKRPFVEEAERLRVQHMQDHPNYKYRPRRRKQVKRLKRVEGGFL-H
      100       110       120       130       140       150        

         180                          190            200        210
pF1KB8 GLAPPQ-----P--------------PPEPFPAASG-----SARAFRELPPLGAE-FDGL
       ::: ::     :              : . :::.        .  .:.   :::  .:: 
CCDS61 GLAEPQAAALGPEGGRVAMDGLGLQFPEQGFPAGPPLLPPHMGGHYRDCQSLGAPPLDGY
       160       170       180       190       200       210       

              220       230       240       250       260       270
pF1KB8 GLPTPERSPLDGLEPGEAAFFPPPAAPEDCALRPFRAPYAPTELSRDPGGCYGAPLAEAL
        ::::. :::::..: . :::  :  : ::   :  . :. ...: : .:    : .   
CCDS61 PLPTPDTSPLDGVDP-DPAFFAAPM-PGDC---PAAGTYSYAQVS-DYAGPPEPPAGPMH
       220       230        240           250        260       270 

                 280                290                            
pF1KB8 -RTAP-PAAP-LAGL---------YYGTLGTPG-----------------------PYPG
        : .: ::.: . ::         :::..:.::                       : ::
CCDS61 PRLGPEPAGPSIPGLLAPPSALHVYYGAMGSPGAGGGRGFQMQPQHQHQHQHQHHPPGPG
             280       290       300       310       320       330 

         300        310       320       330       340       350    
pF1KB8 PLSPPPEAPPL-ESAEPLGPAADLWADVDLTEFDQYLNCSRTRPDAPGLPYHVALAKLGP
         :::::: :  ....:  :: .: ..:: :::.:::.    .:.  ::::.        
CCDS61 QPSPPPEALPCRDGTDPSQPA-ELLGEVDRTEFEQYLHFV-CKPEM-GLPYQGHDSGVNL
             340       350        360       370         380        

          360       370       380    
pF1KB8 RAMSCPEESSLISALSDASSAVYYSACISG
                                     
CCDS61 PDSHGAISSVVSDASSAVYYCNYPDV    
      390       400       410        

>>CCDS5977.1 SOX7 gene_id:83595|Hs108|chr8                (388 aa)
 initn: 745 init1: 480 opt: 559  Z-score: 327.5  bits: 69.4 E(32554): 6.4e-12
Smith-Waterman score: 758; 42.3% identity (58.2% similar) in 397 aa overlap (30-371:2-376)

               10        20        30        40        50        60
pF1KB8 MQRSPPGYGAQDDPPARRDCAWAPGHGAAADTRGLAAGPAALAAPAAPASPPSPQRSPPR
                                    :.  :    : .:  ::  :   . : ::: 
CCDS59                             MASLLGAYPWPEGLECPALDAELSDGQ-SPPA
                                           10        20         30 

               70        80        90       100       110       120
pF1KB8 SPEPGRYGLSPAGRGERQAADESRIRRPMNAFMVWAKDERKRLAQQNPDLHNAVLSKMLG
        :.:      :. .:     .::::::::::::::::::::::: :::::::: ::::::
CCDS59 VPRP------PGDKG-----SESRIRRPMNAFMVWAKDERKRLAVQNPDLHNAELSKMLG
                    40             50        60        70        80

              130       140       150       160        170         
pF1KB8 KAWKELNAAEKRPFVEEAERLRVQHLRDHPNYKYRPRRKKQARK-ARRLEPGLLLPGLAP
       :.:: :. ..:::.:.::::::.::..:.:::::::::::::..  .:..::.:: .:. 
CCDS59 KSWKALTLSQKRPYVDEAERLRLQHMQDYPNYKYRPRRKKQAKRLCKRVDPGFLLSSLSR
               90       100       110       120       130       140

     180        190                         200       210          
pF1KB8 PQPP-PEPFPAASG------------------SARAFRELPPLGAEFDGL---------G
        :   ::   .. :                  : :.  .  : :.   :          :
CCDS59 DQNALPEKRSGSRGALGEKEDRGEYSPGTALPSLRGCYHEGPAGGGGGGTPSSVDTYPYG
              150       160       170       180       190       200

              220       230       240       250            260     
pF1KB8 LPTP-ERSPLDGLEPGEAAFFPPPAAPEDCALRPFRAPYAP-----TELSRDPGGCYGAP
       :::: : :::: ::: : .::  :   :    .: : :. :      : . .:  : . :
CCDS59 LPTPPEMSPLDVLEP-EQTFFSSPCQEEHG--HPRRIPHLPGHPYSPEYAPSPLHC-SHP
              210        220         230       240       250       

          270               280       290              300         
pF1KB8 LAE-ALRTAP--------PAAPLAGLYYGTLGTPGP-------YPGPLSPPPEAPPLESA
       :.  ::  .:        :. : .  ::.  .:  :       . : :::::: : ... 
CCDS59 LGSLALGQSPGVSMMSPVPGCPPSPAYYSP-ATYHPLHSNLQAHLGQLSPPPEHPGFDAL
        260       270       280        290       300       310     

     310       320       330        340          350       360     
pF1KB8 EPLGPAADLWADVDLTEFDQYLNCSRTRPD-APG---LPYHVALAKLGPRAMSCPEESSL
       . :. . .: .:.: .::::::: .  .:: : :   :  :: .... :   . : :.::
CCDS59 DQLSQV-ELLGDMDRNEFDQYLN-TPGHPDSATGAMALSGHVPVSQVTP---TGPTETSL
         320        330        340       350       360          370

         370       380    
pF1KB8 ISALSDASSAVYYSACISG
       ::.:.:             
CCDS59 ISVLADATATYYNSYSVS 
              380         




384 residues in 1 query   sequences
18511270 residues in 32554 library sequences
 Tcomplib [36.3.4 Apr, 2011] (8 proc)
 start: Tue Nov  8 04:25:10 2016 done: Tue Nov  8 04:25:11 2016
 Total Scan time:  3.760 Total Display time: -0.020

Function used was FASTA [36.3.4 Apr, 2011]
Inquiries or Suggestions ?
Send a message to flexiclone AT kazusagt.com