Result of FASTA (ccds) for pF1KB8940
FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448

Query: pF1KB8940, 297 aa
  1>>>pF1KB8940 297 - 297 aa - 297 aa
Library: human.CCDS.faa
  18511270 residues in 32554 sequences

Statistics:  Expectation_n fit: rho(ln(x))= 9.2632+/-0.000799; mu= -0.4885+/- 0.048
 mean_var=222.1917+/-45.694, 0's: 0 Z-trim(116.1): 170  B-trim: 159 in 1/51
 Lambda= 0.086042
 statistics sampled from 16472 (16645) to 16472 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
 ktup: 2, E-join: 1 (0.81), E-opt: 0.2 (0.511), width:  16
 Scan time:  2.150

The best scores are:                                      opt bits E(32554)
CCDS3378.2 MSX1 gene_id:4487|Hs108|chr4            ( 303) 1963 255.4 3.8e-68
CCDS4392.1 MSX2 gene_id:4488|Hs108|chr5            ( 267)  839 115.9 3.4e-26


>>CCDS3378.2 MSX1 gene_id:4487|Hs108|chr4                 (303 aa)
 initn: 1963 init1: 1963 opt: 1963  Z-score: 1337.0  bits: 255.4 E(32554): 3.8e-68
Smith-Waterman score: 1963; 100.0% identity (100.0% similar) in 297 aa overlap (1-297:7-303)

                     10        20        30        40        50    
pF1KB8       MTSLPLGVKVEDSAFGKPAGGGAGQAPSAAAATAAAMGADEEGAKPKVSPSLLP
             ::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS33 MAPAADMTSLPLGVKVEDSAFGKPAGGGAGQAPSAAAATAAAMGADEEGAKPKVSPSLLP
               10        20        30        40        50        60

           60        70        80        90       100       110    
pF1KB8 FSVEALMADHRKPGAKESALAPSEGVQAAGGSAQPLGVPPGSLGAPDAPSSPRPLGHFSV
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS33 FSVEALMADHRKPGAKESALAPSEGVQAAGGSAQPLGVPPGSLGAPDAPSSPRPLGHFSV
               70        80        90       100       110       120

          120       130       140       150       160       170    
pF1KB8 GGLLKLPEDALVKAESPEKPERTPWMQSPRFSPPPARRLSPPACTLRKHKTNRKPRTPFT
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS33 GGLLKLPEDALVKAESPEKPERTPWMQSPRFSPPPARRLSPPACTLRKHKTNRKPRTPFT
              130       140       150       160       170       180

          180       190       200       210       220       230    
pF1KB8 TAQLLALERKFRQKQYLSIAERAEFSSSLSLTETQVKIWFQNRRAKAKRLQEAELEKLKM
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS33 TAQLLALERKFRQKQYLSIAERAEFSSSLSLTETQVKIWFQNRRAKAKRLQEAELEKLKM
              190       200       210       220       230       240

          240       250       260       270       280       290    
pF1KB8 AAKPMLPPAAFGLSFPLGGPAAVAAAAGASLYGASGPFQRAALPVAPVGLYTAHVGYSMY
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS33 AAKPMLPPAAFGLSFPLGGPAAVAAAAGASLYGASGPFQRAALPVAPVGLYTAHVGYSMY
              250       260       270       280       290       300

          
pF1KB8 HLT
       :::
CCDS33 HLT
          

>>CCDS4392.1 MSX2 gene_id:4488|Hs108|chr5                 (267 aa)
 initn: 829 init1: 600 opt: 839  Z-score: 583.7  bits: 115.9 E(32554): 3.4e-26
Smith-Waterman score: 911; 58.9% identity (78.9% similar) in 275 aa overlap (25-297:16-267)

               10        20        30        40         50         
pF1KB8 MTSLPLGVKVEDSAFGKPAGGGAGQAPSAAAATAAAMGADEEGAKPK-VSPSLLPFSVEA
                               ..:...:. . . :. : .:. . :. : :::::::
CCDS43          MASPSKGNDLFSPDEEGPAVVAGPGPGPGGAEGAAEERRVKVSSLPFSVEA
                        10        20        30        40        50 

      60        70        80        90       100       110         
pF1KB8 LMADHRKPGAKESALAPSEGVQAAGGSAQPLGVPPGSLGAPDAPSSPRPLGHFSVGGLLK
       ::.:.. :  ::..  :.:.. .::.. .:: .  .. :: .:  :: ::        .:
CCDS43 LMSDKKPP--KEASPLPAESA-SAGATLRPLLL--SGHGAREA-HSPGPL--------VK
                60        70         80           90               

     120       130       140        150       160       170        
pF1KB8 LPEDALVKAESPEKPERTPWMQSP-RFSPPPARRLSPPACTLRKHKTNRKPRTPFTTAQL
         : : ::.:. :  . . ::: : :.:::: :..:: .::::::::::::::::::.::
CCDS43 PFETASVKSENSE--DGAAWMQEPGRYSPPP-RHMSPTTCTLRKHKTNRKPRTPFTTSQL
       100       110         120        130       140       150    

      180       190       200       210       220       230        
pF1KB8 LALERKFRQKQYLSIAERAEFSSSLSLTETQVKIWFQNRRAKAKRLQEAELEKLKMAAKP
       :::::::::::::::::::::::::.::::::::::::::::::::::::::::::::::
CCDS43 LALERKFRQKQYLSIAERAEFSSSLNLTETQVKIWFQNRRAKAKRLQEAELEKLKMAAKP
          160       170       180       190       200       210    

      240       250       260       270       280       290       
pF1KB8 MLPPAAFGLSFPLGGPAAVAAAAGASLYGASGPFQRAALPVAPVGLYTAHVGYSMYHLT
       ::: ..:.: ::...:  .     ::.:::: ::.: .::. :::::.. :::.::::.
CCDS43 MLP-SSFSLPFPISSPLQA-----ASIYGASYPFHRPVLPIPPVGLYATPVGYGMYHLS
           220       230            240       250       260       




297 residues in 1 query   sequences
18511270 residues in 32554 library sequences
 Tcomplib [36.3.4 Apr, 2011] (8 proc)
 start: Sat Nov  5 01:10:47 2016 done: Sat Nov  5 01:10:47 2016
 Total Scan time:  2.150 Total Display time: -0.020

Function used was FASTA [36.3.4 Apr, 2011]
Inquiries or Suggestions ?
Send a message to flexiclone AT kazusagt.com