Result of FASTA (ccds) for pFN21AB8921
FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448

Query: pF1KB8921, 267 aa
  1>>>pF1KB8921 267 - 267 aa - 267 aa
Library: human.CCDS.faa
  18511270 residues in 32554 sequences

Statistics:  Expectation_n fit: rho(ln(x))= 7.7990+/-0.000747; mu= 5.8772+/- 0.045
 mean_var=184.5384+/-37.546, 0's: 0 Z-trim(116.0): 171  B-trim: 10 in 1/53
 Lambda= 0.094413
 statistics sampled from 16361 (16545) to 16361 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
 ktup: 2, E-join: 1 (0.812), E-opt: 0.2 (0.508), width:  16
 Scan time:  3.010

The best scores are:                                      opt bits E(32554)
CCDS4392.1 MSX2 gene_id:4488|Hs108|chr5            ( 267) 1777 253.2 1.4e-67
CCDS3378.2 MSX1 gene_id:4487|Hs108|chr4            ( 303)  835 125.0 6.3e-29


>>CCDS4392.1 MSX2 gene_id:4488|Hs108|chr5                 (267 aa)
 initn: 1777 init1: 1777 opt: 1777  Z-score: 1327.0  bits: 253.2 E(32554): 1.4e-67
Smith-Waterman score: 1777; 99.6% identity (99.6% similar) in 267 aa overlap (1-267:1-267)

               10        20        30        40        50        60
pF1KB8 MASPSKGNDLFSPDEEGPAVVAGPGPGPGGAEGAAEERRVKVSSLPFSVEALMSDKKPPK
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 MASPSKGNDLFSPDEEGPAVVAGPGPGPGGAEGAAEERRVKVSSLPFSVEALMSDKKPPK
               10        20        30        40        50        60

               70        80        90       100       110       120
pF1KB8 EASPLPAESASAGATLRPLLLSGHGAREAHSPGPLVKPFETASVKSENSEDGAAWMQEPG
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 EASPLPAESASAGATLRPLLLSGHGAREAHSPGPLVKPFETASVKSENSEDGAAWMQEPG
               70        80        90       100       110       120

              130       140       150       160       170       180
pF1KB8 RYSPPPRHTSPTTCTLRKHKTNRKPRTPFTTSQLLALERKFRQKQYLSIAERAEFSSSLN
       :::::::: :::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 RYSPPPRHMSPTTCTLRKHKTNRKPRTPFTTSQLLALERKFRQKQYLSIAERAEFSSSLN
              130       140       150       160       170       180

              190       200       210       220       230       240
pF1KB8 LTETQVKIWFQNRRAKAKRLQEAELEKLKMAAKPMLPSSFSLPFPISSPLQAASIYGASY
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 LTETQVKIWFQNRRAKAKRLQEAELEKLKMAAKPMLPSSFSLPFPISSPLQAASIYGASY
              190       200       210       220       230       240

              250       260       
pF1KB8 PFHRPVLPIPPVGLYATPVGYGMYHLS
       :::::::::::::::::::::::::::
CCDS43 PFHRPVLPIPPVGLYATPVGYGMYHLS
              250       260       

>>CCDS3378.2 MSX1 gene_id:4487|Hs108|chr4                 (303 aa)
 initn: 825 init1: 596 opt: 835  Z-score: 632.8  bits: 125.0 E(32554): 6.3e-29
Smith-Waterman score: 907; 58.9% identity (78.5% similar) in 275 aa overlap (16-267:31-303)

                              10        20        30        40     
pF1KB8                MASPSKGNDLFSPDEEGPAVVAGPGPGPGGAEGAAEERRVKVSSL
                                     ..:...:. . . :. : .:. . :. : :
CCDS33 MAPAADMTSLPLGVKVEDSAFGKPAGGGAGQAPSAAAATAAAMGADEEGAKPK-VSPSLL
               10        20        30        40        50          

          50          60        70         80           90         
pF1KB8 PFSVEALMSDKKPP--KEASPLPAESA-SAGATLRPLLL--SGHGAREA-HSPGPL----
       ::::::::.:.. :  ::..  :.:.. .::.. .:: .  .. :: .:  :: ::    
CCDS33 PFSVEALMADHRKPGAKESALAPSEGVQAAGGSAQPLGVPPGSLGAPDAPSSPRPLGHFS
      60        70        80        90       100       110         

             100       110         120        130       140        
pF1KB8 ----VKPFETASVKSENSE--DGAAWMQEPGRYSPPP-RHTSPTTCTLRKHKTNRKPRTP
           .:  : : ::.:. :  . . ::: : :.:::: :. :: .:::::::::::::::
CCDS33 VGGLLKLPEDALVKAESPEKPERTPWMQSP-RFSPPPARRLSPPACTLRKHKTNRKPRTP
     120       130       140        150       160       170        

      150       160       170       180       190       200        
pF1KB8 FTTSQLLALERKFRQKQYLSIAERAEFSSSLNLTETQVKIWFQNRRAKAKRLQEAELEKL
       :::.:::::::::::::::::::::::::::.::::::::::::::::::::::::::::
CCDS33 FTTAQLLALERKFRQKQYLSIAERAEFSSSLSLTETQVKIWFQNRRAKAKRLQEAELEKL
      180       190       200       210       220       230        

      210        220       230            240       250       260  
pF1KB8 KMAAKPMLP-SSFSLPFPISSPLQAA-----SIYGASYPFHRPVLPIPPVGLYATPVGYG
       ::::::::: ..:.: ::...:  .:     :.:::: ::.: .::. :::::.. :::.
CCDS33 KMAAKPMLPPAAFGLSFPLGGPAAVAAAAGASLYGASGPFQRAALPVAPVGLYTAHVGYS
      240       250       260       270       280       290        

            
pF1KB8 MYHLS
       ::::.
CCDS33 MYHLT
      300   




267 residues in 1 query   sequences
18511270 residues in 32554 library sequences
 Tcomplib [36.3.4 Apr, 2011] (8 proc)
 start: Mon Nov  7 01:21:20 2016 done: Mon Nov  7 01:21:21 2016
 Total Scan time:  3.010 Total Display time: -0.020

Function used was FASTA [36.3.4 Apr, 2011]
Inquiries or Suggestions ?
Send a message to flexiclone AT kazusagt.com