Result of FASTA (ccds) for pFN21AE2181
FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448

Query: pF1KE2181, 240 aa
  1>>>pF1KE2181 240 - 240 aa - 240 aa
Library: human.CCDS.faa
  18511270 residues in 32554 sequences

Statistics:  Expectation_n fit: rho(ln(x))= 5.4193+/-0.000687; mu= 14.1186+/- 0.041
 mean_var=65.9634+/-13.184, 0's: 0 Z-trim(110.7): 13  B-trim: 358 in 2/49
 Lambda= 0.157915
 statistics sampled from 11839 (11844) to 11839 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
 ktup: 2, E-join: 1 (0.742), E-opt: 0.2 (0.364), width:  16
 Scan time:  2.110

The best scores are:                                      opt bits E(32554)
CCDS1006.1 THEM4 gene_id:117145|Hs108|chr1         ( 240) 1636 380.8 4.3e-106
CCDS1005.1 THEM5 gene_id:284486|Hs108|chr1         ( 247)  547 132.7 2.1e-31


>>CCDS1006.1 THEM4 gene_id:117145|Hs108|chr1              (240 aa)
 initn: 1636 init1: 1636 opt: 1636  Z-score: 2018.3  bits: 380.8 E(32554): 4.3e-106
Smith-Waterman score: 1636; 99.6% identity (99.6% similar) in 240 aa overlap (1-240:1-240)

               10        20        30        40        50        60
pF1KE2 MLRSCAARLRTLGALCRPPVGRRLPGSEPRPELRSFSSEEVILKDCSVPNPSWNKDLRLL
       :::::::::::::::: :::::::::::::::::::::::::::::::::::::::::::
CCDS10 MLRSCAARLRTLGALCLPPVGRRLPGSEPRPELRSFSSEEVILKDCSVPNPSWNKDLRLL
               10        20        30        40        50        60

               70        80        90       100       110       120
pF1KE2 FDQFMKKCEDGSWKRLPSYKRTPTEWIQDFKTHFLDPKLMKEEQMSQAQLFTRSFDDGLG
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 FDQFMKKCEDGSWKRLPSYKRTPTEWIQDFKTHFLDPKLMKEEQMSQAQLFTRSFDDGLG
               70        80        90       100       110       120

              130       140       150       160       170       180
pF1KE2 FEYVMFYNDIEKRMVCLFQGGPYLEGPPGFIHGGAIATMIDATVGMCAMMAGGIVMTANL
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 FEYVMFYNDIEKRMVCLFQGGPYLEGPPGFIHGGAIATMIDATVGMCAMMAGGIVMTANL
              130       140       150       160       170       180

              190       200       210       220       230       240
pF1KE2 NINYKRPIPLCSVVMINSQLDKVEGRKFFVSCNVQSVDEKTLYSEATSLFIKLNPAKSLT
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 NINYKRPIPLCSVVMINSQLDKVEGRKFFVSCNVQSVDEKTLYSEATSLFIKLNPAKSLT
              190       200       210       220       230       240

>>CCDS1005.1 THEM5 gene_id:284486|Hs108|chr1              (247 aa)
 initn: 517 init1: 388 opt: 547  Z-score: 677.2  bits: 132.7 E(32554): 2.1e-31
Smith-Waterman score: 547; 38.7% identity (68.7% similar) in 243 aa overlap (1-233:1-239)

                   10        20             30        40        50 
pF1KE2 MLRSC---AARL-RTLGALCRPPVGRRL-P----GSEPRPELRSFSSEEVILKDCSVPNP
       :.: :   :::: .  : :  : .  :: :    ::     .  :  :.. ::: ..:: 
CCDS10 MIRRCFQVAARLGHHRGLLEAPRILPRLNPASAFGSSTDSMFSRFLPEKTDLKDYALPNA
               10        20        30        40        50        60

              60        70        80        90       100       110 
pF1KE2 SWNKDLRLLFDQFMKKCEDGSWKRLPSYKRTPTEWIQDFKTHFLDPKLMKEEQMSQAQLF
       :: .:.  :...:..: ....: .:::.: .  . :. .:   :   :    . .. ..:
CCDS10 SWCSDMLSLYQEFLEKTKSSGWIKLPSFK-SNRDHIRGLK---LPSGLAVSSDKGDCRIF
               70        80         90          100       110      

              120       130       140       150       160       170
pF1KE2 TRSFD-DGLGFEYVMFYNDIEKRMVCLFQGGPYLEGPPGFIHGGAIATMIDATVGMCAMM
       :: .. .: :::::.:..  .:. ::::: : :::::::: :::..:.:.: : .  :..
CCDS10 TRCIQVEGQGFEYVIFFQPTQKKSVCLFQPGSYLEGPPGFAHGGSLAAMMDETFSKTAFL
        120       130       140       150       160       170      

              180       190       200       210       220       230
pF1KE2 AGGIVMTANLNINYKRPIPLCSVVMINSQLDKVEGRKFFVSCNVQSVDEKTLYSEATSLF
       ::  ..: .::: .:  ::. :.:... .:::.: .:...:: ..: :..:.:......:
CCDS10 AGEGLFTLSLNIRFKNLIPVDSLVVMDVELDKIEDQKLYMSCIAHSRDQQTVYAKSSGVF
        180       190       200       210       220       230      

              240 
pF1KE2 IKLNPAKSLT 
       ..:        
CCDS10 LQLQLEEESPQ
        240       




240 residues in 1 query   sequences
18511270 residues in 32554 library sequences
 Tcomplib [36.3.4 Apr, 2011] (8 proc)
 start: Mon Nov  7 15:48:07 2016 done: Mon Nov  7 15:48:07 2016
 Total Scan time:  2.110 Total Display time: -0.010

Function used was FASTA [36.3.4 Apr, 2011]
Inquiries or Suggestions ?
Send a message to flexiclone AT kazusagt.com