Result of FASTA (ccds) for pFN21AA0116
FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448

Query: pF1KA0116, 291 aa
  1>>>pF1KA0116 291 - 291 aa - 291 aa
Library: human.CCDS.faa
  18511270 residues in 32554 sequences

Statistics:  Expectation_n fit: rho(ln(x))= 5.5859+/-0.000825; mu= 12.8372+/- 0.049
 mean_var=53.0006+/-10.558, 0's: 0 Z-trim(104.6): 22  B-trim: 38 in 1/51
 Lambda= 0.176171
 statistics sampled from 7989 (7996) to 7989 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
 ktup: 2, E-join: 1 (0.623), E-opt: 0.2 (0.246), width:  16
 Scan time:  2.250

The best scores are:                                      opt bits E(32554)
CCDS2725.1 EXOSC7 gene_id:23016|Hs108|chr3         ( 291) 1904 491.8 2.4e-139
CCDS31958.1 EXOSC8 gene_id:11340|Hs108|chr13       ( 276)  374 103.0 2.6e-22
CCDS3722.2 EXOSC9 gene_id:5393|Hs108|chr4          ( 439)  375 103.2 3.5e-22
CCDS34057.1 EXOSC9 gene_id:5393|Hs108|chr4         ( 456)  375 103.2 3.6e-22


>>CCDS2725.1 EXOSC7 gene_id:23016|Hs108|chr3              (291 aa)
 initn: 1904 init1: 1904 opt: 1904  Z-score: 2615.1  bits: 491.8 E(32554): 2.4e-139
Smith-Waterman score: 1904; 99.7% identity (100.0% similar) in 291 aa overlap (1-291:1-291)

               10        20        30        40        50        60
pF1KA0 MASVTLSEAEKVYIVHGVQEDLRVDGRGCEDYRCVEVETDVVSNTSGSARVKLGHTDILV
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS27 MASVTLSEAEKVYIVHGVQEDLRVDGRGCEDYRCVEVETDVVSNTSGSARVKLGHTDILV
               10        20        30        40        50        60

               70        80        90       100       110       120
pF1KA0 GVKAEMGTPKLEKPNEGYLEFFVDCSASATPEFEGRGGDDLGTEIANTLYRIFNNKSSVD
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS27 GVKAEMGTPKLEKPNEGYLEFFVDCSASATPEFEGRGGDDLGTEIANTLYRIFNNKSSVD
               70        80        90       100       110       120

              130       140       150       160       170       180
pF1KA0 LKTLCISPREHCWVLYVDVLLLECGGNLFDAISIAVKAALFNTRIPRVRVLEDEEGSKDI
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS27 LKTLCISPREHCWVLYVDVLLLECGGNLFDAISIAVKAALFNTRIPRVRVLEDEEGSKDI
              130       140       150       160       170       180

              190       200       210       220       230       240
pF1KA0 ELSDDPYDCIRLSVENVPCIVTLCKIGYRHVVDATLQEEACSLASLLVSVTSKGVVTCMR
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS27 ELSDDPYDCIRLSVENVPCIVTLCKIGYRHVVDATLQEEACSLASLLVSVTSKGVVTCMR
              190       200       210       220       230       240

              250       260       270       280       290 
pF1KA0 KVGKGSLDPESIFEMMETGKRVGKVLHASLQSVLHKEESLGPKRQKVGFLG
       :::::::::::::::::::::::::::::::::.:::::::::::::::::
CCDS27 KVGKGSLDPESIFEMMETGKRVGKVLHASLQSVVHKEESLGPKRQKVGFLG
              250       260       270       280       290 

>>CCDS31958.1 EXOSC8 gene_id:11340|Hs108|chr13            (276 aa)
 initn: 391 init1: 316 opt: 374  Z-score: 513.9  bits: 103.0 E(32554): 2.6e-22
Smith-Waterman score: 374; 28.9% identity (62.8% similar) in 266 aa overlap (18-276:18-276)

               10        20        30        40        50        60
pF1KA0 MASVTLSEAEKVYIVHGVQEDLRVDGRGCEDYRCVEVETDVVSNTSGSARVKLGHTDILV
                        ..:. : :::   ..: . :.   .:...::: ::::.: .. 
CCDS31 MAAGFKTVEPLEYYRRFLKENCRPDGRELGEFRTTTVNIGSISTADGSALVKLGNTTVIC
               10        20        30        40        50        60

               70        80        90        100       110         
pF1KA0 GVKAEMGTPKLEKPNEGYLEFFVDCSASATPEFE-GRGGDDLGTEIANTLYR-IFNNKSS
       :::::...:. . :..::.   ::     . .:. :  :..  ...:. .   ...:.. 
CCDS31 GVKAEFAAPSTDAPDKGYVVPNVDLPPLCSSRFRSGPPGEE--AQVASQFIADVIENSQI
               70        80        90       100         110        

      120       130       140       150       160       170        
pF1KA0 VDLKTLCISPREHCWVLYVDVLLLECGGNLFDAISIAVKAALFNTRIPRVRVLEDEEGSK
       .. . ::::: .  :::: :.. :.  ::..:: ..:. ::: :...:.: . : : .  
CCDS31 IQKEDLCISPGKLVWVLYCDLICLDYDGNILDACTFALLAALKNVQLPEVTINE-ETALA
      120       130       140       150       160       170        

      180       190       200       210        220       230       
pF1KA0 DIELSDDPYDCIRLSVENVPCIVTLCKIGYRH-VVDATLQEEACSLASLLVSVTSKGVVT
       ...:.   :    :.... :  ...  .     .:: : .::  . ..: . .  .: . 
CCDS31 EVNLKKKSY----LNIRTHPVATSFAVFDDTLLIVDPTGEEEHLATGTLTIVMDEEGKLC
       180           190       200       210       220       230   

       240       250           260       270       280       290 
pF1KA0 CMRKVGKGSLDPESIFEMMETG----KRVGKVLHASLQSVLHKEESLGPKRQKVGFLG
       :..: : ..:   .. . :  .    :.: :..   ..:.  :               
CCDS31 CLHKPGGSGLTGAKLQDCMSRAVTRHKEVKKLMDEVIKSMKPK               
           240       250       260       270                     

>>CCDS3722.2 EXOSC9 gene_id:5393|Hs108|chr4               (439 aa)
 initn: 355 init1: 262 opt: 375  Z-score: 511.7  bits: 103.2 E(32554): 3.5e-22
Smith-Waterman score: 375; 28.9% identity (60.2% similar) in 294 aa overlap (1-289:1-284)

               10        20        30        40        50        60
pF1KA0 MASVTLSEAEKVYIVHGVQEDLRVDGRGCEDYRCVEVETDVVSNTSGSARVKLGHTDILV
       :  . ::. :. .......:  :.:::   ::: ...     ..  :   :.::.: .: 
CCDS37 MKETPLSNCERRFLLRAIEEKKRLDGRQTYDYRNIRIS---FGTDYGCCIVELGKTRVLG
               10        20        30           40        50       

               70        80        90       100       110       120
pF1KA0 GVKAEMGTPKLEKPNEGYLEFFVDCSASATPEFEGRGGDDLGTEIANTLYRIFNNKSSVD
        :. :. .:::.. .:: : : .. :  :.: ::    .:: ...   . : . :.. .:
CCDS37 QVSCELVSPKLNRATEGILFFNLELSQMAAPAFEPGRQSDLLVKLNRLMERCLRNSKCID
        60        70        80        90       100       110       

              130       140       150       160       170       180
pF1KA0 LKTLCISPREHCWVLYVDVLLLECGGNLFDAISIAVKAALFNTRIPRVRVLEDEEGSKDI
        ..::.   :. : . ::. ::.  ::..:: :::. .:: . : : : :  ::     .
CCDS37 TESLCVVAGEKVWQIRVDLHLLNHDGNIIDAASIAAIVALCHFRRPDVSVQGDE-----V
       120       130       140       150       160       170       

               190        200        210       220       230       
pF1KA0 EL-SDDPYDCIRLSVENVP-CI-VTLCKIGYRHVVDATLQEEACSLASLLVSVTSKGVVT
        : . .  : . ::....: :.  .. . :   .:: . .::   . .::: . .:    
CCDS37 TLYTPEERDPVPLSIHHMPICVSFAFFQQGTYLLVDPNEREERV-MDGLLVIAMNKHREI
            180       190       200       210        220       230 

        240       250        260       270       280       290     
pF1KA0 C-MRKVGKGSLDPESIFEMME-TGKRVGKVLHASLQSVLHKEESLGPKRQKVGFLG    
       : ... :   :  .....  . .: .:... .  :.. :......  .  : ::      
CCDS37 CTIQSSGGIMLLKDQVLRCSKIAGVKVAEITELILKA-LENDQKVRKEGGKFGFAESIAN
             240       250       260        270       280       290

CCDS37 QRITAFKMEKAPIDTSDVEEKAEEIIAEAEPPSEVVSTPVLWTPGTAQIGEGVENSWGDL
              300       310       320       330       340       350

>>CCDS34057.1 EXOSC9 gene_id:5393|Hs108|chr4              (456 aa)
 initn: 355 init1: 262 opt: 375  Z-score: 511.4  bits: 103.2 E(32554): 3.6e-22
Smith-Waterman score: 375; 28.9% identity (60.2% similar) in 294 aa overlap (1-289:1-284)

               10        20        30        40        50        60
pF1KA0 MASVTLSEAEKVYIVHGVQEDLRVDGRGCEDYRCVEVETDVVSNTSGSARVKLGHTDILV
       :  . ::. :. .......:  :.:::   ::: ...     ..  :   :.::.: .: 
CCDS34 MKETPLSNCERRFLLRAIEEKKRLDGRQTYDYRNIRIS---FGTDYGCCIVELGKTRVLG
               10        20        30           40        50       

               70        80        90       100       110       120
pF1KA0 GVKAEMGTPKLEKPNEGYLEFFVDCSASATPEFEGRGGDDLGTEIANTLYRIFNNKSSVD
        :. :. .:::.. .:: : : .. :  :.: ::    .:: ...   . : . :.. .:
CCDS34 QVSCELVSPKLNRATEGILFFNLELSQMAAPAFEPGRQSDLLVKLNRLMERCLRNSKCID
        60        70        80        90       100       110       

              130       140       150       160       170       180
pF1KA0 LKTLCISPREHCWVLYVDVLLLECGGNLFDAISIAVKAALFNTRIPRVRVLEDEEGSKDI
        ..::.   :. : . ::. ::.  ::..:: :::. .:: . : : : :  ::     .
CCDS34 TESLCVVAGEKVWQIRVDLHLLNHDGNIIDAASIAAIVALCHFRRPDVSVQGDE-----V
       120       130       140       150       160       170       

               190        200        210       220       230       
pF1KA0 EL-SDDPYDCIRLSVENVP-CI-VTLCKIGYRHVVDATLQEEACSLASLLVSVTSKGVVT
        : . .  : . ::....: :.  .. . :   .:: . .::   . .::: . .:    
CCDS34 TLYTPEERDPVPLSIHHMPICVSFAFFQQGTYLLVDPNEREERV-MDGLLVIAMNKHREI
            180       190       200       210        220       230 

        240       250        260       270       280       290     
pF1KA0 C-MRKVGKGSLDPESIFEMME-TGKRVGKVLHASLQSVLHKEESLGPKRQKVGFLG    
       : ... :   :  .....  . .: .:... .  :.. :......  .  : ::      
CCDS34 CTIQSSGGIMLLKDQVLRCSKIAGVKVAEITELILKA-LENDQKVRKEGGKFGFAESIAN
             240       250       260        270       280       290

CCDS34 QRITAFKMEKAPIDTSDVEEKAEEIIAEAEPPSEVVSTPVLWTPGTAQIGEGVENSWGDL
              300       310       320       330       340       350




291 residues in 1 query   sequences
18511270 residues in 32554 library sequences
 Tcomplib [36.3.4 Apr, 2011] (8 proc)
 start: Wed Nov  2 18:08:56 2016 done: Wed Nov  2 18:08:56 2016
 Total Scan time:  2.250 Total Display time: -0.010

Function used was FASTA [36.3.4 Apr, 2011]
Inquiries or Suggestions ?
Send a message to flexiclone AT kazusagt.com