Result of FASTA (ccds) for pFN21AA0943
FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448

Query: pF1KA0943, 393 aa
  1>>>pF1KA0943 393 - 393 aa - 393 aa
Library: human.CCDS.faa
  18511270 residues in 32554 sequences

Statistics:  Expectation_n fit: rho(ln(x))= 5.1890+/-0.0008; mu= 17.5222+/- 0.048
 mean_var=69.7510+/-13.816, 0's: 0 Z-trim(108.4): 12  B-trim: 0 in 0/50
 Lambda= 0.153567
 statistics sampled from 10150 (10158) to 10150 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
 ktup: 2, E-join: 1 (0.69), E-opt: 0.2 (0.312), width:  16
 Scan time:  2.900

The best scores are:                                      opt bits E(32554)
CCDS46564.1 ATG4B gene_id:23192|Hs108|chr2         ( 393) 2739 615.8 2.1e-176
CCDS46565.1 ATG4B gene_id:23192|Hs108|chr2         ( 380) 2589 582.6  2e-166
CCDS14538.1 ATG4A gene_id:115201|Hs108|chrX        ( 398) 1507 342.9 3.1e-94
CCDS14539.1 ATG4A gene_id:115201|Hs108|chrX        ( 336)  889 205.9 4.4e-53
CCDS12241.1 ATG4D gene_id:84971|Hs108|chr19        ( 474)  588 139.3 6.9e-33


>>CCDS46564.1 ATG4B gene_id:23192|Hs108|chr2              (393 aa)
 initn: 2739 init1: 2739 opt: 2739  Z-score: 3280.6  bits: 615.8 E(32554): 2.1e-176
Smith-Waterman score: 2739; 99.7% identity (99.7% similar) in 393 aa overlap (1-393:1-393)

               10        20        30        40        50        60
pF1KA0 MDAATLTYDTLRFAEFEDFPETSEPVWILGRKYSIFTEKDEILSDVASRLWFTYRKNFPA
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS46 MDAATLTYDTLRFAEFEDFPETSEPVWILGRKYSIFTEKDEILSDVASRLWFTYRKNFPA
               10        20        30        40        50        60

               70        80        90       100       110       120
pF1KA0 IGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKD
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS46 IGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKD
               70        80        90       100       110       120

              130       140       150       160       170       180
pF1KA0 SYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSSLAVHIAMDNTVVMEEIR
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS46 SYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSSLAVHIAMDNTVVMEEIR
              130       140       150       160       170       180

              190       200       210       220       230       240
pF1KA0 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS46 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV
              190       200       210       220       230       240

              250       260       270       280       290       300
pF1KA0 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS46 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH
              250       260       270       280       290       300

              310       320       330       340       350       360
pF1KA0 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLA
       ::::::::::::::::::::::::::::::::::::::::::::::::::::: ::::::
CCDS46 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVELQPSHLA
              310       320       330       340       350       360

              370       380       390   
pF1KA0 CPDVLNLSLDSSDVERLERFFDSEDEDFEILSL
       :::::::::::::::::::::::::::::::::
CCDS46 CPDVLNLSLDSSDVERLERFFDSEDEDFEILSL
              370       380       390   

>>CCDS46565.1 ATG4B gene_id:23192|Hs108|chr2              (380 aa)
 initn: 2589 init1: 2589 opt: 2589  Z-score: 3101.2  bits: 582.6 E(32554): 2e-166
Smith-Waterman score: 2589; 99.2% identity (99.2% similar) in 372 aa overlap (1-372:1-372)

               10        20        30        40        50        60
pF1KA0 MDAATLTYDTLRFAEFEDFPETSEPVWILGRKYSIFTEKDEILSDVASRLWFTYRKNFPA
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS46 MDAATLTYDTLRFAEFEDFPETSEPVWILGRKYSIFTEKDEILSDVASRLWFTYRKNFPA
               10        20        30        40        50        60

               70        80        90       100       110       120
pF1KA0 IGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKD
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS46 IGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKD
               70        80        90       100       110       120

              130       140       150       160       170       180
pF1KA0 SYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSSLAVHIAMDNTVVMEEIR
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS46 SYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSSLAVHIAMDNTVVMEEIR
              130       140       150       160       170       180

              190       200       210       220       230       240
pF1KA0 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS46 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV
              190       200       210       220       230       240

              250       260       270       280       290       300
pF1KA0 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS46 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH
              250       260       270       280       290       300

              310       320       330       340       350       360
pF1KA0 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLA
       ::::::::::::::::::::::::::::::::::::::::::::::::::::: ::::::
CCDS46 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVELQPSHLA
              310       320       330       340       350       360

              370       380       390   
pF1KA0 CPDVLNLSLDSSDVERLERFFDSEDEDFEILSL
       :::::::::  :                     
CCDS46 CPDVLNLSLGESCQVQILLM             
              370       380             

>>CCDS14538.1 ATG4A gene_id:115201|Hs108|chrX             (398 aa)
 initn: 1517 init1: 877 opt: 1507  Z-score: 1805.3  bits: 342.9 E(32554): 3.1e-94
Smith-Waterman score: 1507; 55.4% identity (81.4% similar) in 392 aa overlap (13-393:15-398)

                 10         20        30        40        50       
pF1KA0   MDAATLTYDTLRFAEF-EDFPETSEPVWILGRKYSIFTEKDEILSDVASRLWFTYRKN
                     :... :..:.:.: :::::... . :::...:::...:::::::..
CCDS14 MESVLSKYEDQITIFTDYLEEYPDTDELVWILGKQHLLKTEKSKLLSDISARLWFTYRRK
               10        20        30        40        50        60

        60        70        80        90       100       110       
pF1KA0 FPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFID
       :  ::::::.::.:::::::::::..::::.:::::::: : ..:.::  :  .:. :.:
CCDS14 FSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLD
               70        80        90       100       110       120

       120       130       140       150       160       170       
pF1KA0 RKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSSLAVHIAMDNTVVME
       :::  :::::.::::::::::::.:.:::::::::::::.:: :.::::...::::::.:
CCDS14 RKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIE
              130       140       150       160       170       180

       180       190       200       210          220       230    
pF1KA0 EIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAE---VTNRPSPWRPLVLLIPLRLGLTD
       .:...::. .: .. ::    .::  ... :. .   ..   : :.::.:..:::::...
CCDS14 DIKKMCRV-LPLSADTA----GDRPPDSLTASNQSKGTSAYCSAWKPLLLIVPLRLGINQ
               190           200       210       220       230     

          240       250       260       270       280       290    
pF1KA0 INEAYVETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFI
       :: .::...:.:: ::::::..:::::.:.::::..:.:::.:::::::  :.  ..  .
CCDS14 INPVYVDAFKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTV
         240       250       260       270       280       290     

          300       310       320       330       340       350    
pF1KA0 PDESFHCQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQ
        :..::: . : ::.: .::::.:.::::: : ::..::. :.:  .:   : :::::..
CCDS14 NDQTFHCLQSPQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQK-EILKENLRMFELVQK
         300       310       320       330        340       350    

                360       370        380       390   
pF1KA0 QPSHL------ACPDVLNLSLDSSD-VERLERFFDSEDEDFEILSL
       .:::       : :.: . . .  : .:.::.: : : :::::::.
CCDS14 HPSHWPPFVPPAKPEVTTTGAEFIDSTEQLEEF-DLE-EDFEILSV
          360       370       380        390         

>>CCDS14539.1 ATG4A gene_id:115201|Hs108|chrX             (336 aa)
 initn: 1266 init1: 877 opt: 889  Z-score: 1066.4  bits: 205.9 E(32554): 4.4e-53
Smith-Waterman score: 1137; 48.1% identity (68.9% similar) in 389 aa overlap (13-393:15-336)

                 10         20        30        40        50       
pF1KA0   MDAATLTYDTLRFAEF-EDFPETSEPVWILGRKYSIFTEKDEILSDVASRLWFTYRKN
                     :... :..:.:.: :::::... . :::...:::...:::::::..
CCDS14 MESVLSKYEDQITIFTDYLEEYPDTDELVWILGKQHLLKTEKSKLLSDISARLWFTYRRK
               10        20        30        40        50        60

        60        70        80        90       100       110       
pF1KA0 FPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFID
       :  ::::::.::.:::::::::::..::::.:::::::: : ..:.::  :  .:. :.:
CCDS14 FSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLD
               70        80        90       100       110       120

       120       130       140       150       160       170       
pF1KA0 RKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSSLAVHIAMDNTVVME
       :::  :::::.::::::::::::.:.:::::::::::::.:: :.::::...::::::.:
CCDS14 RKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIE
              130       140       150       160       170       180

       180       190       200       210       220       230       
pF1KA0 EIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINE
       .:...::.         .: ..:       ::    .::                     
CCDS14 DIKKMCRV---------LPLSADT------AG----DRP---------------------
                       190                 200                     

       240       250       260       270       280       290       
pF1KA0 AYVETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDE
                   :.:: .     :..        .:::.:::::::  :.  ..  . :.
CCDS14 ------------PDSLTA----SNQS--------DELIFLDPHTTQTFVDTEENGTVNDQ
                              210               220       230      

       300       310       320       330       340       350       
pF1KA0 SFHCQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPS
       .::: . : ::.: .::::.:.::::: : ::..::. :.:  .:   : :::::...::
CCDS14 TFHCLQSPQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQK-EILKENLRMFELVQKHPS
        240       250       260       270        280       290     

             360       370        380       390   
pF1KA0 HL------ACPDVLNLSLDSSD-VERLERFFDSEDEDFEILSL
       :       : :.: . . .  : .:.::.: : : :::::::.
CCDS14 HWPPFVPPAKPEVTTTGAEFIDSTEQLEEF-DLE-EDFEILSV
         300       310       320         330      

>>CCDS12241.1 ATG4D gene_id:84971|Hs108|chr19             (474 aa)
 initn: 565 init1: 249 opt: 588  Z-score: 703.9  bits: 139.3 E(32554): 6.9e-33
Smith-Waterman score: 656; 32.5% identity (55.6% similar) in 412 aa overlap (26-391:94-474)

                    10        20        30        40          50   
pF1KA0      MDAATLTYDTLRFAEFEDFPETSEPVWILGRKYSIFTEKD--EILSDVASRLWFT
                                     . . ::.: .  : :  ..  : .::::.:
CCDS12 KFKAKFLTAWNNVKYGWVVKSRTSFSKISSIHLCGRRYRFEGEGDIQRFQRDFVSRLWLT
            70        80        90       100       110       120   

            60        70        80        90       100             
pF1KA0 YRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQ-------------
       ::..:: . :   ::: ::::::: :::..::.:. . : ::: :..             
CCDS12 YRRDFPPLPGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSGSA
           130       140       150       160       170       180   

                                   110       120       130         
pF1KA0 ---RKRQPDSYF------------------SVLNAFIDRKDSYYSIHQIAQMGVGEGKSI
          : . :  ..                  .... : :.  . ...:.....: . ::. 
CCDS12 SPSRYHGPARWMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKA
           190       200       210       220       230       240   

     140       150       160       170       180       190         
pF1KA0 GQWYGPNTVAQVLKKLAVFDTWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADS
       :.::::. ::..:.:          ::.   : :      : .  .:  :   :.. :: 
CCDS12 GDWYGPSLVAHILRK----------AVESCSDVT------RLVVYVSQDC---TVYKADV
           250                 260             270          280    

     200       210          220       230       240       250      
pF1KA0 DRHCNGFPAGAEVTNRPSP---WRPLVLLIPLRLGLTDINEAYVETLKHCFMMPQSLGVI
        :          .. ::.:   :. .:.:.:.:::   .: .::  .:. .     ::..
CCDS12 AR----------LVARPDPTAEWKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIM
                    290       300       310       320       330    

        260       270       280       290       300       310      
pF1KA0 GGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFHCQHPPCRMSIAELDPS
       ::::  . :::::  . :.:::::  ::.:. ... : : :::::  :  .:..:..:::
CCDS12 GGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADF-PLESFHCTSPR-KMAFAKMDPS
          340       350       360       370        380        390  

        320       330       340          350         360           
pF1KA0 IAVGFFCKTEDDFNDWCQQVKKLSLLGGAL---PMFELVE--QQPSHLA--CPDVLNLSL
        .:::.   . .:.  :... ..   ..:    ::: :.:   :   :   : .. . .:
CCDS12 CTVGFYAGDRKEFETLCSELTRVLSSSSATERYPMFTLAEGHAQDHSLDDLCSQLAQPTL
            400       410       420       430       440       450  

     370       380       390   
pF1KA0 DSSDVERLERFFDSEDEDFEILSL
           . :: :     .::: .:  
CCDS12 RLPRTGRLLRAKRPSSEDFVFL  
            460       470      




393 residues in 1 query   sequences
18511270 residues in 32554 library sequences
 Tcomplib [36.3.4 Apr, 2011] (8 proc)
 start: Wed Nov  2 20:05:04 2016 done: Wed Nov  2 20:05:05 2016
 Total Scan time:  2.900 Total Display time:  0.040

Function used was FASTA [36.3.4 Apr, 2011]
Inquiries or Suggestions ?
Send a message to flexiclone AT kazusagt.com