Result of FASTA (ccds) for pFN21AE4488
FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448

Query: pF1KE4488, 510 aa
  1>>>pF1KE4488 510 - 510 aa - 510 aa
Library: human.CCDS.faa
  18511270 residues in 32554 sequences

Statistics:  Expectation_n fit: rho(ln(x))= 7.0117+/-0.00092; mu= 11.1584+/- 0.055
 mean_var=131.8774+/-26.370, 0's: 0 Z-trim(109.9): 17  B-trim: 0 in 0/52
 Lambda= 0.111683
 statistics sampled from 11223 (11228) to 11223 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
 ktup: 2, E-join: 1 (0.703), E-opt: 0.2 (0.345), width:  16
 Scan time:  3.370

The best scores are:                                      opt bits E(32554)
CCDS11974.1 LMAN1 gene_id:3998|Hs108|chr18         ( 510) 3454 568.0 8.5e-162
CCDS10270.1 LMAN1L gene_id:79748|Hs108|chr15       ( 526) 1000 172.7 9.3e-43
CCDS4417.1 LMAN2 gene_id:10960|Hs108|chr5          ( 356)  570 103.2 4.9e-22
CCDS2023.1 LMAN2L gene_id:81562|Hs108|chr2         ( 348)  528 96.5 5.3e-20


>>CCDS11974.1 LMAN1 gene_id:3998|Hs108|chr18              (510 aa)
 initn: 3454 init1: 3454 opt: 3454  Z-score: 3018.2  bits: 568.0 E(32554): 8.5e-162
Smith-Waterman score: 3454; 99.8% identity (100.0% similar) in 510 aa overlap (1-510:1-510)

               10        20        30        40        50        60
pF1KE4 MAGSRQRGLRARVRPLFCALLLSLGRFVRGDGVGGDPAVALPHRRFEYKYSFKGPHLVQS
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 MAGSRQRGLRARVRPLFCALLLSLGRFVRGDGVGGDPAVALPHRRFEYKYSFKGPHLVQS
               10        20        30        40        50        60

               70        80        90       100       110       120
pF1KE4 DGTVPFWAHAGNAIPSSDQIRVAPSLKSQRGSVWTKTKAAFENWEVEVTFRVTGRGRIGA
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 DGTVPFWAHAGNAIPSSDQIRVAPSLKSQRGSVWTKTKAAFENWEVEVTFRVTGRGRIGA
               70        80        90       100       110       120

              130       140       150       160       170       180
pF1KE4 DGLAIWYAENQGLEGPVFGSADLWNGVGIFFDSFDNDGKKNNPAIVIIGNNGQIHYDHQN
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 DGLAIWYAENQGLEGPVFGSADLWNGVGIFFDSFDNDGKKNNPAIVIIGNNGQIHYDHQN
              130       140       150       160       170       180

              190       200       210       220       230       240
pF1KE4 DGASQALASCQRDFRNKPYPVRAKITYYQNTLTVMINNGFTPDKNDYEFCAKVENMIIPA
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 DGASQALASCQRDFRNKPYPVRAKITYYQNTLTVMINNGFTPDKNDYEFCAKVENMIIPA
              190       200       210       220       230       240

              250       260       270       280       290       300
pF1KE4 QGHFGISAATGGLADDHDVLSFLTFQLTEPGKEPPTPDKEISEKEKEKYQEEFEHFQQEL
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 QGHFGISAATGGLADDHDVLSFLTFQLTEPGKEPPTPDKEISEKEKEKYQEEFEHFQQEL
              250       260       270       280       290       300

              310       320       330       340       350       360
pF1KE4 DKKKEEFQKGHPDLQGQPAEEIFESVGDRELRQVFEGQNRIHLEIKQLNRQLDMILDEQR
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 DKKKEEFQKGHPDLQGQPAEEIFESVGDRELRQVFEGQNRIHLEIKQLNRQLDMILDEQR
              310       320       330       340       350       360

              370       380       390       400       410       420
pF1KE4 RYVSSLTEEISKRGAGMPGQHGQITQQELDTVVKTQHEILRQVNEMKNSLSETVRLVSGM
       :::::::::::::::::::::::::::::::::::::::::::::::::.::::::::::
CCDS11 RYVSSLTEEISKRGAGMPGQHGQITQQELDTVVKTQHEILRQVNEMKNSMSETVRLVSGM
              370       380       390       400       410       420

              430       440       450       460       470       480
pF1KE4 QHPGSAGGVYETTQHFIDIKEHLHIVKRDIDNLVQRNMPSNEKPKCPELPPFPSCLSTVH
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 QHPGSAGGVYETTQHFIDIKEHLHIVKRDIDNLVQRNMPSNEKPKCPELPPFPSCLSTVH
              430       440       450       460       470       480

              490       500       510
pF1KE4 FIIFVVVQTVLFIGYIMYRSQQEAAAKKFF
       ::::::::::::::::::::::::::::::
CCDS11 FIIFVVVQTVLFIGYIMYRSQQEAAAKKFF
              490       500       510

>>CCDS10270.1 LMAN1L gene_id:79748|Hs108|chr15            (526 aa)
 initn: 900 init1: 435 opt: 1000  Z-score: 881.1  bits: 172.7 E(32554): 9.3e-43
Smith-Waterman score: 1000; 35.5% identity (66.3% similar) in 501 aa overlap (15-501:9-486)

               10        20        30        40        50        60
pF1KE4 MAGSRQRGLRARVRPLFCALLLSLGRFVRGDGVGGDPAVALPHRRFEYKYSFKGPHLVQS
                     :::: ::: :       :    :    : :::::: :::::.:.  
CCDS10       MPAVSGPGPLFCLLLLLLDPHSPETGC---P----PLRRFEYKLSFKGPRLALP
                     10        20               30        40       

               70        80        90       100       110       120
pF1KE4 DGTVPFWAHAGNAIPSSDQIRVAPSLKSQRGSVWTKTKAAFENWEVEVTFRVTGRGRIGA
        . .:::.: :.:: . ...:..::.... :.::..... :  ::::: .:::: :: ::
CCDS10 GAGIPFWSHHGDAILGLEEVRLTPSMRNRSGAVWSRASVPFSAWEVEVQMRVTGLGRRGA
        50        60        70        80        90       100       

              130       140       150       160       170       180
pF1KE4 DGLAIWYAENQGLEGPVFGSADLWNGVGIFFDSFDNDGKKNNPAIVIIGNNGQIHYDHQN
       .:.:.::....:  : :.:.   :.:.::::::  .:  ...::: .....:.:  .. .
CCDS10 QGMAVWYTRGRGHVGSVLGGLASWDGIGIFFDSPAED-TQDSPAIRVLASDGHIPSEQPG
       110       120       130       140        150       160      

              190       200       210       220       230       240
pF1KE4 DGASQALASCQRDFRNKPYPVRAKITYYQNTLTVMINNGFTPDKNDYEFCAKVENMIIPA
       :::::.:.::. ::::.:.: ::.:::. . : . .:.:.::. .  :::. :  ...  
CCDS10 DGASQGLGSCHWDFRNRPHPFRARITYWGQRLRMSLNSGLTPS-DPGEFCVDVGPLLLVP
        170       180       190       200        210       220     

              250       260       270        280       290         
pF1KE4 QGHFGISAATGGLADDHDVLSFLTFQLTEPGKE-PPTPDKEISEKEKEKYQEEFEHFQQE
        : ::.::::: :::::::::::::.:.::. : :: :  :. .. .   : :    .  
CCDS10 GGFFGVSAATGTLADDHDVLSFLTFSLSEPSPEVPPQPFLEM-QQLRLARQLEGLWARLG
         230       240       250       260        270       280    

     300       310       320           330       340       350     
pF1KE4 LDKKKEEFQKGHPDLQGQPAEEIF---ESVG-DRELRQVFEGQNRIHLEIKQLNRQLDMI
       :  ...   :.  . ::. .:..:   :..:  :.. :...: ..   .. : .::    
CCDS10 LGTREDVTPKSDSEAQGE-GERLFDLEETLGRHRRILQALRGLSK---QLAQAERQWKKQ
          290       300        310       320          330       340

           360       370       380             390       400       
pF1KE4 LDE--QRRYVSSLTEEISKRGAGMPGQHGQITQQ------ELDTVVKTQHEILRQVNEMK
       :    : :  .. . . : .  . ::. :.....      .. .... :  .:. ..::.
CCDS10 LGPPGQARPDGGWALDASCQIPSTPGRGGHLSMSLNKDSAKVGALLHGQWTLLQALQEMR
              350       360       370       380       390       400

       410       420       430       440       450       460       
pF1KE4 NSLSETVRLVSGMQHPGSAGGVYETTQHFIDIKEHLHIVKRDIDNLVQRNMPSNEKPKCP
       ..   .::...  :      :.    .::... . : ...... . ..    . . :. :
CCDS10 DA---AVRMAAEAQVSYLPVGI---EHHFLELDHILGLLQEELRGPAK---AAAKAPRPP
                 410          420       430       440          450 

       470        480       490       500       510                
pF1KE4 ELPPFPS-CLSTVHFIIFVVVQTVLFIGYIMYRSQQEAAAKKFF                
         ::  : ::.   :......::: :.::. .:..                         
CCDS10 GQPPRASSCLQPGIFLFYLLIQTVGFFGYVHFRQELNKSLQECLSTGSLPLGPAPHTPRA
             460       470       480       490       500       510 

CCDS10 LGILRRQPLPASMPA
             520      

>>CCDS4417.1 LMAN2 gene_id:10960|Hs108|chr5               (356 aa)
 initn: 408 init1: 160 opt: 570  Z-score: 509.1  bits: 103.2 E(32554): 4.9e-22
Smith-Waterman score: 570; 33.2% identity (64.8% similar) in 301 aa overlap (15-310:32-316)

                               10        20        30        40    
pF1KE4                 MAGSRQRGLRARVRPLFCALLLSLGRFVRGDGVGGDPAVALPHR
                                     :::  ::: ::  : .: . :.      : 
CCDS44 AAEGWIWRWGWGRRCLGRPGLLGPGPGPTTPLF--LLLLLGS-VTADITDGNS----EHL
              10        20        30          40         50        

           50        60        70        80        90       100    
pF1KE4 RFEYKYSFKGPHLVQSDGTVPFWAHAGNAIPSSDQIRVAPSLKSQRGSVWTKTKAAFENW
       . :.  :.  :.   .....:.:   :... .:. .:..:. .:..::.:..    ...:
CCDS44 KREH--SLIKPYQGVGSSSMPLWDFQGSTMLTSQYVRLTPDERSKEGSIWNHQPCFLKDW
             60        70        80        90       100       110  

          110         120       130       140       150         160
pF1KE4 EVEVTFRVTGRGR--IGADGLAIWYAENQGLEGPVFGSADLWNGVGIFFDSFDND--GKK
       :..: :.: : :.  . .::.:.::.... . :::::: : ..:..::.:.. ::   ..
CCDS44 EMHVHFKVHGTGKKNLHGDGIALWYTRDRLVPGPVFGSKDNFHGLAIFLDTYPNDETTER
            120       130       140       150       160       170  

              170       180       190       200       210       220
pF1KE4 NNPAIVIIGNNGQIHYDHQNDGASQALASCQRDFRNKPYPVRAKITYYQNTLTVMINNGF
         : : .. :::.. :::..::    ::.:  ::::. . .   . : .. :::: .   
CCDS44 VFPYISVMVNNGSLSYDHSKDGRWTELAGCTADFRNRDHDTFLAVRYSRGRLTVMTD---
            180       190       200       210       220            

              230       240       250       260       270       280
pF1KE4 TPDKNDYEFCAKVENMIIPAQGHFGISAATGGLADDHDVLSFLTFQLTEPGKEPPTPDKE
         :::... :  . .. .:.  .:: ::.:: :.:.::..:.  :::        :::.:
CCDS44 LEDKNEWKNCIDITGVRLPTGYYFGASAGTGDLSDNHDIISMKLFQLMV----EHTPDEE
     230       240       250       260       270           280     

              290        300       310       320       330         
pF1KE4 ISEKEKEKYQEEF-EHFQQELDKKKEEFQKGHPDLQGQPAEEIFESVGDRELRQVFEGQN
         .  : . . .: .  ....:    .:..:                             
CCDS44 SIDWTKIEPSVNFLKSPKDNVDDPTGNFRSGPLTGWRVFLLLLCALLGIVVCAVVGAVVF
         290       300       310       320       330       340     

     340       350       360       370       380       390         
pF1KE4 RIHLEIKQLNRQLDMILDEQRRYVSSLTEEISKRGAGMPGQHGQITQQELDTVVKTQHEI
                                                                   
CCDS44 QKRQERNKRFY                                                 
         350                                                       

>>CCDS2023.1 LMAN2L gene_id:81562|Hs108|chr2              (348 aa)
 initn: 265 init1: 180 opt: 528  Z-score: 472.6  bits: 96.5 E(32554): 5.3e-20
Smith-Waterman score: 528; 33.6% identity (62.7% similar) in 271 aa overlap (6-268:15-275)

                        10        20        30        40           
pF1KE4          MAGSRQRGLRARVRPLFCALLLSLGRFVRGDGVGGDPAVALPHRRFEY---
                     .: : ::    .  ::: ::    :.:    :  .   . :::   
CCDS20 MAATLGPLGSWQQWRRCLSARDGSRMLLLLLLLGS---GQG----PQQVGAGQTFEYLKR
               10        20        30               40        50   

       50        60        70        80        90       100        
pF1KE4 KYSFKGPHLVQSDGTVPFWAHAGNAIPSSDQIRVAPSLKSQRGSVWTKTKAAFENWEVEV
       ..:.. :.   . :.  .:   :::.  .. ::..:...:..:..:...   ...::..:
CCDS20 EHSLSKPYQGVGTGSSSLWNLMGNAMVMTQYIRLTPDMQSKQGALWNRVPCFLRDWELQV
            60        70        80        90       100       110   

      110         120       130       140       150       160      
pF1KE4 TFRVTGRGR--IGADGLAIWYAENQGLEGPVFGSADLWNGVGIFFDSFDNDGKKNN---P
        :.. :.:.  . .:::::::....   :::::. : . :.:.: :.. :. :...   :
CCDS20 HFKIHGQGKKNLHGDGLAIWYTKDRMQPGPVFGNMDKFVGLGVFVDTYPNEEKQQERVFP
           120       130       140       150       160       170   

           170       180       190       200       210       220   
pF1KE4 AIVIIGNNGQIHYDHQNDGASQALASCQRDFRNKPYPVRAKITYYQNTLTVMINNGFTPD
        :  . :::.. :::. ::    :..:    ::  : .   : : .  ::.:..      
CCDS20 YISAMVNNGSLSYDHERDGRPTELGGCTAIVRNLHYDTFLVIRYVKRHLTIMMD---IDG
           180       190       200       210       220          230

           230       240       250       260       270       280   
pF1KE4 KNDYEFCAKVENMIIPAQGHFGISAATGGLADDHDVLSFLTFQLTEPGKEPPTPDKEISE
       :.... : .: .. .:   .:: :. :: :.:.:::.:.  :.::               
CCDS20 KHEWRDCIEVPGVRLPRGYYFGTSSITGDLSDNHDVISLKLFELTVERTPEEEKLHRDVF
              240       250       260       270       280       290

           290       300       310       320       330       340   
pF1KE4 KEKEKYQEEFEHFQQELDKKKEEFQKGHPDLQGQPAEEIFESVGDRELRQVFEGQNRIHL
                                                                   
CCDS20 LPSVDNMKLPEMTAPLPPLSGLALFLIVFFSLVFSVFAIVIGIILYNKWQEQSRKRFY  
              300       310       320       330       340          




510 residues in 1 query   sequences
18511270 residues in 32554 library sequences
 Tcomplib [36.3.4 Apr, 2011] (8 proc)
 start: Sun Nov  6 00:48:31 2016 done: Sun Nov  6 00:48:31 2016
 Total Scan time:  3.370 Total Display time:  0.020

Function used was FASTA [36.3.4 Apr, 2011]
Inquiries or Suggestions ?
Send a message to flexiclone AT kazusagt.com