Result of FASTA (ccds) for pFN21AE1249
FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448

Query: pF1KE1249, 494 aa
  1>>>pF1KE1249 494 - 494 aa - 494 aa
Library: human.CCDS.faa
  18511270 residues in 32554 sequences

Statistics:  Expectation_n fit: rho(ln(x))= 11.3036+/-0.00109; mu= -3.2891+/- 0.066
 mean_var=556.1011+/-114.278, 0's: 0 Z-trim(117.3): 26  B-trim: 0 in 0/53
 Lambda= 0.054387
 statistics sampled from 18013 (18034) to 18013 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
 ktup: 2, E-join: 1 (0.799), E-opt: 0.2 (0.554), width:  16
 Scan time:  3.610

The best scores are:                                      opt bits E(32554)
CCDS45900.1 ONECUT3 gene_id:390874|Hs108|chr19     ( 494) 3398 281.0 2.1e-75
CCDS42440.1 ONECUT2 gene_id:9480|Hs108|chr18       ( 504) 1038 95.8 1.2e-19
CCDS10150.1 ONECUT1 gene_id:3175|Hs108|chr15       ( 465) 1028 95.0 1.9e-19


>>CCDS45900.1 ONECUT3 gene_id:390874|Hs108|chr19          (494 aa)
 initn: 3398 init1: 3398 opt: 3398  Z-score: 1467.2  bits: 281.0 E(32554): 2.1e-75
Smith-Waterman score: 3398; 100.0% identity (100.0% similar) in 494 aa overlap (1-494:1-494)

               10        20        30        40        50        60
pF1KE1 MELSLESLGGLHSVAHAQAGELLSPGHARSAAAQHRGLVAPGRPGLVAGMASLLDGGGGG
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 MELSLESLGGLHSVAHAQAGELLSPGHARSAAAQHRGLVAPGRPGLVAGMASLLDGGGGG
               10        20        30        40        50        60

               70        80        90       100       110       120
pF1KE1 GGGGAGGAGGAGSAGGGADFRGELAGPLHPAMGMACEAPGLGGTYTTLTPLQHLPPLAAV
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 GGGGAGGAGGAGSAGGGADFRGELAGPLHPAMGMACEAPGLGGTYTTLTPLQHLPPLAAV
               70        80        90       100       110       120

              130       140       150       160       170       180
pF1KE1 ADKFHQHAAAAAVAGAHGGHPHAHPHPAAAPPPPPPPQRLAASVSGSFTLMRDERAALAS
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 ADKFHQHAAAAAVAGAHGGHPHAHPHPAAAPPPPPPPQRLAASVSGSFTLMRDERAALAS
              130       140       150       160       170       180

              190       200       210       220       230       240
pF1KE1 VGHLYGPYGKELPAMGSPLSPLPNALPPALHGAPQPPPPPPPPPLAAYGPPGHLAGDKLL
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 VGHLYGPYGKELPAMGSPLSPLPNALPPALHGAPQPPPPPPPPPLAAYGPPGHLAGDKLL
              190       200       210       220       230       240

              250       260       270       280       290       300
pF1KE1 PPAAFEPHAALLGRAEDALARGLPGGGGGTGSGGAGSGSAAGLLAPLGGLAAAGAHGPHG
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 PPAAFEPHAALLGRAEDALARGLPGGGGGTGSGGAGSGSAAGLLAPLGGLAAAGAHGPHG
              250       260       270       280       290       300

              310       320       330       340       350       360
pF1KE1 GGGGPGGSGGGPSAGAAAEEINTKEVAQRITAELKRYSIPQAIFAQRILCRSQGTLSDLL
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 GGGGPGGSGGGPSAGAAAEEINTKEVAQRITAELKRYSIPQAIFAQRILCRSQGTLSDLL
              310       320       330       340       350       360

              370       380       390       400       410       420
pF1KE1 RNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAACKRKEQEQQKERALQPKKQRLV
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 RNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAACKRKEQEQQKERALQPKKQRLV
              370       380       390       400       410       420

              430       440       450       460       470       480
pF1KE1 FTDLQRRTLIAIFKENKRPSKEMQVTISQQLGLELNTVSNFFMNARRRCMNRWAEEPSTA
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 FTDLQRRTLIAIFKENKRPSKEMQVTISQQLGLELNTVSNFFMNARRRCMNRWAEEPSTA
              430       440       450       460       470       480

              490    
pF1KE1 PGGPAGATATFSKA
       ::::::::::::::
CCDS45 PGGPAGATATFSKA
              490    

>>CCDS42440.1 ONECUT2 gene_id:9480|Hs108|chr18            (504 aa)
 initn: 1167 init1: 949 opt: 1038  Z-score: 466.3  bits: 95.8 E(32554): 1.2e-19
Smith-Waterman score: 1452; 52.4% identity (69.1% similar) in 531 aa overlap (2-494:23-504)

                                    10        20                   
pF1KE1                      MELSLESLGGLHSVAHAQAG------------------E
                             ::..:::: ::. : . .:                  :
CCDS42 MKAAYTAYRCLTKDLEGCAMNPELTMESLGTLHGPAGGGSGGGGGGGGGGGGGGPGHEQE
               10        20        30        40        50        60

                  30        40        50        60        70       
pF1KE1 LL---SPGHA-RSAAAQHRGLVAPGRPGLVAGMASLLDGGGGGGGGGAGGAGGAGSAGGG
       ::   :: :: :.::.. ::   :       : :.    ....... .. . . .:   :
CCDS42 LLASPSPHHAGRGAAGSLRGPPPPPTAHQELGTAA----AAAAAASRSAMVTSMASILDG
               70        80        90           100       110      

        80        90         100         110       120       130   
pF1KE1 ADFRGELAGPLHPAMGMACEA--PGLG--GTYTTLTPLQHLPPLAAVADKFHQHAAAAAV
       .:.: ::. ::: ::.:.:..  ::.:  .::::::::: :::...:.::::.       
CCDS42 GDYRPELSIPLHHAMSMSCDSSPPGMGMSNTYTTLTPLQPLPPISTVSDKFHHP------
        120       130       140       150       160       170      

           140       150       160       170       180       190   
pF1KE1 AGAHGGHPHAHPHPAAAPPPPPPPQRLAASVSGSFTLMRDERAALASVGHLYGPYGKELP
             ::: :::           :::...::::::::::::. : ....::.:: ::.:
CCDS42 ------HPHHHPHHHHH----HHHQRLSGNVSGSFTLMRDERG-LPAMNNLYSPY-KEMP
                    180           190       200        210         

           200            210       220       230       240        
pF1KE1 AMGSPLSPLPNALP-----PALHGAPQPPPPPPPPPLAAYGPPGHLAGDKLLPPAAFEPH
       .:.. ::::  : :      .::.: :  :         ::::::   ::.: :     :
CCDS42 GMSQSLSPLA-ATPLGNGLGGLHNAQQSLP--------NYGPPGH---DKMLSPNFDAHH
      220        230       240               250          260      

      250       260       270       280       290          300     
pF1KE1 AALLGRAEDALARGLPGGGGGTGSGGAGSGSAAGLLAPLGGLAAAG---AHGPHGGGG--
       .:.: :.:. :.:::            :.  :: ... :.::   :   .:::  . .  
CCDS42 TAMLTRGEQHLSRGL------------GTPPAA-MMSHLNGLHHPGHTQSHGPVLAPSRE
        270       280                    290       300       310   

            310        320       330       340       350       360 
pF1KE1 -GPGGSGGGPSAGAAA-EEINTKEVAQRITAELKRYSIPQAIFAQRILCRSQGTLSDLLR
         :..:.:.  : ..  :::::::::::::::::::::::::::::.:::::::::::::
CCDS42 RPPSSSSGSQVATSGQLEEINTKEVAQRITAELKRYSIPQAIFAQRVLCRSQGTLSDLLR
           320       330       340       350       360       370   

             370       380       390       400       410       420 
pF1KE1 NPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAACKRKEQEQQKERALQPKKQRLVF
       :::::::::::::::::::::::::::::::::::::::::::: .:.:  . ::.::::
CCDS42 NPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAACKRKEQEPNKDRNNSQKKSRLVF
           380       390       400       410       420       430   

             430       440       450       460       470       480 
pF1KE1 TDLQRRTLIAIFKENKRPSKEMQVTISQQLGLELNTVSNFFMNARRRCMNRWAEEPSTAP
       ::::::::.::::::::::::::.::::::::::.:::::::::::: ...: .. ::  
CCDS42 TDLQRRTLFAIFKENKRPSKEMQITISQQLGLELTTVSNFFMNARRRSLEKWQDDLST--
           440       450       460       470       480       490   

             490    
pF1KE1 GGPAGATATFSKA
       :: .....: .::
CCDS42 GGSSSTSSTCTKA
             500    

>>CCDS10150.1 ONECUT1 gene_id:3175|Hs108|chr15            (465 aa)
 initn: 1320 init1: 963 opt: 1028  Z-score: 462.5  bits: 95.0 E(32554): 1.9e-19
Smith-Waterman score: 1500; 54.3% identity (70.6% similar) in 506 aa overlap (2-494:4-465)

                 10            20        30         40        50   
pF1KE1   MELSLESLGGLHSVAH----AQAGELLSPGHARSAAAQHRGL-VAPGRPGLVAGMASL
          .:..:..: ::.:.:    : :  : .  ::::..: :::  . :..:  . :::::
CCDS10 MNAQLTMEAIGELHGVSHEPVPAPADLLGGSPHARSSVA-HRGSHLPPAHPRSM-GMASL
               10        20        30         40        50         

            60        70        80        90        100         110
pF1KE1 LDGGGGGGGGGAGGAGGAGSAGGGADFRGELAGPLHPAMGMACEAP-GLG--GTYTTLTP
       ::::.:::       .   :          :::::::.: ::::.: :..   :::::::
CCDS10 LDGGSGGGDYHHHHRAPEHS----------LAGPLHPTMTMACETPPGMSMPTTYTTLTP
       60        70                  80        90       100        

              120        130       140       150       160         
pF1KE1 LQHLPPLAAVADKF-HQHAAAAAVAGAHGGHPHAHPHPAAAPPPPPPPQRLAASVSGSFT
       :: :::...:.::: :.:         :  : : :::           ::::..::::::
CCDS10 LQPLPPISTVSDKFPHHH---------HHHHHHHHPHHH---------QRLAGNVSGSFT
      110       120                130                140       150

     170       180       190       200       210       220         
pF1KE1 LMRDERAALASVGHLYGPYGKELPAMGSPLSPLPNALPPALHGAPQPPPPPPPPPLAAYG
       ::::::. :::...:: :: :.. .::. :::: ..   ..:.. :  :         :.
CCDS10 LMRDERG-LASMNNLYTPYHKDVAGMGQSLSPLSSSGLGSIHNSQQGLPH--------YA
               160       170       180       190               200 

     230        240        250        260       270       280      
pF1KE1 PPGH-LAGDKLLPPAAFEPH-AALLGR-AEDALARGLPGGGGGTGSGGAGSGSAAGLLAP
        ::  .  ::.: : .:: :  :.::: .:. :.   : ..: .  .:       . :  
CCDS10 HPGAAMPTDKMLTPNGFEAHHPAMLGRHGEQHLT---PTSAGMVPINGLPPHHPHAHLNA
             210       220       230          240       250        

         290       300       310       320       330       340     
pF1KE1 LG-GLAAAGAHGPHGGGGGPGGSGGGPSAGAAAEEINTKEVAQRITAELKRYSIPQAIFA
        : :   . :. :. .  :   :.:  : ..  :::::::::::::.:::::::::::::
CCDS10 QGHGQLLGTAREPNPSVTGAQVSNG--SNSGQMEEINTKEVAQRITTELKRYSIPQAIFA
      260       270       280         290       300       310      

         350       360       370       380       390       400     
pF1KE1 QRILCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAACKRKEQE
       ::.:::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 QRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAACKRKEQE
        320       330       340       350       360       370      

         410       420       430       440       450       460     
pF1KE1 QQKERALQPKKQRLVFTDLQRRTLIAIFKENKRPSKEMQVTISQQLGLELNTVSNFFMNA
       . :.:.  ::: ::::::.::::: ::::::::::::.:.::::::::::.:::::::::
CCDS10 HGKDRGNTPKKPRLVFTDVQRRTLHAIFKENKRPSKELQITISQQLGLELSTVSNFFMNA
        380       390       400       410       420       430      

         470       480       490    
pF1KE1 RRRCMNRWAEEPSTAPGGPAGATATFSKA
       ::: ...: .: :.  :. .....: .::
CCDS10 RRRSLDKWQDEGSSNSGNSSSSSSTCTKA
        440       450       460     




494 residues in 1 query   sequences
18511270 residues in 32554 library sequences
 Tcomplib [36.3.4 Apr, 2011] (8 proc)
 start: Tue Nov  8 04:29:19 2016 done: Tue Nov  8 04:29:19 2016
 Total Scan time:  3.610 Total Display time:  0.000

Function used was FASTA [36.3.4 Apr, 2011]
Inquiries or Suggestions ?
Send a message to flexiclone AT kazusagt.com