Result of FASTA (ccds) for pFN21AB5806
FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448

Query: pF1KB5806, 452 aa
  1>>>pF1KB5806 452 - 452 aa - 452 aa
Library: human.CCDS.faa
  18511270 residues in 32554 sequences

Statistics:  Expectation_n fit: rho(ln(x))= 6.2066+/-0.000819; mu= 12.8892+/- 0.049
 mean_var=74.5473+/-15.318, 0's: 0 Z-trim(107.6): 22  B-trim: 47 in 1/49
 Lambda= 0.148545
 statistics sampled from 9681 (9692) to 9681 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
 ktup: 2, E-join: 1 (0.68), E-opt: 0.2 (0.298), width:  16
 Scan time:  3.260

The best scores are:                                      opt bits E(32554)
CCDS4933.1 TFAP2D gene_id:83741|Hs108|chr6         ( 452) 3014 655.2 3.8e-188
CCDS4510.1 TFAP2A gene_id:7020|Hs108|chr6          ( 437) 1185 263.3 3.6e-70
CCDS34337.1 TFAP2A gene_id:7020|Hs108|chr6         ( 431) 1181 262.4 6.4e-70
CCDS43422.1 TFAP2A gene_id:7020|Hs108|chr6         ( 433) 1179 262.0 8.7e-70
CCDS4934.2 TFAP2B gene_id:7021|Hs108|chr6          ( 460) 1105 246.1 5.4e-65
CCDS393.2 TFAP2E gene_id:339488|Hs108|chr1         ( 442) 1069 238.4 1.1e-62
CCDS13454.1 TFAP2C gene_id:7022|Hs108|chr20        ( 450) 1008 225.3 9.7e-59


>>CCDS4933.1 TFAP2D gene_id:83741|Hs108|chr6              (452 aa)
 initn: 3014 init1: 3014 opt: 3014  Z-score: 3491.4  bits: 655.2 E(32554): 3.8e-188
Smith-Waterman score: 3014; 100.0% identity (100.0% similar) in 452 aa overlap (1-452:1-452)

               10        20        30        40        50        60
pF1KB5 MSTTFPGLVHDAEIRHDGSNSYRLMQLGCLESVANSTVAYSSSSPLTYSTTGTEFASPYF
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 MSTTFPGLVHDAEIRHDGSNSYRLMQLGCLESVANSTVAYSSSSPLTYSTTGTEFASPYF
               10        20        30        40        50        60

               70        80        90       100       110       120
pF1KB5 STNHQYTPLHHQSFHYEFQHSHPAVTPDAYSLNSLHHSQQYYQQIHHGEPTDFINLHNAR
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 STNHQYTPLHHQSFHYEFQHSHPAVTPDAYSLNSLHHSQQYYQQIHHGEPTDFINLHNAR
               70        80        90       100       110       120

              130       140       150       160       170       180
pF1KB5 ALKSSCLDEQRRELGCLDAYRRHDLSLMSHGSQYGMHPDQRLLPGPSLGLAAAGADDLQG
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 ALKSSCLDEQRRELGCLDAYRRHDLSLMSHGSQYGMHPDQRLLPGPSLGLAAAGADDLQG
              130       140       150       160       170       180

              190       200       210       220       230       240
pF1KB5 SVEAQCGLVLNGQGGVIRRGGTCVVNPTDLFCSVPGRLSLLSSTSKYKVTIAEVKRRLSP
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 SVEAQCGLVLNGQGGVIRRGGTCVVNPTDLFCSVPGRLSLLSSTSKYKVTIAEVKRRLSP
              190       200       210       220       230       240

              250       260       270       280       290       300
pF1KB5 PECLNASLLGGILRRAKSKNGGRCLREKLDRLGLNLPAGRRKAANVTLLTSLVEGEALHL
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 PECLNASLLGGILRRAKSKNGGRCLREKLDRLGLNLPAGRRKAANVTLLTSLVEGEALHL
              250       260       270       280       290       300

              310       320       330       340       350       360
pF1KB5 ARDFGYTCETEFPAKAVGEHLARQHMEQKEQTARKKMILATKQICKEFQDLLSQDRSPLG
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 ARDFGYTCETEFPAKAVGEHLARQHMEQKEQTARKKMILATKQICKEFQDLLSQDRSPLG
              310       320       330       340       350       360

              370       380       390       400       410       420
pF1KB5 SSRPTPILDLDIQRHLTHFSLITHGFGTPAICAALSTFQTVLSEMLNYLEKHTTHKNGGA
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 SSRPTPILDLDIQRHLTHFSLITHGFGTPAICAALSTFQTVLSEMLNYLEKHTTHKNGGA
              370       380       390       400       410       420

              430       440       450  
pF1KB5 ADSGQGHANSEKAPLRKTSEAAVKEGKTEKTD
       ::::::::::::::::::::::::::::::::
CCDS49 ADSGQGHANSEKAPLRKTSEAAVKEGKTEKTD
              430       440       450  

>>CCDS4510.1 TFAP2A gene_id:7020|Hs108|chr6               (437 aa)
 initn: 1150 init1: 1098 opt: 1185  Z-score: 1373.3  bits: 263.3 E(32554): 3.6e-70
Smith-Waterman score: 1218; 50.3% identity (70.2% similar) in 453 aa overlap (11-437:13-437)

                 10        20          30        40        50      
pF1KB5   MSTTFPGLVHDAEIRHDGSN--SYRLMQLGCLESVANSTVAYSSSSPLTYSTTGTEFA
                   : : ::::..  . :: :::   .:..:   :.:. ::.. : ...: 
CCDS45 MLWKLTDNIKYEDCEDRHDGTSNGTARLPQLG---TVGQSP--YTSAPPLSH-TPNADFQ
               10        20        30             40         50    

         60        70        80        90       100       110      
pF1KB5 SPYFSTNHQYTPLHHQSFHYEFQHSHPAVTPDAYSLNSLHHSQQYYQQIHHGEPTDFIN-
        :::   .:  :.. ::   .  .::   . : :::: :: . :     : : : .  . 
CCDS45 PPYFPPPYQ--PIYPQS---QDPYSH---VNDPYSLNPLHAQPQPQ---HPGWPGQRQSQ
           60             70           80        90          100   

             120         130       140                        150  
pF1KB5 ----LHNARAL--KSSCLDEQRRELGCLDAYRRH---------------DLSLMS--HGS
           ::. :.:  . : ::  ::.      ::::               :::. :  :. 
CCDS45 ESGLLHTHRGLPHQLSGLDP-RRD------YRRHEDLLHGPHALSSGLGDLSIHSLPHAI
           110       120              130       140       150      

            160       170       180       190       200       210  
pF1KB5 QYGMHPDQRLLPGPSLGLAAAGADDLQGSVEAQCGLVLNGQGGVIRRGGTCVVNPTDLFC
       .   : ..  .  :.  .   :  .:. :     . .  .. ...  ::  ::::...::
CCDS45 EEVPHVEDPGINIPDQTVIKKGPVSLSKSNSNAVSAIPINKDNLF--GG--VVNPNEVFC
        160       170       180       190       200           210  

            220       230       240       250       260       270  
pF1KB5 SVPGRLSLLSSTSKYKVTIAEVKRRLSPPECLNASLLGGILRRAKSKNGGRCLREKLDRL
       ::::::::::::::::::.:::.::::::::::::::::.::::::::::: ::::::..
CCDS45 SVPGRLSLLSSTSKYKVTVAEVQRRLSPPECLNASLLGGVLRRAKSKNGGRSLREKLDKI
            220       230       240       250       260       270  

            280       290       300       310       320       330  
pF1KB5 GLNLPAGRRKAANVTLLTSLVEGEALHLARDFGYTCETEFPAKAVGEHLARQHMEQKEQT
       :::::::::::::::::::::::::.::::::::.::::::::::.: : ::: . .::.
CCDS45 GLNLPAGRRKAANVTLLTSLVEGEAVHLARDFGYVCETEFPAKAVAEFLNRQHSDPNEQV
            280       290       300       310       320       330  

            340       350       360       370       380       390  
pF1KB5 ARKKMILATKQICKEFQDLLSQDRSPLGSSRPTPILDLDIQRHLTHFSLITHGFGTPAIC
       .::.:.:::::::::: :::.:::::::.:::.:::.  ::  ::::.::.::::.::.:
CCDS45 TRKNMLLATKQICKEFTDLLAQDRSPLGNSRPNPILEPGIQSCLTHFNLISHGFGSPAVC
            340       350       360       370       380       390  

            400       410       420       430       440       450  
pF1KB5 AALSTFQTVLSEMLNYLEKHTTHKNGGAADSGQGHANSEKAPLRKTSEAAVKEGKTEKTD
       ::....:. :.: :. ..:    .: ..  ..........   ::               
CCDS45 AAVTALQNYLTEALKAMDKMYLSNNPNSHTDNNAKSSDKEEKHRK               
            400       410       420       430                      

>>CCDS34337.1 TFAP2A gene_id:7020|Hs108|chr6              (431 aa)
 initn: 1150 init1: 1098 opt: 1181  Z-score: 1368.7  bits: 262.4 E(32554): 6.4e-70
Smith-Waterman score: 1214; 50.2% identity (70.1% similar) in 458 aa overlap (8-437:2-431)

               10          20          30        40        50      
pF1KB5 MSTTFPGLVHD--AEIRHDGSN--SYRLMQLGCLESVANSTVAYSSSSPLTYSTTGTEFA
              :::.  :  ::::..  . :: :::   .:..:   :.:. ::.. : ...: 
CCDS34       MLVHSFSAMDRHDGTSNGTARLPQLG---TVGQSP--YTSAPPLSH-TPNADFQ
                     10        20           30          40         

         60        70        80        90       100       110      
pF1KB5 SPYFSTNHQYTPLHHQSFHYEFQHSHPAVTPDAYSLNSLHHSQQYYQQIHHGEPTDFIN-
        :::   .:  :.. ::   .  .::   . : :::: :: . :     : : : .  . 
CCDS34 PPYFPPPYQ--PIYPQS---QDPYSH---VNDPYSLNPLHAQPQ---PQHPGWPGQRQSQ
       50          60              70        80           90       

             120         130       140                        150  
pF1KB5 ----LHNARAL--KSSCLDEQRRELGCLDAYRRH---------------DLSLMS--HGS
           ::. :.:  . : ::  ::.      ::::               :::. :  :. 
CCDS34 ESGLLHTHRGLPHQLSGLDP-RRD------YRRHEDLLHGPHALSSGLGDLSIHSLPHAI
       100       110        120             130       140       150

            160       170       180       190       200       210  
pF1KB5 QYGMHPDQRLLPGPSLGLAAAGADDLQGSVEAQCGLVLNGQGGVIRRGGTCVVNPTDLFC
       .   : ..  .  :.  .   :  .:. :     . .  .. ...  ::  ::::...::
CCDS34 EEVPHVEDPGINIPDQTVIKKGPVSLSKSNSNAVSAIPINKDNLF--GG--VVNPNEVFC
              160       170       180       190           200      

            220       230       240       250       260       270  
pF1KB5 SVPGRLSLLSSTSKYKVTIAEVKRRLSPPECLNASLLGGILRRAKSKNGGRCLREKLDRL
       ::::::::::::::::::.:::.::::::::::::::::.::::::::::: ::::::..
CCDS34 SVPGRLSLLSSTSKYKVTVAEVQRRLSPPECLNASLLGGVLRRAKSKNGGRSLREKLDKI
        210       220       230       240       250       260      

            280       290       300       310       320       330  
pF1KB5 GLNLPAGRRKAANVTLLTSLVEGEALHLARDFGYTCETEFPAKAVGEHLARQHMEQKEQT
       :::::::::::::::::::::::::.::::::::.::::::::::.: : ::: . .::.
CCDS34 GLNLPAGRRKAANVTLLTSLVEGEAVHLARDFGYVCETEFPAKAVAEFLNRQHSDPNEQV
        270       280       290       300       310       320      

            340       350       360       370       380       390  
pF1KB5 ARKKMILATKQICKEFQDLLSQDRSPLGSSRPTPILDLDIQRHLTHFSLITHGFGTPAIC
       .::.:.:::::::::: :::.:::::::.:::.:::.  ::  ::::.::.::::.::.:
CCDS34 TRKNMLLATKQICKEFTDLLAQDRSPLGNSRPNPILEPGIQSCLTHFNLISHGFGSPAVC
        330       340       350       360       370       380      

            400       410       420       430       440       450  
pF1KB5 AALSTFQTVLSEMLNYLEKHTTHKNGGAADSGQGHANSEKAPLRKTSEAAVKEGKTEKTD
       ::....:. :.: :. ..:    .: ..  ..........   ::               
CCDS34 AAVTALQNYLTEALKAMDKMYLSNNPNSHTDNNAKSSDKEEKHRK               
        390       400       410       420       430                

>>CCDS43422.1 TFAP2A gene_id:7020|Hs108|chr6              (433 aa)
 initn: 1150 init1: 1098 opt: 1179  Z-score: 1366.4  bits: 262.0 E(32554): 8.7e-70
Smith-Waterman score: 1212; 50.1% identity (70.2% similar) in 453 aa overlap (11-437:9-433)

               10        20          30        40        50        
pF1KB5 MSTTFPGLVHDAEIRHDGSN--SYRLMQLGCLESVANSTVAYSSSSPLTYSTTGTEFASP
                 : . ::::..  . :: :::   .:..:   :.:. ::.. : ...:  :
CCDS43   MSILAKMGDWQDRHDGTSNGTARLPQLG---TVGQSP--YTSAPPLSH-TPNADFQPP
                 10        20           30          40         50  

       60        70        80        90       100       110        
pF1KB5 YFSTNHQYTPLHHQSFHYEFQHSHPAVTPDAYSLNSLHHSQQYYQQIHHGEPTDFIN---
       ::   .:  :.. ::   .  .::   . : :::: :: . :     : : : .  .   
CCDS43 YFPPPYQ--PIYPQS---QDPYSH---VNDPYSLNPLHAQPQPQ---HPGWPGQRQSQES
               60           70           80           90       100 

           120         130       140                        150    
pF1KB5 --LHNARAL--KSSCLDEQRRELGCLDAYRRH---------------DLSLMS--HGSQY
         ::. :.:  . : ::  ::.      ::::               :::. :  :. . 
CCDS43 GLLHTHRGLPHQLSGLDP-RRD------YRRHEDLLHGPHALSSGLGDLSIHSLPHAIEE
             110        120             130       140       150    

          160       170       180       190       200       210    
pF1KB5 GMHPDQRLLPGPSLGLAAAGADDLQGSVEAQCGLVLNGQGGVIRRGGTCVVNPTDLFCSV
         : ..  .  :.  .   :  .:. :     . .  .. ...  ::  ::::...::::
CCDS43 VPHVEDPGINIPDQTVIKKGPVSLSKSNSNAVSAIPINKDNLF--GG--VVNPNEVFCSV
          160       170       180       190           200       210

          220       230       240       250       260       270    
pF1KB5 PGRLSLLSSTSKYKVTIAEVKRRLSPPECLNASLLGGILRRAKSKNGGRCLREKLDRLGL
       ::::::::::::::::.:::.::::::::::::::::.::::::::::: ::::::..::
CCDS43 PGRLSLLSSTSKYKVTVAEVQRRLSPPECLNASLLGGVLRRAKSKNGGRSLREKLDKIGL
              220       230       240       250       260       270

          280       290       300       310       320       330    
pF1KB5 NLPAGRRKAANVTLLTSLVEGEALHLARDFGYTCETEFPAKAVGEHLARQHMEQKEQTAR
       :::::::::::::::::::::::.::::::::.::::::::::.: : ::: . .::..:
CCDS43 NLPAGRRKAANVTLLTSLVEGEAVHLARDFGYVCETEFPAKAVAEFLNRQHSDPNEQVTR
              280       290       300       310       320       330

          340       350       360       370       380       390    
pF1KB5 KKMILATKQICKEFQDLLSQDRSPLGSSRPTPILDLDIQRHLTHFSLITHGFGTPAICAA
       :.:.:::::::::: :::.:::::::.:::.:::.  ::  ::::.::.::::.::.:::
CCDS43 KNMLLATKQICKEFTDLLAQDRSPLGNSRPNPILEPGIQSCLTHFNLISHGFGSPAVCAA
              340       350       360       370       380       390

          400       410       420       430       440       450  
pF1KB5 LSTFQTVLSEMLNYLEKHTTHKNGGAADSGQGHANSEKAPLRKTSEAAVKEGKTEKTD
       ....:. :.: :. ..:    .: ..  ..........   ::               
CCDS43 VTALQNYLTEALKAMDKMYLSNNPNSHTDNNAKSSDKEEKHRK               
              400       410       420       430                  

>>CCDS4934.2 TFAP2B gene_id:7021|Hs108|chr6               (460 aa)
 initn: 1239 init1: 1049 opt: 1105  Z-score: 1280.3  bits: 246.1 E(32554): 5.4e-65
Smith-Waterman score: 1212; 51.4% identity (69.2% similar) in 451 aa overlap (13-432:27-454)

                             10          20        30        40    
pF1KB5               MSTTFPGLVHDAEIRHDG--SNSYRLMQLGCLESVANSTVAYSSSS
                                 : ::::  :.: :: :::   ::...   :::. 
CCDS49 MHSPPRDQAAIMLWKLVENVKYEDIYEDRHDGVPSHSSRLSQLG---SVSQG--PYSSAP
               10        20        30        40             50     

           50        60        70        80        90       100    
pF1KB5 PLTYSTTGTEFASPYFSTNHQYTPLHHQSFHYEFQHSHPAVTPDAYSLNSLHHSQQY---
       ::.. : ...:  :::   .:  : .:::   .  .::   . : :::: ::. ::.   
CCDS49 PLSH-TPSSDFQPPYFPPPYQPLP-YHQS---QDPYSH---VNDPYSLNPLHQPQQHPWG
           60        70         80              90       100       

               110       120       130       140         150       
pF1KB5 -YQQIHHG-EPTDFINLHNARALKSSCLDEQRRELGCLDAYRRHDLSLMS--HGSQYGMH
         :. . : :  ...    :   . : ::  ::.     . :: :. : :  :: . :: 
CCDS49 QRQRQEVGSEAGSLLPQPRAALPQLSGLDP-RRD---YHSVRRPDVLLHSAHHGLDAGMG
       110       120       130        140          150       160   

       160       170       180        190                       200
pF1KB5 PDQRLLPGPSLGLAAAGADDLQGSVEAQ-CGLVLNGQGGVIRR----------------G
        .  :      ::.  : .:.:.  .:.  :. :  :. ::..                :
CCDS49 DSLSLH-----GLGHPGMEDVQSVEDANNSGMNLLDQS-VIKKVPVPPKSVTSLMMNKDG
                170       180       190        200       210       

                210       220       230       240       250        
pF1KB5 --GTCVVNPTDLFCSVPGRLSLLSSTSKYKVTIAEVKRRLSPPECLNASLLGGILRRAKS
         :   ::  ..::::::::::::::::::::..::.::::::::::::::::.::::::
CCDS49 FLGGMSVNTGEVFCSVPGRLSLLSSTSKYKVTVGEVQRRLSPPECLNASLLGGVLRRAKS
       220       230       240       250       260       270       

      260       270       280       290       300       310        
pF1KB5 KNGGRCLREKLDRLGLNLPAGRRKAANVTLLTSLVEGEALHLARDFGYTCETEFPAKAVG
       ::::: :::.:...:::::::::::::::::::::::::.:::::::: ::::::::::.
CCDS49 KNGGRSLRERLEKIGLNLPAGRRKAANVTLLTSLVEGEAVHLARDFGYICETEFPAKAVS
       280       290       300       310       320       330       

      320       330       340       350       360       370        
pF1KB5 EHLARQHMEQKEQTARKKMILATKQICKEFQDLLSQDRSPLGSSRPTPILDLDIQRHLTH
       :.: ::: . ..  .::.:.:::::.:::: :::.:::.:.:.:::.:::.  ::  :::
CCDS49 EYLNRQHTDPSDLHSRKNMLLATKQLCKEFTDLLAQDRTPIGNSRPSPILEPGIQSCLTH
       340       350       360       370       380       390       

      380       390       400       410          420       430     
pF1KB5 FSLITHGFGTPAICAALSTFQTVLSEMLNYLEK---HTTHKNGGAADSGQGHANSEKAPL
       :::::::::.:::::::...:. :.: :. ..:   ..:  :  ..  : :  ...:   
CCDS49 FSLITHGFGAPAICAALTALQNYLTEALKGMDKMFLNNTTTNRHTSGEGPGSKTGDKEEK
       400       410       420       430       440       450       

         440       450  
pF1KB5 RKTSEAAVKEGKTEKTD
                        
CCDS49 HRK              
       460              

>>CCDS393.2 TFAP2E gene_id:339488|Hs108|chr1              (442 aa)
 initn: 1152 init1: 1051 opt: 1069  Z-score: 1238.8  bits: 238.4 E(32554): 1.1e-62
Smith-Waterman score: 1122; 50.2% identity (66.0% similar) in 430 aa overlap (39-437:31-442)

       10        20        30        40           50        60     
pF1KB5 VHDAEIRHDGSNSYRLMQLGCLESVANSTVAYSSSSPLTYS---TTGTEFASPYFSTNHQ
                                     ::. . :: ..   :...::  :::   . 
CCDS39 MLVHTYSAMERPDGLGAAAGGARLSSLPQAAYGPAPPLCHTPAATAAAEFQPPYFPPPYP
               10        20        30        40        50        60

          70        80        90        100       110       120    
pF1KB5 YTPLHHQSFHYEFQHSHPAVTPDAYS-LNSLHHSQQYYQQIHHGEPTDFINLHNARALKS
         :: . .   .   . : .. : :. :  : . :    :   . :      :.      
CCDS39 QPPLPYGQ-APDAAAAFPHLAGDPYGGLAPLAQPQP--PQAAWAAPRAAARAHEE---PP
                70        80        90         100       110       

          130       140         150         160        170         
pF1KB5 SCLDEQRRELGCLDAYRRHDLSL--MSHGSQYGMH--PDQRL-LPGPSLGLAAA-GADDL
       . :    : :: ::  : .  ..  . ::   : :   :  : ::    ::::: : .::
CCDS39 GLLAPPARALG-LDPRRDYATAVPRLLHGLADGAHGLADAPLGLP----GLAAAPGLEDL
          120        130       140       150           160         

      180       190                            200       210       
pF1KB5 QGSVEAQCGLVLNGQGGVIRR---------------------GGTCVVNPTDLFCSVPGR
       :.  :   :. :  :. ::..                     ::  ..:: ..:::::::
CCDS39 QAMDEP--GMSLLDQS-VIKKVPIPSKASSLSALSLAKDSLVGG--ITNPGEVFCSVPGR
     170         180        190       200       210         220    

       220       230       240       250       260       270       
pF1KB5 LSLLSSTSKYKVTIAEVKRRLSPPECLNASLLGGILRRAKSKNGGRCLREKLDRLGLNLP
       :::::::::::::..::.::::::::::::::::.:::::::::::::::.:...:::::
CCDS39 LSLLSSTSKYKVTVGEVQRRLSPPECLNASLLGGVLRRAKSKNGGRCLRERLEKIGLNLP
          230       240       250       260       270       280    

       280       290       300       310       320       330       
pF1KB5 AGRRKAANVTLLTSLVEGEALHLARDFGYTCETEFPAKAVGEHLARQHMEQKEQTARKKM
       ::::::::::::::::::::.::::::::.:::::::::..:.: ::: .  :  .::.:
CCDS39 AGRRKAANVTLLTSLVEGEAVHLARDFGYVCETEFPAKAAAEYLCRQHADPGELHSRKSM
          290       300       310       320       330       340    

       340       350       360       370       380       390       
pF1KB5 ILATKQICKEFQDLLSQDRSPLGSSRPTPILDLDIQRHLTHFSLITHGFGTPAICAALST
       .::.::::::: ::..:::::::.:::. ::.  .:  :::::::::::: :::::::..
CCDS39 LLAAKQICKEFADLMAQDRSPLGNSRPALILEPGVQSCLTHFSLITHGFGGPAICAALTA
          350       360       370       380       390       400    

       400       410       420       430       440       450  
pF1KB5 FQTVLSEMLNYLEKHTTHKNGGAADSGQGHANSEKAPLRKTSEAAVKEGKTEKTD
       ::. : : :. :.:    . :  .  :. .:. . :  ::               
CCDS39 FQNYLLESLKGLDKMFLSSVG--SGHGETKASEKDAKHRK               
          410       420         430       440                 

>>CCDS13454.1 TFAP2C gene_id:7022|Hs108|chr20             (450 aa)
 initn: 1114 init1: 708 opt: 1008  Z-score: 1168.1  bits: 225.3 E(32554): 9.7e-59
Smith-Waterman score: 1082; 47.4% identity (68.1% similar) in 458 aa overlap (11-432:14-447)

                  10        20          30        40        50     
pF1KB5    MSTTFPGLVHDAEIRHDGSNSY--RLMQLGCLESVANSTVAYSSSSPLTYSTTGTEF
                    : : :::::..   :. .:    : :.. . :: . ::... ..   
CCDS13 MLWKITDNVKYEEDCEDRHDGSSNGNPRVPHL----SSAGQHL-YSPAPPLSHTGVAEYQ
               10        20        30             40        50     

          60               70          80         90       100     
pF1KB5 ASPYFSTNHQ---YT----PLHH--QSFHYEFQHSH-PAVTPDAYSLNSLHHSQQYYQ-Q
         :::   .:   :.    :  :  ...   ..  : :: : .  .    ..::.     
CCDS13 PPPYFPPPYQQLAYSQSADPYSHLGEAYAAAINPLHQPAPTGSQQQAWPGRQSQEGAGLP
          60        70        80        90       100       110     

          110       120       130       140       150       160    
pF1KB5 IHHGEPTDFINLHNARALKSSCLDEQRRELGCLDAYRRHDLSLMSHGSQYGMHPDQRLLP
        :::.:. .  : .  .:... .. .:      ::::: :: :. :.       :   : 
CCDS13 SHHGRPAGL--LPHLSGLEAGAVSARR------DAYRRSDL-LLPHAHAL----DAAGL-
         120         130       140              150           160  

          170         180       190       200                      
pF1KB5 GPSLGL--AAAGADDLQGSVEAQCGLVLNGQGGVIRRGGT----------C-------VV
       . .:::       :..: .:. :  :.:. :  :::.:            :       :.
CCDS13 AENLGLHDMPHQMDEVQ-NVDDQ-HLLLHDQT-VIRKGPISMTKNPLNLPCQKELVGAVM
             170        180        190        200       210        

         210       220       230       240       250       260     
pF1KB5 NPTDLFCSVPGRLSLLSSTSKYKVTIAEVKRRLSPPECLNASLLGGILRRAKSKNGGRCL
       :::..::::::::::::::::::::.:::.::::::::::::::::.::::::::::: :
CCDS13 NPTEVFCSVPGRLSLLSSTSKYKVTVAEVQRRLSPPECLNASLLGGVLRRAKSKNGGRSL
      220       230       240       250       260       270        

         270       280       290       300       310       320     
pF1KB5 REKLDRLGLNLPAGRRKAANVTLLTSLVEGEALHLARDFGYTCETEFPAKAVGEHLARQH
       :::::..::::::::::::.::::::::::::.::::::.:.::.:::.: :.:.:.: :
CCDS13 REKLDKIGLNLPAGRRKAAHVTLLTSLVEGEAVHLARDFAYVCEAEFPSKPVAEYLTRPH
      280       290       300       310       320       330        

          330       340       350       360       370       380    
pF1KB5 MEQK-EQTARKKMILATKQICKEFQDLLSQDRSPLGSSRPTPILDLDIQRHLTHFSLITH
       .  . :..:::.:.::..:.:::: .::::::.: :.:: .:.:. .::  :.:::::::
CCDS13 LGGRNEMAARKNMLLAAQQLCKEFTELLSQDRTPHGTSRLAPVLETNIQNCLSHFSLITH
      340       350       360       370       380       390        

          390       400       410          420       430       440 
pF1KB5 GFGTPAICAALSTFQTVLSEMLNYLEKHTTHKNGG---AADSGQGHANSEKAPLRKTSEA
       :::. :::::.:..:. ..: :  ..:  .. : :    :::..   . ::         
CCDS13 GFGSQAICAAVSALQNYIKEALIVIDK--SYMNPGDQSPADSNKTLEKMEKHRK      
      400       410       420         430       440       450      

             450  
pF1KB5 AVKEGKTEKTD




452 residues in 1 query   sequences
18511270 residues in 32554 library sequences
 Tcomplib [36.3.4 Apr, 2011] (8 proc)
 start: Sun Nov  6 03:09:46 2016 done: Sun Nov  6 03:09:47 2016
 Total Scan time:  3.260 Total Display time:  0.060

Function used was FASTA [36.3.4 Apr, 2011]
Inquiries or Suggestions ?
Send a message to flexiclone AT kazusagt.com