Result of FASTA (ccds) for pFN21AE5056
FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448

Query: pF1KE5056, 224 aa
  1>>>pF1KE5056 224 - 224 aa - 224 aa
Library: human.CCDS.faa
  18511270 residues in 32554 sequences

Statistics:  Expectation_n fit: rho(ln(x))= 5.4330+/-0.000724; mu= 12.7556+/- 0.043
 mean_var=58.0059+/-11.471, 0's: 0 Z-trim(107.8): 17  B-trim: 0 in 0/50
 Lambda= 0.168399
 statistics sampled from 9797 (9813) to 9797 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
 ktup: 2, E-join: 1 (0.698), E-opt: 0.2 (0.301), width:  16
 Scan time:  2.050

The best scores are:                                      opt bits E(32554)
CCDS4848.1 APOBEC2 gene_id:10930|Hs108|chr6        ( 224) 1489 369.7 8.5e-103
CCDS33648.1 APOBEC3F gene_id:200316|Hs108|chr22    ( 373)  403 105.9 3.6e-23
CCDS13983.1 APOBEC3C gene_id:27350|Hs108|chr22     ( 190)  383 101.0 5.6e-22
CCDS41747.1 AICDA gene_id:57379|Hs108|chr12        ( 198)  358 94.9 3.9e-20
CCDS58807.1 APOBEC3B gene_id:9582|Hs108|chr22      ( 357)  326 87.2 1.5e-17
CCDS13982.1 APOBEC3B gene_id:9582|Hs108|chr22      ( 382)  326 87.2 1.6e-17
CCDS13984.1 APOBEC3G gene_id:60489|Hs108|chr22     ( 384)  294 79.4 3.4e-15
CCDS46709.1 APOBEC3D gene_id:140564|Hs108|chr22    ( 386)  272 74.1 1.4e-13
CCDS81662.1 AICDA gene_id:57379|Hs108|chr12        ( 188)  265 72.3 2.4e-13
CCDS13981.1 APOBEC3A gene_id:200315|Hs108|chr22    ( 199)  254 69.6 1.6e-12
CCDS54531.1 APOBEC3H gene_id:164668|Hs108|chr22    ( 182)  243 66.9 9.3e-12
CCDS13985.1 APOBEC3H gene_id:164668|Hs108|chr22    ( 183)  243 66.9 9.4e-12
CCDS54530.1 APOBEC3H gene_id:164668|Hs108|chr22    ( 200)  243 67.0   1e-11


>>CCDS4848.1 APOBEC2 gene_id:10930|Hs108|chr6             (224 aa)
 initn: 1489 init1: 1489 opt: 1489  Z-score: 1959.1  bits: 369.7 E(32554): 8.5e-103
Smith-Waterman score: 1489; 100.0% identity (100.0% similar) in 224 aa overlap (1-224:1-224)

               10        20        30        40        50        60
pF1KE5 MAQKEEAAVATEAASQNGEDLENLDDPEKLKELIELPPFEIVTGERLPANFFKFQFRNVE
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS48 MAQKEEAAVATEAASQNGEDLENLDDPEKLKELIELPPFEIVTGERLPANFFKFQFRNVE
               10        20        30        40        50        60

               70        80        90       100       110       120
pF1KE5 YSSGRNKTFLCYVVEAQGKGGQVQASRGYLEDEHAAAHAEEAFFNTILPAFDPALRYNVT
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS48 YSSGRNKTFLCYVVEAQGKGGQVQASRGYLEDEHAAAHAEEAFFNTILPAFDPALRYNVT
               70        80        90       100       110       120

              130       140       150       160       170       180
pF1KE5 WYVSSSPCAACADRIIKTLSKTKNLRLLILVGRLFMWEEPEIQAALKKLKEAGCKLRIMK
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS48 WYVSSSPCAACADRIIKTLSKTKNLRLLILVGRLFMWEEPEIQAALKKLKEAGCKLRIMK
              130       140       150       160       170       180

              190       200       210       220    
pF1KE5 PQDFEYVWQNFVEQEEGESKAFQPWEDIQENFLYYEEKLADILK
       ::::::::::::::::::::::::::::::::::::::::::::
CCDS48 PQDFEYVWQNFVEQEEGESKAFQPWEDIQENFLYYEEKLADILK
              190       200       210       220    

>>CCDS33648.1 APOBEC3F gene_id:200316|Hs108|chr22         (373 aa)
 initn: 541 init1: 219 opt: 403  Z-score: 529.5  bits: 105.9 E(32554): 3.6e-23
Smith-Waterman score: 403; 32.2% identity (68.3% similar) in 199 aa overlap (30-224:184-373)

                10        20        30        40        50         
pF1KE5  MAQKEEAAVATEAASQNGEDLENLDDPEKLKELIELPPFEIVTGERLPANFFKFQFRNV
                                     :::... :       : .  ..: :.:.:.
CCDS33 YCWENFVYSEGQPFMPWYKFDDNYAFLHRTLKEILRNPM------EAMYPHIFYFHFKNL
           160       170       180       190             200       

      60        70        80        90          100        110     
pF1KE5 EYSSGRNKTFLCYVVEAQGKGGQVQASRGYLE---DEHAAAHAEEAFFNTILP-AFDPAL
       . . :::...::...:.  . . :. .:: ..   : ..  :::. :.. .    ..:  
CCDS33 RKAYGRNESWLCFTMEVVKHHSPVSWKRGVFRNQVDPETHCHAERCFLSWFCDDILSPNT
       210       220       230       240       250       260       

         120       130       140       150       160       170     
pF1KE5 RYNVTWYVSSSPCAACADRIIKTLSKTKNLRLLILVGRLFMWEEPEIQAALKKLKEAGCK
        :.::::.: :::  :: .. . :.. .:. : :...::... . . : .:..:.. : .
CCDS33 NYEVTWYTSWSPCPECAGEVAEFLARHSNVNLTIFTARLYYFWDTDYQEGLRSLSQEGAS
       270       280       290       300       310       320       

         180       190       200       210       220    
pF1KE5 LRIMKPQDFEYVWQNFVEQEEGESKAFQPWEDIQENFLYYEEKLADILK
       ..::  .::.: :.::: ...   . :.::. .. :::. . :: .::.
CCDS33 VEIMGYKDFKYCWENFVYNDD---EPFKPWKGLKYNFLFLDSKLQEILE
       330       340          350       360       370   

>--
 initn: 337 init1: 206 opt: 332  Z-score: 436.3  bits: 88.7 E(32554): 5.6e-18
Smith-Waterman score: 332; 32.6% identity (64.1% similar) in 184 aa overlap (37-214:3-179)

         10        20        30        40        50        60      
pF1KE5 AAVATEAASQNGEDLENLDDPEKLKELIELPPFEIVTGERLPANFFKFQFRNVEYSSGRN
                                     : :.  : ::.  . :...: :    : ::
CCDS33                             MKPHFRN-TVERMYRDTFSYNFYNRPILSRRN
                                            10        20        30 

         70        80             90       100       110        120
pF1KE5 KTFLCYVVEAQGKG-GQVQAS--RG--YLEDEHAAAHAEEAFFNTILPAFDPALR-YNVT
        ..::: :...: .  ...:.  ::  : . ::   :::  :.. .     :: . ...:
CCDS33 TVWLCYEVKTKGPSRPRLDAKIFRGQVYSQPEH---HAEMCFLSWFCGNQLPAYKCFQIT
              40        50        60           70        80        

              130       140       150       160       170       180
pF1KE5 WYVSSSPCAACADRIIKTLSKTKNLRLLILVGRLFMWEEPEIQAALKKLKEAGCKLRIMK
       :.:: .::  :. .. . :..  :. : : ..::... : . . :: .:..:: ...:: 
CCDS33 WFVSWTPCPDCVAKLAEFLAEHPNVTLTISAARLYYYWERDYRRALCRLSQAGARVKIMD
       90       100       110       120       130       140        

              190       200       210       220                    
pF1KE5 PQDFEYVWQNFVEQEEGESKAFQPWEDIQENFLYYEEKLADILK                
        ..: : :.:::    .:.. :.::  ...:. .                          
CCDS33 DEEFAYCWENFVY---SEGQPFMPWYKFDDNYAFLHRTLKEILRNPMEAMYPHIFYFHFK
      150       160          170       180       190       200     

CCDS33 NLRKAYGRNESWLCFTMEVVKHHSPVSWKRGVFRNQVDPETHCHAERCFLSWFCDDILSP
         210       220       230       240       250       260     

>>CCDS13983.1 APOBEC3C gene_id:27350|Hs108|chr22          (190 aa)
 initn: 361 init1: 243 opt: 383  Z-score: 508.1  bits: 101.0 E(32554): 5.6e-22
Smith-Waterman score: 383; 32.6% identity (69.1% similar) in 181 aa overlap (48-224:14-190)

        20        30        40        50        60        70       
pF1KE5 GEDLENLDDPEKLKELIELPPFEIVTGERLPANFFKFQFRNVEYSSGRNKTFLCYVVEAQ
                                     :..:. :::.:.  .. ::.:.::..::. 
CCDS13                  MNPQIRNPMKAMYPGTFY-FQFKNLWEANDRNETWLCFTVEGI
                                10         20        30        40  

        80        90          100        110       120       130   
pF1KE5 GKGGQVQASRGYLE---DEHAAAHAEEAFFNTILP-AFDPALRYNVTWYVSSSPCAACAD
        . . :. . : ..   : ..  :::. :.. .    ..:  .:.::::.: :::  :: 
CCDS13 KRRSVVSWKTGVFRNQVDSETHCHAERCFLSWFCDDILSPNTKYQVTWYTSWSPCPDCAG
             50        60        70        80        90       100  

           140       150       160       170       180       190   
pF1KE5 RIIKTLSKTKNLRLLILVGRLFMWEEPEIQAALKKLKEAGCKLRIMKPQDFEYVWQNFVE
       .. . :.. .:. : :...::.... :  : .:..:.. :  ..::  .::.: :.::: 
CCDS13 EVAEFLARHSNVNLTIFTARLYYFQYPCYQEGLRSLSQEGVAVEIMDYEDFKYCWENFVY
            110       120       130       140       150       160  

           200       210       220    
pF1KE5 QEEGESKAFQPWEDIQENFLYYEEKLADILK
       ..   .. :.::. .. ::   ...: . :.
CCDS13 ND---NEPFKPWKGLKTNFRLLKRRLRESLQ
               170       180       190

>>CCDS41747.1 AICDA gene_id:57379|Hs108|chr12             (198 aa)
 initn: 271 init1: 199 opt: 358  Z-score: 474.9  bits: 94.9 E(32554): 3.9e-20
Smith-Waterman score: 358; 35.1% identity (68.4% similar) in 174 aa overlap (52-223:11-180)

              30        40        50        60        70        80 
pF1KE5 ENLDDPEKLKELIELPPFEIVTGERLPANFFKFQFRNVEYSSGRNKTFLCYVVEAQGKGG
                                     : .::.::....:: .:.:::::. . .. 
CCDS41                     MDSLLMNRRKFLYQFKNVRWAKGRRETYLCYVVKRRDSAT
                                   10        20        30        40

              90       100        110       120       130       140
pF1KE5 QVQASRGYLEDEHAAAHAEEAFFNTILP-AFDPALRYNVTWYVSSSPCAACADRIIKTLS
       . . . :::.....  :.:  :.  :    .::.  : :::..: :::  :: ..   : 
CCDS41 SFSLDFGYLRNKNGC-HVELLFLRYISDWDLDPGRCYRVTWFTSWSPCYDCARHVADFLR
               50         60        70        80        90         

              150       160        170       180       190         
pF1KE5 KTKNLRLLILVGRLFMWEEPEIQA-ALKKLKEAGCKLRIMKPQDFEYVWQNFVEQEEGES
        . :: : :...::.. :. . .  .:..:..:: .. ::  .:. : :..:::..:   
CCDS41 GNPNLSLRIFTARLYFCEDRKAEPEGLRRLHRAGVQIAIMTFKDYFYCWNTFVENHE---
     100       110       120       130       140       150         

     200       210       220                     
pF1KE5 KAFQPWEDIQENFLYYEEKLADILK                 
       ..:. :: ..:: .   ..:  ::                  
CCDS41 RTFKAWEGLHENSVRLSRQLRRILLPLYEVDDLRDAFRTLGL
        160       170       180       190        

>>CCDS58807.1 APOBEC3B gene_id:9582|Hs108|chr22           (357 aa)
 initn: 379 init1: 204 opt: 326  Z-score: 428.7  bits: 87.2 E(32554): 1.5e-17
Smith-Waterman score: 326; 31.6% identity (63.1% similar) in 187 aa overlap (45-224:10-190)

           20        30        40        50        60        70    
pF1KE5 SQNGEDLENLDDPEKLKELIELPPFEIVTGERLPANFFKFQFRNVEYSSGRNKTFLCYVV
                                     ::.  . :  .:.:     ::. :.::: :
CCDS58                      MNPQIRNPMERMYRDTFYDNFENEPILYGRSYTWLCYEV
                                    10        20        30         

            80             90       100       110        120       
pF1KE5 EAQ-GKGGQVQAS---RG--YLEDEHAAAHAEEAFFNTILPAFDPALR-YNVTWYVSSSP
       . . :... .  .   ::  :.. ..   :::  :.. .     :: . ...::.:: .:
CCDS58 KIKRGRSNLLWDTGVFRGQVYFKPQY---HAEMCFLSWFCGNQLPAYKCFQITWFVSWTP
      40        50        60           70        80        90      

       130       140       150       160       170       180       
pF1KE5 CAACADRIIKTLSKTKNLRLLILVGRLFMWEEPEIQAALKKLKEAGCKLRIMKPQDFEYV
       :  :. .. . ::.  :. : : ..::... : . . :: .:..:: .. ::  ..: : 
CCDS58 CPDCVAKLAEFLSEHPNVTLTISAARLYYYWERDYRRALCRLSQAGARVTIMDYEEFAYC
        100       110       120       130       140       150      

       190       200       210       220                           
pF1KE5 WQNFVEQEEGESKAFQPWEDIQENFLYYEEKLADILK                       
       :.::: .:   .. :.::  ..::. . .. : .::.                       
CCDS58 WENFVYNE---GQQFMPWYKFDENYAFLHRTLKEILRYLMDPDTFTFNFNNDPLVLRRRQ
        160          170       180       190       200       210   

CCDS58 TYLCYEVERLDNGTWVLMDQHMGFLCNELDPAQIYRVTWFISWSPCFSWGCAGEVRAFLQ
           220       230       240       250       260       270   

>--
 initn: 287 init1:  94 opt: 288  Z-score: 378.8  bits: 78.0 E(32554): 8.8e-15
Smith-Waterman score: 288; 31.7% identity (57.2% similar) in 180 aa overlap (47-224:193-353)

         20        30        40        50        60        70      
pF1KE5 NGEDLENLDDPEKLKELIELPPFEIVTGERLPANFFKFQFRNVEYSSGRNKTFLCYVVEA
                                     .  . : :.: :      : .:.::: :: 
CCDS58 NEGQQFMPWYKFDENYAFLHRTLKEILRYLMDPDTFTFNFNNDPLVLRRRQTYLCYEVER
            170       180       190       200       210       220  

         80        90       100       110       120       130      
pF1KE5 QGKGGQVQASRGYLEDEHAAAHAEEAFFNTILPAFDPALRYNVTWYVSSSPCAA--CADR
         .:  :      : :.: .   .:         .:::  : :::..: ::: .  :: .
CCDS58 LDNGTWV------LMDQHMGFLCNE---------LDPAQIYRVTWFISWSPCFSWGCAGE
                  230       240                250       260       

          140       150       160       170       180       190    
pF1KE5 IIKTLSKTKNLRLLILVGRLFMWEEPEIQAALKKLKEAGCKLRIMKPQDFEYVWQNFVEQ
       .   :... ..:: :...:.. .. :  . ::. :..:: .. ::  ..::: :..:: .
CCDS58 VRAFLQENTHVRLRIFAARIYDYD-PLYKEALQMLRDAGAQVSIMTYDEFEYCWDTFVYR
       270       280       290        300       310       320      

          200       210       220        
pF1KE5 EEGESKAFQPWEDIQENFLYYEEKLADILK    
       .   .  ::::. ..:.      .:  ::.    
CCDS58 Q---GCPFQPWDGLEEHSQALSGRLRAILQNQGN
           330       340       350       

>>CCDS13982.1 APOBEC3B gene_id:9582|Hs108|chr22           (382 aa)
 initn: 379 init1: 204 opt: 326  Z-score: 428.2  bits: 87.2 E(32554): 1.6e-17
Smith-Waterman score: 326; 31.6% identity (63.1% similar) in 187 aa overlap (45-224:10-190)

           20        30        40        50        60        70    
pF1KE5 SQNGEDLENLDDPEKLKELIELPPFEIVTGERLPANFFKFQFRNVEYSSGRNKTFLCYVV
                                     ::.  . :  .:.:     ::. :.::: :
CCDS13                      MNPQIRNPMERMYRDTFYDNFENEPILYGRSYTWLCYEV
                                    10        20        30         

            80             90       100       110        120       
pF1KE5 EAQ-GKGGQVQAS---RG--YLEDEHAAAHAEEAFFNTILPAFDPALR-YNVTWYVSSSP
       . . :... .  .   ::  :.. ..   :::  :.. .     :: . ...::.:: .:
CCDS13 KIKRGRSNLLWDTGVFRGQVYFKPQY---HAEMCFLSWFCGNQLPAYKCFQITWFVSWTP
      40        50        60           70        80        90      

       130       140       150       160       170       180       
pF1KE5 CAACADRIIKTLSKTKNLRLLILVGRLFMWEEPEIQAALKKLKEAGCKLRIMKPQDFEYV
       :  :. .. . ::.  :. : : ..::... : . . :: .:..:: .. ::  ..: : 
CCDS13 CPDCVAKLAEFLSEHPNVTLTISAARLYYYWERDYRRALCRLSQAGARVTIMDYEEFAYC
        100       110       120       130       140       150      

       190       200       210       220                           
pF1KE5 WQNFVEQEEGESKAFQPWEDIQENFLYYEEKLADILK                       
       :.::: .:   .. :.::  ..::. . .. : .::.                       
CCDS13 WENFVYNE---GQQFMPWYKFDENYAFLHRTLKEILRYLMDPDTFTFNFNNDPLVLRRRQ
        160          170       180       190       200       210   

CCDS13 TYLCYEVERLDNGTWVLMDQHMGFLCNEAKNLLCGFYGRHAELRFLDLVPSLQLDPAQIY
           220       230       240       250       260       270   

>--
 initn: 287 init1:  94 opt: 309  Z-score: 405.9  bits: 83.1 E(32554): 2.7e-16
Smith-Waterman score: 309; 31.9% identity (59.7% similar) in 191 aa overlap (47-224:193-378)

         20        30        40        50        60        70      
pF1KE5 NGEDLENLDDPEKLKELIELPPFEIVTGERLPANFFKFQFRNVEYSSGRNKTFLCYVVEA
                                     .  . : :.: :      : .:.::: :: 
CCDS13 NEGQQFMPWYKFDENYAFLHRTLKEILRYLMDPDTFTFNFNNDPLVLRRRQTYLCYEVER
            170       180       190       200       210       220  

         80          90              100       110         120     
pF1KE5 QGKGGQVQASR--GYLEDEHA-------AAHAEEAFFNTILPAF--DPALRYNVTWYVSS
         .:  :  ..  :.: .:         . :::  :.. ..:..  :::  : :::..: 
CCDS13 LDNGTWVLMDQHMGFLCNEAKNLLCGFYGRHAELRFLD-LVPSLQLDPAQIYRVTWFISW
            230       240       250       260        270       280 

         130         140       150       160       170       180   
pF1KE5 SPCAA--CADRIIKTLSKTKNLRLLILVGRLFMWEEPEIQAALKKLKEAGCKLRIMKPQD
       ::: .  :: ..   :... ..:: :...:.. .. :  . ::. :..:: .. ::  ..
CCDS13 SPCFSWGCAGEVRAFLQENTHVRLRIFAARIYDYD-PLYKEALQMLRDAGAQVSIMTYDE
             290       300       310        320       330       340

           190       200       210       220        
pF1KE5 FEYVWQNFVEQEEGESKAFQPWEDIQENFLYYEEKLADILK    
       ::: :..:: ..   .  ::::. ..:.      .:  ::.    
CCDS13 FEYCWDTFVYRQ---GCPFQPWDGLEEHSQALSGRLRAILQNQGN
              350          360       370       380  

>>CCDS13984.1 APOBEC3G gene_id:60489|Hs108|chr22          (384 aa)
 initn: 316 init1: 143 opt: 294  Z-score: 386.2  bits: 79.4 E(32554): 3.4e-15
Smith-Waterman score: 300; 32.6% identity (62.5% similar) in 184 aa overlap (52-224:202-380)

              30        40        50        60        70        80 
pF1KE5 ENLDDPEKLKELIELPPFEIVTGERLPANFFKFQFRNVEYSSGRNKTFLCYVVEAQGKGG
                                     : :.: :  .  ::..:.::: :: . .  
CCDS13 FEPWNNLPKYYILLHIMLGEILRHSMDPPTFTFNFNNEPWVRGRHETYLCYEVERMHNDT
             180       190       200       210       220       230 

                90              100         110       120       130
pF1KE5 QV--QASRGYLEDE----HA---AAHAEEAFFNTILP--AFDPALRYNVTWYVSSSPCAA
        :  .  ::.: ..    :.   . :::  :.. ..:   .:    : :: ..: ::: .
CCDS13 WVLLNQRRGFLCNQAPHKHGFLEGRHAELCFLD-VIPFWKLDLDQDYRVTCFTSWSPCFS
             240       250       260        270       280       290

              140       150       160       170       180       190
pF1KE5 CADRIIKTLSKTKNLRLLILVGRLFMWEEPEIQAALKKLKEAGCKLRIMKPQDFEYVWQN
       ::... : .::.:.. : :...:..  .. . : .:. : ::: :. ::  ..:.. :..
CCDS13 CAQEMAKFISKNKHVSLCIFTARIYD-DQGRCQEGLRTLAEAGAKISIMTYSEFKHCWDT
              300       310        320       330       340         

              200       210       220        
pF1KE5 FVEQEEGESKAFQPWEDIQENFLYYEEKLADILK    
       ::...   .  ::::. ..:.      .:  ::.    
CCDS13 FVDHQ---GCPFQPWDGLDEHSQDLSGRLRAILQNQEN
     350          360       370       380    

>--
 initn: 316 init1: 143 opt: 294  Z-score: 386.2  bits: 79.4 E(32554): 3.4e-15
Smith-Waterman score: 294; 29.4% identity (62.4% similar) in 197 aa overlap (37-224:3-194)

         10        20        30        40        50        60      
pF1KE5 AAVATEAASQNGEDLENLDDPEKLKELIELPPFEIVTGERLPANFFKFQFRNVEYSSGRN
                                     : :.  : ::.  . :...: :    : ::
CCDS13                             MKPHFRN-TVERMYRDTFSYNFYNRPILSRRN
                                            10        20        30 

         70        80           90       100         110       120 
pF1KE5 KTFLCYVVEAQGKGG---QVQASRGYLEDEHAAAHAEEAFFN--TILPAFDPALRYNVTW
        ..::: :...: .    ...  :: . .:    : :  ::.  .    .    .:.:::
CCDS13 TVWLCYEVKTKGPSRPPLDAKIFRGQVYSE-LKYHPEMRFFHWFSKWRKLHRDQEYEVTW
              40        50        60         70        80        90

             130       140       150       160         170         
pF1KE5 YVSSSPCAACADRIIKTLSKTKNLRLLILVGRLFMWEEPEIQAALKKL--KEAGCK--LR
       :.: :::. :.  .   :..  .. : :.:.::... .:. : ::..:  :. : .  ..
CCDS13 YISWSPCTKCTRDMATFLAEDPKVTLTIFVARLYYFWDPDYQEALRSLCQKRDGPRATMK
              100       110       120       130       140       150

       180       190       200       210       220                 
pF1KE5 IMKPQDFEYVWQNFVEQEEGESKAFQPWEDIQENFLYYEEKLADILK             
       ::. ..:.. :..:: ...   . :.::... . ..  .  :..::.             
CCDS13 IMNYDEFQHCWSKFVYSQR---ELFEPWNNLPKYYILLHIMLGEILRHSMDPPTFTFNFN
              160          170       180       190       200       

CCDS13 NEPWVRGRHETYLCYEVERMHNDTWVLLNQRRGFLCNQAPHKHGFLEGRHAELCFLDVIP
       210       220       230       240       250       260       

>>CCDS46709.1 APOBEC3D gene_id:140564|Hs108|chr22         (386 aa)
 initn: 488 init1: 203 opt: 272  Z-score: 357.3  bits: 74.1 E(32554): 1.4e-13
Smith-Waterman score: 365; 31.5% identity (66.3% similar) in 184 aa overlap (45-224:206-386)

           20        30        40        50        60        70    
pF1KE5 SQNGEDLENLDDPEKLKELIELPPFEIVTGERLPANFFKFQFRNVEYSSGRNKTFLCYVV
                                     : .  ..: :.:.:.  . :::...::...
CCDS46 EGQPFMPWYKFDDNYASLHRTLKEILRNPMEAMYPHIFYFHFKNLLKACGRNESWLCFTM
         180       190       200       210       220       230     

           80        90          100        110       120       130
pF1KE5 EAQGKGGQVQASRGYLE---DEHAAAHAEEAFFNTILP-AFDPALRYNVTWYVSSSPCAA
       :.  . . :  .:: ..   : ..  :::. :.. .    ..:   :.::::.: :::  
CCDS46 EVTKHHSAVFRKRGVFRNQVDPETHCHAERCFLSWFCDDILSPNTNYEVTWYTSWSPCPE
         240       250       260       270       280       290     

              140       150       160       170       180       190
pF1KE5 CADRIIKTLSKTKNLRLLILVGRLFMWEEPEIQAALKKLKEAGCKLRIMKPQDFEYVWQN
       :: .. . :.. .:. : :...:: .. . . : .: .:.. : ...::  .::   :.:
CCDS46 CAGEVAEFLARHSNVNLTIFTARLCYFWDTDYQEGLCSLSQEGASVKIMGYKDFVSCWKN
         300       310       320       330       340       350     

              200       210       220    
pF1KE5 FVEQEEGESKAFQPWEDIQENFLYYEEKLADILK
       :: ...   . :.::. .: ::   ...: .::.
CCDS46 FVYSDD---EPFKPWKGLQTNFRLLKRRLREILQ
         360          370       380      

>--
 initn: 512 init1: 203 opt: 305  Z-score: 400.6  bits: 82.1 E(32554): 5.4e-16
Smith-Waterman score: 305; 29.6% identity (61.2% similar) in 196 aa overlap (45-224:10-202)

           20        30        40        50        60        70    
pF1KE5 SQNGEDLENLDDPEKLKELIELPPFEIVTGERLPANFFKFQFRNVEYSSGRNKTFLCYVV
                                     ::.  . :  .:.:     ::. :.::: :
CCDS46                      MNPQIRNPMERMYRDTFYDNFENEPILYGRSYTWLCYEV
                                    10        20        30         

            80           90       100              110             
pF1KE5 EAQ-GKGGQVQAS---RGYLEDEHAAAHAEEAFFN-------TILPAFD----PA-LRYN
       . . :... .  .   :: .  .. . : .:..:         .:  :     ::  :..
CCDS46 KIKRGRSNLLWDTGVFRGPVLPKRQSNHRQEVYFRFENHAEMCFLSWFCGNRLPANRRFQ
      40        50        60        70        80        90         

      120       130       140       150       160       170        
pF1KE5 VTWYVSSSPCAACADRIIKTLSKTKNLRLLILVGRLFMWEEPEIQAALKKLKEAGCKLRI
       .::.:: .::  :. .. : :..  :. : : ..::..... . . .: .:..:: ...:
CCDS46 ITWFVSWNPCLPCVVKVTKFLAEHPNVTLTISAARLYYYRDRDWRWVLLRLHKAGARVKI
     100       110       120       130       140       150         

      180       190       200       210       220                  
pF1KE5 MKPQDFEYVWQNFVEQEEGESKAFQPWEDIQENFLYYEEKLADILK              
       :  .:: : :.::: .:   .. :.::  ...:.   .. : .::.              
CCDS46 MDYEDFAYCWENFVCNE---GQPFMPWYKFDDNYASLHRTLKEILRNPMEAMYPHIFYFH
     160       170          180       190       200       210      

CCDS46 FKNLLKACGRNESWLCFTMEVTKHHSAVFRKRGVFRNQVDPETHCHAERCFLSWFCDDIL
        220       230       240       250       260       270      

>>CCDS81662.1 AICDA gene_id:57379|Hs108|chr12             (188 aa)
 initn: 287 init1: 211 opt: 265  Z-score: 353.2  bits: 72.3 E(32554): 2.4e-13
Smith-Waterman score: 292; 32.8% identity (64.4% similar) in 174 aa overlap (52-223:11-170)

              30        40        50        60        70        80 
pF1KE5 ENLDDPEKLKELIELPPFEIVTGERLPANFFKFQFRNVEYSSGRNKTFLCYVVEAQGKGG
                                     : .::.::....:: .:.:::::. . .. 
CCDS81                     MDSLLMNRRKFLYQFKNVRWAKGRRETYLCYVVKRRDSAT
                                   10        20        30        40

              90       100        110       120       130       140
pF1KE5 QVQASRGYLEDEHAAAHAEEAFFNTILP-AFDPALRYNVTWYVSSSPCAACADRIIKTLS
       . . . :::.....  :.:  :.  :    .::.  : :::..: :::  :: ..   : 
CCDS81 SFSLDFGYLRNKNGC-HVELLFLRYISDWDLDPGRCYRVTWFTSWSPCYDCARHVADFLR
               50         60        70        80        90         

              150       160        170       180       190         
pF1KE5 KTKNLRLLILVGRLFMWEEPEIQA-ALKKLKEAGCKLRIMKPQDFEYVWQNFVEQEEGES
        . :: : :...::.. :. . .  .:..:..:: .. ::          .: :..:   
CCDS81 GNPNLSLRIFTARLYFCEDRKAEPEGLRRLHRAGVQIAIM----------TFKENHE---
     100       110       120       130                 140         

     200       210       220                     
pF1KE5 KAFQPWEDIQENFLYYEEKLADILK                 
       ..:. :: ..:: .   ..:  ::                  
CCDS81 RTFKAWEGLHENSVRLSRQLRRILLPLYEVDDLRDAFRTLGL
        150       160       170       180        

>>CCDS13981.1 APOBEC3A gene_id:200315|Hs108|chr22         (199 aa)
 initn: 286 init1:  87 opt: 254  Z-score: 338.3  bits: 69.6 E(32554): 1.6e-12
Smith-Waterman score: 311; 31.5% identity (62.4% similar) in 197 aa overlap (43-224:7-195)

             20        30        40          50        60        70
pF1KE5 AASQNGEDLENLDDPEKLKELIELPPFEIVTGER--LPANFFKFQFRNVEYSSGRNKTFL
                                     .: :  .  ..:  .: :   . ::.::.:
CCDS13                         MEASPASGPRHLMDPHIFTSNFNN---GIGRHKTYL
                                       10        20           30   

               80          90              100       110           
pF1KE5 CYVVEAQGKGGQVQAS--RGYLEDEHA-------AAHAEEAFFNTILPAF--DPALRYNV
       :: ::   .: .:. .  ::.:...         . :::  :.. . :..  :::  : :
CCDS13 CYEVERLDNGTSVKMDQHRGFLHNQAKNLLCGFYGRHAELRFLDLV-PSLQLDPAQIYRV
            40        50        60        70         80        90  

     120       130         140       150       160       170       
pF1KE5 TWYVSSSPCAA--CADRIIKTLSKTKNLRLLILVGRLFMWEEPEIQAALKKLKEAGCKLR
       ::..: ::: .  :: ..   :... ..:: :...:.. .. :  . ::. :..:: .. 
CCDS13 TWFISWSPCFSWGCAGEVRAFLQENTHVRLRIFAARIYDYD-PLYKEALQMLRDAGAQVS
            100       110       120       130        140       150 

       180       190       200       210       220        
pF1KE5 IMKPQDFEYVWQNFVEQEEGESKAFQPWEDIQENFLYYEEKLADILK    
       ::  ..:.. :..::...   .  ::::. ..:.      .:  ::.    
CCDS13 IMTYDEFKHCWDTFVDHQ---GCPFQPWDGLDEHSQALSGRLRAILQNQGN
             160          170       180       190         




224 residues in 1 query   sequences
18511270 residues in 32554 library sequences
 Tcomplib [36.3.4 Apr, 2011] (8 proc)
 start: Tue Nov  8 04:35:59 2016 done: Tue Nov  8 04:35:59 2016
 Total Scan time:  2.050 Total Display time:  0.010

Function used was FASTA [36.3.4 Apr, 2011]
Inquiries or Suggestions ?
Send a message to flexiclone AT kazusagt.com