Result of FASTA (ccds) for pFN21AE0115
FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448

Query: pF1KE0115, 228 aa
  1>>>pF1KE0115 228 - 228 aa - 228 aa
Library: human.CCDS.faa
  18511270 residues in 32554 sequences

Statistics:  Expectation_n fit: rho(ln(x))= 5.7180+/-0.000812; mu= 12.3463+/- 0.049
 mean_var=75.0082+/-14.861, 0's: 0 Z-trim(108.1): 31  B-trim: 67 in 1/49
 Lambda= 0.148088
 statistics sampled from 9976 (9999) to 9976 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
 ktup: 2, E-join: 1 (0.69), E-opt: 0.2 (0.307), width:  16
 Scan time:  2.220

The best scores are:                                      opt bits E(32554)
CCDS9001.1 THAP2 gene_id:83591|Hs108|chr12         ( 228) 1515 332.6 1.3e-91
CCDS86.1 THAP3 gene_id:90326|Hs108|chr1            ( 175)  280 68.7 2.7e-12
CCDS6136.1 THAP1 gene_id:55145|Hs108|chr8          ( 213)  277 68.1   5e-12
CCDS55572.1 THAP3 gene_id:90326|Hs108|chr1         ( 239)  275 67.7 7.4e-12
CCDS55573.1 THAP3 gene_id:90326|Hs108|chr1         ( 238)  270 66.6 1.6e-11


>>CCDS9001.1 THAP2 gene_id:83591|Hs108|chr12              (228 aa)
 initn: 1515 init1: 1515 opt: 1515  Z-score: 1758.4  bits: 332.6 E(32554): 1.3e-91
Smith-Waterman score: 1515; 100.0% identity (100.0% similar) in 228 aa overlap (1-228:1-228)

               10        20        30        40        50        60
pF1KE0 MPTNCAAAGCATTYNKHINISFHRFPLDPKRRKEWVRLVRRKNFVPGKHTFLCSKHFEAS
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS90 MPTNCAAAGCATTYNKHINISFHRFPLDPKRRKEWVRLVRRKNFVPGKHTFLCSKHFEAS
               10        20        30        40        50        60

               70        80        90       100       110       120
pF1KE0 CFDLTGQTRRLKMDAVPTIFDFCTHIKSMKLKSRNLLKKNNSCSPAGPSNLKSNISSQQV
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS90 CFDLTGQTRRLKMDAVPTIFDFCTHIKSMKLKSRNLLKKNNSCSPAGPSNLKSNISSQQV
               70        80        90       100       110       120

              130       140       150       160       170       180
pF1KE0 LLEHSYAFRNPMEAKKRIIKLEKEIASLRRKMKTCLQKERRATRRWIKATCLVKNLEANS
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS90 LLEHSYAFRNPMEAKKRIIKLEKEIASLRRKMKTCLQKERRATRRWIKATCLVKNLEANS
              130       140       150       160       170       180

              190       200       210       220        
pF1KE0 VLPKGTSEHMLPTALSSLPLEDFKILEQDQQDKTLLSLNLKQTKSTFI
       ::::::::::::::::::::::::::::::::::::::::::::::::
CCDS90 VLPKGTSEHMLPTALSSLPLEDFKILEQDQQDKTLLSLNLKQTKSTFI
              190       200       210       220        

>>CCDS86.1 THAP3 gene_id:90326|Hs108|chr1                 (175 aa)
 initn: 257 init1: 185 opt: 280  Z-score: 334.1  bits: 68.7 E(32554): 2.7e-12
Smith-Waterman score: 280; 34.6% identity (63.9% similar) in 133 aa overlap (1-131:1-130)

               10         20         30        40        50        
pF1KE0 MPTNCAAAGCATTYN-KHINISFHRFPLD-PKRRKEWVRLVRRKNFVPGKHTFLCSKHFE
       :: .:::  : . :. .. ...:::::.. :.  ::::  . : :: : .:: .::.::.
CCDS86 MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGNFKPKQHTVICSEHFR
               10        20        30        40        50        60

       60        70        80        90       100       110        
pF1KE0 ASCFDLTGQTRRLKMDAVPTIFDFCTHIKSMKLKSRNLLKKNNSCSPAGPSNLKSNISSQ
         ::.  :. . :: .::::.: :    .... ..    ...:. :    .. :..   .
CCDS86 PECFSAFGNRKNLKHNAVPTVFAFQDPTQQVRENTDPASERGNASSS---QKEKTSPCRS
               70        80        90       100          110       

      120       130       140       150       160       170        
pF1KE0 QVLLEHSYAFRNPMEAKKRIIKLEKEIASLRRKMKTCLQKERRATRRWIKATCLVKNLEA
       ::: : . .  .:                                               
CCDS86 QVLPEAGAGEDSPGRNMDTALEELQLPPNAEGHVKQAMLFNVENGTPASREALWLSEE  
       120       130       140       150       160       170       

>>CCDS6136.1 THAP1 gene_id:55145|Hs108|chr8               (213 aa)
 initn: 339 init1: 171 opt: 277  Z-score: 329.4  bits: 68.1 E(32554): 5e-12
Smith-Waterman score: 366; 35.3% identity (63.2% similar) in 201 aa overlap (1-185:1-197)

               10        20         30        40        50         
pF1KE0 MPTNCAAAGCATTYNKHINISFHRFPLD-PKRRKEWVRLVRRKNFVPGKHTFLCSKHFEA
       :  .:.: :: . :.:   .:::.:::  :.  :::   :::::: : :.. .::.::  
CCDS61 MVQSCSAYGCKNRYDKDKPVSFHKFPLTRPSLCKEWEAAVRRKNFKPTKYSSICSEHFTP
               10        20        30        40        50        60

      60        70        80        90       100           110     
pF1KE0 SCFDLTGQTRRLKMDAVPTIFDFCTHIKSMKLKSRNLLKKNNSCSPAG-P---SNLKSNI
       .::    ... :: .:::::: .::. ..   :...::. ...  :   :   :.. . :
CCDS61 DCFKRECNNKLLKENAVPTIF-LCTEPHD---KKEDLLEPQEQLPPPPLPPPVSQVDAAI
               70        80            90       100       110      

                   120       130       140       150       160     
pF1KE0 S----------SQQVLLEHSYAFRNPMEAKKRIIKLEKEIASLRRKMKTCLQKERRATRR
       .          . .:. .:.:. .. :. .::: .::... .::.:.::  :. ::  :.
CCDS61 GLLMPPLQTPVNLSVFCDHNYTVEDTMHQRKRIHQLEQQVEKLRKKLKTAQQRCRRQERQ
        120       130       140       150       160       170      

         170        180       190       200       210       220    
pF1KE0 WIKATCLVK-NLEANSVLPKGTSEHMLPTALSSLPLEDFKILEQDQQDKTLLSLNLKQTK
         :   .:. . : ..:  .:                                       
CCDS61 LEKLKEVVHFQKEKDDVSERGYVILPNDYFEIVEVPA                       
        180       190       200       210                          

>>CCDS55572.1 THAP3 gene_id:90326|Hs108|chr1              (239 aa)
 initn: 309 init1: 185 opt: 275  Z-score: 326.3  bits: 67.7 E(32554): 7.4e-12
Smith-Waterman score: 275; 37.7% identity (67.0% similar) in 106 aa overlap (1-104:1-106)

               10         20         30        40        50        
pF1KE0 MPTNCAAAGCATTYN-KHINISFHRFPLD-PKRRKEWVRLVRRKNFVPGKHTFLCSKHFE
       :: .:::  : . :. .. ...:::::.. :.  ::::  . : :: : .:: .::.::.
CCDS55 MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGNFKPKQHTVICSEHFR
               10        20        30        40        50        60

       60        70        80        90       100       110        
pF1KE0 ASCFDLTGQTRRLKMDAVPTIFDFCTHIKSMKLKSRNLLKKNNSCSPAGPSNLKSNISSQ
         ::.  :. . :: .::::.: :    .... ..    ...:. :              
CCDS55 PECFSAFGNRKNLKHNAVPTVFAFQDPTQQVRENTDPASERGNASSSQKEKVLPEAGAGE
               70        80        90       100       110       120

      120       130       140       150       160       170        
pF1KE0 QVLLEHSYAFRNPMEAKKRIIKLEKEIASLRRKMKTCLQKERRATRRWIKATCLVKNLEA
                                                                   
CCDS55 DSPGRNMDTALEELQLPPNAEGHVKQVSPRRPQATEAVGRPTGPAGLRRTPNKQPSDHSY
              130       140       150       160       170       180

>>CCDS55573.1 THAP3 gene_id:90326|Hs108|chr1              (238 aa)
 initn: 309 init1: 185 opt: 270  Z-score: 320.6  bits: 66.6 E(32554): 1.6e-11
Smith-Waterman score: 270; 45.2% identity (70.2% similar) in 84 aa overlap (1-82:1-84)

               10         20         30        40        50        
pF1KE0 MPTNCAAAGCATTYN-KHINISFHRFPLD-PKRRKEWVRLVRRKNFVPGKHTFLCSKHFE
       :: .:::  : . :. .. ...:::::.. :.  ::::  . : :: : .:: .::.::.
CCDS55 MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGNFKPKQHTVICSEHFR
               10        20        30        40        50        60

       60        70        80        90       100       110        
pF1KE0 ASCFDLTGQTRRLKMDAVPTIFDFCTHIKSMKLKSRNLLKKNNSCSPAGPSNLKSNISSQ
         ::.  :. . :: .::::.: :                                    
CCDS55 PECFSAFGNRKNLKHNAVPTVFAFQDPTQVRENTDPASERGNASSSQKEKVLPEAGAGED
               70        80        90       100       110       120




228 residues in 1 query   sequences
18511270 residues in 32554 library sequences
 Tcomplib [36.3.4 Apr, 2011] (8 proc)
 start: Fri Nov  4 02:28:16 2016 done: Fri Nov  4 02:28:16 2016
 Total Scan time:  2.220 Total Display time: -0.030

Function used was FASTA [36.3.4 Apr, 2011]
Inquiries or Suggestions ?
Send a message to flexiclone AT kazusagt.com