Result of FASTA (ccds) for pFN21AE0563
FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448

Query: pF1KE0563, 222 aa
  1>>>pF1KE0563 222 - 222 aa - 222 aa
Library: human.CCDS.faa
  18511270 residues in 32554 sequences

Statistics:  Expectation_n fit: rho(ln(x))= 5.5946+/-0.000756; mu= 12.0751+/- 0.045
 mean_var=59.7259+/-11.934, 0's: 0 Z-trim(106.8): 23  B-trim: 0 in 0/52
 Lambda= 0.165956
 statistics sampled from 9196 (9215) to 9196 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
 ktup: 2, E-join: 1 (0.672), E-opt: 0.2 (0.283), width:  16
 Scan time:  1.520

The best scores are:                                      opt bits E(32554)
CCDS4946.1 GSTA5 gene_id:221357|Hs108|chr6         ( 222) 1423 348.8 1.6e-96
CCDS4945.1 GSTA1 gene_id:2938|Hs108|chr6           ( 222) 1287 316.3   1e-86
CCDS4944.1 GSTA2 gene_id:2939|Hs108|chr6           ( 222) 1265 311.0 3.9e-85
CCDS4947.1 GSTA3 gene_id:2940|Hs108|chr6           ( 222) 1222 300.7 4.9e-82
CCDS4948.1 GSTA4 gene_id:2941|Hs108|chr6           ( 222)  822 204.9 3.3e-53
CCDS41679.1 GSTP1 gene_id:2950|Hs108|chr11         ( 210)  279 74.9 4.3e-14
CCDS3640.1 HPGDS gene_id:27306|Hs108|chr4          ( 199)  241 65.8 2.2e-11


>>CCDS4946.1 GSTA5 gene_id:221357|Hs108|chr6              (222 aa)
 initn: 1423 init1: 1423 opt: 1423  Z-score: 1846.4  bits: 348.8 E(32554): 1.6e-96
Smith-Waterman score: 1423; 100.0% identity (100.0% similar) in 222 aa overlap (1-222:1-222)

               10        20        30        40        50        60
pF1KE0 MAEKPKLHYSNARGSMESIRWLLAAAGVELEEKFLESAEDLDKLRNDGSLLFQQVPMVEI
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 MAEKPKLHYSNARGSMESIRWLLAAAGVELEEKFLESAEDLDKLRNDGSLLFQQVPMVEI
               10        20        30        40        50        60

               70        80        90       100       110       120
pF1KE0 DGMKLVQTRAILNYIASKYNLYGKDMKERALIDMYTEGIVDLTEMILLLLICQPEERDAK
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 DGMKLVQTRAILNYIASKYNLYGKDMKERALIDMYTEGIVDLTEMILLLLICQPEERDAK
               70        80        90       100       110       120

              130       140       150       160       170       180
pF1KE0 TALVKEKIKNRYFPAFEKVLKSHRQDYLVGNKLSWADIHLVELFYYVEELDSSLISSFPL
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 TALVKEKIKNRYFPAFEKVLKSHRQDYLVGNKLSWADIHLVELFYYVEELDSSLISSFPL
              130       140       150       160       170       180

              190       200       210       220  
pF1KE0 LKALKTRISNLPTVKKFLQPGSQRKPPMDEKSLEEARKIFRF
       ::::::::::::::::::::::::::::::::::::::::::
CCDS49 LKALKTRISNLPTVKKFLQPGSQRKPPMDEKSLEEARKIFRF
              190       200       210       220  

>>CCDS4945.1 GSTA1 gene_id:2938|Hs108|chr6                (222 aa)
 initn: 1287 init1: 1287 opt: 1287  Z-score: 1670.4  bits: 316.3 E(32554): 1e-86
Smith-Waterman score: 1287; 90.1% identity (94.6% similar) in 222 aa overlap (1-222:1-222)

               10        20        30        40        50        60
pF1KE0 MAEKPKLHYSNARGSMESIRWLLAAAGVELEEKFLESAEDLDKLRNDGSLLFQQVPMVEI
       ::::::::: :::: ::: ::::::::::.::::..:::::::::::: :.:::::::::
CCDS49 MAEKPKLHYFNARGRMESTRWLLAAAGVEFEEKFIKSAEDLDKLRNDGYLMFQQVPMVEI
               10        20        30        40        50        60

               70        80        90       100       110       120
pF1KE0 DGMKLVQTRAILNYIASKYNLYGKDMKERALIDMYTEGIVDLTEMILLLLICQPEERDAK
       :::::::::::::::::::::::::.::::::::: :::.:: :::::: .: :::.:::
CCDS49 DGMKLVQTRAILNYIASKYNLYGKDIKERALIDMYIEGIADLGEMILLLPVCPPEEKDAK
               70        80        90       100       110       120

              130       140       150       160       170       180
pF1KE0 TALVKEKIKNRYFPAFEKVLKSHRQDYLVGNKLSWADIHLVELFYYVEELDSSLISSFPL
        ::.::::::::::::::::::: :::::::::: ::::::::.::::::::::::::::
CCDS49 LALIKEKIKNRYFPAFEKVLKSHGQDYLVGNKLSRADIHLVELLYYVEELDSSLISSFPL
              130       140       150       160       170       180

              190       200       210       220  
pF1KE0 LKALKTRISNLPTVKKFLQPGSQRKPPMDEKSLEEARKIFRF
       :::::::::::::::::::::: :::::::::::::::::::
CCDS49 LKALKTRISNLPTVKKFLQPGSPRKPPMDEKSLEEARKIFRF
              190       200       210       220  

>>CCDS4944.1 GSTA2 gene_id:2939|Hs108|chr6                (222 aa)
 initn: 1265 init1: 1265 opt: 1265  Z-score: 1642.0  bits: 311.0 E(32554): 3.9e-85
Smith-Waterman score: 1265; 88.7% identity (94.6% similar) in 222 aa overlap (1-222:1-222)

               10        20        30        40        50        60
pF1KE0 MAEKPKLHYSNARGSMESIRWLLAAAGVELEEKFLESAEDLDKLRNDGSLLFQQVPMVEI
       ::::::::::: :: ::::::::::::::.::::..:::::::::::: :.:::::::::
CCDS49 MAEKPKLHYSNIRGRMESIRWLLAAAGVEFEEKFIKSAEDLDKLRNDGYLMFQQVPMVEI
               10        20        30        40        50        60

               70        80        90       100       110       120
pF1KE0 DGMKLVQTRAILNYIASKYNLYGKDMKERALIDMYTEGIVDLTEMILLLLICQPEERDAK
       :::::::::::::::::::::::::.::.:::::: :::.:: :::::: . ::::.:::
CCDS49 DGMKLVQTRAILNYIASKYNLYGKDIKEKALIDMYIEGIADLGEMILLLPFSQPEEQDAK
               70        80        90       100       110       120

              130       140       150       160       170       180
pF1KE0 TALVKEKIKNRYFPAFEKVLKSHRQDYLVGNKLSWADIHLVELFYYVEELDSSLISSFPL
        ::..:: ::::::::::::::: :::::::::: ::::::::.::::::::::::::::
CCDS49 LALIQEKTKNRYFPAFEKVLKSHGQDYLVGNKLSRADIHLVELLYYVEELDSSLISSFPL
              130       140       150       160       170       180

              190       200       210       220  
pF1KE0 LKALKTRISNLPTVKKFLQPGSQRKPPMDEKSLEEARKIFRF
       :::::::::::::::::::::: ::::::::::::.::::::
CCDS49 LKALKTRISNLPTVKKFLQPGSPRKPPMDEKSLEESRKIFRF
              190       200       210       220  

>>CCDS4947.1 GSTA3 gene_id:2940|Hs108|chr6                (222 aa)
 initn: 1222 init1: 1222 opt: 1222  Z-score: 1586.3  bits: 300.7 E(32554): 4.9e-82
Smith-Waterman score: 1222; 85.6% identity (93.2% similar) in 222 aa overlap (1-222:1-222)

               10        20        30        40        50        60
pF1KE0 MAEKPKLHYSNARGSMESIRWLLAAAGVELEEKFLESAEDLDKLRNDGSLLFQQVPMVEI
       :: :::::: :.:: :: :::::::::::.::::. ::::: ::::::::.:::::::::
CCDS49 MAGKPKLHYFNGRGRMEPIRWLLAAAGVEFEEKFIGSAEDLGKLRNDGSLMFQQVPMVEI
               10        20        30        40        50        60

               70        80        90       100       110       120
pF1KE0 DGMKLVQTRAILNYIASKYNLYGKDMKERALIDMYTEGIVDLTEMILLLLICQPEERDAK
       :::::::::::::::::::::::::.::::::::::::..::.:::::: .:.:::.:::
CCDS49 DGMKLVQTRAILNYIASKYNLYGKDIKERALIDMYTEGMADLNEMILLLPLCRPEEKDAK
               70        80        90       100       110       120

              130       140       150       160       170       180
pF1KE0 TALVKEKIKNRYFPAFEKVLKSHRQDYLVGNKLSWADIHLVELFYYVEELDSSLISSFPL
        ::.::: :.::::::::::.:: :::::::::: ::: ::::.::::::::::::.:::
CCDS49 IALIKEKTKSRYFPAFEKVLQSHGQDYLVGNKLSRADISLVELLYYVEELDSSLISNFPL
              130       140       150       160       170       180

              190       200       210       220  
pF1KE0 LKALKTRISNLPTVKKFLQPGSQRKPPMDEKSLEEARKIFRF
       :::::::::::::::::::::: :::: : :.::::::::::
CCDS49 LKALKTRISNLPTVKKFLQPGSPRKPPADAKALEEARKIFRF
              190       200       210       220  

>>CCDS4948.1 GSTA4 gene_id:2941|Hs108|chr6                (222 aa)
 initn: 822 init1: 822 opt: 822  Z-score: 1068.7  bits: 204.9 E(32554): 3.3e-53
Smith-Waterman score: 822; 53.4% identity (84.2% similar) in 221 aa overlap (1-221:1-221)

               10        20        30        40        50        60
pF1KE0 MAEKPKLHYSNARGSMESIRWLLAAAGVELEEKFLESAEDLDKLRNDGSLLFQQVPMVEI
       :: .::::: :.:: :::.::.:::::::..:.:::. :.: ::.. . :::::::::::
CCDS49 MAARPKLHYPNGRGRMESVRWVLAAAGVEFDEEFLETKEQLYKLQDGNHLLFQQVPMVEI
               10        20        30        40        50        60

               70        80        90       100       110       120
pF1KE0 DGMKLVQTRAILNYIASKYNLYGKDMKERALIDMYTEGIVDLTEMILLLLICQPEERDAK
       :::::::::.::.:::.:.::.::..:::.:::::.:: .:: :....  . .:.... .
CCDS49 DGMKLVQTRSILHYIADKHNLFGKNLKERTLIDMYVEGTLDLLELLIMHPFLKPDDQQKE
               70        80        90       100       110       120

              130       140       150       160       170       180
pF1KE0 TALVKEKIKNRYFPAFEKVLKSHRQDYLVGNKLSWADIHLVELFYYVEELDSSLISSFPL
       .. . .:   ::::.:::.:..: :..::::.:: ::. :.. .  .::   ...:.::.
CCDS49 VVNMAQKAIIRYFPVFEKILRGHGQSFLVGNQLSLADVILLQTILALEEKIPNILSAFPF
              130       140       150       160       170       180

              190       200       210       220  
pF1KE0 LKALKTRISNLPTVKKFLQPGSQRKPPMDEKSLEEARKIFRF
       :.   ...::.::.:.::.:::..::: ::  .. . .::: 
CCDS49 LQEYTVKLSNIPTIKRFLEPGSKKKPPPDEIYVRTVYNIFRP
              190       200       210       220  

>>CCDS41679.1 GSTP1 gene_id:2950|Hs108|chr11              (210 aa)
 initn: 253 init1: 124 opt: 279  Z-score: 366.5  bits: 74.9 E(32554): 4.3e-14
Smith-Waterman score: 279; 30.0% identity (59.6% similar) in 203 aa overlap (9-209:8-205)

               10        20        30        40        50        60
pF1KE0 MAEKPKLHYSNARGSMESIRWLLAAAGVELEEKFLESAEDLDKLRNDGSLLFQQVPMVEI
               :  .::   ..: :::  :   .:. . ..:  ..    .: :. :.:  . 
CCDS41  MPPYTVVYFPVRGRCAALRMLLADQGQSWKEEVV-TVETWQEGSLKASCLYGQLPKFQD
                10        20        30         40        50        

               70        80        90       100       110       120
pF1KE0 DGMKLVQTRAILNYIASKYNLYGKDMKERALIDMYTEGIVDLTEMILLLLICQPEERDAK
         . : :. .:: ...   .:::::..: ::.:: ..:. ::    . :.  . :   .:
CCDS41 GDLTLYQSNTILRHLGRTLGLYGKDQQEAALVDMVNDGVEDLRCKYISLIYTNYEA--GK
       60        70        80        90       100       110        

              130       140         150       160       170        
pF1KE0 TALVKEKIKNRYFPAFEKVLKSHR--QDYLVGNKLSWADIHLVELFYYVEELDSSLISSF
          ::  . ..  : :: .:....  . ..::...:.:: .:..:.   : :  . ...:
CCDS41 DDYVKA-LPGQLKP-FETLLSQNQGGKTFIVGDQISFADYNLLDLLLIHEVLAPGCLDAF
        120         130       140       150       160       170    

      180       190       200       210       220  
pF1KE0 PLLKALKTRISNLPTVKKFLQPGSQRKPPMDEKSLEEARKIFRF
       :::.:   :.:  : .: ::      . :..             
CCDS41 PLLSAYVGRLSARPKLKAFLASPEYVNLPINGNGKQ        
          180       190       200       210        

>>CCDS3640.1 HPGDS gene_id:27306|Hs108|chr4               (199 aa)
 initn: 131 init1:  58 opt: 241  Z-score: 317.7  bits: 65.8 E(32554): 2.2e-11
Smith-Waterman score: 241; 25.2% identity (60.9% similar) in 202 aa overlap (6-205:5-198)

               10        20        30        40        50        60
pF1KE0 MAEKPKLHYSNARGSMESIRWLLAAAGVELEEKFLESAEDLDKLRNDGSLLFQQVPMVEI
            :: : : ::  : ::...:   .. :.. .:.: :  ....  .: : ..:..:.
CCDS36  MPNYKLTYFNMRGRAEIIRYIFAYLDIQYEDHRIEQA-DWPEIKS--TLPFGKIPILEV
                10        20        30         40          50      

               70        80        90       100       110       120
pF1KE0 DGMKLVQTRAILNYIASKYNLYGKDMKERALIDMYTEGIVDLTEMILLLLICQPEERDAK
       ::. : :. ::  :.... .: :.   :.  .:    .:::  . ..  .    ...:.:
CCDS36 DGLTLHQSLAIARYLTKNTDLAGNTEMEQCHVD----AIVDTLDDFMSCFPWAEKKQDVK
         60        70        80            90       100       110  

              130       140         150       160       170        
pF1KE0 TALVKEKIKNRYFPAFEKVLKSHR--QDYLVGNKLSWADIHLVELFYYVEELDSSLISSF
         . .: . .   : . . : ..   ...:.::...:::..       .  .  .:... 
CCDS36 EQMFNELL-TYNAPHLMQDLDTYLGGREWLIGNSVTWADFYWEICSTTLLVFKPDLLDNH
            120        130       140       150       160       170 

      180       190       200       210       220  
pF1KE0 PLLKALKTRISNLPTVKKFLQPGSQRKPPMDEKSLEEARKIFRF
       : : .:. ... .:.: ....   : :                 
CCDS36 PRLVTLRKKVQAIPAVANWIKRRPQTKL                
             180       190                         




222 residues in 1 query   sequences
18511270 residues in 32554 library sequences
 Tcomplib [36.3.4 Apr, 2011] (8 proc)
 start: Wed Nov  2 23:16:42 2016 done: Wed Nov  2 23:16:42 2016
 Total Scan time:  1.520 Total Display time:  0.000

Function used was FASTA [36.3.4 Apr, 2011]
Inquiries or Suggestions ?
Send a message to flexiclone AT kazusagt.com