Result of FASTA (omim) for pFN21AE0517
FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448

Query: pF1KE0517, 309 aa
  1>>>pF1KE0517 309 - 309 aa - 309 aa
Library: /omim/omim.rfq.tfa
  60827320 residues in 85289 sequences

Statistics:  Expectation_n fit: rho(ln(x))= 9.2958+/-0.000303; mu= -0.0872+/- 0.019
 mean_var=271.1693+/-54.827, 0's: 0 Z-trim(125.1): 17  B-trim: 21 in 1/61
 Lambda= 0.077885
 statistics sampled from 48136 (48157) to 48136 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
 ktup: 2, E-join: 1 (0.842), E-opt: 0.2 (0.565), width:  16
 Scan time:  8.560

The best scores are:                                      opt bits E(85289)
NP_001008695 (OMIM: 609518) THAP domain-containing ( 309) 2149 253.7 3.5e-67
NP_085050 (OMIM: 609518) THAP domain-containing pr ( 309) 2149 253.7 3.5e-67
NP_078948 (OMIM: 612537) DNA transposase THAP9 iso ( 903)  292 45.5 0.00049


>>NP_001008695 (OMIM: 609518) THAP domain-containing pro  (309 aa)
 initn: 2149 init1: 2149 opt: 2149  Z-score: 1327.2  bits: 253.7 E(85289): 3.5e-67
Smith-Waterman score: 2149; 99.7% identity (99.7% similar) in 309 aa overlap (1-309:1-309)

               10        20        30        40        50        60
pF1KE0 MPRHCSAAGCCTRDTRETRNRGISFHRLPKKDNPRRGLWLANCQRLDPSGQGLWDPASEY
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 MPRHCSAAGCCTRDTRETRNRGISFHRLPKKDNPRRGLWLANCQRLDPSGQGLWDPASEY
               10        20        30        40        50        60

               70        80        90       100       110       120
pF1KE0 IYFCSKHFEEDCFELVGISGYHRLKEGAVPTIFESFSKLRRTTKTKGHSYPPGPPEVSRL
       :::::::::::::::::::::::::::::::::::::::::::::::::::::: :::::
NP_001 IYFCSKHFEEDCFELVGISGYHRLKEGAVPTIFESFSKLRRTTKTKGHSYPPGPAEVSRL
               70        80        90       100       110       120

              130       140       150       160       170       180
pF1KE0 RRCRKRCSEGRGPTTPFSPPPPADVTCFPVEEASAPATLPASPAGRLEPGLSSPFSDLLG
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 RRCRKRCSEGRGPTTPFSPPPPADVTCFPVEEASAPATLPASPAGRLEPGLSSPFSDLLG
              130       140       150       160       170       180

              190       200       210       220       230       240
pF1KE0 PLGAQADEAGCSAQPSPERQPSPLEPRPVSPSAYMLRLPPPAGAYIQNEHSYQVGSALLW
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 PLGAQADEAGCSAQPSPERQPSPLEPRPVSPSAYMLRLPPPAGAYIQNEHSYQVGSALLW
              190       200       210       220       230       240

              250       260       270       280       290       300
pF1KE0 KRRAEAALDALDKAQRQLQACKRREQRLRLRLTKLQQERAREKRAQADARQTLKEHVQDF
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 KRRAEAALDALDKAQRQLQACKRREQRLRLRLTKLQQERAREKRAQADARQTLKEHVQDF
              250       260       270       280       290       300

                
pF1KE0 AMQLSSSMA
       :::::::::
NP_001 AMQLSSSMA
                

>>NP_085050 (OMIM: 609518) THAP domain-containing protei  (309 aa)
 initn: 2149 init1: 2149 opt: 2149  Z-score: 1327.2  bits: 253.7 E(85289): 3.5e-67
Smith-Waterman score: 2149; 99.7% identity (99.7% similar) in 309 aa overlap (1-309:1-309)

               10        20        30        40        50        60
pF1KE0 MPRHCSAAGCCTRDTRETRNRGISFHRLPKKDNPRRGLWLANCQRLDPSGQGLWDPASEY
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_085 MPRHCSAAGCCTRDTRETRNRGISFHRLPKKDNPRRGLWLANCQRLDPSGQGLWDPASEY
               10        20        30        40        50        60

               70        80        90       100       110       120
pF1KE0 IYFCSKHFEEDCFELVGISGYHRLKEGAVPTIFESFSKLRRTTKTKGHSYPPGPPEVSRL
       :::::::::::::::::::::::::::::::::::::::::::::::::::::: :::::
NP_085 IYFCSKHFEEDCFELVGISGYHRLKEGAVPTIFESFSKLRRTTKTKGHSYPPGPAEVSRL
               70        80        90       100       110       120

              130       140       150       160       170       180
pF1KE0 RRCRKRCSEGRGPTTPFSPPPPADVTCFPVEEASAPATLPASPAGRLEPGLSSPFSDLLG
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_085 RRCRKRCSEGRGPTTPFSPPPPADVTCFPVEEASAPATLPASPAGRLEPGLSSPFSDLLG
              130       140       150       160       170       180

              190       200       210       220       230       240
pF1KE0 PLGAQADEAGCSAQPSPERQPSPLEPRPVSPSAYMLRLPPPAGAYIQNEHSYQVGSALLW
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_085 PLGAQADEAGCSAQPSPERQPSPLEPRPVSPSAYMLRLPPPAGAYIQNEHSYQVGSALLW
              190       200       210       220       230       240

              250       260       270       280       290       300
pF1KE0 KRRAEAALDALDKAQRQLQACKRREQRLRLRLTKLQQERAREKRAQADARQTLKEHVQDF
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_085 KRRAEAALDALDKAQRQLQACKRREQRLRLRLTKLQQERAREKRAQADARQTLKEHVQDF
              250       260       270       280       290       300

                
pF1KE0 AMQLSSSMA
       :::::::::
NP_085 AMQLSSSMA
                

>>NP_078948 (OMIM: 612537) DNA transposase THAP9 isoform  (903 aa)
 initn: 211 init1: 123 opt: 292  Z-score: 193.4  bits: 45.5 E(85289): 0.00049
Smith-Waterman score: 292; 43.1% identity (71.6% similar) in 109 aa overlap (1-109:1-103)

               10        20        30        40        50        60
pF1KE0 MPRHCSAAGCCTRDTRETRNRGISFHRLPKKDNPRRGLWLANCQRLDPSGQGLWDPASEY
       : : :::.:: ::::  .:.::.:::..:  :. .:. :.   .:.:: .. .: :.   
NP_078 MTRSCSAVGCSTRDTVLSRERGLSFHQFPT-DTIQRSKWIRAVNRVDPRSKKIWIPGPGA
               10        20        30         40        50         

               70        80        90       100       110       120
pF1KE0 IYFCSKHFEEDCFELVGISGYHRLKEGAVPTIFESFSKLRRTTKTKGHSYPPGPPEVSRL
       : .:::::.:. ::  ::   ..::.::::..  :. :. . .. ::..           
NP_078 I-LCSKHFQESDFESYGIR--RKLKKGAVPSV--SLYKIPQGVHLKGKARQKILKQPLPD
      60         70          80          90       100       110    

              130       140       150       160       170       180
pF1KE0 RRCRKRCSEGRGPTTPFSPPPPADVTCFPVEEASAPATLPASPAGRLEPGLSSPFSDLLG
                                                                   
NP_078 NSQEVATEDHNYSLKTPLTIGAEKLAEVQQMLQVSKKRLISVKNYRMIKKRKGLRLIDAL
          120       130       140       150       160       170    




309 residues in 1 query   sequences
60827320 residues in 85289 library sequences
 Tcomplib [36.3.4 Apr, 2011] (8 proc)
 start: Thu Nov  3 02:21:31 2016 done: Thu Nov  3 02:21:32 2016
 Total Scan time:  8.560 Total Display time: -0.030

Function used was FASTA [36.3.4 Apr, 2011]
Inquiries or Suggestions ?
Send a message to flexiclone AT kazusagt.com