FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE0115, 228 aa
1>>>pF1KE0115 228 - 228 aa - 228 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.7180+/-0.000812; mu= 12.3463+/- 0.049
mean_var=75.0082+/-14.861, 0's: 0 Z-trim(108.1): 31 B-trim: 67 in 1/49
Lambda= 0.148088
statistics sampled from 9976 (9999) to 9976 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.69), E-opt: 0.2 (0.307), width: 16
Scan time: 2.220
The best scores are: opt bits E(32554)
CCDS9001.1 THAP2 gene_id:83591|Hs108|chr12 ( 228) 1515 332.6 1.3e-91
CCDS86.1 THAP3 gene_id:90326|Hs108|chr1 ( 175) 280 68.7 2.7e-12
CCDS6136.1 THAP1 gene_id:55145|Hs108|chr8 ( 213) 277 68.1 5e-12
CCDS55572.1 THAP3 gene_id:90326|Hs108|chr1 ( 239) 275 67.7 7.4e-12
CCDS55573.1 THAP3 gene_id:90326|Hs108|chr1 ( 238) 270 66.6 1.6e-11
>>CCDS9001.1 THAP2 gene_id:83591|Hs108|chr12 (228 aa)
initn: 1515 init1: 1515 opt: 1515 Z-score: 1758.4 bits: 332.6 E(32554): 1.3e-91
Smith-Waterman score: 1515; 100.0% identity (100.0% similar) in 228 aa overlap (1-228:1-228)
10 20 30 40 50 60
pF1KE0 MPTNCAAAGCATTYNKHINISFHRFPLDPKRRKEWVRLVRRKNFVPGKHTFLCSKHFEAS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS90 MPTNCAAAGCATTYNKHINISFHRFPLDPKRRKEWVRLVRRKNFVPGKHTFLCSKHFEAS
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE0 CFDLTGQTRRLKMDAVPTIFDFCTHIKSMKLKSRNLLKKNNSCSPAGPSNLKSNISSQQV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS90 CFDLTGQTRRLKMDAVPTIFDFCTHIKSMKLKSRNLLKKNNSCSPAGPSNLKSNISSQQV
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE0 LLEHSYAFRNPMEAKKRIIKLEKEIASLRRKMKTCLQKERRATRRWIKATCLVKNLEANS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS90 LLEHSYAFRNPMEAKKRIIKLEKEIASLRRKMKTCLQKERRATRRWIKATCLVKNLEANS
130 140 150 160 170 180
190 200 210 220
pF1KE0 VLPKGTSEHMLPTALSSLPLEDFKILEQDQQDKTLLSLNLKQTKSTFI
::::::::::::::::::::::::::::::::::::::::::::::::
CCDS90 VLPKGTSEHMLPTALSSLPLEDFKILEQDQQDKTLLSLNLKQTKSTFI
190 200 210 220
>>CCDS86.1 THAP3 gene_id:90326|Hs108|chr1 (175 aa)
initn: 257 init1: 185 opt: 280 Z-score: 334.1 bits: 68.7 E(32554): 2.7e-12
Smith-Waterman score: 280; 34.6% identity (63.9% similar) in 133 aa overlap (1-131:1-130)
10 20 30 40 50
pF1KE0 MPTNCAAAGCATTYN-KHINISFHRFPLD-PKRRKEWVRLVRRKNFVPGKHTFLCSKHFE
:: .::: : . :. .. ...:::::.. :. :::: . : :: : .:: .::.::.
CCDS86 MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGNFKPKQHTVICSEHFR
10 20 30 40 50 60
60 70 80 90 100 110
pF1KE0 ASCFDLTGQTRRLKMDAVPTIFDFCTHIKSMKLKSRNLLKKNNSCSPAGPSNLKSNISSQ
::. :. . :: .::::.: : .... .. ...:. : .. :.. .
CCDS86 PECFSAFGNRKNLKHNAVPTVFAFQDPTQQVRENTDPASERGNASSS---QKEKTSPCRS
70 80 90 100 110
120 130 140 150 160 170
pF1KE0 QVLLEHSYAFRNPMEAKKRIIKLEKEIASLRRKMKTCLQKERRATRRWIKATCLVKNLEA
::: : . . .:
CCDS86 QVLPEAGAGEDSPGRNMDTALEELQLPPNAEGHVKQAMLFNVENGTPASREALWLSEE
120 130 140 150 160 170
>>CCDS6136.1 THAP1 gene_id:55145|Hs108|chr8 (213 aa)
initn: 339 init1: 171 opt: 277 Z-score: 329.4 bits: 68.1 E(32554): 5e-12
Smith-Waterman score: 366; 35.3% identity (63.2% similar) in 201 aa overlap (1-185:1-197)
10 20 30 40 50
pF1KE0 MPTNCAAAGCATTYNKHINISFHRFPLD-PKRRKEWVRLVRRKNFVPGKHTFLCSKHFEA
: .:.: :: . :.: .:::.::: :. ::: :::::: : :.. .::.::
CCDS61 MVQSCSAYGCKNRYDKDKPVSFHKFPLTRPSLCKEWEAAVRRKNFKPTKYSSICSEHFTP
10 20 30 40 50 60
60 70 80 90 100 110
pF1KE0 SCFDLTGQTRRLKMDAVPTIFDFCTHIKSMKLKSRNLLKKNNSCSPAG-P---SNLKSNI
.:: ... :: .:::::: .::. .. :...::. ... : : :.. . :
CCDS61 DCFKRECNNKLLKENAVPTIF-LCTEPHD---KKEDLLEPQEQLPPPPLPPPVSQVDAAI
70 80 90 100 110
120 130 140 150 160
pF1KE0 S----------SQQVLLEHSYAFRNPMEAKKRIIKLEKEIASLRRKMKTCLQKERRATRR
. . .:. .:.:. .. :. .::: .::... .::.:.:: :. :: :.
CCDS61 GLLMPPLQTPVNLSVFCDHNYTVEDTMHQRKRIHQLEQQVEKLRKKLKTAQQRCRRQERQ
120 130 140 150 160 170
170 180 190 200 210 220
pF1KE0 WIKATCLVK-NLEANSVLPKGTSEHMLPTALSSLPLEDFKILEQDQQDKTLLSLNLKQTK
: .:. . : ..: .:
CCDS61 LEKLKEVVHFQKEKDDVSERGYVILPNDYFEIVEVPA
180 190 200 210
>>CCDS55572.1 THAP3 gene_id:90326|Hs108|chr1 (239 aa)
initn: 309 init1: 185 opt: 275 Z-score: 326.3 bits: 67.7 E(32554): 7.4e-12
Smith-Waterman score: 275; 37.7% identity (67.0% similar) in 106 aa overlap (1-104:1-106)
10 20 30 40 50
pF1KE0 MPTNCAAAGCATTYN-KHINISFHRFPLD-PKRRKEWVRLVRRKNFVPGKHTFLCSKHFE
:: .::: : . :. .. ...:::::.. :. :::: . : :: : .:: .::.::.
CCDS55 MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGNFKPKQHTVICSEHFR
10 20 30 40 50 60
60 70 80 90 100 110
pF1KE0 ASCFDLTGQTRRLKMDAVPTIFDFCTHIKSMKLKSRNLLKKNNSCSPAGPSNLKSNISSQ
::. :. . :: .::::.: : .... .. ...:. :
CCDS55 PECFSAFGNRKNLKHNAVPTVFAFQDPTQQVRENTDPASERGNASSSQKEKVLPEAGAGE
70 80 90 100 110 120
120 130 140 150 160 170
pF1KE0 QVLLEHSYAFRNPMEAKKRIIKLEKEIASLRRKMKTCLQKERRATRRWIKATCLVKNLEA
CCDS55 DSPGRNMDTALEELQLPPNAEGHVKQVSPRRPQATEAVGRPTGPAGLRRTPNKQPSDHSY
130 140 150 160 170 180
>>CCDS55573.1 THAP3 gene_id:90326|Hs108|chr1 (238 aa)
initn: 309 init1: 185 opt: 270 Z-score: 320.6 bits: 66.6 E(32554): 1.6e-11
Smith-Waterman score: 270; 45.2% identity (70.2% similar) in 84 aa overlap (1-82:1-84)
10 20 30 40 50
pF1KE0 MPTNCAAAGCATTYN-KHINISFHRFPLD-PKRRKEWVRLVRRKNFVPGKHTFLCSKHFE
:: .::: : . :. .. ...:::::.. :. :::: . : :: : .:: .::.::.
CCDS55 MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGNFKPKQHTVICSEHFR
10 20 30 40 50 60
60 70 80 90 100 110
pF1KE0 ASCFDLTGQTRRLKMDAVPTIFDFCTHIKSMKLKSRNLLKKNNSCSPAGPSNLKSNISSQ
::. :. . :: .::::.: :
CCDS55 PECFSAFGNRKNLKHNAVPTVFAFQDPTQVRENTDPASERGNASSSQKEKVLPEAGAGED
70 80 90 100 110 120
228 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 02:28:16 2016 done: Fri Nov 4 02:28:16 2016
Total Scan time: 2.220 Total Display time: -0.030
Function used was FASTA [36.3.4 Apr, 2011]