FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE0295, 236 aa
1>>>pF1KE0295 236 - 236 aa - 236 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.0881+/-0.000701; mu= 15.1697+/- 0.042
mean_var=60.7294+/-12.310, 0's: 0 Z-trim(108.9): 22 B-trim: 0 in 0/51
Lambda= 0.164579
statistics sampled from 10536 (10552) to 10536 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.708), E-opt: 0.2 (0.324), width: 16
Scan time: 2.430
The best scores are: opt bits E(32554)
CCDS35437.1 TREX2 gene_id:11219|Hs108|chrX ( 236) 1592 386.0 1.1e-107
CCDS59451.1 TREX1 gene_id:11277|Hs108|chr3 ( 304) 605 151.7 4.9e-37
CCDS2769.1 TREX1 gene_id:11277|Hs108|chr3 ( 314) 605 151.7 5.1e-37
CCDS43086.1 TREX1 gene_id:11277|Hs108|chr3 ( 369) 605 151.8 5.8e-37
>>CCDS35437.1 TREX2 gene_id:11219|Hs108|chrX (236 aa)
initn: 1592 init1: 1592 opt: 1592 Z-score: 2046.5 bits: 386.0 E(32554): 1.1e-107
Smith-Waterman score: 1592; 100.0% identity (100.0% similar) in 236 aa overlap (1-236:1-236)
10 20 30 40 50 60
pF1KE0 MSEAPRAETFVFLDLEATGLPSVEPEIAELSLFAVHRSSLENPEHDESGALVLPRVLDKL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS35 MSEAPRAETFVFLDLEATGLPSVEPEIAELSLFAVHRSSLENPEHDESGALVLPRVLDKL
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE0 TLCMCPERPFTAKASEITGLSSEGLARCRKAGFDGAVVRTLQAFLSRQAGPICLVAHNGF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS35 TLCMCPERPFTAKASEITGLSSEGLARCRKAGFDGAVVRTLQAFLSRQAGPICLVAHNGF
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE0 DYDFPLLCAELRRLGARLPRDTVCLDTLPALRGLDRAHSHGTRARGRQGYSLGSLFHRYF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS35 DYDFPLLCAELRRLGARLPRDTVCLDTLPALRGLDRAHSHGTRARGRQGYSLGSLFHRYF
130 140 150 160 170 180
190 200 210 220 230
pF1KE0 RAEPSAAHSAEGDVHTLLLIFLHRAAELLAWADEQARGWAHIEPMYLPPDDPSLEA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS35 RAEPSAAHSAEGDVHTLLLIFLHRAAELLAWADEQARGWAHIEPMYLPPDDPSLEA
190 200 210 220 230
>>CCDS59451.1 TREX1 gene_id:11277|Hs108|chr3 (304 aa)
initn: 412 init1: 412 opt: 605 Z-score: 778.3 bits: 151.7 E(32554): 4.9e-37
Smith-Waterman score: 605; 46.2% identity (68.2% similar) in 223 aa overlap (8-226:2-223)
10 20 30 40 50
pF1KE0 MSEAPRAETFVFLDLEATGLPSVEPEIAELSLFAVHRSSLENPEHDESGALVLP---RVL
.:..:.:.:::::: .:...:: :.:::: .::.: ... ..: ::.
CCDS59 MQTLIFFDMEATGLPFSQPKVTELCLLAVHRCALESPPTSQGPPPTVPPPPRVV
10 20 30 40 50
60 70 80 90 100 110
pF1KE0 DKLTLCMCPERPFTAKASEITGLSSEGLARCRKAGFDGAVVRTLQAFLSRQAGPICLVAH
:::.::. : . . ::::::::. :: . :: .. : ::: :: : :::::
CCDS59 DKLSLCVAPGKACSPAASEITGLSTAVLAAHGRQCFDDNLANLLLAFLRRQPQPWCLVAH
60 70 80 90 100 110
120 130 140 150 160 170
pF1KE0 NGFDYDFPLLCAELRRLGARLPRD-TVCLDTLPALRGLDRAHSHGTRARGRQGYSLGSLF
:: :::::: ::: :: : . :.:.. ::..:.:: : . .. :..:::::..
CCDS59 NGDRYDFPLLQAELAMLGLTSALDGAFCVDSITALKALERASSPSEHG-PRKSYSLGSIY
120 130 140 150 160 170
180 190 200 210 220 230
pF1KE0 HRYFRAEPSAAHSAEGDVHTLLLIFLHRAAELLAWADEQARGWAHIEPMYLPPDDPSLEA
: . : .:.::::: .:: : : :: :.: .:: .. :.:::
CCDS59 TRLYGQSPPDSHTAEGDVLALLSICQWRPQALLRWVDAHARPFGTIRPMYGVTASARTKP
180 190 200 210 220 230
CCDS59 RPSAVTTTAHLATTRNTSPSLGESRGTKDLPPVKDPGALSREGLLAPLGLLAILTLAVAT
240 250 260 270 280 290
>>CCDS2769.1 TREX1 gene_id:11277|Hs108|chr3 (314 aa)
initn: 412 init1: 412 opt: 605 Z-score: 778.1 bits: 151.7 E(32554): 5.1e-37
Smith-Waterman score: 605; 46.2% identity (68.2% similar) in 223 aa overlap (8-226:12-233)
10 20 30 40 50
pF1KE0 MSEAPRAETFVFLDLEATGLPSVEPEIAELSLFAVHRSSLENPEHDESGALVLP--
.:..:.:.:::::: .:...:: :.:::: .::.: ... ..:
CCDS27 MGSQALPPGPMQTLIFFDMEATGLPFSQPKVTELCLLAVHRCALESPPTSQGPPPTVPPP
10 20 30 40 50 60
60 70 80 90 100 110
pF1KE0 -RVLDKLTLCMCPERPFTAKASEITGLSSEGLARCRKAGFDGAVVRTLQAFLSRQAGPIC
::.:::.::. : . . ::::::::. :: . :: .. : ::: :: : :
CCDS27 PRVVDKLSLCVAPGKACSPAASEITGLSTAVLAAHGRQCFDDNLANLLLAFLRRQPQPWC
70 80 90 100 110 120
120 130 140 150 160 170
pF1KE0 LVAHNGFDYDFPLLCAELRRLGARLPRD-TVCLDTLPALRGLDRAHSHGTRARGRQGYSL
:::::: :::::: ::: :: : . :.:.. ::..:.:: : . .. :..:::
CCDS27 LVAHNGDRYDFPLLQAELAMLGLTSALDGAFCVDSITALKALERASSPSEHG-PRKSYSL
130 140 150 160 170
180 190 200 210 220 230
pF1KE0 GSLFHRYFRAEPSAAHSAEGDVHTLLLIFLHRAAELLAWADEQARGWAHIEPMYLPPDDP
::.. : . : .:.::::: .:: : : :: :.: .:: .. :.:::
CCDS27 GSIYTRLYGQSPPDSHTAEGDVLALLSICQWRPQALLRWVDAHARPFGTIRPMYGVTASA
180 190 200 210 220 230
pF1KE0 SLEA
CCDS27 RTKPRPSAVTTTAHLATTRNTSPSLGESRGTKDLPPVKDPGALSREGLLAPLGLLAILTL
240 250 260 270 280 290
>>CCDS43086.1 TREX1 gene_id:11277|Hs108|chr3 (369 aa)
initn: 412 init1: 412 opt: 605 Z-score: 777.1 bits: 151.8 E(32554): 5.8e-37
Smith-Waterman score: 605; 46.2% identity (68.2% similar) in 223 aa overlap (8-226:67-288)
10 20 30
pF1KE0 MSEAPRAETFVFLDLEATGLPSVEPEIAELSLFAVHR
.:..:.:.:::::: .:...:: :.::::
CCDS43 THTPTPCSSPGSAAGTYPTMGSQALPPGPMQTLIFFDMEATGLPFSQPKVTELCLLAVHR
40 50 60 70 80 90
40 50 60 70 80 90
pF1KE0 SSLENPEHDESGALVLP---RVLDKLTLCMCPERPFTAKASEITGLSSEGLARCRKAGFD
.::.: ... ..: ::.:::.::. : . . ::::::::. :: . ::
CCDS43 CALESPPTSQGPPPTVPPPPRVVDKLSLCVAPGKACSPAASEITGLSTAVLAAHGRQCFD
100 110 120 130 140 150
100 110 120 130 140 150
pF1KE0 GAVVRTLQAFLSRQAGPICLVAHNGFDYDFPLLCAELRRLGARLPRD-TVCLDTLPALRG
.. : ::: :: : ::::::: :::::: ::: :: : . :.:.. ::..
CCDS43 DNLANLLLAFLRRQPQPWCLVAHNGDRYDFPLLQAELAMLGLTSALDGAFCVDSITALKA
160 170 180 190 200 210
160 170 180 190 200 210
pF1KE0 LDRAHSHGTRARGRQGYSLGSLFHRYFRAEPSAAHSAEGDVHTLLLIFLHRAAELLAWAD
:.:: : . .. :..:::::.. : . : .:.::::: .:: : : :: :.:
CCDS43 LERASSPSEHG-PRKSYSLGSIYTRLYGQSPPDSHTAEGDVLALLSICQWRPQALLRWVD
220 230 240 250 260 270
220 230
pF1KE0 EQARGWAHIEPMYLPPDDPSLEA
.:: .. :.:::
CCDS43 AHARPFGTIRPMYGVTASARTKPRPSAVTTTAHLATTRNTSPSLGESRGTKDLPPVKDPG
280 290 300 310 320 330
236 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Thu Nov 3 17:28:40 2016 done: Thu Nov 3 17:28:41 2016
Total Scan time: 2.430 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]