FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE1734, 241 aa
1>>>pF1KE1734 241 - 241 aa - 241 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.1167+/-0.000653; mu= 15.4151+/- 0.039
mean_var=61.1838+/-12.282, 0's: 0 Z-trim(110.7): 29 B-trim: 343 in 2/50
Lambda= 0.163967
statistics sampled from 11818 (11839) to 11818 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.75), E-opt: 0.2 (0.364), width: 16
Scan time: 2.380
The best scores are: opt bits E(32554)
CCDS7555.1 GSTO1 gene_id:9446|Hs108|chr10 ( 241) 1627 392.7 1.1e-109
CCDS53573.1 GSTO1 gene_id:9446|Hs108|chr10 ( 213) 1440 348.4 2.1e-96
CCDS7556.1 GSTO2 gene_id:119391|Hs108|chr10 ( 243) 1097 267.3 6.4e-72
CCDS53575.1 GSTO2 gene_id:119391|Hs108|chr10 ( 215) 940 230.2 8.7e-61
CCDS53572.1 GSTO1 gene_id:9446|Hs108|chr10 ( 208) 835 205.3 2.5e-53
CCDS53574.1 GSTO2 gene_id:119391|Hs108|chr10 ( 209) 668 165.8 2e-41
>>CCDS7555.1 GSTO1 gene_id:9446|Hs108|chr10 (241 aa)
initn: 1627 init1: 1627 opt: 1627 Z-score: 2082.3 bits: 392.7 E(32554): 1.1e-109
Smith-Waterman score: 1627; 99.6% identity (99.6% similar) in 241 aa overlap (1-241:1-241)
10 20 30 40 50 60
pF1KE1 MSGESARSLGKGSAPPGPVPEGSIRIYSMRFCPFAERTRLVLKAKGIRHEVININLKNKP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS75 MSGESARSLGKGSAPPGPVPEGSIRIYSMRFCPFAERTRLVLKAKGIRHEVININLKNKP
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE1 EWFFKKNPFGLVPVLENSQGQLIYESAITCEYLDEAYPGKKLLPDDPYEKACQKMILELF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS75 EWFFKKNPFGLVPVLENSQGQLIYESAITCEYLDEAYPGKKLLPDDPYEKACQKMILELF
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE1 SKVPSLVGSFIRSQNKEDYDGLKEEFRKEFTKLEEVLTNKKTTFFGGNSISMIDYLIWPW
::::::::::::::::::: ::::::::::::::::::::::::::::::::::::::::
CCDS75 SKVPSLVGSFIRSQNKEDYAGLKEEFRKEFTKLEEVLTNKKTTFFGGNSISMIDYLIWPW
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE1 FERLEAMKLNECVDHTPKLKLWMAAMKEDPTVSALLTSEKDWQGFLELYLQNSPEACDYG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS75 FERLEAMKLNECVDHTPKLKLWMAAMKEDPTVSALLTSEKDWQGFLELYLQNSPEACDYG
190 200 210 220 230 240
pF1KE1 L
:
CCDS75 L
>>CCDS53573.1 GSTO1 gene_id:9446|Hs108|chr10 (213 aa)
initn: 1440 init1: 1440 opt: 1440 Z-score: 1844.1 bits: 348.4 E(32554): 2.1e-96
Smith-Waterman score: 1440; 99.5% identity (99.5% similar) in 213 aa overlap (29-241:1-213)
10 20 30 40 50 60
pF1KE1 MSGESARSLGKGSAPPGPVPEGSIRIYSMRFCPFAERTRLVLKAKGIRHEVININLKNKP
::::::::::::::::::::::::::::::::
CCDS53 MRFCPFAERTRLVLKAKGIRHEVININLKNKP
10 20 30
70 80 90 100 110 120
pF1KE1 EWFFKKNPFGLVPVLENSQGQLIYESAITCEYLDEAYPGKKLLPDDPYEKACQKMILELF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS53 EWFFKKNPFGLVPVLENSQGQLIYESAITCEYLDEAYPGKKLLPDDPYEKACQKMILELF
40 50 60 70 80 90
130 140 150 160 170 180
pF1KE1 SKVPSLVGSFIRSQNKEDYDGLKEEFRKEFTKLEEVLTNKKTTFFGGNSISMIDYLIWPW
::::::::::::::::::: ::::::::::::::::::::::::::::::::::::::::
CCDS53 SKVPSLVGSFIRSQNKEDYAGLKEEFRKEFTKLEEVLTNKKTTFFGGNSISMIDYLIWPW
100 110 120 130 140 150
190 200 210 220 230 240
pF1KE1 FERLEAMKLNECVDHTPKLKLWMAAMKEDPTVSALLTSEKDWQGFLELYLQNSPEACDYG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS53 FERLEAMKLNECVDHTPKLKLWMAAMKEDPTVSALLTSEKDWQGFLELYLQNSPEACDYG
160 170 180 190 200 210
pF1KE1 L
:
CCDS53 L
>>CCDS7556.1 GSTO2 gene_id:119391|Hs108|chr10 (243 aa)
initn: 1103 init1: 683 opt: 1097 Z-score: 1404.7 bits: 267.3 E(32554): 6.4e-72
Smith-Waterman score: 1097; 64.0% identity (86.8% similar) in 242 aa overlap (1-241:1-242)
10 20 30 40 50 60
pF1KE1 MSGESARSLGKGSAPPGPVPEGSIRIYSMRFCPFAERTRLVLKAKGIRHEVININLKNKP
:::...:.::::: :::::::: ::::::::::...::::::::: :::::.::::.:::
CCDS75 MSGDATRTLGKGSQPPGPVPEGLIRIYSMRFCPYSHRTRLVLKAKDIRHEVVNINLRNKP
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE1 EWFFKKNPFGLVPVLENSQGQLIYESAITCEYLDEAYPGKKLLPDDPYEKACQKMILELF
::.. :.::: .::::.:: ::::::.:.:::::.::::.::.: ::::.: :::.::::
CCDS75 EWYYTKHPFGHIPVLETSQCQLIYESVIACEYLDDAYPGRKLFPYDPYERARQKMLLELF
70 80 90 100 110 120
130 140 150 160 170
pF1KE1 SKVPSLVGS-FIRSQNKEDYDGLKEEFRKEFTKLEEVLTNKKTTFFGGNSISMIDYLIWP
::: :. .. . .. .:: .:.::..:::.: ..::::::. :::::::.::
CCDS75 CKVPHLTKECLVALRCGRECTNLKAALRQEFSNLEEILEYQNTTFFGGTCISMIDYLLWP
130 140 150 160 170 180
180 190 200 210 220 230
pF1KE1 WFERLEAMKLNECVDHTPKLKLWMAAMKEDPTVSALLTSEKDWQGFLELYLQNSPEACDY
:::::... . .::.::: :.::..::: :::: ::: ... .::::.::.::.:.: :.
CCDS75 WFERLDVYGILDCVSHTPALRLWISAMKWDPTVCALLMDKSIFQGFLNLYFQNNPNAFDF
190 200 210 220 230 240
240
pF1KE1 GL
::
CCDS75 GLC
>>CCDS53575.1 GSTO2 gene_id:119391|Hs108|chr10 (215 aa)
initn: 946 init1: 526 opt: 940 Z-score: 1204.8 bits: 230.2 E(32554): 8.7e-61
Smith-Waterman score: 940; 62.1% identity (86.0% similar) in 214 aa overlap (29-241:1-214)
10 20 30 40 50 60
pF1KE1 MSGESARSLGKGSAPPGPVPEGSIRIYSMRFCPFAERTRLVLKAKGIRHEVININLKNKP
:::::...::::::::: :::::.::::.:::
CCDS53 MRFCPYSHRTRLVLKAKDIRHEVVNINLRNKP
10 20 30
70 80 90 100 110 120
pF1KE1 EWFFKKNPFGLVPVLENSQGQLIYESAITCEYLDEAYPGKKLLPDDPYEKACQKMILELF
::.. :.::: .::::.:: ::::::.:.:::::.::::.::.: ::::.: :::.::::
CCDS53 EWYYTKHPFGHIPVLETSQCQLIYESVIACEYLDDAYPGRKLFPYDPYERARQKMLLELF
40 50 60 70 80 90
130 140 150 160 170
pF1KE1 SKVPSLVGS-FIRSQNKEDYDGLKEEFRKEFTKLEEVLTNKKTTFFGGNSISMIDYLIWP
::: :. .. . .. .:: .:.::..:::.: ..::::::. :::::::.::
CCDS53 CKVPHLTKECLVALRCGRECTNLKAALRQEFSNLEEILEYQNTTFFGGTCISMIDYLLWP
100 110 120 130 140 150
180 190 200 210 220 230
pF1KE1 WFERLEAMKLNECVDHTPKLKLWMAAMKEDPTVSALLTSEKDWQGFLELYLQNSPEACDY
:::::... . .::.::: :.::..::: :::: ::: ... .::::.::.::.:.: :.
CCDS53 WFERLDVYGILDCVSHTPALRLWISAMKWDPTVCALLMDKSIFQGFLNLYFQNNPNAFDF
160 170 180 190 200 210
240
pF1KE1 GL
::
CCDS53 GLC
>>CCDS53572.1 GSTO1 gene_id:9446|Hs108|chr10 (208 aa)
initn: 835 init1: 835 opt: 835 Z-score: 1070.8 bits: 205.3 E(32554): 2.5e-53
Smith-Waterman score: 1349; 86.3% identity (86.3% similar) in 241 aa overlap (1-241:1-208)
10 20 30 40 50 60
pF1KE1 MSGESARSLGKGSAPPGPVPEGSIRIYSMRFCPFAERTRLVLKAKGIRHEVININLKNKP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS53 MSGESARSLGKGSAPPGPVPEGSIRIYSMRFCPFAERTRLVLKAKGIRHEVININLKNKP
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE1 EWFFKKNPFGLVPVLENSQGQLIYESAITCEYLDEAYPGKKLLPDDPYEKACQKMILELF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS53 EWFFKKNPFGLVPVLENSQGQLIYESAITCEYLDEAYPGKKLLPDDPYEKACQKMILELF
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE1 SKVPSLVGSFIRSQNKEDYDGLKEEFRKEFTKLEEVLTNKKTTFFGGNSISMIDYLIWPW
::: ::::::::::::::::::::::::
CCDS53 SKV---------------------------------LTNKKTTFFGGNSISMIDYLIWPW
130 140
190 200 210 220 230 240
pF1KE1 FERLEAMKLNECVDHTPKLKLWMAAMKEDPTVSALLTSEKDWQGFLELYLQNSPEACDYG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS53 FERLEAMKLNECVDHTPKLKLWMAAMKEDPTVSALLTSEKDWQGFLELYLQNSPEACDYG
150 160 170 180 190 200
pF1KE1 L
:
CCDS53 L
>>CCDS53574.1 GSTO2 gene_id:119391|Hs108|chr10 (209 aa)
initn: 668 init1: 668 opt: 668 Z-score: 857.2 bits: 165.8 E(32554): 2e-41
Smith-Waterman score: 970; 60.2% identity (78.4% similar) in 241 aa overlap (1-241:1-208)
10 20 30 40 50 60
pF1KE1 MSGESARSLGKGSAPPGPVPEGSIRIYSMRFCPFAERTRLVLKAKGIRHEVININLKNKP
:::...:.::::: :::::::: ::::::::::...::::::::: :::::.::::.:::
CCDS53 MSGDATRTLGKGSQPPGPVPEGLIRIYSMRFCPYSHRTRLVLKAKDIRHEVVNINLRNKP
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE1 EWFFKKNPFGLVPVLENSQGQLIYESAITCEYLDEAYPGKKLLPDDPYEKACQKMILELF
::.. :.::: .::::.:: ::::::.:.:::::.::::.::.: ::::.: :::.::::
CCDS53 EWYYTKHPFGHIPVLETSQCQLIYESVIACEYLDDAYPGRKLFPYDPYERARQKMLLELF
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE1 SKVPSLVGSFIRSQNKEDYDGLKEEFRKEFTKLEEVLTNKKTTFFGGNSISMIDYLIWPW
:. .. :: ::::::. :::::::.:::
CCDS53 CKI-------LEYQN--------------------------TTFFGGTCISMIDYLLWPW
130 140
190 200 210 220 230 240
pF1KE1 FERLEAMKLNECVDHTPKLKLWMAAMKEDPTVSALLTSEKDWQGFLELYLQNSPEACDYG
::::... . .::.::: :.::..::: :::: ::: ... .::::.::.::.:.: :.:
CCDS53 FERLDVYGILDCVSHTPALRLWISAMKWDPTVCALLMDKSIFQGFLNLYFQNNPNAFDFG
150 160 170 180 190 200
pF1KE1 L
:
CCDS53 LC
241 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 15:22:41 2016 done: Sun Nov 6 15:22:42 2016
Total Scan time: 2.380 Total Display time: 0.010
Function used was FASTA [36.3.4 Apr, 2011]