FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1734, 241 aa 1>>>pF1KE1734 241 - 241 aa - 241 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.1167+/-0.000653; mu= 15.4151+/- 0.039 mean_var=61.1838+/-12.282, 0's: 0 Z-trim(110.7): 29 B-trim: 343 in 2/50 Lambda= 0.163967 statistics sampled from 11818 (11839) to 11818 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.75), E-opt: 0.2 (0.364), width: 16 Scan time: 2.380 The best scores are: opt bits E(32554) CCDS7555.1 GSTO1 gene_id:9446|Hs108|chr10 ( 241) 1627 392.7 1.1e-109 CCDS53573.1 GSTO1 gene_id:9446|Hs108|chr10 ( 213) 1440 348.4 2.1e-96 CCDS7556.1 GSTO2 gene_id:119391|Hs108|chr10 ( 243) 1097 267.3 6.4e-72 CCDS53575.1 GSTO2 gene_id:119391|Hs108|chr10 ( 215) 940 230.2 8.7e-61 CCDS53572.1 GSTO1 gene_id:9446|Hs108|chr10 ( 208) 835 205.3 2.5e-53 CCDS53574.1 GSTO2 gene_id:119391|Hs108|chr10 ( 209) 668 165.8 2e-41 >>CCDS7555.1 GSTO1 gene_id:9446|Hs108|chr10 (241 aa) initn: 1627 init1: 1627 opt: 1627 Z-score: 2082.3 bits: 392.7 E(32554): 1.1e-109 Smith-Waterman score: 1627; 99.6% identity (99.6% similar) in 241 aa overlap (1-241:1-241) 10 20 30 40 50 60 pF1KE1 MSGESARSLGKGSAPPGPVPEGSIRIYSMRFCPFAERTRLVLKAKGIRHEVININLKNKP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 MSGESARSLGKGSAPPGPVPEGSIRIYSMRFCPFAERTRLVLKAKGIRHEVININLKNKP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 EWFFKKNPFGLVPVLENSQGQLIYESAITCEYLDEAYPGKKLLPDDPYEKACQKMILELF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 EWFFKKNPFGLVPVLENSQGQLIYESAITCEYLDEAYPGKKLLPDDPYEKACQKMILELF 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 SKVPSLVGSFIRSQNKEDYDGLKEEFRKEFTKLEEVLTNKKTTFFGGNSISMIDYLIWPW ::::::::::::::::::: :::::::::::::::::::::::::::::::::::::::: CCDS75 SKVPSLVGSFIRSQNKEDYAGLKEEFRKEFTKLEEVLTNKKTTFFGGNSISMIDYLIWPW 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 FERLEAMKLNECVDHTPKLKLWMAAMKEDPTVSALLTSEKDWQGFLELYLQNSPEACDYG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 FERLEAMKLNECVDHTPKLKLWMAAMKEDPTVSALLTSEKDWQGFLELYLQNSPEACDYG 190 200 210 220 230 240 pF1KE1 L : CCDS75 L >>CCDS53573.1 GSTO1 gene_id:9446|Hs108|chr10 (213 aa) initn: 1440 init1: 1440 opt: 1440 Z-score: 1844.1 bits: 348.4 E(32554): 2.1e-96 Smith-Waterman score: 1440; 99.5% identity (99.5% similar) in 213 aa overlap (29-241:1-213) 10 20 30 40 50 60 pF1KE1 MSGESARSLGKGSAPPGPVPEGSIRIYSMRFCPFAERTRLVLKAKGIRHEVININLKNKP :::::::::::::::::::::::::::::::: CCDS53 MRFCPFAERTRLVLKAKGIRHEVININLKNKP 10 20 30 70 80 90 100 110 120 pF1KE1 EWFFKKNPFGLVPVLENSQGQLIYESAITCEYLDEAYPGKKLLPDDPYEKACQKMILELF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 EWFFKKNPFGLVPVLENSQGQLIYESAITCEYLDEAYPGKKLLPDDPYEKACQKMILELF 40 50 60 70 80 90 130 140 150 160 170 180 pF1KE1 SKVPSLVGSFIRSQNKEDYDGLKEEFRKEFTKLEEVLTNKKTTFFGGNSISMIDYLIWPW ::::::::::::::::::: :::::::::::::::::::::::::::::::::::::::: CCDS53 SKVPSLVGSFIRSQNKEDYAGLKEEFRKEFTKLEEVLTNKKTTFFGGNSISMIDYLIWPW 100 110 120 130 140 150 190 200 210 220 230 240 pF1KE1 FERLEAMKLNECVDHTPKLKLWMAAMKEDPTVSALLTSEKDWQGFLELYLQNSPEACDYG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 FERLEAMKLNECVDHTPKLKLWMAAMKEDPTVSALLTSEKDWQGFLELYLQNSPEACDYG 160 170 180 190 200 210 pF1KE1 L : CCDS53 L >>CCDS7556.1 GSTO2 gene_id:119391|Hs108|chr10 (243 aa) initn: 1103 init1: 683 opt: 1097 Z-score: 1404.7 bits: 267.3 E(32554): 6.4e-72 Smith-Waterman score: 1097; 64.0% identity (86.8% similar) in 242 aa overlap (1-241:1-242) 10 20 30 40 50 60 pF1KE1 MSGESARSLGKGSAPPGPVPEGSIRIYSMRFCPFAERTRLVLKAKGIRHEVININLKNKP :::...:.::::: :::::::: ::::::::::...::::::::: :::::.::::.::: CCDS75 MSGDATRTLGKGSQPPGPVPEGLIRIYSMRFCPYSHRTRLVLKAKDIRHEVVNINLRNKP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 EWFFKKNPFGLVPVLENSQGQLIYESAITCEYLDEAYPGKKLLPDDPYEKACQKMILELF ::.. :.::: .::::.:: ::::::.:.:::::.::::.::.: ::::.: :::.:::: CCDS75 EWYYTKHPFGHIPVLETSQCQLIYESVIACEYLDDAYPGRKLFPYDPYERARQKMLLELF 70 80 90 100 110 120 130 140 150 160 170 pF1KE1 SKVPSLVGS-FIRSQNKEDYDGLKEEFRKEFTKLEEVLTNKKTTFFGGNSISMIDYLIWP ::: :. .. . .. .:: .:.::..:::.: ..::::::. :::::::.:: CCDS75 CKVPHLTKECLVALRCGRECTNLKAALRQEFSNLEEILEYQNTTFFGGTCISMIDYLLWP 130 140 150 160 170 180 180 190 200 210 220 230 pF1KE1 WFERLEAMKLNECVDHTPKLKLWMAAMKEDPTVSALLTSEKDWQGFLELYLQNSPEACDY :::::... . .::.::: :.::..::: :::: ::: ... .::::.::.::.:.: :. CCDS75 WFERLDVYGILDCVSHTPALRLWISAMKWDPTVCALLMDKSIFQGFLNLYFQNNPNAFDF 190 200 210 220 230 240 240 pF1KE1 GL :: CCDS75 GLC >>CCDS53575.1 GSTO2 gene_id:119391|Hs108|chr10 (215 aa) initn: 946 init1: 526 opt: 940 Z-score: 1204.8 bits: 230.2 E(32554): 8.7e-61 Smith-Waterman score: 940; 62.1% identity (86.0% similar) in 214 aa overlap (29-241:1-214) 10 20 30 40 50 60 pF1KE1 MSGESARSLGKGSAPPGPVPEGSIRIYSMRFCPFAERTRLVLKAKGIRHEVININLKNKP :::::...::::::::: :::::.::::.::: CCDS53 MRFCPYSHRTRLVLKAKDIRHEVVNINLRNKP 10 20 30 70 80 90 100 110 120 pF1KE1 EWFFKKNPFGLVPVLENSQGQLIYESAITCEYLDEAYPGKKLLPDDPYEKACQKMILELF ::.. :.::: .::::.:: ::::::.:.:::::.::::.::.: ::::.: :::.:::: CCDS53 EWYYTKHPFGHIPVLETSQCQLIYESVIACEYLDDAYPGRKLFPYDPYERARQKMLLELF 40 50 60 70 80 90 130 140 150 160 170 pF1KE1 SKVPSLVGS-FIRSQNKEDYDGLKEEFRKEFTKLEEVLTNKKTTFFGGNSISMIDYLIWP ::: :. .. . .. .:: .:.::..:::.: ..::::::. :::::::.:: CCDS53 CKVPHLTKECLVALRCGRECTNLKAALRQEFSNLEEILEYQNTTFFGGTCISMIDYLLWP 100 110 120 130 140 150 180 190 200 210 220 230 pF1KE1 WFERLEAMKLNECVDHTPKLKLWMAAMKEDPTVSALLTSEKDWQGFLELYLQNSPEACDY :::::... . .::.::: :.::..::: :::: ::: ... .::::.::.::.:.: :. CCDS53 WFERLDVYGILDCVSHTPALRLWISAMKWDPTVCALLMDKSIFQGFLNLYFQNNPNAFDF 160 170 180 190 200 210 240 pF1KE1 GL :: CCDS53 GLC >>CCDS53572.1 GSTO1 gene_id:9446|Hs108|chr10 (208 aa) initn: 835 init1: 835 opt: 835 Z-score: 1070.8 bits: 205.3 E(32554): 2.5e-53 Smith-Waterman score: 1349; 86.3% identity (86.3% similar) in 241 aa overlap (1-241:1-208) 10 20 30 40 50 60 pF1KE1 MSGESARSLGKGSAPPGPVPEGSIRIYSMRFCPFAERTRLVLKAKGIRHEVININLKNKP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 MSGESARSLGKGSAPPGPVPEGSIRIYSMRFCPFAERTRLVLKAKGIRHEVININLKNKP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 EWFFKKNPFGLVPVLENSQGQLIYESAITCEYLDEAYPGKKLLPDDPYEKACQKMILELF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 EWFFKKNPFGLVPVLENSQGQLIYESAITCEYLDEAYPGKKLLPDDPYEKACQKMILELF 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 SKVPSLVGSFIRSQNKEDYDGLKEEFRKEFTKLEEVLTNKKTTFFGGNSISMIDYLIWPW ::: :::::::::::::::::::::::: CCDS53 SKV---------------------------------LTNKKTTFFGGNSISMIDYLIWPW 130 140 190 200 210 220 230 240 pF1KE1 FERLEAMKLNECVDHTPKLKLWMAAMKEDPTVSALLTSEKDWQGFLELYLQNSPEACDYG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 FERLEAMKLNECVDHTPKLKLWMAAMKEDPTVSALLTSEKDWQGFLELYLQNSPEACDYG 150 160 170 180 190 200 pF1KE1 L : CCDS53 L >>CCDS53574.1 GSTO2 gene_id:119391|Hs108|chr10 (209 aa) initn: 668 init1: 668 opt: 668 Z-score: 857.2 bits: 165.8 E(32554): 2e-41 Smith-Waterman score: 970; 60.2% identity (78.4% similar) in 241 aa overlap (1-241:1-208) 10 20 30 40 50 60 pF1KE1 MSGESARSLGKGSAPPGPVPEGSIRIYSMRFCPFAERTRLVLKAKGIRHEVININLKNKP :::...:.::::: :::::::: ::::::::::...::::::::: :::::.::::.::: CCDS53 MSGDATRTLGKGSQPPGPVPEGLIRIYSMRFCPYSHRTRLVLKAKDIRHEVVNINLRNKP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 EWFFKKNPFGLVPVLENSQGQLIYESAITCEYLDEAYPGKKLLPDDPYEKACQKMILELF ::.. :.::: .::::.:: ::::::.:.:::::.::::.::.: ::::.: :::.:::: CCDS53 EWYYTKHPFGHIPVLETSQCQLIYESVIACEYLDDAYPGRKLFPYDPYERARQKMLLELF 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 SKVPSLVGSFIRSQNKEDYDGLKEEFRKEFTKLEEVLTNKKTTFFGGNSISMIDYLIWPW :. .. :: ::::::. :::::::.::: CCDS53 CKI-------LEYQN--------------------------TTFFGGTCISMIDYLLWPW 130 140 190 200 210 220 230 240 pF1KE1 FERLEAMKLNECVDHTPKLKLWMAAMKEDPTVSALLTSEKDWQGFLELYLQNSPEACDYG ::::... . .::.::: :.::..::: :::: ::: ... .::::.::.::.:.: :.: CCDS53 FERLDVYGILDCVSHTPALRLWISAMKWDPTVCALLMDKSIFQGFLNLYFQNNPNAFDFG 150 160 170 180 190 200 pF1KE1 L : CCDS53 LC 241 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 15:22:41 2016 done: Sun Nov 6 15:22:42 2016 Total Scan time: 2.380 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]