FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5246, 243 aa 1>>>pF1KE5246 243 - 243 aa - 243 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.7433+/-0.000643; mu= 17.2063+/- 0.039 mean_var=55.4585+/-11.073, 0's: 0 Z-trim(110.3): 19 B-trim: 618 in 1/52 Lambda= 0.172223 statistics sampled from 11464 (11480) to 11464 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.734), E-opt: 0.2 (0.353), width: 16 Scan time: 1.960 The best scores are: opt bits E(32554) CCDS7556.1 GSTO2 gene_id:119391|Hs108|chr10 ( 243) 1718 434.3 3.5e-122 CCDS53575.1 GSTO2 gene_id:119391|Hs108|chr10 ( 215) 1527 386.8 6.2e-108 CCDS7555.1 GSTO1 gene_id:9446|Hs108|chr10 ( 241) 1098 280.2 8.3e-76 CCDS53573.1 GSTO1 gene_id:9446|Hs108|chr10 ( 213) 941 241.2 4.1e-64 CCDS53574.1 GSTO2 gene_id:119391|Hs108|chr10 ( 209) 862 221.6 3.3e-58 CCDS53572.1 GSTO1 gene_id:9446|Hs108|chr10 ( 208) 669 173.6 8.9e-44 >>CCDS7556.1 GSTO2 gene_id:119391|Hs108|chr10 (243 aa) initn: 1718 init1: 1718 opt: 1718 Z-score: 2307.0 bits: 434.3 E(32554): 3.5e-122 Smith-Waterman score: 1718; 100.0% identity (100.0% similar) in 243 aa overlap (1-243:1-243) 10 20 30 40 50 60 pF1KE5 MSGDATRTLGKGSQPPGPVPEGLIRIYSMRFCPYSHRTRLVLKAKDIRHEVVNINLRNKP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 MSGDATRTLGKGSQPPGPVPEGLIRIYSMRFCPYSHRTRLVLKAKDIRHEVVNINLRNKP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 EWYYTKHPFGHIPVLETSQCQLIYESVIACEYLDDAYPGRKLFPYDPYERARQKMLLELF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 EWYYTKHPFGHIPVLETSQCQLIYESVIACEYLDDAYPGRKLFPYDPYERARQKMLLELF 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 CKVPHLTKECLVALRCGRECTNLKAALRQEFSNLEEILEYQNTTFFGGTCISMIDYLLWP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 CKVPHLTKECLVALRCGRECTNLKAALRQEFSNLEEILEYQNTTFFGGTCISMIDYLLWP 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 WFERLDVYGILDCVSHTPALRLWISAMKWDPTVCALLMDKSIFQGFLNLYFQNNPNAFDF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 WFERLDVYGILDCVSHTPALRLWISAMKWDPTVCALLMDKSIFQGFLNLYFQNNPNAFDF 190 200 210 220 230 240 pF1KE5 GLC ::: CCDS75 GLC >>CCDS53575.1 GSTO2 gene_id:119391|Hs108|chr10 (215 aa) initn: 1527 init1: 1527 opt: 1527 Z-score: 2051.3 bits: 386.8 E(32554): 6.2e-108 Smith-Waterman score: 1527; 100.0% identity (100.0% similar) in 215 aa overlap (29-243:1-215) 10 20 30 40 50 60 pF1KE5 MSGDATRTLGKGSQPPGPVPEGLIRIYSMRFCPYSHRTRLVLKAKDIRHEVVNINLRNKP :::::::::::::::::::::::::::::::: CCDS53 MRFCPYSHRTRLVLKAKDIRHEVVNINLRNKP 10 20 30 70 80 90 100 110 120 pF1KE5 EWYYTKHPFGHIPVLETSQCQLIYESVIACEYLDDAYPGRKLFPYDPYERARQKMLLELF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 EWYYTKHPFGHIPVLETSQCQLIYESVIACEYLDDAYPGRKLFPYDPYERARQKMLLELF 40 50 60 70 80 90 130 140 150 160 170 180 pF1KE5 CKVPHLTKECLVALRCGRECTNLKAALRQEFSNLEEILEYQNTTFFGGTCISMIDYLLWP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 CKVPHLTKECLVALRCGRECTNLKAALRQEFSNLEEILEYQNTTFFGGTCISMIDYLLWP 100 110 120 130 140 150 190 200 210 220 230 240 pF1KE5 WFERLDVYGILDCVSHTPALRLWISAMKWDPTVCALLMDKSIFQGFLNLYFQNNPNAFDF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 WFERLDVYGILDCVSHTPALRLWISAMKWDPTVCALLMDKSIFQGFLNLYFQNNPNAFDF 160 170 180 190 200 210 pF1KE5 GLC ::: CCDS53 GLC >>CCDS7555.1 GSTO1 gene_id:9446|Hs108|chr10 (241 aa) initn: 1103 init1: 683 opt: 1098 Z-score: 1474.5 bits: 280.2 E(32554): 8.3e-76 Smith-Waterman score: 1098; 64.0% identity (87.2% similar) in 242 aa overlap (1-242:1-241) 10 20 30 40 50 60 pF1KE5 MSGDATRTLGKGSQPPGPVPEGLIRIYSMRFCPYSHRTRLVLKAKDIRHEVVNINLRNKP :::...:.::::: :::::::: ::::::::::...::::::::: :::::.::::.::: CCDS75 MSGESARSLGKGSAPPGPVPEGSIRIYSMRFCPFAERTRLVLKAKGIRHEVININLKNKP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 EWYYTKHPFGHIPVLETSQCQLIYESVIACEYLDDAYPGRKLFPYDPYERARQKMLLELF ::.. :.::: .::::.:: ::::::.:.:::::.::::.::.: ::::.: :::.:::: CCDS75 EWFFKKNPFGLVPVLENSQGQLIYESAITCEYLDEAYPGKKLLPDDPYEKACQKMILELF 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 CKVPHLTKECLVALRCGRECTNLKAALRQEFSNLEEILEYQNTTFFGGTCISMIDYLLWP ::: :. .. . .. ..:: .:.::..:::.: ..::::::. :::::::.:: CCDS75 SKVPSLVGS-FIRSQNKEDYAGLKEEFRKEFTKLEEVLTNKKTTFFGGNSISMIDYLIWP 130 140 150 160 170 190 200 210 220 230 240 pF1KE5 WFERLDVYGILDCVSHTPALRLWISAMKWDPTVCALLMDKSIFQGFLNLYFQNNPNAFDF :::::... . .::.::: :.::..::: :::: ::: ... .::::.::.::.:.: :. CCDS75 WFERLEAMKLNECVDHTPKLKLWMAAMKEDPTVSALLTSEKDWQGFLELYLQNSPEACDY 180 190 200 210 220 230 pF1KE5 GLC :: CCDS75 GL 240 >>CCDS53573.1 GSTO1 gene_id:9446|Hs108|chr10 (213 aa) initn: 946 init1: 526 opt: 941 Z-score: 1264.5 bits: 241.2 E(32554): 4.1e-64 Smith-Waterman score: 941; 62.1% identity (86.4% similar) in 214 aa overlap (29-242:1-213) 10 20 30 40 50 60 pF1KE5 MSGDATRTLGKGSQPPGPVPEGLIRIYSMRFCPYSHRTRLVLKAKDIRHEVVNINLRNKP :::::...::::::::: :::::.::::.::: CCDS53 MRFCPFAERTRLVLKAKGIRHEVININLKNKP 10 20 30 70 80 90 100 110 120 pF1KE5 EWYYTKHPFGHIPVLETSQCQLIYESVIACEYLDDAYPGRKLFPYDPYERARQKMLLELF ::.. :.::: .::::.:: ::::::.:.:::::.::::.::.: ::::.: :::.:::: CCDS53 EWFFKKNPFGLVPVLENSQGQLIYESAITCEYLDEAYPGKKLLPDDPYEKACQKMILELF 40 50 60 70 80 90 130 140 150 160 170 180 pF1KE5 CKVPHLTKECLVALRCGRECTNLKAALRQEFSNLEEILEYQNTTFFGGTCISMIDYLLWP ::: :. .. . .. ..:: .:.::..:::.: ..::::::. :::::::.:: CCDS53 SKVPSLVGS-FIRSQNKEDYAGLKEEFRKEFTKLEEVLTNKKTTFFGGNSISMIDYLIWP 100 110 120 130 140 150 190 200 210 220 230 240 pF1KE5 WFERLDVYGILDCVSHTPALRLWISAMKWDPTVCALLMDKSIFQGFLNLYFQNNPNAFDF :::::... . .::.::: :.::..::: :::: ::: ... .::::.::.::.:.: :. CCDS53 WFERLEAMKLNECVDHTPKLKLWMAAMKEDPTVSALLTSEKDWQGFLELYLQNSPEACDY 160 170 180 190 200 210 pF1KE5 GLC :: CCDS53 GL >>CCDS53574.1 GSTO2 gene_id:119391|Hs108|chr10 (209 aa) initn: 884 init1: 862 opt: 862 Z-score: 1158.5 bits: 221.6 E(32554): 3.3e-58 Smith-Waterman score: 1411; 86.0% identity (86.0% similar) in 243 aa overlap (1-243:1-209) 10 20 30 40 50 60 pF1KE5 MSGDATRTLGKGSQPPGPVPEGLIRIYSMRFCPYSHRTRLVLKAKDIRHEVVNINLRNKP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 MSGDATRTLGKGSQPPGPVPEGLIRIYSMRFCPYSHRTRLVLKAKDIRHEVVNINLRNKP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 EWYYTKHPFGHIPVLETSQCQLIYESVIACEYLDDAYPGRKLFPYDPYERARQKMLLELF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 EWYYTKHPFGHIPVLETSQCQLIYESVIACEYLDDAYPGRKLFPYDPYERARQKMLLELF 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 CKVPHLTKECLVALRCGRECTNLKAALRQEFSNLEEILEYQNTTFFGGTCISMIDYLLWP :: :::::::::::::::::::::::: CCDS53 CK----------------------------------ILEYQNTTFFGGTCISMIDYLLWP 130 140 190 200 210 220 230 240 pF1KE5 WFERLDVYGILDCVSHTPALRLWISAMKWDPTVCALLMDKSIFQGFLNLYFQNNPNAFDF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 WFERLDVYGILDCVSHTPALRLWISAMKWDPTVCALLMDKSIFQGFLNLYFQNNPNAFDF 150 160 170 180 190 200 pF1KE5 GLC ::: CCDS53 GLC >>CCDS53572.1 GSTO1 gene_id:9446|Hs108|chr10 (208 aa) initn: 669 init1: 669 opt: 669 Z-score: 899.4 bits: 173.6 E(32554): 8.9e-44 Smith-Waterman score: 969; 59.9% identity (77.7% similar) in 242 aa overlap (1-242:1-208) 10 20 30 40 50 60 pF1KE5 MSGDATRTLGKGSQPPGPVPEGLIRIYSMRFCPYSHRTRLVLKAKDIRHEVVNINLRNKP :::...:.::::: :::::::: ::::::::::...::::::::: :::::.::::.::: CCDS53 MSGESARSLGKGSAPPGPVPEGSIRIYSMRFCPFAERTRLVLKAKGIRHEVININLKNKP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 EWYYTKHPFGHIPVLETSQCQLIYESVIACEYLDDAYPGRKLFPYDPYERARQKMLLELF ::.. :.::: .::::.:: ::::::.:.:::::.::::.::.: ::::.: :::.:::: CCDS53 EWFFKKNPFGLVPVLENSQGQLIYESAITCEYLDEAYPGKKLLPDDPYEKACQKMILELF 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 CKVPHLTKECLVALRCGRECTNLKAALRQEFSNLEEILEYQNTTFFGGTCISMIDYLLWP :: : ..::::::. :::::::.:: CCDS53 SKV----------------------------------LTNKKTTFFGGNSISMIDYLIWP 130 140 190 200 210 220 230 240 pF1KE5 WFERLDVYGILDCVSHTPALRLWISAMKWDPTVCALLMDKSIFQGFLNLYFQNNPNAFDF :::::... . .::.::: :.::..::: :::: ::: ... .::::.::.::.:.: :. CCDS53 WFERLEAMKLNECVDHTPKLKLWMAAMKEDPTVSALLTSEKDWQGFLELYLQNSPEACDY 150 160 170 180 190 200 pF1KE5 GLC :: CCDS53 GL 243 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 22:50:36 2016 done: Mon Nov 7 22:50:37 2016 Total Scan time: 1.960 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]