FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5188, 129 aa 1>>>pF1KE5188 129 - 129 aa - 129 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.9588+/-0.000558; mu= 12.6253+/- 0.034 mean_var=54.7398+/-10.699, 0's: 0 Z-trim(112.1): 16 B-trim: 0 in 0/52 Lambda= 0.173350 statistics sampled from 12881 (12895) to 12881 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.774), E-opt: 0.2 (0.396), width: 16 Scan time: 1.760 The best scores are: opt bits E(32554) CCDS4948.1 GSTA4 gene_id:2941|Hs108|chr6 ( 222) 838 216.7 5.5e-57 CCDS4944.1 GSTA2 gene_id:2939|Hs108|chr6 ( 222) 412 110.2 6.5e-25 CCDS4945.1 GSTA1 gene_id:2938|Hs108|chr6 ( 222) 397 106.4 8.7e-24 CCDS4947.1 GSTA3 gene_id:2940|Hs108|chr6 ( 222) 395 105.9 1.2e-23 CCDS4946.1 GSTA5 gene_id:221357|Hs108|chr6 ( 222) 377 101.4 2.8e-22 >>CCDS4948.1 GSTA4 gene_id:2941|Hs108|chr6 (222 aa) initn: 838 init1: 838 opt: 838 Z-score: 1136.6 bits: 216.7 E(32554): 5.5e-57 Smith-Waterman score: 838; 100.0% identity (100.0% similar) in 129 aa overlap (1-129:94-222) 10 20 30 pF1KE5 MYVEGTLDLLELLIMHPFLKPDDQQKEVVN :::::::::::::::::::::::::::::: CCDS49 KLVQTRSILHYIADKHNLFGKNLKERTLIDMYVEGTLDLLELLIMHPFLKPDDQQKEVVN 70 80 90 100 110 120 40 50 60 70 80 90 pF1KE5 MAQKAIIRYFPVFEKILRGHGQSFLVGNQLSLADVILLQTILALEEKIPNILSAFPFLQE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS49 MAQKAIIRYFPVFEKILRGHGQSFLVGNQLSLADVILLQTILALEEKIPNILSAFPFLQE 130 140 150 160 170 180 100 110 120 pF1KE5 YTVKLSNIPTIKRFLEPGSKKKPPPDEIYVRTVYNIFRP ::::::::::::::::::::::::::::::::::::::: CCDS49 YTVKLSNIPTIKRFLEPGSKKKPPPDEIYVRTVYNIFRP 190 200 210 220 >>CCDS4944.1 GSTA2 gene_id:2939|Hs108|chr6 (222 aa) initn: 412 init1: 412 opt: 412 Z-score: 560.8 bits: 110.2 E(32554): 6.5e-25 Smith-Waterman score: 412; 43.8% identity (78.1% similar) in 128 aa overlap (1-128:94-221) 10 20 30 pF1KE5 MYVEGTLDLLELLIMHPFLKPDDQQKEVVN ::.:: :: :.... :: .:..:. ... CCDS49 KLVQTRAILNYIASKYNLYGKDIKEKALIDMYIEGIADLGEMILLLPFSQPEEQDAKLAL 70 80 90 100 110 120 40 50 60 70 80 90 pF1KE5 MAQKAIIRYFPVFEKILRGHGQSFLVGNQLSLADVILLQTILALEEKIPNILSAFPFLQE . .:. ::::.:::.:..:::..::::.:: ::. :.. . .:: ...:.::.:. CCDS49 IQEKTKNRYFPAFEKVLKSHGQDYLVGNKLSRADIHLVELLYYVEELDSSLISSFPLLKA 130 140 150 160 170 180 100 110 120 pF1KE5 YTVKLSNIPTIKRFLEPGSKKKPPPDEIYVRTVYNIFRP ...::.::.:.::.::: .::: :: .. .::: CCDS49 LKTRISNLPTVKKFLQPGSPRKPPMDEKSLEESRKIFRF 190 200 210 220 >>CCDS4945.1 GSTA1 gene_id:2938|Hs108|chr6 (222 aa) initn: 397 init1: 397 opt: 397 Z-score: 540.5 bits: 106.4 E(32554): 8.7e-24 Smith-Waterman score: 397; 42.2% identity (76.6% similar) in 128 aa overlap (1-128:94-221) 10 20 30 pF1KE5 MYVEGTLDLLELLIMHPFLKPDDQQKEVVN ::.:: :: :.... : :.... ... CCDS49 KLVQTRAILNYIASKYNLYGKDIKERALIDMYIEGIADLGEMILLLPVCPPEEKDAKLAL 70 80 90 100 110 120 40 50 60 70 80 90 pF1KE5 MAQKAIIRYFPVFEKILRGHGQSFLVGNQLSLADVILLQTILALEEKIPNILSAFPFLQE . .: ::::.:::.:..:::..::::.:: ::. :.. . .:: ...:.::.:. CCDS49 IKEKIKNRYFPAFEKVLKSHGQDYLVGNKLSRADIHLVELLYYVEELDSSLISSFPLLKA 130 140 150 160 170 180 100 110 120 pF1KE5 YTVKLSNIPTIKRFLEPGSKKKPPPDEIYVRTVYNIFRP ...::.::.:.::.::: .::: :: .. . .::: CCDS49 LKTRISNLPTVKKFLQPGSPRKPPMDEKSLEEARKIFRF 190 200 210 220 >>CCDS4947.1 GSTA3 gene_id:2940|Hs108|chr6 (222 aa) initn: 395 init1: 395 opt: 395 Z-score: 537.8 bits: 105.9 E(32554): 1.2e-23 Smith-Waterman score: 395; 41.4% identity (77.3% similar) in 128 aa overlap (1-128:94-221) 10 20 30 pF1KE5 MYVEGTLDLLELLIMHPFLKPDDQQKEVVN ::.:: :: :.... :. .:.... ... CCDS49 KLVQTRAILNYIASKYNLYGKDIKERALIDMYTEGMADLNEMILLLPLCRPEEKDAKIAL 70 80 90 100 110 120 40 50 60 70 80 90 pF1KE5 MAQKAIIRYFPVFEKILRGHGQSFLVGNQLSLADVILLQTILALEEKIPNILSAFPFLQE . .:. ::::.:::.:..:::..::::.:: ::. :.. . .:: ...: ::.:. CCDS49 IKEKTKSRYFPAFEKVLQSHGQDYLVGNKLSRADISLVELLYYVEELDSSLISNFPLLKA 130 140 150 160 170 180 100 110 120 pF1KE5 YTVKLSNIPTIKRFLEPGSKKKPPPDEIYVRTVYNIFRP ...::.::.:.::.::: .::: : .. . .::: CCDS49 LKTRISNLPTVKKFLQPGSPRKPPADAKALEEARKIFRF 190 200 210 220 >>CCDS4946.1 GSTA5 gene_id:221357|Hs108|chr6 (222 aa) initn: 377 init1: 377 opt: 377 Z-score: 513.5 bits: 101.4 E(32554): 2.8e-22 Smith-Waterman score: 377; 40.6% identity (78.1% similar) in 128 aa overlap (1-128:94-221) 10 20 30 pF1KE5 MYVEGTLDLLELLIMHPFLKPDDQQKEVVN ::.:: .:: :.... . .:.... ... CCDS49 KLVQTRAILNYIASKYNLYGKDMKERALIDMYTEGIVDLTEMILLLLICQPEERDAKTAL 70 80 90 100 110 120 40 50 60 70 80 90 pF1KE5 MAQKAIIRYFPVFEKILRGHGQSFLVGNQLSLADVILLQTILALEEKIPNILSAFPFLQE . .: ::::.:::.:..: :..::::.:: ::. :.. . .:: ...:.::.:. CCDS49 VKEKIKNRYFPAFEKVLKSHRQDYLVGNKLSWADIHLVELFYYVEELDSSLISSFPLLKA 130 140 150 160 170 180 100 110 120 pF1KE5 YTVKLSNIPTIKRFLEPGSKKKPPPDEIYVRTVYNIFRP ...::.::.:.::.:::..::: :: .. . .::: CCDS49 LKTRISNLPTVKKFLQPGSQRKPPMDEKSLEEARKIFRF 190 200 210 220 129 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 22:24:56 2016 done: Mon Nov 7 22:24:56 2016 Total Scan time: 1.760 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]