FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6178, 118 aa 1>>>pF1KE6178 118 - 118 aa - 118 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.5131+/-0.000876; mu= 1.7466+/- 0.053 mean_var=159.1704+/-31.257, 0's: 0 Z-trim(112.1): 18 B-trim: 11 in 1/52 Lambda= 0.101658 statistics sampled from 12884 (12902) to 12884 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.75), E-opt: 0.2 (0.396), width: 16 Scan time: 1.290 The best scores are: opt bits E(32554) CCDS4698.1 ATP6V1G2 gene_id:534|Hs108|chr6 ( 118) 753 120.9 1.9e-28 CCDS4699.1 ATP6V1G2 gene_id:534|Hs108|chr6 ( 77) 506 84.5 1.1e-17 CCDS6807.1 ATP6V1G1 gene_id:9550|Hs108|chr9 ( 118) 497 83.3 3.8e-17 CCDS1395.1 ATP6V1G3 gene_id:127124|Hs108|chr1 ( 118) 437 74.5 1.7e-14 CCDS81414.1 ATP6V1G3 gene_id:127124|Hs108|chr1 ( 124) 411 70.7 2.5e-13 >>CCDS4698.1 ATP6V1G2 gene_id:534|Hs108|chr6 (118 aa) initn: 753 init1: 753 opt: 753 Z-score: 624.3 bits: 120.9 E(32554): 1.9e-28 Smith-Waterman score: 753; 100.0% identity (100.0% similar) in 118 aa overlap (1-118:1-118) 10 20 30 40 50 60 pF1KE6 MASQSQGIQQLLQAEKRAAEKVADARKRKARRLKQAKEEAQMEVEQYRREREHEFQSKQQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 MASQSQGIQQLLQAEKRAAEKVADARKRKARRLKQAKEEAQMEVEQYRREREHEFQSKQQ 10 20 30 40 50 60 70 80 90 100 110 pF1KE6 AAMGSQGNLSAEVEQATRRQVQGMQSSQQRNRERVLAQLLGMVCDVRPQVHPNYRISA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 AAMGSQGNLSAEVEQATRRQVQGMQSSQQRNRERVLAQLLGMVCDVRPQVHPNYRISA 70 80 90 100 110 >>CCDS4699.1 ATP6V1G2 gene_id:534|Hs108|chr6 (77 aa) initn: 506 init1: 506 opt: 506 Z-score: 431.0 bits: 84.5 E(32554): 1.1e-17 Smith-Waterman score: 506; 100.0% identity (100.0% similar) in 77 aa overlap (42-118:1-77) 20 30 40 50 60 70 pF1KE6 LQAEKRAAEKVADARKRKARRLKQAKEEAQMEVEQYRREREHEFQSKQQAAMGSQGNLSA :::::::::::::::::::::::::::::: CCDS46 MEVEQYRREREHEFQSKQQAAMGSQGNLSA 10 20 30 80 90 100 110 pF1KE6 EVEQATRRQVQGMQSSQQRNRERVLAQLLGMVCDVRPQVHPNYRISA ::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 EVEQATRRQVQGMQSSQQRNRERVLAQLLGMVCDVRPQVHPNYRISA 40 50 60 70 >>CCDS6807.1 ATP6V1G1 gene_id:9550|Hs108|chr9 (118 aa) initn: 546 init1: 497 opt: 497 Z-score: 421.4 bits: 83.3 E(32554): 3.8e-17 Smith-Waterman score: 497; 64.1% identity (89.7% similar) in 117 aa overlap (1-117:1-117) 10 20 30 40 50 60 pF1KE6 MASQSQGIQQLLQAEKRAAEKVADARKRKARRLKQAKEEAQMEVEQYRREREHEFQSKQQ ::::::::::::::::::::::..::::: ::::::::::: :.:::: .::.::..:. CCDS68 MASQSQGIQQLLQAEKRAAEKVSEARKRKNRRLKQAKEEAQAEIEQYRLQREKEFKAKEA 10 20 30 40 50 60 70 80 90 100 110 pF1KE6 AAMGSQGNLSAEVEQATRRQVQGMQSSQQRNRERVLAQLLGMVCDVRPQVHPNYRISA ::.::.:. :.:::. :.... .:. ..::..:: .::..:::.::..: ::::. CCDS68 AALGSRGSCSTEVEKETQEKMTILQTYFRQNRDEVLDNLLAFVCDIRPEIHENYRING 70 80 90 100 110 >>CCDS1395.1 ATP6V1G3 gene_id:127124|Hs108|chr1 (118 aa) initn: 484 init1: 430 opt: 437 Z-score: 373.8 bits: 74.5 E(32554): 1.7e-14 Smith-Waterman score: 437; 54.7% identity (83.8% similar) in 117 aa overlap (1-117:1-117) 10 20 30 40 50 60 pF1KE6 MASQSQGIQQLLQAEKRAAEKVADARKRKARRLKQAKEEAQMEVEQYRREREHEFQSKQQ :.::::::.::::::::: .:. .:.:::..:::::::::..:..::: .:..::. ::. CCDS13 MTSQSQGIHQLLQAEKRAKDKLEEAKKRKGKRLKQAKEEAMVEIDQYRMQRDKEFRLKQS 10 20 30 40 50 60 70 80 90 100 110 pF1KE6 AAMGSQGNLSAEVEQATRRQVQGMQSSQQRNRERVLAQLLGMVCDVRPQVHPNYRISA ::::.::: :.:. : ..: ... .. : :. :::.::::..:..: ::: . CCDS13 KIMGSQNNLSDEIEEQTLGKIQELNGHYNKYMESVMNQLLSMVCDMKPEIHVNYRATN 70 80 90 100 110 >>CCDS81414.1 ATP6V1G3 gene_id:127124|Hs108|chr1 (124 aa) initn: 454 init1: 300 opt: 411 Z-score: 352.9 bits: 70.7 E(32554): 2.5e-13 Smith-Waterman score: 411; 51.2% identity (79.7% similar) in 123 aa overlap (1-117:1-123) 10 20 30 40 50 pF1KE6 MASQSQGIQQLLQAEKRAAEKVADARKR------KARRLKQAKEEAQMEVEQYRREREHE :.::::::.::::::::: .:. .:.:. :..:::::::::..:..::: .:..: CCDS81 MTSQSQGIHQLLQAEKRAKDKLEEAKKKTGTASGKGKRLKQAKEEAMVEIDQYRMQRDKE 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE6 FQSKQQAAMGSQGNLSAEVEQATRRQVQGMQSSQQRNRERVLAQLLGMVCDVRPQVHPNY :. ::. ::::.::: :.:. : ..: ... .. : :. :::.::::..:..: :: CCDS81 FRLKQSKIMGSQNNLSDEIEEQTLGKIQELNGHYNKYMESVMNQLLSMVCDMKPEIHVNY 70 80 90 100 110 120 pF1KE6 RISA : . CCDS81 RATN 118 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 10:07:13 2016 done: Tue Nov 8 10:07:13 2016 Total Scan time: 1.290 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]