FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6255, 226 aa 1>>>pF1KE6255 226 - 226 aa - 226 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.3748+/-0.00115; mu= 4.5471+/- 0.069 mean_var=111.9817+/-22.340, 0's: 0 Z-trim(104.8): 20 B-trim: 0 in 0/51 Lambda= 0.121199 statistics sampled from 8064 (8068) to 8064 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.615), E-opt: 0.2 (0.248), width: 16 Scan time: 1.650 The best scores are: opt bits E(32554) CCDS1826.1 ATP6V1E2 gene_id:90423|Hs108|chr2 ( 226) 1397 255.0 2.9e-68 CCDS13745.1 ATP6V1E1 gene_id:529|Hs108|chr22 ( 226) 1080 199.5 1.4e-51 CCDS42977.1 ATP6V1E1 gene_id:529|Hs108|chr22 ( 204) 898 167.7 4.9e-42 CCDS42978.1 ATP6V1E1 gene_id:529|Hs108|chr22 ( 196) 523 102.1 2.6e-22 >>CCDS1826.1 ATP6V1E2 gene_id:90423|Hs108|chr2 (226 aa) initn: 1397 init1: 1397 opt: 1397 Z-score: 1339.0 bits: 255.0 E(32554): 2.9e-68 Smith-Waterman score: 1397; 99.6% identity (99.6% similar) in 226 aa overlap (1-226:1-226) 10 20 30 40 50 60 pF1KE6 MALRDVDVKKQIKHMMAFIEQEANEKAEEIDAKAEEEFNIEKGRLVQTQRLKIMEYYEKK ::: :::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS18 MALSDVDVKKQIKHMMAFIEQEANEKAEEIDAKAEEEFNIEKGRLVQTQRLKIMEYYEKK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 EKQIEQQKKILMSTMRNQARLKVLRARNDLISDLLSEAKLRLSRIVEDPEVYQGLLDKLV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS18 EKQIEQQKKILMSTMRNQARLKVLRARNDLISDLLSEAKLRLSRIVEDPEVYQGLLDKLV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 LQGLLRLLEPVMIVRCRPQDLLLVEAAVQKAIPEYMTISQKHVEVQIDKEAYLAVNAAGG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS18 LQGLLRLLEPVMIVRCRPQDLLLVEAAVQKAIPEYMTISQKHVEVQIDKEAYLAVNAAGG 130 140 150 160 170 180 190 200 210 220 pF1KE6 VEVYSGNQRIKVSNTLESRLDLSAKQKMPEIRMALFGANTNRKFFI :::::::::::::::::::::::::::::::::::::::::::::: CCDS18 VEVYSGNQRIKVSNTLESRLDLSAKQKMPEIRMALFGANTNRKFFI 190 200 210 220 >>CCDS13745.1 ATP6V1E1 gene_id:529|Hs108|chr22 (226 aa) initn: 1097 init1: 1080 opt: 1080 Z-score: 1039.4 bits: 199.5 E(32554): 1.4e-51 Smith-Waterman score: 1080; 76.4% identity (89.3% similar) in 225 aa overlap (1-225:1-225) 10 20 30 40 50 60 pF1KE6 MALRDVDVKKQIKHMMAFIEQEANEKAEEIDAKAEEEFNIEKGRLVQTQRLKIMEYYEKK ::: :.::.::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 MALSDADVQKQIKHMMAFIEQEANEKAEEIDAKAEEEFNIEKGRLVQTQRLKIMEYYEKK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 EKQIEQQKKILMSTMRNQARLKVLRARNDLISDLLSEAKLRLSRIVEDPEVYQGLLDKLV :::::::::: ::.. :::::::::::.:::.:::.::: :::..:.: :: ::: :: CCDS13 EKQIEQQKKIQMSNLMNQARLKVLRARDDLITDLLNEAKQRLSKVVKDTTRYQVLLDGLV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 LQGLLRLLEPVMIVRCRPQDLLLVEAAVQKAIPEYMTISQKHVEVQIDKEAYLAVNAAGG :::: .:::: :::::: ::. ::.:::::::: : ... :.::::.:.:: . ::: CCDS13 LQGLYQLLEPRMIVRCRKQDFPLVKAAVQKAIPMYKIATKNDVDVQIDQESYLPEDIAGG 130 140 150 160 170 180 190 200 210 220 pF1KE6 VEVYSGNQRIKVSNTLESRLDLSAKQKMPEIRMALFGANTNRKFFI ::.:.:...::::::::::::: :.: :::.: ::::::.::::. CCDS13 VEIYNGDRKIKVSNTLESRLDLIAQQMMPEVRGALFGANANRKFLD 190 200 210 220 >>CCDS42977.1 ATP6V1E1 gene_id:529|Hs108|chr22 (204 aa) initn: 955 init1: 891 opt: 898 Z-score: 868.1 bits: 167.7 E(32554): 4.9e-42 Smith-Waterman score: 898; 72.1% identity (87.8% similar) in 197 aa overlap (29-225:7-203) 10 20 30 40 50 60 pF1KE6 MALRDVDVKKQIKHMMAFIEQEANEKAEEIDAKAEEEFNIEKGRLVQTQRLKIMEYYEKK ... .::::::::::::::::::::::::::: CCDS42 MALSDADVQKQAEEEFNIEKGRLVQTQRLKIMEYYEKK 10 20 30 70 80 90 100 110 120 pF1KE6 EKQIEQQKKILMSTMRNQARLKVLRARNDLISDLLSEAKLRLSRIVEDPEVYQGLLDKLV :::::::::: ::.. :::::::::::.:::.:::.::: :::..:.: :: ::: :: CCDS42 EKQIEQQKKIQMSNLMNQARLKVLRARDDLITDLLNEAKQRLSKVVKDTTRYQVLLDGLV 40 50 60 70 80 90 130 140 150 160 170 180 pF1KE6 LQGLLRLLEPVMIVRCRPQDLLLVEAAVQKAIPEYMTISQKHVEVQIDKEAYLAVNAAGG :::: .:::: :::::: ::. ::.:::::::: : ... :.::::.:.:: . ::: CCDS42 LQGLYQLLEPRMIVRCRKQDFPLVKAAVQKAIPMYKIATKNDVDVQIDQESYLPEDIAGG 100 110 120 130 140 150 190 200 210 220 pF1KE6 VEVYSGNQRIKVSNTLESRLDLSAKQKMPEIRMALFGANTNRKFFI ::.:.:...::::::::::::: :.: :::.: ::::::.::::. CCDS42 VEIYNGDRKIKVSNTLESRLDLIAQQMMPEVRGALFGANANRKFLD 160 170 180 190 200 >>CCDS42978.1 ATP6V1E1 gene_id:529|Hs108|chr22 (196 aa) initn: 954 init1: 522 opt: 523 Z-score: 514.0 bits: 102.1 E(32554): 2.6e-22 Smith-Waterman score: 894; 67.6% identity (78.7% similar) in 225 aa overlap (1-225:1-195) 10 20 30 40 50 60 pF1KE6 MALRDVDVKKQIKHMMAFIEQEANEKAEEIDAKAEEEFNIEKGRLVQTQRLKIMEYYEKK ::: :.::.::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 MALSDADVQKQIKHMMAFIEQEANEKAEEIDAKAEEEFNIEKGRLVQTQRLKIMEYYEKK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 EKQIEQQKKILMSTMRNQARLKVLRARNDLISDLLSEAKLRLSRIVEDPEVYQGLLDKLV :::::::::: ::.. :::::::::::.:::. CCDS42 EKQIEQQKKIQMSNLMNQARLKVLRARDDLIT---------------------------- 70 80 90 130 140 150 160 170 180 pF1KE6 LQGLLRLLEPVMIVRCRPQDLLLVEAAVQKAIPEYMTISQKHVEVQIDKEAYLAVNAAGG :: .:::: :::::: ::. ::.:::::::: : ... :.::::.:.:: . ::: CCDS42 --GLYQLLEPRMIVRCRKQDFPLVKAAVQKAIPMYKIATKNDVDVQIDQESYLPEDIAGG 100 110 120 130 140 150 190 200 210 220 pF1KE6 VEVYSGNQRIKVSNTLESRLDLSAKQKMPEIRMALFGANTNRKFFI ::.:.:...::::::::::::: :.: :::.: ::::::.::::. CCDS42 VEIYNGDRKIKVSNTLESRLDLIAQQMMPEVRGALFGANANRKFLD 160 170 180 190 226 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 11:30:56 2016 done: Tue Nov 8 11:30:57 2016 Total Scan time: 1.650 Total Display time: -0.030 Function used was FASTA [36.3.4 Apr, 2011]