FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6257, 247 aa 1>>>pF1KE6257 247 - 247 aa - 247 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.5338+/-0.000919; mu= 12.6656+/- 0.055 mean_var=63.4571+/-13.358, 0's: 0 Z-trim(104.4): 19 B-trim: 0 in 0/52 Lambda= 0.161003 statistics sampled from 7853 (7859) to 7853 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.599), E-opt: 0.2 (0.241), width: 16 Scan time: 1.400 The best scores are: opt bits E(32554) CCDS41391.1 APH1A gene_id:51107|Hs108|chr1 ( 247) 1609 382.5 1.4e-106 CCDS41390.1 APH1A gene_id:51107|Hs108|chr1 ( 265) 1598 379.9 9e-106 CCDS10184.1 APH1B gene_id:83464|Hs108|chr15 ( 257) 1005 242.2 2.5e-64 CCDS58025.1 APH1A gene_id:51107|Hs108|chr1 ( 195) 837 203.1 1.1e-52 CCDS45276.1 APH1B gene_id:83464|Hs108|chr15 ( 216) 486 121.6 4.2e-28 >>CCDS41391.1 APH1A gene_id:51107|Hs108|chr1 (247 aa) initn: 1609 init1: 1609 opt: 1609 Z-score: 2026.7 bits: 382.5 E(32554): 1.4e-106 Smith-Waterman score: 1609; 100.0% identity (100.0% similar) in 247 aa overlap (1-247:1-247) 10 20 30 40 50 60 pF1KE6 MGAAVFFGCTFVAFGPAFALFLITVAGDPLRVIILVAGAFFWLVSLLLASVVWFILVHVT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 MGAAVFFGCTFVAFGPAFALFLITVAGDPLRVIILVAGAFFWLVSLLLASVVWFILVHVT 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 DRSDARLQYGLLIFGAAVSVLLQEVFRFAYYKLLKKADEGLASLSEDGRSPISIRQMAYV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 DRSDARLQYGLLIFGAAVSVLLQEVFRFAYYKLLKKADEGLASLSEDGRSPISIRQMAYV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 SGLSFGIISGVFSVINILADALGPGVVGIHGDSPYYFLTSAFLTAAIILLHTFWGVVFFD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 SGLSFGIISGVFSVINILADALGPGVVGIHGDSPYYFLTSAFLTAAIILLHTFWGVVFFD 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 ACERRRYWALGLVVGSHLLTSGLTFLNPWYEASLLPIYAVTVSMGLWAFITAGGSLRSIQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 ACERRRYWALGLVVGSHLLTSGLTFLNPWYEASLLPIYAVTVSMGLWAFITAGGSLRSIQ 190 200 210 220 230 240 pF1KE6 RSLLCKD ::::::: CCDS41 RSLLCKD >>CCDS41390.1 APH1A gene_id:51107|Hs108|chr1 (265 aa) initn: 1598 init1: 1598 opt: 1598 Z-score: 2012.4 bits: 379.9 E(32554): 9e-106 Smith-Waterman score: 1598; 99.6% identity (100.0% similar) in 246 aa overlap (1-246:1-246) 10 20 30 40 50 60 pF1KE6 MGAAVFFGCTFVAFGPAFALFLITVAGDPLRVIILVAGAFFWLVSLLLASVVWFILVHVT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 MGAAVFFGCTFVAFGPAFALFLITVAGDPLRVIILVAGAFFWLVSLLLASVVWFILVHVT 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 DRSDARLQYGLLIFGAAVSVLLQEVFRFAYYKLLKKADEGLASLSEDGRSPISIRQMAYV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 DRSDARLQYGLLIFGAAVSVLLQEVFRFAYYKLLKKADEGLASLSEDGRSPISIRQMAYV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 SGLSFGIISGVFSVINILADALGPGVVGIHGDSPYYFLTSAFLTAAIILLHTFWGVVFFD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 SGLSFGIISGVFSVINILADALGPGVVGIHGDSPYYFLTSAFLTAAIILLHTFWGVVFFD 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 ACERRRYWALGLVVGSHLLTSGLTFLNPWYEASLLPIYAVTVSMGLWAFITAGGSLRSIQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 ACERRRYWALGLVVGSHLLTSGLTFLNPWYEASLLPIYAVTVSMGLWAFITAGGSLRSIQ 190 200 210 220 230 240 pF1KE6 RSLLCKD :::::. CCDS41 RSLLCRRQEDSRVMVYSALRIPPED 250 260 >>CCDS10184.1 APH1B gene_id:83464|Hs108|chr15 (257 aa) initn: 978 init1: 536 opt: 1005 Z-score: 1268.2 bits: 242.2 E(32554): 2.5e-64 Smith-Waterman score: 1005; 59.1% identity (83.0% similar) in 247 aa overlap (1-247:1-246) 10 20 30 40 50 60 pF1KE6 MGAAVFFGCTFVAFGPAFALFLITVAGDPLRVIILVAGAFFWLVSLLLASVVWFILVHVT : :::::::.:.:::::.::...:.: .:::.:.:.:::::::::::..:.:::. . CCDS10 MTAAVFFGCAFIAFGPALALYVFTIATEPLRIIFLIAGAFFWLVSLLISSLVWFMARVII 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 DRSDARLQYGLLIFGAAVSVLLQEVFRFAYYKLLKKADEGLASLSEDGRSPISIRQMAYV : .:. : :::::: ::: .::.::::::::::::.::: :.. :.. :.: .::: CCDS10 DNKDGPTQKYLLIFGAFVSVYIQEMFRFAYYKLLKKASEGLKSINP-GETAPSMRLLAYV 70 80 90 100 110 130 140 150 160 170 180 pF1KE6 SGLSFGIISGVFSVINILADALGPGVVGIHGDSPYYFLTSAFLTAAIILLHTFWGVVFFD :::.:::.::::: .: :.:.::::.:::::::: .:: :::.: .:::::.:::.:::: CCDS10 SGLGFGIMSGVFSFVNTLSDSLGPGTVGIHGDSPQFFLYSAFMTLVIILLHVFWGIVFFD 120 130 140 150 160 170 190 200 210 220 230 240 pF1KE6 ACERRRYWALGLVVGSHLLTSGLTFLNPWYEASLLPIYAVTVSMGLWAFITAGGSLRSIQ .::.... : .:. .:::.:. ::.. .: .: . . : :: :::..:::: ::.. CCDS10 GCEKKKWGILLIVLLTHLLVSAQTFISSYYGINLASAFIILVLMGTWAFLAAGGSCRSLK 180 190 200 210 220 230 pF1KE6 RSLLCKD :::.: CCDS10 LCLLCQDKNFLLYNQRSR 240 250 >>CCDS58025.1 APH1A gene_id:51107|Hs108|chr1 (195 aa) initn: 1071 init1: 837 opt: 837 Z-score: 1059.3 bits: 203.1 E(32554): 1.1e-52 Smith-Waterman score: 933; 67.9% identity (69.5% similar) in 246 aa overlap (1-246:1-176) 10 20 30 40 50 60 pF1KE6 MGAAVFFGCTFVAFGPAFALFLITVAGDPLRVIILVAGAFFWLVSLLLASVVWFILVHVT :::::::::::::::::::::::::::::::::::::: CCDS58 MGAAVFFGCTFVAFGPAFALFLITVAGDPLRVIILVAG---------------------- 10 20 30 70 80 90 100 110 120 pF1KE6 DRSDARLQYGLLIFGAAVSVLLQEVFRFAYYKLLKKADEGLASLSEDGRSPISIRQMAYV : .: : . : . CCDS58 -RCSA-----------------------------------LPTTS------------CLI 40 50 130 140 150 160 170 180 pF1KE6 SGLSFGIISGVFSVINILADALGPGVVGIHGDSPYYFLTSAFLTAAIILLHTFWGVVFFD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 SGLSFGIISGVFSVINILADALGPGVVGIHGDSPYYFLTSAFLTAAIILLHTFWGVVFFD 60 70 80 90 100 110 190 200 210 220 230 240 pF1KE6 ACERRRYWALGLVVGSHLLTSGLTFLNPWYEASLLPIYAVTVSMGLWAFITAGGSLRSIQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 ACERRRYWALGLVVGSHLLTSGLTFLNPWYEASLLPIYAVTVSMGLWAFITAGGSLRSIQ 120 130 140 150 160 170 pF1KE6 RSLLCKD :::::. CCDS58 RSLLCRRQEDSRVMVYSALRIPPED 180 190 >>CCDS45276.1 APH1B gene_id:83464|Hs108|chr15 (216 aa) initn: 771 init1: 456 opt: 486 Z-score: 617.9 bits: 121.6 E(32554): 4.2e-28 Smith-Waterman score: 703; 47.0% identity (68.0% similar) in 247 aa overlap (1-247:1-205) 10 20 30 40 50 60 pF1KE6 MGAAVFFGCTFVAFGPAFALFLITVAGDPLRVIILVAGAFFWLVSLLLASVVWFILVHVT : :::::::.:.:::::.::...:.: .:::.:.:.:::::::::::..:.:::. . CCDS45 MTAAVFFGCAFIAFGPALALYVFTIATEPLRIIFLIAGAFFWLVSLLISSLVWFMARVII 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 DRSDARLQYGLLIFGAAVSVLLQEVFRFAYYKLLKKADEGLASLSEDGRSPISIRQMAYV : .:. : :::::: ::: .::.::::::::::::.::: :.. :.. :.: .:: CCDS45 DNKDGPTQKYLLIFGAFVSVYIQEMFRFAYYKLLKKASEGLKSINP-GETAPSMRLLAY- 70 80 90 100 110 130 140 150 160 170 180 pF1KE6 SGLSFGIISGVFSVINILADALGPGVVGIHGDSPYYFLTSAFLTAAIILLHTFWGVVFFD ::.: .:::::.:::.:::: CCDS45 ----------------------------------------AFMTLVIILLHVFWGIVFFD 120 130 190 200 210 220 230 240 pF1KE6 ACERRRYWALGLVVGSHLLTSGLTFLNPWYEASLLPIYAVTVSMGLWAFITAGGSLRSIQ .::.... : .:. .:::.:. ::.. .: .: . . : :: :::..:::: ::.. CCDS45 GCEKKKWGILLIVLLTHLLVSAQTFISSYYGINLASAFIILVLMGTWAFLAAGGSCRSLK 140 150 160 170 180 190 pF1KE6 RSLLCKD :::.: CCDS45 LCLLCQDKNFLLYNQRSR 200 210 247 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 11:32:33 2016 done: Tue Nov 8 11:32:33 2016 Total Scan time: 1.400 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]