FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5006, 321 aa 1>>>pF1KE5006 321 - 321 aa - 321 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.7071+/-0.00108; mu= 13.0977+/- 0.065 mean_var=61.3127+/-11.995, 0's: 0 Z-trim(101.8): 24 B-trim: 0 in 0/49 Lambda= 0.163794 statistics sampled from 6642 (6652) to 6642 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.576), E-opt: 0.2 (0.204), width: 16 Scan time: 1.600 The best scores are: opt bits E(32554) CCDS701.1 MCOLN3 gene_id:55283|Hs108|chr1 ( 553) 2115 508.7 4.1e-144 CCDS58009.1 MCOLN3 gene_id:55283|Hs108|chr1 ( 497) 1239 301.7 7.7e-82 CCDS81347.1 MCOLN2 gene_id:255231|Hs108|chr1 ( 538) 999 245.0 9.8e-65 CCDS30762.1 MCOLN2 gene_id:255231|Hs108|chr1 ( 566) 999 245.0 1e-64 CCDS12180.1 MCOLN1 gene_id:57192|Hs108|chr19 ( 580) 637 159.5 5.9e-39 >>CCDS701.1 MCOLN3 gene_id:55283|Hs108|chr1 (553 aa) initn: 2115 init1: 2115 opt: 2115 Z-score: 2700.7 bits: 508.7 E(32554): 4.1e-144 Smith-Waterman score: 2115; 99.7% identity (100.0% similar) in 315 aa overlap (1-315:1-315) 10 20 30 40 50 60 pF1KE5 MADPEVVVSSCSSHEEENRCNFNQQTSPSEELLLEDQMRRKLKFFFMNPCEKFWARGRKP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS70 MADPEVVVSSCSSHEEENRCNFNQQTSPSEELLLEDQMRRKLKFFFMNPCEKFWARGRKP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 WKLAIQILKIAMVTIQLVLFGLSNQMVVAFKEENTIAFKHLFLKGYMDRMDDTYAVYTQS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS70 WKLAIQILKIAMVTIQLVLFGLSNQMVVAFKEENTIAFKHLFLKGYMDRMDDTYAVYTQS 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 DVYDQLIFAVNQYLQLYNVSVGNHAYENKGTKQSAMAICQHLYKRGNIYPGNDTFDIDPE :::::::::::::::::::::::::::::::::::::::::.:::::::::::::::::: CCDS70 DVYDQLIFAVNQYLQLYNVSVGNHAYENKGTKQSAMAICQHFYKRGNIYPGNDTFDIDPE 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 IETECFFVEPDEPFHIGTPAENKLNLTLDFHRLLTVELQFKLKAINLQTVRHQELPDCYD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS70 IETECFFVEPDEPFHIGTPAENKLNLTLDFHRLLTVELQFKLKAINLQTVRHQELPDCYD 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE5 FTLTITFDNKAHSGRIKISLDNDISIRECKDWHVSGSIQKNTHYMMIFDAFVILTCLVSL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS70 FTLTITFDNKAHSGRIKISLDNDISIRECKDWHVSGSIQKNTHYMMIFDAFVILTCLVSL 250 260 270 280 290 300 310 320 pF1KE5 ILCIRSVIRGLQLQQVGNVAF ::::::::::::::: CCDS70 ILCIRSVIRGLQLQQEFVNFFLLHYKKEVSVSDQMEFVNGWYIMIIISDILTIIGSILKM 310 320 330 340 350 360 >>CCDS58009.1 MCOLN3 gene_id:55283|Hs108|chr1 (497 aa) initn: 1239 init1: 1239 opt: 1239 Z-score: 1582.7 bits: 301.7 E(32554): 7.7e-82 Smith-Waterman score: 1633; 81.9% identity (82.2% similar) in 315 aa overlap (1-315:1-259) 10 20 30 40 50 60 pF1KE5 MADPEVVVSSCSSHEEENRCNFNQQTSPSEELLLEDQMRRKLKFFFMNPCEKFWARGRKP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 MADPEVVVSSCSSHEEENRCNFNQQTSPSEELLLEDQMRRKLKFFFMNPCEKFWARGRKP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 WKLAIQILKIAMVTIQLVLFGLSNQMVVAFKEENTIAFKHLFLKGYMDRMDDTYAVYTQS :::::::::::::::: CCDS58 WKLAIQILKIAMVTIQ-------------------------------------------- 70 130 140 150 160 170 180 pF1KE5 DVYDQLIFAVNQYLQLYNVSVGNHAYENKGTKQSAMAICQHLYKRGNIYPGNDTFDIDPE :::::::::::::::::::::::::::::.:::::::::::::::::: CCDS58 ------------YLQLYNVSVGNHAYENKGTKQSAMAICQHFYKRGNIYPGNDTFDIDPE 80 90 100 110 120 190 200 210 220 230 240 pF1KE5 IETECFFVEPDEPFHIGTPAENKLNLTLDFHRLLTVELQFKLKAINLQTVRHQELPDCYD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 IETECFFVEPDEPFHIGTPAENKLNLTLDFHRLLTVELQFKLKAINLQTVRHQELPDCYD 130 140 150 160 170 180 250 260 270 280 290 300 pF1KE5 FTLTITFDNKAHSGRIKISLDNDISIRECKDWHVSGSIQKNTHYMMIFDAFVILTCLVSL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 FTLTITFDNKAHSGRIKISLDNDISIRECKDWHVSGSIQKNTHYMMIFDAFVILTCLVSL 190 200 210 220 230 240 310 320 pF1KE5 ILCIRSVIRGLQLQQVGNVAF ::::::::::::::: CCDS58 ILCIRSVIRGLQLQQEFVNFFLLHYKKEVSVSDQMEFVNGWYIMIIISDILTIIGSILKM 250 260 270 280 290 300 >>CCDS81347.1 MCOLN2 gene_id:255231|Hs108|chr1 (538 aa) initn: 1032 init1: 646 opt: 999 Z-score: 1275.6 bits: 245.0 E(32554): 9.8e-65 Smith-Waterman score: 999; 51.2% identity (81.3% similar) in 283 aa overlap (35-315:10-292) 10 20 30 40 50 60 pF1KE5 EVVVSSCSSHEEENRCNFNQQTSPSEELLLEDQMRRKLKFFFMNPCEKFWARGRKPWKLA :. .:. :::.::.::::. :: . ::::. CCDS81 MAHRDSEMKEECLREDLKFYFMSPCEKYRARRQIPWKLG 10 20 30 70 80 90 100 110 120 pF1KE5 IQILKIAMVTIQLVLFGLSNQMVVAFKEENTIAFKHLFLKGYMDRMDDTYA--VYTQSDV .:::::.::: ::: ::::::.::::::.::.:::::::::: .: :. :::: :. CCDS81 LQILKIVMVTTQLVRFGLSNQLVVAFKEDNTVAFKHLFLKGYSGTDEDDYSCSVYTQEDA 40 50 60 70 80 90 130 140 150 160 170 180 pF1KE5 YDQLIFAVNQYLQLYNVSVGNHAYENKGTKQSAMAICQHLYKRGNIYPGNDTFDIDPEIE :....::.::: :: ....:. .: .. .. .. .:.. ::.:...:.:.:..:: ..: CCDS81 YESIFFAINQYHQLKDITLGTLGYGENEDNRIGLKVCKQHYKKGTMFPSNETLNIDNDVE 100 110 120 130 140 150 190 200 210 220 230 240 pF1KE5 TECFFVEPDEPFHIGTPAENKLNLTLDFHRLLTVELQFKLKAINLQTVRHQELPDCYDFT .: .. .. . .:. . :.:.::: ::..:.::.:.:::.. .:::::: : CCDS81 LDCVQLDLQDLSKKPPDWKNSSFFRLEFYRLLQVEISFHLKGIDLQTIHSRELPDCYVFQ 160 170 180 190 200 210 250 260 270 280 290 300 pF1KE5 LTITFDNKAHSGRIKISLDNDISIRECKDWHVSGSIQKNTHYMMIFDAFVILTCLVSLIL :: ::::::::.::: .:.: .:.:::: .. :: :::..:...::::::. ::.:::: CCDS81 NTIIFDNKAHSGKIKIYFDSDAKIEECKDLNIFGSTQKNAQYVLVFDAFVIVICLASLIL 220 230 240 250 260 270 310 320 pF1KE5 CIRSVIRGLQLQQVGNVAF : ::.. .:.:.. CCDS81 CTRSIVLALRLRKRFLNFFLEKYKRPVCDTDQWEFINGWYVLVIISDLMTIIGSILKMEI 280 290 300 310 320 330 >>CCDS30762.1 MCOLN2 gene_id:255231|Hs108|chr1 (566 aa) initn: 1032 init1: 646 opt: 999 Z-score: 1275.3 bits: 245.0 E(32554): 1e-64 Smith-Waterman score: 999; 51.2% identity (81.3% similar) in 283 aa overlap (35-315:38-320) 10 20 30 40 50 60 pF1KE5 EVVVSSCSSHEEENRCNFNQQTSPSEELLLEDQMRRKLKFFFMNPCEKFWARGRKPWKLA :. .:. :::.::.::::. :: . ::::. CCDS30 FPQARIPERGSGVFRLTVRNAMAHRDSEMKEECLREDLKFYFMSPCEKYRARRQIPWKLG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 IQILKIAMVTIQLVLFGLSNQMVVAFKEENTIAFKHLFLKGYMDRMDDTYA--VYTQSDV .:::::.::: ::: ::::::.::::::.::.:::::::::: .: :. :::: :. CCDS30 LQILKIVMVTTQLVRFGLSNQLVVAFKEDNTVAFKHLFLKGYSGTDEDDYSCSVYTQEDA 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 YDQLIFAVNQYLQLYNVSVGNHAYENKGTKQSAMAICQHLYKRGNIYPGNDTFDIDPEIE :....::.::: :: ....:. .: .. .. .. .:.. ::.:...:.:.:..:: ..: CCDS30 YESIFFAINQYHQLKDITLGTLGYGENEDNRIGLKVCKQHYKKGTMFPSNETLNIDNDVE 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 TECFFVEPDEPFHIGTPAENKLNLTLDFHRLLTVELQFKLKAINLQTVRHQELPDCYDFT .: .. .. . .:. . :.:.::: ::..:.::.:.:::.. .:::::: : CCDS30 LDCVQLDLQDLSKKPPDWKNSSFFRLEFYRLLQVEISFHLKGIDLQTIHSRELPDCYVFQ 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE5 LTITFDNKAHSGRIKISLDNDISIRECKDWHVSGSIQKNTHYMMIFDAFVILTCLVSLIL :: ::::::::.::: .:.: .:.:::: .. :: :::..:...::::::. ::.:::: CCDS30 NTIIFDNKAHSGKIKIYFDSDAKIEECKDLNIFGSTQKNAQYVLVFDAFVIVICLASLIL 250 260 270 280 290 300 310 320 pF1KE5 CIRSVIRGLQLQQVGNVAF : ::.. .:.:.. CCDS30 CTRSIVLALRLRKRFLNFFLEKYKRPVCDTDQWEFINGWYVLVIISDLMTIIGSILKMEI 310 320 330 340 350 360 >>CCDS12180.1 MCOLN1 gene_id:57192|Hs108|chr19 (580 aa) initn: 1002 init1: 475 opt: 637 Z-score: 812.8 bits: 159.5 E(32554): 5.9e-39 Smith-Waterman score: 980; 51.0% identity (76.2% similar) in 302 aa overlap (27-314:34-327) 10 20 30 40 50 pF1KE5 MADPEVVVSSCSSHEEENRCNFNQQTSPSEELLLEDQMRRKLKFFFMNPCEKFWAR .: :: :: .::.::.:::.::.:: :. CCDS12 PAGPRGSETERLLTPNPGYGTQAGPSPAPPTPPEE---ED-LRRRLKYFFMSPCDKFRAK 10 20 30 40 50 60 70 80 90 100 110 pF1KE5 GRKPWKLAIQILKIAMVTIQLVLFGLSNQMVVAFKEENTIAFKHLFLKGYMDRMDDTYAV :::: :: .:..:: .::.::.:::::::..:.:.:::::::.:::: :: : :::.:. CCDS12 GRKPCKLMLQVVKILVVTVQLILFGLSNQLAVTFREENTIAFRHLFLLGYSDGADDTFAA 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE5 YTQSDVYDQLIFAVNQYLQLYNVSVGNHAYENKG----TKQSAMAICQHLYKRGNIYPGN ::. ..:. .. ::.::: : .::.: .:: : :. :..:.::. :.::.. :.: CCDS12 YTREQLYQAIFHAVDQYLALPDVSLGRYAYVRGGGDPWTNGSGLALCQRYYHRGHVDPAN 120 130 140 150 160 170 180 190 200 210 220 pF1KE5 DTFDIDPEIETECFFVEPDEPFHIGTPAENKL----------NLTLDFHRLLTVELQFKL ::::::: . :.:. :.: : . : . : :::: ::.:..: ..:.: CCDS12 DTFDIDPMVVTDCIQVDP--PERPPPPPSDDLTLLESSSSYKNLTLKFHKLVNVTIHFRL 180 190 200 210 220 230 230 240 250 260 270 280 pF1KE5 KAINLQTVRHQELPDCYDFTLTITFDNKAHSGRIKISLDNDISIRECKDWHVSGSIQKNT :.::::.. ..:.:::: :.. :::::::::::: :::... :.::: : : . .. CCDS12 KTINLQSLINNEIPDCYTFSVLITFDNKAHSGRIPISLETQAHIQECK--HPSVFQHGDN 240 250 260 270 280 290 290 300 310 320 pF1KE5 HYMMIFDAFVILTCLVSLILCIRSVIRGLQLQQVGNVAF . ..::. ::::: .:..:: ::..::. :: CCDS12 SFRLLFDVVVILTCSLSFLLCARSLLRGFLLQNEFVGFMWRQRGRVISLWERLEFVNGWY 300 310 320 330 340 350 CCDS12 ILLVTSDVLTISGTIMKIGIEAKNLASYDVCSILLGTSTLLVWVGVIRYLTFFHNYNILI 360 370 380 390 400 410 321 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 04:05:30 2016 done: Tue Nov 8 04:05:31 2016 Total Scan time: 1.600 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]