FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3184, 422 aa 1>>>pF1KE3184 422 - 422 aa - 422 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.7253+/-0.000833; mu= 15.6588+/- 0.050 mean_var=77.2194+/-15.029, 0's: 0 Z-trim(107.3): 26 B-trim: 0 in 0/53 Lambda= 0.145952 statistics sampled from 9474 (9497) to 9474 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.669), E-opt: 0.2 (0.292), width: 16 Scan time: 3.090 The best scores are: opt bits E(32554) CCDS5865.1 AGK gene_id:55750|Hs108|chr7 ( 422) 2821 603.5 1.3e-172 CCDS45785.1 SPHK1 gene_id:8877|Hs108|chr17 ( 384) 278 68.0 1.8e-11 CCDS59297.1 SPHK1 gene_id:8877|Hs108|chr17 ( 398) 278 68.0 1.9e-11 CCDS11744.1 SPHK1 gene_id:8877|Hs108|chr17 ( 470) 278 68.0 2.1e-11 >>CCDS5865.1 AGK gene_id:55750|Hs108|chr7 (422 aa) initn: 2821 init1: 2821 opt: 2821 Z-score: 3212.7 bits: 603.5 E(32554): 1.3e-172 Smith-Waterman score: 2821; 100.0% identity (100.0% similar) in 422 aa overlap (1-422:1-422) 10 20 30 40 50 60 pF1KE3 MTVFFKTLRNHWKKTTAGLCLLTWGGHWLYGKHCDNLLRRAACQEAQVFGNQLIPPNAQV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 MTVFFKTLRNHWKKTTAGLCLLTWGGHWLYGKHCDNLLRRAACQEAQVFGNQLIPPNAQV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 KKATVFLNPAACKGKARTLFEKNAAPILHLSGMDVTIVKTDYEGQAKKLLELMENTDVII :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 KKATVFLNPAACKGKARTLFEKNAAPILHLSGMDVTIVKTDYEGQAKKLLELMENTDVII 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 VAGGDGTLQEVVTGVLRRTDEATFSKIPIGFIPLGETSSLSHTLFAESGNKVQHITDATL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 VAGGDGTLQEVVTGVLRRTDEATFSKIPIGFIPLGETSSLSHTLFAESGNKVQHITDATL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE3 AIVKGETVPLDVLQIKGEKEQPVFAMTGLRWGSFRDAGVKVSKYWYLGPLKIKAAHFFST :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 AIVKGETVPLDVLQIKGEKEQPVFAMTGLRWGSFRDAGVKVSKYWYLGPLKIKAAHFFST 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE3 LKEWPQTHQASISYTGPTERPPNEPEETPVQRPSLYRRILRRLASYWAQPQDALSQEVSP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 LKEWPQTHQASISYTGPTERPPNEPEETPVQRPSLYRRILRRLASYWAQPQDALSQEVSP 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE3 EVWKDVQLSTIELSITTRNNQLDPTSKEDFLNICIEPDTISKGDFITIGSRKVRNPKLHV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 EVWKDVQLSTIELSITTRNNQLDPTSKEDFLNICIEPDTISKGDFITIGSRKVRNPKLHV 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE3 EGTECLQASQCTLLIPEGAGGSFSIDSEEYEAMPVEVKLLPRKLQFFCDPRKREQMLTSP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 EGTECLQASQCTLLIPEGAGGSFSIDSEEYEAMPVEVKLLPRKLQFFCDPRKREQMLTSP 370 380 390 400 410 420 pF1KE3 TQ :: CCDS58 TQ >>CCDS45785.1 SPHK1 gene_id:8877|Hs108|chr17 (384 aa) initn: 184 init1: 70 opt: 278 Z-score: 319.4 bits: 68.0 E(32554): 1.8e-11 Smith-Waterman score: 278; 26.5% identity (62.3% similar) in 215 aa overlap (62-270:16-227) 40 50 60 70 80 90 pF1KE3 KHCDNLLRRAACQEAQVFGNQLIPPNAQVKKATVFLNPAACKGKARTLFEKNAAPILHLS .. :.::: . :::: ::.... :.: . CCDS45 MDPAGGPRGVLPRPCRVLVLLNPRGGKGKALQLFRSHVQPLLAEA 10 20 30 40 100 110 120 130 140 pF1KE3 GMDVTIVKTDYEGQAKKLL--ELMENTDVIIVAGGDGTLQEVVTGVLRRTDEATFSKIPI .. :.. :. ...:..:. : . :...: .::: ..:::.:...: : : . :. CCDS45 EISFTLMLTERRNHARELVRSEELGRWDALVVMSGDGLMHEVVNGLMERPDWETAIQKPL 50 60 70 80 90 100 150 160 170 180 190 200 pF1KE3 GFIPLGETSSLSHTLFAESG-NKVQH---ITDATLAIVKGETVPLDVLQIKGEKEQPVFA .: : ..:. .: .: ..: . .:. :: . . :...:... . .:. CCDS45 CSLPAGSGNALAASLNHYAGYEQVTNEDLLTNCTLLLCRRLLSPMNLLSLHTASGLRLFS 110 120 130 140 150 160 210 220 230 240 250 260 pF1KE3 MTGLRWGSFRDAGVKVSKYWYLGPLKIKAAHFFSTLKEWPQTHQASISYTGPTERPPNEP . .: :: . :. .. :: :: ... . :. .:... ..: :. : .. CCDS45 VLSLAWGFIADVDLESEKYRRLGEMRFTLGTFLRLAAL--RTYRGRLAYL-PVGRVGSKT 170 180 190 200 210 220 270 280 290 300 310 320 pF1KE3 EETPVQRPSLYRRILRRLASYWAQPQDALSQEVSPEVWKDVQLSTIELSITTRNNQLDPT .:: CCDS45 PASPVVVQQGPVDAHLVPLEEPVPSHWTVVPDEDFVLVLALLHSHLGSEMFAAPMGRCAA 230 240 250 260 270 280 >>CCDS59297.1 SPHK1 gene_id:8877|Hs108|chr17 (398 aa) initn: 184 init1: 70 opt: 278 Z-score: 319.1 bits: 68.0 E(32554): 1.9e-11 Smith-Waterman score: 278; 26.5% identity (62.3% similar) in 215 aa overlap (62-270:30-241) 40 50 60 70 80 90 pF1KE3 KHCDNLLRRAACQEAQVFGNQLIPPNAQVKKATVFLNPAACKGKARTLFEKNAAPILHLS .. :.::: . :::: ::.... :.: . CCDS59 MDPVVGCGRGLFGFVFSAGGPRGVLPRPCRVLVLLNPRGGKGKALQLFRSHVQPLLAEA 10 20 30 40 50 100 110 120 130 140 pF1KE3 GMDVTIVKTDYEGQAKKLL--ELMENTDVIIVAGGDGTLQEVVTGVLRRTDEATFSKIPI .. :.. :. ...:..:. : . :...: .::: ..:::.:...: : : . :. CCDS59 EISFTLMLTERRNHARELVRSEELGRWDALVVMSGDGLMHEVVNGLMERPDWETAIQKPL 60 70 80 90 100 110 150 160 170 180 190 200 pF1KE3 GFIPLGETSSLSHTLFAESG-NKVQH---ITDATLAIVKGETVPLDVLQIKGEKEQPVFA .: : ..:. .: .: ..: . .:. :: . . :...:... . .:. CCDS59 CSLPAGSGNALAASLNHYAGYEQVTNEDLLTNCTLLLCRRLLSPMNLLSLHTASGLRLFS 120 130 140 150 160 170 210 220 230 240 250 260 pF1KE3 MTGLRWGSFRDAGVKVSKYWYLGPLKIKAAHFFSTLKEWPQTHQASISYTGPTERPPNEP . .: :: . :. .. :: :: ... . :. .:... ..: :. : .. CCDS59 VLSLAWGFIADVDLESEKYRRLGEMRFTLGTFLRLAAL--RTYRGRLAYL-PVGRVGSKT 180 190 200 210 220 230 270 280 290 300 310 320 pF1KE3 EETPVQRPSLYRRILRRLASYWAQPQDALSQEVSPEVWKDVQLSTIELSITTRNNQLDPT .:: CCDS59 PASPVVVQQGPVDAHLVPLEEPVPSHWTVVPDEDFVLVLALLHSHLGSEMFAAPMGRCAA 240 250 260 270 280 290 >>CCDS11744.1 SPHK1 gene_id:8877|Hs108|chr17 (470 aa) initn: 184 init1: 70 opt: 278 Z-score: 318.1 bits: 68.0 E(32554): 2.1e-11 Smith-Waterman score: 278; 26.5% identity (62.3% similar) in 215 aa overlap (62-270:102-313) 40 50 60 70 80 90 pF1KE3 KHCDNLLRRAACQEAQVFGNQLIPPNAQVKKATVFLNPAACKGKARTLFEKNAAPILHLS .. :.::: . :::: ::.... :.: . CCDS11 TAPGTPWQREPRVEVMDPAGGPRGVLPRPCRVLVLLNPRGGKGKALQLFRSHVQPLLAEA 80 90 100 110 120 130 100 110 120 130 140 pF1KE3 GMDVTIVKTDYEGQAKKLL--ELMENTDVIIVAGGDGTLQEVVTGVLRRTDEATFSKIPI .. :.. :. ...:..:. : . :...: .::: ..:::.:...: : : . :. CCDS11 EISFTLMLTERRNHARELVRSEELGRWDALVVMSGDGLMHEVVNGLMERPDWETAIQKPL 140 150 160 170 180 190 150 160 170 180 190 200 pF1KE3 GFIPLGETSSLSHTLFAESG-NKVQH---ITDATLAIVKGETVPLDVLQIKGEKEQPVFA .: : ..:. .: .: ..: . .:. :: . . :...:... . .:. CCDS11 CSLPAGSGNALAASLNHYAGYEQVTNEDLLTNCTLLLCRRLLSPMNLLSLHTASGLRLFS 200 210 220 230 240 250 210 220 230 240 250 260 pF1KE3 MTGLRWGSFRDAGVKVSKYWYLGPLKIKAAHFFSTLKEWPQTHQASISYTGPTERPPNEP . .: :: . :. .. :: :: ... . :. .:... ..: :. : .. CCDS11 VLSLAWGFIADVDLESEKYRRLGEMRFTLGTFLRLAAL--RTYRGRLAYL-PVGRVGSKT 260 270 280 290 300 270 280 290 300 310 320 pF1KE3 EETPVQRPSLYRRILRRLASYWAQPQDALSQEVSPEVWKDVQLSTIELSITTRNNQLDPT .:: CCDS11 PASPVVVQQGPVDAHLVPLEEPVPSHWTVVPDEDFVLVLALLHSHLGSEMFAAPMGRCAA 310 320 330 340 350 360 422 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 01:37:40 2016 done: Sun Nov 6 01:37:40 2016 Total Scan time: 3.090 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]