FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE4234, 854 aa 1>>>pF1KE4234 854 - 854 aa - 854 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.5476+/-0.00108; mu= 14.6491+/- 0.065 mean_var=87.7153+/-17.447, 0's: 0 Z-trim(104.4): 24 B-trim: 0 in 0/50 Lambda= 0.136942 statistics sampled from 7879 (7885) to 7879 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.592), E-opt: 0.2 (0.242), width: 16 Scan time: 3.870 The best scores are: opt bits E(32554) CCDS5457.1 VPS41 gene_id:27072|Hs108|chr7 ( 854) 5705 1137.8 0 CCDS5458.2 VPS41 gene_id:27072|Hs108|chr7 ( 829) 5018 1002.1 0 >>CCDS5457.1 VPS41 gene_id:27072|Hs108|chr7 (854 aa) initn: 5705 init1: 5705 opt: 5705 Z-score: 6089.6 bits: 1137.8 E(32554): 0 Smith-Waterman score: 5705; 100.0% identity (100.0% similar) in 854 aa overlap (1-854:1-854) 10 20 30 40 50 60 pF1KE4 MAEAEEQETGSLEESTDESEEEESEEEPKLKYERLSNGVTEILQKDAASCMTVHDKFLAL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 MAEAEEQETGSLEESTDESEEEESEEEPKLKYERLSNGVTEILQKDAASCMTVHDKFLAL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 GTHYGKVYLLDVQGNITQKFDVSPVKINQISLDESGEHMGVCSEDGKVQVFGLYSGEEFH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 GTHYGKVYLLDVQGNITQKFDVSPVKINQISLDESGEHMGVCSEDGKVQVFGLYSGEEFH 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 ETFDCPIKIIAVHPHFVRSSCKQFVTGGKKLLLFERSWMNRWKSAVLHEGEGNIRSVKWR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 ETFDCPIKIIAVHPHFVRSSCKQFVTGGKKLLLFERSWMNRWKSAVLHEGEGNIRSVKWR 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 GHLIAWANNMGVKIFDIISKQRITNVPRDDISLRPDMYPCSLCWKDNVTLIIGWGTSVKV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 GHLIAWANNMGVKIFDIISKQRITNVPRDDISLRPDMYPCSLCWKDNVTLIIGWGTSVKV 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE4 CSVKERHASEMRDLPSRYVEIVSQFETEFYISGLAPLCDQLVVLSYVKEISEKTEREYCA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 CSVKERHASEMRDLPSRYVEIVSQFETEFYISGLAPLCDQLVVLSYVKEISEKTEREYCA 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE4 RPRLDIIQPLSETCEEISSDALTVRGFQENECRDYHLEYSEGESLFYIVSPRDVVVAKER :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 RPRLDIIQPLSETCEEISSDALTVRGFQENECRDYHLEYSEGESLFYIVSPRDVVVAKER 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE4 DQDDHIDWLLEKKKYEEALMAAEISQKNIKRHKILDIGLAYINHLVERGDYDIAARKCQK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 DQDDHIDWLLEKKKYEEALMAAEISQKNIKRHKILDIGLAYINHLVERGDYDIAARKCQK 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE4 ILGKNAALWEYEVYKFKEIGQLKAISPYLPRGDPVLKPLIYEMILHEFLESDYEGFATLI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 ILGKNAALWEYEVYKFKEIGQLKAISPYLPRGDPVLKPLIYEMILHEFLESDYEGFATLI 430 440 450 460 470 480 490 500 510 520 530 540 pF1KE4 REWPGDLYNNSVIVQAVRDHLKKDSQNKTLLKTLAELYTYDKNYGNALEIYLTLRHKDVF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 REWPGDLYNNSVIVQAVRDHLKKDSQNKTLLKTLAELYTYDKNYGNALEIYLTLRHKDVF 490 500 510 520 530 540 550 560 570 580 590 600 pF1KE4 QLIHKHNLFSSIKDKIVLLMDFDSEKAVDMLLDNEDKISIKKVVEELEDRPELQHVYLHK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 QLIHKHNLFSSIKDKIVLLMDFDSEKAVDMLLDNEDKISIKKVVEELEDRPELQHVYLHK 550 560 570 580 590 600 610 620 630 640 650 660 pF1KE4 LFKRDHHKGQRYHEKQISLYAEYDRPNLLPFLRDSTHCPLEKALEICQQRNFVEETVYLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 LFKRDHHKGQRYHEKQISLYAEYDRPNLLPFLRDSTHCPLEKALEICQQRNFVEETVYLL 610 620 630 640 650 660 670 680 690 700 710 720 pF1KE4 SRMGNSRSALKMIMEELHDVDKAIEFAKEQDDGELWEDLILYSIDKPPFITGLLNNIGTH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 SRMGNSRSALKMIMEELHDVDKAIEFAKEQDDGELWEDLILYSIDKPPFITGLLNNIGTH 670 680 690 700 710 720 730 740 750 760 770 780 pF1KE4 VDPILLIHRIKEGMEIPNLRDSLVKILQDYNLQILLREGCKKILVADSLSLLKKMHRTQM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 VDPILLIHRIKEGMEIPNLRDSLVKILQDYNLQILLREGCKKILVADSLSLLKKMHRTQM 730 740 750 760 770 780 790 800 810 820 830 840 pF1KE4 KGVLVDEENICESCLSPILPSDAAKPFSVVVFHCRHMFHKECLPMPSMNSAAQFCNICSA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 KGVLVDEENICESCLSPILPSDAAKPFSVVVFHCRHMFHKECLPMPSMNSAAQFCNICSA 790 800 810 820 830 840 850 pF1KE4 KNRGPGSAILEMKK :::::::::::::: CCDS54 KNRGPGSAILEMKK 850 >>CCDS5458.2 VPS41 gene_id:27072|Hs108|chr7 (829 aa) initn: 5018 init1: 5018 opt: 5018 Z-score: 5356.3 bits: 1002.1 E(32554): 0 Smith-Waterman score: 5476; 97.1% identity (97.1% similar) in 854 aa overlap (1-854:1-829) 10 20 30 40 50 60 pF1KE4 MAEAEEQETGSLEESTDESEEEESEEEPKLKYERLSNGVTEILQKDAASCMTVHDKFLAL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 MAEAEEQETGSLEESTDESEEEESEEEPKLKYERLSNGVTEILQKDAASCMTVHDKFLAL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 GTHYGKVYLLDVQGNITQKFDVSPVKINQISLDESGEHMGVCSEDGKVQVFGLYSGEEFH :::::::::::::::::::::: ::::::::::::: CCDS54 GTHYGKVYLLDVQGNITQKFDV-------------------------VQVFGLYSGEEFH 70 80 90 130 140 150 160 170 180 pF1KE4 ETFDCPIKIIAVHPHFVRSSCKQFVTGGKKLLLFERSWMNRWKSAVLHEGEGNIRSVKWR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 ETFDCPIKIIAVHPHFVRSSCKQFVTGGKKLLLFERSWMNRWKSAVLHEGEGNIRSVKWR 100 110 120 130 140 150 190 200 210 220 230 240 pF1KE4 GHLIAWANNMGVKIFDIISKQRITNVPRDDISLRPDMYPCSLCWKDNVTLIIGWGTSVKV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 GHLIAWANNMGVKIFDIISKQRITNVPRDDISLRPDMYPCSLCWKDNVTLIIGWGTSVKV 160 170 180 190 200 210 250 260 270 280 290 300 pF1KE4 CSVKERHASEMRDLPSRYVEIVSQFETEFYISGLAPLCDQLVVLSYVKEISEKTEREYCA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 CSVKERHASEMRDLPSRYVEIVSQFETEFYISGLAPLCDQLVVLSYVKEISEKTEREYCA 220 230 240 250 260 270 310 320 330 340 350 360 pF1KE4 RPRLDIIQPLSETCEEISSDALTVRGFQENECRDYHLEYSEGESLFYIVSPRDVVVAKER :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 RPRLDIIQPLSETCEEISSDALTVRGFQENECRDYHLEYSEGESLFYIVSPRDVVVAKER 280 290 300 310 320 330 370 380 390 400 410 420 pF1KE4 DQDDHIDWLLEKKKYEEALMAAEISQKNIKRHKILDIGLAYINHLVERGDYDIAARKCQK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 DQDDHIDWLLEKKKYEEALMAAEISQKNIKRHKILDIGLAYINHLVERGDYDIAARKCQK 340 350 360 370 380 390 430 440 450 460 470 480 pF1KE4 ILGKNAALWEYEVYKFKEIGQLKAISPYLPRGDPVLKPLIYEMILHEFLESDYEGFATLI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 ILGKNAALWEYEVYKFKEIGQLKAISPYLPRGDPVLKPLIYEMILHEFLESDYEGFATLI 400 410 420 430 440 450 490 500 510 520 530 540 pF1KE4 REWPGDLYNNSVIVQAVRDHLKKDSQNKTLLKTLAELYTYDKNYGNALEIYLTLRHKDVF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 REWPGDLYNNSVIVQAVRDHLKKDSQNKTLLKTLAELYTYDKNYGNALEIYLTLRHKDVF 460 470 480 490 500 510 550 560 570 580 590 600 pF1KE4 QLIHKHNLFSSIKDKIVLLMDFDSEKAVDMLLDNEDKISIKKVVEELEDRPELQHVYLHK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 QLIHKHNLFSSIKDKIVLLMDFDSEKAVDMLLDNEDKISIKKVVEELEDRPELQHVYLHK 520 530 540 550 560 570 610 620 630 640 650 660 pF1KE4 LFKRDHHKGQRYHEKQISLYAEYDRPNLLPFLRDSTHCPLEKALEICQQRNFVEETVYLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 LFKRDHHKGQRYHEKQISLYAEYDRPNLLPFLRDSTHCPLEKALEICQQRNFVEETVYLL 580 590 600 610 620 630 670 680 690 700 710 720 pF1KE4 SRMGNSRSALKMIMEELHDVDKAIEFAKEQDDGELWEDLILYSIDKPPFITGLLNNIGTH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 SRMGNSRSALKMIMEELHDVDKAIEFAKEQDDGELWEDLILYSIDKPPFITGLLNNIGTH 640 650 660 670 680 690 730 740 750 760 770 780 pF1KE4 VDPILLIHRIKEGMEIPNLRDSLVKILQDYNLQILLREGCKKILVADSLSLLKKMHRTQM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 VDPILLIHRIKEGMEIPNLRDSLVKILQDYNLQILLREGCKKILVADSLSLLKKMHRTQM 700 710 720 730 740 750 790 800 810 820 830 840 pF1KE4 KGVLVDEENICESCLSPILPSDAAKPFSVVVFHCRHMFHKECLPMPSMNSAAQFCNICSA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 KGVLVDEENICESCLSPILPSDAAKPFSVVVFHCRHMFHKECLPMPSMNSAAQFCNICSA 760 770 780 790 800 810 850 pF1KE4 KNRGPGSAILEMKK :::::::::::::: CCDS54 KNRGPGSAILEMKK 820 854 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 17:08:41 2016 done: Mon Nov 7 17:08:42 2016 Total Scan time: 3.870 Total Display time: 0.050 Function used was FASTA [36.3.4 Apr, 2011]