FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6430, 614 aa 1>>>pF1KE6430 614 - 614 aa - 614 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.2466+/-0.000976; mu= 19.4695+/- 0.059 mean_var=65.9716+/-13.140, 0's: 0 Z-trim(103.9): 25 B-trim: 232 in 1/51 Lambda= 0.157905 statistics sampled from 7612 (7619) to 7612 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.604), E-opt: 0.2 (0.234), width: 16 Scan time: 2.730 The best scores are: opt bits E(32554) CCDS7385.1 PAPSS2 gene_id:9060|Hs108|chr10 ( 614) 4171 959.5 0 CCDS44453.1 PAPSS2 gene_id:9060|Hs108|chr10 ( 619) 4151 955.0 0 CCDS3676.1 PAPSS1 gene_id:9061|Hs108|chr4 ( 624) 3381 779.6 0 >>CCDS7385.1 PAPSS2 gene_id:9060|Hs108|chr10 (614 aa) initn: 4171 init1: 4171 opt: 4171 Z-score: 5131.1 bits: 959.5 E(32554): 0 Smith-Waterman score: 4171; 100.0% identity (100.0% similar) in 614 aa overlap (1-614:1-614) 10 20 30 40 50 60 pF1KE6 MSGIKKQKTENQQKSTNVVYQAHHVSRNKRGQVVGTRGGFRGCTVWLTGLSGAGKTTISF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 MSGIKKQKTENQQKSTNVVYQAHHVSRNKRGQVVGTRGGFRGCTVWLTGLSGAGKTTISF 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 ALEEYLVSHAIPCYSLDGDNVRHGLNRNLGFSPGDREENIRRIAEVAKLFADAGLVCITS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 ALEEYLVSHAIPCYSLDGDNVRHGLNRNLGFSPGDREENIRRIAEVAKLFADAGLVCITS 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 FISPFAKDRENARKIHESAGLPFFEIFVDAPLNICESRDVKGLYKRARAGEIKGFTGIDS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 FISPFAKDRENARKIHESAGLPFFEIFVDAPLNICESRDVKGLYKRARAGEIKGFTGIDS 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 DYEKPETPERVLKTNLSTVSDCVHQVVELLQEQNIVPYTIIKDIHELFVPENKLDHVRAE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 DYEKPETPERVLKTNLSTVSDCVHQVVELLQEQNIVPYTIIKDIHELFVPENKLDHVRAE 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE6 AETLPSLSITKLDLQWVQVLSEGWATPLKGFMREKEYLQVMHFDTLLDDGVINMSIPIVL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 AETLPSLSITKLDLQWVQVLSEGWATPLKGFMREKEYLQVMHFDTLLDDGVINMSIPIVL 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE6 PVSAEDKTRLEGCSKFVLAHGGRRVAILRDAEFYEHRKEERCSRVWGTTCTKHPHIKMVM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 PVSAEDKTRLEGCSKFVLAHGGRRVAILRDAEFYEHRKEERCSRVWGTTCTKHPHIKMVM 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE6 ESGDWLVGGDLQVLEKIRWNDGLDQYRLTPLELKQKCKEMNADAVFAFQLRNPVHNGHAL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 ESGDWLVGGDLQVLEKIRWNDGLDQYRLTPLELKQKCKEMNADAVFAFQLRNPVHNGHAL 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE6 LMQDTRRRLLERGYKHPVLLLHPLGGWTKDDDVPLDWRMKQHAAVLEEGVLDPKSTIVAI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 LMQDTRRRLLERGYKHPVLLLHPLGGWTKDDDVPLDWRMKQHAAVLEEGVLDPKSTIVAI 430 440 450 460 470 480 490 500 510 520 530 540 pF1KE6 FPSPMLYAGPTEVQWHCRSRMIAGANFYIVGRDPAGMPHPETKKDLYEPTHGGKVLSMAP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 FPSPMLYAGPTEVQWHCRSRMIAGANFYIVGRDPAGMPHPETKKDLYEPTHGGKVLSMAP 490 500 510 520 530 540 550 560 570 580 590 600 pF1KE6 GLTSVEIIPFRVAAYNKAKKAMDFYDPARHNEFDFISGTRMRKLAREGENPPDGFMAPKA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 GLTSVEIIPFRVAAYNKAKKAMDFYDPARHNEFDFISGTRMRKLAREGENPPDGFMAPKA 550 560 570 580 590 600 610 pF1KE6 WKVLTDYYRSLEKN :::::::::::::: CCDS73 WKVLTDYYRSLEKN 610 >>CCDS44453.1 PAPSS2 gene_id:9060|Hs108|chr10 (619 aa) initn: 2269 init1: 2269 opt: 4151 Z-score: 5106.5 bits: 955.0 E(32554): 0 Smith-Waterman score: 4151; 99.2% identity (99.2% similar) in 619 aa overlap (1-614:1-619) 10 20 30 40 50 60 pF1KE6 MSGIKKQKTENQQKSTNVVYQAHHVSRNKRGQVVGTRGGFRGCTVWLTGLSGAGKTTISF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 MSGIKKQKTENQQKSTNVVYQAHHVSRNKRGQVVGTRGGFRGCTVWLTGLSGAGKTTISF 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 ALEEYLVSHAIPCYSLDGDNVRHGLNRNLGFSPGDREENIRRIAEVAKLFADAGLVCITS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 ALEEYLVSHAIPCYSLDGDNVRHGLNRNLGFSPGDREENIRRIAEVAKLFADAGLVCITS 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 FISPFAKDRENARKIHESAGLPFFEIFVDAPLNICESRDVKGLYKRARAGEIKGFTGIDS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 FISPFAKDRENARKIHESAGLPFFEIFVDAPLNICESRDVKGLYKRARAGEIKGFTGIDS 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 DYEKPETPERVLKTNLSTVSDCVHQVVELLQEQNIVPYTIIKDIHELFVPENKLDHVRAE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 DYEKPETPERVLKTNLSTVSDCVHQVVELLQEQNIVPYTIIKDIHELFVPENKLDHVRAE 190 200 210 220 230 240 250 260 270 280 290 pF1KE6 AETLPSLSITKLDLQWVQVLSEGWATPLKGFMREKEYLQVMHFDTLLD-----DGVINMS :::::::::::::::::::::::::::::::::::::::::::::::: ::::::: CCDS44 AETLPSLSITKLDLQWVQVLSEGWATPLKGFMREKEYLQVMHFDTLLDGMALPDGVINMS 250 260 270 280 290 300 300 310 320 330 340 350 pF1KE6 IPIVLPVSAEDKTRLEGCSKFVLAHGGRRVAILRDAEFYEHRKEERCSRVWGTTCTKHPH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 IPIVLPVSAEDKTRLEGCSKFVLAHGGRRVAILRDAEFYEHRKEERCSRVWGTTCTKHPH 310 320 330 340 350 360 360 370 380 390 400 410 pF1KE6 IKMVMESGDWLVGGDLQVLEKIRWNDGLDQYRLTPLELKQKCKEMNADAVFAFQLRNPVH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 IKMVMESGDWLVGGDLQVLEKIRWNDGLDQYRLTPLELKQKCKEMNADAVFAFQLRNPVH 370 380 390 400 410 420 420 430 440 450 460 470 pF1KE6 NGHALLMQDTRRRLLERGYKHPVLLLHPLGGWTKDDDVPLDWRMKQHAAVLEEGVLDPKS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 NGHALLMQDTRRRLLERGYKHPVLLLHPLGGWTKDDDVPLDWRMKQHAAVLEEGVLDPKS 430 440 450 460 470 480 480 490 500 510 520 530 pF1KE6 TIVAIFPSPMLYAGPTEVQWHCRSRMIAGANFYIVGRDPAGMPHPETKKDLYEPTHGGKV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 TIVAIFPSPMLYAGPTEVQWHCRSRMIAGANFYIVGRDPAGMPHPETKKDLYEPTHGGKV 490 500 510 520 530 540 540 550 560 570 580 590 pF1KE6 LSMAPGLTSVEIIPFRVAAYNKAKKAMDFYDPARHNEFDFISGTRMRKLAREGENPPDGF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 LSMAPGLTSVEIIPFRVAAYNKAKKAMDFYDPARHNEFDFISGTRMRKLAREGENPPDGF 550 560 570 580 590 600 600 610 pF1KE6 MAPKAWKVLTDYYRSLEKN ::::::::::::::::::: CCDS44 MAPKAWKVLTDYYRSLEKN 610 >>CCDS3676.1 PAPSS1 gene_id:9061|Hs108|chr4 (624 aa) initn: 3381 init1: 3381 opt: 3381 Z-score: 4158.4 bits: 779.6 E(32554): 0 Smith-Waterman score: 3381; 78.4% identity (94.0% similar) in 601 aa overlap (13-613:23-623) 10 20 30 40 50 pF1KE6 MSGIKKQKTENQQKSTNVVYQAHHVSRNKRGQVVGTRGGFRGCTVWLTGL :..:::.::::::::::::::::::::::::::::::: CCDS36 MEIPGSLCKKVKLSNNAQNWGMQRATNVTYQAHHVSRNKRGQVVGTRGGFRGCTVWLTGL 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE6 SGAGKTTISFALEEYLVSHAIPCYSLDGDNVRHGLNRNLGFSPGDREENIRRIAEVAKLF :::::::.:.::::::: :.::::.:::::.:.:::.:::::: :::::.:::::::::: CCDS36 SGAGKTTVSMALEEYLVCHGIPCYTLDGDNIRQGLNKNLGFSPEDREENVRRIAEVAKLF 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE6 ADAGLVCITSFISPFAKDRENARKIHESAGLPFFEIFVDAPLNICESRDVKGLYKRARAG ::::::::::::::...::.:::.:::.:.:::::.::::::..::.::::::::.:::: CCDS36 ADAGLVCITSFISPYTQDRNNARQIHEGASLPFFEVFVDAPLHVCEQRDVKGLYKKARAG 130 140 150 160 170 180 180 190 200 210 220 230 pF1KE6 EIKGFTGIDSDYEKPETPERVLKTNLSTVSDCVHQVVELLQEQNIVPYTIIKDIHELFVP ::::::::::.:::::.:: ::::. :.:::.::::::::..::: ...::.:: CCDS36 EIKGFTGIDSEYEKPEAPELVLKTDSCDVNDCVQQVVELLQERDIVPVDASYEVKELYVP 190 200 210 220 230 240 240 250 260 270 280 290 pF1KE6 ENKLDHVRAEAETLPSLSITKLDLQWVQVLSEGWATPLKGFMREKEYLQVMHFDTLLDDG :::: ....:::::.:.:.:.:.::::::.:::::::.:::::.:::: .::: ::: : CCDS36 ENKLHLAKTDAETLPALKINKVDMQWVQVLAEGWATPLNGFMREREYLQCLHFDCLLDGG 250 260 270 280 290 300 300 310 320 330 340 350 pF1KE6 VINMSIPIVLPVSAEDKTRLEGCSKFVLAHGGRRVAILRDAEFYEHRKEERCSRVWGTTC :::.:.:::: .. ::: ::.::. :.: . ::::::::. ::.::::::::.: ::::: CCDS36 VINLSVPIVLTATHEDKERLDGCTAFALMYEGRRVAILRNPEFFEHRKEERCARQWGTTC 310 320 330 340 350 360 360 370 380 390 400 410 pF1KE6 TKHPHIKMVMESGDWLVGGDLQVLEKIRWNDGLDQYRLTPLELKQKCKEMNADAVFAFQL .::.::::::.::::.:::::::... :::::::::::: ::::: :.::::::::::: CCDS36 KNHPYIKMVMEQGDWLIGGDLQVLDRVYWNDGLDQYRLTPTELKQKFKDMNADAVFAFQL 370 380 390 400 410 420 420 430 440 450 460 470 pF1KE6 RNPVHNGHALLMQDTRRRLLERGYKHPVLLLHPLGGWTKDDDVPLDWRMKQHAAVLEEGV :::::::::::::::...::::::..::::::::::::::::::: :::::::::::::: CCDS36 RNPVHNGHALLMQDTHKQLLERGYRRPVLLLHPLGGWTKDDDVPLMWRMKQHAAVLEEGV 430 440 450 460 470 480 480 490 500 510 520 530 pF1KE6 LDPKSTIVAIFPSPMLYAGPTEVQWHCRSRMIAGANFYIVGRDPAGMPHPETKKDLYEPT :.:..:.::::::::.::::::::::::.::.:::::::::::::::::::: ::::::. CCDS36 LNPETTVVAIFPSPMMYAGPTEVQWHCRARMVAGANFYIVGRDPAGMPHPETGKDLYEPS 490 500 510 520 530 540 540 550 560 570 580 590 pF1KE6 HGGKVLSMAPGLTSVEIIPFRVAAYNKAKKAMDFYDPARHNEFDFISGTRMRKLAREGEN ::.:::.::::: ..::.::::::::: :: ::.:: .:..:.::::::::::::::.. CCDS36 HGAKVLTMAPGLITLEIVPFRVAAYNKKKKRMDYYDSEHHEDFEFISGTRMRKLAREGQK 550 560 570 580 590 600 600 610 pF1KE6 PPDGFMAPKAWKVLTDYYRSLEKN ::.:::::::: :::.::.:::: CCDS36 PPEGFMAPKAWTVLTEYYKSLEKA 610 620 614 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 13:13:28 2016 done: Tue Nov 8 13:13:29 2016 Total Scan time: 2.730 Total Display time: 0.030 Function used was FASTA [36.3.4 Apr, 2011]