FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5754, 736 aa 1>>>pF1KE5754 736 - 736 aa - 736 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.7487+/-0.00115; mu= 8.1287+/- 0.069 mean_var=117.6872+/-23.117, 0's: 0 Z-trim(105.4): 27 B-trim: 5 in 2/52 Lambda= 0.118225 statistics sampled from 8391 (8401) to 8391 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.615), E-opt: 0.2 (0.258), width: 16 Scan time: 3.260 The best scores are: opt bits E(32554) CCDS1863.1 PAPOLG gene_id:64895|Hs108|chr2 ( 736) 4845 838.1 0 CCDS9946.1 PAPOLA gene_id:10914|Hs108|chr14 ( 745) 2817 492.2 1.2e-138 CCDS78202.1 PAPOLB gene_id:56903|Hs108|chr7 ( 637) 2540 444.9 1.8e-124 CCDS58334.1 PAPOLA gene_id:10914|Hs108|chr14 ( 285) 1457 260.1 3.4e-69 CCDS58335.1 PAPOLA gene_id:10914|Hs108|chr14 ( 238) 994 181.1 1.7e-45 >>CCDS1863.1 PAPOLG gene_id:64895|Hs108|chr2 (736 aa) initn: 4845 init1: 4845 opt: 4845 Z-score: 4472.0 bits: 838.1 E(32554): 0 Smith-Waterman score: 4845; 100.0% identity (100.0% similar) in 736 aa overlap (1-736:1-736) 10 20 30 40 50 60 pF1KE5 MKEMSANTVLDSQRQQKHYGITSPISLASPKEIDHIYTQKLIDAMKPFGVFEDEEELNHR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS18 MKEMSANTVLDSQRQQKHYGITSPISLASPKEIDHIYTQKLIDAMKPFGVFEDEEELNHR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 LVVLGKLNNLVKEWISDVSESKNLPPSVVATVGGKIFTFGSYRLGVHTKGADIDALCVAP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS18 LVVLGKLNNLVKEWISDVSESKNLPPSVVATVGGKIFTFGSYRLGVHTKGADIDALCVAP 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 RHVERSDFFQSFFEKLKHQDGIRNLRAVEDAFVPVIKFEFDGIEIDLVFARLAIQTISDN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS18 RHVERSDFFQSFFEKLKHQDGIRNLRAVEDAFVPVIKFEFDGIEIDLVFARLAIQTISDN 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 LDLRDDSRLRSLDIRCIRSLNGCRVTDEILHLVPNKETFRLTLRAVKLWAKRRGIYSNML :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS18 LDLRDDSRLRSLDIRCIRSLNGCRVTDEILHLVPNKETFRLTLRAVKLWAKRRGIYSNML 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE5 GFLGGVSWAMLVARTCQLYPNAAASTLVHKFFLVFSKWEWPNPVLLKQPEESNLNLPVWD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS18 GFLGGVSWAMLVARTCQLYPNAAASTLVHKFFLVFSKWEWPNPVLLKQPEESNLNLPVWD 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE5 PRVNPSDRYHLMPIITPAYPQQNSTYNVSTSTRTVMVEEFKQGLAVTDEILQGKSDWSKL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS18 PRVNPSDRYHLMPIITPAYPQQNSTYNVSTSTRTVMVEEFKQGLAVTDEILQGKSDWSKL 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE5 LEPPNFFQKYRHYIVLTASASTEENHLEWVGLVESKIRVLVGNLERNEFITLAHVNPQSF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS18 LEPPNFFQKYRHYIVLTASASTEENHLEWVGLVESKIRVLVGNLERNEFITLAHVNPQSF 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE5 PGNKEHHKDNNYVSMWFLGIIFRRVENAESVNIDLTYDIQSFTDTVYRQANNINMLKEGM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS18 PGNKEHHKDNNYVSMWFLGIIFRRVENAESVNIDLTYDIQSFTDTVYRQANNINMLKEGM 430 440 450 460 470 480 490 500 510 520 530 540 pF1KE5 KIEATHVKKKQLHHYLPAEILQKKKKQSLSDVNRSSGGLQSKRLSLDSSCLDSSRDTDNG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS18 KIEATHVKKKQLHHYLPAEILQKKKKQSLSDVNRSSGGLQSKRLSLDSSCLDSSRDTDNG 490 500 510 520 530 540 550 560 570 580 590 600 pF1KE5 TPFNSPASKSDSPSVGETERNSAEPAAVIVEKPLSVPPAQGLSIPVIGAKVDSTVKTVSP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS18 TPFNSPASKSDSPSVGETERNSAEPAAVIVEKPLSVPPAQGLSIPVIGAKVDSTVKTVSP 550 560 570 580 590 600 610 620 630 640 650 660 pF1KE5 PTVCTIPTVVGRNVIPRITTPHNPAQGQPHLNGMSNITKTVTPKRSHSPSIDGTPKRLKD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS18 PTVCTIPTVVGRNVIPRITTPHNPAQGQPHLNGMSNITKTVTPKRSHSPSIDGTPKRLKD 610 620 630 640 650 660 670 680 690 700 710 720 pF1KE5 VEKFIRLESTFKDPRTAEERKRKSVDAIGGESMPIPTIDTSRKKRLPSKELPDSSSPVPA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS18 VEKFIRLESTFKDPRTAEERKRKSVDAIGGESMPIPTIDTSRKKRLPSKELPDSSSPVPA 670 680 690 700 710 720 730 pF1KE5 NNIRVIKNSIRLTLNR :::::::::::::::: CCDS18 NNIRVIKNSIRLTLNR 730 >>CCDS9946.1 PAPOLA gene_id:10914|Hs108|chr14 (745 aa) initn: 2784 init1: 2669 opt: 2817 Z-score: 2602.5 bits: 492.2 E(32554): 1.2e-138 Smith-Waterman score: 2844; 61.2% identity (78.8% similar) in 747 aa overlap (12-736:13-745) 10 20 30 40 50 pF1KE5 MKEMSANTVLDSQRQQKHYGITSPISLASPKEIDHIYTQKLIDAMKPFGVFEDEEELNH .: :::::::::::::.::: : . :::::...:::::::.::::.. CCDS99 MPFPVTTQGSQQTQPPQKHYGITSPISLAAPKETDCVLTQKLIETLKPFGVFEEEEELQR 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE5 RLVVLGKLNNLVKEWISDVSESKNLPPSVVATVGGKIFTFGSYRLGVHTKGADIDALCVA :...:::::::::::: ..::::::: ::. .:::::::::::::::::::::::::::: CCDS99 RILILGKLNNLVKEWIREISESKNLPQSVIENVGGKIFTFGSYRLGVHTKGADIDALCVA 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE5 PRHVERSDFFQSFFEKLKHQDGIRNLRAVEDAFVPVIKFEFDGIEIDLVFARLAIQTISD ::::.::::: ::..::: :. ...:::::.:::::::. :::::::..:::::.::: . CCDS99 PRHVDRSDFFTSFYDKLKLQEEVKDLRAVEEAFVPVIKLCFDGIEIDILFARLALQTIPE 130 140 150 160 170 180 180 190 200 210 220 230 pF1KE5 NLDLRDDSRLRSLDIRCIRSLNGCRVTDEILHLVPNKETFRLTLRAVKLWAKRRGIYSNM .::::::: :..:::::::::::::::::::::::: ..:::::::.::::::..::::. CCDS99 DLDLRDDSLLKNLDIRCIRSLNGCRVTDEILHLVPNIDNFRLTLRAIKLWAKRHNIYSNI 190 200 210 220 230 240 240 250 260 270 280 290 pF1KE5 LGFLGGVSWAMLVARTCQLYPNAAASTLVHKFFLVFSKWEWPNPVLLKQPEESNLNLPVW ::::::::::::::::::::::: :::::::::::::::::::::::::::: ::::::: CCDS99 LGFLGGVSWAMLVARTCQLYPNAIASTLVHKFFLVFSKWEWPNPVLLKQPEECNLNLPVW 250 260 270 280 290 300 300 310 320 330 340 350 pF1KE5 DPRVNPSDRYHLMPIITPAYPQQNSTYNVSTSTRTVMVEEFKQGLAVTDEILQGKSDWSK ::::::::::::::::::::::::::::::.::: :::::::::::.::::: .:..::: CCDS99 DPRVNPSDRYHLMPIITPAYPQQNSTYNVSVSTRMVMVEEFKQGLAITDEILLSKAEWSK 310 320 330 340 350 360 360 370 380 390 400 410 pF1KE5 LLEPPNFFQKYRHYIVLTASASTEENHLEWVGLVESKIRVLVGNLERNEFITLAHVNPQS :.: :::::::.::::: ::: ::...::::::::::::.:::.::.::::::::::::: CCDS99 LFEAPNFFQKYKHYIVLLASAPTEKQRLEWVGLVESKIRILVGSLEKNEFITLAHVNPQS 370 380 390 400 410 420 420 430 440 450 460 470 pF1KE5 FPGNKEHHKDNNYVSMWFLGIIFRRVENAESVNIDLTYDIQSFTDTVYRQANNINMLKEG ::. ::. ... .:: .:..:...::.:....::::::::::::::::: : .:.. CCDS99 FPAPKENPDKEEFRTMWVIGLVFKKTENSENLSVDLTYDIQSFTDTVYRQAINSKMFEVD 430 440 450 460 470 480 480 490 500 510 520 530 pF1KE5 MKIEATHVKKKQLHHYLPAEILQKKKKQSLSDVNRSSGGLQSKRLSLDSSCLDSSRDTDN ::: : :::.::::. :: ..::::::.: : : .:..: :: : :.:: CCDS99 MKIAAMHVKRKQLHQLLPNHVLQKKKKHSTEGV---------KLTALNDSSLDLSMDSDN 490 500 510 520 530 540 550 560 570 580 pF1KE5 GTPFNSPASKSD-SP--SVGETE-RNSAEPAAVIVEKPLSVPPAQGLSIPVI-------G . ::.: . :: : : .. ::: :: ... .. : .:.: . : CCDS99 SMSVPSPTSATKTSPLNSSGSSQGRNS--PAPAVTAASVTNIQATEVSVPQVNSSESSGG 540 550 560 570 580 590 600 610 620 630 640 pF1KE5 AKVDSTVKTVSPPTVCTIPT-VVGRNVIP-RITTPHNPAQGQPHLNGMSNITKTVTP--- .. .: .:.. :.. : .:.: : :...: ..:. .: . :: :: CCDS99 TSSESIPQTATQPAISPPPKPTVSRVVSSTRLVNPPPRSSGNAATSG-NAATKIPTPIVG 590 600 610 620 630 640 650 660 670 680 690 pF1KE5 -KRSHSPSIDGTPKRLKDVEKFIR-----LESTFKDPRTAEERKRKSVDAIGGESMPIPT ::. :: . .::. : : : . .: :.:. ... .:.. . CCDS99 VKRTSSPHKEESPKKTKTEEDETSEDANCLALSGHDKTEAKEQLDTETSTTQSETIQTAA 650 660 670 680 690 700 700 710 720 730 pF1KE5 IDTSRKKRLPSKELPDSSSPVPANNIRVIKNSIRLTLNR . .: : .: : . .::: : ::::::.: ::: CCDS99 SLLASQKT-SSTDLSDIPA-LPANPIPVIKNSIKLRLNR 710 720 730 740 >>CCDS78202.1 PAPOLB gene_id:56903|Hs108|chr7 (637 aa) initn: 2503 init1: 2503 opt: 2540 Z-score: 2348.3 bits: 444.9 E(32554): 1.8e-124 Smith-Waterman score: 2540; 63.7% identity (82.9% similar) in 615 aa overlap (19-626:21-627) 10 20 30 40 50 pF1KE5 MKEMSANTVLDSQRQQKHYGITSPISLASPKEIDHIYTQKLIDAMKPFGVFEDEEELN ::..:::::: ::: : . ::.::....::::::.::::. CCDS78 MMPFPVTTQGPPQPAPPPNRYGVSSPISLAVPKETDCLLTQRLIETLRPFGVFEEEEELQ 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE5 HRLVVLGKLNNLVKEWISDVSESKNLPPSVVATVGGKIFTFGSYRLGVHTKGADIDALCV .:..:: :::::::::: ..::::.:: ::. .::::::::::::::::::::::::::: CCDS78 RRILVLEKLNNLVKEWIREISESKSLPQSVIENVGGKIFTFGSYRLGVHTKGADIDALCV 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE5 APRHVERSDFFQSFFEKLKHQDGIRNLRAVEDAFVPVIKFEFDGIEIDLVFARLAIQTIS :: ::.::::: ::. ::: :. ...:::::.:::::::. :::::::..:::::.::: CCDS78 APSHVDRSDFFTSFYAKLKLQEEVKDLRAVEEAFVPVIKLCFDGIEIDILFARLALQTIP 130 140 150 160 170 180 180 190 200 210 220 230 pF1KE5 DNLDLRDDSRLRSLDIRCIRSLNGCRVTDEILHLVPNKETFRLTLRAVKLWAKRRGIYSN ..::::::: :..:::::::::::::::::::::::: ..:::::::.::::: ..:::: CCDS78 EDLDLRDDSLLKNLDIRCIRSLNGCRVTDEILHLVPNIDNFRLTLRAIKLWAKCHNIYSN 190 200 210 220 230 240 240 250 260 270 280 290 pF1KE5 MLGFLGGVSWAMLVARTCQLYPNAAASTLVHKFFLVFSKWEWPNPVLLKQPEESNLNLPV .:::::::::::::::::::::::.:::::.:::::::.::::::::::.::: :::::: CCDS78 ILGFLGGVSWAMLVARTCQLYPNAVASTLVRKFFLVFSEWEWPNPVLLKEPEERNLNLPV 250 260 270 280 290 300 300 310 320 330 340 350 pF1KE5 WDPRVNPSDRYHLMPIITPAYPQQNSTYNVSTSTRTVMVEEFKQGLAVTDEILQGKSDWS ::::::::::::::::::::::::::::::: ::: ::.::::::::.: ::: .:..:: CCDS78 WDPRVNPSDRYHLMPIITPAYPQQNSTYNVSISTRMVMIEEFKQGLAITHEILLSKAEWS 310 320 330 340 350 360 360 370 380 390 400 410 pF1KE5 KLLEPPNFFQKYRHYIVLTASASTEENHLEWVGLVESKIRVLVGNLERNEFITLAHVNPQ ::.: :.:::::.::::: ::::::..:::::::::::::.:::.::.:::::::::::: CCDS78 KLFEAPSFFQKYKHYIVLLASASTEKQHLEWVGLVESKIRILVGSLEKNEFITLAHVNPQ 370 380 390 400 410 420 420 430 440 450 460 470 pF1KE5 SFPGNKEHHKDNNYVSMWFLGIIFRRVENAESVNIDLTYDIQSFTDTVYRQANNINMLKE :::. ::. ... .:: .:. ... .:.: ..:::::::::::::::::: : .:.. CCDS78 SFPAPKENPDMEEFRTMWVIGLGLKKPDNSEILSIDLTYDIQSFTDTVYRQAVNSKMFEM 430 440 450 460 470 480 480 490 500 510 520 530 pF1KE5 GMKIEATHVKKKQLHHYLPAEILQKKKKQS-----LSDVNRSSGGLQSKRLSLDSSCLDS :::: : :...:.::. :: ..:: :: .: :.:.: :: :.. .: . : CCDS78 GMKITAMHLRRKELHQLLPHHVLQDKKAHSTEGRRLTDLNDSSFDLSAG--CENSMSVPS 490 500 510 520 530 540 550 560 570 580 590 pF1KE5 SRDTDNGTPFNSPASKSDSPSVGETERNSAEPAAV-IVEKPLSVPPAQGLSIPVIGAKVD : .: . :. : .. .::... . :. :. . . ... ..:... . CCDS78 STSTMKTGPLISSSQGRNSPALAVMTASVANIQATEFSLQQVNTNESSGVALN------E 540 550 560 570 580 590 600 610 620 630 640 650 pF1KE5 STVKTVSPPTVCTIP-TVVGRNVIPRITTPHNPAQGQPHLNGMSNITKTVTPKRSHSPSI : ..:: :.. : ..:.: : : : CCDS78 SIPHAVSQPAISPSPKAMVARVVSSTCLISHPDLQETQQQTYLIL 600 610 620 630 660 670 680 690 700 710 pF1KE5 DGTPKRLKDVEKFIRLESTFKDPRTAEERKRKSVDAIGGESMPIPTIDTSRKKRLPSKEL >>CCDS58334.1 PAPOLA gene_id:10914|Hs108|chr14 (285 aa) initn: 1457 init1: 1457 opt: 1457 Z-score: 1355.8 bits: 260.1 E(32554): 3.4e-69 Smith-Waterman score: 1457; 79.8% identity (94.0% similar) in 267 aa overlap (12-278:13-279) 10 20 30 40 50 pF1KE5 MKEMSANTVLDSQRQQKHYGITSPISLASPKEIDHIYTQKLIDAMKPFGVFEDEEELNH .: :::::::::::::.::: : . :::::...:::::::.::::.. CCDS58 MPFPVTTQGSQQTQPPQKHYGITSPISLAAPKETDCVLTQKLIETLKPFGVFEEEEELQR 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE5 RLVVLGKLNNLVKEWISDVSESKNLPPSVVATVGGKIFTFGSYRLGVHTKGADIDALCVA :...:::::::::::: ..::::::: ::. .:::::::::::::::::::::::::::: CCDS58 RILILGKLNNLVKEWIREISESKNLPQSVIENVGGKIFTFGSYRLGVHTKGADIDALCVA 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE5 PRHVERSDFFQSFFEKLKHQDGIRNLRAVEDAFVPVIKFEFDGIEIDLVFARLAIQTISD ::::.::::: ::..::: :. ...:::::.:::::::. :::::::..:::::.::: . CCDS58 PRHVDRSDFFTSFYDKLKLQEEVKDLRAVEEAFVPVIKLCFDGIEIDILFARLALQTIPE 130 140 150 160 170 180 180 190 200 210 220 230 pF1KE5 NLDLRDDSRLRSLDIRCIRSLNGCRVTDEILHLVPNKETFRLTLRAVKLWAKRRGIYSNM .::::::: :..:::::::::::::::::::::::: ..:::::::.::::::..::::. CCDS58 DLDLRDDSLLKNLDIRCIRSLNGCRVTDEILHLVPNIDNFRLTLRAIKLWAKRHNIYSNI 190 200 210 220 230 240 240 250 260 270 280 290 pF1KE5 LGFLGGVSWAMLVARTCQLYPNAAASTLVHKFFLVFSKWEWPNPVLLKQPEESNLNLPVW ::::::::::::::::::::::: ::::::::::::::: CCDS58 LGFLGGVSWAMLVARTCQLYPNAIASTLVHKFFLVFSKWYVFRLY 250 260 270 280 300 310 320 330 340 350 pF1KE5 DPRVNPSDRYHLMPIITPAYPQQNSTYNVSTSTRTVMVEEFKQGLAVTDEILQGKSDWSK >>CCDS58335.1 PAPOLA gene_id:10914|Hs108|chr14 (238 aa) initn: 989 init1: 989 opt: 994 Z-score: 930.2 bits: 181.1 E(32554): 1.7e-45 Smith-Waterman score: 994; 75.6% identity (92.2% similar) in 193 aa overlap (12-204:13-205) 10 20 30 40 50 pF1KE5 MKEMSANTVLDSQRQQKHYGITSPISLASPKEIDHIYTQKLIDAMKPFGVFEDEEELNH .: :::::::::::::.::: : . :::::...:::::::.::::.. CCDS58 MPFPVTTQGSQQTQPPQKHYGITSPISLAAPKETDCVLTQKLIETLKPFGVFEEEEELQR 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE5 RLVVLGKLNNLVKEWISDVSESKNLPPSVVATVGGKIFTFGSYRLGVHTKGADIDALCVA :...:::::::::::: ..::::::: ::. .:::::::::::::::::::::::::::: CCDS58 RILILGKLNNLVKEWIREISESKNLPQSVIENVGGKIFTFGSYRLGVHTKGADIDALCVA 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE5 PRHVERSDFFQSFFEKLKHQDGIRNLRAVEDAFVPVIKFEFDGIEIDLVFARLAIQTISD ::::.::::: ::..::: :. ...:::::.:::::::. :::::::..:::::.::: . CCDS58 PRHVDRSDFFTSFYDKLKLQEEVKDLRAVEEAFVPVIKLCFDGIEIDILFARLALQTIPE 130 140 150 160 170 180 180 190 200 210 220 230 pF1KE5 NLDLRDDSRLRSLDIRCIRSLNGCRVTDEILHLVPNKETFRLTLRAVKLWAKRRGIYSNM .::::::: :..::::::::::: : CCDS58 DLDLRDDSLLKNLDIRCIRSLNGMRKPTSFCVLQFLSDISCFYTSFVLKLFIAILLTQ 190 200 210 220 230 736 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 06:22:27 2016 done: Tue Nov 8 06:22:27 2016 Total Scan time: 3.260 Total Display time: 0.050 Function used was FASTA [36.3.4 Apr, 2011]