FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5002, 636 aa 1>>>pF1KE5002 636 - 636 aa - 636 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.2892+/-0.000814; mu= 15.6973+/- 0.050 mean_var=87.3237+/-17.233, 0's: 0 Z-trim(108.9): 13 B-trim: 0 in 0/53 Lambda= 0.137249 statistics sampled from 10486 (10494) to 10486 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.669), E-opt: 0.2 (0.322), width: 16 Scan time: 3.850 The best scores are: opt bits E(32554) CCDS78202.1 PAPOLB gene_id:56903|Hs108|chr7 ( 637) 4169 835.4 0 CCDS9946.1 PAPOLA gene_id:10914|Hs108|chr14 ( 745) 3519 706.8 2.7e-203 CCDS1863.1 PAPOLG gene_id:64895|Hs108|chr2 ( 736) 2539 512.7 6.9e-145 CCDS58334.1 PAPOLA gene_id:10914|Hs108|chr14 ( 285) 1711 348.6 6.9e-96 CCDS58335.1 PAPOLA gene_id:10914|Hs108|chr14 ( 238) 1234 254.1 1.6e-67 >>CCDS78202.1 PAPOLB gene_id:56903|Hs108|chr7 (637 aa) initn: 4169 init1: 4169 opt: 4169 Z-score: 4459.9 bits: 835.4 E(32554): 0 Smith-Waterman score: 4169; 99.5% identity (100.0% similar) in 636 aa overlap (1-636:2-637) 10 20 30 40 50 pF1KE5 MPFPVTTQGPPQPAPPPNRYGVSSPISLAVPKETDCLLTQRLIETLRPFGVFEEEEELQ ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS78 MMPFPVTTQGPPQPAPPPNRYGVSSPISLAVPKETDCLLTQRLIETLRPFGVFEEEEELQ 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE5 RRILVLDKLNNLVKEWIREISESKSLPQSVIENVGGKIFTFGSYRLGVHTKGADIDALCV ::::::.::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS78 RRILVLEKLNNLVKEWIREISESKSLPQSVIENVGGKIFTFGSYRLGVHTKGADIDALCV 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE5 APSHVDRSDFFTSFYAKLKLQEEVKDLRAVQEAFVPVIKLCFDGIEIDILFARLALQTIP ::::::::::::::::::::::::::::::.::::::::::::::::::::::::::::: CCDS78 APSHVDRSDFFTSFYAKLKLQEEVKDLRAVEEAFVPVIKLCFDGIEIDILFARLALQTIP 130 140 150 160 170 180 180 190 200 210 220 230 pF1KE5 EDLDLRDDSLLKNLDIRCIRSLNGCRVTDEILHLVPNIDNFRLTLRAIKLWAKCHNIYSN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS78 EDLDLRDDSLLKNLDIRCIRSLNGCRVTDEILHLVPNIDNFRLTLRAIKLWAKCHNIYSN 190 200 210 220 230 240 240 250 260 270 280 290 pF1KE5 ILGFLGGVSWAMLVARTCQLYPNAVASTLVRKFFLVFSEWEWPNPVLLKEPEERNLNLPV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS78 ILGFLGGVSWAMLVARTCQLYPNAVASTLVRKFFLVFSEWEWPNPVLLKEPEERNLNLPV 250 260 270 280 290 300 300 310 320 330 340 350 pF1KE5 WDPRVNPSDRYHLMPIITPAYPQQNSTYNVSISTRMVMIEEFKQGLAITHEILLSKAEWS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS78 WDPRVNPSDRYHLMPIITPAYPQQNSTYNVSISTRMVMIEEFKQGLAITHEILLSKAEWS 310 320 330 340 350 360 360 370 380 390 400 410 pF1KE5 KLFEAPSFFQKYKHYIVLLASASTEKQHLEWVGLVESKIRILVGSLEKNEFITLAHVNPQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS78 KLFEAPSFFQKYKHYIVLLASASTEKQHLEWVGLVESKIRILVGSLEKNEFITLAHVNPQ 370 380 390 400 410 420 420 430 440 450 460 470 pF1KE5 SFPAPKENPDMEEFRTMWVIGLGLKKPDNSEILSIDLTYDIQSFTDTVYRQAVNSKMFEM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS78 SFPAPKENPDMEEFRTMWVIGLGLKKPDNSEILSIDLTYDIQSFTDTVYRQAVNSKMFEM 430 440 450 460 470 480 480 490 500 510 520 530 pF1KE5 GMKITAVHLRRKELHQLLPHHVLQDKKAHSTEGRRLTDLNDSSFDLSAGCENSMSVPSST ::::::.::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS78 GMKITAMHLRRKELHQLLPHHVLQDKKAHSTEGRRLTDLNDSSFDLSAGCENSMSVPSST 490 500 510 520 530 540 540 550 560 570 580 590 pF1KE5 STMKTGPLISSSQGRNSPALAVMTASVANIQATEFSLQQVNTNESSGVALNESIPHAVSQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS78 STMKTGPLISSSQGRNSPALAVMTASVANIQATEFSLQQVNTNESSGVALNESIPHAVSQ 550 560 570 580 590 600 600 610 620 630 pF1KE5 PAISPSPKAMVARVVSSTCLISHPDLQETQQQTYLIL ::::::::::::::::::::::::::::::::::::: CCDS78 PAISPSPKAMVARVVSSTCLISHPDLQETQQQTYLIL 610 620 630 >>CCDS9946.1 PAPOLA gene_id:10914|Hs108|chr14 (745 aa) initn: 3525 init1: 3245 opt: 3519 Z-score: 3763.3 bits: 706.8 E(32554): 2.7e-203 Smith-Waterman score: 3519; 85.8% identity (93.6% similar) in 625 aa overlap (1-623:1-625) 10 20 30 40 50 60 pF1KE5 MPFPVTTQGPPQPAPPPNRYGVSSPISLAVPKETDCLLTQRLIETLRPFGVFEEEEELQR ::::::::: : :: ..::..::::::.::::::.:::.:::::.::::::::::::: CCDS99 MPFPVTTQGSQQTQPPQKHYGITSPISLAAPKETDCVLTQKLIETLKPFGVFEEEEELQR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 RILVLDKLNNLVKEWIREISESKSLPQSVIENVGGKIFTFGSYRLGVHTKGADIDALCVA :::.: :::::::::::::::::.:::::::::::::::::::::::::::::::::::: CCDS99 RILILGKLNNLVKEWIREISESKNLPQSVIENVGGKIFTFGSYRLGVHTKGADIDALCVA 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 PSHVDRSDFFTSFYAKLKLQEEVKDLRAVQEAFVPVIKLCFDGIEIDILFARLALQTIPE : :::::::::::: ::::::::::::::.:::::::::::::::::::::::::::::: CCDS99 PRHVDRSDFFTSFYDKLKLQEEVKDLRAVEEAFVPVIKLCFDGIEIDILFARLALQTIPE 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 DLDLRDDSLLKNLDIRCIRSLNGCRVTDEILHLVPNIDNFRLTLRAIKLWAKCHNIYSNI :::::::::::::::::::::::::::::::::::::::::::::::::::: ::::::: CCDS99 DLDLRDDSLLKNLDIRCIRSLNGCRVTDEILHLVPNIDNFRLTLRAIKLWAKRHNIYSNI 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE5 LGFLGGVSWAMLVARTCQLYPNAVASTLVRKFFLVFSEWEWPNPVLLKEPEERNLNLPVW :::::::::::::::::::::::.:::::.:::::::.::::::::::.::: ::::::: CCDS99 LGFLGGVSWAMLVARTCQLYPNAIASTLVHKFFLVFSKWEWPNPVLLKQPEECNLNLPVW 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE5 DPRVNPSDRYHLMPIITPAYPQQNSTYNVSISTRMVMIEEFKQGLAITHEILLSKAEWSK ::::::::::::::::::::::::::::::.::::::.:::::::::: ::::::::::: CCDS99 DPRVNPSDRYHLMPIITPAYPQQNSTYNVSVSTRMVMVEEFKQGLAITDEILLSKAEWSK 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE5 LFEAPSFFQKYKHYIVLLASASTEKQHLEWVGLVESKIRILVGSLEKNEFITLAHVNPQS :::::.::::::::::::::: ::::.::::::::::::::::::::::::::::::::: CCDS99 LFEAPNFFQKYKHYIVLLASAPTEKQRLEWVGLVESKIRILVGSLEKNEFITLAHVNPQS 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE5 FPAPKENPDMEEFRTMWVIGLGLKKPDNSEILSIDLTYDIQSFTDTVYRQAVNSKMFEMG ::::::::: ::::::::::: .:: .::: ::.:::::::::::::::::.::::::. CCDS99 FPAPKENPDKEEFRTMWVIGLVFKKTENSENLSVDLTYDIQSFTDTVYRQAINSKMFEVD 430 440 450 460 470 480 490 500 510 520 530 540 pF1KE5 MKITAVHLRRKELHQLLPHHVLQDKKAHSTEGRRLTDLNDSSFDLSAGCENSMSVPSSTS :::.:.:..::.::::::.:::: :: ::::: .:: :::::.::: .::::::: :: CCDS99 MKIAAMHVKRKQLHQLLPNHVLQKKKKHSTEGVKLTALNDSSLDLSMDSDNSMSVPSPTS 490 500 510 520 530 540 550 560 570 580 590 pF1KE5 TMKTGPLISS--SQGRNSPALAVMTASVANIQATEFSLQQVNTNESSGVALNESIPHAVS . ::.:: :: :::::::: :: .:::.:::::: :. :::..:::: . .::::.... CCDS99 ATKTSPLNSSGSSQGRNSPAPAVTAASVTNIQATEVSVPQVNSSESSGGTSSESIPQTAT 550 560 570 580 590 600 600 610 620 630 pF1KE5 QPAISPSPKAMVARVVSSTCLISHPDLQETQQQTYLIL :::::: :: :.:::::: :.. : CCDS99 QPAISPPPKPTVSRVVSSTRLVNPPPRSSGNAATSGNAATKIPTPIVGVKRTSSPHKEES 610 620 630 640 650 660 >>CCDS1863.1 PAPOLG gene_id:64895|Hs108|chr2 (736 aa) initn: 2502 init1: 2502 opt: 2539 Z-score: 2714.6 bits: 512.7 E(32554): 6.9e-145 Smith-Waterman score: 2539; 63.6% identity (83.1% similar) in 615 aa overlap (20-626:19-626) 10 20 30 40 50 60 pF1KE5 MPFPVTTQGPPQPAPPPNRYGVSSPISLAVPKETDCLLTQRLIETLRPFGVFEEEEELQR ::..:::::: ::: : . ::.::....::::::.::::.. CCDS18 MKEMSANTVLDSQRQQKHYGITSPISLASPKEIDHIYTQKLIDAMKPFGVFEDEEELNH 10 20 30 40 50 70 80 90 100 110 120 pF1KE5 RILVLDKLNNLVKEWIREISESKSLPQSVIENVGGKIFTFGSYRLGVHTKGADIDALCVA :..:: :::::::::: ..::::.:: ::. .:::::::::::::::::::::::::::: CCDS18 RLVVLGKLNNLVKEWISDVSESKNLPPSVVATVGGKIFTFGSYRLGVHTKGADIDALCVA 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE5 PSHVDRSDFFTSFYAKLKLQEEVKDLRAVQEAFVPVIKLCFDGIEIDILFARLALQTIPE : ::.::::: ::. ::: :. ...::::..:::::::. :::::::..:::::.::: . CCDS18 PRHVERSDFFQSFFEKLKHQDGIRNLRAVEDAFVPVIKFEFDGIEIDLVFARLAIQTISD 120 130 140 150 160 170 190 200 210 220 230 240 pF1KE5 DLDLRDDSLLKNLDIRCIRSLNGCRVTDEILHLVPNIDNFRLTLRAIKLWAKCHNIYSNI .::::::: :..:::::::::::::::::::::::: ..:::::::.::::: ..::::. CCDS18 NLDLRDDSRLRSLDIRCIRSLNGCRVTDEILHLVPNKETFRLTLRAVKLWAKRRGIYSNM 180 190 200 210 220 230 250 260 270 280 290 300 pF1KE5 LGFLGGVSWAMLVARTCQLYPNAVASTLVRKFFLVFSEWEWPNPVLLKEPEERNLNLPVW :::::::::::::::::::::::.:::::.:::::::.::::::::::.::: ::::::: CCDS18 LGFLGGVSWAMLVARTCQLYPNAAASTLVHKFFLVFSKWEWPNPVLLKQPEESNLNLPVW 240 250 260 270 280 290 310 320 330 340 350 360 pF1KE5 DPRVNPSDRYHLMPIITPAYPQQNSTYNVSISTRMVMIEEFKQGLAITHEILLSKAEWSK :::::::::::::::::::::::::::::: ::: ::.::::::::.: ::: .:..::: CCDS18 DPRVNPSDRYHLMPIITPAYPQQNSTYNVSTSTRTVMVEEFKQGLAVTDEILQGKSDWSK 300 310 320 330 340 350 370 380 390 400 410 420 pF1KE5 LFEAPSFFQKYKHYIVLLASASTEKQHLEWVGLVESKIRILVGSLEKNEFITLAHVNPQS :.: :.:::::.::::: ::::::..:::::::::::::.:::.::.::::::::::::: CCDS18 LLEPPNFFQKYRHYIVLTASASTEENHLEWVGLVESKIRVLVGNLERNEFITLAHVNPQS 360 370 380 390 400 410 430 440 450 460 470 480 pF1KE5 FPAPKENPDMEEFRTMWVIGLGLKKPDNSEILSIDLTYDIQSFTDTVYRQAVNSKMFEMG ::. ::. ... .:: .:. ... .:.: ..:::::::::::::::::: : .:.. : CCDS18 FPGNKEHHKDNNYVSMWFLGIIFRRVENAESVNIDLTYDIQSFTDTVYRQANNINMLKEG 420 430 440 450 460 470 490 500 510 520 530 pF1KE5 MKITAVHLRRKELHQLLPHHVLQDKKAHSTEGRRLTDLNDSSFDLSAG--CENSMSVPSS ::: :.:...:.::. :: ..:: :: .: :.:.: :: :.. .: . :: CCDS18 MKIEATHVKKKQLHHYLPAEILQKKKKQS-----LSDVNRSSGGLQSKRLSLDSSCLDSS 480 490 500 510 520 530 540 550 560 570 580 590 pF1KE5 TSTMKTGPLISSSQGRNSPALAVMTASVANIQATEFSLQQVNTNESSGVALN------ES .: . :. : .. .::... . :. :. . . ... ..:... .: CCDS18 RDTDNGTPFNSPASKSDSPSVGETERNSAE-PAAVIVEKPLSVPPAQGLSIPVIGAKVDS 540 550 560 570 580 590 600 610 620 630 pF1KE5 IPHAVSQPAISPSPKAMVARVVSSTCLISHPDLQETQQQTYLIL ..:: :.. : ..:.: : : : CCDS18 TVKTVSPPTVCTIP-TVVGRNVIPRITTPHNPAQGQPHLNGMSNITKTVTPKRSHSPSID 600 610 620 630 640 650 CCDS18 GTPKRLKDVEKFIRLESTFKDPRTAEERKRKSVDAIGGESMPIPTIDTSRKKRLPSKELP 660 670 680 690 700 710 >>CCDS58334.1 PAPOLA gene_id:10914|Hs108|chr14 (285 aa) initn: 1711 init1: 1711 opt: 1711 Z-score: 1835.0 bits: 348.6 E(32554): 6.9e-96 Smith-Waterman score: 1711; 91.8% identity (96.8% similar) in 279 aa overlap (1-279:1-279) 10 20 30 40 50 60 pF1KE5 MPFPVTTQGPPQPAPPPNRYGVSSPISLAVPKETDCLLTQRLIETLRPFGVFEEEEELQR ::::::::: : :: ..::..::::::.::::::.:::.:::::.::::::::::::: CCDS58 MPFPVTTQGSQQTQPPQKHYGITSPISLAAPKETDCVLTQKLIETLKPFGVFEEEEELQR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 RILVLDKLNNLVKEWIREISESKSLPQSVIENVGGKIFTFGSYRLGVHTKGADIDALCVA :::.: :::::::::::::::::.:::::::::::::::::::::::::::::::::::: CCDS58 RILILGKLNNLVKEWIREISESKNLPQSVIENVGGKIFTFGSYRLGVHTKGADIDALCVA 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 PSHVDRSDFFTSFYAKLKLQEEVKDLRAVQEAFVPVIKLCFDGIEIDILFARLALQTIPE : :::::::::::: ::::::::::::::.:::::::::::::::::::::::::::::: CCDS58 PRHVDRSDFFTSFYDKLKLQEEVKDLRAVEEAFVPVIKLCFDGIEIDILFARLALQTIPE 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 DLDLRDDSLLKNLDIRCIRSLNGCRVTDEILHLVPNIDNFRLTLRAIKLWAKCHNIYSNI :::::::::::::::::::::::::::::::::::::::::::::::::::: ::::::: CCDS58 DLDLRDDSLLKNLDIRCIRSLNGCRVTDEILHLVPNIDNFRLTLRAIKLWAKRHNIYSNI 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE5 LGFLGGVSWAMLVARTCQLYPNAVASTLVRKFFLVFSEWEWPNPVLLKEPEERNLNLPVW :::::::::::::::::::::::.:::::.:::::::.: CCDS58 LGFLGGVSWAMLVARTCQLYPNAIASTLVHKFFLVFSKWYVFRLY 250 260 270 280 310 320 330 340 350 360 pF1KE5 DPRVNPSDRYHLMPIITPAYPQQNSTYNVSISTRMVMIEEFKQGLAITHEILLSKAEWSK >>CCDS58335.1 PAPOLA gene_id:10914|Hs108|chr14 (238 aa) initn: 1227 init1: 1227 opt: 1234 Z-score: 1325.7 bits: 254.1 E(32554): 1.6e-67 Smith-Waterman score: 1234; 81.9% identity (91.4% similar) in 232 aa overlap (1-230:1-231) 10 20 30 40 50 60 pF1KE5 MPFPVTTQGPPQPAPPPNRYGVSSPISLAVPKETDCLLTQRLIETLRPFGVFEEEEELQR ::::::::: : :: ..::..::::::.::::::.:::.:::::.::::::::::::: CCDS58 MPFPVTTQGSQQTQPPQKHYGITSPISLAAPKETDCVLTQKLIETLKPFGVFEEEEELQR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 RILVLDKLNNLVKEWIREISESKSLPQSVIENVGGKIFTFGSYRLGVHTKGADIDALCVA :::.: :::::::::::::::::.:::::::::::::::::::::::::::::::::::: CCDS58 RILILGKLNNLVKEWIREISESKNLPQSVIENVGGKIFTFGSYRLGVHTKGADIDALCVA 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 PSHVDRSDFFTSFYAKLKLQEEVKDLRAVQEAFVPVIKLCFDGIEIDILFARLALQTIPE : :::::::::::: ::::::::::::::.:::::::::::::::::::::::::::::: CCDS58 PRHVDRSDFFTSFYDKLKLQEEVKDLRAVEEAFVPVIKLCFDGIEIDILFARLALQTIPE 130 140 150 160 170 180 190 200 210 220 230 pF1KE5 DLDLRDDSLLKNLDIRCIRSLNGCRVTDE--ILHLVPNIDNFRLTLRAIKLWAKCHNIYS ::::::::::::::::::::::: : .:... .:. : .. ..::. CCDS58 DLDLRDDSLLKNLDIRCIRSLNGMRKPTSFCVLQFLSDISCFYTSF-VLKLFIAILLTQ 190 200 210 220 230 240 250 260 270 280 290 pF1KE5 NILGFLGGVSWAMLVARTCQLYPNAVASTLVRKFFLVFSEWEWPNPVLLKEPEERNLNLP 636 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 04:02:50 2016 done: Tue Nov 8 04:02:50 2016 Total Scan time: 3.850 Total Display time: 0.050 Function used was FASTA [36.3.4 Apr, 2011]