FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6771, 563 aa 1>>>pF1KE6771 563 - 563 aa - 563 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.6101+/-0.000965; mu= 16.4720+/- 0.058 mean_var=64.3245+/-12.796, 0's: 0 Z-trim(104.3): 7 B-trim: 20 in 1/49 Lambda= 0.159914 statistics sampled from 7833 (7836) to 7833 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.616), E-opt: 0.2 (0.241), width: 16 Scan time: 1.990 The best scores are: opt bits E(32554) CCDS34184.1 MCCC2 gene_id:64087|Hs108|chr5 ( 563) 3746 873.4 0 CCDS3089.1 PCCB gene_id:5096|Hs108|chr3 ( 539) 770 186.8 5.8e-47 CCDS54643.1 PCCB gene_id:5096|Hs108|chr3 ( 559) 619 152.0 1.8e-36 >>CCDS34184.1 MCCC2 gene_id:64087|Hs108|chr5 (563 aa) initn: 3746 init1: 3746 opt: 3746 Z-score: 4666.9 bits: 873.4 E(32554): 0 Smith-Waterman score: 3746; 100.0% identity (100.0% similar) in 563 aa overlap (1-563:1-563) 10 20 30 40 50 60 pF1KE6 MWAVLRLALRPCARASPAGPRAYHGDSVASLGTQPDLGSALYQENYKQMKALVNQLHERV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 MWAVLRLALRPCARASPAGPRAYHGDSVASLGTQPDLGSALYQENYKQMKALVNQLHERV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 EHIKLGGGEKARALHISRGKLLPRERIDNLIDPGSPFLELSQFAGYQLYDNEEVPGGGII :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 EHIKLGGGEKARALHISRGKLLPRERIDNLIDPGSPFLELSQFAGYQLYDNEEVPGGGII 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 TGIGRVSGVECMIIANDATVKGGAYYPVTVKKQLRAQEIAMQNRLPCIYLVDSGGAYLPR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 TGIGRVSGVECMIIANDATVKGGAYYPVTVKKQLRAQEIAMQNRLPCIYLVDSGGAYLPR 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 QADVFPDRDHFGRTFYNQAIMSSKNIAQIAVVMGSCTAGGAYVPAMADENIIVRKQGTIF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 QADVFPDRDHFGRTFYNQAIMSSKNIAQIAVVMGSCTAGGAYVPAMADENIIVRKQGTIF 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE6 LAGPPLVKAATGEEVSAEDLGGADLHCRKSGVSDHWALDDHHALHLTRKVVRNLNYQKKL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 LAGPPLVKAATGEEVSAEDLGGADLHCRKSGVSDHWALDDHHALHLTRKVVRNLNYQKKL 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE6 DVTIEPSEEPLFPADELYGIVGANLKRSFDVREVIARIVDGSRFTEFKAFYGDTLVTGFA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 DVTIEPSEEPLFPADELYGIVGANLKRSFDVREVIARIVDGSRFTEFKAFYGDTLVTGFA 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE6 RIFGYPVGIVGNNGVLFSESAKKGTHFVQLCCQRNIPLLFLQNITGFMVGREYEAEGIAK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 RIFGYPVGIVGNNGVLFSESAKKGTHFVQLCCQRNIPLLFLQNITGFMVGREYEAEGIAK 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE6 DGAKMVAAVACAQVPKITLIIGGSYGAGNYGMCGRAYSPRFLYIWPNARISVMGGEQAAN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 DGAKMVAAVACAQVPKITLIIGGSYGAGNYGMCGRAYSPRFLYIWPNARISVMGGEQAAN 430 440 450 460 470 480 490 500 510 520 530 540 pF1KE6 VLATITKDQRAREGKQFSSADEAALKEPIIKKFEEEGNPYYSSARVWDDGIIDPADTRLV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 VLATITKDQRAREGKQFSSADEAALKEPIIKKFEEEGNPYYSSARVWDDGIIDPADTRLV 490 500 510 520 530 540 550 560 pF1KE6 LGLSFSAALNAPIEKTDFGIFRM ::::::::::::::::::::::: CCDS34 LGLSFSAALNAPIEKTDFGIFRM 550 560 >>CCDS3089.1 PCCB gene_id:5096|Hs108|chr3 (539 aa) initn: 814 init1: 237 opt: 770 Z-score: 956.6 bits: 186.8 E(32554): 5.8e-47 Smith-Waterman score: 811; 31.9% identity (61.3% similar) in 530 aa overlap (28-538:7-512) 10 20 30 40 50 60 pF1KE6 MWAVLRLALRPCARASPAGPRAYHGDSVASLGTQPDLGSALYQENYKQMKALVNQLHERV ::..:.. .. .. . ... . .....::. CCDS30 MAAALRVAAVGARLSVLASGLRAAVRSLCSQATSVNERI 10 20 30 70 80 90 100 110 pF1KE6 EHIK----LGGGEKARALHISRGKLLPRERIDNLIDPGSPFLELSQF-----AGYQLY-D :. . ::::.. . .:::: ::::. :.:::: :.: ..: : . . : CCDS30 ENKRRTALLGGGQRRIDAQHKRGKLTARERISLLLDPGS-FVESDMFVEHRCADFGMAAD 40 50 60 70 80 90 120 130 140 150 160 170 pF1KE6 NEEVPGGGIITGIGRVSGVECMIIANDATVKGGAYYPVTVKKQLRAQEIAMQNRLPCIYL ... :: ...:: ::..: .....: :: ::. . ..: . .. :. : : : CCDS30 KNKFPGDSVVTGRGRINGRLVYVFSQDFTVFGGSLSGAHAQKICKIMDQAITVGAPVIGL 100 110 120 130 140 150 180 190 200 210 220 pF1KE6 VDSGGAYLPRQADVFPDR-DHFGRTFYNQAIMSSKNIAQIAVVMGSCTAGGAYVPAMADE ::::: . . .. . : : :. . .: : ::...:: :..:..: ::..: CCDS30 NDSGGARIQEGVESLAGYADIFLRN-----VTASGVIPQISLIMGPCAGGAVYSPALTDF 160 170 180 190 200 210 230 240 250 260 270 280 pF1KE6 NIIVRKQGTIFLAGPPLVKAATGEEVSAEDLGGADLHCRKSGVSDHWALDDHHALHLTRK ...:. . .:..:: .::..:.:.:. :.:::: : :::. . .: :: : CCDS30 TFMVKDTSYLFITGPDVVKSVTNEDVTQEELGGAKTHTTMSGVAHRAFENDVDALCNLRD 220 230 240 250 260 270 290 300 310 320 330 340 pF1KE6 VVRNLNYQKKLDVTIEPSEEP---LFPADELYGIVGANLKRSFDVREVIARIVDGSRFTE : ... . .. ..: : : :: :: . ..... ..: .:: .: : CCDS30 FFNYLPLSSQDPAPVRECHDPSDRLVP--ELDTIVPLESTKAYNMVDIIHSVVDEREFFE 280 290 300 310 320 330 350 360 370 380 390 400 pF1KE6 FKAFYGDTLVTGFARIFGYPVGIVGN-----NGVLFSESAKKGTHFVQLCCQRNIPLLFL . :. ....::::. : :::::: .: : .:. ::..::..: ::::. . CCDS30 IMPNYAKNIIVGFARMNGRTVGIVGNQPKVASGCLDINSSVKGARFVRFCDAFNIPLITF 340 350 360 370 380 390 410 420 430 440 450 460 pF1KE6 QNITGFMVGREYEAEGIAKDGAKMVAAVACAQVPKITLIIGGSYGAGNYGMCGRAYSPRF .. ::. : : :: . :::.. : : : :::.:.: .::.. : .. CCDS30 VDVPGFLPGTAQEYGGIIRHGAKLLYAFAEATVPKVTVITRKAYGGAYDVMSSKHLCGDT 400 410 420 430 440 450 470 480 490 500 510 520 pF1KE6 LYIWPNARISVMGGEQAANVLATITKDQRAREGKQFSSADEAALKEPIIKKFEEEGNPYY : ::.:.:.:::.. :... : : .. :. : :.:: .::. CCDS30 NYAWPTAEIAVMGAKGAVEI---IFKGHENVEAAQ----------AEYIEKF---ANPFP 460 470 480 490 530 540 550 560 pF1KE6 SSARVWDDGIIDPADTRLVLGLSFSAALNAPIEKTDFGIFRM ...: . : ::.:..:: CCDS30 AAVRGFVDDIIQPSSTRARICCDLDVLASKKVQRPWRKHANIPL 500 510 520 530 >>CCDS54643.1 PCCB gene_id:5096|Hs108|chr3 (559 aa) initn: 789 init1: 237 opt: 619 Z-score: 768.1 bits: 152.0 E(32554): 1.8e-36 Smith-Waterman score: 766; 30.9% identity (58.9% similar) in 550 aa overlap (28-538:7-532) 10 20 30 40 50 60 pF1KE6 MWAVLRLALRPCARASPAGPRAYHGDSVASLGTQPDLGSALYQENYKQMKALVNQLHERV ::..:.. .. .. . ... . .....::. CCDS54 MAAALRVAAVGARLSVLASGLRAAVRSLCSQATSVNERI 10 20 30 70 80 90 100 110 pF1KE6 EHIK----LGGGEKARALHISRGKLLPRERIDNLIDPGSPFLELSQF-----AGYQLY-D :. . ::::.. . .:::: ::::. :.:::: :.: ..: : . . : CCDS54 ENKRRTALLGGGQRRIDAQHKRGKLTARERISLLLDPGS-FVESDMFVEHRCADFGMAAD 40 50 60 70 80 90 120 130 140 150 pF1KE6 NEEVPGGGIITGIGRVSG--------------------VECMIIANDATVKGGAYYPVTV ... :: ...:: ::..: . . :.: :: ::. . . CCDS54 KNKFPGDSVVTGRGRINGRLVYVFSQQIIGWAQWLPLVISALWEAEDFTVFGGSLSGAHA 100 110 120 130 140 150 160 170 180 190 200 pF1KE6 KKQLRAQEIAMQNRLPCIYLVDSGGAYLPRQADVFPDR-DHFGRTFYNQAIMSSKNIAQI .: . .. :. : : : ::::: . . .. . : : :. . .: : :: CCDS54 QKICKIMDQAITVGAPVIGLNDSGGARIQEGVESLAGYADIFLRN-----VTASGVIPQI 160 170 180 190 200 210 210 220 230 240 250 260 pF1KE6 AVVMGSCTAGGAYVPAMADENIIVRKQGTIFLAGPPLVKAATGEEVSAEDLGGADLHCRK ...:: :..:..: ::..: ...:. . .:..:: .::..:.:.:. :.:::: : CCDS54 SLIMGPCAGGAVYSPALTDFTFMVKDTSYLFITGPDVVKSVTNEDVTQEELGGAKTHTTM 220 230 240 250 260 270 270 280 290 300 310 320 pF1KE6 SGVSDHWALDDHHALHLTRKVVRNLNYQKKLDVTIEPSEEP---LFPADELYGIVGANLK :::. . .: :: : : ... . .. ..: : : :: :: . CCDS54 SGVAHRAFENDVDALCNLRDFFNYLPLSSQDPAPVRECHDPSDRLVP--ELDTIVPLEST 280 290 300 310 320 330 330 340 350 360 370 380 pF1KE6 RSFDVREVIARIVDGSRFTEFKAFYGDTLVTGFARIFGYPVGIVGN-----NGVLFSESA ..... ..: .:: .: :. :. ....::::. : :::::: .: : .:. CCDS54 KAYNMVDIIHSVVDEREFFEIMPNYAKNIIVGFARMNGRTVGIVGNQPKVASGCLDINSS 340 350 360 370 380 390 390 400 410 420 430 440 pF1KE6 KKGTHFVQLCCQRNIPLLFLQNITGFMVGREYEAEGIAKDGAKMVAAVACAQVPKITLII ::..::..: ::::. . .. ::. : : :: . :::.. : : : :::.:.: CCDS54 VKGARFVRFCDAFNIPLITFVDVPGFLPGTAQEYGGIIRHGAKLLYAFAEATVPKVTVIT 400 410 420 430 440 450 450 460 470 480 490 500 pF1KE6 GGSYGAGNYGMCGRAYSPRFLYIWPNARISVMGGEQAANVLATITKDQRAREGKQFSSAD .::.. : .. : ::.:.:.:::.. :... : : .. :. : CCDS54 RKAYGGAYDVMSSKHLCGDTNYAWPTAEIAVMGAKGAVEI---IFKGHENVEAAQ----- 460 470 480 490 500 510 520 530 540 550 560 pF1KE6 EAALKEPIIKKFEEEGNPYYSSARVWDDGIIDPADTRLVLGLSFSAALNAPIEKTDFGIF :.:: .::. ...: . : ::.:..:: CCDS54 -----AEYIEKF---ANPFPAAVRGFVDDIIQPSSTRARICCDLDVLASKKVQRPWRKHA 510 520 530 540 550 pF1KE6 RM CCDS54 NIPL 563 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 16:13:28 2016 done: Tue Nov 8 16:13:28 2016 Total Scan time: 1.990 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]