FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5631, 477 aa 1>>>pF1KE5631 477 - 477 aa - 477 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.7417+/-0.000773; mu= 15.5469+/- 0.047 mean_var=66.7777+/-13.542, 0's: 0 Z-trim(107.8): 19 B-trim: 0 in 0/50 Lambda= 0.156949 statistics sampled from 9786 (9796) to 9786 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.671), E-opt: 0.2 (0.301), width: 16 Scan time: 2.780 The best scores are: opt bits E(32554) CCDS10939.1 OSGIN1 gene_id:29948|Hs108|chr16 ( 477) 3232 740.6 8.2e-214 CCDS6248.1 OSGIN2 gene_id:734|Hs108|chr8 ( 505) 782 185.9 8.6e-47 CCDS47888.1 OSGIN2 gene_id:734|Hs108|chr8 ( 549) 782 185.9 9.2e-47 >>CCDS10939.1 OSGIN1 gene_id:29948|Hs108|chr16 (477 aa) initn: 3232 init1: 3232 opt: 3232 Z-score: 3952.1 bits: 740.6 E(32554): 8.2e-214 Smith-Waterman score: 3232; 100.0% identity (100.0% similar) in 477 aa overlap (1-477:1-477) 10 20 30 40 50 60 pF1KE5 MSSSRKDHLGASSSEPLPVIIVGNGPSGICLSYLLSGYTPYTKPDAIHPHPLLQRKLTEA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 MSSSRKDHLGASSSEPLPVIIVGNGPSGICLSYLLSGYTPYTKPDAIHPHPLLQRKLTEA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 PGVSILDQDLDYLSEGLEGRSQSPVALLFDALLRPDTDFGGNMKSVLTWKHRKEHAIPHV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 PGVSILDQDLDYLSEGLEGRSQSPVALLFDALLRPDTDFGGNMKSVLTWKHRKEHAIPHV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 VLGRNLPGGAWHSIEGSMVILSQGQWMGLPDLEVKDWMQKKRRGLRNSRATAGDIAHYYR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 VLGRNLPGGAWHSIEGSMVILSQGQWMGLPDLEVKDWMQKKRRGLRNSRATAGDIAHYYR 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 DYVVKKGLGHNFVSGAVVTAVEWGTPDPSSCGAQDSSPLFQVSGFLTRNQAQQPFSLWAR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 DYVVKKGLGHNFVSGAVVTAVEWGTPDPSSCGAQDSSPLFQVSGFLTRNQAQQPFSLWAR 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE5 NVVLATGTFDSPARLGIPGEALPFIHHELSALEAATRVGAVTPASDPVLIIGAGLSAADA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 NVVLATGTFDSPARLGIPGEALPFIHHELSALEAATRVGAVTPASDPVLIIGAGLSAADA 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE5 VLYARHYNIPVIHAFRRAVDDPGLVFNQLPKMLYPEYHKVHQMMREQSILSPSPYEGYRS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 VLYARHYNIPVIHAFRRAVDDPGLVFNQLPKMLYPEYHKVHQMMREQSILSPSPYEGYRS 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE5 LPRHQLLCFKEDCQAVFQDLEGVEKVFGVSLVLVLIGSHPDLSFLPGAGADFAVDPDQPL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 LPRHQLLCFKEDCQAVFQDLEGVEKVFGVSLVLVLIGSHPDLSFLPGAGADFAVDPDQPL 370 380 390 400 410 420 430 440 450 460 470 pF1KE5 SAKRNPIDVDPFTYQSTRQEGLYAMGPLAGDNFVRFVQGGALAVASSLLRKETRKPP ::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 SAKRNPIDVDPFTYQSTRQEGLYAMGPLAGDNFVRFVQGGALAVASSLLRKETRKPP 430 440 450 460 470 >>CCDS6248.1 OSGIN2 gene_id:734|Hs108|chr8 (505 aa) initn: 1545 init1: 769 opt: 782 Z-score: 953.6 bits: 185.9 E(32554): 8.6e-47 Smith-Waterman score: 1572; 50.6% identity (74.4% similar) in 480 aa overlap (13-475:13-492) 10 20 30 40 50 60 pF1KE5 MSSSRKDHLGASSSEPLPVIIVGNGPSGICLSYLLSGYTPYTKPDAIHPHPLLQRKLTEA :: .::.:.:::::::::::.:::: :: . .::::. .:. :: :: CCDS62 MPLVEETSLLEDSSVTFPVVIIGNGPSGICLSYMLSGYRPYLSSEAIHPNTILNSKLEEA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 PGVSILDQDLDYLSEGLEGRSQSPVALLFDALLRPDTDFGGNMKSVLTWKHRKEHAIPHV .::.::::.::::::::::..:::.:::.::.::.::: .. ::: :: ...: :::: CCDS62 RHLSIVDQDLEYLSEGLEGRSSNPVAVLFDTLLHPDADFGYDYPSVLHWKLEQHHYIPHV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 VLGRNLPGGAWHSIEGSMVILSQGQWMGLPDLEVKDWMQKKRRGLRNSRATAGDIAHYYR :::.. ::::::..::::. .: :.:: :: :. :::...:::.:...:. .::.::. CCDS62 VLGKGPPGGAWHNMEGSMLTISFGSWMELPGLKFKDWVSSKRRSLKGDRVMPEEIARYYK 130 140 150 160 170 180 190 200 210 220 230 pF1KE5 DYVVKKGLGHNFVSGAVVTAVEWGTPDPSSCGAQD---SSPLFQV--SGFLTRNQ----- :: :: .:: .. .:.: : .. :: :. .:. :.:. :: CCDS62 HYVKVMGLQKNFRENTYITSVSRLYRDQDDDDIQDRDISTKHLQIEKSNFIKRNWEIRGY 190 200 210 220 230 240 240 250 260 270 280 pF1KE5 ------AQQPFSLWARNVVLATGTFDSPARLGIPGEALPFIHHELSALEAATRVGAVTPA .. :: :.:.::.:::::.::::.: : :: .::. : . . :: : . CCDS62 QRIADGSHVPFCLFAENVALATGTLDSPAHLEIEGEDFPFVFHSMPEFGAAINKGKLRGK 250 260 270 280 290 300 290 300 310 320 330 340 pF1KE5 SDPVLIIGAGLSAADAVLYARHYNIPVIHAFRRAVDDPGLVFNQLPKMLYPEYHKVHQMM :::::.:.::.:::::: : . ::::::.::: : ::.:.:.:::: ::::::::..:: CCDS62 VDPVLIVGSGLTAADAVLCAYNSNIPVIHVFRRRVTDPSLIFKQLPKKLYPEYHKVYHMM 310 320 330 340 350 360 350 360 370 380 390 400 pF1KE5 REQSILSPSPY-EGYRSLPRHQLLCFKEDCQAVFQDLEGVEKVFGVSLVLVLIGSHPDLS :: : : :.:.:..: :: : . :.:.. :..:.: .: ..:::::::.:: CCDS62 CTQSYSVDSNLLSDYTSFPEHRVLSFKSDMKCVLQSVSGLKKIFKLSAAVVLIGSHPNLS 370 380 390 400 410 420 410 420 430 440 450 460 pF1KE5 FLPGAGADFAVDPDQPLSAKRNPIDVDPFTYQSTRQEGLYAMGPLAGDNFVRFVQGGALA :: : .. .::.. : ::...: .::. .. .:.:.:::.:::::::..::::. CCDS62 FLKDQGCYLGHKSSQPITCKGNPVEIDTYTYECIKEANLFALGPLVGDNFVRFLKGGALG 430 440 450 460 470 480 470 pF1KE5 VASSLLRKETRKPP :. : .. .: CCDS62 VTRCLATRQKKKHLFVERGGGDGIA 490 500 >>CCDS47888.1 OSGIN2 gene_id:734|Hs108|chr8 (549 aa) initn: 1545 init1: 769 opt: 782 Z-score: 953.0 bits: 185.9 E(32554): 9.2e-47 Smith-Waterman score: 1572; 50.6% identity (74.4% similar) in 480 aa overlap (13-475:57-536) 10 20 30 40 pF1KE5 MSSSRKDHLGASSSEPLPVIIVGNGPSGICLSYLLSGYTPYT :: .::.:.:::::::::::.:::: :: CCDS47 FNSLVQYFGDNLGRKVKAMPLVEETSLLEDSSVTFPVVIIGNGPSGICLSYMLSGYRPYL 30 40 50 60 70 80 50 60 70 80 90 100 pF1KE5 KPDAIHPHPLLQRKLTEAPGVSILDQDLDYLSEGLEGRSQSPVALLFDALLRPDTDFGGN . .::::. .:. :: :: .::.::::.::::::::::..:::.:::.::.::.::: . CCDS47 SSEAIHPNTILNSKLEEARHLSIVDQDLEYLSEGLEGRSSNPVAVLFDTLLHPDADFGYD 90 100 110 120 130 140 110 120 130 140 150 160 pF1KE5 MKSVLTWKHRKEHAIPHVVLGRNLPGGAWHSIEGSMVILSQGQWMGLPDLEVKDWMQKKR . ::: :: ...: :::::::.. ::::::..::::. .: :.:: :: :. :::...:: CCDS47 YPSVLHWKLEQHHYIPHVVLGKGPPGGAWHNMEGSMLTISFGSWMELPGLKFKDWVSSKR 150 160 170 180 190 200 170 180 190 200 210 pF1KE5 RGLRNSRATAGDIAHYYRDYVVKKGLGHNFVSGAVVTAVEWGTPDPSSCGAQD---SSPL :.:...:. .::.::. :: :: .:: .. .:.: : .. :: :. CCDS47 RSLKGDRVMPEEIARYYKHYVKVMGLQKNFRENTYITSVSRLYRDQDDDDIQDRDISTKH 210 220 230 240 250 260 220 230 240 250 260 pF1KE5 FQV--SGFLTRNQ-----------AQQPFSLWARNVVLATGTFDSPARLGIPGEALPFIH .:. :.:. :: .. :: :.:.::.:::::.::::.: : :: .::. CCDS47 LQIEKSNFIKRNWEIRGYQRIADGSHVPFCLFAENVALATGTLDSPAHLEIEGEDFPFVF 270 280 290 300 310 320 270 280 290 300 310 320 pF1KE5 HELSALEAATRVGAVTPASDPVLIIGAGLSAADAVLYARHYNIPVIHAFRRAVDDPGLVF : . . :: : . :::::.:.::.:::::: : . ::::::.::: : ::.:.: CCDS47 HSMPEFGAAINKGKLRGKVDPVLIVGSGLTAADAVLCAYNSNIPVIHVFRRRVTDPSLIF 330 340 350 360 370 380 330 340 350 360 370 380 pF1KE5 NQLPKMLYPEYHKVHQMMREQSILSPSPY-EGYRSLPRHQLLCFKEDCQAVFQDLEGVEK .:::: ::::::::..:: :: : : :.:.:..: :: : . :.:.. :..: CCDS47 KQLPKKLYPEYHKVYHMMCTQSYSVDSNLLSDYTSFPEHRVLSFKSDMKCVLQSVSGLKK 390 400 410 420 430 440 390 400 410 420 430 440 pF1KE5 VFGVSLVLVLIGSHPDLSFLPGAGADFAVDPDQPLSAKRNPIDVDPFTYQSTRQEGLYAM .: .: ..:::::::.:::: : .. .::.. : ::...: .::. .. .:.:. CCDS47 IFKLSAAVVLIGSHPNLSFLKDQGCYLGHKSSQPITCKGNPVEIDTYTYECIKEANLFAL 450 460 470 480 490 500 450 460 470 pF1KE5 GPLAGDNFVRFVQGGALAVASSLLRKETRKPP :::.:::::::..::::.:. : .. .: CCDS47 GPLVGDNFVRFLKGGALGVTRCLATRQKKKHLFVERGGGDGIA 510 520 530 540 477 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 05:21:23 2016 done: Tue Nov 8 05:21:24 2016 Total Scan time: 2.780 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]