GENSCAN 1.0 Date run: 2-Nov-116 Time: 20:57:34 Sequence gi568815596r:96502752_96737135 : 234384 bp : 48.19% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 6317 6457 141 2 0 84 67 124 0.629 9.37 1.02 Intr + 21231 21476 246 0 0 87 90 62 0.412 2.87 1.03 Intr + 23286 23334 49 0 1 115 82 0 0.279 0.78 1.04 Intr + 27242 27313 72 1 0 86 105 11 0.755 2.20 1.05 Intr + 29039 29218 180 1 0 113 82 10 0.755 2.96 1.06 Intr + 35443 35570 128 0 2 90 111 65 0.969 8.48 1.07 Intr + 44253 44344 92 0 2 79 3 62 0.209 -3.56 1.08 Intr + 44651 44766 116 0 2 119 105 150 0.981 19.97 1.09 Intr + 46570 46683 114 0 0 112 11 220 0.733 17.34 1.10 Intr + 46968 47054 87 2 0 62 95 136 0.700 11.87 1.11 Intr + 47275 47534 260 0 2 -9 80 260 0.406 11.76 1.12 Intr + 47823 47982 160 0 1 59 96 231 0.958 20.99 1.13 Term + 48348 49562 1215 2 0 77 47 789 0.657 65.35 1.14 PlyA + 49858 49863 6 1.05 2.00 Prom + 62834 62873 40 -3.66 2.01 Init + 73705 73760 56 0 2 93 78 50 0.663 5.26 2.02 Intr + 92074 92179 106 1 1 87 89 28 0.843 3.02 2.03 Term + 92640 92813 174 2 0 4 54 140 0.659 -0.04 2.04 PlyA + 94384 94389 6 1.05 3.19 PlyA - 94846 94841 6 1.05 3.18 Term - 97752 97723 30 0 0 128 44 22 0.899 -0.35 3.17 Intr - 99025 98892 134 2 2 104 109 47 0.976 8.76 3.16 Intr - 99587 99365 223 2 1 133 109 168 0.993 21.10 3.15 Intr - 100111 100002 110 2 2 101 55 18 0.904 -0.20 3.14 Intr - 101629 101499 131 0 2 97 70 25 0.789 1.94 3.13 Intr - 102112 102028 85 2 1 38 89 58 0.395 -0.22 3.12 Intr - 102745 102569 177 2 0 40 113 64 0.661 3.99 3.11 Intr - 105913 105757 157 1 1 93 75 127 0.995 11.48 3.10 Intr - 106313 106113 201 2 0 65 57 179 0.983 11.98 3.09 Intr - 106811 106748 64 1 1 70 97 14 0.977 -0.78 3.08 Intr - 108132 107975 158 0 2 35 94 173 0.951 11.41 3.07 Intr - 108387 108313 75 0 0 97 78 4 0.499 0.01 3.06 Intr - 110183 110067 117 2 0 63 95 55 0.950 4.36 3.05 Intr - 110868 110737 132 0 0 103 100 33 0.982 6.84 3.04 Intr - 116793 116608 186 0 0 88 116 230 0.999 25.69 3.03 Intr - 117011 116921 91 1 1 63 78 91 0.336 5.60 3.02 Intr - 128731 128561 171 0 0 67 96 240 0.934 21.76 3.01 Init - 134384 134170 215 2 2 86 84 260 0.992 23.62 3.00 Prom - 137714 137675 40 -9.95 4.00 Prom + 139175 139214 40 -8.66 4.01 Init + 140086 140170 85 0 1 73 78 106 0.984 9.10 4.02 Intr + 143650 143702 53 2 2 92 59 60 0.970 2.13 4.03 Intr + 144313 144404 92 0 2 105 100 66 0.988 8.29 4.04 Intr + 145027 145135 109 1 1 126 76 91 0.988 11.99 4.05 Intr + 146872 146926 55 0 1 105 91 82 0.988 8.75 4.06 Intr + 147429 147538 110 1 2 123 94 11 0.991 5.20 4.07 Intr + 149141 149269 129 1 0 101 64 216 0.998 21.39 4.08 Intr + 150889 150951 63 0 0 79 94 34 0.485 2.01 4.09 Intr + 157590 157620 31 2 1 129 96 3 0.703 2.90 4.10 Intr + 158574 158689 116 1 2 113 93 72 0.980 10.37 4.11 Intr + 158917 159040 124 0 1 96 -24 182 0.817 7.96 4.12 Intr + 159464 159516 53 0 2 60 115 51 0.828 3.63 4.13 Intr + 160688 160756 69 1 0 52 80 79 0.753 2.88 4.14 Intr + 166000 166043 44 0 2 121 93 14 0.984 2.24 4.15 Intr + 166135 166253 119 1 2 80 -54 175 0.991 2.61 4.16 Intr + 166292 166386 95 0 2 132 61 74 0.998 8.78 4.17 Intr + 167368 167496 129 0 0 82 78 72 0.956 6.49 4.18 Intr + 170326 170503 178 0 1 28 46 191 0.929 8.39 4.19 Intr + 181576 181700 125 2 2 103 105 250 0.991 28.50 4.20 Intr + 182578 182678 101 0 2 95 21 135 0.999 6.41 4.21 Intr + 183189 183366 178 0 1 87 90 240 0.997 24.02 4.22 Intr + 183444 183599 156 2 0 51 75 73 0.400 2.51 4.23 Intr + 185065 185196 132 0 0 15 101 185 0.867 13.34 4.24 Intr + 185517 185645 129 2 0 87 57 31 0.592 0.79 4.25 Intr + 186462 186625 164 2 2 47 116 111 0.986 8.57 4.26 Intr + 186893 187007 115 2 1 49 77 156 0.987 11.05 4.27 Intr + 187736 187832 97 1 1 120 70 105 0.413 11.48 4.28 Intr + 188439 188602 164 1 2 62 81 176 0.997 13.99 4.29 Intr + 188694 188861 168 2 0 112 31 260 0.983 22.84 4.30 Intr + 189074 189212 139 1 1 14 66 91 0.491 -0.46 4.31 Intr + 189353 189430 78 0 0 90 100 162 0.995 17.12 4.32 Intr + 190755 190936 182 1 2 99 106 202 0.998 22.69 4.33 Intr + 191160 191321 162 2 0 99 57 70 0.947 5.17 4.34 Intr + 191609 191713 105 1 0 97 38 63 0.534 2.61 4.35 Intr + 192758 192910 153 1 0 93 83 166 0.999 16.87 4.36 Intr + 192991 193153 163 0 1 65 42 250 0.998 17.65 4.37 Intr + 193301 193326 26 0 2 101 105 40 0.897 4.74 4.38 Intr + 194775 194825 51 2 0 97 49 52 0.696 1.30 4.39 Intr + 194909 195010 102 1 0 69 42 171 0.987 10.97 4.40 Intr + 195286 195405 120 0 0 50 93 212 0.939 18.59 4.41 Intr + 195920 196081 162 1 0 80 63 209 0.808 17.77 4.42 Intr + 196294 196385 92 0 2 60 58 123 0.999 5.29 4.43 Intr + 196799 196969 171 2 0 65 44 199 0.985 12.26 4.44 Intr + 197181 197329 149 0 2 114 85 82 0.998 10.38 4.45 Intr + 197581 197720 140 2 2 66 80 172 0.994 14.38 4.46 Intr + 199204 199292 89 0 2 41 113 88 0.816 5.37 4.47 Intr + 199528 199650 123 1 0 -6 49 217 0.650 8.10 4.48 Intr + 199849 199990 142 1 1 21 87 252 0.999 18.76 4.49 Intr + 200227 200326 100 0 1 107 66 140 0.992 13.38 4.50 Intr + 200402 200595 194 0 2 59 66 171 0.987 11.21 4.51 Intr + 200772 200881 110 2 2 95 67 95 0.999 7.18 4.52 Intr + 201464 201611 148 2 1 91 117 86 0.984 12.04 4.53 Term + 201717 201941 225 2 0 97 32 43 0.669 -3.62 4.54 PlyA + 202113 202118 6 1.05 5.10 PlyA - 204121 204116 6 -1.75 5.09 Term - 204647 204505 143 0 2 123 52 165 0.977 14.49 5.08 Intr - 205082 204963 120 0 0 80 44 72 0.843 2.47 5.07 Intr - 209019 208905 115 0 1 138 84 166 0.989 21.22 5.06 Intr - 209274 209113 162 0 0 102 93 268 0.457 28.87 5.05 Intr - 209812 209758 55 0 1 64 68 30 0.470 -2.42 5.04 Intr - 215969 215862 108 1 0 87 75 87 0.739 6.70 5.03 Intr - 219637 219579 59 1 2 112 89 -24 0.210 -2.22 5.02 Intr - 230850 230768 83 1 2 125 80 118 0.883 14.06 5.01 Intr - 231775 231658 118 1 1 81 72 101 0.968 7.84 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 91349 91407 59 1 2 67 60 38 0.850 -0.30 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:96502752_96737135|GENSCAN_predicted_peptide_1|953_aa XGCSRPSFPQPSAGPKLPHMRAMEYCANDNKLLLPATTRINLTAVRLIGGTTNSKRTYGL QTKGPWFIACSSHGCSGFYEKAANHKKTISLGGLNWDRNCRWMPDVQRHPNPSPKSPQRS SVDSPDPLTRLPSNVTTEWGVTLGPLESRAGQGAPDVGKKKLQGLLVGSLADPQPFTPTV DLKLSRYQLNILGGRGDTEEKKNKTAAPRRDNDSPKSRQSRTSPSTLNNEVKWEHTKPSE ELSLSVTTKCGPDSVGRDSPILELGLLGHPRARCGEIVGAEERKGTGVERMQRVVGERGG GPVSAPVKGNRKQSTEGDALDPPASPKPAGKQNGIQNPISLEDSPEAGGEREEEQEREEE QAFLVSLYKFMKERHTPIERRLTASLLFSPPVNLWKIYKAVEKLGAYELQSMALGERIGW PLLEPQSSCQTAVLRVPARAAGAARTPPPGGARRPRPHEVPLQVTGRRLWKNVYDELGGS PGSTSAATCTRRHYERLVLPYVRHLKGEDDKPLPTSKPRKQYKMAKENRGDDGATERPKK AKEERRMDQMMPGKTKADAADPAPLPSQEPPRNSTEQQGLASGSSVSFVGASGCPEAYKR LLSSFYCKGTHGIMSPLAKKKLLAQVSKVEALQCQEEGCRHGAEPQASPAVHLPESPQSP KGLTENSRHRLTPQEGLQAPGGSLREEAQAGPCPAAPIFKGCFYTHPTEVLKPVSQHPRD FFSRLKDGVLLGPPGKEGLSVKEPQLVWGGDANRPSAFHKGGSRKGILYPKPKACWVSPM AKVPAESPTLPPTFPSSPGLGSKRSLEEEGAAHSGKRLRAVSPFLKEADAKKCGAKPAGS GLVSCLLGPALGPVPPEAYRGTMLHCPLNFTGTPGPLKGQAALPFSPLVIPAFPAHFLAT AGPSPMAAGLMHFPPTSFDSALRHRLCPASSAWHAPPVTTYAAPHFFHLNTKL >gi568815596r:96502752_96737135|GENSCAN_predicted_CDS_1|2862_bp nnaggctgctcccggcccagcttcccacaaccatcagcaggacctaaactgccacatatg cgcgccatggaatactgtgccaatgacaacaaattactgctacctgcaacgacgaggata aacctcacagccgtgaggctgataggaggcacaacaaactcaaagagaacgtatgggctc cagaccaaaggtccttggttcattgcatgcagctcccacggctgcagcggcttttatgag aaagcagcaaatcataagaaaactatctccttgggaggacttaattgggacagaaactgc agatggatgccagatgtgcaacggcatccaaacccgagtcccaaatccccacaaagatcc tcagtcgactctcccgatcctttgaccagactccccagcaatgtgacaactgagtggggg gtgacactaggtcctctggagtccagggcaggccagggtgccccagatgtggggaagaaa aagcttcaggggcttcttgtaggctctctggccgatccccagccctttactccaacggta gacctgaagctgtcgagatatcaactcaacatcttaggagggaggggagacactgaagaa aaaaaaaacaaaacagctgcccccaggagggacaacgattctcccaagagcaggcagtcc cgaacttctcccagcaccctgaacaatgaggtgaaatgggaacacacgaaaccatctgag gagctctcactgtccgtcaccaccaaatgtggcccagactccgtggggagggacagtccc atcctggaactggggcttttaggacatcccagagcccgatgtggggagatagttggagca gaggaaagaaaagggacaggtgtggagcggatgcagcgggttgtcggggagcggggagga gggcctgtgtcagcccctgtcaaagggaacaggaagcagtccacggagggtgacgcccta gacccacctgcatcccccaaacctgctggcaagcagaacggaatccagaaccccatctcg ctggaggactcccccgaggcaggcggggagcgggaggaggagcaggagcgggaggaggag caggccttcctggtcagcctctacaagttcatgaaggagcgacacacgcccatcgagagg agactgacggccagcctgctcttctctcccccagttaacctgtggaagatctacaaagca gtggagaagctgggggcctatgagctgcagtccatggccctaggagagagaatcggctgg ccgctgctggagccgcagagcagctgccaaactgcagtccttcgagtccctgcgagggcg gccggagctgcaaggaccccgccgccagggggcgcccgccggccgcgccctcacgaggtg cccttgcaggtgaccgggcgccgcctctggaagaacgtgtacgacgagctggggggcagc ccaggcagcaccagcgcggccacgtgcacgcgccgccactacgagaggctggtcctgcca tacgtgcggcacctgaagggggaggatgacaagccgctgcccacctccaagcccaggaaa cagtacaagatggctaaggagaacaggggggatgatggggccaccgagaggccgaagaag gccaaggaggagcggcgcatggaccagatgatgccaggaaagaccaaagcagatgctgct gacccagcaccacttcccagccaggagccccccaggaacagcacagaacagcagggcctg gcctctgggtcttctgtgtcctttgtgggtgccagcggctgtcctgaggcctacaagcgg ctcctatccagcttctactgcaaggggacacacggcatcatgtcaccactggccaaaaag aagctcctggcccaggtgagcaaggtggaggccttgcagtgccaggaggagggctgccgc catggggcagagccccaggcgtccccagctgttcacctcccagagagtccccagagcccc aaagggctgactgagaactccaggcaccggctgacccctcaggagggattgcaggcccca ggtggcagcctcagagaggaggcgcaggcaggcccctgcccggcagcccccatcttcaag ggctgcttctacacccaccccaccgaggtgctgaagcctgtcagccagcaccccagggac ttcttctctagacttaaagatggggtgctattggggcctcctggcaaagaggggctgtca gtgaaagagccccagctggtgtggggcggagacgctaaccgcccttctgcgttccataaa ggtggctccagaaagggcatcctctaccccaagcccaaagcctgctgggtgtcccccatg gccaaggtcccagccgagagccccacgctcccgcccaccttccccagtagcccaggcctg ggcagcaagcgcagcctggaggaagagggtgctgcccacagtgggaagagactgcgggcc gtgtctccctttcttaaggaggcggatgccaagaagtgtggggccaaacctgcagggtcc ggcctggtctcctgccttctgggcccagccctggggcctgtgcccccagaggcctacagg ggcaccatgctgcactgcccgctgaacttcactggcaccccgggccccttgaagggccag gctgcactccccttcagccccctggtcatcccggccttcccggcccacttcctggccacc gcaggcccctcgcccatggccgctggcctgatgcacttccccccaacgtccttcgacagt gccctccgccacagactttgcccggcctcatctgcctggcacgcaccaccagtcacaacc tatgcagcgccccacttcttccacctcaacaccaagctgtag >gi568815596r:96502752_96737135|GENSCAN_predicted_peptide_2|111_aa MGLWFMKLIGLTRDPIVKSLSEPADSLDTETAQSMVKKTKGPGFTWQPLLPTSREPDLQI LDDGASLAGTVSPNLMVSLKTVGGEAKMTVLNRSSDTWTAQQDHTPVQGPP >gi568815596r:96502752_96737135|GENSCAN_predicted_CDS_2|336_bp atgggcttgtggttcatgaaactcattggtcttaccagagatcccatcgtgaagagctta tctgaacctgcggactctctggatacagaaactgcccaaagcatggtgaaaaagacaaag ggcccaggtttcacctggcaaccactgctcccaacatctagagaaccagatttgcagatc ctggacgatggtgcttcccttgctggcaccgtatcacctaacctaatggtttctctgaag acagttggcggagaggcaaaaatgactgtcttaaacaggtcttccgacacgtggacagct cagcaggaccacacacctgtccagggcccaccatga >gi568815596r:96502752_96737135|GENSCAN_predicted_peptide_3|818_aa MAHRGGERDFQTSARRMGTSLLFQLSVHERELDLVFLDHSYAKPWSAHPDASSARPTRML FVTPRRQHESTIESDVPIDVETVTSTPMPLYDNQKARSVMNECERHVIFARTDADAPPPP EDWEEHVNRTGWTMAQNKLFNKILKALQSDRLARLANEGACNEPVLRRVAVDKCARRVRQ ALASVSWDTKLIQWLHTTLVETLSLPMLAAYLDALQTLKGKIPTLIDRMLVSSNTKTGAA GAEALSLLLKRPWDPAVGVLSHNKPSKLPGSPLILIASSGPSSSVFPTSRRHRFWQSQLS CLGKVSVMEYVTAVVCLGFPLLTVDGPRGDVDDPLLDMKTPVLFVIGQNSLQCHPEAMED FREKIRAENSLVVVGGADDNLRISKAKKKSEGLTQSMVDRCIQDEIVDFLTGVLTRAEGH MGSEPRDQDAEKKKKPRDVARRDLAFEVPERGSRPASPAAKLPASPSGSEDLSSVSSSPT SSPKTKVTTVTSAQKSSQIGSSQLLKRHVQRTEAVLTHKQAQEPPEEGEKEDLRVQLKRH HPSSPLPGSKTSKRPKIKVSLISQGDTAGGPCAPSQGSAPEAAGGKPITMTLGQASAGAK ELTGLLTTAKSSSSEGGVSASPVPSVVSSSTAPSALHTLQSRLVATSPGSSLPGATSASS LLQGLSFSLQDISSKTSGLPANPSPGPAPQATSVKLPTPMQSLGAITTGTSTIVRTIPVA TTLSSLGATPGGKPTAIHQLLTNGGLAKLASSLPGLAQISNQASGLKVPTTITLTLRGQP SRITTLSPMGSGAAPSEESSSQVLPSSSQGKYCTEVNT >gi568815596r:96502752_96737135|GENSCAN_predicted_CDS_3|2457_bp atggcccaccggggtggggagagggacttccagacttcagctcgacgcatgggcacctcg ctgctcttccagctttcagtgcatgaacgggagctggacctggtttttctggatcatagc tatgccaagccttggagtgcccacccagatgccagtagtgcccgccccacccgcatgctc tttgtcactccccggcggcagcacgaaagtaccattgaatcagacgtcccaatagatgtg gagacggtcacatcaacgcctatgccactctatgacaatcagaaggcacgcagcgtgatg aatgagtgtgaacggcatgtcatctttgccaggactgatgcagatgcccctcctccacca gaggactgggaggagcatgtcaacaggactggctggacaatggcccagaacaagctattc aacaagatcctcaaagccctgcagtctgaccggcttgcccgcttggccaacgaaggggct tgtaatgagccagtgctgcgccgtgttgctgtggacaagtgtgcaaggagagtgcggcag gctctggcaagtgtgagctgggataccaagctgatccagtggctgcacaccacccttgtg gagaccttgagtctgcccatgctggcagcctacctggatgctttgcagacgctgaagggg aagatcccaaccttgattgaccggatgcttgtgtcatccaacacaaagactggggctgca ggagctgaggccttgtctctcctactgaagaggccctgggaccctgctgtgggtgtgctt tctcataacaaaccaagcaaactccctggctctccgctgattctcatcgcctcctctggt ccctccagctctgtgtttcccacttcacgccgccaccgcttctggcaatctcagctgtcc tgcttgggcaaggtgtcagtaatggagtatgtcactgcagttgtctgccttgggtttcct ctgcttactgtggatggccccagaggggatgtagatgatcccctcttggatatgaagact ccagtcctctttgtcattggtcagaattcccttcaatgtcaccctgaagccatggaggac ttccgggagaagattcgagctgagaacagcttggtggtggttgggggagctgatgacaat ctcagaataagcaaagcaaagaagaaatcagaagggttgactcagagcatggtggacaga tgtattcaggatgagattgtggactttctgactggagtgctcactcgtgctgagggtcac atgggctctgaacctcgggatcaggatgctgagaagaagaagaagccccgcgatgtggcc cgcagagacttggcctttgaagtccctgagcggggcagtcgacctgcctccccagctgcc aagctgcccgcctcaccctcaggctcagaggatctctccagtgtgtccagcagccccacc tccagtcccaagaccaaagtgaccacagtgacctctgcccagaagtccagtcagattgga agttctcagctgctgaagagacatgtgcagcggacagaagctgtgctgacccacaaacaa gctcaagaaccaccagaggaaggagagaaagaggatcttagggttcagctgaagcgacac catccctcgagtccccttcctggcagtaagacctccaaacgaccgaagatcaaggtgtcc cttatctcccaaggggacacagctggagggccttgtgctccttcccaaggaagtgctcca gaagctgcaggtgggaagcccatcaccatgacactggggcaggcttcagcaggggccaag gagctcacaggacttctcaccacagccaagtccagttcttctgaaggtggagtctcagcc agcccagtcccttcagtggtctccagcagcactgcacccagtgccttgcacacactgcag agccgcctggtggccacatctcctggcagctccctcccaggggccacatcagccagcagc ctcctccaaggcctcagcttcagcttgcaggatatcagcagcaagacctctggccttcca gcaaatccctccccaggaccagccccacaggccaccagtgtgaagttgcccacccccatg cagagcctgggtgccatcaccacgggcaccagcaccattgtccgtaccattcctgtggcc accactctctcctccttgggtgccactcctggtgggaagcccacagccatccaccagctg ctgaccaatgggggcctcgctaagttggcaagcagcctccctggcctggctcagatctct aaccaagcatcaggcttgaaggtccccaccaccattactctgacacttcgtggccagccg agcaggatcactacactgagccctatgggctcaggagcagccccatccgaggagtcctct tcccaggtgctgccctccagctcacagggaaaatactgcacagaggtcaacacctga >gi568815596r:96502752_96737135|GENSCAN_predicted_peptide_4|2102_aa MLRLVVQSAKIDPPLAPLPRPCMSIDFRDIKKRTRVVEGNDPVWNETLIWHLWNRPLEND SFLQVTLQDMGSQKKERFIGLATVLLKPLLKQPSEVLFVKDLTLLNHSMKPTDCTVTLQV AHMSNQDIEKTGAEDHLGITAREAASQKLMVPGSTAHRALSSKPQHFQVRVKVFEARQLM GNNIKPVVKVSIAGQQHQTRIKMGNNPFFNEIFFQNFHEVPAKFFDETILIQTDIGFIYH SPGHTLLRKWLGLCQPNNPGSGVTGYLKVTIYALGVGDQALIDQKLLYGTDDTDIQIFKS AVVPINMAYLQLFIYCAEDLHLKKHQSVNPQLEVELIGEKLRTHMQTQTDNPIWNQILTF RIQLPCLSSYIKFRVLDCRKKDCPDEIGTASLSLNQISSTGEEIEGKQSLEPTSYTPRVY SGFLPCFGPSFLTLHGGKKAPFRIQEEGACIPDSVRDGLAYRGRVFLELITQIKSYQDST IKDLSHEVTRIEKHQNRQKYGLCVIFLSCTMMPNFKELIHFEVSIGHYGNKMDLNYKPLV SSTPYSPVIYDGNIYHYVPWYNTKPVVAVTSNWEDVSFRMNCLNLLHFTRDRLKANLDTL KSTRNPKDPALLYQWEKLLRELAEDCKRPLPCMTYQPKATSLDRKRWQLRSLLLQELAQK AKQAKPKDMVATAEDWLYRLNTVLPEPQMGLPDVMIWLVAKEQRVAYAQVPAHSVLFSPA GALHSGRLCGKIQTLFLQYPEGEGQKDVLPAHLRVCMWLGNVTDSKDLQLLRQGDTAVYA EMVYSELSPSSSTAREVSKALTCRGLTHHQQGSPGPLQAPVGSQEYENQAKYKDQWGQQG LYHCPNFSDVMGNKTLPMTDFQPPLGWHWQDSWTVEPQRRLLLDIDINKSQVLEEVYENQ GRDTRGAWGPAAIPNTDVNGQPMEARENVKCPQGWHFKKDWVVELNHAVDSWEYGVGIPP SGLPQVWSPVEKTYHSCRRRRWARVRFRNHGELSHEQETLSFLQLGLAKGEEEGWEYDTF GSKFHLNPQPQSRFRRRCWRRRLAPNKDKGIAPIFLLEGSLAMDLKYHAGKEEDSKTWPW GLDRQFRDPQRQDTRPPNLPFIYCTFNKPHYYQLFCYIYQARNLVSNQILTFQGPFIRVV FLNHSQCTQTLRSSAGPTWAQTLIFQHLLLYENPQDTKESPPLVVLELWQRDFWGKESLW GRSVWPPMVWLDLQDRILPPMRWHPLVKELGKEEGEILASCELILQTEKLGEKQLPILSV PWKNGAYTLPKSIQPTIKRMAIEILAWGLRNMKKASSPQLLVEFGEESLRTEPIRDFQTN PNFPESESVLVLTVLMPTEEAYALPLVVKVVDNWAFGQQTVTGQANIDFLQPYFCDPWAQ DYMHPKLPTLSEKKHQDFLGYLYRKFWFKSSKAEDEYEHEVDWWSKLFWATDEHKSLKYK YKDYHTLKVYECELEAVPAFQGLQDFCQTFKLYQEQPKLDSPVVGEFKGLFRIYPFPENP EAPKPPLQFLVWPEREDFPQPCLVRVYMVRAINLQPQDYNGLCDPYVILKLGKTELGNRD MYQPNTLDPIFGMMFELTCNIPLEKDLEIQLYDFDLFSPDDKIGTTVIDLENRLLSGFGA HCGLSKSYCQSGPFRWRDQMPPSYLLERYAKRKGLPPPLFSPEEDAVFYNGKKFKLQSFE PKTPTVHGLGPKKERLALYLLHTQGLVPEHVETRTLYSHSQPGIDQGKVQMWVDIFPKKL GPPGPQVNINPRKPKRKASEHSGHRYELRCIIWKTANVDLVDDNLSREKTSDIYIKGWLY GLEKDMQKTDIHYHSLTGEADFNWRFIFTMDYLAAERTCVQSQKDYIWSLDATSMKFPAR LIIQVWDNDIFSPDDFLGVLELDLSDMPLPARHAKQCSIRMMDADPKWPYFIQYKHFSLF KKKTVTGWWPCQVLDGGKWRLSGKVKMSLEILSEKEALIKPAGRGQSEPNQYPTLHPPLR TNTSFTWLRSPVQNFCYIFWKRYRFKLIAFMVISIIALMLFNFIYSAPHYLAMSWIKPQL QLYPPIKIFNIINSLNTSNASSSILPTQDPNLKPTIDHEWKLHPGPTNHLSDIFPELPAP GD >gi568815596r:96502752_96737135|GENSCAN_predicted_CDS_4|6309_bp atgctgcggcttgtggtgcagtcggccaagattgacccaccactagccccactacccagg ccctgcatgtccatcgacttcagagatatcaagaaaagaactcgtgtggtggaagggaat gatcccgtgtggaatgagaccctaatctggcacctctggaaccgccccctggaaaatgac tccttcctgcaagtcacccttcaggacatgggctcacaaaagaaagaaagattcattggc ctggccacagtactgctcaagccattgttgaaacaaccaagtgaggtcctttttgtgaag gacttgaccctgctcaaccattccatgaagcccacagattgtactgtcaccctacaggtg gcccacatgagcaaccaggatattgagaagacaggagctgaagaccacctgggcataacg gcaagagaggcagccagtcagaaactgatggtccctggctccactgcgcacagggctctg tcctcaaagcctcagcactttcaggttcgagtgaaggtgtttgaagcccgacagctcatg ggcaacaacatcaaaccagtggtgaaggtgtccatcgcaggccagcagcaccagacacgc atcaagatgggaaacaaccctttctttaatgagatcttcttccagaattttcatgaggtt cctgcaaagttctttgatgagaccatcttaatccagacagatattgggtttatctaccat tctccaggtcacacactcctaaggaaatggctaggcctctgccagccaaataaccctggc agtggtgtgacaggctacctgaaagtcaccatctatgccctcggtgtgggagaccaggcc ctgatagatcaaaagctgctctatggcaccgatgacaccgatattcagatcttcaagtca gcggtagtcccgatcaacatggcttacttacagctcttcatctactgcgcagaggacctt cacctcaagaaacaccagtcagtgaatcctcagttggaggtggaactaattggggaaaag ctcaggacacacatgcagacccaaaccgacaacccgatatggaaccagatcctgaccttc cggattcagctaccctgcctctccagctacatcaagttcagagtcttggactgccgcaag aaggactgcccggatgagattgggactgccagcctgtccctcaaccagatctcgtccacc ggagaagagatagaaggcaagcaaagcctcgagcccacttcctacacccctcgagtgtac tccggcttcctgccctgctttggccccagcttcctgactctgcatgggggtaaaaaggcc cctttcaggatccaggaagaaggcgcttgtattcccgactctgttagggatggtttagct tatcgaggccgagtcttcctggagttaatcacccaaatcaagtcctatcaagactccacg ataaaggatctctcccatgaagtgaccaggatagagaagcaccagaaccgccaaaagtat gggctgtgcgtcatcttcctttcctgtaccatgatgcccaactttaaagagctgatccat ttcgaggtcagcatcggtcactatgggaacaagatggacctgaattacaagcctctagtc tcaagcacaccgtacagcccagtgatatatgatgggaacatctaccattatgtgccctgg tacaacaccaagcctgtcgtggccgtgacctccaactgggaggacgtcagcttccgcatg aactgcctcaacctcctccacttcactcgggaccgcctgaaagccaacctggacaccctg aaatccacgcggaatccgaaggatccagctctcctctaccagtgggagaaactgctgagg gagctggcagaggactgcaagcgccctctgccctgcatgacctatcagcccaaagccacc agcctggacaggaagaggtggcagctccgcagcctcctcctgcaggaactggcccaaaag gccaagcaagccaagcccaaggacatggtggccacagcggaggactggctgtaccgcctc aacaccgtgctccctgagccccagatgggcctccctgacgtgatgatttggctggtggcc aaggagcagcgagtggcctatgcacaggtgcctgcccactccgtcctcttctccccggca ggggctctgcactccggcaggctctgtgggaagatacagacactcttcctacagtaccca gagggtgaaggacagaaggatgtgctcccagctcacctccgggtctgcatgtggcttggc aatgtcacagacagcaaggacctgcagctgctccgccagggtgacacagcggtgtacgcc gagatggtttattccgagctttctcctagttcatcaacagcacgtgaagtgagcaaggct ctgacttgcaggggcctcacccatcaccaacagggcagcccgggacccctgcaggcccca gtggggtcccaggagtatgagaatcaggccaagtataaagaccagtgggggcagcagggg ctgtatcactgccccaacttctcggatgtcatggggaacaagaccctccccatgacggat ttccaaccacccctgggatggcactggcaggacagctggacagtggaacctcagagaagg ctcctcctggacatagacatcaacaagagccaggtgctggaggaggtatatgagaaccag ggccgtgacaccagaggggcctgggggcctgccgccatcccaaacacagacgtgaatgga cagcccatggaggcccgggagaacgtgaagtgcccccaaggctggcactttaagaaggac tgggtggtggagctgaaccacgcagtggacagctgggagtatggagtggggatcccaccg tcgggcctgccccaggtctggagcccggtggagaagacctaccactcgtgccgccgccgg cgctgggcgcgtgtgcgcttcaggaaccatggggagctgagccacgagcaggagaccctc tccttcctgcagctgggcctggccaagggcgaggaggagggctgggagtatgacaccttc ggctccaagttccacctcaaccctcagccccagagccggttccgccgccgctgctggcgc cgcaggctggcccccaacaaggacaagggcatcgcgcccatattcctcctggaggggtcc ttggctatggatctgaaataccacgctgggaaggaagaggacagcaagacatggccatgg ggtctggacagacagttcagggacccccagaggcaggacacccggccccccaacttgccc ttcatctactgcaccttcaataagccccactactaccagctcttctgctacatctaccag gcccggaacctggtgtccaatcagatcctgacattccaagggcccttcattcgggtggtc ttcctgaaccacagccagtgcacccaaaccctgaggagctctgcaggccccacatgggcc cagacactcatcttccagcacctccttctgtacgagaacccacaggacaccaaagagagc ccaccgcttgtggtgctggagctgtggcagcgtgacttctggggcaaggagagcttgtgg ggacggagcgtgtggcccccaatggtctggctggatctccaggaccggatcctgcccccc atgaggtggcatccccttgtaaaggagttggggaaggaagagggcgagatcttggcatcc tgtgagctgatcctccagactgagaagcttggagagaagcagctgcctatcttaagcgtt ccctggaagaatggggcatacacactccccaagagcatccagcccacgataaagaggatg gccattgagatcctggcctggggccttcggaacatgaagaaggcgagctccccccagctc ctggtggaattcggggaagagtccctgaggacagaacccatcagggactttcagaccaac cccaacttccccgagtctgagtctgtcctagtcctcacagtgctcatgccgacggaggag gcctatgcactgcccctcgtggtgaaggtggtagacaactgggccttcggccagcagacc gtgacgggccaggccaacatcgacttcctccagccctacttctgtgacccctgggctcaa gactatatgcacccaaagcttccaacgctgtctgagaagaagcaccaagacttcctaggc tacctctacagaaagttctggttcaagtccagtaaagcagaggatgagtatgagcatgag gtggactggtggagcaagctgttctgggccacagatgagcacaagtccctgaagtacaag tacaaagactaccacaccctcaaggtgtatgagtgtgagctggaggccgtgccagccttc cagggcctgcaggacttctgccagaccttcaaactctaccaggagcagcccaagttggac agccccgtggtaggggagttcaagggccttttccgcatctacccctttcctgagaatcca gaagccccaaagcccccgctgcagttcttggtttggccagagagagaggacttcccccag ccgtgcttggtgcgggtgtacatggtacgagccatcaacctgcagccccaggactacaat ggcctgtgtgacccttatgtgatcctgaaactgggcaagacagagcttggcaaccgggac atgtaccagcccaacactctggatcccatctttggcatgatgtttgaactcacctgcaac atacccctggagaaggacctagagatccagctctatgacttcgacctattttcacctgat gataagataggaaccacagtcatcgaccttgaaaaccgactcctatctggctttggagct cattgtgggctctccaaatcctactgccagtcagggccctttagatggcgggatcagatg cccccaagctacctcctagaacgctatgccaagcggaaagggctacctccgcctctgttc agtcctgaggaagatgctgttttctataatgggaaaaagttcaagctgcaaagctttgag cccaaaacccctactgttcatggtttgggacccaagaaggaacgccttgcactgtacctc ctgcacacccaggggctggtacctgagcacgtggagacccgcacactgtacagccacagc cagccaggcatcgaccagggaaaggtgcaaatgtgggtggacatcttccccaagaagctg gggcctcctggcccccaagtcaacatcaaccccagaaagcctaaacgcaaagcctcagag cacagtggccacaggtatgagctgcgatgcatcatctggaagactgccaatgtggacctg gtggatgacaatttaagtagagagaagacgagcgacatctacatcaaagggtggttatac gggctggagaaggacatgcagaagacagacatccactaccactcgctgactggggaggcc gacttcaactggcggttcatctttaccatggactacctggcggcggagcgcacgtgtgtc cagagccagaaggattacatatggagcctggatgccacgtccatgaagttcccagcccga cttatcatccaggtctgggacaatgacatcttctcccccgacgacttcctaggggtcctg gagctggatttgtctgacatgcccctcccggctcggcacgccaagcagtgctccatcagg atgatggacgccgaccccaagtggccctatttcatccaatacaagcacttctccctcttt aagaagaagactgtgactggctggtggccttgccaggtcctcgatggtggcaaatggcgc ttgtcgggcaaggtgaagatgagcctggagattctgtcagagaaggaagccttaatcaag ccagccgggcgaggccagtcggaacccaaccagtaccccacacttcatcctcccctacgc accaacacctctttcacgtggctgcggtcaccagttcaaaacttctgctatattttctgg aaacgctatcgcttcaaactcatagcctttatggtcatatcgattatagcacttatgctg tttaacttcatctattcagctccgcactatttggccatgagctggatcaaacctcaactt cagctgtatcctcccattaaaatattcaatatcatcaattcactaaacaccagcaacgcc agctcttccatccttcccacccaggatccaaacctaaagcctacaatagaccatgagtgg aaactccacccaggacccacaaatcacctgagtgatattttcccagaacttccagcccca ggagactaa >gi568815596r:96502752_96737135|GENSCAN_predicted_peptide_5|320_aa PCFLRDWELQVHFKIHGQGKKNLHGDGLAIWYTKDRMQPGPVFGNMDKFVGLGVFVDTYP NEEKQQEMGLSSCRKTSLGFPLISTSWMGIVSSLSGFLHKGLDEDYCRTSSLYSFEKVTG LRVILELKRAILNGKIPLLKQRVFPYISAMVNNGSLSYDHERDGRPTELGGCTAIVRNLH YDTFLVIRYVKRHLTIMMDIDGKHEWRDCIEVPGVRLPRGYYFGTSSITGDLSDNHDVIS LKLFELTVERTPEEEKLHRDVFLPSVDNMKLPEMTAPLPPLSGLALFLIVFFSLVFSVFA IVIGIILYNKWQEQSRKRFY >gi568815596r:96502752_96737135|GENSCAN_predicted_CDS_5|963_bp ccatgtttcctgagagactgggagttgcaggtgcacttcaaaatccatggacaaggaaag aagaatctgcatggggatggcttggcaatctggtacacaaaggatcggatgcagccaggg cctgtgtttggaaacatggacaaatttgtggggctgggagtatttgtagacacctacccc aatgaggagaagcagcaagagatgggactatctagttgcaggaaaacaagcttagggttc ccgctgatttctacatcatggatgggtattgtctcctctttgtctggatttctgcataaa gggctagatgaagattattgtaggaccagcagcttatacagctttgaaaaagtgacaggt ctacgtgtaattttagaattgaaaagagctatcctgaatggaaagattcccctgctcaaa cagcgggtattcccctacatctcagccatggtgaacaacggctccctcagctatgatcat gagcgggatgggcggcctacagagctgggaggctgcacagccattgtccgcaatcttcat tacgacaccttcctggtgattcgctacgtcaagaggcatttgacgataatgatggatatt gatggcaagcatgagtggagggactgcattgaagtgcccggagtccgcctgccccgcggc tactacttcggcacctcctccatcactggggatctctcagataatcatgatgtcatttcc ttgaagttgtttgaactgacagtggagagaaccccagaagaggaaaagctccatcgagat gtgttcttgccctcagtggacaatatgaagctgcctgagatgacagctccactgccgccc ctgagtggcctggccctcttcctcatcgtctttttctccctggtgttttctgtatttgcc atagtcattggtatcatactctacaacaaatggcaggaacagagccgaaagcgcttctac tga