GENSCAN 1.0 Date run: 5-Nov-116 Time: 02:22:20 Sequence gi568815581f:38358099_38598510 : 240412 bp : 52.77% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3613 3677 65 1 2 128 110 14 0.939 6.45 1.02 Intr + 8189 8319 131 0 2 118 100 151 0.995 20.22 1.03 Intr + 9784 9952 169 0 1 65 94 106 0.896 8.93 1.04 Intr + 16510 16721 212 2 2 6 -1 216 0.066 3.66 1.05 Intr + 17257 17347 91 0 1 128 -7 62 0.478 0.77 1.06 Intr + 19616 19744 129 0 0 86 110 4 0.408 3.57 1.07 Intr + 37211 37400 190 0 1 77 77 37 0.005 0.66 1.08 Term + 37750 37870 121 1 1 83 44 117 0.010 5.05 1.09 PlyA + 39044 39049 6 1.05 2.09 PlyA - 39577 39572 6 1.05 2.08 Term - 40104 39980 125 1 2 99 42 80 0.791 3.35 2.07 Intr - 44092 43997 96 2 0 92 98 63 0.871 8.18 2.06 Intr - 56693 56673 21 0 0 104 111 0 0.471 1.90 2.05 Intr - 57071 56967 105 0 0 93 94 58 0.621 7.59 2.04 Intr - 57830 57775 56 1 2 76 90 22 0.330 0.31 2.03 Intr - 59051 58977 75 1 0 81 77 34 0.345 0.72 2.02 Intr - 60467 60342 126 1 0 60 64 69 0.569 2.00 2.01 Init - 61279 61182 98 1 2 57 13 185 0.590 5.63 2.00 Prom - 63472 63433 40 0.49 3.05 PlyA - 65103 65098 6 -0.45 3.04 Term - 70861 70700 162 1 0 114 48 99 0.536 6.75 3.03 Intr - 76895 76844 52 1 1 119 59 26 0.261 2.20 3.02 Intr - 81569 81514 56 2 2 98 78 -2 0.097 -1.93 3.01 Init - 83003 82833 171 2 0 49 79 95 0.414 4.12 3.00 Prom - 87071 87032 40 -5.21 4.00 Prom + 87389 87428 40 -3.51 4.01 Init + 89757 89802 46 2 1 76 84 52 0.034 2.51 4.02 Intr + 92164 92211 48 2 0 38 107 45 0.004 0.54 4.03 Intr + 95816 96042 227 0 2 0 76 177 0.019 5.93 4.04 Intr + 97774 97914 141 0 0 51 58 68 0.016 1.16 4.05 Intr + 100058 100165 108 1 0 76 91 186 0.916 18.68 4.06 Intr + 102807 102834 28 2 1 128 94 22 0.975 4.97 4.07 Intr + 104748 104843 96 1 0 60 99 121 0.998 10.88 4.08 Intr + 105020 105098 79 0 1 106 95 99 0.995 11.50 4.09 Intr + 105230 105284 55 2 1 98 105 92 0.997 11.37 4.10 Intr + 108069 109233 1165 2 1 102 97 770 0.647 68.37 4.11 Intr + 109809 109955 147 1 0 68 81 55 0.918 3.52 4.12 Intr + 111046 111201 156 2 0 71 47 110 0.974 5.69 4.13 Intr + 111426 111537 112 1 1 57 77 246 0.999 20.34 4.14 Intr + 111749 111806 58 2 1 94 64 107 0.977 8.28 4.15 Intr + 113765 113908 144 1 0 106 116 92 0.999 14.79 4.16 Intr + 119481 119798 318 2 0 113 78 454 0.995 43.60 4.17 Intr + 121338 121399 62 2 2 117 78 100 0.999 9.92 4.18 Intr + 121655 121785 131 2 2 96 53 67 0.954 4.84 4.19 Intr + 123924 124045 122 1 2 109 109 133 0.922 18.22 4.20 Intr + 124425 124580 156 2 0 113 90 193 0.999 22.72 4.21 Intr + 127964 128042 79 1 1 100 101 130 0.994 15.12 4.22 Intr + 132004 132077 74 2 2 73 95 226 0.991 21.42 4.23 Intr + 132364 132453 90 0 0 107 98 84 0.991 11.99 4.24 Intr + 133309 133434 126 0 0 97 87 220 0.999 24.18 4.25 Intr + 133589 133707 119 1 2 26 70 23 0.544 -5.93 4.26 Intr + 134610 134864 255 0 0 60 60 94 0.448 0.89 4.27 Intr + 135939 136311 373 0 1 64 53 156 0.249 5.33 4.28 Intr + 139687 139728 42 0 0 105 89 59 0.943 6.62 4.29 Intr + 140316 140412 97 2 1 115 99 115 0.993 15.38 4.30 Intr + 142499 142530 32 0 2 87 102 17 0.356 1.33 4.31 Term + 151846 152874 1029 0 0 94 42 1282 0.878 116.84 4.32 PlyA + 154269 154274 6 1.05 5.22 PlyA - 155180 155175 6 1.05 5.21 Term - 175333 175199 135 1 0 135 43 17 0.176 0.23 5.20 Intr - 183397 183341 57 1 0 64 84 70 0.883 3.77 5.19 Intr - 185871 185725 147 0 0 106 94 137 0.797 17.04 5.18 Intr - 187361 186777 585 2 0 116 39 225 0.790 13.86 5.17 Intr - 189783 189661 123 0 0 133 92 41 0.985 10.29 5.16 Intr - 190611 190459 153 0 0 51 92 304 0.931 27.88 5.15 Intr - 191112 190958 155 1 2 105 76 195 0.994 20.30 5.14 Intr - 191440 191276 165 2 0 119 94 13 0.961 5.45 5.13 Intr - 193291 193057 235 1 1 60 80 257 0.800 19.89 5.12 Intr - 194034 193788 247 2 1 90 38 308 0.455 24.00 5.11 Intr - 194496 194349 148 1 1 118 36 324 0.984 30.00 5.10 Intr - 194757 194627 131 2 2 45 59 273 0.842 20.94 5.09 Intr - 200304 200129 176 0 2 101 61 427 0.468 40.55 5.08 Intr - 201674 201487 188 0 2 96 86 221 0.994 22.63 5.07 Intr - 201999 201956 44 2 2 108 94 50 0.999 5.97 5.06 Intr - 202327 202235 93 0 0 85 94 134 0.997 13.28 5.05 Intr - 204230 203365 866 2 2 73 94 1402 0.999 130.12 5.04 Intr - 204822 204729 94 2 1 96 55 105 0.998 8.47 5.03 Intr - 205504 205225 280 2 1 35 68 528 0.678 42.58 5.02 Intr - 206215 206020 196 1 1 89 84 226 0.845 21.81 5.01 Init - 208059 207880 180 0 0 120 100 71 0.992 8.79 5.00 Prom - 209059 209020 40 -10.58 6.11 PlyA - 209112 209107 6 -0.45 6.10 Term - 210035 209873 163 1 1 116 44 138 0.979 9.92 6.09 Intr - 212666 212492 175 0 1 81 35 51 0.033 -1.39 6.08 Intr - 214798 214570 229 1 1 -25 110 115 0.033 0.37 6.07 Intr - 216876 216778 99 0 0 84 109 37 0.385 6.21 6.06 Intr - 220693 220392 302 2 2 113 109 587 0.968 60.30 6.05 Intr - 226724 226661 64 2 1 154 55 18 0.336 3.88 6.04 Intr - 229394 229344 51 2 0 78 90 15 0.227 0.29 6.03 Intr - 230183 230080 104 0 2 124 28 0 0.270 -2.01 6.02 Intr - 231309 231181 129 1 0 51 91 93 0.848 7.07 6.01 Intr - 235282 235183 100 1 1 69 75 99 0.769 6.88 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:38358099_38598510|GENSCAN_predicted_peptide_1|369_aa XKPKLTRTQSAFSPVSFSPLFTDAFPRIAPIRAAESLHSQPPQHLQCPLYRPDSSSFAAS LRELEKCGWYWGPMNWEDAEMKLKGKPDGSFLVRDSSDPRYILSLSFRSQGITHHTRMEH YREPIKEIMKEIWQKKKVKGEGFQDRDLREIQELIDTTSEELTEGDLMEMSSSESVPDDE EEDVEETVPENINSIWFENNEDENLYEDPLYDHFMMIHFHLMNRTFSLWCHPKFEDRCQS VVEFIKRAIMHSKNGKFLYFLRSRVPGLPPTPVQLLYPVSRFSNVKSLQHLCRFRIRQLV RIDHIPDLPLPKYNGVVRFGTGMSKGVVGRPLISYIRKFYYYDPQEEVYLSLKEAQLISK QKQEVEPST >gi568815581f:38358099_38598510|GENSCAN_predicted_CDS_1|1110_bp nngaaacccaagttgacaagaactcaaagtgccttttctccggtctccttcagccccctg ttcacagatgcatttccccggattgctcccatccgagcagctgaatccctgcacagccaa cccccacagcacctccagtgtcccctctaccggcctgactcgagcagctttgcagccagc cttcgagagttggagaagtgtggttggtattgggggccaatgaattgggaagatgcagag atgaagctgaaagggaaaccagatggttctttcctggtacgagacagttctgatcctcgt tacatcctgagcctcagtttccgatcacagggtatcacccaccacactagaatggagcac tacagagagccaatcaaggaaatcatgaaagagatatggcaaaaaaagaaggtgaagggt gaaggatttcaagatagggatcttagagaaattcaagaactcatagataccacatcagag gaattaactgaaggtgacttgatggagatgagttcttctgagtcagtgccagatgacgag gaagaagacgtagaagaaacagtgccagaaaacattaattctatttggtttgaaaacaat gaggatgaaaacctttatgaagatccactttatgatcactttatgatgatccacttccat ttaatgaatagaaccttcagcctgtggtgtcatcccaagtttgaggaccgctgtcaatct gttgtagagtttattaagagagccattatgcactccaagaatggaaagtttctctatttc ttaagatccagggttccaggactgccaccaactcctgtccagctgctctatccagtgtcc cgattcagcaatgtcaaatccctccagcacctttgcagattccggatacgacagctcgtc aggatagatcacatcccagatctcccactgcctaagtacaatggggttgtcaggtttggg acaggaatgagtaagggggttgtggggagacctctgatctcttatatccgaaagttctac tactatgatcctcaggaagaggtatacctgtctctaaaggaagcgcagctcatttccaaa cagaagcaagaggtggaaccctccacgtag >gi568815581f:38358099_38598510|GENSCAN_predicted_peptide_2|233_aa MPSSPRRPRGAQGEAAPPCALSGQLHPATAKRKEGGSEPPDTTGEHNVWAAGPPTLVKTK EMAKRHLRSLTPSIRLDPPVLLFFYIPQGPEPSATMLPQSLQGSLGNREKVERELRATGV RLGPPTLPRSQENKGFSLSLHLEEGVVSHTPGPARSSAASAHSTEKGLGCGQGWDQCTDT DTVQHDTPQEEKGLLGSTPGASSALRLLFVTGINTDVTAKRDPGSSVQTFANI >gi568815581f:38358099_38598510|GENSCAN_predicted_CDS_2|702_bp atgccgtcctcgccgcgccggccccggggggcgcagggggaggcggcccctccatgcgcc ctgtccgggcagctgcacccggcgaccgccaagcgcaaagaagggggatctgaaccccca gatactactggagaacataatgtgtgggcagcggggccgcccactctggtgaagaccaag gagatggcaaaaagacaccttcgctctctgacaccctccatcaggctggatcccccagtg cttctcttcttctacatcccacagggaccagaaccttcagcaacaatgttgcctcagagc ctccagggctccctggggaacagggagaaggtagaaagggagctaagagccacgggtgtc cgcctcggccccccaactctgccccggagccaggagaacaaaggcttctcgctctccctc cacctggaggagggggtcgtgtcccacacaccagggccagcccgctcctctgcggcctcc gcccactccacagagaaaggactaggctgcggccagggctgggaccagtgcactgacaca gacacagttcaacacgacactccccaggaggagaaagggctcctagggtctacaccagga gcaagttcagcattacgcttactctttgtcacaggcatcaatactgatgtcacagcaaaa agggatcctgggagttcagttcaaaccttcgccaatatatga >gi568815581f:38358099_38598510|GENSCAN_predicted_peptide_3|146_aa MVTQQKAVPNLQHNPGPLPTCAVATAAAFPRLGGSFWSPVAKIPLTLCSLWRPLKVVLLN RDNPSPMGLLCRLKRRNNTEFLHLEHKGDSMRELCRGLANGKRKLGQRKRRQAPRLKNPN KSRGPPAPAAQPLGAWHRHLGLCPLS >gi568815581f:38358099_38598510|GENSCAN_predicted_CDS_3|441_bp atggtcactcagcaaaaggcagtgcccaacctgcagcacaacccagggcctctgcccacc tgcgctgtggccactgcagctgcatttcccaggcttggaggttccttctggtcacccgtg gccaaaatcccactgacgctgtgcagcctctggaggcctctaaaggtcgtgttattaaat agagataatccatcgcccatgggattgttgtgcagattaaagagacggaacaatacagag ttcctccaccttgagcataagggagactctatgcgggagctgtgccggggcctcgccaac ggaaaacgaaaactcgggcagaggaaacggcggcaggcgccccgcctgaaaaacccgaac aaaagccgcgggcctcccgcccccgctgcccagccgctgggggcctggcaccgccacctg ggtctgtgccccctgtcctga >gi568815581f:38358099_38598510|GENSCAN_predicted_peptide_4|1904_aa MALVFTLWLARRTLFSGTQDVVKQEGNGKGSARPGAYWGRGSGRAREPPPAAELVGSAVW RVERAGAGGWRWERAVGVDCSAPEPRCLWLPCLSHSDRRAPGPRRARAPMGVQGTVGETG LKLQVEDAGLPLMSKPLLRPPGLGVFQLLRGPCLGPRTLLLYKSPQDGFGFTLRHFIVYP PESAVHCSLKEEENGGRGGGPSPRYRLEPMDTIFVKNVKEDGPAHRAGLRTGDRLVKVNG ESVIGKTYSQVIALIQNSDDTLELSIMPKDEDILQLAYSQDAYLKGNEPYSGEARSIPEP PPICYPRKTYAPPARASTRATMVPEPTSALPSDPRSPAAWSDPGLRVPPAARAHLDNSSL GMSQPRPSPGAFPHLSSEPRTPRAFPEPGSRVPPSRLECQQALSHWLSNQVPRRAGERRC PAMAPRARSASQDRLEEVAAPRPWPCSTSQDALSQLGQEGWHRARSDDYLSRATRSAEAL GPGALVSPRFERCGWASQRSSARTPACPTRDLPGPQAPPPSGLQGLDDLGYIGYRSYSPS FQRRTGLLHALSFRDSPFGGLPTFNLAQSPASFPPEASEPPRVVRPEPSTRALEPPAEDR GDEVVLRQKPPTGRKVQLTPARQMNLGFGDESPEPEASGRGERLGRKVAPLATTEDSLAS IPFIDICVRPCAWLHALLDWMAQQRVLDMQGQSIGMGVSWDRELEETGSEKGYDEPTSPS IDLQAKHVPASAVVSSAMNSAPVLGTSPSSPTFTFTLGRHYSQDCSSIKAGRRSSYLLAI TTERSKSCDDGLNTFRDEGRVLRRLPNRIPSLRMLRSFFTDGSLDSWGTSEDADAPSKRH STSDLSDATFSDIRREGWLYYKQILTKKGKKAGSGLRQWKRVYAALRARSLSLSKERREP GPAAAGAAAAGAGEDEAAPVCIGSCLVDISYSETKRRHVFRLTTADFCEYLFQAEDRDDM LGWIRAIRENSRAEGEDPGCANQALISKKLNDYRKVSHSSGPKADSSPKGSRGLGGLKSE FLKQSAARGLRTQDLPAGSKDDSAAAPKTPWGINIIKKNKKAAPRAFGVRLEECQPATEN QRVPLIVAACCRIVEARGLESTGIYRVPGNNAVVSSLQEQLNRGPGDINLQDERWQDLNV ISSLLKSFFRKLPEPLFTDDKYNDFIEANRIEDARERMRTLRKLIRDLPGHYYETLKFLV GHLKTIADHSEKNKMEPRNLALVFGPTLVRTSEDNMTDMVTHMPDRYKIVETLIQHPWAA FQGKAGMDPGIPAALGCCVPSRRVKRPPSFRDAWERAPSHRREDHSLLHPDTGNTTPELS GRPGLPCSVGDPDEALGKLSSPGHQGAPEPPVRVCFGGSSREEWVSGPVLLEKSLEAQGL RTFSSIAGFYLLDAGNITLVVTIENVSRHCQVPTGVGGLTAVALGPACSYVLCGLGQVPY LLCLSLHTCKAEMTILFTSHRCECLLKHSRRCLAGPKCRVRTDCHPLRRGHYSTPETDPR AGNTESDWFFSDEEDKGERTPVGDKEPQAVPNIEYLLPNIGRTVPPGDPGSDSTTCSSAK SKGSWAPKKEPYAREMLAISFISAVNRKRKKRREARGLGSSTDDDSEQEAHKPGAGATAP GTQERPQGPLPGAVAPEAPGRLSPPAAPEERPAADTRSIVSGYSTLSTMDRSVCSGASGR RAGAGDEADDERSELSHVETDTEGAAGAGPGGRLTRRPSFSSHHLMPCDTLARRRLARGR PDGEGAGRGGPRAPEPPGSASSSSQESLRPPAAALASRPSRMEALRLRLRGTADDMLAVR LRRPLSPETRRRRSSWRRHTVVVQSPLTDLNFNEWKELGGGGPPEPAGARAHSDNKDSGL SSLESTKARAPSSAASQPPAPGDTGSLQSQPPRRSAASRLHQCL >gi568815581f:38358099_38598510|GENSCAN_predicted_CDS_4|5715_bp atggccctggtgtttactttgtggctggctagacgcacccttttctcaggtactcaagat gtggttaaacaagagggtaatggaaaaggatctgcccggcctggggcgtactggggccgg ggcagtggccgggcccgggagcccccgccggccgcggagctggtgggcagcgccgtgtgg cgcgtggagcgcgcgggggccgggggctggcgctgggagcgcgccgtcggcgtggactgc agcgccccggaaccgcgctgcctctggctgccctgcctcagccacagcgaccgccgcgcg cccgggccgcgccgggccagggcccccatgggggtgcagggcaccgtgggagagactggg ctaaagctacaggtggaagatgctggactacccctgatgtcaaaacccctgctgagacct cctggactgggggtcttccagcttttacgtggcccctgtctggggccgaggacgctgctg ctgtacaaaagtccccaggacggctttggcttcactctgcgccacttcatcgtgtaccca cccgagtcggccgtgcactgcagcctgaaggaggaagagaatggaggccgtggaggagga ccctccccccggtaccgcctggagcccatggacaccatctttgtcaagaatgtgaaggaa gacggccctgcccatagggcggggcttcgcacaggagaccggctggtaaaggtgaatggg gaaagcgtcattgggaagacctactctcaggtcatagctctgatccagaatagtgatgac actctggagctgtctatcatgcccaaggacgaggacatcctccagctggcctactcccag gatgcctacctgaaagggaacgagccgtattctggagaggcccgcagcatcccagagcca cccccgatctgctacccccgcaagacctacgcccctcctgcccgggcctccaccagggcc actatggtgcctgagcccacctcagcactgcccagtgacccccggagtcctgctgcctgg agtgacccggggctccgtgtgccacctgctgcccgtgcccacctggacaactcttccttg gggatgagccagccccgccccagccctggtgccttcccccacctctcctcggagccccgg acgccccgtgccttcccagagcctggcagccgggtgccccccagcagactggagtgccag caggccttgtcacactggctgtcaaaccaggtaccccgccgggcgggggagagacggtgc ccagccatggccccccgggcccgcagcgcctcccaggaccggttggaggaggtggctgcc ccccgcccgtggccctgctccacctcccaggatgctttgagccagctgggccaggagggc tggcaccgagctcgctcagatgactacttgagccgggccacccgttctgccgaggcactg gggccaggggcactggtgtcaccccgctttgagcggtgtggctgggcttcccagcgttcg tctgcccgcacccccgcctgcccaactcgggacctgccagggccccaggccccacccccg tctggcctgcagggcctggatgacctcgggtacatcggctaccggagctacagcccatca ttccagcgccggaccggcctcctccatgcgctctccttccgggactcaccctttgggggg ctgcctaccttcaacctggcccagtcccctgcgtcattcccaccagaggcctccgagcca cccagggttgtacggccggaacccagcacccgggccctggagcctcctgcggaggatcgc ggcgatgaggtggtcctgaggcagaagcccccgacgggccgcaaggttcagctgaccccc gcaagacagatgaaccttggatttggtgacgagtccccagagccagaggccagtgggcga ggggaacgcctgggcaggaaggtggcccctttggccaccaccgaagactctctggcttcc atcccctttattgatatttgtgtcaggccctgtgcatggctccatgctctcctggactgg atggcccagcagagggtgttggacatgcagggacaatccattgggatgggcgtgtcgtgg gacagggagcttgaggagacgggctcagagaagggctatgatgagcccaccagccccagc attgacctccaagccaagcacgtccctgcctctgctgtggtctccagtgccatgaactca gcccctgtcctgggcaccagcccatcttccccgaccttcactttcaccctcggacgccat tactcgcaggactgcagcagcatcaaggctggccgccgctcctcctacctgctggccatc accacggagcgctccaagtcctgcgatgatggactcaacaccttccgcgacgagggccgg gttctgcggcgcctgccaaaccgcatacccagcctgcggatgctccggagcttcttcacc gacgggtccttggatagctggggcacctctgaagatgctgacgctccttctaagcgacac tcaacctctgacctctcagatgcgaccttcagcgatatcaggagagaaggctggttgtat tataagcagattctcaccaagaaggggaagaaagcgggcagcggcctgcgccagtggaag cgggtgtacgccgcgctgcgggcgcgctcgctctcgctgagcaaggagcggcgggagccc gggccggcggcggcgggggctgcggcggccggcgcaggtgaggacgaggcggcgcccgtc tgcatcggctcctgcctcgtggacatctcctacagcgagaccaagaggaggcacgtgttc cggctgaccaccgctgacttctgtgaatatctctttcaggctgaggaccgggatgacatg ctgggctggatcagagcgatccgggagaacagcagggccgagggcgaggaccccggctgt gccaaccaagctctgatcagcaagaagcttaacgattatcgcaaagtgagccatagctct gggcccaaagctgattcctcccccaaaggctctcgcggcctggggggcctcaagtctgag ttcctcaagcagagtgcggcacgtggcctcaggactcaggacctgcccgcagggagcaag gatgacagtgctgcagcccccaaaaccccctggggcatcaacatcatcaagaaaaataag aaggccgctccgagggcgtttggggtcaggctggaggagtgccagccagccacggagaac cagcgcgtccccttaatcgtggctgcatgctgtcgcattgtggaggcacgagggctggag tccacaggcatttaccgagtgcccggcaacaatgcagtggtgtccagcctacaggagcag ctcaaccgcgggcctggtgacatcaacctgcaggatgagcgctggcaagacctcaatgtg atcagcagcctgctcaagtccttcttccgaaagctgcccgagcctcttttcactgatgac aaatacaacgacttcatcgaggccaaccgcattgaggacgcgcgggagcgaatgaggacg ctgcggaagctgatccgggatctcccaggacactactatgaaacgctcaaattccttgtg ggccatctcaagaccatcgctgaccactctgagaaaaacaagatggaaccccggaacctg gccctggtctttgggccgacactggtgaggacgtctgaggacaacatgacagacatggtg acccacatgcctgaccgctacaagatcgtggagacactgatccagcacccatgggctgca tttcaaggcaaggcaggaatggatcctggaatccctgctgccctgggatgttgtgtcccc agcaggagagtcaagaggcccccgagcttccgagatgcctgggaaagggctccgagccac aggagggaagatcacagcctgctgcatccggacactgggaacaccactccagagctgtct gggaggccagggctgccctgcagcgtcggggatcctgatgaggccttgggaaagcttagc tctccagggcaccagggagccccggaacctcccgtgagggtgtgttttgggggcagcagc agggaagaatgggtgtctggccctgtgctcctggagaagtccctggaagcccaaggtctg aggacatttagcagcatcgctggcttctacctgctggatgctgggaatatcactctagtt gtcacaatcgaaaatgtctccagacattgccaagtgcccactggggtggggggattgacg gcagtggctctggggccagcctgctcctatgtgctgtgtggccttggacaagttccttac ctgctgtgcctcagtctccacacctgtaaagcagagatgacaatactgtttacctcacat cgttgtgaatgtctgctaaagcactcgcggcggtgcctggcaggtcctaagtgtcgtgtg agaacggactgtcatcctcttcgtcgtggtcattattccacgccagagacagatccccgt gctgggaacacggagtcagactggttcttcagtgacgaagaggacaagggagagagaacc cctgtgggcgacaaggagcctcaggcagtgcccaacattgagtacctcctgcccaacatt ggcaggacagtgccccctggcgacccggggtcagattctaccacctgtagttcagccaag tccaagggttcgtgggcccccaagaaggagccgtacgcccgggagatgctggcgatctcc ttcatctcggccgtcaaccgcaagcgcaagaagcggcgggaggcgcgggggctgggcagc agcaccgacgacgactcggagcaggaggcgcacaagcctggggcgggggccacagcgccg gggactcaggagcggccgcaggggccgctgcctggcgccgtcgcccccgaggcccccgga cgcctcagtcccccggcggcgccggaggagcggccggccgcggacacgcgctccattgtg tcgggctactccaccctgtccaccatggaccgcagcgtgtgctcgggcgctagcggtcgg cgggcaggggcgggggatgaggcggacgacgagcgtagcgagctgagccacgtggagacg gacactgagggcgcggcgggcgcggggcctggggggcgcctgacacgccggccgtccttc agctcgcaccacctcatgccctgcgacactctggcgcgccgccgcctggcccggggccgc ccagacggcgagggcgcgggccggggcggtccccgcgccccggagccgcccggctcggcg tcgtccagcagccaggagtcgctgcggcccccggcggcggcgctggcctcccggccctcg cgcatggaggcgctgcgtctaaggctccgcggcacggcggacgacatgctcgccgtgcgc ctgcggcggccgctgtcgcccgagacccggcggcgccggagcagctggcgccgccacacc gtggtggtgcagagcccgctgactgacctcaacttcaacgagtggaaggagctgggcgga gggggccccccggagcctgcgggcgcgcgggcgcacagtgacaacaaggactccggactc agcagcctggagtccaccaaggcgcgggccccgtcgtccgctgcctcgcagccgcccgcg cccggggacacggggtccctgcagagccagcccccgcgccgctcggccgcctcccgcctg catcagtgtctgtga >gi568815581f:38358099_38598510|GENSCAN_predicted_peptide_5|1465_aa MATPYSSSLLAASCCLNSVALIRTSLATTDLLVARQPHDHQTRPGSPHCHRAPRPSAGMK TRSSRHTQGAQPGLADQAAKLSYASAESLETMSEAELPLGFSRMNRFRQSLPLSRSASQT KLRSPAPKSSPRSPGGRRGDGSSDSEGGVSFAGVLFLQFGEETRRVHITHEVSSLDTLHA LIAHMFPQKLTMGMLKSPNTAILIKDEARNVFYELEDVRDIQDRSIIKIYRKEPLYAAFP GSHLTNGDLRREMVYASRESSPTRRLNNLSPAPHLASGSPPPGLPSGLPSGLQSGSPSRS RLSYAGGRPPSYAGSPVHHAAERLGGAPAAQGVSPSPSAILERRDVKPDEDLASKAGGMV LVKGEGLYADPYGLLHEGRLSLAAAAGDPFAYPGAGGLYKRGSVRSLSTYSAAALQSDLE DSLYKAAGGGGPLYGDGYGFRLPPSSPQKLADVAAPPGGPPPPHSPYSGPPSRGSPVRQS FRKDSGSSSVFAESPGGKTRSAGSASTAGAPPSELFPGPGERSLVGFGPPVPAKDTETRE RMEAMEKQIASLTGLVQSALLRGSEPETPSEKIEGSNGAATPSAPCGSGGRSSGATPVSG PPPPSASSTPAGQPTAVSRLQMQLHLRGLQNSASDLRGQLQQLRKLQLQNQESVRALLKR TEAELSMRVSEAARRQEDPLQRQRTLVEEERLRYLNDEELITQQLNDLEKSVEKIQRDVS HNHRLVPGPELEEKALVLKQLGETLTELKAHFPGLQSKMRVVLRVEVEAVKFLKEEPQRL DGLLKRCRGVTDTLAQIRRQVDEGVWPPPNNLLSQSPKKVTAETDFNKSVDFEMPPPSPP LNLHELSGPAEGASLTPKGGNPTKGLDTPGKRSVDKAVSVEAAERDWEEKRAALTQYSAK DINRLLEETQAELLKAIPDLDCASKAHPGPAPTPDHKPPKAPHGQKAAPRTEPSGRRGSG SVQPDPGLAAQRWGLSCSSDKLPGAASAAGPQTTLGVARVLSGEGQGLGLVCRRDELTVP RYRTEKPSKSPPPPPPRRSFPSSHGLTTTRTGEVVVTSKKDSAFIKKAESEELEVQKPQV KLRRAVSEVARPASTPPIMASAIKDEDDEDRIIAELEVFERSSVSSLPPTPRRQLIPTLL SPQDLGPPGGSAPGPTRKAAPGPRAFCVPRIILTECAPNPPSPPEARLEELGPRTAPTPR PQTLADSTRGWDGPQSPPGVVGETSGPRSSFMPRKEGAALKRLGGGGSSLEDGGARVQCP QGPAQDGTPETSTADTYPEEILKDSGHDAQTCSREHQGQAAANSGRTTWGATAQQMDSLE ETLRELEATLSNMGTGPAMGSPGSPPPLPLRPQSGGGSVPPMKVVTPGASRLKAAQGQAG SPDKSKHGKQRAEYMRIQAQQQAPRMRVSRHNAPPTTLPQPATKPSKEMSGSNETSSPVS EKPSASRTSIPVLTSFGARNSSISF >gi568815581f:38358099_38598510|GENSCAN_predicted_CDS_5|4398_bp atggccaccccctactcttcatcgttgctggctgcctcgtgctgcctgaattctgtagcc ctcatccgcactagcctggcaaccactgaccttttagttgccaggcaaccccatgaccac cagaccagacctggttccccacactgccacagagctccccgcccttcagctgggatgaag acccgcagctcacgccacactcagggagcccagcccgggctggcagaccaggcggcaaag ctgtcctacgcctccgccgagtcgctggagaccatgtcggaggccgagctgcccctgggc ttcagcaggatgaaccgcttccgacagagcctgcctctctcccgctcggccagccagacc aagctgcgctccccagctccgaaaagctccccgcgctctcctggagggcggcgtggagac ggcagcagtgacagcgagggcggcgtctccttcgcaggggtgctgttcctgcagttcggg gaggagactcggcgcgtgcacatcacgcacgaggtcagcagcctggacacgctgcacgca ctcatcgcgcacatgttcccgcagaagctcaccatgggcatgcttaagtcgcccaatacc gccatcctcatcaaagacgaggctcgcaacgtcttctacgagctggaggacgtccgggac atccaggaccgcagtattatcaagatctacagaaaggagcccctctacgctgccttccct ggctcacatctcaccaacggggacctccggagagagatggtgtacgcatcgcgggagtcc tcgcccacgcggcgcctcaacaacctgtcaccagcgccgcacctggcatccggctcgccg ccgcccgggctgccgtcggggctgccgtccgggctgcagtccggttcgccgtcgcgttcg cgcctatcgtacgccggggggcgcccgccttcgtacgccggcagcccggtgcaccacgcg gccgagaggctgggaggcgccccggccgcccagggcgtcagccccagccccagcgccatc ctggagcggcgcgacgtgaagccggacgaggacctggcgagcaaggcgggcggcatggtg ctggtgaaaggcgagggcctctatgctgacccctacgggctgctgcacgagggccgtctg agcctggccgcggccgccggcgacccgttcgcctacccgggcgccggcggcctctacaag cgcggctcggtgcgctcgctcagcacctactcggccgccgcgctgcagtccgatctggag gactccctgtacaaggcggcgggcggcggcggcccgctgtacggcgacggctacggcttc cgcctgccgccttcgtcaccgcagaagctggccgacgtggcagcaccccccggaggtccc ccgccaccgcacagcccctactcggggccgcccagccgcggctcgccagtgcgccagtcc ttccgcaaggactcgggctcctcgtccgtctttgccgagagtcctggagggaagacccgc agcgcggggagcgcctcgacggccggagctcccccttcggagctcttccctgggcctggg gaacgctcgctggttgggttcgggccgccagtgccagccaaggacacggagaccagggag cgcatggaggccatggagaagcagattgccagcctcacaggcctggtgcagagcgcctta ctgcgaggctctgagcctgagacccccagcgagaagattgaaggctccaatggagcagcc accccctcagcaccctgtgggtcaggcggccggagcagcggggccaccccggtgtccggc ccgcccccgccctcggccagcagcacccccgcaggtcagcctaccgccgttagccggctg cagatgcagcttcacctgcgaggcctgcagaacagcgccagtgacttgcgcggccagctc cagcagttgcgcaagctccagctacagaaccaggagtcggtgcgcgcgctgctgaagcgc acggaggcagagctgagcatgcgcgtgtcggaggcggcgcggcggcaggaggacccgctg cagcggcagcgcaccctggtggaagaggaacggctgcgctatctcaacgacgaggagctt attacccagcagctcaatgacctggagaaatcggtggagaagatccagagagacgtgtcc cacaaccaccggctggtgcccggccctgagctggaggagaaggcactggtgctgaagcag ctcggggagacgctgacagagctcaaggctcacttcccgggcctgcagagcaagatgcgg gtggtgctgcgcgtggaggtggaggcggtgaagttcctgaaggaggagccccagcgcctg gatgggctcctcaagcgctgccgcggggtcacggacacgctggcccagatccgaaggcaa gtggatgagggtgtgtggccaccccccaacaatctcctgagtcagtcccccaagaaggtg acggcagagactgacttcaacaagagcgtggacttcgaaatgccaccccccagccccccg ctgaacctgcatgagctgagcgggccagctgaaggagcctctcttacccccaaggggggc aaccccaccaaaggcctggacactcctggcaagagaagcgtggacaaagctgtgtctgtt gaggctgcagagcgagactgggaggagaagcgggcagccctgacccagtacagtgccaag gacatcaaccggctgctggaagagacacaggcagagctgctcaaggccatccctgacctg gactgtgccagcaaggcccatccaggcccggcccccactccagatcacaagccccccaag gccccccacggccagaaggcagccccccgaacggagcccagtgggaggaggggctcagga tctgttcaaccagatcctggtttagcagcccagaggtggggcctgagctgcagttctgac aagctgccaggcgctgccagtgctgctggtccacagacaacacttggagtagcaagagtg ctgtcaggagagggacagggcttggggcttgtgtgcaggagagatgagttgaccgtgccc cgataccgcacagagaagccctccaagtcgcccccaccgccccctccccgccggagcttc ccctcctcccatggcctgaccaccacacgtaccggagaggtggtggtcaccagcaagaag gactcggccttcatcaagaaggctgagtccgaggagctggaggtgcagaagccccaggtg aagctgcgccgggctgtgtctgaggtggcccgcccagcctccacaccacccatcatggcc tcggccatcaaggacgaggatgacgaggatcgcatcatcgcagagctagaggtgtttgag agaagctcagtgtcttccctcccccccacgccccgccgccagctgatccccaccttgctg tccccccaggacctggggccccccgggggctcagccccaggccctacacggaaggctgcc ccaggtcccagggccttctgcgtcccaaggatcatcttgacagagtgtgcccccaaccct ccctccccgccagaggccagacttgaggaactgggacccaggacagctcccaccccaaga cctcagaccctggctgacagcacgaggggctgggatggcccacagtccccaccaggggta gtgggggagacttctgggccgaggagcagcttcatgccccggaaggagggggcagctctg aagagactgggtggaggaggcagcagcctagaggatggaggagccagagtacagtgtcct cagggaccagcccaggatgggactccagagacctctactgctgacacctacccagaggag atcctcaaggactctggacatgatgcccaaacctgcagtagggagcatcagggccaggct gctgccaactcaggccgcaccacgtggggcgccactgcccagcagatggacagcttggag gagacgctccgggagctggaagccaccctgagcaacatgggcacaggccctgccatgggg tcccctggcagccccccacccctacccctccgcccccagagtggcggaggcagtgtacca cccatgaaggtggtgactccgggggcctctcggctgaaggcggcccagggccaggcgggc agccccgacaaaagcaaacatggcaagcagagggccgagtacatgcggatccaggcccag cagcaggctcctcgaatgcgtgtgtccagacataatgctcccccgaccacgctgccccag ccggccactaaaccatctaaagagatgagcgggtcgaatgagacctcgagcccagtctca gaaaagccctcggcttccagaacctctatccctgtattgacttcctttggggcaaggaat tcttccatctccttctag >gi568815581f:38358099_38598510|GENSCAN_predicted_peptide_6|471_aa QGSSVAQRHGEFTLRFITFLPNPVPPEESVLQPAGDASTWRFSEPQMPCMSCVDWHDILP VCVASKVAEWGTLVGPGKPISQIFIRSQKSQPHSPGSDTTLTVGGRKSGSGERHLAGERA LWFPDTGRMTSCVNVAQGQPEDRVEGYIRDPERSSPPMLSADDAEYPREYRTLGGGGGGG SGGRRFSNVGLVHTSERRHTVIAAQSLEALSGLQKADADRKRDAFMDHLKSKYPQHALAL RGQQDRMREQVGGWTVDPVCLLSSLCSHLHGDSAPSGAGQPAQRRARWSRGYRGDWGPGR HGGGRGGSRIAAKGTGGGGRGCGGGRGREEPGRVGVGALPPQPDAGPGAGDPARTPVAAG DCPMDVHSLGTLGQAHVEAIASVVVARPCLYLPPRTTRWASVIRTRYPESDVLTLVAGLC TGVRICTDTFWYGRWASGLQAVAAALGESGFACQGLVDGVGVWATPAGDLG >gi568815581f:38358099_38598510|GENSCAN_predicted_CDS_6|1416_bp cagggctcctctgtggcccagcgccatggagaattcacactgagattcatcaccttcctc cccaaccctgttcctcctgaggagtccgtgctgcagccagctggagatgcatccacctgg cgctttagcgagccccagatgccctgcatgagctgcgtggactggcatgacattctccct gtctgcgtggcatctaaggtggcagagtggggcaccctggtgggcccagggaagccaatt agccagatcttcattagaagccaaaaatcacaaccccacagtcctgggtctgatacaact ctgacagtgggtggaaggaaatcgggctccggggaaaggcatctggctggagagagagca ctgtggtttcccgacacaggacggatgacatcatgtgtgaatgttgcccaggggcagcca gaggacagagtagagggttacatccgggatccggagcggagcagcccccccatgctgtct gcggacgatgcggagtacccgcgggagtaccggaccctggggggcgggggcggcgggggc agcgggggccggcgcttctccaacgtggggctggtgcacacgtccgagcggcggcacacg gtgatcgcggcccagagtctggaggcgctcagcgggctccagaaggcggacgccgaccgc aagcgtgatgccttcatggaccacctgaagagcaagtacccacagcacgccctggccctg cgaggccagcaggacaggatgcgagagcaggttggcggctggaccgtggaccccgtatgc ctcctcagctccctctgctcccacctccatggcgactccgccccctccggggctggccag ccggcccagcgccgcgccaggtggagtcgcggttaccggggcgactggggcccgggccgg cacggcggcggccgcggcggctctcgcattgcagcaaagggcaccggcggcggcggccgc ggctgcggaggcggccgggggagagaagagcccgggcgggtgggagtaggggccctccct ccccagccggacgcaggacctggggctggggaccccgcacgtacacctgtggccgcaggt gactgccccatggatgtgcacagccttgggaccctggggcaggcccatgtggaggccata gccagtgtggttgtggccaggccctgtctttatctccctcccaggaccacgaggtgggcc tcagtcatccgaacaaggtaccctgagtctgatgtgttgacactggtggcagggctttgc accggggtgcgcatctgcacggacaccttctggtatgggcggtgggcctcggggctgcag gctgtggctgctgctttgggagagtctgggttcgcctgtcaggggcttgtggatggagta ggagtttgggccacccctgctggggatctgggctag