GENSCAN 1.0 Date run: 7-Nov-116 Time: 01:02:58 Sequence gi568815591f:101122603_101323845 : 201243 bp : 52.23% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 4522 4642 121 2 1 37 91 96 0.131 4.86 1.02 Intr + 5791 6086 296 1 2 84 90 280 0.207 25.00 1.03 Intr + 7819 8052 234 2 0 93 105 298 0.999 30.19 1.04 Intr + 9273 9467 195 1 0 95 92 278 0.999 28.81 1.05 Intr + 11093 11291 199 0 1 89 82 238 0.999 22.23 1.06 Intr + 12892 12992 101 1 2 97 110 86 0.999 11.95 1.07 Intr + 13113 13199 87 1 0 89 109 131 0.873 15.74 1.08 Intr + 14399 14482 84 0 0 105 99 105 0.981 13.59 1.09 Term + 14626 14840 215 2 2 71 46 80 0.722 -0.18 1.10 PlyA + 16595 16600 6 1.05 2.00 Prom + 30819 30858 40 -0.21 2.01 Init + 31201 31206 6 0 0 74 66 7 0.744 -2.45 2.02 Intr + 31508 31765 258 1 0 110 35 104 0.616 5.49 2.03 Intr + 33992 34170 179 1 2 113 47 212 0.920 18.83 2.04 Intr + 34775 34883 109 2 1 97 84 207 0.999 21.99 2.05 Intr + 36457 36610 154 0 1 127 51 298 0.478 30.16 2.06 Term + 37134 37228 95 1 2 87 45 71 0.500 0.99 2.07 PlyA + 38651 38656 6 1.05 3.02 PlyA - 39061 39056 6 1.05 3.01 Sngl - 42241 40394 1848 1 0 49 42 2187 0.931 202.98 3.00 Prom - 43006 42967 40 -2.11 4.08 PlyA - 47915 47910 6 1.05 4.07 Term - 50049 49477 573 0 0 131 54 917 0.521 87.46 4.06 Intr - 50918 50694 225 2 0 54 94 438 0.999 39.71 4.05 Intr - 52209 51894 316 2 1 115 77 453 0.598 43.52 4.04 Intr - 56534 56495 40 0 1 124 89 7 0.056 2.17 4.03 Intr - 58124 57972 153 0 0 107 38 27 0.029 0.16 4.02 Intr - 59632 59512 121 1 1 85 37 90 0.080 4.17 4.01 Init - 63289 63287 3 1 0 108 81 0 0.188 1.23 4.00 Prom - 68903 68864 40 -2.81 5.13 PlyA - 70821 70816 6 1.05 5.12 Term - 73498 73344 155 2 2 104 47 204 0.992 16.30 5.11 Intr - 73787 73585 203 1 2 136 92 198 0.995 24.45 5.10 Intr - 75763 75589 175 2 1 113 100 194 0.999 22.61 5.09 Intr - 76277 76024 254 1 2 29 85 268 0.358 18.21 5.08 Intr - 77913 77745 169 1 1 117 47 23 0.339 0.62 5.07 Intr - 78339 78144 196 0 1 24 72 76 0.032 -0.89 5.06 Intr - 79353 79211 143 1 2 82 -12 250 0.042 14.98 5.05 Intr - 79696 79542 155 0 2 103 71 189 0.997 18.93 5.04 Intr - 79997 79841 157 0 1 97 76 71 0.963 6.38 5.03 Intr - 80673 80469 205 0 1 66 69 180 0.439 13.20 5.02 Intr - 81443 81211 233 0 2 114 30 215 0.341 16.32 5.01 Init - 81866 81455 412 2 1 36 -155 344 0.421 1.62 5.00 Prom - 82166 82127 40 -8.48 6.20 PlyA - 82290 82285 6 1.05 6.19 Term - 83834 83679 156 2 0 137 47 247 0.999 23.75 6.18 Intr - 84302 84177 126 2 0 121 69 253 0.999 28.08 6.17 Intr - 84929 84795 135 2 0 86 100 -14 0.639 0.87 6.16 Intr - 85122 84976 147 0 0 51 91 327 0.999 30.24 6.15 Intr - 86355 86251 105 0 0 86 94 114 0.933 12.71 6.14 Intr - 87559 87491 69 1 0 101 99 142 0.926 16.47 6.13 Intr - 88071 87729 343 2 1 85 72 416 0.867 35.69 6.12 Intr - 89114 88989 126 1 0 114 65 304 0.984 31.00 6.11 Intr - 89366 89244 123 1 0 74 39 249 0.508 18.81 6.10 Intr - 89772 89651 122 0 2 110 58 96 0.935 8.60 6.09 Intr - 90053 89928 126 2 0 80 55 177 0.992 14.98 6.08 Intr - 90341 90240 102 2 0 79 94 89 0.992 9.47 6.07 Intr - 90602 90505 98 0 2 94 45 196 0.963 16.13 6.06 Intr - 92550 92487 64 0 1 73 66 79 0.583 2.98 6.05 Intr - 93418 93306 113 2 2 74 49 247 0.999 20.00 6.04 Intr - 93724 93561 164 0 2 103 89 254 0.983 27.13 6.03 Intr - 93944 93808 137 2 2 77 65 154 0.999 11.87 6.02 Intr - 94184 94093 92 0 2 76 76 161 0.988 13.91 6.01 Init - 94672 94564 109 1 1 106 94 228 0.988 23.54 6.00 Prom - 96685 96646 40 -1.31 7.00 Prom + 97437 97476 40 -9.55 7.01 Init + 97932 98010 79 2 1 57 92 1 0.516 -1.42 7.02 Intr + 100002 100172 171 1 0 75 81 295 0.929 27.93 7.03 Intr + 100875 100954 80 1 2 46 94 75 0.999 3.67 7.04 Intr + 101071 101240 170 0 2 73 99 147 0.994 13.56 7.05 Term + 101335 101356 22 1 1 143 52 17 0.955 1.67 7.06 PlyA + 101567 101572 6 1.05 8.06 PlyA - 101827 101822 6 1.05 8.05 Term - 109913 109808 106 1 1 94 43 194 0.999 13.68 8.04 Intr - 110118 110002 117 2 0 93 99 73 0.998 8.98 8.03 Intr - 110312 110231 82 0 1 90 89 182 0.690 17.69 8.02 Intr - 111840 111676 165 1 0 49 50 184 0.643 11.15 8.01 Init - 114979 114763 217 1 1 96 60 377 0.970 34.34 8.00 Prom - 115432 115393 40 -7.99 9.06 PlyA - 117035 117030 6 1.05 9.05 Term - 117301 117204 98 2 2 141 49 121 0.999 11.93 9.04 Intr - 117645 117540 106 0 1 28 56 240 0.585 15.09 9.03 Intr - 118304 118228 77 0 2 108 100 149 0.998 17.83 9.02 Intr - 121537 121405 133 1 1 93 72 301 0.947 29.72 9.01 Init - 122402 122358 45 2 0 101 81 129 0.860 14.13 9.00 Prom - 150625 150586 40 -2.51 10.04 PlyA - 150671 150666 6 1.05 10.03 Term - 156293 156190 104 0 2 109 34 61 0.045 1.54 10.02 Intr - 165141 164881 261 1 0 116 73 274 0.680 26.70 10.01 Init - 167162 167087 76 2 1 72 101 127 0.795 11.61 10.00 Prom - 167493 167454 40 -9.84 11.00 Prom + 168755 168794 40 -0.71 11.01 Sngl + 178225 178614 390 0 0 58 48 184 0.488 7.96 11.02 PlyA + 178723 178728 6 1.05 12.08 PlyA - 178976 178971 6 1.05 12.07 Term - 180297 180207 91 2 1 53 52 116 0.025 1.99 12.06 Intr - 180609 180486 124 1 1 100 32 134 0.011 9.15 12.05 Intr - 186194 186015 180 0 0 99 46 86 0.010 5.86 12.04 Intr - 193940 193738 203 1 2 147 47 148 0.803 16.05 12.03 Intr - 195611 195522 90 1 0 120 44 105 0.986 8.91 12.02 Intr - 196430 196354 77 2 2 60 83 65 0.691 2.01 12.01 Init - 199107 199069 39 0 0 60 107 65 0.708 5.85 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 78252 78144 109 0 1 50 72 83 0.923 3.24 S.002 Term - 79353 79160 194 1 2 82 42 235 0.881 16.21 S.003 Init + 180212 180269 58 1 1 91 94 110 0.947 11.33 S.004 Term - 192680 192532 149 0 2 71 42 102 0.931 2.28 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:101122603_101323845|GENSCAN_predicted_peptide_1|510_aa XPRALSRRPTRPPPAAEFLQLSSRRQSRTNRQSQGTSENFRMQMSPALTCLVLGLALVFG EGSAVHHPPSYVAHLASDFGVRVFQQVAQASKDRNVVFSPYGVASVLAMLQLTTGGETQQ QIQAAMGFKIDGEPRDTRGDKGMAPALRHLYKELMGPWNKDEISTTDAIFVQRDLKLVQG FMPHFFRLFRSTVKQVDFSEVERARFIINDWVKTHTKGMISNLLGKGAVDQLTRLVLVNA LYFNGQWKTPFPDSSTHRRLFHKSDGSTVSVPMMAQTNKFNYTEFTTPDGHYYDILELPY HGDTLSMFIAAPYEKEVPLSALTNILSAQLISHWKGNMTRLPRLLVLPKFSLETEVDLRK PLENLGMTDMFRQFQADFTSLSDQEPLHVAQALQKVKIEVNESGTVASSSTAVIVSARMA PEEIIMDRPFLFVVRHNPTEEATFYHPQTVPGPRSVTQAAATPDSSLICLDLSPSTHIQP DELSHSFCFSTPMVLPPSGTVLFMGQVMEP >gi568815591f:101122603_101323845|GENSCAN_predicted_CDS_1|1533_bp nggccaagagcgctgtcaagaagacccacacgcccccctccagcagctgaattcctgcag ctcagcagccgccgccagagcaggacgaaccgccaatcgcaaggcacctctgagaacttc aggatgcagatgtctccagccctcacctgcctagtcctgggcctggcccttgtctttggt gaagggtctgctgtgcaccatcccccatcctacgtggcccacctggcctcagacttcggg gtgagggtgtttcagcaggtggcgcaggcctccaaggaccgcaacgtggttttctcaccc tatggggtggcctcggtgttggccatgctccagctgacaacaggaggagaaacccagcag cagattcaagcagctatgggattcaagattgatggtgagccacgggacaccaggggagac aagggcatggcccccgccctccggcatctgtacaaggagctcatggggccatggaacaag gatgagatcagcaccacagacgcgatcttcgtccagcgggatctgaagctggtccagggc ttcatgccccacttcttcaggctgttccggagcacggtcaagcaagtggacttttcagag gtggagagagccagattcatcatcaatgactgggtgaagacacacacaaaaggtatgatc agcaacttgcttgggaaaggagccgtggaccagctgacacggctggtgctggtgaatgcc ctctacttcaacggccagtggaagactcccttccccgactccagcacccaccgccgcctc ttccacaaatcagacggcagcactgtctctgtgcccatgatggctcagaccaacaagttc aactatactgagttcaccacgcccgatggccattactacgacatcctggaactgccctac cacggggacaccctcagcatgttcattgctgccccttatgaaaaagaggtgcctctctct gccctcaccaacattctgagtgcccagctcatcagccactggaaaggcaacatgaccagg ctgccccgcctcctggttctgcccaagttctccctggagactgaagtcgacctcaggaag cccctagagaacctgggaatgaccgacatgttcagacagtttcaggctgacttcacgagt ctttcagaccaagagcctctccacgtcgcgcaggcgctgcagaaagtgaagatcgaggtg aacgagagtggcacggtggcctcctcatccacagctgtcatagtctcagcccgcatggcc cccgaggagatcatcatggacagacccttcctctttgtggtccggcacaaccccacagag gaggctaccttctatcacccacagacagtgccgggtccccgctctgtgactcaggcagct gcgactccagacagctcactcatctgcctagatctcagtccttccacccacatccagcct gatgagctgtcccactccttctgcttctcaacccccatggttcttccaccctcaggaaca gtccttttcatgggccaagtgatggaaccctga >gi568815591f:101122603_101323845|GENSCAN_predicted_peptide_2|266_aa MTESCLLLRTPMIVPFTSHPFSRLFAPQGSKARGPYCPPPPSSNQPTGQPRYHDDRAPRS CSQGPGCHGFRFSASSPARAPRGVGACAMRFMLLFSRQGKLRLQKWYLATSDKERKKMVR ELMQVVLARKPKMCSFLEWRDLKVVYKRYASLYFCCAIEGQDNELITLELIHRYVELLDK YFGSVCELDIIFNFEKAYFILDEFLMGGDVQDTSKKSVLKAIEQADLLQEVRARDTPPPS PHYCLPPLCCAITEPWCREMSHRNLQ >gi568815591f:101122603_101323845|GENSCAN_predicted_CDS_2|801_bp atgacggagtcctgcctgctgcttcgaactccaatgatcgtcccattcacctctcacccc ttttcccgcctttttgccccacaaggctccaaagcgcgaggaccttattgtcctcctccc ccatcctccaaccagcccacaggacagccgcgttaccatgacgaccgggctcctagaagc tgcagtcaaggacctggttgccatggtttccgcttctccgcctccagccccgcccgcgct ccccgcggcgtcggcgcctgcgcaatgcggttcatgctattattcagccggcagggaaaa ctgcggctgcaaaaatggtacctggccacttcggacaaggaacggaagaagatggtgcgc gagctcatgcaggttgtcctggctcgaaagcccaagatgtgcagcttcctggagtggagg gacctcaaagttgtctataagagatatgccagcctctacttctgctgcgccatcgagggc caagacaatgagctcatcacactggagctgatccaccgatacgtggagctcttagacaaa tactttggcagtgtgtgcgagctggacatcatcttcaactttgagaaggcctacttcatc ctggatgagtttttgatggggggggatgtccaggacacctccaagaagagtgtgctgaaa gccatcgagcaggctgacctactgcaagaggtacgggccagggacacgcccccgccctca ccccattactgcctgccccctctctgctgtgccatcaccgagccctggtgcagggagatg agtcaccggaatctgcagtga >gi568815591f:101122603_101323845|GENSCAN_predicted_peptide_3|615_aa MKALRLSASALFCLLLINGLGAAPPGRPEAQPPPLSSEHKEPVAGDAVPGPKDGSAPEVR GARNSEPQDEGELFQGVDPRALAAVLLQALDRPASPPAPSGSQQGPEEEAAEALLTETVR SQTHSLPAPESPEPAAPPRPQTPENGPEASDPSEELEALASLLQELRDFSPSSAKRQQET AAAETETRTHTLTRVNLESPGPERVWRASWGEFQARVPERAPLPPPAPSQFQARMPDSGP LPETHKFGEGVSSPKTHLGEALAPLSKAYQGVAAPFPKARRPESALLGGSEAGERLLQQG LAQVEAGRRQAEATRQAAAQEERLADLASDLLLQYLLQGGARQRGLGGRGLQEAAEERES AREEEEAEQERRGGEERVGEEDEEAAEAEAEAEEAERARQNALLFAEEEDGEAGAEDKRS QEETPGHRRKEAEGTEEGGEEEDDEEMDPQTIDSLIELSTKLHLPADDVVSIIEEVEEKR KRKKNAPPEPVPPPRAAPAPTHVRSPQPPPPAPAPARDELPDWNEVLPPWDREEDEVYPP GPYHPFPNYIRPRTLQPPSALRRRHYHHALPPSRHYPGREAQARRAQEEAEAEERRLQEQ EELENYIEHVLLRRP >gi568815591f:101122603_101323845|GENSCAN_predicted_CDS_3|1848_bp atgaaagccctcagattgtcggcttccgccctcttctgccttctgctgatcaacgggtta ggggcagcaccccctggtcgccctgaggcgcagcctcctcctctcagctctgagcataaa gagccggtagccggggacgcagtgcccgggccaaaggatggcagcgccccagaggtccga ggcgctcggaattccgagccgcaggacgagggagagcttttccagggcgtggatccccgg gcgctggccgcggtgctgctgcaggcactcgaccgtcccgcctcacccccggcaccaagc ggctcccagcaggggccggaggaagaagcagctgaagctctgctgaccgagaccgtgcgc agccagacccacagcctcccggcgccggagagcccggagcccgcggctccgcctcgccct cagactccggagaatgggcccgaggcgagcgatccctccgaggagctcgaggcgctagcg tccctgctccaggaactgcgagatttcagtccaagtagcgccaagcgccagcaggagacg gcggcagcagagacggaaacccgcacgcacacgctgacccgagtgaatctggagagcccg gggccagagcgcgtatggcgcgcttcctggggagagttccaggcgcgtgtcccggagcgc gcgcccctgccgcccccggccccctctcaattccaggcgcgtatgcccgacagcgggccc cttcccgaaacccacaagttcggggaaggagtgtcctcccccaaaacacacctaggcgag gcattggcacccctgtccaaggcgtaccaaggcgtggccgccccgttccccaaggcgcgc cggccggagagcgcactcctgggcggctccgaggcgggcgagcgccttctccagcaaggg ctggcgcaggtggaggccgggcggcggcaggcggaggccacgcggcaggccgcggcgcag gaagagcggctggccgacctcgcctcggacctgctgctccagtatttgctgcagggcggg gcccggcagcgcggcctcgggggtcgggggctgcaggaggcggcggaggagcgagagagt gcaagggaggaggaggaggcggagcaggagagacgcggcggggaggagagggtgggggaa gaggatgaggaggcggccgaggcggaggcagaggcggaggaggcggagagggcgcggcag aacgcgctcctgttcgcggaggaggaggacggggaagccggcgccgaggacaagcgctcc caggaggagacgccgggccaccggcggaaggaggccgaggggacagaggagggcggggag gaggaggacgacgaggagatggatccgcagacgatcgacagcctcattgagctgtccacc aaactccacctgccagcggacgacgtggtcagcatcatcgaggaggtggaggagaagcgg aagcggaagaagaacgcccctcccgagcccgtgccgcccccccgtgccgcccccgccccc acccacgtccgctccccgcagcccccgccccccgcccccgctcccgcacgagacgagctg ccggactggaacgaggtgctcccgccctgggatcgggaggaggacgaggtgtacccgcca gggccgtaccaccctttccccaactacatccggccgcggacactgcagccgccctcggcc ttgcgccgccgccactaccaccacgccttgccgccttcgcgccactatcccggccgggag gcccaggcgcggcgcgcgcaggaggaggcggaggcggaggagcgccggctgcaggagcag gaggagctggagaattacatcgagcacgtgctgctccggcgcccgtga >gi568815591f:101122603_101323845|GENSCAN_predicted_peptide_4|476_aa MPFNPRLEFLYVGFQEHDLPSYCIGTSNGISLPDYEVITSERFLIPSISFRGRRLGVPFP AALIMERGDSIEGWLLEEGYFRAVGDPRTENGVSRSHRPQGHRYFRVMKLEASCGTATSE VPKPEKKTARDAEPSSETRPQEVEAEPRSGSGPEAEAEPLDFVVATEREFEEVLAISGGI YGGLDYLPSRYHSWLRDPDRTVVLAKRNGGVIALESVNVIDAGETVLVEGLRVAPWERGK GVAGLLQRFCSQLVKRQHPGVKVARLTRDDQLGPRELKKYRLITKQGILLVRFNASALLA GLGARLAALRTSGTFSPLPTEAVSEAGGDVARLLLSPSVQRDVLPGGTIIQDWQPYRPSE SNLRLLAAKGLEWRVDSRARPRVLTLCTRPFPIPHGGDGTWRYLNIDAFGSDGAQVQSQL LWHLQRQAPRLVGLNVMCQLFLEPQLWSQLADFCQVGLGLELVKGYTEQYLLEADI >gi568815591f:101122603_101323845|GENSCAN_predicted_CDS_4|1431_bp atgccctttaaccctcggttggaattcctgtatgtgggattccaggaacatgaccttcct tcttattgcattggcacatccaatgggatcagtttgcctgattatgaagttatcacctcg gaaaggtttctgattccctctatcagtttccggggacgaaggctcggagtccccttccca gccgctcttataatggagagaggagattccattgaggggtggctcctggaggaggggtac ttcagggcagtgggggacccccggacagagaacggagtgtcaagaagccatcggcctcaa gggcatcgatatttcagggtcatgaagctggaagccagctgtggcacagccacctcagag gtccctaagccggaaaagaagactgcccgagatgcagagccaagctctgaaacccggcca caggaggtggaggccgagcccaggtcgggatcggggcctgaggctgaggccgagccattg gacttcgtggtggccacggaacgggagtttgaggaagtgctggccatctcggggggcatc tacggcggcctggactaccttcctagccgctaccacagctggctccgggaccccgaccgc acggtggtgctggccaagcgcaacggaggcgtgatcgcgctggagtcggtgaacgtgatc gacgccggggagacggtgctggtggaggggctgcgcgtggcgccctgggagcgcgggaag ggcgtggccgggctgctgcagcgcttctgctcgcagctggtcaagagacagcacccgggg gtcaaggtggcacggctcacccgggacgaccagctgggcccccgggagctgaagaaatac cgcctaatcaccaagcagggcatccttttggtccgattcaacgcgtccgcgctgctggcc gggctgggcgcgcggctggcggcgctgcggacctctggcaccttctcgccgctgcccacc gaggccgtgtccgaggcaggcggcgacgtggcacgcctcctgctgtcaccctccgtgcag cgcgacgtgcttccaggcgggaccatcatccaggactggcagccctaccggcctagcgaa agcaacctgcgcctgctggcggccaagggcctggagtggcgcgtggacagccgcgcgcgc ccgcgcgtgctcacgctgtgcacgcgccccttccccatcccgcacggaggggacggcact tggcgctatctcaacatcgacgccttcggtagcgacggcgcgcaggtgcagagccagctg ctgtggcacctgcagcgccaggccccgcgcctcgttggcctcaacgtcatgtgccagctc ttcctggagccccagctgtggtcacagctggctgacttctgccaggtcgggctgggactg gagctggtgaagggttatactgaacagtacctgctggaggccgacatctga >gi568815591f:101122603_101323845|GENSCAN_predicted_peptide_5|818_aa MKEEDVLNFLAAGNQLGGTNLDLQMEQYMYKRKSDGIYIPNLKRTWEKLLLAARGIVAIE NPADVSVIPSRNTGQRAVLKFAAATGATPIAGRFTPGTFTNQIQAAFREPRLLVFTDPRA DQQPLTEASYVNLPTLLYSPLRHVDIAIPSNNKGAHSVGLMWWMLAREVLRMRGTISCEH PWEVMPDLYFYRDHEDIEKEEQAAAEKAVTRRNFRLVKTAKLGTSWNYLFDFHPHRVLVV GAFANFCTEPTGCSCLFPKLPPHLLMLPCWFHLLFFQDYIMSGGLVSFVKAPLPQWWPGG CPGVGGPLQALEAKPGQLSLPIRNQKRLVKSALELGENELFQQFPNPQSSWVQRTQEALR PLLSVALQLFLGRRGLPLPFRAPIRTVVGSAIPVQQSPPPSPAQVDTLQARYVGRLTQLF EEHQARYGVPADRHLGESLPRSTSKPSSFVLRVLGSCWKRGASAMGVATTLQPPTTSKTL QKQHLEAVGAYQYVLTFLFMGPFFSLLVFVLLFTSLWPFSVFYLVWLYVDWDTPNQGELR MGAQVARDGGDMGGEGSRCLAFHPPFILLNTPKLVKTAELPPDRNYVLGAHPHGIMCTGF LCNFSTESNGFSQLFPGLRPWLAVLAGLFYLPVYRDYIMSFGLCPVSRQSLDFILSQPQL GQAVVIMVGGAHEALYSVPGEHCLTLQKRKGFVRLALRHGASLVPVYSFGENDIFRLKAF ATGSWQHWCQLTFKKLMGFSPCIFWGRGLFSATSWGLLPFAVPITTVVGRPIPVPQRLHP TEEEVNHYHALYMTALEQLFEEHKESCGVPASTCLTFI >gi568815591f:101122603_101323845|GENSCAN_predicted_CDS_5|2457_bp atgaaggaggaagatgtccttaatttccttgcagcaggaaaccaattaggtggcaccaac cttgacttacagatggaacagtacatgtataaaaggaaaagtgatggcatctacatccca aatctgaagaggacctgggagaagcttctgctggcagctcgtggcattgttgccattgaa aaccccgctgatgtcagtgtcataccctccagaaatactggccagagggccgtgctgaag tttgctgctgccactggagccactccaatcgctggccgcttcactcctggaaccttcact aaccagatccaggcagccttccgggagccacggcttcttgtgtttactgaccccagggct gatcagcagcctctcacagaggcatcttatgttaacctacctaccttgctctattctcct ctgcgccatgtggacattgccatcccaagcaacaacaagggagctcactcagtgggtctg atgtggtggatgctggctcgggaagttctgcgcatgcgtggcaccatttcctgtgaacac ccgtgggaggtcatgcctgatctctacttctacagagatcatgaagacattgaaaaagaa gagcaggctgctgctgaaaaggctgtgacaaggaggaatttcaggctagttaaaactgca aagttgggcacctcctggaactacctctttgacttccaccctcacagggtcctggtcgtg ggagccttcgccaacttctgcacagagcccacgggctgctcctgcctcttccccaaactc ccgccacacctgctcatgctgccttgttggttccatctcctcttcttccaggactacatc atgtcaggtggtttggtctcctttgtcaaggccccgctgcctcagtggtggccaggtggc tgtcctggcgtgggagggcccctgcaggcgctggaggcaaaacccggacaactgagcttg ccgattcggaatcagaagagattggttaagtcagctctggaactcggggagaatgagctc ttccagcagttcccgaacccgcagagctcgtgggtgcagaggacgcaggaggctctgcgt ccgctgctaagcgtggccctgcagctgttcctgggccgccggggcctcccgctgcccttc cgcgcgcccatccgcaccgtagtggggtcggcgattcccgtgcagcagagccccccgccc agtccggcccaggtggacacgctgcaagcgcgctacgtggggcgactcacgcagctcttc gaggagcaccaggcgcgctatggtgtccccgccgacagacacctgggagaaagtctgccc aggtccacatccaagccttcatcgtttgtcctccgggttctgggatcctgctggaagagg ggagcttctgcaatgggagttgccacaaccctgcagcccccaaccacttccaaaaccttg cagaagcagcatctagaagcagtgggcgcctaccaatatgtgctcactttcctcttcatg ggccctttcttctcccttcttgtctttgtcctcctcttcacgtcactctggcccttctct gttttttacttggtgtggctctatgtggactgggacacacccaaccaaggtgagctccgg atgggggcccaggtagccagagatggtggggacatggggggtgaggggagccggtgtctt gccttccatcctcccttcatcctgctcaacaccccgaagctggtgaaaacagcagagctg cccccggatcggaactacgtgctgggcgcccaccctcatgggatcatgtgtacaggcttc ctctgtaatttctccaccgagagcaatggcttctcccagctcttcccggggctccggccc tggttagccgtgctggctggcctcttctacctcccggtctatcgcgactacatcatgtcc tttggactctgtccggtgagccgccagagcctggacttcatcctgtcccagccccagctc gggcaggccgtggtcatcatggtggggggtgcgcacgaggccctgtattcagtccccggg gagcactgccttacgctccagaagcgcaaaggcttcgtgcgcctggcgctgaggcacggg gcgtccctggtgcccgtgtactcctttggggagaatgacatctttagacttaaggctttt gccacaggctcctggcagcattggtgccagctcaccttcaagaagctcatgggcttctct ccttgcatcttctggggtcgcggtctcttctcagccacctcctggggcctgctgcccttt gctgtgcccatcaccactgtggtgggccgccccatccccgtcccccagcgcctccacccc accgaggaggaagtcaatcactatcacgccctctacatgacggccctggagcagctcttc gaggagcacaaggaaagctgtggggtccccgcttccacctgcctcaccttcatctag >gi568815591f:101122603_101323845|GENSCAN_predicted_peptide_6|818_aa MTSSGPGPRFLLLLPLLLPPAASASDRPRGRDPVNPEKLLVITVATAETEGYLRFLRSAE FFNYTVRTLGLGEEWRGGDVARTVGGGQKVRWLKKEMEKYADREDMIIMFVDSYDVILAG SPTELLKKFVQSGSRLLFSAESFCWPEWGLAEQYPEVGTGKRFLNSGGFIGFATTIHQIV RQWKYKDDDDDQLFYTRLYLDPGLREKLSLNLDHKSRIFQNLNGALDEVVLKFDRNRVRI RNVAYDTLPIVVHGNGPTKLQLNYLGNYVPNGWTPEGGCGFCNQDRRTLPGGQPPPRVFL AVFVEQPTPFLPRFLQRLLLLDYPPDRVTLFLHNNEVFHEPHIADSWPQLQDHFSAVKLV GPEEALSPGEARDMAISPLTSRDLCRQDPECEFYFSLDADAVLTNLQTLRILIEENRKVI APMLSRHGKLWSNFWGALSPDEYYARSEDYVELVQRKRVGVWNVPYISQAYVIRGDTLRM ELPQRDVFSGSDTDPDMAFCKSFRDKVSAGARSGLGAVPQTPGIACITDTPTPLQGIFLH LSNQHEFGRLLATSRYDTEHLHPDLWQIFDNPVDWKEQYIHENYSRALEGEGIVEQPCPD VYWFPLLSEQMCDELVAEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQ LLRTYVGPMTESLFPGYHTKSRTGSLHADPFRPRQHLGSMHFSSQGNETLQEAPQGREGA RLRPKARAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGLDYEGGGCRFLRYDCVI SSPRKGWALLHPGRLTHYHEGLPTTWGTRYIMVSFVDP >gi568815591f:101122603_101323845|GENSCAN_predicted_CDS_6|2457_bp atgacctcctcggggcctggaccccggttcctgctgctgctgccgctgctgctgccccct gcggcctcagcctccgaccggccccggggccgagacccggtcaacccagagaagctgctg gtgatcactgtggccacagctgaaaccgaggggtacctgcgtttcctgcgctctgcggag ttcttcaactacactgtgcggaccctgggcctgggagaggagtggcgagggggtgatgtg gctcgaacagttggtggaggacagaaggtccggtggttaaagaaggaaatggagaaatac gctgaccgggaggatatgatcatcatgtttgtggatagctacgacgtgattctggccggc agccccacagagctgctgaagaagttcgtccagagtggcagccgcctgctcttctctgca gagagcttctgctggcccgagtgggggctggcggagcagtaccctgaggtgggcacgggg aagcgcttcctcaattctggtggattcatcggttttgccaccaccatccaccaaatcgtg cgccagtggaagtacaaggatgatgacgacgaccagctgttctacacacggctctacctg gacccaggactgagggagaaactcagccttaatctggatcataagtctcggatctttcag aacctcaacggggctttagatgaagtggttttaaagtttgatcggaaccgtgtgcgtatc cggaacgtggcctacgacacgctccccattgtggtccatggaaacggtcccactaagctg cagctcaactacctgggaaactacgtccccaatggctggactcctgagggaggctgtggc ttctgcaaccaggaccggaggacactcccgggggggcagcctcccccccgggtgtttctg gccgtgtttgtggaacagcctactccgtttctgccccgcttcctgcagcggctgctactc ctggactatccccccgacagggtcacccttttcctgcacaacaacgaggtcttccatgaa ccccacatcgctgactcctggccgcagctccaggaccacttctcagctgtgaagctcgtg gggccggaggaggctctgagcccaggcgaggccagggacatggccatctcgcctctcacc tccagggacctgtgtcggcaggaccccgagtgtgagttctacttcagcctggacgccgac gctgtcctcaccaacctgcagaccctgcgtatcctcattgaggagaacaggaaggtgatc gcccccatgctgtcccgccacggcaagctgtggtccaacttctggggcgccctgagcccc gatgagtactacgcccgctccgaggactacgtggagctggtgcagcggaagcgagtgggt gtgtggaatgtaccatacatctcccaggcctatgtgatccggggtgataccctgcggatg gagctgccccagagggatgtgttctcgggcagtgacacagacccggacatggccttctgt aagagctttcgagacaaggtgagcgcgggtgcacggtctggcctgggggcagtcccccag actccaggcatcgcctgcatcacggacacccccaccccactacagggcatcttcctccat ctgagcaatcagcatgaatttggccggctcctggccacttccagatacgacacggagcac ctgcaccccgacctctggcagatcttcgacaaccccgtcgactggaaggagcagtacatc cacgagaactacagccgggccctggaaggggaaggaatcgtggagcagccatgcccggac gtgtactggttcccactgctgtcagaacaaatgtgtgatgagctggtggcagagatggag cactacggccagtggtcaggcggccggcatgaggattcaaggctggctggaggctacgag aatgtgcccaccgtggacatccacatgaagcaggtggggtacgaggaccagtggctgcag ctgctgcggacgtatgtgggccccatgaccgagagcctgtttcccggttaccacaccaag tcccggacaggaagtctccacgcagacccctttcggccccggcagcatttgggctccatg cattttagttcccaggggaatgagacgctgcaggaggcaccccaggggagggagggcgcc aggctgagacccaaggcgcgggcggtgatgaactttgtggttcgctaccggccagacgag cagccgtctctgcggccacaccacgactcatccaccttcaccctcaacgttgccctcaac cacaagggcctggactatgagggaggtggctgccgcttcctgcgctacgactgtgtgatc tcctccccgaggaagggctgggcactcctgcaccccggccgcctcacccactaccacgag gggctgccaacgacctggggcacacgctacatcatggtgtcctttgtcgacccctga >gi568815591f:101122603_101323845|GENSCAN_predicted_peptide_7|173_aa MILSHTAVVRVKWFKCCKELSTCLAHIRSQDPGQRRVLDRAARQRRINRQLEALENDNFQ DDPHAGLPQLGKRLPQFDDDADTGKKKKKTRGDHFKLRFRKNFQALLEEQNLSVAEGPNY LTACAGPPSRPQRPFCAVCGFPSPYTCVSCGARYCTVRCLGTHQETRCLKWTV >gi568815591f:101122603_101323845|GENSCAN_predicted_CDS_7|522_bp atgatactgtcccatacagctgttgtaagggttaaatggttcaagtgttgcaaagagctg agcacgtgcctggctcatattcgctcccaggaccccgggcagcggcgggtgctggaccgg gctgcccggcagcgtcgcatcaaccggcagctggaggccctggagaatgacaacttccag gatgacccccacgcgggactccctcagctcggcaagagactgcctcagtttgatgacgat gcggacactggaaagaaaaagaagaaaacccgaggtgatcattttaaacttcgcttccga aaaaactttcaggccctgttggaggagcagaacttgagtgtggccgagggccctaactac ctgacggcctgtgcgggacccccatcgcggccccagcgccccttctgtgctgtctgtggc ttcccatccccctacacctgtgtcagctgcggtgcccggtactgcactgtgcgctgtctg gggacccaccaggagaccaggtgtctgaagtggactgtgtga >gi568815591f:101122603_101323845|GENSCAN_predicted_peptide_8|228_aa MSMAVETFGFFMATVGLLMLGVTLPNSYWRVSTVHGNVITTNTIFENLWFSCATDSLGVY NCWEFPSMLALSGYIQACRALMITAILLGFLGLLLGIAGLRCTNIGGLELSRKAKLAATA GALHILAGICGMVAISWYAFNITRDFFDPLYPGTKYELGPALYLGWSASLISILGGLCLC SACCCGSDEDPAASARRPYQAPVSVMPVATSDQEGDSSFGKYGRNAYV >gi568815591f:101122603_101323845|GENSCAN_predicted_CDS_8|687_bp atgtcgatggctgtggaaacctttggcttcttcatggcaactgtggggctgctgatgctg ggggtgactctgccaaacagctactggcgagtgtccactgtgcacgggaacgtcatcacc accaacaccatcttcgagaacctctggtttagctgtgccaccgactccctgggcgtctac aactgctgggagttcccgtccatgctggccctctctgggtatattcaggcctgccgggca ctcatgatcaccgccatcctcctgggcttcctcggcctcttgctaggcatagcgggcctg cgctgcaccaacattgggggcctggagctctccaggaaagccaagctggcggccaccgca ggggccctccacattctggccggtatctgcgggatggtggccatctcctggtacgccttc aacatcacccgggacttcttcgaccccttgtaccccggaaccaagtacgagctgggcccc gccctctacctggggtggagcgcctcactgatctccatcctgggtggcctctgcctctgc tccgcctgctgctgcggctctgacgaggacccagccgccagcgcccggcggccctaccag gctccagtgtccgtgatgcccgtcgccacctcggaccaagaaggcgacagcagctttggc aaatacggcagaaacgcctacgtgtag >gi568815591f:101122603_101323845|GENSCAN_predicted_peptide_9|152_aa MEAVLNELVSVEDLLKFEKKFQSEKAAGSVSKSTQFEYAWCLVRSKYNDDIRKGIVLLEE LLPKGSKEEQRDYVFYLAVGNYRLKEYEKALKYVRGLLQTEPQNNQAKELERLIDKAMKK DGLVGMAIVGGMALGVAGLAGLIGLAVSKSKS >gi568815591f:101122603_101323845|GENSCAN_predicted_CDS_9|459_bp atggaggccgtgctgaacgagctggtgtctgtggaggacctgctgaagtttgaaaagaaa tttcagtctgagaaggcagcaggctcggtgtccaagagcacgcagtttgagtacgcctgg tgcctggtgcggagcaagtacaatgatgacatccgtaaaggcatcgtgctgctcgaggag ctgctgcccaaagggagcaaggaggaacagcgggattacgtcttctacctggccgtgggg aactaccggctcaaggaatacgagaaggccttaaagtacgtccgcgggttgctgcagaca gagccccagaacaaccaggccaaggaactggagcggctcattgacaaggccatgaagaaa gatggactcgtgggcatggccatcgtgggaggcatggccctgggtgtggcgggactggcc ggactcatcggacttgctgtgtccaagtccaaatcctga >gi568815591f:101122603_101323845|GENSCAN_predicted_peptide_10|146_aa MVRMVPVLLSLLHLLGPAIPQETQDGHYSLTYLYTGLSRPGKGTHRLQGTVFLNGRAFFH YNSEDRKPEPLGPWRHVEGVEDWEKQSQVQKAREDIFMETLNNIMEYYNDSNASCVICDR STPRASSSVTSTRPLSSGNHQLMNKG >gi568815591f:101122603_101323845|GENSCAN_predicted_CDS_10|441_bp atggtaagaatggtgcctgtcctgctgtctctgctgcaccttctgggtcctgctatcccc caggagacccaagatggtcattactctctgacctatctctacactgggctgtccaggcct ggcaaaggcacccacaggctgcagggtactgtcttcctcaatggccgtgccttcttccac tacaacagtgaagacaggaagcctgagcccctgggaccatggagacacgtggaaggagta gaggactgggagaagcagagccaagttcagaaggccagggaggacatctttatggagacc ctgaacaacatcatggagtattacaatgacagtaacgcctcttgtgttatatgtgatcgg agcacccccagagcttcgtcatccgtgacgagcacccgccctctctccagtgggaatcat cagctgatgaataagggctga >gi568815591f:101122603_101323845|GENSCAN_predicted_peptide_11|129_aa MRRSQRKDPGDMKKQGNVTPSEETSNSPATDPHPKEIQESSNRELSRELTGLIVKKVINM EESFEQQHKEMRKRGGEIYEMITCQKEVLKFNRRVFLELKKSLDEIQSTLKSFNDRLEQI EEKLSGHKI >gi568815591f:101122603_101323845|GENSCAN_predicted_CDS_11|390_bp atgcgcagatctcaacggaaggacccaggagacatgaaaaagcaaggaaatgtgacgcct tcagaagaaaccagtaattctccagcaacagatccccatccaaaagaaattcaggaaagc tcaaacagagaattgagcagagaattgaccggactgattgtaaagaaggtcatcaacatg gaagagtcttttgaacaacaacataaagaaatgaggaaaagaggtggggagatatatgag atgattacctgccagaaagaggttttaaaattcaacagaagagtatttctggaactgaag aaatcattggatgaaatacaaagtacactcaaaagcttcaatgatagactagaacaaata gaagaaaaactctcagggcataaaatctga >gi568815591f:101122603_101323845|GENSCAN_predicted_peptide_12|267_aa MLKAKILFVGPCESGKTVLANFLTESSDITEYSPTQGVRILEFENPHVTSNNKGTGCEFE LWDCGGDAKFESCWPALMKDAHGVVIVFNADIPSHRKEMEMWYSCFVQQPSLQDTQCMLI AHHKPGSGDDKGSLSLSGGGGNRRQGRSCPRGWTLGVAGAAAAAAQGKEARGLHVGSTWA PATPLGNLSPAAPYSGAGRRRLQLQLLRAGKPVGSTWAPATPLGNLSPAAPYSGGPRGRS AASVKTYRRLRFPPEERRVPAASTRPS >gi568815591f:101122603_101323845|GENSCAN_predicted_CDS_12|804_bp atgctgaaagccaagatcctcttcgtggggccttgcgagagtggaaaaactgttttggcc aactttctgacagaatcttctgacatcactgaatacagcccaacccaaggagtgaggatc ctagaatttgagaacccgcatgttaccagcaacaacaaaggcacgggctgtgaattcgag ctatgggactgtggtggcgatgctaagtttgagtcctgctggccggccctgatgaaggat gctcatggagtggtgatcgtcttcaatgctgacatcccaagccaccggaaggaaatggag atgtggtattcctgctttgtccaacagccgtccttacaggacacacagtgtatgctaatt gcacaccacaaaccaggctctggagatgataaaggaagcctgtctttgtctggcggcggc ggtaaccgcaggcaaggcaggagctgccctcggggctggactctcggggtggcaggggct gcagctgcagctgctcagggcaaggaagcccgtgggctccacgtgggctccacgtgggct cctgcaacgccgctggggaatctctcgcccgctgccccctattctggagctgggcggcgg cggctgcagctgcagctgctcagggcagggaagcccgtgggctccacgtgggctcctgca acgccgctggggaatctctcgcccgctgccccctactctggaggtccacgaggccgatcg gctgcaagtgtgaaaacctaccggcgccttcggttcccgccagaggagaggcgcgtgccg gcggcatccaccaggccatcttga