GENSCAN 1.0 Date run: 5-Nov-116 Time: 06:40:12 Sequence gi568815576r:42068656_42315305 : 246650 bp : 49.30% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 5362 5686 325 0 1 80 78 69 0.097 0.65 1.02 Term + 8625 9418 794 1 2 116 41 770 0.460 68.56 1.03 PlyA + 9707 9712 6 -5.80 2.02 PlyA - 9888 9883 6 1.05 2.01 Sngl - 11280 10753 528 0 0 104 41 178 0.246 11.00 2.00 Prom - 14287 14248 40 -0.96 3.04 PlyA - 14780 14775 6 1.05 3.03 Term - 17659 17528 132 1 0 113 45 45 0.957 0.79 3.02 Intr - 18520 18405 116 2 2 89 88 114 0.990 11.67 3.01 Init - 22089 21951 139 0 1 110 85 223 0.951 24.50 3.00 Prom - 23754 23715 40 -3.06 4.03 PlyA - 24934 24929 6 1.05 4.02 Term - 39716 39128 589 1 1 22 42 293 0.701 12.09 4.01 Init - 40298 40090 209 2 2 88 42 87 0.666 2.39 4.00 Prom - 43992 43953 40 -4.86 5.00 Prom + 46332 46371 40 -6.36 5.01 Init + 51563 51580 18 1 0 79 117 38 0.980 5.88 5.02 Intr + 55111 55149 39 0 0 101 94 7 0.669 1.02 5.03 Intr + 55392 55550 159 2 0 138 94 84 0.843 14.18 5.04 Intr + 55865 55907 43 1 1 103 99 16 0.858 2.01 5.05 Term + 56220 56302 83 1 2 76 52 102 0.736 3.26 5.06 PlyA + 56348 56353 6 1.05 6.10 PlyA - 57867 57862 6 1.05 6.09 Term - 58097 57919 179 0 2 137 39 144 0.573 12.25 6.08 Intr - 58337 58196 142 2 1 82 51 236 0.896 19.23 6.07 Intr - 58979 58792 188 0 2 54 102 273 0.999 24.71 6.06 Intr - 59328 59187 142 0 1 52 49 222 0.924 14.63 6.05 Intr - 59695 59519 177 1 0 50 100 146 0.926 12.12 6.04 Intr - 60289 60129 161 2 2 101 64 268 0.963 25.41 6.03 Intr - 60530 60378 153 0 0 84 80 282 0.999 27.04 6.02 Intr - 61254 60930 325 0 1 55 33 377 0.706 24.25 6.01 Init - 62136 61957 180 0 0 69 94 306 0.987 28.48 6.00 Prom - 62261 62222 40 -10.35 7.02 PlyA - 63231 63226 6 1.05 7.01 Sngl - 64292 63942 351 2 0 49 44 312 0.979 16.98 7.00 Prom - 65205 65166 40 -8.96 8.00 Prom + 65673 65712 40 -8.26 8.01 Init + 67035 67088 54 2 0 82 94 -40 0.490 -2.61 8.02 Intr + 67263 67301 39 2 0 101 94 7 0.650 1.02 8.03 Intr + 67544 67702 159 1 0 138 94 84 0.823 14.18 8.04 Intr + 68017 68059 43 0 1 103 99 16 0.860 2.01 8.05 Term + 68606 68739 134 0 2 114 48 68 0.364 3.65 8.06 PlyA + 70598 70603 6 1.05 9.11 PlyA - 71571 71566 6 1.05 9.10 Term - 71801 71623 179 0 2 137 39 221 0.999 19.95 9.09 Intr - 72041 71900 142 2 1 82 51 244 0.896 20.03 9.08 Intr - 72684 72497 188 1 2 34 102 222 0.998 17.61 9.07 Intr - 73020 72879 142 0 1 76 49 188 0.983 13.63 9.06 Intr - 73389 73213 177 0 0 50 100 162 0.935 13.72 9.05 Intr - 73984 73824 161 2 2 88 64 299 0.999 27.21 9.04 Intr - 74225 74073 153 0 0 101 80 291 0.999 29.64 9.03 Intr - 74701 74589 113 0 2 4 64 94 0.497 -1.38 9.02 Intr - 74949 74745 205 1 1 -35 40 352 0.515 16.36 9.01 Init - 75809 75629 181 2 1 110 94 224 0.978 24.54 9.00 Prom - 78209 78170 40 -9.95 10.09 PlyA - 81200 81195 6 1.05 10.08 Term - 81409 81231 179 2 2 145 40 202 0.986 18.95 10.07 Intr - 81649 81508 142 1 1 79 51 232 0.896 18.53 10.06 Intr - 82236 82100 137 1 2 -24 102 205 0.911 10.99 10.05 Intr - 82640 82489 152 1 2 16 49 204 0.845 9.11 10.04 Intr - 83603 83450 154 0 1 112 28 254 0.933 20.93 10.03 Intr - 83844 83692 153 1 0 120 31 273 0.999 24.84 10.02 Intr - 84545 84329 217 2 1 65 43 314 0.984 22.68 10.01 Init - 86346 86170 177 0 0 110 94 301 0.960 30.16 10.00 Prom - 87562 87523 40 -2.56 11.08 PlyA - 88049 88044 6 1.05 11.07 Term - 88845 88776 70 2 1 98 54 68 0.107 1.81 11.06 Intr - 100048 100002 47 1 2 129 60 34 0.092 1.91 11.05 Intr - 101241 101192 50 1 2 127 113 37 0.740 8.50 11.04 Intr - 107610 107575 36 1 0 99 65 37 0.533 0.73 11.03 Intr - 111047 110954 94 2 1 127 110 95 0.998 15.14 11.02 Intr - 146686 140996 5691 1 0 108 101 2535 0.966 244.28 11.01 Init - 158792 158730 63 2 0 99 88 4 0.308 2.65 11.00 Prom - 163475 163436 40 -3.36 12.00 Prom + 175381 175420 40 -3.46 12.01 Init + 178809 178912 104 2 2 74 17 99 0.472 1.11 12.02 Intr + 199725 199790 66 0 0 64 46 96 0.212 1.02 12.03 Intr + 200852 200951 100 2 1 60 30 68 0.247 -1.69 12.04 Intr + 201239 201370 132 1 0 64 74 127 0.690 9.74 12.05 Intr + 203977 204134 158 0 2 118 26 47 0.416 0.31 12.06 Intr + 205709 205785 77 2 2 98 77 -3 0.204 -1.24 12.07 Intr + 207888 207947 60 1 0 95 61 67 0.228 3.41 12.08 Intr + 209397 209580 184 1 1 54 26 103 0.024 -0.45 12.09 Intr + 216106 216198 93 1 0 61 99 90 0.077 6.48 12.10 Intr + 227375 227493 119 2 2 89 109 50 0.002 7.31 12.11 Intr + 230956 231065 110 2 2 98 91 -13 0.007 0.00 12.12 Intr + 240905 241102 198 1 0 89 3 90 0.305 0.15 12.13 Term + 241152 241259 108 2 0 120 51 92 0.556 7.21 12.14 PlyA + 243722 243727 6 1.05 13.02 PlyA - 244948 244943 6 -0.45 13.01 Sngl - 245841 245476 366 0 0 77 49 232 0.832 12.44 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 211280 211358 79 1 1 24 109 121 0.874 7.03 S.002 Term - 229752 229569 184 2 1 112 47 128 0.842 8.12 S.003 Intr - 232531 232436 96 0 0 69 119 55 0.809 5.82 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576r:42068656_42315305|GENSCAN_predicted_peptide_1|372_aa MDRVFRTVGWVRGPRTVRGGPLGSRPPAPTSAVARAVHRPLGPQFPALADLEAASEIPGL ANTGRGGALTPASRAGSPGSRGHVAAQAEARDLELGSPSWASVRSAAGVPVGAMKLNERS VAHYALSDSPADHMGFLRTWGGPGTPPTPSGTGRRCWFVLKGNLLFSFESREGRAPLSLV VLEGCTVELAEAPVPEEFAFAICFDAPGVRPHLLAAEGPAAQEAWVKVLSRASFGYMRLV VRELESQLQDARQSLALQRRSSWKSVASRCKPQAPNHRAAGLENGHCLSKDSSPVGLVEE AGSRSAGWGLAEWELQGPASLLLGKGQSPVSPETSCFSTLHDWYGQEIVELRQCWQKRAQ GSHSKCEEQDRP >gi568815576r:42068656_42315305|GENSCAN_predicted_CDS_1|1119_bp atggacagagttttccggacggtgggctgggtccgaggcccgcggaccgtccgagggggg cctctgggttcgaggccgcccgcaccgaccagtgctgtggctcgggcagttcaccgccct ctcgggcctcagtttcccgccctggcagacctcgaggcggcctcggagatcccgggcctg gcgaatactgggcggggaggggcgttaactccagcctcccgggcggggtctcccggctcc cgcggccacgttgcagcccaggcggaggcgcgggatctcgagctgggatcgccctcttgg gcttcggtgcgatcggcggcgggagttcccgtgggagccatgaagctgaacgagaggagt gtagcccactatgcactcagcgactccccagcggaccacatgggcttcctgcgcacctgg gggggcccagggaccccaccgacccccagtggcactggccgaagatgctggtttgtcctc aagggcaacctgctattctcctttgagagtcgcgagggccgggccccactgagcctggtg gtgctggaaggctgcacagtggaactggccgaggctcccgtgcccgaggagtttgccttt gccatctgctttgatgcccctggagtgcgcccacacctgctggccgcagaagggccggcg gcccaggaggcctgggtgaaggtgctgtcccgggcaagctttggctacatgcgcctggtg gtacgcgagttggagagccagttgcaggacgcacgccagagcctggctttgcaacgccgc tcatcctggaagtctgttgccagccgctgtaagccccaggctcctaaccaccgagctgcg ggcctggagaatggccactgcctctccaaggacagcagccctgtgggcttggttgaagaa gcgggcagcaggtctgcagggtgggggttggctgagtgggagctgcagggccctgccagc ctcctcctaggcaaggggcagagccctgtgtcccctgagacctcctgcttctctaccctg catgactggtatggccaggagatcgtggagctgcggcagtgttggcagaagagggcccag gggagccactcaaaatgtgaggaacaggataggccctaa >gi568815576r:42068656_42315305|GENSCAN_predicted_peptide_2|175_aa MAPLRVTMTDLDGTRLRPEPLHAAETSPSFLKLGPLRRAPDLTGANTSQRAAPDAMPPAP PLTRAPAPHRPALVPPSGRKPPEASREFEEYTRSPALPLAQCRRLLGRREGGVCHWPAWW RHESGGAAPAPSVRVLLGVAAVLGPPGRQNRPQRPVVEINPGSGRSNNEKALLVF >gi568815576r:42068656_42315305|GENSCAN_predicted_CDS_2|528_bp atggcgccgctgcgggtaacgatgactgacctcgacggtaccaggctccggcctgagccg ctccatgcggcggagacatcgccatctttcctcaagctaggcccgctccggagagccccg gacctgacgggtgccaatactagccagcgagccgctccggacgccatgcccccagctccg cccctcacccgggcacccgcaccccaccgcccagccctcgtgccgccctcgggaagaaag ccgcccgaagcctcgcgagagtttgaggagtacacgagaagcccagccctcccactcgcg cagtgccggcggctcctggggcggcgggagggaggtgtttgtcactggccggcatggtgg cgtcacgagagtggcggcgcagccccggctccctcagtccgcgtcctgctgggcgtggcc gcggtcttgggtccgcccggccgtcagaacaggcctcagcggccggttgtggaaataaac ccagggagcggaaggagtaataacgaaaaagctttactagtgttttaa >gi568815576r:42068656_42315305|GENSCAN_predicted_peptide_3|128_aa MAGSGVRQATSTASTFVKPIFSRDMNEAKRRVRELYRAWYREVPNTVHQFQLDITVKMGR DKVREMFMKNAHVTDPRVVDLLVIKGKIELEETIKVWKQRTHVMRFFHETEAPRPKDFLS KFYVGHDP >gi568815576r:42068656_42315305|GENSCAN_predicted_CDS_3|387_bp atggcggggagcggcgtccgccaagctacttctaccgccagcaccttcgtgaagcccatt ttcagtcgggacatgaacgaggccaagcggagggtgcgcgagctctaccgcgcctggtat cgggaggtgccgaacactgtgcaccaattccagctggacatcactgtgaaaatgggacgg gataaagtccgagaaatgtttatgaagaatgcccatgtcacagaccccagggtggttgat cttctggtcattaagggaaagatcgaactggaagaaacaattaaagtatggaagcagcgg acacatgttatgcggttcttccatgaaacagaagcgccaaggccaaaggatttcctatcc aagttctatgttggccacgatccatga >gi568815576r:42068656_42315305|GENSCAN_predicted_peptide_4|265_aa MPPQKGGDGIKPPPIIGRFGTSLKIGIVGLPNVGKSTFFNVLTNSQASAENFPFCTIDPN ESRVPVPDERSKEPVRFYHDWNDKEIEVLNKHLFLTSKPMVYLVNLSEKDYIRKKNKWLI KIKEWVDKYDPGALVIPFSGALELKLQELSAEERQQYLEANMTQSALPKIIKAGFAALQL KYFFTAGPDEVRARTIRKGTKAPQAAGKIHTDFEKGFIMAEVMKYEDFKEEGSENAVKAA GKYRQQGRNYIVEDGDIIFFKFNTP >gi568815576r:42068656_42315305|GENSCAN_predicted_CDS_4|798_bp atgccccctcaaaagggaggtgatggaattaaaccacccccaatcattggaagatttgga acctcactgaaaattggtattgttggattgccaaatgttgggaaatctactttcttcaat gtattaaccaatagtcaggcttcagcagaaaacttcccattctgcactattgatcctaat gagagcagagtacctgtgccagatgaaagatcaaaagaacctgttcgcttctatcatgat tggaatgacaaagagattgaagtgttgaataaacacttatttttgacttcaaaaccaatg gtctacttggttaatctttctgaaaaagactacattagaaagaaaaacaaatggctgata aaaattaaagagtgggtggacaagtatgacccaggtgccttggtcattccttttagtggg gccttggaactcaagttgcaagaattgagtgctgaggagagacagcagtatctggaagcg aacatgacacaaagtgctttgccaaagatcattaaggctgggtttgcagcactccaacta aaatactttttcactgcaggcccagatgaagtgcgtgcacggaccatcaggaaagggact aaggctcctcaggctgcaggaaagattcacacagattttgaaaagggattcattatggct gaagtaatgaaatatgaagattttaaagaggaaggttctgaaaatgcagtcaaggctgct ggaaagtacagacaacaaggcagaaattatattgttgaagatggagatattatcttcttc aaatttaacacaccttaa >gi568815576r:42068656_42315305|GENSCAN_predicted_peptide_5|113_aa MAYEKPRRKVKSRKKPTSEVTTPRRPGGLNAAAPKEEAAVLSQEGEQVKSPGEEAPSPIP AEQEVAGTPDWEENKKVQKEVAAYPSEASEDSKEQRPWDRVYVPMTELWLDWF >gi568815576r:42068656_42315305|GENSCAN_predicted_CDS_5|342_bp atggcctatgagaagccgagacgcaaagtcaagagtaggaagaagccaacctctgaggtc acgacccccaggagacctggaggactgaatgctgctgcccccaaggaggaggctgccgtc ttatcccaggagggagagcaggtgaagtccccaggggaggaagcacctagccccattcct gctgagcaggaggtggcaggtaccccagactgggaggaaaataaaaaggttcaaaaggaa gttgctgcgtatccatctgaggcctctgaggacagcaaagagcaaaggccctgggaccgg gtctacgtgcccatgacagagctctggctggactggttctga >gi568815576r:42068656_42315305|GENSCAN_predicted_peptide_6|548_aa MGLEALVPLAVIVAIFLLLVDLMHRRQRWAARYPPGPLPLPGLGNLLHVDFQNTPYCFDQ LRRRFGDVFSLQLAWTPVVVLNGLAAVREALVTHGEDTADRPPVPITQILGFGPRSQGKQ RWGQRQISVGPGWVMTVVRAGQRGRGVVDMKQASEWGQRAKKPPALGRWVFLARYGPAWR EQRRFSVSTLRNLGLGKKSLEQWVTEEAACLCAAFANHSGRPFRPNGLLDKAVSNVIASL TCGRRFEYDDPRFLRLLDLAQEGLKEESGFLREVLNAVPVLLHIPALAGKVLRFQKAFLT QLDELLTEHRMTWDPAQPPRDLTEAFLAEMEKAKGNPESSFNDENLRIVVADLFSAGMVT TSTTLAWGLLLMILHPDVQRRVQQEIDDVIGQVRRPEMGDQAHMPYTTAVIHEVQRFGDI VPLGVTHMTSRDIEVQGFRIPKGTTLITNLSSVLKDEAVWEKPFRFHPEHFLDAQGHFVK PEAFLPFSAGRRACLGEPLARMELFLFFTSLLQHFSFSVPTGQPRPSHHGVFAFLVSPSP YELCAVPR >gi568815576r:42068656_42315305|GENSCAN_predicted_CDS_6|1647_bp atggggctagaagcactggtgcccctggccgtgatagtggccatcttcctgctcctggtg gacctgatgcaccggcgccaacgctgggctgcacgctacccaccaggccccctgccactg cccgggctgggcaacctgctgcatgtggacttccagaacacaccatactgcttcgaccag ttgcggcgccgcttcggggacgtgttcagcctgcagctggcctggacgccggtggtcgtg ctcaatgggctggcggccgtgcgcgaggcgctggtgacccacggcgaggacaccgccgac cgcccgcctgtgcccatcacccagatcctgggtttcgggccgcgttcccaaggcaagcag cggtggggacagagacagatttccgtgggacccgggtgggtgatgaccgtagtccgagct gggcagagagggcgcggggtcgtggacatgaaacaggccagcgagtggggacagcgggcc aagaaaccacctgcactagggaggtgggtgttcctggcgcgctatgggcccgcgtggcgc gagcagaggcgcttctccgtgtccaccttgcgcaacttgggcctgggcaagaagtcgctg gagcagtgggtgaccgaggaggccgcctgcctttgtgccgccttcgccaaccactccgga cgcccctttcgccccaacggtctcttggacaaagccgtgagcaacgtgatcgcctccctc acctgcgggcgccgcttcgagtacgacgaccctcgcttcctcaggctgctggacctagct caggagggactgaaggaggagtcgggctttctgcgcgaggtgctgaatgctgtccccgtc ctcctgcatatcccagcgctggctggcaaggtcctacgcttccaaaaggctttcctgacc cagctggatgagctgctaactgagcacaggatgacctgggacccagcccagcccccccga gacctgactgaggccttcctggcagagatggagaaggccaaggggaaccctgagagcagc ttcaatgatgagaacctgcgcatagtggtggctgacctgttctctgccgggatggtgacc acctcgaccacgctggcctggggcctcctgctcatgatcctacatccggatgtgcagcgc cgtgtccaacaggagatcgacgacgtgatagggcaggtgcggcgaccagagatgggtgac caggctcacatgccctacaccactgccgtgattcatgaggtgcagcgctttggggacatc gtccccctgggtgtgacccatatgacatcccgtgacatcgaagtacagggcttccgcatc cctaagggaacgacactcatcaccaacctgtcatcggtgctgaaggatgaggccgtctgg gagaagcccttccgcttccaccccgaacacttcctggatgcccagggccactttgtgaag ccggaggccttcctgcctttctcagcaggccgccgtgcatgcctcggggagcccctggcc cgcatggagctcttcctcttcttcacctccctgctgcagcacttcagcttctcggtgccc actggacagccccggcccagccaccatggtgtctttgctttcctggtgagcccatccccc tatgagctttgtgctgtgccccgctag >gi568815576r:42068656_42315305|GENSCAN_predicted_peptide_7|116_aa MQQGLHLSLSVVPHLEPRAICLFPDHGRQHSTRLRLEQAQDSLIEGCGLAQIIAYTQGQL CHFCHHHIRLLQPHQVPLMDEPVTPSGLINPASSSTAATCSVSTGSKHGSCHTIHL >gi568815576r:42068656_42315305|GENSCAN_predicted_CDS_7|351_bp atgcagcagggcctccacctgtccctctccgtagtcccacacctggaacccagagccatc tgcctcttcccagatcatggccgacagcactccaccagactgcggctggagcaggcacag gattcacttattgagggctgtggcctggcacagatcatagcctatacccagggacagttg tgtcacttctgccaccaccacatccgccttctgcagccacatcaagtaccactcatggat gagcccgtcacccccagcggacttatcaaccccgcgtccagctccacagccgccacgtgc tcggtgagcactggctccaagcatggcagctgccatacaatccacctgtag >gi568815576r:42068656_42315305|GENSCAN_predicted_peptide_8|142_aa MVRPVWGLMNKCYTVPTQRRKVKSRKKPTSEVTTPRRPGGLNAAAPKEEAAVLSQEGEQV KSPGEEAPSPIPAEQEVAGTPDWEENKKVQKEVAAYPSAAGTALPATEPCDHSGLSLEGP KGALATLLTDSVPGDRPTSLSC >gi568815576r:42068656_42315305|GENSCAN_predicted_CDS_8|429_bp atggtcaggccagtgtgggggctaatgaataaatgctacactgtgcccactcagagacgc aaagtcaagagtaggaagaagccaacctctgaggtcacgacccccaggagacctggagga ctgaatgctgctgcccccaaggaggaggctgccgtcttatcccaggagggagagcaggtg aagtccccaggggaggaagcacctagccccattcctgctgagcaggaggtggcaggtacc ccagactgggaggaaaataaaaaggttcaaaaggaagttgctgcgtatccatctgctgca ggaacagcacttcctgccaccgagccgtgtgaccacagtggattgtctctggaggggccc aagggggccctggccacccttctgactgactcggtgccaggggacagaccaacgtccctc tcgtgctga >gi568815576r:42068656_42315305|GENSCAN_predicted_peptide_9|546_aa MGLEALVPLAMIVAIFLLLVDLMHRHQRWAARYPPGPLPLPGLGNLAACGLPEHTILLRP EALTLPLQLRRRFGDVFSLQLAWTPVVVLNGLAAVREAMVTRGEDTADRPPAPIYQVLGF GPRSQGKRRDRSPSSADRAWGPGRETEIKASEWAEDSGPGNHLHGGGVILSRYGPAWREQ RRFSVSTLRNLGLGKKSLEQWVTEEAACLCAAFADQAGRPFRPNGLLDKAVSNVIASLTC GRRFEYDDPRFLRLLDLAQEGLKEESGFLREVLNAVPVLPHIPALAGKVLRFQKAFLTQL DELLTEHRMTWDPAQPPRDLTEAFLAKKEKAKGSPESSFNDENLRIVVGNLFLAGMVTTS TTLAWGLLLMILHLDVQLRVQQEIDDVIGQVRRPEMGDQAHMPCTTAVIHEVQHFGDIVP LGVTHMTSRDIEVQGFRIPKGTTLITNLSSVLKDEAVWKKPFRFHPEHFLDAQGHFVKPE AFLPFSAGRRACLGEPLARMELFLFFTSLLQHFSFSVAAGQPRPSHSRVVSFLVTPSPYE LCAVPR >gi568815576r:42068656_42315305|GENSCAN_predicted_CDS_9|1641_bp atggggctagaagcactggtgcccctggccatgatagtggccatcttcctgctcctggtg gacctgatgcaccggcaccaacgctgggctgcacgctacccgccaggtcccctgccactg cccgggctgggcaaccttgctgcatgtggacttccagaacacaccatactgcttcgacca gaggccctgaccctccctctgcagttgcggcgccgcttcggggacgtgttcagcctgcag ctggcctggacgccggtggtcgtgctcaatgggctggcggccgtgcgcgaggcgatggtg acccgcggcgaggacacggccgaccgcccgcctgcgcccatctaccaggtcctgggcttc gggccgcgttcccaaggcaagcggcgtgaccgtagcccaagcagcgccgacagggcgtgg ggtcctggacgtgaaacagagataaaggccagcgagtgggctgaggacagtgggccagga aaccacctgcacgggggaggggtgatcctgtcgcgctatgggcccgcgtggcgcgagcag aggcgcttctccgtgtccaccttgcgcaacttgggcctgggcaagaagtcgctggagcag tgggtgaccgaggaggccgcctgcctttgtgccgccttcgccgaccaagccggacgcccc tttcgccccaacggcctcttggacaaagccgtgagcaacgtgatcgcctccctcacctgc gggcgccgcttcgagtacgacgaccctcgcttcctcaggctgctggacctagctcaggag ggactgaaggaggagtcgggcttcctgcgcgaggtgctgaatgctgtccccgtcctcccg cacatcccagcgctggctggcaaggtcctacgcttccaaaaggctttcctgacccagctg gatgagctgctaactgagcacaggatgacctgggacccagcccagccaccccgagacctg actgaggccttcctggcaaagaaggagaaggccaaggggagccctgagagcagcttcaat gatgagaacctgcgcatagtggtgggtaacctgttccttgccgggatggtgaccacctcg accacgctggcctggggcctcctgctcatgatcctacacctggatgtgcagctccgtgtc caacaggagatcgacgacgtgatagggcaggtgcggcgaccagagatgggtgaccaggct cacatgccctgcaccactgccgtgattcacgaggtgcagcactttggggacatcgtcccc ctgggtgtgacccatatgacatcccgtgacatcgaagtacagggcttccgcatccctaag ggaacgacactcatcaccaacctgtcatcggtgctgaaggatgaggccgtctggaagaag cccttccgcttccaccccgaacacttcctggatgcccagggccactttgtgaagccggag gccttcctgcctttctcagcaggccgccgtgcatgcctcggggagcccctggcccgcatg gagctcttcctcttcttcacctccctgctgcagcacttcagcttctccgtggccgccgga cagccccggcccagccactctcgtgtcgtcagctttctggtgaccccatccccctacgag ctttgtgctgtgccccgctag >gi568815576r:42068656_42315305|GENSCAN_predicted_peptide_10|436_aa MGLDALVPLAVTVAIFLLLVDLMQQHQRWTARYPPGPLPLPGLGNLLHVDFQNIYTFNQL RHRFGDVFSLQLAWMPVVVLNGLAAVREALVTCGEDTADRPPAPIYQVLGIGPRSQGKRR WGTETAFPWVLGVFLAHYGHAWREQRRFSVSTLRNLGLGKKSLERWVTEEAACLCAAFAD QARRPFHPNGLLNKAASNVIASLTCGCRFEYDDPRFLRLLDLAQKGLKEELGFLPAQAKG NPESSFNDENLRMVVADLFFAGMVTTSITLAWGLLLMILRPDVQQMGDQARMPCTTAVIH EVQRFGDIVPLGVTHMTSRDIEVQGFRIPKGMMLFTNLSSVLKDEAVWEKPFRFHPEHFL DAQGHFVKLEAFLPFSAGRRACLGEPLARIELFLFFTSLLQHFSFSVPTGQPRPSHSRVV GFLVTPSPYELCAVPR >gi568815576r:42068656_42315305|GENSCAN_predicted_CDS_10|1311_bp atggggctggatgcactggtgcccctggcagtgacagtggccatcttcctgctcctggtg gacctgatgcagcagcaccaacgctggactgcacgctacccgccaggccccctgccactg cccgggctgggcaacttgctgcatgtggacttccagaacatatacaccttcaaccagctg cggcaccgctttggggacgtgttcagcctgcagctggcctggatgccggtggtcgtgctc aatgggctggcggccgtgcgtgaggctctggtgacctgcggcgaggacaccgccgaccgc ccgcctgcgcccatctaccaggtcctgggcatcgggccgcgctcccaaggcaagcggcgg tgggggacagagactgcgtttccgtgggtcctgggggtgtttctggcacactacggacac gcgtggcgcgagcagaggcgcttctccgtgtccaccttgcgcaacttgggcctgggcaag aagtccctggagcggtgggtgaccgaggaggccgcctgcctctgtgccgccttcgccgac caagccagacgcccctttcaccccaacggcctcctgaacaaagcggcgagcaacgtgatc gcctccctcacctgcgggtgccgcttcgagtacgacgaccctcgcttcctcaggctactg gacctagctcagaagggattgaaggaggagctgggctttctccctgctcaggccaagggg aaccctgagagcagcttcaatgatgagaacctgcgcatggtggtggctgacctgttcttt gccgggatggtgaccacctcgatcacgctggcctggggcctcctgctcatgatcctacgc ccggatgtgcagcagatgggtgaccaggctcgcatgccctgcaccactgccgtgattcac gaggtgcagcgctttggggacatcgtccccctgggtgtgacccatatgacatcccgtgac atcgaagtacagggcttccgcatccctaaggggatgatgctcttcaccaacctgtcatcg gtgctgaaggatgaggccgtctgggagaagcccttccgcttccaccccgaacacttcctg gatgcccagggccactttgtgaagctggaggccttcctgcctttctcagcaggccgccgt gcatgcctcggggagcccctggcccgcatagagctcttcctcttcttcacctccctgctg cagcacttcagcttctcggtgcccaccggacagccccggcccagccactctcgtgtcgtc ggctttctggtgacgccatccccctatgagctttgtgctgtgccccgctag >gi568815576r:42068656_42315305|GENSCAN_predicted_peptide_11|2016_aa MELHAFSLLSKGSGKDNGANEEGCWPAAVLLNSMQSFREQSSYHGNQQSYPQEVHGSSRL EEFSPRQAQMFQNFGGTGGSSGSSGSGSGGGRRGAAAAAAAMASETSGHQGYQGFRKEAG DFYYMAGNKDPVTTGTPQPPQRRPSGPVQSYGPPQGSSFGNQYGSEGHVGQFQAQHSGLG GVSHYQQDYTGPFSPGSAQYQQQASSQQQQQQVQQLRQQLYQSHQPLPQATGQPASSSSH LQPMQRPSTLPSSAAGYQLRVGQFGQHYQSSASSSSSSSFPSPQRFSQSGQSYDGSYNVN AGSQYEGHNVGSNAQAYGTQSNYSYQPQSMKNFEQAKIPQGTQQGQQQQQPQQQQHPSQH VMQYTNAATKLPLQSQVGQYNQPEVPVRSPMQFHQNFSPISNPSPAASVVQSPSCSSTPS PLMQTGENLQCGQGSVPMGSRNRILQLMPQLSPTPSMMPSPNSHAAGFKGFGLEGVPEKR LTDPGLSSLSALSTQVANLPNTVQHMLLSDALTPQKKTSKRPSSSKKADSCTNSEGSSQP EEQLKSPMAESLDGGCSSSSEDQGERVRQLSGQSTSSDTTYKGGASEKAGSSPAQGAQNE PPRLNASPAAREEATSPGAKDMPLSSDGNPKVNEKTVGVIVSREAMTGRVEKPGGQDKGS QEDDPAATQRPPSNGGAKETSHASLPQPEPPGGGGSKGNKNGDNNSNHNGEGNGQSGHSA AGPGFTSRTEPSKSPGSLRYSYKDSFGSAVPRNVSGFPQYPTGQEKGDFTGHGERKGRNE KFPSLLQEVLQGYHHHPDRRYSRSTQEHQGMAGSLEGTTRPNVLVSQTNELASRGLLNKS IGSLLENPHWGPWERKSSSTAPEMKQINLTDYPIPRKFEIEPQSSAHEPGGSLSERRSVI CDISPLRQIVRDPGAHSLGHMSADTRIGRNDRLNPTLSQSVILPGGLVSMETKLKSQSGQ IKEEDFEQSKSQASFNNKKSGDHCHPPSIKHESYRGNASPGAATHDSLSDYGPQDSRPTP MRRVPGRVGGREGMRGRSPSQYHDFAEKLKMSPGRSRGPGGDPHHMNPHMTFSERANRSS LHTPFSPNSETLASAYHANTRAHAYGDPNAGLNSQLHYKRQMYQQQPEEYKDWSSGSAQG VIAAAQHRQEGPRKSPRQQQFLDRVRSPLKNDKDGMMYGPPVGTYHDPSAQEAGRCLMSS DGLPNKGMELKHGSQKLQESCWDLSRQTSPAKSSGPPGMSSQKRYGPPHETDGHGLAEAT QSSKPGSVMLRLPGQEDHSSQNPLIMRRRVRSFISPIPSKRQSQDVKNSSTEDKGRLLHS SKEGADKAFNSYAHLSHSQDIKSIPKRDSSKDLPSPDSRNCPAVTLTSPAKTKILPPRKG RGLKLEAIVQKITSPNIRRSASSNSAEAGGDTVTLDDILSLKSGPPEGGSVAVQDADIEK RKGEVASDLVSPANQELHVEKPLPRSSEEWRGSVDDKVKTETHAETVTAGKEPPGAMTST TSQKPGSNQGRPDGSLGGTAPLIFPDSKNVPPVGILAPEANPKAEEKENDTVTISPKQEG FPPKGYFPSGKKKGRPIGSVNKQKKQQQPPPPPPQPPQIPEGSADGEPKPKKQRQRRERR KPGAQPRKRKTKQAVPIVEPQEPEIKLKYATQPLDKTDAKNKSFYPYIHVVNKCELGAVC TIINAEEEEQTKLVRGRKGQRSLTPPPSSTESKALPASSFMLQGPVVTESSVMGHLVCCL CGKWASYRNMGDLFGPFYPQDYAATLPKNPPPKRATEMQSKVKVRHKSASNGSKTDTEEE EEQQQQQKEQRSLAAHPRFKRRHRSEDCGGGPRSLSRGLPCKKAATEGSSEKTVLDSKPS VPTTSEGGPELELQIPELPLDSNEFWVHEGCILWANGIYLVCGRLYGLQEALEIAREMKC SHCQEAGATLGCYNKGCSFRYHYPCAIDAASVGETVKKNKRDCLLHEENFSVRCPKHKNK TAKGSLSTEQSERGVQAALDTHRLRIHKFNHTSKIT >gi568815576r:42068656_42315305|GENSCAN_predicted_CDS_11|6051_bp atggagcttcatgccttcagcctgctctcaaaggggtcaggaaaagataatggagcaaat gaggagggctgttggcctgctgctgtgctgctgaacagtatgcagtcctttcgggagcaa agcagttaccacggaaaccagcaaagctacccacaggaggtacacggctcatcccggcta gaagagttcagccctcgtcaggcccagatgttccagaattttggaggtacaggtggcagt agtggcagcagtggcagtggcagtggtggtggacgacgaggagcagcagctgctgcggca gcgatggctagcgagacctctggccatcaaggttaccagggtttcaggaaagaggctgga gatttttactacatggcaggcaacaaagaccccgtgactacaggaaccccacagcctcct cagcgaaggccttctgggcctgtgcagagctatggacccccccaggggagcagctttggc aatcagtatgggagtgagggtcatgtgggccagtttcaagcacagcactctggccttggc ggtgtgtcacattatcagcaggattacactgggcctttctctccagggagtgctcagtac caacagcaggcttccagccagcagcagcagcagcaagtccagcagttgagacaacagctt taccagtcccatcagcccctgccacaggccactggccaaccagcatccagctcatcccat ctacagccaatgcagcggccctcaactctgccatcctctgctgctggttaccagttaaga gtgggtcagtttggccaacactatcagtcttctgcttcctcctcctcctcctcctccttc ccttcaccacagcgttttagccagtctggacagagctatgatggcagttacaatgtgaat gctggatctcagtatgaaggacacaatgtgggttctaatgcacaggcttatggaacacaa tccaattacagctatcagcctcaatctatgaagaattttgaacaggcaaagattccacaa gggacccaacaggggcagcagcagcagcaaccgcagcaacaacaacacccttctcagcat gtgatgcagtatactaacgctgccaccaagctgcccctgcaaagccaagtggggcagtac aaccagcctgaggttcctgtgaggtcccccatgcagtttcaccagaacttcagccccatt tctaacccttctccagctgcctctgtggttcagtctccaagctgtagttctaccccatct cctctcatgcagactggggagaatctccagtgtgggcaaggcagtgtgcctatgggttcc agaaacagaattttacagttaatgcctcaactcagtccaaccccatcaatgatgcccagt cctaattctcatgctgcaggcttcaaagggtttggactagaaggggtaccagaaaagcga ctgacagatcctgggttgagtagtttgagtgctctgagtactcaagtggccaatcttcct aacactgtccagcacatgttactttctgatgccctgactcctcagaagaagacctccaag aggccctcatcttccaagaaagcagatagctgcacaaattctgaaggctcctcacaacct gaagaacagctgaagtcccctatggcagagtcattagatggaggctgctccagcagttca gaggatcaaggcgagagagtgcggcaactaagtggccagagcaccagctctgacaccacc tacaagggtggagcctctgagaaagctggctcctcaccggcacaaggtgctcagaatgaa ccccccagactcaatgctagtcctgccgcaagagaagaggccacctcaccaggcgctaag gacatgccattgtcatccgacgggaacccaaaggttaatgagaagactgttggggtgatt gtctcccgggaagccatgacaggtcgggtagaaaagcctggtggacaagataaaggctcc caagaggatgatcctgcagccactcaaaggccacctagcaatggtggggcaaaggaaacc agtcatgcatcacttccccagccagagcctccaggaggaggagggagcaaaggaaacaag aatggcgataacaactccaaccataatggagaaggaaatggccagagtggccactctgca gcgggccctggttttacgagcagaactgagcctagcaaatctcctggaagtctgcgctat agttacaaagatagtttcgggtcagccgtgccacgaaatgtcagtggctttcctcagtat cctacagggcaagaaaagggagatttcactggccatggggaacgaaagggtagaaatgaa aaattcccaagcctcctgcaggaagtgcttcagggttaccaccaccaccctgacaggaga tattctaggagtactcaagagcatcaggggatggctggtagcctagaaggaaccacaagg cccaatgtcttggttagtcaaaccaatgaattagctagcaggggccttctgaacaaaagc attgggtctctattagaaaatccccactggggcccctgggaaaggaaatcaagcagcaca gctcctgaaatgaaacagatcaatttgactgactatccaattcccagaaagtttgaaata gagcctcagtcatcagcacatgagcctgggggttccctctctgaaagaagatcagtgatc tgtgatatttctccactaagacagattgtcagggacccaggggctcactcactgggacac atgagtgccgacaccagaattgggaggaatgaccgtctcaatccaactttaagtcagtcg gtcattcttcctggtggtttggtgtccatggaaaccaagctgaaatcccagagcgggcag ataaaagaggaagactttgaacagtctaaatctcaagctagtttcaacaacaagaaatct ggagaccactgccatcctcctagcatcaagcatgagtcttaccgcggcaatgccagccct ggagcagcaacccatgattccctttcagactatggcccgcaagacagcagacccacgcca atgcggcgggtccctggcagagttggtggtcgggagggcatgaggggtcggtccccttct caatatcatgactttgcagaaaaattgaaaatgtctcctgggcggagcagaggcccaggg ggagaccctcatcacatgaatccacacatgaccttttcagagagggctaaccggagttct ttacacactcccttttctcccaactcagaaaccctggcctctgcttatcatgcaaatact cgggctcatgcttatggggaccctaacgcaggtttgaattctcagctgcattataagaga cagatgtaccaacagcaaccagaggagtataaagactggagcagcggttctgctcaggga gtaattgctgcagcacagcacaggcaggaggggccacggaagagtccaaggcagcagcag tttcttgacagagtacggagccctctgaaaaatgacaaagatggtatgatgtatggccca ccagtggggacttaccatgaccccagtgcccaggaggctgggcgctgcctaatgtctagt gatggtctgcctaacaagggcatggaattaaagcatggctcccagaagttacaagaatcc tgttgggatctttctcggcaaacttctccagccaaaagcagcggtcctccaggaatgtcc agtcaaaaaaggtatgggccgccccatgagactgatggacatggactagctgaggctaca cagtcatccaaacctggtagtgttatgctgagacttccaggccaggaggatcattcttct caaaaccccttaatcatgaggaggcgtgttcgttcttttatctctcccattcccagtaag agacagtcacaagatgtaaagaacagtagcactgaagataaaggtcgcctccttcactca tcaaaagaaggcgctgataaagcattcaattcctatgcccatctttctcacagtcaggat atcaagtctatccctaagagagattcctccaaggaccttccaagtccagatagtagaaac tgccctgctgttaccctcacaagccctgctaagaccaaaatactgcccccacggaaagga cggggattgaaattggaagctatagttcagaagattacatccccaaatattaggaggagc gcatcttcgaacagtgcggaggctgggggagacacggttacgcttgatgatatactgtct ttgaagagtggtcctcctgaaggtgggagtgttgctgttcaggatgctgacatagagaag agaaaaggtgaggtggcttcggacctagtcagtccagcaaaccaggagttgcacgtagag aaacctcttccaaggtcttcagaagagtggcgtggcagcgtggatgacaaagtgaagaca gagacacatgcagaaacagttactgccggaaaggaaccccctggtgccatgacatccaca acctcacagaagcctggtagtaaccaagggagaccagatggttccctgggtggaacagca cctttaatctttccagactcaaagaatgtacctccagtgggcatattggcccctgaggca aaccccaaggctgaagagaaggagaacgatacagtgacgatttcaccgaagcaagagggt ttccctccaaagggatatttcccatcaggaaagaagaaggggagacccattggtagtgtg aataagcaaaagaaacagcagcagccaccgcctccaccccctcagcccccacagatacca gaaggttctgcagatggagagccaaagccaaaaaaacagaggcaaaggagggagagaagg aagcctggggcccagccgaggaagcgaaaaaccaaacaagcagttcccattgtggaaccc caagaacctgagatcaaactaaaatatgccacccagccactggataaaactgatgccaag aacaagtctttttacccttacatccatgtagtaaataagtgtgaacttggagccgtttgt acaatcatcaatgctgaggaagaagaacagaccaaattagtgaggggcaggaagggtcag aggtcactgacccctccacctagcagcactgaaagcaaggcgctcccggcctcgtccttt atgctgcagggacctgttgtgacagagtcttcggttatggggcacctggtttgctgtctg tgtggcaagtgggccagttaccggaacatgggtgacctctttggacctttttatccccaa gattatgcagccactctcccgaagaatccacctcctaagagggccacagaaatgcagagc aaagttaaggtacggcacaaaagtgcttctaatggctccaagacggacactgaggaggag gaagagcagcagcagcagcagaaggagcagagaagcctggccgcacaccccaggtttaag cggcgccaccgctcggaagactgtggtggaggccctcggtccctgtccagggggctccct tgtaaaaaagcagccactgagggcagcagtgaaaagactgttttggactcgaagccctcc gtgcccaccacttcagaaggtggccctgagctggagttacaaatccctgaactacctctt gacagcaatgaattttgggtccatgagggttgtattctctgggccaatggaatctacctg gtttgtggcaggctctatggcctgcaggaagcgctggaaatagccagagagatgaaatgt tcccactgccaggaggcaggcgccaccttgggctgctacaacaaaggctgctccttccga taccattacccgtgtgccattgatgcagcttctgtaggagaaacagttaaaaagaacaag agagattgtttgctacatgaggagaacttctcggtgaggtgccctaagcacaagaacaag accgcgaaaggcagcctcagcacagagcagtcggagcggggggtacaagcagccctcgat acccacaggctccgcatccacaaattcaaccacacatcaaagataacttga >gi568815576r:42068656_42315305|GENSCAN_predicted_peptide_12|502_aa MENRKEDTDPDLEIKAPLGTPSCSRGYNWGNQEMCICGALATSKEKVGDADNQCTLRGQC TSRRSLRGIGPIGHAPSNHHFPPKKGRKPQPVAKRRARPVPPGLPGAPKQAALTPGRRPL TPAAFAPKASRDQQTLPEHCPNALHLETSPVLGPGQLGLFIPESLPLSLLVPSGKRTCMG IIANRKMSPCFVQVSEPAWREHGGSEQFFLGQGWSFSVKIPGPLINRRHQDTEITKFKGF SEATTQGALESQAGIPLLCTPSLGTQYDPAQASACGRLVNDQKQNRKLCKVLSSSVRSEG LSTYYLQAQHRRFTPPSYPMAMPESYRWICFLIRKMGVRTVLSPSEFCEDYRVKMCQDLD PCLAASSSPTSVPLPERRPPCPHPNTSRPPRSDLAGVSLGQEFPTHLRLLHTPTVPVALH TSAHTVPNASNAPATPHCLENLYSSFQSQLRLLSSQSASGLPCLAFHNRGDTSTSQCCGE DEVRKTQTAWEAASVIIAMRML >gi568815576r:42068656_42315305|GENSCAN_predicted_CDS_12|1509_bp atggagaacagaaaagaggacacggacccagacctggaaataaaggcacctctggggaca ccgagctgcagcagaggatacaactggggaaaccaggagatgtgcatctgcggagcactg gctaccagtaaggagaaagttggggatgcagataatcagtgcaccctgagaggacagtgc acctcccgccgtagcctgagagggatcggcccgattggccacgctccttcaaaccaccac tttcccccgaaaaagggacgaaagcctcagcccgtagccaagaggcgagcgcggccggtg cccccggggctgccaggggccccgaagcaggcggcgctgactcccggacgccgcccgctc acacccgccgcctttgcacccaaggcctcgcgcgaccagcagaccctgccagaacattgc cccaatgccctccacctggaaacttccccagtccttggaccaggccagctgggcctcttt atccctgaaagcctgccactttctcttttggttccatctggaaaaagaacgtgtatgggc atcatagccaacaggaagatgagcccctgctttgtgcaagtctctgagcccgcatggagg gagcatgggggctcagaacagttctttctggggcaaggttggtccttcagcgtcaagatt cctgggccacttattaatagaaggcaccaggacacagaaattacaaagttcaagggtttt agtgaggccacgactcagggagcccttgagagccaagctggcatcccactgctgtgcaca cccagcctgggcacacagtatgaccctgcccaggcatctgcctgcggccgtctggtgaat gaccagaaacagaaccgaaagctctgtaaagtgctctcaagcagcgtccgttccgagggg ctgagcacctactacctgcaagcccagcacaggcgcttcacacccccttcctaccccatg gcaatgcctgagagctatagatggatctgtttcctcatccgtaaaatgggggtaagaaca gtccttagcccatctgagttttgtgaggactaccgagtgaagatgtgtcaggacttagac ccctgcctggcagctagcagctctccaacctctgtacctctccccgaacgcaggcccccc tgcccgcaccccaacacttcaagacctcctaggagtgacctggctggggtctctctaggt caggagttccctacccatctacggctgctgcacaccccaaccgtgcccgtggccctccac acctctgcccacactgtccccaatgccagcaacgcccctgcaactccccactgcctagaa aacctgtattcttccttccagtctcagctcagacttctttcctctcaaagtgcctctgga ctgccctgtctggcttttcataataggggtgacaccagcacctcacagtgttgtggtgag gatgaagtaagaaaaacacaaactgcctgggaagctgcttctgtcatcatcgccatgagg atgctctga >gi568815576r:42068656_42315305|GENSCAN_predicted_peptide_13|121_aa MPGRLPACRRRHRCLGFKLLRGPAMWPWGNHFLTMAINFAESPSALWQPVPRQRQLVRVK PPGVLMTEAIRIRHLPQSTHPEETDSGQFQTCTTSPPSLSSGPWVSGDVFSLPWSGHKGP L >gi568815576r:42068656_42315305|GENSCAN_predicted_CDS_13|366_bp atgcctggccgcctgcccgcctgccgcaggaggcatcgatgcctgggattcaagctcctc cgagggcctgcgatgtggccatggggaaatcattttctgacaatggccatcaatttcgct gagagcccatcggcgctctggcagcctgtcccccggcagcggcagctggtccgtgtcaag cccccaggtgtcctgatgacagaagcgattcgcattaggcatctgccccagagcacacat ccggaagagacggattctggccagttccagacatgcaccacatcacctccaagtttatct tcaggtccttgggttagcggagatgtcttctctctgccctggagtggccacaaggggccc ctatag