GENSCAN 1.0 Date run: 3-Nov-116 Time: 09:27:15 Sequence gi568815579r:18894369_19131071 : 236703 bp : 52.63% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 Term - 1052 659 394 1 1 90 47 161 0.385 7.07 1.01 Init - 1704 1325 380 0 2 80 31 504 0.250 38.30 1.00 Prom - 3437 3398 40 -5.91 2.11 PlyA - 3746 3741 6 1.05 2.10 Term - 5358 5311 48 0 0 56 54 69 0.304 -2.11 2.09 Intr - 5579 5505 75 2 0 155 113 115 0.995 20.91 2.08 Intr - 6081 6013 69 0 0 118 99 128 0.999 16.77 2.07 Intr - 9055 8900 156 1 0 52 102 283 0.980 26.82 2.06 Intr - 10484 10403 82 1 1 115 101 102 0.964 14.34 2.05 Intr - 11261 11208 54 1 0 112 105 118 0.999 14.48 2.04 Intr - 12744 12592 153 2 0 98 89 351 0.999 35.90 2.03 Intr - 16703 16603 101 2 2 90 78 181 0.985 16.71 2.02 Intr - 18678 18616 63 0 0 81 109 85 0.996 9.31 2.01 Init - 24980 24855 126 2 0 90 86 201 0.970 20.22 2.00 Prom - 25286 25247 40 -13.88 3.00 Prom + 25294 25333 40 -14.13 3.01 Init + 25374 25488 115 2 1 71 96 161 0.999 15.67 3.02 Intr + 26212 26335 124 2 1 80 115 73 0.999 9.35 3.03 Intr + 27295 27380 86 1 2 91 55 147 0.999 11.66 3.04 Intr + 27475 27596 122 2 2 105 105 168 0.999 20.92 3.05 Intr + 27958 28145 188 0 2 76 86 357 0.999 33.31 3.06 Intr + 28236 28376 141 0 0 74 63 380 0.936 34.08 3.07 Intr + 29865 29940 76 0 1 62 94 83 0.959 6.31 3.08 Intr + 30255 30331 77 2 2 68 105 150 0.956 13.51 3.09 Intr + 30514 30611 98 1 2 61 99 169 0.995 15.45 3.10 Intr + 31767 32009 243 1 0 86 70 222 0.950 18.20 3.11 Intr + 33398 33486 89 0 2 106 78 172 0.988 18.19 3.12 Intr + 33597 33668 72 2 0 77 100 137 0.997 13.90 3.13 Term + 33760 33948 189 0 0 80 52 199 0.802 13.27 3.14 PlyA + 34237 34242 6 1.05 4.11 PlyA - 34318 34313 6 -6.28 4.10 Term - 35266 35075 192 1 0 107 52 230 0.997 19.04 4.09 Intr - 37043 36957 87 2 0 75 89 95 0.967 8.96 4.08 Intr - 37257 37141 117 0 0 102 109 117 0.999 16.37 4.07 Intr - 37764 37608 157 2 1 103 76 234 0.978 24.23 4.06 Intr - 38677 38556 122 1 2 109 60 191 0.999 18.20 4.05 Intr - 40042 39935 108 1 0 85 76 68 0.860 6.28 4.04 Intr - 44116 43985 132 1 0 79 80 191 0.601 18.75 4.03 Intr - 44516 44360 157 1 1 115 77 214 0.873 23.53 4.02 Intr - 45647 45617 31 0 1 125 31 9 0.767 -3.43 4.01 Init - 46866 46683 184 0 1 93 84 209 0.956 18.25 4.00 Prom - 55545 55506 40 -6.01 5.00 Prom + 56295 56334 40 -7.20 5.01 Init + 57415 57602 188 0 2 66 87 152 0.456 11.41 5.02 Term + 61115 61139 25 2 1 121 42 13 0.660 -2.02 5.03 PlyA + 61322 61327 6 1.05 6.00 Prom + 61785 61824 40 -1.11 6.01 Init + 63966 63969 4 2 1 80 92 0 0.531 -0.31 6.02 Term + 67106 67200 95 0 2 105 49 73 0.755 3.39 6.03 PlyA + 67933 67938 6 1.05 7.17 PlyA - 70241 70236 6 1.05 7.16 Term - 81805 81731 75 1 0 101 34 56 0.401 -0.36 7.15 Intr - 83943 83783 161 1 2 92 29 63 0.110 1.02 7.14 Intr - 90126 90012 115 0 1 73 53 67 0.187 2.22 7.13 Intr - 93942 93780 163 2 1 138 3 38 0.048 0.79 7.12 Intr - 99097 98909 189 0 0 84 48 97 0.028 4.52 7.11 Intr - 100912 100755 158 1 2 117 27 263 0.912 22.42 7.10 Intr - 107306 107245 62 0 2 107 105 41 0.961 6.64 7.09 Intr - 110278 109800 479 0 2 98 113 466 0.971 43.38 7.08 Intr - 114060 113949 112 1 1 75 76 74 0.713 4.84 7.07 Intr - 115974 115487 488 2 2 92 29 329 0.948 20.65 7.06 Intr - 124909 124741 169 2 1 97 82 85 0.565 8.32 7.05 Intr - 131858 130251 1608 0 0 62 91 736 0.084 60.50 7.04 Intr - 136714 136583 132 2 0 100 93 99 0.355 12.72 7.03 Intr - 139287 139121 167 2 2 32 76 93 0.005 2.62 7.02 Intr - 143557 143460 98 1 2 40 63 88 0.011 0.81 7.01 Init - 146606 146571 36 2 0 75 97 28 0.517 2.63 7.00 Prom - 146949 146910 40 -2.31 8.00 Prom + 147307 147346 40 -2.81 8.01 Init + 148389 148509 121 2 1 90 97 232 0.963 24.61 8.02 Intr + 149624 149706 83 0 2 94 91 72 0.907 7.95 8.03 Intr + 157254 157827 574 2 1 94 116 888 0.958 85.12 8.04 Intr + 159784 159953 170 2 2 132 80 281 0.999 31.88 8.05 Intr + 160897 161028 132 0 0 67 80 235 0.999 21.95 8.06 Intr + 161423 161560 138 1 0 88 80 245 0.948 24.87 8.07 Term + 163048 163260 213 0 0 80 46 300 0.972 22.55 8.08 PlyA + 163783 163788 6 1.05 9.11 PlyA - 166405 166400 6 -0.45 9.10 Term - 166887 166870 18 0 0 72 47 -2 0.030 -7.30 9.09 Intr - 167237 167146 92 0 2 85 77 38 0.045 2.61 9.08 Intr - 167467 167355 113 0 2 79 77 83 0.075 6.83 9.07 Intr - 167594 167538 57 1 0 87 72 47 0.058 1.49 9.06 Intr - 177953 177922 32 2 2 100 111 40 0.104 4.91 9.05 Intr - 186947 186833 115 1 1 64 30 67 0.126 -0.55 9.04 Intr - 190759 190633 127 2 1 92 105 -2 0.249 2.14 9.03 Intr - 191424 191355 70 0 1 149 64 15 0.617 4.45 9.02 Intr - 193766 193645 122 0 2 -41 65 160 0.393 1.52 9.01 Init - 198519 198447 73 0 1 71 72 79 0.384 4.24 9.00 Prom - 198672 198633 40 -7.00 10.00 Prom + 199091 199130 40 -3.11 10.01 Init + 200117 200142 26 1 2 81 75 18 0.635 -1.16 10.02 Intr + 201287 201435 149 2 2 7 30 199 0.456 6.39 10.03 Intr + 201949 202171 223 2 1 96 99 27 0.540 2.31 10.04 Intr + 207051 207087 37 0 1 86 55 17 0.395 -3.05 10.05 Intr + 207413 207518 106 1 1 127 115 113 0.998 18.19 10.06 Intr + 210545 210570 26 0 2 98 105 8 0.943 1.73 10.07 Intr + 211193 211359 167 1 2 63 42 360 0.944 28.17 10.08 Intr + 211901 212017 117 2 0 102 82 95 0.972 10.38 10.09 Intr + 213526 213677 152 1 2 102 94 208 0.993 23.12 10.10 Term + 216201 216508 308 1 2 91 50 773 0.999 69.43 10.11 PlyA + 218499 218504 6 1.05 11.08 PlyA - 219841 219836 6 1.05 11.07 Term - 225815 225562 254 0 2 99 37 224 0.803 14.53 11.06 Intr - 226493 226397 97 2 1 110 72 208 0.977 21.48 11.05 Intr - 226798 226624 175 0 1 72 51 308 0.999 25.96 11.04 Intr - 227053 226940 114 0 0 111 38 108 0.713 8.17 11.03 Intr - 227300 227136 165 1 0 77 31 193 0.976 12.09 11.02 Intr - 227451 227391 61 1 1 113 69 35 0.589 2.38 11.01 Init - 235935 235788 148 0 1 60 119 270 0.894 27.52 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 100118 99998 121 1 1 112 43 118 0.962 7.95 S.002 Init - 131823 130251 1573 0 1 67 91 754 0.897 66.54 S.003 Intr + 169554 169678 125 1 2 77 93 61 0.873 6.31 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579r:18894369_19131071|GENSCAN_predicted_peptide_1|257_aa MAAAGPAAGPTGPEPMPSYAQLVQRGWGSALAAARGCTDCGWGLARRGLAEHAHLAPPEL LLLALGALGWTALRSAATARLFRVSVAGGRDEGTPDWGKPGPGTWGRRRVPFFQRWLLRM KGSPERGAEPDTEKPRTGNGRARRTAACAMRLPAMRAAPRPGARSAARFLLGPCGGGAGL PAVIGKRQELPGRCLVPIWGQRHPPTRDGKGLGALATPVVAPLEVTWPGASGLDCAASKR RQQNSPVSAETIAPARE >gi568815579r:18894369_19131071|GENSCAN_predicted_CDS_1|774_bp atggcggcggcggggcccgcggcggggccgacggggcccgagcccatgccgagctacgcg cagctagtgcagcgcggctggggcagcgcgctggcggcggcgcggggctgcacggactgc ggctgggggctggcgcgtcgcggcctggctgagcacgcgcacctggcgccgcccgagctg ctgctgctggcgctcggcgcgctgggctggaccgccctgcgctccgcggccactgcgcgc ctctttcgggtcagtgtggccgggggccgggacgaggggaccccggactgggggaagccg ggaccggggacctggggccgccggcgcgttcctttcttccagcgctggctgctgcggatg aaggggtcgccggaacgtggggccgagccagacactgagaagcccaggaccgggaacggg agggctcggcgcaccgcggcctgcgccatgcgcctccctgccatgcgggctgcgccccgg cccggagcgaggtccgctgcccgttttctgctgggtccgtgcggcggcggggccggcttg cccgctgtaatcgggaagaggcaggagctgcccggtcgctgccttgtgcctatctggggt cagcgccaccctccgaccagggatggcaagggcctgggtgctctggccacgccagtggtc gctcccttggaggtgacatggcccggggcttcagggttggactgtgctgcctccaagagg cgacagcagaattccccagtttctgcggagaccatcgcccctgcgagagagtga >gi568815579r:18894369_19131071|GENSCAN_predicted_peptide_2|308_aa MAPPAPGPASGGSGEVDELFDVKNAFYIGSYQQCINEAQRVKLSSPERDVERDVFLYRAY LAQRKFGVVLDEIKPSSAPELQAVRMFADYLAHESRRDSIVAELDREMSRSVDVTNTTFL LMAASIYLHDQNPDAALRALHQGDSLECTAMTVQILLKLDRLDLARKELKRMQDLDEDAT LTQLATAWVSLATGGEKLQDAYYIFQEMADKCSPTLLLLNGQAACHMAQGRWEAAEGLLQ EALDKDSGYPETLVNLIVLSQHLGKPPEVTNRYLSQLKDAHRSHPFIKEYQAKENDFDRL VLQYAPSA >gi568815579r:18894369_19131071|GENSCAN_predicted_CDS_2|927_bp atggcgcctccggcccccggcccggcctccggcggctccggggaggtagacgagctgttc gacgtaaagaacgccttctacatcggcagctaccagcagtgcataaacgaggcgcagcgg gtgaagctatcaagcccagagagagacgtggagagggacgtcttcctgtatagagcgtac ctggcgcagaggaagttcggtgtggtcctggatgagatcaagccctcctcggcccctgag ctccaggccgtgcgcatgtttgctgactacctcgcccacgagagtcggagggacagcatc gtggccgagctggaccgagagatgagcaggagcgtggacgtgaccaacaccaccttcctg ctcatggccgcctccatctatctccacgaccagaacccggatgccgccctgcgtgcgctg caccagggggacagcctggagtgcacagccatgacagtgcagatcctgctgaagctggac cgcctggacctcgcccggaaggagctgaagagaatgcaggacctggacgaggatgccacc ctcacccagctcgccactgcctgggtcagcctggccacgggtggtgagaagctgcaggat gcctactacatcttccaggagatggctgacaagtgctcgcccaccctgctgctgctcaat gggcaggcggcctgccacatggcccagggccgctgggaggccgctgagggcctgctgcag gaggcgctagacaaggatagtggctacccagagacgctggtcaacctcatcgtcctgtcc cagcacctgggcaagccccctgaggtgacaaaccgatacctgtcccagctgaaggatgcc cacaggtcccatcccttcatcaaggagtaccaggccaaggagaacgactttgacaggctg gtgctacagtacgctcccagcgcctga >gi568815579r:18894369_19131071|GENSCAN_predicted_peptide_3|539_aa MAGFAELGLSSWLVEQCRQLGLKQPTPVQLGCIPAILEGRDCLGCAKTGSGKTAAFVLPI LQKLSEDPYGIFCLVLTPTRELAYQIAEQFRVLGKPLGLKDCIIVGGMDMVAQALELSRK PHVVIATPGRLADHLRSSNTFSIKKIRFLVMDEADRLLEQGCTDFTVDLEAILAAVPARR QTLLFSATLTDTLRELQGLATNQPFFWEAQAPVSTVEQLDQRYLLVPEKVKDAYLVHLIQ RFQDEHEDWSIIIFTNTCKTCQILCMMLRKFSFPTVALHSMMKQKERFAALAKFKSSIYR ILIATDVASRGLDIPTVQVVINHNTPGLPKIYIHRVGRTARAVVLLGVKPRHSGLIPPKP KPLFSISRTPELCSCQDLPLFAPCPDAQHIADNRHIPTGRQGQAITLVTQYDIHLVHAIE EQIKKKLEEFSVEEAEVLQILTQVNVVRRECEIKLEAAHFDEKKEINKRKQLILEGKDPD LEAKRKAELAKIKQKNRRFKEKVEETLKRQKAGRAGHKGRPPRTPSGSHSGPVPSQGLV >gi568815579r:18894369_19131071|GENSCAN_predicted_CDS_3|1620_bp atggcaggcttcgcggagctcgggctgtcatcgtggctcgtggaacaatgtcggcagctg ggtttgaagcagcccacgcccgtgcagctcggctgcatccccgccatcctggagggtcga gactgcttgggctgtgctaagacaggcagtgggaagacagcagcgtttgtccttcccatc ttgcagaagctgtctgaggatccctatggcatcttctgcctcgtcctgacacccaccagg gagctggcctaccagatcgcagagcagttccgggtcctggggaagcctctagggctgaaa gactgcatcatcgtcggtggcatggacatggtggcccaggcgctggagctctctcggaaa ccacacgtggtcatcgccacgccggggcgcctggcagatcacctgcgcagctccaacact tttagtataaagaagatccgcttcctggtgatggatgaggcagaccggctgctggaacag ggctgcactgacttcaccgtggacctggaggccatcctggcggctgtgccggcccgcagg cagacactgctgttcagcgccacgctgaccgacacactccgggagctgcagggtctggcc accaaccagcccttcttctgggaagcacaggccccggtgagcaccgtggagcagctggac cagcgctacctgctggtgcctgagaaggtcaaggacgcctacctggtccacctgatccag cgcttccaggatgagcacgaggactggtccattatcatcttcaccaacacgtgcaagacc tgccagattctgtgcatgatgctgcgcaaattcagcttccccaccgtggctctgcactcc atgatgaagcagaaagaacgctttgccgccctagccaagttcaagtccagcatctaccgg atcctgatcgcaacagacgtggcctcccggggcctggacatccctacggtacaggtggtc atcaaccacaacacccccgggctccccaagatctacatccaccgagtcggccggacggcc cgtgcagtggtcctgcttggggtcaagcccagacactctgggctgatccctcccaagccc aagcccttgttcagcatctccaggacccctgagctgtgtagctgtcaggatctaccttta ttcgccccatgtcctgacgcccagcacatagcagataaccggcacatccccacagggcgg cagggtcaggccatcacgctggtgacacagtacgacatccacctggtgcacgccatcgag gagcagatcaagaagaagctggaggagttctccgtggaagaggccgaggtgctacagatc ctcacacaggtcaacgtggtgcgaagagagtgtgagatcaaactggaggcggcccacttt gacgaaaagaaggagatcaacaaacggaagcagctgatcctggaggggaaggaccctgac ctggaggccaagcgcaaggctgagctggccaagatcaagcagaagaaccggcgcttcaag gagaaggtggaggagacgctgaagcgacagaaggctggcagggctggccacaaggggcgt ccacccaggacaccgtctgggtcccactcaggcccagtcccctcccagggcctggtctga >gi568815579r:18894369_19131071|GENSCAN_predicted_peptide_4|428_aa MGPEAARAAAAQTKARAAWRAARRARAREPGSERGARQRGVSAAGAAPDSRLDTRGAPTW QDSGLLNGPMGREQPIFSTRAHVFQIDPATKRNWIPAGKHALTVSYFYDATRNVYRIISI GGAKAIINSTVTPNMTFTKTSQKFGQWADSRANTVYGLGFASEQHLTQFAEKFQEVKEAA RLAREKSQDGGELTSPALGLASHQVPPSPLVSANGPGEEKLFRSQSADAPGPTERERLKK MLSEGSVGEVQWEAEFFALQDSNNKLAGALREANAAAAQWRQQLEAQRAEAERLRQRVAE LEAQAASEVTPTGEKEGLGQGQSLEQLEALVQTKDQEIQTLKSQTGGPREALEAAEREET QQKVQDLETRNAELEHQLRAMERSLEEARAERERARAEVGRAAQLLDVSLFELSELREGL ARLAEAAP >gi568815579r:18894369_19131071|GENSCAN_predicted_CDS_4|1287_bp atgggaccggaggcggcgcgggcggcggcggcgcagacaaaggcacgggcggcgtggagg gcggcgcggagggcgcgggcccgggagccagggagcgagcggggcgcccggcagcgcgga gtcagcgccgcgggggccgcacccgactcgcgcctggacactcgcggggcgccgacctgg caggattcggggttgctgaatggcccaatggggagggagcagccaatcttcagcacacgg gcgcacgtgttccaaattgacccagccaccaagcgaaactggatcccagcgggcaagcac gcactcactgtctcctatttctacgatgccacccgcaatgtgtaccgcatcatcagcatc ggaggcgccaaggccatcatcaacagcactgtcactcccaacatgaccttcaccaaaact tcccagaagttcgggcagtgggccgacagtcgcgccaacacagtctacggcctgggcttt gcctctgaacagcatctgacacagtttgccgagaagttccaggaagtgaaggaagcagcc aggctggccagggagaaatctcaggatggcggggagctcaccagtccagccctggggctc gcctcccaccaggtgcccccgagccctctcgtcagtgccaacggccccggcgaggaaaaa ctgttccgcagccagagcgctgatgcccccggccccacagagcgcgagcggctaaagaag atgttgtctgagggctccgtgggcgaggtacagtgggaggccgagtttttcgcactgcag gacagcaacaacaagctggcaggcgccctgcgagaggccaacgccgccgcagcccagtgg aggcagcagctggaggctcagcgtgcagaggccgagcggctgcggcagcgggtggctgag ctggaggctcaggcagcttcagaggtgacccccaccggtgagaaggaggggctgggccag ggccagtcgctggaacagctggaagctctggtgcaaaccaaggaccaggagattcagacc ctgaagagtcagactggggggccccgcgaggccctggaggctgccgagcgtgaggagact cagcagaaggtgcaggacctggagacccgcaatgcggagttggagcaccagctgcgggcg atggagcgcagcctggaggaggcacgggcagagcgggagcgggcgcgggctgaggtgggc cgggcagcgcagctgctggacgtcagcctgtttgagctgagtgagctgcgtgagggcctg gcccgcctggctgaggctgcgccctga >gi568815579r:18894369_19131071|GENSCAN_predicted_peptide_5|70_aa MTSDRILQKKENQAKHQVKTAVVAEGYFVEVVIQGNSTPKGPEAGMCSAPWDSIEEAAVT EAKPNPWSEE >gi568815579r:18894369_19131071|GENSCAN_predicted_CDS_5|213_bp atgacttctgacaggattttgcaaaagaaagaaaaccaggcaaagcaccaggtcaagact gcagtggttgctgaaggctactttgtggaggtggtgattcagggcaatagcacgcccaaa ggccctgaggcagggatgtgttcagccccttgggacagcattgaggaggcggctgtgact gaagcaaagcccaacccatggagcgaagagtga >gi568815579r:18894369_19131071|GENSCAN_predicted_peptide_6|32_aa MDKRKKQGHLLSAGDMLSAQPASSKIVDPTAC >gi568815579r:18894369_19131071|GENSCAN_predicted_CDS_6|99_bp atggacaaacggaagaaacaaggccatttattgagcgccggtgatatgctgagcgctcaa cctgcatcctcaaagatcgtcgatccaacagcctgctag >gi568815579r:18894369_19131071|GENSCAN_predicted_peptide_7|1403_aa MADSKSQQVMGQVLTGQGPLDQYGSVTQELGTPAVDDLNQKVLTGASAQKKRTARLRKRA GPRRPAPAHQDSAAACACAARRCGDRWFICMSPPRAAAAAGQNNMAARRITQETFDAVLQ EKAKRYHMDASGEAVSETLQFKAQDLLRAVPRSRAEMYDDVHSDGRYSLSGSVAHSRDAG REGLRSDVFPGPSFRSSNPSISDDSYFRKECGRDLEFSHSDSRDQVIGHRKLGHFRSQDW KFALRGSWEQDFGHPVSQESSWSQEYSFGPSAVLGDFGSSRLIEKECLEKESRDYDVDHP GEADSVLRGGSQVQARGRALNIVDQEGSLLGKGETQGLLTAKGGVGKLVTLRNVSTKKIP TVNRITPKTQGTNQIQKNTPSPDVTLGTNPGTEDIQFPIQKIPLGLDLKNLRLPRRKMSF DIIDKSDVFSRFGIEIIKWAGFHTIKDDIKFSQLFQTLFELETETCAKMLASFKCSLKPE HRDFCFFTIKFLKHSALKTPRVDNEFLNMLLDKGAVKTKNCFFEIIKPFDKYIMRLQDRL LKSVTPLLMACNAYELSVKMKTLSNPLDLALALETTNSLCRKSLALLGQTFSLASSFRQE KILEAVGLQDIAPSPAAFPNFEDSTLFGREYIDHLKAWLVSSGCPLQVKKAEPEPMREEE KMIPPTKPEIQAKAPSSLSDEPTFSIFLREAVLCVPAVPQRADHRVVGTIDQLVKRVIEG SLSPKERTLLKEDPAYWFLSDENSLEYKYYKLKLAEMQRMSENLRGADQKPTSADCAVRA MLYSRAVRNLKKKLLPWQRRGLLRAQGLRGWKARRATTGTQTLLSSGTRLKHHGRQAPGL SQAKPSLPDRNDAAKDCPPDPVGPSPQDPSLEASGPSPKPAGVDISEAPQTSSPCPSADI DMKTMETAEKLARFVAQVGPEIEQFSIENSTDNPDLWFLHDQNSSAFKFYRKKVFELCPS ICFTSSPHNLHTGGGDTTGSQESPVDLMEGEAEFEDEPPPREAELESPEVMPEEEDEDDE DGGEEAPAPGGAGKSEGSTPADGLPGEAAEDDLAGAPALSQASSGTCFPRKRISSKSLKV GMIPAPKRVCLIQEPKVHEPVRIAYDRPRGRPMSKKKKPKDLDFAQQKLTDKNLGFQMLQ KMGWKEGHGLGSLGKGIREPVSVYAAGSLGRWRCPRKAPQPVSAEEPAPLADFPVGLAIA LQKCSVSKRDVETITDGVFSLFARIYTGETNSRSLNSVPVTPTLSPAPPQQCRSLWWFGV TVFPEGPSQASISPVSTYYPNAHPWWRPHLMAGTCAGYQGYPGLLEHPLPECPLEPEVPG IDIPQGQPKQICDPARAVVFYGWLIKQHPALGLFWLRVAGLVSTSGLALPRTVKPKASNM GLNFCSFHPPNAFDILLRVIILL >gi568815579r:18894369_19131071|GENSCAN_predicted_CDS_7|4212_bp atggctgattccaagtcccagcaggtcatggggcaggtcctaacaggccagggaccactg gaccagtatgggtctgtgacccaggaattggggacccctgctgtagatgatcttaaccag aaggtcttgacaggagcgtccgcccagaagaagcggacagcgcgcttgcgcaagagagcc gggccgcggcggcccgcgcctgcgcaccaggactcggcggcggcttgcgcctgcgcggcg cggcgctgcggagaccgttggttcatttgcatgtccccgcctcgcgcggcggcggcggcg gggcaaaataacatggcagccagacgaattacacaggagacttttgatgctgtattacaa gaaaaagccaaacgatatcacatggatgccagtggtgaggctgtaagcgaaactcttcag tttaaagctcaagatctcttaagggcagtcccaagatccagagcagagatgtatgatgac gtccacagcgatggcagatactccctcagtggatctgtagctcactctagagatgccgga agagaaggcctgagaagtgacgtatttccagggccttccttcagatcaagcaacccttcc atcagtgatgacagctactttcgcaaagaatgtggccgggatctggaattttctcactct gattctcgggaccaggtcattggccaccggaaattggggcatttccgttctcaggactgg aaatttgcgctccgtggttcttgggaacaagactttggccatccagtttctcaagagtcc tcttggtcacaggagtatagttttggtccctctgcagttttgggggactttggatcttcc aggctgattgagaaagagtgtttggagaaggagagtcgggattatgacgtggaccatcct ggggaggctgactctgtgcttaggggcggcagtcaagtccaggccagaggtcgagctcta aacatcgttgaccaggaaggttccctcctaggaaagggggagactcagggcctgctcaca gctaaggggggtgttgggaaacttgtcacattgagaaatgtgagcacaaaaaaaataccc accgtgaatcgtattactcccaaaactcagggcactaaccaaatccagaaaaacactcca agtcctgatgtgaccctggggacaaacccagggacagaagatatccagttccccattcag aagatccctctggggctggatctgaagaatcttcggctccccagaagaaagatgagcttt gacatcatagataagtctgatgttttttcaagatttgggatagaaataatcaaatgggca ggattccacaccataaaagatgatattaaattttcccaacttttccagactctctttgaa cttgaaacagaaacctgtgctaaaatgcttgcctcattcaaatgttccttaaaaccagag cacagagatttttgcttttttactatcaaatttttaaagcactctgctttgaaaacaccc agagttgataatgagtttttaaacatgcttttagacaaaggtgctgtgaagaccaaaaat tgcttttttgaaatcataaagccttttgacaagtacataatgagacttcaagaccggctt ctgaagagtgtcacacctttgcttatggcctgcaatgcctacgagctaagtgtcaagatg aagaccctcagtaaccccctggacttggctcttgccctagaaaccaccaactctctctgc cggaagtctttggcccttttgggacagacattttccttggcctcttctttccggcaggag aaaatcttagaagctgtcggcctgcaagatatagctccctcacctgctgcgtttccaaac ttcgaagactccactttgtttgggcgagagtacatagaccacctgaaggcctggctagtc agcagcggatgtcccctccaggttaagaaagccgaaccagagccgatgcgagaggaggag aaaatgattcctcctacgaaacctgaaattcaggccaaggctccaagtagtctgagtgat gagcccacattcagcatatttctcagagaagctgttctgtgtgttccagctgtcccccag cgagcagatcacagggtagtgggcaccatcgaccagcttgtgaaacgtgtcatcgaaggc agcctgtctcccaaagagagaactcttctcaaagaggaccctgcttactggtttttgtct gatgaaaatagtctggagtataaatattacaagctgaagttggcagaaatgcagcggatg agcgagaacttgcgaggagccgaccagaagccgacctcagcagactgtgcagtgagggcc atgctgtactcccgggctgtccgcaacctcaagaagaaactccttccgtggcagcggcgg gggctcctccgtgctcaagggctccggggctggaaggcgaggagagcgaccaccgggacc cagaccctcctatcctcaggcaccaggctgaaacaccacggccggcaggctccaggcctc tcacaggcaaaaccatccctgccagacagaaatgatgctgccaaggactgcccgccagac ccagttggaccttctcctcaggaccccagcttagaagcctcaggcccatcccccaagcca gcaggagtggacatctctgaagcacctcagacctcttctccctgcccatctgctgacatt gacatgaagacaatggagactgcagagaaactggctagatttgttgctcaggtgggacca gagatcgaacaattcagcatagaaaacagcaccgataaccctgacctgtggtttctacat gaccaaaatagttctgctttcaaattctatcgaaagaaagtgtttgaactatgtccatca atttgtttcacgtcatctccgcacaaccttcacactggtggtggtgacaccacgggttct caggagagccccgtggacctcatggaaggggaagcagagtttgaagacgagccccctccg cgggaggctgagctggagagcccagaggtgatgcctgaggaggaggacgaggacgatgag gatgggggagaggaggcccccgctcctggaggggcgggcaagtctgagggcagcacccct gccgacggccttcccggcgaggctgccgaggacgacctggctggagcacctgccttgtca caggcctcctcaggtacctgcttccctcggaagaggatcagcagcaagtcattgaaggtt ggcatgattccagctcccaagagagtgtgtctcatccaggagccaaaagtccatgaacca gttcgaattgcctatgacaggcctcggggtcgtcccatgtccaaaaagaagaaacccaag gacttggacttcgcccagcagaagctgaccgataagaacctgggcttccagatgctgcag aagatgggctggaaggagggccatggcctgggctccctcggaaagggcatcagggagccg gtcagcgtgtacgcagcaggcagcctgggaaggtggcgctgcccccgaaaggcacctcag cctgtgagtgctgaggaaccagctcctctggctgattttccagttggactggccattgct ctccagaagtgctctgttagcaaacgtgatgtggaaacgatcacagatggtgttttctcg ttgttcgccagaatttatacgggggagacaaattcccggtccctcaactctgtccctgtc acccccaccctgtcaccagccccacctcaacagtgcaggagcctgtggtggtttggggtc accgtgttccctgaggggcccagccaggcctctatctcccctgtgtccacatactacccc aatgcccacccatggtggaggccccatttaatggcaggcacctgtgcagggtatcagggc taccctgggctgctggagcaccctttgcctgaatgcccactggaaccagaagtgcctgga attgacattccccagggccagccaaagcagatctgtgacccggctcgggctgtggtgttc tatggatggctgatcaagcagcatcctgccctaggtttattttggttgagggtagcaggc ttggtcagcacctcaggcctggccctgcccaggaccgtgaaacccaaggcctctaatatg gggctcaatttctgcagcttccaccctccaaatgcctttgacattcttctacgagttata attcttctctga >gi568815579r:18894369_19131071|GENSCAN_predicted_peptide_8|476_aa MVSKRIAQETFDAAVRENIEEFAMGPEEAVKEAVEQFESQGVDLSNIVKTAPKVSADGSQ EPTHDILQMLSDLQESVASSRPQEVSAYLTRFCDQCKQDKACRFLAAQKGAYPIIFTAWK LATAGDQGLLLQSLNALSVLTDGQPDLLDAQGLQLLVATLTQNADEADLTCSGIRCVRHA CLKHEQNRQDLVKAGVLPLLTGAITHHGHHTDVVREACWALRVMTFDDDIRVPFGHAHNH AKMIVQENKGLKVLIEATKAFLDNPGILSELCGTLSRLAIRNEFCQEVVDLGGLSILVSL LADCNDHQMRDQSGVQELVKQVLSTLRAIAGNDDVKDAIVRAGGTESIVAAMTQHLTSPQ VCEQSCAALCFLALRKPDNSRIIVEGGGAVAALQAMKAHPQKAGVQKQACMLIRNLVAHG QAFSKPILDLGAEALIMQARSAHRDCEDVAKAALRDLGCHVELRELWTGQRGNLAP >gi568815579r:18894369_19131071|GENSCAN_predicted_CDS_8|1431_bp atggtctccaagcgcattgcccaggagacctttgatgcagctgtgcgcgagaacatcgag gagtttgcgatggggccagaggaggcagtgaaagaggccgtggagcagtttgaatcgcaa ggggttgatctgagcaacattgtaaagacggcacctaaagtctctgcagacggatcccag gagcccacacatgacatcctgcagatgctcagtgacctccaggagtctgtggccagctct cgcccccaggaggtgtcagcatacctcacccgcttctgcgaccagtgcaaacaggacaag gcctgccgcttcctcgcggcccagaagggggcctaccccatcatcttcactgcctggaag ctggccactgcaggtgaccagggccttctgctccagtccctcaatgccctgtcggtgctg actgatggacagccagacctcctggatgcccagggcctgcagctcctagtggccacgctg acccagaatgctgatgaggctgacctgacctgctctgggatccgctgtgtgcgtcacgct tgcctgaaacatgaacagaatcggcaagacctggtgaaagctggcgtgctgcctctgctg actggtgccatcacccatcatggccaccacactgacgtggtcagggaagcctgctgggcc ctgcgtgtcatgaccttcgatgacgacatccgtgtgccctttggccatgcccacaaccat gccaagatgattgtgcaggagaacaaaggcttgaaggtgctcatcgaagccaccaaagcg ttcctggataaccctggcatcctgagcgagctctgtggaaccctgtcccgcctggccatt cgcaacgagttctgccaggaggtcgtcgacctcgggggcctgagcattctggtgtccctg ctagccgactgcaatgaccaccagatgagggaccagagcggcgttcaggagctcgtgaag caagtgctgagcaccctgcgagccatcgcaggcaacgacgacgtgaaagatgctattgtc cgtgctggtgggacggagtccatcgtggctgctatgacccagcatctgaccagcccccag gtgtgtgagcagagctgcgcggccctgtgcttcctggccctgcgtaagcccgacaacagc cgcatcatcgtggagggtggcggggctgtggcagcactgcaggccatgaaggcacacccg cagaaggccggcgtgcagaaacaggcttgcatgctgatccgaaacctggtggcccacggc caggccttctcgaagcccatcctggacctgggggctgaggcactcatcatgcaggcccga tctgcccaccgtgactgtgaggacgtggccaaggccgccctgcgggacctgggttgtcat gtcgagctccgagagctgtggacaggccagaggggcaacctggcgccatga >gi568815579r:18894369_19131071|GENSCAN_predicted_peptide_9|272_aa MSDYCPAPPLPLQTLGAVLALGRLEAEEVQDNGQSIHGNKTVAQNLCSNQLRKPTTNLCS NAQDLGVWGHAQAKAFLPPGPVSASACGDETQGSLAKSKDRGNSGPSAGSESHNRVLLHG AWPLGSPQPLRLVIVVVPVVPYGSSAGKSPWGHLGHPAEMCPNDKSVGRDSATRVGDKYW ADVSIDEMEAKHLQWLRPHASWTQTNYGRGLPHQEGDIYAEERPLPVPTCQALGGGGWRQ PTSLPPSSNGRQSTLAIAMDTPTMGTHPLKCP >gi568815579r:18894369_19131071|GENSCAN_predicted_CDS_9|819_bp atgtctgactactgcccagcaccacccctgccgctacagaccctgggggctgtccttgca cttgggcgcttggaggcagaagaggtgcaggacaatggccaatccatacatggcaacaaa actgtggcccaaaacctgtgcagcaaccagctccggaaaccaaccaccaacctctgcagc aatgctcaggacctgggtgtctggggccatgctcaggcaaaggccttccttcctccaggg cctgtttctgcctctgcatgtggggatgaaactcagggaagtttggccaagtccaaggac agagggaactcaggccccagtgcaggatccgaaagccacaaccgggtccttctccatgga gcctggcccctaggcagcccacagccactgaggctggtcatagtggtggtccctgtggtt ccttatgggtccagtgctggaaagtcaccctggggccacttgggacatcctgcagaaatg tgccccaatgacaagtctgtggggagggatagtgccactcgagtgggtgacaagtattgg gctgatgtgagcattgatgaaatggaagcaaaacacttacaatggcttcgtccacacgct tcatggacacagaccaactatgggcgtggcttgccacaccaagagggagatatctacgcc gaggagcgcccactcccagtgcccacctgccaggccctgggagggggaggctggcgccag ccaacctccctaccgccttccagcaatggcaggcaaagcaccttggcaatagcaatggac acccccaccatgggcactcatcccctcaaatgcccatga >gi568815579r:18894369_19131071|GENSCAN_predicted_peptide_10|436_aa MPSMTGTSTDPRVRADTSPLDVWPDSGIGTEHLGAHWMFGLTVVLALSTSATLAPQPAGS GIPLLCSMQPITNHPVPRMCMLVPLPTWAEGGVVEPLCSHGPKYSIMGWRTPATLYSTKS PTCDVNPTSPATCPCVPTMPQCRSQRDHRQVLSSLLSGALAGALAKTAVAPLDRTKIIFQ VSSKRFSAKEAFRVLYYTYLNEGFLSLWRGNSATMVRVVPYAAIQFSAHEEYKRILGSYY GFRGEALPPWPRLFAGALAGTTAASLTYPLDLVRARMAVTPKEMYSNIFHVFIRISREEG LKTLYHGFMPTVLGVIPYAGLSFFTYETLKSLHREYSGRRQPYPFERMIFGACAGLIGQS ASYPLDVVRRRMQTAGVTGYPRASIARTLRTIVREEGAVRGLYKGLSMNWVKGPIAVGIS FTTFDLMQILLRHLQS >gi568815579r:18894369_19131071|GENSCAN_predicted_CDS_10|1311_bp atgccctccatgactggcacctccacagaccctcgggtccgagctgacacaagcccactg gacgtttggcctgacagtggtattggcaccgagcacctcggtgcccactggatgtttggc ctgacagtggtgttggcactgagcacctcggccacgctggccccacaacccgctggctca gggattcctctgctgtgctccatgcagcccatcacaaatcaccctgtacctaggatgtgc atgttggtaccccttcctacctgggctgagggtggagtggtggagcctctctgctcccac ggccccaaatactccatcatgggatggaggacgccagccaccctctattccaccaaaagc cccacttgtgatgtgaatccaaccagcccagccacctgtccctgtgtacctacaatgcca cagtgccgtagtcagcgtgaccacaggcaagtgctcagctccctgctgtctggggccctg gctggtgcccttgccaaaacagcggtagctcccctggaccgaaccaaaatcatcttccaa gtgtcttcaaaaagattttctgccaaggaggccttccgggtcctctactacacctacctc aacgagggatttctcagcttgtggcgcgggaactcggccaccatggtgcgcgtggtgccc tacgccgccatccagttcagcgcacacgaggagtacaagcgcatcctgggcagctactat ggcttccgtggagaagccctgcccccttggcctcgcctcttcgccggcgcactggctgga acgacagccgcttcactgacctaccccctggacctggtcagagcgcggatggccgtaacc ccgaaggaaatgtacagcaacatctttcatgtcttcatccgcatctcgagagaagagggg ctgaagactctctaccatggatttatgcccaccgtgctgggggtcattccctacgctggc ctgagcttcttcacctatgagacgctcaagagcttgcacagagagtacagcggccgccgg cagccctaccccttcgagcgcatgatcttcggcgcctgcgctggcctcatcgggcagtcg gcctcgtacccgctggatgtggtgcggcggcgcatgcagacggccggcgtcacgggctac ccgcgcgcctccatcgcccgcacgctgcgcaccatcgtgcgggaggagggcgccgtgcgc ggcctctacaaaggcttgagcatgaactgggtcaagggtcccatcgccgtgggcatcagc ttcaccaccttcgacctcatgcagatcctgctgcggcacctgcagagctag >gi568815579r:18894369_19131071|GENSCAN_predicted_peptide_11|337_aa MFLTVTRLYFSAEEGGERSVCLTFAFLFLLLAMLVQVVREETLELGLEPGLASMTQNLEP LLKKQGWDWALPVAKLAIRVGLAVVGSVLGAFLTFPGLRLAQTHRDALTMSEDRPMLQLS GSLGGFLLHTSFLSPLFILWLWTKPIARDFLHQPPFGETRFSLLSDSAFDSGRLWLLVVL CLLRLAVTRPHLQAYLCLAKARVEQLRREAGRIEAREIQQRVVRVYCYVTVVSLQYLTPL ILTLNCTLLLKTLGGYSWGLGPAPLLSPDPSSASAAPIGSGEDEVQQTAARIAGALGGLL TPLFLRGVLAYLIWWTAACQLLASLFGLYFHQHLAGS >gi568815579r:18894369_19131071|GENSCAN_predicted_CDS_11|1014_bp atgttcctgacagtgacacggctgtacttcagcgccgaggaggggggtgagcgctctgtc tgcctcacctttgccttcctcttcctgctgctggccatgctggtgcaagtggtgcgggag gagaccctcgagctgggcctggagcctggtctggccagcatgacccagaacttagagcca cttctgaagaagcagggctgggactgggcgcttcctgtggccaagctggctatccgcgtg ggactggcagtggtgggctctgtgctgggtgccttcctcaccttcccaggcctgcggctg gcccagacccaccgggacgcactgaccatgtcggaggacagacccatgctgcagttaagt gggtcgcttggtgggttcctcctgcacaccagcttcctgtctcccctgttcatcctgtgg ctctggacaaagcccattgcacgggacttcctgcaccagccgccgtttggggagacgcgt ttctccctgctgtccgattctgccttcgactctgggcgcctctggttgctggtggtgctg tgcctgctgcggctggcggtgacccggccccacctgcaggcctacctgtgcctggccaag gcccgggtggagcagctgcgaagggaggctggccgcatcgaagcccgtgaaatccagcag agggtggtccgagtctactgctatgtgaccgtggtgagcttgcagtacctgacgccgctc atcctcaccctcaactgcacacttctgctcaagacgctgggaggctattcctggggcctg ggcccagctcctctactatcccccgacccatcctcagccagcgctgcccccatcggctct ggggaggacgaagtccagcagactgcagcgcggattgccggggctctgggtggcctgctt actcccctcttcctccgtggcgtcctggcctacctcatctggtggacggctgcctgccag ctgctcgccagccttttcggcctctacttccaccagcacttggcaggctcctag