GENSCAN 1.0 Date run: 7-Nov-116 Time: 20:02:57 Sequence gi568815581r:63840774_64072841 : 232068 bp : 50.51% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 Intr - 699 585 115 0 1 115 15 99 0.314 5.32 1.01 Init - 1796 1686 111 2 0 79 99 195 0.503 17.91 1.00 Prom - 4869 4830 40 -6.46 2.00 Prom + 8978 9017 40 -5.66 2.01 Init + 14044 14098 55 0 1 54 94 89 0.890 5.67 2.02 Intr + 16384 16526 143 2 2 49 80 51 0.004 0.47 2.03 Intr + 18656 18800 145 1 1 69 81 51 0.001 2.36 2.04 Intr + 19101 19304 204 1 0 102 18 70 0.594 0.57 2.05 Term + 21952 22175 224 2 2 101 53 172 0.992 12.08 2.06 PlyA + 23582 23587 6 1.05 3.06 PlyA - 23655 23650 6 1.05 3.05 Term - 31550 31353 198 2 0 90 43 276 0.447 20.70 3.04 Intr - 31968 31804 165 0 0 97 60 380 0.988 36.26 3.03 Intr - 32181 32062 120 0 0 48 75 50 0.533 0.39 3.02 Intr - 32566 32406 161 2 2 115 78 124 0.984 13.81 3.01 Init - 32841 32832 10 0 1 93 115 12 0.997 4.85 3.00 Prom - 36943 36904 40 -6.56 4.06 PlyA - 38250 38245 6 1.05 4.05 Term - 39745 39548 198 1 0 90 37 363 0.992 28.80 4.04 Intr - 40163 39999 165 2 0 102 68 390 0.986 38.56 4.03 Intr - 40375 40256 120 1 0 65 65 115 0.967 7.59 4.02 Intr - 40746 40586 161 1 2 116 52 125 0.882 11.41 4.01 Init - 41028 41019 10 0 1 93 115 12 0.999 4.85 4.00 Prom - 42966 42927 40 -8.96 5.00 Prom + 44028 44067 40 -7.96 5.01 Init + 45093 45279 187 2 1 59 86 180 0.704 12.89 5.02 Term + 45473 45696 224 0 2 101 53 172 0.993 12.08 5.03 PlyA + 47114 47119 6 1.05 6.06 PlyA - 50251 50246 6 1.05 6.05 Term - 54446 54249 198 2 0 90 43 354 0.989 28.50 6.04 Intr - 54864 54700 165 0 0 97 60 380 0.692 36.26 6.03 Intr - 55077 54958 120 0 0 48 75 50 0.533 0.39 6.02 Intr - 55462 55302 161 2 2 115 78 123 0.982 13.71 6.01 Init - 55738 55729 10 1 1 83 115 -5 0.992 2.54 6.00 Prom - 66717 66678 40 -6.66 7.06 PlyA - 67121 67116 6 1.05 7.05 Term - 69135 68938 198 0 0 72 50 308 0.997 22.80 7.04 Intr - 69553 69389 165 1 0 102 60 353 0.971 34.06 7.03 Intr - 69693 69647 47 1 2 99 75 58 0.981 3.73 7.02 Intr - 70151 69972 180 0 0 117 41 58 0.541 3.84 7.01 Init - 70423 70414 10 1 1 93 115 12 0.998 4.85 7.00 Prom - 70521 70482 40 -0.06 8.11 PlyA - 75548 75543 6 1.05 8.10 Term - 76733 76536 198 2 0 90 37 340 0.999 26.50 8.09 Intr - 77151 76987 165 0 0 103 60 381 0.985 36.96 8.08 Intr - 77363 77244 120 2 0 72 75 47 0.839 2.49 8.07 Intr - 77733 77573 161 1 2 113 52 165 0.992 15.11 8.06 Intr - 88551 88476 76 0 1 82 71 129 0.751 9.69 8.05 Intr - 88702 88661 42 1 0 125 94 40 0.977 6.74 8.04 Intr - 89115 88997 119 1 2 131 53 283 0.999 29.28 8.03 Intr - 89612 89301 312 0 0 79 57 469 0.962 39.06 8.02 Intr - 90615 90562 54 1 0 79 86 68 0.736 4.65 8.01 Init - 91488 91422 67 0 1 106 92 131 0.999 14.56 8.00 Prom - 93618 93579 40 -9.06 9.13 PlyA - 95675 95670 6 1.05 9.12 Term - 101220 99998 1223 1 2 89 43 2472 0.990 234.59 9.11 Intr - 102323 102053 271 2 1 59 80 714 0.852 64.81 9.10 Intr - 103073 102973 101 0 2 56 92 145 0.780 11.53 9.09 Intr - 104037 103896 142 0 1 84 -57 384 0.779 23.43 9.08 Intr - 104287 104234 54 1 0 71 105 96 0.987 8.78 9.07 Intr - 104865 104587 279 0 0 62 89 542 0.973 49.37 9.06 Intr - 106394 106272 123 2 0 126 59 142 0.957 15.88 9.05 Intr - 107290 107117 174 1 0 81 76 482 0.963 46.44 9.04 Intr - 107992 107838 155 2 2 64 70 211 0.842 16.69 9.03 Intr - 108580 108488 93 2 0 112 -7 85 0.462 1.34 9.02 Intr - 108755 108620 136 2 1 90 72 214 0.797 20.14 9.01 Init - 111136 110651 486 1 0 71 105 976 0.957 90.85 9.00 Prom - 113015 112976 40 -2.16 10.18 PlyA - 113163 113158 6 -1.95 10.17 Term - 113915 113778 138 2 0 84 40 154 0.581 8.16 10.16 Intr - 116745 116389 357 0 0 117 89 710 0.987 69.45 10.15 Intr - 118665 118492 174 0 0 78 46 266 0.697 21.54 10.14 Intr - 120658 120420 239 2 2 85 58 424 0.987 36.43 10.13 Intr - 122867 122814 54 0 0 59 113 33 0.782 1.85 10.12 Intr - 123167 122899 269 1 2 22 113 204 0.756 13.48 10.11 Intr - 123904 123606 299 1 2 103 72 495 0.577 45.07 10.10 Intr - 125470 125329 142 0 1 93 52 209 0.998 18.16 10.09 Intr - 125771 125708 64 0 1 80 92 119 0.983 9.28 10.08 Intr - 127582 127250 333 2 0 93 116 630 0.997 61.84 10.07 Intr - 130480 130389 92 0 2 96 84 160 0.999 16.04 10.06 Intr - 131063 130949 115 0 1 62 43 211 0.397 13.51 10.05 Intr - 131697 131547 151 0 1 92 8 157 0.831 7.84 10.04 Intr - 132077 131796 282 2 0 46 47 473 0.804 36.62 10.03 Intr - 132743 132184 560 0 2 120 78 85 0.097 3.25 10.02 Intr - 138653 138583 71 1 2 61 48 106 0.188 2.73 10.01 Init - 142083 142034 50 0 2 95 67 67 0.784 5.72 10.00 Prom - 144746 144707 40 -7.46 11.10 PlyA - 150820 150815 6 1.05 11.09 Term - 151357 151281 77 2 2 106 48 107 0.729 6.50 11.08 Intr - 152090 151895 196 2 1 123 90 -34 0.422 -0.71 11.07 Intr - 156652 156539 114 1 0 85 81 26 0.008 2.24 11.06 Intr - 158388 158197 192 0 0 90 30 113 0.001 5.39 11.05 Intr - 162152 162019 134 0 2 107 37 256 0.009 22.76 11.04 Intr - 163191 162871 321 1 0 135 77 284 0.997 27.83 11.03 Intr - 164600 164334 267 0 0 103 84 246 0.827 23.20 11.02 Intr - 165962 165858 105 0 0 93 115 45 0.131 7.89 11.01 Init - 186449 186380 70 2 1 64 48 123 0.867 7.11 11.00 Prom - 187082 187043 40 -5.86 12.18 PlyA - 187840 187835 6 1.05 12.17 Term - 203427 203215 213 0 0 87 53 326 0.977 26.23 12.16 Intr - 204154 204087 68 2 2 94 121 43 0.998 6.82 12.15 Intr - 204709 204586 124 1 1 126 58 94 0.908 10.36 12.14 Intr - 207212 207085 128 0 2 85 80 132 0.999 12.50 12.13 Intr - 208429 208282 148 1 1 117 101 150 0.998 19.01 12.12 Intr - 212206 212007 200 2 2 -40 76 210 0.863 6.07 12.11 Intr - 212598 212499 100 0 1 101 89 116 0.873 12.68 12.10 Intr - 213666 213477 190 2 1 100 30 398 0.972 34.79 12.09 Intr - 214055 213965 91 0 1 79 117 39 0.997 4.95 12.08 Intr - 215175 214902 274 0 1 82 88 437 0.995 40.21 12.07 Intr - 217220 217029 192 2 0 31 63 311 0.955 22.59 12.06 Intr - 219814 219696 119 2 2 91 81 92 0.893 8.98 12.05 Intr - 223378 223213 166 1 1 114 82 183 0.999 19.83 12.04 Intr - 224514 224436 79 2 1 92 103 10 0.991 2.45 12.03 Intr - 226159 225898 262 2 1 63 86 345 0.998 28.35 12.02 Intr - 227518 227417 102 2 0 91 98 219 0.999 23.35 12.01 Intr - 231330 231208 123 1 0 133 110 74 0.999 14.66 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 18531 18800 270 1 0 84 81 148 0.811 11.21 S.002 Term + 60301 60524 224 2 2 101 54 163 0.990 11.28 S.003 Init - 78003 77994 10 0 1 81 115 -2 0.889 2.64 S.004 Intr + 160311 160537 227 2 2 82 78 148 0.955 9.98 S.005 Intr + 160694 160764 71 2 2 99 91 53 0.824 5.53 S.006 Term - 162152 161974 179 0 2 107 43 289 0.991 24.15 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:63840774_64072841|GENSCAN_predicted_peptide_1|76_aa MLPGPALRGPGPAGGVGGPGAAAFRPMGPAGPAAQYQVLTPPRPKPCAQAFDTGFLLIYG FAVGFLCFRLMSPGRX >gi568815581r:63840774_64072841|GENSCAN_predicted_CDS_1|228_bp atgctgcccggaccggcgctccggggaccgggtccggcaggaggcgtggggggccccggg gccgccgccttccgccccatgggccccgcgggccccgcggcgcagtaccaggttctaact ccaccacgtccgaagccatgtgcccaagcatttgacactggcttcttgctcatctacggg tttgctgtgggcttcctctgttttcgactcatgtctccaggcagagnn >gi568815581r:63840774_64072841|GENSCAN_predicted_peptide_2|256_aa MKTLLFGVWTLLALILCPGVPEELFEVSIWPSQALVEFGQSLVVNCSTTCPDPGPSGIET FLKKTQGNQELHRKNFTSLAVASQRAEVIISVRAQKENDRCNSSCHAELDLSLQEFSQSP HIWVSSLLEAGMAETVSCEVARVFPAKEVMFHMFLEDQELSSFLSWEGDTAWANATIRTM EAGLDEGISSTLFVIITVALGVGVITIALYLSYRPCKVDRRKLLYRQKEEDKEEESQFAV QEEKSTTHIIDNCLIE >gi568815581r:63840774_64072841|GENSCAN_predicted_CDS_2|771_bp atgaaaacgcttctgtttggtgtctggaccctgctggccttgatcctttgcccaggggtc ccggaagagttgtttgaggtttctatttggccaagtcaggccctggtggagtttggacag tccctagtggtcaactgcagcactacttgcccagacccaggacccagtggaattgagacc ttcttaaagaaaactcagggtaaccaagaactgcatagaaagaactttacgagcttggct gtggcctcccaaagagctgaagtcatcatcagtgtcagagcccaaaaggagaatgacaga tgcaattcttcctgccatgcagaactggacttgagtttgcaagaattctctcagagtccc cacatctgggtctcttcccttttggaggctgggatggcggagactgtgagctgcgaggtg gctagggtgtttccagccaaagaagttatgttccacatgttcctggaagaccaagagctg agctccttcctttcctgggagggggacacagcatgggccaatgctaccattcggaccatg gaggctggactggatgaaggaataagctctaccctctttgtcattattaccgttgccctt ggagtgggtgtcatcaccatagcactgtatttgagctatcggccctgcaaagtggacagg aggaaattgctctataggcagaaagaggaggacaaagaggaggaaagccagtttgctgtt caggaagagaaaagtacaactcatataattgacaactgtttgattgaatga >gi568815581r:63840774_64072841|GENSCAN_predicted_peptide_3|217_aa MAAGSRTSLLLAFALLCLPWLQEAGAVQTVPLSRLFDHAMLQAHRAHQLAIDTYQEFEET YIPKDQKYSFLHDSQTSFCFSDSIPTPSNMEETQQKSNLELLRISLLLIESWLEPVRFLR SMFANNLVYDTSDSDDYHLLKDLEEGIQTLMGRLEDGSRRTGQILKQTYSKFDTNSHNHD ALLKNYGLLYCFRKDMDKVETFLRMVQCRSVEGSCGF >gi568815581r:63840774_64072841|GENSCAN_predicted_CDS_3|654_bp atggctgcaggctcccggacgtccctgctcctggcttttgccctgctctgcctgccctgg cttcaagaggctggtgccgtccaaaccgttccgttatccaggctttttgaccacgctatg ctccaagcccatcgcgcgcaccagctggccattgacacctaccaggagtttgaagaaacc tatatcccaaaggaccagaagtattcattcctgcatgactcccagacctccttctgcttc tcagactctattccgacaccctccaacatggaggaaacgcaacagaaatccaatctagag ctgctccgcatctccctgctgctcatcgagtcgtggctggagcccgtgcggttcctcagg agtatgttcgccaacaacctggtgtatgacacctcggacagcgatgactatcacctccta aaggacctagaggaaggcatccaaacgctgatggggaggctggaagacggcagccgccgg actgggcagatcctcaagcagacctacagcaagtttgacacaaactcacacaaccatgac gcactgctcaagaactacgggctgctctactgcttcaggaaggacatggacaaggtcgag acattcctgcgcatggtgcagtgccgctctgtagagggtagctgtggcttctag >gi568815581r:63840774_64072841|GENSCAN_predicted_peptide_4|217_aa MAAGSRTSLLLAFGLLCLSWLQEGSAFPTIPLSRLFDNAMLRARRLYQLAYDTYQEFEEA YILKEQKYSFLQNPQTSLCFSESIPTPSNRVKTQQKSNLELLRISLLLIQSWLEPVQLLR SVFANSLVYGASDSNVYRHLKDLEEGIQTLMWRLEDGSPRTGQIFNQSYSKFDTKSHNDD ALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGF >gi568815581r:63840774_64072841|GENSCAN_predicted_CDS_4|654_bp atggctgcaggctcccggacgtccctgctcctggcttttggcctgctctgcctgtcctgg cttcaagagggcagtgccttcccaaccattcccttatccaggctttttgacaacgctatg ctccgcgcccgtcgcctgtaccagctggcatatgacacctatcaggagtttgaagaagcc tatatcctgaaggagcagaagtattcattcctgcagaacccccagacctccctctgcttc tcagagtctattccaacaccttccaacagggtgaaaacgcagcagaaatctaacctagag ctgctccgcatctccctgctgctcatccagtcatggctggagcccgtgcagctcctcagg agcgtcttcgccaacagcctggtgtatggcgcctcggacagcaacgtctatcgccacctg aaggacctagaggaaggcatccaaacgctgatgtggaggctggaagatggcagcccccgg actgggcagatcttcaatcagtcctacagcaagtttgacacaaaatcgcacaacgatgac gcactgctcaagaactacgggctgctctactgcttcaggaaggacatggacaaggtcgag acattcctgcgcatcgtgcagtgccgctctgtggagggcagctgtggcttctag >gi568815581r:63840774_64072841|GENSCAN_predicted_peptide_5|136_aa MEHTLTCVPKGNPAPALVCTWNGVVFDLEVPQKATQNHTGTYRCTATNQLGSVSKDIAVI VQGLDEGISSTLFVIITVALGVGVITIALYLSYRPCKVDRRKLLYRQKEEDKEEESQFAV QEEKSTTHIIDNCLIE >gi568815581r:63840774_64072841|GENSCAN_predicted_CDS_5|411_bp atggaacacacgctcacctgcgtcccaaagggaaacccagctccagccttggtgtgtacc tggaatggggtggtctttgaccttgaagtgccacagaaggcaacccagaaccacactgga acctaccgctgcacagccactaaccagctgggctctgtcagcaaagacattgctgtcatt gttcaaggactggatgaaggaataagctctaccctctttgtcattattaccgttgccctt ggagtgggtgtcatcaccatagcactgtatttgagctatcggccctgcaaagtggacagg aggaaattgctctataggcagaaagaggaggacaaagaggaggaaagccagtttgctgtt caggaagagaaaagtacaactcatataattgacaactgtttgattgaatga >gi568815581r:63840774_64072841|GENSCAN_predicted_peptide_6|217_aa MAPGSRTSLLLAFALLCLPWLQEAGAVQTVPLSRLFDHAMLQAHRAHQLAIDTYQEFEET YIPKDQKYSFLHDSQTSFCFSDSIPTPSNMEETQQKSNLELLRISLLLIESWLEPVRFLR SMFANNLVYDTSDSDDYHLLKDLEEGIQTLMGRLEDGSRRTGQILKQTYSKFDTNSHNHD ALLKNYGLLYCFRKDMDKVETFLRMVQCRSVEGSCGF >gi568815581r:63840774_64072841|GENSCAN_predicted_CDS_6|654_bp atggctccaggctcccggacgtccctgctcctggcttttgccctgctctgcctgccctgg cttcaagaggctggtgccgtccaaaccgttcccttatccaggctttttgaccacgctatg ctccaagcccatcgcgcgcaccagctggccattgacacctaccaggagtttgaagaaacc tatatcccaaaggaccagaagtattcattcctgcatgactcccagacctccttctgcttc tcagactctattccgacaccctccaacatggaggaaacgcaacagaaatccaatctagag ctgctccgcatctccctgctgctcatcgagtcgtggctggagcccgtgcggttcctcagg agtatgttcgccaacaacctggtgtatgacacctcggacagcgatgactatcacctccta aaggacctagaggaaggcatccaaacgctgatggggaggctggaagacggcagccgccgg actgggcagatcctcaagcagacctacagcaagtttgacacaaactcgcacaaccatgac gcactgctcaagaactacgggctgctctactgcttcaggaaggacatggacaaggtcgag acattcctgcgcatggtgcagtgccgctctgtggagggcagctgtggcttctag >gi568815581r:63840774_64072841|GENSCAN_predicted_peptide_7|199_aa MAAGSRTSLLLAFALLCLPWLQEAGAVQTVPLSRLFKEAMLQAHRAHQLAIDTYQEFISS WGMDSIPTSSNMEETQQKSNLELLHISLLLIESRLEPVRFLRSTFTNNLVYDTSDSDDYH LLKDLEEGIQMLMGRLEDGSHLTGQTLKQTYSKFDTNSHNHDALLKNYGLLHCFRKDMDK VETFLRMVQCRSVEGSCGF >gi568815581r:63840774_64072841|GENSCAN_predicted_CDS_7|600_bp atggctgcaggctcccggacgtccctgctcctggcttttgccctgctctgcctgccctgg cttcaagaggctggtgccgtccaaaccgttcccttatccaggctttttaaagaggctatg ctccaagcccatcgcgcacaccagctggccattgacacctaccaggagtttataagctct tggggaatggactctattccgacatcctccaacatggaggaaacgcagcagaaatccaac ttagagctgctccacatctccctgctgctcatcgagtcgcggctggagcccgtgcggttc ctcaggagtaccttcaccaacaacctggtgtatgacacctcggacagcgatgactatcac ctcctaaaggacctagaggaaggcatccaaatgctgatggggaggctggaagacggcagc cacctgactgggcagaccctcaagcagacctacagcaagtttgacacaaactcgcacaac catgacgcactgctcaagaactacgggctgctccactgcttcaggaaggacatggacaag gtcgagacattcctgcgcatggtgcagtgccgctctgtggagggcagctgtggcttctag >gi568815581r:63840774_64072841|GENSCAN_predicted_peptide_8|437_aa MARLALSPVPSHWMVALLLLLSAAEPVPAARSEDRYRNPKGSACSRIWQSPRFIARKRGF TVKMHCYMNSASGNVSWLWKQEMDENPQQLKLEKGRMEESQNESLATLTIQGIRFEDNGI YFCQQKCNNTSEVYQGCGTELRVMGFSTLAQLKQRNTLKDGIIMIQTLLIILFIIVPIFL LLDKDDSKAGMEEDHTYEGLDIDQTATYEDIVTLRTGEVKWSVGSRTSLLLAFGLLCLPW LQEGSAFPTIPLSRLFDNAMLRAHRLHQLAFDTYQEFEEAYIPKEQKYSFLQNPQTSLCF SESIPTPSNREETQQKSNLELLRISLLLIQSWLEPVQFLRSVFANSLVYGASDSNVYDLL KDLEEGIQTLMGRLEDGSPRTGQIFKQTYSKFDTNSHNDDALLKNYGLLYCFRKDMDKVE TFLRIVQCRSVEGSCGF >gi568815581r:63840774_64072841|GENSCAN_predicted_CDS_8|1314_bp atggccaggctggcgttgtctcctgtgcccagccactggatggtggcgttgctgctgctg ctctcagcagctgagccagtaccagcagccagatcggaggaccggtaccggaatcccaaa ggtagtgcttgttcgcggatctggcagagcccacgtttcatagccaggaaacggggcttc acggtgaaaatgcactgctacatgaacagcgcctccggcaatgtgagctggctctggaag caggagatggacgagaatccccagcagctgaagctggaaaagggccgcatggaagagtcc cagaacgaatctctcgccaccctcaccatccaaggcatccggtttgaggacaatggcatc tacttctgtcagcagaagtgcaacaacacctcggaggtctaccagggctgcggcacagag ctgcgagtcatgggattcagcaccttggcacagctgaagcagaggaacacgctgaaggat ggtatcatcatgatccagacgctgctgatcatcctcttcatcatcgtgcctatcttcctg ctgctggacaaggatgacagcaaggctggcatggaggaagatcacacctacgagggcctg gacattgaccagacagccacctatgaggacatagtgacgctgcggacaggggaagtgaag tggtctgtaggctcccggacgtccctgctcctggcttttggcctgctctgcctgccctgg cttcaagagggcagtgccttcccaaccattcccttatccaggctttttgacaacgctatg ctccgcgcccatcgtctgcaccagctggcctttgacacctaccaggagtttgaagaagcc tatatcccaaaggaacagaagtattcattcctgcagaacccccagacctccctctgtttc tcagagtctattccgacaccctccaacagggaggaaacacaacagaaatccaacctagag ctgctccgcatctccctgctgctcatccagtcgtggctggagcccgtgcagttcctcagg agtgtcttcgccaacagcctggtgtacggcgcctctgacagcaacgtctatgacctccta aaggacctagaggaaggcatccaaacgctgatggggaggctggaagatggcagcccccgg actgggcagatcttcaagcagacctacagcaagttcgacacaaactcacacaacgatgac gcactactcaagaactacgggctgctctactgcttcaggaaggacatggacaaggtcgag acattcctgcgcatcgtgcagtgccgctctgtggagggcagctgtggcttctag >gi568815581r:63840774_64072841|GENSCAN_predicted_peptide_9|1078_aa MPQVLNLFLALLLSSFSADSLAASDEDGEMNNLQIAIGRIKLGIGFAKAFLLGLLHGKIL SPKDIMLSLGEADGAGEAGEAGETAPEDEKKEPPEEDLKKDNHILNHMGLADGPPSSLEL DHLNFINNPYLTIQVPIASEESDLEMPTEEETDTFSEPEDSKKPPQPLYDGNSSVCSTAD YKPPEEDPEEQAEENPEGEQPEECFTEGLQPLQGAGYTRTKSTRTWGCLAIPVPHVSRAC VQRWPCLYVDISQGRGKKWWTLRRACFKIVEHNWFETFIVFMILLSSGALAFEDIYIEQR RVIRTILEYADKVFTYIFIMEMLLKWVAYGFKVYFTNAWCWLDFLIVDVSIISLVANWLG YSELGPIKSLRTLRALRPLRALSRFEGMRVVVNALLGAIPSIMNVLLVCLIFWLIFSIMG VNLFAGKFYYCINTTTSERFDISEVNNKSECESLMHTGQVRWLNVKVNYDNVGLGYLSLL QVATFKGWMDIMYAAVDSREKEEQPQYEVNLYMYLYFVIFIIFGSFFTLNLFIGVIIDNF NQQKKKMRGKDIFMTEEQKKYYNAMKKLGSKKPQKPIPRPQNKIQGMVYDLVTKQAFDIT IMILICLNMVTMMVETDNQSQLKVDILYNINMIFIIIFTGECVLKMLALRQYYFTVGWNI FDFVVVILSIVGLALSDLIQKYFVSPTLFRVIRLARIGRVLRLIRGAKGIRTLLFALMMS LPALFNIGLLLFLVMFIYSIFGMSNFAYVKKESGIDDMFNFETFGNSIICLFEITTSAGW DGLLNPILNSGPPDCDPNLENPGTSVKGDCGNPSIGICFFCSYIIISFLIVVNMYIAIIL ENFNVATEESSEPLGEDDFEMFYETWEKFDPDATQFIAYSRLSDFVDTLQEPLRIAKPNK IKLITLDLPMVPGDKIHCLDILFALTKEVLGDSGEMDALKQTMEEKFMAANPSKVSYEPI TTTLKRKHEEVCAIKIQRAYRRHLLQRSMKQASYMYRHSHDGSGDDAPEKEGLLANTMSK MYGHENGNSSSPSPEEKGEAGDAGPTMGLMPISPSDTAWPPAPPPGQTVRPGVKESLV >gi568815581r:63840774_64072841|GENSCAN_predicted_CDS_9|3237_bp atgccccaggtcctgaacctgttcctggctctgctgctgagctccttcagcgccgacagt ctggcagcctcggatgaggatggcgagatgaacaacctgcagattgccatcgggcgcatc aagttgggcatcggctttgccaaggccttcctcctggggctgctgcatggcaagatcctg agccccaaggacatcatgctcagcctcggggaggctgacggggccggggaggctggagag gcgggggagactgcccccgaggatgagaagaaggagccgcccgaggaggacctgaagaag gacaatcacatcctgaaccacatgggcctggctgacggccccccatccagcctcgagctg gaccaccttaacttcatcaacaacccctacctgaccatacaggtgcccatcgcctccgag gagtccgacctggagatgcccaccgaggaggaaaccgacactttctcagagcctgaggat agcaagaagccgccgcagcctctctatgatgggaactcgtccgtctgcagcacagctgac tacaagccccccgaggaggaccctgaggagcaggcagaggagaaccccgagggggagcag cctgaggagtgcttcactgagggcctgcagcccctgcaaggggcaggctacaccaggacc aaaagcacacggacctggggctgcctggccatccccgtgcctcacgtgtctagggcctgc gtgcagcgctggccctgcctctacgtggacatctcccagggccgtgggaagaagtggtgg actctgcgcagggcctgcttcaagattgtcgagcacaactggttcgagaccttcattgtc ttcatgatcctgctcagcagtggggctctggccttcgaggacatctacattgagcagcgg cgagtcattcgcaccatcctagaatatgccgacaaggtcttcacctacatcttcatcatg gagatgctgctcaaatgggtggcctacggctttaaggtgtacttcaccaacgcctggtgc tggctcgacttcctcatcgtggatgtctccatcatcagcttggtggccaactggctgggc tactcggagctgggacccatcaaatccctgcggacactgcgggccctgcgtcccctgagg gcactgtcccgattcgagggcatgagggtggtggtgaacgccctcctaggcgccatcccc tccatcatgaatgtgctgcttgtctgcctcatcttctggctgatcttcagcatcatgggt gtcaacctgtttgccggcaagttctactactgcatcaacaccaccacctctgagaggttc gacatctccgaggtcaacaacaagtctgagtgcgagagcctcatgcacacaggccaggtc cgctggctcaatgtcaaggtcaactacgacaacgtgggtctgggctacctctccctcctg caggtggccaccttcaagggttggatggacatcatgtatgcagccgtggactcccgggag aaggaggagcagccgcagtacgaggtgaacctctacatgtacctctactttgtcatcttc atcatctttggctccttcttcaccctcaacctcttcattggcgtcatcattgacaacttc aaccagcagaagaagaagatgagggggaaagacatctttatgacggaggaacagaagaaa tactataacgccatgaagaagcttggctccaagaagcctcagaagccaattccccggccc cagaacaagatccagggcatggtgtatgacctcgtgacgaagcaggccttcgacatcacc atcatgatcctcatctgcctcaacatggtcaccatgatggtggagacagacaaccagagc cagctcaaggtggacatcctgtacaacatcaacatgatcttcatcatcatcttcacaggg gagtgcgtgctcaagatgctcgccctgcgccagtactacttcaccgttggctggaacatc tttgacttcgtggtcgtcatcctgtccattgtgggccttgccctctctgacctgatccag aagtacttcgtgtcacccacgctgttccgtgtgatccgcctggcgcggattgggcgtgtc ctgcggctgatccgcggggccaagggcatccggacgctgctgttcgccctcatgatgtcg ctgcctgccctcttcaacatcggcctcctcctcttcctggtcatgttcatctactccatc ttcggcatgtccaactttgcctacgtcaagaaggagtcgggcatcgatgatatgttcaac ttcgagaccttcggcaacagcatcatctgcctgttcgagatcaccacgtcggccggctgg gacgggctcctcaaccccatcctcaacagcgggcccccagactgtgaccccaacctggag aacccgggcaccagtgtcaagggtgactgcggcaacccctccatcggcatctgcttcttc tgcagctatatcatcatctccttcctcatcgtggtcaacatgtacatcgccatcatcctg gagaacttcaatgtggccacagaggagagcagcgagccccttggtgaagatgactttgag atgttctacgagacatgggagaagttcgaccccgacgccacccagttcatcgcctacagc cgcctctcagacttcgtggacaccctgcaggaaccgctgaggattgccaagcccaacaag atcaagctcatcacactggacttgcccatggtgccaggggacaagatccactgcctggac atcctctttgccctgaccaaagaggtcctgggtgactctggggaaatggacgccctcaag cagaccatggaggagaagttcatggcagccaacccctccaaggtgtcctacgagcccatc accaccaccctcaagaggaagcacgaggaggtgtgcgccatcaagatccagagggcctac cgccggcacctgctacagcgctccatgaagcaggcatcctacatgtaccgccacagccac gacggcagcggggatgacgcccctgagaaggaggggctgcttgccaacaccatgagcaag atgtatggccacgagaatgggaacagcagctcgccaagcccggaggagaagggcgaggca ggggacgccggacccactatggggctgatgcccatcagcccctcagacactgcctggcct cccgcccctcccccagggcagactgtgcgcccaggtgtcaaggagtctcttgtctag >gi568815581r:63840774_64072841|GENSCAN_predicted_peptide_10|1129_aa MAEGKEEQVTSYVDGSRHTEIVGQRTTHAASDGTTPAHLPCQGWQTQAEAPRSSSLRPWQ TSLINHSNLFAQSLAATFMVSTYISRWPISGSTSALRVKSLCQSHFDLTPQPGPSSWPIQ RSPEHSQLWFSLCPIRLKPHSSRSLGPGGSRRHWGVDPKSQGHKESGWFAVGPEGSRCHF WGKCGGGARGDALCPAPVQQQHLAGTSPSPDRVCTPTSPARPLGKQEDARMARPSLCTLV PLGPECLRPFTRESLAAIEQRAVEEEARLQRNKQMEIEEPERKPRSDLEAGKNLPMIYGD PPPEVIGIPLEDLDPYYSNKKTFIVLNKGKAIFRFSATPALYLLSPFSVVRRGAIKVLIH AYPAQDGRGGRGIYTFESLIKILARGFCVDDFTFLRDPWNWLDFSVIMMAYLTEFVDLGN ISALRTFRVLRALKTITVIPGLKTIVGALIQSVKKLSDVMILTVFCLSVFALVGLQLFMG NLRQKCVRWPPPFNDTNTTWYSNDTWYGNDTWYGNEMWYGNDSWYANDTWNSHASWATND TFDWDAYISDEGNFYFLEGSNDALLCGNSSDAGHCPEGYECIKTGRNPNYGYTSYDTFSW AFLALFRLMTQDYWENLFQLTLRAAGKTYMIFFVVIIFLGSFYLINLILAVVAMAYAEQN EATLAEDKEKEEEFQQMLEKFKKHQEELEKVWTQGKEIEDPGFYLPSAGAATGFSLGGRG PGRGAATLKDSGRSSEGQLVHPSGSELDFLPVTLAGLQAKAAQALEGGEADGDPAHGKDC NGSLDTSQGEKGAPRQSSSGDSGISDAMEGRGGAWLDWIQEDRDAMEELEEAHQKCPPWW YKCAHKVLIWNCCAPWLKFKNIIHLIVMDPFVDLGITICIVLNTLFMAMEHYPMTEHFDN VLTVGNLVFTGIFTAEMVLKLIAMDPYEYFQQGWNIFDSIIVTLSLVELGLANVQGLSVL RSFRLLRVFKLAKSWPTLNMLIKIIGNSVGALGNLTLVLAIIVFIFAVVGMQLFGKSYKE CVCKIALDCNLPRWHMHDFFHSFLIVFRILCGEWIETMWDCMEVAGQAMCLTVFLMVMVI GNLVASEKQVSARGHQGTLPESVRTRIPCTYGTWSLPAGNCEEEEEVTR >gi568815581r:63840774_64072841|GENSCAN_predicted_CDS_10|3390_bp atggcggaaggcaaggaggagcaagtcacatcttacgtggatggcagcagacacacggaa attgtgggccagcggaccacacatgctgccagtgatggcaccaccccagcccacctgccc tgccagggatggcaaacacaggcagaggcaccacggtcttcctccttgcgcccctggcag acatcactaatcaatcacagcaacctttttgcacagagcctggctgccaccttcatggtc tccacctacatctccaggtggccaatatcaggatccacgtcggccctcagggtgaagtcc ctgtgccagtcccactttgatctaaccccccaaccaggcccctcatcctggcctatccag aggtcgcctgagcacagccagctgtggttctctctctgtcccattaggctgaagccacac agttcccggagcttgggcccaggaggcagtcgaaggcactggggtgttgaccccaagtcc caaggccacaaagaatctggttggtttgccgtcggtcctgagggcagccgctgccacttc tggggaaagtgtggtgggggggccaggggtgatgctctgtgcccagcacctgtgcagcag cagcaccttgcaggcacatctcccagtcctgatagagtctgcacgcccacctccccggct cggccactgggcaagcaggaggatgcgaggatggccagaccatctctgtgcaccctggtg cctctgggccctgagtgcttgcgccccttcacccgggagtcactggcagccatagaacag cgggcggtggaggaggaggcccggctgcagcggaataagcagatggagattgaggagccc gaacggaagccacgaagtgacttggaggctggcaagaacctacccatgatctacggagac cccccgccggaggtcatcggcatccccctggaggacctggatccctactacagcaataag aagaccttcatcgtactcaacaagggcaaggccatcttccgcttctccgccacacctgct ctctacctgctgagccccttcagcgtagtcaggcgcggggccatcaaggtgctcatccat gcatatcctgcccaagatgggagaggcgggagggggatctacacctttgagtccctcatc aagatactggcccgaggcttctgtgtcgacgacttcacattcctccgggacccctggaac tggctggacttcagtgtcatcatgatggcgtacctgacagagtttgtggacttgggcaac atctcagccctgaggaccttccgggtgctgcgggccctcaaaaccatcacggtcatccca gggctgaagacgatcgtgggggccctgatccagtcggtgaaaaagctgtcggatgtgatg atcctcactgtcttctgcctgagcgtctttgcgctggtaggactgcagctcttcatggga aacctgaggcagaagtgtgtgcgctggcccccgccgttcaacgacaccaacaccacgtgg tacagcaatgacacgtggtacggcaatgacacatggtatggcaatgagatgtggtacggc aatgactcatggtatgccaacgacacgtggaacagccatgcaagctgggccaccaacgat acctttgattgggacgcctacatcagtgatgaagggaacttctacttcctggagggctcc aacgatgccctgctctgtgggaacagcagtgatgctgggcactgccctgagggttatgag tgcatcaagaccgggcggaaccccaactatggctacaccagctatgacaccttcagctgg gccttcttggctctcttccgcctcatgacacaggactattgggagaacctcttccagctg acccttcgagcagctggcaagacctacatgatcttcttcgtggtcatcatcttcctgggc tctttctacctcatcaatctgatcctggccgtggtggccatggcatatgccgagcagaat gaggccaccctggccgaggataaggagaaagaggaggagtttcagcagatgcttgagaag ttcaaaaagcaccaggaggagctggagaaggtctggactcagggaaaggagatagaggac ccagggttctacctgccctcagccggggctgccactggcttctccctggggggccgaggg ccaggaagaggggctgccaccctgaaggactctggaagaagctctgagggtcaattagtc catccctctggctcagagctggacttcctgccagtcactcttgctggtctacaggccaag gccgcccaagctctggaaggtggggaggcagatggggacccagcccatggcaaagactgc aatggcagcctggacacatcgcaaggggagaagggagccccgaggcagagcagcagcgga gacagcggcatctccgacgccatggaaggtaggggtggagcctggctggactggattcag gaggatagggatgccatggaagaactggaagaggcccaccaaaagtgcccaccatggtgg tacaagtgcgcccacaaagtgctcatatggaactgctgcgccccgtggctgaagttcaag aacatcatccacctgatcgtcatggacccgttcgtggacctgggcatcaccatctgcatc gtgctcaacaccctcttcatggccatggaacattaccccatgacggagcactttgacaac gtgctcactgtgggcaacctggtcttcacaggcatcttcacagcagagatggttctgaag ctgattgccatggacccctacgagtatttccagcagggttggaatatcttcgacagcatc atcgtcaccctcagcctggtagagctaggcctggccaacgtacagggactgtctgtgcta cgctccttccgtctgctgcgggtcttcaagctggccaagtcgtggccaacgctgaacatg ctcatcaagatcattggcaattcagtgggggcgctgggtaacctgacgctggtgctggct atcatcgtgttcatcttcgccgtggtgggcatgcagctgtttggcaagagctacaaggag tgcgtgtgcaagattgccttggactgcaacctgccgcgctggcacatgcatgatttcttc cactccttcctcatcgtcttccgcatcctgtgcggggagtggatcgagaccatgtgggac tgcatggaggtggccggccaagccatgtgcctcaccgtcttcctcatggtcatggtcatc ggcaatcttgtggcttcagagaagcaggtgtctgccagaggccaccagggcaccctgcca gagagtgtgaggacccgcatcccctgcacctatggcacttggtcacttcctgcgggtaac tgtgaagaagaagaggaagtgactaggtaa >gi568815581r:63840774_64072841|GENSCAN_predicted_peptide_11|491_aa MIGSFLRPSKKQMLLRFLYSLQNCSPWLVPASPWRLPEMSSFGYRTLTVALFTLICCPGS DEKVFEVHVRPKKLAVEPKGSLEVNCSTTCNQPEVGGLETSLDKILLDEQAQWKHYLVSN ISHDTVLQCHFTCSGKQESMNSNVSVYQPPRQVILTLQPTLVAVGKSFTIECRVPTVEPL DSLTLFLFRGNETLHYETFGKAAPAPQEATATFNSTADREDGHRNFSCLAVLDLMSRGGN IFHKHSAPKMLEIYEPVSDSQMVIIVTVVSVLLSLFVTSVLLCFIFGQHLRQQRMGTYGF FSPLNRNPRWIRIPGPTGERPRRPTHTHLRAGRGRPGLQRSSHQATQQQLVHLRVLQHHQ FQQYLSLGEQACGHYFTVPMGLMASTAINKLRSRHWGHSPPTSFLHTCVKETPWLLQTHL ETPKTKEIVFPLISVHITPREGLGLVQLESCPQVLDEPLWPKGLARDREQRPSDYRGGDQ QSQMVTSSKVN >gi568815581r:63840774_64072841|GENSCAN_predicted_CDS_11|1476_bp atgatcgggagcttcctgaggccttccaagaagcagatgctgctacgcttcctgtacagc ctgcagaactgcagcccttggctggtccctgcgagcccgtggagactgccagagatgtcc tctttcggttacaggaccctgactgtggccctcttcaccctgatctgctgtccaggatcg gatgagaaggtattcgaggtacacgtgaggccaaagaagctggcggttgagcccaaaggg tccctcgaggtcaactgcagcaccacctgtaaccagcctgaagtgggtggtctggagacc tctctagataagattctgctggacgaacaggctcagtggaaacattacttggtctcaaac atctcccatgacacggtcctccaatgccacttcacctgctccgggaagcaggagtcaatg aattccaacgtcagcgtgtaccagcctccaaggcaggtcatcctgacactgcaacccact ttggtggctgtgggcaagtccttcaccattgagtgcagggtgcccaccgtggagcccctg gacagcctcaccctcttcctgttccgtggcaatgagactctgcactatgagaccttcggg aaggcagcccctgctccgcaggaggccacagccacattcaacagcacggctgacagagag gatggccaccgcaacttctcctgcctggctgtgctggacttgatgtctcgcggtggcaac atctttcacaaacactcagccccgaagatgttggagatctatgagcctgtgtcggacagc cagatggtcatcatagtcacggtggtgtcggtgttgctgtccctgttcgtgacatctgtc ctgctctgcttcatcttcggccagcacttgcgccagcagcggatgggcacctacgggttc ttttcccccctgaacaggaaccccaggtggatccgcatccctggacccacaggtgagcga ccccggcggcccacccacacgcacctgagggcagggcgaggcaggccggggctgcagcgc tccagccaccaggcgactcagcagcagctggtgcatctgcgcgttctgcagcatcatcag ttccagcagtacctgtctctaggggagcaggcttgtgggcattactttactgtccccatg ggtcttatggcttcaacagccattaacaagctgagatccaggcactggggtcacagcccc ccaacaagcttcctccacacatgtgtgaaggagacaccatggctccttcagacacattta gaaactcccaagacgaaagagattgtctttcctctaatctcagtacacataactcccagg gaaggactaggattggttcagcttgagtcttgtccccaggttttggatgaaccactatgg cccaaaggattggccagagacagggagcagcggcccagtgattatcggggtggtgaccag cagtcccagatggtgaccagcagcaaagtgaactga >gi568815581r:63840774_64072841|GENSCAN_predicted_peptide_12|859_aa XKKQDIWYVIDLLTGEKQQTLSSAFADSLCPSTSLLYLGRTEYTITMYDTKTRELRWNAT YFDYAASLPEDDVDYKMSHFVSNGDGLVVTVDSESGDVLWIQNYASPVVAFYVWQREGLR KVMHINVAVETLRYLTFMSGEVGRITKWKYPFPKETEAKSKLTPTLYVGKYSTSLYASPS MVHEGVAVVPRGSTLPLLEGPQTDGVTIGDKGECVITPSTDVKFDPGLKSKNKLNYLRNY WLLIGHHETPLSASTKMLERFPNNLPKHRENVIPADSEKKSFEEVINLVDQTSENAPTTV SRDVEEKPAHAPARPEAPVDSMLKDMATIILSTFLLIGWVAFIITYPLSMHQQQQLQHQQ FQKELEKIQLLQQQQQQLPFHPPGDTAQDGELLDTSGPYSESSGTSSPSTSPRASNHSLC SGSSASKAGSSPSLEQDDGDEETSVVIVGKISFCPKDVLGHGAEGTIVYRGMFDNRDVAV KRILPECFSFADREVQLLRESDEHPNVIRYFCTEKDRQFQYIAIELCAATLQEYVEQKDF AHLGLEPITLLQQTTSGLAHLHSLNIVHRDLKPHNILISMPNAHGKIKAMISDFGLCKKL AVGRHSFSRRSGVPGTEGWIAPEMLSEDCKENPTYTVDIFSAGCVFYYVISEGSHPFGKS LQRQANILLGACSLDCLHPEKHEDVIARELIEKMIAMDPQKRPSAKHVLKHPFFWSLEKQ LQFFQDVSDRIEKESLDGPIVKQLERGGRAVVKMDWRENITVPLQTDLRKFRTYKGGSVR DLLRAMRNKKHHYRELPAEVRETLGSLPDDFVCYFTSRFPHLLAHTYRAMELCSHERLFQ PYYFHEPPEPQPPVTPDAL >gi568815581r:63840774_64072841|GENSCAN_predicted_CDS_12|2580_bp ngtaaaaagcaggacatctggtatgttattgacctcctgaccggagagaagcagcagact ttgtcatcggcctttgcagatagtctctgcccatcaacctctcttctgtatcttgggcga acagaatacaccatcaccatgtacgacaccaaaacccgagagctccggtggaatgccacc tactttgactatgcggcctcactgcctgaggacgacgtggactacaagatgtcccacttt gtgtccaatggtgatgggctggtggtgactgtggacagtgaatctggggacgtcctgtgg atccaaaactacgcctcccctgtggtggccttttatgtctggcagcgggagggtctgagg aaggtgatgcacatcaatgtcgctgtggagaccctgcgctatctgaccttcatgtctggg gaggtggggcgcatcacaaagtggaagtacccgttccccaaggagacagaggccaagagc aagctgacgcccactctgtatgttgggaaatactctaccagcctctatgcctctccctca atggtacacgagggggttgctgtcgtgccccgcggcagcacacttcctttgctggaaggg ccccagactgatggcgtcaccattggggacaagggggagtgtgtgatcacgcccagcacg gacgtcaagtttgatcccggactcaaaagcaagaacaagctcaactacttgaggaattac tggcttctgataggacaccatgaaaccccactgtctgcgtctaccaagatgctggagaga tttcccaacaatctacccaaacatcgggaaaatgtgattcctgctgattcagagaaaaag agctttgaggaagttatcaacctggttgaccagacttcagaaaacgcacctaccaccgtg tctcgggatgtggaggagaagcccgcccatgcccctgcccggcccgaggcccccgtggac tccatgcttaaggacatggctaccatcatcctgagcaccttcctgctgattggctgggtg gccttcatcatcacctatcccctgagcatgcatcagcagcagcagctccagcaccagcag ttccagaaggaactggagaagatccagctcctgcagcagcagcagcagcagctgcccttc cacccacctggagacacggctcaggacggcgagctcctggacacgtctggcccgtactca gagagctcgggcaccagcagccccagcacgtcccccagggcctccaaccactcgctctgc tccggcagctctgcctccaaggctggcagcagcccctccctggaacaagacgatggagat gaggaaaccagcgtggtgatagttgggaaaatttccttctgtcccaaggatgtcctgggc catggagctgagggcacaattgtgtaccggggcatgtttgacaaccgcgacgtggccgtg aagaggatcctccccgagtgttttagcttcgcagaccgtgaggtccagctgttgcgagaa tcggatgagcacccgaacgtgatccgctacttctgcacggagaaggaccggcaattccag tacattgccatcgagctgtgtgcagccaccctgcaagagtatgtggagcagaaggacttt gcgcatctcggcctggagcccatcaccttgctgcagcagaccacctcgggcctggcccac ctccactccctcaacatcgttcacagagacctaaagccacacaacatcctcatatccatg cccaatgcacacggcaagatcaaggccatgatctccgactttggcctctgcaagaagctg gcagtgggcagacacagtttcagccgccgatctggggtgcctggcacagaaggctggatc gctccagagatgctgagcgaagactgtaaggagaaccctacctacacggtggacatcttt tctgcaggctgcgtcttttactacgtaatctctgagggcagccacccttttggcaagtcc ctgcagcggcaggccaacatcctcctgggtgcctgcagccttgactgcttgcacccagag aagcacgaagacgtcattgcacgtgaattgatagagaagatgattgcgatggatcctcag aaacgcccctcagcgaagcatgtgctcaaacacccgttcttctggagcctagagaagcag ctccagttcttccaggacgtgagcgacagaatagaaaaggaatccctggatggcccgatc gtgaagcagttagagagaggcgggagagccgtggtgaagatggactggcgggagaacatc actgtccccctccagacagacctgcgtaaattcaggacctataaaggtggttctgtcaga gatctcctccgagccatgagaaataagaagcaccactaccgggagctgcctgcagaggtg cgggagacgctggggtccctccccgacgacttcgtgtgctacttcacatctcgcttcccc cacctcctcgcacacacctaccgggccatggagctgtgcagccacgagagactcttccag ccctactacttccacgagcccccagagccccagcccccagtgactccagacgccctctga