GENSCAN 1.0 Date run: 6-Nov-116 Time: 12:10:57 Sequence gi568815579r:45251632_45470232 : 218601 bp : 52.72% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 7358 7558 201 1 0 65 105 317 0.995 31.10 1.02 Intr + 11482 11535 54 0 0 143 100 84 0.999 14.76 1.03 Intr + 11688 11736 49 2 1 69 85 97 0.999 6.24 1.04 Intr + 13053 13118 66 1 0 62 93 143 0.955 11.57 1.05 Intr + 13209 13279 71 1 2 121 47 137 0.880 12.29 1.06 Intr + 14594 14650 57 1 0 112 121 8 0.980 6.07 1.07 Intr + 19841 20077 237 1 0 130 81 519 0.995 53.74 1.08 Intr + 26292 26411 120 2 0 91 83 131 0.977 14.09 1.09 Intr + 26885 26984 100 1 1 88 73 223 0.840 20.98 1.10 Intr + 28743 28852 110 1 2 87 55 127 0.997 9.80 1.11 Intr + 28944 29103 160 2 1 60 65 115 0.992 6.47 1.12 Intr + 31835 31885 51 0 0 87 99 -2 0.374 0.17 1.13 Intr + 35816 36033 218 0 2 88 86 248 0.997 23.35 1.14 Intr + 42718 42821 104 0 2 126 96 2 0.242 4.27 1.15 Intr + 46045 46323 279 1 0 143 105 78 0.931 12.13 1.16 Intr + 48180 48224 45 0 0 117 113 35 0.025 6.91 1.17 Term + 50743 51079 337 1 1 104 52 416 0.605 34.00 1.18 PlyA + 51710 51715 6 1.05 2.09 PlyA - 52030 52025 6 1.05 2.08 Term - 55297 55119 179 2 2 139 48 357 0.929 34.97 2.07 Intr - 56019 55830 190 0 1 88 96 262 0.919 26.68 2.06 Intr - 56901 56778 124 2 1 58 98 188 0.874 17.89 2.05 Intr - 60289 60118 172 2 1 126 81 418 0.980 44.42 2.04 Intr - 63966 63834 133 0 1 76 105 254 0.999 26.62 2.03 Intr - 66348 66194 155 1 2 100 71 282 0.693 28.00 2.02 Intr - 68100 67890 211 0 1 82 99 340 0.365 33.51 2.01 Init - 76907 76857 51 2 0 83 53 40 0.559 1.10 2.00 Prom - 78108 78069 40 -2.91 3.03 PlyA - 78135 78130 6 1.05 3.02 Term - 81219 81067 153 0 0 -16 46 247 0.538 8.33 3.01 Init - 81493 81221 273 1 0 99 -20 367 0.575 23.97 3.00 Prom - 87737 87698 40 0.39 4.00 Prom + 88351 88390 40 -1.81 4.01 Init + 93911 94168 258 1 0 71 80 312 0.025 25.83 4.02 Intr + 94913 95143 231 1 0 72 50 509 0.998 44.00 4.03 Intr + 95816 95885 70 1 1 91 98 43 0.983 4.85 4.04 Intr + 96310 96529 220 2 1 48 105 375 0.965 33.09 4.05 Intr + 97015 97102 88 1 1 89 101 162 0.998 18.07 4.06 Intr + 97189 97290 102 0 0 121 26 211 0.999 19.07 4.07 Intr + 97798 97971 174 0 0 106 81 363 0.992 38.05 4.08 Intr + 98710 98800 91 0 1 92 97 87 0.999 10.07 4.09 Intr + 98883 98920 38 1 2 130 94 34 0.990 6.67 4.10 Intr + 99010 99116 107 0 2 99 100 92 0.999 11.01 4.11 Intr + 99323 99428 106 2 1 113 69 62 0.625 7.52 4.12 Term + 99655 99726 72 0 0 100 39 99 0.987 4.40 4.13 PlyA + 99863 99868 6 -1.95 5.35 PlyA - 99981 99976 6 -3.94 5.34 Term - 100090 99998 93 1 0 130 48 118 0.997 10.13 5.33 Intr - 100721 100578 144 2 0 93 76 284 0.999 28.69 5.32 Intr - 101018 100875 144 2 0 106 56 300 0.999 29.59 5.31 Intr - 101185 101115 71 2 2 97 113 150 0.999 17.79 5.30 Intr - 101524 101452 73 1 1 92 97 140 0.999 14.77 5.29 Intr - 101703 101611 93 0 0 118 90 138 0.999 17.66 5.28 Intr - 103220 103099 122 0 2 54 101 278 0.999 26.42 5.27 Intr - 104097 104034 64 0 1 110 64 111 0.669 9.68 5.26 Intr - 105740 105639 102 2 0 85 101 147 0.994 16.57 5.25 Intr - 105912 105843 70 2 1 120 74 125 0.999 13.98 5.24 Intr - 106068 105984 85 1 1 124 93 149 0.658 18.38 5.23 Intr - 110011 109893 119 0 2 120 121 187 0.999 25.81 5.22 Intr - 112280 112112 169 0 1 62 80 322 0.995 28.32 5.21 Intr - 112488 112355 134 2 2 114 84 255 0.993 28.50 5.20 Intr - 112700 112604 97 0 1 104 75 206 0.997 20.47 5.19 Intr - 112916 112793 124 2 1 96 73 302 0.999 30.16 5.18 Intr - 113323 113207 117 1 0 105 42 175 0.662 15.77 5.17 Intr - 113527 113411 117 1 0 58 17 123 0.909 3.37 5.16 Intr - 117112 116999 114 1 0 6 83 180 0.994 10.45 5.15 Intr - 117361 117299 63 1 0 64 121 85 0.992 8.81 5.14 Intr - 117516 117439 78 0 0 51 70 93 0.936 4.04 5.13 Intr - 118601 118502 100 1 1 38 98 316 0.993 28.21 5.12 Intr - 119103 119030 74 0 2 20 71 34 0.038 -6.40 5.11 Intr - 131095 130896 200 2 2 94 72 420 0.953 40.59 5.10 Intr - 134097 133931 167 2 2 41 80 363 0.903 30.92 5.09 Intr - 134326 134193 134 1 2 64 93 285 0.999 26.55 5.08 Intr - 134549 134397 153 2 0 100 64 248 0.520 24.38 5.07 Intr - 140709 140249 461 1 2 94 115 435 0.952 40.29 5.06 Intr - 144255 143805 451 0 1 126 113 298 0.932 29.55 5.05 Intr - 144628 144537 92 2 2 88 100 78 0.902 9.21 5.04 Intr - 144805 144707 99 2 0 77 92 214 0.900 21.28 5.03 Intr - 145427 144914 514 2 1 83 61 529 0.988 42.83 5.02 Intr - 146516 146374 143 0 2 63 98 176 0.955 16.68 5.01 Init - 146714 146633 82 2 1 102 58 99 0.752 8.65 5.00 Prom - 147185 147146 40 -4.01 6.00 Prom + 150497 150536 40 -4.11 6.01 Init + 150931 150991 61 0 1 49 39 79 0.570 -1.51 6.02 Intr + 151094 151135 42 0 0 94 78 18 0.340 0.10 6.03 Intr + 151206 151326 121 1 1 95 9 89 0.491 1.76 6.04 Intr + 154709 155349 641 2 2 72 19 208 0.624 4.63 6.05 Intr + 155463 155604 142 1 1 101 101 113 0.977 13.82 6.06 Term + 156502 157870 1369 1 1 110 34 1168 0.857 105.04 6.07 PlyA + 158110 158115 6 -0.45 7.12 PlyA - 158182 158177 6 1.05 7.11 Term - 159853 159716 138 1 0 6 48 144 0.770 0.47 7.10 Intr - 162114 162046 69 0 0 99 69 84 0.951 7.47 7.09 Intr - 162403 162332 72 1 0 117 55 73 0.985 7.00 7.08 Intr - 163329 163230 100 2 1 65 76 202 0.999 17.31 7.07 Intr - 165293 165190 104 2 2 84 78 96 0.546 7.67 7.06 Intr - 167566 167467 100 0 1 84 91 184 0.987 18.91 7.05 Intr - 168796 168693 104 1 2 109 99 127 0.999 15.37 7.04 Intr - 169762 169547 216 1 0 117 94 189 0.957 21.63 7.03 Intr - 171750 171639 112 2 1 104 46 103 0.943 8.58 7.02 Intr - 172840 172706 135 0 0 104 77 28 0.382 3.49 7.01 Init - 174241 174159 83 1 2 46 76 10 0.203 -4.01 7.00 Prom - 177891 177852 40 -1.71 8.00 Prom + 178585 178624 40 -3.21 8.01 Init + 190836 190897 62 2 2 72 70 63 0.101 3.57 8.02 Intr + 196104 196146 43 0 1 108 89 19 0.050 2.73 8.03 Intr + 204144 204456 313 2 1 88 44 100 0.011 1.91 8.04 Intr + 216747 217081 335 1 2 25 111 217 0.173 13.54 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 46512 46591 80 0 2 122 89 58 0.975 8.99 S.002 Term + 48275 48414 140 0 2 123 43 24 0.837 -0.17 S.003 Intr + 93903 94168 266 1 2 87 80 312 0.966 28.07 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579r:45251632_45470232|GENSCAN_predicted_peptide_1|752_aa HGTLGSGRSSDKGPSWSSRSLGARCRNSIASCPEEQPHVGNYRLLRTIGKGNFAKVKLAR HILTGREVAIKIIDKTQLNPSSLQKLFREVRIMKGLNHPNIVKLFEVIETEKTLYLVMEY ASAGEVFDYLVSHGRMKEKEARAKFRQIVSAVHYCHQKNIVHRDLKAENLLLDAEANIKI ADFGFSNEFTLGSKLDTFCGSPPYAAPELFQGKKYDGPEVDIWSLGVILYTLVSGSLPFD GHNLKELRERVLRGKYRVPFYMSTDCESILRRFLVLNPAKRCTLEQIMKDKWINIGYEGE ELKPYTEPEEDFGDTKRIEVMVGMGYTREEIKESLTSQKYNEVTATYLLLGRKTEEGGDR GAPGLALARVRAPSDTTNGTSSSKGTSHSKGQRSSSSTYHRQRRHSDFSTNLLSMSMNLA ILDISGGPSPAPLHPKRSPTSTGEAELKEERLPGRKASCSTAGSGSRGLPPSSPMVSSAH NPNKAEIPERRKDSTSTPNNLPPSMMTRRNTYVCTERPGAERPSLLPNGKENSSGTPRVP PASPSSHSLAPPSGERSRLARGSTIRSTFHGGQVRDRRAGGGGGGGVQNGPPASPTLAHE AAPLPAGRPRPTTNLFTKLTSKLTRRVADEPERIGGPEVTSCHLPWDQTETAPRLLRFPW SVKLTSSRPPEALMAALRQATAAARCRCRQPQPFLLACLHGGAGGPEPLSHFEVEVCQLP RPGLRGVLFRRVAGTALAFRTLVTRISNDLEL >gi568815579r:45251632_45470232|GENSCAN_predicted_CDS_1|2259_bp catggcaccttgggcagtggccgctcctcggacaaaggcccgtcctggtccagccgctca ctgggtgcccgttgccggaactccatcgcctcctgtcccgaggagcagccccacgtgggc aactaccgcctgctgaggaccattgggaagggcaactttgccaaagtcaagctggctcgg cacatcctcactggtcgggaggttgccatcaagattatcgacaaaacccagctgaatccc agcagcctgcagaagctgttccgagaagtccgcatcatgaagggcctaaaccaccccaac atcgtgaagctctttgaggtgattgagactgagaagacgctgtacctggtgatggagtac gcaagtgctggagaagtgtttgactacctcgtgtcgcatggccgcatgaaggagaaggaa gctcgagccaagttccgacagattgtttcggctgtgcactattgtcaccagaaaaatatt gtacacagggacctgaaggctgagaacctcttgctggatgccgaggccaacatcaagatt gctgactttggcttcagcaacgagttcacgctgggatcgaagctggacacgttctgcggg agccccccatatgccgccccggagctgtttcagggcaagaagtacgacgggccggaggtg gacatctggagcctgggagtcatcctgtacaccctcgtcagcggctccctgcccttcgac gggcacaacctcaaggagctgcgggagcgagtactcagagggaagtaccgggtccctttc tacatgtcaacagactgtgagagcatcctgcggagatttttggtgctgaacccagctaaa cgctgtactctcgagcaaatcatgaaagacaaatggatcaacatcggctatgagggtgag gagttgaagccatacacagagcccgaggaggacttcggggacaccaagagaattgaggtg atggtgggtatgggctacacacgggaagaaatcaaagagtccttgaccagccagaagtac aacgaagtgaccgccacctacctcctgctgggcaggaagactgaggagggtggggaccgg ggcgccccagggctggccctggcacgggtgcgggcgcccagcgacaccaccaacggaaca agttccagcaaaggcaccagccacagcaaagggcagcggagttcctcttccacctaccac cgccagcgcaggcatagcgatttctctactaatctactttctatgtcgatgaatttggct attctagacatttcaggtggcccatcccctgcacccctgcaccccaaacgcagcccgacg agcacgggggaggcggagctgaaggaggagcggctgccaggccggaaggcgagctgcagc accgcggggagtgggagtcgagggctgcccccctccagccccatggtcagcagcgcccac aaccccaacaaggcagagatcccagagcggcggaaggacagcacgagcacccccaacaac ctccctcctagcatgatgacccgcagaaacacctacgtttgcacagaacgcccgggggct gagcgcccgtcactgttgccaaatgggaaagaaaacagctcaggcaccccacgggtgccc cctgcctccccctccagtcacagcctggcacccccatcaggggagcggagccgcctggca cgtggttccaccatccgcagcaccttccatggtggccaggtccgggaccggcgggcaggg ggtgggggtggtgggggtgtgcagaatgggccccctgcctctcccacactggcccatgag gctgcacccctgcccgccgggcggccccgccccaccaccaacctcttcaccaagctgacc tccaaactgacccgaagggtcgcagacgaacctgagagaatcgggggacctgaggtcaca agttgccatctaccttgggatcaaacggaaaccgccccccggctgctccgattcccctgg agtgtgaagctgaccagctcgcgccctcctgaggccctgatggcagctctgcgccaggcc acagcagccgcccgctgccgctgccgccagccacagccgttcctgctggcctgcctgcac gggggtgcgggcgggcccgagcccctgtcccacttcgaagtggaggtctgccagctgccc cggccaggcttgcggggagttctcttccgccgtgtggcgggcaccgccctggccttccgc accctcgtcacccgcatctccaacgacctcgagctctga >gi568815579r:45251632_45470232|GENSCAN_predicted_peptide_2|404_aa MKDVILFTTHYPMDQTLVSYTATMPFGNTHNKFKLNYKPEEEYPDLSKHNNHMAKVLTLE LYKKLRDKETPSGFTVDDVIQTGVDNPGHPFIMTVGCVAGDEESYEVFKELFDPIISDRH GGYKPTDKHKTDLNHENLKGGDDLDPNYVLSSRVRTGRSIKGYTLPPHCSRGERRAVEKL SVEALNSLTGEFKGKYYPLKSMTEKEQQQLIDDHFLFDKPVSPLLLASGMARDWPDARGI WHNDNKSFLVWVNEEDHLRVISMEKGGNMKEVFRRFCVGLQKIEEIFKKAGHPFMWNQHL GYVLTCPSNLGTGLRGGVHVKLAHLSKHPKFEEILTRLRLQKRGTGGVDTAAVGSVFDVS NADRLGSSEVEQVQLVVDGVKLMVEMEKKLEKGQSIDDMIPAQK >gi568815579r:45251632_45470232|GENSCAN_predicted_CDS_2|1215_bp atgaaagacgttattttattcaccacccactaccctatggaccagaccctggtctcctac accgccaccatgccattcggtaacacccacaacaagttcaagctgaattacaagcctgag gaggagtaccccgacctcagcaaacataacaaccacatggccaaggtactgacccttgaa ctctacaagaagctgcgggacaaggagactccatctggcttcactgtagacgatgtcatc cagacaggagtggacaacccaggtcaccccttcatcatgaccgtgggctgcgtggctggt gatgaggagtcctacgaagttttcaaggaactctttgaccccatcatctcggatcgccac gggggctacaaacccactgacaagcacaagactgacctcaaccatgaaaacctcaagggt ggagacgacctggaccctaactacgtgctcagcagccgcgtccgcactggccgcagcatc aagggctacacgttgcccccacactgctcccgtggcgagcgccgggcggtggagaagctc tctgtggaagctctcaacagcctgacgggcgagttcaaagggaagtactaccctctgaag agcatgacggagaaggagcagcagcagctcatcgatgaccacttcctgttcgacaagccc gtgtccccgctgctgctggcctcaggcatggcccgcgactggcccgacgcccgtggcatc tggcacaatgacaacaagagcttcctggtgtgggtgaacgaggaggatcacctccgggtc atctccatggagaaggggggcaacatgaaggaggttttccgccgcttctgcgtagggctg cagaagattgaggagatctttaagaaagctggccaccccttcatgtggaaccagcacctg ggctacgtgctcacctgcccatccaacctgggcactgggctgcgtggaggcgtgcatgtg aagctggcgcacctgagcaagcaccccaagttcgaggagatcctcacccgcctgcgtctg cagaagaggggtacaggtggcgtggacacagctgccgtgggctcagtatttgacgtgtcc aacgctgatcggctgggctcgtccgaagtagaacaggtgcagctggtggtggatggtgtg aagctcatggtggaaatggagaagaagttggagaaaggccagtccattgacgacatgatc cccgcccagaagtag >gi568815579r:45251632_45470232|GENSCAN_predicted_peptide_3|141_aa MRSEGQLQSVQVFGGKKTATAVVHCKHGDGLIKVNGRLLEMIKLRTLRYKLLEPVLFLGP GKERFAGVDIHVHVKGPRFMLSVHLQSPGGLYQKYADESSKEIKDILIQYDQTLLVADPR RSESKKFGGPGARACYQKSYQ >gi568815579r:45251632_45470232|GENSCAN_predicted_CDS_3|426_bp atgcggtccgagggccagctgcaatctgtgcaggtcttcggtggcaagaaaacagccaca gctgtggtgcactgcaaacacggcgatggcctcatcaaggtgaatgggcggctcctggag atgatcaagctgcgcacactacgatacaagctgctggagccagttctctttctcggtccc ggcaaggagcgatttgctggtgtggacatccatgtccatgtgaaggggcctagatttatg ctgtcagtccatctccagagccctggtggcctataccagaaatatgcggatgagtcttcc aaggagatcaaagacatcctcatccagtatgaccagaccctgctggtagctgatccccgt cgctctgagtccaaaaagtttggaggtcctggtgcccgtgcttgttaccaaaaatcctac caataa >gi568815579r:45251632_45470232|GENSCAN_predicted_peptide_4|518_aa MSVQVAAPGSAGLGPERLSPEELVRQTRQVVQGLEALRAEHHGLAGHLAEALAGQGPAAG LEMLEEKQQVVSHSLEAIELGLGEAQVLLALSAHVGALEAEKQRLRSQARRLAQENVWLR EELEETQRRLRASEESVAQLEEEKRHLEFLGQLRQYDPPAESQQSESPPRRDSLASLFPS EEEERKGPEAAGAAAAQQGGYEIPARLRTLHNLVIQYAGQGRYEVAVPLCRQALEDLERS SGHCHPDVATMLNILALVYRDQNKYKEATDLLHDALQIREQTLGPEHPAVAATLNNLAVL YGKRGRYREAEPLCQRALEIREKVLGADHPDVAKQLNNLALLCQNQGKFEDVERHYARAL SIYEALGGPHDPNVAKTKNNLASAYLKQNKYQQAEELYKEILHKEDLPAPLGAPNTGTAG DAEQALRRSSSLSKIRESIRRGSEKLVSRLRGEAAAGAAGMKRAMSLNTLNVDAPRAPGT QVRGTSGSKIEEAMWFPSWHLDKAPRTLSASTQDLSPH >gi568815579r:45251632_45470232|GENSCAN_predicted_CDS_4|1557_bp atgtctgtgcaggtagcggctcctggaagtgcagggctgggcccagagcgcctgagccct gaggagctggtgcggcagacgcggcaagtggtccaggggctggaggcgctgcgggcagag caccatggcctggctgggcacctggcggaggccctggcgggacagggcccggcagccggc ttggagatgctggaggaaaagcagcaggtggtgagccactcgctggaggccatcgagctg gggctgggcgaggcccaggtgctgctggccctgtcggcacatgtgggtgcactggaggca gagaagcagcggctgcgctcgcaggcccggcggctggcccaggagaacgtgtggctgcgg gaggaactggaggagacgcagcggcggcttcgggccagcgaggagtccgtggcccagctg gaggaggagaagcgccacctggagttcctggggcagctgcgacagtacgacccaccggcg gagagccagcagtctgagtccccgcctcgccgagacagcctggcctccctgttccccagc gaggaggaggagaggaaaggtcctgaggccgcaggagcagcagctgctcagcagggtggc tatgagatccctgcccgccttcggaccctgcataacctcgtgatccagtacgcggggcag ggccgctatgaggtggcggtgcctctgtgccgccaggccttggaggacctggagcgcagc tcgggccactgccaccctgacgtggccaccatgctcaacatcctggcgctggtgtaccgg gaccagaacaagtacaaagaagccacagaccttctccatgatgccctgcagatccgggag cagacgctgggccctgagcaccccgcggtggccgccacgctcaacaacttggctgtcctc tatgggaagcgtgggcgttaccgggaggcagagcccctgtgccagcgcgctttggagatc cgagagaaggtcctgggtgctgaccacccagatgtggccaagcagctcaacaacctggcc ctgctgtgccagaaccagggcaagtttgaggacgtggagcggcactatgcccgggccctg agcatctatgaggcactgggcgggccccatgaccccaacgtggccaagaccaagaacaac ctggcctcagcctacctgaaacagaacaagtatcaacaagcggaagagctgtacaaagaa atcctccacaaggaggacctacccgcccctctcggtgcccccaacacaggcacagctggt gacgcagaacaggcccttcgccgcagcagctcactctccaagatccgtgagtctatcagg cgaggaagtgagaagctggtctcccggctccgaggcgaggcggcggcaggagcagccgga atgaagagagccatgtcactcaacacactgaacgtggatgctccaagggctcctgggact caggtgagggggacatctgggtcaaaaatagaggaggccatgtggtttcccagctggcac ctggacaaggcccctcggaccctcagcgccagcacccaggacctgagcccccactaa >gi568815579r:45251632_45470232|GENSCAN_predicted_peptide_5|1620_aa MQAPAPAGTMDSEAFQSARDFLDMNFQSLAMKHMDLKQMELDTAAAKVDELTKQLESLWS DSPAPPGPQAGPPSRPPRYSSSSIPEPFGSRGSPRKAATDGADTPFGRSESAPTLHPYSP LSPKGRPSSPRTPLYLQPDAYGSLDRATSPRPRAFDGAGSSLGRAPSPRPGPGPLRQQGP PTPFDFLGRAGSPRGSPLAEGPQAFFPERGPSPRPPATAYDAPASAFGSSLLGSGGSAFA PPLRAQDDLTLRRRPPKAWNESDLDVAYEKKPSQTASYERLDVFARPASPSLQLLPWRES SLDGLGGTGKDNLTSATLPRNYKVSPLASDRRSDAGSYRRSLGSAGPSGTLPRSWQPVSR IPMPPSSPQPRGAPRQRPIPLSMIFKLQNAFWEHGASRAMLPGSPLFTRAPPPKLQPQPQ PQPQPQSQPQPQLPPQPQTQPQTPTPAPQHPQQTWPPVNEGPPKPPTELEPEPEIEGLLT PVLEAGDVDEGPVARPLSPTRLQPALPPEAQSVPELEEVARVLAEIPRPLKRRGSMEQAP AVALPPTHKKQYQQIISRLFHRHGGPGPGGPEPELSPITEGSEARAGPPAPAPPAPIPPP APSQSSPPEQPQSMEMRSVLRKAGSPRKARRARLNPLVLLLDAALTGELEVVQQAVKEVS AAEREMNDPSQPNEEGITALHNAICGANYSIVDFLITAGANVNSPDSHGWTPLHCAASCN DTVICMALVQHGAAIFATTLSDGATAFEKCDPYREGYADCATYLADVEQSMGLMNSGAVY ALWDYSAEFGDELSFREGESVTVLRRDGPEETDWWWAALHGQEGYVPRNYFGSQPPSLFR AHAQYPDWLCPSGLTGRLNVDGLLVYFPYDYIYPEQFSYMRELKRTLDAKGHGVLEMPSG TGKTVSLLALIMAYQRAYPLEVTKLIYCSRTVPEIEKVIEELRKLLNFYEKQEGEKLPFL GLALSSRKNLCIHPEVTPLRFGKDVDGKCHSLTASYVRAQYQHDTSLPHCRFYEEFDAHG REVPLPAGIYNLDDLKALGRRQGWCPYFLARYSILHANVVVYSYHYLLDPKIADLVSKEL ARKAVVVFDEAHNIDNVCIDSMSVNLTRRTLDRCQGNLETLQKTVLRIKETDEQRLRDEY RRLVEGLREASAARETDAHLANPVLPDEVLQEAVPGSIRTAEHFLGFLRRLLEYVKWRLR VQHVVQESPPAFLSGLAQRVCIQRKPLRFCAERLRSLLHTLEITDLADFSPLTLLANFAT LVSTYAKGFTIIIEPFDDRTPTIANPILHFRWDPARCMDASLAIKPVFERFQSVIITSGT LSPLDIYPKILDFHPVTMATFTMTLARVCLCPMIIGRGNDQVAISSKFETREDIAVIRNY GNLLLEMSAVVPDGIVAFFTSYQYMESTVASWYEQGILENIQRNKLLFIETQDGAETSVA LEKYQEACENGRGAILLSVARGKVSEGIDFVHHYGRAVIMFGVPYVYTQSRILKARLEYL RDQFQIRENDFLTFDAMRHAAQCVGRAIRGKTDYGLMVFADKRFARGDKRGKLPRWIQEH LTDANLNLTVDEGVQVAKYFLRQMAQPFHREDQLGLSLLSLEQLESEETLKRIEQIAQQL >gi568815579r:45251632_45470232|GENSCAN_predicted_CDS_5|4863_bp atgcaggcgcccgctccggccggcaccatggacagcgaggcattccagagcgcgcgggac tttctggacatgaacttccagtcgctggccatgaaacacatggatctgaagcagatggag ctggacacggcggcggccaaggtggatgaactgaccaagcagctggagtcgctgtggtca gactctcccgcgcctcctggcccgcaggccggacccccttctaggccgccccggtacagc tccagctcgatccctgagcccttcggcagccgagggtccccccggaaggcggccaccgac ggcgcagacaccccgttcggacgatcagagagtgccccaaccctacacccctacagcccg ctgtcccccaagggacggccgtcgtcgccgcgcaccccgctctacctgcagccggacgcc tacggcagcctggaccgcgcgacctcgccccggccccgcgccttcgatggcgcaggcagc tccctcggccgtgcgccctccccgcggcccgggccaggcccgctccgccagcagggtccc cccacgcctttcgacttcctgggccgcgcaggctccccccgcggcagccccctggcggag gggccccaggccttcttccccgagcgtgggccgtcaccgcgcccccctgccacagcctac gacgcgccagcgtccgccttcgggagctccctgctaggctccggcggcagcgcattcgcc ccgcctctgcgcgcgcaagacgacctgacgctgcgccggcggcctccgaaagcctggaac gagtctgacctggacgtggcgtacgagaagaagccttcgcagacagcgagctatgaacgc ctggacgtcttcgcaaggcctgcctcgccgagcctgcagctgttgccttggagggagagc agcctggatggactggggggcaccggcaaggacaacctcactagcgccaccctgccgcgc aattacaaggtctctcctctggccagcgaccggcgttcagacgcgggcagctaccggcgc tcgctgggctccgcggggccgtcgggcactttgcctcgcagctggcagcccgtcagccgc atccccatgcccccctccagcccccagccccgcggggccccgcgccagcgtcccatcccc ctcagcatgatcttcaagctgcagaacgccttctgggagcacggggccagccgcgccatg ctccctgggtcccccctcttcacccgagcacccccgcctaagctgcagccccaaccacaa ccacagccccagccacaatcacaaccacagccccagctgcccccacagccccagacccaa ccccaaacccctaccccagccccccagcatccccaacagacatggccccctgtgaacgaa ggaccccccaaaccccccaccgagctggagcctgagccggagatagaggggctgctgaca ccagtgctggaggctggcgatgtggatgaaggccctgtagcaaggcctctcagccccacg aggctgcagccagcactgccaccggaggcacagtcggtgcccgagctggaggaggtggca cgggtgttggcggaaattccccggcccctcaaacgcaggggctccatggagcaggcccct gctgtggccctgccccctacccacaagaaacagtaccagcagatcatcagccgcctcttc catcgtcatggggggccagggcccggggggccggagccagagctgtcccccatcactgag ggatctgaggccagggcagggccccctgctcctgccccaccagctcccattccacccccg gccccgtcccagagcagcccaccagagcagccgcagagcatggagatgcgctctgtgctg cggaaggcgggctccccgcgcaaggcccgccgcgcgcgcctcaaccctctggtgctcctc ctggacgcggcgctgaccggggagctggaggtggtgcagcaggcggtgaaggaggtgagc gctgctgagcgggagatgaacgacccgagccagcccaacgaggagggcatcactgccttg cacaacgccatctgcggcgccaactactctatcgtggatttcctcatcaccgcgggtgcc aatgtcaactcccccgacagccacggctggacacccttgcactgcgcggcgtcgtgcaac gacacagtcatctgcatggcgctggtgcagcacggcgctgcaatcttcgccaccacgctc agcgacggcgccaccgccttcgagaagtgcgacccttaccgcgagggttatgctgactgc gccacctacctggcagacgtcgagcagagtatggggctgatgaacagcggggcagtgtac gctctctgggactacagcgccgagttcggggacgagctgtccttccgcgagggcgagtcg gtcaccgtgctgcggagggacgggccggaggagaccgactggtggtgggccgcgctgcac ggccaggagggctacgtgccgcggaactacttcgggagtcagccgccctcgcttttccgt gcgcacgcgcagtatcccgattggctctgccctagcggattgacgggcaggctcaacgtg gacgggctcctggtctacttcccgtacgactacatctaccccgagcagttctcctacatg cgggagctcaaacgcacgctggacgccaagggtcatggagtcctggagatgccctcaggc accgggaagacagtatccctgttggccctgatcatggcataccagagagcatatccgctg gaggtgaccaaactcatctactgctcaagaactgtgccagagattgagaaggtgattgaa gagcttcgaaagttgctcaacttctatgagaagcaggagggcgagaagctgccgtttctg ggactggctctgagctcccgcaaaaacttgtgtattcaccctgaggtgacacccctgcgc tttgggaaggacgtcgatgggaaatgccacagcctcacagcctcctatgtgcgggcgcag taccagcatgacaccagcctgccccactgccgattctatgaggaatttgatgcccatggg cgtgaggtgcccctccccgctggcatctacaacctggatgacctgaaggccctggggcgg cgccagggctggtgcccatacttccttgctcgatactcaatcctgcatgccaatgtggtg gtttatagctaccactacctcctggaccccaagattgcagacctggtgtccaaggaactg gcccgcaaggccgtcgtggtcttcgacgaggcccacaacattgacaacgtctgcatcgac tccatgagcgtcaacctcacccgccggacccttgaccggtgccagggcaacctggagacc ctgcagaagacggtgctcaggatcaaagagacagacgagcagcgcctgcgggacgagtac cggcgtctggtggaggggctgcgggaggccagcgccgcccgggagacggacgcccacctg gccaaccccgtgctgcccgacgaagtgctgcaggaggcagtgcctggctccatccgcacg gccgagcatttcctgggcttcctgaggcggctgctggagtacgtgaagtggcggctgcgt gtgcagcatgtggtgcaggagagcccgcccgccttcctgagcggcctggcccagcgcgtg tgcatccagcgcaagcccctcagattctgtgctgaacgcctccggtccctgctgcatact ctggagatcaccgaccttgctgacttctccccgctcaccctccttgctaactttgccacc cttgtcagcacctacgccaaaggcttcaccatcatcatcgagccctttgacgacagaacc ccgaccattgccaaccccatcctgcacttcaggtgggaccctgcccgctgcatggacgcc tcgctggccatcaaacccgtatttgagcgtttccagtctgtcatcatcacatctgggaca ctgtccccgctggacatctaccccaagatcctggacttccaccccgtcaccatggcaacc ttcaccatgacgctggcacgggtctgcctctgccctatgatcatcggccgtggcaatgac caggtggccatcagctccaaatttgagacccgggaggatattgctgtgatccggaactat gggaacctcctgctggagatgtccgctgtggtccctgatggcatcgtggccttcttcacc agctaccagtacatggagagcaccgtggcctcctggtatgagcaggggatccttgagaac atccagaggaacaagctgctctttattgagacccaggatggtgccgaaaccagtgtcgcc ctggagaagtaccaggaggcctgcgagaatggccgcggggccatcctgctgtcagtggcc cggggcaaagtgtccgagggaatcgactttgtgcaccactacgggcgggccgtcatcatg tttggcgtcccctacgtctacacacagagccgcattctcaaggcgcggctggaatacctg cgggaccagttccagattcgtgagaatgactttcttaccttcgatgccatgcgccacgcg gcccagtgtgtgggtcgggccatcaggggcaagacggactacggcctcatggtctttgcc gacaagcggtttgcccgtggggacaagcgggggaagctgccccgctggatccaggagcac ctcacagatgccaacctcaacctgaccgtggacgagggtgtccaggtggccaagtacttc ctgcggcagatggcacagcccttccaccgggaggatcagctgggcctgtccctgctcagc ctggagcagctagaatcagaggagacgctgaagaggatagagcagattgctcagcagctc tga >gi568815579r:45251632_45470232|GENSCAN_predicted_peptide_6|791_aa MRSGPGQVLYIPGQGSWAGRALAPSSPAGSGPAAARFFRAKDPVDFLEESSSVGTGSESQ TELGTPRLDSYSPVGLRKSSVAAPLRRRCNKPYWTDDGGDPSGPDGPLTPAIQRISGFRE YIFRKKTSSSVFCSAHVEIFPSFSCRSGVEQCLPPRSRTSWALPDDALPLWIHVVCNLVR AARATGLPEVWVPGWRSPRPAVRVRVDGVRRVRWWKEKGASERVRAEKEAYLQAGLAKSV HSQWANGNSNGERVILWGATRGEQGAPRGWIGEIEVAPGEQVGRKEKPDAARFSCPPNFT AKPPASESPRFSLEALTGPDTELWLIQAPADFAPECFNGRHVPLSGSQIVKGKLAGKRHR YRVLSSCPQAGEATLLAPSTEAGGGLTCASAPQGTLRILEGPQQSLSGSPLQPIPASPPP QIPPGLRPRFCAFGGNPPVTGPRSALAPNLLTSGKKKKEMQVTEAPVTQEAVNGHGALEV DMALGSPEMDVRKKKKKKNQQLKEPEAAGPVGTEPTVETLEPLGVLFPSTTKKRKKPKGK ETFEPEDKTVKQEQINTEPLEDTVLSPTKKRKRQKGTEGMEPEEGVTVESQPQVKVEPLE EAIPLPPTKKRKKEKGQMAMMEPGTEAMEPVEPEMKPLESPGGTMAPQQPEGAKPQAQAA LAAPKKKTKKEKQQDATVEPETEVVGPELPDDLEPQAAPTSTKKKKKKKERGHTVTEPIQ PLEPELPGEGQPEARATPGSTKKRKKQSQESRMPETVPQEEMPGPPLNSESGEEAPTGRD KKRKQQQQQPV >gi568815579r:45251632_45470232|GENSCAN_predicted_CDS_6|2376_bp atgcgcagcgggcccgggcaggtgctgtacatcccggggcaagggagctgggccgggcgg gctctagctccgagctctcccgcgggctctgggccagccgcagccaggttctttagggct aaggatcctgtggacttcctggaggagtcatcttcagtaggaaccgggtcagagagccag actgagctgggaacacccaggctggactcctacagccctgtcgggcttaggaagagcagt gtggctgcccctttaaggaggcgttgcaacaaaccatattggacagacgatgggggcgac ccatcgggacccgacgggcctctgactccagcaatacagcgaatcagcggctttcgggaa tacatttttcggaaaaagacttcttcctcggttttctgctctgcacacgttgaaattttc cccagtttttcctgcagatcgggagtcgagcaatgcctacccccgcgctcccgcaccagt tgggcgctcccggatgatgccctacccctttggatccacgtggtctgcaacctggtgcga gcagcccgggctacagggttgcctgaggtgtgggtcccaggatggaggagccccaggccg gcggtgagggtgcgggttgacggggtgcggagggtgcgttggtggaaggagaaaggggcg tccgagagggttcgggcggaaaaggaggcgtacctgcaagcaggacttgcgaagagcgtg cattcccagtgggcgaacgggaattcgaacggagagagggttatcttgtggggggctacc cgtggagagcaaggcgcccccaggggttggatcggtgaaattgaggtcgcccctggggaa caggtgggcagaaaggagaaaccagatgctgctcggttctcttgtccccccaactttacc gcgaagcccccagcctcagagtcccctcgtttctccttggaggcgctgacgggtccagat acggagctgtggcttattcaggcccctgcagactttgccccagaatgcttcaatgggcgg catgtgcctctctctggctcccagatcgtcaagggcaaattggcaggcaagcggcaccgc tatcgagtcctcagcagctgtccccaagctggagaagcgaccctgctggccccctcaacg gaggcaggaggtggactcacctgtgcctcagccccccagggcaccctaaggatccttgag ggtccccagcaatccctgtcagggagccctctgcagcccatcccagcaagtcccccacca cagatccctcctggcctgaggcctcggttctgtgcctttgggggcaacccaccagtcaca gggcctaggtcagccttggcccccaacctgctcacctcagggaagaagaaaaaggagatg caggtgacagaggccccagtcactcaggaggcagtgaatgggcacggggccctggaggtg gacatggctttggggtcgccagaaatggatgtgcggaagaagaagaagaaaaaaaatcag cagctgaaagaaccagaggcagcagggcctgtggggacagagcccacagtggagacactg gagcctctgggagtgctgttcccgtccaccaccaagaagaggaagaagcccaaagggaaa gaaaccttcgagccagaagacaagacagtgaagcaggaacagattaacactgagcctcta gaagacacagtcctgtccccgaccaaaaagagaaagaggcaaaaggggacggaagggatg gagccagaggagggggtgacagttgagtctcagccacaggtgaaggtggagccactggag gaagccatccctctgccccctacgaagaagaggaaaaaagaaaagggacagatggcaatg atggagccagggacggaggcgatggagccagtggagccggagatgaagcctctggagtcc ccaggggggaccatggcgcctcaacagccagaaggagcgaagcctcaggcccaggcagct ctggcagctcccaaaaagaagacgaagaaagaaaaacagcaagatgccacagtggagcca gagacagaggtggtggggcctgagctgccggatgaccttgagcctcaggcagctcccaca tccaccaagaagaagaagaagaagaaagagagaggtcacacagtgactgagccaattcag ccactagagcctgaactgccaggggagggacagcctgaagccagggcaactccgggatcc accaagaagaggaagaagcagagtcaggaaagccggatgccagagacagtgccccaagag gagatgccagggccgccactgaattcagagtctggggaggaggctcccacaggccgggac aagaagcggaagcagcagcagcagcagcctgtgtag >gi568815579r:45251632_45470232|GENSCAN_predicted_peptide_7|410_aa MYMQARINKKINEHNRNVRNRRVCMGIWKPFRTPGLSQALPLTPIPRLSRDLPSQTCPFW VSTGNVSVSLTEPLQMDPGKDKEGVPQPSGPPARKKFVIPLDEDEVPPGVAKPLFRSTQS LPTVDTSAQAAPQTYAEYAISQPLEGAGATCPTGSEPLAGETPNQALKPGAKSNSIIVSP RQRGNPVLKFVRNVPWEFGDVIPDYVLGQSTCALFLSLRYHNLHPDYIHGRLQSLGKNFA LRVLLVQVDVVSNSDFLLQKDPQQALKELAKMCILADCTLILAWSPEEAGRYLETYKAYE QKPADLLMEKLEQDFVSRVTECLTTVKSVNKTDSQTLLTTFGSLEQLIAASREDLALCPG LGPQKAITNGGQAITNGGEDVKKREPSYTFDGNVNQYNHYGKQFGGSSKN >gi568815579r:45251632_45470232|GENSCAN_predicted_CDS_7|1233_bp atgtacatgcaggcaagaataaataaaaagatcaatgaacataacagaaatgtcagaaat agacgtgtgtgtatgggaatttggaagcccttccggactccggggctgagtcaggcgctc cccctgacccccatcccacggctctcccgggatctcccatcccagacctgcccattctgg gtcagcactgggaacgtgtctgtgtcccttactgaaccgctccagatggaccctgggaag gacaaagagggggtgccccagccctcagggccgccagcaaggaagaaatttgtgataccc ctcgacgaggatgaggtccctcctggagtggccaagcccttattccgatctacacagagc cttcccactgtggacacctcggcccaggcggcccctcagacctacgccgaatatgccatc tcacagcctctggaaggggctggggccacgtgccccacagggtcagagcccctggcagga gagacgcccaaccaggccctgaaacccggggcaaaatccaacagcatcattgtgagccct cggcagaggggcaatcccgtactgaagttcgtgcgcaatgtgccctgggaatttggcgac gtaattcccgactatgtgctgggccagagcacctgtgccctgttcctcagcctccgctac cacaacctgcacccagactacatccatgggcggctgcagagcctggggaagaacttcgcc ttgcgggtcctgcttgtccaggtggatgtggtttctaattctgattttctcctccagaaa gatccccagcaggccctcaaggagctggctaagatgtgtatcctggccgactgcacattg atcctcgcctggagccccgaggaagctgggcggtacctggagacctacaaggcctatgag cagaaaccagcggacctcctgatggagaagctagagcaggacttcgtctcccgggtgact gaatgtctgaccaccgtgaagtcagtcaacaaaacggacagtcagaccctcctgaccaca tttggatctctggaacagctcatcgccgcatcaagagaagatctggccttatgcccaggc ctgggccctcagaaagcaataacaaatggtggacaggcaataacaaatggtggcgaggat gtgaagaaaagggaaccctcatatacttttgatgggaatgtaaatcagtacaaccactat ggaaaacagtttggaggctcctctaaaaactga >gi568815579r:45251632_45470232|GENSCAN_predicted_peptide_8|251_aa MSLAYAKPITEMKWLESFSTQTSPIMYHENLASGQLCAEEGVEGAREGSVRQPAGCLPWT AAQSVCRRERVVQEPESPWEKPQPHTWPLTSPGQGTPGSLDKLTESQAEFSHPGHVACCD PPQHPRVAVWLRGLGRAHRGRLHRNFAIVGTGRCSFPELPRTAYFEDSLSSPGTPTAHPG LAPYFPNPAIALASRRPQRGHRGPPVPREMFQAFPGDYDSGSRCSSSPSAESQYLSSVDS FGSPPTAAASQ >gi568815579r:45251632_45470232|GENSCAN_predicted_CDS_8|753_bp atgtccctggcctatgcaaagcccatcacagaaatgaagtggctggaatcattcagcact cagacctcacctattatgtaccatgagaacctggcctctggacagctctgtgctgaagag ggcgtggagggggccagggaagggagtgtcaggcagccagccggctgcctgccctggaca gcagcccagagtgtctgcaggagggagagggtagttcaggagcctgagtcaccctgggag aaaccccagccacatacctggccgctgacatcacccggccagggcacccccggcagccta gacaagctgactgaatcacaggcggaattcagccaccccgggcacgtggcctgctgtgac cccccgcaacacccccgagtggccgtctggctgcgggggttgggccgggcacacagggga agactgcacagaaactttgccattgttggaacgggacgttgctccttccccgagcttccc cggacagcgtactttgaggactcgctcagctcaccggggactcccacggctcaccccgga cttgcaccttacttccccaacccggccatagccttggcttcccggcgacctcagcgtggt cacaggggcccccctgtgcccagggaaatgtttcaggctttccccggagactacgactcc ggctcccggtgcagctcctcaccctctgccgagtctcaatatctgtcttcggtggactcc ttcggcagtccacccaccgccgccgcctcccag