GENSCAN 1.0 Date run: 3-Nov-116 Time: 04:46:46 Sequence gi568815587f:117238587_117512165 : 273579 bp : 48.47% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 216 525 310 0 1 52 53 140 0.605 3.22 1.02 Intr + 1215 1274 60 2 0 90 111 55 0.984 6.93 1.03 Intr + 5859 5999 141 2 0 75 68 85 0.973 5.75 1.04 Intr + 8223 8362 140 2 2 13 88 206 0.903 12.36 1.05 Intr + 41322 41502 181 0 1 81 74 75 0.378 5.27 1.06 Intr + 41585 41673 89 1 2 67 108 55 0.981 4.17 1.07 Intr + 42728 42818 91 2 1 61 86 51 0.932 2.20 1.08 Intr + 43014 43112 99 2 0 93 94 5 0.736 1.91 1.09 Intr + 43308 43684 377 2 2 132 97 160 0.639 15.11 1.10 Intr + 43818 43950 133 0 1 103 94 179 0.999 20.65 1.11 Intr + 44160 44309 150 2 0 73 78 58 0.735 3.66 1.12 Term + 49492 49611 120 0 0 36 54 72 0.138 -2.93 1.13 PlyA + 49731 49736 6 1.05 2.11 PlyA - 50437 50432 6 -0.45 2.10 Term - 51221 50980 242 0 2 77 55 381 0.945 29.89 2.09 Intr - 52073 51902 172 2 1 59 81 331 0.999 29.02 2.08 Intr - 52463 52314 150 2 0 91 91 98 0.999 10.76 2.07 Intr - 53227 53126 102 1 0 115 75 145 0.470 16.27 2.06 Intr - 54602 54468 135 2 0 102 94 221 0.999 24.86 2.05 Intr - 55422 55285 138 0 0 78 101 122 0.984 13.16 2.04 Intr - 56761 56545 217 0 1 103 76 369 0.996 35.71 2.03 Intr - 58623 58449 175 1 1 73 41 154 0.716 8.20 2.02 Intr - 63375 63303 73 0 1 41 84 31 0.023 -3.02 2.01 Init - 77209 76949 261 1 0 115 103 555 0.918 54.76 2.00 Prom - 82591 82552 40 -6.06 3.00 Prom + 83057 83096 40 -2.36 3.01 Init + 100001 100082 82 1 1 77 87 108 0.924 10.73 3.02 Intr + 105580 105691 112 2 1 119 115 133 0.998 18.54 3.03 Intr + 113204 113402 199 2 1 89 105 148 0.995 15.95 3.04 Intr + 113837 113966 130 1 1 64 52 33 0.359 -2.43 3.05 Intr + 116413 118031 1619 2 2 78 96 725 0.448 60.22 3.06 Intr + 123249 123407 159 2 0 71 119 4 0.608 1.98 3.07 Intr + 123818 123952 135 1 0 54 95 85 0.319 6.56 3.08 Intr + 124843 124920 78 0 0 93 94 31 0.771 3.95 3.09 Intr + 132494 132880 387 1 0 97 108 348 0.999 32.69 3.10 Intr + 135165 135245 81 2 0 92 105 56 0.994 7.53 3.11 Intr + 137122 137205 84 0 0 96 109 73 0.998 10.22 3.12 Intr + 142028 142119 92 1 2 103 66 128 0.708 10.89 3.13 Intr + 143115 143282 168 0 0 73 94 147 0.998 12.86 3.14 Intr + 144210 144356 147 0 0 46 50 119 0.452 3.25 3.15 Intr + 148617 148826 210 0 0 68 34 240 0.981 14.63 3.16 Intr + 152191 152322 132 1 0 92 111 186 0.999 21.06 3.17 Intr + 152413 152629 217 1 1 74 55 202 0.999 14.01 3.18 Intr + 153640 153717 78 0 0 128 94 101 0.999 14.45 3.19 Intr + 153910 154041 132 0 0 80 105 94 0.991 11.14 3.20 Intr + 154418 154540 123 1 0 111 55 165 0.998 16.28 3.21 Intr + 155764 155907 144 0 0 79 94 184 0.997 18.58 3.22 Intr + 156334 156417 84 0 0 102 94 73 0.993 9.32 3.23 Intr + 156961 157136 176 0 2 123 43 278 0.999 25.54 3.24 Intr + 157468 157594 127 1 1 115 53 67 0.892 6.58 3.25 Intr + 157964 158025 62 1 2 69 109 30 0.689 0.73 3.26 Intr + 158505 158727 223 0 1 65 77 287 0.894 23.43 3.27 Intr + 169339 169446 108 0 0 93 46 157 0.697 12.48 3.28 Intr + 170304 170418 115 2 1 76 84 89 0.514 7.32 3.29 Intr + 171032 171379 348 0 0 90 77 222 0.999 16.63 3.30 Intr + 172242 172308 67 1 1 85 99 66 0.997 5.36 3.31 Intr + 173209 173328 120 1 0 120 6 87 0.647 3.31 3.32 Term + 173486 173582 97 2 1 55 54 102 0.883 0.94 3.33 PlyA + 174645 174650 6 1.05 4.25 PlyA - 176131 176126 6 1.05 4.24 Term - 181740 181529 212 1 2 57 38 119 0.206 1.26 4.23 Intr - 187712 187493 220 2 1 62 94 109 0.340 6.77 4.22 Intr - 190217 189802 416 0 2 98 37 225 0.276 12.22 4.21 Intr - 192447 192136 312 1 0 114 70 586 0.998 55.56 4.20 Intr - 193142 192948 195 0 0 78 69 226 0.998 19.09 4.19 Intr - 193918 193766 153 2 0 101 36 254 0.891 21.54 4.18 Intr - 194670 194552 119 2 2 111 92 194 0.998 22.21 4.17 Intr - 194885 194855 31 0 1 128 98 12 0.981 3.39 4.16 Intr - 197213 197058 156 0 0 84 59 270 0.999 23.68 4.15 Intr - 198823 198536 288 2 0 84 73 432 0.998 38.52 4.14 Intr - 199497 199309 189 1 0 109 60 435 0.918 42.36 4.13 Intr - 200397 200299 99 1 0 77 78 76 0.838 5.58 4.12 Intr - 200843 200680 164 1 2 91 100 332 0.999 34.32 4.11 Intr - 201350 201233 118 0 1 64 89 113 0.914 8.52 4.10 Intr - 205453 205300 154 1 1 89 76 320 0.960 30.55 4.09 Intr - 212102 211963 140 0 2 88 131 121 0.642 16.58 4.08 Intr - 220323 220168 156 1 0 41 113 302 0.932 27.98 4.07 Intr - 223010 222864 147 0 0 111 84 283 0.979 30.41 4.06 Intr - 226596 226356 241 0 1 70 77 594 0.998 53.62 4.05 Intr - 231394 231324 71 2 2 75 115 58 0.987 6.10 4.04 Intr - 233450 233283 168 0 0 89 105 292 0.891 30.92 4.03 Intr - 241985 241857 129 0 0 90 96 284 0.978 30.07 4.02 Intr - 242684 242588 97 2 1 73 77 105 0.985 7.48 4.01 Init - 243568 243377 192 1 0 82 105 466 0.582 46.67 4.00 Prom - 244222 244183 40 -10.55 5.00 Prom + 245549 245588 40 -7.56 5.01 Sngl + 247629 248018 390 2 0 99 42 308 0.389 23.52 5.02 PlyA + 249812 249817 6 1.05 6.09 PlyA - 250217 250212 6 1.05 6.08 Term - 254654 254363 292 1 1 68 38 176 0.383 5.42 6.07 Intr - 255713 255469 245 2 2 90 53 25 0.196 -4.50 6.06 Intr - 259853 259720 134 0 2 56 103 44 0.510 3.06 6.05 Intr - 265435 265259 177 2 0 75 70 379 0.999 34.69 6.04 Intr - 266457 266338 120 1 0 103 98 281 0.999 31.07 6.03 Intr - 267146 266868 279 0 0 74 77 480 0.824 42.95 6.02 Intr - 267527 267442 86 1 2 77 105 2 0.649 0.26 6.01 Init - 269748 269669 80 0 2 67 86 44 0.258 2.63 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587f:117238587_117512165|GENSCAN_predicted_peptide_1|630_aa XGSQSDLKDVASTAGEEGDTSLRESLHPVTRSLKAGCHTKQLASRNCSEEKSPQTSILKE GNRDTSLDFRPVVSPANGVEGVRVDQDDDQDSSSLKLSQNIAVQTDFKTADSEVNTDQDI EKNLDKMMTERTLLKERYQEVLDKQRQVENQLQVQLKQLQQRREEEMKNHQEILKAIQDV TIKREETKKKIEKEKKEFLQKEQDLKAEIEKLCEKGRREVWEMELDRLKNQDGEINRNIM EETERAWKAEVRRGADILKVLVSGFQMVSTFGIVIGEPILSLESRKELLVLKLEEAEKEA ELHLTYLKSTPPTLETVRSKQEWETRLNGVRIMKKNVRDQFNSHIQLVRNGAKLSSLPQI PTPTLPPPPSETDFMLQVFQPSPSLAPRMPFSIGQVTMPMVMPSADPRSLSFPILNPALS QPSQPSSPLPGSHGRNSPGLGSLVSPHGPHMPPAASIPPPPGLGGVKASAETPRPQPVDK LEKILEKLLTRFPQCNKAQMTNILQQIKTARTTMAGLTMEELIQLVAARLAEHERVAAST QPLGRIRALFPAPLAQISTPMFLPSAQVSYPGRSSHVRLFFFEHCDMEKLEPKNQPGYPR TIRSGEGKNRIGEKDDLHQGALVVEDIRKP >gi568815587f:117238587_117512165|GENSCAN_predicted_CDS_1|1893_bp nntgggtctcagtctgatttgaaggatgtggccagcacagcaggagaggagggggacaca agccttcgggagagcctccatccagtcactcggtctcttaaggcagggtgccatactaag cagcttgcctccaggaattgctctgaagagaaatccccacaaacctccatcctaaaggaa ggtaacagggacacaagcttggatttccgacctgtagtgtctccagcaaatggggttgaa ggagtccgagtggatcaggatgatgatcaagatagctcttccctgaagctttctcagaac attgctgtacagactgactttaagacagctgattcagaggtaaacacagatcaagatatt gaaaagaatttggataaaatgatgacagagagaaccctgttgaaagagcgttaccaggag gtcctggacaaacagaggcaagtggagaatcagctccaagtgcaattaaagcagcttcag caaaggagagaagaggaaatgaagaatcaccaggagatattaaaggctattcaggatgtg acaataaagcgggaagaaacaaagaagaagatagagaaagagaagaaggagtttttgcag aaggagcaggatctgaaagctgaaattgagaagctttgtgagaagggcagaagagaggtg tgggaaatggaactggatagactcaagaatcaggatggcgaaataaataggaacattatg gaagagactgaacgggcctggaaggcagaggtgagaagaggagctgatatcttgaaagtc ttagtttcaggcttccagatggtatctacttttggcattgtaataggagagccaatctta tcactagagagccggaaagagttactggtactgaaactagaagaagcagaaaaagaggca gaattgcaccttacttacctcaagtcaactcccccaacactggagacagttcgttccaaa caggagtgggagacgagactgaatggagttcggataatgaaaaagaatgttcgtgaccaa tttaatagtcatatccagttagtgaggaacggagccaagctgagcagccttcctcaaatc cctactcccactttacctccacccccatcagagacagacttcatgcttcaggtgtttcaa cccagtccctctctggctcctcggatgcccttctccattgggcaggtcacaatgcccatg gttatgcccagtgcagatccccgctccttgtctttcccaatcctgaaccctgccctttcc cagcccagccagccttcctcaccccttcctggctcccatggcagaaatagccctggcttg ggttcccttgtcagcccccacggtccacacatgccccctgccgcctccatcccacctccc ccaggcttgggcggtgttaaggcttctgctgaaactccccggccccaaccagtagacaaa ctggagaagatcctggagaagctgctgacccggttcccacagtgcaataaggcccagatg accaacattcttcagcagatcaagacagcacgtaccaccatggcaggcctgaccatggag gaacttatccagttggttgctgcacgactggcagaacatgagcgggtggcagcaagtact cagccacttggtcgcatccgggccttgttccctgctccactggcccaaatcagtacccca atgttcttgccttctgcccaagtttcatatcctggaaggtcttcacatgtaagactcttt ttctttgaacactgtgatatggagaagctggagcccaagaaccagcctgggtacccacgt accattaggagcggggaggggaagaacaggataggggaaaaagatgaccttcatcaagga gctcttgtggtggaggacataaggaagccctga >gi568815587f:117238587_117512165|GENSCAN_predicted_peptide_2|554_aa MAQALPWLLLWMGAGVLPAHGTQHGIRLPLRSGLGGAPLGLRLPRETDEEPEEPGRRGSF VEMVDNLRGKSGQGYYVEMTVGSPPQTCTKQIGQVKRRCAKSSWETTVIFQVGFSADLQR AQGKCSFGPCSTTGPQVLSVVVVVTGASLDQSCIVELHSGHSKGQTVAGASSTYRDLRKG VYVPYTQGKWEGELGTDLVSIPHGPNVTVRANIAAITESDKFFINGSNWEGILGLAYAEI ARPDDSLEPFFDSLVKQTHVPNLFSLQLCGAGFPLNQSEVLASVGGSMIIGGIDHSLYTG SLWYTPIRREWYYEVIIVRVEINGQDLKMDCKEYNYDKSIVDSGTTNLRLPKKVFEAAVK SIKAASSTEKFPDGFWLGEQLVCWQAGTTPWNIFPVISLYLMGEVTNQSFRITILPQQYL RPVEDVATSQDDCYKFAISQSSTGTVMGAVIMEGFYVVFDRARKRIGFAVSACHVHDEFR TAAVEGPFVTLDMEDCGYNIPQTDESTLMTIAYVMAAICALFMLPLCLMVCQWRCLRCLR QQHDDFADDISLLK >gi568815587f:117238587_117512165|GENSCAN_predicted_CDS_2|1665_bp atggcccaagccctgccctggctcctgctgtggatgggcgcgggagtgctgcctgcccac ggcacccagcacggcatccggctgcccctgcgcagcggcctggggggcgcccccctgggg ctgcggctgccccgggagaccgacgaagagcccgaggagcccggccggaggggcagcttt gtggagatggtggacaacctgaggggcaagtcggggcagggctactacgtggagatgacc gtgggcagccccccgcagacgtgcacaaaacagattggacaggtcaagagaagatgtgcc aaaagcagctgggaaactactgtgatattccaggttggtttttcagcagacctgcagagg gcacaggggaagtgttcctttggcccctgcagcaccacaggaccccaggtcctgtctgtg gtcgtggtggtaacgggtgccagcctggaccagtcctgcatcgtggagcttcactctgga cattctaaggggcagactgtggctggggcgtccagcacataccgggacctccggaagggt gtgtatgtgccctacacccagggcaagtgggaaggggagctgggcaccgacctggtaagc atcccccatggccccaacgtcactgtgcgtgccaacattgctgccatcactgaatcagac aagttcttcatcaacggctccaactgggaaggcatcctggggctggcctatgctgagatt gccaggcctgacgactccctggagcctttctttgactctctggtaaagcagacccacgtt cccaacctcttctccctgcagctttgtggtgctggcttccccctcaaccagtctgaagtg ctggcctctgtcggagggagcatgatcattggaggtatcgaccactcgctgtacacaggc agtctctggtatacacccatccggcgggagtggtattatgaggtgatcattgtgcgggtg gagatcaatggacaggatctgaaaatggactgcaaggagtacaactatgacaagagcatt gtggacagtggcaccaccaaccttcgtttgcccaagaaagtgtttgaagctgcagtcaaa tccatcaaggcagcctcctccacggagaagttccctgatggtttctggctaggagagcag ctggtgtgctggcaagcaggcaccaccccttggaacattttcccagtcatctcactctac ctaatgggtgaggttaccaaccagtccttccgcatcaccatccttccgcagcaatacctg cggccagtggaagatgtggccacgtcccaagacgactgttacaagtttgccatctcacag tcatccacgggcactgttatgggagctgttatcatggagggcttctacgttgtctttgat cgggcccgaaaacgaattggctttgctgtcagcgcttgccatgtgcacgatgagttcagg acggcagcggtggaaggcccttttgtcaccttggacatggaagactgtggctacaacatt ccacagacagatgagtcaaccctcatgaccatagcctatgtcatggctgccatctgcgcc ctcttcatgctgccactctgcctcatggtgtgtcagtggcgctgcctccgctgcctgcgc cagcagcatgatgactttgctgatgacatctccctgctgaagtga >gi568815587f:117238587_117512165|GENSCAN_predicted_peptide_3|2011_aa MAGRPLRIGDQLVLEEDYDETYIPSEQEILEFAREIGIDPIKEPELMWLAREGIVAPLPG EWKPCQDITGDIYYFNFANGQSMWDHPCDEHYRSLVIQERAKLSTSGAIKKKKKKKEKKD KKDRDPPKSSLWCEFLDVRCSHNLVVKTPVRSRFERKALGFIYSSRLLPVSSVSAPGLAD LDLDQEMQAISEGSYNKGKSPGMLGDTPWRFMGALPRKLQPLSKGQPSQIHQVFADMEKI LGRAPAQCRRELGDQQGLEKLQKATEKIYLGFSDPETEELEMRSRQQKLGTPAPQNTGLL QNMQDVVESRNQASVHSKLSEAIKGLPLKGEQHSHSLAKLSPTGPGGDKGQSPIPMSSPE EEPSLSSCSSGHMLPARKSKLLLDSSPTEDLSWQGVPGEGGSIGSGRRRREPPGLWMEQV SKLVNKDIPGSCKETEPSDPEALGASAEDLPQGLLLIPPETLASEPAQNPLLGRAPEEPP ASEKRQALGSAELPYKDQKPSLSGPDLESSSSSSSNSNLASHLGSPVLDEVNNFPWNLQS SRGSEEGMAQSDSGLRDQHFSPFLDPHMSHMQSPDEEQSESEDYSEDQRFYQHILQMVKI SRWPEGLGLPESMQDMPCRHSASTVCCMAAESSRMSSEGEHEAIRVMERDSRFLSWEPEL LEHPQEVALAPAWQEASQQAHFQPSSSTLRQGLVQQSSSRGLTTEPGKMQHLNQALGSSL APVHVPLGGLAPLRGLVDTPPSALRGSQSVSLGSSVESGRQLGELMLPSQGLKTSAYTKG LLGSIYEDKTALSLLGLGEETNEEDEEESDNQSVHSSSEPLRNLHLDIGALGGDFEYEES LRTSQPEEKKDVSLDSDAAGPPTPCKPSSPGADSSLSSAVGKGRQGSGARPGLPEKEENE KSEPKICRNLVTPKADPTGSEPAKASEKEAPEDTVDAGEEGSRREEAAKEPKKKASALEE GSSDASQELEISEHMKEPQLSDSIASDPKSFHGLDFGFRSRISEHLLDVDVLSPVLGGAC RQAQQPLGIEDKDDSQSSQDELQSKQSKGLEERLSPPLPHEERAQSPPRSLATEEEPPQG PEGQPEWKEAEELGEDSAASLSLQLSLQREQAPSPPAACEKGKEQHSQAEELGPGQEEAE DPEEKVAVSPTPPVSPEVRSTEPVAPPEQLSEAALKAMEEAVAQVLEQDQRHLLESKQEK MQQLREKLCQEEEEEILRLHQQKEQSLSSLRERLQKAIEEEEARMREEESQRLSWLRAQV QSSTQADEDQIRAEQEASLQKLREELESQQKAERASLEQKNRQMLEQLKEEIEASEKSEQ AALNAAKEKALQQLREQLEGERKEAVATLEKEHSAELERLCSSLEAKHREVVSSLQKKIQ EAQQKEEAQLQKCLGQVEHRVHQKSYHVAGYEHELSSLLREKRQEVEGEHERRLDKMKEE HQQVMAKAREQYEAEERKQRAELLGHLTGELERLQRAHERELETVRQEQHKRLEDLRRRH REQERKLQDLELDLETRAKDVKARLALLEVQEATATHQQLEEAQKEHTHLLQSNQQLREI LDELQARKLKLESQVDLLQAQSQQLQKHFSSLEAEAQKKQHLLREVTVEENNASPHFEPD LHIEDLRKSLGTNQTKEVSSSLSQSKEDLYLDSLSSHNVWHLLSAEGVALRSAKEFLVQQ TRSMRRRQTALKAAQQHWRHELASAQEVAKDPPGIKALEDMRKNLEKETRHLDEMKSAMR KGHNLLKKKEEKLNQLESSLWEEASDEGTLGGSPTKKAVTFDLSDMDSLSSESSESFSPP HLDSTPSLTSRKIHGLSHSLRQISSQLSSVLSILDSLNPQSPPPLLASMPAQLPPRDPKS TPTPTYYGSLARFSALSSATPTSTQWAWDSGQGPRLPSSVAQTVDDFLLEKWRKYFPSGI PLLSNSPTPLESRLGYMSASEQLRLLQHSHSQVPEAGSTTFQGIIEANRRWLERVKNDPR PLFSSTPKPKATLSLLQLGLDEHNRVKVYRF >gi568815587f:117238587_117512165|GENSCAN_predicted_CDS_3|6036_bp atggctggacgacccctccgcataggagatcagctggttctggaagaagattatgatgag acctacattcctagtgagcaagaaattcttgaatttgcccgggagattggtattgatccc atcaaggaaccagaactgatgtggctggcgcgagagggcatcgtggccccactgcctgga gagtggaaaccatgccaggacatcacaggtgacatttactatttcaacttcgccaacggg cagtctatgtgggaccatccatgtgacgaacactatcggagcttggtgatccaagagcgg gcaaagctgtcaacttctggggccattaagaagaagaaaaaaaaaaaggaaaagaaagac aagaaggacagagacccccccaaaagttcgctgtggtgtgaattcctagatgtcagatgt agccataatttagttgtcaaaacaccagtgcgttcaagatttgaaagaaaagccttgggc tttatctactcatctagattgttgccagtgtcttcagtgagtgcacctgggcttgctgat ctggacctagatcaagagatgcaggctataagtgagggatcctataacaaaggaaagagc ccaggcatgctgggtgacactccctggcgtttcatgggtgcccttcccagaaagctgcag ccactctccaaaggccaaccttcccaaatccaccaggtctttgctgacatggagaaaatc ttaggcagggccccagcccaatgcaggagagaattaggtgatcagcagggtctggagaaa ctccagaaggcgacagagaaaatctacctggggttttcagatcctgaaacagaagagctg gaaatgagaagcaggcagcagaaactgggcactccagcccctcagaacactgggcttctc caaaatatgcaggatgtggtagaaagcaggaaccaggcctctgtccactcaaagctttct gaagccatcaaaggcctgccactgaaaggggagcagcacagccacagcttagccaaactg agccccactggccctggaggggacaagggccaaagccccattcccatgtcatcccctgag gaggagccctccctgtcctcttgttcttccggccacatgttgcctgccaggaagagcaag ttgttgttagatagcagccctactgaagacctgagctggcagggagttcctggggaaggt gggagcataggcagcgggaggaggaggagagaacctcctggactgtggatggaacaggtc tccaagcttgtcaataaggatatcccaggaagctgcaaggaaacagagcccagtgaccct gaggctctaggggcctcagctgaagatctacctcagggtctcttgctaataccacctgaa accctggcatcagaaccagctcagaatccccttttagggagggctcccgaggagcctcct gccagtgagaagagacaggccctagggtctgcagagctcccttataaagatcagaagcct agcttgtctgggcctgacttggagagcagcagcagcagcagcagcaacagcaacctggcc tcacacctgggctctcctgtcctggatgaggtgaacaacttcccttggaacctgcagagc tcacggggatctgaggagggtatggctcagtcagactcgggtctcagagatcaacacttc agccccttcttagatcctcacatgtcccacatgcagagccctgacgaggagcagtcagaa agtgaagactactctgaggaccagaggttctaccagcacatcctgcagatggtcaagatc tccaggtggccggagggcctggggctgcctgagagcatgcaggacatgccgtgcagacac agcgccagcacagtctgttgcatggcagctgagtcttctaggatgtctagtgagggtgag cacgaggccatcagagtcatggagagggactcgaggtttctgtcatgggagccagagctg ctggaacatcctcaggaggtggccctcgcccctgcttggcaagaggcctctcagcaagcc catttccagccaagcagcagcaccctcaggcaggggctagtccagcagagctccagcaga gggcttactacagagccaggcaagatgcagcatctcaaccaggccttgggttcctcatta gccccagttcatgttcctcttgggggcctggctcctttacgaggtcttgtggatacccca ccctctgctcttcgtggatctcaaagcgtgagcctggggagctcagtggagtctggacgt cagcttggagaactcatgctgccttcacagggtctcaagacctctgcttatacaaagggt ctcttgggctccatatatgaggacaagactgctctcagcctcttgggtttaggagaagaa accaatgaggaggatgaggaggaaagtgacaaccagagtgtccacagctcaagtgagcct cttaggaacctacacctggacattggggcactggggggtgactttgagtatgaggagtct ctgagaacaagccagccagaggagaagaaggatgtttctctggattcagatgctgccggt ccccctactccctgcaagccctccagcccaggtgcagacagcagtctgagcagtgctgtt ggcaaagggcgacagggaagtggagcaagacctggtcttccagaaaaagaggaaaatgag aagagtgaacctaagatttgcaggaatctggtgacccccaaggcagaccctacaggcagt gagcctgccaaagcctctgaaaaggaagcaccagaggacacagtagatgcaggagaggag ggttccaggagggaagaggcagccaaggagccaaagaagaaggcttctgctctggaagag ggcagttcagacgccagccaagaactggaaattagtgaacacatgaaggaaccacagctc tcagactccatagcttctgaccccaagtccttccatggcctggacttcggttttcgcagc cggatctcggagcacctgctggatgttgatgtgctttccccagtcctgggtggagcttgt cggcaggcccagcaaccactgggaatagaagacaaggatgacagccagtccagccaagat gagctgcagagcaagcagtccaaaggcctggaggagaggttatctcctccacttccacac gaggagcgggcccagagtccccctcgcagcctggccactgaagaagagcctccccagggc cccgaggggcagcccgagtggaaggaggcagaggagcttggggaggactctgcagccagc ctcagcctgcagctgtccctccagagggagcaggccccaagcccacctgctgcctgtgag aagggcaaggagcagcattcccaggccgaggagctgggccctgggcaggaagaggcagag gatcctgaggagaaggtggcggtcagccccaccccgccagtctctccagaggtgcgatcc acagagcctgtggctcccccagagcagctctcagaggctgcactaaaggccatggaagag gcagtggcccaagtactcgagcaagaccagaggcacctgctggaatccaagcaagagaag atgcagcaactgcgggagaagctgtgccaagaggaggaagaggagatcctccggcttcac cagcagaaagagcaatctctcagttccttgagggagcggctgcagaaagccattgaggag gaggaggcccggatgagagaggaggaaagccagaggctatcctggctccgagctcaggtc cagtccagcacacaagcagatgaggaccaaatcagggctgagcaagaggcttccctgcag aaactgagagaagagttggagtctcaacagaaggctgagagggccagcttggaacagaaa aataggcaaatgctggagcagctcaaggaagagatagaggcttcggagaagagcgagcag gctgccctgaatgctgcaaaggagaaggctctgcagcagctgagggagcagctggaaggg gagaggaaagaagctgtggcaacgctggagaaggagcacagtgctgagctggagcggctc tgctcctcattggaggccaagcaccgggaggtggtctccagcctccagaagaagatacag gaagctcaacagaaagaggaggcccagctgcagaagtgccttgggcaagtggagcacaga gttcaccagaagtcttatcacgtggctgggtatgagcacgagctcagcagtctcctgcga gagaagcgccaggaagtggaaggggagcatgagaggaggttggacaagatgaaggaggag caccagcaagtgatggctaaggccagagagcagtatgaagctgaggagaggaagcagcgg gctgagcttctggggcacctgaccggagagctggagcgcctgcagagggcccatgaacga gaactggagactgtgaggcaggagcaacacaagcgtcttgaggacttgcggcgccggcac agggagcaggaaaggaagctccaggatttagagttggaccttgaaaccagagctaaagat gtcaaggccagattggctctgctggaggtccaggaagccacagccacccatcagcagctg gaggaggcacagaaggagcacacccacctgttgcagtcaaaccagcagctccgagaaatt cttgatgagctgcaggcccgcaagctgaagctggagtcccaagtggatctgctgcaggct cagagccagcaactgcagaaacacttcagcagcctggaggctgaagctcaaaagaagcag cacctgttgagagaagtgacagttgaggaaaataatgcttccccacattttgagccagat ctccatattgaggacctgaggaaatcccttggaacaaaccagaccaaagaggtgtcttct tctctctcccagagcaaggaggacttatacttggacagcctgtcctcccacaatgtctgg cacctcctctctgctgagggggtagccctccgtagtgccaaggagttccttgtgcagcag acacgctccatgcggaggcggcagacagctctgaaagctgcccagcagcattggcgccat gagctggccagtgcgcaggaggtggccaaagacccaccaggcatcaaggccctggaagat atgcgcaagaacctggagaaggagaccaggcacctggatgagatgaagtcggccatgcgg aaaggccacaacctgctgaagaagaaagaggagaagctgaatcagttggagtcctctctt tgggaagaggcctcagatgagggcactctgggaggatcccccaccaagaaggcagtaacc ttcgacctcagtgacatggacagcctgagcagtgaaagttctgaatctttttccccgcct cacctcgactcaaccccgagtctcacctcccgcaagatccacgggcttagccactccctc cggcagatcagcagccagctgagcagtgtcctcagcatcctggacagcctcaaccctcag tcgccgccgccgctcctcgcctccatgccagcccagctccctccccgggaccctaagagc acccccacccccacctactatggctccctggccaggttctcagccttatcatctgctaca cccacgtccacccaatgggcctgggattcagggcaggggcccaggctcccctcctctgtg gctcaaacggtggacgacttcctgttggagaagtggcgcaagtattttccatctggcatc ccgctgctcagcaacagccccaccccgctggagagcaggctgggttacatgtctgccagt gagcagctccggctcctacagcactcccattcgcaagtccctgaggcgggcagcaccacc tttcagggcataattgaggccaaccggaggtggctggaacgtgtcaagaatgaccccaga cctctcttctcgtcaacacccaagccaaaagctactttgagcctcctgcagctgggcctt gatgagcacaacagagtgaaggtgtatcgcttctga >gi568815587f:117238587_117512165|GENSCAN_predicted_peptide_4|1388_aa MITSHPNTTIAIKGHAKELNCTARGERPIIIRWEKGDTVIDPDRVMRYAIATKDNGDEVV STLKLKPADRGDSVFFSCHAINSYGEDRGLIQLTVQEPPDPPELEIREVKARSMNLRWTQ RFDGNSIITGFDIEYKNKSDSWDFKQSTRNISPTINQANIVDLHPASVYSIRMYSFNKIG RSEPSKELTISTEEAAPDGPPMDVTLQPVTSQSIQVTWKAPKKELQNGVIRGYQIGYREN SPGSNGQYSIVEMKATGDSEVYTLDNLKKFAQYGVVVQAFNRAGTGPSSSEINATTLEDV PSQPPENVRALSITSDVAVISWSEPPRSTLNGVLKGYRVIFWSLYVDGEWGEMQNITTTR ERVELRGMEKFTNYSVQVLAYTQAGDGVRSSVLYIQTKEDVPGPPAGIKAVPSSASSVVV SWLPPTKPNGVIRKYTIFCSSPGSGQPAPSEYETSPEQLFYRIAHLNRGQQYLLWVAAVT SAGRGNSSEKVTIEPAGKAPAKIISFGGTVTTPWMKDVRLPCNSVGDPAPAVKWTKDSED SAIPVSMDGHRLIHTNGTLLLRAVKAEDSGYYTCTATNTGGFDTIIVNLLVQVPPDQPRL TVSKTSASSITLTWIPGDNGGSSIRGFVLQYSVDNSEEWKDVFISSSERSFKLDSLKCGT WYKVKLAAKNSVGSGRISEIIEAKTHGREPSFSKDQHLFTHINSTHARLNLQGWNNGGCP ITAIVLEYRPKGTWAWQGLRANSSGEVFLTELREATWYELRMRACNSAGCGNETAQFATL DYDGSTIPPIKSAQGEGDDVKKLFTIGCPVILATLGVALLFIVRKKRKEKRLKRLRDAKS LAEMLISKNNRSFDTPVKGPPQGPRLHIDIPRVQLLIEDKEGIKQLGDDKATIPVTDAEF SQAVNPQSFCTGVSLHHPTLIQSTGPLIDMSDIRPGTNPVSRKNVKSAHSTRNRYSSQWT LTKCQASTPARTLTSDWRTVGSQHGVTVTESDSYSASLSQDTDKGRNSMVSTESASSTYE ELARAYEHAKLEEQLQHAKFEITECFISDSSSDQMTTGTNENADSMTSMSTPSEPGICRF TASPPKPQDADRGKNVAVPIPHRANKSDYCNLPLYAKSEAFFRKADGREPCPVVPPREAS IRNLARTYHTQARHLTLDPASKSLGLPHPGAPAAASTATLPQRTLAMPAPPAGTAPPAPG PTPAEPPTAPSAAPPAPSTEPPRAGGPHTKMGGSRDSLLEMSTSGILGKQTRIRATSNQQ EGMKNSSEARGAEDRSSLRAPKGNAVMKIHDGSRYYVPGAACNLIADLGERYHDYHVTAE PRQGLCDKWHLQYALTGHIRAQHPGSHHQPPLRPAASDQCPARKPGLQPPWGSVGNTKGE GTWQVLQA >gi568815587f:117238587_117512165|GENSCAN_predicted_CDS_4|4167_bp atgatcacttcccaccccaacaccaccatcgccatcaagggccatgcgaaggagctaaac tgcacggcacggggtgagcggcccatcatcatccgctgggagaagggggacacagtcatc gaccctgaccgcgtcatgcggtatgccatcgccaccaaggacaacggcgacgaggtcgtc tccacactgaagctcaagcccgctgaccgtggggactctgtgttcttcagctgccatgcc atcaactcgtatggggaggaccggggcttgatccaactcactgtgcaagagccccccgac cccccagagctggagatccgggaggtgaaggcccggagcatgaacctgcgctggacccag cgattcgacgggaacagcatcatcacgggcttcgacattgaatacaagaacaaatcagat tcctgggacttcaagcagtccacacgcaacatctcccccaccatcaaccaggccaacatt gtggacttgcacccggcatctgtgtacagcatccgcatgtactctttcaacaagattggc cgcagtgaaccaagcaaggagctcaccatcagcactgaggaggccgctcccgatgggccc cccatggatgttaccttgcagccagtgacctcacagagcatccaggtgacctggaaggca cccaagaaggagctgcagaacggtgtcatccggggctaccagattggctacagagagaac agccccggcagcaacgggcagtacagcatcgtggagatgaaggccacgggggacagcgag gtctacaccctggacaacctcaagaagttcgcccagtatggggtggtggtccaagccttc aatcgggctggcacggggccctcttccagcgagatcaatgccaccactctggaggatgtg cccagccagccccctgagaacgtccgggccctgtccatcacttctgacgtggccgtcatc tcctggtcagagcccccgcgcagcaccctcaatggcgtcctcaaaggctatcgggtcatc ttctggtccctctatgttgatggggagtggggcgagatgcagaacatcaccaccacgcgg gagcgggtggagctgcggggcatggagaagttcaccaactacagcgtccaggtgctggcc tacacccaggctggggacggcgtacgcagcagtgtgctctacatccagaccaaggaggac gttccaggtccccctgctggcatcaaagctgtcccttcatcagctagcagtgtggttgtg tcttggctcccccctaccaagcccaacggggtgatccgcaagtacaccatcttctgttcc agccccgggtctggccagccggctcccagcgagtacgagacgagtccagagcagctcttc taccggatcgcccacctaaaccgcggtcagcagtatctgctgtgggtggccgccgtcacc tctgccggccggggcaacagcagcgagaaggtgaccatcgagcctgctggcaaggcccca gcaaagatcatctcctttgggggcaccgtgacaacaccttggatgaaagatgttcggctg ccttgcaattcagtgggagatccagcccctgctgtgaagtggaccaaggacagtgaagac tcggccattccagtgtccatggatgggcaccggctcatccacaccaatggcacactgctg ctgcgtgcagtgaaggctgaggactctggctactacacgtgcacggccaccaacactggt ggctttgacaccatcatcgtcaaccttctggtgcaagttcccccggaccagccccgcctc actgtctccaaaacctcagcttcgtccatcaccctgacctggattccaggtgacaatggg ggcagctccatccgaggcttcgtgctacagtactcggtggacaacagcgaggagtggaag gatgtgttcatcagctccagcgagcgctccttcaagctggacagcctcaagtgtggcacg tggtacaaggtgaagctggcagccaagaacagcgtgggctctgggcgcatcagcgagatc atcgaggccaagacccacgggcgggagccctccttcagcaaagaccaacacctcttcacc cacatcaactccacgcatgctcggcttaacctgcagggctggaacaatgggggctgccct atcacagccatcgttctggagtaccggcccaaggggacctgggcctggcagggcctccgg gccaacagctccggggaggtgtttctgacggaactgcgagaggccacgtggtacgagctg cgcatgagggcttgcaacagtgcgggctgcggcaatgaaacagcccagttcgccaccctg gactacgatggcagcaccattccacccatcaagtctgctcaaggtgaaggggatgatgtg aagaagctgttcaccatcggctgccctgtcatcctggccacactgggggtggcactgctc ttcatcgtacgcaagaagaggaaggagaaacggctgaagcgactccgagatgcaaagagt ttggcagaaatgttgataagcaagaacaatagaagctttgacacccctgtgaaagggcca ccccagggcccacggctacacattgacatccccagggtccagctgctcatcgaggacaaa gaaggcatcaagcaactgggagatgacaaggccaccatccctgtgacagatgctgagttc agccaagctgtcaacccacagagcttctgtactggcgtctccttgcaccacccaaccctc atccagagcacaggacccctcatcgacatgtctgacatccggccaggaaccaatccagtg tccaggaagaatgtgaagtcagcccacagcacccggaaccggtactcaagccagtggacc ctgaccaagtgccaggcctccacacctgcccgcaccctcacctccgactggcgcaccgtg ggctcccagcatggtgtcacggtcactgagagtgacagctacagtgccagcctgtcccag gacacagacaaaggaaggaacagcatggtgtccactgagagtgcctcttccacctacgag gagctggcccgggcctatgagcatgccaagctggaggagcagctgcagcacgccaagttt gagatcaccgagtgcttcatctctgacagttcctctgaccagatgaccacaggcaccaac gagaacgccgacagcatgacatccatgagcacaccctcagagcctggcatctgccgcttt accgcctcaccacccaagccccaggatgcggaccggggcaaaaacgtggctgtgcccatc cctcaccgggccaacaagagtgactactgcaacctgcccctgtatgccaagtcagaggcc ttctttcgaaaggcagatggacgtgagccctgccccgtggtcccaccccgtgaggcctcc atccggaacctggctcgaacctaccacacccaggctcgccacctgaccctggaccctgcc agcaagtccttgggccttccccacccaggggcccccgctgccgcctccacagccacctta cctcagaggactctggccatgccagcccccccagccggcacagcccccccagcccccggc cccacccctgctgagccacccaccgcccccagcgctgcccctccggcccccagcaccgag cctccacgagccgggggcccacacaccaaaatggggggctccagggactcgcttctcgag atgagcacatcggggatattgggcaaacaaacacgaatcagagccacctccaaccagcaa gagggaatgaaaaactcctcagaggctagaggcgcagaagatagaagcagcctaagggct ccgaaggggaacgctgtcatgaaaatacatgacggatcccgatactacgtgccaggtgct gcttgtaacctcattgcagacttgggagaaaggtatcacgattaccacgtgacagctgag ccccgccagggcctttgtgacaagtggcatctccagtacgcgctgactggacacatccgg gctcagcacccgggcagccaccaccagccaccactgcggccagccgcctcagaccaatgc cccgcacgaaagccaggcctccagccaccatggggctctgtcggcaacaccaagggagag gggacttggcaggtcctgcaagcttag >gi568815587f:117238587_117512165|GENSCAN_predicted_peptide_5|129_aa MADGKVADGKVADGKVVDGKVADVKMADVKVVDVKVADVKMADVIMADVKVADVKMADVK VADGKVADVKMADVKVADMKVADVKMVDVKMVDVKGACLRQEGKQRPGAVRWMYILPWEY CQLPSRPCP >gi568815587f:117238587_117512165|GENSCAN_predicted_CDS_5|390_bp atggcggatgggaaagtggcggatgggaaagtggcggatgggaaagtggtggatgggaaa gtggcggatgtgaaaatggcagatgtgaaagtagtggatgtgaaagtagcggatgtgaaa atggcggatgtgataatggcggatgtgaaagtggctgatgtgaaaatggcggatgtgaaa gtggcggatgggaaagtggcggatgtgaaaatggcggatgtgaaagtggcagatatgaaa gtagcggatgtgaaaatggtggatgtgaaaatggtggatgtgaaaggcgcatgtctcagg caagaaggaaagcagagacctggtgcagtaagatggatgtatatccttccatgggaatac tgccagcttccttccaggccttgcccctaa >gi568815587f:117238587_117512165|GENSCAN_predicted_peptide_6|470_aa MQLPRQPCDTCPNPGVDRRLQGCVEGRPPESCELHHWDEKVGTGVPRSSREAGGKVPPLI QPFEFPPASIGQLLYIPCVVSSGDMPIRITWRKDGQVIISGSGVTIESKEFMSSLQISSV SLKHNGNYTCIASNAAATVSRERQLIVRVPPRFVVQPNNQDGIYGKAGVLNCSVDGYPPP KVMWKHAKGSGNPQQYHPVPLTGRIQILPNSSLLIRHVLEEDIGYYLCQASNGVGTDISK SMFLTVKNHSPGTQGIPCFQDLPRAPPQTVLPRMGPATAVTPRAEGRLLAEHVQQQSGVL ATETYGPQSLKYLLQALYSSRVFTAMTWKYLPSDQRKMASPSVSERILMFRVIFLMSKME IKNTTHILGLCLENKEPREEVTGPILKLDKVRPGEFEGLNWTPSAAETALQSKSPTILCD PHPYPLHTYCSAQFCSLCTGAVDSQGPMGPSSPGTSPLALGAEADCFHAI >gi568815587f:117238587_117512165|GENSCAN_predicted_CDS_6|1413_bp atgcagttaccccgacagccatgtgatacgtgccctaatccaggtgtggaccgaaggctt cagggctgtgtggaaggaaggcccccagagagctgtgagctgcatcactgggatgagaag gtgggtacaggggtgccgaggagcagcagggaagctggagggaaagtgccccctctgatc cagcccttcgaattcccacccgcctccatcggccagctgctctacattccctgtgtggtg tcctcgggggacatgcccatccgtatcacctggaggaaggacggacaggtgatcatctca ggctcgggcgtgaccatcgagagcaaggaattcatgagctccctgcagatctctagcgtc tccctcaagcacaacggcaactatacatgcatcgccagcaacgcagccgccaccgtgagc cgggagcgccagctcatcgtgcgtgtgccccctcgatttgtggtgcaacccaacaaccag gatggcatctacggcaaagctggtgtgctcaactgctcggtggacggctaccccccaccc aaggtcatgtggaagcatgccaaggggagcgggaacccccagcagtaccaccctgtgccc ctcactggccgcatccagatcctgcccaacagctcgctgctgatccgccacgtcctagaa gaggacatcggctactacctctgccaggccagcaacggcgtaggcaccgacatcagcaag tccatgttcctcacagtcaagaaccacagtccaggtacccagggaattccctgcttccag gacctcccgagggctcctccacagactgtcctcccacggatgggccctgccaccgccgtg acccctagagctgagggacggctgttggcagagcatgtacaacagcagagtggagtcctt gcaacggagacctatggcccacaaagcctaaaatatttacttcaggccctttacagctct agggtattcacagcaatgacctggaagtaccttccaagtgatcaaaggaagatggcatct ccttctgtttctgaaaggattctgatgttcagagtcatttttctcatgtctaaaatggaa ataaaaaatactacccacatcttagggctgtgtttggagaacaaggagccaagagaagag gtgactggccccattttaaagctggacaaagtaaggcctggcgagtttgagggcctcaac tggacccccagtgcagcagaaacagctctgcagtccaagtccccaaccattctctgtgac cctcacccctaccccttgcacacatactgttcggcccagttctgctccctgtgtactgga gcagtggacagccagggtcccatgggccccagctcccctggcacctccccactggctctt ggcgctgaagctgattgcttccatgcaatctga