GENSCAN 1.0 Date run: 6-Nov-116 Time: 17:16:16 Sequence gi568815593f:36853697_37164889 : 311193 bp : 36.56% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 8673 8746 74 2 2 69 79 66 0.161 4.39 1.02 Intr + 22490 22710 221 2 2 70 -35 310 0.131 13.92 1.03 Intr + 23190 23343 154 1 1 26 -22 195 0.070 0.51 1.04 Intr + 31602 32067 466 0 1 23 36 549 0.501 35.40 1.05 Intr + 32101 32370 270 0 0 46 52 441 0.891 33.12 1.06 Term + 32511 32801 291 2 0 63 41 328 0.985 19.96 1.07 PlyA + 33591 33596 6 1.05 2.00 Prom + 38770 38809 40 -3.65 2.01 Init + 40119 40175 57 2 0 64 53 78 0.919 3.26 2.02 Term + 46278 46373 96 2 0 55 49 169 0.943 6.69 2.03 PlyA + 46513 46518 6 1.05 3.02 PlyA - 46920 46915 6 1.05 3.01 Sngl - 65266 63848 1419 1 0 58 45 371 0.934 25.45 3.00 Prom - 67071 67032 40 -3.85 4.00 Prom + 96185 96224 40 -4.75 4.01 Init + 100001 100064 64 1 1 56 103 33 0.546 2.98 4.02 Intr + 101776 101941 166 2 1 57 103 133 0.999 9.80 4.03 Intr + 104408 104535 128 2 2 54 89 122 0.983 8.30 4.04 Intr + 107788 107887 100 2 1 68 90 33 0.412 -0.25 4.05 Intr + 108446 108578 133 2 1 55 116 43 0.374 3.53 4.06 Intr + 118249 118345 97 0 1 54 74 36 0.097 -2.54 4.07 Intr + 122080 122706 627 2 0 51 93 511 0.683 39.15 4.08 Intr + 130980 132605 1626 1 0 80 116 1214 0.965 110.41 4.09 Intr + 141926 142108 183 0 0 83 101 233 0.982 22.94 4.10 Intr + 146677 146874 198 2 0 41 53 270 0.863 17.20 4.11 Intr + 147121 147192 72 2 0 93 87 39 0.871 2.76 4.12 Intr + 148966 149069 104 2 2 75 65 134 0.999 8.77 4.13 Intr + 149565 149651 87 2 0 46 103 41 0.610 0.65 4.14 Intr + 152661 152892 232 2 1 66 88 200 0.999 14.22 4.15 Intr + 153627 153778 152 1 2 85 110 24 0.999 3.26 4.16 Intr + 154312 154392 81 0 0 72 92 31 0.627 0.82 4.17 Intr + 154716 154843 128 2 2 56 91 18 0.784 -2.54 4.18 Intr + 156391 156529 139 1 1 61 78 153 0.989 11.15 4.19 Intr + 160987 161069 83 0 2 102 87 84 0.982 7.22 4.20 Intr + 162342 162474 133 0 1 68 91 50 0.981 3.03 4.21 Intr + 163323 163466 144 2 0 30 79 122 0.951 5.06 4.22 Intr + 165615 165704 90 2 0 85 46 84 0.900 3.17 4.23 Intr + 166763 166977 215 1 2 68 69 169 0.996 9.59 4.24 Intr + 167079 167181 103 0 1 59 97 78 0.999 5.06 4.25 Intr + 168355 168453 99 0 0 42 65 151 0.994 7.59 4.26 Intr + 168548 168694 147 1 0 86 76 183 0.999 16.41 4.27 Intr + 170889 171023 135 2 0 118 67 90 0.998 9.74 4.28 Intr + 172533 172631 99 2 0 106 83 51 0.899 5.79 4.29 Intr + 173663 173716 54 1 0 109 92 58 0.984 6.56 4.30 Intr + 182683 182791 109 0 1 1 99 123 0.937 3.64 4.31 Intr + 184906 185042 137 2 2 91 84 16 0.921 0.87 4.32 Intr + 190651 190791 141 0 0 63 67 133 0.983 8.33 4.33 Intr + 190940 191033 94 1 1 97 115 -47 0.617 -2.48 4.34 Intr + 191747 191901 155 0 2 61 98 59 0.560 3.07 4.35 Intr + 192413 192503 91 1 1 122 97 55 0.999 8.45 4.36 Intr + 194806 194979 174 2 0 118 93 38 0.981 6.29 4.37 Intr + 195415 195605 191 2 2 106 115 122 0.999 15.08 4.38 Intr + 198083 198190 108 1 0 87 50 124 0.996 8.06 4.39 Intr + 198670 198870 201 0 0 64 66 154 0.991 9.36 4.40 Intr + 205195 205469 275 0 2 50 82 201 0.968 11.11 4.41 Intr + 207148 207322 175 1 1 43 76 131 0.833 6.42 4.42 Intr + 210094 210282 189 0 0 59 60 216 0.999 14.76 4.43 Intr + 210831 211067 237 2 0 62 9 280 0.022 14.39 4.44 Intr + 218264 218392 129 1 0 48 18 146 0.018 3.57 4.45 Intr + 229508 229653 146 1 2 17 39 135 0.224 -0.34 4.46 Intr + 231461 231821 361 2 1 57 94 278 0.063 19.60 4.47 Intr + 234923 235497 575 1 2 -50 52 285 0.482 1.71 4.48 Intr + 236449 236979 531 1 0 35 40 193 0.382 0.32 4.49 Term + 238125 238257 133 0 1 42 42 216 0.948 8.98 4.50 PlyA + 240107 240112 6 1.05 5.10 PlyA - 245131 245126 6 1.05 5.09 Term - 254082 253906 177 0 0 66 47 324 0.927 22.80 5.08 Intr - 259794 259691 104 1 2 22 53 90 0.472 -2.13 5.07 Intr - 261353 261264 90 0 0 79 116 93 0.860 10.25 5.06 Intr - 289329 289221 109 0 1 83 65 56 0.169 1.74 5.05 Intr - 300297 300044 254 1 2 64 69 177 0.898 9.63 5.04 Intr - 303724 303617 108 2 0 53 92 92 0.415 5.44 5.03 Intr - 304172 303974 199 2 1 59 86 152 0.864 10.10 5.02 Intr - 304649 304528 122 0 2 110 10 87 0.816 2.39 5.01 Intr - 308870 308769 102 0 0 56 99 87 0.556 5.83 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 84544 84409 136 0 1 19 106 109 0.811 5.45 S.002 Term + 210831 211196 366 2 0 62 39 338 0.965 19.92 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:36853697_37164889|GENSCAN_predicted_peptide_1|491_aa MIKFITMEAEERVEGMSVFVDGRSPNNTQSGVRSRLSKGKVYDPSIHPAAGTPFKTGIIA IESRASSVGPPGDRLPKANDDLGQPARPPAAAVGDLRARRDAPADRRIGSRARGDAPPVA RARGRVFVSVSMWEKEEEEEEEATICLLGCAASVYAGARGSGSRISVSCSTSFRGGLGSR GLAKWMAQGLAGMGGIQNKETLQSLNNHLASYLDRVRSLETKNQRLESNTREHLEKKGPQ VRDWGHYFKTIEDLRAQIFANTVDNACIVLQIDNAYLAADGFRVKCETELAMCQSVENDI RGLCKLETDIEALREELLLMKENHEEEVKGLQAQLTSSGLTVKVDAPKSQDLAKIMADIQ AQYDELAQKNQEELDKYWSQQIEESTTVVTTQSAQVEQLNRILLHLESELAQTQAEGQHQ AQEYETMLNIKVKLEAESATCHRLLEDGKNFSLDDALDSSNSMQTIQKTTTRWIVDGRVV SETSDTKVLRH >gi568815593f:36853697_37164889|GENSCAN_predicted_CDS_1|1476_bp atgatcaaatttatcactatggaagcagaagaaagagttgaggggatgagtgtctttgtg gatggcagaagcccaaataacacgcagtcgggcgtaaggtcccggctctcaaaaggcaag gtgtacgatccatccatccacccggctgcagggacaccatttaaaacgggcatcatcgcc atcgaatcgcgggcgtcctctgtggggccgcccggagatcggctccctaaagctaacgac gacctgggacagccagcgcggcctccagccgccgccgtcggcgacctgagagcccggagg gacgcccccgccgacaggagaattggttcccgggcccgcggcgatgcccccccggtagct cgggcccgtggtcgggtgtttgtgagtgtttctatgtgggagaaggaggaggaggaggaa gaagaagcaacgatttgtcttctcggctgtgcagccagcgtctatgcaggcgccaggggc tctggttcgcggatctccgtgtcctgttccaccagcttccgcggcggcttggggtccagg ggcctggccaagtggatggcccagggtctggcaggaatgggaggcatccagaacaaggag accctccaaagcctgaacaaccacctggcctcctacctggacagagtgaggagcctagag accaagaaccagagactggagagcaacacccgggagcacctggagaagaagggaccccag gtcagagactggggccattacttcaagaccatcgaggatctgagggctcagatctttgca aatactgtggacaatgcctgcattgttctgcagattgacaatgcctaccttgctgctgat ggctttagagtcaagtgtgagacagagctggccatgtgccagtctgtagagaacgacatc cgtgggctctgcaagctagagacagacatcgaggctctcagggaggagctgctcttaatg aaggagaaccacgaagaggaagtaaaaggcctacaagcccagctcaccagctctgggttg accgtgaaggtagatgctcccaaatctcaggacctcgccaagatcatggcagacatacag gcccaatacgacgagctagctcagaagaaccaagaggagctagacaagtactggtctcag cagattgaggagagcaccacagtggtcaccactcagtccgcgcaggtagagcagctcaac agaatcctgctgcacctggagtcagagctggcacagacccaggcagaggggcagcaccag gcccaggagtacgagaccatgctgaacatcaaggtcaagctggaggctgagagcgccacc tgccaccgcctgcttgaagatggcaagaacttcagtcttgatgatgccctggacagcagc aactccatgcaaactatccaaaagaccaccacccgctggatagtggatggcagagtggtg tctgagaccagtgacaccaaagttctgagacattaa >gi568815593f:36853697_37164889|GENSCAN_predicted_peptide_2|50_aa MEEELLDTSEDVELRKEVRKIEKVFAKAFNSLILSDDDDYINSNDGQYLS >gi568815593f:36853697_37164889|GENSCAN_predicted_CDS_2|153_bp atggaggaggagttgctggatactagcgaggacgtagaattgaggaaagaagttaggaaa atagaaaaagtcttcgctaaagcctttaatagccttattcttagtgatgatgatgattat attaatagcaatgatggccagtatttgtcatga >gi568815593f:36853697_37164889|GENSCAN_predicted_peptide_3|472_aa MDIDAKILNKILAKRIQQHIKKLIHRDQVGFILGMQDWFNIQKSINVIHHINRTKDKNHM IISVDAEKAFDKIQQPFMLKTLNKLGIDGTYLKVTRAIYDKPTANIILNGQKLEAFLLKT GTRQGRPLSPLLFNIVLEVLARAIRQEKKIKGIQLGKEEVKLSLFADDMTVYLENPIISD QNLLRLINNFSKVSGYKINVQKSQAFLNTNNRQTESQIMSELPLTVASKRIKYLGIQLKR DVKDLFKENYIPLLNEIKEDTNKWKNIPCSWVGRINIVKMAILPKVIYRFNAISIKLPMT FFTELEKTTLKFTWNQKRAHIAKTILSQKNKAGGIMLPDFKLYYKATVTQTAWYWYQNRD IDQWNRTEPSEIIPHIYNHLIFDKPDKNKKWGKDSLFNKWCWENWLAICRKLKLNPFLTP YTKINSRRIKDLNVRPKTIKTLEENLGNTIQDIGMGKDFMSLLNDMSFTECK >gi568815593f:36853697_37164889|GENSCAN_predicted_CDS_3|1419_bp atggacattgatgcaaaaatcctcaataaaatattggcaaagcgaatccagcagcacatc aaaaagcttatccaccgtgatcaagtgggtttcatccttgggatgcaagactggttcaac atacaaaaatcaataaacgtaatccatcatataaacagaaccaaagacaaaaaccacatg attatctcagtagatgcagaaaaggcctttgacaaaattcaacagcccttcatgctaaaa actctcaataaattaggtattgatggaacgtatctcaaagtaacaagagctatctatgac aaacccacagccaatatcatactgaatgggcaaaaactggaagcattccttttgaaaact ggcacaagacagggacgccctctctcaccactcctattcaacatagtgttggaagttctg gccagggcaatcaggcaggagaaaaaaataaagggtattcaattaggaaaagaggaagtc aaattgtccctgtttgcagatgacatgactgtatatttagaaaaccccatcatctcagac caaaatctccttaggctgataaacaacttcagcaaagtctcaggatacaaaatcaatgtg caaaaatcacaagcattcttaaacaccaataacagacaaacagagagccaaatcatgagt gaactcccactcacagttgcttcaaagagaataaaatacctaggaatccaacttaaaagg gatgtgaaggacctcttcaaggagaactacataccactgctcaacgaaataaaagaggat acaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatatcgtgaaaatg gccatactgcccaaggtaatttatagattcaatgccatctccatcaagctaccaatgact ttcttcacagaattggaaaaaactactttaaagttcacatggaaccaaaaaagagcccac attgccaagacaatcctaagccaaaagaacaaagctggaggcatcatgctacctgacttc aaactatactacaaggctacagtaacccaaacagcatggtactggtaccaaaacagagat atagaccaatggaacagaacagagccctcagaaataataccacacatctacaaccatctg atttttgacaaacctgacaaaaacaagaaatggggaaaggattccctatttaataaatgg tgctgggaaaactggctagccatatgtagaaagttgaaactgaatcccttccttacacct tatacaaaaattaattcaagaaggattaaagacttaaatgttagacctaaaaccataaaa accctagaagaaaacctaggcaataccattcaggacataggcatgggcaaggacttcatg tcattactaaatgacatgtcatttaccgaatgtaaatga >gi568815593f:36853697_37164889|GENSCAN_predicted_peptide_4|3256_aa MNGDMPHVPITTLAGIASLTDLLNQLPLPSPLPATTTKSLLFNARIAEEVNCLLACRDDN LVSQLVHSLNQVSTDHIELKDNLGSDDPEGDIPVLLQAVLARSPNVFREKSMQNRYVQSG MMMSQYKLSQNSMHSSPASSNYQQTTISHSPSRQALGTDLCHSKIAQCLVHTPHKALQDT CHIPILQVTQHIHRCNKDGDSSTMRNAASFPLRSPQPVCSPAGSEGTPKGSRPPLILQSQ SLPCSSPRDVPPDILLDSPERKQKKQKKMKLGKDEKEQSEKAAMYDIISSPSKDSTKLTL RLSRVRSSDMDQQEDMISGVENSNVSENDIPFNVQYPGQTSKTPITPQDINRPLNAAQCL SQQEQTAFLPANQVPVLQQNTSVAAKQPQTSVVQNQQQISQQGPIYDEVELDALAEIERI ERESAIERERFSKEVQDKDKPLKKRKQDSYPQEAGGATGGNRPASQETGSTGNGSRPALM VSIDLHQAGRVDSQASITQDSDSIKKPEEIKQCNDAPVSVLQEDIVGSLKSTPENHPETP KKKSDPELSKSEMKQSESRLAESKPNENRLVETKSSENKLETKVETQTEELKQNESRTTE CKQNESTIVEPKQNENRLSDTKPNDNKQNNGRSETTKSRPETPKQKGESRPETPKQKSDG HPETPKQKGDGRPETPKQKGESRPETPKQKNEGRPETPKHRHDNRRDSGKPSTEKKPEVS KHKQDTKSDSPRLKSERAEALKQRPDGRSVSESLRRDHDNKQKSDDRGESERHRGDQSRV RRPETLRSSSRNEHGIKSDSSKTDKLERKHRHESGDSRERPSSGEQKSRPDSPRVKQGDS NKSRSDKLGFKSPTSKDDKRTEGNKSKVDTNKAHPDNKAEFPSYLLGGRSGALKNFVIPK IKRDKDGNVTQETKKMEMKGEPKDKVEKIGLVEDLNKGAKPVVVLQKLSLDDVQKLIKDR EDKSRSSLKPIKNKPSKSNKGSIDQSVLKELPPELLAEIESTMPLCERVKMNKRKRSTVN EKPKYAEISSDEDNDSDEAFESSRKRHKKDDDKAWEYEERDRRSSGDHRRSGHSHEGRRS SGGGRYRNRSPSDSDMEDYSPPPSLSEVARKMKKKEKQKKRKAYEPKLTPEGDDDEIPQE LLLGKHQLNELGSESAKIKAMGIMDKLSTDKTVKVLNILEKNIQDGSKLSTLLNHNNDTE EEERLWRDLIMERVTKSADACLTTINIMTSPNMPKAVYIEDVIERVIQYTKFHLQNTLYP QYDPVYRLDPHGGGLLSSKAKRAKCSTHKQRVIVMLYNKVCDIVSSLSELLEIQLLTDTT ILQVSSMGITPFFVENVSELQLCAIKLVTAPRFHSPLNETSVLFDFYKNVIINDRKIEQL TLDTENIFILNGRLNSSDMDGEPMYIQMVTALVLQLIQCVVHLPSSEKDSNAEEDSNKKI DQDVVITNSYETAMRTAQNFLSIFLKKCGSKQGEEDYRPLFENFVQDLLSTVNKPEWPAA ELLLSLLGRLLVHQFSNKSTEMALRVASLDYLGTVAARLRKDAVTSKMDQGSIERILKQV SGGEDEIQQLQKALLDYLDENTETDPSLVFSRKFYIAQWFRDTTLETEKAMKSQKDEESS EGTHHAKEIETTGQIMHRAENRKKFLRSIIKTTPSQFSTLKMNSDTVDYDDACLIVRYLA SMRPFAQSFDIYLTQILRVLGENAIAVRTKAMKCLSEVVAVDPSILARLDMQRGVHGRLM DNSTSVREAAVELLGRFVLCRPQLAEQYYDMLIERILDTGISVRKRVIKILRDICIEQPT FPKITEMCVKMIRRVNDEEGIKKLVNETFQKLWFTPTPHNDKEAMTRKILNITDVVAACR DTGYDWFEQLLQNLLKSEEDSSYKPVKKACTQLVDNLVEHILKYEESLADSDNKGVNSGR LVACITTLFLFSKIRPQLMVKHAMTMQPYLTTKCSTQNDFMVICNVAKILELVVPLMEHP SETFLATIEEDLMKLIIKYGMTVVQHCVSCLGAVVNKVTQNFKFVWACFNRYYGAISKLK SQHQEDPNNTSLLTNKPALLRSLFTVGALCRHFDFDLEDFKGNSKVNIKDKVLELLMYFT KHSDEEVQTKAIIGLGFAFIQHPSLMFEQEVKNLYNNILSDKNSSVNLKIQVLKNLQTYL QEEDTRMQQADRDWKKVAKQEDLKEMGDVSSGMSSSIMQLYLKQVLEAFFHTQSSVRHFA LNVIALTLNQGLIHPVQCVPYLIAMGTDPEPAMRNKADQQLVEIDKKYAGFIHMKAVAGM KMSYQVQQAINTCLKDPVRGFRQDESSSALCSHLYSMIRGNRQHRRAFLISLLNLFDDTA SMVKDKRKERKSSPSKENESSDSEEEVSRPRKSRKRVDSDSDSDSEDDINSVMKCLPENS APLIEFANVSQGILLLLMLKQHLKNLCGFSDSKIQKYSPSESAKVYDKAINRKTGVHFHP KQTLDFLRSDMANSKITEEVKRSIVKQYLDFKLLMEHLDPDEEEEEGEVSASTNARNKAI TSLLGGGSPKNNTAAETEDDESDGEDRGGGTSGSLRRSKRNSDSTELAAQMNESVDVMDV IAICCPKYKDRPQIARVVQKTSSGFSVQWMAGSYSGSWTEAKRRDGRKLVPWYSAPNKEL SMASSQDAGIALVEELDSEKMEESKLRSLHEVGEQEATSIGKGGEYYIKRTPCRTKESEQ QPSALDLPSDRAYPNEKESENQLCPHKLRECLPLIIFLRNRLKYALTGDEVKICMQGFIK VDGKVRTDITYPAGFMDVISIDKTGDNFHLIYDTKGRFAVHRITPEEAKYKLGKVRKIFV GTKGIPHLVTHDAHSIRYLDPLIKEQAPGPARNWGNASSFPRSDGKPTVLSLRAATTGST GLDLLCLNKLMLKEGEDPKRVATGIWGLLPPGTVGLVLGWSSLSSKGINVLTVVIDSDYH REILVMMDCKGLHILPPGSKIAQLLILSYWVPSLYGKERGKGSFGSTGATGVYWNQLITD QGPMKKLEIRILLAYWAQGQTFQSLVIKTGQKLGFGYLTPAAKREIEEIEQAVSQGQLDR IDPRYSIQLFYLSHQTLPYRVNRTDGPRAMLPRMGFLPHTGTKTLSPYSQLLTKVIYSGH KQCNQSLGYDPDVIRIPLSKKQFKAVLPVSINLQIAFSDYTGQIEHILPADKLLYFLSHT LVILPTKIVHSPIPNALTLFTDGSGKHGKAAVWIQPGTGDEGTDPTGSTAPDDEASMDDT SPGRHLGDAEEDNSGG >gi568815593f:36853697_37164889|GENSCAN_predicted_CDS_4|9771_bp atgaatggggatatgccccatgtccccattactactcttgcggggattgctagtctcaca gacctcctgaaccagctgcctcttccatctcctttacctgctacaactacaaagagcctt ctctttaatgcacgaatagcagaagaggtgaactgccttttggcttgtagggatgacaat ttggtttcacagcttgtccatagcctcaaccaggtatcaacagatcacatagagttgaaa gataaccttggcagtgatgacccagaaggtgacataccagtcttgttgcaggccgtcctg gcaaggagtcctaatgttttcagggagaaaagcatgcagaacagatatgtacaaagtgga atgatgatgtctcagtataaactttctcagaattccatgcacagtagtcctgcatcttcc aattatcaacaaaccactatctcacatagcccctccagacaagctctgggaacagattta tgccacagcaaaatagcccagtgcctagtccatacgccccacaaagccctgcaggataca tgccatattcccatccttcaagttacacaacacatccacagatgcaacaaggatggagat tcttcaacaatgaggaatgctgcatcttttcccttgagatctccacagccagtatgctcc cctgctggaagtgaaggaactcctaaaggctcaagaccacctttaatcctacaatctcag tctctaccttgttcatcacctcgagatgttccaccagatatcttgctagattctccagaa agaaaacaaaagaagcagaagaaaatgaaattaggcaaggatgaaaaagagcagagtgag aaagcggcaatgtatgatataattagttctccatccaaggactctactaaacttacatta agactttctcgtgtaaggtcttcagacatggaccagcaagaggatatgatttctggtgtg gaaaatagcaatgtttcagaaaatgatattccttttaatgtgcagtacccaggacagact tcaaaaacacccattactccacaagatataaaccgcccactaaatgctgctcaatgtttg tcgcagcaagaacaaacagcattccttccagcaaatcaagtgcctgttttacaacagaac acttcagttgctgcaaaacaaccccagacttctgtggtacagaatcaacaacagatatca caacagggacctatatatgatgaagtggaattggatgcattggctgaaattgagcgaata gagagagaatcagctattgaaagggagcgcttctcaaaagaagttcaagataaagataag cctttgaaaaaaagaaaacaagattcttacccacaggaggctgggggtgctacaggaggt aatagaccagcttctcaggagacgggttctacgggaaatgggtcaaggccagcattaatg gttagcattgatcttcatcaggcaggaagagtggactctcaggcttctataactcaggat tcagactccataaaaaagcctgaagaaatcaaacaatgtaatgatgcacctgtttctgtt cttcaggaagatattgttggaagtcttaaatctacaccagaaaaccatcctgagacacct aaaaaaaagtctgatcctgagctttcaaagagtgaaatgaaacaaagtgaaagtagatta gcagaatctaaaccaaatgaaaaccgattggtggagacaaaatcaagtgaaaataagtta gaaactaaagttgagacccaaacagaagaacttaaacagaatgagagcagaacaactgaa tgcaaacaaaacgagagcaccatagttgagcctaaacaaaatgaaaatagactgtctgac acaaaaccaaatgacaacaaacaaaataatggcagatcagaaacaacaaaatcaaggcct gaaaccccaaagcaaaagggtgaaagccggcctgagactccaaaacaaaagagtgatggg catcctgaaaccccaaaacagaagggtgatggaaggcctgaaactccaaagcaaaaaggt gagagccgccctgaaactccaaagcaaaaaaatgaagggcgacctgaaacaccaaaacac aggcatgacaataggagggattctggaaagccatctacagagaaaaaacctgaagtgtct aaacataaacaagatactaaatctgactcacctcggttaaaatcagaacgagctgaagcc ttaaagcagagacctgatgggcgatctgtttctgagtcactaagacgtgaccatgataat aaacaaaaatcagatgacaggggtgaatcagagcgacatcgaggggatcagtctagggtt cgaagaccagaaacattgagatcctctagtagaaatgaacatggcattaaatctgatagt tcaaaaactgataaactagaacgaaaacacaggcatgaatcaggggactcaagggaaaga ccatcttctggggaacaaaaatcaagacctgacagtcctcgtgttaaacaaggagattct aataaatcaagatctgataaacttggttttaaatcaccaactagtaaagatgacaaaagg acagagggtaacaagagtaaagtagacactaataaagcacaccctgacaataaggcagaa tttccaagttatttgttggggggcaggtctggtgcgttgaaaaattttgtcattccgaaa atcaagagggataaagatggcaatgttactcaggagacaaagaaaatggaaatgaaagga gagccgaaagacaaagtagaaaaaataggattagttgaagatctaaataaaggagctaag cctgtagttgtgctacaaaaactgtctttggatgatgttcagaaacttattaaagataga gaggacaaatcaagaagttcccttaaacctatcaagaataaaccatcaaagtcaaataaa ggtagtatagatcaatcagtgttaaaagaattaccccctgaactcctggcagaaattgag tccaccatgccactttgtgaacgtgtgaaaatgaacaaacgcaagcgtagcacagttaat gaaaagccaaaatatgctgaaatcagttcagatgaagataatgatagtgatgaagctttt gaatcctctaggaaacgacataaaaaagatgatgataaagcttgggaatatgaagagcgt gacagaagaagctctggggatcataggagaagtggccactctcatgaaggaagaaggagt tcaggtggtggtcgttatcgaaaccgaagtccgtcagattctgacatggaagattattct cctcctcccagccttagtgaggttgctaggaaaatgaagaaaaaagaaaaacagaagaaa aggaaagcatatgaaccaaaactaacacctgaaggtgatgatgatgaaattcctcaggaa ctgctcttaggaaaacatcagcttaatgaacttggcagtgaatctgctaaaataaaagca atgggtataatggataagctttcaactgacaaaactgtgaaagtcttaaatatcttggag aagaatattcaggatgggtcaaagctttccactttgttaaatcataataacgatactgaa gaagaagaaaggttatggagagaccttattatggagagagttacaaaatcagcggatgct tgtcttacaactatcaacattatgacatcccctaacatgccaaaagctgtgtacattgag gatgtaattgaaagagttatacagtacactaaatttcatttgcagaatacactttatcct cagtatgatcctgtttacagattagatcctcatggaggaggcttattaagttcaaaagca aaacgggctaaatgttctacccataagcagagagtaatagtaatgctttataacaaagtt tgtgacattgttagcagcttatcagaattgctagagatacaacttcttacagacacaaca attcttcaggtttcatctatgggaataacaccattttttgtggaaaatgtcagtgaacta cagttgtgtgccattaagttagtcactgcaccaagatttcatagtcctttaaatgaaact agtgtactctttgacttctataagaatgttataattaatgataggaaaatagagcagctt accttagatactgaaaacattttcattctaaatggcaggttaaacagtagtgatatggat ggagaacctatgtatattcagatggttacagcactggttttacaacttattcagtgtgtg gtacacttaccatcatcagagaaggactctaatgcagaagaagattcaaataaaaaaatt gaccaggatgttgtcattactaactcttatgaaacagctatgcgaacagcccaaaacttc ctctccatcttccttaaaaaatgtggtagtaagcaaggtgaagaagattacagaccactg tttgaaaattttgttcaagaccttctttcaacagtcaataagcctgaatggccagctgct gaactactccttagtttgttagggagactgttggttcatcagttcagtaacaagtcaaca gagatggctttaagagtggcatctcttgattaccttggaactgttgctgcacggctaaga aaagatgctgttacaagcaaaatggatcaaggatctatagaacgcattttaaaacaggtt tcaggaggggaagatgaaatccaacaattacaaaaagcattgcttgattacttggatgaa aacactgagactgatccttcactagtgttttctcgtaaattctatatagcccagtggttt cgagacacaactctggaaacagaaaaagcaatgaaatcacaaaaagatgaagaatcatct gaaggaacacatcatgcaaaggaaattgagacaactggccaaattatgcatcgagctgaa aaccgaaaaaagtttcttagaagcattatcaaaaccacaccttctcagtttagcacatta aagatgaactctgatactgtggactatgatgatgcttgcttgattgttcgatacttggcc tccatgaggccgtttgcccagagctttgatatttatttgacacagatcctacgagttctt ggtgaaaatgcaattgctgttcgaacaaaagccatgaagtgtttgtctgaggttgttgct gtagaccccagtattctagcaaggcttgatatgcaacgaggtgttcatggacgattgatg gataattcgactagtgtccgagaagcagcagtagaattactaggtcgatttgtcctttgt cgacctcagcttgctgaacagtattatgatatgctgattgaaagaatattggatactggt atcagtgtcaggaaaagagtaataaagattctcagagacatttgtattgaacaaccaaca tttccaaaaatcacagaaatgtgtgtaaaaatgattcgcagagtcaatgatgaagagggc attaagaaattagtaaatgaaacattccagaaactctggtttactccaactccacacaat gacaaagaagcaatgacaaggaaaattttaaacattaccgatgtggttgcagcatgcaga gatactggatatgactggtttgagcaactgcttcaaaacttgttgaagtccgaagaggat tcctcatataaacctgtgaagaaagcttgtactcaacttgttgataacctagttgagcac attcttaaatatgaggaatctctagctgactctgacaataaaggtgtgaattctggaaga ttggtagcttgcataaccactttgttcttattcagcaaaataagaccccagctcatggtt aaacatgcaatgactatgcaaccataccttaccactaaatgtagtacgcaaaatgatttc atggttatctgcaatgttgcaaaaatcctagagctagttgtaccactgatggagcatcca agtgaaacttttcttgccactattgaggaagatctaatgaagctcatcatcaaatatggc atgactgtagtgcaacattgtgtgagctgtcttggagctgttgtaaataaagtgacacaa aattttaaatttgtgtgggcttgtttcaatagatactatggtgccatttcaaaattaaaa agtcaacaccaagaggacccaaataacacttcacttctaacaaacaaaccagcacttctt agatcccttttcaccgttggagcactatgtcggcattttgattttgatctggaagatttt aaaggcaacagcaaggttaacataaaagataaagtacttgaactattgatgtattttaca aaacactcagatgaagaagtacaaacaaaagctatcattggtctaggatttgcctttatt cagcatccaagtctaatgttcgagcaagaagtgaagaatctatataataatattttatct gataagaactcctcagtcaatttaaaaatacaagtgttaaaaaacctccagacctaccta caagaagaagatacacgtatgcagcaggcagatagagactggaagaaagttgcaaaacag gaagacttaaaagaaatgggtgatgtttcctcagggatgagtagttccatcatgcagctt tatctcaaacaggtgcttgaggcattttttcacacccagtcaagtgtacgccactttgcc ctaaatgtcattgcattgactctaaatcaaggtcttattcatccagttcagtgtgtgcca tatttaattgctatgggcacagacccagaacctgctatgcggaacaaggctgatcagcaa cttgtggaaatagacaaaaaatatgctggattcattcatatgaaagcagtggctggtatg aagatgtcttaccaggtacaacaggcaatcaacacatgcctaaaagatcctgtaaggggt ttcagacaagacgagtcctctagcgctttgtgttcacacctttactccatgatccgtgga aaccgccaacacagacgagcctttcttatttctttactcaacctctttgatgacacagca tctatggtaaaggacaaaaggaaagagagaaaatcatcacctagtaaggaaaatgagtca agcgacagtgaagaagaagtttccaggcctcggaagtcacggaaacgtgtagattcagat tcagattcagattcagaagacgatataaattcagtgatgaaatgtttgccagaaaattca gctcctttaatcgaatttgcaaatgtgtcccagggtattttattacttctcatgttaaaa caacatttgaagaatctttgtggattttctgatagtaaaattcagaagtactctccatct gaatctgcaaaagtatatgataaagcgataaaccgaaaaacaggagttcattttcatcca aaacaaacactggacttcctgcggagtgacatggctaattccaaaatcacagaagaggtg aaaaggagtatagtaaaacagtatctagatttcaaacttctcatggaacatctggaccct gatgaagaagaagaagaaggggaggtttcagctagcacaaatgctcggaacaaagcaatt acctcactgcttggaggaggcagccctaaaaataatacagcagcagagacagaagatgat gaaagtgatggggaggatagaggaggaggcacttcagggtcattgagaaggtcaaaacga aattcagactctacggagttggcagcacagatgaatgaaagtgttgacgtcatggatgtc atcgctatttgctgtccaaagtacaaagatcgaccacaaattgcaagagtagtgcagaaa accagcagtggcttcagtgttcagtggatggcaggctcctacagtggctcctggactgag gctaagcgccgtgatggccgcaaactggtgccttggtactcagctcccaacaaagaattg agtatggcatcaagccaggatgcggggatagctctagttgaagagctagattcagagaaa atggaagaatccaagctcagaagtctccatgaagtgggagagcaagaagccacatccata ggaaaagggggagagtactacatcaagagaacaccctgcaggacaaaagaatctgaacaa cagccttcagccctagaccttccctctgacagagcctacccaaatgagaaggaatcagaa aaccaactctgtcctcacaagttgagagagtgtctccccctcatcattttcctaaggaac agacttaagtatgccctgacaggagatgaagtaaagatttgcatgcaggggttcattaag gttgatggcaaggtacgaactgatataacctaccctgctggattcatggatgtcatcagc attgacaagacgggagataatttccatctgatctatgacaccaagggtcgctttgctgta catcgtattacacccgaggaggctaagtacaaattgggtaaagtgagaaaaatctttgtg ggcacaaaaggaatccctcatctggtgactcatgatgctcacagcatccgctaccttgat cccctcatcaaggagcaagccccaggccccgctcgcaactggggcaatgccagcagcttt cctcgatcagatgggaagcccacagtcctctctctcagagcagccaccacagggagcaca ggactggacttactctgcctcaacaaattaatgctaaaagaaggagaagaccctaaaagg gttgcaaccgggatctggggcctgctgcctccaggaacagtgggattagtcctagggtgg tctagcctatctagtaaaggaattaatgtgctcaccgtggtaattgatagtgattatcac agagagatattggttatgatggactgtaaaggtctgcatattcttccccctggatcaaag atagctcagttactgattttatcatactgggtccccagtctctatggaaaggaaaggggg aagggaagttttgggagcacaggagccacaggagtatattggaatcaattaatcactgat caaggacccatgaagaaattggaaataagaattttactggcttactgggcacaggggcag acatttcaatcattagtgatcaaaactggccagaaacttggctttggatatttaacccct gcagcaaaaagggaaattgaggaaatagaacaagccgtctctcaggggcagctagatcgc attgatccacgttattcaatccaattgttttatctttctcaccaaacactcccctacagg gttaataggacagatggcccccgggctatgcttcctagaatgggttttttgccacatacc gggactaaaacactatctccctatagtcagttacttactaaagtcatctattcaggccac aaacaatgcaatcagtcactaggttatgaccctgatgtcatcaggattcctttaagtaaa aagcaattcaaagcagtattgccagtatctattaatctgcaaatagctttctctgattac acaggacaaatagagcacatacttcctgctgataaactcctttatttcttatctcatacc ctggtaatcttacccacaaaaatagttcactcccccatacctaatgctttaacactgttt actgatggttctggtaaacatggaaaagcagcagtctggatccaacctggtaccggagat gaaggaactgaccctacaggatccacagccccagacgatgaggcttccatggatgacaca agccccggacgtcacctgggggatgctgaagaggacaactcaggaggctga >gi568815593f:36853697_37164889|GENSCAN_predicted_peptide_5|421_aa XMLQDDNTSAGLHFMASVKKKAIGSQDASTNTDPEHEPLTAPQLLVPDVYLNLKLSSEMS EKPWSPSIPHTVTNLVGHTYINVIDIEANDLLQELPVREEPSNDNVIKQQSDHLAVPSSA ELHYMAASVTNAVPPHNFKSQEVTPACLDGKSLRAGITEVKEPSVTSPTPSDIQQNKGLP KPEFRFKGQSTKSDSAEDYLLWKRLQGVSAACPAPSSAAHQLEHLSAKLQKIDEQLLAIQ NIAENIEQDFPKPEMLDLHCDKLLSMQIENLFAVSPGERFSFSYFMDKAVLQCSKIHEGT ATFTIQKKAGGAKAAVRKATQSPVTFQKVCSHAANTYLKLGNLKGKKFNGLTVSHGCGGL TIMDLSPTEEEEPEHPFGVGGVDSVSESTGSILSKLDWNAIEDMVASVEDQGLSVHWALD L >gi568815593f:36853697_37164889|GENSCAN_predicted_CDS_5|1266_bp naaatgctacaagatgataatacttcagctggattgcatttcatggcctctgtaaaaaag aaagctataggaagtcaagatgcaagtacaaatacagacccagaacatgagcctttgact gctcctcagctcttggtcccagatgtctatctaaatctgaagctttccagtgaaatgtca gagaaaccttggtcaccctcaatacctcatacagtaacaaacttggttggacatacttat ataaatgtgattgacattgaagctaatgatcttctacaggaattacctgtgagagaagag ccttcaaatgataatgttatcaaacagcaaagcgatcatctagcagttccatcgtctgca gagttacattatatggcagcttcagttactaatgctgttcccccacataattttaagagt caagaagtaactccagcttgtctggatggaaaaagcttgagagcaggcattacagaagtg aaggagcctagtgtcacctcacctacaccatcagacatacagcagaacaaaggtctgcca aaaccagagttccgattcaaaggacagagcacaaagtcagactctgcagaagattatcta ttgtggaaacggctgcaaggtgtctctgcagcttgccctgcaccaagctctgcagctcac caactagagcatctcagtgctaagcttcagaaaattgacgagcagttgctagcaatacag aacattgctgaaaacatagaacaggatttccccaagcctgaaatgctagatctacattgt gataagctgctgtccatgcagatagaaaacttatttgctgtttcaccaggggagagattt tctttttcctattttatggacaaagctgtccttcagtgctctaagatacatgaaggaact gccactttcaccatacagaaaaaagctggtggagccaaagcagcagtaagaaaggctacg cagtctccagttaccttccaaaaagtctgttctcatgctgctaacacatacctgaaactg ggtaatttaaaaggaaagaagtttaatggactcacagtttcacatggctgtggaggcctc acaatcatggacttgtctccaactgaagaggaagagccagagcatccttttggggtgggc ggtgtggacagcgtgtctgagagcactggcagcatcctcagcaagctggactggaatgcc atcgaagacatggtggccagcgtggaggaccagggcctgtctgtccactgggccctggac ctgtaa