GENSCAN 1.0 Date run: 3-Nov-116 Time: 11:27:50 Sequence gi568815592f:33320265_33551903 : 231639 bp : 48.34% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 Intr - 327 121 207 1 0 90 81 157 0.748 14.25 1.02 Intr - 1303 472 832 1 1 76 91 990 0.983 89.07 1.01 Init - 1661 1455 207 2 0 53 83 250 0.847 19.82 1.00 Prom - 5134 5095 40 -3.26 2.00 Prom + 5622 5661 40 -7.56 2.01 Init + 10806 10892 87 2 0 69 98 67 0.736 4.44 2.02 Term + 18551 19231 681 1 0 -39 48 760 0.487 52.96 2.03 PlyA + 20097 20102 6 1.05 3.02 PlyA - 27369 27364 6 1.05 3.01 Sngl - 45964 45284 681 1 0 73 36 760 0.996 65.49 3.00 Prom - 59651 59612 40 -4.86 4.03 PlyA - 61224 61219 6 1.05 4.02 Term - 71345 71140 206 0 2 105 42 105 0.926 5.13 4.01 Init - 71832 71406 427 0 1 80 80 103 0.603 3.86 4.00 Prom - 75651 75612 40 -3.06 5.02 PlyA - 78207 78202 6 1.05 5.01 Sngl - 80292 79795 498 0 0 87 47 520 0.991 43.85 5.00 Prom - 81785 81746 40 -3.16 6.00 Prom + 82555 82594 40 -9.85 6.01 Init + 83010 83103 94 2 1 40 93 8 0.610 -2.91 6.02 Intr + 83221 83271 51 2 0 78 92 73 0.938 5.58 6.03 Intr + 83465 83865 401 0 2 94 94 282 0.729 23.62 6.04 Intr + 84588 85367 780 2 0 76 53 755 0.981 62.48 6.05 Intr + 85932 86222 291 2 0 114 81 271 0.982 26.33 6.06 Intr + 86328 86401 74 2 2 54 76 89 0.983 2.50 6.07 Intr + 86536 86611 76 1 1 75 86 95 0.917 7.52 6.08 Intr + 90129 90281 153 2 0 38 64 87 0.496 1.57 6.09 Intr + 92244 92325 82 2 1 95 99 7 0.902 1.71 6.10 Intr + 92434 92529 96 2 0 96 81 26 0.821 2.68 6.11 Intr + 92932 93032 101 2 2 69 98 45 0.988 3.43 6.12 Intr + 93145 93293 149 0 2 103 64 100 0.999 8.13 6.13 Intr + 93472 93567 96 1 0 98 94 94 0.999 10.12 6.14 Intr + 93777 93845 69 0 0 76 93 49 0.756 2.50 6.15 Intr + 93979 94102 124 1 1 79 94 30 0.745 3.29 6.16 Intr + 94213 94280 68 0 2 98 117 53 0.878 6.90 6.17 Intr + 94461 94572 112 0 1 103 2 27 0.759 -4.02 6.18 Intr + 94707 94880 174 2 0 51 84 80 0.730 4.04 6.19 Intr + 94971 95065 95 2 2 98 82 139 0.889 13.06 6.20 Intr + 95326 95406 81 1 0 79 94 79 0.799 6.35 6.21 Term + 95546 95834 289 2 1 123 47 73 0.904 1.55 6.22 PlyA + 96156 96161 6 1.05 7.06 PlyA - 96306 96301 6 1.05 7.05 Term - 96512 96386 127 1 1 65 45 103 0.905 1.46 7.04 Intr - 96705 96656 50 0 2 116 89 -16 0.749 -1.22 7.03 Intr - 96878 96833 46 1 1 87 81 58 0.813 3.41 7.02 Intr - 97046 96987 60 1 0 92 84 77 0.973 5.55 7.01 Init - 97404 97217 188 0 2 28 91 294 0.968 20.23 7.00 Prom - 99109 99070 40 -5.86 8.00 Prom + 99229 99268 40 -11.14 8.01 Init + 100001 100067 67 1 1 52 94 64 0.454 3.16 8.02 Intr + 103213 103334 122 2 2 123 105 163 0.884 21.71 8.03 Intr + 105534 105639 106 2 1 87 64 126 0.996 9.89 8.04 Intr + 108327 108496 170 1 2 96 89 20 0.526 2.57 8.05 Intr + 111681 111768 88 2 1 90 66 5 0.505 -1.96 8.06 Intr + 111897 111988 92 1 2 105 109 48 0.991 8.31 8.07 Intr + 112421 112581 161 1 2 126 23 162 0.563 12.29 8.08 Intr + 114888 115041 154 0 1 86 69 78 0.679 5.77 8.09 Intr + 115251 115349 99 2 0 95 67 81 0.983 7.01 8.10 Intr + 117404 118027 624 1 0 66 113 761 0.982 68.84 8.11 Intr + 118155 118299 145 2 1 75 52 195 0.991 14.46 8.12 Intr + 118511 118655 145 0 1 83 85 192 0.999 17.74 8.13 Intr + 120465 120701 237 0 0 106 61 285 0.996 24.33 8.14 Intr + 120909 121110 202 0 1 83 82 188 0.968 16.99 8.15 Intr + 121317 121495 179 2 2 42 99 218 0.921 17.02 8.16 Intr + 122189 122230 42 2 0 109 94 77 0.995 7.76 8.17 Intr + 122625 123696 1072 0 1 89 96 704 0.922 61.53 8.18 Intr + 124180 124353 174 0 0 73 63 260 0.983 22.14 8.19 Intr + 126311 126522 212 1 2 32 94 396 0.982 32.31 8.20 Intr + 127579 127669 91 1 1 81 44 176 0.999 12.50 8.21 Intr + 128913 129036 124 2 1 122 96 -62 0.086 -1.84 8.22 Term + 134766 136258 1493 1 2 121 42 779 0.869 67.33 8.23 PlyA + 137256 137261 6 1.05 9.09 PlyA - 137907 137902 6 1.05 9.08 Term - 146174 146049 126 2 0 96 44 69 0.648 1.58 9.07 Intr - 178456 178397 60 1 0 90 71 49 0.348 2.33 9.06 Intr - 182543 182258 286 1 1 66 45 175 0.394 8.34 9.05 Intr - 192243 192177 67 1 1 105 61 8 0.098 -2.24 9.04 Intr - 199227 199101 127 0 1 121 92 10 0.033 4.95 9.03 Intr - 212595 212411 185 1 2 58 35 100 0.037 1.21 9.02 Intr - 213490 213404 87 2 0 106 101 -13 0.091 1.74 9.01 Init - 219540 219471 70 0 1 52 110 48 0.446 4.62 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 44663 44649 15 2 0 144 49 13 0.833 1.24 S.002 Init - 44817 44761 57 0 0 103 81 121 0.826 12.21 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:33320265_33551903|GENSCAN_predicted_peptide_1|416_aa MATANSIIVLDDDDEDEAAAQPGPSHPLPNAASPGAEAPSSSEPHGARGSSSSGGKKCYK LENEKLFEEFLELCKMQTADHPEVVPFLYNRQQRAHSLFLASAEFCNILSRVLSRARSRP AKLYVYINELCTVLKAHSAKKKLNLAPAATTSNEPSGNNPPTHLSLDPTNAENTASQSPR TRGSRRQIQRLEQLLALYVAEIRRLQEKELDLSELDDPDSAYLQEARLKRKLIRLFGRLC ELKDCSSLTGRVIEQRIPYRGTRYPEVNRRIERLINKPGPDTFPDYGDVLRAVEKAAARH SLGLPRQQLQLMAQDAFRDVGIRLQERRHLDLIYNFGCHLTDDYRPGVDPALSDPVLARR LRENRSLAMSRLDEVISKYAMLQDKSEEGERKKRRARLQGTSSHSADTPEASLDSX >gi568815592f:33320265_33551903|GENSCAN_predicted_CDS_1|1248_bp atggccaccgctaacagcatcatcgtgctggatgatgatgacgaagatgaagcagctgct cagccagggccctcccacccactccccaatgcggcctcacctggggcagaagcccctagc tcctctgagcctcatggggccagaggaagcagtagttcgggcggcaagaaatgctacaag ctggagaatgagaagctgttcgaagagttccttgaactttgtaagatgcagacagcagac caccctgaggtggtcccattcctctataaccggcagcaacgtgcccactctctgtttttg gcctcggcggagttctgcaacatcctctctagggtcctgtctcgggcccggagccggcca gccaagctctatgtctacatcaatgagctctgcactgttctcaaggcccactcagccaaa aagaagctgaacttggcccctgccgccaccacctccaatgagccctctgggaataaccct cccacacacctctccttggaccccacaaatgctgaaaacactgcctctcagtctccaagg acccgtggttcccggcggcagatccagcgtttggagcagctgctggcgctctatgtggca gagatccggcggctgcaggaaaaggagttggatctctcagaattggatgacccagactcc gcatacctgcaggaggcacggttgaagcgtaagctgatccgcctctttgggcgactatgt gagctgaaagactgctcttcactgaccggccgtgtcatagagcagcgcatcccctaccgt ggcacccgctacccagaggttaacaggcgcattgagcggctcatcaacaagccagggcct gataccttccctgactatggggatgtgcttcgggctgtagagaaggcagctgcccgacac agccttggcctcccccgacagcagctccagctcatggctcaggatgccttccgagatgtg ggcatcaggttacaggagcgacgtcacctcgatctcatctacaactttggctgccacctc acagatgactataggccaggcgttgaccctgcactatcagatcctgtgttggcccggcgc cttcgggaaaaccggagtttggccatgagtcggctggatgaggtcatctccaaatatgca atgttgcaagacaaaagtgaggagggcgagagaaaaaagagaagagctcggctccaaggc acctcttcccactctgcagacacccccgaagcctccttggattctgnn >gi568815592f:33320265_33551903|GENSCAN_predicted_peptide_2|255_aa MARSRLTATSASRVQTILLPQPPKSLELQCPVKEHQIFEEDFRHEQKPKIKSKNDTALCS SPSLIVWRRAAHFWCLALELNNHHVEQKGKTKTTKKLPQRTTSNVFAMFDQSQIQEFKEA FNMIDQNRDGFINKEDLHDMLVSLGKNPTDAYLDAIMNEAPGPIDFTMFLTIFGEKLNGT DPEDVIGNAFACFDEEATGIIQEDYLRELLITMWDRFTDEEVDELYREAPINKKGNFNYI EFTCILKHGAKDKDD >gi568815592f:33320265_33551903|GENSCAN_predicted_CDS_2|768_bp atggcacgatctcggctcactgcaacctccgcctcccgggttcaaaccattctcctgcct cagcctcccaagtcgctggaattacagtgcccagttaaggaacaccagatatttgaggaa gacttcagacatgagcaaaaacccaaaattaaaagtaaaaacgacacagcattgtgctct tcgccttccctcatcgtctggcgcagggcagcccacttctggtgtttggcgctggaatta aacaaccaccatgtggagcaaaaaggcaagaccaagaccaccaaaaagctccctcagcgc acaacatccaacgtgtttgccatgtttgaccagtcacagattcaggagttcaaagaggcc ttcaacatgattgatcagaacagagatggtttcatcaacaaagaagatttgcatgatatg cttgtttccctagggaagaatcccaccgatgcataccttgatgccataatgaatgaggca ccagggcccatcgatttcaccatgttcctcaccatatttggtgagaagttaaatggcaca gatcctgaagatgtcattggaaatgcttttgcttgctttgatgaagaagcaacaggcatt attcaggaagattacctgagagagctgctgataaccatgtgggatcggtttacggatgag gaagtggatgagctgtacagagaagcgcctattaacaaaaaggggaatttcaattacatc gagttcacatgcatcctgaaacatggagcaaaagacaaagacgactga >gi568815592f:33320265_33551903|GENSCAN_predicted_peptide_3|226_aa MSVPLLNDAATVSGAERETAVVIFLHGLGDTGHSWADALSTIRLPHVKYICSHEPRIPVT LNMKMVMPSWFDLMGLSPDAPEDEAGIKKAAENIKALIEHEMKNGIPANQIILGGFSQGR ALSLYMALTCPHPLAGILALSCWPPLHRAFPQAANGSAKDLAILQCHGELDPMVPVRFGA LMAEKLRSVVTPARVQFQTYLGVMHSSCPQEMAAVKEFLEKLLPPV >gi568815592f:33320265_33551903|GENSCAN_predicted_CDS_3|681_bp atgtctgtgcccctgctcaacgatgctgccaccgtgtctggagctgagcgggaaacggcc gtggttatttttttacatggacttggagacacagggcacagctgggctgacgccctctcc accattcggctccctcacgtcaagtacatctgttcccatgagcctaggatccctgtgacc ctcaacatgaagatggtgatgccctcctggtttgacctgatggggctgagtccagatgcc ccagaggacgaggctggcatcaagaaggcagcagagaacatcaaggccttgattgagcat gaaatgaagaacgggatccctgccaatcaaatcatcctgggaggcttttcacagggccgg gccctgtccctctacatggccctcacctgcccccaccctctggctggcatcctggctttg agctgctggccgcctctgcaccgggccttcccccaggcagctaatggcagtgccaaggac ctggccatcctccagtgccatggggagctggaccccatggtgcccgtacggtttggggcc ctgatggctgagaagctccggtctgttgtcacacctgccagggtccagttccagacatac ctgggtgtcatgcacagctcctgtcctcaggagatggcagctgtgaaggaatttcttgag aagctgctgcctcctgtctaa >gi568815592f:33320265_33551903|GENSCAN_predicted_peptide_4|210_aa MTKADREPGARQHRSGQPSSPSPGQDTCAAPTHLRIHVHLGPTRAGGMQWKRTGCRGSWD SRSLARSRSRVENSHGSGRYQPRATPATFEFQRPPPTLTALCTQGPIVAVATAEGQSQRS PPPEATPRPTGALSFLFRRRRRAAIVLDFRSAPRRRSLRQKMRGRRRTNATGQAKLVAAG GEGAGTLEERNDACANEDPCRETGGTSASW >gi568815592f:33320265_33551903|GENSCAN_predicted_CDS_4|633_bp atgacaaaggccgaccgggagccgggggcgcgacagcatcggagcggtcagccttcgtcc ccatccccagggcaggacacctgcgccgcccctactcacctgcggatccatgtccacctc ggtcccacacgcgccgggggaatgcagtggaagagaactgggtgccggggatcctgggac tcgcgttctctcgcccgctcgcgaagcagggtagagaactcgcacggctccggccgctac cagccccgcgccacacccgccacttttgaattccaacggccaccacccactctcaccgcg ctctgcacgcagggaccaatcgtcgctgtcgccacagccgagggccaatcgcagcgttct ccgccacccgaagccacaccccgcccgacaggcgccttgtcttttctgtttcgcaggcgc aggagagcggcaatagtgctggacttccgctcggctccccgccgtcgctcgctacgtcag aaaatgcgtggacgtcgccgcacgaacgcaactggccaagcgaaactggtggcggccgga ggagaaggggcggggacgctggaggaaagaaatgacgcgtgcgcaaacgaggacccgtgc cgggagacaggcgggactagcgcctcctggtga >gi568815592f:33320265_33551903|GENSCAN_predicted_peptide_5|165_aa MPPKFDPNEIKVVYLRCTRGEVGATSALAPKIGPLGLSPKKVGDDIAKATGDWKGLRITV KLTIENRQAQIEVVPSASAPIIKALKKPPRDRKKQKNIKHNGNITFDEIVNIARQMRHRS LARELSGTIKEIPGTAQSMGCNVDGHHPHDIIDDINSGAVECPAS >gi568815592f:33320265_33551903|GENSCAN_predicted_CDS_5|498_bp atgccaccgaagttcgaccccaacgagatcaaggtcgtatacctgaggtgcaccagaggt gaagtcggtgccacttctgccctggcccccaagatcggccccctgggtctgtctccaaaa aaggttggtgatgacattgccaaggcaacgggtgactggaagggcctgaggattacagtg aaactgaccattgagaacagacaggcccagattgaggtggtgccttctgcctctgccccg atcatcaaagccctcaagaaaccaccaagagacagaaagaaacagaaaaacattaaacac aatgggaatatcacttttgatgagatcgtcaacattgctcgacagatgcggcaccgatcc ttagccagagaactctctggaaccattaaagagattccggggactgcccagtctatgggc tgtaatgttgatggccaccaccctcatgacatcatagatgacatcaacagtggtgctgtg gaatgcccagctagttaa >gi568815592f:33320265_33551903|GENSCAN_predicted_peptide_6|1151_aa MIILPLLSPISWAAQKVSKKTGPRCSTAIATGLKNQKPVPAVPVQKSGTSGVPPMAGGKK PSKRPAWDLKGQLCDLNAELKRCRERTQTLDQENQQLQDQLRDAQQQVKALGTERTTLEG HLAKVQAQAEQGQQELKNLRACVLELEERLSTQEGLVQELQKKQVELQEERRGLMSQLEE KERRLQTSEAALSSSQAEVASLRQETVAQAALLTEREERLHGLEMERRRLHNQLQELKGN IRVFCRVRPVLPGEPTPPPGLLLFPSGPGGPSDPPTRLSLSRSDERRGTLSGAPAPPTRH DFSFDRVFPPGSGQDEVFEEIAMLVQSALDGYPVCIFAYGQTGSGKTFTMEGGPGGDPQL EGLIPRALRHLFSVAQELSGQGWTYSFVASYVEIYNETVRDLLATGTRKGQGGECEIRRA GPGSEELTVTNARYVPVSCEKEVDALLHLARQNRAVARTAQNERSSRSHSVFQLQISGEH SSRGLQCGAPLSLVDLAGSERLDPGLALGPGERERLRETQAINSSLSTLGLVIMALSNKE SHVPYRNSKLTYLLQNSLGGSAKMLMFVNISPLEENVSESLNSLRFASKPWKHGAGRPRE AARTPADSLAPTADGRRPVKEGLSPPRCRAQEGTRVRKRAVDSAREVCLVQFEDDSQFLV LWKDISPAALPGEELLCCVCRSETVVPGNRLVSCEKCRHAYHQDCHVPRAPAPGEGEGTS WVCRQCVFAIATKRGGALKKGPYARAMLGMKLSLPYGLKGLDWDAGHLSNRQQSYCYCGG PGEWNLKMLQCRSCLQWFHEACTQCLSKPLLYGDRFYEFECCVCRGGPEKVRRLQLRWVD VAHLVLYHLSVCCKKKYFDFDREILPFTSENWDSLLLGELSDTPKGERSSRLLSALNSHK DRFISGREIKKRKCLFGLHARMPPPVEPPTGDGALTRSLGPGGGVSRPLGKRRRPEPEPL RRRQKGKVEELGPPSAVRNQPEPQEQRERAHLQRALQASVSPPSPSPNQSYQGSSGYNFR PTDARCLPSSPIRMFASFHPSASTAGTSGDSGPPDRSPLELHIGFPTDIPKSAPHSMTAS SSSVSSPSPGLPRRSAPPSPLCRSLSPGTGGGVRGGVGYLSRGDPVRVLARRVRPDGSVQ YLVEWGGGGIF >gi568815592f:33320265_33551903|GENSCAN_predicted_CDS_6|3456_bp atgatcattctacctttgctctctcccatctcctgggcagctcaaaaagtttccaagaag acaggaccccggtgttccacagctattgccacagggttgaagaaccagaagccagttcct gctgttcctgtccagaagtctggcacatcaggtgttcctcccatggcaggagggaagaaa cccagcaaacgtccagcctgggacttaaagggtcagttatgtgacctaaatgcagaacta aaacggtgccgtgagaggactcaaacgttggaccaagagaaccagcagcttcaggaccag ctcagagatgcccagcagcaggtcaaggccctggggacagagcgcacaacactggagggg catttagccaaggtacaggcccaggctgagcagggccaacaggagctgaagaacttgcgt gcttgtgtcctggagctggaagagcggctgagcacgcaggagggcttggtgcaagagctt cagaaaaaacaggtggaattgcaggaagaacggaggggactgatgtcccaactagaggag aaggagaggaggctgcagacatcagaagcagccctgtcaagcagccaagcagaggtggca tctctgcggcaggagactgtggcccaggcagccttactgactgagcgggaagaacgtctt catgggctagaaatggagcgccggcgactgcacaaccagctgcaggaactcaagggcaac atccgtgtattctgccgggtccgccctgtcctgccgggggagcccactccaccccctggc ctcctcctgtttccctctggccctggtgggccctctgatcctccaacccgccttagcctc tcccggtctgacgagcggcgtgggaccctgagtggggcaccagctcccccaactcgccat gatttttcctttgaccgggtattcccaccaggaagtggacaggatgaagtgtttgaagag attgccatgcttgtccagtcagccctggatggctatccagtatgcatctttgcctatggc cagacaggcagtggcaagaccttcacaatggagggtgggcctgggggagacccccagttg gaggggctgatccctcgggccctgcggcacctcttctctgtggctcaggagctgagtggt cagggctggacctacagctttgtagcaagctacgtagagatctacaatgagactgtccgg gacctgctggccactggaacccggaagggtcaagggggcgagtgtgagattcgccgtgca gggccagggagtgaggagctcactgtcaccaatgctcgatatgtccctgtctcctgtgag aaagaagtggacgccctgcttcatctggcccgccagaatcgggctgtggcccgcacagcc cagaatgaacggtcatcacgcagccacagtgtattccagctacagatttctggggagcac tccagccgaggcctgcagtgtggggcccccctcagtcttgtggacctggccgggagtgag cgacttgaccccggcttagccctcggccccggggagcgggaacgccttcgggaaacacag gccattaacagcagcctgtccacgctggggctggttatcatggccctgagcaacaaggag tcccacgtgccttaccggaacagcaaactgacctacctgctgcagaactctctgggtggt agtgctaagatgctcatgtttgtgaacatttctccactggaagagaacgtctccgagtcc ctcaactctctacgctttgcctccaagccctggaagcacggggcgggacgtccacgggaa gcggcgcgcacgcccgccgactccctcgcgccaaccgccgacggccgccgcccggtgaag gaggggctcagtcctcccaggtgccgcgcgcaggaggggacacgcgtgcgcaaaagggcg gtggacagtgctagggaggtgtgtctggtccagtttgaggatgattcgcagtttctggtt ctatggaaagacattagccctgctgccctccctggagaggaactcctctgttgtgtctgt cgctctgagactgtggtccctgggaaccggctggtcagctgtgagaagtgtcgccatgct tatcaccaggactgccatgttcccagggctccagcccctggagagggagagggcacatcc tgggtatgccgccagtgtgtctttgcgatcgccaccaagaggggaggtgccctgaagaag ggcccctatgcccgggccatgctgggtatgaagctttctctgccatatggactgaagggg ctggactgggatgctggacatctgagcaaccgacagcagagttactgttactgtggtggc cctggggagtggaacctgaaaatgctgcagtgccggagctgcctgcagtggttccatgag gcctgcacccagtgtctgagcaagcccctcctctatggggacaggttctatgaatttgaa tgctgtgtgtgtcgcgggggccctgagaaagtccggagactacagcttcgctgggtggat gtggcccatcttgtcctgtatcacctcagtgtttgctgtaagaagaaatactttgatttt gatcgtgagatcctccccttcacttctgagaattgggacagtttgctcctgggggagctt tcagacacccccaaaggagaacgttcttccaggctcctctctgctcttaacagccacaag gaccgtttcatttcagggagagagattaagaagaggaaatgtttgtttggtctccatgct cggatgcctccccctgtggagccccctactggagatggagcactcaccaggtcactgggc cctgggggaggggtctcacgtcccctggggaagcgccggaggccggagccagagcccctg aggaggaggcagaaggggaaagtggaggagctggggccaccctcagcagtgcgcaatcag cccgagccccaggagcagagggagcgggctcatctgcagagggcactgcaggcctcagtg tctccaccatcccccagccctaaccagagttaccagggcagcagcggctacaacttccgg cccacagatgcccgctgcctgcccagcagccccatccggatgtttgcttccttccaccct tctgccagcaccgcagggacctctggggacagtggacccccagacaggtcacccctggaa cttcacattggtttccccacagacatccctaaaagtgccccccactcgatgactgcctca tcttcctcagtttcatccccatccccaggtcttcctagacgctcagcacccccttctccc ctgtgccgtagtttgtctcctgggactgggggaggagtccgaggtggggttggttacctg tcccgaggggaccctgtccgggtccttgctcggagagtacggcctgatggctctgtgcag tacctggttgagtggggaggagggggcatcttctga >gi568815592f:33320265_33551903|GENSCAN_predicted_peptide_7|156_aa MPALLPVASRLLLLPRVLLTMASGSPPTQPSPASDSGSGYVPGSVSAAFVTCPNEKVAKE IARAVVEKRLAACVNLIPQITSIYEWKGKIEEDSEVLMMIKTQSSLVPALTDFVRSVHPY EVAEVIALPVEQGNFPYLQWVRQVTESVSDSITVLP >gi568815592f:33320265_33551903|GENSCAN_predicted_CDS_7|471_bp atgccggcgctgctgcctgtggcctcccgccttttgttgctaccccgagtcttgctgacc atggcctctggaagccctccgacccagccctcgccggcctcggattccggctctggctac gttccgggctcggtctctgcagcctttgttacttgccccaacgagaaggtcgccaaggag atcgccagggccgtggtggagaagcgcctagcagcctgcgtcaacctcatccctcagatt acatccatctatgagtggaaagggaagatcgaggaagacagtgaggtgctgatgatgatt aaaacccaaagttccttggtcccagctttgacagattttgttcgttctgtgcacccttac gaagtggccgaggtaattgcattgcctgtggaacaggggaactttccgtacctgcagtgg gtgcgccaggtcacagagtcagtttctgactctatcacagtcctgccatga >gi568815592f:33320265_33551903|GENSCAN_predicted_peptide_8|1932_aa MSRSRASIHRGSIPAMSYAPFRDVRGPSMHRTQYVHSPYDRPGWNPRFCIISGNQLLMLD EDEIHPLLIRDRRSESSRNKLLRRTVSVPVEGRPHGEHGGCWASHGLVGQSAALLVHVTS DPEHAYPWLGLTSCQASVCGNVWPYEQDFLCPWWQGAPAQVPCPLLPAASLSAVAALPAA FRGVEYHLGRSRRKSVPGGKQYSMEGAPAAPFRPSQGFLSRRLKSSIKRTKSQPKLDRTS SFRQILPRFRSADHDRYRGWSMWDEIDVMARLMQSFKESHSHESLLSPSSAAEALELNLD EDSIIKPVHSSILGQEFCFEVTTSSGTKCFACRSAAERDKWIENLQRAVKPNKDNSRRVD NVLKLWIIEARELPPKKRYYCELCLDDMLYARTTSKPRSASGDTVFWGEHFEFNNLPAVR ALRLHLYRDSDKKRKKDKAGYVGLVTVPVATLAGRHFTEQWYPVTLPTGSGGSGGMGSGG GGGSGGGSGGKGKGGCPAVRLKARYQTMSILPMELYKEFAEYVTNHYRMLCAVLEPALNV KGKEEVASALVHILQSTGKAKDFLSDMAMSEVDRFMEREHLIFRENTLATKAIEEYMRLI GQKYLKDAIGEFIRALYESEENCEVDPIKCTASSLAEHQANLRMCCELALCKVVNSHCVF PRELKEVFASWRLRCAERGREDIADRLISASLFLRFLCPAIMSPSLFGLMQEYPDEQTSR TLTLIAKVIQNLANFSKFTSKEDFLGFMNEFLELEWGSMQQFLYEISNLDTLTNSSSFEG YIDLGRELSTLHALLWEVLPQLSKEALLKLGPLPRLLNDISTALRNPNIQRQPSRQSERP RPQPVVLRGPSAEMQGYMMRDLNSSIDLQSFMARGLNSSMDMARLPSPTKEKPPPPPPGG GKDLFYVSRPPLARSSPAYCTSSSDITEPEQKMLSVNKSVSMLDLQGDGPGGRLNSSSVS NLAAVGDLLHSSQASLTAALGLRPAPAGRLSQGSGSSITAAGMRLSQMGVTTDGVPAQQL RIPLSFQNPLFHMAADGPGPPGGHGGGGGHGPPSSHHHHHHHHHHRGGEPPGDTFAPFHG YSKSEDLSSGVPKPPAASILHSHSYSDEFGPSGTDFTRRQLSLQDNLQHMLSPPQITIGP QRPAPSGPGGGSGGGSGGGGGGQPPPLQRGKSQQLTVSAAQKPRPSSGNLLQSPEPSYGP ARPRQQSLSKEGSIGGSGGSGGGGGGGLKPSITKQHSQTPSTLNPTMPASERTVAWVSNM PHLSADIESAHIEREEYKLKEYSKSMDESRLDRVKEYEEEIHSLKERLHMSNRKLEEYER RLLSQEEQTSKILMQYQARLEQSEKRLRQQQAEKDSQIKSIIGRLMLVEEELRRDHPAMA EPLPEPKKRLLDAQCMSVTPHFTRASLGAGGGFVNGVEAMMGWRILAIGAVLTAAAFIPR GVYPQALLLFPILVTFEEAMETPTPLPPVPASPTCNPAPRTIQIEFPQHSSSLLESLNRH RLEGKFCDVSLLVQGRELRAHKAVLAAASPYFHDKLLLGDAPRLTLPSVIEADAFEGLLQ LIYSGRLRLPLDALPAHLLVASGLQMWQVVDQCSEILRELETSGGGISARGGNSYHALLS TTSSTGGWCIRSSPFQTPVQSSASTESPASTESPVGGEGSELGEVLQIQVEEEEEEEEDD DDEDQGSATLSQTPQPQRVSGVFPRPHGPHPLPMTATPRKLPEGESAPLELPAPPALPPK IFYIKQEPFEPKEEISGSGTQPGGAKEETKVFSGGDTEGNGELGFLLPSGPGPTSGGGGP SWKPVDLHGNEILSGGGGPGGAGQAVHGPVKLGGTPPADGKRFGCLCGKRFAVKPKRDRH IMLTFSLRPFGCGICNKRFKLKHHLTEHMKTHAGALHACPHCGRRFRVHACFLRHRDLCK GQGWATAHWTYK >gi568815592f:33320265_33551903|GENSCAN_predicted_CDS_8|5799_bp atgagcaggtctcgagcctccatccatcgggggagcatccccgcgatgtcctatgccccc ttcagagatgtacggggaccctctatgcaccgaacccaatacgttcattccccgtatgat cgtcctggttggaaccctcggttctgcatcatctcggggaaccagctgctcatgctggat gaggatgagatacaccccctactgatccgggaccggaggagcgagtccagtcgcaacaaa ctgctgagacgcacagtctccgtgccggtggaggggcggccccacggcgagcatgggggc tgttgggccagccacgggcttgtggggcagtccgcggccttgctggtccatgtcacctct gaccctgagcatgcatatccctggctggggctcacctcctgtcaggcttctgtgtgtggg aatgtgtggccctatgagcaggattttctgtgtccctggtggcagggggctcctgctcag gttccttgccccctccttcccgctgccagcctctccgccgtcgctgctcttcctgctgct ttccggggggtagaataccacttgggtcgctcgaggaggaagagtgtcccaggggggaag cagtacagcatggagggtgcccctgctgcgcccttccggccctcgcaaggcttcctgagc cgacggctaaaaagctccatcaaacgaacgaagtcacaacccaaacttgaccggaccagc agctttcgccagatcctgcctcgcttccgaagtgctgaccatgaccggtacaggggctgg agcatgtgggatgagattgatgtaatggcccggctgatgcaaagctttaaggagtcacac tctcatgagtccttgctgagtcctagcagtgcagctgaggcattggagctcaacttggat gaagattccattatcaagccagtgcacagctccatcctgggccaggagttctgttttgag gtaacaacttcatcaggaacaaaatgctttgcctgtcggtctgcggccgaaagagacaaa tggattgagaatctgcagcgggcagtaaagcccaacaaggacaacagccgccgggtagac aatgtgctaaagctgtggatcatagaggcccgggagctgccccccaagaagcggtactac tgtgagctctgcctggatgacatgctgtatgcacgcaccacctccaagccccgctctgcc tctggggacaccgtcttctggggcgagcacttcgagtttaacaacctgccggctgtccgt gccctgcggctgcatctgtaccgtgactcagacaaaaagcgcaagaaggacaaggcaggc tatgtcggcctggtgactgtgccagtggccaccctggctgggcgccacttcacagagcag tggtaccctgtaaccctgccaacaggcagtgggggatctgggggcatgggttcgggaggg ggagggggctcggggggtggctcagggggcaagggcaaaggaggttgcccggctgtgcgg ctgaaagcacgttaccagacaatgagcatcttgcccatggagctatataaagagtttgca gagtatgtcaccaaccattatcggatgctgtgtgcagtcttggagcccgccctgaatgtc aaaggcaaggaggaggttgccagtgcactagttcacatcctgcagagtacaggcaaggcc aaggacttcctttcagacatggccatgtctgaggtagaccggttcatggaacgggagcac ctcatattccgcgagaacacgcttgccactaaagccatagaagagtatatgagactgatt ggtcagaaatacctcaaggatgccattggagaattcatccgtgctctgtatgaatctgag gaaaactgcgaggtagaccctatcaagtgcacagcatccagtttggcagagcaccaggcc aacctgcgaatgtgctgtgagttggccctgtgcaaggtggtcaactcccactgcgtgttc ccgagggagctgaaggaggtgtttgcttcgtggcggctgcgctgcgcagagcgaggccgg gaggacatcgcagacaggcttatcagcgcctcactcttcctgcgcttcctctgcccagcg attatgtcgcccagtctctttgggcttatgcaggagtacccagatgagcagacctcacga accctcaccctcattgccaaggtcatccagaacctggccaacttttccaagtttacctca aaggaggactttctgggcttcatgaatgagtttctggagctggaatggggttccatgcag cagtttttgtatgagatctccaatctggacacgctaaccaacagcagtagctttgagggt tacatcgacttgggccgagagctctccacactgcatgccctactctgggaggtgctgccc cagctcagcaaggaagccctcctgaagctgggtccactgccccggctcctcaacgacatc agcacagctctgaggaaccccaacatccaaaggcagccaagccgccagagtgagcggccc cggcctcagcctgtggtactgcgggggccatcggctgagatgcagggctacatgatgcgg gacctcaacagctccatcgaccttcagtccttcatggctcgaggcctcaacagctctatg gacatggctcgcctcccctccccaaccaaggaaaagccacccccaccaccgcctggtggt ggtaaagacctgttctatgtaagccgtccacccctggcccgttcctcaccagcatactgc acgagcagctcggacatcacagagccagagcagaagatgctgagtgtcaacaagagtgtg tccatgctggacttacagggtgatgggcctggtggccgcctcaacagcagcagtgtttcg aacctggcggccgtaggggacctgctgcactcaagccaggcctcgctgacagcagccttg gggctacggcctgcgcctgccggacgcctctcccaggggagtggctcatccatcacggcg gctggcatgcgcctcagccagatgggtgtcaccacagacggtgtccctgcccagcaactg cgaatccccctctccttccagaaccctctcttccacatggctgctgatgggccaggtccc ccaggcggccatggagggggcggtggccatggcccaccttcctcccatcaccaccaccac caccatcaccaccaccgaggtggagagccccctggggacacctttgccccattccatggc tatagcaagagtgaggacctctcttccggggtccccaagccccctgctgcctccatcctt catagccacagctacagtgatgagtttggaccctctggcactgacttcacccgtcggcag ctttcactccaggacaacctgcagcacatgctgtcccctccccagatcaccattggtccc cagaggccagccccctcagggcctggaggtgggagcggtgggggcagcggtgggggtggc gggggccagccgcctccattgcagaggggcaagtctcagcagttgacagtcagcgcagcc cagaaaccccggccatccagcgggaatctattgcagtccccagagccaagttatggcccc gcccgtccacggcaacagagcctcagcaaggagggcagcattgggggcagcgggggcagc ggtggcggagggggtggggggctgaagccctccatcaccaagcagcattctcagacacca tccacattgaaccccacaatgccagcctctgagcggacagtggcctgggtctccaacatg cctcacctgtcggctgacatcgagagtgcccacatcgagcgggaagagtacaagctcaag gagtactcaaaatcgatggatgagagccggctggatagggtgaaggagtacgaggaggag attcactcactgaaagagcggctgcacatgtccaaccggaagctggaagagtatgagcgg aggctgctgtcccaggaagaacaaaccagcaaaatcctgatgcagtatcaggcccgactg gagcagagtgagaagaggctaaggcagcagcaggcagagaaggattcccagatcaagagc atcattggcaggctgatgctggtggaggaggagctgcgccgggaccaccccgccatggct gagccgctgccagaacccaagaagaggctgctcgacgctcagtgtatgtctgtcaccccc catttcaccagagcgtccttaggggctgggggtgggtttgttaatggggtggaggcaatg atgggttggaggatcttggctataggggctgtgctgactgcagcagccttcatcccgcgt ggagtctacccccaagcccttctcctcttcccaattcttgtcaccttcgaggaggccatg gaaaccccaacacctttgccgcctgtacccgcctccccgacctgcaacccagccccacgg acaatccagatcgagttcccacagcatagctcgtctctgctggaatctctgaaccgccac aggctagagggaaagttctgtgatgtgtccctcctggtgcagggccgggaacttagggct cataaagcagtgttagctgctgcctctccttacttccatgacaagctgcttctgggggat gcgcctcgtctcactctaccgagtgtcattgaagccgatgccttcgaggggctgctccag ctcatttattcagggcgtctccgcctgccactggatgctcttcctgctcatctccttgtg gccagtggccttcaaatgtggcaggtagtagatcagtgctcagaaattcttagagaatta gaaacttcaggtggtggaatttcagcccgtggaggaaactcctaccatgcccttctttcc actacatcctctacaggaggctggtgcattcgctcttcgcctttccagaccccagtacag tcctctgcttctactgaaagccctgcttccactgagagccctgtgggaggggagggaagt gaactgggagaagtgctgcaaattcaggtggaagaagaagaggaggaggaggaagatgat gatgatgaggaccaggggtcagccacactctctcagactcctcagccccagagagtatca ggggtttttccccgtcctcatggaccccacccactgcccatgactgctactccccgaaag cttccagagggtgagagtgcaccacttgagcttcctgcccctcctgcactgccccccaaa atcttctacattaagcaggaacccttcgagcctaaggaggagatatcaggaagcggaact cagcctggaggagcaaaggaggaaaccaaagtgttttctggaggggacactgaagggaat ggggagctagggttcttgttgccttcagggccagggccaacatctgggggagggggtcca tcctggaaaccagtggatcttcatgggaatgaaatcctgtcagggggtggaggacctggg ggagcaggccaggccgtgcatgggcctgtgaagctaggggggacaccccctgcagatgga aaacgctttggttgcctgtgtgggaagcggtttgcagtgaagccaaagcgtgaccggcac atcatgctgaccttcagccttcggccttttggctgtggcatctgcaacaagcgcttcaag ctgaagcaccatctgacagagcacatgaagacccatgctggagccctgcatgcctgtccc cactgtggccgtcggttccgagtccatgcctgttttctccgccaccgggacctatgcaag ggccagggctgggccactgcccactggacttacaagtga >gi568815592f:33320265_33551903|GENSCAN_predicted_peptide_9|335_aa MNGKGLKESLPAAQIDFARGITTGGTTGTCHCVRSWWVLGLTDLKNEAADPHDSGAQLAS PSGSRTGAAGGAACQSCAVRSHSSALGWSMGLAALEQGVVLVREARAAQEPMEWRYEQNF IFSNLNQSSCIGMDGLRERSWYQICRQLGTEVFTCMAHNTAGDGEGPTLGCSPGVTEDEA SGAGRPLRVWGHTHLELALARDRLPRLSLHIFLQAEGAGSDLGQPREGLPQCGGGLKGSS STVRVGVEAEQTPRASEGCQHDVTSHQHFGKPRQVHREAKGEEDEEDLYLVLEQAQHHME ATKAWGFHPLKPQPKLYIGPFQSWLELLGHRTPGP >gi568815592f:33320265_33551903|GENSCAN_predicted_CDS_9|1008_bp atgaatggaaaaggcttgaaggaaagcctgccagccgctcagattgactttgcaaggggg atcaccacaggtgggactacaggcacatgccactgtgtccggagttggtgggttcttggt ctgactgacttgaagaacgaagccgcggaccctcatgactcaggagcccagctggcttca cccagtggatcccgcactggggctgcaggtggagctgcctgccagtcctgcgccgtgcgc tcgcactcctcagcccttgggtggtcaatgggactggccgccctggagcagggggtggtg ctcgtccgggaggctcgggcggcacaggagcccatggagtggcgttatgaacagaacttt attttttccaacctaaaccaatcatcctgcattgggatggatggattgagggaaagatca tggtaccagatctgcaggcagctggggacagaagtcttcacttgcatggcccataacact gcgggggatggagaagggccgaccctcggatgctcacctggagtaacggaggatgaggcc agtggcgctggccggccactccgagtgtggggccacacccacctggaactcgcgctggcc cgtgatcgtctcccacgcctctccctccacatcttcctgcaagcagagggagccggctcc gacctcggccagcccagagaggggctcccacagtgcggcggcgggctgaagggctcttca agcacggtcagagtgggcgtcgaggccgagcagacaccaagagcgagcgagggctgccag cacgatgtcacctctcaccagcactttgggaagccaaggcaggtacacagagaagcaaag ggtgaagaagatgaagaggacctttatttagtgttagaacaggctcaacaccacatggaa gctaccaaggcttggggcttccaccctctgaagccacagcccaagctgtacattggcccc tttcagtcatggctggagttgctgggacacaggacaccaggcccctag