GENSCAN 1.0 Date run: 16-Jul-119 Time: 15:55:07 Sequence gi568815583r:77014159_77281926 : 267768 bp : 41.78% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1703 1837 135 2 0 42 44 135 0.346 3.26 1.02 Intr + 2381 2465 85 2 1 64 41 89 0.388 0.70 1.03 Intr + 2656 2814 159 0 0 101 25 59 0.304 0.16 1.04 Intr + 3990 4090 101 2 2 106 83 158 0.898 15.09 1.05 Intr + 4299 4373 75 0 0 110 71 79 0.730 6.11 1.06 Intr + 7619 7752 134 2 2 66 110 32 0.330 2.57 1.07 Intr + 8866 8990 125 2 2 -5 94 104 0.346 1.08 1.08 Intr + 9412 9536 125 0 2 63 100 118 0.801 8.96 1.09 Intr + 11126 11160 35 2 2 134 94 15 0.964 3.65 1.10 Intr + 11340 11446 107 1 2 72 100 102 0.874 8.81 1.11 Intr + 12535 12804 270 0 0 34 14 172 0.306 1.22 1.12 Intr + 13342 13464 123 0 0 25 52 162 0.863 6.16 1.13 Intr + 13694 13756 63 1 0 90 83 102 0.606 7.90 1.14 Intr + 14396 14516 121 1 1 63 17 253 0.494 14.85 1.15 Intr + 14712 14846 135 1 0 64 98 121 0.641 10.32 1.16 Intr + 16128 16423 296 1 2 13 105 248 0.778 14.80 1.17 Intr + 17022 17146 125 2 2 135 74 161 0.620 17.86 1.18 Intr + 17376 17569 194 0 2 91 55 169 0.568 12.11 1.19 Intr + 17947 18053 107 2 2 10 80 38 0.706 -5.79 1.20 Intr + 18140 18236 97 1 1 92 93 215 0.603 21.06 1.21 Intr + 18704 18794 91 0 1 74 78 94 0.530 5.13 1.22 Intr + 21350 21405 56 2 2 145 63 39 0.483 4.80 1.23 Intr + 21644 21777 134 0 2 39 94 149 0.351 10.04 1.24 Intr + 21848 21947 100 1 1 83 44 30 0.342 -3.14 1.25 Term + 22849 23018 170 2 2 64 55 226 0.377 13.96 1.26 PlyA + 23151 23156 6 1.05 2.12 PlyA - 28750 28745 6 1.05 2.11 Term - 29117 28822 296 0 2 13 43 212 0.821 3.68 2.10 Intr - 29690 29527 164 1 2 50 89 77 0.738 2.70 2.09 Intr - 30585 30371 215 0 2 93 55 102 0.675 3.99 2.08 Intr - 32465 32251 215 0 2 57 31 220 0.717 10.71 2.07 Intr - 37782 37749 34 0 1 128 39 -17 0.510 -5.72 2.06 Intr - 38310 38227 84 0 0 77 115 98 0.994 10.50 2.05 Intr - 38771 38619 153 2 0 94 105 40 0.980 5.65 2.04 Intr - 40121 40020 102 2 0 71 99 100 0.998 8.85 2.03 Intr - 41705 41631 75 2 0 60 95 40 0.618 0.69 2.02 Intr - 42097 41906 192 1 0 79 107 281 0.999 27.87 2.01 Init - 56796 56734 63 0 0 109 80 48 0.008 5.74 2.00 Prom - 62103 62064 40 -4.55 3.00 Prom + 65069 65108 40 -4.95 3.01 Init + 67814 67968 155 1 2 80 41 63 0.158 0.63 3.02 Intr + 69691 69862 172 1 1 44 64 165 0.256 8.72 3.03 Term + 73182 73304 123 2 0 82 52 107 0.274 3.90 3.04 PlyA + 73818 73823 6 1.05 4.08 PlyA - 74004 73999 6 1.05 4.07 Term - 80831 80655 177 2 0 27 49 146 0.034 1.30 4.06 Intr - 82829 82765 65 0 2 89 75 56 0.075 1.92 4.05 Intr - 84722 84634 89 1 2 98 23 86 0.054 1.80 4.04 Intr - 92067 91906 162 2 0 64 94 111 0.530 7.47 4.03 Intr - 93909 93725 185 0 2 85 41 22 0.300 -5.14 4.02 Intr - 95517 95300 218 1 2 16 84 176 0.431 7.30 4.01 Init - 97925 97862 64 2 1 70 81 107 0.987 7.60 4.00 Prom - 99120 99081 40 -8.65 5.03 PlyA - 99190 99185 6 -3.94 5.02 Term - 101161 99998 1164 1 0 90 35 1072 0.947 92.95 5.01 Init - 119575 118847 729 1 0 82 116 729 0.961 69.93 5.00 Prom - 122198 122159 40 -6.35 6.03 PlyA - 124267 124262 6 1.05 6.02 Term - 125126 124993 134 0 2 34 47 146 0.140 2.37 6.01 Init - 131028 130944 85 0 1 72 106 32 0.181 4.43 6.00 Prom - 136839 136800 40 -3.65 7.02 PlyA - 137600 137595 6 1.05 7.01 Sngl - 139155 138514 642 0 0 55 51 371 0.961 26.03 7.00 Prom - 140332 140293 40 -4.75 8.05 PlyA - 140570 140565 6 1.05 8.04 Term - 141381 141019 363 0 0 61 47 421 0.921 28.88 8.03 Intr - 143281 143193 89 2 2 50 105 -10 0.911 -4.33 8.02 Intr - 144538 144345 194 0 2 42 97 165 0.752 11.01 8.01 Init - 167768 164632 3137 2 2 44 91 2800 0.472 267.16 8.00 Prom - 179133 179094 40 -2.35 9.00 Prom + 184370 184409 40 -5.15 9.01 Init + 185542 185687 146 0 2 64 50 244 0.088 17.84 9.02 Intr + 214771 214976 206 1 2 91 81 84 0.020 5.92 9.03 Intr + 222606 222677 72 1 0 96 73 23 0.006 0.06 9.04 Term + 238514 238611 98 0 2 74 50 67 0.017 -1.35 9.05 PlyA + 239409 239414 6 1.05 10.00 Prom + 242494 242533 40 -7.15 10.01 Sngl + 247007 247399 393 1 0 88 43 460 0.983 37.49 10.02 PlyA + 247408 247413 6 1.05 11.00 Prom + 248406 248445 40 -6.15 11.01 Sngl + 248493 249125 633 2 0 49 42 187 0.360 6.13 11.02 PlyA + 249207 249212 6 1.05 12.00 Prom + 250265 250304 40 -5.55 12.01 Init + 250325 250525 201 1 0 60 86 132 0.616 9.12 12.02 Term + 256699 257061 363 0 0 55 40 192 0.198 4.68 12.03 PlyA + 257104 257109 6 -1.95 13.00 Prom + 257300 257339 40 -3.65 13.01 Sngl + 264307 264579 273 0 0 98 48 392 0.771 31.28 13.02 PlyA + 265658 265663 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 87087 87196 110 1 2 100 37 103 0.862 4.09 S.002 Sngl + 185542 185715 174 0 0 64 49 270 0.882 15.34 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583r:77014159_77281926|GENSCAN_predicted_peptide_1|1054_aa XPCQQPKALSGGLGGILEPAASGDLSTAVTACRASRCFLTPPEWPALDTLLVALWTAEPG PGPPQEFLGMLMEGRALLDFDSIPVPLTPLRWHLRGDMGQGKEYQALPESCVCDPGVCVT LETVMCVCRDFTAHTGYEVLLQRLLDGRKMCKDMEELLRQRAQAEERYGKELVQIARKAG GQTEINLWAFAPAVALLLPCPVPPIPQGLPEMSPPSESCHLHSPIAHPASVLYVQAGSQL SGGGWEKGQQLPEPAIPYSTPQPETMNRGGPQLVDSGVFAKRGALEEERFAGQVLSPVLD TLNLRYLSDMRPKSSLRASFDSLKQQMENVGSSHIQLALTLREELRSLEEFRERQKEQRK KLGFQDSKCGVLSRQGVPRSQPRLEVPEQAPSLSWDVPAEPAENLPGRRDPGPAILGPPV ARGAALAAKAVLDRAFFLQVARAPQAERCVVSVHTRVSDCDACRGSGMQDERLCAHVCWG GNSPAGDEALAKYEAVMDRVQKSKLSLYKKAMESKKTYEQKCRDADDAEQAFERISANGH QKQVEKVRWAAGPCGTQGPIPRCRPTEAHRVMRADANPHTTLEKGSVGPCHREETEAHTL RQLRAQLGSSSSTSVGGPDGHSDEAPPSARAGPMGEAGLPVGGGIQDGTCWSTGVEELVS GPSLRLPALSERVYRQSIAQLEKVRAEWEQEHRTTCEAFQLQEFDRLTILRNALWVHSNQ LSMQCVKDDEVGAEGLGVGATEHLPPLPGMGGEVDTAVRGWQKLEDRGCSALGLVLAGAE AQSFLCSPHLQGVLVPLTEPNLLGPCTQKVFRKQEAVSQGRDPSGSKTPSRAQWPVRRPL YEEVRLTLEGCSIDADIDSFIQAKSTGTEPPAPVPYQNYYDREVTPLTSSPGIQPSCGMI KRFSGLLHGSPKTTSLAASAASTETLTPTPERNEGVYTAIAVQEIQGNPASPAQEYRALY DYTAQCLASSSLLMVSLIFEHPLLLRRARVPECVHTSKGPSNVMRFQSLGQNPDELDLSA GDILEVILEGEDGWWTVERNGQRGFVPGSYLEKL >gi568815583r:77014159_77281926|GENSCAN_predicted_CDS_1|3165_bp nncccctgccagcagcccaaagctctgagcgggggactgggcgggatcctggagcctgca gcctctggggacttgagcacggcggtcacagcctgcagggcctctcgctgctttctgact cctcccgaatggccagcgctagacacgctgcttgttgctctgtggacagctgagcctgga ccaggccctcctcaggaatttctagggatgcttatggaggggagagctctcctggatttc gactccatccctgttcccctcacacccctaaggtggcatttgaggggagacatgggtcag gggaaggagtaccaggctttacctgagtcatgtgtgtgcgaccctggagtgtgcgtgacc ttggagactgtaatgtgtgtgtgcagggacttcacagcccacacgggctacgaggtgctg ctgcagcggcttctggatggcaggaagatgtgcaaagacatggaggagctactgaggcag agggcccaggcggaggagcggtacgggaaggagctggtgcagatcgcacggaaggcaggt ggccagacggagatcaacctctgggcctttgctcctgctgttgccctgttgctcccctgc ccagtcccacccatccctcaaggcctgcctgaaatgtcacctccctcagaaagctgtcac ctccactcccctattgctcatcctgcgtcagtgctgtacgttcaggcaggctcccagctt tcaggtgggggctgggaaaagggccagcagctcccagaacctgcgattccttacagcacc cctcagcctgagaccatgaaccggggtggcccacagctggtggacagtggtgtctttgcc aagaggggagccctggaagaggagaggtttgcagggcaggtgctgagtccggttttggac acgctgaatttgaggtatctgtcagatatgagacccaaaagctccctgagggcctccttt gactccttgaagcagcaaatggagaatgtgggcagctcacacatccagctggccctgacc ctgcgtgaggagctgcggagtctcgaggagtttcgtgagaggcagaaggagcagaggaag aagctgggctttcaggattctaaatgtggggttctgagcagacagggggtccccagaagc cagcccaggcttgaggtgccggagcaggcaccctctctgagctgggacgtgcccgcagag cctgcagagaacctgccaggaaggagggaccccggccctgctattttgggccctcctgtg gccagaggagctgcactggctgcaaaggctgttttggacagagcattctttctccaagtg gccagggccccgcaggcagagcgctgtgttgtaagtgtacacacacgtgtgagcgactgt gatgcctgcagaggctcagggatgcaggatgaacgactgtgtgcgcacgtgtgttggggt gggaactccccagctggagacgaagctctggccaagtatgaggccgtcatggaccgggtc cagaagagcaagctgtcgctctacaagaaggccatggagtccaagaagacatacgagcag aagtgccgggacgcggacgacgcggagcaggccttcgagcgcattagcgccaacggccac cagaagcaggtggagaaggtgcgctgggctgctgggccgtgcggcactcagggacccatt cctcggtgcaggcccactgaggcccacagggtgatgagagcagacgccaaccctcacaca accttggagaagggctcagtaggcccatgtcacagagaggaaactgaggctcacaccctc aggcagctcagagcacagttggggagcagcagcagcaccagcgtgggtggccctgacggg cattcagatgaggccccgccatctgctagggcaggtcccatgggggaggcggggctccca gtgggaggaggcatccaggatgggacctgctggagtacaggggtggaggagctcgtgtca gggccctccctgaggctgcctgcgctttcagagcgggtatacaggcagagcattgcgcag ctggagaaggtccgggctgagtgggagcaggagcaccggaccacctgtgaggcctttcag ctgcaagagtttgaccggctgaccattctccgcaacgccctgtgggtgcacagcaaccag ctctccatgcagtgtgtcaaggatgatgaggtgggggctgagggccttggtgtgggagcc acagagcacctgccccctctgccggggatgggaggggaggtggacacagctgtcagaggt tggcagaagctggaggaccgtggctgctctgctctaggcctggtgcttgcaggagccgag gcgcagtccttcctctgctccccacatctccagggtgtcctggttccccttactgagccc aacctgctggggccttgcacacagaaggtgttcaggaaacaggaagctgtcagccagggc cgtgacccctcaggatcaaagaccccgagccgcgcacaatggcctgtgaggaggccgctc tacgaggaagtgcggctgacgctggaaggctgcagcatagacgccgacatcgacagtttc atccaggccaagagcacgggcacagagccccccgctccggtgccctaccagaactattac gatcgggaggtcaccccgctgaccagcagccctggcatacagccgtcctgcggcatgata aagaggttctctggactgctgcacggaagtcccaagaccacttcgttggcagcttctgct gcgtccacagagaccctgacccccacccccgagcggaatgagggtgtctacacagccatc gcagtgcaggagatacagggaaacccggcctcaccagcccaggagtaccgggcgctctac gattatacagcgcagtgccttgcgtcctcatctctcctcatggtttcactcatcttcgag catcctctcctcctcaggagggcacgtgtgcccgagtgtgtccacactagcaagggccct tccaacgtcatgcgctttcaatctcttggccagaacccagatgagctggacctgtccgcg ggagacatcctggaggtgatcctggaaggggaggatggctggtggactgtggagaggaac gggcagcgtggcttcgtccctggttcctacctggagaagctttga >gi568815583r:77014159_77281926|GENSCAN_predicted_peptide_2|530_aa MGQCGITSSKTVLVFLNLIFWGAAGILCYVGAYVFITYDDYDHFFEDVYTLIPAVVIIAV GALLFIIGLIGCCATIRESRCGLATFVIILLLVFVTEVVVVVLGYVYRAKVENEVDRSIQ KVYKTYNGTNPDAASRAIDYVQRQLHCCGIHNYSDWENTDWFKETKNQSVPLSCCRETAS NCNGSLAHPSDLYAEGCEALVVKKLQEIMMHVIWAALAFAAIQVLYIHIRKSRTDFKYDG ERSNFRTNCESCSVVELKEDTSLLLLPPSGNRTDAEMSPGPFSLHSGELLAKGFCGEEEE TSFLVKDRTGPLEWEMALGNMPSSSLVLQLAQEDNSPHPSCTQSSVCLVREQPPPVCSAS WTQTGHITSETKKMEVGSGYLPRRRYSLTTAVQLSNSVHLTLIEYLTTHIPIGRIVSSIA PFPLRCWIWSGSGLREEKTEKERGNSEEVPQGGETSEGWLWLPGICCLSHAGESTYWRCP PAKPQEQRAVTVGNAAGTRAGRKETEEKGHLPPLALLAPSEPTVGQAGKG >gi568815583r:77014159_77281926|GENSCAN_predicted_CDS_2|1593_bp atgggccagtgcggcatcacctcctccaagaccgtgctggtctttctcaacctcatcttc tggggggcagctggcattttatgctatgtgggagcctatgtcttcatcacttatgatgac tatgaccacttctttgaagatgtgtacacgctcatccctgctgtagtgatcatagctgta ggagccctgcttttcatcattgggctaattggctgctgtgccacaatccgggaaagtcgc tgtggacttgccacgtttgtcatcatcctgctcttggtttttgtcacagaagttgttgta gtggttttgggatatgtttacagagcaaaggtggaaaatgaggttgatcgcagcattcag aaagtgtataagacctacaatggaaccaaccctgatgctgctagccgggctattgattat gtacagagacagctgcattgttgtggaattcacaactactcagactgggaaaatacagat tggttcaaagaaaccaaaaaccagagtgtccctcttagctgctgcagagagactgccagc aattgtaatggcagcctggcccacccttccgacctctatgctgaggggtgtgaggctcta gtagtgaagaagctacaagaaatcatgatgcatgtgatctgggccgcactggcatttgca gctattcaggttttatatattcatatacgtaaatcaagaactgattttaagtacgatggt gaacgttctaatttcagaaccaattgcgagtcatgtagtgtggtagaattaaaggaggac acgagcctgcttctgttacctccaagtggtaacaggactgatgccgaaatgtcaccaggt cctttcagtcttcacagtggagaactcttggccaaaggtttttgcggggaggaggaggaa accagctttctggttaaggacagaacagggcctctagagtgggaaatggctttgggaaat atgccaagcagtagccttgttcttcaacttgcccaagaggataattctccacacccttcc tgtactcagtcctcagtttgcctggtgagagagcagcctcctcccgtgtgctctgccagc tggacccagactggccatattaccagtgagaccaaaaagatggaggtggggagtgggtac ttaccaagaagaagatactctctgacgactgcagtacagttgtcaaattccgtccatcta acactgatagaatacctcaccactcatattcccattggccgcatcgtgtcctctatagca cctttccctctgcggtgctggatctggtctggatcagggctaagggaagagaaaacagag aaggaacgtggaaactcagaggaggttccccaaggtggagagacctccgaggggtggctg tggttgcctgggatatgctgcctgtcccatgctggagaatcaacttactggaggtgcccc cccgccaagccacaggagcagagagctgtcacggtggggaatgctgctgggacccgtgca ggacgaaaggagacagaagaaaaaggccatcttcctcctctagccttgttagccccttca gagcccactgtgggtcaggctggcaaagggtaa >gi568815583r:77014159_77281926|GENSCAN_predicted_peptide_3|149_aa MAPDGPREREAACPGPTFLGTARMEAMGEGGPLGAGNRGGPSSTGDRERRGRAAAVSGGL LAPVGRSVGASAPLPAGSAFASVPASLRSLRLATYPVGFAQTGLALPDQWVRHPHLLPEC TAAGTAMDLSHAQPQAPAQGTAAGTSPEE >gi568815583r:77014159_77281926|GENSCAN_predicted_CDS_3|450_bp atggccccggatgggccgagggaaagggaagctgcctgtcctgggcccacattcctgggt actgcccggatggaggccatgggtgagggaggacccttgggagcagggaacagaggtggg cccagcagtactggggacagggagagacgggggagggcggcggctgtctctgggggcctc ctcgctccggttggtcggtcggtcggtgcctctgcgcctctgccagcgggctctgcattt gcatccgtcccggcttctctccgatcactcaggcttgccacctacccagtcggctttgct cagactggattggctctcccggatcagtgggtgagacacccccatctcttaccagagtgc actgcagctggcactgccatggatctgagccatgcccagccccaggcccctgcacagggg acagcagcaggcacctccccggaggaatga >gi568815583r:77014159_77281926|GENSCAN_predicted_peptide_4|319_aa MLLQASSLLLVSCGMAAPCLEGPKCCAETNPSAALAEVRTEVQLSVLTAVPKVGIPWTKN GAIREARRVAVQDPGRDYTKEIKPFYPDASKMMRGRCPVHNTGNDGFQPVCLLHSGSSSP ADMDYISLYSLSNLRKSGPKEKAKAFCFSQTSEGCRNKNSLLSRFICISLLGRKNKQPDP QFGPETPGALKGYRPVDGGTSSGLPKPDSRPQELTRSASPLPGLPWSIRKATCVGLRAHG FTGLELRAESERGLGSLRGTKQRSEYLDGLLNFPECCLAVWHGCGGSAENGKAEVAILAG SWMTIWWVTGSPATAGLAG >gi568815583r:77014159_77281926|GENSCAN_predicted_CDS_4|960_bp atgctgctgcaggccagcagtctgctcttggtgtcctgtggcatggccgcgccttgcctt gaaggcccaaagtgctgtgccgagacaaatccaagtgctgctcttgctgaagtgcgcact gaagtccagctgtccgtgcttacagcagtgccaaaagtggggattccatggaccaaaaac ggtgccataagggaagccagaagagttgcagtgcaagatcctgggagagattataccaaa gaaatcaagcctttttatcctgatgcttctaagatgatgaggggtaggtgccctgtccat aacacaggaaatgatggctttcagcctgtgtgtcttctgcactctgggtcctcatcccct gctgacatggattatatttctctttattccctgtctaatctgaggaaatcaggccccaaa gagaaagcaaaggcattctgcttttcccagacctcagaaggctgcagaaataaaaactcc cttctctcccggttcatctgcatctcgttactgggccgcaaaaataagcagcctgacccc cagtttggtccagaaacacctggagccctgaaaggatacaggccagtggatggtgggacc agttcaggattgcccaagccagactccaggcctcaggaactgaccaggtctgcatcaccc ctgccagggctgccatggagtatccggaaagccacatgtgtaggacttcgggcacatggc tttactgggttagaactgcgtgctgagtccgaaaggggtttgggcagcctgcgtggaaca aagcaaaggtcagagtatttggatgggcttctgaactttcctgaatgctgcttggccgtg tggcatggctgtggagggagcgctgaaaacgggaaggcagaagttgccatccttgcagga agctggatgaccatctggtgggtgactggcagccctgccactgctggtcttgctggctga >gi568815583r:77014159_77281926|GENSCAN_predicted_peptide_5|630_aa MIPPKQPRQPKGAVDDAIAFGGKTDQEAPNASQPTPPPLPKKMIIRANTEPISKDLQKSM ESSLCVMANPTYDIDPNWDASSAGSSISYELKGLDIESYDSLERPLRKERPVPSAANSIS SLTTLSIKDRFSNSMESLSSRRGPSCRQGRGIQKPQRQALYRGLENREEVVGKIRSLHTD ALKKLAVKCEDLFMAGQKDQLRFGVDSWSDFRLTSDKPCCEAGDAVYYTASYAKDPLNNY AVKICKSKAKESQQYYHSLAVRQSLAVHFNIQQDCGHFLAEVPNRLLPWEDPDDPEKDED DMEETEEDAKGETDGKNPKPCSEAASSQKENQGVMSKKQRSHVVVITREVPCLTVADFVR DSLAQHGKSPDLYERQVCLLLLQLCSGLEHLKPYHVTHCDLRLENLLLVHYQPGGTAQGF GPAEPSPTSSYPTRLIVSNFSQAKQKSHLVDPEILRDQSRLAPEIITATQYKKCDEFQTG ILIYEMLHLPNPFDENPELKEREYTRADLPRIPFRSPYSRGLQQLASCLLNPNPSERILI SDAKGILQCLLWGPREDLFQTFTACPSLVQRNTLLQNWLDIKRTLLMIKFAEKSLDREGG ISLEDWLCAQYLAFATTDSLSCIVKILQHR >gi568815583r:77014159_77281926|GENSCAN_predicted_CDS_5|1893_bp atgatacctcccaagcagccacgacagcccaagggagctgtggacgatgccatcgccttt ggagggaaaacagaccaagaagcacccaatgcttcccaacctacaccacccccactgcca aagaagatgatcataagagccaatacagagccaatctccaaggacctccaaaaatccatg gaaagtagtctttgtgtcatggctaatcccacctatgatatcgaccccaactgggatgcc agcagtgctggttcttccatcagctatgaactcaaaggactggacattgagtcttatgac tccttggaaaggcctttgcgcaaggagagacctgtcccctcagcagcaaacagcatttcc agcttaaccactctcagtattaaggatagattttccaacagcatggaatccctctccagc cggcgtgggccctcttgcagacagggccgaggcatccagaagccgcagagacaagcactt tatcgaggacttgagaatcgggaggaagtagtgggtaaaatccgaagccttcatacagat gccttgaagaaactggctgttaaatgcgaagaccttttcatggctgggcagaaagaccag ctccgttttggagtggacagctggtcagacttcaggctaaccagtgacaaaccatgttgt gaggcaggtgatgcggtttactatactgcttcatatgcaaaagatccacttaataactat gcagtcaagatctgtaagagcaaagctaaagaatctcagcagtattatcacagcttggct gtccggcagagtctggctgtccattttaacattcagcaggactgtggtcatttccttgct gaagtccctaaccgtctgcttccctgggaggatccagatgaccctgaaaaggatgaggat gacatggaagagactgaagaagacgccaaaggagaaacggatgggaaaaacccaaagccc tgttctgaagcagcatcatcccagaaagagaatcagggagtcatgagcaagaagcagagg agccacgttgtggtcatcaccagggaggttccatgtcttactgtggctgattttgtgcga gactctctggcccagcatgggaaaagccctgatttgtatgagaggcaggtgtgtctgctg ctcttacagctatgctctggtcttgagcacctcaaaccctaccatgtcactcactgcgat ctacgcctagagaacctgctacttgtccactaccagcctggggggactgcccaaggcttt gggcctgcagagcccagccccacctcatcttatcccactaggcttatagtgagcaacttc tctcaggccaagcagaagagccatctggtggaccccgagatcctccgggaccagtctcgc cttgccccagagatcataacagctacccagtataaaaagtgtgatgagttccagacaggc atcctcatctatgagatgctgcacctacccaacccctttgatgagaacccagagctgaag gagagggaatacacacgagcagacctgcctcgcatcccattccgctccccctactcccgg ggtctgcagcagctggccagctgcctcctgaatcccaacccttctgagcggatcctcatt tcagacgccaaaggcatcctccagtgtctgctctggggcccccgcgaagatctcttccag actttcaccgcctgccctagcctagtacagaggaacaccctgctccaaaactggctagac atcaagcgaacactgctcatgatcaagtttgctgagaagtccctggacagggaaggtgga atcagccttgaggactggctttgtgctcagtatttggcttttgccactacagactccctc agttgtattgtgaaaattctgcagcaccgttaa >gi568815583r:77014159_77281926|GENSCAN_predicted_peptide_6|72_aa MSHRTWPEDIFKWGESKEVDYGSSEKEKRGIPEEGIVILGDDSSMCVTAPEDLPLGQDVE VEDRDIDILDPV >gi568815583r:77014159_77281926|GENSCAN_predicted_CDS_6|219_bp atgagccaccgcacctggcctgaagacatttttaagtggggagaaagtaaagaggtagat tatggaagctcagaaaaagagaaaagaggtataccagaagaaggcattgttatcttagga gatgacagctccatgtgtgttactgcccctgaagacctgccactgggacaggatgtggag gtggaagaccgtgatattgatattcttgaccctgtgtag >gi568815583r:77014159_77281926|GENSCAN_predicted_peptide_7|213_aa MQTTIREYYRHLYTNKLENLEEMDKFLDTYTHPRLNQEEVESLNRPITGSEIVAIINSLL TKKSPGPDGFTAEFYQRYKEELVPFLLKLFQSIEKEGILPNSFYEASIILIPKPGRDTTK KQNFRPISLMNIDAKILNKILANRIQQHIKKLIHHDQVGFIPGMQGWFNIRKSINVIQHI NRTKDKNHMIISIDTEKAFDKIQQPFMLKTLNK >gi568815583r:77014159_77281926|GENSCAN_predicted_CDS_7|642_bp atgcaaactaccatcagagaatactacagacacctctacacaaataaactggaaaaccta gaagaaatggataaattcctcgacacatacacccatccaagactaaaccaggaagaagtt gaatctctgaatagaccaataacaggctctgaaattgtggcaataatcaatagcttacta accaaaaaaagtccaggaccagatggattcacagccgaattctaccagaggtacaaggag gagctggtaccattccttctgaaactattccaatcaatagaaaaagagggaatcctccct aactcattttatgaggccagcatcatcctgataccaaagcctggcagagacacaaccaaa aaacagaattttagaccaatatccttgatgaacattgatgcaaaaatcctcaataaaata ctggcaaaccgaatccagcagcacatcaaaaagcttatccaccatgatcaagtgggcttc atccctgggatgcaaggctggttcaacattcgcaaatcaataaatgtaatccagcatata aacagaaccaaagacaaaaaccacatgattatctcaatagatacagaaaaggcctttgac aaaattcaacaacccttcatgctaaaaactctcaataaatga >gi568815583r:77014159_77281926|GENSCAN_predicted_peptide_8|1260_aa MSACNTFTEHVWKPGECKNCFKPKSLHQLPPDPEKAPITHGNVKTNANHSNNHRIRNTGN FRPPVAKKPTIAVKPTMIVADGQSICGELSIQEHCENKPVIIGWNRNRAALSQKPLNNNN EDDEGISHVPKPYGNNDSAKKMSDNNNGLTEVLKEIAGLDTAPQIRGNETNSRETFLGRI NDCYKRSLERKLPPSCMIGGIKETQGKHVILSGSTEVISNEGGRFCYPEFSSGEESEEDV LFSNMEEEHESWDESDEELLAMEIRMRGQPRFANFRANTLSPVRFFVDKKWNTIPLRNKS LQRICAVDYDDSYDEILNGYEENSVVSYGQGSIQSMVSSDSTSPDSSLTEESRSETASSL SQKICNGGLSPGNPGDSKDMKEIEPNYESPSSNNQDKDSSQASKSSIKVPETHKAVLALR LEEKDGKIAVQTEKEESKASTDVAGQAVTINLVPTEEQAKPYRVVNLEQPLCKPYTVVDV SAAMASEHLEGPVNSPKTKSSSSTPNSPVTSSSLTPGQISAHFQKSSAIRYQEVWTSSTS PRQKIPKVELITSGTGPNVPPRKNCHKSAPTSPTATNISSKTIPVKSPNLSEIKFNSYNN AGMPPFPIIIHDEPTYARSSKNAIKVPIVINPNAYDNLAIYKSFLGTSGELSVKEKTTSV ISHTYEEIETESKVPDNTTSKTTDCLQTKGFSNSTEHKRGSVAQKVQEFNNCLNRGQSSP QRSYSSSHSSPAKIQRATQEPVAKIEGTQESQMVGSSSTREKASTVLSQIVASIQPPQSP PETPQSGPKACSVEELYAIPPDADVAKSTPKSTPVRPKSLFTSQPSGEAEAPQTTDSPTT KVQKDPSIKPVTPSPSKLVTSPQSEPPAPFPPPRSTSSPYHAGNLLQRHFTNWTKPTSPT RSTEAESVLHSEGSRRAADAKPKRWISFKSFFRRRKTDEEDDKEKEREKGKLVGLDGTVI HMLPPPPVQRHHWFTEAKGESSEKPAIVFMYRCDPAQGQLSVDQSKARTDQAAVMEKGRA ENALLQDSEKKRSHSSPSQIPKKILSHMTHEVTEDFSPRDPRTVVGKQDGRGCTSVTTAL SLPELEREDGKEDISDPMDPNPCSATYSNLVPHTFLPSEKQSLRQNKDFSVLFFHEVRRE EHSASPATEQSWMENDFDELREEGFRRSNYSELQEEILTNGKEVRNFEKKLDEWITRITN AEKSLKELMELKAKTRELCEECRRLRSRCDQLEERVSVMEDEMNEMRREGKFREKKNKKK >gi568815583r:77014159_77281926|GENSCAN_predicted_CDS_8|3783_bp atgtctgcttgtaacacctttactgaacatgtttggaaacctggtgaatgcaagaattgc tttaaacctaaaagtttgcaccagcttcccccagaccctgagaaggcacccatcacccat ggcaatgtgaaaactaatgccaatcacagtaacaaccaccgcatcaggaacacgggcaat ttccggcctcctgtggctaaaaaacccactatagctgtgaagcccactatgatagtggca gatgggcaaagtatatgtggtgagcttagcatccaagaacactgtgagaacaaacctgtc atcatagggtggaaccgaaacagagctgccttgagtcagaaaccacttaacaataataat gaagatgatgaaggaattagccatgttcctaagccttatggcaataatgatagtgcaaag aagatgtcagataacaataatggactaactgaagtgttaaaggagatagcaggcttggat actgcccctcagataagaggaaatgaaacaaactccagagaaacattcttgggaagaata aatgattgctataaacgatcattggaaagaaagcttccaccaagttgcatgataggtggg ataaaggaaactcagggcaagcatgttattctgagtgggagcacagaagtgattagtaat gaagggggccggttctgttacccagagttttccagtggcgaggagagtgaagaggatgta cttttcagtaacatggaggaggagcacgagagttgggatgagagtgatgaagagctgttg gccatggagattcgcatgagagggcaacctcgctttgccaacttcagagcaaacacattg tctcctgttcgattctttgtggacaaaaaatggaataccatccccctgcgaaacaagtct ctgcagagaatctgtgctgtggactatgatgacagctatgatgaaatcctgaatggttat gaggaaaattctgtggtctcttatggacaaggaagcattcagagcatggtgtcatctgac tccacatcaccagattcttctttaacagaagaatcacgttctgagacagccagtagttta tcccagaagatttgtaatgggggattatctcctggtaacccaggagattctaaggacatg aaggaaattgagcccaattatgaaagtccctctagtaataatcaggataaagattcatca caggcttccaaaagctcaataaaagttccagagacccacaaagcagtccttgctctccga ttagaagagaaagatggcaagattgctgtacaaactgagaaggaagaaagtaaagcctct acagatgttgctgggcaagcagtaaccataaaccttgtccccacagaagagcaagcaaaa ccttaccgagttgtgaacctggaacagccattgtgcaagccatatactgtcgtggatgtg tcagcagccatggccagtgagcacctcgagggccctgttaacagccccaagacaaaaagc tcatcctctactccaaactctccagttacatcatcttcattgacaccaggacaaataagt gcccatttccaaaaatccagtgcaattcgataccaagaagtatggacttctagcaccagt ccacgacaaaagatacctaaagtagaactaattactagtggaactggaccaaatgttcct ccaaggaaaaactgtcacaaatcagcacctacatcacccacagctacaaacatttcctcc aaaaccatccctgttaagtcacctaatttgtctgaaattaaatttaatagttataacaat gctggtatgccaccttttccaattatcattcatgacgagccaacttatgctcggagttcc aaaaatgctatcaaagttcccattgttatcaatccaaatgcatatgacaatctagctatc tacaaaagttttctgggaacaagtggagaactctcagtgaaggaaaaaaccacaagtgta ataagccatacttatgaagaaatagaaacagaaagcaaagtgcctgataacaccactagc aaaaccactgactgtcttcaaactaaagggttttcaaacagcacagagcataaaaggggc tcagtggctcagaaggttcaagagtttaacaactgtctcaacagaggtcagtcttcacca cagagaagctatagttccagccacagctccccagcaaagatccagagagccactcaagag cctgtggccaaaatagaaggcactcaggagtctcagatggtgggcagcagcagcaccaga gagaaagcaagcacagtgctttctcagattgtggcttcaatccaacccccacagtctcct ccagaaacacctcaatctggccctaaagcttgcagtgtggaagagctttatgccattcct ccagatgctgatgttgctaagagcacacctaagagtacgccagtccggcccaaatctctc tttacatctcagcctagtggtgaggctgaagcacctcagaccacagacagtcctaccacc aaagtacagaaagacccatccataaagccagtcaccccctctccctccaaattagtgact agcccccaaagtgagccaccagctccctttcccccgccacgctctacttcttctccttac catgcaggtaaccttttgcagaggcatttcaccaactggaccaagccaaccagccctacc aggtcaacagaagctgaatcagttttgcactctgaaggcagcaggcgggcagctgatgca aaacctaagcgctggatatcatttaaaagcttcttccgccgtcggaaaacagatgaggag gatgacaaagagaaagagcgagagaaagggaaactggtgggcctggatggcacagtcatt cacatgctgcctcctcctccagttcagcgccatcactggttcacagaggcgaaaggagag tccagtgagaaaccagccattgtcttcatgtacaggtgcgaccctgctcaaggccagctc agtgtggatcagagcaaggctaggacagaccaggcagcagtcatggagaagggtagagca gagaatgcattactacaggactcagagaagaagaggagtcattcttctccatcacagatt cctaaaaagattctcagtcacatgacccatgaagtaacagaggatttttctcctcgggat ccaagaactgttgttgggaagcaagatggcaggggctgcacttcagtcacaacagcattg tccctacctgaactggaaagggaagatggaaaagaagacatttcagatcctatggacccg aacccttgtagtgcaacatacagcaacttagtgccacacactttccttccttctgaaaag caaagcctgaggcagaataaggatttctcagtgctcttcttccatgaagtcaggagagag gaacacagtgcctcaccagcaacggaacaaagctggatggagaatgactttgacgagttg agagaagaaggcttcagacgatcaaactactctgagctacaggaggaaattctaaccaat ggcaaagaagttagaaactttgaaaaaaaattagacgaatggataactagaataaccaat gcagagaagtccttaaaggagctgatggagctgaaagctaagactcgagaactatgtgaa gaatgcagacgcctcaggagccgatgcgatcaactggaagaaagggtatcagtgatggaa gatgaaatgaatgaaatgaggcgagaagggaagtttagagaaaaaaagaataaaaagaaa tga >gi568815583r:77014159_77281926|GENSCAN_predicted_peptide_9|173_aa MALGDTEELIYLTSEEFTEDDLMEMTASKPAPDNEDEDVEEAVPENRLRRLSSAIGDSLC GFSMTQDVLALESLAFYYQATPEVLIPAASQDLLPHSEPGSPLENHSKRLWSTQQRSGII GGTWWEVMGSSGQFPPCCSSDSGDTHRTSHTQDLPMAYQRTSNLQEPKEHRCT >gi568815583r:77014159_77281926|GENSCAN_predicted_CDS_9|522_bp atggctcttggagacactgaagagctaatatacctcacatcagaagaatttacggaagat gacttgatggagatgactgcttccaaaccagcgccagacaatgaagatgaagatgtagaa gaagcagtgccagaaaacagattgagaagacttagttctgccataggggactctttatgt ggcttctctatgactcaggatgtattggccctagaaagtttggccttttattatcaggct actccagaggttctcattccagcagccagtcaggacctacttccccattctgaaccaggc agtcccctagaaaaccattccaagagactgtggagtactcaacagagatcaggtatcata ggagggacctggtgggaagtgatgggatcatcggggcagtttcccccatgctgttctagt gatagtggagacacgcatagaacctcacacactcaagacctccccatggcataccaaaga accagtaacttacaagaaccgaaagagcaccgatgcacttga >gi568815583r:77014159_77281926|GENSCAN_predicted_peptide_10|130_aa MGKKQSRNTENSKNQSASPPPKECSSSPAMEQSRMENDFDELREEGFRRSNYSKLKEEVR THGKEVKNLETRLDEWLTRITNAEKSLKELMELKTTARELCDECTSLSSRFDQLEERVSV MEDQMNEMRV >gi568815583r:77014159_77281926|GENSCAN_predicted_CDS_10|393_bp atggggaaaaaacagagcagaaacactgaaaattctaaaaatcagagcgcctctcctcct ccaaaggaatgcagctcctcaccagcaatggaacaaagccggatggagaatgactttgat gagttgagagaagaaggcttcagacgatcaaactactccaagctaaaggaggaagttcga acccatggcaaagaagttaaaaaccttgaaacaagattagacgaatggctaactagaata accaatgcagagaagtccttaaaggagctgatggagctgaaaaccacggcacgagaacta tgtgatgaatgcacgagcctcagcagccgattcgatcaactggaagaaagggtatcagtg atggaagatcaaatgaatgaaatgcgagtttag >gi568815583r:77014159_77281926|GENSCAN_predicted_peptide_11|210_aa MGHFNTPQTTLDRPTRQKVNKDTQELNSALHQADLIDIYRTLHPKSTEYTFFSAPHHTYS KIDHIVGSKAVLSKCKRTEIITNCLSGHSTIKLELRIKKLTQNHSATWKLNNLLLNDYWV HNEMKAEIKMFFETNENKDTTYQNLWDTFKAVRRGKFIALNAHKRNQERSKIDTITSQLK ELEKQEETHSKASRRQEITKIRAELKEIDT >gi568815583r:77014159_77281926|GENSCAN_predicted_CDS_11|633_bp atgggacactttaacaccccacagacaacattagacagaccaacgagacagaaagttaac aaggatacgcaggaattgaactcagctctgcaccaagcagacctaatagacatctacaga actctccaccccaaatcaacagaatatacatttttttcagcaccacaccacacctattcc aaaattgaccacatagttggaagtaaagcagtcctcagcaaatgtaaaagaacagaaatt ataacaaactgtctctcaggccacagtacaatcaaactagaactcaggattaagaaactc actcaaaaccactcagctacatggaaactgaacaacctgctcctgaatgactactgggta cataatgaaatgaaggcagaaataaagatgttctttgaaaccaatgagaacaaagacaca acataccagaatctctgggacacattcaaagcagtgcgtagagggaaatttatagcacta aatgcccacaagagaaaccaagaaagatctaaaattgacaccataacatcacaattaaaa gaactagagaagcaagaggaaacacattcaaaagctagtagaaggcaagaaataactaag atcagagcagaactgaaggaaatagacacataa >gi568815583r:77014159_77281926|GENSCAN_predicted_peptide_12|187_aa MSELPFTIASKRIQYLGIQFTRDVKDLFKENYKPLLNEIKEDTNKWKNIPRSWVGRINIV KMAILPKDVGMGKDFMSKIPKAMATKAKIHKWDLIKLKSFRTAKETTIRVNRQPTEWEKI FAFYSSDKDSVSRIYNELKQIYKKKTNNPIKKWAKDMNRHFSKEDIYAAKRHMKKCPSSL AIRETQN >gi568815583r:77014159_77281926|GENSCAN_predicted_CDS_12|564_bp atgagtgaactcccattcacaattgcttcaaagagaatacaatacctaggaatccaattt acaagggacgtgaaggacctcttcaaggagaactacaaaccactgctcaatgaaataaaa gaggataccaacaaatggaagaacattccacgctcatgggtaggaagaatcaatatcgtg aaaatggccatactgcccaaggatgtaggcatgggcaaggacttcatgtctaaaatacca aaagcaatggcaacaaaagccaaaattcacaaatgggatctaattaaactaaagagcttc cgcacggcaaaagaaactaccatcagagtgaacaggcaacctacagaatgggagaaaatt tttgcattctactcatctgacaaagattctgtatccagaatctacaatgaactcaaacaa atttacaagaaaaaaacaaacaaccccatcaaaaagtgggcgaaggacatgaacagacac ttctcaaaagaagatatttatgcagccaaaagacacatgaaaaaatgcccatcatcactg gccatcagagaaacgcaaaactaa >gi568815583r:77014159_77281926|GENSCAN_predicted_peptide_13|90_aa MPKRKAEGDAKGDKAKVRDKPQRRSARLSTKPAPPKPETKPKKVPAKNGEKVLKGEMGKA DTGKEGNKPAENGDAKTDQTQKGEGAGDAK >gi568815583r:77014159_77281926|GENSCAN_predicted_CDS_13|273_bp atgcccaagagaaaggctgaaggggatgctaaaggagataaagccaaagtgagggacaaa ccacagagaagatctgcaaggttgtctactaaacctgctcctccaaagccagagaccaag cctaaaaaggtccctgcaaagaacggagagaaggtactcaaaggagaaatgggaaaagct gacactggcaaggaggggaataagcctgcagaaaatggagatgccaaaacagaccagaca cagaaaggtgagggtgctggagatgccaagtga