GENSCAN 1.0 Date run: 3-Nov-116 Time: 19:49:38 Sequence gi568815595r:51298772_51583828 : 285057 bp : 45.13% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 5362 5535 174 2 0 63 55 65 0.137 0.61 1.02 Intr + 11461 11555 95 2 2 64 108 97 0.235 8.98 1.03 Intr + 13705 13805 101 0 2 117 60 5 0.947 -0.49 1.04 Intr + 14073 14131 59 0 2 77 69 55 0.888 1.03 1.05 Intr + 16209 16357 149 1 2 88 110 100 0.991 12.15 1.06 Intr + 31367 31452 86 1 2 113 67 117 0.347 10.72 1.07 Intr + 34128 34256 129 0 0 82 110 24 0.748 3.81 1.08 Intr + 39588 39648 61 0 1 102 81 45 0.942 3.94 1.09 Intr + 40164 40257 94 2 1 77 111 117 0.999 12.44 1.10 Intr + 42466 42614 149 2 2 108 91 147 0.999 16.95 1.11 Intr + 44249 44452 204 1 0 74 67 55 0.454 1.40 1.12 Intr + 50081 50167 87 1 0 75 88 93 0.996 8.17 1.13 Intr + 51517 51621 105 0 0 84 100 68 0.992 8.01 1.14 Intr + 56111 56177 67 1 1 41 70 130 0.827 4.98 1.15 Intr + 57318 57484 167 1 2 65 61 221 0.828 16.78 1.16 Intr + 57636 57722 87 2 0 122 85 61 0.960 9.37 1.17 Intr + 58191 58370 180 2 0 108 99 242 0.999 27.36 1.18 Intr + 58987 59070 84 0 0 96 91 117 0.871 12.82 1.19 Intr + 59190 59306 117 2 0 31 103 87 0.848 5.16 1.20 Intr + 60184 60562 379 0 1 78 75 94 0.496 1.64 1.21 Intr + 61720 61861 142 2 1 30 80 79 0.479 0.71 1.22 Intr + 63088 63226 139 1 1 107 92 77 0.973 10.47 1.23 Intr + 63756 63903 148 2 1 122 94 3 0.806 4.11 1.24 Intr + 75698 75816 119 0 2 89 90 146 0.797 15.08 1.25 Intr + 75934 76119 186 0 0 28 86 78 0.671 1.49 1.26 Intr + 76977 77064 88 2 1 108 67 33 0.940 2.74 1.27 Intr + 80678 80800 123 0 0 83 95 67 0.975 7.46 1.28 Intr + 80940 81160 221 1 2 62 64 170 0.934 10.02 1.29 Intr + 81354 81436 83 2 2 140 90 15 0.899 5.34 1.30 Term + 82314 82788 475 0 1 33 55 546 0.866 40.16 1.31 PlyA + 84178 84183 6 -0.45 2.00 Prom + 85101 85140 40 -9.26 2.01 Init + 86460 86791 332 2 2 57 30 445 0.753 30.38 2.02 Intr + 87459 87564 106 0 1 21 111 81 0.827 3.92 2.03 Intr + 88966 89107 142 0 1 53 101 227 0.839 20.43 2.04 Term + 90134 90318 185 0 2 123 34 343 0.999 30.11 2.05 PlyA + 90349 90354 6 -0.45 3.00 Prom + 92086 92125 40 -7.16 3.01 Sngl + 92629 95301 2673 0 0 95 41 3411 0.993 330.01 3.02 PlyA + 99021 99026 6 -0.45 4.20 PlyA - 99095 99090 6 1.05 4.19 Term - 100056 99998 59 1 2 126 55 79 0.984 6.25 4.18 Intr - 104624 104372 253 2 1 96 98 496 0.468 48.41 4.17 Intr - 113709 113608 102 0 0 63 82 158 0.870 13.07 4.16 Intr - 114295 114222 74 2 2 61 88 98 0.999 6.23 4.15 Intr - 114615 114511 105 1 0 70 87 105 0.979 8.79 4.14 Intr - 115272 115179 94 0 1 86 75 37 0.971 1.74 4.13 Intr - 116086 115874 213 1 0 77 43 243 0.971 17.61 4.12 Intr - 118100 118016 85 1 1 110 113 110 0.999 15.52 4.11 Intr - 119427 119345 83 0 2 104 47 10 0.923 -3.06 4.10 Intr - 120105 119907 199 2 1 96 86 159 0.957 15.85 4.09 Intr - 122226 120963 1264 1 1 81 110 706 0.916 59.40 4.08 Intr - 123660 123536 125 2 2 90 82 124 0.999 12.23 4.07 Intr - 128770 128601 170 1 2 112 81 28 0.967 3.24 4.06 Intr - 130699 130490 210 1 0 71 100 143 0.999 12.91 4.05 Intr - 131441 131262 180 2 0 90 73 118 0.882 10.56 4.04 Intr - 142300 142199 102 1 0 69 98 62 0.980 5.67 4.03 Intr - 143126 142614 513 2 0 75 99 316 0.999 24.56 4.02 Intr - 145132 144995 138 1 0 50 95 122 0.968 9.76 4.01 Init - 152516 152514 3 2 0 108 81 0 0.857 1.30 4.00 Prom - 153177 153138 40 -5.26 5.00 Prom + 154402 154441 40 -5.36 5.01 Sngl + 158545 159201 657 0 0 59 43 282 0.953 16.98 5.02 PlyA + 159470 159475 6 -0.45 6.00 Prom + 159598 159637 40 -4.96 6.01 Sngl + 160541 161098 558 1 0 70 42 270 0.939 16.74 6.02 PlyA + 161107 161112 6 1.05 7.00 Prom + 161208 161247 40 -8.66 7.01 Sngl + 161365 163086 1722 0 0 58 47 261 0.328 13.94 7.02 PlyA + 163295 163300 6 1.05 8.06 PlyA - 164193 164188 6 1.05 8.05 Term - 164456 164325 132 2 0 108 39 30 0.101 -1.81 8.04 Intr - 168105 168032 74 1 2 114 42 44 0.657 1.53 8.03 Intr - 172234 172158 77 0 2 90 99 115 0.563 11.96 8.02 Intr - 185065 184948 118 2 1 97 80 52 0.586 4.82 8.01 Init - 206952 206754 199 0 1 66 107 70 0.225 6.05 8.00 Prom - 217951 217912 40 -5.06 9.05 PlyA - 218079 218074 6 1.05 9.04 Term - 232507 232237 271 0 1 80 41 126 0.609 2.06 9.03 Intr - 234490 234442 49 2 1 119 92 63 0.844 7.54 9.02 Intr - 235278 235119 160 0 1 66 81 67 0.871 3.36 9.01 Init - 237988 237983 6 1 0 90 101 0 0.360 2.34 9.00 Prom - 269446 269407 40 -4.56 10.02 PlyA - 269581 269576 6 1.05 10.01 Sngl - 276152 275292 861 2 0 81 47 160 0.474 7.06 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:51298772_51583828|GENSCAN_predicted_peptide_1|1432_aa XRPHPLKKNGSGSDLKWQPGHSLPQPVWQFGSKPSSFPSTSRGKTEDWSYSDGCCPSPRI SAEDFLRVPEPDEDECLPSGLDGNETAHKQVWNSYFSLAVLFINQPSLQLEIITSAKRKK ILDKYGDMRVMMAYELFSMWQNLGEHKIHFIPGMIGPFLGVTLVPQPEVRNIMIPIFHDM MDWEQRKNGNFKQVEAELIDKLDSMVSEGKGDESYRELFSLLGILRNLYMEIEDVFSFVA ASSVTSLSLLSAAWSRTQLFGPYPRDCMKGEETENKKIGCTVNLMNFYKSEINKEEMYIR YIHKLCDMHLQAENYTEAAFTLLLYCELLQWEDRPLREFLHYPSQTEWQRKEGLCRKIIH YFNKGKLLMKLLGEINQSFLKIFRVFKKKTMVLEKQNPDAMGKKNAPVGQGCLAPREKPP WQQEKGKERKEEQVSWEFGIPLCRELACQYESLYDYQSLSWIRKMEASYYDNIMEQQRLE PEFFRVGFYGRKFPFFLRNKEYVCRGHDYERLEAFQQRMLNLQIYAVTPIPDYVDVLQMD RVPDRVKSFYRVNNVRKFRYDRPFHKGPKDKENEFKSLWIERTTLTLTHSLPGISRWFEV ERRELVEVSPLENAIQVVENKNQELRSLISQYQHKQVHGNINLLSMCLNGVIDAAVNGGI ARYQEAFFDKDYINKHPGDAEKITQLKELMQEQVHVLGVGLAVHEKFVHPEMRPLHKKLI DQFQMMRASLYHAGSQILRDMSPITCCYLAPKPSSLLITQDHWTEVAGRQEVELFNAARG CHEQKIPPLGITMATGALREVALLKLSGPDPFGADLLCSWMLQVLVISVPPGSQCRSQPV WTSVDSMKREKEACCSLRGAFISQQEFPGLDKLSPACSGTSTPRGNVLASHSPMSPESIK MTHRHSPMNLMGTGRHSSSSLSSHASSEAGNMVMLGDGSMGDAPEDLYHHMQLAYPNPRY QGSVTNVSVLSSSQASPSSSSLSSTHSAPSQMITSAPSSARGSPSLPDKYRHAREMMLLL PTYRDRPSSAMYPAAILENGQIPQKAAGLLDVECSEPEDQQSRQSVGVSRMVAGTTIGTS IAHVAICPSRQILDSSSLNSLAKPPNFQRALFQQVVGACKPCSDPNLSVAEKAVPAAPSS WSLDSGAQEAQPFLSAHMGRILAPPVPPRSLLHGRILLGFGVSEVSEYSRLAQEGHPNQK AEDNGIDCARHTSHQTVSSPSSGLELPLPEACSTDSEALNFSDFLCGVITPYTLTPSTTL WVIPPQPSLPGPCASPQSGLDGSNSTLSGSASSGVSSLSESNFGHSSEAPPRTDTMDSMP SQAWNADEDLEPPYLPVHYSLSESAVLDSIKAQPCRSHSAPGCVIPQDPMDPPALPPKPY HPRLPALEHDEGVLLREETERPRGLHRKAPLPPGSAKEEQARMAWEHGRGEQ >gi568815595r:51298772_51583828|GENSCAN_predicted_CDS_1|4299_bp nggaggccccacccactgaagaagaatgggtcagggtccgacctaaagtggcagcctggc cacagtctgccacagccagtgtggcagtttggatccaaaccgtccagtttccccagcacc agcaggggaaaaactgaagactggagctacagtgatggctgctgcccctctcccagaatt tctgctgaagattttttgcgtgttccggaacctgatgaagatgagtgtcttccctcggga ctggatggtaatgagactgctcacaagcaagtgtggaattcttactttagcctggcagtt ctattcataaatcagccaagccttcagctagaaattatcacctcagccaaaaggaagaag attctagataagtatggggacatgcgtgtaatgatggcctatgaactgttcagcatgtgg cagaatttgggtgaacataagatccactttattccgggaatgattggtccttttctgggt gtgacactggtcccacagccagaagtacggaatatcatgattcccatctttcatgacatg atggactgggagcagagaaaaaatggcaacttcaaacaggtggaggccgagttgattgac aagctggacagcatggtgtcagaagggaaaggtgacgagagctacagggagctcttcagc ctacttggcatccttaggaacttgtacatggaaatagaagatgtgttttcatttgttgct gccagttcagtaacctccttatctttgctttctgctgcatggagtagaacccagctgttt gggccctaccccagggactgcatgaaaggagaggaaacagagaataagaagataggctgc actgttaacctgatgaatttttacaaatctgagattaacaaggaagaaatgtatatccgc tacatccataagctttgtgacatgcacttgcaggccgaaaactacacagaggccgcattt accctgctcctttactgtgagctgctgcagtgggaggaccggccactacgggaattcctc cactacccatcgcagacagagtggcagcggaaggagggactgtgccggaagatcattcac tacttcaacaaaggcaagctcctgatgaaactattaggtgagattaaccagtccttcttg aaaatattccgggttttcaagaagaaaaccatggttctagaaaagcaaaacccagatgct atgggaaaaaaaaatgctcccgtggggcagggatgtttagcaccaagagaaaagccacct tggcaacaagagaagggcaaggaaagaaaggaggagcaggtaagctgggagtttgggatc ccactgtgcagggagctggcgtgtcagtacgagagcctctatgattaccagagcctcagc tggattcggaaaatggaggccagctactatgacaacattatggagcagcaacgcctggag cctgagttctttcgggtcggcttctatggcaggaagtttcctttctttcttcggaacaaa gaatacgtgtgccgtggccatgactacgagaggctggaggccttccagcagaggatgctc aacttgcagatctatgcagtgacgcccattccagattatgtggatgttctgcagatggat agggtaccagatcgagtcaagagcttctatcgcgtcaacaatgtgaggaagttccggtat gacaggccttttcacaaaggccccaaggacaaggagaatgaattcaagagcctgtggatt gaacgtaccacactgaccctgacccacagcttgcctggcatctctcggtggtttgaagtg gagaggagggaactggtggaggtgagccctctggagaatgccatccaagtggttgagaat aagaaccaggagctacgctccctgatcagccagtatcaacacaagcaggtgcatggcaac attaacctgctaagcatgtgcctgaatggtgtcattgatgcagctgtcaatggaggcatt gcacgctatcaggaggccttctttgataaagattacatcaacaagcacccaggagatgct gagaagatcacccagctcaaggagcttatgcaggagcaggttcatgtccttggagttggg ctagcagttcatgagaagtttgtgcacccagaaatgcggcctctgcataagaagctaatt gatcagttccagatgatgcgggccagtctctaccatgctggctcccagatcttaagggat atgagcccaattacctgctgctacctggcacccaagccaagttctcttctcatcacccag gaccactggacagaagtggcagggagacaagaagtagagctgttcaatgctgcacgaggt tgtcatgagcaaaaaattccccctctagggataaccatggccacaggagccctgagggag gttgcactacttaaactcagtgggcctgacccatttggggcggacctgctgtgttcttgg atgctgcaggttctggtcatctccgtccctccaggctcccagtgcaggtctcagcctgtg tggacaagtgtggacagcatgaaaagagaaaaagaagcttgttgtagtttgagaggggca ttcatttctcaacaggagtttccaggtttggataagctaagtcctgcatgttcaggcacc agcaccccacggggaaatgttctggcatcccatagccccatgagtccggagagcatcaag atgacccaccggcacagccccatgaacttgatgggcacaggccgccattcatcatcctct ctctcctcacatgcgtctagtgaagcaggaaacatggtgatgctgggtgacggctccatg ggtgatgctcctgaggacctgtaccaccacatgcagctcgcgtatcccaaccccaggtac caaggctcagtcaccaacgtctctgttctgtcctcgtcccaggcaagcccttcttcctcc agcctgagttccactcactcagcaccatcccagatgattacctctgccccttccagtgcc cgaggctctccctctctgccagataagtaccgccatgcccgtgaaatgatgttgttgctg cccacataccgggaccgcccaagcagtgccatgtatccagcagccatcctggagaacgga cagatcccgcaaaaggccgctgggcttctggatgtggagtgcagtgaacctgaggaccag caatccagacagagtgtaggcgtgagtagaatggtggcaggaaccaccattggaaccagc attgctcatgtagctatctgtccttcccgccagatcctggactcgtcatccctaaacagt ctagcaaagccgccgaatttccagcgagccctgttccagcaagtggtcggagcctgcaaa ccctgcagtgatcccaatctgtctgtggctgaaaaagctgtgccagcagcccccagcagc tggagtctggatagcggagcccaagaggcccagcccttcctgtctgcccatatggggcgc atcctggcccccccagtgcctccccgaagcctgctgcatggaaggatcctgctgggcttt ggggtcagtgaggtttccgagtacagcaggttagctcaggaaggccatcccaaccagaag gctgaagacaatggaattgactgtgccaggcacacttctcaccagactgtctcatcgcct agcagtgggctagagctgccgctgcctgaggcctgttccacagactctgaagcactcaac ttctctgacttcctgtgtggagtcattactccctacactttgacgccttccaccaccctc tgggtgatacccccccagccctccctgcccggaccctgcgcaagcccccagtcaggtctg gacggcagcaactctacgctgtccggcagtgccagcagcggcgtgtcctccttgagtgag agtaactttgggcactcctcggaggccccacctcgcactgacaccatggactccatgcca agtcaggcctggaatgctgacgaagatcttgagccaccctacctccctgtccactacagc ctctctgagtctgccgtcctggactccatcaaggcccagccatgccgaagccactcagcc ccagggtgcgtcatccctcaggaccccatggacccgcctgcgctgccgcccaagccctac cacccccgcctgccggccctggagcacgatgagggggtgctgctgcgtgaagagactgag aggcctcgaggcctgcaccgcaaggctccattgcctcctgggagcgctaaggaggagcag gcccgcatggcctgggagcacggccgaggggagcagtga >gi568815595r:51298772_51583828|GENSCAN_predicted_peptide_2|254_aa MGSYGARPGLGGGAARRVRFSRSAAAAEEEEEEEDEEDEEDVGHAGAGGGAGSERAAGQP GAAAGRLRRCGIGGRGPRSVTVLAAALNPTTSPDGRGQEGEGGGGGGGDSRFYQDLKDRD VTFSPATIENELIKFCREARGKENRLCYYIGATDDAATKIINEVSKPLAHHIPVEKICEK LKKKDSQICELKYDKQIDLSTVDLKKLRVKELKKILDDWGETCKGCAEKSDYIRKINELM PKYAPKAASARTDL >gi568815595r:51298772_51583828|GENSCAN_predicted_CDS_2|765_bp atggggagctacggcgcgcggccgggacttggaggcggtgcggcgcggcgggtgcggttc agtcggtcggcggcggcagcggaggaggaggaggaggaggaggatgaggaggatgaggag gatgtgggccacgcaggggctggcggtggcgctggctctgagcgtgctgccgggcagccg ggcgctgcggccgggcgactgcgaaggtgcgggatagggggccgggggccgcgctccgtg accgttctggcggcggcgctgaaccccacaacatcaccagacggccggggccaagagggc gagggcgggggcgggggcgggggcgacagcagattttaccaggacctcaaagacagagat gtcacattctcaccagccactattgaaaacgaacttataaagttctgccgggaagcaaga ggcaaagagaatcggttgtgctactatatcggggccacagatgatgcagccaccaaaatc atcaatgaggtatcaaagcctctggcccaccacatccctgtggagaagatctgtgagaag cttaagaagaaggacagccagatatgtgagcttaagtatgacaagcagatcgacctgagc acagtggacctgaagaagctccgagttaaagagctgaagaagattctggatgactggggg gagacatgcaaaggctgtgcagaaaagtctgactacatccggaagataaatgaactgatg cctaaatatgcccccaaggcagccagtgcacggaccgatttgtag >gi568815595r:51298772_51583828|GENSCAN_predicted_peptide_3|890_aa MKRQSERDSSPSGRGSSSSAKRPREREREAEAGGRRAAHKASGGAKHPVPARARDKPRGS GSGGGGHRDGRGTGDANHRASSGRSSGSGAGGGGRGGKASGDPGASGMSPRASPLPPPPP PPGAEPACPGSSAAAPEYKTLLISSLSPALPAEHLEDRLFHQFKRFGEISLRLSHTPELG RVAYVNFRHPQDAREARQHALARQLLLYDRPLKVEPVYLRGGGGSSRRSSSSSAAASTPP PGPPAPADPLGYLPLHGGYQYKQRSLSPVAAPPLREPRARHAAAAFALDAAAAAAVGLSR ERALDYYGLYDDRGRPYGYPAVCEEDLMPEDDQRATRNLFIGNLDHSVSEVELRRAFEKY GIIEEVVIKRPARGQGGAYAFLKFQNLDMAHRAKVAMSGRVIGRNPIKIGYGKANPTTRL WVGGLGPNTSLAALAREFDRFGSIRTIDHVKGDSFAYIQYESLDAAQAACAKMRGFPLGG PDRRLRVDFAKAEETRYPQQYQPSPLPVHYELLTDGYTRHRNLDADLVRDRTPPHLLYSD RDRTFLEGDWTSPSKSSDRRNSLEGYSRSVRSRSGERWGADGDRGLPKPWEERRKRRSLS SDRGRTTHSPYEERSRTKGSGQQSERGSDRTPERSRKENHSSEGTKESSSNSLSNSRHGA EERGHHHHHHEAADSSHGKKARDSERNHRTTEAEPKPLEEPKHETKKLKNLSEYAQTLQL GWNGLLVLKNSCFPTSMHILEGDQGVISSLLKDHTSGSKLTQLKIAQRLRLDQPKLDEVT RRIKQGSPNGYAVLLATQATPSGLGTEGMPTVEPGLQRRLLRNLVSYLKQKQAAGVISLP VGGSKGRDGTGMLYAFPPCDFSQQYLQSALRTLGKLEEEHMVIVIVRDTA >gi568815595r:51298772_51583828|GENSCAN_predicted_CDS_3|2673_bp atgaagcggcagagcgagcgagactctagcccgagcgggcgcggctcgtcatcgtccgcc aagcgtccgcgggagcgcgaacgggaggcggaggcgggcgggcggcgggcggcgcacaag gcctctggcggcgccaagcacccggttccagcgcgggcccgcgacaaaccccgcggcagc ggaagcggcgggggcgggcatcgcgacggccgcggcaccggggacgcgaatcaccgcgcg agtagcgggcgctcctcgggctccggcgctggcggcgggggacgcggcggcaaggcctcg ggggacccgggcgcctccggcatgtcgccccgcgcgtctcctctgccgccgcctccgcca ccgcctggggccgagcccgcgtgtcccggctcatccgcggccgcgcctgagtacaagacg ttgctcatcagcagcttgagccccgcgctgcccgccgagcacctcgaggaccggctcttc caccagttcaagcgcttcggcgagatcagcctccgcctgtcgcacacgcctgagctgggc cgtgtggcctacgtgaatttccggcacccacaggacgcacgcgaggcccgccagcacgcc ctggcccggcagctgctgctctacgaccgcccgctcaaggtagagcccgtgtacctgcgt ggcggcggcgggagcagtcggcgaagtagcagcagcagcgccgccgcttccacgcctccc ccagggccgcccgcgcccgccgacccgctcggctacctcccgctacacggaggctaccag tacaagcagcgctcgctgtcccccgtcgctgccccgcccctgcgggagccccgtgcccgt cacgccgccgcagccttcgccctggatgccgctgctgccgccgccgtgggactgtcccgg gagcgggccctggactactacgggctgtacgacgaccgtgggcgcccctatggctaccca gctgtgtgtgaggaggacctgatgcccgaggatgaccagcgggccacgcgcaacctcttc attggtaacctggaccacagcgtatctgaggtggagctgcgaagggccttcgagaaatat ggcatcatcgaggaggtggtcatcaagaggcctgcccgtggccagggcggtgcctatgcc ttcctcaagttccagaacctggacatggcccatagggctaaggtggccatgtcgggccga gtgattggtcgcaaccccattaagataggctatggcaaggccaaccccaccactcgtctc tgggtgggtggcctgggacctaacacgtcactggcggctctggcccgagagtttgaccgc tttgggagcattcggaccattgatcacgtcaaaggagatagctttgcctatattcagtac gagagcttggacgcagcccaggccgcctgtgctaaaatgaggggttttcccttgggtgga ccagaccgcaggctccgcgtggattttgccaaagcagaggagactcggtacccccagcag taccagccctcgccactccctgtgcattatgagctgctcacagatggatacacccggcac cgcaacctggacgccgacctggtgcgggacaggacgcccccacaccttctgtactcagac cgagaccggacttttttggaaggggactggaccagccccagtaaaagctctgaccgccga aacagccttgagggctacagtcgctcagtgcgcagccggagtggtgagcgttggggggca gatggagaccgtggtttgcccaagccctgggaagagaggcggaaacggagaagcctttcc agtgaccgtgggaggacaacccattcaccatatgaggaacggagtaggaccaagggcagt gggcagcagtcagagcggggctccgaccgcacccctgagcgcagccgcaaggagaaccac tccagtgaagggaccaaggagtccagcagcaactccctcagcaacagcagacatggggct gaggaacggggccaccaccaccaccaccacgaggctgcagactcttcccacgggaagaag gcaagagacagcgagcgcaatcaccggaccacagaggccgagcccaagcctctggaagag ccaaaacacgagaccaaaaagctgaagaatctttcagagtacgctcagacactacagctg ggttggaatgggcttctggtgttgaaaaacagctgcttccccacgtctatgcatatccta gagggggaccagggggtgatcagcagtctcctcaaagaccacacttctgggagcaagctg acccagctgaagatcgcccagcgccttcgactggaccagcccaagcttgacgaggtcaca cgacgcatcaagcaggggagccccaacggctatgcggtcctcttagccacccaggcaacc cccagtgggcttggcactgaggggatgcccacagtagagcccggtctgcagaggcggctt ctcaggaacctggtctcctacttgaaacagaagcaggccgcaggggtgatcagcttgcca gtgggggggtccaagggcagagacggcacaggcatgctctacgccttcccaccctgcgac ttttcccagcagtacctccagtcagcactaaggacattgggcaagctagaagaagaacac atggtgatagtcatcgtcagagacactgcctag >gi568815595r:51298772_51583828|GENSCAN_predicted_peptide_4|1323_aa MEGIVENLFKWAREADQPLRTYSTGLLGGAMENQDIAANYRDENSQLVAIVLRRLRELQL QEVALRQENKRPSPRKLSSEPLLPLDEEAVDMDYGDMAVDVVDGDQEEASGDMEISFHLD SGHKTSSRVNSTTKPEDGGLKKNKSAKQGDRENFRKAKQKLGFSSSDPDRMFVELSNSSW SEMSPWVIGTNYTLYPMTPAIEQRLILQYLTPLGEYQELLPIFMQLGSRELMMFYIDLKQ TNDVLLTFEALKVCMHPHNVLSDVVNYTLWLMECSHASGCCHATMFFSICFSFRAVLELF DRYDGLRRLVNLISTLEILNLEDQGALLSDDEIFASRQTGKHTCMALRKYFEAHLAIKLE QVKQSLQRTEGGILVHPQPPYKACSYTHEQIVEMMEFLIEYGPAQLYWEPAEVFLKLSCV QLLLQLISIACNWKTYYARNDTVRFALDVLAILTVVPKIQLQLAESVDVLDEAGSTVSTV GISIILGVAEGEFFIHDAEIQKSALQIIINCVCGPDNRISSIGKFISGTPRRKLPQNPKS SEHTLAKMWNVVQSNNGIKVLLSLLSIKMPITDADQIRALACKALVGLSRSSTVRQIISK LPLFSSCQIQQLMKEPVLQDKRSDHVKFCKYAAELIERVSGKPLLIGTDVSLARLQKADV VAQSRISFPEKELLLLIRNHLISKGLGETATVLTKEADLPMTAASHSSAFTPVTAAASPV SLPRTPRIANGIATRLGSHAAVGASAPSAPTAHPQPRPPQGPLALPGPSYAGNSPLIGRI SFIRERPSPCNGRKIRVLRQKSDHGAYSQSPAIKKQLDRHLPSPPTLDSIITEYLREQHA RCKNPVATCPPFSLFTPHQCPEPKQRRQAPINFTSRLNRRASFPKYGGVDGGCFDRHLIF SRFRPISVFREANEDESGFTCCAFSARERFLMLGTCTGQLKLYNVFSGQEEASYNCHNSA ITHLEPSRDGSLLLTSATWSQPLSALWGMKSVFDMKHSFTEDHYVEFSKHSQDRVIGTKG DIAHIYDIQTGNKLLTLFNPDLANNYKRNCATFNPTDDLVLNDGVLWDVRSAQAIHKFDK FNMNISGVFHPNGLEWDLRTFHLLHTVPALDQCRVVFNHTGTVMYGAMLQADDEDDLMEE RMKSPFGSSFRTFNATDYKPIATIDVKRNIFDLCTDTKDCYLAVIENQGSMDALNMDTVC RLYEVGRQRLAEDEDEEEDQEEEEQEEEDDDEDDDDTDDLDELDTDQLLEAELEEDDNNE NAGEDGDNDFSPSDEELANLLEEGEDGEDEDSDADEEVELILGDTDSSDNSDLEDDIILS LNE >gi568815595r:51298772_51583828|GENSCAN_predicted_CDS_4|3972_bp atggagggaattgtcgagaatcttttcaaatgggcccgagaggccgatcaaccattgagg acatattctactggactgttaggaggtgctatggaaaatcaagacattgctgccaactat agagatgaaaattcacagctggtggcaatagtgcttcgaagactgagggagctacagcta caggaagtggctttgcggcaggaaaacaagcgtcccagtccacggaagctctcttctgaa ccccttttgcctctggatgaggaggctgtggatatggactatggtgacatggctgtagat gtagtggatggagaccaagaggaagcttctggagacatggagatctcctttcatcttgat tcaggccacaagactagtagcagagtgaactcaacaaccaaacctgaggatggaggatta aagaaaaacaagtcagcaaaacagggtgacagagagaactttaggaaagccaagcaaaag ttgggtttctcatcttctgatccagatcgcatgtttgttgagctgtctaatagcagttgg tcagaaatgtctccctgggtgattggcaccaattataccctttatcctatgactcctgct atcgagcagcgactcattctccaatatttgacccctctaggagaatatcaggagctactt cccatattcatgcaacttggatcacgggagctgatgatgttctatattgacctgaagcaa actaatgatgtcctgcttacatttgaggcactaaaggtttgcatgcatccccacaatgtt ctgtctgatgtggtgaactataccctgtggttaatggagtgttctcatgcttcaggatgc tgccatgctaccatgtttttttcaatttgcttctcatttcgggccgtcttggagctcttt gaccgctatgatggtcttcgtcgtctggtgaacttgatcagtactttggagattctaaat ttggaagatcagggtgcacttctgagtgatgatgaaatatttgctagccgccaaactggg aaacatacctgcatggccttgcgcaaatactttgaggctcacctggccattaaattggaa caagtgaagcagtcacttcagaggactgagggtggcattcttgtccacccacaacccccg tacaaggcatgctcatatactcatgaacagattgtggaaatgatggaatttttgatagaa tatggcccagcgcagctatattgggaaccagctgaagttttcctcaaactttcttgtgtg caactcttgttgcagcttatttctattgcctgcaattggaagacctattatgcaaggaat gacactgtgcgctttgctttggatgtcctggctattcttactgtggtgccaaaaatccag ctccagttggcagaatcagtggacgtgttggatgaggctggatctacagtctctactgta ggtatcagcattattttgggagtggctgagggtgagttcttcatccatgatgctgaaatt cagaagtcagcacttcagattatcatcaattgtgtgtgtggcccagataaccgaatatcc agtattggtaaatttatctctggtactcctcggagaaagctgcctcagaaccctaaaagc agtgagcacaccctggccaagatgtggaatgtggttcagtccaacaacggcatcaaggtg ctcctgtccttactgtccattaagatgcccatcacagatgcagaccaaatccgggccctg gcctgcaaagccctagtgggcctgtctcgcagtagcactgtccggcagatcatcagtaaa ctgccccttttcagcagctgccagatccagcagctgatgaaggagcctgtgctgcaggac aagcgcagtgaccatgtcaagttctgcaagtatgctgctgaactcattgaacgggtgtca ggaaaaccacttctcattggcactgatgtttccctagcacgactgcagaaagcagatgtt gttgcccagtcaaggatctccttccctgagaaagagctgcttttgttgatacgaaaccat cttatttctaaagggcttggagaaacagcaaccgtgctgacaaaagaggctgacctgccc atgactgctgcctcccattcttctgcctttaccccagtcactgctgctgcttctcctgtc tctctaccccgaacccctcgtatcgctaatggcattgcaactcgtctgggcagccatgct gctgtgggtgcctctgcgccttctgcccctactgctcatcctcagccacggcccccccag ggtccgctagctctgcccggcccatcttatgcaggcaactcccctttgattggtagaatc agttttatcagagagaggccatcaccctgcaatggcaggaaaatcagagtgttgcggcag aagtcggaccatggtgcctacagccaaagcccagccataaaaaaacagctggacagacat cttccttccccacctacgctggacagtataatcacagagtatcttagagaacaacatgct cgctgcaagaatccagttgccacctgcccacctttctccctctttactcctcaccaatgt cctgagccaaaacagaggcggcaagcgccaataaactttacgtcaaggctaaaccgcagg gcatcatttccaaagtatggaggggtggatggcggatgctttgataggcaccttatcttt agcagattccgtcctatttcagtgttccgggaagccaatgaagatgagagtggcttcacc tgctgtgcattctcagcacgggagcggttcctgatgcttggcacctgcacagggcagctg aagctctataatgtgtttagtggacaggaggaggccagctataactgtcacaactcagcc atcacacatcttgaaccttccagggatgggtccttgctgctgacatctgctacttggagc cagcctttgtctgcactttggggaatgaagtcagtatttgatatgaagcattccttcaca gaagatcactatgttgagttcagtaagcactcccaggatcgggtcatcggcacaaaagga gacattgcccacatttatgatattcagactggcaacaagctgttgactctgtttaaccca gatcttgccaacaactacaagaggaactgtgccacctttaatcctacagatgatcttgtc ttaaatgatggcgtcctctgggatgtccgctctgcacaggccatccacaagtttgacaag ttcaatatgaacatcagtggtgttttccatccaaatggactggagtgggaccttcgaact tttcatcttttgcatactgttcccgctctggatcagtgtcgcgtggtgttcaatcacacg ggaacagtgatgtatggagctatgttgcaggcagatgatgaagatgacttaatggaagag aggatgaaaagcccctttgggtcatccttccgaacatttaatgcaactgactacaaacct atagcaaccattgatgtgaaacggaacatctttgacctgtgtacagacaccaaagactgc tatcttgctgtcattgagaatcaaggcagcatggatgccctgaacatggacacagtatgc aggctgtatgaagtgggcaggcagcgtctggcagaggatgaggatgaagaggaggaccag gaagaggaagaacaggaggaagaagatgatgatgaagatgatgatgacaccgatgattta gatgagcttgacactgaccagttgctggaggcggagttggaggaggacgacaataatgag aacgcaggggaagatggggacaatgacttctctccctctgatgaggagctagcaaacctt ctagaggagggagaggacggggaggatgaagactctgatgcagatgaggaggtggaactg atcctgggggacactgacagctctgacaactctgatttggaagatgacatcatcttatct ctgaatgagtga >gi568815595r:51298772_51583828|GENSCAN_predicted_peptide_5|218_aa MEDEINEMKREEKFTEKRIKRNEQSLQEIWDYVKTPNLRLIGVPESDRENGTKLENTLQD IIQENFPNLARQANIQIKEIHRTPQRYSSRRATPRHIIVRFTKVEMKEKMLRAAREKGRV THKGKPIRLTADLSAETLQARREWEPIFNTLKEKNFQPRISYPAKLRFISEGEIKYFTDK QMLRDFVTTRPALQELLKEALNMERNNRYQPLQKHAKL >gi568815595r:51298772_51583828|GENSCAN_predicted_CDS_5|657_bp atggaagacgaaattaatgaaatgaagcgagaagagaagtttacagaaaaaagaataaaa agaaacgaacaaagcctccaagaaatatgggactatgtgaaaacaccaaatctacgtctg attggtgtacctgaaagtgacagggagaatggaaccaagttggaaaacactctgcaggat attatccaggagaacttccccaatctagcaaggcaggccaacattcaaattaaggaaata cacagaacaccacaaagatactcctcaagaagagcaactccaagacacataattgtcaga ttcaccaaagttgaaatgaaggaaaaaatgttaagggcagccagagagaaaggtcgggtt acccacaaagggaagcccatcagactaacagctgatctctctgcagaaactctacaagcc agaagagagtgggaaccaatattcaacactcttaaagaaaagaattttcaacccagaatc tcatatccagccaaactacgcttcataagtgaaggagaaataaaatacttcacagacaag caaatgctgagagattttgtcaccaccaggcctgccttacaagagctcctgaaggaagca ctaaacatggaaagaaacaaccggtaccagccactgcaaaaacatgccaaattgtaa >gi568815595r:51298772_51583828|GENSCAN_predicted_peptide_6|185_aa MDKFLDTYILPRLNQEEVESLNRPITGSEIEAIINSLPTKKSPGPDGFTAEFYQRYKEEL VPFLLKLFQSTEKEGILPNSFYEASIILIPKPGKDTTKKENFRPISLMNIDAKILNKILA NQIQQHIKKLIYHDQVGFIPGMQGWFNICKSINVIQHINRTKDKNHMIISIDAETAFDKI QQPSC >gi568815595r:51298772_51583828|GENSCAN_predicted_CDS_6|558_bp atggataaattcctcgacacatacatcctcccaagactaaaccaggaagaagttgaatct ctgaacagaccaataacaggctctgaaattgaggcaataatcaatagcttaccaaccaaa aaaagtccaggaccagatggattcacagccgaattctaccagaggtacaaagaggagctg gtaccattccttctgaaattattccaatcaacagaaaaagagggaatcctccctaactca ttttatgaggccagcatcatcctgataccaaagcctggcaaagacacaaccaaaaaagag aattttagaccaatatccttgatgaacatcgatgcaaaaatcctcaataaaatactggca aaccaaatccagcagcacatcaaaaagcttatctaccatgatcaagtgggcttcatccct gggatgcaaggctggttcaacatatgcaaatcaataaatgtaatccagcatataaacaga accaaagacaaaaaccacatgattatctcaatagatgcagaaacggcctttgacaaaatt caacaaccttcatgctaa >gi568815595r:51298772_51583828|GENSCAN_predicted_peptide_7|573_aa MIVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFLYTNNTDKQESQIMSELPFTTA SKRIKYLGIQLTRDVKDLFKENYKPLLNEIKEDTNKWKNVPCSWVGRISIVKMAILPKVI YRFNAIPIKLPMTFFTELEKTTLKFIWNQKRAHITKSILSQKNKAGGITLPDFKLYYKAT VTKTAWYWYQNRYIDQWNRIEPSEIMPHIYNYLIFDKPDKNKKWGKDSLFNKWCWENWLA ICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGNTIQDTGMGKDFMSKTPKAI ATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTEWEKIFATYSSDKGLISRIYNELKQIYK KKTNNPIKKWAKDMNRHFSKEDIYAAKKHMKKCSPSLAIREMQIKTTMRYHVTPVRMAII KKSGSNRCWRGCGEIGTLLHCWWDCKLVQPLWKSVWRFLRDLELEIPFDPAIPLLGIYPK DYKSCCYKDTFTRMFIAALFTIAKTWNQPKCPTTIDWIKKIWHIYTMEYYAAIKNDEFMS FVGTWMKLPTIILSKLSQGQKTKHRMFSLIGGN >gi568815595r:51298772_51583828|GENSCAN_predicted_CDS_7|1722_bp atgattgtatatctagaaaaccccatcgtctcagcccaaaatctcctcaagctgataagc aacttcagcaaagtctcaggatacaaaatcaatgtacaaaaatcacaagcattcttatac accaataacacagacaaacaagagagccaaatcatgagtgaactcccattcacaactgct tcaaagagaataaaatacctaggaatccaacttacaagggatgtgaaggacctcttcaag gagaactacaaaccactgctcaatgaaataaaagaggatacaaacaaatggaagaacgtt ccatgttcatgggtaggaagaatcagtatcgtgaaaatggccatactgcccaaggtaatt tatagattcaatgccatccccatcaagctaccaatgactttcttcacagaattggaaaaa actactttaaagttcatatggaaccaaaaaagagcccacatcaccaagtcaatcctaagc caaaagaacaaagctggaggcatcacgctacctgacttcaaactatactacaaggctaca gtaaccaaaacagcatggtactggtaccaaaacagatatatagaccaatggaacagaata gagccctcagaaataatgccgcatatctacaactatctgatctttgacaaacctgacaaa aacaagaaatggggaaaggattccctatttaataaatggtgctgggaaaactggctagcc atatgtagaaagctgaaactggatcccttccttacaccttatacaaaaattaattcaaga tggattaaagacttaaatgttagacctaaaaccataaaaaccctagaagaaaacctaggc aataccattcaggacacaggcatgggcaaggacttcatgtctaaaacaccaaaagcaata gcaacaaaagccaaaattgacaaatgggatctaattaaactaaagagcttctgcacagca aaagaaactaccatcagagtgaacaggcaacctacagaatgggagaaaatttttgcaacc tactcatctgacaaagggctaatatccagaatctacaatgaactcaaacaaatttataag aaaaaaacaaacaaccccatcaaaaagtgggcgaaggatatgaacagacacttctcaaaa gaagacatttatgcagccaaaaaacacatgaaaaaatgctcaccatcactggccatcaga gaaatgcaaatcaaaaccacaatgagataccatgtcacaccagttagaatggcaatcatt aaaaagtcaggaagcaacaggtgctggagaggatgcggagaaataggaacacttttacac tgttggtgggactgtaaactagttcaaccattgtggaagtcagtgtggcgattcctcagg gatctagaactagaaataccatttgacccagccatcccattactgggtatatacccaaag gattataaatcatgctgctataaagacacattcacacgtatgtttattgcggcactattc acaatagcaaagacttggaaccaacctaaatgtccaacaacgatagactggattaagaaa atatggcacatatataccatggaatactatgcagccataaaaaatgatgagttcatgtcc tttgtagggacatggatgaaactgccaaccatcattctcagcaaactatcgcaaggacaa aaaaccaaacaccgcatgttctcactcataggtgggaattga >gi568815595r:51298772_51583828|GENSCAN_predicted_peptide_8|199_aa MPTSPWLSWSLVLWGGNQLCDFTKGYLVRSLDLLKQEQGRGGSPQEHHETFHTETSPPLR DEELPHGKAMTTVVVHVDSKAELTTLLEQWEKEHGSGQDMVPILTRMSQLIEKETEEYRK GDPDPFDDRHPGRADPECMLGHLLRILFKNDDFMNALVNAYVMTSREPPLNTAACRLLLD IMPGLETAVVFQEKVLSEK >gi568815595r:51298772_51583828|GENSCAN_predicted_CDS_8|600_bp atgcccactagtccctggttgtcctggtccctggtcctgtggggaggcaaccagctttgt gacttcacaaaaggctatcttgtcagaagcctggacctgctgaagcaggagcagggcaga gggggctcccctcaggaacatcatgagacttttcacacagagacatcgccacctctaagg gatgaagaacttccccacggcaaagccatgactacagtagtggtacatgtggactccaaa gctgagctcactaccctgctggagcagtgggaaaaggaacatggcagtgggcaggacatg gtacctatccttaccaggatgtctcaattgattgaaaaagaaactgaagagtatcgtaaa ggggatccagacccatttgatgatcgacatcctggtcgagctgatccagagtgtatgctg ggccacttgctgagaatactcttcaagaatgatgatttcatgaatgcactggtgaatgca tatgtgatgacaagccgagagccccctttaaacactgcagcttgcagactcctattagac atcatgccagggctggaaactgctgtcgtctttcaagaaaaggtacttagtgaaaagtaa >gi568815595r:51298772_51583828|GENSCAN_predicted_peptide_9|161_aa MNAQSQGKRPLALAVCVSKIAEAGREPAGRHLPRPEDTSPGRKTRNSEDREKGHPEEKED FQAQNLQKQGPRSPPSCHECYQSFYYGGKIQQSFTYHTHIERSCYGVLVKECVESGKSYY KVKHLGVSGSRNGAICHYAIMGLYAAKGSSGFVSPKLDNAE >gi568815595r:51298772_51583828|GENSCAN_predicted_CDS_9|486_bp atgaatgcccagtcccaaggcaaaaggccacttgcgctagcagtgtgcgtcagcaagata gcagaagcaggaagagagccggccggaagacacctacccaggccggaagacacctcccct ggccggaagacacgtaactctgaagatcgagaaaaaggccacccagaagaaaaggaggac ttccaggcccagaacctccaaaagcaaggaccacgcagccctccaagctgccatgagtgt tatcagtctttctactatggaggaaaaatccaacagtcctttacttaccatacccatata gaaagatcctgttatggagtcttagtcaaagaatgtgttgaatcagggaaaagttattat aaagtaaagcatctaggagtatctggcagtcgtaatggggctatatgccattatgccata atggggctatatgccgcaaagggaagcagtggctttgtttcaccaaaactggacaacgcg gagtaa >gi568815595r:51298772_51583828|GENSCAN_predicted_peptide_10|286_aa MGKDFMTKTPKSMATKAKIDKCDLIKLKRFCTAKEMTIRMNRQPTEWEKIFTIYPSDKGL ISRIYKEVKQIYKKKSNNPIKKWLKDMNRYFSKEDIYAANRHMKKCSSSLAIREMQIKTT MRYHFIPVRMAIIKKSGNNRCWRGCGEIGTLLHCWWDCKLVQPLWKTVWRFLKDLELEIP FDPAIPLLGIYPKDYKSCFYKDTCTRMFIVALFTIAKTWNQPKCPSVIGWIKRMWYTYTM EYYAAIKKDEFMSFVGTWMKLETIILSKLSQVQKTKHRMFSLMGGN >gi568815595r:51298772_51583828|GENSCAN_predicted_CDS_10|861_bp atgggcaaggatttcatgactaaaacaccaaaatcaatggcaacaaaagccaaaattgac aaatgtgatctaattaaactaaagagattctgcacagcaaaagaaatgaccatcagaatg aacaggcaacctacagaatgggagaaaatttttacaatctacccatctgacaaagggcta atatccagaatctacaaagaagttaaacaaatttacaagaaaaaatcaaacaaccccatc aaaaagtggttaaaggatatgaacagatacttctcaaaagaagacatttatgcagccaac agacacatgaaaaaatgctcatcatcactggccatcagagaaatgcaaatcaaaaccaca atgagataccatttcataccagttagaatggcgatcattaaaaaatcaggaaacaacagg tgctggagaggatgtggagaaataggaacacttttacactgttggtgggactgtaaacta gttcaaccattgtggaagacagtgtggcgattcctcaaggatctagaactagaaatacca tttgacccggccatcccattactgggtatatacccaaaggattataaatcatgcttctat aaagacacatgcacacgtatgtttatcgtggcactattcacaatagcaaagacttggaac caacccaaatgtccatcagtgataggctggattaagagaatgtggtacacatacaccatg gaatactatgcagccataaaaaaagatgagttcatgtcctttgtagggacatggatgaag ctggaaaccatcattctgagcaaactatcacaagtacagaaaaccaaacaccgcatgttc tcgctcatgggtgggaactga