GENSCAN 1.0 Date run: 2-Nov-116 Time: 23:57:19 Sequence gi568815583r:55729900_56093555 : 363656 bp : 40.34% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 PlyA - 85 80 6 1.05 1.04 Term - 4212 4109 104 1 2 61 54 59 0.133 -2.74 1.03 Intr - 10785 10483 303 1 0 116 108 197 0.992 20.14 1.02 Intr - 12434 12267 168 0 0 62 100 130 0.987 10.60 1.01 Init - 13032 12939 94 0 1 49 80 257 0.994 19.59 1.00 Prom - 27413 27374 40 -7.35 2.00 Prom + 28847 28886 40 -3.45 2.01 Init + 45313 45448 136 0 1 94 91 44 0.819 5.65 2.02 Intr + 49484 49662 179 0 2 98 15 51 0.162 -2.48 2.03 Term + 50340 50570 231 2 0 -99 48 376 0.766 10.39 2.04 PlyA + 52441 52446 6 1.05 3.00 Prom + 60145 60184 40 -7.05 3.01 Init + 64130 64532 403 1 1 100 75 131 0.432 9.84 3.02 Intr + 73382 73550 169 0 1 48 76 143 0.161 7.18 3.03 Intr + 78579 78697 119 0 2 93 35 68 0.172 1.19 3.04 Intr + 80335 80449 115 2 1 111 89 25 0.625 3.49 3.05 Intr + 83725 83929 205 1 1 54 63 124 0.105 4.78 3.06 Intr + 89115 89253 139 2 1 49 94 64 0.064 2.22 3.07 Intr + 90754 90867 114 2 0 94 80 40 0.165 3.30 3.08 Intr + 95928 96049 122 1 2 55 64 99 0.102 3.49 3.09 Term + 97355 97420 66 1 0 57 53 80 0.097 -1.64 3.10 PlyA + 98752 98757 6 1.05 4.18 PlyA - 98977 98972 6 -0.45 4.17 Term - 100100 99998 103 1 1 116 33 164 0.999 10.47 4.16 Intr - 100687 100615 73 2 1 76 107 52 0.988 3.45 4.15 Intr - 103205 103109 97 2 1 83 103 41 0.980 3.76 4.14 Intr - 104387 104328 60 2 0 81 94 49 0.862 2.81 4.13 Intr - 108705 108610 96 0 0 80 72 91 0.778 5.99 4.12 Intr - 110618 110548 71 0 2 30 68 53 0.562 -4.52 4.11 Intr - 110828 110707 122 1 2 59 49 71 0.563 -0.48 4.10 Intr - 112264 112035 230 1 2 66 102 257 0.658 20.64 4.09 Intr - 118531 118207 325 0 1 104 11 165 0.336 5.25 4.08 Intr - 121316 121249 68 2 2 54 76 49 0.726 -2.92 4.07 Intr - 122644 122525 120 1 0 81 68 62 0.783 3.27 4.06 Intr - 126360 126232 129 0 0 79 89 11 0.375 0.27 4.05 Intr - 130675 130508 168 1 0 65 52 141 0.844 7.42 4.04 Intr - 130879 130762 118 0 1 52 60 135 0.394 6.65 4.03 Intr - 133180 133014 167 1 2 104 36 87 0.238 2.94 4.02 Intr - 139782 139680 103 2 1 76 78 29 0.432 -0.04 4.01 Init - 143338 143280 59 1 2 65 92 76 0.917 6.53 4.00 Prom - 146827 146788 40 -7.65 5.04 PlyA - 146937 146932 6 1.05 5.03 Term - 153398 153206 193 1 1 117 52 151 0.777 10.31 5.02 Intr - 178675 178635 41 1 2 58 100 22 0.007 -3.50 5.01 Init - 186932 185385 1548 2 0 43 115 638 0.817 54.46 5.00 Prom - 188344 188305 40 -6.45 6.00 Prom + 195403 195442 40 -3.65 6.01 Sngl + 202706 203101 396 1 0 71 39 170 0.470 6.40 6.02 PlyA + 204699 204704 6 1.05 7.00 Prom + 207440 207479 40 -6.55 7.01 Sngl + 215149 215487 339 0 0 96 37 182 0.745 9.68 7.02 PlyA + 215538 215543 6 1.05 8.00 Prom + 216563 216602 40 -6.15 8.01 Init + 217025 217255 231 1 0 66 116 99 0.539 8.81 8.02 Intr + 217446 217684 239 2 2 43 82 162 0.647 6.59 8.03 Intr + 219837 219876 40 0 1 82 71 29 0.121 -2.09 8.04 Intr + 220819 220968 150 0 0 83 98 17 0.195 1.64 8.05 Term + 230119 230262 144 0 0 75 41 104 0.238 1.33 8.06 PlyA + 230449 230454 6 1.05 9.08 PlyA - 232052 232047 6 1.05 9.07 Term - 234524 234381 144 2 0 3 55 193 0.742 4.43 9.06 Intr - 239961 239817 145 2 1 73 72 56 0.203 1.86 9.05 Intr - 258102 257232 871 1 1 72 53 191 0.104 3.46 9.04 Intr - 263709 263264 446 2 2 124 53 228 0.717 15.02 9.03 Intr - 277381 275749 1633 2 1 95 53 1167 0.080 100.26 9.02 Intr - 284920 284761 160 1 1 47 14 76 0.283 -5.26 9.01 Init - 285598 285389 210 1 0 50 57 205 0.750 12.43 9.00 Prom - 286199 286160 40 -8.75 10.05 PlyA - 286690 286685 6 1.05 10.04 Term - 289394 289255 140 0 2 102 48 68 0.637 1.34 10.03 Intr - 304955 304836 120 0 0 71 72 66 0.011 2.85 10.02 Intr - 314186 314055 132 0 0 45 20 125 0.081 1.10 10.01 Init - 322331 322268 64 2 1 89 115 15 0.703 5.66 10.00 Prom - 323886 323847 40 -5.25 11.05 PlyA - 326122 326117 6 1.05 11.04 Term - 327189 326769 421 2 1 23 32 203 0.591 1.98 11.03 Intr - 327738 327387 352 1 1 62 64 168 0.043 5.26 11.02 Intr - 330333 330231 103 0 1 67 107 51 0.052 3.73 11.01 Init - 338498 337821 678 2 0 42 86 246 0.111 14.98 11.00 Prom - 338958 338919 40 -4.95 12.03 PlyA - 339468 339463 6 1.05 12.02 Term - 343988 343912 77 0 2 83 47 73 0.216 -0.28 12.01 Init - 351442 351322 121 1 1 75 75 68 0.483 4.60 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 19871 20068 198 1 0 79 44 200 0.836 9.62 S.002 Term - 277381 275718 1664 2 2 95 43 1168 0.801 101.48 S.003 Sngl - 338498 337632 867 2 0 42 41 301 0.854 16.44 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583r:55729900_56093555|GENSCAN_predicted_peptide_1|222_aa MAPPLRPLARLRPPGMLLRALLLLLLLSPLPGLREGIGELITPIGTSLPDLDPARRRWEG GIGRVGSEVADLCPGKEGGKVPEAEKEGVWCFSELSFVKEPQDVTVTRKDPVVLDCQAHG EVPIKVTWLKNGAKMSENKRIEVLSNGSLYISEVEGRRGEQSDEGFYQCLAMNKYGAILS QKAHLALSMLAASLASTLPIPIDSLPVVTTKKVLLWGKITPD >gi568815583r:55729900_56093555|GENSCAN_predicted_CDS_1|669_bp atggcgcctcctctgcgacccctcgcccggctgcgaccgccggggatgctgctccgcgcg ctcctgctcctgctgctgctcagtcctttgccagggctgcgagagggaataggtgaactc ataaccccaatcggcaccagcttgccggatctggatccagccaggaggagatgggagggt ggaattggcagggttggaagtgaagtggccgatttgtgccccggaaaggaggggggaaaa gtccccgaagctgaaaaggaaggagtgtggtgctttagcgaactgtcttttgtaaaagaa ccacaggatgtaactgtcacaagaaaggacccagtcgttttagattgccaggctcacgga gaagttcctattaaggtcacatggttgaaaaatggagcaaaaatgtctgaaaataaacgg atcgaggttctttctaacggctctttatacatcagtgaggtggaaggcaggcgaggagag cagtccgatgaaggattttatcagtgcttggcaatgaacaaatatggagccattcttagt caaaaagctcatcttgccttatcaatgttagcggcatccctggcctctaccctcccaata ccgatagactctcttccagttgtgacaaccaagaaagttctactgtggggcaaaataact cctgattga >gi568815583r:55729900_56093555|GENSCAN_predicted_peptide_2|181_aa MRVCGDVELMVENCNCKSGNQKLILTITKAGMNLHACQRGKRNQSGRDPWRDYLLLDQSA MAKVTWPQSMDVPFGGRPFELGPLFRKGVFGSCAEPQEASTSPSEKKKKEEVRGRRRKKK RKKEEEEGGSRRRKRKKEVEEEEEEEKEEGRGGEERRWRKEEEERKEDERVEERGRRMEE D >gi568815583r:55729900_56093555|GENSCAN_predicted_CDS_2|546_bp atgcgggtatgtggtgatgtggaactcatggtagagaactgcaattgtaaatctgggaat caaaaacttattctcacaataaccaaagctggaatgaatctccatgcatgtcaaagggga aaaagaaatcaatcagggagggacccatggagggattatctactcctggatcaatcagct atggccaaagtcacctggccacaaagtatggatgtgccctttgggggccgcccctttgag ttagggccactcttcagaaaaggggtctttgggagctgtgcagaaccacaagaagcatct acctcaccatcagagaagaagaagaaggaagaagtgaggggaagaagaaggaagaagaag agaaagaaggaagaagaggaaggaggaagcagaaggaggaagaggaaaaaggaggtggag gaggaggaggaagaagaaaaagaagaaggaagaggaggagaagaaagaaggtggaggaag gaggaggaagaaagaaaagaagatgagagggtagaggagcgggggagaaggatggaggag gattaa >gi568815583r:55729900_56093555|GENSCAN_predicted_peptide_3|483_aa MAERGQLRAQPVASECASPKPWQRPCAIESASAQKSRTGVWEPLPRFQKMYGNAWMPRQK FAAGVGPSWRTSTKAMWKGNVGSEPPCRVSTGPLPSRAVRSRPPSARLQNGRSTDSLHRE PGKATDTQRQPVKAELAQLLRLMAAQGTPLSSCLYQLTLLRLGVEVPVEPAVGGNLGPVM LQSLTQGVPGRDMDEARNHHSQQTIARTENQTPYVLTHRWELNNENTWTQGFNPGTHNSF SSHIALCQLQSGTVTQSFLVFHDFDSFEEIGGGPSLSQQLREQGGNQSWTGHHPNTERTR TLTHPGQLRHAYSPKEYTFGMWEESRVSKENPHLPGRGCYPAVTLSTGAGARAPTAFGLL APRRQSEEQQEDHFIVQGTGRVSASMTFSREHLQYCGDHLSTGWATFSKIPASIGATVAH SESGVTYQVDMRSESIQVAPESAVIRGWKDSDLEGNQGHLSEKHMMSLLYENDPTDCPAV ECI >gi568815583r:55729900_56093555|GENSCAN_predicted_CDS_3|1452_bp atggctgaaaggggccaacttagagctcagcctgtagcttcagagtgtgcaagccccaag ccttggcagcgtccatgtgccattgagtctgcgagtgcacagaagtcaagaactggggtt tgggaacctctgcctagatttcagaagatgtatggaaatgcctggatgcccaggcagaag tttgctgcaggggtggggccctcgtggagaacctctactaaggcaatgtggaagggaaat gtggggtcagagccaccatgcagagtctctactgggccactgcctagtagggctgtgaga agcaggccaccatccgccagactccagaatggtagatccactgacagcttgcaccgtgaa cctggaaaagccacagacactcagcgccagcctgtgaaagcagagcttgcccagctcctc aggcttatggctgcacagggaacacctctcagcagctgcctttaccagctgaccttgctc agattgggggtggaggtccccgtggaacctgcagttggaggaaacctgggccctgtgatg ctgcagtcactcacacaaggtgtcccaggaagggacatggatgaagctagaaaccatcat tctcagcaaactattgcaaggacagaaaaccaaacaccctatgttctcactcacaggtgg gaattgaacaatgagaacacttggacacagggattcaacccaggaacacataattcattt agcagtcacatagctttatgtcagctccaatctgggacagttactcagtcttttctggtc ttccatgattttgacagttttgaagagatcggaggtggcccaagtctatcccaacagctc agggagcaaggtgggaaccaatcctggacaggacaccatcccaacacagagcgcacacgc acgctcactcatcctggccaactgagacatgcctattcacctaaagagtacacctttggg atgtgggaggaaagcagagtatccaaagaaaacccacacttacctgggaggggctgttac ccagcagtgacgttatccacaggtgcaggggccagagctcctacagcttttggccttcta gcacctagacgccaaagtgaagagcaacaagaggaccactttatagtgcaaggcacaggc agggtctcagcttccatgaccttcagtagggaacaccttcagtattgtggggaccacctc agtactgggtgggcgactttcagcaaaattcctgcctctataggagctacagtggcacac tcagaaagtggagtgacttaccaagtagacatgagaagtgaaagcattcaggtggcaccg gagtctgcagtgattaggggctggaaggactctgatttggaggggaaccaagggcacctg agtgagaaacatatgatgtccttgctttatgaaaatgatcctactgactgtcccgcagta gaatgcatttga >gi568815583r:55729900_56093555|GENSCAN_predicted_peptide_4|702_aa MGSPDERDKNGGPAIDPQSTHKSRVKGYLRLKMTYLPKTSGSEDDNAEQAEELEPGWVVL DQPDAACHLQQQQEPSPLPPGWEERQDILGRTYYVNHESRRTQWKRPTPQDNLTDAENGN IQLQAQRAFTTRRQISEETESVDNRESSENWEIIREDEATMYSNQAFPSPPPSSNLDVPT HLAEELNARLTIFGNSAVSQPASSSLSLKGKSQISRYSSVVLFFHKNHSSRRGSLQAYTF EEQPTLPVLLPTSSGLPPGWEEKQDERGRSYYVDHNSRTTTWTKPTVQIFAIHFVCPSFE STSVYEEFPILYKKNTMGRSSVGECSNNWTSKSLCQCALMGLSAHLSLQEQIVTGITLQA LRGKHIGRSLLSPISHTPSLSYLKRFYNVIILTLRNSSEGTSEPDSQNAKAVWAAELRLN DIPNKFEMKLRRATVLEDSYRRIMGVKRADFLKARLWIEFDGEKGLDYGGVAREWFFLIS KEMFNPYYGLFEYSATDNYTLQINPNSGLCNEDHLSYFKFIGRVAGMAVYHGKLLDGFFI RPFYKMMLHKPITLHDMESVDSEYYNSLRWILENDPTELDLRFIIDEELFGQGFFELIPQ DLIKIFDENELEAVLMMDSEKRIRLLQFVTGTSRVPMNGFAELYGSNGPQSFTVEQWGTP EKLPRAHTCFNRLDLPPYESFEELWDKLQMAIENTQGFDGVD >gi568815583r:55729900_56093555|GENSCAN_predicted_CDS_4|2109_bp atgggctctcctgatgaaagagataagaacggagggccagccattgaccctcagtcaact cacaaatcaagagttaaaggttatctgagactaaaaatgacttatttacctaaaaccagt ggctcagaagatgataatgcagaacaggctgaggaattagagcctggctgggttgttttg gaccaaccagatgctgcttgccatttgcagcaacaacaagaaccttctcctctacctcca gggtgggaagagaggcaggatatccttggaaggacctattatgtaaaccatgaatctaga agaacacagtggaaaagaccaacccctcaggacaacctaacagatgctgagaatggcaac attcaactgcaagcacaacgtgcatttaccaccaggcggcagatatccgaggaaacagaa agtgttgacaaccgagagtcttccgagaactgggaaattataagagaagatgaagccacc atgtatagcaaccaggccttcccatcacctccaccgtcaagtaacttggatgttccaact catcttgcagaagaattgaatgccagactcaccatttttggaaattcagccgtgagccag ccagcatcgagctcactgtcactcaaaggcaaatcacaaatatcaaggtattcttctgtt gtcttgttttttcacaagaatcattccagcagaagaggcagcttacaagcctatactttt gaggaacaacctacacttcctgtgcttttgcctacttcatctggattaccaccaggttgg gaagaaaaacaagatgaaagaggaagatcatattatgtagatcacaattccagaacgact acttggacaaagcccactgtacagatatttgccattcattttgtctgcccatcatttgaa tctacttctgtgtatgaggaattccccatcctatataaaaagaacacaatgggaagatcc tcggttggagaatgtagcaataactggaccagtaagtctctctgccagtgtgctctgatg ggactgtccgctcacctgtctcttcaagagcagatagtcacgggcataaccctacaggca ttgagaggaaagcacattggccgttctctcctctcccccatttcccatactccaagtctc agttatttgaaaagattctataacgtcattatccttaccctacgcaatagcagtgagggg acaagtgagccagacagtcagaatgctaaggcagtgtgggctgctgagctcaggctgaat gacattccaaacaaatttgaaatgaaacttcgccgagcaactgttcttgaagactcttac cggagaattatgggtgtcaagagagcagacttcctgaaggctcgactgtggattgagttt gatggtgaaaagggattggattatggaggagttgccagagaatggttcttcctgatctca aaggaaatgtttaacccttattatgggttgtttgaatattctgctacggacaattatacc ctacagataaatccaaactctggattgtgtaacgaagatcacctctcttacttcaagttt attggtcgggtagctggaatggcagtttatcatggcaaactgttggatggttttttcatc cgcccattttacaagatgatgcttcacaaaccaataacccttcatgatatggaatctgtg gatagtgaatattacaattccctaagatggattcttgaaaatgacccaacagaattggac ctcaggtttatcatagatgaagaactttttggacagggattctttgaactaataccacag gatctcatcaaaatttttgatgaaaatgaactagaggctgttttaatgatggattcagaa aaaagaataagattacttcagtttgtcactggcacatctcgggtgcctatgaatggattt gctgaactatacggttcaaatggaccacagtcatttacagttgaacagtggggtactcct gaaaagctgccaagagctcatacctgttttaatcgcctggacttgccaccttatgaatca tttgaagaattatgggataaacttcagatggcaattgaaaacacccagggctttgatgga gttgattag >gi568815583r:55729900_56093555|GENSCAN_predicted_peptide_5|593_aa MAQSLRLHFAARRSNTYPLSETSGDDLDSHVHMCFKRPTRISTSNVVQMKLTPRQTALAP LIKENVQSQERSSVPSSENVNKKSSCLQISLQPTRYSGYLQSSNVLADSDDASFTCILKD GIYSSAVVDNELNAVNDGHLVSSPAICSGSLSNFSTSDNGSYSSNGSDFGSCASITSGGS YTNSVISDSSSYTFPPSDDTFLGGNLPSDSTSNRSVPNRNTTPCEIFSRSTSTDPFVQDD LEHGLEIMKLPVSRNTKIPLKRYSSLVIFPRSPSTTRPTSPTSLCTLLSKGSYQTSHQFI ISPSEIAHNEDGTSAKGFLSTAVNGLRLSKTICTPGEVRDIRPLHRKGSLQKKIVLSNNT PRQTVCEKSSEGYSCVSVHFTQRKAATLDCETTNGDCKPEMSEIKLNSDSEYIKLMHRTS ACLPSSQNVDCQININGELERPHSQMNKNHGILRRSISLGGAYPNISCLSSLKHNCSKGG PSQLLIKFASGNEGKVDNLSRDSNRDCTNELSNSCKIVPMLVIPQTHFEKQSLSVATPAT GLWGVLPDYHQCSLKAPGLFSHLVVNAARPGIHSTEKWAPLYPREGPEMPSKN >gi568815583r:55729900_56093555|GENSCAN_predicted_CDS_5|1782_bp atggcacaaagcttacgattgcactttgcagccagaagaagcaatacttaccctttgtca gaaacctccggagatgacttggatagccatgttcacatgtgcttcaaaagaccaacacgg atttcaacgtctaacgttgttcaaatgaagctgactcccagacagactgcactagctccg ttaataaaggaaaacgttcagtctcaagaaagatcatctgttccctcatctgaaaatgtt aataaaaagagcagctgtctacagatttcactacagccaacaaggtacagtggatatctt cagtctagcaatgtcttagctgatagtgatgatgcttcgtttacttgtatcttgaaggat ggtatttacagtagtgctgtggtcgataatgaattgaatgctgtgaatgatggtcacctt gtaagcagtccagccatttgtagtggtagccttagtaacttttcaaccagtgataatggg tcttacagcagcaacggtagtgattttgggtcatgtgcaagtatcacaagtggaggttca tatactaacagtgtcatcagtgacagtagtagttatacttttccaccaagtgatgatact tttttgggtggaaacttaccttctgacagcacctccaatagaagtgtgccaaacaggaat actactccttgtgaaattttttcaagaagtacaagtacagatccttttgtccaggatgac ttggaacatggattagagattatgaaattgccagtgagcaggaacacaaaaattccacta aaacgttactcctccttagtcatttttcctaggagtccttcaactacccgaccgacttct ccaacaagtctgtgtactcttctgagcaaaggatcctatcaaacttcacaccagtttatt atttctcctagtgaaattgcacataatgaggatggcactagtgctaaaggatttctttca acagctgtcaatggacttcggttatctaaaacaatttgtacccccggagaagtaagagac atacggccgcttcacaggaagggctcgttacagaagaaaattgttctttcgaataatact cccagacagactgtctgtgaaaagtcatctgaaggatattcttgtgtttcagtgcatttc acccaacgaaaagcagctacattagactgtgaaacaacaaatggtgattgtaaaccagaa atgtcagaaattaagcttaattctgattcagagtatattaagctcatgcataggacatct gcatgtttgccatcctcccaaaatgtagattgtcaaataaatatcaatggagaattggaa agaccacattcacagatgaacaaaaaccatggtattttacgaagaagtatttcattggga ggagcttatccaaatatttcctgtctatccagccttaagcacaattgttctaaaggggga ccatctcagttactcataaagtttgcatctggaaatgaaggtaaagtggataatttatca agagacagcaacagagattgcacaaatgaactgtctaattcttgcaagatcgtaccaatg ctggtaattccccagacacactttgagaagcagagtctctccgtggccactcctgccaca gggctatggggagtactgccagactaccaccagtgttcactcaaggccccagggctcttc agtcacctggtggtgaatgctgccaggcctggaattcactctactgagaagtgggctccc ctgtatcccagggaaggtccagaaatgccatccaagaactga >gi568815583r:55729900_56093555|GENSCAN_predicted_peptide_6|131_aa MGKDFMTKTPKAMATKAKIDKLELIKLKSFCTAKETTIRVNRQPTEWEKIFTIYPSDKGL ISKIYKELKQIYKKKKSNNSIKKWAKDMHRHFSKEDICSQKTHEKMLIITGHQRVCINAN QNDNEIPSHTS >gi568815583r:55729900_56093555|GENSCAN_predicted_CDS_6|396_bp atgggcaaggacttcatgactaaaacaccaaaagcaatggcaacaaaagccaaaattgac aaattggagctaattaaactaaagagcttctgtacagcaaaagaaactaccatcagagtg aacaggcaacctacagaatgggagaaaatttttacaatctacccatctgacaaagggcta atatccaaaatctacaaagaacttaaacaaatttacaagaaaaaaaaatcaaacaactcc atcaaaaagtgggcaaaggatatgcacagacacttctcaaaagaagacatatgcagccaa aagacacatgaaaaaatgctcatcatcactggccatcagagagtttgcataaatgcaaat caaaacgacaatgagataccatctcacaccagttag >gi568815583r:55729900_56093555|GENSCAN_predicted_peptide_7|112_aa MGRNQSRKAENSKNQSASSPPKDRSPSPATEQSWKENDFDELTEVGFRRSVKTNFSKLKK HVLTHCKEPKNLEKRLAQWLTRINSVEKTLNDLMELKTMAQELCDTCTSFSS >gi568815583r:55729900_56093555|GENSCAN_predicted_CDS_7|339_bp atggggagaaaccagagcagaaaagctgaaaattctaaaaaccagagcgcatcttctcct ccaaaggatcgcagcccctcgccagcaacagaacaaagctggaaggagaatgactttgat gagttgacagaagtaggcttcagaaggtcggtaaaaacaaacttctccaagctaaagaag catgttctaacccattgcaaggaacctaaaaaccttgaaaaaaggttagcccaatggcta actagaataaatagtgtagagaagaccttaaatgacctgatggagctgaaaaccatggca caagaactttgtgatacatgcacaagcttcagtagctga >gi568815583r:55729900_56093555|GENSCAN_predicted_peptide_8|267_aa MKAETKMFFETNENKDTTYQNLWDTFKAVCRGKFIALNAHKRKQERSKIDTLTSQLKELE KQEQTHSKASRRQEITKKYKLQTTIREYYKHLYANKLEKLEEMDKFLDTYTLPRLNQEEV ESLNRPITGSEIETIINSLPTKKSPGPDVFTAEFYQRWELNNENTWTQEGVNRETTQQLK GNMGSRKNILGMIQVGIIQRGEIPKMHLYHCSGRVSILATLLNVVNPDNYLTQLPPGNHL PKKQPDSTYLTCSTDPIPPKDCRNKPQ >gi568815583r:55729900_56093555|GENSCAN_predicted_CDS_8|804_bp atgaaggcagaaacaaagatgttctttgaaaccaatgagaacaaagacacaacataccag aatctctgggacacatttaaagcagtgtgtagagggaaatttatagcactaaatgcccac aagagaaagcaggaaagatctaaaattgacaccctaacatcacaattgaaagaactagag aagcaagagcaaacacattcaaaagctagcagaagacaagaaataactaagaaatacaaa ctacaaactaccatcagagaatactataaacacctctatgcaaataaactagaaaagcta gaagaaatggataaattcctcgacacatacaccctcccaagactaaaccaggaagaagtt gaatctctgaatagaccaataacaggctctgaaattgagacaataattaatagcctacca accaaaaaaagtccaggaccagacgtattcacagctgaattctaccaaaggtgggaattg aacaatgagaacacttggacacaggaaggggtgaacagagaaacgactcaacagctaaag gggaatatggggtcaaggaagaacattttagggatgatacaagtgggaataatccagaga ggggaaattcctaaaatgcatctctaccactgctctggtagggtttcgattctggcaaca ctcctcaatgtggtcaatccagataactaccttacacaactgcctcctggtaaccatctc cctaagaaacaaccagattcaacctacttgacttgctccactgaccccatacccccaaag gactgcagaaataagccacagtaa >gi568815583r:55729900_56093555|GENSCAN_predicted_peptide_9|1202_aa MWKRLWNWVTQRGWNSLEGSEEDRKMWESLEPPRDLLNGFEKKADSDMNNKVQADVVSDG DEELVGTGAKLLQTLLKGANTELGLWLQRVKASSLSSFHMVLSVRVHRRQELRFENLCLD FRRSPAPAAAAPAPAAAAPAPAAPSPPPPPPPGSWRAECRLTGMPKEKYDPPDPRRIYTI MSAEEVANGKKSHWAELEISGNCLLRVLPYELGGLFQLQTLGLKGNPLSQDILNLYQDPD GTRNLLNFMLDNLAVHPEQLPPRPWITLKERDQILPSASFTVICYNVLCDKYATRQLYGY CPSWALNLEYRKKGIMEEIVNCDADIISLQEVETEQYFTLFLPALKEREYDGFFSPKSRA KIMSEQERKHVDGCAIFFKREKFTLVQKHAVEFNQVAMANSDGSEAMLNRVMTKDNIGVT VVLEVHKELFGAGMKPIHAADKQLLIVANAHMHWDPEYSDVKLIQTMMFVSEVKTILEKA SSRSGSPTADPNSIPLVLCADLNSLPDSGVVEYLSNGGIADNHKDFKELRYNKCLMNFSC NGKNGSSEGRITHGFQLKSAYENNLMPYTNYTFDFKGVIDYIFYSKTHMNVLGVLGPLDP QWLVENNITGCPHPHIPSDHFSLLTQLELHPPLLPLVNGVHLPNRRWWSTAPPRRGSVAM DLYSCKSNCLPWAGASASGLLEAFGNGNLRGGGVRAPGGRGAGTAGLRGGLPFGLFSSRT RGGSGGGHDRRLCCQPLLPRREPGERARRRGDCGGGSARGRWGLAGGSKLVGSTPAPSRS GRRLTCQFPSGSVEEGARRSAGDPHHRASPFLPPPPELEKTTLKFIWNQKRARIAKSILS QKNKAGGITLPDFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEIMPHIYNYLIFDKPEK NKQWGKDSLFNKWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLG ITIQDIGVGKDFMSKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTTWEKIFAT YSSDKGLISRIYNELKQIYKKKTNNPIKKWAKDMNRHFSKEDIYAAKKHMKKCSSSLAIR EMQIKTTMRYHLTPVRMAIIKKSGNNRSLPVAATAGNVLDHTRSQHNTESCPRPVATAAW LPRCLFKAQELFSQQICQPNGEEERLTELKDRSVEIVTQLEDERDERDKDWRKTTTTTTT EP >gi568815583r:55729900_56093555|GENSCAN_predicted_CDS_9|3609_bp atgtggaagcgactttggaactgggtaacacaaagaggttggaacagtttggagggctca gaagaagacaggaaaatgtgggaaagtttggaacctcctagagacttgttgaatggcttt gaaaaaaaggctgatagtgatatgaacaataaggtccaggctgacgtggtctcagatgga gatgaggaacttgttggaactggagcaaagctgctccagacattgctgaaaggggccaac acagagcttggtctgtggcttcagagggtgaaagcctcaagccttagcagcttccacatg gttttgagcgtgcgggttcacagacgtcaagaattgaggtttgagaacctctgcctagat ttcagaagatcaccagcaccagccgcggcagcaccagcaccagccgccgcagcaccagca ccagccgccccatcgccaccgccgccgccgccgcccggatcctggcgcgctgaatgcaga ctaacagggatgccaaaggaaaaatatgatcctccagatcctcgcagaatttataccatc atgtcagcagaggaggtagccaatgggaaaaaatctcactgggcagaattagaaatctcg gggaattgcttgttacgggttttgccttatgaacttggtgggctcttccagctacaaact ctaggtttgaaaggcaatcctttatcacaggatattctcaacttataccaggacccagat ggaacccgaaatctactgaacttcatgcttgacaatctcgcagttcatccagagcagctt cctccgaggccatggattacattaaaagaacgagaccaaattctgccatcagcatcattc acggttatctgttacaatgtgttatgtgataaatacgctacccggcagctatatggttat tgcccatcctgggcattaaacttggaatacaggaaaaagggaattatggaagaaattgtt aactgtgacgcagatatcattagtcttcaggaagtggaaacagagcaatacttcactctc tttctgccagcattgaaggagcgtgaatatgatggatttttttctccaaagtcacgtgcc aaaatcatgtctgagcaggagagaaagcatgtagatggttgtgcaatatttttcaaaaga gaaaaatttacattggtgcagaagcatgcagtggaatttaaccaagtggcaatggctaat tcagatggatccgaagctatgctgaacagagtgatgacaaaagataacattggtgtcact gtggtattagaggtccacaaagaactatttggagcaggtatgaagcctattcatgctgca gacaaacagctgcttatagtggcaaatgcccacatgcattgggacccagagtattctgat gtgaagctcatccagaccatgatgtttgtctcagaggttaaaaccattctggagaaagcc tctagtaggtctggcagcccaactgcagatcctaattccatcccgctggtgctatgtgca gatcttaactcattgccagattcaggtgttgtggaatacttaagcaatggaggaatagct gacaaccataaagacttcaaggaactaaggtacaataagtgtcttatgaacttcagctgc aatggaaagaatggaagctcagaagggagaatcacacatggcttccaacttaagagcgcc tacgaaaataacttgatgccttacaccaattacacctttgatttcaaaggcgtgattgac tacattttctattccaagactcatatgaatgtgcttggtgtcctggggcctttagatcct caatggctggttgagaacaacatcactgggtgtccacaccctcacatcccttcagaccac ttctcactgttaacacaacttgaactccaccctccactcctgcctcttgtcaatggtgtt cacttgcctaatcggaggtggtggagtactgccccgccaagacggggatctgttgctatg gacctgtacagttgtaaatcaaattgcctgccctgggcgggggcgagcgcgtccggtttg ctggaagcgttcggaaatggcaacttgcgcggtggaggtgttcgggctcctggaggacga ggtgcggggaccgcggggctgcggggcgggcttcccttcgggttattcagcagccggacc cggggaggtagcggcggcggccacgaccggaggctctgctgtcagcccctcctccctcgg cgcgagcctggggagcgcgcgaggcgccgcggggactgtgggggcggctcggcgcgcggg cgctgggggctcgctggagggagcaagcttgtcgggtccacacccgccccttcccggagc ggccgccgccttacttgtcagtttccttcaggaagtgttgaggagggggctaggcggtcg gcgggggacccacatcaccgcgccagccccttcctgccacccccgccggaattggaaaaa actactttaaagttcatatggaaccaaaaaagagcccgcatcgccaagtcaatcctaagc caaaagaacaaagctggaggcatcacactacctgacttcaaactatactacaaggctaca gtaaccaaaacagcatggtactggtaccaaaacagagatatagatcaatggaacagaaca gagccctcagaaataatgccacatatctacaactatctgatctttgacaaacctgagaaa aacaagcaatggggaaaggattccctatttaataaatggtgctgggaaaactggctagcc atatgtagaaagctgaaactggatcccttccttacaccttatacaaaaatcaattcaaga tggattaaagatttaaacgttagacctaaaaccataaaaaccctagaagaaaacctaggc attaccattcaggacataggcgtgggcaaggacttcatgtccaaaacaccaaaagcaatg gcaacaaaagccaaaattgacaaatgggatctaattaaactaaagagcttctgcacagca aaagaaactaccatcagagtgaacaggcaacctacaacatgggagaaaattttcgcaacc tactcatctgacaaagggctaatatccagaatctacaatgaactcaaacaaatttacaag aaaaaaacaaacaaccccatcaaaaagtgggcgaaggacatgaacagacacttctcaaaa gaagacatttatgcagccaaaaaacacatgaagaaatgctcatcatcactggccatcaga gaaatgcaaatcaaaaccactatgagatatcatctcacaccagttagaatggcaatcatt aaaaagtcaggaaacaacaggagtctccctgtggccgccacagctgggaatgtgttggat cacacccgaagtcagcataatactgagtcttgcccaaggcctgtggcaactgctgcctgg ctaccccgatgtttattcaaggcccaagagctctttagtcagcagatttgtcagccaaat ggagaagaggaaagattaactgaacttaaagatagatcagtagaaattgttacgcagttg gaagatgagagagatgagagagacaaagattggagaaaaacaacaacaacaacaacaaca gagccttga >gi568815583r:55729900_56093555|GENSCAN_predicted_peptide_10|151_aa MKRGYRFHTNENLPKTMPYYEAPKVNALLVEGVQVLGIFHKELDTMHKQSKKRMKQQNQR FTENEIKFGGGTELLKITGANENKTLAGRSVREPSSRGGWQKATDTSLAANWMVPTHMEG GSSSPSPLTQMLISSGNTLTDTPRNNTLPVM >gi568815583r:55729900_56093555|GENSCAN_predicted_CDS_10|456_bp atgaaaagaggatatagatttcatactaatgagaatcttccaaaaactatgccatattat gaagccccaaaggtcaatgcgttactggtcgagggtgtccaggttcttggcatcttccac aaagaattggacacaatgcacaaacaaagcaagaaaagaatgaagcagcaaaaccagaga tttactgaaaacgaaatcaaatttgggggaggaacagagctcttaaagatcacaggggca aatgagaacaagactttggctggaaggagtgtaagagagcccagcagtcggggaggatgg cagaaagccacagacacctctctggcagccaattggatggtgcccacccacatggagggt gggtcttcctctcccagtccgctgactcaaatgttaatctcctctggcaacaccctcaca gacacacccagaaacaatactttaccagttatgtag >gi568815583r:55729900_56093555|GENSCAN_predicted_peptide_11|517_aa MIISIDAEKAFDKIHQPFMLKALNKSGIDGTYLKIIRAIYDKPTANIILNGQKLEAFPLK TGTRQGCPLSPLLFNIVLEVLARATRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVS AQNLLKLISNFSKVSGYKINVQKSQAFLYTNNTQTESQIMSELPFTIASKRIKYLGIQLT TDVKDLFKENYKPLLKEIKEDTNKWKNIPCSWVGRINIMKMAILPKVLFTLTLQQFINYS SGVPNLALAPVVVSTHESPPGDFETEPWGRMEAWQASMEIVIRGRKDGHLCFKIVEQLVK LMATVTRKIENLMWLKEISKLNAEGANLFILAAYDKIWEGEMNQRKSNSISKENLELIVL DYGLLGRKIKLSPQGFSISCTQTKGTKAIQNEERPLDPQLTMRRQLRNYSAAKMNHLSWK RETVSGWSQQPNNSELSNHLQGAETGLHRETVLPLESWGLAMCAWLDFRIAKDLCLPCAF NLLPLGMGVSLTVISSLSHYDIWGWARAVMVADNLNY >gi568815583r:55729900_56093555|GENSCAN_predicted_CDS_11|1554_bp atgattatctcaatagatgcagaaaaggcctttgacaaaattcaccaacccttcatgcta aaagctctcaataaatcaggtattgatgggacgtatctcaaaataataagagctatctat gacaaacccacagccaatatcatactgaatgggcaaaaactagaagcattccctttgaaa actggcacaagacagggatgccctctctcaccacttctgttcaacatagtgttggaagtt ctggccagggcaactaggcaggagaaagaaataaagggtattcaattaggaaaagaggaa gtcaaattgtccctgtttgcagatgacatgattgtatatctagaaaaccccattgtctca gcccaaaatctccttaagctgataagcaatttcagcaaagtctcaggatacaaaatcaat gtacaaaaatcacaagcattcttatacaccaataacacacaaacagagagccaaatcatg agtgaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaacttaca acggatgtgaaggacctcttcaaggagaactacaaaccactgctcaaggaaataaaagag gatacaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatatcatgaaa atggccatactgcccaaggttttgttcacactgactctccagcaattcatcaattacagt tcaggtgttcctaacttggcactggctccagtggtggtttccactcatgagtctccgcct ggtgattttgagactgagccatggggaaggatggaagcatggcaagccagcatggaaatt gttattagaggccgcaaagatggtcatctgtgtttcaagatagtggaacaattagtgaag ctgatggctacagttacacggaagatagaaaacttgatgtggctaaaggaaatctccaag ttgaatgccgaaggggctaatttgtttattttagctgcctatgataaaatatgggaagga gagatgaaccaaaggaagagcaattccatttccaaggagaatttagagttaatagttctg gactatggattgctgggacggaaaataaaactctctccccaggggttctcaataagttgt acacagactaaaggaaccaaggcaattcaaaatgaagagagacctctggatccccaactt accatgagacgccaactgaggaattactcagctgcaaaaatgaaccatttgtcatggaaa agggaaacggtctcaggatggagccaacagccaaataacagcgaattaagtaaccatttg cagggagcagaaacaggccttcatcgagaaacagtcctgcctctggagtcgtggggactg gcaatgtgtgcctggctggatttcagaattgctaaggatctgtgcctgccatgtgccttc aatctcctccccttgggaatgggagtatctcttacagttatctcatcgctttcacactat gatatctggggttgggcgagggcagtgatggtggcagataacttgaattattag >gi568815583r:55729900_56093555|GENSCAN_predicted_peptide_12|65_aa MNSWNILVYNSKEASKDYRGHVKKTQEPTCRSCHWTQMGQCSFLMMEKNDFGHQTTVATL DYRNI >gi568815583r:55729900_56093555|GENSCAN_predicted_CDS_12|198_bp atgaactcatggaacatcttggtgtacaatagcaaggaagcttccaaagactatagaggt catgtcaaaaagactcaagagccaacttgcagaagctgccactggacacagatgggacaa tgtagttttctaatgatggaaaaaaatgactttggtcatcaaactactgtggccacatta gattaccggaacatctaa