GENSCAN 1.0 Date run: 5-Jan-118 Time: 10:03:58 Sequence gi568815575r:154517764_154719218 : 201455 bp : 48.84% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.17 PlyA - 5041 5036 6 1.05 1.16 Term - 14327 14237 91 1 1 71 52 138 0.930 5.69 1.15 Intr - 14517 14425 93 2 0 53 94 99 0.985 6.08 1.14 Intr - 14699 14623 77 2 2 80 94 186 0.999 16.71 1.13 Intr - 15039 14774 266 1 2 64 47 659 0.865 56.63 1.12 Intr - 15365 15179 187 2 1 90 78 285 0.994 26.96 1.11 Intr - 15906 15813 94 2 1 105 97 162 0.999 18.77 1.10 Intr - 16397 16272 126 1 0 100 97 271 0.992 28.99 1.09 Intr - 16773 16575 199 1 1 47 59 384 0.712 29.81 1.08 Intr - 17016 16921 96 1 0 114 39 44 0.550 2.08 1.07 Intr - 17367 17102 266 2 2 78 19 126 0.767 1.86 1.06 Intr - 17622 17405 218 0 2 99 67 419 0.999 38.20 1.05 Intr - 18282 18174 109 2 1 76 98 238 0.998 23.89 1.04 Intr - 18415 18378 38 1 2 114 119 71 0.998 9.86 1.03 Intr - 28400 28273 128 0 2 79 109 183 0.689 19.90 1.02 Intr - 29134 28989 146 0 2 -4 -2 112 0.304 -7.07 1.01 Init - 29373 29300 74 0 2 89 55 109 0.468 8.26 1.00 Prom - 31184 31145 40 -7.76 2.00 Prom + 33597 33636 40 -8.76 2.01 Init + 33823 33879 57 0 0 74 37 19 0.277 -3.16 2.02 Intr + 34225 34426 202 0 1 128 78 90 0.684 10.86 2.03 Intr + 38402 38613 212 0 2 99 66 315 0.927 29.03 2.04 Intr + 40769 40887 119 1 2 48 109 110 0.798 8.36 2.05 Intr + 42644 42796 153 2 0 94 81 334 0.999 32.49 2.06 Intr + 43925 44021 97 2 1 36 72 179 0.982 11.11 2.07 Intr + 45047 45190 144 1 0 99 94 283 0.999 30.48 2.08 Intr + 45796 45938 143 0 2 37 80 275 0.968 20.75 2.09 Intr + 46196 46257 62 2 2 68 105 49 0.961 2.98 2.10 Term + 46556 46698 143 0 2 110 49 118 0.852 8.19 2.11 PlyA + 56524 56529 6 1.05 3.00 Prom + 60917 60956 40 -2.76 3.01 Init + 62354 62408 55 1 1 100 61 33 0.862 2.35 3.02 Intr + 63082 63645 564 2 0 11 7 488 0.822 25.77 3.03 Intr + 63689 64000 312 0 0 113 76 131 0.939 10.56 3.04 Term + 64054 64235 182 2 2 -49 33 225 0.887 1.07 3.05 PlyA + 64301 64306 6 1.05 4.02 PlyA - 66073 66068 6 -0.45 4.01 Sngl - 67638 67348 291 0 0 107 50 225 0.188 14.25 4.00 Prom - 70484 70445 40 -2.36 5.03 PlyA - 70856 70851 6 1.05 5.02 Term - 86492 86372 121 1 1 89 54 111 0.952 5.75 5.01 Init - 88282 88197 86 1 2 85 113 56 0.997 8.09 5.00 Prom - 92347 92308 40 -2.76 6.09 PlyA - 92521 92516 6 -1.95 6.08 Term - 93767 93534 234 2 0 44 48 220 0.938 9.92 6.07 Intr - 95232 94628 605 1 2 58 56 174 0.546 3.40 6.06 Intr - 96089 95954 136 2 1 80 58 60 0.773 2.34 6.05 Intr - 96736 96419 318 1 0 24 75 149 0.715 3.35 6.04 Intr - 102671 102558 114 2 0 102 81 39 0.041 5.24 6.03 Intr - 105211 104799 413 2 2 113 105 150 0.020 13.11 6.02 Intr - 105818 105255 564 0 0 11 7 488 0.822 25.77 6.01 Init - 106546 106492 55 1 1 100 61 33 0.861 2.35 6.00 Prom - 107983 107944 40 -2.76 7.10 PlyA - 108713 108708 6 1.05 7.09 Term - 122355 122213 143 1 2 110 49 118 0.804 8.19 7.08 Intr - 122715 122654 62 2 2 68 105 49 0.963 2.98 7.07 Intr - 123115 122973 143 1 2 37 80 275 0.968 20.75 7.06 Intr - 123865 123722 144 1 0 99 94 283 0.999 30.48 7.05 Intr - 124987 124891 97 0 1 36 72 179 0.982 11.11 7.04 Intr - 126268 126116 153 0 0 94 81 334 0.999 32.49 7.03 Intr - 128143 128025 119 1 2 48 109 110 0.808 8.36 7.02 Intr - 130510 130299 212 2 2 99 66 315 0.921 29.03 7.01 Init - 131426 131336 91 2 1 71 105 18 0.280 2.48 7.00 Prom - 132929 132890 40 -7.76 8.00 Prom + 133306 133345 40 -2.46 8.01 Init + 135558 135787 230 2 2 96 77 229 0.842 18.45 8.02 Term + 138104 138455 352 2 1 55 47 163 0.513 2.86 8.03 PlyA + 138606 138611 6 1.05 9.00 Prom + 138887 138926 40 -1.76 9.01 Init + 139028 139138 111 1 0 88 99 82 0.673 9.51 9.02 Term + 139454 139456 3 1 0 109 47 0 0.730 -5.10 9.03 PlyA + 139762 139767 6 -0.45 10.05 PlyA - 139884 139879 6 -0.45 10.04 Term - 141565 141479 87 1 0 67 46 65 0.589 -2.14 10.03 Intr - 141907 141632 276 1 0 54 91 253 0.502 19.91 10.02 Intr - 149243 148203 1041 2 0 32 80 398 0.287 24.07 10.01 Init - 152689 152465 225 1 0 88 72 118 0.266 8.67 10.00 Prom - 154261 154222 40 -5.36 11.02 PlyA - 154398 154393 6 -0.45 11.01 Sngl - 154656 154408 249 0 0 66 42 685 0.969 56.78 11.00 Prom - 157718 157679 40 -3.36 12.04 PlyA - 158405 158400 6 -0.45 12.03 Term - 160531 160415 117 1 0 110 54 81 0.967 5.44 12.02 Intr - 162485 162369 117 2 0 98 91 89 0.972 10.86 12.01 Init - 167094 167029 66 0 0 49 92 29 0.753 0.47 12.00 Prom - 167644 167605 40 -5.76 13.00 Prom + 168234 168273 40 -5.66 13.01 Sngl + 171228 173450 2223 2 0 37 47 548 0.984 39.33 13.02 PlyA + 173659 173664 6 1.05 14.08 PlyA - 176138 176133 6 1.05 14.07 Term - 179088 178899 190 2 1 88 42 62 0.622 -1.48 14.06 Intr - 179450 179369 82 0 1 59 91 80 0.948 4.10 14.05 Intr - 181750 181531 220 1 1 85 75 107 0.974 6.97 14.04 Intr - 182296 182241 56 2 2 96 111 22 0.987 4.00 14.03 Intr - 195660 195520 141 1 0 44 81 104 0.680 5.62 14.02 Intr - 198566 198263 304 2 1 98 110 404 0.966 39.76 14.01 Intr - 199957 199886 72 1 0 71 101 16 0.590 0.80 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 104846 104665 182 0 2 -49 33 225 0.886 1.07 S.002 Intr - 105211 104900 312 2 0 113 76 131 0.939 10.56 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575r:154517764_154719218|GENSCAN_predicted_peptide_1|735_aa MGMREHYAELHPCPPELGMQSSGSGRPPPPPPIKWAGGAQPPETVVHFGAASAEGDDDEA QVTGRAGAAQAEEHSVMAEQVALSRTQVCGILREELFQGDAFHQSDTHIFIIMGASGDLA KKKIYPTIWWLFRDGLLPENTFIVGYARSRLTVADIRKQSEPFFKATPEEKLKLEDFFAR NSYVAGQYDDAASYQRLNSHMNALHLGSQANRLFYLALPPTVYEAVTKNIHESCMSQMLA QAVLPPLYERVRGRGSGQHPWCRGHPQRDHKVAALLHETPPFRSASPKARPGRRVAALLC ECSMARAGWFPNPARGSCPLAGFECGGLQATQTHRGPSMRQEGNGPPAACQQCHPGTQEG FKGVTQLRAPSRGWNRIIVEKPFGRDLQSSDRLSNHISSLFREDQIYRIDHYLGKEMVQN LMVLRFANRIFGPIWNRDNIACVILTFKEPFGTEGRGGYFDEFGIIRDVMQNHLLQMLCL VAMEKPASTNSDDVRDEKVKVLKCISEVQANNVVLGQYVGNPDGEGEATKGYLDDPTVPR GSTTATFAAVVLYVENERWDGVPFILRCGKALNERKAEVRLQFHDVAGDIFHQQCKRNEL VIRVQPNEAVYTKMMTKKPGMFFNPEESELDLTYGNRYKVPYREGAVWRNVKLPDAYERL ILDVFCGSQMHFVRSDELREAWRIFTPLLHQIELEKPKPIPYIYGSRGPTEADELMKRVG FQYEGTYKWVNPHKL >gi568815575r:154517764_154719218|GENSCAN_predicted_CDS_1|2208_bp atggggatgcgggagcactacgcggagctgcacccgtgcccgccggaattggggatgcag agcagcggcagcgggcgcccgcccccgcccccgccgattaaatgggccggcggggctcag cccccggaaacggtcgtacacttcggggctgcgagcgcggagggcgacgacgacgaagcg caggtaaccggccgggcgggcgccgcgcaggcggaggagcacagcgtcatggcagagcag gtggccctgagccggacccaggtgtgcgggatcctgcgggaagagcttttccagggcgat gccttccatcagtcggatacacacatattcatcatcatgggtgcatcgggtgacctggcc aagaagaagatctaccccaccatctggtggctgttccgggatggccttctgcccgaaaac accttcatcgtgggctatgcccgttcccgcctcacagtggctgacatccgcaaacagagt gagcccttcttcaaggccaccccagaggagaagctcaagctggaggacttctttgcccgc aactcctatgtggctggccagtacgatgatgcagcctcctaccagcgcctcaacagccac atgaatgccctccacctggggtcacaggccaaccgcctcttctacctggccttgcccccg accgtctacgaggccgtcaccaagaacattcacgagtcctgcatgagccagatgctggcc caggcagtgctcccaccactctatgagcgtgtccggggccggggatctgggcagcatcca tggtgccggggccatccccagcgggaccacaaggtggcagcgttgctccacgaaacaccg cctttccgctctgcttccccaaaggcccggccaggccgcagggtggcagccttgctctgc gaatgcagcatggcccgcgctgggtggtttcccaacccagccagaggctcttgtcctctg gctggttttgaatgcgggggccttcaggccactcagacccaccggggacccagcatgagg caggaggggaacgggcccccggcagcatgccagcaatgccaccctggcacccaggagggg ttcaagggggtaacgcagctccgggctcccagcagaggctggaaccgcatcatcgtggag aagcccttcgggagggacctgcagagctctgaccggctgtccaaccacatctcctccctg ttccgtgaggaccagatctaccgcatcgaccactacctgggcaaggagatggtgcagaac ctcatggtgctgagatttgccaacaggatcttcggccccatctggaaccgggacaacatc gcctgcgttatcctcaccttcaaggagccctttggcactgagggtcgcgggggctatttc gatgaatttgggatcatccgggacgtgatgcagaaccacctactgcagatgctgtgtctg gtggccatggagaagcccgcctccaccaactcagatgacgtccgtgatgagaaggtcaag gtgttgaaatgcatctcagaggtgcaggccaacaatgtggtcctgggccagtacgtgggg aaccccgatggagagggcgaggccaccaaagggtacctggacgaccccacggtgccccgc gggtccaccaccgccacttttgcagccgtcgtcctctatgtggagaatgagaggtgggat ggggtgcccttcatcctgcgctgcggcaaggccctgaacgagcgcaaggccgaggtgagg ctgcagttccatgatgtggccggcgacatcttccaccagcagtgcaagcgcaacgagctg gtgatccgcgtgcagcccaacgaggccgtgtacaccaagatgatgaccaagaagccgggc atgttcttcaaccccgaggagtcggagctggacctgacctacggcaacagatacaaggtg ccctacagagaaggagcagtgtggaggaacgtgaagctccctgacgcctatgagcgcctc atcctggacgtcttctgcgggagccagatgcacttcgtgcgcagcgacgagctccgtgag gcctggcgtattttcaccccactgctgcaccagattgagctggagaagcccaagcccatc ccctatatttatggcagccgaggccccacggaggcagacgagctgatgaagagagtgggt ttccagtatgagggcacctacaagtgggtgaacccccacaagctctga >gi568815575r:154517764_154719218|GENSCAN_predicted_peptide_2|443_aa MGISFRGATIQPTTLDEGPPLPCWMNRHLWKSQLCEMVQPSGGPAADQDVLGEESPLGKP AMLHLPSEQGAPETLQRCLEENQELRDAIRQSNQILRERCEELLHFQASQREEKEFLMCK FQEARKLVERLGLEKLDLKRQKEQALREVEHLKRCQQQMAEDKASVKAQVTSLLGELQES QSRLEAATKECQALEGRARAASEQARQLESEREALQQQHSVQVDQLRMQGQSVEAALRME RQAASEEKRKLAQLQVAYHQLFQEYDNHIKSSVVGSERKRGMQLEDLKQQLQQAEEALVA KQEVIDKLKEEAEQHKIVMETVPVLKAQADIYKADFQAERQAREKLAEKKELLQEQLEQL QREYSKLKASCQESARIEDMRKRHVEVSQAPLPPAPAYLSSPLALPSQRRSPPEEPPDFC CPKCQYQAPDMDTLQIHVMECIE >gi568815575r:154517764_154719218|GENSCAN_predicted_CDS_2|1332_bp atgggcatatcttttcggggggctaccattcagcccactacactggatgaggggccgccc ttgccctgttggatgaataggcacctctggaagagccaactgtgtgagatggtgcagccc agtggtggcccggcagcagatcaggacgtactgggcgaagagtctcctctggggaagcca gccatgctgcacctgccttcagaacagggcgctcctgagaccctccagcgctgcctggag gagaatcaagagctccgagatgccatccggcagagcaaccagattctgcgggagcgctgc gaggagcttctgcatttccaagccagccagagggaggagaaggagttcctcatgtgcaag ttccaggaggccaggaaactggtggagagactcggcctggagaagctcgatctgaagagg cagaaggagcaggctctgcgggaggtggagcacctgaagagatgccagcagcagatggct gaggacaaggcctctgtgaaagcccaggtgacgtccttgctcggggagctgcaggagagc cagagtcgcttggaggctgccactaaggaatgccaggctctggagggtcgggcccgggcg gccagcgagcaggcgcggcagctggagagtgagcgcgaggcgctgcagcagcagcacagc gtgcaggtggaccagctgcgcatgcagggccagagcgtggaggccgcgctccgcatggag cgccaggccgcctcggaggagaagaggaagctggcccagttgcaggtggcctatcaccag ctcttccaagaatacgacaaccacatcaagagcagcgtggtgggcagtgagcggaagcga ggaatgcagctggaagatctcaaacagcagctccagcaggccgaggaggccctggtggcc aaacaggaggtgatcgataagctgaaggaggaggccgagcagcacaagattgtgatggag accgttccggtgctgaaggcccaggcggatatctacaaggcggacttccaggctgagagg caggcccgggagaagctggccgagaagaaggagctcctgcaggagcagctggagcagctg cagagggagtacagcaaactgaaggccagctgtcaggagtcggccaggatcgaggacatg aggaagcggcatgtcgaggtctcccaggcccccttgccccccgcccctgcctacctctcc tctcccctggccctgcccagccagaggaggagcccccccgaggagccacctgacttctgc tgtcccaagtgccagtatcaggcccctgatatggacaccctgcagatacatgtcatggag tgcattgagtag >gi568815575r:154517764_154719218|GENSCAN_predicted_peptide_3|370_aa MATRAPDTPPAAALEGASSFSSSIAVTDKDTFELSTFVDSSKALHHDRDELPEQRGVGGG LMSPLDQSGLGTEESLGLLDDYLEVAKHFKPHGFSSDKAKADSSEWLAVDGLVSASNDGK EDAFFGTDWMLEKTDLKEFDFDALLGIDDLETMPDELLATLDDSCDLFAPLVQETIKEPP QSRRLLRSPPQMVNPICHLPESLTRPGVQSSTPDYSFSLELGSEVDIFEGARKPDSTAYI SKIPHCTKEEDAPSDNDSGICMSPESYLGSPSTSRGSPSRSPPSPGVLCGSACPKPYDPP GEKMVAAQVKGTRYRQKKRVEQEVLTGECKAVEKKNEALQERADSLAEEIQYMKDSIEEV CKARGKKRVL >gi568815575r:154517764_154719218|GENSCAN_predicted_CDS_3|1113_bp atggctacaagggccccagatacgcctccggctgctgctctggagggtgcaagctcattc agcagcagcattgctgtaaccgacaaagacaccttcgaattaagcacattcgttgattcc agcaaagcactgcaccatgaccgagatgagcttcctgagcagcgaggtgttggtgggggc ttgatgtccccccttgaccagtcaggtttggggactgaagaaagcctaggtctcttagat gactacctggaggtggccaagcacttcaaacctcatgggttctccagcgacaaggctaag gcagactcctccgagtggctggctgtggatgggttggtcagtgcctccaacgatggcaag gaggatgctttctttgggacagattggatgttggagaaaactgatctgaaggagttcgac tttgatgccctgttgggtatagatgacctggaaaccatgccagacgagcttctggccacg ttggatgactcgtgtgatctctttgcccccctagtccaggagactattaaggagcccccc cagtccaggagactattaaggagccccccccagatggtgaacccgatttgccatctccca gaaagtttaacccgaccaggggtccagtcctccactccagattattcctttagtctagag ctgggcagtgaagtggatatctttgaaggagctaggaagccagactccactgcttacatt tccaagatccctcactgcacaaaggaggaagacgccccctcagataatgatagtggcatc tgtatgagcccagagtcctatctgggctctccctctacctccaggggctctccgagtagg agcccgccatctccaggtgtcctctgtggctctgcctgccccaaaccttatgaccctcct ggagagaagatggtagcagcacaagtaaagggaactaggtatcgccagaagaagagggtg gagcaggaggtcctcactggtgagtgcaaagcggtggaaaagaagaacgaggctctgcaa gagagggcggattccctggccgaggagatccagtacatgaaagattcgatagaagaggtc tgcaaggcaagggggaagaaaagggtcctctag >gi568815575r:154517764_154719218|GENSCAN_predicted_peptide_4|96_aa MRTPRGASSRPRGPCCPCAPGTSAARGTRLSWAASIAPWAIRNARASWAISIARRTPCAP AFGLHGSGASARLSERRSGPTRMRRQRGCRKFRPLA >gi568815575r:154517764_154719218|GENSCAN_predicted_CDS_4|291_bp atgcggaccccgcggggcgcctcctcccggccccgaggcccttgctgcccctgcgccccg gggacctctgccgcccgtggcacccgcctctcctgggccgccagcattgccccctgggcc atcaggaatgccagggcctcctgggccatcagcatcgcccgtcgaaccccctgtgccccg gccttcggcctgcatggctccggagcctctgcccggctctcagagagaaggtcagggccc acgaggatgcggaggcagagaggctgcaggaagttccgccccctggcgtga >gi568815575r:154517764_154719218|GENSCAN_predicted_peptide_5|68_aa MTLRASAGEPDENSQQTCIVWKEVRSQKRSFDSAGVDFTPDSRWNDSSFLSHYVMPAFRQ FLPEVDVV >gi568815575r:154517764_154719218|GENSCAN_predicted_CDS_5|207_bp atgactctgagagcatcagctggggaaccggatgaaaattcacagcagacgtgcattgtt tggaaggaggtgaggagccagaaacgcagttttgattctgctggagttgacttcacccca gattccaggtggaatgactcatccttcctctctcactatgtgatgccagctttccgccag ttcctgccagaagtagatgtggtctga >gi568815575r:154517764_154719218|GENSCAN_predicted_peptide_6|812_aa MATRAPDTPPAAALEGASSFSSSIAVTDKDTFELSTFVDSSKALHHDRDELPEQRGVGGG LMSPLDQSGLGTEESLGLLDDYLEVAKHFKPHGFSSDKAKADSSEWLAVDGLVSASNDGK EDAFFGTDWMLEKTDLKEFDFDALLGIDDLETMPDELLATLDDSCDLFAPLVQETIKEPP QSRRLLRSPPQMVNPICHLPESLTRPGVQSSTPDYSFSLELGSEVDIFEGARKPDSTAYI SKIPHCTKEEDAPSDNDSGICMSPESYLGSPSTSRGSPSRSPPSPGVLCGSACPKPYDPP GEKMVAAQVKGEKLDKKLKKKWSKTKQQELGIARRRGWSRRSSLRLVTIILLYLQKINVF SFHMSANMLFPGAEAADSWIWKFPEQGTLDLKDWEKIGKELKQASREGKIIPLTVCNDWA IIKAALEPFQTEEDSVLVSDAPESCVIDCEEEAGTEFKKGTESSHCENVAESVMARSTQS VDYNQLQENSPVFVIQKKSGRWRMLTDLRAANAVIQPMGALQRRLPSPAVIPKGIIVQNT DLVEWSFLPHSTIKTFTLYLDQMATLIGQARLRIVKLCGSDPDKIIVPLNKEQVTQAFIN SGALQIGLADFVGIIDNHYPKTKTFQFLKLTTWILPKITRHTPLENALTVFTDGSSNGKV AYTRPKKRVTETQYHSAQRAELVAVISVLQDFNQLINLVSDSAYVVQATKDVETALVKYS MDDRLHQLFNLLQQTEKSQLPVWIPTRHLKFYNEPIGNAKKSASAETENPQSSIIDSPGE QNGGIRRTDEVDIHQGSGAADLGPTKEVDTVS >gi568815575r:154517764_154719218|GENSCAN_predicted_CDS_6|2439_bp atggctacaagggccccagatacgcctccggctgctgctctggagggtgcaagctcattc agcagcagcattgctgtaaccgacaaagacaccttcgaattaagcacattcgttgattcc agcaaagcactgcaccatgaccgagatgagcttcctgagcagcgaggtgttggtgggggc ttgatgtccccccttgaccagtcaggtttggggactgaagaaagcctaggtctcttagat gactacctggaggtggccaagcacttcaaacctcatgggttctccagcgacaaggctaag gcagactcctccgagtggctggctgtggatgggttggtcagtgcctccaacgatggcaag gaggatgctttctttgggacagattggatgttggagaaaactgatctgaaggagttcgac tttgatgccctgttgggtatagatgacctggaaaccatgccagacgagcttctggccacg ttggatgactcgtgtgatctctttgcccccctagtccaggagactattaaggagcccccc cagtccaggagactattaaggagccccccccagatggtgaacccgatttgccatctccca gaaagtttaacccgaccaggggtccagtcctccactccagattattcctttagtctagag ctgggcagtgaagtggatatctttgaaggagctaggaagccagactccactgcttacatt tccaagatccctcactgcacaaaggaggaagacgccccctcagataatgatagtggcatc tgtatgagcccagagtcctatctgggctctccctctacctccaggggctctccgagtagg agcccgccatctccaggtgtcctctgtggctctgcctgccccaaaccttatgaccctcct ggagagaagatggtagcagcacaagtaaagggtgagaaactggataagaagctgaaaaaa aaatggagcaaaaccaaacagcaggaactaggtatcgccagaagaagagggtggagcagg aggtcctcactgcgtctggtaaccatcattctgctctacctccaaaagatcaacgttttt agtttccacatgagtgccaatatgcttttcccaggtgcagaggctgctgattcctggatt tggaagtttccagaacagggaactttagatctaaaagactgggaaaaaattggcaaagaa ttaaaacaagcaagtagggaaggcaaaatcatcccgcttacagtatgcaatgattgggcc attattaaagcagctttagaaccgtttcaaacagaagaagatagcgttttggtttctgat gcccctgaaagctgtgtaatagattgtgaagaagaggcggggacagagttcaagaaagga acggaaagttcacattgtgaaaatgtagcagagtctgtaatggctcggtcaacacaaagt gttgactacaatcaattacaggagaattcgccagtgtttgtaattcagaaaaaatccggc agatggcgcatgctaaccgacttaagagccgctaatgccgtaattcaacccatgggggct ctccaacgcaggctgccctctccggccgtgatccccaaaggcatcattgttcaaaataca gatcttgtggagtggtccttccttcctcacagtacgattaagacttttacattgtacttg gatcaaatggctacattaattggtcaggcaagactacgaatagtaaaattgtgtggaagt gacccagataaaatcattgttcctttaaacaaggaacaggttacacaagcctttatcaat tctggtgcattgcagattggtcttgctgattttgtgggaattattgacaatcattaccca aaaacaaaaaccttccagtttttaaaattgactacttggattttacctaaaattaccaga catacacctttagaaaatgctctgacagtgtttactgatggttccagcaatggaaaggtg gcttacaccaggccaaaaaaacgagtcactgaaactcaatatcactcagctcaaagagca gagttggttgctgtcatttcagtgttacaagattttaatcagcttattaaccttgtatca gattctgcatatgtagtacaggctacaaaggatgttgagacagccctagtcaaatacagt atggatgatcggttacaccagctgtttaatttgttacaacaaactgaaaaaagtcagctt cctgtttggatacccactagacatttaaagttctacaatgaacccatcggaaatgcaaag aaaagcgcctccgcggagacagaaaacccgcaatcgagcatcatcgactcgccaggtgaa caaaatggtggtatcagaagaacagatgaagttgacatccaccaaggaagtggagccgcc gacctgggcccaactaaagaagttgacacagttagctga >gi568815575r:154517764_154719218|GENSCAN_predicted_peptide_7|387_aa MPPIYTHTGAGMARLVRRGNGRGSHADTQRDAIRQSNQILRERCEELLHFQASQREEKEF LMCKFQEARKLVERLGLEKLDLKRQKEQALREVEHLKRCQQQMAEDKASVKAQVTSLLGE LQESQSRLEAATKECQALEGRARAASEQARQLESEREALQQQHSVQVDQLRMQGQSVEAA LRMERQAASEEKRKLAQLQVAYHQLFQEYDNHIKSSVVGSERKRGMQLEDLKQQLQQAEE ALVAKQEVIDKLKEEAEQHKIVMETVPVLKAQADIYKADFQAERQAREKLAEKKELLQEQ LEQLQREYSKLKASCQESARIEDMRKRHVEVSQAPLPPAPAYLSSPLALPSQRRSPPEEP PDFCCPKCQYQAPDMDTLQIHVMECIE >gi568815575r:154517764_154719218|GENSCAN_predicted_CDS_7|1164_bp atgcccccaatttacacccacacgggcgccggaatggccaggttggtgaggagggggaat gggcgaggctcccatgcagacacccagagagatgccatccggcagagcaaccagattctg cgggagcgctgcgaggagcttctgcatttccaagccagccagagggaggagaaggagttc ctcatgtgcaagttccaggaggccaggaaactggtggagagactcggcctggagaagctc gatctgaagaggcagaaggagcaggctctgcgggaggtggagcacctgaagagatgccag cagcagatggctgaggacaaggcctctgtgaaagcccaggtgacgtccttgctcggggag ctgcaggagagccagagtcgcttggaggctgccactaaggaatgccaggctctggagggt cgggcccgggcggccagcgagcaggcgcggcagctggagagtgagcgcgaggcgctgcag cagcagcacagcgtgcaggtggaccagctgcgcatgcagggccagagcgtggaggccgcg ctccgcatggagcgccaggccgcctcggaggagaagaggaagctggcccagttgcaggtg gcctatcaccagctcttccaagaatacgacaaccacatcaagagcagcgtggtgggcagt gagcggaagcgaggaatgcagctggaagatctcaaacagcagctccagcaggccgaggag gccctggtggccaaacaggaggtgatcgataagctgaaggaggaggccgagcagcacaag attgtgatggagaccgttccggtgctgaaggcccaggcggatatctacaaggcggacttc caggctgagaggcaggcccgggagaagctggccgagaagaaggagctcctgcaggagcag ctggagcagctgcagagggagtacagcaaactgaaggccagctgtcaggagtcggccagg atcgaggacatgaggaagcggcatgtcgaggtctcccaggcccccttgccccccgcccct gcctacctctcctctcccctggccctgcccagccagaggaggagcccccccgaggagcca cctgacttctgctgtcccaagtgccagtatcaggcccctgatatggacaccctgcagata catgtcatggagtgcattgagtag >gi568815575r:154517764_154719218|GENSCAN_predicted_peptide_8|193_aa MRTPRGASSRPRGPCCPCAPGTSAARGTRLSWAASIAPWAIRNARASWAISIARRTPCAP AFGLHGSGASARLSERRAPPVTVISGLLLRLCVRRHLLTSQILLSLWAPCTCLLPLGCWG GDLCALCQRLSCSSQHICFFSCALHRVGLEALGKLYPWTQPPTDGAKSHWIKVLVSHPLK DNSEVRLTVERMK >gi568815575r:154517764_154719218|GENSCAN_predicted_CDS_8|582_bp atgcggaccccgcggggcgcctcctctcggccccgaggcccttgctgcccctgcgccccg gggacctctgccgcccgtggcacccgcctctcctgggccgccagcattgccccctgggcc atcaggaatgccagggcctcctgggccatcagcatcgcccgtcgaaccccctgtgccccg gccttcggcctgcatggctccggagcctctgcccggctctcagagagaagagcccctcca gtgaccgtcatctccggcctcctcctgcgcctctgtgttcgtagacatcttctcacatct cagatcctgctgagcctctgggccccctgcacgtgccttctgcctctcggctgctggggt ggggacctctgtgccctgtgccagcgcctctcctgcagctcacaacacatctgctttttc tcctgcgccctgcacagggtgggcctggaagctctggggaagttgtacccttggacacag ccccccaccgatggggccaagagtcactggataaaagtcttagtctctcaccctttgaag gacaactctgaggtgcggctgactgtggagcgaatgaaatga >gi568815575r:154517764_154719218|GENSCAN_predicted_peptide_9|37_aa MKENEFMSFAETWMKLETIIFSKLTQEQKTKHRMFSL >gi568815575r:154517764_154719218|GENSCAN_predicted_CDS_9|114_bp atgaaagagaatgagttcatgtcgtttgcagagacatggatgaagctggaaaccatcatt ttcagcaaattaacacaggaacagaaaaccaaacaccgcatgttctcactctga >gi568815575r:154517764_154719218|GENSCAN_predicted_peptide_10|542_aa MGRNQSRKAKNSKNESASSPPKDCSSSPAMEQSWMENDFDELTEVGFRRSVITNFSELKE HILTHCKEAKNLERKLISNYSKVSGYKINVQKSQAFLYINSKQKENQIMSEPPFTIATKR IKYLGIQLTRDVKDLFKNYKPLLNKIKEDTNKWKNVPWLWIGRINIVKMAILPKVIYRFN AIPIKLPMTFFTELEKTTLKFTWNQKRACIAKTIISKKNKAGGIMLPDFKLYYKATVTKT AWYWYQNRDIDQWNRTEPSEIMPHIYNHLIFDKPDKKKKWGKDSLFNKWCWENWLAICRK LKLDPFLTPYTKINSRWIKDLNVRPKTKKSLEEILGSTTQDIGMGKDFMTKTPKAIATKA KIDKWDLIKLKSFCTAKETTISVNRQPTEWEKMFAIYPSDKGLISKIYKELKQIYNKQPH QKVQGSVRGSKQLRNGTLVSQFLLKGLRDSKAWRPLLFTTFLLIYIVVVVGSHMFTVDYR RHTPMYFFLGGHSLMDAACISNMVTQVLVHLLAQESFLLTAVAYDSMQLSASHCTTLSSW AD >gi568815575r:154517764_154719218|GENSCAN_predicted_CDS_10|1629_bp atggggagaaaccagagcagaaaagctaaaaattctaaaaacgagagtgcctcttctcct ccaaaggattgcagctcctcgccagcaatggaacaaagctggatggagaatgactttgac gagttgacagaagtaggcttcagaaggtcagtaataacaaacttctctgagctaaaggag catattctaacccattgcaaggaagctaaaaaccttgaaagaaagctgataagcaactac agcaaagtctcaggatacaaaatcaatgtgcaaaaatcacaagcattcctatacatcaat agcaaacaaaaggagaaccaaatcatgagtgaacccccattcacaattgctacaaagaga ataaaatacctaggaatacaacttacaagggatgtgaaggacctcttcaagaactacaaa ccactgctcaacaaaataaaagaggacacaaacaaatggaagaatgttccatggttatgg ataggaagaatcaatatcgtgaaaatggccatactgcccaaggtaatttatagattcaat gccatccccatcaagctaccaatgactttcttcacagaattggaaaaaactactttaaag ttcacatggaaccaaaaaagagcctgcattgccaagacaatcataagcaaaaagaacaaa gctggaggcatcatgctacctgacttcaaactatactacaaggctacagtaaccaaaaca gcatggtactggtaccaaaacagagatatagaccaatggaacagaacagagccctcagaa ataatgccacacatctacaaccatctgatctttgacaaacctgacaaaaagaagaaatgg ggaaaggattccctatttaataaatggtgctgggaaaactggctagccatatgtagaaag ctgaaattggatcccttccttacaccttatacaaaaattaactcaagatggattaaagac ttaaatgttagacctaaaaccaaaaaatctctagaagaaatcctaggcagtaccactcag gacataggcatgggcaaggacttcatgactaaaacaccaaaagcaatagcaaccaaagcc aaaattgacaaatgggatctaattaaactaaagagcttctgcacagcaaaagaaacaacc atcagtgtgaacaggcaacctacagaatgggagaaaatgtttgcaatctacccatctgac aaagggctaatatccaaaatctacaaagaacttaaacaaatttacaacaaacaaccccat caaaaagtgcagggttctgtgcgtggttccaagcagctgcggaatgggaccctagtgtcc cagtttcttctgaaaggcctgagggacagcaaggcttggaggcccctgctgttcaccacc tttctgctaatctacatagtggttgtggttgggagccacatgttcacagtggactaccga cgccacactcccatgtacttcttcttgggcggccactcgctgatggatgccgcctgtatc tccaacatggtgactcaggtgctggtgcatttgctggctcaggagtccttcctcctcaca gccgtggcctatgattctatgcagctatctgccagccattgcactactttgtcctcgtgg gccgactga >gi568815575r:154517764_154719218|GENSCAN_predicted_peptide_11|82_aa MKKKKKKEEEEEEEEEEEEEEEEEEEEEEGEEEEEEEEEEEEEEEEEEEEEEEEEEEEER RRRELNDRTFEIIQSKEQKKGS >gi568815575r:154517764_154719218|GENSCAN_predicted_CDS_11|249_bp atgaagaagaagaagaagaaagaagaagaagaagaagaagaagaagaagaagaagaagaa gaagaagaagaagaagaagaagaagaaggggaggaagaagaagaagaagaagaagaagaa gaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaaaga agaagaagggagctcaatgacaggacatttgaaattatccaatcaaaggagcaaaaaaaa ggatcataa >gi568815575r:154517764_154719218|GENSCAN_predicted_peptide_12|99_aa MLELLTVHERYHSSTSPKTYKQEEHRTASSLSSGALTWTKKFSLDYLALDFNSASPAPMQ QKLLLSEEQRVDYVQVDEQKTQALQSTKQEWTDERQSKV >gi568815575r:154517764_154719218|GENSCAN_predicted_CDS_12|300_bp atgctagagttgctcacagttcatgaaaggtaccattcctccacaagcccaaagacttac aagcaggaggagcaccgaacagccagttccctgagcagtggtgcccttacgtggacaaag aaattcagcctagattatttggccctggacttcaattcagcatcaccagcccccatgcag cagaaacttctcctttcagaagaacaaagagtagactatgtccaagtggatgagcagaag acacaggctctccagagcacaaaacaggagtggacggatgaaaggcaatccaaagtatga >gi568815575r:154517764_154719218|GENSCAN_predicted_peptide_13|740_aa MNIDAKILNKILAKRIQQHIKKLIHHDQVGFIPGMQGWFNIRKSVNVIQHINRTKDKNHM IISIDAEKAFDKIQQPFMLKSLNKLGIDGTDLKIIRAIYDKPTANIILNGQKLEAFPLKT GTRQGCPLSPLLFNIVLEVLARAVRQEKEIKAIQLGKEEVKLSLFADDMIVYLENPIVSA QNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTTASKRIKYLGIQLTR DMKDLFRENYKPLLKEIKEDTNKWKNIPCSWVGRINIVKMAILPKVIYRFNAIPIKLPMP FFTELEKTTLKFIWNQKRACIAKSILSQKNKAGGITLPDFKLYYKATVTKTAWYWYQNRD IDQWNRTEPSETTPHIYNYLIFDKPEKNKQWGKDSLFNKWCWENWLAICRKLKLDPFLTP YTKINSRWIKDLSVRPKTIKALEENLGITIQDTGMGKDFMSKTPKAMATKAKIDKSELIK LKSFCTAKETTIRVNRQPTKWEKTFATYSSDNGLISRIYKELKQIYKEKTNNPIKKWAKD MNRHFAKEDIYAAKKHMKKCSPSLAIREMQIKTTMRYHLTPVRMAIIKKSGNNRCWRGCG EIGTLLHCWWDCKLVQPLWKSVWRFLRDLELEIPFDPVIPLLGIYPKDYKSCCYKDTCTR MFIAALFTIAKTWNQAKCPTMTDWIKKMWHIYAMEHYAAIKNDEFMSFVGTWMKLEIIIL SKLSQGQKTKHRMFSLIDGN >gi568815575r:154517764_154719218|GENSCAN_predicted_CDS_13|2223_bp atgaacattgatgcaaaaatcctcaataaaatactggcaaaacgaatccagcagcacatc aaaaagcttatccaccatgatcaagtgggcttcatccctgggatgcaaggctggttcaat atacgcaaatcagtaaatgtaatccagcatataaacagaaccaaagacaaaaaccacatg attatctcaatagatgcagaaaaggcctttgacaaaattcaacagcccttcatgctaaaa tctctcaataaattaggtattgatgggacggatctcaaaataataagagctatctatgac aaacccacagccaatatcatactgaatgggcaaaaactggaagcattccctttgaaaact ggtacaagacagggatgccctctctcaccactcctattcaacatagtgttggaagttctt gccagggcagttaggcaggagaaggaaataaaggctattcaattaggaaaagaggaagtc aaattgtccctgtttgcagacgacatgattgtatatctagaaaaccccattgtctcagcc caaaatctccttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaatgtg caaaaatcacaagcattcttatacaccaacaacagacaaacggagagccaaatcatgagt gaactcccattcacaactgcttcaaagagaataaaatacctaggaatccaacttacaagg gatatgaaggacctcttcagggagaactacaaaccactgctcaaggaaataaaagaggat acaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatatcgtgaaaatg gccatactgcccaaggtaatttacagattcaatgccatccccatcaagctaccaatgcct ttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcctgc atcgccaagtcaatcctgagccaaaagaacaaagctggaggcatcacactacctgacttc aaactgtactacaaagctacagtaaccaaaacagcatggtactggtaccaaaacagagat atagatcaatggaacagaacagagccctcagaaacaacgccgcatatctacaactatctg atctttgacaaacctgagaaaaacaagcaatggggaaaggattccctatttaataaatgg tgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttccttacacct tatacaaaaattaattcaagatggattaaagacttaagcgttagacctaaaaccataaaa gccctagaagaaaacctaggcattaccattcaggacacaggcatgggcaaggacttcatg tctaaaacaccaaaagcaatggcaaccaaagccaaaattgacaaatcggaactaattaaa ctaaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaacctacaaaa tgggagaaaactttcgcaacctactcatctgacaatgggctaatatccagaatctacaaa gaactcaaacaaatttacaaggaaaaaacaaacaaccccatcaaaaaatgggcaaaggac atgaacagacacttcgcaaaagaagatatttatgcagctaaaaaacacatgaaaaaatgc tcaccatcactggccatcagagaaatgcaaatcaaaaccacaatgagataccatctcaca ccagttagaatggcaatcattaaaaagtcaggaaacaataggtgctggagaggatgtgga gaaataggaacacttttacactgttggtgggactgtaaactagttcaaccattgtggaag tcagtgtggcgattcctcagggatctagaactagaaataccatttgacccagtgatccca ttactgggtatatacccaaaggactataaatcatgctgctataaagacacatgcacacgt atgtttattgcagcattattcacaatagcaaagacttggaaccaagccaaatgtccaaca atgacagactggattaagaaaatgtggcacatatacgccatggaacactatgcagccata aaaaatgatgagttcatgtcctttgtagggacatggatgaaattggaaatcatcattctc agtaaactatcgcaaggacaaaaaaccaaacaccgcatgttctcactcatagatgggaat tga >gi568815575r:154517764_154719218|GENSCAN_predicted_peptide_14|354_aa ARDLHPFFVLSQDGTVYIKNSKSMAWRKRWFVLRRGRMSGNPDVLEYYRNKHSSKPIRVI DLSECAVWKHVGPSFVRKEFQNNFVFIVKTTSRTFYLVAKTEQEMQVWVHSISQVCNLGH LEDGADSMESLSYTPSSLQPSSASSLLTAHAASSSLPRDDPNTNAVATEETRTDVEGQSL RHRDKRLSLNLPCRFSPMYPTASASIEDSYVPMSPQAGASGLGPHCSPDDYIPMNSGSIS SPLPELPANLEPPPVNRDLKPQRKSRPPPLDLRNLSIIREHASLTRTRTVPCILFYCTIL MVKDSCTIPPELVCMLQLQYQQMGSFSVPIQISWGERIVFPSLSGTGYGPSPTL >gi568815575r:154517764_154719218|GENSCAN_predicted_CDS_14|1065_bp gccagagatctgcaccctttctttgtactgagccaagatggtactgtctacatcaaaaat tctaagtcaatggcctggcgcaagcgctggtttgtcctccggcgaggccgcatgagcggc aaccccgatgtcttggagtactacaggaacaagcactccagcaagcccatccgggtgata gacctcagcgagtgtgcagtgtggaagcatgtgggccccagctttgttcggaaggaattt cagaataatttcgtgttcattgtcaagactacttcccgtacattctacctggtggccaaa actgagcaagaaatgcaggtgtgggtgcacagcatcagtcaggtctgcaaccttggccac ctggaggatggtgcagattccatggagagcctctcttacacgccctcctccctgcagcca tcctctgccagctcccttcttaccgcccatgctgccagctcctctttgccaagagatgac ccaaacactaatgccgtagccactgaggaaaccagaactgatgtagaaggccaatcctta agacaccgagacaagcggcttagtttgaatttgccatgcaggttctccccgatgtacccc acagcttcagccagtatcgaagacagctatgtgcccatgagcccccaggctggtgcctct ggtcttggaccccactgcagccctgatgactacattccaatgaactcaggaagcatctca agcccgttgcctgagctgcctgcaaacctggaacctcccccagtgaatagagatctcaag cctcagaggaaatcacggccacctcctctggacctgagaaacctctcgatcatccgggaa catgcatctcttaccaggacccgcactgtgccttgtattcttttctattgcacaatcctc atggtcaaagatagctgtaccataccccctgaacttgtttgcatgctacaactccagtac caacagatgggctctttttcagtgccaattcagatttcctggggagagagaattgtgttc cccagtctcagtggaactggctatggcccaagcccaactttgtga