GENSCAN 1.0 Date run: 5-Nov-116 Time: 08:32:41 Sequence gi568815595r:142211574_142547944 : 336371 bp : 38.07% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 Intr - 4119 4026 94 0 1 100 86 62 0.632 5.82 1.01 Init - 13882 13736 147 1 0 58 57 227 0.716 16.74 1.00 Prom - 28064 28025 40 -6.15 2.00 Prom + 29541 29580 40 -6.55 2.01 Init + 33769 33889 121 0 1 90 64 75 0.525 5.70 2.02 Intr + 40203 40364 162 1 0 56 17 138 0.616 2.53 2.03 Term + 43637 44004 368 0 2 38 36 226 0.791 6.18 2.04 PlyA + 44142 44147 6 1.05 3.02 PlyA - 44986 44981 6 1.05 3.01 Sngl - 63382 62771 612 1 0 59 40 213 0.855 9.54 3.00 Prom - 68093 68054 40 -7.35 4.00 Prom + 72411 72450 40 -7.15 4.01 Init + 79894 80546 653 0 2 71 -21 332 0.527 15.41 4.02 Intr + 81933 83999 2067 0 0 43 53 726 0.471 50.97 4.03 Intr + 87366 87535 170 0 2 50 87 49 0.190 -0.33 4.04 Term + 91372 91721 350 2 2 56 47 155 0.733 1.86 4.05 PlyA + 92005 92010 6 1.05 5.22 PlyA - 92461 92456 6 1.05 5.21 Term - 100240 99938 303 1 0 94 32 194 0.799 8.59 5.20 Intr - 101185 101025 161 2 2 42 110 38 0.161 0.19 5.19 Intr - 107121 107019 103 0 1 62 99 39 0.219 1.23 5.18 Intr - 107330 107217 114 2 0 68 91 84 0.135 6.42 5.17 Intr - 118042 117861 182 2 2 57 80 71 0.133 1.87 5.16 Intr - 120961 120802 160 1 1 17 97 113 0.257 3.74 5.15 Intr - 121784 121674 111 2 0 36 93 91 0.397 4.06 5.14 Intr - 123936 123875 62 1 2 89 115 12 0.121 1.53 5.13 Intr - 135769 135661 109 1 1 56 101 17 0.039 -1.26 5.12 Intr - 143923 143828 96 1 0 51 96 67 0.816 3.09 5.11 Intr - 145546 145339 208 0 1 73 71 164 0.941 11.36 5.10 Intr - 148358 148289 70 0 1 73 76 88 0.994 3.42 5.09 Intr - 153606 153474 133 0 1 86 75 178 0.991 15.60 5.08 Intr - 153793 153737 57 1 0 93 89 14 0.489 0.16 5.07 Intr - 159047 158912 136 1 1 68 74 106 0.999 6.85 5.06 Intr - 164371 164225 147 0 0 12 107 139 0.981 6.63 5.05 Intr - 165021 164906 116 0 2 80 106 -7 0.928 -1.37 5.04 Intr - 168607 168509 99 1 0 72 84 58 0.774 3.19 5.03 Intr - 171840 171727 114 0 0 62 24 103 0.634 1.02 5.02 Intr - 173112 172950 163 2 1 68 115 107 0.988 10.46 5.01 Init - 177308 177007 302 2 2 50 100 138 0.803 7.97 5.00 Prom - 178554 178515 40 -3.25 6.14 PlyA - 179416 179411 6 1.05 6.13 Term - 181063 181027 37 0 1 93 37 50 0.138 -3.77 6.12 Intr - 185887 185756 132 0 0 53 79 135 0.607 7.94 6.11 Intr - 188974 188871 104 1 2 68 97 99 0.986 6.85 6.10 Intr - 192199 192101 99 1 0 27 89 109 0.346 4.29 6.09 Intr - 201090 200971 120 0 0 64 78 84 0.604 4.77 6.08 Intr - 202718 202562 157 1 1 86 94 134 0.989 12.89 6.07 Intr - 207036 206931 106 1 1 101 42 106 0.940 5.65 6.06 Intr - 207308 207242 67 2 1 59 110 28 0.953 -0.34 6.05 Intr - 209580 209443 138 0 0 53 34 236 0.772 14.44 6.04 Intr - 209970 209903 68 1 2 63 95 58 0.911 1.71 6.03 Intr - 215268 215171 98 2 2 61 99 55 0.488 2.63 6.02 Intr - 221320 221088 233 1 2 124 66 177 0.539 14.75 6.01 Init - 225739 225728 12 1 0 73 95 13 0.688 0.03 6.00 Prom - 226242 226203 40 -11.14 7.00 Prom + 226566 226605 40 -12.62 7.01 Init + 228205 228523 319 0 1 76 59 164 0.917 9.84 7.02 Term + 228588 229297 710 1 2 18 35 339 0.499 14.28 7.03 PlyA + 233039 233044 6 1.05 8.03 PlyA - 233537 233532 6 1.05 8.02 Term - 235952 235829 124 1 1 92 43 161 0.996 8.88 8.01 Init - 236489 236293 197 2 2 31 80 219 0.996 11.86 8.00 Prom - 237414 237375 40 -8.25 9.12 PlyA - 237497 237492 6 1.05 9.11 Term - 238029 237856 174 0 0 85 44 154 0.403 7.48 9.10 Intr - 241660 241555 106 0 1 52 96 75 0.411 4.00 9.09 Intr - 246182 246031 152 2 2 23 74 158 0.621 5.94 9.08 Intr - 247538 247385 154 1 1 76 95 67 0.975 5.35 9.07 Intr - 247810 247654 157 2 1 106 115 113 0.999 13.95 9.06 Intr - 250517 250367 151 2 1 73 41 157 0.815 8.31 9.05 Intr - 253667 253524 144 2 0 95 53 74 0.924 4.16 9.04 Intr - 254960 254751 210 2 0 83 52 86 0.858 2.69 9.03 Intr - 256495 256361 135 1 0 63 36 114 0.917 3.54 9.02 Intr - 265798 265105 694 0 1 27 86 243 0.142 8.58 9.01 Init - 266511 265853 659 0 2 72 67 318 0.988 22.88 9.00 Prom - 267224 267185 40 -6.15 10.17 PlyA - 267393 267388 6 1.05 10.16 Term - 268067 267619 449 0 2 16 43 225 0.630 5.19 10.15 Intr - 268398 268139 260 2 2 -33 -2 265 0.093 1.58 10.14 Intr - 273709 273567 143 1 2 74 109 97 0.188 8.63 10.13 Intr - 284947 284788 160 0 1 91 90 124 0.999 11.97 10.12 Intr - 285619 285440 180 0 0 74 4 160 0.889 4.26 10.11 Intr - 287201 287024 178 0 1 106 53 52 0.923 1.56 10.10 Intr - 288145 288054 92 0 2 68 98 71 0.986 4.82 10.09 Intr - 291880 291789 92 1 2 69 90 37 0.960 -0.13 10.08 Intr - 293730 293566 165 0 0 4 110 145 0.121 7.54 10.07 Intr - 296536 296358 179 2 2 43 41 102 0.073 -0.28 10.06 Intr - 300897 300687 211 0 1 33 92 103 0.577 2.76 10.05 Intr - 302065 301928 138 1 0 83 78 61 0.944 4.34 10.04 Intr - 311268 311155 114 0 0 -18 110 131 0.833 4.42 10.03 Intr - 312626 312420 207 2 0 84 103 126 0.919 12.05 10.02 Intr - 323632 323507 126 1 0 70 85 169 0.928 14.76 10.01 Init - 331136 331092 45 2 0 61 115 57 0.851 6.43 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 109423 109213 211 1 1 75 88 150 0.800 12.69 S.002 Intr - 257996 257764 233 0 2 91 93 82 0.952 5.57 S.003 Init - 293715 293566 150 0 0 53 110 149 0.878 13.69 S.004 Intr - 303942 303822 121 2 1 45 116 50 0.932 3.08 S.005 Init - 304103 304066 38 2 2 68 121 45 0.861 5.33 S.006 Init - 314305 314216 90 1 0 79 81 70 0.977 5.94 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:142211574_142547944|GENSCAN_predicted_peptide_1|81_aa MSGLLTDPEQRAQEPRYPGFVLGLDVGSSVIRCHVYDRAARVCGSSVQKVENLYPQIGWV EIDPDVLWIQFVAVIKEAVKX >gi568815595r:142211574_142547944|GENSCAN_predicted_CDS_1|243_bp atgtcggggctgctcacggacccggagcagagagcgcaggagccgcggtaccccggcttc gtgctggggctggatgtgggcagttctgtgatccgctgccacgtctatgaccgggcggcg cgggtctgcggctccagcgtgcagaaggtagaaaatctttatcctcaaattggctgggta gaaattgatcctgatgttctttggattcaatttgttgccgtaataaaagaagcagtcaaa gnn >gi568815595r:142211574_142547944|GENSCAN_predicted_peptide_2|216_aa MTSPSKLNKAPVTNTRVIELCDLSDPEFKIAVLMKLNKIQGPSGKAASGISEGSQGERSF QLNFLIILTEYKLLSRIWRRMGTAAERTAGVAADNAEKAFDKIQHHFMIKTFSKISIEET YLKAIKTVYDKPTANIILNRENSKAFPLRRGTRQGCPLSPLLFNIVLEVLARAIRQEKEI KGIQISKEEVQLSLFADDMIIYLENPKDSSKNLLHR >gi568815595r:142211574_142547944|GENSCAN_predicted_CDS_2|651_bp atgacctcaccaagcaaactaaataaggcaccagtgaccaataccagagtgatagagcta tgtgatctttcagacccagaattcaaaatagctgttttgatgaagctcaacaaaattcaa gggccttcggggaaggcagccagtggaatttcagaggggtcacaaggtgaaagaagtttc caattgaactttctaataattttgaccgagtacaaactgttgagcagaatctggagacga atgggaactgctgcagaaaggacagcaggagttgcagctgacaatgcagaaaaagcattt gacaaaatccagcatcactttatgattaaaaccttcagcaaaatcagcatagaagagaca tacctcaaggcaataaaaaccgtctatgacaaacccacagccaacattatactgaatagg gaaaactcaaaagcattccccctgagaaggggaacaagacaaggatgcccactttcacca cttctattcaacatagtactggaagtgctagccagagcaatcagacaagagaaagaaatt aagggcatccaaattagtaaagaggaagtccagctgtcactgttcgccgatgatatgatt atatacctagaaaaccctaaagactcatccaaaaatctcctacatcgataa >gi568815595r:142211574_142547944|GENSCAN_predicted_peptide_3|203_aa MILYLENHKDSSRKLLEIIKEFSKVSRYKINVHNSVAVLYTNSGPSGESNQELNHFYDSC KKKKKVEIYLTKEAKDLHKENYKTLLKETIDNTNKWKHIPCLWMNRINIVKMTVLPKAIY KFNAIPIKIPPSFFTELEETILKFIWNQKRPHIAKERLSKKNKSGGVTLPDFKLHYKAIV TKTAWYWYKNRHIDQWSKIGTQK >gi568815595r:142211574_142547944|GENSCAN_predicted_CDS_3|612_bp atgatcctttaccttgaaaaccataaagactcctccagaaagctcctagaaattataaaa gaattcagcaaagtttccagatacaagatcaatgtacacaattcagtagctgttctatac accaacagtggaccaagcggagaatcaaatcaagaactcaaccatttttacgatagctgc aaaaaaaaaaaaaaggtagaaatatacctaaccaaggaggcaaaagacctccacaaggaa aactacaaaacactgctgaaagaaaccatagacaacacaaacaaatggaaacacatccca tgcttatggatgaatagaatcaatattgtgaaaatgaccgtactgccaaaagcaatctac aaattcaatgcaattcccatcaaaataccaccatcattcttcacagaattagaagaaaca attctaaaattcatatggaaccaaaaaagaccccacatagccaaagaaaggctaagcaaa aagaacaaatctggaggtgtcacactacctgatttcaaactacactacaaggccatagtc accaaaacagcatggtactggtataaaaataggcacatagaccaatggagcaaaataggg actcagaaataa >gi568815595r:142211574_142547944|GENSCAN_predicted_peptide_4|1079_aa MEDEMNEMKREGKFREKRIKRNEQSLQEIWDYVKRPNLRLIGVPESDRENGTKLENTLQD IIQENFPNLARQANIQIQEIQRTPQRYSSRRATPRHIIVRFTKVEMKEKMLRAAREKGWV TLKGKPIRLTAELSAETLQARREWGPIFNILKEKNFQPRISYPAKLSFISEGEIKYFTDK QMLRDFVTTRPVLKELLKEALNMERNDRHQPLQNHAKILNQEEVESLNRPITGAEIVAII NSLPTKKSQGPDGFTAEFYQRYKEELVPFLLKLFPSIEKEGILTNSFYEASIILIPKLGR DTTKKENFRPISLMNIDAKILNKILANQIQQHIKKLIHHDQVGFIPGMQVWFNICKSINV IQHINRTKDKNHMIISIDAEKAFDKIQQRFMLKTLNKLGIDGMYLKIIRAIYDKPTANII LNGQKLEAFPLKTSTRQGCPLSPLLFNIVLEVLTRAIRQEKEIKGIQLGKEEVKLSLFAD DMIVYLENPIVSAQNLLKLISNFSKVSGYKINVQKPQAFLYTNNRQTESQIMSELPFTVA SKRIKYLGIQLTRDVKELFKENYKPLLKEIKEDTNKWKNIPCSWVGRINIVKMATLPKVI YRFNAIPIKLPMTFFTELEKTTLKFIWNQKRARIAKSILSQKNKAGGITLPDFKLYYKAT VTKTSWYWYQNRDIDQWNRTEPSEITPHIYNYLIFDKPEKNKQWGKDFLFNKWCWENWLA ICRKLKLDAFLIPYTKINSRWIKDFHVRPKTIKTLEENLGITIQDIGMGKDFMSKTPKAM ATKAKIDKWDLIQLKSFCTAKETTIRVNRQPTKWEKIFATYSSDKGLISRIYNELKQIYK KKTNNPIKKWAKDMNRHFSKEDIYAAKKHMKKCSPSPAIREMQIKTTMRYHLTPVRMMII KKSGNNRFRKRSCPLAAQQGLPLLMVALPSPSLLWINEQRPDLQSHRPHRPQDPMEKRQS ELEVLEFLARAIRQEKEIKGIQIGKEEIKLSLFADDMIVYLENHKDSSRKLLELIKEFSK ISRYEINVHKSVALLHNNSDQAENQNSCKKNKILRSIPNQGVERPLQGKLQNTAERNHR >gi568815595r:142211574_142547944|GENSCAN_predicted_CDS_4|3240_bp atggaagatgaaatgaatgaaatgaagcgagaagggaagtttagagaaaaaagaataaaa agaaatgaacaaagcctccaagaaatatgggactatgtgaaaagaccaaatctacgtctg attggtgtacctgaaagtgacagggagaatggaaccaagttggaaaacactctgcaggat attatccaggagaacttccccaatctagcaaggcaggccaacattcagattcaggaaata cagagaacgccacaaagatactcctcgagaagagcaactccaagacacataattgtcaga ttcaccaaagttgaaatgaaggaaaaaatgttaagggcagccagagagaaaggttgggtt accctcaaagggaagcccatcagactaacagcagagctctcggcagaaactctacaagcc agaagagagtgggggccaatattcaacattcttaaagaaaagaattttcaacccagaatt tcatatccagccaaactaagcttcataagtgaaggagaaataaaatactttacagacaag caaatgctgagagattttgtcaccaccaggcctgtcctaaaagagctcctgaaggaagcg ctaaacatggaaaggaacgaccggcaccagccactgcaaaatcatgccaaaatactaaac caggaagaagttgaatctctgaatagaccaataacaggagctgaaattgtggcaataatc aatagcttaccaaccaaaaagagtcaaggaccagatggattcacagccgaattctaccag aggtacaaggaggaactggtaccattccttctgaaactattcccatcaatagaaaaagag ggaatcctcactaactcattttatgaggccagcatcatcctgataccaaagctgggcaga gacacaaccaaaaaagagaattttaggccaatatccttgatgaacattgatgcaaaaatc ctcaataaaatactggcaaaccaaatccagcagcacatcaaaaagcttatccaccatgat caagtgggcttcatccctgggatgcaagtctggttcaatatatgcaaatcaataaatgta atccagcatataaacagaaccaaagacaaaaaccacatgattatctcaatagatgcagaa aaggcctttgacaaaattcaacaacgcttcatgctaaaaactctcaataaattaggtatt gatgggatgtatctcaaaataataagagctatctatgacaaacccacagccaatatcata ctgaatgggcaaaaactggaagcattccctttgaaaactagcacaagacagggatgccct ctctcaccactcctattcaacatagtgttggaagttctgaccagggcaattaggcaggag aaggaaataaagggtattcaattaggaaaagaggaagtcaaattgtccctgtttgcagac gacatgattgtgtatctagaaaaccccatcgtctcagcccaaaatctccttaagctgata agcaacttcagcaaagtctcaggatacaaaatcaatgtacaaaaaccacaagcattctta tacaccaacaacagacaaacagagagccaaatcatgagtgaactcccattcacagttgct tcaaagagaataaaatacctaggaatccaacttacaagggatgtgaaggaactcttcaag gagaactacaaaccattgctcaaggaaataaaagaggatacaaacaaatggaagaacatt ccatgctcatgggtaggaagaatcaatattgtgaaaatggccacactgcccaaggtaatt tacagattcaatgccatccccatcaagctaccaatgactttcttcacagaattggaaaaa actactttaaagttcatatggaaccaaaaaagagcccgcatcgccaagtcaatcctaagc caaaagaacaaagctggaggcatcacactacctgacttcaaactatactacaaggctaca gtaaccaaaacatcatggtactggtaccaaaacagagatatagatcaatggaacagaaca gagccctcagaaataacgccacatatctacaactatctgatctttgacaaaccggagaaa aacaagcaatggggaaaggatttcctatttaataaatggtgctgggaaaactggctagcc atatgtagaaagctgaaactggatgccttccttataccttatacaaaaatcaattcaaga tggattaaagacttccatgttagacctaaaaccataaaaaccctagaagaaaacctaggc attaccattcaggacataggcatgggcaaggacttcatgtctaaaacaccaaaagcaatg gcaaccaaagccaaaattgacaaatgggatctaattcaactaaagagcttctgcacagca aaagaaactaccatcagagtgaacaggcaacctacaaaatgggagaaaattttcgcaacc tactcatctgataaagggctaatatccagaatctacaatgaactcaaacaaatttacaag aaaaaaacaaacaaccccatcaaaaagtgggcaaaggacatgaacagacacttctcaaaa gaagacatttatgcagccaaaaaacacatgaaaaaatgctcaccatcaccggccatcaga gaaatgcaaatcaaaaccacaatgagataccatctcacaccagttagaatgatgatcatt aaaaagtcaggaaacaacagattccgtaagagaagttgtccactcgcagctcagcaggga ctgcccctcctcatggtggccttgccctccccatccctcctctggataaatgaacaaaga cctgatctgcagagtcatagaccacacagaccccaagaccccatggagaagagacagagt gaattggaagtactggaattcctagccagagcaatcagacaagagaaagaaataaagggc atccaaattggtaaagaggaaatcaaactgtcgctgtttgctgatgatatgattgtttac ctagaaaaccacaaagactcctccagaaagctgctagaactgataaaagaattcagcaaa atttccagatacgaaattaatgtacacaaatcagttgctcttctacacaacaacagcgac caagctgagaatcaaaatagctgcaaaaaaaataaaatacttaggagtatacctaaccaa ggagttgaaagacctctacaaggaaaactacaaaacactgctgaaagaaatcacagatga >gi568815595r:142211574_142547944|GENSCAN_predicted_peptide_5|981_aa MESTHSEDAVNIVEMTAKDLGYYINLVEKQWQTLRALTPSWKELLLWVKCYKIALHSAEK SFMKSQLIQQTLLLSYFQKLRQPSQPSSATALTNQQPSISSYLRRKGIIINETSAVVYAQ LLTGRKYQINQNGEVRLEKQWSKQVVPFVYQTIVKDIRAFDSRFSNIKTLDDLFPLRSMV FMLGTPYYGCTGEVQDSGDVITEGRIRVIFSIPCEPNLDALIQNQHKYSIKYNPGYVLAS RLGVSGYLVSRFTGSIFIGRGSRRNPHGDHKANVGLNLKFNKKNEEVPGYTKKVGSEWMY SSAAEQLLAEYLESAEKVQEIITWLKGHPVSTLSRSSCDLQILDAAIVEKIEEEVEKCKQ RKNNKKVRVTVKPHLLYRPLEQQHGVIPDRDAEFCLFDRVVNVRENFSVPVGLRGTIIGI KGANREADVLFEVLFDEEFPGGLTIRCSPGRGYRLPTSALVNLSHGSRSETGNQKLTAIV KPQPAVHQHSSSSSVSSGHLGALNHSPQSLFVPTQVPTKDDDEFCNIWQSLQGSGKMQYF QPTIQEKGAVLPQEISQVNQHHKSGFNDNSVKYQQRKHDPHRKFKEECKSPKAECWSQKM SNKQTLEGPAKYNIKLLKRNESPEVSETQKVVTGYPNAIDKKGTRMLKEILKIDGSNTVD HKNEIKQIANEIPVSSNRRDEYGLPSQPKQNKKLASYMNKPHSANEYHNVQSMDNMCWPA PSQIPPVSTPVTELSRICSLVGMPQPDFSFLRMPQTMTVCQVKLSNGLLVHGPQCHSENE AKEKAALFALQQLGSLGMNFPLPSQVFANYPSAVPPGTIPPAFPPPTANIMPSSSHLFGS MPWGPSVPVPGKPFHHTLYSGTMPMAGGIPGGVHNQFIPLQVTKKRVANKKNFENKEAQS SQATPVQTSQPDSSNIVKVSPRESSSASLKSSPIAQPASSFQVETASQGHSISHHKSTPI SSSRRKSRKLAVNFGVSKPSE >gi568815595r:142211574_142547944|GENSCAN_predicted_CDS_5|2946_bp atggaatctactcatagtgaagatgctgtgaacattgttgaaatgacagcaaaggattta ggatattacataaatttagttgaaaagcagtggcagactttgagagcattgactccaagt tggaaagaacttctcttgtgggtaaaatgctataaaatagcattgcattctgcagagaaa tctttcatgaagagtcagctgatacagcaaactttattattgtcttattttcaaaaattg cgtcagccatcccaaccttcatcggccactgccctgactaatcaacagccatcaatatca agctacctgagaagaaaaggaataataataaatgaaacatctgcagttgtgtatgctcag ttactcacaggtcgtaaatatcaaataaatcaaaatggtgaagttcgtctagagaaacag tggtcaaaacaagttgttccttttgtttatcaaactattgtcaaggacatccgagctttc gactcccgtttctccaatatcaaaacattggatgatttgtttcctctgagaagtatggtc tttatgctgggaactccctattatggctgcactggagaagttcaggattcaggtgatgtg attacagaaggtaggattcgtgtgattttcagcattccatgtgaacccaatcttgatgct ttaatacagaaccagcataaatattctataaagtacaacccaggatatgtgttggccagt cgccttggagtgagtggataccttgtttcaaggtttacaggaagtatttttattggaaga ggatctaggagaaaccctcatggagaccataaagcaaatgtgggtttaaatctcaaattc aacaagaaaaatgaggaggtacctggatatactaagaaagttggaagtgaatggatgtat tcatctgcagcagaacaacttctggcagagtacttagagagtgctgaaaaagttcaagaa attattacttggctaaaaggacatcctgtcagtactttatctcgttcttcttgtgattta caaattctggatgcagctattgttgagaaaattgaggaagaagtcgaaaagtgcaagcaa agaaagaataataagaaggtgcgagtaacagtgaaaccccatttgctatacagaccttta gaacagcaacatggagtcattcctgatcgggatgcagaattttgtctttttgaccgtgtt gtaaatgtgagagaaaacttctcagttccagttggccttcgaggcaccatcataggaata aaaggagctaatagagaagccgatgtactatttgaagtattatttgatgaagaatttcct ggagggttaacaataagatgctcacctggtagaggttatcgactgccaacaagtgccttg gtgaacctttctcatgggagtcgctctgaaactggaaatcagaagttgacagccatcgta aaaccacaaccagctgtacatcaacatagctcaagttcatcagtttcctctgggcatttg ggagccctcaaccattcccctcaatcactttttgttcctactcaagtacctactaaagat gatgatgaattctgcaacatttggcagtccttacagggatctggaaagatgcaatacttt cagccaactatacaagagaagggtgcagttctacctcaagaaataagccaagtaaatcaa catcataaatctggctttaatgacaacagtgttaaatatcagcaaagaaaacatgaccct cacagaaaatttaaagaagagtgtaagagtcctaaagctgagtgttggtcccaaaaaatg tccaataagcagactttagaagggcctgccaaatataatattaagctattgaagagaaat gaaagtccagaagtctcagaaacccaaaaggttgtgactggttatccaaatgctattgat aagaagggaacacggatgcttaaagaaattctaaaaattgatggctctaacactgtggac cataagaatgaaatcaaacagattgctaatgaaatccctgtttcctctaacagaagagat gaatatggattaccctctcagcctaaacaaaataagaaattagcatcttatatgaacaag cctcacagtgctaatgagtaccataatgttcagtctatggacaatatgtgttggcctgcc cccagccagatccctcctgtatccacaccagtaactgaactttctcgaatttgttccctt gttggaatgccacaacctgatttctcctttcttaggatgccacagacaatgaccgtttgc caagtaaaattatctaatggcttactggtacatgggccacagtgccactctgaaaatgaa gccaaagagaaagctgcactttttgctttacaacagttgggctccttaggcatgaatttc cctttgccttcacaagtatttgcaaattatccttcagctgtaccacctggaaccattcct ccagcctttcccccacctactgctaatataatgccttcgtcgtctcatctctttggctca atgccatggggaccatcggtgccagttcctgggaagcccttccatcatactttatattct gggaccatgcccatggctgggggaataccagggggtgtgcacaatcagtttatacctctg caggttactaaaaaaagggttgcaaacaaaaagaactttgagaataaggaagcccagagt tctcaagccactccagttcagactagccagccagattcttccaacattgtcaaagtaagt ccacgggagagctcatcagcttctttgaagtcctctccgattgctcaacctgcatcttct tttcaagttgaaactgcctctcaaggccatagtatatctcaccataagtcaacaccaatc tcttcttcaagaagaaaatcaagaaaactggctgttaattttggtgtttctaaaccttct gagtaa >gi568815595r:142211574_142547944|GENSCAN_predicted_peptide_6|456_aa MDKMIPEFDNLYLDMNGIIHQCSHPNDDDVHFRISDDKIFTDIFHYLEVLFRIIKPRKVF FMAVDGVAPRAKMNQQRGRRFRSAKEAEDKIKKAIEKGETLPTEARFDSNCITPGYINES GHLNLPRFEKYLVKLSDFDREHFSEVFVDLKWFESKVGNKYLNEAAGVAAEEARNYKEKK KLKGQENSLCWTALDKNEGEMITSKDNLEDETEDDDLFETEFRQYKRTYYMTKMGVDVVS EYYPYHYAPFLSDIHNISTLKIHFELGKPFKPFEQLLAVLPAASKNLLPACYQHLMTNED SPIIEYYPPDFKTDLNGKQQEWEAVVLIPFIDEFFLKKSGVQVFQQSSRGENMMLEILVD AESDELTVENVASSVLGKSVFVNWPHLEEARVVAVSDGETKFYLEEPPGTQKLYSGRTAP PSKVVHLGDKEQSNWAKEVQGISEQDMDEIGNHHSQ >gi568815595r:142211574_142547944|GENSCAN_predicted_CDS_6|1371_bp atggataagatgattcctgaatttgacaacttgtacctggatatgaatggaattatacat cagtgctcccatcctaatgatgatgatgttcactttagaatttcagatgataaaatcttt actgatatttttcactacctggaggtgttgtttcgcattattaaacccaggaaagtgttc tttatggctgtagatggtgtggctcctcgagcaaaaatgaaccagcagcgtgggaggcgt tttaggtcagcaaaggaggcagaagacaaaattaaaaaggcaatagagaagggagaaact cttcctacagaggccagatttgattccaactgtatcacaccaggttatattaatgaaagt gggcacctcaacttacctcgatttgagaaataccttgtgaaactatcagattttgatcgg gagcacttcagtgaagtttttgtggacctaaaatggtttgaaagcaaagttggtaacaag tacctcaatgaagcagcaggtgtcgcagcagaagaagccaggaactacaaggaaaagaaa aagttaaagggccaggaaaattctctgtgttggactgctttagacaaaaatgaaggcgaa atgataacttctaaggataatttagaagatgagactgaagatgatgacctatttgaaact gagtttagacaatataaaagaacatattacatgacgaagatgggggttgacgtagtatct gagtattatccttatcattatgcacctttcctgtctgatatacacaacatcagtacactc aaaatccattttgaactaggaaaaccttttaagccatttgaacagcttcttgctgtactt ccagcagccagcaaaaatttacttcctgcatgctaccagcatttgatgaccaatgaagac tcaccaattatagaatattacccacctgattttaaaactgacctaaatgggaaacaacag gaatgggaagctgtggtgttaatcccttttattgatgagttttttttgaagaaaagtggt gttcaagtattccagcaaagcagtcgtggagaaaacatgatgttggaaatcttagtggat gcagaatcagatgaacttaccgtagaaaatgtagcttcatcagtgcttggaaaatctgtc tttgttaattggcctcaccttgaggaagctagagtcgtggctgtatcagatggagaaact aagttttacttggaagaacctccaggaacacagaagctttattcaggaagaactgcccca ccatctaaagtggttcatcttggagataaagaacaatctaactgggcaaaagaagtacaa ggaatttcagaacaggacatggatgaaattggaaatcatcattctcagtaa >gi568815595r:142211574_142547944|GENSCAN_predicted_peptide_7|342_aa MALRQVDFGGSGKGKSWANRMPKRACFQCGPQGHFKKDCPSRNKPPPCPCPLCQGNHWKA HYLRRRRSSETKVSGDEGPAEPDDPAAGLRVPGESASPCHHPHRALVLLSCPGQLSSRSV TIRGVLGQPVTRYFSQPLSCDWGTLLFSHAFLITPESPTPLLRRDILAKVGVIIHLNIGE GTPVCCPLLEEGINPEVWATERQHGRAKNAHPGQVKLKDSTSFPYQRQYPLRPEAQQGLQ KIVKDLKAQGLVKPCNSSCNTPNLGVQKPNRQWRLVQDLRIMNEAVVPLYPAVPNPYTLL SQIPEEAERFTVLDLKDAFFCNPVHPDSQFLFAFEEPSNPTS >gi568815595r:142211574_142547944|GENSCAN_predicted_CDS_7|1029_bp atggcccttaggcaagtggactttggaggctctggaaaagggaaaagctgggcaaatcga atgcctaaaagggcttgcttccagtgcggtccgcaaggacactttaaaaaagattgtcca agtagaaataagccgcccccttgtccatgccccttatgtcaagggaatcactggaaggcc cactacctcaggagacgaaggtcctctgagacgaaggtctcaggagacgaaggtcctgct gaaccagatgatccagcagcaggactgagggtgcccggggaaagcgccagcccatgccat caccctcacagagccctggtcttactctcctgtcctggacaactgtcctccagatctgtc actatccgaggggtcctaggacagccagtcactagatacttctcccagccactaagttgt gactggggaactttactcttttcacatgcttttctaattacgcctgaaagccccactccc ttgttaaggagagacattctagcaaaagtaggggtcattatacacctgaacataggagaa ggaacacccgtttgttgtcccctgcttgaggaaggaattaatcctgaagtctgggcaaca gaaagacaacatggacgagcaaagaatgcccatcctggtcaagttaaactaaaggattcc acctcctttccctaccaaaggcagtacccccttagacccgaggcccaacaaggactccaa aagattgttaaggacctaaaagcccaaggcctagtaaaaccatgcaatagctcctgcaat actccaaatttaggagtacagaaacccaacagacagtggaggttagtgcaagatctcagg attatgaatgaggctgttgttcctctatacccagctgtacctaacccttatactctgctt tcccaaataccagaggaagcagagcggtttacagtcctggaccttaaggatgcctttttc tgcaaccctgtacatcctgactctcaattcttgtttgcctttgaagagccttcgaaccca acgtcttaa >gi568815595r:142211574_142547944|GENSCAN_predicted_peptide_8|106_aa MVGRAVCCHPRSDDRRLRRSPGGAFGFGLALTGISVDDRNGSPQVLQMDLRAVSLSQRSG ERASGGGCEAGVGVLVGSPIPPRAADPLTDDNSRFRTQSPCLLSLI >gi568815595r:142211574_142547944|GENSCAN_predicted_CDS_8|321_bp atggtcggccgggcggtgtgttgtcatccgcggagcgacgaccggaggctgcggcggagc cccggcggggcgtttggtttcggtttggccctgactgggattagtgttgacgatcgaaat gggagtccccaagttttacagatggatctcagagcggtatccctgtctcagcgaagtggt gaaagagcatcaggtggaggctgcgaagctggagtgggtgtcctcgtaggttcccccatc ccacccagagcggcagacccacttacagacgataacagccgcttccgcacgcagtcacct tgtttactttccctgatctag >gi568815595r:142211574_142547944|GENSCAN_predicted_peptide_9|911_aa METQKTLHKINESRSWFLEKINKIDRLLARLIKKKREKNQIDAIKNVKGDITDPTEIQTT IREYYKHLYANKLENLEEMDKFLDTYTLPRLNQEEVESLNGPITGSEIEAIINSLPTKKS PGPDGFTAEFYQRYKEELVPFILKLFQSIEKEGVLPNSFYEASIILIPKPGRDTTKKENF KPISLMNIDAKILNKILANRIQQHIKKLIHHDEVGFIPGISKDKNHMIMSIDAEKAFDKI QQPFMLKTLNKLGIDGTYLKIIRAIYDRLTANILNGQKLEAFLLKTGIRQGCPLSPLLFN IVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIISAPNFFKLISNFTKVS GYKINVQKSQAYLYTNNRQTETQIMSELLFTIASKRIKYPGIQLTRDVKDLFKENYKPLL KEIKEDTNKWKNIPCSWIGRINIVKMAILPKSSYPMRVNRCKEILNKAIHMKKSLEKFVG DATRLTDKLLELCNKPVDGSSSTLSMSTHFKMLKKLVEEATFSEILIPLQSVMIPTLPSI LGTHANHASHEPFPGHWAYIAGFDDMVEILASLQKPKKISLKGSDGKFYIMMCKPKDDLR KDCRLMEFNSLINKCLRKDAESRRRELHIRTYAVIPLNDECGIIEWVNNTAGLRPILTKL YKEKGVYMTGKELRQCMLPKSAALSEKLKVFREFLLPRHPPIFHEWFLRTFPDPTSWYSS RSAYCRSTAVMSMVGYILGLGDRHGENILFDSLTGECVHVDFNCLFNKGETFEVPEIVPF RLTHNMVNGMGPMGTEGLFRRACEVTMRLMRDQREPLMSVLKTFLHDPLVEWSKPVKGHS KAPLNETGEVVNEKAKTHVLDIEQRLQGVIKTRNRVTGLPLSIEGHVHYLIQEATDENLL CQMYLGWTPYM >gi568815595r:142211574_142547944|GENSCAN_predicted_CDS_9|2736_bp atggagacacaaaaaacccttcacaaaatcaatgaatccaggagctggtttttggaaaag atcaacaaaattgatagactgctagctagactaataaagaagaaaagagagaagaatcaa atagatgcaataaaaaatgttaaaggggatatcaccgatcccacagaaatacaaactacc atcagagaatactataaacacctctatgcaaataagctagaaaatctagaagaaatggat aaattcctggacacatacaccctcccaagactaaaccaggaagaagttgaatctcttaat ggaccaataacaggctctgaaattgaggcaataattaatagcttaccaaccaaaaaaagt ccaggaccagatggattcacagccgaattctaccagaggtacaaggaggaactggtacca ttcattctgaaactatttcaatcaatagagaaagagggagtcctccctaactcattttat gaggccagcatcatcctgataccaaagcctggcagagacacaacaaaaaaagagaatttt aaaccaatatccctgatgaacattgatgcaaaaatcctcaataaaatactggcaaaccga atccagcagcacatcaaaaagcttatccaccatgatgaagtgggcttcatccctgggata agcaaagacaaaaaccacatgattatgtcaatagatgcagaaaaggcctttgacaaaatt caacagcccttcatgctaaaaactctcaataaattaggtattgatgggacatatctaaaa ataataagagctatttatgacagactcacagccaatatactgaatgggcaaaaactggaa gcattccttttgaaaactggcataagacagggatgccctctctcaccactcctattcaac atagtgttggaagttctggccagggcaatcaggcaggagaaagaaataaagggcattcaa ctaggaaaagaggaagtcaaattgtccctgtttgcagacgacatgattgtatatctagaa aaccccatcatctcagccccaaatttctttaagctgataagcaacttcaccaaagtctca ggatacaaaatcaatgtgcaaaaatcacaagcatacttatacaccaataacagacaaaca gagacccaaatcatgagtgaactcctattcacaattgcttcaaagagaataaaataccca ggaatccaacttacaagggatgtgaaggacctcttcaaggagaactacaaaccattgctc aaggaaataaaagaggacacaaacaaatggaagaacattccatgctcatggataggaaga atcaatatcgtgaaaatggccatactgcccaagtcatcttatcccatgcgtgtgaacaga tgcaaggaaatcctcaataaagctattcatatgaaaaaatccttagagaagtttgttgga gatgcaactcgcctaacagataagcttctagaattgtgcaataaaccggttgatggaagt agttccacattaagcatgagcactcattttaaaatgcttaaaaagctggtagaagaagca acatttagtgaaatcctcattcctctacaatcagtcatgatacctacacttccatcaatt ctgggtacccatgctaaccatgctagccatgaaccatttcctggacattgggcctatatt gcagggtttgatgatatggtggaaattcttgcttctcttcagaaaccaaagaagatttct ttaaaaggctcagatggaaagttctacatcatgatgtgtaagccaaaagatgacctgaga aaggattgtagactaatggaattcaattccttgattaataagtgcttaagaaaagatgca gagtctcgtagaagagaacttcatattcgaacatatgcagttattccactaaatgatgaa tgtgggattattgaatgggtgaacaacactgctggtttgagacctattctgaccaaacta tataaagaaaagggagtgtatatgacaggaaaagaacttcgccagtgtatgctaccaaag tcagcagctttatctgaaaaactcaaagtattccgagaatttctcctgcccaggcatcct cctatttttcatgagtggtttctgagaacattccctgatcctacatcatggtacagtagt agatcagcttactgccgttccactgcagtaatgtcaatggttggttatattctggggctt ggagaccgtcatggtgaaaatattctctttgattctttgactggtgaatgcgtacatgta gatttcaattgtcttttcaataagggagaaacctttgaagttccagaaattgtgccattt cgcctgactcataatatggttaatggaatgggtcctatgggaacagagggtctttttcga agagcatgtgaagttacaatgaggctgatgcgtgatcagcgagagcctttaatgagtgtc ttaaagacttttctacatgatcctcttgtggaatggagtaaaccagtgaaagggcattcc aaagcgccactgaatgaaactggagaagttgtcaatgaaaaggccaagacccatgttctt gacattgagcagcgactacaaggtgtaatcaagactcgaaatagagtgacaggactgccg ttatctattgaaggacatgtgcattaccttatacaggaagctactgatgaaaacttacta tgccagatgtatcttggttggactccatatatgtga >gi568815595r:142211574_142547944|GENSCAN_predicted_peptide_10|912_aa MQLLSSSVGIEDKKMETSESTDLQTTLQLSMKAIQHENVDVRIHALTSLKETLYKNQEKL IKYATDSETVEPIISQLVTVLLKGCQDANSQARLLCGECLGELGAIDPGRLDFSTTETQG KDFTFVTGVEDSSFAYGLLMELTRAYLAYADNSRAQDSAAYAIQVRHDLASKIFTCCSIM MKHDFKVTIYLLPHILVYVLLGCNQEDQQEVYAEIMAVLKHDDQHTINTQDIASDLCQLS TQTVFSMLDHLTQWARHKFQALKAEKCPHSKSNRNKVDSMVSTVDYEDYQSVTRFLDLIP QDTLAVASFRSKAYTRAVMHFESFITEKKQNIQEHLGFLQKLYAAMHEPDGVAGVSAIRK AEPSLKEQILEHESLGLLRDATACYDRAIQLEPDQIIHYHGVVKSMLGLGQLSTVITQVN GVHANRSEWTDELNTYRVEAAWKLSQWDLVENYLAADGKSTTWSVRLGQLLLSAKKRDIT AFYDSLKLVRAEQIVPLSAASFERGSYQRGYEYIVRLHMLCELEHSIKPLFQHSPGDSSQ EDSLNWVARLEMTQNSYRAKEPILALRRALLSLNKRPDYNEMVGECWLQSARVARKAGHH QTAYNALLNAGESRLAELYVERAKWLWSKDVTACLPEWEDGHFYLAKYYDKLMPMVTDNK MEKQGDLIRYIVLHFGRITNAEKSLKDLMELKSTAQELRDECTSFSSRFDQLEERVSVIE DQMNEMKREEKFREKRIKRNEQSLQEIWDYVKRPNLCLIGVPETRQANIQIQEIQRTPQR YSSRRATPRHIIVRFTKVEMKERTLRAAREKGRVTHKGNPIRLTADLSAEILQARREWGP IFKILKEKNFQPRISYPAKLSFKSEGEIKSFTDKQMLRDFVTTRPVLKELLKEALNMERN NRYQPLQKHAKL >gi568815595r:142211574_142547944|GENSCAN_predicted_CDS_10|2739_bp atgcagttactgagctctagtgttggcattgaagataagaaaatggagacctctgagagc actgatcttcagacaactcttcagctctctatgaaggccattcaacatgaaaatgtcgat gttcgtattcatgctcttacaagcttgaaggaaaccttgtataaaaatcaggaaaaactg ataaagtatgcaacagacagtgaaacagtagaacctattatctcacagttggtgacagtg cttttgaaaggttgccaagatgcaaactctcaagctcggttgctctgtggggaatgttta ggggaattgggggcgatagatccaggtcgattagatttctcaacaactgaaactcaagga aaagattttacatttgtgactggagtagaagattcaagctttgcctatggattattgatg gagctaacaagagcttaccttgcgtatgctgataatagccgagctcaagattcagctgcc tatgccattcaggttcgacatgatcttgccagtaaaattttcacctgctgtagcattatg atgaagcatgatttcaaagtgaccatctatcttcttccacatattctggtgtatgtctta ctgggttgtaatcaagaagatcagcaggaggtttatgcagaaattatggcagttctaaag catgacgatcagcataccataaatacccaagacattgcatctgatctgtgtcaactcagt acacagactgtgttctccatgcttgaccatctcacacagtgggcaaggcacaaatttcag gcactgaaagctgagaaatgtccacacagcaaatcaaacagaaataaggtagactcaatg gtatctactgtggattatgaagactatcagagtgtaacccgttttctagacctcataccc caggatactctggcagtagcttcctttcgctccaaagcatacacacgagctgtaatgcac tttgaatcatttattacagaaaagaagcaaaatattcaggaacatcttggatttttacag aaattgtatgctgctatgcatgaacctgatggagtggccggagtcagtgcaattagaaag gcagaaccatctctaaaagaacagatccttgaacatgaaagccttggcttgctgagggat gccactgcttgttatgacagggctattcagctagaaccagaccagatcattcattatcat ggtgtagtaaagtccatgttaggtcttggtcagctgtctactgttatcactcaggtgaat ggagtgcatgctaacaggtccgagtggacagatgaattaaacacgtacagagtggaagca gcttggaaattgtcacagtgggatttggtggaaaactatttggcagcagatggaaaatct acaacatggagtgtcagactgggacagctattattatcagccaaaaaaagagatatcaca gctttttatgactcactgaaactagtgagagcagaacaaattgtacctctttcagctgca agctttgaaagaggctcctaccaacgaggatatgaatatattgtgagattgcacatgtta tgtgagttggagcatagcatcaaaccacttttccagcattctccaggtgacagttctcaa gaagattctctaaactgggtagctcgactagaaatgacccagaattcctacagagccaag gagcctatcctggctctccggagggctttactaagcctcaacaaaagaccagattacaat gaaatggttggagaatgctggctgcagagtgccagggtagctagaaaggctggtcaccac cagacagcctacaatgctctccttaatgcaggggaatcacgactcgctgaactgtacgtg gaaagggcaaagtggctctggtccaaggatgtgaccgcgtgcctgccagaatgggaggat gggcatttttaccttgccaagtactatgacaaattgatgcccatggtcacagacaacaaa atggaaaagcaaggtgatctcatccggtatatagttcttcattttggcagaataaccaat gcagagaaatccttaaaggacctgatggagctgaaatccacggcacaggaactacgtgac gaatgcacaagcttcagtagccgattcgatcaactggaagaaagggtatcagtgattgaa gatcaaatgaatgaaatgaagcgagaagagaagtttagagaaaaaagaataaaaagaaac gaacaaagcctccaagaaatatgggactatgtgaaaagaccaaatctatgtctgattggt gtacctgaaacaaggcaggccaacattcaaattcaggaaatacagagaacgccacaaaga tactcctcgagaagagcaactccaagacacataattgtcagattcaccaaagttgaaatg aaggaacgaacgttaagggcagccagagagaaaggtcgggttacccacaaagggaacccc atcagactaacagctgatctctcggcagaaattctacaagccagaagagagtgggggcca atattcaaaattcttaaggaaaagaattttcaacccagaatttcatatccagccaaacta agcttcaaaagtgaaggagaaataaaatcctttacagacaagcaaatgctgagagatttt gtcaccaccaggcctgtcctaaaagagctcctgaaggaagcactaaacatggaaaggaac aatcggtaccagccactgcagaaacatgccaaattgtaa