GENSCAN 1.0 Date run: 6-Nov-116 Time: 00:59:38 Sequence gi568815597f:40158272_40392666 : 234395 bp : 42.89% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3129 3365 237 2 0 101 94 290 0.187 26.96 1.02 Intr + 30784 30938 155 0 2 87 93 125 0.993 10.85 1.03 Intr + 32501 32582 82 2 1 64 109 31 0.966 1.52 1.04 Intr + 37361 37493 133 1 1 87 115 23 0.932 4.20 1.05 Intr + 44141 44343 203 0 2 79 105 61 0.911 4.98 1.06 Intr + 64303 64439 137 0 2 52 111 92 0.971 6.35 1.07 Intr + 73246 73387 142 1 1 112 81 8 0.975 1.93 1.08 Intr + 77521 82161 4641 0 0 99 -12 2596 0.047 237.12 1.09 Intr + 99199 99353 155 0 2 117 -50 195 0.369 6.55 1.10 Intr + 99443 99557 115 2 1 43 98 54 0.263 1.43 1.11 Intr + 99943 100123 181 0 1 -10 99 268 0.392 16.62 1.12 Intr + 105620 105666 47 0 2 88 39 25 0.208 -5.19 1.13 Intr + 109515 109601 87 2 0 74 87 74 0.925 5.15 1.14 Intr + 110148 110264 117 2 0 83 75 52 0.870 3.14 1.15 Intr + 113623 113764 142 0 1 85 116 113 0.934 12.81 1.16 Term + 115443 115465 23 1 2 124 37 9 0.578 -3.10 1.17 PlyA + 118360 118365 6 1.05 2.04 PlyA - 118689 118684 6 1.05 2.03 Term - 121558 121270 289 0 1 21 37 209 0.337 2.96 2.02 Intr - 122022 121824 199 1 1 -29 28 385 0.350 18.29 2.01 Init - 122115 122046 70 0 1 67 39 158 0.773 10.06 2.00 Prom - 137556 137517 40 -4.75 3.33 PlyA - 138331 138326 6 1.05 3.32 Term - 143110 142911 200 2 2 128 53 134 0.693 10.48 3.31 Intr - 143618 143541 78 0 0 105 115 14 0.932 4.40 3.30 Intr - 144538 144350 189 2 0 127 96 156 0.972 18.94 3.29 Intr - 144914 144662 253 2 1 59 36 281 0.862 16.08 3.28 Intr - 145405 145259 147 1 0 112 105 76 0.998 11.21 3.27 Intr - 145568 145536 33 2 0 119 109 26 0.974 5.30 3.26 Intr - 145766 145657 110 0 2 27 105 50 0.835 -0.32 3.25 Intr - 145828 145793 36 2 0 93 89 33 0.617 1.22 3.24 Intr - 146120 145904 217 2 1 124 25 172 0.541 11.65 3.23 Intr - 146258 146205 54 2 0 118 109 33 0.986 6.66 3.22 Intr - 146576 146523 54 2 0 94 115 92 0.834 10.76 3.21 Intr - 148012 147872 141 1 0 59 94 49 0.525 2.23 3.20 Intr - 149228 149175 54 2 0 83 115 55 0.963 5.96 3.19 Intr - 149485 149432 54 1 0 88 97 29 0.783 2.06 3.18 Intr - 149974 149921 54 1 0 133 90 60 0.969 8.96 3.17 Intr - 151720 151541 180 1 0 90 56 101 0.764 6.24 3.16 Intr - 151893 151840 54 0 0 147 119 52 0.999 12.46 3.15 Intr - 152046 151993 54 0 0 102 127 32 0.905 6.76 3.14 Intr - 152496 152443 54 0 0 104 107 70 0.997 8.76 3.13 Intr - 152875 152822 54 1 0 98 117 37 0.967 5.86 3.12 Intr - 153444 153391 54 0 0 109 95 66 0.732 7.66 3.11 Intr - 153587 153498 90 2 0 58 98 54 0.822 2.67 3.10 Intr - 153841 153788 54 1 0 118 105 71 0.255 10.06 3.09 Intr - 154208 154185 24 2 0 113 95 23 0.210 2.80 3.08 Intr - 154338 154303 36 0 0 123 117 2 0.227 4.14 3.07 Intr - 154513 154460 54 1 0 118 116 30 0.184 7.06 3.06 Intr - 155996 155934 63 2 0 104 76 45 0.029 2.90 3.05 Intr - 156116 156081 36 2 0 77 117 41 0.030 3.44 3.04 Intr - 157018 156856 163 0 1 87 72 54 0.043 2.76 3.03 Intr - 157393 157319 75 0 0 95 105 77 0.043 7.81 3.02 Intr - 158241 158067 175 1 1 40 40 82 0.019 -3.32 3.01 Init - 158926 158707 220 1 1 104 72 223 0.031 19.14 3.00 Prom - 162304 162265 40 -5.15 4.02 PlyA - 162439 162434 6 1.05 4.01 Sngl - 175203 174805 399 0 0 54 38 377 0.621 25.31 4.00 Prom - 176041 176002 40 -7.55 5.04 PlyA - 176603 176598 6 1.05 5.03 Term - 179972 179938 35 0 2 125 40 13 0.450 -3.03 5.02 Intr - 180689 180461 229 2 1 66 72 195 0.271 12.22 5.01 Init - 185951 185895 57 2 0 88 107 15 0.751 4.76 5.00 Prom - 195454 195415 40 -4.25 6.00 Prom + 200741 200780 40 -7.85 6.01 Sngl + 207474 208490 1017 2 0 88 43 769 0.911 69.17 6.02 PlyA + 208718 208723 6 1.05 7.00 Prom + 208887 208926 40 -6.15 7.01 Init + 209837 212369 2533 1 1 70 47 537 0.078 35.35 7.02 Intr + 215857 215952 96 2 0 52 69 176 0.083 11.16 7.03 Intr + 226728 226830 103 1 1 130 66 128 0.503 13.11 7.04 Term + 229888 230080 193 1 1 68 36 177 0.391 6.41 7.05 PlyA + 230438 230443 6 1.05 8.02 PlyA - 231044 231039 6 -0.45 8.01 Term - 231420 231280 141 0 0 94 50 96 0.179 3.35 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 77521 82176 4656 0 0 99 42 2603 0.939 241.19 S.002 Init + 122687 122732 46 1 1 60 85 11 0.811 -1.10 S.003 Intr + 123072 123256 185 1 2 97 87 117 0.888 11.09 S.004 Intr + 127654 127758 105 0 0 93 111 53 0.967 7.59 S.005 Term + 134174 134398 225 1 0 115 50 66 0.896 1.30 S.006 Init + 156497 156648 152 1 2 56 40 122 0.927 3.76 S.007 Intr + 156863 156998 136 2 1 56 72 114 0.948 6.25 S.008 Term + 204506 204667 162 1 0 60 39 209 0.886 10.15 S.009 Sngl + 208980 209573 594 2 0 49 36 250 0.857 11.84 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:40158272_40392666|GENSCAN_predicted_peptide_1|2198_aa MADGKGDAAAVAGAGAEAPAVAGAGDGVETESMVRGHRPVSPAPGASGLRPCLWQLETEL REQEVSEVSSLNYCRSFCQTLLQYASNKNASEHIVYLLEVYRLAIQSFASARPYLTTECE DVLLVLGRLVLSCFELLLSVSESELPCEVWLPFLQSLQESHDALLEFGNNNLQILVHVTK EGVWKNPVLLKILSQQPVETEEVNKLIAQEGPSFLQMRIKHLLKSNCIPQATALSKLCAE SKEISNVSSFQQAYITCLCSMLPNEDAIKEIAKVDCKEVLDIICNLESEGQDNTAFVLCT TYLTQQLQTASVYCSWELTLFWSKLQRRIDPSLDTFLERCRQFGVIAKTQQHLFCLIRVI QTEAQDAGLGVSILLCVRALQLRSSEDEEMKASVCKTIACLLPEDLEVRRACQLTEFLIE PSLDGFNMLEELYLQPDQKFDEENAPVPNSLRCELLLALKAHWPFDPEFWDWKTLKRHCH QLLGQEASDSDDDLSGYEMSINDTDVLESFLSDYDEGKEDKQYRRRDLTDQHKEKRDKKP IGSSERYQRWLQYKFFCLLCKRECIEARILHHSKMHMEDGIYTCPVCIKKFKRKEMFVPH VMEHVKMPPSRRDRSKKKLLLKGSQKGICPKSPSAIPEQNHSLNDQAKGESHEYVTFSKL EDCHLQDRDLYPCPGTDCSRVFKQFKYLSVHLKAEHQNNDENAKHYLDMKNRREKCTYCR RHFMSAFHLREHEQVHCGPQPYMCVSIDCYARFGSVNELLNHKQKHDDLRYKCELNGCNI VFSDLGQLYHHEAQHFRDASYTCNFLGCKKFYYSKIEYQNHLSMHNVENSNGDIKKSVKL EESATGEKQDCINQPHLLNQTDKSHLPEDLFCAESANSQIDTETAENLKENSDSNSSDQL SHSSSASMNEELIDTLDHSETMQDVLLSNEKVFGPSSLKEKCSSMAVCFDGTKFTCGFDG CGSTYKNARGMQKHLRKVHPYHFKPKKIKTKDLFPSLGNEHNQTTEKLDAEPKPCSDTNS DSPDEGLDHNIHIKCKREHQGYSSESSICASKRPCTEDTMLELLLRLKHLSLKNSITHGS FSGSLQGYPSSGAKSLQSVSSISDLNFQNQDENMPSQYLAQLAAKPFFCELQGCKYEFVT REALLMHYLKKHNYSKEKVLQLTMFQHRYSPFQCHICQRSFTRKTHLRIHYKNKHQIGSD RATHKLLDNEKCDHEGPCSVDRLKGDCSAELGGDPSSNSEKPHCHPKKDECSSETDLESS CEETESKTSDISSPIGSHREEQEGREGRGSRRTVAKGNLCYILNKYHKPFHCIHKTCNSS FTNLKGLIRHYRTVHQYNKEQLCLEKDKARTKRELVKCKKIFACKYKECNKRFLCSKALA KHCSDSHNLDHIEEPKVLSEAGSAARFSCNQPQCPAVFYTFNKLKHHLMEQHNIEGEIHS DYEIHCDLNGCGQIFTHRSNYSQHVYYRHKDYYDDLFRSQKVANERLLRSEKVCQTADTQ GHEHQTTRRSFNAKSKKCGLIKEKKAPISFKTRAEALHMCVEHSEHTQYPCMVQGCLSVV KLESSIVRHYKRTHQMSSAYLEQQMENLVVCVKYGTKIKEEPPSEADPCIKKEENRSCES ERTEHSHSPGDSSAPIQNTDCCHSSERDGGQKGCIESSSVFDADTLLYRGTLKCNHSSKT TSLEQCNIVQPPPPCKIENSIPNPNGTESGTYFTSFQLPLPRIKESETRQHSSGQENTVK NPTHVPKENFRKHSQPRSFDLKTYKPMGFESSFLKFIQESEEKEDDFDDWEPSEHLTLSN SSQSSNDLTGNVVANNMVNDSEPEVDIPHSSSDSTIHENLTAIPPLIVAETTTVPSLENL RVVLDKALTDCGELALKQLHYLRPVVVLERSKFSTPILDLFPTKKTDELCGTLAGKTPPL TTLGVVKTRSSNENSQRYRPLSHTLEFRTPQYWPKDSISHAWREPRPTQSVAAHSVSSGP SPDYLSPCWGKEGRYRSPVRGRVSVWHRCTLKEPAEPGGHGDVGIAGRFVGDAGREAYLR GRAALFLDSVSLGDLPSTAADINKGPTEEMEWLGSRLILLFGGIPYLWRLSGRFCGYAGF GPEYEITQSLVFLLLATLFSALTGLPWSLYNTFVIEEKHGFNQQVLVTIYADYIAPLFDK FTPLPEGKLKEEIEVMAKSIDFPLTKVYVVEGNIKLEI >gi568815597f:40158272_40392666|GENSCAN_predicted_CDS_1|6597_bp atggcggacggaaagggagacgccgccgctgtcgccggggctggggctgaggctccggcg gtagcgggagccggagatggagtcgagactgagtccatggttcggggtcatcgccccgta tctccagcgccgggagcctcgggactgcggccgtgtctgtggcagctggagacagagctg agggagcaagaggtgtcggaggtctcatctttgaactactgccggagcttctgccagacc ttattgcaatatgcaagcaacaagaatgcatcagaacatattgtgtatcttctggaggta tatcgacttgccatccaaagctttgccagtgcacgtccatacttaactactgaatgtgaa gatgtcctcttagtgcttggcagattagtactgagttgtttcgaattactgctttcagtg tctgaaagtgaactgccatgtgaagtctggctaccattccttcagtctctacaggagtca catgatgcattattggaatttgggaataataacctacaaatattggttcatgttaccaag gaaggggtgtggaaaaacccagttcttcttaaaattctgtctcaacagccagtagaaacg gaggaagtcaataaattgattgcacaagaaggaccttcctttctgcaaatgcgaataaaa catttgttgaaatctaactgcatcccccaggctactgctttatcaaaactatgtgcagaa tctaaagaaatttcaaatgtgtcatcttttcagcaagcctatatcacatgtttatgttct atgctccctaatgaagatgctattaaggagattgcaaaggtcgactgcaaggaagtacta gacatcatttgtaatctggaatctgaggggcaggataacacagcatttgttctttgtacg acttaccttacccagcagctccaaactgcaagtgtatattgttcttgggaactgactctt ttttggagtaaactgcaaagaagaattgacccttctttagatacttttttggagcgctgt cgtcagtttggtgtcatagctaaaacgcagcagcatttattttgcctcattagagttata caaactgaagcacaagatgctggtcttggggtgtcaattttactgtgtgtcagagctctt caactcagatcaagtgaagatgaggaaatgaaggcatcagtttgtaaaacaattgcctgt cttttaccagaagatttagaagttagacgagcctgtcagcttacagaattcttaattgaa cccagtttggatggatttaatatgttagaagaactatatttgcaaccagatcaaaaattt gatgaagaaaatgcaccggttccaaattctcttcgatgtgagctcttactagctttaaaa gcccactggccttttgatcctgagttttgggactggaaaactttaaaacgacactgccac caacttttaggacaagaagcctcagattctgatgatgatttaagtggctatgaaatgtcc attaatgacacagatgttttagagtcatttctcagtgactatgatgagggtaaagaagat aaacaatatagaagaagagatttgacagatcagcataaggagaaaagagacaaaaaacct attggctcttctgaaagatatcagaggtggcttcagtacaagtttttctgtttgttatgt aagcgggaatgtatagaggctagaattcttcatcattctaagatgcatatggaagatgga atttacacctgtccagtttgtattaaaaaatttaagagaaaagaaatgtttgttcctcat gtgatggagcatgttaaaatgccaccaagcagaagggaccgctctaaaaagaaattactg ttaaaaggctctcaaaagggtatttgtcctaagagcccctctgcaatcccagagcaaaac cattcattgaatgaccaagccaaaggagagtctcatgaatatgtcacattcagcaaatta gaagattgccacctgcaagacagagatttgtatccatgtcccggtacagactgttcccgt gtgtttaagcaatttaaatacttaagtgtgcatcttaaagctgaacaccaaaataatgat gaaaatgccaagcactacttggatatgaaaaatagaagagagaagtgtacttactgtcga cgacattttatgtctgcttttcaccttcgagagcacgaacaagtgcattgtgggcctcag ccttatatgtgtgtatctatagattgctatgctaggtttggatcagtaaatgaactactt aaccataaacaaaagcatgacgatctgcgttacaaatgtgaattaaatggctgtaatatt gttttcagtgacttgggacagctttaccaccatgaagcacaacactttagggatgcatct tacacatgcaacttccttggctgtaaaaagttctattactccaaaattgaataccagaat cacctctcaatgcataatgttgaaaattcaaatggagacataaagaaatcagtgaaactt gaggagtctgcaacaggtgaaaagcaagattgtattaatcagccccatctacttaaccaa actgataaatcacatttacctgaagatcttttctgtgcagaatcagctaattctcaaata gatacagaaactgcagaaaacctgaaagaaaacagtgacagtaattctagtgatcagtta agtcatagctcttcagcttcaatgaatgaagagctaattgacacactagatcactctgaa actatgcaggatgtattgttatctaatgagaaagtctttgggccctccagtttaaaagaa aaatgttccagtatggcagtttgttttgacgggactaagtttacctgtggttttgatggc tgtggttccacatacaaaaatgcaagaggaatgcagaaacatttacggaaggttcatcca taccatttcaagcccaaaaagataaagacgaaagatctgtttccctctttgggtaatgaa cataatcagacaactgaaaagttggatgcagaacctaaaccctgctcagatacaaacagt gactccccagatgaaggtctagatcacaatattcacattaaatgtaaacgagaacatcaa ggttattcctcagaatcctccatttgtgcttctaaaaggccctgtacagaggataccatg ttggaacttctgttacgcttgaaacatttaagcttgaaaaactcaataacacatggatct ttctcagggtcattgcaggggtacccatccagtggtgctaagtctcttcagtcagtttca tctatctcagaccttaattttcagaatcaagatgaaaacatgccaagtcagtaccttgca cagttggcggctaagccgtttttctgtgagcttcaaggatgcaaatatgaatttgtgacc agagaggctctgttaatgcattatcttaaaaagcataattattcaaaagaaaaagtcctt cagttaaccatgttccaacatcggtattccccatttcagtgtcatatttgccaaaggtca tttacaagaaaaacacaccttaggattcattataaaaataaacatcaaattggcagtgac agagcaactcacaaactattagataatgaaaagtgtgatcatgaaggcccatgttcagta gataggttgaaaggtgattgttctgcagaacttggaggtgatcccagtagtaactctgag aaaccacactgtcatcctaaaaaggatgaatgtagttctgaaacagatttggaatcatct tgtgaagaaacagaaagtaaaacatctgacatttcatcaccaataggcagccatagagaa gaacaagaaggaagagagggcagaggtagcaggcgaactgttgctaaaggaaatctgtgt tatattttgaataaataccacaaaccattccattgtattcataaaacttgcaactcctca ttcaccaatctaaaaggcttaattcgccattacagaactgtacatcagtacaacaaagaa cagttatgtttggagaaagacaaagcaagaaccaaaagggaacttgtcaaatgtaaaaag atatttgcttgcaaatataaggaatgtaataaacgcttcctgtgttccaaagctcttgct aagcactgtagtgattctcataacctagaccatattgaagagcctaaagtactttccgaa gctggatctgcagcaaggttttcttgtaaccagcctcagtgccctgctgttttttataca ttcaacaagttgaagcaccacttgatggaacagcataatattgaaggggaaatacattca gattatgaaattcattgtgatcttaatggctgtggccagattttcacccatcgcagtaat tactcacaacatgtatattaccgacataaagactattatgatgatttgtttagaagccag aaagtagcaaatgagagactactaaggagtgaaaaggtatgtcaaacagctgatactcag gggcatgaacatcagaccaccaggagatcatttaatgctaagtctaaaaaatgtggctta atcaaagaaaagaaagccccaataagttttaaaaccagagctgaggccctccatatgtgt gtggagcactctgagcacacacagtacccctgcatggttcaaggatgcttatctgtggtg aagttggagagcagcattgtgaggcattacaaacgcactcatcagatgagtagtgcctat ttagagcaacagatggagaatcttgttgtttgcgttaagtacggtaccaaaattaaggag gaacccccttctgaagcagatccctgtataaagaaagaagaaaatagaagctgtgaatca gagcgcacagaacacagccattccccgggtgacagtagtgcacccatccagaacactgat tgctgtcattcaagtgaaagggatggaggtcagaaagggtgcatagaaagcagctcagta tttgatgcagatactctgctctacaggggaactttgaaatgtaatcatagttccaaaacc acttccctagaacagtgtaatatagttcagcctcctcctccttgtaaaatagaaaattcc atacctaatcccaatgggactgaaagtgggacttatttcacaagtttccagctgccttta ccaaggatcaaagaatcagaaactaggcagcatagttcagggcaagaaaacactgtaaaa aatccaacccatgtcccaaaagagaattttaggaaacattcacagccccggtcatttgat ttgaagacttacaaacctatgggatttgaatcttcatttctgaaatttattcaggaaagt gaagagaaagaagatgattttgatgattgggagccttcagagcacttaacattaagtaat tcttcacagtccagtaatgatttaacagggaatgttgtggcaaataatatggtgaatgac agtgaacctgaagttgacatacctcattcttccagtgactctacaattcatgagaacctg actgcaatcccacctttaatagtagctgaaacaacaacagttccttccttggaaaacctg agggttgtattggacaaagcattaacagactgtggagagcttgccttaaaacagcttcat tatcttcggccagtggtggttcttgaaagatctaagttttccacaccaattttagactta tttccaacaaaaaagacagatgagctttgtggcactctcgcagggaaaacgccgcctctg actacacttggggtggttaagacgagatcttcaaatgaaaattctcaacgctaccgtcca ctctctcatacgcttgaattccgaacaccccaatactggccaaaagactccatttctcat gcttggcgggagccgcggcccactcagtcagttgcggcccactcagtcagttccggcccc agccccgactacctgtcgccttgttgggggaaggaaggcagataccgcagcccagtcagg ggacgagtgtcggtgtggcaccggtgcacgctgaaggagccggcggaaccgggtggccat ggggatgtgggcatcgctggacgctttgtgggagatgccggccgagaagcgtatcttcgg ggccgtgctgctcttttcctggacagtgtatctttgggagaccttcctagcacagcggca gatatcaataaagggcccactgaagaaatggaatggctggggagcagacttattcttctc tttggaggaataccttatctctggagactttctggacggttctgtggttatgctggcttt ggaccagaatatgagatcactcagtccctggtgtttctgctgttggctacacttttcagt gcattgactggtttgccatggagtctttataatacttttgtgatagaagaaaaacatggc ttcaatcaacaggttcttgtcacaatctatgctgattatattgcccctttatttgacaaa ttcacacctctgcctgagggaaagcttaaagaagaaattgaagtaatggcaaagagtatt gactttcctttgacgaaggtgtatgttgtggaaggtaatattaaactagaaatttaa >gi568815597f:40158272_40392666|GENSCAN_predicted_peptide_2|185_aa MGRKAAETTRNINNAFGPGTANEQVCKGDESLEDEERSGRPSEVDNDQWRAIIGADPLTT TQEVAEELNVDHSTVVWHLKQTGKVKKPDNFLNPGKTITSEKYAQQIDEMHRKLQRLQPA LGNRKGPILLQDNTQPNIAKSILQKLNKLGYEVLPDLPHSPDLLPTDYHLFKHLNNFLKR KRFHN >gi568815597f:40158272_40392666|GENSCAN_predicted_CDS_2|558_bp atgggtcgtaaagcagcagagacaactcgcaacatcaacaacgcatttggcccaggaact gctaacgaacaagtttgcaaaggagatgagagccttgaagatgaggagcgtagtggcagg ccatcggaagttgacaatgaccaatggagagcaatcatcggagctgatccccttacaact acacaagaagttgccgaagaactcaatgtcgaccattctacggtcgtttggcatttgaag caaactggaaaggtgaaaaagcctgataactttctgaatcctggcaaaaccattacatct gagaagtatgcacagcaaatcgatgagatgcacagaaaattgcaacgcctgcaaccagca ctgggcaacagaaagggcccaattcttctccaagacaacacccaaccgaacatcgcaaaa tcaatccttcaaaagttgaacaaattgggctatgaagttttgcctgatctgccacattca cctgacctcttgccaactgactaccacttattcaagcatctcaacaactttttaaagaga aaacgttttcacaactag >gi568815597f:40158272_40392666|GENSCAN_predicted_peptide_3|1037_aa MAAATASPRSLLVLLQVVVLALAQIVSFAAPPLPGSRVRGLESGMEELCLCPAPTDTPSP RGHLRTAPRAGPLGPPFKISDLFRDGQQDGDSPAVSSPHLDMTCDLPETSHPPRGIELQA NGIQREGLLSGREVHRESGAPRVPRDRRECLDPTASTRRLNPSPCLWHIWTLLQVRLGAL ARPPTSLGQTPVSSLRFSLPDLESAGLVPCRGDNGPPGKAGPPGPKGEPGKAGPDGPDGK PGIDGLTGAKGEPGPMGIPGVKGQPGLPGPPGLPGPGFAGPPGPPGPVGLPGEIGIRGPK AFISQDGGSHKVTGKTGWRGQRQDLEEGTRGDPGPDGPSGPPGPPGKPGHAGKRGILGDP GHQGKPGPKGDVGASGEQGIPGPPGPQGIRGYPGMAGPKGETGPHGYKGMVGAIGATGPP GEEGPRGPPGRAGEKGDEVSPQAPIVQSGPLGSTGQDKAPPKAVCVRVHECVRECECVVC GSPGIRGPQGITGPKGATGPPGINGKDGTPGTPGMKGSAGQAGQPGSPGHQGLAVGEIWI GDGKQSSRCMGLPHARAETDFNFFLQGVPGQPGTKGGPGDQGEPGPRGEIGPQGIMGQKG DQGERGPVGQPGPQGRQGPKGEQGPPGIPGPQGLPGVKGDKVPDGAGKHLGKGPYNRGSG VGRTQILPEPPNLRISGFWSGRRGGVGKRGLPREDRAPRQSATPNPALRPRRHRASDPWF SLQGDPGVAGLPGEKGEKGESGEPGPKGQQGVRGEPGYPGPSGDAGAPGVQGYPGPPGPR GLAGNRGVPGQPGRQGVEGRDATDQHIVDVALKMLQGEGQQPLLTVSSRASPPPHPLPEP PRVHLSENLESWGLLPVQSLKGFGTLNKSLWAFDFRKTEPTEEQLAEVAVSAKREALGAV GMMGPPGPPGPPGYPGKQGPHGHPGPRGVPGIVGAVGQIGNTGPKGKRGEKGDPGEVGRG HPGMPGPPGIPGLPGRPGQAINGKDGDRGSPGAPGEAGRPGLPGPVGLPGFCEPAACLGA SAYASARLTEPGSIKGP >gi568815597f:40158272_40392666|GENSCAN_predicted_CDS_3|3114_bp atggccgccgctacggcctccccccgcagcctccttgttctcctccaggtggtagtgctc gctctggcgcagattgtaagtttcgcagcccctccgctgccagggtccagggtgcggggt ctggagtccgggatggaggagctctgcctgtgtcccgcgcctactgacacccctagcccg agaggccacctgaggacagcgccgcgggcaggtcccctcgggccgccttttaagatctct gatctgttcagagatgggcagcaggatggagactctccagctgtgagctcgcctcatctt gatatgacttgtgacctccctgaaacctcacacccacccaggggcattgaattgcaggca aacgggatccagagagagggtctgctttctgggagagaggtccaccgggagagcggggcc ccccgggtcccccgggaccgccgggagtgcctggatccgacggcatcgacgaggcgactc aacccctcaccctgcctctggcacatctggacgcttcttcaggttcgcttgggagccttg gccagaccaccgaccagcttgggccagactcctgtctcttcccttcggttttctctcccc gatttagaatcagctgggcttgttccctgcaggggtgacaatgggccccctggaaaagct ggccctccgggacccaagggcgagcctggcaaagctgggccagatgggccagacgggaag cccgggattgatggtttaactggagccaagggggagcctggccccatggggatccctgga gtcaagggccagcccgggcttcctggtcctcctggccttccgggccctggttttgctgga cctcctgggcctcctggacctgttggcctccctggtgagattggaatccgaggccccaag gccttcatctcccaagatggtggatctcacaaagtgaccgggaagacagggtggagaggg cagaggcaggacctggaggagggcactaggggggaccctggaccagatggaccatcgggg cccccaggaccccctgggaaacctgggcatgcgggcaaacgcgggattctgggtgatcct ggccaccaggggaagccgggtcccaagggagatgtgggtgcctctggagagcaaggcatc cctggaccaccgggtccccagggcatcaggggctacccaggcatggcagggcccaaggga gagacgggccctcatggatataaaggcatggtgggcgctatcggtgccactgggccaccg ggtgaggaaggtcctaggggaccgccaggccgagctggggagaagggtgacgaggtgagt cctcaggcacccattgttcagtcaggacccctggggagtactgggcaggacaaggcaccc cctaaggctgtgtgtgtgagagtgcatgagtgtgtgcgtgagtgtgaatgtgtagtgtgt ggcagcccaggtattcgtggaccccaggggatcacaggcccgaaaggagcaacgggcccc ccaggcatcaacggcaaggatgggaccccaggcacgcctggcatgaagggcagtgcagga caggcgggacagcccggaagtccaggccaccagggcctagcggtgggggaaatctggatt ggggatgggaagcaaagcagcaggtgcatggggctccctcatgccagggcagaaactgac ttcaacttctttctgcagggtgtgccaggccagcctgggacaaaaggaggccctggagac cagggagagccagggcctcgaggagaaattggtccccagggcatcatgggacagaagggt gaccaaggcgagaggggtccagtggggcaaccaggccctcagggaaggcagggccctaag ggggagcagggcccccccggaattccagggccccaaggcttgccaggcgtcaaaggagac aaggtgccagatggggctgggaaacacctgggaaaggggccctataacagggggagtggg gtcggcaggactcagatccttccggagcctccaaacctgcggatctcagggttctggtct ggtcggcgaggcggagttggaaagagggggctccccagggaagaccgggccccgcggcaa agtgcgacccccaaccctgctctgcgtccccgccgccaccgcgcgtctgacccgtggttc tctctgcagggtgacccaggggtggccggcctccccggagagaaaggcgagaagggcgag tccggcgagccggggcccaagggacagcaaggagtacgtggagaacccggctaccctggc cccagcggggatgcgggcgccccaggggttcagggctaccctggtccccccggccctcga ggactggccgggaaccgaggcgtgccaggacagcccgggagacagggcgtggagggccgg gatgccactgaccagcacatcgtggatgtggcgctgaagatgctgcaaggtgaggggcag caacccctcctcacagtcagttcgagggcatcgccgccccctcaccccctcccggagcct ccacgtgttcacttgtctgaaaatctggagtcctgggggctccttccagtccagtctctg aagggttttgggaccttgaataagtcactctgggcctttgacttccgcaaaacagagccc acggaagagcaactggcagaggtcgccgtgagtgccaagcgggaagccctgggtgcggtg ggcatgatgggtcctccaggacctcctgggccccctgggtacccaggcaagcagggcccc catgggcaccctggccctcggggcgttcctggcatcgtgggagccgtgggtcagatcggc aacacggggcccaagggaaaacgtggagagaagggtgatccaggagaagtgggacggggg caccccgggatgcctgggcccccagggatcccaggactccctggccggcctggccaggca atcaacggcaaggatggagatcgagggtccccaggggctccaggagaggcaggtcgacct ggcctgccaggccccgtggggctgccgggcttctgtgaacctgccgcctgccttggagct tcggcctatgcctctgcccgccttacagagcctggatccatcaaggggccttga >gi568815597f:40158272_40392666|GENSCAN_predicted_peptide_4|132_aa MYMRIYKKGDIVDIKGMGTVQKGMSHKCYHGKTGRVYNVPQHAVGIVVNKQVKGKILAKR INVCIEHIKHSKSRDSFLKRVKENDQKKKEAKEKVTWVQRKHQPAPPREAQCVRTNGKEP ELLETIPYEFMA >gi568815597f:40158272_40392666|GENSCAN_predicted_CDS_4|399_bp atgtatatgcgaatctataagaaaggtgatattgtagacatcaagggaatgggtactgtt caaaaaggaatgtcccacaagtgttaccatggcaaaactggaagagtctacaatgttccc cagcatgctgttggcattgttgtaaacaaacaagttaagggcaagattcttgccaagaga attaatgtgtgtattgagcacattaagcactctaaaagccgagatagcttcctgaaacgt gtgaaggaaaatgatcagaaaaagaaagaagccaaagagaaagttacctgggttcaacgg aagcaccagcctgctccacccagagaagcacagtgtgtgagaaccaatgggaaggagcct gagctgctggaaactattccctatgaattcatggcataa >gi568815597f:40158272_40392666|GENSCAN_predicted_peptide_5|106_aa MMLHRKEGPTHKRCFVIATLLEKGAFNSSCQAVIMVCCRRGPLGAARAPASLIYCSPGVG LKEQLVGKGAGPFLPALQPTSPSYRRWEETTGRLEDPKMHNPMLWL >gi568815597f:40158272_40392666|GENSCAN_predicted_CDS_5|321_bp atgatgctccacaggaaagaagggccaactcacaaacgttgctttgtaatagccacgctc ctggagaaaggggcttttaattcgtcttgtcaagctgtcatcatggtgtgctgccgcaga ggaccactaggtgccgccagagcaccagccagcctgatttactgctcacctggagtcggt ctaaaggagcaactggtggggaagggagctgggccttttcttcctgccctgcagcccacc tctccttcctatcggaggtgggaagaaacgactggaagattggaagatcctaaaatgcac aaccctatgctgtggctgtag >gi568815597f:40158272_40392666|GENSCAN_predicted_peptide_6|338_aa MGKKQNRKTGSSKNQSASPPPKERSSSPATEQSWMENDFDELREEGFRRSNYSELREDIQ TKGKEVENFEKNLEECITRITNTEKCLKELMELKTKARELREECRSLRSRCDQLEERVSA MEDEMNEMKREGKFRERRVKRNEQSLQEIWDYVKRPNLHLIGVPESDGENGTKLENTLQD IIQENFPNLARQANVQIQEIQRTPQRYSSRRATPRHIIVRFTKVEMKEKMLRAAREKGRV TLKGKPIRLTADLSAETLQARREWGPIFNILKEKNFQPRISYPAKLSFISEGEIKYFTDK QMLRDFVTTRPALKELLKEALNMERNNPYQPLQNHAKM >gi568815597f:40158272_40392666|GENSCAN_predicted_CDS_6|1017_bp atggggaaaaaacagaacagaaaaactggaagctccaaaaatcagagcgcctctcctcct ccaaaggaacgcagctcctcaccagcaacagaacaaagctggatggagaatgactttgac gagctgagagaagaaggcttcagacgatcaaattactctgagctacgggaggacattcaa accaaaggcaaagaagttgaaaactttgaaaaaaatttagaagaatgtataactagaata accaatacagagaagtgcttaaaggagctgatggagctgaaaaccaaggctcgagaacta cgtgaagaatgcagaagcctcaggagccgatgcgatcaactggaagaaagggtgtcagca atggaagatgaaatgaatgaaatgaagcgagaagggaagtttagagaaagaagagtaaaa agaaatgagcaaagcctccaagaaatatgggactatgtgaaaagaccaaatctacatctg attggtgtacctgaaagtgatggggagaatggaaccaagttggaaaacactctgcaggat attatccaggagaacttccccaatttagcaaggcaggccaacgttcagattcaggaaata cagagaacgccacaaagatactcctcgagaagagcaactccaagacacataattgtcaga ttcaccaaagttgaaatgaaggaaaaaatgttaagggcagccagagagaaaggtcgggtt accctcaaagggaagcccatcagactaacagcggatctctcagcagaaaccctacaagcc agaagagagtgggggccaatattcaacattcttaaagaaaagaattttcaacccagaatt tcatatccagccaagctaagcttcataagtgaaggagaaataaaatactttacagacaag caaatgctgagagattttgtcaccaccagacctgccctaaaagagctcctgaaggaagcg ctaaacatggaaaggaacaacccgtaccagccactgcaaaatcatgccaaaatgtaa >gi568815597f:40158272_40392666|GENSCAN_predicted_peptide_7|974_aa MDKFLDTYTLPRLNQEEVESLNRPITGAEIVAIINSLPTKKSPGLDGFTAEFYQRYKEEL VPFLLKLFQSIEKEGILPNSFYEASIILIPKPGRDTTKKENFRPISLMNIDAKILNKILA NRIQQHIKKLIHHDQVGFIPGMQGWFNIRKSINVIQHINRTKDKNHMIISIDAEKAFDKI QQPFMLKTLNKLGIDGTYFKIIRAIYDKPIANIILNGQKLEAFPLKTGTRQGCPLSPLLF NIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLKLISNFSKV SGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPL LKEIKEDTNKWKNIPCSWVGRINIVKMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFI WNQKRAHIAKSILSQKNKAGGITLPDFKLYYKATITKTAWYWYQNRDIDQWNRTEPSEIT PHTYNCLIFDKPEKNKKWGKDSLFNKWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLN VRPKTIKTLEENLGITIQNIGMGKDFMSKTPKAMATKDKIDKWDLIKLKSFCTAKETTIR VNRQPTKWEKIFATYSSDKGLISRIYNELNQIYKKKTNNPIKKWAKDMNRHFSKEDIYAA KKHMKKCSSSLAIREMQIKTTMRYHLTPVRMAIIKKSGNNRCWRGCGEIGTLLHCWWDCK LVQPLWKSVWRFLRDLELEIPFDPAIPLLGIYPNDYKSCCYKDTCTRMFIAALFTIAKTW NQPKCPTMIDWIKKMWHIYTMEYYAAIKNDEFMSFVGTWMKLEIIILSKLSQEQKTKHRI FSLIGKSVKDVDRYQAVLANLLLEEDNKFCADCQSKGFVNAGPLMAELQVSPQWKAPEMS QICLSCGHPSALSLPNETAPCSTAPGPINHPRAEVCGRTARDWRAAPPAAPVRDPLDEAS WAPESGGALENLYV >gi568815597f:40158272_40392666|GENSCAN_predicted_CDS_7|2925_bp atggataaattcctcgacacatacactctcccaagactaaaccaggaagaagttgaatct cttaatagaccaataacaggagctgaaattgtggcaataatcaatagcttaccaaccaaa aagagtccaggactagatggattcacagccgaattctaccagaggtacaaggaggaactg gtaccattccttctgaaactattccaatcaatagaaaaagagggaatcctccctaactca ttttatgaggccagcatcattctgataccaaagccgggcagagacacaacaaaaaaagag aattttagaccaatatccttgatgaacattgatgcaaaaatcctcaataaaatactggca aaccgaatccagcagcacatcaaaaagcttatccaccatgatcaagtgggcttcatccct gggatgcaaggctggttcaatatacgcaaatcaataaatgtaatccagcatataaacaga accaaagacaaaaaccacatgattatctcaatagatgcagaaaaagcctttgacaaaatt caacaacccttcatgctaaaaactctcaataaattaggtattgatgggacgtatttcaaa ataataagagctatctatgacaaacccatagccaatatcatactgaatgggcaaaaactg gaagcattccctttgaaaactggcacaagacagggatgccctctctcaccactcctattc aacatagtgttggaagttctggccagggcaattaggcaggagaaggaaataaagggtatt caattaggaaaagaggaagtcaaattgtccctgtttgcagatgacatgattgtatatcta gaaaaccccattgtctcagcccaaaatctccttaagctgataagcaacttcagcaaagtc tcaggatacaaaatcaatgtacaaaaatcacaagcattcttatacaccaacaacagacaa acagagagccaaatcatgagtgaactcccattcacaattgcttcaaagagaataaaatac ctaggaatccaacttacaagggatgtgaaggacctcttcaaggagaactacaaaccactg ctcaaggaaataaaagaggatacaaacaaatggaagaacattccatgctcatgggtagga agaatcaatattgtgaaaatggccatactgcccaaggtaatttacagattcaatgccatc cccatcaagctaccaatgactttcttcacagaattggaaaaaactactttaaagttcata tggaaccaaaaaagagcccacatcgccaagtcaatcctaagccaaaagaacaaagctgga ggcatcacactacctgacttcaaactttactacaaggctacaataaccaaaacagcatgg tactggtaccaaaacagagatatagatcaatggaacagaacagagccctcagaaataacg ccgcatacctacaactgtctgatctttgacaaacctgagaaaaacaagaaatggggaaag gattccctatttaataaatggtgctgggaaaactggctagccatatgtagaaagctgaaa ctggatcccttccttacaccttatacaaaaatcaattcaagatggattaaagatttaaac gttagacctaaaaccataaaaaccctagaagaaaacctaggcattaccattcagaacata ggcatgggcaaggacttcatgtccaaaacaccaaaagcaatggcaacaaaagacaaaatt gacaaatgggatctaattaaactaaagagcttctgcacagcaaaagaaactaccatcaga gtgaacaggcaacctacaaaatgggagaaaattttcgcaacctactcatctgacaaaggg ctaatatccagaatctacaatgaactcaaccaaatttacaagaaaaaaacaaacaacccc atcaaaaagtgggcaaaggacatgaacagacacttctcaaaagaagacatttatgcagcc aaaaaacacatgaaaaaatgctcatcatcactggccatcagagaaatgcaaatcaaaacc acaatgagataccatctcacaccagttagaatggcaatcattaaaaagtcaggaaacaac aggtgctggagaggatgtggagaaataggaacacttttacattgttggtgggactgtaaa ctagttcaaccattgtggaagtcagtgtggcgattcctcagggatctagaactagaaata ccatttgacccagccatcccattactgggtatatacccaaatgactataaatcatgctgc tataaagacacatgcacacgtatgtttattgcggcattattcacaatagcaaagacttgg aaccaacccaaatgtccaacaatgatagactggattaagaaaatgtggcacatatacacc atggaatactatgcagccataaaaaatgatgagttcatgtcctttgtagggacatggatg aaattggaaatcatcattcttagtaaactatcacaagaacaaaaaaccaaacaccgcata ttctcactcataggcaagtcggtgaaggacgtggatcggtaccaggctgtcctggccaac ctgctgctggaggaggataacaagttttgtgcagattgccagtctaaaggctttgtgaat gccggtccactgatggctgaactgcaggtctctccccagtggaaagccccagagatgagc cagatctgcctcagctgtggccatccgtcagccctgagcctccccaacgagaccgccccc tgctccacggcacccggtcccatcaaccacccaagggctgaggtgtgtgggcgcacagcg cgggactggcgggcagctccacctgcagccccagtgcgagatccactggatgaagccagc tgggctcctgagtctggtggggccttggagaacctttatgtctag >gi568815597f:40158272_40392666|GENSCAN_predicted_peptide_8|46_aa DDVSHQNLLLRGQRIRFMSSSSFQSSTGGRSTEMKWHKEAITTKQP >gi568815597f:40158272_40392666|GENSCAN_predicted_CDS_8|141_bp gatgatgtgtcacaccaaaacctcctactgagagggcagaggattcgcttcatgtctagc tccagcttccaaagcagcacaggtggcagaagcacagaaatgaaatggcacaaagaagca attaccacaaagcaaccatga