GENSCAN 1.0 Date run: 4-Nov-116 Time: 15:28:38 Sequence gi568815575f:102502142_102703815 : 201674 bp : 39.06% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 4945 5085 141 0 0 77 43 129 0.279 4.25 1.02 PlyA + 5358 5363 6 1.05 2.06 PlyA - 6286 6281 6 1.05 2.05 Term - 8824 8651 174 1 0 87 42 60 0.055 -1.92 2.04 Intr - 11857 11701 157 0 1 52 22 115 0.025 0.39 2.03 Intr - 15013 14872 142 2 1 26 70 139 0.014 4.39 2.02 Intr - 19223 19152 72 0 0 70 101 48 0.014 2.76 2.01 Init - 39952 39796 157 1 1 55 91 98 0.693 6.92 2.00 Prom - 40347 40308 40 -4.35 3.02 PlyA - 40427 40422 6 1.05 3.01 Sngl - 45179 44526 654 2 0 61 39 770 0.723 65.32 3.00 Prom - 45259 45220 40 -12.33 4.00 Prom + 45703 45742 40 -11.93 4.01 Init + 46060 46409 350 0 2 99 24 155 0.915 6.90 4.02 Intr + 47617 47693 77 1 2 122 116 13 0.981 5.74 4.03 Intr + 47791 47954 164 2 2 120 49 123 0.497 10.37 4.04 Intr + 48002 48044 43 1 1 71 113 15 0.443 -0.81 4.05 Intr + 59311 59494 184 2 1 62 106 175 0.991 14.62 4.06 Intr + 59737 59869 133 1 1 72 79 50 0.444 2.23 4.07 Intr + 60655 60759 105 0 0 93 71 93 0.533 7.59 4.08 Intr + 61502 61690 189 1 0 40 67 93 0.346 1.26 4.09 Intr + 61796 61905 110 1 2 91 86 17 0.773 -0.04 4.10 Intr + 62668 62898 231 1 0 72 94 70 0.607 1.97 4.11 Intr + 63954 64095 142 0 1 149 116 46 0.771 12.93 4.12 Intr + 64174 64229 56 0 2 53 89 36 0.998 -2.94 4.13 Intr + 64376 64483 108 2 0 88 84 152 0.999 13.28 4.14 Intr + 64599 64881 283 0 1 118 101 181 0.846 18.90 4.15 Intr + 64993 65035 43 0 1 112 113 39 0.993 5.69 4.16 Intr + 65712 65784 73 1 1 15 108 48 0.929 -2.95 4.17 Intr + 66277 66459 183 1 0 70 81 137 0.744 9.18 4.18 Term + 73426 73579 154 1 1 -16 54 243 0.453 6.91 4.19 PlyA + 73603 73608 6 1.05 5.00 Prom + 74051 74090 40 -10.55 5.01 Init + 74694 75250 557 2 2 64 -2 234 0.176 6.49 5.02 Term + 75356 76931 1576 2 1 -9 38 630 0.099 37.09 5.03 PlyA + 77064 77069 6 1.05 6.00 Prom + 77823 77862 40 -3.65 6.01 Init + 86026 86376 351 0 0 58 86 156 0.426 9.61 6.02 Intr + 97207 97268 62 0 2 104 54 114 0.275 6.21 6.03 Intr + 97345 97431 87 1 0 15 71 139 0.788 3.07 6.04 Intr + 97571 97794 224 2 2 39 81 170 0.669 8.25 6.05 Intr + 98779 98903 125 2 2 78 87 97 0.206 7.98 6.06 Term + 100106 101677 1572 1 0 -2 42 851 0.034 60.83 6.07 PlyA + 101989 101994 6 1.05 7.00 Prom + 106471 106510 40 -6.05 7.01 Init + 108897 109059 163 2 1 62 64 146 0.330 9.54 7.02 Term + 114170 115479 1310 0 2 63 50 153 0.113 -0.25 7.03 PlyA + 116335 116340 6 1.05 8.09 PlyA - 116578 116573 6 1.05 8.08 Term - 119060 118573 488 0 2 29 54 288 0.380 13.38 8.07 Intr - 133116 132980 137 2 2 60 72 78 0.002 2.69 8.06 Intr - 139412 139262 151 0 1 104 33 45 0.002 -1.10 8.05 Intr - 143906 143747 160 2 1 -59 77 164 0.096 -0.76 8.04 Intr - 144147 143965 183 0 0 57 98 139 0.440 10.86 8.03 Intr - 149405 149182 224 0 2 115 64 56 0.495 2.82 8.02 Intr - 150066 149655 412 0 1 86 6 261 0.341 10.53 8.01 Init - 150811 150668 144 1 0 68 69 65 0.504 0.94 8.00 Prom - 150895 150856 40 -8.25 9.00 Prom + 151289 151328 40 -12.33 9.01 Sngl + 151773 155960 4188 2 0 70 39 3007 0.984 282.43 9.02 PlyA + 156287 156292 6 1.05 10.00 Prom + 156384 156423 40 -11.64 10.01 Init + 156779 156781 3 1 0 76 115 0 0.055 1.55 10.02 Intr + 157987 158127 141 0 0 17 90 131 0.067 5.83 10.03 Intr + 158974 158987 14 0 2 90 115 15 0.080 -2.94 10.04 Intr + 166949 167262 314 2 2 105 89 134 0.128 10.10 10.05 Term + 167566 167924 359 2 2 76 42 210 0.125 8.89 10.06 PlyA + 169181 169186 6 1.05 11.00 Prom + 170425 170464 40 -5.35 11.01 Init + 192457 192644 188 0 2 73 87 183 0.889 15.28 11.02 Intr + 196155 196204 50 0 2 65 62 17 0.233 -5.79 11.03 Term + 198958 199073 116 2 2 90 39 122 0.710 5.25 11.04 PlyA + 200015 200020 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 86845 86865 21 0 0 125 44 20 0.883 -1.37 S.002 Sngl + 100001 101677 1677 1 0 75 42 898 0.958 78.79 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:102502142_102703815|GENSCAN_predicted_peptide_1|46_aa TVKKPKDHKTTETSLARQVNEFIKTYIQDIPGQQQDNYRELPTTHL >gi568815575f:102502142_102703815|GENSCAN_predicted_CDS_1|141_bp actgtgaagaagccaaaggaccacaaaacaactgagacaagtctagctcggcaagtaaat gagtttattaagacttacatacaggacattcctgggcagcagcaggacaactacagagaa ctgcccaccactcatctctaa >gi568815575f:102502142_102703815|GENSCAN_predicted_peptide_2|233_aa MDVVGNQGSFFATISEEKLKVAQSQFVKSEEWILFSRQNPPPGRRLSVALASGLGEEEEL VRMDKIGECSVLEAMQSSLGPNGYKKPKEPKYRSFSLDLGVPDVLSSLGEMCLLIAEASV IVKEDFRRLRISSHLCSLAKKSEVANVSLNSFLNLPLVHMFQMADAVNNLTIDDLCNVSF LKIGIVDILRGTDGVISWQYFYFSLLVLQVLVHGTATLASPGLVSSTNSKALS >gi568815575f:102502142_102703815|GENSCAN_predicted_CDS_2|702_bp atggatgtagtcggcaatcagggaagcttctttgccacaatctctgaagaaaagttaaag gtagcacagagtcagtttgtgaagagcgaagaatggattctgttcagtagacagaaccca cctccagggaggagactttctgtggctctggcttcaggtctgggagaagaggaagagcta gtgaggatggataaaataggagagtgcagtgtcctggaagccatgcaatcatccttgggc cccaatggctataaaaagccgaaagagcctaaataccggtcattcagcttggacctcgga gttccagatgttctaagctcactgggcgagatgtgtctgcttattgcagaggcctctgtt attgtgaaagaagattttaggcgtcttcggatatcttctcacctatgttccctggctaag aagtcagaggtagccaatgtttccttaaattcatttttaaacttaccattggtgcatatg ttccagatggcagatgctgtcaataatctcaccattgatgacctttgtaatgtttccttc ctgaaaattggtattgttgatatcttgagaggaacagatggagtaatcagttggcaatac ttttactttagcttacttgttctccaagtattggttcatggaacagcaacattagcatca cctgggcttgttagcagtacaaattctaaggctctatcctag >gi568815575f:102502142_102703815|GENSCAN_predicted_peptide_3|217_aa MTAAMMYGITSCRMRTESEPSCGSPVVSGDPKEDHNCTNAKSSNARSTLPTSDSIPSSSL ADDHYKFASKESQEGNEGSEGSFRSHESHSDREEDDRKHSQKEPRDALGDSGYASQHKKR QDFAKARKVPSDTLPLKKRRTVKHRPRTAPHQGEEEEIKEAAGSLLHLAEIRSCLNNITN RTAKGQEEQNEITKKLKTSHCFVLNLRPFGFSMSGDF >gi568815575f:102502142_102703815|GENSCAN_predicted_CDS_3|654_bp atgacggctgccatgatgtacggcatcaccagctgccggatgcggactgagagcgagcca tcgtgtggctccccggtggttagcggagaccctaaggaggatcacaactgcaccaatgcc aaatcctccaacgcccggagcaccttgcccaccagcgactccatcccctcctcctccttg gccgacgaccactacaagtttgccagcaaggagagccaggagggcaatgagggcagcgag gggagcttccggagccacgagagccacagcgacagggaagaggacgacaggaaacacagc cagaaggagcccagggatgctctgggggacagcgggtacgcatcccagcacaagaaacgc caggactttgctaaggccaggaaggtccccagcgacacactgcccctcaaaaagagacgc actgtaaagcaccgcccccgcaccgccccccaccagggcgaagaagaggagattaaagaa gcggcggggtcgctcctgcacttagcagagatccgatcctgtttgaataacatcaccaat cggacggcaaaggggcaggaagagcaaaacgaaatcacaaaaaaattaaaaacaagtcac tgttttgttttgaacttaaggccatttggtttcagcatgtcaggagatttctaa >gi568815575f:102502142_102703815|GENSCAN_predicted_peptide_4|875_aa MAGPIYMKAPESEWVQETEVRIASVTEGQKGLHNHENASLLGASITWPRVASHGRIPGMR RRWTHLWPSRWVAQSPSRAAAGTPAPVPGARPGQKLGFEFTKGLGLIVRETVLHISGSSL CKYFKDSRNIKTLKLVMGKTGQDLWGQLLKHTKHDIVDSLSVLSKTQHDLSSFLVDMCYQ KASTCLLPWTGPESQRWKWMLYFSVYGVLKEEHSDGSSSPQGENKGGDSSQGNFGKENLH DEHDGYELPPPHLQKVDENVEVGDVHEDPQIRNNPSTLQPDSRSVKCHSEYQDRIPPERE VEKNTQNGDPGTWFKVTFHYNKNRAHFFIQDASAACALKKVNCKIHDEENQKLLSLNLCD NKLYHLDGLPDIIEKAPKVKTLNLSKNKVRRGNQINFGWRVDGSTYQNNGNKQAELKSAW ELGKVKGLKLEELWLEGNSLCSTFSDQSAYVRGALGSPRWVAFLGQGVSTEGHRSQGPSE ENAVVWRGGPQTLNPMLSWGLTLHSSREKVSCPRGCSVSHAQLRLSSHRPVHLLTHPALP PWYALLTASFLSQDGQELASPIIIGIEAPEIIKPCKESYKGSETIKSLVLQFLLQYYLIY DSEDRTGLLSVYHDKACFSLTITLNPEDPEPSSLEKYFKDSRNIKNIKDPCECVMGEDQH GKWGGESIIGYRVQGVAALNASLSHTGLRIQLLKHTKREIVDSLSVLPRTQHDLNSYVVD LCIQTERMLVFSVNGVFKEVERESPGSVLAFTRTFILTSVGNSNLYIVNDKLIVRNASTK ETQSAFSIPVPAPSSSSLPTLSQKQQEMVETVSTQSGMKLEQSQKITNAEKSLKDLMELK TMVRELREKCTSFSSRFDQLEERVSVIEDQMNEMK >gi568815575f:102502142_102703815|GENSCAN_predicted_CDS_4|2628_bp atggcgggacccatatacatgaaggctcctgagtcagaatgggtccaggaaaccgaggtc cgcattgcttccgtcacagaaggacagaagggcctccataatcacgagaatgctagtctt ctaggggcttctatcacctggccccgcgttgcttctcacggccgcattccggggatgcgg cgacgctggactcacctgtggcccagccgctgggtcgctcagtctcccagccgagctgct gccggaaccccggctcctgtgcccggcgcccgcccgggccagaagttaggtttcgaattc actaaaggtttaggattaatagttcgtgaaactgttttacacatttcagggagcagcttg tgcaagtacttcaaggatagcagaaatataaaaactctcaagcttgtgatggggaagact gggcaagacctgtgggggcagctgctgaagcacacaaaacatgacattgtggactccctc agtgtgttgtccaaaactcagcatgacctcagctccttcctggtggacatgtgttaccag aaggcaagcacctgcttactcccttggacaggccctgagagccagaggtggaaatggatg ctctacttttctgtctacggtgtgttgaaagaagaacacagtgatggtagcagctctcct caaggagaaaataaaggtggagattcttcccaggggaattttggaaaggagaaccttcat gatgaacatgatgggtatgagctcccacctccccatcttcagaaggtagatgaaaatgtg gaggtgggggatgtccacgaggatccccaaataagaaacaacccctctaccttacaaccc gatagtaggagtgtgaaatgccatagtgaataccaagatagaattcctccagagagagaa gtggagaagaacacacagaatggagacccagggacctggttcaaggtcacattccactac aacaaaaatcgggcccatttctttattcaggatgctagtgctgcctgtgcattaaagaaa gtcaactgcaagattcatgatgaggaaaaccaaaagctattatctttgaacttgtgcgac aacaaactgtaccacctggatggcctgcctgacattatagagaaggctcccaaagtcaag accctgaatctctccaaaaataaggtgagaagggggaaccagatcaactttggatggagg gtggatggcagtacatatcagaataatggcaacaagcaggcagagctgaagtcggcttgg gagttgggcaaggtgaaagggttgaagctcgaagagctatggctggaagggaactcattg tgcagcaccttctctgaccagtccgcctatgtaagaggagccctgggtagcccaagatgg gtagcattcctcggtcaaggcgtcagcacagaggggcacaggagtcagggaccatcagaa gagaatgcagtggtctggagagggggtccccagactctgaaccccatgctgagctggggc ctgactcttcactcctcccgggagaaggtctcctgtccccgtggctgctctgtttcccat gcccagctcagactgagctcacacaggcctgttcatctccttactcatccagctctacct ccctggtatgcactgctgactgcctctttcctttctcaggatggccaggagttagcatct ccaattataattggcattgaagcccctgagataataaaaccttgtaaggaaagctataaa ggatctgagaccataaagagtctggtgcttcagttcctgcttcagtattacttgatctat gactctgaagatcgaacgggtctcctcagtgtttaccatgacaaggcctgcttctccctg accattaccctcaaccctgaggacccagaaccgagcagcttggaaaaatacttcaaggat agcaggaatataaagaatatcaaggacccttgtgagtgtgtgatgggtgaagatcagcat gggaaatggggtggagagtcaataatagggtacagggtacagggtgtggcagccctcaat gcttctctttctcacacaggcctgaggattcagctgctgaagcacacaaaacgtgagatt gtggactccctcagtgtattgcccagaactcagcatgaccttaactcctatgtggtagac ttgtgcatccaaacggaaaggatgctcgtcttttctgtcaatggagtatttaaggaagtg gaaagagagtctccaggttctgttcttgccttcacccgaaccttcatcttgacttctgtc ggcaattccaatctgtatattgtgaatgacaagctgattgtgaggaatgccagcacgaag gagacccagagtgccttctccatcccagtgcctgcaccctcctccagctccttgcctacc ctctcccagaagcagcaggaaatggtggagactgtctccacccagtctgggatgaaactt gagcagtctcagaaaataaccaatgcagagaagtccttaaaggacctgatggagctgaaa accatggtacgagaactacgtgaaaaatgcacaagcttcagcagccgattcgatcaactg gaagaaagggtatcagtgattgaagatcaaatgaatgaaatgaaatga >gi568815575f:102502142_102703815|GENSCAN_predicted_peptide_5|710_aa MGDVNTPLSTLDRSTRQKVHKDIQELNSALHQVDLIDIYRTLHPKSTEYTFFLARHHTYS KIDHILGSKALLSKCKRTEIITNCLSDHSAIKLELKIKKLTQNQSTTWKLNNLLLYDYWV HNEMKAEIKMFFETNENKDTAYQNLWDTFKAVCRGKFIALNAHKRKQERSKIDTLTSQLK ELEKQERWFFERINKIDRLLARLIKKKREKNQIDAIKNDKGDITTDPTEIQTTIREYYKH LYANKLENLEEMDKFLDTYTLPRLNQEEVESLNRPITGSEIEATINSLPTKKGPGPDGFT AEFYQRYKKELVPFLLKLFQLIEEKGILPNSFYEDSIILIPKPGRDTTKKENSRPISLMN IDGKILNKILANQIQQHIKKLIHHDQVGFIPGMQGWFSTRKSINVIQHINRTNNKNHMII SIDAEKAFDKIQQPFMLKTLNKLDIDGTYFKIITAIYDKPTANIIPNGQKLEAFPLKTGT RQGCPLSPLLFNIVLEVLARAIRQEKEIRGIQLGKDEVKLSLFADDMIVYLENPIVSAQN LLKLISNFSKFSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIELTRDV KDLFKENYKPLLKEIKADTNKWKNIPCSWIGRINIVKMAILPKVIYRFNAIPIKLPMTFF TQLEKTTLKFIWNQKRARIAKSILSQKNKAGGITLPDFKLYYKATVTKTA >gi568815575f:102502142_102703815|GENSCAN_predicted_CDS_5|2133_bp atgggggacgttaacaccccactgtcaacattagacagatcaacgagacagaaagttcac aaggatatccaggaattgaactcagctctacaccaagtggacctaatagacatctacaga actctccaccccaaatcaacagaatatacattctttttagcacgacaccacacctattcc aaaattgaccacatacttggaagtaaagcactcctcagcaaatgtaaaagaacagaaatt ataacaaactgtctttcagaccacagtgcaatcaaactagaactcaagattaagaaactc actcaaaaccagtcaactacatggaaactgaacaacctgctcctgtatgactactgggta cataacgaaatgaaggcagaaataaagatgttctttgaaaccaacgagaacaaagacaca gcataccagaatctctgggacacattcaaagcagtgtgtagggggaaatttatagcgcta aatgcccacaagagaaagcaggaaagatctaaaattgacactctaacatcacaattaaaa gaactagagaagcaagagaggtggttttttgaaaggatcaacaaaattgatagactgcta gcaagattaataaagaagaaaagagagaagaatcaaatagacgcaataaaaaatgataaa ggggatatcaccaccgatcccacagaaatacaaactaccatcagagaatactacaaacac ctctatgcgaataaactagaaaatctagaagaaatggataaattcctcgacacatacacc ctcccaagactaaaccaggaagaagttgaatctctgaatagaccaataacaggctctgaa attgaggcaacaattaatagcttaccaaccaaaaaaggtccaggaccagatggattcaca gccgaattctaccagaggtacaagaaggagctggtaccattccttctgaaactattccaa ttgatagaagaaaagggaatcctccctaactcattttatgaggacagcatcatcctgata ccaaagccgggcagagacacaacaaaaaaagagaattctagaccaatatccctgatgaac atcgatggaaaaatcctcaataaaatactggcaaaccaaatccagcagcacatcaaaaag cttatccaccatgatcaagtgggtttcatccctgggatgcaaggctggttcagcacacgc aaatcaataaacgtaatccagcatataaacagaaccaacaacaaaaaccacatgattatc tcaatagatgcagaaaaggcctttgacaaaattcaacaacccttcatgctaaaaactctc aataaattagatattgatgggacatatttcaaaataataacagctatctatgacaaaccc acagccaatatcataccgaatgggcaaaaactggaagcattccctttgaaaactggcaca agacagggatgccctctctcaccactcctattcaacatagtgttggaagttctggccagg gcaatcaggcaggagaaggaaataaggggtattcaattaggaaaagacgaagtcaaattg tccctgtttgcagacgacatgattgtatatctagaaaaccccattgtctcagcccaaaat ctccttaagctgataagcaactttagcaaattctcaggatacaaaatcaatgtacaaaaa tcacaagcattcttatacaccaacaacagacaaacagagagccaaatcatgagtgaactc ccattcacaattgcttcaaagagaataaaatacctaggaatcgaacttacaagggatgtg aaggacctcttcaaggagaactacaaaccactgctcaaggaaataaaagcggatacaaac aaatggaagaacattccatgctcatggataggaagaatcaatattgtgaaaatggccata ctgcccaaggtaatttacagattcaatgccatccccatcaagctaccaatgactttcttc acacaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcccgcatcgcc aagtcaatcctaagccaaaagaacaaagctggaggcatcacactacctgacttcaaacta tactacaaggctacagtaaccaaaacagcatag >gi568815575f:102502142_102703815|GENSCAN_predicted_peptide_6|806_aa MIVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFLYISNRQTESQIMSELPFTIAS KRIKYLGIQLTRDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWVGRINIVKMAILPKRRF PYIRNSSLRIPIPEAATGRIPINYALAPPTAVVGNADVEEPRLRRKRCRVNPHTQPPPLP RVLVSDCLFLLLYDTAPRGKEIRKLPLIQAEKVSEKVRIAWGWMWVSEVNTGYATWSWEA GGRSVPRSVELSVVKFCKMVTQLKTRKLRRRENCPVSVKVGVKAEAVAEAELKTESVTQA KAGDGAMTRTHTVTYREAMAVTREVIKVEDTTKTRVMVETKTKPLAERSIVPQTKSKAMP MSRVSTVTKSEVKVVAVIEANIRSYAKSHDKANTGSRPDRREETSIGMKSSDEDEENICS WFWTGEEPSVGSWFWPEEETSLQVYKPLPKIQEKPKPTHKPTLTIKQKVIAWSRARYIVL VPVEGGEQSLPPEGNWTLVETLIETPLGIRPLTKIPPYHGPYYQTLAEIKKQIRQREKYG PNPKACHCKSRGFSLEPKEFDKLVALLKLTKDPFIHEIATMIMGISPAYPFTQDIIHDVG ITVMIENLVNNPNVKEHPGALSMVDDSSESSEEPKSGESYIHQVCKGIISCPLNSPVQLA GLKLLGHLSIKFEDHYVITSYIPDFLTLLNKGSVKTKFYVLKVFSCLSKNHANTRELISA KVLSSLVAPFNKNESKANILNIIEIFENINFQFKTKAKLFTKEKFTKSELISIFQEAKQF GQKLQDLAEHSDPEVRDKVIRLILKL >gi568815575f:102502142_102703815|GENSCAN_predicted_CDS_6|2421_bp atgattgtatatctagaaaaccccattgtctcagcccaaaatctccttaagctgataagc aacttcagcaaagtctcaggatacaaaatcaatgtacaaaaatcacaagcattcttatac atcagcaacagacaaacagagagccaaatcatgagtgaactcccattcacaattgcttca aagagaataaaatacctaggaatccaacttacaagggacgtgaaggacctcttcaaggag aactacaaaccactgctcaatgaaataaaagaggatacaaacaaatggaagaacattcca tgctcatgggtaggaagaatcaatatcgtgaaaatggccatactgcccaagcgccgtttt ccctatatccggaattcgtccctgcgcattccaatacctgaagcggccacaggccggatc cccataaactacgctctagctcctcccactgccgttgtgggtaacgcggacgtggaagaa cctcgtctgcggaggaaaaggtgccgagtcaatccccatacacagccgccgccattgcct cgagtccttgtgtctgactgtctgttcctgctgctgtatgacacagcacctcgaggcaag gaaataagaaaactgcctctgatccaagcagagaaggtcagtgagaaggtgcgcatcgct tgggggtggatgtgggtatcagaggtgaacactggatacgcgacctggtcatgggaggcg ggcggtagatctgtccccaggtctgtggaactgtcagttgtgaagttttgtaaaatggtc acccaacttaaaactaggaaattacgaagaagagaaaattgccctgtatctgttaaggtt ggtgtaaaggccgaggcagtggctgaggcagaactgaaaacagaatcagtgacccaggcc aaagctggtgatggagcaatgaccaggacacatacagtgacctacagggaggctatggct gtgacaagggaagtgatcaaggtggaagatacaactaagactagagtcatggttgagact aagacaaaacccctggcagaacgcagtatagtgccacaaaccaagtcaaaggccatgcct atgtctagggtcagtactgtaaccaagtctgaagtcaaggttgttgctgtcattgaggca aatattaggtcctatgccaagtcacatgataaggccaatactgggtccagacctgacaga agggaagagaccagcattgggatgaaatccagtgatgaggatgaagaaaatatatgctcc tggttctggactggagaagagcctagtgtagggtcctggttctggcctgaagaagagacc tctcttcaagtttataagcccctacctaagatccaggaaaagcccaagcccacacacaaa cccacacttactataaaacaaaaggtaatagcatggtcaagggccaggtatattgtccta gttccagttgaaggaggggagcaatccttgcctccagaaggaaactggaccctggttgag accttgattgaaactcctctggggattcgacctttgaccaagatcccaccttatcatggg ccttattaccagaccttagctgagatcaaaaaacagattaggcaaagggaaaagtatggg cctaatccgaaggcctgccactgcaaatcacgtggctttagtttagagcctaaagagttt gataaacttgttgccctccttaagttaactaaggatcctttcattcatgaaatagctaca atgataatgggcatcagtcctgcttatccatttactcaagatataattcatgatgtaggt attactgttatgattgaaaacttggtcaataatcccaatgttaaagaacaccctggagct ttaagtatggtggatgacagctctgagtcttccgaagaaccaaaatcaggggagtcatat atacatcaagtttgtaaaggcataatctcttgccccttgaactcccctgtgcagctggct ggactgaaattactagggcacttgagtataaaatttgaagatcactatgtgattaccagt tatattccagatttcctcaccttgttaaacaagggaagtgtcaaaaccaagttttatgtt ttaaaagtgttttcgtgtttgtctaaaaatcacgccaatacaagagaattgatcagtgcc aaagtactgtcatcattggttgcaccctttaacaagaatgagtcaaaggccaatattctt aatattattgaaatatttgagaatataaattttcagttcaaaacaaaggcaaagctattt accaaggaaaagttcactaaatctgagcttatttcaatattccaggaagcaaaacagttt ggtcagaaactccaagacttagcagagcacagtgatcccgaagtgagagataaagtcata cgattaatactaaaactctga >gi568815575f:102502142_102703815|GENSCAN_predicted_peptide_7|490_aa MVDPLTPCTVHLEKSQTLNNKPVKAAGSGAVLCKATGSELPRAVGAHLLHQHNPGRINIV KMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRARIAKSILSQKNKAGGSTLP DFKLYYKATVTKTAWYWYQNRDIDQWNRTGPSEITPHTYNYLIFDKPEKNKQWGNDSLFN KWCWENWLAICRKLKLDPFLTPYTRINSRWIKDLNVRPKTIKTLEENLGITIQDIGMGKD FMSKTPKAMATKDKIDKWDLIKLKSFCTAKETTIRVNRQPTKWEKIFATYSSDKGLISRI YNELKQIYKKKTNNPIKKWAKDMNRHFSKEDIYAAKRHMKKCSSSLAIREMQIKTTMRYH LTPVRMAIIKKSGNNRCWRGCGEIGTLLHCWWDWKLVQPLWKSVWRFLRDLELEIPFDPA IPLLGIYPNDYKSCCYKDTCTRMFIVALFTIAKTWNQPKCPTMIDWIKKMWHIYTMEYYA AIKNDEFMSL >gi568815575f:102502142_102703815|GENSCAN_predicted_CDS_7|1473_bp atggtagatccactgacaccttgcactgtgcacctggaaaagtcacagacactcaacaac aagcctgtgaaagcagctgggagtggggctgtactctgcaaagccacagggtcagaactg cccagggccgttggagcccacctcttgcatcagcataatccaggaagaatcaatatcgtg aaaatggccatactgcccaaggtaatttacagattcaatgccatccccatcaagctacca atgactttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaaga gcccgcatcgccaagtcaatcctaagccaaaagaacaaagctggaggcagcacactacct gacttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaac agagatatagatcaatggaacagaacagggccctcagaaataacgccgcatacctacaac tatctgatctttgacaaacctgagaaaaacaagcaatggggaaacgattccctatttaat aaatggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttcctt acaccttatacaagaatcaattcaagatggattaaagatttaaacgttagacctaaaacc ataaaaaccctagaagaaaacctaggcattaccattcaggacataggcatgggcaaggat ttcatgtccaaaacaccaaaagcaatggcaacaaaagacaaaattgacaaatgggatcta attaaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaaccc acaaaatgggagaaaattttcgcaacctactcatctgacaaagggctaatatccagaatc tacaatgaactcaaacaaatttacaagaaaaaaacaaacaaccccatcaaaaagtgggcg aaggacatgaacagacacttctcaaaagaagacatttatgcagccaaaagacacatgaaa aaatgctcatcatcactagccatcagagaaatgcaaatcaaaaccactatgagataccat ctcacaccagttagaatggcaatcattaaaaagtcaggaaacaacaggtgctggagagga tgtggagaaataggaacacttttacactgttggtgggactggaaactagttcaaccattg tggaagtcagtgtggcgattcctcagggatctagaactagaaataccatttgacccagcc atcccattactgggtatatacccaaatgactataaatcatgctgctataaagacacatgc acacgtatgtttattgtggcattattcacaatagcaaagacttggaaccaacccaaatgt ccaacaatgatagactggattaagaaaatgtggcacatatacaccatggaatactatgca gccataaaaaatgatgagttcatgtctttgtag >gi568815575f:102502142_102703815|GENSCAN_predicted_peptide_8|632_aa MLSFSNCPLNCPLPLLTNLQRCEPVSSSLPFLQNWPALKQAYGQTDEKMLTVSCQNTPDL PRRQTLTSTWSLWEGTAPRTRGPTGPFPESGPGAANATHLPRARHRAGTLAHEGPPRRPP TASRAPPNTAPVWPFLPQEPPPQPSLGTETVEGCARAPAKLAGKRSARFSSRRSDGASGA SPSALALLWLLFSWIPTAFPAIRMAKTSPAPHASAPNWAGAAQDAARTALREGVGKGGGG IRRESAGRPPSQNHHSSDPELLQLWLKGANIQLGLWLQRVEAASLGSFHVVLSMQVYRSQ ELRFGNLCLYFRRCKKRLDAQKGNVGSEPSHRVPTEAPPSGAVRRGPPSSRPQNDRSTDG LHCVRGKATDNANPWTWMKLETIILSKLTQEQKTKHCMFSLKWELNNGNTWTQGGEHHTL GSARGFVKNQMVVGVQPYFWAPYFVPLVYVSVLVPVPCCFGYCSPYSLKSDKTQDKNHMI LSIDAEKAFDKIQHPFMLETFKKLGIEEPYCKMIRVIYDKPTANITGNGQKLKAFPLKTG TRQGCPLSTLLFNIVLEVLAKAIRQEKEIKGIQIGREEVKLPLFADNIILYLENLIIYDQ KLLELINNFSSFRIQNQHRKTNSIPVYQQHPS >gi568815575f:102502142_102703815|GENSCAN_predicted_CDS_8|1899_bp atgctctctttctccaactgtcccctgaattgccccctcccccttctcaccaacctccag aggtgtgagccagtctcctcctccttgcctttcttgcagaactggccagccttaaagcag gcctatggacaaacagatgaaaaaatgctcactgtgtcctgtcagaatacgccagaccta ccgcgcagacagactttgacatcaacctggagtctgtgggaagggacagcacccaggaca cggggccccacaggcccctttccggaatcagggcccggagcagcgaacgcgacccacctc ccacgggctaggcacagagctgggacgctggcccacgagggcccccccaggcgcccccca actgcctcccgggcgccccccaacacggctcccgtttggccatttctcccgcaggagcct cccccacagccgtctctcggtaccgaaaccgtcgagggctgcgcgcgggcgcctgcaaag ctggctgggaagcggtcagcgcggttctcctccagacgatctgacggtgcctctggagcc agcccctcagcccttgctctcctctggctgctcttcagttggattcccaccgctttccca gcgatccgaatggcaaagacgagtcccgccccccacgcctctgcaccaaactgggcaggg gctgcgcaggatgcggcaagaaccgctctccgcgagggtgtcgggaaggggggcggaggg atacggcgcgaaagtgcaggacgccccccctcacagaaccaccatagttccgacccagag ctgctccagctgtggctgaaaggggccaacatacagcttgggctgtggcttcagagggtg gaagccgcaagccttggcagcttccacgtggtgttgagcatgcaggtgtacagaagtcaa gaactgaggtttgggaacctctgcctatatttcagaagatgtaagaaacgcttggatgcc cagaagggaaatgtggggtcagaaccctcacacagagtccctactgaggcaccacctagt ggagctgtgagaagagggccaccatcctccagaccccagaatgatagatccactgacggc ttgcactgtgtgcgtggaaaagccacagataatgccaacccatggacatggatgaagctg gaaaccatcattctcagcaaactaacacaggaacagaaaaccaaacactgcatgttctca ctcaagtgggagttgaacaatgggaacacatggacacagggaggggaacatcacacactg gggtctgctcgggggtttgtcaaaaatcagatggtcgtaggtgtacagccttacttctgg gctccctactttgttccattggtctatgtgtctgttcttgtaccagtaccatgctgtttt ggttactgtagcccgtatagcttgaagtcagacaaaactcaagacaaaaaccacatgatt ttatcaatagatgcagaaaaggcttttgataaaattcagcaccccttcatgttagaaacc ttcaaaaaactaggcattgaagaaccatactgcaaaatgataagagtcatctatgacaaa cccacagccaacatcacagggaatgggcaaaagctgaaagcattccccttgaaaactggc acaaggcaaggatgccctctctcaacactcttgttcaatatagtattggaagttctggcc aaagcaatcagacaagaaaaagaaataaaaggcatccaaataggacgagaggaagtcaaa ctacccctgtttgcagacaacataattctgtatctagaaaaccttatcatctatgaccaa aagctccttgagctgataaacaactttagcagtttcagaatacaaaatcaacataggaaa accaatagcattcctgtgtaccaacaacatccaagctga >gi568815575f:102502142_102703815|GENSCAN_predicted_peptide_9|1395_aa MTGAEIESGAQVKPEKKPGEEVVGGAEIENDVPLVVRPKVRTQAQIMPGARPKNKSKVMP GASTKVETSAVGGARPKSKAKAIPVSRFKEEAQMWAQPRFGAERLSKTERNSQTNIIASP LVSTDSVLVAKTKYLSEDRELVNTDTESFPRRKAHYQAGFQPSFRSKEETNMGSWCCPRP TSKQEASPNSDFKWVDKSVSSLFWSGDEVTAKFHPGNRVKDSNRSMHMANQEANTMSRSQ TNQELYIASSSGSEDESVKTPWFWARDKTNTWSGPREDPNSRSRFRSKKEVYVESSSGSE HEDHLESWFGAGKEAKFRSKMRAGKEANNRARHRAKREACIDFMPGSIDVIKKESCFWPE ENANTFSRPMIKKEARARAMTKEEAKTKARARAKQEARSEEEALIGTWFWATDESSMADE ASIESSLQVEDESIIGSWFWTEEEASMGTGASSKSRPRTDGERIGDSLFGAREKTSMKTG AEATSESILAADDEQVIIGSWFWAGEEVNQEAEEETIFGSWFWVIDAASVESGVGVSCES RTRSEEEEVIGPWFWSGEQVDIEAGIGEEARPGAEEETIFGSWFWAENQTYMDCRAETSC DTMQGAEEEEPIIGSWFWTRVEACVEGDVNSKSSLEDKEEAMIPCFGAKEEVSMKHGTGV RCRFMAGAEETNNKSCFWAEKEPCMYPAGGGSWKSRPEEEEDIVNSWFWSRKYTKPEAII GSWLWATEESNIDGTGEKAKLLTEEETIINSWFWKEDEAISEATDREESRPEAEEGDIIG SWFWAGEEDRLEPAAETREEDRLAAEKEGIVGSWFGAREETIRREAGSCSKSSPKAEEEE VIIGSWFWEEEASPEAVAGVGFESKPGTEEEEITVGSWFWPEEEASIQAGSQAVEEMESE TEEETIFGSWFWDGKEVSEEAGPCCVSKPEDDEEMIVESWFWSRDKAIKETGTVATCESK PENEEGAIVGSWFEAEDEVDNRTDNGSNCGSRTLADEDEAIVGSWFWAGDEAHFESNPSP VFRAICRSTCSVEQEPDPSRRPQSWEEVTVQFKPGPWGRVGFPSISPFRFPKEAASLFCE MFGGKPRNMVLSPEGEDQESLLQPDQPSPEFPFQYDPSYRSVQEIREHLRAKESTEPESS SCNCIQCELKIGSEEFEELLLLMEKIRDPFIHEISKIAMGMRSASQFTRDFIRDSGVVSL IETLLNYPSSRVRTSFLENMIRMAPPYPNLNIIQTYICKVCEETLAYSVDSPEQLSGIRM IRHLTTTTDYHTLVANYMSGFLSLLATGNAKTRFHVLKMLLNLSENLFMTKELLSAEAVS EFIGLFNREETNDNIQIVLAIFENIGNNIKKETVFSDDDFNIEPLISAFHKVEKFAKELQ GKTDNQNDPEGDQEN >gi568815575f:102502142_102703815|GENSCAN_predicted_CDS_9|4188_bp atgactggggcagagattgagtctggtgcccaggtcaagcctgaaaagaagcctggggaa gaggttgtaggtggggctgagatagagaatgatgtccctctggtggtcagacccaaggtt aggacccaggcccagataatgcctggggcaaggcccaagaataagtccaaggttatgcct ggagcaagcaccaaagttgagacaagtgcagtgggtggggcacgccctaagagtaaggcc aaggcaatacctgtttcacgatttaaggaagaagcccagatgtgggctcagcccaggttt ggtgctgaaagattgtctaagacagagagaaactcccagaccaatatcatagcctctcca cttgtcagtactgattctgtcttggttgctaaaacaaagtacctgtctgaggatagagaa ctggttaatacagacactgagagctttcctagaaggaaggcccattaccaagcaggattc cagccttcttttaggtcaaaggaggagaccaatatggggtcctggtgctgtcctaggcct acatccaaacaagaagcctctcctaattctgatttcaaatgggtagacaaatctgtgagt tccttgttctggagtggagatgaggtcactgcaaaatttcatcctgggaatagggtaaaa gacagtaacagatccatgcacatggccaatcaagaggctaataccatgtctaggtcccaa actaaccaggagctctatattgcatctagttctggttctgaggatgagtctgttaagaca ccctggttctgggccagagataaaaccaatacctggtctgggcccagggaagatcccaat agcaggtccaggtttaggtctaagaaagaagtctatgttgaatcaagttctggatctgag catgaagaccatttggagtcctggtttggggctggaaaggaggccaaattcaggtccaaa atgagagctgggaaggaggccaataacagggccaggcacagggccaagcgagaagcttgc attgatttcatgcctgggtctatagatgtaattaaaaaagagtcctgtttctggcctgaa gaaaatgctaataccttttcaaggcccatgatcaagaaagaggccagggccagagcaatg acaaaggaagaggccaaaaccaaggcccgagccagggccaagcaagaagccaggtcagag gaggaagccctcattgggacctggttctgggctacagacgagtccagcatggcagatgaa gccagcatagagtccagtctacaagtggaggatgagtccataattgggagttggttctgg actgaagaagaggccagtatggggactggggctagcagtaaatccagaccaaggactgat ggggagcgtattggtgattccttatttggggctagggaaaagaccagtatgaaaactggg gctgaggccacctctgaatctatactagcagctgatgatgaacaggtcattattggttcc tggttctgggctggtgaagaggtcaaccaagaggctgaggaagagaccatttttgggtcg tggttctgggtcattgatgcggccagtgtggaatctggtgttggggtcagctgtgagtcc aggacaaggtctgaggaagaagaggtcattggtccctggttttggtctggagaacaagtt gatatagaggctggaatcggagaagaggccaggccaggagctgaagaagagacaatattc gggtcctggttttgggctgaaaaccagacctatatggattgtagggctgaaactagctgt gacaccatgcaaggggctgaggaggaggagcccattattgggtcctggttttggaccaga gtagaagcttgtgtggagggtgatgtcaacagcaagtctagcctggaggacaaggaagag gccatgataccatgttttggagccaaagaagaggtcagtatgaagcatgggactggtgtc agatgcagatttatggcaggggctgaggagaccaataataagtcttgcttctgggcagaa aaagaaccctgtatgtatcctgccggtggaggaagttggaagtctaggccagaggaggaa gaggacattgtcaattcgtggttctggtccagaaaatacacaaagccagaggccattata gggtcctggttatgggctacagaagagagtaatatagatgggactggagaaaaggccaag ttactgactgaagaggagaccataatcaattcctggttctggaaagaagatgaagccatt tcagaggctactgacagagaagagtccaggccagaagctgaggagggggacattattggt tcttggttctgggctggagaagaggacagactagagccagctgctgagactagagaagaa gacaggctagcagctgagaaagaaggtattgttgggtcctggtttggggccagagaagag accattagaagagaggctgggtcttgcagcaaatccagtcctaaagctgaagaggaagaa gtcattattgggtcctggttctgggaagaagaggccagtccggaggcagtggcaggagtc ggctttgagtcaaagcctgggactgaggaggaagaaatcactgttgggtcctggttctgg cctgaagaagaagccagtatacaggctggatctcaggcagtagaggaaatggagtcagag actgaagaggaaaccatttttgggtcctggttctgggatggaaaagaagtcagtgaagaa gcaggaccatgctgtgtatccaagccagaggatgatgaagagatgattgttgagtcctgg ttctggtctagagacaaagccattaaggaaactggaactgtggccacctgtgagtccaag ccagaaaatgaggaaggggccattgttgggtcttggtttgaggctgaagatgaggtagat aacaggactgacaatggaagcaactgtgggtccaggacattagctgatgaagatgaggcc atagtggggtcctggttctgggcaggagatgaggcccattttgaatcaaatcctagcccc gtgttcagggccatttgcaggtccacgtgttcagttgaacaggagcctgatccttcacgc aggcctcagagttgggaggaggtcactgttcagttcaagcctggtccatggggtagggtc ggcttcccatctataagcccctttagatttccgaaagaggcagcatctttattctgtgaa atgtttgggggcaaacccaggaacatggtacttagcccagaaggggaagatcaggaatct ttgcttcagcctgatcagcctagtcctgagttcccatttcagtatgatccttcctacagg tcagtccaggaaattcgagagcatcttagggccaaggagagtacagagcctgagagttca tcctgtaactgcatacaatgtgagctgaaaattggttctgaagagtttgaagaactcctt ttattaatggaaaaaattcgggatccttttattcatgaaatatctaaaatcgcaatgggt atgagaagtgcttctcaatttacccgagatttcattcgagattcaggtgttgtctcactt attgaaaccttgcttaattatccgtcctcccgagttagaacaagttttttggaaaatatg attcgcatggccccaccttatccgaatctaaacataattcagacatacatatgtaaagtg tgtgaggaaacccttgcttatagcgtggattccccggaacagctgtctggaataaggatg attagacatctcactactactactgactatcacacactggttgccaattatatgtctggg tttctctccttattagctacaggcaatgccaaaacaaggtttcatgttttgaaaatgcta ctgaatttgtctgaaaatcttttcatgacaaaagaactactcagtgctgaagcagtgtca gaatttataggcctctttaacagggaagagacaaatgacaatattcaaattgttcttgca atatttgagaatattggcaacaatatcaaaaaagaaacagtgttctctgatgatgatttc aatattgagccgcttatttctgcattccacaaagttgagaaatttgctaaggaactgcaa ggcaaaacagacaatcaaaatgaccctgaaggggaccaagaaaattag >gi568815575f:102502142_102703815|GENSCAN_predicted_peptide_10|276_aa MHFLVEETIPQPTASGLRRSVFCPGAQEAETLPALGVREEKQLLLLEEVAAPCNQVPLLQ VCCSLLGVHSRRCSPGYHQWSLQNSKDCCLFLPLEALSQRDTDLMPARTLLYEVSGYPCW EVSPSQEAWDQEEAVWLPLSGAGVLCLGTPPCQQAGKRTWSSSADSRLLCCPAGIPSQWV LACGILWLWDLLSKATWLSDFSPLSMGVDGSSATLEFWEPPEYVKTPAAQCLPQLLLTGA AAVGLPSFVFGTQGPGGISTQRNLLICRLQKSVGKV >gi568815575f:102502142_102703815|GENSCAN_predicted_CDS_10|831_bp atgcatttcttggtagaggaaactatccctcagccaaccgctagcggtcttagacggtct gtcttttgtccaggagctcaagaagcagagactctacctgccctaggcgttcgtgaagag aagcagctgttgctattggaggaggtagcagccccatgcaatcaggtccctcttctgcag gtctgctgcagtttgctgggggtccactccagacgttgttcacctgggtatcaccagtgg agcctgcagaacagcaaagattgctgcctgttccttcctctggaagctttgtcccagagg gataccgacctgatgccagccagaactctcctgtatgaggtgtctggctacccctgttgg gaggtctcacccagtcaggaggcatgggatcaggaggaagctgtctggcttccgcttagt ggagctggcgtgctgtgcttggggactcccccttgtcagcaggcaggaaagagaacttgg tcatcttccgcagactccaggctgttgtgctgtccagcagggattccaagccagtgggtc ttagcttgtgggattctgtggttgtgggacctactgagcaaggctacttggctctctgac ttcagccccctttccatgggagtggacggctcttctgccaccctggagttctgggagcca ccagagtatgtaaaaactcctgcagctcagtgcctgccccaactgctactgactggtgca gctgccgtaggtctgcccagttttgtgtttgggacccaaggccctggtggtataagcaca caaaggaatcttctgatctgcagattgcagaaatccgtgggaaaagtgtag >gi568815575f:102502142_102703815|GENSCAN_predicted_peptide_11|117_aa MPEEAQMPDLLDRDVKTVILNMLKMLKEDMNNDSKMVYVQNGSINKDIEIMNKERKGNSG AERKHVWVMALITIKAQPAVAQKRLSNDQFKYLNIEEEQSLNGLERLVKNSCQDFFK >gi568815575f:102502142_102703815|GENSCAN_predicted_CDS_11|354_bp atgcctgaggaagcacagatgccagacttactagacagagatgttaaaactgtgatctta aatatgctcaaaatgctaaaggaagacatgaataatgacagtaaaatggtgtatgtacaa aatggcagtatcaataaagatatagaaattatgaataaagaacgcaaaggaaattctgga gctgaaaggaaacatgtttgggtaatggctttgatcacaataaaggcccagcctgcagtg gcacagaaacgcttatccaatgatcaattcaagtacctgaacatcgaagaggagcagagt ttaaatggccttgagagacttgtgaaaaatagctgtcaagacttctttaagtag