GENSCAN 1.0 Date run: 7-Nov-116 Time: 16:35:45 Sequence gi568815596f:165474359_165779750 : 305392 bp : 37.04% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 3912 3951 40 -3.75 1.01 Sngl + 12480 13496 1017 2 0 88 43 740 0.994 66.27 1.02 PlyA + 13724 13729 6 1.05 2.00 Prom + 13893 13932 40 -6.15 2.01 Init + 13986 17142 3157 2 1 44 60 795 0.548 64.10 2.02 Intr + 20401 20570 170 2 2 120 81 206 0.939 21.84 2.03 Intr + 23628 23734 107 2 2 90 97 48 0.341 3.99 2.04 Intr + 57410 58699 1290 2 0 -13 31 464 0.039 16.96 2.05 Intr + 117502 117772 271 1 1 29 39 171 0.083 2.92 2.06 Intr + 119936 119996 61 1 1 49 82 46 0.233 -2.51 2.07 Intr + 120685 120855 171 2 0 127 86 177 0.058 20.39 2.08 Intr + 139392 139467 76 1 1 76 95 20 0.002 -0.85 2.09 Intr + 148956 149145 190 0 1 44 53 158 0.009 6.57 2.10 Intr + 150864 150936 73 2 1 60 92 57 0.014 1.36 2.11 Intr + 152276 152398 123 0 0 64 35 88 0.020 0.74 2.12 Intr + 183403 183662 260 2 2 93 93 217 0.890 18.96 2.13 Intr + 201954 202250 297 2 0 106 110 386 0.928 38.75 2.14 Term + 204343 205395 1053 0 0 83 39 992 0.309 84.94 2.15 PlyA + 206059 206064 6 1.05 3.07 PlyA - 207075 207070 6 1.05 3.06 Term - 213659 213441 219 2 0 52 41 239 0.408 11.66 3.05 Intr - 224539 224457 83 2 2 42 72 83 0.001 0.54 3.04 Intr - 229444 229348 97 1 1 77 87 75 0.004 4.96 3.03 Intr - 232999 232892 108 1 0 39 98 53 0.003 0.96 3.02 Intr - 235467 234009 1459 2 1 110 11 361 0.004 19.36 3.01 Init - 238169 235885 2285 2 2 63 -19 1035 0.831 77.84 3.00 Prom - 249972 249933 40 -1.25 4.02 PlyA - 251796 251791 6 1.05 4.01 Sngl - 260924 258123 2802 2 0 49 47 976 0.862 82.15 4.00 Prom - 261017 260978 40 -6.15 5.02 PlyA - 261186 261181 6 1.05 5.01 Sngl - 262430 262101 330 2 0 88 44 337 0.944 25.07 5.00 Prom - 265287 265248 40 -5.45 6.12 PlyA - 265478 265473 6 1.05 6.11 Term - 266026 265796 231 1 0 53 37 188 0.656 5.69 6.10 Intr - 271719 271634 86 1 2 -10 99 78 0.184 -2.28 6.09 Intr - 273146 273023 124 2 1 47 78 114 0.480 5.54 6.08 Intr - 275536 275384 153 1 0 65 77 161 0.903 12.05 6.07 Intr - 280705 280574 132 1 0 83 83 70 0.982 5.92 6.06 Intr - 282889 282689 201 1 0 54 108 137 0.912 10.86 6.05 Intr - 284506 284389 118 0 1 53 89 62 0.903 2.35 6.04 Intr - 285212 284978 235 0 1 95 60 165 0.926 10.22 6.03 Intr - 287696 287547 150 0 0 92 93 57 0.979 5.81 6.02 Intr - 290698 290526 173 0 2 99 95 102 0.993 10.66 6.01 Init - 296177 295768 410 2 2 70 72 334 0.422 26.07 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 120685 120866 182 2 2 127 44 233 0.927 19.59 S.002 Term - 143554 143415 140 2 2 41 49 155 0.882 4.04 S.003 Intr - 144010 143863 148 1 1 2 101 115 0.862 3.09 S.004 Init - 149015 148866 150 2 0 40 81 120 0.806 6.59 S.005 Term + 229330 229489 160 1 1 89 45 179 0.986 10.13 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:165474359_165779750|GENSCAN_predicted_peptide_1|338_aa MGKKQNRKTGNSKTQSASPPPKERSSSPATEQSWMENDFDELREEGFRRSDYSELREDIQ TKGKEVENFEKNLEECITRITNTEKCLKELMELKTKARELREECRSLRSRCDQLEERVSA MEDEMNEMKQEGKFREKRIKRKEQSLQEIWDYVKRPNLRLIGVPESDGENGTKLENTLQD IIQENFPNLARQANVQIQEIQRTPQRYSSRRATPRHIIVRFTKVEMKEKMLRAAREKGRV TLKGKPIRLTADLSAETLQARREWGPIFNILKEKNFQPRISYPAKLSFISEGEIKYFTDK QMLRDFVTTRPALKELLKEVLNMERNNRYQPLQNHAKM >gi568815596f:165474359_165779750|GENSCAN_predicted_CDS_1|1017_bp atggggaaaaaacagaacagaaaaactggaaactctaaaacgcagagcgcctctcctcct ccaaaggaacgcagttcctcaccagcaacggaacaaagctggatggagaatgattttgac gagctgagagaagaaggcttcagacgatcagattactctgagctacgggaggacattcaa accaaaggcaaagaagttgaaaactttgaaaaaaatttagaagaatgtataactagaata accaatacagagaagtgcttaaaggagctgatggagctgaaaaccaaggctcgagaacta cgtgaagaatgcagaagcctcaggagccgatgcgatcaactggaagaaagggtatcagca atggaagatgaaatgaatgaaatgaagcaagaagggaagtttagagaaaaaagaataaaa agaaaggagcaaagcctccaagaaatatgggactatgtgaaaagaccaaatctacgtctg attggtgtacctgaaagtgatggggagaatggaaccaagttggaaaacactctgcaggat attatccaggagaacttccccaatctagcaaggcaggccaatgttcagattcaggaaata cagagaacgccacaaagatactcctcgagaagagcaactccaagacacataattgtcaga ttcaccaaagttgaaatgaaggaaaaaatgttaagggcagccagagagaagggtcgggtt accctcaaagggaagcccatcagactaacagcggatctctcggcagaaaccctacaagcc agaagagagtgggggccaatattcaacattcttaaagaaaagaattttcaacccagaatt tcatatccagccaagctaagcttcataagtgaaggagaaataaaatactttacagacaag caaatgctgagagattttgtcaccaccaggcctgccctaaaagaactcctgaaggaagtg ctaaacatggaaaggaacaaccggtaccagccgctgcaaaatcatgccaaaatgtaa >gi568815596f:165474359_165779750|GENSCAN_predicted_peptide_2|2432_aa MGDFNTPLSTLDRSTRQKVNKDTQELNSALHQADLIDIYRTLHPKSTEYTFFSAPHHTYS KIDHIVGSKTLLSKCKRTEIITNYLSDHSAMKLELRIKNLTQSRSTTWKLNNLLLNDYWV HNEMKAEIKMFFETNENKDTTYQNLWDAFKAVCRGKFIALNAHKRKQERSKIDTLTSQLK ELEKQEQTHSKASRRQEITKIRAELKEIETQKTLQKINESRSWFFERINKIDRPLARLIK RKREKNQIDTIKNDKGDITTDPTEIQTTIREYYKHLYTNKLENLEEMNTFLNTYTLPRLN QEEVESLNRPITGAEIVAIINSLPTKKSPGPDGFTAEFYQRYKEELVPFLLKLFQSTEKE GILPNSFYEASIILIPKPGRDTTKKENFRPISLMNIDAKILNKILANRIQQHIQKLIHHD QVGFIPGMQGWFNIRKSINVIQHINRAKDKNHMIISIDAEKDFDKIQQPFMLKTLNKLGI DGTYFKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLLFNIVLEVLARAIRQE KKIKGIQLGKEEVKLSLFADDMIVYLEIPIVSAQNLLKLISNFSKVSGYKINVQKSQAFL YTNNRQTESQIMSELPFTIASKRIKNLGIQLTRDVKDLFKENYKPLLNEIKEDTNKWKKI PCSWVGRINILKMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRARIAKSILS QKNKAGGITLPDFKLYYKATVTKTAWYLYQNRDIDQWNRTEPSEITPHTYNYLIFDKPEK NKQWGKDSLFNKWCWENWLAICRKLKLDPFLTLYTKINSRWIKDLNVRPKSIKTLEENLG ITIQDIGVGKDFMSKTPKPMATKAKIDKWDLFKLKSFCTAKETTIRVNRQPTTWEKIFAT YSSDKGLISRIYNELKQIYKKKTNNPIKKWAKDMNRHFSKEDIYAAKKHMKKCSSSLAIR EMQIKTTMRYHLTPVRMAIIKKSGNNRCWRGCGEIGTLLHCWWDCKLVQPLWKSVWRFLR DLELEIPFDPAIPLLGIYPNDYKSCCYKDTCTRTCDSVAAMSGILKGKFEEVNGSSPCSS VQESDDEVFSCDSTESVDSVNRSVLMILPGSASDEFMNKFNELNEIVGREFNLSVKEHFT SPFCKSWFFEKINKIDTPLARLIKKKREKTQIDAIKNDKGDITTDPTETQTTIREYYKHL YTNKLENLEEVDKFLDTYILPRRNQEEVESLNRPITGSEIEATINSLPTKKSPGPDGFTA EFYQRYKEELVPFLLKLFQSREKEGILPNSFYEASIILIPKPGRDTTKKGNFRPISLMNI DAKILNKILANRIQQHIKKLIHHDQVGFIPGMQGWFNICKSINVIQHINRTKDKNHMIIS IDAEKAFDKIQQQFMLKTFNKLGIDGTYLKIIRAIYDKPTANIILNGQKLEGFPLKTGTR QGCPLSPLLFNIVLEVLARVIRQEKEIKDIQLGKEEVKLSLFADDMIVYLENPIISAQNL LKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSEFPFTIASKRIKYLGIQLTRDVK ILIFPKMIFQGRIIWAVQKGSLGLEPPHRVPTGALPSGTVRRGLLSSRTQNGRSTDSLHH APGKATNTQCQPVKAARRGAVPCKATGAELPKAVGDHLLHQHALDKVCAPYLTVPVHGKM EKVEKGTCDSTAAMSGILKRKFEEVDGSSPCSSVRESDDEVSSSESADSGDSVNPSTSSH FTLVYVCVLMPVPHYFGYYSFVVDFEVRSVGALDSRGSTDPIVNRSCMGSRLRAPYENLM INVMHLNYPETIPPQSVEKLSCTKPVPGAKKVLDMAGSETGTTRFSKNQHLGKETGQYRL PVAATSCSATFHGCCPLLLVPSGQMTSPQCGPPFLIASSILKREKRLRTKNVHFSCVTVY YFTRRQGFTSVPSQGGSTLGMSSRHNSVRQYTLGEFAREQERLHREMLREHLREEKLNSL KLKMTKNGTVESEEASTLTLDDISDDDIDLDNTEVDEYFFLQPLPTKKRRALLRASGVKK IDVEEKHELRAIRLSREDCGCDCRVFCDPDTCTCSLAGIKCQVDRMSFPCGCTKEGCSNT AGRIEFNPIRVRTHFLHTIMKLELEKNREQQIPTLNGCHSEISAHSSSMGPVAHSVEYSI ADSFEIETEPQAAVLHLQSAEELDCQGEEEEEEEDGSSFCSGVTDSSTQSLAPSESDEEE EEEEEEEEEEDDDDDKGDGFVEGLGTHAEVVPLPSVLCYSDGTAVHESHAKNASFYANSS TLYYQIDSHIPGTPNQISENYSERDTVKNGTLSLVPYTMTPEQFVDYARQAEEAYGASHY PAANPSVIVCCSSSENDSGVPCNSLYPEHRSNHPQVEFHSYLKGPSQEGFVSALNGDSHI SEHPAENSLSLAEKSILHEECIKSPVVETVPV >gi568815596f:165474359_165779750|GENSCAN_predicted_CDS_2|7299_bp atgggagactttaacaccccactgtcaacattagacagatcaacgagacagaaagtcaac aaggatacccaggaattgaactcagctctgcaccaagcagacctaatagacatctacaga actctccaccccaaatcaacagaatatacatttttttcagcaccacaccacacctattcc aaaattgaccacatagttggaagtaaaactctcctcagcaaatgtaaaagaacagaaatt ataacaaactatctctcagaccacagtgcaatgaaactagaactcaggattaagaatctc actcaaagccgctcaactacatggaaactgaacaacctgctcctgaatgactactgggta cataacgaaatgaaggcagaaataaagatgttctttgaaaccaatgagaacaaagacaca acataccagaatctctgggacgcattcaaagcagtgtgtagagggaaatttatagcacta aatgcccacaagagaaagcaggaaagatccaaaattgacaccctaacatcacaattaaaa gaactagaaaagcaagagcaaacacattcaaaagctagcagaaggcaagaaataactaaa atcagagcagaactgaaggaaatagagacacaaaaaacccttcaaaaaatcaatgaatcc aggagctggttttttgaaaggatcaacaaaattgatagaccgctagcaagactaataaag agaaaaagagagaagaatcaaatagacacaataaaaaatgataaaggggatatcaccact gatcccacagaaatacaaactaccatcagagaatactacaaacacctctacacaaataaa ctagaaaatctagaagaaatgaatacattcctcaacacatacaccctcccaagactaaac caggaagaagttgaatctctgaatagaccaataacaggagctgaaattgtggcaataatc aatagtttaccaaccaaaaagagtccaggaccagatggattcacagccgaattctaccag aggtacaaggaggaactggtaccattccttctgaaactattccaatcaacagaaaaagag ggaatcctccctaactcattttatgaggccagcatcattctgataccaaagccgggcaga gacacaaccaaaaaagagaattttagaccaatatccttgatgaacattgatgcaaaaatc ctcaataaaatactggcaaaccgaatccagcagcacatccaaaagcttatccaccatgat caagtgggcttcatccctgggatgcaaggctggttcaatatacgcaaatcaataaatgta atccagcatataaacagagccaaagacaaaaaccacatgattatctcaatagatgcagaa aaagactttgacaaaattcaacaacccttcatgctaaaaactctcaataaattaggtatt gatgggacgtatttcaaaataataagagctatctatgacaaacccacagccaatatcata ctgaatgggcaaaaactggaagcattccctttgaaaactggcacaagacagggatgccct ctctcaccgctcctattcaacatagtgttggaagttctggccagggcaatcaggcaggag aagaaaataaaaggtattcaattaggaaaagaggaagtcaaattgtccctgtttgcagat gacatgattgtttatctagaaatccccatcgtctcagcccaaaatctccttaagctgata agcaacttcagcaaagtctcaggatacaaaatcaatgtacaaaaatcacaagcattctta tacaccaacaacagacaaacagagagccaaatcatgagtgaactcccattcacaattgct tcaaagagaataaaaaacctaggaatccaacttacaagggatgtgaaggacctcttcaag gagaactacaaaccactgctcaatgaaataaaagaggatacaaacaaatggaagaaaatt ccatgctcatgggtaggaagaatcaatatcttgaaaatggccatactgcccaaggtaatt tacagattcaatgccatccccatcaagctaccaatgactttcttcacagaattggaaaaa actactttaaagttcatatggaaccaaaaaagagcccgcatcgccaagtcaatcctaagc caaaagaacaaagctggaggcatcacactacctgacttcaaactatactacaaggctaca gtaaccaaaacagcatggtacttgtaccaaaacagagatatagatcaatggaacagaaca gagccctcagaaataacgccgcatacctacaactatctgatctttgacaaacctgagaaa aacaagcaatggggaaaggattccctatttaataaatggtgctgggaaaactggctagcc atatgtagaaagctgaaactggatcccttccttacactttatacaaaaatcaattcaaga tggattaaagatttaaacgttagacccaaaagcataaaaaccctagaagaaaacctaggc attaccattcaggacataggcgtgggcaaggacttcatgtccaaaacaccaaaaccaatg gcaacaaaagccaaaattgacaaatgggatctatttaaactaaagagcttctgcacagca aaagaaactaccatcagagtgaacaggcaacctacaacatgggagaaaattttcgcaacc tactcgtctgacaaagggctaatatccagaatctacaatgaactcaaacaaatttacaag aaaaaaacaaacaaccccatcaaaaagtgggcgaaggacatgaacagacacttctcaaaa gaagacatttatgcagccaaaaaacacatgaaaaaatgctcatcatcactggccatcaga gaaatgcaaatcaaaaccactatgagatatcatctcacaccagttagaatggcaatcatt aaaaagtcaggaaacaacagatgctggagaggatgtggagaaataggaacacttttacac tgttggtgggactgtaaactagttcaaccattgtggaagtcagtgtggcgattcctcagg gatctagaactagaaataccatttgacccagccatcccattactgggtatatacccaaat gactataaatcatgctgctataaagacacatgcacacgtacatgtgacagcgttgcagct atgagtggaattttaaaggggaagtttgaagaagtcaacggctcctcaccctgctcttca gtgcaggaatcagatgatgaagttttcagctgtgacagtactgagagtgttgatagtgtc aatcgttcagttttaatgattttaccaggcagtgctagtgatgagtttatgaacaaattc aatgagctaaatgaaatagtgggccgagaattcaatttgagtgtgaaagaacattttact tcccctttctgtaagagctggttttttgaaaagatcaacaaaattgatacaccgctagca agactaataaagaagaaaagagagaagactcaaatagacgcaataaaaaatgataaaggg gatatcaccaccgatcccacagaaacacaaactaccatcagagaatactataaacacctc tacacaaataaactagaaaatctagaagaagtggataaattcctcgacacatacatcctc ccaagacgaaaccaggaagaagttgaatctctgaatagaccaataacaggctctgaaatt gaggcaacaatcaatagcttaccaaccaaaaaaagtccaggaccagatggattcacagcc gaattctaccagaggtacaaagaggagctggtaccattccttctgaaactattccaatca agagaaaaagagggaatcctccctaactcattttatgaggccagcatcatcctgatacca aagcctggcagagacacaaccaaaaaagggaattttagaccaatatccttgatgaacatc gatgcaaaaatcctcaataaaatactggcaaaccgaatccagcagcacatcaaaaagctt atccaccatgatcaagtgggcttcatccctgggatgcaaggctggttcaacatatgcaaa tcaataaatgtaatccagcatataaacagaaccaaagacaaaaaccacatgattatctca atagatgcagaaaaggcctttgacaaaattcaacaacaattcatgctaaaaactttcaat aaattaggtattgatgggacgtatctcaaaataataagagctatctatgacaaacccaca gccaatatcatactgaatgggcaaaaactggaaggattccctttgaaaactggcacaaga cagggatgccctctctcaccgctcctattcaacatagtgttggaagttctggccagggtg atcaggcaggagaaggaaataaaggatattcaattaggaaaagaggaagtcaaattgtcc ctgtttgcagatgacatgattgtatatctagaaaaccccatcatctcagcccaaaatctc ctcaagctgataagcaacttcagcaaagtctcaggatacaaaatcaatgtacaaaaatca caagcattcttatacaccaataacagacaaacagagagccaaatcatgagtgaattccca ttcacaattgcttcaaagagaataaaatacctaggaatccaacttacaagggatgtgaaa attcttattttccctaaaatgatttttcaaggcaggataatatgggcagtgcagaaggga agtttggggttggagcccccacacagagtccccactggggcactgcctagtggaactgtg agaagagggctgctgtcctccagaacccaaaatggaagatccactgacagcttgcaccat gcacctggaaaagccacaaacactcaatgtcagcccgtaaaagcagccaggaggggagct gtaccctgcaaagccacaggggcagagttgcccaaggctgtgggagaccacctcttgcat cagcatgccctggataaagtttgtgctccttacttgactgtgccagttcatggtaaaatg gagaaggtggagaaaggtacatgtgacagcactgcagcgatgagtggaattttaaagagg aagtttgaagaagttgacggctcctcaccctgctcctctgtgagggaatcagatgatgaa gtttccagcagtgaaagtgctgacagtggggacagtgtcaatccatccacttctagtcat tttaccctggtctatgtgtgtgttttgatgccagtaccacactattttggttactacagc tttgtagtggattttgaagtcagatcagtgggggcattagattctcgtgggagcacagat cctattgtgaaccgctcatgcatgggatctaggttgcgtgctccttatgagaatctaatg ataaacgtaatgcacttgaattatcccgaaaccatccctccccagtccgtggaaaaattg tcttgcacaaaaccagtccctggtgccaaaaaggtattggatatggcagggagtgagaca ggcactacccggttctcaaagaaccaacatttgggtaaggagacaggtcagtacagactg cctgttgctgctacctcgtgttctgccacctttcatggctgctgccctctcctcttggta cccagtggccaaatgacttcaccccagtgtggaccaccctttctcatagcttcctccatt ctcaaaagggagaaacgactgaggacaaagaatgtacattttagttgtgtcaccgtgtac tacttcaccaggaggcaaggcttcacaagtgtgcccagtcaagggggaagcaccctgggg atgtccagccgccataacagcgtgcgccagtacactcttggcgagtttgcaagggagcag gagaggctccaccgggagatgttgagagaacaccttagggaggaaaagctgaactcctta aaactaaagatgactaagaatggcacagtagaatcagaagaagccagcactcttacactg gatgacatttctgatgatgacattgacctggacaacacagaggtagatgagtacttcttc ctacaacctttgccaacaaaaaaacgaagagctctgctgcgtgcctctggagtgaaaaag attgacgtggaagaaaagcacgaactccgagccatccgcctctcacgagaggactgtggc tgtgactgccgagtgttctgtgatccagacacgtgcacctgcagcctggctggcattaag tgccaggtggatcgtatgtctttcccatgcggctgcactaaagaaggatgtagtaacaca gcaggtagaattgaatttaatcctatccgtgttcggactcactttttgcacacaataatg aaacttgaactggagaaaaaccgagagcagcaaatccccacgctgaatggctgccacagt gagataagtgctcacagtagttctatgggccctgtcgctcactccgtagaatattcaatc gcagacagttttgagattgaaactgagccccaggctgcagtgctgcacctgcagtcggct gaagaattagattgccaaggagaggaggaggaagaagaggaggatgggagcagcttttgc agcggagtcacagattctagcacgcaaagcttggcacctagtgagtcagacgaggaggag gaggaagaagaagaggaagaggaggaggaggatgacgatgatgacaaaggagatggcttc gtggaaggtttgggcacccatgccgaagttgtccctcttccttcagttctttgttattct gatggcaccgccgttcacgaaagccatgcaaagaatgcttctttttatgccaactcttca actctgtattaccaaatagatagccacattccaggaactccaaatcagatctctgagaac tattctgaaagagacactgtcaaaaatggtaccctttcgctggtgccttacaccatgacc ccggagcaattcgttgactatgcccgacaagcagaagaggcctatggtgcctcccactac ccagctgccaacccctctgtaatcgtttgctgctcctcttccgaaaatgatagcggtgtg ccctgcaatagtttatatcctgaacacaggtccaatcaccctcaagtggaatttcactca tacttgaaaggcccctcccaagaagggtttgtctctgcattgaatggtgacagtcacatt tcagagcatcctgctgaaaattctttgagccttgcagaaaagagcatattgcatgaagag tgcatcaaatcacccgtggttgagacagtccctgtttag >gi568815596f:165474359_165779750|GENSCAN_predicted_peptide_3|1416_aa MVGYFRLWIPGFAILTKPLYKLTKGNLADPIDPKSFPHSSFCSLKTALETAPTLALPDSS QPASLHTAEVQGCAVGILTQGPGSRPVAFLSKQLDLTVLGWPSCLRAAAAAALILLEALK ITNYAQLTLYSSHNFQNLFSSSHLTHILSAPGLLQLYLHFVESPTITIVPGPDFNPASHI IPDTTPDPHDCISLIHLTFTPFPHISFFPVSHPDHTWFIDGSSTRPNRHSPAKAGYAIVS STSIIEATALPPSTTSQQAELVALTQALTLAKGLCVNIYTDSKYAFHILHHHAVIWAERG FLTTQGSSIINASLIKTLLKAALLPKEAGVIHCKGHQKASDPIALGNAYVDKVARQAASS PTSVPHGQFFSFTSVTPTYSPTETSTYQSLPTQGKWFLDQGKYLLPASQAHSILSSFHNL FYVGYKPLARLLEPLISFPSWKSILKEITSQCSICYSTTPQGLFRPPPFPTHQARGFAPA QDWQIDFTHMPRVRKLKYLLVWVDTFTGWVEAFPTGSEKSTVVISSLLSDIIPRFGLPTS IQSDNGPAFTSQITQVVSQALGIQWNLHTPYRPQSSGKVERTNGLLKAHLTKFSLQLTKD WTALLPLALLKIRACPRDATGYSPFELLYGRTFLLGPNLIPDTSPLGDYLPVLQQARQEI RQAANLLLPTPDPQPYEDNLAGRSVLVKNLTPQTLQPRWTGPYLVIYSTPTAVCLQNPPH WVHHSRIKLCPSDSQPNPSSSSWKSQVLSPTSLKLTHISEEHYLHHTINLTHSLLAASNP SLVNNCWLCISLSSSAYTAVPALQTDWATSPISLHLRTSFNSPHLYPPEELIYFLDRSSK TSPDISHQQAAALLRTYLKNLSPYINSTPPIFGPLTTQTTIPVAAPLCISWQRPTGIPLG NLSPSRCSFTLHLRSPTTNINETIGAFQLHITDKPSINTDKLKNISSNYCLGRHLPCISL HPWLSSPCSSDSPPRPSSCLLIPSPENNSERLLVDTRRFLIHHENRTFPSTQLPHQSPLQ PLTAAALAGSLGVWVQDTPFSTPSHLFTLHLQFCLAQGLFFLCGSSTYMCLPANWTGTCT LVFLTPKIQFANGTEELPVPLMTPTQQKRVIPLIPLMVGLGLSASTVALGTGIAGISTSV MTFRSLSNDFSASITDISQTLSVLQAQVDSLAAVVLQNRRGLDLLTAEKGGLCIFLNEEC CFYLNQSGLVYDNIKKLKDRAQKLANQASNYAEPPWALSNWMSWVLPILCRPVLLAPFSP LGKASVRHGTDSDKSNILAKFFLKGPYDNIVTTCITKITLLLKILHHICKVPFVVEGNLE INQVASALPPPTSLQDLGRSARWPCYLWGVTTLMWLSFHKSIRVWEICSCTKIEKNPMEK GNIDFGGVPHLLVSSDSELGVNSEQIKKALKAPGSA >gi568815596f:165474359_165779750|GENSCAN_predicted_CDS_3|4251_bp atggttggatactttcgcctttggatacctggttttgccatcctaacaaaaccattatat aaactcacaaaaggaaacttagctgaccccatagatcctaaatcctttccccactcctct ttctgttccttgaagacagctttagaaactgcccccactctagctctccctgactcatcc caacctgcttcattacacacagccgaagtgcagggctgtgcagtcggaattcttacacaa ggaccaggatcgcgtcctgtagcctttttgtccaaacaacttgatcttactgttttaggc tggccatcatgtctccgtgcagccgctgctgctgccctaatacttttagaggccctcaaa atcacaaactatgctcaactcactctctacagctctcataatttccaaaatctattttct tcctcacacctgacacatatactttctgctcccggactccttcagctatacttacacttt gttgagtctcccacaattaccattgttcctggcccggacttcaatccggcctcccacatt attccggataccacacctgaccctcatgactgcatctctctgatccacctgacattcacc ccatttccccacatttccttcttccctgtttctcaccctgatcacacttggtttattgat ggcagttccaccaggcctaatcgccactcaccagcgaaggcaggctatgctatagtatct tccacatctatcattgaggctactgctctgcctccctccactacctctcagcaagctgaa ctagttgccttaactcaagccctcactcttgcaaaaggactatgcgtcaatatctatact gattctaaatatgccttccatatcctgcaccaccatgcagtcatatgggctgaaagaggt ttcctcactacacaagggtcctccatcattaatgcctctttaataaaaactctgctcaaa gccgctttacttccaaaagaagctggggtcattcactgcaaggggcatcaaaaggcatca gatcccattgctctaggcaatgcttatgttgataaagtggctagacaagcagctagctct ccaacttctgtccctcacggccagtttttctccttcacttcagtcactcccacctactcc cccactgaaacttccacctatcaatctcttcccacacaaggcaaatggttcttagaccaa ggaaaatatcttcttccagcctcacaggcccattctattctgtcatcatttcataacctc ttctatgtaggttacaagccgctagcccgtctcttagaacctctcatttcctttccatca tggaaatctatcctcaaggagatcacttctcagtgttccatctgctattctactacccct cagggattgttcaggcctcctcccttccctacacatcaagctcggggatttgcccctgcc caggactggcaaattgactttactcacatgcctcgagtcagaaaactaaaatatctctta gtctgggtagacactttcactggatgggtagaggccttccccacagggtctgagaagtcc actgtggtcatttcttcccttctgtcagacataattcctcggtttggccttcccacctct atacagtctgataacggaccagcctttactagccaaatcacccaagtagtttctcaggct cttggtattcagtggaaccttcataccccttaccgtcctcaatcttcaggaaaggtggaa cggactaatggtcttttaaaggcacacctcaccaagtttagcctccaacttacaaaggat tggacagcacttttacctcttgctcttctcaaaatcagagcctgtcctcgagatgctaca ggatacagtccatttgaacttttatatggacgcactttcttgcttggccccaacctcatc ccagacaccagccctctaggcgactatcttccagtactccagcaggctagacaggaaatt cgccaggctgctaatcttctcttgcctactccagatccccagccatatgaagacaaccta gctggacgatcagttcttgttaagaatctgacccctcaaactctacaacctcgatggacc ggaccctacttagtcatctatagtaccccgactgccgtctgcctgcagaatcctccccac tgggttcaccattccagaataaagctgtgtccatcggacagccagcctaatccctcctct tcctcctggaagtcgcaagtactctcccccacttcccttaaactcactcatatttctgaa gaacactatctccaccacactatcaaccttacccattctctcctagccgcttctaatccc tccttagtgaacaactgctggctttgcatttccctttcttccagtgcctacacagctgtc cccgccttacagacagactgggcaacatctcccatctccctacacctccgaacttccttt aacagccctcacctttaccctcctgaagaactcatttactttctagacaggtccagcaag acttccccagacatttcacatcagcaagctgccgccctccttcgcacttatttaaaaaac ctttctccttatatcaactctactccccccatatttggacctctcacaacacaaactact attcctgtggccgctcctttgtgtatctcttggcaaagacccactggaattcccctaggt aatctttcaccttctcgatgttcctttactcttcatctccgaagtccaactacaaacatc aatgaaacaattggagccttccagctccatattacagacaagccctctatcaatactgac aaacttaaaaacattagcagtaattattgcttaggaagacacttgccctgtatttcactc catccttggctatcttccccttgctcatcagactctcctcccaggccctcttcttgttta cttatacccagccccgaaaataacagtgaaagattgctcgtagatactcgacgttttctc atacaccatgaaaatcgaaccttcccctctacgcagttaccccatcagtccccattacaa cctctgacagctgccgccctagctggatccctaggagtctgggtacaagacacccctttc agcactccttctcacctttttactttacatctccagttttgcctcgcacaaggtctcttc ttcctctgtggatcctctacctacatgtgcctacctgccaattggacaggcacatgtaca ctagtcttccttacccccaaaattcaatttgcaaatgggaccgaagagctccctgttccc ctcatgacaccgacacaacaaaaaagagttattccactaattcccttgatggtcggttta ggactttctgcctccactgttgctctcggtactggaatagcaggcatttcaacgtctgtc atgaccttccgtagcctgtctaatgacttctctgctagcatcacagacatatcacaaact ttatcagtcctccaggcccaagttgactctttagctgcagttgtcctccaaaaccgccga ggccttgacttactcactgctgaaaaaggaggactctgcatattcttaaatgaggagtgt tgtttttacctaaatcaatctggcctggtgtatgacaacattaaaaaactcaaggataga gcccaaaaacttgccaaccaagcaagtaattacgctgaacccccttgggcactctctaat tggatgtcctgggtcctcccaattctctgtaggcctgtattactggctccattttcccca cttgggaaagcctcagtgagacatgggacagactcagataagtcaaacatcttggccaag tttttcctgaagggtccttatgataacattgtgaccacctgcataaccaaaataaccctc cttctcaagatccttcatcacatctgcaaagtcccttttgtcgttgaaggaaacttagaa attaaccaggttgcctcagcgttacctcctcctacttctcttcaggatctaggacgttct gcccgttggccatgttatttgtggggagtaaccacactgatgtggctgtcatttcacaag agcatccgagtttgggagatatgcagctgtaccaagattgaaaaaaatcccatggagaaa ggaaacattgatttcgggggtgttcctcatcttctggtgagttctgactctgagttgggt gtcaactctgagcagataaaaaaggctttgaaagctccagggagtgcttga >gi568815596f:165474359_165779750|GENSCAN_predicted_peptide_4|933_aa MGDFNTPLSTLDRSTRQKVNKDTQELNSALHQVDLIDIYRTLHPKSTEYTFFSAPHHTYS KIDHVVGSKALLSKCKRTEIITNYLSDHSAMKLELRIKKLTQSRSTTWKLNNLLLNDYWV HNKMKAEIKMFFETNKNKDTTYQNLWDAFKAVCRGKFIALNAHKRKQERSKIDTLTSQLK ELEKQEQTHSKASRRQEITKIRAELKEIETQKTLQKINESRSWFFERINKIDRPLARLIK KKREKNQIDTIKNDKGDITTDPTEIQTTIREYYKHLYANKLENLEEMDKFLDTYTLPRLN QEEVESLNRPITGAEIVAIINSLPTKKSPGPDGFTAEFYQRYKEELVPFLLKLFQSIEKE GILPNSFYEASIILIPKPGRDTTKKENFRPISLMNIDAKILNKILANQIQQHIKKLIHHD QVGFIPGMQGWFNIRKSINVIQHINRAKDKNHMIISIDAEKDFDKIQQPFMLKTLNNLGI DGTYFKIIRAIYDKPTANIMLNGQKLEAFPLKTGTRQGCPLSPLLFNIVLEVLARAIRQE KEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFL YTNNRQTESQIMSEVPFTIASKRIKYLGIQLTRDVKDLFEENYKPLLKEIKEDTNKWKNI PCSWVGRINIVKMAILPKVIYRFNAIAIKLPMTFFTELEKTTLKFMWNQKRACIAKSILS QKNKAGGITLPDFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEITRHTYNCLIFDKSEK NKKWGKDSLFNKWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLG ITIQDIGMGKDFMSKTPKAMATKAKIDKWDLIKLKSFCTARETTIRVNRQPTTWEKIFAT YSSDKGLISRIYNELNQIYKKKKTTPSKSGRRT >gi568815596f:165474359_165779750|GENSCAN_predicted_CDS_4|2802_bp atgggagactttaacaccccactgtcaacattagacagatcaacgagacagaaagtcaac aaggatacccaggaattgaactcagctctgcaccaagtggacctaatagacatctacaga actctccaccccaaatcaacagaatatacatttttttcagcaccacaccacacctattcc aaaattgaccacgtagttggaagtaaagctctcctcagcaaatgtaaaagaacagaaatt ataacaaactatctctcagaccacagtgcaatgaaactagaactcaggattaagaaactc actcaaagccgctcaactacatggaaactgaacaacctgctcctgaatgactactgggta cataacaaaatgaaggcagaaataaagatgttctttgaaaccaacaagaacaaagacaca acataccagaatctctgggacgcattcaaagcagtgtgtagagggaaatttatagcacta aatgcccacaagagaaagcaggaaagatccaaaattgacaccctaacatcacaattaaaa gaactagaaaagcaagagcaaacacattcaaaagctagcagaaggcaagaaataactaaa atcagagcagaactgaaggaaatagagacacaaaaaacccttcaaaaaatcaatgaatcc aggagctggttttttgaaaggatcaacaaaattgatagaccgctagcaagactaataaag aaaaaaagagagaagaatcaaatagacacaataaaaaatgataaaggggatatcaccacc gatcccacagaaatacaaactaccatcagagaatactacaaacacctctatgcaaataaa ctagaaaatctagaagaaatggataaattcctcgacacatacactctcccaagactaaac caggaagaagttgaatctcttaatagaccaataacaggagctgaaattgtggcaataatc aatagcttaccaaccaaaaagagtccaggaccagatggattcacagccgaattctaccag aggtacaaggaggaactggtaccattccttctgaaactattccaatcaatagaaaaagag ggaatcctccctaactcattttatgaggccagcatcattctgataccaaagccaggcaga gacacaacaaaaaaagagaattttagaccaatatccttgatgaacattgatgcaaaaatc ctcaataaaatactggcaaaccaaatccagcagcacatcaaaaagcttatccaccatgat caagtgggcttcatccctgggatgcaaggctggttcaatatacgcaaatcaataaatgta atccagcatataaacagagccaaagacaaaaaccacatgattatctcaatagatgcagaa aaagactttgacaaaattcaacaacccttcatgctaaaaactctcaataacttaggtatt gatgggacgtatttcaaaataataagagctatctatgacaaacccacagccaatatcatg ctgaatgggcaaaaactggaagcattccctttgaaaactggcacaagacagggatgccct ctctcaccactcctattcaacatagtgttggaagttctggccagggcaattaggcaggag aaggaaataaagggtattcaattaggaaaagaggaagtcaaattgtccctgtttgcagat gacatgattgtatatctagaaaaccccattgtctcagcccaaaatctccttaagctgata agcaacttcagcaaagtctcaggatataaaatcaatgtacaaaaatcacaagcattctta tacaccaacaacagacaaacagagagccaaatcatgagtgaagtcccattcacaattgct tcaaagagaataaaatacctaggaatccaacttacaagggatgtgaaggacctcttcgag gagaactacaaaccactgctcaaggaaataaaagaggatacaaacaaatggaagaacatt ccatgctcatgggtaggaagaatcaatatcgtgaaaatggccatactgcccaaggtaatt tacagattcaatgccatcgccatcaagctaccaatgactttcttcacagaattggaaaaa actactttaaagttcatgtggaaccaaaaaagagcctgcatcgccaagtcaatcctaagc caaaagaacaaagctggaggcatcacactacctgacttcaaactttactacaaggctaca gtaaccaaaacagcatggtactggtaccaaaacagagatatagatcaatggaacagaaca gagccctcagaaataacacgacatacctacaactgtctgatctttgacaaatctgagaaa aacaagaaatggggaaaggattccctatttaataaatggtgctgggaaaactggctagcc atatgtagaaagctgaaactggatcccttccttacaccttatacaaaaatcaattcaaga tggattaaagatttaaacgttagacctaaaaccataaaaaccctagaagaaaacctaggc attaccattcaggacataggcatgggcaaggacttcatgtccaaaacaccaaaagcaatg gcaacaaaagccaaaattgacaaatgggatctaattaaactaaagagcttctgcacagca agagaaactaccatcagagtgaacaggcaacctacaacatgggagaaaatttttgcaacc tactcatctgacaaagggctaatatccagaatctacaatgaactcaaccaaatttacaag aaaaaaaaaacaaccccatcaaaaagtgggcgaaggacatga >gi568815596f:165474359_165779750|GENSCAN_predicted_peptide_5|109_aa MGKKQNRKTGSSKNQSASPPPKERSSSPATEQSWMENDFDELREEGFRRSNYSELREDIQ TKGKEVENFEKNLEECITRITNTEKCLKELMELKTKARELREECRSLRS >gi568815596f:165474359_165779750|GENSCAN_predicted_CDS_5|330_bp atggggaaaaaacagaacagaaaaactggaagctctaaaaatcagagcgcctctcctcct ccaaaggaacgcagctcctcaccagcaacagaacaaagctggatggagaatgactttgac gagctgagagaagaaggcttcagacgatcaaattactctgagctacgggaggacattcaa accaaaggcaaagaagttgaaaacttcgaaaaaaatttagaagaatgtataactagaata accaatacagagaagtgcttaaaggagctgatggagctgaaaaccaaggctcgagaacta cgtgaagaatgcagaagcctcaggagctga >gi568815596f:165474359_165779750|GENSCAN_predicted_peptide_6|670_aa MKNKNKMLDLMLEAVNNIKDAMPKMQIGAPVRQNIDAGERPCLQGYYTAAELKPVLDRPP QDSNAPGASGKAFKTTNLSVEEQKEKERGEAKHCFNAFASDRISLHRDLGPDTRPPEYVE EYLLFILYHQALQGREGCIEQKFKRCPPLPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAI LLKEIILVDDASVDEYLHDKLDEYVKQFSIVKIVRQRERKGLITARLLGATVATAETLTF LDAHCECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNRGNFDWSL SFGWESLPDHEKQRRKDETYPIKTPTFAGGLFSISKEYFEYIGSYDEEMEIWGGENIEMS FRVWQCGGQLEIMPCSVVGHVFRSKSPHSFPKGTQVIARNQVRLAEVWMDEYKEIFYRRN TDAAKIVKQKAFGDLSKRFEIKHRLQCKNFTWYLNNIYPEVYVPDLNPVISGYYFEYSAQ HEIRHNIQKELCLHAAQGLVQLKACTYKGHKTVVTGEQIWEIQKISCILVPEVKYDRFHI LNLAEPQLYEKEHILVIDPIFSLFVSAMANGLVATEDLACCLHSRAPSPKNSRLILEVTA TIAARTNSPLSASFLTSSQVNIQAEVRVTEPIHVPYPQSNGGYESKYLALQLLQLKVEIS QAQKGSSDAR >gi568815596f:165474359_165779750|GENSCAN_predicted_CDS_6|2013_bp atgaaaaacaaaaacaagatgttggatttaatgctagaagctgtaaacaatattaaggat gccatgccaaaaatgcaaataggagcacctgtcaggcaaaacattgatgctggtgagaga ccttgtttgcaaggatattatacagcagcagaattgaagcctgtccttgaccgtccacct caggattcaaatgcacctggtgcttctggtaaagcattcaagacaaccaatttaagtgtt gaagagcaaaaggaaaaggaacgtggggaagctaaacactgctttaatgctttcgcaagt gacaggatttctttgcaccgagatcttggaccagacactcgacctcctgagtatgttgaa gaatatttattgtttattctttatcaccaggctttgcaggggagagagggatgtattgaa caaaaatttaagcgctgccctcccctgcccaccaccagtgtcataatagtttttcataat gaagcgtggtccacgttgcttagaactgtccacagtgtgctctattcttcacctgcaata ctgctgaaggaaatcattttggtggatgatgctagtgtagatgagtacttacatgataaa ctagatgaatatgtaaaacaattttctatagtaaaaatagtcagacaaagagaaagaaaa ggtctgatcactgctcggttgctaggagcaacagtcgcaacagctgaaacgctcacattt ttagatgctcactgtgagtgtttctatggttggctagaacctctgttggccagaatagct gagaactacacggctgtcgtaagtccagatattgcatccatagatctgaacacgtttgaa ttcaacaaaccttctccttatggaagtaaccataaccgtggaaattttgactggagtctt tcatttggctgggagtcgcttcctgatcatgagaagcaaagaaggaaagatgaaacctac ccaattaaaacacccacttttgcaggaggacttttttccatatcaaaagaatattttgag tatattggaagctatgatgaagaaatggaaatctggggaggtgaaaatatagaaatgtct ttcagagtatggcaatgtggtgggcagttggagattatgccttgctctgttgttggacat gtttttcgcagcaaaagccctcatagctttccaaaaggcactcaggtgattgctagaaac caagttcgccttgcagaagtctggatggatgaatacaaggaaatattttataggagaaat acagatgcagcaaaaattgttaaacaaaaagcatttggtgatctttcaaaaagatttgaa ataaaacaccgccttcagtgtaaaaattttacatggtatctgaacaacatttatccagag gtgtatgtgccagaccttaatcctgttatatctggatactactttgaatactctgctcaa catgaaattcggcacaacatccagaaggaattatgtcttcatgctgctcaaggtctcgtt cagctgaaggcatgtacctacaaaggtcacaagacagttgtcactggagagcagatatgg gagatccagaagatctcctgtattcttgttcccgaagtaaaatacgatcggtttcatatt ttaaatctggcagagcctcagctgtacgaaaaagagcatatactggttattgaccctatc ttctcattgtttgtttctgccatggcaaatggtctggttgctacagaagacctggcatgc tgccttcattccagagcaccaagtcctaagaactctcggttgatcctcgaggtaactgct actattgctgccagaactaattctccactgagtgcttcttttctcactagttcccaagtc aatatccaagctgaagtcagagtaactgagcctattcatgtgccctaccctcaaagtaat ggaggctatgaaagcaagtatctggctttgcagcttctgcagttgaaggtggaaatttca caggcacagaaagggagttcagatgccaggtag