GENSCAN 1.0 Date run: 5-Nov-116 Time: 18:50:22 Sequence gi568815592f:110080445_110330128 : 249684 bp : 37.95% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.09 PlyA - 1124 1119 6 1.05 1.08 Term - 20235 20078 158 1 2 44 48 280 0.745 16.71 1.07 Intr - 21772 21144 629 0 2 80 86 399 0.975 29.82 1.06 Intr - 23113 22934 180 0 0 70 91 65 0.868 3.06 1.05 Intr - 25132 24963 170 1 2 38 4 189 0.579 3.32 1.04 Intr - 26750 26633 118 1 1 69 62 49 0.934 -0.05 1.03 Intr - 28237 28084 154 2 1 89 82 129 0.965 10.61 1.02 Intr - 33016 32882 135 2 0 68 75 156 0.944 11.92 1.01 Init - 52454 52376 79 2 1 84 93 53 0.242 6.67 1.00 Prom - 54302 54263 40 -2.05 2.06 PlyA - 54672 54667 6 1.05 2.05 Term - 65306 64266 1041 2 0 41 38 370 0.012 18.43 2.04 Intr - 72226 72133 94 0 1 36 28 74 0.010 -4.75 2.03 Intr - 78048 75769 2280 2 0 -1 53 805 0.031 54.48 2.02 Intr - 80288 80191 98 2 2 43 93 63 0.019 0.19 2.01 Init - 85405 85280 126 1 0 45 52 103 0.012 2.61 2.00 Prom - 91025 90986 40 -6.35 3.00 Prom + 92468 92507 40 -3.55 3.01 Init + 98440 98670 231 0 0 81 51 154 0.429 9.31 3.02 Intr + 99974 100189 216 1 0 104 121 351 0.990 37.78 3.03 Intr + 112738 112824 87 0 0 90 94 84 0.988 8.45 3.04 Intr + 115254 115418 165 2 0 29 67 127 0.838 3.94 3.05 Intr + 121114 121243 130 0 1 56 91 123 0.987 8.75 3.06 Intr + 127062 127145 84 1 0 116 86 66 0.994 8.07 3.07 Intr + 128640 128779 140 1 2 93 74 146 0.998 12.96 3.08 Intr + 130263 130359 97 2 1 81 47 121 0.996 5.96 3.09 Intr + 137258 137359 102 0 0 87 110 65 0.846 7.83 3.10 Intr + 138920 139035 116 0 2 54 108 116 0.874 9.45 3.11 Intr + 139292 139425 134 1 2 47 101 228 0.927 18.52 3.12 Intr + 145723 145799 77 1 2 51 101 45 0.298 0.34 3.13 Intr + 148388 148532 145 0 1 68 107 29 0.235 1.22 3.14 Intr + 149510 149610 101 2 2 86 29 61 0.021 -1.17 3.15 Intr + 163625 163698 74 0 2 36 103 85 0.005 3.01 3.16 Term + 165643 165822 180 0 0 -18 42 177 0.010 -0.87 3.17 PlyA + 166578 166583 6 1.05 4.00 Prom + 177012 177051 40 -6.65 4.01 Init + 177132 177277 146 2 2 93 99 179 0.909 19.14 4.02 Term + 179539 179893 355 1 1 21 37 372 0.674 18.38 4.03 PlyA + 179977 179982 6 1.05 5.00 Prom + 180977 181016 40 -6.15 5.01 Init + 181112 182052 941 1 2 8 -24 513 0.016 25.30 5.02 Term + 182202 183822 1621 0 1 -11 32 609 0.015 33.80 5.03 PlyA + 183832 183837 6 -0.45 6.04 PlyA - 184239 184234 6 1.05 6.03 Term - 185490 185360 131 1 2 44 42 225 0.702 10.86 6.02 Intr - 206371 206335 37 1 1 78 59 64 0.021 -0.58 6.01 Init - 218672 218478 195 2 0 80 100 139 0.535 13.28 6.00 Prom - 220539 220500 40 -6.45 7.00 Prom + 221338 221377 40 -5.55 7.01 Init + 222649 222743 95 0 2 87 18 71 0.794 -0.10 7.02 Intr + 223190 223398 209 2 2 101 17 125 0.663 4.50 7.03 Term + 223662 224098 437 1 2 -3 41 333 0.787 13.96 7.04 PlyA + 224149 224154 6 1.05 8.00 Prom + 225410 225449 40 -3.65 8.01 Init + 226349 226388 40 1 1 89 76 54 0.497 4.70 8.02 Term + 240044 240204 161 0 2 63 42 126 0.461 2.62 8.03 PlyA + 240497 240502 6 -0.45 9.02 PlyA - 240992 240987 6 1.05 9.01 Term - 242428 242303 126 1 0 128 44 76 0.212 4.50 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 34091 34016 76 2 1 86 5 71 0.854 -0.10 S.002 Sngl + 35469 35636 168 2 0 68 49 229 0.938 11.51 S.003 Init + 41989 42203 215 0 2 67 98 106 0.839 7.76 S.004 Sngl - 65276 64266 1011 2 0 42 38 352 0.940 22.34 S.005 Sngl + 181112 182080 969 1 0 8 38 497 0.983 33.16 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:110080445_110330128|GENSCAN_predicted_peptide_1|540_aa MEYYTAIKKNENNDIHSKLNGIGDYYCKYAEDIFGELFNEAHSFSFRVNSLQERVDRLSV SVTQLDPKEEELSLQDITMRKAFRSSTIQDQQLFDRKTLPIPLQETYDVCEQPPPLNILT PYRDDGKEGLKFYTNPSYFFDLWKEKMLQDTEDKRKEKRKQKKNLDRPHEPEKVPRAPHD RRREWQKLAQGPELAEDDANLLHKHIEVANGPASHFETRPQTYVDHMDGSYSLSALPFSQ MSELLTRAEERVLVRPHEPPPPPPMHGAGDAKPIPTCISSATGLIENRPQSPATGRTPVF VSPTPPPPPPPLPSALSTSSLRASMTSTPPPPVPPPPPPPATALQAPAVPPPPAPLQIAP GVLHPAPPPIAPPLVQPSPPVARAAPVCETVPVHPLPQGEVQGLPPPPPPPPLPPPGIRP SSPVTVTALAHPPSGLHPTPSTAPGPHVPLMPPSPPSQVIPASEPKRHPSTLPVISDARS VLLEAIRKGIQLRKVEEQREQEAKHERIENDVATILSRRIAVEYSDSEDDSEFDEVDWLE >gi568815592f:110080445_110330128|GENSCAN_predicted_CDS_1|1623_bp atggaatactacacagccataaaaaagaatgaaaataatgacattcatagcaagctgaat ggaattggagactattactgtaaatatgctgaagatatatttggagaattattcaatgaa gcacatagtttttccttcagagtcaactcattgcaagaacgtgtggaccgtttatctgtt agtgttacacagcttgatccaaaggaagaagaattgtctttgcaagatataacaatgagg aaagctttccgaagttctacaattcaagaccagcagcttttcgatcgcaagactttgcct attccattacaggagacgtacgatgtttgtgaacagcctccacctctcaatatactcact ccttatagagatgatggtaaagaaggtctgaagttttataccaatccttcgtatttcttt gatctatggaaagaaaaaatgttgcaagatacagaggataagaggaaggaaaagaggaag cagaagaaaaatctagatcgtcctcatgaaccagaaaaagtgccaagagcacctcatgac aggcggcgagaatggcagaagctggcccaaggtccagagctggctgaagatgatgctaat ctcttacataagcatattgaagttgctaatggcccagcctctcattttgaaacaagacct cagacatacgtggatcatatggatggatcttactcactttctgccttgccatttagtcag atgagtgagcttctgactagagctgaggaaagggtattagtcagaccacatgaaccacct ccacctccaccaatgcatggagcaggagatgcaaaaccgatacccacctgtatcagttct gctacaggtttgatagaaaatcgccctcagtcaccagctacaggcagaacacctgtgttt gtgagccccactcccccacctcctccaccacctcttccatctgccttgtcaacttcctca ttaagagcttcaatgacttcaactcctccccctccagtacctcccccacctccacctcca gccactgctttgcaagctccagcagtaccaccacctccagctcctcttcagattgcccct ggagttcttcacccagctcctcctccaattgcacctcctctagtacagccctctccacca gtagctagagctgccccagtatgtgagactgtaccagttcatccactcccacaaggtgaa gttcaggggctgcctccacccccaccaccgcctcctctgcctccacctggcattcgacca tcatcacctgtcacagttacagctcttgctcatcctccctctgggctacatccaactcca tctactgccccaggtccccatgttccattaatgcctccatctcctccatcacaagttata cctgcttctgagccaaagcgccatccatcaaccctacctgtaatcagtgatgccaggagt gtgctactggaagcaatacgaaaaggtattcagctacgcaaagtagaagagcagcgtgaa caggaagctaagcatgaacgcattgaaaacgatgttgccaccatcctgtctcgccgtatt gctgttgaatatagtgattcggaagatgattcagaatttgatgaagtagattggttggag taa >gi568815592f:110080445_110330128|GENSCAN_predicted_peptide_2|1212_aa MHLVKKQILIQKFRGDTKICSSYKFLGDVDAADAGLRTTLSLCWRNQHVKTTVDDSCWSK VAVEAMAAKSLKSSRINKIDRPLARLIKKKREKNQIDTIKNDKGDITTDPTEIQTTIREY YKHLYANKLENLEEMDTFLDTYTLPRLNQEEVESLNRPITGSEIVAIINSLPTKKSPGPD GFTAEFYQRYMEELVPFLLKLFQSIEKEGILPNSFYEASIILIPKPGRDTTKKENFRPIS LMNIDAKILNKILANRIQQHIKKLIHHDQVGFIPGMQGWFNIRKSINVIQHINRAKDKNH MIISIDAEKAFDKIQQPFMLKTLNKLGIDGTYFKIIRAIYDKPTANIILNGQKLEAFPLE TGTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVS AQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMGELPFTIASKRIKYLEIQLT RDVKDLFKENYKPLLKEIKEETNKWKNIPCSWVGRINIVKMAILPKVIYRFNAIPIKLPM TFFTELEKTTLKFIWNQKRARIAKSILSQKNKAGGITLPDFKLYYKATVTKTAWYWYQNR DIDQWNRTEPSEIMPHIYNYLIFDKPEKNKQWGKDSLFNKWCWENWLAICRKLKLDPFLT PYTKINSRWIKDLNVKPKTIKTLEENLGITIQDIGVGKDFMSKTPKAMATKDKIDKWDLI KLKSFCTAKETTIRVNRQPTTWEKIFATYSSDKGLISRIYNELKQIYKKKTNNPIKKWAK DMNRHFSKEDIHAAKKHMKKCSSSLAIREMQIKTTMRYHLTPVRMAIIKKSGNNRDSEVQ NHGTDGYISFLLALGNNHLASSFKFLHINRTKDKNHMIISIDAEKAFDKIQQPFMLKTLN KLGIDGMYLRIIRATYDKPTANIILNGQKLEAFPLKTGTRQGCPLSLLLFNIALEVLARA IRQEKGIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLKLISNFSKFSGYKINVQKS QAFLYTNNRQTESQIMTELPFTIASKRIKYLGIQLARDVKDLFKENYKPLLNEIKEDTNK WKNIPCSWVGRNSFVKMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRARIAK SILSQKNKAGGVTLPDFKLHYKATVTKTAWYWYQNREIDQWNRTEPSEIMPHIYNYLIFD KPDKEMGKGFPI >gi568815592f:110080445_110330128|GENSCAN_predicted_CDS_2|3639_bp atgcatcttgttaaaaagcagattctgattcagaagttcaggggtgataccaaaatatgc agttcttacaagttcctaggtgatgttgatgctgctgatgctggtttaaggaccacactt tcactgtgttggagaaatcagcatgttaaaacaactgttgatgatagctgttggagtaaa gttgcagtggaagctatggctgcaaaatcgttaaaatcttcaaggatcaacaaaattgat agaccgctagcaagactaataaagaaaaagagagagaagaatcaaatagacacaataaaa aatgataaaggggatatcaccaccgatcccacagaaatacaaactaccatcagagaatac tacaaacacctctacgcaaataaactagaaaatctagaagaaatggatacattcctcgac acatacactctcccaagactaaaccaggaagaagttgaatctctgaatagaccaataaca ggctctgaaattgtggcaataatcaatagtttaccaaccaaaaagagtccaggaccagat ggattcacagctgaattctaccagaggtacatggaggaactggtaccattccttctgaaa ctattccaatcaatagaaaaagagggaatcctccctaactcattttatgaggccagcatc attctgataccaaagccgggcagagacacaaccaaaaaagagaattttagaccaatatcc ttgatgaacattgatgcaaaaatcctcaataaaatactggcaaaccgaatccagcagcac atcaaaaagcttatccaccatgatcaagtgggcttcatccctgggatgcaaggctggttc aatatacgcaaatcaataaatgtaatccagcatataaacagagccaaagacaaaaaccac atgattatctcaatagatgcagaaaaagcctttgacaaaattcaacaacccttcatgcta aaaactctcaataaattaggtattgatgggacgtatttcaaaataataagagctatctat gacaaacccacagccaatatcatactgaatgggcaaaaactggaagcattccctttggaa accggcacaagacagggatgccctctctcaccgctcctattcaacatagtgttggaagtt ctggccagggcaatcaggcaggagaaggaaataaagggtattcaattaggaaaagaggaa gtcaaattgtccctgtttgcagacgacatgattgtttatctagaaaaccccatcgtctca gcccaaaatctccttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaat gtacaaaaatcacaagcattcttatacaccaacaacagacaaacagagagccagatcatg ggtgaactcccattcacaattgcttcaaagagaataaaatacctagaaatccaacttaca agggatgtgaaggacctcttcaaggagaactacaaaccactgctcaaggaaataaaagag gagacaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatatcgtgaaa atggccatactgcccaaggtaatttacagattcaatgccatccccatcaagctaccaatg actttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcc cgcattgccaagtcaatcctaagccaaaagaacaaagctggaggcatcacactacctgac ttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaacaga gatatagatcaatggaacagaacagagccctcagaaataatgccgcatatctacaactat ctgatctttgacaaacctgagaaaaacaagcaatggggaaaggattccctatttaataaa tggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttccttaca ccttatacaaaaatcaattcaagatggattaaagatttaaacgttaaacctaaaaccata aaaaccctagaagaaaacctaggcattaccattcaggacataggcgtgggcaaggacttc atgtccaaaacaccaaaagcaatggcaacaaaagacaaaattgacaaatgggatctaatt aaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaacctaca acatgggagaaaatttttgcaacctactcatctgacaaagggctaatatccagaatctac aatgaactcaaacaaatttacaagaaaaaaacaaacaaccccatcaaaaagtgggcgaag gacatgaacagacacttctcaaaagaagacattcatgcagccaaaaaacacatgaagaaa tgctcatcatcactggccatcagagaaatgcaaatcaaaaccactatgagatatcatctc acaccagttagaatggcaatcattaaaaagtcaggaaacaacagggactcagaagtccaa aatcatggcactgacggctacatttcttttctgttggctctggggaacaatcaccttgca agctcattcaagttcctgcatataaacagaaccaaagacaaaaaccacatgattatctca atagatgcagaaaaggcctttgacaaaattcagcaacccttcatgctaaaaactctcaat aaattaggtattgatgggatgtatctcagaatcataagagctacctatgacaaacccaca gccaatatcatactgaatggacaaaaactggaagcattccctttgaaaactggcacaaga cagggatgccctctctcactactcctattcaacatagcgttggaagttctggccagggca atcaggcaggagaagggaataaagggcattcaattaggaaaagaggaagtcaaattgtcc ctgtttgcagatgacatgattgtatatctagaaaaccccattgtctcagcccaaaatctc cttaagctgataagcaacttcagcaaattctcaggatacaaaatcaatgtgcaaaaatca caagcattcttatacaccaataacagacaaacagagagccaaatcatgactgaactccca ttcacaattgcttcaaagagaataaaatacctaggaatccaacttgcaagggatgtgaag gacctcttcaaggagaactacaaaccactgctcaatgaaataaaagaggatacaaacaaa tggaagaacattccatgctcatgggtaggaagaaacagtttcgtgaaaatggccatactg cccaaggtaatttatagattcaatgccatccccatcaagctaccaatgactttcttcaca gaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcccgcatcgccaag tcaatcctaagccaaaagaacaaagctggaggcgtcacgctacctgacttcaaactacac tacaaggctacagtaaccaaaacagcatggtactggtaccaaaacagagagatagaccaa tggaacagaacagagccctcagaaataatgccgcatatctacaactatctgatctttgac aaacctgataaagaaatggggaaaggattccctatttaa >gi568815592f:110080445_110330128|GENSCAN_predicted_peptide_3|692_aa MNRHQKRQHRFASPSNEEKDARTRFTVMFSDTDSRSKHANARARFTEPRARRASRGFSVA RGYRLPRKEARGGPRARGLRRRFVAVMSAAIAALAASYGSGSGSESDSDSESSRCPLPAA DSLMHLTKSPSSKPSLAVAVDSAPEVAVKEDLETGVHLDPAVKEVQYNPTYETMFAPEVG FYEERLQEENKQHEPNHKRSKEQECPSPYQKKELRDKTRMVLCITLRKALGAKFGPENPF RTQQMAAPRNMLSGYAEPAHINDFMFEQQRRTFATYGYALDPSLDNHQVSAKYIGSVEEA EKNQGLTVFETGQKKTEKRKKFKENDASNIDGFLGPWAKYVDEKDVAKPSEEEQKELDEI TAKRQKKGKQEEEKPGEEKTILHGHSKAVRDICFNTAGTQFLSAAYDRYLKLWDTETGQC ISRFTNRKVPYCVKFNPDEDKQNLFVAGMSDKKIVQWDIRSGEIVQEYDRHLGAVNTIVF VDENRRFVSTSDDKSLRVWEWDIPVDFKYIAEPSMHSMPAVTLSPNGKWLACQSMDNQIL IFGAQNRFRLNKKKIFKGHMVAGYACQVDFSPDMSYVISGDGNGKLNIWDWKTTKLYSRF KAHDKVCIERIGAIEKLSNYPTKVRRWSSGHNHKAVPEPHNAAVTATDLKPRPVEMDLKD ELSNLLKNIFQNKIFQNFPFCTFQICLENVNL >gi568815592f:110080445_110330128|GENSCAN_predicted_CDS_3|2079_bp atgaacaggcaccagaagcgacagcaccgcttcgcctccccttccaacgaagaaaaggat gcccgcactcgctttaccgttatgttttcagacaccgactcgcggagcaaacacgcaaac gcacgagctcgctttaccgagcctcgcgcacgtagggcgagcaggggattctctgtcgcc cggggctaccgcctcccccggaaagaggcacggggcggaccgcgggcgcggggtctccgc agaagatttgttgccgtcatgtcggctgcgattgcagctctggccgcttcctatggttcg ggttcagggtccgaatcggactcggacagtgagagcagtcggtgtccgctgccagccgcc gactccctcatgcacttgactaaatcgccttcatcaaagccgtctctagcagtggcagtg gactcggctccggaggtggcagttaaggaagatttggagactggagttcaccttgaccct gccgtcaaagaagttcagtataatcctacctatgagaccatgtttgctcctgaggtaggc ttttatgaggagagacttcaagaagagaacaaacagcatgagccgaatcataaaagatct aaagaacaggaatgcccgagtccttaccagaagaaagaattgagagataagactaggatg gtactttgtataacattgcgcaaggctttgggggccaagtttggaccagaaaatcccttt aggacacagcaaatggctgcccctagaaatatgctttctggatatgccgaaccagctcat atcaatgatttcatgtttgagcagcaaaggagaacttttgcaacatatggttatgcatta gacccttcattagataatcatcaagtgtctgctaaatatattggttctgtagaagaagct gaaaaaaatcaaggtttaactgtatttgaaactggtcagaagaaaacagaaaagaggaaa aagtttaaagaaaatgatgcatccaatattgatggttttttgggaccatgggcaaaatat gtggatgaaaaagatgtagccaaaccttcagaagaagagcaaaaagaattggatgaaatc acagcaaagaggcagaaaaaaggaaaacaggaagaagagaaacctggggaggagaagaca atcttacatggtcacagtaaggctgttagggatatctgcttcaatactgcaggaacacag ttcctcagtgcagcctatgacaggtatcttaagctctgggacactgagacaggacagtgt atatcaagatttacaaaccgaaaagtaccttattgtgtcaaattcaatcctgatgaagat aagcaaaatctctttgtggctgggatgtctgataagaagattgtgcaatgggacattcga agtggagaaattgtgcaggaatatgatcggcatttgggagctgtcaacaccattgttttt gtggatgagaataggagatttgtgagcacatctgatgataaaagcctaagagtttgggaa tgggatatccctgtggatttcaagtacatagcagaacccagtatgcactcaatgcctgca gtgactttgtctccaaatggaaaatggctagcatgccaatcaatggacaaccaaatctta atttttggagcacagaacagatttagattaaataagaaaaaaatttttaagggccatatg gtagcaggctatgcttgtcaggtggacttttcaccagacatgagttatgtgatttcagga gatggaaatggaaaattaaacatttgggactggaagaccacaaaactctacagtcgattt aaagctcatgataaagtgtgtatagaaaggatcggtgctattgaaaaactgtctaactat cctacgaaagtgagacgatggtcttcaggccacaatcataaggctgtaccagaaccgcac aacgctgctgtcactgccactgacctcaaacccaggccagtggagatggatctcaaagat gagctgtccaatctgctcaagaacatcttccagaataagattttccaaaactttccattc tgcactttccagatctgccttgagaacgtcaatctgtaa >gi568815592f:110080445_110330128|GENSCAN_predicted_peptide_4|166_aa MARKNTVSHSVDLKWSLSTCVSTKSPAADDAAGTTTTLCESLALADNTRPKVDKTTKMGK KQSRKTENSKSQNASPPPKERSSSPAMEQSWTENDFDELREEGFRQSNFSELKEEVRTHG KEDKNFEKRLDEWLTRITNAEKSLKDLMELKTMARELRDECTSFIS >gi568815592f:110080445_110330128|GENSCAN_predicted_CDS_4|501_bp atggctcgtaaaaacacagtttctcactcagtagatctgaagtggagcctaagtacttgt gtttctactaagagcccagctgccgatgatgctgctggtacaacgaccacactttgtgaa tcactagccttggctgataacaccagaccaaaggtagataaaaccacaaagatggggaaa aaacagagcagaaaaactgaaaattctaaaagtcagaatgcctctcctcctccaaaggaa cgcagctcctcaccagcaatggaacaaagctggacggagaatgactttgacgagctgaga gaagaaggcttcagacaatcaaacttctccgaactaaaggaggaagttcgaacccacggc aaagaagataaaaactttgaaaaaagattagacgaatggctaactagaataaccaatgca gagaagtccttaaaggacctgatggagctgaaaaccatggcacgagaactacgtgatgaa tgcacaagctttatcagctga >gi568815592f:110080445_110330128|GENSCAN_predicted_peptide_5|853_aa MRQKVNKDIQELNSALRQVDLIDIYRTLHPKSTIYTFFSAPHHTYSKTDHIVGSKALLSK CQRTEIITKCISDHSAIKLELRIKKLTQNRSTTWKLNNLLLNDYWVHNEMKAEIKMFFET NENKDTTYRNLWDTFKAVCRGKFIALNAHKRKQERSKIDTLTSQLKELEKQEQPHSKASR RQEITKIRAELKEIEIQKTLQKINESRSWFFEKINRIDRLLARLIKKKREKNQIDTIIID KGDITTDPTEIQTTIREYYKHLYTNKLENLEEMDKFLDTYTIPRLNQEEIESLNRPITGS EIETIINSLPTKKNLAETQQKKENFRAISLMNINAKILNKILTNRIQQHIKKLIHHDQVG FIPGMQGWFNIHKSINVIQHVNRTNDKNHMIISIDAEKAFDKIQQPFMLKTLNKLGIDGT YLKIIRAINDKPTANIILNVQKLDVFPLKTGTRQGCPLSPLLFNIVLEVLARAIRQEKEI KGIQLGKGEVKLSLFADDMIVYLENPIISAQNLLKLISNFSKVSGYKINVQKSQAFLYTN NRQTESQIMTELPFTIASKRIEYLGIQLTKDAKDLFKENYKPLLNEIKEDTNKWKNISCS WVGIINIVKMAILPKVIYRFNAIPIELPMTFFTELEKTTLKFMWNQKRARIAKSILSQKN KAGGVTLPDFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEIMPHIYNYLIFDKSDKKKK WGKDSLFNKWCWENWLAICRKLKLDPFLRPYTKINSRWIKDLKVRPKTIKTLEENLGNTI QDIGMGKDFMSKTPKAMATKAKIDKWDLIKLKSFCTAKETTTRVNRQPTEWEKIFAIYSS DKGLTSRIYKELK >gi568815592f:110080445_110330128|GENSCAN_predicted_CDS_5|2562_bp atgagacagaaagttaacaaggatatccaggaattgaactcagctctgcgccaagtggac ctaatagacatctacagaactctccaccccaaatcaacaatatatacattcttctcagca ccacaccacacttattccaaaactgaccacatagttggaagtaaagcactcctcagcaaa tgtcaaagaacagaaattataacaaagtgtatctcagaccacagtgcaatcaaactagaa ctcaggattaagaaactcactcaaaaccgctcaactacatggaaactcaacaacctgctc ctgaatgactactgggtacataacgaaatgaaggcagaaataaagatgttctttgaaacc aacgagaacaaagacacaacgtaccggaatctctgggacacatttaaagcagtgtgtaga gggaaatttatagcactaaatgcccacaagagaaagcaggaaagatctaaaattgacacc ctaacatcacaattaaaagaactagagaagcaagagcaaccacattcaaaagctagcaga aggcaagaaataactaagatcagagcagaactgaaggagatagagatacaaaaaaccctt caaaaaatcaatgaatccaggagctggttttttgaaaagatcaacagaattgatagactg ctagcaagattaataaaaaagaaaagagagaagaatcaaatagacacaataataattgat aaaggggatatcaccaccgatcccacagaaatacaaactaccatcagagaatactataaa cacctctacacaaataaacttgaaaatctagaagaaatggataaattcctcgacacatac accatcccaagactaaaccaagaagaaattgaatcactgaatagaccaataacaggctct gaaattgagacaataattaatagcttaccaaccaaaaaaaacctggcagagacacaacaa aaaaaagagaattttagagcaatatccttgatgaacatcaatgcaaaaatcctcaataaa atactgacaaaccgaatccagcagcatatcaaaaagcttatccaccatgatcaagtgggc ttcatccctgggatgcaaggctggttcaacatacacaaatcaataaacgtaatccagcat gtaaacagaaccaacgacaaaaaccacatgattatctcaatagatgcagaaaaggccttt gacaaaattcaacagcccttcatgctaaaaactctcaataaattaggtattgatgggacg tatctcaaaataatcagagctattaatgacaaacccacagccaatatcatactgaatgtg caaaaactggatgtgttccctttgaaaactggcacaagacaaggatgccctctctcacca ctcctattcaacatagtgttggaagttctggccagggcaatcaggcaggagaaggaaata aagggtattcaattaggaaaaggggaagttaaattgtccctgtttgcagatgacatgatt gtatatctagaaaaccccatcatctcagcccaaaatctccttaagctgataagcaacttc agcaaagtctcaggatacaaaatcaatgtgcaaaaatcacaagcattcttatacaccaat aacagacaaacagagagccaaatcatgactgaactcccattcacaattgcttcaaagaga atagaatacctaggaatccaacttacaaaggatgcgaaggacctcttcaaggagaactac aagccactgctcaacgaaataaaagaggatacaaacaaatggaagaacatttcatgctca tgggtaggaataatcaatatcgtgaaaatggccatactgcccaaagtaatttatagattc aatgccatccccatcgagctaccaatgactttcttcacagaattggaaaaaactacttta aagttcatgtggaaccaaaaaagagcccgcatcgccaagtcaatcctaagccaaaagaac aaagctggaggcgtcacgctacctgacttcaaactatactacaaggctacagtaaccaaa acagcatggtactggtaccaaaacagagatatagaccaatggaacagaacagagccctca gaaataatgccacatatctacaactatctgatctttgacaaatctgacaaaaagaagaaa tggggaaaggattccctatttaataaatggtgctgggaaaactggctagccatatgtaga aagttgaaactggatcccttccttagaccttatacaaaaattaattcaaggtggattaaa gacttaaaggttagacctaaaaccataaaaaccctagaagaaaacctaggcaataccatt caggacataggcatgggcaaggacttcatgtctaaaacaccaaaagcaatggcaacaaaa gccaaaattgacaaatgggatctaattaaactaaagagcttctgcacagcaaaagaaact accaccagagtgaacaggcaacccacagaatgggagaaaatttttgcaatctactcatct gacaaagggctaacatccagaatctacaaagaactgaaataa >gi568815592f:110080445_110330128|GENSCAN_predicted_peptide_6|120_aa MANNGCEVHRFDPSVKSAHILESQHLWYHRLSIDWRDPHPAVAAQKPHSNTRKLGSILNE FGHHKLCWQLIVPTRIKEQEPVKKKKKKEEEEEEEEEGGGGGGRGRGGGIKKKNFLPSLT >gi568815592f:110080445_110330128|GENSCAN_predicted_CDS_6|363_bp atggccaacaacggatgtgaagtgcatcgttttgatcctagtgtcaagtcagctcacatt ctggagagtcagcacctttggtatcaccgcttgtccattgactggcgggatccccatcca gctgttgctgcccaaaaaccacatagcaacaccagaaaactgggaagcattttgaatgaa tttggacatcacaagctttgctggcagctgattgtgcccactcggattaaggagcaagaa cctgtgaagaagaagaagaagaaagaggaggaggaggaggaagaggaagaaggaggagga ggaggaggaagaggaagaggaggaggaataaaaaagaaaaattttcttccttctcttaca taa >gi568815592f:110080445_110330128|GENSCAN_predicted_peptide_7|246_aa MRDPAMRDGAFQPGYYAFPMVFITHRPGDSLGASLKEMQQPHSGAYRQNFHLPGSEHLGE GAAAGAASADLNIPAYWLRRQQWISQYNTRALLRYRLPLQVESNRININKKEDRAKTPSE GHQQQRPKVDKSMKMRKNQRKKAENSKNQNTSSPPKDHHSSPERKQNWMENEFDKLTEVG FRRWVITNSCELKEHVLTQCKEAKNLDKRLEELLTKITSLEKNMNNLMELKNTARELHEP YTNINS >gi568815592f:110080445_110330128|GENSCAN_predicted_CDS_7|741_bp atgagggaccctgccatgagggatggtgcattccagcccggatactacgcttttcccatg gtcttcataacccacagaccaggagattccctcggggcatccctgaaagaaatgcagcaa ccccactcaggggcttatagacaaaacttccatctccctgggtcagagcacctgggggaa ggggcggctgcgggtgcagcttcagcagacttaaacattcctgcctactggctccgaaga cagcagtggatctcccagtacaacactcgagctctgctaaggtacagactgcctctgcaa gtggaaagcaatagaatcaacatcaacaaaaaggaagaccgggcaaaaactccatctgaa ggccaccaacagcaaagaccaaaggtagataaatccatgaagatgaggaaaaaccagcgc aaaaaggctgaaaattccaaaaaccagaacacctcttctcctccaaaggatcaccactcc tcgccagaaaggaaacaaaactggatggagaatgaatttgacaaactgacagaagtaggc ttcagaaggtgggtaataacaaactcctgtgagctaaaggagcatgttctaacccaatgc aaggaagctaagaaccttgataaaaggttagaagaattgctaactaaaataactagttta gagaagaacatgaataacctgatggaactgaaaaacacagcacgagaacttcatgaacca tacacaaatatcaatagctga >gi568815592f:110080445_110330128|GENSCAN_predicted_peptide_8|66_aa MAQSGKPGVKSNPVTRKYIKAQNLIWLKWPRIKNHCEVRNFTSFFLEDKNPSSELQQQIL YYRSSS >gi568815592f:110080445_110330128|GENSCAN_predicted_CDS_8|201_bp atggcccaaagcggtaaaccaggagtcaaatccaatccagtaacccggaagtacatcaaa gcacagaaccttatctggctaaaatggccccgcattaaaaaccattgtgaggtccggaat tttacgtcattctttctggaggacaaaaatccttctagtgagcttcaacaacaaattctc tattatcgctcatcaagctga >gi568815592f:110080445_110330128|GENSCAN_predicted_peptide_9|41_aa GPRWHIDLQPWAGSAQSLDEEAWRFLRYISTTQVVFQAQRS >gi568815592f:110080445_110330128|GENSCAN_predicted_CDS_9|126_bp ggtccccggtggcatatagatctccagccttgggcaggctctgctcagtccctggatgaa gaagcctggaggttcctgagatatatcagcaccacccaggttgtgtttcaggctcaaaga agttag