GENSCAN 1.0 Date run: 16-Jul-119 Time: 15:57:59 Sequence gi568815597r:94073872_94331611 : 257740 bp : 40.72% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.18 PlyA - 1809 1804 6 1.05 1.17 Term - 3184 2960 225 1 0 50 55 93 0.325 -2.00 1.16 Intr - 4016 3819 198 2 0 71 94 157 0.968 13.23 1.15 Intr - 4835 4719 117 2 0 94 61 80 0.977 5.64 1.14 Intr - 5590 5451 140 2 2 93 64 56 0.999 2.96 1.13 Intr - 6847 6607 241 1 1 75 108 205 0.737 17.40 1.12 Intr - 9554 9481 74 0 2 56 105 12 0.128 -2.09 1.11 Intr - 14801 14612 190 2 1 48 92 123 0.196 6.94 1.10 Intr - 25120 24923 198 1 0 99 79 300 0.934 28.83 1.09 Intr - 27733 27565 169 0 1 40 44 180 0.873 7.83 1.08 Intr - 27804 27744 61 1 1 89 52 25 0.698 -4.13 1.07 Intr - 28120 27955 166 1 1 68 64 80 0.547 2.21 1.06 Intr - 29271 29144 128 1 2 93 91 93 0.993 9.58 1.05 Intr - 32345 32204 142 2 1 59 80 37 0.383 -0.99 1.04 Intr - 33585 33127 459 0 0 6 50 309 0.745 11.35 1.03 Intr - 33675 33622 54 0 0 107 65 68 0.724 4.66 1.02 Intr - 34534 34524 11 2 2 92 97 -10 0.425 -6.64 1.01 Init - 34808 34706 103 2 1 45 115 126 0.364 11.45 1.00 Prom - 34897 34858 40 -10.15 2.00 Prom + 35209 35248 40 -11.54 2.01 Init + 35328 35508 181 2 1 74 96 67 0.734 3.57 2.02 Intr + 36627 36689 63 1 0 70 48 90 0.482 0.97 2.03 Term + 37884 38230 347 1 2 28 44 400 0.582 23.27 2.04 PlyA + 38382 38387 6 1.05 3.06 PlyA - 39381 39376 6 1.05 3.05 Term - 41264 40957 308 0 2 45 45 218 0.975 7.49 3.04 Intr - 47175 47109 67 0 1 69 87 96 0.116 5.16 3.03 Intr - 54165 54037 129 0 0 55 89 104 0.445 7.17 3.02 Intr - 55029 54889 141 0 0 97 78 55 0.445 5.03 3.01 Init - 55515 55513 3 0 0 54 101 0 0.161 -2.05 3.00 Prom - 56338 56299 40 -8.35 4.00 Prom + 59786 59825 40 -7.35 4.01 Sngl + 61273 61566 294 0 0 88 54 222 0.821 14.25 4.02 PlyA + 61686 61691 6 1.05 5.00 Prom + 62688 62727 40 -7.35 5.01 Init + 62781 63799 1019 2 2 51 72 403 0.089 28.15 5.02 Term + 64115 65006 892 2 1 -47 43 446 0.142 17.51 5.03 PlyA + 66377 66382 6 1.05 6.26 PlyA - 66832 66827 6 1.05 6.25 Term - 89390 88846 545 0 2 85 39 260 0.526 14.24 6.24 Intr - 90552 90399 154 0 1 -23 36 182 0.410 0.62 6.23 Intr - 90984 90829 156 0 0 78 54 92 0.789 4.09 6.22 Intr - 91732 91573 160 0 1 64 44 145 0.868 6.77 6.21 Intr - 92471 92266 206 2 2 61 50 117 0.081 2.28 6.20 Intr - 93034 93000 35 2 2 85 75 28 0.041 -1.78 6.19 Intr - 100878 100021 858 1 0 81 62 564 0.071 43.49 6.18 Intr - 103849 103741 109 1 1 64 89 103 0.999 6.94 6.17 Intr - 104296 103981 316 0 1 83 81 181 0.728 12.04 6.16 Intr - 106086 105854 233 0 2 65 95 38 0.405 -2.15 6.15 Intr - 111189 111001 189 0 0 65 88 87 0.581 5.26 6.14 Intr - 111610 111471 140 2 2 15 94 62 0.647 -1.24 6.13 Intr - 112726 112628 99 2 0 81 32 184 0.902 11.26 6.12 Intr - 115070 114966 105 0 0 68 88 44 0.755 1.67 6.11 Intr - 115481 115345 137 1 2 81 68 109 0.993 7.49 6.10 Intr - 116212 116055 158 1 2 85 72 179 0.703 13.89 6.09 Intr - 127986 127849 138 0 0 3 87 92 0.404 0.34 6.08 Intr - 128861 128673 189 2 0 53 31 159 0.733 5.56 6.07 Intr - 130123 130059 65 2 2 82 111 37 0.982 2.92 6.06 Intr - 131327 131190 138 0 0 95 92 57 0.776 6.31 6.05 Intr - 131812 131764 49 1 1 42 90 67 0.920 -0.37 6.04 Intr - 135033 134961 73 2 1 51 121 69 0.957 4.99 6.03 Intr - 135479 135383 97 0 1 73 92 54 0.955 2.45 6.02 Intr - 146521 146387 135 2 0 78 116 84 0.867 9.82 6.01 Init - 157740 157536 205 0 1 54 72 68 0.357 0.86 6.00 Prom - 160428 160389 40 -5.35 7.00 Prom + 161221 161260 40 -5.95 7.01 Init + 165546 165587 42 2 0 65 59 38 0.039 -0.92 7.02 Intr + 172973 173104 132 1 0 108 93 -18 0.433 0.62 7.03 Intr + 173589 173972 384 2 0 56 47 264 0.860 13.22 7.04 Term + 179752 179826 75 0 0 6 49 188 0.610 3.56 7.05 PlyA + 182477 182482 6 1.05 8.06 PlyA - 182860 182855 6 1.05 8.05 Term - 191643 191366 278 1 2 89 50 99 0.088 0.84 8.04 Intr - 198517 198432 86 0 2 65 88 60 0.048 2.24 8.03 Intr - 206531 206453 79 0 1 95 91 67 0.615 5.39 8.02 Intr - 213942 213824 119 2 2 73 35 84 0.467 0.79 8.01 Init - 214703 214240 464 2 2 74 40 144 0.318 3.41 8.00 Prom - 214757 214718 40 -4.25 9.02 PlyA - 215367 215362 6 1.05 9.01 Sngl - 218710 218363 348 1 0 77 42 417 0.973 31.79 9.00 Prom - 221763 221724 40 -7.55 10.00 Prom + 222617 222656 40 -4.75 10.01 Init + 228167 228502 336 1 0 83 3 349 0.667 23.22 10.02 Intr + 228610 229084 475 0 1 38 59 592 0.062 43.01 10.03 Intr + 231304 231460 157 2 1 81 65 46 0.102 -0.25 10.04 Intr + 245356 245525 170 1 2 78 55 92 0.113 3.57 10.05 Intr + 256264 256394 131 2 2 91 63 91 0.002 6.39 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 42021 41967 55 0 1 93 111 28 0.830 7.10 S.002 Term - 100878 99998 881 1 2 81 50 591 0.926 46.32 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:94073872_94331611|GENSCAN_predicted_peptide_1|891_aa MNAPESQHLGRIWTELHILSQFMDTLRTHPERIAEAWKEKNKKQSESNNTNVPLQSAPLW MARGTGHGHWIPLARAEEKSGREEVKPPKIQGLLQLWVWSVPSEERLLWGRLPRKRGSEA TKAAVRGFWEMCAPRIRELTLRMTLEVNRDEKGVTKGSTRGMGEAVLIEQLLLQVPNKKC GRMVEISPGGTFPCMRAAPSTFSRPSVPPASGCEKSLTEQIWGQGLRKGWPRPPRNGGSA RSLALEAVRGNRGPSPRRGIRIRDILKDEETLTLFLIKNIGLSDSVVYLLINSQVRPEQS CGSQSPETSSISDAWELVRNANYQGPLQVNWVQSPGIRPSNLCFTSPPGDSAGSDSRRER DINRTSEPVLTLRSRVLSESRGVRDPTPPGQHQLLEAGSHTGSLHNEGAHTCFSGFSDPE VFTETRQGEPRFAHGVPDLALKDIACSEALLERFIIFSQRRGAKTVRYALCSLSQGTLQW IEDTLYANVDFFKLFRVHQFLDLQAGDLDNGMGRTHDGSQTESGASHRKGTQAQKLVWGK TMWSVQNTLDLGRCPADSWIYSRSQGINLRSWGGILSDMSPRIQEFIHRPSMQDLLWVTR PLMQNGGPETFTKLMGILSDLLCGYPEGGGSRVLSFNWYEDNNYKAFLGIDSTRKDPIYS YDRRTTSFCNALIQSLESNPLTKIAWRAAKPLLMGKILYTPDSPAARRILKNANSTFEEL EHVRKLVKAWEEVGPQIWYFFDNSTQMNMIRDTLGNPTVKDFLNRQLGEEGITAEAILNF LYKGPRESQADDMANFDWRDIFNITDRTLRLVNQYLELHEACTIIIANLQMRSQGLKRLN KIPRMTIKRSEQDPALGVPEFRACLSDFSYQALSPLPPKYLSTLPLAFQAL >gi568815597r:94073872_94331611|GENSCAN_predicted_CDS_1|2676_bp atgaatgcaccagagagccagcaccttggccgtatttggacagagctacacatcttgtcc caattcatggacaccctccggactcacccggagagaattgcagaagcctggaaggagaag aacaagaaacaatccgaaagtaataacaccaatgtgcctttacaaagtgctcctctgtgg atggctagagggactggacatggccactggatcccacttgcaagagcagaggaaaagagt ggtcgtgaggaagtaaagccccccaaaatccaggggttgctgcagctttgggtgtggagc gtgccctctgaggaaaggctgctctgggggagattgcccaggaaacggggctcagaggcc acgaaagcagctgttaggggcttctgggagatgtgtgctcctaggattagggagttgact ctaaggatgaccttagaggttaacagggatgagaaaggggtcaccaaggggtctaccagg ggaatgggagaggctgtattgatagaacagcttctgctgcaggttccaaacaagaaatgt gggagaatggttgaaatcagccccgggggcaccttcccgtgcatgcgtgcagctccttca acattcagtcgaccttcagtgcctcctgcttcgggctgtgagaagtccctaacagagcaa atctggggacaagggctcaggaaaggttggccacggccccctaggaatgggggctctgca agatccctggccttagaggctgtgagagggaacaggggtccatccccaagaagaggaata cgaataagggatatcttgaaagatgaagaaacactgacactatttctcattaaaaacatc ggcctgtctgactcagtggtctaccttctgatcaactctcaagtccgtccagagcagtcc tgtggctctcaaagcccagagaccagcagcatcagcgatgcctgggagcttgttaggaat gcaaattatcagggcccactccaggtgaactgggtccaaagccctgggataaggcctagc aatctgtgcttcacaagccctccaggtgattccgcaggctcagactccaggagagaacga gacataaacagaacttcagagcctgtgttaaccctgagatcaagggtgctgtctgagtcc agaggagtgagggaccccaccccacctggtcagcaccagctcctggaagcaggttctcac actggttccctgcacaatgaaggagctcatacctgcttttctggcttctcagaccctgag gttttcaccgaaactagacaaggggaacctaggttcgctcatggagtcccggacctggcg ctgaaggacatcgcctgcagcgaggccctcctggagcgcttcatcatcttcagccagaga cgcggggcaaagacggtgcgctatgccctgtgctccctctcccagggcaccctacagtgg atagaagacactctgtatgccaacgtggacttcttcaagctcttccgtgtgcaccaattt ctggatctccaggctggagatttagacaatgggatgggaagaacccatgatgggtcccag acagaaagtggtgccagccacagaaagggcacacaggcacagaagttggtttggggtaag acgatgtggtcagttcagaacacgctggatctaggcagatgcccagcagacagttggata tacagccgttctcaaggtatcaatctgagatcttggggaggaatattatctgatatgtca ccaagaattcaagagtttatccatcggccgagtatgcaggacttgctgtgggtgaccagg cccctcatgcagaatggtggtccagagacctttacaaagctgatgggcatcctgtctgac ctcctgtgtggctaccccgagggaggtggctctcgggtgctctccttcaactggtatgaa gacaataactataaggcctttctggggattgactccacaaggaaggatcctatctattct tatgacagaagaacaacatccttttgtaatgcattgatccagagcctggagtcaaatcct ttaaccaaaatcgcttggagggcggcaaagcctttgctgatgggaaaaatcctgtacact cctgattcacctgcagcacgaaggatactgaagaatgccaactcaacttttgaagaactg gaacacgttaggaagttggtcaaagcctgggaagaagtagggccccagatctggtacttc tttgacaacagcacacagatgaacatgatcagagataccctggggaacccaacagtaaaa gactttttgaataggcagcttggtgaagaaggtattactgctgaagccatcctaaacttc ctctacaagggccctcgggaaagccaggctgacgacatggccaacttcgactggagggac atatttaacatcactgatcgcaccctccgcctggtcaatcaatacctggagctccatgag gcttgcaccattatcattgccaatttgcagatgagaagccagggcttaaagaggttaaat aagatcccacgcatgaccattaagaggagcgaacaggatccagctctgggggtgcctgag ttcagagcctgcctttctgatttctcttaccaagctttgtctcctctccctcctaaatat ctctcaactctgcctcttgcattccaggctctctga >gi568815597r:94073872_94331611|GENSCAN_predicted_peptide_2|196_aa MVSDLFSLPVLAGSARLSVPCSVCLLPREALNRVTLVDITGGFIQFLPNSLRPALPPIHA GRVKPAPQQAFAGGLQVWAAQETWPLLSSSRCSGETDGCCNRHGWKCEGEARGDTSFCAE RGIGVLVQEDSTQLLLLLEQTLVYSFNHGELSTYPKHAVGLQVMGSSGRGKEESDGQEAD SELGPETHEREGEDQE >gi568815597r:94073872_94331611|GENSCAN_predicted_CDS_2|591_bp atggtgagcgatctattcagtctcccagttttggcagggtctgctcgcctctcggtccca tgctctgtgtgtctcttgcccagagaagccttaaacagggtgacattggtggatatcact ggagggtttatccagtttttacccaattctctgagacctgcccttccacccattcacgca ggtcgggtaaagcctgctccccaacaggcctttgctggtggccttcaggtatgggctgca caagagacatggcctcttctctcaagctcccggtgtagtggggagacagacggatgttgt aataggcatgggtggaaatgtgaaggggaagcaaggggggacacgtcattttgtgccgag aggggcataggagtgttagttcaagaagactccacgcagttgctgctgctgcttgagcag acgctcgtctattccttcaaccacggtgaactgagcacctaccccaagcatgcagtaggg ctgcaggttatgggctcctcggggagaggtaaagaagagtctgatggacaggaggctgat tcagaactggggcctgaaactcatgaacgagaaggtgaagatcaagaatag >gi568815597r:94073872_94331611|GENSCAN_predicted_peptide_3|215_aa MGKDLLDPVFSFPAAAVATGKSSSHLQGLALQPSAVLFLPPIKTVLSESPGIFGIFGGTL EYLEEVMDKDCLDIKAVTLFSFPRKTHENVQHGLRETDTAFALEELDPAEKAKALPTDGM IQNGRQDTNVHDPGGGCFLNLQTVGDLSPMSPKAFLEKPPRPVLTKAHSFRGYLLGTYYE FCPMWGAGITVVTTTDEVPVLRKLTALEGFTYLEE >gi568815597r:94073872_94331611|GENSCAN_predicted_CDS_3|648_bp atgggaaaagacctcctggacccagtgttctcgttccctgcagctgctgtggccactgga aagagtagctcccatctccagggtcttgctcttcaaccctctgctgtcttgtttctgcct cccattaaaactgttctttcagagtcccctggaatatttggaatatttggaggaactctg gaatatttggaggaggtgatggacaaagattgtttagacataaaagcagtaactctcttc tcattcccaagaaaaactcatgaaaatgttcaacatgggcttcgtgagacagatacagct tttgctctggaagaactggaccctgcggaaaaggcaaaagccttacccactgatggaatg atccagaatggaagacaagacaccaatgtacatgaccctgggggaggctgtttcttaaat ctacagactgttggtgacctgagccccatgtcaccaaaggctttcctggagaagcctcct agaccagtcttgacaaaggctcactcattccgtggatatttattgggcacctattatgag ttctgccccatgtggggtgctggaatcacagtagtgacaacgacagatgaggttcctgtc ctcaggaagcttactgcccttgagggcttcacttacttggaggagtga >gi568815597r:94073872_94331611|GENSCAN_predicted_peptide_4|97_aa MGKKQSRKTGNSKNQSPSPPPKECSSSPAMTQSWTENNFDELREEGFRRSNYSELKEEVR THDKEVKNLEKKLVEWLTRITNAEKSLKDLMELKATA >gi568815597r:94073872_94331611|GENSCAN_predicted_CDS_4|294_bp atggggaaaaaacagagcagaaaaactggaaactctaaaaatcagagcccctctcctcct ccaaaggaatgcagctcctcaccagcaatgacacaaagctggacggagaataactttgac gaactgagagaagaaggcttcagacgatcaaactactccgagctaaaggaggaagttcga acccatgacaaagaagttaaaaaccttgaaaaaaaattagttgaatggctaactagaata accaatgctgagaagtccttaaaggacctgatggagctgaaagccactgcatga >gi568815597r:94073872_94331611|GENSCAN_predicted_peptide_5|636_aa MEDFNTPLSTLDRLRQKVNKDIQELNSPLHQADLIDIYRTFHPKSMEYTFFSAPHHTYSK IDHIVGSKALLSKCKRTEIITNCLSDHSALKLELRIKKLNQNHSTTWKLNNLLLNDYWVH NEMKAEIKMFFETNKNKDTTYQNLWDTFKAVCRGKFIALNAHKKKQERSKTDTLTSQLKE LGKQEQTHSKASRRQEITKIRKELKEMETQKTLQKIIESRSWFFEKINKIDRPLARLIRK KREKNQIDKIKNDKGDITTNPTEIQTTIREYYKHLYANKLENLEEMDKFLDTYTLPRLNQ EEVESLNRPITGSEIEAIINSLPTKKSPGPDGFTAEFYQRAKDKNHMIISIDAEKAFDKI QQPFMLKTLNKLGIDGTYLKIIRAIYDKPTANIILNGQKLEAFQGTRQGCPLSPLLFNIV LEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLKLIGNFSKVSGY KISVQRSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLNE IKEDTNKWKNIPRSWAGRINIMKTAILPEVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQ KRARIAKSILSQKNKAGGIMPADFKLYYKATVTKTA >gi568815597r:94073872_94331611|GENSCAN_predicted_CDS_5|1911_bp atggaagactttaacaccccactgtcaacattagacagattgagacagaaagttaacaag gatatccaggaactgaactcacctctgcaccaagcggacttaatagacatctatagaact ttccaccccaaatcaatggaatatacattcttttcagcaccacaccacacctactccaaa attgaccacatagttggaagtaaagcactcctcagcaaatgtaaaagaacagaaattata acaaactgtctctcagaccacagtgcactcaaactagaactcaggattaagaaactcaat caaaaccactcaactacatggaaactgaataacctgctcctgaatgactactgggtacat aacgaaatgaaggcagaaataaagatgttctttgaaaccaacaagaacaaagacacaaca taccagaatctctgggacacattcaaagcagtgtgtagagggaaatttatagcactaaat gcccacaagaaaaagcaggaaagatctaaaactgacaccctaacatcacaattaaaagaa ctagggaagcaagagcaaacgcattcaaaagctagcagaaggcaagaaataactaagatc agaaaagaactgaaggaaatggagacacaaaaaacccttcaaaaaatcattgaatccagg agctggttttttgaaaagatcaacaaaattgatagaccgctagcaagactaataaggaag aaaagagagaagaatcaaatagacaaaataaaaaatgataaaggtgatatcaccaccaat cccacagaaatacaaactaccatcagagaatactataaacacctttacgcaaataaacta gaaaatctagaagaaatggataaattcctcgacacatacaccctcccaagactaaaccag gaagaagttgaatccctgaatagaccaataacaggatctgaaattgaggcaataattaat agcttaccaaccaaaaaaagtccaggaccagatggattcacagccgaattctaccagaga gccaaagacaaaaaccacatgattatctcaatagatgcagaaaaggcctttgacaaaatt caacagcccttcatgctaaaaactctcaataaattaggtattgatgggacgtatctcaaa ataataagagctatctatgacaaacccacagccaatatcatactgaatggacaaaaactg gaagcattccagggcacaagacagggatgccctctctcaccactgctattcaacatagtg ttggaagttctggccagggcaatcaggcaggagaaggaaataaagggtattcagttagga aaagaggaagtcaaattgtccctgtttgcagatgacatgattgtatacctagaaaacccc atcgtctcagcccaaaatctccttaagctgataggcaacttcagcaaagtctcaggatac aaaatcagtgtgcaaagatcacaagcattcttatacacaaataacagacaaacagagagc caaatcatgagtgaactcccattcacaattgcttcaaaaagaataaaatacctaggaatc caacttacaagggacgtgaaggacctcttcaaggagaactacaaaccactgctcaatgaa ataaaagaggatacaaacaaatggaagaacattccacgctcatgggcaggaagaatcaat atcatgaaaacggccatactgcccgaggtaatttatagattcaatgccatccccatcaag ttaccaatgactttcttcacagaattggaaaaaactactttaaagttcatatggaaccaa aaaagagcccgcattgccaagtcaatcctaagccaaaagaacaaagctggaggcatcatg ccagctgacttcaaactatactacaaggctacagtaaccaaaacagcatga >gi568815597r:94073872_94331611|GENSCAN_predicted_peptide_6|1562_aa MIAHKQKKTKKKRAWASGQLSTDITTSEMGLKSLSSNSIFDPDYIKELVNDIRKFSHMLL YLKEAIFSDCFKEVIHIRLEELLRVLKSIMNKHQNLNSVDLQNAAEMLTAKVKAVNFTEV NEENKNDLFQEVFSSIETLAFTFGNILTNFLMGDVGNDSLLRLPVSRETKSFENVSVESV DSSSEKGNFSPLELDNVLLKNTDSIELALSYAKTWSKYTKNIVSWVEKKLNLELESTRNM VKLAEATRTNIGIQLEAENALKKAKLLCMQRQDEYEKAKSSMFRAEEEHLSSSGGLAKNL NKQLEKKRRLEEEALQKVEEANELYKVCVTNVEERRNDLENTKREILAQLRTLVFQCDLT LKAVTVNLFHMQHLQAASLADSLQSLCDSAKLYDPGQEYSEFVKATNSTEEEKVDGNVNK HLNSSQPSGFGPANSLEDVVRLPDSSNKIEEDRCSNSADITGPSFIRSWTFGMFSDSEST GGSSESRSLDSESISPGDFHRKLPRTPSSGTMSSADDLDEREPPSPSETGPNSLGTFKKT LMSKAALTHKFRKLRSPTKCRDCEGIVVFQGVECEECLLVCHRKCLENLVIICGHQKLPG KIHLFGAEFTQVAKKEPDGIPFILKICASEIENRALCLQLPEPFILFRLYKEFIDLAKEI QHVNEEQETKKNSLEDKKWPNMCIEINRILLKSKDLLRQLPASNFNSLHFLIVHLKRVVD HAEENKMNSKNLGVIFGPSLIRPRPTTAPITISSLAEYSNQARLVEFLITYSQKIFDGSL QPQDVMCSIGVVDQGCFPKPLLSPEERDIERSMKSLFFSSKEDIHTSESESKIFERATSF EESERKQNALGKCDACLSDKAQLLLDQEAESASQKIEDGKTPKPLSLKSDRSTNNVERHT PRTKIRPVSLPVDRLLLASPPNERNGRNMGNVNLDKFCKNPAFEGVNRKDAATTVCSKFN GFDQQTLQKIQDKQYEQNSLTAKTTMIMPSALQEKGVTTSLQISGDHSINATQPSKPYAE PVRSVREASERRSSDSYPLAPVRAPRTLQPQHWTTFYKPHAPIISIRGNEEKPASPSAAV PPGTDHDPHGLVVKSMPDPDKASACPGQATGQPKEDSEELGLPDVNPMCQRPRLKRMQQF EDLEACRHPIVGPCVCKELKLWKNKHKLLSCEWLTCNKRCMHQVFTVKVRASTGKEWDPE TWNGDMWDCQIPLMKLGTLSCGIDLDLWPFTRVTVHWGKGKDQTFRGLLDTGSELTLIPG DPKTSCGPPVKVGAYGDQRYINSPAMCHNLIHRDLDFFSLPQDITLAHYIDDIMLIGSSG QEVANTLDLLWVPEQKKDLQQVQAAVEAAVPLGPYDPADPLVLEVSVADRDASWSLWQAA IATIHGSRNQGVEVQVAPLTITPSDPLAKFLLPLPATLRSAGLEVLVPEGGMLPPGDTTI SLNWKLRLPTGHFGLLLPLSQQAKKGVAVLAGVIDLDYQDEISLLLHNRSKKEYAWNTGD PLGRPLVLPCSVIKVNRKLQQPNPGRTTNGSDPSGMTVWVTPPGKKHNLLRCLLKAKRIQ NG >gi568815597r:94073872_94331611|GENSCAN_predicted_CDS_6|4689_bp atgattgctcacaaacagaaaaagacaaagaaaaaacgtgcttgggcatcaggtcaactc tctactgatattacaacttctgaaatggggctcaagtccttaagttccaactctattttt gatccggattacatcaaggagttggtgaatgatatcaggaagttctcccacatgttacta tatttgaaagaagccatattttcagactgttttaaagaagttattcatatacgtctagag gaactgctccgtgttttaaagtctataatgaataaacatcagaacctcaattctgttgat cttcaaaatgctgcagaaatgctcactgcaaaagtgaaagctgtgaacttcacagaagtt aatgaagaaaacaaaaacgatctcttccaggaagtgttttcttctattgaaactttggca tttacctttggaaatatccttacaaacttccttatgggagatgtaggcaatgattcatta ttgcgactgcctgtttctcgagaaactaagtcgtttgaaaatgtttctgtggaatcagtg gactcatccagtgaaaaaggaaatttttcccctttagaactagacaacgtgctgttaaag aacactgactctatcgagctggctttgtcatatgctaaaacttggtcaaaatatactaag aacatagtttcatgggttgaaaaaaagcttaacttggaattggagtccactagaaatatg gtcaagttggcagaggcaactagaactaacattggaattcagcttgaagcagagaatgct ctcaaaaaggcaaaattattatgcatgcaacgtcaagatgaatatgagaaagcaaagtct tccatgtttcgtgcagaagaggagcatctgtcttcaagtggcggattagcaaaaaatctc aacaagcaactagaaaaaaagcgaaggttggaagaggaggctctccaaaaagtagaagaa gcaaatgaactttacaaagtttgtgtgacaaatgttgaagaaagaagaaatgatctagaa aataccaaaagagaaattttagcacaactccggacacttgttttccagtgtgatcttacc cttaaagctgtaacagttaacctcttccacatgcagcatctgcaggctgcttcccttgca gacagtttacagtctctctgtgatagtgccaaactctatgacccaggccaagagtacagt gaatttgtcaaggccacaaattcaactgaagaagaaaaagttgatggaaatgtaaataaa catttaaatagttcccaaccttcaggatttggacctgccaactctttagaggatgttgta cgccttcctgacagttctaataaaattgaagaggacagatgctctaacagtgcagatata acaggtccttcctttataagatcatggacatttgggatgtttagtgattctgagagcact ggagggagcagcgaatctagatctctggattcagaatctataagtccaggagactttcat cgaaaacttccacgaacaccatccagtggaactatgtcctctgcagatgatctagatgaa agagagccaccttccccttcagaaactggacccaattcccttggaacatttaagaaaaca ttgatgtcaaaggcagctctcacacacaagtttcgcaaattgagatcccccacgaaatgt agggattgtgaaggcattgtagtgttccaaggtgttgaatgtgaagagtgtctccttgtt tgtcatcgaaagtgtttggaaaatttagtcattatttgtggtcatcagaaacttccagga aaaatacacttatttggagcagaattcacacaagttgcaaaaaaggaaccagatggtatc ccttttatactcaaaatatgtgcctcagagattgaaaatagagctttgtgtctacagctc ccagaaccatttattttatttcgattgtacaaggaatttatagaccttgcaaaagagatc caacatgtaaatgaagaacaagagacaaaaaagaatagtcttgaagacaaaaaatggcca aatatgtgtatagaaataaaccgaattcttctaaaaagcaaagaccttctaagacaattg ccagcatcaaattttaacagtcttcatttccttatagtacatctaaagcgggtagtagat catgcagaagaaaacaagatgaactccaaaaacttgggggtgatatttggaccaagtctc attaggccaaggcccacaactgctcctatcaccatctcctcccttgcagagtattcaaat caagcacgcttggtagagtttctcattacttactcacagaagatcttcgatgggtcccta caaccacaagatgttatgtgtagcataggtgttgttgatcaaggctgttttccaaagcct ctgttatcaccagaagaaagagacattgaacgttccatgaagtcactatttttttcttca aaggaagatatccatacttcagagagtgaaagcaaaatttttgaacgagctacatcattt gaggaatcagaacgcaagcaaaatgcgttaggaaaatgtgatgcatgtctcagtgacaaa gcacagttgcttctagaccaagaggctgaatcagcatcccaaaagatagaagatggtaaa acccctaagccactttctctgaaatctgataggtcaacaaacaatgtggagaggcatact ccaaggaccaagattagacctgtaagtttgcctgtagatagactacttcttgcaagtcct cctaatgagagaaatggcagaaatatgggaaatgtaaatttagacaagttttgcaagaat cctgcctttgaaggagttaatagaaaagacgctgctactactgtttgttccaaatttaat ggctttgaccagcaaactctacagaaaattcaggacaaacagtatgaacaaaacagccta actgccaagactacaatgatcatgcccagtgcactccaggaaaaaggagtgacaacaagc ctccagattagtggggaccattctatcaatgccactcaacccagtaagccatatgcagag ccagtcaggtcagtgagagaggcatctgagagacggtcttcagattcctaccctctcgct cctgtcagagcacccagaacactgcagcctcaacattggacaacattttataaaccacat gctcccatcatcagtatcagggggaatgaggagaagccagcttcaccctcagcagcagtg cctcctggcacagatcacgatccccacggtctcgtggtgaagtcaatgccagacccagac aaagcatcagcttgtcctgggcaagcaactggtcaacctaaagaagactctgaggagctt ggcttgcctgatgtgaatccaatgtgtcagagaccaaggctaaaacgaatgcaacagttt gaagacctcgaagcctgcagacatcctattgtgggaccttgtgtttgtaaagagctgaaa ttgtggaaaaacaaacacaagctcttatcatgtgagtggctgacttgtaacaaaagatgc atgcaccaggtgtttactgttaaagtgagggcatctactggaaaagaatgggaccctgaa acttggaatggggacatgtgggactgccagatccccctgatgaaactagggacactgagt tgtggaatagatttagatctctggccttttaccagggtaacagtgcattggggaaaagga aaggatcagacatttcggggactactggacactggctctgagctgacattgattccaggg gacccaaaaacgtcatgtggtcctccagttaaagtaggggcttatggagatcagaggtat atcaactctccggctatgtgtcataatcttattcacagagacctagatttcttttcactt ccgcaagatatcacactggcccattacattgatgacattatgctgattggatccagtggg caagaagtagcaaacacactggatttattgtgggttccagaacagaagaaggatctgcaa caggtccaggctgctgtggaagctgctgtcccacttgggccatatgatccagcagatcca ctggtacttgaggtgtcagtggcagatagggatgcttcttggagcctttggcaggctgcc atagccacgattcacgggtccaggaatcaaggggtggaagtgcaagtggcaccactcacc atcacccctagtgacccactagcaaaattcttgcttcctcttcccgcgacattacgttct gctggcctagaggtcttagttccagagggaggaatgctgccaccaggagacacaacaatt tcattaaactggaagttaagattgccaactggacactttgggctcctcctacctttaagt caacaggctaagaaaggagttgcagtgttggctggggtgattgacctggactatcaagat gaaatcagtctactactccacaacagaagtaagaaagagtatgcatggaacacaggagat ccattagggcgtccattagtattaccatgctctgtgattaaggtcaataggaaactacag cagcccaatccaggcaggactacaaatggctcagacccctcaggaatgacggtttgggtc actccaccaggaaaaaaacacaacctgctgaggtgcttgctgaaagcaaagagaatacag aatgggtag >gi568815597r:94073872_94331611|GENSCAN_predicted_peptide_7|210_aa MNKFLMEREGVWMESTRLALEPWLGKGGRARPGSNEWGERRGGVGWEGTTETLQKGHTPL PDPGRQLAGARAGDPSPEPPPPRPKRAPPPHTHHGAPSRVPDADGRPTPTAAATAEGWSS PPPSPTARRRPAKWQPQPQTPPPPPPQRPPPLQLPPRLRRPPAPPLGAQPIGLARAPGGR GRRPRQASGTHTHARTHARTHAHAHIKAFL >gi568815597r:94073872_94331611|GENSCAN_predicted_CDS_7|633_bp atgaacaagttcctgatggagagagagggtgtatggatggagagcacacgacttgcccta gagccatggctgggaaaaggtggaagggcgagaccaggcagtaatgagtggggagagaga aggggaggagtagggtgggagggaacgacagagacgctccagaaggggcacacgcccctg ccggaccctggacggcaactagccggcgcccgcgcgggcgaccccagcccggagccacca ccgccgcggccgaagcgagcgccacctcctcacactcaccacggcgctccatcccgcgtc ccggacgcggacggccgccccacacctacggccgccgccaccgccgagggctggagctcg ccgcccccatcccccacggcccgcagacgcccggccaagtggcagccgcagccacagaca ccaccaccaccaccaccacagcggccgccgcctttgcagctaccgccacggctgcgccgg ccgcctgccccgccccttggagcccagcccattggcctggcccgcgctcccggggggcgg ggtcggcgaccgcggcaggcatctggaacacacacgcacgcacgcacgcacgcacgcacg cacgcacatgcacacattaaggcttttctctaa >gi568815597r:94073872_94331611|GENSCAN_predicted_peptide_8|341_aa MGKDFMTKTPKAMATKAKIDKWDLIKLKSFCTAKETIIRVDRQPTEWEKIFAIYPSDKGL ISRIYKELRQIYKKKTNNPIKKWAKDTNRHFSKKHIYAANIHMKKCSSSLVIREMQIKTT MRYYLTPVRNAIIKTSGNNRCERGCGEIEMLLHCWDMDEAENHHSQQTITRTENQTSHVL THKWELNNENTWTQAEEELLEDRKMVCSIVSPTQPGTAFGRSPSPHKFKEQLYHGSHSPV FLKRILEKGGMSLERICWKKDGLRLDCPYSHIRPHILSRKLKLEGLKYPIHIASGAGPEP RRKQAHQECEVMPARAAYRFSSQGSQTRSAMPRQKGSQTGH >gi568815597r:94073872_94331611|GENSCAN_predicted_CDS_8|1026_bp atgggcaaagacttcatgactaaaacaccaaaagcaatggcaacaaaagccaaaattgac aaatgggatctaattaaactaaagagcttctgcactgcaaaagaaactatcatcagagtg gacaggcaacctacagaatgggagaaaatttttgcaatctacccatctgacaaagggcta atatccagaatctacaaagaacttagacaaatttacaagaaaaaaacaaacaaccccatc aaaaagtgggcaaaggatacgaacagacacttctcaaaaaaacacatttatgcagccaac atacatatgaaaaaatgctcatcatcactggtcatcagagaaatgcaaatcaaaaccaca atgagatactatctcacaccagttagaaatgcgatcattaaaacgtcaggaaacaacaga tgcgagagaggatgtggagaaatagaaatgcttttacactgttgggacatggatgaagct gagaaccatcattctcagcaaactatcacaaggacagaaaaccaaacatcgcatgttctc acgcataagtgggagttgaacaatgagaacacatggacacaggctgaagaggagctgttg gaggacaggaagatggtctgcagcattgtgtctcccacccaacctggcacagcctttggc agatccccttctccacacaagttcaaggagcagctctaccatggttcccatagtcctgtc ttcctgaagagaatcttagaaaaaggaggtatgtccctggaaaggatttgctggaaaaag gatggtctgagactggactgcccatactcacatattaggccccacattctcagtagaaaa ctgaaattggaggggttaaaatatcctatccacatagcttctggagcagggccagagcca aggaggaagcaggcccatcaggaatgcgaggtcatgcctgccagggcagcttatagattc agctcacagggaagccagacccggtcagcaatgccaaggcagaaggggagccaaactggg cactga >gi568815597r:94073872_94331611|GENSCAN_predicted_peptide_9|115_aa MERNQSRKAENSKNQNASSPPKEHNSLPAREQKWTENEFDKLTEVGFRRSVITNFSELKD HILTHRKEAKNLEKRLDEWLTRITSAEKSLNDVMELKTTVRELREAYTGFSSQFD >gi568815597r:94073872_94331611|GENSCAN_predicted_CDS_9|348_bp atggagagaaatcagagcagaaaggctgaaaattccaaaaaccagaatgcctcttctcct ccaaaggaacacaactccttgccagcaagggaacaaaaatggacagagaatgagtttgac aagttgacggaagtaggcttcagaaggtcagtaataacaaacttctccgagctaaaggac cacattctaactcatcgcaaggaagctaaaaaccttgaaaaaaggttagacgaatggcta actagaataaccagtgcagagaagagcttaaatgacgtgatggagctgaaaaccacagta cgagaacttcgtgaagcatacacaggcttcagtagccaatttgattaa >gi568815597r:94073872_94331611|GENSCAN_predicted_peptide_10|423_aa MMKVKARVNGFGHTGHLVTTAAFNSGKVDIITINDLNYMVYHGITWFQYDSTHSKFHCTI KAENGNPITIFQEGDPTKITQGDAGADYVMESTGIFTTMEKAGAHLEGGAKRLPNLLAKV IHNNSGIMKRLSITVHAITATQKTLMAPLGNCGVMAMGLSITSSLHVSTGAAKAVGKVIP KPNRKLSGVAFHVPTANVSVMDLTCSLEKPAKYDDIKEVVKRHRRGPSGASWATLSTRLS PPTLTVTPTLPPSTLGLALPSTASLSGSFPAPAQRSSVDHVRDLQSCQGKNQSHGAGNGS ISQYSPGCSCVPILSGHGVCSDWMCKGALSLGEFHSRFNCSLNLLNFCIRVPISSLTVLQ VKGQKGIFPVNILWATPGKAPSAVSENTVPHFFPKLKPLWQQRTRLLWIQQLTVDLQQVG APS >gi568815597r:94073872_94331611|GENSCAN_predicted_CDS_10|1269_bp atgatgaaagtgaaggccagagtaaatggatttggccatactgggcacctggtcaccacg gctgcttttaactctggcaaagtggatatcatcaccatcaatgacctcaactacatggtc taccatggaatcacatggttccagtatgattccacccacagcaagttccactgcaccatc aaggctgaaaatggaaatcccatcaccatcttccaggagggagatcccaccaaaatcaca cagggtgacgctggtgctgattatgttatggagtccactggcatcttcactaccatggag aaggctggggctcacttagagggtggagccaaaaggctgcctaacctcctggccaaggtc atccataataactctggcatcatgaagcgactcagcatcacagtccatgccatcactgct acccagaagactctcatggcccctctgggaaactgtggtgtgatggccatggggctctcc attacatcatccctgcatgtgtctactggtgctgccaaggctgtgggcaaggtcatcccc aagccgaacaggaagctcagtggtgtggccttccatgtacccactgccaatgtgtcagtc atggacctgacctgctctctggagaaaccagccaaatatgatgacatcaaggaggtggtg aagcggcatcggagggggccatcgggggcatcgtgggccacactgagcaccaggctgtct cctccaactttaacagtgacacccactcttccaccttcaacactggggctggcattgccc tcaacggcctctttgtcaggctcgtttccggcccctgctcaaaggtcctctgtggaccat gtaagggatctgcagtcatgccagggaaagaatcaaagccacggtgctggaaatggcagc atcagccaatattcaccaggatgctcttgtgtgccaatcctgagtggtcatggggtttgt tctgactggatgtgcaagggtgctttgtctcttggggagttccactcccgtttcaactgt tctctgaatctgctcaacttctgcattagagtccctatttctagtctgacagtcctacaa gtcaaagggcagaaaggaatctttcctgtaaacatcctctgggccactcctggaaaagcc ccctctgctgtttctgagaacacagtgccccatttctttccaaagctaaaacctctctgg caacagagaacccgactgttatggattcagcagttgacggttgacctccagcaggttggt gctccatca