GENSCAN 1.0 Date run: 8-Nov-116 Time: 05:48:44 Sequence gi568815582r:84122220_84337485 : 215266 bp : 49.47% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.08 PlyA - 91 86 6 1.05 1.07 Term - 619 568 52 0 1 108 37 36 0.216 -2.60 1.06 Intr - 4163 4038 126 1 0 75 79 86 0.506 6.19 1.05 Intr - 7556 7432 125 2 2 107 47 27 0.555 -0.12 1.04 Intr - 8212 7767 446 2 2 93 77 477 0.786 40.22 1.03 Intr - 9108 8883 226 0 1 133 33 107 0.930 7.26 1.02 Intr - 19461 19294 168 0 0 35 69 153 0.603 8.24 1.01 Init - 20827 20711 117 1 0 82 98 7 0.494 1.35 1.00 Prom - 21276 21237 40 -11.23 2.00 Prom + 21414 21453 40 -8.46 2.01 Init + 22167 22459 293 2 2 58 80 188 0.879 9.63 2.02 Intr + 23191 23345 155 1 2 23 50 154 0.876 4.82 2.03 Intr + 26788 26923 136 2 1 87 91 52 0.914 5.03 2.04 Intr + 28032 28123 92 0 2 75 116 19 0.911 3.04 2.05 Intr + 32358 32579 222 1 0 71 56 316 0.853 24.80 2.06 Intr + 33376 33530 155 2 2 73 26 155 0.831 7.59 2.07 Intr + 37456 37577 122 0 2 106 110 20 0.993 5.29 2.08 Intr + 43564 43730 167 1 2 92 69 135 0.998 11.60 2.09 Intr + 47640 48137 498 1 0 71 90 280 0.820 19.46 2.10 Intr + 49469 49602 134 0 2 68 77 39 0.869 1.16 2.11 Intr + 52450 52503 54 0 0 104 42 91 0.722 5.28 2.12 Intr + 53231 53458 228 1 0 61 81 128 0.913 7.47 2.13 Intr + 53714 54080 367 1 1 72 94 280 0.968 21.72 2.14 Term + 56018 56112 95 0 2 62 39 62 0.457 -3.31 2.15 PlyA + 56140 56145 6 -0.45 3.16 PlyA - 56557 56552 6 -3.64 3.15 Term - 57599 56722 878 0 2 90 55 566 0.689 46.24 3.14 Intr - 57864 57727 138 1 0 109 41 122 0.725 10.04 3.13 Intr - 58125 57951 175 0 1 54 94 155 0.808 12.21 3.12 Intr - 58776 58645 132 0 0 89 75 29 0.823 2.54 3.11 Intr - 59244 59109 136 2 1 88 96 159 0.963 17.27 3.10 Intr - 59644 59527 118 2 1 110 94 137 0.995 16.02 3.09 Intr - 59812 59723 90 2 0 60 78 51 0.528 1.27 3.08 Intr - 60221 59938 284 1 2 104 40 260 0.680 19.86 3.07 Intr - 60713 60643 71 2 2 57 35 0 0.411 -10.42 3.06 Intr - 61114 61025 90 1 0 128 84 87 0.977 12.49 3.05 Intr - 61288 61191 98 2 2 96 109 149 0.587 17.53 3.04 Intr - 61559 61478 82 2 1 106 99 27 0.987 4.81 3.03 Intr - 62770 62632 139 0 1 88 78 12 0.379 0.67 3.02 Intr - 64742 64682 61 0 1 45 113 82 0.550 4.19 3.01 Init - 66067 65959 109 1 1 79 98 -2 0.492 0.28 3.00 Prom - 66582 66543 40 -8.36 4.00 Prom + 66817 66856 40 -8.66 4.01 Init + 69282 69429 148 2 1 56 94 197 0.471 15.67 4.02 Intr + 72223 72363 141 2 0 104 95 155 0.999 18.12 4.03 Intr + 72714 72761 48 1 0 93 82 33 0.783 1.85 4.04 Intr + 72850 72939 90 2 0 66 67 110 0.500 6.67 4.05 Intr + 73047 73227 181 1 1 32 78 275 0.022 19.83 4.06 Intr + 73311 73478 168 0 0 98 15 258 0.999 18.56 4.07 Intr + 73596 73825 230 0 2 83 80 284 0.735 24.61 4.08 Intr + 73908 74151 244 1 1 114 52 213 0.949 16.86 4.09 Intr + 74428 74548 121 1 1 81 81 49 0.570 4.00 4.10 Intr + 74651 74734 84 1 0 116 56 34 0.780 3.02 4.11 Intr + 76055 76155 101 1 2 70 44 73 0.455 0.01 4.12 Intr + 76530 76558 29 0 2 119 59 25 0.233 0.46 4.13 Term + 81627 81802 176 1 2 37 43 146 0.102 2.92 4.14 PlyA + 85663 85668 6 1.05 5.03 PlyA - 86779 86774 6 1.05 5.02 Term - 92511 92369 143 1 2 96 49 105 0.969 5.49 5.01 Init - 96270 96168 103 0 1 76 78 135 0.999 11.70 5.00 Prom - 98013 97974 40 -4.56 6.08 PlyA - 98431 98426 6 -4.04 6.07 Term - 100801 99998 804 1 0 150 35 1346 0.999 128.73 6.06 Intr - 108758 108531 228 2 0 102 80 60 0.632 4.67 6.05 Intr - 115204 114511 694 0 1 70 99 1156 0.121 106.41 6.04 Intr - 125858 125803 56 2 2 76 93 55 0.664 2.58 6.03 Intr - 126513 126460 54 0 0 120 97 18 0.955 5.08 6.02 Intr - 138713 138491 223 1 1 105 72 58 0.034 4.03 6.01 Init - 140376 140294 83 0 2 73 81 47 0.096 2.98 6.00 Prom - 141169 141130 40 -2.06 7.00 Prom + 150305 150344 40 -2.06 7.01 Init + 158037 158203 167 2 2 89 43 97 0.782 4.42 7.02 Intr + 158598 158730 133 0 1 118 80 69 0.735 9.75 7.03 Intr + 170280 170331 52 2 1 104 119 -6 0.592 2.58 7.04 Intr + 172690 172896 207 2 0 92 86 152 0.995 14.45 7.05 Intr + 177323 177423 101 0 2 32 39 79 0.188 -2.77 7.06 Intr + 179062 179262 201 0 0 49 99 38 0.318 0.48 7.07 Intr + 184442 184649 208 1 1 72 110 153 0.729 14.55 7.08 Intr + 186396 186760 365 1 2 18 -53 441 0.063 17.90 7.09 Intr + 190742 190934 193 1 1 123 96 248 0.973 28.17 7.10 Intr + 196059 196136 78 1 0 61 77 93 0.820 5.02 7.11 Intr + 196917 197102 186 1 0 44 68 118 0.896 5.06 7.12 Intr + 197212 197352 141 2 0 57 109 186 0.545 17.92 7.13 Intr + 202200 202241 42 1 0 83 98 19 0.052 0.61 7.14 Term + 213135 213421 287 1 2 56 48 157 0.339 4.07 7.15 PlyA + 213515 213520 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 73077 73227 151 1 1 86 78 273 0.978 25.32 S.002 Init - 129917 129864 54 2 0 82 73 57 0.986 5.17 S.003 Term - 138713 138567 147 2 0 105 50 119 0.837 7.70 S.004 Term - 149194 149049 146 2 2 84 39 166 0.824 9.37 S.005 Term + 197413 197474 62 2 2 60 54 84 0.823 0.17 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582r:84122220_84337485|GENSCAN_predicted_peptide_1|419_aa MMVKSVGSWMRIQVSNSMLLLFRLSFIVRLVLYQKLAVKKVKLWTLPASDIEEWHEPAAG GSVNGFCRFAKAFGSGMHVDQQLNHGQIPLRTLAQVAMAAVDSFYLLYREIARSCNCYME ALALVGAWYTARKSITVICDFYSLIRLHFIPRLGSRADLIKQYGRWAVVSGATDGIGKAY AEELASRGLNIILISRNEEKLQVVAKDIADTYKVETDIIVADFSSGREIYLPIREALKDK DVGILVNNVGVFYPYPQYFTQLSEDKLWDIINVNIAAASLMVHVVLPGMVERKKGAIVTI SSGSCCKPTPQLAAFSASKAYLDHFSRALQYEYASKGIFVQSLIPFYVATSMTAPSNFLH RHCPMSAASTKALSETLKQSSISRTPFPGPLSTPIIGQDTIPRTWANMLIGANVVPSCP >gi568815582r:84122220_84337485|GENSCAN_predicted_CDS_1|1260_bp atgatggtgaagagtgttggatcttggatgagaatccaggtttctaattccatgttgctg ctctttaggctcagcttcattgtgcgtttagtcctataccaaaaattggcagtgaagaag gtgaagctttggacgctaccagccagtgacattgaggagtggcacgaacctgctgctggc gggagtgtgaatgggttctgccgctttgcaaaagcatttggcagtggaatgcatgtggac cagcagttaaaccatgggcaaatccctttgcgaactcttgcacaggttgccatggctgct gttgacagtttctacctcttgtacagggaaatcgccaggtcttgcaattgctatatggaa gctctagctttggttggagcctggtatacggccagaaaaagcatcactgtcatctgtgac ttttacagcctgatcaggctgcattttatcccccgcctggggagcagagcagacttgatc aagcagtatggaagatgggccgttgtcagcggtgcaacagatgggattggaaaagcctac gctgaagagttagcaagccgaggtctcaatataatcctgattagtcggaacgaggagaag ttgcaggttgttgctaaagacatagccgacacgtacaaagtggaaactgatattatagtt gcggacttcagcagcggtcgtgagatctaccttccaattcgagaagccctgaaggacaaa gacgttggcatcttggtaaataacgtgggtgtgttttatccctacccgcagtatttcact cagctgtccgaggacaagctctgggacatcataaatgtgaacattgccgccgctagtttg atggtccatgttgtgttaccgggaatggtggagagaaagaaaggtgccatcgtcacgatc tcttctggctcctgctgcaaacccactcctcagctggctgcattttctgcttctaaggct tatttagaccacttcagcagagccttgcaatatgaatatgcctctaaaggaatctttgta cagagtctaatccctttctatgtagccaccagcatgacagcacccagcaactttctgcac agacattgccctatgtctgctgctagcacaaaggccctgagtgagaccctgaagcagagc agcatcagcaggactcccttccctggtcccctgagcacccccatcataggtcaagatacc atcccaaggacatgggcaaacatgctgattggtgcaaatgtggtgccgagctgtccatag >gi568815582r:84122220_84337485|GENSCAN_predicted_peptide_2|905_aa MLGSPRGAAAIRFLGLRLGGTFRTGTCLVLLPSPSPPAPGKAQSTRANSEILAEPTTAAY FHFRKTRQQQLKPNRSEMRIKKHRQRRRLLGPLGGGAGCGAFGVAEVNMHPEPSEPATGG AAELDCAQEPGVEESAGDHGSAGRGGCKEEINDPKEICVGSSDTSYHSQQKQSGDNGSGG HFAHPREDREDRGPRMTKSSLQKLCKQHKLYITPALNDTLYLHFKGFDRIENLEEYTGLR CLWLQSNGIQKIENLEAQTELRCLFLQMNLLRKIENLEPLQKLDALNLSNNYIKTIENLF LNTLQMAHNHLETVEDIQHLQECLRLCVLDLSHNKLSDPEILSILESMPDLRVLNLMGNP VIRQIPNYRRTVTVRLKHLTYLDDRPVFPKDRACAEAWARGGYAAEKEERQQWESRERKK ITDSIEALAMIKQRAEERKRQRESQERGEMTSSDDGENVPASAEGKEEPPGDRETRQKME LFVKESFEAKDELCPEKPSGEEPPVEAKREDGGPEPEGTLPAETLLLSSPVEVKGEDGDG EPEGTLPAEAPPPPPPVEVKGEDGDQEPEGTLPAETLLLSPPVKVKGEDGDREPEGTLPA EAPPPPPLGAAREVGCHFQDSCGCCFDSWPVLFISWDSPPDYRKGQPFRSFSSRQKPVDL PDLEDDDETGKSLEDQRPLLALPGAHTCQTRTLREKEASAIISEHELSSKENRHSYCGGP AVQGAKSQRVCEGNGVQGHAKQSPGLRGAPSQNMCFPKIEVISSLSDDSDPELDYTSLPV LENLPTDTLSNIFAVSKDTSKAARVPFTDIFKKEAKRDLEIRKQDTKSPRPLIQELSDED PSGQLLMPPTCQRDAAPLTSSGDRDSDFLAASSPEGTIDSPQVPDPCPKGEGTSAIISLH ELSVR >gi568815582r:84122220_84337485|GENSCAN_predicted_CDS_2|2718_bp atgttgggatcgccgaggggcgctgcagcgatccggtttctggggctgcgtctgggtggc acgttcagaacggggacctgcctggttcttcttcccagcccttcccctccagcacccggc aaggcccagtcaacacgagcaaactcggaaatcctggctgaacccacgaccgctgcatat ttccattttagaaaaacacggcaacaacaactcaaaccaaatcgttcagaaatgagaata aagaagcaccgacaaaggaggcgattgctggggccgctcgggggaggagccggctgcggg gcgttcggtgtcgccgaagtaaacatgcaccctgagccctcggagcctgcgacaggtggt gcagcagagctggattgcgcgcaggagcccggcgtggaggagtctgcgggtgaccacggg agcgcaggccgagggggctgcaaggaagaaattaatgatcctaaggaaatatgtgtgggt tcttctgacacatcctaccacagccagcagaaacagagtggtgataatgggtcaggtggt cacttcgcacacccaagagaagacagggaagatcggggccccagaatgactaaaagttcc ctgcaaaaactctgcaagcagcacaagctttatattaccccagcattgaatgatacgctg tatttacactttaaaggttttgatcgcattgagaacctggaagagtacacagggctgcgc tgtctctggctgcagagcaatggaatacagaaaatcgaaaacctggaggcccaaactgag ttgcgttgcctcttcttgcaaatgaacttgctccgtaaaattgagaacctggaacctctg cagaaactggatgctcttaacctcagcaacaattacatcaagaccattgaaaacctcttc ctgaacacattgcagatggcccacaatcacctggagaccgtggaggacattcagcatcta caagagtgtttgaggctttgtgtccttgacctttcgcacaacaagctgagtgacccggag atcctgagcattctggaaagcatgcccgatttgcgtgtactgaatttgatgggaaacccg gttatcagacagattcctaattacagaaggacagtcactgtacgactaaagcacttaaca tacctggatgatagaccagtgtttccaaaggacagagcttgtgcggaggcctgggctagg ggagggtacgcagctgaaaaggaggagagacagcagtgggagagcagggagcggaagaag atcacagacagcattgaagccttggccatgatcaagcagcgggcagaggagaggaaaaga cagagagagagtcaagagagaggggagatgacatcttcagatgatggtgagaatgtgccc gccagtgcggaaggcaaggaggagcctcccggggacagagaaacaaggcagaagatggag ctatttgttaaggaaagctttgaggccaaggacgagctctgcccggaaaagccaagtgga gaggagccgcctgtggaggctaaaagagaggatggaggtccagagccagaggggaccctc ccagctgagaccctgctactgtcgtcacctgtggaggttaaaggagaggacggagatgga gagccagaggggaccctcccagctgaggccccaccacccccgccacctgtggaggttaaa ggagaggatggagatcaagagccagaggggaccctcccagctgagaccctgctactgtca ccgcctgtgaaggttaaaggagaggatggagatcgagagccagaggggaccctcccagct gaggccccaccaccaccgcccctgggagctgccagggaagtcggctgccatttccaagat tcctgcggctgctgctttgactcctggccggtgctcttcatctcatgggattcgcctcca gattacagaaaagggcagccgttccgttctttttccagcagacagaagcctgtggaccta cctgacttggaagatgatgatgaaacaggcaaatctctggaagaccagaggccattactg gcattaccaggagcccacacgtgccagacccgtaccctgagggagaaggaggcatcagcc atcatctctgagcacgagctcagcagcaaagaaaacagacacagttactgtggggggcct gctgtccaaggagctaagtcccagagagtctgtgaaggaaacggggttcaggggcatgcc aagcagagcccagggctcaggggtgcgccctctcagaatatgtgctttccgaagattgag gtcatctcgagcttgagtgatgacagtgaccctgaactggactacacgtcactccctgtg ctggaaaacctccccacagacactctgtcaaatatatttgcagtctctaaagacacctca aaggcggctcgggtgcccttcacagacatctttaaaaaagaagctaagagggacttggaa atccgaaaacaagacaccaagtccccaagacccctgatccaggagctcagcgacgaggac ccctctggccagctactgatgccccccacctgccaaagagatgctgcaccactcacttcc agtggagacagggacagcgacttccttgcagcctcttctccggagggcactatcgacagc ccacaggtgccagacccatgtcccaagggagaaggaacatcagccatcatctctctgcat gagctcagtgtcaggtag >gi568815582r:84122220_84337485|GENSCAN_predicted_peptide_3|866_aa MSQGMQAASRSQKRQDSGLFLRASQKEHSPAGTFSPGSGEHHAEAHFPDDHDEAGVTMDF PSSLRPALFLTGPLGLSDVPDLSFMCSWRDALTLPEAQPQNSENGALHVTKDLLWEPATP GPLPMLPPLIDPWDPGLTARDLLFRGGCRYRKRPRVVLDVTEQISRFLLDHGDVAFAPLG KLMLENFKLEGAGTSNSKSSQGKADLHLWSSGTAASRCPWAYLSNRQRRFSILGGPILGT SVASHLAELLHEELVLRWEQLLLDEACTGGALAWVPGRTPQFGQLVYPAGGAQDRLRILS CETGDSSGLASGDNPQFLGKPGRIQLQGPVRQVVTCTVQGETLLAVRSDYHCAVWKFGKQ WQPTLLQAMQVEKGATGISLRLRQIYRDPETLVFRDSSSWRWADFTAHPRVLTVGDRTGV KMLDTQMVTRGCCLPAALLGPDNHPANQTQPWMNLWPKHAVDPTVGANRQFSLYLVDERL PLVPMLKWNHGLPSPLLLARLLPPPRPSCVQPLLLGGQGGQLQLLHLAGEGASVPRLAGP PQSLPSRIDSLPAFPLLEPKIQWRLQERLKAPTIAPTPGLVLFQLSAAGDVFYQQLRPQV DSSLRRDAGPPGDTQPDCHAPTASWTSQDTAGCSQWLKALLKVPLAPPVWTAPTFTHRQM LGSTELRREEEEGQRLGVLRKAMARGQLLLQRDLGSLPAAEPPPAPESGLEDKLSERLGE AWAGRGAAWWERQQGRTSEPGRQTRRPKRRTQLSSSFSLSGHVDPSEDTSSPHSPEWPPA DALPLPPTTPPSQELTPDACAQGVPSEQRQMLRDYMAKLPPQRDTPGCATTPPHSQASSV RATRSQQHTPVLSSSQPLRKKPRMGF >gi568815582r:84122220_84337485|GENSCAN_predicted_CDS_3|2601_bp atgagccaaggaatgcaggcagcttccagaagccagaaaagacaagatagtggactcttc cttagagcctctcaaaaggaacacagccctgctggcacttttagtccagggtccggcgag catcacgccgaggcccattttccagacgaccacgacgaggccggggtcacgatggacttc cccagctccctccgccctgcattgtttctgaccggcccccttggtctgagcgacgtccct gacctctctttcatgtgcagctggcgagacgcactgactctgccagaggcccagccccag aactcagagaatggggcactgcatgtgaccaaggacctgctgtgggagccggcaacccct gggcctctccccatgctgcctcccctcatcgatccctgggaccctggcctgactgcccgg gacctgcttttccgcggagggtgccggtatcggaagcggccccgagtcgtgctggatgtg actgagcagatcagccggttcctcttggatcatggagacgtagcctttgcgcccctgggg aagctgatgctggagaatttcaagctggagggagcggggacttccaactccaaatcatct caggggaaagcagatttacatttgtggagttcaggcacagctgcatccaggtgtccctgg gcttacctcagcaaccgacagcgccgcttctctatcctcgggggccccatcctgggcacg tcggtggcgagccacttggcagagctgctgcacgaggagctggtgctgcggtgggagcag ctgcttctggatgaggcctgcactgggggcgcgctggcctgggttcctggaaggacaccc cagttcgggcagctggtctaccctgctggaggcgcccaggacaggctgcgtatcctttct tgcgagacaggagattccagcgggctcgcctctggtgacaatccccaattccttgggaaa cctggacgcatccagctccagggacctgtccggcaagtggtgacatgcaccgtccaggga gaaactctgctggccgtccgctctgactaccactgtgccgtgtggaagtttggtaaacag tggcagccaacccttctgcaggcaatgcaggtggagaaaggggccacggggatcagcctc aggctgcggcaaatctacagggaccctgagaccctcgtgttccgggactcctcttcgtgg cgttgggcagacttcactgcgcaccctcgggtgctgaccgtgggtgaccgcaccggagtg aagatgctggacactcagatggtcactagaggctgctgcttaccagcggctctgctgggc ccagacaaccacccagcaaaccaaactcaaccctggatgaatctgtggcccaaacacgca gtggatcccacagtcggtgctaacaggcagttctctctctacctagtggacgagcgcctt cccctggtgccgatgctgaagtggaaccatggcctcccctccccgctcctgctggcccga ctgctgcctccgccccggcccagctgcgtgcagcccctgctcctcggaggccagggtggg cagctgcagctgctgcacctggcaggagaaggggcgtcggtgccccgcctggcaggcccc ccccagtctcttccttccaggatcgactccctccctgcatttcctctgctggagcctaag atccagtggcggctgcaggagcgcctgaaagcaccgaccatagcgcccacaccaggcctg gtgctcttccagctctcggcggcgggagatgtcttctaccagcagctccgcccccaggtg gactccagcctccgcagagatgctgggcctcctggcgacacccaacctgactgccatgcc cccacagcttcctggacctcccaggacactgccggctgcagccagtggctgaaggccctg ctaaaagtgcccctggctcctcctgtgtggacagcacccaccttcacccaccgccagatg ctgggcagcacagagctgcggagggaggaagaggaagggcagcggctgggtgtgctccgc aaggccatggcccgagggcagctcctgctgcagagagacctgggctccctccctgcggca gagccaccccctgcacccgagtcaggcctagaggacaagctcagtgagcgcctgggggaa gcctgggcaggccgaggggctgcctggtgggagaggcagcagggcaggacctcggagccc gggagacagaccaggcggcccaagcgccggacccagctgtccagcagcttttcgctcagt ggccatgtggatccctcagaggacaccagctcccctcatagccctgagtggccacctgct gatgctctgcccctgccccccacgaccccgccctcccaggagttgactccggatgcatgc gcccagggcgtcccatcagagcagcggcagatgctccgtgactacatggccaagctacca ccccagagggacaccccaggctgtgccaccacacctccccactcccaggcctccagcgtc cgggccactcgctcccagcagcacacacccgtcctctctagctctcagcccctccggaag aagcctcgaatgggcttctga >gi568815582r:84122220_84337485|GENSCAN_predicted_peptide_4|586_aa MGKAPRVPVPPAGLSLPLKDPPASQAVSLLTEYAASLGIFLLFREDQPPGPCFPFSVSAE LDGVVCPAGTANSKTEAKQQAALSALCYIRSQLENPESPQTSSRPPLAPLSVENILTHEQ RCAALVSAGFDLLLDERSPYWACWAVSAPSCTEIPRARGHVKEIYKLVALGTGSSCCAGW LEFSGQQLHDCHGLVIARRALLRFLFRQLLLATQGGPKGKEQSVLAPQPGPGPPFTLKPR VFLHLYISNTPKGAARDIYLPPTSEGGLPHSPPMRLQAHVLGQLKPVCYVAPSLCDTHVG CLSASDKLARWAVLGLGGALLAHLVSPLYSTSLILADSCHDPPTLSRAIHTRPCLDSVLG PCLPPPYVRTALHLFAGPPVAPSEPTPDTCRGLSLNWSLGDPGIEVVDVATGRVKANAAL GPPSRLCKASFLRAFHQAARAVGKPYLLALKTYEAAKAGPYQEARRQLSLLLDQQGLGAW PSKPLTWTRTHNIGSPSSQAFEFGLERHHVGPQHSHSKCDDEMGESEGGVKLQTFAVSVT ALKVARLELFVPPGGLLGSLASGVKLQTFVVSVTAHKSSVDLKSEQ >gi568815582r:84122220_84337485|GENSCAN_predicted_CDS_4|1761_bp atggggaaggccccgagggtccctgtgcccccagcagggctcagcctgccgctcaaagac ccacctgccagccaggccgtgtccttgctcacggagtacgcggccagcctgggcatcttc ctgctcttccgggaggaccagccaccaggtccctgcttccccttctcggtgagcgcggaa ctggatggggtggtctgccctgcgggcactgcgaatagcaagacggaggccaaacagcag gcagcgctctctgccctctgctacatccggagtcagctggagaacccagagtccccccag acctccagccggcctccactggcccccctgagcgtagagaacatcctgacccatgagcag cgctgcgcagcgttggtgagcgccggctttgacctcctgttggacgagcgctcgccatac tgggcctgctgggccgtctctgccccctcctgcacagagatcccgcgtgccaggggccac gtgaaggagatctacaagctggtggctctgggcaccggcagcagctgctgtgctggctgg ctggagttctcgggccagcagctccacgactgccatggcctggtcatcgcccgcagggcc ctgctgaggttcttgttccggcagctcctgctggccacacaggggggccccaagggcaag gagcagtccgtgctggccccccagccagggcccggacccccattcaccctcaagccccgc gtcttcctgcacctctacatcagcaacacccccaagggcgcggcccgtgacatctacctg ccccccacctcggaaggtggcctcccgcacagcccacccatgcgcctgcaggcccatgtg ctcgggcagctgaagcctgtgtgctacgtggcgccctcgctctgtgacacccacgtgggc tgcctgtcagccagtgacaagctggcacgctgggccgtgctggggctgggtggtgccctg ctggcccacctggtgtccccactctacagcaccagcctcatcctggctgactcatgccac gaccctccgactctgagcagggccatccacacccggccctgcctggacagtgtcctgggg ccatgcctgccacctccctacgtccggaccgccctgcacctgtttgcagggcccccggtg gccccttccgaacccacccctgacacctgccgtggcctgagcctcaactggagcctgggg gaccctggcatcgaggttgtggatgtggccaccgggcgtgtgaaggccaatgccgccctg gggcctccctcccgtctctgcaaggcctcctttctccgggcctttcaccaggcggccagg gctgtggggaagccctacctcctggccttgaagacctacgaggctgccaaggctgggccc taccaggaggctcgcaggcagctgtctctcctcctggaccagcagggcctgggggcttgg ccctcgaagccactgacttggaccaggactcataacattggttctcccagttctcaggcc ttcgagtttggactagaacgccaccacgtgggcccccagcactcacacagcaaatgtgac gatgaaatgggagaatctgagggaggagtgaagctgcagaccttcgcagtgagtgttaca gctcttaaggtggcgcgtctggagttgttcgttcctcctggtgggctcctgggctccctg gcttcaggagtgaagctgcagaccttcgtggtgagtgttacagctcataaaagcagtgtg gacctaaagagtgagcagtag >gi568815582r:84122220_84337485|GENSCAN_predicted_peptide_5|81_aa MTIAKGGHLPAEGKEKTALYGEPFSCCIVDVVARAVRSEADFAELSRAQPCLHGSLLLDD TWRAMESVTQLLHDFRNSPLP >gi568815582r:84122220_84337485|GENSCAN_predicted_CDS_5|246_bp atgactatagccaaaggtggacatcttccagctgaaggcaaagagaaaacggccctttac ggcgagccattttcgtgttgtattgtcgacgtggttgccagagctgtccgttctgaggcg gactttgctgagttgagcagagcacagccctgcctccacggcagcctacttctagatgac acgtggagagccatggagtcagtgactcagctgctgcacgactttcgtaactcacctttg ccataa >gi568815582r:84122220_84337485|GENSCAN_predicted_peptide_6|713_aa MQTWAAATVSAITHGLTTVANSTPLSDSRATAHAHLTTKLPSSPLAFLATREGPAQGHPA TSSCFIVARATRCTRAEMATRSKGVPGEVYSDMDTKLIKDCLDLSIPPGREAEEDEGQSQ WAFQSVAVMKQHCDKSPNCPWSQLLSSPMETPSIKGLYYRRVRKVGALDASPVDLKKEIL INVGGRRYLLPWSTLDRFPLSRLSKLRLCRSYEEIVQLCDDYDEDSQEFFFDRSPSAFGV IVSFLAAGKLVLLQEMCALSFQEELAYWGIEEAHLERCCLRKLLRKLEELEELAKLHRED VLRQQRETRRPASHSSRWGLCMNRLREMVENPQSGLPGKVFACLSILFVATTAVSLCVST MPDLRAEEDQCLPSFLLCSGCQRVFSESLLWLETRWEKQANSPRAGPALPAFPLFSSHLG CYACPTALPGAFRDIVRGTCTPYGNQGECSRKCYYIFIVETICVAWFSLEFCLRFVQAQD KCQFFQGPLNIIDILAISPYYVSLAVSEEPPEDGERPSGSSYLEKVGLVLRVLRALRILY VMRLARHSLGLQTLGLTVRRCTREFGLLLLFLAVAITLFSPLVYVAEKESGRVLEFTSIP ASYWWAIISMTTVGYGDMVPRSVPGQMVALSSILSGILIMAFPATSIFHTFSHSYLELKK EQEQLQARLRHLQNTGPASECELLDPHVASEHELMNDVNDLILEGPALPIMHM >gi568815582r:84122220_84337485|GENSCAN_predicted_CDS_6|2142_bp atgcagacctgggccgctgcaacggtcagtgcaatcacccacggcttaaccacagttgca aattctaccccattaagtgacagcagagccacagcacatgcacaccttaccacaaagctc ccgtcttcaccactcgcattcctggccaccagagagggtcctgcccaagggcaccctgca acatcgtcatgcttcatcgttgcaagggccacaagatgcaccagggctgagatggcaacc aggagcaaaggagtgcctggtgaggtgtactctgacatggacacaaagctgataaaagac tgcctggacctcagcattccaccaggaagggaggcagaggaggacgaagggcagtcacag tgggcattccaatctgtggctgtcatgaagcagcactgtgacaaatctcctaactgccct tggagtcagctcctgtccagccccatggagacgccgtccatcaagggcctttactaccgg agggtgcggaaggtgggtgccctggacgcctccccagtggacctgaagaaggagatcctg atcaacgtggggggcaggaggtatctcctcccctggagcacactggaccggttcccgctg agccgcctgagcaaactcaggctctgtcggagctacgaggagatcgtgcagctctgcgat gattacgacgaggacagccaggagttcttcttcgacaggagccccagcgccttcggggtg atcgtgagcttcctggcggccgggaagctggtgcttctgcaggagatgtgcgcgctgtcc ttccaggaggagctggcctactggggcatcgaggaggcccacctggagaggtgctgcctg cggaagctgctgaggaagctggaggagctggaggagctggccaagctgcacagggaggac gtactgaggcagcagagggagacccgccgccccgcctcgcactcctcgcgctggggcctg tgcatgaaccggctgcgcgagatggtggaaaacccgcagtccgggctgcccgggaaggtc ttcgcttgcctctccatcctcttcgtggccaccacagccgtcagcctgtgtgtcagcacc atgcccgacctcagggcagaggaggaccagtgtctgccctcattcctcctctgctccggc tgtcagcgcgtcttttctgagagcctgctctggctggagacccggtgggagaagcaggcg aactcgccacgcgcaggaccagccctgccggccttcccactgttctcctcccatcttggg tgctacgcctgccccacagcattgcctggggctttcagggatattgtcagagggacatgc acaccctatggcaaccaaggcgaatgctctcggaagtgctactatattttcatcgtggag accatctgcgtggcctggttctccctggagttctgcctgcggtttgtccaggcccaagac aagtgtcagttcttccaggggcccctgaacatcatcgacatcctggccatctccccatac tacgtgtcgctggcggtgtctgaggagcccccggaggacggcgagaggccgagcgggagc tcctacctggagaaggtggggctggtcctgcgtgtgctgcgagcgctgcgcatcctctac gtgatgcgcctggctcgccactcgctggggctgcagacgctggggctcaccgtgcgccgt tgcacacgtgagttcggcctgctccttctcttcctggccgtggccatcaccctcttctcc cctttggtctacgtggccgagaaggagtccgggcgggtgctggagttcaccagcatcccc gcctcctattggtgggccatcatctccatgacaacggtgggctacggggacatggtgccc cgcagtgtgccaggccagatggtggccctcagcagcatcctgagcgggatcctcatcatg gccttcccggccacgtctatcttccacaccttctcccactcctacctggagctcaagaag gagcaggagcagcttcaggcccgcctccgccacctccaaaacaccggtccagccagtgaa tgtgaactcctggacccccatgtggccagtgaacatgagctcatgaacgatgtcaatgac ctaatcctggagggcccagccttgcctatcatgcacatgtaa >gi568815582r:84122220_84337485|GENSCAN_predicted_peptide_7|786_aa MVIGELSVLYCPMDHSKLSLTSRKEERRVPCGVSMQVVTVGPVVEWHPVEFLISLGCSSV WLDENVMCISQDVVGGQQKLNSTSLTLKKISLSHVTRSPEMRKQRQRDGAGPVPGRPGPR SEGGPSSVCVWKVAAQGGNAFNRRGAGQLQEADHPGSVPLATSPPRRLCQEYLETGIACE AGREIPYETPENNDLLSAYTVSGQEFATYSLLGASRQIHRHPRTRIQSALGIKGVSFCWL QPATRHFTDKETEAIAENDLSVLFWLGTSGAITISPESGSFVCSLNQNPLHRAHVQISEL MHVKHLEGCPANCQGVLTTIILIFIIIIIITAFVPPEEYLLSTCYVPGTVLRARDTDASL SVDARLGVHSILSVDTSLGVDAILGVDAILSVDASLGVDAILSVDASLGVDAMLSVDASL GVDAILSVDASLGVDASLGVDASLGIDAILGVDTSLGIDAILGVDTILGADSGYPRREAE EAGAPGGPRQPRADRCPPPPRTLPPGACQAARCQADSECPRHRRCCYNGCAYACLEAVPP PPDWLVQPKPRWLGGNGWLLDGPEEVLQGCIREHVYMDVAEPVRVCLGSGELGLPVFVCV FVGVHVCHMPLSETQGLAWNLGYRGEAASLAEACSTTEDGAEPLLCPSGYECHILSPGDV AEGIPNRGQCVKQRRQADGRILRHKLYKEYPDGEFKIAVLRKLKEIQDNADKEFRILSDK FNKGIEIIKKNQAKILELKNAIDIVKNASESLNNRIDRAEERISELKDRLFENTQSEDRR RNNKKE >gi568815582r:84122220_84337485|GENSCAN_predicted_CDS_7|2361_bp atggtcatcggagagctcagtgtgctctactgtcccatggatcactccaagctgtcactg acctccaggaaggaagaacgaagggttccatgcggcgtctccatgcaggtcgtgacagtg gggcccgtggttgaatggcatccagttgagttcctgatctctcttggctgctcatcagtc tggctggatgagaacgtgatgtgtatcagtcaggatgtagttggtggtcagcaaaagctc aactcaactagcttaaccctaaagaagattagtttgtcccatgtaacaagaagtccagag atgagaaaacagaggcagagagatggggcaggacctgttccaggtcgtcctggcccacgc agcgaggggggcccctcttctgtgtgcgtctggaaggtcgctgcccagggaggaaatgcc tttaaccggcgtggggccgggcagctgcaggaggcagatcatccgggctctgtgcctctt gctacttctcctccacgccggctctgccaagaatatctggaaacgggcattgcctgcgag gctggccgagaaatcccgtatgaaactcctgagaacaatgatttactgagcgcttacact gtgtcagggcaagagtttgccacgtattcgctcttgggagcctctcgacaaatccaccgg caccctcggacacgaatacagtctgctctgggcatcaagggagtcagcttctgttggttg caaccggcaacgagacatttcacagataaggaaactgaggccattgcagagaatgactta tctgtgctgttttggctggggacaagtggggctatcactatatccccagagtctgggagc tttgtctgctccttgaaccagaacccactgcacagggcccatgtgcagatcagcgagtta atgcatgttaagcacttagaagggtgcccagcaaactgccagggtgtgttgaccaccatc atcctcatcttcatcatcatcatcatcatcacagcttttgttccacccgaagaatattta ctgagtacttgctatgtgccaggcactgttctacgagccagggacacagatgccagcctg agtgtagatgcccgcctgggtgtacactccatcctgagtgtagacaccagcctgggtgta gacgccatccttggtgtagatgccatcctgagtgtagatgccagcctgggtgtagacgcc atcctgagtgtagatgccagcctgggtgtagacgccatgctgagtgtagacgccagcctg ggtgtagatgccatcctcagtgtagatgccagcctgggtgtagatgccagcctgggtgta gacgccagcctgggcatagatgccatcctgggtgtagacaccagcctgggcatagatgcc atcctgggtgtagacaccatcctgggtgcagactcaggataccccaggagggaagccgag gaggcgggcgcgcccggcggcccccggcagccccgagcagaccgctgcccgccgcctccg cggacgctgccccccggcgcctgccaggccgcgcgctgtcaggcggactccgagtgcccg cggcaccggcgctgctgctacaacggatgcgcctacgcctgcctagaagctgtgccgccc ccgccagactggctggtgcagccgaaacctcgatggcttggtggcaatggctggctcctg gatggccctgaggaggtgttacaagggtgtatccgtgaacatgtctatatggatgttgct gagcctgtgagagtatgtttgggatcaggtgagcttggactgcctgtgtttgtgtgcgtg tttgtgggtgtgcacgtgtgccacatgccgctgagtgagacccagggcctagcctggaac ctggggtacagaggcgaggccgcctctctggcagaggcgtgcagcaccacggaggatggg gccgaacccctgctctgtccctcgggctatgagtgccacatcctgagcccaggtgacgtg gccgaaggtatccccaaccgtgggcagtgcgtcaagcagcgccggcaagcagatgggcga atcctacgacacaaactttacaaagaatatccagacggagaattcaagatagctgtgttg aggaagctcaaagaaattcaagataacgcagataaggaattcagaattctatcagataaa tttaacaaagggattgaaattattaaaaagaatcaagcaaaaattctagagttgaaaaat gcaattgacatagtgaagaatgcatcagagtctcttaataacagaattgatcgagcagaa gaaagaattagtgagcttaaagacagactatttgaaaatacacagtcagaggacagaaga agaaataataaaaaagaatga