GENSCAN 1.0 Date run: 6-Nov-116 Time: 06:13:40 Sequence gi568815597r:149843860_150077510 : 233651 bp : 43.62% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init - 6914 6539 376 2 1 78 89 675 0.970 63.70 1.00 Prom - 7003 6964 40 -16.58 2.00 Prom + 7167 7206 40 -5.66 2.01 Init + 7255 7643 389 0 2 62 61 701 0.090 61.08 2.02 Term + 8654 9206 553 2 1 -5 48 1085 0.930 88.99 2.03 PlyA + 10392 10397 6 1.05 3.06 PlyA - 12109 12104 6 1.05 3.05 Term - 17323 16961 363 1 0 45 47 680 0.924 54.17 3.04 Intr - 21579 21507 73 2 1 66 66 31 0.243 -1.89 3.03 Intr - 26808 26687 122 0 2 120 68 50 0.309 5.49 3.02 Intr - 35071 34906 166 0 1 20 21 119 0.089 -1.64 3.01 Init - 42781 42405 377 1 2 69 89 997 0.486 94.31 3.00 Prom - 42857 42818 40 -7.96 4.00 Prom + 42976 43015 40 -10.64 4.01 Sngl + 43116 43505 390 2 0 59 36 717 0.999 59.82 4.02 PlyA + 43560 43565 6 -0.45 5.02 PlyA - 43601 43596 6 -5.80 5.01 Sngl - 44057 43665 393 2 0 60 32 563 0.854 42.14 5.00 Prom - 44141 44102 40 -6.76 6.00 Prom + 51861 51900 40 -7.26 6.01 Sngl + 56201 56614 414 1 0 38 46 492 0.876 34.30 6.02 PlyA + 60553 60558 6 1.05 7.13 PlyA - 60650 60645 6 1.05 7.12 Term - 61338 61155 184 2 1 1 49 226 0.817 7.02 7.11 Intr - 62180 62021 160 0 1 93 53 115 0.998 7.55 7.10 Intr - 62997 62791 207 1 0 113 87 234 0.999 24.85 7.09 Intr - 63974 63841 134 1 2 70 95 170 0.996 16.19 7.08 Intr - 64347 64183 165 2 0 79 55 187 0.975 13.58 7.07 Intr - 65421 65333 89 0 2 86 94 10 0.972 0.17 7.06 Intr - 65712 65602 111 0 0 113 95 128 0.985 16.58 7.05 Intr - 66031 65942 90 1 0 65 63 128 0.912 8.19 7.04 Intr - 66844 66711 134 2 2 126 49 91 0.975 9.36 7.03 Intr - 67118 66967 152 1 2 68 89 129 0.871 10.81 7.02 Intr - 68121 67941 181 1 1 141 92 212 0.999 25.83 7.01 Init - 69981 69360 622 0 1 86 82 973 0.999 92.01 7.00 Prom - 76743 76704 40 -3.46 8.21 PlyA - 78758 78753 6 1.05 8.20 Term - 79870 79683 188 2 2 139 32 69 0.983 4.05 8.19 Intr - 80155 79982 174 2 0 85 106 87 0.983 10.11 8.18 Intr - 83059 82517 543 2 0 124 103 667 0.993 64.67 8.17 Intr - 83435 83307 129 0 0 91 113 87 0.997 12.17 8.16 Intr - 84017 83788 230 1 2 23 27 138 0.092 -1.29 8.15 Intr - 85458 85274 185 0 2 103 32 21 0.060 -3.41 8.14 Intr - 86057 85764 294 2 0 105 110 -4 0.208 0.71 8.13 Intr - 86688 86506 183 0 0 81 72 43 0.631 1.98 8.12 Intr - 87106 86933 174 1 0 105 82 23 0.635 3.54 8.11 Intr - 87567 87401 167 1 2 89 78 20 0.607 0.78 8.10 Intr - 88155 88085 71 2 2 100 113 55 0.990 8.03 8.09 Intr - 88471 88405 67 2 1 90 78 30 0.954 0.16 8.08 Intr - 89671 89547 125 0 2 116 85 22 0.959 4.93 8.07 Intr - 90467 90332 136 0 1 109 58 159 0.984 14.63 8.06 Intr - 90667 90589 79 1 1 81 105 63 0.999 6.42 8.05 Intr - 91362 91127 236 1 2 114 110 48 0.641 7.01 8.04 Intr - 91500 91440 61 0 1 62 119 88 0.996 7.61 8.03 Intr - 91846 91725 122 2 2 116 94 26 0.969 6.21 8.02 Intr - 95655 95482 174 1 0 112 82 -11 0.556 0.61 8.01 Init - 95808 95739 70 0 1 72 94 4 0.816 0.63 8.00 Prom - 97963 97924 40 -3.76 9.07 PlyA - 98972 98967 6 1.05 9.06 Term - 101206 99998 1209 1 0 116 47 720 0.997 62.36 9.05 Intr - 103476 103365 112 2 1 99 70 30 0.659 2.68 9.04 Intr - 105224 105110 115 0 1 102 115 107 0.934 14.31 9.03 Intr - 105919 105770 150 2 0 80 48 126 0.979 7.93 9.02 Intr - 106362 106235 128 2 2 128 113 86 0.974 15.42 9.01 Init - 109973 109570 404 2 2 60 27 168 0.086 4.00 9.00 Prom - 111891 111852 40 -4.96 10.09 PlyA - 112052 112047 6 1.05 10.08 Term - 113458 113002 457 0 1 29 48 288 0.415 13.50 10.07 Intr - 115937 115825 113 2 2 73 106 82 0.779 7.68 10.06 Intr - 120490 120363 128 2 2 78 59 109 0.964 7.40 10.05 Intr - 122019 121918 102 1 0 103 97 4 0.849 2.95 10.04 Intr - 123662 123435 228 0 0 87 82 138 0.890 10.94 10.03 Intr - 127392 127204 189 1 0 90 79 95 0.881 8.36 10.02 Intr - 133676 133567 110 1 2 54 100 50 0.345 2.73 10.01 Init - 140650 140598 53 1 2 64 49 79 0.233 2.23 10.00 Prom - 147248 147209 40 -3.46 11.00 Prom + 154489 154528 40 -4.46 11.01 Init + 179274 179280 7 2 1 62 115 0 0.810 1.19 11.02 Term + 188748 188884 137 1 2 41 48 200 0.574 9.48 11.03 PlyA + 189288 189293 6 1.05 12.00 Prom + 199247 199286 40 -2.46 12.01 Sngl + 210005 210424 420 1 0 79 52 292 0.259 21.00 12.02 PlyA + 212175 212180 6 1.05 13.00 Prom + 214217 214256 40 -5.66 13.01 Init + 223999 224091 93 0 0 107 113 116 0.998 16.38 13.02 Intr + 224771 224905 135 1 0 78 79 92 0.958 8.06 13.03 Intr + 228307 228367 61 0 1 105 78 7 0.460 -0.29 13.04 Intr + 232374 232453 80 1 2 66 97 113 0.305 9.27 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 7255 7647 393 0 0 62 54 715 0.909 61.74 S.002 Init + 8512 8567 56 0 2 62 119 73 0.841 8.56 S.003 Init - 84037 83788 250 1 1 76 27 146 0.809 5.04 S.004 Sngl - 109973 109008 966 2 0 60 41 246 0.842 13.81 S.005 Sngl - 111798 111223 576 0 0 49 43 180 0.828 5.77 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:149843860_150077510|GENSCAN_predicted_peptide_1|126_aa MPEPAKFAPAPKKGSKKAVTKAQKKDGKKRKRSRKESYSIYVYKVLKRVHPDTGIWCKAM GIMNSFLNDIFERIAGEASRLAHYNKRSTITSRRSRRPCACCCPASWPSTPCPRAPRRSP STPAPX >gi568815597r:149843860_150077510|GENSCAN_predicted_CDS_1|378_bp atgcctgagcctgcaaagttcgcgccggctcccaagaagggctccaagaaagccgtcacc aaagcccagaagaaagacggcaagaagcgcaagcgcagccgcaaggagagctactccatc tacgtgtacaaggtgctgaagcgggtccaccccgacaccggcatctggtgcaaggccatg ggcatcatgaactccttcctcaacgacatcttcgagcgcatcgcgggagaggcgtcccgc ctggcgcactacaacaagcgctccaccatcacgtcccggagatccagacggccgtgcgcc tgctgctgcccggcgagctggccaagcacgccgtgtccgagggcaccaaggcggtcacca agtacgccagctccaann >gi568815597r:149843860_150077510|GENSCAN_predicted_peptide_2|313_aa MSGRGKQGGKARAKAKSRSSRAGLQFPVGRVHRLLRKGNYAERVGAGAPVYMAAVLEYLT AEILELAGNAARDNKKTRIIPRHLQLAIRNDEELNKLLGKVTIAQGGVLPNIQAVLLPKK TESHHKAKGNQWTASAGFSIIVPPNREKTVLIKTAAAGLGARFSPRRCAGKPVFWFAMAR TKQTARKSTGGKAPRKQLATKAARKSAPATGGVKKPHRYRPGTVALREIRRYQKSTELLI RKLPFQRLVREIAQDFKTDLRFQSSAVMALQEASEAYLVGLFEDTNLCAIHAKRVTIMPK DIQLARRIRGERA >gi568815597r:149843860_150077510|GENSCAN_predicted_CDS_2|942_bp atgtctggtcgtggcaagcaaggaggcaaggcccgcgccaaggccaagtcgcgctcgtcc cgcgctggccttcagttcccggtagggcgagtgcatcgcttgctgcgcaaaggcaactac gcggagcgagtgggggccggcgcgcccgtctacatggctgcggtcctcgagtatctgacc gccgagatcctggagctggcgggcaacgcggctcgggacaacaagaagacgcgcatcatc cctcgtcacctccagctggccatccgcaacgacgaggaactgaacaagctgctgggcaaa gtcaccatcgcccagggcggcgtcttgcctaacatccaggccgtactgctccctaagaag acggagagtcaccacaaggcaaagggcaaccaatggacagccagcgcgggattttcaatt attgttccgcccaatcgggaaaagactgtgcttataaagacggctgcggcggggctagga gctcgtttttctccccgccgctgcgctggtaagcctgtgttttggttcgctatggcccgt actaagcagactgctcgcaagtcgaccggcggcaaggccccgaggaagcagctggccacc aaggcggcccgcaagagcgcgccggccacgggcggggtgaagaagccgcaccgctaccgg cccggcaccgtagccctgcgggagatccggcgctaccagaagtccacggagctgctgatc cgcaagctgcccttccagcggctggtacgcgagatcgcgcaggactttaagacggacctg cgcttccagagctcggccgtgatggcgctgcaggaggccagcgaggcctacctggtgggg ctgttcgaagacacgaacctgtgcgccatccacgccaagcgcgtgaccattatgcccaag gacatccagctggcccgccgcatccgtggagagcgggcttaa >gi568815597r:149843860_150077510|GENSCAN_predicted_peptide_3|366_aa MPEPAKSAPAPKKGSKKAVTKAQKKDGKKRKRSRKESYSIYVYKVLKQVHPDTGISSKAM GIMNSFVNDIFERIAGEASRLAHYNKRSTITSREIQTAVRLLLPGELAKHAVSEGTKAVT KYTSSNQLMKDLNYICKVFLINSKYRFCPYSKGGDLNNGYQGPGLTGAIAEFCLLQEYRG TLFQASKCRPQSASCPTIQAFHYYRKHLAKDHHHLPVLPLKLTDLAEDIINSDQRTLSLS NNTTFMGKTVLALTEAVYRAPAVMSGRGKGGKGLGKGGAKRHRKVLRDNIQGITKPAIRR LARRGGVKRISGLIYEETRGVLKVFLENVIRDAVTYTEHAKRKTVTAMDVVYALKRQGRT LYGFGG >gi568815597r:149843860_150077510|GENSCAN_predicted_CDS_3|1101_bp atgcctgaaccggcaaaatccgctccggcccctaaaaagggctccaagaaagccgtcacc aaagcccagaagaaagacggcaagaagcgcaagcgcagccgcaaagagagctactccatc tacgtgtacaaggtgctgaagcaggtccaccccgacaccggcatctcgtccaaggccatg ggcatcatgaactccttcgtcaacgacatcttcgagcgcatcgcgggagaggcttcccgc ctggcgcactacaacaagcgctccaccatcacatcccgcgagatccagacggccgtgcgc ctgctgctgcccggcgagctggccaagcacgccgtgtccgagggcaccaaggcggtcacc aagtacaccagctccaatcaactgatgaaggatcttaactacatctgcaaagttttcttg attaacagcaagtacaggttctgcccatactcaaagggaggggatttaaacaacgggtac caggggccaggactcacgggagccatcgcagaattttgcctactacaggaatatagagga actcttttccaggcaagtaaatgcaggccacagtctgccagctgtccaaccattcaggct tttcattattataggaagcatctggccaaggaccaccaccatcttcctgtcctgcccctg aagctcactgacttagctgaggacatcatcaattctgatcaaaggaccctaagtctaagt aacaataccactttcatggggaagacggtgctcgccttgacagaagctgtctatcgggct ccagcggtcatgtccggcagaggaaagggcggaaaaggcttaggcaaagggggcgctaag cgccaccgcaaggtcttgagagacaacattcagggcatcaccaagcctgccattcggcgt ctagctcggcgtggcggcgttaagcggatctctggcctcatttacgaggagacccgcggt gtgctgaaggtgttcctggagaatgtgattcgggacgcagtcacctacaccgagcacgcc aagcgcaagaccgtcacagccatggatgtggtgtacgcgctcaagcgccaggggcgcacc ctgtacggcttcggaggctag >gi568815597r:149843860_150077510|GENSCAN_predicted_peptide_4|129_aa MSGRGKQGGKARAKAKSRSSRAGLQFPVGRVHRLLRKGNYAERVGAGAPVYMAAVLEYLT AEILELAGNAARDNKKTRIIPRHLQLAIRNDEELNKLLGKVTIAQGGVLPNIQAVLLPKK TESHKAKSK >gi568815597r:149843860_150077510|GENSCAN_predicted_CDS_4|390_bp atgtctggtcgtggcaaacaaggaggcaaggcccgcgccaaggccaagtcgcgctcgtcc cgcgctggcctccagttcccggtagggcgagtgcaccgcttgctgcgcaaaggcaactac gcggagcgggtgggggccggcgcgcccgtctacatggcggcggtcctcgagtacctgacc gccgagatcctggagctggcgggcaacgcggctcgggacaacaagaagacgcgcatcatc cctcgtcacctccagctggccatccgcaacgacgaggaactgaacaagctgctgggcaaa gtcaccatcgcccagggcggcgttttgcctaacatccaggccgttctgttaccaaagaaa accgaaagccacaaagccaaaagcaaataa >gi568815597r:149843860_150077510|GENSCAN_predicted_peptide_5|130_aa MSGRGKQGGKARAKAKSRSSRAGLQFPVGRVHRLLRKGNYAERVGAGAPVYLAAVLEYLT AEILELAGNAARDNKKTRIIPRHLQLAVRNDEELNKLLGGVTIAQGGVLPNIQAVLLPKK TESHKPGKNK >gi568815597r:149843860_150077510|GENSCAN_predicted_CDS_5|393_bp atgtcaggacgcggaaagcagggaggcaaggcccgcgctaaggccaagtcgcgctcgtcc cgcgctggtctccagttcccggtggggcgagtgcaccgcttgctgcgcaaaggcaactac gcggagcgggtcggggcaggcgccccggtgtacctggcggcggtcctcgagtacctgacc gcggaaattctggagctggcgggcaacgcggctcgggacaacaagaagacgcgcatcatc cctcgccatctgcaactagccgtgaggaatgacgaagagctcaacaagttactcgggggt gtcaccattgcccagggcggcgtcttgcccaatatccaggctgtcctgttgcccaagaaa acggagagtcacaagcctggcaagaacaagtaa >gi568815597r:149843860_150077510|GENSCAN_predicted_peptide_6|137_aa MLSGRLVLGLVSMAGRVCLCQGSAGSGAIGPVEAAIRTKLEEALSPEVLELRNESGGHAV PPGSETHFRVAVVSSRFEGLSPLQRHRLVHAALAEELGGPVHALAIQARTPAQWRENSQL DTSPPCLGGNKKTLGTP >gi568815597r:149843860_150077510|GENSCAN_predicted_CDS_6|414_bp atgctgagtgggcggctggtcctgggtctggtctccatggctggccgcgtttgtttgtgc cagggcagcgcgggatccggggccatcggtccggtggaggccgccattcgcacgaagttg gaggaggccctgagccccgaggtgctagagcttcgcaacgagagcggtggccacgcggtc ccgcctggcagtgagactcacttccgcgtggctgtggtgagctctcgtttcgagggactg agccccctacaacgacaccggctggtccacgcagcgctggccgaggagctgggaggtccg gtccatgcgctggccatccaggcacggacccccgcccagtggagagagaactctcagctg gacactagccccccatgcctgggtgggaacaagaaaactctaggaaccccctga >gi568815597r:149843860_150077510|GENSCAN_predicted_peptide_7|742_aa MEEGFRDRAAFIRGAKDIAKEVKKHAAKKVVKGLDRVQDEYSRRSYSRFEEEDDDDDFPA PSDGYYRGEGTQDEEEGGASSDATEGHDEDDEIYEGEYQGIPRAESGGKGERMADGAPLA GVRGGLSDGEGPPGGRGEAQRRKEREELAQQYEAILRECGHGRFQWTLYFVLGLALMADG VEVFVVGFVLPSAEKDMCLSDSNKGMLGLIVYLGMMVGAFLWGGLADRLGRRQCLLISLS VNSVFAFFSSFVQGYGTFLFCRLLSGVGIGGSIPIVFSYFSEFLAQEKRGEHLSWLCMFW MIGGVYAAAMAWAIIPHYGWSFQMGSAYQFHSWRVFVLVCAFPSVFAIGALTTQPESPRF FLENGKHDEAWMVLKQVHDTNMRAKGHPERVFSVTHIKTIHQEDELIEIQSDTGTWYQRW GVRALSLGGQVWGNFLSCFGPEYRRITLMMMGVWFTMSFSYYGLTVWFPDMIRHLQAVDY ASRTKVFPGERVEHVTFNFTLENQIHRGGQYFNDKFIGLRLKSVSFEDSLFEECYFEDVT SSNTFFRNCTFINTVFYNTDLFEYKFVNSRLINSTFLHNKEGCPLDVTGTGEGAYMVYFV SFLGTLAVLPGNIVSALLMDKIGRLRMLAGSSVMSCVSCFFLSFGNSESAMIALLCLFGG VSIASWNALDVLTVELYPSDKRTTAFGFLNALCKLAAVLGISIFTSFVGITKAAPILFAS AALALGSSLALKLPETRGQVLQ >gi568815597r:149843860_150077510|GENSCAN_predicted_CDS_7|2229_bp atggaagagggcttccgagaccgggcagctttcatccgtggggccaaagacattgctaag gaagtcaaaaagcatgcggccaagaaggtggtgaagggcctggacagagtccaggacgaa tattcccgaagatcgtactcccgctttgaggaggaggatgatgatgatgacttccctgct cccagtgatggttattaccgaggagaagggacccaggatgaggaggaaggtggtgcatcc agtgatgctactgagggccatgacgaggatgatgagatctatgaaggggaatatcagggc attccccgggcagagtctgggggcaaaggcgagcggatggcagatggggcgcccctggct ggagtaagggggggcttgagtgatggggagggtccccctgggggccggggggaggcacaa cgacggaaagaacgagaagaactggcccaacagtatgaagccatcctacgggagtgtggc cacggccgcttccagtggacactgtattttgtgcttggtctggcgctgatggctgacggt gtggaggtctttgtggtgggcttcgtgctgcccagcgctgagaaagacatgtgcctgtcc gactccaacaaaggcatgctaggcctcatcgtctacctgggcatgatggtgggagccttc ctctggggaggtctggctgaccggctgggtcggaggcagtgtctgctcatctcgctctca gtcaacagcgtcttcgccttcttctcatcttttgtccagggttacggcactttcctcttc tgccgcctactttctggggttgggattggagggtccatccccattgtcttctcctatttc tccgagtttctggcccaggagaaacgaggggagcatttgagctggctctgcatgttttgg atgattggtggcgtgtacgcagctgctatggcctgggccatcatcccccactatgggtgg agttttcagatgggttctgcctaccagttccacagctggagggtcttcgtcctcgtctgc gcctttccttctgtgtttgccattggggctctgaccacgcagcctgagagcccccgtttc ttcctagagaatggaaagcatgatgaggcctggatggtgctgaagcaggtccatgatacc aacatgcgagccaaaggacatcctgagcgagtgttctcagtaacccacattaagacgatt catcaggaggatgaattgattgagatccagtcggacacagggacctggtaccagcgctgg ggggtccgggccttgagcctaggggggcaggtttgggggaattttctctcctgttttggt cccgaatatcggcgcatcactctgatgatgatgggtgtgtggttcaccatgtcattcagc tactatggcctgaccgtctggtttcctgacatgatccgccatctccaggcagtggactac gcatcccgcaccaaagtgttccccggggagcgcgtagagcatgtaacttttaacttcacg ttggagaatcagatccaccgaggcgggcagtacttcaatgacaagttcattgggctgcgg ctcaagtcagtgtcctttgaggattccctgtttgaagagtgttattttgaggatgtcaca tccagcaacacgtttttccgcaactgcacattcatcaacactgtgttctataacactgac ctgttcgagtacaagtttgtgaacagccgtctgataaacagtacattcctgcacaacaag gagggctgcccgctagacgtgacagggacgggcgaaggtgcctacatggtatactttgtg agcttcctggggacactggcagtgcttcctgggaatatcgtgtctgccctgctcatggac aagatcggcaggctcagaatgcttgctggctccagcgtgatgtcctgtgtctcctgcttc ttcctgtcttttgggaacagtgagtcggccatgatcgctctgctctgcctttttggcggg gtcagcattgcatcctggaatgcgctggacgtgttgactgttgaactctacccctcagac aagaggaccacagcttttggcttcctgaatgccctgtgtaagctggcagctgtgctgggg atcagcatcttcacatccttcgtgggaatcaccaaggctgcacccatcctctttgcctca gctgcccttgcccttggcagctctctggccctgaagctgcctgagacccgggggcaggtg ctgcagtga >gi568815597r:149843860_150077510|GENSCAN_predicted_peptide_8|1135_aa MSHRDWPLCLPSPMSQCPRHIATGLSTIDLDESCRHSVSSTLQTASLQITGYFATIHQLG NKILRCPHVDFLCAISCIGQVGEQILAWAPGVRKGLEPELSGTLICTNFRVTFQPCGWQW NQDTPLNSEYDFALVNIGRLEAENSRDGVGWAARVPSGETLVHGYTVLYLLTSVSGLSRV QLLRPGSLHKFIPEEILIHGRDFRLLRVGFEAGGLEPQAFQVTMAIVQARAQSNQAQQYS GITLSKAGQGSGSRKPPIPLMETAEDWETERKKQAARGWRVSTVNERFDVATRAVELMLQ AGHSDVVLVDTMDELPSLADVQLAHLRLRALCLPDSSVAEDKWLSALEGTRWLDYVRACL RKASDISVLVTSRVRSVILQERGDRDLNGLLSSLVQLLSAPEARTLFGFQSLVQREWVAA GHPFLTRLGGTGASEEAPVFLLFLDCVWQLLQQFPADFEFSEFFLLALHDSVRVPDTLTF LRNTPWERGKQSGQLNSYTQVYTPGYSQPPAGNSFNLQLSVWDWDLRYSNAQILQFQNPG YDPEHCPDSWLPRPQPSFMVPGPPSSVWLFSRGALTPLNQLCPWRDSPSLLAVSSRWLPR PAISSESLADQEWGLPSHWGACPLPPGLLLPGYLGPQIRLWRRCYLRGRPEVQMGLSAPT ISGLQDELSHLQELLRKWTPRISPEDHSKKRDPHTILNPTEIAGILKGRAEGDLGHLLFR GPAHRYVRLRPGSGSRAEVRRRNRCWETAGSLSPWLPGRSPSGIRVTGGGKSGMGSAWLG GPDYSQQCLCRDATVYVGGLDEKVSEPLLWELFLQAGPVVNTHMPKDRVTGQHQGYGFVE FLSEEDADYAIKIMNMIKLYGKPIRVNKASAHNKNLDVGANIFIGNLDPEIDEKLLYDTF SAFGVILQTPKIMRDPDTGNSKGYAFINFASFDASDAAIEAMNGQYLCNRPITVSYAFKK DSKGERHGSAAERLLAAQNPLSQADRPHQLFADAPPPPSAPNPVVSSLGSGLPPPGMSQM QLAHHGPHGLGHPHAGPPGSGGQPPPRPPPGMPHPGPPPMGMPPRGPPFGSPMGHPGPMP PHGMRGPPPLMPPHGYTGPPRPPPYGYQRGPLPPPRPTPRPPVPPRGPLRGPLPQ >gi568815597r:149843860_150077510|GENSCAN_predicted_CDS_8|3408_bp atgagccaccgcgactggcctctgtgtcttccttctccaatgagtcagtgccccagacat atagccacaggcctttcaactatagatctggatgagtcatgcaggcattctgtctcttct actttgcagacagccagtctgcaaatcacagggtattttgccaccatacatcagttgggt aacaaaatattacgatgtccccatgtagactttctctgtgccatctcttgcattggccaa gtgggggagcagatcctagcatgggccccaggggtgaggaagggcctggaaccagaattg tctggaaccctgatctgtaccaactttagggtcaccttccagccctgtggatggcagtgg aatcaggacactcccttgaacagtgaatacgattttgccctggtcaacattggacgatta gaggctgaaaacagcagggatggagtgggatgggctgcaagagtcccttctggagagact ctggtccatggttacactgttctgtacctcttgacttcagtgagcggcttgtcccgagtc cagctcctccgtccagggtccctgcataaatttatccctgaggagattctgattcatggc cgagacttccggctgctcagagttggttttgaggctggaggcctagagcctcaggctttt caggtgaccatggccattgtccaagccagagctcagagcaatcaagcccaacagtattcg gggataaccctgagcaaggctggccagggttctggctccagaaaaccaccaattcctctc atggagacagcggaagactgggagactgagcggaagaagcaggcagccagaggctggagg gtcagcacggtcaacgagaggttcgacgtagccaccagagcagtggagttgatgctccag gctgggcattcagatgttgtcctggtagacactatggatgagctgcccagccttgcagat gtccaacttgcccacctgaggctgagggccctctgcctgcctgattcatctgtagctgag gataaatggctttcagccctggaaggaacacgatggctggactatgtcagggcttgtctt cgaaaggccagtgacatttcagtattagtgacatccagggttcgttctgtaatacttcaa gagcgcggtgatcgtgatctcaatggcctcctctcttcactcgtccagctgctttcagcc cccgaagcccgaacactgtttggcttccaatcactagtacagcgagagtgggtggcagct ggacatcccttcctgactcggcttgggggaactggggccagtgaagaggctccggtgttt ctcctcttccttgattgtgtctggcagctcctccagcagtttccagctgattttgaattc tctgagtttttccttcttgctcttcatgacagtgtcagggttcctgacacccttaccttc ctgagaaataccccctgggagcgcggaaagcagagcggacagttaaactcctatacacaa gtctacaccccaggatactcccagcctccagctgggaactcttttaacctgcagctgtct gtctgggactgggatttacgttatagcaatgcacagatactacaattccagaatcctggc tatgacccagaacactgtccagattcctggctccctagaccacagccaagcttcatggtt cctggaccccccagttctgtgtggctcttctctagaggagcattgacccccctgaatcag ctctgtccttggcgggacagtccttccctgctggcagtctcttctcgttggctccctcga cctgctatctcctctgaaagcctggctgaccaggaatggggtctcccctcacattgggga gcttgccctttacctccagggctgctgctgcctgggtatctgggaccccagatcaggctc tggagacgctgctacctgaggggaaggcctgaggtccagatgggcctctcagctcccaca atctctggcctccaggatgagctatcccatcttcaggagttattacggaaatggacacca agaatatctcctgaggatcactccaagaaaagagatccacataccattctcaatcccact gaaattgctggcattctcaaaggcagggcagagggggatctggggcacctcctcttccgc ggccccgcccaccgctacgtccgtcttcgccccggaagtggaagtcgtgctgaggtcaga aggcggaaccgctgctgggagacggcgggatctctttcgccatggctgccgggccgatct ccgagcggaatcagggtaactggagggggaaagtctggaatgggctcagcgtggctagga ggccccgactacagccagcagtgtctctgcagggatgccactgtgtacgtggggggcctg gatgagaaggttagtgaaccgctgctgtgggaactgtttctccaggctggaccagtagtc aacacccacatgccaaaggatagagtcactggccagcaccaaggctatggctttgtggaa ttcttgagtgaggaagatgctgactatgccattaagatcatgaacatgatcaaactctat gggaagccaatacgggtgaacaaagcatcagctcacaacaaaaacctggatgtaggggcc aacattttcattgggaacctggaccctgagattgatgagaagttgctttatgatactttc agcgcctttggggtcatcttacaaacccccaaaattatgcgggaccctgacacaggcaac tccaaaggttatgcctttattaattttgcttcatttgatgcttcggatgcagcaattgaa gccatgaatgggcagtacctctgtaaccgtcctatcaccgtatcttatgccttcaagaag gactccaagggtgagcgccatggctcagcagccgaacgacttctggcagctcagaacccg ctctcccaggctgatcgccctcatcagctgtttgcagatgcacctcctccaccctctgct cccaatcctgtggtatcatcattggggtctgggcttcctccaccagggatgtctcagatg cagcttgcacaccatggccctcatggcttaggacatccccacgctggacccccaggctct gggggccagccaccgccccgaccaccacctggaatgcctcatcctggacctcctccaatg ggcatgcccccccgagggcctccattcggatctcccatgggtcacccaggtcctatgcct ccgcatggtatgcgtggacctcctccactgatgcccccccatggatacactggccctcca cgacccccaccctatggctaccagcgggggcctctccctccacccagacccactccccgg ccaccagttccccctcgaggcccacttcgaggccctctccctcagtaa >gi568815597r:149843860_150077510|GENSCAN_predicted_peptide_9|705_aa MSELPFTIASKRIKYLEIQLTRDVKDLFKENYKPLLNEIKEDTNKWKNISCSWIGRLSIV KMAILPKVIYRLSAIPIKPPMTFFTELEKTTLKSIWNQKRACIAKTILSQKNKAGGITLP DFKLYYKAAVTKTAWVESSEEPVYESLEEFHVFVLAHVLRRPIVVVADTMLRDSGGEAFA PIPFGGIYLPLEVPASQCHRSPLVLAYDQAHFSALVSMEQKENTKEQAVIPLTDSEYKLL PLHFAVDPGKGWEWGKDDSDNVRLASVILSLEVKLHLLHSYMNVKWIPLSSDAQVKTSPT STLAPLAQPESPTASAGDEPRSTPESGDSDKESVGSSSTSNEGGRRKEKSKRDREKDKKR ADSVANKLGSFGKTLGSKLKKNMGGLMHSKGSKPGGVGTGLGGSSGTETLEKKKKNSLKS WKGGKEEAAGDGPVSEKPPAESVGNGGSKYSQEVMQSLSILRTAMQGEGKFIFVGTLKMG HRHQYQEEMIQRYLSDAEERFLAEQKQKEAERKIMNGGIGGGPPPAKKPEPDAREEQPTG PPAESRAMAFSTGYPGDFTIPRPSGGGVHCQEPRRQLAGGPCVGGLPPYATFPRQCPPGR PYPHQDSIPSLEPGSHSKDGLHRGALLPPPYRVADSYSNGYREPPEPDGWAGGLRGLPPT QTKCKQPNCSFYGHPETNNFCSCCYREELRRREREPDGELLVHRF >gi568815597r:149843860_150077510|GENSCAN_predicted_CDS_9|2118_bp atgagtgaactcccattcacaattgcttcaaagagaataaaatacctagaaatccaactt acaagggatgtgaaggacctcttcaaggagaactacaaaccactgctcaatgaaataaaa gaggacacaaacaaatggaagaacatttcatgctcttggataggaagactcagtatcgtg aaaatggccatactgcccaaggtaatttatagactcagtgccatccccatcaaaccacca atgactttcttcacagaattggaaaaaactactttaaagtccatatggaaccaaaaaaga gcctgcattgccaagacaatcctaagccaaaagaacaaagctggaggcatcacgctacct gacttcaaactatactacaaggctgcagtaaccaaaacagcatgggtggagagttctgag gagcctgtatatgagagccttgaagagtttcacgtctttgtccttgctcatgtgcttagg aggcccatagtcgtcgtggcagacaccatgctgagggactccggaggggaagcatttgcc cctattccctttggaggaatctatctgcctttggaggtcccagccagccagtgtcaccgc tcccctctggtgctcgcctatgatcaggcccacttttctgcactcgtgtccatggagcag aaggagaataccaaggaacaagctgtgatcccacttacagattcagagtataagctgctg cccttgcactttgctgtggaccctggaaagggctgggagtggggcaaagatgatagtgac aatgtccgattggccagtgtaattctgtccctagaggtcaaattgcatctgctgcatagc tacatgaatgtgaagtggatcccactgtcctctgatgcacaggtgaagacttctcccaca tctaccctggctcctctggcccagcctgagtcccccaccgcctcagctggagatgagccc cggtccactcctgagtctggagactcagacaaggagtcagttggcagcagttccaccagc aacgagggcggccggcggaaggagaagtcaaagcgagatcgggagaaggacaagaagaga gcagattctgtggctaacaaactgggcagctttggcaaaaccttgggcagcaagctcaag aagaacatggggggcctgatgcacagcaagggttcaaagcctggaggggtggggacaggg ttgggaggaagcagcggcactgagacactggagaagaagaagaaaaactcactgaagagc tggaagggtggcaaggaggaggcagctggggatgggcctgtgtctgagaagcccccagct gagtctgttggtaacggagggagcaagtatagccaggaggtgatgcagagcctgagcatt ctgaggactgccatgcaaggggaggggaagtttatttttgttggaaccctgaagatgggt caccgtcaccagtatcaggaggaaatgatccagcgctacctttctgatgctgaggagaga ttcctggcagaacagaagcagaaggaggcagagaggaagatcatgaatggaggaataggg ggtggccctcctccagccaaaaagccagagccagatgctagggaagagcagccgaccggt cccccagcagagtccagggcaatggcattttccactggctaccctggggactttactatc cctcggccgtctgggggcggagtccactgccaggaaccccggaggcagttggcagggggt ccatgtgtcgggggcctaccaccatatgccaccttccccagacagtgccctcctgggcga ccctacccccaccaggacagcatcccttctctggagccaggcagccactctaaggatgga cttcacaggggtgccttgttaccacccccctaccgagtggctgattcctatagcaatggc tacagagagccccctgagccagatggatgggctggaggtctccggggccttcccccaact cagaccaaatgcaaacaaccgaactgcagcttctatggacaccctgagacaaacaacttc tgttcctgttgttacagggaagaactgaggaggagggagcgggaaccggatggggagctc ctggtgcacaggttctga >gi568815597r:149843860_150077510|GENSCAN_predicted_peptide_10|459_aa MGVHKDDQDTGPEKQEECWLKVLEDHMTLDMDAVLSDFVRSTGAEPGLARDLLEGKNWDV NAALSDFEQLRQVHAGNLPPSFSEGSGGSRTPEKGFSDREPTRPPRPILQRQDDIVQEKR LSRGISHASSSIVSLARSHVSSNGGGGGSNEHPLEMPICAFQLPDLTVYNEDFRSFIERD LIEQSMLVALEQAGRLNWWVSVDPTSQRLLPLATTGDGNCLLHAASLGMWGFHDRDLMLR KALYALMEKGVEKEALKRRWRWQQTQQNKESGLVYTEDEWQKEWNELIKLASSEPRMHLG TNGANCGGQTGSGVDLQQTPTDLQLRVLTVRRKTNRKDIHTNTPSVCHHHQRPKVDKTTK MGRNQSRKAENSKNQSASSPPKERSSSPATEQSWTENDFDELREEGFRRSVITNSELKED VRTHRKEAKNLEKRLDEWLTRINSIEKTLNDLMELKTMA >gi568815597r:149843860_150077510|GENSCAN_predicted_CDS_10|1380_bp atgggagtccacaaagacgaccaagacacagggccagagaagcaggaagagtgctggctg aaagtacttgaggatcacatgaccctggacatggatgctgttctgtcagattttgtccgt tccacaggagcagagccagggctagcgcgagatctcctagaaggaaagaattgggatgtg aatgccgccctcagtgattttgaacagctacgtcaagtccatgctggaaacctaccccca tcctttagtgaggggagtggtggctccaggacccctgaaaaagggttttctgacagagag cctactcgccctccccgacccatcctccagcggcaggatgacatcgttcaagaaaaacgc ctgtctaggggcatctcccacgccagctccagcattgtttccctggcccggtcccatgtc tcctccaatggtgggggtggggggagcaatgagcaccccctggaaatgcccatctgtgcc ttccagcttccagatctcactgtatacaatgaagacttccgcagcttcatagagagagac ctcattgagcagtccatgctggttgccttggaacaggcagggcgtttgaactggtgggtg agtgtggatcccacctctcagaggctgcttcctttggcaactactggagatgggaactgc ctcctgcatgcagcctcccttggaatgtggggtttccatgatcgggacttgatgctgcgg aaagctttgtatgcactgatggagaagggagttgagaaggaagcgttgaaaaggcgctgg aggtggcagcagacacagcagaataaagagtcagggctggtatacacagaagatgaatgg cagaaggagtggaatgaactgatcaagcttgcctcaagtgaaccccgaatgcatctaggt accaatggagccaactgtggtgggcaaacagggtctggagtggacctccagcaaactcca acagacctgcagctgagggtcctgactgttagaaggaaaactaacagaaaggacatccac accaataccccatctgtatgtcaccatcatcaaagaccaaaggtagataaaaccacaaag atggggagaaaccagagcagaaaagctgaaaattctaaaaatcagagtgcctcttctcct ccaaaggaacgcagctcctcaccagcaacagaacaaagctggacggagaatgactttgac gagttgagagaagaaggcttcagacgatcggtaataacaaactctgagctaaaggaggat gttcgaacccatcgcaaagaagctaaaaaccttgaaaaaagattagacgaatggctaact agaataaacagcatagagaagaccttaaacgacctgatggagctgaaaaccatggcataa >gi568815597r:149843860_150077510|GENSCAN_predicted_peptide_11|47_aa MKAWATEQNPVKRRKEEKKRKKRRKRNKKKKKEEEEEEEEEEEESIP >gi568815597r:149843860_150077510|GENSCAN_predicted_CDS_11|144_bp atgaaagcctgggcgacagagcaaaatcctgttaaaagaagaaaagaagaaaagaagagg aagaagaggaggaagaggaataagaagaagaagaaggaggaggaggaggaggaggaagaa gaagaagaagaaagtataccatga >gi568815597r:149843860_150077510|GENSCAN_predicted_peptide_12|139_aa MEGEQVEKPDTKEKKPEVKKADAGGKVKKGNLKAKKPKNRKPHCSQNPVIVRGIGRYSPS AMYSRKATCKRKYSAVKSKVEKQKEKFPATITKPAGGGKNGGTQVVKLCKMPTYYLTEDV LSKLLSQGKKPFSQHMRKL >gi568815597r:149843860_150077510|GENSCAN_predicted_CDS_12|420_bp atggagggtgaacaagttgagaagccagatactaaagagaagaaacctgaagtcaagaag gctgatgctggtggcaaggtgaaaaagggtaacctcaaggctaagaagcccaagaacagg aagccccattgcagtcaaaatcctgtcattgtcagagggattggcagatattctccatct gctatgtattccagaaaggccacgtgcaagaggaagtactcagctgtgaaatccaaggtt gaaaagcaaaaggagaagtttcctgcaactattacaaaaccagctggtggtggcaagaac ggtggtacgcaggtggttaaactttgcaaaatgcctacatattatcttactgaagatgtg ctttcaaagctgttgagccaaggaaaaaaacccttcagtcagcacatgagaaaactgtga >gi568815597r:149843860_150077510|GENSCAN_predicted_peptide_13|123_aa MNVVFAVKQYISKMIEDSGPGMKVLLMDKETTGIVSMVYTQSEILQKEVYLFERIDSQNR EIMKHLKAICFLRPTKENVDYIIQELRRPKYTIYFIYFSNVISKSDVKSLAEADEQEVVA EVQ >gi568815597r:149843860_150077510|GENSCAN_predicted_CDS_13|369_bp atgaacgtggtttttgctgtgaagcagtacatttccaaaatgatagaggacagcgggcct ggtatgaaagtacttctcatggataaagagacgactggcatagtgagtatggtatacaca caatcggagattctacagaaggaagtgtacctctttgaacgcattgattctcaaaatcga gagatcatgaaacacctgaaggcaatttgttttcttcgacctacaaaggagaatgtggat tatattattcaggagctccgaagacccaaatacactatatatttcatttatttcagtaat gtgatcagcaagagtgacgtgaagtcattggctgaagctgatgaacaggaagttgtggct gaggttcag