GENSCAN 1.0 Date run: 7-Nov-116 Time: 22:57:52 Sequence gi568815590f:21943145_22148246 : 205102 bp : 50.13% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 15918 15977 60 2 0 37 64 71 0.103 0.85 1.02 Intr + 23713 23859 147 0 0 76 27 103 0.470 3.43 1.03 Intr + 26339 26432 94 1 1 98 88 12 0.503 1.74 1.04 Intr + 27000 27166 167 1 2 81 92 111 0.999 10.48 1.05 Intr + 28732 28824 93 0 0 68 100 58 0.795 5.16 1.06 Intr + 33212 33377 166 1 1 57 92 155 0.993 12.33 1.07 Intr + 34626 34699 74 1 2 111 105 -6 0.995 2.53 1.08 Intr + 36940 37059 120 0 0 65 111 0 0.399 0.69 1.09 Intr + 38587 38733 147 0 0 56 72 108 0.987 6.43 1.10 Intr + 39496 39668 173 0 2 84 94 152 0.998 14.14 1.11 Intr + 41502 41695 194 0 2 23 96 280 0.999 21.44 1.12 Intr + 42442 42547 106 2 1 82 123 53 0.943 7.47 1.13 Intr + 43997 44132 136 2 1 81 87 99 0.993 9.67 1.14 Intr + 44640 44713 74 2 2 51 101 84 0.994 4.20 1.15 Intr + 47206 47263 58 1 1 79 113 35 0.693 3.99 1.16 Intr + 47667 47775 109 2 1 97 54 31 0.940 0.46 1.17 Intr + 48724 48830 107 2 2 72 113 170 0.947 17.83 1.18 Intr + 52334 52455 122 1 2 20 87 7 0.131 -6.91 1.19 Intr + 55611 55693 83 0 2 104 115 72 0.977 10.78 1.20 Intr + 55947 56161 215 1 2 29 99 278 0.914 21.43 1.21 Intr + 56392 56530 139 0 1 154 99 22 0.971 9.94 1.22 Intr + 58968 59128 161 1 2 115 110 192 0.974 23.81 1.23 Intr + 60075 60173 99 2 0 100 100 99 0.999 12.61 1.24 Intr + 60759 60886 128 2 2 116 109 158 0.999 20.18 1.25 Term + 61851 61944 94 0 1 115 47 117 0.976 7.60 1.26 PlyA + 62900 62905 6 1.05 2.00 Prom + 63513 63552 40 -2.96 2.01 Init + 65682 65833 152 2 2 57 70 65 0.235 1.11 2.02 Term + 67084 67516 433 1 1 84 46 144 0.500 4.57 2.03 PlyA + 68205 68210 6 1.05 3.00 Prom + 70089 70128 40 -2.86 3.01 Init + 75606 75665 60 2 0 71 110 34 0.746 5.15 3.02 Intr + 81955 82162 208 0 1 46 84 131 0.589 7.15 3.03 Intr + 82292 82377 86 0 2 139 90 -3 0.996 4.54 3.04 Intr + 82503 82628 126 2 0 54 62 147 0.989 9.58 3.05 Intr + 89986 90079 94 0 1 142 115 13 0.903 8.94 3.06 Intr + 90965 91131 167 0 2 71 81 261 0.971 23.38 3.07 Intr + 91366 91400 35 0 2 105 88 43 0.963 3.02 3.08 Intr + 93349 93382 34 1 1 95 113 17 0.919 3.13 3.09 Intr + 98418 98522 105 2 0 64 36 89 0.133 1.71 3.10 Intr + 100001 100037 37 1 1 102 96 16 0.167 1.64 3.11 Intr + 100311 100363 53 1 2 88 94 14 0.167 0.63 3.12 Intr + 102970 103147 178 0 1 16 77 300 0.261 21.19 3.13 Intr + 103383 103489 107 1 2 82 100 193 0.987 19.83 3.14 Intr + 104812 105101 290 0 2 112 18 477 0.903 39.14 3.15 Intr + 106316 106450 135 2 0 87 76 65 0.852 4.88 3.16 Term + 107726 107864 139 2 1 78 48 93 0.793 1.74 3.17 PlyA + 109198 109203 6 1.05 4.00 Prom + 118214 118253 40 -2.76 4.01 Init + 123732 123788 57 2 0 38 39 139 0.814 3.35 4.02 Intr + 124383 124538 156 2 0 89 78 341 0.897 33.41 4.03 Intr + 125872 125916 45 0 0 116 105 -2 0.876 2.91 4.04 Intr + 126275 126374 100 1 1 79 82 15 0.938 -0.32 4.05 Intr + 126737 126793 57 0 0 125 78 58 0.980 7.36 4.06 Intr + 127038 127190 153 1 0 85 49 205 0.992 16.34 4.07 Intr + 129182 129306 125 0 2 107 64 155 0.190 15.30 4.08 Intr + 130586 130691 106 1 1 126 76 55 0.187 7.89 4.09 Intr + 137036 137100 65 0 2 86 90 13 0.817 -0.26 4.10 Intr + 137268 137316 49 2 1 134 94 9 0.822 4.35 4.11 Intr + 137474 137601 128 0 2 145 74 0 0.877 4.70 4.12 Intr + 137661 137726 66 2 0 95 100 6 0.752 1.60 4.13 Intr + 137969 138049 81 1 0 83 80 109 0.953 9.43 4.14 Term + 138206 138319 114 1 0 57 45 160 0.943 7.17 4.15 PlyA + 141713 141718 6 1.05 5.00 Prom + 142760 142799 40 -5.96 5.01 Init + 146110 146154 45 0 0 94 77 148 0.999 12.99 5.02 Intr + 151296 151374 79 2 1 110 80 196 0.989 20.12 5.03 Intr + 153193 153365 173 2 2 41 59 305 0.821 22.56 5.04 Intr + 154372 154476 105 0 0 59 94 144 0.816 12.51 5.05 Intr + 154573 154695 123 0 0 104 55 183 0.995 17.38 5.06 Intr + 154924 155196 273 0 0 91 23 195 0.500 11.03 5.07 Intr + 155279 155475 197 1 2 66 90 255 0.992 21.71 5.08 Intr + 155804 155912 109 2 1 123 101 90 0.999 14.09 5.09 Intr + 156140 156216 77 1 2 57 86 125 0.998 7.51 5.10 Intr + 156560 156749 190 2 1 83 77 220 0.997 19.99 5.11 Intr + 157450 157595 146 0 2 77 95 230 0.999 21.68 5.12 Intr + 157700 157828 129 2 0 66 75 240 0.999 20.31 5.13 Intr + 158296 158386 91 1 1 62 105 158 0.997 14.90 5.14 Intr + 158564 158707 144 1 0 85 70 121 0.636 10.48 5.15 Intr + 159031 159165 135 0 0 80 73 75 0.492 5.96 5.16 Intr + 159384 159484 101 2 2 96 73 144 0.999 12.61 5.17 Term + 159649 159787 139 1 1 104 52 192 0.859 14.64 5.18 PlyA + 161214 161219 6 1.05 6.23 PlyA - 162624 162619 6 1.05 6.22 Term - 164751 164156 596 1 2 123 44 545 0.997 48.29 6.21 Intr - 165202 164989 214 1 1 69 80 252 0.965 20.79 6.20 Intr - 166799 165995 805 1 1 72 99 575 0.405 48.46 6.19 Intr - 167209 167107 103 2 1 52 35 60 0.214 -3.67 6.18 Intr - 169712 169610 103 2 1 96 91 27 0.322 3.55 6.17 Intr - 173284 173156 129 1 0 91 109 41 0.969 7.39 6.16 Intr - 173895 173731 165 0 0 72 49 146 0.915 9.26 6.15 Intr - 175921 175806 116 2 2 113 77 71 0.829 8.67 6.14 Intr - 176139 176020 120 1 0 101 100 141 0.999 17.07 6.13 Intr - 176782 176616 167 0 2 40 101 100 0.578 6.10 6.12 Intr - 177029 176960 70 0 1 73 89 70 0.951 3.84 6.11 Intr - 177363 177198 166 0 1 107 94 -37 0.932 -1.67 6.10 Intr - 177814 177572 243 1 0 111 94 266 0.988 27.19 6.09 Intr - 179464 179349 116 2 2 70 110 107 0.971 11.27 6.08 Intr - 179735 179646 90 0 0 128 84 70 0.986 10.57 6.07 Intr - 180669 180505 165 1 0 86 91 90 0.518 9.03 6.06 Intr - 182360 182167 194 1 2 135 61 174 0.794 18.54 6.05 Intr - 183082 182764 319 2 1 64 -6 175 0.708 0.72 6.04 Intr - 184685 183893 793 2 1 95 116 365 0.628 31.04 6.03 Intr - 186066 185415 652 2 1 44 113 345 0.520 24.51 6.02 Intr - 187459 187284 176 1 2 65 115 209 0.996 20.04 6.01 Init - 188705 188436 270 2 0 56 94 147 0.682 7.24 6.00 Prom - 192891 192852 40 -9.16 7.10 PlyA - 194894 194889 6 1.05 7.09 Term - 195408 195343 66 0 0 128 46 117 0.999 9.44 7.08 Intr - 195649 195495 155 2 2 90 78 216 0.848 20.59 7.07 Intr - 195917 195782 136 2 1 107 85 148 0.976 16.54 7.06 Intr - 196385 196086 300 2 0 90 68 309 0.631 26.03 7.05 Intr - 196939 196819 121 0 1 116 75 217 0.999 23.70 7.04 Intr - 197104 197028 77 1 2 118 60 18 0.994 0.31 7.03 Intr - 197402 197292 111 2 0 90 68 38 0.829 2.58 7.02 Intr - 197553 197463 91 2 1 93 57 71 0.664 4.50 7.01 Init - 198338 198307 32 2 2 98 59 51 0.667 0.58 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 23149 23169 21 0 0 71 100 -12 0.862 -1.98 S.002 Intr + 129182 129301 120 0 0 107 61 161 0.809 15.77 S.003 Term + 130586 130671 86 0 2 126 47 65 0.808 4.02 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590f:21943145_22148246|GENSCAN_predicted_peptide_1|1021_aa MNDCRSTLSEYGFDDYKYILSLAQLENLCKQLYETTDTTTRLQAEKALVEFTNSPDCLSK CQLLLERGSSSYSQLLAATCLTKLVSRTNNPLPLEQRIDIRNYVLNYLATRPKLATFVTQ ALIQLYARITKLGWFDCQKDDYVFRNAITDVTRFLQDSVEYCIIGVTILSQLTNEINQVS ATAFLIEASGKNLNLNDESQHGLLMQLLKLTHNCLNFDFIGTSTDESSDDLCTVQIPTSW RSAFLDSSTLQLFFDLYHSIPPSFSPLVLSCLVQIASVRRSLFNNAERAKFLSHLVDGVK RILENPQSLSDPNNYHEFCRLLARLKSNYQLGELVKVENYPEVIRLIANFTVTSLQHWEF APNSVHYLLSLWQRLAASVPYVKATEPHMLETYTPEVTKAYITSRLESVHIILRDGLEDP LEDTGLVQQQLDQLSTIGRCEYEKTCALLVQLFDQSAQSYQELLQSASASPMDIAVQEGR LTWLVYIIGAVIGGRVSFASTDEQDAMDGELVCRVLQLMNLTDSRLAQAGNEKLELAMLS FFEQFRKIYIGDQVQKSSKLYRRLSEVLGLNDETMVLSVFIGKISVRKLVKLSAVQFMLN NHTSEHFSFLGINNQSNLTDMRCRTTFYTALGRLLMVDLGEDEDQYEQFMLPLTAAFEAV AQMFSTNSFNEQEAKLLIYRYPSYMPILQRAIELWYHDPACTTPVLKLMAELVHNRSQRL QFDVSSPNGILLFRETSKMITMYGNRILTLGEVPKDQVYALKLKGISICFSMLKAALSGS YVNFGVFRLYGDDALDNALQTFIKLLLSIPHSDLLDYPKLSQSYYSLLEVLTQDHMNFIA SLEPHVIMYILSSISEGLTALDTMVCTGCCSCLDHIVTYLFKQLSRSTKKRTTPLNQESD RFLHIMQQHPEMIQQMLSTVLNIIIFEDCRNQWSMSRPLLGLILLNEKYFSDLRNSIVNS QPPEKQQAMHLCFENLMEGIERNLLTKNRDRFTQNLSAFRREVNDSMKNSTYGVNSNDMM S >gi568815590f:21943145_22148246|GENSCAN_predicted_CDS_1|3066_bp atgaatgactgcagaagcaccctgagcgaatatggatttgatgattacaaatacatttta agcctggcccaactagagaatctgtgcaaacagctgtatgaaaccacagacacaaccact cgactccaggcagagaaagccttggttgaatttaccaacagccctgattgcctgagcaag tgccagctactcctcgaaagaggaagttcctcttactcccagttactggcagctacatgc cttaccaagcttgtatcacgcacaaacaaccccctaccattggaacagcgaatagatatt cggaactatgtgctcaactaccttgccactcggccgaagttggctactttcgtgacacaa gcacttattcagttatatgccagaatcacaaaactgggctggtttgactgtcagaaggat gactatgtcttcagaaatgcaatcacagacgtcacaaggtttttacaggatagtgttgaa tactgcatcattggtgtcacaattttatctcagctaaccaatgaaattaatcaagtaagt gctacagccttcctcattgaagcttcaggaaagaatctaaacttgaatgatgaaagtcag catggcttgctcatgcaactgctcaagctcactcataactgcctcaactttgacttcatc ggcacttccactgatgagtcctcagacgacctgtgtacagtgcagattcccaccagctgg agatcagccttcttagattcttcaaccttgcagctgttttttgacctgtatcattccatc cctccttcattttcacctctggtattatcctgcttggtacagatcgcttcagtcagaaga tccctgtttaacaatgcagagagggccaagtttctctctcatcttgttgatggtgttaaa cgaatactggaaaacccacagagtttatcagacccaaacaattaccatgagttttgcaga ctactggcccgattgaagagtaactatcaactgggagaattggtaaaggtggaaaactac cctgaggtcatccgattgatagccaacttcacagtgaccagcctacagcactgggaattt gctccaaatagtgtgcactatcttctgagcctgtggcagcggctggcagcctctgtgccg tatgtcaaagccacagagccccacatgctggaaacttacactcctgaggtcaccaaagcc tacatcacatcccggttggaatctgtgcacatcatactgagagatggcctggaagatccc ctggaggatacggggctggtccagcagcagttggaccagctgtccaccattgggcgttgt gaatatgagaagacgtgtgcactcctcgtgcagttgtttgaccagtcggcccagtcgtac caggagctgctacagagcgccagcgcaagcccaatggacattgcagtgcaggagggaagg ctgacatggctggtttacattattggagcagtgatcggtggccgggtttcttttgccagc actgatgagcaagacgccatggatggtgagcttgtctgtcgggtgctccagctgatgaac ctaacagattctcgtttggcccaggcgggtaatgagaagctagagttggccatgctgagc ttttttgaacagtttcgtaagatctacattggggaccaagtgcagaaatcctctaagctg taccgccgactctcagaagttctgggcttgaatgatgagaccatggtcctaagcgtcttc ataggaaaaattagcgtaaggaagctagtgaagcttagtgcggtacagttcatgctgaac aatcacacgagcgagcacttttcatttttgggtattaacaatcagtccaacctgacagac atgcggtgtcggactaccttctacacagcacttgggcgtctcctcatggtggatttagga gaggatgaagatcagtatgagcagttcatgctgccactcacagcagcatttgaggctgtg gcccagatgtttagcaccaatagtttcaacgagcaggaggcaaagcttctcatttacaga tatccatcctatatgccaattctccaacgggcaattgagctctggtaccatgatccagcc tgtactacacctgtactcaagttgatggctgaattggttcataataggtcccagcgactc cagtttgatgtctcttcccccaatggcatcttactcttccgagaaaccagcaagatgata acaatgtatggcaatcgcatcctgacactaggagaggtcccaaaggatcaggtctatgct ctgaagctcaagggcatctccatctgcttctccatgctgaaggctgctctcagtgggagt tacgtcaatttcggagtctttcgtctctatggagacgatgccctggacaatgctctgcag accttcatcaagctgctcctctctattcctcacagtgatctcttggattaccccaagctc agccagtcttattattcactactggaagtcctgacccaggaccatatgaactttattgca agcctggaacctcacgtcatcatgtatattctctcttccatttctgaaggacttactgca cttgacaccatggtatgcacaggctgctgctcctgcctggaccacattgtgacatacctc ttcaagcagctgtcacgtagcaccaagaagaggaccacacccctgaaccaggagagcgac cgctttctgcacatcatgcagcagcatccagagatgatccagcagatgctgtccacggtg ctgaacatcatcatctttgaagactgtaggaaccagtggtctatgtcccgaccactactt ggcttgatattgcttaatgaaaagtatttttctgacctaagaaacagtattgtgaacagc cagccaccggagaagcagcaggccatgcacctgtgttttgagaacctgatggaaggcatc gagcgaaatcttcttacgaaaaacagagacaggttcacccagaacctgtcagcattccgt cgagaagtcaacgactcaatgaagaattccacttatggcgtgaatagcaatgacatgatg agctga >gi568815590f:21943145_22148246|GENSCAN_predicted_peptide_2|194_aa MGRETVKVEDDIKFGKSFPFLRKAFWFSGQPDCYPLSSVHQILIQHLLWASAAAPEVISR VSSAGRWRLRTQVSTASVPALPQLSPPRAGAVQTRSWPGVLVLPLEERGLNCFAGAFLGK SRAGFLRPLGPHGAVCLAEVRVHTHRGLQQAAAVLISGAVPGLGAAAKDKAPAPKNAVAR GYPERDGHGAHGMP >gi568815590f:21943145_22148246|GENSCAN_predicted_CDS_2|585_bp atgggacgtgagactgtcaaggtagaggatgacatcaagtttggaaaatcattccccttc ctcagaaaagccttctggttctcaggacagcctgactgctaccctttgtcatctgttcat caaatacttattcagcatctattatgggccagcgctgcagccccggaagtaatttcgcga gtttcttccgccggaaggtggcgcctgcgcactcaggtgtccacggcctctgttccggct ctcccccagctttcgccgccgcgcgcaggcgcagtccagactcggtcctggccgggggtt ctagtgttgccgctggaagagcgaggtcttaattgctttgcgggagcgttcctggggaag tccagagctgggttcctgcggcccttgggcccccacggcgccgtgtgcctggcagaggtt cgagttcacacgcaccgtggcttgcagcaggcagccgcagtgctaatcagcggcgctgtt cccgggctgggtgcagctgctaaggacaaggcccctgctccgaagaacgcggtggctcgg ggataccctgaaagggacggccatggcgcacatgggatgccctag >gi568815590f:21943145_22148246|GENSCAN_predicted_peptide_3|617_aa MPTFTSRIQHSSVSTSQNNQVPSSRERPLSTCRCLLVSGILQSRVRVREQGLTRAPSLQL PGQPASLPGAMNLSSASSTEEKAVTTVLWGCELSQERRTWTFRPQLEGKQSCRLLLHTIC LGEKAKEEMHRVEILPPANQEDKKMQPVTIASLQASVLPMVSMVGVQLSPPVTFQLRAGS GPVFLSGQERYEASDLTWEEEEEEEGEEEEEEEEDDEDEDADISLEEQSPVKQVKRLVPQ KQASVAKKKKLEKEEEEIRASVRDKSPVKKGGNSMELSVHRSPGPRIPPLILKTQEGFCI PTMKQVLTAADSLLSNSDLRDGAWSGWKQVEVMAQGENHPSPNFNQYVRDQGAMTDQLSR RQIREYQLYSRTSGKHVQVTGRRISATAEDGNKFAKLIVETDTFGSRVRIKGAESEKYIC MNKRGKLIGKPSGKSKDCVFTEIVLENNYTAFQNARHEGWFMAFTRQGRPRQASRSRQNQ REAHFIKRLYQGQLPFPNHAEKQKQFEFVGSAPTRRTKRTRRPQPLTVLQVCVWGVELGG HKTRLFSALAFNLVEGEVFLAGTTPPPRHPTRWVFRLWKSVQLIPPELVVTSKMLKGRSH KNRSLMMALQRRFNMHI >gi568815590f:21943145_22148246|GENSCAN_predicted_CDS_3|1854_bp atgcccaccttcaccagtcgtattcaacacagttctgtaagtactagccagaacaaccag gtcccctccagccgcgagcgacccctcagtacctgccgatgcctgctggtctctggcatc ctccagtcgagggtcagggtcagggagcaaggcctcacgcgggcgccctccttgcagctg cccggccagcccgcttctctgcccggagccatgaatctcagtagcgccagtagcacggag gaaaaggcagtgacgaccgtgctctggggctgcgagctcagtcaggagaggcggacttgg accttcagaccccagctggaggggaagcagagctgcaggctgttgcttcatacgatttgc ttgggggagaaagccaaagaggagatgcatcgcgtggagatcctgcccccagcaaaccag gaggacaagaagatgcagccggtcaccattgcctcactccaggcctcagtcctccccatg gtctccatggtaggagtgcagctttctcccccagttactttccagctccgggctggctca ggacccgtgttcctcagtggccaggaacgttatgaagcatcagacctaacctgggaggag gaggaggaagaagaaggggaggaggaggaagaggaagaggaagatgatgaggatgaggat gcagatatatctctggaggagcaaagccctgtcaaacaagtcaaaaggctggtgccccag aagcaggcgagcgtggctaagaaaaaaaagctggaaaaagaagaagaggaaataagagcc agcgttagagacaagagccctgtgaaaaagggagggaacagcatggagctctctgttcac cggtctccaggacctcggattccacctttaatcctgaaaacccaggaaggcttctgtatc cctacaatgaagcaggtgcttacagctgctgattctctgctgtcaaactcagacttgagg gatggagcctggagcggttggaagcaggtggaggtgatggctcagggggagaatcacccg tctcctaattttaaccagtacgtgagggaccagggcgccatgaccgaccagctgagcagg cggcagatccgcgagtaccaactctacagcaggaccagtggcaagcacgtgcaggtcacc gggcgtcgcatctccgccaccgccgaggacggcaacaagtttgccaagctcatagtggag acggacacgtttggcagccgggttcgcatcaaaggggctgagagtgagaagtacatctgt atgaacaagaggggcaagctcatcgggaagcccagcgggaagagcaaagactgcgtgttc acggagatcgtgctggagaacaactatacggccttccagaacgcccggcacgagggctgg ttcatggccttcacgcggcaggggcggccccgccaggcttcccgcagccgccagaaccag cgcgaggcccacttcatcaagcgcctctaccaaggccagctgcccttccccaaccacgcc gagaagcagaagcagttcgagtttgtgggctccgcccccacccgccggaccaagcgcaca cggcggccccagcccctcacggtcctccaggtgtgcgtgtggggagtggaactcgggggc cataaaacccgcctcttcagtgcgctggcttttaatcttgtcgagggggaggtcttcctc gcagggacaaccccccccccccgccaccccaccaggtgggtcttcaggctctggaagtct gttcagctcattccccctgaactggtggtgacttcgaagatgctaaaaggaagaagccac aagaatagatccctaatgatggcacttcagaggagatttaacatgcacatctag >gi568815590f:21943145_22148246|GENSCAN_predicted_peptide_4|433_aa MERLQKVRGAAPGRGRRGRAKMDNQVLGYKDLAAIPKDKAILDIERPDLMIYEPHFTYSL LEHVELPRSRERSLSPKSTSPPPSPEVWADSRSPGIISQASAPRTTGTPRTSLPHFHHPE TSRPDSNIYKKPPIYKQRESVGGSPQTKHLIEDLIIESSKFPAAQPPDPNQPAKIETDYW PCPPSLAVVETEWRKRKASRRGAEEEEEEEDDDSGEEMKALRERQREELSKVTSNLGKMI LKEEMEKSLPIRRKTRSLPDRTPFHTSLHQGTSKSSSLPAYGRTTLSRLQSTEFSPSGSE TGSPGLQVSASWRGTGRAAEAGSWHPKVSGEGGCGATELPGEEEEAGNGEGQRGRMDRGN SLPCVLEQKIYPYEMLVVTNKGRTKLPPGVDRMRLERHLSAEDFSRVFAMSPEEFGKLAL WKRNELKKKASLF >gi568815590f:21943145_22148246|GENSCAN_predicted_CDS_4|1302_bp atggaacggctgcagaaggtgcgcggcgccgccccgggccggggccgccgagggcgggcc aagatggacaatcaggtgctgggctacaaggacctggctgccatccccaaggacaaggcc atcctggacatcgagcggcccgacctcatgatctacgagcctcacttcacttattccctc ctggaacacgtggagctgcctcgcagccgcgagcgctcgctgtcacccaaatccacatcc cccccaccatccccagaggtgtgggcggacagccggtcgcctggaatcatctctcaggcc tcggcccccagaaccactggaaccccccggaccagcctgccccatttccaccaccctgag acctcccgcccagattccaacatctacaagaagcctcccatctataagcagagagagtcc gtgggaggcagccctcagaccaagcacctcatcgaggatctcatcatcgagtcatccaag tttcctgcagcccagcccccagaccccaaccagccagccaaaatcgaaaccgactactgg ccatgccccccgtctctggctgttgtggagacagaatggaggaagcggaaggcgtctcgg aggggagcagaggaagaggaggaggaggaagatgacgactctggagaggagatgaaggct ctcagggagcgtcagagagaggaactcagtaaggttacttccaacttgggaaagatgatc ttgaaagaagagatggaaaagtcattgccgatccgaaggaaaacccgctctctgcctgac cggacacccttccatacctccttgcaccagggaacgtctaaatcttcctctctccccgcc tatggcaggaccaccctgagccggctacagtccacagagttcagcccatcagggagtgag actggaagcccaggcctgcaggtgagtgcctcctggaggggaacaggcagagcagcagaa gctgggtcctggcacccgaaagtgtcaggggaaggggggtgtggggccacagagttgcct ggtgaagaagaagaagccgggaacggagagggccagagggggaggatggaccgggggaac tccctgccctgtgtgctggagcagaagatctatccctatgaaatgctagtggtgaccaac aaggggcgaaccaagctgccaccgggggtggatcggatgcggcttgagaggcatctgtct gccgaggacttctcaagggtatttgccatgtcccctgaagagtttggcaagctggctctg tggaagcggaatgagctcaagaagaaggcctctctcttctga >gi568815590f:21943145_22148246|GENSCAN_predicted_peptide_5|751_aa MLSRLGALLQEAVGAREPSIDLLQAFVEHWKGITHYYIESTDESTPAKKTDIPWRLKQML DILVYEEQQQAAAGEAGPCLEYLLQHKILETLCTLGKAEYPPGMRQQVFQFFSKVLAQVQ HPLLHYLSVHRPVQKLLRLGGTASGSVTEKEEVQFTTVLCSKIQQDPELLAYILEGKKIV GRKKACGEPTALPKDTTSHGDKDCSHDGAPARPQLDGESCGAQALNSHMPAETEELDGGT TESNLITSLLGLCQSKVLAAEMERLGKSRVALKAQENLLLLVSMASPAAATYLVQSSACC PAIVRHLCQLYRSMPVFLDPADIATLEGISWRLPSAPSDEASFPGKEALAAFLGWFDYCD HLITEAHTVVADALAKAVAENFFVETLQPQLLHVSEQSILTSTALLTAMLRQLRSPALLR EAVAFLLGTDRQPEAPGDNPHTLYAHLIGHCDHLSDEISITTLRLFEELLQKPHEGIIHS LVLRNLEGRPYVAWGSPEPESYEDTLDLEEDPYFTDSFLDSGFQTPAKPRLAPATSYDGK TAVTEIVNSFLCLVPEEAKTSAFLEETGYDTYVHDAYGLFQECSSRVASWGWPLTPTPLD PHEPERPFFEGHFLRVLFDRMSRILDQPYSLNLQVTSVLSRLALFPHPHIHEYLLDPYIS LAPGCRSLFSVLVIGDLMQRIQRVPQFPGKLLLVRKQLTGQAPGEQLDHQTLLQGVVVLE EFCKELAAIAFVKFPPHDPRQNVSPAPEGQV >gi568815590f:21943145_22148246|GENSCAN_predicted_CDS_5|2256_bp atgctgagccggctcggggcgctgctgcaggaagccgtgggggcgcgcgagcccagcatt gacctgctgcaggccttcgtggagcactggaagggcatcacgcactactacatcgagagc acagatgaaagcacccccgccaagaagacagacattccctggcggctgaagcagatgctg gatatcctggtgtatgaagagcagcagcaggcggccgcgggtgaggcagggccctgcctg gagtacctgctgcagcacaagatcctggagactctctgcacgctgggcaaggccgagtac cccccaggcatgcggcagcaggtgttccagttcttcagcaaggttctggcgcaggtgcag caccccctgctgcattacctcagcgtccacaggcctgtgcagaaactcctccgacttggt gggactgcttccggatccgttacagaaaaggaggaggtgcagttcaccaccgtcctctgc tccaagatccagcaggacccagagctgctcgcctacatcctggaaggtaaaaagattgta ggtaggaagaaagcatgcggagaacccactgccctgcctaaggacacaaccagccacggg gacaaggactgctcccacgatggtgctcctgccaggccccagctggacggggagtcctgt ggggcccaggccttgaacagccacatgcctgctgagaccgaggagctggacggtgggacc acagagagcaacctgattacctccctgcttgggctgtgccagagcaaggtgctggcagca gagatggagaggttggggaagagtcgggtggccttgaaggcccaggagaacctgctgctc ctggtgagcatggcctccccagcagctgccacctacctggtacagagcagcgcctgctgc cctgcgatcgtccggcacctttgccagttgtaccggtccatgcctgtcttcctggacccc gcagacattgccaccttagagggcatcagctggaggttacccagtgccccgtctgatgag gcttccttccctggcaaggaggccttggctgccttcttgggctggtttgattactgcgac cacctcatcacagaggcacacacggtggttgcggacgccttggcgaaggctgtggctgag aacttcttcgtggagaccctgcagccccagctcctgcacgtgtccgagcagagcatcttg acctccaccgccctcctcacagccatgctgcgccagcttcgctcccctgcgctgctgcgg gaggccgtggctttcctcctgggcacagaccggcagcctgaagcccccggggacaacccc cacaccctgtatgctcatctcatcgggcattgtgaccacctctctgatgagatcagcatc accacactccggctgtttgaggagctgctgcagaagccccacgaggggatcatccacagc ctggtcctgcgcaaccttgagggccgcccttacgtggcctggggctcaccagagcctgag agctatgaggacaccctagacctggaggaagacccctacttcaccgacagcttcctggat tccggctttcaaactcccgcaaagcctcgcctagctcctgctaccagttacgatggcaaa acagcagtgaccgagatcgtcaacagtttcctgtgcctggtccccgaggaagccaagacc tctgccttcctggaggagacaggctatgacacatacgtccacgatgcttatggcctgttc caggagtgcagctcccgcgtcgcctcctggggctggcctctgacccccacacctttggac ccccatgagcccgagcgacctttcttcgagggccacttcctccgagtgctgtttgaccgc atgtcccggattctggatcagccatacagcctgaacctgcaggtgacctcggtcctgtcc cggcttgccctcttcccccacccccatattcatgagtacctgctggatccgtacatcagc ctggcccccggctgcaggagcctattctccgtgttggtgatcggggacttgatgcagaga atccagagggtaccccagttcccaggcaagctgctcctggtgcgcaagcagttgacgggc caggctcctggggagcagctggaccaccagaccctcctccagggcgtggtggtgctggag gagttctgcaaggagctggctgccattgccttcgtcaagtttcccccacatgatcctcgc cagaacgtctccccagccccggaagggcaggtctga >gi568815590f:21943145_22148246|GENSCAN_predicted_peptide_6|1923_aa MGASPAPGLEAPGTLGMGQTLTARGLEPVSELIWAHDLSRHLAKSRPLDQGQTQAQALPE VLGRNPLCPWALEEVWEGFGVEDGKEQLGQSAPAAPGEGERERAEQTERENASSPAGREA PELAHGEQAPGAGHDDRHRPRRDRPVKPRDPPLGEPHEGRRVMESTPSFLKGTPTWEKTA PENGIVRQEPGSPPRDGLHHGPLCLGEPAPFWRGVLSTPDSWLPPGFPQGPKDMLPLVEG EGPQNGERKVNWLGSKEGLRWKEAMLTHPLAFCGPACPPRCGPLMPEHSGGHLKSDPVAF RPWHCPFLLETKILERAPFWVPTCLPPYLVSGLPPEHPCDWPLTPHPWVYSGGQPKVPSA FSLGSKGFYYKDPSIPRLAKEPLAAAEPGLFGLNSGGHLQRAGEAERPSLHQRDGEMGAG RQQNPCPLFLGQPDTVPWTSWPACPPGLVHTLGNVWAGPGDGNLGYQLGPPATPRCPSPE PPVTQRGCCSSYPPTKGGGLGPCGKCQEGLEGGASGASEPSEEVNKASGPRACPPSHHTK LKKTWLTRHSEQFECPRGCPEVEERPVARLRALKRAGSPEVQGAMGSPAPKRPPDPFPGT AEQGAGGWQEVRDTSIGNKDVDSGQHDEQKGLCPGWEFVGPFAFLGYCTCGLCRNSLALL TFRRPPSTLGLSPHVTQPVVSLSPQWSYPKSTLAGSCTCVRTIAPAAQSLGLNTPDNLFA RLPVHVTASSDSLLSLASPLGGELQQEEDTATNSSSEEGPGSGPDSRLSTGLAKHLLSGL GDRLCRLLRREREALAWAQREGQGPAVTEDSPGIPRCCSRCHHGLFNTHWRCPRCSHRLC VACGRVAGTGRAREKAGFQEQSAEECTQEAGHAACSLMLTQFVSSQALAELSTAMHQVWV KFDIRGHCPCQADARVWAPGDAGQQDDRITNILDSIIAQVVERKIQEKALGPGLRAGPGL RKGLGLPLSPVRPRLPPPGALLWLQEPQPCPRRGFHLFQEHWRQGQPVLVSGIQRTLQGN LWGTEALGALGGQVQALSPLGPPQPSSLGSTTFWEGFSWPELRPKSDEGSVLLLHRALGD EDTSRLGLLGMTPYPDRVENLAASLPLPEYCALHGKLNLASYLPPGLALRPLEPQLWAAY GVSPHRGHLGTKNLCVEVADLVSILVHADTPLPAWHRAQKDFLSGLDGEGLWSPGSQVST VWHVFRAQDAQRIRRFLQMVCPAGAGALEPGAPGSCYLDAGLRRRLREEWGVSCWTLLQA PGEAVLVPAGAPHQVQGLVSTVSVTQHFLSPETSALSAQLCHQGPSLPPDCHLLYAQVGH NALEDIFGLFVNRSESMFICSSTVPENAAGPVLRWGEGGKAELGEEQALATTTPTLLSTR QVLHAGISFKAGVYVPHPTGHVTFITLWWNEKKGIWDMINSGNAIVCLRQQRDSGSRGRP RASVTSPDCRVTVAYPGGATRPAGKMTSPSELLQTSARSGSWRAGGGWETSRAHGTDRRQ KPGGVRWAPDPCPPSSRAAPGGPAPSVNAAGRPIRAGRGAAQPISGQSSRALPRSRALPR SRELPARCRRDWERAPQRTLARGSAQSVCEDPARRPPGDPMASEGLAGALASVLAGQGSS VHSCDSAPAGEPPAPVRLRKNVCYVVLAVFLSEQDEVLLIQEAKRECRGSWYLPAGRMEP GETIVEALQREVKEEAGLHCEPETLLSVEERGPSWVRFVFLARPTGGILKTSKEADAESL QAAWYPRTSLPTPLRAHDILHLVELAAQYRQQARHPLILPQELPCDLVCQRLVATFTSAQ TVWVLVGTVGMPHLPVTACGLDPMEQRGGMKMAVLRLLQECLTLHHLVVEIKGLLGLQHL GRDHSDGICLNVLVTVAFRSPGIQDEPPKVRGENFSWWKVMEEDLQSQLLQRLQGSSVVP VNR >gi568815590f:21943145_22148246|GENSCAN_predicted_CDS_6|5772_bp atgggcgcctctccagcccctggcctggaagcaccaggaaccctggggatggggcagacc ctcacagcccggggtctggagccggtgtcggagctcatctgggcccatgacctctccaga catttggcaaaatcaaggcccttagaccagggacagacccaagcccaggccctcccagag gtcctaggacgcaaccctttgtgcccttgggctctggaagaggtttgggaagggtttggg gtggaagatggcaaagagcagcttggccagagcgcccccgccgccccgggggaaggagag cgcgagcgcgctgagcagacagagcgggagaacgcgtcctcgcccgccggccgggaggcc ccggagctggcccatggggagcaggcgcccggtgccggccacgacgaccgccaccgcccg cgccgcgaccggccggtgaagcccagggacccccctctgggagagccccatgagggcagg agagtgatggagagtacgcccagcttcctgaagggcaccccaacctgggagaagacggcc ccagagaacggcatcgtgagacaggagcccggcagcccgcctcgagatggactgcaccat gggccgctgtgcctgggagagcctgctcccttttggaggggcgtcctgagcaccccagac tcctggcttccccctggcttcccccagggccccaaggacatgctcccacttgtggagggc gagggcccccagaatggggagaggaaggtcaactggctgggcagcaaagagggactgcgc tggaaggaggccatgcttacccatccgctggcattctgcgggccagcgtgcccacctcgc tgtggccccctgatgcctgagcatagtggtggccatctcaagagtgaccctgtggccttc cggccctggcactgccctttccttctggagaccaagatcctggagcgagctcccttctgg gtgcccacctgcttgccaccctacctagtgtctggcctgcccccagagcatccatgtgac tggcccctgaccccgcacccctgggtatactccgggggccagcccaaagtgccctctgcc ttcagcttaggcagcaagggcttttactacaaggatccgagcattcccaggttggcaaag gagcccttggcagctgcggaacctgggttgtttggcttaaactctggtgggcacctgcag agagccggggaggccgaacgcccttcactgcaccagagggatggagagatgggagctggc cggcagcagaatccttgcccgctcttcctggggcagccagacactgtgccctggacctcc tggcccgcttgtcccccaggccttgttcatactcttggcaacgtctgggctgggccaggc gatgggaaccttgggtaccagctggggccaccagcaacaccaaggtgcccctctcctgag ccgcctgtcacccagcggggctgctgttcatcctacccacccactaaaggtgggggtctt ggcccttgtgggaagtgccaggagggcctggaggggggtgccagtggagccagcgaaccc agcgaggaagtgaacaaggcctctggccccagggcctgtccccccagccaccacaccaag ctgaagaagacatggctcacacggcactcggagcagtttgaatgtccacgcggctgccct gaggtcgaggagaggccggttgctcggctccgggccctcaaaagggcaggcagccccgag gtccagggagcaatgggcagtccagcccccaagcggccaccggacccttttccaggcact gcagaacagggggctgggggttggcaggaggtgcgggacacatcgatagggaacaaggat gtggactcgggacagcatgatgagcagaaaggcctctgccctggctgggaatttgttggc cccttcgccttcttgggttactgcacctgtggcctgtgtcgaaactctctggcccttctc accttccggcgccctccgtccacgttgggattgtctcctcatgtcacgcagccagtggta tcactaagccctcagtggagctatccaaagtctacactggctggatcctgcacttgtgtc aggacgatcgccccagctgcccaaagccttgggctcaacacaccagacaacctcttcgct cggctgccggtccatgtcacggcatcctcagactccctgctcagcctggcatcgcctctg ggaggggagctgcagcaggaggaagacacagccaccaactccagctctgaggaaggccca gggtccggccctgacagccggctcagcacaggcctcgccaagcacctgctcagtggtttg ggggaccgactgtgccgcctgctgcggagggagcgggaggccctggcttgggcccagcgg gaaggccaagggccagccgtgacagaggacagcccaggcattccacgctgctgcagccgt tgccaccatggactcttcaacacccactggcgatgtccccgctgcagccaccggctgtgt gtggcctgtggtcgtgtggcaggcactgggcgggccagggagaaagcaggctttcaggag cagtccgcggaggagtgcacgcaggaggccgggcacgctgcctgttccctgatgctgacc cagtttgtctccagccaggctttggcagagctgagcactgcaatgcaccaggtctgggtc aagtttgatatccgggggcactgcccctgccaagctgatgcccgggtatgggcccccggg gatgcaggccagcaggatgaccgcatcaccaacatcctggacagcattatcgcacaggtg gtggaacggaagatccaggagaaagccctggggccggggcttcgagctggcccgggtctg cgcaagggcctgggcctgcccctctctccagtgcggccccggctgcctcccccaggggct ttgctgtggctgcaggagccccagccttgccctcggcgtggcttccacctcttccaggag cactggaggcagggccagcctgtgttggtgtcagggatccaaaggacattgcagggcaac ctgtgggggacagaagctcttggggcacttggaggccaggtgcaggcgctgagccccctc ggacctccccagcccagcagcctgggcagcacaacattctgggagggcttctcctggcct gagcttcgcccaaagtcagacgagggctctgtcctcctgctgcaccgagctttgggggat gaggacaccagcaggttgggcctgctgggcatgaccccctaccctgacagggtggagaac ctagctgccagtctgccacttccggagtactgcgccctccatggaaaactcaacctggct tcctacctcccaccgggccttgccctgcgtccactggagccccagctctgggcagcctat ggtgtgagcccgcaccggggacacctggggaccaagaacctctgtgtggaggtggccgac ctggtcagcatcctggtgcatgccgacacaccactgcctgcctggcaccgggcacagaaa gacttcctttcaggcctggacggggaggggctctggtctccgggcagccaggtcagcact gtgtggcacgtgttccgggcacaggacgcccagcgcatccgccgctttctccagatggtg tgcccggccggggcaggcgccctggagcctggcgccccaggcagctgctacctggatgca gggctgcggcggcgcctgcgggaggagtggggcgtgagctgctggaccctgctccaggcc cccggagaggccgtgctggtgcctgcaggggctccccaccaggtgcagggcctggtgagc acagtcagcgtcactcagcacttcctctcccctgagacctctgccctctctgctcagctc tgccaccagggacccagccttccccctgactgccacctgctttatgcccaggtgggtcac aatgcccttgaggacatatttggtctctttgtaaacagaagtgaatcaatgttcatttgt tccagcactgttcctgagaatgcagccggaccagtgctgaggtggggtgaaggaggaaag gccgagctgggagaggagcaggcgctcgccacaaccacccccaccctgctctccacgcgc caggtcctgcacgctggaatctccttcaaggcaggggtttatgtcccccaccccacgggc catgtgacatttataactctttggtggaatgagaagaaaggtatttgggatatgatcaac tccggcaatgccattgtgtgtttacggcaacagcgggacagtggttccagggggcggccc cgggcctccgtgacgtcaccggattgtcgcgtcaccgtcgcctaccccggcggcgcaacg cgccctgcaggaaagatgacgtcaccgtcggagctcctgcagaccagtgcgcgctcgggg agttggcgagcgggtggcggctgggagacgtcccgagcgcacgggactgacaggcggcag aagccgggcggggtccgctgggctccggacccgtgcccccccagttccagggcggccccg ggcggccccgccccctcggtgaatgccgcgggccggccaatccgggcaggccgcggcgcc gcgcagcctatcagcggccagagctcgcgtgcgcttccgcgttcgcgtgcgcttccgcgt tctcgtgagctcccggcccgctgccgcagggactgggagcgggctccgcagcgcactcta gcccgcggctcggctcagtcggtctgcgaggatccggcccgccgccccccgggggacccg atggcctcggagggcctggcgggggcgctggcttccgtgctggctggccaggggtccagc gtgcacagctgcgactcggcgccggccggggagccgccggcgcccgtgcggctgcggaag aacgtgtgctacgtggtgctggccgtgttcctcagcgagcaggatgaggtgctactgatc caggaggccaagagggagtgccgggggtcgtggtacctgcctgcggggagaatggagcca ggggagaccatcgtggaggcgctgcagcgggaggtgaaggaggaggcggggctgcactgt gagcccgagacactgctgtccgtggaggagcggggcccctcctgggtccgcttcgtgttc ctcgctcgccccacaggtggaattctcaagacttccaaggaggccgatgcggagtccctg caggctgcctggtacccacggacctccctgcccactccgctgcgagcccatgacatcctg cacctggttgaactagccgcccagtatcgccagcaagccaggcaccctctcattctgccc caagagctaccctgtgatctggtctgccagcggctcgtggctacctttaccagcgcccag acagtgtgggtgttagtgggcacagtggggatgcctcacttgcctgtcactgcctgtggc ctcgaccctatggagcagaggggtggcatgaagatggccgtcctgcggctgctgcaggag tgtctgaccctgcaccacttggtggtggagatcaaggggttgcttggactgcagcacctg ggccgagatcacagtgatggcatctgtttgaatgtgctggtgaccgtggcttttcggagc ccagggatccaggatgaacccccaaaagttcggggtgagaacttctcttggtggaaggtg atggaggaagacctgcaaagccagctcctccagcggcttcagggatcctctgttgtccca gtgaacagatag >gi568815590f:21943145_22148246|GENSCAN_predicted_peptide_7|362_aa MVSWMICRLVVLVFGMLCPAYASYKAVKTKNIREYVSVGVWGQGCVAHPEWSGEGERMGP VQEGVDRGQAIGVPVEDLVRWMMYWIVFALFMAAEIVTDIFISWFPFYYEIKMAFVLWLL SPYTKGASLLYRKFVHPSLSRHEKEIDAYIVQAKERSYETVLSFGKRGLNIAASAAVQAA TKVLWAPSPPAAPIPPLRPFGLIQLLCQGAQRWQPGSRGEKVTGGYWQKRGAGVRPSFLA LALSSQGALAGRLRSFSMQDLRSISDAPAPAYHDPLYLEDQVSHRRPPIGYRAGGLQDSD TEDECWSDTEAVPRAPARPREKPLIRSQSLRVVKRKPPVREGTSRSLKVRTRKKTVPSDV DS >gi568815590f:21943145_22148246|GENSCAN_predicted_CDS_7|1089_bp atggtgtcctggatgatctgtcgcctggtggtgctggtgtttgggatgctgtgtccagct tatgcttcctataaggctgtgaagaccaagaacattcgtgaatatgtgagcgtgggggtt tgggggcaaggctgtgtggcacatcctgagtggagtggggagggagagagaatgggccct gttcaagagggtgtagatagagggcaggccatcggggtgcctgtggaagatctggtgcgg tggatgatgtactggattgtttttgcactcttcatggcagcagagatcgttacagacatt tttatctcctggttccctttctactatgagatcaagatggccttcgtgctgtggctgctc tcaccctacaccaagggcgccagcctgctttaccgcaagtttgtccacccgtccctgtcc cgccatgagaaggagatcgacgcgtacatcgtgcaggccaaggagcgcagctacgagacc gtgctcagcttcgggaagcggggcctcaacattgccgcctccgctgctgtgcaggctgcc accaaggtgctctgggcccccagccctccagcagcccccatcccacccctaaggcccttt gggctcattcagctcctctgccagggagcccagagatggcagccggggagcaggggtgag aaggtgacgggggggtattggcagaagcgtggagctggagtcagaccttccttcctggct ctggcactgagcagtcagggggcgctggccggcaggctgcggagcttctccatgcaggac ctgcgctccatctctgacgcacctgcccctgcctaccatgaccccctctacctggaggac caggtgtcccaccggaggccacccattgggtaccgggccgggggcctgcaggacagcgac accgaggatgagtgttggtcagatactgaggcagtcccccgggcgccagcccggccccga gagaagcccctaatccgcagccagagcctgcgtgtggtcaagaggaagccaccggtgcgg gagggcacctcgcgctccctgaaggttcggacgaggaaaaagactgtgccctcagacgtg gacagctag