GENSCAN 1.0 Date run: 4-Nov-116 Time: 09:44:24 Sequence gi568815590f:21925249_22136679 : 211431 bp : 48.66% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 33814 33873 60 0 0 37 64 71 0.086 0.85 1.02 Intr + 41609 41755 147 1 0 76 27 103 0.471 3.43 1.03 Intr + 44235 44328 94 2 1 98 88 12 0.503 1.74 1.04 Intr + 44896 45062 167 2 2 81 92 111 0.999 10.48 1.05 Intr + 46628 46720 93 1 0 68 100 58 0.795 5.16 1.06 Intr + 51108 51273 166 2 1 57 92 155 0.993 12.33 1.07 Intr + 52522 52595 74 2 2 111 105 -6 0.995 2.53 1.08 Intr + 54836 54955 120 1 0 65 111 0 0.399 0.69 1.09 Intr + 56483 56629 147 1 0 56 72 108 0.987 6.43 1.10 Intr + 57392 57564 173 1 2 84 94 152 0.998 14.14 1.11 Intr + 59398 59591 194 1 2 23 96 280 0.999 21.44 1.12 Intr + 60338 60443 106 0 1 82 123 53 0.943 7.47 1.13 Intr + 61893 62028 136 0 1 81 87 99 0.993 9.67 1.14 Intr + 62536 62609 74 0 2 51 101 84 0.994 4.20 1.15 Intr + 65102 65159 58 2 1 79 113 35 0.693 3.99 1.16 Intr + 65563 65671 109 0 1 97 54 31 0.940 0.46 1.17 Intr + 66620 66726 107 0 2 72 113 170 0.947 17.83 1.18 Intr + 70230 70351 122 2 2 20 87 7 0.131 -6.91 1.19 Intr + 73507 73589 83 1 2 104 115 72 0.977 10.78 1.20 Intr + 73843 74057 215 2 2 29 99 278 0.914 21.43 1.21 Intr + 74288 74426 139 1 1 154 99 22 0.971 9.94 1.22 Intr + 76864 77024 161 2 2 115 110 192 0.974 23.81 1.23 Intr + 77971 78069 99 0 0 100 100 99 0.999 12.61 1.24 Intr + 78655 78782 128 0 2 116 109 158 0.999 20.18 1.25 Term + 79747 79840 94 1 1 115 47 117 0.976 7.60 1.26 PlyA + 80796 80801 6 1.05 2.00 Prom + 81409 81448 40 -2.96 2.01 Init + 83578 83729 152 0 2 57 70 65 0.235 1.11 2.02 Term + 84980 85412 433 2 1 84 46 144 0.500 4.57 2.03 PlyA + 86101 86106 6 1.05 3.00 Prom + 87985 88024 40 -2.86 3.01 Init + 93502 93561 60 0 0 71 110 34 0.746 5.15 3.02 Intr + 99851 100058 208 1 1 46 84 131 0.589 7.15 3.03 Intr + 100188 100273 86 1 2 139 90 -3 0.996 4.54 3.04 Intr + 100399 100524 126 0 0 54 62 147 0.989 9.58 3.05 Intr + 107882 107975 94 1 1 142 115 13 0.903 8.94 3.06 Intr + 108861 109027 167 1 2 71 81 261 0.971 23.38 3.07 Intr + 109262 109296 35 1 2 105 88 43 0.963 3.02 3.08 Intr + 111245 111278 34 2 1 95 113 17 0.919 3.13 3.09 Intr + 116314 116418 105 0 0 64 36 89 0.133 1.71 3.10 Intr + 117897 117933 37 2 1 102 96 16 0.167 1.64 3.11 Intr + 118207 118259 53 2 2 88 94 14 0.167 0.63 3.12 Intr + 120866 121043 178 1 1 16 77 300 0.261 21.19 3.13 Intr + 121279 121385 107 2 2 82 100 193 0.987 19.83 3.14 Intr + 122708 122997 290 1 2 112 18 477 0.903 39.14 3.15 Intr + 124212 124346 135 0 0 87 76 65 0.852 4.88 3.16 Term + 125622 125760 139 0 1 78 48 93 0.793 1.74 3.17 PlyA + 127094 127099 6 1.05 4.00 Prom + 136110 136149 40 -2.76 4.01 Init + 141628 141684 57 0 0 38 39 139 0.814 3.35 4.02 Intr + 142279 142434 156 0 0 89 78 341 0.897 33.41 4.03 Intr + 143768 143812 45 1 0 116 105 -2 0.876 2.91 4.04 Intr + 144171 144270 100 2 1 79 82 15 0.938 -0.32 4.05 Intr + 144633 144689 57 1 0 125 78 58 0.980 7.36 4.06 Intr + 144934 145086 153 2 0 85 49 205 0.992 16.34 4.07 Intr + 147078 147202 125 1 2 107 64 155 0.190 15.30 4.08 Intr + 148482 148587 106 2 1 126 76 55 0.187 7.89 4.09 Intr + 154932 154996 65 1 2 86 90 13 0.817 -0.26 4.10 Intr + 155164 155212 49 0 1 134 94 9 0.822 4.35 4.11 Intr + 155370 155497 128 1 2 145 74 0 0.877 4.70 4.12 Intr + 155557 155622 66 0 0 95 100 6 0.752 1.60 4.13 Intr + 155865 155945 81 2 0 83 80 109 0.953 9.43 4.14 Term + 156102 156215 114 2 0 57 45 160 0.943 7.17 4.15 PlyA + 159609 159614 6 1.05 5.00 Prom + 160656 160695 40 -5.96 5.01 Init + 164006 164050 45 1 0 94 77 148 0.999 12.99 5.02 Intr + 169192 169270 79 0 1 110 80 196 0.989 20.12 5.03 Intr + 171089 171261 173 0 2 41 59 305 0.821 22.56 5.04 Intr + 172268 172372 105 1 0 59 94 144 0.816 12.51 5.05 Intr + 172469 172591 123 1 0 104 55 183 0.995 17.38 5.06 Intr + 172820 173092 273 1 0 91 23 195 0.500 11.03 5.07 Intr + 173175 173371 197 2 2 66 90 255 0.992 21.71 5.08 Intr + 173700 173808 109 0 1 123 101 90 0.999 14.09 5.09 Intr + 174036 174112 77 2 2 57 86 125 0.998 7.51 5.10 Intr + 174456 174645 190 0 1 83 77 220 0.997 19.99 5.11 Intr + 175346 175491 146 1 2 77 95 230 0.999 21.68 5.12 Intr + 175596 175724 129 0 0 66 75 240 0.999 20.31 5.13 Intr + 176192 176282 91 2 1 62 105 158 0.997 14.90 5.14 Intr + 176460 176603 144 2 0 85 70 121 0.636 10.48 5.15 Intr + 176927 177061 135 1 0 80 73 75 0.492 5.96 5.16 Intr + 177280 177380 101 0 2 96 73 144 0.999 12.61 5.17 Term + 177545 177683 139 2 1 104 52 192 0.859 14.64 5.18 PlyA + 179110 179115 6 1.05 6.23 PlyA - 180520 180515 6 1.05 6.22 Term - 182647 182052 596 2 2 123 44 545 0.997 48.29 6.21 Intr - 183098 182885 214 2 1 69 80 252 0.965 20.79 6.20 Intr - 184695 183891 805 2 1 72 99 575 0.405 48.46 6.19 Intr - 185105 185003 103 0 1 52 35 60 0.214 -3.67 6.18 Intr - 187608 187506 103 0 1 96 91 27 0.322 3.55 6.17 Intr - 191180 191052 129 2 0 91 109 41 0.969 7.39 6.16 Intr - 191791 191627 165 1 0 72 49 146 0.915 9.26 6.15 Intr - 193817 193702 116 0 2 113 77 71 0.829 8.67 6.14 Intr - 194035 193916 120 2 0 101 100 141 0.999 17.07 6.13 Intr - 194678 194512 167 1 2 40 101 100 0.578 6.10 6.12 Intr - 194925 194856 70 1 1 73 89 70 0.951 3.84 6.11 Intr - 195259 195094 166 1 1 107 94 -37 0.932 -1.67 6.10 Intr - 195710 195468 243 2 0 111 94 266 0.988 27.19 6.09 Intr - 197360 197245 116 0 2 70 110 107 0.971 11.27 6.08 Intr - 197631 197542 90 1 0 128 84 70 0.986 10.57 6.07 Intr - 198565 198401 165 2 0 86 91 90 0.518 9.03 6.06 Intr - 200256 200063 194 2 2 135 61 174 0.794 18.54 6.05 Intr - 200978 200660 319 0 1 64 -6 175 0.708 0.72 6.04 Intr - 202581 201789 793 0 1 95 116 365 0.628 31.04 6.03 Intr - 203962 203311 652 0 1 44 113 345 0.520 24.51 6.02 Intr - 205355 205180 176 2 2 65 115 209 0.996 20.04 6.01 Init - 206601 206332 270 0 0 56 94 147 0.517 7.24 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 41045 41065 21 1 0 71 100 -12 0.880 -1.98 S.002 Intr + 147078 147197 120 1 0 107 61 161 0.809 15.77 S.003 Term + 148482 148567 86 1 2 126 47 65 0.808 4.02 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590f:21925249_22136679|GENSCAN_predicted_peptide_1|1021_aa MNDCRSTLSEYGFDDYKYILSLAQLENLCKQLYETTDTTTRLQAEKALVEFTNSPDCLSK CQLLLERGSSSYSQLLAATCLTKLVSRTNNPLPLEQRIDIRNYVLNYLATRPKLATFVTQ ALIQLYARITKLGWFDCQKDDYVFRNAITDVTRFLQDSVEYCIIGVTILSQLTNEINQVS ATAFLIEASGKNLNLNDESQHGLLMQLLKLTHNCLNFDFIGTSTDESSDDLCTVQIPTSW RSAFLDSSTLQLFFDLYHSIPPSFSPLVLSCLVQIASVRRSLFNNAERAKFLSHLVDGVK RILENPQSLSDPNNYHEFCRLLARLKSNYQLGELVKVENYPEVIRLIANFTVTSLQHWEF APNSVHYLLSLWQRLAASVPYVKATEPHMLETYTPEVTKAYITSRLESVHIILRDGLEDP LEDTGLVQQQLDQLSTIGRCEYEKTCALLVQLFDQSAQSYQELLQSASASPMDIAVQEGR LTWLVYIIGAVIGGRVSFASTDEQDAMDGELVCRVLQLMNLTDSRLAQAGNEKLELAMLS FFEQFRKIYIGDQVQKSSKLYRRLSEVLGLNDETMVLSVFIGKISVRKLVKLSAVQFMLN NHTSEHFSFLGINNQSNLTDMRCRTTFYTALGRLLMVDLGEDEDQYEQFMLPLTAAFEAV AQMFSTNSFNEQEAKLLIYRYPSYMPILQRAIELWYHDPACTTPVLKLMAELVHNRSQRL QFDVSSPNGILLFRETSKMITMYGNRILTLGEVPKDQVYALKLKGISICFSMLKAALSGS YVNFGVFRLYGDDALDNALQTFIKLLLSIPHSDLLDYPKLSQSYYSLLEVLTQDHMNFIA SLEPHVIMYILSSISEGLTALDTMVCTGCCSCLDHIVTYLFKQLSRSTKKRTTPLNQESD RFLHIMQQHPEMIQQMLSTVLNIIIFEDCRNQWSMSRPLLGLILLNEKYFSDLRNSIVNS QPPEKQQAMHLCFENLMEGIERNLLTKNRDRFTQNLSAFRREVNDSMKNSTYGVNSNDMM S >gi568815590f:21925249_22136679|GENSCAN_predicted_CDS_1|3066_bp atgaatgactgcagaagcaccctgagcgaatatggatttgatgattacaaatacatttta agcctggcccaactagagaatctgtgcaaacagctgtatgaaaccacagacacaaccact cgactccaggcagagaaagccttggttgaatttaccaacagccctgattgcctgagcaag tgccagctactcctcgaaagaggaagttcctcttactcccagttactggcagctacatgc cttaccaagcttgtatcacgcacaaacaaccccctaccattggaacagcgaatagatatt cggaactatgtgctcaactaccttgccactcggccgaagttggctactttcgtgacacaa gcacttattcagttatatgccagaatcacaaaactgggctggtttgactgtcagaaggat gactatgtcttcagaaatgcaatcacagacgtcacaaggtttttacaggatagtgttgaa tactgcatcattggtgtcacaattttatctcagctaaccaatgaaattaatcaagtaagt gctacagccttcctcattgaagcttcaggaaagaatctaaacttgaatgatgaaagtcag catggcttgctcatgcaactgctcaagctcactcataactgcctcaactttgacttcatc ggcacttccactgatgagtcctcagacgacctgtgtacagtgcagattcccaccagctgg agatcagccttcttagattcttcaaccttgcagctgttttttgacctgtatcattccatc cctccttcattttcacctctggtattatcctgcttggtacagatcgcttcagtcagaaga tccctgtttaacaatgcagagagggccaagtttctctctcatcttgttgatggtgttaaa cgaatactggaaaacccacagagtttatcagacccaaacaattaccatgagttttgcaga ctactggcccgattgaagagtaactatcaactgggagaattggtaaaggtggaaaactac cctgaggtcatccgattgatagccaacttcacagtgaccagcctacagcactgggaattt gctccaaatagtgtgcactatcttctgagcctgtggcagcggctggcagcctctgtgccg tatgtcaaagccacagagccccacatgctggaaacttacactcctgaggtcaccaaagcc tacatcacatcccggttggaatctgtgcacatcatactgagagatggcctggaagatccc ctggaggatacggggctggtccagcagcagttggaccagctgtccaccattgggcgttgt gaatatgagaagacgtgtgcactcctcgtgcagttgtttgaccagtcggcccagtcgtac caggagctgctacagagcgccagcgcaagcccaatggacattgcagtgcaggagggaagg ctgacatggctggtttacattattggagcagtgatcggtggccgggtttcttttgccagc actgatgagcaagacgccatggatggtgagcttgtctgtcgggtgctccagctgatgaac ctaacagattctcgtttggcccaggcgggtaatgagaagctagagttggccatgctgagc ttttttgaacagtttcgtaagatctacattggggaccaagtgcagaaatcctctaagctg taccgccgactctcagaagttctgggcttgaatgatgagaccatggtcctaagcgtcttc ataggaaaaattagcgtaaggaagctagtgaagcttagtgcggtacagttcatgctgaac aatcacacgagcgagcacttttcatttttgggtattaacaatcagtccaacctgacagac atgcggtgtcggactaccttctacacagcacttgggcgtctcctcatggtggatttagga gaggatgaagatcagtatgagcagttcatgctgccactcacagcagcatttgaggctgtg gcccagatgtttagcaccaatagtttcaacgagcaggaggcaaagcttctcatttacaga tatccatcctatatgccaattctccaacgggcaattgagctctggtaccatgatccagcc tgtactacacctgtactcaagttgatggctgaattggttcataataggtcccagcgactc cagtttgatgtctcttcccccaatggcatcttactcttccgagaaaccagcaagatgata acaatgtatggcaatcgcatcctgacactaggagaggtcccaaaggatcaggtctatgct ctgaagctcaagggcatctccatctgcttctccatgctgaaggctgctctcagtgggagt tacgtcaatttcggagtctttcgtctctatggagacgatgccctggacaatgctctgcag accttcatcaagctgctcctctctattcctcacagtgatctcttggattaccccaagctc agccagtcttattattcactactggaagtcctgacccaggaccatatgaactttattgca agcctggaacctcacgtcatcatgtatattctctcttccatttctgaaggacttactgca cttgacaccatggtatgcacaggctgctgctcctgcctggaccacattgtgacatacctc ttcaagcagctgtcacgtagcaccaagaagaggaccacacccctgaaccaggagagcgac cgctttctgcacatcatgcagcagcatccagagatgatccagcagatgctgtccacggtg ctgaacatcatcatctttgaagactgtaggaaccagtggtctatgtcccgaccactactt ggcttgatattgcttaatgaaaagtatttttctgacctaagaaacagtattgtgaacagc cagccaccggagaagcagcaggccatgcacctgtgttttgagaacctgatggaaggcatc gagcgaaatcttcttacgaaaaacagagacaggttcacccagaacctgtcagcattccgt cgagaagtcaacgactcaatgaagaattccacttatggcgtgaatagcaatgacatgatg agctga >gi568815590f:21925249_22136679|GENSCAN_predicted_peptide_2|194_aa MGRETVKVEDDIKFGKSFPFLRKAFWFSGQPDCYPLSSVHQILIQHLLWASAAAPEVISR VSSAGRWRLRTQVSTASVPALPQLSPPRAGAVQTRSWPGVLVLPLEERGLNCFAGAFLGK SRAGFLRPLGPHGAVCLAEVRVHTHRGLQQAAAVLISGAVPGLGAAAKDKAPAPKNAVAR GYPERDGHGAHGMP >gi568815590f:21925249_22136679|GENSCAN_predicted_CDS_2|585_bp atgggacgtgagactgtcaaggtagaggatgacatcaagtttggaaaatcattccccttc ctcagaaaagccttctggttctcaggacagcctgactgctaccctttgtcatctgttcat caaatacttattcagcatctattatgggccagcgctgcagccccggaagtaatttcgcga gtttcttccgccggaaggtggcgcctgcgcactcaggtgtccacggcctctgttccggct ctcccccagctttcgccgccgcgcgcaggcgcagtccagactcggtcctggccgggggtt ctagtgttgccgctggaagagcgaggtcttaattgctttgcgggagcgttcctggggaag tccagagctgggttcctgcggcccttgggcccccacggcgccgtgtgcctggcagaggtt cgagttcacacgcaccgtggcttgcagcaggcagccgcagtgctaatcagcggcgctgtt cccgggctgggtgcagctgctaaggacaaggcccctgctccgaagaacgcggtggctcgg ggataccctgaaagggacggccatggcgcacatgggatgccctag >gi568815590f:21925249_22136679|GENSCAN_predicted_peptide_3|617_aa MPTFTSRIQHSSVSTSQNNQVPSSRERPLSTCRCLLVSGILQSRVRVREQGLTRAPSLQL PGQPASLPGAMNLSSASSTEEKAVTTVLWGCELSQERRTWTFRPQLEGKQSCRLLLHTIC LGEKAKEEMHRVEILPPANQEDKKMQPVTIASLQASVLPMVSMVGVQLSPPVTFQLRAGS GPVFLSGQERYEASDLTWEEEEEEEGEEEEEEEEDDEDEDADISLEEQSPVKQVKRLVPQ KQASVAKKKKLEKEEEEIRASVRDKSPVKKGGNSMELSVHRSPGPRIPPLILKTQEGFCI PTMKQVLTAADSLLSNSDLRDGAWSGWKQVEVMAQGENHPSPNFNQYVRDQGAMTDQLSR RQIREYQLYSRTSGKHVQVTGRRISATAEDGNKFAKLIVETDTFGSRVRIKGAESEKYIC MNKRGKLIGKPSGKSKDCVFTEIVLENNYTAFQNARHEGWFMAFTRQGRPRQASRSRQNQ REAHFIKRLYQGQLPFPNHAEKQKQFEFVGSAPTRRTKRTRRPQPLTVLQVCVWGVELGG HKTRLFSALAFNLVEGEVFLAGTTPPPRHPTRWVFRLWKSVQLIPPELVVTSKMLKGRSH KNRSLMMALQRRFNMHI >gi568815590f:21925249_22136679|GENSCAN_predicted_CDS_3|1854_bp atgcccaccttcaccagtcgtattcaacacagttctgtaagtactagccagaacaaccag gtcccctccagccgcgagcgacccctcagtacctgccgatgcctgctggtctctggcatc ctccagtcgagggtcagggtcagggagcaaggcctcacgcgggcgccctccttgcagctg cccggccagcccgcttctctgcccggagccatgaatctcagtagcgccagtagcacggag gaaaaggcagtgacgaccgtgctctggggctgcgagctcagtcaggagaggcggacttgg accttcagaccccagctggaggggaagcagagctgcaggctgttgcttcatacgatttgc ttgggggagaaagccaaagaggagatgcatcgcgtggagatcctgcccccagcaaaccag gaggacaagaagatgcagccggtcaccattgcctcactccaggcctcagtcctccccatg gtctccatggtaggagtgcagctttctcccccagttactttccagctccgggctggctca ggacccgtgttcctcagtggccaggaacgttatgaagcatcagacctaacctgggaggag gaggaggaagaagaaggggaggaggaggaagaggaagaggaagatgatgaggatgaggat gcagatatatctctggaggagcaaagccctgtcaaacaagtcaaaaggctggtgccccag aagcaggcgagcgtggctaagaaaaaaaagctggaaaaagaagaagaggaaataagagcc agcgttagagacaagagccctgtgaaaaagggagggaacagcatggagctctctgttcac cggtctccaggacctcggattccacctttaatcctgaaaacccaggaaggcttctgtatc cctacaatgaagcaggtgcttacagctgctgattctctgctgtcaaactcagacttgagg gatggagcctggagcggttggaagcaggtggaggtgatggctcagggggagaatcacccg tctcctaattttaaccagtacgtgagggaccagggcgccatgaccgaccagctgagcagg cggcagatccgcgagtaccaactctacagcaggaccagtggcaagcacgtgcaggtcacc gggcgtcgcatctccgccaccgccgaggacggcaacaagtttgccaagctcatagtggag acggacacgtttggcagccgggttcgcatcaaaggggctgagagtgagaagtacatctgt atgaacaagaggggcaagctcatcgggaagcccagcgggaagagcaaagactgcgtgttc acggagatcgtgctggagaacaactatacggccttccagaacgcccggcacgagggctgg ttcatggccttcacgcggcaggggcggccccgccaggcttcccgcagccgccagaaccag cgcgaggcccacttcatcaagcgcctctaccaaggccagctgcccttccccaaccacgcc gagaagcagaagcagttcgagtttgtgggctccgcccccacccgccggaccaagcgcaca cggcggccccagcccctcacggtcctccaggtgtgcgtgtggggagtggaactcgggggc cataaaacccgcctcttcagtgcgctggcttttaatcttgtcgagggggaggtcttcctc gcagggacaaccccccccccccgccaccccaccaggtgggtcttcaggctctggaagtct gttcagctcattccccctgaactggtggtgacttcgaagatgctaaaaggaagaagccac aagaatagatccctaatgatggcacttcagaggagatttaacatgcacatctag >gi568815590f:21925249_22136679|GENSCAN_predicted_peptide_4|433_aa MERLQKVRGAAPGRGRRGRAKMDNQVLGYKDLAAIPKDKAILDIERPDLMIYEPHFTYSL LEHVELPRSRERSLSPKSTSPPPSPEVWADSRSPGIISQASAPRTTGTPRTSLPHFHHPE TSRPDSNIYKKPPIYKQRESVGGSPQTKHLIEDLIIESSKFPAAQPPDPNQPAKIETDYW PCPPSLAVVETEWRKRKASRRGAEEEEEEEDDDSGEEMKALRERQREELSKVTSNLGKMI LKEEMEKSLPIRRKTRSLPDRTPFHTSLHQGTSKSSSLPAYGRTTLSRLQSTEFSPSGSE TGSPGLQVSASWRGTGRAAEAGSWHPKVSGEGGCGATELPGEEEEAGNGEGQRGRMDRGN SLPCVLEQKIYPYEMLVVTNKGRTKLPPGVDRMRLERHLSAEDFSRVFAMSPEEFGKLAL WKRNELKKKASLF >gi568815590f:21925249_22136679|GENSCAN_predicted_CDS_4|1302_bp atggaacggctgcagaaggtgcgcggcgccgccccgggccggggccgccgagggcgggcc aagatggacaatcaggtgctgggctacaaggacctggctgccatccccaaggacaaggcc atcctggacatcgagcggcccgacctcatgatctacgagcctcacttcacttattccctc ctggaacacgtggagctgcctcgcagccgcgagcgctcgctgtcacccaaatccacatcc cccccaccatccccagaggtgtgggcggacagccggtcgcctggaatcatctctcaggcc tcggcccccagaaccactggaaccccccggaccagcctgccccatttccaccaccctgag acctcccgcccagattccaacatctacaagaagcctcccatctataagcagagagagtcc gtgggaggcagccctcagaccaagcacctcatcgaggatctcatcatcgagtcatccaag tttcctgcagcccagcccccagaccccaaccagccagccaaaatcgaaaccgactactgg ccatgccccccgtctctggctgttgtggagacagaatggaggaagcggaaggcgtctcgg aggggagcagaggaagaggaggaggaggaagatgacgactctggagaggagatgaaggct ctcagggagcgtcagagagaggaactcagtaaggttacttccaacttgggaaagatgatc ttgaaagaagagatggaaaagtcattgccgatccgaaggaaaacccgctctctgcctgac cggacacccttccatacctccttgcaccagggaacgtctaaatcttcctctctccccgcc tatggcaggaccaccctgagccggctacagtccacagagttcagcccatcagggagtgag actggaagcccaggcctgcaggtgagtgcctcctggaggggaacaggcagagcagcagaa gctgggtcctggcacccgaaagtgtcaggggaaggggggtgtggggccacagagttgcct ggtgaagaagaagaagccgggaacggagagggccagagggggaggatggaccgggggaac tccctgccctgtgtgctggagcagaagatctatccctatgaaatgctagtggtgaccaac aaggggcgaaccaagctgccaccgggggtggatcggatgcggcttgagaggcatctgtct gccgaggacttctcaagggtatttgccatgtcccctgaagagtttggcaagctggctctg tggaagcggaatgagctcaagaagaaggcctctctcttctga >gi568815590f:21925249_22136679|GENSCAN_predicted_peptide_5|751_aa MLSRLGALLQEAVGAREPSIDLLQAFVEHWKGITHYYIESTDESTPAKKTDIPWRLKQML DILVYEEQQQAAAGEAGPCLEYLLQHKILETLCTLGKAEYPPGMRQQVFQFFSKVLAQVQ HPLLHYLSVHRPVQKLLRLGGTASGSVTEKEEVQFTTVLCSKIQQDPELLAYILEGKKIV GRKKACGEPTALPKDTTSHGDKDCSHDGAPARPQLDGESCGAQALNSHMPAETEELDGGT TESNLITSLLGLCQSKVLAAEMERLGKSRVALKAQENLLLLVSMASPAAATYLVQSSACC PAIVRHLCQLYRSMPVFLDPADIATLEGISWRLPSAPSDEASFPGKEALAAFLGWFDYCD HLITEAHTVVADALAKAVAENFFVETLQPQLLHVSEQSILTSTALLTAMLRQLRSPALLR EAVAFLLGTDRQPEAPGDNPHTLYAHLIGHCDHLSDEISITTLRLFEELLQKPHEGIIHS LVLRNLEGRPYVAWGSPEPESYEDTLDLEEDPYFTDSFLDSGFQTPAKPRLAPATSYDGK TAVTEIVNSFLCLVPEEAKTSAFLEETGYDTYVHDAYGLFQECSSRVASWGWPLTPTPLD PHEPERPFFEGHFLRVLFDRMSRILDQPYSLNLQVTSVLSRLALFPHPHIHEYLLDPYIS LAPGCRSLFSVLVIGDLMQRIQRVPQFPGKLLLVRKQLTGQAPGEQLDHQTLLQGVVVLE EFCKELAAIAFVKFPPHDPRQNVSPAPEGQV >gi568815590f:21925249_22136679|GENSCAN_predicted_CDS_5|2256_bp atgctgagccggctcggggcgctgctgcaggaagccgtgggggcgcgcgagcccagcatt gacctgctgcaggccttcgtggagcactggaagggcatcacgcactactacatcgagagc acagatgaaagcacccccgccaagaagacagacattccctggcggctgaagcagatgctg gatatcctggtgtatgaagagcagcagcaggcggccgcgggtgaggcagggccctgcctg gagtacctgctgcagcacaagatcctggagactctctgcacgctgggcaaggccgagtac cccccaggcatgcggcagcaggtgttccagttcttcagcaaggttctggcgcaggtgcag caccccctgctgcattacctcagcgtccacaggcctgtgcagaaactcctccgacttggt gggactgcttccggatccgttacagaaaaggaggaggtgcagttcaccaccgtcctctgc tccaagatccagcaggacccagagctgctcgcctacatcctggaaggtaaaaagattgta ggtaggaagaaagcatgcggagaacccactgccctgcctaaggacacaaccagccacggg gacaaggactgctcccacgatggtgctcctgccaggccccagctggacggggagtcctgt ggggcccaggccttgaacagccacatgcctgctgagaccgaggagctggacggtgggacc acagagagcaacctgattacctccctgcttgggctgtgccagagcaaggtgctggcagca gagatggagaggttggggaagagtcgggtggccttgaaggcccaggagaacctgctgctc ctggtgagcatggcctccccagcagctgccacctacctggtacagagcagcgcctgctgc cctgcgatcgtccggcacctttgccagttgtaccggtccatgcctgtcttcctggacccc gcagacattgccaccttagagggcatcagctggaggttacccagtgccccgtctgatgag gcttccttccctggcaaggaggccttggctgccttcttgggctggtttgattactgcgac cacctcatcacagaggcacacacggtggttgcggacgccttggcgaaggctgtggctgag aacttcttcgtggagaccctgcagccccagctcctgcacgtgtccgagcagagcatcttg acctccaccgccctcctcacagccatgctgcgccagcttcgctcccctgcgctgctgcgg gaggccgtggctttcctcctgggcacagaccggcagcctgaagcccccggggacaacccc cacaccctgtatgctcatctcatcgggcattgtgaccacctctctgatgagatcagcatc accacactccggctgtttgaggagctgctgcagaagccccacgaggggatcatccacagc ctggtcctgcgcaaccttgagggccgcccttacgtggcctggggctcaccagagcctgag agctatgaggacaccctagacctggaggaagacccctacttcaccgacagcttcctggat tccggctttcaaactcccgcaaagcctcgcctagctcctgctaccagttacgatggcaaa acagcagtgaccgagatcgtcaacagtttcctgtgcctggtccccgaggaagccaagacc tctgccttcctggaggagacaggctatgacacatacgtccacgatgcttatggcctgttc caggagtgcagctcccgcgtcgcctcctggggctggcctctgacccccacacctttggac ccccatgagcccgagcgacctttcttcgagggccacttcctccgagtgctgtttgaccgc atgtcccggattctggatcagccatacagcctgaacctgcaggtgacctcggtcctgtcc cggcttgccctcttcccccacccccatattcatgagtacctgctggatccgtacatcagc ctggcccccggctgcaggagcctattctccgtgttggtgatcggggacttgatgcagaga atccagagggtaccccagttcccaggcaagctgctcctggtgcgcaagcagttgacgggc caggctcctggggagcagctggaccaccagaccctcctccagggcgtggtggtgctggag gagttctgcaaggagctggctgccattgccttcgtcaagtttcccccacatgatcctcgc cagaacgtctccccagccccggaagggcaggtctga >gi568815590f:21925249_22136679|GENSCAN_predicted_peptide_6|1923_aa MGASPAPGLEAPGTLGMGQTLTARGLEPVSELIWAHDLSRHLAKSRPLDQGQTQAQALPE VLGRNPLCPWALEEVWEGFGVEDGKEQLGQSAPAAPGEGERERAEQTERENASSPAGREA PELAHGEQAPGAGHDDRHRPRRDRPVKPRDPPLGEPHEGRRVMESTPSFLKGTPTWEKTA PENGIVRQEPGSPPRDGLHHGPLCLGEPAPFWRGVLSTPDSWLPPGFPQGPKDMLPLVEG EGPQNGERKVNWLGSKEGLRWKEAMLTHPLAFCGPACPPRCGPLMPEHSGGHLKSDPVAF RPWHCPFLLETKILERAPFWVPTCLPPYLVSGLPPEHPCDWPLTPHPWVYSGGQPKVPSA FSLGSKGFYYKDPSIPRLAKEPLAAAEPGLFGLNSGGHLQRAGEAERPSLHQRDGEMGAG RQQNPCPLFLGQPDTVPWTSWPACPPGLVHTLGNVWAGPGDGNLGYQLGPPATPRCPSPE PPVTQRGCCSSYPPTKGGGLGPCGKCQEGLEGGASGASEPSEEVNKASGPRACPPSHHTK LKKTWLTRHSEQFECPRGCPEVEERPVARLRALKRAGSPEVQGAMGSPAPKRPPDPFPGT AEQGAGGWQEVRDTSIGNKDVDSGQHDEQKGLCPGWEFVGPFAFLGYCTCGLCRNSLALL TFRRPPSTLGLSPHVTQPVVSLSPQWSYPKSTLAGSCTCVRTIAPAAQSLGLNTPDNLFA RLPVHVTASSDSLLSLASPLGGELQQEEDTATNSSSEEGPGSGPDSRLSTGLAKHLLSGL GDRLCRLLRREREALAWAQREGQGPAVTEDSPGIPRCCSRCHHGLFNTHWRCPRCSHRLC VACGRVAGTGRAREKAGFQEQSAEECTQEAGHAACSLMLTQFVSSQALAELSTAMHQVWV KFDIRGHCPCQADARVWAPGDAGQQDDRITNILDSIIAQVVERKIQEKALGPGLRAGPGL RKGLGLPLSPVRPRLPPPGALLWLQEPQPCPRRGFHLFQEHWRQGQPVLVSGIQRTLQGN LWGTEALGALGGQVQALSPLGPPQPSSLGSTTFWEGFSWPELRPKSDEGSVLLLHRALGD EDTSRLGLLGMTPYPDRVENLAASLPLPEYCALHGKLNLASYLPPGLALRPLEPQLWAAY GVSPHRGHLGTKNLCVEVADLVSILVHADTPLPAWHRAQKDFLSGLDGEGLWSPGSQVST VWHVFRAQDAQRIRRFLQMVCPAGAGALEPGAPGSCYLDAGLRRRLREEWGVSCWTLLQA PGEAVLVPAGAPHQVQGLVSTVSVTQHFLSPETSALSAQLCHQGPSLPPDCHLLYAQVGH NALEDIFGLFVNRSESMFICSSTVPENAAGPVLRWGEGGKAELGEEQALATTTPTLLSTR QVLHAGISFKAGVYVPHPTGHVTFITLWWNEKKGIWDMINSGNAIVCLRQQRDSGSRGRP RASVTSPDCRVTVAYPGGATRPAGKMTSPSELLQTSARSGSWRAGGGWETSRAHGTDRRQ KPGGVRWAPDPCPPSSRAAPGGPAPSVNAAGRPIRAGRGAAQPISGQSSRALPRSRALPR SRELPARCRRDWERAPQRTLARGSAQSVCEDPARRPPGDPMASEGLAGALASVLAGQGSS VHSCDSAPAGEPPAPVRLRKNVCYVVLAVFLSEQDEVLLIQEAKRECRGSWYLPAGRMEP GETIVEALQREVKEEAGLHCEPETLLSVEERGPSWVRFVFLARPTGGILKTSKEADAESL QAAWYPRTSLPTPLRAHDILHLVELAAQYRQQARHPLILPQELPCDLVCQRLVATFTSAQ TVWVLVGTVGMPHLPVTACGLDPMEQRGGMKMAVLRLLQECLTLHHLVVEIKGLLGLQHL GRDHSDGICLNVLVTVAFRSPGIQDEPPKVRGENFSWWKVMEEDLQSQLLQRLQGSSVVP VNR >gi568815590f:21925249_22136679|GENSCAN_predicted_CDS_6|5772_bp atgggcgcctctccagcccctggcctggaagcaccaggaaccctggggatggggcagacc ctcacagcccggggtctggagccggtgtcggagctcatctgggcccatgacctctccaga catttggcaaaatcaaggcccttagaccagggacagacccaagcccaggccctcccagag gtcctaggacgcaaccctttgtgcccttgggctctggaagaggtttgggaagggtttggg gtggaagatggcaaagagcagcttggccagagcgcccccgccgccccgggggaaggagag cgcgagcgcgctgagcagacagagcgggagaacgcgtcctcgcccgccggccgggaggcc ccggagctggcccatggggagcaggcgcccggtgccggccacgacgaccgccaccgcccg cgccgcgaccggccggtgaagcccagggacccccctctgggagagccccatgagggcagg agagtgatggagagtacgcccagcttcctgaagggcaccccaacctgggagaagacggcc ccagagaacggcatcgtgagacaggagcccggcagcccgcctcgagatggactgcaccat gggccgctgtgcctgggagagcctgctcccttttggaggggcgtcctgagcaccccagac tcctggcttccccctggcttcccccagggccccaaggacatgctcccacttgtggagggc gagggcccccagaatggggagaggaaggtcaactggctgggcagcaaagagggactgcgc tggaaggaggccatgcttacccatccgctggcattctgcgggccagcgtgcccacctcgc tgtggccccctgatgcctgagcatagtggtggccatctcaagagtgaccctgtggccttc cggccctggcactgccctttccttctggagaccaagatcctggagcgagctcccttctgg gtgcccacctgcttgccaccctacctagtgtctggcctgcccccagagcatccatgtgac tggcccctgaccccgcacccctgggtatactccgggggccagcccaaagtgccctctgcc ttcagcttaggcagcaagggcttttactacaaggatccgagcattcccaggttggcaaag gagcccttggcagctgcggaacctgggttgtttggcttaaactctggtgggcacctgcag agagccggggaggccgaacgcccttcactgcaccagagggatggagagatgggagctggc cggcagcagaatccttgcccgctcttcctggggcagccagacactgtgccctggacctcc tggcccgcttgtcccccaggccttgttcatactcttggcaacgtctgggctgggccaggc gatgggaaccttgggtaccagctggggccaccagcaacaccaaggtgcccctctcctgag ccgcctgtcacccagcggggctgctgttcatcctacccacccactaaaggtgggggtctt ggcccttgtgggaagtgccaggagggcctggaggggggtgccagtggagccagcgaaccc agcgaggaagtgaacaaggcctctggccccagggcctgtccccccagccaccacaccaag ctgaagaagacatggctcacacggcactcggagcagtttgaatgtccacgcggctgccct gaggtcgaggagaggccggttgctcggctccgggccctcaaaagggcaggcagccccgag gtccagggagcaatgggcagtccagcccccaagcggccaccggacccttttccaggcact gcagaacagggggctgggggttggcaggaggtgcgggacacatcgatagggaacaaggat gtggactcgggacagcatgatgagcagaaaggcctctgccctggctgggaatttgttggc cccttcgccttcttgggttactgcacctgtggcctgtgtcgaaactctctggcccttctc accttccggcgccctccgtccacgttgggattgtctcctcatgtcacgcagccagtggta tcactaagccctcagtggagctatccaaagtctacactggctggatcctgcacttgtgtc aggacgatcgccccagctgcccaaagccttgggctcaacacaccagacaacctcttcgct cggctgccggtccatgtcacggcatcctcagactccctgctcagcctggcatcgcctctg ggaggggagctgcagcaggaggaagacacagccaccaactccagctctgaggaaggccca gggtccggccctgacagccggctcagcacaggcctcgccaagcacctgctcagtggtttg ggggaccgactgtgccgcctgctgcggagggagcgggaggccctggcttgggcccagcgg gaaggccaagggccagccgtgacagaggacagcccaggcattccacgctgctgcagccgt tgccaccatggactcttcaacacccactggcgatgtccccgctgcagccaccggctgtgt gtggcctgtggtcgtgtggcaggcactgggcgggccagggagaaagcaggctttcaggag cagtccgcggaggagtgcacgcaggaggccgggcacgctgcctgttccctgatgctgacc cagtttgtctccagccaggctttggcagagctgagcactgcaatgcaccaggtctgggtc aagtttgatatccgggggcactgcccctgccaagctgatgcccgggtatgggcccccggg gatgcaggccagcaggatgaccgcatcaccaacatcctggacagcattatcgcacaggtg gtggaacggaagatccaggagaaagccctggggccggggcttcgagctggcccgggtctg cgcaagggcctgggcctgcccctctctccagtgcggccccggctgcctcccccaggggct ttgctgtggctgcaggagccccagccttgccctcggcgtggcttccacctcttccaggag cactggaggcagggccagcctgtgttggtgtcagggatccaaaggacattgcagggcaac ctgtgggggacagaagctcttggggcacttggaggccaggtgcaggcgctgagccccctc ggacctccccagcccagcagcctgggcagcacaacattctgggagggcttctcctggcct gagcttcgcccaaagtcagacgagggctctgtcctcctgctgcaccgagctttgggggat gaggacaccagcaggttgggcctgctgggcatgaccccctaccctgacagggtggagaac ctagctgccagtctgccacttccggagtactgcgccctccatggaaaactcaacctggct tcctacctcccaccgggccttgccctgcgtccactggagccccagctctgggcagcctat ggtgtgagcccgcaccggggacacctggggaccaagaacctctgtgtggaggtggccgac ctggtcagcatcctggtgcatgccgacacaccactgcctgcctggcaccgggcacagaaa gacttcctttcaggcctggacggggaggggctctggtctccgggcagccaggtcagcact gtgtggcacgtgttccgggcacaggacgcccagcgcatccgccgctttctccagatggtg tgcccggccggggcaggcgccctggagcctggcgccccaggcagctgctacctggatgca gggctgcggcggcgcctgcgggaggagtggggcgtgagctgctggaccctgctccaggcc cccggagaggccgtgctggtgcctgcaggggctccccaccaggtgcagggcctggtgagc acagtcagcgtcactcagcacttcctctcccctgagacctctgccctctctgctcagctc tgccaccagggacccagccttccccctgactgccacctgctttatgcccaggtgggtcac aatgcccttgaggacatatttggtctctttgtaaacagaagtgaatcaatgttcatttgt tccagcactgttcctgagaatgcagccggaccagtgctgaggtggggtgaaggaggaaag gccgagctgggagaggagcaggcgctcgccacaaccacccccaccctgctctccacgcgc caggtcctgcacgctggaatctccttcaaggcaggggtttatgtcccccaccccacgggc catgtgacatttataactctttggtggaatgagaagaaaggtatttgggatatgatcaac tccggcaatgccattgtgtgtttacggcaacagcgggacagtggttccagggggcggccc cgggcctccgtgacgtcaccggattgtcgcgtcaccgtcgcctaccccggcggcgcaacg cgccctgcaggaaagatgacgtcaccgtcggagctcctgcagaccagtgcgcgctcgggg agttggcgagcgggtggcggctgggagacgtcccgagcgcacgggactgacaggcggcag aagccgggcggggtccgctgggctccggacccgtgcccccccagttccagggcggccccg ggcggccccgccccctcggtgaatgccgcgggccggccaatccgggcaggccgcggcgcc gcgcagcctatcagcggccagagctcgcgtgcgcttccgcgttcgcgtgcgcttccgcgt tctcgtgagctcccggcccgctgccgcagggactgggagcgggctccgcagcgcactcta gcccgcggctcggctcagtcggtctgcgaggatccggcccgccgccccccgggggacccg atggcctcggagggcctggcgggggcgctggcttccgtgctggctggccaggggtccagc gtgcacagctgcgactcggcgccggccggggagccgccggcgcccgtgcggctgcggaag aacgtgtgctacgtggtgctggccgtgttcctcagcgagcaggatgaggtgctactgatc caggaggccaagagggagtgccgggggtcgtggtacctgcctgcggggagaatggagcca ggggagaccatcgtggaggcgctgcagcgggaggtgaaggaggaggcggggctgcactgt gagcccgagacactgctgtccgtggaggagcggggcccctcctgggtccgcttcgtgttc ctcgctcgccccacaggtggaattctcaagacttccaaggaggccgatgcggagtccctg caggctgcctggtacccacggacctccctgcccactccgctgcgagcccatgacatcctg cacctggttgaactagccgcccagtatcgccagcaagccaggcaccctctcattctgccc caagagctaccctgtgatctggtctgccagcggctcgtggctacctttaccagcgcccag acagtgtgggtgttagtgggcacagtggggatgcctcacttgcctgtcactgcctgtggc ctcgaccctatggagcagaggggtggcatgaagatggccgtcctgcggctgctgcaggag tgtctgaccctgcaccacttggtggtggagatcaaggggttgcttggactgcagcacctg ggccgagatcacagtgatggcatctgtttgaatgtgctggtgaccgtggcttttcggagc ccagggatccaggatgaacccccaaaagttcggggtgagaacttctcttggtggaaggtg atggaggaagacctgcaaagccagctcctccagcggcttcagggatcctctgttgtccca gtgaacagatag