GENSCAN 1.0 Date run: 5-Nov-116 Time: 22:39:29 Sequence gi568815596f:238040755_238251062 : 210308 bp : 51.18% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 430 584 155 2 2 46 44 178 0.642 7.60 1.02 PlyA + 1820 1825 6 1.05 2.00 Prom + 4790 4829 40 -0.61 2.01 Init + 10686 10725 40 2 1 76 92 28 0.588 2.59 2.02 Intr + 16404 16496 93 1 0 102 101 18 0.841 4.93 2.03 Intr + 16607 16731 125 0 2 24 89 118 0.569 6.31 2.04 Intr + 19672 19803 132 0 0 81 5 131 0.524 5.45 2.05 Intr + 19916 19993 78 1 0 83 42 71 0.398 2.24 2.06 Intr + 20255 20389 135 1 0 64 69 88 0.633 5.77 2.07 Intr + 20957 21111 155 1 2 0 51 91 0.274 -4.02 2.08 Intr + 23603 23715 113 2 2 127 91 107 0.967 15.43 2.09 Intr + 27311 27411 101 0 2 84 88 28 0.647 2.73 2.10 Intr + 28543 28723 181 0 1 78 109 105 0.544 11.56 2.11 Intr + 39894 40043 150 1 0 101 -5 86 0.692 1.25 2.12 Intr + 40955 41082 128 0 2 72 101 194 0.995 20.00 2.13 Intr + 41291 41455 165 1 0 69 115 249 0.991 26.37 2.14 Intr + 42494 42600 107 1 2 75 86 20 0.351 0.01 2.15 Intr + 50464 50500 37 1 1 93 105 14 0.036 2.35 2.16 Intr + 53107 53190 84 0 0 83 55 179 0.471 14.61 2.17 Intr + 53666 53768 103 1 1 82 59 26 0.223 -0.65 2.18 Intr + 56047 56122 76 2 1 113 100 84 0.950 11.17 2.19 Term + 57448 57601 154 1 1 90 42 323 0.977 25.61 2.20 PlyA + 58638 58643 6 1.05 3.00 Prom + 58820 58859 40 -6.30 3.01 Init + 59666 59959 294 1 0 66 99 421 0.921 38.19 3.02 Intr + 60445 60495 51 0 0 72 84 48 0.833 2.39 3.03 Intr + 61187 61377 191 1 2 55 110 216 0.999 19.40 3.04 Intr + 63902 64088 187 2 1 97 119 276 0.973 31.81 3.05 Intr + 67037 67219 183 1 0 128 94 233 0.946 28.40 3.06 Intr + 74466 74591 126 2 0 67 109 14 0.002 2.78 3.07 Intr + 76149 76280 132 2 0 87 107 254 0.933 28.55 3.08 Intr + 79087 79227 141 0 0 104 39 33 0.270 1.06 3.09 Intr + 81458 81604 147 1 0 57 47 66 0.304 0.34 3.10 Intr + 84516 84630 115 2 1 93 78 70 0.596 7.02 3.11 Intr + 87464 87645 182 0 2 51 74 66 0.332 1.60 3.12 Intr + 87953 88150 198 1 0 96 96 302 0.981 31.87 3.13 Term + 89374 90978 1605 0 0 110 46 2087 0.999 197.44 3.14 PlyA + 92509 92514 6 1.05 4.00 Prom + 93911 93950 40 -4.21 4.01 Init + 100001 100774 774 1 0 97 84 1379 0.959 133.21 4.02 Intr + 102045 102177 133 2 1 109 81 177 0.936 19.82 4.03 Intr + 102227 102352 126 0 0 84 65 29 0.353 1.36 4.04 Intr + 104148 104234 87 1 0 117 80 107 0.891 13.24 4.05 Intr + 104923 105078 156 2 0 51 77 234 0.974 19.09 4.06 Intr + 107080 107268 189 2 0 108 94 424 0.749 45.08 4.07 Intr + 108253 108398 146 2 2 92 86 263 0.996 27.01 4.08 Intr + 110060 110274 215 1 2 55 -18 325 0.002 16.64 4.09 Intr + 113807 114068 262 2 1 80 89 140 0.029 11.33 4.10 Term + 114731 114850 120 1 0 93 44 34 0.245 -1.72 4.11 PlyA + 116320 116325 6 -0.45 5.00 Prom + 116635 116674 40 -3.11 5.01 Init + 118254 118451 198 2 0 82 111 377 0.999 36.28 5.02 Intr + 118988 119034 47 1 2 124 48 21 0.459 -0.60 5.03 Intr + 119838 120021 184 0 1 100 -6 92 0.607 1.31 5.04 Intr + 120086 120223 138 1 0 97 1 67 0.491 0.07 5.05 Intr + 120795 120962 168 2 0 40 100 182 0.590 15.26 5.06 Intr + 121982 122084 103 1 1 62 97 133 0.426 11.85 5.07 Intr + 122983 123245 263 2 2 59 105 372 0.994 33.74 5.08 Intr + 123321 123429 109 2 1 88 68 167 0.999 15.06 5.09 Intr + 123516 123606 91 1 1 116 77 113 0.994 12.55 5.10 Intr + 124852 124930 79 1 1 84 109 156 0.870 17.35 5.11 Term + 126202 126300 99 0 0 64 53 125 0.730 5.03 5.12 PlyA + 128116 128121 6 1.05 6.12 PlyA - 129674 129669 6 1.05 6.11 Term - 129922 129782 141 1 0 101 55 238 0.847 20.04 6.10 Intr - 130270 130189 82 0 1 96 78 44 0.980 4.34 6.09 Intr - 132899 132780 120 1 0 88 98 174 0.999 18.51 6.08 Intr - 141432 141311 122 0 2 82 91 74 0.930 6.90 6.07 Intr - 142986 142899 88 2 1 109 79 27 0.954 4.37 6.06 Intr - 143359 143266 94 2 1 59 100 63 0.911 4.12 6.05 Intr - 144533 144427 107 1 2 88 100 -23 0.886 -0.74 6.04 Intr - 147503 147377 127 0 1 86 84 232 0.982 22.84 6.03 Intr - 149218 149099 120 2 0 96 0 101 0.852 3.07 6.02 Intr - 154116 154051 66 1 0 93 92 61 0.200 6.37 6.01 Init - 163036 162745 292 1 1 43 119 255 0.941 19.41 6.00 Prom - 165185 165146 40 -1.31 7.00 Prom + 168816 168855 40 -2.11 7.01 Init + 169080 169134 55 2 1 90 106 22 0.793 5.59 7.02 Intr + 174816 174913 98 1 2 70 80 39 0.402 1.53 7.03 Term + 175640 175789 150 1 0 3 45 117 0.162 -2.88 7.04 PlyA + 178544 178549 6 1.05 8.02 PlyA - 179643 179638 6 1.05 8.01 Sngl - 180779 180432 348 2 0 74 52 179 0.596 7.73 8.00 Prom - 181487 181448 40 -2.31 9.05 PlyA - 183145 183140 6 1.05 9.04 Term - 183791 183578 214 1 1 42 44 114 0.364 -0.57 9.03 Intr - 184745 184536 210 1 0 65 26 146 0.359 4.65 9.02 Intr - 187116 186930 187 1 1 102 109 91 0.378 11.87 9.01 Init - 190764 190455 310 0 1 75 44 99 0.259 1.69 9.00 Prom - 190926 190887 40 -11.55 10.00 Prom + 191013 191052 40 -4.41 10.01 Init + 191127 191382 256 2 1 46 72 264 0.973 17.98 10.02 Intr + 191438 191835 398 0 2 -9 91 407 0.903 26.17 10.03 Term + 192927 192959 33 2 0 96 40 44 0.617 -1.73 10.04 PlyA + 193575 193580 6 1.05 11.08 PlyA - 197174 197169 6 1.05 11.07 Term - 198497 198073 425 0 2 54 47 541 0.894 42.43 11.06 Intr - 198814 198733 82 1 1 98 101 135 0.988 15.51 11.05 Intr - 198993 198907 87 0 0 92 64 241 0.999 22.76 11.04 Intr - 199179 199071 109 2 1 74 86 89 0.005 8.09 11.03 Intr - 205770 205625 146 0 2 112 28 145 0.008 10.49 11.02 Intr - 208458 208308 151 2 1 -11 66 267 0.983 15.28 11.01 Intr - 209989 209797 193 2 1 77 48 235 0.930 17.47 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 110060 110311 252 1 0 55 40 361 0.918 23.97 S.002 Term - 112479 112361 119 1 2 88 43 71 0.814 1.61 S.003 Term - 162599 162163 437 0 2 53 44 146 0.963 2.53 S.004 Init - 199151 199071 81 2 0 103 86 83 0.995 10.52 S.005 Init + 199334 199430 97 1 1 34 0 239 0.856 8.12 S.006 Intr + 200112 200223 112 0 1 113 41 159 0.921 14.58 S.007 Term - 205770 205621 150 0 0 112 40 160 0.991 11.82 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:238040755_238251062|GENSCAN_predicted_peptide_1|51_aa XDLGAGGSDLSLKRCFTHSLTPPVLLSVPNAVTPQEDFRNKVDDYIKRYAR >gi568815596f:238040755_238251062|GENSCAN_predicted_CDS_1|156_bp ngggaccttggtgctggtggatctgacttgtccctgaagcgctgcttcactcactcactc actcctcctgttcttctttctgtccccaatgctgttactccacaggaggacttccggaat aaagtggatgactacatcaaacgttatgccagatga >gi568815596f:238040755_238251062|GENSCAN_predicted_peptide_2|718_aa MVAGLFRAVLVVMGENSIWCTCYCVRNWWVLGLTDFKNEAADPRGVKLRTFAVSATALKA VRLELFVPPGGFVVSLASGVKLQTFVVPYGAAPTNGRAGPADLLITVQGGLRETPGALAL LPRPRHFPSRVQLGPRLLLRRLTGRGPMAALPQTAKRSAGRRLDARQQWGGDGGGRGAGE GCAGTRGESAQRLRETQLAGEEAGALGQVKGNQGFSQTAGPAAGNGMATAEKRGDPKHVV SILVKSEKSRRQGKVYMDYNATTPLEPEVIQAMTKAMWEAWGNPSSPYSAGRKAKDIINA ARESLAKMIGGKPQDIIFTSGGTESNNLVIHSVVKHFHANQTSKGHTGGHHSPVKGAKPH FITSSVEHDSIRLPLEHLVEEQVADPSPILGAHLVPALSGRVVLSQSEVSTSAADLVYGS RACSDIGKLTSAGLAVTFVPVSKVSGQAEVDDILAAVRPTTRLVTIMLANNETGIVMPVP EISQRIKALNQERVAAGLPPILVHTDAAQALGKQRVDVEDLGVDFLTIVGHKFYGPRIGA LYIRGLGEFTPLYPMLFGGGQERNFRPGTENTPMIAGLGKAAELVTQNCEAYEAHMRDVR DYLEERLEAEFGQKRIHLNSQFPGTQRLPNTCNFSIRGPRLQGHVVLAQCRVLMASVGAA CHSDHGDQPSPVLLSYGVPFDVARNALRLSVGRSTTRAEVDLVVQDLKQAVAQLEDQA >gi568815596f:238040755_238251062|GENSCAN_predicted_CDS_2|2157_bp atggtggcaggtcttttccgtgctgtcctcgtggtgatgggtgagaattctatttggtgc acgtgctattgtgtccggaattggtgggttcttggtctcactgacttcaagaatgaagcc gcggaccctcgcggagtgaagctgcggaccttcgcggtgagtgctacagctcttaaggcg gtgcgtctggagttgttcgttcctcccggtgggttcgtggtctcgctggcttcaggagtg aagctgcagaccttcgtggtcccctatggagccgctccgactaacggacgcgcaggtcca gcggacctgctcatcactgtccagggtggcctgcgggagacccccggggccctggccctg ctgccgcggccccggcacttcccctcgagggtccagctcgggccccgcctgctcttacgc cgactaaccggccgcggcccgatggcggccctgccgcagacagcaaagcgctccgcggga aggaggctggatgcccggcagcagtggggcggggatggaggcggccgtggcgccggggag ggatgcgccggcacccgcggcgagtcagcccagcggctgcgggaaacacaactcgccgga gaggaagctggtgcactggggcaggtgaaagggaatcaagggttctcccagactgctggc ccagctgcaggaaacgggatggccaccgctgagaagcgcggagacccgaagcacgtggta tccatactagttaagtccgagaagagcaggaggcaggggaaagtttatatggactataat gcaacgactcccctggagccagaagttatccaggccatgaccaaggccatgtgggaagcc tggggaaatcccagcagcccgtattcagcaggaagaaaggccaaggatattataaatgca gctcgggaaagcctcgcgaagatgataggggggaaacctcaagatataatcttcacttcc gggggcactgagtcaaataatttagtaatccattctgtggtgaaacatttccacgcaaac cagacctcaaagggacacacaggtgggcaccacagcccagtgaagggggccaagccccat ttcattacttcctcggtggaacacgactccatccggctgcccctggagcacctggtggaa gaacaagtggcagatccttcccccatcctgggagcccaccttgtgcccgccctctctggg cgtgtggtgctgtctcagtctgaggtcagcacctctgccgctgacttggtgtacgggagc agggcgtgttcggacattgggaagctgacctctgccggcctcgcggtcacctttgtcccg gtgtccaaggtgagcgggcaggcagaggtggacgacatcctcgcggcagtccgcccgacc acacgcctcgtgaccatcatgctggccaacaatgagactggcattgtcatgcctgtccct gaaatcagtcagcgcattaaagccctgaaccaggaacgggtggcagctgggctacctccc atcctcgtgcacacggatgctgcacaggccttggggaagcagcgcgtggatgtggaggac ctgggcgtggacttccttacaatcgtggggcacaagttttatggtcccaggattggcgca ctttatatacgaggacttggtgaatttacccctctctaccctatgctatttggaggtgga caagaacggaatttcaggccagggacagagaacaccccaatgattgctggccttgggaag gccgcggagctggtgacccagaactgcgaggcttatgaggcccacatgagggacgtccgc gactacctggaagagaggctggaagctgaattcggtcagaagagaatccatctgaatagc cagtttccaggcacccagcggcttcccaatacctgtaacttttccatccggggaccccgg cttcaaggccacgtggtgcttgcgcagtgccgagtgctgatggccagtgtgggggccgcg tgccactcggaccacggggaccagccgtccccagtgctgctgagctacggtgtccccttc gacgtggccaggaacgcgctccggctcagcgtgggccgcagcaccaccagggccgaggtg gacctcgtcgtgcaggacctgaagcaggccgtggcgcagctggaggaccaggcctag >gi568815596f:238040755_238251062|GENSCAN_predicted_peptide_3|1183_aa MEKQRALVAAKDGDVATLERLLEAGALGPGITDALGAGLVHHATRAGHLDCVKFLVQRAQ LPGNQRAHNGATPAHDAAATGSLAELCWLVREGGCGLQARPLELSLSPVELSGLRDQDAS GVSPLHLAARFGHPVLVEWLLHEGHSATLETREGARPLHHAAVSGDLTCLKLLTAAHGSS VNRRTRSGASPLYLACQEGHLHLAQFLVKDCGADVHLRALDGMSALHAAAARGHYSLVVW LVTFTDIGLTARDNEGATALHFAARGGHTPILDRLLLMGTPILRDSWGGTPLHDAAENGQ MEAVATLEAMELRAEEGGEGSPPEDSPRTRHSPSRSQADLLAAQCCQTLVSHHVDPSLRD EDGYTAADLAEYHGHRDCAQYLREVAQPSIWDQLGQSHSPPHPSPWGLLDTCHTATTGAS VGTDGNPHQSCRAAQIPEAPSCPGPSYGSQGCGYSRCRRNATARLLIPVKLRDHPNTEGG SSVKVPLLMTPPPPPFPPPPLLATRRSLEDGRRGGPGPGNPSHSQPLRCSVGVWETPRGA AARRGELERARHIEGSDKPSAAQDWLPGFVYVAVPRASLPRDTGTETALAGDTSDGLAAL QLDGLPSGDIDGLVPTRDERGQPIPEWKRQVMVRKLQARLGAESSAEAQDNGGSSGPTEQ AAWRYSQTHQAILGPFGELLTEDDLVYLEKQIADLQLRRRCQEYESELGRLAAELQALLP EPLVSITVNSHFLPRAPGLEVEEASIPAAEPAGSAEASEVAPGVQPLPFWCSHISRLVRS LSLLLKGVHGLVQGDEKPSTRPLQDTCREASASPPRSEAQRQIQEWGVSVRTLRGNFESA SGPLCGFNPGPCEPGAQHRQCLSGCWPALPKPRSGLASGEPRPGDTEEASDSGISCEEVP SEAGAAAGPDLASLRKERIIMLFLSHWRRSAYTPALKTVACRTLGARHAGLRGQEAARSP GPPSPPSEGPRLGHLWQQRSTITHLLGNWKAIMAHVPARQLRRLSRQPRGALSPEQFLPH VDGAPVPYSSLSLDLFMLGYFQLLECDLPAEERKLRHLLCFEVFEHLGTHGWEAVRAFHK AVTDEVAAGRRAWTDGFEDIKARFFGSSQRPAWDTEPGRKSGLTLLGPLPHAAVPCSGPE PTAQRLGSRSQQGSFNGEDICGYINRSFAFWKEKEAEMFNFGE >gi568815596f:238040755_238251062|GENSCAN_predicted_CDS_3|3552_bp atggagaagcagcgggcactcgtggccgccaaggatggggatgtggcgacgttggagcgg ctgctggaggctggcgccctgggcccgggcatcaccgatgctctgggggccggcctggtt caccacgccacccgggctggccacctggactgcgtcaagttcttggtgcagcgggcccag ctgcccggcaaccagcgggcccacaacggggccaccccagcgcatgacgccgctgccacg ggcagcctggccgagctgtgctggctggtccgcgaggggggctgcggtctgcaggcgagg cccttagagctgtcgctcagccctgtggagctttcagggctgagggaccaagatgcctcg ggcgtctccccgctgcacctggccgcccgttttggacacccagtgctggtggagtggctg ctccacgagggccactcggccacgctagagacccgggagggagcccggccgctgcaccac gctgccgtcagtggggacctgacctgcctcaagctcctgacagccgcgcatggcagcagc gtgaaccggcggacacgcagtggcgcctccccactctacctggcctgccaggagggccac ctgcacctggcccagttcctggtgaaggactgtggcgctgacgtgcaccttcgtgctctc gatggcatgagcgccctgcacgctgccgccgcccgtggccactactccctcgtcgtctgg ctggtcacattcaccgacatcggactcacggcacgggacaatgagggggccacggccctg cactttgcagcccgaggcggccacacgcccattctagaccgactcctgctcatgggtacc cccatcctgagagactcctggggtgggacccccctccacgacgcagcagagaacgggcag atggaggctgtggccaccttggaggccatggaactcagggcagaggagggaggggaagga agccctccagaggattcccccaggacaaggcactcgccctcgaggtctcaagctgacctg ctagctgcgcagtgctgccagaccctagtctcccaccacgtggacccctccctgcgggat gaagatggttacacggcggcagacctggcggagtaccatggacaccgggactgcgcccag tacctgcgggaggtggcccagccgagcatctgggaccaactggggcagagccacagtccc ccccaccccagcccctggggcctgctggacacttgtcacactgccaccactggggcctct gttggcactgacggaaacccacaccagtcctgccgtgctgcacagatccctgaggctcca tcgtgtccagggccttcctatgggagccaggggtgtggttactcccgttgccggagaaac gccacagccagacttctcatccctgttaaactcagagaccatcctaataccgagggtggc agcagcgtcaaggtgcccctgctgatgacgcccccaccaccaccgttccccccacctcca ctgttggccacgaggcgctccctggaggatggaagaagaggaggcccagggccagggaac cccagccattctcaaccattgaggtgttcggtgggagtctgggaaacgccacgtggcgct gctgcccgtcgtggagagctggagcgcgcacggcacattgaaggctctgacaagccctcc gcggcacaggattggctgccaggctttgtctatgtagctgtgcccagagcaagtttgccc agggatacggggacagagacggcgctggcgggggacacctcagatggcctggccgcacta cagctggatgggctgccctcaggcgacatcgacgggctggtgcccacgcgggatgagcgc ggccagcccatcccagagtggaagcggcaggtgatggtgcggaagctgcaggcgcgcctg ggcgcagagagctccgcagaggcccaggacaatggtgggagctcaggccccacggagcag gcggcctggaggtactcacagactcatcaggccatcctggggccctttggggagctgctg acagaggatgacctggtctacctggagaagcagattgcagacctgcagcttcggcgccgc tgtcaggagtatgagagtgagctgggccggttggcggctgagctgcaggccctgctgccc gagcccctggtcagcatcacggtcaacagccacttcctgccccgggcgcccggactggag gttgaggaggcctcaatcccagcggctgagcccgcagggtctgcggaggcctcagaggtg gcccccggggtgcagcccctgcccttctggtgcagccacatctcccgcctggtacgcagc ctgtccctgctgctgaagggcgtgcatgggctagtacagggggatgagaagccatccacc cggcccctgcaggacacctgcagggaggcctcggccagcccccctcggagcgaggcccag cgccagatccaggagtggggggtgtctgtgcggacgctgcggggcaacttcgagtcggcc tctggcccactctgtggcttcaaccctggcccctgcgagccgggggcccagcacaggcag tgcctgagtggctgctggccagccctgcctaagccccgcagtggcctggcttcaggggag cccaggcctggcgacacagaggaggccagcgactctggcatcagctgcgaggaggtgcca tcagaggcgggtgccgcagccggcccagacctggccagcctgcgcaaggagcgcatcatc atgctcttcctcagccactggaggagatcggcctacacgccggccctcaagacagtggcc tgcaggaccctaggagcccgccacgcggggttgcggggccaggaggccgccaggagccct gggccaccctccccgcccagcgagggcccccggctgggccacctgtggcagcagcgcagc accatcacccacctgctgggcaactggaaggccatcatggctcacgtgcccgcccggcag ctgcggcggctgagccggcagccccgcggggctttgtcccccgagcagttcctgccccac gtggacggggctccggtgccctacagcagcctctcactggatctcttcatgctgggttac ttccagctgctggagtgcgacctgccggcggaggagcggaagctgcgccacctgctgtgc ttcgaggtcttcgagcacctgggcacccacggctgggaggctgtgcgcgccttccacaag gccgtgaccgacgaggtggccgccggccgccgggcctggaccgacggcttcgaggacatc aaagcccgcttctttggctccagccagcgtcccgcctgggatacggagcctggccgcaag tcaggtctgaccctgctcgggcccctgcctcacgccgccgtcccctgcagcggccctgag cccacagcacagcggctggggtcccgctcccagcagggcagcttcaacggtgaggacatc tgcggctacatcaaccgcagctttgccttctggaaggagaaggaagctgagatgttcaac tttggagaatga >gi568815596f:238040755_238251062|GENSCAN_predicted_peptide_4|735_aa MVRNVDDLDFHLPSHAQDMLDGLQRLRSQPKLADVTLLVGGRELPCHRGLLALSSPYFHA MFAGDFAESFSARVELRDVEPAVVGQLVDFVYTGRLTITQGNVEALTRTAARLHFPSVQK VCGRYLQQQLDAANCLGICEFGEQQGLLGVAAKAWAFLRENFEAVAREDEFLQLPRERLV TCLAGDLLQVQPEQSRLEALMRWVRHDPQARAAHLPELLSLVHLDAVPRPCVQQLLASEP LIQESEACRAALSQGHDGAPLALQQKLEEVLVVVGGQALEEEEAGEEPTPGLGNFAFYNS KATGAAVSSLQVERMRLAEDRELCEAPVRGVPGWASPGGCRGARERWMALPDFPDYHKWG FSLAALNNNIYVTGGSRGTKTDTWSTTQAWCFPLKEASWKPVAPMLKPRTNHASAALNGE IYVIGGTTLDVVEVESYDPYTDSWTPVSPALKYVSNFSAAGCRGRLYLVGSSACKYNALA LQCYNPVTDAWSVIASPFLPKYLSSPRCAALHGELYLIGDNTKKVYVYDPGANLWQKVQS QHSLHENGALVPLGDALYVTGGRWQGMEGDYHVEMEAYDTVRDTWTRHGALPRLWLYHGA STVFLDVSNKKTEGYRPLLQLVGPSVDPAESMCMDGECANGLRREGTATYTGKMLHLRGR VIHRPSPVPGVVSADRSEVWLLLPDSAAPGTWQGSQGRGRFYSHDFIGKGPSSGGREKSG NSLAASGCWEARPGP >gi568815596f:238040755_238251062|GENSCAN_predicted_CDS_4|2208_bp atggtgcggaacgtggatgacctggatttccacctgccctcgcatgcccaggacatgctg gatggcctgcagcgcctgcgctctcagcccaagctggccgacgtcacactgctggtgggc ggccgggagctgccatgccaccgcggcctcctggcgctcagcagcccctacttccatgcc atgtttgcgggtgacttcgccgagagcttctctgcgcgcgtggagctgcgggacgtggag cccgccgtggtgggacaactggtggacttcgtgtacacaggccggctgaccatcacgcag ggcaacgtggaggcgctgacacgcacggctgcgcgcctgcacttcccctcggtgcagaag gtctgcggccgctacctgcagcagcaactggatgccgccaactgcctgggcatctgtgag ttcggggagcagcaagggctgctgggcgtggctgccaaggcctgggccttcctgcgagag aactttgaggctgtggcacgtgaggacgagttcctgcagcttccccgagagcggctggtc acttgtctggccggcgacctgctgcaggtacagccggagcaaagccgactcgaggccctg atgcgctgggtgcgccatgacccgcaggcccgggccgcccacctgcccgagctgctcagc ctagtgcacctggacgccgtgcccaggccctgcgtgcagcaactgctggcctcagagccc ctgatccaggagtcagaggcatgccgggcagccctgtcccagggccatgatggggcacca ctcgccctccagcagaagctggaggaggtcctggtggtggtgggcgggcaggcgctggag gaggaggaggcaggtgaggagcccacccccggccttgggaactttgccttctacaacagc aaggccacgggagccgctgtgtcctccttgcaggtggagcgcatgaggctcgcagaggac cgggagctgtgtgaggcccctgtgaggggcgtgcctgggtgggcctcacctgggggctgc agaggagcccgagagaggtggatggcacttccagacttccccgactatcacaagtggggt ttctccctggcggccctgaacaacaacatctatgtcacaggtggctctcggggcacaaag acagacacctggtcaaccacccaggcctggtgcttccccctgaaggaggcctcctggaag cccgtggcgcccatgctgaagccccgcaccaaccacgccagcgcggccctcaatggggag atctacgttatcggcggcaccaccctggacgtggtggaggtggagagctatgacccctac acggacagctggacgcccgtcagcccggccctcaaatacgtcagcaacttctcggctgcc ggctgccggggccggctctacctggtgggctccagcgcctgcaagtacaacgccctggcc ctgcagtgctacaaccctgtcacagatgcgtggagtgtgatcgcctcgcccttcctgccc aagtacctgtcctcgcctcgctgtgctgcactgcacggggagctctacctcattggggac aacaccaagaaggtctacgtgtacgaccccggggccaacctgtggcagaaggtgcagtca cagcacagcctgcatgagaatggcgcgctggtgccactgggtgatgcgctgtacgtgacg ggcggccgctggcagggcatggaaggtgactaccacgtggagatggaggcctacgacacg gttcgggacacctggacccgccacggcgccctgccccggctctggctctaccacggggcc tccaccgtcttcctggatgtctccaacaagaaaacagaaggatacaggccactactccag ttggttgggccatcagttgatccagctgaaagcatgtgtatggatggagaatgtgccaat ggcttaaggagagagggcacagccacgtacacaggaaagatgctgcacctccggggccgt gtcattcaccgtccctccccagtgcccggcgtggtatcagctgatcggtctgaagtgtgg ctgcttctcccggactctgcagcaccaggtacgtggcagggcagccagggcagagggagg ttttattcccatgacttcatcggcaaaggtccttcctctggaggaagagagaagagtggg aacagcttggctgcatcaggctgctgggaagccaggcccgggccgtga >gi568815596f:238040755_238251062|GENSCAN_predicted_peptide_5|492_aa MAPARRPAGARLLLVYAGLLAAAAAGLGSPEPGAPSRSRARREPPPGNELPRGPGESRAG PAARPPGFGAWTAEQRVVRLGRLLGTAAPIFPPQHPGLRIGQPAPKRRPCIPPDQPGTKG SDLAAGFVSDKRAQSALKSQGPSLAALPGAQIAQGLSQEGPSQNCAQCQHQSGCSADRNG NFRKHHCPCEPSEANRPAGLAVFQEPTAERAHSVDPRDAWMLFVRQSDKGVNGKKRSRGK AKKLKFGLPGPPGPPGPQGPPGPIIPPEALLKEFQLLLKGAVRQRERAEPEPCTCGPAGP VAASLAPVSATAGEDDDDVVGDVLALLAAPLAPGPRAPRVEAAFLCRLRRDALVERRALH ELGVYYLPDAEGAFRRGPGLNLTSGQYRAPVAGFYALAATLHVALGEPPRRGPPRPRDHL RLLICIQSRCQRNASLEAIMGLESSSELFTISVNGVLYLQMGQWTSVFLDNASGCSLTVR SGSHFSAVLLGV >gi568815596f:238040755_238251062|GENSCAN_predicted_CDS_5|1479_bp atggccccggcccgccgccccgccggagcccgcctgctgctcgtctacgcgggcctgctg gccgccgccgccgcgggcctggggtccccggagcctggggcgccctcgaggagccgcgcc cgcagggagccgccgcccgggaacgagctgccccggggccccggggagagccgcgcgggg ccggccgctcgtccgccgggttttggagcctggactgctgagcagagggtggtccggttg gggaggctcctcggcaccgcagcccccatcttcccaccacaacaccctggcctgcgcatt ggtcagccagctcccaaaagaaggccctgcatccccccagatcagccaggaaccaaaggc agtgacttggcagccggctttgtctctgataaaagggcccagtcagctctaaaatcccag ggtccttccctggcagccttgcctggagcccagatagcccaaggcctctcccaggaaggt ccctcccaaaactgtgctcagtgtcagcatcagtcaggatgcagcgcggacaggaatggg aacttccggaagcatcactgtccatgtgagcccagtgaggccaaccgccctgctgggctg gctgtgttccaggagcccaccgctgagcgtgcacacagcgtcgacccccgggacgcctgg atgctcttcgtcaggcagagtgacaagggtgtcaatggcaagaagaggagcaggggcaag gccaagaagctgaagttcggcttgccagggccccctgggcctcccggtccccagggcccc ccaggccccatcatcccacccgaggcgctgctgaaggagttccagctgctgctgaaaggt gcggtgcggcagcgggagcgcgcggagcccgaaccctgtacgtgtggccccgccgggccg gtcgctgcgagcctcgccccggtctcggccaccgccggggaggacgacgacgacgtggtg ggggacgtgctggcactgctggccgcgcccctggccccggggccgcgggcgccgcgcgtg gaggccgctttcctctgccgcctgcgccgggacgcgttggtggagcggcgcgcgctgcac gagcttggcgtctactacctgcccgacgccgagggtgccttccgccgcggcccgggcctg aacttgaccagcggccagtacagggcgcccgtggctggcttctacgctctcgccgccacg ctgcacgtggcgctcggggagccgccgaggagggggccgccgcgcccccgggaccacctg cgcctgctcatctgcatccagtcccggtgccagcgcaacgcctccctggaggccatcatg ggcctggagagcagcagtgagctcttcaccatctctgtgaatggcgtcctgtacctgcag atggggcagtggacctccgtgttcttggacaacgccagcggctgctccctcacagtgcgc agtggctcccacttcagtgctgtcctcctgggcgtgtga >gi568815596f:238040755_238251062|GENSCAN_predicted_peptide_6|452_aa MRTAPPPAAPPSPRAALWGGRATPRADWWRPGLSLQVTDGAGRPCPSPARCCRPPGVWSP AAARGLSVCRCCRLHPASAMDLFGDLPEPERSPRPAAGKEAQKGPLLFDDLPPASSTDSG SLATSISQMVKTEGKGAKRKTSEEEKNGSEELVEKKVCKASSVIFGLKGYVAERKGEREE MQDAHVILNDITEECRPPSSLITRVSYFAVFDGHGGIRASKFAAQNLHQNLIRKFPKGDV ISVEKTVKRCLLDTFKHTDEEFLKQASSQKPAWKDGSTATCVLAVDNILYIANLGDSRAI LCRYNEESQKHAALSLSKEHNPTQYEERMRIQKAGGNVRDGRVLGVLEVSRSIGDGQYKR CGVTSVPDIRRCQLTPNDRFILLACDGLFKVFTPEEAVNFILSCLEDEKIQTREGKSAAD ARYEAACNRLANKAVQRGSADNVTVMVVRIGH >gi568815596f:238040755_238251062|GENSCAN_predicted_CDS_6|1359_bp atgcgcaccgccccgcccccagccgccccgcccagcccgcgcgcagccctctggggcggc cgggccacgccgcgcgccgattggtggcgtccgggactctccctgcaggtgactgacggc gccggccgcccctgcccgtcgcccgcccgctgctgccgcccgcccggggtgtggagcccg gccgctgctcgcgggctgagtgtctgtcgctgctgccgcctccacccagcctccgccatg gacctcttcggggacctgccggagcccgagcgctcgccgcgcccggctgccgggaaagaa gctcagaaaggacccctgctctttgatgacctccctccggccagcagtactgactcaggt tctcttgccacatcaatatcccagatggtaaagactgaagggaaaggagcaaagagaaaa acctccgaggaagagaagaatggcagtgaagagcttgtggaaaagaaagtttgtaaagcc tcttcggtgatctttggtctgaagggctatgtggctgagcggaagggtgagagggaggag atgcaggatgcccacgtcatcctgaacgacatcaccgaggagtgtaggcccccatcgtcc ctcattactcgggtttcatattttgctgtttttgatggacatggaggaattcgagcctca aaatttgctgcacagaatttgcatcaaaacttaatcagaaaatttcctaaaggagatgta atcagtgtagagaaaaccgtgaagagatgccttttggacactttcaagcatactgatgaa gagttccttaaacaagcttccagccagaagcctgcctggaaagatgggtccactgccacg tgtgttctggctgtagacaacattctttatattgccaacctcggagatagtcgggcaatc ttgtgtcgttataatgaggagagtcaaaaacatgcagccttaagcctcagcaaagagcat aatccaactcagtatgaagagcggatgaggatacagaaggctggaggaaacgtcagggat gggcgtgttttgggcgtgctagaggtgtcacgctccattggggacgggcagtacaagcgc tgcggtgtcacctctgtgcccgacatcagacgctgccagctgacccccaatgacaggttc attttgttggcctgtgatgggctcttcaaggtctttaccccagaagaagccgtgaacttc atcttgtcctgtctcgaggatgaaaagatccagacccgggaagggaagtccgcagccgac gcccgctacgaagcagcctgcaacaggctggccaacaaggcggtgcagcggggctcggcc gacaacgtcactgtgatggtggtgcggatagggcactga >gi568815596f:238040755_238251062|GENSCAN_predicted_peptide_7|100_aa MAKFAIHIWKYPTNPKKETVAALGTTPSPVTLWFLQTRRGSKIWRNSLDYQGALLIATTA VNVLVTAETSMSQSLTQGPWLYYLVTTADYSDSKGSFISR >gi568815596f:238040755_238251062|GENSCAN_predicted_CDS_7|303_bp atggcaaaatttgctatccatatttggaaatatcccactaatccgaagaaggaaactgta gctgcattaggtactactccaagcccagtaacactgtggttcttgcagactcgtagagga tctaagatctggaggaattccctggattaccagggggccctcctcatagccaccacagct gtgaatgtcctggtcacagctgaaaccagcatgtctcagagtctcactcaaggcccatgg ctgtactacctggttaccactgctgattattcagattccaagggctctttcatcagcaga tga >gi568815596f:238040755_238251062|GENSCAN_predicted_peptide_8|115_aa MQWPLQTACRCQEPGTSRSPAPSELGQEQLPGAAAAAQTVAANPGLPLHGAGRNLTGPFP AQRRLPKPRLQTQASLHSWGPGKALLPSQAQKCLLPVPGLSLFLVPAPISEQSRG >gi568815596f:238040755_238251062|GENSCAN_predicted_CDS_8|348_bp atgcagtggccgctccagacggcctgccgctgccaggagccggggaccagcaggagccct gccccttctgagctggggcaggagcagctccccggtgccgctgcagctgcccaaaccgtg gctgccaacccgggcctcccgctccacggagcaggcaggaacctcaccggccccttcccc gcccagcggcggctgcccaaaccgcggctgcagactcaggcatccctgcactcttggggg cccgggaaggccctcctcccctcacaggcccagaagtgcctgctcccagtgcctggcctc tccctgttcctggtgcctgctccaatctcagagcaaagtcggggctga >gi568815596f:238040755_238251062|GENSCAN_predicted_peptide_9|306_aa MKALQPPPPDLRLRKRRRERSSSPWGPSPRERGAGCGARRPGRRSVESSPRDRKCSRNQR LRSVKERTLEALEGKQVSPGGNPGLRFAQRSPRECVEHLLCAGVGVEGVTAQLLQRQRLP PPCSRPAGSEHLPHADRAPAPMFTMLRKTCGGQQEDSRGMGPHLPRVPDGVVLGVEEPEG KAQGRVRGELCPGALARIHMGHSCTRLSAEGIAMLLGTAGPSRLPAAPDASQAPLSAHIV TVHLPPGFPLDGKLGVQCNALLTSEYLLSFDKCVLFCASNPSQGPERGHQKVPSHTFQPV PSRAGE >gi568815596f:238040755_238251062|GENSCAN_predicted_CDS_9|921_bp atgaaggctctgcagcccccgccgcccgatttacgcttgagaaagcgaaggcgagagagg tcgtccagcccctgggggcccagccctagagagcggggcgcgggctgcggggcccggcgg cctggacggcggagcgttgaaagctcccccagggataggaagtgcagccgaaatcaacga ttgaggtcagtgaaggagcggacgcttgaggccttggaggggaagcaggtttctccagga gggaacccgggcctccgctttgctcagcgctcccctcgcgaatgtgttgagcacctgtta tgtgctggggtgggtgtggagggggtgactgcccagcttctgcagcgccagcgacttcct ccaccttgttcccgccctgcagggtctgagcacctccctcatgcagacagagccccggct cccatgttcaccatgctgagaaagacctgcgggggacagcaggaggacagccggggaatg ggcccgcacttgcccagagtccctgatggtgtcgtcttgggagtggaggaacctgaggga aaagcccagggacgagtcaggggagagctctgccctggtgctcttgcacgtatccacatg ggccacagctgcacccggctctcagcagagggcatcgccatgctcctgggcacagccggg ccctctcgcctccctgcggctcctgatgcctcccaggctccgctgagtgcgcacatcgtt acagtccatttgccaccgggttttcctttggatggaaagcttggcgttcagtgcaatgcc ctgctcacgagtgaatacctgctgagttttgacaagtgtgtgctgttttgtgcctcaaac ccttctcaaggtcctgagcgtggccatcagaaggttccctcacacaccttccaaccagtg ccctcccgtgcaggagagtga >gi568815596f:238040755_238251062|GENSCAN_predicted_peptide_10|228_aa MSEYIRVTEDENDEPIEIPSEDDGTVLLSTVTAQFPGACGLRCRNPVSQCMRGVRLVEGI LHAPDAGWRNPVYVVNYPKDNKRKMVLGLPWKTTEQDLKEYCSTFGDVLMVRQSQDEPLR SRKVFVGRRTEDMTEDELWEFFSQYGDVMDVFVPKPFRAFAFVPFADDQIARSLCGEDSM KGISVHISNAEPKRNSNRQQEVEDLVVIQVALGIRVDLSRINAVNARL >gi568815596f:238040755_238251062|GENSCAN_predicted_CDS_10|687_bp atgtctgaatatattcgggtaaccgaagatgagaacgatgagcccattgaaataccatcg gaagacgatgggacggtgctgctgtccacggttacagcccagtttccaggggcgtgtggg cttcgctgcaggaatccggtgtctcaatgtatgagaggtgtccggctggtagaaggaatt ctgcatgcccccgatgctggctggagaaatccggtgtatgttgtcaactatccgaaagat aacaaaagaaaaatggtgttgggtctcccatggaaaacaactgaacaggacctgaaagag tactgtagtacctttggagacgttcttatggtgcggcaaagccaagatgagcctttgaga agcagaaaagtgtttgtggggcgccgtacggaggacatgactgaggatgagctgtgggag ttcttctctcagtatggggatgtgatggatgtcttcgtccccaagccgttcagggccttt gcctttgttccatttgcagatgatcagattgcgcggtctctttgtggagaggactcgatg aaaggaatcagcgttcacatatccaatgccgaacccaagcgcaatagcaatagacagcaa gaagtggaagatttggtggtaatccaggtggctttgggaatcagggtggatttgagtcgt atcaacgctgtgaacgcaaggctgtga >gi568815596f:238040755_238251062|GENSCAN_predicted_peptide_11|397_aa XSSDTSHTSKYFGSIDSSENNHKAKMNTGMEESEHFIKCVLQDPIWLLMADADSSVMMTY QLPSRNLEAVLKEDREKLKLLQKLQPRFTESQKQELREVHQWMQTGGLPAAIDVAECVYC ENKEKGNICIPYEEDIPSLGLSEVSDTKEDENGSPLNHRIEEQTPCRVPGGAGMAPPAAP GRDRVGREDEDGWETRGDRKARKPLVEKKRRARINESLQELRLLLAGAEVQAKLENAEVL ELTVRRVQGVLRGRAREREQLQAEASERFAAGYIQCMHEVHTFVSTCQAIDATVAAELLN HLLESMPLREGSSFQDLLGDALAGPPRAPGRSGWPAGGAPGSPIPSPPGPGDDLCSDLEE APEAELSQAPAEGPDLVPAALGSLTTAQIARSVWRPW >gi568815596f:238040755_238251062|GENSCAN_predicted_CDS_11|1194_bp ngcagtagtgacacaagtcataccagcaaatattttggaagcattgactcctcagagaat aatcacaaagcaaaaatgaacactggtatggaagaaagtgagcatttcattaagtgcgtc ctgcaggatcccatctggctgctgatggcagatgcggacagcagcgtcatgatgacgtac cagctgccttcccgaaatttagaagcggttttgaaggaggacagagagaagctgaagctc ctacagaaactccagcccaggttcacggagagtcagaagcaggagctgcgcgaggtccac cagtggatgcagacgggcggcctgcccgcagccatcgacgtggcagaatgtgtttactgt gaaaacaaggaaaaaggtaatatttgcataccatatgaggaagatattccttctctggga ctcagcgaagtgtcggacaccaaagaagacgaaaatggatcccccttgaatcacaggatc gaagagcagaccccctgccgcgtccccggcggagcgggcatggcgccacccgcggcgcct ggccgggaccgtgtgggccgtgaggatgaggacggctgggagacgcgaggggaccgcaag gcccggaagcccctggtggagaagaagcggcgcgcgcggatcaacgagagcctgcaggag ctgcggctgctgctggcgggcgccgaggtgcaggccaagctggagaacgccgaagtgctg gagctgacggtgcggcgggtccagggtgtgctgcggggccgggcgcgcgagcgcgagcag ctgcaggcggaagcgagcgagcgcttcgctgccggctacatccagtgcatgcacgaggtg cacacgttcgtgtccacgtgccaggccatcgacgctaccgtcgctgccgagctcctgaac catctgctcgagtccatgccgctgcgtgagggcagcagcttccaggatctgctgggggac gccctggcggggccacctagagcccctggacggagtggctggcctgcggggggcgctccg ggatccccaatacccagccccccgggtcctggggacgacctgtgctccgacctggaggag gcccctgaggctgaactgagtcaggctcctgctgaggggcccgacttggtgcccgcagcc ctgggcagcctgaccacagcccaaattgcccggagtgtctggaggccttggtga