GENSCAN 1.0 Date run: 8-Nov-116 Time: 12:48:51 Sequence gi568815596f:226764694_227095519 : 330826 bp : 40.36% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 4565 4897 333 1 0 43 43 346 0.346 21.37 1.02 PlyA + 5335 5340 6 1.05 2.11 PlyA - 6057 6052 6 1.05 2.10 Term - 30188 30064 125 0 2 100 45 95 0.812 3.97 2.09 Intr - 34067 30325 3743 1 2 60 29 3472 0.263 324.90 2.08 Intr - 34801 34594 208 2 1 73 27 209 0.212 10.51 2.07 Intr - 43952 43889 64 2 1 77 95 64 0.004 3.37 2.06 Intr - 70949 70755 195 2 0 26 22 191 0.020 4.99 2.05 Intr - 71525 71353 173 0 2 91 56 166 0.045 12.44 2.04 Intr - 74868 74740 129 1 0 81 110 131 0.949 14.35 2.03 Intr - 84055 83943 113 0 2 88 107 -18 0.017 -0.80 2.02 Intr - 86711 86605 107 2 2 67 63 34 0.123 -3.11 2.01 Init - 87274 87137 138 1 0 56 103 145 0.980 12.99 2.00 Prom - 93599 93560 40 -6.95 3.00 Prom + 97333 97372 40 -5.55 3.01 Init + 100001 100433 433 1 1 65 110 127 0.877 9.02 3.02 Term + 103581 103669 89 1 2 108 41 106 0.779 4.74 3.03 PlyA + 104650 104655 6 1.05 4.03 PlyA - 104834 104829 6 1.05 4.02 Term - 113882 113783 100 1 1 103 43 105 0.982 4.12 4.01 Init - 114130 113964 167 1 2 36 78 105 0.825 3.45 4.00 Prom - 116960 116921 40 -3.75 5.03 PlyA - 116985 116980 6 1.05 5.02 Term - 127338 127171 168 0 0 28 47 144 0.620 1.20 5.01 Init - 134269 134192 78 1 0 56 116 71 0.423 7.81 5.00 Prom - 134387 134348 40 -7.25 6.09 PlyA - 134697 134692 6 1.05 6.08 Term - 137479 137426 54 1 0 86 49 122 0.929 4.68 6.07 Intr - 139713 139629 85 2 1 124 61 72 0.349 7.00 6.06 Intr - 140943 140846 98 0 2 44 82 21 0.001 -5.01 6.05 Intr - 149603 149431 173 0 2 86 63 80 0.002 4.04 6.04 Intr - 150610 150511 100 1 1 52 52 64 0.002 -2.04 6.03 Intr - 160475 160278 198 2 0 78 84 120 0.402 9.23 6.02 Intr - 167980 167840 141 1 0 70 59 100 0.119 4.93 6.01 Init - 173174 173124 51 2 0 62 76 25 0.050 -0.09 6.00 Prom - 173234 173195 40 -3.55 7.11 PlyA - 175833 175828 6 1.05 7.10 Term - 182938 182355 584 2 2 77 29 185 0.633 5.17 7.09 Intr - 190786 190692 95 0 2 66 100 64 0.168 4.09 7.08 Intr - 196757 196597 161 2 2 6 96 126 0.101 3.06 7.07 Intr - 207654 207468 187 2 1 79 99 48 0.107 3.77 7.06 Intr - 210314 210161 154 0 1 4 67 177 0.240 5.41 7.05 Intr - 211308 211178 131 2 2 18 84 121 0.467 4.12 7.04 Intr - 215712 215626 87 2 0 114 44 37 0.146 0.07 7.03 Intr - 215853 215780 74 0 2 46 94 13 0.097 -5.01 7.02 Intr - 219547 219205 343 0 1 -1 70 262 0.287 10.01 7.01 Init - 219892 219804 89 1 2 83 76 41 0.779 2.57 7.00 Prom - 220960 220921 40 -8.05 8.00 Prom + 222164 222203 40 -3.65 8.01 Init + 223715 223959 245 1 2 56 97 174 0.950 12.36 8.02 Intr + 229310 229499 190 2 1 95 53 58 0.765 1.67 8.03 Term + 231052 231426 375 0 0 91 43 135 0.505 3.05 8.04 PlyA + 231817 231822 6 1.05 9.24 PlyA - 232933 232928 6 1.05 9.23 Term - 234371 234189 183 2 0 56 53 132 0.944 3.06 9.22 Intr - 234850 234719 132 1 0 62 42 122 0.666 4.92 9.21 Intr - 242895 242653 243 0 0 85 46 178 0.265 10.07 9.20 Intr - 243881 243325 557 0 2 25 101 423 0.559 28.93 9.19 Intr - 245808 245620 189 1 0 121 52 92 0.593 7.54 9.18 Intr - 247604 247488 117 0 0 41 80 126 0.025 6.62 9.17 Intr - 257480 257355 126 0 0 42 80 153 0.884 9.63 9.16 Intr - 263316 263209 108 1 0 104 88 51 0.632 6.04 9.15 Intr - 265905 265750 156 1 0 64 86 82 0.698 4.66 9.14 Intr - 267362 267252 111 0 0 80 64 132 0.988 9.43 9.13 Intr - 267583 267455 129 2 0 39 115 107 0.993 8.25 9.12 Intr - 268788 268717 72 1 0 88 115 66 0.750 7.76 9.11 Intr - 277562 277455 108 0 0 93 115 15 0.930 4.04 9.10 Intr - 278491 278384 108 2 0 111 82 112 0.986 12.24 9.09 Intr - 282856 282782 75 2 0 102 80 39 0.418 3.07 9.08 Intr - 285438 285375 64 0 1 49 86 70 0.908 0.27 9.07 Intr - 286465 286284 182 2 2 44 110 124 0.640 8.87 9.06 Intr - 287719 287612 108 2 0 51 82 84 0.589 3.44 9.05 Intr - 290017 289901 117 2 0 57 94 112 0.660 8.22 9.04 Intr - 292907 292746 162 0 0 86 111 34 0.576 4.53 9.03 Intr - 294930 294712 219 1 0 77 86 50 0.312 1.05 9.02 Intr - 295550 295443 108 0 0 81 65 85 0.534 4.84 9.01 Init - 296595 296544 52 0 1 61 13 66 0.195 -2.23 9.00 Prom - 298761 298722 40 -8.75 10.00 Prom + 299132 299171 40 -6.35 10.01 Sngl + 301075 302082 1008 0 0 88 43 700 0.996 62.22 10.02 PlyA + 302310 302315 6 1.05 11.00 Prom + 302479 302518 40 -6.15 11.01 Init + 302572 303862 1291 0 1 44 -14 493 0.754 26.88 11.02 Term + 304155 304765 611 1 2 67 38 283 0.948 14.87 11.03 PlyA + 304941 304946 6 1.05 12.07 PlyA - 305737 305732 6 1.05 12.06 Term - 312794 312778 17 0 2 89 41 18 0.447 -5.38 12.05 Intr - 313384 313201 184 1 1 34 86 147 0.677 7.54 12.04 Intr - 315856 315750 107 2 2 70 47 120 0.725 5.11 12.03 Intr - 317494 317422 73 1 1 63 105 48 0.552 2.06 12.02 Intr - 324123 323960 164 1 2 75 81 87 0.296 5.47 12.01 Intr - 329596 329432 165 2 0 102 48 93 0.437 5.71 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 71067 71004 64 2 1 24 50 170 0.812 3.18 S.002 Term - 255940 255819 122 2 2 92 43 69 0.882 0.46 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:226764694_227095519|GENSCAN_predicted_peptide_1|110_aa MMKLKTTKAISLALPKMVTGSAVHMDTELRTVQVQIISDRQGNRSCCKTARSVFRKQLDL WKTRPRQCRPAAPQGTTEPFRGCPSSQPDMDGRSRHADVVLGFRRVSWEA >gi568815596f:226764694_227095519|GENSCAN_predicted_CDS_1|333_bp atgatgaaactgaagacaactaaggccatcagccttgccctgccaaagatggtcactggc agtgcagtacacatggacactgaactcaggacagtccaggttcaaataatcagtgaccgg caaggaaacagaagctgctgcaagactgcccggagtgtcttcagaaagcaattagatcta tggaagactagaccaaggcaatgcaggcctgcagcaccccaaggaaccacagagccattt cgaggctgtccttccagccaaccagacatggatggaagatctagacatgcagatgttgtt ttaggtttccgacgcgtttcatgggaggcttag >gi568815596f:226764694_227095519|GENSCAN_predicted_peptide_2|1664_aa MAEELKGKTSQQKATNTTATNAKLYYTDDWLLRVKDPKELHEHWFQAQLDHQVALLHTEL RNSNSSLKTENLSPSVVSALGGKEFLILKLVYTQSPTKFTNSPKLPFKSPYWLLSLVASE FHVQLPGIRGSLGTPPNIAAKDQEPVPFNLTVLKASRLLSFTAPREAPTSAQQPQPGRQL PQPGETARDAAHNQRQQPDQTDAETHALYDPPASPRLRAHVLCREASSFRAPAQPEAGEL LPGRVRYRLLGGGWERHSPASGAIADNRRAAYRRVPEPASKARSRTLTFEIQPPRCEKSK SHGETGSAVAARYRFAWKSHFLHPPRWARMGAAEDAPAGGGSSSSSSSSNSNSRSAAVSA TELVFGRLVAAGTVGGVGGGGGSMASPPESDGFSDVRKVGYLRKPKSMHKRFFVLRAASE AGGPARLEYYENEKKWRHKSSAPKRSIPLESCFNINKRADSKNKHLVALYTRDEHFAIAA DSEAEQDSWYQALLQLHNRAKGHHDGAAALGAGGGGGSCSGSSGLGEAGEDLSYGDVPPG PAFKEVWQVILKPKGLGQTKNLIGIYRLCLTSKTISFVKLNSEAAAVVLQLMNIRRCGHS ENFFFIEVGRSAVTGPGEFWMQVDDSVVAQNMHETILEAMRAMSDEFRPRSKSQSSSNCS NPISVPLRRHHLNNPPPSQVGLTRRSRTESITATSPASMVGGKPGSFRVRASSDGEGTMS RPASVDGSPVSPSTNRTHAHRHRGSARLHPPLNHSRSIPMPASRCSPSATSPVSLSSSST SGHGSTSDCLFPRRSSASVSGSPSDGGFISSDEYGSSPCDFRSSFRSVTPDSLGHTPPAR GEEELSNYICMGGKGPSTLTAPNGHYILSRGGNGHRCTPGTGLGTSPALAGDEAASAADL DNRFRKRTHSAGTSPTITHQKTPSQSSVASIEEYTEMMPAYPPGGGSGGRLPGHRHSAFV PTRSYPEEGLEMHPLERRGGHHRPDSSTLHTDDGYMPMSPGVAPVPSGRKGSGDYMPMSP KSVSAPQQIINPIRRHPQRVDPNGYMMMSPSGGCSPDIGGGPSSSSSSSNAVPSGTSYGK LWTNGVGGHHSHVLPHPKPPVESSGGKLLPCTGDYMNMSPVGDSNTSSPSDCYYGPEDPQ HKPVLSYYSLPRSFKHTQRPGEPEEGARHQHLRLSTSSGRLLYAATADDSSSSTSSDSLG GGYCGARLEPSLPHPHHQVLQPHLPRKVDTAAQTNSRLARPTRLSLGDPKASTLPRAREQ QQQQQPLLHPPEPKSPGEYVNIEFGSDQSGYLSGPVAFHSSPSVRCPSQLQPAPREEETG TEEYMKMDLGPGRRAAWQESTGVEMGRLGPAPPGAASICRPTRAVPSSRGDYMTMQMSCP RQSYVDTSPAAPVSYADMRTGIAAEEVSLPRATMAAASSSSAASASPTGPQGAAELAAHS SLLGGPQGPGGMSAFTRVNLSPNRNQSAKVIRADPQGCRRRHSSETFSSTPSATRVGNTV PFGAGAAVGGGGGSSSSSEDVKRHSSASFENVWLRPGELGGAPKEPAKLCGAAGGLENGL NYIDLDLVKDFKQCPQECTPEPQPPPPPPPHQPLGSGESSSTRRSSEDLSAYASISFQKQ PEDRRMHLHSQGTQDAHPTLTSGRESNKHVGAATGGLFRLRIPK >gi568815596f:226764694_227095519|GENSCAN_predicted_CDS_2|4995_bp atggctgaagaactaaaaggtaaaacaagccaacaaaaagccaccaacaccaccgcaaca aacgccaagctttactacactgacgactggctgttgagggtaaaagatccaaaggaacta catgagcattggtttcaggctcagttggatcatcaagttgctctgctgcacacggagctc aggaactctaattctagtcttaagactgaaaatctttctccatctgttgtctctgcattg gggggaaaagagttcttaattctcaagctagtctacactcagagtcccaccaaatttacc aactcaccaaaattaccatttaagagtccctactggttactgtctttagtggcttctgag tttcacgtccagctccccggaatccgaggttccctaggcacgcctcctaatatcgctgcc aaggaccaagagcctgtgcccttcaacctcactgttcttaaagcttctcggctcctgagt tttacagccccccgggaggcgccgacctccgcccagcagccccagcccggccgccagctc ccgcagcccggggaaacggcgagagatgccgctcacaaccaacgacagcagccggaccaa acagacgcggaaactcacgctctctacgacccgcctgcgtccccacgtcttcgcgcgcac gttctctgccgggaagcttcgtccttccgggccccagcgcagccggaggctggggagctg ctgcctggaagggtcagataccgcctgttaggcggagggtgggaaagacactctcctgct tcaggagctatcgcagacaaccggagggcggcttatcgcagagtcccagagccagccagc aaggcaaggtcgcggacactcacctttgaaatccagcctccacgctgtgaaaagtccaag tcacatggagagaccggatctgcagtggctgcccggtatcgtttcgcatggaaaagccac tttctccacccgccgagatgggcccggatgggggctgcagaggacgcgcccgcgggcggc ggcagcagcagcagcagcagcagcagcaacagcaacagccgcagcgccgcggtctctgcg actgagctggtatttgggcggctggtggcggctgggacggttgggggcgttggtggtggc ggtggcagcatggcgagccctccggagagcgatggcttctcggacgtgcgcaaggtgggc tacctgcgcaaacccaagagcatgcacaaacgcttcttcgtactgcgcgcggccagcgag gctgggggcccggcgcgcctcgagtactacgagaacgagaagaagtggcggcacaagtcg agcgcccccaaacgctcgatcccccttgagagctgcttcaacatcaacaagcgggctgac tccaagaacaagcacctggtggctctctacacccgggacgagcactttgccatcgcggcg gacagcgaggccgagcaagacagctggtaccaggctctcctacagctgcacaaccgtgct aagggccaccacgacggagctgcggccctcggggcgggaggtggtgggggcagctgcagc ggcagctccggccttggtgaggctggggaggacttgagctacggtgacgtgcccccagga cccgcattcaaagaggtctggcaagtgatcctgaagcccaagggcctgggtcagacaaag aacctgattggtatctaccgcctttgcctgaccagcaagaccatcagcttcgtgaagctg aactcggaggcagcggccgtggtgctgcagctgatgaacatcaggcgctgtggccactcg gaaaacttcttcttcatcgaggtgggccgttctgccgtgacggggcccggggagttctgg atgcaggtggatgactctgtggtggcccagaacatgcacgagaccatcctggaggccatg cgggccatgagtgatgagttccgccctcgcagcaagagccagtcctcgtccaactgctct aaccccatcagcgtccccctgcgccggcaccatctcaacaatcccccgcccagccaggtg gggctgacccgccgatcacgcactgagagcatcaccgccacctccccggccagcatggtg ggcgggaagccaggctccttccgtgtccgcgcctccagtgacggcgaaggcaccatgtcc cgcccagcctcggtggacggcagccctgtgagtcccagcaccaacagaacccacgcccac cggcatcggggcagcgcccggctgcaccccccgctcaaccacagccgctccatccccatg ccggcttcccgctgctcgccttcggccaccagcccggtcagtctgtcgtccagtagcacc agtggccatggctccacctcggattgtctcttcccacggcgatctagtgcttcggtgtct ggttcccccagcgatggcggtttcatctcctcggatgagtatggctccagtccctgcgat ttccggagttccttccgcagtgtcactccggattccctgggccacaccccaccagcccgc ggtgaggaggagctaagcaactatatctgcatgggtggcaaggggccctccaccctgacc gcccccaacggtcactacattttgtctcggggtggcaatggccaccgctgcaccccagga acaggcttgggcacgagtccagccttggctggggatgaagcagccagtgctgcagatctg gataatcggttccgaaagagaactcactcggcaggcacatcccctaccattacccaccag aagaccccgtcccagtcctcagtggcttccattgaggagtacacagagatgatgcctgcc tacccaccaggaggtggcagtggaggccgactgccgggacacaggcactccgccttcgtg cccacccgctcctacccagaggagggtctggaaatgcaccccttggagcgtcgggggggg caccaccgcccagacagctccaccctccacacggatgatggctacatgcccatgtcccca ggggtggccccagtgcccagtggccgaaagggcagtggagactatatgcccatgagcccc aagagcgtatctgccccacagcagatcatcaatcccatcagacgccatccccagagagtg gaccccaatggctacatgatgatgtcccccagcggtggctgctctcctgacattggaggt ggccccagcagcagcagcagcagcagcaacgccgtcccttccgggaccagctatggaaag ctgtggacaaacggggtagggggccaccactctcatgtcttgcctcaccccaaaccccca gtggagagcagcggtggtaagctcttaccttgcacaggtgactacatgaacatgtcacca gtgggggactccaacaccagcagcccctccgactgctactacggccctgaggacccccag cacaagccagtcctctcctactactcattgccaagatcctttaagcacacccagcgcccc ggggagccggaggagggtgcccggcatcagcacctccgcctttccactagctctggtcgc cttctctatgctgcaacagcagatgattcttcctcttccaccagcagcgacagcctgggt gggggatactgcggggctaggctggagcccagccttccacatccccaccatcaggttctg cagccccatctgcctcgaaaggtggacacagctgctcagaccaatagccgcctggcccgg cccacgaggctgtccctgggggatcccaaggccagcaccttacctcgggcccgagagcag cagcagcagcagcagcccttgctgcaccctccagagcccaagagcccgggggaatatgtc aatattgaatttgggagtgatcagtctggctacttgtctggcccggtggctttccacagc tcaccttctgtcaggtgtccatcccagctccagccagctcccagagaggaagagactggc actgaggagtacatgaagatggacctggggccgggccggagggcagcctggcaggagagc actggggtcgagatgggcagactgggccctgcacctcccggggctgctagcatttgcagg cctacccgggcagtgcccagcagccggggtgactacatgaccatgcagatgagttgtccc cgtcagagctacgtggacacctcgccagctgcccctgtaagctatgctgacatgcgaaca ggcattgctgcagaggaggtgagcctgcccagggccaccatggctgctgcctcctcatcc tcagcagcctctgcttccccgactgggcctcaaggggcagcagagctggctgcccactcg tccctgctggggggcccacaaggacctgggggcatgagcgccttcacccgggtgaacctc agtcctaaccgcaaccagagtgccaaagtgatccgtgcagacccacaagggtgccggcgg aggcatagctccgagactttctcctcaacacccagtgccacccgggtgggcaacacagtg ccctttggagcgggggcagcagtagggggcggtggcggtagcagcagcagcagcgaggat gtgaaacgccacagctctgcttcctttgagaatgtgtggctgaggcctggggagcttggg ggagcccccaaggagccagccaaactgtgtggggctgctgggggtttggagaatggtctt aactacatagacctggatttggtcaaggacttcaaacagtgccctcaggagtgcacccct gaaccgcagcctcccccacccccaccccctcatcaacccctgggcagcggtgagagcagc tccacccgccgctcaagtgaggatttaagcgcctatgccagcatcagtttccagaagcag ccagaggaccggagaatgcacttacattctcagggcacacaagatgctcaccccacactg acatctggcagagagtcaaacaaacatgtaggagcagccacaggagggctttttcgtttg agaattcccaagtga >gi568815596f:226764694_227095519|GENSCAN_predicted_peptide_3|173_aa MQRRSRGINTGLILLLSQIFHVGINNIPPVTLATLALNIWFFLNPQKPLYSSCLSVEKCY QQKDWQRLLLSPLHHADDWHLYFNMASMLWKGINLERRLGSRWFAYVITAFSVLTGVVYL LLQFAVAEFMDEPDFKRSCAVGFSEKDLEIRKVKRLINKQFVVDTSLVRIQAP >gi568815596f:226764694_227095519|GENSCAN_predicted_CDS_3|522_bp atgcaacggagatcaagagggataaatactggacttattctactcctttctcaaatcttc catgttgggatcaacaatattccacctgtcaccctagcaactttggccctcaacatctgg ttcttcttgaaccctcagaagccactgtatagctcctgccttagtgtggagaagtgttac cagcaaaaagactggcagcgtttactgctctctccccttcaccatgctgatgattggcat ttgtatttcaatatggcatccatgctctggaaaggaataaatctagaaagaagactggga agtagatggtttgcctatgttatcaccgcattttctgtacttactggagtggtatacctg ctcttgcaatttgctgttgccgaatttatggatgaacctgacttcaaaaggagctgtgct gtaggtttctcagagaaggacttggaaattcggaaggttaagagacttatcaacaaacag tttgtggtagacaccagccttgtgaggattcaggccccctga >gi568815596f:226764694_227095519|GENSCAN_predicted_peptide_4|88_aa MVAITTSTEQVSPNLVGPKKRFLGVTDSVGQEFRRSTRGGRSLLHNVSSLSRENWSLVTS GKLYAFDMGSRDKHFNSQGKKLLVSEAM >gi568815596f:226764694_227095519|GENSCAN_predicted_CDS_4|267_bp atggttgccattaccacttctacagaacaagtctccccaaatctagtgggaccaaagaaa cgttttcttggagtcacagattctgtgggtcaggaattcagaaggagcacacggggtggt cggtctctgctccataatgtcagcagcctcagcagagaaaactggagcctggtgacatcg ggaaagctgtatgcttttgacatgggctccagagacaagcatttcaactcacaaggcaag aaactgcttgtttcagaagcaatgtga >gi568815596f:226764694_227095519|GENSCAN_predicted_peptide_5|81_aa MYAEQMTSLRTQLGDTVLDKVKEQGKCHSQRNTAPPPVEAQEVQIVPQLPGSQSTGMRPA LANQIPFVSNTKSSDDSPVKH >gi568815596f:226764694_227095519|GENSCAN_predicted_CDS_5|246_bp atgtatgcagagcaaatgacctcattaaggacgcaacttggagacacagtgttggacaag gtaaaggaacaaggtaagtgtcacagtcaaagaaacacagccccacctccagtggaggct caagaggtgcagatagtccctcagcttcctggcagccagagcacaggcatgcgacctgca ctggccaatcagataccctttgtatctaatacaaagtctagtgatgacagtccagtgaag cactga >gi568815596f:226764694_227095519|GENSCAN_predicted_peptide_6|299_aa MVSTQMPINDRLDKENVTTKHRATSNCTILPGEGQGSGRRTYPSHSTKRGPETEVQADTE KNPLKGYTNLHSYQQHIEMSVITTNIKCGEMEGGGAKRPNRKLYRSSFPEEHQTEELSAE KITFIRTKDHTSAENARPQLFKFYTFLKVQLKCYLLCENLIVLVLLCTCHSSWVLLHSDL ACHADNNPDIQSCLKAQGKGSRTQLWTVVLYFLFCISKDKRCIPHTAISYPPEAEGKWCH CHPLKSTGSHRILRMCHSGVTCLFGDRACLNPLREGEHTDGQLHVEDAIRSSIQGQSCN >gi568815596f:226764694_227095519|GENSCAN_predicted_CDS_6|900_bp atggtatcaacccaaatgcccatcaatgatagactggataaagaaaatgtgactacaaag catagagcaacaagtaattgcactatactaccgggagagggacaagggagtggaaggaga acctatccgagtcacagcacaaaacgaggacctgaaactgaagttcaagcagataccgag aaaaaccctctgaaaggttataccaatttgcattcatatcagcagcatatagaaatgtct gttatcactaccaacattaaatgtggtgagatggagggaggtggagcaaaacggccaaat agaaagctttacagatcatccttcccagaggaacaccaaactgaagaactatctgcagaa aaaatcaccttcataagaaccaaagatcatacttctgcagagaatgcccgtccacagctc ttcaagttctacacattcttgaaggtccaactcaaatgctatctcctctgtgaaaatctg atagttctagtcctgctgtgtacgtgtcatagttcctgggtgcttcttcatagtgatctg gcctgccatgcggataataatcctgatatccagagctgcctaaaggcacaaggaaaaggt tctagaacacagctatggactgttgttttatacttcctgttttgtataagcaaagacaag cgatgcatcccacacacggccatcagttatcctcctgaggcagaaggcaagtggtgtcat tgccatcctctgaaatccactggttctcacagaatactccggatgtgccacagtggtgta acttgtctgtttggtgaccgtgcatgcttaaaccccttacgggaaggggagcacacagat gggcagctccacgtggaagatgccatccgaagcagcattcagggacaatcctgcaactag >gi568815596f:226764694_227095519|GENSCAN_predicted_peptide_7|634_aa MDSIFVFPQNPYVEVQTLNVMACGGGAFGRWPKLTKTASLKLAAWQLVVQQVLAVTAAGQ TPSDQSGEQMCFLVEKKAHLSWDSVLSGGFIIAGDSSSLPPRDTMPAAWLTLLYEQGQFL HSQITRSPTPLRFAQALKSSVQLKNTRGVFRMVDLTCSCQRWEMEVQRRHFILLRTLLTK SGVDTGVLQSCPSATSLWRPIFNWNHECDCSPDLDTEHNNPGATYITPAAKCTQSEPSEG QGVTAADSRVAIRAVELGLICATGAQPGRDTWPENGNGSTKISRMSLSHALARSGLFPSM IDWIKKMWHIYTMEYYAAIQKNEIMSFAGTWMKLEAIILSKLTQEQKTKHRMFSLKFPKL IEIEVKRTISISDLSDERKHVEEEKEKEVRKGNRPSAFLVPVPSISYLRSVTLDAKKLHI VQQRVTSKQHSEKLYMPLGQELEKTTLKFIWNQKRACIAKSILSQKNKAGGITLPDFKLY YKATVTKTAWYWYQNRDIDQWNRTEPSEITPHIYNYLIFDKPEKNKQWGKDSLFNKWCWE NWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGITIQDIGMGKDFMSKT PKAMATKDKIDKWDLIKLKSFCTAKETTIRVNNT >gi568815596f:226764694_227095519|GENSCAN_predicted_CDS_7|1905_bp atggactcaatatttgtgttccctcaaaatccatacgttgaagtccaaaccctcaatgtg atggcatgtggaggtggggcctttgggaggtggcctaaactgactaagacagcatctttg aagttggcagcttggcaactggtagttcagcaggtgctggctgtcacagctgctgggcag acccccagtgatcagagcggtgaacagatgtgctttctggtggagaagaaagcacatctt tcttgggactctgtcctaagtggtggcttcataattgccggagacagcagctccctccct ccaagggacacaatgccagctgcatggctcaccctcctatacgagcagggacagttcctc cattcacaaataacaagatcccccacaccattgcgttttgcccaggctctcaaatcttct gttcagctaaagaacacgaggggtgtcttcaggatggtagatctgacttgctcatgtcag agatgggaaatggaagttcagagaaggcatttcattctgcttagaaccttgcttaccaaa tctggagtggataccggtgtcttacaaagctgcccttctgctacctcgctctggagacct attttcaactggaaccatgagtgtgactgttccccagatctagatacagaacacaacaac cctggggccacttacattactcctgcagccaaatgcactcaatcagaaccctcagaggga caaggagtcactgctgctgattcacgtgtggccatcagagcagtggaactaggcctcatc tgtgccacaggagcacagcctggaagagatacatggcctgaaaatggtaatgggagcacc aagatctccaggatgtccctgagccacgcactggcgagaagtgggctttttccatcaatg attgactggataaagaaaatgtggcacatatacaccatggaatactatgcagccatacaa aagaatgagatcatgtcctttgcagggacatggatgaagctggaagccatcattctcagc aaactaacacaggaacagaaaaccaaacaccgcatgttctcactcaaatttccaaaattg attgaaatagaagtaaaaagaacgatttctataagtgacctaagtgatgagagaaaacat gtggaagaggagaaagagaaagaggtaagaaaaggaaacagacccagtgctttcctggtg ccagtgccctccatctcctatctcagatcagtgaccctagatgctaaaaagctgcatatt gtacaacaaagagtaacatcaaaacaacattcagaaaagctgtatatgcccctggggcaa gaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcctgcatcgccaag tcaatcctaagccaaaagaacaaagctggaggcatcacactacctgacttcaaactatac tacaaggctacagtcaccaaaacagcatggtactggtaccaaaacagagatatagatcaa tggaacagaacagagccctcagaaataacgccgcatatctacaactatctgatctttgac aaacctgagaaaaataagcaatggggaaaggattccctatttaataaatggtgctgggaa aactggctagccatatgtaggaagctgaaactggatcccttccttacaccttatacaaaa atcaattcaagatggattaaagatttaaacgttagacctaaaaccataaaaaccctagaa gaaaacctaggcattaccattcaggacataggcatgggcaaggacttcatgtccaaaaca ccaaaagcaatggcaacaaaagacaaaattgacaaatgggatctaattaaactaaagagc ttctgcacagcaaaagaaactaccatcagagtgaacaatacctag >gi568815596f:226764694_227095519|GENSCAN_predicted_peptide_8|269_aa MPGKPEAGRSLDLRNPLRLQGPCSGVEQRLRPCEVDSSCPHCARLALSADWCGICIRNIR RGRGLQLPLRADTLALELLRGSQASAARTFGFCKWVQGHVCWLLLLRAGLRAPSQSASSN SSMQFPSWAMSCVRTSLGSIGSHKQVTSSMKRPVSTLPSLTADSAMPLPRSAFEDCDLHQ EVLLTPVGKISPSFCLECPVFDLAAGVEPSPRPPGALPLWLSRNRMWLPWPNVVLLSPTP APQLLSSSGPCVWLVFAGLSLCSGIALPL >gi568815596f:226764694_227095519|GENSCAN_predicted_CDS_8|810_bp atgcctgggaagcctgaagcaggccgcagtttggacctaaggaatcctttgaggctgcag ggtccctgtagtggagttgagcaaagactcaggccttgcgaggtggatagcagctgcccg cactgtgccaggctggccctctctgcggactggtgtggaatctgcatcagaaacatcaga agaggccgggggctccagctgcccctccgagcagacacattggccctggagcttctgaga ggaagccaggctagtgctgccagaacatttgggttctgcaaatgggtccaaggccatgtg tgctggctgctgcttttgagagctggactgcgggcaccttcccagtctgcctcttctaat tcctccatgcagttcccttcatgggcaatgagttgtgtcagaacctcactgggcagcatt ggaagtcataagcaggtcacttcctccatgaagagaccagtttccacgctcccatctctc actgctgactcagcgatgcctctgcctcggtctgcttttgaagactgtgaccttcaccag gaggttttacttacaccagtcgggaagattagtccctcattctgcctggagtgccccgtg tttgacttggcagcgggtgtggagccatccccgcgtcctcctggcgcattgccactgtgg ctgtccaggaacaggatgtggctgccttggccgaatgttgtcctactctccccaaccccg gcgcctcagctcctcagctcctcgggcccctgcgtctggctggtgtttgcagggctttcg ctctgctctggtattgctctgcctttatag >gi568815596f:226764694_227095519|GENSCAN_predicted_peptide_9|1141_aa MARVRKYNSVSAEWRIAGPKGFPGPQGAPGLSGSDGHKGRPGTPGTAEIPGPPGFRGDMG DPGFGGEKGSSPVGPPGPPGSPGVNGQKGIPGDPAFGHLGPPGKRGLSGVPGIKGPRGDP GCPGAEGPAGIPGFLGLKGPKGREGHAGFPGVPGPPGHSCERGAPGIPGQPGLPGYPGSP GFPGERGKPGAEGCPGAKGEPGEKGMSGLPGDRGLRGAKGAIGPPGDEGEMAIISQKGTP GEPGPPGDDGFPGERGDKGTPGMQGRRGEPGRYGPPGFHRGEPGEKGQPGPPGPPGPPGS TGLRGFIGFPGLPGDQGEPGSPGPPGFSGIDGARGPKGNKGDPASHFGPPGPKGEPGSPG CPGHFGASGEQGLPGIQGPRGSPGRPGPPGSSGPPGCPGDHGMPGLRGQPGEMGDPGPRG LQGDPGIPGPPGIKGPSGSPGLNGLHGLKGQKGTKGASGLHDVGPPGPVGIPGLKGERGD PGSPGISPPGPRGKKGPPGPPGSSGPPGPAGATGRAPKDIPDPGPPGDQGPPGPDGPRGA PGPPGLPGSVDLLRGEPGDCGLPGPPGPPGPPGPPGYKGFPGCDGKDGQKGPVGFPGPQG PHGFPGPPGEKGLPGPPGRKGPTGLPGEPGPPADVDDCPRIPGLPGAPGMRGPEGAMGLP GMRGPSGPGCKGEPGLDGRRGVDGVPGSPGPPGRKGDTGEDGYPGGPGPPGPIGDPGPKG FGPGYLGGFLLVLHSQTDQEPTCPLGMPRLWTGYSLLYLEGQEKAHNQDLGHELPEHRDV LGAHLQHLAPCLARVRDEGMRAGPQDQGGVRNGFLTTEAGFGEGLQISEHVSAKARWRKE GSGNTYMGCLDTSCLFFPSQGLAGSCLPVFSTLPFAYCNIHQVCHYAQRNDRSYWLASAA PLPMMPLSEEAIRPYVSRCAVCEAPAQAVAVHSQDQSIPPCPQTWRSLWIGYSFLMHTGA GDQGGGQALMSPGSCLEDFRAAPFLECQGRQGTCHFFANKYSFWLTTVKADLQFSSAPAP DTLKESQAQRQKISRCQKEKMLAQTHPTQTCKKHMARFGTTHRDSLVTLLGRVGILSFVY QHNQQFQVVCGNCVDVVVSVTGFRWGLDEEDLNGPRERKELFHVKAQHRMSQDQEHSQPE Q >gi568815596f:226764694_227095519|GENSCAN_predicted_CDS_9|3426_bp atggcaagagtgcggaaatataattccgtatcagcagagtggagaattgcaggtccgaag ggatttccaggtccccaaggtgcccctgggctgagtggttcagatgggcataaaggcaga cctggcacaccaggaacagcggaaataccaggtccacctggttttcgtggtgacatggga gatccgggttttggaggtgaaaaggggtcctcccctgttgggcccccaggccctcccggc tcaccaggagtgaatggtcagaaaggaatcccgggagaccctgcatttggtcacctggga cccccgggaaagaggggtctttcaggagtgccagggataaaaggacccagaggtgatccg ggatgtccaggggctgaagggccagctggcattcctggattcctaggtctcaaaggtccc aaaggcagagagggacatgctgggtttccaggtgtcccaggtccacctggccattcctgt gaaagaggtgctccagggataccagggcaaccgggactccctgggtatccaggtagccca ggttttcccggagaaagaggaaagcctggtgcagagggatgtcctggcgcaaagggagaa cctggagagaagggcatgtctggccttcctggagaccggggactgagaggggccaaagga gccataggacctcccggagatgaaggagaaatggctatcatttcacaaaagggaacacct ggggaacctggacctcctggagatgatggattcccaggagaaagaggtgataaaggaact cccgggatgcaagggagaagaggagagccgggaagatacggaccacctggatttcacaga ggggaacctggtgagaaaggtcagccagggcctcctggacccccaggccctccaggctca actggtctaagagggttcattggttttccaggacttccaggtgaccagggtgagccaggt tctccaggtccccctggattttcaggaattgatggagcaagaggacctaaaggaaacaaa ggtgaccctgccagtcactttggtccacctggtccaaagggtgagccaggtagccctgga tgtccagggcattttggagcatccggagagcagggcttgcctggtattcaagggcccaga ggatcacctggaaggccagggccacctggctcctctggaccaccagggtgcccaggtgat cacgggatgcctgggctgaggggacagccaggagaaatgggagaccctgggccaagaggc ctccagggggatccagggataccaggtcctccgggaataaaaggtccctccggatcacct ggcctgaacggcttgcatggattgaaaggtcagaaaggaactaaaggtgcttcaggtttg catgatgtggggccacctggtccagtgggaatacctgggctaaaaggggagagaggagac cctgggagcccaggaatctctcctccaggtcctcgtggaaagaaaggtcccccaggaccc ccagggagttcaggaccacctggtcctgcaggtgccacaggaagagctcctaaggacatt cctgacccgggtccacctggagatcagggacctcctggtcctgatggcccaagaggagca cctgggcctccaggcctccctgggagtgttgaccttctgagaggggagccaggtgactgt ggtctaccagggccaccaggtccccctggcccaccaggccctccaggatacaaaggcttt ccaggatgtgatggaaaagatggccagaaaggaccagtgggattcccgggaccgcaggga ccacatggatttcctgggccacctggagagaagggtttacctggacctccagggagaaaa gggcccactggtcttccgggtgaaccggggccacctgcagatgtggatgactgtccccga atcccaggccttcctggggcgccaggcatgagaggaccagaaggagccatggggctccct ggaatgagaggcccctcaggaccagggtgcaaaggagagcctgggctggatggcaggagg ggtgtggatggcgtccctgggtctcctgggcctcccggacgtaaaggtgacacaggagaa gacggctaccctggaggaccagggcctcctggtcccattggggatcctgggcccaaaggg tttggccctggatacctcggtggcttcctcctggttctccacagtcagacggaccaggag cccacctgccccctgggcatgcccaggctctggactgggtatagtctgttatacctggaa gggcaagagaaagctcacaatcaagaccttggccatgagcttcctgaacacagagatgtg cttggtgcccacctccagcacctggccccctgcttggctcgagtgagggatgaggggatg agggctggcccacaggaccagggaggtgtcagaaatggcttcctcaccacagaagcaggc tttggcgagggcctccagatttcagaacacgtatctgcaaaggccagatggaggaaggaa gggtctgggaacacctacatggggtgcttggacaccagctgtctcttcttcccttcccaa ggtctggcagggtcttgccttcccgtatttagcacgctgccctttgcctactgcaacatc caccaggtgtgccactatgcccagagaaacgacagatcctactggctggccagcgctgcg cccctccccatgatgccactctctgaagaggcgatccgcccctatgtcagccgctgtgcg gtatgcgaggccccggcccaggcggtggcggtgcacagccaggaccagtccatcccccca tgtccgcagacctggaggagcctctggatcgggtattcattcctgatgcacacaggagct ggggaccaaggaggagggcaggcccttatgtcacctggcagctgcctggaagatttcaga gcagcaccattccttgaatgccagggccggcagggaacttgccactttttcgcaaataag tatagcttctggctcacaacggtgaaagcagacttgcagttttcctctgctccagcacca gacaccttaaaagaaagccaggcccaacgccagaaaatcagccggtgccagaaggaaaaa atgttagcacaaacacatccaacgcagacctgtaaaaagcacatggcaagattcggtact actcacagggactccttggtgacacttcttggtcgtgtgggaatcttgagctttgtgtac cagcataaccagcaattccaagtggtatgtggaaactgtgtagatgtggttgtgtctgtt acaggattcagatgggggctggatgaggaggatctgaatgggccaagagagaggaaagag ctgttccatgtaaaagcacagcacagaatgtcacaagatcaggaacactcacagcctgag cagtga >gi568815596f:226764694_227095519|GENSCAN_predicted_peptide_10|335_aa MGKKQSRKTGNSKNQSTSPLPKERSSSPAMEQSWTENHFDELREEGFRRSNYSELRGNIQ TKGKEVENFEKNLEECITIITNIEKCLKELMELKTKPRELREECRSLRSRCDQLEQRVSA MEDEMKREGKFREKRIKRNEQSLQEIWDYVKRPNLHLIGVPESDRENGTKLENTLQDIIQ ENFPNLARQANIQIQEIQRTPQRYSSRRATPRHIIVRFTKVEMKEKMLRATREKGRVTLK GKPIRLTVDLSAETLQARREWGPIFNILKEKNFQPRISYPAKLSFISEGEIKYFTDKQML RDFVTTRPALKELLKEALNMERNNRYQLLQNHAKM >gi568815596f:226764694_227095519|GENSCAN_predicted_CDS_10|1008_bp atggggaaaaaacagagcagaaaaactggaaactctaaaaaccagagcacctctcctctt ccaaaggaacgaagttcctcgccagcaatggaacaaagctggacggagaatcactttgac gagctgagagaagaaggcttcagacgatcaaattactccgagctacgggggaacattcaa accaaaggcaaagaagttgaaaactttgaaaagaatttagaagaatgtataactataata accaatatagagaagtgcttaaaggagctgatggagctgaaaaccaagcctcgagaacta cgtgaagaatgcagaagcctcaggagccgatgcgatcaactggaacaaagggtatcagcg atggaagatgaaatgaagcgagaagggaagtttagagaaaaaagaataaaaagaaatgag caaagcctccaagaaatatgggactatgtgaaaagaccaaatctacatctgattggtgta cctgaaagtgacagggagaatggaaccaagttggaaaacactctgcaggatattatccag gagaacttccccaatctagcaaggcaggccaacattcagattcaggaaatacagagaacg ccacaaagatactcctcgagaagagcaactccaagacacataattgtcagattcaccaaa gttgaaatgaaggaaaaaatgttaagggcaaccagagagaaaggtcgggttaccctcaaa gggaagcccatcagactaacagtggatctttcggcagaaactctacaagccagaagagag tgggggccaatattcaacattctcaaagaaaagaattttcaacccagaatttcatatcca gccaaactaagcttcataagtgaaggagaaataaaatactttacagacaagcaaatgctg agagattttgtcaccaccaggcctgccctaaaagagctcctgaaggaagcgctaaacatg gaaaggaacaaccggtaccagctgctgcaaaatcatgccaaaatgtaa >gi568815596f:226764694_227095519|GENSCAN_predicted_peptide_11|633_aa MGDFNTPLSTLDRSTRQKVNKDTQELNSALHQADLIDIYRTLHPKSTEYTFFSAPHHTYS KIDHILGSKAFLSKCKRTEIITNYLSDHNAIKLELRIKNLTQNHSTTWKLNNLLLNDYWV HNEMKAEIKMFFETNENKDTTYQNLWDALKAVCRGKFIALNALKRKQERSKIDTLTSQLK ELEKQEQTHSKASRRQEITKIRAGLKEIETQKTLQKINESRSWFFERINKIDRPLARLIK KKREKNQIDAIKNDKGDITTDPTEIQTTIREYYKHLYANKLENLEEMDKFLDTYTLPRLN QEEVESLNRPITGSEIVAIINSLPTKKGPGPDGFTAEFYQRYKEELVPFLLKLFQSIEKE GILPNSFYEVSIILIPKPGRDTTKKENFRPISLMNIDAKILKKYWQTESSSTSKSLSTMI KWASPLGCKAVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLK LISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDL FKENYKPLLKEIKEDTNKWKNIPCSWVGRINIVKMAILPKAIYRFNAIPIKLPMTFFTEL EKNYFKVHMEPKKSRHRQVNPKPKEQSRRHHTT >gi568815596f:226764694_227095519|GENSCAN_predicted_CDS_11|1902_bp atgggagactttaacactccactgtcaacattagacagatcaacgagacagaaagtcaac aaggatacacaggaattgaactcagctctgcaccaagcagacctaatagacatctacaga actctccaccccaaatcaacagaatatacatttttttctgcaccacaccacacctattcc aaaattgaccacatacttggaagtaaagcattcctcagcaaatgtaaaagaacagaaatt ataacaaactatctctcagaccacaatgcaatcaaactagaactcaggattaagaatctc actcaaaaccactcaactacatggaaactgaacaacctgctcctgaatgactactgggta cataacgaaatgaaggcagagataaagatgttctttgaaaccaacgagaacaaagacaca acataccagaatctctgggatgcactcaaagcagtgtgtagagggaaatttatagcacta aatgccctcaagagaaagcaggaaagatctaaaattgacaccctaacatcacaattaaaa gaactagaaaagcaagagcaaacacattcaaaagctagcagaaggcaagaaataactaaa atcagagcaggactgaaagaaatagagacacaaaaaacccttcaaaaaattaatgaatcc aggagctggttttttgaaaggatcaacaaaattgatagaccgctagcaagactaataaag aaaaaaagagagaagaatcaaatagatgcaataaaaaatgataaaggggatatcaccacc gatcccacagaaatacaaactaccatcagagaatactacaaacacctctacgcaaataaa ctagaaaatctagaagaaatggataaattcctcgacacatacactctcccaagactaaac caggaagaagttgaatctctgaatagaccaataacaggatctgaaattgtggcaataatc aatagcttaccaactaaaaagggtccaggaccagatggattcacagctgaattctaccag aggtacaaggaggaactggtaccattccttctgaaactattccaatcaatagaaaaagag ggaatcctccctaactcattttatgaggtcagcatcatcctgataccaaagccaggcaga gacacaaccaaaaaagagaattttagaccaatatccttgatgaacattgatgcaaaaatc ctcaaaaaatactggcaaaccgaatccagcagcacatcaaaaagcttatccaccatgatc aagtgggcttcacccctgggatgcaaggctgtgttggaagttctggccagggcaattagg caggagaaggaaataaagggtattcaattgggaaaagaggaagtcaaattgtccctgttt gcagacgacatgattgtatatctagaaaaccccattgtctcagcccaaaatctccttaag ctgataagtaacttcagcaaagtctcaggatacaaaatcaatgtacaaaaatcacaagca ttcttatacaccaacaacagacaaacagagagccaaatcatgagtgaactcccattcaca attgcttcaaagagaataaaatacctaggaatccaacttacaagggatgtgaaggacctc ttcaaggagaactacaaaccactgctcaaggaaataaaagaggatacaaacaaatggaag aatattccatgctcatgggtaggaagaatcaatatcgtgaaaatggccatactgcccaag gcaatttacagattcaatgccatccccatcaagctaccaatgactttcttcacagaattg gaaaaaaactactttaaagttcatatggaaccaaaaaagagccggcatcgccaagtcaat cctaagccaaaagaacaaagccggaggcatcacactacctga >gi568815596f:226764694_227095519|GENSCAN_predicted_peptide_12|236_aa XMIGPPGPQGFPGLPGLPGEAGIPGRPDSAPGKPGKPGSPGLPGAPGLQGLPGSSGNEGL CACEPGPMGPPGPPGLPGRQGSKGDLGLPGWLGTKGDPGPPGAEGPPGLPGKHGASGPPG NKGAKGDMVVSRVKGHKGERGPDGPPGFPGQPGSHGRDGHAGEKGDPGPPGDHEDATPGG KGFPGPLGPPGKAGPVGPPGLGFPGPPGERGHPGVPGHPGVRGPDGLKGQKGLDIM >gi568815596f:226764694_227095519|GENSCAN_predicted_CDS_12|711_bp ngcatgataggaccccctgggccacaaggatttcctggtcttcctgggcttccaggagaa gctggtattcctgggagacctgattctgctccaggaaaaccagggaagccaggatcacct ggcttgcctggagcaccaggcctgcagggcctcccaggatcaagtggaaatgaaggactc tgtgcctgtgagcctggacccatgggcccccctggccctccaggacttcctgggaggcag gggagtaagggagacttggggctccctggctggcttggaacaaaaggtgacccaggacct cctggtgctgaaggacctccagggctaccaggaaagcatggtgcctctggaccacctggc aacaaaggggcgaagggtgacatggttgtatcaagagttaaagggcacaaaggagaaaga ggtcctgatgggcccccaggatttccagggcagccaggatcacatggtcgggatggacat gctggagaaaaaggggatccaggacctccaggggatcatgaagatgcgaccccaggtggt aaaggatttcctggacctctgggccccccaggcaaagcaggacctgtggggcccccagga ctgggatttcctggtccaccaggagagcgaggccacccaggagttccaggccacccaggt gtgaggggccctgatggcttgaagggtcagaaaggtctcgatattatgtga