GENSCAN 1.0 Date run: 3-Nov-116 Time: 15:49:14 Sequence gi568815587r:111686400_111971482 : 285083 bp : 42.48% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1602 1763 162 1 0 117 93 187 0.846 21.13 1.02 Intr + 14487 14611 125 1 2 62 19 162 0.668 6.08 1.03 Intr + 15053 15176 124 1 1 78 84 76 0.994 5.44 1.04 Intr + 16696 17024 329 2 2 84 88 319 0.823 25.99 1.05 Intr + 18588 18740 153 2 0 88 90 166 0.911 16.15 1.06 Intr + 25812 25976 165 2 0 79 102 115 0.455 11.24 1.07 Intr + 33376 33604 229 0 1 85 96 184 0.983 15.42 1.08 Intr + 34079 34363 285 0 0 81 84 223 0.999 17.59 1.09 Intr + 34500 34663 164 1 2 96 94 86 0.966 8.77 1.10 Intr + 35431 35541 111 0 0 84 94 139 0.996 13.76 1.11 Intr + 36266 36357 92 1 2 113 66 101 0.998 8.27 1.12 Intr + 37097 37711 615 2 0 29 79 666 0.348 50.34 1.13 Intr + 38355 38605 251 0 2 45 44 164 0.329 3.86 1.14 Intr + 40938 41080 143 1 2 105 13 73 0.115 0.65 1.15 Intr + 43902 43954 53 2 2 90 83 26 0.048 -0.91 1.16 Intr + 45594 45756 163 0 1 50 75 137 0.176 7.66 1.17 Intr + 47097 47397 301 2 1 55 59 145 0.019 3.68 1.18 Term + 51069 51547 479 1 2 101 48 232 0.821 14.52 1.19 PlyA + 53360 53365 6 1.05 2.13 PlyA - 53833 53828 6 -0.45 2.12 Term - 56266 56120 147 1 0 90 44 81 0.671 0.82 2.11 Intr - 57131 56977 155 0 2 62 101 100 0.960 7.57 2.10 Intr - 61615 61555 61 1 1 79 101 50 0.982 2.79 2.09 Intr - 65933 65760 174 2 0 56 105 215 0.931 19.21 2.08 Intr - 67178 67044 135 2 0 67 90 32 0.589 1.14 2.07 Intr - 68695 68588 108 1 0 66 32 127 0.463 4.46 2.06 Intr - 69051 68896 156 0 0 72 88 186 0.996 16.29 2.05 Intr - 73552 73405 148 0 1 82 97 191 0.998 18.72 2.04 Intr - 74652 74420 233 0 2 45 72 201 0.999 9.85 2.03 Intr - 78506 78406 101 0 2 93 105 64 0.994 7.51 2.02 Intr - 78985 78895 91 1 1 58 91 48 0.996 0.75 2.01 Init - 79962 79849 114 0 0 91 84 118 0.906 12.24 2.00 Prom - 93094 93055 40 -4.65 3.12 PlyA - 93156 93151 6 1.05 3.11 Term - 100121 99998 124 1 1 105 43 109 0.343 4.98 3.10 Intr - 123374 123244 131 2 2 73 77 47 0.003 0.67 3.09 Intr - 149895 149766 130 2 1 96 64 143 0.844 12.48 3.08 Intr - 151216 151069 148 2 1 131 108 126 0.992 17.27 3.07 Intr - 158324 158202 123 0 0 126 101 50 0.991 9.74 3.06 Intr - 167086 166981 106 1 1 93 91 79 0.955 7.57 3.05 Intr - 170341 170308 34 0 1 85 91 -3 0.421 -2.99 3.04 Intr - 174236 174148 89 2 2 103 111 76 0.900 9.25 3.03 Intr - 182337 182203 135 0 0 91 116 13 0.918 4.24 3.02 Intr - 183971 183833 139 1 1 81 50 87 0.935 3.75 3.01 Init - 185083 184953 131 1 2 77 53 144 0.743 9.56 3.00 Prom - 185883 185844 40 -3.95 4.06 PlyA - 187195 187190 6 1.05 4.05 Term - 189705 188523 1183 2 1 46 29 621 0.973 42.24 4.04 Intr - 190240 190082 159 0 0 76 91 78 0.924 5.08 4.03 Intr - 190612 190409 204 0 0 80 82 163 0.858 12.29 4.02 Intr - 192313 192157 157 2 1 38 100 145 0.999 8.85 4.01 Init - 192733 192562 172 1 1 99 59 134 0.999 9.25 4.00 Prom - 193731 193692 40 -10.35 5.00 Prom + 194232 194271 40 -6.35 5.01 Init + 195124 195216 93 0 0 70 64 61 0.829 2.33 5.02 Intr + 195969 196199 231 2 0 86 89 162 0.995 13.15 5.03 Term + 197372 197461 90 1 0 123 42 69 0.865 2.54 5.04 PlyA + 197649 197654 6 1.05 6.02 PlyA - 199554 199549 6 1.05 6.01 Sngl - 203006 202800 207 2 0 67 37 183 0.935 6.04 6.00 Prom - 218387 218348 40 -6.55 7.04 PlyA - 219844 219839 6 1.05 7.03 Term - 222568 222365 204 1 0 72 37 237 0.999 13.39 7.02 Intr - 224050 223928 123 1 0 97 91 134 0.994 14.46 7.01 Init - 225325 225125 201 1 0 119 105 79 0.991 10.28 7.00 Prom - 225450 225411 40 -10.25 8.00 Prom + 225992 226031 40 -10.84 8.01 Init + 226431 226524 94 2 1 95 86 60 0.689 6.95 8.02 Term + 227042 227496 455 0 2 135 38 549 0.999 48.93 8.03 PlyA + 227674 227679 6 1.05 9.00 Prom + 229052 229091 40 -7.35 9.01 Init + 232574 232605 32 1 2 87 105 52 0.836 6.10 9.02 Intr + 237927 237964 38 0 2 91 93 31 0.858 0.89 9.03 Intr + 239254 239315 62 2 2 84 90 75 0.953 4.83 9.04 Term + 239561 239800 240 1 0 95 53 357 0.778 27.94 9.05 PlyA + 240453 240458 6 1.05 10.10 PlyA - 242992 242987 6 1.05 10.09 Term - 243527 243401 127 1 1 40 48 110 0.173 -1.03 10.08 Intr - 251248 251132 117 0 0 37 100 115 0.492 6.26 10.07 Intr - 251795 251431 365 2 2 45 45 166 0.261 1.06 10.06 Intr - 256628 256575 54 2 0 55 69 85 0.060 1.56 10.05 Intr - 258083 257970 114 2 0 86 47 53 0.040 0.72 10.04 Intr - 262142 262040 103 1 1 23 74 73 0.132 -1.34 10.03 Intr - 262651 262536 116 1 2 61 116 96 0.977 8.03 10.02 Intr - 263607 263331 277 2 1 36 26 221 0.583 7.30 10.01 Init - 265423 265311 113 1 2 83 0 83 0.544 -1.27 10.00 Prom - 268181 268142 40 -4.45 11.00 Prom + 268702 268741 40 -4.95 11.01 Init + 271825 272069 245 0 2 89 45 119 0.355 5.26 11.02 Intr + 272412 272509 98 0 2 34 89 72 0.544 0.63 11.03 Term + 272809 273263 455 2 2 63 47 257 0.885 13.43 11.04 PlyA + 273826 273831 6 1.05 12.00 Prom + 276358 276397 40 -2.25 12.01 Init + 278043 278093 51 2 0 50 94 -21 0.819 -4.09 12.02 Intr + 278150 278279 130 1 1 86 57 248 0.969 20.85 12.03 Intr + 282114 282284 171 1 0 107 81 136 0.646 13.79 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 46959 47397 439 2 1 68 59 248 0.962 16.22 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:111686400_111971482|GENSCAN_predicted_peptide_1|1314_aa XYLANHGRLNESEARRKFWQILSAVDYCHGRKIVHRDLKAENLLLDNNMNIKIADFGFGN FFKSGELLATWCGSPPYAAPEVFEGQQYEGPQLDIWSMGVVLYVLVCGALPFDGPTLPIL RQRVLEGRFRIPYFMSEVEDTKINTLVQSHMSQANNPEVQGDSCDFCNIVFSIDCEHLIR RMLVLDPSKRLTIAQIKEHKWMLIEVPVQRPVLYPQEQENEPSIGEFNEQVLRLMHSLGI DQQKTIESLQNKSYNHFAAIYFLLVERLKSHRSSFPVEQRLDGRQRRPSTIAEQTVAKAQ TVGLPVTMHSPNMRLLRSALLPQASNVEAFSFPASGCQAEAAFMEEECVDTPKVNGCLLD PVPPVLVRKGCQSLPSNMMETSIDEGLETEGEAEEDPAHAFEAFQSTRSGQRRHTLSEVT NQLVVMPGAGKIFSMNDSPSLDSVDSEYDMGSVQRDLNFLEDNPSLKDIMLANQPSPRMT SPFISLRPTNPAMQALSSQKREVHNRSPVSFREGRRASDTSLTQGIVAFRQHLQNLARTK GILELNKVQLLYEQIGPEADPNLAPAAPQLQDLASSCPQEEVSQQQESVSTLPASVHPQL SPRQSLETQYLQHRLQKPSLLSKAQNTCQLYCKEPPRSLEQQLQEHRLQQKRLFLQKQSQ LQAYFNQMQIAESSYPQPSQQLPLPRQETPPPSQQAPPFSLTQPLSPVLEPSSEQMQYSP FLSQYQEMQLQPLPSTSGPRAAPPLPTQLQQQQPPPPPPPPPPRQPGAAPAPLQFSYQTC ELPSAASPAPDYPTPCQYPVDGAQQSDLTGPDCPRSPGLQEAPSSYDPLALSELPGLFDC EMLDAVDPQHNGNRLHLHSEAGDTPLQKVPWVAECQNILRTPEVSHVELTGDPPPWRQGA RNSRTHQEHQPWAREDRLFLQFLVDTAGLRAVGLQGKKKSVALSSSDIKNSQVFKPYAGV SGWDRQEPKTEASPCLLCPFPTLKICSVKEIQVVRPFHHGWTLFSSAAFVSRRLQQGGEG EQQGGLHPKPRASAQWMEPDSSACTSQVLHQDAGQVGWDSSGSSSSLPKSEMPRVGLTDK SDPEETVHCHPRATHIVGGGTQWNGGKAHQALSSKHATHSSRSAQPCLSLLSGIRQNCLP VATEPKCLAVPLCQQSTVGLGKWFLSARGTLGSPLSFTGNSSFLSLCATTRDTDSGAVTV PDQGTSLPSPPVLPSQDALSCASRVGCPAAAYKYMGHWAEKAGAAQGRAEVPFPRGPGKR HPIGPFGVHLAVPHILVTDGLQSPNHRKDLAPFRTHISLREQLWRQKWLGFPTQ >gi568815587r:111686400_111971482|GENSCAN_predicted_CDS_1|3945_bp nactatcttgctaatcatggccggttaaatgagtctgaagccaggcgaaaattctggcaa atcctgtctgctgttgattattgtcatggtcggaagattgtgcaccgtgacctcaaagct gaaaatctcctgctggataacaacatgaatatcaaaatagcagatttcggttttggaaat ttctttaaaagtggtgaactgctggcaacatggtgtggcagccccccttatgcagcccca gaagtctttgaagggcagcagtatgaaggaccacagctggacatctggagtatgggagtt gttctttatgtccttgtctgtggagctctgccctttgatggaccgactcttccaattttg aggcagagggttctggaaggaagattccggattccgtatttcatgtcagaagtagaggat acaaagattaacacccttgtacaaagtcacatgagtcaagcaaataaccctgaagtgcaa ggtgattcttgtgacttttgtaacattgtgttttctatagattgcgagcaccttatccga aggatgttggtcctagacccatccaaacggctaaccatagcccaaatcaaggagcataaa tggatgctcatagaagttcctgtccagagacctgttctctatccacaagagcaagaaaat gagccatccatcggggagtttaatgagcaggttctgcgactgatgcacagccttggaata gatcagcagaaaaccattgagtctttgcagaacaagagctataaccactttgctgccatt tatttcttgttggtggagcgcctgaaatcacatcggagcagtttcccagtggagcagaga cttgatggccgccagcgtcggcctagcaccattgctgagcaaacagttgccaaggcacag actgtggggctcccagtgaccatgcattcaccgaacatgaggctgctgcgatctgccctc ctcccccaggcatccaacgtggaggccttttcatttccagcatctggctgtcaggcggaa gctgcattcatggaagaagagtgtgtggacactccaaaggtcaatggctgtctgcttgac cctgtgcctcctgtcctggtgcggaagggatgccagtcactgcccagcaacatgatggag acctccattgacgaagggctggagacagaaggagaggccgaggaagaccccgctcatgcc tttgaggcatttcagtccacacgcagcgggcagagacggcacactctgtcagaagtgacc aatcaactggtcgtgatgcctggggcagggaaaattttctccatgaatgacagcccctcc cttgacagtgtggactctgagtatgatatggggtctgttcagagggacctgaactttctg gaagacaacccttcccttaaggacatcatgttagccaatcagccttcaccccgcatgaca tctcccttcataagcctgagacctaccaacccagccatgcaggctctgagctcccagaaa cgagaggtccacaacaggtctccagtgagcttcagagagggccgcagagcatcagatacc tccctcacccagggaattgtagcatttagacaacatcttcagaatctggctagaaccaaa ggaattctagagttgaacaaagtgcagttgttgtatgaacaaataggaccggaggcagac cctaacctggcgccggcggctcctcagctccaggaccttgctagcagctgccctcaggaa gaagtttctcagcagcaggaaagcgtctccactctccctgccagcgtgcatccccagctg tccccacggcagagcctggagacccagtacctgcagcacagactccagaagcccagcctt ctgtcaaaggcccagaacacctgtcagctttattgcaaagaaccaccgcggagccttgag cagcagctgcaggaacataggctccagcagaagcgactctttcttcagaagcagtctcaa ctgcaggcctattttaatcagatgcagatagcagagagctcctacccacagccaagtcag cagctgccccttccccgccaggagactccaccgccttctcagcaggccccaccgttcagc ctgacccagcccctgagccccgtcctggagccttcctccgagcagatgcaatacagccct ttcctcagccagtaccaagagatgcagcttcagcccctgccctccacttccggtccccgg gctgctcctcctctgcccacgcagctacagcagcagcagccgccaccgccaccaccccct ccaccaccacgacagccaggagctgccccagcccccttacagttctcctatcagacttgt gagctgccaagcgctgcttcccctgcgccagactatcccactccctgtcagtatcctgtg gatggagcccagcagagcgacctaacggggccagactgtcccagaagcccaggactgcaa gaggccccctccagctacgacccactagccctctctgagctacctggactctttgattgt gaaatgctagacgctgtggatccacaacacaacggaaaccgccttcatctccattcggaa gcaggtgacacaccccttcagaaggtgccctgggttgccgagtgtcagaatatactcagg actccagaggtgtcacacgtggaactgacaggagacccgccaccgtggaggcagggggca agaaactcaagaacgcatcaagagcaccagccctgggccagggaagacaggctcttcctg cagtttctcgtggacactgctggcttgcgggcagtcggtctccagggaaagaaaaagtca gtggccctttcttcctcagatatcaagaactcccaagtgtttaaaccgtatgctggagtc agtggttgggacagacaggagcccaagactgaagccagcccttgcctcttgtgtcccttc ccaactctgaagatttgctcagtcaaggaaattcaagtggtgagacctttccaccatggg tggacactcttcagttctgcagcctttgtgagtcgaaggctccagcagggtggggaagga gagcagcagggaggcctgcaccccaaacccagggcctctgcccagtggatggaaccagac agcagtgcctgcacttcccaagttctccatcaagacgcaggacaggtcggctgggacagt tctggctccagctcctcactccccaaaagtgaaatgccccgagtggggctgactgacaaa tcagaccctgaggagactgttcattgtcaccccagggccacccacatagttgggggtggg acacaatggaatggaggaaaagcccaccaagccctttcctccaagcacgccacacatagc tcccggagcgcacagccttgcctgtctcttctgtctgggatccgccagaactgccttcca gtcgctacagagcccaaatgcttagcagtgcccctgtgccagcaaagcactgtgggtctt gggaagtggttcttgtcagcccgagggacactgggttctccactgtccttcacaggaaat tctagcttcctcagcctttgtgccaccactagagacacagacagtggcgctgtaactgtc cctgaccagggcacatccctcccaagcccacctgtgcttcccagccaggatgccctcagt tgtgccagcagagttgggtgtccagcagcagcctacaagtatatggggcactgggcagag aaagctggggcagcccagggtcgtgctgaggtgcctttccctcggggccctggaaagcgc caccccatcgggccttttggtgtccacctggctgttccccacatcctggtcactgatggt ctccagagccccaaccacaggaaagacctggctcctttcagaactcatatcagtttaaga gaacaactctggagacagaaatggcttggctttccgacgcaatga >gi568815587r:111686400_111971482|GENSCAN_predicted_peptide_2|540_aa MAGASELGTGPGAAGGDGDDSLYPIAVLIDELRNEDVQLRLNSIKKLSTIALALGVERTR SELLPFLTDTIYDEDEVLLALAEQLGNFTGLVGGPDFAHCLLPPLENLATVEETVVRDKA VESLRQISQEHTPVALEAYFVPLVKRLASGDWFTSRTSACGLFSVCYPRASNAVKAEIRQ QFRSLCSDDTPMVRRAAASKLGEFAKVLELDSVKSEIVPLFTSLASDEQDSVRLLAVEAC VSIAQLLSQDDLETLVMPTLRQAAEDKSWRVRYMVADRFSELQKAMGPKITLNDLIPAFQ NLLKDCEAEVRAAAAHKELVSDTNQHVKSALASVIMGLSTILGKENTIEHLLPLFLAQLK DECPDVRLNIISNLDCVNEVIGIRQLSQSLLPAIVELAEDAKWRVRLAIIEYMPLLAGQL GVEFFDEKLNSLCMAWLVDHVYAIREAATNNLMKLVQKFGTEWAQNTIVPKVLVMANDPN YLHRMTTLFCINALSEACGQEITTKQMLPIVLKMAGDQVANVRFNVAKSLQKIGPILDTK >gi568815587r:111686400_111971482|GENSCAN_predicted_CDS_2|1623_bp atggcgggcgcatcagagctcgggaccggcccaggagcagcgggtggagatggagatgat tcgctatacccgatcgcggttttaatcgacgagctccgcaatgaagacgtgcagctccga ctcaacagtattaagaagttatcaacaattgccctagcacttggagtagaaaggacccga agtgaattgttgccatttcttacagatacaatttatgatgaagatgaggtactattagct cttgctgagcagctgggaaatttcactggcctagtgggaggtcctgactttgcccactgt ctgctgcctcctttggaaaatctggcaactgtggaagagactgttgttcgtgacaaggct gtggagtccctgagacagatctcccaggagcatactcctgttgctctggaagcttatttt gtacctctggtgaaacgcttagcaagtggggattggttcacctctcgcacatctgcatgt ggtttgttcagcgtttgctatcccagggcatcaaatgctgttaaagcagaaatcagacag caattccgttccttgtgctcagatgacacaccaatggtacgacgtgctgctgcttccaaa ttgggtgaatttgcaaaagttttggaattagacagtgtgaaaagtgaaattgttccactg ttcactagtctagcttcagatgaacaggattcagtgcgcctccttgctgtggaagcttgt gtcagtattgcccagttattgtctcaggatgaccttgagactttggtgatgcctacactt cgacaagcagcagaagataaatcttggcgcgttcgctatatggtggctgacagattttca gagctccagaaagccatgggtcctaaaatcaccctaaatgacctcatccccgcctttcag aacctacttaaagactgtgaagctgaagtccgggcagctgctgcccacaaagaattagta tccgataccaatcaacatgtcaaatcggctctagcttctgtaattatgggattgtctact attttgggcaaagaaaataccattgaacatcttctacctcttttcttagctcagttaaag gatgagtgtcctgacgttcgtttgaatatcatctccaatttggattgtgtaaatgaagtg attggaatccgtcagctctctcagtctctccttcctgccatagtggagctggcagaagat gccaaatggagggtccgcctggccatcattgagtatatgccgctgctggcaggccagctg ggtgtggaattctttgatgaaaagctgaattctttatgtatggcttggctcgtggaccat gtatacgccatccgagaagctgccaccaacaacctcatgaaactagttcagaagtttggt acagagtgggcccaaaatactattgttcccaaagtgttagtaatggcaaatgatcctaat tacttgcatagaatgaccactttattctgcattaatgcactgtctgaggcctgtggtcag gaaataactactaagcaaatgctgcccatcgtattaaaaatggcaggagaccaagtagca aatgttcgcttcaatgtggccaaatctctacaaaagattggaccaattctagataccaag taa >gi568815587r:111686400_111971482|GENSCAN_predicted_peptide_3|429_aa MASRGARQRLKGSGASSGDTAPAADKLRELLGSREAGGAEHRTELSGNKAGQVWAPEGST AFKCLLSARLCAALLSNISDCDETFNYWEPTHYLIYGEGFQTWEYSPAYAIRSYAYLLLH AWPAAFHARILQTNKGCVQEVWVARESNDASLLGSQHWHVLLIISVWYLNFLPVPQVPVV VIDSYYYGKLVIAPLNIVLYNVFTPHGPDLYGTEPWYFYLINGFLNFNVAFALALLVLPL TSLMEYLLQRFHGYHGPLDLYPEFYRIATDPTIHTVPEGRPVNVCVGKEWYRFPSSFLLP DNWQLQFIPSEFRGQLPKPFAEGPLATRIVPTDMNDQNLEEPSRYIDISKCHYLVDLDTM RETPREPKYSSNKEEWISLAYRPFLDASRSSKLLRAFYVPFLSDQYTVYVNYTILKPRKA KQIRKKSGG >gi568815587r:111686400_111971482|GENSCAN_predicted_CDS_3|1290_bp atggctagtcgaggggctcggcagcgcctgaagggcagcggggccagcagtggggatacg gccccggctgcggacaagctgcgggagctgctgggcagccgagaggcgggcggcgcggag caccggaccgagttatctgggaacaaagcaggacaagtctgggcacctgaaggatctact gctttcaagtgtctgctttcagcaaggttatgtgctgctctcctgagcaacatctctgac tgtgatgaaacattcaactactgggagccaacacactacctcatctatggggaagggttt cagacttgggaatattccccagcatatgccattcgctcctatgcttacctgttgcttcat gcctggccagctgcatttcatgcaagaattctacaaactaataagggctgtgtgcaagaa gtttgggttgcacgtgagtcgaatgatgctagccttcttggttctcagcactggcatgtt ttgctcatcatcagtgtgtggtaccttaatttcttacctgtaccacaggtgcctgtggtg gtcattgacagctactattatgggaagttggtgattgcaccactcaacattgttttgtat aatgtctttactcctcatggacctgatctttatggtacagaaccctggtatttctattta attaatggatttctgaatttcaatgtagcctttgctttggctctcctagtcctaccactg acttctcttatggaatacctgctgcagagatttcatggatatcacgggccccttgatttg tatccagaattttaccgaattgctacagacccaaccatccacactgtcccagaaggcaga cctgtgaatgtctgtgtgggaaaagagtggtatcgatttcccagcagcttccttcttcct gacaattggcagcttcagttcattccatcagagttcagaggtcagttaccaaaacctttt gcagaaggacctctggccacccggattgttcctactgacatgaatgaccagaatctagaa gagccatccagatatattgatatcagtaaatgccattatttagtggatttggacaccatg agagaaacaccccgggagccaaaatattcatccaataaagaagaatggatcagcttggcc tatagaccattccttgatgcttctagatcttcaaagctgctgcgggcattctatgtcccc ttcctgtcagatcagtatacagtgtacgtaaactacaccatcctcaaaccccggaaagca aagcaaatcaggaagaaaagtggaggttag >gi568815587r:111686400_111971482|GENSCAN_predicted_peptide_4|624_aa MAPRRLLLVGEGNFSFAAALSETLDQSTQLTATCLQRPAELARDPLAWENLQCLRERGID VRFGVDCTQLADVFELHEREFDQIYFIFPHCGRKAGVAKNRELLAKFFQSCADVLAEEGE VHVALCRGQGGTPADKPQREWHNSWQVVAMAALGGLILSDVYPFSCKAVAGYKCTGYRSQ DKSFHVEGALNHIFTRSLPFEGSQPRIFRIKLGNQWFSFPEPEALVGKLNRGFLEAPSCH PIKTINEKLIAELGKVFPLKRLKCSYPLLPQEGTSVLPFWNCDFLSAAFWISLHEDNSNS ESLTGGTSQDVEDFLVSFSELSLLKNPGRDGKEEACEGTCGQAKICLRPSLLVHVQDVIE VPDFLSGSLHILSGPVFQKCHILPFTMPAFHETLFILGVNQNLKDGCLQSLLDHLKGILD SLLTQTLPESSKLSSLVKFVLQSNGKDYMIRVKTHNFSPDCTEDLIIGSVITSATSVIHK DQCFVFVSMNLDLLAMLVWCISDWRMLWTFDNRFLKNFVPGKIEPFKSHSLYPPCYVHDV SFWIDQKKGFDELEFHTVARAVSQDTIISIQFLSRFQHPKTQQVSLCYRLTYQTCDKALT QQQVASMQSQFRKEIQQHLYVIPR >gi568815587r:111686400_111971482|GENSCAN_predicted_CDS_4|1875_bp atggcccctcggcgcctcctgttggttggggaggggaatttctccttcgccgccgctctg agcgaaaccctggatcagagcactcaacttaccgccacctgcctccagcgcccggccgag ttggctcgggatccactggcctgggagaatctgcagtgcctgcgcgagcgaggtatcgat gtacgtttcggtgtggactgcacccagctggcagatgtctttgaactgcacgagagagaa tttgatcaaatttatttcatcttcccgcattgtggacgcaaagctggcgtagctaagaac agggaactgcttgccaaatttttccaaagctgtgcagacgttcttgcagaggaaggagaa gtccacgtggcattatgtagaggacaaggtggaactcctgcggataagccccagagagaa tggcacaacagttggcaagtggttgccatggcagccctgggggggctcattttaagcgac gtgtatccattcagctgtaaggctgtggcagggtacaagtgcactggatataggagtcaa gataagtcctttcatgtagaaggtgctttgaaccatatcttcaccaggagcttacctttt gaaggttctcaacccagaatcttcaggatcaaactgggtaaccagtggttttcctttcca gaaccagaagcacttgtaggaaagttgaacaggggtttcctggaagcaccttcatgtcat cctatcaaaaccataaatgagaaactcattgctgaattaggcaaagttttcccgctaaaa aggctgaagtgttcctaccctttgctgccacaggaaggtaccagtgttcttcctttctgg aattgtgactttctgtcagctgctttttggattagtctccatgaagataactcaaattct gagtccctgactggtgggacatcacaagatgtggaagactttctagtgtcattttcagaa cttagccttctcaagaatcctggaagagatggtaaagaagaagcttgtgaaggaacctgt ggccaggccaagatctgccttagaccttctctcctagtgcatgttcaggatgtcatcgaa gtaccagacttcctctcaggttctctgcacatcctcagtggacctgtctttcagaagtgc cacattttgcctttcacaatgccagcatttcatgaaactttatttatccttggggttaat caaaatctgaaggatggctgtcttcaatcactgctggatcatctgaagggcattctagat agcctgctgacccagacattgccggagagctctaagctgagcagtttagtcaaatttgtc cttcagtcaaatggaaaggattatatgattcgtgtgaagactcataattttagcccagat tgtactgaggatctaattattgggtctgttatcacatctgccactagtgttatacataaa gaccagtgttttgtgtttgtgtctatgaacttggacttattagccatgcttgtctggtgt atctctgactggagaatgttgtggacgtttgataaccgtttcctgaaaaattttgtccct ggcaaaatagaaccctttaaaagtcattctctgtatcctccatgttatgtgcatgatgtt agtttttggatagatcagaagaaaggatttgatgaactagagtttcacactgtggcccga gcagtgtctcaggacactattatatccatacagtttcttagccgtttccagcatccaaag actcaacaggtcagtctctgctatagattgacctaccagacctgtgacaaggccctcacc cagcagcaagtagcatcaatgcagtcccagtttaggaaggagattcaacaacacctatat gttatacctcggtag >gi568815587r:111686400_111971482|GENSCAN_predicted_peptide_5|137_aa MEQPSKDFCDSHKKSSKSSTYSASSYSSSGKRQNLACFLTNPHCGSLVNADGHGEVWTDW NNMSKFFQYGWRCTTNENTYSNRTLMGNWNQERYDLRNIVQPKPLPSQDLSESLTGSQDI NLNWILPDTNAQKSQLT >gi568815587r:111686400_111971482|GENSCAN_predicted_CDS_5|414_bp atggaacagccaagtaaggatttctgtgactctcataagaaatcttcaaagagctcaacc tactcagcatcaagttactcttcttcaggaaagagacagaacctcgcctgtttcctcaca aacccacactgtggcagccttgttaatgcagatggccatggtgaagtgtggacagattgg aataatatgtccaagtttttccagtatggatggcgatgcaccactaatgagaatacctat tcaaaccgtaccctgatgggcaactggaaccaggaaagatatgacctgaggaatatcgtg cagcccaaacccttgccttcccaggatttaagcgagagcctcactggttcccaggacatc aacctgaactggatcctccccgatacaaatgcacagaaaagtcaacttacatga >gi568815587r:111686400_111971482|GENSCAN_predicted_peptide_6|68_aa MVKEIEISQHTKYTCSFCGKTKMKRRAVKIRHCNSCMKTVAGSAWTYNTTSAVMVKSAIR RLKELKDQ >gi568815587r:111686400_111971482|GENSCAN_predicted_CDS_6|207_bp atggtgaaggaaattgaaatcagccagcacaccaagtacacttgctctttctgtggcaaa accaagatgaagagacgagctgtgaaaatcaggcactgtaattcctgcatgaagacagtg gctggcagtgcctggacctacaacaccacttcggctgtcatggtgaagtctgccatcaga agactgaaggaattgaaagaccagtag >gi568815587r:111686400_111971482|GENSCAN_predicted_peptide_7|175_aa MDIAIHHPWIRRPFFPFHSPSRLFDQFFGEHLLESDLFPTSTSLSPFYLRPPSFLRAPSW FDTGLSEMRLEKDRFSVNLDVKHFSPEELKVKVLGDVIEVHGKHEERQDEHGFISREFHR KYRIPADVDPLTITSSLSSDGVLTVNGPRKQVSGPERTIPITREEKPAVTAAPKK >gi568815587r:111686400_111971482|GENSCAN_predicted_CDS_7|528_bp atggacatcgccatccaccacccctggatccgccgccccttctttcctttccactccccc agccgcctctttgaccagttcttcggagagcacctgttggagtctgatcttttcccgacg tctacttccctgagtcccttctaccttcggccaccctccttcctgcgggcacccagctgg tttgacactggactctcagagatgcgcctggagaaggacaggttctctgtcaacctggat gtgaagcacttctccccagaggaactcaaagttaaggtgttgggagatgtgattgaggtg catggaaaacatgaagagcgccaggatgaacatggtttcatctccagggagttccacagg aaataccggatcccagctgatgtagaccctctcaccattacttcatccctgtcatctgat ggggtcctcactgtgaatggaccaaggaaacaggtctctggccctgagcgcaccattccc atcacccgtgaagagaagcctgctgtcaccgcagcccccaagaaatag >gi568815587r:111686400_111971482|GENSCAN_predicted_peptide_8|182_aa MSGRSVPHAHPATAEYEFANPSRLGEQRFGEGLLPEEILTPTLYHGYYVRPRAAPAGEGS RAGASELRLSEGKFQAFLDVSHFTPDEVTVRTVDNLLEVSARHPQRLDRHGFVSREFCRT YVLPADVDPWRVRAALSHDGILNLEAPRGGRHLDTEVNEVYISLLPAPPDPEEEEEAAIV EP >gi568815587r:111686400_111971482|GENSCAN_predicted_CDS_8|549_bp atgtcgggccgctcagtgccacatgcccacccggccaccgccgagtacgaatttgccaac ccgagccgcctgggtgagcagcgcttcggagaaggcctcctgccagaagagatcctgacc cccacactctaccatggctactatgtccggcctcgggccgccccagctggggagggcagc agggcaggggcctccgagcttaggctcagtgagggcaagttccaggcatttctggatgtg agccactttaccccagacgaggtgactgtgaggactgtggataacctgctggaggtgtct gcccggcacccccagcgcctggaccgccacggcttcgtgtcccgagagttctgccgcacc tatgtcctgcctgctgatgtcgacccctggcgagtccgagctgctctctcccatgatggc atcttaaacctggaagcacctcggggtggccgacatttggacacagaggtcaatgaggtc tacatctccctgctccctgcgcctcctgatccagaggaagaggaggaggcagccatagtt gagccctga >gi568815587r:111686400_111971482|GENSCAN_predicted_peptide_9|123_aa MGNRVCCGGSWSCPSTFQKKKKTGSQTRRTLKPQPQQLQQNLPKGHETTGHTYERVLQQQ GSQERSPGLMSEDSNLHYADIQVCSRPHAREVKHVHLENATEYATLRFPQATPRYDSKNG TLV >gi568815587r:111686400_111971482|GENSCAN_predicted_CDS_9|372_bp atgggaaaccgggtctgctgcggaggaagctggagctgcccatcaactttccagaagaaa aagaaaacaggaagccaaacaagacggacactgaagccgcagccacaacagctgcagcag aatctcccaaagggccatgaaacaacaggacatacgtatgaacgggtgttacagcagcaa gggtctcaagagaggagtccaggcctcatgtcggaagacagcaacttacattatgctgac attcaagtgtgcagccgtccccatgcccgggaagtgaaacacgtgcatttagaaaacgct acagagtatgcgacccttcgcttcccccaggccacacctcgctatgacagcaagaacggg accctggtgtga >gi568815587r:111686400_111971482|GENSCAN_predicted_peptide_10|461_aa MAEGKEEQVTFYMDGSRQKERQSLCRETLPHKPSDIVRHCAKVSAYRHINNGHKVIAATR EKRLKKKAIYAEKEQQDGRLEESGKGQQGRRHLAGSLRMSANCREDKGRGDGRRKGQKTH TILEEQPVEGMWKPRFRGICPQLQGCSVTEQGFEFSDDAKSWALQHPKRPAQQDWAMSRG NQGWANLGLKIIRGEAVTTRAKEVCKPSLSSLSPPRLKLTEFLQGTYLLELRKHQWGFTR EHIQAVCATRHLLIGTTAKSEAPVPLLSHGPVHALAPWTQDPGPLSLSSSSQTTMSLIAE KSTAPHSERDSDMKRALHMDHLQSPALSNFLPMQYHSTLPWLRWLAPDIDEHFCQISRDS LRISLEREALGRMETPSSGRRMEEEEDPFRRLGGRRWGRGAEEWEEQDSYLIEALLQDVH TPKTYRRLCWQQGVIESHNHLGPHLQESLEETNQGHLKQEE >gi568815587r:111686400_111971482|GENSCAN_predicted_CDS_10|1386_bp atggcagaaggcaaggaggagcaagtcacattttacatggatggcagcagacaaaaagag agacagagtttgtgcagggaaactctgcctcataagccatcagatatcgtgaggcattgt gccaaggtcagtgcttacaggcacataaataacggacacaaagtgattgctgctacaaga gagaaacgtttaaaaaaaaaagctatttatgcagagaaggagcaacaggatggaaggctg gaggaatcaggaaaaggtcaacagggaaggagacatctggctggttctttacggatgtct gccaattgcagagaggacaagggcagaggtgatggaaggaggaagggtcagaagacacat actatcttagaggaacagccagttgaggggatgtggaaaccaaggttcagaggaatatgc ccacagttgcagggctgctcagtgacagaacaagggtttgaattcagtgatgacgctaaa tcctgggctctacagcatcccaaaagacctgctcagcaggactgggccatgagcagggga aaccaaggatgggctaatctaggactgaaaatcatcagaggagaggcagtgaccaccagg gcaaaggaagtttgtaagccgagcctcagttctctatctcctccaagactcaagctcaca gaattcctgcaggggacttatttgctagagttgcgaaagcaccaatggggcttcaccagg gaacatattcaagctgtctgtgctacccgtcatcttcttattggaacgactgccaagagt gaggccccagttcctctgctctcacatggaccggtgcacgcgctggctccctggacccag gacccaggtcccctctcgctcagtagctcctctcagacaaccatgtccttgattgcagag aaatctactgccccgcactcagaaagagattcagacatgaagagggcacttcacatggac catctacagtcccctgccctctcaaactttctaccaatgcagtaccactccaccctccct tggctccgctggttagctcctgatatagatgagcacttttgccaaatatccagggattcc ctgaggatttccttggagagagaagctctagggaggatggagaccccctcttctggaagg aggatggaggaggaggaggaccccttccggagactgggcgggagacgctgggggagggga gctgaggagtgggaggagcaggacagttacctcattgaagccctcctgcaggacgtccat acaccaaaaacttacagaaggctctgttggcagcaaggagttattgaatctcataaccac ttgggtccccatcttcaagagtccctagaagagacgaatcagggccatctaaaacaggaa gaataa >gi568815587r:111686400_111971482|GENSCAN_predicted_peptide_11|265_aa MAVDQASLCSWGPGAGRSPNPSGIAAAAAAQATAVDPGIAALSGARLGNACSCCLASPDS LHLLRSWSKVEAKPGHCHNLAGPSKPQTQPDLENDGTTCLRRGATHSRVSSLLNGNKKER TEERRAAERRAPFGEPRPRSSPSQGCDTLFWALWFLESPSFQAPPHSLVPAVEAACSMPG PAAASQGASTCASPWSCPPHHSQCFWLCTVAGPHVHLITHPLPLHLPLAGTGSKPAVRAC QTEWAQWAQAKLRQRHHWPGFQLAK >gi568815587r:111686400_111971482|GENSCAN_predicted_CDS_11|798_bp atggctgtggaccaggcatccctgtgctcttgggggccaggagcaggcaggagccccaac ccctcaggtatagctgcagctgcagctgcccaagccacagctgtggacccaggcatcgct gcactctcaggggctaggcttggaaatgcttgctcctgctgcctggcctctcccgactcc ctgcacctgctccgatcttggagcaaagttgaggccaagcccgggcactgccacaacctg gctggcccatcaaaaccccagactcaaccagacttggaaaatgatgggacgacctgcctg cggagaggagctacccactccagggtctcctccctgctgaatggcaacaagaaggagaga acagaggagagaagagctgcagagagaagagctccctttggggagcccagacctaggagc tccccaagccagggctgtgacaccctcttttgggctctgtggttcttggagtctccaagc ttccaggcgccaccacattccctggtgccagctgtggaagctgcttgcagtatgcctggt ccagctgcagcatcgcagggagccagcacctgtgccagtccctggagctgcccaccccac cacagtcagtgtttctggctgtgcacagtggctggaccccatgttcacttgatcacacac cccttgccactccacttgcccttagcaggcacgggatccaagccggcagtgcgagcttgc cagactgagtgggcccagtgggcccaagcaaaactcaggcaaaggcaccactggccaggt ttccagctagcaaagtga >gi568815587r:111686400_111971482|GENSCAN_predicted_peptide_12|118_aa MGCLLYPSHKRFLRGYEQQLQAYVAWVNAQLKKRPAVKPVQDLRQDLRDGVILAYLIEIV AGEKLSGVQLSPGNQQEMKNNVEKVLQFVASKKIRMHQTSAKGQCLITSLVHEYTLTX >gi568815587r:111686400_111971482|GENSCAN_predicted_CDS_12|354_bp atgggttgtttattataccccagtcacaaacgcttcctcagagggtatgagcaacagctg caggcctatgtggcctgggtgaatgcacagctgaagaagaggccagcagtgaagcctgtg caggacctgcgacaagatctccgggatggggtgatcctggcatatctcatcgagattgtt gcaggagaaaagctgagtggggtacagctgagtcccggtaaccaacaggagatgaagaat aatgtggagaaagtgctacagtttgtggcctctaaaaagattcgtatgcaccagacttcg gctaaaggtcagtgcctcatcacatccttggtgcatgagtatactttaaccann