GENSCAN 1.0 Date run: 3-Nov-116 Time: 16:08:01 Sequence gi568815595r:119726791_120193434 : 466644 bp : 39.42% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3430 3587 158 1 2 22 34 217 0.209 8.43 1.02 Intr + 5504 5686 183 0 0 18 121 122 0.249 7.34 1.03 Intr + 6574 6716 143 2 2 95 70 73 0.975 5.35 1.04 Intr + 12465 12536 72 2 0 95 66 72 0.949 4.38 1.05 Intr + 13759 13905 147 0 0 36 0 233 0.900 8.81 1.06 Intr + 17185 17424 240 0 0 61 57 329 0.967 23.92 1.07 Intr + 20325 20473 149 2 2 84 82 127 0.951 9.81 1.08 Intr + 21021 21112 92 0 2 50 121 42 0.953 2.42 1.09 Term + 24147 24307 161 1 2 36 45 178 0.991 5.42 1.10 PlyA + 26102 26107 6 1.05 2.00 Prom + 30269 30308 40 -5.85 2.01 Init + 36430 36526 97 0 1 61 106 40 0.176 3.62 2.02 Intr + 41065 41175 111 2 0 65 69 71 0.028 2.33 2.03 Intr + 42022 42189 168 2 0 47 17 152 0.005 3.00 2.04 Intr + 53763 53837 75 1 0 71 44 108 0.047 3.27 2.05 Intr + 53914 54083 170 2 2 94 47 79 0.031 3.14 2.06 Intr + 65216 65476 261 1 0 27 99 313 0.242 23.06 2.07 Intr + 65557 65759 203 0 2 73 53 173 0.042 9.46 2.08 Intr + 80439 80657 219 0 0 90 80 234 0.989 19.30 2.09 Intr + 82852 83052 201 1 0 47 13 191 0.687 4.98 2.10 Intr + 83271 83404 134 0 2 101 44 146 0.999 10.87 2.11 Intr + 84749 84936 188 0 2 74 57 239 0.999 17.89 2.12 Intr + 85896 86170 275 2 2 78 91 212 0.811 15.91 2.13 Intr + 88189 88331 143 1 2 73 49 182 0.158 11.88 2.14 Intr + 88533 88649 117 1 0 105 94 151 0.987 16.92 2.15 Intr + 88936 89041 106 2 1 111 71 107 0.996 9.65 2.16 Term + 89204 89357 154 2 1 45 48 146 0.115 2.71 2.17 PlyA + 89387 89392 6 -3.94 3.09 PlyA - 89470 89465 6 1.05 3.08 Term - 90443 90192 252 2 0 6 33 301 0.032 11.05 3.07 Intr - 90967 90824 144 1 0 84 42 130 0.055 7.56 3.06 Intr - 98200 98107 94 0 1 118 21 76 0.078 2.95 3.05 Intr - 100065 99932 134 0 2 100 7 169 0.399 8.52 3.04 Intr - 103455 103346 110 1 2 26 63 21 0.007 -7.52 3.03 Intr - 116563 116465 99 2 0 83 110 148 0.717 15.66 3.02 Intr - 126880 126715 166 1 1 -41 86 161 0.071 1.61 3.01 Init - 128376 127708 669 0 0 69 41 320 0.074 20.64 3.00 Prom - 128842 128803 40 -8.55 4.10 PlyA - 129243 129238 6 1.05 4.09 Term - 130425 130352 74 1 2 69 47 108 0.745 1.89 4.08 Intr - 136815 136629 187 0 1 116 88 204 0.989 21.54 4.07 Intr - 146886 146782 105 0 0 57 98 30 0.486 0.39 4.06 Intr - 149718 149623 96 0 0 59 113 43 0.530 3.19 4.05 Intr - 179062 178965 98 2 2 115 100 50 0.616 7.71 4.04 Intr - 186020 185914 107 1 2 72 97 77 0.969 5.94 4.03 Intr - 189384 189254 131 0 2 43 121 119 0.718 9.27 4.02 Intr - 196693 196583 111 1 0 41 95 92 0.966 4.86 4.01 Init - 197531 197472 60 2 0 78 116 19 0.680 5.00 4.00 Prom - 200638 200599 40 -4.45 5.00 Prom + 214848 214887 40 -4.35 5.01 Init + 237035 237268 234 1 0 36 52 144 0.115 3.79 5.02 Intr + 251921 251984 64 1 1 85 99 36 0.002 1.77 5.03 Intr + 257762 257967 206 0 2 46 72 142 0.165 6.40 5.04 Term + 258284 259078 795 1 0 17 38 281 0.230 8.29 5.05 PlyA + 259310 259315 6 1.05 6.00 Prom + 260071 260110 40 -3.65 6.01 Init + 260188 260320 133 0 1 64 75 75 0.507 4.15 6.02 Intr + 277749 277888 140 1 2 23 77 84 0.013 0.06 6.03 Intr + 278087 278244 158 1 2 80 72 121 0.008 7.59 6.04 Intr + 294183 294439 257 0 2 -12 49 252 0.467 7.46 6.05 Term + 294759 294928 170 1 2 0 47 194 0.957 3.56 6.06 PlyA + 295478 295483 6 1.05 7.00 Prom + 304166 304205 40 -4.35 7.01 Init + 313245 313303 59 2 2 86 61 70 0.434 4.93 7.02 Intr + 314322 314494 173 0 2 72 80 72 0.715 3.46 7.03 Intr + 316984 317517 534 2 0 84 53 316 0.060 19.66 7.04 Intr + 317758 317852 95 2 2 0 106 87 0.019 0.46 7.05 Intr + 321282 321358 77 2 2 48 93 47 0.030 -1.41 7.06 Intr + 322558 322702 145 1 1 7 111 117 0.433 5.26 7.07 Term + 338649 338780 132 2 0 18 41 141 0.307 -0.49 7.08 PlyA + 338934 338939 6 1.05 8.00 Prom + 339692 339731 40 -4.25 8.01 Init + 356455 356638 184 0 1 38 75 180 0.484 11.03 8.02 Term + 366221 366342 122 0 2 74 36 100 0.175 1.06 8.03 PlyA + 366539 366544 6 1.05 9.02 PlyA - 367034 367029 6 -0.45 9.01 Sngl - 368853 367537 1317 0 0 56 48 719 0.985 58.51 9.00 Prom - 384077 384038 40 -3.95 10.21 PlyA - 384456 384451 6 1.05 10.20 Term - 406071 405982 90 0 0 71 55 100 0.173 1.74 10.19 Intr - 409818 409751 68 1 2 75 108 22 0.049 0.61 10.18 Intr - 414603 414352 252 1 0 39 22 187 0.050 3.68 10.17 Intr - 428562 428432 131 2 2 85 87 29 0.380 1.92 10.16 Intr - 430457 430319 139 0 1 60 108 18 0.189 -0.40 10.15 Intr - 431152 430949 204 2 0 35 97 173 0.265 11.15 10.14 Intr - 435316 435099 218 0 2 1 78 214 0.284 8.92 10.13 Intr - 435797 435671 127 0 1 -3 66 82 0.146 -4.28 10.12 Intr - 438918 438747 172 0 1 52 45 127 0.198 3.39 10.11 Intr - 439576 439433 144 1 0 90 39 84 0.443 3.26 10.10 Intr - 441594 440248 1347 0 0 96 50 1110 0.177 95.57 10.09 Intr - 444801 444630 172 2 1 106 38 130 0.234 8.82 10.08 Intr - 446680 446151 530 1 2 111 79 160 0.942 8.21 10.07 Intr - 454571 454335 237 2 0 110 80 107 0.252 8.99 10.06 Intr - 456713 456651 63 2 0 81 65 53 0.147 0.30 10.05 Intr - 458604 458509 96 0 0 124 81 49 0.977 7.09 10.04 Intr - 460034 459868 167 0 2 40 89 158 0.933 9.86 10.03 Intr - 465908 465773 136 2 1 79 108 21 0.870 2.42 10.02 Intr - 466259 466149 111 2 0 109 89 53 0.913 7.16 10.01 Init - 466624 466505 120 1 0 69 96 35 0.759 2.67 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 65557 65763 207 0 0 73 55 183 0.918 9.86 S.002 Intr + 90278 90414 137 2 2 58 70 201 0.849 14.59 S.003 Term + 90866 91086 221 0 2 44 38 134 0.835 0.22 S.004 Term - 252169 252051 119 2 2 41 55 134 0.896 3.12 S.005 Init - 252732 252654 79 0 1 70 91 81 0.909 7.88 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:119726791_120193434|GENSCAN_predicted_peptide_1|448_aa XLQEIRLEVLKELLRKREENQNEVNMKHLNARWSKLQEGKEAKMAKIQRTHVSTIRKLVG KRKNIEGKLERRNIIKDYSDYASQVYGPLSRLGCFPDNNSEDFVVKNYYLNTYEGLVELE SCLPDFVTQPQIRAPKPKVITTKAGFLKRAARLDYELAEVHKEEEEMEMAVIYLQKLLRG RVVQNMMFEGKEKRLELIQELRTCHALQEDEKLVKKAEKQVTLALQRQRNLHEHKVSLVE NHLAGLEGRALADMFDFLSKELVRLQEERRIHAFVMLAERQRRVREAEESGRRQVEKQRL REEDEIFKEASRGSWVVKVHHSTISSYLEDIILNTEANTAEEQARAEIEKMAEKINDIAY EMESRRTYLQSEEIVAELVYSFLIPEVQKYFVKEKVRNAQRKHILAAHQIIHSYTESMVQ KKLTEGEQDEASNAAMLLEKETQNENNS >gi568815595r:119726791_120193434|GENSCAN_predicted_CDS_1|1347_bp nnactgcaggagattcgcctggaagttctaaaagagctgttgaggaagcgtgaagagaat cagaatgaagtgaatatgaagcacttgaatgcccggtggtctaaactgcaggagggaaaa gaggcaaaaatggcaaaaattcagcgcacgcatgtatcaacaatcagaaaacttgtagga aagagaaagaatatagaagggaagttggagagaagaaatatcatcaaggattattctgat tatgcatcacaggtctatggacctctgtctcgtcttgggtgtttcccagacaacaactca gaggactttgtagtaaaaaactactatctcaacacctatgaaggattagtggaacttgag tcatgtctcccagattttgtgacacaaccccaaatcagagctccaaaacctaaagtcatt accaccaaagctggttttctgaagagggcagcaaggttggactatgagttggcagaggtt cataaggaagaagaagaaatggaaatggctgtgatctaccttcaaaagttactccggggc agagtcgttcagaacatgatgtttgaagggaaagaaaagcgactggagttgatccaggag ttgcgcacctgccacgcactacaagaagatgaaaagctggtgaaaaaagccgagaagcaa gtgaccctggccttacagcggcagaggaacttgcatgagcacaaggtgtcactggttgaa aaccatttggccggactggaaggaagggcactagcagacatgtttgacttcctgtccaaa gagctggtgagactgcaggaggagaggaggatccatgcctttgtcatgctggctgagcgc cagcggcgggtacgagaggctgaagagagtggtcggcgccaggtggaaaaacagcgcctg cgggaggaggacgagatatttaaggaggcaagtaggggcagctgggtggttaaagttcac catagtactataagctcctacctagaagacataatactgaataccgaagcgaatactgca gaagaacaagccagggcagaaatagagaagatggctgagaaaatcaatgacattgcttat gaaatggaaagccgccgaacctatcttcagtcagaggagattgttgctgagttggtttat agttttctgatcccagaggtgcaaaaatactttgtcaaagaaaaagtgaggaacgcacag cggaaacatattcttgcagcccatcagatcatccacagttacacggaaagcatggttcaa aagaaattaactgagggagagcaagatgaggcctcaaatgctgccatgttacttgagaaa gaaactcaaaatgagaacaacagctaa >gi568815595r:119726791_120193434|GENSCAN_predicted_peptide_2|873_aa MPNITNRSRNANKSHNVISPHTVRMAIIKKTKSRSSGRVFSSQQKGGNGVGGSSLQAGHQ DEYSALSREAPTSSMEGAALAAPPCCSQCDGSGRPSDVATAISSMEVKRAGSGPRLSGLK SQSKFRDLQAPDIAPCCPPTPEHTVQATSTGPPCLRPKSTFCPVGFVIQYLLALAESHVD NQKCDWKKDRALWLWVPVPTAGSKSPAKPIIFDHRSQPHNVPVITGSKDLQNVNIIPCIL FGPVTSQLPRIFTRIGEDYDERVLPSITTEILKSVVARFDAGELITQRELVSRQEGVHRS SGSQTGGSAGCREGQNSLATAGDGLMELCKLEAAEDITYQLSRSWNITNLPAGQSVLLQL PHPRGPEANLEVRPKESWNHADFVHCEDTESVPGKPSVNADEEVGGPQICRVCGDKATGY HFNVMTCEGCKGFFRRGNRGRGPSPWKLSGNKAVKEKGTGVSGRREGAEDGPGEEVSLKV VLGLPAPAKSTCPISCRNGQKRRAMKRNARLRCPFRKGACEITRKTRRQCQACRLRKCLE SGMKKEMIMSDEAVEERRALIKRKKSERTGTQPLGVQGLTEEQRMMIRELMDAQMKTFDT TFSHFKNFRLPGVLSSGCELPESLQAPSREEAAKWSQVRKDLCSLKVSLQLRGEDGSVWN YKPPADSGGKEIFSLLPHMADMSTYMFKGIISFAKVISYFRDLPIEDQISLLKGAAFELC QLRFNTVFNAETGTWECGRLSYCLEDTAGGFQQLLLEPMLKFHYMLKKLQLHEEEYVLMQ AISLFSPDRPGVLQHRVVDQLQEQFAITLKSYIECNRPQPAHRKNTSKQGDVAQKGASIP SLVLSSSCDLLPRLMPTDPETNLDFTEPKAMKG >gi568815595r:119726791_120193434|GENSCAN_predicted_CDS_2|2622_bp atgcccaacattactaatcgttcgagaaatgcaaacaaaagccacaatgtgatatcacct catactgttagaatggctattatcaaaaagacaaaaagcaggtcatctggtcgagtgttc agctctcagcaaaaaggaggcaatggagtgggtggctcctctctgcaggcaggtcatcag gatgagtattcagctctcagcagagaagctcccaccagctccatggagggtgcagccttg gctgcacctccctgctgcagccagtgtgatggcagtggcagaccgtctgatgtggccact gccatcagtagcatggaagttaagagagcaggctctggacccagactgtccggattgaaa tcacagtccaagttcagggacctgcaggccccagatatagccccatgctgtcctcctacc ccagagcacactgttcaggctacttccactggtcctccctgcctaaggcccaagtcaact ttctgtccagtgggatttgtaatccaatacctcctagccctagcagaatcccatgtggat aatcagaaatgtgactggaaaaaggacagagctctatggctgtgggtcccagtccccact gctggcagtaagtccccagcaaaaccaattatctttgaccaccgttctcaaccacataat gtgccagtcatcactggtagcaaagatttacagaatgtcaatatcattccgtgcatcctc tttgggcctgtcactagccagcttcctcgcatcttcaccaggatcggagaagactatgat gagcgtgtgctgccatccatcactactgagatcctcaagtcagtggtggctcgctttgat gctggagaactaatcacccagagagaactggtctccaggcaggaaggagttcacagaagc agtggaagccaaacaggtggctcagcaggatgcagagagggccagaactcactggccact gcaggggacggcctgatggagctgtgcaagctggaagctgcagaggacatcacgtaccag ctctctcgctcttggaacatcaccaatctgccggcagggcagtccgtgctcctccagctg ccccatccaagaggcccagaagcaaacctggaggtgagacccaaagaaagctggaaccat gctgactttgtacactgtgaggacacagagtctgttcctggaaagcccagtgtcaacgca gatgaggaagtcggaggtccccaaatctgccgtgtatgtggggacaaggccactggctat cacttcaatgtcatgacatgtgaaggatgcaagggctttttcaggaggggaaatcgagga agaggcccatctccatggaaactttctggtaacaaggctgtgaaagagaaggggacaggt gtttcaggaagaagggaaggcgctgaggatgggcctggagaggaagtgtccctgaaagtg gttttggggctgcccgcgccagcgaaaagcacgtgtcccatttcctgccggaatggccag aaaaggagggccatgaaacgcaacgcccggctgaggtgccccttccggaagggcgcctgc gagatcacccggaagacccggcgacagtgccaggcctgccgcctgcgcaagtgcctggag agcggcatgaagaaggagatgatcatgtccgacgaggccgtggaggagaggcgggccttg atcaagcggaagaaaagtgaacggacagggactcagccactgggagtgcaggggctgaca gaggagcagcggatgatgatcagggagctgatggacgctcagatgaaaacctttgacact accttctcccatttcaagaatttccggctgccaggggtgcttagcagtggctgcgagttg ccagagtctctgcaggccccatcgagggaagaagctgccaagtggagccaggtccggaaa gatctgtgctctttgaaggtctctctgcagctgcggggggaggatggcagtgtctggaac tacaaacccccagccgacagtggcgggaaagagatcttctccctgctgccccacatggct gacatgtcaacctacatgttcaaaggcatcatcagctttgccaaagtcatctcctacttc agggacttgcccatcgaggaccagatctccctgctgaagggggccgctttcgagctgtgt caactgagattcaacacagtgttcaacgcggagactggaacctgggagtgtggccggctg tcctactgcttggaagacactgcaggtggcttccagcaacttctactggagcccatgctg aaattccactacatgctgaagaagctgcagctgcatgaggaggagtatgtgctgatgcag gccatctccctcttctccccagaccgcccaggtgtgctgcagcaccgcgtggtggaccag ctgcaggagcaattcgccattactctgaagtcctacattgaatgcaatcggccccagcct gctcataggaaaaacacaagcaaacagggggacgtggcccagaaaggggcttctatacct tcactggtcctcagtagctcctgtgacctactgcccagactgatgcccacagacccagag acaaacttggattttacggagcccaaggccatgaagggttaa >gi568815595r:119726791_120193434|GENSCAN_predicted_peptide_3|555_aa MKAEIKMFFETNENKDTTYQNLWDTFKAAYRGKFIALNAHKRKKERSKIDTLTSQLKELE KQEQTRSKDSRRQEITKVRAELKEIETQKTLQKVSESRSWFFEKINKIDRPPARLIKKKR EKNRIDAIKNDKGGITTDPTEIQTAIREYYEHFYANKLENLEEMDKFLETYTLPRLNQEE VESLNRPITGSEVEAIINSLPTKKSPGPDRFTAKIYQRYTEELNKIPGNPTYKGREGPRQ GELQTTAQRNKRGHKQMKEHSMLMDRRNQCRENGPTAQELSSNPPLATILIPPHARIQAA ASTPTNATAASVSSQPPRGQLLLLYLAQGPVHLYDGIVEIMKCNFTQSMLILETVDRPIM LLLHQLPTPPEQSRAASCTGKTTSYLSVTQQHWSLYGSRPPEGSEVESVGSPPAGPSTLS CSLTRTPTDTHAPADEYRPGAHKAARKRRGTSGKWETGLLEVAMKRLRKRTNVSPKGSRS ATCDAEQLLHEGRSKGVYVLDAQQPLGVLSIDAAELGEHSHDLQEQEPACQPEKRLWVQP ALTHKPQGCSEAAQA >gi568815595r:119726791_120193434|GENSCAN_predicted_CDS_3|1668_bp atgaaggcagaaataaagatgttctttgaaaccaatgagaacaaagacacaacataccag aatctctgggacacatttaaagcagcatatagaggaaaatttatagcactaaatgcccac aagagaaagaaggaaagatctaaaattgacaccctaacatcacaattaaaagaactagag aagcaagagcaaacacgttcaaaagatagcagaaggcaagaaataactaaggtcagagca gaactgaaggagatagagacgcaaaaaacccttcaaaaagtcagtgaatccaggagctgg ttttttgaaaagatcaacaaaattgatagaccgccagcaaggctaataaagaagaaaaga gagaagaatcgaatagacgcaataaaaaatgataaagggggtatcaccaccgatcccaca gaaatacaaactgccatcagagaatactatgaacacttctacgcaaataaactagaaaat ctagaagaaatggataaattcctggaaacatacaccctcccaagactaaaccaggaagaa gttgaatccctgaatagaccaataacaggctctgaagttgaggcaataattaatagcctc ccaaccaaaaaaagtccaggaccagacagattcacagccaaaatctaccagaggtacaca gaggagctgaataaaatacctgggaatccaacttacaagggacgtgaaggacctcgtcaa ggagaactacaaaccactgctcaacgaaataaaagaggacacaaacaaatgaaagaacat tccatgctcatggataggaggaatcaatgtcgtgaaaatggccctactgcccaagaactg tcaagtaatccacctctggctaccatccttattcctcctcatgctcggattcaagcagct gcttcaacccccacaaatgccacagcagcgtcagtttcttcccagccaccaagaggtcag ctcctcctactttacctggcccaggggccagtccatctttatgatgggattgtagagatt atgaaatgtaactttactcagtcaatgctaatactggagaccgtggacagaccaataatg ctgcttctgcatcagcttccaactccacctgaacagtcccgagcagccagctgcacagga aaaaccaccagttacttgagtgtcactcagcaacactggtcactttatgggtctcgtcct ccagagggttcggaagtggagtctgttggcagccctcctgcaggccctagcaccctgtcc tgctccttaactaggactcccacagatactcatgcgcctgccgatgagtacaggcctgga gcccacaaagcagctcggaagaggaggggaacgagtgggaagtgggagacaggactatta gaggtagcaatgaaaagactcaggaagcgaacaaacgtgtcacccaagggcagccgctca gctacctgtgatgccgaacaactcctgcatgaggggcgtagcaaaggggtgtatgtcctg gatgcgcagcagccgctgggtgtgctgagcattgatgctgcggagctcggtgagcatagc catgatcttcaggaacaagaacctgcatgccagccagagaaaagattgtgggtgcagccc gccctgacccacaagcctcagggctgctctgaggctgcacaagcataa >gi568815595r:119726791_120193434|GENSCAN_predicted_peptide_4|322_aa MVILNFVFCFNLPDTGHMNKKDEVYLNLVLDYVPETVYRVARHYSRAKQTLPVIYVKLYM YQLFRSLAYIHSFGICHRDIKPQNLLLDPDTAVLKLCDFGSAKQLVRGEPNVSYICSRYY RAPELIFGATDYTSSIDVWSAGCVLAELLLGQPIFPGDSGVDQLVEIIKVLGTPTREQIR EMNPNYTEFKFPQIKAHPWTKVIIRIKMQNLKFLEGTQTPFPTTVIVDSNLRNALRVFRP RTPPEAIALCSRLLEYTPTARLTPLEACAHSFFDELRDPNVKLPNGRDTPALFNFTTQEI AIATLTFSNQYPGQSAAIDIKV >gi568815595r:119726791_120193434|GENSCAN_predicted_CDS_4|969_bp atggtgattctcaactttgttttctgcttcaatttacctgatactggccacatgaacaag aaagatgaggtctatcttaatctggtgctggactatgttccggaaacagtatacagagtt gccagacactatagtcgagccaaacagacgctccctgtgatttatgtcaagttgtatatg tatcagctgttccgaagtttagcctatatccattcctttggaatctgccatcgggatatt aaaccgcagaacctcttgttggatcctgatactgctgtattaaaactctgtgactttgga agtgcaaagcagctggtccgaggagaacccaatgtttcgtatatctgttctcggtactat agggcaccagagttgatctttggagccactgattatacctctagtatagatgtatggtct gctggctgtgtgttggctgagctgttactaggacaaccaatatttccaggggatagtggt gtggatcagttggtagaaataatcaaggtcctgggaactccaacaagggagcaaatcaga gaaatgaacccaaactacacagaatttaaattccctcaaattaaggcacatccttggact aaggttattataagaattaaaatgcaaaacttaaaattcctagagggaactcagactcct ttccctaccactgtgatagtggacagcaacttgagaaatgctcttagggtcttccgaccc cgaactccaccggaggcaattgcactgtgtagccgtctgctggagtatacaccaactgcc cgactaacaccactggaagcttgtgcacattcattttttgatgaattacgggacccaaat gtcaaactaccaaatgggcgagacacacctgcactcttcaacttcaccactcaagaaatt gccatagccaccctaaccttcagcaaccaatatcctggtcaatcagcagccatcgacata aaggtataa >gi568815595r:119726791_120193434|GENSCAN_predicted_peptide_5|432_aa MKLEPYLTPYTKINLKWIKDLNIRSENVEPFEEIIRENLLDTVLVNNFLDNRQQKPKIDK WDYIKLKSFYTAKETINRSRRPFAKSNRNIQHWTLIQPTEIQTTIREYYKHLYANKLENQ EETDKFLDTYTLPRINQEEVESLNRPITGSEIEAMINSLPTKKSPGPENQRQKPHDYLNR CRKGLRQNSTAPHAKILNKRGIDGTYLKIRAIYDKLTINIILNGQKLEAFPLKTNTRQGC PLSPVLFNIVLEVLVRAIRQEKEIKGIQLGNKEVKLSLFADDMIVYLENSTVSAQNLKLI TNFSKVSGYKINVQKSQAFLDTNNRPTESQIMSELPFTIATKRIKYLGIQLTKNVKDLFK NNYKPLLNEIKEDINKWKNIPCSWIGKINMVKTAILPKVIYRFNVIPMKLPMTFFTELEK TTLNFILNQKRA >gi568815595r:119726791_120193434|GENSCAN_predicted_CDS_5|1299_bp atgaaattggaaccttatcttacaccatacacaaaaattaacttaaaatggattaaagat ttaaacataagatctgaaaacgtagaaccctttgaagaaatcatcagagaaaaccttctt gacactgttcttgtcaataatttcttggataacaggcaacaaaagccaaaaatagacaag tgggactatatcaaactaaaaagcttctacacagcaaaagaaaccatcaacagatccagg aggccctttgccaaatccaaccggaacatccaacattggacactaatccagccaactgaa atacaaactaccatcagagaatactataaacacctctatgcaaataaactagaaaatcaa gaagaaacagataaattcctggacacatacaccctcccaagaataaaccaggaagaagtt gaatctctgaatagaccaataacaggctctgaaattgaggcaatgattaatagtctacca accaaaaaaagtccaggaccagagaaccaaagacaaaaaccacatgattatctcaataga tgcagaaaaggccttcgacaaaattcaacagcccctcatgctaaaattctcaataaacga ggtattgatggaacgtatctcaaaataagagctatttatgacaaactcacaatcaatatc atactgaatgggcaaaaactggaagcattccctttgaaaaccaacacaagacaaggatgc cctctctcaccagtcctattcaacatagtgttggaagttctggtcagggcaatcaggcaa gagaaagaaataaagggcattcaattaggaaacaaggaagtcaaattgtccctgtttgca gatgacatgattgtatatttggaaaactccaccgtctcagcccaaaatctcaagctgata accaacttcagcaaagtctcaggatacaaaatcaacgtgcaaaaatcacaagcattccta gacaccaataacagaccaacagagagccaaatcatgagtgaactcccattcacaattgct acaaagagaattaaatacctaggaatacaacttacaaagaatgtgaaggacctcttcaag aataactacaaaccactgctcaacgaaataaaagaggacataaacaaatggaagaatatt ccatgctcatggataggaaaaatcaatatggtgaaaacggccatactgcccaaagtaatt tatagattcaatgtcatccccatgaagctaccaatgactttcttcacagaattggaaaaa actactttaaacttcatattgaatcaaaaaagagcctga >gi568815595r:119726791_120193434|GENSCAN_predicted_peptide_6|285_aa MEYYAAIKKDEFLSFAGTQMKLETILLSKLSQGPKTKDRMFSLTGGCPSGTKHPEEGSGS NICCSAASSGDTQANRVWSGPLANSNRPAAKKHNYSPAREQNWMQNEFDKMTEVGFRRSV ITNFSKLKKYVLTHCKEAKNLDKSQSLNQSKAGTLFNSLKADRSLQGRLPSLKLAEERCP LHTIKVQGKAVSADLEAVASYPEDLAKTTDEDNYTKQLLDAIKNIHDSAGTGTGVWKMIP ILDDSEEFKTSVEEVTADVVEIEREPELETEPEDVTELLQSCDQT >gi568815595r:119726791_120193434|GENSCAN_predicted_CDS_6|858_bp atggaatactatgcagccataaaaaaggatgagttcctgtcctttgcagggacacagatg aagctagaaaccatccttctcagcaaactatcacaaggacctaaaaccaaagaccgcatg ttctcactcacaggtgggtgcccctctgggacgaagcatccagaggaaggatcaggcagc aatatctgctgttctgcagcctcctctggtgatacccaggcaaacagggtctggagtgga cctctagcaaactccaatagacctgcagctaagaaacacaactactcgccagcaagggaa caaaactggatgcagaatgagtttgacaagatgacagaagtaggcttcagaaggtcagta ataacaaacttctccaagttaaagaagtatgttctaacccattgcaaggaagctaaaaac cttgacaaaagccaaagcctaaaccagagcaaggccggaaccctcttcaattctctgaag gccgacagaagccttcaaggaaggctgccaagtttgaagctagcagaggaaagatgccct ctccataccataaaagtacaaggtaaagcagtaagtgctgatttagaagctgtagcaagt tatccagaagatttagctaagaccactgatgaagataactacactaaacaacttctagat gccattaagaatattcatgattcagcgggaacaggaacaggagtttggaagatgatacca atcctggatgattctgaggagttcaaaacttcagtggaggaagtcactgcagatgtggta gaaatagaaagagaaccagaattagaaacagagcctgaagatgtgactgaactgctgcaa tcttgtgatcaaacttga >gi568815595r:119726791_120193434|GENSCAN_predicted_peptide_7|404_aa MGKKQAGSSGKGGSHTTATRVQAPCKGWLCKTCRPGHGPNPEQATLWDICLTSAGAKWCR DLAAAPSSWASLQLIFLKHFYGLRVKTNGYNPICFFKMRFINLPPPSPSKPSTLPYNNAE VNIVEADDLRQTLAIKIGYQDTNVWLKCIKNSVRTLNKSNCYACVHGRPEAQIVPFPLEW CSSQPGMGCMVTLFQDSTARSNKSCQALSLLYPKVQHPVGQPPRAIQLPSLNVNFTLCLS RNGKNLAFLGSLMEHIYLDAIGISQGVPDKFKAQDQIAAGFESLPPWNYVNTITMSRTVK ANHLVSINWLQIRKYGQMDKSCTMLNTTKAVEGMKEARKRKIWNVRRLPGKVASVLDVER QDSNNQKIEIRLEEDAEKWEPYYTAGGNVKWCSHYGKQFGSPVN >gi568815595r:119726791_120193434|GENSCAN_predicted_CDS_7|1215_bp atgggaaagaaacaggccggcagctcggggaaaggaggctcacataccacagcgactagg gttcaggcaccatgtaaagggtggctctgcaaaacatgcagaccaggccacgggccaaac cccgaacaggctacattatgggacatctgccttacttctgctggggccaagtggtgcagg gatttggctgcggctccaagcagctgggcatccctgcagctaattttcctgaagcacttt tacggcttaagagtaaagaccaatggatacaaccctatatgcttctttaaaatgcgtttt attaacctccctcctccttctccatctaaaccttctaccctgccctacaacaatgctgaa gtaaacattgtagaagcagatgatctaaggcaaactctagcaattaaaataggatatcaa gatacaaatgtctggttgaaatgcatcaaaaattctgtccgcacattaaacaaaagcaat tgttatgcttgtgtgcatggccggccagaggcccagattgtcccctttccactcgaatgg tgctccagtcaaccgggcatgggctgcatggtaactctcttccaggattctacagctagg agtaataaatcatgccaagctctctccctgctctatcccaaagtccaacaccctgtgggt cagcccccaagggccatccagcttccatctctcaatgtcaatttcaccttgtgtctctca cggaatgggaaaaacttggcgttcctgggaagcttaatggaacacatttatttagacgca attggaatctcacagggagtacctgataaattcaaagcccaagaccaaatagctgcagga tttgaatcattgcctccatggaactatgtaaatacaattacaatgtctagaaccgttaaa gccaatcatttagtaagcatcaactggctacagataaggaaatacggtcaaatggataaa agctgcacaatgttaaatacaacaaaagcggtagagggtatgaaggaagcaagaaaaaga aaaatctggaatgtcagaagacttcctgggaaggtagcatctgtattagatgttgagaga caggatagcaataatcaaaaaatagaaataaggcttgaagaagatgcggagaaatgggaa ccctactacactgctggaggaaatgtgaaatggtgcagccattatggaaaacagtttggc agtccagtaaattaa >gi568815595r:119726791_120193434|GENSCAN_predicted_peptide_8|101_aa MKCYLEIWDPGNWRANPEEQRKKSEDDSCTTGPGEQLDQIPAVGRGLQEKGHSRKRKGQE AGWASKHCHHYFNVETLEHLMKLTTNSKKSTRKQFEIESSK >gi568815595r:119726791_120193434|GENSCAN_predicted_CDS_8|306_bp atgaaatgttatttagagatatgggatccaggaaactggagagccaaccctgaagagcag cgaaagaagtcagaggatgacagctgtacaacggggcctggagaacaactagatcagatc ccagcagttggacgagggctccaggaaaaaggacactccaggaaaagaaaggggcaggag gcaggatgggcaagtaaacactgtcatcactactttaacgtggaaacactcgagcacttg atgaaactgactaccaacagcaagaaaagtacaagaaagcagtttgaaatcgaatcatcc aaataa >gi568815595r:119726791_120193434|GENSCAN_predicted_peptide_9|438_aa MAPAAGRGAAAQRRCLSGSFVPILASESLQPAGKAPLVRLALSGKRPTLGTRDPRQRGAL GAQNWLRPAESLLWVGRAPPPLCSLERSEPRAADRSTFRFAAPRARGPCVRFPPLLEAAA GEPSGRQPRPESGISPLLGPGPKANHTAARSCDWTAVRADWPPPRVSGSLPLPFAPTPAP KLQANETALYRRPPFAFFSPPWRGRGSPDWPDCPRKPPSLHQSPKEVRSQRSFPANHKAA AAPSRAVSPASLGFTVLNSSSLFVRAGDSDWPLLPLSNVSSSQPPNKANSPFPFGYLRHF PLFGRGFPASLSNWFLRLIGFFREEPQTNDVRDWLRSGFGSQPKRASVGLGRRFPDACYA GGGALGGVRLGGGGRRWRTSRRGEEEWGSARRLRRQRSSHRAAPDPGECGLVPPPPPPPP GPSDKGRKEARRSRPRSR >gi568815595r:119726791_120193434|GENSCAN_predicted_CDS_9|1317_bp atggcccccgcggctgggcggggcgcggccgcccaaaggcgctgcctttctggaagcttt gttcccattttagcgtctgagagcttgcagccggctgggaaggcccccttggtccgtctg gccctttcggggaagaggccaacactcggcacacgcgatccacggcagaggggagccttg ggcgcgcagaattggctgcgccccgccgagagcctcctgtgggtggggagagcccctcca cccctctgctcgcttgagcgctcagagcccagggccgccgaccgcagcactttccgattt gctgcaccgagggcccgcggtccctgtgtgcggtttccaccgttgttggaggcggccgca ggcgaaccgtcgggtcgtcagccacgacccgagtcaggcatctccccgctcctgggaccg gggccgaaggccaatcacactgcagctaggtcttgcgattggacggcagtgagagccgat tggccgccgccgcgagtttcgggctccctccctctcccttttgccccaactccagcgcct aagcttcaggccaatgagacagcgctttatagacgccctcccttcgctttcttctcccca ccttggagagggaggggaagtcctgactggccagactgtcctcggaagcccccttctctt caccaatcaccgaaggaggtacgctcccagcggagttttccagccaatcacaaagcggcg gcggcgcccagccgtgcagtttcaccagcgtctctgggtttcaccgtcctcaactcttca agcctcttcgtaagggccggcgactctgattggccactgttgccattgtcgaatgtctcc tccagccagccaccgaacaaggcgaattcaccctttccgttcggctaccttcggcatttt ccgctctttgggcgtggcttcccagcgtcactttctaattggtttctcaggctgatcggc tttttccgggaggagccgcaaacaaacgacgtccgtgattggctccgttcgggcttcggc tcccagccgaagcgggcgagcgtggggctcggccggcgattcccagacgcctgttacgcg ggcggcggggcgctgggcggtgtaaggctgggtgggggaggaaggaggtggaggacgagt aggaggggggaggaggagtggggaagtgcaaggcggctgcgcagacagcgctcctcacac agagcagctcctgacccgggcgaatgcgggcttgtgccgccgccgccgccgccgccgccc gggccaagtgacaaaggaaggaaggaagcgaggaggagccggccccgcagccgctga >gi568815595r:119726791_120193434|GENSCAN_predicted_peptide_10|1507_aa MSSPNLNIVTLLGSCLTYSSAYLFGIQDVLVGSSMETLIQTRLSMLCIGTSLVFGPILGK SWRLYKVFTQRVPDKRVALWGHVSWPIPGTSVRQNKIPRDCCLAQVRAGSEGRKSDYRET IATSQQGEVLTLTLLQIIKDLQLLGLVAALLMADVILLMTWVLTDPIQCLQILSVSMTVT GKDVSCTSTSTHFCASRYSDVWIALIWGCKEKEGKGWSIRKLTEPDPIDSLGLLLLYGAY LAGLTGHVSSPPVNQSLTIMVGVNLLVLAAGLLFVVTRYLHSWPNLVFGLTSGGIFVCTT TINCFIFIPQLKQWKAFEEENQTIRRMAKYFSTPNKSFHTQYGEEENCHPRGEKSSMERL LTEVSSAPAWKIPCKGSNHASVAPKALIADVLFSSHSPTSLNSKQNLTISTVTRIYFIHS VDRCIMSHSYVPGIVPVNNEQKHGSYPHRAHSLVEEIEIHSSSKHVIIHCHKNYERKIRV VRQNNVWDSICYIQKDALSLQRKPFNGHRCTSGKKLASPVHATPGISFPEQAREQAASPR ARWQKNAVIESLQEQVNNAKEKIVRLMSAECTYDLPEGAAPPASSPNKDVQAVASVHTLA AAQGPSGHLSDFQNDPGMAARDSQCTSGPSSYAQSLEGPGKDSSFSPGKEEKISDSKDFS DHLDSGCSQKPWTEQSLGPERGDQVPMNPSQSLLPERGGSDPQRQRHLENSEEPPERRSR VSSVIREKLQEVLQDLGLGPEASLSTAPSCHQQTWKNSAAFSPQKMPLSKELGFSPYMVR RRRAAQRARSHFPGSAPSSVGHRANRTVPGAHSRLHVQNGDSPSLAPQTTDSRVRRPSSR KPSLPSDPQDRPGTLEGSKQSQTEPEGARGSKAAFLRQPSGSGRAPSPAAPCLSKASPDL PEQWQLWPPVPSGCASLSSQHSYFDTESSSSDEFFCRCHRPYCEICFQSSSDSSDSGTSD TDPEPTGGLASWEKLWARSKPIVNFKDDLKPTLASQISPLGSDVPEVLTVPTVPLSCGSS SIHPLMQFCTAGRHTPAISVGATLAPPKGAGELVKDECEELSHHQSHRESTPGSTVLIAE LAVMSRLSPAPSPVYTSVVTRRQEMPMSQIKTCCELEKVCVKVSQETLGSENKCLGDTLE RKRKRKRKRKKKRRGRRRREEKRKKKKSKQQRDEKRDVVKEQANQRTKGILCGQFVAIAS HINKCTDASSKLRGQGGHTNGMQPGALFPAVTLITPELRVSHDWARVAWALRSGATAVNP HPKRISSLSGPACRSRDVPGQVTSMECCAHNRHKRCLNIDGRKRGKETRKDKFLFSISHE ACGETMNKPRGLGGRNGFMGLAQGPAALWSLGTWCPASQPLQLQLWLKKGQGVKLQTFAV SVTAHRGSADPKSEQQQDLLQRVKEQSFHSVERTRAGCHCRLGQLAFIPLSGPSHILLIG PFYRELIGLFYRELIASGNHPSTFYIHEPNHFDFYVLQSHEVLHSTDHTDCLLTECGISS CLQSSRL >gi568815595r:119726791_120193434|GENSCAN_predicted_CDS_10|4524_bp atgtccagtcccaatctgaacattgtgaccttactgggcagttgtctcacttacagtagc gcttacctctttgggattcaggatgttttagtggggagctcaatggaaactctcattcag acaagactgtccatgctgtgcattgggacctcccttgtgtttggccccattctgggaaag agctggcgactctacaaggtgtttacccaaagggtcccggacaagagagtggccctgtgg ggacatgtgagctggccaatacctggcaccagtgtgaggcagaacaagatacccagagac tgttgtttggcacaggtgcgagctggaagtgaagggaggaagagtgattacagagaaact atagcaaccagtcaacaaggagaagttttgaccttgactcttttgcagattatcaaagac ctgcagttgctggggttggtggcagccctgttgatggctgatgtgatcctgctcatgacg tgggtgctgactgatcccatccagtgcctccagattctcagtgtcagtatgacggtgaca gggaaagacgtgtcctgcacttcgaccagcacccacttctgtgcttcccggtattccgat gtttggattgctctcatttggggatgcaaggaaaaagaaggaaaaggatggagcatcagg aaacttactgagccagaccccatagactccctgggtctgctcctgctgtatggtgcctac ctggctggcctgactggccatgtcagctcccctcctgtgaatcagtccttaaccatcatg gtgggggtcaacctccttgtactggctgctgggctgctttttgtagtcaccagatacttg cattcctggcccaacctggtctttggactcacatctggagggatctttgtttgtacaact acaatcaactgcttcatcttcattccccagctgaagcaatggaaggcatttgaagaggaa aaccaaacaatcagacgcatggccaaatatttcagcactcccaacaaaagcttccatacc cagtatggtgaggaggagaactgccacccgaggggagagaaaagctccatggagaggctc ctcacagaagtaagctctgcccctgcctggaagattccctgtaaaggatccaatcatgca tcggtggcccccaaggctcttattgctgatgttctcttttcctctcattctcccacttct cttaacagcaagcaaaacctcaccatttccaccgtgacacgcatctacttcattcattca gtggacagatgtataatgagtcactcctatgtgccaggcattgtgccagtgaacaatgaa caaaagcatggttcctaccctcacagagctcacagtctagtggaagaaatagagattcat tcatcaagcaaacatgtaattatacattgtcataagaattatgaaagaaaaatcagagtg gtaagacagaataatgtgtgggactcaatctgctatatccaaaaagatgctttgagtctg cagaggaagcccttcaatgggcacaggtgcacaagtggtaagaagctggcctccccagtg cacgctacacctggaatctctttcccagaacaggccagagaacaagcggccagtcctagg gcccggtggcagaaaaatgctgtgattgaaagcctgcaggaacaagtaaacaacgccaaa gagaagattgtgaggctgatgtcagctgagtgcacctatgacctcccagagggggctgcc ccacctgcctcttccccgaacaaggacgtccaggcggtagcctcggtccacaccctggca gctgctcaggggccttcgggtcacctctctgactttcagaatgatcctggcatggctgcc cgggattcccagtgcacttcagggccctcctcatatgcacaaagccttgaggggcctggg aaggactccagcttctccccagggaaggaggagaagatatctgactcaaaagacttttct gatcatttagactcaggttgtagccagaagccatggactgagcaaagcctgggtccagaa agaggagaccaagtccccatgaaccccagccagagtctcctaccagaaagaggcggctca gatccccagagacagaggcatctggagaactcagaggagcccccagagcggcggtcacgg gttagttcagtaatcagggagaaacttcaggaggtcttacaagatctgggcctgggccct gaggcttccctctccaccgccccctcttgtcatcagcaaacctggaagaacagtgctgcc ttcagcccccaaaagatgcccctctccaaggagctcggctttagcccttacatggtgagg agaaggcgggcagctcagcgggcccgctcacactttcctggctctgcaccctcatctgtg gggcatcgggcaaacaggactgttcctggggcacacagcaggctacatgtgcagaatggg gacagccccagcctggccccacaaactactgattccagagtacgaagaccttcttccagg aagccttcactaccttcagatcctcaagacagaccaggtaccctggagggcagcaaacaa agccagacagagcccgagggggctagagggagcaaagcagcctttcttcgccagccttct ggttctggccgggccccaagtcctgctgccccatgcctctccaaagcctcacctgacttg cctgaacagtggcagctgtggcccccagtgccctcaggctgtgcctccctgtcttctcaa cacagctattttgatactgagtccagcagctcagatgagttcttctgccgctgccaccgg ccctactgtgaaatctgcttccagagctcttctgactctagtgacagtggcacatcagac actgaccctgagcctactggggggctggcttcctgggaaaagctgtgggcccgctccaag cctattgtgaacttcaaagatgacttgaaacccacgctggcttcccagatctctcccctg ggatcggatgtacctgaggtcctgactgtccccactgtccccttgtcctgtggctcatcc tcgattcacccactgatgcagttttgtactgcggggaggcacacaccagcaatttctgtt ggggccacactggcacctcccaagggagctggggagctggtgaaggatgagtgcgaggaa ctcagccaccaccagagccacagggagagcacacctgggtccactgtgcttattgctgag ctggctgtcatgtccaggctgtcccctgccccgagtcctgtgtatacatctgtggtcaca agaagacaggaaatgccaatgtcacaaatcaaaacctgttgtgaacttgaaaaggtctgt gtcaaggtttcccaagaaacattaggctcagagaataagtgcctgggagacaccctggaa agaaagaggaagagaaagaggaagaggaagaagaagagaagaggaagaagaagaagagaa gagaagaggaagaagaagaagtccaaacaacagagagatgagaagagagatgtggtgaag gagcaagccaatcaaagaacaaaaggcatcctatgtggacagtttgtggcaattgcctct cacataaataaatgtactgacgccagttctaagttgagaggacaaggaggccataccaat ggaatgcagcctggtgccttatttccagcagttaccctcatcactccagaactgcgggtg tcccatgactgggccagggtggcctgggccttgcggtcaggagccactgctgtcaatcca catcctaaaaggatttcttctctctcaggtcctgcctgcagatcccgggatgtacccggt caagtgacaagcatggaatgctgtgcacacaatagacataagaggtgcttaaatattgac gggaggaagagagggaaggaaacaaggaaggataaattcttattcagcatctctcatgaa gcttgtggtgaaacaatgaacaagcccagaggcctaggaggaagaaatggttttatgggc ctggcccagggccctgctgctttgtggagccttgggacttggtgccctgcatcccagcca ctccagctccagctgtggctaaaaaagggccaaggagtgaagctgcagaccttcgcggtg agtgttacagctcatagaggcagtgcggacccaaagagtgagcagcagcaagatttattg caaagagtgaaagaacaaagcttccacagtgtggaaaggacccgagcgggttgccactgc cggctggggcagcttgcttttattcccttatctggccccagccacatcctgctgattggt ccattttatagagagctgattggtctgttttacagagagctgattgcctctggtaaccat ccttccactttctatatccatgagcccaatcattttgacttttatgtcctacaaagccat gaggtcctgcactccactgatcacactgactgtctcctgactgaatgtggcatcagcagc tgcctgcaaagctccaggctttga