GENSCAN 1.0 Date run: 8-Nov-116 Time: 15:35:43 Sequence gi568815585r:23630351_23989320 : 358970 bp : 41.17% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 529 524 6 1.05 1.02 Term - 4029 3829 201 0 0 56 38 212 0.931 9.41 1.01 Init - 13946 13911 36 2 0 70 68 48 0.284 1.06 1.00 Prom - 14494 14455 40 -4.85 2.00 Prom + 20515 20554 40 -4.95 2.01 Init + 23341 23343 3 0 0 75 101 0 0.590 0.05 2.02 Intr + 23508 23651 144 2 0 75 38 73 0.384 0.56 2.03 Intr + 23996 24151 156 1 0 81 66 133 0.665 9.69 2.04 Intr + 28603 28864 262 0 1 27 87 357 0.700 25.64 2.05 Intr + 30015 30140 126 1 0 67 115 231 0.988 23.43 2.06 Intr + 37630 37732 103 2 1 104 88 47 0.951 4.61 2.07 Intr + 38342 38747 406 2 1 67 110 277 0.849 21.23 2.08 Intr + 46791 46959 169 2 1 72 81 85 0.006 4.80 2.09 Intr + 58300 58403 104 2 2 94 45 97 0.037 4.97 2.10 Term + 59073 59198 126 2 0 75 49 124 0.228 4.50 2.11 PlyA + 60888 60893 6 1.05 3.00 Prom + 61790 61829 40 -4.35 3.01 Init + 64641 64729 89 2 2 80 86 37 0.259 2.87 3.02 Intr + 71595 71797 203 0 2 4 89 171 0.506 6.81 3.03 Intr + 72302 72404 103 0 1 35 105 129 0.536 7.61 3.04 Term + 74780 74912 133 2 1 44 42 110 0.340 -1.42 3.05 PlyA + 77284 77289 6 1.05 4.10 PlyA - 78755 78750 6 1.05 4.09 Term - 84570 84451 120 0 0 71 49 184 0.457 10.29 4.08 Intr - 85163 84963 201 2 0 78 51 76 0.421 1.46 4.07 Intr - 85687 85306 382 0 1 31 -38 300 0.670 5.69 4.06 Intr - 86387 86220 168 1 0 101 105 144 0.976 15.54 4.05 Intr - 91399 91086 314 1 2 60 45 142 0.039 1.16 4.04 Intr - 99146 98901 246 2 0 75 30 184 0.087 8.03 4.03 Intr - 100095 100061 35 1 2 127 88 37 0.140 4.62 4.02 Intr - 110396 110268 129 0 0 85 87 36 0.008 2.95 4.01 Init - 126381 126195 187 0 1 42 93 145 0.469 9.67 4.00 Prom - 126721 126682 40 -8.25 5.00 Prom + 128262 128301 40 -10.45 5.01 Init + 128929 129527 599 0 2 92 49 204 0.253 11.70 5.02 Intr + 134035 134077 43 1 1 66 64 43 0.252 -2.98 5.03 Intr + 137546 137666 121 1 1 119 68 80 0.402 8.25 5.04 Intr + 139974 140275 302 1 2 99 46 61 0.149 -1.47 5.05 Term + 140607 141056 450 2 0 26 42 239 0.599 7.30 5.06 PlyA + 141651 141656 6 1.05 6.00 Prom + 142204 142243 40 -5.75 6.01 Sngl + 150417 151163 747 2 0 58 43 375 0.973 25.93 6.02 PlyA + 151391 151396 6 1.05 7.00 Prom + 151560 151599 40 -6.15 7.01 Sngl + 151653 152906 1254 2 0 49 43 613 0.970 48.50 7.02 PlyA + 152958 152963 6 1.05 8.00 Prom + 153699 153738 40 -3.55 8.01 Init + 153867 154583 717 2 0 74 42 247 0.677 13.60 8.02 Intr + 159982 160154 173 0 2 66 78 146 0.864 9.22 8.03 Intr + 161431 161617 187 1 1 118 76 6 0.467 1.17 8.04 Intr + 167279 167374 96 1 0 77 77 72 0.199 4.29 8.05 Term + 174510 174629 120 2 0 86 48 55 0.109 -1.21 8.06 PlyA + 174760 174765 6 1.05 9.17 PlyA - 176984 176979 6 1.05 9.16 Term - 181944 181792 153 0 0 82 54 152 0.580 8.14 9.15 Intr - 203733 203515 219 0 0 67 50 93 0.168 0.98 9.14 Intr - 205999 205890 110 2 2 117 91 64 0.949 8.68 9.13 Intr - 207406 207202 205 1 1 58 93 129 0.904 8.25 9.12 Intr - 211138 210985 154 0 1 105 93 43 0.154 5.65 9.11 Intr - 228562 228510 53 1 2 66 115 35 0.118 0.79 9.10 Intr - 230779 230648 132 1 0 69 72 41 0.327 0.52 9.09 Intr - 232012 231952 61 0 1 55 94 66 0.142 1.72 9.08 Intr - 233839 233791 49 2 1 55 96 30 0.116 -2.68 9.07 Intr - 239098 238942 157 1 1 59 115 71 0.852 5.56 9.06 Intr - 239845 239663 183 1 0 71 76 194 0.998 15.56 9.05 Intr - 244559 244496 64 1 1 91 83 66 0.709 4.20 9.04 Intr - 249004 248918 87 0 0 92 73 56 0.831 2.67 9.03 Intr - 250348 250128 221 1 2 6 19 236 0.162 4.68 9.02 Intr - 256156 255983 174 1 0 60 87 242 0.968 20.51 9.01 Init - 258970 258620 351 1 0 55 36 383 0.893 25.01 9.00 Prom - 260428 260389 40 -8.75 10.04 PlyA - 260479 260474 6 -0.45 10.03 Term - 261711 260939 773 1 2 97 48 601 0.999 49.56 10.02 Intr - 263851 263789 63 2 0 95 88 64 0.820 4.87 10.01 Init - 266636 266471 166 2 1 83 78 121 0.835 10.45 10.00 Prom - 269328 269289 40 -4.45 11.06 PlyA - 270082 270077 6 1.05 11.05 Term - 289316 289269 48 2 0 104 42 43 0.038 -2.37 11.04 Intr - 290741 290658 84 2 0 67 79 74 0.052 3.50 11.03 Intr - 311119 310940 180 1 0 74 75 199 0.918 16.34 11.02 Intr - 311407 311276 132 1 0 16 76 106 0.486 2.12 11.01 Init - 316120 315380 741 1 0 70 76 341 0.337 25.88 11.00 Prom - 316923 316884 40 -11.24 12.00 Prom + 316966 317005 40 -18.01 12.01 Init + 317244 317641 398 2 2 87 91 161 0.577 12.65 12.02 Intr + 317780 317987 208 2 1 85 84 135 0.985 10.96 12.03 Intr + 320075 320374 300 1 0 101 52 103 0.643 4.01 12.04 Intr + 321021 321082 62 2 2 99 4 80 0.147 -2.69 12.05 Intr + 321785 321930 146 2 2 87 84 74 0.222 5.91 12.06 Intr + 322416 322950 535 1 1 17 35 246 0.361 3.25 12.07 Term + 323264 323765 502 2 1 72 46 259 0.900 13.07 12.08 PlyA + 323985 323990 6 1.05 13.07 PlyA - 326586 326581 6 1.05 13.06 Term - 338606 338366 241 1 1 17 49 185 0.013 2.01 13.05 Intr - 349413 349219 195 2 0 83 49 198 0.031 13.01 13.04 Intr - 349735 349461 275 1 2 67 61 157 0.543 6.31 13.03 Intr - 353110 353039 72 1 0 52 60 89 0.444 1.18 13.02 Intr - 354414 354303 112 2 1 15 57 171 0.274 6.16 13.01 Init - 357042 356990 53 0 2 83 101 10 0.412 2.48 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 45430 45332 99 1 0 54 43 163 0.913 5.55 S.002 Sngl + 338357 338542 186 1 0 78 48 180 0.819 7.83 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585r:23630351_23989320|GENSCAN_predicted_peptide_1|78_aa MWRLWELRDEIWRLKIKAREIRFIGDWVIINCYCSQKAPPKAQAHSDFEEQLPIQGVLST HSVKAFNFSIFTDNRQSQ >gi568815585r:23630351_23989320|GENSCAN_predicted_CDS_1|237_bp atgtggagattatgggaactacgagatgagatttggagactgaagataaaagccagagaa attcgatttattggtgactgggtaataatcaactgctactgttctcagaaggccccacca aaggctcaggcacacagcgactttgaagagcagctcccaattcaaggggtgctgtcaact cactcagtcaaggcattcaacttttccatttttactgataacagacaaagtcaatga >gi568815585r:23630351_23989320|GENSCAN_predicted_peptide_2|532_aa MGEHSQATVAMGLSGQQIYWSVPGSWFCRSSPALIECSGPSASVGLGVCMQSPPQNAPRP CPRTPRRCSACLLPPRLITVARHADTKTNGKAEDPEHLCGQHGKATGKGACQLRHKHCIC SCSEHADPIPVVSGASKVNLVKIASTASSPRDTALAAVICSALATVLLALLILCVIYCKR QFMEKKPSWSLRSQDIQYNGSELSCFDRPQLHEYAHRACCQCRRDSVQTCGPVRLLPSMC CEEACSPNPATLGCGVHSAASLQARNAGPAGEMVPTFFGSLTQSICGEFSDAWPLMQNPM GGDNISFCDSYPELTGEDIHSLNPELESSTSLDSNSSQDLVGGAVPVQSHSENFTAATDL SRYNNTLVESASTQDALTMRSQLDQESGAVIHPATQTSLQWVHGLSGFRSEAADLHGVTA LKAALWSCSFLLSGVVHSSRWVRGLTGLRSEAADLPNIWVYKYHCVATGYSIHSNMLCTC VAWEQQALIAWKWSPPIDEWMDRQNVAYAYNGISLSLQKEGNSDIRYSLDEP >gi568815585r:23630351_23989320|GENSCAN_predicted_CDS_2|1599_bp atgggagagcacagccaggccacagtggccatgggactttctgggcagcagatctattgg agtgtgcctggcagctggttctgcaggagctcacctgccctaatagaatgttcaggccct tctgcctcagtgggtcttggtgtctgcatgcagtcacctccacagaatgccccccgcccc tgccccaggacacccagacgttgctcagcatgcttactgcccccacgtctgatcacagtg gcccgtcatgcagacaccaagaccaatgggaaggcggaggaccccgaacatctctgtggg cagcatgggaaggccactggtaagggtgcctgccagcttcgccataaacactgcatctgc agttgttcagagcatgctgaccccattcctgtggtttcaggtgccagcaaggtcaacctc gtgaagatcgcgtccacggcctccagcccacgggacacggcgctggctgccgttatctgc agcgctctggccaccgtcctgctggccctgctcatcctctgtgtcatctattgtaagaga cagtttatggagaagaaacccagctggtctctgcggtcacaggacattcagtacaacggc tctgagctgtcgtgttttgacagacctcagctccacgaatatgcccacagagcctgctgc cagtgccgccgtgactcagtgcagacctgcgggccggtgcgcttgctcccatccatgtgc tgtgaggaggcctgcagccccaacccggcgactcttggttgtggggtgcattctgcagcc agtcttcaggcaagaaacgcaggcccagccggggagatggtgccgactttcttcggatcc ctcacgcagtccatctgtggcgagttttcagatgcctggcctctgatgcagaatcccatg ggtggtgacaacatctctttttgtgactcttatcctgaactcactggagaagacattcat tctctcaatccagaacttgaaagctcaacgtctttggattcaaatagcagtcaagatttg gttggtggggctgttccagtccagtctcattctgaaaactttacagcagctactgattta tctagatataacaacacactggtagaatcagcatcaactcaggatgcactaactatgaga agccagctagatcaggagagtggtgctgtcatccacccagccactcagacgtccctccag tgggttcatggtctctctggcttcaggagtgaagctgctgaccttcacggtgttacagct cttaaggcggcactctggagttgttcgttcctcctgtctggagttgttcattcctcccgg tgggttcgtggtctcactggcctcaggagtgaagctgcagaccttcccaacatttgggtg tacaaataccactgtgttgcaactggctacagtattcacagcaacatgctgtgcacgtgt gtagcttgggagcaacaggctctcattgcctggaaatggtctccacccatcgatgaatgg atggatagacaaaatgtggcctatgcatacaatggaatatcactcagccttcaaaaggaa ggaaattctgacatacgctacagcctggatgaaccttga >gi568815585r:23630351_23989320|GENSCAN_predicted_peptide_3|175_aa MMKEWTYDIVQFGGPLGKLLSRGRETLAGGPAARHSGQQLDIEAVTEGAGPWLWGHLGAS GPSRWPSQHSRCPGRHAALRSGTGSAQQGAHKPLRAPACARCLKCEQWKAGASAGPQEQG DGGERRGSSAKSIWVEASCLEFLPRDSSIKARSHSPVSMAGIASHTVRTTKDEHT >gi568815585r:23630351_23989320|GENSCAN_predicted_CDS_3|528_bp atgatgaaggaatggacatatgacattgtgcaatttggcggcccacttgggaagttgctg agccggggtcgggagacactggctggagggccagcagccagacacagcggccagcagctg gacatagaggccgtaactgagggtgctgggccatggctgtggggacacctgggagcctca gggccatcaagatggccctcacagcacagcaggtgccctggcaggcatgcagcattgcga tctggaacaggatctgcacagcagggcgctcataagcccctaagagctccagcctgtgct agatgcttaaaatgcgaacaatggaaagcaggagcctcagcggggccacaggaacaaggg gatggaggtgaacgtcgaggttccagcgccaagagtatttgggtagaggcctcctgcctg gagttcctgcccagagacagcagcatcaaagccagatcccatagccctgtgtccatggct ggaatagcctctcacaccgtcagaaccactaaagatgagcacacctag >gi568815585r:23630351_23989320|GENSCAN_predicted_peptide_4|593_aa MVAAFMNMASVWLSLRIQDKCEWRLCKQESVTEPLFPRAAGERYRREMLAHGGGREPMLM VEGTTPAFPVKVGSRNQALKLACVSPRLYESEERQNHLSEKTAASGMLQKCPSVDDFTND ERKDSKYPKLGHTFNLFAESGGISSCSEMGLDGETSSVAGLCQVLLDARGIDPSCFPTSS SPLEIQIGTDQKAEQPAPQDADMAETAGRGAQEISVPSTPFCCEPKTAEWIMYVLRSSCA ICTLQHFPSIPWKGREVLETDRAQKSDRDTRSEWSQCPQPLQHLIRFGSVSPPNPTCRRR PGGRLAAAATTSTDANTTTSAVTLNALAHPIRPLSSGCHRHPLTTTVKLQSLSPPPTAAR IPEEMLCACGAPPGAGRCRPLESGVQEEENSPFFGGHHYSLPPLLPTATSSAAPDRAPDP SLPQAFQHIVGLQTVPSLAQAVQSQIAPPTCYPPWPRIVPPASPELPDNECPTPLSVPPN CPPPPPPLAPPPAMQTRIAPQTSPPYHAQCSHGQCSPNSTPNRSLRLPQRAYLPCSNFST TLADLQFPSPPPTTLRGPEGEATCYTLDAAACATVAHIAVVGGSNGDCSQLEW >gi568815585r:23630351_23989320|GENSCAN_predicted_CDS_4|1782_bp atggtggcagcttttatgaatatggcatcagtgtggctttctttgagaattcaagacaag tgtgaatggaggctgtgcaagcaggagtctgtaacagaacctctcttccccagggctgcc ggggagcgctatcgcagggagatgctggcccacggtggaggcagggagcccatgctcatg gttgaaggaacaacccctgcttttccagtgaaggtggggagtagaaatcaggcattaaag cttgcttgtgtatcacccagactttatgaatcggaggaaagacagaatcatttaagtgaa aaaacagctgcttcaggtatgcttcagaagtgtccttctgttgatgacttcaccaatgat gaaaggaaagattcaaaatacccaaaactaggacacacattcaacctatttgcagaatct ggaggaatttcttcatgctcagagatgggactggatggagaaacatcatcagtggccggc ctgtgccaggtgctcctggatgcaagggggatcgacccttcatgctttcccacctcctcc tcacccttagaaatccaaatcggtactgaccagaaagcagagcagccagcaccccaagat gccgatatggcggagacagcggggaggggtgcacaggaaatttctgttccttccactcca ttttgctgcgaacctaaaactgctgagtggataatgtatgtcttaaggagttcctgtgca atttgcacattacagcactttccttctataccatggaaggggagggaggtcctggagaca gacagagcccagaagagtgacagagacaccaggtctgagtggtctcagtgtccccagccc ctgcagcatctgatacggtttggctctgtgtccccacccaatcccacgtgtcggaggaga cctggtgggagacttgctgctgctgccaccactagcactgatgccaatacaaccacctct gctgtcaccctcaatgcactggcccaccctataaggcccctatcatctggctgccacagg catcctcttaccactacagtcaagctgcagtctctgtcaccaccaccaactgcagcaagg attcctgaggaaatgctgtgcgcctgcggagctccaccaggagcaggacgttgtcgccct ctagagtctggagtccaggaagaggagaacagccccttctttggaggccaccactattcg ctgccacctctgctgcccacagccaccagcagtgcagcccctgatagggcccccgaccca tccctgccacaggcattccagcacatcgtagggcttcaaaccgtgccctccctggcacag gcagtgcagtcccagatagcacccccaacttgctaccctccatggccccgtatagtgccc ccagccagccctgagctgcctgataacgaatgccccacacccctgtcagtgcccccaaac tgcccccccccacccccgccactggccccaccgcctgcaatgcaaacccggatagcaccc caaaccagccccccttaccatgcacagtgcagccatgggcagtgcagccctaacagcaca cccaaccgcagtctgcggttgccccagagagcatacctaccctgcagcaacttttctacc actctggccgatctgcagtttccatcaccaccacctaccacacttagaggaccagaggga gaagccacctgctacacgctggatgctgcagcctgtgccactgtggctcacatcgctgtg gttggtggcagcaacggagactgcagccagctggagtggtag >gi568815585r:23630351_23989320|GENSCAN_predicted_peptide_5|504_aa MGSSAPPQTAREPSHPSPSTGPPRTSPDSTGYPTPTRQHRIPRNPSASTRSPYLPRQHRI PHTPKTARDGIPHTPLDSTGSPASPSHSIESLTPSRKQGISCTSPPNGSLTTTWPLERDW ESRTTDPRHPYFLPTMLPASPVPCFASILWKGVGIPYHVVRSEKPRKPTGAAGAASGSTG ALIQGFPQTPHSHLYTVAAGPTLMPRFSRAGNGKKQNNTLFLLSVNRTLDCCGSDCISQM PRGVWLCDYILENGYTLPPGRSIWLILTFLNVFLNYASSTRPSLMSKKRGGGQDAVSVPI PGTIGTTGFTAVPLPPGSWDLSIEVCADELPLHTLLLNQGLTPLPASLASPRCSLAQRLE NHPSFQTSLNSQLASTREDPDLMAAQSLGGADSKSGITGQSHCNKCYGNAGSALTGCTAN TRSHNHPDRGHTHLLGLWHGLKERREGKSVHCAGPRDGPVCQLLTESFLSRVDIRHLFVS VTSGVSSINIYTNNDKRSKNIRHK >gi568815585r:23630351_23989320|GENSCAN_predicted_CDS_5|1515_bp atgggatcctctgcacctccccagacagcacgggaaccctcgcacccttcacccagcaca ggacccccacgcacctctccagacagcacgggataccccacccccaccagacagcacagg atcccccgcaacccctcagccagcacaagatccccatacctccccagacagcacaggatt ccccacacccccaagacagcacgggacgggatcccccacacccccctagacagcacagga tcccctgcatcccccagccacagcatagaatccctcacaccctccaggaagcaagggatc tcctgtacctctccacccaatggaagtttaaccactacctggcctttagaaagagattgg gaaagcaggactactgatcctaggcacccctacttcctcccgacaatgctcccagcctct ccagtgccttgttttgcctccattctgtggaaaggggtgggcatcccgtatcacgtggtg aggtcagaaaagccaagaaaacccacaggagcagcaggggctgcttcaggtagcacagga gccctcatacaagggttccctcagacaccacacagccacctctacacagtggctgcaggc cctactctaatgccacgattttccagggccggaaatggaaagaaacagaataataccctt ttcttgctaagtgtgaacaggactctcgattgctgtggcagtgactgcatttcccagatg ccccgaggcgtgtggctgtgcgactatattcttgaaaatggctatactcttcccccaggt cgatccatatggctcattctcactttcctcaatgtctttttaaattacgcttcttcaaca agaccttctctgatgagtaaaaaaaggggagggggtcaggatgctgtctcagtgccaatt ccaggcacaattggaacaaccggtttcactgctgtccccctgcccccaggcagctgggac ctcagcatcgaagtctgtgcagatgaactcccactccataccctcttgctcaatcagggc ctaactcccctgcctgccagcttggcctcccccaggtgctctctggcacagagacttgaa aatcatccatctttccaaacctcactgaattcacaactggccagcactagagaggaccct gacctcatggctgcacagtcactggggggtgcagacagtaaatccgggatcactggacag tcacactgcaacaagtgctatgggaatgcaggctctgcacttaccggctgcacagccaac accagatcccacaaccaccctgaccgtggccacacccacctcctagggctatggcatggg ctaaaggagagaagggaagggaaaagtgtgcactgtgctggtcctcgggatggtccagtt tgtcagctgctaacagaatcctttctaagcagggtggacatcagacacctctttgtatca gtgactagtggggtgtcttcaataaatatttacacaaacaatgataaaaggagcaagaat attagacataaatga >gi568815585r:23630351_23989320|GENSCAN_predicted_peptide_6|248_aa MELKTTAQELRDECTSFSSPFDQLEERVSVIKDQMNEMKREEKFRKKRVKRNEQSLQEIR DYAKRPNLRLIGVPESDGENGTKLENTLQDIIQENFPNLARQANIQIQEIQRTPQRYSSR RATPRHIIIRFTKVEMTEKMLRAAREKGRVTHKRKPIRLTANLLAETLQARREWGPIFNI LKEKNFQPRISYPAKLSFISEGEIKYFTDKQMLGDFVTTRLALQELPKEALNMERNNWYQ PLQKHAKL >gi568815585r:23630351_23989320|GENSCAN_predicted_CDS_6|747_bp atggagttgaaaactacggcacaagaactacgtgacgaatgcacaagcttcagtagccca ttcgatcaactggaagaaagggtatcagtgattaaagatcaaatgaatgaaatgaagcga gaagagaagtttagaaaaaaaagggtaaaaagaaatgaacaaagcctccaagaaataagg gactatgcgaaaagaccaaatctacgtctgattggtgtacctgaaagtgacggggagaat ggaaccaagttggaaaacactctgcaggatattatccaggagaacttcccgaacttagca aggcaggccaacattcaaattcaggaaatacagagaacaccacaaagatactcctcgaga agagcaactccaagacacataattatcagattcaccaaagttgaaatgacggaaaaaatg ttaagggcagccagagagaaaggtcgggttacccacaaaaggaaacccatcagactaaca gcaaatctcttggcagaaactctacaagccagaagagagtgggggccaatattcaacatt cttaaagaaaagaattttcaacccagaatttcatatccagccaaactaagcttcataagt gaaggagaaataaaatactttacagacaagcaaatgctgggagattttgtcaccaccagg cttgccctacaagagctcccaaaggaagcactaaacatggaaaggaacaactggtaccag ccactgcaaaaacatgccaaattgtaa >gi568815585r:23630351_23989320|GENSCAN_predicted_peptide_7|417_aa MGDFNTPLSTLDRSTRQKVNKDIQELNTALHQADLIDTYRTLHPKSTEYTFFSAPHHTYS KIDHIVGSKALLSKCKRTEIITNCLSDHSAIKLELRIKKLTQNCSTTWKLNNLLLNDHRV HDKMKAEIKMFFETSENKDTTYQNLWDTFKAVCRGKFIPLNAHKRKQERSKNDTLTSQLK ELEKQEQTHSEASRRQEITKIREELKETETQKSLQKINESRSWFFEKINKIDRPVARLIK KKREKNQIDAIKNDKGDISTNPTEIQTTIREYYKHLYANKLENLEEMDKFLDTYTLPRLN QEEVESLNRPITGSEIEAIINSLPTKKNPGPDGFTAEFYQRYKEELVPFLLKLFQSIEKE GILPNSFYEASIILIPKPGRDTTEKENFRPISLMNISAKILNTGKLNPAAHQKAYPP >gi568815585r:23630351_23989320|GENSCAN_predicted_CDS_7|1254_bp atgggagactttaacaccccactgtcaacattagacagatcaacaagacagaaagttaac aaggatatccaggaattgaacacagctctgcaccaagcagacctaatagacacctacaga actctccaccccaaatcaacagaatatacattcttctcagcaccacatcacacttattcc aaaattgaccacatagttggaagtaaagcactcctcagcaaatgtaaaagaacagaaatt ataacaaactgtctctcagaccacagtgcaatcaaactagaactcaggattaagaaactc actcaaaactgctcaactacatggaaactgaacaacctgctcctgaacgaccaccgggta catgacaaaatgaaggcagaaataaagatgttctttgaaaccagtgagaacaaagacaca acataccagaatctctgggacacatttaaagcagtgtgtagagggaaatttataccacta aatgcccacaagagaaagcaggaaagatctaaaaatgacaccctaacatcacaattaaaa gaactagagaagcaagagcaaacacattcagaagctagcagaaggcaagaaataaccaag atcagagaagaactgaaggagacagagacacaaaaaagtcttcaaaaaatcaatgaatcc aggagctggttttttgaaaagatcaacaaaattgatagaccagtagcaagactaataaag aagaaaagagagaagaatcaaatagacgcaataaaaaatgataaaggggatatctccacc aatcccacagaaatacaaactaccatcagagaatactataaacacctctacgcaaataaa ctagaaaatctagaagaaatggataaattccttgacacatacactctcccaagactaaac caggaagaagttgaatccctgaatagaccaataacaggctctgaaattgaggcaataatt aatagcctaccaaccaaaaaaaatccaggaccagacggattcacagctgaattctaccag aggtacaaggaggagctggtaccattccttctgaaactattccaatcaatagaaaaagag ggaatcctccctaactcattttatgaggccagcatcatcctgataccaaagccgggcaga gacacaaccgaaaaagagaattttagaccaatatccctgatgaacatcagtgcaaaaatc ctcaatactggcaaactgaatccagcagcacatcaaaaagcttatccaccatga >gi568815585r:23630351_23989320|GENSCAN_predicted_peptide_8|430_aa MLPDFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEITPHIYDHLIFDKPDKNKKWGKDS LFNKWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLHVRPKTIKTQEENLGNTIQDIGM GKDFMSKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQPIEWEEIFAIYSSDKGLI SRIYKELKQIYKNKTNNPIKKWVKDMNRHFSKEDIYAANRHMKKCSSSLAIREMHIKTTN SDDSNAPKANLSNVLCGRKNNALSSKDVHILSLEPVNVTLFGKKMDFADVIKLKILRKTP EVPLLTASNSSLSLYLIPTPFSLCPTNHSTDNVLIQVNNDLHLAKSNSCCSVLSSSTLKP PGKLADAQQIGVGCHIIKSEEECKSPTKTQKFTKHFQVSSPSVRPTRLEGHRPEPTGSYQ TIQSGGIARE >gi568815585r:23630351_23989320|GENSCAN_predicted_CDS_8|1293_bp atgctacctgacttcaaactatactacaaggctacggtaaccaaaacagcatggtactgg taccaaaacagagatatagaccaatggaacagaacagagccctcagaaataacaccacac atctatgaccatctgatctttgacaaacctgacaaaaacaagaaatggggaaaggattcc ctctttaataaatggtgctgggaaaactggctagccatatgtagaaagctgaaactggat cccttccttacaccttatacaaaaattaattcaaggtggattaaagacttacatgttaga cctaaaaccataaaaacccaagaagaaaacctaggcaataccattcaggacataggcatg ggcaaggacttcatgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaa tgggatctaattaagctaaagagcttctgcacagcaaaagaaactaccatcagagtgaac aggcaacctatagaatgggaggaaatttttgcaatctactcatctgacaaagggctaata tctaggatctacaaagaactcaaacaaatttacaagaataaaacaaacaaccccatcaaa aagtgggtgaaggatatgaacagacacttctcaaaagaagacatttatgcagccaacaga cacatgaaaaaatgttcatcatcactggccatcagagaaatgcacatcaaaaccacaaat tctgatgacagcaatgcccccaaagctaatctttcaaatgttctgtgtggcagaaagaat aatgccctctcctccaaagatgtccacatcctatccctagaacctgtgaatgtcacctta tttgggaaaaaaatggactttgcagatgtgattaagctaaagatcctgaggaaaacacca gaagtgcctctactcactgcctccaattcttctctctcactttacttgatacccactcca tttagcctttgtcccaccaaccactcaactgataatgttcttatccaagtcaataacgac cttcaccttgctaaatccaatagttgttgctcagttctcagctcttctaccctaaagcct ccaggaaagcttgctgatgcacaacaaataggtgtaggttgccacatcatcaagagtgag gaagaatgcaagagccctacaaaaacacaaaagtttacaaaacactttcaagttagctct ccttctgttagaccaactaggctagaaggacatcgacctgaacctaccggtagttatcaa accattcagtctggggggattgccagagaatga >gi568815585r:23630351_23989320|GENSCAN_predicted_peptide_9|790_aa MLCVGRLGGLGARAAALPPRRAGRGSLEAGIRARRVSTSWSPVGAAFNVKPQGSRLDLFG ERRVSGAGGAPLSPRAKPTPAPRGQQQSGRDCLGLGGGEQRTHLLQWRLVCALDEGKGLF GVPELSAPEGFHIAQEKALRKTELLVDRACSTPPGPQTVLIFDELSDSLCRVADLNGALQ SADTGHADRAESGDSLGESFASRGKAPGKESAVGCLLEALCMLCPAELDVGWSHGQALFI LSEDKYRLKLNTNVDLYQSLQKLLADKKLVDSLDPETRRVAELFMFDFEISGIHLDKEKR KRAVDLNVKILDLSSTFLMGTNFPNKIEKHLLPEHIRRNFTSAGDHIIIDGLHAESPDDL VREAAYKIFLYPNAGQLKCLEELLSSRDLLAKLVGYSTFSHRALQGTIAKNPETVMQFLE KLSDKLSERTLKDFEMIRGMKMKLNPQNSDLYLIQTSGSQSVVPGPEAVASHGSLLEMQI LEPHLRPTESKTGEVMPWDPPYYSGVIRAERYNIEPSLYCPFFSLGACMEGLNILLNRLL GISLYAEQPAKGEVWSEDVRKLDCHFTIRGGRLKEDGDYQLPVVVLMLNLPRSSRSSPTL LTPSMMENLFHEMGHAMHSMLGRTRYQHVTGTRCPTDFAEVPSILMEYFANDYRVVNQFA RHYQTGQLWRPSNPQTSTRLVKGSGWEAGDSKGASCFRGFRRVRPREAPLSVGDVGSITA ARRGFCEFEESEVSEVAAAKLTPIHLANLTPTSIKPFRVTAHIPSPTGLFFPARVTVSNC GIGAYVENAS >gi568815585r:23630351_23989320|GENSCAN_predicted_CDS_9|2373_bp atgctgtgcgtcggaaggctgggcggcttgggagccagagcagcagctctgccgccccgc cgggcgggccggggaagcctcgaagccgggatccgggcccgaagggtcagcaccagctgg tctcccgtgggcgccgccttcaatgtcaagccccagggcagccgcttggacctgttcggc gagcgccgggtgagcggcgcaggaggagctcccctcagtccccgagctaagccgacccct gctccgcgcggccagcaacaatcagggcgagattgtttgggcctgggaggaggcgagcag cgaacccaccttttacagtggcgtcttgtgtgcgctctcgatgagggcaagggtcttttt ggagttcctgagctgagtgccccagaaggatttcatattgcacaagaaaaagccttgaga aagacagaattgcttgtggaccgtgcatgttccaccccacctgggccccagaccgtgctg atcttcgatgagctctcggattccttatgcagagtggccgacttgaacggggccctgcag agtgcagatactgggcatgctgacagggctgaaagtggggactccctgggagaaagcttt gcctctcggggaaaggcccctgggaaggagtcagcagtgggctgcctgctcgaggctctc tgcatgctgtgtcctgccgaattagacgtgggctggtctcatggacaagcactcttcatt ttgtcggaggacaaatacagactgaagttgaacacaaatgtggatttatatcaaagtttg caaaaattactagctgataaaaaacttgtggattcccttgatccagaaacaaggcgagtg gctgaactgtttatgtttgattttgaaattagtggaatccatctagacaaagaaaagcgt aaaagagcagtggacctcaatgttaaaatcttggatttgagtagtacatttcttatggga accaattttcccaacaagattgagaagcatctcttaccagaacacattcgtcgtaacttt acatctgctggggatcatatcataattgatggtctccacgcagaatcaccagatgacttg gtgcgagaagctgcttataaaatttttctttatcccaatgctggtcaattgaaatgttta gaagaattgctcagcagcagagatcttctggcaaagttggtggggtattccacgttttct cacagggctctccaaggaacgatagctaaaaatccagagactgtcatgcagttccttgaa aaactatctgacaaactttctgaaagaactctgaaagattttgagatgatacgagggatg aaaatgaaactgaatcctcaaaattccgacctatacttaatccagaccagtggttctcaa agtgtggttcctggaccagaagcagtagcatcacatgggagcctgttagaaatgcaaatt cttgagccccacctcagacccactgaatcaaaaactggggaagtaatgccctgggacccc ccttactacagtggtgtgattcgtgcagaaaggtataatattgagcccagcctatattgc ccgtttttctctcttggagcatgcatggaaggcctgaatattttgcttaacagactgttg gggatttcattatatgcagagcagcctgcaaaaggagaggtgtggagcgaagatgtccga aaactggattgccatttcactatccgtggaggcagactaaaggaagatggagactatcaa ctcccagttgtagttcttatgctgaatcttccccgttcctcaaggagttctccaactttg ctaactcctagcatgatggaaaatcttttccatgaaatgggacatgccatgcattcaatg ctaggacgtactcgttaccaacacgtcactgggaccaggtgccctactgattttgctgag gttccttctattctgatggagtactttgcaaatgattatcgagtagttaaccaatttgcc agacattatcagactggacagctctggcgacccagcaacccgcagacaagtacacggctg gtgaaggggtcagggtgggaggcaggggacagcaaaggggcatcctgcttccgagggttc cgcagggtcaggccaagggaggcgccacttagtgtgggagacgtggggagcatcactgcc gccaggaggggtttctgcgagtttgaggagtctgaggtttcggaggtggccgctgccaaa ttaacacctattcatttggcaaatctcactcctacatccatcaagcctttcagggtcaca gctcacataccgtcccctacgggccttttcttccctgcacgcgttacagttagtaactgt ggaattggtgcttacgtggagaatgctagctga >gi568815585r:23630351_23989320|GENSCAN_predicted_peptide_10|333_aa MRIWWLLLAIEICTGNINSQDTCRQGHPGIPGNPGHNGLPGRDGRDGAKGDKGDAGEPGC PGSPGKDGTSGEKGERGADGKVEAKGIKGDQGSRGSPGKHGPKGLAGPMGEKGLRGETGP QGQKGNKGDVGPTGPEGPRGNIGPLGPTGLPGPMGPIGKPGPKGEAGPTGPQGEPGVRGI RGWKGDRGEKGKIGETLVLPKSAFTVGLTVLSKFPSSDVPIKFDKILYNEFNHYDTAVGK FTCHIAGVYYFTYHITVFSRNVQVSLVKNGVKILHTRDAYVSSEDQASGSIVLQLKLGDE MWLQVTGGERFNGLFADEDDDTTFTGFLLFSSQ >gi568815585r:23630351_23989320|GENSCAN_predicted_CDS_10|1002_bp atgaggatctggtggcttctgcttgccattgaaatctgcacagggaacataaactcacag gacacctgcaggcaagggcaccctggaatccctgggaaccccggtcacaatggtctgcct ggaagagatggacgagacggagcgaagggtgacaaaggcgatgcaggagaaccaggatgt cctggcagcccggggaaggatgggacgagtggagagaagggagaacgaggagcagatgga aaagttgaagcaaaaggcatcaaaggtgatcaaggctcaagaggatccccaggaaaacat ggccccaaggggcttgcagggcccatgggagagaaaggcctccgaggagagactgggcct caggggcagaaggggaataagggtgacgtgggtcccactggtcctgaggggccaaggggc aacattgggcctttgggcccaactggtttaccgggccccatgggccctattggaaagcct ggtcccaagggagaagctggacccacggggccccagggtgagccaggagtccggggaata agaggctggaaaggagatcgaggagagaaagggaaaatcggtgagactctagtcttgcca aaaagtgctttcactgtggggctcacggtgctgagcaagtttccttcttcagatgtgccc attaaatttgataagatcctgtataatgaattcaaccattatgatacagcagtggggaaa ttcacgtgccacattgctggggtctattacttcacctaccacatcactgttttctccagg aatgttcaggtgtctttggtcaaaaacggagtaaaaatactgcacaccagagatgcttac gtgagctctgaggaccaggcctctggcagcattgtcctgcagctgaagctcggggatgag atgtggctgcaggtgacaggaggagagaggttcaatggcttgtttgctgatgaggacgat gacacaactttcacagggttccttctgttcagcagccagtga >gi568815585r:23630351_23989320|GENSCAN_predicted_peptide_11|394_aa MSQGPKTLVKEWRSQHWSARGEWELDGLALGWTHGLQPQLSCAVEPSSHLGMSCPLRLPG AGRPPRCRRALKGFRGSLCRRGQRLSVAAEGCGGPGWIADSRLDRRFGIADWGLDWGFGV GSGMWGLDRGFEDWIGDLGLDPGFGAGSAVGGSEKVTESSPCSGAGGWGSEKSPPPRSSS ASGAAGARWPGLHRPRLHGSGYRIRDAEQRKIQGRLSRTGGGARPGAQEPETWRPGTSST GSWGQPRSSYSCRTALHLACASGHVEVVTLLVNRQCQIDICDKEKSVPLIQAQAVHCQEE ACAIILLERGANPNLKDVYSNTALQYAVYSESTSLAKKLLSHDANTEALDKIQGAHVQVC YVAVLHDAEVCIVIDSITQDFFGYMGSFSVPYED >gi568815585r:23630351_23989320|GENSCAN_predicted_CDS_11|1185_bp atgtcccaaggccccaagaccctggtgaaggaatggaggagccagcattggtcggcgaga ggggagtgggagttggatggcctggctctgggctggacacacggtttgcagccacagctg agctgtgctgtggagcccagttcccacctgggcatgtcctgtcctctgaggcttccggga gccggacggccaccgcggtgcaggcgcgcactcaagggctttcgaggctcactctgtcgg agaggtcagagactgagtgtcgctgctgaaggctgtggtggaccgggctggatcgcggat tctaggttagatcgcagatttgggatcgcggattggggattggattggggatttggggtt ggatcagggatgtgggggctggataggggatttgaggattggatcggggatttggggttg gatcctggatttggggccgggtcggcggtggggggcagtgaaaaggtgacagagagcagc ccctgctcaggagccggtggttgggggtctgagaagtcaccaccaccaagaagttcttcg gcttcgggagccgcgggggcgagatggcccgggctccatagaccacgtctgcacggttcc gggtaccgaatccgggacgcggaacagcggaaaatccaagggcggctgtcaaggaccgga ggtggagcgcggcctggagcgcaggagccggagacctggaggcccgggacaagcagcaca ggtagctggggtcagcccaggtcctcttattcttgtagaactgctctacatttggcctgt gccagcggccatgtggaagtggtcactctcctggtgaacagacaatgccaaattgacatc tgtgacaaagaaaagagtgtgcctttgatacaggcacaggctgtgcattgccaagaagag gcttgtgccattattctgctggaacgtggcgccaatccaaaccttaaggatgtctacagc aacactgctctccagtatgctgtgtatagtgagagcacatcactggcaaaaaaactgctt tcccatgatgcaaatactgaagcactggacaagattcagggagcccacgtgcaggtttgt tacgtggctgtattgcatgatgctgaggtttgcattgtgattgattccatcacccaggac ttcttcggctatatgggctctttctcggttccatatgaggattag >gi568815585r:23630351_23989320|GENSCAN_predicted_peptide_12|716_aa MHSPSPLPAYPIQAWAGDPEPQWLSHQPCQPPAQKHAPQATQTRGAQESSSQGHSTTPQP PFTLPCTWRGSVTHPPHKNSQERPSLFGRKFCWHQGPLREASTWRAGLGSPMTVPQQPHC QTLLSCHGYPFHRLPLGCGCPPAKLCPRHPGLSCAELAPVAVGATVAGPQVFTPPICAAT ANGTEDNCPQMGLLAWGLPLPRAWVWVCCRPGFRALPRLWTRGVMRAEHTYLSNGMNFRA PGYGPSGTLERVGRALGHPIRHPTTNTQPQWSIQWPLGKLRAQEKDLRWLIGKPSPLSWD PVLPEEPLGSHGPGITECRGSAGLCHGYAPCLVQTRVVCPSLWVRGLAGLRSETADLHVS VTAHKGGTDPKEAHSASPLTGTRSGTLRHLAWALWQPRGSSSPDQAQQVLAGCVECGACP ACAHPEPVPAASACRHSPSSRQCLSLHTSLQAEGAGSSLGQPQRGVPTAQWQAEGLLEHG QNGRRGQGGTKSERGLLALCHLSPPQPSQTQPALSTQVPIPNNSICRALSHTPATSPPRI VRYFMPGGPRQTVQPHSTPNLLPTAAQTAPPNCSPRRPPLTASIVAPDSSPNPSPHHWQC SIADTTCFPVPPHSTGSVAPHTLLTRPTASNATLDRVPNQPPHATGSAHMGGAALTAPKP ALRGCPDNAPTPQPTTLAELKSPSLPTAASVATNHSEASYSGAGSSLQHAPSTRVF >gi568815585r:23630351_23989320|GENSCAN_predicted_CDS_12|2151_bp atgcactcaccctcccctctccctgcctaccctattcaggcctgggctggggatcctgag ccacagtggctcagccaccagccctgccagccgcctgcccaaaagcatgctccccaggcc acccagacacgaggagcccaagagagctcttcccagggccattcaaccacccctcagcct cccttcaccctgccctgcacttggagaggctctgtgacccaccccccacacaagaactca caggaacgcccttctctttttgggagaaagttctgctggcatcagggaccactgagggaa gcgagcacatggagagcaggactcgggtcccccatgactgtgccccagcagccccactgc cagacacttctgagctgccatgggtacccgtttcacaggctccccctgggatgtggctgc ccacctgcaaagctatgtcctcgccacccaggtctgagttgtgcagagctggctccagtg gctgtaggtgctacagtcgcagggcctcaggtcttcacaccccccatctgtgctgccaca gccaatggcactgaagacaactgtcctcagatggggctgctggcctgggggttgccactg ccaagggcctgggtctgggtgtgctgcaggccaggattcagggctttacccaggctgtgg acaagaggagtgatgagagctgagcacacttatctgagcaacggcatgaatttcagggca cctggctatgggccctcaggcaccttggaaagagtgggaagagccctggggcaccccatt aggcaccccacaaccaatacacagccacagtggtcaatccaatggcccctggggaaactg agggcccaagagaaagatttaagatggcttattgggaagccaagtcctctctcctgggac cctgtgctccctgaggagcctctggggtcccacggtcctggaatcactgagtgccgtggg agtgcaggcctttgccatggctatgccccctgcctggtgcaaaccagagttgtttgtccc tccctgtgggttcgtggtcttgctggcctcaggagtgaaactgcagaccttcacgtgagt gttacagctcataaaggtggcacagacccaaaggaagcccactcggcttcacctctcact ggcactcgtagcgggactttgcggcacctagcctgggcactctggcagcccagagggagc tcatcccccgatcaagcccagcaggtgcttgccggctgtgtggagtgcggggcctgccca gcctgtgcccacccggaacctgtgccagctgcgagtgcctgtaggcacagccccagctcc cgccagtgtctctccctccacacctctctgcaagcagagggagctggctccagccttggc cagccccagagaggggtccccacagcgcagtggcaggctgaagggctccttgagcatggc cagaatggacggcgaggccaaggaggcaccaagagcgagagagggctgctagcactttgt cacctctcaccaccacaaccaagtcagacccagccagccctctccacccaagtgcccatt cccaacaactccatctgcagggccctgtcccacaccccagccacctcccctccccgcatt gtccgctacttcatgcctgggggtcccaggcagacagtgcagccccatagcacccccaac ttgctccctactgcagcccagacagcccccccaaactgctccccccgccggccaccactc actgccagtatcgtagccccggatagctcacccaacccatctccccaccactggcaatgc tcaatagcggacacaacctgcttccctgttcccccgcactccaccggcagtgtagcccct catacgctgctaacccgccccactgccagcaatgcaaccctggatagggtcccaaaccag cccccccatgccacaggcagtgcacacatgggcggtgcagccctaacagcacccaaaccc gcactccgtggctgcccagataacgcgcctaccccgcagcctaccactctggctgagctg aagtctccgtcactaccaaccgcagcctccgtcgccacaaaccacagtgaagcaagctac agtggtgcaggctccagcctccagcatgctccctctacacgcgttttctaa >gi568815585r:23630351_23989320|GENSCAN_predicted_peptide_13|315_aa MGTIDFYIIQAIFYPLKICSTQEPVSESTDQGKLLDLVQAVHSEHFPCSLEKLHQDEVLT PVPQNGTVFGDRAFIAVIELYPFSYPEPLIINFHPPLQEKNVPRAQDFGQMLLGSTFSKS AGEVTICREIPLACLSIPPTSSEGPWRDPSIPLRAGRGRKVTEKPTGNCPGAGPPSRWAQ DRSESVSGSPPSRTQTGSGRLSECRVSLAMSSSASPSAASSASRAGAVLLFAVLRGCCHE EELGFSVDQCCLQALEFSVHVINLLSTLLRGNGFTGIQKAAVDQTSRRPPNSDHNLFWCK FGFGKCLGASSWSNH >gi568815585r:23630351_23989320|GENSCAN_predicted_CDS_13|948_bp atgggaactattgatttttacattattcaagctatcttttatccactgaaaatctgttct acacaagaacctgtttctgaaagcaccgatcaaggaaagctgctggatcttgtccaagca gtgcattccgaacactttccgtgttccttggaaaagctccaccaggatgaagtcctgact ccagtacctcagaatgggactgtgtttggagatagggccttcatagcggtaattgagctc tatcccttctcctacccggagccgctgatcatcaactttcaccccccacttcaggaaaaa aatgttccacgggcacaggatttcggccaaatgctgcttggctctaccttctccaaaagt gcgggcgaggtgacaatatgtagagaaatcccacttgcctgtctcagcatccctcctacg tcctcagagggtccttggagggatcccagcatcccgctccgagctggaagggggaggaag gtcacggagaagccgactggaaactgtcccggggcgggacctcccagccgctgggcccag gaccgcagcgaatcagtctctggctccccgcccagcagaactcagacaggctctggacgg ctgtccgaatgccgcgtttccctggcgatgagcagttccgcatcaccctctgcggctagc tcggcttccagggccggggctgtgctattgtttgctgttcttcgtgggtgctgtcatgaa gaagaattgggcttttctgttgatcaatgctgcctgcaggcgctggagttttcagtgcat gtcatcaatttgctgagtacactcctcagaggcaatggtttcactggcattcagaaagct gcagtggatcagacgagccgacgaccaccaaatagtgaccataaccttttttggtgcaag tttggctttgggaagtgccttggagcgtcttcttggtccaaccactga