GENSCAN 1.0 Date run: 4-Nov-116 Time: 19:32:31 Sequence gi568815588r:47267810_47468829 : 201020 bp : 48.62% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 7397 7467 71 1 2 79 26 79 0.308 1.32 1.02 Intr + 11504 11698 195 2 0 79 50 112 0.098 5.03 1.03 Intr + 15364 15467 104 1 2 89 83 59 0.308 5.32 1.04 Term + 18341 18471 131 0 2 22 48 136 0.110 1.34 1.05 PlyA + 20886 20891 6 1.05 2.00 Prom + 25610 25649 40 -6.56 2.01 Init + 32843 33161 319 1 1 94 106 453 0.997 43.20 2.02 Intr + 41987 42912 926 0 2 108 97 1152 0.995 109.05 2.03 Intr + 44659 44742 84 0 0 92 55 59 0.616 3.02 2.04 Intr + 44792 44979 188 1 2 135 97 229 0.815 26.99 2.05 Intr + 52212 52314 103 0 1 33 75 110 0.583 4.38 2.06 Intr + 54923 55205 283 1 1 105 97 279 0.129 27.49 2.07 Intr + 57032 57971 940 0 1 82 67 1209 0.659 108.43 2.08 Term + 62860 63049 190 1 1 22 42 140 0.142 -0.28 2.09 PlyA + 65085 65090 6 1.05 3.00 Prom + 79510 79549 40 -4.66 3.01 Init + 80676 83729 3054 2 0 69 94 4672 0.503 455.96 3.02 Intr + 85516 85706 191 0 2 90 78 326 0.722 30.18 3.03 Intr + 87567 87709 143 0 2 108 84 165 0.949 18.10 3.04 Term + 89293 89648 356 2 2 90 50 440 0.818 35.06 3.05 PlyA + 90027 90032 6 1.05 4.02 PlyA - 90183 90178 6 1.05 4.01 Sngl - 100969 99998 972 1 0 93 43 656 0.991 56.35 4.00 Prom - 109360 109321 40 -3.96 5.12 PlyA - 115087 115082 6 1.05 5.11 Term - 116062 116028 35 2 2 125 46 22 0.623 -0.55 5.10 Intr - 117816 117734 83 2 2 125 74 44 0.469 5.98 5.09 Intr - 121307 121149 159 1 0 122 50 63 0.182 5.00 5.08 Intr - 138290 138153 138 1 0 102 99 28 0.430 4.88 5.07 Intr - 139008 138864 145 1 1 46 49 80 0.257 -0.76 5.06 Intr - 142478 142387 92 1 2 14 100 81 0.243 1.54 5.05 Intr - 146487 146278 210 2 0 37 83 97 0.266 2.13 5.04 Intr - 146833 146625 209 1 2 120 97 32 0.252 5.18 5.03 Intr - 147273 146901 373 2 1 57 51 154 0.627 3.66 5.02 Intr - 147825 147697 129 2 0 57 94 40 0.593 1.31 5.01 Init - 150193 148566 1628 1 2 59 55 498 0.445 35.33 5.00 Prom - 160266 160227 40 -2.46 6.02 PlyA - 161770 161765 6 1.05 6.01 Sngl - 165150 164128 1023 0 0 88 43 404 0.999 32.97 6.00 Prom - 173886 173847 40 -2.36 7.05 PlyA - 174011 174006 6 1.05 7.04 Term - 185465 185351 115 1 1 86 52 98 0.729 4.04 7.03 Intr - 187630 187500 131 1 2 -38 68 152 0.532 0.09 7.02 Intr - 187781 187690 92 0 2 50 89 43 0.778 0.31 7.01 Init - 188369 188252 118 2 1 59 84 85 0.816 5.73 7.00 Prom - 192382 192343 40 -6.16 8.00 Prom + 192654 192693 40 -2.66 8.01 Init + 193983 194027 45 2 0 93 47 48 0.607 1.98 8.02 Intr + 195257 195368 112 1 1 50 71 82 0.543 2.65 8.03 Term + 196392 196483 92 1 2 31 49 119 0.501 0.18 8.04 PlyA + 199265 199270 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 54860 55205 346 1 1 45 97 419 0.862 33.88 S.002 Init - 122871 122849 23 0 2 49 76 58 0.927 -0.13 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:47267810_47468829|GENSCAN_predicted_peptide_1|166_aa MKIENTGNQKKILKLPEGKTGEESPQKLPPPNQLWTPLLLFSASPELPALVLSSWKFLRK HLHMDVEEKKISRSHGVLWESKIRTPKGLTPRNVHHYLHFSAEENAKVLYLQQEAGPREA DPRVGEQVQKEKQKIPGSKLGIPEPLGNASLRSEWRCRIGKWMETF >gi568815588r:47267810_47468829|GENSCAN_predicted_CDS_1|501_bp atgaagattgagaacacagggaatcaaaagaagatcctaaaacttccagagggcaaaact ggagaagaaagtccccagaagctaccaccccccaaccagctttggacacccctgctctta ttttctgcatccccagaactacctgccctagtgctgagctcatggaagttcctcaggaag catcttcacatggatgttgaagagaaaaaaatctccaggtcccacggggtgctgtgggag tccaaaatcaggactccaaaaggcctcaccccgaggaacgttcatcattatctgcatttt tcagcagaggagaatgccaaagtcttgtacctgcagcaagaagccggccccagagaggct gaccccagagtaggggagcaggtgcagaaggagaagcagaagatccctggaagcaagctt ggtatcccagagccactggggaatgccagcctgaggtctgagtggagatgccggattggg aagtggatggagacgttctga >gi568815588r:47267810_47468829|GENSCAN_predicted_peptide_2|1010_aa MAHVPARTSPGPGPQLLLLLLPLFLLLLRDVAGSHRAPAWSALPAAADGLQGDRDLQRHP GDAAATLGPSAQDMVAVHMHRLYEKYSRQGARPGGGNTVRSFRARLEVVDQKAVYFFNLT SMQDSEMILTATFHFYSEPPRWPRALEVLCKPRAKNASGRPLPLGPPTRQHLLFRSLSQN TATQGLLRGAMALAPPPRGLWQAKDISPIVKAARRDGELLLSAQLDSEERDPGVPRPSPY APYILVYANDLAISEPNSVAVTLQRYDPFPAGDPEPRAAPNNSADPRVRRAAQATGPLQD NELPGLDERPPRAHAQHFHKHQLWPSPFRALKPRPGRKDRRKKGQEVFMAASQVLDFDEK TMQKARRKQWDEPRVCSRRYLKVDFADIGWNEWIISPKSFDAYYCAGACEFPMPKVDAYS VASAGEQQQSSMAWDCEDGMGAWIVRPSNHATIQSIVRAVGIIPGIPEPCCVPDKMNSLG VLFLDENRNVVLKVYPNMSVDTCACRPGLEDIPAAKNKQPFRHVGPVPNSKASVVEKQQQ GKPLQSWGRGSAGGNAHSPLGVPGGGLPEHTFNLKMFLENVKVDFLRSLNLSGVPSQDKT RVEPPQYMIDLYNRYTSDKSTTPASNIVRSFSMEDAISITATEDFPFQKHILLFNISIPR HEQITRAELRLYVSCQNHVDPSHDLKGSVVIYDVLDGTDAWDSATETKTFLVSQDIQDEG WETLEVSSAVKRWVRSDSTKSKNKLEVTVESHRKGCDTLDISVPPGSRNLPFFVVFSNDH SSGTKETRLELREMISHEQESVLKKLSKDGSTEAGESSHEEDTDGHVAAGSTLARRKRSA GAGSHCQKTSLRVNFEDIGWDSWIIAPKEYEAYECKGGCFFPLADDVTPTKHAIVQTLVH LKFPTKVGKACCVPTKLSPISVLYKDDMGVPTLKYHYEGMSVAECGCRRQKHLEPSHFAR ATSQQLLEPEFELQFVTPSPDSLYCMDYCPGAFLSVKVKVTLKGKIKTRA >gi568815588r:47267810_47468829|GENSCAN_predicted_CDS_2|3033_bp atggctcatgtccccgctcggaccagcccgggacccgggccccagctgctgctgctgctg ctgccgttgtttctgctgttgctccgggatgtggccggcagccacagggcccccgcctgg tccgcactgcccgcggccgccgacggcctgcagggggacagggatctccagcggcaccct ggggacgcggccgccacgttgggccccagcgcccaggacatggtcgctgtccacatgcac aggctctatgagaagtacagccggcagggcgcgcggccgggagggggcaacacggtccgc agcttcagggccaggctggaagtggtcgaccagaaggccgtgtatttcttcaacctgact tccatgcaagactcggaaatgatccttacggccactttccacttctactcagagccgcct cggtggcctcgagcgctcgaggtgctatgcaagccgcgggccaagaacgcttcaggccgc ccgctgcccctgggcccgcccacacgccagcacctgctcttccgcagcctctcgcagaac acggccacacaggggctactccgcggggccatggccctggcgcccccaccgcgcggcctg tggcaggccaaggacatctcccccatcgtcaaggcggcccgccgggatggcgagctgctc ctctccgcccagctggattctgaggagagggacccgggggtgccccggcccagcccctat gcgccctacatcctagtctatgccaacgatctggccatctcggagcccaacagcgtggca gtgacgctgcagagatacgaccccttccctgccggagaccccgagccccgcgcagccccc aacaactcagcggacccccgcgtgcgccgagccgcgcaggccactgggcccctccaggac aacgagctgccggggctggatgagaggccgccgcgcgcccacgcacagcacttccacaag caccagctgtggcccagccccttccgggcgctgaaaccccggccagggcgcaaagaccgc aggaagaagggccaggaggtgttcatggccgcctcgcaggtgctggactttgacgagaag acgatgcagaaagcccggaggaagcagtgggatgagccgagggtgtgctcccggaggtac ctgaaggtggacttcgcagacatcggctggaatgaatggataatctcaccgaaatctttt gatgcctactactgcgcgggagcatgtgagttccccatgcctaaggtggatgcctattct gtggcctcagccggggaacaacagcagagcagtatggcctgggactgtgaggatggcatg ggtgcgtggatcgttcgtccatccaaccatgccaccatccagagcattgtcagggctgtg ggcatcatccctggcatcccagagccctgctgtgttcccgataagatgaactcccttggg gtcctcttcctggatgagaatcggaatgtggttctgaaggtgtaccccaacatgtccgtg gacacctgtgcctgccgacctgggctagaggacattccagcagctaagaacaagcagccg tttagacatgtaggaccagtgcccaacagcaaagccagtgttgttgagaagcaacagcag gggaagccactgcagagctggggacgagggtctgctgggggaaacgcccacagcccactg ggggtgcctggaggtgggctgcctgagcacaccttcaacctgaagatgtttctggagaac gtgaaggtggatttcctgcgcagccttaacctgagtggggtcccttcgcaggacaaaacc agggtggagccgccgcagtacatgattgacctgtacaacaggtacacgtccgataagtcg actacgccagcgtccaacattgtgcggagcttcagcatggaagatgccatctccataact gccacagaggacttccccttccagaagcacatcttgctcttcaacatctccattcctagg catgagcagatcaccagagctgagctccgactctatgtctcctgtcaaaatcacgtggac ccctctcatgacctgaaaggaagcgtggtcatttatgatgttctggatggaacagatgcc tgggatagtgctacagagaccaagaccttcctggtgtcccaggacattcaggatgagggc tgggagaccttggaagtgtccagcgccgtgaagcgctgggtccggtccgactccaccaag agcaaaaataagctggaagtgactgtggagagccacaggaagggctgcgacacgctggac atcagtgtccccccaggttccagaaacctgcccttctttgttgtcttctccaatgaccac agcagtgggaccaaggagaccaggctggagctgagggagatgatcagccatgaacaagag agcgtgctcaagaagctgtccaaggacggctccacagaggcaggtgagagcagtcacgag gaggacacggatggccacgtggctgcggggtcgactttagccaggcggaaaaggagcgcc ggggctggcagccactgtcaaaagacctccctgcgggtaaacttcgaggacatcggctgg gacagctggatcattgcacccaaggagtatgaagcctacgagtgtaagggcggctgcttc ttccccttggctgacgatgtgacgccgacgaaacacgctatcgtgcagaccctggtgcat ctcaagttccccacaaaggtgggcaaggcctgctgtgtgcccaccaaactgagccccatc tccgtcctctacaaggatgacatgggggtgcccaccctcaagtaccattacgagggcatg agcgtggcagagtgtgggtgcaggagacagaagcacttggagccctcacactttgctaga gccacaagccagcaactgctggagccagaatttgagctccagtttgtgacaccaagtcca gattctctctactgtatggactactgtcctggtgctttcctcagtgtcaaggttaaagtc acactgaaaggaaaaatcaaaacgagggcataa >gi568815588r:47267810_47468829|GENSCAN_predicted_peptide_3|1247_aa MMREWVLLMSVLLCGLAGPTHLFQPSLVLDMAKVLLDNYCFPENLLGMQEAIQQAIKSHE ILSISDPQTLASVLTAGVQSSLNDPRLVISYEPSTPEPPPQVPALTSLSEEELLAWLQRG LRHEVLEGNVGYLRVDSVPGQEVLSMMGEFLVAHVWGNLMGTSALVLDLRHCTGGQVSGI PYIISYLHPGNTILHVDTIYNRPSNTTTEIWTLPQVLGERYGADKDVVVLTSSQTRGVAE DIAHILKQMRRAIVVGERTGGGALDLRKLRIGESDFFFTVPVSRSLGPLGGGSQTWEGSG VLPCVGTPAEQALEKALAILTLRSALPGVVHCLQEVLKDYYTLVDRVPTLLQHLASMDFS TVVSEEDLVTKLNAGLQAASEDPRLLVRAIGPTETPSWPAPDAAAEDSPGVAPELPEDEA IRQALVDSVFQVSVLPGNVGYLRFDSFADASVLGVLAPYVLRQVWEPLQDTEHLIMDLRH NPGGPSSAVPLLLSYFQGPEAGPVHLFTTYDRRTNITQEHFSHMELPGPRYSTQRGVYLL TSHRTATAAEEFAFLMQSLGWATLVGEITAGNLLHTRTVPLLDTPEGSLALTVPVLTFID NHGEAWLGGGVVPDAIVLAEEALDKAQEVLEFHQSLGALVEGTGHLLEAHYARPEVVGQT SALLRAKLAQGAYRTAVDLESLASQLTADLQEVSGDHRLLVFHSPGELVVEEAPPPPPAV PSPEELTYLIEALFKTEVLPGQLGYLRFDAMAELETVKAVGPQLVRLVWQQLVDTAALVI DLRYNPGSYSTAIPLLCSYFFEAEPRQHLYSVFDRATSKVTEVWTLPQVAGQRYGSHKDL YILMSHTSGSAAEAFAHTMQDLQRATVIGEPTAGGALSVGIYQVGSSPLYASMPTQMAMS ATTGKAWDLAGVEPDITVPMSEALSIAQDIVALRAKVPTVLQTAGKLVADNYASAELGAK MATKLSGLQSRYSRVTSEVALAEILGADLQMLSGDPHLKAAHIPENAKDRIPGIVPMQIP SPEVFEELIKFSFHTNVLEDNIGYLRFDMFGDGELLTQVSRLLVEHIWKKIMHTDAMIID MRFNIGGPTSSIPILCSYFFDEGPPVLLDKIYSRPDDSVSELWTHAQVVGERYGSKKSMV ILTSSVTAGTAEEFTYIMKRLGRALVIGEVTSGGCQPPQTYHVDDTNLYLTIPTARSVGA SDGSSWEGVGVTPHVVVPAEEALARAKEMLQHNQLRVKRSPGLQDHL >gi568815588r:47267810_47468829|GENSCAN_predicted_CDS_3|3744_bp atgatgagagaatgggttctgctcatgtccgtgctgctctgtggcctggctggccccaca cacctgttccagccaagcctggtgctggacatggccaaggtcctcttggataactactgc ttcccggagaacctgctgggcatgcaggaagccatccagcaggccatcaagagccatgag attctgagcatctcagacccgcagacgctggccagtgtgctgacagccggggtgcagagc tccctgaacgatcctcgcctggtcatctcctatgagcccagcacccccgagcctccccca caagtcccagcactcaccagcctctcagaagaggaactgcttgcctggctgcaaaggggc ctccgccatgaggttctggagggtaatgtgggctacctgcgggtggacagcgtcccgggc caggaggtgctgagcatgatgggggagttcctggtggcccacgtgtgggggaatctcatg ggcacctccgccttagtgctggatctccggcactgcacaggaggccaggtctctggcatt ccctacatcatctcctacctgcacccagggaacaccatcctgcacgtggacactatctac aaccgcccctccaacaccaccacggagatctggaccttgccccaggtcctgggagaaagg tacggtgccgacaaggatgtggtggtcctcaccagcagccagaccaggggcgtggccgag gacatcgcgcacatccttaagcagatgcgcagggccatcgtggtgggcgagcggactggg ggaggggccctggacctccggaagctgaggataggcgagtctgacttcttcttcacggtg cccgtgtccaggtccctggggccccttggtggaggcagccagacgtgggagggcagcggg gtgctgccctgtgtggggactccggccgagcaggccctggagaaagccctggccatcctc actctgcgcagcgcccttccaggggtagtccactgcctccaggaggtcctgaaggactac tacacgctggtggaccgtgtgcccaccctgctgcagcacttggccagcatggacttctcc acggtggtctccgaggaagatctggtcaccaagctcaatgccggcctgcaggctgcgtct gaggatcccaggctcctggtgcgagccatcgggcccacagaaactccttcttggcccgcg cccgacgctgcagccgaagactcaccaggggtggccccagagttgcctgaggacgaggct atccggcaagcactggtggactctgtgttccaggtgtcggtgctgccaggcaatgtgggc tacctgcgcttcgatagttttgctgacgcctccgtcctgggtgtgttggccccatatgtc ctgcgccaggtgtgggagccgctacaggacacggagcacctcatcatggacctgcgccac aaccctggagggccatcctctgctgtgcccctgctcctgtcctacttccagggccctgag gccggccccgtgcacctcttcaccacctatgatcgccgcaccaacatcacgcaggagcac ttcagccacatggagctcccgggcccacgctacagcacccaacgtggggtgtatctgctc accagccaccgcaccgccacggccgcggaggagttcgccttccttatgcagtcgctgggc tgggccacactggtaggtgagatcaccgcgggcaacctgctgcacacccgcacggtgccg ctgctggacacacccgaaggcagcctcgcgctcaccgtgccggtcctcaccttcatcgac aatcacggcgaggcctggctgggtggtggagtggtgcccgatgccatcgtgctggccgag gaggccctggacaaagcccaggaagtgctggagttccaccaaagcctgggggccttggtg gagggcacagggcacctgctggaggcccactatgctcggccagaggtcgtggggcagacc agtgccctcctgcgggccaagctggcccagggcgcctaccgcacagctgtggacttggag tctctggcctctcagctcacagcagacctccaggaggtgtctggggaccaccgcttgcta gtgttccacagccctggcgagctggtggtagaggaagcacccccaccaccccctgctgtc ccctctccagaggagctcacctaccttattgaggccctgttcaagacagaggtgctgccc ggccagctgggctacctgcgttttgacgccatggctgaactggagacagtgaaggccgtg gggccacagctggtgcggctggtatggcaacagctggtggacacggctgcgctggtgatc gacctgcgctacaaccctggcagctactccacggccatcccgctgctctgctcctacttc tttgaggcagagccccgccagcacctgtattctgtctttgacagggccacctcaaaagtc acggaggtgtggaccttgccccaggtcgccggccagcgctacggctcacacaaggacctc tacatcctgatgagccacaccagtggctctgcggccgaggcctttgcacacaccatgcag gacctgcagcgggccacggtcattggggagcccacggccggaggcgcactctctgtgggc atctaccaggtgggcagcagccccttatatgcatccatgcccacccagatggccatgagt gccaccacaggcaaggcctgggacctggctggtgtggagcccgacatcactgtgcccatg agcgaagccctttccatagcccaggacatagtggctctgcgtgccaaggtgcccacggtg ctgcagacggccgggaagctggtggctgataactatgcctctgccgagctgggggccaag atggccaccaaactgagcggtctgcagagccgctactccagggtgacctcagaagtggcc ctagccgagatcctgggggctgacctgcagatgctctccggagacccacacctgaaggca gcccatatccctgagaatgccaaggaccgcattcctggaattgtgcccatgcagatccct tcccctgaagtatttgaagagctgatcaagttttccttccacactaacgtgcttgaggac aacattggctacttgaggtttgacatgtttggggacggtgagctgctcacccaggtctcc aggctgctggtggagcacatctggaagaagatcatgcacacggatgccatgatcatcgac atgaggttcaacatcggtggccccacatcctccattcccatcttgtgctcctacttcttt gatgaaggccctccagttctgctggacaagatctacagccggcctgatgactctgtcagt gaactctggacacacgcccaggttgtaggtgaacgctatggctccaagaagagcatggtc attctgaccagcagtgtgacggccggcaccgcggaggagttcacctatatcatgaagagg ctgggccgggccctggtcattggggaggtgaccagtgggggctgccagccaccacagacc taccacgtggatgacaccaacctctacctcactatccccacggcccgttctgtgggggcc tcggatggcagctcctgggaaggggtgggggtgacaccccatgtggttgtccctgcagaa gaggctctcgccagggccaaggagatgctccagcacaaccagctgagggtgaagcggagc ccaggcctgcaggaccacctgtag >gi568815588r:47267810_47468829|GENSCAN_predicted_peptide_4|323_aa MAAGKGAPLSPSAENRWRLSEPELGRGCKPVLLEKTNRLGPEAAVGRAGRDVGSAELALL VAPGKPRPGKPLPPKTRGEQRQSAFTELPRMKDRQVDAQAQEREHDDPTGQPGAPQLTQN IPRGPAGSKVFSVWPSGARSEQRSAFSKPTKRPAERPELTSVFPAGESADALGELSGLLN TTDLACWGRLSTPKLLVGDLWNLQALPQNAPLCSTFLGAPTLWLEHTQAQVPPPSSSSTT SWALLPPTLTSLGLSTQNWCAKCNLSFRLTSDLVFHMRSHHKKEHAGPDPHSQKRREEAL ACPVCQEHFRERHHLSRHMTSHS >gi568815588r:47267810_47468829|GENSCAN_predicted_CDS_4|972_bp atggcagctgggaagggagccccgttgagcccatcggctgaaaacagatggcgacttagc gaacctgagctgggccggggctgcaagccagtgctgctcgagaagacgaaccgcctgggc cctgaggctgctgtgggcagggcgggccgggatgtgggcagtgcggagctggcactgttg gtagccccaggcaagccccgacctggcaagccgctgcccccgaagacacgtggagagcag aggcagagcgccttcacggagctgccgaggatgaaggaccggcaggtggatgctcaggcc caggagagggagcacgatgaccccacaggccaacctggtgccccacagctgacccagaac atccccagaggcccagctggcagcaaagtcttctctgtgtggcccagcggagcacgaagt gagcaaagaagcgcctttagcaaaccaaccaagcgaccagcagagaggcctgagctaacc tcagtcttccctgcaggggaatctgcagatgccctgggggagctgtctggactcctcaac actacagacctcgcttgttggggtcgactttcaactcccaagcttttggttggtgatttg tggaacttgcaagcattgccacagaatgctccactctgtagcacttttctgggtgcccct acactgtggctggagcatacccaggcccaggtgcccccaccctcatcatcctccaccact tcgtgggccctcctgccacccaccctcacctccctgggcttgtccacccagaactggtgt gcaaagtgcaacctgtcctttcgcctaacgtccgacctggtctttcacatgcgatcccac cacaaaaaggagcatgcggggcctgacccacattctcagaagcggagagaagaggccctt gcctgccctgtgtgccaggagcacttccgggagcgccaccacctctcccggcacatgact tctcacagctag >gi568815588r:47267810_47468829|GENSCAN_predicted_peptide_5|1066_aa MDGGSVVGEWPVMDGCSVVNGWPVMDGWPVVDGCSVVNGGSVVNGGSVVNGWPVMDGGSV ANGWPVLNGWPVMDGWPVMDGCSVVNGWPVMDGWPVMDGCSVVNDWPVMAGQPVVDGCSV VNGWPVMDGCSVVNEWPVMDGWPVVDGCSVVNGGSVVNGWPVMNDWPMMDGGSVVNGWPM MDGGSVANGWPVLNGWPVMDGWPMMDGCSVVNGWPVMAGWPVMDGCSVVNDWPMMAGQPV VDGCSVVNGWPVMNGWPVVDGYSVVNGWPVMDGGTVMDGCSVVNGWPVMNGWPVVDGCSV VNGWPVMNGWPVVDGYSVVNGWPVMDGGTVMDGCSVVNGWPVMNGWPVVDGCSVVNGWPV MNGWPVVDGCSVVNGWPVMNGWPVMDGCSVVNDWPVMAGRPVVDGCSVVNDWPVMNGWPV VDGCSVVNGWPMMNGWPVVDGCSVVNGWPVMDGGTVMDGCSVVNGWPVMNGWPVVDGCSV VNGWPVMDGWPVTDGCSVVNGWPVVNGWPVMDGCSVVNGWPVMDGCSVVDGCSVVNGGSM VKRLCPRALPRKSQKLPGLMNLASLNTSLAAQGFCEDHKIYRYKDSLPIGSRRRHTAGLS EGAERDSSGAQAGRRVRKEADSIWEMTADDHQAGSGDRVVSGFRIISAAGAPVQRHTHPE GSAWNPPDPLDLRGPRGEGGDPRGPGFWLLAHSLQSSSTSASWNRIKQAQGVSHSSAGPL DSPGWWHPHPGTHTGGALHSDVIRKDISAIVQAGFNKDLLKAQKAKMVRASPERCSSCPC DHKWLLGVNHSGSSVFTESIGLTLKGDLEQRYHSSGNHLLHPALLTDPRTLQIPRTLQIP EHLGGHQQGRKSEGLVIREGFQDKVPFEQGCKGQVGSAFQAIIVLVTLLSAEEARSDLLS KGIRSDLQKLRDPCSWLGPQHCSFQGMASPAGGQAPKGYEHQNCTRRIPHSSQNSMERGI TMLIAPGRAPGARSRAKHRLHPYFMAGGKDKREMWAGTGDPTAQASSARSLGQPEIRGQA EGSEQGSHHPVKKDLFCFHFHHDCKFPEASASMLNSLKNSAYEFFS >gi568815588r:47267810_47468829|GENSCAN_predicted_CDS_5|3201_bp atggatgggggctctgtggtgggcgagtggcctgtgatggatgggtgctctgtggtgaat gggtggcctgtgatggatggctggcctgtggtggatgggtgctctgtggtgaatgggggc tctgtggtgaatgggggctctgtggtgaatgggtggcctgtgatggatgggggctctgtg gcaaatgggtggcctgtgttgaatggctggcctgtgatggatggctggcctgtgatggat ggatgttctgtggtgaatgggtggcctgtgatggatggctggcctgtgatggatgggtgc tctgtggtgaacgattggcctgtgatggctggccagcctgtggtggatgggtgctctgtg gtgaatgggtggcctgtgatggatgggtgttctgtggtgaacgagtggcctgtgatggat ggctggcctgtggtggatgggtgctctgtggtgaatgggggctctgtggtgaatgggtgg cctgtgatgaatgattggcctatgatggatgggggctctgtggtgaatgggtggcctatg atggatgggggctctgtggcaaatgggtggcctgtgttgaatggctggcctgtgatggat ggatggcctatgatggatggatgttctgtggtgaatggctggcctgtgatggctggctgg cctgtgatggatgggtgctctgtggtgaacgattggcctatgatggctggccagcctgtg gtggatgggtgctctgtggtgaatggatggcctgtgatgaatggctggcctgtggtggat gggtactcagtggtgaatggctggcctgtgatggatgggggcactgtaatggatgggtgc tctgtggtgaatggctggcctgtgatgaatggctggcctgtggtggatgggtgctctgtg gtgaatgggtggcctgtgatgaatggttggcctgtggtggatgggtactcagtggtgaat ggctggcctgtgatggatgggggcactgtaatggatgggtgctctgtggtgaatggctgg cctgtgatgaatggctggcctgtggtggatgggtgctctgtggtgaatgggtggcctgtg atgaatggctggcctgtggtggacgggtgctctgtggtgaatgggtggcctgtgatgaat ggctggcctgtgatggatgggtgctctgtggtgaacgactggcctgtgatggctggccgg cctgtggtggatgggtgctctgtggtgaatgactggcctgtgatgaatggctggcctgtg gtggatgggtgctctgtggtgaatgggtggcctatgatgaacggctggcctgtggtggat gggtgctctgtggtgaatgggtggcctgtgatggatgggggcactgtaatggatgggtgc tctgtggtgaatggctggcctgtgatgaacggctggcctgtggtggatgggtgctctgtg gtgaatgggtggcctgtgatggatggttggccagtgacagatgggtgctctgtggtgaat gggtggcctgtggtgaatggctggcctgtgatggatgggtgctctgtggtgaatgggtgg cctgtgatggacgggtgttctgtggtggatggttgctctgtggtgaatgggggctctatg gtgaaaaggctctgccctagagccctgccaagaaagtcccagaagcttcctggtctcatg aacctagcctcactgaacacctcgcttgcagcacagggcttctgtgaggaccacaaaatc tacaggtacaaagacagcctccccatcggcagcaggagaagacatactgcagggctcagt gagggagcagagagggacagctccggggcccaagcaggtaggagggtcagaaaggaggct gacagcatctgggaaatgacagcagatgaccaccaggcaggctcgggcgacagggttgtg agtgggtttaggatcatatcagcagctggggcaccggttcagcggcacacgcatcctgaa ggtagtgcgtggaatcctccagatcccctggacctgagaggcccacgtggagaaggtgga gaccctagaggtcctggattctggctcctggcacacagcctccaatcctcatccaccagt gccagctggaacaggatcaagcaggcccagggtgtatctcattccagtgcagggcctctg gacagcccaggttggtggcatccccaccctggtactcacacaggtggtgctcttcatagt gatgtcattagaaaggacatttctgccattgttcaggctggcttcaacaaagacctcctt aaagcacagaaagcgaagatggtgagagccagcccggagagatgcagttcttgcccttgt gaccacaaatggctgcttggtgtcaaccattctggctcttctgtatttacagagtccatt ggcttaactctgaaaggcgacctggagcaacggtaccattcctcgggcaatcatctcctg catcctgccctcctcacagaccctaggacactgcagatccctaggacactgcagatccct gaacaccttgggggccaccaacaagggaggaagtcagaggggctggtcatcagggaaggc ttccaggacaaggtgcccttcgagcagggctgcaaagggcaggtgggctcggcctttcaa gcaataatcgtgctggtaaccttgttgtctgcagaagaagcaaggagtgacctgctcagc aaaggcatcaggagtgacctccagaagctccgggacccctgcagttggctgggcccgcag cactgctcattccaaggcatggccagcccagctggagggcaagcaccaaagggttatgag caccagaactgcacacgcaggatcccacacagttctcagaactccatggaacgggggatt actatgctcattgcacctgggagggcacctggagctcggagcagggctaaacatcgtctc cacccctacttcatggctggtggtaaagataagcgggaaatgtgggctggtacaggggac cccacagcacaggcctccagtgcacggtcactaggtcagcctgaaatcagaggccaggct gaaggaagtgagcaaggaagccaccaccctgtgaagaaggacctgttttgcttccacttc caccatgattgtaagtttcccgaggcctctgcatccatgctgaactcactcaagaattct gcctatgaatttttctcctag >gi568815588r:47267810_47468829|GENSCAN_predicted_peptide_6|340_aa MGRNQHKKAENSKSQNASFPPKDDNSSPAREQNSTENEFDELTEVGFRRWVINPSKLKKH VLTQCKEAKNLEKRLDELLTRITSVEKNINDLMELKNTARELHEAYTIINSQINGVEERI SVIEDQLNEIKREDKIREKRIKRNEQSLQEIWDCVKRPNLHLIVEPESDKENGTKLENTL QDIIQENFPNLARKANIQIQEIQRTPQRYSLRRATPRHIIVRFTEVEMKEKMLRAARQTD WIIHKGKPIRLTADLSAETLQARREWGPTFNIFKEKNFQPRISYPAKLSFISEREIKSFT NKQMLRDFVTTRPALQELLKEALNMERNNWYQRLQKHTKL >gi568815588r:47267810_47468829|GENSCAN_predicted_CDS_6|1023_bp atggggagaaaccagcacaaaaaggctgaaaattccaaaagccagaacgcctcttttcct ccaaaggatgacaactcctcgccagcaagggaacaaaactcgacagagaatgagtttgat gaattgacagaagtaggcttcagaaggtgggtaataaacccctccaagctaaagaagcat gttctaacccaatgcaaggaagctaagaacctcgaaaaaaggttagatgaattgctaact agaataaccagtgtagagaagaacataaatgacctgatggagctgaaaaacacagcacga gaacttcatgaagcatacacaattattaatagccaaatcaatggagtggaagaaaggata tcagtgattgaagatcaacttaatgaaataaagagagaagacaagattagagaaaaaaga ataaaaaggaatgaacaaagcctccaagaaatatgggactgtgtgaaaagaccaaattta catttgattgttgaacctgaaagtgacaaagagaatggaaccaagctggaaaacactctt caggatattatccaggagaacttccccaatctagcaagaaaagccaacattcaaattcag gaaatacagagaacaccacaaagatactccttgagaagagcaaccccaagacacataatt gtcagattcaccgaggttgaaatgaaagaaaaaatgttaagggcagccagacagacagat tggattatccacaaagggaagcccatcagactaacagcggatctctctgcagaaacccta caagccagaagagagtgggggccaacattcaacatttttaaagaaaagaattttcaaccc agaatttcatatccagccaaactaagcttcataagtgaacgagaaataaaatcctttaca aacaagcaaatgctgagagattttgtcaccaccaggcctgccttacaagagctcctgaag gaagcactaaacatggaaaggaacaactggtaccagcgactgcaaaaacataccaaattg taa >gi568815588r:47267810_47468829|GENSCAN_predicted_peptide_7|151_aa MWVPSLFWGSDPLLMIPVEVLGSRDIQARMPSSSAHESKESWNKYHGAAPSTEEGPLPWA SVGNQGPKSARTCQAPTKPNGKLSPGDTVLIQGNQDSHGTSFSTEEERRGKTPRATCFLS SPRTRLSFITYCMEDSIVLQLTLTLGRGVLL >gi568815588r:47267810_47468829|GENSCAN_predicted_CDS_7|456_bp atgtgggtgccctcattgttctggggcagtgaccccctgctcatgatccctgtggaggtt ctgggctccagggacatccaggccagaatgcccagcagttcagctcatgagtccaaagag tcatggaacaaataccacggagcagcacctagcacagaggaaggccccctgccctgggca tctgtgggaaaccagggccccaagtctgcgagaacctgccaggctccaacaaagccaaac gggaaactctccccgggagacaccgtcctcatccagggaaaccaggattctcatggaacg tcattttctactgaagaggagcgcagaggaaaaactcccagggccacgtgcttcctctcc agcccgagaacacggctgtccttcatcacctactgcatggaggacagcatcgtcctgcag ctcacactcacactaggcaggggagtgctgctttga >gi568815588r:47267810_47468829|GENSCAN_predicted_peptide_8|82_aa MESGSNCVGMGRLPQGGDPKHAKSNTLALPPESMRTTRMGVCPDLALPSWTAGDLLKKGQ GDQTSPGKVEDEDPDRILGVVR >gi568815588r:47267810_47468829|GENSCAN_predicted_CDS_8|249_bp atggagagcggcagcaactgcgttgggatggggcggctgccccagggaggtgaccctaaa catgccaagtccaacacactagctctaccccctgagagcatgagaacaacacggatgggc gtgtgcccggatctggccttgccctcatggactgcaggagacctgctgaagaaaggccag ggtgaccagacatcacctgggaaggtggaggatgaggaccctgatcggatattgggagtg gtcagatag