GENSCAN 1.0 Date run: 13-Jun-118 Time: 19:13:50 Sequence gi568815588f:47200652_47412789 : 212138 bp : 47.70% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 10265 10341 77 0 2 86 98 19 0.307 1.93 1.02 Term + 12026 12289 264 1 0 54 37 172 0.346 4.21 1.03 PlyA + 16229 16234 6 1.05 2.03 PlyA - 19458 19453 6 1.05 2.02 Term - 24278 24133 146 0 2 99 44 95 0.904 4.27 2.01 Init - 33534 33465 70 0 1 78 93 51 0.286 4.50 2.00 Prom - 33920 33881 40 -1.66 3.00 Prom + 35516 35555 40 -5.46 3.01 Init + 36229 36272 44 0 2 62 48 59 0.237 -0.81 3.02 Intr + 38035 38128 94 1 1 84 71 45 0.331 2.37 3.03 Intr + 39244 39302 59 0 2 77 75 56 0.196 0.88 3.04 Intr + 40609 40726 118 1 1 49 74 54 0.276 0.57 3.05 Intr + 47867 48049 183 1 0 -39 69 179 0.154 3.28 3.06 Intr + 52834 53010 177 0 0 28 48 110 0.144 1.12 3.07 Intr + 55095 55316 222 2 0 42 76 110 0.216 3.52 3.08 Intr + 64650 64831 182 2 2 15 58 67 0.001 -4.93 3.09 Intr + 65983 66057 75 1 0 88 80 69 0.007 4.73 3.10 Intr + 78662 78856 195 2 0 79 50 112 0.099 5.03 3.11 Intr + 82522 82625 104 1 2 89 83 59 0.308 5.32 3.12 Term + 85499 85629 131 0 2 22 48 136 0.110 1.34 3.13 PlyA + 88044 88049 6 1.05 4.00 Prom + 92768 92807 40 -6.56 4.01 Init + 100001 100319 319 1 1 94 106 453 0.997 43.20 4.02 Intr + 109145 110070 926 0 2 108 97 1152 0.995 109.05 4.03 Intr + 111817 111900 84 0 0 92 55 59 0.616 3.02 4.04 Intr + 111950 112137 188 1 2 135 97 229 0.815 26.99 4.05 Intr + 119370 119472 103 0 1 33 75 110 0.583 4.38 4.06 Intr + 122081 122363 283 1 1 105 97 279 0.129 27.49 4.07 Intr + 124190 125129 940 0 1 82 67 1209 0.659 108.43 4.08 Term + 130018 130207 190 1 1 22 42 140 0.142 -0.28 4.09 PlyA + 132243 132248 6 1.05 5.00 Prom + 146668 146707 40 -4.66 5.01 Init + 147834 150887 3054 2 0 69 94 4672 0.503 455.96 5.02 Intr + 152674 152864 191 0 2 90 78 326 0.722 30.18 5.03 Intr + 154725 154867 143 0 2 108 84 165 0.949 18.10 5.04 Term + 156451 156806 356 2 2 90 50 440 0.818 35.06 5.05 PlyA + 157185 157190 6 1.05 6.02 PlyA - 157341 157336 6 1.05 6.01 Sngl - 168127 167156 972 1 0 93 43 656 0.991 56.35 6.00 Prom - 176518 176479 40 -3.96 7.08 PlyA - 182245 182240 6 1.05 7.07 Term - 183220 183186 35 2 2 125 46 22 0.623 -0.55 7.06 Intr - 184974 184892 83 2 2 125 74 44 0.469 5.98 7.05 Intr - 188465 188307 159 1 0 122 50 63 0.182 5.00 7.04 Intr - 205448 205311 138 1 0 102 99 28 0.687 4.88 7.03 Intr - 206166 206022 145 1 1 46 49 80 0.406 -0.76 7.02 Intr - 209636 209545 92 1 2 14 100 81 0.439 1.54 7.01 Init - 211935 211880 56 0 2 103 65 56 0.768 5.56 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 122018 122363 346 1 1 45 97 419 0.862 33.88 S.002 Init - 190029 190007 23 0 2 49 76 58 0.918 -0.13 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588f:47200652_47412789|GENSCAN_predicted_peptide_1|113_aa XVACFYKLKICDNPGSSKSISAILPTVWEKYIENLLKGFIIVDATKRIHHAWEEVKISTL MGIWKKLIPNLMDNFERFKISVGKVTADVVEIARDLQLEAEPEYVAELLIFGD >gi568815588f:47200652_47412789|GENSCAN_predicted_CDS_1|342_bp ngtgttgcatgtttttacaaattgaagatctgtgacaaccctgggtcaagcaagtctatc agtgccattcttccaacagtctgggaaaaatacattgaaaaccttctgaaaggattcatc atcgtagatgccactaagagaattcatcatgcatgggaggaggtcaaaatatcaacatta atgggaatttggaagaagttgatcccaaaccttatggataactttgaaagattcaagatt tcggtagggaaagtaactgcagatgtggtggaaatagcaagagacctacaattagaagca gagcctgaatatgtggctgaattgctgatttttggtgattaa >gi568815588f:47200652_47412789|GENSCAN_predicted_peptide_2|71_aa MKSFPVLALDSSADGGMQSCLQKELNDCGVWGEIKAECQLFFLLLELVHQLVEACGASVG SWGMDKWESVS >gi568815588f:47200652_47412789|GENSCAN_predicted_CDS_2|216_bp atgaagtccttccctgtgcttgccctggattcttccgcagatggagggatgcagagttgc ctgcagaaagaattgaatgactgtggtgtttggggagaaataaaggctgagtgccagctt ttcttcctcctcttggaactggtgcatcagcttgtggaagcatgtggagcttctgtgggg tcctggggcatggacaagtgggaatcagtcagctga >gi568815588f:47200652_47412789|GENSCAN_predicted_peptide_3|527_aa MTSDKKYNSSEATMIPGSNSTLRMEKRSHMEVKPFNWAFSKQQESQVPYRDEDETTERQT QESDFRHPLGGTSSSPTYGWQDVTFILAKLIEECSIRCAAIQTCQINKPSGNAYDAASLA AQSQGPPQNCCAKGQLPRPPEANVFMVLLVEPPAAPSPAATQRVRQPHTAGYPRGLEQGW RDEAGLDCVLRKTEAIVDMAVPLSPPSCSYSKHTRADSSPHRSMENSEGNNSNCLLVKCL ELRGEKVSDKCKMLLLAQARGWNGKSQQKLGTQRKPFVSSSEQMDTGKGAFLWQPGTAQT TAAHSPHQERNIEGKRGGEEEASNIAHSQTQTNLLVLDKEFCEQRTHAALLGHMEIGILR SPHKAPEKSGHSPGDRASPMVLHGSPQKLPPPNQLWTPLLLFSASPELPALVLSSWKFLR KHLHMDVEEKKISRSHGVLWESKIRTPKGLTPRNVHHYLHFSAEENAKVLYLQQEAGPRE ADPRVGEQVQKEKQKIPGSKLGIPEPLGNASLRSEWRCRIGKWMETF >gi568815588f:47200652_47412789|GENSCAN_predicted_CDS_3|1584_bp atgacatcagataagaagtacaactcttctgaagccaccatgattcctggctccaattcc acactaaggatggaaaaacggagtcacatggaagtgaagccatttaactgggcattcagc aaacagcaagagagccaggttccctacagggacgaagatgagacaacagagcgacaaaca caggaaagtgacttcaggcatccattaggagggacaagcagtagccctacttatgggtgg caagatgtcaccttcattctggcaaagctgattgaagaatgcagcattagatgtgcagct atacagacctgccagatcaacaagccctcgggcaatgcctatgatgctgcttcactggct gcacagtcacagggacctccccagaactgctgtgcaaagggtcaacttcccagaccccca gaggccaatgtcttcatggtgctcctggtggagccaccagctgctcccagcccggctgcc acacagagagtacggcagcctcacactgctggttatcccagaggcttggagcagggttgg agggatgaagcaggcctggactgcgtccttaggaagacagaagccatcgtcgatatggcc gttcccttgtcccctccctcctgctcctacagcaagcataccagggctgactcctctcca caccgcagcatggagaattcagagggcaataatagcaactgcttgcttgtaaagtgcttg gagctcagaggagaaaaggtgtcagacaagtgtaaaatgttactactggcacaggccaga ggatggaatggaaaatcccagcagaaattgggaactcagaggaagccatttgtcagctcc tctgaacaaatggatacaggaaaaggggccttcctctggcagccaggcacagcccagacc actgctgcccacagccctcaccaagagaggaacatagaggggaagcggggaggggaggag gaggctagtaatatagctcatagtcagactcagaccaatctgctggtcttggacaaggag ttttgtgagcagagaactcatgctgctctccttgggcacatggaaattggaatcctaagg agtcctcacaaagccccagagaaatctggccattctccaggggaccgagcatcccccatg gtgctgcatgggagtccccagaagctaccaccccccaaccagctttggacacccctgctc ttattttctgcatccccagaactacctgccctagtgctgagctcatggaagttcctcagg aagcatcttcacatggatgttgaagagaaaaaaatctccaggtcccacggggtgctgtgg gagtccaaaatcaggactccaaaaggcctcaccccgaggaacgttcatcattatctgcat ttttcagcagaggagaatgccaaagtcttgtacctgcagcaagaagccggccccagagag gctgaccccagagtaggggagcaggtgcagaaggagaagcagaagatccctggaagcaag cttggtatcccagagccactggggaatgccagcctgaggtctgagtggagatgccggatt gggaagtggatggagacgttctga >gi568815588f:47200652_47412789|GENSCAN_predicted_peptide_4|1010_aa MAHVPARTSPGPGPQLLLLLLPLFLLLLRDVAGSHRAPAWSALPAAADGLQGDRDLQRHP GDAAATLGPSAQDMVAVHMHRLYEKYSRQGARPGGGNTVRSFRARLEVVDQKAVYFFNLT SMQDSEMILTATFHFYSEPPRWPRALEVLCKPRAKNASGRPLPLGPPTRQHLLFRSLSQN TATQGLLRGAMALAPPPRGLWQAKDISPIVKAARRDGELLLSAQLDSEERDPGVPRPSPY APYILVYANDLAISEPNSVAVTLQRYDPFPAGDPEPRAAPNNSADPRVRRAAQATGPLQD NELPGLDERPPRAHAQHFHKHQLWPSPFRALKPRPGRKDRRKKGQEVFMAASQVLDFDEK TMQKARRKQWDEPRVCSRRYLKVDFADIGWNEWIISPKSFDAYYCAGACEFPMPKVDAYS VASAGEQQQSSMAWDCEDGMGAWIVRPSNHATIQSIVRAVGIIPGIPEPCCVPDKMNSLG VLFLDENRNVVLKVYPNMSVDTCACRPGLEDIPAAKNKQPFRHVGPVPNSKASVVEKQQQ GKPLQSWGRGSAGGNAHSPLGVPGGGLPEHTFNLKMFLENVKVDFLRSLNLSGVPSQDKT RVEPPQYMIDLYNRYTSDKSTTPASNIVRSFSMEDAISITATEDFPFQKHILLFNISIPR HEQITRAELRLYVSCQNHVDPSHDLKGSVVIYDVLDGTDAWDSATETKTFLVSQDIQDEG WETLEVSSAVKRWVRSDSTKSKNKLEVTVESHRKGCDTLDISVPPGSRNLPFFVVFSNDH SSGTKETRLELREMISHEQESVLKKLSKDGSTEAGESSHEEDTDGHVAAGSTLARRKRSA GAGSHCQKTSLRVNFEDIGWDSWIIAPKEYEAYECKGGCFFPLADDVTPTKHAIVQTLVH LKFPTKVGKACCVPTKLSPISVLYKDDMGVPTLKYHYEGMSVAECGCRRQKHLEPSHFAR ATSQQLLEPEFELQFVTPSPDSLYCMDYCPGAFLSVKVKVTLKGKIKTRA >gi568815588f:47200652_47412789|GENSCAN_predicted_CDS_4|3033_bp atggctcatgtccccgctcggaccagcccgggacccgggccccagctgctgctgctgctg ctgccgttgtttctgctgttgctccgggatgtggccggcagccacagggcccccgcctgg tccgcactgcccgcggccgccgacggcctgcagggggacagggatctccagcggcaccct ggggacgcggccgccacgttgggccccagcgcccaggacatggtcgctgtccacatgcac aggctctatgagaagtacagccggcagggcgcgcggccgggagggggcaacacggtccgc agcttcagggccaggctggaagtggtcgaccagaaggccgtgtatttcttcaacctgact tccatgcaagactcggaaatgatccttacggccactttccacttctactcagagccgcct cggtggcctcgagcgctcgaggtgctatgcaagccgcgggccaagaacgcttcaggccgc ccgctgcccctgggcccgcccacacgccagcacctgctcttccgcagcctctcgcagaac acggccacacaggggctactccgcggggccatggccctggcgcccccaccgcgcggcctg tggcaggccaaggacatctcccccatcgtcaaggcggcccgccgggatggcgagctgctc ctctccgcccagctggattctgaggagagggacccgggggtgccccggcccagcccctat gcgccctacatcctagtctatgccaacgatctggccatctcggagcccaacagcgtggca gtgacgctgcagagatacgaccccttccctgccggagaccccgagccccgcgcagccccc aacaactcagcggacccccgcgtgcgccgagccgcgcaggccactgggcccctccaggac aacgagctgccggggctggatgagaggccgccgcgcgcccacgcacagcacttccacaag caccagctgtggcccagccccttccgggcgctgaaaccccggccagggcgcaaagaccgc aggaagaagggccaggaggtgttcatggccgcctcgcaggtgctggactttgacgagaag acgatgcagaaagcccggaggaagcagtgggatgagccgagggtgtgctcccggaggtac ctgaaggtggacttcgcagacatcggctggaatgaatggataatctcaccgaaatctttt gatgcctactactgcgcgggagcatgtgagttccccatgcctaaggtggatgcctattct gtggcctcagccggggaacaacagcagagcagtatggcctgggactgtgaggatggcatg ggtgcgtggatcgttcgtccatccaaccatgccaccatccagagcattgtcagggctgtg ggcatcatccctggcatcccagagccctgctgtgttcccgataagatgaactcccttggg gtcctcttcctggatgagaatcggaatgtggttctgaaggtgtaccccaacatgtccgtg gacacctgtgcctgccgacctgggctagaggacattccagcagctaagaacaagcagccg tttagacatgtaggaccagtgcccaacagcaaagccagtgttgttgagaagcaacagcag gggaagccactgcagagctggggacgagggtctgctgggggaaacgcccacagcccactg ggggtgcctggaggtgggctgcctgagcacaccttcaacctgaagatgtttctggagaac gtgaaggtggatttcctgcgcagccttaacctgagtggggtcccttcgcaggacaaaacc agggtggagccgccgcagtacatgattgacctgtacaacaggtacacgtccgataagtcg actacgccagcgtccaacattgtgcggagcttcagcatggaagatgccatctccataact gccacagaggacttccccttccagaagcacatcttgctcttcaacatctccattcctagg catgagcagatcaccagagctgagctccgactctatgtctcctgtcaaaatcacgtggac ccctctcatgacctgaaaggaagcgtggtcatttatgatgttctggatggaacagatgcc tgggatagtgctacagagaccaagaccttcctggtgtcccaggacattcaggatgagggc tgggagaccttggaagtgtccagcgccgtgaagcgctgggtccggtccgactccaccaag agcaaaaataagctggaagtgactgtggagagccacaggaagggctgcgacacgctggac atcagtgtccccccaggttccagaaacctgcccttctttgttgtcttctccaatgaccac agcagtgggaccaaggagaccaggctggagctgagggagatgatcagccatgaacaagag agcgtgctcaagaagctgtccaaggacggctccacagaggcaggtgagagcagtcacgag gaggacacggatggccacgtggctgcggggtcgactttagccaggcggaaaaggagcgcc ggggctggcagccactgtcaaaagacctccctgcgggtaaacttcgaggacatcggctgg gacagctggatcattgcacccaaggagtatgaagcctacgagtgtaagggcggctgcttc ttccccttggctgacgatgtgacgccgacgaaacacgctatcgtgcagaccctggtgcat ctcaagttccccacaaaggtgggcaaggcctgctgtgtgcccaccaaactgagccccatc tccgtcctctacaaggatgacatgggggtgcccaccctcaagtaccattacgagggcatg agcgtggcagagtgtgggtgcaggagacagaagcacttggagccctcacactttgctaga gccacaagccagcaactgctggagccagaatttgagctccagtttgtgacaccaagtcca gattctctctactgtatggactactgtcctggtgctttcctcagtgtcaaggttaaagtc acactgaaaggaaaaatcaaaacgagggcataa >gi568815588f:47200652_47412789|GENSCAN_predicted_peptide_5|1247_aa MMREWVLLMSVLLCGLAGPTHLFQPSLVLDMAKVLLDNYCFPENLLGMQEAIQQAIKSHE ILSISDPQTLASVLTAGVQSSLNDPRLVISYEPSTPEPPPQVPALTSLSEEELLAWLQRG LRHEVLEGNVGYLRVDSVPGQEVLSMMGEFLVAHVWGNLMGTSALVLDLRHCTGGQVSGI PYIISYLHPGNTILHVDTIYNRPSNTTTEIWTLPQVLGERYGADKDVVVLTSSQTRGVAE DIAHILKQMRRAIVVGERTGGGALDLRKLRIGESDFFFTVPVSRSLGPLGGGSQTWEGSG VLPCVGTPAEQALEKALAILTLRSALPGVVHCLQEVLKDYYTLVDRVPTLLQHLASMDFS TVVSEEDLVTKLNAGLQAASEDPRLLVRAIGPTETPSWPAPDAAAEDSPGVAPELPEDEA IRQALVDSVFQVSVLPGNVGYLRFDSFADASVLGVLAPYVLRQVWEPLQDTEHLIMDLRH NPGGPSSAVPLLLSYFQGPEAGPVHLFTTYDRRTNITQEHFSHMELPGPRYSTQRGVYLL TSHRTATAAEEFAFLMQSLGWATLVGEITAGNLLHTRTVPLLDTPEGSLALTVPVLTFID NHGEAWLGGGVVPDAIVLAEEALDKAQEVLEFHQSLGALVEGTGHLLEAHYARPEVVGQT SALLRAKLAQGAYRTAVDLESLASQLTADLQEVSGDHRLLVFHSPGELVVEEAPPPPPAV PSPEELTYLIEALFKTEVLPGQLGYLRFDAMAELETVKAVGPQLVRLVWQQLVDTAALVI DLRYNPGSYSTAIPLLCSYFFEAEPRQHLYSVFDRATSKVTEVWTLPQVAGQRYGSHKDL YILMSHTSGSAAEAFAHTMQDLQRATVIGEPTAGGALSVGIYQVGSSPLYASMPTQMAMS ATTGKAWDLAGVEPDITVPMSEALSIAQDIVALRAKVPTVLQTAGKLVADNYASAELGAK MATKLSGLQSRYSRVTSEVALAEILGADLQMLSGDPHLKAAHIPENAKDRIPGIVPMQIP SPEVFEELIKFSFHTNVLEDNIGYLRFDMFGDGELLTQVSRLLVEHIWKKIMHTDAMIID MRFNIGGPTSSIPILCSYFFDEGPPVLLDKIYSRPDDSVSELWTHAQVVGERYGSKKSMV ILTSSVTAGTAEEFTYIMKRLGRALVIGEVTSGGCQPPQTYHVDDTNLYLTIPTARSVGA SDGSSWEGVGVTPHVVVPAEEALARAKEMLQHNQLRVKRSPGLQDHL >gi568815588f:47200652_47412789|GENSCAN_predicted_CDS_5|3744_bp atgatgagagaatgggttctgctcatgtccgtgctgctctgtggcctggctggccccaca cacctgttccagccaagcctggtgctggacatggccaaggtcctcttggataactactgc ttcccggagaacctgctgggcatgcaggaagccatccagcaggccatcaagagccatgag attctgagcatctcagacccgcagacgctggccagtgtgctgacagccggggtgcagagc tccctgaacgatcctcgcctggtcatctcctatgagcccagcacccccgagcctccccca caagtcccagcactcaccagcctctcagaagaggaactgcttgcctggctgcaaaggggc ctccgccatgaggttctggagggtaatgtgggctacctgcgggtggacagcgtcccgggc caggaggtgctgagcatgatgggggagttcctggtggcccacgtgtgggggaatctcatg ggcacctccgccttagtgctggatctccggcactgcacaggaggccaggtctctggcatt ccctacatcatctcctacctgcacccagggaacaccatcctgcacgtggacactatctac aaccgcccctccaacaccaccacggagatctggaccttgccccaggtcctgggagaaagg tacggtgccgacaaggatgtggtggtcctcaccagcagccagaccaggggcgtggccgag gacatcgcgcacatccttaagcagatgcgcagggccatcgtggtgggcgagcggactggg ggaggggccctggacctccggaagctgaggataggcgagtctgacttcttcttcacggtg cccgtgtccaggtccctggggccccttggtggaggcagccagacgtgggagggcagcggg gtgctgccctgtgtggggactccggccgagcaggccctggagaaagccctggccatcctc actctgcgcagcgcccttccaggggtagtccactgcctccaggaggtcctgaaggactac tacacgctggtggaccgtgtgcccaccctgctgcagcacttggccagcatggacttctcc acggtggtctccgaggaagatctggtcaccaagctcaatgccggcctgcaggctgcgtct gaggatcccaggctcctggtgcgagccatcgggcccacagaaactccttcttggcccgcg cccgacgctgcagccgaagactcaccaggggtggccccagagttgcctgaggacgaggct atccggcaagcactggtggactctgtgttccaggtgtcggtgctgccaggcaatgtgggc tacctgcgcttcgatagttttgctgacgcctccgtcctgggtgtgttggccccatatgtc ctgcgccaggtgtgggagccgctacaggacacggagcacctcatcatggacctgcgccac aaccctggagggccatcctctgctgtgcccctgctcctgtcctacttccagggccctgag gccggccccgtgcacctcttcaccacctatgatcgccgcaccaacatcacgcaggagcac ttcagccacatggagctcccgggcccacgctacagcacccaacgtggggtgtatctgctc accagccaccgcaccgccacggccgcggaggagttcgccttccttatgcagtcgctgggc tgggccacactggtaggtgagatcaccgcgggcaacctgctgcacacccgcacggtgccg ctgctggacacacccgaaggcagcctcgcgctcaccgtgccggtcctcaccttcatcgac aatcacggcgaggcctggctgggtggtggagtggtgcccgatgccatcgtgctggccgag gaggccctggacaaagcccaggaagtgctggagttccaccaaagcctgggggccttggtg gagggcacagggcacctgctggaggcccactatgctcggccagaggtcgtggggcagacc agtgccctcctgcgggccaagctggcccagggcgcctaccgcacagctgtggacttggag tctctggcctctcagctcacagcagacctccaggaggtgtctggggaccaccgcttgcta gtgttccacagccctggcgagctggtggtagaggaagcacccccaccaccccctgctgtc ccctctccagaggagctcacctaccttattgaggccctgttcaagacagaggtgctgccc ggccagctgggctacctgcgttttgacgccatggctgaactggagacagtgaaggccgtg gggccacagctggtgcggctggtatggcaacagctggtggacacggctgcgctggtgatc gacctgcgctacaaccctggcagctactccacggccatcccgctgctctgctcctacttc tttgaggcagagccccgccagcacctgtattctgtctttgacagggccacctcaaaagtc acggaggtgtggaccttgccccaggtcgccggccagcgctacggctcacacaaggacctc tacatcctgatgagccacaccagtggctctgcggccgaggcctttgcacacaccatgcag gacctgcagcgggccacggtcattggggagcccacggccggaggcgcactctctgtgggc atctaccaggtgggcagcagccccttatatgcatccatgcccacccagatggccatgagt gccaccacaggcaaggcctgggacctggctggtgtggagcccgacatcactgtgcccatg agcgaagccctttccatagcccaggacatagtggctctgcgtgccaaggtgcccacggtg ctgcagacggccgggaagctggtggctgataactatgcctctgccgagctgggggccaag atggccaccaaactgagcggtctgcagagccgctactccagggtgacctcagaagtggcc ctagccgagatcctgggggctgacctgcagatgctctccggagacccacacctgaaggca gcccatatccctgagaatgccaaggaccgcattcctggaattgtgcccatgcagatccct tcccctgaagtatttgaagagctgatcaagttttccttccacactaacgtgcttgaggac aacattggctacttgaggtttgacatgtttggggacggtgagctgctcacccaggtctcc aggctgctggtggagcacatctggaagaagatcatgcacacggatgccatgatcatcgac atgaggttcaacatcggtggccccacatcctccattcccatcttgtgctcctacttcttt gatgaaggccctccagttctgctggacaagatctacagccggcctgatgactctgtcagt gaactctggacacacgcccaggttgtaggtgaacgctatggctccaagaagagcatggtc attctgaccagcagtgtgacggccggcaccgcggaggagttcacctatatcatgaagagg ctgggccgggccctggtcattggggaggtgaccagtgggggctgccagccaccacagacc taccacgtggatgacaccaacctctacctcactatccccacggcccgttctgtgggggcc tcggatggcagctcctgggaaggggtgggggtgacaccccatgtggttgtccctgcagaa gaggctctcgccagggccaaggagatgctccagcacaaccagctgagggtgaagcggagc ccaggcctgcaggaccacctgtag >gi568815588f:47200652_47412789|GENSCAN_predicted_peptide_6|323_aa MAAGKGAPLSPSAENRWRLSEPELGRGCKPVLLEKTNRLGPEAAVGRAGRDVGSAELALL VAPGKPRPGKPLPPKTRGEQRQSAFTELPRMKDRQVDAQAQEREHDDPTGQPGAPQLTQN IPRGPAGSKVFSVWPSGARSEQRSAFSKPTKRPAERPELTSVFPAGESADALGELSGLLN TTDLACWGRLSTPKLLVGDLWNLQALPQNAPLCSTFLGAPTLWLEHTQAQVPPPSSSSTT SWALLPPTLTSLGLSTQNWCAKCNLSFRLTSDLVFHMRSHHKKEHAGPDPHSQKRREEAL ACPVCQEHFRERHHLSRHMTSHS >gi568815588f:47200652_47412789|GENSCAN_predicted_CDS_6|972_bp atggcagctgggaagggagccccgttgagcccatcggctgaaaacagatggcgacttagc gaacctgagctgggccggggctgcaagccagtgctgctcgagaagacgaaccgcctgggc cctgaggctgctgtgggcagggcgggccgggatgtgggcagtgcggagctggcactgttg gtagccccaggcaagccccgacctggcaagccgctgcccccgaagacacgtggagagcag aggcagagcgccttcacggagctgccgaggatgaaggaccggcaggtggatgctcaggcc caggagagggagcacgatgaccccacaggccaacctggtgccccacagctgacccagaac atccccagaggcccagctggcagcaaagtcttctctgtgtggcccagcggagcacgaagt gagcaaagaagcgcctttagcaaaccaaccaagcgaccagcagagaggcctgagctaacc tcagtcttccctgcaggggaatctgcagatgccctgggggagctgtctggactcctcaac actacagacctcgcttgttggggtcgactttcaactcccaagcttttggttggtgatttg tggaacttgcaagcattgccacagaatgctccactctgtagcacttttctgggtgcccct acactgtggctggagcatacccaggcccaggtgcccccaccctcatcatcctccaccact tcgtgggccctcctgccacccaccctcacctccctgggcttgtccacccagaactggtgt gcaaagtgcaacctgtcctttcgcctaacgtccgacctggtctttcacatgcgatcccac cacaaaaaggagcatgcggggcctgacccacattctcagaagcggagagaagaggccctt gcctgccctgtgtgccaggagcacttccgggagcgccaccacctctcccggcacatgact tctcacagctag >gi568815588f:47200652_47412789|GENSCAN_predicted_peptide_7|235_aa MPGHFCKGTNQEQGGHLQRKSEGLVIREGFQDKVPFEQGCKGQVGSAFQAIIVLVTLLSA EEARSDLLSKGIRSDLQKLRDPCSWLGPQHCSFQGMASPAGGQAPKGYEHQNCTRRIPHS SQNSMERGITMLIAPGRAPGARSRAKHRLHPYFMAGGKDKREMWAGTGDPTAQASSARSL GQPEIRGQAEGSEQGSHHPVKKDLFCFHFHHDCKFPEASASMLNSLKNSAYEFFS >gi568815588f:47200652_47412789|GENSCAN_predicted_CDS_7|708_bp atgcccggccacttctgcaaaggcaccaaccaggagcagggtggccatctgcagaggaag tcagaggggctggtcatcagggaaggcttccaggacaaggtgcccttcgagcagggctgc aaagggcaggtgggctcggcctttcaagcaataatcgtgctggtaaccttgttgtctgca gaagaagcaaggagtgacctgctcagcaaaggcatcaggagtgacctccagaagctccgg gacccctgcagttggctgggcccgcagcactgctcattccaaggcatggccagcccagct ggagggcaagcaccaaagggttatgagcaccagaactgcacacgcaggatcccacacagt tctcagaactccatggaacgggggattactatgctcattgcacctgggagggcacctgga gctcggagcagggctaaacatcgtctccacccctacttcatggctggtggtaaagataag cgggaaatgtgggctggtacaggggaccccacagcacaggcctccagtgcacggtcacta ggtcagcctgaaatcagaggccaggctgaaggaagtgagcaaggaagccaccaccctgtg aagaaggacctgttttgcttccacttccaccatgattgtaagtttcccgaggcctctgca tccatgctgaactcactcaagaattctgcctatgaatttttctcctag