GENSCAN 1.0 Date run: 5-Nov-116 Time: 00:48:55 Sequence gi568815594r:13441996_13644414 : 202419 bp : 39.87% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 4202 4511 310 1 1 79 84 98 0.078 6.02 1.02 Term + 11763 11845 83 1 2 120 49 43 0.087 0.48 1.03 PlyA + 11907 11912 6 1.05 2.07 PlyA - 12054 12049 6 1.05 2.06 Term - 13020 12916 105 0 0 69 54 59 0.353 -1.97 2.05 Intr - 13850 13804 47 0 2 77 121 39 0.678 3.31 2.04 Intr - 18833 18704 130 2 1 64 106 46 0.568 3.35 2.03 Intr - 32411 32323 89 0 2 90 80 60 0.256 4.17 2.02 Intr - 37531 37435 97 1 1 84 110 -23 0.219 -1.84 2.01 Init - 42155 42081 75 2 0 92 86 135 0.974 14.84 2.00 Prom - 54416 54377 40 -4.25 3.00 Prom + 59777 59816 40 -3.25 3.01 Init + 72404 72485 82 1 1 81 100 76 0.623 9.28 3.02 Term + 74214 74221 8 1 2 137 48 0 0.414 -2.05 3.03 PlyA + 74484 74489 6 1.05 4.00 Prom + 79495 79534 40 -4.55 4.01 Init + 82027 82135 109 0 1 31 89 207 0.546 13.53 4.02 Intr + 82192 82309 118 2 1 118 70 63 0.541 6.10 4.03 Intr + 82890 83084 195 0 0 68 9 226 0.480 10.31 4.04 Intr + 85601 85799 199 2 1 43 35 159 0.163 4.53 4.05 Intr + 86059 86232 174 0 0 75 68 73 0.695 3.21 4.06 Intr + 87101 87398 298 1 1 77 46 170 0.278 7.22 4.07 Intr + 87884 88023 140 0 2 63 28 96 0.140 0.36 4.08 Intr + 89201 89365 165 1 0 -19 23 188 0.081 0.84 4.09 Intr + 92654 92983 330 1 0 23 58 273 0.317 12.70 4.10 Intr + 93197 93294 98 1 2 94 73 55 0.225 2.49 4.11 Intr + 95308 95501 194 1 2 31 80 219 0.114 13.61 4.12 Term + 95942 96156 215 0 2 60 44 197 0.147 8.91 4.13 PlyA + 98156 98161 6 1.05 5.03 PlyA - 99711 99706 6 -1.75 5.02 Term - 100533 99998 536 1 2 68 46 579 0.648 45.12 5.01 Init - 102419 101738 682 2 1 69 60 418 0.943 32.42 5.00 Prom - 103892 103853 40 -9.45 6.00 Prom + 104676 104715 40 -9.35 6.01 Init + 104880 105349 470 2 2 53 42 239 0.221 10.92 6.02 Intr + 106681 106737 57 1 0 107 84 49 0.118 3.48 6.03 Intr + 107310 107427 118 0 1 56 68 57 0.054 0.05 6.04 Term + 120777 120950 174 2 0 68 55 103 0.078 1.78 6.05 PlyA + 121521 121526 6 1.05 7.19 PlyA - 122287 122282 6 1.05 7.18 Term - 128133 128016 118 2 1 72 36 150 0.880 5.23 7.17 Intr - 130847 130707 141 1 0 66 115 72 0.658 6.25 7.16 Intr - 134996 134843 154 0 1 51 80 282 0.812 21.91 7.15 Intr - 135492 135408 85 0 1 104 84 85 0.998 8.17 7.14 Intr - 140315 140242 74 0 2 74 78 101 0.457 5.91 7.13 Intr - 140741 140657 85 2 1 53 113 47 0.312 2.17 7.12 Intr - 146797 146727 71 2 2 51 42 119 0.579 1.58 7.11 Intr - 153949 153865 85 1 1 79 71 86 0.326 4.47 7.10 Intr - 163089 156964 6126 0 0 88 29 6196 0.386 600.14 7.09 Intr - 165194 165122 73 1 1 128 69 108 0.999 11.39 7.08 Intr - 166673 166535 139 0 1 96 83 153 0.999 14.20 7.07 Intr - 167411 167300 112 2 1 81 75 95 0.989 6.53 7.06 Intr - 169105 168939 167 2 2 99 87 141 0.999 13.86 7.05 Intr - 171666 171517 150 1 0 45 58 212 0.665 13.11 7.04 Intr - 172815 172201 615 1 0 97 71 592 0.999 49.79 7.03 Intr - 173507 173317 191 1 2 74 44 215 0.997 14.01 7.02 Intr - 178072 177948 125 1 2 88 108 70 0.989 7.46 7.01 Init - 181430 181395 36 2 0 64 99 57 0.816 4.50 7.00 Prom - 183118 183079 40 -5.85 8.03 PlyA - 183670 183665 6 1.05 8.02 Term - 185670 185185 486 0 0 88 36 248 0.648 13.31 8.01 Init - 190123 189938 186 1 0 83 77 130 0.888 10.54 8.00 Prom - 192843 192804 40 -4.95 9.00 Prom + 193421 193460 40 -7.85 9.01 Init + 194021 194242 222 1 0 95 85 75 0.484 6.50 9.02 Term + 200086 200202 117 0 0 95 42 104 0.241 4.06 9.03 PlyA + 201266 201271 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 67423 67704 282 0 0 48 42 199 0.905 6.54 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:13441996_13644414|GENSCAN_predicted_peptide_1|130_aa MAVAPPPTKLGPPSPTPDCCAGSGNFKLVWFLVCGALWEWDPLSKIAWLPGFSPLSRGVD SFPVSLEFQVPQEYEKIPAAQCLSKQPPSFVLKTRGPGGVGSQEFSLTFDNLAIMCLEED LFGLNLLQVL >gi568815594r:13441996_13644414|GENSCAN_predicted_CDS_1|393_bp atggcggtagcccctcccccaaccaaacttggtcctcccagtccgactccagactgctgt gctggcagtgggaatttcaagctagtgtggttcttagtttgcggggctctgtgggagtgg gatccactgagcaagatcgcttggctccctggcttcagccccctttccaggggagtggac agttttcctgtctcactggagttccaggtgccacaggaatatgaaaaaattcctgctgct cagtgcctgtccaaacagccacccagttttgtacttaaaacccggggccctggtggtgta ggctcacaagaattctctttgacttttgacaatttggctataatgtgccttgaagaggac ctctttgggttaaatctacttcaggttctttga >gi568815594r:13441996_13644414|GENSCAN_predicted_peptide_2|180_aa MSDSEEESQDRQLKIVVLGDGASGKTSLTTCFAQETFGKQYKQTIGLDFFLRRITLPGNL NVTLQIWDIGGQTIGGKMLDKYIYGAQGVLLVYDITNYQSFENLEDWYTVVKKVSEESET QPLVALVGNKNQIGPETGETSPCGEKASHQYSGSEKQPCKPPLAEMPLGQLSSCVHLSQA >gi568815594r:13441996_13644414|GENSCAN_predicted_CDS_2|543_bp atgtcggactctgaggaggagagccaggaccggcaactgaaaatcgtcgtgctgggggac ggcgcctccgggaagacctccttaactacgtgttttgctcaagaaacttttgggaaacag tacaaacaaactataggactggatttctttttgagaaggataacattgccaggaaacttg aatgttacccttcaaatttgggatataggagggcagacaataggaggcaaaatgttggat aaatatatctatggagcacagggagtcctcttggtatatgatattacaaattatcaaagc tttgagaatttagaagattggtatactgtggtgaagaaagtgagcgaggagtcagaaact cagccactggttgccttggtaggcaataaaaatcagattggtccagagacaggagagact tccccttgtggggaaaaggctagccatcaatattctgggtctgagaaacaaccctgcaag ccacctctggcagaaatgcccctaggccagctgagcagctgtgtgcacctgtcccaggcc tga >gi568815594r:13441996_13644414|GENSCAN_predicted_peptide_3|29_aa MVRQTIGNPYKMRGSPWQPIHPEVIEEEL >gi568815594r:13441996_13644414|GENSCAN_predicted_CDS_3|90_bp atggtccgccaaactattggtaacccttataagatgaggggttctccttggcagccaatt caccctgaagtgatagaggaagaattgtaa >gi568815594r:13441996_13644414|GENSCAN_predicted_peptide_4|744_aa MGSRRRRRAALHSWLALSASGTLDIPRLGNGKCEVSGREPSASRSRVSSAISSVSMRMSQ IQCSAWDRRTRVLKASLADNLSLGPKGGDADEKNRRTRRHAEVLLRKPNLGSPKQAGKAE STVTSFALPSRWVAATRRGDYHWVRTEMEALAATACDPQQTSEAGRGVLRGGTRAGAVAQ APGGRCSTEPGAPLQAGARVARMGRCRGSAGTTDSLWGFPASLRRAQGFGPASLPDWGVP AVFPFQPLSSGASASSTCVGGVSAKSPSIKQVTEDATVSEAPSKLWSLLLGVGGPQVGGS GSKAKVGSVPWAPPTPPFWARLGGGSPGRQHSFAAAGKEDRRAQSSRRHIWSVDLGVKNR RIRNGIIPHTSWVLPRIHRKVWSVQARGNRRICSTFFPFLELQSLPYPGEVDAKIGPHRL GGLLPREKEKQLSGPFGNFSASLASLNVSDPDLRSGFRGRQGPRAESIKSIDALPKNTSQ KEVDRHKPQACRSLEASGTIQGHLYGHPSSLATSEPTQPGNLQWPQSSVRRTMRLDRWGT LRFRVGIVGIAQPLLPGQVLCEESGGTPARLCLWVLRVFHLRTGDSPVLQRLVRPQRRIS LRPSPYRAQSLCCCFLRGYGRAFTHRGGFRSSCEPRAVATSGAATCLQRFPTYKAPPSIW HLLETPRRSVRLPRCSVCHCRGLLQSASLRGNQEPAFSTLPTPQKPPDVTLPRAQFLRVL GTDMPEPQLRGQALVFEPDYNLLE >gi568815594r:13441996_13644414|GENSCAN_predicted_CDS_4|2235_bp atgggatcgcgccgccgccgccgagcagccctacactcctggctcgcgctctcggcctcc ggcaccttggacatcccgcgtttagggaatggaaagtgtgaagtcagtggccgagaaccc tcagcgtcgaggagccgggtgtcgagtgcaatcagctctgtctctatgcggatgtcacag atccagtgctctgcttgggacagaaggacccgggtgttgaaggcaagtcttgctgacaac ctttccttaggaccaaagggaggagatgcagacgagaagaaccgaaggactaggcggcat gccgaggtgctcttgcggaagccaaacttgggctctccgaaacaggccgggaaagctgaa agcacagtgacctccttcgctctcccaagccgctgggtcgccgctacgcgccgaggcgat tatcactgggtcagaaccgaaatggaggctttggcagcgaccgcgtgtgatccacagcag acgtcggaagcaggaaggggcgtcctgcgcggtgggacccgcgcgggggcagtggcgcag gcgccaggaggcaggtgcagcaccgagcccggtgcgccgctgcaggcgggtgcacgggtg gcgagaatggggagatgtcgagggtcagccgggactacagatagcctctggggattcccc gccagcctccggagagctcagggctttggtcctgcctccttgcctgactggggcgtacct gcagtgttccccttccaaccgctcagcagtggcgcaagtgcttcctccacttgcgtgggt ggcgtctcagcgaagtctccatctataaaacaagtcacggaagatgcaacagtctctgag gccccgtccaagctctggtctttgctccttggtgtagggggccctcaggttggaggaagt ggttcaaaggctaaagttgggtccgtcccgtgggcccctcccactccccccttttgggcg cggctcgggggtggaagccctgggcgacagcacagcttcgctgcggcggggaaggaggac cgtcgggcccagagttcccgccgccacatatggtctgtggatttaggtgtcaaaaaccgg agaattagaaatggtataatcccacacacttcatgggttctccccaggattcaccgaaag gtatggtcagtgcaagcacgtggaaaccgcagaatctgttcaaccttctttcctttcctg gagctccagtctctgccttaccctggtgaagtggatgcaaagattggcccgcacaggctt ggaggtttgctgcccagggagaaggagaagcagctctcaggccctttcggcaacttctcg gcttcccttgcatctctgaacgttagtgatccggacctcagatccggtttccgagggcgc cagggtcctagggctgagagcatcaaatccattgacgcccttcccaaaaacacaagccaa aaggaagtagaccggcacaaaccccaggcctgccgatccctcgaagcttcggggacaatt cagggccatttatatggtcacccaagctccttagcgacctctgaacccacgcagcctggt aatctccaatggccacagtccagcgtgcggcgcacaatgaggttggaccgctgggggacg ctgaggttccgagttggcattgtgggcattgcccagccactcctgcccggccaggtcctc tgtgaggaaagcggtggcaccccagcgcgcttgtgcttgtgggtgttgcgcgtttttcat ctaaggacaggggactctccggttctccagagactggtgcggccacagaggaggattagc ctcaggccttctccctaccgagcgcaaagcctctgctgctgcttccttcgaggctacggt cgggccttcacccatcgcggtggtttccgcagcagctgtgagcccagagcggttgcaacc agcggcgcagctacctgccttcagcgttttccgacttacaaagcaccaccaagcatttgg catctcttggagaccccacgcaggtctgtgcggttgccaagatgctcagtgtgccattgc cggggtctgctccagagcgcttccctgagaggaaaccaggagccggcattcagcacactc cccactcctcaaaagcccccagatgtcacccttccacgggcccagtttctcagggttctg ggcactgacatgcctgagccccaactcagagggcaggcacttgtgtttgagcccgattac aatcttctcgaatga >gi568815594r:13441996_13644414|GENSCAN_predicted_peptide_5|405_aa MAVRGANTLTSFSIQAILNKKEERGGLAAPEGRPAPGGTAASVAAAPAVCCWRLFGERDA GALGGAEDSLLASPAGTRTAAGRTAESPEGWDSDSALSEENESRRRCADARGASGAGLAG GSLSLGQPVCELAASKDLEEEAAGRSDSEMSASVSGLLRLRGDGGRAEGMDAEGNHAEWG GRLEHPAPWGSEAGQIPKFCALARAALSLRELSPRETGVAPQSPTSWGDRSPRTEDDGVG PRGAHVSALCSGAGGGGGSGPAGVAEEEEEPAAPKPRKKRSRAAFSHAQVFELERRFNHQ RYLSGPERADLAASLKLTETQVKIWFQNRRYKTKRRQMAADLLASAPAAKKVAVKVLVRD DQRQYLPGEVLRPPSLLPLQPSYYYPYYCLPGWALSTCAAAAGTQ >gi568815594r:13441996_13644414|GENSCAN_predicted_CDS_5|1218_bp atggctgtgcgcggcgccaacaccttgacgtccttctccatccaggcgatcctcaacaag aaagaggagcgcggcgggctggccgcgccagaggggcgcccggcgcccgggggcacagcg gcatcggtggccgcggctcccgctgtctgctgttggcggctctttggggagagggacgcg ggcgcgttggggggcgccgaggactctctgctggcgtctcctgccggtaccagaacagct gcggggcggactgcggagagcccggaaggctgggactcggactccgcgctcagcgaggag aacgagagcaggcggcgctgcgcggacgcgcggggggccagcggggccggccttgcgggg ggatccttgagcctcggccagccggtctgtgagctggccgcttccaaagacctagaggag gaagccgcgggccggagcgacagcgagatgtccgccagcgtctcaggtctgctgcggctt cgcggggatgggggccgggctgaggggatggacgcggagggcaaccacgccgagtggggt ggtcggcttgagcaccctgcgccctggggctcggaggctggccaaatcccgaagttctgc gctctggcgcgagctgctctatcacttcgggaactgagcccgcgagagaccggagtagct cctcagagccctacgtcctggggcgaccgcagcccaaggaccgaggacgacggtgttggc cccagaggtgcacacgtgtccgcgctgtgcagcggggccggcggcgggggcggcagcggg ccggcaggcgtcgcggaggaggaggaggagccggcggcgcccaagccacgcaagaagcgc tcgcgggccgctttctcccacgcgcaggtcttcgagctggagcgccgctttaaccaccag cgctacctgtccgggcccgagcgcgcagacctggccgcgtcgctgaagctcaccgagacg caggtgaaaatctggttccagaaccgtcgctacaagacaaagcgccggcagatggcagcc gacctgctggcctcggcgcccgccgccaagaaggtggccgtaaaggtgctggtgcgcgac gaccagagacaatacctgcccggcgaagtgctgcggccaccctcgcttctgccactgcag ccctcctactattacccgtactactgcctcccaggctgggcgctctccacctgcgcagct gccgcaggcacccagtga >gi568815594r:13441996_13644414|GENSCAN_predicted_peptide_6|272_aa MKRCGLATLSCRVFLCPRSVRKGLRSASSGGAQQGSLLVSSRAITALSRSLGLWVTVGRA GSWVLGAEGELAGLELQLEAAPSKGSGGDEKTVGKGRWLYAQRYPGVTTESRPRVAVCGE GAAGLRLSEDGRSGRDRAGVPGSRFLPIEGDFLRAQLPYPFLIAYSLEKELRSDLRAKGK GLEQDAGVLGSVSERRPQQAPGPPPTLSCIPQLLATFAQNATGASVKYRSTQARGRHRIQ VLRDPMGREVKGNPRTMIRGASRMRWQPVQTA >gi568815594r:13441996_13644414|GENSCAN_predicted_CDS_6|819_bp atgaaaagatgtggtttggcgactttgagttgtcgggtatttttgtgtcccaggagtgtg agaaaggggctacgttccgcgtcttccggaggcgcacagcaagggagtctcctggtgtca tctcgggccattacggcgcttagtcgatcattaggtctgtgggtgactgtaggcagagcc gggagttgggtgcttggcgcggagggtgaactcgcgggtcttgagctgcaactagaagct gctcctagcaaggggtcgggtggagatgaaaagactgtggggaaggggaggtggttgtat gcccagagatatcccggcgtaactacggagtccaggcccagagtggcggtgtgtggcgaa ggcgcagcaggcctgcgcctgtcggaggacggcagatctggacgcgaccgcgcgggcgtg cctgggtcccgcttcctgcccatcgagggcgacttccttcgggctcagctaccttaccca tttttaattgcttattcattggaaaaggagcttcgttcggatttaagggccaaggggaag ggactggagcaggacgcgggcgtgttgggatcggtttcagaacggaggccacagcaagcc cctggcccacctcctaccctatcctgcattccacagctgcttgctacctttgctcagaat gctacaggtgcttcagtaaaatacaggagtacacaagcaagaggaagacataggatacag gtattaagagatccaatgggaagagaagtgaagggtaaccctaggacaatgattagagga gcctccaggatgagatggcagccagtccagactgcatga >gi568815594r:13441996_13644414|GENSCAN_predicted_peptide_7|2848_aa MIKRPFRELASQPAYQNLRQRVDNFVANHLATHTWSPHLNKNQLRNNIRQQVLKSGMLES GIDRIISQVVDPKINHTFRPQVEKAVHEFLATLNHKEEGSGNTAPDDEKPDTSLITQGVP TPGPSANVANDAMSILETITSLNQEASAARASTETSNAKTSERASKKLPSQPTTDTSTDK ERTSEDMADKEKSTADSGGEGLETAPKSEEFSDLPCPVEEIKNYTKEHNNLILLNKDVQQ ESSEQKNKSTDKGEKKPDSNEKGERKKEKKEKTEKKFDHSKKSEDTQKVKDEKQAKEKEV ESLKLPSEKNSNKAKTVEGTKEDFSLIDSDVDGLTDITVSSVHTSDLSSFEEDTEEEVVT SDSMEEGEITSDDEEKNKQNKTKTQTSDSSEGKTKSVRHAYVHKPYLYSKYYSDSDDELT VEQRRQSIAKEKEERLLRRQINREKLEEKRKQKAEKTKSSKTKGQGRSSVDLEESSTKSL EPKAARIKEVLKERKVLEKKVALSKKRKKDSRNVEENSKKKQQYEEDSKETLKTSEHCEK EKISSSKELKHVHAKSEPSKPARRLSESLHVVDENKNESKLEREHKRRTSTPVIMEGVQE ETDTRDVKRQVERSEICTEEPQKQKSTLKNEKHLKKDDSETPHLKSLLKKEVKSSKEKPE REKTPSEDKLSVKHKYKGDCMHKTGDETELHSSEKGLKVEENIQKQSQQTKLSSDDKTER KSKHRNERKLSVLGKDGKPVSEYIIKTDENVRKENNKKERRLSAEKTKAEHKSRRSSDSK IQKDSLGSKQHGITLQRRSESYSEDKCDMDSTNMDSNLKPEEVVHKEKRRTKSLLEEKLV LKSKSKTQGKQVKVVETELQEGATKQATTPKPDKEKNTEENDSEKQRKSKVEDKPFEETG VEPVLETASSSAHSTQKDSSHRAKLPLAKEKYKSDKDSTSTRLERKLSDGHKSRSLKHSS KDIKKKDENKSDDKDGKEVDSSHEKARGNSSLMEKKLSRRLCENRRGSLSQEMAKGEEKL AANTLSTPSGSSLQRPKKSGDMTLIPEQEPMEIDSEPGVENVFEVSKTQDNRNNNSQQDI DSENMKQKTSATVQKDELRTCTADSKATAPAYKPGRGTGVNSNSEKHADHRSTLTKKMHI QSAVSKMNPGEKEPIHRGTTEVNIDSETVHRMLLSAPSENDRVQKNLKNTAAEEHVAQGD ATLEHSTNLDSSPSLSSVTVVPLRESYDPDVIPLFDKRTVLEGSTASTSPADHSALPNQS LTVRESEVLKTSDSKEGGEGFTVDTPAKASITSKRHIPEAHQATLLDGKQGKVIMPLGSK LTGVIVENENITKEGGLVDMAKKENDLNAEPNLKQTIKATVENGKKDGIAVDHVVGLNTE KYAETVKLKHKRSPGKVKDISIDVERRNENSEVDTSAGSGSAPSVLHQRNGQTEDVATGP RRAEKTSVATSTEGKDKDVTLSPVKAGPATTTSSETRQSEVALPCTSIEADEGLIIGTHS RNNPLHVGAEASECTVFAAAEEGGAVVTEGFAESETFLTSTKEGESGECAVAESEDRAAD LLAVHAVKIEANVNSVVTEEKDDAVTSAGSEEKCDGSLSRDSEIVEGTITFISEVESDGA VTSAGTEIRAGSISSEEVDGSQGNMMRMGPKKETEGTVTCTGAEGRSDNFVICSVTGAGP REERMVTGAGVVLGDNDAPPGTSASQEGDGSVNDGTEGESAVTSTGITEDGEGPASCTGS EDSSEGFAISSESEENGESAMDSTVAKEGTNVPLVAAGPCDDEGIVTSTGAKEEDEEGED VVTSTGRGNEIGHASTCTGLGEESEGVLICESAEGDSQIGTVVEHVEAEAGAAIMNANEN NVDSMSGTEKGSKDTDICSSAKGIVESSVTSAVSGKDEVTPVPGGCEGPMTSAASDQSDS QLEKVEDTTISTGLVGGSYDVLVSGEVPECEVAHTSPSEKEDEDIITSVENEECDGLMAT TASGDITNQNSLAGGKNQGKVLIISTSTTNDYTPQVSAITDVEGGLSDALRTEENMEGTR VTTEEFEAPMPSAVSGDDSQLTASRSEEKDECAMISTSIGEEFELPISSATTIKCAESLQ PVAAAVEERATGPVLISTADFEGPMPSAPPEAESPLASTSKEEKDECALISTSIAEECEA SVSGVVVESENERAGTVMEEKDGSGIISTSSVEDCEGPVSSAVPQEEGDPSVTPAEEMGD TAMISTSTSEGCEAVMIGAVLQDEDRLTITRVEDLSDAAIISTSTAECMPISASIDRHEE NQLTADNPEGNGDLSATEVSKHKVPMPSLIAENNCRCPGPVRGGKEPGPVLAVSTEEGHN GPSVHKPSAGQGHPSAVCAEKEEKHGKECPEIGPFAGRGQKESTLHLINAEEKNVLLNSL QKEDKSPETGTAGGSSTASYSAGRGLEGNANSPAHLRGPEQTSGQTAKDPSVSIRYLAAV NTGAIKADDMPPVQGTVAEHSFLPAEQQGSEDNLKTSTTKCITGQESKIAPSHTMIPPAT YSVALLAPKCEQDLTIKNDYSGKWTDQASAEKTGDDNSTRKSFPEEGDIMVTVSSEENAY VPSEEEKNGEILAPPESLCGGKPSGIEISSGRKDNAEAISGHSVEADPKEEENSRDLEEL PKTSSETNSTTSRVMEEKDEYSSSETTGEKPEQNDDDTIKSQEDDGEEKIVTSVRRRGRK PKRSLTVSDDAESSEPERKRQKSVSDPVEDKKEQESDEEEEEEEEDEPSGATTRSTTRSE AQRKQHSKPSARATSKLGSPDTVSPRNRQKLAKEKLPTSEKVSNSPPLGRSKTQLSPSIK RKREVSPPGARTRGQQRVEEAPVKKAKR >gi568815594r:13441996_13644414|GENSCAN_predicted_CDS_7|8547_bp atgatcaagaggcccttcagggagctcgccagccagcctgcgtatcagaatctgagacag cgtgttgacaactttgttgcaaatcacttggcaactcacacatggagtccgcatctcaat aagaaccagctaagaaacaacattagacaacaagtcctcaaatcaggaatgttggagtct ggtattgaccgaattatttctcaggttgtggacccaaagatcaaccacacattcagacct caggtagagaaagctgtgcatgagtttttggccacgctaaatcacaaagaggaaggaagt ggcaacacagctcccgatgatgagaaaccagacacttcccttattacacaaggtgttcct actcctgggcccagtgctaatgtagccaatgatgccatgtcgatattggaaaccataact tctcttaaccaagaagccagtgctgctagggcttcaacagaaacatcaaatgccaagacc agtgagagagcgtcaaaaaaacttccatctcagccaaccactgatactagtactgacaaa gaaagaacttcagaggacatggctgataaagaaaaatctacagctgactctggaggtgaa ggactggaaacagccccaaagtctgaagagttcagcgacctcccctgtccagtcgaagaa attaaaaattacacaaaagagcataataatttaattctgctaaataaggatgttcaacag gaaagcagtgagcaaaaaaataaatcaacagacaaaggtgaaaagaagccagacagcaat gagaaaggagaaagaaagaaagaaaagaaggaaaagactgaaaagaaatttgatcactca aaaaagagtgaagatacacagaaagttaaagatgaaaaacaagcaaaggaaaaagaagta gagagtttaaaacttccttcagaaaagaacagtaataaagctaaaactgttgaagggaca aaagaagatttctctttgatagattctgatgtggatggacttacagacatcacagttagc tctgttcataccagtgacctttcatcttttgaagaagatactgaggaggaagttgtaacg tctgatagcatggaagaaggagagattacgtcagatgatgaagagaagaacaaacagaat aaaacaaaaactcaaactagtgattctagtgaaggaaaaacaaaaagtgtacggcatgcg tatgtccacaaaccatatctttactcaaaatactatagtgattctgatgatgagcttact gtagaacaacgacgacagtccattgccaaagaaaaagaagagaggcttttaagaaggcaa atcaatagagaaaaacttgaagaaaaacgaaaacagaaagcagaaaagacaaagtcttca aaaaccaagggtcaaggcaggagtagtgtggacttagaagaatcatcaacaaagagtttg gaacctaaagccgccagaattaaagaagtccttaaagaacggaaagttttagaaaaaaaa gtagccttaagcaaaaagagaaaaaaagattcaaggaatgttgaagagaactccaaaaag aaacagcaatatgaagaagattccaaagaaacccttaaaacaagtgagcattgtgaaaag gaaaaaatttcttcttcaaaggagctgaagcatgttcatgcaaaaagtgaaccaagtaaa cctgcccggagactttcagagtctttgcatgtagttgacgaaaacaaaaatgaatccaaa ttagaaagagaacataaaagacggacatctacccctgttatcatggagggggtacaggaa gagactgacacaagagatgtaaaaaggcaagtagaacgctcagaaatttgcaccgaagag ccccagaaacagaaaagcacacttaaaaacgaaaagcatctaaagaaagatgattctgaa acaccacatttgaaaagcctacttaagaaagaggtgaaatcctccaaggagaagcctgaa agagagaaaactccatcggaagacaaattgtctgtgaaacataaatataaaggtgattgt atgcataaaacaggtgatgagactgagcttcactcttctgagaaaggtttaaaagtagag gaaaatattcaaaagcaaagtcaacaaacaaagctttcttcagatgataaaaccgaacga aaaagtaaacataggaatgaaaggaaattatcagtattaggcaaagatggaaagccagtt tctgaatatattataaaaacagatgagaatgttcgtaaagaaaacaacaaaaaagagaga cgcttgtcagctgaaaaaactaaggcagagcacaaatcaagaaggtcaagtgattctaaa attcagaaagattctctgggttccaagcaacatggtatcacattacagagaagaagtgaa agttattcggaagataagtgtgatatggactccactaacatggatagtaatttgaaacca gaagaggttgttcacaaggagaaacgacgaacaaagagcttgttagaagagaaacttgtg ttgaagtctaaatcaaaaactcaaggcaaacaggtaaaagttgtagaaacagaattacaa gaaggtgccacaaaacaggcaaccactccaaaaccagacaaggagaagaacacagaagaa aatgactcagaaaaacagcgtaagtctaaagttgaagacaaaccttttgaagaaactggt gttgaacctgtattagagactgcttcttcttcagcacatagtacacagaaggattctagt catagagccaagttaccattagcaaaggagaaatataagagtgataaagactccacttcc accaggcttgagagaaagttgtcagatggccacaaaagcagaagcttaaagcatagtagt aaagacataaaaaagaaggacgaaaataaatcagatgacaaggatggtaaagaagttgac agtagtcatgaaaaggccagaggtaatagttcactcatggaaaagaaattaagtagaagg ttgtgcgaaaatcggagaggaagcttgtcacaagaaatggccaaaggagaagaaaaatta gcagcaaacactttgagcactcccagcggttcctcccttcagagaccaaaaaagagtggt gatatgacattgatccctgaacaagagccaatggaaattgattctgagccaggtgttgaa aatgtgtttgaagtatctaaaacccaagacaaccgcaataataattctcagcaagacatt gactctgaaaatatgaaacaaaaaacttctgccactgttcaaaaggatgaattgagaact tgcacagcagattcaaaagcaacagctccagcttataagccaggccgtggaacaggagtt aatagtaattctgaaaagcatgccgatcatagaagcaccttgaccaagaaaatgcatata caaagtgctgtgtccaaaatgaaccctggggagaaagaacccattcatagaggaactact gaagtgaatatagattctgaaactgttcatagaatgttactgagtgccccatcagaaaat gatagggtacagaagaatttgaaaaacacagctgctgaagaacatgttgctcaaggagat gccactcttgaacattccacaaatttagactcctcaccatccttaagttcagtgactgtt gtgcctctgagggaatcgtatgatccagatgtaattcctctgtttgacaaaagaactgtt ttggaaggtagcacagccagcacctcccctgcggatcactctgctctccctaaccaaagt ctgactgttagggaatcagaagtccttaagacaagtgacagcaaagaaggtggtgaaggt ttcacagtagatacaccagcaaaagcaagcatcactagcaaaagacacattccagaagct caccaggctactttattggatggtaaacaaggaaaggtaatcatgcctcttggaagtaag ttaacgggcgtgattgtggaaaatgagaatattaccaaagaaggtggcttagtggacatg gccaagaaagaaaatgacttaaatgcagagcccaatttaaagcagacaattaaagcaaca gtagagaatggcaagaaggatggcattgctgttgatcatgttgtaggcctgaatacagaa aaatatgctgaaactgtcaaacttaagcataaaagaagcccaggtaaagtaaaagacata tcaattgatgttgaaagaaggaatgaaaacagtgaggtagacaccagtgctggaagtggc tctgcaccctctgttttacaccaaaggaacggacaaactgaggatgtggcaactgggcct aggagagcagaaaagacttctgttgccactagtactgaagggaaggacaaagatgtcacc ttaagtccagtgaaggctgggcctgccacaaccacttcttcagaaacaagacaaagtgag gtggctttgccttgcaccagcattgaggcagatgaaggcctcataataggaacacattcc agaaataatcctcttcatgttggtgcagaagccagtgaatgcactgtttttgctgcagct gaagaaggtggggctgttgtcacagagggatttgctgaaagtgaaaccttcctcacaagc actaaggaaggggaaagtggggagtgtgctgtggctgaatctgaggacagagcagcagac ctactggctgtgcatgcagttaaaatcgaagccaatgtaaatagcgttgtgacagaggaa aaggatgatgctgtaaccagtgcaggctctgaagaaaaatgtgatggttctttaagtaga gactcagaaatagttgaaggaactattacttttattagtgaagttgaaagtgatggagca gttacaagtgctggaacagagataagagcaggatctataagcagtgaagaggtggatggc tcccagggaaatatgatgagaatgggtcccaaaaaagaaacagagggcactgtgacatgt acaggagcagaaggcagaagtgataactttgtgatctgctcagtaactggagcagggccc cgggaggaacgcatggttacaggtgcaggtgttgtcctgggagataatgatgcaccacca ggaacaagtgccagccaagaaggagatggttctgtgaatgatggtacagaaggtgagagt gcagtcaccagcacggggataacagaagatggagaggggccagcaagttgcacaggttca gaagatagcagcgaaggctttgctataagttctgaatcggaagaaaatggagagagtgca atggacagcacagtggccaaagaaggcactaatgtaccattagttgctgctggtccttgt gatgatgaaggcattgtgactagcacaggcgcaaaagaggaagacgaggaaggggaggat gttgtgactagtactggaagaggaaatgaaattgggcatgcttcaacttgtacagggtta ggagaagaaagtgaaggggtcttgatttgtgaaagtgcagaaggggacagtcagattggt actgtggtagagcatgtggaagctgaggctggagctgccatcatgaatgcaaatgaaaat aatgttgacagcatgagtggcacagagaaaggaagtaaagacacagatatctgctccagt gcaaaagggattgtagaaagcagtgtgaccagtgcagtctcaggaaaggatgaagtgaca ccagttccaggaggttgtgagggtcctatgactagtgctgcatctgatcaaagtgacagt cagctcgaaaaagttgaagataccactatttccactggcctggtcgggggtagttacgat gttcttgtatctggtgaagtcccagaatgtgaagttgctcacacatcaccaagtgaaaaa gaagatgaggacatcatcacctctgtagaaaatgaagagtgtgatggtctcatggcaact acagccagtggtgatattaccaaccagaatagcttagcagggggtaaaaatcaaggcaaa gttttgattatttccaccagtaccacaaatgattacacccctcaggtaagcgcaattaca gatgtggaaggaggtctctcagatgctctgagaactgaagaaaatatggaaggtaccaga gtaaccacagaagaatttgaggcccccatgcccagtgcagtctcaggagatgacagccaa ctcactgccagcagaagtgaagagaaagatgagtgtgccatgatttccacaagcataggg gaagaattcgaattgcctatctccagtgcaacaaccatcaagtgtgctgaaagtcttcag ccggttgctgcagcagtggaagaaagggctacaggtccagtcttgataagcaccgccgac tttgaggggcctatgcccagtgcgcccccagaagctgaaagtcctcttgcctcaaccagc aaggaggagaaggatgaatgtgctctcatttccactagcatagcagaagaatgtgaggct tctgtttccggtgtagttgttgaaagtgaaaatgagcgagctggcacagtcatggaagaa aaagacgggagtggcatcatctctacgagctcggtggaagactgtgagggcccagtgtcc agtgctgtccctcaagaggaaggcgacccctcagtcacaccagcggaagagatgggtgac accgccatgatttccacaagcacctctgaagggtgtgaagcagtcatgattggtgctgtc ctccaggatgaagatcggctcaccatcacaagagtagaagacttgagcgatgctgccatc atctccaccagcacagcagaatgtatgccaatttccgccagcattgacagacatgaagag aatcagctgactgcagacaacccagaagggaacggtgacctgtcagccacagaagtgagc aagcacaaggtccccatgcccagcctaattgctgagaataactgtcggtgtcctgggcca gtcaggggaggcaaagaaccgggtcccgtgttggcagtgagcaccgaggaggggcacaac gggccatcagtccacaagccctctgcagggcaaggccatccaagtgctgtttgtgcggaa aaagaagagaagcatggcaaggagtgccccgaaataggaccatttgcaggaagaggacag aaagagagcactttacacctcataaatgcagaagagaagaatgtattgttgaactccctt cagaaagaagataagagcccagagacagggacagcagggggcagtagcacagcaagttat tcagcaggaaggggcttagaggggaatgctaactcacctgcccacctgagaggaccagaa cagacgtctgggcagacggctaaggatccctctgtcagcattcgctatttggcagcagta aacaccggtgctataaaagctgatgacatgccacctgttcaagggaccgtggctgagcat tcctttcttcctgctgagcagcaggggtctgaagacaacttgaaaaccagtaccaccaaa tgtattactggccaagaatcaaaaattgctccttcccacacaatgatccctccagctact tacagtgtagctctgttggctcctaaatgtgagcaggacttgactataaagaatgattat agtggcaaatggactgatcaagcatctgctgagaaaacaggagatgataacagcacaagg aaatcattccctgaggaaggagacataatggttactgtgtcttctgaagaaaatgcttat gtgccttcagaggaagagaaaaatggtgaaattctggcaccaccagaaagtctgtgtggg ggaaagccaagtggaatagagatatccagtggtagaaaagacaacgcagaagccataagc ggtcacagtgttgaagcagatcctaaagaggaagagaactccagagatttggaagaatta cctaaaaccagttctgagacaaatagcactacctcaagggtcatggaagaaaaagatgaa tatagcagcagtgaaactactggtgaaaagccagagcagaacgatgatgacaccataaaa tctcaggaggatgatggtgaagaaaaaatagtaacaagtgtgcgtcggagaggaagaaaa cccaaacgttctctcactgtatcagatgatgctgaatcctcagagccagaaagaaaacgc cagaaatcagtttctgatccagtggaggacaagaaagagcaggagtctgatgaggaagag gaagaagaggaagaggacgagccttcaggagccaccacaagatccaccaccagatcagag gctcagagaaagcaacatagcaagccatctgcacgtgcaacatccaaacttggcagccca gacacagtttctcctagaaatcgccaaaaattagcaaaagagaagttacctaccagcgaa aaagttagtaactctcccccattaggaagatcaaagacacagctctccccttctatcaag cgcaagagagaagtcagccctcctggggcccgaacaagaggccagcaaagggtggaggaa gcccctgtgaaaaaagcgaagcgataa >gi568815594r:13441996_13644414|GENSCAN_predicted_peptide_8|223_aa MAQLEESTMLPSTLLALLYTNKGLILAKRINVRIEHIKLTKSRDSFLKHMKENDQKKKTK RKAAGPVQPPPPRIVRRSGPLPSPAQATMATNPQPQPPPPAPPPPPPQPQPQPPPPPPGP GAGPGAGGAGGAGAGAGDPQLVAMIVNHLKSQGLFDQFRRDCLADVDTKVRSAPGLGVCE GRGGPKGALLLQWLPLGAAAVCHGLCKAQSVHSLACPLVRFTD >gi568815594r:13441996_13644414|GENSCAN_predicted_CDS_8|672_bp atggcacaactagaagagtctacaatgttacccagcacgctgctggcattgttgtatacc aacaagggcctgattcttgccaagagaattaatgtgcgtatcgagcacattaagctcact aagagccgagatagcttcctgaaacacatgaaggaaaatgaccagaaaaagaaaacaaag agaaaggcggcgggaccagtgcagccgccgcctcccaggatcgtccgccggtcagggccc ttgccctccccggcacaggccaccatggccaccaacccacagccgcagccgcctcctccg gcgccgccgcctcccccgccgcagccgcagccgcagccaccgccgccgccgccgggcccc ggggctggccccggcgcgggcggggcgggcggcgcgggtgcgggcgccggggacccgcag ctcgtggccatgatcgtgaaccacctcaagagccaggggctcttcgaccagttccgcaga gactgcctggccgacgtggacaccaaggttaggagcgcgccgggtttgggggtctgcgag ggaaggggaggacccaagggcgcgcttcttttgcagtggcttcctcttggagctgcagct gtgtgccacggactgtgcaaagcacagagtgttcattccttagcgtgtcctttggttcgt tttaccgactag >gi568815594r:13441996_13644414|GENSCAN_predicted_peptide_9|112_aa MDKLEPFLGIFEGNWYPSQLHCKQVSGVGNNQILCHVYQANLPRKKKSTNPQSEAQMRMD NEHPACGPHIPEGQNPQETMLELNQYVVSNQTDLNMTCDSATYQLNDLVMVI >gi568815594r:13441996_13644414|GENSCAN_predicted_CDS_9|339_bp atggacaaattagaacccttccttgggatttttgaaggtaattggtatccctctcagcta cattgtaagcaggtgtctggagtcggtaataaccagatcctctgtcatgtgtatcaagcc aatctgccaagaaaaaaaaaatccactaatccacagagtgaagcacagatgaggatggac aacgaacaccctgcctgtggtccacatattcctgagggccagaatccccaagagaccatg ttagagttaaatcaatatgtggtcagcaatcagacagacctgaatatgacatgtgattct gctacttatcagctgaatgaccttgtgatggtcatctga