GENSCAN 1.0 Date run: 6-Nov-116 Time: 18:38:52 Sequence gi568815597r:229418182_229658652 : 240471 bp : 43.09% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 7587 7748 162 2 0 74 54 135 0.775 3.29 1.02 PlyA + 8007 8012 6 1.05 2.27 PlyA - 11173 11168 6 1.05 2.26 Term - 13461 13318 144 0 0 88 42 316 0.999 24.91 2.25 Intr - 13721 13540 182 0 2 105 98 366 0.999 38.89 2.24 Intr - 14004 13813 192 1 0 128 97 522 0.999 56.56 2.23 Intr - 14250 14089 162 1 0 92 85 406 0.999 40.65 2.22 Intr - 14699 14375 325 2 1 85 117 668 0.999 64.75 2.21 Intr - 14946 14806 141 0 0 101 71 274 0.989 27.55 2.20 Intr - 16701 16586 116 1 2 62 40 90 0.015 1.77 2.19 Intr - 26821 26733 89 0 2 81 115 86 0.939 10.21 2.18 Intr - 30873 30809 65 0 2 16 81 52 0.292 -5.18 2.17 Intr - 32424 32344 81 0 0 51 107 57 0.435 3.73 2.16 Intr - 42588 42430 159 0 0 79 116 43 0.949 6.38 2.15 Intr - 45495 45362 134 1 2 92 64 160 0.954 14.36 2.14 Intr - 46694 46443 252 0 0 75 88 316 0.995 27.71 2.13 Intr - 47338 47239 100 1 1 88 57 34 0.985 -0.02 2.12 Intr - 48575 48453 123 2 0 88 82 77 0.991 7.88 2.11 Intr - 52601 52399 203 0 2 0 94 141 0.307 4.90 2.10 Intr - 59579 59416 164 1 2 90 90 54 0.516 5.42 2.09 Intr - 65964 65873 92 0 2 105 75 48 0.948 3.99 2.08 Intr - 68347 68190 158 2 2 109 80 32 0.645 4.23 2.07 Intr - 69432 69255 178 0 1 62 85 40 0.319 0.59 2.06 Intr - 77866 77711 156 1 0 89 86 1 0.513 0.21 2.05 Intr - 80125 79955 171 1 0 98 98 -14 0.514 0.74 2.04 Intr - 82682 82575 108 2 0 61 96 18 0.180 0.38 2.03 Intr - 83921 83818 104 0 2 80 90 52 0.328 4.49 2.02 Intr - 87977 87859 119 1 2 21 100 66 0.271 1.21 2.01 Init - 90068 89887 182 2 2 62 115 80 0.247 4.82 2.00 Prom - 97637 97598 40 -6.16 3.19 PlyA - 98282 98277 6 1.05 3.18 Term - 100229 99998 232 1 1 49 49 182 0.707 6.35 3.17 Intr - 100694 100660 35 2 2 116 94 14 0.989 1.82 3.16 Intr - 103454 103411 44 0 2 76 121 40 0.996 4.06 3.15 Intr - 107935 107755 181 1 1 76 57 155 0.974 10.64 3.14 Intr - 109127 109048 80 0 2 82 110 45 0.879 5.37 3.13 Intr - 112227 112018 210 1 0 83 107 80 0.981 8.28 3.12 Intr - 113550 113455 96 1 0 96 76 36 0.834 3.18 3.11 Intr - 121410 121275 136 0 1 111 84 107 0.989 12.74 3.10 Intr - 122571 122425 147 0 0 88 92 111 0.994 11.93 3.09 Intr - 124190 124056 135 2 0 76 103 70 0.994 8.06 3.08 Intr - 129520 129318 203 2 2 123 80 -4 0.938 1.30 3.07 Intr - 131253 131053 201 1 0 63 92 232 0.467 20.36 3.06 Intr - 140475 139955 521 2 2 36 91 216 0.023 9.30 3.05 Intr - 141136 140927 210 0 0 67 32 127 0.006 3.03 3.04 Intr - 147906 147818 89 0 2 72 117 -4 0.024 -0.33 3.03 Intr - 156466 156247 220 0 1 22 61 136 0.012 2.60 3.02 Intr - 168455 168392 64 0 1 68 75 38 0.101 -1.72 3.01 Init - 169225 169171 55 1 1 48 75 88 0.538 4.95 3.00 Prom - 170985 170946 40 -5.06 4.06 PlyA - 171304 171299 6 1.05 4.05 Term - 176913 176116 798 0 0 85 28 870 0.980 73.97 4.04 Intr - 179243 179112 132 2 0 57 57 76 0.700 2.24 4.03 Intr - 184738 184014 725 2 2 100 64 662 0.313 56.15 4.02 Intr - 192029 191925 105 0 0 125 119 76 0.999 14.59 4.01 Init - 196301 196160 142 2 1 50 95 271 0.772 24.30 4.00 Prom - 198607 198568 40 -4.56 5.00 Prom + 203585 203624 40 -2.46 5.01 Init + 207940 208175 236 0 2 53 105 122 0.046 6.01 5.02 Intr + 208438 208485 48 1 0 84 107 34 0.043 2.70 5.03 Intr + 211218 211287 70 0 1 72 68 33 0.042 -1.12 5.04 Intr + 216736 220066 3331 0 1 84 110 1496 0.773 138.79 5.05 Intr + 225352 225512 161 2 2 96 94 27 0.974 3.81 5.06 Intr + 229329 229571 243 2 0 83 100 228 0.910 21.19 5.07 Intr + 233054 233141 88 1 1 97 86 5 0.948 0.74 5.08 Intr + 236068 236207 140 2 2 60 93 117 0.625 9.58 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 23859 23723 137 1 2 108 32 80 0.886 2.58 S.002 Init + 207125 207158 34 1 1 76 115 12 0.861 2.96 S.003 Term + 207990 208150 161 1 2 36 52 191 0.880 8.40 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:229418182_229658652|GENSCAN_predicted_peptide_1|53_aa MRRRGGGGGRREEKEEEEEEKRRRRKKRRGGGGRREEEEEEEEKRRRRKRRRG >gi568815597r:229418182_229658652|GENSCAN_predicted_CDS_1|162_bp atgaggagaagaggaggaggaggaggaagaagagaagagaaggaggaggaggaagaagag aagaggaggaggaggaagaagagaagaggaggaggaggaagaagagaagaggaggaggag gaagaagagaagagaaggaggaggaagaggaggaggggatga >gi568815597r:229418182_229658652|GENSCAN_predicted_peptide_2|1299_aa MFPAAPSPRTPGTGSRRGPLAGLGPGSTPRTASRKGLPLGSAVSSPVLFSPVGRRSSLSS RGTPTRMFPHHSITESVNYDVKTFGSSLPVKVMEALTLAEVDDQLTINIDEGGWACLVCK EKLIIWKIALSPITKLSVCKELQLPPSDFHWSADLVALSYSSPSGEAHSTQGGSFILSSS GSQLIRLIPESSGKIHQHILPQGQGMLSGIGRKVSSLFGILSPSSDLTLSSVLWDRERSS FYSLTSSNISKWELDDSSEKHAYSWDINRALKENITDAIWSEDLILCQLTVPNFSNQTAY LYNESAVYVCSTGTGKFSLPQEKIVFNAQGTVAVLESMIGDSVLGAGACGGVPIIFSRNS GLVSITSRENVSILAEDLEGSLASSVAGPNSESMIFETTTKNETIAQEDKIKLLKAAFLQ YCRKDLGHAQMVVDELFSSHSDLDSDSELDRAVTQISVDLMDDYPASDPRWAESVPEGSF PVRGTPMATRLLLCEHAEKLSAAIVLKNHHSRLSDLVNTAILIALNKREYEIPSNLTPAD VFFREVSQVDTICECLLEHEEQVLRDAPMDSIEWAEVVINVNNILKDMLQAASHYRQNRN SLYRREESLEKEPEYVPWTATSGPGGIRTVIIRQHEIVLKVAYPQADSNLRNIVTEQLVA LIDCFLDGYVSQLKSVDKSSNRERYDNLEMEYLQKRSDLLSPLLSLGQYLWAASLAEKYC DFDILVQMCEQTDNQSRLQRYMTQFADQNFSDFLFRWYLEKGKRGKLLSQPISQHGQLAN FLQAHEHLSWLHEINSQELEKLYICEENRRANEYDFKKALDLLEYIDESITEDFCDQTCW DFPPPTLGNHWSSSDGKDDPIEVSKDSIFVKILQKLLKDGLSPGAPAPPPRRSALASPRQ LSILRQLRALRPPVALCAKLDTMCDEDETTALVCDNGSGLVKAGFAGDDAPRAVFPSIVG RPRHQGVMVGMGQKDSYVGDEAQSKRGILTLKYPIEHGIITNWDDMEKIWHHTFYNELRV APEEHPTLLTEAPLNPKANREKMTQIMFETFNVPAMYVAIQAVLSLYASGRTTGIVLDSG DGVTHNVPIYEGYALPHAIMRLDLAGRDLTDYLMKILTERGYSFVTTAEREIVRDIKEKL CYVALDFENEMATAASSSSLEKSYELPDGQVITIGNERFRCPETLFQPSFIGMESAGIHE TTYNSIMKCDIDIRKDLYANNVMSGGTTMYPGIADRMQKEITALAPSTMKIKIIAPPERK YSVWIGGSILASLSTFQQMWITKQEYDEAGPSIVHRKCF >gi568815597r:229418182_229658652|GENSCAN_predicted_CDS_2|3900_bp atgttcccagccgccccttctccgcggaccccgggtaccgggtcccgaaggggcccgctg gccggactcgggcccggctccacgccccggacggctagcaggaagggtctgcccctgggg tctgcagtcagctccccagtgctcttctcgccggtcggccggcgtagctcgctaagctcg cggggaacaccaacacgaatgttcccacaccactccataactgagtctgtgaactatgat gtgaaaacgtttggatcttctcttcctgttaaagtcatggaagccctaacattggctgaa gtcgatgaccagctgaccattaacatagatgaaggtggatgggcttgtctggtgtgcaaa gagaagctcattatttggaagattgctctgtcacctattactaagttatccgtttgcaaa gaacttcagctgccacctagtgatttccactggagtgccgacttagtggctctttcttac tcttctccctcaggtgaagcacattctactcagggaggaagttttattttgtcttcatca ggaagccaactaattcggttgatacctgagagctcaggaaagattcatcagcatatcctg cctcaggggcaaggcatgctttcaggaattggtcgaaaagtttcttctctttttggaatt ttatctcctagtagtgatctcacactttcaagtgttctctgggatagagagagatcaagc ttttatagcctgacgagttcaaacatcagtaaatgggaattagatgattcttcagaaaag catgcatacagttgggatataaatagagccctgaaggaaaacattaccgatgctatttgg tctgaagacctgattttgtgtcagttgacggtcccaaacttttcaaaccagactgcctat ctgtataacgaaagtgctgtctatgtgtgctccacaggaactgggaaattttctcttccc caggagaaaattgtctttaatgcacaaggtacagtagcagttttagaaagcatgattgga gatagtgttttaggtgctggtgcctgtggtggtgttcctatcattttttctagaaacagt ggactggtgtctattacttcaagggaaaatgtgtctatattggcagaagacttggaaggg tctttagcatcttcagttgctggaccaaacagtgagagtatgatttttgagaccactaca aagaatgaaactatagcccaggaagataaaatcaagttgctgaaagctgcctttctgcaa tactgcagaaaagatttaggtcatgctcaaatggtggttgatgagctcttttcctctcac tctgatttggattctgattctgaactagacagggcagttacccaaatcagtgtagacctg atggatgactacccagcatctgacccacggtgggctgagtctgtccctgagggcagtttt ccagttagagggacaccgatggccactcgactgttgctctgtgagcatgccgaaaagctg tcagccgccattgttctcaagaaccaccactcccggctttctgaccttgtcaacacagcc atattgattgctttgaacaagagggagtatgaaatcccatccaacctgactcctgcagat gtctttttcagggaggtatcccaagtagataccatctgtgagtgcttactggagcatgag gagcaagtcttgagggatgcacctatggattccattgaatgggctgaagtggtgatcaat gtgaacaatattctcaaggatatgctgcaggctgctagtcattatcgccaaaatagaaac tctttgtatagaagagaagaatcactagaaaaagaacctgaatatgttccatggacggca acaagtggtcctggtggcatccgaacggtaataatacgccagcatgagattgtcctgaag gtggcttatccacaggcagacagcaacctccgaaacatcgtgaccgagcagctggtagcc ctgatcgattgcttcctggatggttatgtttctcagcttaagtctgtggataaatccagt aatcgggaaagatatgacaatctggagatggaatacctacagaaaagatcagatctctta tctcctcttctttcactaggccagtacctgtgggctgcttctctagcagagaaatactgt gactttgatatattggtacaaatgtgtgagcagactgacaaccagagccgactccagcgc tacatgacccagtttgctgatcagaatttttcagactttctcttccgttggtatctggag aaaggaaagcgaggcaaattattatctcagcccatttctcagcatggacagttggcaaat tttttgcaagctcatgaacatctcagctggttacatgaaattaatagccaagaattagaa aagctatatatctgtgaagaaaatagaagagctaatgaatatgatttcaagaaagctttg gacttgttggaatatattgatgagtcaataacagaagacttctgtgaccagacgtgttgg gatttcccacctcctacactgggcaaccactggtccagttctgatggcaaagatgatcca attgaagtatctaaagacagtatatttgtgaagatcttacagaaacttttaaaagatggc ctatccccgggagcccccgcgcctcctccccggcgctccgccctcgcctccccccgccag ttgtctatcctgcgacagctgcgcgccctccggccgccggtggccctctgtgcgaaacta gacacaatgtgcgacgaagacgagaccaccgccctcgtgtgcgacaatggctccggcctg gtgaaagccggcttcgccggggatgacgcccctagggccgtgttcccgtccatcgtgggc cgcccccgacaccagggcgtcatggtcggtatgggtcagaaagattcctacgtgggcgac gaggctcagagcaagagaggtatcctgaccctgaagtaccctatcgagcacggcatcatc accaactgggatgacatggagaagatctggcaccacaccttctacaacgagcttcgcgtg gctcccgaggagcaccccaccctgctcaccgaggcccccctcaatcccaaggccaaccgc gagaagatgacccagatcatgtttgagaccttcaacgtgcccgccatgtacgtggccatc caggccgtgctgtccctctacgcctccggcaggaccaccggcatcgtgctggactccggc gacggcgtcacccacaacgtgcccatttatgagggctacgcgctgccgcacgccatcatg cgcctggacctggcgggccgcgatctcaccgactacctgatgaagatcctcactgagcgt ggctactccttcgtgaccacagctgagcgcgagatcgtgcgcgacatcaaggagaagctg tgctacgtggccctggacttcgagaacgagatggcgacggccgcctcctcctcctccctg gaaaagagctacgagctgccagacgggcaggtcatcaccatcggcaacgagcgcttccgc tgcccggagacgctcttccagccctccttcatcggtatggagtcggcgggcattcacgag accacctacaacagcatcatgaagtgtgacatcgacatcaggaaggacctgtatgccaac aacgtcatgtcggggggcaccacgatgtaccctgggatcgctgaccgcatgcagaaagag atcaccgcgctggcacccagcaccatgaagatcaagatcatcgccccgccggagcgcaaa tactcggtgtggatcggcggctccatcctggcctcgctgtccaccttccagcagatgtgg atcaccaagcaggagtacgacgaggccggcccttccatcgtccaccgcaaatgcttctag >gi568815597r:229418182_229658652|GENSCAN_predicted_peptide_3|952_aa MWTTVEVESMDKSSEHVREQPVKPLQFGLGLQSLNQYLTRMKPDWILEATVPHSGLLFQS ICQVTQNAPHFECGLEQEGALQKAQAAGKEALLWTIWFSRPKDAGGINGGGDSHFGRVHE PLYPAGTPFRTTQIQNSDNTKCRAELTRSEVLSVVPPHLSPVLGSETSDRAAPAWVSRAA DPPHAVSSTLSSRPIFVPAIVWSRLRAPPGLPAAMRGPPAWPLRLLEPPSPAEPGRLLPV ACVWAAASRVPGSLSPFTGLRPARLWGAGPALLWGVGAARRWRSGCRGGGPGASRGVLGL ARLLGLWARGPGSCRCGAFAGPGAPRLPRARFPGGPAAAAWAGDEAWRRGPAAPPGDKGR LRPAAAGLPEARKLLGLAYPERRRLAAAVGFLTMSSVISMSAPFFLGKIIDVIYTNPTVD YSDNLTRLCLGLSAVFLCGAAANAIRVYLMQTSGQRIVNRLRTSLFSSILRQEVAFFDKT RTGELINRLSSDTALLGRSVTENLSDGLRAGAQASVGISMMFFVSPNLATFVLSVVPPVS IIAVIYGRYLRKLTKVTQDSLAQATQLAEERIGNVRTVRAFGKEMTEIEKYASKVDHVMQ LARKEAFARAGFFGATGLSGNLIVLSVLYKGGLLMGSAHMTVGELSSFLMYAFWVGISIG GLSSFYSELMKGLGAGGRLWELLEREPKLPFNEGVILNEKSFQGALEFKNVHFAYPARPE VPIFQDFSLSIPSGSVTALVGPSGSGKSTVLSLLLRLYDPASGTISLDGHDIRQLNPVWL RSKIGTVSQEPILFSCSIAENIAYGADDPSSVTAEEIQRVAEVANAVAFIRNFPQGFNTV VGEKGVLLSGGQKQRIAIARALLKNPKILLLDEATSALDAENEYLVQEALDRLMDGRTVL VIAHRLSTIKNANMVAVLDQGKITEYGKHEELLSKPNGIYRKLMNKQSFISA >gi568815597r:229418182_229658652|GENSCAN_predicted_CDS_3|2859_bp atgtggaccacagtggaagtggagagcatggacaagtccagtgagcacgttagagagcag cccgttaaacctctccagtttggattaggccttcagtcattgaaccagtacctgacaaga atgaagcctgactggatcctggaggcaacagttccacactcaggactacttttccagtcc atatgccaagtgacacagaacgctcctcactttgaatgcggcctagagcaggaaggagct ctgcagaaggcccaggctgcaggcaaagaagccttgctttggaccatatggttcagcagg cccaaagatgctggaggtatcaatggtggaggagattcacactttgggagggtgcatgag ccactgtacccagccggcacaccttttagaacgactcaaatccaaaattctgacaacacc aaatgccgagctgagctgacacgcagcgaagtcctctcagtggtccctccgcacctcagc cctgtgcttggcagcgagacgagtgacagggccgccccagcatgggtctcccgggccgcg gacccgccccacgccgtatccagcacgctgtcctcccgccccatctttgtgccggccatc gtgtggtcgcgtctgcgcgctccgcccggtctgccggccgccatgcgaggcccccctgcc tggccgctgcggctgctcgagccaccgagccctgccgagccaggtcggctcctgccggta gcctgcgtgtgggccgcggccagccgcgttcccgggtccctatcgccgttcactggcctg aggccggcgcggctatggggcgcggggcccgcgctgctctggggcgttggagccgcgcgc cgctggaggagcggctgccggggcgggggtccgggcgcctcgcggggcgtcctgggcctc gcgcggctcctggggctgtgggctcgcggccccggcagctgcaggtgcggggcttttgcc gggccaggcgctcctcggctcccgcgcgcccggttcccgggcggtcccgcagccgctgcc tgggcaggggacgaggcctggcggcgcgggccggcggcgcctcccggggacaaggggcgg ctgcgccccgcagcggccggactcccggaggcccggaagctcctggggctggcgtaccct gagcgccggaggctggcagctgcggttggatttctcacgatgtccagtgttatctccatg tctgcccctttcttcctggggaagatcattgatgtcatctataccaaccccactgtggac tacagcgacaacctgacccgcctctgcctagggctcagtgccgtgtttctgtgtggtgct gccgccaatgccattcgtgtctacctcatgcaaacttcaggtcagcgcattgtgaatagg ctgagaacttcattattctcctccattctgaggcaggaggttgctttctttgacaagact cgcacaggagaattgattaaccgcctctcatcagacactgcactcctggggcgctcagtg actgaaaacctctcagatgggctcagggccggggcccaggcttccgtaggcatcagtatg atgttttttgtctcacctaatctggccacctttgttttgagcgtggtgcctccagtgtca atcattgctgtaatttatgggcgatatctacggaaactgaccaaagtcactcaggattcc ctggcacaagccactcagctagctgaggaacgtattggaaatgtaagaactgttcgagct tttgggaaagaaatgactgaaatcgagaaatatgccagcaaagtggaccatgtaatgcag ttagcaaggaaagaggcattcgcccgggctggtttctttggagcaactgggctctccgga aacctgatcgtgctttctgtcctgtacaaaggagggctgctgatgggcagtgcccacatg accgtgggtgaactctcttccttcctaatgtatgctttctgggttggaataagcattgga ggtctgagctctttctactcggagctgatgaaaggactgggtgcaggggggcgcctctgg gagctcctggagagagagcccaagctgccttttaacgagggggtcatcttaaatgagaaa agcttccagggtgctttggagtttaagaacgtgcattttgcctatccagctcgcccagag gtgcccatatttcaggatttcagcctttccattccgtcaggatctgtcacggcactggtt ggcccaagtggttctggcaaatcaacagtgctttcactcctgctgaggttgtacgaccct gcttctggaactattagtcttgatggccatgacatccgtcagctaaacccagtgtggctg agatccaaaattgggacagtgagtcaggaacccattttgttttcttgctctattgctgag aacattgcttatggtgctgatgacccttcctctgtgaccgctgaggaaatccagagagtg gctgaagtggccaatgcagtggccttcatccggaatttcccccaagggttcaacactgtg gttggagaaaagggtgttctcctctcaggtgggcagaaacagcggattgcgattgcccgt gctctgctaaagaatcccaaaattcttctcctagatgaagcaaccagtgcgctggatgcc gaaaatgagtaccttgttcaagaagctctagatcgactgatggatggaagaacggtgtta gttattgcccatcgtctgtccaccattaagaatgctaatatggttgctgttcttgaccaa ggaaaaattactgaatatggaaaacatgaagagctgctttcaaaaccaaatgggatatac agaaaactaatgaacaaacaaagttttatttcagcataa >gi568815597r:229418182_229658652|GENSCAN_predicted_peptide_4|633_aa MKRVRTEQIQMAVSCYLKRRQYVDSDGPLKQGLRLSQTAEEMAANLTVQSESGCANIVSA APCQAEPQQYEVQFGRLRNFLTDSDSQHSHEVMPLLYPLFVYLHLNLVQNSPKSTVESFY SRFHGMFLQNASQKDVIEQLQTTQTIQDILSNFKLRAFLDNKYVVRLQEDSYNYLIRYLQ SDNNTALCKVLTLHIHLDVQPAKRTDYQLYASGSSSRSENNGLEPPDMPSPILQNEAALE VLQESIKRVKDGPPSLTTICFYAFYNTEQLLNTAEISPDSKLLAAGFDNSCIKLWSLRSK KLKSEPHQVDVSRIHLACDILEEESQLDTFPPAWDNHQDPWGAPAGLPAHAPSPCTFISN CDAFTTEEDDEDDNAGTEMKILRGHCGPVYSTRFLADSSGLLSCSEDMSIRYWDLGSFTN TVLYQGHAYPVWDLDISPYSLYFASGSHDRTARLWSFDRTYPLRIYAGHLADVDCVKFHP NSNYLATGSTDKTVRLWSAQQGNSVRLFTGHRGPVLSLAFSPNGKYLASAGEDQRLKLWD LASGTLYKELRGHTDNITSLTFSPDSGLIASASMDNSVRVWDIRNTYCSAPADGSSSELV GVYTGQMSNVLSVQFMACNLLLVTGITQENQEH >gi568815597r:229418182_229658652|GENSCAN_predicted_CDS_4|1902_bp atgaaacgagtgcgtaccgagcagattcagatggcagtgtcctgctacctcaaacgccgg cagtacgtggactcagatggtcccctgaagcaaggactgcggctgtcacagactgctgaa gagatggcggccaatctaacagtgcaatcagaatctggttgtgccaacatagtgtctgca gccccttgccaggcagaaccccagcaatatgaagtacagtttggacgactgcggaatttt ctcactgattctgattcccagcatagccacgaagtgatgcctctcctctatcctctcttt gtctacctccatctcaacctggtccaaaacagtccgaagagcacagtggaaagtttttac agccgcttccatggaatgtttctgcagaatgctagccagaaggatgtcattgagcagcta cagaccactcaaaccatccaggacatcctatctaacttcaagcttcgagcattcctagat aacaagtacgtggtccgtctccaagaagacagctacaactaccttatccgctacctccaa agtgacaacaatactgccctgtgcaaagtcctcaccttacatattcatcttgacgtgcag cctgccaagagaacagactatcagctgtatgccagtggcagctcctcccgcagtgagaac aacggtttggagccccccgacatgcccagccctattctgcagaacgaggctgccctagag gtcttacaggagagcattaagcgagtcaaggatgggcctccctccctcactaccatctgc ttctatgccttctataacacagagcagctgttgaacactgcagaaatctcccccgatagc aagctgcttgctgctgggtttgacaactcctgtataaaactttggagtttacgatccaag aagttaaaatcagagccccaccaagtagacgtgtcccgcatccatttggcttgtgatatt ctggaggaggagagtcagctggacacgtttccaccagcctgggacaatcatcaagatccc tggggagcaccagctggactcccagcccatgctcctagcccatgcaccttcatctccaat tgtgatgccttcaccactgaggaggatgatgaggatgataatgcaggcacggagatgaag atactgcggggccactgcggaccagtgtacagcacgaggttcctcgcggacagctcaggg ttgctctcttgttctgaagacatgtccatcagatactgggatctggggagtttcaccaac actgtgttgtaccaaggacatgcctatcctgtgtgggatctggacatcagtccatatagc ctgtacttcgccagcgggtcccacgaccgcaccgccaggctgtggtcatttgatcggacg tacccgctgaggatatatgcaggacacctggcagatgtggactgtgtcaaattccaccct aattcaaactacttggccacgggctcaaccgacaagaccgtccggctgtggagcgctcag caggggaactcggtgaggcttttcacaggccaccgtggccccgtgctttctctcgccttt tctcccaacggtaagtacttggcgtctgctggcgaggaccagcggttgaagctgtgggac ttggcctctgggaccctttataaagagttgagaggccacacagacaatatcaccagcctc accttcagtccagacagcggcttgattgcctctgcctccatggacaactcggtgcgcgtc tgggacatcaggaacacttactgcagtgcacctgccgacggctcctccagcgagctcgtg ggcgtgtacaccgggcagatgagcaacgtcctgagcgtgcagttcatggcctgtaacctt cttctggtgactggaattacacaagaaaatcaggaacattaa >gi568815597r:229418182_229658652|GENSCAN_predicted_peptide_5|1439_aa MGEAPRRSAGDAWRACSPSPLRRRRPAEGARVPVPRGGDAGPVQRPPPHRDSSRRRCRRP PLSTRSCLELSPLPGRIPRECWQGCAVRPGLRIARTWYMHISEILQVWFHVTEIDHNIII NERVAEFSLSGSQRNICAVLRCCQGILSTPALAVIYTAKQELMVALLSQLCWSACRQPEG AVVAQLFEVIHLALGHYLLILQQQVNPRRAFGDVTAHLLQPCLVLRHLLSGGTWTQAGQG QLRQVLSRDIRSQIEAMFRGGIFQPELLSSYKEGLLDQQQGDVKTGAMKNLLAPMDTVLN RLVDAGYCAASLHTSVVANSVALLYKLFLDSYFKEGNQLLCFQVLPRLFGCLKISHLQEE QSKALSTSDWTTELLVVEQLLNSVANNNIYNIAADRIRHEEAQFRFYRHVAELLINHAQA PIPAWFRCLKTLISLNHLILEPDLDDLLASAWIDAEVTEFRTKKAQEALIRTVFQTYAKL RQVPRLFEEVLGVICRPAAEALRQPVLASGPSTVLSACLLELPPSQILDTWSLVLEKFQS LVLPYLQSDADMALKSLSLSLLLHCIMFNMRSLDSSTPLPIVRRTQCMMERMMRELVQPL LALLPDTPGPEPELWLQKVSDSVLLLSYTWAQVDAMFSLNCSQYHSMSGPLIGVALEISN LPSLLPGVKTQHWKKIEKFTAQFSSLGTYCLEQLYLQKMKRTLMQTSFRSEGAIQSLRCD AAFIIGSGRKSLNQRTTASWDGQVGMVSGLTYPVAHWHLIVSNLTILISYLCPDDVGYLA SVLLRTLPMGKAQEVSIDEEAYITLEKISKAFLHSPLFPEMQSLHSAFLTCVTTSCSSIL CSGAQRDSGLVSQQLPWLFEKDHMVVGHWENRFAKAGPEGIEPRGEIAQNLLSLVKSDFP IQLEGEQLESILGLLEVISALQLDSLLPPYHVHYFLVLLSMAVTKLGCSCSSSLALKFLT TCYQLLGYLQKGKSARSVFKIMYGSDIFEVVLTSLFRASSRFLIEMDDPAWLEFLQVIGT FLEELMQMLIQMKLSLVLNFRKITAFLSSSKPYTEAASSKQLENQNPQGRQLLLVSLTRL CHVLGPFLKEQKLGQEAPAALSELLQQVVLQTGAVLQLCSVPGARGWRLPSVLISSVSTL LEADLGQHCRDGGADISQGSDRTLLSHVALYQGVYSQILLELPALAGHDQSFQAALQFLT LFFLAPELHPKKDSVFTSMFHSVRRVLADPEIPVQVTQDIEPHLGALFTQMLEVGTTEDL RLVMQCILQGLDVSNMWKADVQLLNREASQEQPVSLTVVGPVLDVLAALLRQGEEAIGNP HHVSLAFSILLTVPLDHLKPLEYGSVFPRLHNVLFSILQCHPKVMLKAIPSFLNSFNRLV FSVMREGRQKDKGSIDDLPTVLKCARLVERMYSHIAARAEEFAVFSPFMVAQYVLEVQK >gi568815597r:229418182_229658652|GENSCAN_predicted_CDS_5|4317_bp atgggcgaggcgcctcggcgctcggctggcgacgcctggcgtgcgtgcagcccctccccg ctgcgccgccggcgtccggcggaaggagcgcgggtaccggtcccacgtggtggggacgcg ggacccgtacagcggcctccgccgcaccgggacagcagccgccgccgctgccgccgtcct cccctgtctacccggagctgtctcgagctgagccccctaccgggccggatcccgagggag tgctggcagggctgtgctgtccgccctggacttcgcattgcccgaacttggtacatgcat atctcagagatactgcaggtttggttccacgtcactgaaatagatcacaatattatcatc aatgagagagtagctgagttctctctttcgggatcccaaagaaacatctgtgctgtcctt cgatgttgccagggcatcctgtcgacacctgccctggctgtcatctacacggccaaacag gagctgatggtggccttgctgagccagctttgctggtcggcctgcaggcagcccgaagga gctgtggtagcccagttgtttgaggtcattcacctggcccttggccattatctcttgatc ctgcagcagcaggtcaacccaagacgtgcctttggggatgtgactgctcacctgctccag ccgtgcctggtcctgaggcacttactctctgggggcacatggacgcaggctggccagggc cagctgaggcaggtgctgagccgggacatcaggagtcagattgaggccatgttccgagga gggatttttcagcctgagctactgtcatcctacaaggaggggctcttggaccagcagcaa ggggatgtgaagacgggagccatgaagaaccttctggctcccatggacaccgtgcttaac aggctggttgatgctggctactgtgcagcatcccttcatacctctgttgtggccaactca gtggccttgctgtataagctctttctagattcttactttaaggagggaaaccagcttctc tgcttccaggttctccccaggttgtttggctgcttgaagatttcacacctgcaggaggag cagagcaaagccctgtccacatcagattggaccacagagcttttggttgtggaacagcta ctaaactcagtggccaacaacaatatctacaacatcgctgccgacagaattcggcacgaa gaggctcagttccgcttttaccgccacgtggctgagctgctgataaaccatgcacaagca cccataccggcctggttccgctgtctgaagactttgatatctctgaatcatttgattttg gagccagacctggatgacctgctggcttcagcgtggatcgatgccgaggtaacagagttt cgaaccaaaaaagcccaggaggcgcttattcgtactgtcttccagacttatgccaaactc cgacaagtgccacggttgtttgaagaggttttgggggtgatctgtcgtccagctgctgag gcactgaggcagcctgtgctggcctcgggcccctccacggtactctctgcatgcctcctg gagctgcctccaagtcagatcctggacacgtggtcccttgtgctggagaagttccagtct ttagtcttgccctatttgcagagtgatgccgacatggccctgaaatcactgtcactgagc ttgctgctgcactgcatcatgttcaacatgaggagcctggacagcagcacgcctctgccc attgtcagacggacacagtgcatgatggagaggatgatgagggagctcgtgcagcccctg ctggcccttctcccggacaccccaggcccagagccagagctgtggctgcagaaggtcagt gactctgtgctcctgctctcttacacttgggcccaggtggacgctatgttcagtttgaac tgtagccagtatcactctatgtctgggccccttataggtgttgctctggagatctcgaac ctcccttcgttgctcccaggtgtaaaaacacagcattggaagaagatagagaagtttaca gctcagttcagctctcttggtacatattgcttagaacagctgtacctgcagaaaatgaaa aggactttaatgcaaactagtttccggtctgaaggagccatccaaagtttgaggtgcgat gctgcctttattattggttccggcagaaaaagcttgaatcagagaacgacggcttcctgg gatggccaagttgggatggtgagtggactcacataccctgtagcacactggcacttgatt gtgtcaaatctcacaattttaatatcctatctgtgtccagatgatgtgggatacctggcc agtgtcctgctgagaactttacccatgggcaaagcccaggaagtctcaatagatgaagag gcatacatcacactggaaaaaatatccaaagccttccttcatagccctctctttccagag atgcagtcccttcattctgctttcttaacgtgcgtaaccacaagttgctccagcattctg tgttctggtgcccagcgtgactcaggtcttgtcagtcagcagcttccctggctttttgaa aaggaccacatggttgtgggtcattgggaaaacagatttgcaaaagctggacccgaaggt atagaacctagaggagaaattgcccagaacttactgtccctggtcaagagtgacttccct atccagctggagggagagcagttggaaagcatcctggggcttttggaagtgatttctgcc ttacagctggacagcctcttgccaccctatcatgtgcattattttcttgtgttactgtcc atggccgtcaccaaactaggatgctcttgctcctcctcactggctctcaagttcttgacg acttgctaccaacttcttggttacttgcaaaaggggaaaagtgctcgctctgtgttcaag atcatgtatggtagtgatatttttgaggttgtactgacctcattgttcagagctagtagt aggttccttattgagatggatgatcccgcttggctggaattcctccaagtgatagggacg ttcttagaggagctaatgcagatgctcatccaaatgaagctgagcttggtgctcaatttt agaaaaatcaccgcattcctctctagttccaaaccatacacggaggcagcttcaagcaaa caattagaaaatcagaacccccagggcaggcagctccttctggtgtctttaaccaggttg tgccatgtcctgggacctttcctcaaagagcagaagctgggccaagaggccccagcagca ctgtctgagctgctgcagcaggttgtgctgcagacaggagctgtgctgcagctctgctca gtgccgggggcccggggctggcgccttccctcggtcctcatctcatccgtcagcacgctc ttggaagccgacctgggtcagcactgcagggatggaggggccgacatttcccaaggaagc gacaggacgctgctctcccatgttgccctctaccagggtgtttactctcagatactgttg gagttgccagctctcgcgggacatgatcagtcttttcaggcagccttgcagtttttgact ctgttctttttggccccagaactgcatcccaaaaaggactccgtgtttacctccatgttt cattctgtgagaagagttcttgcagatcctgaaattcctgttcaggtcactcaggatatt gagcctcatttgggagccttgttcacccaaatgttagaggttgggacgacagaggacttg aggctggtgatgcagtgtattctccagggactggatgtcagtaacatgtggaaagcagat gtgcagctcttaaaccgagaagcttctcaggagcagcctgtgtccctcacagtggtcggg cctgtcttagatgtcctggctgcactgctgcggcagggggaggaggccatcggcaacccc caccacgtcagcctggccttcagcatccttctcactgtccctttggaccatctgaagccg ctggagtatggaagcgtcttcccgaggctgcacaacgtgctcttctcaatcctgcagtgt caccctaaggtaatgctgaaagccatcccttctttcttgaactctttcaatagattggtg ttttcagttatgcgggaagggcggcagaaggacaaaggaagcatagatgacctgcctacg gtcctaaagtgtgcacgcctggttgaaagaatgtacagccacatcgccgcacgagctgag gagtttgctgtgttttccccatttatggtggcccagtacgtgttggaggtacagaag