GENSCAN 1.0 Date run: 5-Nov-116 Time: 14:54:38 Sequence gi568815581r:5005827_5209870 : 204044 bp : 47.70% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1089 1258 170 1 2 130 75 106 0.993 13.17 1.02 Intr + 1437 1516 80 2 2 84 48 85 0.915 2.45 1.03 Intr + 1641 1716 76 0 1 109 113 51 0.920 9.22 1.04 Intr + 7827 7906 80 2 2 120 74 130 0.999 13.15 1.05 Intr + 8917 9011 95 1 2 93 90 111 0.911 11.41 1.06 Intr + 11126 11227 102 0 0 47 81 64 0.678 1.75 1.07 Intr + 14170 14253 84 2 0 77 75 119 0.997 9.29 1.08 Intr + 14666 14852 187 0 1 92 64 265 0.999 23.25 1.09 Intr + 14980 15052 73 1 1 74 81 134 0.907 10.71 1.10 Intr + 15967 16065 99 0 0 27 61 97 0.606 1.21 1.11 Intr + 16266 16883 618 2 0 88 91 1023 0.999 95.11 1.12 Term + 17642 18325 684 1 0 112 47 409 0.991 32.84 1.13 PlyA + 18856 18861 6 1.05 2.05 PlyA - 19690 19685 6 1.05 2.04 Term - 27343 27131 213 1 0 145 52 80 0.935 7.33 2.03 Intr - 27558 27435 124 2 1 101 77 37 0.896 4.49 2.02 Intr - 28532 27653 880 0 1 119 56 697 0.948 59.99 2.01 Init - 28780 28651 130 1 1 44 96 241 0.806 19.00 2.00 Prom - 38669 38630 40 -4.86 3.00 Prom + 45830 45869 40 -5.56 3.01 Init + 62762 62764 3 1 0 100 77 0 0.147 0.10 3.02 Intr + 72472 72749 278 0 2 70 81 162 0.504 10.01 3.03 Intr + 82003 82073 71 1 2 122 66 73 0.906 7.33 3.04 Term + 85671 87187 1517 1 2 109 33 1095 0.888 96.78 3.05 PlyA + 88401 88406 6 -0.45 4.00 Prom + 90102 90141 40 -6.66 4.01 Init + 91271 91366 96 1 0 44 102 173 0.991 14.87 4.02 Intr + 91614 91805 192 2 0 76 94 98 0.913 8.89 4.03 Term + 95673 95705 33 2 0 119 48 -2 0.306 -3.51 4.04 PlyA + 96189 96194 6 1.05 5.06 PlyA - 96284 96279 6 -0.45 5.05 Term - 100707 99998 710 1 2 54 38 404 0.768 25.67 5.04 Intr - 103226 103100 127 2 1 84 65 54 0.977 2.95 5.03 Intr - 104042 103568 475 1 1 49 75 388 0.764 26.67 5.02 Intr - 106341 105974 368 0 2 70 105 123 0.524 6.14 5.01 Init - 110337 110320 18 0 0 89 69 48 0.259 1.60 5.00 Prom - 111280 111241 40 -10.15 6.00 Prom + 111727 111766 40 -5.16 6.01 Init + 114849 114962 114 2 0 86 64 137 0.623 11.31 6.02 Intr + 115707 115924 218 2 2 98 70 315 0.015 28.00 6.03 Intr + 122907 123096 190 0 1 65 109 80 0.014 7.39 6.04 Intr + 123273 123307 35 2 2 37 99 2 0.040 -6.78 6.05 Intr + 124110 124235 126 0 0 68 43 110 0.109 4.29 6.06 Intr + 124541 124613 73 2 1 83 87 72 0.129 6.01 6.07 Intr + 124776 124858 83 2 2 95 73 126 0.990 10.24 6.08 Intr + 126570 126609 40 0 1 79 116 5 0.907 0.63 6.09 Intr + 127617 127724 108 2 0 113 85 160 0.974 18.68 6.10 Intr + 128061 128170 110 2 2 97 82 138 0.999 13.18 6.11 Intr + 129408 129456 49 0 1 91 117 57 0.991 7.58 6.12 Intr + 129982 130102 121 0 1 92 96 113 0.985 12.57 6.13 Intr + 130417 130682 266 2 2 99 54 43 0.571 -0.77 6.14 Intr + 131825 131924 100 1 1 95 89 44 0.970 4.88 6.15 Intr + 132295 132447 153 2 0 79 80 163 0.969 14.64 6.16 Intr + 133429 133848 420 2 0 106 72 110 0.868 5.12 6.17 Intr + 135599 135673 75 0 0 95 110 13 0.887 3.69 6.18 Intr + 136177 136315 139 2 1 53 97 32 0.689 0.12 6.19 Intr + 136571 136676 106 2 1 65 102 -22 0.907 -2.88 6.20 Intr + 138864 139037 174 2 0 85 68 191 0.883 16.94 6.21 Intr + 139639 139753 115 0 1 46 96 -6 0.434 -3.98 6.22 Intr + 140197 140348 152 2 2 84 90 80 0.887 7.68 6.23 Intr + 141257 141368 112 1 1 95 82 28 0.950 2.85 6.24 Intr + 142730 142941 212 0 2 59 49 149 0.432 6.73 6.25 Intr + 149596 149780 185 0 2 88 109 11 0.584 1.79 6.26 Intr + 155702 155788 87 2 0 109 71 23 0.770 1.79 6.27 Intr + 157058 157178 121 2 1 98 90 58 0.984 7.50 6.28 Intr + 162106 162297 192 0 0 110 84 179 0.999 19.39 6.29 Intr + 162941 163229 289 1 1 98 100 94 0.634 8.42 6.30 Intr + 164653 165089 437 2 2 105 89 401 0.999 35.20 6.31 Intr + 165761 165853 93 1 0 109 82 80 0.999 9.66 6.32 Intr + 166979 167133 155 1 2 105 12 221 0.312 15.07 6.33 Term + 173436 173649 214 0 1 40 42 229 0.248 10.20 6.34 PlyA + 173913 173918 6 1.05 7.04 PlyA - 174243 174238 6 -0.45 7.03 Term - 176032 175070 963 1 0 -38 45 688 0.811 43.96 7.02 Intr - 178450 176091 2360 2 2 29 93 1481 0.140 130.05 7.01 Intr - 203606 203557 50 1 2 118 86 24 0.308 3.52 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 115707 115928 222 2 0 98 48 311 0.983 25.02 S.002 Init - 123259 123208 52 1 1 114 117 90 0.933 15.83 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:5005827_5209870|GENSCAN_predicted_peptide_1|782_aa XLKTEEGSVRGALPAVSSPPAPVSPSSPTTHNGELEPSFSPNTESQIGPEEAMERLQETE KIIAELNETWEEKLRKTEALRMEREALLAEMGVAVREDGGTVGVFSPKKTPHLVNLNEDP LMSECLLYHIKDGVTRVGQVDMDIKLTGQFIREQHCLFRSIPQPDGEGSANQGSCSGLVQ VLSDSKCFRHFGLKAVESVQPMVVTLEPCEGAETYVNGKLVTEPLVLKSGNRIVMGKNHV FRFNHPEQARLERERGVPPPPGPPSEPVDWNFAQKELLEQQGIDIKLEMEKRLQDLENQY RKEKEEADLLLEQQRLSCTKFHTRLATTNTYGNFLGAGHGSKYSTWIWEYADSDSGDDSD KRSCEESWRLISSLREQLPPTTVQTIVKRCGLPSSGKRRAPRRVYQIPQRRRLQGKDPRW ATMADLKMQAVKEICYEVALADFRHGRAEIEALAALKMRELCRTYGKPDGPGDAWRAVAR DVWDTVGEEEGGGAGSGGGSEEGARGAEVEDLRAHIDKLTGILQEVKLQNSSKDRELQAL RDRMLRMERVIPLAQDHEDENEEGGEVPWAPPEGSEAAEEAAPSDRMPSARPPSPPLSSW ERVSRLMEEDPAFRRGRLRWLKQEQLRLQGLQGSGGRGGGLRRPPARFVPPHDCKLRFPF KSNPQHRESWPGMGSGEAPTPLQPPEEVTPHPATPARRPPSPRRSHHPRRNSLDGGGRSR GAGSAQPEPQHFQPKKHNSYPQPPQPYPAQRPPGPRYPPYTTPPRMRRQRSAPDLKESGA AV >gi568815581r:5005827_5209870|GENSCAN_predicted_CDS_1|2349_bp ngcctgaagacggaagaagggagtgtcagaggcgccctgccagctgtgtcatctccccca gctccagtttcaccctcatcacccaccacacataatggggagctggagccgtcattctcc cccaacacggagtcccagattgggcctgaggaagccatggagaggctgcaggagacagag aagattatagctgagctgaacgagacatgggaggagaagctacgcaagacagaagccctg aggatggagagagaagcattgctggctgagatgggggtggccgtccgggaggatggggga actgtgggcgtcttctctccaaagaagactccccacctggtgaacctgaacgaagaccct ctgatgtctgagtgtctgctctaccacatcaaagatggcgtcaccagggtcggccaagta gatatggacatcaagctgaccggacagttcattcgggagcaacactgtctgttccggagc atcccccagccagatggagaagggtcagccaatcaaggcagttgttctggtctggttcaa gtcctcagtgacagcaaatgcttccggcattttgggttgaaagctgttgaatcagttcag cccatggtggtcactctggagccttgtgaaggagctgagacatatgtgaatgggaagctt gtgacggagccgctggtgctgaagtcagggaataggattgtgatgggcaagaaccacgtt ttccgcttcaaccacccggagcaggcaaggctggaacgggaacgaggggtccccccaccc ccaggaccgccctctgagccagtcgactggaactttgcccagaaggaactgctggagcag caaggcatcgacataaagctggaaatggagaagaggctgcaggatctggagaatcagtac cggaaagaaaaggaagaagccgatcttctgctggagcagcagcgactgtcttgcacaaag ttccacactcgattagctacaactaatacttatggtaactttctcggtgccggccatggt tctaagtactccacgtggatctgggagtatgcagactcggacagcggggatgactctgac aagcgctcttgtgaagagagctggaggctcatctcctccttgcgggagcagctgccgccc accacggtccagaccattgtcaaacgctgtggtctgcccagcagtggcaagcgcagggcc cctcgcagggtttatcagatcccccagcgacgcaggctgcagggcaaagacccccgctgg gccaccatggctgacctgaagatgcaggcggtgaaggagatctgctacgaggtggccctg gctgacttccgccacgggcgggctgagattgaggccctggccgccctcaagatgcgggag ctgtgtcgcacctatggcaagccagacggccccggagacgcctggagggctgtggcccgg gatgtctgggacactgtaggcgaggaggaaggaggtggagctggcagtggtggtggcagt gaggagggagcccgaggggcggaggtggaggacctccgggcccacatcgacaagctgacg gggattctgcaggaggtgaagctgcagaacagcagcaaggaccgggagctgcaggccctg cgggaccgcatgctccgcatggagagggtcatccccctggcccaggatcatgaggatgag aatgaagaaggtggtgaggtcccctgggccccgcctgaaggatcagaggcagcagaggag gcagcccccagtgaccgcatgccgtcagcccggcccccctcgccaccactgtcaagctgg gagcgggtgtcacggctcatggaggaggaccctgccttccgtcgtggtcgtcttcgctgg ctcaagcaggagcagctacggctgcagggactgcagggctctgggggccggggcgggggg ctgcgcaggcccccagcccgctttgtgccccctcacgactgcaagctacgcttccccttc aagagcaacccccagcaccgggagtcttggccagggatggggagcggggaggctccaact ccgctccaaccccctgaggaggtcactccccatccagccacccctgcccgccggcctccg agtccccgaaggtcccaccatccccgcaggaactccctggatggagggggccgatcccgg ggagcgggttctgcacagcctgaaccccagcacttccagcccaaaaagcacaactcttat ccccagccaccccaaccctacccagcccagcggcccccagggccccgctaccccccatac actactcccccacgaatgagacggcagcgttctgcccctgacctcaaggagagtggggca gctgtgtga >gi568815581r:5005827_5209870|GENSCAN_predicted_peptide_2|448_aa MAAPTLGRLVLTHLLVALFGMGSWAAVNGIWVELPVVVKDLPEGWSLPSYLSVVVALGNL GLLVVTLWRQLAPGKGEQVPIQVVQVLSVVGTALLAPLWHHVAPVAGQLHSVAFLTLALV LAMACCTSNVTFLPFLSHLPPPFLRSFFLGQGLSALLPCVLALVQGVGRLECPPAPTNGT SGPPLDFPERFPASTFFWALTALLVTSAAAFRGLLLLLPSLPSVTTGGSGPELQLGSPGA EEEEKEEEEALPLQEPPSQAAGTIPGPDPEAHQLFSAHGAFLLGLMAFTSAVTNGVLPSV QSFSCLPYGRLAYHLAVVLGSAANPLACFLAMGVLCRSLAGLVGLSLLGMLFGAYLMALA ILSPCPPLVGTTAGVVLVVLSWVLCLCVFSYVKVAASSLLHGGGRPALLAAGVAIQVGSL LGAGAMFPPTSIYHVFQSRKDCVDPCGP >gi568815581r:5005827_5209870|GENSCAN_predicted_CDS_2|1347_bp atggcagcacccacgctgggccgtctggtgctgacccacctgctggtggccctttttggc atgggctcctgggctgctgtgaacgggatctgggtggagctgcctgtggtggtaaaagac cttccagagggttggagcctcccctcatacctctctgtggttgtggcgctgggaaacctg ggtctgctggtggtgaccctgtggaggcagctggccccgggcaagggcgagcaggtcccc atccaggtggtacaggtgctgagtgtagtgggcacagccctgctggcccctctgtggcac cacgtggccccagtggcagggcagctccactctgtggccttcctaactctggccttggtg ttggcaatggcctgttgtacctctaatgtcactttcctgcccttcctgagccacctgcca cctcctttcttacggtctttcttcctgggtcagggtctcagtgccctactcccctgtgtg ctggccctagtgcaaggtgtgggccgcctcgagtgcccaccagcgcccaccaatggcacc tctgggcctcccctcgacttccctgagcgttttcctgccagcaccttcttctgggcactg actgcccttctggtcacttcagctgccgccttccggggtctcctgttgctgttgccatca ctaccctctgtaaccacagggggctcagggcctgaacttcaactgggatccccaggagca gaggaggaagagaaggaggaagaagaggctttgccattgcaggagccaccgagccaggca gcaggcaccatccctggcccagaccctgaggcccatcagctgttctcagcccatggtgcc ttcctgctgggcctgatggccttcaccagtgccgtgaccaatggcgtgctgccttctgtg cagagcttttcctgtttgccctatgggcgcctggcctaccacctggctgtggtgctgggc agtgccgccaacccccttgcctgcttcctggccatgggcgtgctgtgcaggtccctggca gggctggttggtctttctctgctgggcatgctctttggggcctacctgatggcactggca atcctgagcccctgcccacccctggtgggcaccactgcaggggtggtccttgtggtgctg tcgtgggtgctgtgtctgtgtgtgttctcatatgtgaaggtggctgcaagctccctgctg catggtgggggtcggccggcattgctggcagctggtgtggccatccaagtgggctccctg cttggtgccggtgccatgttccctcccaccagcatctaccacgtgtttcaaagcagaaag gactgtgtagacccctgtggcccctga >gi568815581r:5005827_5209870|GENSCAN_predicted_peptide_3|622_aa MGAGSAGGGDVELGLGLEPAKSKPEKARGRRCADHLFLQRSATRADWLECRPGTGSGRCP SPVTPGQQLRALCERPSSAEPQRSERARGSSGLRPLLGLLIGGPQLEARGSVDVFHTDCE MGTENKEVIPKEEISEESEPHGSLLEKFPKVVYQGHEFGAGCEEDMLEGHSRESMEEVIE QMSPQERDFPSGLMIFKKSPSSEKDRENNESERGCSPSPNLVTHQGDTTEGVSAFATSGQ NFLEILESNKTQRSSVGEKPHTCKECGKAFNQNSHLIQHMRVHSGEKPFECKECGKTFGT NSSLRRHLRIHAGEKPFACNECGKAFIQSSHLIHHHRIHTGERPYKCEECGKAFSQNSAL ILHQRIHTGEKPYECNECGKTFRVSSQLIQHQRIHTEERYHECNECGKAFKHSSGLIRHQ KIHTGEKPYLCNECGKGFGQSSELIRHQRIHTGDKPYECNECGKTFGQNSEIIRHIRIHT GEKPYVCKECGKAFRGNSELLRHERIHTGEKPYECFECGKAFRRTSHLIVHQRIHTGEKP HQCNECARTFWDNSELLLHQKIHIGEKPYECSECEKTFSQHSQLIIHQRIHTGEKPYECQ ECQKTFSRSSHLLRHQSVHCME >gi568815581r:5005827_5209870|GENSCAN_predicted_CDS_3|1869_bp atgggagccggctcggcaggcggcggcgacgtggaacttgggctagggctagaaccggcg aaaagcaagccggaaaaggcaagggggcgtcggtgcgctgaccacctgttcctccagcgg agtgcgacgcgcgccgattggctggagtgccgcccgggtaccggaagtggccgatgcccg tcgccagtgacaccgggacaacagctgcgggctctgtgcgagcggcccagcagcgcggag cctcagcggagtgagcgagcgcggggcagtagcggcctgcgacccctgctggggctcctc attggtggaccacagctggaagccagaggctctgttgatgtgttccatacagattgtgag atggggactgagaacaaggaggtgattcccaaggaagaaatttctgaagaatctgagcca catgggtcattattagaaaaatttccaaaagtggtttaccaaggtcatgagtttggagca ggatgtgaagaagacatgttggagggacattcgagagagtccatggaagaggttatagag cagatgtctcctcaggagagagactttccatcagggttgatgatctttaagaaatcaccc tcaagtgagaaagaccgggagaataatgagagtgagagaggctgcagtcccagcccaaat ctggttacacatcagggagatacaacagagggagttagtgcatttgctacctctggccaa aacttcctagagattttagaatctaacaaaacacagagaagttctgtgggagaaaagcct catacatgtaaagaatgtgggaaagcctttaatcagaactcacatctcatccagcatatg agagttcatagtggagaaaaaccctttgaatgtaaagaatgtggaaagacatttggaact aattcaagccttcgacggcacctgagaattcatgctggagaaaaaccctttgcttgtaat gaatgtggaaaggccttcattcagagttcacaccttattcaccatcatagaattcatact ggagagagaccctataaatgtgaagaatgtggtaaagccttcagtcaaaattcagccctt attctacaccagagaatccatactggagagaaaccatatgaatgtaatgaatgtgggaag acctttagggttagttcacagcttattcagcatcagagaattcatactgaagaaagatac catgaatgcaatgagtgtggcaaagccttcaagcatagctcaggccttattagacaccag aaaattcatactggagaaaaaccatatctgtgtaatgaatgtgggaagggcttcgggcag agttctgagcttatccggcatcagagaattcatacaggggacaaaccctatgaatgtaat gaatgtgggaaaacttttggccagaactcagagattattagacatattagaattcatact ggtgagaagccctatgtatgtaaggaatgtgggaaggccttcagggggaactcagaactt cttagacatgagagaattcacactggagagaaaccctatgaatgctttgagtgtggaaag gctttcaggcggacctctcaccttattgtccaccagagaattcatactggagagaaaccc catcaatgtaatgagtgtgcaagaaccttttgggataattctgagctgcttctccaccag aaaattcatattggagagaaaccttatgaatgtagcgagtgtgagaaaacatttagccag cattcccaacttatcatacatcagagaattcacactggagagaagccttatgagtgccaa gaatgtcagaagacttttagtcggagctctcacctcctccgacatcaaagtgttcactgt atggagtaa >gi568815581r:5005827_5209870|GENSCAN_predicted_peptide_4|106_aa MRKGAASAAQVDTVVNHSLAACDVKHLRWARRASFPCPCLLVLGIHGHSTLLQYRGQEHH VLFTGRCLLVDPTLLVLVVSKKSPNALEASQPGIREGSFTLPVAQF >gi568815581r:5005827_5209870|GENSCAN_predicted_CDS_4|321_bp atgaggaagggagcagcctcagcagcgcaggtggacacagtcgtcaaccactcgctggcc gcctgcgacgtgaagcacctgcgctgggcgcggcgggcgtccttcccatgtccttgtctc ctagtcctgggaatccacggccactccactctcctgcagtaccgcggacaagaacaccat gtactcttcacaggcaggtgtcttctggtggacccaactctgttggtacttgtcgtctcc aaaaagtccccaaatgcgctagaggccagccagcccggtatccgagagggctctttcaca ttgcctgttgcccagttctga >gi568815581r:5005827_5209870|GENSCAN_predicted_peptide_5|565_aa MGAAAKVQFHKRPPKLDEVTRLQGLTLHLRKDCNSQGPPGHPAAPVQSCRRRALSRSAQV QEPVPGLPASRQRPRAWAKVRERKPAARRILGVGGRGCDLRRRKPAPGVRAVSGAIRGSA PMEPPGPVRGPLQDSSWYEPSAELVQTRMAVSLTAAETLALQGTQGQEKMMMMGPKEEEQ SCEYETRLPGNHSTSQEIFRQRFRHLRYQETPGPREALSQLRVLCCEWLRPEKHTKEQIL EFLVLEQFLTILPEELQSWVRGHHPKSGEEAVTVLEDLEKGLEPEPQVPGPAHGPAQEEP WEKKESLGAAQEALSIQLQPKETQPFPKSEQVYLHFLSVVTEDGPEPKDKGSLPQPPITE VESQVFSEKLATDTSTFEATSEGTLELQQRNPKAERLRWSPAQEESFRQMVVIHKEIPTG KKDHECSECGKTFIYNSHLVVHQRVHSGEKPYKCSDCGKTFKQSSNLGQHQRIHTGEKPF ECNECGKAFRWGAHLVQHQRIHSGEKPYECNECGKAFSQSSYLSQHRRIHSGEKPFICKE CGKAYGWCSELIRHRRVHARKEPSH >gi568815581r:5005827_5209870|GENSCAN_predicted_CDS_5|1698_bp atgggcgcggccgccaaggtacaattccacaagagaccccccaaacttgatgaagtcacc cgcttacaagggctcactctccacctgcgcaaggactgcaattcccaggggcctccaggg cacccggccgctccagtccagtcgtgcaggcggcgcgctctttctcgctccgctcaagtc caggaaccggttcccgggctcccagcctctcggcagcgcccacgcgcctgggccaaagtc cgcgaacggaagccggcggcgaggaggattctgggagttggaggccgaggctgcgacctg cgcaggcgcaaacctgcccctggggtgagggctgtaagtggcgcgattcgcggcagcgcc ccgatggaacctcctggtcctgtgagggggcccttgcaagattccagctggtatgagcct tctgcagagctagtgcagactaggatggctgtatcactaacagcagctgaaactctggcc cttcagggtacacagggacaagagaagatgatgatgatgggaccaaaggaagaggaacag tcttgtgagtatgagaccaggctacctgggaaccactctaccagtcaagagatcttccgc caacgcttcaggcatctccgctaccaggagactcctggtccccgggaggccttgagccaa ctacgagtactctgctgtgagtggctgaggccagagaaacacacgaaggagcagatcctg gagttcctggtgctggaacaattcttgaccatcctgcctgaggagctccaatcctgggtg cggggacatcaccctaagagtggagaggaggctgtgactgtgctggaggatttagagaaa ggacttgaaccagagccgcaggtcccaggccctgcacatggacctgcacaggaagagcca tgggagaagaaggaatctctgggagcagcccaggaagcactgagcatccagctccagcct aaggagacccagcctttcccaaagagtgaacaggtatatttacattttctgtcagttgtt acagaagatggcccagagcccaaggacaaaggatcattgccacaaccacccattactgaa gtggaatcacaggtgttctcagaaaaacttgctactgacacctctacatttgaagctacc tctgagggtaccttagaactgcagcagagaaatcccaaagcggagagactgaggtggtcc cctgcccaggaggaaagtttcaggcagatggttgtcatccataaggaaattcccacaggg aagaaagaccatgaatgtagtgaatgtggtaaaaccttcatttataactcacatcttgtt gtccaccagagagttcattctggagagaaaccctataagtgtagtgactgtgggaaaact ttcaaacagagctcaaacctcggtcagcatcagagaattcatacaggagagaaacccttc gaatgtaatgaatgtgggaaggccttcagatggggtgctcatcttgttcagcatcagagg attcactcaggagagaagccctatgagtgtaatgagtgtgggaaggcctttagtcaaagc tcatatctaagtcagcatcggagaattcacagtggagagaaaccttttatatgtaaagaa tgtgggaaagcttatggatggtgctcagagctcattagacatcggagagttcatgccaga aaagagccttcccattga >gi568815581r:5005827_5209870|GENSCAN_predicted_peptide_6|1687_aa MDTVPGVNLSWILNEMHDQDKKLVEKSYKDAKGCFFSMGEPSLIDPCLEVCYGTQLAQLQ GLIRSMEQQLCELCCDAEHQDHEHQVLLDVKTRLEQEIATYSRLLEVEDAQQGPIVMRAM TSECHWCLAPIRSPDCVCVLRQLHTRSGIHKRFCGLKGFLPERHNPGQLIHQNQSGCCSR MYPFDRKLGISVALNIPGGRSYRDLWCLTAVHIHIPGIDHHRLFTLGRMDMVENADSLQA QERKDILMKYDKGHRAGLPEDKGPEPVGINSSIDRFGILHETELPPVTAREAKLIDRVYK GIPMNIRGPVWSVLLNIQEIKLKNPGRYQIMKERGKRSSEHIHHIDLDVRTTLRNHVFFR DRYGAKQRELFYILLAYSEYNPEVGYCRDLSHITALFLLYLPEEDAFWALVQLLASERHS LPAAWAPAALGAHKWADQAQVAASHHPVSPGPTPLPGDDQEAQHPPCSGRSVVASKSGLP FCTLAQEASRGTSSQAPGDVPVPPPQGKGRMISLGLTLRLWDVYLVEGEQVLMPITSIAL KVQQKRLMKTSRCGLWARLRNQFFDTWAMNDDTVLKHLRASTKKLTRKQGDLPPPAKREQ GSLAPRPVPASRGGKTLCKGYRQAPPGPPAQFQRPICSASPPWASRFSTPCPGGAVREDT YPVGTQGVPSLALAQGGPQGSWRFLEWKSMPRLPTDLDIGGPWFPHYDFEWSCWVRAISQ EDQLATCWQAEHCGEVHNKDMSWPEEMSFTANSSKIDRQKVPTEKGATGLSNLGNTCFMN SSIQCVSNTQPLTQYFISGRHLYELNRTNPIGMKGHMAKCYGDLVQELWSGTQKSVAPLK LRRTIAKYAPKFDGFQQQDSQELLAFLLDGLHEDLNRVHEKPYVELKDSDGRPDWEVAAE LRSQVKCKTCGHISVRFDPFNFLSLPLPMDSYMDLEITVIKLDGTTPVRYGLRLNMDEKY TGLKKQLRDLCGLNSEQILLAEVHDSNIKNFPQDNQKVQLSVSGFLCAFEIPVPSSPISA SSPTQIDFSSSPSTNGMFTLTTNGDLPKPIFIPNGMPNTVVPCGTEKNFTNGMVNGHMPS LPDSPFTGYIIAVHRKMMRTELYFLSPQENRPSLFGMPLIVPCTVHTRKKDLYDAVWIQV SWLARPLPPQEASIHAQDRDNCMGYQYPFTLRVVQKDGNSCAWCPQYRFCRGCKIDCGED RAFIGNAYIAVDWHPTALHLRYQTSQERVVDKHESVEQSRRAQAEPINLDSCLRAFTSEE ELGESEMYYCSKCKTHCLATKKLDLWRLPPFLIIHLKRFQFVNDQWIKSQKIVRFLRESF DPSAFLVPRDPALCQHKPLTPQGDELSKPRILAREVKKVDAQSSAGKEDMLLSKSPSSLS ANISSSPKGSPSSSRKSGTSCPSSKNSSPNSSPRTLGRSKGRLRLPQIGSKNKPSSSKKN LDASKENGAGQICELADALSRGHMRGGSQPELVTPQDHEVALANGFLYEHEACGNGCGDG YSNGQLGNHSEEDSTDDQREDTHIKPIYNLYAISCHSGILSGGHYITYAKNPNCKWYCYN DSSCEELHPDEIDTDSAYILFYEQQGIDYAQFLPKIDGKKMADTSSTDEDSESDYEKLTR PETLAEWRSAGYRWMMPTWLPMTSAPSEAGSLELHQGLDKLTLAGANLEMQPENLKEDLV YLKKIQQ >gi568815581r:5005827_5209870|GENSCAN_predicted_CDS_6|5064_bp atggacactgtgcctggagtgaacctgagctggatcctgaatgagatgcatgaccaggac aagaaattggtggagaagagctacaaggatgccaagggctgtttcttcagcatgggtgag cccagccttatagacccctgcttggaggtgtgttacgggacccaactggcccagctgcag gggctcatcagaagcatggagcagcagctgtgcgagctctgctgcgacgcggagcaccag gaccatgagcaccaggtccttctggacgtgaagacacggctggagcaggagatcgccacc tacagccgcttgctagaggttgaggacgctcagcaaggccccattgtgatgcgtgccatg acctcagaatgtcactggtgcttagcacctatccgctctccagactgcgtctgtgttcta cggcagttacacacacgcagtggtattcacaagcggttttgtggactcaaaggttttctc cctgagaggcataacccaggccagctgattcatcagaatcagagtgggtgctgttcccga atgtacccattcgacaggaaactgggcatctctgtggccctgaacatcccaggaggccga tcgtacagagacctctggtgcctgaccgcagttcacatccacatccctggaatagaccat cacaggctcttcacccttggcaggatggacatggtagagaatgcagatagtttgcaggca caggagcggaaggacatacttatgaagtatgacaagggacaccgagctgggctgccagag gacaaggggcctgagcccgttggaatcaacagcagcattgatcgttttggcattttgcat gagacggagctgcctcctgtgactgcacgggaggcgaagctcatagatcgagtgtacaag ggaattcccatgaacatccggggcccggtgtggtcagtcctcctgaacattcaggaaatc aagttgaaaaaccccggaagataccagatcatgaaggagaggggcaagaggtcatctgaa cacatccaccacatcgacctggacgtgaggacgactctccggaaccatgtcttctttagg gatcgatatggagccaagcagagggaactattctacatcctcctggcctattcggagtat aacccggaggtgggctactgcagggacctgagccacatcaccgccttgttcctcctttat ctgcctgaggaggacgcattctgggcactggtgcagctgctggccagtgagaggcactcc ctgccagctgcctgggctcctgctgcccttggtgcccacaaatgggctgaccaagcccag gtggcagcatctcaccatcccgtgtcccctggcccgaccccacttccaggagatgaccag gaagctcagcacccaccctgttctggccgctctgttgtggcctcaaagtcaggcttgccc ttttgcaccctggcccaggaggcttccaggggaacctccagccaggctccaggggatgtt cctgtcccacctccccagggcaaaggccgcatgatctctctcgggctcaccctgcgcctg tgggacgtgtatttggtggaaggagaacaggtgttgatgccaataaccagcattgctctt aaggttcagcagaagcgcctcatgaagacatccaggtgtggcctgtgggcacgtctgcgg aaccaattcttcgatacctgggccatgaacgatgacaccgtgctcaagcatcttagggcc tctacgaagaaactaacaaggaagcaaggggacctgccacccccagccaaacgcgagcaa gggtccttggcacccaggcctgtgccggcttcacgtggtgggaagaccctctgcaagggg tataggcaggcccctccaggcccaccagcccagttccagcggcccatttgctcagcttcc ccgccatgggcatctcgtttttccacgccctgtcctggtggggctgtccgggaagacacg taccctgtgggcactcagggtgtgcccagcctggccctggctcagggaggacctcagggt tcctggagattcctggagtggaagtcaatgccccggctcccaacggacctggatataggg ggcccttggttcccccattatgattttgaatggagctgctgggtccgtgccatatcccag gaggaccagctggccacctgctggcaggctgaacactgcggagaggttcacaacaaagat atgagttggcctgaggagatgtcttttacagcaaatagtagtaaaatagatagacaaaag gttcccacagaaaagggagccacaggtctaagcaacctgggaaacacatgcttcatgaac tcaagcatccagtgcgttagtaacacacagccactgacacagtattttatctcagggaga catctttatgaactcaacaggacaaatcccattggtatgaaggggcatatggctaaatgc tatggtgatttagtgcaggaactctggagtggaactcagaagagtgttgccccattaaag cttcggcggaccatagcaaaatatgctcccaagtttgatgggtttcagcaacaagactcc caagaacttctggcttttctcttggatggtcttcatgaagatctcaaccgagtccatgaa aagccatatgtggaactgaaggacagtgatggccgaccagactgggaagtagctgcagag ctaagatctcaagtcaaatgcaagacatgtgggcatataagtgtccgatttgaccctttc aattttttgtctttgccactaccaatggacagttacatggacttagaaataacagtgatt aagttagatggtactacccctgtacggtatggactaagactgaatatggatgaaaagtac acaggtttaaaaaaacagctgagggatctctgtggacttaattcagaacaaatcctacta gcagaagtacatgattccaacataaagaactttcctcaggataaccaaaaagtacaactc tcagtgagcggatttttgtgtgcatttgaaattcctgtcccttcatctccaatttcagct tctagtccaacacaaatagatttctcctcttcaccatctacaaatggaatgttcacccta actaccaatggggacctacccaaaccaatattcatccccaatggaatgccaaacactgtt gtgccatgtggaactgagaagaacttcacaaatggaatggttaatggtcacatgccatct cttcctgacagcccctttacaggttacatcattgcagtccaccgaaaaatgatgaggaca gaactgtatttcctgtcacctcaggagaatcgccccagcctctttggaatgccattgatt gttccatgcactgtgcatacccggaagaaagacctatatgatgcggtttggattcaagta tcctggttagcaagaccactcccacctcaggaagctagtattcatgcccaggatcgtgat aactgtatgggctatcaatatccattcactctacgagttgtgcagaaagatgggaactcc tgtgcttggtgcccacagtatagattttgcagaggctgtaaaattgattgtggggaagac agagctttcattggaaatgcctatattgctgtggattggcaccccacagcccttcacctt cgctatcaaacatcccaggaaagggttgtagataagcatgagagtgtggagcagagtcgg cgagcgcaagccgagcccatcaacctggacagctgtctccgtgctttcaccagtgaggaa gagctaggggaaagtgagatgtactactgttccaagtgtaagacccactgcttagcaaca aagaagctggatctctggaggcttccacccttcctgattattcaccttaagcgatttcaa tttgtaaatgatcagtggataaaatcacagaaaattgtcagatttcttcgggaaagtttt gatccgagtgcttttttggtaccacgagacccggccctctgccagcataaaccactcaca ccccagggggatgagctctccaagcccaggattctggcaagagaggtgaagaaagtggat gcgcagagttcggctggaaaagaggacatgctcctaagcaaaagcccatcctcactcagc gctaacatcagcagcagcccaaaaggttctccttcttcatcaagaaaaagtggaaccagc tgtccctccagcaaaaacagcagccctaatagcagcccacggactttggggaggagcaaa gggaggctccggctgccccagattggcagcaaaaataagccgtcaagtagtaagaagaac ttggatgccagcaaagagaatggggctgggcagatctgtgagctggctgacgccttgagc cgagggcatatgcgggggggcagccaaccagagctggtcactcctcaggaccatgaggta gctttggccaatggattcctttatgagcatgaagcatgtggcaatggctgtggcgatggc tacagcaatggtcagcttggaaaccacagtgaagaagacagcactgatgaccaaagagaa gacactcatattaagcctatttataatctatatgcaatttcatgccattcaggaattctg agtgggggccattacatcacttatgccaaaaacccaaactgcaagtggtactgttataat gacagcagctgtgaggaacttcaccctgatgaaattgacaccgactctgcctacattctt ttctatgagcagcaggggatagactacgcacaatttctgccaaagattgatggcaaaaag atggcagacacaagcagtacggatgaagactctgagtctgattacgaaaaactcaccagg ccggagacgttggcagaatggagaagtgctggctacaggtggatgatgcccacctggctg ccaatgacttctgcaccaagcgaggctgggtctctggagctgcaccaggggctggacaag ctgaccctggccggagccaacctggagatgcagcctgagaacctcaaggaggacctggtc tacctgaagaagattcaacaatga >gi568815581r:5005827_5209870|GENSCAN_predicted_peptide_7|1124_aa XLYEGFHLAINTSLGGQDSEDRNKMKEWKSKMEISEEKKSARAASEKLQRQITQECELVE TSNSEDRLLKHWVSPLKDAMRHLPSQESGIREMHIIPQKAIVGEIGHGCNEGEKILSAGE SSHRYEVSGQNFKQKSGLTEHQKIHNINKTYECKECEKTFNRSSNLIIHQRIHTGNKPYV CNECGKDSNQSSNLIIHQRIHTGKKPYICHECGKDFNQSSNLVRHKQIHSGGNPYECKEC GKAFKGSSNLVLHQRIHSRGKPYLCNKCGKAFSQSTDLIIHHRIHTGEKPYECYDCGQMF SQSSHLVPHQRIHTGEKPLKCNECEKAFRQHSHLTEHQRLHSGEKPYECHRCGKTFSGRT AFLKHQRLHAGEKIEECEKTFSKDEELREEQRIHQEEKAYWCNQCGRNFQGTSDLIRHQV THTGEKPYECKECGKTFNQSSDLLRHHRIHSGEKPCVCSKCGKSFRGSSDLIRHHRVHTG EKPYECSECGKAFSQRSHLVTHQKIHTGEKPYQCTECGKAFRRRSLLIQHRRIHSGEKPY ECKECGKLFIWRTAFLKHQSLHTGEKLECEKTFSQDEELRGEQKIHQEAKAYWCNQCGRA FQGSSDLIRHQVTHTREKPYECKECGKTFNQSSDLLRHHRIHSGEKPYVCNKCGKSFRGS SDLIKHHRIHTGEKPYECSECGKAFSQRSHLATHQKIHTGEKPYQCSECGNAFRRRSLLI QHRRLHSGEKPYECKECGKLFMWHTAFLKHQRLHAGEKLEECEKTFSKDEELRKEQRTHQ EKKVYWCNQCSRTFQGSSDLIRHQSSDLLRHHRIHSGEKPYVCNKCGESFRSSSDLIKHH RVHTGEKPHECSECGKVFSQRSHLVTHQKIHTGEKPYQCTECEKAFRRRSLLIQRRRIHS GEKPYECKECGKLFMWHTAFLKHQRLHAGEKLEEREKTFSKDEELRGEQKIHQEEKAYWC NQCGRAFQGSSDLIGHQVTHTGEKPYECKECGKTFNQSSDLLRHHRIHSGEKPYVCNKCG KSFRGSSDLIRHHRVHTGEKPYECPECWKAFSQNSHLVSHQRIHTREKPFECSNCGKAFS GWTAFLKHQKLHIGKEFEDCKSLQTGPILIGSRNLMNAVKLGKV >gi568815581r:5005827_5209870|GENSCAN_predicted_CDS_7|3375_bp nnactctatgaaggcttccaccttgctattaatacctccttgggaggccaagattctgaa gacaggaataagatgaaggaatggaaatcaaagatggaaatttctgaagaaaagaagtca gcaagggctgcatccgaaaaactccaaagacaaatcacccaggaatgtgagttagttgaa accagtaattctgaggacagattattgaagcactgggtaagccctttaaaggatgcaatg agacatctcccttcccaagagagcggtatcagggaaatgcatattatcccccagaaagcc attgtgggagagattggccatggatgtaatgaaggagaaaaaatactttctgcaggagaa agctcccatagatatgaggttagtggccaaaacttcaaacagaagtcaggattaactgaa catcagaaaattcataatataaataagacctatgaatgtaaggaatgtgaaaaaaccttc aacaggagttcaaacctgatcatacatcagagaattcatacaggaaataagccatatgtg tgtaatgaatgtgggaaagactctaatcaaagttcaaatcttattatacatcagagaatt catacaggaaagaaaccttatatatgtcatgaatgtggaaaagacttcaatcagagctcc aatctggtgagacataagcaaattcacagtggtgggaatccctatgagtgcaaagagtgt gggaaggcttttaagggaagctcaaaccttgtcctgcaccagagaatccacagtaggggg aagccatatttatgcaataaatgtgggaaggctttcagtcaaagcacagatcttattata catcacagaattcacactggagagaaaccctatgaatgttatgactgtggacagatgttc agtcaaagttcacaccttgtcccacatcagagaattcacactggagagaaacccctcaaa tgtaatgaatgtgaaaaagccttcaggcagcattctcaccttactgaacaccagagactc cacagtggagagaaaccctatgaatgtcacagatgtgggaagaccttcagtgggcgcaca gcttttcttaaacatcagagattgcatgctggagagaaaattgaagaatgtgagaaaacc ttcagcaaggatgaggagcttagggaagagcagagaattcaccaggaagagaaagcttat tggtgtaatcagtgtggtaggaatttccagggcacctcagacctcatcagacatcaggta actcatacaggagagaaaccatatgaatgtaaagaatgtgggaaaactttcaatcagagc tcagaccttctgagacatcatagaattcacagtggagaaaaaccttgtgtatgtagcaaa tgtgggaaatcttttaggggcagctcagatcttattagacaccatcgtgttcatactgga gagaaaccctatgaatgtagtgaatgtgggaaagcctttagccagaggtcacaccttgtt acacaccagaaaatccatactggagagaagccctatcagtgcactgaatgtgggaaagcc ttcaggcggcgttcactccttattcaacatcggagaattcatagtggtgagaaaccctat gaatgtaaggaatgtgggaagctcttcatttggcgcacagctttcctcaaacatcagagc ctgcatactggagagaaacttgaatgtgagaaaaccttcagccaggatgaggagcttagg ggagagcagaaaattcaccaggaagcgaaagcttattggtgtaatcagtgtggtagggct ttccagggcagctcagacctcatcagacatcaggtaactcatacaagagagaaaccatat gaatgcaaagaatgtgggaaaactttcaatcagagctcagaccttctgagacatcataga attcacagtggagaaaaaccttatgtatgcaacaaatgtgggaaatcttttaggggtagc tcagatcttattaaacaccatcgtattcatactggagagaaaccctatgaatgtagtgaa tgtgggaaagccttcagccagaggtcacaccttgctacacaccagaaaatccatactgga gagaaaccctatcagtgcagtgaatgtgggaatgccttcaggcggcgttccctccttatt caacatcggagacttcatagtggtgagaaaccctatgaatgtaaggaatgtgggaaactc ttcatgtggcacacggctttcctcaaacatcagagactgcatgctggagagaaacttgaa gaatgtgagaaaaccttcagcaaggatgaggagcttagaaaagagcagagaactcaccag gaaaagaaagtttattggtgtaatcagtgtagtaggaccttccagggcagctcagatctc atcagacatcagagctcagaccttctgagacatcatagaattcacagtggagaaaaacct tacgtatgcaataaatgtggggaatcttttaggagcagctcagatcttattaaacaccat cgtgttcatactggagagaaacctcatgaatgtagtgaatgtgggaaagtctttagccag aggtcccaccttgtcacacaccagaaaatccacactggagagaagccctatcagtgcact gaatgtgaaaaagccttcaggcggcgttcactccttattcaacgtcggagaattcatagt ggtgagaaaccctatgaatgtaaggaatgtgggaaactcttcatgtggcacacagctttc ctcaaacatcagagactgcatgctggagagaaacttgaagaacgtgagaaaaccttcagc aaggatgaggagcttaggggagagcagaaaattcaccaagaagagaaagcttattggtgt aatcagtgtggtagggctttccagggcagctcagacctcatcggacatcaggtaactcat acaggagagaaaccatatgaatgtaaagaatgtgggaaaactttcaatcagagctcagac cttctgagacatcatagaattcacagtggagaaaaaccttatgtatgcaacaaatgtggg aaatcttttaggggcagctcagatcttattagacaccatcgtgttcatactggagagaaa ccctatgaatgccctgaatgttggaaggccttcagtcagaactcacaccttgtcagtcat caaagaattcataccagagagaaaccctttgaatgtagcaactgtggtaaggccttcagt gggtggacagcttttcttaagcaccagaaacttcacattggaaaggaatttgaagactgt aagagtctacaaacaggacctattctaataggtagcagaaacctaatgaatgcagtaaaa ctaggaaaagtctga