GENSCAN 1.0 Date run: 4-Nov-116 Time: 09:30:29 Sequence gi568815583f:41393928_41603163 : 209236 bp : 49.33% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 Intr - 1117 932 186 1 0 50 87 137 0.593 9.69 1.02 Intr - 3088 2560 529 0 1 47 73 236 0.683 10.94 1.01 Init - 10075 10002 74 1 2 88 116 45 0.917 7.84 1.00 Prom - 18849 18810 40 -3.46 2.00 Prom + 21504 21543 40 -6.76 2.01 Init + 23189 23386 198 1 0 54 94 392 0.996 33.30 2.02 Intr + 44394 44504 111 2 0 76 90 92 0.913 8.78 2.03 Intr + 58974 59121 148 2 1 101 105 100 0.836 12.81 2.04 Intr + 63745 63949 205 2 1 80 116 208 0.984 21.06 2.05 Intr + 70844 70958 115 2 1 40 91 116 0.914 7.55 2.06 Intr + 72214 72325 112 0 1 108 115 173 0.999 21.95 2.07 Intr + 76330 76465 136 2 1 98 57 261 0.981 23.63 2.08 Intr + 77245 77422 178 1 1 89 11 106 0.752 2.92 2.09 Intr + 80693 80775 83 1 2 94 70 30 0.567 0.24 2.10 Intr + 81598 81685 88 1 1 34 88 65 0.694 1.07 2.11 Intr + 81785 81892 108 1 0 75 94 58 0.915 5.58 2.12 Intr + 82519 82596 78 0 0 85 116 66 0.992 8.85 2.13 Intr + 83238 83359 122 2 2 25 89 188 0.995 11.89 2.14 Intr + 83531 83588 58 2 1 117 83 117 0.999 12.99 2.15 Intr + 84621 84698 78 2 0 76 82 58 0.930 3.75 2.16 Intr + 85176 85280 105 2 0 61 26 93 0.606 0.81 2.17 Intr + 86287 86398 112 0 1 76 91 113 0.942 10.35 2.18 Term + 86654 86760 107 0 2 121 53 70 0.921 5.37 2.19 PlyA + 88343 88348 6 1.05 3.00 Prom + 88384 88423 40 -8.76 3.01 Init + 88518 88525 8 2 2 85 89 4 0.759 0.41 3.02 Intr + 89737 89807 71 1 2 90 58 89 0.832 4.93 3.03 Term + 92251 92468 218 2 2 89 43 133 0.614 6.21 3.04 PlyA + 95226 95231 6 -0.45 4.00 Prom + 95593 95632 40 -7.56 4.01 Init + 100028 100489 462 1 0 92 96 766 0.964 71.50 4.02 Intr + 107536 107924 389 0 2 78 85 655 0.610 57.79 4.03 Intr + 108070 108274 205 1 1 125 100 489 0.999 53.00 4.04 Intr + 108475 108576 102 0 0 86 85 153 0.999 15.17 4.05 Intr + 108861 108932 72 2 0 96 99 121 0.985 13.60 4.06 Term + 109006 109239 234 0 0 4 54 380 0.754 22.52 4.07 PlyA + 109605 109610 6 1.05 5.44 PlyA - 109734 109729 6 1.05 5.43 Term - 110317 110069 249 1 0 108 52 32 0.708 -2.70 5.42 Intr - 110505 110415 91 2 1 87 69 139 0.993 11.90 5.41 Intr - 110713 110579 135 0 0 80 56 136 0.998 9.28 5.40 Intr - 110947 110846 102 0 0 113 19 118 0.985 6.69 5.39 Intr - 111137 111045 93 1 0 73 66 89 0.940 4.28 5.38 Intr - 111378 111281 98 0 2 86 66 166 0.998 13.01 5.37 Intr - 111603 111474 130 2 1 52 85 0 0.405 -3.20 5.36 Intr - 111850 111786 65 1 2 107 55 72 0.362 3.32 5.35 Intr - 112078 111988 91 0 1 112 100 44 0.957 8.00 5.34 Intr - 113363 113168 196 0 1 68 111 83 0.956 7.07 5.33 Intr - 113730 113587 144 1 0 92 55 49 0.695 2.25 5.32 Intr - 114294 114142 153 1 0 51 71 119 0.489 6.54 5.31 Intr - 115202 115104 99 0 0 81 72 92 0.917 6.98 5.30 Intr - 117419 117237 183 0 0 126 61 112 0.997 12.06 5.29 Intr - 117690 117495 196 0 1 33 94 185 0.551 12.59 5.28 Intr - 118036 117890 147 1 0 115 87 52 0.911 8.23 5.27 Intr - 118338 118188 151 2 1 91 38 155 0.696 10.96 5.26 Intr - 118951 118780 172 2 1 96 84 127 0.994 12.10 5.25 Intr - 119193 119050 144 1 0 63 88 72 0.938 4.95 5.24 Intr - 119833 119740 94 1 1 -25 114 111 0.178 1.94 5.23 Intr - 121706 121623 84 2 0 18 90 77 0.076 0.92 5.22 Intr - 123764 123630 135 2 0 118 21 89 0.173 5.96 5.21 Intr - 123930 123871 60 0 0 142 75 64 0.998 9.43 5.20 Intr - 124255 124079 177 1 0 90 89 177 0.992 18.12 5.19 Intr - 127220 126464 757 1 1 82 80 574 0.980 47.47 5.18 Intr - 127953 127811 143 0 2 105 105 156 0.968 18.15 5.17 Intr - 128323 128171 153 1 0 52 78 93 0.843 4.97 5.16 Intr - 130045 129844 202 0 1 47 89 19 0.497 -2.81 5.15 Intr - 130327 130169 159 0 0 89 68 117 0.817 8.90 5.14 Intr - 131221 131064 158 1 2 84 94 164 0.999 15.41 5.13 Intr - 133141 132971 171 1 0 139 93 -15 0.948 4.24 5.12 Intr - 133374 133240 135 0 0 63 76 221 0.998 19.16 5.11 Intr - 133678 133496 183 1 0 102 98 239 0.999 26.28 5.10 Intr - 134100 133933 168 0 0 77 66 156 0.974 12.44 5.09 Intr - 134409 134308 102 0 0 93 96 26 0.939 4.27 5.08 Intr - 135641 135543 99 2 0 87 78 85 0.994 7.71 5.07 Intr - 136052 135937 116 0 2 85 78 158 0.997 14.67 5.06 Intr - 137275 137096 180 2 0 114 41 77 0.700 5.44 5.05 Intr - 141008 140787 222 0 0 76 52 197 0.903 13.00 5.04 Intr - 141705 141585 121 0 1 117 84 119 0.999 14.47 5.03 Intr - 142291 142202 90 1 0 83 96 50 0.944 5.49 5.02 Intr - 142722 142574 149 1 2 103 30 -9 0.860 -5.25 5.01 Init - 143198 143018 181 2 1 76 79 170 0.871 14.25 5.00 Prom - 144750 144711 40 -3.76 6.00 Prom + 159557 159596 40 -6.46 6.01 Init + 165349 165454 106 0 1 83 91 281 0.827 26.28 6.02 Intr + 167200 167383 184 2 1 112 61 316 0.841 30.15 6.03 Intr + 167612 167712 101 2 2 66 100 83 0.587 7.05 6.04 Intr + 168621 168791 171 1 0 132 99 31 0.983 8.51 6.05 Intr + 171099 171214 116 1 2 72 96 126 0.894 11.97 6.06 Intr + 173433 173610 178 2 1 101 82 165 0.998 16.69 6.07 Intr + 174290 174435 146 0 2 69 110 121 0.984 12.40 6.08 Intr + 174951 175095 145 2 1 70 94 78 0.936 6.46 6.09 Intr + 176100 176229 130 1 1 113 82 177 0.998 19.35 6.10 Intr + 176313 176413 101 0 2 89 84 197 0.998 19.15 6.11 Intr + 176677 176772 96 2 0 76 95 170 0.998 16.48 6.12 Intr + 177111 177191 81 1 0 52 113 122 0.996 10.71 6.13 Intr + 177668 177760 93 0 0 87 85 99 0.999 9.44 6.14 Intr + 178516 178637 122 2 2 113 59 97 0.996 9.51 6.15 Intr + 179075 179208 134 1 2 92 39 217 0.888 16.64 6.16 Intr + 179381 179540 160 2 1 97 77 214 0.992 21.19 6.17 Intr + 179752 179888 137 0 2 65 67 226 0.972 17.57 6.18 Intr + 183959 184193 235 2 1 133 23 160 0.643 11.69 6.19 Intr + 191533 191779 247 0 1 52 43 145 0.402 3.43 6.20 Intr + 194256 194347 92 1 2 49 88 79 0.594 3.71 6.21 Intr + 194756 194933 178 1 1 27 115 87 0.604 4.79 6.22 Intr + 195091 195320 230 2 2 57 58 86 0.213 0.09 6.23 Term + 195362 195580 219 1 0 -17 55 154 0.425 -1.36 6.24 PlyA + 195712 195717 6 1.05 7.03 PlyA - 198263 198258 6 1.05 7.02 Term - 205481 205297 185 0 2 105 43 70 0.892 1.91 7.01 Init - 207643 207583 61 1 1 92 66 35 0.807 3.11 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 123764 123615 150 2 0 118 45 105 0.821 7.11 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583f:41393928_41603163|GENSCAN_predicted_peptide_1|263_aa MGHGETARIKGCGKRIEESVMGAERKFSKPTSALYPFLGIRFAEYSSSLQKPVASPGKAS SQRKTEGDLQGDHQKEVALDITSSEEKPDVSFDKAIRDEAIYHFRLLKDEIVDHWRGPEG HPLHEVLLEQAKVVWQFRGKEDLDKWTVTSDKTIGGRSEVFLKMGKNNQSALLYGTLSSE APQDGESTRSGYCAMISRIPRGAFERKMSYDWSQFNTLYLRVRGDGRPWMVNIKEDTDFF QRTNQMYSYFMFTRGGPYWQEVK >gi568815583f:41393928_41603163|GENSCAN_predicted_CDS_1|789_bp atgggccatggagagacagcccgcattaaaggttgtgggaagagaatagaagagtcagtg atgggggcagaaagaaaattctctaagccaacttctgccttgtatccatttttgggtatt cgctttgcagagtattccagtagtcttcagaaaccagtggcttctcctggcaaagcctcc tcacagaggaagactgaaggggatttgcaaggagatcaccagaaagaagttgctttggat ataacttcttctgaggagaagcctgatgttagtttcgataaagcaattagagatgaagca atataccattttaggcttttgaaggatgaaattgtggatcattggagaggaccggaaggc caccctctgcatgaggtcttgctggaacaagccaaggttgtctggcaattccgggggaaa gaagatttggataagtggacagtgacttctgataagacgattggaggcagaagtgaagtg tttttgaaaatgggcaagaataaccaaagtgcactgctatatggaactctgagctctgag gcgcctcaggacggggagtctacccgaagtgggtactgtgcaatgatatccaggattcca aggggtgcttttgagaggaagatgtcttacgattggtcccagttcaatactctgtatctc cgtgtacgtggggatggtcggccttggatggtgaatatcaaggaggacacagatttcttc cagaggacgaatcagatgtatagttacttcatgttcacccgcgggggaccctactggcag gaggtcaag >gi568815583f:41393928_41603163|GENSCAN_predicted_peptide_2|713_aa MRGRLCVGRAAAAAAAVAVPLAGGQEGSPGGGRRGSRGTTMVKKRKGRVVIDSDTEDSGS DENLDQELLSLAKRKRSDSEEKEPPVSQPAASSDSETSDSDDEWTFGSNKNKKKGKARKI EKKGTMKKQANKTASSGSSDKDSSAESSAPEEGEVSDSDSNSSSSSSDSDSSSEDEEFHD GYGEDLMGDEEDRARLEQMTEKEREQELFNRIEKREVLKRRFEIKKKLKTAKKKEKKEKK KKQEEEQEKKKLTQIQESQVTSHNKERRSKRDEKLDKKSQAMEELKAEREKRKNRTAELL AKKQPLKTSEVYSDDEEEEEDDKSSEKSDRSSRTSSSDEEEEKEEIPPKSQPVSLPEELN RVRLSRHKLERWCHMPFFAKTVTGCFVRIGIGNHNSKPVYRVAEITGVVETAKVYQLGGT RTNKGLQLRHGNDQRVFRLEFVSNQEFTESEFMKWKEAMFSAGMQLPTLDEINKKELSIK EALNYKFNDQDIEEIVKEKERFRKAPPNYAMKKTQLLKEKAMAEDLGDQDKAKQIQDQLN ELEERAEALDRQRTKNISAISYINQRNREWNIVESEKALVAESHNMKNQQMDPFTRRQCK PTIVSNSRDPAVQAAILAQLNAKYGSGVLPDAPKEMSKASVGQGKDKDLNSKSASDLSED LFKVHDFDVKIDLQVPSSESKALAITSKAPPAKDGAPRRSLNLEDYKKRRGLI >gi568815583f:41393928_41603163|GENSCAN_predicted_CDS_2|2142_bp atgcgcggtcgcctttgtgtgggtcgagcagcggcggcggcggcggcagtggcggtccca ctggcaggcgggcaagaggggagtccgggcggcggccggcgtgggagccgggggaccacc atggtaaagaagcggaaaggccgcgtcgtgatcgactcggacacagaggacagcggcagc gacgagaacctggatcaggagctcttgtccctggcaaagcgaaagcgcagtgactctgag gagaaggagccgcctgtgagtcagcctgcagcctcgtcagactcggagacgtctgacagt gacgatgagtggacatttgggagcaataaaaataagaagaaaggaaaagccagaaaaata gagaagaaaggaaccatgaagaaacaggccaacaaaactgcctcctcaggcagttcagac aaagacagttcagctgagagctcagcccctgaggaaggtgaagtgtcagactctgacagc aacagctcctcttccagttcagattcagactcttcctcagaagatgaagagttccatgat ggctatggagaagacctcatgggagatgaggaagacagggcccgtctggaacagatgaca gagaaagagagagagcaagaactgttcaatcgcatagagaagagggaggtgttgaaaaga agatttgaaatcaagaaaaaactaaaaacagccaaaaagaaagaaaagaaagaaaagaag aaaaagcaagaagaggagcaagaaaagaaaaaactgacacagattcaagaatctcaggta acatcccacaacaaggaacggcgttccaagcgggatgagaaactagacaagaaatctcaa gccatggaggagctaaaagcagagcgagaaaaacgaaagaacagaacagctgagctcctt gccaaaaaacagccattaaaaaccagtgaggtctactctgatgatgaagaggaggaagag gatgacaaatccagtgaaaagtcagaccgctcatcacgaacatcatcgtctgatgaagaa gaggagaaagaagagatccctcccaaatcccaaccagtttccttacctgaagaattgaat cgggttcgattatcacggcataagctagaacgctggtgtcacatgcccttctttgctaaa actgtcacaggatgttttgtgcggattggcatcggaaaccacaacagcaaaccagtttac cgggtcgctgagattacgggtgttgtggaaactgccaaagtttaccaactaggtggcacc agaacaaacaaagggctgcaactacggcatggcaatgaccaacgcgtgttccgtttagag tttgtctcaaaccaagaattcaccgaaagtgagtttatgaagtggaaagaagcgatgttc tctgctggcatgcagttgcccactctagatgaaatcaataaaaaggaattatctattaaa gaagctcttaattataaattcaatgatcaggacattgaagagattgtaaaagagaaagaa aggttcagaaaagctccacccaactacgctatgaagaagactcagctactgaaggaaaag gccatggctgaggacctgggggatcaggacaaggccaaacaaatccaagatcaactgaat gagctggaggaacgggcagaggccctggaccgccagcggaccaagaacatatccgctatc agttacatcaaccagcggaaccgggagtggaacattgtagagtctgagaaggcccttgtg gctgaaagtcacaacatgaaaaaccaacagatggatccctttactcggcggcagtgcaag cctaccatcgtttctaattccagagacccagctgttcaagctgccatcttggcccagctg aatgcaaaatacggttctggagtgttaccagatgctccaaaggaaatgagcaaggcaagt gtgggtcaaggcaaagataaagatttgaattctaagtcagccagtgacctctcagaagat ctgttcaaagtacacgattttgatgtgaagattgacttacaagttcccagctcagagtca aaggctttagccatcacctccaaggctccgccagccaaggatggggctccaaggagatct ctgaacttggaagactacaaaaaacgacgagggcttatttga >gi568815583f:41393928_41603163|GENSCAN_predicted_peptide_3|98_aa MDFPVPTKVKVAIVNSLAAWFPGSLSAMDSPASLSACDAAQPFTWQARKPQVDSISFAGR ALRRSPLGVSTTPRTGLGATLVRANGPRIPGPVRLLRR >gi568815583f:41393928_41603163|GENSCAN_predicted_CDS_3|297_bp atggacttccctgtgcccacaaaagtgaaagtggccatcgtcaactctttagcagcctgg ttccccggaagcctctctgccatggatagccctgcttcgctaagcgcgtgcgatgcagca cagcccttcacctggcaagcccggaagcctcaggttgactccatcagttttgccgggaga gcccttcggcgctccccgcttggtgtctccaccaccccccgcaccggcctgggcgccacc cttgtccgcgccaacggtccccgcatccctggccccgtgcgcctcctgcgccgttag >gi568815583f:41393928_41603163|GENSCAN_predicted_peptide_4|487_aa MARPGGARPCSPGLERAPRRSVGELRLLFEARCAAVAAAAAAGEPRARGAKRRGGQVPNG LPRAPPAPVIPQLTVTAEEPDVPPTSPGPPERERDCLPAAGSSHLQQPRRLSTSSVSSTG SSSLLEDSEDDLLSDSESRSRGNVQLEAGEDVGQKNHWQKIRTMVNLPVISPFKKRYAWV QLAGHTGEQWGGWAGARDGKGLGGADGCRSSGSFKAAGTSGLILKRCSEPERYCLARLMA DALRGCVPAFHGVVERDGESYLQLQDLLDGFDGPCVLDCKMGVRTYLEEELTKARERPKL RKDMYKKMLAVDPEAPTEEEHAQRAVTKPRYMQWREGISSSTTLGFRIEGIKKADGSCST DFKTTRSREQVLRVFEEFVQGDEEVLRRYLNRLQQIRDTLEVSEFFRRHEGRGLTVRGSQ VIGSSLLFVHDHCHRAGVWLIDFGKTTPLPDGQILDHRRPWEEGNREDGYLLGLDNLIGI LASLAER >gi568815583f:41393928_41603163|GENSCAN_predicted_CDS_4|1464_bp atggcgcggccggggggcgcgaggccctgcagcccggggctggagcgggccccgcgccgg agtgtcggggagctgcgcctgctcttcgaggcgcgctgtgcggcggtcgctgcggccgcc gccgcgggggagccccgggcccgcggggccaagcggcgtgggggacaggtccccaacggg cttccgcgggctcccccggccccggtgatccctcagctgaccgtgacagccgaggagccc gacgtgcccccgaccagccctgggccgccggagcgggagagggactgcctcccggcagcg ggctcttcgcacctgcagcagccgcgccgcctttccacctcgtcggtctcctccactggc tcctcgtcgctgctcgaggactcggaggacgacctgctgagcgacagtgagagccggagc cgcggcaacgtgcagctggaagcgggcgaggacgtgggtcagaaaaaccactggcagaag atccggaccatggtcaatctgccggtcataagccctttcaagaagcgctacgcctgggtg cagctggcagggcacactggtgagcagtggggcgggtgggcgggtgcccgcgacgggaag gggctgggcggcgctgacggatgccggtcctcagggagttttaaggcggcgggcaccagc gggctgatcctgaagcgctgctcggagccggagcgctactgcctggcgcggctgatggct gacgcgctgcgcggctgcgtgcctgccttccacggcgtggtggagcgcgacggcgaaagc tacctgcagctgcaggacctgctcgatggcttcgacggaccttgtgtgctcgactgcaaa atgggcgtcaggacttacctagaggaggagctgaccaaggcccgtgagcggcccaagctg cggaaggacatgtacaagaaaatgctggcggtggatcctgaagctcccacggaggaggag cacgcgcagcgcgccgtcaccaagccgcgctacatgcagtggcgggaaggcatcagctcc agcaccaccctcggcttccgcatcgagggcatcaagaaagcggacggctcctgcagcacc gacttcaagactacgcgaagccgagagcaggtgcttcgcgtctttgaagagtttgtgcaa ggagatgaggaagtgctgaggcggtatctgaaccgcctgcagcagatccgggacaccctg gaggtatccgagttcttcaggaggcacgagggccgcggcctgacggtgcggggctcgcag gtgatcggcagctcgctcctctttgtgcacgatcactgccatcgcgccggcgtgtggctc atcgacttcggcaagaccacgcccctccccgatggccagatcctggaccaccggcggccc tgggaggagggcaaccgcgaggacggctatttgctggggctggacaatctcattggcatc ctggccagcctggctgagagatga >gi568815583f:41393928_41603163|GENSCAN_predicted_peptide_5|2225_aa MLSRPKPGESEVDLLHFQSQFLAAGAAPAVQLVKKGNRGGGDANSDRPPLQDHRDVVMLD NLPDLPPALVPSPPKRARPSPGHCLPEDEDPEERLRRHDQHITAVLTKIIERDTSSVAVN LPVPSGVAFPAVFLRSRDTQGKSATSGKRSIFAQEIAARRIAEAKGPSVGEVVPNVGPPE GAVTCETPTPRNQGCQLPGSSHSFQGPNLVTGKGLRDQEAEQEAQTIHEENIARLQAMAP EEILQEQQRLLAQLDPSLVAFLRSHSHTQEQTGETASEEQRPGGPSANVTKEEPLMSAFA SEPRKRDKLEPEAPALALPVTPQKEWLHMDTVELEKLHWTQDLPPVRRQQTQERMQARFS LQGELLAPDVDLPTHLGLHHHGEEAERAGYSLQELFHLTRSQVSQQRALALHVLAQVISR AQAGEFGDRLAGSVLSLLLDAGFLFLLRFSLDDRVDGVIATAIRALRALLVAPGDEELLD STFSWYHGALTFPLMPSQEDKEDEDEDEECPAGKAKRKSPEEESRPPPDLARHDVIKGLL ATSLLPRLRYVLEVTYPGPAVVLDILAVLIRLARHSLESATRVLECPRLIETIVREFLPT SWSPVGAGPTPSLYKVPCATAMKLLRVLASAGRNIAARLLSSFDLRSRLCRIIAEAPQEL ALPPEEAEMLSTEALRLWAVAASYGQGGYLYRELYPVLMRALQVVPRELSTHPPQPLSMQ RIASLLTLLTQLTLAAGSTPAETISDSAEASLSATPSLVTWTQVSGLQPLVEPCLRQTLK LLSRPEMWRAVGPVPVACLLFLGAYYQAWSQQLAAILAAPGLQNYFLQCVAPGAAPHLTP FSAWALRHEYHLQYLALALAQKAAALQPLPATHAALYHGMALALLSRLLPGSEYLTHELL LSCVFRLEFLPERTSGGPEAADFSDQLSLGSSRVPRCGQGTLLAQACQDLPSIRNCYLTH CSPARASLLASQALHRGELQRVPTLLLPMPTEPLLPTDWPFLPLIRLYHRASDTPSGLSP TDTMGTAMRVLQWVLVLESWRPQALWAVPPAARLARLMCVFLVDSELFRESPVQHLVAAL LAQLCQPQVLPNLNLDCRLPGLTSFPDLYANFLDHFEAVSFGDHLFGALVLLPLQRRFSV TLRLALFGEHVGALRALSLPLTQLPVSLECYTVPPEDNLALLQLYFRTLVTGALRPRWCP VLYAVAVAHVNSFIFSQDPQSSDEVKAARRSMLQKTWLLADEGLRQHLLHYKLPNSTLPE GFELYSQLPPLRQHYLQRLTSTVLQNGITGSYHQKLLIQYIWGETQESAFLTSSQGCGRD RKGFCCRVDPTGMGCWGQLLVWFGAAGAILCSSPGSQETFLRSSPLPLASPSPRDPKVSA PPSILEPASPLNSPGTEGSWLFSTCGASGRHGPTQTQCDGAYAGTSVVVTVGAAGQLRGV QLWRVPGPGQYLISAYGAAGGKGAKNHLSRAHGVFVSAIFSLGLGESLYILVGQQGEDAC PGGSPESQLVCLGESRAVEEHAAMDGSEGVPGSRRWAGGGGGGGGATYVFRLEGASWNTP LAPQVRAGELEPLLVAAGGGGRAYLRPRDRGRTQASPEKLENRSEAPGSGGRGGAAGGGG GWTSRAPSPQAGRSLQEGAEGGQGCSEAWATLGWAAAGGFGGGGGACTAGGGGGGYRGGD ASETDNLWADGEDGVSFIHPSSELFLQPLAVTENHGEVEIRRHLNCSHCPLRDCQWQAEL QLAECLCPEGMELAVDNVTCMDLHKPPGPLVLMVAVVATSTLSLLMVCGVLILGTKRLAG TVDSRLLLSMKQKKWQGLQEMRLPSPELELSKLRTSAIRTAPNPYYCQVGLGPAQSWPLP PGVTEVSPANVTLLRALGHGAFGEVYEGLVIGLPGDSSPLQVAIKTLPELCSPQDELDFL MEALIISKFRHQNIVRCVGLSLRATPRLILLELMSGGDMKSFLRHSRPHLGQPSPLVMRD LLQLAQDIAQGCHYLEENHFIHRDIAARNCLLSCAGPSRVAKIGDFGMARDIYRASYYRR GDRALLPVKWMPPEAFLEGIFTSKTDSWSFGVLLWEIFSLGYMPYPGRTNQEVLDFVVGG GRMDPPRGCPGPVYRIMTQCWQHEPELRPSFASILERLQYCTQDPDVLNSLLPMELGPTP EEEGTSGLGNRSLECLRPPQPQELSPEKLKSWGGSPLGPWLSSGLKPLKSRGLQPQNLWN PTYRS >gi568815583f:41393928_41603163|GENSCAN_predicted_CDS_5|6678_bp atgctgtcgagaccgaagccaggggagtccgaggtggacctgctgcacttccagagtcag tttctcgcagctggtgcagccccagcagtgcagttggtgaagaaaggaaataggggcggt ggtgatgccaactcagaccggcctccgctccaggaccatcgggatgtggtgatgttggac aatctcccagatttgcccccagctttggtcccttctcctccaaagagagccaggcccagc cctggccactgcctgcctgaggatgaggacccagaagagaggctgaggaggcatgatcag cacatcactgctgtcttgactaagattattgaacgagatacaagttcagtggccgtgaat ctgcctgtgcccagtggtgttgctttccctgctgtgttccttcgctcgcgggacacacag gggaaatcagcaacatctggtaagagaagcatctttgcccaggaaattgcggcaaggagg atagctgaagccaagggcccatcagttggggaagttgtgcccaacgtgggcccaccagag ggtgccgtgacctgtgagacacccactcctaggaaccagggctgccagcttcctgggagc agccacagctttcagggacccaatctggtcacagggaaggggctcagggatcaagaagct gagcaggaagcccagactatccatgaagagaacatagcaagactgcaggccatggctcct gaggagatcctgcaggaacagcagcggttgctggcccagcttgaccccagcttggttgct ttcttgagatctcacagccacacgcaagagcaaacaggagagacagcctctgaggagcag aggccaggaggaccctctgctaatgtcaccaaggaggaacccctcatgtcagcttttgcc agtgagcccaggaagagagacaagctggagccagaagccccagctctggcattgcccgtg acccctcagaaagaatggctgcacatggacactgtcgagctggagaagctccactggacc caggacttgccccctgtccggcggcagcagacacaggagaggatgcaggctcgattcagt cttcagggagaactactggcccctgacgtggacctgcccacccacctgggtctgcaccac catggagaggaggcagagagagcggggtattccctacaggagctgttccacctgacccgc agccaggtttcccagcagagagcactggcactgcatgtgttagcccaggtcatcagcagg gcccaggctggtgagtttggggaccggctagcaggcagtgtcttaagcctccttttggat gctggtttcctcttcctactgcgcttctccttggatgacagagtggatggggtcattgca accgccatccgtgctcttcgggctctgctggtggctcctggagatgaggagctcctcgac agcaccttctcttggtaccatggagctttgacgttccctctgatgcccagccaggaggac aaggaggatgaggacgaggatgaagaatgcccagcaggaaaagcaaaaaggaaaagccct gaagaagaaagccggcctccacctgacctggcccgacatgatgtcatcaaggggctcctg gctaccagcctgctgcctcggctgcgctacgtgctggaggtgacatacccaggacctgcg gtggtccttgacatcctggctgtgctcatccgcctggcccggcattccctggaatcagcc acaagggtcctggagtgccctcggctgatagagactatagttcgagagttcttgcccacc agttggtctcctgtgggggcagggcctacccctagtctatacaaagtaccctgtgctact gccatgaaactacttcgtgtcctggcctcagctgggaggaatattgctgcccggctgttg agcagctttgatctccggagccgcctgtgccgcatcatagctgaggctccccaagaactg gccttgcccccagaggaagctgagatgctgagcaccgaggccctccgtctgtgggctgtg gctgcctcctatggccagggcggttacctttacagggagctctacccagtgctgatgcgg gccttgcaggtggtgccgcgggagctcagcacccacccacctcaacccctgtccatgcag cggatagcctcactgctcactctcctcacccagctaaccctggcagccggcagtacccct gctgaaaccatcagtgattctgctgaggccagcctctcggccaccccttccttagtcact tggacacaggtgtctgggctccagcctcttgttgagccgtgtctaaggcagaccttgaag ttgctgtccagacctgagatgtggagagccgtgggcccagtgcccgttgcctgcctgttg ttcctgggagcctactaccaggcctggagccagcaactggctgccatattggctgccccg ggactccagaattacttcctccagtgtgtggctcctggggctgccccacacctcacacct ttctctgcatgggccctgcgccatgagtaccacctgcagtacctggcactcgctctggcc cagaaagcggcagcgctgcagccactgccagccacccatgctgccctctatcatggtatg gccttggccctgctgagccggctgctgcccggaagtgagtacctcacccatgagctgctg ctgagctgtgtattccggctggagttcctcccggaaagaacatcagggggtccagaggca gccgacttctctgaccagctgtcgttaggaagcagcagggtccctcggtgtgggcaaggg actctgctggctcaggcctgccaggacctccccagcatccgcaactgctacctgactcat tgctcgccagcccgagccagtctgctggcctcccaggctctgcaccgaggggagctacag cgagtcccaaccctgctactgcccatgcctacggagccgctgctgcccaccgactggccc ttcctgccactgattcgcctctaccaccgggcttcagacaccccctcgggactctctccc acagacaccatgggcacagccatgcgggtcctgcagtgggtgctagttttggagagctgg cgcccccaggctctctgggctgtgccccctgctgcccgcctggcacggctcatgtgtgtg ttcctggtggacagtgagctgttccgggagtccccagtacagcatctggtggcagccctc ctcgcccagctctgtcagcctcaagtcttgccaaacctcaacctggactgccgactccct ggcctgacgtctttccctgacctctatgccaacttcctggatcattttgaggctgtctct tttggggaccacctctttggggccctggtcctcctgcccctgcagcgtcggttcagtgtc accttgcgccttgccctctttggggaacacgtgggagccttgcgagctctgagcctgcct ctgacccagttgcctgtgtccctggagtgttacacagtgcctcctgaagacaacctggcc ctccttcagctctacttccggaccctggttactggtgcgctccgcccacgttggtgcccc gtgctctatgctgtggctgtggctcatgtcaatagcttcatcttctctcaggacccacag agctcagatgaggtcaaagctgcccgcaggagtatgctgcagaaaacatggctgctggca gatgagggtctccggcagcacctcctgcactataagcttcccaattccacgctcccagag ggctttgagctctattctcagttgccccctctgcgtcagcactacctccagagactgact tcaacagtgctccaaaatgggatcactgggtcctaccaccagaagctgctgattcagtac atctggggtgaaacccaagaatctgcatttctaacaagttcccaggggtgtggccgcgac cgcaagggcttttgttgccgggtggacccaacagggatgggctgctggggacagctgctg gtgtggttcggagccgcgggcgccattctctgctctagcccggggtcccaggagactttt ctgcggtcctcgcccctgccgctggcaagtcccagcccccgggacccgaaagtcagcgcc ccgcctagtatcttggagccagcctccccgctgaattctccgggcaccgaggggtcttgg ctgttttctacctgcggggccagcggccggcatgggcccacacagacacaatgtgacggg gcgtacgcggggaccagcgtggtggtgaccgtgggggccgccgggcagctgagaggcgtg cagctgtggcgcgtgccgggccctggccagtatctgatctcagcctacggagccgcgggc ggcaaaggcgccaagaaccacctgtcgcgggcgcatggcgtcttcgtctcagcaatcttc tccctcggtctcggggagtcgctgtacatcctggtggggcagcagggagaggacgcctgt cccggaggtagcccggagagccagctcgtctgcctcggggagtctcgagccgttgaagag cacgcggcgatggatgggagcgaaggggtcccggggtcgcggcgctgggcgggaggtggc gggggtggcgggggcgccacctacgttttccggctggagggcgcttcctggaacacgccg ctggccccacaggtgcgcgctggcgagctggaaccgttgctggtggcggccggaggcggc ggtcgggcctacctgaggccgcgggaccgaggccggactcaggcctcccccgagaaactg gagaaccgctcggaggcgcccgggagcggcgggagaggcggggcggcaggtggtgggggc ggctggacgtcgcgggctccctctccgcaggccggccgctcactgcaggagggggcggag ggcggccagggctgctccgaggcttgggcgacccttggctgggccgcggccggcggcttc gggggcggcggcggggcctgcactgcgggcggaggcggcggcggctacagggggggcgac gcttcagagactgacaacctctgggctgatggggaagatggagtatccttcatacacccc agcagcgagctcttcctgcagcctctggcagtcaccgagaaccacggagaggtagagatc cgaaggcacctcaactgcagtcactgccctttgagagactgccaatggcaggcagagctc cagctggctgaatgcctgtgcccagaaggcatggagctagctgtggataacgtcacctgc atggacctgcacaagcccccaggccctctggttctgatggtggctgtggtggcaacctca acactgagcctccttatggtgtgtggggtcctgattctgggtacgaagcgtctagcaggc acagttgattcaaggctgctcctctccatgaagcagaagaagtggcagggcctgcaggag atgaggctgccgagccctgagcttgagctgagcaagcttcgaacctctgccatcaggaca gcccccaatccctattattgccaggtggggcttggcccggcccagtcctggcctctgcca ccaggtgtcaccgaggtttccccagccaatgttactctgctcagagccctgggccatggt gcctttggggaggtgtatgagggactggtaattggccttcctggggactccagtcccctg caggtagctatcaagaccctgccagaactctgctcgcctcaggatgagctggatttcctc atggaggccctcatcatcagcaagtttcgccatcagaacattgtgcggtgtgtggggctc agcctcagggccacccctcgcctcattctgctggaactgatgtctggaggggacatgaag agtttcctgaggcacagtcggccacacctgggccagccatcacctctggtcatgcgggac ctgctgcaactggcccaggacatagcccagggctgccactacctggaggaaaatcacttc atccacagggatattgccgcccggaactgcctgctgagctgcgctggacccagccgagtg gccaagattggggactttgggatggcacgagatatctaccgggccagttattaccgcagg ggggaccgggccttgctcccagtcaagtggatgcccccagaggccttcctggagggcatc ttcacatccaagacagattcctggtcttttggggtgctgctctgggagatcttctcactg ggctacatgccctatcctgggcgcaccaaccaggaggtgctggacttcgtcgttggagga ggccggatggaccctcctaggggctgcccagggcctgtgtaccgcatcatgacccagtgt tggcagcacgagcctgagctccgccctagctttgccagcatcttggagcgtctgcagtac tgcactcaggacccggatgtgctgaattcactcctgccaatggagctggggcccacccca gaggaggaagggacttctgggctggggaacagatctttggagtgcctaagacccccacag ccccaggaactgagtccagagaagttgaaaagctggggaggtagccctcttggcccctgg ctgtcctctggcctcaagcccctcaaatccaggggcctccaacctcagaacctttggaat cccacttatcgctcctga >gi568815583f:41393928_41603163|GENSCAN_predicted_peptide_6|1133_aa MGRPGLPPLPLPPPPRLGLLLAALASLLLPESAAAGLKLMGAPVKLTVSQGQPVKLNCSV EGMEEPDIQWVKDGAVVQNLDQLYIPVSEQHWIGFLSLKSVERSDAGRYWCQVEDGGETE ISQPVWLTVEGVPFFTVEPKDLAVPPNAPFQLSCEAVGPPEPVTIVWWRGTTKIGGPAPS PSVLNVTALPAAPFNITVTKLSSSNASVAWMPGADGRALLQSCTVQVTQAPGGWEVLAVV VPVPPFTCLLRDLVPATNYSLRVRCANALGPSPYADWVPFQTKGLAPASAPQNLHAIRTD SGLILEWEEVIPEAPLEGPLGPYKLSWVQDNGTQDELTVEGTRANLTGWDPQKDLIVRVC VSNAVGCGPWSQPLVVSSHDRAGQQGPPHSRTSWVPVVLGVLTALVTAAALALILLRKRR KETRFGQAFDSVMARGEPAVHFRAARSFNRERPERIEATLDSLGISDELKEKLEDVLIPE QQFTLGRMLGKGEFGSVREAQLKQEDGSFVKVAVKMLKADIIASSDIEEFLREAACMKEF DHPHVAKLVGVSLRSRAKGRLPIPMVILPFMKHGDLHAFLLASRIGENPFNLPLQTLIRF MVDIACGMEYLSSRNFIHRDLAARNCMYEFWRTRGLAEDMTVCVADFGLSRKIYSGDYYR QGCASKLPVKWLALESLADNLYTVQSDVWAFGVTMWEIMTRGQTPYAGIENAEIYNYLIG GNRLKQPPECMEDVYDLMYQCWSADPKQRPSFTCLRMELENILGQLSVLSASQDPLYINI ERAEEPTAGGSLELPGRDQPYSGAGDGSGMGAKGPSCGQSRRPAGRGEERGGRRRWRPVG AGPLQGRGQTRTEREPGSSGSTGSILPRVPENTPSSRNARASFVKAVFEFASSGEQVLEK VVILEQICLLQYVDDILISGEDIEKCGSFRSADSRTQRPPAARSLPIKGVGPSHLWMASM HPVHRGCGNTGRGKQEVNLWRKIDGNPNLWREHTCLDLIDYHTKVRPDLGETPFRTGRHL FIDSSSGVIEGKRHNGYSVIDGEILIEIESGKLPTIGLLKHNQEGTIYTDSKYACVVAHM FGKIWTERGLISSKGQDLVHKELITQVLNNLQLPEEIAIVRVPGHQKNLSFES >gi568815583f:41393928_41603163|GENSCAN_predicted_CDS_6|3402_bp atggggcggccggggctcccgccgctgccgctgccgccgccaccgcggctcgggctgctg ctggcggctctggcttctctgctgctcccggagtccgccgccgcaggtctgaagctcatg ggagccccggtgaagctgacagtgtctcaggggcagccggtgaagctcaactgcagtgtg gaggggatggaggagcctgacatccagtgggtgaaggatggggctgtggtccagaacttg gaccagttgtacatcccagtcagcgagcagcactggatcggcttcctcagcctgaagtca gtggagcgctctgacgccggccggtactggtgccaggtggaggatgggggtgaaaccgag atctcccagccagtgtggctcacggtagaaggtgtgccatttttcacagtggagccaaaa gatctggcagtgccacccaatgcccctttccaactgtcttgtgaggctgtgggtccccct gaacctgttaccattgtctggtggagaggaactacgaagatcgggggacccgctccctct ccatctgttttaaatgtaacagcactgcctgcagcccccttcaacatcaccgtgacaaag ctttccagcagcaacgctagtgtggcctggatgccaggtgctgatggccgagctctgcta cagtcctgtacagttcaggtgacacaggccccaggaggctgggaagtcctggctgttgtg gtccctgtgcccccctttacctgcctgctccgggacctggtgcctgccaccaactacagc ctcagggtgcgctgtgccaatgccttggggccctctccctatgctgactgggtgcccttt cagaccaagggtctagccccagccagcgctccccaaaacctccatgccatccgcacagat tcaggcctcatcttggagtgggaagaagtgatccccgaggcccctttggaaggccccctg ggaccctacaaactgtcctgggttcaagacaatggaacccaggatgagctgacagtggag gggaccagggccaatttgacaggctgggatccccaaaaggacctgatcgtacgtgtgtgc gtctccaatgcagttggctgtggaccctggagtcagccactggtggtctcttctcatgac cgtgcaggccagcagggccctcctcacagccgcacatcctgggtacctgtggtccttggt gtgctaacggccctggtgacggctgctgccctggccctcatcctgcttcgaaagagacgg aaagagacgcggtttgggcaagcctttgacagtgtcatggcccggggagagccagccgtt cacttccgggcagcccggtccttcaatcgagaaaggcccgagcgcatcgaggccacattg gacagcttgggcatcagcgatgaactaaaggaaaaactggaggatgtgctcatcccagag cagcagttcaccctgggccggatgttgggcaaaggagagtttggttcagtgcgggaggcc cagctgaagcaagaggatggctcctttgtgaaagtggctgtgaagatgctgaaagctgac atcattgcctcaagcgacattgaagagttcctcagggaagcagcttgcatgaaggagttt gaccatccacacgtggccaaacttgttggggtaagcctccggagcagggctaaaggccgt ctccccatccccatggtcatcttgcccttcatgaagcatggggacctgcatgccttcctg ctcgcctcccggattggggagaacccctttaacctacccctccagaccctgatccggttc atggtggacattgcctgcggcatggagtacctgagctctcggaacttcatccaccgagac ctggctgctcggaattgcatgtacgaattctggaggactcgagggctggcagaggacatg acagtgtgtgtggctgacttcggactctcccggaagatctacagtggggactactatcgt caaggctgtgcctccaaactgcctgtcaagtggctggccctggagagcctggccgacaac ctgtatactgtgcagagtgacgtgtgggcgttcggggtgaccatgtgggagatcatgaca cgtgggcagacgccatatgctggcatcgaaaacgctgagatttacaactacctcattggc gggaaccgcctgaaacagcctccggagtgtatggaggacgtgtatgatctcatgtaccag tgctggagtgctgaccccaagcagcgcccgagctttacttgtctgcgaatggaactggag aacatcttgggccagctgtctgtgctatctgccagccaggaccccttatacatcaacatc gagagagctgaggagcccactgcgggaggcagcctggagctacctggcagggatcagccc tacagtggggctggggatggcagtggcatgggggcaaagggccccagctgcggacagagc cggcgccctgcaggccgcggggaggagcgcggcgggcgcaggcggtggcggcccgtgggc gcggggcccctgcagggacggggacagacgcgcacggagcgggagcccggcagctccggg tctacgggatccatcctcccaagggtgcccgagaacactccatcgtcgcggaatgcccgc gcttctttcgttaaagctgtctttgagtttgcctcctctggtgaacaggtactagaaaaa gttgtcatactagaacaaatatgccttcttcagtacgtagacgacattcttatatctggt gaagatatagagaagtgcggtagttttaggagtgctgactcaagaacacagaggccgccg gcagcccgtagccttcctatcaaaggtgttggacccagtcacttgtggatggcctcaatg catccagtccatcggggctgcggcaatactggtcgaggaaagcaggaagttaacctttgg aggaaaattgacgggaatccaaatctatggagggaacacacatgtttagatttaattgat taccatacaaaggttcgaccagacctaggagaaacccccttccggactggacggcactta ttcatagacagttcctccggggtgattgagggaaaaagacacaatgggtattcagtgatt gatggagaaattctcatagaaatagaatctggaaaattgccaacaattggtctgctcaaa cataaccaggaaggaaccatctatacagattccaagtatgcctgtgtagtggcccatatg tttgggaaaatttggactgaaagaggtctcattagtagtaaaggtcaagaccttgttcac aaggagctgatcacccaagtattgaataatcttcagttgccagaagaaatagctattgtc cgtgttcccggacaccagaaaaacctttcttttgaaagttga >gi568815583f:41393928_41603163|GENSCAN_predicted_peptide_7|81_aa MVLTSLAGGQAAPRNEDAEKEEPNSSILGTLNPSGTGFGWLTWGVEDIIENHIDMLGPVN LDWSGLSLLSYWQQLRESRWP >gi568815583f:41393928_41603163|GENSCAN_predicted_CDS_7|246_bp atggtccttaccagtcttgctggaggtcaagctgccccaaggaatgaagatgctgaaaaa gaagaaccaaacagttccattttagggactctgaaccccagtggaacaggatttggttgg ctgacctggggagtggaagacatcatagaaaaccacattgatatgttggggccggttaac ctggactggtctggactttccctgctgtcctattggcagcaactcagagagtccagatgg ccctag