GENSCAN 1.0 Date run: 6-Nov-116 Time: 12:16:11 Sequence gi568815581r:28973855_29174199 : 200345 bp : 52.17% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.09 Intr - 5959 5826 134 2 2 95 113 108 0.919 14.87 1.08 Intr - 8185 7517 669 2 0 84 76 464 0.988 37.10 1.07 Intr - 11743 11640 104 0 2 116 25 9 0.029 -2.28 1.06 Intr - 19656 19568 89 0 2 87 68 26 0.137 -0.23 1.05 Intr - 22007 21864 144 2 0 63 86 73 0.515 5.59 1.04 Intr - 23888 23694 195 2 0 99 72 50 0.766 4.63 1.03 Intr - 24871 24848 24 1 0 105 92 18 0.550 2.60 1.02 Intr - 30709 30501 209 2 2 33 -9 145 0.037 -1.38 1.01 Init - 32015 31961 55 2 1 90 103 229 0.999 23.99 1.00 Prom - 36992 36953 40 -4.11 2.02 PlyA - 38226 38221 6 1.05 2.01 Sngl - 47786 47466 321 2 0 89 53 256 0.991 18.51 2.00 Prom - 48019 47980 40 -3.71 3.00 Prom + 49106 49145 40 -4.01 3.01 Init + 51517 51592 76 0 1 83 90 61 0.793 7.00 3.02 Intr + 53577 53718 142 1 1 59 81 85 0.577 4.82 3.03 Term + 54033 54036 4 0 1 127 41 0 0.308 -4.15 3.04 PlyA + 54558 54563 6 1.05 4.05 PlyA - 55258 55253 6 1.05 4.04 Term - 61434 61326 109 2 1 -6 48 263 0.615 11.18 4.03 Intr - 62142 62042 101 0 2 88 60 -12 0.632 -4.59 4.02 Intr - 62336 62241 96 2 0 77 61 71 0.419 4.01 4.01 Init - 62940 62887 54 0 0 88 65 3 0.433 -0.61 4.00 Prom - 65621 65582 40 -2.81 5.00 Prom + 68062 68101 40 -1.31 5.01 Init + 69372 69485 114 2 0 112 84 86 0.951 11.09 5.02 Intr + 71005 71153 149 0 2 88 80 89 0.743 7.64 5.03 Intr + 79066 79279 214 1 1 73 95 84 0.795 6.84 5.04 Intr + 79559 79741 183 1 0 136 110 119 0.999 19.40 5.05 Intr + 80691 80837 147 2 0 81 105 123 0.775 14.24 5.06 Intr + 81209 81367 159 1 0 -2 103 237 0.994 16.90 5.07 Intr + 81959 82034 76 1 1 43 105 90 0.994 5.78 5.08 Term + 82321 82451 131 2 2 104 43 85 0.997 4.25 5.09 PlyA + 83325 83330 6 1.05 6.00 Prom + 84092 84131 40 -4.31 6.01 Init + 86880 86958 79 2 1 74 71 78 0.965 5.88 6.02 Term + 87516 87826 311 1 2 89 35 106 0.664 0.97 6.03 PlyA + 88143 88148 6 1.05 7.00 Prom + 88358 88397 40 -4.11 7.01 Init + 88559 88808 250 1 1 48 89 124 0.573 6.12 7.02 Term + 91753 91892 140 2 2 44 49 133 0.659 3.43 7.03 PlyA + 97526 97531 6 1.05 8.03 PlyA - 98916 98911 6 1.05 8.02 Term - 101060 100916 145 1 1 98 41 191 0.992 13.09 8.01 Init - 103274 103174 101 2 2 92 92 1 0.792 0.62 8.00 Prom - 103815 103776 40 -1.21 9.50 PlyA - 104178 104173 6 -0.45 9.49 Term - 107146 105877 1270 0 1 11 39 1377 0.990 116.73 9.48 Intr - 108584 108462 123 1 0 102 99 134 0.854 16.11 9.47 Intr - 112723 112584 140 1 2 94 94 254 0.998 26.37 9.46 Intr - 113267 113082 186 2 0 69 55 358 0.935 31.00 9.45 Intr - 116244 116107 138 0 0 99 90 159 0.984 18.37 9.44 Intr - 116761 116678 84 1 0 110 100 112 0.999 15.11 9.43 Intr - 117072 116863 210 0 0 125 49 334 0.927 32.83 9.42 Intr - 118647 118489 159 0 0 81 77 246 0.877 23.50 9.41 Intr - 119147 119001 147 2 0 59 68 321 0.999 28.14 9.40 Intr - 119573 119469 105 2 0 26 71 238 0.933 16.81 9.39 Intr - 120236 120126 111 2 0 50 116 160 0.940 16.08 9.38 Intr - 120416 120297 120 2 0 39 75 53 0.531 0.29 9.37 Intr - 120996 120796 201 0 0 100 92 261 0.831 27.70 9.36 Intr - 121205 121082 124 1 1 73 90 194 0.893 19.19 9.35 Intr - 123061 122907 155 1 2 88 90 354 0.999 34.98 9.34 Intr - 123496 123369 128 2 2 103 76 296 0.986 30.80 9.33 Intr - 124045 123934 112 1 1 97 80 257 0.999 26.26 9.32 Intr - 124370 124251 120 2 0 62 94 194 0.998 18.59 9.31 Intr - 124591 124502 90 1 0 108 100 207 0.999 24.59 9.30 Intr - 125115 124972 144 0 0 86 52 251 0.593 22.29 9.29 Intr - 125908 125780 129 1 0 114 48 125 0.993 12.50 9.28 Intr - 129810 129745 66 0 0 80 100 109 0.996 10.89 9.27 Intr - 133335 133226 110 1 2 91 77 152 0.978 14.90 9.26 Intr - 135698 135585 114 0 0 75 77 31 0.389 1.62 9.25 Intr - 136247 136004 244 2 1 81 81 300 0.987 26.20 9.24 Intr - 136768 136582 187 0 1 87 89 305 0.974 30.71 9.23 Intr - 137729 137570 160 0 1 94 63 130 0.999 10.66 9.22 Intr - 138009 137868 142 0 1 82 100 139 0.961 14.94 9.21 Intr - 140243 140157 87 2 0 90 62 185 0.988 16.76 9.20 Intr - 141024 140839 186 0 0 7 57 110 0.369 0.20 9.19 Intr - 141245 141053 193 1 1 71 55 272 0.998 22.22 9.18 Intr - 141587 141497 91 0 1 63 95 130 0.999 10.75 9.17 Intr - 141986 141810 177 0 0 69 80 232 0.952 20.91 9.16 Intr - 143856 143836 21 1 0 102 60 35 0.420 0.10 9.15 Intr - 144335 144191 145 2 1 67 86 210 0.820 19.07 9.14 Intr - 144586 144523 64 0 1 85 100 43 0.999 4.41 9.13 Intr - 145581 145481 101 0 2 79 94 158 0.999 14.91 9.12 Intr - 146904 146762 143 1 2 75 94 148 0.917 14.68 9.11 Intr - 147357 147144 214 0 1 84 81 267 0.988 24.41 9.10 Intr - 147869 147693 177 2 0 117 56 238 0.999 24.13 9.09 Intr - 148103 147997 107 0 2 105 100 209 0.999 24.23 9.08 Intr - 148399 148312 88 1 1 115 92 79 0.999 11.04 9.07 Intr - 154690 153998 693 1 0 54 72 539 0.021 41.24 9.06 Intr - 157588 157529 60 1 0 139 87 46 0.035 9.02 9.05 Intr - 164633 164506 128 0 2 101 82 44 0.008 6.00 9.04 Intr - 166419 166326 94 0 1 83 91 6 0.017 0.44 9.03 Intr - 176709 176497 213 0 0 78 78 47 0.017 2.14 9.02 Intr - 181616 181488 129 2 0 128 28 17 0.006 1.10 9.01 Init - 193086 192088 999 0 0 76 76 1425 0.247 133.66 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 30709 30495 215 2 2 33 41 155 0.831 3.02 S.002 Init - 154576 153998 579 1 0 87 72 572 0.937 48.73 S.003 Init + 164333 164412 80 1 2 46 93 81 0.821 4.93 S.004 Term + 170558 170662 105 1 0 106 41 53 0.839 1.11 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:28973855_29174199|GENSCAN_predicted_peptide_1|541_aa MRPVALLLLPSLLALLAHGRKTVAPETQKPFSEPLAPLAHSPVSSPRYRAAHAGSSGSAQ NLKPCAHVLGRGEESCKTHPRDEDREGEVRCAPEEQNSMKIKGNRGGGDGDWEETTRPLE RSPSNSQNCPSPGEADLSGISGGAESYNPGVQSWTAEAAVGQGFRPLLQLNSAKRPPLEL IAFHTLEPSPWVPPPPAAAALETGTSWEKLTPPRFSIQRNMVSAADMVVVEVAGTGRGLL QNSPIQSMQIHTKPFQSAHFIAEEAKALRGNLPGLSLEAPTVGKGQAPGIEETDGELTAA PTPEQPERGVHFVTTAPTLKLLNHHPLLEEFLQEGLEKGDEELRPALPFQPDPPAPFTPS PLPRLANQDSRPVFTSPTPAMAAVPTQPQSKEGPWSPESESPMLRITAPLPPGPSMAVPT LGPGEIASTTPPSRAWTPTQEGPGDMGRPWVAEVVSQGAGIGIQGTITSSTASGDDEETT TTTTIITTTITTVQTPGPCSWNFSGPEGSLDSPTDLSSPTDVGLDCFFYISVYPGYGVEI K >gi568815581r:28973855_29174199|GENSCAN_predicted_CDS_1|1623_bp atgcgcccggtagccctgctgctcctgccctcgctgctggcgctcctggctcacggaaga aaaactgtggccccggagacgcagaagcctttttctgagcctctagctccacttgcacac agcccagtgtcctcaccccgctaccgggctgcccatgctggatcctccggcagcgcccag aacttgaaaccgtgtgctcacgttctggggagaggggaagagtcctgcaagacccacccc agggatgaagaccgggaaggagaagtcaggtgtgcccctgaagagcagaacagcatgaag atcaaaggaaatcggggtgggggggatggtgactgggaagaaaccacaagaccccttgag aggagcccctccaacagccagaactgtccatcaccaggagaggctgatttgtcaggcatc tctggaggggctgaatcatacaacccaggagtccagagctggaccgctgaggcagcagtg gggcagggcttcaggcccctactacagctcaactctgccaaaaggccaccattagagctc attgccttccacacccttgagccttcaccctgggtcccaccaccaccagctgcagctgct ctggaaactggcaccagctgggagaagctgacacctccaagattttccattcagagaaat atggtctctgctgctgatatggtggtggtagaagtggctggcacagggagagggctcctt cagaactctcctattcagagcatgcagattcacacaaaacccttccaaagtgcccatttt atagctgaggaggcaaaggctctgagaggcaacttgccaggactctctttagaggcccca accgtggggaaaggacaagccccaggcatcgaggagacagatggcgagctgacagcagcc cccacacctgagcagccagaacgaggcgtccactttgtcacaacagcccccaccttgaag ctgctcaaccaccacccgctgcttgaggaattcctacaagaggggctggaaaagggagat gaggagctgaggccagcactgcccttccagcctgacccacctgcacccttcaccccaagt ccccttccccgcctggccaaccaggacagccgccctgtctttaccagccccactccagcc atggctgcggtacccactcagccccagtccaaggagggaccctggagtccggagtcagag tcccctatgcttcgaatcacagctcccctacctccagggcccagcatggcagtgcccacc ctaggcccaggggagatagccagcactacaccccccagcagagcctggacaccaacccaa gagggtcctggagacatgggaaggccgtgggttgcagaggttgtgtcccagggcgcaggg atcgggatccaggggaccatcacctcctccacagcttcaggagatgatgaggagaccacc actaccaccaccatcatcaccaccaccatcaccacagtccagacaccaggcccttgtagc tggaatttctcaggcccagagggctctctggactcccctacagacctcagctcccccact gatgttggcctggactgcttcttctacatctctgtctaccctggctatggcgtggaaatc aag >gi568815581r:28973855_29174199|GENSCAN_predicted_peptide_2|106_aa MPFLAIQKRFGLNIDQWWTIQSAEQPYKIAARCHAFEKEWIECAYGISVIRAEKECKIES DDFVECLLRQKTMRRAGTIRKQQDKLIKEGKYTPPPHHIGKEEPQP >gi568815581r:28973855_29174199|GENSCAN_predicted_CDS_2|321_bp atgcctttcttggccatccagaaaagattcggccttaacatagatcaatggtggacaatc cagagtgctgaacagccctacaagattgctgctcgatgccatgcttttgaaaaagaatgg atagaatgtgcatatggaatcagtgttatccgggcagagaaagagtgcaagatagaatct gatgatttcgtagagtgtttgcttcggcagaaaacgatgagacgtgcaggtaccatcagg aagcagcaggataagctgataaaggaagggaagtacacccctccacctcaccacattggc aaggaggagcctcagccctga >gi568815581r:28973855_29174199|GENSCAN_predicted_peptide_3|73_aa MLPVTLRVSLQADLSAKDPIQGQTEVPQVAAVNWDLCAAAKRHLRIPKFGTGGRSLPKQA VKLDFGEDFKCGC >gi568815581r:28973855_29174199|GENSCAN_predicted_CDS_3|222_bp atgcttccagtcactcttcgtgtctctctgcaggccgaccttagtgctaaggatcctatc caaggtcagaccgaggtcccccaagttgcagctgtgaactgggacttatgcgcagcagcc aagagacacctcagaattcccaagtttggaacaggaggaaggagtctgccaaagcaagcg gtgaaactagattttggagaggacttcaagtgtggatgctga >gi568815581r:28973855_29174199|GENSCAN_predicted_peptide_4|119_aa MPVLGWPSKVYQRLTAPSAFGEGPIKAQQAVGKMRNQSAQKWSCFSVEVRPGYHYTLKWG VGERVEIGSRKSKQLLHELDWLDRKKKKKKKKKKKKKKEKEKEEKEEKKEKKEGKRERE >gi568815581r:28973855_29174199|GENSCAN_predicted_CDS_4|360_bp atgcctgttctgggctggcccagcaaggtttatcaaaggctgacagcgccctctgctttt ggagaaggccccattaaagctcagcaggcagtgggaaaaatgagaaatcagagtgctcag aagtggtcctgtttctctgtggaggtgaggcctggatatcactataccctaaagtgggga gttggggaaagagtggaaattgggagcaggaaaagtaaacagcttttgcatgagctggat tggcttgataggaagaagaagaagaagaagaagaagaagaagaagaagaagaaggagaaa gagaaggaggagaaggaggagaagaaggagaagaaagaaggaaagagagagagagagtaa >gi568815581r:28973855_29174199|GENSCAN_predicted_peptide_5|390_aa MAAQKDLWDAIVIGAGIQGCFTAYHLAKHRKRILLLEQFFLPHSRGSSHGQSRIIRKAYL EDFYTRMMHECYQIWAQLEHEAGTQLHRQTGLLLLGMKENQELKTIQANLSRQRVEHQCL SSEELKQRFPNIRLPRGEVGLLDNSGGVIYAYKALRALQDAIRQLGGIVRDGEKVVEINP GLLVTVKTTSRSYQAKSLVITAGPWTNQLLRPLGIEMPLQTLRINVCYWREMVPGSYGVS QAFPCFLWLGLCPHHIYGLPTGEYPGLMKVSYHHGNHADPEERDCPTARTDIGDVQILSS FVRDHLPDLKPEPAVIESCMYTNTPDEQFILDRHPKYDNIVIGAGFSGHGFKLAPVVGKI LYELSMKLTPSYDLAPFRISRFPSLGKAHL >gi568815581r:28973855_29174199|GENSCAN_predicted_CDS_5|1173_bp atggcggctcagaaagatctctgggacgccattgtgattggggcggggatccagggctgc ttcactgcataccacctggccaaacacaggaagaggatcctcctgctggagcagttcttt ctaccacactcccgaggaagctcccatggacaaagccggataatccgaaaggcgtacctg gaagacttttacacccggatgatgcatgagtgctatcagatatgggcccagctggagcac gaggctggaacccaattgcacaggcagactggattactgctgctgggaatgaaagagaat caagaattaaagacaatccaggccaatctgtcgaggcagagggtagaacaccagtgtctt tcatctgaggaactgaagcaacgtttcccaaatattcggttgcccaggggagaagtgggg ctcttggacaattccggaggagttatctatgcatataaggccctcagagccctgcaggat gcaattcgacagctaggaggcatagtgcgtgacggagagaaggtggtggagataaaccca gggctactggtcacggtgaaaaccacctccaggagctaccaagctaagagcttggtcatc acagcaggtccttggaccaaccagctcctccgtcccctgggcattgagatgcctctccag accctgcggatcaacgtgtgttactggcgagagatggttcctgggagctatggtgtgtcc caggcctttccgtgcttcctgtggctgggcttgtgtccccaccacatctacggactgccc acaggagagtacccagggctgatgaaggtcagctatcaccacggcaaccacgcagaccct gaggagcgggactgccccacagcacgcacagacatcggagacgtccagatcctgagcagc tttgtcagagatcacttacctgatctgaagcccgagcctgctgtcattgagagctgcatg tacacgaatacccctgatgagcagttcattctcgatcgccacccaaagtatgacaacatt gtcattggtgctggattctctgggcacgggttcaagctggcccctgtggtggggaagatc ctgtatgaattaagcatgaaattaacaccatcttatgacttggcaccttttcgaatcagc cgtttcccaagcctgggcaaagcccacctttga >gi568815581r:28973855_29174199|GENSCAN_predicted_peptide_6|129_aa MDRASEERRQTRSRKEETGNVQTTKAASPKDEEMVGKSGKTGKEGGESGQSNLSSKTQAI VGYQSGIVVGRGGGVYYPQYQFGPQQSLASQNGFNSLRNVSVPRKEGQLSRKVKGENMAE SVWRQSATK >gi568815581r:28973855_29174199|GENSCAN_predicted_CDS_6|390_bp atggatcgggccagcgaagaaaggagacagacaagaagcaggaaagaagagactggcaat gtgcagaccacgaaggcagcctcaccaaaagatgaggagatggtggggaaatcgggaaag actggcaaagagggaggagagtcagggcaaagtaatttgagttctaagacccaggcgatc gtggggtaccagtcagggatcgtggtggggagaggaggaggtgtatactatcctcagtac cagtttggaccccagcagagtttggcatcccagaatgggtttaacagcttaaggaatgtg agtgtacccaggaaggagggtcaactatctagaaaggtcaaaggagaaaacatggccgag tcggtctggagacagtcggctacaaagtaa >gi568815581r:28973855_29174199|GENSCAN_predicted_peptide_7|129_aa METAAVGREAHSCEQRHRAKDVGITLQNAPGASIFRREKFLRFPDMCRVQVPLESPGSLT TAAKAPSRPRAQLQHSTEALEQSMPGNVSPADFFGDRGRLNANLYNCGHPDTHRQREQEK TPVVAARTV >gi568815581r:28973855_29174199|GENSCAN_predicted_CDS_7|390_bp atggaaacagcagccgtgggaagggaggctcacagctgcgagcagaggcacagagctaag gacgtgggcatcactctccaaaatgctcctggggccagcatcttcaggagagaaaagttt cttagattccccgacatgtgccgggttcaagtcccactcgaatccccgggctcccttaca actgctgccaaggcccccagcaggccccgtgctcagctccagcacagcactgaggcgttg gagcagtcaatgccagggaacgttagcccagctgacttcttcggggatcggggaaggctg aacgcaaatctgtacaactgtgggcacccagacacccatcgacagagggagcaggagaag accccagtggttgctgccagaactgtgtga >gi568815581r:28973855_29174199|GENSCAN_predicted_peptide_8|81_aa MEFTTSPAAPATAQRLDETVWKHGCHLSRCLPSSPTSYWKSLAPDRSDDEHDPLDNTSRP RYSHSYLSDSDTEAKLTETNA >gi568815581r:28973855_29174199|GENSCAN_predicted_CDS_8|246_bp atggagtttaccacctccccggcagctcctgccactgcccagcgtcttgatgaaacagta tggaaacacggctgtcatttatccaggtgtctgcctagcagccccaccagctactggaag tcccttgcccctgatcggtcagatgatgagcacgaccctctcgacaacacctccagaccg cgatactcccacagttatctgagtgacagcgacacagaggccaagctgacggagactaac gcatag >gi568815581r:28973855_29174199|GENSCAN_predicted_peptide_9|3042_aa MFNLMKKDKDKDGGRKEKKEKKEKKERMSAAELRSLEEMSLRRGFFNLNRSSKRESKTRL EISNPIPIKVASGSDLHLTDIDSDSNRGSVILDSGHLSTASSSDDLKGEEGSFRGSVLQR AAKFGSLAKQNSQMIVKRFSFSQRSRDESASETSTPSEHSAAPSPQVEVRTLEGQLVQHP GPGIPRPGHRSRAPELVTKKFPVDLRLPPVVPLPPPTLRELELQRRPTGDFGFSLRRTTM LDRGPEGQACRRVVHFAEPGAGTKDLALGLVPGDRLVEINGHNVESKSRDEIVEMIRQSG DSVRLKVQPIPELSELSRSWLRSGEGPRREPSDHLSLPGPGASRVHTGRQVLPMELFSGS RTEAAPALGQKRAACVSPPGKAGDGGISPSSQEGRAVLSGNFYTSTTTSSGILMPQGQQV PGCVPPQLGQRFLSSPRMSSAQLCNTQERRDLFVISPDVAEVGGERGGAALGLLSELLDK LVLGWDWEEEVSWELPSVPLNGKEGGEENALPLGPQKEKPPVKEEDKTLPKPGSPGKEEG ALEGSSKEGSGPSRSPQPPTSPIPPETSQRARSPAPTLAMNGPGAATAEGLSEEAQGLSR KRVANAVRKVVSKVLPSEELGNAKETPGRGVKSPEHPTRSKRGEKAASSPKPPPPPPPPP APPKPEVKKEAAKDELSLGLRSLMSRGRGKEHKARGKQSSGKGEKPSSQEPGSPDWADSP EKAGSPAKPEAPKKQRSPAPPEELVTPGSTGPKSDLTGEQQSKSPVPGVKQEAKTEEQIA AEEAWNETEKVWLVHRDGFSLASQLKSEELNLPEGKVRVKLDHDGAILDVDEDDVEKANA PSCDRLEDLASLVYLNESSVLHTLRQRYGASLLHTYAGPSLLVLGPRGAPAVYSEKVMHM FKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIILLGSSGSGKTTSCQHLVQYLATIAGI SGNKVFSVEKWQALYTLLEAFGNSPTIINGNATRFSQILSLDFDQAGQVASASIQTMLLE KLRVARRPASEATFNVFYYLLACGDGTLRTELHLNHLAENNVFGIVPLAKPEEKQKAAQQ FSKLQAAMKVLGISPDEQKACWFILAAIYHLGAAGATKEAPEEQAAGRKQFARHEWAQKA AYLLGCSLEELSSAIFKHQHKGGTLQRSTSFRQGPEESGLGDGTGPKLSALECLEGMAAG LYSELFTLLVSLVNRALKSSQHSLCSMMIVDTPGFQNPEQGGSARGASFEELCHNYTQDR LQRLFHERTFVQELERYKEPQILALAPTARASVDAEGITLLSISVLLRSLEQESFLAPLS VASGFGYVLTWWRVYSPGSLQENIELAFDDLEPPTDDSVAAVDQASHQSLVRSLARTDEA RGLLWLLEEEALVPGASEDTLLERLFSYYGPQEGDKKGQSPLLHSSKPHHFLLGHSHGTN WVEYNVTGWLNYTKQNPATQNAPRLLQDSQKKIISNLFLGRAGSATVLSGSIAGLEGGSQ LALRRATSMRKTFTTGMAAVKKKSLCIQMKLQVDALIDTIKKSKLHFVHCFLPVAEGWAG EPRSASSRRVSSSSELDLPSGDHCEAGLLQLDVPLLRTQLRGSRLLDAMRMYRQVWTVQL CPQQQTQCPFGAICPVRNEVLNLSEVLKIPKKGYPDHMVFSEFRRRFDVLAPHLTKKHGR NYIVVDERRAVEELLECLDLEKSSCCMGLSRVFFRAGTLARLEEQRDEQTSRNLTLFQAA CRGYLARQHFKKRKIQDLAIRCVQKNIKKNKGVKDWPWWKLFTTVRPLIEVQLSEEQIRN KDEEIQQLRSKLEKAEKERNELRLNSDRLESRISELTSELTDERNTGESASQLLDAETAE RLRAEKEMKELQTQYDALKKQMEVMEMEVMEARLIRAAEINGEVDDDDAGGEWRLKYERA VREVDFTKKRLQQEFEDKLEVEQQNKRQLERRLGDLQADSEESQRALQQLKKKCQRLTAE LQDTKLHLEGQQVRNHELEKKQRRFDSELSQAHEEAQREKLQREKLQREKDMLLAEAFSL KQQLEEKDMDIAGFTQKVVSLEAELQDISSQESKDEASLAKVKKQLRDLEAKVKDQEEEL DEQAGTIQMLEQTSSKGVQLLWRSCCHGSKVPAKGTGWWTFSAPEPRESHAQAKLRLEME MERMRQTHSKEMESRDEEVEEARQSCQKKLKQMEVQLEEEYEDKQKVLREKRELEGKLAT LSDQVNRRDFESEKRLRKDLKRTKALLADAQLMLDHLKNSAPSKREIAQLKNQEWLTSLP GAGVVCVQLEESEFTCAAAVKARKAMEVEIEDLHLQIDDIAKAKTALEEQLSRLQREKNE IQNRLEEDQEDMNELMKKHKAAVAQVPLPGVPVTLCTVTILVVLLMGPPHNRLFWEASRD LAQINDLQAQLEEANKEKQELQEKLQALQSQVEFLEQSMVDKSLVSRQEAKIRELETRLE FERTQVKRLESLASRLKENMEKLTEERDQRIAAENREKEQNKRLQRQLRDTKEEMGELAR KEAEASRKKHELEMDLESLEAANQSLQADLKLAFKRIGDLQAAIEDEMESDENEDLINSE GDSDVDSELEDRVDGVKSWLSKNKGPSKAASDDGSLKSSSLSKEAPGVEERPSSVVSSLS YRKRLTLKDSIGGTGDADSLFTSLSERAASPERPPRKAHVGPREEPCPGRKSEEPEECGS VRSGTGGRAGRGPQKRWGSDFSPASTVSAPVSRASSATRRGSGEDRAGSSLSFSLSGSPG SRRSTSRLDSLSRTLSPSLSRASGLGRESPDSRLSLGRSCLEEWDDGASMALSEACSQYS HPSLARSLSVPPRPRSSASAVDEPPSSSVRSVSRHSYLDPDLEAAVNEVLSYKPVPFQRS SLEPDSEEDDRKSIQSARSAQLDPPERAASIRRSASAADVSRSRSGRKSRSRRRSGRSSS SSSSSSGSEASSEHKRRKKGRSRKSKKSKSRRKRTETESESSSSSSSGSTVSSHSCSSVK KGPAAESEETGQTHRPSRKEEKKRKKEVDSLMMRYLYRPESD >gi568815581r:28973855_29174199|GENSCAN_predicted_CDS_9|9129_bp atgtttaacctaatgaagaaagacaaggacaaagatggcgggcggaaggagaagaaggag aaaaaggagaaaaaggagcggatgtcagcggcagagcttcggagcctggaggagatgagc ctgcgacgtggcttcttcaacctgaaccgctcctccaagcgtgaatccaagacgcgcctg gaaatctccaaccccatccccatcaaggtggccagcggctctgacctgcacctgactgac attgactccgatagtaaccggggcagcgtcatcctggactcgggccacctaagtacagcc agctccagcgatgacctcaagggtgaggagggtagcttccgtggctcggtgctgcagcgg gcagccaagttcggctcactggccaagcagaactcacagatgattgtcaagcgcttttcc ttctcccagcgtagccgggatgagagcgcctcagaaacctcgacgccctcagagcactct gccgccccctcgccacaggtggaggtgaggactctagagggacagctggtgcagcatcct ggcccaggcatccctcgaccagggcaccgatcccgagcccctgagctagtgactaaaaag ttcccagtcgacctgcgcctgccccccgtggtgcccctgcccccacctaccctccgggag ctggagctgcaacgacggcccactggagactttggcttctccctgcggcgcacaaccatg ctggatcggggccccgagggccaggcctgtcggcgtgtggtccactttgctgagcctggt gcaggcaccaaggacctggccctggggctggtgccaggagatcgactggtggagattaat gggcacaatgtggagagcaagtccagggatgagattgtggagatgatccggcagtcaggg gacagcgtgcggctcaaggtgcagcccattccagagctcagcgagctcagcaggagctgg ctgcggagcggcgagggacctcgcagggagccatccgatcacctctcactgccaggccct ggggcaagcagggtccacacagggcggcaggttcttcccatggagctcttctctggcagc aggactgaggctgccccggccctggggcagaagagggctgcatgtgtgagtccccctggt aaggcaggggatggtggcatttccccttcctctcaagaaggaagagctgtcctctctggc aatttctacacctcaaccaccaccagctctggaatcctaatgccccagggccagcaggtc cctggctgtgtgccaccacaactggggcagcgcttcctttcctcgccaaggatgtcttct gcccagctttgtaacacacaggaaaggagggacctttttgtgatttccccagatgtggct gaagtaggtggggagaggggtggggcagccttggggctcctctccgagcttcttgataag ttggtgctaggttgggactgggaggaagaagtctcctgggagctgccctctgtgcctctg aatgggaaagagggaggagaggagaatgccctgcctcttggtcctcagaaggagaagcct ccggttaaagaggaggacaaaactctgccaaagcctggctctcctggcaaggaggaaggg gctctagagggcagctcaaaggaaggcagtggcccatctaggagtcctcagcctcccacc agcccgatacccccagagacttcccagagagccaggagccctgcacccacactcgccatg aacggcccgggagctgccacagcagaaggcctaagcgaagaggcccagggcctgtcccgg aagcgggtggcaaatgcagtgaggaaggtggtgagtaaggtgctgcccagcgaggagctt gggaatgccaaggagaccccaggcagaggagtcaagtcccccgagcacccgactcgaagc aagaggggagaaaaggcagcttctagtcccaagccgccaccccctcccccacctccccct gctccgcctaagcctgaagtgaagaaggaggcagccaaggatgagctctccctgggcctg cggagcctgatgtctcggggcaggggcaaggagcacaaggcccgcggcaagcagtcttct gggaagggggagaagccctccagccaggagccaggctccccagactgggcagactcccct gagaaagcaggatccccagccaagcctgaagccccaaagaagcagcgctccccagcccca ccggaagagctggtgaccccaggctccaccggcccgaagtcagacctcactggagagcag cagtccaagtcaccagtccctggtgtgaagcaggaggcgaaaacagaagaacagattgca gcagaagaggcctggaatgagacggagaaggtgtggctggtccatagggacggcttctca ctggccagtcaactcaaatctgaggagctcaacttgcctgaggggaaggtgcgtgtgaag ctggaccacgatggggccatcctggatgtggatgaggatgacgttgagaaggctaatgct ccctcctgcgaccgtctggaggatctggcctcactggtgtacctcaatgagtccagcgtc ctgcacaccttgcgccagcgctatggcgctagcctgctgcacacgtatgctggccccagc ctgctggttcttggcccccgtggggcccctgctgtgtactctgagaaggtgatgcacatg ttcaagggttgtcggcgggaggacatggcaccccacatctatgcagtggcccagaccgca tacagggcgatgctgatgagccgtcaggatcagtcaatcatcctcctgggcagtagtggc agtggcaagaccaccagctgccagcatctggtgcagtacctggccaccatcgcgggcatc agcgggaacaaggtgttttctgtggagaagtggcaggctctgtacaccctcctggaagcc tttgggaacagccccaccatcattaatggcaatgccacccgcttctcccagatcctctcc ctggactttgaccaagctggccaggtggcctcagcctccattcagacaatgcttctggag aagctgcgtgtggctcggcgcccagccagtgaagccacattcaacgtcttctactacctg ctggcctgtggggatggcaccctcaggacagagctccacctcaaccacttggcagagaac aatgtgtttgggattgtgccactggccaagcctgaggaaaagcagaaggcagctcagcag tttagtaagctgcaggcggccatgaaggtgctgggcatctcccccgatgaacagaaggcc tgctggttcattctggctgccatctaccacctgggggctgcgggagccaccaaagaggcc cccgaggagcaagcggctgggcgcaagcagtttgcccgccatgagtgggcccagaaggct gcgtacctactgggctgcagcctggaggagctgtcctcagccatcttcaagcaccagcac aagggtggcaccctgcagcgctccacctccttccgccagggccccgaggagagtggcctg ggagatgggacaggcccgaaactgagtgcactggagtgccttgagggcatggcggccggc ctctacagcgagctcttcacccttctcgtctccctggtgaatagggctctcaagtccagc cagcactcactctgctccatgatgattgtcgacaccccgggcttccagaaccctgagcag ggtgggtcagcccgcggagcctcctttgaggagctgtgccacaactacacccaagaccgg ctgcagaggctcttccacgagcgcaccttcgtgcaggagttggaaagatacaaggagcct cagattctggccttggcacccacagcgagggcatctgtggatgcagaaggcatcacctta ctctccatttctgtcctcctgcgctctctggagcaggagtcgttccttgcccccttgtcc gtggcctctggctttggctacgttcttacttggtggagagtttactctccaggaagccta caggagaacatcgagctggcgtttgacgacttggaacccccgacggatgactctgtggct gctgtggaccaggcctcccatcagtccctggtccgctcgctggcccgcacagacgaggcg aggggcctgctctggctattggaagaggaggctctggtgccaggggccagtgaggacacc ctcctggagcgccttttctcctattatggcccccaggaaggtgacaaaaaaggccaaagc ccccttctgcacagcagcaaaccacaccactttctcctgggccacagccatggcaccaac tgggtagagtacaatgtgactggctggctgaactacaccaagcagaacccagccacccag aatgccccccggctcctgcaggactcccagaaaaaaatcatcagcaacctgtttctgggc cgcgcaggcagtgccacggtgctctctggctccatcgcgggcctggagggcggctcgcag ctggcactgcgccgggccaccagcatgcggaaaacctttaccacaggcatggcggctgtc aaaaagaagtcactgtgcatccagatgaagctacaggtggacgccctcatcgacaccatc aagaagtcaaagctgcattttgtgcactgcttcctgcctgtagctgagggctgggctggg gagccccgttccgcctcctcccgccgagtcagcagcagcagtgagctggacctgccctcg ggagaccactgcgaggctgggctcctgcagctcgacgtgcccctgctccgcacccagctc cgcggctcccgcctgctcgatgccatgcgcatgtaccgccaagtttggacagttcagctt tgcccacagcagcaaacccagtgccctttcggagccatctgcccagtgagaaatgaggtg ctaaatctttctgaagtacttaaaattcccaagaaaggttaccctgaccacatggtgttt tccgagttccgccgccgctttgatgtcctggccccgcacctgaccaagaaacacgggcgt aactacatcgtggtggatgaaaggcgggcagtggaggagctgctggagtgcttggatctg gagaagagcagctgctgcatgggcctgagccgggtgttcttccgggcgggcaccttggca cggctggaggagcagcgggatgaacaaaccagcaggaacctaaccctgttccaagcagcc tgcaggggctacctggcccgccagcacttcaagaagagaaagatccaggacctggccatt cgctgtgtacagaagaacatcaagaagaacaaaggggtgaaggactggccctggtggaag ctttttaccacagtgaggcccctcatcgaagtacagctgtcagaggagcagatccggaac aaagacgaggagatccagcagctgcggagcaagctcgagaaggcggagaaggagaggaac gagctgcggctcaacagtgaccggctggagagccggatctcagagctgacatcggagctg acagatgagcgtaacacaggagagtccgcctcccagctgctggacgcggagacagcagag aggctccgggctgagaaggagatgaaggaactgcagacccagtacgatgcactgaagaag cagatggaggttatggaaatggaggtgatggaggcccgtctcatccgggcagcggagatc aacggggaagtggatgatgatgatgcaggtggcgagtggcggctgaagtatgagcgggct gtgcgggaggtggacttcaccaagaaacggctccagcaggagtttgaggacaagctggag gtggagcagcagaacaagaggcagctggaacggcggctcggggacctgcaggcagatagt gaggagagtcagcgggctctgcagcagctcaagaagaagtgccagcgactgacggctgag ctgcaagacaccaagctgcacctggagggccagcaggtccgcaaccacgaactggagaag aagcagaggaggtttgacagtgagctctcgcaggcgcatgaggaggcccagcgggagaag ctgcagcgggagaagctgcagcgggagaaggacatgctcctcgctgaggctttcagcctg aagcagcaactagaggaaaaagacatggacattgcagggttcacccagaaggttgtgtct ctagaggcagagctccaggacatttcttcccaagagtccaaggatgaggcttctctggcc aaggtcaagaaacagctccgggacctggaggccaaagtcaaggatcaggaagaagagctg gatgagcaggcagggaccatccagatgctggaacagacctctagtaagggagttcagctg ctctggagaagctgctgccacgggagcaaggtgccagcaaagggaactggctggtggacg ttctcggcccctgagcctagggagagccatgcacaggccaagctgcgtctggagatggag atggagcggatgagacagacccattctaaggagatggagagtcgggatgaggaggtggag gaggcccggcagtcgtgtcagaagaagttaaaacagatggaggtgcagctagaggaagag tatgaggacaagcagaaggttctgcgagagaagcgggagctggagggcaagctcgccacc ctcagcgaccaggtgaaccggcgggactttgagtcagagaagcggctgcggaaggacctg aagcgcaccaaggccctgctggcagatgcccagctcatgctggaccacctgaagaacagt gctcccagcaagcgagagattgcccagctcaagaaccaggaatggctgacatctctccct ggggcgggggtggtctgtgtccagctggaggagtcagagttcacctgtgcggcagccgtg aaagcacggaaagcaatggaggtggagatcgaagacctgcacctgcagattgatgacatc gccaaagccaagacagcgctggaggagcagctgagccgccttcagcgtgagaagaatgag atccagaaccggctggaggaagatcaggaagacatgaacgaattgatgaagaagcacaag gctgccgtggctcaggtacccctgccaggagtgcccgtcacactctgcaccgtcaccatc cttgtggtcctcctcatgggtcccccgcacaaccgcttgttttgggaggcttcccgggac ctggctcagataaatgatctccaagctcagctagaagaagccaacaaagagaagcaggag ctgcaggagaagctacaagccctccagagccaggtggagttcctggagcagtccatggtg gacaagtccctggtgagcaggcaggaagctaagatacgggagctggagacacgcctggag tttgaaaggacgcaagtgaaacggctggagagcctggctagccgtctcaaggaaaacatg gagaagctgactgaggagcgggatcagcgcattgcagccgagaaccgggagaaggaacag aacaagcggctacagaggcagctccgggacaccaaggaggagatgggcgagcttgccagg aaggaggccgaggcgagccgcaagaagcacgaactggagatggatctagaaagcctggag gctgctaaccagagcctgcaggctgacctaaagttggcattcaagcgcatcggggacctg caggctgccattgaggatgagatggagagtgatgagaatgaggacctcatcaacagtgag ggagactctgatgtggactcggagctggaggaccgtgttgacggggtcaagtcctggttg tcaaaaaacaagggaccttccaaggcagcttctgatgatggcagcttaaagagttccagc ctctctaaggaggccccgggggtggaggagaggccgtcctcggtggtgagctccctgagc tatcggaagcggctcaccctaaaggactccatcgggggcaccggggacgcggattcgctc ttcacctccctgagcgagcgggcggcctcccctgagaggccccctaggaaggcccatgtg ggccccagggaggagccatgcccaggcaggaagtccgaggagccggaggagtgcggctcc gtccgctcggggaccggggggcgcgctggccgggggccgcagaagcggtggggctccgac ttcagcccggcctccaccgtctctgcaccggtcagccgggcctcctcggccacgcggcgg ggctctggcgaagacagggctggctcttccctgagtttctcgctgtcgggctcgcccggt tcccgccgcagcacctcccggctcgacagcctctccaggacactcagcccttccctgagc cgggcctctggcctaggccgggagagccctgattctcgcctgtccctgggccggagctgc ctagaggagtgggatgatggagccagtatggccctgagcgaggcctgctcgcagtacagc cacccgtcgctggcccgcagtctgtcggtgccaccccggccacgcagctctgcctcggca gtggatgagcctcccagctccagcgtccgctccgtcagccgtcactcctacctggaccca gacctggaggctgccgtcaacgaggtcttgagctacaagcctgttccattccagcggagc agcctggagcccgactccgaggaggatgacaggaagagcatccaaagtgcccggagcgcc caactggaccccccggagcgagctgccagcatccgccgctccgcctctgctgctgatgtg tcccggtcccgcagcggccggaagagccggagccggcgaaggagcggaaggagcagctcc agctccagctccagctccggctccgaagcttcctcagagcacaagaggcggaagaagggg cgctctcggaagagcaagaagtccaagtcgagaagaaagagaacggagacggagtctgag tcctcctcgtcgtcatccagcggctcgaccgtctccagccacagctgctccagtgtgaag aagggcccagctgcagaaagtgaagaaactgggcagacgcaccggccgtcgaggaaggag gaaaagaagcgcaagaaagaggtggacagcctgatgatgcggtacctgtaccggcccgag agcgactag