GENSCAN 1.0 Date run: 2-Nov-116 Time: 19:47:27 Sequence gi568815587r:46758986_47018379 : 259394 bp : 42.21% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.32 Intr - 457 284 174 0 0 58 89 229 0.874 18.13 1.31 Intr - 1799 1627 173 2 2 42 55 212 0.969 11.12 1.30 Intr - 3208 3015 194 2 2 76 82 186 0.956 15.09 1.29 Intr - 3777 3642 136 0 1 100 72 151 0.999 13.92 1.28 Intr - 4194 3991 204 0 0 31 90 105 0.900 3.47 1.27 Intr - 4645 4496 150 1 0 113 95 104 0.999 13.04 1.26 Intr - 6271 6146 126 1 0 43 116 128 0.998 11.06 1.25 Intr - 8678 8590 89 0 2 42 95 85 0.996 3.37 1.24 Intr - 11113 10978 136 1 1 106 93 83 0.999 9.82 1.23 Intr - 11997 11803 195 0 0 58 74 163 0.983 10.59 1.22 Intr - 17398 17270 129 1 0 69 88 157 0.997 13.77 1.21 Intr - 18567 18454 114 0 0 66 92 112 0.996 9.12 1.20 Intr - 19328 19154 175 1 1 63 76 102 0.999 5.52 1.19 Intr - 19614 19475 140 0 2 54 96 175 0.933 13.24 1.18 Intr - 21334 21209 126 1 0 98 70 100 0.964 9.16 1.17 Intr - 21500 21443 58 1 1 96 81 53 0.992 3.37 1.16 Intr - 24383 24289 95 2 2 105 106 33 0.990 4.64 1.15 Intr - 25688 25503 186 2 0 78 92 180 0.831 16.36 1.14 Intr - 31201 31091 111 1 0 111 67 67 0.841 6.56 1.13 Intr - 31598 31485 114 2 0 68 69 110 0.854 6.82 1.12 Intr - 36791 36609 183 2 0 97 33 119 0.755 6.36 1.11 Intr - 37955 37827 129 2 0 37 100 221 0.925 18.17 1.10 Intr - 38984 38820 165 2 0 59 87 83 0.948 4.54 1.09 Intr - 39187 39098 90 1 0 88 93 77 0.994 7.47 1.08 Intr - 42319 42215 105 1 0 104 103 100 0.999 12.59 1.07 Intr - 49159 49046 114 1 0 81 94 78 0.991 7.42 1.06 Intr - 50515 50415 101 2 2 84 87 127 0.999 11.11 1.05 Intr - 50889 50757 133 0 1 32 99 266 0.999 21.40 1.04 Intr - 52193 52022 172 1 1 62 94 124 0.998 9.42 1.03 Intr - 57419 57213 207 1 0 46 116 212 0.999 17.07 1.02 Intr - 59518 59325 194 1 2 74 106 147 0.999 12.47 1.01 Init - 62246 62190 57 2 0 74 116 72 0.999 9.96 1.00 Prom - 64970 64931 40 -8.35 2.04 PlyA - 65236 65231 6 1.05 2.03 Term - 67974 67642 333 0 0 -2 49 323 0.754 13.03 2.02 Intr - 68391 68353 39 0 0 72 100 37 0.380 0.80 2.01 Init - 69054 69034 21 0 0 74 85 15 0.473 -0.39 2.00 Prom - 72231 72192 40 -6.05 3.00 Prom + 72325 72364 40 -7.65 3.01 Init + 78593 78653 61 1 1 84 64 36 0.312 2.26 3.02 Intr + 86640 86914 275 1 2 55 27 269 0.793 13.83 3.03 Term + 87067 87522 456 0 0 95 41 189 0.740 9.04 3.04 PlyA + 89567 89572 6 1.05 4.39 PlyA - 89894 89889 6 1.05 4.38 Term - 97830 97681 150 0 0 101 45 172 0.765 11.13 4.37 Intr - 100330 100004 327 1 0 113 25 436 0.999 34.77 4.36 Intr - 103762 103621 142 0 1 66 83 179 0.998 14.63 4.35 Intr - 105550 105463 88 2 1 96 93 63 0.996 5.71 4.34 Intr - 106201 106134 68 0 2 87 115 64 0.999 6.63 4.33 Intr - 109129 108994 136 2 1 76 94 120 0.956 10.11 4.32 Intr - 109728 109615 114 1 0 75 97 64 0.951 5.50 4.31 Intr - 110147 110003 145 2 1 27 80 208 0.871 12.83 4.30 Intr - 112648 112540 109 0 1 109 84 144 0.881 15.47 4.29 Intr - 114249 114115 135 2 0 106 75 199 0.999 19.16 4.28 Intr - 114608 114390 219 1 0 45 98 237 0.954 16.90 4.27 Intr - 116118 115815 304 1 1 68 80 281 0.633 20.02 4.26 Intr - 116696 116471 226 2 1 128 66 318 0.677 30.24 4.25 Intr - 116981 116819 163 1 1 80 78 174 0.999 14.66 4.24 Intr - 117652 117481 172 2 1 95 64 246 0.999 20.98 4.23 Intr - 118354 118214 141 2 0 81 66 143 0.992 10.80 4.22 Intr - 120053 119922 132 0 0 91 109 81 0.995 10.20 4.21 Intr - 120330 120141 190 0 1 40 99 203 0.864 14.84 4.20 Intr - 122918 122717 202 1 1 78 116 230 0.943 23.17 4.19 Intr - 124991 124886 106 0 1 78 100 194 0.999 17.95 4.18 Intr - 127187 127106 82 2 1 63 85 83 0.978 3.79 4.17 Intr - 127548 127340 209 1 2 25 75 368 0.926 27.07 4.16 Intr - 130548 130426 123 1 0 26 89 156 0.806 9.14 4.15 Intr - 131108 130959 150 0 0 85 98 136 0.620 13.51 4.14 Intr - 131509 131292 218 0 2 102 84 207 0.645 18.92 4.13 Intr - 134144 133988 157 0 1 86 98 219 0.998 20.85 4.12 Intr - 135834 135604 231 1 0 65 91 367 0.998 31.52 4.11 Intr - 136306 136181 126 2 0 62 82 120 0.996 8.53 4.10 Intr - 137033 136899 135 0 0 59 105 73 0.967 5.72 4.09 Intr - 137350 137225 126 2 0 125 78 157 0.649 18.13 4.08 Intr - 137988 137884 105 1 0 58 41 152 0.364 6.77 4.07 Intr - 139692 139573 120 1 0 33 73 203 0.937 12.85 4.06 Intr - 140047 139895 153 2 0 102 59 237 0.636 21.32 4.05 Intr - 140518 140402 117 2 0 103 72 144 0.830 13.82 4.04 Intr - 140991 140878 114 1 0 83 52 177 0.907 13.10 4.03 Intr - 141393 141277 117 1 0 57 72 153 0.999 10.12 4.02 Intr - 143944 143798 147 2 0 125 86 153 0.999 18.09 4.01 Init - 151117 150853 265 1 1 85 63 115 0.497 5.92 4.00 Prom - 151336 151297 40 -4.25 5.00 Prom + 153215 153254 40 -9.55 5.01 Init + 154215 154440 226 2 1 62 0 125 0.143 -2.09 5.02 Intr + 155237 155309 73 0 1 108 62 54 0.298 2.35 5.03 Intr + 157778 158075 298 2 1 69 68 179 0.467 9.95 5.04 Intr + 158674 159276 603 0 0 103 36 361 0.433 24.15 5.05 Intr + 175015 175101 87 0 0 102 72 22 0.124 1.25 5.06 Term + 177582 177827 246 2 0 17 47 285 0.708 12.11 5.07 PlyA + 182711 182716 6 1.05 6.00 Prom + 186745 186784 40 -0.85 6.01 Init + 206028 206034 7 2 1 75 109 0 0.595 1.92 6.02 Intr + 209067 209153 87 1 0 59 77 52 0.329 0.22 6.03 Term + 228223 228326 104 2 2 80 39 160 0.225 7.76 6.04 PlyA + 229260 229265 6 1.05 7.03 PlyA - 230324 230319 6 1.05 7.02 Term - 235324 234447 878 2 2 67 38 249 0.760 9.55 7.01 Init - 236907 235473 1435 0 1 49 72 559 0.709 41.72 7.00 Prom - 237000 236961 40 -9.95 8.02 PlyA - 237169 237164 6 1.05 8.01 Sngl - 238405 237629 777 1 0 81 38 494 0.621 39.46 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 169064 169720 657 1 0 87 53 219 0.858 14.22 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:46758986_47018379|GENSCAN_predicted_peptide_1|1492_aa MGDDSEWLKLPVDQKCEHKLWKARLSGYEEALKIFQKIKDEKSPEWSKFLGLIKKFVTDS NAVVQLKGLEAALVYVENAHVAGKTTGEVVSGVVSKVFNQPKAKAKELGIEICLMYIEIE KGEAVQEELLKGLDNKNPKIIVACIETLRKALSEFGSKIILLKPIIKVLPKLFESREKAV RDEAKLIAVEIYRWIRDALRPPLQNINSVQLKELEEEWVKLPTSAPRPTRFLRSQQELEA KLEQQQSAGGDAEGGGDDGDEVPQIDAYELLEAVEILSKLPKDFYDKIEAKKWQERKEAL ESVEVLIKNPKLEAGDYADLVKALKKVVGKDTNVMLVALAAKCLTGLAVGLRKKFGQYAG HVVPTILEKFKEKKPQVVQALQEAIDAIFLTTTLQNISEDVLAVMDNKNPTIKQQTSLFI ARSFRHCTASTLPKSLLKPFCAALLKHINDSAPEVRDAAFEALGTALKVVGEKAVNPFLA DVDKLKLDKIKECSEKVELIHGKKAGLAADKKEFKPLPGRTAASGAAGDKDTKDISAPKP GPLKKAPAAKAGGPPKKGKPAAPGGAGNTGTKNKKGLETKEIVEPELSIEVCEEKASAVL PPTCIQLLDSSNWKERLACMEEFQKVMQMKLHIVALIAQKGNFSKTSAQVVLDGLVDKIG DVKCGNNAKEAMTAIAEACMLPWTAEQVVSMAFSQKNPKNQSETLNWLSNAIKEFGFSGL NVKAFISNVKTALAATNPAVRTAAITLLGVMYLYVGPSLRMFFEDEKPALLSQIDAEFEK MQGQSPPAPTRGISKHSTSGTDEGEDGDEPDDGSNDVVDLLPRTEISDKITSELVSKIGD KNWKIRKEGLDEVAGIINDAKFIQPNIGELPTALKGRLNDSNKILVQQTLNILQQLAVAM GPNIKQHVKNLGIPIITVLGDSKNNVRAAALATVNAWAEQTGMKEWLEGEDLSEELKKEN PFLRQELLGWLAEKLPTLRSTPTDLILCVPHLYSCLEDRNGDVRKKAQDALPFFMMHLGY EKMAKATGKLKPTSKDQVLAMLEKAKVNMPAKPAPPTKATSKPMGGSAPAKFQPASAPAE DCISSSTEPKPDPKKAKAPGLSSKAKSAQGKKMPSKTSLKEDEDKSGPIFIVVPNGKEQR MKDEKGLKVLKWNFTTPRDEYIEQLKTQMSSCVAKWLQDEMFHSDFQHHNKALAVMVDHL ESEKEGVIGCLDLILKWLTLRFFDTNTSVLMKALEYLKLLFTLLSEEEYHLTENEASSFI PYLVVKVGEPKDVIRKDVRAILNRMCLVYPASKMFPFIMEGTKSKNSKQRAECLEELGCL VESYGMNVCQPTPGKALKEIAVHIGDRDNAVRNAALNTIVTVYNVHGDQVFKLIGNLSEK DMSMLEERIKRSAKRPSAAPIKQVEEKPQRAQNISSNANMLRKGPAEDMSSKLNQARSMS GHPEAAQMVRREFQLDLDEIENDNGTVRCEMPELVQHKLDDIFEPVLIPEPN >gi568815587r:46758986_47018379|GENSCAN_predicted_CDS_1|4476_bp atgggagatgacagtgagtggttgaaactgccagttgatcagaaatgtgaacacaagctg tggaaagcaaggttaagtgggtatgaagaggccctgaagatcttccagaaaataaaggat gaaaagagcccagagtggtccaaatttttaggattgatcaaaaaatttgtcactgattcc aatgcagtggttcaattgaaaggattagaagctgcacttgtttatgttgaaaatgcccat gtagcaggaaaaaccacaggagaagttgtgtcaggtgttgtaagtaaggtgttcaatcaa cctaaagctaaagccaaggagctgggcatagagatctgtcttatgtacatagagattgag aaaggagaggctgttcaagaagagctcctgaaaggcttggacaataagaatcccaagatc atagtggcctgtatagagacactgaggaaagccttaagtgaatttggttccaaaatcatc ttgcttaagccaattatcaaagtgttgccaaaactctttgagtctcgagagaaggctgtt cgagatgaagccaaactaattgctgtggagatttacagatggattcgggatgctctgaga cccccattacaaaatataaactctgttcagttgaaagaactagaagaagaatgggtcaaa ctgccaacaagtgctcctagacctactcgatttcttcgttcccaacaagaactagaagct aaattggaacaacaacagtctgctggtggagatgctgaaggaggtggtgatgatggtgat gaggtgccacaaatagatgcttatgagcttttagaagctgtagaaatcctttccaaactt cccaaagacttttatgacaaaattgaggcaaaaaaatggcaagagagaaaagaggccctg gagtctgtagaagtactaataaaaaaccccaaactggaagctggcgattatgcagattta gtaaaagcattaaagaaggttgttggaaaggacaccaatgtcatgttggtggctttggca gcaaaatgtcttactggcctggctgttgggctaaggaagaaatttggacaatatgcagga catgttgtgccaaccatcttggagaaattcaaagagaagaaacctcaagtggtacaagcc ctgcaggaggcaattgatgcaatcttccttactaccacactacagaacatcagtgaggat gttttagcagtaatggataataaaaatccaaccatcaagcagcagacatctctttttatt gcaagaagtttccgccactgcactgcttctaccctgccaaagagcttgctaaagcccttt tgtgctgcactacttaagcacatcaatgattctgctcctgaagtcagagatgccgcattt gaagcattgggtactgctttgaaggtggttggcgagaaagcagtaaacccattcctagct gatgtggacaaactcaagcttgataagatcaaagaatgttcagaaaaggtagaactgata catggtaagaaagctggactagctgctgataagaaggaattcaaacctctgcctggaagg actgctgcttcaggggctgcaggagataaggacacaaaggacatttctgcacccaaacca ggacctctaaaaaaggcacctgctgctaaggctggtgggccaccaaaaaaggggaaacca gctgcaccaggaggcgcagggaatactggaaccaagaacaagaaaggactggagactaaa gaaatagtggagcctgagctctcgatagaagtatgtgaagaaaaagcttcagctgttctt ccccctacctgtatacagcttcttgacagcagtaactggaaagaaaggctggcttgtatg gaagagttccagaaggtgatgcaaatgaagcttcatatagttgctttgattgcccagaag ggaaatttttccaaaacgtcagctcaggttgtattagatggccttgtggacaagattgga gatgtgaaatgtgggaacaatgcaaaagaagctatgacagcaatagccgaagcctgtatg ttaccatggactgctgaacaggttgtgtcaatggctttctcacaaaagaatcccaaaaat cagtcagaaactctgaattggctatcaaatgccataaaagaatttggtttttctgggttg aatgtcaaagctttcattagcaatgtgaagacagctcttgctgcaacaaacccagctgtg aggactgctgccataaccctgcttggcgtgatgtatctgtatgttggtccctctttgcga atgttctttgaggatgagaagcctgccctcctatcccagatagatgcagaatttgagaag atgcagggacaaagtccacctgctccaaccagaggaatttccaagcatagcacaagtggt acagatgaaggagaagatggagatgaaccagatgacgggagcaatgatgtcgttgatctt ttgccgaggacggagatcagtgataaaatcacttcagagttggtatctaagattggtgat aagaattggaagattaggaaagaaggcctagatgaagtggcaggtattattaatgacgca aaatttatccaaccgaatataggtgaacttccaactgccttgaagggtcgactcaatgat tcaaataaaatcttggtacagcaaacgctgaatatcctgcaacaactggcagtagccatg ggcccaaatattaagcaacatgtaaaaaatttaggcatccctatcatcacagtccttgga gacagcaagaacaatgttcgagctgctgccctagcgactgtgaatgcttgggcagaacag actggcatgaaggaatggctggaaggagaagatctttctgaagagctcaaaaaggaaaat cctttcttgaggcaagagcttctgggctggctggctgagaaactacctactcttcgttcc acccctacagaccttatcctttgtgttcctcatctctactcctgcctagaagatcgaaat ggagatgtgcgaaagaaggcccaagatgccttgccattcttcatgatgcatttaggatat gaaaaaatggccaaggctactgggaaactaaagccaacttctaaagatcaggtattggcc atgctagagaaagccaaagttaacatgccagccaagcctgctccacccactaaagcaact tctaaaccaatgggagggtccgctccagccaaattccagcctgcatcagcacctgctgaa gattgtatttccagcagtacagaacccaaacctgatccaaaaaaggccaaagctccagga ttatcctctaaagcaaagagtgcacaagggaagaagatgccaagcaaaaccagcttaaag gaggatgaagacaaatccgggcctatttttattgttgttccaaatggaaaagagcaaagg atgaaagatgaaaaaggattgaaggtgctaaagtggaattttactaccccacgggatgaa tacattgagcaactaaagactcaaatgtctagctgtgtggctaaatggttacaagatgag atgtttcactcagactttcagcatcataacaaagcccttgctgttatggttgatcacttg gagagtgaaaaagaaggagttattggttgcctggatcttatcttaaagtggcttaccctg aggttttttgacaccaatacaagcgtcctgatgaaagcactagaatatttaaaattgctc ttcaccttgctaagtgaagaagaatatcatcttactgagaatgaagcatcttccttcatc ccctatcttgtcgtcaaggttggagaaccaaaggatgtcattcgtaaagatgttcgtgcc atcctgaaccggatgtgccttgtctacccagctagcaagatgtttccctttatcatggaa ggaaccaaatccaaaaactctaagcagagagcagagtgcctggaagagctgggatgtctg gttgagtcctatggcatgaatgtttgccaaccaaccccaggaaaagccttaaaggaaata gctgttcacataggagaccgtgacaatgctgtacgcaatgctgcactcaacaccattgta acggtgtacaatgtacatggggatcaggtgttcaaactgattggaaatctttctgaaaag gatatgagcatgctcgaggagaggattaagcggtcagcaaagagaccctctgctgcacca ataaaacaggtggaagagaaacctcagcgtgcacagaacataagctccaatgccaacatg ttacgcaagggaccagctgaggacatgtcttccaaactcaaccaagcccgaagcatgagt gggcatcctgaggcagcccagatggtccgccgagaattccagctggatctagatgagatt gagaatgacaatggtacagtccgatgtgaaatgccagaacttgttcagcacaaactggat gacatttttgagccagtccttattcctgaacccaan >gi568815587r:46758986_47018379|GENSCAN_predicted_peptide_2|130_aa MAQGFKEDSSSFVKNEFREKLSECAERGAEGGAGTLTVALRAPPSEEAFPSGPQHQCCCR RRRRLPGIPADPARTLASLTSPCPGSVSVKPRAKGKHPATVLQVLVDSTGDFAGLSEAKA AKERKQCRLQ >gi568815587r:46758986_47018379|GENSCAN_predicted_CDS_2|393_bp atggcccaaggctttaaggaggattcctccagttttgtaaagaacgaattccgtgaaaag ctttcagagtgtgcggagaggggagctgagggtggggcgggcactctgacagtggcgctc cgggccccgccctcggaggaggcgtttcccagcggaccgcagcaccaatgctgctgccgc cgccgccgccgcctgccgggtattcctgctgatcctgctcggaccttagcttccctgacc tctccctgcccggggagcgtaagtgtcaaaccacgggcaaagggcaagcatcccgcgacc gtgctgcaagtcctggtggacagtactggggactttgctggcctatcagaggctaaggcg gcgaaagagcgtaaacaatgccggctccagtga >gi568815587r:46758986_47018379|GENSCAN_predicted_peptide_3|263_aa MADSSTGTRNTQDEPGTFPDRLEGHQNAPDTLRERKPVVKGGVVDDRNRSHRSKPAGTQV CGQAPVPLSLLQGDKALRNHHSPRRRLLIHSLDNYARGTLAGPALGAPALVLVPRPPARF AASDLPGSWGKWGARLETSSRPGFRGSSARAMPALEGLAGGAGPPTTHLSWVRTKRPALS RLKPLGPPHTPQCPRSLLVLDGPPIGLLLVHLKGLWARAARSLKGAARSRVSLWTRRRLH GWGRWSLPGASLPTPERKAKECV >gi568815587r:46758986_47018379|GENSCAN_predicted_CDS_3|792_bp atggctgattctagcactgggacaagaaatacacaagatgagcctggaacatttccagat aggctggaaggacaccagaacgctcccgacacgttgagagagagaaagccagtcgttaag ggcggggtggtggatgatcggaatcgttcacatcgcagcaagccagcagggacacaggtg tgcggccaggccccggtgccgctgtccttgctccagggcgacaaggcccttcgcaaccac cacagccctcgtaggcgcttacttattcattcacttgacaattacgcgagaggcactcta gcagggccggctctcggggcccccgccctagtcttggtcccccggcctccggctcgcttc gccgcctcagacctgccgggttcgtggggcaagtggggcgcgcgtctggagacctccagc cgcccaggattcaggggctcctctgcgcgcgcgatgcctgcactcgagggcctcgcgggt ggggcaggcccaccgaccactcacctcagctgggtcagaaccaagcggcctgctctaagc cgtttgaaaccgcttgggccgccgcacaccccgcagtgtcctcgttctttattggtcctc gacggaccgcccatcggacttttattggtccatttgaaaggcctgtgggcacgcgccgcc cgcagccttaaaggggccgcgcgcagccgtgtttccctctggaccagacgccgcctgcac gggtggggacgctggtcgctgccgggggcaagtctgccgactccagaaagaaaggcgaaa gaatgtgtgtag >gi568815587r:46758986_47018379|GENSCAN_predicted_peptide_4|1987_aa MSLSKRATEEKTSPGRFSKPTKLSWPLGIGNKRDHGTEISRNMACQFHSRIIQLVKGTLV GERSKQNLGRTKRLPCKREDSCSLQHWLGLASSPECACGRSHFTCAVSALGECTCIPAQW QCDGDNDCGDHSDEDGCILPTCSPLDFHCDNGKCIRRSWVCDGDNDCEDDSDEQDCPPRE CEEDEFPCQNGYCIRSLWHCDGDNDCGDNSDEQCDMRKCSDKEFRCSDGSCIAEHWYCDG DTDCKDGSDEENCPSAVPAPPCNLEEFQCAYGRCILDIYHCDGDDDCGDWSDESDCCEYS GQLGASHQPCRSGEFMCDSGLCINAGWRCDGDADCDDQSDERNCKQFRCHSGRCVRLSWR CDGEDDCADNSDEENCENTGSPQCALDQFLCWNGRCIGQRKLCNGVNDCGDNSDESPQQN CRPRTGEENCNVNNGGCAQKCQMVRGAVQCTCHTGYRLTEDGHTCQDVNECAEEGYCSQG CTNSEGAFQCWCETGYELRPDRRSCKALGPEPVLLFANRIDIRQVLPHRSEYTLLLNNLE NAIALDFHHRRELVFWSDVTLDRILRANLNGSNVEEVVSTGLESPGGLAVDWVHDKLYWT DSGTSRIEVANLDGAHRKVLLWQNLEKPRAIALHPMEGTIYWTDWGNTPRIEASSMDGSG RRIIADTHLFWPNGLTIDYAGRRMYWVDAKHHVIERANLDGSHRKAVISQVFEDSLYWTD WHTKSINSANKFTGKNQEIIRNKLHFPMDIHTLHPQRQPAGKNRCGDNNGGCTHLCLPSG QNYTCACPTGFRKISSHACAQSLDKFLLFARRMDIRRISFDTEDLSDDVIPLADVRSAVA LDWDSRDDHVYWTDVSTDTISRAKWDGTGQEVVVDTSLESPAGLAIDWVTNKLYWTDAGT DRIEVANTDGSMRTVLIWENLDRPRDIVVEPMGGYMYWTDWGASPKIERAGMDASGRQVI ISSNLTWPNGLAIDYGSQRLYWADAGMKTIEFAGLDGSKRKVLIGSQLPHPFGLTLYGER IYWTDWQTKSIQSADRLTGLDRETLQENLENLMDIHVFHRRRPPVSTPCAMENGGCSHLC LRSPNPSGFSCTCPTGINLLSDGKTCSPGMNSFLIFARRIDIRMVSLDIPYFADVVVPIN ITMKNTIAIGVDPQEGLQTTDGLAVDAIGRKVYWTDTGTNRIEVGNLDGSMRKVLVWQNL DSPRAIVLYHEMGFMYWTDWGENAKLERSGMDGSDRAVLINNNLGWPNGLTVDKASSQLL WADAHTERIEAADLNGANRHTLVSPVQHPYGLTLLDSYIYWTDWQTRSIHRADKGTGSNV ILVRSNLPGLMDMQAVDRAQPLGFNKCGSRNGGCSHLCLPRPSGFSCACPTGIQLKGDGK TCDPSPETYLLFSSRGSIRRISLDTSDHTDVHVPVPELNNVISLDYDSVDGKVYYTDVFL DVIRRADLNGSNMETVIGRGLKTTDGLAVDWVARNLYWTDTGRNTIEASRLDGSCRKVLI NNSLDEPRAIAVFPRKGYLFWTDWGHIAKIERANLDGSERKVLINTDLGWPNGLTLDYDT RRIYWVDAHLDRIESADLNGKLRQVLVSHVSHPFALTQQDRWIYWTDWQTKSIQRVDKYS GRNKETVLANVEGLMDIIVVSPQRQTGTNACGVNNGGCTHLCFARASDFVCACPDEPDSR PCSLVPGLVPPAPRATGMSEKSPVLPNTPPTTLYSSTTRTRTSLEEVEGRCSERDARLGL CARSNDAVPAAPGEGLHISYAIGGLLSILLILVVIAALMLYRHKKSKFTDPGMGNLTYSN PSYRTSTQEVKIEAIPKPAMYNQLCYKKEGGPDHNYTKEKIKIVEGICLLSGDDAEWDDL KQLRSSRGGLLRDHVCMKTDTVSIQASSGSLDDTETEQLLQEEQSECSSVHTAATPERRG SLPDTGWKHERKLSSESQTHAVENLLTDITFKSFVDVGTVKSNKECEVPAGVFSSSGEDR SRPEFLD >gi568815587r:46758986_47018379|GENSCAN_predicted_CDS_4|5964_bp atgtccctttcgaaacgggccactgaggagaagacttcacctggtagattttccaaacca accaagttaagctggcctctaggaatcgggaataaaagggaccatgggacagaaataagt agaaacatggcgtgtcagtttcacagccggataatacagctggtgaaaggaactcttgta ggagaaaggagtaaacaaaacctgggtaggacaaagaggctgccttgcaaacgagaagat tcctgttctctccagcactggttaggcctggccagcagccccgagtgtgcttgtggtcgg agccacttcacatgtgcagtgagtgctcttggagagtgtacctgcatccctgcccagtgg cagtgtgatggagacaatgactgcggggaccacagcgatgaggatggatgtatactacct acctgttcccctcttgactttcactgtgacaatggcaagtgcatccgccgctcctgggtg tgtgacggggacaacgactgtgaggatgactcggatgagcaggactgtcccccccgggag tgtgaggaggacgagtttccctgccagaatggctactgcatccggagtctgtggcactgc gatggtgacaatgactgtggcgacaacagcgatgagcagtgtgacatgcgcaagtgctcc gacaaggagttccgctgtagtgacggaagctgcattgctgagcattggtactgcgacggt gacaccgactgcaaagatggctccgatgaggagaactgtccctcagcagtgccagcgccc ccctgcaacctggaggagttccagtgtgcctatggacgctgcatcctcgacatctaccac tgcgatggcgacgatgactgtggagactggtcagacgagtctgactgctgtgagtactct ggccagctgggagcctcccaccagccctgccgctctggggagttcatgtgtgacagtggc ctgtgcatcaatgcaggctggcgctgcgatggtgacgcggactgtgatgaccagtctgat gagcgcaactgcaaacagttccgctgtcactcaggccgctgtgtccgcctgtcctggcgc tgtgatggggaggacgactgtgcagacaacagcgatgaagagaactgtgagaatacagga agcccccaatgtgccttggaccagttcctgtgttggaatgggcgctgcattgggcagagg aagctgtgcaacggggtcaacgactgtggtgacaacagcgacgaaagcccacagcagaat tgccggccccggacgggtgaggagaactgcaatgttaacaacggtggctgtgcccagaag tgccagatggtgcggggggcagtgcagtgtacctgccacacaggctaccggctcacagag gatgggcacacgtgccaagatgtgaatgaatgtgccgaggaggggtattgcagccagggc tgcaccaacagcgaaggggctttccaatgctggtgtgaaacaggctatgaactacggccc gaccggcgcagctgcaaggctctggggccagagcctgtgctgctgttcgccaatcgcatc gacatccggcaggtgctgccacaccgctctgagtacacactgctgcttaacaacctggag aatgccattgcccttgatttccaccaccgccgcgagcttgtcttctggtcagatgtcacc ctggaccggatcctccgtgccaacctcaacggcagcaacgtggaggaggttgtgtctact gggctggagagcccagggggcctggctgtggattgggtccatgacaaactctactggacc gactcaggcacctcgaggattgaggtggccaatctggatggggcccaccggaaagtgttg ctgtggcagaacctggagaagccccgggccattgccttgcatcccatggagggtaccatt tactggacagactggggcaacaccccccgtattgaggcctccagcatggatggctctgga cgccgcatcattgccgatacccatctcttctggcccaatggcctcaccatcgactatgcc gggcgccgtatgtactgggtggatgctaagcaccatgtcatcgagagggccaatctggat gggagtcaccgtaaggctgtcattagccaggtgtttgaagacagcctgtactggacagac tggcacaccaagagcatcaatagcgctaacaaatttacggggaagaaccaggaaatcatt cgcaacaaactccacttccctatggacatccacaccttgcacccccagcgccaacctgca gggaaaaaccgctgtggggacaacaacggaggctgcacgcacctgtgtctgcccagtggc cagaactacacctgtgcctgccccactggcttccgcaagatcagcagccacgcctgtgcc cagagtcttgacaagttcctgctttttgcccgaaggatggacatccgtcgaatcagcttt gacacagaggacctgtctgatgatgtcatcccactggctgacgtgcgcagtgctgtggcc cttgactgggactcccgggatgaccacgtgtactggacagatgtcagcactgataccatc agcagggccaagtgggatggaacaggacaggaggtggtagtggataccagtttggagagc ccagctggcctggccattgattgggtcaccaacaaactgtactggacagatgcaggtaca gaccggattgaagtagccaacacagatggcagcatgagaacagtactcatctgggagaac cttgatcgtcctcgggacatcgtggtggaacccatgggcgggtacatgtattggactgac tggggtgcgagccccaagattgaacgagctggcatggatgcctcaggccgccaagtcatt atctcttctaatctgacctggcctaatgggttagctattgattatgggtcccagcgtcta tactgggctgacgccggcatgaagacaattgaatttgctggactggatggcagtaagagg aaggtgctgattggaagccagctcccccacccatttgggctgaccctctatggagagcgc atctattggactgactggcagaccaagagcatacagagcgctgaccggctgacagggctg gaccgggagactctgcaggagaacctggaaaacctaatggacatccatgtcttccaccgc cgccggcccccagtgtctacaccatgtgctatggagaatggcggctgtagccacctgtgt cttaggtccccaaatccaagcggattcagctgtacctgccccacaggcatcaacctgctg tctgatggcaagacctgctcaccaggcatgaacagtttcctcatcttcgccaggaggata gacattcgcatggtctccctggacatcccttattttgctgatgtggtggtaccaatcaac attaccatgaagaacaccattgccattggagtagacccccaggaagggctacagaccaca gatgggctcgcggttgatgccattggccggaaagtatactggacagacacgggaacaaac cggattgaagtgggcaacctggacgggtccatgcggaaagtgttggtgtggcagaacctt gacagtccccgggccatcgtactgtaccatgagatggggtttatgtactggacagactgg ggggagaatgccaagttagagcggtccggaatggatggctcagaccgcgcggtgctcatc aacaacaacctaggatggcccaatggactgactgtggacaaggccagctcccaactgcta tgggccgatgcccacaccgagcgaattgaggctgctgacctgaatggtgccaatcggcat acattggtgtcaccggtgcagcacccatatggcctcaccctgctcgactcctatatctac tggactgactggcagactcggagcatccaccgtgctgacaagggtactggcagcaatgtc atcctcgtgaggtccaacctgccaggcctcatggacatgcaggctgtggaccgggcacag ccactaggttttaacaagtgcggctcgagaaatggcggctgctcccacctctgcttgcct cggccttctggcttctcctgtgcctgccccactggcatccagctgaagggagatgggaag acctgtgatccctctcctgagacctacctgctcttctccagccgtggctccatccggcgt atctcactggacaccagtgaccacaccgatgtgcatgtccctgttcctgagctcaacaat gtcatctccctggactatgacagcgtggatggaaaggtctattacacagatgtgttcctg gatgttatcaggcgagcagacctgaacggcagcaacatggagacagtgatcgggcgaggg ctgaagaccactgacgggctggcagtggactgggtggccaggaacctgtactggacagac acaggtcgaaataccattgaggcgtccaggctggatggttcctgccgcaaagtactgatc aacaatagcctggatgagccccgggccattgctgttttccccaggaaggggtacctcttc tggacagactggggccacattgccaagatcgaacgggcaaacttggatggttctgagcgg aaggtcctcatcaacacagacctgggttggcccaatggccttaccctggactatgatacc cgcaggatctactgggtggatgcgcatctggaccggatcgagagtgctgacctcaatggg aaactgcggcaggtcttggtcagccatgtgtcccacccctttgccctcacacagcaagac aggtggatctactggacagactggcagaccaagtcaatccagcgtgttgacaaatactca ggccggaacaaggagacagtgctggcaaatgtggaaggactcatggatatcatcgtggtt tcccctcagcggcagacagggaccaatgcctgtggtgtgaacaatggtggctgcacccac ctctgctttgccagagcctcggacttcgtatgtgcctgtcctgacgaacctgatagccgg ccctgctcccttgtgcctggcctggtaccaccagctcctagggctactggcatgagtgaa aagagcccagtgctacccaacacaccacctaccaccttgtattcttcaaccacccggacc cgcacgtctctggaggaggtggaaggaagatgctctgaaagggatgccaggctgggcctc tgtgcacgttccaatgacgctgttcctgctgctccaggggaaggacttcatatcagctac gccattggtggactcctcagtattctgctgattttggtggtgattgcagctttgatgctg tacagacacaaaaaatccaagttcactgatcctggaatggggaacctcacctacagcaac ccctcctaccgaacatccacacaggaagtgaagattgaagcaatccccaaaccagccatg tacaaccagctgtgctataagaaagagggagggcctgaccataactacaccaaggagaag atcaagatcgtagagggaatctgcctcctgtctggggatgatgctgagtgggatgacctc aagcaactgcgaagctcacgggggggcctcctccgggatcatgtatgcatgaagacagac acggtgtccatccaggccagctctggctccctggatgacacagagacggagcagctgtta caggaagagcagtctgagtgtagcagcgtccatactgcagccactccagaaagacgaggc tctctgccagacacgggctggaaacatgaacgcaagctctcctcagagagccagacccat gctgtggaaaacctgcttacagacatcacttttaagtcctttgtggatgtgggcacagtg aagagcaataaagagtgtgaggttcctgctggagttttcagctcatctggagaagacagg tcgaggccagagttccttgactga >gi568815587r:46758986_47018379|GENSCAN_predicted_peptide_5|510_aa MKSAWMGVRGAQGHLGCLLLAKSKTHNRKWRQTTVRKLENWLFLTAALGYGQLSAAPKNH ITGTNLKRRNKRDNRGNNPEVQGDFAQGHQELPEPRFKPRGLRGKFHPLSLQPEGGNPPK SKAVPLRGPCGGKNSYIPTPVGPLSQHAKEATNNLPGESRGGESAERSGTWAGQATALRR ERAQSALERTRKKAPENPQPPFQPPPFRTIQGWKLTGRSQERDPETPPRGDAARKQCPRA PWWEQDLESGNRSPSRGIPVPKYQILPGPLSATRPSPQPQEGGLFTIYGFSDPPHPPPFL RTHSFAPRTQSPRPNLKGKQPPLLKEQRGRASERTPRTLPAPRPRPAPRPRPEPAEAPPL MPPLGLLEPGPLRVLSPARSPRATARSEGEGSQAPARAVQVIIHAYYRKFKTIEKYKEEK CKQSVIPLPIPLKPSLQEWESKETWTIVKRLPPELEWPAPLCDTETVCVTPPPAPVPLPP RGCHGNGERRQPGFRKPESWSFEATPVKGC >gi568815587r:46758986_47018379|GENSCAN_predicted_CDS_5|1533_bp atgaagtctgcttggatgggcgtaagaggcgcccaagggcacctaggatgtttgcttctt gctaaatcaaaaacgcacaacagaaaatggaggcagacaaccgtcaggaagctagagaat tggctctttttaacagctgctcttggttacggtcaactttccgcagctcccaaaaatcac atcactggcaccaacttaaaaagaagaaacaaaagagacaacaggggtaacaaccctgag gttcagggagattttgcacagggtcaccaagaattgccagagccaagattcaaacccagg gggctgagaggaaagttccatcccctttccttgcaacccgaagggggaaaccccccaaaa agcaaagctgtccctttaaggggcccatgtggtgggaagaacagctacattcctacccca gtagggccgctgtcccaacacgccaaagaggctacaaataaccttccaggagagagccgg ggaggggagagcgcagagcggagcggaacgtgggccggccaagctaccgctcttcggaga gaaagagctcagtccgcgctagagaggacgcggaaaaaagcccctgaaaacccacagccc cctttccaaccacctccctttaggacgatccagggctggaagctgactggccgcagccaa gagagggaccccgaaacaccgccgaggggagatgctgcccgcaaacagtgccccagagcc ccttggtgggagcaagacctggagtccgggaatcgttctccgagtagaggaatcccggtg cccaagtaccagattctaccggggcctctctccgcaacccgcccgagcccccagccccag gaaggagggctatttacaatttatgggtttagcgaccctcctcacccgcctccctttctt cgcacgcattcattcgcaccaaggacacagagcccccgacctaacctgaaagggaagcag cccccgctcttgaaagagcagcgagggagggcgagcgaacgaacgccccggaccttaccc gcgccccggcccagacccgcgccccggcccagacccgaacccgcggaagcgcctccgctg atgcccccgctcggactgctggagccggggccgctccgggtcctctcccctgcacgcagc cccagggccacggctaggagcgagggcgaggggtctcaggccccggcccgcgccgtccag gttataatacatgcctattatagaaagtttaagactatagaaaaatataaagaagaaaaa tgtaaacagtcagtaattccactaccaattcctctaaaaccaagtcttcaagaatgggag tccaaagaaacctggaccatcgtgaagcgactgcccccggaacttgagtggccagctcca ctttgcgacacggagactgtttgtgtgacacctcctccagctcccgtcccccttccgccg cgtgggtgccatggcaacggagagagacggcaacctgggttccggaagccggagagctgg agctttgaagccaccccggtcaaaggatgctga >gi568815587r:46758986_47018379|GENSCAN_predicted_peptide_6|65_aa MTGSEQRRLLFMLAETGPFGDLAVISLDFSEAQRHVLTYMEDAVCQLLENREDISQYGIA RFFTE >gi568815587r:46758986_47018379|GENSCAN_predicted_CDS_6|198_bp atgacagggtcagaacagaggagacttcttttcatgctggctgaaactggcccatttgga gatttggctgttatctctcttgacttctcagaagctcagcgacatgtcctcacctacatg gaggatgcagtgtgccagctgctagaaaacagggaagatattagccaatatggaattgcc aggttcttcactgaatag >gi568815587r:46758986_47018379|GENSCAN_predicted_peptide_7|770_aa MGDFNTPLSTYDRSTRQKVNKDIQELNSALHQADLVDIYRTLHPISTEYTFYSAPHRTYS KIDHIVGSKALLSKCKTPEIITNCLSDHSAIKLELSMKKLSQNQSTTWKLNNLLLNDYWV HNEMKAEIKMFFETNENKDTTYQNLWDTFKAVCRGKLIALNAHKGKQERSKTDTLTSQLK ELKKQEQTNSKASRRQEITKIRAELKEIETQKNFQKINESRSWFFEKINKIDRPLARLIK KKREKNQIDAIKNDKGDITTDPTEIQTTIRECYKHLYANKLENLEEMDKFLDTYTLPRLN QEEVESLSRPITGSEIKAIINSLPTKKSPGPDGFTAEFYQRYKEELVLFLLKLFQSIEKE GILPNSFYEASIILIPKPGRDITKKENFRPISLMNIDAKILNKILANRIQQYIKKLIHHD QVGFIPGMQDWFNIHKSINVIHHINRTKDKNHMIISTDAEKAFDKIQQPFLLKTLNKSVL EVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIIYLENPIISAQNLLKLISNFSKVSGYK INVQKSQAFLYTNNRQTESKIMSERPFTITSKRIKYPGIQLTRDVKDLFKENYKPLLNEI KEDTNKRKNIPCSWMGRINIVKMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKYIWNQK RALISKTILSQKNKAGDIMLSDFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEIIPHIY NHLIFDKPDKNKKWGKDSLFNKWFWENWLAICRKLKLDPFLTPYTKINSR >gi568815587r:46758986_47018379|GENSCAN_predicted_CDS_7|2313_bp atgggagactttaacaccccactgtcaacatatgacagatcaacgagacagaaagttaac aaggatatccaggaattgaactcagctctgcaccaagcagacctagtagacatctacaga actctccaccccatatcaacagaatatacattctactcagcaccacatcgcacttattcc aaaattgaccacatagttggtagtaaagcactcctcagcaaatgtaaaacaccagaaatt ataacaaactgtctctcagaccacagtgcaatcaaactagaactcagcatgaagaaactc tctcaaaaccaatcaactacatggaaactgaacaacttgctcctgaatgactactgggta cataatgaaatgaaggcagaaataaagatgttctttgaaaccaatgagaacaaagacaca acataccagaatctctgggacacatttaaagcagtgtgtagagggaaattgatagcacta aatgcccacaagggaaagcaggaaagatctaaaactgacaccctaacatcacaattaaaa gaactaaagaagcaagagcaaacaaattcaaaagctagcagaaggcaagaaataactaag atcagagcagaactgaaggagatagagacacaaaaaaactttcaaaaaatcaatgaatcc aggagctggttttttgaaaagatcaacaaaattgatagaccactagcaagactaataaag aagaaaagagagaagaatcaaatagacgcaataaaaaatgataaaggggatattaccact gatcccacagaaatacaaactaccatcagagaatgctataaacacctctacgcaaataaa ctagaaaatctagaagaaatggataaattcctggacacatacactctcccaagactaaac caggaagaagttgaatccctgagtagaccaataacaggctctgaaattaaggcaataatt aatagcctaccaaccaaaaaaagtccaggaccagatggattcacagctgaattctaccag aggtacaaagaggagctggtactattccttctgaaactattccaatcaatagaaaaagag ggaatcctccctaactcattttatgaggccagcatcatcctgataccaaagcctggcaga gacataacaaaaaaagagaattttagaccaatatccctgatgaacatcgatgcaaaaatc ctcaataaaatactggcaaaccgaatccagcagtacatcaaaaagcttatccaccatgat caagtgggcttcatccctgggatgcaagactggttcaacatacacaaatcaataaatgta atccatcatataaacagaaccaaagacaaaaaccacatgattatctcaacagatgcagaa aaggcctttgacaaaattcaacagcccttcctgctaaaaactctcaataaatcagtgttg gaagttctggccagggcaattaggcaggagaaggaaataaagggtattcaattaggaaaa gaggaagtcaaattgtccctctttgcagatgacatgattatatatttagaaaaccccatc atctcagcccaaaatctccttaagctgataagcaacttcagcaaagtctcaggatacaaa atcaatgtgcaaaaatcacaagcattcctatacaccaataacagacaaactgagagcaaa atcatgagtgaacgcccattcacaattacttcaaagagaataaaatacccaggaatccaa cttacaagggatgtgaaggacctcttcaaggagaattacaaaccactgctcaacgaaata aaagaggacacaaacaaaaggaagaacattccatgctcatggatgggaagaatcaatatc gtgaaaatggccatactgcccaaggtaatttatagattcaatgccatccccatcaagcta ccaatgactttcttcacagaactggaaaaaactactttaaagtacatatggaaccaaaaa agagccctcatttccaagacaatcctaagccaaaagaacaaagctggagacatcatgcta tctgacttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaa aacagagatatagaccaatggaacagaacagagccctcagaaataataccacacatctac aaccatctgatctttgacaaacctgacaaaaacaagaaatggggaaaggattccctattt aataaatggttctgggaaaactggctagccatatgtagaaagctgaaactggatcccttc cttacaccttatacaaaaattaattcaagatga >gi568815587r:46758986_47018379|GENSCAN_predicted_peptide_8|258_aa MAQELCDTHTSFSSRFDQLEKRVRVIEDQMNEMKQEEKFREKRVKRNEQSLQEIWDYVKR PNLHLIGVPESNAENRTKLENTLQDIIQENFPNLARKANIQIQEIQRTPQRYSSRRATPR HKIVRFIKAEMKEKMLRTARGKGRVTRKGKPIRLTADLLAETLQARREWGPIFNILKEKN FQPRISYPVKLSFISEGEIKSFTDKQMLRDFITTRRAPEGSTKRGKEKPVPATAKTCQIV KTINARKKLHQLKSKIIS >gi568815587r:46758986_47018379|GENSCAN_predicted_CDS_8|777_bp atggcacaagaactatgtgacacacacacaagcttcagtagccgatttgatcaactggaa aaaagggtaagagtgattgaagatcaaatgaatgaaatgaagcaagaagagaagtttaga gaaaaaagagtaaaaagaaacgaacaaagcctccaagaaatatgggactatgtgaaaaga ccaaatctacatctgattggtgtacctgaaagtaatgcggagaacagaaccaagttggaa aacactcttcaggatattatccaggagaacttccccaacctagcgaggaaagccaacatt caaattcaggaaatacagagaacaccacaaagatactcctcgagaagagcaactccaaga cacaaaattgtcagattcatcaaagctgaaatgaaggaaaaaatgttaaggacagccaga gggaaaggtcgggttacccgcaaagggaagcccatcagactaacagcagatctcttggca gaaactctacaagccagaagagagtgggggccaatattcaacattcttaaagaaaagaat tttcaacccagaatttcatacccagtcaaactaagcttcataagtgaaggagaaataaaa tcctttacagacaagcaaatgctgagagattttatcaccaccaggcgagcccctgaagga agcactaaacgtggaaaggaaaaaccagtaccagccactgcaaaaacatgccaaattgta aagaccatcaatgctaggaagaaactgcatcaactaaagagcaaaataatcagctaa