GENSCAN 1.0 Date run: 4-Nov-116 Time: 22:08:45 Sequence gi568815595r:121531849_121758347 : 226499 bp : 40.71% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 Intr - 1361 1142 220 1 1 85 94 133 0.973 10.98 1.05 Intr - 5360 5252 109 0 1 47 111 70 0.990 3.62 1.04 Intr - 7741 7585 157 1 1 59 82 99 0.925 5.06 1.03 Intr - 9631 9501 131 2 2 38 95 111 0.787 6.29 1.02 Intr - 13058 12879 180 0 0 79 68 52 0.541 1.22 1.01 Init - 14029 13867 163 1 1 56 47 170 0.506 9.64 1.00 Prom - 17476 17437 40 -8.05 2.00 Prom + 27476 27515 40 -6.05 2.01 Init + 27849 28035 187 2 1 61 34 153 0.777 6.47 2.02 Intr + 38840 38968 129 0 0 70 81 68 0.191 4.05 2.03 Intr + 53069 53217 149 0 2 62 47 148 0.966 7.13 2.04 Term + 54174 54752 579 2 0 99 47 287 0.776 19.20 2.05 PlyA + 54763 54768 6 1.05 3.07 PlyA - 55054 55049 6 1.05 3.06 Term - 77146 76557 590 2 2 -33 49 351 0.584 12.79 3.05 Intr - 79314 79108 207 1 0 79 77 127 0.898 8.83 3.04 Intr - 80661 80631 31 0 1 72 93 5 0.531 -3.81 3.03 Intr - 81402 81297 106 2 1 88 19 118 0.460 4.20 3.02 Intr - 83125 82906 220 2 1 45 106 121 0.807 6.04 3.01 Init - 88104 87996 109 0 1 71 33 104 0.482 3.63 3.00 Prom - 88647 88608 40 -10.45 4.00 Prom + 89273 89312 40 -6.55 4.01 Init + 89807 91495 1689 1 0 38 107 1300 0.750 116.36 4.02 Term + 94847 95062 216 1 0 103 42 110 0.575 4.06 4.03 PlyA + 98422 98427 6 1.05 5.42 PlyA - 99578 99573 6 1.05 5.41 Term - 100134 99998 137 1 2 92 42 137 0.978 6.70 5.40 Intr - 100336 100253 84 2 0 87 92 40 0.898 3.17 5.39 Intr - 100715 100484 232 2 1 71 105 410 0.992 37.42 5.38 Intr - 101323 101219 105 1 0 88 99 69 0.994 7.49 5.37 Intr - 102570 102359 212 1 2 88 83 273 0.955 24.51 5.36 Intr - 103956 103887 70 0 1 97 93 88 0.999 8.04 5.35 Intr - 104641 104586 56 2 2 96 98 127 0.998 12.28 5.34 Intr - 105408 105298 111 1 0 90 105 69 0.652 8.23 5.33 Intr - 112931 112739 193 2 1 49 21 106 0.454 -1.86 5.32 Intr - 113080 112970 111 1 0 73 115 106 0.974 11.46 5.31 Intr - 115600 115471 130 0 1 53 115 164 0.218 15.38 5.30 Intr - 125504 125431 74 2 2 98 58 122 0.084 7.49 5.29 Intr - 126499 126416 84 1 0 84 89 99 0.100 8.70 5.28 Intr - 132766 132471 296 2 2 108 47 181 0.011 11.80 5.27 Intr - 133032 132858 175 0 1 12 40 92 0.953 -4.61 5.26 Intr - 133183 133078 106 0 1 85 90 133 0.999 12.50 5.25 Intr - 135762 135628 135 2 0 74 55 192 0.999 13.26 5.24 Intr - 136310 136213 98 2 2 82 97 105 0.999 8.69 5.23 Intr - 137507 137349 159 2 0 91 100 157 0.588 16.46 5.22 Intr - 145182 145045 138 0 0 45 111 151 0.986 12.84 5.21 Intr - 145602 145437 166 2 1 110 109 41 0.957 7.44 5.20 Intr - 150017 149839 179 2 2 68 79 187 0.998 13.60 5.19 Intr - 160733 158822 1912 1 1 129 75 1802 0.994 169.37 5.18 Intr - 167081 161893 5189 2 2 54 99 5378 0.999 521.93 5.17 Intr - 168037 167964 74 2 2 45 107 32 0.943 -0.99 5.16 Intr - 170747 170633 115 2 1 63 80 157 0.943 11.50 5.15 Intr - 183128 183013 116 0 2 90 91 30 0.817 2.75 5.14 Intr - 185291 184889 403 2 1 91 86 317 0.904 24.88 5.13 Intr - 186653 186540 114 2 0 60 84 164 0.998 12.92 5.12 Intr - 187920 187798 123 0 0 56 89 212 0.979 17.96 5.11 Intr - 190530 190414 117 0 0 8 91 102 0.708 2.24 5.10 Intr - 193260 193183 78 0 0 60 103 49 0.535 2.43 5.09 Intr - 195193 195065 129 1 0 109 91 139 0.998 16.27 5.08 Intr - 197492 197340 153 2 0 65 86 93 0.981 6.15 5.07 Intr - 198169 198017 153 1 0 46 79 198 0.999 14.05 5.06 Intr - 199125 199028 98 1 2 112 76 79 0.962 7.91 5.05 Intr - 206929 206788 142 1 1 89 82 42 0.690 2.71 5.04 Intr - 217721 217162 560 0 2 36 64 282 0.210 12.22 5.03 Intr - 217898 217784 115 2 1 35 95 92 0.989 3.70 5.02 Intr - 218314 218068 247 0 1 77 0 236 0.727 10.24 5.01 Init - 226414 226281 134 1 2 71 53 88 0.335 3.26 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 127802 128074 273 1 0 93 43 186 0.803 9.68 S.002 Term - 132766 132399 368 2 2 108 54 258 0.984 18.18 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:121531849_121758347|GENSCAN_predicted_peptide_1|320_aa MNLLRRSGKRRRSESGSDSFSGSGGDSSASPQFLSGSVLSPPPGLGRCLKAAAAGECKPT VPDYERDKLLLANWGLPKAVLEKYHSFGVKKMFEWQAECLLLGQVLEGKNLVYSAPTSAG KTLVAELLILKRVLEMRKKALFILPFVSVAKEKKYYLQSLFQEVGIKVDGYMGSTSPSRH FSSLDIAVCTIERANGLINRLIEENKMDLLGMVVVDELHMLGDSHRGYLLELLLTKICYI TRKSASCQADLASSLSNAVQIVGMSATLPNLELVASWLNAELYHTDFRPVPLLESVKVGN SIYDSSMKLVREFEPMLQVK >gi568815595r:121531849_121758347|GENSCAN_predicted_CDS_1|960_bp atgaatcttctgcgtcggagtgggaaacggcggcgttcagaatcaggctcagattcgttc tcgggaagcggcggtgacagcagtgccagcccccagttcctctccgggtccgtgctgagc ccgccgcccggccttggtcgctgcctgaaggccgcagctgcaggagaatgcaagcctaca gttcctgactacgaaagagacaagctactattggcaaactggggacttcctaaagcagtt ctggaaaaataccacagttttggtgtaaaaaagatgtttgaatggcaggcagagtgcctt ttgcttggacaagtcctggaaggaaagaatttagtttattcagctcctacaagtgctggg aagactcttgtggcagaattacttattttgaagcgggttttggaaatgcggaagaaagct ttgtttattcttccctttgtttctgtggctaaagagaagaaatactacctccagagtctg tttcaggaagtaggaataaaagtagacggttatatgggcagcacctctccatcaaggcat ttctcttcattggatattgcagtctgcacaattgagagagccaatggtctgatcaatcgc ctcatagaggaaaataagatggatctgttaggaatggtggttgtggatgaattacatatg ctgggagactctcaccgagggtatctgctggaacttttgctgaccaagatttgctatatt actcggaaatcagcatcttgtcaggcagatctagccagttctctgtctaatgctgtgcaa atcgttggcatgagtgctacccttcctaatttggagcttgtggcttcctggttgaatgct gaactctaccataccgactttcgccctgtaccgcttttggagtcagtaaaagttggaaat tccatatatgactcttcaatgaaacttgtgagggaatttgagcccatgctacaagtgaag >gi568815595r:121531849_121758347|GENSCAN_predicted_peptide_2|347_aa MYVGISPQQQPILQQILQWAPAKCPLIEFNSDTTCLEIASDPAGQELSPTRLSPLQMPVA STGLPSQISETMRNRMAPENPQPDPFINRNYSNMKVIPPQDPASPTIRRRHKERTSFTHQ QYEELEALFSQTMFPDRNLQEKLALRLDLPESTVKVWFRNRRFKLKKQQQQQSAKQRNQI LPSKKNVPTSPRTSPSPYAFSPVISDFYSSLPSQPLDPSNWAWNSTFTESSTSDFQMQDT QWERLVASVPALYSDAYDIFQIIELYNLPDENEISSSSFHCLYQYLSPTKYQVGGQGSSL SIFAGPAVGLSPAQTWPNMTSQAFEAYSLTDSLEFQKTSNMVDLGFL >gi568815595r:121531849_121758347|GENSCAN_predicted_CDS_2|1044_bp atgtatgtggggatttctccccagcagcaaccaattctgcagcagattctccagtgggca ccagctaagtgtcctctaattgagttcaattccgacactacctgcctggaaatagcatct gatccagcaggtcaagagctcagtcccacaaggctgtccccacttcagatgcctgttgca agcacaggccttccatctcagatttcagaaaccatgaggaacagaatggccccagagaat ccccagccagaccctttcatcaataggaattattccaacatgaaggtgataccaccacag gatccagctagtcccacaatacggagaaggcataaagaacgtacttctttcacccaccaa cagtatgaggagctagaagctctgtttagccagaccatgttcccagatagaaatcttcag gagaaactagctttgagactcgacctaccggagtcaacagtaaaggtttggttcaggaac cggcgattcaaattgaagaagcagcagcagcagcaatcagcaaagcaacgaaaccagatc cttccatccaagaagaatgtgcccacctcccccagaacatcccccagtccttatgctttt tctcctgtgatttcagatttctacagctcccttccatctcagcccttagacccttccaat tgggcatggaactctaccttcactgagagttctaccagtgacttccaaatgcaagatact cagtgggagaggctggtggcctcggttcctgctttgtactctgatgcctatgacatattc caaatcatagaactgtacaatcttcctgatgagaatgagatatccagctcttctttccac tgtctgtatcagtatctctcacccacaaagtaccaggtaggaggacagggttcctctctc agcatctttgctggtccagctgtaggcctatctcctgcacaaacctggcccaatatgaca agccaagcctttgaagcctacagtctaacagatagcctggaattccagaaaacctccaat atggtagacttgggatttctctga >gi568815595r:121531849_121758347|GENSCAN_predicted_peptide_3|420_aa MPCLQGNEISQPWLIDAMWECVARPGFSRDSGDADFLLGLQVKASAPGKCASFTCLKSFP VRIQEPTLGVWVVTLFSCNILLVSLYGWTIRRDSHSEGKQSAQHQLADFGGARLEAVQSC GQGVVFSTFSPGGERSTYPEELTVRASPSWFQTPHFSRPTRLEIPTGVEGQATPSTYDLT WQDLHIHLSTCFTEEEKQHIMGAVKAYADGVAACNQGHDIYPVGDLKRFDWNENSRQNFN TVKAKLGSAPALGIPNLDKPFFLYVAKKQGMVLDVLVQKLGNIPQPVAYFSKQLDHVASA WPGCLWAVAATALLVDEANTLTLGQHLEVLTTHQVQGVLEAKGHQWMMGGCLLKHQALLL DTPDIILEVLQVNPAACLPESTDALNHSCIQVMEQIYSSRPNLRDEPLNNPETEWFTDGS >gi568815595r:121531849_121758347|GENSCAN_predicted_CDS_3|1263_bp atgccctgtcttcagggaaatgaaatttctcagccctggttaattgatgccatgtgggaa tgtgttgccagacctggtttttccagagacagtggggatgcggactttttgctgggatta caggtgaaagcctccgcacctggcaaatgtgcttctttcacttgcctcaaatcctttcct gtgagaatccaagaacccactcttggtgtctgggtcgtgaccctcttttcctgcaacatc cttctggtgtccttgtatggatggacaataagaagagactcccactcggaaggaaaacag tctgcgcagcatcagttggctgactttggtggagctcggctggaggcagtacaatcttgc ggacaaggagtggtattcagcaccttcagtcctggtggagaacgcagtacctatcctgaa gagctcactgttagggcttcccccagctggttccagactccccacttctctcgtcccacc cgactagaaatacccacaggtgtggaggggcaggccaccccttcaacatatgatttaacc tggcaagatttgcatatacatctgtctacctgcttcacagaagaagaaaagcagcatatt atgggagctgttaaagcatatgcagatggagtggcagcctgcaaccaggggcatgacatc tatccagtaggagatttaaaacgctttgattggaatgagaatagcagacaaaacttcaat actgtcaaagcgaaattgggctctgctccagccttaggaatccccaacttggataagcca tttttcctttatgtggccaaaaagcaaggtatggtcttggatgttcttgtccaaaaattg gggaatattccacagccagtagcctatttttctaagcaactcgaccatgtggcttctgca tggcctggatgcctttgggctgttgcagctactgctcttctggtagatgaagctaatacg ctaacattaggacagcacctggaggttttgaccacacaccaagtccagggggtcctagaa gctaaagggcaccagtggatgatgggaggatgtttattaaagcatcaggccttactgtta gacacaccagacataattcttgaagttctccaagtaaacccagctgcttgtctgccagag tccacagacgctctaaaccattcctgcatacaggttatggagcaaatttattctagcagg ccaaacttaagagatgagcctcttaacaatcctgagacagaatggtttacagatggaagc tga >gi568815595r:121531849_121758347|GENSCAN_predicted_peptide_4|634_aa MSRHKLAKHLQVCPASVVCCSMEWNRWPNVDSETTLHENIMKETPSEECLDTALALQDQK VLFRSLKMVELFPETREATEEEPTMNGETSVEEMGGAVGGVDIGLVPHGLSATNGEMAEL SQEEREVLAKTKEGMDLVKFGQWENIFSKEHAASALTNSSASCESKNKNDSEKEQISSGH NMVEGEGAPKKKEPQENQKQQDVRTAMETTGLAPWQDGVLERLKTAVDAKDYNMYLVHNG RMLIHFGQMPACTPKERDFVYGKLEAQEVKTVYTFKVPVSYCGKRARLGDAMLSCKPSEH KAVDTSDLGITVEDLPKSDLIKTTLQCALERELKGHVISESRSIDGLFMDFATQTYNFEP EQFSSGTVLADLTAATPGGLHVELHSECVTRRHNKSSSAFTFTCNKFFRRDEFPLHFKNV HTDIQSCLNGWFQHRCPLAYLGCTFVQNHFRPPGQKAKVIYSQELKTFAIKPEVAPELSE GRKNNHLLGHGGKSQNSLTSLPLEILKYIAGFLDSVSLAQLSQVSVLMRNICATLLQERG MVLLQWKKKRYSHGGTSWRVHREIWQFSSLFSKIKSWEFNEVTSMSEHLKSCPFNIVEHK TDPILLTSMCQPREQARESLVSTFRIRPRGRYVS >gi568815595r:121531849_121758347|GENSCAN_predicted_CDS_4|1905_bp atgtcccgccacaaactggccaagcacctgcaggtgtgccccgccagcgtggtctgctgc tccatggagtggaaccgctggccaaatgtggactctgaaaccacccttcatgaaaacatc atgaaagagacccccagtgaggagtgtttggacacagccctggccctgcaggatcagaag gtcctcttcagatccttgaaaatggtggaacttttcccagaaactagagaggctactgag gaggaaccaactatgaatggtgaaaccagtgtggaggaaatgggaggagcagtgggtgga gtggatatcggtttggtaccacatggtctgtcagcaactaatggggagatggcagagcta agtcaagaagaacgggaggtgctagccaaaaccaaagaagggatggacctggtcaagttt ggccagtgggaaaatattttcagcaaagagcacgcagcctctgctttaacaaattcatca gcgagctgtgagagcaagaacaagaatgactccgagaaagaacagatttccagtggccat aacatggtagaaggagagggcgctcccaaaaagaaagaaccacaggaaaatcagaagcag caggacgttcgtacagccatggaaaccacagggcttgccccttggcaggatggtgttctg gaaagactgaaaacagctgtggatgcaaaggactataacatgtatctagtgcacaatggg cggatgctgatacactttggtcagatgcctgcttgtacacccaaggagagagactttgtt tatggcaagctggaggctcaggaagttaagactgtttacaccttcaaagttcctgtgagc tactgtggaaagcgagctcgacttggagatgccatgttgagttgtaagccaagtgaacac aaggcagtggatacttcagatttggggatcactgtggaggacctgcccaaatcagatctc atcaagaccaccctccagtgtgctttggaaagagaactcaaaggccacgtcatctctgaa tccagaagcattgatggactgttcatggattttgccacacaaacatacaactttgagcca gaacagttttcctctgggacagtgctggctgacctaaccgctgccaccccagggggactc cacgtggagctccacagcgagtgtgtgaccaggagacacaacaaaagcagctctgccttc actttcacttgcaacaaattcttcaggagggatgagttccccctgcacttcaagaatgtc cacacagacattcagtcatgtctcaatggctggttccagcatcgatgccccctcgcctac ttgggatgtacatttgttcaaaaccatttccgtcccccagggcaaaaggcaaaagtaatc tatagccaggagctcaagacctttgccattaagccggaggttgctccagagctgagcgag ggaaggaagaacaaccatcttttgggtcatggaggaaaaagccagaattctttaaccagc ctgcccctggagattttgaagtacattgctgggttcttggacagcgtcagcctggcccag ctctcccaggtgtctgtgctgatgaggaatatctgtgccactttgttacaagagagagga atggtccttttgcaatggaagaaaaagaggtattcccatggaggcacctcctggagagtc cacagagagatctggcagttcagcagcctcttctccaaaatcaagagctgggagtttaat gaagtcacctccatgtctgagcacctgaagtcctgtcctttcaacattgtagagcacaaa actgacccgattcttttgactagcatgtgtcagccccgtgagcaggcccgagagagctta gtctccacctttagaatcagaccacgaggaagatacgtctcctaa >gi568815595r:121531849_121758347|GENSCAN_predicted_peptide_5|4340_aa MEACQKGKMQLKIAASMPQHRGPQLPDHRPVPVRGLLGTRPHSRSSFNNPNGSGCPQSQV SAEAQAAVSVSHNRRRGGIGGSQHNHFTLRTSDLSQNARPLDTSCPSPDLVPGHLLASNH SPHSLFRWLSPWRGAAVEGVAYGLPELTWLELGGGQLEPQAAALLEGPRELPKQLHRSPL LAPPLPDSVLDPALHPPPSPTRPSSSSPQASASPVGLRGHPAQTWTTCSRSVPSPVRFSS GSDGEKRLGQSCPHGLVPGAACLSRPPYAAADVGKNLLGPCAQALNKLGVPKGRRDQKKS KVFRAEITESLEILMQTEACPIILLMLMGYWKDEKRGKIPTIPSTRKSQESESLNFNSPL SRFSSVWDPILEESPNSSVLRINKGPDCLSSFSPRRPLAEMLSRLSGLANVVLHELSGDD DTDQNMRAPLDPELHQESDMEFNNTTQEDVQERLAYAEQLVVELKDIIRQKDVQLQQKDE ALQEERKAADNKIKKLKLHAKAKLTSLNKYIEEMKAQGGTVLPTEPQSEEQLSKHDKSST EEEMEIEKIKHKLQEKEELISTLQAQLTQAQAEQPAQWGSLWWFCLKTSLCLSPLAIDDI LRHSSTEMEEFVMMKQQLQEKEEFISTLQAQLSQTQAEQAAQLSSMQQVVREKDARFETQ VRLHEDELLQLVTQADVETEMQQKLRVLQRKLEEHEESLVGRAQVVDLLQQELTAAEQRN QILSQQLQQMEAEHNTLRNTVETEREESKILLEKMELEVAERKLSFHNLQEEMHHLLEQF EQAGQAQAELESRYSALEQKHKAEMEEKTSHILSLQKTGQELQSACDALKDQNSKLLQDK NEQAVQSAQTIQQLEDQLQQKSKEISQFLNRLPLQQHETASQTSFPDVYNEGTQAVTEEN IASLQKRVVELENEKGALLLSSIELEELKAENEKLSSQITLLEAQNRTGEADREVSEISI VDIANKRSSSAEESGQDVLENTFSQKHKELSVLLLEMKEAQEEIAFLKLQLQGKRAEEAD HEVLDQKEMKQMEGEGIAPIKMKVFLEDTGQDFPLMPNEESSLPAVEKEQASTEHQSRTS EEISLNDAGVELKSTKQDGDKSLSAVPDIGQCHQDELERLKSQILELELNFHKAQEIYEK NLDEKAKEISNLNQLIEEFKKNADNNSSAFTALSEERDQLLSQVKELSMVTELRAQVKQL EMNLAEAERQRRLDYESQTAHDNLLTEQIHSLSIEAKSKDVKIEVLQNELDDVQLQFSEQ STLIRSLQSQLQNKESEVLEGAERVRHISSKVEELSQALSQKELEITKMDQLLLEKKRDV ETLQQTIEEKDQQVTEISFSMTEKMVQLNEEKFSLGVEIKTLKEQLNLLSRAEEAKKEQV EEDNEVSSGLKQNYDEMSPAGQISKEELQHEFDLLKKENEQRKRKLQAALINRKELLQRV SRLEEELANLKDESKKEIPLSETERGEVEEDKENKEYSEKCVTSKCQEIEIYLKQTISEK EVELQHIRKDLEEKLAAEEQFQALVKQMNQTLQDKTNQIDLLQAEISENQAIIQKLITSN TDASDGDSVALVKETVVISPPCTGSSEHWKPELEEKILALEKEKEQLQKKLQEALTSRKA ILKKAQEKERHLREELKQQKDDYNRLQEQFDEQSKENENIGDQLRQLQIQVRESIDGKLP STDQQESCSSTPGLEEPLFKATEQHHTQPVLESNLCPDWPSHSEDASALQGGTSVAQIKA QLKEIEAEKVELELKVSSTTSELTKKSEEVFQLQEQINKQGLEIESLKTVSHEAEVHAES LQQKLESSQLQIAGLEHLRELQPKLDELQKLISKKEEDVSYLSGQLSEKEAALTKIQTEI IEQEDLIKALHTQLEMQAKEHDERIKQLQVELCEMKQKPEEIGEESRAKQQIQRKLQAAL ISRKEALKENKSLQEELSLARGTIERLTKSLADVESQVSAQNKEKDTVLGRLALLQEERD KLITEMDRSLLENQSLSSSCESLKLALEGLTEDKEKLVKEIESLKSSKIAESTEWQEKHK ELQKEYEILLQSYENVSNEAERIQHVVEAVRQEKQELYGKLRSTEANKKETEKQLQEAEQ EMEEMKEKMRKFAKSKQQKILELEEENDRLRAEVHPAGDTAKECMETLLSSNASMKEELE RVKMEYETLSKKFQSLMSEKDSLSEEVQDLKHQIEGNVSKQANLEATEKHDNQTNVTEEG TQSIPGETEEQDSLSMSTRPTCSESVPSAKSANPAVSKDFSSHDEINNYLQQIDQLKERI AGLEEEKQKNKEFSQTLENEKNTLLSQISTKDGELKMLQEEVTKMNLLNQQIQEELSRVT KLKETAEEEKDDLEERLMNQLAELNGSIGNYCQDVTDAQIKNELLESEMKNLKKCVSELE EEKQQLVKEKTKVESEIRKEYLEKIQGAQKEPGNKSHAKELQELLKEKQQEVKQLQKDCI RYQEKISALERTVKALEFVQTESQKDLEITKENLAQAVEHRKKAQAELASFKVLLDDTQS EAARVLADNLKLKKELQSNKESVKSQMKQKDEDLERRLEQAEEKHLKEKKNMQEKLDALR REKVHLEETIGEIQVTLNKKDKEVQQLQENLDSTVTQLAAFTKSMSSLQDDRDRVIDEAK KWERKFSDAIQSKEEEIRLKEDNCSVLKDQLRQMSIHMEELKINISRLEHDKQIWESKAQ TEVQLQQKVCDTLQGENKELLSQLEETRHLYHSSQNELAKLESELKSLKDQLTDLSNSLE KCKEQKGNLEGIIRQQEADIQNSKFSYEQLETDLQASRELTSRLHEEINMKEQKIISLLS GKEEAIQVAIAELRQQHDKEIKELENLLSQEEEENIVLEEENKKAVDKTNQLMETLKTIK KENIQQKAQLDSFVKSMSSLQNDRDRIVGDYQQLEERHLSIILEKDQLIQEAAAENNKLK EEIRGLRSHMDDLNSENAKLDAELIQYREDLNQVITIKDSQQKQLLEVQLQQNKELENKY AKLEEKLKESEEANEDLRRSFNALQEEKQDLSKEIESLKVSISQLTRQVTALQEEGTLGL YHAQLKVKEEEVHRLSALFSSSQKRIAELEEELVCVQKEAAKKVGEIEDKLKKELKHLHH DAGIMRNETETAEERVAELARDLVEMEQKLLMVTKENKGLTAQIQSFGRSMSSLQNSRDH ANEELDELKRKYDASLKELAQLKEQGLLNRERDALLSETAFSMNSTEENSLSHLEKLNQQ LLSKDEQLLHLSSQLEDSYNQVQSFSKAMASLQNERDHLWNELEKFRKSEEGKQRSAAQP STSPAEVQSLKKAMSSLQNDRDRLLKELKNLQQQYLQINQEITELHPLKAQLQEYQDKTK AFQIMQEELRQENLSWQHELHQLRMEKSSWEIHERRMKEQYLMAISDKDQQLSHLQNLIR ELRSSSSQTQPLKVQYQRQASPETSASPDGSQNLVYETELLRTQLNDSLKEIHQKELRIQ QLNSNFSQLLEEKNTLSIQLCDTSQSLRENQQHYGDLLNHCAVLEKQVQELQAVSKEKGP LNIDVAPGAPQEKNGVHRKSDPEELREPQQSFSEAQQQLCNTRQEVNELRKLLEEERDQR VAAENALSVAEEQIRRLEHSEWDSSRTPIIGSCGTQEQALLIDLTSNSCRRGWLYWGQGA EEKECGSPATEQRMSLRLEGRGRLEPGGICQVVDSEATFLSEYPFLQWLDPEWRWMEASP AFTLSFTDPSATSSSHLLSNDSCPAHSVFYGPSIDLVVTLWTTPLKTWNSLTSNIRTSIP VEQSSHLQVFSPTLHGKCLQKQRWIRGQMWKSVVGHDVSVSVETQGDDWDTDPDFVNDIS EKEQRWGAKTIEGSGRTEHINIHQLRNKVSEEHDVLRKKEMESGPKASHGYGGRFGVERD RMDKSAVGHEYVAEVEKHSSQTDAAKGFGGKYGVERDRADKSCENCPRSPLDGGTCFFVE RQDSGCCEDLTPSQGYLPQQVPWCQCWKPHAELKDSDSNRRRLSNYYSRGFGGRYGVEKD KWDKAALGYDYKGETEKHESQRDYAKGFGGQYGIQKDRVDKSAVGFNEMEAPTTAYKKTT PIEAASSGTRGLKAKFESMAEEKRKREEEEKAQQVARRQQERKAVTKRSPEAPQPVIAME EPAVPAPLPKKISSEAWPPVGTPPSSESEPVRTSREHPVPLLPIRQTLPEDNEEPPALPP RTLEGLQVEEEPVYEAEPEPEPEPEPEPENDYEDVEEMDRHEQEDEPEGDYEEVLEPEDS SFSSALAGSSGCPAGAGAGAVALGISAVAVYDYQGEGSDELSFDPDDVITDIEMVDEGWW RGRCHGHFGLFPANYVKLLE >gi568815595r:121531849_121758347|GENSCAN_predicted_CDS_5|13023_bp atggaagcctgccagaaaggaaaaatgcagctgaagattgctgcatccatgcctcagcac aggggtccccaactcccggaccatcgaccggtaccagtccgtggcctgttaggaaccagg ccacacagcaggagctcattcaacaatcctaatggcagcgggtgcccgcagtctcaagtc tctgccgaagcccaagccgctgtctcggtcagtcacaaccgacgaagaggaggaattgga ggaagccagcataaccacttcacactgagaacatctgacctttcccagaacgcccgccca ctcgacacttcctgtccctccccagatcttgttccggggcacctactcgcgtctaatcac tcccctcattcactctttaggtggctgagtccctggcggggcgcggcggtggaaggtgtc gcgtacgggcttcccgagctgacgtggcttgaattgggaggggggcagctggagcctcag gcggcagcgcttctagaggggccgcgggagctgcccaagcagctacaccggagtcccctc ctcgccccgcctctcccggattcagtcctggatccggcccttcacccgcctccttcacct accaggcctagttctagtagcccacaagcctcggcctctccggtgggccttcggggtcac cctgcgcagacttggaccacttgctcacggtctgtcccctccccggtacgtttctcctct gggagcgacggggaaaagcgcctaggtcagtcctgtccgcacggccttgttcctggagcg gcctgcctaagtcggcctccgtacgcagccgctgacgtcggaaagaacctgctgggcccc tgtgctcaggccctaaacaaacttggtgtcccgaaggggaggcgcgaccaaaagaaatca aaggtctttagggcggagatcacagaatcattggagatattaatgcaaacggaggcctgt ccaataatcctgctaatgctaatgggttactggaaagatgagaagagaggaaagatccca acgatcccgtcaacacgaaaatcccaggaatcagagtcacttaattttaattctcctctc agcagattctcctcagtgtgggaccctatcctggaagagagccccaactcttcagttctg agaattaataaaggtccagactgcctcagctctttcagccctcgtaggcccctggcagaa atgctgagccgattatcaggattagcaaatgttgttttgcatgaattatcaggagatgat gacactgatcagaatatgagggctcccctagaccctgaattacaccaagaatctgacatg gaatttaataatactacacaagaagatgttcaggagcgcctggcttatgcagagcaattg gtggtggagctaaaagatattattagacagaaggatgttcaactgcagcagaaagatgaa gctctacaggaagagagaaaagctgctgataacaaaattaaaaaactaaaacttcatgcg aaggccaaattaacttctttgaataaatacatagaagaaatgaaagcacaaggagggact gttctgcctacagaacctcagtcagaggagcaactttccaagcatgacaagagttctaca gaggaagagatggaaatagaaaagataaaacataagctccaggagaaggaggaactaatc agcactttgcaagcccagcttactcaggcacaggcagaacaacctgcacagtggggctct ctgtggtggttctgcctcaagacaagcctctgcctgagccccctggctattgatgacatc cttcgacatagttctacagagatggaagaatttgtaatgatgaagcaacagctccaggag aaggaagaattcattagcactttacaagcccagctcagccagacacaggcagagcaagct gcacagttgagttccatgcagcaggtggtccgagagaaagatgcccgctttgaaacacaa gttcgtcttcatgaagatgagcttcttcagttagtaacccaggcagatgtggaaacagag atgcaacagaaattgagggtgctgcaaaggaagcttgaggaacacgaagaatccttggtg ggccgtgctcaggtcgttgacttgctgcaacaggagctgactgctgctgagcagagaaac cagattctctctcagcagttacagcagatggaagctgagcataatactttgaggaacact gtggaaacagaaagagaggagtccaagattctactggaaaagatggaacttgaagtggca gagagaaaattatccttccataatctgcaggaagaaatgcatcatcttttagaacagttt gagcaagcaggccaagcccaggctgaactagagtctcggtatagtgctttggagcagaag cacaaagcagaaatggaagagaagacctctcatattttgagtcttcaaaagactggacaa gagctgcagtctgcctgtgatgctctaaaggatcaaaattcaaagcttctccaagataag aatgagcaagcagttcagtcagcccagaccattcagcaactggaagatcagctccagcaa aaatccaaagaaattagccaatttctaaatagactgcccttgcaacaacatgaaacagca tctcagacttctttcccagatgtttataatgagggcacacaggcagtcactgaggagaat attgcttctttgcagaagagagtggtagaactagagaatgaaaagggagccttgctcctt agttctatagagctggaggagctgaaagctgagaatgaaaaactgtcttctcagattact ctcctagaggctcagaatagaactggggaggcagacagagaagtcagtgagatcagcatt gttgatattgccaacaagaggagctcttctgctgaggaaagtggacaagatgttctagaa aacacattttctcagaaacataaagaattatcagttttattgttggaaatgaaagaagct caagaggaaattgcatttcttaaattacagctccagggaaaaagggctgaggaagcagat catgaggtccttgaccagaaagaaatgaaacagatggagggtgagggaatagctccaatt aaaatgaaagtatttcttgaagatacagggcaagattttcccttaatgccaaatgaagag agcagtcttccagcagttgaaaaagaacaggcgagcactgaacatcaaagtagaacatct gaggaaatatctttaaatgatgctggagtagaattgaaatcaacaaagcaggatggtgat aaatccctttctgctgtaccagatattggtcagtgtcatcaggatgagttggaaaggtta aaaagtcaaattttggagctcgagctaaactttcataaagcacaagaaatctatgagaaa aatttagatgagaaagctaaggaaattagcaacctaaaccagttgattgaggagtttaag aaaaatgctgacaacaacagcagtgcattcactgctttgtctgaagaaagagaccagctt ctctctcaggtgaaggaacttagcatggtaacagaattgagggctcaggtaaagcaactg gaaatgaaccttgcagaagcagaaaggcaaagaagacttgattatgaaagccaaactgcc catgacaacctgctcactgaacagatccatagtctcagcatagaagccaaatctaaagat gtgaaaattgaagttttacagaatgaactggatgatgtgcagcttcagttttctgagcag agtaccctgataagaagcctgcaaagccagctgcaaaataaggaaagtgaagtgcttgag ggggcagaacgtgtaaggcatatctcaagtaaagtggaagaactgtcccaggctctttca cagaaggaacttgaaataacaaaaatggatcagctcttactagagaaaaagagagatgtg gaaaccctccaacaaaccatcgaggagaaggatcaacaagtgacagaaatcagctttagt atgactgagaaaatggttcagcttaatgaagagaagttttctcttggggttgaaattaag actcttaaagaacagctaaatttattatccagagctgaggaagcaaaaaaagagcaggtg gaagaagataatgaagtttcttctggccttaaacaaaattatgatgagatgagcccagca ggacaaataagtaaggaagaacttcagcatgaatttgaccttctgaagaaagaaaatgag cagagaaagagaaagctccaggcagctcttattaacagaaaggagcttctgcaaagagtc agtagattggaagaagaattagccaacttgaaagatgaatctaagaaagaaatcccactc agtgagactgagaggggagaagtggaagaagataaagaaaacaaagaatactcagaaaaa tgtgtgacttctaagtgccaagaaatagaaatttatttaaaacagacaatatctgagaaa gaagtggaactacagcatataaggaaggatttggaagaaaagctggcagctgaagagcaa ttccaggctctggtcaaacagatgaatcagaccttgcaagataaaacaaaccaaatagat ttgctccaagcagaaatcagtgaaaaccaagcaattatccagaagttaatcacaagtaac acggatgcaagtgatggggactccgtagcacttgtaaaggaaacagtggtgataagtcca ccttgtacaggtagtagtgaacactggaaaccagaactagaagaaaagatactggccctt gaaaaagaaaaggagcaacttcaaaagaagctacaggaagccttaacctcccgcaaggca attcttaaaaaggcacaggagaaagaaagacatctcagggaggagctaaagcaacagaaa gatgactataatcgcttgcaagaacagtttgatgagcaaagcaaggaaaatgagaatatt ggagaccagctaaggcaactccagattcaagtaagggaatccatagacggaaaactccca agcacagaccagcaggaatcgtgttcttccactccaggtttagaagaacctttattcaaa gccacagaacagcatcacactcaacctgttttagagtccaacttgtgcccagactggcct tctcattctgaagatgcgagtgctctgcagggcggaacttctgttgcccagattaaggcc cagctgaaggaaatagaggctgagaaagtagagttagaattgaaagttagttctacaaca agtgagcttactaaaaaatcagaagaggtatttcagttacaagagcagataaataaacag ggtttagaaatcgagagtctaaagacagtatcccatgaagctgaagtccatgccgaaagc ctgcagcagaaattggaaagcagccaactacaaattgctggcctagaacatctaagagaa ttgcaacctaaactggatgaactgcaaaaactcataagcaaaaaggaagaagacgttagc tacctttctggacaacttagtgagaaagaagcagctctcactaaaatacagacagagata atagaacaagaagatttaattaaggctctgcatacacagctagaaatgcaagccaaagag catgatgagaggataaagcagctacaggtggaactttgtgaaatgaagcaaaaaccagaa gagattggagaagaaagtagagcaaagcaacaaatacaaaggaaactgcaagctgccctt atttcccgaaaagaagcactaaaagaaaacaaaagtctccaagaggaattgtctttggcc agaggtaccattgaacgtctcaccaagtctctggcagatgtggaaagccaagtttctgct caaaataaagaaaaagatacggtcttaggaaggttagctcttcttcaagaagaaagagac aaactcattacagaaatggacaggtctttattggaaaatcagagtctcagcagctcctgt gaaagtctaaaactagctctagagggtcttactgaagacaaggaaaagttagtgaaggaa attgaatctttgaaatcttctaagattgcagaaagtactgagtggcaagagaaacacaag gagctacaaaaagagtatgaaattcttctgcagtcctatgagaatgttagtaatgaagca gaaaggattcagcatgtggtggaagctgtgaggcaagagaaacaagaactgtatggcaag ttaagaagcacagaggcaaacaagaaggagacagaaaagcagttgcaggaagctgagcaa gaaatggaggaaatgaaagaaaagatgagaaagtttgctaaatctaaacagcagaaaatc ctagagctggaagaagagaatgaccggcttagggcagaggtgcaccctgcaggagataca gctaaagagtgtatggaaacacttctttcttccaatgccagcatgaaggaagaacttgaa agggtcaaaatggagtatgaaaccctttctaagaagtttcagtctttaatgtctgagaaa gactctctaagtgaagaggttcaagatttaaagcatcagatagaaggtaatgtatctaaa caagctaacctagaggccaccgagaaacatgataaccaaacgaatgtcactgaagaggga acacagtctataccaggtgagactgaagagcaagactctctgagtatgagcacaagacct acatgttcagaatcggttccatcagcgaagagtgccaaccctgctgtaagtaaggatttc agctcacatgatgaaattaataactacctacagcagattgatcagctcaaagaaagaatt gctggattagaggaggagaagcagaaaaacaaggaatttagccagactttagaaaatgag aaaaataccttactgagtcagatatcaacaaaggatggtgaactaaaaatgcttcaggag gaagtaaccaaaatgaacctgttaaatcagcaaatccaagaagaactctccagagttacc aaactaaaggagacagcagaagaagagaaagatgatttggaagagaggcttatgaatcaa ttagcagaacttaatggaagcattgggaattactgtcaggatgttacagatgcccaaata aaaaatgagctattggaatctgaaatgaagaaccttaaaaagtgtgtgagtgaattggaa gaagaaaagcagcagttagtcaaggaaaaaactaaggtggaatcagaaatacgaaaggaa tatttggagaaaatacaaggtgctcagaaagaacccggaaataaaagccatgcaaaggaa cttcaggaactgttaaaagaaaaacaacaagaagtaaagcagctacagaaggactgcatc aggtatcaagagaaaattagtgctctggagagaactgttaaagctctagaatttgttcaa actgaatctcaaaaagatttggaaataaccaaagaaaatctggctcaagcagttgaacac cgcaaaaaggcacaagcagaattagctagcttcaaagtcctgctagatgacactcaaagt gaagcagcaagggtcctagcagacaatctcaagttgaaaaaggaacttcagtcaaataaa gaatcagttaaaagccagatgaaacaaaaggatgaagatcttgagcgaagactggaacag gcagaagagaagcacctgaaagagaagaagaatatgcaagagaaactggatgctttgcgc agagaaaaagtccacttggaagagacaattggagagattcaggttactttgaacaagaaa gacaaggaagttcagcaacttcaggaaaacttggacagtactgtgacccagcttgcagcc tttactaagagcatgtcttccctccaggatgatcgtgacagggtgatagatgaagctaag aaatgggagaggaagtttagtgatgcgattcaaagcaaagaagaagaaattagactcaaa gaagataattgcagtgttctaaaggatcaacttagacagatgtccatccatatggaagaa ttaaagattaacatttccaggcttgaacatgacaagcagatttgggagtccaaggcccag acagaggtccagcttcagcagaaggtctgtgatactctacagggggaaaacaaagaactt ttgtcccagctagaagagacacgccacctataccacagttctcagaatgaattagctaag ttggaatcagaacttaagagtctcaaagaccagttgactgatttaagtaactctttagaa aaatgtaaggaacaaaaaggaaacttggaagggatcataaggcagcaagaggctgatatt caaaattctaagttcagttatgaacaactggagactgatcttcaggcctccagagaactg accagtaggctgcatgaagaaataaatatgaaagagcaaaagattataagcctgctttct ggcaaggaagaggcaatccaagtagctattgctgaactgcgtcagcaacatgataaagaa attaaagagctggaaaacctgctgtcccaggaggaagaggagaatattgttttagaagag gagaacaaaaaggctgttgataaaaccaatcagcttatggaaacactgaaaaccatcaaa aaggaaaacattcagcaaaaggcacagttggattcctttgttaaatccatgtcttctctc caaaatgatcgagaccgcatagtgggtgactatcaacagctggaagagcgacatctctct ataatcttggaaaaagaccaactcatccaagaggctgctgcagagaataataagcttaaa gaagaaatacgaggcttgagaagtcatatggatgatctcaattctgagaatgccaagcta gatgcagaactgatccaatatagagaagacctgaaccaagtgataacaataaaggacagc caacaaaagcagcttcttgaagttcaacttcagcaaaataaggagctggaaaataaatat gctaaattagaagaaaagctgaaggaatctgaggaagcaaatgaggatctgcggaggtcc tttaatgccctacaagaagagaaacaagatttatctaaagagattgagagtttgaaagta tctatatcccagctaacaagacaagtaacagccttgcaagaagaaggtactttaggactc tatcatgcccagttaaaagtaaaagaagaagaggtacacaggttaagtgctttgttttcc tcctctcaaaagagaattgcagaactggaagaagaattggtttgtgttcaaaaggaagct gccaagaaggtaggtgaaattgaagataaactgaagaaagaattaaagcatcttcatcat gatgcagggataatgagaaatgaaactgaaacagcagaagagagagtggcagagctagca agagatttggtggagatggaacagaaattactcatggtcaccaaagaaaataaaggtctc acagcacaaattcagtcttttggaaggtctatgagttccttgcaaaatagtagagatcat gccaatgaggaacttgatgaactgaaaaggaaatatgatgccagtctgaaggaattggca cagttgaaagaacagggactcttaaacagagagagagatgctcttctttctgaaaccgcc ttttcaatgaactccactgaggagaatagcttgtctcaccttgagaaacttaaccaacag ctcctatccaaagatgagcaattgcttcacttgtcctcacaactagaagattcttataac caagtgcagtccttttccaaggctatggccagtctgcagaatgagagagatcacctgtgg aatgagctggagaaatttcgaaagtcagaggaagggaagcagaggtctgcagctcagcct tccaccagcccagctgaagtacagagtttaaaaaaagctatgtcttcactccaaaatgac agagacagactactgaaggaattgaagaatctgcagcagcaatacttacagattaatcaa gagatcactgagttacatccactgaaggctcaacttcaggagtatcaagataagacaaaa gcatttcagattatgcaagaagagctcaggcaggaaaacctctcctggcagcatgagctg catcagctcaggatggagaagagttcctgggaaatacatgagaggagaatgaaggaacag taccttatggctatctcagataaagatcagcagctcagtcatctgcagaatcttataagg gaattgaggtcttcttcctcccagactcagcctctcaaagtgcaataccaaagacaggca tccccagagacatcagcttccccagatgggtcacaaaatctggtttatgagacagaactt ctcaggacccagctcaatgacagcttaaaggaaattcaccaaaaggagttaagaattcag caactgaacagcaacttctctcagctactggaagagaaaaacaccctttccattcagctc tgcgataccagtcagagtcttcgtgagaaccagcagcactatggtgaccttttaaatcac tgtgcagtcttggagaagcaggttcaagagctgcaggcggtgagtaaagagaaggggcca ctaaatatagatgttgctccaggagctccccaggaaaagaatggagttcacagaaagagt gaccctgaggaactaagggaaccgcagcaaagcttttctgaagctcagcagcagctatgc aacaccagacaggaagtgaatgaattaaggaagctgctggaagaagaacgagaccaaaga gtggctgctgagaatgctctctctgtggccgaggagcagatcagacggttagagcacagt gaatgggactcttcccggactcctatcattggctcctgtggcactcaggagcaggcactg ttaatagatcttacaagcaacagttgtcgaaggggctggctatactgggggcaaggagct gaggagaaggaatgtggaagtccagccactgagcagcgaatgagtctaaggctggaaggg cgtggaagattagaacctggtggaatttgccaagttgttgactctgaggcaactttcctc tcagaatacccctttctacagtggttggacccggagtggcgttggatggaagcgagtcct gcgttcactctgtcattcacggacccgagtgccacttctagcagccatctactttctaat gattcatgtcctgctcattctgtgttttacgggccatctatagacttagttgttactctt tggaccactcccctcaaaacttggaattctctcacctctaacatcagaacatcaattcca gtggaacagtcttcccatttacaggtcttctctccaactcttcacggaaagtgcctgcaa aaacagaggtggatacgaggacagatgtggaagtctgtagtgggccatgatgtgtctgtt tccgtggagacccagggtgatgattgggacacagatcctgactttgtgaatgacatctct gaaaaggagcaacgatggggagccaagaccatcgaggggtctggacgcacagaacacatc aacatccaccagctgaggaacaaagtatcagaggagcatgatgttctcaggaagaaagag atggagtcagggcccaaagcatcccatggctatggaggtcggtttggagtagaaagagac cgaatggacaagagtgcagtgggccatgagtatgttgccgaggtggagaagcactcttct cagacggatgctgccaaaggctttgggggcaagtacggagttgagagggacagggcagac aagtcctgtgaaaattgcccacgatcacccctggatggaggcacatgcttctttgttgaa agacaggacagtggatgctgtgaggacctgacaccgtcacagggatacctccctcagcaa gttccttggtgccaatgttggaagccacatgctgagctgaaagacagtgattcaaatagg agaagactctcaaactattactctcgtggctttggtggccggtacggggtggagaaggat aaatgggacaaagcagctctgggatatgactacaagggagagacggagaaacacgagtcc cagagagattatgccaagggctttggtggccagtatggaatccagaaggaccgagtggat aagagcgctgtcggcttcaatgaaatggaggccccgaccacagcttataagaagacgacg cccatagaagccgcttctagtggtacccgtgggctgaaggcgaaatttgagtccatggct gaggagaagaggaagcgagaggaagaggagaaggcacagcaggtggccaggaggcaacag gagcgaaaggctgtgacaaagaggagccctgaggctccacagccagtgatagctatggaa gagccagcagtaccggccccactgcccaagaaaatctcctcagaggcctggcctccagtt gggactcctccatcatcagagtctgagcctgtgagaaccagcagggaacacccagtgccc ttgctgcccattaggcagactctcccggaggacaatgaggagcccccagctctgccccct aggactctggaaggcctccaggtggaggaagagccagtgtacgaagcagagcctgagcct gagcccgagcctgagcccgagcctgagaatgactatgaggacgttgaggagatggacagg catgagcaggaggatgaaccagagggggactatgaggaggtgctcgagcctgaagattct tctttttcttctgctctggctggatcatcaggctgcccggctggggctggggctggggct gtggctctggggatctcagctgtggctgtatatgattaccaaggagagggaagtgatgag ctttcctttgatccggacgacgtaatcactgacattgagatggtggacgagggctggtgg cggggacgttgccatggccactttggactcttccctgcaaattatgtcaagcttctggag tga