GENSCAN 1.0 Date run: 8-Nov-116 Time: 03:49:09 Sequence gi568815591r:73437301_73657578 : 220278 bp : 48.12% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.19 PlyA - 55 50 6 1.05 1.18 Term - 5253 4896 358 2 1 95 44 367 0.506 26.98 1.17 Intr - 5528 5425 104 2 2 129 109 186 0.999 23.77 1.16 Intr - 6829 6684 146 2 2 71 85 114 0.927 9.40 1.15 Intr - 10079 9964 116 1 2 88 95 279 0.997 28.69 1.14 Intr - 12389 12242 148 0 1 109 94 93 0.989 11.29 1.13 Intr - 13694 13547 148 2 1 67 71 118 0.985 7.81 1.12 Intr - 22418 22236 183 2 0 81 63 137 0.891 10.48 1.11 Intr - 25799 25622 178 1 1 94 57 125 0.997 9.92 1.10 Intr - 28265 28139 127 0 1 104 94 29 0.706 4.84 1.09 Intr - 32350 32217 134 0 2 48 73 104 0.937 5.19 1.08 Intr - 33183 33045 139 1 1 62 68 178 0.999 12.72 1.07 Intr - 41269 39568 1702 1 1 67 93 1363 0.799 122.02 1.06 Intr - 52091 51894 198 2 0 77 116 147 0.988 15.95 1.05 Intr - 55621 55500 122 2 2 36 96 11 0.765 -3.09 1.04 Intr - 61398 61197 202 0 1 110 30 253 0.726 20.56 1.03 Intr - 71171 71027 145 1 1 27 105 83 0.139 4.18 1.02 Intr - 73552 73436 117 0 0 28 87 91 0.128 2.58 1.01 Init - 84633 84479 155 0 2 116 37 162 0.079 13.38 1.00 Prom - 97265 97226 40 -2.76 2.08 PlyA - 98439 98434 6 -0.45 2.07 Term - 100090 99998 93 1 0 100 42 140 0.999 8.43 2.06 Intr - 100713 100634 80 1 2 96 115 98 0.791 12.57 2.05 Intr - 102752 102582 171 0 0 103 96 216 0.967 23.81 2.04 Intr - 106344 106248 97 0 1 130 96 26 0.723 7.18 2.03 Intr - 114942 114867 76 2 1 45 107 53 0.720 2.42 2.02 Intr - 120309 120187 123 2 0 -9 77 247 0.175 13.60 2.01 Init - 123395 123298 98 2 2 70 76 119 0.744 8.68 2.00 Prom - 125777 125738 40 -5.46 3.08 PlyA - 125870 125865 6 1.05 3.07 Term - 133672 133207 466 0 1 112 41 648 0.999 56.99 3.06 Intr - 134041 133889 153 0 0 23 105 151 0.997 9.49 3.05 Intr - 135670 135544 127 2 1 104 80 136 0.998 14.14 3.04 Intr - 136171 136020 152 0 2 91 110 324 0.852 34.71 3.03 Intr - 136822 136638 185 1 2 94 75 331 0.999 30.99 3.02 Intr - 137213 137083 131 0 2 45 102 182 0.975 15.71 3.01 Init - 141249 141120 130 0 1 108 95 190 0.739 20.01 3.00 Prom - 153914 153875 40 -5.76 4.17 PlyA - 155491 155486 6 1.05 4.16 Term - 156683 156565 119 0 2 86 37 128 0.991 6.20 4.15 Intr - 157126 156974 153 2 0 -48 70 195 0.649 4.14 4.14 Intr - 158460 158288 173 2 2 94 76 196 0.683 18.59 4.13 Intr - 158669 158542 128 2 2 98 21 279 0.999 21.68 4.12 Intr - 159020 158853 168 2 0 67 105 163 0.530 16.04 4.11 Intr - 159179 159064 116 0 2 82 86 98 0.727 9.17 4.10 Intr - 159489 159339 151 0 1 99 45 38 0.839 0.34 4.09 Intr - 160413 160276 138 0 0 93 45 51 0.790 1.96 4.08 Intr - 162395 162226 170 0 2 74 99 111 0.721 10.47 4.07 Intr - 168468 168388 81 1 0 92 75 139 0.962 12.61 4.06 Intr - 168811 168610 202 1 1 88 77 167 0.985 14.46 4.05 Intr - 169718 169674 45 2 0 83 94 20 0.619 0.71 4.04 Intr - 170120 169836 285 2 0 102 88 280 0.975 26.94 4.03 Intr - 170372 170290 83 0 2 83 55 102 0.994 5.76 4.02 Intr - 178877 178771 107 1 2 106 65 123 0.993 11.66 4.01 Init - 187192 186900 293 1 2 103 94 499 0.999 46.62 4.00 Prom - 195801 195762 40 -6.86 5.00 Prom + 204757 204796 40 -2.16 5.01 Init + 210765 210927 163 2 1 114 80 153 0.779 14.99 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 120278 120187 92 2 2 103 77 216 0.815 21.96 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:73437301_73657578|GENSCAN_predicted_peptide_1|1473_aa MAPLLGRKPFPLVKPLPGEEPLFTIPHTQEAFRTREYPFPAARAGLGRAGPGEYEARLER YSERIWTCKSTGSSQLTHKEAWEEEQEVAELLKEEFPAWYEKLVLEMVHHNTASLEKLVD TAWLEIMTKYAVGEECDFEVGKEKMLKVKIVKIHPLEKVDEEATEKKSDGACDSPSSDKE NSSQIAQDHQKKETVVKEDEGRRESINDRARRSPRKLPTSLKKGERKWAPPKFLPHKYDV KLQNEDKIISNVPADSLIRTERPPNKEIVRYFIRHNALRAGTGENAPWVVEDELVKKYSL PSKFSDFLLDPYKYMTLNPSTKRKNTGSPDRKPSKKSKTDNSSLSSPLNPKLWCHVHLKK SLSGSPLKVKNSKNSKSPEEHLEEMMKMMSPNKLHTNFHIPKKGPPAKKPGKHSDKPLKA KGRSKGILNGQKSTGNSKSPKKGLKTPKTKMKQMTLLDMAKGTQKMTRAPRNSGGTPRTS SKPHKHLPPAALHLIAYYKENKDREDKRSALSCVISKTARLLSSEDRARLPEELRSLVQK RYELLEHKKRWASMSEEQRKEYLKKKREELKKKLKEKAKERREKEMLERLEKQKRYEDQE LTGKNLPAFRLVDTPEGLPNTLFGDVAMVVEFLSCYSGLLLPDAQYPITAVSLMEALSAD KGGFLYLNRVLVILLQTLLQDEIAEDYGELGMKLSEIPLTLHSVSELVRLCLRRSDVQEE SEGSDTDDNKDSAAFEDNEVQDEFLEKLETSEFFELTSEEKLQILTALCHRILMTYSVQD HMETRQQMSAELWKERLAVLKEENDKKRAEKQKRKEMEAKNKENGKVENGLGKTDRKKEI VKFEPQVDTEAEDMISAVKSRRLLAIQAKKEREIQEREMKVKLERQAEEERIRKHKAAAE KAFQEGIAKAKLVMRRTPIGTDRNHNRYWLFSDEVPGLFIEKGWVHDSIDYRFNHHCKDH TVSGDEDYCPRKLSGLFCPYRFLCDSQKELDELLNCLHPQGIRESQLKERLEKRYQDIIH SIHLARKPNLGLKSCDGNQELLNFLRSDLIEVATRLQKGGLGYVEETSEFEARVISLEKL KDFGECVIALQASVIKKFLQGFMAPKQKRRKLQSEDSAKTEEVDEEKKMVEEAKVASALE KWKTAIREAQTFSRMHVLLGMLDACIKWDMSAENARCKVCRKKGEDDKLILCDECNKAFH LFCLRPALYEVPDGEWQCPACQPATARRNSRGRNYTEESASEDSEDDESDEEEEEEEEEE EEEDYEVAGLRLRPRKTIRGKHSVIPPAARSGRRPGKKPHSTRRSQPKAPPVDDAEVDEL VLQTKRSSRRQSLELQKCEEILHKIVKYRFSWPFREPVTRDEAEDYYDVITHPMDFQTVQ NKCSCGSYRSVQEFLTDMKQVFTNAEVYNCRGSHVLSCMVKTEQCLVALLHKHLPGHPYV RRKRKKFPDRLAEDEGDSEPEAVGQSRGRRQKK >gi568815591r:73437301_73657578|GENSCAN_predicted_CDS_1|4422_bp atggcgccgctcctgggccgcaagcccttcccgctggtgaagccgttgcccggagaggag ccgctcttcaccatcccgcacactcaggaggccttccgcacccgggagtatccttttcca gccgcgcgggccgggctgggccgggctgggccgggagagtatgaagcccgcttggaaagg tacagtgagcgcatttggacgtgcaagagtactggaagcagtcagctaacacacaaggaa gcctgggaggaagaacaggaagttgctgagcttttgaaggaggagtttcctgcctggtat gagaagcttgttctggaaatggttcaccataacacagcctccttagagaagttagtagat actgcttggttggagatcatgaccaaatatgctgtgggagaagagtgtgacttcgaggtt gggaaggagaaaatgctcaaggtgaagattgtgaagattcatcctttggagaaagtggat gaagaggccactgagaagaaatctgatggtgcctgtgattctccatcaagtgacaaagag aactccagtcagattgctcaggaccatcagaagaaggagacagttgtgaaagaggatgaa ggaaggagagagagtattaatgacagagcacgtagatcgccacgaaaacttcctacttca ttaaaaaaaggagaaaggaaatgggctcctccaaaatttctgcctcacaaatatgatgtg aaactacaaaatgaagataagatcatcagtaacgtgccagcagacagcttgattcgtaca gagcgcccaccaaataaggagatagttcgatactttatacggcataatgcattacgagct ggtactggtgaaaatgcaccttgggtcgtagaagatgaattggtgaagaaatactctctg cccagcaagttcagtgactttttacttgatccatacaagtatatgactctcaacccttct actaagaggaagaatactggatccccagacaggaagccctcaaagaaatccaagacagac aactcttctcttagttcaccactaaatcctaagttatggtgtcacgtacacttgaagaag tcattgagtggctcgccactcaaagtgaagaactcaaagaattccaaatctcctgaagaa catctagaagaaatgatgaagatgatgtcgcccaataagctgcacactaactttcacatt cctaaaaaaggcccacctgccaagaaaccagggaagcacagtgacaagcctttgaaggca aagggcagaagcaaaggcatcctgaatggacagaaatccacagggaattccaaatctccc aaaaaaggactgaagactcctaaaaccaaaatgaagcagatgactttgttggatatggcc aaaggcacgcagaagatgacacgagccccacggaattctgggggtacacctaggacctct agtaaacctcataaacatctgcctcctgcagccctacacctcattgcatactacaaagaa aacaaagacagggaggacaagaggagcgccctgtcctgtgttatctccaaaacagctcgt cttctctctagtgaagatagagctcgtctcccagaagaattgcgaagtcttgttcaaaaa cgctatgaacttctagagcacaaaaagaggtgggcttctatgtctgaagaacaacggaaa gaatatttgaaaaagaaacgggaggagctgaaaaagaagttgaaggaaaaagccaaagaa cgaagagagaaagaaatgcttgagagattagaaaaacagaagcggtatgaggaccaagag ttaactggcaaaaaccttccagcattcagattggtggatacccctgaagggctgcccaac acgctgtttggggatgtggccatggtggtggaattcttgagctgttattctgggctactt ttaccagatgctcagtatcctattactgctgtgtcccttatggaagccttgagtgcagat aagggtggctttttataccttaacagggtgttggtcatcctcttacagaccctcctacaa gatgagatagcagaagactatggtgaattgggaatgaagctgtcggaaatccccttgact ctgcattctgtttcagagctggtgcggctctgcttgcgcagatctgatgttcaggaggaa agcgagggctcagacacagatgacaataaagattcagctgcatttgaggataatgaggta caagatgagttcctagaaaagctggagacctctgaattttttgagctgacgtcagaggag aagctacagatcttgacagcactgtgccaccggatcctcatgacatactcagtgcaagac cacatggagaccagacagcagatgtctgcagagttgtggaaggaacggcttgctgtgttg aaggaagaaaatgataagaagagagcagagaaacagaaacggaaagaaatggaagccaaa aataaagaaaatggaaaagttgagaatgggttaggcaaaactgataggaaaaaagaaatt gtgaagtttgagccccaagtagatacagaagctgaagacatgattagtgctgtgaagagc agaaggttgcttgccattcaagctaagaaggaacgggaaatccaggaaagagaaatgaaa gtgaaactggaacgccaagctgaagaagaacgaatacggaagcacaaagcagctgctgag aaagctttccaggaagggattgccaaggccaaactagtcatgcgcaggactcctattggc acagatcgaaaccataatagatactggctcttctcagatgaagttccaggattattcatt gaaaaaggctgggtacatgacagcattgactaccgattcaaccatcactgcaaagaccac acagtctctggtgatgaggattactgtcctcgcaagttatcaggtctcttttgcccctac aggtttttatgtgatagtcaaaaggagctggatgagttgctaaactgtcttcaccctcag ggaataagagaaagtcaacttaaagagagactagagaagaggtaccaggacattattcac tctattcatctagcacggaagccaaatttgggtctaaaatcttgtgatggcaaccaggag cttttaaacttccttcgtagtgatctcattgaagttgcaacaaggttacaaaaaggagga cttggatatgtggaagaaacatcagaatttgaagcccgggtcatttcattagagaaattg aaggattttggtgagtgtgtgattgcccttcaggccagtgtcataaagaaatttctccaa ggcttcatggctcccaagcaaaagagaagaaaactccaaagtgaagattcagcaaaaact gaggaagtggatgaagagaagaaaatggtagaggaagcaaaggttgcatctgcactggag aaatggaagacagcaatccgggaagctcagactttctccaggatgcacgtgctgcttggg atgcttgatgcctgtatcaagtgggatatgtccgcagaaaatgctaggtgcaaagtttgt cgaaagaaaggtgaggatgacaaattgatcttgtgtgatgagtgtaataaagccttccac ctgttttgtctgaggccggccctctatgaagtaccagatggtgagtggcagtgcccagct tgccagcccgctactgccaggcgcaactcccgtggcaggaactatactgaagagtctgct tctgaggacagtgaagatgatgagagtgatgaagaggaggaggaggaagaagaggaggag gaggaagaagattatgaggtggctggtttgcgattgagacctcgaaagaccatccggggc aagcacagcgtcatcccccctgcagcaaggtcaggccggcgcccgggtaagaagccacac tctaccaggaggtctcagcccaaggcaccacctgtggatgatgctgaggtggatgagctg gtgcttcagaccaagcggagctcccggaggcaaagcctggagctgcagaagtgtgaagag atcctccacaagatcgtgaagtaccgcttcagctggcccttcagggagcctgtgaccaga gatgaggccgaggactactatgatgtgatcacgcaccccatggactttcagacagtgcag aacaaatgttcctgtgggagctaccgctctgtgcaggagtttcttactgacatgaagcaa gtgtttaccaatgctgaggtttacaactgccgtggcagccatgtgctaagctgcatggtg aagacagaacagtgtctagtggctctgttgcataaacaccttcctggccacccatatgtc cgcaggaagcgcaagaagtttcctgataggcttgctgaagatgaaggggacagtgagcca gaggccgttggacagtccaggggacgaagacagaagaagtag >gi568815591r:73437301_73657578|GENSCAN_predicted_peptide_2|245_aa MCAGNLCSWEERWAVDEKEATDKKPHVGGFGQGAPAPAAPAAAMSGRSVRAETRSRAKDD IKKVMAAIEKVRKWEKKWVTVGDTSLRIFKWVPVTDSKEKEKSKSNSSAAREPNGFPSDA SANSSLLLEFQDENSNQSSVSDVYQLKVDSSTNSSPSPQQSESLSPAHTSDFRTDDSQPP TLGQEILEEPSLPSSEVADEPPTLTKEEPVPLETQVVEEEEDSGAPPLKRFCVDQPTVPQ TASES >gi568815591r:73437301_73657578|GENSCAN_predicted_CDS_2|738_bp atgtgtgcagggaacctgtgcagctgggaggagcgctgggctgtggacgagaaggaagct acagacaagaaacctcacgtgggaggatttggacaaggtgccccggcccccgccgctccc gccgccgccatgtcgggccggtcggtccgggcggagacccgcagccgggccaaggacgac atcaagaaggtgatggcggccatcgagaaagtgcggaaatgggagaagaagtgggtgact gtgggtgacacgtccctgaggatatttaagtgggttcctgtgacagacagcaaggagaaa gaaaagtcaaaatcgaacagttcagcagcccgagaacctaatggctttccttctgatgcc tcagccaattcctctctccttcttgaattccaggacgaaaacagcaaccagagttccgtg tctgacgtctatcagcttaaggtggacagcagcaccaactcaagccccagcccccagcag agtgagtccctgagcccagcacacacctccgacttccgcacggatgactcccagccccca acgctgggccaggagatcctggaggagccctccctgccctcctcggaagttgctgatgaa cctcctaccctcaccaaggaagaaccagttccactagagacacaggtcgttgaggaagag gaagactcaggtgccccgcccctgaagcgcttctgtgtggaccaacccacagtgccgcag acggcgtcagaaagctag >gi568815591r:73437301_73657578|GENSCAN_predicted_peptide_3|447_aa MELSQMSELMGLSVLLGLLALMATAAVARGWLRAGEERSGRPACQKANGFPPDKSSGSKK QKQYQRIRKEKPQQHNFTHRLLAAALKSHSGNISCMDFSSNGKYLATCADDRTIRIWSTK DFLQREHRSMRANVELDHATLVRFSPDCRAFIVWLANGDTLRVFKMTKREDGGYTFTATP EDFPKKHKAPVIDIGIANTGKFIMTASSDTTVLIWSLKGQVLSTINTNQMNNTHAAVSPC GRFVASCGFTPDVKVWEVCFGKKGEFQEVVRAFELKGHSAAVHSFAFSNDSRRMASVSKD GTWKLWDTDVEYKKKQDPYLLKTGRFEEAAGAAPCRLALSPNAQVLALASGSSIHLYNTR RGEKEECFERVHGECIANLSFDITGRFLASCGDRAVRLFHNTPGHRAMVEEMQGHLKRAS NESTRQRLQQQLTQAQETLKSLGALKK >gi568815591r:73437301_73657578|GENSCAN_predicted_CDS_3|1344_bp atggagctctcgcagatgtcggagctcatggggctgtcggtgttgcttgggctgctggcc ctgatggcgacggcggcggtagcgcgggggtggctgcgcgcgggggaggagaggagcggc cggcccgcctgccaaaaagcaaatggatttccacctgacaaatcttcgggatccaagaag cagaaacaatatcagcggattcggaaggagaagcctcaacaacacaacttcacccaccgc ctcctggctgcagctctgaagagccacagcgggaacatatcttgcatggactttagcagc aatggcaaatacctggctacctgtgcagatgatcgcaccatccgcatctggagcaccaag gacttcctgcagcgagagcaccgcagcatgagagccaacgtggagctggaccacgccacc ctggtgcgcttcagccctgactgcagagccttcatcgtctggctggccaacggggacacc ctccgtgtcttcaagatgaccaagcgggaggatgggggctacaccttcacagccacccca gaggacttccctaaaaagcacaaggcgcctgtcatcgacattggcattgctaacacaggg aagtttatcatgactgcctccagtgacaccactgtcctcatctggagcctgaagggtcaa gtgctgtctaccatcaacaccaaccagatgaacaacacacacgctgctgtatctccctgt ggcagatttgtagcctcgtgtggcttcaccccagatgtgaaggtttgggaagtctgcttt ggaaagaagggggagttccaggaggtggtgcgagccttcgaactaaagggccactccgcg gctgtgcactcgtttgctttctccaacgactcacggaggatggcttctgtctccaaggat ggtacatggaaactgtgggacacagatgtggaatacaagaagaagcaggacccctacttg ctgaagacaggccgctttgaagaggcggcgggtgccgcgccgtgccgcctggccctctcc cccaacgcccaggtcttggccttggccagtggcagtagtattcatctctacaatacccgg cggggcgagaaggaggagtgctttgagcgggtccatggcgagtgtatcgccaacttgtcc tttgacatcactggccgctttctggcctcctgtggggaccgggcggtgcggctgtttcac aacactcctggccaccgagccatggtggaggagatgcagggccacctgaagcgggcctcc aacgagagcacccgccagaggctgcagcagcagctgacccaggcccaagagaccctgaag agcctgggtgccctgaagaagtga >gi568815591r:73437301_73657578|GENSCAN_predicted_peptide_4|803_aa MAGALAGLAAGLQVPRVAPSPDSDSDTDSEDPSLRRSAGGLLRSQVIHSGHFMVSSPHSD SLPRRRDQEGSVGPSDFGPRSIDPTLTRLFECLSLAYSGKLVSPKWKNFKGLKLLCRDKI RLNNAIWRAWYIQYVKRRKSPVCGFVTPLQGPEADAHRKPEAVVLEGNYWKRRIEVVMRE YHKWRIYYKKRVSGGGPGRPQSFPPAAAGYRPPRKIPGKGILTPELAPLGPSIQSRADSA TVWPQRLLAASLPRGRLRKPSREDDLLAPKQAEGRWPPPEQWCKQLFSSVVPVLLGDPEE EPGGRQLLDLNCFLSDISDTLFTMTQSGPSPLQLPPEDAYVGNADMIQPDLTPLQPSLDD FMDISDFFTNSRLPQPPMPSNFPEPPSFSPVVDSLFSSGTLGPEVPPASSAMTHLSGHSR LQARNSCPGPLDSSAFLSSDFLLPEDPKPRLPPPPVPPPLLHYPPPAKQETVPEFPCTFL PPTPAPTPPRPPPGPATLAPSRPLLVPKAERLSPPAPSGSERRLSGDLSSMPGPGTLSVR VSPPQPILSRGRPDSNKALLGSFLGSPNSLLPETENRRITHISAEQKRRFNIKLGFDTLH GLVSTLSAQPSLKVSKATTLQKTAEYILMLQQERAGLQEEAQQLRDEIEELNAAINLCQQ QLPATGVPITHQRFDQMRDMFDDYVRTRTLHNWKFWVVSSLKPMAGGLQGLWQGSSLTWA QFSILIRPLFESFNGMVSTASVHTLRQTSLAWLDQYCSLPALRPTVLNSLRQLGTSTSIL TDPGRIPEQATRAVTEGTLGKPL >gi568815591r:73437301_73657578|GENSCAN_predicted_CDS_4|2412_bp atggccggcgcgctggcaggtctggccgcgggcttgcaggtcccgcgggtcgcgcccagc ccagactcggactcggacacagactcggaggacccgagtctccggcgcagcgcgggcggc ttgctccgctcgcaggtcatccacagcggtcacttcatggtgtcgtcgccgcacagcgac tcgctgccccggcggcgcgaccaggaggggtccgtggggccctccgacttcgggccgcgc agtatcgaccccacactcacacgcctcttcgagtgcttgagcctggcctacagtggcaag ctggtgtctcccaagtggaagaatttcaaaggcctcaagctgctctgcagagacaagatc cgcctgaacaacgccatctggagggcctggtatatccagtatgtgaagcggaggaagagc cccgtgtgtggcttcgtgacccccctgcaggggcctgaggctgatgcgcaccggaagccg gaggccgtggtcctggaggggaactactggaagcggcgcatcgaggtggtgatgcgggaa taccacaagtggcgcatctactacaagaagcgggtcagtgggggagggccagggaggccc cagagctttcctcctgcggctgccggctaccgcccgcctcggaagatccctggaaagggg atcctgacccccgagcttgcgcccctcgggccctccattcagtcccgggccgacagcgcc accgtgtggccacagcgtctcctagcggcctccttacctaggggtcggctccgtaagccc agcagggaagatgacctcctggcccctaagcaggcggaaggcaggtggccgccgccggag caatggtgcaaacagctcttctccagtgtggtccccgtgctgctgggggacccagaggag gagccgggtgggcggcagctcctggacctcaattgctttttgtccgacatctcagacact ctcttcaccatgactcagtccggcccttcgcccctgcagctgccgcctgaggatgcctac gtcggcaatgctgacatgatccagccggacctgacgccactgcagccaagcctggatgac ttcatggacatctcagatttctttaccaactcccgcctcccacagccgcccatgccttca aacttcccagagccccccagcttcagccccgtggttgactccctcttcagcagtgggacc ctgggcccagaggtgcccccggcttcctcggccatgacccacctctctggacacagccgt ctgcaggctcggaacagctgccctggccccttggactccagcgccttcctgagttctgat ttcctccttcctgaagaccccaagccccggctcccaccccctcctgtacccccacctctg ctgcattaccctccccctgccaagcaggagacagtccctgaattcccctgcacattcctt cccccgaccccggcccctacaccgccccggccacctccaggcccggccacattggcccct tccaggcccctgcttgtccccaaagcggagcggctctcacccccagcgcccagcggcagt gaacggcggctgtcaggggacctcagctccatgccaggccctgggactctgagcgtccgt gtctctcccccgcaacccatcctcagccggggccgtccagacagcaacaaggccctcctg ggctccttcctaggctcacccaactctctgctccccgagaccgagaaccggcgtatcaca cacatctccgcggagcagaagcggcgcttcaacatcaagctggggtttgacacccttcat gggctcgtgagcacactcagtgcccagcccagcctcaaggtgagcaaagctaccacgctg cagaagacagctgagtacatccttatgctacagcaggagcgtgcgggcttgcaggaggag gcccagcagctgcgggatgagattgaggagctcaatgccgccattaacctgtgccagcag cagctgcccgccacaggggtacccatcacacaccagcgttttgaccagatgcgagacatg tttgatgactacgtccgaacccgtacgctgcacaactggaagttctgggtggtatcctcc ctcaagcccatggctggggggctgcaggggctgtggcagggcagctccttgacctgggcc cagttcagcatcctcatccggcctctgtttgagtccttcaacgggatggtgtccacggca agtgtgcacaccctccgccagacctcactggcctggctggaccagtactgctctctgccc gctctccggccaactgtcctgaactccctacgccagctgggcacatctaccagtatcctg accgacccgggccgcatccctgagcaagccacacgggcagtcacagagggcacccttggc aaacctttatag >gi568815591r:73437301_73657578|GENSCAN_predicted_peptide_5|55_aa MARALGSGACQSPSAAGSCPLTFNPLYFPRGLGARPPAAGWARGGRSRRCGATPX >gi568815591r:73437301_73657578|GENSCAN_predicted_CDS_5|165_bp atggcccgggccctgggctcgggcgcctgccaatccccgtctgccgccggctcgtgcccg ctcacctttaaccccctctactttccccgggggctcggtgcccggcccccagccgcgggc tgggctcggggagggaggagccgcaggtgtggggccaccccagnn