GENSCAN 1.0 Date run: 8-Nov-116 Time: 13:12:21 Sequence gi568815594r:102163047_102444662 : 281616 bp : 38.73% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 5606 5669 64 1 1 79 29 85 0.958 3.06 1.02 Intr + 6684 6839 156 1 0 99 58 97 0.150 6.86 1.03 Intr + 25256 25356 101 0 2 53 43 71 0.133 -1.99 1.04 Term + 25492 25665 174 0 0 71 37 183 0.269 8.28 1.05 PlyA + 26169 26174 6 1.05 2.03 PlyA - 26418 26413 6 1.05 2.02 Term - 42193 42157 37 0 1 93 37 41 0.747 -4.67 2.01 Init - 45500 42541 2960 2 2 44 53 931 0.767 76.35 2.00 Prom - 45593 45554 40 -4.95 3.02 PlyA - 45762 45757 6 1.05 3.01 Sngl - 47006 46677 330 2 0 88 44 330 0.970 24.37 3.00 Prom - 47610 47571 40 -3.55 4.00 Prom + 60061 60100 40 -2.55 4.01 Init + 72470 72649 180 1 0 64 18 189 0.664 8.78 4.02 Intr + 72829 72966 138 0 0 97 -2 85 0.345 0.14 4.03 Intr + 74516 75260 745 1 1 4 14 517 0.639 25.99 4.04 Intr + 75558 75633 76 1 1 36 68 78 0.643 -1.75 4.05 Intr + 75918 76509 592 0 1 50 48 246 0.157 8.47 4.06 Intr + 77842 78157 316 0 1 31 80 204 0.115 8.61 4.07 Intr + 78383 78789 407 0 2 -18 66 249 0.320 5.24 4.08 Intr + 79530 79754 225 2 0 83 -6 128 0.101 0.16 4.09 Intr + 82215 82395 181 2 1 57 48 62 0.030 -2.38 4.10 Intr + 86508 86759 252 1 0 61 30 279 0.393 15.88 4.11 Intr + 93626 93774 149 0 2 86 86 66 0.242 5.23 4.12 Intr + 97452 97511 60 2 0 73 81 65 0.373 2.31 4.13 Term + 104854 104982 129 0 0 43 48 229 0.487 11.60 4.14 PlyA + 106879 106884 6 1.05 5.00 Prom + 106915 106954 40 -7.65 5.01 Init + 110121 110207 87 2 0 77 38 87 0.293 3.29 5.02 Intr + 110646 110920 275 2 2 -8 71 245 0.271 8.61 5.03 Term + 111185 111524 340 2 1 -33 47 278 0.563 4.62 5.04 PlyA + 111763 111768 6 1.05 6.07 PlyA - 116258 116253 6 1.05 6.06 Term - 122027 121867 161 0 2 85 36 95 0.008 1.12 6.05 Intr - 138355 138130 226 1 1 22 65 147 0.401 2.44 6.04 Intr - 141435 141271 165 0 0 91 76 140 0.900 12.34 6.03 Intr - 142065 141943 123 0 0 118 99 41 0.996 8.06 6.02 Intr - 144559 144390 170 2 2 82 94 25 0.831 1.24 6.01 Init - 151115 151016 100 2 1 75 65 124 0.736 9.27 6.00 Prom - 152901 152862 40 -7.75 7.00 Prom + 153276 153315 40 -7.25 7.01 Init + 158348 158423 76 1 1 90 28 75 0.410 3.00 7.02 Term + 167594 168210 617 0 2 24 38 292 0.654 11.44 7.03 PlyA + 168384 168389 6 1.05 8.03 PlyA - 169343 169338 6 1.05 8.02 Term - 175467 175453 15 0 0 106 48 3 0.220 -4.64 8.01 Init - 181616 181398 219 2 0 101 110 399 0.954 40.08 8.00 Prom - 185964 185925 40 -8.05 9.05 PlyA - 186093 186088 6 -0.45 9.04 Term - 186671 186648 24 2 0 105 41 28 0.005 -2.95 9.03 Intr - 189766 189689 78 1 0 50 92 66 0.012 2.03 9.02 Intr - 200391 200110 282 0 0 68 76 139 0.567 7.39 9.01 Init - 209577 209458 120 0 0 71 91 105 0.905 9.34 9.00 Prom - 209631 209592 40 -5.75 10.03 PlyA - 209741 209736 6 1.05 10.02 Term - 211702 211208 495 1 0 25 41 242 0.881 6.78 10.01 Init - 215322 214285 1038 0 0 49 41 418 0.592 27.23 10.00 Prom - 215415 215376 40 -6.15 11.02 PlyA - 215584 215579 6 1.05 11.01 Sngl - 216826 216437 390 1 0 88 54 386 0.934 31.17 11.00 Prom - 222893 222854 40 -5.55 12.00 Prom + 227725 227764 40 -5.75 12.01 Init + 232144 232182 39 0 0 62 52 40 0.186 -1.96 12.02 Term + 232436 233323 888 1 0 43 42 541 0.932 36.47 12.03 PlyA + 234332 234337 6 1.05 13.04 PlyA - 235142 235137 6 1.05 13.03 Term - 238035 237935 101 1 2 81 48 86 0.055 1.21 13.02 Intr - 279349 279220 130 1 1 83 48 54 0.125 0.25 13.01 Intr - 280059 279921 139 2 1 68 75 108 0.550 7.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 6684 6859 176 1 2 99 47 112 0.817 5.14 S.002 Sngl - 15510 15364 147 0 0 93 44 147 0.801 4.62 S.003 Sngl + 32577 32867 291 2 0 75 46 151 0.850 5.00 S.004 Intr + 77969 78157 189 0 0 87 80 139 0.804 11.64 S.005 Term + 187270 187350 81 0 0 71 48 109 0.861 1.91 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:102163047_102444662|GENSCAN_predicted_peptide_1|164_aa MARVRANEGLNEVGQWAQKGQDLRITTTTSFQSVGLWGSTNPHSSPRNITGWAHSSRSSF SLEALASSSGEPSLVYTSSYLVTQSVAFFRYINTVDSTSGVGPEVVWEKLASFIPGMNRH LTQDLEKLKEQPLKAERTTGSADAPRWSSRCPPNYMNNGTCCLT >gi568815594r:102163047_102444662|GENSCAN_predicted_CDS_1|495_bp atggctcgagtaagagcaaatgaaggcctgaacgaagttgggcagtgggcgcagaaaggc caggatctccgaataacaactacaacctcctttcagtccgtgggcctgtgggggtccacc aaccctcactccagtcccaggaacatcaccgggtgggcccactcttccagatctagcttc tcactggaggcccttgcaagcagctcaggagagccatccttggtctatactagcagctac ttggtcacccaaagtgttgccttctttaggtacattaacacagttgattctacatccggt gttgggcctgaagttgtatgggagaagttggcatccttcatccctgggatgaacaggcac ttgacccaggacttggagaagctgaaggagcagcccctgaaagccgaaaggaccacaggc tctgctgatgctccaaggtggagcagcagatgccccccaaattatatgaacaacggcacc tgttgtcttacataa >gi568815594r:102163047_102444662|GENSCAN_predicted_peptide_2|998_aa MGDFNTPLSTLDRSTRQKVNKDTQELNSTLHQADLIGIYRTLHPKSTEYTFFSAPHHTYF KIDHILGSKALLSKCKRTEIITNYLSDHSAIKLELRIKNLTQSHSTTWKLNNLLLNDYWV HNEMKAEIKMFFETNENKDTTYQNLWDAFKAVCRGKFIALNAYKRKQERSKIDTLTSQLK ELEKQEQTHSKASRRQEITKIRAELKEIETQKTLQKINESRSWFFERINKIDRPLARLIK KKREKNQIDTIKNDKGDITTDPTKIQTTIREYYKHLYANKLENLEEMDTFLDTYTLPRLN QEEVESLNRPITGAEIVAIINSLPTKKSPGPDGFTAEFYQRYKEELVPFLLKLFQSIEKE GILPNSFYEASIILISKLGRDTTKKENFRPISLMNIDAKILNKILANRIQQHIKKLIHHD QVGFIPGMQSWFNIRKSINVIQHINRAKDKNHMIISIDAEKAFDKIEQPFMLKTLNKLGI DGTYLKIIRAIYDKPTANIILIGQKLEAFPLKTGTRQGCPLSPLLFNIVLEVLARAIRQE KEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLFKLISNFSKVSGYKINVEKSQAFL YTNNTQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLKEIKEDTNKWKNI PCSWVGRINIVKMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRARIAKSILS QKNKAGGITLPDFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEIMPHTYNYLIFDEPEK NKQWGKDSLFNKWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLG ITIQDIGVGKDFMSKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTTWEKIFAT YSSDKGLISRIYNELKQIYKKKTNNPIKKWARDMNRHFSKEDIYAAKKHMKTCSSSLAIR EMQIKTTMRYHLIPARMAIIKKSGNNRDMDEIGNHHSQ >gi568815594r:102163047_102444662|GENSCAN_predicted_CDS_2|2997_bp atgggagactttaacaccccactgtcaacattagacagatcaacgagacagaaagtcaac aaggatacccaggaattgaactcaactctgcaccaagcggacctaataggcatctacaga actctccaccccaaatcaacagaatatacatttttttcagcaccacaccacacctatttc aaaattgaccacatacttggaagtaaagctctcctcagcaaatgtaaaagaacagaaatt ataacaaactatctctcagaccacagtgcaatcaaactagaactcaggattaagaatctc actcaaagccactcaactacatggaaactgaacaacctgctcctgaatgactactgggta cataacgaaatgaaggcagaaataaagatgttctttgaaaccaacgagaacaaagacaca acttaccagaatctctgggacgcattcaaagcagtgtgtagagggaaatttatagcacta aatgcgtacaagagaaagcaggaaagatccaaaattgacaccctaacatcacaattaaaa gaactagaaaagcaagagcaaacacattcaaaagctagcagaaggcaagaaataactaaa atcagagcagaactgaaggaaatagagacacaaaaaacccttcaaaaaatcaatgaatcc aggagctggttttttgaaaggatcaacaaaattgatagaccgctagcaagactaataaag aaaaaaagagagaagaatcaaatagacacaataaaaaatgataaaggggatatcaccacc gatcccacaaaaatacaaactaccatcagagaatactacaaacacctctatgcaaataaa ctagaaaatctagaagaaatggatacattcctcgacacatacactctcccaagactaaac caggaagaagttgaatctctgaatagaccaataacaggagctgaaattgtggcaataatc aatagtttaccaaccaaaaagagtccaggaccagatggattcacagccgaattctaccag aggtacaaggaggaactggtaccattccttctgaaactattccaatcaatagaaaaagag ggaatcctccctaactcattttatgaggccagcatcattctgatatcaaagctgggcaga gacacaaccaaaaaagagaattttagaccaatatccttgatgaacattgatgcaaaaatc ctcaataaaatactggcaaaccgaatccagcagcacatcaaaaagcttatccaccatgat caagtgggcttcatccctgggatgcaaagctggttcaatatacgcaaatcaataaatgta atccagcatataaacagagccaaagacaaaaaccacatgattatctcaatagatgcagaa aaggcctttgacaaaattgaacaacccttcatgctaaaaactctcaataaattaggtatt gatgggacgtatctcaaaataataagagctatctatgacaaacccacagccaatatcata ctgattgggcaaaaactggaagcattccctttgaaaactggcacaagacagggatgccct ctctcaccgctcctattcaacatagtgttggaagttctggccagggcaatcaggcaggag aaggaaataaagggtattcaattaggaaaagaggaagtcaaattgtccctgtttgcagac gacatgattgtttatctagaaaaccccattgtctcagcccaaaatctctttaaactgata agcaacttcagcaaagtctcaggatacaaaatcaatgtagaaaaatcacaagcattccta tacaccaacaacacacaaacagagagccaaatcatgagtgaactcccattcacaattgct tcaaagagaataaaatacctaggaatccaacttacaagggatgtgaaggacctcttcaag gagaactacaaaccactgctcaaggaaataaaagaggatacaaacaaatggaagaacatt ccatgctcatgggtaggaagaatcaatattgtgaaaatggccatactgcccaaggtaatt tatagattcaatgccatccccatcaagctaccaatgactttcttcacagaattggaaaaa actactttaaagttcatatggaaccaaaaaagagcccgcattgccaagtcaatcctaagc caaaagaacaaagctggaggcatcacactacctgacttcaaactatactacaaagctaca gtaaccaaaacagcatggtactggtaccaaaacagagatatagatcaatggaacagaaca gagccctcagaaataatgccgcatacctacaactatctgatctttgacgaacctgagaaa aacaagcaatggggaaaggattccctatttaataaatggtgctgggaaaactggctagcc atatgtagaaagctgaaactggatcccttccttacaccttatacaaaaatcaattcaaga tggattaaagatttaaacgttagacctaaaaccataaaaaccctagaagaaaacctaggc attaccattcaggacataggcgtgggcaaggacttcatgtccaaaacaccaaaagcaatg gcaaccaaagccaaaattgacaaatgggatctaattaaactaaagagcttctgcacagca aaagaaactaccatcagagtgaacaggcaacctacaacatgggagaaaattttcgcaacc tactcatctgacaaagggctaatatccagaatctacaatgaactcaaacaaatttacaag aaaaaaacaaacaaccccatcaaaaagtgggcgagggacatgaacagacacttctcaaaa gaagacatttatgcagccaaaaaacacatgaaaacatgctcatcatcactggccatcaga gaaatgcaaatcaaaaccactatgagatatcatctcataccagctagaatggcaatcatt aaaaagtcaggaaacaacagggacatggatgaaattggaaaccatcattctcagtaa >gi568815594r:102163047_102444662|GENSCAN_predicted_peptide_3|109_aa MGKKQNRKTGNSKTQSASPPPKERSSSPATEQSWMENDFDELREEGFRRSNYSELREDIQ TKGKEVENFEKNLEECITRITNTEKCLKELMELKTKARELREECRSLRS >gi568815594r:102163047_102444662|GENSCAN_predicted_CDS_3|330_bp atggggaaaaaacagaacagaaaaactggaaactctaaaacgcagagcgcctctcctcct ccaaaggaacgcagttcctcaccagcaacggaacaaagctggatggagaatgattttgac gagctgagagaggaaggcttcagacgatcaaattactctgagctacgggaggacattcaa accaaaggcaaagaagttgaaaactttgaaaaaaatttagaagaatgtataactagaata accaatacagagaagtgcttaaaggagctgatggagctgaaaaccaaggctcgagaacta cgtgaagaatgcagaagcctcaggagctga >gi568815594r:102163047_102444662|GENSCAN_predicted_peptide_4|1149_aa MNNSRYAALRAVTLTAKVCSFTPEASETTNPPEGRNCEHIRTSEGKNSRHTILKSCNTHH NSGAKYQAPVSQLKATSVAAGLKTWVSGFLEKGSLTTPNSLELGALIKADLGKFSDDPDR YIDVLQGRGQTFDLTWRDVVLLLDQTLAFNEKNAALAAAREFGDTWYLHQVNDRMTAEGR DKFSTSQQAIPSMDPHCNLDSHHGDWSRKHLLICDLEGLRRIRKKPINYSMMSTTTQRNE ENPSAFLEWLREALRKYTPLSPESLKGQLILKDKFITQSAADIRRKLQKQALGPEQNLEA LLNLATSVFYNRDQEEQAQKEKRDQRKATALVMAFRQTNLGGSERTENGAGQSRMLISCP GQLSARSVTIRGILGQPVTRQYLLRPEAHKGLQDIVKYLKAQALVRNCSSPCNTPILGVQ KPNGQWRLLQDLRLINEADIPLYPVVPNPYTLLSQIQEEAEWFTVLDLKDAFFCIPLHSD SQFLFAFEDPTDHSSQLTGMVLPQGFRDSPHLFGQALAQDLGHFSSPGTLVLQHVDDLLL ATSSEASFQEATLDLLNFLANQGYKMSRSKAQLCLQQVPKEVYGYQTTAYLETRHYSLRD QPVLHICMCMALNPATFLPEEGEPIEHNCQQITVQTYAPRDDFLEVPLAKPDLNLYTDGS SFVENGIRRAGYAIVSDVTILEMQKPKEVAVLHCQGHQKGEGEKSEGNCRADAEAKIAVR RNPPLEIPTEGPLVWNNPLQEIKPQYSQTEAEWGLSRGHSFLPLGWLVTEEGKVLIPEAS QWKILKTLHQTFHMGIENTNQMAKSLFTGPNLLRTIRQEGTYSVILSTPTAVKVAGVESQ IHQTRVKLWTSPEEPAGPSAQESQDQPDQPRYTCEPLEDLHFLFWKETSQTKRASAASLQ GRARDLQPAMPETHPRWTPTQPRPPQQVLPPALWHQVPSTAQGLRSAGTQHETGVKLQTF AVSVTAHKGSADPKSEQQQAVLQRAKNKASTTQKETTAGYHCCLRQPAFIPLSGPTHILL IGPFYREQIGPFYRELIVSSNFTPIYALVVYSFSCQIHTVTSSSRIRRLLLIITATGETK NCHAQMRNLDLKAESVATILEHEAILKDCYGSTESLKERQGAGSPNRQAIDEIVEGIAER YHPGNRPYF >gi568815594r:102163047_102444662|GENSCAN_predicted_CDS_4|3450_bp atgaacaactccagatacgccgccttaagagctgtaacactcactgcgaaggtctgcagc ttcactcctgaagccagcgagaccaccaacccaccagaaggaagaaactgtgaacacatc cgaacatcagaaggaaaaaactccagacacaccatcctcaagagctgtaacactcaccac aattctggggctaaataccaggcacctgtcagccagttaaaagcaactagcgtggccgct ggactaaagacatgggtgtcaggctttctggaaaagggctctctaacaacccccaactct ttggagttgggggcgttgatcaaggcagacctggggaagttttcagatgatcctgataga tacatagatgtcctacagggtagagggcaaacctttgacctcacttggagagatgtcgtg ctactcttagatcaaaccctggcctttaatgaaaagaatgcagctttagctgcagcccga gagtttggagatacatggtatcttcatcaagtaaatgatagaatgacagccgaaggaagg gacaaattctctaccagtcagcaagccatccccagtatggatccccactgtaaccttgac tcacatcatggggactggagtcgtaaacatctgttgatctgtgatctagaaggactaagg agaattaggaaaaagcccataaattattcaatgatgtccaccacaactcagagaaatgaa gaaaatccttctgccttcctcgagtggctacgagaggccttaagaaaatatactcccttg tcacccgaatcactcaagggtcaattgattctaaaagataagtttattacccaatcagct gcagatatcaggagaaagctccaaaagcaagccctgggccctgaacaaaatctggaggca ttattaaacttggcaacctcagtgttctataatagggaccaagaggaacaggcccaaaag gaaaagcgagatcagagaaaggccacagccttagtcatggccttcagacaaacaaacctt ggtggttcagaaaggacagaaaatggagcaggccaatcacgcatgttaatctcctgtcct ggacaactgtccgcaaggtccgttaccattcgaggaatcctgggacagcctgtaaccagg caatatctcttaaggcctgaagctcataaaggattacaggatattgttaaatatttaaaa gctcaagccttagtaaggaattgcagcagtccctgcaacactccaattctaggagtacaa aaacctaatggtcagtggagactactgcaagatcttagactcatcaatgaggcagatatt cctctatatccagttgtacccaacccctataccctgctctctcaaatacaagaggaagca gaatggttcacagttctggacctcaaggatgccttcttctgtattcccctgcactctgac tcccagtttctctttgcctttgaggatcccacagaccactcatcccaacttacggggatg gtcttgccccaagggtttagggatagccctcacctgtttggtcaggcactggcccaagat ctaggccacttctcaagtccaggcactctggttcttcagcatgtggatgatttgcttttg gctaccagttcagaagcctcattccaggaggctactctagatctcttgaactttctagct aatcaagggtacaaaatgtctaggtctaaggcccagctttgcctacagcaggtgccaaag gaagtttatggctatcagacaaccgcctacttagaaactaggcactactccttgagggac caaccagtacttcacatatgcatgtgcatggccctcaaccctgccacttttctcccagag gagggggaaccaatcgagcataactgccaacaaattacagtccagacttatgccccccga gatgatttcttagaagtccccttagctaaacctgaccttaacctatataccgacgggagt tcatttgtggagaatggcatacgaagggcaggttacgccatagttagtgatgtaaccata cttgaaatgcaaaaacccaaggaggtggcagtcttacactgccaaggccatcagaaaggt gaaggagaaaagtcagaaggaaactgccgggcagatgctgaggccaaaattgctgtgagg cggaaccctccattagaaatacctacggaaggacccttggtatggaacaaccccctccaa gagattaagccccagtattcccaaaccgaagcagagtggggactttcacgggggcatagt tttctccccttggggtggttggtgacagaagaaggaaaggtacttatacccgaagccagc cagtggaaaatacttaaaaccctccaccaaacttttcatatgggtattgaaaacactaat caaatggccaaatccctatttacagggccaaatctcctccgtaccatccgacaggaagga acatactcggtaatcctctctactcccactgcagttaaggtggcaggagtggaatctcag attcaccaaaccagagttaaactttggacatcccctgaggaacctgcaggaccatcagct caggagtcccaagatcagccagaccagcctcgatacacctgcgaaccactggaggacttg cacttcctattttggaaggaaacctcccagactaaaagggcctcagctgcctccctgcaa ggaagggctcgggacctgcagcctgccatgcccgagacccacccccggtggactcccaca cagcccaggcctccccaacaggtgctgccccctgctctgtggcaccaggtcccatcgacc gcccaagggctgaggagtgcaggcacacagcacgagactggagtgaagctgcagaccttc gcagtgagtgttacagctcataaaggcagtgcagacccgaagagtgagcagcagcaagct gtattgcaaagagcgaagaacaaagcttccacaacacagaaggagacaacagcaggttac cactgctgccttcggcagcctgcttttattcccttatctggccccacccacatcctgctg attggtccattttacagagagcagattggtccattttacagagagctgattgtgagcagc aacttcactcccatatacgcacttgttgtgtattccttctcctgccaaattcacacagtc acctcctcttccaggataagacgtttactgctgataatcacagccacaggggaaacaaag aactgccatgctcaaatgaggaatttagaccttaaggctgaatctgtagcaaccatctta gagcatgaagcaatcttgaaggattgctatggaagtactgagtccctgaaggagagacaa ggtgcaggaagccccaatcgccaggccatcgatgaaattgtggagggcatcgcagagcgt tatcatccaggcaatcgtccctatttctga >gi568815594r:102163047_102444662|GENSCAN_predicted_peptide_5|233_aa MEPGKLRPTGLKFSLPAQQSEVNLGHSSLKGSNWHLMGAHLGRSFQRKEQTAIFAVLQPM LVIPRQTGDGVDPQQTPEDLQKRGLTVRRKTNKQKAIASTSTKRTATQKLHLKVTNSKDQ RIVSLEKNINDLIKSKNTAQELREAYTSISSQIYEAEESISEIEDQLNKIKHEDKIREKT MKRNKQRIQEIWDYVKRPNLRLIGVPESDGENGTKLENTLQDIIQENFLNLAR >gi568815594r:102163047_102444662|GENSCAN_predicted_CDS_5|702_bp atggagcctggaaagctaagacccactggcttgaaattctcgctgccagcacagcagtct gaagtcaacctggggcactcgagcttgaagggctccaactggcatctgatgggtgcccat ctgggacgaagcttccagaggaaggagcagacagcaatctttgctgttctacagccaatg ctggtgatacccaggcaaacaggggatggagtggacccccagcaaactccagaagacctg cagaagaggggtctgactgttagaaggaaaaccaacaaacagaaagcaatagcatcaaca tcaacaaaaagaacggccactcaaaaactccatctgaaggtcaccaacagcaaagaccaa agaatagtcagtttagagaagaacataaatgatctgataaagtcgaaaaacacagcacaa gaacttcgtgaagcatacacaagtatcagtagccaaatctatgaagcagaagaaagtata tcagaaattgaagatcaacttaataaaataaagcatgaagacaagattagagaaaaaaca atgaagaggaacaagcaaagaatccaagaaatatgggactatgtgaaaagaccaaaccta cgtttgattggtgtacctgaaagtgatggggagaatggaaccaagttggaaaacacactt caggatattatccaggagaacttcctcaacctagcaagatag >gi568815594r:102163047_102444662|GENSCAN_predicted_peptide_6|314_aa MSQVLSQRNRRDRYTISRDGEDWEEPFIVEEGSVWGYGFLSVTIINLASLLGLILTPLIK KSYFPKILTFFVGLAIGTLFSNAIFQLIPEAFGFDPKVDSYVEKAVAVFGGFYLLFFFER MLKMLLKTYGQNGHTHFGNDNFGPQEKTHQPKALPAINGVTCYANPAVTEANGHIHFDNV SVVSLQKRSLKVSVGNCIMRKNQSCKYRGRKYAKDFENNQTGHTQGTWKLNGSGLPHINA GEQWHNSFNILNENNCGIINLGISKCLMNMTLSNLYTNLVKVDKSQKRLAAEETPIEITL EGSIRHSGLWSYAF >gi568815594r:102163047_102444662|GENSCAN_predicted_CDS_6|945_bp atgagtcaagtcttgagccagagaaaccggagagatcgttataccatttcccgggatgga gaagactgggaagagccctttattgtagaggaagggagtgtttggggatatggattcctg tcagtgacgattattaatctggcatctctcctcggattgattttgactccactgataaag aaatcttatttcccaaagattttgaccttttttgtggggctggctattgggactcttttt tcaaatgcaattttccaacttattccagaggcatttggatttgatcccaaagtcgacagt tatgttgagaaggcagttgctgtgtttggtggattttacctacttttcttttttgaaaga atgctaaagatgttattaaagacatatggtcagaatggtcatacccactttggaaatgat aactttggtcctcaagaaaaaactcatcaacctaaagcattacctgccatcaatggtgtg acatgctatgcaaatcctgctgtcacagaagctaatggacatatccattttgataatgtc agtgtggtatctctacagaaaaggtccctcaaggtgtcagtagggaattgtataatgaga aagaaccagtcatgcaaatatcggggaagaaaatatgctaaagactttgagaataatcaa acaggtcacacacaagggacctggaagctgaatggtagtggacttccccacatcaatgct ggagagcagtggcacaattccttcaatattctgaatgaaaacaactgtggaattataaac ctaggaatatcaaagtgcttaatgaacatgactttatcgaacctttataccaacttagtg aaagtggataaaagtcagaaacgtttagcagcagaggaaacaccaatagagattacattg gagggttccataagacattccggcctctggtcttatgctttctag >gi568815594r:102163047_102444662|GENSCAN_predicted_peptide_7|230_aa MGISRACKGQEPALFSSCVCDLEPILEKERLFPNSFYEVSIILIPKPGKDITEKENFRSI SLMNMDAKLLNKILANQFQQHIKKLIHYDQVSFIPGMQCWFNICKSINVIHHINRTNDKN HMIISIDAEKAFDKIQHHFMLKTLNKLRIDGMYLKIIRAAYDKPTANIILNGQKLEAVPL KTSTRQGCPLLPLLFNIVLEVLARAIRQEKEMKGIQTGKEAVKLSRLQMT >gi568815594r:102163047_102444662|GENSCAN_predicted_CDS_7|693_bp atgggaatttcccgtgcgtgcaaggggcaggagcctgccctcttcagttcctgtgtttgt gacctggaaccaatcttagaaaaagagagactcttccctaactcattttacgaggtcagc atcatcctgataccaaaacctggcaaagacataacagaaaaagaaaatttcaggtcaata tccctgatgaacatggatgcaaaactcctcaataaaatactggcaaaccaattccagcag cacatcaaaaagcttatccattatgatcaagtcagtttcatccctggaatgcagtgctgg ttcaacatatgcaaatcaataaacgtaatccatcacataaacagaaccaatgacaaaaac cacatgattatctcaatagatgcagaaaaggcttttgataagattcaacaccacttcatg ctaaaaacgctcaataaactacgtattgatgggatgtatctcaaaataataagagctgct tatgacaagcccacagccaatatcatactgaatgggcaaaagctggaagcagtccctttg aaaaccagcacaagacaaggatgccctctattaccactcctattcaacatagtactggaa gttctggccagggcaatcaggcaagagaaagaaatgaagggtattcaaacaggaaaagaa gcagtcaaattgtctcgtttgcagatgacatga >gi568815594r:102163047_102444662|GENSCAN_predicted_peptide_8|77_aa MAPGRAVAGLLLLAAAGLGGVAEGPGLAFSEDVLSVFGANLSLSAAQLQHLLEQMGAASR VGVPEPGQLHFNQRPYN >gi568815594r:102163047_102444662|GENSCAN_predicted_CDS_8|234_bp atggccccgggtcgcgcggtggccgggctcctgttgctggcggccgccggcctcggagga gtggcggaggggccagggctagccttcagcgaggatgtgctgagcgtgttcggcgcgaat ctgagcctgtcggcggcgcagctccagcacttgctggagcagatgggagccgcctcccgc gtgggcgtcccggagcctggccagctgcacttcaaccagagaccttataattaa >gi568815594r:102163047_102444662|GENSCAN_predicted_peptide_9|167_aa MGKDFMTKTPKAMATKAKIDKWDLIKELLHSKINYHQSEQISWETNLFTSADLKSPQVRS NAVFVCIQQSGSIGSVSNNILFTWVIAGKRNSPGNPGETLTFTKVCGSGQTQESSDFRAR TCSISRKRRKGVVRLSVIKKHASHAAWGLPVCCSGLSSHQALTLHTG >gi568815594r:102163047_102444662|GENSCAN_predicted_CDS_9|504_bp atgggcaaagacttcatgactaagacaccaaaagcaatggcaacaaaagccaaaattgac aaatgggatctaattaaagagcttctgcacagcaaaataaactatcatcagagtgaacag atttcctgggaaaccaacctcttcacttcagctgacctgaaatccccacaggtccgctca aatgctgtatttgtttgcatccaacaaagtggaagtatagggagtgtcagtaataacata ctgtttacatgggtgatagctgggaagagaaattcaccaggaaatcctggtgagactctt acattcaccaaggtgtgtggcagtgggcagactcaggagtcttcagatttcagagcaagg acctgcagcataagcaggaagaggagaaagggtgttgtcaggctttcggtgataaagaag catgccagccacgcagcatggggtcttcctgtttgctgctcaggactttcttctcatcag gctctgactctccacactggctga >gi568815594r:102163047_102444662|GENSCAN_predicted_peptide_10|510_aa MGDFNTPLSTLDRSMRQKVNKDIQELNSALHQADLIDIYRTLHPKSTEYTFFSAPHCTYS KIDHIVGSKALLSKCKRIEIITNCLSDHSAIKLELRIKKLTQNCSTTWKLNNLLLNDYWV HNKMKAEIKMFFETNENKDTTYQNLWDTFKAVCRGKFIALNAYKRKQERSKIDTLTSQLK ELEKQEQTHSKASRRQEITKIRAELKEIETQKTLQKINESRSWFFEKINKIDRPLARLIK RKREKNQIDTIKNDNGDITTDPTEIQTTIREYCKHLYANKLENLEEMDKFLDTYTLPRLN QEKVESLNRPITGAEIEAIINSLPTKKSPGPDGFTAEFYQRYKEELYNKKKEKEKNQIGT TKNDEGDITTDPREVQTTIREYYKHLYANKLENLGEMNNFLDTYIPSRANQEEVESLNRP ITSSEIEAIINSLPTKKIPGPDGFTAEFYQRYKEELVPLLLKLFQIIEKEELLSVILIPK PGRDTTKKENFKSISLMNIDTKIVNKILAN >gi568815594r:102163047_102444662|GENSCAN_predicted_CDS_10|1533_bp atgggagactttaacaccccactgtcaacattagacagatcaatgagacagaaagttaac aaggatatccaggaattgaactcagctctgcaccaagcagacctaatagacatctacaga actctccaccccaaatcaacagaatatacattcttctcagcaccacactgcacttattcc aaaattgaccacatagttggaagtaaagcactcctcagcaaatgtaaaagaatagaaatt ataacaaactgtctctcagaccacagtgcaatcaaactagaactcaggattaagaaactc actcaaaactgctcaactacatggaaactgaacaacctgctcctgaatgactactgggta cataacaaaatgaaggcagaaataaagatgttttttgaaaccaacgagaacaaagacaca acttaccagaatctctgggacacattcaaagcagtgtgtagagggaaatttatagcacta aatgcctataagagaaagcaggaaagatccaaaattgacaccctaacatcacaattaaaa gaactagaaaagcaagagcaaacacattcaaaagctagcagaaggcaagaaataactaag atcagagcagaactcaaggaaatagagacacaaaaaacccttcaaaaaatcaatgaatcc aggagctggttttttgaaaagatcaacaaaattgatagaccactagcaagactaataaag aggaaaagagagaagaatcaaatagacacaataaaaaatgataatggggatatcaccaca gatcccacagaaatacaaactaccatcagagaatactgtaaacacctctatgcaaataaa ctagaaaatctagaagaaatggataaattccttgacacatacaccctcccaaggctaaac caagaaaaagttgaatctctgaatagaccaataacaggtgctgaaattgaggcaataatt aatagcttaccaaccaaaaaaagtccaggaccagatggattcacagccgaattctaccag aggtacaaggaggagctgtataataaaaaaaaagaaaaagagaaaaatcaaataggcaca acaaaaaatgatgaaggagatatcaccactgatcccagagaagtacaaactaccatcaga gaatactataaacacctctatgcaaataaactagaaaatttaggagaaatgaataatttc ctggacacatacatcccctcaagagcaaaccaggaagaagttgaatccttgaatagacca ataacaagttctgaaattgaggcaataattaatagcctaccaaccaaaaaaatcccagga ccagatggatttacagctgaattctaccagaggtacaaagaggagctggtgccattactt ctgaaactattccaaataatagaaaaagaggaactcctctcagtcatcctgataccaaaa cctggcagggacacaacaaaaaaagaaaatttcaagtcaatatccctgatgaacatcgac acaaaaatcgtcaataaaatactggcaaactga >gi568815594r:102163047_102444662|GENSCAN_predicted_peptide_11|129_aa MGKKQNRKTGNSKNQSTSPPPKERSSSPAMEQSWTENDFDELREEGFRQSNYSKLKEEVR THGKEVKNLEKKLDEWLTRITNAERSLKDLKGLKTKARELCDECTSLSSRLDQLEERVSV MEDQMNEMK >gi568815594r:102163047_102444662|GENSCAN_predicted_CDS_11|390_bp atggggaaaaaacagaacagaaaaactggaaactctaaaaatcagagcacctctcctcct ccaaaggaacgcagctcctcaccagcaatggaacaaagctggacagagaatgactttgac gagttgagagaagaaggcttcagacaatcaaactactccaagctaaaggaggaagttcga acccatggtaaagaagttaaaaaccttgaaaagaaattagatgaatggctaactagaata accaatgcagagaggtccttaaaggacctgaaggggctgaaaaccaaggcacgagaacta tgtgacgaatgcacaagcctcagtagccgattggatcaactggaagaaagggtatcagtg atggaagatcaaatgaatgaaatgaagtga >gi568815594r:102163047_102444662|GENSCAN_predicted_peptide_12|308_aa MIGKLVTKKFGEDADLATATDVCSIFQQQRPTLSPQYGTIPGGDQPATWWQVDYTRLLPS WKGQRFVLTGIDTYSGYGFAYPACNASAKTTICGLTECLIHCDDIPHSIASDQGTHFTAK EMWQWLHAHGIRWSYHVLHHPEAARLIEQWNGLLKPQLQHQLSYNTSQGWGKVLQKVTCA LNQHPIYGTVSPIARIHGSRIQGVEVEVAPLTNTHSDPLATFLLPVLTTLCSASPEVLFP EAGMLPQGDTTTIPLNWKLRLPPRHFGLLLPLSQQAKKGVAVLAGVIDLDYQDEISLLLH TEVRKSMH >gi568815594r:102163047_102444662|GENSCAN_predicted_CDS_12|927_bp atgattggaaaattggtgacaaagaaattcggggaagacgctgacctggctacggccact gatgtgtgctcaattttccagcagcagagaccaacactgagccctcaatatggcaccatt cctgggggtgatcagccagctacctggtggcaggttgattatactcgacttcttccatca tggaaaggacagaggtttgtcctcactggaatagacacttactctggatatgggtttgcc tatcctgcatgcaatgcttctgccaagactaccatctgtggactcacagaatgccttatc cactgtgatgatattccacacagcattgcctctgatcaaggcactcactttaccgctaaa gaaatgtggcagtggcttcatgctcatggaattcgctggtcttaccatgttctccatcat cctgaagcagccagattaatagaacagtggaatggcctgttgaagccacaattacaacac caactaagttacaatacttcacagggctggggcaaagttctccagaaggtcacgtgtgct ctaaatcagcatccaatatatggtactgtttctcccatagccaggattcacgggtccagg attcaaggggtggaagtggaagtggcaccactcaccaacacccatagtgatccactagca acatttttgcttcctgttctcacgacattatgttctgctagcccagaggtcttatttcca gaggcaggaatgctgccacagggagacacaacaacgatcccattaaactggaagttaaga ttgccacctcgacactttgggctcctcctacctttaagtcaacaggctaagaagggagtt gcagtgttggctggggtgattgacctggactatcaagatgaaatcagtctactactccac acagaagtcaggaagagtatgcattga >gi568815594r:102163047_102444662|GENSCAN_predicted_peptide_13|123_aa XSVNKNSKKKTVVIVSFIARSRLQADKGSSENNFTLCFERDRIERKENKGTHPFSGVRIL QREEGSGVLANWLSGWKKTGFVTVADFSNVERQHKFALLPAEHGGAYITTSVSKYGGNLL ANF >gi568815594r:102163047_102444662|GENSCAN_predicted_CDS_13|372_bp nnttcggttaataaaaattcaaagaagaaaacagtggtcattgtttcctttattgccagg tctagacttcaggcagacaagggatcttcagagaacaacttcaccttgtgctttgagaga gacaggattgagcgaaaggagaacaaaggcactcatcctttctcaggtgttagaatacta caaagggaagaaggcagtggtgtgctagcaaattggctgtctgggtggaaaaagactgga tttgtaacagttgctgatttttctaatgtagaaaggcagcacaagtttgcactactgcca gccgagcatgggggtgcttatattactacatccgtgtcaaaatatggcggtaacctgctt gctaatttttga