GENSCAN 1.0 Date run: 8-Nov-116 Time: 15:12:40 Sequence gi568815597r:93788993_94009163 : 220171 bp : 41.59% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 528 523 6 1.05 1.01 Sngl - 5278 2264 3015 1 0 44 37 1016 0.377 83.65 1.00 Prom - 5371 5332 40 -9.55 2.02 PlyA - 5540 5535 6 1.05 2.01 Sngl - 6784 5768 1017 1 0 88 43 777 0.878 69.97 2.00 Prom - 10548 10509 40 -6.65 3.04 PlyA - 10584 10579 6 1.05 3.03 Term - 13615 13473 143 2 2 60 49 144 0.560 4.81 3.02 Intr - 14139 14077 63 1 0 34 81 89 0.198 0.57 3.01 Init - 25800 25533 268 0 1 33 86 227 0.275 12.63 3.00 Prom - 30392 30353 40 -4.95 4.00 Prom + 31992 32031 40 -4.75 4.01 Init + 33995 34229 235 1 1 70 68 92 0.282 3.20 4.02 Intr + 36422 36558 137 0 2 101 45 82 0.711 4.57 4.03 Intr + 37922 38101 180 1 0 47 80 98 0.826 4.04 4.04 Intr + 43532 43610 79 1 1 68 115 56 0.423 4.51 4.05 Intr + 46232 46371 140 0 2 26 65 103 0.241 1.06 4.06 Intr + 48601 48693 93 0 0 100 95 -7 0.033 0.44 4.07 Term + 58775 58975 201 1 0 101 43 325 0.737 25.71 4.08 PlyA + 59361 59366 6 1.05 5.00 Prom + 59919 59958 40 -7.25 5.01 Init + 67505 67754 250 1 1 57 71 154 0.201 8.27 5.02 Term + 69314 70746 1433 0 2 30 48 715 0.106 51.74 5.03 PlyA + 70894 70899 6 1.05 6.10 PlyA - 72018 72013 6 1.05 6.09 Term - 80952 80859 94 2 1 88 44 171 0.999 9.02 6.08 Intr - 81800 81691 110 2 2 70 98 131 0.999 10.46 6.07 Intr - 83244 83080 165 0 0 23 80 184 0.999 10.34 6.06 Intr - 84222 84127 96 0 0 74 89 103 0.988 8.29 6.05 Intr - 86791 86653 139 0 1 79 107 94 0.998 9.95 6.04 Intr - 88870 87276 1595 1 2 97 93 1392 0.991 127.13 6.03 Intr - 89470 89369 102 1 0 117 23 66 0.833 2.45 6.02 Intr - 89600 89556 45 2 0 31 116 57 0.580 0.49 6.01 Init - 90513 90085 429 0 0 58 97 297 0.796 23.90 6.00 Prom - 96227 96188 40 -7.35 7.07 PlyA - 96235 96230 6 1.05 7.06 Term - 100167 99998 170 1 2 76 36 130 0.965 3.66 7.05 Intr - 105736 105622 115 1 1 93 93 87 0.976 8.80 7.04 Intr - 107828 107626 203 0 2 79 94 210 0.997 18.78 7.03 Intr - 112677 112593 85 0 1 52 63 86 0.900 0.97 7.02 Intr - 115596 115531 66 0 0 79 110 11 0.523 0.58 7.01 Init - 120171 120046 126 0 0 93 83 347 0.999 32.91 7.00 Prom - 122225 122186 40 -3.65 8.02 PlyA - 122984 122979 6 1.05 8.01 Sngl - 123991 123074 918 1 0 42 44 443 0.961 31.38 8.00 Prom - 125311 125272 40 -9.65 9.02 PlyA - 125478 125473 6 1.05 9.01 Sngl - 126736 126398 339 1 0 88 34 229 0.986 13.28 9.00 Prom - 129190 129151 40 -2.25 10.00 Prom + 129212 129251 40 -10.94 10.01 Init + 130373 130557 185 1 2 67 52 187 0.967 11.64 10.02 Intr + 131839 131870 32 1 2 90 103 3 0.874 -1.14 10.03 Intr + 132498 132604 107 1 2 43 99 117 0.117 7.31 10.04 Intr + 133876 134013 138 0 0 20 95 71 0.172 0.74 10.05 Intr + 136741 136810 70 0 1 47 64 18 0.092 -6.86 10.06 Term + 140486 141129 644 0 2 68 42 218 0.681 8.54 10.07 PlyA + 141456 141461 6 1.05 11.00 Prom + 144068 144107 40 -5.85 11.01 Init + 167735 168085 351 1 0 59 72 184 0.091 9.11 11.02 Intr + 174620 174717 98 1 2 57 66 40 0.136 -3.41 11.03 Intr + 177116 177169 54 2 0 116 89 45 0.925 4.58 11.04 Intr + 177619 177739 121 1 1 85 59 79 0.602 4.28 11.05 Term + 180085 180237 153 0 0 -3 44 256 0.986 9.04 11.06 PlyA + 180293 180298 6 1.05 12.06 PlyA - 181266 181261 6 1.05 12.05 Term - 188111 187930 182 0 2 90 45 95 0.455 2.19 12.04 Intr - 189364 189271 94 1 1 59 60 62 0.170 -0.88 12.03 Intr - 194708 194658 51 2 0 78 89 33 0.025 0.59 12.02 Intr - 200027 199967 61 1 1 65 93 31 0.863 -0.88 12.01 Init - 200359 200193 167 1 2 85 101 104 0.921 10.55 12.00 Prom - 203059 203020 40 -8.25 13.14 PlyA - 203872 203867 6 1.05 13.13 Term - 204219 204036 184 2 1 99 45 171 0.483 9.93 13.12 Intr - 206808 206771 38 0 2 49 85 20 0.040 -6.06 13.11 Intr - 207203 207117 87 2 0 114 80 115 0.064 12.55 13.10 Intr - 208947 208813 135 0 0 12 15 178 0.050 2.74 13.09 Intr - 209118 208971 148 2 1 63 18 158 0.058 5.62 13.08 Intr - 211936 211844 93 0 0 139 82 110 0.987 13.66 13.07 Intr - 212113 211980 134 1 2 116 35 68 0.910 2.82 13.06 Intr - 213000 212572 429 0 0 118 -29 313 0.417 15.59 13.05 Intr - 213500 213306 195 2 0 78 -3 159 0.606 4.49 13.04 Intr - 216590 216449 142 1 1 94 62 185 0.682 15.93 13.03 Intr - 218748 218642 107 0 2 75 105 112 0.999 9.69 13.02 Intr - 219335 219243 93 2 0 75 90 40 0.523 2.14 13.01 Init - 219992 219759 234 2 0 46 116 270 0.623 23.79 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 209118 208839 280 2 1 63 53 325 0.896 20.43 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:93788993_94009163|GENSCAN_predicted_peptide_1|1004_aa MGDFNTPLSTLDRSTRQKVNKDTQEMNSALHQADLIDIYRTLHPKSTEYTFFSAPHHTYS KIDHIVGSKALLSKCKRTEIITNYLSDHSAIKLELRIKNLTQSRSTTWKLNNLLLNDYWV HNEMKAEIKMFFETNENKDTTYQNLWDAFKAVCRGKFIALNAYKRKQERSKIDTLTSQLK ELEKQEQTHSKASRRQEITEIRAELKEIETQKTLQKINESRSWFFERINKIDRPLARLIK KKREKNQIDTIKNDKGDITTDPTEIQTTIREYYKHLYANKLENLEEMDTFLDTYTLPRLN QEEVESLNRPITGSEIVAIINSLPTKKSPGPDGFTAEFYQRCKEELVPFLLKLFQSIEKE GILPNSFYEASIILIPKPGRDTTKKENFRPISLMNIDAKILNKILANRIQQHIKKLIHHD QVGFIPGMQGWFNIRKSINVIQHINRAKDKNHMIISIDAEKAFDKIQQPFMLKTLNKLGI DGMYFKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLLFNIVLEVLARAIRQE KEIKGIQLGKEEVKLSLFADDMIVYLENPIVSPQNLLKLISNFSKVSGYKINVQKSQAFL YTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLKEIKEDTNKWKNI PCSWVGRINIVKMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRARIAKSILS QKNKAGGITLPDFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEIIPHIYNYLIFDKPEK NKQWGKDSLFNKWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLG ITIQDIGVGKDFMSKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTTWEKIFAT YSSDKGLISRIYNELKQIYKKKTNNPIKKWAKDMNRHFSKEDIYAAKKHMKKCSSSLAIR EMQIKTTMRYHLTPVRMAIIKKSGNNRCWRGCGEIGTLLHLVGL >gi568815597r:93788993_94009163|GENSCAN_predicted_CDS_1|3015_bp atgggagactttaacaccccactgtcaacattagacagatcaacgagacagaaagtcaac aaggatacccaggaaatgaactcagctctgcaccaagcggacctaatagacatctacaga actctccaccccaaatcaacagaatatacatttttttcagcaccacaccacacctattcc aaaattgaccacatagttggaagtaaagctctcctcagcaaatgtaaaagaacagaaatt ataacaaactatctctcagaccacagtgcaatcaaactagaactcaggattaagaatctc actcaaagccgctcaactacatggaaactgaacaacctgctcctgaatgactactgggta cataacgaaatgaaggcagaaataaagatgttctttgaaaccaacgagaacaaagacacc acataccagaatctctgggatgcattcaaagcagtgtgtagagggaaatttatagcacta aatgcctacaagagaaagcaggaaagatccaaaattgacaccctaacatcacaattaaaa gaactagaaaagcaagagcaaacacattcaaaagctagcagaaggcaagaaataactgaa atcagagcagaactgaaggaaatagagacacaaaaaacccttcaaaaaatcaatgaatcc aggagctggttttttgaaaggatcaacaaaattgatagaccgctagcaagactaataaag aaaaaaagagagaagaatcaaatagacacaataaaaaatgataaaggggatatcaccacc gatcccacagaaatacaaactaccatcagagaatactacaaacacctctatgcaaataaa ctagaaaatctagaagaaatggatacattcctcgacacatacactctcccaagactaaac caggaagaagttgaatctctgaatagaccaataaccggctctgaaattgtggcaataatc aatagtttaccaaccaaaaagagtccaggaccagatggattcacagccgaattctaccag aggtgcaaggaggaactggtaccattccttctgaaactattccaatcaatagaaaaagag ggaatcctccctaactcattttatgaggccagcatcattctgataccaaagccgggcaga gacacaaccaaaaaagagaattttagaccaatatccttgatgaacattgatgcaaaaatc ctcaataaaatactggcaaaccgaatccagcagcacatcaaaaagcttatccaccatgat caagtgggcttcatccctgggatgcaaggctggttcaatatacgcaaatcaataaatgta atccagcatataaacagagccaaagacaaaaaccacatgattatctcaatagatgcagaa aaagcctttgacaaaattcaacaacccttcatgctaaaaactctcaataaattaggtatt gatgggatgtatttcaaaataataagagctatctatgacaaacccacagccaatatcata ctgaatgggcaaaaactggaagcattccctttgaaaactggcacaagacagggatgccct ctctcaccactcctattcaacatagtgttggaagttctggccagggcaatcaggcaggag aaggaaataaagggtattcaattaggaaaagaggaagtcaaattgtccctgtttgcagac gacatgattgtttatctagaaaaccccatcgtctcaccccaaaatctccttaagctgata agcaacttcagcaaagtctcaggatacaaaatcaatgtacaaaaatcacaagcattctta tacaccaacaacagacaaacagagagccaaatcatgagtgaactcccattcacaattgct tcaaagagaataaaatacctaggaatccaacttacaagggatgtgaaggacctcttcaag gagaactacaaaccactgctcaaggaaataaaagaggacacaaacaaatggaagaacatt ccatgctcatgggtaggaagaatcaatatcgtgaaaatggccatactgcccaaggtaatt tacagattcaatgccatccccatcaagctaccaatgactttcttcacagaattggaaaaa actactttaaagttcatatggaaccaaaaaagagcccgcatcgccaagtcaatcctaagc caaaagaacaaagctggaggcatcacactacctgacttcaaactatactacaaggctaca gtaaccaaaacagcatggtactggtaccaaaacagagatatagatcaatggaacagaaca gagccctcagaaataatcccacatatctacaactatctgatctttgacaaacctgagaaa aacaagcaatggggaaaggattccctatttaataaatggtgctgggaaaactggctagcc atatgtagaaagctgaaactggatcccttccttacaccttatacaaaaatcaattcaaga tggattaaagatttaaacgttagacctaaaaccataaaaaccctagaagaaaacctaggc attaccattcaggacataggcgtgggcaaggacttcatgtccaaaacaccaaaagcaatg gcaacaaaagccaaaattgacaaatgggatctaattaaactaaagagcttctgcacagca aaagaaactaccatcagagtgaacaggcaacctacaacatgggagaaaattttcgcaacc tactcatctgacaaagggctaatatccagaatctacaatgaactcaaacaaatttacaag aaaaaaacaaacaaccccatcaaaaagtgggcgaaggacatgaacagacacttctcaaaa gaagacatttatgcagccaaaaaacacatgaaaaaatgctcatcatcactggccatcaga gaaatgcaaatcaaaaccactatgagatatcatctcacaccagttagaatggcaatcatt aaaaagtcaggaaacaacaggtgctggagaggatgtggagaaataggaacacttttacac ttggtgggactgtaa >gi568815597r:93788993_94009163|GENSCAN_predicted_peptide_2|338_aa MGKKQNRKTGNSKKQSASPPPKERSSSPATEQSWMENDFDELREEGFRRSNYSELREDIQ TKGKEVENFEKNLEECITRITNTEKCLKELMELKTKARELREECRSLRSRCDQLEERVSA MEDEMNQMKREGKFREKRIKRNEQSLQEIWDYVKRPNLRLIGVPESDVENGTKLENTLQD IIQENFPNLARQANVQIQEIQRTPQRYSSRRATPRHIIVRFTKVEMKEKMLRAAREKGRV TLKGKPIRLTADLSAETLQARREWGPIFNILKGKNFQPRISYPAKLSFISEGEIKYFIDK QMLRDFVTTRPALKELLKEALNMERNNRYQPLQNHAKM >gi568815597r:93788993_94009163|GENSCAN_predicted_CDS_2|1017_bp atggggaaaaaacagaacagaaaaactggaaactctaaaaagcagagcgcctctcctcct ccaaaggaacgcagttcctcaccagcaacggaacaaagctggatggagaatgattttgac gagctgagagaagaaggcttcagacgatcaaattactctgagctacgggaggacattcaa accaaaggcaaagaagttgaaaactttgaaaaaaatttagaagaatgtataactagaata accaatacagagaagtgcttaaaggagctgatggagctgaaaaccaaggctcgagaacta cgtgaagaatgcagaagcctcaggagccgatgtgatcaactggaagaaagggtatcagca atggaagatgaaatgaatcaaatgaagcgagaagggaagtttagagaaaaaagaataaaa agaaatgagcaaagcctccaagaaatatgggactatgtgaaaagaccaaatctacgtctg attggtgtacctgaaagtgatgtggagaatggaaccaagttggaaaacactctgcaggat attatccaggagaacttccccaatctagcaaggcaggccaacgttcagattcaggaaata cagagaacgccacaaaggtactcctcgagaagagcaactccaagacacataattgtcaga ttcaccaaagttgaaatgaaggaaaaaatgttaagggcagccagagagaaaggtcgggtt accctcaaaggaaagcccatcagactaacagcggatctctcggcagaaaccctacaagcc agaagagagtgggggccaatattcaacattcttaaaggaaagaattttcaacccagaatt tcatatccagccaaactaagcttcataagtgaaggagaaataaaatactttatagacaag caaatgctgagagattttgtcaccaccaggcctgccctaaaagagctcctgaaggaagcg ctaaacatggaaaggaacaaccggtaccagccgctgcaaaatcatgccaaaatgtaa >gi568815597r:93788993_94009163|GENSCAN_predicted_peptide_3|157_aa MYAPPLLAATQELCSVSQGHHQPGSSCVAVTPVDAGVTAGTHTSGDPQMGPQGRNTKGQA SFSMRAAQSSFTTATKAEPKPIGGFAGSGRLHRQQKNTSIAIDPAGDAHKEVLTACIREK PDPRGKPSVAHTPWTKAELRALTKEFPDPTQDPTGFT >gi568815597r:93788993_94009163|GENSCAN_predicted_CDS_3|474_bp atgtatgcgcctcctctcctggctgccacacaggagctgtgttcagtctcacaggggcat catcagcctgggagcagctgtgtggctgtaacaccagtggatgccggtgtaacagctggt acacacaccagtggtgacccccagatgggcccacagggcagaaacactaaaggccaggcc tccttctccatgagggctgcccagagttccttcacaacagcgacaaaggcagagcccaag cccattggaggttttgcaggtagtggacggcttcaccgtcaacagaagaacacttcaatt gcaatagatcctgcaggggatgctcataaagaagttttgacagcatgcattagagaaaag cctgacccaaggggaaaaccttcagtggcccatactccctggaccaaggcagaacttaga gctcttactaaggaatttccagatcctacccaggatcctactggttttacttag >gi568815597r:93788993_94009163|GENSCAN_predicted_peptide_4|354_aa MKAIFWGPGKGPLPAVDAQGGVSLLLLSAVVSGCSSHHATLKAYSVEMTELKEGKDLGSW ESITQLTYQEAPSPGFLLGRFCRLELLPDQEKQPEQLKVKVSSERKAEVTPAGDSTISFV ERKQGKGQYDREESSVEGNKAGELSRKRLWNTRKAMGSASSRKEGTVTYLFWYYGAPQSA RAIQLKTDTAQSPRKPTGPSQMLWVTLTVEGSSIINASLIKTLLKAALLPKEAGVIHCKG HQKASDPIAQGNAYADKDPSCCSSETAGSHPFKIPSQSGSTSLPLTSPFTAFGSCGPFLS GSQGPEPLSSPGLGGPGPSSADWAVDVGPEKSGGGGGGGGEGKATLKRRLAVSA >gi568815597r:93788993_94009163|GENSCAN_predicted_CDS_4|1065_bp atgaaggcaatcttctggggccctgggaagggtcccctccctgctgtagatgcacaggga ggagtgtctctcctcctgctgagtgctgtggtatctggatgcagtagccatcatgccact ctgaaggcgtacagcgtggaaatgacagagctgaaagaggggaaggacttgggctcctgg gagtcaatcactcaattaacctaccaggaggcaccttccccaggtttcttgctggggagg ttttgcaggcttgagctcctccctgatcaagaaaagcaacctgaacagttaaaagttaag gtctcttcagagaggaaagcagaggtcaccccagcaggagatagtaccatttcttttgtg gaaaggaaacagggaaagggacaatatgacagggaggagtcatcagtggagggaaataaa gctggagagttaagtaggaagagattgtggaataccaggaaggccatgggcagtgccagc tccagaaaggaagggactgtgacctatttgttttggtattatggagcccctcagagtgcc agggccatacagctgaagactgacactgcccaatcacctcggaagcctacaggaccatca cagatgctttgggtaactcttacagtggaagggtcctccatcattaatgcctctttaata aaaactcttctcaaggctgctttacttccaaaggaagctggagtcattcactgcaagggc catcaaaaggcatcagatcccatcgctcagggcaacgcttatgctgataaggatccatct tgttgcagctctgagactgctgggtcccaccccttcaagattcctagccaatcaggctcc acttctctccctttaaccagtcctttcactgcattcggttcttgcggtcctttcttaagc ggctcgcagggtcccgagcccctcagctccccgggcctcggtggcccagggcccagctca gccgactgggcagtcgatgtaggtcctgagaagagcggcggcggcggcggcggcggcggc gaaggaaaagcgacactgaagcgaaggctcgcggtttcggcctaa >gi568815597r:93788993_94009163|GENSCAN_predicted_peptide_5|560_aa MAGYSSETKLPEERPGSNICCSPISAVLQPPLLIPRQTGSGVDLQQTPTDLQLRVLTVIK KTNKQKGHPHQNPICTSPSSKTKDRSTRQKVNKDIQELNSALHQADLIDIYRTLHPKSTE YTFFAAPHCTYSKIDHIVGSKALLSKCKRTEIITNCLLDHSAIKLELRIKKFTQNCSTTW KLNNLLLNDYWAHNEIKAEIKMFFETNENKDTTYQNFWDTFKAVCRGKFIALNAHKRKQE RSKIDTLTSQLKELEKQEQTHSKASRRQEITKITAELKEIETQKTLQKINESRSWFFEKI NKIETASKTNKKREKNPIDAIKNDKGDITTNPTEIQTTIREYYKHLYANKLENLEEMNKF LDTNTIPRLNQEEVESLNRPITGSEIEAIINSLPTKKSPGPHGFTARFYQEYKEELVPFL LKLFQSIEKEGILPNSFYEASIILTPKPGRDTTKKESFRPISLMIINAKILNKILANRIQ QHIKKLIHHDQVGFIPGMQDWFKICKSINVIQHINRTKDKNHMIISIDAEKAFDKIQQLF MLKTLSKLGIDGTYLNKSYL >gi568815597r:93788993_94009163|GENSCAN_predicted_CDS_5|1683_bp atggctgggtactcctctgagacaaaacttccagaggaacgaccaggcagcaacatctgc tgttcaccaatatctgctgttctgcagcctccgctgctgatacccaggcaaacagggtct ggagtggacctccagcaaactccaacagacctgcagctgagggtcctgactgttataaag aaaactaacaaacaaaaaggacatccacaccaaaaccccatctgtacgtccccatcatca aagaccaaagacagatcaacgagacagaaagttaacaaggatatccaggaattgaactca gctctgcaccaagcagacctaatagacatctacagaactctccaccccaaatcaacagaa tatacattcttcgcagcaccacactgcacttattccaaaatcgaccacatagttggaagt aaagcactcctcagcaaatgtaaaagaacagaaattataacaaactgtctcttagaccac agtgcaatcaaactagaactcaggattaagaaattcactcaaaactgctcaactacatgg aaattgaacaacctgctcctgaatgactactgggcacataacgaaatcaaggcagaaata aagatgttctttgaaaccaacgagaacaaagacacaacataccagaatttctgggacaca ttcaaagcagtgtgtagagggaaatttatagcactaaatgcccacaagagaaagcaggaa agatctaaaattgacaccctgacatcacaattaaaagaactagagaagcaagagcaaaca cattcaaaagctagcagaaggcaagaaataactaagatcacagcagaactgaaggaaata gagacacaaaaaacccttcaaaaaatcaatgaatccaggagctggttttttgaaaagatc aacaaaattgagaccgctagcaagactaataagaaaagagagaagaatccaatagatgca ataaaaaatgataaaggggatatcaccaccaatcccacagaaatacaaactaccatcaga gaatactataaacacctctatgcaaataaactagaaaatctagaagaaatgaataaattc ctcgacacaaacaccatcccaagactaaaccaggaagaagttgaatctctgaataggcca ataacaggctctgaaattgaggcaataattaatagcttaccaacaaaaaaaagtccagga ccacatggatttacagccagattctaccaggagtacaaggaggagctggtaccattcctt ctgaaactattccaatcaatagaaaaagagggaatcctccctaactcattttatgaggcc agcatcatcctgacaccaaagcctggcagagacacaacaaaaaaagagagttttagacca atatccctgatgatcatcaatgcaaaaatcctcaataaaatactggcaaaccgaatccag cagcacatcaagaagcttatccaccatgatcaagtgggcttcatccctgggatgcaagac tggttcaaaatatgcaaatcaataaacgtaatccagcatataaacagaaccaaagacaaa aaccacatgattatctcaatagatgcagaaaaggcctttgacaaaattcaacaactcttc atgctaaaaactctcagtaaattaggtattgatgggacgtatctcaataagagctatcta tga >gi568815597r:93788993_94009163|GENSCAN_predicted_peptide_6|924_aa MKERLPGSNDSTKQMKERMACKCVFTIHVNGYAKTLCHCGLRRPDEGLSQQKSGLPSTAL KCTLKPRPFVSDRPGLNWLGGDSREFSDLRGKRARGSSLTHFRSAREEGSRGGREGAGKM VVTRSARAKASIQAASAESSGQKRCRERSGGDFFLYVKVGQNVGLPIIFDFKQKNFNRSA SEVTSRKDEASESFAANGIQAHPESSTGSDARTTAESQTTGKQSLIPRTPKARKRKSRTT GSLPKGTEPSTDGETSEAESNYSVSEHHDTILRVTRRRQILIACSPVSSVRKKPKVTPTK ESYTEEIVSEAESHVSGISRIVLPTEKTTGARRSKAKSLTDPSQESHTEAISDAETSSSD ISFSGIATRRTRSMQRKLKAQTEKKDSKIVPGNEKQIVGTPVNSEDSDTRQTSHLQARSL SEINKPNFYNNDFDDDFSHRSSENILTVHEQANVESLKETKQNCKDLDEDANGITDEGKE INEKSSQLKNLSELQDTSLQQLVSQRHSTPQNKNAVSVHSNLNSEAVMKSLTQTFATVEV GRWNNNKKSPIKASDLTKFGDCGGSDDEEESTVISVSEDMNSEGNVDFECDTKLYTSAPN TSQGKDNSVLLVLSSDESQQSENSENEEDTLCFVENSGQRESLSGDTGSLSCDNALFVID TTPGMSADKNFYLEEEDKASEVAIEEEKEEEEDEKSEEDSSDHDENEDEFSDEEDFLNST KAKLLKLTSSSIDPGLSIKQLGGLYINFNADKLQSNKRTLTQIKEKKKNELLQKAVITPD FEKNHCVPPYSESKYQLQKKRRKERQKTAGDGWFGMKAPEMTNELKNDLKALKMRASMDP KRFYKKNDRDGFPKYFQIGTIVDNPADFYHSRIPKKQRKRTIVEELLADSEFRRYNRRKY SEIMAEKAANAAGKKFRKKKKFRN >gi568815597r:93788993_94009163|GENSCAN_predicted_CDS_6|2775_bp atgaaagaacgcctcccgggttcaaacgattctactaaacaaatgaaagaacgaatggca tgtaagtgcgtttttacaattcatgtgaacgggtacgctaagaccctctgtcactgcggg cttcgccgccccgatgaaggtctttcccagcaaaaatctggcttgccttctacagctctg aaatgcacacttaaacccaggccctttgtctctgaccgccctggcctgaactggctgggc ggagattctcgcgaattctcggatctgcgcggaaaacgcgctcgaggcagttctctgacg catttccggagcgccagggaagagggaagtcgtggtggtcgcgagggagccggaaagatg gtggttaccagatctgcacgggctaaggccagcatccaagccgcgtcggctgaaagttcc gggcaaaagcgatgcagggaacggagcggcggcgattttttcctgtacgtaaaggtgggt cagaatgttggtctacccattatatttgactttaaacagaaaaattttaatcgttctgct tctgaagttacatctaggaaggatgaggctagtgaaagttttgctgctaatgggattcaa gcgcatccagaaagtagtactggatctgatgcccgaactactgctgaatcacagaccact gggaagcaaagtttaatccctagaactcctaaagctagaaagaggaagagcagaactaca ggctcactaccaaaggggactgaaccatctacggatggagaaacctctgaggcagagtca aattattctgtgtctgagcaccatgataccattttaagggtaactaggagaaggcagatc ttaattgcatgctccccagtgtccagtgttaggaaaaagccgaaagtaactccaacaaag gagtcttacactgaagaaatagtgtctgaagcagaatctcatgtttcaggtatttctaga attgtgcttcctacagaaaaaactacaggagccagaagaagtaaggctaaatctctgaca gatccaagccaagaatctcatacagaagctatatctgatgctgagacatcaagctcagac atttcattctctggaattgcaactagaagaaccaggagtatgcagaggaaattaaaggca caaactgaaaagaaagatagtaagattgtaccaggaaatgagaaacagatcgtgggtaca cctgtgaattcagaggattcagataccagacaaacttcccatttacaagcaagatctctt tctgagataaataagccaaatttctataataatgactttgatgatgatttctcccacaga agttcagaaaatatattaacagtgcacgaacaggccaatgttgaatctcttaaagaaaca aaacagaattgtaaggatttggatgaagatgccaatggaataacagatgaggggaaagaa attaatgagaaaagttctcagctgaagaatctttctgaacttcaggacactagccttcaa cagttagtttctcagagacattcaaccccccaaaataaaaatgctgtatcagtgcactct aatctgaactctgaggctgtaatgaaatcattaactcaaacatttgcaactgtggaagta ggcagatggaataacaacaaaaagagccccataaaagcaagtgacttgacaaagtttggt gattgtggtggtagtgatgatgaagaagagtccacagttataagtgtcagtgaagacatg aacagtgaagggaatgtagattttgaatgtgataccaaactatacacgtctgcgcccaac acatctcagggtaaagataattctgtcttactagttctcagcagtgatgaaagccaacag tctgaaaacagtgagaatgaagaggatactttatgttttgttgaaaatagtggccaaagg gagtcattaagtggagacacaggaagtctgtcatgtgacaatgcattgtttgtaattgac acaactcctggaatgagtgctgataaaaatttttacttggaagaggaagacaaggcaagt gaggttgccattgaggaagaaaaagaagaggaagaggatgaaaaaagtgaagaagattca tcagaccatgacgaaaatgaagatgagtttagtgatgaagaagacttcctaaatagcaca aaggctaaacttctgaagttgacaagcagcagcatagaccctggtctgagtatcaagcag ttgggtggtttgtatattaattttaatgcagataaactacagtctaacaagagaacccta acacagatcaaggagaaaaagaaaaatgagcttctgcagaaagccgtcattacacctgat tttgaaaaaaaccactgtgttccaccatatagtgaatcaaagtatcaacttcagaaaaaa cgcagaaaagaacgacaaaaaacagcaggggatggctggtttggtatgaaagctccagaa atgacaaatgaactgaaaaatgatctcaaagcactgaagatgagagccagcatggacccg aaaagattttacaagaaaaatgatagagatggcttccccaagtacttccagattggaacc attgttgacaatccagctgatttctaccattcacgaattcccaagaagcaaaggaaaaga actattgtggaagaactgctggctgattctgaattcagaagatacaaccgaaggaagtac tcagagatcatggctgaaaaagcagcaaatgcagcaggaaaaaagttccgaaagaagaag aaatttcgcaattaa >gi568815597r:93788993_94009163|GENSCAN_predicted_peptide_7|254_aa MGTDSRAAKALLARARTLHLQTGNLLNWGRLRKKCPSTHSEELHDCIQKTLNEWSSQINP DLVREFPDVLECTVSHAVEKINPDEREEMKVSACSVLGVAQLDSVIIASPPIEDGVNLSL EHLQPYWEELENLVQSKKIVAIGTSDLDKTQLEQLYQWAQVKPNSNQVNLASCCVMPPDL TAFAKQFDIQLLTHNDPKELLSEASFQEALQESIPDIQAHEWVPLWLLRYSVIVKSRGII KSKGYILQAKRRGS >gi568815597r:93788993_94009163|GENSCAN_predicted_CDS_7|765_bp atgggcaccgacagccgcgcggccaaggcgctcctggcgcgggcccgcaccctgcacctg cagacggggaacctgctgaactggggccgcctgcggaagaagtgcccgtccacgcacagc gaggagcttcatgattgtatccaaaaaaccttgaatgaatggagttcccaaatcaaccca gatttggtcagggagtttccagatgtcttggaatgcactgtatctcatgcagtagaaaag ataaatcctgatgaaagagaagaaatgaaagtttctgcctgttcagtccttggagttgca cagctggattctgtgatcattgcttcacctcctattgaagatggagttaatctttccttg gagcatttacagccttactgggaggaattagaaaacttagttcagagcaaaaagattgtt gccataggtacctctgatctagacaaaacacagttggaacagctgtatcagtgggcacag gtaaaaccaaatagtaaccaagttaatcttgcctcctgctgtgtgatgccaccagatttg actgcatttgctaaacaatttgacatacagctgttgactcacaatgatccaaaagaactg ctttctgaagcaagtttccaagaagctcttcaggaaagcattcctgacattcaagcgcac gagtgggtgccgctgtggctactgcggtattcggtcattgtgaaaagtagaggaattatc aaatcaaaaggctacattttacaagctaaaagaaggggttcttaa >gi568815597r:93788993_94009163|GENSCAN_predicted_peptide_8|305_aa MIISIDAEKAFDKIQQPFMLKTLNKLGIRGMYLKIIRAIYDKPTANIILNGQKLEAFPLK TGTRQGCPLSTLLFHIVLEVLARAIRQEKEIKGIQIGKEEVKLSLFADDMIVYLENPIIS AQNLLKLISNFSKVSGYKFNVQKSQAFLYTNNRQTESQIMRELPFTIALKRIKYLGIHLT RDETDLFKENYKPLLNKIKEDTNKRKNIPCSWIGRNNSVKMAILPKVIYRFNAIPIKLPM TFFTEWEKTTLKFIWNQKRACIAKTILSKKNKAGGIMLPDFKPYYKATVTKTAWYKREIQ TNGTE >gi568815597r:93788993_94009163|GENSCAN_predicted_CDS_8|918_bp atgattatctcaatagatgcagaaaaggcctttgacaaaattcaacagcccttcatgcta aaaactctcaataaactcggtattcgtggaatgtatctcaaaataataagagctatttat gacaaacccacagccaatatcatactgaatgggcaaaaactggaagcattccctttgaaa actggcacaagacagggatgccctctctcaacactcctattccacatagtgttggaagtt ctggccagggcaatcaggcaagagaaagaaataaagggtattcagataggaaaagaggaa gtcaaattgtccctgtttgcagatgacatgattgtatatttagaaaaccccatcatctca gcccaaaatctccttaagctgataagcaacttcagcaaagtctcaggatacaaattcaat gtgcaaaaatcacaagcattcctctacaccaataacagacaaacagagagccaaatcatg agagaactgccattcacaattgctttgaaaagaataaaatacctaggaatccatcttaca agggatgagacggacctcttcaaggagaactacaaaccactgctcaacaaaataaaagag gacacaaacaaacggaagaacattccatgctcatggataggaagaaacaatagtgtgaaa atggccatactgcccaaggtaatttatagattcaatgccatccccatcaagctaccaatg actttcttcacagaatgggaaaaaactactttaaagttcatatggaaccaaaaaagagcc tgcattgccaagacaatcctaagcaaaaagaacaaagctggaggcatcatgctacctgac ttcaaaccatactacaaggctacagtaaccaaaacagcatggtacaaaagagagatacag accaatggaacagaatag >gi568815597r:93788993_94009163|GENSCAN_predicted_peptide_9|112_aa MGRNQNRKAENSKNQSASSPPKDCSSSPAMEQSWMENNFDELTEIGFRRSVITNFSELKE DVQTHCKEAKNLKKRLDECLTRINSVQKHLNDLMELKTMAQELPDACTSFSS >gi568815597r:93788993_94009163|GENSCAN_predicted_CDS_9|339_bp atggggagaaaccagaacagaaaagctgagaattctaaaaaccagagtgcctcttctcct ccaaaggattgcagctcctcgccagcaatggaacaaagctggatggagaacaactttgat gagttgacagaaataggcttcagaaggtcggtaataacaaacttctccgagctaaaggag gatgttcaaacccattgcaaggaagctaaaaaccttaaaaaaagattagatgaatgccta actagaataaacagtgtacagaagcacttaaatgacttgatggagctgaaaaccatggca caagaactacctgatgcatgcacaagcttcagtagctga >gi568815597r:93788993_94009163|GENSCAN_predicted_peptide_10|391_aa MNECRGVEITTVCSEDWIQRHLPGAVVPLKGEGNVGVATEDHYQEDSFGFGLRGTGKPLK LEKSRPGAVTHTSPSDGLTQYVPDSLTAMPAAAVVIWVIRPGCGVQQQIGFLRHSNIDDI RPRTIIRAVAPVKLHGIRIKMASFTSATRLTDEQVITAPADSGIVIPRFSDNWVAGGKTH LTHKDSHKLKVKGWKKAFHANGHQKQAGVAILTSDKTNFKATAVKKDKEGHYIMVKGLDQ QENITILNIYAPNTGAPKFMKQLLIDVRNEIDSNTIIAGDFNTPLTALDRSSIQKVNKET MDLNYTLEQMDLKDIYRTLHPTTAEYTIYSTVHGTFSKIDYMTGHKTSLNKFKKSEIISS SLSDHSGIKLEIDSKRNHQNHANTWKLITCS >gi568815597r:93788993_94009163|GENSCAN_predicted_CDS_10|1176_bp atgaacgagtgcagaggagtggaaattaccacagtttgttctgaagactggatacagcgg caccttcctggagcagtggtaccccttaaaggtgaagggaacgtgggtgttgcaacagag gaccattatcaagaagactcttttgggtttggtctgagaggcactgggaaacctcttaaa cttgaaaaatcacgaccaggtgcagtgactcatacatctcccagtgatggcctgacccag tatgtgcctgacagcctgacagccatgccagctgcagcagttgtcatttgggtcatcaga cctggctgtggggtacagcagcagattgggtttttgaggcacagcaacattgatgacatt aggccacggacaattattagggctgtggctcctgtaaaacttcatggcattagaataaaa atggctagttttactagtgctacaagacttacggatgaacaggtaattactgctcctgct gatagtgggatagttatacctagattttctgataattgggtggctggtgggaagactcac ctaacacataaggactcacataaacttaaagttaaggggtggaaaaaggcatttcatgca aatggacaccaaaagcaagcaggagtagctattcttacatcggacaaaacaaactttaaa gcaacagcagttaaaaaagataaagagggacattatataatggtaaaaggccttgaccaa caggaaaatatcacaatcctaaacatatatgcacctaacactggagctcccaaatttatg aaacaattactaatagatgtaagaaatgagatagacagcaacacaataatagcaggggac ttcaatactccactgacagcactagacaggtcatcaattcagaaagtcaacaaagaaaca atggatttaaactataccttggaacaaatggacttaaaggatatatacagaacactccat ccaacaactgcagaatacacaatctattcaacagtgcatggaacattctccaagatagac tatatgacaggtcacaaaacgagcctcaataaatttaagaaaagtgaaattatatcaagc agtctgtcagaccacagtggaataaaactggaaatcgactccaaaaggaaccatcaaaac catgcaaatacatggaaattaataacctgctcctga >gi568815597r:93788993_94009163|GENSCAN_predicted_peptide_11|258_aa MAMAVGGQAGSWILWAVCMVWVMAVAVMAVAAWAPKQHVLVFAVAAAGWMGQSPTPQVVY AVGAGCGGSGKLGGPILRPPGRMHRCQWWSTGPQAVCSGTGVHVVPSQVGLSSGSLLICK MPEIEANPNRTVGFIGPKLAGGPGTSLSNRYWHALDDVIVDSCCHQQRSLEKFMGARHQS PRRRCCLQVPPESPVLGWHRWDDGVQAEAGPSGGIPGEGIAVVGNDSSMSVIAPGDFPVG QDVEVEEGGIDDPDAVFA >gi568815597r:93788993_94009163|GENSCAN_predicted_CDS_11|777_bp atggcaatggcagtgggaggccaggcaggttcatggatcctctgggcagtgtgcatggtg tgggtgatggcagtagcagtgatggcagtagctgcttgggctccaaagcaacatgtgttg gtgtttgcagtggctgcagcaggctggatgggccagtctccaacccctcaggtagtatat gcagtgggtgctggctgtggtggtagtggcaagttgggtgggcctatcctcaggccccca ggaagaatgcacagatgccagtggtggtcaacaggcccccaggcagtgtgctcgggtacg ggggtgcatgtggtaccaagccaggtgggactgtcctcaggctccctgctgatttgtaaa atgccagaaatcgaagccaacccaaacagaacagtagggtttattggcccaaaactggca gggggacccgggaccagcctctccaacaggtattggcatgcactagatgatgtcattgtg gactcatgttgtcaccagcaacgaagcttggagaagttcatgggagccagacaccaaagt ccacgtcgtaggtgctgtctccaggtgcctcctgagtctccagtccttggttggcataga tgggatgatggtgtccaggcagaggcaggtccttcaggaggaattccaggagaaggcatt gctgtggtaggaaatgacagctccatgagtgtgattgcccctggagacttcccagtggga caagatgtggaggtggaagaaggtggcattgatgaccctgatgctgtgtttgcctag >gi568815597r:93788993_94009163|GENSCAN_predicted_peptide_12|184_aa MTHTQLPEQLPDVQAQMCTSHKTCFAQTQAFLVLYFSTFHKRPQIIVAKSHSLESCCQKA KEEGGKERREETLQKKVYSVLEKLDMCTGGDAQTWYGFYLILGALTLSTPTNAHSDLLSP SKSWGAAKLRVGSESDSLLRREATGKAMSEWSWTALFCLFCSPKPKTRRCAGCESMKALG PQVH >gi568815597r:93788993_94009163|GENSCAN_predicted_CDS_12|555_bp atgacacacactcagcttcctgagcagctgcctgatgtccaggctcagatgtgcacatcc cataagacatgctttgcccagacccaagccttcttggttttatactttagtacatttcat aagagaccacagattattgtagccaaatcccacagtcttgagtcatgttgccaaaaagca aaagaggaagggggaaaggagaggagagaggaaacattgcagaagaaggtctactctgtc ttagagaaactggacatgtgcacaggaggagatgctcagacttggtatggtttttacttg atcttaggagcactgacccttagtacgccgacaaatgcccactcagacctcctttctcct tcaaagtcctggggggctgcaaagctcagagtgggctctgagagtgacagtctcctgaga agagaggccactgggaaggccatgtctgagtggtcctggactgctttgttttgcctcttt tgtagcccaaaacccaagacaaggaggtgtgctggttgtgaaagcatgaaagccctgggg ccccaggtccattga >gi568815597r:93788993_94009163|GENSCAN_predicted_peptide_13|672_aa MEAQKEAEARKWKRGPMERTVPRTLYSQPNPDPFSSSRIAEPTKEPIVDEDDDVAEERQR IITGGNKTDILRLHELTKFSVRFLRILQIYPGTSSPAVDRLCVGVRPGECFGLLGVNGAG KTTTFKMLTGDTTVTSGDATVAGKSILTNISEVHQNMGYCPQFDAIDELLTGREHLYLYA RLRGVPAEEIEKSEDSSNGCVHIVLQCLVQCLAYGQCSIGMCRMHKALGVTLLTGGINRS GTHHLLKQSLPPPLLAMVANWSIKSLGLTVYADCLAGTYSGGNKRKLSTAIALIGCPPLV LLVTAGLGRTKGLNQVLGLLGWGNRFWVGRFRNCSSLALVWTVSCVAHFERSAQCLVHTA PEKNKSRHSLDLGLRTSCVPRMSPVGPSAHSREVQPTLRPDEPTTGMDPQARRMLWNVIV SIIREGRAVVLTSHRQEIPRAGEGGMEECEALCTRLAIMVKGAFRCMGTIQHLKSKFGDG YIVTMKIKSPKDDLLPDLNPVEQFFQGNFPGSVQRERHYNMLQFQDLPAPPLPQGQPAHR GVLSHTDHTGPGKLALGHRELSKDWSRTPSVFVNFAKQQTESHDLPLHPRAAGASRQAQQ KSLDLGNKEQRSQKGTLGSWRRRSLCPYGHPNGLASVNDPTAAENKHTRSMQRIQKEVFQ KETETDLLTWNT >gi568815597r:93788993_94009163|GENSCAN_predicted_CDS_13|2019_bp atggaagcccagaaggaagcagaagcaaggaagtggaagagaggtcccatggaaaggaca gtgccaaggacactgtacagccagcccaatcctgaccccttttcttcatctaggattgcc gagcccactaaggagcccattgttgatgaagatgatgatgtggctgaagaaagacaaaga attattactggtggaaataaaactgacatcttaaggctacatgaactaaccaagttctca gtccggtttcttcgtatcttgcagatttatccaggcacctccagcccagcagtggacagg ctgtgtgtcggagttcgccctggagagtgctttggcctcctgggagtgaatggtgccggc aaaacaaccacattcaagatgctcactggggacaccacagtgacctcaggggatgccacc gtagcaggcaagagtattttaaccaatatttctgaagtccatcaaaatatgggctactgt cctcagtttgatgcaattgatgagctgctcacaggacgagaacatctttacctttatgcc cggcttcgaggtgtaccagcagaagaaatcgaaaagtctgaggatagcagcaatggctgt gttcacattgttctccagtgcctggttcagtgcctggcgtatggtcagtgctccataggt atgtgtcggatgcacaaggctttgggtgtaaccctcttgacgggtgggatcaacaggtct gggactcaccatcttctcaaacagagccttcctcctccactgctagccatggttgcaaac tggagtattaagagcctgggcctgactgtctacgccgactgcctggctggcacgtacagt gggggcaacaagcggaaactctccacagccatcgcactcattggctgcccaccgctggtg ctgctggtaactgcgggcttgggccgcaccaagggcttaaaccaagtgctgggtctcttg ggttggggaaataggttctgggtcggcagatttagaaactgcagcagtttggctttagtc tggactgtttcctgtgttgctcattttgagcgatcagcccagtgtttggttcacacagct ccggagaaaaacaagtcacggcacagccttgacttgggactgcgcacatcctgcgttccc aggatgtctcctgtggggccatcggctcacagccgggaagttcagcccactctgcggcct gatgagcccaccacagggatggacccccaggcacgccgcatgctgtggaacgtcatcgtg agcatcatcagagaagggagggctgtggtcctcacatcccacaggcaagagattcccagg gctggggaaggtggcatggaagaatgtgaggcactgtgtacccggctggccatcatggta aagggcgcctttcgatgtatgggcaccattcagcatctcaagtccaaatttggagatggc tatatcgtcacaatgaagatcaaatccccgaaggacgacctgcttcctgacctgaaccct gtggagcagttcttccaggggaacttcccaggcagtgtgcagagggagaggcactacaac atgctccagttccaggatcttccagctcctcctctcccacaaggacagcctgctcatcga ggagtactcagtcacacagaccacactggaccaggcaagttggccctggggcaccgagag ctgagcaaagactggtccagaacacccagtgtgtttgtaaattttgctaaacagcagact gaaagtcatgacctccctctgcaccctcgagctgctggagccagtcgacaagcccagcaa aaaagcttggatttggggaataaggagcagagaagccagaaaggaactctgggcagctgg aggcgcaggagcctgtgcccatatggtcatccaaatggactggccagcgtaaatgacccc actgcagcagaaaacaaacacacgaggagcatgcagcgaattcagaaagaggtctttcag aaggaaaccgaaactgacttgctcacctggaacacctga