GENSCAN 1.0 Date run: 5-Nov-116 Time: 01:19:08 Sequence gi568815595f:136828002_137048450 : 220449 bp : 41.46% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 11032 11102 71 1 2 111 55 68 0.379 3.71 1.02 Intr + 14449 14604 156 2 0 27 81 105 0.100 2.76 1.03 Intr + 26129 26207 79 0 1 73 82 35 0.032 -0.91 1.04 Term + 26480 27698 1219 2 1 49 42 563 0.081 37.91 1.05 PlyA + 28632 28637 6 1.05 2.02 PlyA - 28656 28651 6 1.05 2.01 Sngl - 34837 34424 414 1 0 69 42 292 0.777 16.75 2.00 Prom - 36942 36903 40 -6.75 3.00 Prom + 43649 43688 40 -6.65 3.01 Sngl + 44832 45332 501 2 0 30 44 282 0.807 13.79 3.02 PlyA + 46137 46142 6 1.05 4.00 Prom + 47607 47646 40 -6.15 4.01 Sngl + 47700 48440 741 2 0 44 42 253 0.965 12.25 4.02 PlyA + 48445 48450 6 1.05 5.02 PlyA - 48661 48656 6 1.05 5.01 Sngl - 62112 61882 231 0 0 97 36 157 0.537 4.92 5.00 Prom - 65134 65095 40 -4.65 6.02 PlyA - 66117 66112 6 1.05 6.01 Sngl - 72075 71431 645 0 0 103 42 528 0.800 45.62 6.00 Prom - 86092 86053 40 -4.35 7.00 Prom + 91668 91707 40 -7.15 7.01 Init + 100001 100226 226 1 1 71 82 322 0.999 28.58 7.02 Intr + 117582 118294 713 1 2 73 123 622 0.969 54.43 7.03 Term + 120258 120452 195 2 0 104 43 126 0.991 6.13 7.04 PlyA + 121100 121105 6 1.05 8.05 PlyA - 121220 121215 6 1.05 8.04 Term - 129979 129789 191 2 2 19 42 154 0.309 0.53 8.03 Intr - 139276 136155 3122 0 2 42 60 1047 0.113 83.69 8.02 Intr - 140470 139834 637 2 1 -61 38 362 0.551 6.52 8.01 Init - 140816 140516 301 2 1 88 -8 319 0.897 19.76 8.00 Prom - 141889 141850 40 -7.65 9.05 PlyA - 142471 142466 6 1.05 9.04 Term - 147290 147027 264 2 0 37 42 165 0.352 1.32 9.03 Intr - 148014 147853 162 0 0 36 51 121 0.138 2.45 9.02 Intr - 151849 151726 124 0 1 92 65 66 0.171 4.37 9.01 Init - 152477 152434 44 2 2 90 80 22 0.435 1.56 9.00 Prom - 153520 153481 40 -3.65 10.00 Prom + 153547 153586 40 -9.85 10.01 Init + 153590 153714 125 1 2 74 33 101 0.757 2.89 10.02 Intr + 154159 154349 191 1 2 124 80 189 0.981 20.01 10.03 Intr + 158608 158865 258 2 0 70 68 125 0.547 5.21 10.04 Intr + 158892 159016 125 1 2 28 101 89 0.757 3.58 10.05 Intr + 159544 159685 142 0 1 24 27 175 0.854 4.01 10.06 Intr + 159988 160326 339 2 0 44 10 233 0.428 5.62 10.07 Intr + 161386 161564 179 2 2 66 78 239 0.621 19.42 10.08 Intr + 163937 164087 151 1 1 70 116 107 0.859 10.51 10.09 Term + 167413 167567 155 2 2 112 32 122 0.825 6.10 10.10 PlyA + 168326 168331 6 1.05 11.02 PlyA - 168735 168730 6 1.05 11.01 Sngl - 175370 174426 945 2 0 86 44 167 0.967 8.39 11.00 Prom - 175853 175814 40 -6.55 12.03 PlyA - 175952 175947 6 1.05 12.02 Term - 186125 185934 192 2 0 80 49 217 0.958 13.44 12.01 Init - 186377 186249 129 2 0 98 34 66 0.909 2.40 12.00 Prom - 189671 189632 40 -3.65 13.10 PlyA - 190435 190430 6 1.05 13.09 Term - 191222 190510 713 0 2 67 34 226 0.011 7.77 13.08 Intr - 197572 197502 71 0 2 73 76 61 0.026 1.31 13.07 Intr - 197731 197597 135 0 0 64 103 29 0.366 0.76 13.06 Intr - 198430 198218 213 0 0 84 110 96 0.097 8.31 13.05 Intr - 201013 200917 97 2 1 41 81 56 0.014 -1.65 13.04 Intr - 204745 204517 229 1 1 0 7 280 0.128 7.62 13.03 Intr - 207372 207274 99 0 0 127 -46 103 0.095 0.19 13.02 Intr - 209312 209133 180 2 0 64 99 79 0.913 5.74 13.01 Init - 216461 216342 120 2 0 83 68 42 0.141 1.94 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 39348 39034 315 0 0 65 50 164 0.944 6.00 S.002 Sngl - 191628 190510 1119 0 0 49 34 261 0.882 13.27 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:136828002_137048450|GENSCAN_predicted_peptide_1|508_aa XDGSFLPLTASPSTAEFKAADSQEGLSDNPGIPWGQYRVQGQTRLGEGFLRDLEPASPAQ SELQPSGKDNCENNLKGGETGRNEKMEKSVCGIKIEDRAELNKKYPVKKRVKIHPNTVMV KYTSHYPQPGDDGYEEINEGYGNFMEENPKKGLLSEMKKKGRAFFGTMDTLPPPTEDPMI NEIGQFQSFAEKNIFQSRKMWIVLFGSALAHGCVALITRLVSDRSKVPSLELIFIRSVFQ VLSVLVVCYYQEAPFGPSGYRLRLFFYGVCNVISITCAYTSFSIVPPSNGTTMWRATTTV FSAILAFLLVDEKMAYVDMATVVCSILGVCLVMIPNIVDEDNSLLNAWKEAFGYTMTVMA GLTTALSMIVYRSIKEKISMWTALFTFGWTGTIWGISTMFILQEPIIPLDGETWSYLIAI CVCSTAAFLGVYYALDKFHPALVSTVQHLEIVVAMVLQLLVLHIFPSIYDVFGGVIIMIS VFVLAGYKLYWRNLRKQDYQEILDSPIK >gi568815595f:136828002_137048450|GENSCAN_predicted_CDS_1|1527_bp nntgatggctccttcctacccttaacagcttcacccagcacagctgaatttaaagctgct gactcacaggaaggcctttctgacaatcctgggataccctggggccagtatagagttcaa gggcagaccagactgggggaggggttcctgagggacctggaaccagcatcaccagcacag tcagaattgcaacctagtggcaaagataactgtgagaacaatctcaaaggtggagaaaca ggtaggaatgaaaagatggaaaaatccgtttgtggtataaagatagaagatagagctgaa ttaaacaaaaaatatccagttaaaaaacgggtgaaaatacatcccaacacagtgatggtg aaatatacttctcattatccccagcctggcgatgatggatatgaagaaatcaatgaaggc tatggaaattttatggaggaaaatccaaagaaaggtctgctgagtgaaatgaaaaaaaaa gggagagctttctttggaaccatggataccctacctccaccaacagaagacccaatgatc aatgagattggacaattccagagctttgcagaaaaaaacatttttcaatcccgaaaaatg tggatagtgctgtttggatctgctttggctcatggatgtgtagctcttatcactaggctt gtttctgatcggtctaaagttccatctctagaactgatttttatccgttctgtttttcag gtcttatctgtgttagttgtgtgttactatcaggaggccccctttggacccagtggatac agattacgactcttcttttatggtgtatgcaatgtcatttctatcacttgtgcttataca tcattttcaatagttcctcccagcaatgggaccactatgtggagagccacaactacagtc ttcagtgccattttggcttttttactcgtagatgagaaaatggcttatgttgacatggct acagttgtttgcagcatcttaggtgtttgtcttgtcatgatcccaaacattgttgatgaa gacaattctttgttaaatgcctggaaagaagcctttgggtacaccatgactgtgatggct ggactgaccactgctctctcaatgatagtatacagatccatcaaggagaagatcagcatg tggactgcactgtttacttttggttggactgggacaatttggggaatatctactatgttt attcttcaagaacccatcatcccattagatggagaaacctggagttatctcattgctata tgtgtctgttctactgcagcattcttaggagtttattatgccttggacaaattccatcca gctttggttagcacagtacaacatttggagattgtggtagctatggtcttgcagcttctc gtgctgcacatatttcctagcatctatgatgtttttggaggggtaatcattatgattagt gtttttgtccttgctggctataaactttactggaggaatttaagaaagcaggactaccag gaaatactagactctcccattaaatga >gi568815595f:136828002_137048450|GENSCAN_predicted_peptide_2|137_aa MSTRSTAQADASPPLGLEVRAPFLLPRPYLPGLGLRHSWQKDFFMRKQTSQPARRTLDLP GSHGRPPGLSARHSAPGADKEAAGSRSLGQRPEGLAESRCRSLDPPSSLRAPSGSAPSLQ TAPPGLTQREKRKRRSE >gi568815595f:136828002_137048450|GENSCAN_predicted_CDS_2|414_bp atgtccacccgcagcaccgcgcaggctgacgcctcccctcctctcgggctggaggtgaga gcgcccttcctcctaccacgtccctaccttcccggcctcggattacgccacagctggcaa aaggactttttcatgagaaaacagacaagtcaaccagcgcgccgcacactcgacctccca ggctcgcacgggcggccgcccgggctctcagcccgacacagcgctccaggggctgacaag gaggccgcggggtcacgctccctggggcaacggcccgagggtctcgcggaatcccggtgc agaagtctggaccctccgagcagcctccgcgccccctccggctcggctccgagcctgcag acggcgcccccggggctaactcagcgcgagaagaggaagcgacgcagcgagtaa >gi568815595f:136828002_137048450|GENSCAN_predicted_peptide_3|166_aa MLQCGVDAASAQKSRIGVWEPPPAFQKMYGNAWMPRQKFAAGVEHSWRTSARAVQKGNVG SEPPHRVPTRVPPSGAVRRGPPSSSLQNDRSTDSLHHAPGKAADTQCKPVKAARREAVLC KTTESEVPKTMEIHFLDQHDLDVRHGVKGDHFGALRFDCPAVFWIC >gi568815595f:136828002_137048450|GENSCAN_predicted_CDS_3|501_bp atgctgcaatgtggtgttgatgctgcaagtgcacagaagtcaagaattggggtttgggaa cctccacctgcatttcagaagatgtatggaaatgcttggatgcccaggcagaagtttgca gcaggggtagagcattcgtggagaacctctgctagggcagtgcagaagggaaatgtgggg tcagagcccccacacagagttcctactcgggtaccgcctagtggagctgtgagaagaggg ccaccatcctccagcctccagaatgatagatccactgacagcttgcaccatgcgcctgga aaagctgcagacactcaatgcaagcctgtgaaagcagccaggagagaggctgtactctgc aaaaccacagagtcagaggtgcccaagaccatggaaatccacttcttggatcagcatgac ctggatgtgagacatggagtcaaaggagatcattttggagctttaagatttgactgccct gctgtattttggatttgctag >gi568815595f:136828002_137048450|GENSCAN_predicted_peptide_4|246_aa MGDFNTPLSTLDRSTRQKVNKDTQELNSALHQVDLIDIYRTLHPKSTEYTFFSAPHHTYS KIDHIVGSKALLSKYKRTEIITNYLSGHSAIKLELRIKNLTQNRSTTWKLNNLLLNDYWV HNEMKAEVKMFFETNENKDTTYQNLWDAFKAVCRGKFIALNAQKRKQERSKIDTLTSQLK ELEKQEQTHSKASRRQEITKIRAELKEIETQKTLQKINESRSWFFERINKIDRPLARLIK KKRRIK >gi568815595f:136828002_137048450|GENSCAN_predicted_CDS_4|741_bp atgggagactttaacaccccactgtcaacattagacagatcaacgagacagaaagtcaac aaggatacccaggaattgaactcagctctgcaccaagtggacctaatagacatctacaga actctccaccccaaatcaacagaatatacatttttttcagcaccacaccacacctattcc aaaattgaccacatagttggaagtaaagctctcctcagcaaatataaaagaacagaaatt ataacaaactatctctcaggccacagtgcaatcaaactagaactcaggattaagaatctc actcaaaaccgctcaactacatggaaactgaacaacctgctcctgaatgactactgggta cataacgaaatgaaggcagaagtaaagatgttctttgaaaccaacgagaacaaagacaca acataccagaatctctgggacgcattcaaagcagtgtgtagagggaaatttatagcacta aatgcccaaaagagaaagcaggaaagatccaaaattgacaccctaacatcacaattaaaa gaactagaaaagcaagagcaaacacattcaaaagctagcagaaggcaagaaataactaaa atcagagcagaactgaaagaaatagagacacaaaaaacccttcaaaaaatcaatgaatcc aggagctggttttttgaaaggatcaacaaaattgatagaccgctagcaagactaataaag aaaaagagaagaatcaaatag >gi568815595f:136828002_137048450|GENSCAN_predicted_peptide_5|76_aa MPEPPRPMGCCAARASPTRTTLCSRAPNPIDHPRAQECGHTAQDWQAAPPATPVRDPLGE ASWAPESGGALENLYV >gi568815595f:136828002_137048450|GENSCAN_predicted_CDS_5|231_bp atgcctgagcctccccgccccatgggctgctgtgccgcccgagcctccccaacaaggacc accctctgctccagggcgcccaatcccattgaccacccaagggctcaggagtgcgggcac acagcgcaggactggcaggcagctccacctgcaacccctgtgcgggatccactgggtgaa gccagctgggctcctgagtctggtggggccttggagaatctttatgtctag >gi568815595f:136828002_137048450|GENSCAN_predicted_peptide_6|214_aa MVQPVRCKKPINYSQFGDSDSDDDFVSATVPLNKQSRTSKELKQDKPKPNLNNLQKEEIP LEEKTPKKKRMALDDKLYQRDLEVALALSVKELSTVTTNVQKSQDKRVEKHGNSRTETVS KSPRISNCSVASDYLDLDKITKKDNGGIQGKRKAASKAAVQQRKIFLEGSDGNSANNTKP DLATGEDSEDDSDFGESEDNDKDSSMRKSKVKEI >gi568815595f:136828002_137048450|GENSCAN_predicted_CDS_6|645_bp atggtgcagcctgtgagatgtaagaaaccaatcaattactcacagtttggcgactctgac agtgatgatgattttgtttctgcaactgtacctttaaacaagcaatccagaacatcaaag gagttaaaacaagataaaccaaaacctaatttgaacaatctccagaaagaagaaatccca ctagaagagaaaacccctaaaaaaaaaaggatggctttagatgataagctctaccagaga gacttagaagttgcactagctttatcagtgaaggaactttcaacagtcaccactaatgtg cagaagtctcaagataaaagagttgaaaaacatggcaatagtagaacagaaacagtgagt aagtctcctcgtatctctaattgcagtgtagccagtgattatttagatttggataagatt actaagaaagacaatggtggtattcaagggaaaagaaaagcagcatctaaagctgcggta caacagaggaaaatttttctggaaggcagtgatggcaatagtgctaataacaccaaacca gacttggcaactggtgaagattctgaggatgattctgattttggtgagagtgaggataat gacaaagactcctctatgagaaaaagtaaagttaaagaaatttaa >gi568815595f:136828002_137048450|GENSCAN_predicted_peptide_7|377_aa MAEEVVVVAKFDYVAQQEQELDIKKNERLWLLDDSKSWWRVRNSMNKTGFVPSNYVERKN SARKASIVKNLKDTLGIGKVKRKPSVPDSASPADDSFVDPGERLYDLNMPAYVKFNYMAE REDELSLIKGTKVIVMEKCSDGWWRGSYNGQVGWFPSNYVTEEGDSPLGDHVGSLSEKLA AVVNNLNTGQVLHVVQALYPFSSSNDEELNFEKGDVMDVIEKPENDPEWWKCRKINGMVG LVPKNYVTVMQNNPLTSGLEPSPPQCDYIRPSLTGKFAGNPWYYGKVTRHQAEMALNERG HEGDFLIRDSESSPNDFSVSLKAQGKNKHFKVQLKETVYCIGQRKFSTMEELVEHYKKAP IFTSEQGEKLYLVKHLS >gi568815595f:136828002_137048450|GENSCAN_predicted_CDS_7|1134_bp atggcagaagaagtggtggtagtagccaaatttgattatgtggcccaacaagaacaagag ttggacatcaagaagaatgagagattatggcttctggatgattctaagtcctggtggcga gttcgaaattccatgaataaaacaggttttgtgccttctaactatgtggaaaggaaaaac agtgctcggaaagcatctattgtgaaaaacctaaaggataccttaggcattggaaaagtg aaaagaaaacctagtgtgccagattctgcatctcctgctgatgatagttttgttgaccca ggggaacgtctctatgacctcaacatgcccgcttatgtgaaatttaactacatggctgag agagaggatgaattatcattgataaaggggacaaaggtgatcgtcatggagaaatgcagt gatgggtggtggcgtggtagctacaatggacaagttggatggttcccttcaaactatgta actgaagaaggtgacagtcctttgggtgaccatgtgggttctctgtcagagaaattagca gcagtcgtcaataacctaaatactgggcaagtgttgcatgtggtacaggctctttaccca ttcagctcatctaatgatgaagaacttaatttcgagaaaggagatgtaatggatgttatt gaaaaacctgaaaatgacccagagtggtggaaatgcaggaagatcaatggtatggttggt ctagtaccaaaaaactatgttaccgttatgcagaataatccattaacttcaggtttggaa ccatcacctccacagtgtgattacattaggccttcactcactggaaagtttgctggcaat ccttggtattatggcaaagtcaccaggcatcaagcagaaatggcattaaatgaaagagga catgaaggggatttcctcattcgtgatagtgaatcttcgccaaatgatttctcagtatca ctaaaagcacaagggaaaaacaagcattttaaagtccaactaaaagagactgtctactgc attgggcagcgtaaattcagcaccatggaagaacttgtagaacattacaaaaaggcacca atttttacaagtgaacaaggagaaaaattatatcttgtcaagcatttatcatga >gi568815595f:136828002_137048450|GENSCAN_predicted_peptide_8|1416_aa MGKKQNRKTGNSKTQSASPPPKERSSSPATEQSWMENDFDELREEGFRRSNYSELREDIQ TKGKEVENFEKNLEECITRITNTEKCLKELMELKTKARELQRVSAMEDEMNEMKREGKFR EKRIKRNEQSLQEIWDYVKRPNLRLIGVPESDVENGTKLENTLQDIIQENFPNLARQANV QIQEIQRTPQRYSSRRATPRHIIVRFTKVEMKEKMLRAAREKGRVTLKGKPIRLTADLSA ETLQARREWGPIFNILKEKNFQPRISYPAKLSFISEGEIKYFIDKQMLRDFVTTRPALKE LLKEALNMERNNRQINETESQQGYPGIELSSAQADLIEIYRTLHPKSTEYTFFSAPHHTY SKIDHIVGSKALLSKCKRTEIITNYLSDHSAIKLELRIKNLTQSRSTTWKLNNLLLNDYW VHNEMKAEIKMFFETNENKDTTYQNLWDAFKAVCRGKFIALNAQKRKQERSKIDTLTSQL KELEKQEQTHSKASRRQEITKIRAELKEIETQKTLQKINESRSWFFERINKIDRPLARLI KKKREKNQIDTIKNDKGDITTDNTEIQTTIREYYKHLYANKLENLEEMDTFLDTYTLPRL NQEEVESLNRPITGSEIVAIINSLPTKKSPGPDGFTAEFYQRYKEELVPFLLKLFQSIEK EGILPNSFYEASIILIPKPGRDTTKKENLRPISLMNIDAKILNKILANRIQQHIKKLIHH DQVGFIPGMQGWFNIRKSINVIQHINRAKDKNHMIISIDAEKAFDKIQQPFMLKILNKLG IDGTYFKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLLFNIVLEVLARAIRQ EKEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAF LYTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLKEIKEDTNKWKN IPCSWVGRINIVKMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRACIAKSIL SQKNKAGGITLPDFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEIMPHIYNYLIFDKPE KNKQWGKDSLFNKWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENL GITIQDIGVGKDFMSKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTEWEKIFA TYSSDKGLISRIYNELKQIYKKKTNNPIKKWAKDMNRHFSKEDIYAAKKHMKKCSSSLAI REMQIKTTMRYHLTPVRTAIIKKSGNNRCWRGCGEIGTLLHCWWDCKLVQPLWKSVWRFL RDLELEIPFDPAIPLLGIYPKDYKSCCYKDTCTQPAQEAQKGRAPKHPPFCPNPDVASAG SGLGEFQRKCNFGELENLPMSNTRTTGSVLKDSAPP >gi568815595f:136828002_137048450|GENSCAN_predicted_CDS_8|4251_bp atggggaaaaaacagaacagaaaaactggaaactctaaaacacagagcgcctctcctcct ccaaaggaacgcagttcctcaccagcaacggaacaaagctggatggagaatgattttgac gagctgagagaagaaggcttcagacgatcaaattactctgagctacgggaggacattcaa accaaaggcaaagaagttgaaaactttgaaaaaaatttagaagaatgtataactagaata accaatacagagaagtgcttaaaggagctgatggagctgaaaaccaaggctcgagaacta caaagggtatcagcaatggaagatgaaatgaatgaaatgaagcgagaagggaagtttaga gaaaaaagaataaaaagaaatgagcaaagcctccaagaaatatgggactatgtgaaaaga ccaaatctacgtctgattggtgtacctgaaagtgatgtggagaatggaaccaagttggaa aacactctgcaggatattatccaggagaacttccccaatctagcaaggcaggccaacgtt cagattcaggaaatacagagaacgccacaaagatactcctcgagaagagcaactccaaga cacataattgtcagattcaccaaagttgaaatgaaggaaaaaatgttaagggcagccaga gagaaaggtcgggttaccctcaaaggaaagcccatcagactaacagcggatctctcggca gaaaccctacaagccagaagagagtgggggccaatattcaacattcttaaagaaaagaat tttcaacccagaatttcatatccagccaaactaagcttcataagtgaaggagaaataaaa tactttatagacaagcaaatgctgagagattttgtcaccaccaggcctgccctaaaagag ctcctgaaggaagcgctaaacatggaaaggaacaaccgacagatcaacgagacagaaagt caacaaggatacccaggaattgagctcagctctgcacaagcagacctaatagaaatctac agaactctccaccccaaatcaacagaatatacatttttttcagcaccacaccacacctat tccaaaattgaccacatagttggaagtaaagctctcctcagcaaatgtaaaagaacagaa attataacaaactatctctcagaccacagtgcaatcaaactagaactcaggattaagaat ctcactcaaagccgctcaactacatggaaactgaacaacctgctcctgaatgactactgg gtacataacgaaatgaaggcagaaataaagatgttctttgaaaccaacgagaacaaagac accacataccagaatctctgggacgcattcaaagcagtgtgtagagggaaatttatagca ctaaatgcccagaagagaaagcaggaaagatccaaaattgacaccctaacatcacaatta aaagaactagaaaagcaagagcaaacacattcaaaagctagcagaaggcaagaaataact aaaatcagagcagaactgaaggaaatagagacacaaaaaacccttcaaaaaatcaatgaa tccaggagctggttttttgaaaggatcaacaaaattgatagaccgctagcaagactaata aagaaaaaaagagaaaagaatcaaatagacacaataaaaaatgataaaggggatatcacc accgataacacagaaatacaaactaccatcagagaatactacaaacacctctacgcaaat aaactagaaaatctagaagaaatggatacattcctcgacacatacactctcccaagacta aaccaggaagaagttgaatctctgaatagaccaataacaggctccgaaattgtggcaata atcaatagtttaccaaccaaaaagagtccaggaccagatggattcacagccgaattctac cagaggtacaaggaggaactggtaccattccttctgaaactattccaatcaatagaaaaa gagggaatcctccctaactcattttatgaggccagcatcattctgataccaaaaccaggc agagacacaacaaaaaaagagaatcttagacctatatccttgatgaacattgatgcaaaa atcctcaataaaatactggcaaaccgaatccagcagcacatcaaaaagcttatccaccat gatcaagtgggcttcatccctgggatgcaaggctggttcaatatacgcaaatcaataaat gtaatccagcatataaacagagccaaagacaaaaaccacatgattatctcaatagatgca gaaaaagcctttgacaaaattcaacaacccttcatgctaaaaattctcaataaattaggt attgatgggacgtatttcaaaataataagagctatctatgacaaacccacagccaatatc atactgaatgggcaaaaactggaagcattccctttgaaaactggcacaagacagggatgc cctctctcaccgctcctattcaacatagtgttggaagttctggccagggcaatcaggcag gagaaggaaataaagggtattcaattaggaaaagaggaagtcaaattgtccctgtttgca gacgacatgattgtttatctagaaaaccccatcgtctcagcccaaaatctccttaagctg ataagcaacttcagcaaagtctcaggatacaaaatcaatgtacaaaaatcacaagcattc ttatacaccaacaacagacaaacagagagccaaatcatgagtgaactcccattcacaatt gcttcaaagagaataaaatacctaggaatccaacttacaagggatgtgaaggacctcttc aaggagaactacaaaccactgctcaaggaaataaaagaggacacaaacaaatggaagaac attccatgctcatgggtaggaagaatcaatatcgtgaaaatggccatactgcccaaggta atttacagattcaatgccatccccatcaagctaccaatgactttcttcacagaattggaa aaaactactttaaagttcatatggaaccaaaaaagagcctgcatcgccaagtcaatccta agccaaaagaacaaagctggaggcatcacactacctgacttcaaactatactacaaggct acagtaaccaaaacagcatggtactggtaccaaaacagagatatagatcaatggaacaga acagagccctcagaaataatgccgcatatctacaactatctgatctttgacaaacctgag aaaaacaagcaatggggaaaggattccctatttaataaatggtgctgggaaaactggcta gccatatgtagaaagctgaaactggatcccttccttacaccttatacaaaaatcaattca agatggattaaagatttaaacgttagacctaaaaccataaaaaccctagaagaaaaccta ggcattaccattcaggacataggcgtgggcaaggacttcatgtccaaaacaccaaaagca atggcaacaaaagccaaaattgacaaatgggatctaattaaactaaagagcttctgcaca gccaaagaaactaccatcagagtgaacaggcaacctacagaatgggagaaaattttcgca acctactcatctgacaaagggctaatatccagaatctacaatgaactcaaacaaatttac aagaaaaaaacaaacaaccccatcaaaaagtgggcgaaggacatgaacagacacttctca aaagaagacatttatgcagccaaaaaacacatgaagaaatgctcatcatcactggccatc agagaaatgcaaatcaaaaccactatgagatatcatctcacaccagttagaacggcaatc attaaaaagtcaggaaacaacaggtgttggagaggatgtggagaaataggaacactttta cactgttggtgggactgtaaactagttcaaccattgtggaagtcagtgtggcgattcctc agggatctagaactagaaataccatttgacccagccatcccattactgggtatataccca aaggactataaatcatgctgctataaagacacatgcacacagccggcccaggaagctcag aaagggcgggctccgaagcaccctcctttctgcccaaacccagatgttgcatcagcaggg agtggcctaggagagttccagagaaaatgtaactttggggagcttgagaatctgcctatg agcaatacccggaccacaggctctgtcctaaaggactctgcacctccctaa >gi568815595f:136828002_137048450|GENSCAN_predicted_peptide_9|197_aa MATSSRIKADSLADRCQSNRFMSLNKAKWVISCQVGSTGSDLLWRESIQVGHQASSDNSR NPDNYPATATAITCAMLAVQGPKNPPPCPVHHYHCQHPSKPCGGPMISILIHLQENILPY GSKFKNGKKKLLYQMCRYQHKNTRNMKRQGHMTSPKEHNSPAIDLNQKEIFEIPEKELLN TLILKKLSGIQENLEKQ >gi568815595f:136828002_137048450|GENSCAN_predicted_CDS_9|594_bp atggccacttcatctaggatcaaggcagatagcctggctgacagatgccaaagcaatcgt ttcatgtccctcaacaaggccaaatgggtcatcagttgccaggtgggaagtacaggctca gatctgctgtggagagagtccattcaagtgggacaccaagccagctctgacaactctcgg aacccagataattatcctgccactgctactgccatcacttgtgccatgctagctgtccag ggccccaagaacccacctccttgcccagtccaccactaccactgccagcatccaagcaag ccatgtggaggcccaatgatcagcattctgatacatcttcaggaaaacatcctcccctat ggaagtaaattcaaaaatgggaagaagaaactgttataccagatgtgcagatatcaacat aagaacacaagaaacatgaaaaggcaaggacatatgacatctccaaaggaacacaattct ccagcaatagaccttaatcaaaaagaaatttttgaaatcccagagaaggaattattaaac acactgatattaaagaagctcagtgggatacaagagaatttggaaaaacaataa >gi568815595f:136828002_137048450|GENSCAN_predicted_peptide_10|554_aa MRTAGAERLRWSEQLVKGPKAKSAQHFGEMTNVARISRFGAEEYESLYTSHIWIPSSWCS LTEGPECDVTDDITATVPYNLRVRATLGSQTSAWSILKHPFNRNSSSSSTSRHKHFSDEG RDGERLCSLKTPCRNCLSSECPLVSGIGGFLISLTSRMKPQTLAVSVTALKVARLESVPS DVRMCSEFLPSGVKLQIFTVSVTALKAARLELFVPPGGLVVSLASGVKLQTFAGCRWSCV PVLRPASALLSPWVVDGTGRRGAGGGAHRGGSGRTGAHGGAEGAGSSLGQPRKGLPQCSG GLKGSSSAAKVGAQAEDAPRVSEGCEDCQHAVTSHTDPREGRFSPEHAPPLSNPRPVIRV ALLVLQYLGLKGKYNSITSNQGVETICVGPSVLGKSTLSGAGFDSPVVLPTAILTRPGME ITKDGFHLVIELEDLGPQFEFLVAYWRREPGAEEHVKMVRSGGIPVHLETMEPGAAYCVK AQTFVKAIGRYSAFSQTECVEVQGEAIPLVLALFAFVGFMLILVVVPLFVWKMGRLLQYS CCPVVVLPDTLVIE >gi568815595f:136828002_137048450|GENSCAN_predicted_CDS_10|1665_bp atgcgaacagcaggggcagagcgtttgagatggagtgaacagttagtcaaaggtcctaag gcaaaaagtgctcagcattttggagaaatgaccaatgtggcaagaatatcacgcttcgga gcagaggagtacgagagcctgtacacgagccacatctggatccccagcagctggtgctca ctcactgaaggtcctgagtgtgatgtcactgatgacatcacggccactgtgccatacaac cttcgtgtcagggccacattgggctcacagacctcagcctggagcatcctgaagcatccc tttaatagaaactcaagtagctcttcaacaagcaggcataaacacttctctgatgaggga cgcgatggggaaaggctttgttctctgaaaactccctgtaggaactgcctttcttctgag tgtccacttgtgtccggaattggtgggttcttgatctcactgacttcaagaatgaagccg cagaccctcgcggtgagtgttacagctcttaaggtggcgcgtctggagtctgtcccttct gatgttcggatgtgttcagagtttcttccttctggagtgaagctgcagatcttcacggtg agtgttacagctcttaaggcggcgcgtctggagttgttcgttcctcctggtgggctcgtg gtctcgctggcttcaggagtgaagctgcagaccttcgcgggctgccggtggagctgcgtg ccagtcctgcgccctgcgtccgcactcctcagcccttgggtggtcgatgggactgggcgc cgtggagcagggggtggcgctcatcggggaggctcgggccgcacaggagcccatggaggg gctgagggagccggctccagtcttggccagcccagaaaggggctcccgcagtgcagcggt gggctgaagggctcctcaagtgcggccaaagtgggagcccaggcagaggacgcgcccaga gtgagcgagggctgtgaggactgccagcatgctgtcacctctcacactgaccccagggaa ggcagatttagcccagaacatgccccacccctttccaatccaagaccagtgatcagggtg gcccttctggtcctgcagtatctgggtctgaaaggcaagtataactccatcaccagtaac cagggtgtagagactatctgtgtgggcccttctgttcttgggaaatcaaccctgtctggg gctggctttgactctcctgttgtcttgccaacagccatccttacccgacctgggatggag atcaccaaagatggcttccacctggttattgagctggaggacctggggccccagtttgag ttccttgtggcctactggaggagggagcctggtgccgaggaacatgtcaaaatggtgagg agtgggggtattccagtgcacctagaaaccatggagccaggggctgcatactgtgtgaag gcccagacattcgtgaaggccattgggaggtacagcgccttcagccagacagaatgtgtg gaggtgcaaggagaggccattcccctggtactggccctgtttgcctttgttggcttcatg ctgatccttgtggtcgtgccactgttcgtctggaaaatgggccggctgctccagtactcc tgttgccccgtggtggtcctcccagacaccttggtaatagagtag >gi568815595f:136828002_137048450|GENSCAN_predicted_peptide_11|314_aa MAILPKVIYTFNAIPIKLPITFFTELEEITLKFIWNQKRARIAKRILSQKNKAGGITLSD FKLYYKATVTKTAWYWYQNRDIDKWNRTETSDITPHIYNHLIFDEPDKNKKWGKDSLFNK WCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKAIKVLEENLGNTIQDIGMGKDF MTKTPKAMATKAKIDKWDIINLKSFCTAEETTIRVNRQPTEWEKNFAIYPSDKGLISRFY KELKQIYKKKKTIKKWAKDMNRHFSKGDIYAANRHMKKCSSSLVIREMQIKITMRYHLIP VRMAIIKKSGYNRC >gi568815595f:136828002_137048450|GENSCAN_predicted_CDS_11|945_bp atggccatactgcccaaagtaatttatacattcaatgccatccccatcaagctaccaata actttcttcacagaattggaagaaattactttaaaattcatatggaaccaaaaaagagcc cgcattgccaagagaatcctaagccaaaagaacaaagctggaggcatcacactatctgac ttcaaactgtactacaaggctacagtaaccaaaacagcatggtactggtaccaaaacaga gatatagacaaatggaacagaacagagacctcagatataacaccacacatctacaaccat ctgatctttgatgaacctgacaaaaacaagaaatggggaaaggattccctatttaataaa tggtgctgggaaaattggctagccatatgtagaaagctgaaactggatcccttccttaca ccttatacaaaaatcaattcaagatggattaaagacttaaatgttagacctaaagccata aaagtcctagaagaaaacctaggcaataccattcaggacataggcatgggcaaggacttc atgactaaaacaccaaaagcaatggcaacaaaagccaaaatagacaaatgggatataatt aacctaaagagcttctgcacagcagaagaaactaccatcagagtgaacaggcaacctaca gaatgggagaaaaattttgcaatctacccatctgacaaaggactaatatccagattctac aaagaactcaaacaaatttacaagaaaaaaaaaaccatcaaaaagtgggcaaaggatatg aacagacacttctcaaaaggagacatttatgcagccaacagacacatgaaaaaatgctca tcatcactggtcatcagagaaatgcaaatcaaaatcacaatgagataccatctcatacca gttagaatggcaatcattaaaaagtcaggatacaacagatgctag >gi568815595f:136828002_137048450|GENSCAN_predicted_peptide_12|106_aa MVERSTDRRVREAKVPEKSESRGYSVKPQTMSSGQIQTQSEEKEAQGVSLRVGTENIFST DHKVAPDARGCNGSSPPGELRSWPWDQVLQIERKPRGLQARKVTLA >gi568815595f:136828002_137048450|GENSCAN_predicted_CDS_12|321_bp atggtggaaagatcaacagacaggagagtgcgagaggccaaggtcccagaaaagagcgag agccgtgggtacagcgtgaagccccaaaccatgagctcgggacagatacagacacagagc gaggagaaggaagcccagggagtttcattgcgagtgggaacagaaaacattttcagcact gaccacaaggtggcgccagacgcccgtggatgtaatgggtcttcccctcctggtgagctc cgaagctggccatgggatcaagtcctccagattgaaaggaaaccaagaggtctgcaggcc cggaaagttactctagcctaa >gi568815595f:136828002_137048450|GENSCAN_predicted_peptide_13|618_aa MMTISCRISRPIGCAAALPWNPGDFLADLMGPLSGGRGAQVTLVQPDCRASGANLHVSFC CAKWKPYPCYVTYVGEKTPLATGPGTGLHVGPGATLSTVQLLEPGGTLSGITQTVPSGLR FAPDPERSLSYPNESFPKKVPCTTDICYFAGEDAKAPLGPNQINRSLQLTNQDLHCHLVA SPGTTAGSGGLCQSRVQRRQTMPLTLLSPRKGWNGADDLLAPSLGSHADSLVCVPATKEE RRGLTVDTGLLDKNLRCKCWYHIDQTLPSALKGSEDSGPQWCCSQAPALSYGYTTNPAWL ELPKPGSSNSQFRMLSWSPLHQLLWRSVHFSGQDEKGWCWAHSRPGLWGVRKACSMDRQS EPSAPPQQSGPSSLFTGTENKVLKVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLE NPIVSAQNLLKLISNFSKVSGYKINVQKSQSFLYTNNRQTGSQIMSELPFTIASKRIKYL GIQLTRDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWIGRINIMKMAILPKVIYRFNATP IKLPTTFFTELEKTTLKFIWNQKRARIAKSILSQKNKARIKLEASLFKLYYKATVTETAW YWYQNRDIDQGTEQSPQK >gi568815595f:136828002_137048450|GENSCAN_predicted_CDS_13|1857_bp atgatgactatctcctgcaggatctccaggcccataggttgtgcagcagccttgccctgg aaccctggggacttcctggcagatctgatggggcctctgtctggaggacggggtgcacag gtgacattagtacagccagactgccgagcatcaggtgccaatttacatgttagcttctgt tgtgctaagtggaaaccctacccttgttatgtcacttatgtaggtgagaaaactcccctg gccacagggcctggcacgggactgcatgtgggcccaggggccactttatctactgtgcag ctgctagagcctggaggcacattgagtggtatcacacagacagtgccatcaggcctgcgt tttgccccagacccagaaagaagcctctcctaccctaacgagtccttcccgaagaaagtg ccctgcacgacggacatttgttattttgccggagaggacgctaaggctccgctgggaccc aaccaaatcaaccgtagcctgcagttgaccaaccaagacctacattgtcacctagtggcg tctccgggaactacagctggctccggagggctctgccaaagccgcgttcagagacgccaa acaatgccactgacgctgctgtctccaaggaaagggtggaatggagcagatgacctctta gccccaagcttagggtcccatgcagacagcttggtttgtgtccctgccactaaggaagaa agaagaggcttgactgtggacacaggacttttggataagaatctcaggtgtaaatgctgg tatcacatagaccagactctcccttctgccctaaagggctcagaagactctgggcctcag tggtgctgctcacaagctcctgctctctcctatggctacaccaccaacccagcctggctt gagcttcccaagcctggaagctccaacagccaattcagaatgttgagctggtccccacta catcagctcttatggagaagtgtccatttttctgggcaggatgagaagggctggtgctgg gcccacagcagaccagggctgtggggagtaaggaaggcctgcagcatggacagacagtca gagcccagtgcccctccacaacaatctggtccaagctccttgttcactgggacagaaaat aaggtgttgaaagttctggccagggcaatcaggcaggagaaagaaataaagggtattcaa ttaggaaaagaggaagtcaaattgtccctgtttgcagatgacatgattgtatatctagaa aaccccattgtctcagcccaaaatctccttaagctgataagcaacttcagcaaagtctca ggatacaaaatcaatgtgcaaaaatcacagtcattcttatacaccaataacagacaaaca gggagccaaatcatgagtgaactcccattcacaattgcttcaaagagaataaaataccta ggaatccaacttacaagggatgtgaaggacctcttcaaggagaactacaaaccactgctc aatgaaataaaagaggatacaaacaaatggaagaacattccatgctcatggataggaaga atcaatatcatgaaaatggccatactacccaaggtaatttatagattcaatgccaccccc atcaagctaccaacgactttcttcacagaactggaaaaaactactttaaagttcatatgg aatcaaaaaagagcccgcatcgccaagtcaatcctaagccaaaagaacaaagcaagaata aagctggaggcatcactattcaaactatactacaaggctacagtaaccgaaacagcatgg tactggtaccaaaacagagatatagaccaaggaacagaacagagccctcagaaataa