GENSCAN 1.0 Date run: 4-Nov-116 Time: 21:46:19 Sequence gi568815576f:21533182_21737000 : 203819 bp : 51.83% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 1055 1393 339 1 0 85 47 149 0.678 5.35 1.02 PlyA + 2419 2424 6 1.05 2.02 PlyA - 8540 8535 6 1.05 2.01 Sngl - 17795 12876 4920 2 0 106 53 4839 0.985 468.83 2.00 Prom - 19278 19239 40 -3.11 3.00 Prom + 22071 22110 40 -4.61 3.01 Init + 26710 26798 89 0 2 38 29 121 0.095 -0.72 3.02 Intr + 34494 34590 97 0 1 98 93 44 0.008 6.41 3.03 Intr + 39846 39915 70 2 1 57 53 79 0.009 0.55 3.04 Intr + 56763 56842 80 1 2 83 100 -4 0.000 0.07 3.05 Intr + 59680 59819 140 0 2 79 59 118 0.112 7.77 3.06 Intr + 61184 61268 85 2 1 65 105 14 0.043 1.12 3.07 Intr + 77676 77862 187 2 1 118 116 145 0.989 20.08 3.08 Term + 88334 88488 155 0 2 105 38 159 0.992 11.00 3.09 PlyA + 89729 89734 6 1.05 4.06 PlyA - 90256 90251 6 1.05 4.05 Term - 95606 95237 370 1 1 133 46 416 0.775 36.38 4.04 Intr - 96006 95829 178 1 1 108 77 286 0.999 28.99 4.03 Intr - 96226 96127 100 1 1 111 96 133 0.999 16.58 4.02 Intr - 96580 96421 160 0 1 65 80 155 0.992 12.90 4.01 Init - 96833 96670 164 2 2 90 94 279 0.990 27.27 4.00 Prom - 97628 97589 40 -8.58 5.00 Prom + 97873 97912 40 -2.61 5.01 Init + 99462 99516 55 2 1 39 103 90 0.506 7.00 5.02 Intr + 99939 100072 134 1 2 60 114 87 0.556 9.37 5.03 Intr + 100841 101389 549 1 0 77 64 487 0.908 38.87 5.04 Intr + 101504 102394 891 1 0 78 45 429 0.418 29.79 5.05 Term + 103251 103889 639 2 0 80 43 1004 0.945 89.73 5.06 PlyA + 104106 104111 6 1.05 6.00 Prom + 107820 107859 40 -5.71 6.01 Init + 109156 109342 187 0 1 71 90 441 0.999 39.80 6.02 Intr + 109681 109877 197 2 2 49 99 233 0.994 20.15 6.03 Term + 110713 110994 282 0 0 84 48 428 0.927 34.17 6.04 PlyA + 113667 113672 6 1.05 7.00 Prom + 117566 117605 40 -3.81 7.01 Init + 124111 124311 201 0 0 53 46 176 0.818 6.90 7.02 Intr + 124600 124791 192 0 0 83 32 268 0.832 20.91 7.03 Intr + 125085 125248 164 2 2 122 -11 194 0.231 12.19 7.04 Intr + 127656 127840 185 0 2 51 58 82 0.157 1.45 7.05 Intr + 136732 136781 50 2 2 92 110 20 0.203 3.49 7.06 Intr + 137829 137878 50 2 2 95 74 88 0.443 6.07 7.07 Intr + 139149 139200 52 0 1 121 92 74 0.997 10.50 7.08 Intr + 141883 141934 52 0 1 77 96 35 0.862 2.17 7.09 Intr + 148118 148209 92 0 2 111 85 192 0.973 21.41 7.10 Intr + 149256 149345 90 2 0 60 109 200 0.999 19.99 7.11 Intr + 150001 150076 76 0 1 97 91 41 0.879 4.88 7.12 Intr + 151572 151732 161 1 2 72 68 278 0.916 24.42 7.13 Intr + 153302 153377 76 1 1 86 68 37 0.862 1.08 7.14 Intr + 153711 153904 194 1 2 19 84 410 0.913 33.43 7.15 Intr + 153922 154008 87 0 0 70 50 57 0.612 0.76 7.16 Intr + 154462 154551 90 0 0 30 89 145 0.954 9.49 7.17 Intr + 154892 154925 34 1 1 105 77 10 0.965 -0.12 7.18 Intr + 155551 155668 118 2 1 116 68 189 0.867 19.73 7.19 Intr + 160635 160691 57 0 0 86 103 86 0.985 8.39 7.20 Intr + 161412 161484 73 0 1 85 66 105 0.994 7.90 7.21 Intr + 161574 161636 63 2 0 66 94 170 0.999 14.81 7.22 Intr + 161756 161889 134 1 2 64 94 186 0.962 16.75 7.23 Intr + 162029 162108 80 2 2 71 75 28 0.682 -0.51 7.24 Intr + 162213 162335 123 1 0 91 36 39 0.345 0.06 7.25 Intr + 162850 162948 99 2 0 92 47 57 0.287 2.58 7.26 Intr + 163919 164032 114 0 0 94 37 83 0.581 4.72 7.27 Intr + 165329 165430 102 0 0 75 56 48 0.324 0.95 7.28 Term + 166615 166730 116 2 2 50 49 58 0.292 -2.86 7.29 PlyA + 166810 166815 6 1.05 8.05 PlyA - 167300 167295 6 -0.45 8.04 Term - 168037 167948 90 1 0 89 35 86 0.860 1.42 8.03 Intr - 170297 170189 109 1 1 110 83 169 0.997 19.39 8.02 Intr - 170797 170658 140 1 2 42 67 67 0.521 -0.23 8.01 Init - 176832 176710 123 0 0 86 45 82 0.346 3.83 8.00 Prom - 179606 179567 40 -4.21 9.00 Prom + 180074 180113 40 -1.91 9.01 Sngl + 180634 180849 216 0 0 43 50 155 0.323 2.67 9.02 PlyA + 182378 182383 6 1.05 10.03 PlyA - 182408 182403 6 1.05 10.02 Term - 193326 193201 126 0 0 115 54 38 0.691 1.69 10.01 Init - 202472 202434 39 2 0 59 78 102 0.169 4.44 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 52559 52434 126 2 0 125 42 56 0.839 3.29 S.002 Init + 132919 132950 32 0 2 102 101 40 0.871 5.85 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576f:21533182_21737000|GENSCAN_predicted_peptide_1|112_aa MTPPFSSAVWVFLGVLTPGQVFCTPDLLKTKPVLPQSGNMRLREALQLSVSRLAFGSSDW VLEPSGSVALPLQPYASLKLPMASEVEDLGEQGCQRGEEGDLSVGGCQNDRA >gi568815576f:21533182_21737000|GENSCAN_predicted_CDS_1|339_bp atgacgccccccttctcatcagctgtctgggtctttctcggggtcctgacaccaggccaa gtgttctgcacacctgacctgctcaaaacaaaaccagttctgccacagtctgggaacatg aggctcagagaggccttgcagctgtctgtgagcaggctggcatttggatccagtgactgg gttctggagcccagtggctctgtggccctgcctctccagccttatgcttctttaaaactg cccatggcctctgaagttgaggatttaggggagcagggctgccagaggggtgaagaagga gatctttctgtgggtgggtgtcagaatgaccgggcctga >gi568815576f:21533182_21737000|GENSCAN_predicted_peptide_2|1639_aa MAKDSPSPLGASPKKPGCSSPAAAVLENQRRELEKLRAELEAERAGWRAERRRFAARERQ LREEAERERRQLADRLRSKWEAQRSRELRQLQEEMQREREAEIRQLLRWKEAEQRQLQQL LHRERDGVVRQARELQRQLAEELVNRGHCSRPGASEVSAAQCRCRLQEVLAQLRWQTDGE QAARIRYLQAALEVERQLFLKYILAHFRGHPALSGSPDPQAVHSLEEPLPQTSSGSCHAP KPACQLGSLDSLSAEVGVRSRSLGLVSSACSSSPDGLLSTHASSLDCFAPACSRSLDSTR SLPKASKSEERPSSPDTSTPGSRRLSPPPSPLPPPPPPSAHRKLSNPRGGEGSESQPCEV LTPSPPGLGHHELIKLNWLLAKALWVLARRCYTLQEENKQLRRAGCPYQADEKVKRLKVK RAELTGLARRLADRARELQETNLRAVSAPIPGESCAGLELCQVFARQRARDLSEQASAPL AKDKQIEELRQECHLLQARVASGPCSDLHTGRGGPCTQWLNVRDLDRLQRESQREVLRLQ RQLMLQQGNGGAWPEAGGQSATCEEVRRQMLALERELDQRRRECQELGTQAAPARRRGEE AETQLQAALLKNAWLAEENGRLQAKTDWVRKVEAENSEVRGHLGRACQERDASGLIAEQL LQQAARGQDRQQQLQRDPQKALCDLHPSWKEIQALQCRPGHPPEQPWETSQMPESQVKGS RRPKFHARPEDYAVSQPNRDIQEKREASLEESPVALGESASVPQVSETVPASQPLSKKTS SQSNSSSEGSMWATVPSSPTLDRDTASEVDDLEPDSVSLALEMGGSAAPAAPKLKIFMAQ YNYNPFEGPNDHPEGELPLTAGDYIYIFGDMDEDGFYEGELDDGRRGLVPSNFVEQIPDS YIPGCLPAKSPDLGPSQLPAGQDEALEEDSLLSGKAQGMVDRGLCQMVRVGSKTEVATEI LDTKTEACQLGLLQSMGKQGLSRPLLGTKGVLRMAPMQLHLQNVTATSANITWVYSSHRH PHVVYLDDREHALTPAGVSCYTFQGLCPGTHYRVRVEVRLPWDLLQVYWGTMSSTVTFDT LLAGPPYPPLDVLVERHASPGVLVVSWLPVTIDSAGSSNGVQVTGYAVYADGLKVCEVAD ATAGSTVLEFSQLQVPLTWQKVSVRTMSLCGESLDSVPAQIPEDFFMCHRWPETPPFSYT CGDPSTYRVTFPVCPQKLSLAPPSAKASPHNPGSCGEPQAKFLEAFFEEPPRRQSPVSNL GSEGECPSSGAGSQAQELAEAWEGCRKDLLFQKSPQNHRPPSVSDQPGEKENCYQHMGTS KSPAPGFIHLRTECGPRKEPCQEKAALERVLRQKQDAQGFTPPQLGASQQYASDFHNVLK EEQEALCLDLRGTERREERREPEPHSRQGQALGVKRGCQLHEPSSALCPAPSAKVIKMPR GGPQQLGTGANTPARVFVALSDYNPLVMSANLKAAEEELVFQKRQLLRVWGSQDTHDFYL SECNRQVGNIPGRLVAEMEVGTEQTDRRWRSPAQGHLPSVAHLEDFQGLIIPQGSSLVLQ GNSKRLPLWTPKIMIAALDYDPGDGQMGGQGKGRLALRAGDVVMVYGPMDDQGFYYGELG GHRGLVPAHLLDHMSLHGH >gi568815576f:21533182_21737000|GENSCAN_predicted_CDS_2|4920_bp atggccaaggactcgcccagccccttgggcgcgtcgcccaagaagccgggctgctccagc ccggcggcggcagtgctggagaaccagaggcgggagctggagaagctacgggcggagctg gaggcggagcgggcaggctggcgggcggaacggcggcgcttcgctgcccgggagcgccag ctgcgtgaggaggccgagcgggagcggcggcagctggctgaccgtctgcgctccaagtgg gaggcacagcgcagccgggagttgcggcagctgcaagaggagatgcagcgggaacgcgag gccgagatccggcagctgctgcgctggaaggaggccgagcagcggcagctgcagcagctg ctgcaccgggagcgcgatggcgtggtgcgccaagcccgggagctgcagcgccagctggcc gaggagctggtgaaccgcggccactgtagccgcccgggggcgtccgaggtttccgcggcg cagtgccgctgtcgcctgcaggaagtgttggcgcagcttcgctggcagactgacggcgag caggcggcgcgcatccgctatctgcaggcggcgctggaggtggagcgccagctcttcctc aagtacatcctggcgcacttccgcgggcacccggctttgtcgggatcaccggacccccaa gctgtgcattccttggaagaaccgctgccccagacctccagcggctcttgccacgccccc aaacccgcctgccaactcggatctctagacagcctgagtgctgaagtcggtgtgcgctcc cgctcgctaggcctggtgtcctctgcgtgctccagctccccagacggcctgctctccacg cacgccagctcccttgattgcttcgcacctgcgtgttcccgctcgcttgacagcacccgg agcctccccaaggcctccaaatccgaggagcggccctcctcaccagacacctccacccct ggctcccggaggctctcgccgccaccatcgccactcccgccgccaccaccaccgtcagcc cacaggaaactcagcaacccgcggggaggagaaggctctgagagccagccctgcgaagtc ctgactccctcacccccgggcctgggccaccacgagctgataaagctgaactggctgctg gccaaggcgttgtgggtgctggcgcgccgctgttataccctgcaagaggagaacaagcag ctgcggcgtgcaggctgcccctaccaggcagacgagaaggtgaagcggctcaaggtaaag cgcgcggagctgaccgggctcgcgcggcgcctagctgaccgcgcccgcgagctgcaggag accaacctccgggccgtgagcgcgcctatacccggcgagagttgcgccggcctggagctg tgccaagtctttgcccgccagcgcgctcgggacctgtcggagcaggcgagcgcgccgctg gccaaggacaagcagatcgaagagctgcggcaggagtgccacctcctgcaggcgcgtgtc gcctcgggtccctgcagcgacctgcatactggaaggggcggcccctgcacccagtggctc aacgtcagagacttagaccgcctgcagcgcgagtcccagcgggaagtgctgcgcctgcag aggcagttgatgcttcagcagggcaacggtggcgcttggcccgaggcgggcggccagagc gcaacctgcgaggaggtgcgacggcagatgctggcgctggagcgcgagctggaccagcgg cggcgcgagtgccaggagctgggcacgcaggcggccccggcgcggcgacgtggcgaggag gccgagacacagctgcaggcggcgctgctcaaaaacgcctggctggcggaggagaatggg cggctgcaggccaagaccgactgggtgcggaaggtggaggctgagaatagcgaagtgcgc ggccacctgggccgcgcgtgtcaagagcgcgatgcctccggcttgatcgccgaacagctg ctgcagcaggcggcgcgcgggcaggacaggcagcagcagctgcaacgcgacccgcagaag gccctgtgtgacctccatccttcctggaaggagatacaggcgctccagtgtcggcctggt caccctcctgaacagccctgggagaccagtcaaatgccggagtcccaagttaaaggtagc agaaggcccaagttccacgcacggcctgaagactacgcagtgtcacagcccaacagagac atacaggagaaaagggaagcctccctcgaggagagcccagttgcccttggggagtcagcc agtgtcccccaagtttcagagacagtccctgccagccaacctctgtccaagaaaaccagc tcccagtcaaactcctcctctgaggggtcgatgtgggccaccgtgccgtcctcccctact ctggacagggacacagccagtgaggtggatgacctggagcctgacagcgtgtccctggcc ctggaaatggggggctcggcggctcctgctgcccccaagctcaagatcttcatggctcag tataactacaacccatttgaggggcccaatgatcaccctgagggtgagctgcccctcaca gctggggactacatatatatcttcggggacatggatgaggatggcttctatgagggggag cttgacgatggccggcgggggctggtgccctccaacttcgtggagcagattccggacagc tacatcccaggctgcctgcctgccaaatcccctgatcttggccccagtcaactcccagcg gggcaggatgaagctctggaggaagacagcttattatctgggaaagcccagggaatggtg gacagagggctgtgccagatggtcagggtgggctccaagacagaagtagcaacagagatc ctggataccaagacggaagcctgccagctgggcttgctgcagagcatggggaagcagggc ctctccagaccccttctggggaccaaaggggtgctccgtatggctcccatgcagctacac ctgcagaatgtcacagccacatcagccaacatcacctgggtctacagcagccaccgccac ccccatgtggtatatcttgatgaccgagagcatgccctgaccccagcgggcgtgagctgc tacaccttccagggcctgtgccccggcacgcactaccgggtgcgggtggaggtgcggctg ccatgggacttgctgcaggtgtattggggaactatgtcctccaccgtcaccttcgacaca ctcttggcaggacctccctacccaccgctggatgtgctggtggagcgccatgcctcgcca ggtgtcctggtggtcagctggctccctgtgaccattgactcagctgggtcctccaatgga gtccaggtcaccggttatgctgtgtatgcagatgggcttaaggtttgtgaggtcgccgat gccactgctgggagcaccgtattggaattctcccagctacaggtgcccctcacgtggcag aaggtctcagtgagaaccatgtcactctgtggtgagtccctggattcagtgcctgctcag atccccgaggacttcttcatgtgtcaccgatggccagagactccaccctttagctacact tgtggcgacccatccacctacagagtcaccttccccgtctgcccccagaagctgtcactg gctcctccgagtgccaaggccagcccccacaaccctggaagctgcggggagccccaggcc aagttcctagaagcattctttgaagaacccccaaggaggcaatccccagtgtccaacctg ggctcagaaggagaatgtccgagttcaggggctggcagccaagcccaggagcttgcagag gcctgggagggctgtagaaaggacctgctctttcagaagagtccccagaaccacaggcca ccttcagtcagtgaccagcctggggagaaggaaaattgctaccagcacatgggcaccagc aaaagccctgctccaggattcatccatctacgcaccgagtgtgggcccaggaaagaaccg tgtcaggaaaaggctgcccttgagagggtacttcggcaaaagcaagatgcccaagggttc acacctccccagctgggcgccagccaacagtatgcatctgacttccataacgttttgaag gaggagcaggaggcactgtgcttggatctgcggggcacagagaggcgagaggagaggagg gagcctgagccccacagcaggcaaggacaagctctgggggtgaagagagggtgccagctc catgagcccagctcggcactgtgtccagctccatccgccaaagtcatcaagatgcccagg ggtggcccccaacagctggggacgggggccaacactccagccagggtctttgtggccctc tctgattacaaccccctggtgatgtctgccaacctcaaggctgcagaggaggagctggtc ttccagaaaaggcagttgctaagagtgtggggctctcaggacacccatgatttctacctc agcgagtgcaacaggcaagtgggcaatatccccgggcgcctagtggctgagatggaggtg gggacagagcagactgataggaggtggcgttctccggcccaagggcacctgccttctgtg gcccacctcgaggactttcaggggctcatcatcccccagggttcctccctggtgctccag gggaactccaagagactcccactgtggactccaaagatcatgatagcagctctggactat gatcctggggatgggcaaatggggggccaggggaagggcaggctggcgctgagggcagga gacgtggtcatggtttacgggcccatggatgaccaaggattctattatggagagttgggc ggccacaggggcctggttcctgcccacctgctggatcacatgtccctccatggacactga >gi568815576f:21533182_21737000|GENSCAN_predicted_peptide_3|300_aa MGLDTWRGTVLLFLGPAACLAGPHLPGPGGKCGGSSRPAGRDAFWGRSSTKSKMAASRRL MKFRPSLFVVDTCIELLNGSSKLLTGVWLTPMAVFLLSLSVDPLDLLDTKAPELEEIRKC GMKNFRNIQVDEANLLTWQGLIVPVSIEHFHFLPDYSLRCFLPDAGYKVSMEHASSCRCF LLDAGYKDNPPYDKGAFRIEINFPAEYPFKPPKITFKTKIYHPNIDEKGQVCLPVISAEN WKPATKTDQVIQSLIALVNDPQPEHPLRADLAEEYSKDRKKFCKNAEEFTKKYGEKRPVD >gi568815576f:21533182_21737000|GENSCAN_predicted_CDS_3|903_bp atggggctggatacctggagagggaccgtgctgctcttcctgggccctgcagcctgcctg gctggtccacatcttcctgggccaggagggaagtgcgggggctccagccgcccggccggc cgcgatgcattctggggaaggagcagcaccaaatccaagatggcggccagcaggaggctg atgaagttcaggccatccctgtttgtggtagacacctgcatcgagctccttaatggttca tctaaactccttactggtgtgtggctcactcccatggcagtatttcttttatctttgtct gtggatcccctggatctccttgataccaaagcaccagagcttgaagaaatccgcaaatgt gggatgaaaaacttccgtaacatccaggttgatgaagctaatttattgacttggcaaggg cttattgttcctgtgagtattgaacacttccacttcctaccagattattctttaagatgc tttcttcctgatgctggttataaggtgagcatggagcatgcttcctcctgtagatgcttt cttcttgatgctggttataaggacaaccctccatatgataagggagccttcagaatcgaa atcaactttccagcagagtacccattcaaaccaccgaagatcacatttaaaacaaagatc tatcacccaaacatcgacgaaaaggggcaggtctgtctgccagtaattagtgccgaaaac tggaagccagcaaccaaaaccgaccaagtaatccagtccctcatagcactggtgaatgac ccccagcctgagcacccgcttcgggctgacctagctgaagaatactctaaggaccgtaaa aaattctgtaagaatgctgaagagtttacaaagaaatatggggaaaagcgacctgtggac taa >gi568815576f:21533182_21737000|GENSCAN_predicted_peptide_4|323_aa MSRPRMRLVVTADDFGYCPRRDEGIVEAFLAGAVTSVSLLVNGAATESAAELARRHSIPT GLHANLSEGRPVGPARRGASSLLGPEGFFLGKMGFREAVAAGDVDLPQVREELEAQLSCF RELLGRAPTHADGHQHVHVLPGVCQVFAEALQAYGVRFTRLPLERGVGGCTWLEAPARAF ACAVERDARAAVGPFSRHGLRWTDAFVGLSTCGRHMSAHRVSGALARVLEGTLAGHTLTA ELMAHPGYPSVPPTGGCGEGPDAFSCSWERLHELRVLTAPTLRAQLAQDGVQLCALDDLD SKRPGEEVPCEPTLEPFLEPSLL >gi568815576f:21533182_21737000|GENSCAN_predicted_CDS_4|972_bp atgtcccgccctcgcatgcgcctggtggtcaccgcggacgactttggttactgcccgcga cgcgatgagggtatcgtggaggcctttctggccggggctgtgaccagcgtgtccctgctg gtcaacggtgcggccacggagagcgcggcggagctggcccgcaggcacagcatccccacg ggcctccacgccaacctgtccgagggccgccccgtgggtccggcccgccgtggcgcctca tcgctgctcggcccggaaggcttcttccttggcaagatgggattccgggaggcggtggcg gccggagacgtggatttgcctcaggtgcgggaggagctcgaggcccaactaagctgcttc cgggagctgctgggcagggcccccacgcacgcggacgggcaccagcacgtgcacgtgctc ccaggcgtgtgccaggtgttcgccgaggcgctgcaggcctatggggtgcgctttacgcga ctgccgctggagcgcggtgtgggtggctgcacttggctggaggcccccgcgcgtgccttc gcctgcgccgtggagcgcgacgcccgggccgccgtgggccccttctcccgccacggcctg cggtggacagacgccttcgtgggcctgagcacttgcggccggcacatgtccgctcaccgc gtgtccggggccctggcgcgggtcctggaaggtaccctagcgggccacaccctgacagcc gagctgatggcgcaccccggctaccccagtgtgcctcccaccggcggctgcggtgaaggc cccgacgctttctcttgctcttgggagcggctgcatgagctgcgcgtcctcaccgcgccc acgctgcgggcccagcttgcccaggatggcgtgcagctttgcgccctcgacgacctggac tccaagaggccaggggaggaggtcccctgtgagcccactctggaacccttcctggaaccc tccctactctga >gi568815576f:21533182_21737000|GENSCAN_predicted_peptide_5|755_aa MWAPPPTIVSEREVDELLERNAQLKREVGSRPGHLPGDHMARCRHHSGYLADDEASHSMC SARVQLPKKPLVPEMRPACKPGRVPHPPSTCGSSALQGQRRNKRHPQPFGHFLDFLTESQ VLDSLETVVEKATERMAAMKTEAGVPLVEVQDPVEVPSGGRRAHARPSLSTVHRHRVRPT LCTGHPNNYPSSSSSMSNCHSSLMAGCLGSHSRDSDLGAQGSLPPVRDKLLLEKNLKRLL QLEREGKGLSQSCSQRDSLLWDSLGSQTSFQWTQEQPLSWFSGLLGSSSGVPEASEPRPG EQEPIFRKREFNKEIKSLLSQLESLDLPGYCPLREPHRTLNFLADHRLFPALQSVVSQAV DKLRGAHCRDGRPLFPTSLEPTSDLPPLGSEPAKPTNGGQPYASPRPTVSSPKMLQRKRK DRGGSPSMSSAQVATRFKLKVTPTEKPNVPSPSLHSREEAPDSDPKLQNPPVSLSSSQRA QPWQGLHLTLPTPGIVVEVACSQGHLRGPVTPPLSSPYPRSSCYLLPELSPVASSSPASL CPESPCSSSRFTKKKPLPSISSKSSMSHFSNRLYEELADFLTQQAASLVIRKYEFEKDLS KQLGFFSFPITHVLRDLSLGLKKVKGSRIHLSSETHRSCLLRKLEESKRARQASRLSTSH CSTETPSVQQEPATHTAQDQATEPCRSLYTNLPASRQLSPLEPKLYMSACTGMGSSPPKS KDMDNEGRDKAEIEDEDEDEFKDEDQDEDKDEDGV >gi568815576f:21533182_21737000|GENSCAN_predicted_CDS_5|2268_bp atgtgggccccgcccccaaccattgtgtcggagagggaagtggacgagctcctcgagagg aatgcgcagctgaagagagaggtgggcagcaggcccggtcacctgccaggtgaccacatg gccaggtgccgccaccactcgggttacctggccgatgacgaggccagccactccatgtgc agtgcacgggtgcagctgcccaagaagccactggtcccagaaatgcggccagcctgcaag ccgggccgtgtgccacacccaccatccacatgtggcagctcagcactccagggccaacgc cgaaacaagaggcaccctcagccctttggccactttctggatttcctaactgagagccag gtcctggacagcctggagacagtggtggagaaggcgactgagcgcatggctgccatgaag acggaggctggggtgccgcttgtggaggtgcaggacccagtggaggtgccaagtggtgga cggcgggcacatgcccggcccagcctcagcaccgtacaccggcaccgtgtacggccgacc ctctgcactggacaccccaacaactacccatccagctccagctccatgtccaactgccat agcagcctcatggccggctgtctgggctcccacagccgggacagtgacctaggtgcccaa ggctcattgccacctgtgagggacaaactcctgctggagaagaacctcaagcggctgcta cagctggagagggaagggaaaggcctcagtcagtcctgctcccagagggactccctgctg tgggattcgctgggtagccagaccagctttcagtggacacaggagcagcccttgtcctgg ttctcagggctgctgggctcaagctctggcgtgcctgaagcatcagagccgaggcctgga gaacaggagccaatcttccgcaagcgagagttcaataaggagatcaagtcattactgagc cagctggagtccctcgacctgcctggctactgtccgctccgtgagccccatcgcacgctg aacttcctggctgaccaccgcctcttccctgccctgcaaagcgtggtcagccaggctgtg gataagctccgtggcgcccactgccgcgacggccgtcctctgttccccaccagcttggag cccacctcagatctgccgcctctgggctctgagccagctaaacccaccaatggcgggcag ccctatgcttccccccgccccacagtctccagccccaagatgcttcagagaaaacgcaag gacagaggaggctccccctccatgtctagtgcccaggtggccaccagattcaaactcaag gtgacacccacggagaagcccaatgtccccagcccctcactccactccagggaggaggca cctgactcagatcccaaattacaaaacccacctgtttccctgagctccagccagagggcc cagccctggcagggcctgcacctcaccctgcccacgccagggattgtggtggaggtggcc tgcagccagggccacctcaggggccctgtcacacctccactttcctccccctacccccgc tcttcctgctaccttctccctgagctctctccagttgcctcttcatctcccgcctcactg tgtccagagagcccctgcagcagcagcaggttcacgaagaagaagccgctgccctccatc tcgtcgaagtccagcatgtctcacttctccaaccgcctttatgaggagctcgccgacttc ctgacccagcaggcagcctccttggtcatccgcaagtacgagttcgaaaaggacctcagt aagcagctgggcttcttctccttccccatcacccacgtgctcagggacctttccctgggc ttaaagaaggtaaaaggctcccgcatccacctgtcctcggagacccaccggagctgcctg ctgcgtaaactggaggagtccaaaagggcccggcaggcctcccggctcagcacctcccac tgcagcacagagacaccctctgtgcagcaggaaccagccacccacactgcccaggaccag gccacagagccctgccgctccctctacaccaacttgccagccagccggcagctcagccct ttggagcccaagctctacatgtctgcctgcaccggcatgggttccagtccccccaagtcc aaggacatggacaatgagggccgtgataaagccgagattgaagatgaagatgaggatgag ttcaaggatgaagaccaggatgaggacaaggatgaggatggagtctag >gi568815576f:21533182_21737000|GENSCAN_predicted_peptide_6|221_aa MWSAGRGGAAWPVLLGLLLALLVPGGGAAKTGAELVTCGSVLKLLNTHHRVRLHSHDIKY GSGSGQQSVTGVEASDDANSYWRIRGGSEGGCPRGSPVRCGQAVRLTHVLTGKNLHTHHF PSPLSNNQEVSAFGEDGEGDDLDLWTVRCSGQHWEREAAVRFQHVGTSVFLSVTGEQYGS PIRGQHEVHGMPSANTHNTWKAMEGIFIKPSVEPSAGHDEL >gi568815576f:21533182_21737000|GENSCAN_predicted_CDS_6|666_bp atgtggagcgcgggccgcggcggggctgcctggccggtgctgttggggctgctgctggcg ctgttagtgccgggcggtggtgccgccaagaccggtgcggagctcgtgacctgcgggtcg gtgctgaagctgctcaatacgcaccaccgcgtgcggctgcactcgcacgacatcaaatac ggatccggcagcggccagcaatcggtgaccggcgtagaggcgtcggacgacgccaatagc tactggcggatccgcggcggctcggagggcgggtgcccgcgcgggtccccggtgcgctgc gggcaggcggtgaggctcacgcatgtgcttacgggcaagaacctgcacacgcaccacttc ccgtcgccgctgtccaacaaccaggaggtgagtgcctttggggaagacggcgagggcgac gacctggacctatggacagtgcgctgctctggacagcactgggagcgtgaggctgctgtg cgcttccagcatgtgggcacctctgtgttcctgtcagtcacgggtgagcagtatggaagc cccatccgtgggcagcatgaggtccacggcatgcccagtgccaacacgcacaatacgtgg aaggccatggaaggcatcttcatcaagcctagtgtggagccctctgcaggtcacgatgaa ctctga >gi568815576f:21533182_21737000|GENSCAN_predicted_peptide_7|974_aa MRPGRLQTARRGARRRGLGACREVGTLKKLLTNHVLGAAKSPTRVKGPMAPAELGQDPRS RVRPLEVATRALQPRSLVEILDKFDSEVPQSKTCQQISEEDLERQADTYTGRALRLLFRS PVRNPSLAERVAKFFCVVQAELNHCNSMGALEMHEQVEQLKQSIHRVHLYSRGEDQPLPV LGPGRGHQEDEKEAKTEETRTQPLVGPVDLPMPATNPAATSAATTTMVSVVTSSPPGTWG CLACLKPSTLPVLNTLTFMVARSQPFVYPVCTPDGIVFDLLNIVPWLKKYGTNPSNGEKL DGRSLIKLNFSKNSEGKYHCPVLFTVFTNNTHIVAVRTTGNVYAYEAVEQLNIKAKNFRD LLTDEPFSRQDIITLQDPTNLDKFNVSNFYHVKNNMKIIDPDEEKAKQDPSYYLKNTNAE TRETLQELYKEFKGDEILAATMKAPEKKKVDKLNAAHYSTGKVSASFTSTAMVPETTHEA AAIDEDVLRYQFVKKKGYVRLHTNKGDLNLELHCDLVGVEASHSPCPKVISGSSDSHVLN IGLPLEIQPVTRRDPQQCPVSFLGGRVGSMACAQTPKTCENFIRLCKKHYYDGTIFHRSI RNFVIQGGDPTGTGTGGESYWGKPFKDEFRPNLSHTGRGILSMANSGPNSNRSQFFITFR SCAYLDKKHTIFGRVVGGFDVLTAMENVESDPKTDRPKEEIRIDATTVFVDPYEEADAQI AQERKTQLKVAPETKVKSSQPQAGSQGPQTFRQGVGKYINPAATRGTASEGLAGASVGGR DQIQGDVGAAGSEQQRKSPQPVPLSPCPRRSPVGVLGTSAPGSSRLAAVDLGTAYLYNQC GPLTNAITWDKFSDPRLTEPKPLQGTVLRKAWCSPGSNWLVRGSDTKLGLQRRGTVGHSD CPGWGRHEPPTVSPSPRGIGGWVPTWERNEDSLKGKVCYSIDMDQPLSNCLTPKLLSLLR AQPLVPDPVRLSPQ >gi568815576f:21533182_21737000|GENSCAN_predicted_CDS_7|2925_bp atgaggcccgggaggctacagacagcgaggaggggcgcccgtcggaggggactaggggcc tgccgagaggtggggaccttaaagaagttgctgaccaaccacgtcctcggcgcagccaag agcccgacccgggtcaaaggcccgatggcccctgcggaactgggccaggacccgcgcagc cgcgttcgccccctcgaggtcgccacccgggctctccagccgcgctcgctcgtagagatc ctagacaagttcgacagcgaggtcccgcagagcaagacctgccagcagatctccgaggag gacttggagaggcaggcggacacctacactgggcgcgcgctgcgcctcctcttccgcagc cccgtccgcaacccgtcgctggcggagagggtggcgaagttcttctgtgttgtgcaggcg gagctgaaccattgcaacagcatgggcgccctggagatgcacgagcaggtggagcagctg aagcagagcatccaccgcgtgcacctctactcccggggcgaggatcaacctctccccgtg ctgggccctgggcggggacaccaagaagacgagaaagaagccaagacagaagaaaccaga acccaacctcttgtgggaccggttgacctccccatgcctgccaccaaccctgctgccacc tctgccgctaccaccaccatggtctccgtggtgacctcatcaccccctggcacctggggc tgccttgcttgcctaaagccaagtacattacctgtgctgaatacactcacttttatggtg gcaagaagccagccctttgtctacccagtctgcactcccgatggcatcgtctttgactta ctgaacattgttccatggcttaagaagtacgggaccaaccccagcaatggagagaagctg gacgggaggtccctgatcaagctgaacttttccaagaacagtgaggggaagtaccactgc ccagtgctgtttaccgtgttcaccaacaacacccacatcgtggctgtgaggacgaccggc aacgtctacgcctatgaggcagtggaacagctaaatatcaaggccaagaacttccgggac ctgctgaccgacgagcccttctcccggcaggacatcatcaccctccaggaccccaccaat ttggacaagttcaatgtctctaacttctatcatgtgaagaataacatgaaaataatagac ccagatgaagagaaggccaaacaggacccgtcttattatctgaaaaatacaaatgccgag acccgagagaccctgcaggagctctacaaggagttcaaaggggacgagattctggcagcc accatgaaggccccggagaagaagaaagtggacaagctgaacgctgcccactattccaca gggaaggtcagcgcttccttcacctccaccgcgatggtcccggagaccacacatgaagca gctgccatcgacgaggatgtgctgcgctaccagtttgtgaagaagaagggctacgtgcgg ctgcacaccaacaagggcgacctcaacctggagctgcactgcgacctggtgggtgtggag gccagccactccccatgccccaaggtcatctctgggtcatctgacagccatgtcttaaat atagggctgccactggaaattcagcctgtcactaggcgggacccgcagcagtgccctgtg tccttcctagggggccgcgttggcagcatggcctgcgcacagacaccaaaaacctgcgaa aacttcatcaggctttgcaagaagcattattacgatggcaccatcttccacagatccatc cggaactttgtgatccaagggggcgaccccacaggcacaggcacgggtggggagtcatac tgggggaagcccttcaaagacgagttccggcccaacctctcgcacacgggccgcggcatc ctcagcatggccaactccgggcccaacagcaacaggtctcaattcttcatcacgtttcgc tcctgtgcctacctggacaagaagcataccatctttggacgggttgttgggggctttgac gtactgacagccatggagaatgtggagagtgaccccaaaactgaccgccctaaggaggag atccgcattgatgccactacagtgttcgtggacccctatgaggaggccgatgcccagatt gcgcaggagcggaagacacagctcaaggtagccccggagaccaaagtgaagagcagccag ccccaggcagggagccagggcccccagaccttccgccagggcgtgggcaagtacatcaac ccagcagccacccgagggactgcctctgaaggcctggccggggcctctgtgggtggaaga gatcaaattcagggggatgtgggagcagcaggaagcgagcagcagaggaagagccctcaa ccagtgccactgtccccatgtccaagaagaagcccagtcggggttttggggacttcagct cctggtagcagcaggttggccgctgtggaccttgggacagcctatttatacaaccagtgt ggtcccctgaccaacgccattacctgggacaagttttcagaccccagacttactgagcct aagcctctgcagggcactgtcctccgcaaggcctggtgcagccctggcagtaactggctt gtaagaggctcagacaccaagctgggcctgcagaggaggggcacagtaggacacagtgac tgcccaggctggggacggcacgaacctccaactgtttcccctagtccccggggcatcgga ggctgggttcccacatgggagaggaatgaagactcactcaaagggaaggtgtgttactct atagacatggaccagccactcagcaactgcctgacacctaagctgctgtcactgctgagg gcacaacccctggtaccagaccctgtgaggctgtcacctcagtaa >gi568815576f:21533182_21737000|GENSCAN_predicted_peptide_8|153_aa MEELMLSRNDSVLHPSSGHDIPPASGHELPASSYVMTTDPQLTAAGLDSPGSGQHRFVPE STRSQSHGTFFFQSFQGSQGRAYLFNSVVNVGCGPAEERVLLTGLHAVADIYCENCKTTL GWKYEHAFESSQKYKEGKFIIELAHMIKDNGWE >gi568815576f:21533182_21737000|GENSCAN_predicted_CDS_8|462_bp atggaggagctgatgctcagtcggaatgacagtgttctccacccatcctcaggtcatgac atccccccagcctcaggtcatgaactccccgcatcctcatacgtcatgaccaccgatcct cagctaacagccgcggggctggactcccctggttctgggcagcaccgcttcgttccagaa agcactcgcagccaatctcacgggaccttcttctttcagtcctttcaggggagccaggga cgcgcctacctcttcaattccgtggtgaacgtgggctgcggccctgcagaggagagggtc cttctcaccgggctgcatgcggttgccgacatctactgcgagaactgcaagaccacgctc gggtggaaatacgagcatgcctttgagagcagtcagaaatataaggaaggaaaattcatc attgagcttgctcatatgatcaaagacaatggctgggagtaa >gi568815576f:21533182_21737000|GENSCAN_predicted_peptide_9|71_aa MTDAAIISNRTCAKFSWWHHPSLACDKGPAESQAHRIFFQAKQKSKGFPKFESTGLGRRA KRQQCPHTCSS >gi568815576f:21533182_21737000|GENSCAN_predicted_CDS_9|216_bp atgactgatgctgccatcatcagcaacaggacgtgcgccaagttttcctggtggcaccac cccagcctggcctgtgacaaggggcctgcagagagtcaggcacaccggattttctttcag gccaaacagaagagcaagggtttccccaaattcgagagcacggggctgggaagaagagcc aagcgccagcagtgtcctcacacctgctcctcctga >gi568815576f:21533182_21737000|GENSCAN_predicted_peptide_10|54_aa MGAAGRGADRGAAARWDRERRAPVLWIEYPPVGLWWPLAGRDAYFSGCCENSTA >gi568815576f:21533182_21737000|GENSCAN_predicted_CDS_10|165_bp atgggcgctgcgggccggggcgcggaccgcggagcggccgccaggtgggaccgagagcgg agagcacctgtgctctggattgagtacccgcctgtgggactgtggtggccgctagcgggc agggatgcctatttctcaggatgttgtgagaattcaacggcctga