GENSCAN 1.0 Date run: 4-Nov-116 Time: 12:23:43 Sequence gi568815589r:34974111_35179524 : 205414 bp : 46.82% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2042 2120 79 1 1 108 95 87 0.892 10.52 1.02 Intr + 2425 2630 206 2 2 23 94 245 0.998 17.52 1.03 Intr + 2973 3133 161 2 2 108 46 91 0.893 5.69 1.04 Intr + 3433 3531 99 1 0 47 109 123 0.557 9.53 1.05 Term + 3905 4001 97 2 1 106 39 135 0.991 7.84 1.06 PlyA + 7609 7614 6 1.05 2.00 Prom + 9986 10025 40 -6.46 2.01 Init + 16521 16702 182 2 2 40 123 135 0.612 9.44 2.02 Intr + 17632 17760 129 1 0 130 45 63 0.864 6.01 2.03 Intr + 19090 19334 245 1 2 58 116 409 0.973 37.84 2.04 Intr + 19614 19664 51 1 0 93 53 47 0.598 0.58 2.05 Intr + 22155 22756 602 1 2 118 59 969 0.937 89.23 2.06 Term + 22916 23149 234 1 0 113 44 96 0.927 4.02 2.07 PlyA + 24299 24304 6 1.05 3.00 Prom + 29453 29492 40 -6.76 3.01 Sngl + 36589 36789 201 0 0 71 38 240 0.686 12.48 3.02 PlyA + 37342 37347 6 1.05 4.00 Prom + 55159 55198 40 -2.26 4.01 Init + 61734 61977 244 2 1 70 38 125 0.054 1.64 4.02 Intr + 62062 62273 212 2 2 87 59 29 0.031 -1.47 4.03 Intr + 68751 70443 1693 2 1 71 91 385 0.704 24.42 4.04 Intr + 77417 77491 75 0 0 79 75 29 0.020 0.19 4.05 Term + 82267 82298 32 2 2 55 54 38 0.004 -4.88 4.06 PlyA + 82554 82559 6 1.05 5.54 PlyA - 82818 82813 6 1.05 5.53 Term - 83112 83007 106 2 1 99 42 86 0.984 2.98 5.52 Intr - 83420 83266 155 2 2 51 111 237 0.999 21.17 5.51 Intr - 85109 84954 156 2 0 89 101 210 0.945 22.61 5.50 Intr - 85691 85362 330 2 0 83 -19 437 0.780 28.33 5.49 Intr - 86415 86203 213 0 0 96 105 263 0.999 27.71 5.48 Intr - 86813 86691 123 2 0 97 91 116 0.998 13.58 5.47 Intr - 87069 86881 189 0 0 46 86 262 0.590 21.58 5.46 Intr - 87579 87467 113 1 2 123 72 156 0.999 17.60 5.45 Intr - 88028 87893 136 2 1 96 87 148 0.999 15.64 5.44 Intr - 88240 88107 134 2 2 98 19 198 0.999 14.26 5.43 Intr - 88970 88868 103 2 1 64 81 126 0.998 9.25 5.42 Intr - 90175 90044 132 1 0 87 105 135 0.983 15.94 5.41 Intr - 91271 91141 131 0 2 121 105 75 0.984 12.91 5.40 Intr - 92707 92565 143 0 2 60 100 277 0.991 26.10 5.39 Intr - 93953 93781 173 2 2 85 89 234 0.812 21.94 5.38 Intr - 94252 94141 112 0 1 97 115 179 0.999 21.88 5.37 Intr - 94714 94594 121 2 1 92 84 37 0.896 3.25 5.36 Intr - 97968 97781 188 2 2 61 68 74 0.333 2.03 5.35 Intr - 98513 98399 115 0 1 60 60 102 0.679 4.11 5.34 Intr - 100972 100817 156 2 0 15 91 149 0.796 7.88 5.33 Intr - 101215 101169 47 0 2 120 53 85 0.834 6.25 5.32 Intr - 101619 101355 265 1 1 126 99 86 0.966 10.07 5.31 Intr - 101918 101848 71 1 2 67 57 68 0.834 0.43 5.30 Intr - 102473 102322 152 2 2 87 78 -19 0.403 -4.04 5.29 Intr - 102760 102614 147 1 0 92 75 66 0.403 6.13 5.28 Intr - 102991 102861 131 2 2 86 58 70 0.856 4.21 5.27 Intr - 103289 103154 136 2 1 88 82 107 0.999 10.24 5.26 Intr - 104233 104031 203 2 2 51 90 132 0.996 8.70 5.25 Intr - 104626 104495 132 2 0 34 78 78 0.583 2.02 5.24 Intr - 105131 105041 91 2 1 97 102 4 0.960 2.27 5.23 Intr - 105425 105331 95 0 2 42 87 85 0.697 3.48 5.22 Intr - 106154 105968 187 2 1 17 66 186 0.145 8.56 5.21 Intr - 115111 115027 85 0 1 115 42 55 0.227 3.42 5.20 Intr - 115340 115270 71 2 2 104 78 9 0.852 -0.52 5.19 Intr - 116129 115956 174 2 0 -18 94 202 0.552 10.34 5.18 Intr - 118657 117089 1569 1 0 107 76 565 0.406 46.48 5.17 Intr - 119099 118920 180 2 0 87 96 77 0.974 8.46 5.16 Intr - 119470 119311 160 0 1 67 78 117 0.996 8.59 5.15 Intr - 119914 119791 124 2 1 78 81 122 0.999 10.14 5.14 Intr - 120249 120106 144 1 0 140 55 138 0.999 15.95 5.13 Intr - 121456 120945 512 0 2 90 82 323 0.775 24.51 5.12 Intr - 122138 122084 55 0 1 90 7 57 0.533 -4.16 5.11 Intr - 122648 122451 198 0 0 47 99 104 0.546 6.72 5.10 Intr - 126062 125930 133 2 1 112 4 142 0.549 8.42 5.09 Intr - 126616 126488 129 1 0 67 66 205 0.994 17.09 5.08 Intr - 126901 126822 80 2 2 102 47 103 0.807 6.87 5.07 Intr - 127169 127025 145 2 1 62 71 177 0.767 13.26 5.06 Intr - 127450 127316 135 1 0 62 75 241 0.975 20.96 5.05 Intr - 127701 127600 102 0 0 74 102 151 0.999 15.47 5.04 Intr - 127852 127794 59 2 2 96 77 77 0.998 6.00 5.03 Intr - 128084 127985 100 2 1 76 85 163 0.999 14.48 5.02 Intr - 128713 128576 138 1 0 125 39 113 0.983 10.76 5.01 Init - 128984 128940 45 2 0 53 76 113 0.811 5.29 5.00 Prom - 129767 129728 40 -13.52 6.18 PlyA - 130036 130031 6 1.05 6.17 Term - 131264 131111 154 1 1 98 38 118 0.992 5.19 6.16 Intr - 131747 131572 176 2 2 113 94 157 0.996 17.54 6.15 Intr - 131899 131831 69 1 0 102 89 58 0.909 6.68 6.14 Intr - 132300 132142 159 0 0 70 72 145 0.992 11.28 6.13 Intr - 132533 132428 106 1 1 104 55 82 0.938 6.72 6.12 Intr - 132781 132723 59 1 2 105 87 20 0.954 1.28 6.11 Intr - 133441 133271 171 1 0 36 99 135 0.785 9.54 6.10 Intr - 136577 136496 82 1 1 87 42 60 0.264 0.94 6.09 Intr - 142291 142152 140 1 2 67 76 39 0.254 -0.14 6.08 Intr - 142641 142432 210 0 0 88 70 52 0.231 2.51 6.07 Intr - 152602 152410 193 0 1 3 81 150 0.070 5.29 6.06 Intr - 164831 164755 77 2 2 -3 91 85 0.104 -1.99 6.05 Intr - 165314 165130 185 0 2 97 97 40 0.107 5.31 6.04 Intr - 174082 173398 685 1 1 -10 85 874 0.517 68.44 6.03 Intr - 175124 174097 1028 0 2 75 -57 602 0.022 34.71 6.02 Intr - 181188 181164 25 0 1 144 31 -5 0.048 -3.00 6.01 Intr - 188208 188041 168 0 0 86 74 149 0.153 13.44 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589r:34974111_35179524|GENSCAN_predicted_peptide_1|213_aa DNINLLLTEEEMYSLTETFQRCKVIPDCSLTLEDFLRYRHQAAKRGDRDRALSEEQEEQA ARQFAALDPEHRGHIEWPDFLSHESLLLLQQLRPQNSLLRLLTVKERERARAAFLARGSG STVSEAECRRAQHSWFCKRFPEAPSCSVSSISHVGPIADSSPASSSSKSQDKTLLPTEQE SRFVDWPTFLQENVLYILAARPNSAAIHLKPPG >gi568815589r:34974111_35179524|GENSCAN_predicted_CDS_1|642_bp gacaacatcaacttgctgcttactgaggaggaaatgtatagcctcacggagacctttcag cggtgtaaagtcatccctgattgctccctgacactggaggactttctgcgttaccgccac caagcagccaagcggggggaccgtgacagggccctgagtgaggagcaagaagagcaggcg gcccgccagtttgctgccctggaccctgaacatcgaggccacatagagtggcctgacttc ttgtcccatgagtccctcctgcttctgcagcagttgcgtccccagaactctctgttgagg cttctgacagtgaaggagcgggagcgagcccgagccgccttcctggctcggggcagtggg agcaccgtcagtgaggcagagtgccgccgggcccagcactcttggttttgcaaacggttc ccagaggctccttcctgcagtgtcagcagcatcagccatgtgggtcccatcgcagatagc agcccagccagcagcagcagcaagagtcaggacaagaccctgctgcccacagagcaggag tccagatttgtggactggcctaccttcctgcaagagaatgtcctctacatcctggctgct cgccccaacagcgcagccattcacctgaagcccccaggatag >gi568815589r:34974111_35179524|GENSCAN_predicted_peptide_2|480_aa MFKRTVLSCPPPAAPPLQARGAFRSFPHSWGEDFLASLMFKIQLEPLKLRAWTLNGFVKF RRDMLVGGNGGVPAFVEPFLRQRKVTPRTMSQPSHLQQGLWGQQNKETSAGPVAVMGKDY YKILGIPSGANEDEIKKAYRKMALKYHPDKNKEPNAEEKFKEIAEAYDVLSDPKKRGLYD QYGEEALLVLPEVSAASLADEEGLKTGGGTSGGSSGSFHYTFHGDPHATFASFFGGSNPF DIFFASSRSTRPFSGFDPDDMDVDEDEDPFGAFGRFGFNGLSRGPRRAPEPLYPRRKVQD PPVVHELRVSLEEIYHGSTKRMKITRRRLNPDGRTVRTEDKILHIVIKRGWKEGTKITFP KEGDATPDNIPADIVFVLKDKPHAHFRRDGTNVLYSALISLKEALCGCTVNIPTIDGRVI PLPCNDVIKPGTVKRLRGEGLPFPKVPTQRGDLIVEFKVRFPDRLTPQTRQILKQHLPCS >gi568815589r:34974111_35179524|GENSCAN_predicted_CDS_2|1443_bp atgtttaagcgcacagtgctctcctgcccacccccagcagcacccccactgcaggcccga ggagctttccggagcttcccacactcctggggagaagacttcttagccagcttgatgttt aaaattcagctggagcccttaaaacttcgagcgtggacgctgaatgggtttgtaaagttt cgacgtgacatgctggtggggggcaatgggggagtacccgcctttgtggagccttttctt cgtcagagaaaggtgactcccagaaccatgagtcaacccagccacctgcagcagggcctg tgggggcagcaaaacaaggagaccagtgctggtccagtggctgtgatgggaaaagattat tacaagattcttgggatcccatcgggggccaacgaggatgagatcaagaaagcctaccgg aagatggccttgaagtaccacccagacaagaataaagaacccaacgctgaggagaagttt aaggagattgcagaggcctatgatgtgctaagtgaccccaagaaacggggcctgtatgac cagtatggggaggaagccttattagtcctgcctgaagtctccgctgcttcattggctgat gaggaaggcctgaagaccggcggtggcacatcaggtggctccagtggctcctttcactac acctttcatggggacccccatgccacctttgcctccttctttggtggctccaaccccttc gatatcttctttgccagcagccgctccactcggcccttcagtggctttgacccagatgac atggatgtggatgaagatgaggacccatttggcgctttcggccgttttggcttcaatggg ctgagtaggggtccaaggcgagccccagaaccactgtaccctcggcgcaaggtgcaggac cccccagtggtgcacgagctgcgggtgtccctggaggagatctaccatggctccaccaag cgcatgaagatcacaaggcgtcgcctcaaccctgatgggcgaactgtgcgcaccgaggac aagatcctgcacatagtcatcaagcgtggctggaaggaaggcaccaagatcaccttcccc aaagaaggcgacgccacacctgacaacatccctgctgacatcgtctttgtgctcaaagac aagccccatgcacacttccgccgagatggcaccaacgtgctctacagtgccctgatcagc ctcaaggaggcgctgtgtggctgcactgtgaacattcccactatcgacggccgagtgatc cctttgccctgcaatgatgtcatcaagccaggcaccgtgaagagactccgtggggagggc cttcccttccccaaagtgccaactcagcgaggagacctcattgttgagttcaaagttcgc ttcccagacagattaacaccacagacaagacagatccttaagcagcacctaccctgttcc tag >gi568815589r:34974111_35179524|GENSCAN_predicted_peptide_3|66_aa MAEKVSVDSMEKEAFAAAEELATQKHEPRLHKFWELHLKETEAWKLNRQEVVEDDKRLKT TCKLGS >gi568815589r:34974111_35179524|GENSCAN_predicted_CDS_3|201_bp atggctgagaaggtgtcagtggacagcatggagaaggaggcttttgcagcagctgaggag ctggccactcagaagcatgaaccaagactgcacaaattctgggagctgcacctgaaagag actgaagcttggaaattaaatcgccaagaagttgtggaggacgataaaagacttaaaact acctgcaaactgggaagttaa >gi568815589r:34974111_35179524|GENSCAN_predicted_peptide_4|751_aa MPSWAGRAGSASAPRDSISPLAADTLTHATFRGSYFPFPPAHQGHCGLTTPSSPGKQFPQ DRGHYPSSGRRPHKPLTEGRLRGNWSWVAAGPLTLHPYGEGGTGERNSRRERQRGGWRGR TREFRWRLWLRVDAPPGRWRGGAGPTSPKAREELPLLHRVAFLDHLCKQKSEVEEEGEEE EEGEDEASLDPLKPCSPTKEAPTGEQATPAPPQPSCGSEGLLKAIGIPEQTVMQPVSPSR SFPIFQILTSFPVRHKIASGNRQQQRKSQLFWGLPSLHSESLEAIFLSSGGPSPLKWSVC SSVFFNKLAFLPRSNLLLPQYHSSAQFSTHGAHTMEDLEGMAPDPQLLPPPSSPSVSSLL LHLRPFPVDHKGVLSGAEAPTQSPGTSPLEVLPGYETHLETTGHKKMPQAFEPPMPPPCQ SPASLSEPRKVSPEGGLAISKDFWGTVGYREKPQASESSMPVPCPPLDSLPELQRESSLE DPSRYKPQWECRENSGNLWAFESPVLDLNPELSGTSPECVPPASETPWKGMQSRENIWVP ADPVSPPSLPSVPLLESLVMGPQGVLSESKALWETMGQKENLWASDSPDPVHSTPPTTLM EPHRINPGECLATSEATWKDTEHSRNSSASRSPSLALSPPPALAPELLRVRSMGVLSDSE ARCGDIQKTKNSWASKHPACNLPQDLHGASPLGVLSDSQSIVGEMEQKENCVPVFPALFS SNLSTGSESKTHHLLAKGGTKVLSSDTADKG >gi568815589r:34974111_35179524|GENSCAN_predicted_CDS_4|2256_bp atgccatcctgggccggcagggcaggttcagcttctgcccctcgggactcgatctcacct ctggccgctgacactctgacccatgcgacttttcggggctcctacttcccattcccacca gcccaccaggggcactgcggcctcacaactccctcatcccctgggaaacaattcccccag gaccgtgggcattatccctccagcggacgtcggccccacaagcccctcacggaagggcgg ctccgtggcaactggagctgggttgctgccgggcccttaacactacatccctacggagag ggcggaacaggagagaggaatagcagaagggaaaggcagaggggagggtggcgaggaaga acccgcgagttccgatggaggctgtggctgcgcgtcgatgctccaccaggacgctggaga ggcggcgcgggtcccacaagcccgaaagcccgggaggaactaccacttctgcaccgtgtg gccttccttgatcacctgtgtaagcagaaatcagaagtggaggaagaaggggaagaagag gaagagggggaagacgaggcatctctggatccactgaagccatgttctcctaccaaagaa gctcccactggagagcaagccactccagccccaccccagccatcctgtggttctgagggc ctcctcaaggctatagggataccagagcaaacagtcatgcagcccgtgagcccttccaga tccttccccatcttccagattctgaccagctttcctgtgaggcacaagatagcatcaggg aaccgccagcagcagagaaaaagccagctcttctggggtctcccctctctgcacagcgag tccttggaggccatcttcctgagctcaggtggcccctctcctctgaagtggtctgtttgt tcttctgtcttcttcaacaagcttgccttcctacctaggtccaacctgttgcttccccag tatcactcctcagcccagttttctacccatggggcccatactatggaagatctagaaggg atggcccccgatcctcagctgcttccacctccatcttctccttctgtctcatcactactc ctccatctgaggcccttccctgtggaccacaagggagttttatctggcgctgaggcaccc acacagtcccctggaactagccccctggaagttctccctggatatgagactcatttggaa accacaggacacaaaaagatgccccaagcttttgagcctccgatgccacccccctgccaa tccccagcttctctgtcagaacccagaaaagttagccctgaaggaggacttgctatatct aaggacttctggggaaccgtgggatacagagagaaacctcaggcctctgagtcttcaatg ccagtcccttgccctcccctagactccctgccagaactccagagagagagttccctggaa gatccatccagatataagccccagtgggaatgcagagaaaactcaggaaacctctgggct tttgagtctccagtcttggacctcaacccagagctctctggaaccagccctgaatgtgtc ccaccagcatctgagacaccatggaagggcatgcaaagtagagaaaatatttgggtccct gcagacccagtttcacctcccagccttccctcagtccctctcctggagtctctagtaatg ggcccccagggagtcctgtctgaatccaaagctttgtgggagaccatggggcagaaagag aacctctgggcatctgattccccagaccctgttcatagcacacctccaaccacccttatg gaaccacacagaatcaatcctggggaatgcctcgctacatcagaagctacatggaaggat actgagcattccaggaattcctcggcttctaggtctccatctctggccctcagcccaccc ccagctcttgcaccggagctgctcagagttagatccatgggggtcctgtctgattctgaa gctagatgtggggacatacaaaagacaaaaaactcctgggcctctaagcacccagcttgt aacttaccccaagacctgcatggagccagccctctgggagtcttgtctgattctcagtct attgtaggggaaatggagcaaaaagaaaactgtgttcctgtgttcccagctttatttagc tcaaacctgtcaacagggtctgagtccaaaactcaccacctgctggccaagggtggaact aaagtcctatccagtgatactgcagacaaagggtga >gi568815589r:34974111_35179524|GENSCAN_predicted_peptide_5|3007_aa MLARAARGTGALLLRGSLLASGRAPRRASSGLPRNTVVLFVPQQEAWVVERMGRFHRILE PGLNILIPVLDRIRYVQSLKEIVINVPEQSAVTLDNVTLQIDGVLYLRIMDPYKASYGVE DPEYAVTQLAQTTMRSELGKLSLDKVFRERESLNASIVDAINQAADCWGIRCLRYEIKDI HVPPRVKESMQMQVEAERRKRATVLESEGTRESAINVAEGKKQAQILASEAEKAEQINQA AGEASAVLAKAKAKAEAIRILAAALTQHNGDAAASLTVAEQYVSAFSKLAKDSNTILLPS NPGDVTSMVAQAMGVYGALTKAPVPGTPDSLSSGSSRDVQGTDASLDEELDRVKMTWSPV PNFQLLNIPSNWGQPHAPGQTSTEVPADGDGATDGPLCLAHASLCCQVAGAAAAALPGAI AGGAVGWARIPLRLRSLSTGMQKASVLLFLAWVCFLFYAGIALFTSGFLLTRLELTNHSS CQEPPGPGSLPWGSQGKPGACWMASRFSRVVLVLIDALRFDFAQPQHSHVPREPPVSLPF LGKLSSLQRILEIQPHHARLYRSQVDPPTTTMQRLKALTTGSLPTFIDAGSNFASHAIVE DNLIKQLTSAGRRVVFMGDDTWKDLFPGAFSKAFFFPSFNVRDLDTVDNGILEHLYPTMD SGEWDVLIAHFLGVDHCGHKHGPHHPEMAKKLSQMDQVIQGLVERLENDTLLVVAGDHGM TTNGDHGGDSELEVSAALFLYSPTAVFPSTPPEEPEVIPQVSLVPTLALLLGLPIPFGNI GEVMAELFSGGEDSQPHSSALAQASALHLNAQQVSRFLHTYSAATQDLQAKELHQLQNLF SKASADYQWLLQSPKGAEATLPTVIAELQQFLRGARAMCIESWARFSLVRMAGGTALLAA SCFICLLASQWAISPGFPFCPLLLTPVAWGLVGAIAYAGLLGTIELKLDLVLLGAVAAVS SFLPFLWKAWAGWGSKRPLATLFPIPGPVLLLLLFRLAVFFSDSFVVAEARATPFLLGSF ILLLVVQLHWEGQLLPPKLLTMPRLGTSATTNPPRHNGAYALRLGIGLLLCTRLAGLFHR CPEETPVCHSSPWLSPLASMVGGRAKNLWYGACVAALVALLAAVRLWLRRYGNLKSPEPP MLFVRWGLPLMALGTAAYWALASGADEAPPRLRVLVSGASMVLPRAVAGLAASGLALLLW KPVTVLVKAGAGAPRTRTVLTPFSGPPTSQADLDYVVPQIYRHMQEEFRGRLERTKSQGP LTVAAYQLGSVYSAAMVTALTLLAFPLLLLHAERISLVFLLLFLQSFLLLHLLAAGIPVT TPGKYLSSDSLKDNSDSQGLRKRQQPPGNEADARVRPEEEEEPLMEMRLRDAPQHFYAAL LQLGLKYLFILGIQILACALAASILRRHLMVWKVFAPKFIFEAVGFIVSSVGLLLGIALV MRVDGAVLLSSASTERHCQQTTRGRKPTLVSVLVLDSEQRKDGRLRSALVSSYRFLETPS AGAELFRPASATMSRQTTSVGSSCLDLWREKNDRLVRQAKVAQNSGLTLRRQQLAQDALE GLRGLLHSLQGLPAAVPVLPLELTVTCNFIILRASLAQGFTEDQAQDIQRSLERVLETQE QQGPRLEQGLRELWDSVLRASCLLPELLSALHRLVGLQAALWLSADRLGDLALLLETLNG SQSGASKDLLLLLKTWSPPAEELDAPLTLQDAQGLKDVLLTAFAYRQGLQELITGNPDKA LSSLHEAASGLCPRPVLVQVYTALGSCHRKMGNPQRALLYLVAALKEGSAWGPPLLEASR LYQQLGDTTAELESLELLVEALNVPCSSKAPQFLIEVELLLPPPDLASPLHCGTQSQTKH ILASRCLQTGRAGDAAEHYLDLLALLLDSSEPRVGPCMPEVFLEAAVALIQAGRAQDALT LCEELLSRTSSLLPKMSRLWEDARKGTKELPYCPLWVSATHLLQGQAWVQLGAQKVAISE FSRCLELLFRATPEEKEQGAAFNCEQGCKSDAALQQLRAAALISRGLEWVASGQDTKALQ DFLLSVQMCPVSAKRLRPSFESSLPLPLPLPLPPRGSGASVVRPTPRCRPRPARLAPLER TSGPGQVFRPTPPARRPGALGRQSAVRPTTRRKPLVPGESRPREPEAPAGPEEDIKVQRL GNLPKITIKQWHNWNSDPMGLTIEFLLLTTLLSKGDDLSTAILKQKNRPNRLIVDEAINE DNSVVSLSQPKMDELQLFRGDTVLLKGKKRREAVCIVLSDDTCSDEKIRMNRVVRNNLRV RLGDVISIQPCPDVKYGKRIHVLPIDDTVEGITGNLFEVYLKPYFLEAYRPIRKGDIFLV RGGMRAVEFKVVETDPSPYCIVAPDTVIHCEGEPIKREDEEESLNEVGYDDIGGCRKQLA QIKEMVELPLRHPALFKAIGVKPPRGILLYGPPGTGKTLIARAVANETGAFFFLINGPEI MSKLAGESESNLRKAFEEAEKNAPAIIFIDELDAIAPKREKTHGEVERRIVSQLLTLMDG LKQRAHVIVMAATNRPNSIDPALRRFGRFDREVDIGIPDATGRLEILQIHTKNMKLADDV DLEQVANETHGHVGADLAALCSEAALQAIRKKMDLIDLEDETIDAEVMNSLAVTMDDFRV RTTPVPQWALSQSNPSALRETVVEVPQVTWEDIGGLEDVKRELQELVQYPVEHPDKFLKF GMTPSKGVLFYGPPGCGKTLLAKAIANECQANFISIKGPELLTMWFGESEANVREIFDKA RQAAPCVLFFDELDSIAKARGGNIGDGGGAADRVINQILTEMDGMSTKKNVFIIGATNRP DIIDPAILRPGRLDQLIYIPLPDEKSRVAILKANLRKSPVAKAGARSWADVDLEFLAKMT NGFSGADLTEICQRACKLAIRESIESEIRRERERQTNPSAMEVEEDDPVPEIRRDHFEEA MRFARRSVSDNDIRKYEMFAQTLQQSRGFGSFRFPSGNQGGAGPSQGSGGGTGGSVYTED NDDDLYG >gi568815589r:34974111_35179524|GENSCAN_predicted_CDS_5|9024_bp atgctggcgcgcgcggcgcggggcactggggcccttttgctgaggggctctctactggct tctggccgcgctccgcgccgcgcctcctctggattgccccgaaacaccgtggtactgttc gtgccgcagcaggaggcctgggtggtggagcgaatgggccgattccaccggatcctggag cctggtttgaacatcctcatccctgtgttagaccggatccgatatgtgcagagtctcaag gaaattgtcatcaacgtgcctgagcagtcggctgtgactctcgacaatgtaactctgcaa atcgatggagtcctttacctgcgcatcatggacccttacaaggcaagctacggtgtggag gaccctgagtatgccgtcacccagctagctcaaacaaccatgagatcagagctcggcaaa ctctctctggacaaagtcttccgggaacgggagtccctgaatgccagcattgtggatgcc atcaaccaagctgctgactgctggggtatccgctgcctccgttatgagatcaaggatatc catgtgccaccccgggtgaaagagtctatgcagatgcaggtggaggcagagcggcggaaa cgggccacagttctagagtctgaggggacccgagagtcggccatcaatgtggcagaaggg aagaaacaggcccagatcctggcctccgaagcagaaaaggctgaacagataaatcaggca gcaggagaggccagtgcagttctggcgaaggccaaggctaaagctgaagctattcgaatc ctggctgcagctctgacacaacataatggagatgcagcagcttcactgactgtggccgag cagtatgtcagcgcgttctccaaactggccaaggactccaacactatcctactgccctcc aaccctggcgatgtcaccagcatggtggctcaggccatgggtgtatatggagccctcacc aaagccccagtgccagggactccagactcactctccagtgggagcagcagagatgtccag ggtacagatgcaagtcttgatgaggaacttgatcgagtcaagatgacttggtcccctgtc cccaattttcaattactaaatatcccatcaaactggggccagccccacgctcctggccaa acttccaccgaggtgccggcagatggcgatggagccaccgacgggccactctgtctggcg cacgcatctctgtgctgccaggtcgcaggcgccgccgccgccgcacttccgggtgccatt gcaggcggcgccgtcggctgggcccggattcccctgcggcttcgatccctttccactggg atgcagaaagcctcagtgttgctcttcctggcctgggtctgcttcctcttctacgctggc attgccctcttcaccagtggcttcctgctcacccgtttggagctcaccaaccatagcagc tgccaagagcccccaggccctgggtccctgccatgggggagccaagggaaacctggggcc tgctggatggcttcccgattttcgcgggttgtgttggtgctgatagatgctctgcgattt gacttcgcccagccccagcattcacacgtgcctagagagcctcctgtctccctacccttc ctgggcaaactaagctccttgcagaggatcctggagattcagccccaccatgcccggctc taccgatctcaggttgaccctcctaccaccaccatgcagcgcctcaaggccctcaccact ggctcactgcctacctttattgatgctggtagtaacttcgccagccacgccatagtggaa gacaatctcattaagcagctcaccagtgcaggaaggcgtgtagtcttcatgggagatgat acctggaaagaccttttccctggtgctttctccaaagctttcttcttcccatccttcaat gtcagagacctagacacagtggacaatggcatcctggaacacctctaccccaccatggac agtggtgaatgggacgtgctgattgctcacttcctgggtgtggaccactgtggccacaag catggccctcaccaccctgaaatggccaagaaacttagccagatggaccaggtgatccag ggacttgtggagcgtctggagaatgacacactgctggtagtggctggggaccatgggatg accacaaatggagaccatggaggggacagtgagctggaggtctcagctgctctctttctg tatagccccacagcagtcttccccagcaccccaccagaggagccagaggtgattcctcaa gttagccttgtgcccacgctggccctgctgctgggcctgcccatcccatttgggaatatc ggggaagtgatggctgagctattctcagggggtgaggactcccagccccactcctctgct ttagcccaagcctcagctctccatctcaatgctcagcaggtgtcccgatttcttcatacc tactcagctgctactcaggaccttcaagctaaggagcttcatcagctgcagaacctcttc tccaaggcctctgctgactaccagtggcttctccagagccccaagggggctgaggcgaca ctgccgactgtgattgctgagctgcagcagttcctgcggggagctcgggccatgtgcatc gagtcttgggctcgtttctctctggtccgcatggcggggggtactgctctcttggctgct tcctgctttatctgcctgctggcatctcagtgggcaatatccccaggctttccattctgc cctctactcctgacacctgtggcctggggcctggttggggccatagcgtatgctggactc ctgggaactattgagctgaagctagatctagtgcttctaggggctgtggctgcagtgagc tcattcctcccttttctgtggaaagcctgggctggctgggggtccaagaggcccctggca accctgtttcccatccctgggcccgtcctgttactcctgctgtttcgcttggctgtgttc ttctctgatagttttgttgtagctgaggccagggccacccccttccttttgggctcattc atcctgctcctggttgtccagcttcactgggagggccagctgcttccacctaagctactc acaatgccccgccttggcacttcagccacaacaaaccccccacggcacaatggtgcatat gccctgaggcttggaattgggttgcttttatgtacaaggctagctgggctttttcatcgt tgccctgaagagacacctgtttgccactcctctccctggctgagtcctctggcatccatg gtgggtggtcgagccaagaatttgtggtatggagcttgtgtggcggcgctggtggccctg ttagctgccgtgcgcttgtggcttcgccgctatggtaatctcaagagccccgagccaccc atgctctttgtgcgctggggactgcccctaatggcattgggtactgctgcctactgggca ttggcgtcgggggcagatgaggctcccccccgtctccgggtcctggtctctggggcatcc atggtgctgcctcgggctgtagcagggctggctgcttcagggctcgcgctgctgctctgg aagcctgtgacagtgctggtgaaggctggggcaggcgctccaaggaccaggactgtcctc actcccttctcaggcccccccacttctcaagctgacttggattatgtggtccctcaaatc taccgacacatgcaggaggagttccggggccggttagagaggaccaaatctcagggtccc ctgactgtggctgcttatcagttggggagtgtctactcagctgctatggtcacagccctc accctgttggccttcccacttctgctgttgcatgcggagcgcatcagccttgtgttcctg cttctgtttctgcagagcttccttctcctacatctgcttgctgctgggatacccgtcacc acccctggtaaatatctcagctctgattcacttaaagacaatagtgatagtcaagggctg cggaagagacagcagcccccagggaatgaagctgatgccagagtcagacccgaggaggaa gaggagccactgatggagatgcggctccgggatgcgcctcagcacttctatgcagcactg ctgcagctgggcctcaagtacctctttatccttggtattcagattctggcctgtgccttg gcagcctccatccttcgcaggcatctcatggtctggaaagtgtttgcccctaagttcata tttgaggctgtgggcttcattgtgagcagcgtgggacttctcctgggcatagctttggtg atgagagtggatggtgctgtacttttgagcagtgcaagtacagagcggcactgccagcag actacacgcggtagaaagccgaccttggtgagcgtgttggtgctcgacagtgagcagaga aaggatggacgattacggagcgccctcgtctccagttaccgctttctggaaacaccatcc gccggggcggagctgttccgccccgcctcggccaccatgtcccgccagaccacctctgtg ggctccagctgcctggacctgtggagggaaaagaatgaccggctcgttcgacaggccaag gtggctcagaactccggtctgactctgaggcgacagcagttggctcaggatgcactggaa gggctcagagggctcctccatagtctgcaagggctccctgcagctgttcctgttcttccc ttggagctgactgtcacctgcaacttcattatcctgagggcaagcttggcccagggtttc acagaggatcaggcccaggatatccagcggagcctagagagagtgctggagacacaggag cagcaggggcccaggttggaacaggggctcagggagctgtgggactctgtccttcgtgct tcctgccttctgccggagctgctgtctgccctgcaccgcctggttggcctgcaggctgcc ctctggttgagtgctgaccgtcttggggacctggccttgttactagagaccctgaatggc agccagagtggagcctctaaggatctgctgttacttctgaaaacttggagtcccccagct gaggaattagatgctccattgaccctgcaggatgcccagggattgaaggatgtcctcctg acagcatttgcctaccgccaaggtctccaggagctgatcacagggaacccagacaaggca ctaagcagccttcatgaagcggcctcaggcctgtgtccacggcctgtgttggtccaggtg tacacagcactggggtcctgtcaccgtaagatgggaaatccacagagagcactgttgtac ttggttgcagccctgaaagagggatcagcctggggtcctccacttctggaggcctctagg ctctatcagcaactgggggacacaacagcagagctggagagtctggagctgctagttgag gccttgaatgtcccatgcagttccaaagccccgcagtttctcattgaggtagaattacta ctgccaccacctgacctagcctcaccccttcattgtggcactcagagccagaccaagcac atactagcaagcaggtgcctacagacggggagggcaggagacgctgcagagcattacttg gacctgctggccctgttgctggatagctcggagccaagggtggggccctgtatgcctgag gtgtttttggaggcagcggtagcactgatccaggcaggcagagcccaagatgccttgact ctatgtgaggagttgctcagccgcacatcatctctgctacccaagatgtcccggctgtgg gaagatgccagaaaaggaaccaaggaactgccatactgcccactctgggtctctgccacc cacctgcttcagggccaggcctgggttcaactgggtgcccaaaaagtggcaattagtgaa tttagcaggtgcctcgagctgctcttccgggccacacctgaggaaaaagaacaaggggca gctttcaactgtgagcagggatgtaagtcagatgcggcactgcagcagcttcgggcagcc gccctaattagtcgtggactggaatgggtagccagcggccaggataccaaagccttacag gacttcctcctcagtgtgcagatgtgcccagtctcagcgaagcgtctgcgaccgtcgttt gagtcgtcgctgccgctgccgctgccactgccactgccacctcgcggatcaggagccagc gttgttcgcccgacgcctcgctgccggccccggcccgcccgattggctcccttagaacgg acgtctgggcctggccaggtcttccggcccactccgccggcgcggcgccccggggctttg gggcgccagtctgccgtccggcctaccacccgccgaaagcctttggtccccggagagagc aggccccgcgagcccgaggccccagccgggcccgaggaggacatcaaagttcagaggtta ggtaacttgcccaagatcacaattaagcagtggcataattggaattcagacccaatgggt ctgactatagagttcctgctcttaaccactcttctttcaaaaggtgatgacctatcaaca gccattctcaaacagaagaaccgtcccaatcggttaattgttgatgaagccatcaatgag gacaacagtgtggtgtccttgtcccagcccaagatggatgaattgcagttgttccgaggt gacacagtgttgctgaaaggaaagaagagacgagaagctgtttgcatcgtcctttctgat gatacttgttctgatgagaagattcggatgaatagagttgttcggaataaccttcgtgta cgcctaggggatgtcatcagcatccagccatgccctgatgtgaagtacggcaaacgtatc catgtgctgcccattgatgacacagtggaaggcattactggtaatctcttcgaggtatac cttaagccgtacttcctggaagcgtatcgacccatccggaaaggagacatttttcttgtc cgtggtgggatgcgtgctgtggagttcaaagtggtggaaacagatcctagcccttattgc attgttgctccagacacagtgatccactgcgaaggggagcctatcaaacgagaggatgag gaagagtccttgaatgaagtagggtatgatgacattggtggctgcaggaagcagctagct cagataaaggagatggtggaactgcccctgagacatcctgccctctttaaggcaattggt gtgaagcctcctagaggaatcctgctttacggacctcctggaacaggaaagaccctgatt gctcgagctgtagcaaatgagactggagccttcttcttcttgatcaatggtcctgagatc atgagcaaattggctggtgagtctgagagcaaccttcgtaaagcctttgaggaggctgag aagaatgctcctgccatcatcttcattgatgagctagatgccatcgctcccaaaagagag aaaactcatggcgaggtggagcggcgcattgtatcacagttgttgaccctcatggatggc ctaaagcagagggcacatgtgattgttatggcagcaaccaacagacccaacagcattgac ccagctctacggcgatttggtcgctttgacagggaggtagatattggaattcctgatgct acaggacgcttagagattcttcagatccataccaagaacatgaagctggcagatgatgtg gacctggaacaggtagccaatgagactcacgggcatgtgggtgctgacttagcagccctg tgctcagaggctgctctgcaagccatccgcaagaagatggatctcattgacctagaggat gagaccattgatgccgaggtcatgaactctctagcagttactatggatgacttccgggta aggaccacacccgtgcctcagtgggccttgagccagagtaacccatcagcactgcgggaa accgtggtagaggtgccacaggtaacctgggaagacatcgggggcctagaggatgtcaaa cgtgagctacaggagctggtccagtatcctgtggagcacccagacaaattcctgaagttt ggcatgacaccttccaagggagttctgttctatggacctcctggctgtgggaaaactttg ttggccaaagccattgctaatgaatgccaggccaacttcatctccatcaagggtcctgag ctgctcaccatgtggtttggggagtctgaggccaatgtcagagaaatctttgacaaggcc cgccaagctgccccctgtgtgctattctttgatgagctggattcgattgccaaggctcgt ggaggtaacattggagatggtggtggggctgctgaccgagtcatcaaccagatcctgaca gaaatggatggcatgtccacaaaaaaaaatgtgttcatcattggcgctaccaaccggcct gacatcattgatcctgccatcctcagacctggccgtcttgatcagctcatctacatccca cttcctgatgagaagtcccgtgttgccatcctcaaggctaacctgcgcaagtccccagtt gccaaggcaggtgcaagatcatgggctgatgtggacttggagttcctggctaaaatgact aatggcttctctggagctgacctgacagagatttgccagcgtgcttgcaagctggccatc cgtgaatccatcgagagtgagattaggcgagaacgagagaggcagacaaacccatcagcc atggaggtagaagaggatgatccagtgcctgagatccgtcgagatcactttgaagaagcc atgcgctttgcgcgccgttctgtcagtgacaatgacattcggaagtatgagatgtttgcc cagacccttcagcagagtcggggctttggcagcttcagattcccttcagggaaccagggt ggagctggccccagtcagggcagtggaggcggcacaggtggcagtgtatacacagaagac aatgatgatgacctgtatggctaa >gi568815589r:34974111_35179524|GENSCAN_predicted_peptide_6|1228_aa SAHSPRTQSSDMAEDRASLCRAPAAPDRSLFPLSRCRDCSRYRPHGSGSSSWSRSQVAPR QAEPGIGRSTAETLGPPGSLEMGVLTFRDVAIEFSLEEWHCLDTAQQNLYRNVMLENYRN LVFLGIAASKPDLITCLEQGKEPWNVKRHEMVAEPPVVCSYFAQDLWPNQGIKNCFQKVI LRRYKKCGHENLQLGKYRKSMDECKVHEECCNGFNQCSRTTQSKILQCDKYVKVFHKFSN SNRYKIRHTGKKPFKCKECEKSFFMLSHSAQHKRIHSGEKPYKCKECGKAYNEASNLSTH KRIHTGKKPYKCEDCGKAFNWLSHLTTQKIIHTGKKPYKCEDCGKAFNQSANLTTHKRIH TGEKPYKCEECGKAFSQSSTLTTHKIIHAGEKPYKCEECGKSFNQSSIIHTGEKFYKCEE RGKAFSQFSHLTTRKRIHSGEKPYKCEECGKAFKQSSTLTTHKRIHSGEKFYKCEVCSKA FSRFSHLTTHKRIHTGEKPYKCEECGKAFNLSSHLTTHKIIHTGEKPYKCEECGKAFNQS STLSKHKVIHTGEKPYKCGECGKAFNQSSHLTTHKIIHTGEKPYKCEECGKAFNNSSILN RHKMIHTGEKLYKLESCNNACDNISNISKHKRNCAVLCPALSEPRAFMDLREEEMPANWS TGDHEQPRVAQKWHHVPPLWSVGLAARPRFRVLPGLKSPAHACFLEQEAQVCSPGCNGYS CTRPPTEKRWKIRDPWRKEVPDFSNRPISLGYKVPKPVPSTNRQAATGRVTFTQDPSVAT RCQSRIMNPHPPAQRRLQKLLNLADLNPTAQIPSFSQTPEIPCTTLQSLRTSPISANARA HPDIPTKRLEFRRQLTPTCHHRSRTWPGWDLRRPATITLSPPSSFGAATANQSQRILVGV RPPGMLGAVVLASHSLAQLFDASCCPPPERRRRLLPAGEAPDVSSEEEGPAPRRRRGSLG HPTAANSSDAKATPFWSHLLPGPKEPVLDPTDCGPMGRRLKGARRLKLSPLRSLRKGPGL LSPPSASPVPTPAVSRTLLGNFEESLLRGRFAPSGHIEGFTAEIGASGSYCPQHVTLPVT VTFFDVSEQNAPAPFLGIVDLNPLGRKGYSVPKVGTVQVTLFNPNQTVVKMFLVTFDFSD MPAAHMTFLRHRLFLVPVGEEGNANPTHRLLCYLLHLRFRSSRSGRLSLHGDIRLLFSRR SLELDTGLPYELQAVTEAPHNPRYSPLP >gi568815589r:34974111_35179524|GENSCAN_predicted_CDS_6|3687_bp tccgcgcactcaccgcgcacgcagagcagtgacatggccgaggatcgggcaagcctctgc cgcgccccagccgcgccggaccgctccctctttcctctcagcaggtgccgcgactgctct cgttaccggccccacgggtctggcagctcctcatggtcccggtcgcaggtggctcctcgc caggctgagccaggtattgggagatccacagctgagacgctaggaccccctggaagccta gaaatgggagtgttgacatttagggatgtggccatagaattctctctggaggagtggcat tgtctggacactgcacagcagaatttatataggaatgtgatgttagagaactacagaaac ctggtcttcctgggtattgctgcctctaagccagacctgatcacctgtctggagcaagga aaagaaccttggaatgtgaagagacatgagatggtagctgaacccccagttgtatgttct tattttgcccaagacctttggccaaaccagggcataaaaaattgtttccaaaaagtgata ctgagaagatataaaaaatgtggacatgaaaatttacagttaggaaaataccgtaaaagc atggatgagtgtaaggtgcatgaagaatgttgcaatggatttaaccagtgttcgagaact acccagagcaaaatattacaatgtgacaaatatgtgaaagtctttcataaattttcaaat tcaaacagatataagataagacatactggaaagaaacctttcaaatgtaaagaatgcgaa aagtcatttttcatgctttcacactcagctcaacataaaagaattcatagtggagagaaa ccctacaaatgtaaagaatgtgggaaagcctataatgaggcctcaaacctttctacacat aaaagaattcatactggaaagaaaccctacaaatgcgaagattgtggaaaagcctttaac tggctttcacatcttactacacaaaaaataattcataccggaaagaaaccctacaaatgt gaggactgtggcaaagcttttaaccaatctgcaaaccttactacacataagagaattcat actggagagaaaccctacaaatgtgaagaatgtggcaaagcttttagccagtcctcaacc ctcactacacataagataattcatgctggagagaaaccctacaaatgtgaagaatgtggc aaatcttttaaccaatcctcaataattcatactggagagaaattctacaaatgtgaagaa cgtggcaaagcctttagccagttctcacaccttactacacgtaagagaattcattctgga gagaaaccctacaaatgtgaagaatgtggcaaagcttttaaacaatcctcaacccttact acacataagagaattcattctggagagaaattctacaaatgtgaagtatgtagcaaagcc tttagccggttctcacaccttactacacataagagaattcatactggagagaagccctac aaatgtgaagaatgtggcaaagcttttaacctatcctcacaccttactacacataagata attcatactggagagaaaccctacaaatgtgaagaatgtggcaaagcttttaaccagtcc tcaactctttctaaacacaaggttattcatactggagagaaaccctacaaatgtggagaa tgtggcaaagcctttaaccagtcctcacaccttactacacataagataattcatactggt gagaaaccctacaagtgcgaagaatgtggcaaagcctttaacaactcctctattcttaac agacataagatgattcatacgggagagaaactctacaaacttgaaagttgtaacaatgct tgtgacaacatctcaaacatttccaaacataaaagaaattgtgctgtcctctgccctgct ctgtctgagcccagggcttttatggatctcagagaggaggaaatgcctgccaattggtcc acaggtgaccatgagcagccacgggtggcccagaaatggcaccacgtgcctccactctgg tctgtgggactggcggcacggccccgttttcgggtcctccctggtctgaagtcgccagct catgcctgcttcctggaacaggaggcccaggtctgcagccccgggtgcaatggctacagc tgcaccaggcccccaactgaaaaacgttggaagatcagggatccctggaggaaagaagtc ccagacttcagcaatcgtcctatcagtttgggctataaggtgcccaagccagtaccaagc accaataggcaagctgctacaggccgggtcaccttcactcaggatccctctgtggctacc agatgtcaatcgagaataatgaacccacacccgcctgcacagcgtaggcttcagaaactg ctcaacctcgcagacctgaaccccaccgcccagattccctcattttcacagacccctgag attccttgcactaccttgcaatccctacggacctcccctattagtgcaaatgcacgcgcc cacccagacatccctacaaagaggttagagttccggcggcagctcacaccgacttgtcat cacagaagccgcacgtggccggggtgggacctcaggcgacctgcgaccatcactttgtct cctccttcctcctttggggccgccaccgccaatcagagccagcggatcctggttggagtg cgaccgcccggcatgttgggagctgtagtccttgccagccacagcctggcacagctgttt gatgccagctgctgcccaccccctgaacgaagacgaaggctgcttcctgctggagaagcc ccagatgtcagctctgaggaagaggggccagcccctcggaggcgccggggatccctgggc caccctactgctgccaacagttctgatgccaaagccacacccttctggagccacctgctg cctgggcccaaagagcctgttttggacccaacagactgcggtcccatggggcggaggctg aaaggagcccgtcgcctgaagctgagcccccttcgaagcctccggaaggggccaggcctg ctgagcccccccagtgcctcccctgttcctacccctgctgtcagccgtaccctgctgggc aactttgaggaatcattgctgcgaggacgttttgcaccatctggccacattgagggcttc acagcagaaattggagctagtgggtcatactgcccccagcacgtcacgctgcctgtcact gtcacattctttgatgtttctgagcaaaatgccccggctcccttcctgggcatcgtggat ctgaaccccttggggaggaagggttacagcgtgcccaaggtgggcaccgtccaagtgacc ttatttaaccccaaccagactgtggtaaagatgttccttgtgacctttgacttctcggac atgcctgctgcccacatgaccttcctgcgccatcgcctctttttggtgcctgtgggtgag gagggaaatgctaaccccacccaccgcctcctctgctacttgctgcacctcaggttccgg agctcccgctcaggccgcttaagcctgcatggagatatccgcctgcttttttcccgccgg agcctggagctggacacagggctcccctacgaactgcaggctgtgaccgaggcccctcat aatccacgttattcacctttgccctga