GENSCAN 1.0 Date run: 7-Nov-116 Time: 04:01:22 Sequence gi568815577r:33404127_33639315 : 235189 bp : 42.48% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 7328 7445 118 1 1 70 76 111 0.267 8.51 1.02 Intr + 10762 10894 133 2 1 103 66 100 0.458 8.08 1.03 Intr + 17354 17559 206 2 2 99 95 181 0.898 17.82 1.04 Intr + 21072 21212 141 1 0 20 31 134 0.019 0.30 1.05 Intr + 23301 23374 74 1 2 13 80 82 0.014 -1.89 1.06 Intr + 24735 25017 283 2 1 101 50 90 0.229 2.57 1.07 Intr + 28588 28745 158 2 2 78 94 132 0.836 11.61 1.08 Intr + 35102 35236 135 1 0 -20 110 129 0.235 4.14 1.09 Intr + 39244 39365 122 0 2 3 76 108 0.046 -0.53 1.10 Term + 45206 45500 295 2 1 7 43 194 0.035 0.59 1.11 PlyA + 45783 45788 6 -0.45 2.06 PlyA - 45799 45794 6 1.05 2.05 Term - 48167 48064 104 0 2 79 48 108 0.809 3.36 2.04 Intr - 56379 56287 93 1 0 85 106 29 0.552 3.42 2.03 Intr - 61283 61216 68 1 2 93 103 63 0.702 5.93 2.02 Intr - 62986 62884 103 2 1 66 103 15 0.602 -0.89 2.01 Init - 64759 64669 91 1 1 76 76 53 0.634 3.60 2.00 Prom - 65581 65542 40 -4.35 3.00 Prom + 67087 67126 40 -4.95 3.01 Init + 74693 74759 67 1 1 63 91 91 0.495 8.19 3.02 Intr + 75756 75852 97 1 1 35 93 79 0.301 1.25 3.03 Term + 75886 76189 304 1 1 45 38 129 0.348 -2.74 3.04 PlyA + 77118 77123 6 1.05 4.02 PlyA - 77435 77430 6 1.05 4.01 Sngl - 77987 77454 534 2 0 43 34 701 0.982 56.02 4.00 Prom - 78158 78119 40 -4.85 5.05 PlyA - 78395 78390 6 1.05 5.04 Term - 85298 84101 1198 1 1 103 41 826 0.019 69.58 5.03 Intr - 86950 86857 94 2 1 25 65 59 0.010 -4.60 5.02 Intr - 87776 87499 278 1 2 83 95 138 0.018 10.24 5.01 Init - 92098 92040 59 1 2 77 49 53 0.024 1.13 5.00 Prom - 97903 97864 40 -4.65 6.20 PlyA - 99066 99061 6 1.05 6.19 Term - 100189 99998 192 1 0 87 48 254 0.999 17.74 6.18 Intr - 100401 100286 116 1 2 113 68 56 0.997 5.35 6.17 Intr - 101576 101435 142 2 1 95 103 88 0.999 10.01 6.16 Intr - 101978 101848 131 0 2 126 89 139 0.999 17.29 6.15 Intr - 107332 107126 207 2 0 69 75 207 0.983 15.63 6.14 Intr - 113015 112863 153 0 0 66 82 78 0.601 4.12 6.13 Intr - 113482 113231 252 2 0 91 65 231 0.977 17.58 6.12 Intr - 116436 116238 199 0 1 55 56 164 0.983 7.90 6.11 Intr - 116889 116780 110 1 2 60 84 114 0.998 7.28 6.10 Intr - 118156 118062 95 0 2 90 82 126 0.999 10.89 6.09 Intr - 120874 120643 232 2 1 74 98 223 0.999 17.81 6.08 Intr - 124209 124041 169 0 1 79 109 152 0.801 15.00 6.07 Intr - 124478 124351 128 0 2 57 13 84 0.359 -2.72 6.06 Intr - 124811 124724 88 2 1 99 96 67 0.961 7.22 6.05 Intr - 126758 126633 126 2 0 88 80 103 0.998 9.46 6.04 Intr - 128330 128219 112 1 1 82 96 138 0.999 13.46 6.03 Intr - 130627 130453 175 2 1 67 92 135 0.998 9.88 6.02 Intr - 131194 131099 96 2 0 102 63 117 0.999 9.66 6.01 Init - 135189 135045 145 0 1 74 46 125 0.971 7.23 6.00 Prom - 135264 135225 40 -7.75 7.00 Prom + 138164 138203 40 -9.35 7.01 Init + 138967 139043 77 0 2 112 97 91 0.878 13.01 7.02 Intr + 142087 142253 167 1 2 21 111 141 0.846 8.38 7.03 Intr + 145350 151265 5916 1 0 67 92 6568 0.997 641.54 7.04 Intr + 153030 153190 161 1 2 102 99 205 0.999 21.79 7.05 Intr + 155104 155250 147 0 0 53 82 157 0.888 11.11 7.06 Intr + 155461 155649 189 0 0 57 107 221 0.999 19.76 7.07 Intr + 155751 155934 184 2 1 81 61 101 0.487 5.14 7.08 Intr + 156006 156085 80 1 2 -5 37 122 0.227 -3.85 7.09 Intr + 163031 163141 111 1 0 83 75 155 0.683 13.36 7.10 Intr + 164845 164961 117 0 0 91 81 83 0.965 7.64 7.11 Intr + 169182 169329 148 2 1 83 87 96 0.999 7.89 7.12 Intr + 171652 171767 116 2 2 109 61 77 0.999 6.35 7.13 Term + 172292 172450 159 1 0 77 33 97 0.797 0.06 7.14 PlyA + 173182 173187 6 1.05 8.09 PlyA - 173313 173308 6 1.05 8.08 Term - 174318 174181 138 0 0 77 47 81 0.875 -0.12 8.07 Intr - 175310 175224 87 2 0 87 115 16 0.765 3.45 8.06 Intr - 177374 177176 199 1 1 71 85 114 0.862 7.83 8.05 Intr - 178120 178039 82 2 1 107 30 52 0.839 -0.92 8.04 Intr - 179464 179362 103 1 1 44 93 121 0.504 7.03 8.03 Intr - 182055 181852 204 0 0 74 93 58 0.899 3.37 8.02 Intr - 184059 183832 228 0 0 31 61 238 0.627 12.54 8.01 Init - 184515 184195 321 0 0 95 117 307 0.999 31.68 8.00 Prom - 190753 190714 40 -2.95 9.09 PlyA - 191024 191019 6 1.05 9.08 Term - 191402 191307 96 2 0 78 47 52 0.415 -2.91 9.07 Intr - 193275 193154 122 1 2 26 79 132 0.436 5.39 9.06 Intr - 195122 195024 99 0 0 91 92 127 0.694 12.56 9.05 Intr - 209480 209412 69 0 0 104 71 76 0.507 5.74 9.04 Intr - 212624 212580 45 0 0 68 111 41 0.617 1.86 9.03 Intr - 217942 217870 73 1 1 44 115 56 0.097 1.96 9.02 Intr - 220634 220557 78 2 0 76 67 54 0.064 0.93 9.01 Intr - 227431 227360 72 1 0 83 116 28 0.120 3.78 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815577r:33404127_33639315|GENSCAN_predicted_peptide_1|554_aa MVNRRVEGGIMVPKDVHALILRVREPGNVLPDVQKGLRRYPLSQLPAPQHPKIRLYNAEQ VLSWEPVALSNSTRPVVYQVQFKYTDSKWFTADIMSIGVNCTQITATECDFTAASPSAGF PMDFNVTLRLRAELGALHSAWVTMPWFQHYRNGMSHHAHPVIPLFGSDTDRERLRGKAIY LETPHIVVQCSLEPAFLQCFCEKGTCEIGGKQREKELSREQYQQDGVHPLLCTESAPRGR VATCILELGRAAQHWRTQLQDPGASLVEQQALMKSTQNRLPLPGTRLAPCHLASLGCSFL VYKMKQLSYKLEPVKRSSPSTELQQVILISVGTFSLLSVLAGACFFLVLKYRGLIKYWFH TPPSIPLQIEEEIGDRRASSVGQQDGGQRGVEDTSIAGEQERVSAQQDRGQINSVRWSTS SRGCGYIPVGCAPHNPTGAVDLNRTQRHTATASSPDTNPLKREERSDAKWAYIRAVDTTT GIIHTTKALQHPSGEKVQHKICVTKESKSSNKGPEEYKQCKYSTANSVKGVFDWSMNFQV EDIFHRNIHPKLES >gi568815577r:33404127_33639315|GENSCAN_predicted_CDS_1|1665_bp atggtgaacaggcgtgtggagggcggaataatggtccccaaagatgtccacgccctcatc ctcagagtccgtgaacctgggaatgtgctgcctgacgtacaaaagggactccgcagatac cctctttcccagctgcccgctcctcagcacccgaagattcgcctgtacaacgcagagcag gtcctgagttgggagccagtggccctgagcaatagcacgaggcctgttgtctaccaagtg cagtttaaatacaccgacagtaaatggttcacggccgacatcatgtccataggggtgaat tgtacacagatcacagcaacagagtgtgacttcactgccgccagtccctcagcaggcttc ccaatggatttcaatgtcactctacgccttcgagctgagctgggagcactccattctgcc tgggtgacaatgccttggtttcaacactatcggaatggtatgagccaccatgcccatccc gtgatacccttatttggcagtgacactgatcgtgagaggcttcgtggaaaggccatttac ctggagactccacacattgtcgtccagtgttccctagagccagccttcctccagtgcttt tgtgaaaagggcacatgcgaaatcggggggaaacagagggagaaagagctttccagagag cagtaccagcaggatggagtccatccacttctctgcactgagtcagcccccagagggcga gttgccacttgtatcctagagctgggcagggctgcccagcactggagaactcagctccag gaccctggggccagccttgttgagcagcaagctctaatgaagagcacacagaacaggctt ccactgcctgggacccgcttagcaccatgtcacttagcctctctgggctgcagcttcctc gtgtataaaatgaaacaactgagctataagctggagcctgtgaagaggagcagcccctcc actgagcttcagcaagtcatcctgatctccgtgggaacattttcgttgctgtcggtgctg gcaggagcctgtttcttcctggtcctgaaatatagaggcctgattaaatactggtttcac actccaccaagcatcccattacagatagaagaggaaatcggagacagaagagcttcttct gttgggcagcaggatggtggccagcgaggagtggaggatacatctatagcaggagaacag gaaagagtttcagcccagcaggacagagggcaaatcaactctgttaggtggagcaccagt agtcggggctgtgggtacattcctgtaggctgtgcaccgcacaaccccacaggtgctgtt gacctaaacagaacacaaaggcacacagcaactgctagtagtcctgatacaaatcctctc aagagagaggagaggagtgatgccaaatgggcttacattagagccgtggacactaccact ggtattattcatacaaccaaggctctacaacacccctctggagaaaaagtgcaacacaaa atctgtgtaacaaaggaaagcaaaagtagcaataagggcccagaggaatacaaacagtgc aaatacagtactgcaaactcagtaaaaggagtttttgattggagtatgaactttcaagtt gaagatatatttcacaggaatattcacccaaagcttgagagctag >gi568815577r:33404127_33639315|GENSCAN_predicted_peptide_2|152_aa MAGFLDNFRWPECECIDWSERRNAVASVVAGWWIMIDAAVVYPKPEQLNHAFHTCGVFST LAFFMINAVSNAQVRGDSYESGCLGRTGARVWLFIGFMLMFGSLIASMWILFGAYVTQNH HALPGFRLVALTLLVLKDVWQFLLSYMSKTCD >gi568815577r:33404127_33639315|GENSCAN_predicted_CDS_2|459_bp atggcaggcttcctagataattttcgttggccagaatgtgaatgtattgactggagtgag agaagaaatgctgtggcatctgttgtcgcaggctggtggataatgattgatgcagctgtg gtgtatcctaagccagaacagttgaaccatgcctttcacacatgtggtgtattttccaca ttggctttcttcatgataaatgctgtatccaatgctcaggtgagaggtgatagctatgaa agcggctgtttaggaagaacaggtgctcgagtttggcttttcattggtttcatgttgatg tttgggtcacttattgcttccatgtggattctttttggtgcatatgttacccaaaaccac catgctctacctggatttcgccttgtggccttgacactcctggttttgaaagatgtttgg cagtttctgctgtcttatatgtcaaagacatgtgactga >gi568815577r:33404127_33639315|GENSCAN_predicted_peptide_3|155_aa MMKKCDRPESSEPLKCQGIHSAGRRLLRHNPAGVPRLHLSPGSPELGADARIERRDQSER SKWPAGGGAGREGAGRGRWLGAAGRRALLATNRRAASGRPGAGRAPRDGVGCSAGLPEPE FRPSYLRFRAERDGPLLSPAGGRPPPGSHCCVSAR >gi568815577r:33404127_33639315|GENSCAN_predicted_CDS_3|468_bp atgatgaaaaagtgtgaccggcctgagagttcagagcctctgaagtgtcaagggatccac agtgcaggaaggagactgctgcgccacaaccctgccggcgtcccgcggctccacctcagc cccgggagcccggagctgggagcagacgcgaggatagagcgccgcgaccaatcggagcgc agcaagtggccggccgggggcggggcgggacgcgagggggcggggagagggcgttggctg ggcgcagcgggacgccgggcgctcctcgcgaccaatcggcgtgcagcaagtggccggccg ggggcagggcgagctccgagggacggggtgggctgctctgcaggcctcccggagccggag ttcagacccagctatctacggttccgggcagagcgggacgggcctctcctgtcccctgcc ggtgggcggccgccgccgggctcccactgctgcgtttctgcacgctga >gi568815577r:33404127_33639315|GENSCAN_predicted_peptide_4|177_aa MQINGISLQDYTAVKEKYAKYLPHSAGWYAAKCFRKAQCPIVEPLTNSMMMHGCNNSNKL MIMCIIKHAFDFIHLLTGENPLQVLVNAIINSGPWEDSTCIGRAGTVRQQAVDMSPLHCV NQVVWLLCTGTREAAFWNIKTIAECLADELINATKASSSSYAINKKDELECGDKSNR >gi568815577r:33404127_33639315|GENSCAN_predicted_CDS_4|534_bp atgcagatcaatggcatttccctgcaggattacactgcagtgaaggagaagtatgccaag tacctgcctcacagtgctgggtggtatgcagccaaatgcttccgcaaagctcagtgcccc attgtggagcccctcactaactccatgatgatgcacggctgcaacaacagcaacaagctc atgatcatgtgcatcatcaagcatgccttcgatttcatccacctgctcacaggcgagaac cctctccaggtcctggtgaacgccatcatcaacagtggtccctgggaggactccacatgc attgggcgagcagggactgtgagacaacaggctgtggacatgtccccactgcactgtgtg aatcaggtcgtctggctgctgtgcacaggcactcgtgaggctgccttctggaacatcaag accattgctgagtgcctggcggatgagctcatcaatgccaccaaggcctcctccagctcc tatgccatcaacaagaaggatgagctggagtgtggggacaagtccaaccgctga >gi568815577r:33404127_33639315|GENSCAN_predicted_peptide_5|542_aa MTSKEESRRQQPTAGPAGQGKLPSPSEPQLPTPPTRSLHHFRRPLSPSREAQAHIAPSSE LHLPQSQSAGPPPLGAGTEVELVVPGRDEGSRGALPGSSGVKFVWRKIVRFPVSDQVRTL SISRLMRRLLEMMQTLVQFIIGWRSLLGRTLGTIMNTMYVMMAQILRSHLIKATVIPNRV KMLPYFGIIRNRMMSTHKSKKKIREYYRLLNVEEGCSADEVRESFHKLAKQYHPDSGSNT ADSATFIRIEKAYRKVLSHVIEQTNASQSKGEEEEDVEKFKYKTPQHRHYLSFEGIGFGT PTQREKHYRQFRADRAAEQVMEYQKQKLQSQYFPDSVIVKNIRQSKQQKITQAIERLVED LIQESMAKGDFDNLSGKGKPLKKFSDCSYIDPMTHNLNRILIDNGYQPEWILKQKEISDT IEQLREAILVSRKKLGNPMTPTEKKQWNHVCEQFQENIRKLNKRINDFNLIVPILTRQKV HFDAQKEIVRAQKIYETLIKTKEVTDRNPNNLDQGEGEKTPEIKKGFLNWMNLWKFIKIR SF >gi568815577r:33404127_33639315|GENSCAN_predicted_CDS_5|1629_bp atgacttcgaaagaggaaagcaggaggcagcagcccacagctggtcctgcagggcaggga aagttaccctcgccctccgagccacaactccccacgccgccaactcggtctttacatcat tttcgacgccccctaagtccctcccgagaggcgcaggcgcacatcgccccttctagcgaa ctacatctcccacaatcccaatcggccggaccacctccgctcggggcggggacggaggtg gagctggtggtccccggtcgggacgaaggctcccgaggtgccctgcctgggtcctccggg gtaaagttcgtttggcggaagattgtccgttttcctgtatcagaccaggtacgtaccttg tccataagtcgtcttatgagaagacttcttgagatgatgcagactctcgtgcagtttata atcggctggaggtcattgctaggaagaactctgggtacaataatgaatacaatgtatgtg atgatggctcagatcttaagatctcacctgataaaggctacagtgattcctaatcgagtg aaaatgcttccatattttggtatcattagaaatagaatgatgtcaacccataaatccaaa aagaagatcagagaatattatagactgctgaacgtggaggaaggatgctctgcagatgaa gtcagggaatcttttcataagcttgccaagcaatatcatcctgacagtggctctaatact gctgattctgcaacatttataaggattgaaaaagcttatagaaaggtgctctcccatgtg atagaacaaacaaatgccagtcagagtaaaggtgaagaagaagaagatgtagaaaaattc aaatataaaacaccccaacaccgacattatttaagttttgaaggtattggttttgggact ccaactcaacgagagaagcattataggcaatttagggcagaccgtgctgctgaacaagtg atggaatatcaaaagcagaaactacaaagccagtattttcctgatagtgtaattgttaaa aatataagacagagcaaacagcaaaagataacgcaagctatagaacgtttagtggaggac ctcattcaagaatccatggcaaaaggagactttgacaatctcagtgggaaaggaaaacct ctgaaaaagttttctgactgttcttacattgatcccatgactcacaacctgaaccgaata ctgatcgataatggataccaaccagaatggatccttaagcaaaaggaaataagcgatact attgagcaactcagagaggcaattttagtgtctaggaaaaaacttgggaatccaatgaca ccaactgaaaagaaacagtggaaccatgtttgtgagcagtttcaagaaaacatcagaaaa ttaaacaagcgaattaatgattttaatttaattgttcccatcctgaccaggcaaaaagtc cattttgatgctcagaaagaaattgtcagagcccagaaaatatacgagacccttataaaa acaaaagaagtcacagatagaaacccaaataaccttgatcaaggagaaggagagaaaaca cctgaaatcaagaaaggttttttaaactggatgaatctgtggaaatttattaaaatacga tcattttga >gi568815577r:33404127_33639315|GENSCAN_predicted_peptide_6|955_aa MAARVLIIGSGGREHTLAWKLAQSHHVKQVLVAPGNAGTACSEKISNTAISISDHTALAQ FCKEKKIEFVVVGPEAPLAAGIVGNLRSAGVQCFGPTAEAAQLESSKRFAKEFMDRHGIP TAQWKAFTKPEEACSFILSADFPALVVKASGLAAGKGVIVAKSKEEACKAVQEIMQCLCF TDGKTVAPMPPAQDHKRLLEGDGGPNTGGMGAYCPAPQVSNDLLLKIKDTVLQRTVDGMQ QEGTPYTGILYAGIMLTKNGPKVLEFNCRFGDPECQVGKNIKVLLGRRYVVILPLLKSDL YEVIQSTLDGLLCTSLPVWLENHTALTVVMASKGYPGDYTKGVEITGFPEAQALGLEVFH AGTALKNGKVVTHGGRVLAVTAIRENLISALEEAKKGLAAIKFEGAIYRKDVGFRAIAFL QQPRSLTYKESGVDIAAGNMLVKKIQPLAKATSRSGCKVDLGGFAGLFDLKAAGFKDPLL ASGTDGVGTKLKIAQLCNKHDTIGQDLVAMCVNDILAQGAEPLFFLDYFSCGKLDLSVTE AVVAGIAKACGKAGCALLGGETAEMPDMYPPGEYDLAGFAVGAMERDQKLPHLERITEGD VVVGIASSGLHSNGFSLVRKIVAKSSLQYSSPAPDGCGDQTLGDLLLTPTRIYSHSLLPV LRSGHVKAFAHITGGGLLENIPRVLPEKLGVDLDAQTWRIPRVFSWLQQEGHLSEEEMAR TFNCGVGAVLVVSKEQTEQILRDIQQHKEEAWVIGSVVARAEGSNLQALIDSTREPNSSA QIDIVISNKAAVAGLDKAERAGIPTRVINHKLYKNRVEFDSAIDLVLEEFSIDIVCLAGF MRILSGPFVQKWNGKMLNIHPSLLPSFKGSNAHEQALETGVTVTGCTVHFVAEDVDAGQI ILQEAVPVKRGDTVATLSERVKLAEHKIFPAALQLVASGTVQLGENGKICWVKEE >gi568815577r:33404127_33639315|GENSCAN_predicted_CDS_6|2868_bp atggcagcccgagtacttataattggcagtggaggaagggaacatacgctggcctggaaa cttgcacagtctcatcatgtcaaacaagtgttggttgccccaggaaacgcaggcactgcc tgctctgaaaagatttcaaataccgccatctcaatcagtgaccacactgcccttgctcaa ttctgcaaagagaagaaaattgaatttgtagttgttggaccagaagcacctctggctgct gggattgttgggaacctgaggtctgcaggagtgcaatgctttggcccaacagcagaagcg gctcagttagagtccagcaaaaggtttgccaaagagtttatggacagacatggaatccca accgcacaatggaaggctttcaccaaacctgaagaagcctgcagcttcattttgagtgca gacttccctgctttggttgtgaaggccagtggtcttgcagctggaaaaggggtgattgtt gcaaagagcaaagaagaggcctgcaaagctgtacaagagatcatgcagtgtctgtgtttc actgatggcaagactgtggcccccatgcccccagcacaggaccataagcgattactggag ggagatggtggccctaacacagggggaatgggagcctattgtccagcccctcaggtttct aatgatctattactaaaaattaaagatactgttcttcagaggacagtggatggcatgcag caagagggtactccatatacaggtattctctatgctggaataatgctgaccaagaatggc ccaaaagttctagagtttaattgccgttttggtgatccagagtgccaagtgggtaaaaat atcaaagtattacttggtagaagatatgtggtaatcctcccacttcttaaaagtgatctt tatgaagtgattcagtccaccttagatggactgctctgcacatctctgcctgtttggcta gaaaaccacaccgccctaactgttgtcatggcaagtaaaggttatcctggagactacacc aagggtgtagagataacagggtttcctgaggctcaagctctaggactggaggtgttccat gcaggcactgccctcaaaaatggcaaagtagtaactcatgggggtagagttcttgcagtc acagccatccgggaaaatctcatatcagcccttgaggaagccaagaaaggactagctgct ataaagtttgagggagcaatttataggaaagacgtcggctttcgtgccatagctttcctc cagcagcccaggagtttgacttacaaggaatctggagtagatatcgcagctggaaatatg ctggtcaagaaaattcagcctttagcaaaagccacttccagatcaggctgtaaagttgat cttggaggttttgctggtctttttgatttaaaagcagctggtttcaaagatccccttctg gcctctggaacagatggcgttggaactaaactaaagattgcccagctatgcaataaacat gataccattggtcaagatttggtagcaatgtgtgttaatgatattctggcacaaggagca gagcccctcttcttccttgattacttttcctgtggaaaacttgacctcagtgtaactgaa gctgttgttgctggaattgctaaagcttgtggaaaagctggatgtgctctccttggaggt gaaacagcagaaatgcctgacatgtatccccctggagagtatgacctagctgggtttgcc gttggtgccatggagcgagatcagaaactccctcacctggaaagaatcactgagggtgat gttgttgttggaatagcttcatctggtcttcatagcaatggatttagccttgtgaggaaa atcgtggcaaaatcttccctccagtactcctctccagcacctgatggttgtggtgaccag actttaggggacttacttctcacgcctaccagaatctacagccattcactgttacctgtc ctacgttcaggacatgtcaaagcctttgcccatattactggtggaggattactagagaac atccccagagtcctccctgagaaacttggggtagatttagatgcccagacctggaggatc cccagggtcttctcatggttgcagcaggaaggacacctctctgaggaagagatggccaga acatttaactgtggggttggcgctgtccttgtggtatcaaaggagcagacagagcagatt ctgagggatatccagcagcacaaggaagaagcctgggtgattggcagtgtggttgcacga gctgaaggatcgaacctgcaagcacttatagacagtactcgggaaccaaatagctctgca caaattgatattgttatctccaacaaagccgcagtagctgggttagataaagcggaaaga gctggtattcccactagagtaattaatcataaactgtataaaaatcgtgtagaatttgac agtgcaattgacctagtccttgaagagttctccatagacatagtctgtcttgcaggattc atgagaattctttctggcccctttgtccaaaagtggaatggaaaaatgctcaatatccac ccatccttgctcccttcttttaagggttcaaatgcccatgagcaagccctggaaaccgga gtcacagttactgggtgcactgtacactttgtagctgaagatgtggatgctggacagatt attttgcaagaagctgttcccgtgaagaggggtgatactgtcgcaactctttctgaaaga gtaaaattagcagaacataaaatatttcctgcagcccttcagctggtggccagtggaact gtacagcttggagaaaatggcaagatctgttgggttaaagaggaatga >gi568815577r:33404127_33639315|GENSCAN_predicted_peptide_7|2523_aa MATNIEQIFRSFVVSKFREIQQELSSGRNEGQLNGETNTPIEGNQAGDAAASARSLPNEE IVQKIEEVLSGVLDTELRYKPDLKEGSRKSRCVSVQTDPTDEIPTKKSKKHKKHKNKKKK KKKEKEKKYKRQPEESESKTKSHDDGNIDLESDSFLKFDSEPSAVALELPTRAFGPSETN ESPAVVLEPPVVSMEVSEPHILETLKPATKTAELSVVSTSVISEQSEQSVAVMPEPSMTK ILDSFAAAPVPTTTLVLKSSEPVVTMSVEYQMKSVLKSVESTSPEPSKIMLVEPPVAKVL EPSETLVVSSETPTEVYPEPSTSTTMDFPESSAIEALRLPEQPVDVPSEIADSSMTRPQE LPELPKTTALELQESSVASAMELPGPPATSMPELQGPPVTPVLELPGPSATPVPELPGPL STPVPELPGPPATAVPELPGPSVTPVPQLSQELPGLPAPSMGLEPPQEVPEPPVMAQELP GLPLVTAAVELPEQPAVTVAMELTEQPVTTTELEQPVGMTTVEHPGHPEVTTATGLLGQP EATMVLELPGQPVATTALELPGQPSVTGVPELPGLPSATRALELSGQPVATGALELPGPL MAAGALEFSGQSGAAGALELLGQPLATGVLELPGQPGAPELPGQPVATVALEISVQSVVT TSELSTMTVSQSLEVPSTTALESYNTVAQELPTTLVGETSVTVGVDPLMAPESHILASNT METHILASNTMDSQMLASNTMDSQMLASNTMDSQMLASSTMDSQMLATSSMDSQMLATSS MDSQMLATSTMDSQMLATSSMDSQMLATSSMDSQMLATSSMDSQMLATSSMDSQMLATST MDSQMLATSTMDSQMLATSSMDSQMLASGTMDSQMLASGTMDAQMLASGTMDAQMLASST QDSAMLGSKSPDPYRLAQDPYRLAQDPYRLGHDPYRLGHDAYRLGQDPYRLGHDPYRLTP DPYRMSPRPYRIAPRSYRIAPRPYRLAPRPLMLASRRSMMMSYAAERSMMSSYERSMMSY ERSMMSPMAERSMMSAYERSMMSAYERSMMSPMAERSMMSAYERSMMSAYERSMMSPMAD RSMMSMGADRSMMSSYSAADRSMMSSYSAADRSMMSSYTADRSMMSMAADSYTDSYTDTY TEAYMVPPLPPEEPPTMPPLPPEEPPMTPPLPPEEPPEGPALPTEQSALTAENTWPTEVP SSPSEESVSQPEPPVSQSEISEPSAVPTDYSVSASDPSVLVSEAAVTVPEPPPEPESSIT LTPVESAVVAEEHEVVPERPVTCMVSETPAMSAEPTVLASEPPVMSETAETFDSMRASGH VASEVSTSLLVPAVTTPVLAESILEPPAMAAPESSAMAVLESSAVTVLESSTVTVLESST VTVLEPSVVTVPEPPVVAEPDYVTIPVPVVSALEPSVPVLEPAVSVLQPSMIVSEPSVSV QESTVTVSEPAVTVSEQTQVIPTEVAIESTPMILESSIMSSHVMKGINLSSGDQNLAPEI GMQEIALHSGEEPHAEEHLKGDFYESEHGINIDLNINNHLIAKEMEHNTVCAAGTSPVGE IGEEKILPTSETKQRTVLDTYPGVSEADAGETLSSTGPFALEPDATGTSKGIEFTTASTL SLVNKYDVDLSLTTQDTEHDMVISTSPSGGSEADIEGPLPAKDIHLDLPSNNNLVSKDTE EPLPVKESDQTLAALLSPKESSGGEKEVPPPPKETLPDSGFSANIEDINEADLVRPLLPK DMERLTSLRAGIEGPLLASDVGRDRSAASPVVSSMPERASESSSEEKDDYEIFVKVKDTH EKSKKNKNRDKGEKEKKRDSSLRSRSKRSKSSEHKSRKRTSESRSRARKRSSKSKSHRSQ TRSRSRSRRRRRSSRSRSKSRGRRSVSKEKRKRSPKHRSKSRERKRKRSSSRDNRKTVRA RSRTPSRRSRSHTPSRRRRSRSVGRRRSFSISPSRRSRTPSRRSRTPSRRSRTPSRRSRT PSRRSRTPSRRSRTPSRRRRSRSVVRRRSFSISPVRLRRSRTPLRRRFSRSPIRRKRSRS SERGRSPKRLTDLDKAQLLEIAKANAAAMCAKAGVPLPPNLKPAPPPTIEEKVAKKSGGA TIEELTEKCKQIAQSKEDDDVIVNKPHVSDEEEEEPPFYHHPFKLSEPKPIFFNLNIAAA KPTPPKSQVTLTKEFPVSSGSQHRKKEADSVYGEWVPVEKNGEENKDDDNVFSSNLPSEG RVKRQGRVRRQMKQPAASHLTVTRCNSLCGTKPQSEKHRIAENSVITSLPNIGPSLHLWE ANLSGGGIDRRGQILRFNTRTGLEKVIPVDISTAMSERALAQKRLSENAFDLEAMSMLNR AQERIDAWAQLNSIPGQFTGSTGVQVLTQEQLANTGAQAWIKKDQFLRAAPVTGGMGAVL MRKMGWREGEGLGKNKEGNKEPILVDFKTDRKGKHPVSALMEICNKRRWQPPEFLLVHDS GPDHRKHFLFRVLINGSAYQPSFASPNKKHAKATAATVVLQAMGLVPKDLMANATCFRSA SRR >gi568815577r:33404127_33639315|GENSCAN_predicted_CDS_7|7572_bp atggcgaccaacatcgagcagatttttaggtctttcgtggtcagtaaattccgggaaatt caacaggagctttccagtggaaggaatgaaggccagctgaatggtgaaacaaatacaccc attgaaggaaaccaggcgggtgatgcagctgcctctgccaggagtctaccaaatgaagaa atagtgcagaagatagaggaagtactttctggggtcttagatacagaactacgatataag ccagacttgaaagagggctccagaaaaagtagatgcgtatctgtacaaacagatcctact gatgaaattcccactaaaaagtcaaagaagcataaaaagcacaaaaacaaaaagaagaaa aagaagaaagaaaaggaaaaaaaatataaaagacagccagaagaatctgagtcaaagacg aaatctcatgatgatgggaacatagatttagaatctgattcctttttaaagtttgattct gaaccttcagctgtggcgctggagcttcctacaagagcatttggcccatctgagaccaat gaatcccctgcagttgtgctagaacctcctgtagtatcaatggaggtatcagagccacac atcttagaaactctgaagccagctacaaaaactgcagaactgtcagttgtatctacatca gtaatctcagagcagtcagagcagtctgtggcagtaatgccagaaccatccatgacaaag attctggattcctttgcagcagcaccagtgcctactacaacactggtgttgaagtcatct gagccagttgtaacaatgtcagtggagtatcagatgaagtctgtgctgaaatctgtggag agcacatctccagagccatcaaagatcatgttggtagagcccccagtagcaaaagtgtta gagccttcagaaacccttgtggtatcatcagagacacctactgaggtgtaccctgagcca agcacatcaacaacaatggattttccagagtcatctgcaattgaagcgctaagattgcca gagcagcctgtagacgtaccatcggagattgcagattcatccatgacaagaccgcaggag ttgccggagctgcctaagaccacagcgttggagctgcaggagtcgtcggtggcctcagcg atggagttgccggggccacctgcgacctccatgccggagttgcaggggccccctgtgact ccagtgctggagttacctgggccctctgctaccccggtgccagagttgccagggcccctt tctaccccagtgcctgagttgccagggccccctgcgacagcagtgcctgagttgccaggg ccctctgtgacaccagtgccacagttgtcgcaggaattgccagggcttccagcaccatcc atggggttggagccaccacaggaggtaccagagccacctgtgatggcacaggagttgcca gggctgcctttggtgacagcagcagtagagttgccagagcagcctgcggtaacagtagca atggagttgaccgaacaacctgtgacgacgacagagttggagcagcctgtggggatgaca acggtggaacatcctgggcatcctgaggtgacaacggcaacagggttgctggggcagcct gaggcaacgatggtgctggagttgccaggacagccagtggcaacgacagcgctggagttg ccggggcagccttcggtgactggggtgccagagttgccagggctgccttcggcaactagg gcactggagttgtcggggcagcctgtggcaactggggcactagagttgcctgggccgctc atggcagctggggcactggagttctcggggcagtctggggcagctggagcactggagctt ttggggcagcctctggcaacaggggtgctggagttgccagggcagcctggggcgccagag ttgcctgggcagcctgtggcaactgtggcgctggagatctctgttcagtctgtggtgaca acatcggagctgtcaacgatgaccgtgtcgcagtccctggaggtgccctcgacgacagcg ctggaatcctataatacggtagcacaggagctgcctactacattagtgggggagacttct gtaacagtaggagtggatcccttgatggccccagaatcccatatattagcttctaacacc atggagacccatatattagcatccaacaccatggactcccaaatgctagcgtccaacacc atggactcccagatgctagcatccaacaccatggactcccagatgttagcgtctagcacc atggactcccagatgttagcaactagctccatggactcccagatgttagcaactagctcc atggactcccagatgttagcaactagcactatggactcccagatgttagcaaccagttcc atggactcccagatgttagcaaccagctccatggactcccagatgttagcaaccagctcc atggactcccagatgttagcaaccagctccatggactcccagatgttagcaaccagcacc atggattctcagatgttagcaaccagcaccatggactcccagatgttagcaactagctca atggattcccagatgttagcatctggcactatggactctcaaatgttagcttctggcacc atggatgctcagatgttagcgtctggtaccatggatgcccagatgttagcgtctagtacc caagattctgctatgttgggttcaaaatctcctgatccctataggttagctcaggatcct tacaggttagctcaggatccctataggttgggccatgacccctatagattaggtcatgat gcttacaggttaggacaagacccttatagattaggccatgatccctacagactaactcct gatccctataggatgtcacctagaccctacaggatagcacccaggtcctatagaatagca cccaggccatataggttagcacctagacccctgatgttagcatctagacgttctatgatg atgtcctatgctgcagaacgttccatgatgtcatcttacgaacgctctatgatgtcttat gagcggtctatgatgtcccctatggctgaacgctctatgatgtcagcctacgagcgctct atgatgtcagcctacgagcgctctatgatgtcccctatggctgagcgctctatgatgtca gcttatgaacgctccatgatgtcagcttatgaacgctccatgatgtccccaatggctgat cgatctatgatgtccatgggtgctgaccggtctatgatgtcgtcatactctgctgctgac cggtctatgatgtcatcgtactctgcagctgaccgatctatgatgtcatcttatactgct gatcgttcaatgatgtctatggctgctgattcttacaccgattcttacactgacacatat acagaggcatatatggtgccacctttgcctcctgaagagcccccaacaatgccaccgttg ccacctgaggagccaccaatgacaccaccattgcctcctgaggaaccaccagagggtcca gcattgcccactgagcagtcagcattaacagctgaaaatacttggcctacagaggtgcca tcatcaccatctgaagagtctgtatcgcagcctgagcctcctgtgagtcaaagtgagatt tcggagccttcagcagtgcctactgattattcagtgtcagcatcagatccctcagtttta gtatcagaggctgctgtgactgttccagaaccaccaccagagccagaatcttcaattacg ttaacacctgtagagtctgcagtagtagcagaagaacatgaagttgttccagagagacca gtgacttgtatggtatctgaaactcccgccatgtcagctgaaccaactgtgttagcatca gagcctcctgttatgtcagagacagcagaaacatttgattccatgagagcctcaggacat gttgcctcagaagtatctacatccttgttggttccagcagtaactactccagtgctggca gagagcattctggagccgccagccatggctgccccagagtcttcagctatggctgtcctg gagtcttcggctgtgaccgtcctggagtcttcgactgtgactgtcctggagtcttcgact gtaactgtcctggagccttcggttgtgactgtcccggagcctcctgttgtggctgagcca gactatgttaccattcctgtgccagttgtttctgcgctggagccttctgtgcctgttctg gaaccagcggtgtcagtccttcaaccttctatgattgtttcagaaccatctgtttctgtc caggaatcgactgtgacagtttcagagcctgctgtcacagtctcagagcagactcaagta ataccaactgaggtggctatagagtccacaccaatgatactggaatctagtatcatgtca tcacatgttatgaaaggaattaatctatcctctggtgatcaaaatcttgctccagagatt ggcatgcaggagattgcattgcattcaggtgaagaaccacatgctgaggaacacctgaaa ggtgacttttacgaaagtgaacatggtataaatatagaccttaatataaataatcattta attgctaaagagatggaacataatacagtgtgtgctgctggtactagtcctgttggggaa attggtgaagagaaaattttgcccaccagtgagactaaacagcgcacagtattggatacc taccctggtgttagtgaagctgatgcaggagaaactctatcttctactggtccttttgct ctggaacctgatgcaacaggaactagtaagggtattgaatttaccacagcatctactctc agtttagttaataaatatgatgttgatttatctttaactactcaagatactgaacatgac atggtaatttccaccagtcctagtggtggtagtgaagctgacattgaagggcctttgcct gctaaagatattcatcttgatttaccatctaataataaccttgttagtaaggatacagaa gaaccattacctgtaaaagagagtgaccagacattagcagctctgctcagccctaaagaa agtagtggaggagaaaaagaagtacctccccctcctaaagagacactgcctgattcagga ttttctgccaatattgaggatattaatgaagcagatttagtgagaccgttacttcctaag gacatggaacgtcttacaagccttagagctggcattgaaggacctttacttgcaagtgat gttggacgtgacagatctgctgccagcccggttgtaagtagtatgccagaaagagcttca gagtcttcttcagaggaaaaagatgattatgaaatttttgtaaaagttaaggacactcac gaaaaaagcaagaaaaataagaaccgtgataagggggagaaagagaagaaaagagactct tcattaagatctcgaagtaagcgttccaaatcttctgaacacaaatcacgcaagcgtacc agtgaatctcgttctagggcaagaaagagatcatctaagtccaagtctcatcgctctcag acacgttcacggtcacgttcaagacgcaggaggagaagcagcagatcaagatcaaagtct agaggaagaagatctgtatcaaaagagaagcgcaaaagatctccaaagcacagatccaag tctagggaaagaaaaagaaaaagatcaagctccagggataaccgaaagacagttagagct cgaagtcgaaccccaagtcgtcggagtcggagtcatactccaagtcgtcgacgaaggtct agatctgtgggtagaagaaggagctttagcatttccccaagccgccgcagccgcaccccc agccgccgcagccgcacccccagccgccgcagccgcacccccagccgccgcagccgcacc cccagccgccggagccgcacccctagccgtcggagccgcaccccaagccgccggagaaga tcaaggtctgtggtaagaagacgaagcttcagtatctcaccagtcagattaaggcgatca agaacacccttaagaagaaggtttagcagatctcccatccgtcgtaaaagatccaggtct tctgaacgaggcagatcacccaaacgtctgacagatttggataaggctcaattacttgaa atagccaaagctaatgcagctgccatgtgtgctaaggctggtgtccctttaccaccaaac ctaaagcctgcacctccacctactatagaagagaaagttgctaaaaagtcaggaggagct actatagaagaactaactgagaaatgtaaacagatcgcacagagtaaagaagatgatgat gtaatagtgaataaacctcatgtttcggatgaagaggaagaagaacctcctttttatcat catccctttaaactcagtgaacccaaacctatttttttcaatctgaatattgctgcagca aaaccaactccaccaaaaagccaggtaacattaacaaaagaattccctgtatcatctgga tctcaacatcggaaaaaagaagcggatagtgtttatggagaatgggttcctgtggagaaa aatggtgaagaaaacaaagatgatgataatgttttcagcagcaatttgccctcagagggc cgggttaaacggcagggccgggttagacgacagatgaaacaacccgcagcttctcatttg acagtaactcgatgcaattcactttgtggaaccaagccacaaagtgaaaagcatcgaatt gcagagaacagtgttatcacatccctacccaacattgggccctccctgcacttgtgggaa gcaaatttatcgggtggagggattgatcggagagggcagatcctcaggttcaacacaaga actggattggaaaaggtcattcctgtggacatctctacagcaatgagtgaacgggcactt gctcagaaaagactcagtgagaatgcatttgatcttgaagccatgagcatgttaaataga gctcaggaaaggattgatgcctgggctcagctgaactctattcctggccagttcacagga agtacaggagtacaggttttgacacaagaacagttggccaatactggtgcccaagcctgg attaaaaaggatcagttcttaagagcagccccggtaactggaggaatgggagccgttttg atgagaaaaatgggctggagagaaggagaaggattaggaaaaaacaaagaaggcaataag gaacccatcctagttgattttaagacagaccgaaaaggcaaacatcctgtgtctgctttg atggagatctgtaataaaagaaggtggcaaccacctgaatttctattggtccatgatagt ggccctgatcatcgcaaacattttctctttagggtattgataaatggaagcgcttaccag cccagctttgccagccctaataagaagcatgctaaagccacagcagctactgtggttctt caagcaatgggccttgtaccaaaggacctcatggctaatgccacttgcttcaggagtgcc tcacgtagatag >gi568815577r:33404127_33639315|GENSCAN_predicted_peptide_8|453_aa MALSVPGYSPGFRKPPEVVRLRRKRARSRGAAASPPRELTEPAARRAALVAGLPLRPFPA AGGRGGGSGGGPAAARRNPFARLDNRPRVAAEPPDGPAREQPEAPVPGTPPWEPRFWQGR VCGKSAPLATPLRAVLTGRVPEESEGAPEGSEEPLGEDLEVTWKRARKTLRAPILQGRCS MGQTSHVSFSEPDIPSSKSTELPVDWSIKTRLLFTSSQPFTWADHLKAQEEAQGLVQHCR ATEVTLPKSIQFTVLFRAAGLAGSDLITALISPTTRGLREAMRNEGIEFSLPLIKESGHK KETASGTSLGYGERKEKHEVQMDHRPESVVLVKGINTFTLLNFLINSKSLVATSGPQAGL PPTLLSPVAFRGATMQMLKSGSFSAVLYPHEPTAVFNICLQMDKVLDMEVVHKELTNCGL HPNTLEQLSQIPLLGKSSLRNVVLRDYIYNWRS >gi568815577r:33404127_33639315|GENSCAN_predicted_CDS_8|1362_bp atggccctttcggtgcccggctactcaccgggcttccgaaagccgcccgaggtagtgcgg ctccgacggaaaagggcccggagccgtggagctgccgcctccccgccccgtgagctgacg gagccggcggcccgccgagccgccctggtggcggggctgcctcttcgccctttccctgct gcggggggcagaggcggtggcagcggcggcggcccggccgctgctcggaggaaccccttc gcccgcctggacaaccgaccgcgggtcgccgcggagccccccgacgggccggcccgcgag cagccggaggccccggtcccgggtaccccgccctgggagccgcgattctggcaaggccgg gtttgtgggaagagcgctcccctggctaccccgctgcgggctgtgctcacgggacgcgtt cccgaggaaagtgagggggccccagagggcagcgaggagccccttggagaggacttagag gtgacttggaaaagagctcgaaaaaccctgagagcccccatcttgcaaggacgctgttcg atggggcagacttcacatgtatcattctccgagcctgatattccgtcctcaaaaagtact gagttacctgtggactggagtattaaaacgcgactccttttcacctcttctcaacccttt acctgggcagatcatttgaaagcacaggaagaagctcaaggtcttgtccagcattgtagg gcaacagaagttactttgcctaaaagtatacagtttactgtcctgttccgagcagcagga ttagctggaagtgacttaatcacagctctcatatctccaacaactcgaggtttaagagaa gctatgagaaatgaaggtattgaattttctctgcctttaataaaagaaagtggccataag aaggagacagcatctggaacaagcttgggatatggggagcgtaaagagaaacatgaagta caaatggatcacagacctgaatctgttgtgttggtaaaaggaatcaacacctttacattg ctcaattttttgattaactctaagagtttagttgctacctcaggtccacaggcaggactt cctccaaccctcttgtcccctgttgctttccgaggtgccacaatgcaaatgcttaagagt ggatctttctctgcagtactgtatccacacgagccaactgctgtatttaacatctgcctg caaatggacaaagtacttgatatggaggttgttcataaggagcttactaactgtggtttg caccctaacactctggagcaacttagtcaaataccgttacttgggaaatcatctttacgg aatgtggtgctgagagactacatttataattggagatcctga >gi568815577r:33404127_33639315|GENSCAN_predicted_peptide_9|217_aa VTMKGLYFQQSSTDEEITFVFQEKEDLPVTEDNFVKLQVKACALSQINTKLLAEMKMKKD LFPVGREIAGIVLDVGSKVSFFQPDDEVVGILPLDSEDPGLCEVVRVHEHYLARVIDVSN GKVHVAESCLEETGGLGVDIVLDAGVRLYSKDDEPAVKLQLLPHKHDIITLLGVGGHWVT TEENLQVKKQKHRDVDTDSRTQQVAEQESHSGSLSPF >gi568815577r:33404127_33639315|GENSCAN_predicted_CDS_9|654_bp gtgactatgaaaggcttatatttccaacagagttccacagatgaagaaataacatttgta tttcaagaaaaggaagatcttcctgttacagaggataactttgtgaaacttcaagttaaa gcttgtgctctgagccagataaatacaaagcttctggcagaaatgaagatgaaaaaggat ttatttcctgttgggagagaaattgctggaattgtattagatgttggaagcaaggtatca ttctttcaaccagatgatgaagtagttggaattttgcccctggactctgaagaccctgga ctttgtgaagttgttagagtacatgagcattacttggcccgagtgattgatgtatctaat gggaaagttcatgttgctgaaagctgtttggaagaaacaggtggcctgggagtagatatt gtcctagatgctggagtgagattatatagtaaagatgatgaaccagctgtaaaactacaa ctactaccacataaacatgatatcatcacacttcttggtgttggaggccactgggtaaca acagaagaaaaccttcaggtgaagaaacagaagcacagagatgttgatacagactcaagg acgcagcaagtggcggagcaagaatcacactcaggcagtctctctcctttctga