GENSCAN 1.0 Date run: 6-Nov-116 Time: 01:12:55 Sequence gi568815587r:47318838_47526279 : 207442 bp : 50.29% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 4834 4998 165 0 0 91 107 224 0.991 24.76 1.02 Intr + 5428 5500 73 0 1 104 93 109 0.999 11.98 1.03 Intr + 5634 5740 107 1 2 102 96 130 0.972 15.13 1.04 Intr + 7901 7970 70 1 1 138 19 53 0.622 2.15 1.05 Intr + 9821 9899 79 0 1 69 74 104 0.877 5.71 1.06 Term + 10209 10398 190 0 1 111 55 55 0.454 1.42 1.07 PlyA + 11175 11180 6 1.05 2.38 PlyA - 12600 12595 6 1.05 2.37 Term - 13044 13034 11 1 2 95 47 0 0.670 -5.04 2.36 Intr - 13421 13235 187 2 1 74 78 211 0.717 17.96 2.35 Intr - 13865 13729 137 0 2 63 97 228 0.685 21.49 2.34 Intr - 14136 13977 160 0 1 105 53 275 0.838 25.26 2.33 Intr - 14496 14357 140 1 2 103 89 155 0.957 17.28 2.32 Intr - 14915 14720 196 2 1 91 70 364 0.981 33.89 2.31 Intr - 15173 15085 89 0 2 95 75 84 0.849 7.49 2.30 Intr - 16372 16205 168 2 0 108 66 165 0.875 16.22 2.29 Intr - 17174 17040 135 0 0 118 68 201 0.999 21.64 2.28 Intr - 18770 18554 217 2 1 -17 97 475 0.757 35.98 2.27 Intr - 18957 18797 161 1 2 63 36 174 0.466 9.41 2.26 Intr - 19842 19683 160 0 1 86 85 380 0.777 37.06 2.25 Intr - 20567 20487 81 2 0 95 113 88 0.970 11.83 2.24 Intr - 20953 20814 140 2 2 102 84 150 0.999 16.18 2.23 Intr - 22195 22166 30 2 0 121 110 1 0.929 3.70 2.22 Intr - 22407 22301 107 2 2 85 80 208 0.999 19.56 2.21 Intr - 23319 23154 166 1 1 90 101 299 0.999 30.42 2.20 Intr - 23907 23741 167 2 2 26 84 278 0.999 20.80 2.19 Intr - 24098 23993 106 0 1 65 101 129 0.999 11.17 2.18 Intr - 24308 24184 125 1 2 78 86 205 0.994 19.53 2.17 Intr - 24787 24655 133 2 1 -54 61 230 0.977 5.80 2.16 Intr - 27533 27370 164 1 2 -16 71 345 0.997 22.02 2.15 Intr - 28642 28589 54 0 0 84 105 42 0.908 3.59 2.14 Intr - 29068 29020 49 2 1 53 94 86 0.968 3.44 2.13 Intr - 29704 29587 118 1 1 106 89 204 0.993 22.34 2.12 Intr - 31085 30937 149 0 2 114 98 294 0.999 32.95 2.11 Intr - 31332 31177 156 1 0 43 92 149 0.495 10.78 2.10 Intr - 31778 31665 114 0 0 87 100 55 0.992 7.02 2.09 Intr - 32668 32402 267 2 0 31 110 407 0.593 34.70 2.08 Intr - 35346 35298 49 0 1 83 95 58 0.972 4.25 2.07 Intr - 36235 36104 132 1 0 124 41 77 0.932 7.44 2.06 Intr - 36709 36438 272 2 2 131 -23 720 0.860 62.46 2.05 Intr - 40169 40007 163 2 1 121 75 159 0.937 17.45 2.04 Intr - 41203 41016 188 2 2 110 57 314 0.681 29.91 2.03 Intr - 56895 56796 100 0 1 40 101 142 0.475 10.38 2.02 Intr - 64327 64226 102 1 0 95 67 25 0.124 1.47 2.01 Init - 70527 70339 189 0 0 33 84 105 0.273 3.61 2.00 Prom - 74754 74715 40 -7.66 3.00 Prom + 83756 83795 40 -7.06 3.01 Init + 91288 91558 271 0 1 70 100 186 0.900 13.14 3.02 Intr + 93089 93202 114 0 0 102 119 139 0.999 18.82 3.03 Intr + 93509 93630 122 0 2 101 80 195 0.999 20.21 3.04 Intr + 94563 94670 108 2 0 126 87 135 0.999 17.68 3.05 Intr + 94760 94849 90 1 0 141 100 218 0.983 28.49 3.06 Intr + 95588 95638 51 1 0 96 83 37 0.885 3.10 3.07 Intr + 95940 96072 133 2 1 83 68 187 0.977 16.42 3.08 Intr + 96223 96322 100 2 1 71 89 151 0.973 12.67 3.09 Term + 96451 96526 76 1 1 117 32 149 0.999 9.51 3.10 PlyA + 97641 97646 6 1.05 4.21 PlyA - 97795 97790 6 1.05 4.20 Term - 100108 99998 111 1 0 76 49 214 0.999 14.86 4.19 Intr - 100360 100279 82 0 1 88 101 163 0.933 17.24 4.18 Intr - 101572 101427 146 1 2 107 99 188 0.999 20.88 4.17 Intr - 101890 101794 97 0 1 86 113 110 0.789 13.31 4.16 Intr - 103885 103737 149 1 2 120 91 299 0.999 32.43 4.15 Intr - 104136 103993 144 0 0 72 113 110 0.996 12.38 4.14 Intr - 105346 105209 138 1 0 63 78 250 0.997 22.16 4.13 Intr - 105654 105592 63 0 0 88 89 55 0.957 4.51 4.12 Intr - 105874 105770 105 1 0 57 99 146 0.999 13.01 4.11 Intr - 106409 106284 126 2 0 68 85 162 0.999 14.78 4.10 Intr - 107113 107030 84 1 0 25 86 186 0.976 12.12 4.09 Intr - 107445 107296 150 0 0 101 24 124 0.504 7.66 4.08 Intr - 119210 119144 67 1 1 80 -12 101 0.031 -1.79 4.07 Intr - 120094 119895 200 1 2 56 75 330 0.999 26.65 4.06 Intr - 122375 122322 54 2 0 80 77 104 0.777 7.68 4.05 Intr - 122896 122774 123 1 0 79 100 162 0.999 17.28 4.04 Intr - 123084 122986 99 0 0 68 78 147 0.997 12.01 4.03 Intr - 123977 123819 159 2 0 121 59 300 0.912 30.58 4.02 Intr - 129313 128975 339 1 0 70 86 660 0.910 59.67 4.01 Init - 130127 129936 192 2 0 83 98 384 0.999 37.87 4.00 Prom - 136314 136275 40 -6.16 5.13 PlyA - 138002 137997 6 1.05 5.12 Term - 153520 153393 128 2 2 43 53 217 0.956 12.14 5.11 Intr - 154394 154251 144 0 0 42 76 168 0.943 11.25 5.10 Intr - 156684 156499 186 1 0 61 113 188 0.964 18.26 5.09 Intr - 158119 158009 111 2 0 63 110 41 0.588 4.15 5.08 Intr - 158588 158460 129 0 0 72 115 52 0.990 6.97 5.07 Intr - 160115 160040 76 2 1 58 107 36 0.681 1.59 5.06 Intr - 164019 163858 162 0 0 91 73 111 0.962 10.07 5.05 Intr - 164695 164616 80 2 2 111 75 70 0.999 7.27 5.04 Intr - 165686 165552 135 0 0 53 92 158 0.685 13.24 5.03 Intr - 167961 167913 49 0 1 112 103 45 0.983 6.65 5.02 Intr - 168404 168322 83 0 2 72 78 39 0.953 0.66 5.01 Init - 170177 170000 178 2 1 56 72 220 0.955 16.62 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 119210 119138 73 1 1 80 41 97 0.917 1.58 S.002 Init + 135351 135425 75 2 0 51 116 73 0.927 7.49 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:47318838_47526279|GENSCAN_predicted_peptide_1|227_aa VCDDCVVLRSNIGTVYERWWYEKLINMTYCPKTKVLCLWRRNGSETQLNKFYTKKCRELY YCVKDSMERAAARQQSIKPGPELGGEFPVQDLKTGEGGLLQVTLEGINLKFMHNQVFIEL NHIKKCNTVRGVFVLEEFVPEIKEVVSHKYKTPMVSVPLNPDGPRPTKSATPYYVSSRTW LQFIAVRKISEPRPGLSLADGEGLRSCPSPGHAPGPLLFPSARCCCD >gi568815587r:47318838_47526279|GENSCAN_predicted_CDS_1|684_bp gtgtgcgatgactgtgtggtgttgcgtagtaacatcggaacagtgtatgagcgctggtgg tacgagaagctcatcaacatgacctactgtcccaagacgaaggtgttgtgcttgtggcgt agaaatggctctgagacccagctcaacaagttctatactaaaaagtgtcgggagctgtac tactgtgtgaaggacagcatggagcgcgctgccgcccgacagcaaagcatcaaacccgga cctgaattgggtggcgagttccctgtgcaggacctgaagactggtgagggtggcctgctg caggtgaccctggaagggatcaacctcaaattcatgcacaatcaggttttcatagagctg aatcacattaaaaagtgcaatacagttcgaggcgtctttgtcctggaggaatttgttcct gaaattaaagaagtggtgagccacaagtacaagacaccaatggtgagtgtgccgctcaac cctgatgggccacggcccacgaaatctgctactccgtattatgtctcttctcgtacgtgg ctgcagttcatagcagtgaggaagatctcagaaccccgccccggcctgtctctagctgat ggagaggggctacgcagctgccccagcccagggcacgcccctggccccttgctgttccca agtgcacgatgctgctgtgactga >gi568815587r:47318838_47526279|GENSCAN_predicted_peptide_2|1693_aa MREPRDRDSHGGTGLRGPENHNHLLPCSRLCALGRAPYLRELHFRHLQNGTDKACLVRLR EGGELLTVRLWITKPFLFLESGSYYSQEFELPRVPGDQPSEDLVPYDTDLYQRQTHEYYP YLSSDGESHSDHYWDFHPHHVHSEFESFAENNFTELQSVQPPQLQQLYRHMELEQMHVLD TPMVPPHPSLGHQVSYLPRMCLQYPSLSPAQPSSDEEEGERQSPPLEVSDGEADGLEPGP GLLPGETGSKKKIRLYQFLLDLLRSGDMKDSIWWVDKDKGTFQFSSKHKEALAHRWGIQK GNRKKMTYQKMARALRNYGKTGEVKKVKKKLTYQFSGEAPSTSRFASLQDSTPAPGRQLG VRPHRGNLAEDDPGYCLGSLKSLLATAGEKQTKLSGEAFSAFSKKPRSVEVAAGSPAVFE AETERAGVKVRWQRGGSDISASNKYGLATEGTRHTLTVREVGPADQGSYAVIAGSSKVKF DLKVIEAEKAEPMLAPAPAPAEATGAPGEAPAPAAELGESAPSPKAWVTEQDSISNKQKK PLLTGSSSAALNGPTPGAPDDPIGLFVMRPQDGEVTVGGSITFSARVAGASLLKPPVVKW FKGKWVDLSSKVGQHLQLHDSYDRASKVYLFELHITDAQPAFTGSYRCEVSTKDKFDCSN FNLTVHEAMGTGDLDLLSAFRRTDSHEDTGILDFSSLLKKRDSKLEAPAEEDVWEILRQA PPSEYERIAFQYGVTDLRGMLKRLKGMRRDEKKSTAFQKKLEPAYQVSKGHKIRLTVELA DHDAEVKWLKNGQEIQMSGRYIFESIGAKRTLTISQCSLADDAAYQCVVGGEKCSTELFV KEPPVLITRPLEDQLVMVGQRVEFECEVSEEGAQVKWLKDGVELTREETFKYRFKKDGQR HHLIINEAMLEDAGHYALCTSGGQALAELIVQEKKLEVYQSIADLMVGAKDQAVFKCEVS DENVRGVWLKNGKELVPDSRIKVSHIGRVHKLTIDDVTPADEADYSFVPEGFACNLSAKL HFMEVKIDFVPRQEPPKIHLDCPGRIPDTIVVVAGNKLRLDVPISGDPAPTVIWQKAITQ GNKAPARPAPDAPEDTGDSDEWVFDKKLLCETEGRVRVETTKDRSIFTVEGAEKEDEGVY TVTVKNPVGEDQVNLTVKVIDVPDAPAAPKISNVGEDSCTVQWEPPAYDGGQPILGECKG TGWRCEGAKQIRGKTRAATSPEPGYILERKKKKSYRWMRLNFDLIQELSHEARRMIEGVV YEMRVYAVNAIGMSRPSPASQPFMPIGPPSEPTHLAVEDVSDTTVSLKWRPPERVGAGGL DGYSVEYCPEGCSEWVAALQGLTEHTSILVKDLPTGARLLFRVRAHNMAGPGAPVTTTEP VTVQEILQRPRLQLPRHLRQTIQKKVGEPVNLLIPFQGKPRPQVTWTKEGQPLAGEEVSI RNSPTDTILFIRAARRVHSGTYQVTVRIENMEDKATLVLQVVDKPSPPQDLRVTDAWGLN VALEWKPPQDVGNTELWGYTVQKADKKTMEWFTVLEHYRRTHCVVPELIIGNGYYFRVFS QNMVGFSDRAATTKEPVFIPRPGITYEPPNYKALDFSEAPSFTQPLVNRSVIAGYTAMLC CAVRGSPKPKISWFKNGLDLGEDARFRMFSKQGVLTLEIRKPCPFDGGIYVCRATNLQGE ARCECRLEVRVPQ >gi568815587r:47318838_47526279|GENSCAN_predicted_CDS_2|5082_bp atgagagaacctcgagatagggacagccacggaggcacaggcctgaggggccctgagaac cacaatcacctcctgccctgctcccggctgtgtgcccttggccgtgccccttacctccgg gaactgcacttccgtcatctgcagaatggcactgacaaggcctgcctcgtacggttacgt gagggaggggaactactaaccgtgaggttatggattaccaaaccgttcctgttcttggaa tctgggtcctactattcccaagaatttgaattaccacgggtccctggggatcagccatca gaagacctggtgccctatgacacggatctataccaacgccaaacgcacgagtattacccc tatctcagcagtgatggggagagccatagcgaccattactgggacttccacccccaccac gtgcacagcgagttcgagagcttcgccgagaacaacttcacggagctccagagcgtgcag cccccgcagctgcagcagctctaccgccacatggagctggagcagatgcacgtcctcgat acccccatggtgccaccccatcccagtcttggccaccaggtctcctacctgccccggatg tgcctccagtacccatccctgtccccagcccagcccagctcagatgaggaggagggcgag cggcagagccccccactggaggtgtctgacggcgaggcggatggcctggagcccgggcct gggctcctgcctggggagacaggcagcaagaagaagatccgcctgtaccagttcctgttg gacctgctccgcagcggcgacatgaaggacagcatctggtgggtggacaaggacaagggc accttccagttctcgtccaagcacaaggaggcgctggcgcaccgctggggcatccagaag ggcaaccgcaagaagatgacctaccagaagatggcgcgcgcgctgcgcaactacggcaag acgggcgaggtcaagaaggtgaagaagaagctcacctaccagttcagcggcgaagccccc tccacatcccgcttcgcctccctccaggactccaccccggctcccggacgccagctgggc gtcagaccccaccggggcaaccttgcagaggacgacccggggtactgccttgggagtctc aagtccctcctagccacagcaggggagaaacagaccaagctgagcggagaagccttctca gcttttagcaagaagccacggtcagtggaagtggccgcaggcagccctgccgtgttcgag gccgagacagagcgggcaggagtgaaggtgcgctggcagcgcggaggcagtgacatcagc gccagcaacaagtacggcctggccacagagggcacacggcatacgctgacagtgcgggaa gtgggccctgccgaccagggatcttacgcagtcattgctggctcctccaaggtcaagttc gacctcaaggtcatagaggcagagaaggcagagcccatgctggcccctgcccctgcccct gctgaggccactggagcccctggagaagccccggccccagccgctgagctgggagaaagt gccccaagtcccaaagcctgggtgacagagcaagactccatctcaaacaaacagaaaaag cctttgctcacagggtcaagctcagcagctctcaatggtcctacccctggagcccccgat gaccccattggcctcttcgtgatgcggccacaggatggcgaggtgaccgtgggtggcagc atcaccttctcagcccgcgtggccggcgccagcctcctgaagccgcctgtggtcaagtgg ttcaagggcaaatgggtggacctgagcagcaaggtgggccagcacctgcagctgcacgac agctacgaccgcgccagcaaggtctatctgttcgagctgcacatcaccgatgcccagcct gccttcactggcagctaccgctgtgaggtgtccaccaaggacaaatttgactgctccaac ttcaatctcactgtccacgaggccatgggcaccggagacctggacctcctatcagccttc cgccgcactgatagccatgaggacactgggattctggacttcagctcactgctgaaaaag agggactcgaagctggaggcaccagcagaggaggacgtgtgggagatcctacggcaggca cccccatctgagtacgagcgcatcgccttccagtacggcgtcactgacctgcgcggcatg ctaaagaggctcaagggcatgaggcgcgatgagaagaagagcacagcctttcagaagaag ctggagccggcctaccaggtgagcaaaggccacaagatccggctgaccgtggaactggct gaccatgacgctgaggtcaaatggctcaagaatggccaggagatccagatgagcggcagg tacatctttgagtccatcggtgccaagcgtaccctgaccatcagccagtgctcattggcg gacgacgcagcctaccagtgcgtggtgggtggcgagaagtgtagcacggagctctttgtg aaagagccccctgtgctcatcacgcgccccttggaggaccagctggtgatggtggggcag cgggtggagtttgagtgtgaagtatcggaggagggggcgcaagtcaaatggctgaaggac ggggtggagctgacccgggaggagaccttcaaataccggttcaagaaggacgggcagaga caccacctgatcatcaacgaggccatgctggaggacgcggggcactatgcactgtgcact agcgggggccaggcgctggctgagctcattgtgcaggaaaagaagctggaggtgtaccag agcatcgcagacctgatggtgggcgcaaaggaccaggcggtgttcaaatgtgaggtctca gatgagaatgttcggggtgtgtggctgaagaatgggaaggagctggtgcccgacagccgc ataaaggtgtcccacatcgggcgggtccacaaactgaccattgacgacgtcacacctgcc gacgaggctgactacagctttgtgcccgagggcttcgcctgcaacctgtcagccaagctc cacttcatggaggtcaagattgacttcgtacccaggcaggaacctcccaagatccacctg gactgcccaggccgcataccagacaccattgtggttgtagctggaaataagctacgtctg gacgtccctatctctggggaccctgctcccactgtgatctggcagaaggctatcacgcag gggaataaggccccagccaggccagccccagatgccccagaggacacaggtgacagcgat gagtgggtgtttgacaagaagctgctgtgtgagaccgagggccgggtccgcgtggagacc accaaggaccgcagcatcttcacggtcgagggggcagagaaggaagatgagggcgtctac acggtcacagtgaagaaccctgtgggcgaggaccaggtcaacctcacagtcaaggtcatc gacgtgccagacgcacctgcggcccccaagatcagcaacgtgggagaggactcctgcaca gtacagtgggagccgcctgcctacgatggcgggcagcccatcctgggtgagtgcaagggc accggatggaggtgtgagggcgccaaacagatccgagggaagaccagagctgccacctcc cctgagccaggctacatcctggagcgcaagaagaagaagagctaccggtggatgcggctg aacttcgacctgattcaggagctgagtcatgaagcgcggcgcatgatcgagggcgtggtg tacgagatgcgcgtctacgcggtcaacgccatcggcatgtccaggcccagccctgcctcc cagcccttcatgcctatcggtccccccagcgaacccacccacctggcagtagaggacgtc tctgacaccacggtctccctcaagtggcggcccccagagcgcgtgggagcaggaggcctg gatggctacagcgtggagtactgcccagagggctgctcagagtgggtggctgccctgcag gggctgacagagcacacatcgatactggtgaaggacctgcccacgggggcccggctgctt ttccgagtgcgggcacacaatatggcagggcctggagcccctgttaccaccacggagccg gtgacagtgcaggagatcctgcaacggccacggcttcagctgcccaggcacctgcgccag accattcagaagaaggtcggggagcctgtgaaccttctcatccctttccagggcaagccc cggcctcaggtgacctggaccaaagaggggcagcccctggcaggcgaggaggtgagcatc cgcaacagccccacagacaccatcctgttcatccgggccgctcgccgcgtgcattcaggc acttaccaggtgacggtgcgcattgagaacatggaggacaaggccacgctggtgctgcag gttgttgacaagccaagtcctccccaggatctccgggtgactgacgcctggggtcttaat gtggctctggagtggaagccaccccaggatgtcggcaacacggagctctgggggtacaca gtgcagaaagccgacaagaagaccatggagtggttcaccgtcttggagcattaccgccgc acccactgcgtggtgccagagctcatcattggcaatggctactacttccgcgtcttcagc cagaatatggttggctttagtgacagagcggccaccaccaaggagcccgtctttatcccc agaccaggcatcacctatgagccacccaactataaggccctggacttctccgaggcccca agcttcacccagcccctggtgaaccgctcggtcatcgcgggctacactgctatgctctgc tgtgctgtccggggtagccccaagcccaagatttcctggttcaagaatggcctggacctg ggagaagacgcccgcttccgcatgttcagcaagcagggagtgttgactctggagattaga aagccctgcccctttgacgggggcatctatgtctgcagggccaccaacttacagggcgag gcacggtgtgagtgccgcctggaggtgcgagtgcctcagtga >gi568815587r:47318838_47526279|GENSCAN_predicted_peptide_3|354_aa MAGPRLLFLTALALELLERAGGSQPALRSRGTATACRLDNKESESWGALLSGERLDTWIC SLLGSLMVGLSGVFPLLVIPLEMGTMLRSEAGAWRLKQLLSFALGGLLGNVFLHLLPEAW AYTCSASPGGEGQSLQQQQQLGLWVIAGILTFLALEKMFLDSKEEGTSQAPNKDPTAAAA ALNGGHCLAQPAAEPGLGAVVRSIKVSGYLNLLANTIDNFTHGLAVAASFLVSKKIGLLT TMAILLHEIPHEVGDFAILLRAGFDRWSAAKLQLSTALGGLLGAGFAICTQSPKGVEETA AWVLPFTSGGFLYIALVNVLPDLLEEEDPWRSLQQLLLLCAGIVVMVLFSLFVD >gi568815587r:47318838_47526279|GENSCAN_predicted_CDS_3|1065_bp atggcgggcccaaggctcctcttcctcactgcccttgccctggagctcttggaaagggct gggggttcccagccggccctccggagccgggggactgcgacggcctgtcgcctggacaac aaggaaagcgagtcctggggggctctgctgagcggagagcggctggacacctggatctgc tccctcctgggttccctcatggtggggctcagtggggtcttcccgttgcttgtcattccc ctagagatggggaccatgctgcgctcagaagctggggcctggcgcctgaagcagctgctc agcttcgccctggggggactcttgggcaatgtgtttctgcatctgctgcccgaagcctgg gcctacacgtgcagcgccagccctggtggtgaggggcagagcctgcagcagcagcaacag ctggggctgtgggtcattgctggcatcctgaccttcctggcgttggagaagatgttcctg gacagcaaggaggaggggaccagccaggcccccaacaaagaccccactgctgctgccgcc gcgctcaatggaggccactgtctggcccagccggctgcagagcccggcctcggtgccgtg gtccggagcatcaaagtcagcggctacctcaacctgctggccaacaccatcgataacttc acccacgggctggctgtggctgccagcttccttgtgagcaagaagatcgggctcctgaca accatggccatcctcctgcatgagatcccccatgaggtgggcgactttgccatcctgctc cgggccggctttgaccgatggagcgcagccaagctgcaactctcaacagcgctggggggc ctactgggcgctggcttcgccatctgtacccagtcccccaagggagtagaggagacggca gcctgggtcctgcccttcacctctggcggctttctctacatcgccttggtgaacgtgctc cctgacctcttggaagaagaggacccgtggcgctccctgcagcagctgcttctgctctgt gcgggcatcgtggtaatggtgctgttctcgctcttcgtggattaa >gi568815587r:47318838_47526279|GENSCAN_predicted_peptide_4|875_aa MGQDQTKQQIEKGLQLYQSNQTEKALQVWTKVLEKSSDLMGRFRVLGCLVTAHSEMGRYK EMLKFAVVQIDTARELEDADFLLESYLNLARSNEKLCEFHKTISYCKTCLGLPGTRAGAQ LGGQVSLSMGNAFLGLSVFQKALESFEKALRYAHNNDDAMLECRVCCSLGSFYAQVKDYE KALFFPCKAAELVNNYGKGWSLKYRAMSQYHMAVAYRLLGRLGSAMECCEESMKIALQHG DRPLQALCLLCFADIHRSRGDLETAFPRYDSAMSIMTEIGNRLGQVQALLGVAKCWVARK ALDKALDAIERAQDLAEEVGNKLSQLKLHCLSESIYRSKGLQRELRAHVVRFHECVEETE LYCGLCGESIGEKNSRLQALPCSHIFHLRCLQNNGTRSCPNCRRSSMKPGFEMNLLPNIE SPVTRQEKMATVWDEAEVGTGRAGGPGTGARRETPLTRGKEQDGIGEEVLKMSTEEIIQR TRLLDSEIKIMKSEVLRVTHELQAMKDKIKENSEKIKVNKTLPYLVSNVIELLDVDPNDQ EEDGANIDLDSQRKGKCAVIKTSTRQTYFLPVIGLVDAEKLKPGDLVGVNKDSYLILETL PTEYDSRVKAMEVDERPTEQYSDIGGLDKQIQELVEAIVLPMNHKEKFENLGIQPPKGVL MYGPPGTGKTLLARACAAQTKATFLKLAGPQLVQMFIGDGAKLVRDAFALAKEKAPSIIF IDELDAIGTKRFDSEKAGDREVQRTMLELLNQLDGFQPNTQVKVIAATNRVDILDPALLR SGRLDRKIEFPMPNEEARARIMQIHSRKMNVSPDVNYEELARCTDDFNGAQCKAVCVEAG MIALRRGATELTHEDYMEGILEVQAKKKANLQYYA >gi568815587r:47318838_47526279|GENSCAN_predicted_CDS_4|2628_bp atggggcaggaccagaccaagcagcagatcgagaaggggctccagctgtaccagtccaac cagacagagaaggcattgcaggtgtggacaaaggtgctggagaagagctcggacctcatg gggcgcttccgcgtgctgggctgcctggtcacagcccactcggagatgggccgctacaag gagatgctgaagttcgctgtggtccagatcgacacggcccgggagctggaggatgccgac ttcctcctggagagctacctgaacctggcacgcagcaacgagaagctgtgcgagtttcac aagaccatctcctactgcaagacctgccttgggctgcctggtaccagggcaggtgcccag ctcggaggccaggtcagcctgagcatgggcaatgccttcctgggcctcagcgtcttccag aaggccctggagagcttcgagaaggccctgcgctatgcccacaacaatgatgacgccatg ctcgagtgccgcgtgtgctgcagcctgggcagcttctatgcccaggtcaaggactacgag aaagccctgttcttcccctgcaaggcggcagagcttgtcaacaactatggcaaaggctgg agcctgaagtaccgggccatgagccagtaccacatggccgtggcctatcgcctgctgggc cgcctgggcagtgccatggagtgttgtgaggagtctatgaagatcgcgctgcagcacggg gaccggccactgcaggcgctctgcctgctctgcttcgctgacatccaccggagccgtggg gacctggagacagccttccccaggtacgactccgccatgagcatcatgaccgagatcgga aaccgcctggggcaggtgcaggcgctgctgggtgtggccaagtgctgggtggccaggaag gcgctggacaaggctctggatgccatcgagagagcccaggatctggccgaggaggtgggg aacaagctgagccagctcaagctgcactgtctgagcgagagcatttaccgcagcaaaggg ctgcagcgggaactgcgggcgcacgttgtgaggttccacgagtgcgtggaggagacggag ctctactgcggcctgtgcggcgagtccataggcgagaagaacagccggctgcaggcccta ccttgctcccacatcttccacctcaggtgcctgcagaacaacgggacccggagctgtccc aactgccgccgctcatccatgaagcctggctttgaaatgaatctgctgccgaatattgag agtccagtgactcggcaggagaagatggcgaccgtgtgggatgaggccgaggtgggcacc gggcgagctggcggtccgggaaccggggcccgcagggagacgccattgacaagagggaag gagcaagatggaattggggaggaggtgctcaagatgtccacggaggagatcatccagcgc acacggctgctggacagtgagatcaagatcatgaagagtgaagtgttgagagtcacccat gagctccaagccatgaaggacaagataaaagagaacagtgagaaaatcaaagtgaacaag accctgccgtaccttgtctccaacgtcatcgagctcctggatgttgatcctaatgaccaa gaggaggatggtgccaatattgacctggactcccagaggaagggcaagtgtgctgtgatc aaaacctctacacgacagacgtacttccttcctgtgattgggttggtggatgctgaaaag ctaaagccaggagacctggtgggtgtgaacaaagactcctatctgatcctggagacgctg cccacagagtatgactcgcgggtgaaggccatggaggtagacgagaggcccacggagcaa tacagtgacattgggggtttggacaagcagatccaggagctggtggaggccattgtcttg ccaatgaaccacaaggagaagtttgagaacttggggatccaacctccaaaaggggtgctg atgtatgggcccccagggacggggaagaccctcctggcccgggcctgtgccgcacagact aaggccaccttcctaaagctggctggcccccagctggtgcagatgttcattggagatggt gccaagctagtccgggatgcctttgccctggccaaggagaaagcgccctctatcatcttc attgatgagttggatgccatcggcaccaagcgctttgacagtgagaaggctggggaccgg gaggtgcagaggacaatgctggagcttctgaaccagctggatggcttccagcccaacacc caagttaaggtaattgcagccacaaacagggtggacatcctggaccccgccctcctccgc tcgggccgccttgaccgcaagatagagttcccgatgcccaatgaggaggcccgggccaga atcatgcagatccactcccgaaagatgaatgtcagtcctgacgtgaactacgaggagctg gcccgctgcacagatgacttcaatggggcccagtgcaaggctgtgtgtgtggaggcgggc atgatcgcactgcgcaggggtgccacggagctcacccacgaggactacatggaaggcatc ctggaggtgcaggccaagaagaaagccaacctacaatactacgcctag >gi568815587r:47318838_47526279|GENSCAN_predicted_peptide_5|486_aa MNGTLDHPDQPDLDAIKMFVGQVPRTWSEKDLRELFEQYGAVYEINVLRDRSQNPPQSKG CCFVTFYTRKAALEAQNALHNMKVLPGMHHPIQMKPADSEKNNAVEDRKLFIGMISKKCT ENDIRVMFSSFGQIEECRILRGPDGLSRGCAFVTFTTRAMAQTAIKAMHQAQTMEGCSSP MVVKFADTQKDKEQKRMAQQLQQQMQQISAASVWGNLAGLNTLGPQYLALYLQLLQQTAS SGNLNTLSSLHPMGGLNAMQLQNLAALAAAASAAQNTPSGTNALTTSSSPLSVLTSSGSS PSSSSSNSVNPIASLGALQTLAGATAGLNVGSLAGMAALNGGLGSSGLSNGTGSTMEALT QAYSGIQQYAAAALPTLYNQNLLTQQSIGAAGSQKEGPEGANLFIYHLPQEFGDQDLLQM FMPFGNVVSAKVFIDKQTNLSKCFGFVSYDNPVSAQAAIQSMNGFQIGMKRLKVQLKRSK NDSKPY >gi568815587r:47318838_47526279|GENSCAN_predicted_CDS_5|1461_bp atgaacggcaccctggaccacccagaccaaccagatcttgatgctatcaagatgtttgtg ggccaggttccaaggacctggtctgaaaaggacttgcgggaactcttcgaacagtatggt gctgtgtatgaaatcaacgtcctaagggataggagccaaaacccgcctcagagcaaaggg tgctgttttgttacattttacacccgtaaagctgcattagaagctcagaatgctcttcac aacatgaaagtcctcccagggatgcatcaccctatacagatgaaacctgctgacagtgag aagaacaatgcagtggaagacaggaagctgtttattggtatgatttccaagaagtgcact gaaaatgacatccgagtcatgttctcttcgtttggacagattgaagaatgccggatattg cggggacctgatggcctgagccgaggttgtgcatttgtgacttttacaacaagagccatg gcacagacggctatcaaggcaatgcaccaagcacagaccatggagggttgctcatcaccc atggtggtaaaatttgctgatacacagaaggacaaagaacagaagagaatggcccagcag ctccagcagcagatgcagcaaatcagcgcagcatctgtgtggggaaaccttgctggtcta aatactcttggaccccagtatttagcactttatttgcagctccttcagcagactgcctcc tctgggaacctcaacaccctgagcagcctccacccaatgggagggttgaatgcaatgcag ttacagaatttggctgcactagctgctgcagctagtgcagctcagaacacaccaagtggt accaatgctctcactacatccagcagtcccctcagcgtgctcactagttcagggtcctca cctagctctagcagcagtaattctgtcaaccccatagcctcacttggagccctgcagaca ttagctggagcaacggctggcctcaatgttggctctttggcaggaatggctgctttaaat ggtggcctgggcagcagtggcctttccaatggcaccgggagcaccatggaggccctcact caggcctactcgggtatccagcaatatgctgctgctgcgctccccactctgtacaaccag aatcttctgacacagcagagtattggtgctgctggaagccagaaggaaggtccagaggga gccaacctgttcatctaccacctgccccaggagtttggtgatcaggacctgctgcagatg tttatgccctttgggaatgtcgtgtctgccaaggttttcatagacaagcagacaaacctg agcaagtgttttggttttgtaagttacgacaatcctgtttcggcccaagctgccatccag tccatgaacggctttcagattggcatgaagcggcttaaagtgcagctcaaacgttcgaag aatgacagcaagccctactga