GENSCAN 1.0 Date run: 6-Nov-116 Time: 04:22:51 Sequence gi568815586f:53084889_53287587 : 202699 bp : 49.00% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 12830 13163 334 1 1 87 105 270 0.999 24.06 1.02 Intr + 15824 15969 146 0 2 122 74 54 0.727 7.40 1.03 Intr + 16153 16272 120 0 0 122 94 206 0.999 25.29 1.04 Intr + 18047 18170 124 1 1 97 75 -12 0.063 -1.44 1.05 Intr + 19263 19318 56 1 2 126 78 25 0.931 4.00 1.06 Intr + 20219 20355 137 1 2 86 109 59 0.964 7.17 1.07 Intr + 20673 20732 60 0 0 43 80 102 0.844 2.75 1.08 Intr + 21019 21126 108 1 0 107 96 230 0.999 25.10 1.09 Intr + 30502 30766 265 1 1 62 94 403 0.968 35.82 1.10 Intr + 31209 31278 70 2 1 105 78 10 0.933 0.45 1.11 Intr + 33462 33546 85 1 1 138 82 42 0.999 7.48 1.12 Intr + 34002 34047 46 0 1 70 94 51 0.995 2.31 1.13 Intr + 34236 34365 130 2 1 111 99 124 0.999 16.07 1.14 Intr + 35898 35995 98 1 2 72 39 162 0.955 9.43 1.15 Intr + 36415 36513 99 0 0 107 55 185 0.990 17.41 1.16 Intr + 38193 38328 136 2 1 86 83 132 0.974 12.64 1.17 Intr + 38840 38985 146 0 2 65 110 82 0.695 8.10 1.18 Intr + 45169 45286 118 0 1 144 18 -7 0.049 -2.06 1.19 Term + 46553 46767 215 0 2 7 44 153 0.121 0.19 1.20 PlyA + 47725 47730 6 1.05 2.00 Prom + 66808 66847 40 -3.16 2.01 Init + 68432 68765 334 1 1 50 -14 366 0.719 20.06 2.02 Intr + 68819 69235 417 0 0 -8 38 429 0.626 22.30 2.03 Term + 69352 69602 251 2 2 13 55 294 0.759 14.47 2.04 PlyA + 69717 69722 6 1.05 3.14 PlyA - 69803 69798 6 1.05 3.13 Term - 73796 73623 174 2 0 91 52 347 0.998 29.16 3.12 Intr - 74824 74735 90 1 0 109 102 24 0.985 6.09 3.11 Intr - 75050 74999 52 1 1 119 75 90 0.999 9.71 3.10 Intr - 75431 75232 200 2 2 68 82 312 0.980 26.75 3.09 Intr - 76501 76385 117 1 0 60 76 123 0.976 8.96 3.08 Intr - 85238 85184 55 1 1 110 105 64 0.998 9.28 3.07 Intr - 85614 85535 80 0 2 105 57 118 0.998 8.75 3.06 Intr - 86553 86438 116 1 2 109 59 163 0.999 15.67 3.05 Intr - 87100 86994 107 0 2 130 18 125 0.939 9.56 3.04 Intr - 87548 87458 91 0 1 120 55 82 0.999 7.15 3.03 Intr - 87760 87634 127 1 1 105 56 290 0.999 27.75 3.02 Intr - 88588 88457 132 1 0 72 113 117 0.637 13.44 3.01 Init - 95649 95515 135 0 0 83 80 124 0.870 9.24 3.00 Prom - 98782 98743 40 -8.36 4.00 Prom + 98966 99005 40 -1.96 4.01 Init + 100069 100152 84 0 0 60 92 80 0.155 6.42 4.02 Intr + 101066 101189 124 1 1 100 119 36 0.560 8.06 4.03 Intr + 101503 101621 119 2 2 69 110 156 0.998 16.08 4.04 Term + 103318 103491 174 0 0 60 54 114 0.609 2.96 4.05 PlyA + 104314 104319 6 1.05 5.30 PlyA - 106453 106448 6 1.05 5.29 Term - 106748 106668 81 2 0 83 49 38 0.755 -2.91 5.28 Intr - 107131 106971 161 2 2 94 94 266 0.836 27.51 5.27 Intr - 107650 107442 209 0 2 60 106 221 0.981 19.82 5.26 Intr - 108022 107803 220 2 1 75 93 113 0.999 7.86 5.25 Intr - 108475 108252 224 0 2 133 67 151 0.981 15.27 5.24 Intr - 109013 108820 194 2 2 105 94 -50 0.627 -4.41 5.23 Intr - 109456 109310 147 1 0 94 93 69 0.698 8.43 5.22 Intr - 110575 110486 90 1 0 33 54 111 0.823 2.39 5.21 Intr - 110833 110738 96 1 0 114 94 69 0.959 10.31 5.20 Intr - 111311 111153 159 2 0 37 59 148 0.990 6.98 5.19 Intr - 111932 111691 242 0 2 117 94 238 0.985 24.57 5.18 Intr - 112775 112605 171 0 0 84 85 301 0.999 29.31 5.17 Intr - 113063 112862 202 2 1 56 105 267 0.945 24.06 5.16 Intr - 115558 115355 204 1 0 95 63 71 0.130 4.70 5.15 Intr - 116369 116345 25 1 1 97 55 13 0.019 -3.07 5.14 Intr - 122369 122314 56 2 2 94 76 65 0.058 3.68 5.13 Intr - 126975 126833 143 1 2 129 6 170 0.067 12.97 5.12 Intr - 128355 128197 159 1 0 106 96 254 0.999 27.96 5.11 Intr - 128812 128608 205 1 1 97 64 359 0.999 33.17 5.10 Intr - 129347 129171 177 2 0 119 119 215 0.999 27.82 5.09 Intr - 129718 129558 161 2 2 46 89 219 0.824 17.51 5.08 Intr - 130546 130405 142 1 1 96 102 205 0.999 22.63 5.07 Intr - 130906 130758 149 2 2 123 93 160 0.989 19.95 5.06 Intr - 135334 135062 273 2 0 31 51 138 0.245 1.91 5.05 Intr - 142687 142474 214 1 1 70 56 148 0.785 8.09 5.04 Intr - 142799 142762 38 0 2 151 63 -6 0.890 1.18 5.03 Intr - 146347 146281 67 1 1 83 94 50 0.625 3.58 5.02 Intr - 147192 147086 107 1 2 48 109 62 0.885 4.23 5.01 Init - 151509 151380 130 0 1 54 71 26 0.292 -2.19 5.00 Prom - 158853 158814 40 -5.16 6.00 Prom + 160689 160728 40 -8.46 6.01 Init + 167948 169234 1287 1 0 96 79 991 0.001 89.05 6.02 Intr + 183867 183959 93 2 0 85 81 73 0.202 6.46 6.03 Intr + 184136 185197 1062 1 0 84 59 625 0.874 49.71 6.04 Intr + 185490 185594 105 2 0 115 84 72 0.994 9.91 6.05 Intr + 185790 185910 121 2 1 145 65 137 0.999 17.17 6.06 Intr + 187833 187969 137 1 2 118 82 124 0.989 15.09 6.07 Intr + 189929 190122 194 1 2 63 87 109 0.951 6.59 6.08 Intr + 191732 191971 240 2 0 70 95 355 0.995 31.06 6.09 Intr + 192195 192339 145 0 1 86 81 170 0.844 16.38 6.10 Intr + 192582 192720 139 2 1 89 81 219 0.997 21.34 6.11 Intr + 192933 193072 140 1 2 55 90 104 0.991 7.48 6.12 Intr + 194844 194978 135 2 0 27 109 130 0.944 9.76 6.13 Intr + 196619 196738 120 1 0 87 63 97 0.995 7.79 6.14 Intr + 197376 197547 172 2 1 148 62 97 0.999 12.62 6.15 Intr + 198241 198369 129 2 0 81 101 127 0.999 13.97 6.16 Intr + 198494 198650 157 0 1 84 100 93 0.999 9.17 6.17 Intr + 199170 199279 110 0 2 68 99 173 0.999 16.33 6.18 Intr + 201036 202024 989 1 2 105 100 645 0.824 57.99 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 17157 17279 123 2 0 132 42 40 0.818 2.18 S.002 Init + 18690 18771 82 2 1 108 89 20 0.858 4.44 S.003 Term - 126975 126788 188 1 2 129 47 209 0.924 18.55 S.004 Sngl + 167948 169300 1353 1 0 96 40 1075 0.998 97.65 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:53084889_53287587|GENSCAN_predicted_peptide_1|830_aa MTPHRLLPPLLLLLALLLAASPGGALARCPGCGQGVQAGCPGGCVEEEDGGSPAEGCAEA EGCLRREGQECGVYTPNCAPGLQCHPPKDDEAPLRALLLGRGRCLPARAPAVAEENPKES KPQAGTARPQDVNRRDQQRNPGTSTTPSQPNSAGVQDTEMGPCRRHLDSVLQQLQTEVYR GAQTLYVPNCDHRGFYRKRQVLNTCLFSLLSAYMLTQAGVRKRPCQFRGPDTQPFLEGPQ FRNTETHRAPDLVQWTRHMEAVKAQLLEQAQGQLRELLDRAMREAIQSYPSQDKPLPPPP PGSLSRTQEPSLGKQKVFIIRKSLLDELMEVQHFRTIYHMFIAGLCVFIISTLAIDFIDE GRLLLEFDLLIFSFGQLPLALVTWVPMFLSTLLAPYQALRLWARGTWTQATGLGCALLAA HAVVLCALPVHVAVEHQLPPASRCVLVFEQVRFLMKSYSFLREAVPGTLRARRGEGIQAP SFSSYLYFLFCPTLIYRETYPRTPYVRWNYVAKNFAQALGCVLYACFILGRLCVPVFANM SREPFSTRALVLSILHATLPGIFMLLLIFFAFLHCWLNAFAEMLRFGDRMFYRDWWNSTS FSNYYRTWNVVVHDWLYSYVYQDGLRLLGARARGVAMLGVFLVSAVAHEYIFCFVLGFFY PVMLILFLVIGGMLNFMMHDQRTGPAWNVLMWTMLFLGQGIQVSLYCQEWYARRHCPLPQ LHQAAVPLLSSLSQPHFSKELPVPTVEVFHLTFTQVLIIGTQEKLIRDFDEKQQEANKML TQMEEELHYAPVSFHNPMMSKLQDYQKDLAQFHLEARSTPLQPCLGTEET >gi568815586f:53084889_53287587|GENSCAN_predicted_CDS_1|2493_bp atgaccccccacaggctgctgccaccgctgctgctgctgctagctctgctgctcgctgcc agcccaggaggcgccttggcgcggtgcccaggctgcgggcaaggggtgcaggcgggttgt ccagggggctgcgtggaggaggaggatggggggtcgccagccgagggctgcgcggaagct gagggctgtctcaggagggaggggcaggagtgcggggtctacacccctaactgcgcccca ggactgcagtgccatccgcccaaggacgacgaggcgcctttgcgggcgctgctgctcggc cgaggccgctgccttccggcccgcgcgcctgctgttgcagaggagaatcctaaggagagt aaaccccaagcaggcactgcccgcccacaggatgtgaaccgcagagaccaacagaggaat ccaggcacctctaccacgccctcccagcccaattctgcgggtgtccaagacactgagatg ggcccatgccgtagacatctggactcagtgctgcagcaactccagactgaggtctaccga ggggctcaaacactctacgtgcccaattgtgaccatcgaggcttctaccggaagcggcag gtgctgaatacgtgtttgtttagtctcctttctgcctacatgctcacccaagcaggtgtc aggaagcggccctgtcagttcaggggccctgacactcagcctttcctggaggggccccag ttccgaaacactgagacgcacagagccccggacttggtacaatggacccgacacatggag gctgtgaaggcacaattgctggagcaagcgcagggacaactgagggagctgctggatcgg gccatgcgggaggctatacaatcctacccatcacaagacaaacctctgcccccacctccc ccaggttccttgagcaggacccaggagccatccctggggaaacagaaagttttcatcatc cgcaagtccctgcttgatgagctgatggaggtgcagcatttccgcaccatctaccacatg ttcatcgctggcctgtgtgtcttcatcatcagcaccctggccatcgacttcattgatgag ggcaggctgctgctggagtttgacctactgatcttcagcttcggacagctgccattggcg ctggtgacctgggtgcccatgtttctgtccaccctgttggcgccgtaccaggccctacgg ctgtgggccaggggcacctggacgcaggcgacgggcctgggctgtgcgctgctagccgcc cacgccgtggtgctctgcgcgctgccggtccacgtggccgtggagcatcagctcccgccg gcctcccgttgtgtcctggtcttcgagcaggttaggttcctgatgaaaagctactccttc ctgagagaggctgtgcctgggacccttcgtgccagacgaggtgaggggatccaggccccc agtttctccagctacctctacttcctcttctgcccaacactcatctacagggagacttac cctaggacgccctatgtcaggtggaattatgtggccaagaactttgcccaggccctggga tgtgtgctctatgcctgcttcatcctgggccgcctctgtgttcctgtctttgccaacatg agccgagagcccttcagcacccgtgccctggtgctctctatcctgcatgccacgttgcca ggcatcttcatgctgctgctcatcttctttgccttcctccattgctggctcaacgccttt gccgagatgctacgatttggagacaggatgttctaccgggactggtggaactcaacgtcc ttctccaactactaccgcacttggaacgtggtggtccatgactggctgtacagctacgtg tatcaggatgggctgcggctccttggtgcccgggcccgaggggtagccatgctgggtgtg ttcctggtctccgcagtggcccatgagtatatcttctgcttcgtcctggggttcttctat cccgtcatgctgatactcttccttgtcattggaggaatgttgaacttcatgatgcatgac cagcgcaccggcccggcatggaacgtgctgatgtggaccatgctgtttctaggccaggga atccaggtcagcctgtactgccaggagtggtacgcacggcggcactgccccttaccccag ctacatcaggctgctgtcccacttctcagctccctgtcacagccacacttctccaaagag ttgcctgtccccactgttgaggtttttcacctcacattcactcaagtgttgatcataggg acccaggagaagctgatcagagattttgatgaaaagcaacaggaagcaaacaaaatgctg acacagatggaggaggaactacattatgcacccgtatctttccataaccccatgatgtct aagcttcaagactatcagaaggaccttgcccaattccatctggaggcaagaagtacacct ttgcagccatgcctggggaccgaggagacatga >gi568815586f:53084889_53287587|GENSCAN_predicted_peptide_2|333_aa MEPKGVIESNWNEIVDSFDDMNLSESLLRGIYAYGLEKPSAIQQRAILPCIKGYHVIAQA QSGAGKMATFAILILQQIELDLKATQALLLAPTQELAQQIQKVVMALGDYMETADEAPHI IVGTPGHVFDMLNGRYLSPKYIKMFVLDEADEMLSRGFKDQIYDIFQKLNSNTQVVLLSA AMPSDVLEVTKKFMRDLIRILVTKEVLTLEGIRQFYINLEREEWKLDTLCDLYETLTITP GSHLHQHPEEAVLITTDLLARGIDVQQVSLVTNYDLPTNRENHIHRIGRGGRFGRKGVAI NVVTEEDKRTLRDIEIFYNSSIEEMPLNVADLI >gi568815586f:53084889_53287587|GENSCAN_predicted_CDS_2|1002_bp atggagcccaaaggtgtcattgagagtaactggaatgagattgttgacagctttgatgac atgaacctctcagagtcccttctccgtggcatctacgcctatggtttagagaagccgtct gccatccagcagcgagccattctaccttgtatcaagggttatcacgtgattgctcaagcc caatctggggctgggaaaatggccacatttgccatattgattctgcagcagattgaatta gatctaaaagccacccaggccttgctcctagcacccactcaagaattggctcagcagata cagaaggtggtcatggcactaggagactacatggaaactgcagacgaagctcctcacatc attgtgggtacccctggccatgtgtttgatatgcttaacgggagatacctgtctcccaaa tacatcaagatgtttgtactggacgaagctgacgaaatgttaagccgtggattcaaggac cagatctatgacatattccaaaagctcaacagcaacacccaggtagttttgctgtcagct gcgatgccttctgatgtgcttgaggtgaccaagaagttcatgagggacctcattcggatt cttgtcacgaaggaagtgttgaccttggagggtatccgccaattctacatcaatttggaa cgagaggagtggaagctggacacactatgtgacttgtatgaaaccctgaccatcacccca ggcagtcatcttcatcaacacccggaggaagcagttttgattaccactgacctgctggcc agaggcattgatgtgcaacaggtttctttagtcaccaactatgaccttcccaccaacagg gaaaaccatatccatagaatcggtcgaggtggacggtttggccgtaaaggtgtggctatt aacgtggtgacagaagaagacaagaggactcttcgagacatcgagatcttctacaactcc tccattgaggaaatgcccctcaatgttgctgacctcatctga >gi568815586f:53084889_53287587|GENSCAN_predicted_peptide_3|491_aa MEEGALRPARAPWPGRAPAAIETEGRTLGPSMIVRWVLDGRRVCWILMADSEALPSLAGD PVAVEALLRAVFGVVVDEAIQKGTSVSQKVCEWKEPEELKQLLDLELRSQGESQKQILER CRAVIRYSVKTGHPRFFNQLFSGLDPHALAGRIITESLNTSQYTYEIAPVFVLMEEEVLR KLRALVGWSSGDGIFCPGGSISNMYAVNLARYQRYPDCKQRGLRTLPPLALFTSKECHYS IQKGAAFLGLGTDSVRVVKADERGKMVPEDLERQIGMAEAEGAVPFLVSATSGTTVLGAF DPLEAIADVCQRHGLWLHVDNLLKRCHGSQASYLFQQDKFYDVALDTGDKVVQCGRRVDC LKLWLMWKAQGDQGLERRIDQAFVLARYLVEEMKKREGFELVMEPEFVNVCFWFVPPSLR GKQESPDYHERLSKVAPVLKERMVKEGSMMIGYQPHGTRGNFFRVVVANSALTCADMDFL LNELERLGQDL >gi568815586f:53084889_53287587|GENSCAN_predicted_CDS_3|1476_bp atggaggagggagcgctgaggccggcgcgtgcgccgtggccggggcgcgccccggccgcc atcgagactgagggcaggaccctcggcccttcgatgattgtacgctgggtgctcgatggg aggagggtttgctggatcctgatggctgactcagaagcactcccctcccttgctggggac ccagtggctgtggaagccttgctccgggccgtgtttggggttgttgtggatgaggccatt cagaaaggaaccagtgtctcccagaaggtctgtgagtggaaggagcctgaggagctgaag cagctgctggatttggagctgcggagccagggcgagtcacagaagcagatcctggagcgg tgtcgggctgtgattcgctacagtgtcaagactggtcaccctcggttcttcaaccagctc ttctctgggttggatccccatgctctggccgggcgcattatcactgagagcctcaacacc agccagtacacatatgaaatcgcccccgtgtttgtgctcatggaagaggaggtgctgagg aaactgcgggccctggtgggctggagctctggggacggaatcttctgccctggtggctcc atctccaacatgtatgctgtaaatctggcccgctatcagcgctacccggattgcaagcag aggggcctccgcacactgccgcccctggccctattcacatcgaaggagtgtcactactcc atccagaagggagctgcgtttctgggacttggcaccgacagtgtccgagtggtcaaggct gatgagagagggaaaatggtccccgaggatctggagaggcagattggtatggccgaggct gagggtgctgtgccgttcctggtcagtgccacctctggcaccactgtgctaggggccttt gaccccctggaggcaattgctgatgtgtgccagcgtcatgggctatggctgcatgtggat aacctgctcaagcgctgccatgggtcccaggccagctaccttttccagcaggacaagttc tacgatgtggctctggacacgggagacaaggtggtgcagtgtggccgccgtgtggactgt ctgaagctgtggctcatgtggaaggcacagggcgatcaagggctggagcggcgcatcgac caggcctttgtccttgcccggtacctggtggaggaaatgaagaagcgggaagggtttgag ctagtcatggagcctgagtttgtcaatgtgtgtttctggttcgtaccccccagcctgcga gggaagcaggagagtccagattaccacgaaaggctgtcaaaggtggcccccgtgctcaag gagcgcatggtgaaggagggctccatgatgattggctaccagccccacgggacccggggc aacttcttccgtgtggttgtggccaactctgcactgacctgtgctgatatggacttcctc ctcaacgagctggagcggctaggccaggacctgtga >gi568815586f:53084889_53287587|GENSCAN_predicted_peptide_4|166_aa MMLSQIASKQAENGERAGSPDVLRCSSQVVVVEQNGSFQVKIPKNFVCEHCFGAFRSSYH LKRHILIHTGEKPFECDICDMRFIQKYHLERHKRVHSGEKPYQCERCHQGQKEKRGGADK GWIQKVDGTSSFLKGKGRVVPVGVVMAHVTLAPDGISSNAFRRKLG >gi568815586f:53084889_53287587|GENSCAN_predicted_CDS_4|501_bp atgatgctgagccagattgccagcaagcaggccgagaatggcgagcgggcaggtagccct gatgtgctgaggtgctcgagtcaggtggtggtagtggaacaaaatggttcttttcaagta aagattcccaaaaattttgtttgtgaacactgctttggagcctttcggagcagttaccac ctaaagaggcacatccttattcacactggtgagaagccatttgaatgcgatatatgtgat atgcgtttcatccagaagtaccacctggaacgccacaagcgtgtgcacagtggtgaaaag ccttaccagtgtgaacggtgtcatcagggacaaaaggaaaaacggggtggagcagacaag ggctggatccaaaaagtggatggcacctcttccttcctcaagggaaaaggaagagttgtg ccagtgggtgttgtcatggcccatgtcacattggcacctgatggcatcagctcaaatgcc tttaggaggaaacttgggtga >gi568815586f:53084889_53287587|GENSCAN_predicted_peptide_5|1481_aa MEILCKTDSYIHPHTGWQTPVALASHTLQATVEADVLAPGSGTGAPQYGEPRTLRRSIQE TARRRDLGAPPPPFPLPLQQLRPSSLNLTQYVEASLCRRPAGLLEAQWAGQAGRTPGDSH TAAAMATNKERLFAAGALGPGSGYPGAGFPFAFPGALRGSPPFEMLSPSFRGLGQPDLPK EMASLSPYASPPPPPLLERGAAGGGGGIGCGSLVFPAPSFPSSRVAMYDCMETFAPGPRR LYGAAGPGAGLLRRATGGSCFAGLESFAWPQPASLQSVETQSTSSEEMVPSSPSPPPPPR VYKPCFVCNDKSSGYHYGVSSCEGCKGFFRRSIQKNMVYTCHRDKNCIINKVTRNRCQYC RLQKCFEVGMSKEAVRNDRNKKKKEVKEEGSPDSYELSPQLEELITKVSKAHQETFPSLC QLGKYTTNSSADHRVQLDLGLWDKFSELATKCIIKIVEFAKRLPGFTGLSIADQITLLKA ACLDILMLRICTRYTPEQDTMTFSDGLTLNRTQMHNAGFGPLTDLVFAFAGQLLPLEMDD TETGLLSAICLICGDRMDLEEPEKVDKLQEPLLEALRLYARRRRPSQPYMFPRMLMKITD LRGISTKGAERAITLKMEIPGPMPPLIREMLENPEMFEDDSSQPGPHPNASSEDEISTQR LLLPPEEGLLCTHLIQDCKSRQGMVALPMVLVLLLVLSRGESELDAKIPSTGDATEWRNP HLSMLGSCQPAPSCQKCILSHPSCAWCKQLNFTASGEAEARRCARREELLARGCPLEELE EPRGQQEVLQDQPLSQGARGEGATQLAPQRVRVTLRPGEPQQLQVRFLRAEGYPVDLYYL MDLSYSMKDDLERVRQLGHALLVRLQEVTHSVRIGFGSFVDKTVLPFVSTVPSKLRHPCP TRLERCQSPFSFHHVLSLTGDAQAFEREVGRQSVSGNLDSPEGGFDAILQAALCQEQIGW RNVSRLLVFTSDDTFHTAGDGKLGGIFMPSDGHCHLDSNGLYSRSTEFDYPSVGQVAQAL SAANIQPIFAVTSAALPVYQELSKLIPKSAVGELSEDSSNVVQLIMDAYNSLSSTVTLEH SSLPPGVHISYESQCEGPEKREGKAEDRGQCNHVRINQTVTFWVSLQATHCLPEPHLLRL RALGFSEELIVELHTLCDCNCSDTQPQAPHCSDGQGHLQCGVCSCAPGRLGRLCECSVAE LSSPDLESGCRAPNGTGPLCSGKGHCQCGRCSCSGQSSGHLCECDDASCERHEGILCGGF GRCQCGVCHCHANRTGRACECSGDMDSCISPEGGLCSGHGRCKCNRCQCLDGYYGALCDQ CPGCKTPCERHRDCAECGAFRTGPLATNCSTACAHTNVTLALAPILDDGWCKERTLDNQL FFFLVEDDARGTVVLRVRPQEKGADHTQAIVLGCVGGIVAVGLGLVLAYRLSVEIYDRRE YSRFEKEQQQLNWKQDSNPLYKSAITTTINPRFQEADSPTL >gi568815586f:53084889_53287587|GENSCAN_predicted_CDS_5|4446_bp atggagattctgtgcaaaacagacagctacatccacccacacactggctggcagacacct gtagcactcgcctcacacacactccaggcaactgtggaggcagacgtgctagctccaggg agtgggacaggagccccccagtacggcgagccccggacattgcgacgctccatccaagag actgcccgacgccgggacctcggggctccgccgcctcccttccccctcccactccagcag ctacggcccagttccctcaacctgacccagtatgtagaagccagtctctgcaggcggcca gcgggacttttggaggcccagtgggcaggccaggcagggcggaccccaggggactctcac accgcagctgccatggccaccaataaggagcgactctttgcggctggtgccctggggcct ggatctggctacccaggggcaggtttccccttcgccttcccaggggcactcagggggtct ccgcctttcgagatgctgagccctagcttccggggcctgggccagcctgacctccccaag gagatggcctctctgtcgccctatgctagccctccccctccccccctgctggagcggggc gccgccgggggaggagggggaatcggctgcgggtccttggtgtttccagcacccagtttc ccttcaagccgggtcgcgatgtacgactgtatggaaacgtttgccccgggtccgcgacgg ctgtacggggcggccgggcccggggccggcttgctgcgcagagccaccggcggctcctgt ttcgccggacttgaatcttttgcctggccgcaacccgccagcctgcaatcggtggagaca cagagcaccagctcagaggagatggtgcccagctcgccctcgccccctccgcctcctcgg gtctacaagccatgcttcgtgtgcaatgacaagtcctctggctaccactatggggtcagc tcttgtgaaggctgcaagggcttctttcgccgaagcatccagaagaacatggtgtacacg tgtcaccgcgacaaaaactgtatcatcaacaaggtgaccaggaatcgctgccagtactgc cggctacagaagtgcttcgaagtgggcatgtccaaggaagctgtgcgaaatgaccggaac aagaagaagaaagaggtgaaggaagaagggtcacctgacagctatgagctgagccctcag ttagaagagctcatcaccaaggtcagcaaagcccatcaggagactttcccctcgctctgc cagctgggcaagtataccacgaactccagtgcagaccaccgcgtgcagctggatctgggg ctgtgggacaagttcagtgagctggctaccaagtgcatcatcaagatcgtggagtttgcc aagcggttgcctggctttacagggctcagcattgctgaccagatcactctgctcaaagct gcctgcctagatatcctgatgctgcgtatctgcacaaggtacaccccagagcaggacacc atgaccttctccgacgggctgaccctgaaccggacccagatgcacaatgccggcttcggg cccctcacagaccttgtctttgcctttgctgggcagctcctgcccctggagatggatgac accgagacagggctgctcagcgccatctgcctcatctgcggagaccgcatggacctggag gagcccgaaaaagtggacaagctgcaggagccactgctggaagccctgaggctgtacgcc cggcgccggcggcccagccagccctacatgttcccaaggatgctaatgaaaatcaccgac ctccggggcatcagcactaagggagctgaaagggccattactctgaagatggagattcca ggcccgatgcctcccttaatccgagagatgctggagaaccctgaaatgtttgaggatgac tcctcgcagcctggtccccaccccaatgcctctagcgaggatgagatcagtacacaaagg ctgctgctgccgccagaggaaggactgctctgcacgcacctaatccaagattgtaaaagc cgccaaggcatggtggctttgccaatggtccttgttttgctgctggtcctgagcagaggt gagagtgaattggacgccaagatcccatccacaggggatgccacagaatggcggaatcct cacctgtccatgctggggtcctgccagccagccccctcctgccagaagtgcatcctctca caccccagctgtgcatggtgcaagcaactgaacttcaccgcgtcgggagaggcggaggcg cggcgctgcgcccgacgagaggagctgctggctcgaggctgcccgctggaggagctggag gagccccgcggccagcaggaggtgctgcaggaccagccgctcagccagggcgcccgcgga gagggtgccacccagctggcgccgcagcgggtccgggtcacgctgcggcctggggagccc cagcagctccaggtccgcttccttcgtgctgagggatacccggtggacctgtactacctt atggacctgagctactccatgaaggacgacctggaacgcgtgcgccagctcgggcacgct ctgctggtccggctgcaggaagtcacccattctgtgcgcattggttttggttcctttgtg gacaaaacggtgctgccctttgtgagcacagtaccctccaaactgcgccacccctgcccc acccggctggagcgctgccagtcaccattcagctttcaccatgtgctgtccctgacgggg gacgcacaagccttcgagcgggaggtggggcgccagagtgtgtccggcaatctggactcg cctgaaggtggcttcgatgccattctgcaggctgcactctgccaggagcagattggctgg agaaatgtgtcccggctgctggtgttcacttcagacgacacattccatacagctggggac gggaagttgggcggcattttcatgcccagtgatgggcactgccacttggacagcaatggc ctctacagtcgcagcacagagtttgactacccttctgtgggtcaggtagcccaggccctc tctgcagcaaatatccagcccatctttgctgtcaccagtgccgcactgcctgtctaccag gagctgagtaaactgattcctaagtctgcagttggggagctgagtgaggactccagcaac gtggtacagctcatcatggatgcttataatagcctgtcttccaccgtgacccttgaacac tcttcactccctcctggggtccacatttcttacgaatcccagtgtgagggtcctgagaag agggagggtaaggctgaggatcgaggacagtgcaaccacgtccgaatcaaccagacggtg actttctgggtttctctccaagccacccactgcctcccagagccccatctcctgaggctc cgggcccttggcttctcagaggagctgattgtggagttgcacacgctgtgtgactgtaat tgcagtgacacccagccccaggctccccactgcagtgatggccagggacacctacaatgt ggtgtatgcagctgtgcccctggccgcctaggtcggctctgtgagtgctctgtggcagag ctgtcctccccagacctggaatctgggtgccgggctcccaatggcacagggcccctgtgc agtggaaagggtcactgtcaatgtggacgctgcagctgcagtggacagagctctgggcat ctgtgcgagtgtgacgatgccagctgtgagcgacatgagggcatcctctgcggaggcttt ggtcgctgccaatgtggagtatgtcactgtcatgccaaccgcacgggcagagcatgcgaa tgcagtggggacatggacagttgcatcagtcccgagggagggctctgcagtgggcatgga cgctgcaaatgcaaccgctgccagtgcttggacggctactatggtgctctatgcgaccaa tgcccaggctgcaagacaccatgcgagagacaccgggactgtgcagagtgtggggccttc aggactggcccactggccaccaactgcagtacagcttgtgcccataccaatgtgaccctg gccttggcccctatcttggatgatggctggtgcaaagagcggaccctggacaaccagctg ttcttcttcttggtggaggatgacgccagaggcacggtcgtgctcagagtgagaccccaa gaaaagggagcagaccacacgcaggccattgtgctgggctgcgtagggggcatcgtggca gtggggctggggctggtcctggcttaccggctctcggtggaaatctatgaccgccgggaa tacagtcgctttgagaaggagcagcaacaactcaactggaagcaggacagtaatcctctc tacaaaagtgccatcacgaccaccatcaatcctcgctttcaagaggcagacagtcccact ctctga >gi568815586f:53084889_53287587|GENSCAN_predicted_peptide_6|1825_aa MLVTAYLAFVGLLASCLGLELSRCRAKPPGRACSNPSFLRFQLDFYQVYFLALAADWLQA PYLYKLYQHYYFLEGQIAILYVCGLASTVLFGLVASSLVDWLGRKNSCVLFSLTYSLCCL TKLSQDYFVLLVGRALGGLSTALLFSAFEAWYIHEHVERHDFPAEWIPATFARAAFWNHV LAVVAGVAAEAVASWIGLGPVAPFVAAIPLLALAGALALRNWGENYDRQRAFSRTCAGGL RCLLSDRRVLLLGTIQALFESVIFIFVFLWTPVLDPHGAPLGIIFSSFMAASLLGSSLYR IATSKRYHLQPMHLLSLAVLIVVFSLFMLTFSTSPGQESPVESFIAFLLIELACGLYFPS MSFLRRKVIPETEQAGVLNWFRVPLHSLACLGLLVLHDSDRKTGTRNMFSICSAVMVMAL LAVVGLFTVLSGVMRSFKRVNFGTLLSSQKEAEELLPALKEFLSNPPAGFPSSRSDAERR QACDAILRACNQQLTAKLACPRHLGSLLELAELACDGYLVSTPQRPPLYLERILFVLLRN AAAQGSPEATLRLAQPLHACLVQCSREAAPQDYEAVARGSFSLLWKGAEALLERRAAFAA RLKALSFLVLLEDESTPCEVPHFASPTACRAVAAHQLFDASGHGLNEADADFLDDLLSRH VIRALVGERGSSSGLLSPQRALCLLELTLEHCRRFCWSRHHDKAISAVEKAHSYLRNTNL APSLQLCQLGVKLLQVGEEGPQAVAKLLIKASAVLSKSMEAPSPPLRALYESCQFFLSGL ERGTKRRYRLDAILSLFAFLGGYCSLLQQLRDDGVYGGSSKQQQSFLQMYFQGLHLYTVV VYDFAQGCQIVDLADLTQLVDSCKSTVVWMLEALEGLSGQELTDHMGMTASYTSNLAYSF YSHKLYAEACAISEPLCQHLGLVKPGTYPEVPPEKLHRCFRLQVESLKKLGKQAQGCKMV ILWLAALQPCSPEHMAEPVTFWVRVKMDAARAGDKELQLKTLRDSLSGWDPETLALLLRE ELQAYKAVRADTGQERFNIICDLLELSPEETPAGAWARATHLVELAQVLCYHDFTQQTNC SALDAIREALQLLDSVRPEAQARDQLLDDKAQALLWLYICTLEAKMQEGIERDRRAQAPG NLEEFEVNDLNYEDKLQEDRFLYSNIAFNLAADAAQSKCLDQALALWKELLTKGQAPAVR CLQQTAASLQILAALYQLVAKPMQALEVLLLLRIVSERLKDHSKAAGSSCHITQLLLTLG CPSYAQLHLEEAASSLKHLDQTTDTYLLLSLTCDLLRSQLYWTHQKVTKGVSLLLSVLRD PALQKSSKAWYLLRVQVLQLVAAYLSLPSNNLSHSLWEQLCAQGWQTPEIALIDSHKLLR SIILLLMGSDILSTQKAAVETSFLDYGENLVQKWQVLSEVLSCSEKLVCHLGRLGSVSEA KAFCLEALKLTTKLQIPRQCALFLVLKGELELARNDIDLCQSDLQQVLFLLESCTEFGGV TQHLDSVKKVHLQKGKQQAQVPCPPQLPEEELFLRGPALELVATVAKEPGPIAPSTNSSP VLKTKPQPIPNFLSHSPTCDCSLCASPVLTAVCLRWVLVTAGVRLAMGHQAQGLDLLQVV LKGCPEAAERLTQALQASLNHKTPPSLVPSLLDEILAQAYTLLALEGLNQPSNESLQKVL QSGLKFVAARIPHLEPWRASLLLIWALTKLGGLSCCTTQLFASSWGWQPPLIKSVPGSEP SKTQGQKRSGRGRQKLASAPLRLNNTSQKGLEGRGLPCTPKPPDRIRQAGPHVPFTVFEE VCPTESKPEVPQAPRVQQRVQTRLK >gi568815586f:53084889_53287587|GENSCAN_predicted_CDS_6|5475_bp atgctggtgactgcctaccttgcttttgtaggcctcctggcctcctgcctggggctggaa ctgtcaagatgccgggctaaaccccctggaagggcctgcagcaatccctccttccttcgg tttcaactggacttctatcaggtctacttcctggccctggcagctgattggcttcaggcc ccctacctctataaactctaccagcattactacttcctggaaggtcaaattgccatcctc tatgtctgtggccttgcctctacagtcctctttggcctagtggcctcctcccttgtggat tggctgggtcgcaagaattcttgtgtcctcttctccctgacttactcactatgctgctta accaaactctctcaagactactttgtgctgctagtggggcgagcacttggtgggctgtcc acagccctgctcttctcagccttcgaggcctggtatatccatgagcacgtggaacggcat gacttccctgctgagtggatcccagctacctttgctcgagctgccttctggaaccatgtg ctggctgtagtggcaggtgtggcagctgaggctgtagccagctggatagggctggggcct gtagcgccctttgtggctgccatccctctcctggctctggcaggggccttggcccttcga aactggggggagaactatgaccggcagcgtgccttctcaaggacctgtgctggaggcctg cgctgcctcctgtcggaccgccgcgtgctgctgttgggcaccatacaagctctatttgag agtgtcatcttcatctttgtcttcctctggacacctgtgctggacccacacggggcccct ctgggcattatcttctccagcttcatggcagccagcctgcttggctcttccctgtaccgt atcgccacctccaagaggtaccaccttcagcccatgcacctgctgtcccttgctgtgctc atcgtcgtcttctctctcttcatgttgactttctctaccagcccaggccaggagagtccg gtggagtccttcatagcctttctacttattgagttggcttgtggattatactttcccagc atgagcttcctacggagaaaggtgatccctgagacagagcaggctggtgtactcaactgg ttccgggtacctctgcactcactggcttgcctagggctccttgtcctccatgacagtgat cgaaaaacaggcactcggaatatgttcagcatttgctctgctgtcatggtgatggctctg ctggcagtggtgggactcttcaccgtgctctccggtgtcatgaggagcttcaaaagagtc aactttgggactctgctaagcagccagaaggaggctgaagagttgctgcccgccttgaag gagttcctgtccaaccctccagctggttttcccagcagccgatctgatgctgagaggaga caagcttgtgatgccatcctgagggcttgcaaccagcagctgactgctaagctagcttgc cctaggcatctggggagcctgctggagctggcagagctggcctgtgatggctacttagtg tctaccccacagcgtcctcccctctacctggaacgaattctctttgtcttactgcggaat gctgctgcacaaggaagcccagaggccacactccgccttgctcagcccctccatgcctgc ttggtgcagtgctctcgcgaggctgctccccaggactatgaggccgtggctcggggcagc ttttctctgctttggaagggggcagaagccctgttggaacggcgagctgcatttgcagct cggctgaaggccttgagcttcctagtactcttggaggatgaaagtaccccttgtgaggtt cctcactttgcttctccaacagcctgtcgagcggtagctgcccatcagctatttgatgcc agtggccatggtctaaatgaagcagatgctgatttcctagatgacctgctctccaggcac gtgatcagagccttggtgggtgagagagggagctcttctgggcttctttctccccagagg gccctctgcctcttggagctcaccttggaacactgccgtcgcttttgctggagccgccac catgacaaagccatcagcgcagtggagaaggctcacagttacctaaggaacaccaatcta gcccctagccttcagctatgtcagctgggggttaagctgctgcaggttggggaggaagga cctcaggcagtggccaagcttctgatcaaggcatcagctgtcctgagcaagagtatggag gcaccatcacccccacttcgggcattgtatgagagctgccagttcttcctttcaggcctg gaacgaggcaccaagaggcgctatagacttgatgccattctgagcctctttgcttttctt ggagggtactgctctcttctgcagcagctgcgggatgatggtgtgtatgggggctcctcc aagcaacagcagtcttttcttcagatgtactttcagggacttcacctctacactgtggtg gtttatgactttgcccaaggctgtcagatagttgatttggctgacctgacccaactagtg gacagttgtaaatctaccgttgtctggatgctggaggccttagagggcctgtcgggccaa gagctgacggaccacatggggatgaccgcttcttacaccagtaatttggcctacagcttc tatagtcacaagctctatgccgaggcctgtgccatctctgagccgctctgtcagcacctg ggtttggtgaagccaggcacttatcccgaggtgcctcctgagaagttgcacaggtgcttc cggctacaagtagagagtttgaagaaactgggtaaacaggcccagggctgcaagatggtg attttgtggctggcagccctgcaaccctgtagccctgaacacatggctgagccagtcact ttctgggttcgggtcaagatggatgcggccagggctggagacaaggagctacagctaaag actctgcgagacagcctcagtggctgggacccggagaccctggccctcctgctgagggag gagctgcaggcctacaaggcggtgcgggccgacactggacaggaacgcttcaacatcatc tgtgacctcctggagctgagccccgaggagacaccagccggggcctgggcacgagccacc cacctggtagaactggctcaggtgctctgctaccacgactttacgcagcagaccaactgc tctgctctggatgctatccgggaagccctgcagcttctggactctgtgaggcctgaggcc caggccagagatcagcttctggacgataaagcacaggccttgctgtggctttacatctgt actctggaagccaaaatgcaggaaggtatcgagcgggatcggagagcccaggcccctggt aacttggaggaatttgaagtcaatgacctgaactatgaagataaactccaggaagatcgt ttcctatacagtaacattgccttcaacctggctgcagatgctgctcagtccaaatgcctg gaccaagccctggccctgtggaaggagctgcttacaaaggggcaggccccagctgtacgg tgtctccagcagacagcagcctcactgcagatcctagcagccctctaccagctggtggca aagcccatgcaggctctggaggtcctcctgctgctacggattgtctctgagagactgaag gaccactcgaaggcagctggctcctcctgccacatcacccagctcctcctgaccctcggc tgtcccagctatgcccagttacacctggaagaggcagcatcgagcctgaagcatctcgat cagactactgacacatacctgctcctttccctgacctgtgatctgcttcgaagtcaactc tactggactcaccagaaggtgaccaagggtgtctctctgctgctgtctgtgcttcgggat cctgccctccagaagtcctccaaggcttggtacttgctgcgtgtccaggtcctgcagctg gtggcagcttaccttagcctcccgtcaaacaacctctcacactccctgtgggagcagctc tgtgcccaaggctggcagacacctgagatagctctcatagactcccataagctcctccga agcatcatcctcctgctgatgggcagtgacattctctcaactcagaaagcagctgtggag acatcgtttttggactatggtgaaaatctggtacaaaaatggcaggttctttcagaggtg ctgagctgctcagagaagctggtctgccacctgggccgcctgggtagtgtgagtgaagcc aaggccttttgcttggaggccctaaaacttacaacaaagctgcagataccacgccagtgt gccctgttcctggtgctgaagggcgagctggagctggcccgcaatgacattgatctctgt cagtcggacctgcagcaggttctgttcttgcttgagtcttgcacagagtttggtggggtg actcagcacctggactctgtgaagaaggtccacctgcagaaggggaagcagcaggcccag gtcccctgtcctccacagctcccagaggaggagctcttcctaagaggccctgctctagag ctggtggccactgtggccaaggagcctggccccatagcaccttctacaaactcctcccca gtcttgaaaaccaagccccagcccatacccaacttcctgtcccattcacccacctgtgac tgctcgctctgcgccagccctgtcctcacagcagtctgtctgcgctgggtattggtcacg gcaggggtgaggctggccatgggccaccaagcccagggtctggatctgctgcaggtcgtg ctgaagggctgtcctgaagccgctgagcgcctcacccaagctctccaagcttccctgaat cataaaacacccccctccttggttccaagcctcttggatgagatcttggctcaagcatac acactgttggcactggagggcctgaaccagccatcaaacgagagcctgcagaaggttcta cagtcagggctgaagtttgtagcagcacggataccccacctagagccctggcgagccagc ctgctcttgatttgggccctcacaaaactaggtggcctcagctgctgtactacccaactt tttgcaagctcctggggctggcagccaccattaataaaaagtgtccctggctcagagccc tctaagactcagggccaaaaacgttctggacgagggcgccaaaagttagcctctgctccc ctgcgcctcaataatacctctcagaaaggtctggaaggtagaggactgccctgcacacct aaacccccagaccggatcaggcaagctggccctcatgtccccttcacggtgtttgaggaa gtctgccctacagagagcaagcctgaagtaccccaggcccccagggtacaacagagagtc cagacgcgcctcaag