GENSCAN 1.0 Date run: 7-Nov-116 Time: 18:01:56 Sequence gi568815575f:154233178_154370981 : 137804 bp : 52.70% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.09 Term - 790 434 357 1 0 93 49 150 0.347 6.47 1.08 Intr - 1545 1410 136 2 1 55 72 116 0.109 7.78 1.07 Intr - 10382 10267 116 2 2 70 32 98 0.045 2.15 1.06 Intr - 10713 10691 23 1 2 85 113 12 0.707 1.15 1.05 Intr - 11626 11527 100 1 1 80 80 54 0.710 3.98 1.04 Intr - 12732 12540 193 2 1 52 37 48 0.018 -3.88 1.03 Intr - 18112 17385 728 1 2 77 66 750 0.049 62.57 1.02 Intr - 18685 18618 68 2 2 51 54 86 0.524 0.62 1.01 Init - 21339 21276 64 0 1 33 90 67 0.588 0.66 1.00 Prom - 22485 22446 40 -5.11 2.00 Prom + 23491 23530 40 -3.21 2.01 Init + 24443 24554 112 1 1 93 83 140 0.497 14.34 2.02 Intr + 29535 29831 297 1 0 129 91 576 0.796 59.39 2.03 Intr + 31819 31987 169 2 1 118 110 120 0.975 16.72 2.04 Intr + 33455 33620 166 2 1 82 103 279 0.997 29.28 2.05 Intr + 35175 35414 240 2 0 127 115 327 0.999 37.58 2.06 Term + 37697 37807 111 1 0 116 51 123 0.996 10.26 2.07 PlyA + 37883 37888 6 1.05 3.10 PlyA - 37899 37894 6 -4.33 3.09 Term - 38594 38238 357 2 0 93 49 150 0.823 6.47 3.08 Intr - 39349 39214 136 0 1 55 72 116 0.174 7.78 3.07 Intr - 47503 47388 116 1 2 70 32 98 0.062 2.15 3.06 Intr - 47834 47812 23 0 2 85 113 12 0.737 1.15 3.05 Intr - 48747 48648 100 0 1 80 80 54 0.739 3.98 3.04 Intr - 52929 52890 40 2 1 98 37 13 0.022 -4.09 3.03 Intr - 55230 54503 728 0 2 77 66 750 0.049 62.57 3.02 Intr - 59028 58929 100 2 1 104 84 46 0.929 6.41 3.01 Init - 62846 62602 245 2 2 57 43 284 0.519 18.00 3.00 Prom - 65223 65184 40 -2.11 4.00 Prom + 69341 69380 40 -4.21 4.01 Init + 71081 71169 89 1 2 74 56 47 0.524 0.18 4.02 Intr + 72127 72244 118 1 1 82 91 186 0.940 19.27 4.03 Intr + 76168 76265 98 0 2 103 94 100 0.995 11.41 4.04 Intr + 77659 77850 192 1 0 81 28 150 0.988 7.43 4.05 Intr + 77934 78061 128 0 2 108 82 169 0.999 19.13 4.06 Intr + 79403 79596 194 0 2 93 102 142 0.945 15.83 4.07 Intr + 81996 82160 165 2 0 117 101 186 0.998 23.47 4.08 Intr + 87580 87736 157 0 1 111 87 186 0.995 20.90 4.09 Intr + 90030 90160 131 1 2 97 63 222 0.999 21.52 4.10 Intr + 92162 92245 84 1 0 80 116 96 0.999 12.11 4.11 Intr + 94414 94510 97 0 1 25 86 107 0.994 4.28 4.12 Intr + 94662 94781 120 1 0 100 93 185 0.993 21.17 4.13 Term + 96339 96511 173 1 2 80 38 170 0.949 9.50 4.14 PlyA + 97154 97159 6 1.05 5.00 Prom + 106765 106804 40 -1.91 5.01 Init + 108477 108670 194 2 2 82 36 159 0.002 6.68 5.02 Intr + 109130 109201 72 2 0 98 60 29 0.001 0.12 5.03 Intr + 111005 111090 86 2 2 113 60 26 0.338 2.26 5.04 Intr + 112013 112228 216 0 0 53 54 134 0.345 5.50 5.05 Intr + 112410 112544 135 1 0 80 72 72 0.895 5.85 5.06 Term + 114135 114313 179 1 2 88 51 166 0.997 11.07 5.07 PlyA + 115329 115334 6 -3.64 6.47 PlyA - 115374 115369 6 1.05 6.46 Term - 115859 115672 188 0 2 78 47 349 0.994 27.77 6.45 Intr - 116388 116185 204 1 0 102 80 187 0.999 19.00 6.44 Intr - 116690 116472 219 0 0 102 99 389 0.998 40.20 6.43 Intr - 117030 116854 177 1 0 108 111 240 0.977 28.71 6.42 Intr - 117864 117732 133 0 1 82 100 154 0.911 16.72 6.41 Intr - 118519 118404 116 2 2 89 94 123 0.985 13.67 6.40 Intr - 118844 118707 138 0 0 70 84 192 0.950 17.94 6.39 Intr - 119270 119004 267 0 0 89 96 263 0.999 25.24 6.38 Intr - 119498 119376 123 0 0 72 58 75 0.920 3.96 6.37 Intr - 119747 119595 153 0 0 23 105 272 0.973 22.96 6.36 Intr - 120027 119824 204 1 0 88 96 323 0.999 32.80 6.35 Intr - 120280 120119 162 2 0 83 94 253 0.982 25.86 6.34 Intr - 120550 120377 174 2 0 62 94 197 0.991 18.13 6.33 Intr - 120866 120738 129 0 0 80 84 208 0.999 20.77 6.32 Intr - 121114 120974 141 2 0 116 80 302 0.999 33.03 6.31 Intr - 121306 121204 103 1 1 119 109 158 0.999 21.25 6.30 Intr - 121534 121439 96 1 0 110 55 144 0.986 14.01 6.29 Intr - 121895 121648 248 0 2 65 89 474 0.843 42.91 6.28 Intr - 124446 124257 190 0 1 105 99 409 0.999 43.38 6.27 Intr - 125178 125022 157 2 1 73 101 270 0.998 27.33 6.26 Intr - 125391 125268 124 1 1 25 94 200 0.998 14.45 6.25 Intr - 125977 125807 171 2 0 72 113 68 0.636 8.13 6.24 Intr - 126229 126069 161 0 2 33 94 272 0.763 22.44 6.23 Intr - 126469 126307 163 2 1 101 110 294 0.999 32.45 6.22 Intr - 126728 126555 174 0 0 120 69 319 0.989 33.63 6.21 Intr - 127410 126813 598 0 1 119 96 921 0.993 88.73 6.20 Intr - 128393 128131 263 0 2 117 86 347 0.968 35.14 6.19 Intr - 128610 128493 118 0 1 107 89 105 0.999 13.04 6.18 Intr - 128971 128802 170 2 2 116 68 348 0.999 35.78 6.17 Intr - 129155 129065 91 2 1 120 81 120 0.848 14.57 6.16 Intr - 129401 129241 161 0 2 117 94 446 0.999 48.32 6.15 Intr - 129607 129484 124 1 1 130 86 159 0.999 20.56 6.14 Intr - 130988 130845 144 2 0 89 84 235 0.999 24.19 6.13 Intr - 131195 131082 114 2 0 78 80 175 0.919 16.85 6.12 Intr - 131542 131349 194 2 2 24 65 414 0.951 32.43 6.11 Intr - 131780 131644 137 1 2 86 106 259 0.985 28.22 6.10 Intr - 132082 131959 124 2 1 74 94 240 0.999 23.35 6.09 Intr - 132309 132172 138 1 0 70 100 184 0.999 18.74 6.08 Intr - 133047 132847 201 1 0 -1 102 258 0.994 18.08 6.07 Intr - 133293 133131 163 0 1 87 94 342 0.992 34.76 6.06 Intr - 133462 133385 78 1 0 72 105 163 0.986 16.64 6.05 Intr - 133673 133555 119 0 2 67 75 158 0.989 13.09 6.04 Intr - 134367 134220 148 0 1 82 94 286 0.999 28.92 6.03 Intr - 134561 134464 98 0 2 100 86 113 0.995 12.53 6.02 Intr - 134988 134665 324 1 0 55 76 554 0.394 47.20 6.01 Init - 137804 137696 109 2 1 60 85 254 0.930 22.64 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 55230 54499 732 0 0 77 54 740 0.905 63.41 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:154233178_154370981|GENSCAN_predicted_peptide_1|594_aa MLLLWPVVVAHACNPSTLGGRVEFSAVDCQLYEIKECIWLLYHVAEHTRSPSATLPSNVP SCRSLSSSEDGPSGPSSLADGGLAHNLQDSVRHRILYLSEQLRVEKASRDGNTVSYLKLV SKADRHQVPHIQQAFEKVNQRASATIAQIEHRLHQCHQQLQELEEGCRPEGLLLMAESDP ANCEPPSEKALLSEPPEPGGEDGPVNLPHASRPFILESRFQSLQQGTCLETEDVAQQQNL LLQKVKAELEEAKRFHISLQESYHSLKERSLTDLQLLLESLQEEKCRRSFRSPRRCTVSG RPPQHKPVSWGTRRPRSLSFFVGQEERRLEKLRKRSLVEEMAPKEGMGSLQKGETETRTH PEGERRVETEAEADLPWQKPRKPAASSPFELHSTAGWVVYKERKFNVKALAAGEDLCTAS SYGGRQKGKRAQALMEEQVNGRLQGQLNEIYNLKHNLACSEERMAYLSYERAKEIWEITE TFKSRISKLEMLQQVTQLEAAEHLQSRPPQMLFKFLSPRLSLATVLLVFVSTLCACPSSL ISSRLCTCTMLMLIGLGVLAWQRWRAIPATDWQEWVPSRCRLYSKDSGPPADGP >gi568815575f:154233178_154370981|GENSCAN_predicted_CDS_1|1785_bp atgttgctgctgtggccggtcgtagtggctcatgcctgtaatcccagcactttgggaggc cgagtagaattttctgccgtagactgtcagctgtatgagatcaaggaatgtatctggctt ctgtaccacgtggcggaacacaccaggagccccagcgcaaccctcccctccaatgtgcct tcatgccggtccctgtcatccagcgaagacggccccagtggcccttccagcctcgcagat ggaggcctagcccacaacttacaggatagtgtcaggcaccgcatcctctacctctcagag cagctgagagtggagaaggccagtcgggatggcaacactgtgagctacctcaagctggta tccaaagcagaccggcaccaggtgccgcacatccagcaggcctttgagaaggtgaaccag cgcgcctctgccaccatcgcccagatcgagcacaggctccaccagtgtcaccagcagctc caggagctggaggaaggctgcaggcccgagggcttactgctgatggcagaaagcgaccca gccaactgcgagccacccagtgagaaggccctgctttcagagccccccgagccaggtggg gaagacgggccggtcaacctgcctcatgccagcaggcccttcatcttggagagtcgcttc cagagcttacagcaggggacgtgcttagagacagaggatgtggcccagcaacaaaacctg ctgttgcagaaggtaaaggcagagctggaagaagccaagaggttccacatcagcctccag gagtcctatcacagcctaaaggagaggtctctgactgacctgcagctgttgctggagtcc cttcaggaggagaagtgtaggaggagcttcagatctccgagaagatgcacagtgagcggc cgacctcctcagcacaagccagtctcctgggggaccagaaggccccggtccctcagtttc ttcgtgggtcaggaggaaagaaggcttgagaaactcagaaaacgttcattggtagaagag atggcacccaaggaaggcatggggagccttcaaaaaggggagacggagacacggacacac ccagagggagagcgccgtgtggagacagaggctgaggccgacctgccgtggcagaagcca aggaagcccgcagcttcatctccttttgaacttcacagtaccgcaggctgggtggtttat aaagaaaggaagttcaatgtcaaggcgctggcagctggtgaggacctttgtacggcatca tcctatggcggaaggcagaagggcaagagggcgcaagcattgatggaagaacaggtgaat ggtcgcctgcagggacagctgaatgagatttacaacctcaaacacaatctggcctgcagc gaagagagaatggcctatctatcctatgagagagccaaggaaatatgggagatcacggag accttcaagagccgaatatccaagctggagatgctacagcaagtcacccaactggaggca gcggagcacctccaaagccgtcccccgcagatgttgttcaagttcctgagtccgcgcctc tcactggcaaccgtcctcttggtctttgtctccaccttgtgtgcctgcccctcgtcactg atcagctcacgcctgtgcacctgcaccatgctgatgctgatcgggcttggggtcctggcc tggcagaggtggcgcgccatccctgccacagactggcaggaatgggtcccctccaggtgt agactgtactccaaggactctgggcctccagcagatggaccttaa >gi568815575f:154233178_154370981|GENSCAN_predicted_peptide_2|364_aa MAQQWSLQRLAGRHPQDSYEDSTQSSIFTYTNSNSTRGPFEGPNYHIAPRWVYHLTSVWM IFVVIASVFTNGLVLAATMKFKKLRHPLNWILVNLAVADLAETVIASTISVVNQVYGYFV LGHPMCVLEGYTVSLCGITGLWSLAIISWERWMVVCKPFGNVRFDAKLAIVGIAFSWIWA AVWTAPPIFGWSRYWPHGLKTSCGPDVFSGSSYPGVQSYMIVLMVTCCITPLSIIVLCYL QVWLAIRAVAKQQKESESTQKAEKEVTRMVVVMVLAFCFCWGPYAFFACFAAANPGYPFH PLMAALPAFFAKSATIYNPVIYVFMNRQFRNCILQLFGKKVDDGSELSSASKTEVSSVSS VSPA >gi568815575f:154233178_154370981|GENSCAN_predicted_CDS_2|1095_bp atggcccagcagtggagcctccaaaggctcgcaggccgccatccgcaggacagctatgag gacagcacccagtccagcatcttcacctacaccaacagcaactccaccagaggccccttc gaaggcccgaattaccacatcgctcccagatgggtgtaccacctcaccagtgtctggatg atctttgtggtcattgcatccgtcttcacaaatgggcttgtgctggcggccaccatgaag ttcaagaagctgcgccacccgctgaactggatcctggtgaacctggcggtcgctgacctg gcagagaccgtcatcgccagcactatcagcgttgtgaaccaggtctatggctacttcgtg ctgggccaccctatgtgtgtcctggagggctacaccgtctccctgtgtgggatcacaggt ctctggtctctggccatcatttcctgggagagatggatggtggtctgcaagccctttggc aatgtgagatttgatgccaagctggccatcgtgggcattgccttctcctggatctgggct gctgtgtggacagccccgcccatctttggttggagcaggtactggccccacggcctgaag acttcatgcggcccagacgtgttcagcggcagctcgtaccccggggtgcagtcttacatg attgtcctcatggtcacctgctgcatcaccccactcagcatcatcgtgctctgctacctc caagtgtggctggccatccgagcggtggcaaagcagcagaaagagtctgaatccacccag aaggcagagaaggaagtgacgcgcatggtggtggtgatggtcctggcattctgcttctgc tggggaccctacgccttcttcgcatgctttgctgctgccaaccctggctaccccttccac cctttgatggctgccctgccggccttctttgccaaaagtgccactatctacaaccccgtt atctatgtctttatgaaccggcagtttcgaaactgcatcttgcagcttttcgggaagaag gttgacgatggctctgaactctccagcgcctccaaaacggaggtctcatctgtgtcctcg gtatcgcctgcatga >gi568815575f:154233178_154370981|GENSCAN_predicted_peptide_3|614_aa MTQTSMRRTYPELVEHVALMEWIRKRLAISCNTCKVPLSGLASSGNSALASASAISPTPL ESEDPLLRLRRLKSEWRLRAAHYSLSVSKISVNYLLDCIDAIEKAACSLKVMVLKAEHTR SPSATLPSNVPSCRSLSSSEDGPSGPSSLADGGLAHNLQDSVRHRILYLSEQLRVEKASR DGNTVSYLKLVSKADRHQVPHIQQAFEKVNQRASATIAQIEHRLHQCHQQLQELEEGCRP EGLLLMAESDPANCEPPSEKALLSEPPEPGGEDGPVNLPHASRPFILESRFQSLQQGTCL ETEDVAQQQNLLLQKVKAELEEAKRFHISLQESYHSLKERSLTDLQLLLESLQEEKCRLL LQILEACTSDPKGETETRTHPEGERRVETEAEADLPWQKPRKPAASSPFELHSTAGWVVY KERKFNVKALAAGEDLCTASSYGGRQKGKRAQALMEEQVNGRLQGQLNEIYNLKHNLACS EERMAYLSYERAKEIWEITETFKSRISKLEMLQQVTQLEAAEHLQSRPPQMLFKFLSPRL SLATVLLVFVSTLCACPSSLISSRLCTCTMLMLIGLGVLAWQRWRAIPATDWQEWVPSRC RLYSKDSGPPADGP >gi568815575f:154233178_154370981|GENSCAN_predicted_CDS_3|1845_bp atgacccagacttcaatgaggagaacttacccggagctcgtggagcatgtggccctgatg gaatggattcgcaagcggctggccatatcttgcaacacctgcaaggtgcccctgtcaggt ctggcctcctccgggaactcagccctcgcctcagcatccgccattagtccaacccctttg gagtctgaagacccactcctacgtctccggcgtctgaagagcgaatggcgcctgcgagct gcccattattctctcagtgtatcgaagatatcagtcaactatcttctggattgcattgat gctattgagaaggcagcctgcagtctaaaagtcatggttttaaaggcggaacacaccagg agccccagcgcaaccctcccctccaatgtgccttcatgccggtccctgtcatccagcgaa gacggccccagtggcccttccagcctcgcagatggaggcctagcccacaacttacaggat agtgtcaggcaccgcatcctctacctctcagagcagctgagagtggagaaggccagtcgg gatggcaacactgtgagctacctcaagctggtatccaaagcagaccggcaccaggtgccg cacatccagcaggcctttgagaaggtgaaccagcgcgcctctgccaccatcgcccagatc gagcacaggctccaccagtgtcaccagcagctccaggagctggaggaaggctgcaggccc gagggcttactgctgatggcagaaagcgacccagccaactgcgagccacccagtgagaag gccctgctttcagagccccccgagccaggtggggaagacgggccggtcaacctgcctcat gccagcaggcccttcatcttggagagtcgcttccagagcttacagcaggggacgtgctta gagacagaggatgtggcccagcaacaaaacctgctgttgcagaaggtaaaggcagagctg gaagaagccaagaggttccacatcagcctccaggagtcctatcacagcctaaaggagagg tctctgactgacctgcagctgttgctggagtcccttcaggaggagaagtgtagacttttg ctgcaaattctagaagcctgcacttctgacccgaaaggggagacggagacacggacacac ccagagggagagcgccgtgtggagacagaggctgaggccgacctgccgtggcagaagcca aggaagcccgcagcttcatctccttttgaacttcacagtaccgcaggctgggtggtttat aaagaaaggaagttcaatgtcaaggcgctggcagctggtgaggacctttgtacggcatca tcctatggcggaaggcagaagggcaagagggcgcaagcattgatggaagaacaggtgaat ggtcgcctgcagggacagctgaatgagatttacaacctcaaacacaatctggcctgcagc gaagagagaatggcctatctatcctatgagagagccaaggaaatatgggagatcacggag accttcaagagccgaatatccaagctggagatgctacagcaagtcacccaactggaggca gcggagcacctccaaagccgtcccccgcagatgttgttcaagttcctgagtccgcgcctc tcactggcaaccgtcctcttggtctttgtctccaccttgtgtgcctgcccctcgtcactg atcagctcacgcctgtgcacctgcaccatgctgatgctgatcgggcttggggtcctggcc tggcagaggtggcgcgccatccctgccacagactggcaggaatgggtcccctccaggtgt agactgtactccaaggactctgggcctccagcagatggaccttaa >gi568815575f:154233178_154370981|GENSCAN_predicted_peptide_4|581_aa MSRTVSVNEGSWRMCTPALQENVHASPAACHPTSCSSSSEIMSVLFFYIMRYKQSDPENP DNDRFVLAKRLSFVDVATGWLGQGLGVACGMAYTGKYFDRASYRVFCLMSDGESSEGSVW EAMAFASYYSLDNLVAIFDVNRLGHSGALPAEHCINIYQRRCEAFGWNTYVVDGRDVEAL CQVFWQASQVKHKPTAVVAKTFKGRGTPSIEDAESWHAKPMPRERADAIIKLIESQIQTS RNLDPQPPIEDSPEVNITDVRMTSPPDYRVGDKIATRKACGLALAKLGYANNRVVVLDGD TRYSTFSEIFNKEYPERFIECFMAEQNMVSVALGCASRGRTIAFASTFAAFLTRAFDHIR IGGLAESNINIIGSHCGVSVGDDGASQMALEDIAMFRTIPKCTIFYPTDAVSTEHAVALA ANAKGMCFIRTTRPETMVIYTPQERFEIGQAKVLRHCVSDKVTVIGAGITVYEALAAADE LSKQDIFIRVIDLFTIKPLDVATIVSSAKATEGRIITVEDHYPQGGIGEAVCAAVSMDPD IQVHSLAVSGVPQSGKSEELLDMYGISARHIIVAVKCMLLN >gi568815575f:154233178_154370981|GENSCAN_predicted_CDS_4|1746_bp atgtccaggacagtgagtgtgaacgaggggtcctggagaatgtgcacgccagccctgcag gagaatgtccacgccagccctgcagcctgccaccctacatcatgtagcagttcttctgag atcatgtctgtgctgttcttctacatcatgaggtacaagcagtcagatccagagaatccg gacaacgaccgatttgtcctcgcaaagagactgtcgtttgtggatgtggcaacaggatgg ctcggacaaggactgggagttgcatgtggaatggcatatactggcaagtacttcgacagg gccagctaccgggtgttctgcctcatgagtgatggcgagtcctcagaaggctctgtctgg gaggcaatggcctttgcttcctactacagtctggacaatcttgtggcaatctttgatgtg aaccgcctgggacacagtggtgcattgcccgccgagcactgcataaacatctatcagagg cgctgcgaagcctttgggtggaacacttatgtggtggacggccgggacgtggaggcactg tgccaggtattctggcaggcttctcaggtgaagcacaagcccactgctgtggtggccaag accttcaagggccggggcaccccaagtattgaggatgcagaaagttggcatgcaaagcca atgccgagagaaagagcagatgccattatcaaattaattgagagccagatacagaccagc aggaatcttgacccacagccccccattgaggactcacctgaagtcaacatcacagatgta aggatgacctctccacctgattacagagttggtgacaagatagctactcggaaagcatgc ggtctggctctggctaagctgggctacgcgaacaacagagtcgttgtgctggatggtgac accaggtactctactttctctgagatattcaacaaggagtaccctgagcgcttcatcgag tgctttatggctgaacaaaacatggtgagcgtggctctgggctgtgcctcccgtggacgg accattgcttttgctagcacctttgctgcctttctgactcgagcatttgatcacatccgg ataggaggcctcgctgagagcaacatcaacattattggttcccactgtggggtatctgtt ggtgacgatggtgcttcccagatggccctggaggatatagccatgttccgaaccattccc aagtgcacgatcttctacccaactgatgccgtctccacggagcatgctgttgctctggca gccaatgccaaggggatgtgcttcattcggaccacccgaccagaaactatggttatttac accccacaagaacgctttgagatcggacaggccaaggtcctccgccactgtgtcagtgac aaggtcacagttattggagctggaattactgtgtatgaagccttagcagctgctgatgag ctttcgaaacaagatatttttatccgtgtcatcgacctgtttaccattaaacctctggat gtcgccaccatcgtctccagtgcaaaagccacagagggccggatcattacagtggaggat cactacccgcaaggtggcatcggggaagctgtctgcgcagccgtctccatggatcctgac attcaggttcattcgctggcagtgtcgggagtgccccagagtgggaagtccgaggaattg ctggatatgtatggaattagtgccagacatatcatagtggccgtgaaatgcatgttgctg aactaa >gi568815575f:154233178_154370981|GENSCAN_predicted_peptide_5|293_aa MPTAAPSRAHRGDTRAALGRHGRRAALPGRFTCAHGRRPGSGGQVPRPLDVCRPFSGMER LEKRRPVVTVFPRGRGGQASPEIRAREAPVCAVCSLALHGSLSQPWSGSPGPTSPGADGH SHDSSQTLSRPMHPPGRIGALLDLLAPLRVPKELPQSTLGASGPLGPQQLPAAQAGPRGF RAQRRRSLAHPPRLLSAGAEGSCGLGGILVRECPALDTVSRKDTCSPFVGGQAQGVKEGT RGVEVSEGLGLTCAMAAGYGLVVSDAVELREDVARKLSVAPESLASWPLSADV >gi568815575f:154233178_154370981|GENSCAN_predicted_CDS_5|882_bp atgcccacggccgcgccctcccgcgcgcaccgcggcgacacccgggccgccctcggaagg cacggacgtcgcgccgcgctcccggggcgcttcacttgcgcccacggccgacgccccggc agcggcgggcaggtgccacgacctctggacgtttgtcgccccttctcgggaatggaacgc ttagagaagcgcagaccggtggtcactgtcttcccccggggacgaggcggacaggcgtcg cctgagatcagggcccgggaagccccggtctgcgccgtgtgctctctggctcttcatgga agcctttcccagccctggtcggggagccccggtccaacatccccaggagccgatgggcac agccacgacagcagccagaccctgtcccggcccatgcacccacctggacggatcggcgcc ctcctggacttactggctccattgagggtcccaaaggagctgccccagagcaccctaggg gcgtccggacccctcggcccgcagcagctaccagcagcccaggcaggacccaggggcttc cgggcacagaggcgccgctccctggctcatcccccccggctgctgtcagctggggctgag gggtcctgtggcctagggggcatcctggttcgggagtgccctgccctggacaccgtgtcc aggaaggatacatgctcaccctttgttgggggccaggcacagggggtcaaggagggtacg cgtggtgttgaggtctcagaaggtctgggtttgacgtgtgccatggctgcaggctacggc ctggtcgtgtcagacgctgtggagctgagggaggatgtggcgagaaagctgagcgttgcc cccgagagccttgcttcctggcccttgtctgcagatgtctga >gi568815575f:154233178_154370981|GENSCAN_predicted_peptide_6|2576_aa MHRKHNQRPTFRQMQLENVSVALEFLDRESIKLVSIDPDPKAWDRPLPHSRVPSVATLLL TDSKAIVDGNLKLILGLIWTLILHYSISMPMWDEEEDEEAKKQTPKQRLLGWIQNKLPQL PITNFSRDWQSGRALGALVDSCAPGLCPDWDSWDASKPVTNAREAMQQADDWLGIPQVIT PEEIVDPNVDEHSVMTYLSQFPKAKLKPGAPLRPKLNPKKARAYGPGIEPTGNMVKKRAE FTVETRSAGQGEVLVYVEDPAGHQEEAKVTANNDKNRTFSVWYVPEVTGTHKVTVLFAGQ HIAKSPFEVYVDKSQGDASKVTAQGPGLEPSGNIANKTTYFEIFTAGAGTGEVEVVIQDP MGQKGTVEPQLEARGDSTYRCSYQPTMEGVHTVHVTFAGVPIPRSPYTVTVGQACNPSAC RAVGRGLQPKGVRVKETADFKVYTKGAGSGELKVTVKGPKGEERVKQKDLGDGVYGFEYY PMVPGTYIVTITWGGQNIGRSPFEVKVGTECGNQKVRAWGPGLEGGVVGKSADFVVEAIG DDVGTLGFSVEGPSQAKIECDDKGDGSCDVRYWPQEAGEYAVHVLCNSEDIRLSPFMADI RDAPQDFHPDRVKARGPGLEKTGVAVNKPAEFTVDAKHGGKAPLRVQVQDNEGCPVEALV KDNGNGTYSCSYVPRKPVKHTAMVSWGGVSIPNSPFRVNVGAGSHPNKVKVYGPGVAKTG LKAHEPTYFTVDCAEAGQGDVSIGIKCAPGVVGPAEADIDFDIIRNDNDTFTVKYTPRGA GSYTIMVLFADQATPTSPIRVKVEPSHDASKVKAEGPGLSRTGVELGKPTHFTVNAKAAG KGKLDVQFSGLTKGDAVRDVDIIDHHDNTYTVKYTPVQQGPVGVNVTYGGDPIPKSPFSV AVSPSLDLSKIKVSGLGEKVDVGKDQEFTVKSKGAGGQGKVASKIVGPSGAAVPCKVEPG LGADNSVVRFLPREEGPYEVEVTYDGVPVPGSPFPLEAVAPTKPSKVKAFGPGLQGGSAG SPARFTIDTKGAGTGGLGLTVEGPCEAQLECLDNGDGTCSVSYVPTEPGDYNINILFADT HIPGSPFKAHVVPCFDASKVKCSGPGLERATAGEVGQFQVDCSSAGSAELTIEICSEAGL PAEVYIQDHGDGTHTITYIPLCPGAYTVTIKYGGQPVPNFPSKLQVEPAVDTSGVQCYGP GIEGQGVFREATTEFSVDARALTQTGGPHVKARVANPSGNLTETYVQDRGDGMYKVEYTP YEEGLHSVDVTYDGSPVPSSPFQVPVTEGCDPSRVRVHGPGIQSGTTNKPNKFTVETRGA GTGGLGLAVEGPSEAKMSCMDNKDGSCSVEYIPYEAGTYSLNVTYGGHQVPGSPFKVPVH DVTDASKVKCSGPGLSPGMVRANLPQSFQVDTSKAGVAPLQVKVQGPKGLVEPVDVVDNA DGTQTVNYVPSREGPYSISVLYGDEEVPRSPFKVKVLPTHDASKVKASGPGLNTTGVPAS LPVEFTIDAKDAGEGLLAVQITDPEGKPKKTHIQDNHDGTYTVAYVPDVTGRYTILIKYG GDEIPFSPYRVRAVPTGDASKCTVTGAGIGPTIQIGEETVITVDTKAAGKGKVTCTVCTP DGSEVDVDVVENEDGTFDIFYTAPQPGKYVICVRFGGEHVPNSPFQVTALAGDQPSVQPP LRSQQLAPQYTYAQGGQQTWAPERPLVGVNGLDVTSLRPFDLVIPFTIKKGEITGEVRMP SGKVAQPTITDNKDGTVTVRYAPSEAGLHEMDIRYDNMHIPGSPLQFYVDYVNCGHVTAY GPGLTHGVVNKPATFTVNTKDAGEGGLSLAIEGPSKAEISCTDNQDGTCSVSYLPVLPGD YSILVKYNEQHVPGSPFTARVTGDDSMRMSHLKVGSAADIPINISETDLSLLTATVVPPS GREEPCLLKRLRNGHVGISFVPKETGEHLVHVKKNGQHVASSPIPVVISQSEIGDASRVR VSGQGLHEGHTFEPAEFIIDTRDAGYGGLSLSIEGPSKVDINTEDLEDGTCRVTYCPTEP GNYIINIKFADQHVPGSPFSVKVTGEGRVKESITRRRRAPSVANVGSHCDLSLKIPEISI QDMTAQVTSPSGKTHEAEIVEGENHTYCIRFVPAEMGTHTVSVKYKGQHVPGSPFQFTVG PLGEGGAHKVRAGGPGLERAEAGVPAEFSIWTREAGAGGLAIAVEGPSKAEISFEDRKDG SCGVAYVVQEPGDYEVSVKFNEEHIPDSPFVVPVASPSGDARRLTVSSLQESGLKVNQPA SFAVSLNGAKGAIDAKVHSPSGALEECYVTEIDQDKYAVRFIPRENGVYLIDVKFNGTHI PGSPFKIRVGEPGHGGDPGLVSAYGAGLEGGVTGNPAEFVVNTSNAGAGALSVTIDGPSK VKMDCQECPEGYRVTYTPMAPGSYLISIKYGGPYHIGGSPFKAKVTGPRLVSNHSLHETS SVFVDSLTKATCAPQHGAPGPGPADASKVVAKGLGLSKAYVGQKSSFTVDCSKAGNNMLL VGVHGPRTPCEEILVKHVGSRLYSVSYLLKDKGEYTLVVKWGDEHIPGSPYRVVVP >gi568815575f:154233178_154370981|GENSCAN_predicted_CDS_6|7731_bp atgcaccgcaagcacaaccagcggcccactttccgccaaatgcagcttgagaacgtgtcg gtggcgctcgagttcctggaccgcgagagcatcaaactggtgtccatcgaccctgacccc aaagcctgggatcggcccctcccgcacagccgtgtgcccagcgtggccactcttctactc acagacagcaaggccatcgtggacgggaacctgaagctgatcctgggcctcatctggacc ctgatcctgcactactccatctccatgcccatgtgggacgaggaggaggatgaggaggcc aagaagcagacccccaagcagaggctcctgggctggatccagaacaagctgccgcagctg cccatcaccaacttcagccgggactggcagagcggccgggccctgggcgccctggtggac agctgtgccccgggcctgtgtcctgactgggactcttgggacgccagcaagcccgttacc aatgcgcgagaggccatgcagcaggcggatgactggctgggcatcccccaggtgatcacc cccgaggagattgtggaccccaacgtggacgagcactctgtcatgacctacctgtcccag ttccccaaggccaagctgaagccaggggctcccttgcggcccaaactgaacccgaagaaa gcccgtgcctacgggccaggcatcgagcccacaggcaacatggtgaagaagcgggcagag ttcactgtggagaccagaagtgctggccagggagaggtgctggtgtacgtggaggacccg gccggacaccaggaggaggcaaaagtgaccgccaataacgacaagaaccgcaccttctcc gtctggtacgtccccgaggtgacggggactcataaggttactgtgctctttgctggccag cacatcgccaagagccccttcgaggtgtacgtggataagtcacagggtgacgccagcaaa gtgacagcccaaggtcccggcctggagcccagtggcaacatcgccaacaagaccacctac tttgagatctttacggcaggagctggcacgggcgaggtcgaggttgtgatccaggacccc atgggacagaagggcacggtagagcctcagctggaggcccggggcgacagcacataccgc tgcagctaccagcccaccatggagggcgtccacaccgtgcacgtcacgtttgccggcgtg cccatccctcgcagcccctacactgtcactgttggccaagcctgtaacccgagtgcctgc cgggcggttggccggggcctccagcccaagggtgtgcgggtgaaggagacagctgacttc aaggtgtacacaaagggcgctggcagtggggagctgaaggtcaccgtgaagggccccaag ggagaggagcgcgtgaagcagaaggacctgggggatggcgtgtatggcttcgagtattac cccatggtccctggaacctatatcgtcaccatcacgtggggtggtcagaacatcgggcgc agtcccttcgaagtgaaggtgggcaccgagtgtggcaatcagaaggtacgggcctggggc cctgggctggagggcggcgtcgttggcaagtcagcagactttgtggtggaggctatcggg gacgacgtgggcacgctgggcttctcggtggaagggccatcgcaggctaagatcgaatgt gacgacaagggcgacggctcctgtgatgtgcgctactggccgcaggaggctggcgagtat gccgttcacgtgctgtgcaacagcgaagacatccgcctcagccccttcatggctgacatc cgtgacgcgccccaggacttccacccagacagggtgaaggcacgtgggcctggattggag aagacaggtgtggccgtcaacaagccagcagagttcacagtggatgccaagcacggtggc aaggccccacttcgggtccaagtccaggacaatgaaggctgccctgtggaggcgttggtc aaggacaacggcaatggcacttacagctgctcctacgtgcccaggaagccggtgaagcac acagccatggtgtcctggggaggcgtcagcatccccaacagccccttcagggtgaatgtg ggagctggcagccaccccaacaaggtcaaagtatacggccccggagtagccaagacaggg ctcaaggcccacgagcccacctacttcactgtggactgcgccgaggctggccagggggac gtcagcatcggcatcaagtgtgcccctggagtggtaggccccgccgaagctgacatcgac ttcgacatcatccgcaatgacaatgacaccttcacggtcaagtacacgccccggggggct ggcagctacaccattatggtcctctttgctgaccaggccacgcccaccagccccatccga gtcaaggtggagccctctcatgacgccagtaaggtgaaggccgagggccctggcctcagt cgcactggtgtcgagcttggcaagcccacccacttcacagtaaatgccaaagctgctggc aaaggcaagctggacgtccagttctcaggactcaccaagggggatgcagtgcgagatgtg gacatcatcgaccaccatgacaacacctacacagtcaagtacacgcctgtccagcagggt ccagtaggcgtcaatgtcacttatggaggggatcccatccctaagagccctttctcagtg gcagtatctccaagcctggacctcagcaagatcaaggtgtctggcctgggagagaaggtg gacgttggcaaagaccaggagttcacagtcaaatcaaagggtgctggtggtcaaggcaaa gtggcatccaagattgtgggcccctcgggtgcagcggtgccctgcaaggtggagccaggc ctgggggctgacaacagtgtggtgcgcttcctgccccgtgaggaagggccctatgaggtg gaggtgacctatgacggcgtgcccgtgcctggcagcccctttcctctggaagctgtggcc cccaccaagcctagcaaggtgaaggcgtttgggccggggctgcagggaggcagtgcgggc tcccccgcccgcttcaccatcgacaccaagggcgccggcacaggtggcctgggcctgacg gtggagggcccctgtgaggcgcagctcgagtgcttggacaatggggatggcacatgttcc gtgtcctacgtgcccaccgagcccggggactacaacatcaacatcctcttcgctgacacc cacatccctggctccccattcaaggcccacgtggttccctgctttgacgcatccaaagtc aagtgctcaggccccgggctggagcgggccaccgctggggaggtgggccaattccaagtg gactgctcgagcgcgggcagcgcggagctgaccattgagatctgctcggaggcggggctt ccggccgaggtgtacatccaggaccacggtgatggcacgcacaccattacctacattccc ctctgccccggggcctacaccgtcaccatcaagtacggcggccagcccgtgcccaacttc cccagcaagctgcaggtggaacctgcggtggacacttccggtgtccagtgctatgggcct ggtattgagggccagggtgtcttccgtgaggccaccactgagttcagtgtggacgcccgg gctctgacacagaccggagggccgcacgtcaaggcccgtgtggccaacccctcaggcaac ctgacggagacctacgttcaggaccgtggcgatggcatgtacaaagtggagtacacgcct tacgaggagggactgcactccgtggacgtgacctatgacggcagtcccgtgcccagcagc cccttccaggtgcccgtgaccgagggctgcgacccctcccgggtgcgtgtccacgggcca ggcatccaaagtggcaccaccaacaagcccaacaagttcactgtggagaccaggggagct ggcacgggcggcctgggcctggctgtagagggcccctccgaggccaagatgtcctgcatg gataacaaggacggcagctgctcggtcgagtacatcccttatgaggctggcacctacagc ctcaacgtcacctatggtggccatcaagtgccaggcagtcctttcaaggtccctgtgcat gatgtgacagatgcgtccaaggtcaagtgctctgggcccggcctgagcccaggcatggtt cgtgccaacctccctcagtccttccaggtggacacaagcaaggctggtgtggccccattg caggtcaaagtgcaagggcccaaaggcctggtggagccagtggacgtggtagacaacgct gatggcacccagaccgtcaattatgtgcccagccgagaagggccctacagcatctcagta ctgtatggagatgaagaggtaccccggagccccttcaaggtcaaggtgctgcctactcat gatgccagcaaggtgaaggccagtggccccgggctcaacaccactggcgtgcctgccagc ctgcccgtggagttcaccatcgatgcaaaggacgccggggagggcctgctggctgtccag atcacggatcccgaaggcaagccgaagaagacacacatccaagacaaccatgacggcacg tatacagtggcctacgtgccagacgtgacaggtcgctacaccatcctcatcaagtacggt ggtgacgagatccccttctccccgtaccgcgtgcgtgccgtgcccaccggggacgccagc aagtgcactgtcacaggtgctggcatcggccccaccattcagattggggaggagacggtg atcactgtggacactaaggcggcaggcaaaggcaaagtgacgtgcaccgtgtgcacgcct gatggctcagaggtggatgtggacgtggtggagaatgaggacggcactttcgacatcttc tacacggccccccagccgggcaaatacgtcatctgtgtgcgctttggtggcgagcacgtg cccaacagccccttccaagtgacggctctggctggggaccagccctcggtgcagccccct ctacggtctcagcagctggccccacagtacacctacgcccagggcggccagcagacttgg gccccggagaggcccctggtgggtgtcaatgggctggatgtgaccagcctgaggcccttt gaccttgtcatccccttcaccatcaagaagggcgagatcacaggggaggttcggatgccc tcaggcaaggtggcgcagcccaccatcactgacaacaaagacggcaccgtgaccgtgcgg tatgcacccagcgaggctggcctgcacgagatggacatccgctatgacaacatgcacatc ccaggaagccccttgcagttctatgtggattacgtcaactgtggccatgtcactgcctat gggcctggcctcacccatggagtagtgaacaagcctgccaccttcaccgtcaacaccaag gatgcaggagaggggggcctgtctctggccattgagggcccgtccaaagcagaaatcagc tgcactgacaaccaggatgggacatgcagcgtgtcctacctgcctgtgctgccgggggac tacagcattctagtcaagtacaatgaacagcacgtcccaggcagccccttcactgctcgg gtcacaggtgacgactccatgcgtatgtcccacctaaaggtcggctctgctgccgacatc cccatcaacatctcagagacggatctcagcctgctgacggccactgtggtcccgccctcg ggccgggaggagccctgtttgctgaagcggctgcgtaatggccacgtggggatttcattc gtgcccaaggagacgggggagcacctggtgcatgtgaagaaaaatggccagcacgtggcc agcagccccatcccggtggtgatcagccagtcggaaattggggatgccagtcgtgttcgg gtctctggtcagggccttcacgaaggccacacctttgagcctgcagagtttatcattgat acccgcgatgcaggctatggtgggctcagcctgtccattgagggccccagcaaggtggac atcaacacagaggacctggaggacgggacgtgcagggtcacctactgccccacagagcca ggcaactacatcatcaacatcaagtttgccgaccagcacgtgcctggcagccccttctct gtgaaggtgacaggcgagggccgggtgaaagagagcatcacccgcaggcgtcgggctcct tcagtggccaacgttggtagtcattgtgacctcagcctgaaaatccctgaaattagcatc caggatatgacagcccaggtgaccagcccatcgggcaagacccatgaggccgagatcgtg gaaggggagaaccacacctactgcatccgctttgttcccgctgagatgggcacacacaca gtcagcgtgaagtacaagggccagcacgtgcctgggagccccttccagttcaccgtgggg cccctaggggaagggggagcccacaaggtccgagctgggggccctggcctggagagagct gaagctggagtgccagccgaattcagtatctggacccgggaagctggtgctggaggcctg gccattgctgtcgagggccccagcaaggctgagatctcttttgaggaccgcaaggacggc tcctgtggtgtggcttatgtggtccaggagccaggtgactacgaagtctcagtcaagttc aacgaggaacacattcccgacagccccttcgtggtgcctgtggcttctccgtctggcgac gcccgccgcctcactgtttctagccttcaggagtcagggctaaaggtcaaccagccagcc tcttttgcagtcagcctgaacggggccaagggggcgatcgatgccaaggtgcacagcccc tcaggagccctggaggagtgctatgtcacagaaattgaccaagataagtatgctgtgcgc ttcatccctcgggagaatggcgtttacctgattgacgtcaagttcaacggcacccacatc cctggaagccccttcaagatccgagttggggagcctgggcatggaggggacccaggcttg gtgtctgcttacggagcaggtctggaaggcggtgtcacagggaacccagctgagttcgtc gtgaacacgagcaatgcgggagctggtgccctgtcggtgaccattgacggcccctccaag gtgaagatggattgccaggagtgccctgagggctaccgcgtcacctatacccccatggca cctggcagctacctcatctccatcaagtacggcggcccctaccacattgggggcagcccc ttcaaggccaaagtcacaggcccccgtctcgtcagcaaccacagcctccacgagacatca tcagtgtttgtagactctctgaccaaggccacctgtgccccccagcatggggccccgggt cctgggcctgctgacgccagcaaggtggtggccaagggcctggggctgagcaaggcctac gtaggccagaagagcagcttcacagtagactgcagcaaagcaggcaacaacatgctgctg gtgggggttcatggcccaaggaccccctgcgaggagatcctggtgaagcacgtgggcagc cggctctacagcgtgtcctacctgctcaaggacaagggggagtacacactggtggtcaaa tggggggacgagcacatcccaggcagcccctaccgcgttgtggtgccctga