GENSCAN 1.0 Date run: 24-Oct-119 Time: 21:54:09 Sequence gi568815593r:132441814_132643478 : 201665 bp : 42.85% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.29 PlyA - 18 13 6 1.05 1.28 Term - 2527 2380 148 0 1 61 45 160 0.922 5.39 1.27 Intr - 8026 7873 154 2 1 98 62 60 0.709 2.61 1.26 Intr - 11388 11180 209 2 2 0 75 373 0.988 25.00 1.25 Intr - 12345 12269 77 0 2 104 90 48 0.976 3.99 1.24 Intr - 13785 13672 114 0 0 20 85 114 0.885 4.02 1.23 Intr - 14733 14672 62 1 2 90 70 55 0.684 1.43 1.22 Intr - 18878 18727 152 1 2 -5 -5 199 0.394 0.19 1.21 Intr - 21268 21012 257 1 2 101 53 124 0.628 5.52 1.20 Intr - 22525 22234 292 0 1 2 -23 244 0.152 0.91 1.19 Intr - 23044 22938 107 1 2 89 80 111 0.673 8.49 1.18 Intr - 23345 23209 137 0 2 58 18 79 0.677 -2.73 1.17 Intr - 23721 23545 177 1 0 135 41 107 0.711 9.67 1.16 Intr - 24501 24436 66 1 0 84 90 36 0.683 1.36 1.15 Intr - 26163 25910 254 2 2 130 75 -5 0.139 -1.45 1.14 Intr - 36474 36394 81 2 0 69 77 86 0.168 3.43 1.13 Intr - 42262 42017 246 0 0 100 11 149 0.037 4.05 1.12 Intr - 42684 42443 242 0 2 96 36 247 0.138 15.73 1.11 Intr - 43903 43854 50 2 2 126 98 88 0.998 11.08 1.10 Intr - 44560 44438 123 2 0 92 81 94 0.994 8.74 1.09 Intr - 44873 44744 130 2 1 79 99 183 0.947 17.85 1.08 Intr - 45031 44910 122 2 2 94 86 38 0.383 3.49 1.07 Intr - 45317 45141 177 0 0 131 63 269 0.980 27.67 1.06 Intr - 46290 46113 178 0 1 62 89 79 0.392 3.97 1.05 Intr - 48786 48732 55 2 1 55 105 65 0.373 2.96 1.04 Intr - 49858 49541 318 0 0 92 109 100 0.122 6.65 1.03 Intr - 53578 53440 139 2 1 56 69 134 0.173 6.90 1.02 Intr - 54482 54293 190 2 1 101 24 40 0.088 -2.86 1.01 Init - 55731 54763 969 0 0 43 76 372 0.176 24.86 1.00 Prom - 65585 65546 40 -5.05 2.00 Prom + 66398 66437 40 -5.35 2.01 Sngl + 73328 74074 747 1 0 49 43 542 0.991 41.73 2.02 PlyA + 74092 74097 6 -4.04 3.00 Prom + 74174 74213 40 -5.25 3.01 Init + 74432 77523 3092 1 2 44 53 1093 0.479 92.87 3.02 Intr + 84178 84268 91 1 1 43 76 75 0.180 0.88 3.03 Term + 85382 85609 228 1 0 68 49 137 0.259 3.45 3.04 PlyA + 86854 86859 6 1.05 4.03 PlyA - 87094 87089 6 1.05 4.02 Term - 87479 87144 336 2 0 47 41 207 0.258 5.49 4.01 Init - 95192 95172 21 2 0 120 36 41 0.252 0.57 4.00 Prom - 95671 95632 40 -2.95 5.05 PlyA - 97624 97619 6 1.05 5.04 Term - 100096 99998 99 1 0 114 53 102 0.998 6.45 5.03 Intr - 100330 100202 129 1 0 27 106 111 0.987 6.77 5.02 Intr - 101313 101281 33 0 0 125 101 19 0.955 4.40 5.01 Init - 101665 101522 144 1 0 85 76 158 0.932 13.24 5.00 Prom - 102908 102869 40 -5.85 6.00 Prom + 112706 112745 40 -6.95 6.01 Init + 115512 115640 129 2 0 90 127 107 0.121 15.00 6.02 Intr + 115789 115932 144 0 0 41 22 137 0.755 1.96 6.03 Intr + 117471 117554 84 2 0 84 90 21 0.636 1.00 6.04 Intr + 133964 134115 152 1 2 91 56 144 0.989 9.54 6.05 Intr + 137504 137689 186 2 0 72 76 162 0.996 11.28 6.06 Intr + 138049 138253 205 1 1 51 88 250 0.960 19.58 6.07 Intr + 145749 145877 129 2 0 77 76 153 0.939 12.97 6.08 Intr + 146111 146276 166 1 1 81 53 138 0.997 8.21 6.09 Intr + 146874 147019 146 1 2 48 33 145 0.801 4.08 6.10 Intr + 147818 148024 207 1 0 51 50 186 0.801 9.45 6.11 Intr + 150064 150221 158 0 2 76 100 41 0.985 1.99 6.12 Intr + 153056 153231 176 2 2 48 107 188 0.975 15.36 6.13 Intr + 153760 153997 238 2 1 49 94 119 0.935 4.35 6.14 Intr + 161487 161676 190 0 1 65 111 190 0.989 17.67 6.15 Intr + 162107 162233 127 1 1 27 90 154 0.999 8.83 6.16 Intr + 162993 163186 194 1 2 43 116 210 0.999 17.59 6.17 Intr + 164606 165268 663 1 0 14 30 268 0.903 4.63 6.18 Intr + 166802 166912 111 1 0 91 69 38 0.686 1.86 6.19 Intr + 167304 167396 93 2 0 41 86 95 0.912 3.84 6.20 Intr + 167470 167583 114 0 0 57 100 121 0.998 9.92 6.21 Intr + 174190 174317 128 0 2 69 91 103 0.999 7.26 6.22 Intr + 176257 176481 225 1 0 105 100 260 0.596 25.08 6.23 Intr + 195302 195387 86 2 2 64 113 23 0.358 0.94 6.24 Intr + 196268 196414 147 0 0 72 70 217 0.980 17.59 6.25 Intr + 198215 198303 89 0 2 112 115 -15 0.973 2.37 6.26 Intr + 198859 198992 134 0 2 51 80 144 0.973 8.42 6.27 Term + 199513 199717 205 1 1 9 47 231 0.616 6.96 6.28 PlyA + 200056 200061 6 1.05 7.02 PlyA - 200923 200918 6 -0.45 7.01 Term - 201594 201409 186 0 0 16 48 193 0.843 4.61 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 122494 122889 396 0 0 -22 38 246 0.883 2.69 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:132441814_132643478|GENSCAN_predicted_peptide_1|1741_aa MFGEILQQISKRGHYGYDLSKETSSYKDNKMGCDSEGTAGDLSHLRGYPNSSDPTCLRTV PSFPACTPSNPLACRTPPSFPKGHPAGRSILHTPTLSMNTIRRGLGRCIQVGLFRGRFSP APAGTSSWPLAGNTGRPPVPRPPPLRRPQRSQRGSVPGNSSISLLPPRLQETETQAPPLR RGPAEPGDGADPHWMAGPKPSPTRPLRPAPPPAARPRRRWEPSECLRAEPRRAASVQPLS TTRASNGRAAGLSQVSGCNGWQCPGKGRNAAFPCGNLQAGAFSVHPLCPCETERKGKVGG EQGSAEVFAESSAPRLLLAPSRREAGQLGAGCSPRLTPRLGGAGMWKLARKDCAPSPSNT PPPRPPPSSGRGCGAWVTWGRRTPHHGFCENRDEGFEHFREPYYFWTWNLEGMLGSLSAL GTVAPSLGSRLHSQPGGVPGRQTEKGWGVILVTIRPCNGDLENYSASVARVEHSALQSLP ITANHQGRATIILWLKNRGPASLGGCASSGQVGISPGPAAPDSTRGRVGFLPSSLPQGSP CVAGTLARDRRIAPAAEAWVSDSAPKSVVSDLTQPYVYVCIVQEEMIFQIPWKHAAKHGW DINKDACLFRSWAIHTGRYKAGEKEPDPKTWKANFRCAMNSLPDIEEVKDQSRNKGSSAV RVYRMLPPLTKNQRKERKSKSSRDAKSKAKRKVSVVLSSQAFGHLWARVSSGRNAKSCGD SSPDTFSDGLSSSTLPDDHSSYTVPGYMQDLEVEQALTPALSPCAVSSTLPDWHIPVEVV PDSTSDLYNFQVSPMPSTSEATTDEDEEGKLPEDIMKLLEQSEWQPTNVDGKGYLLNEPG VQPTSVYGDFSCKEEPEIDSPGGKKALDPYGFLDEGEPRRDGESLGAGPGSRAVKHLGGY WAESTACLHRSEEHGCHLAGQPADPSPVALHPGHSLCTVAGPLGPSYSSRQAGPGIMVDM VQRSWTSVGPSTAKCDPTAKPGNQLSPPNQYGALVPTHIIYLCPESRTLSPPSQIALCPL ESTGWPLSSLPTLQLGGGWEEVKKEEVKTKYKGYREVRSTNLNVSAAHAVRKECASCLWS MPWGCQGHWCKAPSGLGAVLTSAISLHPCNPRKGVAHRAQCPAYLEEAVKPIYVGNGGKA RARESVPCGTPPPITSPLPAHTGDSQSIPQAGGTTQALWMHSEQHRPGPRVTDFTSLREA DIRQIITWISEKHSSYERSLQEICRFHESFFGGVFQAATKRWLCQLLLSVFVPGGAEGQE KVFQPASTLLPGHPQRAGTGLVEPTEQGVVAMPIPGSPGSEAVRLGPLVILMQAVDLTMA VVASETIRTARAHLRHGLILSKSLPATDAREGRGVPETKRRHTSPTRNKASSCSSLGPSR GQRLWDLHSFPLSGEGRSALWGVRAQPPDKLEKQDLRARINGECEAQHQVQEARGSRILA VSSRQHGCTPRGRQDGRKHYTGDGDRSTAGASGEEEFEDSLVERMEGQICVLRNTQIKQK RGLTGTRRDERQPGTEAATKKGLKNPSKMTGVLIMEEQIALLNNSKVYITLDEGMVNLKD GCKFFDISPMERKASNKERWRDREREKKKKAEGKRKKEERRGKKKKNKRKRRRKKKKKKE DDDNDNNNNNKKQPPPPLPPPGLSLYPGSQRKVSWAFPLSSCQHLAPWSSLDSFQGYVGQ SQTRCGLIVVLPCSCDPQRPLVQHKLRQRGPPGSLYWCEAQPARGLGIGGPELIVVAPAA G >gi568815593r:132441814_132643478|GENSCAN_predicted_CDS_1|5226_bp atgttcggggaaatcctccaacaaataagcaaaagaggacactacggatatgatctcagt aaggagacctcatcgtataaggacaataagatgggatgtgacagtgaggggacagctggg gacctcagtcatttgaggggatatcccaattcctcagatcccacctgcctgcgcacagtc ccctctttccccgcgtgcacaccctcaaaccccttagcttgtagaactcctccctcattc cctaaaggccaccctgcaggcaggtccatccttcacacccccaccctcagcatgaacacc atccggagaggcctgggcaggtgcattcaggtggggcttttccggggccggttttccccg gctcccgctggaacttccagctggccactggccggtaacactggacgcccgcccgtcccc cggcccccgccgctgcgcagacctcagcgctcccagcgtggttcggttcccggaaactcc tcgatttccctgttgccgccgcgtttgcaagaaaccgaaacccaggctccgccccttcgg cgcggccccgcggaacccggcgatggggctgacccccactggatggccggacccaagccc tcccctacccggcccctgcgccccgccccaccgccagctgcaaggcccaggcggaggtgg gagccatctgagtgcctgcgcgccgaaccccggcgggcagcatccgtccagccgctcagc accacccgcgcctccaacgggagagccgcaggtctatcccaagtctcaggctgcaacgga tggcagtgccccggcaagggacgcaacgcggccttcccctgcggcaatttacaagcgggc gccttttctgtgcatcccctatgtccctgcgaaaccgagcggaaggggaaggtggggggg gagcagggaagtgcagaggtgttcgcggaaagctctgcaccccgattgttgctagctcca tctcgcagggaagccggccagcttggagccggctgctcaccccgactcaccccccgcctg ggcggagcaggaatgtggaagttggcccgaaaggactgtgccccctccccgtcaaacacc ccccccccgcgtcccccaccaagttctggccggggctgtggagcgtgggtcacctggggg cgaaggactccacatcacggtttctgtgaaaatcgagatgaaggctttgagcatttcaga gagccctactacttctggacctggaacctggaaggcatgctggggagtttgtctgctttg gggaccgtggccccctctctgggtagcaggctccacagccagccagggggagtcccaggg aggcagaccgaaaaggggtggggtgtcatcctggtcactattagaccctgcaacggcgac cttgaaaactactcagcgtctgttgcccgagtggagcatagtgctttacaatctcttccc atcacagcaaaccatcaaggtagggctactattattttatggttgaaaaacagaggtcct gcgtcccttgggggctgtgccagcagcggccaagttgggatttcccctggtccagcagcc ccagacagcacacggggcagggtaggctttctgccttcttcacttccccagggcagcccc tgcgtcgccgggaccctcgcgcgcgaccgccgaatcgctcctgcagcagaggcctgggtc tcagactcagccccaaagtctgtggtctctgacctgacacagccttatgtgtatgtgtgt attgttcaggaggagatgatcttccagatcccatggaagcatgctgccaagcatggctgg gacatcaacaaggatgcctgtttgttccggagctgggccattcacacaggccgatacaaa gcaggggaaaaggagccagatcccaagacgtggaaggccaactttcgctgtgccatgaac tccctgccagatatcgaggaggtgaaagaccagagcaggaacaagggcagctcagctgtg cgagtgtaccggatgcttccacctctcaccaagaaccagagaaaagaaagaaagtcgaag tccagccgagatgctaagagcaaggccaagaggaaggtgagtgtggtcctaagcagccag gcctttggtcacctgtgggccagggtgagcagtggaagaaatgctaagtcatgtggggat tccagccctgataccttctctgatggactcagcagctccactctgcctgatgaccacagc agctacacagttccaggctacatgcaggacttggaggtggagcaggccctgactccagca ctgtcgccatgtgctgtcagcagcactctccccgactggcacatcccagtggaagttgtg ccggacagcaccagtgatctgtacaacttccaggtgtcacccatgccctccacctctgaa gctacaacagatgaggatgaggaagggaaattacctgaggacatcatgaagctcttggag cagtcggagtggcagccaacaaacgtggatgggaaggggtacctactcaatgaacctgga gtccagcccacctctgtctatggagactttagctgtaaggaggagccagaaattgacagc ccagggggtaagaaggccctggatccttatggcttcttagatgagggagaaccacgtagg gatggagaaagcttgggggcagggccagggagcagggcggtaaagcatctggggggatat tgggctgagtctacagcgtgtcttcacagatctgaagaacatggatgccacctggctgga cagcctgctgaccccagtccggttgccctccatccaggccattccctgtgcaccgtagca gggcccctgggcccctcttattcctctaggcaagcaggacctggcatcatggtggatatg gtgcagagaagctggacttctgtgggcccctcaacagccaagtgtgaccccactgccaaa ccaggaaaccaactctcaccacctaatcagtatggagccttggttccaacccacatcatc tatctgtgcccagaatccagaaccctgtcccctccatcccagatagccctgtgccctctg gaatccacaggctggcccctcagtagcctccctaccttgcagttgggtggggggtgggag gaggtcaagaaagaggaagtgaaaactaaatacaagggctacagagaagtccggtccaca aacctcaatgtttcagcagcacacgctgtgagaaaggaatgtgcaagctgtttgtggagc atgccttgggggtgccaaggccactggtgcaaagcaccttcgggtctgggagctgtgctc acatctgccatctcattacatccttgcaaccctcgcaaaggtgtggcccacagagctcag tgccctgcctatctggaagaggctgtgaagcccatctatgtaggtaacggaggcaaagca agggctagggagagtgtgccatgtgggacacctccccctatcacctccccactgcctgca cacactggggacagtcaaagcattcctcaggctgggggcactacccaggcactgtggatg cacagtgaacaacacagaccaggtccacgcgtcacagactttacttccctgagggaggca gacattaggcaaataatcacatggatctctgaaaaacatagctcctacgagaggtcgctt caggaaatctgccgcttccatgagagcttctttggtggtgtcttccaagctgctaccaag cgatggctttgccagctgttgctttcagtgtttgtgcctggaggagctgaggggcaggaa aaagtgttccagccagcaagcaccctgctccctgggcaccctcagagggcaggtactgga ctggtagaacccactgagcagggagttgttgcaatgccgattcctggctctccaggctct gaggccgtacgtttgggccctttggtgattctgatgcaggctgtggacctcaccatggca gtcgtggcctcagagaccatcagaacagctagagcacacctgaggcacggcctcatcctc tccaagtcacttcctgccacagatgctcgggaaggcagaggggtgcctgagacaaagagg agacacacttctcccacgagaaataaagcaagcagctgttcctctcttgggcccagcagg ggtcagaggctgtgggaccttcactccttccctctcagtggagagggcagatctgctctt tggggtgtgagggcacagcctcctgacaagctggagaagcaggatttaagagctagaatc aacggagaatgtgaggcccagcatcaggttcaagaagcaaggggatcaagaatcttggct gtaagcagcagacagcatggctgtactccacggggaaggcaggatggcaggaagcattat acaggtgatggagacaggagcacagcaggagccagtggagaagaagagtttgaagattcc ctggttgagagaatggaagggcagatatgtgtgcttcgaaatacacaaattaaacaaaaa cgagggctgactgggaccaggagagatgaaaggcagccagggacagaggcggccacgaag aaaggtttaaagaatcccagcaaaatgactggggtcctcattatggaagaacaaatagct ttacttaataattccaaggtatatatcactctggatgagggtatggtgaatttaaaagat ggttgcaaattctttgacatttctccaatggagaggaaagcctccaacaaggagagatgg agagacagagagagggagaagaaaaagaaagcagaaggaaaaaggaagaaggaagaaaga agagggaagaagaagaagaacaagaggaagaggaggaggaagaagaagaagaagaaggag gatgacgacaacgacaacaacaacaacaacaagaagcagccaccaccgccgctgccacct ccaggactctcactctatccaggaagtcaacgcaaagtctcttgggccttccctttatcc agctgccaacacttagcaccctggtcttccttggacagtttccaaggctacgttgggcag tcccaaacaagatgtggtcttattgttgtcttaccttgcagctgtgacccccagaggcct ctagttcagcataagctgaggcaaagggggcccccaggttccctctactggtgtgaagcc cagccggcaaggggactggggatcggcggcccagagttgattgttgtggccccagcagca ggatga >gi568815593r:132441814_132643478|GENSCAN_predicted_peptide_2|248_aa MELKTKARELREECRSLRSQCDQLEERVSAMEDEMNEMKREGKFREKIIKRNEQSLQEIW DYVKRPNLRLIGVPESDVENGTKLENTLQDIIQENFPNLARQANVQIQEIQRTPQRYSSR RATPRHIIVRFTKVEMKEKMLRAAREKGRVTLKGKPIRLTADLLAETLQARREWGPIFNI LKEKNFQPRISYPAKLSFISEGEIKYFIDKQMLRDFVTTRPALKELLKEALNMERNNRYQ PLQNHAKM >gi568815593r:132441814_132643478|GENSCAN_predicted_CDS_2|747_bp atggagctgaaaaccaaggctcgagaactacgtgaagaatgcagaagcctcaggagccaa tgcgatcaactggaagaaagggtatcagcgatggaagatgaaatgaatgaaatgaagcga gaagggaagtttagagaaaaaataataaaaagaaatgagcaaagcctccaagaaatatgg gactatgtgaaaagaccaaatctacgtctgattggtgtacctgaaagtgatgtggagaat ggaaccaagttggaaaacactctgcaggatattatccaggagaacttccccaatctagca aggcaggccaacgttcagattcaggaaatacagagaacgccacaaagatactcctcgaga agagcaactccaagacacataattgtcagattcaccaaagttgaaatgaaggaaaaaatg ttaagggcagccagagagaaaggtcgggttaccctcaaaggaaagcccatcagactaaca gcggatctcttggcagaaactctacaagccagaagagagtgggggccaatattcaacatt cttaaagaaaagaattttcaacccagaatttcatatccagccaaactaagcttcataagt gaaggagaaataaaatactttatagacaagcaaatgctgagagattttgtcaccaccagg cctgccctaaaagagctcctgaaggaagcgctaaacatggaaaggaacaaccggtaccag ccgctgcaaaatcatgccaaaatgtaa >gi568815593r:132441814_132643478|GENSCAN_predicted_peptide_3|1136_aa MVKGSIQQEELTILNIYAPNTGAPRFIKQVLSDLQRDLDSHTLIMGDFNTPLSTLDRSTR QKVNKDTQELNSALHQADLIDIYRTLHPKSTEYTFFSTPHHTYSKIDHIVGSKALLSKCK RTEIITNYLSDHSAIKLELRIKNLTQSRSTTWKLNNLLLNDYWVHNEMKAEIKMFFETNE NKDTTYQNLWDAFKAVCRGKFIALNAYKRKQERSKIDTLTSQLKELEKQEQTHSKASRRQ EITKIRAELKERETQKTLQKINESRSWFFERINKIDRPLARLIKKKREKNQIDTIKNDKG DITTNPTEIQTTIREYYKHLYANKLENLEEMDTFLDTYTLPRLNQEEVESLNRPITGSEI VAIINSLPTKKSPGPDGFTAEFYQRYKEELVPFLLKLFQSIEKEGILPNSFYEASIILIP KPGRDTTKKENFRPISLMNIDAKILNKILANRIQQHIKKLIHHDQVGFIPGMQGWFNIRK SINVIQHINRAKDKNHMIISIDAEKAFDKIQQPFMLKTLNKLGIDGTYFKIIRAIYDKPT ANIILNGQKLEAFPLKTGTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQLGKEEVKLS LFADDMIVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELP FTIASKRIKYLGIQLTRDVKDLFKENYKPLLKEIKEDTNKWKNIPCSWVGRINIVKMAIL PKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRARIAKSILSQKNKAGGITLPDFKLY YKATVTKTAWYWYQNRDIDQWNRTEPSEIMPHIYNYLIFDKPEKNKQWGKDSLFNKWCWE NWLAICRKLKLDPFLTPYTKINSRWIKDLNIRPKTIKTLEENLGITIQDIGVGKDFMSKT PKAMVTKAKIDKWDLIKLKSFCTAKETTIRVNRQPTTWEKIFATYSSDKGLISRIYNELK QIYKKKTNNPIKKWAKDMNRHFSKEDIYAAKKHMKKCSSSLAIREMQIKTTMRYHLTPVR MAIIKKSGNNSLGTEQDLIKKKKEEEEEEKKSHSILPSRGRADPGDGQLGCVNGTQWKGL PSVGCRTGALLFECPDEPHTDLTMLYQERVSASHAQSCTRELANTLAKAFHLHVSK >gi568815593r:132441814_132643478|GENSCAN_predicted_CDS_3|3411_bp atggtaaagggatcaattcaacaagaagagctaactatcctaaatatttatgcaccgaat acaggagcacccagattcataaagcaagtcctgagtgacctacaaagagacttagactcc cacacattaataatgggagactttaacaccccactgtcaacattagacagatcaacgaga cagaaagtcaacaaggatacccaggaattgaactcagctctgcaccaagcagacctaata gacatctacagaactctccaccccaaatcaacagaatatacatttttttcaacaccacac cacacctattccaaaattgaccacatagttggaagtaaagctctcctcagcaaatgtaaa agaacagaaattataacaaactatctctcagaccacagtgcaatcaaactagaactcagg attaagaatctcactcaaagccgctcaactacatggaaactgaacaacctgctcctgaat gactactgggtacataacgaaatgaaggcagaaataaagatgttctttgaaaccaacgag aacaaagacaccacataccagaatctctgggacgcattcaaagcagtgtgtagagggaaa tttatagcactaaatgcctataagagaaagcaggaaagatccaaaattgacaccctaaca tcacaattaaaagaactagaaaagcaagagcaaacacattcaaaagctagcagaaggcaa gaaataactaaaatcagagcagaactgaaggaaagagagacacaaaaaacccttcaaaaa atcaatgaatccaggagctggttttttgaaaggatcaacaaaattgatagaccgctagca agactaataaagaaaaaaagagagaagaatcaaatagacacaataaaaaatgataaaggg gatatcaccaccaatcccacagaaatacaaactaccatcagagaatactacaaacacctc tacgcaaataaactagaaaatctagaagaaatggatacattcctcgacacatacactctc ccaagactaaaccaggaagaagttgaatctctgaatagaccaataacaggatctgaaatt gtggcaataatcaatagtttaccaaccaaaaagagtccaggaccagatggattcacagct gaattctaccagaggtacaaggaggagctggtaccattccttctgaaactattccaatca atagaaaaagaaggaatcctccctaactcattttatgaggccagcatcattctgatacca aagccgggcagagacacaaccaaaaaagagaattttagaccaatatccttgatgaatatt gatgcaaaaatcctcaataaaatactggcaaatcgaatccagcagcacatcaaaaagctt atccaccatgatcaagtgggcttcatccctgggatgcaaggctggttcaatatacgcaaa tcaataaatgtaatccagcatataaacagagccaaagacaaaaaccacatgattatctca atagatgcagaaaaagcctttgacaaaattcaacaacccttcatgctaaaaactctcaat aaattaggtattgatgggacgtatttcaaaataataagagctatctatgacaaacccaca gccaatatcatactgaatgggcaaaaactggaagcattccctttgaaaactggcacaaga cagggatgccctctctcaccgctcctattcaacatagtgttggaagttctggccagggca atcaggcaggagaaggaaataaagggtattcaattaggaaaagaggaagtcaaattgtcc ctgtttgcagacgacatgattgtttatctagaaaaccccatcgtctcagcccaaaatctc cttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaatgtacaaaaatca caagcattcttatacaccaacaacagacaaacagagagccaaatcatgagtgaactccca ttcacaattgcttcaaagagaataaaatacctaggaatccaacttacaagggatgtgaag gacctcttcaaggagaactacaaaccactgctcaaggaaataaaagaggacacaaacaaa tggaagaacattccatgctcctgggtaggaagaatcaatatcgtgaaaatggccatactg cccaaggtaatttacagattcaatgccatccccatcaagctaccaatgactttcttcaca gaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcccgcatcgccaag tcaatcctaagccaaaagaacaaagctggaggcatcacactacctgacttcaaactatac tacaaggctacagtaaccaaaacagcatggtactggtaccaaaacagagatatagatcaa tggaacagaacagagccctcagaaataatgccgcatatctacaactatctgatctttgac aaacctgagaaaaacaagcaatggggaaaggattccctatttaataaatggtgctgggaa aactggctagccatatgtagaaagctgaaactggatcccttccttacaccttatacaaaa atcaattcaagatggattaaagatttaaacattagacctaaaaccataaaaaccctagaa gaaaacctaggcattaccattcaggacataggcgtgggcaaggacttcatgtccaaaaca ccaaaagcaatggtaacaaaagccaaaattgacaaatgggatctaattaaactaaagagc ttctgcacagcaaaagaaactaccatcagagtgaacaggcaacctacaacatgggagaaa attttcgcaacctactcatctgacaaagggctaatatccagaatctacaatgaactcaaa caaatttacaagaaaaaaacaaacaaccccatcaaaaagtgggcaaaggacatgaacaga cacttctcaaaagaagacatttatgcagccaaaaaacacatgaaaaaatgctcatcatca ctggccatcagagaaatgcaaatcaaaaccactatgagatatcatctcacaccagttaga atggcaatcattaaaaagtcaggaaacaacagcctgggaacagaacaagacctcatcaaa aaaaaaaaagaagaagaagaagaagaaaagaaaagccatagtattctgccctcgaggggc agggcagatccgggtgatggtcagcttggctgtgtgaatgggactcagtggaaaggactg ccttctgtggggtgcagaacaggggcattactttttgaatgtcctgatgagccacacaca gatctaaccatgctgtatcaggagagagtcagtgctagtcatgcacagtcatgcacaagg gaacttgcaaacaccctggcaaaagctttccacttgcacgtgagcaaatag >gi568815593r:132441814_132643478|GENSCAN_predicted_peptide_4|118_aa MALLFVVVGNDPKFEDSWSRVTLAVKNERLTVISPKRCLLQQRKATSWQIPLRGYSWTGH WDPAGAWETATLAPPIPPQSSLTCLFIWLFIHSLSKYSHSCNVPGTVLSIGGTAGSRT >gi568815593r:132441814_132643478|GENSCAN_predicted_CDS_4|357_bp atggccttgctgtttgtggtggtgggaaatgaccccaaatttgaagattcatggagcagg gtgactcttgctgttaagaatgagagactcaccgtcatcagccccaagagatgccttctg caacagcgaaaagccacctcttggcagatccctttacgtgggtacagctggactgggcac tgggatccagctggggcctgggaaactgccacactggcaccccctattcctccacagtca tccctcacttgtctgttcatttggttgtttattcattcactcagcaaatactcacacagc tgcaatgtgccaggcactgttctaagtattggtggcacagcagggagcaggacatag >gi568815593r:132441814_132643478|GENSCAN_predicted_peptide_5|134_aa MRMLLHLSLLALGAAYVYAIPTEIPTSALVKETLALLSTHRTLLIANETLRIPVPVHKNH QLCTEEIFQGIGTLESQTVQGGTVERLFKNLSLIKKYIDGQKKKCGEERRRVNQFLDYLQ EFLGVMNTEWIIES >gi568815593r:132441814_132643478|GENSCAN_predicted_CDS_5|405_bp atgaggatgcttctgcatttgagtttgctagctcttggagctgcctacgtgtatgccatc cccacagaaattcccacaagtgcattggtgaaagagaccttggcactgctttctactcat cgaactctgctgatagccaatgagactctgaggattcctgttcctgtacataaaaatcac caactgtgcactgaagaaatctttcagggaataggcacactggagagtcaaactgtgcaa gggggtactgtggaaagactattcaaaaacttgtccttaataaagaaatacattgacggc caaaaaaaaaagtgtggagaagaaagacggagagtaaaccaattcctagactacctgcaa gagtttcttggtgtaatgaacaccgagtggataatagaaagttga >gi568815593r:132441814_132643478|GENSCAN_predicted_peptide_6|1541_aa MSRIEKMSILGVRSFGIEDKDKQIITFFSPLTILVGPNGAGKTATATLGVCLKQSRKDVR TWPGDKVYLRIPRYRELSPLTQSSVPTGCPLTIIECLKYICTGDFPPGTKGNTFVHDPKV AQETDVRAQIRLQFRDVNGELIAVQRSMVCTQKSKKTEFKTLEGVITRTKHGEKVSLSSK CAEIDREMISSLGVSKAVLNNVIFCHQEDSNWPLSEGKALKQKFDEIFSATRYIKALETL RQVRQTQGQKVKEYQMELKYLKQYKEKACEIRDQITSKEAQLTSSKEIVKSYENELDPLK NRLKEIEHNLSKIMKLDNEIKALDSRKKQMEKDNSELEEKMEKVFQGTDEQLNDLYHNHQ RTVREKERKLVDCHRELEKLNKESRLLNQEKSELLVEQGRLQLQADRHQEHIRARDSLIQ SLATQLELDGFERGPFSERQIKNFHKLNDFAEKETLKQKQIDEIRDKKTGLGRIIELKSE ILSKKQNELKNVKYELQQLEGSSDRILELDQELIKAADKDEQIRKIKSRHSDELTSLLGY FPNKKQLEDWLHSKSKEINQTRDRLAKLNKELASSEQNKNHINNELKRKEEQLSSYEDKL FDVCGSQDFESDLDRLKEEIEKSSKQRAMLAGATAVYSQFITQLTDENQSCCPVCQRVFQ TEAELQEVISDLQSKLRLAPDKLKSTESELKKKEKRRDEMLGLVPMRQSIIDLKEKEIPE LRNKLQNVNRDIQRLKNDIEEQETLLGTIMPEEESAKVCLTDVTIMERFQMELKDVERKI AQQAAKLQGIDLDRTVQQVNQEKQEKQHKLDTVSSKIELNRKLIQDQQEQIQHLKSTTNE LKSEKLQISTNLQRRQQLEEQTVELSTEVQSLYREIKEEVKSPNGPTTSSEIEAVINSLP TKRSPGPDRFTAEFYQRYKEELVPFLLKLFLTIEKEGILPNSFYKASIILIPKPSRDTTK KENFRPISLMNIDAKILNKILANRIQQHVKKLIHHDQVGFIPGMQGWFNICKSINVIHHM NRTKDKNHMIIPIDGEKAFDKIQHRFMLKALNKLGIDGMYFKIIRAIYDRPTANIILNGQ KLEAFPLKTSYYEDQMKMDAKEQVSPLETTLEKFQQEKEELINKKNTSNKIAQDKLNDIK EKVKNIHGYMKDIENYIQDGKDDYKKQKETELNKVIAQLSECEKHKEKINEDMRLMRQDI DTQKIQERWLQDNLTLRKRNEELKEVEEERKQHLKEMGQMQVLQMKSEHQKLEENIDNIK RNHNLALGRQKGYEEEIIHFKKELREPQFRDAEEKYREMMIVMRTTELVNKDLDIYYKTL DQAIMKFHSMKMEEINKIIRDLWRSTYRGQDIEYIEIRSDADENVSASDKRRNYNYRVVM LKGDTALDMRGRCSAGQKADGFSGSEGFSIRFLVTTKMPNSSSYLDFLQVLASLIIRLAL AETFCLNCGIIALDEPTTNLDRENIESLAHALVDVDAKVVKQLLHKDYAGREKGSFPEVK GGVSGTPGSRQSLSGREGQKESSAAQGHLSAAIITSKGECF >gi568815593r:132441814_132643478|GENSCAN_predicted_CDS_6|4626_bp atgtcccggatcgaaaagatgagcattctgggcgtgcggagttttggaatagaggacaaa gataagcaaattatcactttcttcagcccccttacaattttggttggacccaatggggcg ggaaagacggctacagcgaccttaggagtttgcctgaagcagtctcggaaggatgtccgg acctggcctggggacaaggtttacttaagaatcccacgataccgggagctgtcgccgctc actcagagttcagttcccactggatgccctttgaccatcattgaatgtctaaaatatatt tgtactggagatttccctcctggaaccaaaggaaatacatttgtacacgatcccaaggtt gctcaagaaacagatgtgagagcccagattcgtctgcaatttcgtgatgtcaatggagaa cttatagctgtgcaaagatctatggtgtgtactcagaaaagcaaaaagacagaatttaaa actctggaaggagtcattactagaacaaagcatggtgaaaaggtcagtctgagctctaag tgtgcagaaattgaccgagaaatgatcagttctcttggggtttccaaggctgtgctaaat aatgtcattttctgtcatcaagaagattctaattggcctttaagtgaaggaaaggctttg aagcaaaagtttgatgagattttttcagcaacaagatacattaaagccttagaaacactt cggcaggtacgtcagacacaaggtcagaaagtaaaagaatatcaaatggaactaaaatat ctgaagcaatataaggaaaaagcttgtgagattcgtgatcagattacaagtaaggaagcc cagttaacatcttcaaaggaaattgtcaaatcctatgagaatgaacttgatccattgaag aatcgtctaaaagaaattgaacataatctctctaaaataatgaaacttgacaatgaaatt aaagccttggatagccgaaagaagcaaatggagaaagataatagtgaactggaagagaaa atggaaaaggtttttcaagggactgatgagcaactaaatgacttatatcacaatcaccag agaacagtaagggagaaagaaaggaaattggtagactgtcatcgtgaactggaaaaacta aataaagaatctaggcttctcaatcaggaaaaatcagaactgcttgttgaacagggtcgt ctacagctgcaagcagatcgccatcaagaacatatccgagctagagattcattaattcag tctttggcaacacagctagaattggatggctttgagcgtggaccattcagtgaaagacag attaaaaattttcacaaacttaatgactttgcagaaaaagagactctgaaacaaaaacag atagatgagataagagataagaaaactggactgggaagaataattgagttaaaatcagaa atcctaagtaagaagcagaatgagctgaaaaatgtgaagtatgaattacagcagttggaa ggatcttcagacaggattcttgaactggaccaggagctcataaaagctgctgacaaagat gaacaaatcagaaaaataaaatctaggcacagtgatgaattaacctcactgttgggatat tttcccaacaaaaaacagcttgaagactggctacatagtaaatcaaaagaaattaatcag accagggacagacttgccaaattgaacaaggaactagcttcatctgagcagaataaaaat catataaataatgaactaaaaagaaaggaagagcagttgtccagttacgaagacaagctg tttgatgtttgtggtagccaggattttgaaagtgatttagacaggcttaaagaggaaatt gaaaaatcatcaaaacagcgagccatgctggctggagccacagcagtttactcccagttc attactcagctaacagacgaaaaccagtcatgttgccccgtttgtcagagagtttttcag acagaggctgagttacaagaagtcatcagtgatttgcagtctaaactgcgacttgctcca gataaactcaagtcaacagaatcagagctaaaaaaaaaggaaaagcggcgtgatgaaatg ctgggacttgtgcccatgaggcaaagcataattgatttgaaggagaaggaaataccagaa ttaagaaacaaactgcagaatgtcaatagagacatacagcgcctaaagaacgacatagaa gaacaagaaacactcttgggtacaataatgcctgaagaagaaagtgccaaagtatgcctg acagatgttacaattatggagaggttccagatggaacttaaagatgttgaaagaaaaatt gcacaacaagcagctaagctacaaggaatagacttagatcgaactgtccaacaagtcaac caggagaaacaagagaaacagcacaagttagacacagtttctagtaagattgaattgaat cgtaagcttatacaggaccagcaggaacagattcaacatctaaaaagtacaacaaatgag ctaaaatctgagaaacttcagatatccactaatttgcaacgtcgtcagcaactggaggag cagactgtggaattatccactgaagttcagtctttgtacagagagataaaggaagaagtc aaatccccaaatggaccaacaacaagttctgaaattgaggcagtaattaatagcctgcca accaaaagaagtccaggaccagacagattcacagccgaattctaccagaggtacaaagag gagctggtaccattccttctgaaactattcctaacaatagaaaaagagggaatcctccct aactcattttataaggccagcatcatcctgataccaaaacctagcagagacacaacaaaa aaagaaaatttcaggccaatatccctgatgaacatcgatgcaaaaatcctcaataaaata ctggcaaaccgaatccagcagcacgtcaaaaagcttatccaccacgatcaagtcggcttt attcctgggatgcaaggctggttcaacatatgcaaatcaataaatgtaatccatcacatg aacagaaccaaagacaaaaaccacatgattatcccaatagatggagaaaaggccttcgat aaaattcaacaccgcttcatgctaaaagctctcaataaactaggtatcgatggaatgtat ttcaaaataatacgagctatttatgacagacccacagccaatatcatactgaatgggcaa aaactggaagcattccctttgaaaaccagctattatgaggatcaaatgaaaatggatgct aaagagcaggtaagccctttggaaacaacattggaaaagttccagcaagaaaaagaagaa ttaatcaacaaaaaaaatacaagcaacaaaatagcacaggataaactgaatgatattaaa gagaaggttaaaaatattcatggctatatgaaagacattgagaattatattcaagatggg aaagacgactataagaagcaaaaagaaactgaacttaataaagtaatagctcaactaagt gaatgcgagaaacacaaagaaaagataaatgaagatatgagactcatgagacaagatatt gatacacagaagatacaagaaaggtggctacaagataaccttactttaagaaaaagaaat gaggaactaaaagaagttgaagaagaaagaaaacaacatttgaaggaaatgggtcaaatg caggttttgcaaatgaaaagtgaacatcagaagttggaagagaacatagacaatataaaa agaaatcataatttggcattagggcgacagaaaggttatgaagaagaaattattcatttt aagaaagaacttcgagaaccacaatttcgggatgctgaggaaaagtatagagaaatgatg attgttatgaggacaacagaacttgtgaacaaggatctggatatttattataagactctt gaccaagcaataatgaaatttcacagtatgaaaatggaagaaatcaataaaattatacgt gacctgtggcgaagtacctatcgtggacaagatattgaatacatagaaatacggtctgat gccgatgaaaatgtatcagcttctgataaaaggcggaattataactaccgagtggtgatg ctgaagggagacacagccttggatatgcgaggacgatgcagtgctggacaaaaggcagat ggtttcagtgggtctgagggttttagtattaggtttctagtaaccacgaaaatgcctaac agttccagttacttggattttctacaggtattagcctcactcatcattcgcctggccctg gctgaaacgttctgcctcaactgtggcatcattgccttggatgagccaacaacaaatctt gaccgagaaaacattgaatctcttgcacatgctctggttgatgtagacgccaaggttgtc aagcagctcttgcataaggactatgctggcagagaaaaaggcagtttccctgaggtcaaa ggaggtgtttcaggcactcctggctccagacagtccctttctggcagagagggccagaag gagagctcagcagcgcagggccacctttctgcagccatcatcacaagtaagggcgagtgc ttttga >gi568815593r:132441814_132643478|GENSCAN_predicted_peptide_7|61_aa HNSKACDLVAFCVEWKDSRCQLAIPGKTVIRLFLEVASEEQMERPVLPGLVGPYREQALT L >gi568815593r:132441814_132643478|GENSCAN_predicted_CDS_7|186_bp cataattcaaaagcgtgtgacttggttgcattctgtgtggaatggaaggattcaagatgt cagctggcaattccaggaaaaactgtgattaggcttttcttagaagtggcatctgaagag caaatggagaggcctgttcttccaggtctggttggaccctacagggagcaggccttgact ctgtga