GENSCAN 1.0 Date run: 16-Jul-119 Time: 16:00:54 Sequence gi568815591f:91841100_92095812 : 254713 bp : 37.52% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 4644 4859 216 2 0 51 42 190 0.812 5.62 1.02 PlyA + 5828 5833 6 1.05 2.00 Prom + 6872 6911 40 -5.95 2.01 Init + 10255 10376 122 0 2 72 49 159 0.933 10.11 2.02 Intr + 15315 15496 182 0 2 37 58 124 0.241 2.89 2.03 Term + 22779 22951 173 1 2 60 44 88 0.300 -1.39 2.04 PlyA + 23893 23898 6 1.05 3.02 PlyA - 24386 24381 6 1.05 3.01 Sngl - 33634 32495 1140 1 0 64 49 646 0.855 54.49 3.00 Prom - 33856 33817 40 -6.85 4.00 Prom + 38275 38314 40 -6.15 4.01 Init + 39197 39206 10 1 1 39 69 3 0.209 -5.76 4.02 Intr + 39583 39765 183 2 0 69 89 201 0.768 17.14 4.03 Intr + 44734 44994 261 2 0 12 53 148 0.355 0.24 4.04 Term + 47931 48223 293 1 2 -5 42 229 0.404 3.52 4.05 PlyA + 48593 48598 6 1.05 5.03 PlyA - 49441 49436 6 1.05 5.02 Term - 58196 58139 58 1 1 16 49 133 0.332 -1.62 5.01 Init - 66198 63239 2960 0 2 44 53 914 0.414 74.65 5.00 Prom - 66291 66252 40 -6.15 6.02 PlyA - 66460 66455 6 1.05 6.01 Sngl - 67704 66688 1017 0 0 59 43 768 0.994 66.17 6.00 Prom - 71101 71062 40 -5.25 7.00 Prom + 73418 73457 40 -4.35 7.01 Init + 90716 90864 149 1 2 83 35 80 0.263 1.81 7.02 Intr + 99703 100048 346 1 1 5 97 361 0.325 23.17 7.03 Intr + 100630 100788 159 0 0 26 3 157 0.210 0.26 7.04 Intr + 132612 132869 258 2 0 87 100 154 0.698 13.24 7.05 Intr + 139190 139234 45 1 0 80 94 44 0.361 1.89 7.06 Intr + 151786 151956 171 0 0 19 91 215 0.127 14.12 7.07 Intr + 153522 153677 156 2 0 70 108 82 0.995 7.69 7.08 Intr + 154504 154701 198 0 0 85 80 180 0.913 15.53 7.09 Term + 159749 162163 2415 1 0 69 42 1965 0.402 174.35 7.10 PlyA + 162366 162371 6 1.05 8.02 PlyA - 162508 162503 6 1.05 8.01 Sngl - 164156 163128 1029 2 0 38 38 383 0.810 25.13 8.00 Prom - 166405 166366 40 -7.45 9.00 Prom + 170712 170751 40 -5.75 9.01 Init + 171348 171543 196 2 1 37 98 190 0.886 14.04 9.02 Intr + 173150 173229 80 0 2 78 98 27 0.996 1.05 9.03 Intr + 175030 175168 139 0 1 81 85 133 0.995 11.42 9.04 Intr + 181715 181910 196 0 1 42 61 146 0.791 4.85 9.05 Intr + 188796 188892 97 0 1 80 52 72 0.963 1.89 9.06 Intr + 190413 190505 93 2 0 42 102 116 0.995 7.64 9.07 Intr + 197320 197673 354 0 0 112 84 147 0.970 11.26 9.08 Intr + 199575 199799 225 2 0 49 110 260 0.982 21.56 9.09 Intr + 200947 201087 141 0 0 81 67 128 0.959 9.63 9.10 Intr + 201569 201672 104 1 2 30 69 97 0.884 0.05 9.11 Intr + 203909 204114 206 2 2 95 64 77 0.851 3.92 9.12 Intr + 211633 211859 227 2 2 60 91 265 0.492 20.78 9.13 Intr + 220161 220323 163 2 1 71 84 203 0.914 16.83 9.14 Intr + 221175 221387 213 1 0 72 111 210 0.999 19.46 9.15 Intr + 222363 222401 39 1 0 111 89 48 0.955 4.48 9.16 Intr + 224132 224364 233 0 2 61 95 187 0.975 13.27 9.17 Intr + 225328 225447 120 0 0 49 94 89 0.975 5.37 9.18 Intr + 228931 229107 177 0 0 112 70 120 0.538 11.79 9.19 Intr + 229806 229910 105 2 0 26 94 174 0.999 11.29 9.20 Intr + 235756 235908 153 0 0 34 86 121 0.973 5.85 9.21 Intr + 236597 236776 180 1 0 89 69 119 0.990 9.24 9.22 Intr + 237980 239053 1074 1 0 61 96 984 0.999 85.96 9.23 Intr + 241423 241563 141 0 0 74 91 171 0.999 15.63 9.24 Intr + 242071 242556 486 0 0 91 53 366 0.948 25.68 9.25 Intr + 244396 244587 192 0 0 87 76 127 0.998 10.17 9.26 Intr + 245129 245317 189 1 0 57 75 161 0.998 10.56 9.27 Intr + 248286 248430 145 2 1 110 95 84 0.990 10.23 9.28 Intr + 251998 252217 220 2 1 54 69 219 0.997 13.04 9.29 Intr + 253924 254074 151 1 1 44 69 205 0.960 13.44 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:91841100_92095812|GENSCAN_predicted_peptide_1|71_aa MFDNSWQCGPKVWKTVAGRRASNEIRGEKRREKAAAVALVKAKLEKTNGKKSQFKKKGQS KQKEIQDIAVS >gi568815591f:91841100_92095812|GENSCAN_predicted_CDS_1|216_bp atgtttgacaacagttggcaatgtggacccaaggtttggaaaacagttgctggtagaaga gctagcaatgagattagaggagagaaaaggagagaaaaggctgctgctgtagccttggtt aaggctaaattagagaaaacaaacgggaagaagagtcaatttaaaaaaaaaggacaaagt aaacagaaagaaatccaggatattgcagtgtcttaa >gi568815591f:91841100_92095812|GENSCAN_predicted_peptide_2|158_aa MKEDNQDLSESPENEAVVQKRKAAGVQQRSTPEKPIALRKGVLELLATAIKQEKEIKDIQ IGKEAVKLFLLADDMISYPENLKDFFKRLLDLINNFSKVSGSLFTIAKIWTQSKCPSTAE QLKKKCGIYTQATTIQPYKEGNSVICNHMDEPGGHYIK >gi568815591f:91841100_92095812|GENSCAN_predicted_CDS_2|477_bp atgaaggaagacaaccaggatctgtcagagagtccagagaatgaagctgtagtgcagaaa aggaaagcagcaggagtgcagcagagatcaacccctgagaaacccatagccctacggaaa ggagtactagaactcctggccacagcaatcaagcaagagaaagaaataaaagatatccaa attggaaaagaggcagtcaaactatttttgttggctgatgacatgatctcatacccagaa aatctgaaagatttcttcaaaagactcctagacctgataaacaacttcagtaaagtttca ggatcattattcacaatagccaagatatggactcaatctaagtgtccatcaacagctgaa cagctaaaaaaaaaatgtggtatatatacacaagcaactactattcagccttataaagaa ggaaattctgtcatttgcaaccacatggatgaacctggaggacactatattaagtga >gi568815591f:91841100_92095812|GENSCAN_predicted_peptide_3|379_aa MAPGNLWHMRNNFLFGSRCWMTRFSAENIFKSVSFRLFGVKCHNTDSEPLKNEDLLKNLL TMGVDIDMARKRQPGVFHRMITNEQDLKMFLLSKGASKEVIASIISRYPRAITRTPENLS KRWDLWRKIVTSDLEIVNILERSPESFFRSNNNLNLENNIKFLYSVGLTRKCLCRLLTNA PRTFSNSLDLNKQMVEFLQAAGLSLGHNDPADFVRKIIFKNPFILIQSTKRVKANIEFLR STFNLNSEELLVLICGPGAEILDLSNDYARRSYANIKEKLFSLGCTEEEVQKFVLSYPDV IFLAEKKFNDKIDCLMEENISISQIIENPRVLDSSISTLKSRIKELVNAGCNLSTLNITL LSWSKKRYEAKLKKLSRFA >gi568815591f:91841100_92095812|GENSCAN_predicted_CDS_3|1140_bp atggcaccaggaaacctctggcatatgagaaataactttctctttggttcaagatgttgg atgactcgattttcagcagaaaacatcttcaaatcagtttcatttaggctttttggtgtg aagtgtcataatacagacagtgagcctttgaaaaatgaggacctactgaaaaacttactt actatgggagtagatattgacatggcaaggaaacgacagcctggagtttttcataggatg attaccaatgagcaggacctgaagatgttccttctttccaaaggagctagcaaagaagtg atcgctagcatcatatcaagatatccacgagcaataacacgtactcccgagaatctttca aaacggtgggatctgtggagaaagattgtgacatcagaccttgaaattgtaaatattttg gaacgttctcctgaatccttttttcggtccaataacaacctaaacttagagaataatata aagttcctctactcagttggattgacccgtaaatgcctttgtcgattgttgaccaatgcc cctcgtaccttctccaatagtcttgatctgaataaacagatggttgaatttttgcaggca gccggtttgtcattgggtcacaatgatcccgcagattttgtcagaaagataatttttaaa aacccttttatcttaattcagagcaccaagcgggtgaaagctaacattgaattcttacgg tcaactttcaatttgaacagtgaggaactgctggttctgatatgtggtccaggagctgaa atcctagacctttccaatgactatgccagaagaagctacgcaaacatcaaagagaagctg ttttctcttggatgtactgaagaagaggtacagaagtttgtcttaagctatccagatgtg atcttcttggcagagaaaaagtttaatgataaaatagactgcctcatggaagaaaacatt agcatttcacaaataatcgaaaatcctcgggttctggattcaagcataagtactttaaaa agtcgaatcaaagaattggtaaatgctggctgtaacttgagtactttaaacatcactctt ctatcttggagtaaaaaaagatatgaagctaaattgaaaaagttaagcagatttgcctaa >gi568815591f:91841100_92095812|GENSCAN_predicted_peptide_4|248_aa MGKVRFYFRLWGPERLGGAEVEKTNFRPRVARRGDVVGPAPPRGGTSALRSPTLGAWQGA ARDRGSHKSLLLQGQHQEYEESARMTQTLPSRPHLQHWGLHLNTGFGGDIQTISRLYKKL GASICFYAEPQAASTHSKRQKGMQRSCGKRRMRSASAQPPIVWDVRSASAQPPIVWDVRS ASAWLPHLGSEKRLCPAISSGRCTQQLRRDSDHRERAMMTMALLSKRKGGNVGKRKRDQI VTVSGRKK >gi568815591f:91841100_92095812|GENSCAN_predicted_CDS_4|747_bp atgggaaaagttcgcttttacttccgcctctggggtccggagcggctaggaggagcggaa gtggaaaagactaacttccggcctcgggtcgcgcgcagaggtgatgtcgttggtcctgcg ccccctcgtggcggtacttcagcccttcgcagcccgacgctaggggcctggcagggggca gcaagagaccgcggatctcacaagagcttactcttacaaggacagcaccaagaatatgag gaatctgcccgcatgacccaaacacttccctccaggccccacctccaacactggggatta catctcaacactggatttggaggggacatccaaactatatcaaggctgtacaagaagctt ggtgccagcatttgcttctacgcagagcctcaggctgcttctactcacagtaaaaggcaa aagggcatgcaaagatcatgtggcaagagaagaatgaggagcgcctctgcccagccgccc atcgtctgggatgtgaggagcgcctctgcccagccgcccatcgtctgggatgtgaggagc gcctctgcctggctgccccatctgggaagtgagaagcgcctctgcccggccatctcatct gggaggtgtacccaacagctccgaagagacagcgaccatcgagaacgggccatgatgacg atggcgcttttgtcgaaaagaaaagggggaaatgtggggaaaagaaagagagatcagatt gttactgtgtctggtagaaagaagtag >gi568815591f:91841100_92095812|GENSCAN_predicted_peptide_5|1005_aa MGDFNTPLSTLDRSTRQKVNKDTQELNSALHQADLIDIYRTLHPKSTEHTFFSAPHHTYS KIDHILGSKALLSKCKRTEIITNYLSDHSAIKLELRIKNLTQNCSTTWKQNNLLLNDYWV HNEMKAEIKMFFENNENKDTTYQNLWDAFKAVCRGKFIALNAHKRKQERSKIDTLTSQLK ELEKQEQTHSKASRRQEITKIRAELKEIETQKSLQKINESRSWFFERINKIDRPLARLIK KKREKNQIDTIKNDKGDITTDPTEIQTTIREYYKHLYANKLENLEEMDKFLDTYTLPRLN QEEVESLNRPITGAEIVAIINSLPTKKSPGPDGFTAEFYQRYKEELVPFLLKLFQSTEKE GILPNSFYEASIILIPKPGRDTTKKENFRPISLMNIDAKILNKILANRIQQHIKKLIHHD QVGFIPGMQGWFNICKSINVIQHINRAKDKNHMIISIDAEKAFDKIQQPFMLKTLNKLGI DGTYFKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQECLLLPLLFNIVLEVLARAIRQE KEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLKLISNFSKVSEYKINVQKSQAFL YTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLKEIKEDTNKWKNI PCSWVGRINIVKMATLPKVIYRFSAIPIKLPMTFFTALEKTTLKFIWNQKRARIAKAILS QKNKAGAITLPDFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEITLHIYNYLIFDKPDK NKQWGKDSLFNKWCWENWLAICRKLKLGPFLTPYTKINSRWIKDLNVRPKTIKTLEENLG ITIQDIGMGKDFMSKTPKAMATKAKIDKWDLIKRKSFCTAKETTIRVNRQPTKWEKIFAT YSSDKGLISRIYNELKQIYKKKTINPMKKWAKDMNRHFSKEDIYAAKKHMKKCSSSLAIR EMQIKTTMRYYLTPVRMAIIKKSGNNRNDDDDNDDDDNDHAIVSQ >gi568815591f:91841100_92095812|GENSCAN_predicted_CDS_5|3018_bp atgggagactttaacactccactgtcaacattagacagatcaacgagacagaaagtcaac aaggatacccaggaattgaactcagctctgcaccaagcggacctaatagacatctacaga actctccaccccaaatcaacagaacatacatttttttcagcaccacaccacacctattcc aaaattgaccacatacttggaagtaaagctctcctcagcaaatgtaaaagaacagagatt ataacaaactatctctcagaccacagtgcaatcaaactagaactcaggattaagaatctc actcaaaactgctcaactacatggaaacagaacaacctgctcctgaatgactactgggta cataacgaaatgaaggcagaaataaagatgttctttgaaaacaacgagaacaaagacaca acataccagaatctctgggacgcattcaaagcagtgtgtagagggaaatttatagcacta aatgcccacaagagaaagcaggaaagatccaaaattgacaccctaacatcacaattaaaa gaactagaaaagcaagagcaaacacattcaaaagctagcagaaggcaagaaataactaaa atcagagcagaactgaaggaaatagagacacaaaaaagccttcaaaaaattaatgaatcc aggagctggttttttgaaaggatcaacaaaattgatagaccactagcaagactaataaag aaaaaaagagagaagaatcaaatagacacaataaaaaatgataaaggggatatcaccact gatcccacagaaatacaaactaccataagagaatactacaaacacctctacgcaaataaa ctagaaaatctagaagaaatggataaattcctcgacacatacactctcccaagactaaac caggaagaagttgaatctctgaatagaccaataacaggagctgaaattgtggcaataatc aatagtttaccaaccaaaaagagtccaggaccagatggattcacagccgaattctaccag aggtacaaggaggagctggtaccattccttctgaaactattccaatcaacagaaaaagag ggaatcctccctaactcattttatgaggccagcatcattctgataccaaagccaggcaga gacacaaccaaaaaagagaattttagaccaatatccttgatgaacattgatgcaaaaatc ctcaataaaatactggcaaaccgaatccagcagcacatcaaaaagcttatccaccatgat caagtgggcttcatccctgggatgcaaggctggttcaatatatgcaaatcaataaatgta atccagcatataaacagagccaaagacaaaaaccacatgattatctcaatagatgcagaa aaagcctttgacaaaattcaacaacccttcatgctaaaaactctcaataaattaggtatt gatgggacgtatttcaaaataataagagctatctatgacaaacccacagccaatatcata ctgaatgggcaaaaactggaagcattccctttgaaaactggcacaagacaggaatgcctt ctcttaccactcctattcaatatagtgttggaagttctggccagggcaatcaggcaggag aaggaaataaagggtattcaattaggaaaagaggaagtcaaattgtccctgtttgcagac gacatgattgtatatctagaaaaccccattgtctcagcccaaaatctccttaagctgata agcaacttcagcaaagtctcagaatacaaaatcaatgtacaaaaatcacaagcgttctta tacaccaacaacagacaaacagagagccaaatcatgagtgaactcccattcacaattgct tcaaagagaataaaatacctaggaatccagcttacaagggatgtgaaggacctcttcaag gagaactacaaaccactgctcaaggaaataaaagaggatacaaacaaatggaagaacatt ccatgctcatgggtaggaagaatcaatattgtgaaaatggccacactgcccaaggtaatt tacagattcagtgctatccccataaagctaccaatgactttcttcacagcattggaaaaa actactttaaagttcatatggaaccaaaaaagagcccgcattgccaaggcaatcctaagc caaaagaacaaagctggagccatcacactacctgacttcaaactatactacaaggctaca gtaaccaaaacagcatggtactggtaccaaaacagagatatagatcaatggaacagaaca gagccctcagaaataacgctgcatatctacaactatctgatctttgacaaacctgacaaa aacaagcaatggggaaaggattccctatttaataaatggtgctgggaaaactggctagcc atatgtagaaagctgaaactgggtcccttccttacaccttatacaaaaatcaattcaaga tggattaaagacttaaacgttagacctaaaaccataaaaaccctagaagaaaacctaggc attaccattcaggacataggcatgggcaaggacttcatgtctaaaacaccaaaagcaatg gcaacaaaagccaaaattgacaaatgggatctaattaaacgaaagagcttctgcacagca aaagaaactaccatcagagtgaacaggcaacctacaaaatgggagaaaatttttgcaacc tactcatctgacaaagggctaatatccagaatctacaatgaactcaaacaaatttacaag aaaaaaacaatcaaccccatgaaaaagtgggcgaaggacatgaacagacacttctcaaaa gaagacatttatgcagccaaaaaacacatgaaaaaatgctcatcatcactggccatcaga gaaatgcaaatcaaaaccacaatgagatactatctcacaccagttagaatggcaatcatt aaaaagtcaggaaacaacagaaatgatgatgatgataatgatgatgatgataatgaccat gctattgtgagtcaataa >gi568815591f:91841100_92095812|GENSCAN_predicted_peptide_6|338_aa MGKKQNRKTGNSKTQSASPPPKERSSSPATEQSWMENDFDELREEGFRRSNYSELREDIQ TKGKEVENFEKNLEECITRITNRDKCLKELMELKTKAREPREECRSLRSRCDQLEERVLV VEDEMNEMKREGKFREKRIKRNEQSLQEIWDYVKRPNLRLIGVPESDGENGTKLENTLQD IIQENFPNLARQANDQIQEIQRMPQRYSSRRATPRHIIVRFTKVEMKEKMLRAAREKGRV THKGKPIRLTADLLAETLQARREWGPIFNILKEKNFQPRISYPAKLSFISEGEIKYFTDK QMLRDFVTTRPALKELLKEALNMERNNRYQPLQNHAKM >gi568815591f:91841100_92095812|GENSCAN_predicted_CDS_6|1017_bp atgggaaaaaaacagaacagaaaaactggaaactctaaaacgcagagcgcctctcctcct ccaaaggaacgcagttcctcaccagcaacggaacaaagctggatggagaatgactttgac gagctgagagaagaaggcttcagacgatcaaattactctgagctacgggaggacattcaa accaaaggcaaagaagttgaaaactttgaaaaaaatttagaagaatgtataactagaata accaatagagacaagtgcttaaaggagctcatggagctgaaaaccaaggctcgagaacca cgtgaagaatgcagaagcctcaggagccgatgcgatcaactggaagaaagggtattagtg gtggaagatgaaatgaatgaaatgaagcgagaagggaagtttagagaaaaaagaataaaa agaaatgagcaaagcctccaagaaatatgggactatgtgaaaagaccaaatctacgtctg attggtgtacctgaaagtgatggggagaatggaaccaagttggaaaacactctgcaggat attatccaggagaacttccccaatctagcaaggcaggccaacgatcagattcaggaaata cagagaatgccacaaagatactcctcgagaagagcaactccaagacacataattgtcaga ttcaccaaagttgaaatgaaggaaaaaatgttaagggcagccagagagaaaggtcgggtt acccacaaagggaagcccatcagactaacagcggatctcttggcagaaaccctacaagcc agaagagagtgggggccaatattcaacattcttaaagaaaagaattttcaacccagaatt tcatatccagccaaactaagcttcataagtgaaggagaaataaaatactttacagacaag caaatgctgagagattttgtcaccaccaggcctgccctaaaagagctcctgaaggaagca ctaaacatggaaaggaacaaccggtaccagccgctgcaaaatcatgccaaaatgtaa >gi568815591f:91841100_92095812|GENSCAN_predicted_peptide_7|1298_aa MDEARNYHPQQTNTGTESQTPHVLTHNWELNNENTWTQGGEHHTLGPVAGSTCPRGGCAD RRSSPSRLAVCLRGDEDGGGGGGDGASRAAEDDPPVSAETASTSGGGAPDRIGSLGRGAC RPTSVQIDLSFLSPTTPQPLFSPAFLAEAMEDEERQKKLEAGKAKVFVMACAFVSYGMLM QSVNGTSEDRKVPYANTYRIPALNNLVVIYLSGESWLCLAQFRQRKAQSDGQSPSKKQKK KRKTSSSKHDVSAHHDLNIDQSQCNEMYINSSQRVESTVIPESTIMRTLHSGEITSHEQG FSVELESEISTTADDCSSEEEEFGVDDSYSEQGAQDSPTHLEMMESELAGKQHEIEELNR ELEEMRVTYGTEGLQQLQEFEAAIKQRDGIITQLTANLQQARREKDETMREFLELTEQSQ KLQIQFQQLQASETLRNSTHSSTAADLLQAKQQILTHQQQLEEQDHLLEDYQKKKEDFTM QISFLQEKIKVYEMEQDKKVENSNKEEIQEKETIIEELNTKIIEEEKKTLELKDKLTTAD KLLGELQEQIVQKNQEIKNMKLELTNSKQKERQSSEEIKQLMGTVEELQKRNHKDSQFET DIVQRMEQETQRKLEQLRAELDEMYGQQIVQMKQELIRQHMAQMEEMKTRHKGEMENALR SYSNITVNEDQIKLMNVAINELNIKLQDTNSQKEKLKEELGLILEEKCALQRQLEDLVEE LSFSREQIQRARQTIAEQESKLNEAHKSLSTVEDLKAEIVSASESRKELELKHEAEVTNY KIKLEMLEKEKNAVLDRMAESQEAELERLRTQLLFSHEEELSKLKEDLEIEHRINIEKLK DNLGIHYKQQIDGLQNEMSQKIETMQFEKDNLITKQNQLILEISKLKDLQQSLVNSKSEE MTLQINELQKEIEILRQEEKEKGTLEQEVQELQLKTELLEKQMKEKENDLQEKFAQLEAE NSILKDEKKTLEDMLKIHTPVSQEERLIFLDSIKSKSKDSVWEKEIEILIEENEDLKQQC IQLNEEIEKQRNTFSFAEKNFEVNYQELQEEYACLLKVKDDLEDSKNKQELEYKSKLKAL NEELHLQRINPTTVKMKSSVFDEDKTFVAETLEMGEVVEKDTTELMEKLEVTKREKLELS QRLSDLSEQLKQKHGEISFLNEEVKSLKQEKEQVSLRCRELEIIINHNRAENVQSCDTQV SSLLDGVVTMTSRGAEGSVSKVNKSFGEESKIMVEDKVSFENMTVGEESKQEQLILDHLP SVTKESSLRATQPSENDKLQKELNVLKSEQVCLLLHIW >gi568815591f:91841100_92095812|GENSCAN_predicted_CDS_7|3897_bp atggatgaagctagaaactatcatcctcagcaaactaacacaggaacagaaagccaaaca ccacatgttctcacgcataattgggagctgaacaatgagaacacatggacacagggaggg gaacatcacacactggggcctgtggcagggagcacctgcccacgaggcggctgcgcggac agacgttccagcccctcccgcctcgccgtgtgtttacgtggagacgaagatggcggcggc ggcggcggtgacggcgcttcccgtgcggctgaggacgatccgccagtgagcgcggagact gcttccacttcgggcgggggagcgccggaccgaatcggctctctaggccgtggagcttgc cgtcccacctccgtccaaatcgacctttcctttctatccccaaccacccctcaacccctg ttttcccctgccttccttgcagaggccatggaggacgaggagagacagaagaagctggag gccggcaaagccaaggtctttgtcatggcatgcgcttttgtgtcctatggaatgttaatg cagtctgtgaatggaacttctgaggaccgaaaggttccatatgccaacacctatagaatt cctgctttaaataatttagttgttatatatctctctggcgaatcctggttgtgtcttgcc cagtttcgacaaagaaaagctcagtcggatgggcagagtccttccaagaagcagaaaaaa aagagaaaaacgtcaagcagtaaacatgatgtgtcagcacaccatgatttgaatattgat caatcacagtgtaatgaaatgtacataaatagttctcagagagtagaatcaactgtgatt cctgaatctacaataatgagaactctacatagtggagaaataaccagtcatgagcagggc ttctctgtggaactggaaagtgaaatttcaaccacagcagatgactgcagttcagaggaa gaagaatttggtgttgatgattcttattctgaacaaggagcacaagacagtccgactcat ctagagatgatggaaagtgagttggctgggaagcagcatgagattgaagagctaaacaga gagctggaagaaatgagggttacctatgggactgaaggactgcagcagttacaagaattt gaagctgccattaaacaaagagatggcattataacccagctcactgctaatttacaacaa gcaagaagagaaaaggatgagacaatgagagaatttttagagttgacagaacagagtcaa aaattacagattcaatttcagcaattacaggctagtgaaactctgagaaacagcactcat agtagcacagctgcagacttactacaagccaaacaacagatcctcactcatcaacagcag cttgaagaacaagaccacttattagaagattatcagaaaaagaaagaagacttcacaatg caaattagtttcttgcaagagaaaattaaagtatatgaaatggaacaagataaaaaagta gaaaactcaaataaagaagaaatacaggaaaaggagacaatcattgaagaattaaacaca aaaataatagaagaagaaaagaaaactcttgagctaaaggataaattaacaactgctgat aaattactaggagaattacaagaacagattgtgcaaaagaaccaagaaataaaaaacatg aaattagagctgactaattctaagcaaaaagaaagacagtcttctgaagaaataaaacag ttaatggggacagtcgaagaacttcagaagagaaatcataaagacagccagttcgaaact gatatagtacaacgaatggaacaagaaacacaaagaaagttagaacaactccgggcagag ctggatgagatgtatgggcagcagatagtgcaaatgaaacaagaattaataagacaacac atggcacagatggaggaaatgaaaacacggcataagggagaaatggagaatgctttaagg tcatattcaaatattacagttaatgaagatcagataaagttaatgaatgtggcaataaat gaactgaatataaaattgcaagatactaactctcaaaaggaaaaactcaaggaagaacta ggactaattttagaagaaaagtgtgctctacagagacagcttgaagaccttgttgaagaa ttgagcttttcaagggaacagattcagagagctagacagacaatagctgaacaagaaagt aaacttaatgaagcacataagtcccttagtacagtggaagatttgaaagctgagattgtt tctgcatctgaatccagaaaggaactagaattaaaacatgaagcagaagttacaaattac aagataaaacttgaaatgttagaaaaagaaaagaatgctgtgttagacagaatggctgaa tcacaagaagctgaattagagaggctgagaacacagcttctatttagtcacgaagaagag ctttccaaactgaaggaagatttagaaattgaacatcgaataaatattgaaaaacttaaa gataatttaggcattcactataaacagcagatagatggtttacagaatgaaatgagtcaa aagatagaaaccatgcagtttgaaaaggacaatttgataactaagcagaatcaattaatt ttggaaatttcaaagctaaaagatttacagcagtctcttgtaaattcaaagtcagaagaa atgactcttcaaatcaatgaacttcaaaaagaaattgaaatactcagacaagaagaaaaa gaaaagggtacacttgaacaagaagttcaagaattacaacttaaaacagaattgttagaa aaacagatgaaggaaaaagagaatgatcttcaagaaaaatttgcacaacttgaagcagag aatagcattcttaaagatgaaaagaaaacccttgaagacatgttgaaaatacatactcct gttagccaagaagaaagattgattttcttagactccattaagtccaaatccaaagactct gtgtgggaaaaagaaatagaaatacttatagaggaaaatgaggacctcaaacaacaatgt attcagctaaatgaagagattgaaaagcaaaggaacactttttcatttgctgaaaaaaac tttgaagttaactatcaagagttacaagaggagtatgcttgccttctcaaagtaaaagat gatttagaagacagtaaaaataaacaggaattagagtataaaagtaaacttaaagcactt aatgaagagcttcatttgcaaagaataaatccaactacagtgaaaatgaaaagttctgtc tttgatgaagacaaaacttttgtagcagaaacattggaaatgggtgaggttgttgaaaag gatacaacagaactcatggaaaaacttgaggtaaccaagcgagagaaattagagctgtca cagagactgtctgatctttctgaacaattgaaacagaaacatggtgagattagttttcta aatgaagaagttaaatctttaaagcaagagaaagaacaagtttcattgagatgtagagag ctagaaatcattattaaccacaacagggcagaaaatgtacagtcatgtgatactcaagta agctctttattagatggagttgtgaccatgacaagcaggggtgctgaaggatcagtttct aaagtaaataaaagttttggtgaagaatcaaaaataatggtggaagataaagtttctttt gaaaatatgactgttggagaagaaagtaagcaagaacagttgattttggatcacttacca tctgtaacaaaggaatcatcacttagagcaactcaaccaagtgaaaatgataaacttcag aaagaactcaatgtacttaaatcagaacaggtatgtttacttcttcatatatggtaa >gi568815591f:91841100_92095812|GENSCAN_predicted_peptide_8|342_aa MNIDAKILNKILANQMQQHIRKLIHHDQVGFIPGMQGWFNIRKSVSVIQHINRTKDKNHM IISIDAEKALDKIQQHFMLKTLNKLGTDGAYLKIVRAIYDKPTANIILNGQKPEAFPLKT GTRQGCPLSPLLFNIVLEVLAREIRQEKEIKGIQLGKEEVKLSLFADDVIVYLENPIVSA QNLLKLISNFSKVSGYKINVQKSQAFLYTNNRRTESQIMSELPFTIASMRIKYLGIQLTR DMKGLFKENYKRLLNEIKEDTNKWKNIPCSWIGRINIVKMAILPKVIYRFNAIPIRNDFL HRNDFLHRIGKNYFKVHKEPKKSPHCQVNPKPREQSWRHHAT >gi568815591f:91841100_92095812|GENSCAN_predicted_CDS_8|1029_bp atgaacatcgatgcaaaaatcctcaataaaatactggcaaaccaaatgcagcagcacatc agaaagctcatccaccatgatcaagtgggcttcatccctgggatgcaaggctggttcaac atacgcaaatcagtaagtgtaatccagcatataaacagaaccaaagacaaaaaccacatg attatctcaatagatgcagaaaaggccttggacaaaattcaacagcacttcatgctaaaa actctcaataaattaggtactgatggggcatatctcaaaatagtaagagctatttatgac aaacccacagccaatatcatactgaatggacaaaaaccggaagcattccctttgaaaact ggcacaagacagggatgccctctctcaccactcctattcaacatagtgttggaagttctg gccagggaaatcaggcaggagaaagaaataaagggtattcaattaggaaaagaggaagtc aaattgtccctgtttgcagatgacgtgattgtatatctagaaaaccccatcgtctcagcc caaaatctccttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaatgtg caaaaatcacaagcattcttatacaccaacaacagacgaacagagagccaaatcatgagt gaactcccattcacaattgcttcaatgagaataaaatacctaggaatccaacttacaagg gatatgaagggcctcttcaaggagaactacaaacgactgctcaacgaaataaaagaggat acaaacaaatggaagaacattccatgctcatggataggaagaatcaatattgtgaaaatg gccatactgcccaaggtaatttatagattcaatgccattcccatcaggaatgactttctt cacagaaatgactttcttcacagaattggaaaaaactactttaaagttcataaggaacca aaaaagagcccacattgccaagtcaatcctaagccaagagaacaaagctggaggcatcac gctacctga >gi568815591f:91841100_92095812|GENSCAN_predicted_peptide_9|2013_aa MEAQRICLSLVYSTHVDQVREYMENEKDKALCSLKEELIFAQEEKIKELQKIHQLELQTM KTQETGDEGKPLHLLIGKLQKAVSEECSYFLQTLCSVLGEYYTPALKCEVNAEDKENSGD YISENEDPELQDYRYEVQDVNHKSKLSSLQDLEKTKLEEQVQELESLISSLQQQLKETEQ NYEAEIHCLQKRLQAVSESTVPPSLPVDSVVITESDAQRTMYPGSCVKKNIDGTIEFSGE FGVKEETNIVKLLEKQYQEQLEEEVAKVIVSMSIAFAQQTELSRISGGKENTASSKQAHA VCQQEQHYFNEMKLSQDQIGFQTFETVDVKFKEEFKPLSKELGEHGKEILLSNSDPHDIP ESKDCVLTISEEMFSKDKTFIVRQSIHDEISVSSMDASRQLMLNEEQLEDMRQELVRQYQ EHQQATELLRQAHMRQMERQREDQEQLQEEIKRLNRQLAQRSSIDNENLVSERERVLLEE LEALKQLSLAGREKLCCELRNSSTQTQNGNENQGEVEEQTFKEKELDRKPEDVPPEILSN ERYALQKANNRLLKILLEVVKTTAAVEETIGRHVLGILDRSSKSQSSASLIWRSEAEASV KSCVHEEHTRDESIPSYSGSDMPRNDINMWSKVTEEGTELSQRLVRSGFAGTEIDPENEE LMLNISSRLQAAVEKLLEAISETSSQLEHAKVTQTELMRESFRQKQEATESLKCQEELRE RLHEESRAREQLAVELSKAEGVIDGYADEKTLFERQIQEKTDIIDRLEQELLCASNRLQE LEAEQQQIQEERELLSRQKEAMKAEAGPVEQRLVDAAVDAAPGAELLQETEKLMKEKLEV QCQAEKVRDDLQKQVKALEIDVEEQVSRFIELEQEKNTELMDLRQQNQALEKQLEKMRKF LDEQAIDREHERDVFQQEIQKLEQQLKVVPRFQPISEHQTREVEQLANHLKEKTDKCSEL LLSKEQLQRDIQERNEEIEKLEFRVRELEQALLVSADTFQKVEDRKHFGAVEAKPELSLE VQLQAERDAIDRKEKEITNLEEQLEQFREELENKNEEVQQLHMQLEIQKKESTTRLQELE QENKLFKDDMEKLGLAIKESDAMSTQDQHVLFGKFAQIIQEKEVEIDQLNEQVTKLQQQL KITTDNKVIEEKNELIRDLETQIECLMSDQECVKRNREEEIEQLNEVIEKLQQELANIGQ KTSMNAHSLSEEADSLKHQLDVVIAEKLALEQQVETANEEMTFMKNVLKETNFKMNQLTQ ELFSLKRERESVEKIQSIPENSVNVAIDHLSKDKPELEVVLTEDALKSLENQTYFKSFEE NGKGSIINLETRLLQLESTVSAKDLELTQCYKQIKDMQEQGQFETEMLQKKIVNLQKIVE EKVAAALVSQIQLEAVQEYAKFCQDNQTISSEPERTNIQNLNQLREDELGSDISALTLRI SELESQVVEMHTSLILEKEQVEIAEKNVLEKEKKLLELQKLLEGNEKKQREKEKKRSPQD VEVLKTTTELFHSNEESGFFNELEALRAESVATKAELASYKEKAEKLQEELLVKETNMTS LQKDLSQVRDHLAEAKEKLSILEKEDETEVQESKKACMFEPLPIKLSKSIASQTDGTLKI SSSNQTPQILVKNAGIQINLQSECSSEEVTEIISQFTEKIEKMQELHAAEILDMESRHIS ETETLKREHYVAVQLLKEECGTLKAVIQCLRSKEGLLRAVHNEGMQVLSLTESPYSDGED HSIQQVSEPWLEERKAYINTISSLKDLITKMQLQREAEVYDSSQSHESFSDWRGELLLAL QQVFLEERSVLLAAFRTELTALGTTDAVGLLNCLEQRIQEQGVEYQAAMECLQKADRRSL LSEIQALHAQMNGRKITLKREQESEKPSQELLEYNIQQKQSQMLEMQVELSSMKDRATEL QEQLSSEKMVVAELKSELAQTKLELETTLKAQHKHLKELEAFRLEVKDKTDEVHLLNDTL ASEQKKSRELQWALEKEKAKLGRSEERDKEELE >gi568815591f:91841100_92095812|GENSCAN_predicted_CDS_9|6039_bp atggaagcccaacgcatttgcctctctctggtttattcaactcatgtggatcaggttcgt gaatatatggaaaatgaaaaagataaagctctttgcagtcttaaagaagagcttattttt gctcaagaggaaaagatcaaggaacttcagaaaatacaccagttagaactacagactatg aaaacacaagaaacaggtgatgaaggaaagcctttacatctgctcattggaaaacttcaa aaggcagtgtctgaagaatgttcttattttttacagactttatgcagtgtccttggtgaa tattatactcctgctttaaaatgtgaagtaaatgcagaagacaaagagaattctggtgat tacatttctgaaaatgaagatccagaattacaagattatagatatgaagttcaagatgtc aatcataaaagcaagttatcttctctgcaagatcttgaaaaaactaaacttgaagaacaa gttcaagaattagaaagcctcatatcctctttgcagcaacagttgaaagaaactgaacaa aactatgaggcagagatccactgtttacagaagaggcttcaagctgttagtgagtccacg gttccgccaagcttacctgttgattcggtggtaattacagaatctgatgcacagagaaca atgtaccctggaagttgtgtgaaaaagaatattgatggtacaatagagttttctggtgaa tttggagtgaaagaggaaacaaatatcgttaagttgcttgaaaaacaataccaagaacaa ttagaagaagaagtagctaaggttattgtgtcaatgagtatagcatttgctcaacaaact gaactgtctagaatatctgggggaaaagaaaatactgcatcatcaaagcaagcacatgct gtgtgtcagcaagaacaacattattttaatgaaatgaaattatcacaggatcaaattggt tttcagacttttgagacagtggatgtgaaatttaaagaagaatttaaaccacttagtaaa gagttaggagaacatggaaaggaaattttattatcaaatagtgatccccatgatatacca gaatcaaaggactgtgtgctgactatttcagaagaaatgttctccaaagataaaacattt atagttagacagtctattcatgatgagatttcagtgtcaagcatggatgcttctagacaa ctaatgttgaatgaagaacagttggaagatatgagacaggaacttgtacgacaataccaa gaacatcaacaggcaacggaattgttaaggcaagcacatatgcggcaaatggagagacag cgagaagaccaggaacagctacaagaagagattaagagacttaatagacaattagcccag agatcctccatagataatgaaaacctggtttcagagagagagagggtgcttttagaggag ctggaagcactaaagcagctgtctttagctggaagagagaagctgtgttgtgagctgcgc aacagcagtacgcaaacacagaatggaaatgaaaaccaaggagaagttgaagaacaaaca tttaaagaaaaggaattagacagaaaacctgaagatgtgcctcctgagattttgtctaat gaaaggtatgcactccagaaagctaataatagacttttgaagatcctcttagaagttgta aagacaacagcagctgttgaagaaacaattggtcgccatgtccttgggattctagataga tctagtaaaagccagtcatctgccagcctaatttggaggtcagaagcagaggcatctgta aagtcatgtgtccatgaggaacatacaagagatgaatccattccctcttattctggaagt gatatgccaagaaatgacattaacatgtggtcaaaagtaactgaggaaggaacagagctg tcacaacgacttgtgaggagtggttttgctggaactgaaatagaccctgaaaatgaagaa cttatgctgaacattagctctcgactacaagcagcagttgaaaaactcctagaagccata agtgaaactagcagtcagcttgaacatgcgaaagtgacacagacagagttgatgcgtgag tcatttagacagaaacaagaagcaacagagtcccttaagtgccaagaggaacttcgagag cgccttcatgaggagtccagggccagagaacagctagctgtggagctcagtaaggctgag ggcgtcattgatggctatgcagatgaaaaaactctttttgaaaggcaaattcaggaaaaa actgatataatagatcgtcttgagcaggagttgttatgtgcaagtaacaggttgcaagaa ttggaggcagagcaacagcagatccaagaagaaagagaattactgtccagacaaaaggaa gctatgaaagcagaggcaggcccagttgaacaacgactagtagatgctgcagtcgatgca gcaccaggagcagaattactacaggagacagaaaaattaatgaaggaaaaactagaagta caatgtcaagctgaaaaagtacgtgatgaccttcaaaaacaagtgaaagctctagaaata gatgtggaagaacaagtcagtaggtttatagagctggaacaagaaaaaaatactgaacta atggatttaagacagcaaaaccaagcattggaaaagcagttagaaaaaatgagaaaattt ttagatgagcaagccattgacagagaacatgagagagatgtattccaacaggaaatacag aaactagaacagcaacttaaggttgttcctcgattccagcctatcagtgaacatcaaact agagaggttgaacagttagcaaatcatctgaaagaaaaaacagacaaatgcagtgagctt ttgctctctaaagagcagcttcaaagggatatacaagaaaggaatgaagaaatagagaaa ctggagttcagagtaagagaactggagcaggcgcttcttgtgagtgcagatacttttcaa aaggtagaggaccgaaaacactttggagctgtagaagctaaaccagaattgtccctagaa gtacaattgcaggctgaacgagatgccatagacagaaaggaaaaagagattacaaactta gaagagcaattagaacagtttagagaagaactggaaaataagaatgaagaagttcaacaa ttacatatgcaattagaaatacagaaaaaggaatctactacccgcctacaagaacttgaa caggaaaacaaattatttaaggatgacatggagaaactgggacttgccataaaggaatct gatgccatgtctactcaagaccaacatgtgctatttgggaaatttgctcaaataatacag gaaaaagaggtagaaattgaccaattaaatgaacaagttacgaaactccagcagcaactt aaaattacaacagataacaaggttattgaagaaaaaaatgaactgataagggatcttgaa acccaaatagaatgtttgatgagtgatcaagaatgtgtgaagagaaatagagaagaagaa atagagcagctcaatgaagtgattgaaaaacttcaacaggaattggcaaatattggacag aagacatcaatgaatgctcattccctctcagaagaagcagacagtttaaaacatcaattg gatgtggttatagctgaaaagctggccttggaacagcaagtagaaaccgctaatgaagaa atgaccttcatgaaaaatgtacttaaagaaaccaattttaaaatgaatcagctaacacag gaattattcagcttaaagagagaacgtgaaagtgtggaaaagattcaaagcataccagag aatagtgttaacgtggctatagatcatctgagcaaagacaaacctgaactagaagtagtc cttacagaggatgctcttaaatccctagaaaatcagacatacttcaaatcttttgaagaa aatggcaaaggttccataattaatttggaaacaaggttgctacaacttgagagcactgtt agtgcaaaggacttagaacttacccagtgttataaacaaataaaagacatgcaagaacaa ggccagtttgaaacagaaatgcttcaaaagaagattgtaaacctacagaaaatagttgaa gaaaaagtggctgctgctcttgtcagtcaaatccaacttgaggcagttcaggaatatgca aaattctgtcaagataatcaaacaatttcatcagaacctgaaagaacaaatattcagaat ttaaatcaactaagagaagatgagttggggtcagatatatcagcattaaccttgagaata tcagaattagaaagccaggttgttgaaatgcatactagtttgattttagaaaaagaacaa gtagaaattgcagaaaaaaatgttttagaaaaagaaaagaagctgctagaactacagaag ctattggagggcaatgagaaaaaacagagagagaaagaaaagaaaagaagccctcaagat gttgaagttctcaagacaactactgagctatttcatagcaatgaagaaagtggatttttt aatgaactcgaggctcttagagctgaatcagtggctaccaaagcagaacttgccagttat aaagaaaaggctgaaaaacttcaagaagagcttttggtaaaagaaacaaatatgacatct cttcagaaagacttaagccaagttagggatcacctcgcagaggcaaaagagaaattgtcc attttagaaaaagaagatgagactgaggtacaagaaagcaaaaaggcctgcatgtttgag ccacttcctataaaactgagtaagagcattgcatcccagacagatgggactctgaagatc agtagcagcaatcagactccacaaattcttgttaaaaatgcaggaatacaaattaattta cagagtgaatgttcctcagaagaagttactgaaataatcagtcagtttactgaaaaaatt gagaagatgcaagaactacatgctgctgaaattttggacatggaatccagacatatttca gaaactgaaaccttaaagagggaacactatgttgccgttcagttactgaaagaggaatgt ggtaccttgaaggcagtgatacagtgtctgagaagtaaagagggattactgagagctgtc cataatgaaggcatgcaggtgctttctctcactgagtctccctatagtgatggagaggac cattctattcagcaggtttcagaaccttggctagaagagagaaaagcttacatcaataca atctcatctctaaaggatttaattacaaagatgcaactgcaaagagaagccgaggtttat gatagttctcaatctcatgagagcttctcagactggcgaggtgaactactgcttgccctt caacaagttttcttagaagagcgtagtgttttactagcagcatttcggacggagctgaca gctctaggtactacagatgcagttggtttactaaactgtttggaacagagaatacaagaa cagggtgttgaatatcaagcagctatggaatgcctccagaaagcagatagaaggagtttg ttatctgaaattcaggcactgcatgcacaaatgaatggtaggaaaattactctgaaaaga gaacaagagagtgagaaaccaagccaagaactcttggaatataatatacagcagaagcag tctcaaatgctggagatgcaagtggagctcagcagtatgaaagacagagcaacggaactg caggagcagctgagttctgagaaaatggtggttgctgaactgaagagtgagcttgcacaa actaaattggaactagaaacaacactcaaggcacagcataaacacctaaaagaattggag gctttcaggttggaagttaaagataagacagatgaagtacatttgcttaatgacacatta gcaagtgaacagaaaaaatcaagagagctccagtgggctttggagaaagagaaagccaag ttgggacgcagtgaagaacgggataaagaagaacttgag