GENSCAN 1.0 Date run: 6-Nov-116 Time: 19:22:05 Sequence gi568815597r:40201185_40417197 : 216013 bp : 43.22% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 12304 12343 40 -2.56 1.01 Init + 18747 18786 40 2 1 84 80 23 0.308 1.45 1.02 Intr + 22037 22062 26 0 2 92 98 22 0.458 1.34 1.03 Term + 34608 39263 4656 2 0 99 42 839 0.629 65.87 1.04 PlyA + 39715 39720 6 1.05 2.00 Prom + 49501 49540 40 -6.26 2.01 Init + 57088 57210 123 0 0 99 99 126 0.945 14.98 2.02 Intr + 66602 66688 87 1 0 74 87 64 0.962 5.07 2.03 Intr + 67235 67351 117 1 0 83 75 34 0.818 2.26 2.04 Intr + 70710 70851 142 2 1 85 116 107 0.970 13.13 2.05 Intr + 74955 75045 91 1 1 21 93 37 0.002 -3.45 2.06 Intr + 78936 79132 197 0 2 58 -12 193 0.010 5.36 2.07 Intr + 80159 80343 185 0 2 97 87 143 0.945 14.61 2.08 Intr + 84741 84845 105 2 0 93 111 45 0.963 7.71 2.09 Term + 91261 91485 225 0 0 115 50 12 0.271 -3.12 2.10 PlyA + 92801 92806 6 1.05 3.32 PlyA - 92840 92835 6 1.05 3.31 Term - 100197 99998 200 1 2 128 53 125 0.996 10.56 3.30 Intr - 100705 100628 78 2 0 105 115 19 0.982 5.82 3.29 Intr - 101625 101437 189 1 0 127 96 191 0.999 23.36 3.28 Intr - 102001 101947 55 1 1 59 100 130 0.748 9.85 3.27 Intr - 102492 102346 147 0 0 112 105 139 0.999 18.43 3.26 Intr - 102655 102623 33 1 0 119 109 24 0.983 6.02 3.25 Intr - 102788 102744 45 2 0 93 105 44 0.959 5.21 3.24 Intr - 102915 102880 36 0 0 93 89 19 0.640 0.96 3.23 Intr - 103207 103136 72 1 0 124 45 68 0.919 5.70 3.22 Intr - 103345 103292 54 1 0 118 109 -17 0.855 2.58 3.21 Intr - 103663 103610 54 1 0 94 115 56 0.662 8.08 3.20 Intr - 104584 104531 54 1 0 116 84 -4 0.690 1.18 3.19 Intr - 106315 106262 54 1 0 83 115 6 0.796 1.98 3.18 Intr - 106572 106519 54 0 0 88 97 57 0.981 5.78 3.17 Intr - 107061 107008 54 0 0 133 90 41 0.996 7.98 3.16 Intr - 108807 108754 54 0 0 90 105 0 0.604 1.08 3.15 Intr - 108980 108927 54 2 0 147 119 18 0.994 9.98 3.14 Intr - 109133 109080 54 2 0 102 127 31 0.993 7.58 3.13 Intr - 109583 109530 54 2 0 104 107 40 0.990 6.68 3.12 Intr - 109962 109909 54 0 0 98 117 31 0.986 6.18 3.11 Intr - 110102 110046 57 2 0 69 100 27 0.623 1.08 3.10 Intr - 110363 110316 48 2 0 91 85 15 0.543 0.38 3.09 Intr - 110531 110478 54 2 0 109 95 50 0.984 6.98 3.08 Intr - 110928 110875 54 0 0 118 105 54 0.992 9.28 3.07 Intr - 111295 111272 24 1 0 113 95 19 0.909 3.32 3.06 Intr - 111425 111390 36 2 0 123 117 9 0.981 5.76 3.05 Intr - 111600 111547 54 0 0 118 116 44 0.998 9.38 3.04 Intr - 113083 113021 63 1 0 104 76 47 0.938 4.01 3.03 Intr - 113203 113168 36 1 0 77 117 40 0.947 4.26 3.02 Intr - 114480 114406 75 0 0 95 105 67 0.738 8.81 3.01 Init - 116013 115939 75 0 0 104 87 142 0.717 14.79 3.00 Prom - 119391 119352 40 -3.96 4.02 PlyA - 119526 119521 6 1.05 4.01 Sngl - 132290 131892 399 2 0 54 38 304 0.607 18.26 4.00 Prom - 152541 152502 40 -3.06 5.00 Prom + 157828 157867 40 -6.66 5.01 Init + 163776 163987 212 2 2 80 3 199 0.043 8.86 5.02 Intr + 164536 165272 737 1 2 49 58 518 0.005 35.98 5.03 Intr + 166856 169456 2601 0 0 28 47 572 0.009 34.46 5.04 Intr + 172695 173039 345 1 0 57 69 232 0.118 13.56 5.05 Intr + 183815 183917 103 0 1 130 66 68 0.899 8.03 5.06 Intr + 186975 187138 164 0 2 68 44 99 0.258 3.12 5.07 Intr + 205552 205685 134 2 2 91 80 75 0.611 7.36 5.08 Intr + 211832 211918 87 1 0 67 115 48 0.890 5.57 5.09 Intr + 212981 213056 76 1 1 36 91 64 0.540 0.59 5.10 Intr + 214088 214197 110 0 2 92 113 42 0.994 7.10 5.11 Intr + 214992 215157 166 2 1 86 71 75 0.865 5.13 5.12 Intr + 215596 215912 317 2 2 95 115 166 0.961 15.78 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 79202 78942 261 2 0 67 48 316 0.936 20.66 S.002 Sngl + 163776 164000 225 2 0 80 46 200 0.911 10.14 S.003 Sngl + 164561 165577 1017 1 0 88 43 564 0.914 48.94 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:40201185_40417197|GENSCAN_predicted_peptide_1|1573_aa MEIMIIEKQKYWPEYLVAKKETAQDAGLGVSILLCVRALQLRSSEDEEMKASVCKTIACL LPEDLEVRRACQLTEFLIEPSLDGFNMLEELYLQPDQKFDEENAPVPNSLRCELLLALKA HWPFDPEFWDWKTLKRHCHQLLGQEASDSDDDLSGYEMSINDTDVLESFLSDYDEGKEDK QYRRRDLTDQHKEKRDKKPIGSSERYQRWLQYKFFCLLCKRECIEARILHHSKMHMEDGI YTCPVCIKKFKRKEMFVPHVMEHVKMPPSRRDRSKKKLLLKGSQKGICPKSPSAIPEQNH SLNDQAKGESHEYVTFSKLEDCHLQDRDLYPCPGTDCSRVFKQFKYLSVHLKAEHQNNDE NAKHYLDMKNRREKCTYCRRHFMSAFHLREHEQVHCGPQPYMCVSIDCYARFGSVNELLN HKQKHDDLRYKCELNGCNIVFSDLGQLYHHEAQHFRDASYTCNFLGCKKFYYSKIEYQNH LSMHNVENSNGDIKKSVKLEESATGEKQDCINQPHLLNQTDKSHLPEDLFCAESANSQID TETAENLKENSDSNSSDQLSHSSSASMNEELIDTLDHSETMQDVLLSNEKVFGPSSLKEK CSSMAVCFDGTKFTCGFDGCGSTYKNARGMQKHLRKVHPYHFKPKKIKTKDLFPSLGNEH NQTTEKLDAEPKPCSDTNSDSPDEGLDHNIHIKCKREHQGYSSESSICASKRPCTEDTML ELLLRLKHLSLKNSITHGSFSGSLQGYPSSGAKSLQSVSSISDLNFQNQDENMPSQYLAQ LAAKPFFCELQGCKYEFVTREALLMHYLKKHNYSKEKVLQLTMFQHRYSPFQCHICQRSF TRKTHLRIHYKNKHQIGSDRATHKLLDNEKCDHEGPCSVDRLKGDCSAELGGDPSSNSEK PHCHPKKDECSSETDLESSCEETESKTSDISSPIGSHREEQEGREGRGSRRTVAKGNLCY ILNKYHKPFHCIHKTCNSSFTNLKGLIRHYRTVHQYNKEQLCLEKDKARTKRELVKCKKI FACKYKECNKRFLCSKALAKHCSDSHNLDHIEEPKVLSEAGSAARFSCNQPQCPAVFYTF NKLKHHLMEQHNIEGEIHSDYEIHCDLNGCGQIFTHRSNYSQHVYYRHKDYYDDLFRSQK VANERLLRSEKVCQTADTQGHEHQTTRRSFNAKSKKCGLIKEKKAPISFKTRAEALHMCV EHSEHTQYPCMVQGCLSVVKLESSIVRHYKRTHQMSSAYLEQQMENLVVCVKYGTKIKEE PPSEADPCIKKEENRSCESERTEHSHSPGDSSAPIQNTDCCHSSERDGGQKGCIESSSVF DADTLLYRGTLKCNHSSKTTSLEQCNIVQPPPPCKIENSIPNPNGTESGTYFTSFQLPLP RIKESETRQHSSGQENTVKNPTHVPKENFRKHSQPRSFDLKTYKPMGFESSFLKFIQESE EKEDDFDDWEPSEHLTLSNSSQSSNDLTGNVVANNMVNDSEPEVDIPHSSSDSTIHENLT AIPPLIVAETTTVPSLENLRVVLDKALTDCGELALKQLHYLRPVVVLERSKFSTPILDLF PTKKTDELCVGSS >gi568815597r:40201185_40417197|GENSCAN_predicted_CDS_1|4722_bp atggagataatgataatagaaaaacagaagtactggccagaatacttagttgctaaaaag gaaacagcacaagatgctggtcttggggtgtcaattttactgtgtgtcagagctcttcaa ctcagatcaagtgaagatgaggaaatgaaggcatcagtttgtaaaacaattgcctgtctt ttaccagaagatttagaagttagacgagcctgtcagcttacagaattcttaattgaaccc agtttggatggatttaatatgttagaagaactatatttgcaaccagatcaaaaatttgat gaagaaaatgcaccggttccaaattctcttcgatgtgagctcttactagctttaaaagcc cactggccttttgatcctgagttttgggactggaaaactttaaaacgacactgccaccaa cttttaggacaagaagcctcagattctgatgatgatttaagtggctatgaaatgtccatt aatgacacagatgttttagagtcatttctcagtgactatgatgagggtaaagaagataaa caatatagaagaagagatttgacagatcagcataaggagaaaagagacaaaaaacctatt ggctcttctgaaagatatcagaggtggcttcagtacaagtttttctgtttgttatgtaag cgggaatgtatagaggctagaattcttcatcattctaagatgcatatggaagatggaatt tacacctgtccagtttgtattaaaaaatttaagagaaaagaaatgtttgttcctcatgtg atggagcatgttaaaatgccaccaagcagaagggaccgctctaaaaagaaattactgtta aaaggctctcaaaagggtatttgtcctaagagcccctctgcaatcccagagcaaaaccat tcattgaatgaccaagccaaaggagagtctcatgaatatgtcacattcagcaaattagaa gattgccacctgcaagacagagatttgtatccatgtcccggtacagactgttcccgtgtg tttaagcaatttaaatacttaagtgtgcatcttaaagctgaacaccaaaataatgatgaa aatgccaagcactacttggatatgaaaaatagaagagagaagtgtacttactgtcgacga cattttatgtctgcttttcaccttcgagagcacgaacaagtgcattgtgggcctcagcct tatatgtgtgtatctatagattgctatgctaggtttggatcagtaaatgaactacttaac cataaacaaaagcatgacgatctgcgttacaaatgtgaattaaatggctgtaatattgtt ttcagtgacttgggacagctttaccaccatgaagcacaacactttagggatgcatcttac acatgcaacttccttggctgtaaaaagttctattactccaaaattgaataccagaatcac ctctcaatgcataatgttgaaaattcaaatggagacataaagaaatcagtgaaacttgag gagtctgcaacaggtgaaaagcaagattgtattaatcagccccatctacttaaccaaact gataaatcacatttacctgaagatcttttctgtgcagaatcagctaattctcaaatagat acagaaactgcagaaaacctgaaagaaaacagtgacagtaattctagtgatcagttaagt catagctcttcagcttcaatgaatgaagagctaattgacacactagatcactctgaaact atgcaggatgtattgttatctaatgagaaagtctttgggccctccagtttaaaagaaaaa tgttccagtatggcagtttgttttgacgggactaagtttacctgtggttttgatggctgt ggttccacatacaaaaatgcaagaggaatgcagaaacatttacggaaggttcatccatac catttcaagcccaaaaagataaagacgaaagatctgtttccctctttgggtaatgaacat aatcagacaactgaaaagttggatgcagaacctaaaccctgctcagatacaaacagtgac tccccagatgaaggtctagatcacaatattcacattaaatgtaaacgagaacatcaaggt tattcctcagaatcctccatttgtgcttctaaaaggccctgtacagaggataccatgttg gaacttctgttacgcttgaaacatttaagcttgaaaaactcaataacacatggatctttc tcagggtcattgcaggggtacccatccagtggtgctaagtctcttcagtcagtttcatct atctcagaccttaattttcagaatcaagatgaaaacatgccaagtcagtaccttgcacag ttggcggctaagccgtttttctgtgagcttcaaggatgcaaatatgaatttgtgaccaga gaggctctgttaatgcattatcttaaaaagcataattattcaaaagaaaaagtccttcag ttaaccatgttccaacatcggtattccccatttcagtgtcatatttgccaaaggtcattt acaagaaaaacacaccttaggattcattataaaaataaacatcaaattggcagtgacaga gcaactcacaaactattagataatgaaaagtgtgatcatgaaggcccatgttcagtagat aggttgaaaggtgattgttctgcagaacttggaggtgatcccagtagtaactctgagaaa ccacactgtcatcctaaaaaggatgaatgtagttctgaaacagatttggaatcatcttgt gaagaaacagaaagtaaaacatctgacatttcatcaccaataggcagccatagagaagaa caagaaggaagagagggcagaggtagcaggcgaactgttgctaaaggaaatctgtgttat attttgaataaataccacaaaccattccattgtattcataaaacttgcaactcctcattc accaatctaaaaggcttaattcgccattacagaactgtacatcagtacaacaaagaacag ttatgtttggagaaagacaaagcaagaaccaaaagggaacttgtcaaatgtaaaaagata tttgcttgcaaatataaggaatgtaataaacgcttcctgtgttccaaagctcttgctaag cactgtagtgattctcataacctagaccatattgaagagcctaaagtactttccgaagct ggatctgcagcaaggttttcttgtaaccagcctcagtgccctgctgttttttatacattc aacaagttgaagcaccacttgatggaacagcataatattgaaggggaaatacattcagat tatgaaattcattgtgatcttaatggctgtggccagattttcacccatcgcagtaattac tcacaacatgtatattaccgacataaagactattatgatgatttgtttagaagccagaaa gtagcaaatgagagactactaaggagtgaaaaggtatgtcaaacagctgatactcagggg catgaacatcagaccaccaggagatcatttaatgctaagtctaaaaaatgtggcttaatc aaagaaaagaaagccccaataagttttaaaaccagagctgaggccctccatatgtgtgtg gagcactctgagcacacacagtacccctgcatggttcaaggatgcttatctgtggtgaag ttggagagcagcattgtgaggcattacaaacgcactcatcagatgagtagtgcctattta gagcaacagatggagaatcttgttgtttgcgttaagtacggtaccaaaattaaggaggaa cccccttctgaagcagatccctgtataaagaaagaagaaaatagaagctgtgaatcagag cgcacagaacacagccattccccgggtgacagtagtgcacccatccagaacactgattgc tgtcattcaagtgaaagggatggaggtcagaaagggtgcatagaaagcagctcagtattt gatgcagatactctgctctacaggggaactttgaaatgtaatcatagttccaaaaccact tccctagaacagtgtaatatagttcagcctcctcctccttgtaaaatagaaaattccata cctaatcccaatgggactgaaagtgggacttatttcacaagtttccagctgcctttacca aggatcaaagaatcagaaactaggcagcatagttcagggcaagaaaacactgtaaaaaat ccaacccatgtcccaaaagagaattttaggaaacattcacagccccggtcatttgatttg aagacttacaaacctatgggatttgaatcttcatttctgaaatttattcaggaaagtgaa gagaaagaagatgattttgatgattgggagccttcagagcacttaacattaagtaattct tcacagtccagtaatgatttaacagggaatgttgtggcaaataatatggtgaatgacagt gaacctgaagttgacatacctcattcttccagtgactctacaattcatgagaacctgact gcaatcccacctttaatagtagctgaaacaacaacagttccttccttggaaaacctgagg gttgtattggacaaagcattaacagactgtggagagcttgccttaaaacagcttcattat cttcggccagtggtggttcttgaaagatctaagttttccacaccaattttagacttattt ccaacaaaaaagacagatgagctttgtgtaggaagttcataa >gi568815597r:40201185_40417197|GENSCAN_predicted_peptide_2|423_aa MGMWASLDALWEMPAEKRIFGAVLLFSWTVYLWETFLAQRQLILLFGGIPYLWRLSGRFC GYAGFGPEYEITQSLVFLLLATLFSALTGLPWSLYNTFVIEEKHGFNQQVLVTIYADYIA PLFDKFTPLPEGKLKEEIEVMAKSIDFPLTKVYVVEGDSNIGQEKRLHTRIQTTNKSIDI PVEMSDSLLQMPNDRRMVDIEFFGNFLCSCKGISSDDCSPLVIVNFRWPATTLLIFKALI SFANFLNHHCAVRSKRSSHSNAYFYGFFKNKRIVLFDTLLEEYSVLNKDIQEDSGMEPRN EEEGNSEEIKAKVKNKKQGCKNEEVLAVLGHELGHWKLGHTVKNIIISQVLSFCLTVLSR RFEFQADAFAKKLGKAKDLYSALIKLNKDNLGFPVSDWLFSMWHYSHPPLLERLQALKTM KQH >gi568815597r:40201185_40417197|GENSCAN_predicted_CDS_2|1272_bp atggggatgtgggcatcgctggacgctttgtgggagatgccggccgagaagcgtatcttc ggggccgtgctgctcttttcctggacagtgtatctttgggagaccttcctagcacagcgg cagcttattcttctctttggaggaataccttatctctggagactttctggacggttctgt ggttatgctggctttggaccagaatatgagatcactcagtccctggtgtttctgctgttg gctacacttttcagtgcattgactggtttgccatggagtctttataatacttttgtgata gaagaaaaacatggcttcaatcaacaggttcttgtcacaatctatgctgattatattgcc cctttatttgacaaattcacacctctgcctgagggaaagcttaaagaagaaattgaagta atggcaaagagtattgactttcctttgacgaaggtgtatgttgtggaaggagattctaat attggacaagaaaaaagactacacacaagaatacaaacaacaaataagtctatagacatc cctgtggagatgtcagacagtttgcttcaaatgccaaacgaccgtagaatggtcgacatt gagttcttcggcaacttcttgtgtagttgtaaggggatcagctccgatgattgctctcca ttggtcattgtcaacttccgatggcctgccactacgctcctcatcttcaaggctctcatc tcctttgcaaacttcttgaaccaccactgcgctgtacgatctaaacgctcttcccacagc aatgcttatttttatggcttcttcaagaacaagcgaatagttttgtttgacactctacta gaagagtactctgtactaaacaaagacatccaggaggattctggcatggaaccccgcaat gaggaagaagggaacagtgaagaaataaaagctaaagttaaaaataagaaacaaggatgt aaaaatgaggaggtactcgctgtactaggccatgaactggggcactggaagttgggacat acagtcaaaaatatcattattagccaggttctttctttttgcctaacagtcctaagccgc agatttgagtttcaagctgatgcatttgccaagaaacttgggaaggctaaagacttatat tctgctttaatcaaacttaacaaagataacttgggattccctgtttctgactggttgttc tcaatgtggcattattctcatcctccactgctagagagacttcaagctttgaaaactatg aagcaacactga >gi568815597r:40201185_40417197|GENSCAN_predicted_peptide_3|674_aa MAAATASPRSLLVLLQVVVLALAQIRGPPGERGPPGPPGPPGVPGSDGIDGDNGPPGKAG PPGPKGEPGKAGPDGPDGKPGIDGLTGAKGEPGPMGIPGVKGQPGLPGPPGLPGPGFAGP PGPPGPVGLPGEIGIRGPKGDPGPDGPSGPPGPPGKPGRPGTIQGLEGSADFLCPTNCPP GMKGPPGLQGVKGHAGKRGILGDPGHQGKPGPKGDVGASGEQGIPGPPGPQGIRGYPGMA GPKGETGPHGYKGMVGAIGATGPPGEEGPRGPPGRAGEKGDEGSPGIRGPQGITGPKGAT GPPGINGKDGTPGTPGMKGSAGQAGQPGSPGHQGLAGEPGPQGLPGFSGPPGKEGEPGPR GEIGPQGIMGQKGDQGERGPVGQPGPQGRQGPKGEQGPPGIPGPQGLPGVKGDKGSPGKT GPRGKVGDPGVAGLPGEKGEKGESGEPGPKGQQGVRGEPGYPGPSGDAGAPGVQGYPGPP GPRGLAGNRGVPGQPGRQGVEGRDATDQHIVDVALKMLQEQLAEVAVSAKREALGAVGMM GPPGPPGPPGYPGKQGPHGHPGPRGVPGIVGAVGQIGNTGPKGKRGEKGDPGEVGRGHPG MPGPPGIPGLPGRPGQAINGKDGDRGSPGAPGEAGRPGLPGPVGLPGFCEPAACLGASAY ASARLTEPGSIKGP >gi568815597r:40201185_40417197|GENSCAN_predicted_CDS_3|2025_bp atggccgccgctacggcctccccccgcagcctccttgttctcctccaggtggtagtgctc gctctggcgcagattagaggtccaccgggagagcggggccccccgggtcccccgggaccg ccgggagtgcctggatccgacggcatcgacggtgacaatgggccccctggaaaagctggc cctccgggacccaagggcgagcctggcaaagctgggccagatgggccagacgggaagccc gggattgatggtttaactggagccaagggggagcctggccccatggggatccctggagtc aagggccagcccgggcttcctggtcctcctggccttccgggccctggttttgctggacct cctgggcctcctggacctgttggcctccctggtgagattggaatccgaggccccaagggg gaccctggaccagatggaccatcggggcccccaggaccccctgggaaacctggtcgcccg ggaaccatccagggtctggaaggcagtgcggatttcctgtgtccaaccaactgtccaccc ggaatgaaaggtcccccagggctgcagggagtgaaggggcatgcgggcaaacgcgggatt ctgggtgatcctggccaccaggggaagccgggtcccaagggagatgtgggtgcctctgga gagcaaggcatccctggaccaccgggtccccagggcatcaggggctacccaggcatggca gggcccaagggagagacgggccctcatggatataaaggcatggtgggcgctatcggtgcc actgggccaccgggtgaggaaggtcctaggggaccgccaggccgagctggggagaagggt gacgagggcagcccaggtattcgtggaccccaggggatcacaggcccgaaaggagcaacg ggccccccaggcatcaacggcaaggatgggaccccaggcacgcctggcatgaagggcagt gcaggacaggcgggacagcccggaagtccaggccaccagggcctagcgggtgagccgggc ccgcagggccttcctggattctctggtccccctgggaaagagggagagccagggcctcga ggagaaattggtccccagggcatcatgggacagaagggtgaccaaggcgagaggggtcca gtggggcaaccaggccctcagggaaggcagggccctaagggggagcagggcccccccgga attccagggccccaaggcttgccaggcgtcaaaggagacaagggctccccagggaagacc gggccccgcggcaaagtgggtgacccaggggtggccggcctccccggagagaaaggcgag aagggcgagtccggcgagccggggcccaagggacagcaaggagtacgtggagaacccggc taccctggccccagcggggatgcgggcgccccaggggttcagggctaccctggtcccccc ggccctcgaggactggccgggaaccgaggcgtgccaggacagcccgggagacagggcgtg gagggccgggatgccactgaccagcacatcgtggatgtggcgctgaagatgctgcaagag caactggcagaggtcgccgtgagtgccaagcgggaagccctgggtgcggtgggcatgatg ggtcctccaggacctcctgggccccctgggtacccaggcaagcagggcccccatgggcac cctggccctcggggcgttcctggcatcgtgggagccgtgggtcagatcggcaacacgggg cccaagggaaaacgtggagagaagggtgatccaggagaagtgggacgggggcaccccggg atgcctgggcccccagggatcccaggactccctggccggcctggccaggcaatcaacggc aaggatggagatcgagggtccccaggggctccaggagaggcaggtcgacctggcctgcca ggccccgtggggctgccgggcttctgtgaacctgccgcctgccttggagcttcggcctat gcctctgcccgccttacagagcctggatccatcaaggggccttga >gi568815597r:40201185_40417197|GENSCAN_predicted_peptide_4|132_aa MYMRIYKKGDIVDIKGMGTVQKGMSHKCYHGKTGRVYNVPQHAVGIVVNKQVKGKILAKR INVCIEHIKHSKSRDSFLKRVKENDQKKKEAKEKVTWVQRKHQPAPPREAQCVRTNGKEP ELLETIPYEFMA >gi568815597r:40201185_40417197|GENSCAN_predicted_CDS_4|399_bp atgtatatgcgaatctataagaaaggtgatattgtagacatcaagggaatgggtactgtt caaaaaggaatgtcccacaagtgttaccatggcaaaactggaagagtctacaatgttccc cagcatgctgttggcattgttgtaaacaaacaagttaagggcaagattcttgccaagaga attaatgtgtgtattgagcacattaagcactctaaaagccgagatagcttcctgaaacgt gtgaaggaaaatgatcagaaaaagaaagaagccaaagagaaagttacctgggttcaacgg aagcaccagcctgctccacccagagaagcacagtgtgtgagaaccaatgggaaggagcct gagctgctggaaactattccctatgaattcatggcataa >gi568815597r:40201185_40417197|GENSCAN_predicted_peptide_5|1684_aa MVAKVDEVKSTTQFQMKKVLCLAVAVGHVKMTDDELVHNIHLAVNFLVSLLKKNCQDIQA LYTESTVGKPTPKVDKTTKMGKKQNRKTGSSKNQSASPPPKERSSSPATEQSWMENDFDE LREEGFRRSNYSELREDIQTKGKEVENFEKNLEECITRITNTEKCLKELMELKTKARELR EECRSLRSRCDQLEERVSAMEDEMNEMKREGKFRERRVKRNEQSLQEIWDYVKRPNLHLI GVPESDGENGTKLENTLQDIIQENFPNLARQANVQIQEIQRTPQRYSSRRATPRHIIVRF TKVEMKEKMLRAAREKEIQTTIREYYKHLYANKLENLEEMDKFLDTYTLPRLNQEEVESL NRPITGAEIVAIINSLPTKKSPGLDGFTAEFYQRYKEELVPFLLKLFQSIEKEGILPNSF YEASIILIPKPGRDTTKKENFRPISLMNIDAKILNKILANRIQQHIKKLIHHDQVGFIPG MQGWFNIRKSINVIQHINRTKDKNHMIISIDAEKAFDKIQQPFMLKTLNKLGIDGTYFKI IRAIYDKPIANIILNGQKLEAFPLKTGTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQ LGKEEVKLSLFADDMIVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQT ESQIMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLKEIKEDTNKWKNIPCSWVGR INIVKMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRAHIAKSILSQKNKAGG ITLPDFKLYYKATITKTAWYWYQNRDIDQWNRTEPSEITPHTYNCLIFDKPEKNKKWGKD SLFNKWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGITIQNIG MGKDFMSKTPKAMATKDKIDKWDLIKLKSFCTAKETTIRVNRQPTKWEKIFATYSSDKGL ISRIYNELNQIYKKKTNNPIKKWAKDMNRHFSKEDIYAAKKHMKKCSSSLAIREMQIKTT MRYHLTPVRMAIIKKSGNNRCWRGCGEIGTLLHCWWDCKLVQPLWKSVWRFLRDLELEIP FDPAIPLLGIYPNDYKSCCYKDTCTRMFIAALFTIAKTWNQPKCPTMIDWIKKMWHIYTM EYYAAIKNDEFMSFVGTWMKLEIIILSKLSQEQKTKHRIFSLIASLGGGPVPKPAKGRRA EGLRSGGRTPPTREGASGGAGGEGSPRSGGAPGRGTGSPQPRTEAGVSGGEEGASPSAPA GTLAMTGKSVKDVDRYQAVLANLLLEEDNKFCADCQSKGFVNAGPLMAELQVSPQWKAPE MSQICLSCGHPSALSLPNETAPCSTAPGPINHPRAEVCGRTARDWRAAPPAAPVRDPLDE ASWAPESGPRWASWNIGVFICIRCAGIHRNLGVHISRVKSVNLDQWTQEQIQKEKDDKWK RGSEPVPEKKLEPVVFEKVKMKKEDPQLPRKSSPKSTAPVMDLLGLDAPVACSIANSKTS NTLEKDLDLLASVPSPSSSGSRKVVGSMPTAGSAGSVPENLNLFPEPGSKSEEIGKKQLS KDSILSLYGSQTPQMPTQAMFMAPAQMAYPTAYPSFPGVTPPNSIMGSMMPPPVGMVAQP GASGMVAPMAMPAGYMGGMQASMMGVPNGMMTTQQAGYMAGMAAMPQTVYGVQPAQQLQW NLTQ >gi568815597r:40201185_40417197|GENSCAN_predicted_CDS_5|5052_bp atggtggccaaagtggatgaggtgaagtccacaacccagtttcagatgaagaaggtgcta tgtctggctgtggctgttggccatgtgaagatgacagacgatgagcttgtacataacatt cacctggctgtcaacttcctagtgtcattactcaagaaaaattgccaagacattcaagct ttatatactgagagcaccgtgggcaagcccacaccaaaagtagataaaaccacaaagatg gggaaaaaacagaacagaaaaactggaagctccaaaaatcagagcgcctctcctcctcca aaggaacgcagctcctcaccagcaacagaacaaagctggatggagaatgactttgacgag ctgagagaagaaggcttcagacgatcaaattactctgagctacgggaggacattcaaacc aaaggcaaagaagttgaaaactttgaaaaaaatttagaagaatgtataactagaataacc aatacagagaagtgcttaaaggagctgatggagctgaaaaccaaggctcgagaactacgt gaagaatgcagaagcctcaggagccgatgcgatcaactggaagaaagggtgtcagcaatg gaagatgaaatgaatgaaatgaagcgagaagggaagtttagagaaagaagagtaaaaaga aatgagcaaagcctccaagaaatatgggactatgtgaaaagaccaaatctacatctgatt ggtgtacctgaaagtgatggggagaatggaaccaagttggaaaacactctgcaggatatt atccaggagaacttccccaatttagcaaggcaggccaacgttcagattcaggaaatacag agaacgccacaaagatactcctcgagaagagcaactccaagacacataattgtcagattc accaaagttgaaatgaaggaaaaaatgttaagggcagccagagagaaagaaatacaaact accatcagagaatactacaaacacctctacgcaaataaactagaaaatctagaagaaatg gataaattcctcgacacatacactctcccaagactaaaccaggaagaagttgaatctctt aatagaccaataacaggagctgaaattgtggcaataatcaatagcttaccaaccaaaaag agtccaggactagatggattcacagccgaattctaccagaggtacaaggaggaactggta ccattccttctgaaactattccaatcaatagaaaaagagggaatcctccctaactcattt tatgaggccagcatcattctgataccaaagccgggcagagacacaacaaaaaaagagaat tttagaccaatatccttgatgaacattgatgcaaaaatcctcaataaaatactggcaaac cgaatccagcagcacatcaaaaagcttatccaccatgatcaagtgggcttcatccctggg atgcaaggctggttcaatatacgcaaatcaataaatgtaatccagcatataaacagaacc aaagacaaaaaccacatgattatctcaatagatgcagaaaaagcctttgacaaaattcaa caacccttcatgctaaaaactctcaataaattaggtattgatgggacgtatttcaaaata ataagagctatctatgacaaacccatagccaatatcatactgaatgggcaaaaactggaa gcattccctttgaaaactggcacaagacagggatgccctctctcaccactcctattcaac atagtgttggaagttctggccagggcaattaggcaggagaaggaaataaagggtattcaa ttaggaaaagaggaagtcaaattgtccctgtttgcagatgacatgattgtatatctagaa aaccccattgtctcagcccaaaatctccttaagctgataagcaacttcagcaaagtctca ggatacaaaatcaatgtacaaaaatcacaagcattcttatacaccaacaacagacaaaca gagagccaaatcatgagtgaactcccattcacaattgcttcaaagagaataaaataccta ggaatccaacttacaagggatgtgaaggacctcttcaaggagaactacaaaccactgctc aaggaaataaaagaggatacaaacaaatggaagaacattccatgctcatgggtaggaaga atcaatattgtgaaaatggccatactgcccaaggtaatttacagattcaatgccatcccc atcaagctaccaatgactttcttcacagaattggaaaaaactactttaaagttcatatgg aaccaaaaaagagcccacatcgccaagtcaatcctaagccaaaagaacaaagctggaggc atcacactacctgacttcaaactttactacaaggctacaataaccaaaacagcatggtac tggtaccaaaacagagatatagatcaatggaacagaacagagccctcagaaataacgccg catacctacaactgtctgatctttgacaaacctgagaaaaacaagaaatggggaaaggat tccctatttaataaatggtgctgggaaaactggctagccatatgtagaaagctgaaactg gatcccttccttacaccttatacaaaaatcaattcaagatggattaaagatttaaacgtt agacctaaaaccataaaaaccctagaagaaaacctaggcattaccattcagaacataggc atgggcaaggacttcatgtccaaaacaccaaaagcaatggcaacaaaagacaaaattgac aaatgggatctaattaaactaaagagcttctgcacagcaaaagaaactaccatcagagtg aacaggcaacctacaaaatgggagaaaattttcgcaacctactcatctgacaaagggcta atatccagaatctacaatgaactcaaccaaatttacaagaaaaaaacaaacaaccccatc aaaaagtgggcaaaggacatgaacagacacttctcaaaagaagacatttatgcagccaaa aaacacatgaaaaaatgctcatcatcactggccatcagagaaatgcaaatcaaaaccaca atgagataccatctcacaccagttagaatggcaatcattaaaaagtcaggaaacaacagg tgctggagaggatgtggagaaataggaacacttttacattgttggtgggactgtaaacta gttcaaccattgtggaagtcagtgtggcgattcctcagggatctagaactagaaatacca tttgacccagccatcccattactgggtatatacccaaatgactataaatcatgctgctat aaagacacatgcacacgtatgtttattgcggcattattcacaatagcaaagacttggaac caacccaaatgtccaacaatgatagactggattaagaaaatgtggcacatatacaccatg gaatactatgcagccataaaaaatgatgagttcatgtcctttgtagggacatggatgaaa ttggaaatcatcattcttagtaaactatcacaagaacaaaaaaccaaacaccgcatattc tcactcatagcctctcttggaggcgggcctgtccctaagcccgccaaggggcgccgcgcc gaggggctgcggagtgggggacggacgcccccgacccgggaaggggcgtccggcggggcc ggaggagagggctctccccgctcaggaggtgcccctgggcgggggaccgggagtcctcaa ccccggactgaggcaggggtctctgggggcgaggagggcgcgtcgccctctgcccccgcc ggcaccctggccatgacaggcaagtcggtgaaggacgtggatcggtaccaggctgtcctg gccaacctgctgctggaggaggataacaagttttgtgcagattgccagtctaaaggcttt gtgaatgccggtccactgatggctgaactgcaggtctctccccagtggaaagccccagag atgagccagatctgcctcagctgtggccatccgtcagccctgagcctccccaacgagacc gccccctgctccacggcacccggtcccatcaaccacccaagggctgaggtgtgtgggcgc acagcgcgggactggcgggcagctccacctgcagccccagtgcgagatccactggatgaa gccagctgggctcctgagtctgggccgcgatgggcctcttggaacattggtgtgttcatc tgcattcgatgtgctggaatccacaggaatctgggggtgcacatatccagggtaaagtca gttaacctcgaccagtggactcaagaacagattcagaaagaaaaagatgacaagtggaaa agagggagcgaaccagttccagaaaaaaaattggaacctgttgtttttgagaaggtgaaa atgaaaaaagaagacccacagctacctcggaaaagctccccgaaatccacagcgcctgtc atggatttgttgggccttgatgctcctgtggcctgctccattgcaaatagtaagaccagc aataccctagagaaggatttagatctgttggcctctgttccatccccttcttcttccggt tccagaaaggttgtaggttccatgccaactgcagggagtgccggctctgttcctgaaaat ctgaacctgtttccggagccagggagcaaatcagaagaaataggcaagaaacagctctct aaagactccattctttcactgtatggatcccagacgcctcaaatgcctactcaagcaatg ttcatggctcccgctcagatggcatatcccacagcctaccccagcttccccggggttaca cctcctaacagcataatggggagcatgatgcctccaccagtaggcatggttgctcagcca ggagcttctgggatggttgcccccatggccatgcctgcaggctatatgggtggcatgcag gcatcaatgatgggtgtgccgaatggaatgatgaccacccagcaggctggctacatggca ggcatggcagctatgccccagactgtgtatggggtccagccagctcagcagctgcaatgg aaccttactcag