GENSCAN 1.0 Date run: 3-Nov-116 Time: 05:37:31 Sequence gi568815583r:40880226_41196310 : 316085 bp : 46.24% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 12809 12957 149 2 2 51 85 58 0.272 1.68 1.02 Intr + 13272 13385 114 1 0 92 67 38 0.520 2.52 1.03 Intr + 14512 14634 123 2 0 83 90 176 0.690 17.86 1.04 Intr + 15713 15854 142 0 1 86 46 160 0.989 11.01 1.05 Intr + 18682 18773 92 1 2 111 119 109 0.995 15.94 1.06 Intr + 18919 20789 1871 2 2 81 115 2595 0.999 248.83 1.07 Term + 22391 23116 726 1 0 104 49 1291 0.992 120.28 1.08 PlyA + 27721 27726 6 1.05 2.00 Prom + 28254 28293 40 -5.26 2.01 Init + 32261 32304 44 1 2 62 92 7 0.003 -1.89 2.02 Intr + 37807 37947 141 1 0 141 99 0 0.003 5.87 2.03 Intr + 45729 46031 303 0 0 81 18 120 0.007 0.01 2.04 Term + 46631 46763 133 2 1 81 46 92 0.017 1.86 2.05 PlyA + 47649 47654 6 -0.45 3.00 Prom + 49076 49115 40 -0.96 3.01 Init + 49444 49509 66 0 0 79 92 151 0.999 13.68 3.02 Intr + 49622 49891 270 1 0 67 105 345 0.999 31.84 3.03 Intr + 50400 50457 58 2 1 81 109 107 0.998 10.56 3.04 Intr + 51278 51541 264 0 0 122 68 352 0.931 34.08 3.05 Intr + 51946 52006 61 2 1 93 76 37 0.984 0.79 3.06 Intr + 52092 52222 131 0 2 138 113 -5 0.990 7.34 3.07 Intr + 52597 52632 36 2 0 107 56 39 0.159 0.83 3.08 Intr + 53387 53497 111 0 0 101 59 17 0.081 0.45 3.09 Intr + 54323 54492 170 0 2 125 100 190 0.947 23.57 3.10 Intr + 54673 54892 220 0 1 101 86 310 0.943 29.97 3.11 Intr + 56003 56705 703 0 1 111 109 1006 0.999 95.52 3.12 Intr + 57193 57387 195 1 0 110 73 53 0.544 4.53 3.13 Intr + 58128 58331 204 0 0 56 105 111 0.521 7.92 3.14 Intr + 60126 60426 301 0 1 92 91 54 0.320 2.84 3.15 Intr + 62063 62158 96 1 0 88 62 47 0.155 2.31 3.16 Intr + 66050 66131 82 1 1 89 61 35 0.013 0.11 3.17 Intr + 72435 72547 113 1 2 82 56 78 0.041 4.10 3.18 Intr + 73404 73589 186 2 0 57 89 370 0.184 33.89 3.19 Intr + 74003 74038 36 1 0 90 107 46 0.970 5.16 3.20 Term + 75148 75549 402 0 0 122 55 553 0.993 50.55 3.21 PlyA + 75614 75619 6 -8.47 4.42 PlyA - 75748 75743 6 1.05 4.41 Term - 77565 77318 248 1 2 88 43 196 0.970 11.05 4.40 Intr - 77986 77778 209 0 2 99 -8 133 0.789 3.52 4.39 Intr - 79801 79655 147 0 0 61 80 83 0.795 4.15 4.38 Intr - 83380 83164 217 2 1 69 43 85 0.138 -0.34 4.37 Intr - 92874 92770 105 1 0 43 52 149 0.708 6.99 4.36 Intr - 96667 96546 122 0 2 37 77 43 0.464 -1.76 4.35 Intr - 100215 100002 214 1 1 134 43 129 0.795 10.77 4.34 Intr - 102852 102637 216 1 0 99 87 177 0.987 17.18 4.33 Intr - 103696 103537 160 1 1 118 46 155 0.524 13.86 4.32 Intr - 104127 103939 189 0 0 61 69 135 0.605 8.68 4.31 Intr - 105201 105113 89 1 2 51 63 191 0.995 12.59 4.30 Intr - 106968 106866 103 0 1 53 58 89 0.626 2.15 4.29 Intr - 107749 107591 159 1 0 70 99 192 0.986 18.68 4.28 Intr - 117376 117304 73 0 1 118 98 37 0.968 7.11 4.27 Intr - 125462 125368 95 2 2 82 95 41 0.006 2.96 4.26 Intr - 135990 135863 128 1 2 138 75 78 0.628 11.90 4.25 Intr - 140900 140675 226 2 1 43 109 121 0.193 7.26 4.24 Intr - 164850 164679 172 2 1 86 121 -23 0.406 0.75 4.23 Intr - 167276 167183 94 0 1 97 94 11 0.744 1.62 4.22 Intr - 167564 167517 48 0 0 115 90 29 0.844 4.45 4.21 Intr - 168051 167987 65 2 2 92 100 -21 0.855 -2.14 4.20 Intr - 169195 169062 134 1 2 115 91 29 0.993 5.34 4.19 Intr - 169877 169710 168 2 0 55 116 66 0.955 6.24 4.18 Intr - 173789 173704 86 0 2 51 115 32 0.678 1.74 4.17 Intr - 175139 175022 118 2 1 91 70 86 0.676 7.14 4.16 Intr - 176481 176397 85 2 1 40 80 45 0.987 -1.28 4.15 Intr - 178556 178414 143 2 2 108 92 143 0.989 15.85 4.14 Intr - 190322 190242 81 2 0 108 90 71 0.996 9.13 4.13 Intr - 191833 191624 210 1 0 105 68 159 0.999 14.71 4.12 Intr - 193270 193203 68 2 2 89 100 46 0.999 4.52 4.11 Intr - 194340 194145 196 0 1 125 101 127 0.997 16.69 4.10 Intr - 199679 199476 204 2 0 87 75 275 0.969 25.50 4.09 Intr - 200848 200795 54 1 0 62 93 56 0.865 2.68 4.08 Intr - 205358 205144 215 0 2 95 115 166 0.998 18.43 4.07 Intr - 207457 207337 121 1 1 41 105 152 0.576 12.27 4.06 Intr - 211957 211802 156 1 0 70 94 188 0.999 17.81 4.05 Intr - 215443 215376 68 2 2 93 116 -22 0.610 -0.28 4.04 Intr - 215703 215534 170 2 2 72 82 52 0.320 2.59 4.03 Intr - 235997 235748 250 0 1 91 78 79 0.011 3.70 4.02 Intr - 236321 236118 204 0 0 49 73 142 0.001 7.97 4.01 Init - 254688 254625 64 0 1 47 115 74 0.200 7.31 4.00 Prom - 268070 268031 40 -3.76 5.00 Prom + 275002 275041 40 -4.46 5.01 Init + 282937 283218 282 0 0 89 2 297 0.350 18.28 5.02 Term + 283697 284149 453 1 0 80 36 315 0.906 20.66 5.03 PlyA + 285161 285166 6 1.05 6.07 PlyA - 285188 285183 6 1.05 6.06 Term - 303831 303809 23 1 2 97 47 16 0.539 -3.13 6.05 Intr - 304368 304041 328 0 1 86 53 179 0.833 9.47 6.04 Intr - 309903 309712 192 0 0 56 92 101 0.897 6.99 6.03 Intr - 311360 311217 144 2 0 74 94 9 0.563 0.58 6.02 Intr - 315630 315550 81 0 0 116 88 27 0.963 5.33 6.01 Intr - 315812 315708 105 2 0 64 115 21 0.849 2.81 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 68723 68523 201 2 0 58 46 201 0.918 8.09 S.002 Init + 132834 132965 132 2 0 100 82 146 0.920 15.34 S.003 Sngl + 235934 236197 264 1 0 105 48 245 0.958 15.40 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583r:40880226_41196310|GENSCAN_predicted_peptide_1|1072_aa XWEGCIEAAGLVKRSRTLVRREMSLDSILVTCPRTSAPPSQRAQIALSGAGWAAPAFTLA PTCPSFLLGLAGICSLLLHALSAVATSLGPGQRAQRPGTMASILDEYENSLSRSAVLQPG CPSVGIPHSGYVNAQLEKEVPIFTKQRIDFTPSERITSLVVSSNQLCMSLGKDTLLRIDL GKANEPNHVELGRKDDAKVHKMFLDHTGSHLLIALSSTEVLYVNRNGQKVRPLARWKGQL VESVGWNKALGTESSTGPILVGTAQGHIFEAELSASEGGLFGPAPDLYFRPLYVLNEEGG PAPVCSLEAERGPDGRSFVIATTRQRLFQFIGRAAEGAEAQGFSGLFAAYTDHPPPFREF PSNLGYSELAFYTPKLRSAPRAFAWMMGDGVLYGALDCGRPDSLLSEERVWEYPEGVGPG ASPPLAIVLTQFHFLLLLADRVEAVCTLTGQVVLRDHFLEKFGPLKHMVKDSSTGQLWAY TERAVFRYHVQREARDVWRTYLDMNRFDLAKEYCRERPDCLDTVLAREADFCFRQRRYLE SARCYALTQSYFEEIALKFLEARQEEALAEFLQRKLASLKPAERTQATLLTTWLTELYLS RLGALQGDPEALTLYRETKECFRTFLSSPRHKEWLFASRASIHELLASHGDTEHMVYFAV IMQDYERVVAYHCQHEAYEEALAVLARHRDPQLFYKFSPILIRHIPRQLVDAWIEMGSRL DARQLIPALVNYSQGGEVQQVSQAIRYMEFCVNVLGETEQAIHNYLLSLYARGRPDSLLA YLEQAGASPHRVHYDLKYALRLCAEHGHHRACVHVYKVLELYEEAVDLALQVDVDLAKQC ADLPEEDEELRKKLWLKIARHVVQEEEDVQTAMACLASCPLLKIEDVLPFFPDFVTIDHF KEAICSSLKAYNHHIQELQREMEEATASAQRIRRDLQELRGRYGTVEPQDKCATCDFPLL NRPFYLFLCGHMFHADCLLQAVRPGLPAYKQARLEELQRKLGAAPPPAKGSARAKEAEGG AATAGPSREQLKADLDELVAAECVYCGELMIRSIDRPFIDPQRYEEEQLSWL >gi568815583r:40880226_41196310|GENSCAN_predicted_CDS_1|3219_bp nngtgggaaggctgcattgaggctgcaggcttggtcaagagaagcaggaccctggtgcga agggaaatgagcctggacagcatacttgtcacctgccccaggactagtgcccctccaagc cagagagcccagatagccctcagtggggctggctgggcggcccccgccttcacgcttgct cccacttgtccctcgttcttgctaggcttggcaggtatttgttccctgctccttcacgct ctcagtgccgtggccacgtcgctgggccccggacaaagagcccagaggccgggcaccatg gcgtccatcctggatgagtacgagaactcgctgtcccgctcggccgtcttgcagcccggc tgccctagcgtgggcatcccccactcggggtatgtgaatgcccagctggagaaggaagtg cccatcttcacaaagcagcgcattgacttcaccccttccgagcgcattaccagtcttgtc gtctccagcaatcagctgtgcatgagcctgggcaaggatacactgctccgcattgacttg ggcaaggcaaatgagcccaaccacgtggagctgggacgtaaggatgacgcaaaagttcac aagatgttccttgaccatactggctctcacctgctgattgccctgagcagcacggaggtc ctctacgtgaaccgaaatggacagaaggtacggccactagcacgctggaaggggcagctg gtggagagtgtgggttggaacaaggcactgggcacggagagcagcacaggccccatcctg gtcgggactgcccaaggccacatctttgaagcagagctctcagccagcgaaggtgggctt ttcggccctgctccggatctctacttccgcccattgtacgtgctaaatgaagaagggggt ccagcacctgtgtgctcccttgaggccgagcggggccctgatgggcgtagctttgttatt gccaccactcggcagcgcctcttccagttcataggccgagcagcagagggggctgaggcc cagggtttctcagggctctttgcagcttacacggaccacccacccccattccgtgagttt cccagcaacctgggctacagtgagttggccttctacacccccaagctgcgctccgcaccc cgggccttcgcctggatgatgggggatggtgtgttgtatggggcattggactgtgggcgc cctgactctctgctgagcgaggagcgagtctgggagtacccagagggggtagggcctggg gccagcccacccctagccatcgtcttgacccagttccacttcctgctgctactggcagac cgggtggaggcagtgtgcacactgaccgggcaggtggtgctgcgggatcacttcctggag aaatttgggccgctgaagcacatggtgaaggactcctccacaggccagctgtgggcctac actgagcgggctgtcttccgctaccacgtgcaacgggaggcccgagatgtctggcgcacc tatctggacatgaaccgcttcgatctggccaaagagtattgtcgagagcggcccgactgc ctggacacggtcctggcccgggaggccgatttctgctttcgccagcgtcgctacctggag agcgcacgctgctatgccctgacccagagctactttgaggagattgccctcaagttcctg gaggcccgacaggaggaggctctggctgagttcctgcagcgaaaactggccagtttgaag ccagccgaacgtacccaggccacactgctgaccacctggctgacagagctctacctgagc cggcttggggctctgcagggcgacccagaggccctgactctctaccgagaaaccaaggaa tgctttcgaaccttcctcagcagcccccgccacaaagagtggctctttgccagccgggcc tctatccatgagctgctcgccagtcatggggacacagaacacatggtgtactttgcagtg atcatgcaggactatgagcgggtggtggcttaccactgtcagcacgaggcctacgaggag gccctggccgtgctcgcccgccaccgtgacccccagctcttctacaagttctcacccatc ctcatccgtcacatcccccgccagcttgtagatgcctggattgagatgggcagccggctg gatgctcgtcagctcattcctgccctggtgaactacagccagggtggtgaggtccagcag gtgagccaggccatccgctacatggagttctgcgtgaacgtgctgggggagactgagcag gccatccacaactacctgctgtcactgtatgcccgtggccggccggactcactactggcc tatctggagcaggctggggccagcccccaccgggtgcattacgacctcaagtatgcgctg cggctctgcgccgagcatggccaccaccgcgcttgtgtccatgtctacaaggtcctagag ctgtatgaggaggccgtggacctggccctgcaggtggatgtggacctggccaagcagtgt gcagacctgcctgaggaggatgaggaattgcgcaagaagctgtggctgaagatcgcacgg cacgtggtgcaggaagaggaagatgtacagacagccatggcttgcctggctagctgcccc ttgctcaagattgaggatgtgctgcccttctttcctgatttcgtcaccatcgaccacttc aaggaggcgatctgcagctcacttaaggcctacaaccaccacatccaggagctgcagcgg gagatggaagaggctacagccagtgcccagcgcatccggcgagacctgcaggagctgcgg ggccgctacggcactgtggagccccaggacaaatgtgccacctgcgacttccccctgctc aaccgccctttttacctcttcctctgtggccatatgttccatgctgactgcctgctgcag gctgtgcgacctggcctgccagcctacaagcaggcccggctggaggagctgcagaggaag ctgggggctgctccacccccagccaagggctctgcccgggccaaggaggccgagggtggg gctgccacggcagggcccagccgggaacagctcaaggctgacctggatgagttggtggcc gctgagtgtgtgtactgtggggagctgatgatccgctctatcgaccggccgttcatcgac ccccagcgctacgaggaggagcagctcagttggctgtag >gi568815583r:40880226_41196310|GENSCAN_predicted_peptide_2|206_aa MPVGVFQGWVSVGCRALITVLSSELPNEAVWISCCSGFPRPLKGQHGTKSVIWVGHQLLW WRPRAAPPRLPGSASPTAPVGTRRAGLTRGSALRQMRHAPASFQCEWIEWSSHVAATVCK SPRLLCSSPAAAPCTWPLLRSDRSGAPQGLSAPPKPGLLPRPWTTRVAFFRPQVRKSRVG EEGRRSPSFGLTETLAAFALGSPPAA >gi568815583r:40880226_41196310|GENSCAN_predicted_CDS_2|621_bp atgcctgttggggtcttccagggctgggtctctgttggttgcagggcactaattactgtg ctgagctccgagctgcctaatgaggccgtttggatttcctgttgttctggcttcccaaga cccttaaagggccagcatggaactaagagtgtgatctgggtgggacaccaactgctctgg tggaggccccgggctgcgcccccacggctgccaggctctgccagtcccactgccccagtt ggcacgcggcgggccgggctgacgcgtggcagcgccctgcggcagatgaggcacgcgccg gcctcatttcaatgtgaatggatcgaatggagcagccatgtggcagcaacagtttgcaaa tctccacgccttctgtgcagctccccggccgccgctccctgcacgtggccccttctccgc agcgaccgcagcggcgcccctcaaggcttaagcgccccgcccaagccaggcctgcttccc cgcccctggaccacgcgtgtagcgttcttccgcccgcaagtccgaaagtcccgagtggga gaagaaggaaggcgcagccccagttttgggctgacagaaacacttgcggccttcgcactg ggctccccacccgcagcatga >gi568815583r:40880226_41196310|GENSCAN_predicted_peptide_3|1234_aa MAAASRSASGWALLLLVALWQQRAAGSGVFQLQLQEFINERGVLASGRPCEPGCRTFFRV CLKHFQAVVSPGPCTFGTVSTPVLGTNSFAVRDDSSGGGRNPLQLPFNFTWPGTFSLIIE AWHAPGDDLRPEALPPDALISKIAIQGSLAVGQNWLLDEQTSTLTRLRYSYRVICSDNYY GDNCSRLCKKRNDHFGHYVCQPDGNLSCLPGWTGEYCQQPICLSGCHEQNGYCSKPAECL CRPGWQGRLCNECIPHNGCRHGTCSTPWQCTCDEGWGGLFCDQGLFEQLSPKQLSGLSQL PLLHEAFSDFLPDMWPCPLLFFLIFILLGLAVYLNYCTHHSPCKNGATCSNSGQRSYTCT CRPGYTGVDCELELSECDSNPCRNGGSCKDQEDGYHCLCPPGYYGLHCEHSTLSCADSPC FNGGSCRERNQGANYACECPPNFTGSNCEKKVDRCTSNPCANGGQCLNRGPSRMCRCRPG FTGTYCELHVSDCARNPCAHGGTCHDLENGLMCTCPAGFSGRRCEVRTSIDACASSPCFN RATCYTDLSTDTFVCNCPYGFVGSRCEFPVGLPPSFPWVAVSLGVGLAVLLVLLGMVAVA VRQLRLRRPDDGSREAMNNLSDFQKDNLIPAAQLKNTNQKKELEVDCGLDKSNCGKQQNH TLDYNLAPGPLGRGTMPGKFPHSDKSLGEKAPLRLHSEKPECRISAICSPRDSMYQSVCL ISEERNECVIATEVSAGLAFPSAFCGRESGLVTLDPWAIPEGESGSSPMGPGAAVASTGI RVSKSAFGPGSTATVGPKSERREGANEGRASCGLENHWVRLLLGFALEVRHTMPTPDLGA YARDTGARTRSTPHSSLPVPFTVHTGYQTGKHSGNRSHSLSRQQGEEQHDLTLPAQNETG KQLPGPRGVREGGTGLGHEGKMTRLPCGGQQPVPLATQQASGHPGPETSSREENGVEEGR TPLRKPRTKDTVPRVTQLVRGGAELKSEQGAATTSKVTAGRSKRSNRLFPEPTWARGVPH LVRGVAHSPTPSAQFPRNDGDPQALWIFGYGSLVWRPDFAYSDSRVGFVRGYSRRFWQGD TFHRGSDKMPGRVVTLLEDHEGCTWGVAYQVQGEQVSKALKYLNVREAVLGGYDTKEVTF YPQDAPDQPLKALAYVATPQNPGYLGPAPEEAIATQILACRGFSGHNLEYLLRLADFMQL CGPQAQDEHLAAIVDAVGTMLPCFCPTEQALALV >gi568815583r:40880226_41196310|GENSCAN_predicted_CDS_3|3705_bp atggcggcagcgtcccggagcgcctctggctgggcgctactgctgctggtggcactttgg cagcagcgcgcggccggctccggcgtcttccagctgcagctgcaggagttcatcaacgag cgcggcgtactggccagtgggcggccttgcgagcccggctgccggactttcttccgcgtc tgccttaagcacttccaggcggtcgtctcgcccggaccctgcaccttcgggaccgtctcc acgccggtattgggcaccaactccttcgctgtccgggacgacagtagcggcggggggcgc aaccctctccaactgcccttcaatttcacctggccgggtaccttctcgctcatcatcgaa gcttggcacgcgccaggagacgacctgcggccagaggccttgccaccagatgcactcatc agcaagatcgccatccagggctccctagctgtgggtcagaactggttattggatgagcaa accagcaccctcacaaggctgcgctactcttaccgggtcatctgcagtgacaactactat ggagacaactgctcccgcctgtgcaagaagcgcaatgaccacttcggccactatgtgtgc cagccagatggcaacttgtcctgcctgcccggttggactggggaatattgccaacagcct atctgtctttcgggctgtcatgaacagaatggctactgcagcaagccagcagagtgcctc tgccgcccaggctggcagggccggctgtgtaacgaatgcatcccccacaatggctgtcgc cacggcacctgcagcactccctggcaatgtacttgtgatgagggctggggaggcctgttt tgtgaccaagggctgtttgagcagctctcccctaaacagctgtccggcttatctcagctg cccctcctccatgaagccttttctgacttcctccccgacatgtggccttgccctctgctc ttcttccttatcttcatcctacttgggttggcagtttatctcaactactgcacccaccac tccccatgcaagaatggggcaacgtgctccaacagtgggcagcgaagctacacctgcacc tgtcgcccaggctacactggtgtggactgtgagctggagctcagcgagtgtgacagcaac ccctgtcgcaatggaggcagctgtaaggaccaggaggatggctaccactgcctgtgtcct ccgggctactatggcctgcattgtgaacacagcaccttgagctgcgccgactccccctgc ttcaatgggggctcctgccgggagcgcaaccagggggccaactatgcttgtgaatgtccc cccaacttcaccggctccaactgcgagaagaaagtggacaggtgcaccagcaacccctgt gccaacgggggacagtgcctgaaccgaggtccaagccgcatgtgccgctgccgtcctgga ttcacgggcacctactgtgaactccacgtcagcgactgtgcccgtaacccttgcgcccac ggtggcacttgccatgacctggagaatgggctcatgtgcacctgccctgccggcttctct ggccgacgctgtgaggtgcggacatccatcgatgcctgtgcctcgagtccctgcttcaac agggccacctgctacaccgacctctccacagacacctttgtgtgcaactgcccttatggc tttgtgggcagccgctgcgagttccccgtgggcttgccgcccagcttcccctgggtggcc gtctcgctgggtgtggggctggcagtgctgctggtactgctgggcatggtggcagtggct gtgcggcagctgcggcttcgacggccggacgacggcagcagggaagccatgaacaacttg tcggacttccagaaggacaacctgattcctgccgcccagcttaaaaacacaaaccagaag aaggagctggaagtggactgtggcctggacaagtccaactgtggcaaacagcaaaaccac acattggactataatctggccccagggcccctggggcgggggaccatgccaggaaagttt ccccacagtgacaagagcttaggagagaaggcgccactgcggttacacagtgaaaagcca gagtgtcggatatcagcgatatgctcccccagggactccatgtaccagtctgtgtgtttg atatcagaggagaggaatgaatgtgtcattgccacggaggtgagtgctgggctcgccttt ccttctgccttttgtgggagggaaagtggcctggtcactcttgacccatgggccattcct gaaggagagagtggcagtagccccatggggcccggagctgctgtggcctccactggcatc cgtgtttccaaaagtgcctttggcccaggctccacggcgacagttgggcccaaatcagaa aggagagagggggccaatgagggcagggcctcctgtgggctggaaaaccactgggtgcgt ctcttgctggggtttgccctggaggtgagacacaccatgcccaccccagacctgggtgct tatgccagagacacaggagcacgcacacgctccacgcctcactcatcgctccctgttccc ttcactgtacacacagggtatcaaacaggaaagcattcagggaaccgttcccacagcctg agccggcagcagggggaagaacaacacgacctcactcttcctgcccaaaatgaaacagga aagcagctcccaggccctaggggtgtacgggaggggggaactgggctgggacacgaaggc aagatgacgaggctcccatgtgggggacagcagccagtccccctcgccacccaacaggcc tccggccatcctggtccagagacatcctcaagggaagagaatggggtcgaggagggccgg acaccgttgagaaaaccaagaactaaagacaccgtgcctagggtcactcagctggtacgt ggtggagctgagctcaaatctgagcaaggtgcagccaccacctccaaagtgactgccggt cggtccaagcgctctaacaggctcttccccgagcctacctgggctcgaggggtcccacac ctggtccggggcgtagcccattcccctacgccgtccgctcagttcccccgaaacgacggc gaccctcaagcgctgtggattttcgggtacggctccctggtgtggaggcccgacttcgcc tacagcgacagccgtgtgggcttcgtgcgcggctacagccgccgtttctggcagggagac accttccatcggggcagcgacaagatgcctggccgtgtggtgacgctccttgaagatcat gagggctgcacttggggcgtggcataccaagtgcaaggggagcaggtaagcaaggccctg aagtacctgaatgtgcgagaggcagtgcttggtggctacgataccaaggaggtcaccttc tatccccaagatgctcctgaccaaccactgaaggcattggcctatgtggccaccccacag aaccctggttacctgggccctgcgcctgaagaggccattgccacgcagatcctggcctgc cggggcttctccggccacaaccttgaatacttgctgcgtctggcagacttcatgcagctc tgtgggcctcaggcgcaggacgagcacctggcagccatcgtggacgctgtgggcaccatg ttgccctgcttctgccccaccgagcaggctctggcgctggtgtga >gi568815583r:40880226_41196310|GENSCAN_predicted_peptide_4|1957_aa MRKGLTEVMGPVPEALQQYSPAMTSSSSSPPPFGNLRQSQSRLICTQMAWIRVSPRTPVR TGRQVAAAGCELREADGSGTTGGHGQATSMSESGRRRRRPSARSRLSAAAILGSQEPRRE RARSVHAQRSHRPTASESEPRLHRRRGDPPLRTPPAPRPPSAAVAAAGPGGPSDDSEDGL DDSNPLLPQSGDPLIQVKEEPPNSLLGETSGAGSSGMLNTYSLNGVLQSESKCDKGNLYN FSKLKKSRKWLKSILLSDESSEADSQSEDDDEEELNLSREELHNMLRLHKYKKLHQNKYS KDKELQQYQYYSAGLLSTYDPFYEQQRHLLGPKKKKFKEEKKLKAKLKKVKKKRRRDEEL SSEESPRRHHHQTKVFAKFSHDAPPPGTKKKHLSIEQLNARRRKVWLSIVKKELPKANKQ KASARNLFLTNSRKLAHQCMKEVRRAALQAQKNCKETLPRARRLTKEMLLYWKKYEKVEK EHRKRAEKEALEQRKLDEEMREAKRQQRKLNFLITQTELYAHFMSRKRDMGHDGIQEEIL RKLEDSSTQRQIDIGGGVVVNITQEDYDSNHFKAQALKNAENAYHIHQARTRSFDEDAKE SRAAALRAANKSGTGFGESYSLANPSIRAGEDIPQPTIFNGKLKGYQLKGMNWLANLYEQ GINGILADEMGLGKTVQSIALLAHLAEKTLYTQDAPFHVVITSYQLVVQDVKYFQRVKWQ YMVLDEAQALKSSSSVRWKILLQFQCRNRLLLTGTPIQNTMAELWALLHFIMPTLFDSHE EFNEWFSKDIESHAENKSAIDENQLSRLHMILKPFMLRRIKKDVENELSDKIEILMYCQL TSRQKLLYQALKNKISIEDLLQSSMGSTQQAQNTTSSLMNLVMQFRKVCNHPELFERQET WSPFHISLKPYHISKFIYRHGQIRVFNHSRDRWLRVLSPFAPDYIQRSLFHRKALDKEKA FRQASKNPKGINEESCFSFLRFIDISPAEMANLMLQGLLARWLALFLSLKASYRLHQLRS WGAPEGESHQRYLRNKDFLLGVNFPLSFPNLCSCPLLKVTAVPLDSYCNDRSAEYERRVL KEGGSLAAKQCLLNGAPELAADWLNRRSQFFPEPAGGLWSIRPQNGWSFIRIPGKESLIT DSGKLYALDVLLTRLKSQGHRVLIYSQMTRMIDLLEEYMVYRKHTYMRLDGSSKISERRD MVADFQNRNDIFVFLLSTRAGGLGINLTAADTVIFYDSDWNPTVDQQAMDRAHRLGQTKQ VTVYRLICKGTIEERILQRAKEKSEIQRMVISGGNFKPDTLKPKEVVSLLLDDEELEKKL RLRQEEKRQQEETNRVKERKRKREKYAEKKKKEDELDGKRRKEGVNLVIPFVPSADNSNL SADGDDSFISVDSAMPSPFSEVRLLLDACITAISISSELHTGSIPLDESSSDMLVIVDDP ASSAPQSRATNSPASITGSVSDTVNGISIQEMPAAGRGHSARSRGRPKGSGSTAKGAGKG RSRKSTAGSAAAMAGAKAGAAAASAAAYAAYGYNVSKGISASSPLQTSLVRPAGLADFGP SSASSPLSSPLSKGNNVPGNPKNLHMTSSLAPDSLVRKQGKGTNPSGGRIFLLITTRTPG KNNTVYRELQSLRYFTVLTESLPTVPGTFVGLIQHPDLKVDLKENDLDHHNPSFIANEPL NDRHGDIRRYIQGVIAHGNISHRSSELLLGPRNCAPCWGKPFLPSGVFQLVRTDLQVDEA NNTCWAQNRSGQKRQGGAPERQPLGVALSPLPECPFYPLDFYRFSKAPRASQPCPPLSSG YPSQARGPFSGSLSKKAKKAFPDCPAQVRSPFWVLKALFSHSTLNGHHQFLDSKFLEEEE FFTAVSPCLEWCMAHSTQKKLAGLHTLPINGQDTRPPRHEHTAQAWHPPQPQRLHNLIGV DPAGDTQTNPGHQPQEQQQHIPPSLPHTSSHGIVPSQ >gi568815583r:40880226_41196310|GENSCAN_predicted_CDS_4|5874_bp atgagaaagggcttgactgaagtaatggggcctgtccctgaagccttgcagcagtacagc ccagcgatgacatcctcctcctcctccccgccgcctttcggcaatcttcgccagtcccag tcccgactaatctgtactcagatggcatggatcagggtctcccctcgaaccccggttcgc acggggcgtcaggtggcagcggcggggtgcgagctgcgcgaggccgacggcagcggcact acgggtggccacgggcaggcgacgagcatgagtgaaagtggccgccgccgccgccggccc agcgcccgcagccgcctcagcgccgccgccatcttggggtcccaggagccgcggagggag cgagctaggagcgtccacgcccaacgcagtcaccgtcccacggcctcagagagcgaaccg cggctccaccgtcggcggggcgacccccccctccggaccccgcccgcaccccgccccccc tccgccgccgtcgcggcggcggggccaggcggcccgagtgatgacagtgaagatggactg gatgacagtaatccattattgccccagtctggggatcccttaatacaagttaaggaagaa cctccaaattcattgcttggtgaaacttctggagcaggcagttctggaatgttaaacaca tattctctgaatggagttctacagtcagaatcaaaatgtgataaggggaatttatataat ttctctaagctgaagaaaagcagaaagtggctaaagagcattctgctaagtgatgaatcc agcgaggctgattctcagagtgaagacgatgatgaagaagaactcaatctcagcagagaa gaacttcacaacatgcttcgactacacaaatataagaaacttcaccaaaataagtatagt aaagacaaggagttgcagcaatatcagtactacagtgcaggcctgctctccacatatgac cctttctatgagcaacaacggcacctacttggacccaagaaaaagaaatttaaggaggaa aagaaacttaaagctaagttgaaaaaagtgaagaaaaaaagacgaagagatgaagaactt tcctctgaagaatcccctcgtcgccatcaccaccagaccaaagtctttgccaagttttct cacgatgcacctccccctggcactaagaaaaagcacttatccattgagcagctgaatgct cgtcgcaggaaagtatggctcagcattgtgaaaaaggaactaccaaaggcaaataagcag aaagcttcagctcgtaacctgtttctcaccaatagccgaaagcttgctcaccagtgcatg aaggaggtgcgtcgagctgccttgcaggcccagaagaactgtaaggaaaccttgcctcgt gcccgccgcctcaccaaggagatgcttctgtactggaagaaatatgagaaagtagagaag gagcaccgcaagagagcagagaaggaagctttggagcagcggaagttggatgaggaaatg cgggaggccaagaggcaacagcgaaaactcaacttcttaattacccagacagagttgtat gcccatttcatgagtcgcaaacgagatatgggtcatgatggtatccaggaagaaatccta aggaaactggaagacagttctacccagagacaaatcgatataggtggaggagtggtagtt aacatcacacaggaggattatgatagtaaccattttaaagcccaggccctgaagaatgct gaaaatgcttaccatattcaccaagctcggacaaggtcatttgatgaagatgcaaaagaa agtcgagcagctgccctacgggcagcaaacaagtctggcactgggtttggggagagttat agcctggctaacccatctatccgggctggtgaggatattccacagcccacaatttttaat ggcaaattgaaaggttatcaactgaaaggcatgaattggttggccaatctatatgaacag ggtattaatggcattcttgctgatgaaatgggccttggtaaaacagtacagagcattgcc cttctggcccatctggctgagaagaccctctacactcaggatgcccccttccatgtggtt attaccagctatcagctggtggtgcaggatgtaaagtatttccagcgggtcaagtggcaa tacatggtactggatgaggctcaggcgctcaagagtagttccagtgttcgttggaagatc ctcttacagttccagtgtcggaatcggcttttgctaaccgggaccccaattcagaacacc atggcagagctttgggctctgctgcatttcattatgccaacattatttgattcacatgag gaatttaatgaatggttttccaaggacattgagagccatgccgaaaacaaatctgctatt gatgagaatcaactttctcgcttacacatgattttgaagccatttatgctgaggagaatc aagaaagatgtggaaaatgaattatctgacaagattgagattctaatgtattgccaactg accagccgacagaagctgctatatcaggcactaaagaacaaaatttccattgaggattta ttgcagtcttctatgggctctacccaacaagcacagaacaccaccagcagcctcatgaat ctggtcatgcagtttaggaaggtgtgtaatcacccggagttatttgaacggcaagaaact tggtctccatttcatatttccctaaagccataccacatttcaaagtttatctaccgtcat ggacagatcagggtcttcaatcattcacgagacaggtggttaagggttctttctccattt gcaccagactatatccaacggtctctctttcacagaaaagcattggacaaagagaaggca tttaggcaagcatccaaaaatcccaaaggtattaatgaagaaagctgtttctctttcctt cgctttattgatatatctccagcagaaatggcaaaccttatgcttcagggacttttggcc agatggttagctcttttcctgtctctgaaagcctcctacaggctccatcagctacgctcc tggggagcgccagaaggggagagccaccagagatacctgaggaacaaggatttccttctt ggggttaattttccactctcctttccaaacctttgcagctgccctttgttaaaggttacc gcagtgccattggattcttactgcaatgaccgaagtgcagaatatgaaaggcgagttctg aaggaaggagggagtctggcagccaagcagtgtttgttgaatggggcccctgaactggct gcagactggctaaatagacgatcacagttcttcccagagccagctggaggtctgtggagc atcagacctcagaatggctggtctttcatcaggattccaggcaaggagagcctcatcact gacagtggaaagctgtatgcccttgatgtcctgctgactcggctcaagtctcaagggcat agggtccttatctactcccagatgaccaggatgatagacctactggaggaatacatggtt tacaggaagcatacctacatgaggcttgatggctcatccaagatctcggagaggcgagac atggttgctgattttcagaacaggaatgacatctttgtgttcctgttaagcacacgagct ggaggactgggtatcaatctcactgctgcagacacagtgattttctatgatagcgactgg aaccccactgtggaccagcaggccatggacagggcccaccgcttagggcagacaaagcag gttactgtgtaccggctcatctgtaaaggcaccattgaagaacgcattctgcaaagagcc aaggagaagagtgagattcagcggatggtgatttcaggtgggaacttcaaaccagatacc ttgaaacccaaagaggtggttagtcttcttctagacgacgaagagttggagaagaaactg aggctgcggcaggaagagaaacggcaacaggaggaaaccaaccgagtgaaagagcgcaag cggaagcgggaaaagtatgcagagaagaagaaaaaagaagatgaattggatgggaaaagg agaaaagagggtgtgaacctggtgatcccatttgttccctcggctgataactccaacctc tctgctgacggagatgactccttcattagcgtggactcagccatgccaagccctttcagt gaggtgaggctccttttagatgcctgcatcacagccatctccatcagcagtgagctgcac actggctccattcccctggacgagagcagcagtgacatgctggtcattgtggatgaccca gcctcctcagcccctcagtctcgagctaccaactctcccgcatccataacaggctccgtc tcagataccgtgaatggaatttccattcaggaaatgccagctgcaggacgtggtcactca gcccgaagccgaggccgccccaaaggttcaggaagcacagccaaaggagcagggaagggc cggagccgaaagtccacggcaggcagtgctgctgcaatggcaggagccaaagccggggct gcagcggcctctgcagctgcctatgccgcatacgggtacaacgtgtctaaaggaatctcc gccagcagtcctctgcagacatcccttgttcggcctgctggccttgctgactttggacct tcaagcgcctcttctcctttgagttcccctttgagcaagggaaataatgttcctgggaat cccaagaacctccacatgaccagcagcctagccccagactctctggtccggaaacagggc aaaggcaccaacccctctggaggacggattttcttgctcatcacaacaaggacaccaggc aaaaacaatactgtatacagagaattacaaagcttgagatacttcacagtcctcacagaa tccctgcctacagttcctggcacctttgtgggcctcatccaacatccagacctcaaagtt gacctcaaggagaatgatctggaccaccacaaccccagctttattgccaatgaacccctc aatgatcggcacggtgacatacgacgttacattcagggagtcattgcccatgggaacatt tcccacaggtcctctgagctcctcctagggccccggaactgtgctccatgctggggtaaa ccgttcctgccctcgggggtgttccagttagtaaggacagacctgcaagtagatgaagcc aataacacctgctgggcacagaacagatctgggcagaaacggcagggaggcgccccagag cgtcagccactcggcgtggccctttctcctttgccggaatgccccttctacccactggat ttctaccgcttctctaaggcccccagagccagccagccttgcccacccctgagctccggt tacccttcccaggccagaggccccttctcaggttccctctccaagaaagccaagaaggcc ttccctgactgccctgctcaagtcagaagccctttctgggtgctcaaggccctcttctcc cattccacgctgaatggtcaccatcagtttctagactccaagttccttgaggaggaggag ttctttaccgctgtatccccgtgcctggaatggtgcatggcacacagcacccagaagaag ctggctggactccacactcttcccatcaatggccaagacacaaggccgccaagacatgag cacacggcccaggcatggcaccctccccagccacaacggctgcacaacctgatcggagtt gaccctgctggagacacccagaccaatccaggacaccagccgcaagaacagcagcaacac ataccacccagcctgccccatacatcatcccacggcatcgtcccctcacagtga >gi568815583r:40880226_41196310|GENSCAN_predicted_peptide_5|244_aa MMRCTLENRNAQTKQLQTAVSNVEKHFGELCQIFAAYVRKTARLRDKADHLVNEINAYAA TETPHLKLGLMNFDDEFAKLQDYRQAEVERLEAKAETELQRTAMDASRTSRHLEETINNF ERQKMKDIKTIFSEFITIEMLFHGKALEVYTAAYRNIQNIDEDEDLEVFRNSLYAPDYSS CLDTVRANSKSPLQRSLSAKCVSGTRQVSTCRLRKDQQAEDDEDEELDVTEEENFLNYTF PFSS >gi568815583r:40880226_41196310|GENSCAN_predicted_CDS_5|735_bp atgatgaggtgcaccctggaaaatcggaacgctcaaacgaaacaactgcaaacagctgtc tcaaatgtggagaagcattttggagaactgtgccaaatcttcgctgcctatgtgcggaaa actgccaggctgagagacaaagcagaccacctggtgaatgaaatcaatgcgtatgctgct acagagaccccacatttaaagctgggcctgatgaactttgatgatgagtttgccaaactt caggattatcgacaagcagaggttgaaagacttgaagccaaagcagaaacggaattacag agaactgcaatggatgctagccgaacaagtcgtcatctggaggaaactattaacaacttt gaaaggcagaaaatgaaggatataaagactatattttctgaatttatcacaatcgaaatg ttatttcacggcaaagctttagaggtctacactgctgcctaccggaatatacaaaacatt gatgaagatgaagatttagaggttttccgaaattctctgtatgcaccagattattcatct tgtttagatactgtaagagcgaattcaaagtcacctcttcagagatcactgtcagctaag tgtgtatctggaacaagacaggtatccacttgtcgactaagaaaggatcaacaagcagaa gatgatgaggatgaagagttagatgttacagaagaagaaaattttcttaactacacattt ccattttcatcataa >gi568815583r:40880226_41196310|GENSCAN_predicted_peptide_6|290_aa VATNCRVYLFDIFLLGSRAFHNGLQMILEDKRILKVIHDCRWLSDCLSHQYGILLNNVFD TQVADVLQFSMETGGYLPNCITTLQESLIKHLQVAPKYLSFLEKRQKLIQENPEVWFIRP VSPSLLKILALEATYLLPLRLALLDEMMSDLTTLVDGYLNTYREGSADRLGGTEPTCMEL PEELLQLKDFQKQRREKAAREYRVNAQGLLIRTVLQPKKLVTETAGKEEKVKGFLFGKNF RIDKAPSFTSQDFHGDVNLLKEESLNKQATNPQHLPPTEEGETRNQSVSK >gi568815583r:40880226_41196310|GENSCAN_predicted_CDS_6|873_bp gtggccacaaattgccgagtttacttatttgacattttccttctgggaagtcgagctttc cacaatggacttcagatgatactagaagacaagagaattttgaaggttatccatgattgt cgttggctttctgattgcctctctcatcagtatggaattttgctgaataatgtctttgac acacaggtagcagatgtacttcagttttccatggaaacgggtggctatcttccaaactgc atcactactttgcaggagagtttaatcaaacaccttcaagtagcccctaaatatctctcc tttctagaaaagagacaaaaactaattcaggaaaatccagaagtatggttcatccgacct gtttcaccctctttactgaaaattttggccctggaagctacctacctgttaccccttcgc ttggcactcctagatgagatgatgtctgacctaaccaccctggtggatggttacctaaac acgtatcgcgaagggtctgcagaccggcttggaggcactgagcctacatgtatggagctg ccagaggaactgcttcaactcaaggacttccagaagcagcgcagggagaaagctgcaaga gaatatagggtgaatgcacagggactcctgataaggacagtgctacagccaaagaaatta gtgacagagacagcagggaaagaggagaaagtcaaaggcttcttatttggtaaaaatttt aggatagataaagctccaagttttacatctcaagactttcacggggatgtgaatttactg aaagaagaatctttgaataaacaagctacaaatcctcaacatctacctcccacagaggaa ggggaaaccagaaaccagagtgtctccaagtga