GENSCAN 1.0 Date run: 4-Nov-116 Time: 12:27:02 Sequence gi568815597r:151242189_151446320 : 204132 bp : 46.46% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 250 379 130 2 1 106 81 30 0.808 3.85 1.02 Intr + 19973 20146 174 2 0 41 80 107 0.514 4.25 1.03 Intr + 21726 21840 115 0 1 76 84 203 0.995 19.15 1.04 Intr + 22644 22730 87 2 0 64 76 146 0.996 11.17 1.05 Intr + 22978 23046 69 0 0 85 105 55 0.984 6.28 1.06 Intr + 23206 23421 216 0 0 77 85 248 0.998 22.10 1.07 Intr + 23816 23924 109 1 1 86 62 126 0.820 9.66 1.08 Intr + 24120 24251 132 1 0 104 99 210 0.998 24.32 1.09 Intr + 24332 24399 68 0 2 69 100 54 0.999 3.32 1.10 Intr + 24985 25151 167 0 2 93 44 294 0.014 24.26 1.11 Intr + 41000 41130 131 2 2 57 86 101 0.723 7.14 1.12 Intr + 44087 46218 2132 0 2 100 82 1463 0.983 134.24 1.13 Intr + 46340 46518 179 1 2 74 51 91 0.998 2.72 1.14 Intr + 46907 47083 177 2 0 111 61 95 0.992 8.13 1.15 Intr + 47190 47352 163 0 1 99 87 94 0.995 10.38 1.16 Intr + 47490 47819 330 2 0 108 113 210 0.995 21.33 1.17 Intr + 47934 48046 113 2 2 100 111 17 0.999 4.38 1.18 Intr + 48244 48385 142 1 1 121 96 125 0.999 16.96 1.19 Term + 48527 49021 495 1 0 83 34 427 0.999 31.37 1.20 PlyA + 49260 49265 6 -1.75 2.14 PlyA - 49633 49628 6 1.05 2.13 Term - 50845 50664 182 2 2 111 47 395 0.994 35.47 2.12 Intr - 51274 51212 63 2 0 128 65 50 0.940 5.39 2.11 Intr - 51950 51830 121 2 1 126 81 206 0.959 23.77 2.10 Intr - 52353 52221 133 2 1 116 76 195 0.798 21.75 2.09 Intr - 56885 56620 266 2 2 68 96 219 0.956 17.01 2.08 Intr - 59779 59656 124 0 1 123 110 141 0.998 20.39 2.07 Intr - 60110 60006 105 1 0 56 113 96 0.992 8.23 2.06 Intr - 61462 61353 110 1 2 83 72 148 0.988 11.78 2.05 Intr - 64175 63948 228 2 0 78 78 386 0.974 34.67 2.04 Intr - 65613 65386 228 0 0 36 71 284 0.970 19.57 2.03 Intr - 68067 68023 45 0 0 86 94 28 0.676 1.81 2.02 Intr - 74321 73385 937 1 1 112 83 989 0.640 91.99 2.01 Init - 74840 74830 11 2 2 92 31 5 0.274 -4.78 2.00 Prom - 76617 76578 40 -4.96 3.13 PlyA - 79177 79172 6 1.05 3.12 Term - 84185 83988 198 2 0 60 50 94 0.815 0.20 3.11 Intr - 85250 85083 168 2 0 14 99 166 0.384 10.44 3.10 Intr - 100990 100097 894 1 0 92 59 160 0.118 5.01 3.09 Intr - 101254 101154 101 2 2 88 110 112 0.518 13.23 3.08 Intr - 101694 101493 202 0 1 52 70 139 0.544 7.36 3.07 Intr - 102090 101967 124 2 1 14 98 -7 0.211 -6.51 3.06 Intr - 102348 102229 120 2 0 93 7 141 0.543 6.11 3.05 Intr - 102659 102540 120 1 0 85 72 134 0.620 11.11 3.04 Intr - 103000 102918 83 1 2 97 100 72 0.997 7.74 3.03 Intr - 103773 103740 34 2 1 115 119 30 0.997 7.03 3.02 Intr - 104145 104017 129 2 0 86 88 45 0.300 4.11 3.01 Init - 113227 113193 35 1 2 114 53 27 0.163 -0.54 3.00 Prom - 114483 114444 40 -4.26 4.12 PlyA - 115292 115287 6 1.05 4.11 Term - 122517 122355 163 2 1 102 43 190 0.999 13.31 4.10 Intr - 122856 122738 119 0 2 73 94 138 0.999 12.16 4.09 Intr - 123093 123001 93 0 0 70 116 61 0.819 7.26 4.08 Intr - 123496 123375 122 2 2 90 49 232 0.659 19.71 4.07 Intr - 123658 123580 79 1 1 79 109 104 0.999 10.72 4.06 Intr - 124265 124087 179 0 2 81 46 233 0.994 18.04 4.05 Intr - 124716 124534 183 1 0 113 85 195 0.999 21.46 4.04 Intr - 126131 126011 121 2 1 129 77 147 0.899 17.77 4.03 Intr - 127001 126852 150 2 0 79 10 225 0.898 14.16 4.02 Intr - 127366 127254 113 2 2 105 66 136 0.999 13.20 4.01 Init - 127711 127525 187 1 1 75 86 154 0.957 11.13 4.00 Prom - 151575 151536 40 -1.86 5.00 Prom + 155741 155780 40 -4.06 5.01 Init + 157400 157539 140 1 2 62 100 77 0.975 3.92 5.02 Intr + 157793 157999 207 2 0 111 78 253 0.998 24.79 5.03 Intr + 158254 158400 147 1 0 83 12 130 0.949 4.25 5.04 Intr + 158576 158657 82 2 1 49 111 99 0.999 7.94 5.05 Intr + 159051 159167 117 2 0 65 76 212 0.999 18.36 5.06 Intr + 159354 159442 89 2 2 73 109 110 0.848 10.37 5.07 Term + 159629 159641 13 2 1 95 44 14 0.823 -4.63 5.08 PlyA + 159731 159736 6 1.05 6.19 PlyA - 160560 160555 6 1.05 6.18 Term - 164276 162614 1663 1 1 107 48 1305 0.981 117.20 6.17 Intr - 164443 164419 25 2 1 108 87 -26 0.701 -3.62 6.16 Intr - 164835 164723 113 2 2 73 98 55 0.708 5.02 6.15 Intr - 165103 165047 57 0 0 104 94 6 0.625 0.80 6.14 Intr - 166052 165912 141 1 0 119 108 87 0.999 13.27 6.13 Intr - 166393 166221 173 1 2 101 1 108 0.953 2.14 6.12 Intr - 166640 166506 135 2 0 80 98 138 0.997 14.76 6.11 Intr - 169583 169437 147 2 0 77 79 100 0.935 8.43 6.10 Intr - 170208 170108 101 1 2 81 72 27 0.895 0.23 6.09 Intr - 181363 181209 155 0 2 81 94 113 0.979 10.92 6.08 Intr - 182098 181761 338 1 2 67 123 199 0.923 15.62 6.07 Intr - 182873 182767 107 0 2 96 75 33 0.712 2.73 6.06 Intr - 185826 185635 192 1 0 94 82 152 0.960 14.66 6.05 Intr - 186225 185935 291 1 0 46 82 182 0.589 10.51 6.04 Intr - 187523 187415 109 2 1 60 110 -1 0.973 -0.84 6.03 Intr - 188653 188478 176 2 2 100 109 85 0.916 11.46 6.02 Intr - 198898 198740 159 2 0 118 115 85 0.997 14.16 6.01 Intr - 200017 199893 125 0 2 69 92 206 0.432 19.33 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 24985 25155 171 0 0 93 53 310 0.986 25.83 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:151242189_151446320|GENSCAN_predicted_peptide_1|1709_aa XVHLGRPDVLPQTPPLEEISEGSPIPDPSFSPLVGETLQMLTTSVDNSEYMRNGDFLPTR LQAQQDAVNIVCHSKTRSNPENNVGLITLAKYGGQRRRGLSSDCEVLTTLTPDTGRILSK LHTVQPKGKITFCTGIRVAHLALKHRQGKNHKMRIIAFVGSPVEDNEKDLVKLAKRLKKE KVNVDIINFGEEEVNTEKLTAFVNTLNGKDGTGSHLVTVPPGPSLADALISSPILAGEGG AMLGLGASDFEFGVDPSADPELALALRVSMEEQRQRQEEEARRAAAASAAEAGIATTGTE DSDDALLKMTISQQEFGRTGLPDLSSMTEEEQIAYAMQMSLQGAEFGQAESADIDASSAM DTSEPAKEEDDYDVMQDPEFLQSVLENLPGVDPNNEAIRNAMGSLASQATKDGKKDKKEE DKKTCSPRQPSADGRDLIPSATQLGLNLIAASPSPSSALQALGPGKGLGSADMGDMKTPD FDDLLAAFDIPDIDANEAIHSGPEENEGPGGPGKPEPGVGSESEDTAAASAGDGPGVPAQ ASDHGLPPPDISVVSVIVKNTVCPEQSEALAGGSAGDGAQAAGVTKEGPVGPHRMQNGFG SPEPSLPGTPHSPAPPSGGTWKEKGMEGKTPLDLFAHFGPEPGDHSDPLPPSAPSPTREG ALTPPPFPSSFELAQENGPGMQPPVSSPPLGALKQESCSPHHPQVLAQQGSGSSPKATDI PASASPPPVAGVPFFKQSPGHQSPLASPKVPVCQPLKEEDDDEGPVDKSSPGSPQSPSSG AEAADEDSNDSPASSSSRPLKVRIKTIKTSCGNITRTVTQVPSDPDPPAPLAEGAFLAEA SLLKLSPATPTSEGPKVVSVQLGDGTRLKGTVLPVATIQNASTAMLMAASVARKAVVLPG GTATSPKMIAKNVLGLVPQALPKADGRAGLGTGGQKVNGASVVMVQPSKTATGPSTGGGT VISRTQSSLVEAFNKILNSKNLLPAYRPNLSPPAEAGLALPPTGYRCLECGDAFSLEKSL ARHYDRRSMRIEVTCNHCARRLVFFNKCSLLLHAREHKDKGLVMQCSHLVMRPVALDQMV GQPDITPLLPVAVPPVSGPLALPALGKGEGAITSSAITTVAAEAPVLPLSTEPPAAPATS AYTCFRCLECKEQCRDKAGMAAHFQQLGPPAPGATSNVCPTCPMMLPNRCSFSAHQRMHK NRPPHVCPECGGNFLQANFQTHLREACLHVSRRVGYRCPSCSVVFGGVNSIKSHIQTSHC EVFHKCPICPMAFKSGPSAHAHLYSQHPSFQTQQAKLIYKCAMCDTVFTHKPLLSSHFDQ HLLPQRVSVFKCPSCPLLFAQKRTMLEHLKNTHQSGRLEETAGKGAGGALLTPKTEPEEL AVSQGGAAPATEESSSSSEEEEVPSSPEPPRPAKRPRRELGSKGLKGGGGGPGGWTCGLC HSWFPERDEYVAHMKKEHGKSVKKFPCRLCERSFCSAPSLRRHVRVNHEGIKRVYPCRYC TEGKRTFSSRLILEKHVQVRHGLQLGAQSPGRGTTLARGSSARAQGPGRKRRQSSDSCSE EPDSTTPPAKSPRGGPGSGGHGPLRYRSSSSTEQSLMMGLRVEDGAQQCLDCGLCFASPG SLSRHRFISHKKRRGVGKASALGLGDGEEEAPPSRSDPDGGDSPLPASGGPLTCKVCGKS CDSPLNLKTHFRTHGMAFIRARQGAVGDN >gi568815597r:151242189_151446320|GENSCAN_predicted_CDS_1|5130_bp ngcgttcaccttggtcgtcctgatgttttacctcagactccacctttggaggaaatcagt gagggctcgcctattcctgaccccagtttctcacctctagttggagagactttgcaaatg ctaactacaagtgtggacaacagtgagtatatgcggaatggagacttcttacccaccagg ctgcaggcccagcaggatgctgtcaacatagtttgtcattcaaagacccgcagcaaccct gagaacaacgtgggccttatcacactggctaagtatgggggacagaggaggaggggactc agtagtgactgtgaagtgctgaccacactcaccccagacactggccgtatcctgtccaag ctacatactgtccaacccaagggcaagatcaccttctgcacgggcatccgcgtggcccat ctggctctgaagcaccgacaaggcaagaatcacaagatgcgcatcattgcctttgtggga agcccagtggaggacaatgagaaggatctggtgaaactggctaaacgcctcaagaaggag aaagtaaatgttgacattatcaattttggggaagaggaggtgaacacagaaaagctgaca gcctttgtaaacacgttgaatggcaaagatggaaccggttctcatctggtgacagtgcct cctgggcccagtttggctgatgctctcatcagttctccgattttggctggtgaaggtggt gccatgctgggtcttggtgccagtgactttgaatttggagtagatcccagtgctgatcct gagctggccttggcccttcgtgtatctatggaagagcagcggcagcggcaggaggaggag gcccggcgggcagctgcagcttctgctgctgaggccgggattgctacgactgggactgaa gactcagacgatgccctgctgaagatgaccatcagccagcaagagtttggccgcactggg cttcctgacctaagcagtatgactgaggaagagcagattgcttatgccatgcagatgtcc ctgcagggagcagagtttggccaggcggaatcagcagacattgatgccagctcagctatg gacacatctgagccagccaaggaggaggatgattacgacgtgatgcaggaccccgagttc cttcagagtgtcctagagaacctcccaggtgtggatcccaacaatgaagccattcgaaat gctatgggctccctggcctcccaggccaccaaggacggcaagaaggacaagaaggaggaa gacaagaagacttgctccccgcgccagccctcggcagatggcagggacttaattccgtct gctacccagcttggcctcaacctaatcgccgccagcccctcgccctcctctgcgctgcag gccttgggcccgggcaaaggtctgggatctgccgatatgggggatatgaagacccctgat tttgatgacctccttgctgcctttgacatccctgacattgatgcgaatgaagccatccat tctgggccagaagaaaatgaggggcctggaggcccagggaagccagaaccaggtgtagga agtgaatctgaagacacagcagcagcctctgctggggatggccctggagttccagcccag gcctctgaccatggcctgccaccgccagacatttctgtagtcagtgtcattgtcaagaac actgtgtgtcccgagcagtctgaggccctggctggaggctcagcaggagacggggcccag gctgctggggtaactaaagaagggcctgtggggcctcatcgaatgcagaatggttttggg agccctgaaccttccctcccaggaactccccactctcctgctcctcccagtgggggcacc tggaaagaaaaaggcatggaaggcaaaactcccttggacctgtttgctcattttggccct gagccaggggaccactcagatccgctgcctccctctgcaccctctcccactcgggagggg gctctgaccccgcctcctttcccctcttcctttgagctggcccaggagaatggcccaggc atgcagccacctgtttcttccccaccattgggggccttgaagcaggagagctgcagcccc catcatccccaggtcctagcccaacaaggctcaggctccagccctaaggccacggacatc cctgccagtgcctcgcctcccccagttgctggggtgcccttcttcaagcagtctccaggg caccagagccctcttgcctcccccaaagtgcccgtctgtcagcccttgaaggaagaagat gatgatgaggggccagtggacaagtcttccccaggaagtccccagagtccctctagtggg gccgaggctgcagatgaggacagcaatgactcccctgcctccagctcctctaggcctctt aaggtgcggatcaagaccattaaaacatcctgcgggaatatcacaaggactgtaactcag gtcccctcagatcctgatccacctgcccccttggctgagggggccttcttggctgaggct agcctcttgaagctgtcccctgcaacacctacttctgagggtccaaaggtggtgagcgta cagttgggtgatggtacaaggctgaaaggcactgtgctgcctgtggccaccatccagaac gccagtactgccatgctgatggcagccagtgtggctcgcaaggctgtggtgctgcctggg gggactgccaccagccctaagatgattgctaagaacgtgctaggcctggtgccccaagcc ctgcctaaggctgacgggcgggcagggctggggactgggggacagaaggtgaatggtgcc tcggtggtgatggtgcaaccttcaaagacagctactgggccaagtacagggggcggcaca gtgatatcacggacccagtccagcctggtggaggccttcaacaagatcctcaacagcaag aacctgctccctgcctataggccaaacctgagcccaccagctgaggctgggctggccctg cctcccaccggctaccgctgcctggagtgtggggatgccttctcattggagaagagcctg gcacggcactatgaccgtcggagcatgcgcatcgaggtcacctgcaaccactgcgcccgc cgcctggtcttcttcaacaagtgcagcctgctcctgcatgcacgtgaacacaaggacaag gggctcgtcatgcagtgctcacatttggtcatgaggcctgtagcccttgaccagatggtg gggcagccggacatcacaccgctgctgcctgtagctgtcccacctgtctctggacctctg gccttgcctgccttgggcaagggtgagggggccatcacctcctctgccattactacagtt gctgctgaggcccctgtcctgccgctctccacagagccgcctgctgccccggccacctct gcttacacatgctttcgctgcctggagtgcaaggaacagtgccgggacaaggctggcatg gcagctcacttccagcagctcggcccccctgcccctggggccaccagcaatgtgtgccca acctgccccatgatgctccccaatcgctgcagcttcagcgcccaccagcgcatgcataag aatcgacccccccatgtctgtcctgagtgtgggggcaacttcctgcaagccaattttcag acccatctccgggaggcctgtctgcacgtctctcgccgtgtaggatacaggtgccccagc tgttcagtggtgtttgggggtgtgaactccatcaagtcccacatccagacgtcgcactgc gaggttttccacaagtgccccatctgccccatggccttcaagtctgggccaagtgcccat gcccacctctactcccagcatcccagcttccaaactcagcaggccaagctgatctacaag tgcgccatgtgcgacacagtcttcactcacaaacccctcctctcctcacacttcgaccag cacttgctgccccagcgtgtcagtgtctttaagtgcccgtcttgtcctctgctctttgcc caaaaaaggaccatgctggaacatctcaagaacacccatcagtctgggcgcttggaggag actgctgggaaaggggccgggggtgccctgctgacccccaagactgagcctgaggagctg gctgtttctcagggaggggcagcccctgctactgaggagtcgtcttcatcttcagaagag gaggaagtacccagctcccctgagcccccccgtccagccaaacggcctcggcgggaacta gggagcaaaggcctcaagggtgggggtggggggcctggaggctggacctgtggcctgtgt cactcctggttccctgagcgtgatgaatacgtggcccacatgaagaaggagcatggcaag tcagtgaaaaagttcccctgtcgcctgtgtgagcgctccttctgctccgcccccagcctg aggcgccatgtcagagttaatcacgagggcatcaagcgagtttacccctgcaggtattgc acagagggaaaacgcaccttcagcagccgcctgatcctagagaaacatgtccaggtccgg cacggcttgcagcttggggcccagtcccctggccgggggaccaccttggctcggggttcc agtgccagagcccaggggccaggtcggaaacgccgccagtcttctgactcttgcagtgag gagcctgacagcacgacaccgccagccaagtcccccaggggcggacctggatctggaggc catggccctctgcgctaccggagcagcagctccacagaacagagcctcatgatggggttg agggtggaggatggtgcccagcagtgcctcgactgtggcttgtgctttgcctcccctggc tccctgagccgacaccgtttcatcagccacaagaagagacggggtgtgggtaaagccagt gccctggggctgggggatggggaggaagaggcccctccatcaaggtctgaccccgatggt ggagactcacccctgcctgcttctggaggcccactgacctgtaaggtctgtggcaagagc tgcgacagccctctaaacctcaagacccacttccgcacgcatggcatggcgttcatcagg gctcggcagggggctgttggggacaactag >gi568815597r:151242189_151446320|GENSCAN_predicted_peptide_2|850_aa MVKPLEARSLAVAMGDTVVEPAPLKPTSEPTSGPPGNNGGSLLSVITEGVGELSVIDPEV AQKACQEVLEKVKLLHGGVAVSSRGTPLELVNGDGVDSEIRCLDDPPAQIREEEDEMGAA VASGTAKGARRRRQNNSAKQSWLLRLFESKLFDISMAISYLYNSKEPGVQAYIGNRLFCF RNEDVDFYLPQLLNMYIHMDEDVGDAIKPYIVHRCRQSINFSLQCALLLGAYSSDMHIST QRHSRGTKLRKLILSDELKPAHRKRELPSLSPAPDTGLSPSKRTHQRSKSDATASISLSS NLKRTASNPKVENEDEELSSSTESIDNSFSSPVRLAPEREFIKSLMAIGKRLATLPTKEQ KTQRLISELSLLNHKLPARVWLPTAGFDHHVVRVPHTQAVVLNSKDKAPYLIYVEVLECE NFDTTSVPARIPENRIRSTRSVENLPECGITHEQRAGSFSTVPNYDNDDEAWSVDDIGEL QVELPEVHTNSCDNISQFSVDSITSQESKEPVFIAAGDIRRRLSEQLAHTPTAFKRDPED PSAVALKEPWQEKVRRIREGSPYGHLPNWRLLSVIVKCGDDLRQELLAFQVLKQLQSIWE QERVPLWIKPYKILVISADSGMIEPVVNAVSIHQVKKQSQLSLLDYFLQEHGSYTTEAFL SAQRNFVQSCAGYCLVCYLLQVKDRHNGNILLDAEGHIIHIDFGFILSSSPRNLGFETSA FKLTTEFVDVMGGLDGDMFNYYKMLMLQGLIAARKHMDKVVQIVEIMQQGCRRCSGSSPS GPMMTVAQVICSQLPCFHGSSTIRNLKERFHMSMTEEQLQLLVEQMVDGSMRSITTKLYD GFQYLTNGIM >gi568815597r:151242189_151446320|GENSCAN_predicted_CDS_2|2553_bp atggtgaaacccttggaagctcgaagtctggctgtggccatgggagatacagtagtggag cctgcccccttgaagccaacttctgagcccacttctggcccaccagggaataatgggggg tccctgctaagtgtcatcacggagggggtcggggaactatcagtgattgaccctgaggtg gcccagaaggcctgccaggaggtgttggagaaagtcaagcttttgcatggaggcgtggca gtctctagcagaggcaccccactggagttggtcaatggggatggtgtggacagtgagatc cgttgcctagatgatccacctgcccagatcagggaggaggaagatgagatgggggccgct gtggcctcaggcacagccaaaggagcaagaagacggcggcagaacaactcagctaaacag tcttggctgctgaggctgtttgagtcaaaactgtttgacatctccatggccatttcatac ctgtataactccaaggagcctggagtacaagcctacattggcaaccggctcttctgcttt cgcaacgaggacgtggacttctatctgccccagttgcttaacatgtacatccacatggat gaggacgtgggtgatgccattaagccctacatagtccaccgttgccgccagagcattaac ttttccctccagtgtgccctgttgcttggggcctattcttcagacatgcacatttccact caacgacactcccgtgggaccaagctacggaagctgatcctctcagatgagctaaagcca gctcacaggaagagggagctgccctccttgagcccggcccctgacacagggctgtctccc tccaaaaggactcaccagcgctctaagtcagatgccactgccagcataagtctcagcagc aacctgaaacgaacagccagcaaccctaaagtggagaatgaggatgaggagctctcctcc agcaccgagagtattgataattcattcagttcccctgttcgactggctcctgagagagaa ttcatcaagtccctgatggcgatcggcaagcggctggccacgctccccaccaaagagcag aaaacacagaggctgatctcagagctctccctgctcaaccataagctccctgcccgagtc tggctgcccactgctggctttgaccaccacgtggtccgtgtaccccacacacaggctgtt gtcctcaactccaaggacaaggctccctacctgatttatgtggaagtccttgaatgtgaa aactttgacaccaccagtgtccctgcccggatccccgagaaccgaattcggagtacgagg tccgtagaaaacttgcccgaatgtggtattacccatgagcagcgagctggcagcttcagc actgtgcccaactatgacaacgatgatgaggcctggtcggtggatgacataggcgagctg caagtggagctccccgaagtgcataccaacagctgtgacaacatctcccagttctctgtg gacagcatcaccagccaggagagcaaggagcctgtgttcattgcagcaggggacatccgc cggcgcctttcggaacagctggctcataccccgacagccttcaaacgagacccagaagat ccttctgcagttgctctcaaagagccctggcaggagaaagtacggcggatcagagagggc tccccctacggccatctccccaattggcggctcctgtcagtcattgtcaagtgtggggat gaccttcggcaagagcttctggcctttcaggtgttgaagcaactgcagtccatttgggaa caggagcgagtgcccctttggatcaagccatacaagattcttgtgatttcggctgatagt ggcatgattgaaccagtggtcaatgctgtgtccatccatcaggtgaagaaacagtcacag ctctccttgctcgattacttcctacaggagcacggcagttacaccactgaggcattcctc agtgcacagcgcaattttgtgcaaagttgtgctgggtactgcttggtctgctacctgctg caagtcaaggacagacacaatgggaatatccttttggacgcagaaggccacatcatccac atcgactttggcttcatcctctccagctcaccccgaaatctgggctttgagacgtcagcc tttaagctgaccacagagtttgtggatgtgatgggcggcctggatggcgacatgttcaac tactataagatgctgatgctgcaagggctgattgccgctcggaaacacatggacaaggtg gtgcagatcgtggagatcatgcagcaaggttgtcgccgttgctcaggatcatccccatct ggccccatgatgacggtggcccaggtcatctgttctcagcttccttgcttccatggctcc agcaccattcgaaacctcaaagagaggttccacatgagcatgactgaggagcagctgcag ctgctggtggagcagatggtggatggcagtatgcggtctatcaccaccaaactctatgac ggcttccagtacctcaccaacggcatcatgtga >gi568815597r:151242189_151446320|GENSCAN_predicted_peptide_3|735_aa MPGRLHLLTGKFPHAGMAEDEPDAKSPKTGGRAPPGGAEAGEPTTLLQRLRGTISKAVQN KVEGILQDVQKFSDNDKLYLYLQLPSGPTTGDKSSEPSTLSNEEYMYAYRWIRNHLEEHT DTCLPKQSVYDAYRKYCESLACCRPLSTANFGKIIREIFPDIKARRLGGRGQSKYCYSGI RRKTLVSMPPLPGLDLKGSESVSTKSPSNSTLLSQPEMGPEVTPAPRDELVEAACALTCD WAERILKRSFSSIVEVARFLLQQHLISARSAHAHVLKAMGLAEEDEHAPRERSSKPKNGL ENPEGGAHKKPERLAQPPKDLEARTGAGPLARGERKKSVVESSAPGANNLQVNALVARLP LLLPRAPRSLIPPIPVSPPILAPRLSSGALKVATLPLSSRAGAPPAAVPIINMILPTVPA LPGPGPGPGRAPPGGLTQPRGTENREVGIGGDQGPHDKGVKRTAEVPVSEASGQAPPAKA AKQDIEDTASDAKRKRGRPRKKSGGSGERNSTPLKSAAAMESAQSSRLPWETWGSGGEGN SAGGAERPGPMGEAEKGAVLAQGQGDGTVSKGGRGPGSQHTKEAEDKIPLVPSKVSVIKG SRSQKEAFPLAKGEAAPRAAPQPGPGAASAAAVREAQAANGTRERSLLLRRQCRQSNRDC PHPLRGGPPEDQLNLELRRKMCCEQGEPQLPQAVQDRRVFLKAGASFEEVPIRLHLVDYS GAATKRDEQACVEIE >gi568815597r:151242189_151446320|GENSCAN_predicted_CDS_3|2208_bp atgcccggccgactccacctcttgacagggaagttccctcatgccgggatggcagaagat gagcctgatgctaagagccccaagactgggggaagggcccccccaggtggtgctgaggct ggggaacctaccacccttcttcagaggctccgaggtaccatttccaaggccgtgcagaac aaagtagaggggatcctgcaagatgtacagaaattttctgacaatgacaagctgtatctc taccttcagctcccctcaggacccaccactggagacaaaagctcagagccaagtacactg agcaatgaggagtacatgtatgcctataggtggatccgcaaccacctggaagagcacact gacacctgtctgccaaagcaaagtgtttatgatgcctatcggaagtactgtgagagtctt gcctgttgccgcccactcagcacagccaactttggcaagatcatcagagagatcttccct gacatcaaagctcgaaggcttggtggccggggccagtccaaatattgctacagtggcata aggaggaagaccttggtgtctatgccacccctgcctggacttgacctaaagggttctgag agtgtaagtaccaaatcaccttccaattccactcttctctcccagccagaaatgggccca gaagtaaccccagcacctcgagatgaactggtggaggcagcgtgtgccctgacctgtgac tgggcagagcggatcctgaaacggtccttcagttccatcgttgaggtcgcccgcttcctg ctacagcagcatctcatctctgcccgatctgcacatgcccatgtgcttaaggccatgggg ctcgctgaagaggacgaacatgcacctcgggaacggtcatctaaaccaaagaatggttta gagaacccagagggtggagcccacaagaagccagagagactggcccagcctcctaaggat ctggaagcccgaactggggccggtcctctcgcacgtggagagcggaagaagagtgtagtt gagagctcggccccaggagccaataacctgcaggttaatgccctagtggctcggctgcct ctgctccttccccgggcccctcgctcactaattccgccaatcccagtctctccacctatt ctggcccccaggctttcttcaggtgccctgaaagtggctacactgcctctgtctagtagg gccggggcacccccagcagctgtgcccatcattaacatgatcttaccaactgttcctgct ttgcctggacctggacctgggcctgggcgagctccacctgggggactcactcagccccgg ggcacagagaacagagaggtaggcataggtggtgaccaaggaccacatgacaagggtgtc aagaggacagctgaagtacctgtgagtgaggccagtgggcaggctccaccagctaaagca gcaaagcaggatatagaggatacagcaagtgatgccaaaaggaaacgggggcgccctcga aaaaagtcaggtggaagtggggaaaggaattctacccctctcaagtcagcagctgccatg gaatctgcccagtcctcaaggttaccatgggagacatggggctcaggaggggaaggcaac tcagctggaggggcagagaggccagggccaatgggagaggctgaaaagggggcagtactt gcccagggtcagggagatggtactgtttccaaaggaggaaggggccccggttcccagcat accaaagaagcagaagataaaattcccttggtcccctcaaaagtgagtgtcatcaagggc agcagaagccaaaaggaggcttttcctttggcaaagggagaggcggcgccgcgggcagcc ccgcagccggggcctggtgcagcctccgcggccgctgtcagggaagcgcaggcggccaat ggaacccgggagcggtcgctgctgctgaggcggcagtgtcggcagtccaaccgcgactgc ccgcaccccctccgcgggggtcccccagaggatcaactaaaccttgaactaagaagaaaa atgtgttgtgagcagggggagcctcagctgcctcaggccgttcaggacagaagggtgttt ctgaaggccggagcaagttttgaagaagtccctatcagattacacttggttgactactcc ggagcagccactaagagggatgaacaggcctgcgtggaaattgaatga >gi568815597r:151242189_151446320|GENSCAN_predicted_peptide_4|502_aa MRLEWGPRPAALPWPAGMCAAERAEGAFTLQSVAQPMRPIASTATKCGNCGPGYSTPLEA MKGPREEIVYLPCIYRNTGTEAPDYLATVDVDPKSPQYCQVIHRLPMPNLKDELHHSGWN TCSSCFGDSTKSRTKLVLPSLISSRIYVVDVIEPKDIHAKCELAFLHTSHCLASGEVMIS SLGDVKGNGKGGFVLLDGETFEVKGTWERPGGAAPLGYDFWYQPRHNVMISTEWAAPNVL RDGFNPADVEAGLYGSHLYVWDWQRHEIVQTLSLKDGLIPLEIRFLHNPDAAQGFVGCAL SSTIQRFYKNEGGTWSVEKVIQVPPKKVKGWLLPEMPGLITDILLSLDDRFLYFSNWLHG DLRQYDISDPQRPRLTGQLFLGGSIVKGGPVQVLEDEELKSQPEPLVVKGKRVAGGPQMI QLSLDGKRLYITTSLYSAWDKQFYPDLIREGSVMLQVDVDTVKGGLKLNPNFLVDFGKEP LGPALAHELRYPGGDCSSDIWI >gi568815597r:151242189_151446320|GENSCAN_predicted_CDS_4|1509_bp atgaggctggagtggggacctaggccagccgcactgccgtggcccgctgggatgtgtgct gcagaacgtgcggagggagccttcaccctccagagcgtggcccagccaatgcgccccatt gcttccacagctacgaaatgtgggaattgtggacccggctactccacccctctggaggcc atgaaaggacccagggaagagatcgtctacctgccctgcatttaccgaaacacaggcact gaggccccagattatctggccactgtggatgttgaccccaagtctccccagtattgccag gtcatccaccggctgcccatgcccaacctgaaggacgagctgcatcactcaggatggaac acctgcagcagctgcttcggtgatagcaccaagtcgcgcaccaagctggtgctgcccagt ctcatctcctctcgcatctatgtggtggacgtcattgagcccaaggacatccatgccaag tgcgaactggcctttctccacaccagccactgcctggccagcggggaagtgatgatcagc tccctgggagacgtcaagggcaatggcaaagggggttttgtgctgctggatggggagacg ttcgaggtgaaggggacatgggagagacctgggggtgctgcaccgttgggctatgacttc tggtaccagcctcgacacaatgtcatgatcagcactgagtgggcagctcccaatgtctta cgagatggcttcaaccccgctgatgtggaggctggactgtacgggagccacttatatgta tgggactggcagcgccatgagattgtgcagaccctgtctctaaaagatgggcttattccc ttggagatccgcttcctgcacaacccagacgctgcccaaggctttgtgggctgcgcactc agctccaccatccagcgcttctacaagaacgagggaggtacatggtcagtggagaaggtg atccaggtgccccccaagaaagtgaagggctggctgctgcccgaaatgccaggcctgatc accgacatcctgctctccctggacgaccgcttcctctacttcagcaactggctgcatggg gacctgaggcagtatgacatctctgacccacagagaccccgcctcacaggacagctcttc ctcggaggcagcattgttaagggaggccctgtgcaagtgctggaggacgaggaactaaag tcccagccagagcccctagtggtcaagggaaaacgggtggctggaggccctcagatgatc cagctcagcctggatgggaagcgcctctacatcaccacgtcgctgtacagtgcctgggac aagcagttttaccctgatctcatcagggaaggctctgtgatgctgcaggttgatgtagac acagtaaaaggagggctgaagttgaaccccaacttcctggtggacttcgggaaggagccc cttggcccagcccttgcccatgagctccgctaccctgggggcgattgtagctctgacatc tggatttga >gi568815597r:151242189_151446320|GENSCAN_predicted_peptide_5|264_aa MEAFLGSRSGLWAGGPAPGQFYRIPSTPDSFMDPASALYRGPITRTQNPMVTGTSVLGVK FEGGVVIAADMLGSYGSLARFRNISRIMRVNNSTMLGASGDYADFQYLKQVLGQMVIDEE LLGDGHSYSPRAIHSWLTRAMYSRRSKMNPLWNTMVIGGYADGESFLGYVDMLGVAYEAP SLATGYGAYLAQPLLREVLEKQPVLSQTEARDLVERCMRVLYYRDARSYNRFQIATVTEK GVEIEGPLSTETNWDIAHMISGFE >gi568815597r:151242189_151446320|GENSCAN_predicted_CDS_5|795_bp atggaagcgtttttggggtcgcggtccggactttgggcggggggtccggccccaggacag ttttaccgcattccgtccactcccgattccttcatggatccggcgtctgcactttacaga ggtccaatcacgcggacccagaaccccatggtgaccgggacctcagtcctcggcgttaag ttcgagggcggagtggtgattgccgcagacatgctgggatcctacggctccttggctcgt ttccgcaacatctctcgcattatgcgagtcaacaacagtaccatgctgggtgcctctggc gactacgctgatttccagtatttgaagcaagttctcggccagatggtgattgatgaggag cttctgggagatggacacagctatagtcctagagctattcattcatggctgaccagggcc atgtacagccggcgctcgaagatgaaccctttgtggaacaccatggtcatcggaggctat gctgatggagagagcttcctcggttatgtggacatgcttggtgtagcctatgaagcccct tcgctggccactggttatggtgcatacttggctcagcctctgctgcgagaagttctggag aagcagccagtgctaagccagaccgaggcccgcgacttagtagaacgctgcatgcgagtg ctgtactaccgagatgcccgttcttacaaccggtttcaaatcgccactgtcaccgaaaaa ggtgttgaaatagagggaccattgtctacagagaccaactgggatattgcccacatgatc agtggctttgaatga >gi568815597r:151242189_151446320|GENSCAN_predicted_peptide_6|1402_aa XMADTDLFMECEEEELEPWQKISDVIEDSVVEDYNSVDKTTTVSVSQQPVSAPVPIAAHA SVAGHLSTSTTVSSSGAQNSDSTKKTLVTLIANNNAGNPLVQQGGQPLILTQNPAPGLGT MVTQPVLRPVQVMQNANHVTSSPVASQPIFITTQGFPVRNVRPVQNAMNQVGIVLNVQQG QTVRPITLVPAPGTQFVKPTVGVPQVFSQMTPVRPGSTMPVRPTTNTFTTVIPATLTIRS TVPQSQSQQTKSTPSTSTTPTATQPTSLGQLAVQSPGQSNQTTNPKLVSIASFVTVKRPG VTGENSNEVAKLVNTLNTIPSLGQSPGPVVVSNNSSAHGSQRTSGPESSMKVTSSIPVFD LQDGGRKICPRCNAQFRVTEALRGHMCYCCPEMVEYQKKGKSLDSEPSVPSAAKPPSPEK TAPVASTPSSTPIPALSPPTKVPEPNENVGDAVQTKLIMLVDDFYYGRDGGKVAQLTNFP KVATSFRCPHCTKRLKNNIRFMNHMKHHVELDQQNGEVDGHTICQHCYRQFSTPFQLQCH LENVHSPYESTTKCKICEWAFESEPLFLQHMKDTHKPGEMPYVCQVCQYRSSLYSEVDVH FRMIHEDTRHLLCPYCLKVFKNGNAFQQHYMRHQKRNVYHCNKCRLQFLFAKDKIEHKLQ HHKTFRKPKQLEGLKPGTKVTIRASRGQPRTVPVSSNDTPPSALQEAAPLTSSMDPLPVF LYPPVQRSIQKRAVRKMSVMGRQTCLECSFEIPDFPNHFPTYVHCSLCRYSTCCSRAYAN HMINNHVPRKSPKYLALFKNSVSGIKLACTSCTFVTSVGDAMAKHLVFNPSHRSSSILPR GLTWIAHSRHGQTRDRVHDRNVKNMYPPPSFPTNKAATVKSAGATPAEPEELLTPLAPAL PSPASTATPPPTPTHPQALALPPLATEGAECLNVDDQDEGSPVTQEPELASGGGGSGGVG KKEQLSVKKLRVVLFALCCNTEQAAEHFRNPQRRIRRWLRRFQASQGENLEGKYLSFEAE EKLAEWVLTQREQQLPVNEETLFQKATKIGRSLEGGFKISYEWAVRFMLRHHLTPHARRA VAHTLPKDVAENAGLFIDFVQRQIHNQDLPLSMIVAIDEISLFLDTEVLSSDDRKENALQ TVGTGEPWCDVVLAILADGTVLPTLVFYRGQMDQPANMPDSILLEAKESGYSDDEIMELW STRVWQKHTACQRSKGMLVMDCHRTHLSEEVLAMLSASSTLPAVVPAGCSSKIQPLDVCI KRTVKNFLHKKWKEQAREMADTACDSDVLLQLVLVWLGEVLGVIGDCPELVQRSFLVASV LPGPDGNINSPTRNADMQEELIASLEEQLKLSGEHSESSTPRPRSSPEETIEPESLHQLF EGESETESFYGFEEADLDLMEI >gi568815597r:151242189_151446320|GENSCAN_predicted_CDS_6|4209_bp nncatggcggacaccgacctgttcatggaatgtgaggaggaggagttggagccatggcag aaaatcagtgatgtcattgaggactctgtagttgaagattataattcagtggataaaact accacagtttctgtgagccagcagccagtctcggctccagtgcccatcgctgcccatgct tctgttgctgggcacctctctacatccaccaccgttagtagcagcggggcacagaacagc gacagtacaaagaagactcttgtcacactaattgccaacaacaatgctggcaatcctttg gtccagcaaggtggacagccactcatcctgacccagaatccagccccaggtctgggcaca atggttactcaaccagtattgaggcctgttcaggtcatgcagaatgccaatcatgtgact agttcccctgtggcctcacaaccaatatttatcactacgcagggatttcctgtaaggaat gtccggcctgtacaaaatgcaatgaatcaggttgggattgtgctgaacgtacagcaaggc caaacggttagaccaattacactagttccagccccaggtacccagtttgttaagccgaca gttggagttccacaagtgttctcccagatgacccctgtgaggccaggctccacaatgcct gtgaggcccaccaccaacaccttcaccaccgtcatcccggccactcttaccattcgaagc accgtcccacagtcccagtcccagcagaccaagtccactcccagcacttctaccactccc actgccacacagccaacctcactggggcaactagctgttcagtctccaggccagtcaaac cagaccacgaatcccaagctagtgagcattgccagctttgtcactgtgaagcgacctggt gttacaggcgaaaatagcaatgaagtggccaaattggtgaatacccttaacaccatccct tccctgggccagagtcctgggccagtggtggtgtccaacaacagctctgctcatggctct caaagaaccagcggacctgagtcttcaatgaaagtgacctcttccatcccagtatttgac ctccaggatggtggacggaaaatatgtccacgatgtaatgctcaatttcgtgttactgaa gctttgagaggtcacatgtgttactgttgcccagaaatggttgaataccagaagaaagga aagtccctggattcagaacccagtgtcccatcagcagcaaagcccccatcccctgagaaa acagctcctgttgcttccacaccctcttctacacctattcctgctctgtcaccgcctacc aaagtaccagaaccaaatgagaacgtgggcgatgccgtccagaccaaactcattatgctt gtagatgacttctactatggacgggatggtggcaaagtagcccagctcacaaatttccct aaggtcgccacatctttccgatgcccacattgtaccaaaaggctaaaaaacaatattcga ttcatgaaccatatgaaacaccacgtagaactcgatcagcagaacggtgaggtagatggt cacactatctgccagcactgttaccgccagttttccactcccttccagcttcagtgccac ttggaaaatgttcatagtccctatgaatctactaccaagtgcaagatctgtgaatgggcg tttgaaagtgagccactatttctccagcatatgaaggatactcataagcctggagagatg ccttatgtttgccaggtgtgtcaatatcgctcctcactctactctgaggtagatgtccat tttcggatgatccatgaggatacccggcatctgctctgcccttattgcctgaaggtcttc aaaaatggcaatgcattccaacagcattacatgaggcaccagaagagaaatgtttatcac tgcaacaaatgccggctgcagtttctctttgccaaggacaaaattgaacacaagcttcaa caccataaaaccttccgtaaacccaagcagctggagggcttgaaaccaggcaccaaggtg acaatccgggcttcccgagggcagccacgaactgttcctgtatcctctaatgatacacct cccagcgccttgcaggaggcagcaccgctgacctcctcaatggaccctctgcctgtcttc ctttatccccctgtccagcgcagcatccagaagagagctgttaggaaaatgagtgtcatg ggccggcagacatgcctggagtgcagcttcgagatcccagacttccctaatcatttccct acttacgtacactgctctctgtgtcgctatagcacctgctgttctcgagcttatgccaac cacatgatcaacaatcatgttccacggaagagccccaagtatttggctttgtttaaaaat tctgtgagtggaatcaagctggcctgcacttcatgtacctttgttacctctgtgggcgat gctatggccaagcatttggtattcaacccctctcacagatccagcagcatcctgccacgg ggactcacttggatagctcactcaaggcatggccagactcgtgaccgagtgcatgaccgg aacgtgaagaatatgtaccctcctccttccttccccactaacaaagctgccactgtgaaa tctgcgggggccaccccagctgagcctgaagagctactaactcccttagccccagcactc ccatcaccagcctcaactgcaaccccaccaccaacccccactcacccgcaggctttagcc cttccaccgctggctacagagggagccgaatgtctgaatgttgatgatcaggatgaaggg agcccagtcacccaagaacctgagctagcatcaggtggtggtggtagtggtggagttggc aaaaaggagcagctgtctgtgaagaagcttcgagtagtactgtttgctctatgctgcaat acagaacaggcagctgaacacttccgaaatccccagcgacgtattcgccgttggcttcga cgtttccaggcctcccagggggagaatctagagggcaaatatctgagctttgaggcagaa gagaaactggctgagtgggtgctaacccagcgcgaacaacagctacctgtaaatgaggag accttgttccagaaggccaccaaaataggacgttctttggaaggggggtttaagatctcc tatgagtgggctgtgcgtttcatgctgcggcaccacctgactccccatgcccggcgagct gtggcccacaccctacctaaggatgtagcagagaatgcaggactcttcattgattttgta caacggcagattcacaaccaggacttacccttgtctatgattgtggctattgatgagatc tctttgttcctggatacagaggtgctgagcagtgatgatcgaaaggagaatgccctgcag acagtgggcacaggggaaccttggtgtgatgtagtcctagccattctggcagatggcact gtccttcccaccctggttttctacagagggcagatggatcagcctgctaacatgccagac tccatattgctagaggcaaaggagagtggctacagtgatgacgagatcatggagctgtgg tcaactcgagtgtggcagaagcacacagcttgccagcgcagcaaaggcatgcttgtgatg gactgtcatcgcactcacttgtcagaagaggtactggctatgcttagtgcctctagcact ttgcctgcagtggtcccagcaggctgtagctccaaaattcagccattagatgtatgcatc aaaagaactgtcaagaacttcctgcataaaaaatggaaggaacaggctcgggaaatggca gatactgcatgtgattctgatgtcctgcttcagctggtgcttgtctggctgggtgaagtg ctaggtgtcattggggactgtccagagctagttcagcgctccttcctggtggctagtgtt ctgcctggccccgatggcaacattaactcacctacaagaaatgctgacatgcaggaggag ctaattgcctccctagaggagcaactgaagctgagtggggaacattctgagtcttccact ccacgacccagatcatctcctgaagagacaattgagcctgaaagtcttcaccagctcttt gagggtgaaagtgagaccgagtctttctatggctttgaagaagctgacctagatctgatg gagatttga