GENSCAN 1.0 Date run: 4-Nov-116 Time: 04:06:58 Sequence gi568815597r:44722418_44942932 : 220515 bp : 47.46% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1705 1827 123 2 0 99 79 293 0.873 30.06 1.02 Intr + 1903 1975 73 2 1 94 102 68 0.876 7.26 1.03 Intr + 2122 2251 130 1 1 25 69 373 0.705 29.80 1.04 Intr + 2318 2422 105 1 0 85 119 105 0.976 13.71 1.05 Intr + 2719 2800 82 0 1 95 113 123 0.978 14.71 1.06 Term + 2874 2986 113 1 2 113 42 118 0.999 8.42 1.07 PlyA + 3152 3157 6 1.05 2.03 PlyA - 3961 3956 6 1.05 2.02 Term - 4480 4345 136 0 1 34 53 132 0.302 1.79 2.01 Init - 10770 10763 8 0 2 114 91 0 0.667 3.40 2.00 Prom - 10951 10912 40 -4.76 3.00 Prom + 14900 14939 40 -6.36 3.01 Init + 17522 17585 64 1 1 86 77 50 0.509 5.03 3.02 Intr + 18496 18590 95 2 2 84 92 58 0.935 5.48 3.03 Intr + 24967 25068 102 0 0 90 96 51 0.984 6.47 3.04 Intr + 25235 25283 49 1 1 86 106 47 0.998 4.55 3.05 Intr + 28025 28147 123 0 0 102 80 175 0.919 18.66 3.06 Intr + 30745 30837 93 2 0 92 74 74 0.825 6.34 3.07 Intr + 31316 31416 101 0 2 82 82 78 0.998 6.43 3.08 Intr + 32333 32428 96 1 0 77 95 65 0.940 6.31 3.09 Intr + 33512 33566 55 1 1 94 115 9 0.975 2.75 3.10 Intr + 33658 33820 163 2 1 84 92 123 0.975 11.33 3.11 Intr + 35139 35229 91 0 1 61 84 27 0.990 -0.40 3.12 Intr + 35491 35554 64 0 1 111 95 68 0.998 8.09 3.13 Intr + 35632 35723 92 2 2 42 65 163 0.917 9.11 3.14 Intr + 36789 36931 143 2 2 65 94 332 0.999 30.65 3.15 Intr + 37863 38067 205 0 1 70 67 160 0.971 11.30 3.16 Intr + 38175 38285 111 2 0 86 101 44 0.967 6.08 3.17 Intr + 39499 39566 68 0 2 129 79 -5 0.993 0.40 3.18 Intr + 39929 40034 106 2 1 93 73 79 0.981 7.12 3.19 Intr + 40128 40241 114 2 0 78 96 173 0.987 17.74 3.20 Intr + 44409 44532 124 2 1 74 90 156 0.986 14.56 3.21 Term + 44680 44762 83 2 2 95 49 120 0.974 6.66 3.22 PlyA + 44771 44776 6 1.05 4.00 Prom + 51602 51641 40 -5.66 4.01 Init + 53180 53183 4 1 1 77 92 0 0.689 -0.54 4.02 Intr + 53617 53723 107 2 2 80 98 189 0.980 19.03 4.03 Intr + 54258 54357 100 2 1 92 65 46 0.677 2.38 4.04 Intr + 55197 55372 176 1 2 58 78 421 0.999 37.76 4.05 Intr + 55583 55712 130 1 1 116 77 156 0.900 17.57 4.06 Term + 56159 56268 110 0 2 76 35 97 0.895 1.87 4.07 PlyA + 56294 56299 6 1.05 5.02 PlyA - 56309 56304 6 1.05 5.01 Sngl - 58298 57909 390 2 0 91 50 318 0.990 24.52 5.00 Prom - 59291 59252 40 -8.86 6.10 PlyA - 59449 59444 6 1.05 6.09 Term - 62066 61793 274 1 1 94 51 412 0.999 33.04 6.08 Intr - 62366 62212 155 2 2 12 101 241 0.979 16.67 6.07 Intr - 62568 62488 81 0 0 77 99 163 0.875 16.13 6.06 Intr - 62888 62691 198 2 0 98 86 183 0.996 18.65 6.05 Intr - 63259 63182 78 1 0 120 32 136 0.970 10.95 6.04 Intr - 63811 63657 155 2 2 97 81 161 0.999 16.09 6.03 Intr - 64279 64046 234 2 0 23 85 555 0.938 46.36 6.02 Intr - 65049 64955 95 2 2 99 82 119 0.998 12.01 6.01 Init - 65288 65137 152 2 2 104 107 127 0.870 15.05 6.00 Prom - 74468 74429 40 -6.96 7.00 Prom + 75572 75611 40 -7.26 7.01 Init + 78047 78256 210 1 0 75 86 207 0.705 15.99 7.02 Intr + 78423 78530 108 2 0 101 98 234 0.996 26.18 7.03 Intr + 78619 78735 117 0 0 86 62 177 0.997 15.56 7.04 Intr + 79205 79334 130 1 1 35 72 205 0.990 13.87 7.05 Intr + 79428 79515 88 1 1 75 81 67 0.999 3.73 7.06 Intr + 80343 80438 96 0 0 120 84 47 0.987 6.62 7.07 Intr + 80538 80736 199 0 1 113 66 286 0.999 28.25 7.08 Intr + 80851 80974 124 0 1 107 70 133 0.998 13.56 7.09 Intr + 81183 81274 92 1 2 98 78 51 0.886 4.81 7.10 Intr + 81514 81607 94 0 1 99 77 14 0.808 0.94 7.11 Intr + 81746 81829 84 0 0 111 110 52 0.993 9.49 7.12 Intr + 81922 82084 163 2 1 82 106 193 0.999 19.53 7.13 Intr + 82233 82362 130 0 1 122 70 149 0.999 17.20 7.14 Intr + 82849 82962 114 0 0 100 110 86 0.997 12.64 7.15 Term + 83070 83261 192 2 0 106 48 249 0.999 20.12 7.16 PlyA + 83439 83444 6 -8.70 8.02 PlyA - 83497 83492 6 1.05 8.01 Sngl - 84251 83586 666 2 0 65 55 725 0.990 63.08 8.00 Prom - 85517 85478 40 -14.63 9.00 Prom + 85554 85593 40 -16.47 9.01 Init + 86404 86489 86 0 2 58 94 34 0.992 1.48 9.02 Intr + 87796 88009 214 1 1 126 39 171 0.962 14.72 9.03 Intr + 88137 88190 54 2 0 73 90 71 0.932 4.98 9.04 Intr + 89622 89681 60 2 0 82 94 102 0.951 9.13 9.05 Intr + 90579 90647 69 2 0 140 84 98 0.999 13.98 9.06 Intr + 90721 90852 132 0 0 83 101 269 0.999 28.54 9.07 Term + 90957 91355 399 2 0 60 42 289 0.517 16.62 9.08 PlyA + 93563 93568 6 1.05 10.28 PlyA - 96111 96106 6 1.05 10.27 Term - 98287 98272 16 0 1 128 38 25 0.399 -0.79 10.26 Intr - 98969 98882 88 0 1 31 94 36 0.251 -2.47 10.25 Intr - 99202 99050 153 2 0 79 68 59 0.462 3.04 10.24 Intr - 100156 100015 142 1 1 76 42 110 0.737 5.13 10.23 Intr - 100751 100556 196 1 1 109 82 109 0.609 11.82 10.22 Intr - 100968 100826 143 0 2 67 97 162 0.917 14.15 10.21 Intr - 103970 103833 138 2 0 121 93 187 0.998 23.16 10.20 Intr - 104351 104071 281 0 2 129 73 337 0.994 33.50 10.19 Intr - 104665 104485 181 1 1 107 73 197 0.984 19.54 10.18 Intr - 104892 104750 143 1 2 87 47 241 0.985 19.97 10.17 Intr - 105297 104985 313 0 1 76 76 504 0.774 43.76 10.16 Intr - 105774 105426 349 2 1 82 116 279 0.998 25.46 10.15 Intr - 105997 105879 119 1 2 113 84 98 0.997 11.16 10.14 Intr - 106214 106089 126 2 0 76 94 211 0.977 21.38 10.13 Intr - 106657 106565 93 1 0 128 75 72 0.994 10.06 10.12 Intr - 106895 106740 156 2 0 123 101 186 0.999 23.61 10.11 Intr - 107116 106985 132 1 0 113 64 123 0.926 13.24 10.10 Intr - 107344 107197 148 0 1 91 67 215 0.999 19.94 10.09 Intr - 107613 107492 122 0 2 125 75 46 0.999 6.29 10.08 Intr - 108626 108431 196 1 1 79 94 223 0.961 21.42 10.07 Intr - 109380 109289 92 0 2 66 78 62 0.995 1.79 10.06 Intr - 109627 109558 70 0 1 114 114 35 0.999 7.88 10.05 Intr - 109946 109735 212 2 2 104 81 120 0.957 10.61 10.04 Intr - 110784 110674 111 0 0 111 51 59 0.901 5.08 10.03 Intr - 113835 113678 158 1 2 45 21 112 0.934 -0.07 10.02 Intr - 119622 119430 193 0 1 122 115 106 0.956 15.77 10.01 Init - 120515 120444 72 2 0 98 109 36 0.993 6.93 10.00 Prom - 122870 122831 40 -6.46 11.09 PlyA - 128446 128441 6 1.05 11.08 Term - 128586 128534 53 1 2 88 44 97 0.627 2.89 11.07 Intr - 135390 135287 104 2 2 66 92 145 0.893 12.52 11.06 Intr - 152409 152261 149 0 2 70 75 82 0.936 4.13 11.05 Intr - 153278 153201 78 2 0 81 119 43 0.961 6.45 11.04 Intr - 157591 157401 191 2 2 22 94 117 0.969 5.00 11.03 Intr - 159322 159195 128 0 2 128 68 67 0.943 9.02 11.02 Intr - 175027 174938 90 0 0 111 86 56 0.698 6.81 11.01 Init - 190197 190151 47 0 2 74 81 56 0.509 3.65 11.00 Prom - 197227 197188 40 -1.86 12.03 PlyA - 198022 198017 6 1.05 12.02 Term - 204322 204207 116 2 2 59 46 158 0.962 7.43 12.01 Intr - 219248 219089 160 2 1 91 110 58 0.959 7.86 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 121740 121504 237 0 0 70 46 220 0.956 11.09 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:44722418_44942932|GENSCAN_predicted_peptide_1|208_aa XIELIKDLVGYDVRQALLKGLVALLIPSVKEISKLQAKILSDPSVLQLTPSLPMFLQQAA AAKAIGVLARNDMSIAEELLYLRVVRGLMAAMGNTDHSNSQRLASLTLESRRLARAAQCF VQMFPLVAEHVRKCMGEELYQLFLSNAEDLYMKIDSIQADILAANTVNVTKALCLHGSSY SMNTLYGSRDSAQMAYLTHFEEDVESKE >gi568815597r:44722418_44942932|GENSCAN_predicted_CDS_1|627_bp nccatcgagttgatcaaagacctggtcggttacgatgtgcgccaggcgctgctcaagggc ctcgtggcgctgctgataccgtcggtcaaggagatctccaaactgcaggccaagatcctc agtgacccctcggttctccagctcacccccagcctgccgatgtttttgcagcaggccgcg gccgccaaggccatcggggtcctggcgcgcaacgacatgagcatcgccgaggagctgctg tacctgcgcgtggtgcgtggcctaatggccgccatgggcaacacggaccacagcaacagc cagcggctggccagcctcacgctggagtcacgccgcctcgcccgcgcggcgcagtgcttc gtgcagatgttccccttggtggcggagcacgtgcgcaagtgcatgggggaggaactctac cagctcttcctgagcaacgctgaggacttgtacatgaaaatagacagcattcaggcggac atcttggcggccaacacagtcaatgttaccaaagccctgtgcctccatggcagctcctac agcatgaacactctctatggctcgcgcgattcggctcagatggcctacctcacacacttc gaggaggatgtagaatcaaaggagtaa >gi568815597r:44722418_44942932|GENSCAN_predicted_peptide_2|47_aa MPRRWPLLAAGDDPPAQSSWRSLAYALVPEFSSHEPKKNEDALAIEE >gi568815597r:44722418_44942932|GENSCAN_predicted_CDS_2|144_bp atgcccaggagatggcctttacttgctgcaggagacgacccacccgctcagtccagttgg cggagtctagcttatgcactggttcccgagttctcgtcccacgagcccaagaagaatgag gatgcactggcaattgaagagtga >gi568815597r:44722418_44942932|GENSCAN_predicted_peptide_3|713_aa MDSSLQARLFPGLAIKIQRSNGLIHSANVRTVNLEKSCVSVEWAEGGATKGKEIDFDDVA AINPELLQLLPLHPKDNLPLQENVTIQKQKRRSVNSKIPAPKESLRSRSTRMSTVSELRI TAQENDMEVELPAAANSRKQFSVPLAEIPLRMVSEEMEEQVHSIRGSSSANPVNSVRRKS CLVKEVEKMKNKREEKKAQNSEMRMKRAQEYDSSFPNWEFARMIKEFRATLECHPLTMTD PIEEHRICVCVRKRPLNKQELAKKEIDVISIPSKCLLLVHEPKLKVDLTKYLENQAFCFD FAFDETASNEVVYRFTARPLVQTIFEGGKATCFAYGQTGSGKTHTMGGDLSGKAQNASKG IYAMASRDVFLLKNQPCYRKLGLEVYVTFFEIYNGKLFDLLNKKAKLRVLEDGKQQVQVV GLQEHLVNSADDVIKMIDMGSACRTSGQTFANSNSSRSHACFQIILRAKGRMHGKFSLVD LAGNERGADTSSADRQTRMEGAEINKSLLALKECIRALGQNKAHTPFRESKLTQVLRDSF IGENSRTCMIATISPGISSCEYTLNTLRYADRVKELSPHSGPSGEQLIQMETEEMEACSN GALIPGNLSKEEEELSSQMSSFNEAMTQIRELEEKAMEELKEIIQQGPDWLELSEMTEQP DYDLETFVNKAESALAQQAKHFSALRDVIKALRLAMQLEEQASRQISSKKRPQ >gi568815597r:44722418_44942932|GENSCAN_predicted_CDS_3|2142_bp atggactcgtcgcttcaggcccgcctgtttcccggtctcgctatcaagatccaacgcagt aatggtttaattcacagtgccaatgtaaggactgtgaacttggagaaatcctgtgtttca gtggaatgggcagaaggaggtgccacaaagggcaaagagattgattttgatgatgtggct gcaataaacccagaactcttacagcttcttcccttacatccgaaggacaatctgcccttg caggaaaatgtaacaatccagaaacaaaaacggagatccgtcaactccaaaattcctgct ccaaaagaaagtcttcgaagccgctccactcgcatgtccactgtctcagagcttcgcatc acggctcaggagaatgacatggaggtggagctgcctgcagctgcaaactcccgcaagcag ttttcagttcctctggctgaaataccattgaggatggtcagcgaggagatggaagagcaa gtccattccatccgaggcagctcttctgcaaaccctgtgaactcagttcggaggaaatca tgtcttgtgaaggaagtggaaaaaatgaagaacaagcgagaagagaagaaggcccagaac tctgaaatgagaatgaagagagctcaggagtatgacagtagttttccaaactgggaattt gcccgaatgattaaagaatttcgggctactttggaatgtcatccacttactatgactgat cctatcgaagagcacagaatatgtgtctgtgttaggaaacgcccactgaataagcaagaa ttggccaagaaagaaattgatgtgatttccattcctagcaagtgtctcctcttggtacat gaacccaagttgaaagtggacttaacaaagtatctggagaaccaagcattctgctttgac tttgcatttgatgaaacagcttcgaatgaagttgtctacaggttcacagcaaggccactg gtacagacaatctttgaaggtggaaaagcaacttgttttgcatatggccagacaggaagt ggcaagacacatactatgggcggagacctctctgggaaagcccagaatgcatccaaaggg atctatgccatggcctcccgggacgtcttcctcctgaagaatcaaccctgctaccggaag ttgggcctggaagtctatgtgacattcttcgagatctacaatgggaagctgtttgacctg ctcaacaagaaggccaagctgcgcgtgctggaggacggcaagcaacaggtgcaagtggtg gggctgcaggagcatctggttaactctgctgatgatgtcatcaagatgatcgacatgggc agcgcctgcagaacctctgggcagacatttgccaactccaattcctcccgctcccacgcg tgcttccaaattattcttcgagctaaagggagaatgcatggcaagttctctttggtagat ctggcagggaatgagcgaggcgcggacacttccagtgctgaccggcagacccgcatggag ggcgcagaaatcaacaagagtctcttagccctgaaggagtgcatcagggccctgggacag aacaaggctcacaccccgttccgtgagagcaagctgacacaggtgctgagggactccttc attggggagaactctaggacttgcatgattgccacgatctcaccaggcataagctcctgt gaatatactttaaacaccctgagatatgcagacagggtcaaggagctgagcccccacagt gggcccagtggagagcagttgattcaaatggaaacagaagagatggaagcctgctctaac ggggcgctgattccaggcaatttatccaaggaagaggaggaactgtcttcccagatgtcc agctttaacgaagccatgactcagatcagggagctggaggagaaggctatggaagagctc aaggagatcatacagcaaggaccagactggcttgagctctctgagatgaccgagcagcca gactatgacctggagacctttgtgaacaaagcggaatctgctctggcccagcaagccaag catttctcagccctgcgagatgtcatcaaggccttgcgcctggccatgcagctggaagag caggctagcagacaaataagcagcaagaaacggccccagtga >gi568815597r:44722418_44942932|GENSCAN_predicted_peptide_4|208_aa MGISRDNWHKRRKTGGKRKPYHKKRKYELGRPAANTKIGPRRIHTVRVRGGNKKYRALRL DVGNFSWGSECCTRKTRIIDVVYNASNNELVRTKTLVKNCIVLIDSTPYRQWYESHYALP LGRKKGAKLTPEEEEILNKKRSKKIQKKYDERKKNAKISSLLEEQFQQGKLLACIASRPG QCGRADGYVLEGKELEFYLRKIKARKGK >gi568815597r:44722418_44942932|GENSCAN_predicted_CDS_4|627_bp atgggcatctctcgggacaactggcacaagcgccgcaaaaccgggggcaagagaaagccc taccacaagaagcggaagtatgagttggggcgcccagctgccaacaccaagattggcccc cgccgcatccacacagtccgtgtgcggggaggtaacaagaaataccgtgccctgaggttg gacgtggggaatttctcctggggctcagagtgttgtactcgtaaaacaaggatcatcgat gttgtctacaatgcatctaataacgagctggttcgtaccaagaccctggtgaagaattgc atcgtgctcatcgacagcacaccgtaccgacagtggtacgagtcccactatgcgctgccc ctgggccgcaagaagggagccaagctgactcctgaggaagaagagattttaaacaaaaaa cgatctaaaaaaattcagaagaaatatgatgaaaggaaaaagaatgccaaaatcagcagt ctcctggaggagcagttccagcagggcaagcttcttgcgtgcatcgcttcaaggccggga cagtgtggccgagcagatggctatgtgctagagggcaaagagttggagttctatcttagg aaaatcaaggcccgcaaaggcaaataa >gi568815597r:44722418_44942932|GENSCAN_predicted_peptide_5|129_aa MVRMNALADALKSINNAEKRGKRQVLLRPCSKVIVQFLTVMMKHGYIGEFEITDDHRAGK IVVNLTGRLNKCGAISPRFDVQLKDLEKWQNNLLPSRQFDFIVLTTSAGIMDHEARRKHT GGKIQGFFF >gi568815597r:44722418_44942932|GENSCAN_predicted_CDS_5|390_bp atggtgcgcatgaatgccctggcagatgctctcaagagcatcaacaatgccgaaaagaga ggcaaacgccaggtgcttcttaggccatgctccaaagtcatcgtccagtttctcactgtg atgatgaagcatggttacattggcgaatttgaaatcactgatgatcacagagctgggaaa attgttgtgaacctcacaggcaggctaaacaagtgtggagcgatcagccccagatttgat gtgcaactcaaagatctggaaaaatggcagaataatctgcttccatcccgccagtttgat ttcattgtactgacaacctcagctggcatcatggaccatgaagcaagacgaaaacacaca ggagggaaaatccagggattctttttctag >gi568815597r:44722418_44942932|GENSCAN_predicted_peptide_6|473_aa MTVSYTLKVAEARFGGFSGLLLRWRGSIYKLLYKEFLLFGALYAVLSITYRLLLTQEQRY VYAQVARYCNRSADLIPLSFVLGFYVTLVVNRWWSQYTSIPLPDQLMCVISASVHGVDQR GRLLRRTLIRYANLASVLVLRSVSTRVLKRFPTMEHVVDAGFMSQEERKKFESLKSDFNK YWVPCVWFTNLAAQARRDGRIRDDIALCLLLEELNKYRAKCSMLFHYDWISIPLVYTQVV TIAVYSFFALSLVGRQFVEPEAGAAKPQKLLKPGQEPAPALGDPDMYVPLTTLLQFFFYA GWLKVAEQIINPFGEDDDDFETNQLIDRNLQVSLLSVDEMYQNLPPAEKDQYWDEDQPQP PYTVATAAESLRPSFLGSTFNLRMSDDPEQSLQVEASPGSGRPAPAAQTPLLGRFLGVGA PSPAISLRNFGRVRGTPRPPHLLRFRAEEGGDPEAAARIEEESAESGDEALEP >gi568815597r:44722418_44942932|GENSCAN_predicted_CDS_6|1422_bp atgacggtttcatacactctcaaagtggcggaggcccgcttcggaggtttctctggcctg cttctccgctggaggggaagcatctacaagctcctctacaaggaattcctcctctttggg gccttgtacgctgtgcttagcatcacctaccggctgctgctgacccaggagcagaggtac gtgtatgctcaggtggcccggtactgcaaccgctcagcagacctcattcccttgtccttt gtattgggtttctatgtgactctcgtggtgaaccgctggtggtcccagtacacaagcatc ccgctgccagaccagctgatgtgcgtcatctcggctagcgtgcacggcgtggaccagcgg ggccgcctgctgcgccgcaccctcatccgctacgcgaacctggcgtccgtgctggtgctg cgctcggtcagcacccgcgtgcttaagcgcttccccaccatggagcacgtggtggacgca ggtttcatgtcccaggaagagaggaaaaagtttgagagcctgaaatccgacttcaacaag tactgggtcccctgcgtctggttcaccaacctggcggcccaggcccggagggacgggcga atacgtgacgatatcgctctctgtctacttttggaagagctgaacaagtaccgagccaag tgcagcatgctattccactatgactggatcagcatccccctcgtctacacccaagtggtg accatagccgtctactctttctttgccctctccctggttggccgccagtttgtggagcca gaggcaggggctgccaaacctcagaagcttctgaagccaggccaggagccagccccagcc ctgggagacccggacatgtacgtgcctctcaccactctgctgcagttcttcttctatgct ggctggctcaaggtggctgaacagatcatcaacccatttggtgaggatgatgacgacttt gagacaaatcagctcatagaccgcaacttgcaggtgtccctgctatccgtggacgaaatg taccagaaccttccccccgctgagaaggaccagtactgggatgaggaccagccgcagcca ccctacactgtggccacggcggccgagtctctgcggccctcattcctgggctccaccttc aacctgcgcatgagcgacgaccctgagcagagcctgcaggtggaggcgtcccccggatct ggtcggcccgcgcccgccgcgcagaccccgttgctcggccgcttcctgggcgtaggggcg ccctccccggccatcagcctccggaacttcggccgcgtgcgaggcaccccccgccccccg catctgctgcgcttccgggcggaggagggcggcgaccccgaggccgcagcccgcatcgag gaggaatcggcggagtccggggacgaggccctggagccctga >gi568815597r:44722418_44942932|GENSCAN_predicted_peptide_7|646_aa MEPAAGFLSPRPFQRAAAAPAPPAGPGPPPSALRGPELEMLAGLPTSDPGRLITDPRSGR TYLKGRLLGKGGFARCYEATDTETGSAYAVKVIPQSRVAKPHQREKILNEIELHRDLQHR HIVRFSHHFEDADNIYIFLELCSRKSLAHIWKARHTLLEPEVRYYLRQILSGLKYLHQRG ILHRDLKLGNFFITENMELKVGDFGLAARLEPPEQRKKTICGTPNYVAPEVLLRQGHGPE ADVWSLGCVMYTLLCGSPPFETADLKETYRCIKQVHYTLPASLSLPARQLLAAILRASPR DRPSIDQILRHDFFTKGYTPDRLPISSCVTVPDLTPPNPARSLFAKVTKSLFGRKKKSKN HAQERDEVSGLVSGLMRTSVGHQDARPEAPAASGPAPVSLVETAPEDSSPRGTLASSGDG FEEGLTVATVVESALCALRNCIAFMPPAEQNPAPLAQPEPLVWVSKWVDYSNKFGFGYQL SSRRVAVLFNDGTHMALSANRKTVHYNPTSTKHFSFSVGAVPRALQPQLGILRYFASYME QHLMKGGDLPSVEEVEVPAPPLLLQWVKTDQALLMLFSDGTVQVNFYGDHTKLILSGWEP LLVTFVARNRSACTYLASHLRQLGCSPDLRQRLRYALRLLRDRSPA >gi568815597r:44722418_44942932|GENSCAN_predicted_CDS_7|1941_bp atggagcctgccgccggtttcctgtctccgcgccccttccagcgtgcggccgccgcgccc gctcccccggccgggcccgggccgcctccgagtgccttgcgcggacctgagctggagatg ctggccgggctaccgacgtcagaccccgggcgcctcatcacggacccgcgcagcggccgc acctacctcaaaggccgcttgttgggcaaggggggcttcgcccgctgctacgaggccact gacacagagactggcagcgcctacgctgtcaaagtcatcccgcagagccgcgtcgccaag ccgcatcagcgcgagaagatcctaaatgagattgagctgcaccgagacctgcagcaccgc cacatcgtgcgtttttcgcaccactttgaggacgctgacaacatctacattttcttggag ctctgcagccgaaagtccctggcccacatctggaaggcccggcacaccctgttggagcca gaagtgcgctactacctgcggcagatcctttctggcctcaagtacttgcaccagcgcggc atcttgcaccgggacctcaagttgggaaattttttcatcactgagaacatggaactgaag gtgggggattttgggctggcagcccggttggagcctccggagcagaggaagaagaccatc tgtggcacccccaactatgtggctccagaagtgctgctgagacagggccacggccctgag gcggatgtatggtcactgggctgtgtcatgtacacgctgctctgcgggagccctcccttt gagacggctgacctgaaggagacgtaccgctgcatcaagcaggttcactacacgctgcct gccagcctctcactgcctgcccggcagctcctggccgccatccttcgggcctcaccccga gaccgcccctctattgaccagatcctgcgccatgacttctttaccaagggctacaccccc gatcgactccctatcagcagctgcgtgacagtcccagacctgacaccccccaacccagct aggagtctgtttgccaaagttaccaagagcctctttggcagaaagaagaagagtaagaat catgcccaggagagggatgaggtctccggtttggtgagcggcctcatgcgcacatccgtt ggccatcaggatgccaggccagaggctccagcagcttctggcccagcccctgtcagcctg gtagagacagcacctgaagacagctcaccccgtgggacactggcaagcagtggagatgga tttgaagaaggtctgactgtggccacagtagtggagtcagccctttgtgctctgagaaat tgtatagccttcatgcccccagcggaacagaacccggcccccctggcccagccagagcct ctggtgtgggtcagcaagtgggttgactactccaataagttcggctttgggtatcaactg tccagccgccgtgtggctgtgctcttcaacgatggcacacatatggccctgtcggccaac agaaagactgtgcactacaatcccaccagcacaaagcacttctccttctccgtgggtgct gtgccccgggccctgcagcctcagctgggtatcctgcggtacttcgcctcctacatggag cagcacctcatgaagggtggagatctgcccagtgtggaagaggtagaggtacctgctccg cccttgctgctgcagtgggtcaagacggatcaggctctcctcatgctgtttagtgatggc actgtccaggtgaacttctacggggaccacaccaagctgattctcagtggctgggagccc ctccttgtgacttttgtggcccgaaatcgtagtgcttgtacttacctcgcttcccacctt cggcagctgggctgctctccagacctgcggcagcgactccgctatgctctgcgcctgctc cgggaccgcagcccagcctag >gi568815597r:44722418_44942932|GENSCAN_predicted_peptide_8|221_aa MASRPLPPGRQEEENAKDSGRKPSPVRPRGCLPSIDEARPAGPGPAPASRRGSMLGLAAS FSRRNSLVGPGAGPGGQRPSLGPVPPLGSRVSFSGLPLAPARWVAPSYRTEPVPGERWEA ARAQRALEAALAAGLHDACYSSDEAARLVRELCEQVHVRLRELSPPRYKLVCSVVLGPRA GQGVHVVSRALWDVARDGLASVSYTNTSLFAVATVHGLYCE >gi568815597r:44722418_44942932|GENSCAN_predicted_CDS_8|666_bp atggccagcaggcctctgcccccgggacgccaggaggaggagaatgccaaagactccggg cggaaaccctcaccggtgcggccccgaggctgcctgcccagcattgatgaggcccgaccg gcaggtccaggtccagccccggcctcgcgccggggctccatgctgggcctggccgcgtcc ttctcccgccgcaactcgctggtcgggccaggcgcgggtcctgggggtcagcggccatcc ctgggcccggtgccccctctgggctcaagggtcagcttctcagggttgcccctggcgccc gcccgttgggtggcgccctcctaccgcacggagccagtgcccggggagcgctgggaggct gcgcgtgcacagcgtgccctggaggcggcgctggccgcagggctgcacgacgcgtgctac tccagcgacgaggccgcgcggctggtgcgggagctctgcgagcaggtgcacgttcgcctg cgcgagctcagcccgccacgctacaagctggtatgcagtgtggtgctggggccgcgcgcg ggccagggcgttcacgtggtcagccgtgcgctctgggacgtggcgcgcgatgggctggcc tcggtctcctacaccaacacctcgctcttcgcggtggccacggtccacgggctctactgc gagtga >gi568815597r:44722418_44942932|GENSCAN_predicted_peptide_9|337_aa MEPLGLVVHGKAEPFSAALRSLVNNPRYSDVCFVVGQERQEVFAHRCLLACRCNFFQRLL GTEPGPGVPSPVVLSTVPTEAFLAVLEFLYTNSVKLYRHSVLEVLTAAVEYGLEELRELC LQFVVKVLDVDLVCEALQVAVTFGLGQLQERCVAFIEAHSQEALRTRGFLELSAAALLPL LRSDKLCVDEAELVRAARSWARVGAAVLERPVAEVAAPVVKELRLALLAPAELSALEEQN RQEPLIPVGTRGTGPAPLSRGYRALGGGRSSPAHGPRGEGPPRDCALTGLALQVEQIVEA WKCHALRRGDEARGAPCRRRRGTLPREHHRFLDLSFK >gi568815597r:44722418_44942932|GENSCAN_predicted_CDS_9|1014_bp atggagcccttgggactggtcgtgcatgggaaagctgaacctttttccgcagcactccga agccttgtcaacaacccgcgatacagtgatgtttgcttcgtggttggtcaagaacggcag gaggtatttgcccatcggtgcttgttggcctgtagatgcaacttcttccagcgacttctg ggcacagagccaggccccggggtgcccagtcctgtggtgctaagcactgtgccaactgag gccttcctggcagtgctggagttcctatataccaacagtgtcaagctgtaccgccactct gtgctggaagtgctgacagcggctgtggagtatgggctggaggaactgagagagctgtgc ctgcagtttgtggtgaaggtgctggatgtggacttggtttgtgaggccctgcaggtggcc gtaacctttggcctggggcagctgcaggagcgctgcgtggctttcatagaggcccacagc caggaggccctccggacccgaggcttcctggagctgtcggcggccgcgctgctgcccctg ctccgcagcgacaagctctgcgtggacgaggctgaactggtccgcgcggcccgaagctgg gcgcgcgtgggcgcggcggtgctggagcggccggtggctgaggtggcggccccggtggtg aaagagctgagactagccttgctggccccggcggagctgagcgccctggaagagcagaac cggcaggaaccactcatcccggtggggacgcggggaaccggcccagctccactcagcagg gggtacagggcgctgggagggggcaggagcagcccggcccacggtcctcggggcgagggg ccaccgcgggactgcgcactaactggccttgctctgcaggtggagcagattgtggaggcg tggaaatgccatgccctgcggagaggggatgaggcccggggcgccccgtgtcgccgccgg agaggcaccctgccccgggagcatcaccgctttctggacctgtccttcaaatga >gi568815597r:44722418_44942932|GENSCAN_predicted_peptide_10|1380_aa MTRSPPLRELPPSYTPPARTAAPQILAGSLKAPLWLRAYFQGLLFSLGCGIQRHCGKVLF LGLLAFGALALGLRMAIIETNLEQLWVEELCSILCRNPSLGKGNGITDWLLKLIKFHLLM LGRSPAVPNHWAFPMTATEIRVPVTEQEPVASITQDWRGAKIQDTMAGLAPGIPVVWTLS LFALAVGSRVSQELHYTKEKLGEEAAYTSQMLIQTARQEGENILTPEALGLHLQAALTAS KVQVSLYGKSWDLNKICYKSGVPLIENGMIERMIEKLFPCVILTPLDCFWEGAKLQGGSA YLPGRPDIQWTNLDPEQLLEELGPFASLEGFRELLDKAQVGQAYVGRPCLHPDDLHCPPS APNHHSRQAPNVAHELSGGCHGFSHKFMHWQEELLLGGMARDPQGELLRAEALQSTFLLM SPRQLYEHFRGDYQTHDIGWSEEQASTVLQAWQRRFVQLAQEALPENASQQIHAFSSTTL DDILHAFSEVSAARVVGGYLLMLAYACVTMLRWDCAQSQGSVGLAGVLLVALAVASGLGL CALLGITFNAATTQVLPFLALGIGVDDVFLLAHAFTEALPGTPLQERMGECLQRTGTSVV LTSINNMAAFLMAALVPIPALRAFSLQAAIVVGCTFVAVMLVFPAILSLDLRRRHCQRLD VLCCFSSPCSAQVIQILPQELGDGTVPVGIAHLTATVQAFTHCEASSQHVVTILPPQAHL VPPPSDPLGSELFSPGGSTRDLLGQEEETRQKAACKSLPCARWNLAHFARYQFAPLLLQS HAKAIVLVLFGALLGLSLYGATLVQDGLALTDVVPRGTKEHAFLSAQLRYFSLYEVALVT QGGFDYAHSQRALFDLHQRFSSLKAVLPPPATQAPRTWLHYYRNWLQGIQAAFDQDWASG RITRHSYRNGSEDGALAYKLLIQTGDAQEPLDFSQLTTRKLVDREGLIPPELFYMGLTVW VSSDPLGLAASQANFYPPPPEWLHDKYDTTGENLRIPPAQPLEFAQFPFLLRGLQKTADF VEAIEGARAACAEAGQAGVHAYPSGSPFLFWEQYLGLRRCFLLAVCILLVCTFLVCALLL LNPWTAGLIVLVLAMMTVELFGIMGFLGIKLSAIPVVILVASVGIGVEFTVHVALGFLTT QGSRNLRAAHALEHTFAPVTDGAISTLLGLLMLAGSHFDFIVRYFFAALTVLTLLGLLHG LVLLPVLLSILGPPPEVTTPSAPSLYSQPKGRGRERQGKGQSPVAHRQSFARVTTSMTVA IHPPPLPGAYIHPAPDEPPWSPAATSSGNLSSRGPDSPPPTCFTRTGNGKAEGQSHAAGG RRGGGRSGKQPVEAERGQQGLGSFSGAERCLWPVSQWISCGNCSETEVLAVSEDTAPEEI >gi568815597r:44722418_44942932|GENSCAN_predicted_CDS_10|4143_bp atgactcgatcgccgcccctcagagagctgcccccgagttacacacccccagctcgaacc gcagcaccccagatcctagctgggagcctgaaggctccactctggcttcgtgcttacttc cagggcctgctcttctctctgggatgcgggatccagagacattgtggcaaagtgctcttt ctgggactgttggcctttggggccctggcattaggtctccgcatggccattattgagaca aacttggaacagctctgggtagaagaactatgtagcatcctatgccgaaatccttcactt ggcaaaggcaatggaattactgattggcttcttaaactaatcaagtttcacctcctgatg ctggggaggagcccagctgtccctaatcactgggcctttccgatgacggccacggaaatc agggtcccagttacagagcaggagcctgtggccagcataacgcaggactggaggggcgct aagatccaggacactatggcaggcctggcgccgggaatccctgtggtctggacactttct ctgtttgctctggcagtgggcagccgggtgagccaggagctgcattacaccaaggagaag ctgggggaggaggctgcatacacctctcagatgctgatacagaccgcacgccaggaggga gagaacatcctcacacccgaagcacttggcctccacctccaggcagccctcactgccagt aaagtccaagtatcactctatgggaagtcctgggatttgaacaaaatctgctacaagtca ggagttccccttattgaaaatggaatgattgagcggatgattgagaagctgtttccgtgc gtgatcctcacccccctcgactgcttctgggagggagccaaactccaagggggctccgcc tacctgcccggccgcccggatatccagtggaccaacctggatccagagcagctgctggag gagctgggtccctttgcctcccttgagggcttccgggagctgctagacaaggcacaggtg ggccaggcctacgtggggcggccctgtctgcaccctgatgacctccactgcccacctagt gcccccaaccatcacagcaggcaggctcccaatgtggctcacgagctgagtgggggctgc catggcttctcccacaaattcatgcactggcaggaggaattgctgctgggaggcatggcc agagacccccaaggagagctgctgagggcagaggccctgcagagcaccttcttgctgatg agtccccgccagctgtacgagcatttccggggtgactatcagacacatgacattggctgg agtgaggagcaggccagcacagtgctacaagcctggcagcggcgctttgtgcagctggcc caggaggccctgcctgagaacgcttcccagcagatccatgccttctcctccaccaccctg gatgacatcctgcatgcgttctctgaagtcagtgctgcccgtgtggtgggaggctatctg ctcatgctggcctatgcctgtgtgaccatgctgcggtgggactgcgcccagtcccagggt tccgtgggccttgccggggtactgctggtggccctggcggtggcctcaggccttgggctc tgtgccctgctcggcatcaccttcaatgctgccactacccaggtgctgcccttcttggct ctgggaatcggcgtggatgacgtattcctgctggcgcatgccttcacagaggctctgcct ggcacccctctccaggagcgcatgggcgagtgtctgcagcgcacgggcaccagtgtcgta ctcacatccatcaacaacatggccgccttcctcatggctgccctcgttcccatccctgcg ctgcgagccttctccctacaggcggccatagtggttggctgcacctttgtagccgtgatg cttgtcttcccagccatcctcagcctggacctacggcggcgccactgccagcgccttgat gtgctctgctgcttctccagtccctgctctgctcaggtgattcagatcctgccccaggag ctgggggacgggacagtaccagtgggcattgcccacctcactgccacagttcaagccttt acccactgtgaagccagcagccagcatgtggtcaccatcctgcctccccaagcccacctg gtgcccccaccttctgacccactgggctctgagctcttcagccctggagggtccacacgg gaccttctaggccaggaggaggagacaaggcagaaggcagcctgcaagtccctgccctgt gcccgctggaatcttgcccatttcgcccgctatcagtttgccccgttgctgctccagtca catgctaaggccatcgtgctggtgctctttggtgctcttctgggcctgagcctctacgga gccaccttggtgcaagacggcctggccctgacggatgtggtgcctcggggcaccaaggag catgccttcctgagcgcccagctcaggtacttctccctgtacgaggtggccctggtgacc cagggtggctttgactacgcccactcccaacgcgccctctttgatctgcaccagcgcttc agttccctcaaggcggtgctgcccccaccggccacccaggcaccccgcacctggctgcac tattaccgcaactggctacagggaatccaggctgcctttgaccaggactgggcttctggg cgcatcacccgccactcgtaccgcaatggctctgaggatggggccctggcctacaagctg ctcatccagactggagacgcccaggagcctctggatttcagccagctgaccacaaggaag ctggtggacagagagggactgattccacccgagctcttctacatggggctgaccgtgtgg gtgagcagtgaccccctgggtctggcagcctcacaggccaacttctaccccccacctcct gaatggctgcacgacaaatacgacaccacgggggagaaccttcgcatcccgccagctcag cccttggagtttgcccagttccccttcctgctgcgtggcctccagaagactgcagacttt gtggaggccatcgagggggcccgggcagcatgcgcagaggccggccaggctggggtgcac gcctaccccagcggctcccccttcctcttctgggaacagtatctgggcctgcggcgctgc ttcctgctggccgtctgcatcctgctggtgtgcactttcctcgtctgtgctctgctgctc ctcaacccctggacggctggcctcatagtgctggtcctggcgatgatgacagtggaactc tttggtatcatgggtttcctgggcatcaagctgagtgccatccccgtggtgatccttgtg gcctctgtaggcattggcgttgagttcacagtccacgtggctctgggcttcctgaccacc cagggcagccggaacctgcgggccgcccatgcccttgagcacacatttgcccccgtgacc gatggggccatctccacattgctgggtctgctcatgcttgctggttcccactttgacttc attgtaaggtacttctttgcggcgctgacagtgctcacgctcctgggcctcctccatgga ctcgtgctgctgcctgtgctgctgtccatcctgggcccgccgccagaggtgaccacaccc tcggcaccatccctctactcccagcccaagggacggggtagggagaggcaagggaaggga cagagccctgtggcccacagacagagctttgccagagtgactacctccatgaccgtggcc atccacccaccccccctgcctggtgcctacatccatccagcccctgatgagcccccttgg tcccctgctgccactagctctggcaacctcagttccaggggaccagactcacctccacct acctgcttcacccgcacgggaaacggcaaggcagaggggcaaagccatgcagcaggtgga aggcgaggtggaggcagatcaggaaagcagccagttgaagcagagagaggtcaacagggt ctggggagcttctcaggagccgagaggtgtctttggccagtcagccagtggatcagttgc gggaactgctcagaaactgaggtgctagcagttagtgaggacacagcgcccgaggagatc tag >gi568815597r:44722418_44942932|GENSCAN_predicted_peptide_11|279_aa MGSQLNPICRECHEQRHPRIRFHTGLVDAHLYCLKKYIVDFLMENGSITSIRSELIPYLV RKQFSSASSQQGQEEKEEDLKKKELKSLDIYSFIKEANTLNLAPYDACWNACRGDRWEDL SRSQVRCYVHIMKEGLCSRVSTLGLYMEANRQVPKLLSALCPEEPPVHSSAQIVSKHLVG VDSLIGPETQIGEKSSIKRSVIGSSCLIKDRVTITNCLLMNSVTVEEGSNIQGSVICNNA VIEKGADIKDCLIGSGQRIEAKAKRVNEVIVGNDQLMEI >gi568815597r:44722418_44942932|GENSCAN_predicted_CDS_11|840_bp atgggaagccagctgaatcctatatgccgagaatgccacgagcaaaggcatcctagaata cgtttccacacgggtcttgtggatgcccacctctactgtttgaaaaaatacatcgtggat ttcctaatggaaaatgggtcaataacttctatccggagtgaactgattccatatttagtg agaaaacagttttcctcagcttcctcacaacagggacaagaagaaaaagaggaggatcta aagaaaaaggagctgaagtccttagatatctacagttttataaaagaagccaatacactg aacctggctccctatgatgcctgctggaatgcctgtcgaggagacaggtgggaagacttg tccagatcacaggtgcgctgctatgtccacatcatgaaagaggggctctgctctcgagtg agcacactgggactctacatggaagcaaacagacaggtgcccaaattgctgtctgctctc tgtccagaagaaccaccagtccattcgtcagcccagattgtcagcaaacacctggttgga gttgacagcctcattgggccagagacacagattggagagaagtcatccattaagcgctca gtcattggctcatcctgtctcataaaagatagagtgactattaccaattgccttctcatg aactcagtcactgtggaggaaggaagcaatatccaaggcagtgtcatctgcaacaatgct gtgatcgagaagggtgcagacatcaaggactgcttgattggaagtggccagaggattgaa gccaaagctaaacgagtgaatgaggtgatcgtggggaatgaccagctcatggagatctga >gi568815597r:44722418_44942932|GENSCAN_predicted_peptide_12|91_aa TDVLVLSCDLITDVALHEVVDLFRAYDASLAMLMRKGQDSIEPVPGQKGKKKAVEQRDFI GVDSTGKRLLFMANEADLDEELVIKGSILQK >gi568815597r:44722418_44942932|GENSCAN_predicted_CDS_12|276_bp acagatgtgctggtgctgagctgtgatctgataacagacgttgccttacatgaggttgtg gacctgtttagagcttatgatgcatcacttgctatgttgatgagaaaaggccaagatagc atagaacctgttcccggtcaaaaggggaaaaaaaaagcagtggagcagcgtgacttcatt ggagtggacagcacaggaaagaggctgctcttcatggctaatgaagcagacttggatgaa gagctggtcattaagggatccatcctacagaagtaa