GENSCAN 1.0 Date run: 5-Nov-116 Time: 02:29:49 Sequence gi568815576r:19751170_19952261 : 201092 bp : 53.55% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 5824 5962 139 0 1 89 72 154 0.930 12.18 1.02 Intr + 7532 7819 288 0 0 75 61 103 0.832 4.06 1.03 Intr + 8200 8508 309 2 0 33 105 225 0.552 15.53 1.04 Intr + 9736 10111 376 2 1 55 71 699 0.095 59.54 1.05 Intr + 10976 11127 152 2 2 96 39 88 0.566 5.02 1.06 Intr + 12041 12173 133 0 1 14 105 278 0.508 22.31 1.07 Intr + 12986 13157 172 2 1 86 68 290 0.965 27.26 1.08 Intr + 13738 13944 207 0 0 96 76 264 0.614 25.80 1.09 Intr + 14589 14656 68 2 2 83 105 129 0.948 12.30 1.10 Intr + 14733 14833 101 0 2 112 97 147 0.999 18.25 1.11 Term + 15220 15698 479 2 2 113 41 594 0.918 52.59 1.12 PlyA + 20141 20146 6 -0.45 2.03 PlyA - 22042 22037 6 1.05 2.02 Term - 25182 25115 68 1 2 86 49 55 0.436 -0.21 2.01 Init - 26252 26120 133 2 1 78 47 68 0.498 1.97 2.00 Prom - 26376 26337 40 0.29 3.00 Prom + 30534 30573 40 -5.31 3.01 Init + 33130 33317 188 0 2 109 92 105 0.934 9.52 3.02 Intr + 33566 33940 375 2 0 53 52 186 0.656 6.19 3.03 Term + 35724 35880 157 0 1 50 45 85 0.067 -1.88 3.04 PlyA + 36614 36619 6 -4.83 4.03 PlyA - 37264 37259 6 1.05 4.02 Term - 37791 37540 252 0 0 88 41 410 0.906 32.27 4.01 Init - 38247 38206 42 0 0 74 34 11 0.550 -5.76 4.00 Prom - 38879 38840 40 -4.21 5.00 Prom + 39745 39784 40 -4.61 5.01 Init + 41125 41349 225 0 0 79 -15 272 0.991 14.46 5.02 Intr + 41380 41658 279 0 0 -10 -16 364 0.823 14.41 5.03 Term + 41743 41925 183 0 0 54 36 263 0.995 15.56 5.04 PlyA + 42596 42601 6 1.05 6.00 Prom + 44427 44466 40 -5.41 6.01 Init + 44631 44882 252 2 0 72 83 229 0.706 18.16 6.02 Term + 45067 45183 117 0 0 49 48 44 0.606 -4.66 6.03 PlyA + 45440 45445 6 1.05 7.40 PlyA - 46256 46251 6 -1.95 7.39 Term - 47675 47582 94 1 1 81 46 47 0.772 -2.60 7.38 Intr - 48701 48290 412 0 1 39 42 220 0.482 6.32 7.37 Intr - 49216 49075 142 1 1 57 70 41 0.758 -0.36 7.36 Intr - 51047 50832 216 2 0 135 82 341 0.985 37.43 7.35 Intr - 52148 52025 124 1 1 70 79 16 0.474 -0.01 7.34 Intr - 54824 54699 126 1 0 90 103 18 0.747 3.80 7.33 Intr - 55420 55191 230 1 2 55 77 106 0.581 3.40 7.32 Intr - 55588 55490 99 1 0 81 110 137 0.449 16.01 7.31 Intr - 56980 56816 165 1 0 -46 64 205 0.958 5.37 7.30 Intr - 61278 61116 163 2 1 80 78 157 0.836 14.39 7.29 Intr - 69554 69429 126 1 0 101 94 174 0.655 19.60 7.28 Intr - 70206 70059 148 1 1 72 96 96 0.607 8.60 7.27 Intr - 76899 76846 54 1 0 74 66 43 0.001 0.14 7.26 Intr - 90818 90721 98 1 2 81 52 78 0.334 3.65 7.25 Intr - 94140 94061 80 0 2 35 66 77 0.179 -0.96 7.24 Intr - 94890 94833 58 2 1 80 60 80 0.224 3.78 7.23 Intr - 99504 99365 140 0 2 90 81 106 0.502 9.87 7.22 Intr - 101093 100052 1042 1 1 91 26 697 0.014 54.99 7.21 Intr - 104311 104091 221 1 2 69 -22 174 0.002 2.13 7.20 Intr - 104439 104358 82 2 1 56 80 46 0.020 0.74 7.19 Intr - 119198 118998 201 1 0 65 53 112 0.016 4.32 7.18 Intr - 124462 124306 157 2 1 75 94 65 0.450 5.38 7.17 Intr - 124638 124553 86 2 2 118 37 58 0.051 3.66 7.16 Intr - 125920 125871 50 1 2 46 85 6 0.041 -5.93 7.15 Intr - 126065 125957 109 1 1 111 13 125 0.066 8.09 7.14 Intr - 127095 126921 175 1 1 47 108 69 0.496 4.31 7.13 Intr - 127485 127315 171 1 0 46 68 71 0.150 1.33 7.12 Intr - 127780 127607 174 2 0 81 80 80 0.253 6.93 7.11 Intr - 128337 128268 70 0 1 71 39 -17 0.447 -9.05 7.10 Intr - 129102 129010 93 0 0 66 78 143 0.740 11.76 7.09 Intr - 129548 129453 96 2 0 73 82 133 0.516 11.91 7.08 Intr - 132292 132156 137 2 2 134 80 183 0.484 22.90 7.07 Intr - 134605 134472 134 0 2 63 49 38 0.718 -1.80 7.06 Intr - 136402 136247 156 0 0 43 44 125 0.580 3.24 7.05 Intr - 136740 136531 210 2 0 6 103 113 0.036 3.15 7.04 Intr - 139691 139524 168 1 0 82 35 72 0.015 0.88 7.03 Intr - 142345 142282 64 2 1 51 59 73 0.168 -1.03 7.02 Intr - 144412 144238 175 1 1 96 82 203 0.985 20.53 7.01 Init - 147061 146870 192 1 0 94 72 143 0.768 12.27 7.00 Prom - 152493 152454 40 -5.81 8.00 Prom + 154028 154067 40 -4.61 8.01 Init + 155588 155853 266 1 2 37 103 317 0.481 24.58 8.02 Intr + 155859 155974 116 0 2 -67 4 189 0.217 -4.41 8.03 Intr + 155995 156247 253 2 1 17 103 319 0.462 23.33 8.04 Intr + 156575 156723 149 2 2 -35 4 216 0.080 1.29 8.05 Intr + 156744 156946 203 1 2 0 15 244 0.236 7.83 8.06 Intr + 160648 160828 181 0 1 67 98 13 0.527 0.16 8.07 Term + 161918 162084 167 0 2 93 41 108 0.784 4.99 8.08 PlyA + 162910 162915 6 1.05 9.11 PlyA - 163645 163640 6 1.05 9.10 Term - 163727 163698 30 2 0 107 46 2 0.720 -3.96 9.09 Intr - 164107 164045 63 1 0 136 94 51 0.933 9.91 9.08 Intr - 164674 164596 79 0 1 79 83 104 0.984 9.05 9.07 Intr - 166057 165894 164 1 2 95 45 84 0.904 4.09 9.06 Intr - 166565 166397 169 1 1 67 46 50 0.797 -0.84 9.05 Intr - 167048 166974 75 1 0 80 84 31 0.665 1.02 9.04 Intr - 167835 167691 145 1 1 76 95 207 0.998 20.05 9.03 Intr - 168430 168374 57 2 0 122 102 76 0.991 11.75 9.02 Intr - 179926 179861 66 2 0 102 75 85 0.588 7.97 9.01 Init - 181200 181188 13 0 1 123 100 15 0.983 6.40 9.00 Prom - 183394 183355 40 -1.61 10.00 Prom + 183961 184000 40 -6.10 10.01 Sngl + 185746 186657 912 0 0 66 40 333 0.460 23.05 10.02 PlyA + 187292 187297 6 -0.45 11.00 Prom + 187685 187724 40 -5.91 11.01 Sngl + 187902 188126 225 2 0 31 36 226 0.794 7.14 11.02 PlyA + 188645 188650 6 1.05 12.04 PlyA - 189208 189203 6 1.05 12.03 Term - 190271 190152 120 2 0 68 44 48 0.464 -2.82 12.02 Intr - 190789 190560 230 2 2 98 33 192 0.112 12.82 12.01 Intr - 200450 200280 171 0 0 55 45 114 0.456 4.23 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 126065 125942 124 1 1 111 55 129 0.923 10.06 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576r:19751170_19952261|GENSCAN_predicted_peptide_1|807_aa MEAVEAAAPAGGWDGLALRGLPGPFSRGLLEGPSVEQRGPGVRNQARRSEERALPTLRMY KRPGQRRAVRASARDARARGLRSLGSVPLLGSPATIVSQLQGAVGHLLGWVLSCCAKEQR TLRPWGSCQWPVREPKLPSPHPGAPGSEWHHRPPPPDSVGWAGLGAVRLGTRSRRRRRPG RRAPMDARSPLSPRASAFSIASLVAAEAAERTAHQGSGSSDRVKLRWLPGSPAGMHFSTV TRDMEAFTASSLSSLGAAGGFPGAASPGADPYGPREPPPPPPRYDPCAAAAPGAPGPPPP PHAYPFAPAAGAATSAAAEPEGPGASCAAAAKAPVKKNAKVAGVSVQLEMKALWDEFNQL GTEMIVTKAGRPSLLSPGTQALLRTAAAQPLAQASPARALSALGFTREPTSRSTLQPAEA SARVTQGLITPRRMFPTFQVKLFGMDPMADYMLLMDFVPVDDKRYRYAFHSSSWLVAGKA DPATPGRVHYHPDSPAKGAQWMKQIVSFDKLKLTNNLLDDNGHILSPAPPLELIPHLVFQ IILNSMHRYQPRFHVVYVDPRKDSEKYAEENFKTFVFEETRFTAVTAYQNHRITQLKIAS NPFAKGFRDCDPEDWPRNHRPGALPLMSAFARSRNPVASPTQPSGTEKDAAEARREFQRD AGGPAVLGDPAHPPQLLARVLSPSLPGAGGAGGLVPLPGAPGGRPSPPNPELRLEAPGAS EPLHHHPYKYPAAAYDHYLGAKSRPAPYPLPGLRGHGYHPHAHPHHHHHPVSPAAAAAAA AAAAAAAANMYSSAGAAPPGSYDYCPR >gi568815576r:19751170_19952261|GENSCAN_predicted_CDS_1|2424_bp atggaggcggtggaggcggcggctccggctggaggctgggacggcctcgcgcttcgaggg cttccgggcccctttagccgagggctcctggagggtcccagcgttgagcaacggggtccg ggggtccgaaatcaagcgaggaggtcggaggagcgggcactgcccaccctccggatgtat aagcgtcctggccagaggcgggcggtgcgcgcaagtgcgcgtgacgctcgggctcgcggg ctgcgcagcctgggcagcgtccctctgctggggtctccagcgacgatagtgagccaactt cagggggctgttgggcatttactgggctgggtcctgtcctgctgcgccaaggaacagagg acgctgcggccctggggctcctgtcagtggccggtgcgagagcccaagctgccctctcct cacccaggggcccctggcagtgagtggcaccacaggcccccgcccccggactccgtgggc tgggctgggctgggcgcggtgcggctgggcacacgcagtcggaggcggcgccggccaggc cgccgggcgcctatggacgcgcggagcccgctgtctccccgagccagtgcgttcagcatc gcctctctggttgcagcggaggcggcggagcgcaccgcccaccagggctcagggtcctcc gaccgggtgaagcttcgctggctgccaggatccccggcagggatgcacttcagcaccgtc accagggacatggaagccttcacggccagcagcctgagcagcctgggggccgcggggggc ttcccgggcgccgcgtcgcccggcgccgacccgtacggcccgcgcgagcccccgccgccg ccgccgcgctacgacccgtgcgccgccgccgcccccggcgccccgggcccgccgccgccg ccgcacgcctacccgtttgcgccggccgccggggccgccaccagcgccgccgccgagccc gagggccccggggccagctgcgcggccgcagccaaggcgccggtgaagaagaacgcgaag gtggccggtgtgagcgtgcagctagagatgaaggcgctgtgggacgagttcaaccagctg ggcaccgagatgatcgtcaccaaggccggcaggccctccttgctgtccccaggcacacag gccctcctgaggacagcggcagcccagccgctggctcaggccagcccggcaagggccttg tcggcgctgggcttcacccgggagcccacgtcccgcagcaccctccagcctgcagaggct tcggctagggtgacccaaggcctcatcacccccaggcggatgtttcccaccttccaagtg aagctcttcggcatggatcccatggccgactatatgctgctcatggacttcgtgccggtg gacgataagcgctaccggtacgccttccacagctcctcctggctggtggcggggaaggcc gaccctgccacgccaggccgcgtgcactaccacccggactcgcctgccaagggcgcgcag tggatgaagcaaatcgtgtccttcgacaagctcaagctgaccaacaacctactggacgac aacggccacatcctcagcccagccccaccgctggagctgattccccaccttgtcttccag attattctgaattccatgcacagataccagccccgcttccacgtggtctatgtggaccca cgcaaagatagcgagaaatatgccgaggagaacttcaaaacctttgtgttcgaggagaca cgattcaccgcggtcactgcctaccagaaccatcggatcacgcagctcaagattgccagc aatcccttcgcgaaaggcttccgggactgtgaccctgaggactggccccggaaccaccgg cccggcgcactgccgctcatgagcgccttcgcgcgctcgcggaaccccgtggcttccccg acgcagcccagcggcacggagaaagacgcggctgaggcccggcgagaattccagcgcgac gcgggcgggccagcagtgctcggggacccggcgcatcctccgcagctgctggcccgggtg ctaagcccctcgctgcccggggccggcggcgccggcggcttagtcccgctgcccggcgcg cccggaggccggcccagtcccccgaaccccgagctgcgcctggaggcgcccggcgcatcg gagccgctgcaccaccacccctacaaatatccggccgccgcctacgaccactatctcggg gccaagagccggccggcgccctacccgctgcccggcctgcgtggccacggctaccacccg cacgcgcatccgcaccaccaccaccaccccgtgagtccagccgccgcggccgccgccgcc gctgccgcagctgccgcggccgccaacatgtactcgtcggccggagccgcgccgcccggc tcctacgactattgccccagataa >gi568815576r:19751170_19952261|GENSCAN_predicted_peptide_2|66_aa MEYYAAIKNEELMSFIGTWMKLETIILSKLSQGQKTKHRMFSLIVREDWAFLEQGATRLP PEELGS >gi568815576r:19751170_19952261|GENSCAN_predicted_CDS_2|201_bp atggaatactatgcagccataaaaaatgaagagctcatgtccttcatagggacatggatg aaactggaaaccatcattctcagcaaactatcgcaaggacaaaaaaccaaacaccgcatg ttctcactcatagtgagggaggactgggccttcctggagcaaggggccacacgtcttcct cctgaggagctaggatcctaa >gi568815576r:19751170_19952261|GENSCAN_predicted_peptide_3|239_aa MERPAAAAGPGLCSGALPGIISRHRLTPRCLRAKPNLPAGTPTARGGGRFSWSQTQGRLL ACSCPPASGVGPAAPLTAPGNQRGPLHLHRRQRLGMLEHTAPATQDPTLGRESYSTEHPK ATPAGCEDTTRRTVPRAGAHSHENLDLNRKSPQPRCHPDAALWGPEQRTSYVVPVPRHME ILPIRTGWTLEEKQQGPPLQRSWAPGGHPCPTEPGQYPSQVVSKLGPGAAFPASTDIPV >gi568815576r:19751170_19952261|GENSCAN_predicted_CDS_3|720_bp atggagcgcccagcagcggcggcgggtcctgggctgtgctccggagcccttcccgggatc atctcccggcatcggctgacccctcggtgtctacgtgccaagcccaatctgcctgcaggg acacccactgccagaggcggaggcaggttctcctggtcccagacccagggccgtctgttg gcctgcagctgcccaccggccagtggagtgggacctgctgcccctctgacagctcctggt aatcagaggggcccactccacctgcaccggaggcagcggctggggatgctggagcatacg gctccagccactcaggatcccactctgggccgtgaaagttactccacagaacaccccaag gccacccctgctggctgtgaagacaccaccaggcggacagtgcccagggctggggctcac agccacgagaatctggatttgaacagaaaaagcccccagccgcgctgccaccctgacgca gccttgtggggccctgagcagaggaccagctacgttgtgcccgtgcctcgacacatggaa attcttccaataagaacaggatggacactcgaggaaaagcagcagggcccccctctccaa aggtcctgggcacctggtggccacccttgtccaacggagccaggccagtacccgagccag gtggtctccaagcttgggccaggggccgctttcccggccagcactgacattcccgtttga >gi568815576r:19751170_19952261|GENSCAN_predicted_peptide_4|97_aa MEFVFLTLVSFGVVVRGTHELTNPGIAEVTIRPDRKILATAGWDHRIRVFHWRTMQPLAV LAFHSAAVQCVAFTADGLLAAGSKDQRISLWSLYPRA >gi568815576r:19751170_19952261|GENSCAN_predicted_CDS_4|294_bp atggagtttgtatttctcaccctcgtcagctttggggtggttgtgcgtgggactcatgaa ctcaccaatcccgggatcgccgaggtcacgatccggccagatcgcaagatcctggccacc gcaggctgggaccaccgcatccgcgtgttccactggcggacgatgcagccactggccgtg ctggccttccacagcgccgctgtccagtgcgtggccttcaccgccgatggcttgctggcc gcgggctccaaggatcagcggatcagcctctggtcactctacccacgcgcatga >gi568815576r:19751170_19952261|GENSCAN_predicted_peptide_5|228_aa MPKGKKAKGKKVALAPAVMKKQEAKKVVNPLFEKRPKNFGIGQDIQPKRDLTHFVKWPRY IRLQRQRAILYKRLKALDCQTATQLLKLAHKYRPETKQEKKQRLLAWAEKKAAGKRDVPT KRPPVLRAGVNIVTTLVENKKAQLVVIADDVDPIELVVFLPALCRKMGVNSEDKGTLAKL MEAIRTNYNDRYDEIRRHWGGNILGPKSVARIAKLEKAKAKELATKLG >gi568815576r:19751170_19952261|GENSCAN_predicted_CDS_5|687_bp atgccaaaaggaaagaaggccaagggaaagaaggtggctctggcccctgctgtcatgaag aagcaggaggctaagaaagtggtgaatcccctgtttgagaaaaggcctaagaattttggc atcggacaggacatccagcccaaaagagacctcacccactttgtgaaatggccccgctat atcaggttgcagcggcagagagccatcctctataagcggctgaaagccctggactgccaa acagctactcagctgcttaagctggcccacaagtacagaccagagacaaagcaagagaag aagcagagactgttggcctgggctgagaagaaagctgctggcaaacgggacgtccccacc aagagaccacctgtccttcgagcaggagttaacatcgtcaccaccttggtggagaacaag aaagctcagctggtggtgattgcagacgacgtggatcccatcgagctggttgtcttcttg cctgccttgtgtcgtaaaatgggggtgaactcggaagacaaaggcactttggctaagctg atggaagctatcaggaccaattacaatgacagatacgatgagatccgccgtcactggggc ggcaacatcctgggtcctaagtctgtggctcgtatcgccaagctcgaaaaggcaaaggct aaagaacttgccactaaactgggttaa >gi568815576r:19751170_19952261|GENSCAN_predicted_peptide_6|122_aa MAQQLLLLRERPLSIQARASEATSTATKRPLPEESLKDALFGFQKEVLRCKNTQGNAHTR HAIIDVRTGSIRDVSTPGGPGRLEASFKVHKANTDRPPAKGNRTGLSAPAGDHGDLQDDL KG >gi568815576r:19751170_19952261|GENSCAN_predicted_CDS_6|369_bp atggcccagcagctgctgctgctcagagaacggcccctcagcatccaggcccgtgcatca gaggccaccagcacagccaccaagaggcccctgcctgaggaatccctaaaggatgcgctt tttggcttccagaaggaggtcttgagatgtaagaacacccagggcaacgctcacacacga catgcaatcatcgatgtgagaactggctccataagagatgtgtctacgcctggagggcca ggaaggttggaggccagcttcaaagtgcataaagcaaatacagacaggcccccagcaaaa gggaacaggacaggcctctctgcgcctgctggggaccatggtgacctgcaggatgactta aagggctaa >gi568815576r:19751170_19952261|GENSCAN_predicted_peptide_7|2145_aa MDGWGVGCGTGPWGVVPSFPKLGSTVLLSWHPTDVALECAGFLTGIGLDTTIMMRSIPLR GFDQQMSSMVIEHMASHGTRFLRGCAPSRVRRLPDGQLQVTWEDSTTGKEDTGTFDTVLW AIGLLQRRPLQMGRRKEGCGSSSGHCLLACTRQATVFAQLCSADGSRTPQTQLHKSLRIV PLLSARQQERAESKNQAGDGFANVPVVAISEGTFLGACGHTLHHGQQLFFALYVVGMACL QDGWSVRPPLAPTLTTALLAEMILLEGSFMEDAADVQLGPLATVHKQKLARVEALGGDGQ LCLLLKLVWILETISAGACPSRTSQASVQRRVWVQRQMFCSGTEQAPGSAGPGGLDTRLE RDQTAERRVPDTRSLNLEKAGVDTSPDTQKILVDSREATSVPHIYAIGDVVEGRPELTPI AIMAGRLLVQRLFGGSSDLMDYDNVPTTVFTPLEYGCVGLSEEEAVARHGQEHVEAQALT LEERGFHRASLLWGSALAAPDTLSSPSGVLKAFVQQLRLRCGHSGLAPVRAKQTPGCLRE PRWPPAHPFLPPPFVVEHGSSHWLPARERACTHPVASQALIPIPDQTFRPDGWFLPAMTL PCPCSLWELQVPRGRVQGGCRVDATQPPPGLVAEMCGFQMVCLREPPQLVLGLHFLGPNA GEVTQGFALGIKCGASYAQVMRTVGIHPTCSEEVVKLRISKRSGLDPTARAHGAPAASSS EARPRMAAGQVWGASTLSWSACEMVSVERKCWTGSWCQMMTTWVETYPVGTHVRAPWHFC NANKEGTFSEVCSACGVPDGPPGSPRGFWSPACPGPQPILCAYEDSEGAVCGAQQSILGP SLPDPSPDMVYGAQVTLIPKPKLAFPEPSLTSHLGLSREAHNYIHNKEAQRPCLQPPGMQ PAATSNRALTRFPALAMSDERHSAGTPFQGAHEAQQPRYAHIHPDTCSPPGLARPRSSPL DAQQHPSVWLTPGHMPRGRCRQQGPRIPIWAAANYANAHPWQQMDKASPGVAYTPLVDPW IERPCCGDTVCVRTTMEQKSTASGTCGGKPAERGPLAGHMPSSRPHRVDFCWVPGSDPGT FDGSPWLLDRFLAQLGDYMSFHFEHYQDNISRVCEILRRLTGRAQAWAAPYLDGDLPLPD DYELFCQDLKEVVQDPNSFAEYHAVVTCPLPLASSQLPVAPQLPVVRQYLARFLEGLALD MGTAPRSLPAAMATPAVSGSNSVSRSALFEQQLTKESTPGPKEPPVLPSSTCSSKPGPVE PASSQPEEAAPTPVPRLSESANPPAQRPDPAHPGGPKPQKTEEEVLETEGDQEVSLGTPQ ELECEEGCLEEVATLILGDEWDSILAKNDVSCGITSSSGQLEGLDAGRAYLSISCWLQQL TVWLQPPGRASGRLASVLIRMSVDVGKHIAGMHMPHGLGFMFLLAWRTSQFNIFMLERLC YFIEKLVLTPFMSLDENYDFELHPCNCILPSMTAPCPPPPPDPQFVLRGTQSPVHALHFC EGAQAQGRPLLFSGSQSGLVHIWSLQTRRAVTTLDGHGGQCVTWLQTLPQGRQLLSQGRD LKLCLWDLAEGRSAVVDSVCLESVGFCRSSILAGGQPRWTLAVPGRGSDEAELLVLSGAE ALLSEAKLLVLSGAEALLAEAELLWGPLARAACTRSAVALVRVLEVQILEMPSKTSVCAL KPKADAKLGMPMCLRLWQDLIYRRATQATPHPPAAGNLCSCLSVPTALCFLLLQGASCQC PRRGLASASSSGSRSSSFASSGGLTPQLVVPSSERPALAQGVAALLGAAHVQGCRASDPV HSLAKSPSALSCLGSPRFVCVNSVIRTQWFYKSFSGTCMRTTGPALASGSHAHILQGTAD CSSRPLLLAGYEDGSVVLWDVSEQKVCSRIACHEEPVMDLDFDSQKARGISGSAGKALAV WSLDWQQALQPDSKALKTQTASLRAPGGLFSAVSTSVPRTPGRGPPAVLAQICLVSPEAL EIAGSVKPQPQGKPVGALSEVSCLRAFLPPGSKQGDRRSSAPTGVLTHRLHCDSLAESHV GTAVSPSHPPAEASDEPRSVSPRRVLGVETRMRFLEQSMGNRNQMVASPHHAFLAPEVMR YESHWWPEILPGRPRNTGLEQALVFLQEAPPGFPHHPHFLREAGP >gi568815576r:19751170_19952261|GENSCAN_predicted_CDS_7|6438_bp atggatgggtggggtgtgggctgtggcacagggccctggggtgttgtcccatcatttcca aaattgggatctacagtgctccttagctggcaccccacagatgtggccctggagtgtgct ggcttcctcaccgggattgggctggacaccaccatcatgatgcgcagcatccccctccgc ggcttcgaccagcaaatgtcctccatggtcatagagcacatggcatctcatggcacccgg ttcctgaggggctgtgccccctcgcgggtcaggaggctccctgatggccagctgcaggtc acctgggaggacagcaccaccggcaaggaggacacgggcacctttgacaccgtcctgtgg gccataggcctgctgcagaggaggccattgcagatgggccgccgcaaggagggctgcggg tcctcgtcagggcactgtttgctggcatgtacacggcaggcaactgtctttgctcagctg tgctcggctgatggctcccgtacaccgcagactcagctacacaagtccctgcggattgtc cctctgttgagtgccaggcagcaggaaagggcagaaagcaagaaccaagctggggatggg ttcgccaacgtgccagtggttgccatctctgaagggacgttccttggggcttgcggccat actctgcaccatggtcagcagctcttctttgccttatatgtagtgggcatggcctgcctt caggatggctggtcagttcggccacctctggctcccacattgaccacagctctgttggct gagatgatcctcttggagggcagcttcatggaggacgctgccgacgtccagctgggccct cttgctacagtacacaaacagaaattggcccgtgtggaggcccttggtggtgatggacag ctctgtctgctgcttaaactggtgtggatcctggagaccatctcagcaggggcatgtcct tcaaggaccagccaggcaagcgtgcagaggagagtgtgggtgcagcgtcagatgttctgt tccggcacggagcaggcaccaggaagtgctgggcctggtgggctggacaccaggttggag agggaccagacggctgagcgtcgagtcccagacaccagaagtctgaatttggagaaggct ggggtagatactagccccgacactcagaagatcctggtggactcccgggaagccacctct gtgccccacatctacgccattggtgacgtggtggaggggcggcctgagctgacacccata gcgatcatggccgggaggctcctggtgcagcggctcttcggcgggtcctcagatctgatg gactacgacaatgttcccacgaccgtcttcaccccgctggagtatggctgtgtggggctg tccgaggaggaggcagtggctcgccacgggcaggagcatgttgaggctcaggcactcacc cttgaggaaaggggttttcatcgagcaagcctgctttgggggtctgctctggcagccccg gacacgctgtcgagcccttcaggggtgctgaaagcctttgtccaacagctccggctccgg tgtggccacagcggccttgctccagtgagggccaagcaaacaccagggtgcctgagggag cccaggtggcctcctgctcacccattcttgcctccaccatttgttgtggagcatggctcc agccactggctgcctgcacgtgagagggcctgcacacaccccgtggccagccaagccctc atccccatcccagaccagaccttcaggccagatggatggttcctgcccgccatgaccctg ccttgcccatgcagcctgtgggagctgcaggtgccacgaggcagggtgcagggtggatgc agggtggacgcaacccagccccctcctgggctggtggctgagatgtgcggctttcagatg gtgtgcctgagggagcccccacagctggtgctgggcctgcatttccttggccccaacgca ggcgaagttactcaaggatttgctctggggatcaagtgtggggcttcctatgcgcaggtg atgcggaccgtgggtatccatcccacatgctctgaggaggtagtcaagctgcgcatctcc aagcgctcaggcctggaccccacggccagggcacacggtgcgcccgccgccagctcctcg gaggccagacccaggatggctgcaggccaggtttggggggcctcaaccctctcctggagc gcctgtgagatggtcagcgtggagcgcaagtgctggacaggctcctggtgccagatgatg acgacctgggtggaaacctaccctgtgggcacccatgtccgagccccctggcatttctgc aatgcaaataaagagggtactttttctgaagtgtgctctgcctgcggtgtccctgatggt cccccaggatcgcccagagggttctggagccctgcttgtcctggcccccagccaatcctg tgtgcctatgaggacagcgagggggccgtctgtggggcccagcaaagcatcttaggcccg agcctgcctgatcccagccctgacatggtgtacggtgctcaggttaccctaatcccaaaa cccaagctggcgttcccagagccttcccttacttcccacctcggcctgagtcgggaagca cacaattatatccacaacaaggaggcccagaggccctgcctgcagcccccaggtatgcag cctgcagctacgtcgaaccgggccctaacaaggttccccgcactggcaatgagtgatgag aggcactccgcgggcacgccgttccagggcgctcacgaggcccagcagccccgctatgcc cacatccaccccgacacctgcagccccccaggcttagcccggccccgcagcagccctctc gacgcccagcagcaccccagtgtctggctgacgccagggcacatgcctcgtggccggtgc cgtcagcagggccctcgcattcccatctgggcagccgccaattatgccaacgcacatcca tggcagcagatggacaaggcgtctcctggggtggcatacaccccccttgtggatccctgg attgagcggccctgctgtggggacaccgtgtgtgtgcgaacgaccatggagcagaagagc acagctagtggcacttgtggcggtaaacctgcagaaaggggtcccctagctgggcacatg cccagctcccgaccccacagggtggacttctgctgggtaccaggctcagacccaggcacc tttgatggctccccgtggctactggaccgcttcttggcccagctgggcgattacatgtcc ttccactttgagcactatcaggacaacatcagccgtgtctgcgagatcctcaggcgccta acaggccgagcccaggcctgggcagccccctaccttgatggggacctgcccctgcctgac gattacgagctcttctgccaggatctcaaggaagttgttcaagacccgaacagttttgct gagtaccatgctgtggttacctgtcccctgcccctggcctccagccagctgccagtggcc cctcagctgcctgtggtgaggcaatacttagctaggttcttagagggcctggcactcgac atgggtactgcccccaggtctttaccagccgccatggccacccctgctgtgtctgggtcc aactctgtatctagaagtgctctgttcgagcagcagctgaccaaggagagcacccctggg cccaaggagcccccagtcctgcccagttctacatgtagctccaagcctggtcctgtggaa ccagcctcttcccagccagaggaggcagcccccacacctgtccctagactgtcggagtca gctaatcctcctgcccagagaccagacccagctcatccaggaggtccaaaaccccagaaa acagaggaggaggttttggagacagagggagaccaggaggtgtccttaggtaccccacag gagctggaatgtgaggaaggctgcctggaggaagtggccactttgattctgggggatgag tgggatagcatcttggcaaagaatgatgtttcctgtgggattaccagcagttctgggcag ctggaaggtttggatgcaggaagggcctacctcagcatcagctgctggctgcagcaactc acagtctggctgcagccacccggcagagcgagtgggaggctggcctctgtcctcattcgc atgtctgtggatgtgggcaagcacattgctggcatgcatatgcctcatggccttggcttc atgttcttgctggcctggaggaccagccaattcaacatcttcatgctagaacggctgtgc tacttcatagagaaattagttttgactcccttcatgtctcttgatgagaattatgacttt gagcttcatccatgtaactgcatcctgcccagcatgacggccccctgcccgccgccacct ccagacccccagtttgtcctccgaggcacccagtcaccggtgcatgcgctgcacttctgc gaaggagcccaggctcaggggcgcccgctcctcttctcagggtctcagagtggcctggta cacatctggagcctgcagacgcggagagcggttaccaccctggatggccacggcggccag tgtgtgacctggctgcagacgctgccccaggggcgccagctcctcagtcagggccgggac ctgaagctgtgcctgtgggacctcgcggagggcaggagcgctgtcgtggactccgtgtgc ttggagagtgtgggcttctgccggagcagcatcctggccgggggccagccacgctggacg cttgccgtgccagggaggggcagcgacgaggctgagctcctggtgctgagtggagcagaa gccctgctgtcagaggctaagctcctggtgctgagtggagcagaagccctgctggcagag gctgagctgctgtgggggcccctggcaagagcagcctgcacaagaagcgccgtggccctg gtccgcgtactggaggttcagattctggagatgccctccaagacgtcagtgtgcgccctg aagccgaaggcagatgccaagctgggcatgcccatgtgcctgcggctgtggcaggatctg atttatcgtcgagccactcaggcaacaccccacccccctgccgcagggaacctgtgcagc tgcctcagtgtccccacggccctctgcttcctgctgctgcaaggggcctcctgtcagtgc cccaggagaggcctggcttcagccagcagcagtggcagcaggtcctcgtcctttgcaagc tccggtgggctcaccccgcagctcgtggtgccctccagtgagcggccagcactggctcaa ggtgtagctgccctcctcggggccgctcatgtccagggctgcagagcaagtgatcccgtg cattcacttgcaaagagcccctcagccctgagctgcctcggaagcccacgttttgtctgt gtgaactctgttattagaacccagtggttttacaagtctttttcagggacatgtatgagg accactggaccagctctggcctcgggttcacatgcccacatcctgcagggcacagccgac tgcagctcccgcccactccttctggccggctatgaggatggatcggtggtcctgtgggac gtctctgagcagaaggtgtgcagccgcatcgcctgccatgaggagcccgtcatggacctt gactttgactcccagaaggccaggggcatctcaggctccgcggggaaggcgctggctgtc tggagcctggactggcagcaggccctgcagcctgacagcaaagcgctgaagacccagaca gcttccctgagagcccccggagggctcttctctgcagtttccacctctgtgcccaggacc ccaggaaggggccctcctgccgtcctggcccagatctgcctcgtctctcctgaggccctg gagattgcaggttctgtaaagccccagcctcagggaaagcctgtgggggcgctttcggag gtgagctgcctgagagcgttcttgccacccgggagcaagcaaggtgaccgaaggtcctca gcacccacaggtgtcctcacccaccgcctgcactgtgactccttggctgagagccacgtg ggcacggctgtgtccccctcacaccctcctgccgaagcctctgatgagcccagatcagtc tccccccggcgggtgctgggagtagagacgcgaatgaggttcctggagcagagcatgggt aatcggaaccagatggtggcttcgccgcaccacgcattcctggcacctgaggtcatgcgc tacgagagccactggtggccagagatccttcctggaaggcccaggaacacagggctggag caggccctagtctttctccaggaggccccaccaggcttcccgcatcatcctcatttcctg cgggaagctggtccttga >gi568815576r:19751170_19952261|GENSCAN_predicted_peptide_8|444_aa MEGSRQSMETSTSSAQSQESVGAVGSSDHSQESVGAVGSSDRSQEGVGAVDSSDGSQESV GTVSSSDRSQESVGTVGSSDRSQESVGAPSSDRSQESVGAVDSSDGSQESVGAMGSSDGS QESVGTMGECGRTMSSSDRSQESVGAVSSSDRSQESVGAVGSSDRSQEGVGAVDSSDRSQ ESVGTVSSSDRSQESVGTVGSSDRSQESVGAPDRSQESVGAVGSSDRSQESVGTVGSSDR SQESVGAMGSSDRSQESVGAVGECGHTMGSSDSSQESVGAVGSSDSSQESVGAVGSSDRS QESVGAVGSSDRSQESVGAVGSSDCSQESTHQTDRGMQPQGATPGFWGIQTQREEVGREQ VSKSGPAELRCVCAAGVAYKASQRRGGTGAHLLSLLQLPPNPSPRGPFLNSAPTTHKTGP GVIHLSVKNSDKTGKNWEIQNYGN >gi568815576r:19751170_19952261|GENSCAN_predicted_CDS_8|1335_bp atggaaggcagcaggcagtcgatggaaaccagcacaagctcagcacagtctcaggagagt gtgggcgccgtgggtagcagtgaccactctcaggagagtgtgggcgccgtgggtagcagt gaccgctctcaggagggtgtgggcgccgtggatagcagtgacggctctcaggagagtgtg ggcaccgtaagtagcagtgaccgctctcaggagagtgtgggcaccgtgggtagcagtgac cgctctcaggagagtgtgggcgcacctagcagtgaccgctctcaggagagtgtgggcgcc gtggatagcagtgacggctctcaggagagtgtgggcgccatgggtagcagtgacggctct caggagagtgtgggcaccatgggagagtgtgggcgcaccatgagtagcagtgaccgctct caggagagtgtgggcgccgtgagtagcagtgaccgctctcaggagagtgtgggcgccgtg ggtagcagtgaccgctctcaggagggtgtgggcgccgtggatagcagtgaccgctctcag gagagtgtgggcaccgtaagtagcagtgaccgctctcaggagagtgtgggcaccgtgggt agcagtgaccgctctcaggagagtgtgggcgcacccgaccgctctcaggagagtgtgggc gccgtgggtagcagtgaccgctctcaggagagtgtgggcaccgtgggtagcagtgaccgc tctcaggagagtgtgggcgccatgggtagcagtgaccgctctcaggagagtgtgggcgcc gtgggagagtgtgggcacaccatgggtagcagtgacagctctcaggagagtgtgggcgcc gtgggtagcagtgacagctctcaggagagtgtgggcgccgtggggagcagtgaccgttct caggagagtgtgggcgccgtgggtagcagtgaccgctctcaggagagtgtgggcgccgtg ggtagcagtgactgctctcaggagagtacacatcaaacagacaggggcatgcagccccag ggagcaacacctgggttctgggggatccaaactcagagagaggaagtggggagagaacag gtcagcaaatcagggccagcagagctcaggtgtgtctgcgcggcaggggtggcctacaag gccagccagaggagaggaggcacaggggcccatttgctttccctgctgcagctgcccccc aacccctcaccccggggccccttcctcaactctgcccccacaacccacaagactgggcca ggagttattcacctctctgttaaaaacagcgataaaactggcaagaactgggagattcag aattatggaaattaa >gi568815576r:19751170_19952261|GENSCAN_predicted_peptide_9|286_aa MEDQGQRDYDLLVVGGGSGGLACAKEAAQLGRKVAVVDYVEPSPQGTRWGLGGTCVNVGC IPKKLMHQAALLGGLIQDAPNYGWEVAQPVPHDWRKMAEAVQNHVKSLNWGHRVQLQDSS LGVGAGAWCFPLEQMSLLGHRSCQPLGISSTEAYAPLAYNLLLGLSCQHSRCLKLLLDGH QTGPAHCLLRSHLQKEAGAHLGCFCFFTLLKSAAMSIARLCPVAASWLSRKVKYFNIKAS FVDEHTVCGVAKGGKEILLSADHIIIATGGRPRYPTHPQNVGTFRI >gi568815576r:19751170_19952261|GENSCAN_predicted_CDS_9|861_bp atggaggaccaaggtcagcgggactatgatctcctggtggtcggcgggggatctggtggc ctggcttgtgccaaggaggccgcccagctgggaaggaaggtggccgtggtggactacgtg gaaccttctccccaaggcacccggtggggcctcggcggcacctgcgtcaacgtgggctgc atccccaagaagctgatgcaccaggcggcactgctgggaggcctgatccaagatgccccc aactatggctgggaggtggcccagcccgtgccgcatgactggaggaagatggcagaagct gttcaaaatcacgtgaaatccttgaactggggccaccgtgtccagcttcaggacagctcc ttgggtgtgggtgcaggagcttggtgctttccgctggagcaaatgtccctacttggtcac cgttcctgtcagcccctggggatctccagcacagaggcctatgctcccctggcctacaac cttctcctggggctcagctgccagcacagcagatgcctgaaactgctcctggacgggcac caaacaggcccagcccactgcctgctccggagccacctgcagaaggaggctggggcgcac ctgggctgtttctgctttttcactcttctgaaaagtgctgccatgagcattgcccggctg tgtcccgtggcagcttcctggctgtcgagaaaagtcaagtactttaacatcaaagccagc tttgttgacgagcacacggtttgcggcgttgccaaaggtgggaaagagattctgctgtca gccgatcacatcatcattgctactggagggcggccgagataccccacgcaccctcagaat gttggaacattcaggatataa >gi568815576r:19751170_19952261|GENSCAN_predicted_peptide_10|303_aa MLEPPNAQGVQVCRYTPLNLTFLKEFKDACTQYVPTSPYVKTVLQTLCTEVILLPIDCDL LAKAVLTPSQHLQFHTWWSEEARLQAQLNRADGIPITQAQLTGSDNYSDTTAHLGFDALT TEQVTKVCMRAWDKLRTPGQASVSFTIVKQGHNELYLDFLAKLQDAVEKSVSDEHTQGIL LHMLAFQNANHECKMAVHSILRQNLPDHEVLPAYIKACEGVGSETHRAILWAPAMKHGNQ TGSTDSFLGACYNCGQLGHTQKNYTVKNLKAAKPAQQTRPNAPATVCLRCRKGKHWASTC HSV >gi568815576r:19751170_19952261|GENSCAN_predicted_CDS_10|912_bp atgctggaacctcctaatgctcaaggggtacaagtgtgtcgatatacgccgctcaatctt acctttttaaaagaattcaaggatgcttgtactcagtatgttcctacttctccatatgtt aaaacggtattacagactctttgtaccgaggtcattttgcttcctatagactgcgacctt ttggcaaaagctgttctaactccatctcagcatttacaattccatacctggtggtcagag gaggcccgcctgcaggctcagctaaatcgggctgatggcattccaattactcaggctcag ctcacaggctccgataattactctgatactactgctcatttaggctttgatgctctcacc acggaacaagtaacaaaggtgtgtatgagagcttgggataaattacgcaccccaggccaa gcttctgtttcttttactattgttaaacagggtcacaatgaattatatcttgatttttta gctaaattacaagatgctgttgaaaaatctgtctctgatgagcacactcaaggtattctt cttcatatgttagcttttcaaaatgcgaaccatgagtgtaaaatggccgtgcattccatc ctacgtcaaaatttacctgatcatgaggtgttgcctgcatatattaaagcttgtgaaggc gttggatcggagacccacagagctattctgtgggcaccggccatgaagcacggcaaccaa actggctcgactgattcttttcttggagcctgctataattgtggccaacttggtcatacc caaaaaaactacactgttaaaaacttaaaagcggccaagccggctcaacaaacacggcca aatgctcctgctactgtttgcctgcgttgtcgtaagggtaaacattgggcaagtacttgc cactcagtctga >gi568815576r:19751170_19952261|GENSCAN_predicted_peptide_11|74_aa MLVFYQEIVNPVWVPTRRLKLRVNTDNKNHEEETFASETALIPDEICTDSSETGMPNQNG SGSILPNGNVNPSN >gi568815576r:19751170_19952261|GENSCAN_predicted_CDS_11|225_bp atgcttgtgttttaccaggagatcgtcaatcccgtctgggtgcccactaggagacttaaa cttcgtgtgaatactgacaataaaaaccacgaggaagagacgttcgcgtcagagactgcc ctcatacctgatgagatctgtaccgactcctcagaaactggcatgccaaatcaaaatggg tctggttcaatcctccctaatggcaatgtaaacccctctaactaa >gi568815576r:19751170_19952261|GENSCAN_predicted_peptide_12|173_aa XARTWIWMGTTIRSTLGCNNRSTQCTQRSVLSPLCLLMRESHGLICDPVVRKNIGCSSLK APAPNPAAFGPGLASALTSPATARPGLDPPRSGAGSAAGSPSCPRSPTTMAAMAVALRGL GGRFRWRTQAVAGGSWYPSNNYSSEDDDIPPPEVTRHRPHWPDFPTSPQTPRL >gi568815576r:19751170_19952261|GENSCAN_predicted_CDS_12|522_bp ngggccaggacttggatctggatgggaaccaccatccgatcaaccctcggatgcaacaac cggagcacacagtgtacacagcgatccgtgttatctccactctgcctgctcatgcgggag tcacatggtttgatttgtgatccagtggtccggaagaatatcggatgcagcagtctgaag gcccccgccccgaatccggccgcattcggccccggtctagccagcgctctcacctctccc gcgacggcccgcccgggactggacccgccccggtccggcgcaggcagcgcggcgggcagc cctagctgccccagaagccccacgacgatggcggcaatggcggtggcgctgcggggatta ggagggcgcttccggtggcggacgcaggccgtggcgggcgggagctggtatcccagtaac aactacagttctgaagatgatgatatcccacctcccgaggtcaccaggcaccggccccac tggccagacttcccaacttctccccagacccctagactctag