GENSCAN 1.0 Date run: 3-Nov-116 Time: 11:20:57 Sequence gi568815589r:38986689_39388064 : 401376 bp : 39.00% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.09 PlyA - 506 501 6 1.05 1.08 Term - 6791 6458 334 1 1 34 46 245 0.886 8.00 1.07 Intr - 7810 7577 234 0 0 -1 105 188 0.931 7.48 1.06 Intr - 11714 11570 145 0 1 25 100 78 0.567 1.12 1.05 Intr - 48275 48167 109 2 1 23 78 97 0.009 1.14 1.04 Intr - 51957 51792 166 2 1 61 37 112 0.086 2.44 1.03 Intr - 52536 52181 356 0 2 51 26 190 0.219 2.26 1.02 Intr - 55154 54838 317 0 2 4 11 217 0.196 0.46 1.01 Init - 55784 55580 205 2 1 97 44 181 0.552 13.84 1.00 Prom - 57786 57747 40 -5.05 2.04 PlyA - 58693 58688 6 1.05 2.03 Term - 67205 67015 191 0 2 64 43 145 0.436 4.23 2.02 Intr - 73453 73355 99 2 0 81 42 123 0.327 6.16 2.01 Init - 82078 82012 67 1 1 88 86 33 0.712 4.39 2.00 Prom - 86042 86003 40 -5.35 3.14 PlyA - 88460 88455 6 1.05 3.13 Term - 91768 91632 137 2 2 117 42 281 0.999 23.60 3.12 Intr - 92232 92002 231 1 0 115 89 266 0.995 26.22 3.11 Intr - 99135 99048 88 0 1 28 97 77 0.924 1.22 3.10 Intr - 100161 100028 134 1 2 117 74 105 0.918 11.44 3.09 Intr - 101959 101735 225 2 0 60 23 147 0.481 2.43 3.08 Intr - 113462 113223 240 0 0 49 115 171 0.816 12.50 3.07 Intr - 116027 115809 219 0 0 52 97 173 0.879 11.95 3.06 Intr - 117226 117056 171 2 0 64 91 67 0.489 3.59 3.05 Intr - 122599 122472 128 0 2 94 97 126 0.971 13.50 3.04 Intr - 131571 131415 157 1 1 76 15 109 0.325 0.55 3.03 Intr - 146447 146244 204 0 0 73 99 128 0.285 10.65 3.02 Intr - 153950 153831 120 0 0 81 91 132 0.562 12.35 3.01 Init - 158695 158560 136 1 1 89 90 85 0.693 9.16 3.00 Prom - 160385 160346 40 -5.45 4.00 Prom + 161471 161510 40 -5.35 4.01 Init + 167387 167479 93 1 0 67 94 73 0.760 6.23 4.02 Intr + 167934 168257 324 2 0 -21 87 342 0.629 18.25 4.03 Intr + 173180 173295 116 1 2 71 66 30 0.125 -2.57 4.04 Term + 175244 175364 121 2 1 22 41 150 0.160 0.67 4.05 PlyA + 176532 176537 6 1.05 5.06 PlyA - 177275 177270 6 1.05 5.05 Term - 179388 179177 212 1 2 90 47 163 0.398 8.87 5.04 Intr - 184942 184681 262 1 1 109 92 119 0.794 10.54 5.03 Intr - 190814 190630 185 0 2 98 15 254 0.929 17.69 5.02 Intr - 206587 206440 148 1 1 110 92 46 0.822 6.09 5.01 Init - 206641 206633 9 1 0 93 66 18 0.655 -0.39 5.00 Prom - 209330 209291 40 -6.75 6.00 Prom + 210007 210046 40 -9.25 6.01 Init + 211630 211783 154 0 1 58 95 128 0.455 10.70 6.02 Intr + 215604 215959 356 1 2 108 86 133 0.500 9.18 6.03 Intr + 218235 218373 139 2 1 33 61 114 0.201 2.32 6.04 Intr + 226390 226589 200 2 2 49 102 133 0.736 8.95 6.05 Intr + 231087 231211 125 2 2 54 65 87 0.307 1.46 6.06 Intr + 238759 238792 34 1 1 117 67 10 0.257 -0.89 6.07 Intr + 239651 239764 114 1 0 127 72 90 0.365 11.02 6.08 Term + 246091 246303 213 0 0 82 48 84 0.112 0.05 6.09 PlyA + 246993 246998 6 1.05 7.12 PlyA - 247536 247531 6 1.05 7.11 Term - 248732 248673 60 2 0 83 50 37 0.011 -3.77 7.10 Intr - 252498 252305 194 1 2 56 71 122 0.021 5.59 7.09 Intr - 256026 255954 73 0 1 89 82 49 0.014 2.46 7.08 Intr - 261254 261123 132 2 0 73 100 12 0.015 0.82 7.07 Intr - 261747 261659 89 1 2 90 47 22 0.010 -2.93 7.06 Intr - 271644 271490 155 2 2 34 99 190 0.817 13.49 7.05 Intr - 278706 278609 98 0 2 95 55 52 0.641 0.49 7.04 Intr - 280281 280208 74 1 2 103 101 68 0.697 7.81 7.03 Intr - 284834 284735 100 2 1 86 86 33 0.000 1.66 7.02 Intr - 288362 288172 191 0 2 67 32 153 0.001 5.98 7.01 Init - 290678 290603 76 2 1 47 99 36 0.002 1.90 7.00 Prom - 296264 296225 40 -1.45 8.03 PlyA - 296562 296557 6 1.05 8.02 Term - 300313 300219 95 2 2 102 33 82 0.973 1.11 8.01 Init - 301376 301292 85 2 1 88 113 128 0.999 15.98 8.00 Prom - 302123 302084 40 -7.15 9.00 Prom + 304285 304324 40 -5.65 9.01 Init + 305081 305322 242 1 2 64 64 180 0.208 10.59 9.02 Term + 322615 323275 661 1 1 91 47 267 0.376 15.29 9.03 PlyA + 324467 324472 6 1.05 10.00 Prom + 330784 330823 40 -7.35 10.01 Init + 331416 331602 187 2 1 44 55 174 0.795 8.97 10.02 Intr + 332021 332331 311 0 2 60 31 168 0.091 3.41 10.03 Term + 338897 339040 144 1 0 49 37 242 0.272 12.13 10.04 PlyA + 339193 339198 6 1.05 11.00 Prom + 341577 341616 40 -7.75 11.01 Init + 343319 343357 39 1 0 63 94 60 0.275 2.57 11.02 Intr + 358694 358816 123 1 0 31 39 143 0.020 3.56 11.03 Intr + 366747 366967 221 2 2 88 69 90 0.388 3.28 11.04 Intr + 368970 369231 262 0 1 64 116 86 0.462 5.57 11.05 Intr + 371070 371130 61 2 1 94 78 64 0.507 3.39 11.06 Intr + 371168 371300 133 0 1 92 -6 71 0.383 -3.12 11.07 Term + 371386 375121 3736 1 1 115 34 1899 0.957 171.58 11.08 PlyA + 375178 375183 6 -0.45 12.03 PlyA - 375269 375264 6 1.05 12.02 Term - 376103 375986 118 1 1 90 46 51 0.227 -1.87 12.01 Init - 377592 376847 746 0 2 85 89 319 0.610 25.94 12.00 Prom - 379074 379035 40 -7.75 13.00 Prom + 381113 381152 40 -8.45 13.01 Init + 382052 382229 178 1 1 78 42 158 0.119 9.67 13.02 Intr + 384542 384922 381 0 0 -10 89 422 0.014 26.46 13.03 Term + 395006 395115 110 0 2 94 41 49 0.021 -1.51 13.04 PlyA + 395519 395524 6 1.05 14.02 PlyA - 395772 395767 6 1.05 14.01 Term - 396570 396288 283 2 1 123 47 138 0.892 7.11 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 40028 40234 207 1 0 74 94 128 0.870 10.87 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589r:38986689_39388064|GENSCAN_predicted_peptide_1|621_aa MPEPPTHSMGSCAARASPTSTTPCSTAPGPIDHPRAEECERTARDWQAAPPAAPVRDPLG EASWAPESAVTLTVKICSFTPEPSETTSPPGGTNNSRRATLRAVTLTTKVCSFTPEPARP RTHQKEETPNTCERQKGQTSRCATLKAVTLTARVRGFILEVSETKNPPISGTVLRVTWIR KQMKITVLNTPSPQNHFGFLSDAVALQAISTVIVLANAVETTTVKMIPNVVCQRIQVTVF VQPVKSKEQKKTDREMSLRVLTMSVCMCFTKIIRTMAVLVHFHTADKDISETGSLSQHVG IMGAIIQDEIRVGTQRNHTNGSEDRETESYAGQCCPQCIFKIQEGHNSAIHKTEGEHQAV QALLDGRGLEKEKHGSQITSRLMRGSMKNGQPREKVIGHLFRFFLAFLYDIPFPRCKAGP LWNQGLVICYQTRKYEGRWLTRHSQEEQLPPRDQDIGKTGVLLADLQREGFESRQREDTD VGLKGEEGGNPARGNRAPGLIPGPQNLLEESVTIGTENRGPTPVLSGYYHPRECARRTHT VLCSPVLHSHANTATGTNAHKDTGVGEGASIPHPQPLHRDATVAAMNAHNEASMLAATSS LPQPVNVHHTSLPLLLLLLKV >gi568815589r:38986689_39388064|GENSCAN_predicted_CDS_1|1866_bp atgcctgagcctcccacccactccatgggctcctgtgcagcccgagcctccccgacgagc accaccccctgctccacggcgcccggtcccatcgaccaccctagggctgaggagtgcgag cgcacggcgcgggactggcaggcagctccacctgcagcccctgtgcgggatccactaggt gaagccagctgggctcctgagtctgctgtaacactcaccgtgaagatttgcagcttcact cctgagcccagtgagaccaccagcccaccgggaggaacgaacaactccagacgtgccacc ttaagagctgtaacactcaccacgaaggtctgcagcttcactcctgagccagcgagacca cgaacccaccagaaggaagaaactccgaacacatgtgaacgtcagaagggacagacttcc agatgcgccaccttaaaagctgtaacactcaccgcgagggtccgcggcttcattcttgaa gtcagtgagaccaagaacccaccaatttcgggcacagtgttaagagttacttggataagg aaacagatgaaaataactgtgctaaatactcccagtccccaaaaccactttggctttctc tctgatgctgtagccttacaggctataagcacagtgatagttttagctaacgcagttgaa acaaccacagtaaaaatgattccaaatgttgtctgtcagaggatacaagtgactgtgttt gtacagccagtgaagagcaaggagcagaagaaaacagacagagagatgtcgctgagagtc ttgactatgagtgtgtgtatgtgcttcacaaagatcataagaaccatggctgtattagtc cattttcacactgctgataaagatatatcagagactggctccctctcacaacacgtggga attatgggagctataattcaagatgagattagggtggggacacagcgaaaccataccaat ggcagtgaggacagagaaacagagagctatgctggccagtgctgtccccagtgcatcttc aaaatccaggaaggtcacaactctgcaatacacaaaactgaaggtgagcaccaagcagtg caagctttacttgatggccgtggacttgagaaggagaagcatggctcacaaatcacttct cgactaatgagaggttcaatgaagaatgggcagccacgtgaaaaggtgattggacatctc ttcagattcttcttggcctttctgtatgacattcctttccctcggtgtaaagcaggaccg ctctggaaccagggtcttgtgatctgctatcaaacaagaaaatatgagggaagatggctg actagacacagccaggaggaacagctcccaccaagggaccaggacattgggaagactggt gtgctcctggcagatcttcagagggaaggctttgagagcagacagagggaagacacagat gttgggctgaagggggaggaaggcgggaaccctgcaaggggcaaccgtgcaccaggactc attccaggtccccaaaatctcctggaggagagtgtgacaataggcaccgagaacagagga cccacccctgttctgagcggctactaccacccacgtgaatgtgcaaggagaactcacaca gtcctgtgctcaccagtgcttcactcccatgctaacaccgccaccggcacaaatgcacac aaagacacaggggtgggggagggggcaagcatcccccacccccaacccctgcaccgtgat gccacagttgctgctatgaatgcccacaatgaggccagcatgctggcagccactagctcc ctgccgcagccagtgaatgtgcaccacacctcgctgccactgctgctgctgctgctgaag gtgtga >gi568815589r:38986689_39388064|GENSCAN_predicted_peptide_2|118_aa MEIDSLRKPSKSLNGVAVAESSVNCRKAELVCTGECIPVSVFLFPSRCVWMTQQPGKSFA SKIGHMQKQPRITCVWQEEGTMSLEATSNKCGIAISGQEDSETRSMGCLRESPEGLAT >gi568815589r:38986689_39388064|GENSCAN_predicted_CDS_2|357_bp atggagatagatagcttaaggaagcctagcaagtctctgaatggagtagctgttgctgag agttcagtgaactgtcggaaagcagagctcgtgtgcactggagagtgtattcctgtgagt gtattcctgttcccttccaggtgcgtgtggatgacccagcagccagggaagagttttgca tcgaagattggacacatgcaaaaacaacccagaattacatgtgtgtggcaggaagagggg acaatgtccctggaggcgacatccaacaaatgcgggatagcaatcagtggacaggaagat tctgagacacgttccatgggctgtctcagagagtccccagagggattagctacatag >gi568815589r:38986689_39388064|GENSCAN_predicted_peptide_3|729_aa MDYDVPHGLEGQCPLESSLFTRSLAVYSAVGDSGQRLRSNPAQGRALYEQSCEAHKHRGN PSGLYYIDADGSGPLGPFLVYCNMTADAAWTVVQHGGPDAVTLRGAPSGHPRSAVSFAYA AGAGQLRSAVNLAERCEQRLALRCGTARRPDSRDGTPLSWWVGRTNETHTSWGGSLPDAQ KCTCGLEGNCIDSQYYCNCDAGRNEWTSDTIVLSQKEHLPVTQIVMTDAGRPHSEAAYTL GPLLCRGDQSFWNSASFNTETSYLHFPAFHGELTADVCFFFKTTVSSGVFMENLGITDFI RIELRAPTEVTFSFDVGNGPCEVTVQSPTPFNDNQWHHVRAERNVKGASLQVDQLPQKMQ PAPADGHVRLQLNSQLFIGGTATRQRGFLGCIRSLQLNGVALDLEERATVTPGVEPGCAG HCSTYGHLCRNGGRCREKRRGVTCDCAFSAYDGPFCSNEISAYFATGSSMTYHFQEHYTL SENSSSLVSSLHRDVTLTREMITLSFRTTRTPSLLLYVSSFYEEYLSVILANNGSLQIRY KLDRHQNPDAFTFDFKNMADGQLHQVKINREEAVVMVEVNQSTKKQVILSSGTEFNAVKS LILGKVLEAAGADPDTRRAATSGFTGCLSAVRFGRAAPLKAALRPSGPSRVTVRGHVAPM ARCAAGAASGSPARELAPRLAGGAGRSGPADEGEPLVNADRRDSAVIGGNKALNDLVLVI IALDNDTIT >gi568815589r:38986689_39388064|GENSCAN_predicted_CDS_3|2190_bp atggactatgatgtccctcacggcctggaaggtcagtgtcctttagaatcttcactcttt accaggtccttagctgtgtactcagcagtgggagacagcggccagaggctgagatccaac cctgctcaaggcagagctctctacgagcagtcttgtgaagcccacaagcaccgagggaac ccgtctgggctttactatattgatgcagatggaagtggccccctgggaccatttcttgtg tactgcaatatgacagcagacgccgcgtggacggtggtgcagcacggtggccccgacgcg gtgaccctccgaggtgcccccagcgggcacccgcgctcggctgtgtccttcgcgtacgca gcgggcgcggggcagctgcggtccgcggtgaacctggcggagcgctgcgagcagcggctg gctctgcgctgcgggacggcgcggcgcccggactcacgagatggaaccccactgagctgg tgggttggaagaaccaatgaaacacacacttcctggggaggttctctgcctgatgctcaa aagtgtacttgtggattagaggggaactgcattgattctcagtattactgcaactgtgat gctggccggaatgaatggactagtgacacaatagtcctttcccaaaaggagcacctgcca gtcactcagattgtgatgacagacgcaggccgaccacattccgaagcagcttatacactg gggccactgctctgccgcggagatcagtcattctggaattcagcttccttcaacactgag acttcataccttcatttccctgctttccacggagaactcactgctgacgtgtgcttcttt tttaagaccacagtttcctccggggtgtttatggagaacctggggatcacagatttcatc aggattgagctgcgtgctcccacagaagtgaccttttccttcgatgtggggaatggacct tgtgaggtcacggtgcagtcacccactccctttaatgacaatcagtggcaccacgtgagg gcagagagaaatgttaaaggagcgtctcttcaagttgatcagcttcctcagaagatgcag cctgcccctgctgatgggcacgttcgtttacagctcaacagccagctcttcattggtgga acggccaccagacagagaggctttctaggatgcattcggtctctgcagttgaacggggtg gccctggatctggaagaaagagccacagtgacgccaggagtggagccagggtgtgcagga cactgcagcacctatggacacttgtgtcgcaatggagggagatgcagagagaaacgcagg ggggtcacctgtgactgtgccttctcagcctatgatgggccgttctgctccaatgagatt tccgcatattttgcaactggctcctcaatgacataccattttcaagaacattacacttta agtgaaaactccagctctctcgtttcttcattacacagagatgtaacattgaccagagaa atgatcacactgagcttccgaaccacacgaactccgagcttattgctgtatgtgagctct ttctatgaggaatacctttcagttatcctcgccaacaatggaagtttgcagattaggtac aagctagatagacatcaaaatcctgatgcatttacctttgattttaaaaacatggctgat gggcaacttcaccaagtgaagattaacagagaagaagctgtggtcatggtagaggttaac cagagcacaaagaaacaagtcatcttgtcctcagggacagaattcaacgccgtcaaatct ctcatattgggaaaggttttagaggctgccggcgcggacccggacacaaggcgggcggcg actagtggcttcactggctgcctctcggcggtgcgcttcggccgcgctgctcccctgaag gcggcgctgcgccccagcggcccctcccgggtcaccgtccgcggccacgtggcccctatg gcccgctgcgcagcgggggcggcgtccggctccccggcgcgggaactggctccccgactc gcggggggcgcaggtcgttctggaccagcggatgagggagagcccttggttaatgcagac agaagagactctgctgtcatcggaggtaacaaggccctgaatgacctggtgcttgtcatt atcgctttagataatgataccattacttag >gi568815589r:38986689_39388064|GENSCAN_predicted_peptide_4|217_aa MPPYDGLRVETSDSKAEGWFQNIKLLQSQLQKDQESPRDFPYEEDSRPQSWSSKAAIPPP VKEEQDRLRSPNRPSSSFLANVVGTVAHNIMQKRGFREGRGLGEHLQGLRDAFPVGKTSR SGGKIIVGDAAEKDALKDQKCGRAGLMTLLIVLQGQNQGALGLLMKALGQNVCPSTYRTW QKFHDNDTENIATKTKVDKWDLIKLKVSAQQKKLSTE >gi568815589r:38986689_39388064|GENSCAN_predicted_CDS_4|654_bp atgcccccatacgatggcctgagagtggagaccagtgactcaaaagcagaaggctggttc caaaacatcaaacttctgcagtctcagcttcagaaagaccaagagtcaccccgagatttt ccttatgaagaggattcaagacctcagtcatggtcttccaaagcagccattcctccccca gtgaaggaggaacaggacagactgagatctccaaaccggcctagcagctccttcctcgct aacgtggtgggaacggtggcgcacaacatcatgcagaagcgcggcttccgggaaggccgg ggcctcggggagcacctgcaggggctgcgagatgcctttccggtggggaagactagcaga tcaggcggcaagatcatcgtgggcgacgccgcagagaaagatgcattgaaggatcagaag tgtgggagggctggactgatgactctgcttatagtcctacaaggacaaaatcaaggtgct cttgggctccttatgaaggccctagggcagaatgtgtgtccaagcacatacaggacatgg caaaaatttcatgacaacgacaccgaaaacattgcaacaaaaacaaaagttgacaaatgg gacctaattaaactaaaggtttctgcacagcaaaagaaactatccacagagtaa >gi568815589r:38986689_39388064|GENSCAN_predicted_peptide_5|271_aa MKMGFPGNTNADSVVHYRLQPPFEARFLRFLPLAWNPRGRIGMRIEVYGCAYSNAKLPST IAPVTLTLGSLLDDQHWHSVLIELLDTQVNFTVDKHTHHFQAKGDSSYLDLNFEGNVSFS CPQPQTVPVTFLSSRSYLALPGNSGEDKVSVTFQFRTWNRAGHLLFGELRRGSGSFVLFL KDGKLKLSLFQPGQSPRNVTAGAGLNDGQWHSVSFSAKWSHMNVVVDDDTAVQPLVAVLI DSGDTYYFGGKRRQLNDTGSGTTFYLYCFAF >gi568815589r:38986689_39388064|GENSCAN_predicted_CDS_5|816_bp atgaagatgggttttccaggaaacacaaacgcagacagtgtggtgcactacagactccag cctccctttgaagccaggttcctgcgctttctccctttagcctggaaccctaggggcagg attgggatgcggatcgaagtgtacggatgtgcatatagcaatgctaagctgccttccact attgctcctgtgaccctcaccctgggcagcctgctggacgaccagcactggcattccgtc ctcatcgagctcctcgacacgcaggtcaacttcaccgtggacaaacacactcatcatttc caagcaaagggagattccagttacttggatcttaattttgagggaaatgtgtccttctca tgtccacagccacagactgtccctgtgacttttctgagctccaggagttatctggctctg ccaggcaactctggggaggacaaagtgtctgtcacttttcaatttcgaacgtggaacaga gcaggacatttgcttttcggcgaacttcgacgtggttcagggagtttcgtcctctttctt aaggatggcaagctcaaactgagtctcttccagccgggacagtcaccaaggaatgtcaca gcaggtgctggattaaacgatgggcagtggcactctgtgtccttctctgccaagtggagc catatgaatgtggtggtggacgatgacacagctgttcagcccctggtggctgtgctcatt gattcaggtgacacctattattttggaggtaagagaaggcaactgaatgacactggcagt ggaaccactttttatctttattgctttgcattttga >gi568815589r:38986689_39388064|GENSCAN_predicted_peptide_6|444_aa MLMGEEGTPGKWDLGLPQSPPKDIDRTVRPRLQQDAGSCNCQRLQLSFGVSYDMIVYLEN PIVSAQNLLKLINNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKSIKYLG IQLTRDVKDLFKENYKPLLNQIKEDTNKWKNIPCSWIRRINIVKMAILPKFLNSPKKMGL YSQIQIIIIIIIIIIIIIAYVFEHFPHTSLSPLLRDAWVPDPATFLVPQPKELEGKDPQR GRDRGDERKERKGEERREEKRREEKRERVRMTSSVPSVLQLWKIVPNGSSRNHIHKKAPV PEIESSEFYLFMCTHETDLKHSQMCTGIHHPALPWPINRNLLVTVLETGMLKIKVVASVK AFLLYHNKAEGITWVPSQSRQLLPLGSVLMGRSPTSDLPSASSVSSTHICSSQARKIKIT IRNKYTQAMGDLLMHFVHCPKNVP >gi568815589r:38986689_39388064|GENSCAN_predicted_CDS_6|1335_bp atgctcatgggtgaagaagggactcctggcaagtgggacctggggctgccacagagccct cccaaggacatagacaggactgtgaggccacggctccagcaggatgctggctcctgcaac tgtcaaagacttcagctcagttttggagtttcctatgacatgattgtatatttagaaaac cccattgtctcagcccaaaatctccttaagctgataaacaacttcagcaaagtctcagga tacaaaatcaatgtacaaaaatcacaagcattcttatacaccaataacagacaaacagag agccaaatcatgagtgaactcccattcacaattgcttcaaagagtataaaatacctagga atccaacttacaagggatgtgaaggacctcttcaaggagaactacaaaccactgctcaac caaataaaagaggacacaaacaaatggaagaacattccatgctcatggataagaagaatc aatatcgtgaaaatggccatactgcccaagtttttgaatagcccaaagaaaatgggatta tattctcaaattcaaatcatcatcatcatcatcatcatcatcatcatcatcatagcttat gtttttgagcacttcccacatacttccttaagccctttgctcagagatgcttgggtacca gatccagccacgtttttggtgccacaacccaaggagctggagggtaaggatccacagagg ggaagggatagaggagatgagaggaaggagaggaaaggagaggagagaagagaagagaag agaagagaagagaagagagaaagggtgaggatgaccagctcagtacccagtgttctgcag ctctggaagatagtgccaaatggttcttcaagaaatcatattcataaaaaggcaccagtt cctgaaattgaatcatctgagttttaccttttcatgtgcacacatgaaacagacctaaaa cattcccagatgtgcactggaatacatcatcccgctctcccctggcctataaacagaaat ttacttgtaacagttctggagactgggatgctgaagatcaaggtggtggcatctgtgaag gccttcttgctgtatcataacaaggcagaaggcatcacatgggtaccgagtcagtccaga cagctgctgcctctgggctcagtgctgatgggcagatctccaacctctgaccttccttct gcctcctccgttagttccactcacatttgctccagtcaggcccggaaaataaagataaca attaggaacaaatacacacaggccatgggagatctactaatgcactttgtccactgcccc aaaaacgtcccatga >gi568815589r:38986689_39388064|GENSCAN_predicted_peptide_7|413_aa MVPVPPCTSGRIGLVNPSGPGLFLVVSHSSNPFSRFLASLRWVRTSSFSSEKFVITDHLK PSSLNSSKSFSIQLCSVAGEELRSFGGEENHQSAGKAPFSGVRFSCSLSSPSAATRGCCV APRHPSAAPQSCPAATARGFQGLIEEMRYGLCFVTLDSIAAEPQWMVIITETVTKPGHRG GKEDASLQEQPARHPRCAAAHSVTGLDALLQLWWPVRTAPVIQGHPEVPKAGVVGGTQWE ITESWGRFPHTVLVVENQSHKDFRIALLQFAFFLLKMLSFHFFPVQKNPVITCDSSTVMQ FLARKWPSIPTIKAAFNKGTQTSAIQQCTGAGGWTPLVSNKYQWLQIDLGERMEVTAVAT QGGYGSSDWVTSYLLMFSDGGRNWKQYRREESIWGHRINLPVPGEQNAFPALS >gi568815589r:38986689_39388064|GENSCAN_predicted_CDS_7|1242_bp atggtaccagttcctccttgtacctctggtagaattgggcttgtgaatccatctggtcct ggactttttttggttgttagccattcgtctaatcctttttcaaggtttttagcttctttg cgatgggttcgaacatcctcctttagctcggagaagtttgttattaccgatcatctgaaa ccttcttctctcaactcgtcaaagtcattctccatccagctttgttccgttgctggcgag gagttgcgttcctttggaggagaagagaatcatcagtcagcaggcaaggctcctttctcg ggagtgaggttctcctgttccctctcatccccatctgccgccacacgtggctgctgtgtg gctccccgtcatccttcagcagctcctcagagctgtccagcagccacggcccggggtttt caaggcttaatcgaagagatgagatatgggctctgttttgttactttagattctattgct gcagagcctcagtggatggtcattatcactgaaacagtgactaagcctgggcacaggggt ggaaaggaggacgccagtctccaagagcagcctgcacggcatcctcgttgtgctgcggca cattcagtgacaggactggacgccctgctgcagctctggtggccggtgaggacagcccca gttatccaaggacaccctgaggtaccaaaggcaggtgttgtgggagggacccagtgggag ataactgaatcatgggggcggttcccccatactgttctcgtggttgagaatcaatctcac aaggattttcgcatcgctctcttacagtttgcattcttccttctaaaaatgttgtcgttt cacttttttcctgttcaaaaaaatccagtgataacatgtgacagctcaactgtcatgcag ttcttggcaagaaagtggccttctataccaaccatcaaagctgcctttaacaaaggcaca cagactagtgctatccagcagtgcacaggagctggtggctggactccacttgtgtcaaat aaataccaatggctgcaaattgaccttggagagagaatggaggtcactgctgtcgccacc caaggaggatatgggagctctgactgggtgaccagctacctcctgatgttcagtgatggt gggagaaactggaagcagtatcgccgagaagaaagcatctggggtcatcgtattaacctt cctgtaccaggagaacagaatgcatttccagctctctcctag >gi568815589r:38986689_39388064|GENSCAN_predicted_peptide_8|59_aa MASVAWAVLKVLLLLPTQTWSPVGAGNPLASPRCQPQPSPTKISFEISSNKRQWPCLEI >gi568815589r:38986689_39388064|GENSCAN_predicted_CDS_8|180_bp atggcttcagtggcctgggccgtcctcaaggtgctgctgcttctccccactcagacttgg agccccgtgggagcaggaaatccactggcatctccccgctgtcagcctcagccctctcct accaaaatctctttcgaaataagttccaataaacgccagtggccatgtttggaaatttag >gi568815589r:38986689_39388064|GENSCAN_predicted_peptide_9|300_aa MICSDSLKGQAEKKSEGTHQVTLWQPKTSTSRKPLLPTAEAQREERLLPDPQKAAAGEHK GLAGATVIEVKEFLVRTWKQDFFKRCKSKKNKAGDIMLPDFKVYYKATVTKTAWYWYQNR DIDQWNRTEASEITPHIYNHLIFEKLDKNKQCGKNCLFNKWYWENWLAIRRKLKLDPFLT PYTKFNSRWIKDLNIRPKTIKTLEENLGNTIQDIGMGKDFMTKTPKAMATKAKIDEWDLV KLKSFCTAKETIIRVNRQPTEWEKIFTICPSDKRRISRIYKELKQIYKRQTTPSKCGQRI >gi568815589r:38986689_39388064|GENSCAN_predicted_CDS_9|903_bp atgatttgctcagatagtttaaagggacaggctgaaaagaaaagtgaagggacccaccag gtgacattgtggcaaccaaagacaagcactagcagaaagcctttactacctacagcagag gcgcagagggaggaacggttgctgccagatccccagaaggcagcagccggggaacacaaa ggtctggcaggagccacagtcatagaagtaaaagaatttcttgtcagaacctggaaacag gacttctttaaaagatgcaaaagtaaaaagaacaaagctggagacatcatgctacctgac ttcaaagtatactataaggctacagtgaccaaaacagcatggtactggtaccaaaacaga gatatagaccaatggaacagaacagaggcctcagaaataacaccacacatttacaaccat ctgatctttgagaaacttgacaaaaacaagcaatgcggaaagaattgcctatttaataaa tggtattgggaaaactggctggccatacgcagaaaactgaaactggaccccttccttaca ccttatacaaaatttaactcacgatggattaaagacttaaacataagacctaaaaccata aaaaccctagaagaaaatctaggcaataccatccaggacataggcatgggcaaggacttt atgactaaaacaccaaaagcaatggcaacaaaagccaaaatagacgaatgggatctggtt aaactaaagagcttctgcacagctaaagaaactatcatcagagtgaacaggcaacctacg gaatgggagaaaatttttacaatctgtccatctgacaaaaggcgaatatctagaatctac aaagaacttaaacaaatttacaagagacaaacaaccccatcaaaatgtgggcaaaggata tga >gi568815589r:38986689_39388064|GENSCAN_predicted_peptide_10|213_aa MWESVELPRDLEDLEDKKMWESLGLPRDLNGFYQNVDSDMENEVQAEVVSDGDEELVGNW DKDASAPATVKRGQGTAQAIASEGPSSKPWRLPCGVEPVGSQKSRIEVWEPSPGFQRMYG NAWIPRQKFAAGAKPSWRTSARTVQNGNVGLEPPHRVLTGVLPSGAEMQQEELGATQCSD INEERSCDEKDEEAPEEVMPAKTATNFILKELL >gi568815589r:38986689_39388064|GENSCAN_predicted_CDS_10|642_bp atgtgggaaagtgtggaacttcctagagacttggaggacttggaagacaagaagatgtgg gaaagtttgggacttcctagagacttgaatggcttctaccaaaatgttgatagtgacatg gaaaatgaagtccaggctgaggtggtctcagatggagatgaggaacttgttgggaactgg gataaagatgcttcagctccagccacagttaaaaggggccaaggtacagctcaggccatt gcttcagagggtccaagctccaagccttggaggcttccatgtggtgttgagcctgtgggt tcacagaagtcaagaattgaggtttgggaaccttcacctggatttcagaggatgtatgga aatgcctggatacccaggcagaagtttgctgcaggggcaaagccctcatggagaacctct gctaggacagtgcagaacggaaatgtggggttggagcccccacacagagtcctcactggg gtactacctagtggagctgaaatgcagcaagaggaactcggcgcaacgcagtgttctgac ataaatgaggaacgcagctgtgatgaaaaggatgaagaagccccagaggaagtgatgccg gcaaagacagcaacaaacttcatattaaaggaactcttgtag >gi568815589r:38986689_39388064|GENSCAN_predicted_peptide_11|1524_aa MGERSPMLSMLCQEGYRQCQSMKATKRGAMSCKATGVELPKAMGAHLLHKCDLDNALHPS HLHFPLTHHTHPHCLACPPPPPKAFIAPPLLDCILTPSQCDSMALPLGTALKSSSPQWPE TKPTVSLRPGPQSLVPTGMPRAQLLESNAPIHMENLPFPLKLLSASSLNAPSSTPWVLDI FLTLVFALGFFFLLLPYLSYFRCDDPPSPSPGKRKLVESAREACRRLRTCFHNCRGLGRD PRATGSLELTWDGETRGTEDGSKTLGRGVAGELGNQGVGCLLGPHLDKGDFGQLSGPDPP GEVGERAPDGASQSSHEPMEDAAPILSPLASPDPQAKHPQDLASTPSPGPMTTSVSSLSA SQPPEPSLPLEHPSPEPPALFPHPPHTPDPLACSPPPPKGFTAPPLRDSTLITPSHCDSV ALPLGTVPQSLSPHEDLVASVPAISGLGGSNSHVSASSRWQETARTSCAFNSSVQQDHLS RHPPETYQMEAGSLFLLSSDGQNAVGIQVTETAKVNIWEEKENVGSFTDRMTPEKHLNSL RNLAKSLDAEQDTTNPKPFWNMGENSKQLPGPQKLSDPRLWQESFWKNYSQLFWGLPSLH SESLVANAWVTDRSYTLQSPPFLFNEMSNVCPIQRETTMSPLLFQAQPPSHLGPECQPFI SSTPQFRPTPMAQAEAQAHLQSSFPVLSPAFPSLIKNTGVACPASQNKVQALSLPETQHP EWPLLRRQLEGRLALPSRVQKSQDVFSVSTPNLPQESLTSILPENFPVSPELRRQLEQHI KKWIIQHWGNLGRIQESLDLMQLRDESPGTSQAKGKPSPWQSSMSTGESSKEAQKVKFQL ERDPCPHLGQILGETPQNLSRDMKSFPRKVLGVTSEESERNLRKPLRSDSGSDLLRCTER THIENILKAHMGRNLGQTNEGLIPVRVRRSWLAVNQALPVSNTHVKTSNLAAPKSGKACV NTAQVLSFLEPCTQQGLGAHIVRFWAKHRWGLPLRVLKPIQCFKLEKVSSLSLTQLAGPS SATCESGAGSEVEVDMFLRKPPMASLRKQVLTKASDHMPESLLASSPAWKQFQRAPRGIP SWNDHGPLKPPPAGQEGRWPSKPLTYSLTGSTQQSRSLGAQSSKAGETREAVPQCRVPLE TCMLANLQATSEDMHGFEAPGTSKSSLHPRVSVSQDPRKLCLMEEVVNEFEPGMATKSET QPQVCAAVVLLPDGQASVVPHASENLVSQVPQGHLQSMPAGNMRASQELHDLMAARRSKL VHEEPRNPNCQGSCKNQRPMFPPIHKSEKSRKPNLEKHEERLEGLRTPQLTPVRKTEDTH QDEGVQLLPSKKQPPSVSHFGGNIKQFFQWIFSKKKSKPAPVTAESQKTVKNRSCVYSSS AEAQGLMTAVGQMLDEKMSLCHARHASKVNQHKQKFQAPVCGFPCNHRHLFYSEHGRILS YAASSQQATLKSQGCPNRDRQIRNQQPLKSVRCNNEQWGLRHPQILHPKKAVSPVSPLQH WPKTSGASSHHHHCPRHCLLWEGI >gi568815589r:38986689_39388064|GENSCAN_predicted_CDS_11|4575_bp atgggagaacgaagcccgatgctgagcatgctttgccaggaaggctacagacaatgccag tccatgaaagcaaccaagaggggggctatgtcctgcaaagccacaggggtggagctgccc aaggccatgggagcccacctcttgcataagtgtgacctggataacgcccttcacccaagc cacctgcattttcccctcacccatcatacccacccccattgtctggcctgccctccacct cctccaaaagccttcattgctcccccactgctggactgcattctgactccgtctcaatgt gactcaatggcacttccattgggcactgccctaaagagctcatctccacagtggcctgaa accaaacccacagtatctctgaggcctgggccccagtccctagtccccacggggatgccc agagctcagttgcttgaaagcaacgcgcctattcacatggagaatcttccctttccttta aaattacttagtgcctcatcgctaaacgcccccagttccacaccatgggtgttggatatc ttcctcactttggtgtttgccctggggttcttcttcctattactcccctacttatcttac ttccgttgtgatgacccaccctcaccatcgcctgggaagagaaagctggtagagagtgcc cgagaggcctgcaggagacttcggacctgctttcacaactgcagagggctgggacgtgac cccagggccacaggcagcctggagctgacctgggatggggagaccaggggtacagaggat gggagtaaaaccctggggcgaggggtagcaggagaactgggcaatcagggtgtggggtgc ctcctggggccacaccttgacaaaggtgactttggtcagctctccggtccagacccccca ggtgaagtgggcgaaagagcacctgatggagcctcccagtcctctcatgagcctatggaa gatgctgctcccattctctccccgttagcttccccggatcctcaagccaagcatcctcag gatctggcctccaccccatcaccaggcccaatgaccacctcagtctcctccctaagtgcc tcccagccaccagaaccttcccttcccctagaacacccctcacccgagccacctgcactt ttccctcacccaccacacacccctgatcctctggcctgctctccgcctcctccaaaaggc ttcactgctcctcccctgcgggactccacactgataactccatctcactgtgactcagtg gcacttccactgggcaccgtccctcaaagcttgtctccacatgaggatttggtggcttct gtcccagccatctcaggccttggtggctcaaacagtcatgtttctgcctcctcccggtgg caggagactgccagaacctcgtgcgcctttaactcatcagtccagcaagatcatctttcc cgccacccaccagagacctaccagatggaagctggtagcctgtttttgctcagctctgat ggccagaatgccgtggggatacaagtcacagaaacagccaaggtcaacatttgggaagaa aaagaaaatgttggatcatttacagatcgaatgaccccagaaaagcacttaaattctttg cggaatttggctaaatcattggatgctgagcaggacaccacaaacccaaaacccttctgg aacatgggagagaactcgaaacagctgcccggacctcagaagctctcagatcctaggctc tggcaggaaagtttttggaagaattatagccagcttttctggggcctcccctctctgcac agcgagtccctggtggctaacgcctgggtaactgacaggtcttatactttacagtctcct cctttcttgttcaatgaaatgtccaatgtctgcccaattcaaagggagactacaatgtcc ccactgcttttccaggcccagcccccgtcccatctggggcccgagtgccaaccctttatt tcatccacaccccaattccggcccacacctatggctcaggccgaggctcaggcccatctt caatcttctttcccagtcctatctcctgcttttccatccctgattaagaacactggagta gcttgccctgcatcgcagaataaagtgcaagctctctccctacctgaaactcagcaccct gaatggcctttgttgaggagacaactagaaggtaggttggctttaccctctagggtccaa aaatctcaggacgtctttagtgtctccactcctaaccttccccaggaaagtttgacatcc attctgcctgagaactttccagtcagtcctgaactccggagacaactggagcaacacata aaaaagtggatcatccaacactggggcaacctgggaaggatccaagagtctctggatctg atgcagcttcgggacgaatcaccagggacaagtcaggccaagggcaaacccagtccctgg cagtcctccatgtccacaggtgaaagcagcaaggaggcacagaaggtgaagttccagcta gagagggacccgtgcccacatctggggcaaattctgggtgagaccccacaaaatctatcc agggacatgaaaagcttcccacggaaggttctgggggtgacttctgaggagtcggaaagg aacttgaggaagcccttgaggagtgactcgggaagtgatttattaagatgcacagagagg actcatatagaaaacatcctgaaagcccacatgggcaggaacttgggccagaccaacgag ggcttgatccccgtgcgtgtgcgtcgatcctggcttgctgtcaaccaggctcttcccgtg tccaacacccatgtgaaaaccagcaatctagcagccccgaaaagtgggaaagcctgtgtg aacacagcccaggtgctttccttcctcgagccgtgtactcagcaggggttgggagcccat attgtgaggttttgggccaaacacaggtggggtctacccctcagggtcctcaagcccatt cagtgctttaaactggaaaaggtttcatccttgtcccttacacagcttgctggtccctcc tcagccacctgtgaatctggggctggctcagaagttgaggtggacatgttccttagaaag ccaccaatggcaagtctgagaaagcaggtgctgaccaaagcatctgatcacatgccagag agtcttctggcctcctcacctgcatggaagcagttccagagggcaccgcgaggaatccca tcttggaatgatcatgggcccttgaagcctcctccagctggacaggagggcaggtggcca tctaagcccctcacgtacagcctcacaggcagcacccagcagagcaggagcttaggagcc caatcttcaaaggctggagagacaagggaggcagtgccacaatgcagagtccccttggaa acctgtatgctggcaaacctccaagccacaagtgaggatatgcatggtttcgaggctcca gggaccagcaaaagctctctacaccctagagtgtctgtctcccaagatccaagaaagctg tgtcttatggaggaggttgttaatgaatttgagcctggaatggccacaaagtcagagacc cagcctcaagtttgtgccgctgttgtgctccttccagatgggcaagcatctgttgtgccc cacgcttcagagaatttggtttctcaagtgccccagggccatctccagagcatgcctgct gggaacatgcgggcttcccaggagctacatgaccttatggcagccagaaggagcaaactg gtgcacgaggagcccagaaacccaaactgtcaaggctcatgcaagaaccaaaggccaatg tttccccctattcacaagagtgagaagtctaggaagcccaacttagaaaaacatgaagaa aggcttgaaggattgaggactcctcaacttaccccagtcaggaaaacagaagacacccat caggatgaaggcgtccagctactgccatcaaagaaacagcctccttcagtaagccacttt ggaggaaacatcaagcaattttttcagtggattttttcaaagaaaaaaagcaagccagca ccagtcactgctgagagccaaaaaacagtgaaaaacagatcatgtgtgtacagcagcagt gctgaagctcagggtctcatgacggcagttggacaaatgctggacgagaaaatgtcactt tgccatgcgcgccatgcctcgaaggtaaatcagcacaaacagaagtttcaagccccagtc tgtgggtttccctgcaaccacaggcacctcttctactcagaacacggcagaatactgagc tatgcagccagcagtcaacaagccactctcaagagccagggttgtcccaacagagacagg caaatcagaaatcaacagcccttgaaaagtgtgcggtgcaacaatgagcaatggggcctg cgacatccccaaatcttgcaccccaagaaagctgtatccccagtcagtccccttcagcac tggccgaagacatccggtgcctctagccaccatcaccactgtccaaggcactgtcttctt tgggaaggtatctga >gi568815589r:38986689_39388064|GENSCAN_predicted_peptide_12|287_aa MGSTSSSSTSQNSHLGFLREPSSSQAWPILSGQGPSSVPHSPELPEDRGCSLLQVPAPPA DPGSAVTAGPTLEAASVIRAPERTWPQEKPQAGDCAHCPLHPSAGQSHMLGAGPFPLSSR SVALPGTGLSTIEMRPPEVGLLHACMWHKAEKTPSRDLAPPVHDLWRPPAFPYGQGPERR KAVLKAEGDSRDGSCPAHTLPILDRSLPAPLYVQILPACIFPCWSLEQARMSGARGDLLC DPSSAAGPFSTESHHQKPLHTQAYPTPFTQNYFAEAERHVGEIDKGR >gi568815589r:38986689_39388064|GENSCAN_predicted_CDS_12|864_bp atgggctccacgtccagctcctccacatcccagaactcgcatcttggtttcctgagggaa cccagcagcagccaggcatggcccatcctcagtggtcagggtcccagctctgtgccacac tctcctgagctcccggaggaccggggctgctctctgctccaggtcccagctcctcctgct gatcctggctccgctgtcaccgctgggcccaccttggaggctgcttcagttatccgggcc ccagagaggacctggccccaggagaagccacaagctggggactgtgcccactgccccctg caccccagtgccggccagtcccacatgttgggggcagggccatttccattgtcatctaga tcagtggcactgcctggcactggcctctccaccattgaaatgaggcccccggaagtgggc ctcttgcacgcctgtatgtggcacaaggcagagaaaactccctctagagacctggctcct cctgtccatgatttgtggagacctcctgctttcccatatggacagggcccagagaggagg aaagctgtgctgaaagcagagggagacagcagggatggctcctgtcctgcccataccctg cccattctggacaggtcacttccagctcccttgtatgttcaaatcctgcctgcctgcatc ttcccttgctggtctctggaacaagcaaggatgtcaggagccaggggagatttgctgtgt gaccccagctcagctgctgggcccttctctacagagtcccaccatcagaagcctctgcac acacaggcataccctactccattcacccagaactacttcgctgaagctgagagacatgta ggtgagatagacaaaggccggtga >gi568815589r:38986689_39388064|GENSCAN_predicted_peptide_13|222_aa MLQPEEPTSVGTELLCSPQPRHAAAVQMMPTVNPRKLQNNAAPGTVGEDLEDANVFPKKR DPEKMWRELRGCPGGDVETAQRLSQRRRGKSSEAVPEKTWRAQRMSQRRRGESSEAVPEK TWKELRNSETVPEKTWKQLRRCLQEDVERVQRLSLLLHLAVFLWIIIAINFSNSGVKSQS STYLPSVPITVPGTHRPHNPSWPIELTVNYVFVHLGCYNKIP >gi568815589r:38986689_39388064|GENSCAN_predicted_CDS_13|669_bp atgttgcaaccagaagagcccacttccgtggggacagagcttctgtgctctccacagcca cgtcacgctgccgctgtgcaaatgatgccaactgtaaatcccagaaaactgcaaaataat gctgcgccaggcacagttggtgaggatttagaggatgctaacgtctttcctaagaaacga gacccagagaagatgtggagggagctcagaggctgtcctggaggagacgtggagacagct cagaggctgtcccagagaagacgtggaaagagctcagaggctgtcccagagaagacgtgg agagctcagaggatgtcccagagaagacgtggagagagttcagaggctgtcccggagaag acgtggaaagagctcagaaactcagagactgtcccggagaagacatggaaacagctcaga cgctgtctgcaagaagatgtggagagagttcagaggctgtcgttactgctgcacttggct gtttttctttggataattattgcaatcaattttagtaattcaggtgttaaatcacagtca agcacctatttacctagtgttcctatcacagtgccagggacacacaggccccataatcct tcatggccaattgaactgacagtgaactatgtcttcgtccatttgggatgctacaacaaa ataccatag >gi568815589r:38986689_39388064|GENSCAN_predicted_peptide_14|94_aa XPLYSLKQNSIEIRPINNPTIASKCSNERTNHVFLTLSQKLEMIELSEEGILETKIDGKL GLFHQTVGQAVHAKRMFLEEIKSATPVNTQVMRK >gi568815589r:38986689_39388064|GENSCAN_predicted_CDS_14|285_bp nngcctctgtattccctgaaacagaacagtattgaaattaggccaattaataatcctaca atcgcctctaagtgttcaaatgaaaggacgaatcacgtatttctcactttaagtcaaaag ctagaaatgattgagctgagtgaggaaggcatcctggaaaccaagatagacgggaagcta ggcctctttcaccaaacagttggccaagctgtgcatgcaaagagaatgttcctggaggaa attaaaagtgctactccagtaaacacacaagtgatgagaaagtga