GENSCAN 1.0 Date run: 8-Nov-116 Time: 01:52:45 Sequence gi568815587f:2803346_3025194 : 221849 bp : 54.52% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 4054 4142 89 0 2 78 78 166 0.999 12.67 1.02 Intr + 6532 6632 101 1 2 99 121 -19 0.464 2.75 1.03 Intr + 8219 8366 148 0 1 -22 78 106 0.200 -1.60 1.04 Intr + 9367 9513 147 1 0 119 56 4 0.384 0.16 1.05 Intr + 9660 9802 143 0 2 76 48 63 0.337 1.61 1.06 Intr + 13802 14248 447 0 0 108 34 182 0.171 8.70 1.07 Intr + 17782 17914 133 2 1 88 61 61 0.074 3.61 1.08 Intr + 23000 23146 147 2 0 53 53 73 0.175 0.16 1.09 Intr + 29616 29732 117 0 0 127 36 26 0.135 1.38 1.10 Intr + 30306 30444 139 0 1 61 81 64 0.204 3.97 1.11 Intr + 35896 36212 317 0 2 64 78 102 0.047 2.21 1.12 Intr + 38829 38958 130 0 1 56 84 45 0.007 2.10 1.13 Intr + 44422 44651 230 0 2 162 -14 291 0.003 23.50 1.14 Intr + 49561 49656 96 1 0 78 33 94 0.013 2.52 1.15 Intr + 50665 50713 49 1 1 68 80 40 0.027 0.37 1.16 Intr + 51371 51607 237 1 0 67 60 111 0.044 4.54 1.17 Intr + 53603 53738 136 1 1 29 80 19 0.037 -4.15 1.18 Intr + 53947 54287 341 2 2 85 21 193 0.006 8.05 1.19 Intr + 55735 55896 162 0 0 86 94 69 0.015 7.99 1.20 Term + 59304 59519 216 2 0 115 37 82 0.341 3.27 1.21 PlyA + 61362 61367 6 1.05 2.00 Prom + 64940 64979 40 -1.61 2.01 Init + 66109 66479 371 0 2 74 40 324 0.728 20.68 2.02 Intr + 67216 67519 304 1 1 107 99 11 0.274 1.24 2.03 Intr + 77229 77331 103 2 1 84 77 42 0.168 2.95 2.04 Term + 77800 78062 263 2 2 29 54 163 0.610 3.02 2.05 PlyA + 79115 79120 6 1.05 3.04 PlyA - 79898 79893 6 1.05 3.03 Term - 80570 80429 142 1 1 92 48 79 0.943 2.01 3.02 Intr - 80789 80663 127 0 1 102 89 174 0.985 19.14 3.01 Init - 82144 81325 820 1 1 75 105 885 0.871 83.90 3.00 Prom - 82467 82428 40 -0.51 4.10 PlyA - 84463 84458 6 1.05 4.09 Term - 85290 85095 196 2 1 65 36 123 0.642 2.00 4.08 Intr - 86256 85963 294 2 0 18 65 290 0.129 16.37 4.07 Intr - 92081 92010 72 1 0 128 34 56 0.103 3.22 4.06 Intr - 93731 93558 174 1 0 24 80 79 0.063 0.27 4.05 Intr - 93946 93906 41 1 2 82 53 44 0.130 -2.19 4.04 Intr - 96251 96051 201 2 0 60 105 40 0.144 2.90 4.03 Intr - 98321 98296 26 0 2 114 97 25 0.082 4.23 4.02 Intr - 99364 99239 126 2 0 91 71 15 0.042 1.26 4.01 Init - 100077 99693 385 0 1 86 80 95 0.031 4.02 4.00 Prom - 102316 102277 40 -0.31 5.00 Prom + 103052 103091 40 -0.11 5.01 Init + 104593 104595 3 0 0 83 40 0 0.027 -5.37 5.02 Intr + 104888 104985 98 1 2 90 92 156 0.720 15.51 5.03 Intr + 105851 106011 161 2 2 82 91 226 0.997 22.44 5.04 Intr + 106233 106365 133 1 1 94 100 204 0.491 22.41 5.05 Intr + 112532 112650 119 2 2 67 81 26 0.206 0.41 5.06 Intr + 113258 113395 138 0 0 54 115 134 0.959 13.64 5.07 Intr + 114619 114750 132 2 0 54 109 245 0.696 24.32 5.08 Intr + 115962 116062 101 1 2 93 101 185 0.999 20.63 5.09 Intr + 118757 118858 102 1 0 109 77 221 0.862 23.97 5.10 Intr + 119091 119211 121 2 1 102 109 270 0.952 31.07 5.11 Intr + 121665 121848 184 1 1 96 17 198 0.979 12.86 5.12 Term + 122606 123140 535 2 1 37 46 511 0.832 35.92 5.13 PlyA + 125175 125180 6 1.05 6.03 PlyA - 125487 125482 6 -5.12 6.02 Term - 126037 125561 477 1 0 60 52 948 0.867 83.63 6.01 Init - 126268 126194 75 1 0 52 51 102 0.545 1.94 6.00 Prom - 127315 127276 40 -5.91 7.00 Prom + 132292 132331 40 -0.51 7.01 Init + 134407 134721 315 0 0 48 34 247 0.068 12.64 7.02 Intr + 139885 140015 131 0 2 73 94 59 0.149 5.00 7.03 Term + 141199 141346 148 1 1 31 54 127 0.132 1.29 7.04 PlyA + 141635 141640 6 1.05 8.16 PlyA - 142498 142493 6 -0.45 8.15 Term - 142627 142520 108 1 0 58 49 68 0.215 -1.29 8.14 Intr - 147970 147914 57 1 0 63 81 119 0.978 8.27 8.13 Intr - 148464 148435 30 0 0 90 94 57 0.964 5.31 8.12 Intr - 151301 151182 120 2 0 100 11 119 0.281 6.59 8.11 Intr - 152421 152399 23 1 2 115 99 12 0.839 2.75 8.10 Intr - 155199 155042 158 2 2 70 -65 180 0.039 1.07 8.09 Intr - 156564 156425 140 0 2 55 94 164 0.865 13.47 8.08 Intr - 161406 161335 72 0 0 51 101 51 0.586 2.80 8.07 Intr - 166589 166458 132 2 0 84 115 134 0.999 17.05 8.06 Intr - 168189 168103 87 0 0 41 87 109 0.990 6.76 8.05 Intr - 168898 168757 142 0 1 69 97 57 0.840 5.56 8.04 Intr - 172778 172679 100 0 1 76 111 107 0.907 11.47 8.03 Intr - 174997 174939 59 0 2 36 82 60 0.409 -0.78 8.02 Intr - 175892 175862 31 0 1 89 115 15 0.411 1.97 8.01 Init - 178600 178588 13 1 1 105 75 11 0.707 1.72 8.00 Prom - 178810 178771 40 -3.31 9.00 Prom + 185503 185542 40 -1.61 9.01 Sngl + 188803 189012 210 0 0 90 36 208 0.986 9.28 9.02 PlyA + 190806 190811 6 1.05 10.17 PlyA - 192448 192443 6 1.05 10.16 Term - 193892 193709 184 1 1 75 55 97 0.443 2.44 10.15 Intr - 197903 197773 131 2 2 56 53 219 0.434 15.20 10.14 Intr - 198708 198625 84 0 0 85 75 120 0.981 10.91 10.13 Intr - 199255 199196 60 1 0 81 19 98 0.681 1.62 10.12 Intr - 201393 201176 218 1 2 89 94 65 0.562 5.95 10.11 Intr - 203614 203534 81 2 0 71 98 151 0.699 14.51 10.10 Intr - 208931 208850 82 2 1 106 100 100 0.968 12.71 10.09 Intr - 211192 211060 133 0 1 50 89 153 0.973 12.75 10.08 Intr - 212330 212255 76 0 1 74 58 56 0.654 0.27 10.07 Intr - 212735 212528 208 2 1 78 31 69 0.653 -0.63 10.06 Intr - 213950 213761 190 1 1 93 121 336 0.999 37.51 10.05 Intr - 214609 214512 98 1 2 97 72 80 0.999 6.61 10.04 Intr - 215166 215063 104 1 2 95 82 137 0.998 14.19 10.03 Intr - 215404 215275 130 1 1 118 74 84 0.999 10.77 10.02 Intr - 215922 215794 129 0 0 87 92 86 0.997 10.30 10.01 Intr - 216987 216875 113 1 2 132 105 101 0.986 16.80 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 44422 44658 237 0 0 162 55 281 0.997 28.70 S.002 Intr + 55682 55896 215 0 2 105 94 110 0.857 12.26 S.003 Init - 130211 130164 48 2 0 96 75 50 0.825 3.42 S.004 Term - 155199 155034 166 2 1 70 34 175 0.870 8.00 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587f:2803346_3025194|GENSCAN_predicted_peptide_1|1174_aa MESGTVPPAVAALGLLAPPLRARGQRDVMRLGLLFPGTQTCLMLISSVFWGSLRGLTDKN CVEGGALAERQGGWQPPSVIVLLLGCGLGLHIQDGDVPVGAQHKGQQTPGWRRKQSRHPS MAVGPTPPRPHAPGAPCLPLQHPLPDSTQPPGGFRLKASVPVLCLCLEAVAWLSSWNRSS EKTELASTISPVRTLCLERSLVQRKNPTDCQLCARSFQGLTVQAPHGTSALAFDMKMFLQ QARMPAFLWLQSDVFNYVGQNPRRPFPWLQPVSERYLEPLLHPRRRITPVPPDPQPCADT DTLRMKLTPVVAATPAVREGPAGAFQILSSMYPVKEQVPQFWFMVPEMTASDDVPGQGGW GGSVKCSVAPGWCCSNTGVRGRTPKPGALCESRFLASQLCDSSFPGSPGSNQASWGSEPA ARPSSRLLAALMETAVRFRADSHGNSGRPGVGVTWTLASSQWETDVGNAAAVPCSGQAGA QGLPTRPGDKRGKMEVQRGEQTPGPMFAPGCDRAVHWAQQQELGQVPRDGTETTVGKAPS KGRGQRGSSATQGVLCHVGPPGSATDSLHDLCPHPEVDMNVRTEQPLQVCEAVDVTALAL AQTSAPTQMQGGRWLRVQEHGAMWHRLMSADTAHIDPLGAMTSPTTGASPEPEHRGCHDA RLSGPRLTHSLDTARVKGSGGQGEQLVTQLDQRLALITDMLHQLLSLHGGSTPGSGGPPR EGGAHITQPCGSGGSVDPELFLPSNTLPTYEQLTVPRRGPDEGSCYDLLALLFSTPYQAS YNSPPLQPPQLKAPEPRSHKALCLPDTAKGQLHRTDPRILSGREDIAKAGANARPNSNDR AHCAQGEPCTDVLFSPATWSFDTQTAPETEVSAANPPTGHFCQAEPESAQSCSRPHQAGA RATVRAQKLPPLGLPGPATPLTPLATTTVLGQRQPADGTEMRPWKLSPRPRAPSDQPHPL DFKLGALCHQEAPSPPPLKELEKRQTMLLWDVDHLELQAGEAAARGIGPEGPSPPPLRAA QRQCGSQGTFHLHLETTVTHAPSLALPRLPWPLHLGKGYGGRGFGFLALASAKSPKHLVE ENGCTWEPRGLHGGPKETRPQPQMGRECALHADERGLAAFLGPPGAAAGRAGAEGREGAA VPPLDAPPGTGGHPAARAREARFSLFISPKWDLK >gi568815587f:2803346_3025194|GENSCAN_predicted_CDS_1|3525_bp atggagagcgggactgtgccgcccgcggtcgctgcactgggcctcctcgcccccccactc cgggcccggggccagcgcgatgtcatgagattgggtttactgttccctggtactcagacc tgcttaatgctcatttcttcagttttctggggaagcctcaggggccttactgataagaac tgtgtggaaggcggggcccttgccgagcgtcagggtggctggcagccaccttctgtgatc gtcctccttctgggctgcggtctagggctgcacattcaggatggggatgtcccagtgggc gcgcagcacaagggtcagcagacccctggctggaggaggaagcaatccaggcacccttcc atggccgtgggccccacgcccccgaggccccacgcccccggggccccatgcctgcccttg caacatccccttcctgactccacccagcctcctgggggcttccgcctcaaggcctctgtg cctgtcctgtgcctgtgcctggaggctgtggcctggctctccagctggaacagaagctct gagaagacagagctggccagcaccatctctcctgtgagaaccctgtgcctagaacggagc ctggtacagaggaagaacccgacagactgccaactctgcgcacgctcctttcaagggtta acggtgcaagctcctcacggcaccagtgcgcttgcctttgacatgaaaatgtttttgcag caggctcggatgccagcatttttgtggttgcagtcagatgtttttaactacgttggacag aacccacggcggccctttccgtggcttcagccagtttctgagcgctaccttgaacccctg ttgcaccccaggaggagaatcacaccagtgccccctgacccccagccttgtgcagacact gatacccttaggatgaagctgaccccagtcgtggcagccaccccagcagtcagggaagga cctgctggggcatttcagatcctgagcagcatgtaccctgtgaaagaacaggttccccaa ttttggttcatggtacctgagatgacggcttcagatgatgtccctggccagggagggtgg ggtgggtctgtcaagtgttcggtggcccctggctggtgctgtagcaacacgggtgtcaga gggcggacaccgaagcctggtgctctgtgcgagtcccgtttcctcgcctctcagctgtgt gactccagcttcccagggtctcctggcagtaaccaggccagctgggggtcagaacccgcg gccaggcccagcagccgcctcttggcagcgctgatggaaaccgccgtgcggttccgcgca gacagtcacggaaactcaggcagacccggcgttggagtgacctggacactagcctcctcc cagtgggagacagatgtgggcaatgcagctgctgttccctgctcagggcaggctggtgcc caaggactgcccaccagaccaggggataagagaggcaaaatggaggttcagagaggtgag caaacaccaggtcccatgtttgcacctggctgtgaccgtgcagtccactgggcccagcag caggaactggggcaagtgccccgagacgggaccgagacaacagtggggaaggccccttcc aagggaagagggcaaaggggctccagtgccacccagggagtgttgtgccacgtgggcccc cctggctctgccactgactccctccatgacctctgtccccaccctgaagttgacatgaat gtccgcacggagcagcctctgcaggtctgtgaggctgtggatgtgacagctctggctctg gcacaaacgtcagccccaacccagatgcagggagggaggtggctgcgtgtccaggagcac ggggccatgtggcaccggctcatgtcagcagacacggcacacatagaccccctgggggct atgacgagccccaccacaggcgccagtcccgagcctgagcacaggggctgccacgacgct cgactctctgggccaagactcacacacagcctggacactgcccgggtgaagggctctggg ggacagggggagcagctggtgacgcagctggaccagaggctggcactcatcaccgacatg cttcaccagctgctctccttgcacggtggcagcacccccggcagcggcggcccccccaga gagggcggggcccacatcacccagccctgcggcagtggcggctccgtcgaccctgagctc ttcctgcccagcaacaccctgcccacctacgagcagctgaccgtgcccaggaggggcccc gatgaggggtcctgctatgacctcctggccctcctcttcagcaccccgtaccaggcctcc tacaactcccctcccctgcagcccccacagctcaaggcacctgagccaagaagccacaaa gcactgtgcctgcctgatacagccaagggtcagctccacagaactgaccccaggatcctc agtggaagagaagatattgcaaaagccggcgccaacgcaagaccaaacagcaatgacaga gctcactgtgctcagggagaaccatgtactgacgtcctcttcagtccagcaacctggagc tttgacacacaaacagcccctgagacagaggtctccgcagcaaatccacccacgggacac ttctgccaggcagagcctgagtcagcacagagctgttcccggccccatcaagcaggtgca cgggcaacagtccgggctcagaagctgcctcctctcgggctcccaggcccggccactcct ctgacaccactggctacgaccactgttctggggcaaagacagccagctgatgggacagag atgaggccctggaagctcagccctcgtccaagggcacccagtgaccaaccgcacccgctg gacttcaaacttggagctctgtgccaccaggaagccccctcacccccaccactgaaggag ctcgagaaacgccagactatgctgctttgggatgtagaccacctggagctgcaggcagga gaggcggccgccaggggcatcggcccagaaggcccctcacctcctccgctgagggctgcc cagaggcaatgtgggtcccaggggacttttcatctgcatctggagaccaccgtcactcat gctccctccctggccctcccacgactgccctggcctctgcatttgggcaaaggctatgga ggaagaggctttggctttttggccctggcttcagccaaatcccccaagcacctcgtggag gagaatggctgcacctgggagccacgtgggctccatggaggccctaaagagaccaggcct cagccccagatgggccgcgagtgtgcgcttcatgcagatgagcgcggcctcgccgccttc ctcggacctccgggagccgcggccgggcgtgctggggccgagggcagggagggggccgct gtccctccactcgacgcgccgccgggcacgggcgggcacccagcggcccgggcccgggaa gcgcgcttctctcttttcatttctccaaaatgggatcttaagtga >gi568815587f:2803346_3025194|GENSCAN_predicted_peptide_2|346_aa MGPCVCDRDAAARAVPAGRSRVPDGAGRAGAGRASAAGAGAPSGRGDPEGQRSRAARRDG RWAPKARSRGQARGGQQDRCTDGRTDGRADRCADRRLSRRPGAVVPVQAASSGGAGSAAA AAPLAGAESAGTQAQGPGFPLSGHLRACPAPARPSEKAIPPQRGEAVAVMGRKWSGPTAE HQLPMPPPGVRLDSWKGVASGCSPSKASQEARGKEKCPTLNGQPQQLRCFYYSLLQSLRG SPLLADKIQSLAPVHQARGEKSHCCVPGGPGVIAHMMVVDPQWSWSVTPTRQTHECAGCQ PVPVFSAWMRPGGQDARSYLLFPNSMGEKRALQTVPSGIISMAVPT >gi568815587f:2803346_3025194|GENSCAN_predicted_CDS_2|1041_bp atgggcccctgtgtgtgcgaccgggacgcagcggcgcgggccgtgccagccgggcggagc cgggtgcccgacggcgcaggacgagcgggcgcaggacgagcgagcgcagcgggcgcgggc gcaccgtcggggcgcggggaccccgagggccagcggagccgtgctgcgcgcagggatgga cgctgggcgcccaaggcccgttcccgtggacaggcgcgcggagggcagcaggaccgatgc accgacggacgcaccgacggacgcgcggacagatgcgccgaccgcaggctctcccgccgt cccggggccgtcgtgcccgtgcaggccgcgagcagcggcggggccggctctgcggctgca gcagccccattagccggggctgagtcggcagggacgcaggctcagggccctggattcccg ctttctggacacttgcgtgcatgcccagcaccagctcgcccttctgagaaagcaatccct cctcagaggggagaagccgtggctgtgatgggcaggaagtggtcaggacccactgccgag caccaactccccatgccaccaccaggggtgcgcctggactcctggaaaggggtcgcgtca gggtgcagcccatcaaaggcttcccaagaggcaagaggcaaggagaagtgtcctacttta aacggccagcctcagcagctgagatgcttctactactccctgcttcagtccctccggggc tccccactgctggctgacaaaatccaatctttggcaccagtccatcaggcccggggagag aaaagccattgctgtgtgcctggcggtcctggtgtcatcgctcatatgatggttgtggat ccccagtggagctggagcgtgactcccacacggcagacccatgagtgtgctggctgccag ccggtgcctgtgttttcagcttggatgaggcctgggggccaggacgcacgctcctatctg ctctttcccaacagcatgggtgagaagagagctttgcaaacagtgccatcaggtattatt tccatggctgtccccacgtga >gi568815587f:2803346_3025194|GENSCAN_predicted_peptide_3|362_aa MSDASLRSTSTMERLVARGTFPVLVRTSACRSLFGPVDHEELSRELQARLAELNAEDQNR WDYDFQQDMPLRGPGRLQWTEVDSDSVPAFYRETVQVGRCRLLLAPRPVAVAVAVSPPLE PAAESLDGLEEAPEQLPSVPVPAPASTPPPVPVLAPAPAPAPAPVAAPVAAPVAVAVLAP APAPAPAPAPAPAPVAAPAPAPAPAPAPAPAPAPAPDAAPQESAEQGANQGQRGQEPLAD QLHSGISGRPAAGTAAASANGAAIKKLSGPLISDFFAKRKRSAPEKSSGDVPAPCPSPSA APGVGSVEQTPRKRLRLEPKEPRGNLPGQRTLEGRWASAGTVHVAATGGGCRRAAFGFVF KF >gi568815587f:2803346_3025194|GENSCAN_predicted_CDS_3|1089_bp atgtccgacgcgtccctccgcagcacatccacgatggagcgtcttgtcgcccgtgggacc ttcccagtactagtgcgcaccagcgcctgccgcagcctcttcgggccggtggaccacgag gagctgagccgcgagctgcaggcccgcctggccgagctgaacgccgaggaccagaaccgc tgggattacgacttccagcaggacatgccgctgcggggccctggacgcctgcagtggacc gaagtggacagcgactcggtgcccgcgttctaccgcgagacggtgcaggtggggcgctgc cgcctgctgctggcgccgcggcccgtcgcggtcgcggtggctgtcagcccgcccctcgag ccggccgctgagtccctcgacggcctcgaggaggcgccggagcagctgcctagtgtcccg gtcccggccccggcgtccaccccgcccccagtcccggtcctggctccagccccggccccg gctccggctccggtcgcggctccggtcgcggctccggtcgcggtcgcggtcctggccccg gccccggccccggctccggctccggctccggccccggctccagtcgcggccccggcccca gccccggccccggccccggccccggcccccgccccggccccggccccggacgcggcgcct caagagagcgccgagcagggcgcgaaccaggggcagcgcggccaggagcctctcgctgac cagctgcactcggggatttcgggacgtcccgcggccggcaccgcggccgccagcgccaac ggcgcggcgatcaagaagctgtccgggcctctgatctccgatttcttcgccaagcgcaag agatcagcgcctgagaagtcgtcgggcgatgtccccgcgccgtgtccctctccaagcgcc gcccctggcgtgggctcggtggagcagaccccgcgcaagaggctgcgtttagagcccaaa gagccccgagggaacctgccggggcagcggacgttggaagggcgctgggcctcggctggg accgttcatgtagcagcaaccggcggcggctgccgcagagcagcgttcggttttgttttt aaattttga >gi568815587f:2803346_3025194|GENSCAN_predicted_peptide_4|504_aa MTEDRPRALILPGDRPWSLGARAPCILDRRSRETQERSKQEAGGAGGKSPVDPGRGAESL GPRQHPPGLACKRGQHCRTPSPVELSCYLDLRGHIRGSPSSTFDSSGLQAHSLPGRGDRD KLLPPTPAAECCLSPPFPARPPTGGNWILRLQRTSPNFATLTADCTSEPRGATRDASTQR KRLWLWASEPVRPQPQGIHRFREARRQFCRMRGSRLTGGRKGFGSSGLRFGRGGFSEEVM PQPVLKAPAVQVHVRKWAVRARAPEEALPTGASPLASSEPGNPDGLRAKGPVPDTSVQME GGQQRAMGFGESPVVLDGLAGSFREPGYLVSTGGSHTGISVRTGIPSPVKPPAAHIPDGA MNMPTRQLYSCHTREGEASGSRGGTGVEVTRQKAHVAKDMASWAYPRSATHGPVCLPLEA SVSSAVKRAYDIDRPPMRLWQCGVQRVLGGSLLMAPQGLQPPSGQQRAQKDCLGSSDVTA WKWCTAFLTTFPAKTGPAAHLCAR >gi568815587f:2803346_3025194|GENSCAN_predicted_CDS_4|1515_bp atgaccgaggaccggcctagagcgctcatcctgccgggggaccggccctggtccctggga gcccgagctccctgcatcctggacaggcgcagcagggagacccaggagagatccaagcag gaggctggcggagcagggggcaaaagtccagttgatccgggcaggggtgctgagtcactg ggtcccaggcagcacccgcctggccttgcctgcaagagaggacagcactgcaggaccccc tccccagtggagctctcctgctacctggaccttagaggacacatcagggggtccccctcc tccacttttgacagctctggtctccaggcacacagcctcccggggaggggagacagggac aagctgctccccccgacccctgcagcggagtgctgtctgtccccacccttccccgcccgg cctccaactggtgggaactggattttgcggctgcagaggacttccccaaattttgccaca ttaacagctgactgcacatctgagccccgaggggccaccagagatgccagcacacagagg aagcggctttggttgtgggccagcgagcctgtgcgtccccagccccagggcattcaccgc ttcagagaagcaaggcgacagttctgccggatgagaggatccaggctgactggaggaagg aagggctttggctcatctgggctcaggtttgggagaggtggcttctcagaggaggtgatg cctcagccagtcttaaaggccccggctgtgcaggtgcacgtgaggaagtgggctgtcagg gccagggccccggaggaggctctgccaacaggggcttctccactggcctctagtgaacca gggaacccagacgggcttagggccaagggcccagtccctgacaccagtgtccaaatggag ggagggcagcaacgtgctatgggctttggggagagtcctgtggtcctggatggacttgct ggcagcttcagggagcctggctatttggtttccaccgggggcagccacactggcatctca gtgaggacgggcattcctagccctgtgaagccacctgcggcacacattcctgatggggcc atgaacatgccgacaaggcagctgtacagctgccacacccgggaaggggaggcctcgggc agccggggaggcactggagtggaggtcacccgccagaaagcacatgtggccaaggacatg gcttcctgggcctaccctcgttctgctactcatggaccagtgtgtttgcctctggaagcc tctgtctcctcggctgtaaagcgggcctacgacattgaccgaccacccatgaggttgtgg caatgcggtgtgcagagggtgcttggtggttctctcctgatggccccgcagggtctgcag cctccatctggccagcagagggcgcagaaggactgcctgggcagctcggacgtgaccgcc tggaagtggtgtacagcgttcctgacaacgttcccggccaaaacgggtcccgccgcccac ttgtgtgcaagataa >gi568815587f:2803346_3025194|GENSCAN_predicted_peptide_5|608_aa MYLSRKLGLDSIAFGYLQTTFGVLQLLGGPVFGRFADQRGARAALTLSFLAALALYLLLA AASSPALPGVYLLFASRLPGALMHTLPAAQMVITDLSAPEERPAALGRLGLCFGVGVILG SLLGGTLVSAYGRQDLPIPSRGITGQVSRWGLMAGKLTVQPESQALVVAQEAEAVARRIQ CPAILAALATLLGAVLSFTCIPASTKGAKTDAQAPLPGLNGLSVPGGPRASVFDLKAIAS LLRLPDVPRIFLVKVASNCPTGLFMVMFSIISMDFFQLEAAQAGYLMSFFGLLQMVTQGL VIGQLSSHFSEEVLLRASVLVFIVVGLAMAWMSSVFHFCLLVPGLVFSLCTLNVVTDSML IKAVSTSDTGTMLGLCASVQPLLRTLGPTVGGLLYRSFGVPVFGHVQVAINTLVLLVLWR KPMPQRKDKVRTVPRRLPATRTPEPHQTLPATRTPEPHQTLPATRTPEPHQTLPATRTPE PHQTVPATRTPEPHQTLPATRTPEPHQTVPATRTPEPHQTLPATRTPEPHQTLPATRTPE PHQTLPATRTPEPYQTLPATRTPEPYQTLPATRTPEPYQTLPATRIPELHQTLPATRTPE PHQTLPAT >gi568815587f:2803346_3025194|GENSCAN_predicted_CDS_5|1827_bp atgtacctgtctcggaaactgggcctggattccattgccttcggctacctgcaaaccacc ttcggggtgctgcagctgctgggcgggccggtatttggcaggttcgcagaccagcgcggg gcgcgggcggcgctcacgctctccttcctggctgccttggcgctctacctgctcctggcg gccgcctccagcccggccctgcccggggtctacctgctcttcgcctcgcgcctgcccgga gcgctcatgcacacgctgccagccgcccagatggtcatcacggacctgtcggcacccgag gagcggcccgcggccctgggccggctgggcctctgcttcggcgtcggagtcatcctcggc tccctgctgggcgggaccctggtctccgcgtacgggagacaagacctgccgattccttcc aggggaatcacaggccaagtttccaggtggggactcatggcagggaagctgacggtacag ccagagtcccaggccttggtagtggcccaggaagccgaggctgttgcccgcaggattcag tgcccggccatcctggctgccctggccaccctcctgggagctgtcctcagcttcacctgc atccccgccagcaccaaaggggccaaaactgacgcccaggctccactgccaggcctgaat ggcctctccgtgccaggcggcccccgggccagtgtgttcgacctgaaggccatcgcctcc ctgctgcggctgccagacgtcccgaggatcttcctggtgaaggtggcctccaactgcccc acagggctcttcatggtcatgttctccatcatctccatggacttcttccagctggaggcc gcccaagctggctacctcatgtccttcttcgggctcctccagatggtgacccagggcctg gtcatcgggcagctgagcagccacttctcggaggaggtgctgctccgggccagcgtgctg gtcttcatcgtggtgggcctggccatggcctggatgtccagcgtcttccacttctgcctc ctggtgcccggcctggtgttcagcctctgcaccctcaacgtggtcaccgacagcatgctg atcaaggctgtctccacctcggacacagggaccatgctgggcctctgcgcctctgtacaa ccactgctccgaactctgggacccacggtcggcggcctcctgtaccgcagctttggcgtc cccgtcttcggccacgtgcaggttgctatcaatacccttgtcctcctggtcctctggagg aaacctatgccccagaggaaggacaaagtccgaactgtaccaaggaggctgccagccacc cgaaccccagaaccgcaccagacactgccagccacccgaaccccagaaccgcaccagaca ctgccagccacccgaaccccagaaccgcaccagacactgccagccacccgaaccccagaa ccgcaccagacagtgccagccacccgaaccccagaaccgcaccagacactgccagccacc cgaaccccagaaccgcaccagacagtgccagccacccgaaccccagaaccgcaccagaca ctgccagccacccgaaccccagaaccgcaccagacactgccagccacccgaaccccagaa ccgcaccagacactgccagccacccgaaccccagaaccgtaccagacactgccagccacc cgaaccccagaaccgtaccagacactgccagccacccgaaccccagaaccgtaccagaca ctgccagccacccgaatcccagaactgcaccagacactgccagccacccgaaccccagaa ccgcaccagacactgccagccacatga >gi568815587f:2803346_3025194|GENSCAN_predicted_peptide_6|183_aa MGGDGRGPTRARTRRCGPGRLGLGRSRARHDMKSPDEVLREGELEKRSDSLFQLWKKKRG VLTSDRLSLFPASPRARPKELRFHSILKVDCVERTGKYVYFTIVTTDHKEIDFRCAGESC WNAAIALALIDFQNRRALQDFRSRQERTAPAAPAEDAVAAAAAAPSEPSEPSRPSPQPKP RTP >gi568815587f:2803346_3025194|GENSCAN_predicted_CDS_6|552_bp atgggaggggacggccggggaccgaccagagcgcggacgcgacggtgcgggcccggacgg ctgggcttggggaggtcccgcgctcggcacgacatgaaatcccccgacgaggtgctacgc gagggcgagttggagaagcgcagcgacagcctcttccagctatggaagaagaagcgcggg gtgctcacctccgaccgcctgagcctgttccccgccagcccccgcgcgcgccccaaggag ctgcgcttccactccatcctcaaggtggactgcgtggagcgcacgggcaagtacgtgtac ttcaccatcgtcaccaccgaccacaaggagatcgacttccgctgcgcgggcgagagctgc tggaacgcggccatcgcgctggcgctcatcgatttccagaaccgccgcgccctgcaggac tttcgcagccgccaggaacgcaccgcacccgccgcacccgccgaggacgccgtggctgcc gcggccgccgcaccctccgagccctcggagccctccaggccatccccgcagcccaaaccc cgcacgccatga >gi568815587f:2803346_3025194|GENSCAN_predicted_peptide_7|197_aa MLSVNILSVNTLSVNMLSMLGVNMLSVNMLSVNMLGVNTLGVNTLGVNMLCENARCEYAW CEYACNMLGVNMLCVNMLCVKMLSVNTLSVNTLGVNMLGVNMLGVLPRLHVEVLHPDVLG TGQHPRTMHSQGQTHWRTQEPSSVKDARRPGQCGHGHPRLPESAVTHLKTPPTSRQAVTP NPGPSDRMNGGAPQWGY >gi568815587f:2803346_3025194|GENSCAN_predicted_CDS_7|594_bp atgcttagtgtgaatatactcagtgtgaatacgctcagtgtgaatatgctgagtatgctc ggtgtgaatatgctcagtgtgaatatgctcagtgtgaatatgctcggtgtgaatacgctc ggtgtgaatacgctcggtgtgaatatgctgtgtgaaaatgctcggtgtgaatacgcttgg tgtgaatacgcttgtaatatgctcggtgtgaatatgctttgtgtgaatatgctttgtgtg aaaatgcttagtgtgaatacgctcagtgtgaatacgctcggtgtgaatatgctcggtgtg aatatgctcggtgtgctccccaggcttcacgttgaggtcctgcacccagatgtccttgga actgggcagcatccacgaaccatgcactcccaaggtcagactcactggcggacacaggaa cccagctcagtgaaggatgcacgcaggcctggtcagtgcggacacggccatccccggctg ccggagagcgccgtcacccacttgaaaaccccacccaccagccgccaagcggtcacacca aacccaggaccttcagacaggatgaatggtggggccccgcaatggggctactga >gi568815587f:2803346_3025194|GENSCAN_predicted_peptide_8|423_aa MVEERDKNIQMADHSFSDGVPSDSVEAAKNASNTEKLTDQVMQNPRVLAALQERLDNVPH TPSSYIETLPKAVKRRINALKQLQVRCAHIEAKFYEEVHDLERKYAALYQPLFDKRREFI TGDVEPTDAESEWHSENEEEEKLAGDMKSKVVVTEKAAATAEEPDPKGIPEFWFTIFRNV DMLSELVQEYDEPILKHLQDIKVKFSDPGQPMSFVLEFHFEPNDYFTNSVLTKTYKMKSE PDKADPFSFEGPEIVDCDGCTIDWKKGKNVTVKTIKKKQKHKGRGTVRTITKQVPNESFF NFFNPLKGKSVASGDGESLDEDSEFTLASDFEIGHFFRERIVPRAVLYFTGEAIEDDDNF EEGEEGEEEELEGDEEGEDEDDAEINPKGFQLLPSPFMGRPLELHHPKEPQNPPAEHAMN LNA >gi568815587f:2803346_3025194|GENSCAN_predicted_CDS_8|1272_bp atggtggaggagcgggataaaaacattcagatggcagatcacagtttttcagatggggtt ccttcagattccgtggaagctgctaaaaatgcaagtaacacagaaaagctcacagatcag gtgatgcagaatcctcgagttctggcagctttacaggagcgacttgacaatgtccctcac accccttccagctacatcgaaactttacctaaagcagtaaaaagaagaattaatgcattg aaacaacttcaggtgagatgtgctcacatagaagccaagttctatgaagaggtacatgac ttggaaagaaagtatgcagcgctataccagcctctctttgacaagagaagagaatttatc accggcgatgttgaaccaacagatgcggaatcggaatggcacagtgaaaatgaagaggaa gagaaattggctggagacatgaaaagtaaagtagtcgtcacagaaaaagcagcggcaacg gctgaagagccagatcccaaaggaattccagagttctggtttaccatcttcagaaatgtg gacatgctgagtgaattagtccaggaatatgatgaaccaatcttgaaacacctgcaggat attaaagtgaaattttctgaccctggacagcctatgtcttttgtgttagagttccacttt gaacccaacgactactttaccaactcagtcctgacaaaaacctacaagatgaaatcagaa ccagataaggctgatcccttttcctttgaaggtcctgagattgtggactgtgacgggtgt actattgactggaagaaaggaaagaatgttactgtcaaaaccatcaagaaaaagcagaag cataagggtcgaggcactgttagaacaattacgaaacaagtacccaatgagtcctttttc aacttcttcaatccattgaaaggcaagtctgttgcatccggggatggagaatcactggat gaagattctgaattcacattagcctctgattttgaaattggacactttttccgtgagcgg atagtcccgcgggctgtgctgtacttcactggggaggccatagaagatgatgacaatttt gaagaaggtgaagaaggagaagaggaggaattagaaggtgacgaggagggagaagacgag gatgatgcggaaattaaccccaaggggttccagctgcttcccagccctttcatgggcagg cccctggagctccaccaccccaaggagccccaaaaccctcctgctgaacatgccatgaat ttgaatgcttga >gi568815587f:2803346_3025194|GENSCAN_predicted_peptide_9|69_aa MAAAAPAPDPRPHPPVRARPPSPARRYRPSTRVSAPHASCGSGGDPSFARFGAAPPWPPT TPAPIGRGS >gi568815587f:2803346_3025194|GENSCAN_predicted_CDS_9|210_bp atggccgccgccgcgcccgcgccggacccgcggccccacccgcccgtccgggcccggccg ccctccccagcgcgccgttaccggcccagcacccgtgtctccgcacctcacgcctcctgc ggcagtggcggcgaccctagcttcgctcgctttggggctgcgccgccgtggcctccaaca acgccggctcccattggccgaggctcctag >gi568815587f:2803346_3025194|GENSCAN_predicted_peptide_10|673_aa XDLSISADRLSEKRSPNDFALWKASKPGEPSWPCPWGKGRPGWHIECSAMAGTLLGASMD IHGGGFDLRFPHHDNELAQSEAYFENDCWVRYFLHTGHLTIAGCKMSKSLKNFITIKDAL KKHSARQLRLAFLMHSWKDTLDYSSNTMESALQYEKFLNEFFLNVKDILRAPVDITGQFE KWGEEEAELNKNFYDKKTAIHKALCDNVDTRTVMEEMRALVSQCNLYMAARKAVRKRPNQ ALLENIALYLTHMLKWLLAETELERVAVSQGHPGAMRAPSGSGFWGSWSCCFHLFPGLGA PPPVNEEHSSQLLATCRLHLAAASDPSGQRLELNGNSGSESAKCGQLPCRVFHLNGCCGC GGAFSVLWTRIRIPQAASFGLCSQLIAKKQLRRMLEATVMPYLQVLSEFREGVRKIAREQ KVPEILQLSDALRDNILPELGVRFEDHEAYSPPAPHNAPRIRDGPSSSVPNRTEEQEQKR NVRISGHSAWGTLGCRQQDERHLGPQCTPWEVGHGGCCHQQVEEEKRKKKEEAARRKQEQ EAAKLAKMKIPPSEMFLSETDKYSKFDENGLPTHDMEGKELSKGQAKKLKKLFEAQEKLY KEYLQMAQNGSFQKYWEQRENAGEAMKQPPHLCRRWGILPDARLASSIRLRQREQNRAEE KDWERLEKQNHQV >gi568815587f:2803346_3025194|GENSCAN_predicted_CDS_10|2022_bp ngtgacctgagcatctctgcagaccgcctgagtgagaagcgctctcccaacgactttgcc ttatggaaggcctctaagcccggagaaccgtcctggccgtgcccttggggaaagggtcgt ccgggctggcatatcgagtgctcggccatggcaggcaccctcctaggggcttcgatggac attcacggaggtgggttcgacctccggttcccccaccatgacaatgagctggcacagtcg gaggcctactttgaaaacgactgctgggtcaggtacttcctgcacacaggccacctgacc attgcaggctgcaaaatgtcaaagtcactaaaaaacttcatcaccattaaagatgccttg aaaaagcactcagcacggcagttgcggctggccttcctcatgcactcgtggaaggacacc ctggactactccagcaacaccatggagtcagcgcttcaatatgagaagttcttgaatgag tttttcttaaatgtgaaagatatccttcgcgctcctgttgacatcactggtcagtttgag aagtggggagaagaagaagcagaactgaataagaacttttatgacaagaagacagcaatt cacaaagccctctgtgacaatgttgacacccgcaccgtcatggaagagatgcgggccttg gtcagtcagtgcaacctctatatggcagcccggaaagccgtgaggaagaggcccaaccag gctctgctggagaacatcgccctgtacctcacccatatgctgaagtggcttctggctgaa acagagctggagagggtagctgtgagtcagggccatcctggagcaatgagggcaccctca ggctcgggtttctggggctcctggagctgctgcttccatctctttcctggtcttggggca ccccctcccgtgaacgaggaacacagctcacagctgctcgccacatgccgccttcatctt gccgcagcttcagacccttccggacagcgcctcgagctaaatggcaattctggctctgag agtgctaagtgtggtcagctgccctgccgtgtttttcatcttaacggctgctgtggttgt gggggtgccttcagcgtgctgtggacacgcatccgcatcccccaggcagcctcatttggc ctttgcagtcagctgattgcgaagaagcagctgcggcgaatgctcgaggccacagtcatg ccctaccttcaggtgttatcagaattccgagaaggagtgcggaagattgcccgagagcaa aaagtccctgagattctgcagctcagcgatgccctgcgggacaacatcctgcccgagctt ggggtgcggtttgaagaccacgaagcatactccccacctgcaccccacaacgcccccagg atcagggatggcccaagttcctcggtgcccaataggacagaagaacaggaacaaaaacgc aatgtcagaatctctgggcacagcgcatggggtaccctgggatgcaggcagcaggatgag aggcacctgggcccccagtgcacaccatgggaggtgggccatgggggctgttgccaccag caggttgaagaggagaagaggaagaagaaagaggaggcggcccggaggaaacaggaacaa gaagcagcaaagctggccaagatgaagattccccccagtgagatgttcttgtcagaaacc gacaaatactccaagtttgatgaaaatggtctgcccacacatgacatggagggcaaagag ctcagcaaagggcaagccaagaagctgaagaagctcttcgaggctcaggagaagctctac aaggaatatctgcagatggcccagaatggaagcttccagaaatactgggaacagagggag aatgcaggtgaggccatgaagcagccaccacatttgtgcagaaggtggggcattctgcca gatgcccggctggcatcttcaatacgtctgcgtcaaagggagcagaatagggcagaagag aaggactgggagagactcgagaaacaaaaccaccaggtttga