GENSCAN 1.0 Date run: 7-Nov-116 Time: 19:10:01 Sequence gi568815576f:37723407_37925467 : 202061 bp : 52.22% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 273 3097 2825 2 2 81 99 839 0.489 72.78 1.02 Intr + 7114 7215 102 1 0 74 61 60 0.457 1.69 1.03 Intr + 9892 10006 115 1 1 32 99 60 0.522 2.45 1.04 Intr + 10993 12036 1044 0 0 109 110 197 0.326 15.37 1.05 Intr + 17489 17616 128 1 2 38 43 128 0.003 3.28 1.06 Term + 24474 24687 214 0 1 53 43 146 0.148 3.63 1.07 PlyA + 25790 25795 6 1.05 2.00 Prom + 26236 26275 40 -7.79 2.01 Init + 27735 27794 60 2 0 69 97 75 0.981 6.16 2.02 Intr + 28366 28422 57 0 0 129 94 69 0.977 11.17 2.03 Intr + 31695 31784 90 2 0 50 66 216 0.927 16.29 2.04 Intr + 32144 32253 110 1 2 66 70 154 0.990 10.98 2.05 Intr + 34207 34732 526 1 1 87 80 796 0.999 72.24 2.06 Intr + 35748 35858 111 2 0 113 110 69 0.971 12.68 2.07 Intr + 42264 42411 148 2 1 88 80 228 0.346 22.32 2.08 Intr + 44668 44770 103 2 1 94 63 141 0.999 11.93 2.09 Intr + 45622 45781 160 1 1 92 80 314 0.999 31.50 2.10 Intr + 45856 45969 114 0 0 93 105 152 0.997 18.55 2.11 Intr + 48244 48462 219 0 0 63 53 187 0.764 11.73 2.12 Intr + 48937 49137 201 0 0 64 16 96 0.354 0.00 2.13 Term + 49195 49356 162 0 0 138 44 337 0.928 32.55 2.14 PlyA + 51085 51090 6 1.05 3.04 PlyA - 53266 53261 6 1.05 3.03 Term - 60665 60551 115 1 1 73 47 74 0.281 0.15 3.02 Intr - 61594 61433 162 0 0 127 94 -31 0.290 1.01 3.01 Init - 67357 67206 152 1 2 114 66 95 0.729 9.38 3.00 Prom - 77019 76980 40 -2.11 4.00 Prom + 77138 77177 40 -3.11 4.01 Sngl + 82139 82723 585 1 0 116 48 1040 0.991 99.37 4.02 PlyA + 83232 83237 6 -0.45 5.00 Prom + 83804 83843 40 -0.51 5.01 Init + 84562 84757 196 0 1 72 82 301 0.587 24.97 5.02 Intr + 86621 86751 131 0 2 113 86 130 0.904 16.22 5.03 Intr + 89481 89582 102 2 0 94 85 153 0.968 16.47 5.04 Intr + 90057 90203 147 2 0 98 75 461 0.957 46.74 5.05 Intr + 91720 91874 155 0 2 96 56 217 0.958 18.68 5.06 Intr + 92012 92094 83 2 2 69 82 99 0.997 7.18 5.07 Intr + 92257 92428 172 2 1 122 64 225 0.754 23.02 5.08 Intr + 92794 92915 122 1 2 109 94 96 0.996 12.94 5.09 Intr + 93161 93282 122 0 2 93 3 231 0.002 15.82 5.10 Intr + 94372 94623 252 0 0 51 23 128 0.000 0.76 5.11 Intr + 95117 95265 149 1 2 16 49 98 0.000 -1.76 5.12 Intr + 99982 100359 378 1 0 95 85 595 0.039 54.54 5.13 Term + 101317 102064 748 1 1 75 40 1124 0.999 99.67 5.14 PlyA + 102421 102426 6 1.05 6.09 PlyA - 102603 102598 6 1.05 6.08 Term - 108611 108537 75 2 0 97 43 65 0.156 1.04 6.07 Intr - 109338 109231 108 0 0 63 99 90 0.983 8.58 6.06 Intr - 109703 109552 152 0 2 118 109 178 0.982 23.29 6.05 Intr - 109800 109753 48 1 0 96 87 80 0.987 7.84 6.04 Intr - 110349 110278 72 1 0 91 94 29 0.795 3.67 6.03 Intr - 115192 115094 99 2 0 58 97 126 0.995 11.08 6.02 Intr - 116828 116781 48 0 0 91 109 78 0.998 9.34 6.01 Init - 120832 120505 328 1 1 80 101 522 0.995 50.19 6.00 Prom - 123428 123389 40 -5.11 7.00 Prom + 125775 125814 40 -7.89 7.01 Init + 126044 126076 33 1 0 87 94 41 0.936 4.41 7.02 Intr + 126609 126657 49 2 1 135 94 118 0.995 15.84 7.03 Intr + 127874 128084 211 0 1 60 80 325 0.813 27.40 7.04 Intr + 132159 132228 70 0 1 51 52 53 0.345 -2.32 7.05 Intr + 139563 139632 70 2 1 122 64 72 0.967 7.45 7.06 Intr + 139866 139939 74 1 2 92 70 76 0.713 5.82 7.07 Intr + 146770 146941 172 0 1 86 105 221 0.870 23.63 7.08 Intr + 150964 151118 155 2 2 123 59 176 0.980 18.50 7.09 Intr + 152435 152605 171 1 0 87 91 275 0.984 28.35 7.10 Intr + 154268 154765 498 1 0 86 91 814 0.963 75.27 7.11 Intr + 163359 163439 81 2 0 132 78 130 0.995 16.73 7.12 Term + 165020 165058 39 1 0 126 45 37 0.968 0.68 7.13 PlyA + 166701 166706 6 1.05 8.00 Prom + 179307 179346 40 -1.11 8.01 Init + 182631 182776 146 2 2 84 7 172 0.951 6.26 8.02 Intr + 182827 183162 336 1 0 61 68 438 0.639 34.29 8.03 Intr + 188546 188594 49 2 1 138 111 75 0.961 13.97 8.04 Intr + 188945 189086 142 1 1 87 100 182 0.997 19.74 8.05 Intr + 194355 194389 35 1 2 66 96 41 0.421 1.13 8.06 Intr + 195630 195772 143 2 2 126 117 139 0.981 20.26 8.07 Intr + 198566 199020 455 2 2 107 101 366 0.235 33.28 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 17353 17209 145 1 1 96 91 114 0.964 12.87 S.002 Init + 71819 71867 49 1 1 86 58 39 0.880 -0.25 S.003 Term + 72467 72588 122 0 2 108 49 81 0.962 5.14 S.004 Term + 93161 93312 152 0 2 93 49 246 0.998 19.58 S.005 Init + 100001 100359 359 1 2 90 85 584 0.959 54.90 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576f:37723407_37925467|GENSCAN_predicted_peptide_1|1475_aa RENPRTPCVQQDDPRASSPNRTTQRENSRTSCAQRDNPKASRTSSPNRATRDNPRTSCAQ RDNPRASSPSRATRDNPTTSCAQRDNPRASRTSSPNRATRDNPRTSCAQRDNPRASSPSR ATRDNPTTSCAQRDNPRASRTSSPNRATRDNPRTSCAQRDNPRASSPNRAARDNPTTSCA QRDNPRASRTSSPNRATRDNPRTSCAQRDNPRASSPNRATRDNPTTSCAQRDNPRASRTS SPNRATRDNPRTSCAQRDNPRASSPNRTTQQDSPRTSCARRDDPRASSPNRTIQQENPRT SCALRDNPRASSPSRTIQQENPRTSCAQRDDPRASSPNRTTQQENPRTSCARRDNPRASS RNRTIQRDNPRTSCAQRDNPRASSPNRTIQQENLRTSCTRQDNPRTSSPNRATRDNPRTS CAQRDNLRASSPIRATQQDNPRTCIQQNIPRSSSTQQDNPKTSCTKRDNLRPTCTQRDRT QSFSFQRDNPGTSSSQCCTQKENLRPSSPHRSTQWNNPRNSSPHRTNKDIPWASFPLRPT QSDGPRTSSPSRSKQSEVPWASIALRPTQGDRPQTSSPSRPAQHDPPQSSFGPTQYNLPS RATSSSHNPGHQSTSRTSSPVYPAAYGAPLTSPEPSQPPCAVCIGHRDAPRASSPPRYLQ HDPFPFFPEPRAPESEPPHHEPPYIPPAVCIGHRDAPRASSPPRHTQFDPFPFLPDTSDA EHQCQSPQHEPLQLPAPVCIGYRDAPRASSPPRQAPEPSLLFQDLPRASTESLVPSMDSL HECPHIPTPVCIGHRDAPSFSSPPRQAPEPSLFFQDPPGTSMESLAPSTDSLHGSPVLIP QVCIGHRDAPRASSPPRHPPSDLAFLAPSPSPGSSGGSRGSAPPGETRHNLEREEYTVLA DLPPPRRLAQRQPGPQAQCSSGGRTHSPGRAEVERLFGQERREEQPTGSRLGSCFIEAPI PQFGGKFTLRPSALTEKSEAAGAFQAQDEGRSQQPSQGQSQLLRRQSSPAPSRQVTMLPA KQAELTRRSQAEPPHPWSPEKRPEGDRQLQGSPLPPRTSARTPERELRTQRPLESGQAGP RQPLGVWQSQEEPPGSQGPHRHLERSWSSQEGGLGPGGWWGCGEPSLGAAKAPEGAWGGT SREYKESWGQPEAWEEKPTHELPRELGKRSPLTSPPENWGGPAESSQSWHSGTPTAVGWG AEGACPYPRGSERRPELDWRDLLGLLRAPGEGVWARVPSLDWEGLLELLQARLPRKDPAG HRDDLARALGPELGPPGTNDVPEQESHSQPEGWAEATPVNGHSPALQSQSPVQLPSPACT STQWPKIKVTRGPATATLAGLEQTGPLGSRSTAKGPSLPELQADKRPAEGKAGSPLKGRL VTSWRMPGDRPTLFNPFLLSLGVLSCLSWGQHVLSKGKAAGSSVWGAWKMDSTSLLVAAA FLREVSSCDYRALSGGDGLARRQQCGQKRVVMFEK >gi568815576f:37723407_37925467|GENSCAN_predicted_CDS_1|4428_bp cgggaaaaccccaggacaccctgtgtccagcaggacgatcccagagcctcctctcccaac agaaccactcaacgagagaattccagaacatcctgtgcccagcgggacaatcccaaagcc tccagaacctcctctcccaatagagccacacgagacaaccccagaacatcctgcgcccag cgggacaatcccagagcctcctctcccagtagagctacacgagacaaccccacaacatcc tgtgcccagcgggacaatcccagagcctccagaacctcctctcccaatagagccacacga gacaaccccagaacatcctgtgcccagcgggacaatcccagagcctcctctcccagtaga gctacacgagacaaccccacaacatcctgtgcccagcgggacaatcccagagcctccaga acctcctctcccaatagagccacacgagacaaccccagaacatcctgcgcccagcgggac aatcccagagcctcctctcccaatagagctgcacgagacaaccccacaacatcctgtgcc cagcgggacaatcccagagcctccagaacctcctctcccaatagagccacacgagacaac cccagaacatcctgtgcccagcgggacaatcccagagcctcctctcccaatagagctaca cgagacaaccccacaacatcctgtgcccagcgggacaatcccagagcctccagaacctcc tctcccaatagagccacacgagataaccccagaacatcctgtgcccagcgggacaatccc agagcctcctctcccaacagaaccacccaacaagacagccccagaacatcctgtgcccga cgggacgatcccagagcctcctctcctaacagaaccatccaacaagagaaccccagaaca tcctgtgccctacgggacaatcccagagcctcctctcccagcagaaccatccaacaagag aaccccagaacatcctgtgcccaacgggacgatcccagagcctcctctcctaacagaacc acccaacaagagaaccccagaacatcctgtgcccgacgggacaatcccagagcctcctct cgcaacagaaccatccagcgagacaaccccagaacatcctgtgcccagcgggacaatccc agagcctcctctcctaacagaaccatccaacaagagaacctcagaacatcctgtacccga caggacaatcccaggacctcctctcccaatagagccacacgagacaaccccagaacatcc tgtgcccagcgggacaatctcagagcctcctctcccatcagagccacccaacaggacaac cccagaacttgtattcaacagaacatccccagatcatcttctacccaacaagacaaccct aaaacctcttgtaccaaacgagataacctcagacccacttgtacacagcgggaccgcaca cagtccttttcctttcaacgagacaaccctggaacctcctcatctcaatgctgcacccaa aaggagaatctgagaccatcatctccccaccgctccactcaatggaacaatcccaggaat tcatctccccatcgtactaacaaagacatcccctgggcctcgtttcccctccggccaact cagagtgatggtccccgaacctcttccccatctcgctccaagcaaagcgaggttccctgg gcatccatcgccctccggccaacccaaggtgacaggcctcagacatcctctcccagcagg ccagcccagcatgacccaccccagtcctcctttggccccacccagtacaacttgccatcc cgggccacctcttcctcccataacccaggccaccagagcacctcccgaacttcctcacct gtgtaccccgctgcctatggggctcccctgacctctcctgagccctcccagcctccatgt gctgtgtgcattgggcaccgggatgcccctcgagcctcttcgccccctcgctatttgcag cacgaccccttccccttcttcccagagccccgcgcccctgagagtgaaccgccccaccac gagcctccctatataccacctgctgtgtgcattggacaccgagatgccccccgggcgtcc tcgcccccccgccacacccaatttgaccccttccccttcctcccagacacatcagatgcc gagcatcagtgtcagtccccccaacacgagccccttcagctccctgcacctgtgtgtatt gggtaccgagatgcaccccgggcctcctccccaccacgccaggccccagagccttccctc ttattccaggacctccccagggccagcacagagagccttgtcccttccatggactctctg cacgagtgcccccacatccccacccctgtgtgcattgggcaccgggatgcaccctccttc tcatccccaccacgccaggctcctgagccatccctcttcttccaggatccccctggaact agtatggagagcctggccccctccactgactctctgcatggctccccagtgctgatcccc caagtgtgcatcgggcaccgggatgcaccccgagcctcctccccaccccgccacccaccc agtgacctagcgttcctggcaccctcaccttcaccgggcagctctgggggctcccggggc tcagcgcctcccggggagaccaggcacaacttggagcgggaggagtacactgtgctggcc gacctgcccccacccaggaggctggcccagagacagccagggccccaggcgcagtgcagc agcgggggccgcacccacagccctggccgtgcagaggtggagcgcctcttcgggcaagag cgcagggaggaacagcccactgggtcacgtctgggctcttgcttcatcgaagctcctata ccccagttcgggggaaaattcactctaaggccttcagctctcacagagaagtccgaggca gcgggggccttccaggcccaggacgagggacggtcacagcagcccagccaaggccagagc caacttctccgaagacagtccagccctgcccccagcaggcaggtgaccatgctccctgcc aaacaggcagaactgacccggcggagccaagcagagccccctcatccttggagtcctgag aagagacctgagggagatcggcagctccaggggtccccgctgccccccaggacatcagcc aggacccctgagagggagctgcggacacagagacctctggagagtggccaagcaggccca agacagcctctgggggtgtggcagagtcaggaggaaccgccagggtcccagggccctcat agacacctagaaaggagctggagcagccaggagggaggcctgggccctgggggctggtgg ggatgtggagagcccagcctgggggcagccaaagccccggagggagcatgggggggcact tccagggagtacaaggagagctgggggcagccagaggcctgggaggagaagcccactcat gagctccccagagaactaggaaagagaagcccactcacgagcccccctgagaactgggga ggccccgcagagtcctcacaatcctggcactctgggacacccactgctgtgggctggggg gcagagggagcgtgtccatacccgcgtggctctgagaggcgacccgagcttgactggagg gatctgcttggccttctccgggcaccaggagagggggtctgggcccgtgtccccagcctg gactgggagggcctcttggagctcctgcaggccaggctgccccgcaaggacccagctgga cacagggatgacctggccagggctttagggccagagctgggtcccccaggcacaaacgat gtccctgagcaggagtcacacagccagccagaaggctgggccgaggccaccccagtcaat ggacacagccccgcactgcagtcccagagcccggtccagctgcccagccctgcctgcacc tccacccagtggccaaagatcaaagtgacaagaggaccagcgaccgcaactctggcaggc ctggagcagacgggccccctggggagcaggagcactgcgaagggccccagcttgccagag ctgcaggcagacaagaggccagcagagggcaaggctgggagcccgctcaagggccgactg gtgacctcatggcggatgcccggggaccggcccacgctgttcaatccgttcctgctgtct ctgggggtcctcagttgcctgtcctgggggcagcacgtgctgagcaagggtaaggctgcc ggaagcagcgtgtggggtgcttggaagatggacagcacatccctgctggtggcagcagcc ttcctgagggaggtgtcctcctgtgattatagggccttgtcaggtggagatggactagcg aggagacagcagtgtggacagaaacgggtggtcatgtttgagaagtag >gi568815576f:37723407_37925467|GENSCAN_predicted_peptide_2|686_aa MLQLVAPRPRGCAPLGGTQKPDLLNFKKGWMSILDEPGEADELDGEIDLRSCTDVTEYAV QRNYGFQIHTKDAVYTLSAMTSGIRRNWIEALRKTVRPTSAPDVTKLSDSNKENALHSYS TQKGPLKAGEQRAGSEVISRGGPRKADGQRQALDYVELSPLTQASPQRARTPARTPDRLA KQEELERDLAQRSEERRKWFEATDSRTPEVPAGEGPRRGLGAPLTEDQQNRLSEEIEKKW QELEKLPLRENKRVPLTALLNQSRGERRGPPSDGHEALEKEVQALRAQLEAWRLQGEAPQ SALRSQEDGHIPPGYISQEACERSLAEMESSHQQVMEELQRHHERELQRLQQEKEWLLAE ETAATASAIEAMKKAYQEELSRELSKTRSLQQGPDGLRKQHQSDVEALKRELQVLSEQYS QKCLEIGALMRQAEEREHTLRRCQQEGQELLRHNQELHGRLSEEIDQLRGFIASQGMGNG CGRSNERSSCELEVLLRVKENELQYLKKEVQCLRDELQMMQKVGPSAGLGAVGDSGAIWM PSCEHLLCAKPSTRFILPQALLCFSHPLTALRSWSKSSPPKQDEDDNDACVPGWYQGGGR GKPRTHGHLGEPPGGMKRGICGEVWLLVLPMCPDKRFTSGKYQDVYVELSHIKTRSEREI EQLKEHLRLAMAALQEKESMRNSLAE >gi568815576f:37723407_37925467|GENSCAN_predicted_CDS_2|2061_bp atgctgcagctggtagcccccagaccccggggctgtgcccccctgggcggcacccagaag cccgatctgctcaacttcaagaagggatggatgtcgatcttggacgagcctggagaggca gatgagctggatggtgagatcgacctgcgttcctgcacggatgtcactgagtacgcggtg cagcgcaactatggcttccagatccacaccaaggatgctgtctataccttgtcggccatg acctcaggcatccggcggaactggatcgaggctctgagaaagaccgtacgtccaacttca gccccagatgtcaccaagctctcggactctaacaaggagaacgcgctgcacagctacagc acccagaagggccccctgaaggcaggggagcagcgggcgggctctgaggtcatcagccgg ggtggccctcggaaggcggacgggcagcgtcaggccttggactacgtggagctctcgccg ctgacccaggcttccccgcagcgggcccgcaccccagcccgcactcctgaccgcctggcc aagcaggaggagctggagcgggacctggcccagcgctccgaggagcggcgcaagtggttt gaggccacagacagcaggaccccagaggtgcctgctggtgaggggccgcgccggggcctg ggtgcccccctgactgaggaccagcaaaaccggcttagtgaggagatcgagaagaagtgg caggagctggagaagctgcccctgcgggagaataagcgggtgcccctcactgccctgctc aaccaaagccgcggagagcgccgagggcccccaagtgacggccacgaggcactggagaag gaggttcaggctcttcgggcccagctggaggcgtggcgtctccaaggggaggctcctcag agtgcactgagatcccaggaggatggccacatccccccgggctacatctcacaggaggca tgtgagcgcagcctggcagagatggagtcctcgcaccagcaggtgatggaggagctgcag cggcaccacgagcgggagctgcagcgcctgcagcaggagaaggagtggctcctggctgag gagacggcagccacggcctcagccattgaagccatgaagaaggcctaccaggaagagctg agccgagagctgagcaaaacacggagtctccagcagggcccggatggcctccggaagcag caccagtcagatgtggaggcactgaagcgagagctgcaggtgctatcggagcagtactcg cagaagtgcctggagattggggcactcatgcggcaggctgaggagcgcgagcacacgctg cgccgctgccagcaggagggccaggagctgctgcgccacaaccaggagctgcatggccgc ctgtcagaggagatagaccagctgcgcggcttcattgcctcgcagggcatgggcaatggc tgcgggcgcagcaacgagcggagttcctgcgagctagaggtgctgcttcgcgtaaaagaa aacgaactccagtacctaaagaaggaggtgcagtgcctccgggacgagctccagatgatg cagaaggtaggtccttccgctgggctgggggccgtcggggactctggagccatctggatg ccatcctgtgagcacctgctctgtgccaagccctccactcgcttcatccttcctcaggct ctcctgtgcttctcccatccactcactgccctgcggtcttggtcaaaatcttctcccccg aaacaggatgaggatgacaatgacgcctgtgtccctgggtggtaccagggaggtgggagg ggtaagcccagaacccacggccatcttggggagccacctggagggatgaagcgaggtatc tgcggggaggtctggctgctggtgctgcccatgtgcccggacaagcgcttcacctcggga aagtaccaggacgtctatgtggagctgagccacatcaagacacggtctgagcgggagatc gagcagctgaaggagcacctgcgtcttgccatggccgccctccaggagaaggagtcgatg cgcaacagcctggctgagtag >gi568815576f:37723407_37925467|GENSCAN_predicted_peptide_3|142_aa MPGWENIFSSMNHTLNALTGAVTSTRDNRDCSLVRAMPNWGPNAAGISPCRVSAGTGRVP SCLCHSGAWNRLDQDINRNNSCYSDSVGLATLRASLNPISSSHLRHTWPVQPAVLLLSED TATWEQLECGSGDATVEVTASR >gi568815576f:37723407_37925467|GENSCAN_predicted_CDS_3|429_bp atgcccggctgggaaaatattttctcctccatgaaccatactctcaatgcactaactggt gctgtgacaagcaccagggacaacagggactgctctctggtgagggccatgcccaactgg ggcccaaatgcagctggcatcagtccctgcagggtgagcgcagggacagggcgtgtccct tcatgtctgtgtcactcaggtgcctggaacaggcttgaccaggatattaacaggaataac agctgctactcagactcagtgggcctggccacgctcagggcctcacttaatcccatttct tcatctcatctcagacacacttggccagtgcagccagctgtcctgctcttgagtgaggac acagccacgtgggagcagctggaatgtggctctggagatgccacagttgaggtcacagct agtagatga >gi568815576f:37723407_37925467|GENSCAN_predicted_peptide_4|194_aa MTENSTSAPAAKPKRAKASKKSTDHPKYSDMIVAAIQAEKNRAGSSRQSIQKYIKSHYKV GENADSQIKLSIKRLVTTGVLKQTKGVGASGSFRLAKSDEPKKSVAFKKTKKEIKKVATP KKASKPKKAASKAPTKKPKATPVKKAKKKLAATPKKAKKPKTVKAKPVKASKPKKAKPVK PKAKSSAKRAGKKK >gi568815576f:37723407_37925467|GENSCAN_predicted_CDS_4|585_bp atgaccgagaattccacgtccgcccctgcggccaagcccaagcgggccaaggcctccaag aagtccacagaccaccccaagtattcagacatgatcgtggctgccatccaggccgagaag aaccgcgctggctcctcgcgccagtccattcagaagtatatcaagagccactacaaggtg ggtgagaacgctgactcgcagatcaagttgtccatcaagcgcctggtcaccaccggtgtc ctcaagcagaccaaaggggtgggggcctcggggtccttccggctagccaagagcgacgaa cccaagaagtcagtggccttcaagaagaccaagaaggaaatcaagaaggtagccacgcca aagaaggcatccaagcccaagaaggctgcctccaaagccccaaccaagaaacccaaagcc accccggtcaagaaggccaagaagaagctggctgccacgcccaagaaagccaaaaaaccc aagactgtcaaagccaagccggtcaaggcatccaagcccaaaaaggccaaaccagtgaaa cccaaagcaaagtccagtgccaagagggccggcaagaagaagtga >gi568815576f:37723407_37925467|GENSCAN_predicted_peptide_5|918_aa MWPGNAWRAALFWVPRGRRAQSALAQLRGILEGELEGIRGAGTWKSERVITSRQGPHIRV DGVSGGILNFCANNYLGLSSHPEVIQAGLQALEEFGAGLSSVRFICGTQSIHKNLEAKIA RFHQREDAILYPSCYDANAGLFEALLTPEDAVLSDELNHASIIDGIRLCKAHKYRYRHLD MADLEAKLQEAQKHRLRLVATDGAFSMDGDIAPLQEICCLASRYGALVFMDECHATGFLG PTGRGTDELLGVMDQVTIINSTLGKALGGASGGYTTGPGPLVSLLRQRARPYLFSNSLPP AVVGCASKALDLLMGSNTIVQSMAAKTQRFRSKMEAAGFTISGASHPICPVMLGDARLAS RMADDMLKRGIFVIGFSYPVVPKGKARIRVQISAVHSEEDIDRCVEAFVEAYDDQPPGGA ASTGRGRRLAPPLQTGGLGPRLFRLPSSQGQERRFAAAKAPSSLVPHNGSPLSCWRGLRE EGGLSLATLWTPCGPAAHVKRHGQLRGDVWQSVCCTRRGQNRGPRSTQTDVRRHEARSRQ RRRKCPSDGEMADAQNISLDSPGSVGAVAVPVVFALIFLLGTVGNGLVLAVLLQPGPSAW QEPGSTTDLFILNLAVADLCFILCCVPFQATIYTLDAWLFGALVCKAVHLLIYLTMYASS FTLAAVSVDRYLAVRHPLRSRALRTPRNARAAVGLVWLLAALFSAPYLSYYGTVRYGALE LCVPAWEDARRRALDVATFAAGYLLPVAVVSLAYGRTLRFLWAAVGPAGAAAAEARRRAT GRAGRAMLAVAALYALCWGPHHALILCFWYGRFAFSPATYACRLASHCLAYANSCLNPLV YALASRHFRARFRRLWPCGRRRRHRARRALRRVRPASSGPPGCPGDARPSGRLLAGGGQG PEPREGPVHGGEAARGPE >gi568815576f:37723407_37925467|GENSCAN_predicted_CDS_5|2757_bp atgtggcctgggaacgcctggcgcgccgcactcttctgggtgccccgcggccgccgcgca cagtcagcgctggcccagctgcgtggcattctggagggggagctggaaggcatccgcgga gctggcacttggaagagtgagcgggtcatcacgtcccgtcaggggccgcacatccgcgtg gacggcgtctccggaggaatccttaacttctgtgccaacaactacctgggcctgagcagc caccctgaggtgatccaggcaggtctgcaggctctggaggagtttggagctggcctcagc tctgtccgctttatctgtggaacccagagcatccacaagaatctagaagcaaaaatagcc cgcttccaccagcgggaggatgccatcctctatcccagctgttatgacgccaacgccggc ctctttgaggccctgctgaccccagaggacgcagtcctgtcggacgagctgaaccatgcc tccatcatcgacggcatccggctgtgcaaggcccacaagtaccgctatcgccacctggac atggccgacctagaagccaagctgcaggaggcccagaagcatcggctgcgcctggtggcc actgatggggccttttccatggatggcgacatcgcacccctgcaggagatctgctgcctc gcctctagatatggtgccctggtcttcatggatgaatgccatgccactggcttcctgggg cccacaggacggggcacagatgagctgctgggtgtgatggaccaggtcaccatcatcaac tccaccctggggaaggccctgggtggagcatcagggggctacacgacagggcctgggccc ctggtgtccctgctgcggcagcgcgcccggccatacctcttctccaacagtctgccacct gctgtcgttggctgcgcctccaaggccctagatctgctgatggggagtaacaccattgtc cagtctatggctgccaagacccagaggttccgtagtaagatggaagctgctggcttcact atctcgggagccagtcaccccatctgccctgtgatgctgggtgatgcccggctggcctct cgcatggcggatgacatgctgaagagaggcatctttgtcatcgggttcagctaccccgtg gtccccaagggcaaggcccggatccgggtacagatctcagcagtgcatagcgaggaagac attgaccgctgcgtggaggccttcgtggaagcctacgatgatcagccaccagggggtgct gcgagcacgggccgcggccgccggctcgccccgcccctccagactgggggccttgggccg cggctgttcaggctgcccagcagtcaaggccaggagaggcgatttgctgctgccaaggcc ccatcctcccttgtgccccacaacggctcgccgctttcctgttggaggggcctgcgggag gagggcggtctctccctggcgaccttgtggaccccttgtgggccagcagctcatgttaag cgccacgggcagctgcggggtgacgtgtggcagtcggtgtgctgcacccggcggggccag aacagagggcccaggtccacccagaccgacgtgaggcggcacgaggcgagatccagacag cggcgcagaaagtgcccgtctgatggggagatggctgatgcccagaacatttcactggac agcccagggagtgtgggggccgtggcagtgcctgtggtctttgccctaatcttcctgctg ggcacagtgggcaatgggctggtgctggcagtgctcctgcagcctggcccgagtgcctgg caggagcctggcagcaccacggacctgttcatcctcaacctggcggtggctgacctctgc ttcatcctgtgctgcgtgcccttccaggccaccatctacacgctggatgcctggctcttt ggggccctcgtctgcaaggccgtgcacctgctcatctacctcaccatgtacgccagcagc tttacgctggctgctgtctccgtggacaggtacctggccgtgcggcacccgctgcgctcg cgcgccctgcgcacgccgcgtaacgcccgcgccgcagtggggctggtgtggctgctggcg gcgctcttctcggcgccctacctcagctactacggcaccgtgcgctacggcgcgctggag ctctgcgtgcccgcctgggaggacgcgcgccgccgcgccctggacgtggccaccttcgct gccggctacctgctgcccgtggctgtggtgagcctggcctacgggcgcacgctgcgcttc ctgtgggccgccgtgggtcccgcgggcgcggcggcggccgaggcgcggcggagggcgacg ggccgcgcggggcgcgccatgctggcggtggccgcgctctacgcgctctgctggggtccg caccacgcgctcatcctgtgcttctggtacggccgcttcgccttcagcccggccacctac gcctgccgcctggcctcacactgcctggcctacgccaactcctgcctcaacccgctcgtc tacgcgctcgcctcgcgccacttccgcgcgcgcttccgccgcctgtggccgtgcggccgc cgacgccgccaccgtgcccgccgcgccttgcgtcgcgtccgccccgcgtcctcgggccca cccggctgccccggagacgcccggcctagcgggaggctgctggctggtggcggccagggc ccggagcccagggagggacccgtccacggcggagaggctgcccgaggaccggaataa >gi568815576f:37723407_37925467|GENSCAN_predicted_peptide_6|309_aa MAAAAGDADDEPRSGHSSSEGECAVAPEPLTDAEGLFSFADFGSALGGGGAGLSGRASGG AQSPLRYLHVLWQQDAEPRDELRCKIPAGRLRRAARPHRRLGPTGKEVHALKRLRDSANA NDVETVQQLLEDGADPCAADDKGRTALHFASCNGNDQIVQLLLDHGADPNQRDGLGNTPL HLAACTNHVPVITTLLRGECSPQLVPAGARVDALDRAGRTPLHLAKSKLNILQEGHAQCL EAVRLEVKQIIHMLREYLERLGQHEQRERLDDLCTRLQMTSTKEQVDEVTDLLASFTSLS LQMQSMEKR >gi568815576f:37723407_37925467|GENSCAN_predicted_CDS_6|930_bp atggcagccgccgccggggacgcggacgacgagccgcgctcaggccactcgagctcggag ggcgagtgcgcggtggcgccggagccgctgactgacgctgagggcctcttctccttcgct gacttcgggtctgcgctgggcggcggcggcgcgggcctctcgggccgggcgtccggcggg gcccagtcgccgctgcgctacttgcacgtcctgtggcagcaggatgcggagccgcgcgac gagctgcgctgcaagatacccgctggccggctgaggcgcgctgccaggccccaccggcgg ctcgggcccacgggcaaggaggtgcacgctctgaagagactgagggactcggccaatgcc aatgatgtggaaacagtgcagcagctgctggaagatggcgcggatccctgtgcagctgat gacaagggccgcacagctctacactttgcctcatgcaatggcaatgaccagattgtgcag ctgctcctggaccatggtgctgatcctaaccagcgagatgggctggggaacacgccactg cacctggcggcctgcaccaaccacgttcctgtcatcaccacactgctacgaggagaatgc tcacctcagctggtacctgcaggggcccgtgtagatgccctggaccgagctggtcgcaca cccctgcacctggccaagtcaaagctgaatatcctgcaggagggccatgcccagtgccta gaggctgtgcgtctggaggtgaagcagatcatccatatgctgagggagtatctggagcgc ctagggcaacatgagcagcgagaacgcctggatgacctctgcacccgcctgcagatgacc agtaccaaagagcaggtggatgaagtgactgacctcctggccagcttcacctccctcagt ctgcagatgcagagcatggagaagaggtag >gi568815576f:37723407_37925467|GENSCAN_predicted_peptide_7|540_aa MSYPADDYESEAAYDPYAYPSDYDMHTGDPKQDLAYERQYEQQTYQVIPEVIKNFIQYFH KTVSDLIDQKVYELQASRVSSDVIDQKVYEIQDIYENSWTKLTERFFKNTPWPEAEAIAP QGGPSLEQRFESYYNYCNLFNYILNADGPAPLELPNQWLWDIIDEFIYQFQSFSQYRCKT AKKSEEEIDFLRSNPKIWNVHSVLNVLHSLVDKSNINRQLEVYTSGGDPESVAGEYGRHS LYKMLGYFSLVGLLRLHSLLGDYYQAIKVLENIELNKKSMYSRVPECQVTTYYYVGFAYL MMRRYQDAIRVFANILLYIQRTKSMFQRTTYKYEMINKQNEQMHALLAIALTMYPMRIDE SIHLQLREKYGDKMLRMQKGDPQVYEELFSYSCPKFLSPVVPNYDNVHPNYHKEPFLQQL KVFSDEVQQQAQLSTIRSFLKLYTTMPVAKLAGFLDLTEQEFRIQLLVFKHKMKNLVWTS GISALDGEFQSASEVDFYIDKDMIHIADTKVARRYGDFFIRQIHKFEELNRTLKKMGQRP >gi568815576f:37723407_37925467|GENSCAN_predicted_CDS_7|1623_bp atgtcttatcccgctgatgattatgagtctgaggcggcttatgacccctacgcttatccc agcgactatgatatgcacacaggagatccaaagcaggaccttgcttatgaacgtcagtat gaacagcaaacctatcaggtgatccctgaggtgatcaaaaacttcatccagtatttccac aaaactgtctcagatttgattgaccagaaagtgtatgagctacaggccagtcgtgtctcc agtgatgtcattgaccagaaggtgtatgagatccaggacatctatgagaacagctggacc aagctgactgaaagattcttcaagaatacaccttggcccgaggctgaagccattgctcca caggggggaccttccttggagcagaggtttgaatcctattacaactactgcaatctcttc aactacattcttaatgccgatggtcctgctccccttgaactacccaaccagtggctctgg gatattatcgatgagttcatctaccagtttcagtcattcagtcagtaccgctgtaagact gccaagaagtcagaggaggagattgactttcttcgttccaatcccaaaatctggaatgtt catagtgtcctcaatgtccttcattccctggtagacaaatccaacatcaaccgacagttg gaggtatacacaagcggaggtgaccctgagagtgtggctggggagtatgggcggcactcc ctctacaaaatgcttggttacttcagcctggtcgggcttctccgcctgcactccctgtta ggagattactaccaggccatcaaggtgctggagaacatcgaactgaacaagaagagtatg tattcccgtgtgccagagtgccaggtcaccacatactattatgttgggtttgcatatttg atgatgcgtcgttaccaggatgccatccgggtcttcgccaacatcctcctctacatccag aggaccaagagcatgttccagaggaccacgtacaagtatgagatgattaacaagcagaat gagcagatgcatgcgctgctggccattgccctcacgatgtaccccatgcgtattgatgag agcattcacctccagctgcgggagaaatatggggacaagatgttgcgcatgcagaaaggt gacccacaagtctatgaagaacttttcagttactcctgccccaagttcctgtcgcctgta gtgcccaactatgataatgtgcaccccaactaccacaaagagcccttcctgcagcagctg aaggtgttttctgatgaagtacagcagcaggcccagctttcaaccatccgcagcttcctg aagctctacaccaccatgcctgtggccaagctggctggcttcctggacctcacagagcag gagttccggatccagcttcttgtcttcaaacacaagatgaagaacctcgtgtggaccagc ggtatctcagccctggatggtgaatttcagtcagcctcagaggttgacttctacattgat aaggacatgatccacatcgcggacaccaaggtcgccaggcgttatggggatttcttcatc cgtcagatccacaaatttgaggagcttaatcgaaccctgaagaagatgggacagagacct tga >gi568815576f:37723407_37925467|GENSCAN_predicted_peptide_8|436_aa MGRRPVLVGVGSALEAELLLGGARLRRGRRRRDARSHDRPTMRRAGSAELGQGRGAPERG HRGPASAPPLACRSAPELGAAAAAGNRARAAAAVPAKPGPRSQSRSRAGRGVMAGPRGAL LAWCRRQCEGYRGVEIRDLSSSFRDGLAFCAILHRHRPDLLDFDSLSKDNVFENNRLAFE VAEKELGIPALLDPNDMVSMSVPDCLSIMTYVSQYYNHFCSPGQAPTPVEPEDVAQGEEL SSGSLSEQGTGQTPSSTCAACQQHVHLVQRYLADGRLYHRHCFRCRRCSSTLLPGAYENG PEEGTFVCAEHCARLGPGTRSGTRPGPFSQPKQQHQQQLAEDAKDVPGGGPSSSAPAGAE ADGPKASPEARPQIPTKPRVPGKLQELASPPAGRPTPAPRKASESTTPAPPTPRPRSSLQ QENLVEQAGSSSLVNX >gi568815576f:37723407_37925467|GENSCAN_predicted_CDS_8|1308_bp atggggcgccggcccgtcttggtaggggtgggctccgcactggaggcggagctgctgctg ggtggggcgcgcctccggcgcggacggaggcggcgggacgcccgctcccacgaccggccc acaatgaggcgagcgggcagcgcggagttagggcagggccgcggggcgcccgagcgagga caccgcggccccgcctccgcccctcccctcgcctgccggtcggcgcccgagctcggagcc gcagccgcagccggaaaccgggcccgcgcggcggccgccgtcccggccaagccggggccc cgaagccagagccggagccgggcgggccgcggggtcatggctgggccgcggggcgcgctg ctggcctggtgccgccgccagtgcgagggctaccgcggcgtggagatccgcgacctgagc agctccttccgggacggcctggccttctgcgccatcctgcaccggcaccggcccgacctg ctagattttgattcgctttccaaggacaatgtcttcgagaataaccgtttggcctttgaa gtggctgagaaggagctggggatccccgctctcctggaccccaatgacatggtctccatg agcgtccctgactgcctcagcatcatgacctatgtgtcccagtattacaaccacttctgc agtcctggccaagcacccactccagtggaaccagaagatgtggctcagggcgaggagctc tcctcaggcagcctgtcagagcagggcaccggccagacccccagcagcacgtgcgcagcc tgccagcagcatgtgcacttggtgcagcgctacctggctgacggcaggctgtaccatcgc cactgcttccggtgtcggcggtgctccagcaccctgctccctggggcttatgagaatggg cctgaggagggcacctttgtgtgtgcagaacactgtgccaggctgggcccggggacacgg tcggggaccaggcctgggcccttctcacagccaaagcagcagcaccagcagcaactcgca gaagatgccaaggatgttccaggaggcggccccagctccagtgctcctgcaggggctgag gccgatggacccaaggccagccctgaggcccggccgcagatccctaccaagccccgggtt cctggcaaactacaggagctggccagcccccctgcgggccgccccacccctgcccccagg aaggcctctgagagcaccaccccagcaccccccacgccccggccccgctccagtctgcag caggagaacctggtggagcaggctggcagcagcagcctggtgaacgnn