GENSCAN 1.0 Date run: 3-Nov-116 Time: 23:44:14 Sequence gi568815576f:37705545_37906126 : 200582 bp : 52.13% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 414 563 150 2 0 70 60 94 0.606 4.82 1.02 Intr + 3156 3330 175 2 1 38 39 92 0.143 -0.67 1.03 Intr + 3897 4042 146 1 2 32 89 107 0.836 5.71 1.04 Intr + 4883 5022 140 1 2 50 90 110 0.993 7.17 1.05 Intr + 7666 7867 202 1 1 108 77 96 0.762 10.31 1.06 Intr + 10219 10390 172 0 1 87 96 -3 0.808 0.43 1.07 Intr + 11697 11832 136 1 1 33 -26 156 0.545 -1.17 1.08 Intr + 12307 12485 179 1 2 53 56 146 0.828 7.98 1.09 Intr + 12539 12966 428 0 2 43 -8 221 0.016 2.08 1.10 Intr + 18135 20959 2825 2 2 81 99 839 0.237 72.78 1.11 Intr + 24976 25077 102 1 0 74 61 60 0.456 1.69 1.12 Intr + 27754 27868 115 1 1 32 99 60 0.521 2.45 1.13 Intr + 28855 29898 1044 0 0 109 110 197 0.325 15.37 1.14 Intr + 35351 35478 128 1 2 38 43 128 0.003 3.28 1.15 Term + 42336 42549 214 0 1 53 43 146 0.148 3.63 1.16 PlyA + 43652 43657 6 1.05 2.00 Prom + 44098 44137 40 -7.79 2.01 Init + 45597 45656 60 2 0 69 97 75 0.981 6.16 2.02 Intr + 46228 46284 57 0 0 129 94 69 0.977 11.17 2.03 Intr + 49557 49646 90 2 0 50 66 216 0.927 16.29 2.04 Intr + 50006 50115 110 1 2 66 70 154 0.990 10.98 2.05 Intr + 52069 52594 526 1 1 87 80 796 0.999 72.24 2.06 Intr + 53610 53720 111 2 0 113 110 69 0.971 12.68 2.07 Intr + 60126 60273 148 2 1 88 80 228 0.346 22.32 2.08 Intr + 62530 62632 103 2 1 94 63 141 0.999 11.93 2.09 Intr + 63484 63643 160 1 1 92 80 314 0.999 31.50 2.10 Intr + 63718 63831 114 0 0 93 105 152 0.997 18.55 2.11 Intr + 66106 66324 219 0 0 63 53 187 0.764 11.73 2.12 Intr + 66799 66999 201 0 0 64 16 96 0.354 0.00 2.13 Term + 67057 67218 162 0 0 138 44 337 0.928 32.55 2.14 PlyA + 68947 68952 6 1.05 3.04 PlyA - 71128 71123 6 1.05 3.03 Term - 78527 78413 115 1 1 73 47 74 0.281 0.15 3.02 Intr - 79456 79295 162 0 0 127 94 -31 0.290 1.01 3.01 Init - 85219 85068 152 1 2 114 66 95 0.729 9.38 3.00 Prom - 94881 94842 40 -2.11 4.00 Prom + 95000 95039 40 -3.11 4.01 Sngl + 100001 100585 585 1 0 116 48 1040 0.991 99.37 4.02 PlyA + 101094 101099 6 -0.45 5.00 Prom + 101666 101705 40 -0.51 5.01 Init + 102424 102619 196 0 1 72 82 301 0.587 24.97 5.02 Intr + 104483 104613 131 0 2 113 86 130 0.904 16.22 5.03 Intr + 107343 107444 102 2 0 94 85 153 0.968 16.47 5.04 Intr + 107919 108065 147 2 0 98 75 461 0.957 46.74 5.05 Intr + 109582 109736 155 0 2 96 56 217 0.958 18.68 5.06 Intr + 109874 109956 83 2 2 69 82 99 0.997 7.18 5.07 Intr + 110119 110290 172 2 1 122 64 225 0.754 23.02 5.08 Intr + 110656 110777 122 1 2 109 94 96 0.996 12.94 5.09 Intr + 111023 111144 122 0 2 93 3 231 0.002 15.82 5.10 Intr + 112234 112485 252 0 0 51 23 128 0.000 0.76 5.11 Intr + 112979 113127 149 1 2 16 49 98 0.000 -1.76 5.12 Intr + 117844 118221 378 1 0 95 85 595 0.039 54.54 5.13 Term + 119179 119926 748 1 1 75 40 1124 0.999 99.67 5.14 PlyA + 120283 120288 6 1.05 6.09 PlyA - 120465 120460 6 1.05 6.08 Term - 126473 126399 75 2 0 97 43 65 0.156 1.04 6.07 Intr - 127200 127093 108 0 0 63 99 90 0.983 8.58 6.06 Intr - 127565 127414 152 0 2 118 109 178 0.982 23.29 6.05 Intr - 127662 127615 48 1 0 96 87 80 0.987 7.84 6.04 Intr - 128211 128140 72 1 0 91 94 29 0.795 3.67 6.03 Intr - 133054 132956 99 2 0 58 97 126 0.995 11.08 6.02 Intr - 134690 134643 48 0 0 91 109 78 0.998 9.34 6.01 Init - 138694 138367 328 1 1 80 101 522 0.995 50.19 6.00 Prom - 141290 141251 40 -5.11 7.00 Prom + 143637 143676 40 -7.89 7.01 Init + 143906 143938 33 1 0 87 94 41 0.936 4.41 7.02 Intr + 144471 144519 49 2 1 135 94 118 0.995 15.84 7.03 Intr + 145736 145946 211 0 1 60 80 325 0.813 27.40 7.04 Intr + 150021 150090 70 0 1 51 52 53 0.345 -2.32 7.05 Intr + 157425 157494 70 2 1 122 64 72 0.967 7.45 7.06 Intr + 157728 157801 74 1 2 92 70 76 0.713 5.82 7.07 Intr + 164632 164803 172 0 1 86 105 221 0.870 23.63 7.08 Intr + 168826 168980 155 2 2 123 59 176 0.980 18.50 7.09 Intr + 170297 170467 171 1 0 87 91 275 0.984 28.35 7.10 Intr + 172130 172627 498 1 0 86 91 814 0.963 75.27 7.11 Intr + 181221 181301 81 2 0 132 78 130 0.995 16.73 7.12 Term + 182882 182920 39 1 0 126 45 37 0.967 0.68 7.13 PlyA + 184563 184568 6 1.05 8.00 Prom + 197169 197208 40 -1.11 8.01 Init + 200493 200553 61 2 1 84 31 163 0.669 9.56 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 35215 35071 145 1 1 96 91 114 0.964 12.87 S.002 Init + 89681 89729 49 1 1 86 58 39 0.880 -0.25 S.003 Term + 90329 90450 122 0 2 108 49 81 0.962 5.14 S.004 Term + 111023 111174 152 0 2 93 49 246 0.998 19.58 S.005 Init + 117863 118221 359 1 2 90 85 584 0.959 54.90 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576f:37705545_37906126|GENSCAN_predicted_peptide_1|2051_aa MEGRVVRVNQVTRQEKVHVLIGARDDWGFEMEANGHISELIEQARLQDLGPPFPRGENNA PKGEVIGLRSPAGNRRSQDGARGPSEARASADPPDRKSELGTDRGAHSSFHCLGGCCHHS PESGPGRAAVWGPEPEPPGDEGADSRQPPPPPEPAAQELRSPSGAEVPYCDLPRCPPAPE DPLSASTSGCQSVVDPGLRPGPKRGPSPSAGLPEEGPTAAPRSRSRELEAVPYLEGLTTS LCGSCNEDPGSDPTSSPDSATPDDTSNSSSVDWDTVERQEEEAPSWDELAVMIPRRPREG PRADSSQRAPSLLTRSPVGGDAAGQKKEGVKLQTFVVSVTAHKGSVDPKSEQQQDLLQRA KEQSFHSMEEDPSGPWVVDGTGRCGAGAALIGEARAAQEPTEAGGSSGMAGCRSRALPGG KAAKARREIQRSAGAKPLIARGRQGQPAAPSAGPTKPTPTRNSSWPTSAARSLGSRSRLS LHTSVQAEGASSGLGQPRKGLPQCSGGLKGSSSAAKVGAQAEEAPRVSEGCEGRQHAVTS QYHPSDSASITERKAPLGWGEREDTTFTKRLLSIPRRENPRTPCVQQDDPRASSPNRTTQ RENSRTSCAQRDNPKASRTSSPNRATRDNPRTSCAQRDNPRASSPSRATRDNPTTSCAQR DNPRASRTSSPNRATRDNPRTSCAQRDNPRASSPSRATRDNPTTSCAQRDNPRASRTSSP NRATRDNPRTSCAQRDNPRASSPNRAARDNPTTSCAQRDNPRASRTSSPNRATRDNPRTS CAQRDNPRASSPNRATRDNPTTSCAQRDNPRASRTSSPNRATRDNPRTSCAQRDNPRASS PNRTTQQDSPRTSCARRDDPRASSPNRTIQQENPRTSCALRDNPRASSPSRTIQQENPRT SCAQRDDPRASSPNRTTQQENPRTSCARRDNPRASSRNRTIQRDNPRTSCAQRDNPRASS PNRTIQQENLRTSCTRQDNPRTSSPNRATRDNPRTSCAQRDNLRASSPIRATQQDNPRTC IQQNIPRSSSTQQDNPKTSCTKRDNLRPTCTQRDRTQSFSFQRDNPGTSSSQCCTQKENL RPSSPHRSTQWNNPRNSSPHRTNKDIPWASFPLRPTQSDGPRTSSPSRSKQSEVPWASIA LRPTQGDRPQTSSPSRPAQHDPPQSSFGPTQYNLPSRATSSSHNPGHQSTSRTSSPVYPA AYGAPLTSPEPSQPPCAVCIGHRDAPRASSPPRYLQHDPFPFFPEPRAPESEPPHHEPPY IPPAVCIGHRDAPRASSPPRHTQFDPFPFLPDTSDAEHQCQSPQHEPLQLPAPVCIGYRD APRASSPPRQAPEPSLLFQDLPRASTESLVPSMDSLHECPHIPTPVCIGHRDAPSFSSPP RQAPEPSLFFQDPPGTSMESLAPSTDSLHGSPVLIPQVCIGHRDAPRASSPPRHPPSDLA FLAPSPSPGSSGGSRGSAPPGETRHNLEREEYTVLADLPPPRRLAQRQPGPQAQCSSGGR THSPGRAEVERLFGQERREEQPTGSRLGSCFIEAPIPQFGGKFTLRPSALTEKSEAAGAF QAQDEGRSQQPSQGQSQLLRRQSSPAPSRQVTMLPAKQAELTRRSQAEPPHPWSPEKRPE GDRQLQGSPLPPRTSARTPERELRTQRPLESGQAGPRQPLGVWQSQEEPPGSQGPHRHLE RSWSSQEGGLGPGGWWGCGEPSLGAAKAPEGAWGGTSREYKESWGQPEAWEEKPTHELPR ELGKRSPLTSPPENWGGPAESSQSWHSGTPTAVGWGAEGACPYPRGSERRPELDWRDLLG LLRAPGEGVWARVPSLDWEGLLELLQARLPRKDPAGHRDDLARALGPELGPPGTNDVPEQ ESHSQPEGWAEATPVNGHSPALQSQSPVQLPSPACTSTQWPKIKVTRGPATATLAGLEQT GPLGSRSTAKGPSLPELQADKRPAEGKAGSPLKGRLVTSWRMPGDRPTLFNPFLLSLGVL SCLSWGQHVLSKGKAAGSSVWGAWKMDSTSLLVAAAFLREVSSCDYRALSGGDGLARRQQ CGQKRVVMFEK >gi568815576f:37705545_37906126|GENSCAN_predicted_CDS_1|6156_bp atggagggcagggtggtgagagtgaatcaggtgacccgccaggaaaaggtgcacgtactc atcggtgccagagatgattggggctttgagatggaggcaaatggacacatttccgaacta attgagcaggcacgtctacaggatttggggcctccatttccgagaggagaaaacaatgct ccgaaaggggaagtcatcggcctgaggtcaccagccgggaaccggaggagccaggatggg gcccgaggtccatctgaggccagagccagtgctgacccgccagacaggaaatcagagctg gggactgaccgtggggcccactcaagtttccactgcctcggcggctgctgccaccacagc ccggagtcggggcctgggagggcagcagtgtgggggcctgagccggagccccccggggac gagggtgctgacagtcgacagccaccaccaccaccagagcccgcagcccaggagctcagg agcccttcaggtgctgaggtgccctactgcgacctgcctcgatgtccacctgcccctgag gacccactcagcgcctcaacctccggctgccagtctgtggtggacccaggcctcaggcca gggcccaagaggggcccatccccctcagcagggctcccagaagagggtcccacagctgcc cccaggagcaggagccgggagcttgaggcagtaccctatctggagggcctgaccacttcc ttgtgtggcagctgcaacgaggaccccggctctgaccccacctccagccctgactccgcc acccctgatgataccagcaactcgtcctctgtggactgggacactgttgagaggcaggag gaggaggcccccagctgggacgagctcgcagtgatgatcccgaggaggcctcgggagggg ccgagagctgacagctcccaaagggctccgtctctcctcaccaggtcccctgtgggagga gatgctgcaggccagaaaaaggagggagtgaagctgcagaccttcgtggtgagcgttaca gctcataaaggcagtgtggacccaaagagtgagcagcagcaagatttattgcaaagagcg aaagaacaaagcttccacagcatggaagaggacccgagcggcccttgggtggttgatggg actgggcgctgtggagcaggggcggcgctcatcggggaggctcgggctgcacaggagccc acggaggcggggggaagctcaggcatggcgggctgcaggtcccgagccctgcccggtggg aaggcagctaaggcccggcgagaaatccagcgcagcgctggtgctaagcccctcattgcc cggggccggcagggccagccggccgctccgagtgcggggcccaccaagcccacgcccacc cggaactccagctggcccacaagcgccgcgcgcagcctcggttcccgctcgcgcctctcc ctccacacctccgtgcaagctgagggagccagctccggcctcggccagcccaggaagggg ctcccacagtgcagcggtggcctgaagggctcctcaagtgccgccaaagtgggagcccag gcagaggaggcgccgagagtgagcgagggctgtgagggccgccagcatgctgtcacctct cagtatcacccctcagattctgcatcgattactgagagaaaggcacccttaggctgggga gagcgggaagataccactttcaccaagaggctgctcagcatccccagacgggaaaacccc aggacaccctgtgtccagcaggacgatcccagagcctcctctcccaacagaaccactcaa cgagagaattccagaacatcctgtgcccagcgggacaatcccaaagcctccagaacctcc tctcccaatagagccacacgagacaaccccagaacatcctgcgcccagcgggacaatccc agagcctcctctcccagtagagctacacgagacaaccccacaacatcctgtgcccagcgg gacaatcccagagcctccagaacctcctctcccaatagagccacacgagacaaccccaga acatcctgtgcccagcgggacaatcccagagcctcctctcccagtagagctacacgagac aaccccacaacatcctgtgcccagcgggacaatcccagagcctccagaacctcctctccc aatagagccacacgagacaaccccagaacatcctgcgcccagcgggacaatcccagagcc tcctctcccaatagagctgcacgagacaaccccacaacatcctgtgcccagcgggacaat cccagagcctccagaacctcctctcccaatagagccacacgagacaaccccagaacatcc tgtgcccagcgggacaatcccagagcctcctctcccaatagagctacacgagacaacccc acaacatcctgtgcccagcgggacaatcccagagcctccagaacctcctctcccaataga gccacacgagataaccccagaacatcctgtgcccagcgggacaatcccagagcctcctct cccaacagaaccacccaacaagacagccccagaacatcctgtgcccgacgggacgatccc agagcctcctctcctaacagaaccatccaacaagagaaccccagaacatcctgtgcccta cgggacaatcccagagcctcctctcccagcagaaccatccaacaagagaaccccagaaca tcctgtgcccaacgggacgatcccagagcctcctctcctaacagaaccacccaacaagag aaccccagaacatcctgtgcccgacgggacaatcccagagcctcctctcgcaacagaacc atccagcgagacaaccccagaacatcctgtgcccagcgggacaatcccagagcctcctct cctaacagaaccatccaacaagagaacctcagaacatcctgtacccgacaggacaatccc aggacctcctctcccaatagagccacacgagacaaccccagaacatcctgtgcccagcgg gacaatctcagagcctcctctcccatcagagccacccaacaggacaaccccagaacttgt attcaacagaacatccccagatcatcttctacccaacaagacaaccctaaaacctcttgt accaaacgagataacctcagacccacttgtacacagcgggaccgcacacagtccttttcc tttcaacgagacaaccctggaacctcctcatctcaatgctgcacccaaaaggagaatctg agaccatcatctccccaccgctccactcaatggaacaatcccaggaattcatctccccat cgtactaacaaagacatcccctgggcctcgtttcccctccggccaactcagagtgatggt ccccgaacctcttccccatctcgctccaagcaaagcgaggttccctgggcatccatcgcc ctccggccaacccaaggtgacaggcctcagacatcctctcccagcaggccagcccagcat gacccaccccagtcctcctttggccccacccagtacaacttgccatcccgggccacctct tcctcccataacccaggccaccagagcacctcccgaacttcctcacctgtgtaccccgct gcctatggggctcccctgacctctcctgagccctcccagcctccatgtgctgtgtgcatt gggcaccgggatgcccctcgagcctcttcgccccctcgctatttgcagcacgaccccttc cccttcttcccagagccccgcgcccctgagagtgaaccgccccaccacgagcctccctat ataccacctgctgtgtgcattggacaccgagatgccccccgggcgtcctcgcccccccgc cacacccaatttgaccccttccccttcctcccagacacatcagatgccgagcatcagtgt cagtccccccaacacgagccccttcagctccctgcacctgtgtgtattgggtaccgagat gcaccccgggcctcctccccaccacgccaggccccagagccttccctcttattccaggac ctccccagggccagcacagagagccttgtcccttccatggactctctgcacgagtgcccc cacatccccacccctgtgtgcattgggcaccgggatgcaccctccttctcatccccacca cgccaggctcctgagccatccctcttcttccaggatccccctggaactagtatggagagc ctggccccctccactgactctctgcatggctccccagtgctgatcccccaagtgtgcatc gggcaccgggatgcaccccgagcctcctccccaccccgccacccacccagtgacctagcg ttcctggcaccctcaccttcaccgggcagctctgggggctcccggggctcagcgcctccc ggggagaccaggcacaacttggagcgggaggagtacactgtgctggccgacctgccccca cccaggaggctggcccagagacagccagggccccaggcgcagtgcagcagcgggggccgc acccacagccctggccgtgcagaggtggagcgcctcttcgggcaagagcgcagggaggaa cagcccactgggtcacgtctgggctcttgcttcatcgaagctcctataccccagttcggg ggaaaattcactctaaggccttcagctctcacagagaagtccgaggcagcgggggccttc caggcccaggacgagggacggtcacagcagcccagccaaggccagagccaacttctccga agacagtccagccctgcccccagcaggcaggtgaccatgctccctgccaaacaggcagaa ctgacccggcggagccaagcagagccccctcatccttggagtcctgagaagagacctgag ggagatcggcagctccaggggtccccgctgccccccaggacatcagccaggacccctgag agggagctgcggacacagagacctctggagagtggccaagcaggcccaagacagcctctg ggggtgtggcagagtcaggaggaaccgccagggtcccagggccctcatagacacctagaa aggagctggagcagccaggagggaggcctgggccctgggggctggtggggatgtggagag cccagcctgggggcagccaaagccccggagggagcatgggggggcacttccagggagtac aaggagagctgggggcagccagaggcctgggaggagaagcccactcatgagctccccaga gaactaggaaagagaagcccactcacgagcccccctgagaactggggaggccccgcagag tcctcacaatcctggcactctgggacacccactgctgtgggctggggggcagagggagcg tgtccatacccgcgtggctctgagaggcgacccgagcttgactggagggatctgcttggc cttctccgggcaccaggagagggggtctgggcccgtgtccccagcctggactgggagggc ctcttggagctcctgcaggccaggctgccccgcaaggacccagctggacacagggatgac ctggccagggctttagggccagagctgggtcccccaggcacaaacgatgtccctgagcag gagtcacacagccagccagaaggctgggccgaggccaccccagtcaatggacacagcccc gcactgcagtcccagagcccggtccagctgcccagccctgcctgcacctccacccagtgg ccaaagatcaaagtgacaagaggaccagcgaccgcaactctggcaggcctggagcagacg ggccccctggggagcaggagcactgcgaagggccccagcttgccagagctgcaggcagac aagaggccagcagagggcaaggctgggagcccgctcaagggccgactggtgacctcatgg cggatgcccggggaccggcccacgctgttcaatccgttcctgctgtctctgggggtcctc agttgcctgtcctgggggcagcacgtgctgagcaagggtaaggctgccggaagcagcgtg tggggtgcttggaagatggacagcacatccctgctggtggcagcagccttcctgagggag gtgtcctcctgtgattatagggccttgtcaggtggagatggactagcgaggagacagcag tgtggacagaaacgggtggtcatgtttgagaagtag >gi568815576f:37705545_37906126|GENSCAN_predicted_peptide_2|686_aa MLQLVAPRPRGCAPLGGTQKPDLLNFKKGWMSILDEPGEADELDGEIDLRSCTDVTEYAV QRNYGFQIHTKDAVYTLSAMTSGIRRNWIEALRKTVRPTSAPDVTKLSDSNKENALHSYS TQKGPLKAGEQRAGSEVISRGGPRKADGQRQALDYVELSPLTQASPQRARTPARTPDRLA KQEELERDLAQRSEERRKWFEATDSRTPEVPAGEGPRRGLGAPLTEDQQNRLSEEIEKKW QELEKLPLRENKRVPLTALLNQSRGERRGPPSDGHEALEKEVQALRAQLEAWRLQGEAPQ SALRSQEDGHIPPGYISQEACERSLAEMESSHQQVMEELQRHHERELQRLQQEKEWLLAE ETAATASAIEAMKKAYQEELSRELSKTRSLQQGPDGLRKQHQSDVEALKRELQVLSEQYS QKCLEIGALMRQAEEREHTLRRCQQEGQELLRHNQELHGRLSEEIDQLRGFIASQGMGNG CGRSNERSSCELEVLLRVKENELQYLKKEVQCLRDELQMMQKVGPSAGLGAVGDSGAIWM PSCEHLLCAKPSTRFILPQALLCFSHPLTALRSWSKSSPPKQDEDDNDACVPGWYQGGGR GKPRTHGHLGEPPGGMKRGICGEVWLLVLPMCPDKRFTSGKYQDVYVELSHIKTRSEREI EQLKEHLRLAMAALQEKESMRNSLAE >gi568815576f:37705545_37906126|GENSCAN_predicted_CDS_2|2061_bp atgctgcagctggtagcccccagaccccggggctgtgcccccctgggcggcacccagaag cccgatctgctcaacttcaagaagggatggatgtcgatcttggacgagcctggagaggca gatgagctggatggtgagatcgacctgcgttcctgcacggatgtcactgagtacgcggtg cagcgcaactatggcttccagatccacaccaaggatgctgtctataccttgtcggccatg acctcaggcatccggcggaactggatcgaggctctgagaaagaccgtacgtccaacttca gccccagatgtcaccaagctctcggactctaacaaggagaacgcgctgcacagctacagc acccagaagggccccctgaaggcaggggagcagcgggcgggctctgaggtcatcagccgg ggtggccctcggaaggcggacgggcagcgtcaggccttggactacgtggagctctcgccg ctgacccaggcttccccgcagcgggcccgcaccccagcccgcactcctgaccgcctggcc aagcaggaggagctggagcgggacctggcccagcgctccgaggagcggcgcaagtggttt gaggccacagacagcaggaccccagaggtgcctgctggtgaggggccgcgccggggcctg ggtgcccccctgactgaggaccagcaaaaccggcttagtgaggagatcgagaagaagtgg caggagctggagaagctgcccctgcgggagaataagcgggtgcccctcactgccctgctc aaccaaagccgcggagagcgccgagggcccccaagtgacggccacgaggcactggagaag gaggttcaggctcttcgggcccagctggaggcgtggcgtctccaaggggaggctcctcag agtgcactgagatcccaggaggatggccacatccccccgggctacatctcacaggaggca tgtgagcgcagcctggcagagatggagtcctcgcaccagcaggtgatggaggagctgcag cggcaccacgagcgggagctgcagcgcctgcagcaggagaaggagtggctcctggctgag gagacggcagccacggcctcagccattgaagccatgaagaaggcctaccaggaagagctg agccgagagctgagcaaaacacggagtctccagcagggcccggatggcctccggaagcag caccagtcagatgtggaggcactgaagcgagagctgcaggtgctatcggagcagtactcg cagaagtgcctggagattggggcactcatgcggcaggctgaggagcgcgagcacacgctg cgccgctgccagcaggagggccaggagctgctgcgccacaaccaggagctgcatggccgc ctgtcagaggagatagaccagctgcgcggcttcattgcctcgcagggcatgggcaatggc tgcgggcgcagcaacgagcggagttcctgcgagctagaggtgctgcttcgcgtaaaagaa aacgaactccagtacctaaagaaggaggtgcagtgcctccgggacgagctccagatgatg cagaaggtaggtccttccgctgggctgggggccgtcggggactctggagccatctggatg ccatcctgtgagcacctgctctgtgccaagccctccactcgcttcatccttcctcaggct ctcctgtgcttctcccatccactcactgccctgcggtcttggtcaaaatcttctcccccg aaacaggatgaggatgacaatgacgcctgtgtccctgggtggtaccagggaggtgggagg ggtaagcccagaacccacggccatcttggggagccacctggagggatgaagcgaggtatc tgcggggaggtctggctgctggtgctgcccatgtgcccggacaagcgcttcacctcggga aagtaccaggacgtctatgtggagctgagccacatcaagacacggtctgagcgggagatc gagcagctgaaggagcacctgcgtcttgccatggccgccctccaggagaaggagtcgatg cgcaacagcctggctgagtag >gi568815576f:37705545_37906126|GENSCAN_predicted_peptide_3|142_aa MPGWENIFSSMNHTLNALTGAVTSTRDNRDCSLVRAMPNWGPNAAGISPCRVSAGTGRVP SCLCHSGAWNRLDQDINRNNSCYSDSVGLATLRASLNPISSSHLRHTWPVQPAVLLLSED TATWEQLECGSGDATVEVTASR >gi568815576f:37705545_37906126|GENSCAN_predicted_CDS_3|429_bp atgcccggctgggaaaatattttctcctccatgaaccatactctcaatgcactaactggt gctgtgacaagcaccagggacaacagggactgctctctggtgagggccatgcccaactgg ggcccaaatgcagctggcatcagtccctgcagggtgagcgcagggacagggcgtgtccct tcatgtctgtgtcactcaggtgcctggaacaggcttgaccaggatattaacaggaataac agctgctactcagactcagtgggcctggccacgctcagggcctcacttaatcccatttct tcatctcatctcagacacacttggccagtgcagccagctgtcctgctcttgagtgaggac acagccacgtgggagcagctggaatgtggctctggagatgccacagttgaggtcacagct agtagatga >gi568815576f:37705545_37906126|GENSCAN_predicted_peptide_4|194_aa MTENSTSAPAAKPKRAKASKKSTDHPKYSDMIVAAIQAEKNRAGSSRQSIQKYIKSHYKV GENADSQIKLSIKRLVTTGVLKQTKGVGASGSFRLAKSDEPKKSVAFKKTKKEIKKVATP KKASKPKKAASKAPTKKPKATPVKKAKKKLAATPKKAKKPKTVKAKPVKASKPKKAKPVK PKAKSSAKRAGKKK >gi568815576f:37705545_37906126|GENSCAN_predicted_CDS_4|585_bp atgaccgagaattccacgtccgcccctgcggccaagcccaagcgggccaaggcctccaag aagtccacagaccaccccaagtattcagacatgatcgtggctgccatccaggccgagaag aaccgcgctggctcctcgcgccagtccattcagaagtatatcaagagccactacaaggtg ggtgagaacgctgactcgcagatcaagttgtccatcaagcgcctggtcaccaccggtgtc ctcaagcagaccaaaggggtgggggcctcggggtccttccggctagccaagagcgacgaa cccaagaagtcagtggccttcaagaagaccaagaaggaaatcaagaaggtagccacgcca aagaaggcatccaagcccaagaaggctgcctccaaagccccaaccaagaaacccaaagcc accccggtcaagaaggccaagaagaagctggctgccacgcccaagaaagccaaaaaaccc aagactgtcaaagccaagccggtcaaggcatccaagcccaaaaaggccaaaccagtgaaa cccaaagcaaagtccagtgccaagagggccggcaagaagaagtga >gi568815576f:37705545_37906126|GENSCAN_predicted_peptide_5|918_aa MWPGNAWRAALFWVPRGRRAQSALAQLRGILEGELEGIRGAGTWKSERVITSRQGPHIRV DGVSGGILNFCANNYLGLSSHPEVIQAGLQALEEFGAGLSSVRFICGTQSIHKNLEAKIA RFHQREDAILYPSCYDANAGLFEALLTPEDAVLSDELNHASIIDGIRLCKAHKYRYRHLD MADLEAKLQEAQKHRLRLVATDGAFSMDGDIAPLQEICCLASRYGALVFMDECHATGFLG PTGRGTDELLGVMDQVTIINSTLGKALGGASGGYTTGPGPLVSLLRQRARPYLFSNSLPP AVVGCASKALDLLMGSNTIVQSMAAKTQRFRSKMEAAGFTISGASHPICPVMLGDARLAS RMADDMLKRGIFVIGFSYPVVPKGKARIRVQISAVHSEEDIDRCVEAFVEAYDDQPPGGA ASTGRGRRLAPPLQTGGLGPRLFRLPSSQGQERRFAAAKAPSSLVPHNGSPLSCWRGLRE EGGLSLATLWTPCGPAAHVKRHGQLRGDVWQSVCCTRRGQNRGPRSTQTDVRRHEARSRQ RRRKCPSDGEMADAQNISLDSPGSVGAVAVPVVFALIFLLGTVGNGLVLAVLLQPGPSAW QEPGSTTDLFILNLAVADLCFILCCVPFQATIYTLDAWLFGALVCKAVHLLIYLTMYASS FTLAAVSVDRYLAVRHPLRSRALRTPRNARAAVGLVWLLAALFSAPYLSYYGTVRYGALE LCVPAWEDARRRALDVATFAAGYLLPVAVVSLAYGRTLRFLWAAVGPAGAAAAEARRRAT GRAGRAMLAVAALYALCWGPHHALILCFWYGRFAFSPATYACRLASHCLAYANSCLNPLV YALASRHFRARFRRLWPCGRRRRHRARRALRRVRPASSGPPGCPGDARPSGRLLAGGGQG PEPREGPVHGGEAARGPE >gi568815576f:37705545_37906126|GENSCAN_predicted_CDS_5|2757_bp atgtggcctgggaacgcctggcgcgccgcactcttctgggtgccccgcggccgccgcgca cagtcagcgctggcccagctgcgtggcattctggagggggagctggaaggcatccgcgga gctggcacttggaagagtgagcgggtcatcacgtcccgtcaggggccgcacatccgcgtg gacggcgtctccggaggaatccttaacttctgtgccaacaactacctgggcctgagcagc caccctgaggtgatccaggcaggtctgcaggctctggaggagtttggagctggcctcagc tctgtccgctttatctgtggaacccagagcatccacaagaatctagaagcaaaaatagcc cgcttccaccagcgggaggatgccatcctctatcccagctgttatgacgccaacgccggc ctctttgaggccctgctgaccccagaggacgcagtcctgtcggacgagctgaaccatgcc tccatcatcgacggcatccggctgtgcaaggcccacaagtaccgctatcgccacctggac atggccgacctagaagccaagctgcaggaggcccagaagcatcggctgcgcctggtggcc actgatggggccttttccatggatggcgacatcgcacccctgcaggagatctgctgcctc gcctctagatatggtgccctggtcttcatggatgaatgccatgccactggcttcctgggg cccacaggacggggcacagatgagctgctgggtgtgatggaccaggtcaccatcatcaac tccaccctggggaaggccctgggtggagcatcagggggctacacgacagggcctgggccc ctggtgtccctgctgcggcagcgcgcccggccatacctcttctccaacagtctgccacct gctgtcgttggctgcgcctccaaggccctagatctgctgatggggagtaacaccattgtc cagtctatggctgccaagacccagaggttccgtagtaagatggaagctgctggcttcact atctcgggagccagtcaccccatctgccctgtgatgctgggtgatgcccggctggcctct cgcatggcggatgacatgctgaagagaggcatctttgtcatcgggttcagctaccccgtg gtccccaagggcaaggcccggatccgggtacagatctcagcagtgcatagcgaggaagac attgaccgctgcgtggaggccttcgtggaagcctacgatgatcagccaccagggggtgct gcgagcacgggccgcggccgccggctcgccccgcccctccagactgggggccttgggccg cggctgttcaggctgcccagcagtcaaggccaggagaggcgatttgctgctgccaaggcc ccatcctcccttgtgccccacaacggctcgccgctttcctgttggaggggcctgcgggag gagggcggtctctccctggcgaccttgtggaccccttgtgggccagcagctcatgttaag cgccacgggcagctgcggggtgacgtgtggcagtcggtgtgctgcacccggcggggccag aacagagggcccaggtccacccagaccgacgtgaggcggcacgaggcgagatccagacag cggcgcagaaagtgcccgtctgatggggagatggctgatgcccagaacatttcactggac agcccagggagtgtgggggccgtggcagtgcctgtggtctttgccctaatcttcctgctg ggcacagtgggcaatgggctggtgctggcagtgctcctgcagcctggcccgagtgcctgg caggagcctggcagcaccacggacctgttcatcctcaacctggcggtggctgacctctgc ttcatcctgtgctgcgtgcccttccaggccaccatctacacgctggatgcctggctcttt ggggccctcgtctgcaaggccgtgcacctgctcatctacctcaccatgtacgccagcagc tttacgctggctgctgtctccgtggacaggtacctggccgtgcggcacccgctgcgctcg cgcgccctgcgcacgccgcgtaacgcccgcgccgcagtggggctggtgtggctgctggcg gcgctcttctcggcgccctacctcagctactacggcaccgtgcgctacggcgcgctggag ctctgcgtgcccgcctgggaggacgcgcgccgccgcgccctggacgtggccaccttcgct gccggctacctgctgcccgtggctgtggtgagcctggcctacgggcgcacgctgcgcttc ctgtgggccgccgtgggtcccgcgggcgcggcggcggccgaggcgcggcggagggcgacg ggccgcgcggggcgcgccatgctggcggtggccgcgctctacgcgctctgctggggtccg caccacgcgctcatcctgtgcttctggtacggccgcttcgccttcagcccggccacctac gcctgccgcctggcctcacactgcctggcctacgccaactcctgcctcaacccgctcgtc tacgcgctcgcctcgcgccacttccgcgcgcgcttccgccgcctgtggccgtgcggccgc cgacgccgccaccgtgcccgccgcgccttgcgtcgcgtccgccccgcgtcctcgggccca cccggctgccccggagacgcccggcctagcgggaggctgctggctggtggcggccagggc ccggagcccagggagggacccgtccacggcggagaggctgcccgaggaccggaataa >gi568815576f:37705545_37906126|GENSCAN_predicted_peptide_6|309_aa MAAAAGDADDEPRSGHSSSEGECAVAPEPLTDAEGLFSFADFGSALGGGGAGLSGRASGG AQSPLRYLHVLWQQDAEPRDELRCKIPAGRLRRAARPHRRLGPTGKEVHALKRLRDSANA NDVETVQQLLEDGADPCAADDKGRTALHFASCNGNDQIVQLLLDHGADPNQRDGLGNTPL HLAACTNHVPVITTLLRGECSPQLVPAGARVDALDRAGRTPLHLAKSKLNILQEGHAQCL EAVRLEVKQIIHMLREYLERLGQHEQRERLDDLCTRLQMTSTKEQVDEVTDLLASFTSLS LQMQSMEKR >gi568815576f:37705545_37906126|GENSCAN_predicted_CDS_6|930_bp atggcagccgccgccggggacgcggacgacgagccgcgctcaggccactcgagctcggag ggcgagtgcgcggtggcgccggagccgctgactgacgctgagggcctcttctccttcgct gacttcgggtctgcgctgggcggcggcggcgcgggcctctcgggccgggcgtccggcggg gcccagtcgccgctgcgctacttgcacgtcctgtggcagcaggatgcggagccgcgcgac gagctgcgctgcaagatacccgctggccggctgaggcgcgctgccaggccccaccggcgg ctcgggcccacgggcaaggaggtgcacgctctgaagagactgagggactcggccaatgcc aatgatgtggaaacagtgcagcagctgctggaagatggcgcggatccctgtgcagctgat gacaagggccgcacagctctacactttgcctcatgcaatggcaatgaccagattgtgcag ctgctcctggaccatggtgctgatcctaaccagcgagatgggctggggaacacgccactg cacctggcggcctgcaccaaccacgttcctgtcatcaccacactgctacgaggagaatgc tcacctcagctggtacctgcaggggcccgtgtagatgccctggaccgagctggtcgcaca cccctgcacctggccaagtcaaagctgaatatcctgcaggagggccatgcccagtgccta gaggctgtgcgtctggaggtgaagcagatcatccatatgctgagggagtatctggagcgc ctagggcaacatgagcagcgagaacgcctggatgacctctgcacccgcctgcagatgacc agtaccaaagagcaggtggatgaagtgactgacctcctggccagcttcacctccctcagt ctgcagatgcagagcatggagaagaggtag >gi568815576f:37705545_37906126|GENSCAN_predicted_peptide_7|540_aa MSYPADDYESEAAYDPYAYPSDYDMHTGDPKQDLAYERQYEQQTYQVIPEVIKNFIQYFH KTVSDLIDQKVYELQASRVSSDVIDQKVYEIQDIYENSWTKLTERFFKNTPWPEAEAIAP QGGPSLEQRFESYYNYCNLFNYILNADGPAPLELPNQWLWDIIDEFIYQFQSFSQYRCKT AKKSEEEIDFLRSNPKIWNVHSVLNVLHSLVDKSNINRQLEVYTSGGDPESVAGEYGRHS LYKMLGYFSLVGLLRLHSLLGDYYQAIKVLENIELNKKSMYSRVPECQVTTYYYVGFAYL MMRRYQDAIRVFANILLYIQRTKSMFQRTTYKYEMINKQNEQMHALLAIALTMYPMRIDE SIHLQLREKYGDKMLRMQKGDPQVYEELFSYSCPKFLSPVVPNYDNVHPNYHKEPFLQQL KVFSDEVQQQAQLSTIRSFLKLYTTMPVAKLAGFLDLTEQEFRIQLLVFKHKMKNLVWTS GISALDGEFQSASEVDFYIDKDMIHIADTKVARRYGDFFIRQIHKFEELNRTLKKMGQRP >gi568815576f:37705545_37906126|GENSCAN_predicted_CDS_7|1623_bp atgtcttatcccgctgatgattatgagtctgaggcggcttatgacccctacgcttatccc agcgactatgatatgcacacaggagatccaaagcaggaccttgcttatgaacgtcagtat gaacagcaaacctatcaggtgatccctgaggtgatcaaaaacttcatccagtatttccac aaaactgtctcagatttgattgaccagaaagtgtatgagctacaggccagtcgtgtctcc agtgatgtcattgaccagaaggtgtatgagatccaggacatctatgagaacagctggacc aagctgactgaaagattcttcaagaatacaccttggcccgaggctgaagccattgctcca caggggggaccttccttggagcagaggtttgaatcctattacaactactgcaatctcttc aactacattcttaatgccgatggtcctgctccccttgaactacccaaccagtggctctgg gatattatcgatgagttcatctaccagtttcagtcattcagtcagtaccgctgtaagact gccaagaagtcagaggaggagattgactttcttcgttccaatcccaaaatctggaatgtt catagtgtcctcaatgtccttcattccctggtagacaaatccaacatcaaccgacagttg gaggtatacacaagcggaggtgaccctgagagtgtggctggggagtatgggcggcactcc ctctacaaaatgcttggttacttcagcctggtcgggcttctccgcctgcactccctgtta ggagattactaccaggccatcaaggtgctggagaacatcgaactgaacaagaagagtatg tattcccgtgtgccagagtgccaggtcaccacatactattatgttgggtttgcatatttg atgatgcgtcgttaccaggatgccatccgggtcttcgccaacatcctcctctacatccag aggaccaagagcatgttccagaggaccacgtacaagtatgagatgattaacaagcagaat gagcagatgcatgcgctgctggccattgccctcacgatgtaccccatgcgtattgatgag agcattcacctccagctgcgggagaaatatggggacaagatgttgcgcatgcagaaaggt gacccacaagtctatgaagaacttttcagttactcctgccccaagttcctgtcgcctgta gtgcccaactatgataatgtgcaccccaactaccacaaagagcccttcctgcagcagctg aaggtgttttctgatgaagtacagcagcaggcccagctttcaaccatccgcagcttcctg aagctctacaccaccatgcctgtggccaagctggctggcttcctggacctcacagagcag gagttccggatccagcttcttgtcttcaaacacaagatgaagaacctcgtgtggaccagc ggtatctcagccctggatggtgaatttcagtcagcctcagaggttgacttctacattgat aaggacatgatccacatcgcggacaccaaggtcgccaggcgttatggggatttcttcatc cgtcagatccacaaatttgaggagcttaatcgaaccctgaagaagatgggacagagacct tga >gi568815576f:37705545_37906126|GENSCAN_predicted_peptide_8|21_aa MGRRPVLVGVGSALEAELLLX >gi568815576f:37705545_37906126|GENSCAN_predicted_CDS_8|63_bp atggggcgccggcccgtcttggtaggggtgggctccgcactggaggcggagctgctgctg gnn