GENSCAN 1.0 Date run: 3-Nov-116 Time: 10:41:02 Sequence gi568815583f:30804664_31037253 : 232590 bp : 41.39% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 5281 5382 102 2 0 96 71 41 0.573 2.43 1.02 Intr + 10612 10783 172 2 1 85 88 101 0.915 7.88 1.03 Intr + 11867 12011 145 2 1 68 110 184 0.990 17.96 1.04 Intr + 13213 13438 226 0 1 81 60 143 0.819 7.44 1.05 Intr + 17884 18055 172 2 1 64 96 116 0.955 8.08 1.06 Intr + 18934 19108 175 1 1 78 83 220 0.950 19.52 1.07 Intr + 19993 20272 280 0 1 54 69 160 0.665 6.83 1.08 Intr + 20349 20469 121 1 1 72 -5 116 0.410 -0.67 1.09 Intr + 23037 23191 155 0 2 130 96 101 0.757 13.89 1.10 Intr + 27578 27742 165 0 0 99 60 155 0.811 12.81 1.11 Intr + 28128 28235 108 1 0 111 110 154 0.841 19.24 1.12 Intr + 29965 30423 459 2 0 92 67 172 0.129 7.73 1.13 Intr + 30508 30626 119 2 2 96 52 43 0.951 0.76 1.14 Intr + 31531 31656 126 0 0 49 90 135 0.985 9.76 1.15 Intr + 32092 32174 83 0 2 109 69 65 0.969 4.22 1.16 Term + 33130 33319 190 1 1 71 45 179 0.948 7.84 1.17 PlyA + 35628 35633 6 1.05 2.04 PlyA - 36828 36823 6 1.05 2.03 Term - 39811 39727 85 0 1 102 38 71 0.390 -0.35 2.02 Intr - 44272 44189 84 0 0 93 75 36 0.430 0.82 2.01 Init - 45017 44893 125 2 2 71 91 91 0.736 7.39 2.00 Prom - 45071 45032 40 -5.65 3.02 PlyA - 45613 45608 6 1.05 3.01 Sngl - 47087 46548 540 2 0 24 31 259 0.801 9.63 3.00 Prom - 59665 59626 40 -4.45 4.04 PlyA - 61130 61125 6 1.05 4.03 Term - 62694 62375 320 1 2 66 49 162 0.149 4.16 4.02 Intr - 78005 77689 317 1 2 46 49 313 0.012 17.98 4.01 Init - 78631 78426 206 1 2 66 25 298 0.968 18.57 4.00 Prom - 82007 81968 40 -4.85 5.03 PlyA - 82265 82260 6 1.05 5.02 Term - 84905 84811 95 0 2 44 48 142 0.540 2.81 5.01 Init - 86347 86209 139 1 1 85 81 73 0.727 6.65 5.00 Prom - 87747 87708 40 -5.35 6.00 Prom + 96633 96672 40 -2.55 6.01 Init + 99102 99231 130 2 1 32 64 152 0.530 7.56 6.02 Intr + 99978 101234 1257 1 0 78 70 869 0.540 70.46 6.03 Intr + 103455 103595 141 1 0 72 80 90 0.708 6.00 6.04 Intr + 104977 105102 126 2 0 34 82 101 0.928 3.83 6.05 Intr + 105951 106152 202 1 1 65 6 147 0.965 1.62 6.06 Intr + 107781 107885 105 0 0 90 34 121 0.949 5.31 6.07 Intr + 109243 109428 186 1 0 -14 110 156 0.000 5.48 6.08 Intr + 113501 113632 132 2 0 45 78 121 0.186 5.64 6.09 Intr + 115882 115990 109 1 1 71 50 47 0.024 -1.43 6.10 Intr + 120464 120628 165 1 0 65 66 177 0.020 12.44 6.11 Intr + 121126 121276 151 0 1 92 55 116 0.909 7.51 6.12 Intr + 121945 122102 158 2 2 62 -1 143 0.220 1.61 6.13 Intr + 123890 124018 129 1 0 43 41 111 0.313 1.87 6.14 Intr + 124540 124734 195 0 0 73 95 133 0.912 11.19 6.15 Term + 132456 132593 138 2 0 84 42 146 0.974 6.58 6.16 PlyA + 132778 132783 6 1.05 7.20 PlyA - 134052 134047 6 1.05 7.19 Term - 134830 134570 261 1 0 45 45 272 0.364 13.14 7.18 Intr - 137409 136912 498 0 0 24 17 347 0.328 13.46 7.17 Intr - 138409 138227 183 1 0 75 94 34 0.611 1.76 7.16 Intr - 142637 142467 171 2 0 64 115 85 0.545 8.02 7.15 Intr - 143808 143639 170 1 2 84 77 37 0.731 0.94 7.14 Intr - 147375 147305 71 2 2 138 88 -1 0.475 2.81 7.13 Intr - 148968 148899 70 1 1 43 76 56 0.303 -2.78 7.12 Intr - 150230 150100 131 1 2 91 46 82 0.773 3.72 7.11 Intr - 154288 154200 89 1 2 57 76 99 0.877 3.45 7.10 Intr - 154458 154371 88 2 1 110 100 -16 0.835 0.85 7.09 Intr - 156410 156218 193 0 1 95 111 202 0.343 20.83 7.08 Intr - 157775 157622 154 2 1 19 71 41 0.034 -5.78 7.07 Intr - 169793 169651 143 0 2 93 85 139 0.988 13.25 7.06 Intr - 170340 170268 73 0 1 59 98 31 0.233 -0.74 7.05 Intr - 172292 172156 137 0 2 58 89 74 0.080 3.87 7.04 Intr - 181284 181184 101 2 2 78 23 42 0.004 -4.47 7.03 Intr - 182250 182016 235 1 1 63 50 180 0.046 7.52 7.02 Intr - 186939 186784 156 1 0 47 97 130 0.940 8.86 7.01 Init - 187047 186969 79 0 1 71 31 215 0.998 13.37 7.00 Prom - 187674 187635 40 -6.75 8.09 PlyA - 188338 188333 6 1.05 8.08 Term - 198407 197159 1249 1 1 80 34 1093 0.225 93.16 8.07 Intr - 221608 221476 133 2 1 78 102 264 0.988 25.58 8.06 Intr - 222454 222252 203 0 2 61 27 166 0.934 5.81 8.05 Intr - 223735 223669 67 2 1 60 70 68 0.171 -0.76 8.04 Intr - 226494 226320 175 0 1 19 66 241 0.371 13.59 8.03 Intr - 228277 228026 252 1 0 73 49 326 0.461 23.91 8.02 Intr - 231011 230883 129 2 0 90 24 184 0.704 12.17 8.01 Init - 232438 232280 159 1 0 88 78 53 0.489 2.83 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 78005 77603 403 1 1 46 33 367 0.804 20.64 S.002 Init + 120419 120628 210 1 0 110 66 168 0.877 15.73 S.003 Term - 186492 186290 203 1 2 18 36 201 0.846 4.47 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583f:30804664_31037253|GENSCAN_predicted_peptide_1|932_aa XLPNLRLSFKLALIRDMAILPFQWNLLISFNAVLGLVFSWGDGDFGKLGRGGSEGCNIPQ NIERLNGQGVCQIECGAQFLLALTKSGVVWTWGKGDYFRLGHRSDVHVRKPQVVEGLRGK KIVHVAVGAQHCLAVMDSGQVYAWGDNDHGQQGNGTTTVNRKPMLVQGLEGQKIMCVACG SSHSVAWTTMDVATPSVHEPVLFQTARDPLGASYLGVPSDVNSSAASNKISGASNSKPYH PSLAKILLSLDGNLAKQQALSHILTALQIMYARDAVVGALMPAAMIAPVECPSAAASDAF AMASPMNGEECMLAVDIEDRLSPNPWQEKREIVSSEDVVTPSAVTPSALSASAGPFITVT DDPGAASIFAETMTKTEEVKRHSFFALYCEITKWQVEPQIDDPNSNLEEVINEAEAITSE NSLGLISRWPRLFSQIQPDDSVLPGMVVRESQQPNGPGFDPTVWQWQICCWSSVSPSWRM WPQDLQSGRLSSQPVVVESSHPYTDDTSTSGTVKIPVPSSARSSSLKENSCASARTPLTP DVRVLHTKRFSSSLDTPEVEVLLTSNAAHSPGAEGLRVEFDRQCSSEWHHDPLTIMDGVN RIVSVWSGPVEAWNRTQRTVPAVQLSGSHSSSPGVEGVVAGVILRLLVYSGSVSVKSKAH RLAVACVGTPAVAFWWVGLGGATLVCWLLMGWSLACETQGCLQCSFTGPKELLSDCCVLS CPSTDLVTCLLDFRLNLASNRSVVPCLVALLAACAQLSGLATSHRMWALQKLRKLLTTEF GQSININRLLGENDGEARALSFTGSALAALVKGLPEALQRQFEYEDPIVRGGKQLLHSPF FKVLVALACDLELDTLPCCAETHKWAWFRRRPDDWNLSAGGSGTIYGWGHDHRGQLGGIE GAKVKIPSPCEALTTLRPMQLIGGEQTLCCDS >gi568815583f:30804664_31037253|GENSCAN_predicted_CDS_1|2799_bp ngactgcccaatttaagattaagttttaaactggcgcttattagagatatggccattttg ccatttcagtggaaccttttaatcagctttaatgctgtcttaggtttggtattttcctgg ggtgatggtgactttggaaaattgggccggggcggaagtgaaggctgcaacattccccag aacattgagagactaaatggacagggggtgtgccagattgagtgtggagctcagttccta ctggcgctcaccaagtctggagtggtgtggacatggggaaagggggattacttcaggttg ggccaccgctctgacgtgcacgttcggaagccgcaggtggtggaagggctgagagggaag aagatcgtgcatgtggctgtcggggcccagcattgcctggcggtcatggactcagggcag gtgtatgcttggggtgacaatgaccacggccagcagggcaatggcacgaccacggttaac aggaagcccatgctcgtgcaaggcttagaaggccagaagatcatgtgtgtggcttgcggg tcgtcccacagtgtggcgtggacaactatggatgtggccacgccctctgtccacgagccc gtcctcttccagactgcaagggaccctttaggtgcttcctatttaggcgtgccttcagat gtcaattcttctgctgccagtaataaaataagtggtgcaagtaattctaagccatatcac ccttctcttgccaagattctcttgtcattggatggaaacctggccaaacagcaggcctta tcgcatattcttacagcattgcaaatcatgtatgccagagatgccgtagtgggggccctg atgccggccgccatgatcgccccggtggagtgcccctctgcggctgcttcggacgcattt gcgatggctagtcccatgaatggagaagaatgcatgctggctgttgatatcgaagacaga ctgagtccaaatccgtggcaagaaaagagagagattgtttcctctgaggacgtggtgacc ccctctgcagtgactccatcggccctctcagcctccgctgggccatttatcacagtgacg gatgacccgggagctgcaagcatctttgcagaaaccatgaccaaaactgaagaggtaaag aggcattcttttttcgctttgtattgtgaaattaccaaatggcaggtggagccgcagatc gatgaccccaacagcaaccttgaggaggtgattaatgaggcagaggccatcacctctgag aacagcctgggattgatctccaggtggccgcgtttattctcacaaattcagcccgatgac tcagtccttccaggaatggtggtacgtgaatctcaacagcctaatggaccaggctttgac cccacagtgtggcagtggcagatatgctgttggagctctgtgtcaccgagttggaggatg tggccacaagacttgcagagcggccgcctctcttctcagcctgtggtggtggagagtagc cacccttacaccgacgacacctccaccagtggcacagtgaagataccagtgccatcctct gctagatcttccagcctgaaggaaaactcttgtgcctctgctcgcacacctctcacacca gatgtgcgggttctccacaccaagcggttctccagttctctggacacgccagaagttgag gtgctcctcacctccaatgccgctcactccccaggtgcagaaggactcagggtggaattt gaccggcagtgctcctcagagtggcaccacgaccctctcacaatcatggatggcgtcaac aggatcgtctccgtgtggtcaggccccgttgaggcgtggaacagaacacagagaactgtc ccagccgtccaactatcaggctcacactcctcctcacctggtgtggaaggagttgttgct ggtgtcattcttaggcttttggtttacagtggaagtgtgagtgtgaagagcaaggctcac cgactcgcagtggcatgcgtgggcacaccagctgttgccttctggtgggtggggctgggt ggagccacccttgtttgttggcttcttatgggttggtccctagcttgtgaaacacagggg tgtttacagtgctcattcacaggccctaaagaactcctctctgactgctgcgtcctctct tgtccatccacggacttggtgacgtgtctgttagacttccgactcaaccttgcctctaac agaagtgtcgtcccttgccttgtggccttgctggcagcttgtgcacagctgagtggccta gccaccagtcacagaatgtgggcccttcagaaattgaggaagctgcttacaactgaattt gggcaatcaattaacataaataggctgcttggagaaaatgatggggaagcaagagctttg agttttacaggtagtgctcttgctgctttggtgaaaggtcttccagaagctttgcaaagg cagtttgaatatgaagatcctattgtgaggggtggcaaacagctgctccacagcccattc tttaaggtactggtagctcttgcttgtgacctggagctggacactctgccttgctgtgcc gagacgcacaagtgggcctggttccggaggcgaccagatgattggaatctgtctgctggt ggcagtggaacaatttatggttggggacatgatcataggggccagctcgggggcattgaa ggtgcaaaagtcaaaattccctctccctgtgaagccctcacaactctcagacccatgcag ttaatcggaggggaacagaccctctgctgtgacagctga >gi568815583f:30804664_31037253|GENSCAN_predicted_peptide_2|97_aa MGKDFMSKTPKAMATKDKIDKWDLIKLNSFCTAKETTIRVNRDVDEAGNHHSQQTIARTK KQPLHVLTHSFSITHVTQNAKRDSVDDLREKDKKTLT >gi568815583f:30804664_31037253|GENSCAN_predicted_CDS_2|294_bp atgggcaaggacttcatgtctaaaacaccaaaagcaatggcaacaaaagacaaaattgac aaatgggatctaattaaactaaacagcttctgcacagcaaaagaaactaccatcagagtg aacagggacgtggatgaagctggaaaccatcattctcagcaaactattgcaaggaccaaa aaacaaccactgcatgttctcactcacagttttagtatcacgcacgttacacaaaatgcc aaacgtgattcagtggatgatttaagggagaaggacaaaaaaacacttacttag >gi568815583f:30804664_31037253|GENSCAN_predicted_peptide_3|179_aa MALNAHKRKQERSKIDTLTSQLKELEKQEQTHSKASRRQEITNIRAELKEIETQKTLQKI NESRSWFFEKINKIDRLLARLIKKKREKNQIDAIKNDKGDITNNPTEIQTTIREYYKHLY ANKLENLEEMDKFLDTYTLPRLNQEEVEPLNRPITGSEIEAIINSLPTKKSPGPDGFTA >gi568815583f:30804664_31037253|GENSCAN_predicted_CDS_3|540_bp atggcactaaatgcccacaagagaaagcaggaaagatctaaaattgacaccctaacatca caattaaaagaactagagaagcaagagcaaacacattcaaaagctagcagaaggcaagaa ataactaacatcagagcagaactgaaggaaatagagacacaaaaaacccttcaaaaaatc aatgaatccaggagctggttttttgaaaagatcaacaaaattgatagactgctagcaaga ctaataaagaagaaaagagagaagaatcaaatagacgcaataaaaaatgataaaggggat atcaccaacaatcccacagaaatacaaactaccatcagagaatactataaacacctctat gcaaataaactagaaaatctagaagaaatggataaattccttgacacatacaccctccca agactaaaccaggaagaagttgaacctctgaatagaccaataacaggctctgaaattgag gcaataattaatagcttaccaaccaaaaaaagtccaggaccagatggattcacagcctaa >gi568815583f:30804664_31037253|GENSCAN_predicted_peptide_4|280_aa MPAEPGGAGLRLLGPALQATDVAQGPSPLGRPRPDTIALPKRGKRLKSLLCSSRVTVEDY ANSDLAVVRKIIGNRQEPMREFNFKFKKQSARLKSKCTGGLQLPIQYEDVHTNGDQDCCL LQVTTLNFIFIPIVMGMIFTLFTVNASTDTRHHRVRLVFQDSPVHGGQKLCSEQDGSAQY GKTTKSEHLQHCRAFLRYISDKLCGREIYSVGRTLDKALGCALFLEGEKASCEIICNSWV VTNSSAGGPGTWKEYDGRLVTKTFGNDMWGQISLSGKKIM >gi568815583f:30804664_31037253|GENSCAN_predicted_CDS_4|843_bp atgcccgcggagcctggtggagctggcctgcggctcctgggacccgctctccaggccaca gacgtggcccaggggcccagcccactaggcaggcctcgcccagacacgattgccctgccc aagaggggaaagcgactcaagtcgctcctctgctccagccgagtgactgtggaggattac gccaactcggatctggcggtcgtgagaaagattattgggaacagacaggaaccaatgcgg gaattcaacttcaagttcaaaaaacagtccgctaggttaaagagcaagtgtacaggagga ttgcagcttcccatccagtacgaagatgttcataccaatggagaccaggactgctgccta ctgcaggtcaccaccctcaatttcatctttattccgattgtcatgggaatgatatttact ctgtttactgtcaatgcgagcacggacacgcggcatcatcgagtgagactggtgttccaa gattcccctgttcatggtggtcagaaactgtgcagtgaacaggatggttctgcacaatat ggaaaaaccaccaaaagtgaacatctgcagcactgtcgagcttttctaagatatatctct gacaaactctgtggaagggaaatctactcagtgggcagaacgttagacaaggcacttggc tgtgcacttttcttggagggagaaaaggccagctgtgaaattatatgcaattcctgggtt gtgaccaatagttcagctggaggaccagggacttggaaggaatatgatggaagactggtg acaaagacatttgggaatgacatgtggggacagatctctctgagtgggaaaaaaataatg tga >gi568815583f:30804664_31037253|GENSCAN_predicted_peptide_5|77_aa MKSIVHLDMGPQSAHAADPLRTPSISPTAFGIPFDIMLISDPANSPNSWMSGSRKALDSV SIQDREKAPDTSVRALE >gi568815583f:30804664_31037253|GENSCAN_predicted_CDS_5|234_bp atgaagagtattgtccacctggacatgggtccccagagtgctcatgcagcagaccctctc agaactccatccatttctcccactgcctttggaataccctttgacataatgctgatttct gatccagcaaacagccccaatagttggatgtcgggttccaggaaggctttggattctgtg agcatccaggacagagaaaaagcaccagataccagcgtgagagctcttgagtga >gi568815583f:30804664_31037253|GENSCAN_predicted_peptide_6|1107_aa MGAFNVERKGDRSLEPPLAKPVEARRGDRLGMPGSAGARSQVGEHPVFLILMMSEGKPPD KKRPRRSLSISKNKKKASNSIISCFNNAPPAKLACPVCSKMVPRYDLNRHLDEMCANNDF VQVDPGQVGLINSNVSMVDLTSVTLEDVTPKKSPPPKTNLTPGQSDSAKREVKQKISPYF KSNDVVCKNQDELRNRSVKVICLGSLASKLSRKYVKAKKSIDKDEEFAGSSPQSSKSTVV KSLIDNSSEIEDEDQILENSSQKENVFKCDSLKEECIPEHMVRGSKIMEAESQKATRECE KSALTPGFSDNAIMLFSPDFTLRNTLKSTSEDSLVKQECIKEVVEKREACHCEEVKMTVA SEAKIQLSDSEAKSHSSADDASAWSNIQEAPLQDDSCLNNDIPHSIPLEQGSSCNGPGQT TGHPYYLRSFLVVLKTVLENEDDMLLFDEQEKGIVTKFYQLSATGQKLYVRLFQRKLSWI KMTKLEYEEIALDLTPVIEELTNAGFLQTAPWGLALAVPLAFCHGHPALSVLHQDECIPV SEPWTDFPLHPESELQELSEVLELLSAPELKSLAKTFHLVNPNGQKQQLVDAFLKLAKQR SVCTWGKNKPGIGAVILKRLEAVSTGASPGLCVEVDEMGTLGQSVDEWLSVQWWAVFSRI LLLFSLTDSMEDEDAACGGQGQLSTVLLVNLGRMEFPSYTINRKTHIFQDRDDLIRYAAA THMLSDISSAMANGNWEEAKELAQCAKRDWNRLKNHPSLRCHEDLPLFLRCFTVGWIYTR ILSRFVEILQRLHMYETIKCITEGLADPEVRTGHRLSLYQRAVRLRESPSCKKFKHLFQQ LPEMAVQDVKHVTITGRLCPQRGMCKSVFVMEAGEAADPTTVLCSVEELALAHYRRSGFD QGKGDVIGEDVQMPVKTERYSRPEMVRNRRIECTFKAPRDVETGHSLEEEAGWDGFMAKG PPSAPCMASSCGTSSSWMGFRMSSETPVRYSSAPAPRAFPLDLCTDSFFTSRRPALEARL QLIHDAPEESLRAWVAATWHEQEGRVASLVSWDRFTSLQQAQLVEVKGPNDRLSHKQMIW LAELQKLGAEVEVCHVVAVGAKSQSLS >gi568815583f:30804664_31037253|GENSCAN_predicted_CDS_6|3324_bp atgggagcctttaacgtggaaaggaaaggagatcgatcactcgagcctccacttgccaag cctgtggaagcgagacgcggggatcggctgggaatgcctggaagcgccggcgcgcggagc caggtgggagaacatccagtttttctaatactcatgatgtcagaagggaaacctcctgac aaaaaaaggcctcgtagaagcttatcaatcagcaagaataagaaaaaagcatctaattct attatttcgtgttttaacaatgcaccacctgctaaacttgcctgccccgtttgcagtaaa atggtgcctagatatgacttaaaccggcaccttgatgaaatgtgtgctaacaatgacttc gttcaagtggatccagggcaggttggcttaataaattcaaatgtgtctatggtagattta accagtgttaccttagaagatgtaacacctaagaagtcaccaccaccaaagacaaattta acccctggccaaagtgattcagcaaaaagggaagtaaagcagaagatcagtccctacttt aaaagtaatgatgtggtgtgcaaaaatcaagatgagctgagaaatcgtagtgtgaaagtc atttgtttgggaagcctagcatctaaattgtccagaaaatacgtaaaggctaaaaaatca atagataaggatgaagaatttgccggttctagtccacagagttccaaatccacagttgtt aagagcctgattgataactcttcagaaattgaggacgaggatcaaattttggagaacagt tctcaaaaagaaaacgtgtttaaatgtgattctctaaaggaagagtgcattcctgaacat atggtaagaggaagtaaaataatggaagccgaaagccaaaaggctacccgggaatgtgag aaatcagccctcacccctggattctcagataatgcgatcatgttattctcaccagatttc actcttaggaatacattaaagtctacttcagaagacagtcttgtaaagcaagagtgtatc aaagaagtggttgaaaaacgtgaggcatgtcattgtgaagaagtaaaaatgactgttgct tcagaagctaaaatacagctgtcagattcagaggcaaaatctcatagttctgcagatgat gcttctgcatggagtaacatccaagaggctcctctgcaggatgacagttgcttaaacaat gatatccctcacagcattcctttggagcaggggtcaagctgcaatggtcctggtcaaaca accggtcatccttactaccttcggagtttccttgtggtgctgaaaaccgtacttgagaat gaagatgatatgttgctctttgatgagcaggagaagggaattgtaactaaattttatcag ttatcagctactggtcagaagttatatgtaaggctctttcaacgtaaattaagctggatt aagatgaccaaattagagtatgaagagattgccttagacttaacacctgtgattgaagaa ttgacgaatgcaggctttctacagacagccccatggggcctggctctggctgtccctctc gctttctgccacggccacccagccctcagtgttcttcaccaggatgagtgcattcctgtc tcagagccttggactgactttcccctccacccagaatctgagttgcaagaactctctgaa gtgcttgaactcctttctgctcctgaactaaaatccctagccaagaccttccacttggtg aatcccaatggacagaaacagcagctggtggacgcctttctcaaattggccaaacagcgt tcagtctgcacttggggcaagaataagcctggaattggtgcagtgattttaaaaagactg gaggctgtgtccactggggcatccccgggactctgcgtggaggtggatgaaatgggtact ttggggcagtctgtagatgagtggctctcagttcagtggtgggctgtgttttcccgcatc ttgctactgttttcgttgaccgactcaatggaagatgaagacgccgcttgtggaggtcag ggacagctttcaacagtcctgttggtcaacctcggccgaatggagtttcctagttacacc atcaatcggaaaacccacatcttccaagacagagatgatcttatcagatatgcagcagcc acgcacatgctgagtgacatttcttccgcaatggccaatgggaactgggaagaagctaag gagctcgctcagtgtgcaaaaagggattggaacagactgaaaaaccacccttctctgaga tgccacgaagatttaccactcttcctgcggtgtttcactgttgggtggatttatacaagg attttgtctcggtttgtggaaatactgcagagacttcacatgtatgagactatcaagtgc atcacagaggggctggcggatccggaagtcagaacgggacaccgcctttcactgtatcag cgagccgtgcgcctgcgagagtctccgagctgtaaaaagttcaagcacctcttccagcag ctcccagaaatggctgtgcaagatgtgaaacacgtgaccatcacaggcaggctgtgccca cagcgtgggatgtgcaagtctgtgtttgtgatggaggccggggaggccgctgaccccacc acggtcctgtgctctgtggaggagctggcactggcccattacagacgcagcggttttgac caggggaagggtgatgttattggggaagacgtgcaaatgccagtgaagacagagagatac agtaggcctgaaatggttagaaatagaagaatagaatgcacattcaaggccccgagagac gtggaaactggacactcgctggaggaggaagcaggctgggacggattcatggcgaagggt ccaccttcagcaccctgtatggcctcctcctgtgggacatcatcttcatggatgggattc cggatgtcttcagaaacgcctgtcaggtactccagtgcccctgccccacgagcattcccc ctggacttgtgcacagacagcttcttcacaagcagacgcccagcccttgaggccaggctg cagctgattcatgatgcccccgaggagagcctgcgggcctgggtggcagccacgtggcat gagcaggaaggcagagtggcttcccttgtcagctgggatcgcttcacgtctcttcagcaa gctcagctggtggaagttaaaggccccaatgatcgtctttcacataagcagatgatctgg ctggctgaactgcagaagctgggggctgaagtagaagtctgccatgtggttgcagttgga gctaagagccaaagccttagctaa >gi568815583f:30804664_31037253|GENSCAN_predicted_peptide_7|1000_aa MGPGARVGLAVPGPSPLGRALRRIPAQRAWAAALRALRPEASALPLARDGNGAKRRRHHV LPQAAQTHLQVLPPATAPAAGLVDRFGEDVSISWVPGLSWIARMLEVSCACIAERRGVSP KLPVDKVEDCDLDHWYPDSSQGLVSDLDEVFTSPWSNVASKVNNPPWTASMNTGLPLDVV RLVLREEVLFGEIVVNEVNFVRKCIATDTSQYDLWGKLICSNFKISFITDDPMPLQKFHY RNLLLGEHDVPLTCIEQIVTVNDHKRKQKVLGPNQKLKFNPTELIIYCKDFRIVRFRFDE SGPESAKKGSVSCGKLLAGLNAAEYSIGDASWETCHSCSLQQKDPGPEWVVPSVTRYCQA NKINGIPSGDGGGGGGGGNGAGGGSSQKTPLFETYSDWDREIKRTGASGWRVCSINEGYM ISTCLPEYIVVPSSLADQDLKIFSHSFVGRRMPLWCWSHSNGSALVRMALIKDVLQQRKI DQRICNAITKSHPQRSDVYKSDLDKTLPNIQEVQAAFVKLKQLCVNEPFEETEEKWLSSL ENTRWLEYVRAFLKHSAELVYMLESKHLSVVLQEEEGRDLSCCVASLVQVMLDPYFRTIT GFQSLIQKEWVMAGYQFLDRCNHLKRSEKESPLFLLFLDATWQLLEQYPAAFEFSETYLA VLYDSTRISLFGTFLFNSPHQRVKQSTEFAISKNIQLGDEKGLKFPSVWDWSLQFTAKDR TLFHNPFYIGKSTPCIQNGSVKSFKRTKKSYSSTLRGMPSALKNGIISDQELLPRRNSLI LKPKPDPAQQTDSQNSDTEQYFREWFSKPANLHGVILPRVSGTHIKLWKLCYFRWVPEAQ ISLGGSITAFHKLSLLADEVDVLSRMLRQQRSGPLEACYGELGQSRMYFNASGPHHTDTS GTPEFLSSSFPFSPFIVHNMKNNYDSLQSGQGEAAYTTSAVRGTCGDSSGVAAQGALERA QHPCCPPAEPVLRECRTDADALGEISSIERKGVQCLAVIL >gi568815583f:30804664_31037253|GENSCAN_predicted_CDS_7|3003_bp atgggccccggcgcccgcgtcggcctggctgtgcccggcccctccccgctcgggcgggcg ctgcgccgtatccccgcccaaagggcctgggcggccgcactgagagctttacgcccggag gcgtcggcgctgccactggcccgcgacgggaacggggcgaaaaggcggcggcaccatgtt ctccctcaagccgcccaaacccaccttcaggtcctacctcctgccaccgccccagctgct ggactggtggatcggttcggtgaggatgtgagtatctcctgggtgccaggtctgtcctgg atagcgagaatgctggaggtgtcatgtgcctgtatcgcagaaaggcgtggggtgagccct aagctgcctgttgacaaggtagaagactgtgacctggatcactggtacccagattccagc cagggcctggtatcagatttggatgaagtttttaccagcccttggtcaaatgtggccagc aaggttaataacccaccctggactgcaagcatgaacacaggtctgcctctggatgttgtt aggttggtactaagggaagaggtcctctttggagaaattgtcgtaaatgaagtcaatttt gtgagaaaatgcattgcaacagacacaagccagtacgatttgtggggaaagctgatatgc agtaacttcaaaatctcctttattacagatgacccaatgccattacagaaattccattac agaaaccttcttcttggtgaacacgatgtccctttaacatgtattgagcaaattgtcaca gtaaacgaccacaagaggaagcagaaagtcctaggccccaaccagaaactgaaatttaat ccaacagagttaattatttattgtaaagatttcagaattgtcagatttcgctttgatgaa tcaggtcccgaaagtgctaaaaagggaagtgtgtcttgtggtaagctgctggctggttta aatgcagcagaatattccattggggatgccagctgggagacttgccacagttgcagcctg cagcagaaagaccctgggccagaatgggttgtgccatctgtcaccagatattgccaagca aacaaaattaatggaattccctcaggagatggaggaggaggaggaggaggaggtaatgga gctggtggtggcagcagccagaaaactccactctttgaaacttactcggattgggacaga gaaatcaagaggacaggtgcttccgggtggagagtttgttctattaacgagggttacatg atatccacttgccttccagaatacattgtagtgccaagttctttagcagaccaagatcta aagatcttttcccattcttttgttgggagaaggatgccactctggtgctggagccactct aacggcagtgctcttgtgcgaatggccctcatcaaagacgtgctgcagcagaggaagatt gaccagaggatttgtaatgcaataactaaaagtcacccacagagaagtgatgtttacaaa tcagatttggataagaccttgcctaatattcaagaagtacaggcagcatttgtaaaactg aagcagctatgcgttaatgagccttttgaagaaactgaagagaaatggttatcttcactg gaaaatactcgatggttagaatatgtaagggcattccttaagcattcagcagaacttgta tacatgctagaaagcaaacatctctctgtagtcctacaagaggaggaaggaagagacttg agctgttgtgtagcttctcttgttcaagtgatgctggatccctattttaggacaattact ggatttcagagtctgatacagaaggagtgggtcatggcaggatatcagtttctagacaga tgcaaccatctaaagagatcagagaaagagtctcctttatttttgctattcttggatgcc acctggcagctgttagaacaatatcctgcagcttttgagttctccgaaacctacctggca gtgttgtatgacagcacccggatctcactgtttggcaccttcctgttcaactcccctcac cagcgagtgaagcaaagcacggaatttgctataagcaaaaacatccaattgggtgatgag aagggcttaaaattcccctctgtttgggactggtctctccagtttacagcaaaggatcgc acccttttccataaccccttctacattggaaagagcacaccttgtatacagaatggctcc gtgaagtcttttaaacggacaaagaaaagctacagctccacactaagaggaatgccgtct gccttaaagaatggaatcatcagtgaccaagaattacttccaaggagaaattcattgata ttaaaaccaaagccagatccagctcagcaaaccgacagccagaacagtgatacggagcag tattttagagaatggttttccaaacccgccaacctgcacggtgttattctgccacgtgtc tctggaacacacataaaactgtggaaactgtgctacttccgctgggttcccgaggcccag atcagcctgggtggctccatcacagcctttcacaagctctccctcctggctgatgaagtc gacgtactgagcaggatgctgcggcaacagcgcagtggccccctggaggcctgctatggg gagctgggccagagcaggatgtacttcaacgccagcggccctcaccacaccgacacctcg gggacaccggagtttctctcctcctcatttccattttctccttttattgtgcataatatg aaaaacaactatgacagccttcagtcgggccagggtgaagctgcttataccacctctgcc gtcagagggacatgtggtgacagcagtggtgtggctgcacagggcgcactagagagagct cagcacccctgctgcccgccagcagagcccgtgctgagggaatgccgcacagatgctgat gcactgggtgaaatttctagtattgaacgtaaaggtgtacagtgtcttgctgttatttta tga >gi568815583f:30804664_31037253|GENSCAN_predicted_peptide_8|788_aa MAELQAVPVGVARHTVACATTSVGRRETSHEIPSIKPQPPPGRPRCCGCTGDEISYLGYL LLFNYVILVRMDGWPSLQEWIVISYIVSLALEKIREILMSEPGKLSQKIKVWLQEYWNIT DLVAISTFMIGAILRLQNQPYMGYGRVIYCVDIIFWYIRVLDIFGVNKYLGPYVMMIGKM MIDMLYFVVIMLVVLMSFGVARQAILHPEEKPSWKLARNIFYMPYWMIYGEVFADQIDPL MACYLLVANILLVNLLIAVFNNTFFEVKSISNQVWKFQRYQLIMTFHDRPVLPPPMIILS HIYIIIMRLSGRCRKKREGDQEERDRGLKLFLSDEELKRLHEFEEQCVQEHFREKEDEQQ SSSDERIRVTSERVENMSMRLEEINERETFMKTSLQTVDLRLAQLEELSNRMVNALENLA GIDRSDLIQARSRASSECEATYLLRQSSINSADGYSLYRYHFNGEELLFEDTSLSTSPGT GVRKKTCSFRIKEEKDVKTHLVPECQNSLHLSLGTSTSATPDGSHLAVDDLKNAEESKLG PDIGISKEDDERQTDSKKEETISPSLNKTDVIHGQDKSDVQNTQLTVETTNIEGTISYPL EETKITRYFPDETINACKTMKSRSFVYSRGRKLVGGVNQDVEYSSITDQQLTTEWQCQVQ KITRSHSTDIPYIVSEAAVQAEHKEQFADMQDEHHVAEAIPRIPRLSLTITDRNGMENLL SVKPDQTLGFPSLRSKSLHGHPRNVKSIQGKLDRSGHASSVSSLVIVSGMTAEEKKVKKE KASTETEC >gi568815583f:30804664_31037253|GENSCAN_predicted_CDS_8|2367_bp atggccgagctgcaggcggtcccagtgggggtggccaggcacactgtggcttgtgcaacc acgtcagtcgggaggagagagacttctcatgaaatcccaagcatcaagccacagcccccg ccaggcaggcctcggtgctgtggttgcactggtgatgagatatcatacttgggctacctg ctgctgtttaactacgtcatcctggtgcggatggatggctggccgtccctccaggagtgg atcgtcatctcctacatcgtgagcctggcgttagagaagatacgagagatcctcatgtca gaaccaggcaaactcagccagaaaatcaaagtttggcttcaggagtactggaacatcaca gatctcgtggccatttccacattcatgattggagcaattcttcgcctacagaaccagccc tacatgggctatggccgggtgatctactgtgtggatatcatcttctggtacatccgtgtc ctggacatctttggtgtcaacaagtatctggggccatacgtgatgatgattggaaagatg atgatcgacatgctgtactttgtggtcatcatgctggtcgtgctcatgagtttcggagta gcccgtcaagccattctgcatccagaggagaagccctcttggaaactggcccgaaacatc ttctacatgccctactggatgatctatggagaggtgtttgcagaccagatagacccactc atggcgtgctatctactggtcgccaacatcctgctggtgaacctgctgattgctgtgttc aacaataccttctttgaagtaaaatcaatatccaaccaggtgtggaagttccagcgatat cagctgattatgacatttcatgacaggccagtcctgcccccaccgatgatcattttaagc cacatctacatcatcattatgcgtctcagcggccgctgcaggaaaaagagagaaggggac caagaggaacgggatcgtggattgaagctcttccttagcgacgaggagctaaagaggctg catgagttcgaggagcagtgcgtgcaggagcacttccgggagaaggaggatgagcagcag tcgtccagcgacgagcgcatccgggtcacttctgaaagagttgaaaatatgtcaatgagg ttggaagaaatcaatgaaagagaaacttttatgaaaacttccctgcagactgttgacctt cgacttgctcagctagaagaattatctaacagaatggtgaatgctcttgaaaatcttgcg ggaatcgacaggtctgacctgatccaggcacggtcccgggcttcttctgaatgtgaggca acgtatcttctccggcaaagcagcatcaatagcgctgatggctacagcttgtatcgatat cattttaacggagaagagttattatttgaggatacatctctctccacgtcaccagggaca ggagtcaggaaaaaaacctgttccttccgtataaaggaagagaaggacgtgaaaacgcac ctagtcccagaatgtcagaacagtcttcacctttcactgggcacaagcacatcagcaacc ccagatggcagtcaccttgcagtagatgacttaaagaacgctgaagagtcaaaattaggt ccagatattgggatttcaaaggaagatgatgaaagacagacagactctaaaaaagaagaa actatttccccaagtttaaataaaacagatgtgatacatggacaggacaaatcagatgtt caaaacactcagctaacagtggaaacgacaaatatagaaggcactatttcctatcccctg gaagaaaccaaaattacacgctatttccccgatgaaacgatcaatgcttgtaaaacaatg aagtccagaagcttcgtctattcccggggaagaaagctggtcggtggggttaaccaggat gtagagtacagttcaatcacggaccagcaattgacgacggaatggcaatgccaagttcaa aagatcacgcgctctcatagcacagatattccttacattgtgtcggaagctgcagtgcaa gctgagcataaagagcagtttgcagatatgcaagatgaacaccatgtcgctgaagcaatt cctcgaatccctcgcttgtccctaaccattactgacagaaatgggatggaaaacttactg tctgtgaagccagatcaaactttgggattcccatctctcaggtcaaaaagtttacatgga catcctaggaatgtgaaatccattcagggaaagttagacagatctggacatgccagtagt gtaagcagcttagtaattgtgtctggaatgacagcagaagaaaaaaaggttaagaaagag aaagcttccacagaaactgaatgctag