GENSCAN 1.0 Date run: 5-Nov-116 Time: 04:29:27 Sequence gi568815584r:22926092_23134881 : 208790 bp : 48.06% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 Intr - 205 42 164 2 2 65 113 98 0.421 9.69 1.02 Intr - 2524 2406 119 0 2 82 37 41 0.555 -1.49 1.01 Init - 3270 3161 110 0 2 91 60 98 0.598 7.05 1.00 Prom - 6456 6417 40 -6.36 2.08 PlyA - 6476 6471 6 1.05 2.07 Term - 20617 20434 184 0 1 47 49 172 0.984 6.22 2.06 Intr - 21148 21080 69 0 0 115 66 96 0.995 8.40 2.05 Intr - 21640 21510 131 1 2 78 83 184 0.734 16.39 2.04 Intr - 21922 21777 146 2 2 53 15 236 0.999 12.80 2.03 Intr - 24319 24223 97 1 1 51 109 42 0.969 2.18 2.02 Intr - 25598 25464 135 2 0 85 36 98 0.943 5.06 2.01 Init - 26128 26126 3 1 0 98 95 0 0.295 1.70 2.00 Prom - 40332 40293 40 -3.86 3.08 PlyA - 40561 40556 6 1.05 3.07 Term - 47477 47352 126 2 0 82 49 116 0.994 5.38 3.06 Intr - 48024 47956 69 0 0 66 105 77 0.944 6.58 3.05 Intr - 48799 48748 52 0 1 147 78 -17 0.981 2.11 3.04 Intr - 48979 48883 97 2 1 3 89 158 0.248 6.47 3.03 Intr - 50621 50550 72 0 0 99 52 76 0.197 4.48 3.02 Intr - 52354 52253 102 2 0 65 89 70 0.960 4.95 3.01 Init - 56175 55170 1006 0 1 83 109 940 0.990 89.23 3.00 Prom - 59655 59616 40 -9.16 4.10 PlyA - 59844 59839 6 1.05 4.09 Term - 61543 61124 420 1 0 65 38 308 0.836 18.79 4.08 Intr - 61924 61812 113 2 2 100 77 210 0.999 21.20 4.07 Intr - 63754 63651 104 0 2 84 100 62 0.931 6.82 4.06 Intr - 64036 63975 62 1 2 105 86 78 0.958 6.83 4.05 Intr - 70177 69857 321 1 0 76 65 250 0.985 17.56 4.04 Intr - 72894 72336 559 2 1 102 76 442 0.643 37.23 4.03 Intr - 73065 73013 53 0 2 97 87 45 0.563 3.01 4.02 Intr - 81160 81056 105 1 0 93 53 66 0.322 4.01 4.01 Init - 90995 90993 3 2 0 113 81 0 0.329 1.80 4.00 Prom - 96280 96241 40 -6.36 5.04 PlyA - 96825 96820 6 1.05 5.03 Term - 100284 99998 287 1 2 100 48 446 0.963 37.37 5.02 Intr - 107583 107277 307 0 1 69 105 291 0.945 24.92 5.01 Init - 108790 108593 198 1 0 94 70 71 0.500 4.88 5.00 Prom - 109819 109780 40 -6.56 6.00 Prom + 113733 113772 40 -7.96 6.01 Sngl + 116000 117037 1038 1 0 78 54 1075 0.453 98.23 6.02 PlyA + 120983 120988 6 1.05 7.12 PlyA - 121001 120996 6 1.05 7.11 Term - 122388 121889 500 1 2 80 53 1069 0.999 97.29 7.10 Intr - 123184 122936 249 2 0 112 97 160 0.531 16.71 7.09 Intr - 123647 123536 112 2 1 88 92 153 0.976 15.65 7.08 Intr - 123852 123731 122 1 2 73 99 162 0.991 16.01 7.07 Intr - 126518 126382 137 1 2 110 85 100 0.999 12.11 7.06 Intr - 127658 127405 254 2 2 111 96 338 0.999 33.13 7.05 Intr - 128237 128050 188 0 2 63 77 264 0.999 22.21 7.04 Intr - 128582 128415 168 0 0 64 61 286 0.994 23.42 7.03 Intr - 128775 128656 120 1 0 91 81 158 0.809 15.87 7.02 Intr - 129262 128968 295 1 1 92 97 413 0.996 39.18 7.01 Init - 129642 129442 201 0 0 76 116 228 0.977 21.28 7.00 Prom - 132451 132412 40 -12.96 8.20 PlyA - 132497 132492 6 1.05 8.19 Term - 133383 133057 327 0 0 84 37 355 0.976 24.71 8.18 Intr - 135093 134993 101 1 2 79 109 108 0.999 11.83 8.17 Intr - 135531 135207 325 0 1 137 100 367 0.995 38.15 8.16 Intr - 136184 136077 108 2 0 92 94 79 0.998 9.38 8.15 Intr - 136432 136325 108 1 0 80 89 16 0.695 1.38 8.14 Intr - 136983 136838 146 1 2 55 111 117 0.988 10.70 8.13 Intr - 137486 137345 142 2 1 63 113 134 0.999 13.33 8.12 Intr - 138166 138014 153 1 0 45 110 215 0.962 19.67 8.11 Intr - 138397 138264 134 2 2 34 100 75 0.980 3.66 8.10 Intr - 141874 141739 136 1 1 35 89 96 0.711 4.54 8.09 Intr - 143526 143385 142 2 1 82 82 211 0.948 20.26 8.08 Intr - 152175 152060 116 0 2 90 75 51 0.710 3.25 8.07 Intr - 152947 152729 219 1 0 69 65 71 0.455 1.40 8.06 Intr - 154718 153456 1263 2 0 70 87 772 0.970 62.60 8.05 Intr - 155745 155657 89 1 2 91 93 86 0.910 9.09 8.04 Intr - 164010 163891 120 1 0 87 62 73 0.969 5.07 8.03 Intr - 164542 164431 112 1 1 95 121 79 0.992 11.85 8.02 Intr - 167453 167388 66 2 0 100 94 16 0.824 2.50 8.01 Init - 169021 168884 138 1 0 100 60 245 0.489 23.04 8.00 Prom - 171060 171021 40 -6.36 9.00 Prom + 171534 171573 40 -9.75 9.01 Sngl + 171682 171990 309 0 0 64 44 235 0.448 12.60 9.02 PlyA + 172372 172377 6 1.05 10.10 PlyA - 172992 172987 6 1.05 10.09 Term - 174085 174076 10 0 1 93 37 4 0.198 -6.53 10.08 Intr - 174988 174759 230 1 2 101 98 97 0.513 8.67 10.07 Intr - 175473 175201 273 0 0 66 94 96 0.720 5.73 10.06 Intr - 175674 175578 97 2 1 50 84 130 0.647 8.81 10.05 Intr - 175820 175751 70 0 1 69 89 59 0.400 2.34 10.04 Intr - 176175 176001 175 0 1 78 72 144 0.974 11.31 10.03 Intr - 176960 176835 126 2 0 78 17 117 0.946 4.48 10.02 Intr - 177727 177584 144 1 0 83 57 65 0.880 3.38 10.01 Init - 178829 178737 93 2 0 64 39 252 0.924 16.29 10.00 Prom - 185781 185742 40 -5.46 11.03 PlyA - 188920 188915 6 1.05 11.02 Term - 191731 191396 336 1 0 123 54 494 0.656 44.07 11.01 Init - 193000 192491 510 1 0 80 116 562 0.999 53.13 11.00 Prom - 195238 195199 40 -7.36 12.05 PlyA - 199227 199222 6 1.05 12.04 Term - 201252 201086 167 1 2 87 55 212 0.999 15.88 12.03 Intr - 202105 201928 178 1 1 90 97 137 0.942 14.29 12.02 Intr - 203708 203559 150 2 0 79 100 239 0.914 24.56 12.01 Intr - 205466 205370 97 1 1 110 70 84 0.923 8.81 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 194968 194805 164 2 2 106 44 133 0.849 8.80 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584r:22926092_23134881|GENSCAN_predicted_peptide_1|131_aa MAAMAVGGAGGSRVSSGRDLNCVPEIADTLGAVAKQGFDFLCMPVFHPRFKREFIQEPAK NRPGPQTRSDLLLSGRALEIGADLPSNHVIDRWLGEPIKAAILPTSIFLTNKKGFPVLSK MHQRLIFRLLK >gi568815584r:22926092_23134881|GENSCAN_predicted_CDS_1|393_bp atggcggcgatggcggtcgggggtgctggtgggagccgcgtgtccagcgggagggacctg aattgcgtccccgaaatagctgacacactaggggctgtggccaagcaggggtttgatttc ctctgcatgcctgtcttccatccgcgtttcaagagggagttcattcaggaacctgctaag aatcggcccggtccccagacacgatcagacctactgctgtcaggaagggctcttgaaatt ggggctgacctcccatctaatcatgtcattgatcgctggcttggggagcccatcaaagca gccattctccccactagcattttcctgaccaataagaagggatttcctgttctttctaag atgcaccagaggctcatcttccggctcctcaag >gi568815584r:22926092_23134881|GENSCAN_predicted_peptide_2|254_aa MFHETLEQRLLVTELMRLLGPSQEREIPPLLGLEKADLLELMPLSEDFVWMRARLQQEVE EQLKKKCFTLLCYYDPNSDADSETVKAAKVWKLAEVLVGEQQQCQDAKSQQKEQMLLLEK KSAAYSQVLLRCLTLLQRLLQEHRLKTQSELDRINAQYLEVKCGAMILKLRMEELKILSD TYTVEKVEVHRLIRDRLEGAIHLQEQDMENSRQVLNSYEVLGEEFDRLVKEYTVLKQATE NKRWALQEFSKVYR >gi568815584r:22926092_23134881|GENSCAN_predicted_CDS_2|765_bp atgtttcatgagacccttgaacagcggctgcttgtaactgaactgatgcggctcttaggt cctagccaggagagggagatacctccactgctggggctggagaaagcggaccttctggaa ctcatgccactctcagaggattttgtgtggatgagggctcggctacagcaagaagtagag gagcagctcaaaaagaaatgtttcactctgctctgctactatgatcccaattcagatgct gacagtgaaaccgtgaaggcagcaaaggtgtggaaactcgcagaggtcctggtgggtgag cagcagcagtgccaggatgccaagagccagcagaaggagcagatgttgctgctggagaag aagagtgctgcttactcccaggtgcttctccgctgcctcactttgctgcagaggcttctt caagaacaccggctgaagactcaatccgagctagaccgcatcaatgcccagtacctggaa gtcaagtgcggtgctatgatccttaagctgaggatggaggagctaaagattttgtccgac acttacactgttgagaaagtggaagttcatcgtctgattagggaccgtttggagggagcc attcacctacaggagcaggacatggagaactcaagacaggtcctgaactcctatgaggtc cttggggaggagtttgacaggctggtgaaagagtacaccgtactcaagcaggcaacagag aacaagcggtgggccctccaggagttcagcaaggtctaccgttga >gi568815584r:22926092_23134881|GENSCAN_predicted_peptide_3|507_aa MERLGEKASRLLEKFGRRKGESSRSGSDGTPGPGKGRLSGLGGPRKSGPRGATGGPGDEP LEPAREQGSLDAERNQRGSFEAPRYEGSFPAGPPPTRALPLPQSLPPDFRLEPTAPALSP RSSFASSSASDASKPSSPRGSLLLDGAGAGGAGGSRPCSNRTSGISMGYDQRHGSPLPAG PCLFGPPLAGAPAGYSPGGVPSAYPELHAALDRLYAQRPAGFGCQESRHSYPPALGSPGA LAGAGVGAAGPLERRGAQPGRHSVTGYGDCAVGARYQDELTALLRLTVGTGGREAGARGE PSGIEPSGLEEPPGPFVPEAARARMREPEAREDYFGTCIKCNKGIYGQSNACQALDSLYH TQCFVCCSCGRTLRCKAFYSVNGSVYCEEDYLVSCFRCIVCNKCLDGIPFTVDFSNQVYC VTDYHKNYAPKCAACGQPILPSEGCEDIVRVISMDRDYHFECYHCEDCRMQLSDEEGCCC FPLDGHLLCHGCHMQRLNARQPPANYI >gi568815584r:22926092_23134881|GENSCAN_predicted_CDS_3|1524_bp atggagcggttaggagagaaagccagtcgcctgctggagaagttcggccgcagaaagggt gaatctagccggtctgggtctgacgggacccccgggccgggcaaggggcgcctaagtggg ttggggggacctaggaagtcagggccccgaggagctactgggggacctggggatgagccg ttggagccggcccgggagcaaggttccctggacgctgagcgaaatcagcgcggctccttt gaggcgccgcgctacgaaggctcttttcccgcggggccgccgcccacccgggccttgcct ctacctcagtcgttgccccccgattttcggctggagcccacggccccggccctcagcccc cgctctagcttcgccagtagctcggccagcgacgcgagcaagccgtccagcccccggggc agcctgctgctggacggggcgggggctggcggagctggaggtagccggccctgcagcaat cgcaccagcggcatcagcatgggctacgaccagcgccacgggagccccttgccagcgggg ccgtgcctgtttggcccacccctggccggagcaccggcaggctattctcccggaggggtc ccgtccgcctacccggagctccacgccgccctggaccgattgtacgctcagcggcccgcg gggttcggctgccaggaaagccgccactcgtatcccccggccctgggcagccctggagct ctagccggggccggagtgggagcggcggggcccttggagagacggggggcgcaacccgga cgacactctgtgaccggctacggggactgcgccgtgggcgcccggtaccaggacgagcta acagctttgcttcgcctgacggtgggcaccggtgggcgagaagccggagcccgcggagaa ccctcggggattgagccgtcgggtctggaggagccaccaggtcctttcgttccggaggcc gcccgggcccggatgcgggagccagaggccagggaggactacttcggcacctgtatcaag tgcaacaaaggcatctatgggcagagcaatgcctgccaggccctggacagcctctaccac acccagtgctttgtttgctgctcttgtgggcgaactttgcgttgcaaggctttctacagt gtcaatggctctgtgtactgtgaggaagattatctggtgagctgtttccgatgcattgtt tgcaacaagtgcctggatggcatccccttcacagtggacttctccaaccaagtatactgt gtcaccgactaccacaaaaattatgctcctaagtgtgcagcctgtggccaacccatcctc ccctctgagggctgtgaggacatcgtgagggtgatatccatggaccgggattatcacttt gagtgctaccactgtgaggactgccggatgcagctgagtgatgaggaaggctgctgctgt ttccctctggatgggcacttgctctgccatggttgccacatgcagcggctcaatgcccga caaccccctgccaactatatctga >gi568815584r:22926092_23134881|GENSCAN_predicted_peptide_4|579_aa MPIVSVTLLPSHRLTANHREPEGKRGGVSPRARGSRTLQLGAWAPLTILIEKGNGSEARC CCCACKSETNGGNTGSQGGNPPPSTPITVTGHGLAVQSSEQLLHVIYQRVDKAVGLAEAA LGLARANNELLKRLQEEVGDLRQGKVSIPDEDGESRAHSSPPEEPGPLKESPGEAFKALS AVEEECDSVGSGVQVVIEELRQLGAASVGPGPLGFPATQRDMRLPGCTLAASEAAPLLNP LVDDYVASEGAVQRVLVPAYAKQLSPATQLAIQRATPETGPENGTKLPPPRPEDMLNAAA ALDSALEESGPGSTGELRHSLGLTVSPCRTRGSGQKNSRRKRDLVLSKLVHNVHNHITND KRFNGSESIKSSWNISVVKFLLEKLKQELVTSPHNYTDKELKGACVAYFLTKRREYRNSL NPFKGLKEKEEKKLRSRRYRLFANRSSIMRHFGPEDQRLWNDVTEELMSDEEDSLNEPGV WVARPPRFRAQRLTELCYHLDANSKHGTKANRVYGPPSDRLPSAEAQLLPPELYNPNFQE EEDEGGDENAPGSPSFDQPHKTCCPDLNSFIEIKVEKDE >gi568815584r:22926092_23134881|GENSCAN_predicted_CDS_4|1740_bp atgccaatagtgtccgtgaccctgctccccagccatcgtctaaccgccaatcaccgggag ccagaaggcaaacggggcggtgtctccccgcgtgcgcgcggtagccggaccctacagctt ggggcctgggctcctctgaccatcctcattgagaaaggaaatggcagcgaggccagatgc tgctgctgcgcctgtaagagtgagactaatggaggcaacacaggctcccagggtgggaat cctcctcccagcacccccatcacagtgactggacatggcttggctgttcagagctcagag cagctcctgcatgttatctaccagcgggtcgataaggcagtgggtttggctgaagctgct ctgggtcttgccagggccaacaatgagttgttaaaacgtctccaggaggaagtgggtgac ctgaggcaagggaaagtgtccatccctgatgaagatggggaaagccgggcacatagttcc ccacctgaggagcctgggcctctcaaggaaagtcccggggaagcctttaaggctctgtct gccgtggaagaggagtgtgacagcgtgggcagcggcgtgcaggtggtgattgaggagctg cggcagctgggagcagcctcagtggggcctgggcctttgggcttcccagcaactcagagg gacatgcggctcccagggtgcacgctggctgccagcgaggcggcccccctgctcaatcct ctggtggatgattacgtggcctctgagggtgcagtacagcgagttctggtccctgcttat gccaagcaactctcaccagccacacaactggcaatccagcgggcaaccccagagacagga ccagaaaatggaaccaagctgccaccaccccgccctgaggacatgctcaatgccgctgct gcgctggacagtgccttggaagagtcaggccctgggagcactggggagctgagacactct ctagggctgaccgtttccccatgcaggaccagaggaagtgggcagaagaactccaggcgc aagcgggatcttgtactctctaaactggtccacaatgtgcataaccacatcaccaatgac aagagattcaatgggtctgaaagcatcaagtcctcttggaatatttcagtagtgaagttt cttctggaaaagctcaagcaagagctggtgaccagtccccacaattacactgataaggag ctaaaaggagcctgtgtggcctacttccttactaagaggcgtgagtaccgcaactccctg aacccctttaaaggcctgaaggaaaaagaggagaagaaacttcgaagtcgccgatatcgg ctttttgccaaccgatccagtatcatgaggcattttggacctgaggaccaacgtctgtgg aatgatgtgacagaggaactgatgtcagatgaagaggacagtcttaacgagccaggtgtg tgggtggcccgccctccccgtttccgggcccagcgcctcacagagctctgctaccacctg gatgctaactctaagcatggcaccaaagccaaccgtgtgtatgggcctccctcagacaga ctgccttctgctgaagcccagctccttccaccagaactttacaatcctaatttccaagaa gaggaagatgagggaggggatgagaatgcacctggctccccatcttttgaccaaccccac aaaacctgctgtcctgacttgaactcattcattgaaatcaaggtggaaaaggatgaataa >gi568815584r:22926092_23134881|GENSCAN_predicted_peptide_5|263_aa MALASVLERPLPVNQRGFFGLGGRADLLDLGPGSLSDGLSLAAPGWGVPEEPGIEMLHGT TTLAFKFRHGVIVAADSRATAGAYIASQTVKKVIEINPYLLGTMAGGAADCSFWERLLAR QCRIYELRNKERISVAAASKLLANMVYQYKGMGLSMGTMICGWDKRGPGLYYVDSEGNRI SGATFSVGSGSVYAYGVMDRGYSYDLEVEQAYDLARRAIYQATYRDAYSGGAVNLYHVRE DGWIRVSSDNVADLHEKYSGSTP >gi568815584r:22926092_23134881|GENSCAN_predicted_CDS_5|792_bp atggcgcttgccagcgtgttggagagaccgctaccggtgaaccagcgcgggtttttcgga cttgggggtcgtgcagatctgctggatctaggtccagggagtctcagtgatggtctgagc ctggccgcgccaggctggggtgtcccagaagagccaggaatcgaaatgcttcatggaaca accaccctggccttcaagttccgccatggagtcatagttgcagctgactccagggctaca gcgggtgcttacattgcctcccagacggtgaagaaggtgatagagatcaacccatacctg ctaggcaccatggctgggggcgcagcggattgcagcttctgggaacggctgttggctcgg caatgtcgaatctatgagcttcgaaataaggaacgcatctctgtagcagctgcctccaaa ctgcttgccaacatggtgtatcagtacaaaggcatggggctgtccatgggcaccatgatc tgtggctgggataagagaggccctggcctctactacgtggacagtgaagggaaccggatt tcaggggccaccttctctgtaggttctggctctgtgtatgcatatggggtcatggatcgg ggctattcctatgacctggaagtggagcaggcctatgatctggcccgtcgagccatctac caagccacctacagagatgcctactcaggaggtgcagtcaacctctaccacgtgcgggag gatggctggatccgagtctccagtgacaatgtggctgatctacatgagaagtatagtggc tctaccccctga >gi568815584r:22926092_23134881|GENSCAN_predicted_peptide_6|345_aa MKRQLTHLPGRFWLWPSFSVASLLSHQTPATNSWLASSKLHSAPGMALQDVCKWQSPDTQ GPSPHLPRAGGWAVPRGCDPQTFLQIHGPRLAHGTTTLAFRFRHGVIAAADTRSSCGSYV ACPASCKVIPVHQHLLGTTSGTSADCATWYRVLQRELRLRELREGQLPSVASAAKLLSAM MSQYRGLDLCVATALCGWDRSGPELFYVYSDGTRLQGDIFSVGSGSPYAYGVLDRGYRYD MSTQEAYALARCAVAHATHRDAYSGGSVDLFHVRESGWEHVSRSDACVLYVELQKLLEPE PEEDASHAHPEPATAHRAAEDRELSVGPGEVTPGDSRMPAGTETV >gi568815584r:22926092_23134881|GENSCAN_predicted_CDS_6|1038_bp atgaagcgtcagctcacacaccttcctggccggttctggctgtggcccagcttctctgta gcgtccctcctatcccaccagaccccagccacaaattcctggcttgcttcttccaaactt cattcagccccagggatggctctgcaggatgtgtgcaagtggcagtcccctgacacccag ggaccatcacctcacctgcctcgggctggcggctgggctgtgccccggggttgtgaccct caaaccttcctgcagatccatggccccagactggcccacggcaccaccactctggccttc cgcttccgtcatggagtcattgctgcagctgacacgcgttcctcctgtggcagctatgtg gcgtgtccagcctcatgcaaggtcatccctgtgcaccagcacctcctgggtaccacctct ggcacctctgccgactgtgctacctggtatcgggtattacagcgggagctgcggcttcgg gaactgagggagggtcagctgcccagtgtggccagtgctgccaagctcttgtcagccatg atgtctcaataccggggactggatctctgtgtggccactgccctctgcggctgggaccgc tctggccctgagctcttctacgtctatagcgacggcacccgcctgcagggggacatcttc tctgtgggctctggatctccctatgcctacggcgtgctagaccgtggctatcgctacgac atgagcacccaggaagcctacgccctggctcgctgcgccgtggcccacgccacccaccgt gatgcctattcagggggctctgtagaccttttccacgtgcgggagagtggatgggagcat gtgtcacgcagtgatgcctgtgtgctgtacgtggagttacagaagctcctggagccggag ccagaggaggatgccagccatgcccatcctgagcctgccactgcccacagagctgcagaa gatagagagctctctgtggggccaggggaggtgacaccaggagactccaggatgccagca gggactgagacggtgtga >gi568815584r:22926092_23134881|GENSCAN_predicted_peptide_7|781_aa MWGLVRLLLAWLGGWGCMGRLAAPARAWAGSREHPGPALLRTRRSWVWNQFFVIEEYAGP EPVLIGKLHSDVDRGEGRTKYLLTGEGAGTVFVIDEATGNIHVTKSLDREEKAQYVLLAQ AVDRASNRPLEPPSEFIIKVQDINDNPPIFPLGPYHATVPEMSNVGTSVIQVTAHDADDP SYGNSAKLVYTVLDGLPFFSVDPQTGVVRTAIPNMDRETQEEFLVVIQAKDMGGHMGGLS GSTTVTVTLSDVNDNPPKFPQSLYQFSVVETAGPGTLVGRLRAQDPDLGDNALMAYSILD GEGSEAFSISTDLQGRDGLLTVRKPLDFESQRSYSFRVEATNTLIDPAYLRRGPFKDVAS VRVAVQDAPEPPAFTQAAYHLTVPENKAPGTLVGQISAADLDSPASPIRYSILPHSDPER CFSIQPEEGTIHTAAPLDREARAWHNLTVLATELDSSAQASRVQVAIQTLDENDNAPQLA EPYDTFVCDSAAPGQLIQVIRALDRDEVGNSSHVSFQGPLGPDANFTVQDNRDGSASLLL PSRPAPPRHAPYLVPIELWDWGQPALSSTATVTVSVCRCQPDGSVASCWPEAHLSAAGLS TGALLAIITCVGALLALVVLFVALRRQKQEALMVLEEEDVRENIITYDDEGGGEEDTEAF DITALQNPDGAAPPAPGPPARRDVLPRARVSRQPRPPGPADVAQLLALRLREADEDPGVP PYDSVQVYGYEGRGSSCGSLSSLGSGSEAGGAPGPAEPLDDWGPLFRTLAELYGAKEPPA P >gi568815584r:22926092_23134881|GENSCAN_predicted_CDS_7|2346_bp atgtggggcctggtgaggctcctgctggcctggctgggtggctggggctgcatggggcgt ctggcagccccagcccgggcctgggcagggtcccgggaacacccagggcctgctctgctg cggactcgaaggagctgggtctggaaccagttctttgtcattgaggaatatgctggtcca gagcctgttctcattggcaagctgcactcggatgttgaccggggagagggccgcaccaag tacctgttgaccggggagggggcaggcaccgtatttgtgattgatgaggccacaggcaat attcatgttaccaagagccttgaccgggaggaaaaggcgcaatatgtgctactggcccaa gccgtggaccgagcctccaaccggcccctggagcccccatcagagttcatcatcaaagtg caagacatcaacgacaatccacccatttttccccttgggccctaccatgccaccgtgccc gagatgtccaatgtcgggacatcagtgatccaggtgactgctcacgatgctgatgacccc agctatgggaacagtgccaagctggtgtacactgttctggatggactgcctttcttctct gtggacccccagactggagtggtgcgtacagccatccccaacatggaccgggagacacag gaggagttcttggtggtgatccaggccaaggacatgggcggccacatgggggggctgtca ggcagcactacggtgactgtcacgctcagcgatgtcaacgacaacccccccaagttccca cagagcctataccagttctccgtggtggagacagctggacctggcacactggtgggccgg ctccgggcccaggacccagacctgggggacaacgccctgatggcatacagcatcctggat ggggaggggtctgaggccttcagcatcagcacagacttgcagggtcgagacgggctcctc actgtccgcaagcccctagactttgagagccagcgctcctactccttccgtgtcgaggcc accaacacgctcattgacccagcctatctgcggcgagggcccttcaaggatgtggcctct gtgcgtgtggcagtgcaagatgccccagagccacctgccttcacccaggctgcctaccac ctgacagtgcctgagaacaaggccccggggaccctggtaggccagatctccgcggctgac ctggactcccctgccagcccaatcagatactccatcctcccccactcagatccggagcgt tgcttctctatccagcccgaggaaggcaccatccatacagcagcacccctggatcgcgag gctcgcgcctggcacaacctcactgtgctggctacagagctcgacagttctgcacaggcc tcgcgcgtgcaagtggccatccagaccctggatgagaatgacaatgctccccagctggct gagccctacgatacttttgtgtgtgactctgcagctcctggccagctgattcaggtcatc cgggccctggacagagatgaagttggcaacagtagccatgtctcctttcaaggtcctctg ggccctgatgccaactttactgtccaggacaaccgagatggctccgccagcctgctgctg ccctcccgccctgctccaccccgccatgccccctacttggttcccatagaactgtgggac tgggggcagccggcgctgagcagcactgccacagtgactgttagtgtgtgccgctgccag cctgacggctctgtggcatcctgctggcctgaggctcacctctcagctgctgggctcagc accggcgccctgcttgccatcatcacctgtgtgggtgccctgcttgccctggtggtgctc ttcgtggccctgcggcggcagaagcaagaagcactgatggtactggaggaggaggacgtc cgagagaacatcatcacctacgacgacgagggcggcggcgaggaggacaccgaggccttc gacatcacggccttgcagaacccggacggggcggcccccccggcgcccggccctcccgcg cgccgagacgtgttgccccgggcccgggtgtcgcgccagcccagaccccccggccccgcc gacgtggcgcagctcctggcgctgcggctccgcgaggcggacgaggaccccggcgtaccc ccgtacgactcggtgcaggtgtacggctacgagggccgcggctcctcttgcggctccctc agctccctgggctccggcagcgaagccggcggcgcccccggccccgcggagccgctggac gactggggtccgctcttccgcaccctggccgagctgtatggggccaaggagcccccggcc ccctga >gi568815584r:22926092_23134881|GENSCAN_predicted_peptide_8|1314_aa MAELEEVTLDGKPLQALRVTDLKAALEQRGLAKSGQKSALVKRLKGALMLENLQKHSTPH AAFQPNSQIGEEMSQNSFIKQYLEKQQELLRQRLEREAREAAELEEASAESEDEMIHPEG VASLLPPDFQSSLERPELELSRHSPRKSSSISEEKGDSDDEKPRKGERRSSRVRQARAAK LSEGSQPAEEEEDQETPSRNLRVRADRNLKTEEEEEEEEEEEEDDEEEEGDDEGQKSREA PILKEFKEEGEEIPRVKPEEMMDERPKTRSQEQEVLERGGRFTRSQEEARKSHLARQQQE KEMKTTSPLEEEEREIKSSQGLKEKSKSPSPPRLTEDRKKASLVALPEQTASEEETPPPL LTKEASSPPPHPQLHSEEEIEPMEGPAPAVLIQLSPPNTDADTRELLVSQHTVQLVGGLS PLSSPSDTKAESPAEKVPEESVLPLVQKSTLADYSAQKDLEPESDRSAQPLPLKIEELAL AKGITEECLKQPSLEQKEGRRASHTLLPSHRLKQSADSSSSRSSSSSSSSSRSRSRSPDS SGSRSHSPLRSKQRDVAQARTHANPRGRPKMGSRSTSESRSRSRSRSRSASSNSRKSLSP GVSRDSSTSYTETKDPSSGQEVATPPVPQLQVCEPKERTSTSSSSVQARRLSQPESAEKH VTQRLQPERGSPKKCEAEEAEPPAATQPQTSETQTSHLPESERIHHTVEEKEEVTMDTSE NRPENDVPEPPMPIADQVSNDDRPEGSVEDEEKKELESLRRCQPQLSEEKYSDLAAECLP GPGVFTYPQATFIRGSMIPLAATKGVPAGNSDTEGGQPGRKRRWGASTATTQKKPSISIT TESLKSLIPDIKPLAGQEAVVDLHADDSRISEDETERNGDDGTHDKGLKICRTVTQVVPA EGQENGQREEEEEEKEPEAEPPVPPQVSVEVALPPPAEHEVKKVTLGDTLTRRSISQQKS GVSITIDDPVRTAQVPSPPRGKISNIVHISNLVRPFTLGQLKELLGRTGTLVEEAFWIDK IKSHCFVTYSTVEEAVATRTALHGVKWPQSNPKFLCADYAEQDELDYHRGLLVDRPSETK TEEQGIPRPLHPPPPPPVQPPQHPRAEQREQERAVREQWAEREREMERRERTRSEREWDR DKVREGPRSRSRSRDRRRKERAKSKEKKSEKKEKAQEEPPAKLLDDLFRKTKAAPCIYWL PLTDSQIVQKEAERAERAKEREKRRKEQEEEEQKEREKEAERERNRQLEREKRREHSRER DRERERERERDRGDRDRDRERDRERGRERDRRDTKRHSRSRSRSTPVRDRGGRR >gi568815584r:22926092_23134881|GENSCAN_predicted_CDS_8|3945_bp atggcggagctggaggaggtgactctggacgggaagcctcttcaggcgctgcgggtgacc gacctgaaggccgcactggagcagcgaggcctagccaagagcgggcagaagagtgccctg gtcaagcggctcaaaggggctctaatgctagaaaatttacagaaacactcaacaccccat gctgcattccagccaaattcccagattggtgaggaaatgagccagaacagtttcataaaa cagtatctggaaaagcagcaggagctacttaggcagcgtctggaacgtgaagctcgagaa gctgcagaacttgaagaagcttcagctgagtcggaggacgagatgatccatcctgaggga gtggcttccctgctgcctcctgactttcagagcagcctggagagaccagagctggagctc agcagacattcgcccagaaaaagctcctcaatttctgaagagaaaggtgactctgatgat gagaaaccaaggaaaggagaaagacgatcatctagggtcagacaggcaagagcagctaaa ctgtctgagggcagccaacctgctgaggaggaagaggatcaagaaacaccttccagaaac ctaagggtcagagcagatcgaaatttgaaaacagaggaggaagaagaggaggaggaggag gaggaagaagatgatgaagaagaggaaggtgatgatgagggacaaaaatctagggaggca ccaatcctgaaagagtttaaggaagaaggggaagagatacctagagtaaaaccagaggag atgatggatgagagacccaaaacaagatcccaggaacaggaggtgttagagagaggaggg agatttacaagatcccaggaagaggctagaaaaagtcatctggccagacagcagcaggag aaggaaatgaaaacaacatctccccttgaggaggaagaaagagaaataaaatcttcacaa ggcttaaaggaaaaatcgaagtctccttcccctcctcgactgactgaagatcgaaagaag gcctcacttgtagcgctgccagagcaaactgccagcgaggaggagactcctccaccttta ctaacaaaggaagcatcttctccaccacctcatccacagctccatagcgaagaagaaata gagcccatggaaggcccagcccccgctgtcctcattcagttatctcctcctaatacagat gctgacaccagggagctattagtatctcagcatactgtccagttggtaggaggcctgtct cctttgtcaagtccttcagacaccaaagcagaatctccagcagagaaagtgccagaggag agtgtcctgcctctggttcagaaaagcacactggctgactactcagcccagaaggatctt gaacctgagtcagacagatctgctcagcccctccctctaaaaattgaggaattagcactg gccaaaggaatcactgaagaatgtctgaaacagccatctttggaacagaaggaaggcaga agagcttctcatacccttctcccaagccacagattgaaacagtcagctgattcatcctct agccggtcctcctcatcttcctcctccagttctagatcaagatctcgctctcctgacagt tcaggttctcggtctcattcaccgctcagatccaagcagagagatgtagcccaggcacgt actcatgccaaccctcgtggtagacccaagatgggctccagatcaacatcagagtccaga tcaaggtcacgttcacgttctcgttcagcatcaagcaacagcagaaaatctctgagccct ggagtctccagggacagcagcaccagctatactgaaaccaaagatccctcttctggtcag gaggttgcaactccaccagtgccacaactgcaggtctgtgagccaaaggagaggacttcc acctcctcatcctctgtccaagcaaggcgtctgagtcagcctgaatcagctgaaaagcat gtgacccagaggttacagcctgagcgggggagcccaaagaagtgtgaagctgaagaggca gagccaccagctgccacacagccccaaacctcagagactcagacctctcatctgccagaa tcagaaagaattcatcacactgttgaggagaaggaggaagtgaccatggacacaagtgaa aacagacctgaaaatgatgttccagaacctcccatgcctattgcagaccaagtcagcaat gatgaccgcccggagggcagtgttgaagatgaggagaagaaagagctggagtccctgagg agatgccagccacaactctccgaagagaagtattctgaccttgctgctgaatgtctgcca ggtcctggggtattcacctacccccaggctaccttcatcagaggctccatgatcccactc gcagctaccaagggggtgccagctggaaacagtgacacagaggggggccagcctggtcgg aaacgacgctggggagccagcacagccaccacacagaagaaaccttccatcagtatcacc actgaatcactaaagagcctcatccccgacatcaaacccctggcggggcaggaggctgtt gtggatcttcatgctgatgactctcgcatctctgaggatgagacagagcgtaatggcgat gatgggacccatgacaaggggctgaaaatatgccggacagtcactcaggtagtacctgca gagggccaggagaatgggcagagggaagaagaggaagaagagaaggaacctgaagcagaa cctcctgtacctccccaggtgtcagtagaggtggccttgcccccacctgcagagcatgaa gtaaagaaagtgactttaggagataccttaactcgacgttccattagccagcagaagtcc ggagtttccattaccattgatgacccagtccgaactgcccaggtgccctccccaccccgg ggcaagattagcaacattgtccatatctccaatttggtccgtcctttcactttaggccag ctaaaggagttgttggggcgcacaggaaccttggtggaagaggccttctggattgacaag atcaaatctcattgctttgtaacgtactcaacagtagaggaagctgttgccacccgcaca gctctgcacggggtcaaatggccccagtccaatcccaaattcctttgtgctgactatgcc gagcaagatgagctggattatcaccgaggcctcttggtggaccgtccctctgaaactaag acagaggagcagggaataccacggcccctgcaccccccacccccacccccggtccagcca ccacagcacccccgggcagagcagcgggagcaggaacgggcagtgcgggaacagtgggca gaacgggaacgggaaatggagcggcgggagcggactcgatcagagcgtgaatgggatcgg gacaaagttcgagaagggccccgttcccgatcaaggtcccgtgaccgccgccgcaaggaa cgtgcgaagtctaaagaaaagaagagtgagaagaaagagaaagcccaggaggaaccacct gccaagctgctggatgaccttttccgaaagaccaaggcagctccctgcatctattggctc ccactgactgacagccagatcgttcagaaagaggcagagcgggccgaacgggccaaggag cgggagaagcggcgaaaggagcaagaagaagaagagcaaaaggagcgggagaaggaagcc gagcgggaacggaaccgacagctggagcgagagaaacgtcgggagcacagtcgggagagg gacagggagagagagagagaaagggagcgggacaggggggaccgagatcgggatagggaa agggaccgagaacgaggcagggaaagggatcgcagggacaccaagcgccacagcagaagc cggagtcggagcacacctgtgcgggaccggggtgggcgccgctag >gi568815584r:22926092_23134881|GENSCAN_predicted_peptide_9|102_aa MKCILHWFANWSGPQRERFLEDLVAKAVPEKLQPLLDSLEQLSVSGADRPPSIFECQLHL WDQWFRGWAEQERNEFVRQLEFSEPDFVAKFYQAVAATAGKD >gi568815584r:22926092_23134881|GENSCAN_predicted_CDS_9|309_bp atgaagtgtattcttcactggtttgccaattggtcaggtccccagcgtgaacgtttccta gaggacctggtagctaaggcagtgccagaaaaattacaaccactgctggatagtctggag cagcttagtgtgtctggggcagaccgaccaccttctatctttgagtgccagctacatctt tgggatcagtggtttcgaggctgggctgagcaggagcgcaatgaatttgtcagacagctg gagttcagtgagccagacttcgtggcaaagttttaccaagcagtggctgctacagctggt aaggactga >gi568815584r:22926092_23134881|GENSCAN_predicted_peptide_10|405_aa MLLLLLLLLLLPPLVLRVAASRCLHDETQKSIPDTHLRGYALWPEQGPPQLVQPDGPGVQ NTDFLLYVRVAHTSKCHQEPSVIAYAACCQLDSEDRPLAGTIVYCAQHLTSPSLSHSDIV MEGLLSSHWEARLLQGSLMTATFDGAQRTRLDPITLAAFKDSGWYQVNHSAAEELLWGQG SGPEFGLVTTCGTGSSDFFCTGSGLGCHYLHLDKGSCSSDPMLEGCRMYKPLANGSECWK KENGFPAGVDNPHGEIYHPQSRCFFANLTSQLLPGDKPRHPSLTPHLKEAELMGRCYLHQ CTGRGAYKVQVEGSPWVPCLPGKVIQIPGYYGLLFCPRGRLCQTNEDINAVTSPPVSLST PDPLFQLSLELAGPPGHSLGKEQQEGLAEAVLEALASKGGTGRLK >gi568815584r:22926092_23134881|GENSCAN_predicted_CDS_10|1218_bp atgctgctgctgctgctgctgctgctgctgctgccgccactagtcctcagggttgctgca agccgatgtctacatgatgagacacagaagtctattcctgacacccatcttcgcggttat gccttgtggccggagcagggtcccccacaactggtccagccagatgggcctggggtccaa aacactgattttctcctgtatgtgcgagttgctcacacttccaagtgccaccaagagccc tctgtcatagcctatgctgcctgctgccagctggactcagaagacaggcccctcgctggt accattgtctactgtgcccaacatctcaccagccccagcctcagccacagtgacatcgtc atggagggccttctgtcctcgcactgggaggccagactactccagggttctttaatgact gctacctttgatggagcccagcgcactcgactcgacccaatcaccctcgctgccttcaaa gactcaggctggtaccaggtcaaccacagcgctgcagaggagctgttgtggggccaggga tctggcccagaatttggcttggtgaccacatgtgggactggctcctcagacttcttctgt actggcagtgggctgggctgccactacctgcacctggacaagggaagctgctcctcagac cccatgctggaaggctgccgcatgtacaagcccttagccaatgggagtgaatgctggaag aaggaaaacggattccctgctggggtggataatccccatggggagatctaccatccccag agccgttgcttctttgccaacctcacttcacagctgctccctggggataagcccaggcat ccttctcttaccccacacctcaaggaagcagagctcatgggccgctgctacttacatcaa tgcacagggaggggagcttacaaggtgcaggtggagggctcgccttgggtcccatgcctt cctggaaaggttatacagatacctgggtactatggtcttctcttctgtccccggggtcgg ctgtgtcagactaatgaagatatcaatgctgttacttccccacctgtgagtctttcaacc ccagatccactattccagctctctttagaattagctgggcctccaggacactctctgggg aaggaacagcaagaagggctagctgaagcagtactggaggctttggcgagcaaaggcggc actggcaggctcaagtga >gi568815584r:22926092_23134881|GENSCAN_predicted_peptide_11|281_aa MSHGTYYECEPRGGQQPLEFSGGRAGPGELGDMCEHEASIDLSAYIESGEEQLLSDLFAV KPAPEARGLKGPGTPAFPHYLPPDPRPFAYPPHTFGPDRKALGPGIYSSPGSYDPRAVAV KEEPRGPEGSRAASRGSYNPLQYQVAHCGQTAMHLPPTLAAPGQPLRVLKAPLATAAPPC SPLLKAPSPAGPLHKGKKAVNKDSLEYRLRRERNNIAVRKSRDKAKRRILETQQKVLEYM AENERLRSRVEQLTQELDTLRNLFRQIPEAANLIKGVGGCS >gi568815584r:22926092_23134881|GENSCAN_predicted_CDS_11|846_bp atgtcccacgggacctactacgagtgtgagccccggggtggccagcagccactcgagttc tcagggggccgagctgggcccggggagctaggggacatgtgtgagcatgaggcctccatt gacctctccgcctacatcgagtctggggaagagcagcttctctccgatctctttgccgtg aagccagcgcctgaggccagaggcctcaagggccccggaacccctgccttcccccactac ttgccgcctgaccctcggccctttgcctaccctccacataccttcggcccagacaggaag gcgctggggcctggcatctacagcagcccagggagctacgaccccagggctgtggcggtg aaggaggagccccgggggccagagggcagccgagctgccagccgaggcagctacaatccc ctgcagtaccaagtggcacactgtgggcagacagccatgcacctgcccccaactctggca gcacccggccagcctctgcgcgttctcaaggcccctttggccactgccgcacccccctgc agtcccctcctgaaggcgccctccccggctggccccttacacaagggcaagaaggcagtg aacaaagatagccttgagtaccggctgaggcgggagcgcaacaacatcgccgtgcgcaag agccgagacaaggccaagaggcgcattctggagacgcagcagaaggtgctggagtacatg gcagagaacgagcgcctccgcagccgcgtggagcagctcacccaggagctagacaccctc cgcaacctcttccgccagattcctgaggcggccaacctcatcaagggcgtggggggttgc agctga >gi568815584r:22926092_23134881|GENSCAN_predicted_peptide_12|197_aa XLFFAGAREGHLPSVLAMIHVKRCTPIPALLFTCISTLLMLVTSDMYTLINYVGFINYLF YGVTVAGQIVLRWKKPDIPRPIKINLLFPIIYLLFWAFLLVFSLWSEPVVCGIGLAIMLT GVPVYFLGVYWQHKPKCFSDFIELLTLVSQKMCVVVYPEVERGSGTEEANEDMEEQQQPM YQPTPTKDKDVAGQPQP >gi568815584r:22926092_23134881|GENSCAN_predicted_CDS_12|594_bp nngctgttcttcgctggagcccgagagggccaccttcccagtgtgttggccatgatccac gtgaagcgctgcaccccaatcccagccctgctcttcacatgcatctccaccctgctgatg ctggtcaccagcgacatgtacacactcatcaactatgtgggcttcatcaactacctcttc tatggggtcacggttgctggacagatagtccttcgctggaagaagcctgatatcccccgc cccatcaagatcaacctgctgttccccatcatctacttgctgttctgggccttcctgctg gtcttcagcctgtggtcagagccggtggtgtgtggcattggcctggccatcatgctgaca ggagtgcctgtctatttcctgggtgtttactggcaacacaagcccaagtgtttcagtgac ttcattgagctgctaaccctggtgagccagaagatgtgtgtggtcgtgtaccccgaggtg gagcggggctcagggacagaggaggctaatgaggacatggaggagcagcagcagcccatg taccaacccactcccacgaaggacaaggacgtggcggggcagccccagccctga