GENSCAN 1.0 Date run: 4-Nov-116 Time: 17:06:37 Sequence gi568815595r:109228889_109437517 : 208629 bp : 41.96% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 726 882 157 2 1 70 38 118 0.638 5.12 1.02 Intr + 5479 5592 114 2 0 57 71 98 0.368 4.50 1.03 Term + 8026 8714 689 2 2 32 37 195 0.282 1.58 1.04 PlyA + 9904 9909 6 1.05 2.00 Prom + 13741 13780 40 -6.65 2.01 Init + 17282 17332 51 1 0 80 90 26 0.023 3.22 2.02 Term + 35597 35800 204 1 0 -33 55 445 0.880 25.49 2.03 PlyA + 36015 36020 6 1.05 3.16 PlyA - 41486 41481 6 1.05 3.15 Term - 42144 42058 87 0 0 16 43 343 0.823 19.28 3.14 Intr - 51092 50936 157 1 1 84 22 104 0.512 2.49 3.13 Intr - 75626 75587 40 0 1 97 76 21 0.049 -1.84 3.12 Intr - 75782 75705 78 0 0 107 71 12 0.115 0.00 3.11 Intr - 79405 79144 262 1 1 76 99 194 0.771 15.44 3.10 Intr - 80191 80108 84 1 0 88 98 72 0.985 7.30 3.09 Intr - 80442 80282 161 1 2 83 96 75 0.994 6.59 3.08 Intr - 83804 83597 208 2 1 73 69 205 0.558 14.83 3.07 Intr - 85819 85622 198 1 0 34 115 111 0.202 7.03 3.06 Intr - 100200 100088 113 1 2 87 70 49 0.562 2.18 3.05 Intr - 101924 101636 289 2 1 66 107 207 0.662 16.30 3.04 Intr - 102896 102846 51 2 0 73 88 35 0.532 0.19 3.03 Intr - 103143 102983 161 1 2 79 94 131 0.999 11.59 3.02 Intr - 105105 104982 124 0 1 90 116 84 0.920 10.64 3.01 Init - 108629 108576 54 2 0 67 78 56 0.614 3.83 3.00 Prom - 120038 119999 40 -4.15 4.03 PlyA - 121470 121465 6 1.05 4.02 Term - 135354 135150 205 2 1 -15 52 251 0.639 7.06 4.01 Init - 135533 135400 134 2 2 65 105 117 0.941 10.76 4.00 Prom - 137589 137550 40 -5.45 5.06 PlyA - 138040 138035 6 1.05 5.05 Term - 143976 143869 108 0 0 106 38 55 0.222 -0.17 5.04 Intr - 167764 167642 123 1 0 66 82 112 0.737 8.26 5.03 Intr - 170056 169976 81 1 0 45 87 63 0.481 0.82 5.02 Intr - 173994 173936 59 1 2 110 72 3 0.180 -1.42 5.01 Init - 178837 178063 775 1 1 70 28 220 0.159 9.22 5.00 Prom - 180193 180154 40 -5.35 6.04 PlyA - 180430 180425 6 -0.45 6.03 Term - 181181 180790 392 0 2 -4 48 428 0.885 23.76 6.02 Intr - 181304 181220 85 2 1 4 84 139 0.627 3.57 6.01 Init - 196953 196795 159 0 0 60 75 119 0.274 7.67 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:109228889_109437517|GENSCAN_predicted_peptide_1|319_aa MEISSEDEAKSHQTAETAVKQAISTTQNKVCADISIPLTYIKGINRYSHFKPKNCNPQIG RSHSGAQATRALSLNGSMQILNSHWVKTDLNRSMRQKINKDIQDFNSALDQMDLIDIYRT LHPKTTEYTFFSVPHGTYSKIDHIIGSTTLLSKCKRTEIITDNLSDHNAIKSELRIKKLI QNHTTTWKLNNLLLNDSWVNNEIKAEIKKFFETNKNKETIYQNLWNTAKAVLRGKFIALN AHIRKLERSQINTLASQLKELEEQEQKKKKKEKNPKARKRQEITKIRAELKEIETKKKSS KKINESSSWIFEKINKIDH >gi568815595r:109228889_109437517|GENSCAN_predicted_CDS_1|960_bp atggaaatctcttctgaggacgaagccaagagccatcagacagcagagacagctgtgaaa caggccatctctacaactcagaataaggtctgtgcagatatttctatacctctcacttac ataaaaggcataaaccggtactctcatttcaagcctaaaaactgcaacccacagatcgga agatcccactcgggagcccaggccaccagggccttgagtctcaatggaagcatgcagatt ctcaacagccattgggttaaaaccgacctaaacagatcaatgagacagaaaattaacaag gatattcaggacttcaactcagctctggatcaaatggacctaatagatatctacagaact ctccaccccaaaacaacagaatatacattcttctcagtgccacatggcacttactctaaa attgaccacataattggaagtacaacactcctcagcaaatgcaaaagaactgaaatcata acagacaatctctcagaccacaatgcaatcaaatcagaactcaggattaagaaactcatt caaaaccacacaactacatggaaattgaacaacctgctcctgaatgactcctgggtaaat aatgaaattaaagcagaaatcaagaagttctttgaaaccaataagaacaaagaaacaata tatcagaatctctggaacacagctaaagcagtgttaagagggaaatttatagcactaaat gcccacatcagaaagctagaaagatctcaaatcaacaccctagcatcacaattaaaagaa cttgaggagcaagagcaaaaaaaaaaaaaaaaagaaaaaaatccaaaagctaggaaaaga caagaaataactaagatcagagcagaactgaaagagatagagacaaaaaaaaaatcttcc aaaaaaatcaatgaatccagcagctggatttttgaaaaaattaataaaatagaccactag >gi568815595r:109228889_109437517|GENSCAN_predicted_peptide_2|84_aa MTSFDSRSYIQVTLMQEKKKEEEERRKEGKEGKEGKEGKEGEEGGEGEEGEEGGEGGEEG EEGGEGGEEGEGGEEGEGGEEGGE >gi568815595r:109228889_109437517|GENSCAN_predicted_CDS_2|255_bp atgacctcctttgactccaggtcttacatccaggtcacgctaatgcaagagaagaagaag gaggaggaggagaggaggaaggaagggaaggaagggaaggaagggaaggaagggaaggaa ggagaagaaggaggagaaggagaagaaggagaagaaggaggagaaggaggagaagaagga gaagaaggaggagaaggaggagaagaaggagaaggaggagaagaaggagaaggaggagaa gaaggaggagaatga >gi568815595r:109228889_109437517|GENSCAN_predicted_peptide_3|688_aa MLRGSASSTSMEKAKGKEWTSTEKSREEDQQASNQPNSIALPGTSAKRTKEKMSIKGSKV LCPKKKAEHTDNPRPQKKIPIPPLPSKLPPVNLIHRDILRAWCQQLKLSSKGQKLDAYKR LCAFAYPNQKDFPSTAKEAKIRKSLQKKLKVEKGETSLQSSETHPPEVALPPVGEPPALE NSTALLEGVNTVVVTTSAPEALLASWARISARARTPEAVESPQEASGVRWCVVHGKSLPA DTDGWVHLQFHAGQAWVPEKQEGRKMREGKKRRKGKQELIWVTGSRRKHLLIKGESRLRE VNYHILNIAVLAFPRVLLKMSDANLDSSKKNFLEGEVDDEESVILTLVPVKDDANMEQME PSVSSTSDVKLEKPKKYNPGMRTGIAQLVTPTTLGSPGKSHLLQTNEQFTAPQKARCKIP ALPLPTILPPINKVCRDTLRDWCQQLGLSTNGKKIEVYLRLHRHAYPEQRQVSNECVDGV KDMPEMSQETRLQRCSRKRKAVTKRARLQRSYEMNERAEETNTVEVITSAPGAMLASWAR IAARAVQPKALNSCSIPVSVEAFLMQASGVRWCVVHGRLLSADTKGWVRLQFHAGIEDNM LCPDCAKRYSEENTDVAVKQSHRQVLALSLTSFEKAGALLTLPQPVSSTGVFLIRLGRCF LLDDDDDDDDDGDDDDDDDGDDDDDVGL >gi568815595r:109228889_109437517|GENSCAN_predicted_CDS_3|2067_bp atgttgcgaggctccgcttcttctacaagtatggagaaggcaaaaggcaaggagtggacc tccacagagaagtcgagggaagaggatcagcaggcttctaatcaaccaaattcaattgct ttgccaggaacatcagcaaagagaaccaaagaaaaaatgtctatcaaaggcagtaaagtg ctctgccctaagaaaaaggcagagcacactgacaaccccagacctcagaagaagatacca atccctccattaccttctaaactgccacctgttaatctgattcaccgggacattctgcgg gcctggtgccaacaattgaagctgagctccaaaggccagaaattggatgcatataagcgc ctgtgtgcctttgcctacccaaatcaaaaggattttcctagcacagcaaaagaggccaaa atccggaaatcattgcaaaaaaaattaaaggtggaaaagggggaaacgtccctgcaaagt tctgagacacatcctcctgaagtggctcttcctcctgtgggggagccgcctgccctggaa aattccactgctctccttgagggagttaatacagttgtggtgacaacttctgccccagag gctttgctggcctcctgggcgagaatttcagccagggcgaggacaccagaggcagtggaa tctccacaagaggcctctggtgtcaggtggtgtgtggtccatgggaaaagtctccctgca gacacagatggttgggttcacctgcagtttcatgctggtcaagcctgggttccagaaaag caagaagggagaaaaatgagggagggcaaaaagaggaggaaaggaaaacaagagctcatc tgggtaacaggaagtagaaggaaacatttgcttataaaaggagaaagcaggttaagagaa gtaaactaccatatccttaacattgctgtccttgccttccccagggtgttgctgaaaatg tcagatgcaaatttggatagcagcaagaagaatttcttggagggggaagtagatgatgag gaaagtgtgattttgacactggtgccagttaaagatgacgcaaatatggaacaaatggaa ccaagcgtttcttcaacttctgatgtcaaactggagaagcctaagaaatacaatccaggt atgaggacagggattgctcagttagttactcctacaactctgggaagcccaggaaaaagt catctacttcaaacaaatgagcaatttacagctccacaaaaagctagatgcaaaatacca gcccttcccttgccgaccattttgcctcccattaataaggtgtgtcgggacactttgcgg gactggtgtcaacaactcggtttgagtactaatggcaagaaaatcgaagtttatctgagg cttcataggcatgcttaccctgaacaacggcaagtgagtaatgagtgtgttgatggtgtg aaggatatgcctgaaatgtcacaagagaccagattacagcgatgttcgaggaaacgcaag gcagtgaccaagagagcaaggcttcagagaagttatgagatgaatgagagagcagaagag accaatacagttgaagtgataacttcagcaccgggagccatgttggcatcatgggcaaga attgctgcaagagctgttcagcctaaggctttgaattcatgttccattcctgtttctgtt gaggcctttttgatgcaagcctctggcgtcaggtggtgtgtggtccatggcagacttctc tcggcagacacaaagggttgggtacgcctgcagtttcatgcaggcatagaagataatatg ttatgccccgactgtgctaagaggtatagtgaagagaatactgatgttgcagttaaacag agccatcgtcaagtcctggctttatcacttaccagctttgagaaagcaggggcgctcctc acgctccctcagcctgtttcctcaacaggggtgtttcttatcagattgggccgttgtttt ctcctagatgatgatgatgatgatgatgatgatggtgatgatgatgatgatgatgatggt gatgatgatgatgatgttggcctataa >gi568815595r:109228889_109437517|GENSCAN_predicted_peptide_4|112_aa MHSLDLAVNPRVPSGWALPTATLESSSQDFALPYGPWPPPGQPRSIQNPRDMIQEVCGSH PYELHAMELLKDSKHQTGPQVHQEKGGNTRPCQEAAGGAATQSGCYEEIAKN >gi568815595r:109228889_109437517|GENSCAN_predicted_CDS_4|339_bp atgcatagtttagatctagctgtgaatcctagagttccaagtggctgggcccttcccaca gctaccctggagagcagcagccaggactttgcgctaccctatggcccatggcctccacca gggcaaccaagaagcatacaaaatccaagggacatgatccaagaggtgtgtggctctcac ccttacgagctgcatgccatggagctgctcaaggactccaagcaccaaacgggccctcaa gttcaccaagaaaagggtgggaacacacgtccatgtcaagaagcagcaggaggagctgcg acccagtctggctgctatgaggaaattgccaagaactga >gi568815595r:109228889_109437517|GENSCAN_predicted_peptide_5|381_aa MDKFLDTYTLPRLNQEEVESLNRPITSSEIQAVINSLPTKKSPGLDRFTAEFYQRYKEEL VPFLLKLFQTKEKEGILSNSFYEASIILIPKPGRDTTKKENLKPISLMNINAKILNKILA NQIQQHIKKLIHHDQVGFIPGMQGWFDICKSTKVIHHINRTNDKNHMIISIDAEKAFDKI QHPFMLKTLNKLGIDGTYFKIIRAIYDRPTASIIPNGQKLHSFPLKTDTRQGCPLSPLLF NTVLEVLARAIRQEKEIKGAKFPKNCLAGAVGCMTLYKPHPPATGDQEKKSRKSGKQSLI PALAQKHTAFSATRSPGASWSAQQAVLEESAFVSSAKHTREESPEQKMLNSPTSGCATTL VNHLQDSGELDFPMSVDFCLF >gi568815595r:109228889_109437517|GENSCAN_predicted_CDS_5|1146_bp atggataagttcctggacacatacaccctcccaagactaaaccaggaagaagtcgaatcc ctgaatagaccaataacaagttctgaaattcaggcagtaattaatagcctaccaaccaaa aaaagtccaggactagacagattcacagccgaattctaccagaggtacaaagaggagctg gtaccattccttctgaaactattccaaacaaaagaaaaagagggaatcctctctaactca ttttatgaggccagcatcatcctgataccaaagcctggcagagacacaacaaaaaaagaa aacttgaagccaatatccctgatgaatatcaatgcgaaaatcctcaataaaatactggca aaccaaatccagcagcacatcaaaaagcttatccaccacgatcaagtcggcttcatccct gggatgcaaggctggtttgacatatgcaaatcaacaaaagtaatccatcacataaacaga accaatgacaaaaaccacatgattatctcaatagatgcagaaaaggccttcgataaaatt caacaccccttcatgctaaaaacactcaataaactaggtattgatggaacatatttcaaa ataataagagctatttatgacagacccacagccagtatcataccgaatgggcaaaagctg cactcattccctttgaaaaccgacacaagacagggatgccctctctcaccactcctattc aacacagtgttggaagttctggccagggcaatcaggcaagagaaagaaataaagggtgcc aaatttcccaagaattgtttagctggagcagttgggtgtatgacactctataagcctcac cctcctgccactggtgatcaagagaagaaaagcagaaagagtgggaagcagtcactaatt ccagcactggcccagaaacacacggcattttccgctacacggagcccaggcgcctcgtgg tcggcccagcaggcggtgctagaggagagcgcgtttgtgagcagcgccaagcacacgcgg gaagaaagccctgagcaaaaaatgctaaactctcctacctctggctgtgctacaacttta gtgaatcacttgcaggactctggtgaactagattttcccatgtctgtggacttttgtctt ttctga >gi568815595r:109228889_109437517|GENSCAN_predicted_peptide_6|211_aa MLNIISYQGSANQNHNAIPPYFCKNGHNQKNQKIIDVGVDVVNKENLYNANGNPPPHRHA AITLQRQRCLWLAELQLKDKGADCLQIDRKAARKQLGTKAAHKSAPSTGGVKKPHHYRPG TVALREIRRYQKATELLIRKLRFQRLVQEIAQDLKTDLRFQSAAIGALQEASEAYLVGLF EDSNLCAIHAKRLTIMPKEIQLARRIQGKRA >gi568815595r:109228889_109437517|GENSCAN_predicted_CDS_6|636_bp atgctcaacatcattagttatcagggaagtgcaaatcaaaaccacaatgcaataccacct tacttctgcaagaatggccataatcaaaaaaatcaaaaaataatagatgttggtgtggat gtggtgaacaaggaaaacctctacaatgctaatggtaatccgccgccgcaccgccatgcc gccatcactctccaacgccagcgctgcctctggctcgcagagctccagctgaaggataag ggagcggactgcctgcaaatcgaccgtaaagcagccaggaagcaactgggtacaaaagcc gctcacaagagtgcgccctctactggaggggtgaagaaacctcatcattacaggcctggt actgtggcactccgtgaaatcagacgttatcagaaggccactgaacttctgattcgcaaa cttcgctttcagcgtctggtgcaagaaattgctcaggacttgaaaacagatctacgcttc cagagtgcagctattggagctttgcaggaggcaagtgaggcctacctggttggccttttt gaagacagcaacctgtgtgctatccatgccaaacggctaactattatgccaaaagaaatc cagctagcacgccgcatacaaggaaaacgtgcttaa