GENSCAN 1.0 Date run: 5-Nov-116 Time: 15:23:56 Sequence gi568815597r:165301266_165544893 : 243628 bp : 43.07% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 2418 2460 43 2 1 68 116 8 0.354 2.18 1.02 Intr + 8383 8498 116 2 2 40 99 96 0.773 6.07 1.03 Intr + 11812 11863 52 0 1 85 111 21 0.651 2.58 1.04 Term + 12624 12718 95 1 2 117 51 31 0.589 0.29 1.05 PlyA + 14248 14253 6 1.05 2.03 PlyA - 14855 14850 6 1.05 2.02 Term - 15365 15129 237 2 0 65 48 141 0.524 3.97 2.01 Init - 16577 16503 75 2 0 40 89 50 0.764 1.39 2.00 Prom - 21215 21176 40 -1.66 3.00 Prom + 22278 22317 40 -3.16 3.01 Init + 25833 25922 90 2 0 80 102 27 0.554 3.79 3.02 Intr + 42976 43120 145 0 1 66 42 116 0.797 4.66 3.03 Term + 43596 43888 293 1 2 127 33 96 0.911 3.51 3.04 PlyA + 44604 44609 6 1.05 4.05 PlyA - 45499 45494 6 1.05 4.04 Term - 51997 51807 191 2 2 78 42 400 0.274 32.01 4.03 Intr - 54316 54219 98 0 2 129 92 110 0.948 15.15 4.02 Intr - 54959 54805 155 2 2 95 94 14 0.716 1.57 4.01 Init - 55966 55661 306 1 0 59 77 157 0.210 9.00 4.00 Prom - 58057 58018 40 -7.36 5.00 Prom + 60073 60112 40 -7.86 5.01 Init + 62783 62818 36 1 0 79 115 39 0.831 5.71 5.02 Intr + 69266 69398 133 1 1 51 64 123 0.714 6.42 5.03 Intr + 75570 75678 109 1 1 113 70 59 0.445 5.94 5.04 Intr + 94753 94844 92 1 2 86 94 16 0.044 1.64 5.05 Term + 97823 98037 215 0 2 64 46 68 0.087 -2.41 5.06 PlyA + 98648 98653 6 1.05 6.14 PlyA - 99683 99678 6 1.05 6.13 Term - 100145 99998 148 1 1 123 52 232 0.918 20.47 6.12 Intr - 105652 105547 106 2 1 57 94 171 0.995 13.87 6.11 Intr - 107053 106962 92 0 2 89 91 79 0.997 7.94 6.10 Intr - 108425 108293 133 0 1 120 39 104 0.996 8.40 6.09 Intr - 109566 109437 130 0 1 109 103 106 0.999 14.47 6.08 Intr - 109844 109684 161 0 2 71 88 143 0.838 12.31 6.07 Intr - 115955 115776 180 0 0 45 100 257 0.985 22.44 6.06 Intr - 118749 118605 145 0 1 106 88 139 0.988 15.56 6.05 Intr - 127701 127454 248 1 2 125 111 115 0.425 14.68 6.04 Intr - 131994 131846 149 2 2 30 86 66 0.090 0.48 6.03 Intr - 135009 134846 164 0 2 119 37 32 0.074 -0.03 6.02 Intr - 138221 138130 92 0 2 65 39 104 0.387 2.91 6.01 Init - 143628 143580 49 0 1 70 107 43 0.152 5.51 6.00 Prom - 149065 149026 40 -6.46 7.08 PlyA - 150821 150816 6 1.05 7.07 Term - 151414 151334 81 1 0 108 33 77 0.540 1.89 7.06 Intr - 181098 181084 15 0 0 118 89 16 0.216 0.44 7.05 Intr - 183560 183469 92 0 2 133 36 50 0.324 4.01 7.04 Intr - 190641 190473 169 0 1 54 59 84 0.011 1.62 7.03 Intr - 206906 206815 92 0 2 81 100 47 0.693 4.91 7.02 Intr - 221822 221709 114 0 0 60 111 59 0.508 5.82 7.01 Init - 235950 235935 16 0 1 62 97 1 0.092 -1.39 7.00 Prom - 237847 237808 40 -4.26 8.00 Prom + 241134 241173 40 -3.16 8.01 Init + 243083 243489 407 1 2 41 67 320 0.349 21.26 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 159284 159345 62 1 2 86 91 53 0.833 6.12 S.002 Init + 170560 170662 103 0 1 80 80 83 0.802 7.10 S.003 Init - 186242 186237 6 2 0 118 88 0 0.947 3.87 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:165301266_165544893|GENSCAN_predicted_peptide_1|101_aa MGSRRVLDRNVKGQASPKSNAKTTGTRRCWMLKKTSEPPTFPMNFVSGTEGLLRFSHAQL MPIGNNIESTGNNNNKNSITGGTPWLTFPKLLSEIRAELGS >gi568815597r:165301266_165544893|GENSCAN_predicted_CDS_1|306_bp atgggaagtaggcgggtactggacaggaatgtaaagggccaagcctctccaaaaagcaat gcaaaaaccactgggaccagacgttgctggatgctcaagaagaccagtgaaccacccacc ttccccatgaactttgtgtctggaactgaaggcttgttgaggtttagtcatgctcaactt atgccaattggaaacaatatagagtctacaggtaacaacaacaacaaaaacagcatcaca ggaggaaccccttggctgacatttcccaaactgctttcagaaatcagggctgagctcggt agctga >gi568815597r:165301266_165544893|GENSCAN_predicted_peptide_2|103_aa MNYAEDKWWVINVALTLNPEDDVRSPPPRQSTCAYVSVPAGKPQFFLPSASLQYSLLVKQ LEGVISLTAKGAHHEVASVGTKEQVVSVPPGPPVLKVTVVLVP >gi568815597r:165301266_165544893|GENSCAN_predicted_CDS_2|312_bp atgaattacgcagaggataaatggtgggtcattaatgtagcgttgaccctgaacccagag gatgacgtgagatcgccacctccccgacaaagcacttgtgcgtatgtctcagtaccagcc gggaagccccagtttttcctgcccagcgcctccctgcagtattctctgcttgtgaagcag cttgagggagtcataagtcttactgccaagggtgcccatcatgaggttgctagtgtaggg accaaggagcaggtggtatcagtgcccccaggacctccagtcctcaaagtgaccgtggtg ctggttccatag >gi568815597r:165301266_165544893|GENSCAN_predicted_peptide_3|175_aa MDLVGLRSRSHPLETPAETGRVDKFIGWKKSLPHGTANWFSSLKSYACIPVKTVAVITRE HMHSRNSCHNVRRSKRRKRPHAQERHQPLPTLELGAPGQAGISSDSGSGCRVLKQRHIRT LLQRQGAYFWAQGELALICCTMNDNNAKNENMVWGDSPQEPNILVFKGTLMSLRG >gi568815597r:165301266_165544893|GENSCAN_predicted_CDS_3|528_bp atggatttggtaggtttgagatccagaagtcaccctctggagacgccagctgaaactggg agagtggacaagttcattggctggaagaagagtctacctcatggcactgccaactggttt tcatctctgaagagctatgcctgcatccctgtgaagacggtggctgtcattaccagagaa cacatgcacagtaggaattcctgccacaatgtgaggaggagcaagaggagaaagaggccc catgctcaggagagacatcaacctctccccaccctggagctgggggccccaggccaagct ggcatcagctccgactcaggctcaggctgtcgggtgctcaaacaaagacatattcggacc ctcctgcagagacagggggcctatttctgggcccagggggagctggccttaatctgctgc acaatgaatgacaacaatgctaagaatgaaaatatggtttggggggatagtccccaggag cccaacattttggtatttaaaggaactttaatgagcttaagaggctaa >gi568815597r:165301266_165544893|GENSCAN_predicted_peptide_4|249_aa MASVRPMSEYGSTLKEVGNWHEPSGDEGFEELGAAGPTSTGAKQLLSASRGPGKDCSAFI AAVGERRGNPRGGTAVQEAPTSPQRPSRGEGHWGATGPERPETFPALADWREDLPARPAH RLLRAPQQSAAPGVQKRAGRPPGPLQCPFSTWGRSSRPGPNMLDGLKMEENFQSAIDTSA SFSSLLGRAVSPKSVCEGCQRVILDRFLLRLNDSFWHEQCVQCASCKEPLETTCFYRDKK LYCKYDYEK >gi568815597r:165301266_165544893|GENSCAN_predicted_CDS_4|750_bp atggcctctgtacgccccatgagtgaatatggttcgaccttaaaggaagttggaaactgg catgagccctcgggtgatgaggggtttgaggagctaggagctgctgggcccacatctact ggcgctaaacagcttctcagcgcttcccggggcccgggtaaggattgcagcgcttttata gcagcagttggagagcggcgggggaatcctcgtggcggcaccgctgtccaggaggcgccg acatctccgcaaaggcccagtcggggtgaggggcactggggggcgaccgggccagagcgc cccgagacattcccggcactggccgactggcgggaggacctccccgcgcgccccgcacac cggctcctgcgcgcaccccaacagagcgcagcgccaggagtccagaagcgggcgggacgc cctccgggtcccttacagtgccccttctcgacctggggcaggtcctcccggcccggcccg aacatgctggacggcctaaagatggaggagaacttccaaagcgcgatcgacacctcggcc tccttctcctcgctgctgggcagagcggtgagccccaagtctgtctgcgagggctgtcag cgggtcatcttggacaggtttctgctgcggctcaacgacagcttctggcatgagcagtgc gtgcagtgcgcctcctgcaaagagcccctggagaccacctgcttctaccgggacaagaag ctgtactgcaagtatgactacgagaagtaa >gi568815597r:165301266_165544893|GENSCAN_predicted_peptide_5|194_aa MASAKASGEKVKQFKNGLIQSPLKKFRGDANSSQAPLSTRAVKCTVPVCEEPRSKPAPSP LRLPDPAIHSTAARIAFPFHGFDHVTSPLGRPEELGPGERMRRERSALKYVVLLMPSMWQ VSTGYKEDQLRKEVAFEALNLMGPAHAMKTAQLLPSFCSGLVRLPSSSSGDIAPDGCAAP CWALTTLCFAAGKS >gi568815597r:165301266_165544893|GENSCAN_predicted_CDS_5|585_bp atggcttcagcaaaggcatccggggagaaagtcaagcaattcaagaacggattaattcag tcccctctcaagaaatttcgaggggatgcaaatagttctcaagcacccctctccactcga gctgtcaagtgcacagtgccagtgtgtgaggagcccagatcaaagccagcaccctcccct ctgcggctccccgacccagccatccacagcacagctgccagaatcgcctttccgttccac ggttttgatcatgtaacatccccacttggaagaccagaggaattgggccctggagaaagg atgagaagagagagaagtgctcttaagtacgtggtgctgttgatgccttccatgtggcaa gtatcaacagggtataaggaggaccagctcaggaaggaggtggcctttgaggctctgaat ctcatgggaccagcccatgcaatgaagacagcccagctcctgccaagcttctgttcaggc ttagtgaggctccctagcagcagttcgggagacatcgcacctgatggatgtgcagcaccc tgctgggccctgaccactctgtgttttgctgcaggcaagagctag >gi568815597r:165301266_165544893|GENSCAN_predicted_peptide_6|598_aa MYGNYSHFMKFPAGYGVRTSWLAFRSRTGQFHLDSCSPQRRADELLRVAPRGLRETQFED HKQPQVMGKPYLLSPVSLLVTEGLNSMICRPFCVALTVNPALSGDLEVQDQGTSGFVSGE VHSLLPKMVPHMMEGMEKWTNCASSDAEGMKGSPGHTGSTSMSPSAALSTGKPMDSHPSY TDTPVSAPRTLSAVGTPLNALGSPYRVITSAMGPPSGALAAPPGINLVAPPSSQLNVVNS VSSSEDIKPLPGLPGIGNMNYPSTSPGSLVKHICAICGDRSSGKHYGVYSCEGCKGFFKR TIRKDLIYTCRDNKDCLIDKRQRNRCQYCRYQKCLVMGMKREAVQEERQRSRERAESEAE CATSGHEDMPVERILEAELAVEPKTESYGDMNMENSTNDPVTNICHAADKQLFTLVEWAK RIPHFSDLTLEDQVILLRAGWNELLIASFSHRSVSVQDGILLATGLHVHRSSAHSAGVGS IFDRVLTELVSKMKDMQMDKSELGCLRAIVLFNPDAKGLSNPSEVETLREKVYATLEAYT KQKYPEQPGRFAKLLLRLPALRSIGLKCLEHLFFFKLIGDTPIDTFLMEMLETPLQIT >gi568815597r:165301266_165544893|GENSCAN_predicted_CDS_6|1797_bp atgtatggaaattattctcacttcatgaagtttcccgcaggctatggagttcggaccagc tggctagcctttcgaagccgcacagggcagttccacctggattcatgtagcccccagaga agggctgatgagctgctcagggtggctccaaggggactcagggaaactcagtttgaagac cacaaacagccacaggtcatgggcaagccatacctcctttccccagtgtccttgcttgta accgaggggttgaactcgatgatctgcaggcccttctgtgtagctttgacagtcaatcca gctctttctggagacttggaagttcaggatcaaggcacgagcgggttcgtgtctggtgag gtccactctctgcttccaaagatggtgcctcacatgatggaagggatggaaaaatggaca aattgtgcatcctctgatgcagaagggatgaaaggctcccctggccacactggctctaca tccatgagcccatcagcagccttgtccacagggaagccaatggacagccaccccagctac acagataccccagtgagtgccccacggactctgagtgcagtggggacccccctcaatgcc ctgggctctccatatcgagtcatcacctctgccatgggcccaccctcaggagcacttgca gcgcctccaggaatcaacttggttgccccacccagctctcagctaaatgtggtcaacagt gtcagcagttcagaggacatcaagcccttaccagggcttcccgggattggaaacatgaac tacccatccaccagccccggatctctggttaaacacatctgtgccatctgtggagacaga tcctcaggaaagcactacggggtatacagttgtgaaggctgcaaagggttcttcaagagg acgataaggaaggacctcatctacacgtgtcgggataataaagactgcctcattgacaag cgtcagcgcaaccgctgccagtactgtcgctatcagaagtgccttgtcatgggcatgaag agggaagctgtgcaagaagaaagacagaggagccgagagcgagctgagagtgaggcagaa tgtgctaccagtggtcatgaagacatgcctgtggagaggattctagaagctgaacttgct gttgaaccaaagacagaatcctatggtgacatgaatatggagaactcgacaaatgaccct gttaccaacatatgtcatgctgctgacaagcagcttttcaccctcgttgaatgggccaag cgtattccccacttctctgacctcaccttggaggaccaggtcattttgcttcgggcaggg tggaatgaattgctgattgcctctttctcccaccgctcagtttccgtgcaggatggcatc cttctggccacgggtttacatgtccaccggagcagtgcccacagtgctggggtcggctcc atctttgacagagtcctaactgagctggtttccaaaatgaaagacatgcagatggacaag tcggaactgggatgcctgcgagccattgtactctttaacccagatgccaagggcctgtcc aacccctctgaggtggagactctgcgagagaaggtttatgccacccttgaggcctacacc aagcagaagtatccggaacagccaggcaggtttgccaagctgctgctgcgcctcccagct ctgcgttccattggcttgaaatgcctggagcacctcttcttcttcaagctcatcggggac acccccattgacaccttcctcatggagatgttggagaccccgctgcagatcacctga >gi568815597r:165301266_165544893|GENSCAN_predicted_peptide_7|192_aa MAPDFELRHVATSNSKGVWKTQFSCVKDIYKHLARSKGMDHSTAAWNLNVVSGASVVIQD CKCESHALRVVEQKAISMLAFANASALYHMPCFGVQFEMPKALTENLSQSTKYGSENPSH NGFSYSFIILGRMYVFYRNQNQRTVVKLLSSQSVTGTSIGGHSSYVGLEQIFLSSNADIY FFGWGSLDLDPV >gi568815597r:165301266_165544893|GENSCAN_predicted_CDS_7|579_bp atggctcctgattttgaactcaggcacgtggccacatctaacagcaagggagtctggaaa acacagttcagctgtgtgaaggatatttataaacatttagcaagatccaaaggaatggac catagcacagctgcctggaatttgaatgtggtatctggagcttcagtagtcatccaggac tgtaagtgtgaaagccatgcactgagagtggtggaacagaaggcaatatctatgttagcc tttgccaatgccagtgctttgtaccacatgccctgctttggtgtacaatttgaaatgccg aaagcactcactgaaaacttaagccaatccacaaaatatggctctgagaatccttcccat aatggattcagttatagtttcatcattctaggaaggatgtatgtcttttatagaaatcag aaccagcgcacagtggtgaagttgctgtcttcccaaagtgtcacaggcaccagcattgga ggacactcctcctacgtgggacttgaacagattttcctgagctctaatgctgacatctac ttctttggctggggttccttggatctggacccagtctag >gi568815597r:165301266_165544893|GENSCAN_predicted_peptide_8|136_aa MGLVSGSKCPNNCLCQAQEVICTGKQLTEYPLDIPLNTRRLFLNENRITSLPAMHLGLLS DLVYLDCQNNRIREVMDYTFIGVFKLIYLDLSSNNLTSISPFTFSVLSNLVQLNIANNPH LLSLHKFTFANTTSLS >gi568815597r:165301266_165544893|GENSCAN_predicted_CDS_8|408_bp atggggctggtatcagggtcaaagtgtccaaataattgtctgtgtcaagcccaagaagta atctgcacagggaagcagttaaccgaatacccccttgacatacccctgaacacccggagg ctgttcctgaacgagaacagaatcactagtttgccagcaatgcatctaggactcctcagt gaccttgtttatttggactgtcagaacaaccggattcgagaggtgatggattataccttc atcggggtcttcaaactcatctaccttgacctcagctccaacaacctaacctcgatctcc ccattcactttctcggtgctcagcaacctggtgcagctgaacattgccaacaaccctcac ctgttatcgcttcacaagttcacctttgccaacaccacctctttgagn