GENSCAN 1.0 Date run: 6-Nov-116 Time: 05:28:32 Sequence gi568815589r:96288240_96517974 : 229735 bp : 44.16% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 Intr - 1328 1239 90 2 0 84 59 40 0.003 0.89 1.02 Intr - 10223 10177 47 0 2 79 99 51 0.828 3.43 1.01 Init - 13865 13712 154 2 1 110 110 146 0.989 17.18 1.00 Prom - 15976 15937 40 -5.66 2.13 PlyA - 17603 17598 6 1.05 2.12 Term - 26660 26541 120 2 0 120 38 45 0.534 1.17 2.11 Intr - 31938 31846 93 0 0 120 13 94 0.670 5.26 2.10 Intr - 33841 33759 83 2 2 130 97 -49 0.204 -0.44 2.09 Intr - 35930 35852 79 2 1 84 113 28 0.164 4.02 2.08 Intr - 55170 55033 138 0 0 74 85 13 0.340 0.26 2.07 Intr - 55757 55665 93 2 0 89 99 52 0.651 6.56 2.06 Intr - 57162 57060 103 2 1 109 76 20 0.417 3.08 2.05 Intr - 62932 62864 69 0 0 82 86 89 0.390 6.40 2.04 Intr - 65754 65708 47 0 2 77 39 45 0.125 -4.29 2.03 Intr - 76311 76225 87 0 0 107 115 17 0.326 6.47 2.02 Intr - 80066 80033 34 1 1 91 109 7 0.201 1.33 2.01 Init - 95395 95238 158 1 2 70 76 418 0.649 36.18 2.00 Prom - 97637 97598 40 -4.46 3.05 PlyA - 97726 97721 6 1.05 3.04 Term - 100220 99998 223 1 1 78 33 299 0.999 19.79 3.03 Intr - 104319 104159 161 0 2 32 55 152 0.025 5.09 3.02 Intr - 106703 106584 120 2 0 72 64 91 0.025 5.79 3.01 Init - 129793 129374 420 1 0 67 105 737 0.908 69.69 3.00 Prom - 139570 139531 40 -3.76 4.00 Prom + 143872 143911 40 -3.86 4.01 Init + 162041 162389 349 1 1 92 75 649 0.977 59.35 4.02 Intr + 170140 170302 163 2 1 84 53 128 0.899 7.93 4.03 Intr + 177098 177259 162 2 0 90 101 27 0.751 3.29 4.04 Intr + 177471 177539 69 0 0 92 96 26 0.656 2.10 4.05 Intr + 196223 196394 172 2 1 59 69 175 0.417 12.65 4.06 Intr + 199760 200035 276 1 0 61 66 122 0.652 5.01 4.07 Intr + 200579 200748 170 1 2 106 36 128 0.418 8.14 4.08 Intr + 203512 203662 151 1 1 52 19 107 0.130 0.36 4.09 Intr + 204781 204892 112 0 1 70 49 34 0.039 -2.35 4.10 Intr + 208418 208632 215 0 2 28 46 186 0.569 6.83 4.11 Term + 216117 216320 204 2 0 114 40 119 0.830 7.07 4.12 PlyA + 218290 218295 6 1.05 5.04 PlyA - 219178 219173 6 1.05 5.03 Term - 220610 220349 262 1 1 96 37 50 0.252 -4.30 5.02 Intr - 221550 221434 117 2 0 72 100 21 0.244 1.28 5.01 Intr - 225361 225280 82 2 1 70 92 120 0.599 9.30 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 104297 104159 139 0 1 37 55 165 0.965 7.62 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589r:96288240_96517974|GENSCAN_predicted_peptide_1|97_aa MGDVLEQFFILTGLLVCLACLAKCVRFSRCVLLNYWKVLPKSFLRSMGQWAVITGAGDGI GKAYSFEEADEVNWETKVLNSLAILLMDSKTQKWRLE >gi568815589r:96288240_96517974|GENSCAN_predicted_CDS_1|291_bp atgggggacgtcctggaacagttcttcatcctcacagggctgctggtgtgcctggcctgc ctggcgaagtgcgtgagattctccagatgtgttttactgaactactggaaagttttgcca aagtctttcttgcggtcaatgggacagtgggcagtgatcactggagcaggcgatggaatt gggaaagcgtactcgttcgaggaggctgatgaagtgaattgggagacaaaggtgcttaat tctttagccatcctgttgatggattccaagactcaaaaatggaggctagag >gi568815589r:96288240_96517974|GENSCAN_predicted_peptide_2|367_aa MTAGGQAEAEGAGGEPGAARLPSRVARLLSALFYGTCSFLIVLVNKALLTTYGFPSPIFL GIGQMAATIMILYVSKLNKIIHFPDFDKKIPVKLLSGCSVVDDVKPVLGKQYSLNIILSV FAIILGAFIAAGSDLAFNLEGYIFVFLNDIFTAANGVYTKQKMDPKELGKYGVLFYNACF MIIPTLIISVSTGDLQQLFVPLEYFKTYPSHHAILPLNIPGSTFEFWEGVVLSAFALKPY LLLVSADVLHGSVQLLQFSPDDSSGWSHQECIRCLHWDINRWRLHFLFVKLCRVKYLVCS GAFTTGAFPHMLTYRQPLVYGKEAQMDRLDLVEVLPQVGIAKKGSFSSRFLGNWIQLGED HLNSHPI >gi568815589r:96288240_96517974|GENSCAN_predicted_CDS_2|1104_bp atgacggccggcggccaggccgaggccgagggcgctggcggggagcccggcgcggcgcgg ctgccctcgcgggtggcccggctgctgtcggcgctcttctacgggacctgctccttcctc atcgtgcttgtcaacaaggcgctgctgaccacctacggtttcccgtcaccaattttcctt ggaattggacagatggcagccaccataatgatactatatgtgtccaagctaaacaaaatc attcacttccctgattttgataagaaaattcctgtaaagctgctctctggatgcagtgtg gttgatgatgtgaagcctgtgctggggaagcagtattcactcaacatcatcctcagtgtc tttgccattattctcggggctttcatagcagctgggtctgaccttgcttttaacttagaa ggctatatttttgtattcctgaatgatatcttcacagcagcaaatggagtttataccaaa cagaaaatggacccaaaggagctagggaaatacggagtacttttctacaatgcctgcttc atgattatcccaactcttattattagtgtctccactggagacctgcaacagctttttgtt cctctggagtattttaaaacctatcccagccatcacgccattttacctctaaatattcca ggatccacatttgaattttgggaaggtgttgttctttcagcatttgccctgaaaccgtat ctgctgctggtttctgctgatgtactccacggttctgtgcagctattacaattcagccct gacgacagcagtggttggagccatcaagaatgtatccgttgcctacattgggatattaat cggtggagactacattttctctttgttaaactttgtagggttaaatatttggtttgttca ggagcttttaccacaggtgcatttcctcacatgctcacctaccgccaacccctggtgtat gggaaggaggcccaaatggataggctcgacttggtggaagttctgcctcaggtgggcata gccaaaaaggggtcattctcttctcggttcctggggaattggatccaactgggagaagac cacctgaattctcatcccatctaa >gi568815589r:96288240_96517974|GENSCAN_predicted_peptide_3|307_aa MIRGFEAPMAENPPPPPPPVIFCHDSPKRVLVSVIRTTPIKPTCGGGGEPEPPPPLIPTS PGFSDFMVYPWRWGENAHNVTLSPGAAGAAASAALPAAAAAEHSGLRGRGAPPPAASASA AASGGEDEEEASSPDSGHLKVRGPICVTIQTVEKPLFKVDSSKHISVFTPERNLLFVQKM VLTLCFEGCLSRFTHANRHCPKHPYARLKREEPTDTLSKHQAADNKAAAEWLARYWEMRE QRTPTLKGKLVQKADQEQQDPLEYLQSDEEDDEKRGAQRRLQEQRERLHGALALIELANL TGAPLRQ >gi568815589r:96288240_96517974|GENSCAN_predicted_CDS_3|924_bp atgatccggggcttcgaggcgcccatggcggagaacccgccgccgccgccgccgcccgtc atcttctgccacgactccccgaagcgggtgctggtgtcggtcatcaggacgaccccgatc aagccaacgtgcggcggtggaggggagccggagccgccgccgccgctcatccccaccagc cccggcttcagcgacttcatggtgtacccgtggcgctggggcgagaacgcacacaacgtg acgctcagccctggggccgcgggggccgccgcctcggccgccctgcctgcagccgcagcc gccgagcactcggggcttcgtggccggggcgcgcccccgcccgccgcctcggcctccgcc gccgcctcgggaggtgaggacgaggaggaagcgagcagcccagacagcggccacctcaag gtgagaggccctatctgtgtgactatccagactgtggaaaagcctttgttcaaagtggac agctcaaaacacatcagcgtcttcacaccggagagaaaccttttgtttgttcagaaaatg gtgttaaccttgtgttttgaaggctgcctgagcagattcacccatgcaaaccgccactgt ccgaagcacccctacgccaggctgaagagagaggagcccacggacacactcagcaaacat caggctgccgacaacaaggccgcggccgagtggctggcgaggtattgggaaatgagagag cagcgcacccccactttgaaaggcaagctggttcagaaggctgatcaggagcagcaggac cctctggaataccttcagtctgatgaagaggacgacgagaagagaggggcccagcgccgg ctgcaggagcagcgggagcgcctgcatggagccctcgcgctcatagagcttgccaacctg actggggcgccactccgacagtag >gi568815589r:96288240_96517974|GENSCAN_predicted_peptide_4|680_aa MKGALGSPVAAAGAAMQESFGCVVANRFHQLLDDESDPFDILREAERRRQQQLQRKRRDE AAAAAGAGPRGGRSPAGASGHRAGAGGRRESQKERKSLPAPVAQRPDSPGGGLQAPGQKR TPRRGEQQGWNDSRGPEGMLERAERRSYREYRPYETERQADFTAEKFPDEKPGDRFDRDR PLRGRGGPRGGMRGRGRGGPGNRVFDAFDQRGKREFERYGGNDKIAVRTEDNMGGCGVRT WGSGKDTRVPELEVEEETQVQEMTLDEWKNLQEQTRPKPEFNIRKPESTVPSKAVVIHKS KYRDDLLCSTLQPLSQMSVGMAVDLCVFDAVIVFQMVKDDYEDDSHVFRKPANDITSQLE INFGNLPRPGRGARGGTRGGRGRIRRAENYGPRAEVVVSLITSPRPIKVSEQRSFFSLKP DLRVRLYSHWCGYRSAAAFAHYRFTQCHAVPGMYCKNTDSSYPIELLSHVVEDSFGEPGA RSEPVMGYVCLPGLALADLVVCLDTSDHFLLLISKSMPFQHMGASREGDCCPLKLYCAVL AGAGKGDPAGSTAQATLKSPLEEVQRTRSRGARPAPSLREAPSSSSFLCPPAREHRGGFS VLCTAQDKRHWKRFTTLHFQVSNNFQKANKTKIRVRKTTSNLEGSSPDAAAAALFKEKRL NRSFRFQRSQQSTESKIQTS >gi568815589r:96288240_96517974|GENSCAN_predicted_CDS_4|2043_bp atgaagggcgctctggggagtcccgtggctgccgctggcgccgcgatgcaggagagtttc ggctgcgtggtggccaaccgcttccatcagctgctggacgacgagtcggacccgttcgac atcctgcgcgaggccgagcgccggcgccagcagcagctgcagcgcaagaggcgcgacgag gcggcggcggcggccggggccggtccccgcggcggcaggagcccagccggggcctcgggc cacagagccggcgcgggcggccggagggagtcgcagaaggagcgcaagagcctcccggcg cccgtcgctcagcggcccgatagccccgggggcggcctgcaggcgccgggccagaagcgg actcctagaagaggggagcagcaaggatggaatgacagccgtgggccggaggggatgctc gaaagagctgagcggagatcctacagggaataccgaccctatgagacagagaggcaggca gacttcacagctgagaagtttccagatgaaaaaccaggtgataggtttgatcgagacaga ccgttgagaggacgtggaggcccgagagggggtatgcgcggcagaggcagaggtggccct gggaacagagtttttgacgcttttgaccagagaggaaagcgagaatttgaaagatatggt gggaatgacaaaatagcagtcagaactgaagacaacatgggtggatgtggagttcgaacc tggggatcgggtaaagataccagagttcctgagttggaggtagaagaagaaacccaagtt caagagatgactttagatgagtggaaaaatcttcaagaacagaccagaccaaagcctgag tttaacatccggaaaccagaatccactgttccttccaaagccgtggtgattcacaagtca aaatacagagatgatctcctgtgtagcacactgcagccactaagccagatgagtgtgggg atggctgtggacttgtgcgtgtttgatgctgtaattgtgtttcagatggtaaaagatgac tatgaggacgattcccatgttttccggaaacccgccaatgacatcacatcccagctggag attaattttggtaacctccctcgtcctgggcgtggagccagaggaggcacccggggaggc cggggaaggatcaggagggcagagaactatggacccagagcagaagtggtggtttctttg atcacatctcctagaccgatcaaggtctcggagcaaaggagtttcttttcattgaagcct gatctccgtgtgcggctgtattctcactggtgtggctatcggtcagcggctgcttttgct cattaccggtttacgcagtgccacgcagtgcctggcatgtattgcaaaaacactgacagt tcctatcccatagagctgttgagtcatgtggttgaagacagttttggggaacccggtgca cgttctgaacctgtaatgggctatgtttgcttaccagggctagctctcgctgacttggtc gtgtgcctggatacttcagaccatttccttctgttgatcagcaagtccatgccattccag cacatgggagcttcacgggaaggagactgctgtcctctgaaactgtactgtgcggtgctg gcaggcgcaggcaagggagaccccgctggctccaccgcccaggccaccctgaagtccccg ctggaggaggtgcagcggacccgctcccggggagcccgtcccgcgccctccctccgagaa gccccttcctccagctccttcctctgcccgccggcccgagagcaccgaggcggcttctct gtcctgtgtactgcccaggacaagcggcattggaagcgattcacgacgctacactttcag gtcagtaacaactttcagaaagcaaacaaaaccaaaataagagtcagaaaaaccacttcc aatctggagggctcctcgcccgatgctgctgctgctgcgctcttcaaggagaagcgtttg aatcgatcatttcgttttcagagatcccaacagagcactgagtcaaagatccagaccagc tag >gi568815589r:96288240_96517974|GENSCAN_predicted_peptide_5|153_aa XFTISEISEYRGSKDAAEKKVNWNIGKRVILQSSVQSCKTSEPNISGSAGITKRTTRSAS RKSSVKSFPAMYIRGFGPRGLISVKLVPSAKQPVSQETLAELGVLCALPFEDSFSSAMSP LHCFVQPYCTLNTRMGMQQCPGIRIGGLSGTDS >gi568815589r:96288240_96517974|GENSCAN_predicted_CDS_5|462_bp natttcactatatctgaaatctcagagtaccgaggctccaaggatgctgcagagaagaag gtgaactggaatataggcaagagagtaattcttcaatccagtgttcagagctgtaaaaca tctgaacctaacatttctggcagtgcaggcattactaaaagaaccaccagatctgcttca aggaaaagcagtgttaaaagcttccctgccatgtatatccgaggctttgggcctaggggc cttatcagtgtgaaattagtccccagtgcaaagcagccagtctcccaagagaccttggca gagctgggagttctgtgtgctttgccttttgaagactcattcagctctgccatgtctcct ctacactgttttgtacaaccttactgcacacttaacactcgcatggggatgcagcagtgc cccggcataaggattggaggactgtcaggcactgactcatga