GENSCAN 1.0 Date run: 6-Nov-116 Time: 15:44:56 Sequence gi568815589r:96409673_96719378 : 309706 bp : 43.62% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 151 146 6 1.05 1.02 Term - 7535 7455 81 2 0 48 45 70 0.735 -3.61 1.01 Init - 8360 7941 420 2 0 67 105 737 0.907 69.69 1.00 Prom - 18137 18098 40 -3.76 2.00 Prom + 22439 22478 40 -3.86 2.01 Init + 40608 40956 349 2 1 92 75 649 0.977 59.35 2.02 Intr + 48707 48869 163 0 1 84 53 128 0.899 7.93 2.03 Intr + 55665 55826 162 0 0 90 101 27 0.751 3.29 2.04 Intr + 56038 56106 69 1 0 92 96 26 0.656 2.10 2.05 Intr + 74790 74961 172 0 1 59 69 175 0.417 12.65 2.06 Intr + 78327 78602 276 2 0 61 66 122 0.652 5.01 2.07 Intr + 79146 79315 170 2 2 106 36 128 0.418 8.14 2.08 Intr + 82079 82229 151 2 1 52 19 107 0.130 0.36 2.09 Intr + 83348 83459 112 1 1 70 49 34 0.039 -2.35 2.10 Intr + 86985 87199 215 1 2 28 46 186 0.568 6.83 2.11 Term + 94684 94887 204 0 0 114 40 119 0.818 7.07 2.12 PlyA + 96857 96862 6 1.05 3.15 PlyA - 97745 97740 6 1.05 3.14 Term - 99177 98916 262 2 1 96 37 50 0.211 -4.30 3.13 Intr - 100117 100001 117 0 0 72 100 21 0.205 1.28 3.12 Intr - 103928 103847 82 0 1 70 92 120 0.494 9.30 3.11 Intr - 105405 105398 8 2 2 126 98 0 0.413 -1.82 3.10 Intr - 112931 112834 98 2 2 49 94 66 0.367 2.11 3.09 Intr - 113748 113589 160 2 1 39 46 83 0.444 -0.81 3.08 Intr - 114053 113915 139 0 1 90 95 99 0.641 10.32 3.07 Intr - 124441 124255 187 1 1 17 95 224 0.365 15.26 3.06 Intr - 129468 129406 63 0 0 59 100 53 0.372 2.51 3.05 Intr - 168884 168796 89 0 2 63 48 91 0.022 2.29 3.04 Intr - 201413 201387 27 0 0 113 61 37 0.034 1.59 3.03 Intr - 208968 208823 146 2 2 105 68 101 0.866 9.73 3.02 Intr - 209297 209174 124 0 1 74 16 25 0.404 -6.46 3.01 Init - 209706 209547 160 0 1 79 105 297 0.993 28.49 3.00 Prom - 212030 211991 40 -2.86 4.07 PlyA - 215441 215436 6 1.05 4.06 Term - 232214 232087 128 0 2 143 38 -11 0.282 -2.16 4.05 Intr - 236316 236221 96 1 0 35 115 27 0.199 0.08 4.04 Intr - 241823 241718 106 2 1 98 110 64 0.985 9.39 4.03 Intr - 242040 241987 54 0 0 93 84 30 0.828 2.28 4.02 Intr - 245101 245033 69 1 0 100 89 32 0.934 3.88 4.01 Init - 245609 245418 192 2 0 95 88 506 0.999 48.27 4.00 Prom - 258380 258341 40 -3.16 5.00 Prom + 268121 268160 40 -2.46 5.01 Init + 269313 269416 104 2 2 94 54 59 0.224 2.81 5.02 Intr + 277526 277694 169 2 1 45 49 83 0.224 0.05 5.03 Intr + 289769 289970 202 1 1 70 107 20 0.074 0.96 5.04 Term + 291511 291650 140 2 2 18 55 123 0.395 0.13 5.05 PlyA + 292569 292574 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 150012 150077 66 2 0 73 106 35 0.898 4.88 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589r:96409673_96719378|GENSCAN_predicted_peptide_1|166_aa MIRGFEAPMAENPPPPPPPVIFCHDSPKRVLVSVIRTTPIKPTCGGGGEPEPPPPLIPTS PGFSDFMVYPWRWGENAHNVTLSPGAAGAAASAALPAAAAAEHSGLRGRGAPPPAASASA AASGGEDEEEASSPDSGHLKIALALLKDALQRQPKSKKGLLGLFCG >gi568815589r:96409673_96719378|GENSCAN_predicted_CDS_1|501_bp atgatccggggcttcgaggcgcccatggcggagaacccgccgccgccgccgccgcccgtc atcttctgccacgactccccgaagcgggtgctggtgtcggtcatcaggacgaccccgatc aagccaacgtgcggcggtggaggggagccggagccgccgccgccgctcatccccaccagc cccggcttcagcgacttcatggtgtacccgtggcgctggggcgagaacgcacacaacgtg acgctcagccctggggccgcgggggccgccgcctcggccgccctgcctgcagccgcagcc gccgagcactcggggcttcgtggccggggcgcgcccccgcccgccgcctcggcctccgcc gccgcctcgggaggtgaggacgaggaggaagcgagcagcccagacagcggccacctcaag atcgccctcgccctgttaaaagatgccctgcagagacagccgaagagcaagaaaggactt ctgggcctgttctgcggctga >gi568815589r:96409673_96719378|GENSCAN_predicted_peptide_2|680_aa MKGALGSPVAAAGAAMQESFGCVVANRFHQLLDDESDPFDILREAERRRQQQLQRKRRDE AAAAAGAGPRGGRSPAGASGHRAGAGGRRESQKERKSLPAPVAQRPDSPGGGLQAPGQKR TPRRGEQQGWNDSRGPEGMLERAERRSYREYRPYETERQADFTAEKFPDEKPGDRFDRDR PLRGRGGPRGGMRGRGRGGPGNRVFDAFDQRGKREFERYGGNDKIAVRTEDNMGGCGVRT WGSGKDTRVPELEVEEETQVQEMTLDEWKNLQEQTRPKPEFNIRKPESTVPSKAVVIHKS KYRDDLLCSTLQPLSQMSVGMAVDLCVFDAVIVFQMVKDDYEDDSHVFRKPANDITSQLE INFGNLPRPGRGARGGTRGGRGRIRRAENYGPRAEVVVSLITSPRPIKVSEQRSFFSLKP DLRVRLYSHWCGYRSAAAFAHYRFTQCHAVPGMYCKNTDSSYPIELLSHVVEDSFGEPGA RSEPVMGYVCLPGLALADLVVCLDTSDHFLLLISKSMPFQHMGASREGDCCPLKLYCAVL AGAGKGDPAGSTAQATLKSPLEEVQRTRSRGARPAPSLREAPSSSSFLCPPAREHRGGFS VLCTAQDKRHWKRFTTLHFQVSNNFQKANKTKIRVRKTTSNLEGSSPDAAAAALFKEKRL NRSFRFQRSQQSTESKIQTS >gi568815589r:96409673_96719378|GENSCAN_predicted_CDS_2|2043_bp atgaagggcgctctggggagtcccgtggctgccgctggcgccgcgatgcaggagagtttc ggctgcgtggtggccaaccgcttccatcagctgctggacgacgagtcggacccgttcgac atcctgcgcgaggccgagcgccggcgccagcagcagctgcagcgcaagaggcgcgacgag gcggcggcggcggccggggccggtccccgcggcggcaggagcccagccggggcctcgggc cacagagccggcgcgggcggccggagggagtcgcagaaggagcgcaagagcctcccggcg cccgtcgctcagcggcccgatagccccgggggcggcctgcaggcgccgggccagaagcgg actcctagaagaggggagcagcaaggatggaatgacagccgtgggccggaggggatgctc gaaagagctgagcggagatcctacagggaataccgaccctatgagacagagaggcaggca gacttcacagctgagaagtttccagatgaaaaaccaggtgataggtttgatcgagacaga ccgttgagaggacgtggaggcccgagagggggtatgcgcggcagaggcagaggtggccct gggaacagagtttttgacgcttttgaccagagaggaaagcgagaatttgaaagatatggt gggaatgacaaaatagcagtcagaactgaagacaacatgggtggatgtggagttcgaacc tggggatcgggtaaagataccagagttcctgagttggaggtagaagaagaaacccaagtt caagagatgactttagatgagtggaaaaatcttcaagaacagaccagaccaaagcctgag tttaacatccggaaaccagaatccactgttccttccaaagccgtggtgattcacaagtca aaatacagagatgatctcctgtgtagcacactgcagccactaagccagatgagtgtgggg atggctgtggacttgtgcgtgtttgatgctgtaattgtgtttcagatggtaaaagatgac tatgaggacgattcccatgttttccggaaacccgccaatgacatcacatcccagctggag attaattttggtaacctccctcgtcctgggcgtggagccagaggaggcacccggggaggc cggggaaggatcaggagggcagagaactatggacccagagcagaagtggtggtttctttg atcacatctcctagaccgatcaaggtctcggagcaaaggagtttcttttcattgaagcct gatctccgtgtgcggctgtattctcactggtgtggctatcggtcagcggctgcttttgct cattaccggtttacgcagtgccacgcagtgcctggcatgtattgcaaaaacactgacagt tcctatcccatagagctgttgagtcatgtggttgaagacagttttggggaacccggtgca cgttctgaacctgtaatgggctatgtttgcttaccagggctagctctcgctgacttggtc gtgtgcctggatacttcagaccatttccttctgttgatcagcaagtccatgccattccag cacatgggagcttcacgggaaggagactgctgtcctctgaaactgtactgtgcggtgctg gcaggcgcaggcaagggagaccccgctggctccaccgcccaggccaccctgaagtccccg ctggaggaggtgcagcggacccgctcccggggagcccgtcccgcgccctccctccgagaa gccccttcctccagctccttcctctgcccgccggcccgagagcaccgaggcggcttctct gtcctgtgtactgcccaggacaagcggcattggaagcgattcacgacgctacactttcag gtcagtaacaactttcagaaagcaaacaaaaccaaaataagagtcagaaaaaccacttcc aatctggagggctcctcgcccgatgctgctgctgctgcgctcttcaaggagaagcgtttg aatcgatcatttcgttttcagagatcccaacagagcactgagtcaaagatccagaccagc tag >gi568815589r:96409673_96719378|GENSCAN_predicted_peptide_3|553_aa MKRKSERRSSWAAAPPCSRRCSSTSPGVKKIRSSTQQDPRRRDPQDDVYLDITARWTEGH LSKLRASARRLEPAAPPRGLLLSSCGFRLQGLRARAPPERLRLPGSAPRRGLEKEGVTLV AFEPAGKTNVRVVTFEKSQPARAAFHGYCSNKDWIFGMNTTEGEVPFYHILSEVHEIHTT SRAMQYGFLNFNSFNLDEYEHYENHNVTTIIRLNKRMYDAKRFTDAGFDHHDLFFADGST PTDAIVKEFLDICENAEGAIAVHCKAGLGRTGTLIACYIMKHYRMTAAETIAWVRICRPG SVIGPQQQFLVMKQTNLWLEGDYFRQKLKGQENGQHRAAFSKLLSGVDDISINGVENQDQ QEPEPYSDDDEINGVTQGDRLRALKSRRQSKTNAIPLTEYNFTISEISEYRGSKDAAEKK VNWNIGKRVILQSSVQSCKTSEPNISGSAGITKRTTRSASRKSSVKSFPAMYIRGFGPRG LISVKLVPSAKQPVSQETLAELGVLCALPFEDSFSSAMSPLHCFVQPYCTLNTRMGMQQC PGIRIGGLSGTDS >gi568815589r:96409673_96719378|GENSCAN_predicted_CDS_3|1662_bp atgaagcggaaaagcgagcggcggtcgagctgggccgccgcgcccccctgctcgcggcgc tgctcgtcgacctcgccgggtgtgaagaagatccgcagctccacgcagcaagacccgcgc cgccgggacccccaggacgacgtgtacctggacatcaccgcaagatggaccgagggtcac ctttctaagttgcgggcgtcagcccggcgcctcgaacctgcagctcctccccgcgggctg cttctgagttcctgtggattccgccttcagggattgcgagcccgcgcgccccccgaacgc ctccgcctcccggggtccgctccccgccggggcctcgagaaggaaggtgtcacgttggtg gcattcgagccagccggcaagacgaatgtccgggttgtcacttttgagaaatcccagcca gccagagcagcttttcatggatactgctccaacaaagactggatttttggcatgaatacc acagaaggtgaagtgcccttctatcacatcttgtcagaggtacatgaaatccacacgaca tcacgggcaatgcagtatggcttccttaatttcaactcatttaaccttgatgaatatgaa cactatgaaaatcacaatgttactaccattattcgtctgaataaaaggatgtatgatgcc aaacgctttacggatgctggcttcgatcaccatgatcttttctttgcggatggcagcacc cctactgatgccattgtcaaagaattcctagatatctgtgaaaatgctgagggtgccatt gcagtacattgcaaagctggccttggtcgcacgggcactctgatagcctgctacatcatg aagcattacaggatgacagcagccgagaccattgcgtgggtcaggatctgcagacctggc tcggtgattgggcctcagcagcagtttttggtgatgaagcaaaccaacctctggctggaa ggggactattttcgtcagaagttaaaggggcaggagaatggacaacacagagcagccttc tccaaacttctctctggcgttgatgacatttccataaatggggtcgagaatcaagatcag caagaacccgaaccgtacagtgatgatgacgaaatcaatggagtgacacaaggtgataga cttcgggccttgaaaagcagaagacaatccaaaacaaacgctattcctctcacggaatac aatttcactatatctgaaatctcagagtaccgaggctccaaggatgctgcagagaagaag gtgaactggaatataggcaagagagtaattcttcaatccagtgttcagagctgtaaaaca tctgaacctaacatttctggcagtgcaggcattactaaaagaaccaccagatctgcttca aggaaaagcagtgttaaaagcttccctgccatgtatatccgaggctttgggcctaggggc cttatcagtgtgaaattagtccccagtgcaaagcagccagtctcccaagagaccttggca gagctgggagttctgtgtgctttgccttttgaagactcattcagctctgccatgtctcct ctacactgttttgtacaaccttactgcacacttaacactcgcatggggatgcagcagtgc cccggcataaggattggaggactgtcaggcactgactcatga >gi568815589r:96409673_96719378|GENSCAN_predicted_peptide_4|214_aa MAAPAPVTRQVSGAAALVPAPSGPDSGQPLAAAVAELPVLDARGQRVPFGALFRERRAVV VFVRHFLCYICKEYVEDLAKIPRSFLQEANVTLIVIGQSSYHHIEPFCKLTGYSHEIYVD PEREIYKRLGMKRGEEIASSGSLQSLWRAVTGPLFDFQGDPAQQGGTLILGPGNNIHFIH RDRNRLDHKPINSVLQLVGVQHVNFTNRPSVIHV >gi568815589r:96409673_96719378|GENSCAN_predicted_CDS_4|645_bp atggccgcgccggccccggtcacgcggcaggttagcggcgccgccgccctggtcccggcc ccgagcggccccgacagcgggcagcccctggcggccgccgtggccgagctgccggtgctg gacgcccgcgggcagcgggtaccgttcggcgcgctgttccgggagcgccgcgccgtggtg gtgttcgtgcggcatttcctgtgttacatctgcaaggaatacgtagaggatctggccaaa atccccaggagtttcttacaagaagcaaatgtcacccttatagtgattggacagtcatcc taccatcatattgagcctttttgcaagctgactggatattctcatgaaatctatgtcgat cctgagagagaaatttataaaagattgggaatgaaaagaggtgaagaaattgcttcctca ggaagccttcagagcctgtggcgggcagtgactggccctctctttgattttcaaggagac ccagctcagcaaggtggaaccctcattttaggtccaggtaacaacatccattttatacac cgcgataggaataggttggatcacaaacctatcaactctgttttacagcttgtaggagtt cagcatgtgaactttacaaacagaccttcagttatccatgtgtga >gi568815589r:96409673_96719378|GENSCAN_predicted_peptide_5|204_aa MDLNGHFSKEDLQMAYEHYLKSLIIREIPIECTMRTVRPLGRGESPTADVWVCECQLSGA SRTRHTQRGFEEGSAFQVERFLGSENGKFSKCWDYRHEPLCPAHNLSEEDIGENHDIGCA CDFLDMTGKTQQQQQQKIDKNFCASKDTVNRKMATQAGAGLTGCSECECGAHQVPAHPEL QLARKRCTQPRFPLVPLPPHLPAS >gi568815589r:96409673_96719378|GENSCAN_predicted_CDS_5|615_bp atggatcttaatggacatttctccaaagaagatctacaaatggcctatgagcactactta aaatcattaatcattagggaaatccctattgaatgcacaatgaggaccgtccgcccgctc ggaagaggcgaatctcccacggcagacgtctgggtgtgcgaatgccagttgagcggggcc tcgaggacgcgtcacacacagcgaggatttgaggagggaagcgccttccaggtagaacgg tttctgggttcagagaacggaaagttcagtaaatgctgggattataggcatgagccactg tgcccggcccataatctctctgaagaagacataggggaaaatcatgacattggatgtgcc tgtgatttcttggatatgacaggaaaaacacaacaacaacaacaacaaaaaatagacaaa aacttttgtgcatcaaaggatactgtcaacaggaaaatggcaacccaggcaggagcaggg ctgactggctgctccgagtgcgagtgcggggcccaccaagtccccgcccacccggaactg cagctggcccgcaagcgctgcacacagccccggttcccgctcgtgcctctccctccacac ctccctgcaagctga