GENSCAN 1.0 Date run: 6-Nov-116 Time: 11:02:26 Sequence gi568815596r:98694535_98946800 : 252266 bp : 45.18% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 4432 4574 143 2 2 107 85 19 0.423 3.57 1.02 Intr + 10380 10451 72 2 0 83 61 47 0.235 1.10 1.03 Term + 28734 28832 99 2 0 66 54 69 0.216 -0.57 1.04 PlyA + 29733 29738 6 1.05 2.03 PlyA - 30077 30072 6 1.05 2.02 Term - 32033 32003 31 1 1 94 49 32 0.206 -2.77 2.01 Init - 36683 36514 170 2 2 52 76 292 0.942 21.41 2.00 Prom - 41280 41241 40 -4.46 3.03 PlyA - 41603 41598 6 1.05 3.02 Term - 43964 43863 102 2 0 121 45 40 0.213 1.28 3.01 Init - 55154 55101 54 2 0 68 97 44 0.573 4.58 3.00 Prom - 71614 71575 40 -4.46 4.00 Prom + 74508 74547 40 -6.46 4.01 Init + 80681 80771 91 1 1 57 78 74 0.171 3.95 4.02 Intr + 82751 82818 68 0 2 82 94 25 0.138 1.12 4.03 Intr + 91943 91996 54 1 0 95 115 69 0.270 9.48 4.04 Term + 96097 96123 27 0 0 131 38 12 0.521 -1.43 4.05 PlyA + 97872 97877 6 1.05 5.14 PlyA - 99331 99326 6 1.05 5.13 Term - 100140 99998 143 1 2 59 43 67 0.495 -2.61 5.12 Intr - 101730 101586 145 0 1 71 51 141 0.942 8.56 5.11 Intr - 103003 102744 260 2 2 81 67 160 0.721 10.38 5.10 Intr - 107816 107719 98 1 2 48 46 116 0.066 3.05 5.09 Intr - 114852 114720 133 1 1 88 61 24 0.016 -0.60 5.08 Intr - 129003 127323 1681 0 1 73 99 1575 0.921 144.42 5.07 Intr - 132635 132441 195 2 0 48 16 199 0.643 8.31 5.06 Intr - 133693 133549 145 0 1 70 61 88 0.871 4.58 5.05 Intr - 134144 134027 118 0 1 106 36 73 0.887 3.42 5.04 Intr - 137978 137810 169 2 1 47 23 207 0.045 9.62 5.03 Intr - 138463 138328 136 0 1 50 99 32 0.048 1.07 5.02 Intr - 143753 143585 169 0 1 69 83 224 0.102 19.00 5.01 Init - 152248 152197 52 1 1 69 110 23 0.207 4.10 5.00 Prom - 162071 162032 40 -3.46 6.00 Prom + 173500 173539 40 -0.96 6.01 Init + 176924 176983 60 1 0 79 39 60 0.149 1.45 6.02 Intr + 178183 178257 75 0 0 6 55 134 0.117 1.61 6.03 Intr + 190384 190501 118 0 1 81 64 71 0.054 4.04 6.04 Intr + 227245 227295 51 2 0 88 67 62 0.148 2.98 6.05 Intr + 237614 237694 81 0 0 96 80 74 0.640 7.01 6.06 Term + 246678 246766 89 1 2 67 48 76 0.076 -0.68 6.07 PlyA + 248689 248694 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 141408 141668 261 2 0 74 36 155 0.813 4.07 S.002 Term - 143753 143581 173 0 2 69 38 235 0.893 14.59 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:98694535_98946800|GENSCAN_predicted_peptide_1|104_aa XMSSLTAGLSGVPKTIPRFSDSLEGLTGLTIELQHREKTHGEKFGRNQLSKHASGEYTRG ATDIGNKHPLRQAQSQGATCASSMRQQDSRSRRRAGRKTRTPED >gi568815596r:98694535_98946800|GENSCAN_predicted_CDS_1|315_bp nctatgagttccttaacagcaggtttgtcaggagttcctaagactatccccaggttcagt gattcacttgaaggactcacaggactcaccattgagttgcagcacagggaaaagacacac ggagagaagtttggaagaaaccagctgagcaaacatgcttctggagagtacaccagaggc gcaactgacattgggaacaaacatcccttaaggcaggcccagtcccaaggtgcaacttgc gccagcagcatgcgccagcaagatagcagaagcaggaggagagccggccggaagacacgt acccctgaagattga >gi568815596r:98694535_98946800|GENSCAN_predicted_peptide_2|66_aa MAGGVGAARRGGAGRALQRRGRSRRGGRGGPQRRRQRRRRLQQLRRQPREHGDASGGKGK GRGISV >gi568815596r:98694535_98946800|GENSCAN_predicted_CDS_2|201_bp atggctggcggggtcggggcggcccggcggggcggggcggggcgggcgctccagcggcgg gggcggagccggcggggcgggcggggcggcccgcagcggcggcggcagcggcggcggcgg ctacagcagctccgccggcagccgcgggagcacggcgacgccagcggcgggaagggaaaa ggccgaggcatcagcgtgtga >gi568815596r:98694535_98946800|GENSCAN_predicted_peptide_3|51_aa MNVHGVFEEKRSRGAVGKVQIRHLFFKVQIKSVLSHKVLLHSVNEQVPGTY >gi568815596r:98694535_98946800|GENSCAN_predicted_CDS_3|156_bp atgaatgttcatggtgtttttgaggagaaacgttcaagaggtgctgttggtaaggtccaa atccgacatctgttcttcaaggttcagatcaagtctgttctctcccacaaagtcctcctc cattctgtcaatgaacaggtgcctgggacatattga >gi568815596r:98694535_98946800|GENSCAN_predicted_peptide_4|79_aa MCQKKTLCCELSEACELAPGNASPTVFGVGECILIVKQCMTAYEEIYYTVLVHGSLGSPH WRKHLILTDEMTLLTTLDP >gi568815596r:98694535_98946800|GENSCAN_predicted_CDS_4|240_bp atgtgtcagaagaagacattatgctgtgagctctctgaagcttgtgagctggccccaggg aatgcaagtccaactgtctttggtgttggagaatgtatcctcattgttaaacaatgcatg actgcatatgaagagatttattatacggtattggttcacgggagcctgggcagcccgcac tggcggaagcacctgatccttacagatgagatgacccttttgacaactctagatccctag >gi568815596r:98694535_98946800|GENSCAN_predicted_peptide_5|1147_aa MDIKLREAAEGLGEDSTGKKKSKFKTFKKFFGKKKRKESPSSTGSSTWKQSQTRNEVIAI ESGPVGYDSEDELEESRGTLGSRALSHDSIFIPESGQDATRPVRVFSQENVCDRIKALQL KIQCNVKMGPPPPPGGLPAKRGEDAGMSSEDDGLPRSPPEMSLLHDVGPGTTIKANKLTL LLENRRDLHMNKKRLFLPYRSPSHQYDHPYGSLSGRSEFHPNLCLQGSPLRPHSVVELAK FGGRDLRVLPSFFHMTASIPLSQVSVVSPDHVSDSTVSARISDNSLAPVADFSYPAESSS CLDNSAAKHKLQVKPRNQRSSKMRRLSSRAQSESLSDLTCTPEEEENEEKPLLEVSPEER PSSGQQDVAPDRGPEPGPPAPLPPPGGARARRARLQHSSALTASVEEGGVPGEDPSSRPA TPELAEPESAPTLRVEPPSPPEGPPNPGPDGGKQDGEAPPAGPCAPATDKAEEVVCAPED VASPFPTAIPEGDTTPPETDPAATSEAPSARDGPERSVPKEAEPTPPVLPDEEKGPPGPA PEPEREAETEPERGAGTEPERIGTEPSTAPAPSPPAPKSCLKHRPAAASEGPAASPPLAA AESPPVEPGPGSLDAEAAAPERPKAERAEAPPAGAERAAPERKAERGGAELRGAKKFSVS SCRARPRPGVSRPLERASGRLPLARSGPVWRSEAALDDLQGLPEPQHAKPGPRKLAERGP QDSGDRAASPAGPRKSPQEAAAAPGTREPCPAAQEPAPSEDRNPFPVKLRSTSLSLKYRD GASQEVKGVKRYSAEVRLERSLTVLPKEEKCPLGTAPALRGTRAPSDQGKGKARPPEPLS SKPPLPRKPLLQSFTLPHQPAPPDAGPGEREPRKEPRTAEKRPLRRGAEPHMGSQPRGQG DNEKRSQPESQWWLGIPQCQFQVSARPRQDKTFVSCIRDGQARCNTQRVHGGESAAVDSG TAVHTEKSLPPAATGPGADGQPAPPWITVTRQKRRGTLDQPPNQEDKPGARTLKSEPGKQ AKVPERGQVLRANTSVLWVQEGAASWDWGEDPEPVKQADFVRSKSFLITPVKPAVDRKQG AKLNFKEGLQRGISLSHQNLAAQSAVMMEKELHQLKRASYASTDQPSWMELARKKSQAWS DMPQIIK >gi568815596r:98694535_98946800|GENSCAN_predicted_CDS_5|3444_bp atggacattaagcttcgggaggctgcagaaggccttggggaggacagcacaggaaagaaa aaatctaaattcaaaacttttaagaagttttttgggaagaagaagagaaaagaatcgccg tcgtccacaggaagtagcacctggaaacaaagtcagacgaggaatgaggtcattgccatc gaatccgggccagtgggctacgactccgaggatgagctggaggagtcgaggggcacgctg ggcagccgggccctttctcacgacagtattttcattcctgagtccggacaggacgctact cggcctgtgcgggtgttttcccaagaaaatgtgtgtgatcggattaaagctctgcagtta aaaatacagtgtaatgtgaaaatggggccaccacctcctccaggggggcttcctgccaag cggggagaggatgccggcatgagctctgaggacgacgggctgcccaggagccccccagag atgtctctgctgcacgacgtgggtcctggtaccaccataaaggcaaataaacttacactt ttattggaaaataggagagaccttcacatgaacaagaagaggttgttcctcccgtatcga agcccatctcaccagtacgatcatccttatggttccctgtccgggcgctccgagtttcat cccaacttgtgtctgcaggggagcccacttcgacctcactcagtggtggagctggccaag ttcgggggcagggatctgcgggtattgccttcattctttcacatgactgccagcatccct ttgtcccaggtctctgtcgtgtctccagaccacgtgagcgacagcaccgtctctgcccgg atctcagacaacagcctggcaccagtggctgacttcagttatcctgcagaatcctcctcc tgcctggacaactctgcagctaagcacaagctccaggtcaagccccgcaaccagcggtcg agtaagatgaggcggctctcatcgcgcgctcagtctgaatccctgagcgacctgacgtgc accccagaggaggaggaaaacgaggagaagccacttttggaagtcagcccagaagagcgc cccagctctgggcagcaggacgtggcgccagacagaggccctgagcctgggccaccggcg cccttgccgccacccggaggggcccgtgccagacgcgcccgcctgcagcactcctccgcg ctcacggccagcgtggaggaggggggcgtccccggggaggacccctcaagccgcccggcc accccggagctcgccgagcccgagtcggcccccactctccgcgtggagcccccgtccccg ccggagggccccccgaatcccggtcccgacggcggaaagcaggatggggaggcgcccccc gcaggcccgtgtgccccggccacggacaaggcggaggaggtggtctgtgctcccgaagac gtcgcgagcccgtttcccaccgccatccctgagggggacacgactccccccgagactgac cccgccgccacctcagaggcgccctctgctcgcgacgggccagaacgcagtgtcccgaag gaagcggagccgacgccgcccgtgctcccggatgaggagaaggggcccccagggccggcg cctgagcccgagagagaagcggagacggagcccgagagaggagcggggaccgagcccgag agaattgggaccgagccctccacggcgcccgccccgagcccgccggcgcccaagagctgc ctgaaacaccggcccgcggccgccagcgagggccccgccgcgtccccgccgcttgccgct gcggagtctcccccggtggagcccggccccggttccctcgacgcagaggccgccgccccg gagcgccccaaggccgagcgagccgaggcgccaccggcgggcgccgagagggcggcgcca gagcggaaggcggagaggggcggtgccgagctgcgaggcgcgaagaagttctcggtgtcc tcgtgccgagcgcggcctcgtccgggcgtctcccgcccgctggaacgggccagcggccgc ctgcccctcgcgaggagcggcccggtctggaggagcgaggcggctcttgacgacctccag ggtctccccgagccccagcacgcgaaacctggccctcggaagctggcggagcgcggccct caggactcgggggacagggcggccagcccggccgggccgcgcaagagccctcaggaggcg gccgccgcgcccggcacgagagagccctgcccagccgcccaggagccggccccgagtgag gacagaaaccccttccccgtcaagctccggtccacctccctctcgctcaaatacagggat ggcgcctctcaggaggtgaagggtgtgaagaggtacagtgccgaggtccggttagaaagg tcgctgaccgtgctcccgaaggaggagaagtgtcccctcgggacggcccccgcccttcga ggcaccagggcccccagcgaccaaggaaaggggaaggcccggccccccgagccgctcagc tccaagccgcccctgccccggaagccgcttctgcagagcttcacgctcccgcaccagccc gcgccccccgacgccggcccgggagagcgggaacccaggaaggagcccaggacggcggag aaaaggccgctgcgcaggggagctgagccccacatggggagccagccaagaggacaaggg gacaatgagaagcggtcacaaccagagtcccagtggtggctgggaatcccccagtgtcag tttcaagtgtctgccaggcccaggcaggacaaaacctttgtgtcctgcatccgggatggc caggctcgctgtaacacacagcgtgtgcatggaggagagagcgccgccgtggatagcggg actgcggtgcacactgaaaagagtctgccgcctgcagcaacagggcctggagctgatggg cagcctgcgccaccctggatcaccgtcactcggcagaagcggagggggaccttggaccag ccacccaaccaggaagacaagcctggggcacggaccctgaagtctgaaccaggaaagcaa gccaaggtgcccgagagaggccaggtgcttagagcaaatacttctgttctgtgggtgcag gagggagcagccagctgggactggggcgaggaccctgagcctgtgaagcaagctgacttt gttcgcagcaagtctttcctgataacccctgtgaagcccgctgtggaccggaagcagggg gcaaagctcaacttcaaggaggggctgcaaagaggaatctcattgtcccatcagaacttg gcagctcagtctgcagtgatgatggagaaggaactgcatcagctgaagagagccagttat gccagtacagatcagccatcctggatggaacttgccagaaagaaatctcaagcttggagt gacatgccccagattataaaatag >gi568815596r:98694535_98946800|GENSCAN_predicted_peptide_6|157_aa MTEAVSGMSDPEIKACASASLPGDADDAARGPSFDEYDVYKMMYAAPEKQEDGLGPSREC SQILCLLLQTLEQVTTESASAQKEAPPAPVHSYAKPPASPSCDTSSTGVCRYLDNRRDRR DTSQHQATGDIQRGIPELMAMNELKALSLSVGRRLRF >gi568815596r:98694535_98946800|GENSCAN_predicted_CDS_6|474_bp atgactgaggcagtcagtggaatgtccgaccctgaaatcaaggcctgtgcctccgcaagt ctcccaggtgatgctgacgatgctgcccgtggaccttcctttgatgagtatgacgtgtac aaaatgatgtatgcagcacctgaaaaacaggaggatggacttgggccttccagagagtgc agtcagatcctgtgcctgcttcttcaaaccttggaacaagtgaccactgagtcagcttca gctcaaaaggaagctcctcctgcacccgtgcacagctacgccaagccaccggccagcccc tcctgtgacaccagcagtaccggtgtttgcaggtatctagacaacagaagagatcggcgg gacaccagccagcatcaagccacaggagatattcagagaggaatccctgagctgatggct atgaatgagctgaaggctctgagtctcagcgtgggaaggcgtctaaggttctaa