GENSCAN 1.0 Date run: 4-Nov-116 Time: 18:28:14 Sequence gi568815596r:236065917_236267971 : 202055 bp : 48.11% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 16153 16159 7 0 1 80 116 0 0.063 3.09 1.02 Intr + 20572 20625 54 2 0 109 92 -3 0.004 1.15 1.03 Intr + 35657 35906 250 0 1 61 21 205 0.138 7.60 1.04 Intr + 43673 43803 131 2 2 -6 31 162 0.056 1.44 1.05 Intr + 47775 47949 175 1 1 12 88 122 0.380 3.60 1.06 Intr + 54276 54531 256 0 1 110 69 541 0.894 51.85 1.07 Term + 58003 58206 204 0 0 96 55 407 0.992 35.57 1.08 PlyA + 59075 59080 6 1.05 2.00 Prom + 62986 63025 40 -6.16 2.01 Init + 67528 67611 84 0 0 97 100 14 0.720 4.11 2.02 Intr + 68152 68309 158 0 2 28 45 119 0.593 0.41 2.03 Intr + 70728 70933 206 0 2 78 72 95 0.415 5.74 2.04 Intr + 76876 77004 129 2 0 59 58 94 0.265 4.17 2.05 Term + 78533 78618 86 0 2 63 42 93 0.433 0.02 2.06 PlyA + 78995 79000 6 1.05 3.05 PlyA - 81342 81337 6 1.05 3.04 Term - 91164 90967 198 0 0 79 42 100 0.272 1.90 3.03 Intr - 93806 93775 32 0 2 90 90 46 0.781 2.85 3.02 Intr - 94280 94191 90 0 0 98 0 80 0.383 0.17 3.01 Init - 98970 98349 622 0 1 31 105 219 0.593 13.41 3.00 Prom - 99775 99736 40 -6.46 4.03 PlyA - 99910 99905 6 -3.64 4.02 Term - 100521 99998 524 1 2 94 55 808 0.934 72.54 4.01 Init - 102055 101533 523 1 1 37 109 962 0.991 86.22 4.00 Prom - 115093 115054 40 -4.86 5.11 PlyA - 116051 116046 6 1.05 5.10 Term - 129141 128956 186 0 0 102 47 164 0.900 11.19 5.09 Intr - 129492 129282 211 2 1 59 74 86 0.174 3.22 5.08 Intr - 144708 144582 127 1 1 72 76 57 0.131 2.64 5.07 Intr - 145775 145571 205 2 1 102 3 162 0.246 7.87 5.06 Intr - 148950 148446 505 2 1 91 116 788 0.999 74.88 5.05 Intr - 172040 171773 268 0 1 146 90 449 0.864 47.49 5.04 Intr - 175631 175364 268 2 1 52 80 293 0.481 22.01 5.03 Intr - 191860 191779 82 0 1 64 85 65 0.115 3.44 5.02 Intr - 196484 196433 52 0 1 77 95 51 0.608 2.67 5.01 Init - 198429 198225 205 0 1 68 59 229 0.964 17.01 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 149682 149549 134 0 2 68 96 79 0.959 6.35 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:236065917_236267971|GENSCAN_predicted_peptide_1|358_aa MKETKRNTAHLKSLDSNLIAGRRCTYSEGLHPLYGIRSGGKGPFLFPYLFMMRYVPKPIT SAAVMVNGIHCDGGCTSRAAVGFMPRYAESRTASPRWLGLPLTGHGNVLVGSSVGVLDRQ PWERPSRRQRRCSGPAAVETFWIRASQGQVRQSPSGSNARVSVSELADPEKSRGFTAPAT YPVIPMSCAQCLAVPVDGFLSWPREREEKERWIRAKYEQKLFLAPLPCTELSLGQHLLRA TADEDLRTAILLLAHGSRDEVNETCGEGDGRTALHLACRKGNVVLAQLLIWYGVDVTARD AHGNTALAYARQASSQECIDVLLQYGCPDERFVLMATPNLSRRNNNRNNSSGRVPTII >gi568815596r:236065917_236267971|GENSCAN_predicted_CDS_1|1077_bp atgaaagaaactaaaagaaatactgcccatttaaagtctttggacagcaatttgattgca gggcgtcggtgcacgtacagcgagggcttacatccactttacgggatccggagcggagga aaagggccatttctcttcccttatctgtttatgatgcgatatgttccaaagccgatcaca tcagccgctgttatggtgaacggaattcactgtgatggcggctgcaccagcagagccgcc gtgggcttcatgccacgttacgcggagtctaggacggcctcaccccgctggctcgggctc cctctcactggccatgggaatgtcctagtcggcagcagtgtcggtgttctagaccggcag ccgtgggaacgtcctagtcggcggcagcgtcggtgctctggaccggcagccgtggaaacg ttctggatccgagcatcccagggtcaggtgcggcaaagcccgtcaggcagcaatgctaga gtgtctgtctccgagctggcagaccccgaaaaatccagagggttcacggctcctgcaacc tacccagtgattcctatgtcctgtgctcagtgcctggctgtccctgtggatggcttcctg tcatggcccagagagagggaagagaaggaacggtggatccgtgccaagtacgagcagaag ctcttcctggccccgctgccctgcacggagctgtccctgggccagcacctgctgcgggcc accgccgacgaggacctgcggacggccatcctgctgctggcacacggctcccgggacgag gtgaacgagacctgcggggagggagacggccgcacggcgctgcatctggcctgccgcaag gggaatgtggtcctggcgcagctcctgatctggtacggagtggacgtcacggcccgagat gcccacgggaacacagctctggcctacgcccggcaggcctccagccaggagtgcatcgac gtgctgctgcagtacggctgccccgacgagcgcttcgtgctcatggccacccctaacctg tccaggagaaacaataaccggaacaacagcagtgggagggtgcccaccatcatctga >gi568815596r:236065917_236267971|GENSCAN_predicted_peptide_2|220_aa MAPRCLLPLGTPERGISSICIPVRRREKCWQGQGEANPNTLLVEAVPLQSHMELCTKTLK SFPHDPGRQNLDHTFTTIEAYLGSHRAPHPTLATPGPARGDSEDDSAAVQWTKAWQSTPA ASGEELRSALLSPARSEKLLSRNLSPLESTLATQSCQVESSWTDHPCGLPSGCEEELTIA AAAAGFLCLQCPERAIPSNAMIHSTAMIHSKAMIHSKAMI >gi568815596r:236065917_236267971|GENSCAN_predicted_CDS_2|663_bp atggctcccaggtgcttgttacctctgggcacccccgagaggggcatctccagcatctgc attccagtgagaagaagagagaagtgctggcaaggccaaggtgaggcgaacccaaacacg ctgctggtggaagcagtgcccctgcaaagccatatggaattgtgcacaaagactctgaaa agcttccctcatgacccaggaaggcagaacctggaccacaccttcaccacgatagaggct tacctgggctcgcacagggctccccaccccaccctcgctacacccggacctgcccgtggt gacagcgaagatgacagtgcggctgtacaatggacaaaagcctggcagtccacgccagcc gcgagtggggaggagttacgatcagccctactgtctccagcacgttccgagaagctactc agtcgcaatttatcaccactagagtccaccctggccacgcagagttgccaggtggaaagc agctggacagaccatccgtgtggacttccgagtggctgcgaggaggaactcacaattgct gctgctgctgctggctttctctgcctgcagtgtcctgagcgtgccatcccctctaacgcc atgatccactctacagccatgatccactctaaagccatgatccactctaaagccatgatc taa >gi568815596r:236065917_236267971|GENSCAN_predicted_peptide_3|313_aa MLQIDYSQGGNVFPLQISPARKEAVLACRQSGGSINQAPGVIWRQEQKPSEPSRLPGSPQ SEAEAGWSAPGPGTSPSGRGRGGGTRAPPHPKDAPVVRTTKGENGRGYAGELGVGLQGTG SCAPPHLGGACLVPKPTPVPTPALRSEPCPLAAGPRSAPPDTAADRPRGVQRASECERGA TKPAARSAREAGASPASAFRVGEGGEAAHPLRAAVGYTLIPSLEQAPKDASVRPRAAPTG SLAAFVMPACSEGEATGGELGGLWQCPWSRVGGSHTEDRGLSSGYPHTTYRQGFTTGTVG TGTGQFFVVWDCP >gi568815596r:236065917_236267971|GENSCAN_predicted_CDS_3|942_bp atgctccaaatcgactacagccagggaggaaacgtctttcctttacaaatatctccggcc cgaaaagaagcggtgctcgcctgccggcagagtggagggtccataaatcaggctccgggc gttatctggcgacaggagcaaaaacccagcgagcccagccggcttcccggcagccctcag tcggaggcggaggctggctggagcgcccccggccccggcacctccccctccgggcggggt cgagggggcgggacccgggcaccgccccaccccaaggacgccccggtagtccgcaccaca aagggggaaaacgggagaggctacgcaggggagctgggagtcggtctgcaagggacgggg agctgcgcgcctccccatctcggtggggcgtgcctggtccccaagcccactcctgtccca accccggccctgcgctcagagccctgtccgctcgctgccggaccccgaagcgcgccgcca gatactgcggcggacaggcccaggggcgttcagcgggccagcgagtgtgagcgtggtgcc accaagccagcagcccgaagcgcgagagaagccggggcttcgccagcctcagcctttcga gtgggggaaggaggggaggccgcccacccgcttcgtgccgcggtcgggtacaccctcatc ccctctctggaacaggcaccgaaggacgccagtgtgcggcccagagctgcacccactggg tccctggcggcctttgtcatgccggcctgcagcgaaggagaggccaccgggggcgaactg ggtggactgtggcagtgcccttggtcccgagtgggtggcagccatacagaggatcggggg ctgtcctctggttacccccacaccacctacaggcagggttttaccaccgggactgtgggc actggcacaggacaattctttgtggtgtgggactgcccatga >gi568815596r:236065917_236267971|GENSCAN_predicted_peptide_4|348_aa MSAAFPPSLMMMQRPLGSSTAFSIDSLIGSPPQPSPGHFVYTGYPMFMPYRPVVLPPPPP PPPALPQAALQPALPPAHPHHQIPSLPTGFCSSLAQGMALTSTLMATLPGGFSASPQHQE AAAARKFAPQPLPGGGNFDKAEALQADAEDGKGFLAKEGSLLAFSAAETVQASLVGAVRG QGKDESKVEDDPKGKEESFSLESDVDYSSDDNLTGQAAHKEEDPGHALEETPPSSGAAGS TTSTGKNRRRRTAFTSEQLLELEKEFHCKKYLSLTERSQIAHALKLSEVQVKIWFQNRRA KWKRVKAGNANSKTGEPSRNPKIVVPIPVHVSRFAIRSQHQQLEQARP >gi568815596r:236065917_236267971|GENSCAN_predicted_CDS_4|1047_bp atgagcgcagcgttcccgccgtcgctgatgatgatgcagcgcccgctggggagtagcacc gccttcagcatagactcgctgatcggcagcccgccgcagcccagccccggccatttcgtc tacaccggctaccccatgttcatgccctaccggccggtagtgctgccgccgccgccgccg ccgccgcccgcgctgccccaggccgcgctgcagccagcgctgccgcccgcacaccctcac caccagatccccagcctgcccacaggcttctgctccagcctggcgcagggcatggcgctc acctctacgctcatggccacgctccccggcggcttctccgcgtcgccccagcaccaggag gcggcagcggcccgcaagttcgcgccgcagccgctgcccggcggcggtaacttcgacaag gcggaggcgctgcaggctgacgcggaggacggcaaaggcttcctggccaaagagggctcg ctgctcgccttctccgcggccgagacggtgcaggcttcgctcgtcggggctgtccgaggg caagggaaagacgagtcaaaggtggaagacgacccgaagggcaaggaggagagcttctcg ctggagagcgatgtggactacagctcggatgacaatctgactggccaggcagctcacaag gaggaagacccgggccacgcgctggaggagaccccgccgagcagcggcgccgcgggcagc accacgtctacgggcaagaaccggcggcggcggactgccttcaccagcgagcagctgctg gagctagagaaggagttccactgcaaaaagtacctctccttgaccgagcgctcgcagatc gcccacgccctcaaactcagcgaggtgcaggtgaaaatctggttccagaaccgacgggcc aagtggaaacgggtgaaggcaggcaatgccaattccaagacaggggagccctcccggaac cctaagatcgtcgtccccatccctgtccacgtcagcaggttcgctatcagaagtcagcat cagcagctagaacaggcccggccctga >gi568815596r:236065917_236267971|GENSCAN_predicted_peptide_5|702_aa MSNSDYLPDYPLNSDLVKRLKSALDAKDEERVRDLICTEITPVDAVIELANDDWMKDPSA QLPTGMLLALSGTGLGMAIRCGCCHLCVLQTGTLLLTCVHLLVNSQLGAEQEQNSSEIDG ISMERQPGPVRGKLQIFQKTEKDPQARAGSPVQEYHTALVAGDLDHLKPLMDQFFQDANV VFEINKDEMEWQVKSPATFGLSGLWTLEYKRELTTPLCIAAAHGHTACVRHLLGRGADPD ASPGGRGALHEACLGGHTACVRLLLQHRADPDLLSAEGLAPLHLCRTAASLGCAQALLEH GASVQRVGGTGRDTPLHVAAQRGLDEHARLYLGRGAHVDARNGRGETALSAACGAARRPD EHGRCLRLCALLLRRGAEADARDEDERSPLHKACGHASHSLARLLLRHGADAGALDYGGA SPLGRVLQTASCALQASPQRTVQALLNHGSPTVWPDAFPKGRHLIMCDIKATDAAASHKD AGQAPHGRHGEPSVESPALSQGKPSVGSSRTVNRCKETPMERRITEAKGLLFPVVQRPAA ARENAPAGDCKEFRLSPGRGDAMITKVVFSGREELVMTWGDEQGHPGAVGLLCKEDGAKR NSGRPPGASEVTPVRIPSHSAGTVLWESTQAPGVEGGSHKMMHKPFYQSLFALALTPRCL QHLCRCALRRLFGKRCFDLIPLLPLPKPLQNYLLLEPQGVLH >gi568815596r:236065917_236267971|GENSCAN_predicted_CDS_5|2109_bp atgtccaactcggattaccttcccgactacccactcaactcagatttagtgaagagatta aagtctgccctggatgccaaagatgaggagagagtgagggatttaatctgcactgaaatc acgcctgtggacgctgtgatagaactggccaatgacgactggatgaaagacccctcggct cagctgcccaccggcatgctgctagccctatcaggtacagggttaggaatggccatccgc tgtggctgctgccacctctgcgttctacagaccggaacactgctgctcacctgtgttcac ttgctggtcaacagccagttaggagcagagcaggagcagaactcttctgaaatcgacggc atctcaatggagagacagccagggccagtgagaggaaaacttcaaatatttcaaaagaca gagaaggatcctcaagctagagcagggtccccggtgcaggagtaccacactgccctggtc gcaggggacctcgaccatctgaagcccctcatggaccagttcttccaggatgccaacgtg gtgtttgagatcaataaggatgagatggaatggcaggtgaaatctccagccacgtttgga ctatcaggcctctggaccctggagtacaagcgtgagctcaccacgcccctgtgcatcgcc gcggcccacggccacaccgcctgcgtgcgacacctgctcggccgcggcgcagacccagac gccagccccggcggccgcggcgccctgcacgaggcctgcctcgggggccacaccgcctgc gtccgcctgctgctgcagcaccgcgccgaccccgacctgctcagcgccgagggcctggcg cctctgcacctctgccgcacggccgcctcgctcgggtgcgcgcaggcgctgctggagcac ggggcctcggtgcagcgcgtgggcggcacgggccgggacacgccgctgcacgtggcggcg cagcgcggcctggacgagcacgcgcgcctgtacctgggccgcggggcgcacgtggacgcg aggaacggccgcggagagacggctctgagcgcggcctgcggtgcggcgcggaggcccgac gagcacgggcgctgcctgcgcctgtgcgcgctgctgctgcggcgcggggcggaggcggac gcgcgcgacgaggacgagcgcagcccgctgcacaaagcctgcggccacgcgagccacagc ctggcgcgcctcctactgcggcacggcgccgacgcgggcgcgctcgactatggcggggcc tcgccgctgggccgcgtgctccagaccgcatcctgcgctctccaggcctcaccgcagcgc acggtgcaggcgctgctcaaccacggctctcccaccgtgtggcccgacgccttccccaag ggacggcatctaatcatgtgcgacatcaaagcaacagacgctgcagccagccacaaagat gctgggcaggcaccacacgggcgccatggcgagccgagcgtagagtcaccagcactgtca cagggcaagccgtccgtgggcagcagccgtactgtgaatagatgtaaagagacacccatg gagcgcagaattaccgaggcaaaaggccttctgtttcctgtggtgcagaggccagcagca gcccgggaaaacgcacctgcaggtgactgtaaagagttcagactctcacctggcagagga gatgccatgatcacaaaggtggttttctcaggaagggaagagctggtcatgacttgggga gatgaacagggccaccccggagctgtagggctgctgtgcaaagaggatggagcaaaacgg aattcggggagaccacctggagcatctgaagtcaccccagtcagaatcccctctcactcg gcaggaactgtgctttgggagagcacacaagcccctggggtagagggtggttcccacaag atgatgcacaagccgttctaccagtccctctttgccttggccctcaccccacgctgcctg cagcatctttgccgctgtgctcttcgcagactgtttggcaaaaggtgctttgacctcatc cccctgttacccttgccaaagcccctgcagaattacctacttttggagccacagggtgtt ttgcactga