GENSCAN 1.0 Date run: 4-Nov-116 Time: 03:22:48 Sequence gi568815587r:33759359_33969386 : 210028 bp : 43.01% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 PlyA - 2076 2071 6 1.05 1.04 Term - 7531 7479 53 2 2 108 41 15 0.173 -3.61 1.03 Intr - 9609 9493 117 1 0 49 79 70 0.223 2.64 1.02 Intr - 14820 14702 119 2 2 78 49 73 0.406 2.51 1.01 Init - 15139 15036 104 1 2 110 34 161 0.587 12.47 1.00 Prom - 21274 21235 40 -3.76 2.02 PlyA - 24052 24047 6 1.05 2.01 Sngl - 28437 28096 342 0 0 36 42 168 0.666 3.03 2.00 Prom - 31264 31225 40 -2.76 3.00 Prom + 45377 45416 40 0.14 3.01 Init + 60311 60383 73 1 1 92 43 140 0.200 11.13 3.02 Term + 69404 70062 659 0 2 26 42 193 0.029 2.62 3.03 PlyA + 72421 72426 6 1.05 4.04 PlyA - 73005 73000 6 1.05 4.03 Term - 73337 73066 272 0 2 76 48 225 0.492 12.95 4.02 Intr - 73798 73510 289 1 1 35 -28 243 0.555 4.22 4.01 Init - 74009 73863 147 2 0 34 55 93 0.417 0.69 4.00 Prom - 80855 80816 40 -2.86 5.10 PlyA - 82828 82823 6 1.05 5.09 Term - 100217 99998 220 1 1 100 47 238 0.999 17.41 5.08 Intr - 105459 105244 216 2 0 89 80 402 0.875 37.12 5.07 Intr - 109571 109479 93 1 0 38 76 112 0.004 4.08 5.06 Intr - 110263 109988 276 0 0 78 89 196 0.005 15.33 5.05 Intr - 111201 110987 215 0 2 46 34 100 0.001 -2.09 5.04 Intr - 130100 129977 124 1 1 99 77 66 0.085 7.19 5.03 Intr - 135339 135312 28 1 1 120 44 13 0.012 -2.73 5.02 Intr - 143299 143210 90 2 0 86 68 64 0.075 4.17 5.01 Init - 145730 145646 85 2 1 78 66 25 0.070 0.28 5.00 Prom - 159621 159582 40 -3.36 6.05 PlyA - 164800 164795 6 1.05 6.04 Term - 182205 182068 138 0 0 120 52 94 0.668 6.96 6.03 Intr - 189515 189466 50 0 2 103 84 -39 0.026 -4.40 6.02 Intr - 195266 195201 66 0 0 78 82 77 0.627 4.98 6.01 Init - 201860 201809 52 2 1 57 59 68 0.256 2.12 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 43240 43124 117 1 0 66 48 104 0.873 2.74 S.002 Intr - 105571 105495 77 1 2 109 81 -11 0.959 -1.39 S.003 Init - 105827 105756 72 2 0 67 72 93 0.955 6.67 S.004 Term + 109427 110350 924 1 0 59 37 443 0.940 28.48 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:33759359_33969386|GENSCAN_predicted_peptide_1|130_aa MAAMETETAPLTLESLPTDPLLLILSFLDYRDLIKYEKRRDISLPGLGEAHLQEKEVVAL VPHRRAPRTPAGPADTYSDVGRYIDHYAAIKKAWDDLKKYLEPRCPRMVLSLKVPVLWGF AMSWNRIKAL >gi568815587r:33759359_33969386|GENSCAN_predicted_CDS_1|393_bp atggcggccatggagaccgagacggcgccgctgaccctagagtcgctgcccaccgatccc ctgctcctcatcttatcctttttggactatcgggatctaatcaagtatgagaagcgacga gatatctctctcccaggcctcggtgaggcccacttgcaagagaaggaggtagtggccctc gtgccgcaccgccgcgctccgcggacccccgctggccccgcagatacttactctgatgta ggaagatacattgaccattatgctgctattaaaaaggcctgggatgatctcaagaaatat ttggagcccaggtgtcctcggatggttttatctctgaaagtgccagttctctggggcttt gccatgagttggaataggataaaggctctttga >gi568815587r:33759359_33969386|GENSCAN_predicted_peptide_2|113_aa MTCVEQEKLGQAFEDAFEVLRQHSTGDLQYSPDYKNYLALINHRPHVKGNSSCYGVLPTE EPVYNWRMVINSAADFYFEGNIHQSLQNITENQLVQPTVLQQKGKRQEEALTV >gi568815587r:33759359_33969386|GENSCAN_predicted_CDS_2|342_bp atgacatgtgttgaacaagagaagctgggtcaagcatttgaggatgcttttgaggttctg aggcaacattcaactggagatcttcagtactcgccagattacaaaaattacctggcttta atcaaccatcgtcctcatgtcaaaggaaattccagctgctatggagtgttgcctacagag gagcctgtctataattggagaatggtaattaacagtgctgcggacttctattttgaagga aatattcatcaatctctgcagaacataactgaaaaccagctggtacaacccactgttctc cagcaaaaggggaaaaggcaggaagaagctctgactgtttga >gi568815587r:33759359_33969386|GENSCAN_predicted_peptide_3|243_aa MTLSLVYTIVLTIAKTPKPFSGLFHSVKSMISLRAENKPSLRCIWEGASLAVLRGWLDLV LLSELEDPIPEQPLDETRPWDPARSPGGASKASLEPRSGAEENFGPGLGSPYPGHHLISP PLSPGPHKGIKPGPSSLPPLRPPVPPPFLAFARDEVAGDVPKAGRRCLPLGAPARLAVTL QPRAAARDSPHRASKEPGFESCPSLSARQAVPEPGPSASQGGGRGRPPHLDPRTPGRNPK PQR >gi568815587r:33759359_33969386|GENSCAN_predicted_CDS_3|732_bp atgaccctctcgctggtgtacaccatcgtcctcaccattgccaagacccccaagcccttc tctggattatttcatagcgtcaaatccatgatcagtctccgtgcagagaacaagccctct ctgcgttgcatttgggaaggggcctcccttgctgttctgcggggctggctggatcttgtt ctcctctccgagttggaggaccccattcctgagcagccgctggacgaaacccggccctgg gatcccgcgaggtccccgggaggcgcctcaaaagccagcctggagccccgcagcggggcg gaggagaattttgggcccgggcttgggtccccgtaccccgggcatcacctaatatcacca cccctctccccgggtccccacaaaggcatcaagccaggcccctcgtcactgcctccactg cgccctcctgtccctcctcctttcctcgccttcgcgcgggacgaagtggccggcgatgtc ccgaaagcggggcgtcggtgcctccccctcggcgccccagctcggctcgcggtgaccctg cagcccagggccgcagccagggactccccacaccgagcctccaaagagccaggcttcgaa agctgcccgtccctgtcagcgcggcaggcggtcccggagccaggtcccagcgcctcccaa ggcggggggagggggcggcccccacacctggacccgcggacacctgggcggaaccccaag ccgcagcgatag >gi568815587r:33759359_33969386|GENSCAN_predicted_peptide_4|235_aa MFAYCCSVKIHTSTFELLESIFCILLVVEAFSLQKVVEMLEEVVVSRRECWLCDIRLGIV VERNWALSVDQCRLQALQFLVQLIDLPSILLRCNGFSGIQKAIVDQTSSSPPNSDHDLFF GANLALGSALELLLGPVTEVVIASCLPNNSRMVDAEFFGNFSCSCKRMSFSDGSQLVIVS FRRLATTFLTFKALVSFAKLLAPPPHCMFVSSSWAKRTADVASCLRCLTTHFELK >gi568815587r:33759359_33969386|GENSCAN_predicted_CDS_4|708_bp atgtttgcttattgctgtagtgtaaaaatccatacttcaacatttgaactcttggaaagc attttctgcatcctgctggttgtggaagcgttttccttgcaaaaagttgtcgagatgctt gaagaagtggtagtcagtcgacgagagtgttggttgtgtgacatacggctgggcattgtc gtggagaggaattgggctctttctgttgaccagtgccggctgcaggcattgcagtttttg gtgcagctcatcgatttgccgagcatacttctcagatgtaatggtttctctgggattcag aaagctatagtggatcagaccagcagcagcccaccaaacagtgaccatgacctttttttt ggcgcaaatttggctttgggaagtgctttggagcttcttctgggtccagtcactgaggtg gtcatcgccagttgtctgccgaacaacagtaggatggtcgacgctgagtttttcggcaac ttctcatgtagttgtaagaggatgagcttcagtgatggctctcaattagtcattgtcagc ttccgacggctggccactacattcctcaccttcaaggctcttgtctcctttgcaaaactt cttgcaccaccaccgcactgtatgtttgttagcagttcctgggccaaacgcactgctgat gttgcgagctgtctccgctgccttacaacccattttgaactcaaataa >gi568815587r:33759359_33969386|GENSCAN_predicted_peptide_5|448_aa MGTATEKYCSLNVHPWGQGKSGYGKKCAGGDIKAGYKASGKTWFSVSLVIALPTHPCPDV AQELAELCVTALSGTTELLSDSVERMLLPGPYLAWRVWLRAHLCSGLAKPVQSRLAHHHR LCRARPQPGARSPEPAAWGPQPVARSPQSRAVRGIPVHGVRGPGTSNLGRAENPLGTLAF LPVIRSLSLAFEGSAVTVLERGGASSPAERRSKRRRRSGGDGGGGGGARAPEGVRAPAAG QPRATKGAPPPPGTPPPSPMSSAIERKSLDPSEGTFQVAASGVPNGRLPGPQVEPLPALR ITGGEPVDEVLQIPPSLLTCGGCQQNIGDRYFLKAIDQYWHEDCLSCDLCGCRLGEVGRR LYYKLGRKLCRRDYLRLFGQDGLCASCDKRIRAYEMTMRVKDKVYHLECFKCAACQKHFC VGDRYLLINSDIVCEQDIYEWTKINGMI >gi568815587r:33759359_33969386|GENSCAN_predicted_CDS_5|1347_bp atgggaaccgcgactgagaagtattgctctctaaatgttcacccctggggtcaaggcaaa tctggctatggaaaaaaatgtgcaggtggagatattaaagcaggatataaggcatcagga aaaacctggttttctgtatccctggtcattgctctgccaactcacccctgcccagatgtg gctcaagaacttgcagagttatgtgtgactgctttgtctggcacaacagaactgctcagc gattctgttgaacggatgctgctgcctggcccgtatctggcctggagagtctggctgcga gctcatctgtgctctggtctggctaagcccgtgcagtcccggctggcgcaccaccatcgc ctctgcagagcgcggccgcagcccggagcccgcagcccggagcccgcagcctggggcccg cagcccgtagcccgcagcccgcagagccgcgctgtccggggcatcccagtgcacggcgtc cgtggacctgggacctcgaacctgggacgggcggaaaatccactgggaacacttgcattt ctcccggtgattcgctctctctctttggcgtttgaagggagcgcggtgactgtccttgag cgcggaggggcgagctcgccggcggagcgccggagcaagcggaggcgcaggagcggcggc gacggcggcggcggcggcggcgcccgagcacccgagggggtccgagccccggcagccggc cagccccgcgccacaaagggagcgcccccgccgcccggcaccccgcctccctccccaatg tcctcggccatcgaaaggaagagcctggacccttcagagggaacattccaagtggcggcc tccggggtccccaacggccgccttcccggtccgcaggtggagccgctgccagccttgcgg ataacgggcggggaaccagtggatgaggtgctgcagatccccccatccctgctgacatgc ggcggctgccagcagaacattggggaccgctacttcctgaaggccatcgaccagtactgg cacgaggactgcctgagctgcgacctctgtggctgccggctgggtgaggtggggcggcgc ctctactacaaactgggccggaagctctgccggagagactatctcaggctttttgggcaa gacggtctctgcgcatcctgtgacaagcggattcgtgcctatgagatgacaatgcgggtg aaagacaaagtgtatcacctggaatgtttcaaatgcgccgcctgtcagaagcatttctgt gtaggtgacagatacctcctcatcaactctgacatagtgtgcgaacaggacatctacgag tggactaagatcaatgggatgatatag >gi568815587r:33759359_33969386|GENSCAN_predicted_peptide_6|101_aa MIKMIDNDDSNVDGEGQAVAIFASLGSTDLEEQQICTSECDLQSTCHSATSHSLVSVMMT SNLVNDFSCKCVPNSYDSMCFGAGGTRLYVYWLCDFEKSIL >gi568815587r:33759359_33969386|GENSCAN_predicted_CDS_6|306_bp atgataaaaatgatcgataatgatgatagcaatgttgatggtgaaggtcaagcggtggcc atttttgcttcgctgggctccacggatctggaggagcaacagatctgcacatctgaatgt gatctccaatccacctgccactctgctacttcccacagtctagttagtgtcatgatgaca agcaacctggttaatgatttcagttgtaaatgtgtccctaactcctacgacagcatgtgc tttggagctggaggaaccaggctctatgtttactggctgtgtgactttgagaagtccatt ctttga