GENSCAN 1.0 Date run: 4-Nov-116 Time: 21:04:34 Sequence gi568815586r:89249257_89452039 : 202783 bp : 40.17% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 108 162 55 2 1 89 75 38 0.661 4.10 1.02 Intr + 4636 4687 52 2 1 115 75 32 0.748 1.65 1.03 Intr + 5913 6099 187 0 1 19 66 148 0.339 4.47 1.04 Term + 9069 9218 150 2 0 14 55 143 0.588 0.53 1.05 PlyA + 9222 9227 6 1.05 2.00 Prom + 30501 30540 40 -4.55 2.01 Init + 47437 47659 223 0 1 64 66 224 0.381 16.56 2.02 Intr + 60297 60386 90 1 0 92 10 111 0.022 2.75 2.03 Intr + 74039 74290 252 0 0 95 77 102 0.801 6.28 2.04 Term + 74353 74462 110 2 2 96 43 58 0.693 -0.21 2.05 PlyA + 74695 74700 6 1.05 3.08 PlyA - 74942 74937 6 1.05 3.07 Term - 76354 76268 87 1 0 92 49 84 0.004 1.58 3.06 Intr - 90466 90389 78 1 0 53 92 62 0.011 1.93 3.05 Intr - 95527 95450 78 1 0 141 90 15 0.033 5.83 3.04 Intr - 100305 100022 284 1 2 100 43 255 0.081 18.41 3.03 Intr - 101769 101332 438 1 0 109 66 358 0.998 28.26 3.02 Intr - 102793 102384 410 0 2 81 107 740 0.578 68.19 3.01 Init - 105116 104965 152 2 2 89 44 102 0.543 3.54 3.00 Prom - 110660 110621 40 -7.25 4.00 Prom + 116676 116715 40 -0.25 4.01 Init + 119595 119628 34 2 1 45 94 45 0.163 0.88 4.02 Intr + 120394 120452 59 2 2 97 84 -14 0.033 -3.22 4.03 Intr + 123544 123630 87 0 0 69 101 55 0.075 4.15 4.04 Intr + 138086 138203 118 1 1 49 92 105 0.020 6.12 4.05 Term + 142911 143080 170 1 2 84 52 113 0.960 4.36 4.06 PlyA + 143202 143207 6 1.05 5.00 Prom + 143624 143663 40 -3.75 5.01 Init + 149463 149469 7 2 1 64 116 0 0.429 1.54 5.02 Intr + 151280 151339 60 0 0 95 84 108 0.392 8.89 5.03 Intr + 160440 160617 178 1 1 13 72 130 0.014 1.96 5.04 Intr + 160933 161142 210 1 0 35 58 122 0.637 1.01 5.05 Term + 162457 162556 100 1 1 100 34 103 0.584 2.72 5.06 PlyA + 162676 162681 6 1.05 6.04 PlyA - 164250 164245 6 1.05 6.03 Term - 172001 171897 105 2 0 103 42 98 0.825 4.13 6.02 Intr - 176038 175905 134 2 2 55 80 113 0.812 6.64 6.01 Init - 180826 180679 148 1 1 113 15 84 0.757 3.90 6.00 Prom - 180914 180875 40 -4.55 7.00 Prom + 182284 182323 40 -8.15 7.01 Sngl + 193763 194710 948 1 0 55 38 414 0.869 29.41 7.02 PlyA + 195099 195104 6 1.05 8.02 PlyA - 197059 197054 6 1.05 8.01 Term - 202683 202556 128 1 2 71 37 113 0.763 1.96 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 78598 78670 73 0 1 69 88 75 0.958 6.88 S.002 Term + 79298 79380 83 0 2 131 42 58 0.961 2.38 S.003 Term - 100305 99998 308 1 2 100 48 284 0.916 19.89 S.004 Init + 141850 141952 103 0 1 58 78 111 0.918 7.55 S.005 Init + 160454 160617 164 1 2 70 72 106 0.922 6.45 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:89249257_89452039|GENSCAN_predicted_peptide_1|147_aa MDIKMGTIDPGDYYREEGASTHQSSTETPGLFLKQRNTPDWVIDKEKGLIGLWFGRLHQK LSGICFGEGLRKLTAMAEGEEAAGASHGKRGSKRESEEKRRLGHRYTQREDSMKTQEENG HLQGRKEVSAEINPADTLILDFQPPEL >gi568815586r:89249257_89452039|GENSCAN_predicted_CDS_1|444_bp atggacataaagatgggaacaatagatcctggggactactacagggaggagggagcatca acccatcaatcttccacagagacaccagggcttttcctgaaacagaggaataccccagac tgggtaattgataaagaaaaaggtttaattggcttatggttcggcaggctgcaccaaaag ctgagtggtatctgctttggtgaaggcctcaggaagcttacagccatggcagaaggtgaa gaagcagcaggtgcatcacatggcaagagagggagtaagagagagagtgaggagaagagg agattaggacacagatacacgcagagggaggacagtatgaagacacaggaagaaaatggc catctacaaggaaggaaagaggtctcagcagaaataaaccctgctgacaccttgatcttg gacttccagcctccagaactatga >gi568815586r:89249257_89452039|GENSCAN_predicted_peptide_2|224_aa MKNKHGSEEGPRSRKRQKSRKNNVAFRATLLLEICANSGESGCEAEQLSWDDFDNLPEQE GQSLGPESTKEKKPVTKCSRERLLQQDDWKQSNSQSAKDVVEKICPSHNTSSLKGISGGS SASPWPSYRLEVLKSGTNSEKEPSRYLESSLGVPVSSVSGTWSQKRDWPSDGFVPLTSQA LSPLHPKEGQHRIDHSLFPRIIPDLSPRGLRDPSASLHTQTDGP >gi568815586r:89249257_89452039|GENSCAN_predicted_CDS_2|675_bp atgaagaataaacacggtagtgaagaaggaccaagatccaggaaaaggcagaaatccaga aagaataacgttgcattcagggctacacttcttctggagatttgtgccaattctggagaa agtggatgtgaagctgagcaactgagctgggatgattttgacaatctgccagagcaagaa ggacaaagtctgggaccagagtccaccaaggagaagaaaccagtgaccaagtgttccagg gagagactgctccagcaggatgactggaagcagagcaacagccaatcggcaaaggatgtg gtggagaagatctgtccatcccacaacacaagcagcctcaagggcatatctgggggtagt tcagccagcccttggccatcctacaggctagaagtgctcaaaagtgggacaaactcagag aaggaacccagtcggtatctggaatcgagcctgggagttccagtgtcctcagtatctgga acatggtctcaaaagagggattggccttcagatgggtttgtccccttaacctctcaggct ctttcccccttgcatcccaaggaagggcaacacagaatagaccacagccttttccccaga atcatcccagatcttagtccccggggactgagagatccctctgcatcccttcacactcag acagatggcccatag >gi568815586r:89249257_89452039|GENSCAN_predicted_peptide_3|508_aa MPVGAGRRAKGDPATLGALAVFTVGAKRSKGHSPKPHPAGRLPPLPPLRQRSTPMIDTLR PVPFASEMAISKTVAWLNEQLELGNERLLLMDCRPQELYESSHIESAINVAIPGIMLRRL QKGNLPVRALFTRGEDRDRFTRRCGTDTVVLYDESSSDWNENTGGESVLGLLLKKLKDEG CRAFYLEGGFSKFQAEFSLHCETNLDGSCSSSSPPLPVLGLGGLRISSDSSSDIESDLDR DPNSATDSDGSPLSNSQPSFPVEILPFLYLGCAKDSTNLDVLEEFGIKYILNVTPNLPNL FENAGEFKYKQIPISDHWSQNLSQFFPEAISFIDEARGKNCGVLVHCLAGISRSVTVTVA YLMQKLNLSMNDAYDIVKMKKSNISPNFNFMGQLLDFERTLGLSSPCDNRVPAQQLYFTT PSNQNVYQDDTTKLAPKKFTSSLLSLLEANWEREGMVGNDAIILFLDHVCTRSPWSSKHE EWLHTSDGPVQKENVGPLVQIAGKTGYS >gi568815586r:89249257_89452039|GENSCAN_predicted_CDS_3|1527_bp atgcccgtgggtgcggggcgtcgggcgaagggagaccccgcgaccttaggcgccctcgcg gtgttcacggtaggcgcaaagcgcagcaagggacactccccaaagccgcaccccgccgga cggctgcccccgcttccgcctctcaggcagcgctcgacccccatgatagatacgctcaga cccgtgcccttcgcgtcggaaatggcgatcagcaagacggtggcgtggctcaacgagcag ctggagctgggcaacgagcggctgctgctgatggactgccggccgcaggagctatacgag tcgtcgcacatcgagtcggccatcaacgtggccatcccgggcatcatgctgcggcgcctg cagaagggtaacctgccggtgcgcgcgctcttcacgcgcggcgaggaccgggaccgcttc acccggcgctgtggcaccgacacagtggtgctctacgacgagagcagcagcgactggaac gagaatacgggcggcgagtcggtgctcgggctgctgctcaagaagctcaaggacgagggc tgccgggcgttctacctggaaggtggcttcagtaagttccaagccgagttctccctgcat tgcgagaccaatctagacggctcgtgtagcagcagctcgccgccgttgccagtgctgggg ctcgggggcctgcggatcagctctgactcttcctcggacatcgagtctgaccttgaccga gaccccaatagtgcaacagactcggatggtagtccgctgtccaacagccagccttccttc ccagtggagatcttgcccttcctctacttgggctgtgccaaagactccaccaacttggac gtgttggaggaattcggcatcaagtacatcttgaacgtcacccccaatttgccgaatctc tttgagaacgcaggagagtttaaatacaagcaaatccccatctcggatcactggagccaa aacctgtcccagtttttccctgaggccatttctttcatagatgaagcccggggcaagaac tgtggtgtcttggtacattgcttggctggcattagccgctcagtcactgtgactgtggct taccttatgcagaagctcaatctgtcgatgaacgatgcctatgacattgtcaaaatgaaa aaatccaacatatcccctaacttcaacttcatgggtcagctgctggacttcgagaggacg ctgggactcagcagcccatgtgacaacagggttccagcacagcagctgtattttaccacc ccttccaaccagaatgtataccaggatgacacaactaaactggccccaaagaagttcact tcctcactcttatccctcctagaggcaaactgggagagggagggaatggtcggcaatgat gccatcatccttttcctggatcacgtctgcactcggagtccttggtcttctaagcacgag gaatggctacatacctcggatggcccagtgcaaaaagaaaatgtggggcctcttgttcaa attgctggaaaaacagggtacagttga >gi568815586r:89249257_89452039|GENSCAN_predicted_peptide_4|155_aa MSVATEVPVATGKSDWEVSASLTENALLPRSVRPPVGATESHLQLSLTPGAGPKSALRGN QIKRAFELNGHTFIIVLIIHSLHYYHNSGTVKQSVIWAKRSGVEEGGEVEQQGKLKTINQ KKTEKKHRASFTGSVAPKPSFQGYGFKIMEEAAFT >gi568815586r:89249257_89452039|GENSCAN_predicted_CDS_4|468_bp atgtctgtagccactgaggttcctgttgcaacagggaaatcagattgggaggttagtgca tcattgacagagaatgccctccttccacgctctgttcgaccccctgtgggggccacagag agtcacctgcagctcagtcttaccccaggtgctgggcccaagtctgctcttagaggaaat caaatcaaacgagcattcgagttaaatggccacactttcattattgttcttattattcac agtctgcattactaccacaacagtggcaccgttaagcaatccgtcatttgggccaaacgg tcaggggtggaggaaggaggagaggtagaacaacagggtaaactgaaaacaatcaatcaa aaaaagacagaaaaaaagcaccgagccagtttcacaggctccgttgctccaaagccatca tttcaaggttatggctttaaaattatggaggaggcagctttcacctga >gi568815586r:89249257_89452039|GENSCAN_predicted_peptide_5|184_aa MKGLLPPESCVNRLGCRWETPKENLEEMDKFPDTYTLPRLNHEEVEYLNGPTTSSEIEAL INSLPTKKSQGADRFTAEFHLRTSDKNHMVISIDAEKAFDKIQHPFMLKTLNKLGIDGTS LKIIRDIYDKPTANIIPNGQKLEAFPLKTGTREGMINITSCETLQLSLAESGEEGGFNNF ELFM >gi568815586r:89249257_89452039|GENSCAN_predicted_CDS_5|555_bp atgaaaggtttacttcctccagaaagctgcgtgaaccgcttaggatgccgatgggaaaca cctaaagaaaatctagaagaaatggataaattcccggacacatacaccctcccaagatta aaccatgaagaagtcgaatacctgaatggaccaacaacaagttctgaaattgaggcacta attaatagcctaccaaccaagaaaagccaaggagcagacagattcacagctgaattccac ctgagaaccagtgacaaaaaccacatggttatctcaatagatgcagaaaaagcctttgat aaaattcaacatcccttcatgctaaaaacactcaataaactaggtattgacggaacgtct ctcaaaataataagagatatttatgacaaacccacagccaatatcataccaaacgggcaa aagctggaagcattccctttgaaaaccggcacaagagaaggaatgatcaacatcaccagt tgtgagactctccaactctcacttgctgaaagcggtgaagaagggggatttaacaacttt gaacttttcatgtga >gi568815586r:89249257_89452039|GENSCAN_predicted_peptide_6|128_aa MGLSKEELKNSDCPAILFDKVLPQSIVSSMKEDTMSDLFFAIALMPTIGECLPTTTKKKT EDMSDLPCESQRSIPLAVTDALEHIMEQLNVLTQTVSILEQRLTLTEDKLKDCLENQQKL FSAVQQKS >gi568815586r:89249257_89452039|GENSCAN_predicted_CDS_6|387_bp atggggttatccaaagaagaacttaagaatagtgattgccctgccatcctttttgataaa gtacttccccagagtattgtaagctccatgaaggaagataccatgtctgacttgttcttt gctatagctctaatgcctaccataggagaatgtttgccaacaaccacgaaaaagaaaaca gaagacatgagtgacctcccctgtgaaagtcaaaggagcatacctctcgctgtgactgat gctttagagcatattatggaacaactcaatgttttgacacagactgtttcaatcttggag cagcgactgactttgacagaggataagctgaaagactgccttgaaaatcagcaaaagctt ttcagtgctgtccaacagaaaagctga >gi568815586r:89249257_89452039|GENSCAN_predicted_peptide_7|315_aa MVKGSIQQEELTILNIYAPNAGAPRFIKQVLRDLQRDLDSHTIIMGDFNTPLSTLDSSTR QKVNKDIQELNSALHQVDLMDIYRTLHPKSTEYTFFSAPHCTYSKIDHIVGSKAQLSKCK RTEMITNCLSDHNAIKLELRIKKLTQNRSTTWKLNNLLLNDYWVHNEMKAEIKMFFETNE NKDTTYQNLWDTFKAVCRGKFIALNAHKRKQERSKIDTLTSQLKELEKQEQTHSKASRRQ EITKIRAELKEIETQKTLQKINESRSWFFEKIHKIDRPLARLIKKKREKNQIDAIKNDKG DITTNPTEIQTTIRK >gi568815586r:89249257_89452039|GENSCAN_predicted_CDS_7|948_bp atggtcaagggatcaatccaacaagaagagctaactatcctaaatatatatgcacccaat gcaggagcacccagattcataaagcaagtccttagagacttacaaagagacttagattcc cacacaataataatgggagactttaacaccccactgtcaacattagacagttcaacaaga cagaaagttaacaaggatatccaggaattgaactcagctctgcaccaagtggacctaatg gacatctatagaactctccaccccaaatcaacagaatatacatttttctcagcaccacat tgcacttattccaaaattgaccatatagttggaagtaaagcacaactcagcaaatgtaaa agaacagaaatgataacaaactgtctctcagaccacaatgcaatcaaactagaactcagg attaagaaactcactcaaaaccgctcaactacatggaaactgaacaacctgctcctgaat gactactgggtacataacgaaatgaaggcagaaataaagatgttctttgaaaccaatgag aacaaagacacaacataccagaatctctgggacacattcaaagcagtgtgtagagggaaa tttatagcactaaatgcccacaagagaaagcaggaaagatctaaaattgacaccctaaca tcacaattaaaagaactagagaagcaagagcaaacacattcaaaagctagcagaaggcaa gaaataactaagatcagagcagaactgaaggagatagagacacaaaaaacccttcaaaaa atcaatgaatccaggagctggttttttgaaaagatccacaaaattgatagaccactagca agactaataaagaagaaaagagagaagaatcaaatagatgcaataaaaaatgacaaaggg gatatcaccaccaatcccacagaaatacaaactaccatcagaaaataa >gi568815586r:89249257_89452039|GENSCAN_predicted_peptide_8|42_aa XWFTTELNAICGAVGASRKIVNLAFNCSAHLYPSGFELFGAA >gi568815586r:89249257_89452039|GENSCAN_predicted_CDS_8|129_bp ngatggtttactacggagctaaatgccatttgtggggctgttggagcttcacggaaaata gtgaacttagctttcaactgttctgcacatctatacccatctggttttgagttatttggt gctgcatag