GENSCAN 1.0 Date run: 5-Nov-116 Time: 04:38:04 Sequence gi568815596f:41948797_42157685 : 208889 bp : 46.63% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 3815 3935 121 2 1 121 46 55 0.507 2.55 1.02 PlyA + 4523 4528 6 1.05 2.08 PlyA - 5227 5222 6 1.05 2.07 Term - 6300 6045 256 2 1 101 53 111 0.476 3.96 2.06 Intr - 9179 9086 94 0 1 67 63 76 0.179 2.02 2.05 Intr - 29604 29410 195 1 0 60 91 54 0.048 2.29 2.04 Intr - 45254 45125 130 2 1 47 -6 202 0.118 6.97 2.03 Intr - 52818 52764 55 2 1 -1 97 65 0.073 -2.52 2.02 Intr - 59555 59389 167 2 2 98 51 83 0.051 4.36 2.01 Init - 65421 65383 39 0 0 81 81 2 0.059 -1.01 2.00 Prom - 68958 68919 40 -3.56 3.02 PlyA - 69190 69185 6 1.05 3.01 Sngl - 81638 81462 177 2 0 78 52 166 0.237 6.95 3.00 Prom - 89584 89545 40 -1.56 4.00 Prom + 91984 92023 40 -8.56 4.01 Init + 99404 100042 639 1 0 84 96 1013 0.997 95.04 4.02 Intr + 104443 104565 123 0 0 115 46 205 0.959 19.78 4.03 Intr + 105240 105511 272 2 2 76 60 343 0.981 26.64 4.04 Intr + 106145 106224 80 2 2 102 99 91 0.807 10.79 4.05 Intr + 106490 106597 108 0 0 43 117 152 0.995 13.86 4.06 Intr + 108425 108598 174 0 0 76 99 221 0.992 21.91 4.07 Intr + 123510 123603 94 1 1 68 84 41 0.034 0.72 4.08 Intr + 123674 123892 219 2 0 1 75 126 0.092 0.02 4.09 Intr + 124231 124270 40 1 1 67 45 59 0.217 -2.27 4.10 Intr + 124391 124479 89 1 2 114 119 78 0.891 12.27 4.11 Intr + 126758 126840 83 2 2 52 127 26 0.736 2.18 4.12 Intr + 152155 152301 147 2 0 -16 94 97 0.130 0.11 4.13 Term + 152334 152971 638 1 2 67 44 215 0.576 9.41 4.14 PlyA + 153893 153898 6 1.05 5.00 Prom + 156705 156744 40 -2.76 5.01 Init + 161478 161494 17 2 2 56 91 27 0.469 -0.22 5.02 Intr + 163490 163553 64 2 1 90 115 27 0.655 4.32 5.03 Intr + 169328 169473 146 1 2 80 92 76 0.650 6.28 5.04 Term + 175390 175450 61 1 1 90 33 43 0.109 -3.72 5.05 PlyA + 175773 175778 6 1.05 6.04 PlyA - 175932 175927 6 1.05 6.03 Term - 177537 177406 132 0 0 45 42 136 0.161 2.79 6.02 Intr - 184462 184409 54 1 0 60 102 44 0.060 2.18 6.01 Init - 194329 194156 174 1 0 46 73 98 0.202 3.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 108807 108892 86 1 2 111 43 63 0.874 1.92 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:41948797_42157685|GENSCAN_predicted_peptide_1|40_aa XSLEFSALRTYKLPRGLPDQPQLLPASFRQHISAALGHRS >gi568815596f:41948797_42157685|GENSCAN_predicted_CDS_1|123_bp nnatccttggagttctccgcactcagaacctacaagctgcccaggggtcttccagaccag ccccagttgcttccagcctccttccgccagcacatctctgcagccttgggccaccgatcc taa >gi568815596f:41948797_42157685|GENSCAN_predicted_peptide_2|311_aa MTMAALWNRKGGKSLLIASAQMQLAVCHAREIPKLEDLGTRSSSGIYWQCGLKEVTLTLE ISAPSAAKGQYQLSGITVLQDRRTSAKKVGDHPHIIFSASIDHVCVDSVDFIVIANTSEY TIFTLYSFIAEFYLSFKALFKYLLHCGFFPDHPHAQLNAIFHLWSFLLVPLVFSNYCTVL CNVELCPLQEGKLLAEPRTQRLLKGKDDHEDLATMFHKDQGRKAVGSEWQLGLLAAGSSP EPVTRAQDELSRCICDNVAFIFITVDPVGGKPPFSHFSPHLSSQNSKLPPRRDCRFPGRG KLCRTSSPEHS >gi568815596f:41948797_42157685|GENSCAN_predicted_CDS_2|936_bp atgacaatggcggctttgtggaatagaaaggggggaaagagcctcctgattgcctcagcc cagatgcagctggcagtgtgtcatgcgagggagatccctaaactagaagacctgggaact agatctagctctggtatttactggcagtgtggacttaaggaagtcacattgactttagag atctctgctccctctgctgcaaagggacaataccagctgagtggcatcactgtcctgcag gacagacgtacaagtgccaagaaagtgggcgaccaccctcacatcatcttctctgcttct attgaccatgtctgcgtggactccgtggactttatcgtcatcgccaacacctctgaatat accatttttacattgtattcatttatcgctgaattctacctatccttcaaggctctgttc aagtacctgctccactgtggattctttccagatcatccccatgcccaactgaatgccatt ttccacttgtggagctttctgttggtgcctctggttttcagcaattactgcactgtactt tgtaatgtagaattatgtcccctacaagaaggcaagctccttgcagagcccaggacacag cgtttgctgaaaggcaaggatgaccatgaagacctagccaccatgttccacaaggaccag gggagaaaagctgtgggaagtgagtggcagcttggactcttggctgcaggcagttctcct gaaccggtcaccagggctcaggatgagctatccagatgcatctgtgacaatgtggctttc atcttcatcacagttgaccccgtgggaggaaagccccctttctcccacttcagcccacat ctctcttcccagaacagcaaactgccacccaggagggactgccgcttccctggtcgtggg aaactctgcagaacctcctctcctgagcactcctga >gi568815596f:41948797_42157685|GENSCAN_predicted_peptide_3|58_aa MAILLWPACSPDYFSLIERKECEECGVDCLRRAAAADPLGDQGIRILGPGMGELQMAR >gi568815596f:41948797_42157685|GENSCAN_predicted_CDS_3|177_bp atggccatcctgctgtggcctgcttgctcccctgattatttctccttaattgagcgtaaa gaatgtgaggaatgtggagtagactgcctgaggagggctgctgcagccgaccccctcggg gatcaggggatccggattctgggccctgggatgggagaactgcagatggcccgctga >gi568815596f:41948797_42157685|GENSCAN_predicted_peptide_4|901_aa MRRRRAAVAAGFCASFLLGSVLNVLFAPGSEPPRPGQSPEPSPAPGPGRRGGRGELARQI RARYEEVQRYSRGGPGPGAGRPERRRLMDLAPGGPGLPRPRPPWARPLSDGAPGWPPAPG PGSPGPGPRLGCAALRNVSGAQYMGSGYTKAVYRVRLPGGAAVALKAVDFSGHDLGSCVR EFGVRRGCYRLAAHKLLKEMVLLERLRHPNVLQLYGYCYQDSEDIPDTLTTITELGAPVE MIQLLQTSWEDRFRICLSLGRLLHHLAHSPLGSVTLLDFRPRQFVLVDGELKVTDLDDAR VEETPCAGSTDCILEFPARNFTLPCSAQGWCEGMNEKRNLYNAYRFFFTYLLPHSAPPSL RPLLDSIVNATGELAWGVDETLAQLEKVLHLYRSGQYLQNSTASSSTEYQCIPDSTIPQE DYRCWPSYHHGSCLLSVFNLAEAVDVCESHAQCRAFVVTNQTTWTEVQREDARLSGSECA MGPSRLPQLCIQPDVALCWQDGLSCSCWAAVQSDRQNAVYKGVETDNRWQLSTSWRLQRG VADYLAFGNRALLPAGGPQPWAVPEGSSEMMRVQKGKLEVIVGLLLTDEEIESTLKMKSP LYQNLNSPPTVIWQESDLYSAEFCRCPPLITLHKSKQYLHAETGTCGGLTVRSSVQVLSI LFAGPHACRAPPQKAGPDSDQQGLQGRSAGLRQRLGRRLGVAGEPLHVLPFSRVHGALEA QMLAKWRERSRDEDSGRGAPASARIPAHPPPRPAPAPTHWGSPALPEARAAPQEESQPFP PRCAVCGLPAANRSASRSPAAPGSCPGSAVLLGIPGRPPPRPGPPDRPFQRLQARAVPPD AGYTTRRCWNLSQPRVAAPGPGTPSPERPRRPLASELETEVDACSVTLGQIALPKPQFLH L >gi568815596f:41948797_42157685|GENSCAN_predicted_CDS_4|2706_bp atgcggcgccggcgggcggcagtggccgcgggtttctgcgcctccttcctgctgggctcc gtcctcaacgtgctcttcgctccgggctcggagcctccgaggccaggccagtcccctgag ccttcgccggccccgggtccgggccgtcgcgggggccgcggggagctggcccggcagatc cgggcgcgctacgaggaggtgcagcgctattcccgcgggggccccgggcccggggcgggc cggccggagcggcggcgcctgatggacctggctccgggcgggcccggcctgccgcgcccc cggcccccttgggcccggcccctgtccgacggcgccccaggctggcccccggctcccggc ccaggctcccccggcccgggcccgcgcctgggctgcgccgcgcttcgcaacgtgtccggc gcgcagtacatgggctcaggctacaccaaggccgtgtaccgggtccgcctgcccggcggt gccgcggtggcgctcaaggcggtggactttagcggccacgatctgggcagctgcgtgcgc gagttcggggtacggaggggctgctatcggctggcggcccacaagctgcttaaggagatg gtgctgctggagcggctgcggcaccccaacgtgctgcagctctatggctactgctaccag gacagcgaggacatcccagacaccctgaccaccatcacggagctgggcgcccctgtagaa atgatccagctgctgcaaacttcctgggaggatcgattccgaatctgcctgagcctgggc cgcctcctccaccacctggcccactccccactgggctccgtcactctgctggacttccgc cctcggcagtttgtgctggtggatggggagctcaaagtgacggacctggatgacgcacgt gtggaggagacgccgtgtgcaggcagcaccgactgcatactcgagtttccggccaggaac ttcaccctgccctgctcagcccagggctggtgcgagggcatgaacgagaagcggaacctc tataatgcctacaggtttttcttcacatacctcctgcctcacagtgccccgccttcactg cgtcctctgctggacagcatcgtcaacgccacaggagagctcgcctggggggtggacgag accctggcccagctggagaaggtgctgcacctgtaccggagcgggcagtatctgcagaac tccacggcaagcagcagtaccgagtaccagtgtatcccagacagcaccatcccccaggaa gactaccgctgctggccatcctaccaccacgggagctgcctcctttcagtgttcaacctg gctgaggctgtggatgtctgtgagagccatgcccagtgtcgggcctttgtggtcaccaac cagaccacctggacagaggtccagagagaggatgcccgcctctctgggagtgagtgtgcc atgggcccatccaggctaccccagctctgtattcaaccagatgtggctctgtgctggcag gacggcctcagctgcagctgctgggcagctgtccagagtgacaggcagaatgcagtgtac aagggtgtggaaacagacaacaggtggcagctcagcaccagttggcggctgcagagggga gtggcagattatctagcatttggtaacagggcgttgctaccagcaggtggaccacagcca tgggcagttccagaggggtcttctgaaatgatgagggtgcagaaagggaagttggaggtc atcgtggggctcctacttactgatgaagaaatagaatcaactctcaaaatgaaaagccca ctctatcagaacctaaacagccctccaacagtcatctggcaggagtcagatctgtatagt gcagagttctgcagatgccctcccctcatcacgctgcacaaatccaagcagtaccttcac gctgaaactgggacctgcggaggcctcactgtccggagcagcgtccaggtcctcagcatc ctcttcgctgggccccacgcgtgtcgtgcgccaccccagaaggcagggcctgattcggat cagcaggggctgcagggccgctctgcaggcctgaggcagagactgggcaggaggctgggc gtggccggggagccccttcacgtgctccccttctcgagggttcacggggctctggaggcc cagatgctggcaaagtggcgggagcgctccagggacgaggactcaggtcggggcgccccc gccagcgccaggatccccgctcaccctccgccgcgccccgcccccgccccgacacactgg gggagccccgccctccccgaggcccgcgcggcgccgcaggaggaatcccagccatttcct ccacgctgcgcggtatgtggcctgcccgccgccaaccgcagcgcgagccggtccccagcc gcgcctggcagctgccccggctccgccgtgctgctcgggattccgggaaggccgccccct cgtcccgggccaccagaccggcctttccagcggctccaggcccgtgcagtcccgccggac gccggctacaccacgcgccgctgctggaacctctcccagccccgcgtggccgcccccggc ccaggcaccccctccccggaacgcccccgacggcccctcgcgtccgagctggaaactgaa gttgacgcttgctccgtcaccctggggcaaattgctttgcctaagcctcagtttctccat ctgtga >gi568815596f:41948797_42157685|GENSCAN_predicted_peptide_5|95_aa MNNRIRYYPQQHSSRPADLDYFLGSSQCPRDERTLKGHVSCDPTSSISITGEKLKQKLES TPLLLTPNLHFYRILGSLHLTVLIIETSYFDLNTS >gi568815596f:41948797_42157685|GENSCAN_predicted_CDS_5|288_bp atgaacaaccgaatcaggtattatccccagcagcacagcagcagaccagctgacctggac tatttcttaggcagtagtcagtgtcccagggatgaaaggactttgaaaggtcatgtgtct tgtgacccaactagcagcatcagcatcactggggagaaactcaagcagaagcttgagtct actccactcttgctgacaccaaatctgcatttttacaggattcttggatctctccacctg actgtcctcattattgaaacgtcatattttgacctcaacacttcatag >gi568815596f:41948797_42157685|GENSCAN_predicted_peptide_6|119_aa MTSEKDLSGPRKGGRKVKEESKQELNLGVNATVIMTHSLMVLKAGVALEKPGHSCITKEA ASAELKTIRVDQRKCKERPAYPSSFTQSLEDFSEIYLYLVPSVAIAGQSCILAIWTWYS >gi568815596f:41948797_42157685|GENSCAN_predicted_CDS_6|360_bp atgacatcagaaaaggacctgagtggcccaagaaagggaggacggaaagtcaaggaggaa agtaagcaggagttgaatttaggagtaaatgctacggtgattatgactcacagcctcatg gtcctcaaggctggagtggcgttggagaaaccaggtcatagttgtataaccaaggaagct gcttctgcagaattaaaaaccatcagagttgatcaaaggaagtgcaaggaaagacccgca taccccagcagtttcactcagtccctcgaggacttcagtgaaatctacttgtacctggtg cccagtgtggcaattgcaggtcagtcctgcatcctggccatctggacgtggtattcttaa