GENSCAN 1.0 Date run: 5-Nov-116 Time: 04:05:25 Sequence gi568815595f:5022466_5278710 : 256245 bp : 43.56% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 30999 31080 82 2 1 80 70 70 0.167 5.53 1.02 Term + 44230 44354 125 2 2 23 48 98 0.011 -2.15 1.03 PlyA + 46103 46108 6 1.05 2.05 PlyA - 48940 48935 6 1.05 2.04 Term - 58984 58692 293 2 2 55 38 444 0.613 31.61 2.03 Intr - 60350 60164 187 2 1 15 34 167 0.096 3.26 2.02 Intr - 61085 60977 109 1 1 46 31 176 0.176 7.99 2.01 Init - 64560 64442 119 0 2 32 75 58 0.285 -1.43 2.00 Prom - 78873 78834 40 -5.76 3.00 Prom + 88077 88116 40 -5.76 3.01 Init + 100001 100123 123 1 0 94 89 265 0.967 27.37 3.02 Term + 100507 100548 42 0 0 43 47 71 0.370 -4.34 3.03 PlyA + 101015 101020 6 1.05 4.02 PlyA - 101052 101047 6 1.05 4.01 Sngl - 122630 122136 495 2 0 52 50 242 0.861 12.85 4.00 Prom - 157204 157165 40 -1.86 5.00 Prom + 162681 162720 40 -6.36 5.01 Init + 165341 165849 509 1 2 88 86 813 0.925 73.53 5.02 Intr + 172744 172816 73 1 1 82 92 57 0.968 4.91 5.03 Intr + 177127 177230 104 0 2 94 93 168 0.996 16.87 5.04 Intr + 179318 179459 142 2 1 64 63 132 0.873 8.66 5.05 Intr + 180501 180684 184 2 1 106 36 27 0.801 -1.34 5.06 Intr + 182602 182776 175 2 1 107 75 112 0.999 10.80 5.07 Intr + 184688 184808 121 2 1 86 68 179 0.999 16.20 5.08 Intr + 185628 185798 171 2 0 115 79 129 0.995 14.84 5.09 Intr + 187710 187783 74 2 2 110 34 39 0.574 -1.10 5.10 Intr + 188655 188751 97 0 1 21 84 93 0.706 2.21 5.11 Intr + 190854 191057 204 2 0 91 92 136 0.999 13.70 5.12 Term + 193364 193453 90 1 0 114 39 113 0.917 6.72 5.13 PlyA + 193636 193641 6 1.05 6.04 PlyA - 193841 193836 6 1.05 6.03 Term - 200673 200558 116 1 2 6 54 189 0.733 6.03 6.02 Intr - 202278 202218 61 0 1 108 -18 103 0.325 0.01 6.01 Init - 217723 217655 69 1 0 73 92 25 0.368 2.46 6.00 Prom - 219189 219150 40 -5.16 7.00 Prom + 221695 221734 40 -4.46 7.01 Sngl + 221749 222084 336 0 0 76 44 153 0.377 5.73 7.02 PlyA + 222808 222813 6 1.05 8.02 PlyA - 223270 223265 6 1.05 8.01 Term - 247079 246859 221 0 2 61 38 157 0.496 5.20 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 57917 57747 171 2 0 110 54 249 0.997 18.53 S.002 Init - 155396 155336 61 2 1 60 94 64 0.840 5.61 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:5022466_5278710|GENSCAN_predicted_peptide_1|68_aa MDTEQIITPTTTVRDAKKEAPPTEQYKKEIVFYCPKAESSPWRNSSSTRTTLALCLSVEQ AFIVDFYK >gi568815595f:5022466_5278710|GENSCAN_predicted_CDS_1|207_bp atggacactgaacaaattattaccccaacaaccacagtgagagatgccaagaaggaggca cctccaacagagcagtataagaaagaaattgtcttttattgccccaaagcagaaagctcc ccatggagaaactcatcctcaacaaggaccacccttgccctctgtctttccgtggagcag gccttcattgtggacttctacaagtga >gi568815595f:5022466_5278710|GENSCAN_predicted_peptide_2|235_aa MYVDLQTLFFNFNEINFHIKQNEPLKIPTAKLLIEDKPGMKDHPNLIKNAKKPDIPKKLN SPTQQLWYTHRKKDFSCAELMANKKDVPSAERMICRVASSGSCSPGRRRTPMCDQETKDY EAELLRFSQETAPGGAAAVGKGQQLQEEQPRFLEIELACTLARRWSDLSEKAKYKTREQA RQGVPGTQQAAQVIKRVQIWQQSIISNYLACFKNDRVKASKAMDVTWNPKEENLM >gi568815595f:5022466_5278710|GENSCAN_predicted_CDS_2|708_bp atgtatgtggatttacagactctattcttcaattttaatgaaattaactttcatataaag caaaatgaacctctgaagattcccacagcaaagctgcttattgaagataaaccaggaatg aaggatcatcccaacttaatcaagaatgccaagaagccagacatccccaagaagctcaac tcccccacccagcagctgtggtacacccatcggaagaaggactttagttgcgcagagctc atggccaacaagaaggatgttcccagcgcggagcgcatgatatgccgtgtggccagcagt ggaagctgctctcccggaaggagaaggacgcctatgtgcgaccaggaaacaaaagattat gaggcggaactgctgcgtttttctcaggagactgccccaggaggagcagcagcagtggga aaggggcagcagctgcaggaggagcagcctaggttcttggagatcgagctggcctgcacg ctggcccgaaggtggagtgacttgtctgagaaggccaagtacaagacccgagagcaagcc aggcagggagtgccaggaactcagcaagccgcccaagtcatcaagagagtacagatctgg caacagagcatcatcagcaactacctggcctgcttcaagaacgaccgggtgaaggcctcg aaagccatggatgtgacctggaatcccaaggaggagaacctgatgtga >gi568815595f:5022466_5278710|GENSCAN_predicted_peptide_3|54_aa MLALISRLLDWFRSLFWKEEMELTLVGLQYSGKTTFVNVIAGGIWQIRQPYEKA >gi568815595f:5022466_5278710|GENSCAN_predicted_CDS_3|165_bp atgctggcgctcatctcccgcctgctggactggttccgttcgctcttctggaaggaagag atggagctgacgctcgtggggctgcagtactcgggcaagaccaccttcgtcaatgtcatc gcgggtgggatttggcaaatccgtcagccgtacgagaaagcgtaa >gi568815595f:5022466_5278710|GENSCAN_predicted_peptide_4|164_aa MAKRDNGYAITDGNKHSLCEKGRLPNGWLAQTCELYAFNQALKLLEGQEGTIFTNSKYAY GVVHTFGKIWTVQGLINSRGKELVHGELVKQVLESLQLPAEVAIVRINGHQKGNTIEAVG NKLADKAAMQASLEEEIRLFSLIPDIPKVVLRPQFTRKEKEELG >gi568815595f:5022466_5278710|GENSCAN_predicted_CDS_4|495_bp atggcaaagagagataatggttatgctatcactgatggaaataaacactccttatgtgag aaaggtagattacctaatggttggttggcccaaacctgtgaattatatgcttttaaccag gcgctaaagctccttgaaggccaagaaggcactatattcactaattctaaatatgcctat ggggtggtacacacttttggaaaaatctggacagtgcagggcctaataaatagcagggga aaagaattggtacatggggaactggtcaaacaggttttagaaagcctccagcttccagca gaggttgccatagttcgcataaatggtcatcagaaaggtaacactatagaagctgtagga aacaagcttgcagataaagctgctatgcaagcctccctggaggaagaaattagactattt agcctgatcccagacatccctaaggtagtattaaggccccagtttaccagaaaggagaag gaagaattaggatag >gi568815595f:5022466_5278710|GENSCAN_predicted_peptide_5|647_aa MQWRALVLGLVLLRLGLHGVLWLVFGLGPSMGFYQRFPLSFGFQRLRSPDGPASPTSGPV GRPGGVSGPSWLQPPGTGAAQSPRKAPRRPGPGMCGPANWGYVLGGRGRGPDEYEKRYSG AFPPQLRAQMRDLARGMFVFGYDNYMAHAFPQDELNPIHCRGRGPDRGDPSNLNINDVLG NYSLTLVDALDTLAIMGNSSEFQKAVKLVINTVSFDKDSTVQVFEATIRIITDSKQPFGD MTIKDYDNELLYMAHDLAVRLLPAFENTKTGIPYPRVNLKTGVPPDTNNETCTAGAGSLL VEFGILSRLLGDSTFEWVARRAVKALWNLRSNDTGLLGNVVNIQTGHWVGKQSGLGAGLD SFYEYLLKSYILFGEKEDLEMFNAAYQSIQNYLRRGREACNEGEGDPPLYVNVNMFSGQL MNTWIDSLQAFFPGLQVLIGDVEDAICLHAFYYAIWKRYGALPERYNWQLQAPDVLFYPL RPELVESTYLLYQATKNPFYLHVGMDILQSLEKYTKVKCGYATLHHVIDKSTEDRMESFF LSETCKYLYLLFDEDNPVHKSGTRYMFTTEGHIVSVDEHLRELPWKEFFSEEGGQDQGGK SVHRPKPHELKVINSSSNCNRVPDERRYSLPLKSIYMRQIDQMVGLI >gi568815595f:5022466_5278710|GENSCAN_predicted_CDS_5|1944_bp atgcaatggcgagcgctcgtcctggggctggtgctcctccggcttggcctccatggagta ttgtggctcgtcttcgggctggggcccagcatgggcttctaccagcgctttccgctcagc ttcggcttccagcgtctgaggagccccgacggccccgcgtcgcccacctcggggcccgtg ggccggcctgggggggtatccgggccgtcgtggctgcagccgccggggaccggggcagcg cagagcccgcgcaaggctccgcggcgtcctgggccggggatgtgcggcccagccaactgg ggctacgtgctgggcggccggggccgcggcccggacgagtacgagaagcgctacagcggc gccttccctccgcagctgcgtgcccagatgcgcgacctggcacggggcatgttcgtcttt ggctacgacaactacatggctcacgccttcccccaggacgagctcaaccccatccactgc cgcggccgtgggcccgaccgcggggacccttcaaatctgaacatcaatgatgtactaggg aactactcattgactcttgttgatgcattggatacacttgcaataatgggaaattcatcc gagttccagaaagccgtcaagttagtgatcaacacagtttcatttgacaaagattccacc gtccaagtctttgaggccacgataagaataataactgactccaagcagccctttggtgac atgacaattaaggactatgataatgagttgttatacatggcccatgacctggcggtgcgg ctcctccctgcttttgaaaacaccaagacagggattccatatcctcgggtgaatctaaag acaggagttcctcctgacaccaataatgagacatgcacagcgggagccggttccctcctg gtggaatttgggattctgagtcgactcctgggggactccacatttgagtgggtggccaga cgagcagtgaaagccctttggaacctccggagcaatgatacaggattactaggcaatgtc gtgaacattcagacgggccactgggttggaaagcagagtggcctgggtgccgggctggac tccttctatgaatacctcttgaaatcttacattctctttggagaaaaagaagacctagaa atgtttaatgctgcatatcagagtattcagaactacttaagaagagggcgggaagcctgc aatgaaggagaaggagaccctccactctatgtcaacgtgaacatgttcagtgggcagctg atgaacacctggattgactctctgcaggcctttttccctggactgcaggtgctgatagga gatgtggaagatgccatctgccttcatgccttctactatgccatatggaaacgatatggt gccctccctgagagatataactggcagctgcaggcccctgacgttctcttctacccactg agaccagagttagtggaatccacatatctcctctaccaggcaaccaagaatcccttctac ctccatgtaggaatggatattctgcagagtctggaaaagtacacaaaagtcaagtgtggg tacgccacgctgcatcacgtcattgacaagtccacagaagaccggatggagagcttcttt ctcagtgagacctgtaaatatttgtatctgctgtttgatgaagacaatccagtacacaag tctggaaccagatacatgttcacaacagagggacacattgtatctgtggatgagcatctt cgggaattgccatggaaggaattcttctctgaagagggagggcaggaccaagggggaaag tctgtgcacaggccgaaacctcatgagttaaaagtcatcaactccagctccaactgcaat cgtgtacctgatgagaggaggtactccctgcccttaaagagcatctacatgcgacagatt gaccagatggttggtttgatttga >gi568815595f:5022466_5278710|GENSCAN_predicted_peptide_6|81_aa MLIIPIFPVMMYYHDSVTAWFTKPTQREDEDEDLYDDPFPLMNSHFLREFDEPIIEGHPA VQQSVFKAPHDERLYLEKVDV >gi568815595f:5022466_5278710|GENSCAN_predicted_CDS_6|246_bp atgctgatcattcctatctttcctgttatgatgtactaccatgattcagtcactgcctgg ttcaccaagcctactcaacgtgaagatgaggatgaagacctctatgatgatccatttcca cttatgaatagtcattttcttcgagaatttgacgagcccatcattgaagggcacccagcc gtgcagcagagcgtcttcaaagctccccacgatgagagactttaccttgaaaaggtggat gtttga >gi568815595f:5022466_5278710|GENSCAN_predicted_peptide_7|111_aa MGKDFMTKTPKAMATKAKIEKSDLIKLKNFCTAKETIIRVNRQPAEWEKIFAIYPSDKGL ISRIYKELKQIHKKKKNNPIKKWAKDMNRRFSKEGIYVANKHEKKLIITGH >gi568815595f:5022466_5278710|GENSCAN_predicted_CDS_7|336_bp atgggcaaagactttatgactaaaacaccaaaagcaatggcaacaaaagccaaaattgag aaatcggatctaattaaactaaagaacttctgcacagcaaaagaaactatcattagagtg aacagacaacctgcagaatgggagaaaatttttgcaatctatccatctgacaaagggcta atatccagaatctacaaggaacttaaacaaattcacaagaaaaaaaaaaacaaccccatt aaaaagtgggcaaaagatatgaacagacgcttctcaaaagaaggcatttatgtggccaac aaacatgaaaaaaagctcatcatcactggtcattag >gi568815595f:5022466_5278710|GENSCAN_predicted_peptide_8|73_aa XVEKGVGKSKVCGYWAWLVVAAMEEDPEIPSKELSDLSKKMPQNQTWTNKATTLATLSVP SNSDLIRAQTLTL >gi568815595f:5022466_5278710|GENSCAN_predicted_CDS_8|222_bp naagtggagaaaggagttggaaagtcaaaggtatgtggctactgggcctggcttgtggta gcagccatggaggaggacccagaaattccaagcaaggaattatctgacctaagcaagaag atgcctcaaaatcagacttggaccaataaagccaccacacttgccactctatcagtgcct tccaactcagacctcatacgtgctcaaactcttactctctaa