GENSCAN 1.0 Date run: 4-Nov-116 Time: 00:14:46 Sequence gi568815575r:8070085_8270643 : 200559 bp : 38.50% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1246 1285 40 0 1 53 94 48 0.087 2.31 1.02 Intr + 51808 51995 188 2 2 114 44 173 0.127 13.99 1.03 Term + 55633 55869 237 0 0 51 42 134 0.007 0.28 1.04 PlyA + 56367 56372 6 1.05 2.00 Prom + 59277 59316 40 -1.45 2.01 Init + 60150 60290 141 2 0 59 44 113 0.594 4.18 2.02 Intr + 61634 61679 46 1 1 111 87 49 0.865 4.16 2.03 Intr + 69729 69819 91 1 1 33 78 73 0.003 -1.07 2.04 Intr + 70991 71216 226 2 1 31 113 96 0.014 3.46 2.05 Intr + 74295 74625 331 2 1 -42 22 292 0.108 3.88 2.06 Term + 74768 74982 215 0 2 55 55 141 0.826 3.91 2.07 PlyA + 76349 76354 6 1.05 3.04 PlyA - 76371 76366 6 1.05 3.03 Term - 83047 82883 165 1 0 14 48 189 0.843 4.43 3.02 Intr - 84542 84485 58 1 1 92 94 2 0.308 -0.83 3.01 Init - 87254 87184 71 2 2 66 116 4 0.254 1.57 3.00 Prom - 90149 90110 40 -3.75 4.02 PlyA - 90409 90404 6 1.05 4.01 Sngl - 100559 99948 612 2 0 66 38 529 0.979 41.64 4.00 Prom - 104982 104943 40 -6.95 5.03 PlyA - 105144 105139 6 1.05 5.02 Term - 106059 105839 221 1 2 60 37 224 0.998 10.72 5.01 Init - 107876 107687 190 2 1 71 81 146 0.665 10.00 5.00 Prom - 109998 109959 40 -7.45 6.04 PlyA - 110037 110032 6 1.05 6.03 Term - 111411 111271 141 0 0 55 43 88 0.007 -2.05 6.02 Intr - 112464 112354 111 0 0 104 86 84 0.006 9.46 6.01 Init - 118944 118882 63 0 0 55 99 34 0.042 2.40 6.00 Prom - 120362 120323 40 -4.35 7.00 Prom + 143534 143573 40 -5.25 7.01 Init + 144176 144189 14 1 2 103 80 6 0.363 0.49 7.02 Term + 148245 148365 121 0 1 80 41 139 0.537 5.37 7.03 PlyA + 148535 148540 6 1.05 8.03 PlyA - 149190 149185 6 1.05 8.02 Term - 170012 169950 63 2 0 110 39 132 0.869 7.41 8.01 Init - 196360 196331 30 1 0 95 59 36 0.212 1.19 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 114189 114126 64 0 1 57 97 52 0.821 4.36 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575r:8070085_8270643|GENSCAN_predicted_peptide_1|154_aa MLEGKAGPSYMAAAIGSCTAGASQTTMSAAPCSTTPSPINRPRAEECGHRARDWQAAPPA AQCQIHWVKPAGLLSLSLHKGKTGSHVVPRQCGRSEAAVSSTNAAACGARQVQALAQAMV VTGWQLQAPRVLRQHQGPGLLHSCKVVVLVLRLP >gi568815575r:8070085_8270643|GENSCAN_predicted_CDS_1|465_bp atgttggaagggaaagcaggcccatcttacatggcagcagccatcggctcctgcacagct ggagcctcccaaacgacgatgagcgctgctccctgctccacaacgcccagtcccatcaac cgcccaagggctgaggagtgtgggcacagggcacgggactggcaggcagctccacctgcg gcccagtgccagatccactgggtgaagccagctgggctcctgagtctgagcctccataag ggaaagactggctcccacgtggtccccaggcagtgtggtcgcagtgaggcagcagtgtcg agcacaaatgcagcagcctgtggggcacggcaggtgcaggctctggctcaagccatggtg gtgacgggctggcagcttcaggctcccagagtcctgagacagcatcagggccctgggctc ctgcattcctgcaaggtggtggtcttggttctaaggctgccatga >gi568815575r:8070085_8270643|GENSCAN_predicted_peptide_2|349_aa MNRDELLEGRGVGLEQGVSYEAATKTKVQLGTTLNATERESAFYLYEPCEEGVCFPFTYC PDSKILKDLRRVAAQTFLVKGIEKDSNQLLRVPIMWLHFLQVDELEENAWVLSLIPVGYG RSPKPGGTSILAGVQALDTIRRRNSRKSQKNSVKCGHLLHTKSTHRRKFLKTACPEFVPS DVRMCLEFLPSGGVRGLAGSGVKLRTFAASVTALKVAHLELFIPPGGFVVLLASGVKLQT FAVSVTAHKGSVDPKSEQQQDLLQTVKEQSFHSVEGDPTRHKGSPSPPPESGAQLASPGG SCTGAAGGAACQSHAVRRHSSALGWSMGLSAVEQGVDAHPGGLGHTGAH >gi568815575r:8070085_8270643|GENSCAN_predicted_CDS_2|1050_bp atgaacagggatgagttgttggaaggaaggggtgtaggtttggaacagggagtgagttat gaagctgcaacaaagactaaagtccaacttggaacaacgctgaatgcaacagaaagggag tcagcattttatctttatgaaccttgtgaagaaggtgtctgcttccccttcacctactgc cctgattcaaaaattctcaaggacttaaggagagttgctgcacaaactttccttgtgaaa gggattgagaaagattccaaccaactcctccgggtgcctatcatgtggctgcactttttg caagttgatgaactggaagaaaatgcctgggtgctgagtctgatacccgtaggctatggg aggtccccaaaaccaggtgggacctcaatcctggccggtgtccaggctcttgataccatc aggagaagaaattcaaggaagagtcaaaaaaatagtgttaagtgtggacatttattgcat accaaaagtacacaccgaagaaagttcttaaagacggcgtgtccggagtttgttccttct gatgttcggatgtgtttggagtttcttccttctggtggggttcgtggtctcgctggctca ggagtgaagctgcggacctttgcggcgagtgttacagctcttaaggtggcgcatctggag ttgttcattcctcctggtgggttcgtggtcttgctggcttcaggagtgaagctgcagacc ttcgcagtgagtgttacagctcataaaggcagtgtggacccaaagagtgagcagcagcaa gatttattgcaaacagtgaaagaacaaagcttccacagtgtggaaggggacccgactaga cataaaggttctccaagtcccccaccagagtcaggagcccagctggcttcacccggtgga tcctgcactggggccgcaggtggagctgcctgccagtcccacgcggtgcgccggcactcc tcagcccttgggtggtcaatgggactcagtgccgtggagcagggggtcgatgctcatccg ggaggcttgggccacacaggagcccactga >gi568815575r:8070085_8270643|GENSCAN_predicted_peptide_3|97_aa MNGSCSHSFNCPNSEMNNYLLAVRWIGRLRNNRHTQDSESWVQRQAGPSEGVVEKGNVIG GHTSMHVIVPEDLPVGPYVKVEGSDIDDPDSSVGLGY >gi568815575r:8070085_8270643|GENSCAN_predicted_CDS_3|294_bp atgaatggttcatgttcacacagcttcaactgcccgaattctgaaatgaacaactattta ttagctgtaaggtggatcggcaggttgagaaataatagacacacacaagacagtgaaagc tgggtccagcggcaggcaggtccttcagaaggtgttgtagaaaaaggcaatgttattgga ggtcacacctccatgcatgttattgtccctgaagaccttccagtgggaccatatgtgaag gtggaaggcagtgatattgatgatcctgactctagtgtaggcctaggctactga >gi568815575r:8070085_8270643|GENSCAN_predicted_peptide_4|203_aa MSPKPRASGPPAKATEAGKRKSSSQPSPSDPKKKVSDPPKLLLVFPSPPSSQEASPVVTW QNPPTRPPPLLRTRPCSQPPPSSSLNQSPSVISLLSFQTTKVAKKGKAVRRGRRGKKGAA TKMAAVTAPEAESAPAAPGPSDQPSQELPQHELPPEEPVSEGTQHDPLSQESEVEEPLSQ ESEVEEPLTVWMASFSPVSESTD >gi568815575r:8070085_8270643|GENSCAN_predicted_CDS_4|612_bp atgagtccaaagccgagagcctcgggacctccggccaaggccacggaggcaggaaagagg aagtcctcctctcagccgagccccagtgacccgaagaagaaggtgagtgaccctcccaag ctcctcctcgtcttcccctcgcctccttcctcacaagaagcctctcctgtcgtcacttgg cagaaccccccaacccggcccccaccgcttctgaggacacgtccctgttcccagcctcct ccatcctcgtccctaaaccagagcccttctgtgatctccctgttgtccttccagactacc aaggtggccaagaagggaaaagcagttcgtagagggagacgcgggaagaaaggggctgcg acaaagatggcggccgtgacggcacctgaggcggagagcgcgccagcggcacccggcccc agcgaccagcccagccaggagctccctcagcacgagctgccgccggaggagccagtgagc gaggggacccagcacgaccccctgagtcaggagagcgaggtggaagaaccactgagtcag gagagcgaggtggaagaaccactgactgtgtggatggccagcttttcccctgtctccgag agcaccgactaa >gi568815575r:8070085_8270643|GENSCAN_predicted_peptide_5|136_aa MGMAAMPLSASVCGSPGCGGLRANSGPRNNCLHPTSSNECPKLKGLSLDLTEPYKQISNT LSTGPKATGASLDWTTPNSPYDPDSCSTQKPLHVDLLLQGKGGTSTRQRTHRLHLRTSWR RLSIVQEGLKLRQYPP >gi568815575r:8070085_8270643|GENSCAN_predicted_CDS_5|411_bp atggggatggctgccatgcctctgagtgctagtgtgtgtggctctcctggctgtggaggc ctccgagctaactctgggccaaggaataactgtctacaccccacatcaagtaatgaatgt cctaaactcaaaggcctttcactggatctcacagagccatataagcaaatttccaacact ctttctacaggaccaaaagccactggagccagtttggactggaccacaccaaattctcct tatgaccccgacagctgttcaactcaaaagcctctccatgtggatttactgctccagggt aaaggtggcaccagcactcgacaaagaacgcaccgcctacacctgcgaaccagttggaga cgtttgtctattgttcaggagggactcaaactcaggcaatatccaccttaa >gi568815575r:8070085_8270643|GENSCAN_predicted_peptide_6|104_aa MPFHPEWSSQKNIWGIFGEHEEENHIVYLDHIGQLNPSNVTRMVESRKKIEDDKKTTRSA TWTTDVILAPSPIPIGLIAKHQSSRETFSEPQHLAGRLTECDVF >gi568815575r:8070085_8270643|GENSCAN_predicted_CDS_6|315_bp atgccttttcacccagaatggtcatctcagaaaaatatatggggaattttcggggaacat gaggaagaaaatcatattgtctatttggatcatattggacagctgaacccatcaaacgtt accagaatggtggagtccaggaagaaaatagaagacgacaagaaaactacacgtagtgct acctggacaactgacgtcatcctcgccccaagccccattcccattggcctgattgccaaa catcaatccagcagagaaaccttttctgaaccccaacacctagcaggtcgtctcacagaa tgtgatgtgttttaa >gi568815575r:8070085_8270643|GENSCAN_predicted_peptide_7|44_aa MASTSKKHTPGWRYKMLMRHAMYEQARTVTAHVDPEDYPEHAYS >gi568815575r:8070085_8270643|GENSCAN_predicted_CDS_7|135_bp atggcttcgaccagtaaaaagcatacccctgggtggagatataagatgctaatgagacat gcaatgtatgaacaagcacgcacagttactgcgcatgtggacccagaggactatccagaa catgcttattcgtaa >gi568815575r:8070085_8270643|GENSCAN_predicted_peptide_8|30_aa MALNAGVFGEPQQEDDEDEDLSDDPLPLNE >gi568815575r:8070085_8270643|GENSCAN_predicted_CDS_8|93_bp atggctttaaatgctggtgtctttggagagcctcaacaggaagatgatgaggatgaagac ctttctgatgatccacttccactgaatgaatag