GENSCAN 1.0 Date run: 7-Nov-116 Time: 16:56:12 Sequence gi568815594f:188039559_188247369 : 207811 bp : 44.01% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.15 PlyA - 178 173 6 -0.45 1.14 Term - 2855 2795 61 1 1 72 49 85 0.124 0.28 1.13 Intr - 7983 7891 93 2 0 32 105 102 0.225 5.38 1.12 Intr - 24698 24629 70 0 1 89 105 38 0.458 3.84 1.11 Intr - 42047 41975 73 2 1 95 83 10 0.008 0.18 1.10 Intr - 50973 50941 33 0 0 82 95 37 0.016 2.22 1.09 Intr - 52383 51827 557 1 2 81 -9 285 0.013 10.66 1.08 Intr - 57603 57503 101 2 2 98 89 29 0.910 3.75 1.07 Intr - 57788 57766 23 2 2 100 113 1 0.966 0.14 1.06 Intr - 59617 59477 141 1 0 9 116 127 0.206 8.15 1.05 Intr - 65374 65279 96 1 0 64 82 30 0.175 0.21 1.04 Intr - 65871 65622 250 2 1 59 90 76 0.201 2.34 1.03 Intr - 66029 65927 103 0 1 75 36 137 0.509 6.43 1.02 Intr - 81706 81679 28 1 1 98 58 14 0.024 -2.91 1.01 Init - 86953 86870 84 1 0 56 109 60 0.491 5.72 1.00 Prom - 89921 89882 40 -5.56 2.00 Prom + 95335 95374 40 -5.56 2.01 Init + 99267 99277 11 2 2 84 92 9 0.268 0.91 2.02 Intr + 100126 100408 283 1 1 83 52 205 0.268 13.82 2.03 Intr + 102694 102924 231 0 0 76 78 162 0.856 11.97 2.04 Intr + 104280 104302 23 2 2 94 102 10 0.854 -0.66 2.05 Intr + 104478 104575 98 0 2 96 92 45 0.820 5.35 2.06 Term + 107264 107814 551 0 2 83 55 358 0.775 26.46 2.07 PlyA + 107914 107919 6 1.05 3.00 Prom + 119457 119496 40 -4.76 3.01 Sngl + 139260 139778 519 2 0 55 54 304 0.680 17.75 3.02 PlyA + 140213 140218 6 1.05 4.03 PlyA - 143387 143382 6 1.05 4.02 Term - 158229 158107 123 0 0 93 48 63 0.504 1.18 4.01 Init - 159643 159566 78 1 0 56 81 52 0.668 2.36 4.00 Prom - 168059 168020 40 -3.36 5.03 PlyA - 168427 168422 6 1.05 5.02 Term - 169241 169118 124 1 1 92 54 63 0.867 1.16 5.01 Init - 174339 174230 110 0 2 77 52 171 0.976 12.09 5.00 Prom - 197911 197872 40 -3.56 6.02 PlyA - 198435 198430 6 1.05 6.01 Term - 207089 206925 165 2 0 37 39 189 0.562 6.82 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 52383 51815 569 1 2 81 33 277 0.945 16.08 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594f:188039559_188247369|GENSCAN_predicted_peptide_1|570_aa MRPKKSGILYTRQWSHKRTTASAEKNEQVKFCVKAILGSCQKFVGFEESRAPTRSVCVDL FPWIFVDSEVANAGRYLSRSIQSSLQLGTPTRMSKRLSPQLQHNITEDAYCETHLEPTRL FCDVDQITLCSKCFQSQEHKHHMVCGIQEAAENYRKLFQEILNTSREKLEAAKSILTDEQ ERMAMIQRLGRVGRENMEKLKESEARASEQVRSLLKLIVELEKKCGEGTLALLKNAKYSL ERSKSLLLEHLEPAHITDLSLCHIRGLSSMFRVLQRHLTLDPETAHPCLALSEDLRTMRL RHGQQDGAGNPERLDFSAMVLAAESFTSGRHYWEVDVEKATRWQVGIYHGSADAKGSTAR ASGEKVLLTGSVMGTEWTLWVFPPLKRLFLEKKLDTVGVFLDCEHGQISFYNVTEMSLIY NFSHCAFQGALRPVFSLCIPNGDTSPDSLTILQHGPSCDATEQFDLLEEARQIGELCGHS RYSEIQENFVFEKEGGALPHLSWRALTRIVDAVTAPRTFRYFKDARTPRKLEKGIEEQYV LATQKLCLFSWGALCGSDPTAVSAVDVYGS >gi568815594f:188039559_188247369|GENSCAN_predicted_CDS_1|1713_bp atgaggcccaagaaatccggaattctctacacaaggcaatggtctcacaagcggacaact gcctcggcagagaaaaatgaacaggttaagttctgtgtcaaagctattctaggcagctgc cagaaatttgttggatttgaggagtccagagcgccgacccgctctgtctgcgtggactta tttccttggatcttcgtggactctgaagttgccaatgcagggagatacctcagtagaagc atacagtccagtcttcaactagggacccctaccaggatgtccaaaaggctcagccctcag ttacagcacaacatcacagaagatgcctattgtgaaacacacctggaaccaacacggctg ttctgtgatgttgaccaaatcacactctgcagcaaatgcttccagtcccaggagcacaaa catcacatggtgtgtgggatacaagaggctgctgagaattacaggaagttattccaggaa atattgaacacatcgagggagaaacttgaagcagctaaaagcatattgactgatgagcaa gaaagaatggcgatgattcagagactgggccgtgtggggagagagaacatggagaaactg aaggagagtgaagccagggcttctgaacaggtccgcagcctcctaaagctcatcgtggag cttgagaaaaagtgtggggaaggcaccttggcattgctcaagaatgcaaaatactcttta gaaaggagcaagtcactgctgcttgagcatctggagcccgctcatatcacagacctgagt ttatgccacataagaggactcagcagcatgttcagagtactccagagacatttaacattg gatcctgaaacagctcatccctgcctggcactatctgaggacctgagaactatgagattg agacatgggcagcaggatggggctggcaacccagaaagattggatttcagtgccatggtg ctggctgcggagagcttcacctcagggaggcactactgggaggtggacgtggaaaaggca accaggtggcaagtgggcatataccacggctctgcagacgcgaagggcagcacggccaga gcttccggagagaaagtcttgctcacggggtcggtgatggggaccgagtggactctctgg gtcttcccccctctgaaaaggctcttcctggaaaagaagttggacacagttggcgttttc cttgactgcgaacacgggcagatatcattctacaatgtgaccgagatgtccctcatttac aatttctcccattgcgccttccaaggagctctcaggcctgtgttttccctctgtatccca aatggagacacaagtccagactccctcaccatcttacaacatggtccttcttgtgatgct actgaacaatttgatttattagaagaagcccgacagataggagaactatgtggtcattcc agatactcagagatacaagagaactttgtatttgaaaaagagggtggagctctaccccac ctgtcctggagagctctgacccgaatcgtggacgccgtcactgcaccacggacattccga tattttaaggatgccaggacgccgcggaagctggagaaaggaattgaggagcagtacgtg ctggccacacaaaaactatgtttatttagctggggagctttgtgcggctccgaccccacg gcggtgtctgcggtggatgtttacggctcctga >gi568815594f:188039559_188247369|GENSCAN_predicted_peptide_2|398_aa MADGSWEEHNTPLSCPECWRTLEGPHFQSNERLGRLASIARQLRSQVLQSEDEQGSYGRM PTTAKALSDDEQGGSAFVAQSHGANRVHLSSEAEEHHREETKTCKQVVVSEYMKMHQFLK EEEQLQLQLLEQEEKENMRKLRNNEIKLTQQIRSLSKMIAQIESSSQSSAFESLEEVRGA LERSEPLLLQCPEATTTELSLCRITGMKEMLRKFSTEITLDPATANAYLVLSEDLKSVKY GGSRQQLPDNPERFDQSATVLGTQIFTSGRHYWEVEVGNKTEWEVGICKDSVSRKGNLPK PPGDLFSLIGLKIGDDYSLWVSSPLKGQHVREPVCKVGVFLDYESGHIAFYNGTDESLIY SFPQASFQEALRPIFSPCLPNEGTNTDPLTICSLNSHV >gi568815594f:188039559_188247369|GENSCAN_predicted_CDS_2|1197_bp atggcagatgggagctgggaggaacataacacacctttatcttgtcctgagtgctggagg accttggagggcccgcatttccagtcaaacgagcgtctggggaggctggccagcatcgcc aggcagctccggtcccaggtgctgcagagcgaggatgagcagggcagctacgggaggatg cccaccactgccaaggcgctctccgatgacgagcagggtggaagcgccttcgtagcccag agccatggtgcaaacagagtgcatctctccagcgaggctgaggagcatcacagagaagaa acaaagacttgtaaacaggttgttgtgtcagaatacatgaaaatgcaccagttcctgaag gaagaggagcagctgcaactccagctactagaacaggaagagaaagagaacatgaggaag ctgaggaacaatgagatcaaactgacccagcaaatcagaagcctaagcaaaatgatcgca cagattgagtcctcaagtcaaagctcggctttcgaatctcttgaggaagtgagaggagcc ctggaaaggagcgagccactcttgcttcagtgtccagaggccaccaccacagagctgagt ctgtgccgcatcacgggaatgaaggagatgctaagaaaattcagcacggagataacgctg gacccagccacagctaatgcctatctcgtgttgtcggaggatctgaagagtgtgaaatat gggggaagcagacagcagctacccgacaacccggaaagatttgaccagtctgcgactgtg ctgggtactcagatcttcaccagtgggagacactactgggaggtggaggtgggaaacaag accgagtgggaagtgggcatctgcaaggactctgtgagcagaaaggggaatctccccaag ccacctggggacctgttctcactaataggtttaaaaatcggagatgattacagcctctgg gtctcgtcacctttgaaaggtcagcacgtcagagagcctgtgtgtaaggttggtgtcttc ctggactatgaatctggacatatagcattctacaacgggacggatgaatccctcatctac agcttcccgcaggcttctttccaagaggccctcaggcctatcttttccccctgcctccca aatgaggggacaaacacagaccctctcaccatctgctcactgaacagccacgtctga >gi568815594f:188039559_188247369|GENSCAN_predicted_peptide_3|172_aa MCCLPSLALSASSASASALATLEEPFSRLLHCGSPFLGWPRLELVPSAYGEVWRKRRGRE PGLPAAPMGQRGLGGPCIQSSWQAPPAPGSEGLSTRASSCGRCAGYPSTAGPPRPHSNSC RASAASPQGRVQALQPAMPEFPLRCGLLRGPSLPYERCPLLHGAGSHRLPKG >gi568815594f:188039559_188247369|GENSCAN_predicted_CDS_3|519_bp atgtgctgcctgccctcgctcgctctcagtgcctcctcggcctcagcgtccgctctggcc acacttgaggagcccttcagccggctgctgcactgtgggagcccctttctgggatggcct agactggagctggttccctctgcttacggggaggtgtggaggaagaggcgagggcgggaa ccggggctgcccgcggcgcccatgggccagcgagggctcggcgggccctgcattcagagc agctggcaggcgccgcccgccccgggcagtgaggggcttagcacccgggccagcagctgc ggaaggtgcgccgggtaccccagcactgccggcccacccaggccgcactccaattcttgc cgggcctcagctgcctccccgcagggcagggttcaggccctacagcctgccatgcctgag tttcccctccgctgcgggctcctgcgcggcccgagcctgccctacgagcgctgtcccctg ctccatggcgccgggtcccatcgactgcccaagggctga >gi568815594f:188039559_188247369|GENSCAN_predicted_peptide_4|66_aa MNTEVISDDATGFRVEKKVTLAFKSPVSCSSARLDLLPYCKDRRLSVPWVLALVYQKNQI TRGLGE >gi568815594f:188039559_188247369|GENSCAN_predicted_CDS_4|201_bp atgaacactgaagtcatttcggatgatgcaacagggttcagggtggagaaaaaagtgacc ctggcttttaaatctccagtgtcgtgttcatcagctcgattagatcttctgccttattgc aaggacaggcggctttctgtaccctgggttcttgccttggtgtaccagaaaaatcagatc acacgtgggcttggagaatga >gi568815594f:188039559_188247369|GENSCAN_predicted_peptide_5|77_aa MEVDEEGVVTKLHQTSTVGQKNMDKSQTGSATLYYRETRNTDQMMRRTSDKFQQRASCRM IDQYSSKLSRSSKTRKI >gi568815594f:188039559_188247369|GENSCAN_predicted_CDS_5|234_bp atggaggtggacgaagaaggagttgtcaccaagctgcatcagacctccacagtgggccaa aagaatatggacaagtcacaaacaggatctgctacactctactacagagaaactcgtaat accgatcaaatgatgagaagaacatcagataaatttcaacagagggcatcctgcagaatg attgaccagtactcctcaaaactgtcaaggtcctcgaaaacaaggaaaatctga >gi568815594f:188039559_188247369|GENSCAN_predicted_peptide_6|54_aa PRTLLRHTEHLLCASIHYTADDDSDNTVNETDAKSSPCEASVQSINTSYGQNEY >gi568815594f:188039559_188247369|GENSCAN_predicted_CDS_6|165_bp ccacgcaccctgttaagacatactgagcacctactctgtgccagcatccattatacagct gatgatgacagtgacaacaccgtgaacgaaacagatgcaaaatcctctccttgtgaagct tccgtgcagtcaatcaacacgtcctacgggcaaaatgaatattag