GENSCAN 1.0 Date run: 3-Nov-116 Time: 20:59:05 Sequence gi568815593r:135474571_135678814 : 204244 bp : 45.89% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 103 98 6 1.05 1.03 Term - 7942 7790 153 1 0 101 41 117 0.747 6.22 1.02 Intr - 11873 11800 74 0 2 79 -6 52 0.103 -5.97 1.01 Init - 12450 12408 43 0 1 81 102 49 0.652 6.18 1.00 Prom - 15069 15030 40 -4.46 2.05 PlyA - 15445 15440 6 -0.45 2.04 Term - 16203 16026 178 2 1 66 49 186 0.931 9.66 2.03 Intr - 17902 17798 105 0 0 48 57 107 0.788 2.93 2.02 Intr - 29801 29717 85 0 1 85 87 4 0.068 -1.12 2.01 Init - 35546 35471 76 2 1 90 48 101 0.476 7.55 2.00 Prom - 36843 36804 40 -3.16 3.04 PlyA - 40143 40138 6 1.05 3.03 Term - 43012 42978 35 2 2 116 48 19 0.383 -1.55 3.02 Intr - 54289 54182 108 2 0 39 119 102 0.369 8.66 3.01 Init - 57538 57466 73 1 1 58 36 88 0.288 1.83 3.00 Prom - 59552 59513 40 -5.56 4.02 PlyA - 59578 59573 6 1.05 4.01 Sngl - 61120 60407 714 1 0 85 47 978 0.664 89.63 4.00 Prom - 66986 66947 40 -5.86 5.03 PlyA - 67523 67518 6 1.05 5.02 Term - 72963 72785 179 1 2 100 47 111 0.874 6.05 5.01 Init - 75901 75760 142 1 1 88 93 9 0.200 1.76 5.00 Prom - 80317 80278 40 -5.56 6.07 PlyA - 80327 80322 6 1.05 6.06 Term - 81203 81100 104 0 2 49 45 126 0.432 2.84 6.05 Intr - 84209 84129 81 0 0 64 98 49 0.177 3.11 6.04 Intr - 97841 97759 83 1 2 128 80 -30 0.309 -0.52 6.03 Intr - 100115 100002 114 1 0 104 101 241 0.981 26.56 6.02 Intr - 103969 103864 106 2 1 80 73 236 0.998 20.57 6.01 Init - 104208 104145 64 0 1 88 113 240 0.681 25.81 6.00 Prom - 107413 107374 40 -7.76 7.12 PlyA - 107661 107656 6 1.05 7.11 Term - 114079 113839 241 0 1 95 44 133 0.763 5.10 7.10 Intr - 117051 116971 81 2 0 68 94 51 0.352 2.45 7.09 Intr - 120301 120201 101 1 2 95 41 41 0.247 -0.99 7.08 Intr - 131089 130998 92 2 2 34 121 18 0.129 -0.59 7.07 Intr - 131684 131625 60 0 0 50 121 40 0.016 2.21 7.06 Intr - 152444 152363 82 2 1 108 98 35 0.109 5.71 7.05 Intr - 156542 156392 151 1 1 61 42 94 0.075 2.26 7.04 Intr - 185063 184909 155 2 2 4 58 136 0.022 1.07 7.03 Intr - 186296 186192 105 2 0 47 91 67 0.026 3.31 7.02 Intr - 194848 194709 140 2 2 104 69 52 0.895 5.08 7.01 Init - 196774 196684 91 1 1 120 41 15 0.722 0.65 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 65816 65998 183 1 0 68 54 128 0.839 4.94 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:135474571_135678814|GENSCAN_predicted_peptide_1|89_aa MGVPSAPPPFFTDEDQLPSFDPPRWDSSTVLEEPVALTQLSRDDDSDDSNSHIDLGSLPL HPALPLKISVSDQTTSSHPEELITIAGPK >gi568815593r:135474571_135678814|GENSCAN_predicted_CDS_1|270_bp atgggtgttccttcagcccctccacccttcttcacggatgaagaccagctgccttccttt gacccaccaagatgggactcctccacagtcctggaagagcctgtggccttgacccaactg agcagagatgatgacagtgatgatagtaacagccacattgacctaggcagtcttcccctc cacccggccttgcctctgaagatctcagtatctgatcagacaaccagttctcaccctgaa gagctgatcacaattgcaggccccaagtga >gi568815593r:135474571_135678814|GENSCAN_predicted_peptide_2|147_aa MTVDTAEFFPSDPISPSNKDDHTAEGYSFFHPPSTGLLQLLLWKCPDSLSTSQLSSSPNQ QQVLPEHYFMLQPYPQEPSFINHTTMPDGPPIASRAPAACGDGKGPGPELEVEAAWPQLA KTAPTLASFTGASSSLSLESRGPDIVF >gi568815593r:135474571_135678814|GENSCAN_predicted_CDS_2|444_bp atgacggtggacacagctgagttcttcccatcagatcctatcagccccagtaacaaggat gaccatactgcagaagggtatagctttttccacccaccctcaacagggcttctccagctg ctgctctggaagtgtcctgacagcctcagcacatcacagctgtccagtagccctaaccag caacaagtgttgcctgaacactacttcatgctccagccctacccgcaggagcccagtttc atcaaccacacaaccatgccagatgggccgcccatcgcgtccagagcacctgctgcgtgc ggcgacgggaaaggccccggccctgaactggaggtggaggcggcttggccacagctggct aagaccgcgcctacgctggcatcttttactggagcttcttcctcgctcagcctggagtct agaggtcccgacatcgtcttctga >gi568815593r:135474571_135678814|GENSCAN_predicted_peptide_3|71_aa MYDPEHLSRTQSTFPKRNVFNNIVENMYAVAPTSAKQGIKAAKFNTKETNGEDFSTEDAP GPSSKYSHIRS >gi568815593r:135474571_135678814|GENSCAN_predicted_CDS_3|216_bp atgtatgacccagagcacctttccaggacccagagcacctttccaaaaagaaacgtcttt aacaacatcgttgaaaatatgtatgcagttgctccaacctcagccaagcagggaatcaag gctgcaaagttcaacaccaaagagaccaatggtgaagattttagcacagaagatgctcct ggcccgagctccaaatacagccacattaggagttag >gi568815593r:135474571_135678814|GENSCAN_predicted_peptide_4|237_aa MPARLETCISDLDCASSSGSDLSGFLTDEEDCARLQQAASASGPPAPARRGAPNISRASE VPGAQDDEQERRRRRGRTRVRSEALLHSLRRSRRVKANDRERNRMHNLNAALDALRSVLP SFPDDTKLTKIETLRFAYNYIWALAETLRLADQGLPGGGARERLLPPQCVPCLPGPPSPA SDAESWGSGAAAASPLSDPSSPAASEDFTYRPGDPVFSFPSLPKDLLHTTPCFIPYH >gi568815593r:135474571_135678814|GENSCAN_predicted_CDS_4|714_bp atgccagcccgccttgagacctgcatctccgacctcgactgcgccagcagcagcggcagt gacctatccggcttcctcaccgacgaggaagactgtgccagactccaacaggcagcctcc gcttcggggccgcccgcgccggcccgcaggggcgcgcccaatatctcccgggcgtctgag gttccaggggcacaggacgacgagcaggagaggcggcggcgccgcggccggacgcgggtc cgctccgaggcgctgctgcactcgctgcgcaggagccggcgcgtcaaggccaacgatcgc gagcgcaaccgcatgcacaacttgaacgcggccctggacgcactgcgcagcgtgctgccc tcgttccccgacgacaccaagctcaccaaaatcgagacgctgcgcttcgcctacaactac atctgggctctggccgagacactgcgcctggcggatcaagggctgcccggaggcggtgcc cgggagcgcctcctgccgccgcagtgcgtcccctgcctgcccggtcccccaagccccgcc agcgacgcggagtcctggggctcaggtgccgccgccgcctccccgctctctgaccccagt agcccagccgcctccgaagacttcacctaccgccccggcgaccctgttttctccttccca agcctgcccaaagacttgctccacacaacgccctgtttcattccttaccactag >gi568815593r:135474571_135678814|GENSCAN_predicted_peptide_5|106_aa MEVRSQVAFLKASSRPLAADAVTQVCPALSHCSVPSHSEEALLFPGPGEEELCMCTHVSM HVSTHAGMNTWCPCTNPFPPQISLERQLYSKDMNDDGDDDEVKKRD >gi568815593r:135474571_135678814|GENSCAN_predicted_CDS_5|321_bp atggaggtgcgctcacaagtggctttccttaaagccagttccaggcctcttgctgctgat gctgtgacccaggtgtgccccgcactcagccactgctctgtgccctctcactctgaggag gccctcctcttccctggccctggtgaggaagagctatgtatgtgcacgcatgtatccatg catgtcagcacccatgcaggcatgaacacctggtgcccatgcacaaatccattccctcca caaatatcccttgagcgtcaactatactcaaaggatatgaatgatgatggtgatgatgat gaagtcaagaagagggactga >gi568815593r:135474571_135678814|GENSCAN_predicted_peptide_6|183_aa MRLLAAALLLLLLALYTARVDGSKCKCSRKGPKIRYSDVKKLEMKPKYPHCEEKMVIITT KSVSRYRGQEHCLHPKLQSTKRFIKWYNAWNEKRRKQLLELAFFWRTCIIWAKYKEIYKP KSRVYPQGRLSVEEIDSCMEMFTEASFTTASYKIPDQYSSKLSQSSKTEKARNFDSQEEP EET >gi568815593r:135474571_135678814|GENSCAN_predicted_CDS_6|552_bp atgaggctcctggcggccgcgctgctcctgctgctgctggcgctgtacaccgcgcgtgtg gacgggtccaaatgcaagtgctcccggaagggacccaagatccgctacagcgacgtgaag aagctggaaatgaagccaaagtacccgcactgcgaggagaagatggttatcatcaccacc aagagcgtgtccaggtaccgaggtcaggagcactgcctgcaccccaagctgcagagcacc aagcgcttcatcaagtggtacaacgcctggaacgagaagcgcaggaagcagcttctagag ctagcattcttctggaggacatgcattatttgggcaaaatacaaagaaatatacaagcct aagtcaagagtctatcctcagggaagactttcagtggaagaaattgatagctgcatggag atgttcactgaagcatcatttaccacagcatcctacaaaatacctgaccagtactcctca aaactgtcacagtcatcaaaaacagagaaagccagaaacttcgacagccaagaggagcct gaggagacatga >gi568815593r:135474571_135678814|GENSCAN_predicted_peptide_7|432_aa MASLLESLLHKGKKASTVSTSLSVTNFPAPDRCFLASLPNKQLAHKSFSQSLLVGEPKKA APIQKCPFPPPGPRASPSVGCGPAAAAPGDLLDMQHLRPQLLNQKLHFNKTQQCENRLIQ ENWYQKGGIAIKIPENVKATLELDNEQRLEDFGGLRRRQEDEERKNVHASNSRLSAGSSG SMQEDKDSKALAYSMSSIWSNSVEMMLPASNKAQAPTSCAKPGHKWGYFYGSVARRQPAG QDSLATQTTSLTMQTFYVEEKEWPNSQQESQAVVEVVKKMLNVLISEKQVPKEDKKAYLS LCVKVLFMLSNAMHRRDLGTLQAVMPLFEEKQWELRRLEQALQKKAGYMEHSRSISTTCF MKAFPIPNLDGTALQPPIWNPRIRNSFEVFSQPAGKGLSGLQTSCRASGPQPPQPCNPGT EQALSKGSTWCL >gi568815593r:135474571_135678814|GENSCAN_predicted_CDS_7|1299_bp atggccagccttctagagagcctgctacataaaggaaaaaaggctagcacagtgtccacc agcttatcagtgacaaacttcccagctccagataggtgcttcctggcatcacttcccaat aagcaacttgcacacaaatccttctctcagagtctgctggtgggagaacccaagaaggca gcccctatccagaagtgtcctttcccacctccaggccccagagcatctcctagtgtgggc tgtggaccagcagcagcagcacccggggacttgttagacatgcagcatctcagaccccaa ctgctgaatcagaagctgcattttaacaagacccagcagtgtgagaacagactaatacaa gaaaattggtaccagaaagggggcattgctataaagatacctgaaaatgtcaaagcaact ttggaactggataatgagcagaggttggaagactttggagggctcagaagaagacaggaa gatgaggaaaggaaaaatgtacatgcatctaactcgaggctttctgctggaagttcaggg tcgatgcaggaagacaaggactcaaaggctttggcatattccatgtcttctatctggtcc aacagtgtagagatgatgctgccagcttcaaacaaggcccaggctcccacctcctgtgcc aagccaggccataagtggggctacttctatgggtctgttgcaagaaggcaaccagcaggc caagatagccttgcaacccagaccacctccctgacaatgcagacattttatgtggaagaa aaagaatggccaaattcccagcaagaaagtcaggcagtggtagaggtagtgaagaagatg ttaaatgttctcatttctgaaaagcaagtacctaaggaagacaaaaaggcttacctgagc ctgtgtgtgaaggtgctcttcatgctgtcaaatgctatgcataggagggatctgggcacc ctccaagcagtgatgccgctgtttgaggaaaagcaatgggagctgcgtaggctggagcaa gcgctgcagaagaaagcaggatacatggaacactccaggtctatttccactacctgcttc atgaaagcattcccaatccccaacctggatgggactgccttacagccccccatctggaat cctcgcatccgtaactcatttgaagtattctctcagcctgctggcaagggcctctctggc ctgcagacatcctgccgagcctctggcccgcagcccccgcagccctgcaaccccggcaca gagcaggcactcagcaagggctcaacttggtgtttatag