GENSCAN 1.0 Date run: 5-Nov-116 Time: 13:05:42 Sequence gi568815594r:177331711_177542375 : 210665 bp : 39.38% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2044 2240 197 0 2 44 -12 190 0.443 1.89 1.02 Intr + 3978 4112 135 0 0 82 98 43 0.738 3.46 1.03 Intr + 4398 4611 214 0 1 86 106 125 0.938 11.90 1.04 Term + 5142 5180 39 2 0 63 43 67 0.739 -4.09 1.05 PlyA + 6450 6455 6 1.05 2.00 Prom + 7262 7301 40 -2.05 2.01 Init + 13425 13560 136 2 1 54 75 103 0.603 5.95 2.02 Intr + 21598 22018 421 2 1 84 80 191 0.493 9.98 2.03 Intr + 28793 28973 181 2 1 69 38 132 0.377 5.25 2.04 Intr + 49300 49529 230 0 2 91 20 137 0.677 2.94 2.05 Intr + 50548 50645 98 1 2 107 -34 117 0.483 0.13 2.06 Term + 52333 52514 182 2 2 54 38 207 0.906 9.09 2.07 PlyA + 53003 53008 6 1.05 3.00 Prom + 57012 57051 40 -4.75 3.01 Init + 76282 76358 77 0 2 90 105 15 0.831 4.03 3.02 Term + 76526 76682 157 2 1 -4 48 203 0.680 3.52 3.03 PlyA + 80241 80246 6 1.05 4.02 PlyA - 80348 80343 6 1.05 4.01 Sngl - 83986 83798 189 1 0 53 44 159 0.416 2.86 4.00 Prom - 93999 93960 40 -3.55 5.10 PlyA - 94320 94315 6 1.05 5.09 Term - 100098 99998 101 1 2 87 36 45 0.388 -3.49 5.08 Intr - 101637 101504 134 2 2 53 127 106 0.997 10.37 5.07 Intr - 102779 102672 108 1 0 28 86 166 0.980 8.88 5.06 Intr - 105286 105196 91 2 1 60 42 83 0.861 -0.97 5.05 Intr - 105809 105695 115 2 1 66 19 140 0.551 4.00 5.04 Intr - 107147 107035 113 0 2 73 63 9 0.363 -3.92 5.03 Intr - 107978 107866 113 1 2 103 68 74 0.723 6.00 5.02 Intr - 108716 108563 154 0 1 21 86 253 0.990 16.61 5.01 Init - 110665 110539 127 1 1 78 80 207 0.954 17.27 5.00 Prom - 117027 116988 40 -3.55 6.03 PlyA - 117775 117770 6 1.05 6.02 Term - 132954 132825 130 2 1 91 54 75 0.135 1.07 6.01 Init - 139686 139427 260 0 2 104 53 278 0.259 22.56 6.00 Prom - 150947 150908 40 -5.25 7.00 Prom + 163683 163722 40 -3.65 7.01 Sngl + 164149 164382 234 0 0 97 36 206 0.285 9.24 7.02 PlyA + 164683 164688 6 1.05 8.05 PlyA - 166055 166050 6 1.05 8.04 Term - 168917 168502 416 0 2 82 46 234 0.555 13.04 8.03 Intr - 169242 169010 233 2 2 75 60 62 0.075 -1.41 8.02 Intr - 169754 169577 178 0 1 67 37 131 0.084 3.96 8.01 Init - 173171 172988 184 2 1 37 25 180 0.125 5.93 8.00 Prom - 182910 182871 40 -4.45 9.00 Prom + 190156 190195 40 -3.85 9.01 Init + 195062 195159 98 1 2 64 97 3 0.053 -1.37 9.02 Intr + 205088 205272 185 2 2 77 47 108 0.038 4.11 9.03 Intr + 208565 208861 297 0 0 42 101 185 0.122 11.12 9.04 Term + 210311 210372 62 0 2 74 48 96 0.702 1.19 9.05 PlyA + 210600 210605 6 -1.75 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 168972 168960 13 0 1 94 101 19 0.819 3.20 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:177331711_177542375|GENSCAN_predicted_peptide_1|194_aa DLLGSYVFKFIRKIDEPSLDTDGSSLGQVSISESTVSKAQDLRKRQRPVQVKPEQVNAHP WTRQLWIHFGMKGFIMINPLEYKYKNGASPVLEVQLTKDLICFFDSSVELRNSMESQQRI RMMKELDVCSPEFSFLRAESEVKKQKGRMLGDVLMDQNVLPGVGNIIKNEALFDSGLHPA VKLLVTLEAAVVEG >gi568815594r:177331711_177542375|GENSCAN_predicted_CDS_1|585_bp gatctccttgggtcctacgtctttaaattcatcagaaaaattgacgaacccagcttggat actgatggatctagcttgggacaggtgtccatttctgaatcaactgtatcaaaggcacag gatctgagaaaacgacagagacctgttcaggttaagcctgaacaggttaatgcccaccct tggaccaggcaactgtggattcatttcggaatgaaaggcttcatcatgattaatccactt gagtataaatataaaaatggagcttctcctgttttggaagtgcagctcaccaaagatttg atttgtttctttgactcatcagtagaactcagaaactcaatggaaagccaacagagaata agaatgatgaaagaattagatgtatgttcacctgaatttagtttcttgagagcagaaagt gaagttaaaaaacagaaaggccggatgctaggtgatgtgctaatggatcagaacgtattg cctggagtagggaacatcatcaaaaatgaagctctctttgacagtggtctccacccagct gttaaactgttggtgacattagaagctgctgttgtggaaggatag >gi568815594r:177331711_177542375|GENSCAN_predicted_peptide_2|415_aa MFILLAMEVWKKKHLSLPGKDKEVFLKEAVFELSIERNRNSSKTRDSVLKSEENSTVFSH LMKYPCNTFGKPHTEVKINRKTAFGTTTLVLTDFSNKSSTLERKTKQNQILDEEFQNSPP ASVCLNDIQHPSKKTTNDITQPSSKVNISPTISSESKLFSPAHKKPKTAQYSSPELKSCN PGYSNSELQINMTDGPRTLNPDSPRCSKHNRLCILRVVGKDGENKGRQFYACPLPREAQC GFFEVCPNGQWCVSSSISPLPCSSPRLRDYLGPATASGAWSSCPLLAKAEGHSVAAFPGT HVCWVLSSCPTFMKNEVMLTTEGSQCDGSGCPGRSAAAIISISESKLIRKHSQSHILETE KSEIKVPVDLVSGESLISATKMAPRYCILTRQKDKAALFSPFYKGINPIHEVGAL >gi568815594r:177331711_177542375|GENSCAN_predicted_CDS_2|1248_bp atgtttatcctactggcaatggaagtatggaagaagaaacacttaagtctacctgggaag gacaaggaagtctttctgaaagaggcagtctttgagcttagtatagaaagaaataggaat tcatcaaagacaagagattcagtgctcaagagtgaagaaaattctactgtctttagccac ttaatgaagtacccgtgtaatacttttggaaaacctcatacagaagtcaagatcaacaga aaaactgcatttggaactacaactcttgtcttgactgattttagcaataaatccagtact ttggaaagaaaaacaaagcaaaaccagatactagatgaggagtttcaaaactctcctcct gctagtgtttgtttgaatgatatacagcacccctccaagaagacaacaaacgatataact caaccatccagcaaagtaaacatatcacctacaatcagttcagaatctaaattatttagt ccagcacataaaaaaccgaaaacagcccaatactcatcaccagagcttaaaagctgcaac cctggatattctaacagtgaacttcaaattaatatgacagatggccctcgtaccttaaat cctgacagccctcgctgcagtaaacacaaccgcctctgcattctccgagttgtggggaag gatggggaaaacaagggcaggcagttttatgcctgtcctctacctagagaagcacaatgt ggattttttgaagtatgtcccaatggacagtggtgtgttagcagctcaatcagccccttg ccctgctctagtccacggctgcgggactacctcggccccgccactgcttccggtgcatgg agcagctgccctctgctggcaaaggcagagggccacagtgttgcagcctttccaggcacc catgtttgctgggtcctgagctcttgtcccacattcatgaagaatgaggtcatgctgaca actgaaggcagccagtgtgatggcagtggctgccctggacggtccgccgctgccatcatc agcatctctgaatctaaattgatcagaaaacactctcaatcacatattctggagactgag aagtctgagatcaaggttccagtagatttggtgtctggtgaaagcctgatctccgctacc aagatggcacctcgttactgcatcctcacacggcagaaggataaggcagctctgttcagc cccttttataagggtattaatcccattcatgaggttggagccttataa >gi568815594r:177331711_177542375|GENSCAN_predicted_peptide_3|77_aa MAQAFCSLTWGSWPHGFQGMERWAMRSTAGTGLDLEGGSQWRVCDGSNQQWRTVSESSAR ARTNTDQKSAQLQDIIE >gi568815594r:177331711_177542375|GENSCAN_predicted_CDS_3|234_bp atggcacaagccttttgttccctgacctggggttcttggcctcacggattccagggaatg gaacgttgggccatgaggagcacagcaggcaccgggctggatctggagggtggaagtcag tggcgggtctgcgacggcagcaatcagcagtggcggacggtgagtgaaagctcagctcga gccagaacaaacacggaccagaagagcgcgcagttgcaagatataatagagtga >gi568815594r:177331711_177542375|GENSCAN_predicted_peptide_4|62_aa MDDFEGFETPVGKVTADVVEIAKELELEVKPDDVTALLKFHDKTLTDEKLLLRDEQKSSF VR >gi568815594r:177331711_177542375|GENSCAN_predicted_CDS_4|189_bp atggatgactttgaggggtttgagactccagtggggaaagtaacagcagatgtggtagaa atagcaaaagaactagaattagaagtgaagcctgatgatgtgactgcattgctaaaattt catgataaaacattaacagatgagaagttgcttcttagggatgagcaaaaaagtagtttc gtgagatga >gi568815594r:177331711_177542375|GENSCAN_predicted_peptide_5|351_aa MARKSNLPVLLVPFLLCQALVRCSSPLPLVVNTWPFKNATEAAWRALASGGSALDAVESG CAMCEREQCDGSVGFGGSPDELGETTLDAMIMDGTTMDVGAVGDLRRIKNAIGVARKVLE HTTHTLLVGESATTFAQSMGFINEDLSTTASQALHSDWLARNCQPNYWRNVIPDPSKYCG PYKPPGILKQDIPIHKETEDDRGHDTIVLEPEKFKNRALASGKGHPKVEVKRMRQEIGRV GDSPIPGAGAYADDTAGAAAATGNGDILMRFLPSYQAVEYMRRGEDPTIACQKVISRIQK HFPEFFGAVICANVTGSYGAACNKLSTFTQFSFMVYNSEKNQPTEEKVDCI >gi568815594r:177331711_177542375|GENSCAN_predicted_CDS_5|1056_bp atggcgcggaagtcgaacttgcctgtgcttctcgtgccgtttctgctctgccaggcccta gtgcgctgctccagccctctgcccctggtcgtcaacacttggccctttaagaatgcaacc gaagcagcgtggagggcattagcatctggaggctctgccctggatgcagtggagagcggc tgtgccatgtgtgagagagagcagtgtgacggctctgtaggctttggaggaagtcctgat gaacttggagaaaccacactagatgccatgatcatggatggcactactatggatgtagga gcagtaggagatctcagacgaattaaaaatgctattggtgtggcacggaaagtactggaa catacaacacacacacttttagtaggagagtcagccaccacatttgctcaaagtatgggg tttatcaatgaagacttatctaccactgcttctcaagctcttcattcagattggcttgct cggaattgccagccaaattattggaggaatgttataccagatccctcaaaatactgcgga ccctacaaaccacctggtatcttaaagcaggatattcctatccataaagaaacagaagat gatcgtggtcatgacactattgttctggagcctgagaagttcaagaacagggcactggca tctggcaagggccatcccaaggtggaagtcaagagaatgagacaggaaatcggccgtgta ggagactcaccaatacctggagctggagcctatgctgacgatactgcaggggcagccgca gccactgggaatggtgatatattgatgcgcttcctgccaagctaccaagctgtagaatac atgagaagaggagaagatccaaccatagcttgccaaaaagtgatttcaagaatccagaag cattttccagaattctttggggctgttatatgtgccaatgtgactggaagttacggtgct gcttgcaataaactttcaacatttactcagtttagtttcatggtttataattccgaaaaa aatcagccaactgaggaaaaagtggactgcatctaa >gi568815594r:177331711_177542375|GENSCAN_predicted_peptide_6|129_aa MPPSWAYTLAGRHAGGLMSRGAHQQKKKLLDIVKNTSVEEDTSYFSTRLSRARQRESTPI GAATPAGHQPAGRGRVWRRQSKESRGHGMSLSNLHLQKIITVPVGMEDNRNKSKVEAGTP TTSRCGQCG >gi568815594r:177331711_177542375|GENSCAN_predicted_CDS_6|390_bp atgcccccatcctgggcttataccctagcaggtagacacgcaggcggcctgatgtcaaga ggagcacatcagcagaaaaagaagctgctggacatcgtgaagaacacatcggtggaagaa gacacaagctatttctcgacaaggctgtcgagagcacgccagcgggagagcacgccaata ggcgctgccacgccagcaggccatcaaccggcgggacgaggcagagtttggcggaggcag tcaaaggagagccggggccatggaatgtcattatctaacttacatttgcaaaaaataatt acagttccagtgggcatggaagacaatagaaacaagtcaaaagtagaagcaggaacacca acaacaagccgctgtggccaatgtgggtga >gi568815594r:177331711_177542375|GENSCAN_predicted_peptide_7|77_aa MPEPPTPSVGSCAARASRMSATPCSTVPSPIDHPRAEECGPMAQDWQAAPPAAPVRDPLG EASWAPEFGGDVENLYV >gi568815594r:177331711_177542375|GENSCAN_predicted_CDS_7|234_bp atgcctgagcctcccaccccctccgtgggctcctgtgcagcccgggcctcccggatgagc gccaccccctgctccacggtgcccagtcccatcgaccacccaagggctgaggagtgcggg cccatggcacaggactggcaggcagctccacctgcagcccctgtgcgggatccactgggg gaagccagctgggctcctgagtttggtggggacgtggagaatctttatgtctag >gi568815594r:177331711_177542375|GENSCAN_predicted_peptide_8|336_aa MRRNQKTNPEGSSTPPKNHTSSPAIDPNQEEIPDLPEKEFKRLVIKLIREGPEKGKAQAR HVLEVLARAIRQEKEIKGIQISKEKIRLSLFDDDVIAYLENPKDSSRKLLELIKEFSKVS RIRKNNSKIHREPKKSLHSQSKIKQKNKSGGITLPDFKLYYKATVTKTARYWHKNRHIDQ CNRIENTEINPNTYVSGIDVQMCPEFLPSGGSVVALTSGVKLQTFTVSVTALKGGTSGLF IPSSGFMVSLASGLKLQTFTVSVTVHKGGMSGVVCSTCPELFVPPSGFMVSLASGVKLQT FVVSVTAHKGDADPNSEQQQNLLRRVKQQTFHGRGR >gi568815594r:177331711_177542375|GENSCAN_predicted_CDS_8|1011_bp atgagaaggaaccagaaaaccaaccctgaaggctcttcaacaccccccaaaaatcatact agttcaccagcaatagacccaaaccaagaagaaatccctgatttacctgaaaaagaattc aagaggttagttattaagctaatcagggagggaccagagaaaggcaaagcccaggcccgg catgtactggaagtcctagccagagcaatcagacaagagaaagaaataaagggcatccaa atcagtaaagagaaaatcagactgtcactgtttgatgatgatgtgattgcttaccttgaa aaccctaaggactcctccagaaagctcctagaactgataaaagaattcagcaaagtttcc agaattagaaaaaacaattctaaaattcatagggaaccaaaaaagagcctgcatagccaa agcaagattaagcaaaagaacaaatctggaggcatcacactacctgattttaaactatac tataaggccacagtcaccaaaacagcacggtactggcataaaaataggcacatagaccaa tgtaacagaatagagaacacagaaataaacccaaatacttatgtgtccggaattgatgtt cagatgtgtccagagtttcttccttctggtgggtccgtggtcgcactgacttcaggagtg aagctgcagaccttcacggtgagtgttacagctcttaaaggcggcacgtctggattgttc attccttccagtgggttcatggtctcgctggcctcaggactgaagctgcagactttcaca gtgagtgttacagttcataaaggtggcatgtccggagtggtttgctccacctgtccggag ttgttcgtccctcccagtgggttcatggtctcgctggcttcaggagtgaagctgcagacc tttgttgtgagtgtaacagctcataaaggtgacgcagacccaaacagtgagcagcaacaa aatttattgcgaagagtgaaacaacaaacgttccacgggcgtggaaggtga >gi568815594r:177331711_177542375|GENSCAN_predicted_peptide_9|213_aa MQMTSFIYLDAVVNSLSWDPINVPVPTLFFKPKHALPREEESREAVWPQWLCQAVVGSAQ SELSRGFVYTVRGKTPTQASVIVDASPPPRWSVPEFIPSGGFLVSLTSRTKPRTRTMSVT ALKDGVCRVCTFRCSDVSGVPSGEFMVLLTSGVKLQTFAVSVTALKGGTSGVVCSSQWVR GLADFRNEAADPHGLFECGDAMWKLGLLEVGYD >gi568815594r:177331711_177542375|GENSCAN_predicted_CDS_9|642_bp atgcagatgacatcatttatctatcttgacgcagttgtaaattctctctcatgggatcca ataaatgtccctgtccccactctcttttttaaacctaaacatgccctgcccagagaggag gaatctagagaggcagtctggccacagtggctttgccaagctgtggtgggctctgcccag tccgaactttccagaggctttgtttacactgtgaggggaaaaacgcctactcaagcgtca gtaattgtggacgcctctcccccaccaagatggagtgtcccagaatttattccttctggt gggttcttggtctcgctgacttcaagaacgaagccacggacccgcacgatgagtgttaca gctcttaaagatggtgtgtgcagagtttgtaccttcagatgttcagatgtgtctggagtt ccttctggtgagttcatggtcttgctgacttcaggagtgaagctgcagaccttcgcagtg agtgttacagctcttaaaggtggcacgtctggagttgtttgttcctcccagtgggttcgt ggtctcgctgacttcaggaatgaagccgcagaccctcatgggctcttcgaatgtggtgat gccatgtggaagctggggcttttggaagttggttatgattga