GENSCAN 1.0 Date run: 8-Nov-116 Time: 06:22:54 Sequence gi568815596f:60660134_60897157 : 237024 bp : 43.22% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1629 1707 79 2 1 85 76 79 0.732 7.62 1.02 Term + 18980 19125 146 0 2 49 42 124 0.329 1.97 1.03 PlyA + 19405 19410 6 1.05 2.03 PlyA - 22572 22567 6 1.05 2.02 Term - 22882 22668 215 2 2 39 43 231 0.997 11.09 2.01 Init - 23111 22964 148 2 1 86 42 169 0.999 12.35 2.00 Prom - 31214 31175 40 -5.66 3.00 Prom + 32416 32455 40 -4.86 3.01 Init + 32688 32797 110 2 2 80 92 36 0.849 2.89 3.02 Intr + 34289 34451 163 2 1 109 59 122 0.887 11.38 3.03 Intr + 36433 36481 49 0 1 64 77 6 0.103 -4.65 3.04 Term + 51325 51677 353 2 2 45 41 189 0.014 4.55 3.05 PlyA + 51783 51788 6 1.05 4.03 PlyA - 51803 51798 6 1.05 4.02 Term - 52789 52685 105 1 0 103 48 68 0.118 2.71 4.01 Init - 57287 57234 54 2 0 83 53 73 0.137 2.58 4.00 Prom - 66731 66692 40 -5.06 5.03 PlyA - 67860 67855 6 1.05 5.02 Term - 75405 74757 649 2 1 50 35 239 0.928 8.35 5.01 Init - 75585 75422 164 0 2 80 26 178 0.950 10.00 5.00 Prom - 79567 79528 40 -0.96 6.00 Prom + 89667 89706 40 -1.86 6.01 Init + 96477 96631 155 2 2 86 36 123 0.927 4.36 6.02 Intr + 100001 100162 162 2 0 77 91 164 0.999 14.69 6.03 Intr + 101608 101674 67 1 1 80 116 61 0.998 7.01 6.04 Intr + 110325 110378 54 2 0 97 100 61 0.937 7.38 6.05 Intr + 120811 120864 54 0 0 98 66 21 0.435 0.08 6.06 Intr + 121752 121872 121 2 1 86 92 60 0.796 6.27 6.07 Intr + 122553 122637 85 1 1 98 84 1 0.837 -0.42 6.08 Intr + 126814 126933 120 1 0 86 89 38 0.691 3.31 6.09 Intr + 127378 127487 110 1 2 60 110 16 0.429 0.93 6.10 Intr + 133838 134058 221 0 2 120 98 -3 0.493 1.92 6.11 Intr + 134831 134887 57 1 0 104 87 48 0.976 5.38 6.12 Term + 136929 137027 99 2 0 83 42 146 0.996 7.63 6.13 PlyA + 138806 138811 6 1.05 7.00 Prom + 139858 139897 40 -4.76 7.01 Init + 144715 144762 48 0 0 74 80 109 0.711 7.70 7.02 Intr + 157894 158016 123 0 0 46 69 60 0.199 0.68 7.03 Term + 161723 161857 135 1 0 79 42 123 0.622 4.82 7.04 PlyA + 166945 166950 6 1.05 8.04 PlyA - 167330 167325 6 1.05 8.03 Term - 180929 180725 205 1 1 106 50 153 0.902 10.14 8.02 Intr - 181186 181045 142 2 1 85 47 50 0.635 0.01 8.01 Init - 201590 201536 55 2 1 81 64 39 0.245 2.25 8.00 Prom - 205552 205513 40 -5.46 9.00 Prom + 216704 216743 40 -3.96 9.01 Init + 221708 221717 10 1 1 109 117 11 0.642 5.72 9.02 Intr + 231550 231692 143 2 2 96 90 56 0.924 6.67 9.03 Intr + 234264 234412 149 2 2 79 81 98 0.295 7.23 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:60660134_60897157|GENSCAN_predicted_peptide_1|74_aa MGPLESTMRWVLKKEAEGGNRYAICRSPSQGKIPEKSWLLLKHKYTPEIGVWQKFLLPAA GVSPNSTNVQVPQC >gi568815596f:60660134_60897157|GENSCAN_predicted_CDS_1|225_bp atggggcctttggagagtaccatgcgatgggtgttgaagaaggaagcagaagggggcaat cgctatgcgatatgcagatccccttctcaaggaaagatcccagagaaatcctggctgctc ctcaagcacaagtacacccccgaaataggagtgtggcagaaattcctccttcccgcagct ggtgtctctccaaactcaaccaatgtccaagtcccccagtgctga >gi568815596f:60660134_60897157|GENSCAN_predicted_peptide_2|120_aa MNHAVQTVFTPANTSHPLNYEMLKEEHEVAVLRVPHNPVPPTSTVIHIHTFTYSVKSRDR KMVGDLTGAQAYASTAKCLNVCTLILGIRVTTTLIILFTSSSMIIFQAISQMIKNLQGQQ >gi568815596f:60660134_60897157|GENSCAN_predicted_CDS_2|363_bp atgaaccacgctgtccaaaccgtcttcactcccgccaacaccagccaccccctcaactat gaaatgctcaaggaggagcatgaggtggctgtgctgagggtgccccacaaccctgttccc ccgacatccaccgtgatccacatccacacattcacctactccgtgaagtctagggacagg aagatggttggcgacctgactggggcccaggcctacgcctccactgccaagtgcctgaac gtttgcaccctcatcctgggcatccgtgtgaccacaacactcatcatccttttcaccagt agttctatgataatattccaagcaatttctcaaatgataaagaatctccaaggccagcag tag >gi568815596f:60660134_60897157|GENSCAN_predicted_peptide_3|224_aa MNQFWNRQCWWLHNIVNVYNATQSYTQNGYSSKHFVMKIRDSKDAMLFRIKQERERTGGS SVKDYYGPGLEEAHITSSHPTGQNSAAWEAEGFTEKSKKVCFASLGKAITKGGAAKMKYN PFVTSDRSKNCKRHFNAPSHTRRKIMSSPLSKELRQKYNVRSKTIQKDDEVQVARGHCKG QQIGQIIQVYRKKYAIYIERVQWEKANSIAVPVGIHPARWLSLD >gi568815596f:60660134_60897157|GENSCAN_predicted_CDS_3|675_bp atgaaccagttctggaatagacagtgctggtggctgcacaacattgtgaatgtgtataac gccactcaatcatacactcaaaatggttacagtagtaaacattttgttatgaaaatcaga gattctaaggacgccatgctcttccgcatcaagcaggaaagagaaagaactggaggctca tctgtaaaagattattacggtccaggcctggaagaggcacacatcacgagttcacatccc actggccagaactcagctgcttgggaagctgagggatttactgagaaatccaagaaagtc tgttttgcatccttaggcaaagccatcaccaaaggtggagcagccaaaatgaagtacaat ccctttgtgacttctgaccgaagcaagaactgcaaaaggcatttcaatgcaccttcccac actcgcagaaagattatgtcttcccctctttccaaagagctgagacaaaagtacaatgtt cgatcgaagaccatccaaaaggatgacgaagttcaggttgcgcgaggacactgtaagggt cagcaaattggccaaataatccaggtttacaggaagaaatatgccatctacattgaacgg gtgcagtgggaaaaggctaacagcatagctgtccctgtgggcattcacccagcaaggtgg ttatcactagactaa >gi568815596f:60660134_60897157|GENSCAN_predicted_peptide_4|52_aa MGRMRWLTPVIPALSEAEVMLMQEVGSHGLGQLHPCGFAGIAPLLAAFTGWH >gi568815596f:60660134_60897157|GENSCAN_predicted_CDS_4|159_bp atgggccggatgcggtggctcacacctgtaatcccagcactttcggaggccgaggtcatg ctgatgcaagaggtgggctcccacggccttggacagctccacccctgtggctttgcaggt atagcccctctcctggcggctttcactggctggcattga >gi568815596f:60660134_60897157|GENSCAN_predicted_peptide_5|270_aa MKKESLNQSLAEWKLFIYNPTTGEFLWRTAKSWGLILLFYLIFYGFLAALFSFTILNDEV PKYRDQILNPGLMVFPKPVTALEYAFSRSDLTSYAGYTEDLKKFLKPYTLEEQKNLTVCP DGALFEQKGPVYVACQSPISLLQACRGMNDLDFGYSQRNPCILVKMNRIIGLKSEGVPRI DCVSKNEDIPNVAVYPHNGIIDLKYFPYYGEKLHVGYLQPLVAVQVSFAPNDTGKEVTVE CKIDGSANLKSQDDRDKFLGRVMFKITACA >gi568815596f:60660134_60897157|GENSCAN_predicted_CDS_5|813_bp atgaagaaggagtccctcaaccagagcctcgccgagtggaagctcttcatctacaacccg accaccggagagttcctgtggcgcaccgccaagagctggggtttgatcttgctcttctac ctaattttttatgggttcctggctgcactcttctcattcacgatactcaacgatgaggtt ccaaaataccgtgaccagattcttaacccaggactgatggtttttccaaaaccagtcact gcattggaatatgcattcagtaggtctgatctaacttcgtatgcagggtacactgaagac cttaagaagtttctaaaaccatatactttagaagaacagaaaaacctcacagtctgtcct gatggagcactttttgaacagaagggtccagtttatgttgcttgtcaatctcccatttcg ttacttcaagcatgccgtggtatgaatgatcttgatttcggctattctcaaagaaaccct tgtattcttgtgaaaatgaacagaataattggattaaagtctgaaggagtgccaaggata gattgtgtttcgaagaatgaagatataccaaatgtagcagtttatcctcataatggaatt atagacttaaaatatttcccatattatggggaaaaactgcatgtggggtatctacagcca ttggttgctgttcaggtcagctttgctcctaacgatactgggaaagaagtaacagttgaa tgcaagattgatggatcagccaaccttaaaagtcaggatgatcgtgacaagtttttggga cgagttatgttcaaaatcacagcatgtgcatag >gi568815596f:60660134_60897157|GENSCAN_predicted_peptide_6|434_aa MVSFPAPGRSARAPPLNPAAHLATPARTRGLRESPAVPPPRTPSEKDEEGKRNTVLDSQR QQKHYGITSPISLASPKEIDHIYTQKLIDAMKPFGVFEDEEELNHRLVVLGKLNNLVKEW ISDVSESKAVEDAFVPVIKFEFDGIEMIRITNADPNVMSLKIAHVNPSDRYHLMPIITPA YPQQNSTYNVSTSTRTVMVEEFKQGLAVTDEILQGKSDWSKLLEPPNFFQKYRVGLVESK IRVLVGNLERNEFITLAHVNPQSFPGNKEHHKDNNYVSMWFLGIIFRRVENAESVNIDLT YDIQSFTDTEVDSTVKTVSPPTVCTIPTVVGRNVIPRITTPHNPAQGQPHLNGMSNITKT VTPKRSHSPSIDGTPKRLKDVEKDAIGGESMPIPTIDTSRKKRLPSKELPDSSSPVPANN IRVIKNSIRLTLNR >gi568815596f:60660134_60897157|GENSCAN_predicted_CDS_6|1305_bp atggtctccttcccggctcccggccggtccgcaagggcacctcccctaaacccggcggct cacctggccacccctgcacggactcggggactccgggagtccccagctgtcccgcctccg aggacgccctcagaaaaggatgaggagggaaagagaaacaccgtgctggacagccagcgt caacaaaagcattatggaattacctccccaattagtttggcatctcctaaagaaattgat catatttacacacagaaattaattgacgccatgaaaccatttggagtgtttgaagatgag gaagaattgaaccacaggctggtggttcttggtaaattgaacaatttagtaaaagaatgg atttctgatgtcagcgagagtaaggctgtagaagatgcctttgtacctgttataaaattt gaatttgatggtattgaaatgattaggatcactaatgctgatccaaatgttatgtcatta aaaatcgcccatgtaaatccatcagataggtatcatctcatgcccataatcacccctgcc tacccacaacagaattctacgtataatgtgtccacatcaactcgaacagtaatggtagaa gaatttaaacaaggtcttgcagtcacagatgaaattcttcaaggaaagtcagattggtcc aaactacttgagccaccgaatttctttcaaaagtatagggttggattagtagaatctaaa atccgtgtacttgttggaaacttggaacggaatgaatttattactcttgcccatgtgaat ccccagtcattcccagggaataaggaacatcataaagacaacaattacgtatcaatgtgg ttccttgggataatttttcggagagtagaaaatgcagaaagtgtcaacatagacttgaca tatgatatacagtcatttactgatacagaagttgactctacagtaaaaactgtatcaccc cccactgtgtgtaccattcctaccgtagtaggacgaaatgtcattcctagaatcacaaca cctcacaaccctgcccagggacaaccgcatctgaatggaatgtcaaatataactaagact gttacacctaagagatcccattccccatccatagatgggactcctaagaggttgaaagac gtagaaaaggatgccattggaggagaatctatgcctattccaactattgatacatcacgc aaaaagagactacccagtaaagaactaccagattcatcatctccagttccagcaaacaac atccgtgtcatcaaaaattccattcgactgacccttaatcggtaa >gi568815596f:60660134_60897157|GENSCAN_predicted_peptide_7|101_aa MAHAALAAPSLLQPLQHSPINFLCTNCRRGNCFLDNATRDAQNNSFDGFPRRLKDLWVVA QENLKSPRQPKDYYKNARSSFIHNNHKLETIEMNFNKTINI >gi568815596f:60660134_60897157|GENSCAN_predicted_CDS_7|306_bp atggcacacgcagccctggccgcaccttccctgctgcagccgctccagcattccccaata aactttctgtgcacaaactgccgtcgtggaaactgtttcctggataatgcaacccgtgat gctcagaataattcattcgatggcttcccaaggaggttgaaggatctatgggtagttgcc caagaaaacctcaagtccccaagacagccaaaagactattacaagaatgctcgcagcagc tttattcataataaccacaaactggaaacaatcgaaatgaacttcaacaagacaatcaat atataa >gi568815596f:60660134_60897157|GENSCAN_predicted_peptide_8|133_aa MLLNRNVAEERSQSQNWPANPIHLAAIKPVPVFLNPGWGALKNACAQVSLQANLITPSVD GSRASRLCEAELGPLASASLDLKVFPEGQFGLACRYLCRPQQCTENTPSQHAHKSGSSAK HQTPTLLPMPDAK >gi568815596f:60660134_60897157|GENSCAN_predicted_CDS_8|402_bp atgctgctcaacaggaatgtggcagaggaaagaagccagagccaaaactggccagccaac ccgattcatcttgctgcaatcaaacctgtaccagtgtttctcaaccctggctggggagct cttaaaaatgcctgtgcccaggtgtctctccaggccaatctaatcacaccctctgtggat gggtcccgggcatccagactctgtgaagcagaactcgggcctctagcgtctgctagtcta gatctaaaggtgtttcctgagggacagtttggcctggcatgcaggtacctctgcagacca caacagtgcaccgaaaacaccccctcccagcacgcacacaagtctggctcctcagccaaa catcaaacaccaacactgctgcccatgccagatgccaagtga >gi568815596f:60660134_60897157|GENSCAN_predicted_peptide_9|101_aa MASGAYNPYIEIIEQPRQRGMRFRYKCEGRSAGSIPGEHSTDNNRTYPSIQIMNYYGKGK VRITLVTKNDPYKPHPHDLVGKDCRDGYYEAEFGQERRPLF >gi568815596f:60660134_60897157|GENSCAN_predicted_CDS_9|303_bp atggcctccggtgcgtataacccgtatatagagataattgaacaacccaggcagagggga atgcgttttagatacaaatgtgaagggcgatcagcaggcagcattccaggggagcacagc acagacaacaaccgaacatacccttctatccagattatgaactattatggaaaaggaaaa gtgagaattacattagtaacaaagaatgacccatataaacctcatcctcatgatttagtt ggaaaagactgcagagacggctactatgaagcagaatttggacaagaacgcagacctttg ttn