GENSCAN 1.0 Date run: 5-Nov-116 Time: 11:58:06 Sequence gi568815597f:192709072_192911593 : 202522 bp : 37.80% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 5670 5764 95 2 2 96 110 53 0.879 8.20 1.02 Term + 7096 7582 487 1 1 45 36 471 0.834 30.79 1.03 PlyA + 7594 7599 6 1.05 2.06 PlyA - 7644 7639 6 1.05 2.05 Term - 17767 17615 153 1 0 118 53 93 0.635 5.74 2.04 Intr - 20063 19916 148 1 1 40 78 51 0.285 -1.38 2.03 Intr - 36634 36553 82 2 1 97 94 71 0.317 6.38 2.02 Intr - 43127 42975 153 0 0 48 44 115 0.493 2.22 2.01 Init - 43357 43243 115 1 1 83 113 9 0.706 3.38 2.00 Prom - 48793 48754 40 -4.95 3.04 PlyA - 48962 48957 6 1.05 3.03 Term - 58224 57989 236 1 2 115 50 75 0.609 1.90 3.02 Intr - 66190 66122 69 2 0 30 121 58 0.081 1.54 3.01 Init - 74434 74227 208 1 1 30 88 162 0.798 9.43 3.00 Prom - 77426 77387 40 -6.95 4.00 Prom + 82176 82215 40 -3.65 4.01 Init + 87425 87504 80 1 2 33 85 98 0.317 4.59 4.02 Intr + 99976 100110 135 1 0 8 76 127 0.103 2.26 4.03 Intr + 101095 101196 102 1 0 62 108 52 0.886 2.97 4.04 Intr + 101299 101360 62 1 2 64 101 84 0.998 4.76 4.05 Intr + 101910 102076 167 1 2 122 94 78 0.999 10.56 4.06 Term + 102331 102525 195 0 0 97 44 150 0.994 7.93 4.07 PlyA + 103175 103180 6 1.05 5.00 Prom + 110880 110919 40 -5.15 5.01 Init + 120413 120772 360 1 0 71 42 151 0.342 5.92 5.02 Term + 128640 128753 114 2 0 46 43 154 0.537 4.29 5.03 PlyA + 129163 129168 6 1.05 6.00 Prom + 130984 131023 40 -5.25 6.01 Init + 132524 132598 75 1 0 64 94 27 0.596 2.04 6.02 Term + 133134 133274 141 2 0 81 41 129 0.970 4.45 6.03 PlyA + 133996 134001 6 1.05 7.00 Prom + 135951 135990 40 -6.95 7.01 Init + 138713 138787 75 1 0 66 100 82 0.435 8.34 7.02 Intr + 141462 141613 152 2 2 110 61 53 0.467 2.84 7.03 Intr + 148823 148932 110 2 2 17 59 125 0.458 1.51 7.04 Term + 154911 155020 110 1 2 97 47 87 0.744 3.19 7.05 PlyA + 155187 155192 6 1.05 8.04 PlyA - 156442 156437 6 1.05 8.03 Term - 166537 166474 64 0 1 102 43 36 0.318 -3.12 8.02 Intr - 167239 167135 105 0 0 60 87 92 0.634 4.71 8.01 Init - 174285 174215 71 0 2 71 106 59 0.736 6.57 8.00 Prom - 177911 177872 40 -6.55 9.03 PlyA - 178189 178184 6 1.05 9.02 Term - 182790 182132 659 1 2 31 54 260 0.006 10.02 9.01 Intr - 199496 199367 130 2 1 85 84 32 0.081 1.85 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 71884 71724 161 2 2 31 54 124 0.946 0.42 S.002 Init + 177644 177747 104 1 2 68 89 67 0.804 4.57 S.003 Intr + 181916 182024 109 2 1 72 84 99 0.931 7.27 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:192709072_192911593|GENSCAN_predicted_peptide_1|193_aa MQALRVPDSRTWPLDGISGLALCQKGAHCPEGGAATKMQIFVKTLMGKTITLEVELSDTI DNVKAKIQDKEGIPPDQQRLIFAGKQLEDGRTLSDYNIQKESTLHLVLRLRGGAKKRKKK SYTTPRKNKHKRKKVKLALLKYYKVDENGKISCLHRECPSDECGAGVFMASHFDRHYCGK CCLTYCFNKPEDK >gi568815597f:192709072_192911593|GENSCAN_predicted_CDS_1|582_bp atgcaggctcttagggttcctgattccaggacttggcccttggatggcatttctggactt gccctgtgccagaagggagcccattgccctgaaggtggagccgccaccaaaatgcagatt ttcgtgaaaacccttatggggaagaccatcaccctcgaggttgaactctcggatacaata gataatgtaaaggccaagatccaggataaggaaggaattcctcctgatcagcagagactg atctttgctggcaagcagttggaagatggacgtactttgtctgactacaatattcaaaag gagtctactcttcatcttgtgttgagacttcgtggtggtgctaagaaaaggaagaagaag tcttacaccactcccaggaagaataagcacaagagaaagaaggttaagctggctctcctg aaatattataaggtggatgagaatggcaaaattagttgccttcatcgagagtgcccttct gatgaatgtggtgctggggtgtttatggcaagccactttgacagacattattgtggcaaa tgttgtctgacttactgtttcaacaaaccagaagacaagtaa >gi568815597f:192709072_192911593|GENSCAN_predicted_peptide_2|216_aa MSNSFSISHNSILFSWLCVRNGWVLGLTDFKNKAADLYAIKSSKSAAVHSSHPESFASPS GFMVSLASGVKLQTFVVTVTAHKGGTNPKKTVKALELFFLEYEYSDKQAQCKISLYRSSS KPNYFPKTPSQFSSHWELGLQLTDFCRDTIASTAVSEHKDLSPYTFVITQSSLLSLWEQL LDLRIGPPQQNGDVLIPGWQENMVVHGKPVAGVVSI >gi568815597f:192709072_192911593|GENSCAN_predicted_CDS_2|651_bp atgagcaattccttttctatcagtcataactccatccttttctcatggttgtgtgtccgg aatgggtgggttcttggtcttactgacttcaagaataaagctgcggacctttacgctatt aaaagcagcaagtctgcagctgttcattcctcccatccagagtcgtttgcctctcccagt gggttcatggtctcgctggcttcaggagtgaagctgcagacctttgtggtgactgttaca gctcacaaaggcggcacgaacccaaagaaaacagtcaaagctcttgagctcttcttcctt gagtacgagtacagtgacaaacaggcccaatgcaaaattagtctttaccggagctcatct aaacctaattactttccaaagactccatcccagttctcatcacattgggaattagggctt caacttacagatttttgcagggatacaattgcgtctacagctgtgtctgagcacaaagac ctctctccctatacctttgttataactcagagctctctcctttctctgtgggagcaattg ctggacctcaggattggacccccacagcaaaatggagatgtgctaattcctgggtggcag gaaaatatggtagttcatgggaaacctgttgctggagtggtcagtatttga >gi568815597f:192709072_192911593|GENSCAN_predicted_peptide_3|170_aa MKEFEQQPPTLDLPFDRDYPNEKEPVLLPTLVIEQGSLTPPHPHKIKLVRQQWIQTKKKS LIYLKNNSGDFYKQEHNLTLKALLKETPTFRQGPHSKLWNGASYFKSHSFPFVFTSILIK VLTRSILVICSSAEACFHVLLTLKLKARQRTSSEQFQQNCHCSVTLLEAI >gi568815597f:192709072_192911593|GENSCAN_predicted_CDS_3|513_bp atgaaagaatttgaacaacagcctccaaccctagaccttccctttgacagagactaccca aatgagaaggaaccagttctattaccaactctggtaatagaacaaggctctttaacaccc ccacacccccacaaaatcaaactagttcgccagcaatggatccaaaccaagaagaaatcc ctgatttacctgaaaaataattcaggagatttctacaaacaggaacacaatctcacccta aaagcactgctaaaggagactccaacattcagacaaggaccacattcaaaactgtggaat ggtgcatcatatttcaagtctcatagcttcccttttgtcttcacttccatcctcatcaag gtccttactagaagcatcttggtcatttgttcctctgccgaggcttgttttcatgtgctg ctgacactaaaacttaaggccaggcaaagaacctcctctgaacaattccagcagaactgc cattgttccgtgaccctgctggaggcgatatag >gi568815597f:192709072_192911593|GENSCAN_predicted_peptide_4|246_aa MLAEILELLIAAGISGQDVMAYQLLASRGSSGRTIMQSAMFLAVQHDCRPMDKSAGSGHK SEEKREKMKRTLLKDWKTRLSYFLQNSSTPGKPKTGKKSKQQAFIKPSPEEAQLWSEAFD ELLASKYGLAAFRAFLKSEFCEENIEFWLACEDFKKTKSPQKLSSKARKIYTDFIEKEAP KEINIDFQTKTLIAQNIQEATSGCFTTAQKRVYSLMENNSYPRFLESEFYQDLCKKPQIT TEPHAT >gi568815597f:192709072_192911593|GENSCAN_predicted_CDS_4|741_bp atgttggctgaaattctggagctcttaattgctgctggtatttcaggacaagacgtaatg gcttatcagcttttggcaagccggggctccagcgggagaacgataatgcaaagtgctatg ttcttggctgttcaacacgactgcagacccatggacaagagcgcaggcagtggccacaag agcgaggagaagcgagaaaagatgaaacggacccttttaaaagattggaagacccgtttg agctacttcttacaaaattcctctactcctgggaagcccaaaaccggcaaaaaaagcaaa cagcaagctttcatcaagccttctcctgaggaagcacagctgtggtcagaagcatttgac gagctgctagccagcaaatatggtcttgctgcattcagggcttttttaaagtcggaattc tgtgaagaaaatattgaattctggctggcctgtgaagacttcaaaaaaaccaaatcaccc caaaagctgtcctcaaaagcaaggaaaatatatactgacttcatagaaaaggaagctcca aaagagataaacatagattttcaaaccaaaactctgattgcccagaatatacaagaagct acaagtggctgctttacaactgcccagaaaagggtatacagcttgatggagaacaactct tatcctcgtttcttggagtcagaattctaccaggacttgtgtaaaaagccacaaatcacc acagagcctcatgctacatga >gi568815597f:192709072_192911593|GENSCAN_predicted_peptide_5|157_aa MGKDFMTKTPKEMATMAKIDKWDLIKLKPFCTAKQTIIRVNRQRIEWEKIFAIYPSDKGL ISRIHKELKQIYKKKTNNPIKKWAKDMNRHFSKQDIYVANNHMEKSSSSLVIREMQIKTT DTCRLPVEQAQIGPKEYSREQLQPTTNGSSGINIPVG >gi568815597f:192709072_192911593|GENSCAN_predicted_CDS_5|474_bp atgggcaaagacttcatgactaaaacaccaaaagaaatggcaacaatggccaaaatagac aaatgggatctaattaaactaaaacccttctgcacagcaaaacaaactatcatcagagtg aacaggcaacgtatagaatgggaaaaaatttttgcaatctacccatctgacaaagggcta atatccagaatccacaaggaacttaaacaaatttacaagaaaaaaacaaacaaccccatc aaaaagtgggcaaaggatatgaacagacacttctcaaaacaagacatttatgtggccaac aaccatatggaaaaaagctcatcatcattggtcattagagaaatgcaaatcaaaaccaca gacacatgtcgtttgcctgtggagcaagcccaaattggtccaaaggaatactcccgggag caacttcaacctacgactaatggaagttcaggtataaatatacccgttggatag >gi568815597f:192709072_192911593|GENSCAN_predicted_peptide_6|71_aa MAFEFGLELGVGVLPVEVREREKELRTYINRMEISSLAAVRKDAPERRDAKTKCRRLKNN PGNMKSKKMYL >gi568815597f:192709072_192911593|GENSCAN_predicted_CDS_6|216_bp atggcatttgagtttggccttgaattaggggtaggagttctcccagtggaagtaagggag agagaaaaagagctaaggacttacataaacagaatggaaatatcaagtctggcagcagtc cgtaaggatgcaccggaaagaagagatgctaagactaaatgtagaagacttaagaacaat cctggcaacatgaaatccaaaaaaatgtatttgtaa >gi568815597f:192709072_192911593|GENSCAN_predicted_peptide_7|148_aa MGGVQFHTGWLKKASQKGDTGDLKKFHPLLIILLSNPVKSNSRIWIGGGGRVHERWGEEG CDYERELRNEERRVCNSLRTGHDDNGGFVEWKGGKGGERIERLDGCRVCVERDTEETKTS QDHPMPGRDLSCMEIISQYDHPLLEEHL >gi568815597f:192709072_192911593|GENSCAN_predicted_CDS_7|447_bp atggggggagtacaatttcatactggatggttaaagaaggcctctcagaaaggagacact ggagacttgaagaagttccacccactcctcatcattctcctcagcaatcctgtcaaaagt aatagtagaatatggattggagggggtggaagagtacatgagaggtggggagaggaagga tgtgattatgaaagggagctgagaaacgaggagagacgtgtttgcaactcattgagaacg ggccatgatgacaatggcggttttgtggaatggaaaggggggaaaggtggggaaaggatt gagagattggatggttgccgtgtctgtgtagagagagacactgaggagacgaagacaagc caagatcatcctatgcctggaagagatctcagttgtatggaaataatcagccaatatgac cacccacttctggaagagcatctctaa >gi568815597f:192709072_192911593|GENSCAN_predicted_peptide_8|79_aa MDQYWSVARRVGSPLLNPYKNSAKGKVLKEILIVKDPWSNGHIFQLGFLVGIQLLSVEKL RSKAQFRSLILTQPSYYFD >gi568815597f:192709072_192911593|GENSCAN_predicted_CDS_8|240_bp atggaccagtactggtcagtggcccggagggtggggtcccctctgctaaatccttacaaa aattctgcaaaagggaaagttcttaaagagattctaatagtcaaagatccatggagcaac ggacacatctttcagttaggatttctggttggaatccagttgcttagtgttgaaaaatta agatcaaaggcacagttcaggtcacttatcctcactcaaccgagctactattttgattag >gi568815597f:192709072_192911593|GENSCAN_predicted_peptide_9|262_aa GSDFKHLRLSTWAFLLAHAWSSPGKIGAMALRVDLSQQRKELESAATQTIAVDLGFLLQG AGSSLTWAQLQLPTWLQAQVSLHSLEPGKAPMPLDAQKCLFPLPNFSLLWVPAGISQQRQ GEPGCHEHLWEADRFLGRGGGPSKAPPSGQGGLEGWGLGCYSTVQSGDLWCLFQQPMAAY GPIGMHFLPSEAHKSPGLSQSRAEDGDHRMTSCREELPSLLRAGMISCREELPSLLRASE TCRGTGTTSCREEPPNPGPPLC >gi568815597f:192709072_192911593|GENSCAN_predicted_CDS_9|789_bp ggttctgacttcaagcacctgagactctctacctgggcctttcttttggcccatgcctgg agcagtccaggaaagattggagctatggccttgagagtagacctcagccaacaaaggaag gagttggagtctgcagccacccaaactatagctgtagacctgggcttcctgctccaggga gcaggaagcagcctgacctgggcgcagctacagctgcccacatggctgcaggctcaggta tctctgcactctttggagcccgggaaggccccaatgcctttggatgctcagaagtgcctc ttcccactgcctaacttctccctcctgtgggtgcctgctgggatctcacagcaaagacag ggggagcctgggtgtcatgaacatctgtgggaggcagacagattcctgggcagagggggt ggtcccagtaaggccccaccttcaggccagggaggtcttgaaggctgggggttgggctgc tattccactgtccagagtggggacttgtggtgccttttccagcagcccatggctgcctat ggaccaatcggcatgcacttccttccctctgaggcccataaaagccctggtctcagccag agcagggcagaggatggagaccacaggatgaccagctgcagagaggagctaccttctctt ctaagagctgggatgatcagctgcagagaggaactaccttctctgctgagagcttcagag acctgcagaggcactgggactaccagctgcagagaggagccaccaaatccagggcctcct ctctgctga