GENSCAN 1.0 Date run: 7-Nov-116 Time: 02:01:53 Sequence gi568815594f:74345346_74552634 : 207289 bp : 37.14% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 19925 20030 106 0 1 94 106 136 0.467 14.35 1.02 Intr + 25498 25532 35 1 2 133 44 12 0.122 -1.75 1.03 Intr + 34103 34189 87 0 0 71 87 30 0.263 0.22 1.04 Intr + 35669 35792 124 0 1 75 90 88 0.602 6.42 1.05 Intr + 37300 37449 150 1 0 123 115 83 0.999 12.86 1.06 Term + 39382 39463 82 1 1 73 42 108 0.925 0.89 1.07 PlyA + 41082 41087 6 1.05 2.00 Prom + 43472 43511 40 -6.95 2.01 Init + 47870 47912 43 1 1 99 86 39 0.750 5.43 2.02 Intr + 53170 53285 116 2 2 43 -12 143 0.055 -0.95 2.03 Intr + 68009 68147 139 1 1 -22 19 299 0.473 11.12 2.04 Intr + 71188 71212 25 2 1 82 113 2 0.359 -1.73 2.05 Term + 76378 76621 244 1 1 61 48 231 0.461 10.79 2.06 PlyA + 77967 77972 6 1.05 3.02 PlyA - 78525 78520 6 1.05 3.01 Sngl - 93082 92738 345 1 0 40 49 212 0.294 8.29 3.00 Prom - 97151 97112 40 -5.15 4.00 Prom + 97516 97555 40 -7.95 4.01 Init + 99733 100033 301 0 1 88 -2 303 0.358 17.76 4.02 Intr + 101189 101437 249 0 0 120 105 323 0.999 33.69 4.03 Intr + 103702 103903 202 2 1 80 108 56 0.997 4.22 4.04 Intr + 105035 105187 153 2 0 52 100 175 0.989 13.37 4.05 Intr + 109414 109569 156 1 0 98 93 58 0.518 5.50 4.06 Intr + 132928 133096 169 1 1 47 107 90 0.058 5.83 4.07 Intr + 137951 138122 172 1 1 41 81 124 0.134 5.59 4.08 Intr + 154560 154685 126 1 0 -10 60 169 0.005 4.03 4.09 Intr + 154736 154909 174 0 0 12 63 140 0.038 2.89 4.10 Term + 160967 161346 380 0 2 38 37 183 0.201 2.17 4.11 PlyA + 161613 161618 6 1.05 5.00 Prom + 162129 162168 40 -4.75 5.01 Sngl + 164710 165315 606 0 0 74 49 211 0.799 11.74 5.02 PlyA + 167080 167085 6 1.05 6.07 PlyA - 167393 167388 6 1.05 6.06 Term - 180291 180231 61 2 1 68 49 59 0.140 -3.70 6.05 Intr - 185882 185796 87 1 0 79 97 30 0.180 1.17 6.04 Intr - 186007 185914 94 2 1 101 80 54 0.234 4.00 6.03 Intr - 194517 194363 155 2 2 70 20 120 0.269 2.19 6.02 Intr - 195255 195153 103 1 1 98 26 74 0.199 0.51 6.01 Intr - 205618 205547 72 2 0 100 106 26 0.349 4.06 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594f:74345346_74552634|GENSCAN_predicted_peptide_1|194_aa XALRQAPAPAPIADDRGEEDGDALCRQGPCAAALPGDKAAQLQGNKISFHLLQAVLSTTV IPSCIPGESSDNCTALVQTEDNPRVAQVSITKCSSDMNGYCLHGQCIYLVDMSQNYCRCE VGYTGVRCEHFFLTVHQPLSKEYVALTVILIILFLITVVGSTYYFCRWYRNRKSKEPKKE YERVTSGDPELPQV >gi568815594f:74345346_74552634|GENSCAN_predicted_CDS_1|585_bp nccgccctccgccaagccccagcgcccgctcccatcgccgatgaccgcggggaggaggat ggagatgctctgtgccggcagggtccctgcgctgctgctctgcctggagataaggcagct cagttgcagggaaacaaaataagtttccatcttctacaggcagtcctcagtacaactgtg attccatcatgtatcccaggagagtccagtgataactgcacagctttagttcagacagaa gacaatccacgtgtggctcaagtgtcaataacaaagtgtagctctgacatgaatggctat tgtttgcatggacagtgcatctatctggtggacatgagtcaaaactactgcaggtgtgaa gtgggttatactggtgtccgatgtgaacacttctttttaaccgtccaccaacctttaagc aaagaatatgtggctttgaccgtgattcttattattttgtttcttatcacagtcgtcggt tccacatattatttctgcagatggtacagaaatcgaaaaagtaaagaaccaaagaaggaa tatgagagagttacctcaggggatccagagttgccgcaagtctga >gi568815594f:74345346_74552634|GENSCAN_predicted_peptide_2|188_aa MAEAKESLQVKHSKGVKLQTLAVSVVAHKGSVDPKSEQQQDLLQTVKEQCFHSLHGDDDG DGGSDTDDYGDDDSGNGDNGDYNGGEGSDDDENKNGDDDGLLDKCVLSDLDLFYHRHYEE EGEEEKKGEGKEEGEEEGKGKEDGEEKRKKKKEEKEDANGEEEEKEEKEGRREEERKERR TKERKHIH >gi568815594f:74345346_74552634|GENSCAN_predicted_CDS_2|567_bp atggctgaagccaaagaatcacttcaagttaaacactccaaaggagtgaagctgcagacc ttggcggtaagtgttgtagctcataaaggcagtgtggacccaaagagcgagcagcagcaa gatttattgcaaacagtgaaagaacaatgcttccacagcttacatggtgatgatgatggt gatggtggtagtgatactgatgactatggtgatgatgacagtggcaatggtgataatggt gattataatggtggtgagggtagtgatgatgatgagaataagaatggtgatgatgatggc ttactagataagtgtgtgctaagtgatttagacttattttaccatcgtcactatgaagaa gaaggagaggaagaaaaaaaaggagaaggaaaagaggaaggggaagaggaggggaagggg aaggaagatggggaggaaaagagaaagaagaaaaaagaagaaaaggaggatgctaatggg gaggaggaggaaaaggaagagaaagagggaagaagagaggaagaaaggaaggaaagaagg acaaaagaaagaaaacacatacattaa >gi568815594f:74345346_74552634|GENSCAN_predicted_peptide_3|114_aa MVYNYRDLEEGKREQSKEKQQAKIMAAIIGDALNAQKASKGNPKGHKDNANRGSCFKCKE PGHWTKDYTKPLPKPGQKCEGASYDPWHWRCWLPPLPPRSSVRQNSSSAKRGIR >gi568815594f:74345346_74552634|GENSCAN_predicted_CDS_3|345_bp atggtgtataactaccgtgatctggaagaaggaaaaagggaacagagtaaagaaaaacag caagccaaaattatggcagccatcattggtgatgccctgaatgcccaaaaagcatccaag ggaaacccaaagggccacaaagataatgccaacaggggctcttgtttcaagtgcaaggaa cctgggcattggacaaaggactataccaagcctctgccaaaacccggccaaaaatgtgag ggtgccagttatgatccttggcactggaggtgttggctgcctccactcccaccaaggagc tcagtcaggcaaaactccagcagtgctaaaagaggaatcagatga >gi568815594f:74345346_74552634|GENSCAN_predicted_peptide_4|693_aa MGCGPLPAEPIKRQVRAALQTFAHLGASAPEVPGQPEAPRPPPRAPQAFESGAHSRSPLA LPTPARFGGSSCPRDRVAPETETPPLRRTNESPAATAGAGGHYAAGLDLNDTYSGKREPF SGDHSADGFEVTSRSEMSSGSEISPVSEMPSSSEPSSGADYDYSEEYDNEPQIPGYIVDD SVRVEQVVKPPQNKTESENTSDKPKRKKKGGKNGKNRRNRKKKNPCNAEFQNFCIHGECK YIEHLEAVTCKCQQEYFGERCGEKSMKTHSMIDSSLSKIALAAIAAFMSAVILTAVAVIT VQISHWSHCQVIAINDESVLFPVDHKTMDPFCYDGFKLSIVTFYAISVYKGARRYPLSTV QDTETTNISINNFSLTTENLLSTIGIMVKKVDKAPTLKKFTDDCRRNKTERTSYPGILGT EKSHEERVEESRDKNLLCPFYRGQKSSTLREHNAAIAQIAFLQVMLSGDSTGTALGKKSC SGGSDSMKQGTVGIQYMELIGRSSSSGSRVVAQALGVQVQQQQGPSNGKMPLWFGPWRAR NTAVMTVLPGEAEYLSSASAGDSTVPVGHQHQRPKVDKTTKMRRNQSRKAENSKKQSTSS PPRDHSSSPAREQNWTENEFDKVTEVGFRRSVITNFSELKKHVLNQHKEDKNLERRLDEW LTRINSVEKTLNYLMELKTMAQELRDTYTSINS >gi568815594f:74345346_74552634|GENSCAN_predicted_CDS_4|2082_bp atgggctgcggccccctcccggctgagcctataaagcggcaggtgcgcgccgccctacag acgttcgcacacctgggtgccagcgccccagaggtcccgggacagcccgaggcgccgcgc ccgccgccccgagctccccaagccttcgagagcggcgcacactcccggtctccactcgct cttccaacacccgctcgttttggcggcagctcgtgtcccagagaccgagttgccccagag accgagacgccgccgctgcgaaggaccaatgagagccccgctgctaccgccggcgccggt ggccattatgctgctggattggacctcaatgacacctactctgggaagcgtgaaccattt tctggggaccacagtgctgatggatttgaggttacctcaagaagtgagatgtcttcaggg agtgagatttcccctgtgagtgaaatgccttctagtagtgaaccgtcctcgggagccgac tatgactactcagaagagtatgataacgaaccacaaatacctggctatattgtcgatgat tcagtcagagttgaacaggtagttaagcccccccaaaacaagacggaaagtgaaaatact tcagataaacccaaaagaaagaaaaagggaggcaaaaatggaaaaaatagaagaaacaga aagaagaaaaatccatgtaatgcagaatttcaaaatttctgcattcacggagaatgcaaa tatatagagcacctggaagcagtaacatgcaaatgtcagcaagaatatttcggtgaacgg tgtggggaaaagtccatgaaaactcacagcatgattgacagtagtttatcaaaaattgca ttagcagccatagctgcctttatgtctgctgtgatcctcacagctgttgctgttattaca gtccagatatcacattggagtcactgccaagtcatagccataaatgatgagtcggtcctc tttccagtggatcataagacaatggaccctttttgttatgatggttttaaactttcaatt gtcactttttatgctatttctgtatataaaggtgcacgaaggtatccattgagcactgta caagatactgaaacaaccaacatttcaatcaacaacttttcattgaccactgaaaatctc ctaagtactataggtataatggtgaagaaggtagataaagcccctactcttaagaagttt acagacgattgtagaaggaataaaaccgagagaacaagttatccaggaattctaggtacc gaaaagagtcatgaagaaagggtggaggaaagtagagataagaatctcttatgtcctttt tatcggggacagaaatcttcaacattaagagagcacaatgctgctattgcacagatagcc tttctccaggtgatgttatctggggacagcactggtacagctctggggaagaagagctgc tctggaggctcagactccatgaagcagggcacagttggaattcagtacatggaactaata ggacgcagcagcagctcaggctccagggtggtagcccaggctctgggagttcaggtgcag cagcaacaaggccccagtaatggcaaaatgcctctgtggtttgggccctggagagcaagg aacactgcagtgatgactgtactccctggagaggcagagtacctcagcagcgcaagtgcc ggggatagtacagttccagtaggtcaccaacatcaaagaccaaaggtagataaaaccaca aagatgaggagaaaccagagcagaaaagctgaaaattccaaaaaacagagcacttcttct cctccaagggatcacagctcctcaccagcaagggaacaaaactggacggagaatgagttt gacaaggtgacagaagtaggcttcagaaggtcagtaataacaaacttctctgagctaaag aagcatgttctaaaccaacataaagaagataaaaaccttgaaagaaggttagatgaatgg ctaactagaataaacagtgtagagaagaccttaaattacctcatggagctgaaaaccatg gcacaagaacttcgtgacacatacacaagcatcaatagctga >gi568815594f:74345346_74552634|GENSCAN_predicted_peptide_5|201_aa MLPDFKLYYKGTVTKIACYWYQNRYIDQWNRTAASEITPHIYNHLIFDKPEKNKQWRKDS LFNKWCWENWLAICRKLKLDLFFTPYTKINSRWIKDLNVRPNTIKILEENLDNTVQDIGM GKDFMAKTPKAMATKVKIDKWDLIKLKSFCTAKETIITVNRQPTEWEKAIAIYPSDKGLI SRIYKELKQIYKKTTTPSKSQ >gi568815594f:74345346_74552634|GENSCAN_predicted_CDS_5|606_bp atgctacctgacttcaaactatactacaagggtacagtaaccaaaatagcatgctactgg taccaaaacagatatatagaccaatggaacagaacagcggcctcagaaataacaccacac atctacaaccatctgatctttgacaaacctgagaaaaacaagcagtggagaaaggattcc ctatttaataaatggtgctgggaaaactggctagccatatgtagaaagctgaaactggat ctcttctttacaccatatacaaaaattaactctagatggattaaagacttaaatgtaaga cctaacaccataaaaatcctagaagaaaacctagacaataccgttcaggacataggcatg ggcaaagacttcatggctaaaacaccaaaagcaatggcaacaaaagtcaaaatagacaaa tgggatctaattaaactaaagagcttctgcacagcaaaagaaactatcatcactgtgaac aggcaacctacggaatgggagaaagccattgcaatctacccatctgacaaagggctaata tccagaatctacaaagaacttaaacaaatttacaaaaaaacaacaaccccatcaaaaagt cagtga >gi568815594f:74345346_74552634|GENSCAN_predicted_peptide_6|190_aa XCLSVKLALLEGSDFTHLSAPLVLVSHILMAFLQAPIPIKALDSGCPGNDEEMVKCSYLR WCTGIQQNPHGFLTTKLQIEMKKTASQAKVFWIPTQLDFIVKVKIAVKIEYFLVLFSKDG LSVGKGQIDICLMADEVHCIASSHEWHSPKVTTRKVRARQGTANNNNASDREDDEFTFEH LESIYLWNIK >gi568815594f:74345346_74552634|GENSCAN_predicted_CDS_6|573_bp ngatgtctatctgtgaagctggccctgctggaagggtctgattttactcatctctctgct cctttagtgctggtttctcatatacttatggcttttctgcaggccccaataccaattaaa gctttagacagtggatgccctggcaatgatgaagagatggtgaagtgctcttacctgagg tggtgtactggcatacagcagaatccacatggctttcttacaacaaaattacagattgaa atgaaaaaaactgcttctcaagcaaaagtgttctggattcccactcagttggacttcata gtcaaagtaaagattgccgttaaaattgaatactttctagtgctgttctccaaggatggc ctctctgttggaaagggtcaaatcgatatctgtctaatggccgatgaggtgcactgtatt gccagcagtcatgaatggcactcacccaaggttaccaccaggaaagtgagagccagacaa ggtacagctaacaataataatgcttcagacagggaagatgatgagtttacttttgaacac ctcgagtcgatttacctatggaacatcaagtga