GENSCAN 1.0 Date run: 6-Nov-116 Time: 06:20:59 Sequence gi568815580f:50563959_50829851 : 265893 bp : 44.93% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 9973 10097 125 0 2 47 58 89 0.063 1.18 1.02 Term + 14158 14326 169 1 1 114 39 93 0.426 4.35 1.03 PlyA + 21179 21184 6 1.05 2.00 Prom + 22375 22414 40 -3.66 2.01 Init + 26272 26289 18 0 0 63 116 44 0.305 3.20 2.02 Intr + 61038 61146 109 2 1 47 44 124 0.023 3.76 2.03 Term + 66556 66611 56 2 2 128 43 51 0.017 2.32 2.04 PlyA + 67548 67553 6 1.05 3.04 PlyA - 70532 70527 6 1.05 3.03 Term - 77098 77088 11 2 2 146 44 5 0.223 0.26 3.02 Intr - 81062 80958 105 0 0 128 59 15 0.420 2.79 3.01 Init - 87327 86964 364 0 1 63 94 95 0.422 4.03 3.00 Prom - 92553 92514 40 -2.66 4.00 Prom + 95106 95145 40 -5.86 4.01 Init + 100001 100546 546 1 0 95 95 1210 0.677 117.31 4.02 Term + 105243 105251 9 2 0 127 39 0 0.257 -2.81 4.03 PlyA + 109324 109329 6 1.05 5.08 PlyA - 109571 109566 6 1.05 5.07 Term - 113299 113145 155 2 2 42 33 103 0.286 -1.72 5.06 Intr - 121944 121801 144 1 0 39 70 140 0.402 7.55 5.05 Intr - 124179 124000 180 1 0 70 65 58 0.366 1.54 5.04 Intr - 124921 124887 35 0 2 115 80 16 0.530 1.37 5.03 Intr - 129789 129769 21 2 0 111 103 31 0.539 3.56 5.02 Intr - 132945 132839 107 0 2 136 62 44 0.481 5.61 5.01 Init - 137035 136973 63 1 0 70 85 11 0.444 0.18 5.00 Prom - 138596 138557 40 -2.76 6.00 Prom + 140120 140159 40 -9.46 6.01 Init + 140840 140895 56 1 2 75 72 56 0.310 3.46 6.02 Intr + 144724 144802 79 1 1 82 48 31 0.231 -1.95 6.03 Intr + 151121 151265 145 1 1 67 105 135 0.885 12.96 6.04 Intr + 157980 158141 162 1 0 94 77 243 0.992 23.75 6.05 Intr + 162004 162217 214 2 1 108 47 341 0.824 29.77 6.06 Intr + 165200 165892 693 2 0 121 65 1319 0.503 123.52 6.07 Intr + 167359 167450 92 1 2 90 80 13 0.532 0.34 6.08 Term + 173139 173212 74 1 2 59 52 108 0.717 2.37 6.09 PlyA + 175969 175974 6 1.05 7.04 PlyA - 179704 179699 6 1.05 7.03 Term - 184330 184302 29 2 2 93 47 32 0.054 -2.16 7.02 Intr - 188707 188603 105 2 0 95 83 22 0.063 2.59 7.01 Init - 201959 201776 184 2 1 58 100 105 0.770 7.98 7.00 Prom - 202269 202230 40 -4.86 8.07 PlyA - 208060 208055 6 1.05 8.06 Term - 223949 223917 33 2 0 101 42 60 0.223 0.29 8.05 Intr - 230099 230031 69 2 0 92 95 30 0.376 3.48 8.04 Intr - 236185 236078 108 1 0 64 105 91 0.984 8.88 8.03 Intr - 241378 241196 183 1 0 59 105 218 0.999 20.58 8.02 Intr - 242892 242746 147 0 0 54 97 152 0.974 13.13 8.01 Intr - 255744 255623 122 1 2 95 110 52 0.456 8.31 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 152913 153133 221 1 2 65 54 104 0.825 1.90 S.002 Init + 157134 157242 109 2 1 65 105 45 0.870 4.28 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815580f:50563959_50829851|GENSCAN_predicted_peptide_1|97_aa RGKLKLREDVWLVLDDIVTTCFPSIPGLEATSDSTSAFFYPWQFLFSPRRKGRGCISASG PQVNAAVLNCERSAILEKGKWPESSLKLLIEWNLSRF >gi568815580f:50563959_50829851|GENSCAN_predicted_CDS_1|294_bp aggggaaaactgaagctacgggaagatgtgtggcttgtcttagatgatatagttactacc tgcttccctagtattccaggactagaagccacttctgattccacatcggcgttcttttat ccctggcaattcctgtttagtccaaggcgtaaggggaggggctgtatttctgcatctggc ccccaggtgaatgcagctgttttaaactgcgagcgctcagctatcctggaaaaggggaag tggccagagagttcattgaagctgctcattgaatggaatctgagccgcttctaa >gi568815580f:50563959_50829851|GENSCAN_predicted_peptide_2|60_aa MALVMKPVALNSTLIHSLGYLEDEDYLEMKDILDDWHLLQDTDQPSVAILGIQEYPGYRC >gi568815580f:50563959_50829851|GENSCAN_predicted_CDS_2|183_bp atggccttggtcatgaagcctgtggcactgaattccactctaattcactctctggggtac ctggaagatgaggactatctggaaatgaaagacatccttgatgactggcacttgctccag gacactgaccaaccttccgtagccattctgggcatccaggagtacccaggatataggtgc tga >gi568815580f:50563959_50829851|GENSCAN_predicted_peptide_3|159_aa MRLHRLGAEWTRKQWSSWTQGDGQWGIHCKLSDETKNKSLNGNPSLPSSTCVPCWFDFML KKVTIQPSGISQFILITAGTQATEIQKEERTLPRSLLFLEQAQPILAFLGPGANLTGHPT SDGSRLQIPIPSIILHCKFEQCPLRIAMWQLLEEDGGDS >gi568815580f:50563959_50829851|GENSCAN_predicted_CDS_3|480_bp atgaggctccaccgactgggtgcggagtggacaagaaagcaatggagctcatggacccag ggagatgggcagtggggaatccactgtaagctctcagatgagacaaagaacaagagcctc aacgggaaccccagtctccccagcagcacctgcgttccttgctggtttgatttcatgctg aaaaaggtgaccatccagccttcaggcattagtcagttcatactaattacagcaggaacc caggcaactgaaatccagaaagaggaaaggactcttccaaggtcactcctgtttttggag caggcccaacccattttggcatttcttgggcctggggctaaccttacagggcatcctacg tcagatggctctcgcctgcaaatccccatcccatccattattctccactgcaagtttgag cagtgtccccttagaatagccatgtggcagcttctagaagaggatggaggtgacagctga >gi568815580f:50563959_50829851|GENSCAN_predicted_peptide_4|184_aa MAEKGDCIASVYGYDLGGRFVDFQPLGFGVNGLVLSAVDSRACRKVAVKKIALSDARSMK HALREIKIIRRLDHDNIVKVYEVLGPKGTDLQGELFKFSVAYIVQEYMETDLARLLEQGT LAEEHAKLFMYQLLRGLKYIHSANVLHRDLKPANIFISTEDLVLKIGDFGLARIVDQHYS HKAR >gi568815580f:50563959_50829851|GENSCAN_predicted_CDS_4|555_bp atggctgagaagggtgactgcatcgccagtgtctatgggtatgacctcggtgggcgcttt gttgacttccaacccctgggcttcggtgtcaatggtttggtgctgtcggccgtggacagc cgggcctgccggaaggtcgctgtgaagaagattgccctgagcgatgcccgcagcatgaag cacgcgctccgagagatcaagatcattcggcgcctggaccacgacaacatcgtcaaagtg tacgaggtgctcggtcccaagggcactgacctgcagggtgagctgttcaagttcagcgtg gcgtacatcgtccaggagtacatggagaccgacctggcacgcctgctggagcagggcacg ctggcagaagagcatgccaagctgttcatgtaccagctgctccgcgggctcaagtacatc cactccgccaacgtgctgcacagggacctgaagcccgccaacatcttcatcagcacagag gacctcgtgctcaagattggggatttcgggttggcaaggatcgttgatcagcattactcc cacaaggcccgctaa >gi568815580f:50563959_50829851|GENSCAN_predicted_peptide_5|234_aa MAAGGAGVVLKGVSVLHLNKLVLKCRVSPPNVCKGPHTRKTPCMSQDLNPTAISWCWITT GIQWFLGEQVVFGYMNCSLSTAGTQSLDVYTPEKCQHQAGDEGEAAAHDHLSAESLTPRA GNGFSHLFSVPSLVPELSVKAADQCHRENSMLQRAACTECQYLVPFKKMAGFTTLDSLPK PSEAQQSPTFLAPGISFVENNFSTDREEELGDGFGMKLLHSDHQALDSHKERAT >gi568815580f:50563959_50829851|GENSCAN_predicted_CDS_5|705_bp atggctgcagggggagctggtgtggttttgaaaggtgtttctgtccttcacctaaacaag ctggtgctcaagtgccgtgtctctcctccaaatgtgtgcaagggaccccataccaggaag actccctgtatgtcccaggacctcaaccccactgccatcagctggtgttggattaccacg ggaatccaatggtttttgggagaacaggtggtatttggttatatgaactgtagcctctca acggcagggactcagtcattagatgtctataccccagagaagtgccagcaccaagctggg gatgagggcgaggcagctgcccatgatcacctttcagcagaatccctgaccccgagggca gggaatgggtttagtcatctgttctcagtacccagcctcgtgcctgaactctctgtgaag gcagcagatcagtgccaccgagaaaacagcatgctccagagagctgcgtgtaccgagtgt cagtatctggtgccattcaagaagatggcaggctttacaacattggattctcttcccaaa ccgtcggaagcccagcagtccccaacctttttggcaccagggatcagtttcgtggaaaac aatttttccacggaccgggaagaggagctgggagatggttttgggatgaaactgttacac tcagatcatcaggcattagattctcataaggagcgtgcaacctag >gi568815580f:50563959_50829851|GENSCAN_predicted_peptide_6|504_aa MTAAILGEHHDDQMEGNRGQPRPPQKALAALEVSLMCEAVEPSIQGYLSEGLVTKWYRSP RLLLSPNNYTKAIDMWAAGCILAEMLTGRMLFAGAHELEQMQLILETIPVIREEDKDELL RVMPSFVSSTWEVKRPLRKLLPEVNSEAIDFLEKILTFNPMDRLTAEMGLQHPYMSPYSC PEDEPTSQHPFRIEDEIDDIVLMAANQSQLSNWDTCSSRYPVSLSSDLEWRPDRCQDASE VQRDPRAGSAPLAEDVQVDPRKDSHSSSERFLEQSHSSMERAFEADYGRSCDYKVGSPSY LDKLLWRDNKPHHYSEPKLILDLSHWKQAAGAPPTATGLADTGAREDEPASLFLEIAQWV KSTQGGPEHASPPADDPERRLSASPPGRPAPVDGGASPQFDLDVFISRALKLCTKPEDLP DNKLGDLNGACIPEHPGDLVQTEAFSKERWNILPFLPLVFSSFSSPNTPKDTHRKSSQFP ELHEDVLNSSSNDVGSIVKPENVT >gi568815580f:50563959_50829851|GENSCAN_predicted_CDS_6|1515_bp atgacagcagccatcctgggagagcatcatgatgaccaaatggaaggaaacagaggacag cccaggcccccacagaaagccctggcagccttagaagtctctcttatgtgcgaggcagtg gagccctctattcagggttatctgtcagaagggttggtaacaaagtggtaccgttcccca cgactgctcctttcccccaataactacaccaaagccatcgacatgtgggccgccggctgc atcctggctgagatgcttacggggagaatgctctttgctggggcccatgagctggagcag atgcaactcatcctggagaccatccctgtaatccgggaggaagacaaggacgagctgctc agggtgatgccttcctttgtcagcagcacctgggaggtgaagaggcctctgcgcaagctg ctccctgaagtgaacagtgaagccatcgactttctggagaagatcctgacctttaacccc atggatcgcctaacagctgagatggggctgcaacacccctacatgagcccatactcgtgc cctgaggacgagcccacctcacaacaccccttccgcattgaggatgagatcgacgacatc gtgctgatggccgctaaccagagccagctgtccaactgggacacgtgcagttccaggtac cctgtgagcctgtcgtcggacctggagtggcggcctgaccggtgccaggacgccagcgag gtacagcgcgacccgcgcgcgggttcggcgccactggctgaggacgtgcaggtggacccg cgcaaggactcgcacagcagctccgagcgcttcctagagcagtcgcactcgtccatggag cgcgccttcgaggccgactacgggcgctcctgcgactacaaggtggggtcgccgtcctac ctggacaagctgctgtggcgcgacaacaagccgcaccactactcggagcccaagctcatc ctggacctgtcgcactggaagcaggcggccggcgcgccccccacggccacggggctggcg gacacgggggcgcgcgaggacgagccggccagcctcttcctggagatcgcgcagtgggtc aagagcacgcagggcggcccagagcacgccagcccgcccgccgacgaccccgagcgccgc ttgtctgcctcgccccccggccgcccggccccggtggacggcggcgccagcccccagttc gacctggacgtgttcatctcccgcgccctgaagctctgcaccaagcccgaggacctgccg gacaataaactgggcgacctcaatggtgcgtgcatccccgagcaccctggcgacctcgtg cagaccgaggccttctccaaagaaaggtggaacattctaccatttctgccactggtgttc agctccttctcttcccccaacactcccaaagatacccacaggaagtccagccagtttcca gaattgcatgaagatgtgcttaattcttccagcaacgatgtgggtagtattgtcaaacca gaaaatgtcacctga >gi568815580f:50563959_50829851|GENSCAN_predicted_peptide_7|105_aa MDRGWNSLEGSEEDRKWWENSELPRDLLNSFAENTDSDMDNEVQVEVVSDGDVELVGNWN KVSSLILMEVWLHGPSLHVRVLETRVDVSELMVQGRAPAVTAAAM >gi568815580f:50563959_50829851|GENSCAN_predicted_CDS_7|318_bp atggacagaggttggaacagtttggagggctcagaagaagacagaaagtggtgggaaaat tcagaacttcctagagacctgttgaacagctttgctgaaaatactgatagtgatatggac aatgaagtccaggttgaggtggtctcagatggagatgtggaacttgttgggaactggaat aaagtttcttccctgattctgatggaagtttggctccatggaccaagtctgcatgtcagg gtgttggagacaagagtggatgtttctgaactaatggtacagggaagagctcctgcagtc acagcagctgccatgtga >gi568815580f:50563959_50829851|GENSCAN_predicted_peptide_8|220_aa XIQGWENLAESSHLATTRESSPPESGTGSGSSRGSRLQEPQVSWKLRFQKREPLKNVFFI LAERARDPSAKKRHMAMRNLGTMAYEAPDKVRKYKKIVLDLLVYGLYDPVNLEVIHESMK TLTVVLGKIQGKGLGSFFIDITLQTRTLLDDACKTTFQACSPYLKLKEEYSFQSEEDQRN TKLYQQLGDAINHVPFKQMNFLQQFANVNQPPVEQQLFLE >gi568815580f:50563959_50829851|GENSCAN_predicted_CDS_8|663_bp ngaattcagggctgggaaaatttagccgagtcatcacatttggctactacccgggaaagc agccctcccgagtcaggaacgggctccggttcatcacgtggcagccgcctgcaggagccg caggtctcttggaaactgaggttccagaagcgggagcctctgaagaatgtgtttttcatc ttggcagaaagagctcgggaccccagtgctaaaaagcgtcacatggcaatgagaaacttg ggaaccatggcctatgaagcccctgacaaggtgagaaagtataagaaaattgtcctcgac ctgctggtgtatggactgtatgaccctgtgaatttggaagtcatccatgagagtatgaag actctgaccgtcgttctgggcaagatccaggggaaaggtttgggttccttcttcatagat atcacccttcagaccaggactttattagatgacgcttgcaaaacaacatttcaagcctgt tctccatatctgaaactaaaggaggaatacagcttccagagtgaagaagatcaaaggaac actaagctctaccagcagctgggtgatgctattaatcatgttcctttcaagcagatgaat ttcctacagcagtttgcaaatgttaaccagccgcccgtggagcagcagctgtttctagaa tga