GENSCAN 1.0 Date run: 7-Nov-116 Time: 00:01:05 Sequence gi568815597f:116654493_116868780 : 214288 bp : 43.89% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 608 603 6 1.05 1.02 Term - 9780 9686 95 1 2 85 48 48 0.367 -1.51 1.01 Init - 11834 11792 43 2 1 107 106 103 0.721 12.58 1.00 Prom - 35446 35407 40 -4.76 2.05 PlyA - 36606 36601 6 1.05 2.04 Term - 40866 40787 80 1 2 84 48 103 0.191 3.83 2.03 Intr - 51999 51947 53 2 2 103 97 73 0.923 8.25 2.02 Intr - 59485 59345 141 0 0 42 46 215 0.569 12.17 2.01 Init - 60343 60066 278 1 2 91 71 334 0.583 28.46 2.00 Prom - 61106 61067 40 -8.86 3.00 Prom + 61704 61743 40 -7.26 3.01 Init + 63453 64579 1127 2 2 60 53 411 0.682 28.47 3.02 Term + 70453 70471 19 1 1 99 47 12 0.075 -4.01 3.03 PlyA + 70794 70799 6 1.05 4.09 PlyA - 73237 73232 6 1.05 4.08 Term - 73879 73727 153 1 0 84 32 101 0.389 2.02 4.07 Intr - 84811 84734 78 1 0 74 105 54 0.664 5.45 4.06 Intr - 86381 85398 984 2 0 -19 55 1171 0.536 94.19 4.05 Intr - 86807 86503 305 0 2 -63 57 443 0.857 22.41 4.04 Intr - 87182 87011 172 2 1 0 52 239 0.958 11.02 4.03 Intr - 87638 87372 267 2 0 5 61 272 0.777 13.93 4.02 Intr - 88132 87959 174 1 0 81 3 218 0.368 12.74 4.01 Init - 96906 96829 78 0 0 93 38 22 0.082 -1.24 4.00 Prom - 98218 98179 40 -5.76 5.00 Prom + 98287 98326 40 -5.86 5.01 Init + 100001 100061 61 1 1 63 121 -10 0.903 1.21 5.02 Intr + 100139 100459 321 0 0 120 115 183 0.987 20.03 5.03 Intr + 105910 106140 231 2 0 92 96 132 0.966 12.14 5.04 Intr + 109992 110114 123 1 0 126 103 64 0.994 12.26 5.05 Term + 110145 110191 47 1 2 126 41 38 0.879 0.27 5.06 PlyA + 110463 110468 6 1.05 6.02 PlyA - 116177 116172 6 1.05 6.01 Sngl - 121140 120928 213 0 0 44 33 218 0.394 7.02 6.00 Prom - 122192 122153 40 -2.46 7.00 Prom + 131053 131092 40 -2.36 7.01 Init + 135280 135605 326 0 2 71 50 119 0.117 3.20 7.02 Term + 150365 150461 97 2 1 97 53 98 0.879 4.64 7.03 PlyA + 150619 150624 6 1.05 8.08 PlyA - 152703 152698 6 1.05 8.07 Term - 159241 159161 81 1 0 101 54 94 0.512 4.99 8.06 Intr - 171988 171905 84 1 0 114 45 23 0.012 0.62 8.05 Intr - 193142 193116 27 2 0 127 65 10 0.218 0.91 8.04 Intr - 199410 199334 77 1 2 86 101 34 0.538 3.73 8.03 Intr - 200684 200467 218 1 2 63 94 73 0.478 3.55 8.02 Intr - 206853 206769 85 1 1 100 52 60 0.422 2.48 8.01 Init - 210999 210960 40 0 1 80 115 42 0.620 6.49 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 51634 51576 59 2 2 106 44 16 0.816 -3.15 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:116654493_116868780|GENSCAN_predicted_peptide_1|45_aa MKCFFPVLSCLAVLGAMEENEHQGGFKGAQWQTSRCRLKWEGLEL >gi568815597f:116654493_116868780|GENSCAN_predicted_CDS_1|138_bp atgaagtgctttttcccggtgctgagctgtctggctgtgctgggtgccatggaggaaaat gagcatcaaggtggctttaaaggggcacagtggcagacatccagatgcaggctgaagtgg gaaggtttggaactgtag >gi568815597f:116654493_116868780|GENSCAN_predicted_peptide_2|183_aa MVKVKPGVNGLGCIGCVVTRAAFNSGKVDIVAINDAFNDLNYMVYMFQYDFTHGEFHGTV KAENRKLVINRDPITIFQELDSTRIKWGDAGTGDTHSSTFDTGAGIALNDHFVKLISCYD NEFGYSNRVVDFMAHMASKENLSSTGLNEKATVTEEIDVYLDSGAPGKNPPVFTMAVPGA CRT >gi568815597f:116654493_116868780|GENSCAN_predicted_CDS_2|552_bp atggtgaaggtgaagcctggagtcaatggattaggctgtattgggtgcgtggtcaccagg gctgcttttaactctggtaaagtggatattgtcgccatcaatgatgccttcaatgacctc aactacatggtctacatgttccagtatgatttcacccatggcgaattccatggcactgtc aaagctgagaacaggaagcttgtcatcaatagagatcccatcaccatcttccaggagcta gattccaccagaatcaaatggggtgatgctggcactggtgacacccactcctccaccttt gatactggggctggcattgccctcaatgaccactttgtcaagcttatttcctgttatgac aatgaatttggctacagcaacagggtggtggacttcatggcccacatggcctccaaggag aacctgagctccacaggactcaatgaaaaagcaactgtaacagaagagatagatgtctac ctggattctggagctccaggaaagaacccccctgtgttcacaatggcggtccctggtgcc tgcagaacttag >gi568815597f:116654493_116868780|GENSCAN_predicted_peptide_3|381_aa MSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLKEIKEDTNKWKNIPCSWVGRINIV KMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRARIAKSILSQKNKAGGITLP DFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEIMPHIYNYLIFDKPEKNKQWGKDSLFN KWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGITIQDIGVGKD FMSKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTTWEKIFATYSSDKGLISRI YNELKQIYKKKTNNPIKKWAKDMNRHFSKEDIYAAKKHMKKCSSSLAIREMQIKTTMRYH LTPVRMAIIKKSGNNRGRLSL >gi568815597f:116654493_116868780|GENSCAN_predicted_CDS_3|1146_bp atgagtgaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaactt acaagggatgtgaaggacctcttcaaggagaactacaaaccactgctcaaggaaataaaa gaggacacaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatatcgtg aaaatggccatactgcccaaggtaatttacagattcaatgccatccccatcaagctacca atgactttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaaga gcccgcatcgccaagtcaatcctaagccaaaagaacaaagctggaggcatcacactacct gacttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaac agagatatagatcaatggaacagaacagagccctcagaaataatgccgcatatctacaac tatctgatctttgacaaacctgagaaaaacaagcaatggggaaaggattccctatttaac aaatggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttcctt acaccttatacaaaaatcaattcaagatggattaaagatttaaacgttagacctaaaacc ataaaaaccctagaagaaaacctaggcattaccattcaggacataggcgtgggcaaggac ttcatgtccaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatcta attaaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaacct acaacatgggagaaaattttcgcaacctactcatctgacaaagggctaatatccagaatc tacaatgaactcaaacaaatttacaagaaaaaaacaaacaaccccatcaaaaagtgggcg aaggacatgaacagacacttctcaaaagaagacatttatgcagccaaaaaacacatgaag aaatgctcatcatcactggccatcagagaaatgcaaatcaaaaccactatgagatatcat ctcacaccagttagaatggcaatcattaaaaagtcaggaaacaacagggggcgtctaagt ctgtga >gi568815597f:116654493_116868780|GENSCAN_predicted_peptide_4|736_aa MELPVCEWLTHQLPSNSSVSRPGGEKAPAQTMMSFGGSEALLGAERQQQPPLHADPKGQH KQKALGCWILQRVPLMDTDVHELHSAAPGAGAPAPGLPHVHRCLHDEAQQHKEVKACAVG GFLQEAEAAGVELQKKAQALQEECSYLWHRQQEQVGELLGQMQGCGASQAQAQEEITEYQ RQLPAETTELEALKSTIDSLDRQCSELEDRHQADIASYQEATQQLDAGTPKKIKVVEKSE KETVIVEEQTEETQVTEKVTEEEEKEAKEEEVKEEEGGEEEEAKGGEEEAKSPLAEEATS PEKEAKSPVKEEAKSPAEAKSPEKEEAKSPAEAKSPAKEEAKSLAEAKSPKKAKSPVKEE AKSPEKAKTLDVKSPEANTPAKEEARPSADKSPKKAKSPVKEEVKFPEKVKSPLKEDAKA PEKEIPKKEEVKSPVKEEEKPQDVKVKEPAKKAEEEKATATPKTEKESKKEESSNKEAPK PEVEEKEAAIEKPKESKVEAKKEEAEDKKKAATLEKEAPAKVEVKEDAKPKEKTEMAKKE PDDAKAKEPRKPAEREEAAAAMEKKDTKEEKATEFKKPEEKPKTQSQRRPSQRSSVNLRR KRLKNPPAQTKKTAGLQRRPQKTRLPRGSKAGRKECLEQPKKLKRVLELKDQSNAIFTFY MITAEFQYQYQQQNQIPQTLSLPGKETCPRAIGGTSAATITLIDPETDTLIINRPCHVQL PQWRDVALSDNVALVF >gi568815597f:116654493_116868780|GENSCAN_predicted_CDS_4|2211_bp atggagcttcctgtttgtgagtggctgacccaccagctgccatccaactccagtgtgtca aggcctggtggagagaaggcacctgctcagaccatgatgagctttggtggctctgaggca ctgctgggtgctgaacggcagcagcagcctccactacacgctgaccctaaagggcagcac aagcagaaggcgctcggctgctggatcctccagcgggttccactcatggacacggatgtc catgagctccattctgctgcgcctggagcaggagcacctgctccaggacttccacatgtg caccggtgcctccatgatgaggcccagcagcacaaagaggtcaaggcttgtgctgttggc ggcttcttgcaggaggccgaggcggcgggcgtggaactgcagaagaaggcgcaggcactg caagaggagtgtagctacctgtggcaccgccagcaggagcaggtgggcgagctgctcggc cagatgcagggctgcggcgcctcgcaggcgcaggcgcaggaggagataactgagtatcag cgtcagcttccggccgagaccacagagctggaggcactgaagagcaccatcgactcactg gacaggcagtgttctgagctggaggaccgtcatcaggccgacattgcctcctaccaggaa gccactcagcagctggacgctggaacaccaaagaagatcaaagtggtagagaagtctgag aaagaaactgtgattgtggaggaacagacagaggagacccaagtgactgaaaaagtgact gaagaagaggagaaagaggccaaagaggaggaggtcaaggaggaagaagggggtgaagaa gaggaggcaaaagggggagaagaagaagcaaagtctcccctagcagaagaggccacatcc ccagagaaggaagccaagtccccagtgaaggaagaggcaaagtcaccggctgaggccaag tccccagagaaggaggaagcaaaatccccagctgaggccaagtccccagcaaaggaagag gcaaagtcactggctgaggccaagtctccaaagaaggccaagtccccagtgaaggaagaa gcaaagtcccctgagaaggccaagactcttgatgtgaagtctccagaagccaacactcca gcaaaggaggaagcaaggccctctgcagataaatcccccaaaaaggccaaaagccctgtc aaagaggaggtcaagttcccagagaaggtgaagtctcccctgaaagaggatgccaaggcc cccgagaaggagatcccaaagaaggaagaggtgaagtccccagtgaaggaggaggagaag ccccaggacgtaaaagtcaaagagcctgcaaagaaggcagaggaagagaaagctaccgcc acgccaaaaacagagaaggagagcaagaaagaggagtcatccaacaaggaggctccaaag cccgaggtggaggagaaggaagctgccatcgaaaagcccaaagaatccaaagttgaagcc aagaaggaagaggctgaagataagaaaaaagcagccaccttagagaaggaggctcctgcc aaggtggaggtgaaggaagacgctaaacccaaagagaagacagagatggccaagaaggaa ccagatgatgccaaggccaaggaacccagaaaaccagcagagagggaggaggcggcagca gcaatggagaaaaaagataccaaggaggagaaggccactgagttcaagaagcctgaggag aaacccaagacccaaagccaaagacgaccctctcaaaggagctcagtgaacctaaggcgg aaaaggctgaaaaatcctccagcacagaccaaaaagacagcaggcctccagagaaggcca cagaagacaaggctgccaaggggaagtaaggcagggagaaaggaatgtctggagcagcca aagaaactcaaaagggtcctggagctcaaggatcagagtaatgcaatttttactttttat atgatcacagcagaatttcagtatcagtatcagcagcaaaatcagattccacaaacattg tcactgccaggcaaagagacctgcccaagggccattggaggtaccagtgctgccaccatc accctgattgacccagaaactgatactttgataatcaacaggccatgtcatgtccagctc ccccaatggagggacgtggcactgtcagacaatgtggccttggtgttttaa >gi568815597f:116654493_116868780|GENSCAN_predicted_peptide_5|260_aa MSFPCKFVASFLLIFNVSSKGAVSKEITNALETWGALGQDINLDIPSFQMSDDIDDIKWE KTSDKKKIAQFRKEKETFKEKDTYKLFKNGTLKIKHLKTDDQDIYKVSIYDTKGKNVLEK IFDLKIQERVSKPKISWTCINTTLTCEVMNGTDPELNLYQDGKHLKLSQRVITHKWTTSL SAKFKCTAGNKVSKESSVEPVSCPEKGLDIYLIIGICGGGSLLMVFVALLVFYITKRKKQ RSRRNGPSKIPMKQLMGMTP >gi568815597f:116654493_116868780|GENSCAN_predicted_CDS_5|783_bp atgagctttccatgtaaatttgtagccagcttccttctgattttcaatgtttcttccaaa ggtgcagtctccaaagagattacgaatgccttggaaacctggggtgccttgggtcaggac atcaacttggacattcctagttttcaaatgagtgatgatattgacgatataaaatgggaa aaaacttcagacaagaaaaagattgcacaattcagaaaagagaaagagactttcaaggaa aaagatacatataagctatttaaaaatggaactctgaaaattaagcatctgaagaccgat gatcaggatatctacaaggtatcaatatatgatacaaaaggaaaaaatgtgttggaaaaa atatttgatttgaagattcaagagagggtctcaaaaccaaagatctcctggacttgtatc aacacaaccctgacctgtgaggtaatgaatggaactgaccccgaattaaacctgtatcaa gatgggaaacatctaaaactttctcagagggtcatcacacacaagtggaccaccagcctg agtgcaaaattcaagtgcacagcagggaacaaagtcagcaaggaatccagtgtcgagcct gtcagctgtccagagaaaggtctggacatctatctcatcattggcatatgtggaggaggc agcctcttgatggtctttgtggcactgctcgttttctatatcaccaaaaggaaaaaacag aggagtcggagaaatggccccagcaaaatccccatgaaacagctcatggggatgacacct tga >gi568815597f:116654493_116868780|GENSCAN_predicted_peptide_6|70_aa MFFADSHRGFLLHLCLEGTGAFPVPQRLAAQSQSFHLLTVRWDRPEAPGQPGAPASSIYL STSDAFTAIT >gi568815597f:116654493_116868780|GENSCAN_predicted_CDS_6|213_bp atgttcttcgccgacagtcaccgcggtttcctgcttcacctgtgcttggagggaactggc gcttttcccgtgccccagcggctggcggctcagagccagagcttccacctcctcaccgtg cgctgggaccgccctgaggcccccggtcaacctggcgctccagcctcctccatctacctg tccacatctgatgctttcaccgccataacgtag >gi568815597f:116654493_116868780|GENSCAN_predicted_peptide_7|140_aa MAILPKAIYKFNAIPIKIPPSFFAELEKKILKFMTQKRLIAKARLSKKNKSGGITLPDFK LYYETIVTKTAWHWYKNRHIDQWNRIENSEIKPNTYSQLIFNKGNKNIKMCPENLKQRGT QRKVYALQPTPKEEEDRRLS >gi568815597f:116654493_116868780|GENSCAN_predicted_CDS_7|423_bp atggccatactgccaaaagcaatctacaaattcaatgcaattcccatcaagataccacca tcattctttgcagaactagaaaaaaaaatcctaaaattcatgacccaaaaaagactcata gccaaagcaagactaagcaaaaagaacaaatctggaggcattacattacccgacttcaaa ctatactatgagaccatagtcaccaaaacagcatggcactggtataaaaacagacatata gaccaatggaacagaatagagaactcagaaataaagccaaatacttacagtcaactgatc ttcaacaaaggaaacaaaaacataaagatgtgccctgaaaaccttaagcaaagaggaaca cagagaaaagtgtacgccctgcagccaactcccaaagaggaagaggaccgcagacttagc tga >gi568815597f:116654493_116868780|GENSCAN_predicted_peptide_8|203_aa MGAVLEALWQYSPGGTGLPGKQVLHLHADLRVKEGKVSTLNKVDFWCQKPRTLTGTASVP ARFRKPSPVLKREKRLKKTEVEYRKEIKKQKKGAALWKRGEKEGVWGKKVEELAALELRL ELLSRKQVLLLVNYHDCVYMCQFNRFISLAHIGGPEDARTWVFEDQGLRGPGCARAWNIR FQNYISQAEGFLYGETKRKDLTL >gi568815597f:116654493_116868780|GENSCAN_predicted_CDS_8|612_bp atgggggctgtccttgaagccttgtggcagtacagcccagggggcactgggcttccaggc aaacaggtcctccacttacatgcagatctgcgtgtgaaagaagggaaggtttcgaccctg aacaaagtcgatttctggtgtcagaaaccaagaactctgactggtacagcatctgtccct gcaagattcaggaaaccaagtccagtgctcaagagggagaagagattaaagaaaacagaa gtagaatatagaaaggaaataaaaaagcagaaaaagggtgcagcattgtggaagcgtggg gagaaggaaggagtgtgggggaagaaagtagaggaattggcagctctggagctcagactg gaactgctcagcaggaaacaggtcttgctgctagtaaattaccatgactgtgtttacatg tgccaatttaatcggtttataagcctggcccacataggaggaccagaagatgcaaggacc tgggtgttcgaggaccagggtctgagaggaccagggtgtgcaagggcctggaacattcgc ttccagaattatatctcccaggcagaaggattcctctatggagaaaccaagagaaaagac ctgacactttga