GENSCAN 1.0 Date run: 5-Nov-116 Time: 22:33:51 Sequence gi568815588f:99433132_99635602 : 202471 bp : 43.99% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 25407 25446 40 0.14 1.01 Init + 28483 28534 52 0 1 77 53 94 0.926 6.12 1.02 Intr + 34152 34183 32 1 2 46 75 44 0.002 -3.25 1.03 Term + 51557 51997 441 1 0 65 46 201 0.432 8.86 1.04 PlyA + 53833 53838 6 1.05 2.09 PlyA - 55814 55809 6 1.05 2.08 Term - 82441 82341 101 2 2 60 34 87 0.112 -1.21 2.07 Intr - 88605 88345 261 1 0 118 91 182 0.908 18.96 2.06 Intr - 88899 88758 142 0 1 21 51 108 0.349 0.33 2.05 Intr - 90499 90409 91 0 1 92 52 79 0.237 4.70 2.04 Intr - 93695 93643 53 2 2 98 64 80 0.294 4.31 2.03 Intr - 94379 94340 40 1 1 67 65 33 0.157 -2.87 2.02 Intr - 97398 97268 131 0 2 77 82 90 0.266 6.79 2.01 Init - 98348 98154 195 2 0 95 59 68 0.693 3.55 2.00 Prom - 98662 98623 40 -11.33 3.00 Prom + 99391 99430 40 -8.46 3.01 Init + 100001 100358 358 1 1 57 87 413 0.758 35.37 3.02 Intr + 101854 102586 733 2 1 109 58 1269 0.585 116.50 3.03 Term + 103118 103187 70 2 1 81 50 99 0.263 2.81 3.04 PlyA + 103368 103373 6 1.05 4.00 Prom + 117569 117608 40 -4.76 4.01 Init + 122624 122716 93 1 0 91 84 98 0.959 10.08 4.02 Term + 145371 145502 132 2 0 49 47 82 0.031 -1.71 4.03 PlyA + 146485 146490 6 1.05 5.03 PlyA - 146807 146802 6 1.05 5.02 Term - 151430 151227 204 2 0 107 33 129 0.644 6.67 5.01 Init - 166808 166572 237 2 0 98 62 56 0.197 2.01 5.00 Prom - 168798 168759 40 -5.66 6.05 PlyA - 169467 169462 6 1.05 6.04 Term - 178235 177718 518 0 2 32 45 436 0.509 28.18 6.03 Intr - 179468 179412 57 0 0 108 90 5 0.242 1.56 6.02 Intr - 180138 180027 112 0 1 103 63 19 0.132 0.85 6.01 Init - 187204 186914 291 1 0 50 100 422 0.096 34.75 6.00 Prom - 195010 194971 40 -4.26 7.03 PlyA - 195066 195061 6 1.05 7.02 Term - 198690 198589 102 0 0 79 42 85 0.541 1.28 7.01 Intr - 201179 201141 39 2 0 128 93 2 0.496 3.12 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 187204 186833 372 1 0 50 54 493 0.903 36.23 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588f:99433132_99635602|GENSCAN_predicted_peptide_1|174_aa MKKKSNKEEEEEEKRERRVNGPMDTNLQEGHSPVGKTDKSSHNCNTYDNGYRQRYNSDIS FSKFAFIENLQATGTRSATVSMTDTIPAFMELMSDGIQVTRDTEEGPPIQIGVIRKGPPM VPVLPSNLKTSYHPGPRREDSTPGRKHSTGKGTKVWKKTAHPENWQRQLRERTL >gi568815588f:99433132_99635602|GENSCAN_predicted_CDS_1|525_bp atgaagaagaagagcaacaaagaggaggaagaggaggagaagagggaaagaagagtgaac ggccctatggacacaaacctccaggaaggtcatagtccagtgggaaagacagacaagtcc tctcacaactgtaatacctacgataacggataccggcagagatataacagtgatatttca ttcagtaaattcgcatttattgagaatttacaagctacaggcactaggagcgctacagtg agcatgacagatacaatccctgctttcatggagctcatgtccgatgggatccaagtcaca cgggacacagaagaggggcccccaatccaaattggagtgatcagaaagggtccccctatg gtcccagtgcttccgtcgaaccttaaaacctcctatcatccaggcccaagaagagaggac agcactccaggcagaaagcacagcacaggcaaaggcacgaaggtatggaagaagacggca catccagagaactggcaacgccaactcagagagcgcaccctgtaa >gi568815588f:99433132_99635602|GENSCAN_predicted_peptide_2|337_aa MAELKETRLTWKGQVKGAGTVDIASERHKTPAGSGYEGGGLPRGSRNGDRSGCPRLKGER TACRETLCSRRSPPGRLCELMQGAVSSPPAGPRVPLYPKFRRPSRSRARSESPVRYLQVA ATMFDVKGIRAAQEGFAEVGALTVVHSPRKECAPVTGLTGSCLENKHCRTNCSPGSVYTK RLAVPAALFGQRIILMDNELGGEKKARLETGLQDTAPGGYQTRGRLAWVSARQQVNKTGS YQDGGRFDFLAVGDGLLTASASKSGLGPSPGRLAAGKVRAAFKGRPPSECAPPGAGDSLQ RPAQEIATAMPAFSNNHLDQSKEPSMSRQKRLPAKRL >gi568815588f:99433132_99635602|GENSCAN_predicted_CDS_2|1014_bp atggcggagctgaaagagacccgtctcacctggaaggggcaggtgaaaggtgcagggacg gtggacattgcctccgagcggcataagacaccggctgggagtgggtacgagggtggaggg ctgccacgcgggtcccgcaacggcgacaggagtgggtgcccaaggttgaagggagaaaga acagcctgtcgggagacgctctgcagccgccggagcccaccagggcgcctctgtgagctc atgcagggggctgtctcctccccaccagccggcccacgggtgcccctctaccccaaattc cgccgccccagccgctcccgggccaggtcggagtccccggttcggtacctccaggttgca gcgactatgtttgatgtgaagggaatcagggcagcacaggagggcttcgccgaggtggga gccctgacggtggttcacagcccccgaaaagaatgtgccccggtgacaggattgacggga agctgccttgaaaataaacactgccggacgaattgttcccccgggtctgtttacaccaaa cgtctggccgtcccggctgcactgtttggacagaggattatactaatggataacgagcta ggaggggagaagaaggcgcggctggaaacggggctccaggacaccgcgccgggcggatac caaactcgaggtcgtcttgcttgggtgagcgcgcgtcaacaagtaaacaagaccgggagc taccaggatggcggccgctttgatttcctcgcagtcggtgacggcctgcttactgcttcg gcctccaaatccggattagggccaagccccgggcgcctggcggccggcaaggttagagct gccttcaaagggcggccgcccagcgagtgcgcacctccaggagctggagactccctgcag cgcccggcgcaagaaatcgccacagccatgccagccttcagcaacaaccaccttgatcag tcaaaagaaccatcaatgtcaaggcaaaagcgtctaccagcaaaaagattatga >gi568815588f:99433132_99635602|GENSCAN_predicted_peptide_3|386_aa MMLPSPVTSTPFSVKDILNLEQQHQHFHGAHLQADLEHHFHSAPCMLAAAEGTQFSDGGE EDEEDEGEKLSYLNSLAAADGHGDSGLCPQGYVHTVLRDSCSEPKEHEEEPEVVRDRSQK SCQLKKSLETAGDCKAAEESERPKPRSRRKPRVLFSQAQVFELERRFKQQRYLSAPEREH LASSLKLTSTQVKIWFQNRRYKCKRQRQDKSLELGAHAPPPPPRRVAVPVLVRDGKPCVT PSAQAYGAPYSVGASAYSYNSFPAYGYGNSAAAAAAAAAAAAAAAAYSSSYGCAYPAGGG GGGGGTSAATTAMQPACSAAGGGPFVNVSNLGGFGSGGSAQPLHQGTAAGAACAQGTLQG IRACRYLGPDLRIVGGRRTLRGKPAE >gi568815588f:99433132_99635602|GENSCAN_predicted_CDS_3|1161_bp atgatgttaccaagcccggtcacctccacccctttctcagtcaaagacattttgaatctg gagcagcagcaccagcacttccatggtgcgcacttgcaggcggacttggagcaccacttc cactctgcgccctgcatgctggccgccgctgaggggacgcaattttctgacggaggggag gaggacgaggaagacgagggcgagaaattgtcctatttgaactcactagccgcagcagac ggccacggggattcagggctgtgtccccagggctatgtccacacggtcctgcgagactcg tgcagcgagcccaaggaacatgaagaggagcccgaggtcgtgagggaccggagccaaaaa agctgccagctgaagaagtctctagagacggccggagactgcaaggcggcggaggagagc gagaggccgaagccacgcagccgccggaagccccgggtcctcttctcgcaagcccaggtc ttcgagctggaacgcaggttcaagcagcagcggtacctgtcggcacccgagcgcgagcac ctcgccagcagcctgaagctcacatccactcaggtgaaaatctggttccagaatcgcagg tacaagtgcaagagacagcggcaggacaagtctctggagcttggcgcacacgcgcccccg ccgccgccgcgccgcgtggctgtcccggtgctggtgcgggacggcaagccgtgcgtcacg cccagcgcgcaggcctacggcgcgccctacagcgtgggcgccagcgcctactcctacaac agcttccccgcctacggctatgggaactcggccgcggccgccgccgccgccgccgccgcc gccgcagcagcggcggcctacagcagcagctatggctgtgcgtacccggcgggcggcggc ggcggcggcggcgggacctccgcggcgaccactgccatgcagcccgcctgcagcgcggcc ggaggcggcccctttgtgaacgtgagcaacctaggaggcttcggcagcggcggcagcgca cagccgttgcaccagggtactgcagccggggccgcgtgcgctcagggcaccttgcagggc atccgggcctgcagatacctcggtccagatctccggattgtcgggggacgcaggactctt cgaggaaaaccagccgaatga >gi568815588f:99433132_99635602|GENSCAN_predicted_peptide_4|74_aa MNRPGDPQPDRKKREYFSAVNRKHEINAQMETPRTWTPSTSNMGICPPCHAEDTMPNRLP EYTFTAASYALSQF >gi568815588f:99433132_99635602|GENSCAN_predicted_CDS_4|225_bp atgaacaggcctggagatcctcagccggaccgcaagaaaagagaatacttctctgcagtc aatcgaaagcatgaaattaatgcccaaatggagaccccaagaacctggacaccctccacc agtaacatgggcatttgtcccccatgtcacgctgaagacacaatgccaaaccgcctccct gagtacaccttcacggctgcctcatatgcactgtcccagttctaa >gi568815588f:99433132_99635602|GENSCAN_predicted_peptide_5|146_aa MAQGRSGGMEERRLDPESRKGRSCFKDQIGGDWSPVPVRGIVLQEEEIVCGADLRSKEDG KKLLRSPGTRDGKIYLAGQRFAEGKEKWRGPGWWAASSTHHAGTPSSRHSCANCQHSPKA QEGHPVTRLTGSLLRNVPDTRPYEIH >gi568815588f:99433132_99635602|GENSCAN_predicted_CDS_5|441_bp atggctcaaggaagaagcggaggtatggaggagaggcggcttgatcctgagagcagaaaa gggaggtcttgcttcaaagaccaaattggaggggactggagtcctgtgcctgtaagaggc attgttcttcaggaggaagagatcgtgtgtggggctgacttgaggagtaaggaggatggc aagaagttactgaggtcccctgggaccagggatggaaaaatttacctggctggacagcgc tttgccgagggcaaggagaagtggcgtggccctggctggtgggctgcaagctccactcac cacgcgggaaccccgagctcccgacacagctgcgccaattgccagcattcccccaaagca caagagggtcatccagtaactcgcctcacgggaagtcttctgaggaatgtcccagacacc aggccttacgagattcattag >gi568815588f:99433132_99635602|GENSCAN_predicted_peptide_6|325_aa MELEGRGAGGVAGGPAAGPGRSPGESALLDGWLQRGVGRGAGGGEAGACRPPVRQDPDSG PDYEALPAGATVTTHMVAGAVAGILEHCVMYPIDCVKDPNSPEAVCVENMTKPHDKMNGG GAKELHEETRRLCGSAAGCVATLLHDAAMNPAEVVKQRMQMYNSPYHRVTDCVRAVWQNE GAGAFYRSYTTQLTMNVPFQAIHFMTYEFLQEHFNPQRRYNPSSHVLSGACAGAVAAAAT TPLDVCKTLLNTQESLALNSHITGHITGMASAFRTVYQVGGVTAYFRGVQARVIYQIPST AIAWSVYEFFKYLITKRQEEWRAGK >gi568815588f:99433132_99635602|GENSCAN_predicted_CDS_6|978_bp atggagttggaggggcggggtgctggcggtgtggcgggggggccggcggcagggcccggg cggagccccggggagtcggcgctgctggacgggtggctgcagcggggcgtgggccggggg gccggcggcggggaggccggggcctgcaggcccccggtacgacaagatccggactccggc ccggactacgaggcgctgccggctggagccactgtcaccacgcacatggtggcaggcgcc gtggcagggatcctggagcactgcgtgatgtaccccatcgactgcgtcaaggatcccaac tcaccagaggcagtttgtgttgagaacatgacaaagcctcatgacaaaatgaatgggggt ggggccaaggaactgcatgaagaaaccagaaggttgtgtggaagtgcggccgggtgtgtg gcaacattacttcatgatgcagccatgaaccctgcggaagtggtcaagcagaggatgcag atgtacaactcaccataccaccgggtgacagactgtgtacgggcagtgtggcaaaatgaa ggggccggggccttttaccgcagctacaccacccagctgaccatgaacgttcctttccaa gccattcacttcatgacctatgaattcctgcaggagcactttaacccccagagacggtac aacccaagctcccacgtcctctctggagcttgcgcaggagctgtagctgccgcagccaca accccactggacgtttgcaaaacactgctcaacacccaggagtccttggctttgaactca cacattacaggacatatcacaggcatggctagtgccttcaggacggtatatcaagtaggt ggggtgaccgcctatttccgaggggtgcaggccagagtaatttaccagatcccctccaca gccatcgcatggtctgtgtatgagttcttcaaatacctaatcactaaaaggcaagaagag tggagggctggcaagtga >gi568815588f:99433132_99635602|GENSCAN_predicted_peptide_7|46_aa VNQGFLLGLDPLLLVESTNAEPMDTEGILCGLLEYYYCFTDEESEA >gi568815588f:99433132_99635602|GENSCAN_predicted_CDS_7|141_bp gtaaatcagggatttcttcttggtttggatccattgctgttggttgaatccacgaatgca gaacccatggatacagagggcattctatgcggcctattagagtattattactgctttaca gatgaggaaagtgaagcttag