GENSCAN 1.0 Date run: 24-Oct-119 Time: 21:30:31 Sequence gi568815597f:228635743_228843737 : 207995 bp : 44.54% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 14539 14754 216 0 0 67 8 381 0.784 24.69 1.02 Intr + 14767 14933 167 0 2 -4 50 123 0.753 -1.94 1.03 Term + 14935 15133 199 1 1 51 54 169 0.650 6.67 1.04 PlyA + 15947 15952 6 1.05 2.00 Prom + 16150 16189 40 -4.86 2.01 Sngl + 25322 25570 249 1 0 60 42 178 0.724 5.48 2.02 PlyA + 25852 25857 6 1.05 3.00 Prom + 25969 26008 40 -4.46 3.01 Init + 26023 26147 125 0 2 71 89 119 0.731 9.94 3.02 Term + 30856 30940 85 1 1 93 49 52 0.601 -1.07 3.03 PlyA + 31352 31357 6 1.05 4.07 PlyA - 32059 32054 6 1.05 4.06 Term - 32834 32769 66 2 0 97 42 74 0.632 1.64 4.05 Intr - 33147 33073 75 0 0 59 96 57 0.133 3.31 4.04 Intr - 37194 37164 31 2 1 52 46 61 0.018 -3.57 4.03 Intr - 40060 40042 19 2 1 130 58 11 0.008 -2.03 4.02 Intr - 44916 44409 508 0 1 31 110 249 0.471 13.95 4.01 Init - 45222 45145 78 0 0 56 80 79 0.530 4.96 4.00 Prom - 47983 47944 40 -8.36 5.00 Prom + 49192 49231 40 -6.06 5.01 Sngl + 51744 52085 342 2 0 30 46 338 0.455 19.84 5.02 PlyA + 53615 53620 6 1.05 6.00 Prom + 68509 68548 40 -1.66 6.01 Sngl + 71959 72288 330 0 0 90 41 225 0.578 14.02 6.02 PlyA + 72334 72339 6 1.05 7.00 Prom + 73372 73411 40 -4.96 7.01 Sngl + 73465 74547 1083 0 0 44 35 291 0.952 16.48 7.02 PlyA + 74668 74673 6 1.05 8.00 Prom + 76596 76635 40 -2.46 8.01 Init + 87735 87750 16 2 1 103 66 14 0.673 1.18 8.02 Term + 89609 89742 134 0 2 74 52 96 0.863 2.85 8.03 PlyA + 90017 90022 6 1.05 9.00 Prom + 96242 96281 40 -4.16 9.01 Init + 100001 100262 262 1 1 56 105 307 0.716 26.43 9.02 Intr + 101931 101989 59 1 2 62 82 32 0.797 -1.40 9.03 Intr + 104649 104717 69 2 0 44 99 71 0.759 3.18 9.04 Term + 107429 107998 570 1 0 70 44 657 0.609 53.94 9.05 PlyA + 109241 109246 6 1.05 10.03 PlyA - 109647 109642 6 1.05 10.02 Term - 133136 132965 172 1 1 70 35 180 0.383 8.20 10.01 Init - 148007 147829 179 2 2 94 74 76 0.275 5.53 10.00 Prom - 154940 154901 40 -2.96 11.00 Prom + 156924 156963 40 -3.56 11.01 Init + 182409 182516 108 2 0 94 39 104 0.864 4.36 11.02 Term + 183439 183561 123 0 0 82 42 167 0.870 9.88 11.03 PlyA + 185388 185393 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:228635743_228843737|GENSCAN_predicted_peptide_1|193_aa MLRKEAAAGWMVLGCRPYLAFTALSVPGSLNINLYSLVCASPGRLWGQRATCCQMPRSTL LLQEGSILVAVMGSCHGQKLCEESARRVVLTSLLACLPSCWRVYFLKGGYETFFQGCMDV KLVSLVSNERALISQCGKLVVLNINYRPAYDQGGPVEILSSLYLGGAYHASKCKFLIKLY ITAQLNVPLWTWL >gi568815597f:228635743_228843737|GENSCAN_predicted_CDS_1|582_bp atgctccgcaaggaggcagcggcgggctggatggtgcttggctgccggccctacctggcc tttaccgccttgagcgtgcctggctcactcaacatcaacctctattcactggtgtgcgcc agcccggggcggctgtggggtcagcgtgctacttgctgccagatgcctcggagcacgctg ctgctgcaggagggcagcatcctggtggccgtgatggggagctgccacgggcagaagctg tgcgaggagagtgcccggcgggttgtcctcacctcactgctcgcctgcctgccctcctgc tggcgggtctacttcctcaaagggggatatgagaccttcttccagggttgcatggatgta aaactcgtttcactcgtttcaaatgagagagctctcatcagccagtgtggaaagctagta gtgctaaacatcaactacaggccagcttatgatcagggtggtccagttgaaatcctttcc tccctctaccttggaggtgcctaccatgcatccaagtgcaagttcctcatcaagctatac atcacagcccagctgaatgtccccttgtggacctggctgtga >gi568815597f:228635743_228843737|GENSCAN_predicted_peptide_2|82_aa MSELPFTVATKRIKYLGIQLTRDVKDLFKENYKLLLNKIKEETNKWKNIPCSWIGSINIM KMAVLPKVIYRFNAIPIKLPMI >gi568815597f:228635743_228843737|GENSCAN_predicted_CDS_2|249_bp atgagtgaactcccattcacagttgctacaaagagaataaaatacctaggaatccaactt acaagggatgtgaaggacctcttcaaggagaactacaaactactgctcaacaaaataaag gaggagacaaacaaatggaagaacattccatgctcatggataggaagcatcaatatcatg aaaatggccgtactgcccaaggtaatttatagattcaatgccatccccatcaagctacca atgatttaa >gi568815597f:228635743_228843737|GENSCAN_predicted_peptide_3|69_aa MGKDVMTETPKAMATKAKIDKWGLNKLKSFCTAKETTIRVNSLQCKSILAQGGVAAPVWK PAGNVHPCD >gi568815597f:228635743_228843737|GENSCAN_predicted_CDS_3|210_bp atgggcaaggacgtcatgactgaaacaccaaaagcaatggcaacaaaagccaaaattgac aaatggggtctaaataaattaaagagcttctgcacagccaaagaaactaccatcagagtg aacagcctccagtgtaagagtattcttgctcagggtggtgtggcagcccctgtttggaag ccagcagggaatgtgcacccctgtgactga >gi568815597f:228635743_228843737|GENSCAN_predicted_peptide_4|258_aa MSQTSRKALEQFPEKIPYGTIRQIPQSAPDIRKKLQKLESGLQTPQQELINLAFKVYNNR EEVARQQRISELQLLASAVRQPTTTSPAYKKTSEHPSHSSQRLLKNILVDLVSNAKSLAT GPQNAHGPGFLLSRALSVQAPTGGQTVRLTLLLLLKPLEPKPNVPWPTPSRSDAAGSPRK PPGPSQMLWVTLTVEGQSYKLLWKSGCCYLQGDRKSEEEKSDTMPEQAASLLAQAFQNTS DAKCVWWFALTDHFSDTS >gi568815597f:228635743_228843737|GENSCAN_predicted_CDS_4|777_bp atgagccagacctccaggaaggcattagagcagttccctgagaagatcccctatggaact atcaggcagattccccagtcagctcctgacattagaaaaaagcttcaaaaattggaatcc ggccttcaaaccccacaacaggaattaatcaacctcgccttcaaggtgtacaataataga gaggaggtggccagacagcaacgcatttctgagttacagctacttgcctccgctgtaaga caacccacaaccacatctccagcatacaaaaaaacttcagaacatccaagccacagctcc cagaggctccttaaaaacatcctcgtggaccttgtttcaaatgccaaaagcctggccact gggcctcagaatgcccacggcccgggattcctcctgagccgtgccctgtctgtgcaggcc cccactggaggtcagactgtccgactcacattgctgctgctcctgaagcccctggagccc aaacccaatgttccctggccaactccttccagatctgatgctgccggatcacctcggaag cctcctggaccatcacagatgctttgggtaactctcacagtggagggtcaatcatataaa ctcctgtggaaatcgggctgctgctacctgcagggagacagaaagtcagaagaggagaaa agtgacactatgccagaacaggcagctagtctgctggctcaagccttccagaacacttct gatgccaaatgtgtgtggtggtttgccctcaccgaccatttctccgacaccagctaa >gi568815597f:228635743_228843737|GENSCAN_predicted_peptide_5|113_aa MKLQNQRGGRIFLQDIKKPDCDDWESGLNAMECALHLEKSVNQSLLELHKLATDKNDPHL CDFIETHYLNKQVKAIKELGEHVTNLCKMGAPESGSAEYLLDKHTLGDSDNES >gi568815597f:228635743_228843737|GENSCAN_predicted_CDS_5|342_bp atgaagctgcagaaccaacgaggtggccgaatcttccttcaggatatcaagaaaccagac tgtgatgactgggagagcgggctgaatgcgatggagtgtgcattacatttggaaaaaagt gtgaatcagtcactactggaactgcacaaactggccactgacaaaaatgacccccatttg tgtgacttcattgagacacattacctgaataagcaggtgaaagccatcaaagaattgggt gagcacgtgaccaacttgtgcaagatgggagcgcccgaatctggctcggcggaatacctc ttagacaagcacaccctgggggacagtgataatgaaagctaa >gi568815597f:228635743_228843737|GENSCAN_predicted_peptide_6|109_aa MEKNQSRKTGNSKKQSASPPPKECSSSPATEQSWTENDFDKLREEGFRQSNYSELWEDIQ TKGKEVENFEKNLEEYITRITNTEKCLKELMELKTKARELHEECRSLRS >gi568815597f:228635743_228843737|GENSCAN_predicted_CDS_6|330_bp atggagaaaaaccagagcagaaaaactggaaactctaaaaagcagagtgcctctcctcct ccaaaggaatgcagttcctcaccagcaacggaacaaagctggacagagaatgactttgac aagctgagagaagaaggcttcagacaatcaaattactccgagctatgggaagacattcaa accaaaggcaaagaagttgaaaactttgaaaaaaatttagaagaatatataactagaatc accaatacagagaagtgcttaaaggagctgatggagctgaaaaccaaggctcgagaacta catgaagaatgcagaagcctcaggagctga >gi568815597f:228635743_228843737|GENSCAN_predicted_peptide_7|360_aa MGDFNTPLSTLDRSTRQKVNKDTQELNSALHQADLIDIYRTLHPKSTEYTFFSAPHHTYS KIDHIVGSKALLSKCKRTEIITNYLSDHSAIKLELRIKNLTQNHSTTWKLNNLLLNDYWV HNEMKAEIKMFFETNENKDTTYQNLWDTFKAVCRGKFIALNAHKRKQERSKIDTLTSQLK ELEKQEQTYSKASRRQEITKIRAELKEIETQKTLQKINESRSWFFERINKIDRPLARLIK KKREKNQIGAIKNDKGDITTNSTEIQTTIREYYKHLYANKLENLEEMDKFLDIYTLPGLN QEEVESLNRPIIGAEIVAIINSLPTKKSPGPDGFTAKFYHRYKEELVPFLLKLFHSIEKE >gi568815597f:228635743_228843737|GENSCAN_predicted_CDS_7|1083_bp atgggagactttaacaccccactgtcaacattagacagatcaacgagacagaaagtcaac aaggatacccaggaattgaactcagctctgcaccaagcagacctaatagacatctacaga actctccaccccaaatcaacagaatatacatttttttcagcaccacaccacacctattcc aaaattgaccacatagttggaagtaaagctctcctcagcaaatgtaaaagaacagaaatt ataacaaactatctctcagaccacagtgcaatcaaactggaactcaggattaagaatctc actcaaaaccactcaactacatggaaactgaacaacctgctcctgaatgactactgggta cataacgaaatgaaggcagaaataaagatgttctttgaaaccaacgagaacaaagacaca acataccagaatctctgggacacattcaaagcagtgtgtagagggaaatttatagcacta aatgcccacaagagaaagcaggaaagatccaaaattgacaccctaacatcacaattaaaa gaactagaaaagcaagagcaaacatattcaaaagctagcagaaggcaagaaataactaag atcagagcagaactgaaggaaatagagacacaaaaaacccttcaaaaaattaacgaatcc aggagctggttttttgaaaggatcaacaaaattgatagaccgctagcaagactaataaag aaaaaaagagagaagaatcaaataggagcaataaaaaatgataaaggggatatcaccacc aattccacagaaatacaaactaccataagagaatactacaaacacctctatgcaaataaa ctagaaaatctagaagaaatggataaattccttgacatatacactctcccaggactaaac caggaagaagttgaatctctgaatagaccaataataggagctgaaattgtggcaataatc aatagcttaccaaccaaaaagagtccaggaccagatggattcacagccaaattctaccac aggtacaaggaggaactggtaccattccttttgaaactattccactcaatagaaaaagag taa >gi568815597f:228635743_228843737|GENSCAN_predicted_peptide_8|49_aa MMFYTQVQRSGIQFQNTDGRVDAAILGAVALSFCSLSKGLVMSNPEHLP >gi568815597f:228635743_228843737|GENSCAN_predicted_CDS_8|150_bp atgatgttctacacacaggtacagaggtctgggattcagttccagaacacggatggccgt gtggatgctgcaatcctgggggctgtggccctgagcttctgcagcctctcaaagggacta gtgatgagcaaccctgagcatctcccttga >gi568815597f:228635743_228843737|GENSCAN_predicted_peptide_9|319_aa MPPQQGDPAFPDRCEAPPVPPRRERGGRGGRGPGEPGGRGRAGGAEGRGVKCVLVGDGAV GKTSLVVSYTTNGYPTEYIPTAFDNFSAVVSVDGRPVRLQLCDTAGQTFRTNFKTFEEEP SSKTMKNKLLNQRVFYHLPDPFMKNVKVIFLLMRIHENITFGYFAWLQDEFDKLRPLCYT NTDIFLLCFSVVSPSSFQNVSEKWVPEIRCHCPKAPIILVGTQSDLREDVKVLIELDKCK EKPVPEEAAKLCAEEIKAASYIECSALTQKNLKEVFDAAIVAGIQYSDTQQQPKKSKSRT PDKMKNLSKSWWKKYCCFV >gi568815597f:228635743_228843737|GENSCAN_predicted_CDS_9|960_bp atgcccccgcagcagggggaccccgcgttccccgaccgctgcgaggcgcctccggtgccg ccgcgtcgggagcgcggtggacgcgggggacgcgggcctggggagccggggggccggggg cgtgcggggggtgccgaggggcgcggcgtcaagtgcgtgctggtcggcgacggcgcggtg ggcaagacgagcctggtggtgagctacaccaccaacggctaccccaccgagtacatccct actgccttcgacaacttctccgcggtggtgtctgtggatgggcggcccgtgagactccaa ctctgtgacactgccggacagacttttagaaccaacttcaaaacttttgaagaagaacca tcttcaaaaactatgaaaaataagctgctgaaccagcgggtcttctatcacctgccagac cctttcatgaaaaatgtgaaggtgatctttttactgatgagaattcatgaaaacataact tttgggtactttgcttggttgcaggatgaatttgacaagctgaggcctctctgctacacc aacacagacatcttcctgctctgcttcagtgtcgtgagcccctcatccttccagaacgtc agtgagaaatgggtgccggagattcgatgccactgtcccaaagcccccatcatcctagtt ggaacgcagtcggatctcagagaagatgtcaaagtcctcattgagttggacaaatgcaaa gaaaagccagtgcctgaagaggcggctaagctgtgcgccgaggaaatcaaagccgcctcc tacatcgagtgttcagccttgactcaaaaaaacctcaaagaggtctttgatgcagccatc gtcgctggcattcaatactcggacactcagcaacagccaaagaagtctaaaagcaggact ccagataaaatgaaaaacctctccaagtcctggtggaagaagtactgctgtttcgtatga >gi568815597f:228635743_228843737|GENSCAN_predicted_peptide_10|116_aa MDHNEAQRCICFSSGPDADGAPGLAGLLLGSRNTQHPVCVQWPRSVSNVAASSVQKWLQR RCVSRRLGLSGMKVLSAHEGFAKLSLPDAMMTMNSSSVAWIQREDNDTVMYNVNKK >gi568815597f:228635743_228843737|GENSCAN_predicted_CDS_10|351_bp atggaccataatgaggcccagagatgtatctgcttctccagtggaccagatgctgacgga gccccaggactggctggcttgctccttggttcccgcaatacccagcacccggtctgtgtg cagtggcctcggtcagtgtctaatgtggctgcatcgagcgtccaaaagtggctgcagagg aggtgtgtctctcggcgcttgggcctctctgggatgaaagttttaagtgcacacgaaggc tttgccaagctctctcttcctgatgccatgatgactatgaacagcagctccgtggcctgg atccagagggaggacaatgacaccgtgatgtataatgtgaacaagaaataa >gi568815597f:228635743_228843737|GENSCAN_predicted_peptide_11|76_aa MVAAAGRCGWSCTLHGAPQVLLQLPKPRLQTQASCSDSDKAAMATEVSGQKIDTPKILKH HDRISYSCNVQCKLIN >gi568815597f:228635743_228843737|GENSCAN_predicted_CDS_11|231_bp atggtggctgcagcagggaggtgtggatggagctgcacactccatggagctccccaggtg ctgctgcagctgcccaaaccacgactgcagacccaggcctcctgctctgactcagacaaa gccgccatggccacagaggtttccggccagaaaattgacacccccaagatcctgaaacac catgaccgtatctcttactcatgtaatgtccagtgtaagctgatcaactaa