GENSCAN 1.0 Date run: 4-Nov-116 Time: 12:01:19 Sequence gi568815595f:13381246_13604680 : 223435 bp : 50.96% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.07 Intr - 487 360 128 2 2 49 64 73 0.024 1.40 1.06 Intr - 5162 5030 133 2 1 46 89 144 0.934 10.52 1.05 Intr - 7208 7058 151 1 1 76 86 198 0.978 18.56 1.04 Intr - 10062 9966 97 1 1 100 91 136 0.993 14.17 1.03 Intr - 16243 16112 132 2 0 72 100 320 0.501 32.22 1.02 Intr - 18616 18480 137 0 2 100 61 222 0.810 20.91 1.01 Init - 38981 38815 167 2 2 96 81 450 0.963 42.21 1.00 Prom - 55566 55527 40 -4.76 2.03 PlyA - 56086 56081 6 1.05 2.02 Term - 66094 65834 261 1 0 96 36 220 0.693 13.13 2.01 Init - 67146 67144 3 0 0 108 27 0 0.128 -4.10 2.00 Prom - 70733 70694 40 -3.66 3.00 Prom + 77857 77896 40 -2.66 3.01 Init + 78395 78533 139 1 1 65 100 49 0.259 4.11 3.02 Intr + 92149 92278 130 2 1 78 75 40 0.092 1.45 3.03 Intr + 100001 100149 149 2 2 56 86 176 0.431 14.08 3.04 Intr + 102219 102319 101 1 2 25 87 153 0.410 8.73 3.05 Intr + 115491 115607 117 2 0 90 78 89 0.966 8.76 3.06 Intr + 117268 117310 43 0 1 105 92 92 0.999 9.11 3.07 Intr + 119468 119544 77 0 2 106 93 124 0.999 13.93 3.08 Intr + 120626 120688 63 1 0 96 109 99 0.995 11.71 3.09 Intr + 121639 121735 97 0 1 58 68 238 0.954 18.38 3.10 Intr + 122849 123027 179 0 2 87 88 286 0.694 28.14 3.11 Term + 123223 123438 216 0 0 138 46 282 0.998 26.14 3.12 PlyA + 124067 124072 6 1.05 4.08 PlyA - 128290 128285 6 1.05 4.07 Term - 138688 138491 198 1 0 87 39 83 0.026 0.70 4.06 Intr - 154532 154319 214 1 1 60 61 132 0.403 6.42 4.05 Intr - 158881 158751 131 1 2 123 75 73 0.748 8.99 4.04 Intr - 167148 167045 104 1 2 81 60 92 0.671 5.59 4.03 Intr - 169013 168998 16 2 1 109 103 -12 0.184 -2.48 4.02 Intr - 172702 172625 78 1 0 87 70 50 0.253 2.85 4.01 Init - 176404 176351 54 1 0 47 84 89 0.248 3.69 4.00 Prom - 178271 178232 40 -6.16 5.00 Prom + 182868 182907 40 -6.46 5.01 Init + 189111 190416 1306 2 1 87 116 1432 0.875 135.81 5.02 Term + 192754 192776 23 2 2 114 41 18 0.290 -1.83 5.03 PlyA + 196936 196941 6 1.05 6.04 PlyA - 196952 196947 6 1.05 6.03 Term - 199125 199015 111 0 0 27 42 129 0.230 0.76 6.02 Intr - 203300 203267 34 1 1 91 84 9 0.184 -0.97 6.01 Init - 203906 203812 95 2 2 93 40 111 0.419 4.68 6.00 Prom - 222476 222437 40 -2.46 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 20143 20136 8 1 2 75 89 12 0.800 -0.35 S.002 Intr + 22209 22312 104 1 2 94 89 77 0.851 8.29 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:13381246_13604680|GENSCAN_predicted_peptide_1|315_aa MAARGRGLLLLTLSVLLAAGPSAAAAKLNIPKVLLPFTRATRVNFTLEASEGCYRWLSTR PEVASIEPLGLDEQQCSQKAVVQARLTQPARLTSIIFAEDITTGQVLRCDAIVDLIHDIQ IVSTTRELYLEDSPLELKIQALDSEGNTFSTLAGLVFEWTIVKDSEADRFSDSHNALRIL TFLESTYIPPSYISEMEKAAKQGDTILVSGMKTGSSKLKARIQEAVYKNVRPAEVRLLIL ENILLNPAYDVYLMVGTSIHYKVQKIRQGKITDDSRDDLGPQERLSVVGPIIQRQELGSA LNICEDFHIQCLLQG >gi568815595f:13381246_13604680|GENSCAN_predicted_CDS_1|945_bp atggcggcgcggggccgggggctgctgctgctgacgctgtcggtgctgttggcggcgggc ccctccgccgctgcggccaagctcaacatccccaaagtgctgctgcccttcacgcgggcc acgcgcgttaacttcacgctggaggcctcggagggctgctaccgctggttgtccacccgg ccggaggtggccagcatcgagccgctgggcctggacgagcagcagtgctcccagaaggca gtggtgcaggcccgcctgacccagcctgcccgcctcaccagcatcatcttcgcagaggac atcaccacaggccaggtcctgcgctgtgatgccattgtggacctcatccatgacatccag atcgtctccaccacccgcgagctctacctggaggactcccccctggagctgaagatccag gccctggactccgaagggaacaccttcagcactctggctggactggtcttcgagtggacg attgtgaaggactccgaggcggacaggttctcagactcccacaatgcgctgcgaatcctc actttcttggagtctacgtacatccctccttcttacatctcagagatggagaaggctgcc aagcaaggggacaccatcctggtgtctgggatgaagaccgggagctccaagctcaaggct cgcatccaggaggctgtctacaagaatgtacgccctgcagaagtcaggctgctgattttg gaaaacatccttctgaacccggcctatgacgtctacctgatggtgggaacctccattcac tacaaggtgcagaagatcaggcaagggaaaattacagatgatagcagggatgatctgggg cctcaggagcgcctttctgttgttggcccaataattcaaagacaggaactgggcagtgcc cttaatatctgcgaagacttccacattcagtgtctcttacaagga >gi568815595f:13381246_13604680|GENSCAN_predicted_peptide_2|87_aa MGRVRDLQPAMPEPPTHSMGSCVARASPTSTTPCSTAPSPIDHPRAEECERTAQDWQAAP PAAPVRDPLGEASWAPESGGAMENLYV >gi568815595f:13381246_13604680|GENSCAN_predicted_CDS_2|264_bp atgggcagggttcgggacctgcagcccgccatgcctgagcctcccacccactccatgggc tcctgtgtggcccgagcctccccgacgagcaccaccccctgctccacagcgcccagtccc atcgaccacccaagggctgaggaatgcgagcgcacggcgcaggactggcaggcagctcca cctgcagccccagtgcgggatccactaggtgaagccagctgggctcctgagtctggtggg gccatggagaacctttatgtctag >gi568815595f:13381246_13604680|GENSCAN_predicted_peptide_3|436_aa MLQTLRKAVRGCRVSQSTVIKGRSGGDRATDRETTKSKCRESGLLGKKATYYTIAFMRNV QNRHIYRSKMWINGFQWMPGGRKWVMTAKRLHTTQLYQHVPETRWPIVYSPRYNITFMGL EKLHPFDAGKWGKVINFLKEEKLLSDSMLVEAREASEEDLLVVHTRRYLNELKWSFAVAT ITEIPPVIFLPNFLVQRKVLRPLRTQTGGTIMAGKLAVERGWAINVGGGFHHCSSDRGGG FCAYADITLAIKFLFERVEGISRATIIDLDAHQGNGHERDFMDDKRVYIMDVYNRHIYPG DRFAKQAIRRKVELEWGTEDDEYLDKVERNIKKSLQEHLPDVVVYNAGTDILEGDRLGGL SISPAGIVKRDELVFRMVRGRRVPILMVTSGGYQKRTARIIADSILNLFGLGLIGPESPS VSAQNSDTPLLPPAVP >gi568815595f:13381246_13604680|GENSCAN_predicted_CDS_3|1311_bp atgctacagacactgagaaaggctgtgaggggatgcagagtttcacaaagcacagtgatc aaaggccgtagcggtggggatcgtgctaccgacagagaaactacgaaatccaaatgcagg gagagtggccttttaggaaaaaaggccacatattatacaattgcatttatgcgaaacgtc cagaataggcacatctatagatccaaaatgtggattaatggcttccagtggatgccggga gggaggaaatgggtaatgactgctaaaaggctacacacaacccagctgtaccagcatgtg ccagagacacgctggccaatcgtgtactcgccgcgctacaacatcaccttcatgggcctg gagaagctgcatccctttgatgccggaaaatggggcaaagtgatcaatttcctaaaagaa gagaagcttctgtctgacagcatgctggtggaggcgcgggaggcctcggaggaggacctg ctggtggtgcacacgaggcgctatcttaatgagctcaagtggtcctttgctgttgctacc atcacagaaatcccccccgttatcttcctccccaacttccttgtgcagaggaaggtgctg aggccccttcggacccagacaggaggaaccataatggcggggaagctggctgtggagcga ggctgggccatcaacgtggggggtggcttccaccactgctccagcgaccgtggcgggggc ttctgtgcctatgcggacatcacgctcgccatcaagtttctgtttgagcgtgtggagggc atctccagggctaccatcattgatcttgatgcccatcagggcaatgggcatgagcgagac ttcatggacgacaagcgtgtgtacatcatggatgtctacaaccgccacatctacccaggg gaccgctttgccaagcaggccatcaggcggaaggtggagctggagtggggcacagaggat gatgagtacctggataaggtggagaggaacatcaagaaatccctccaggagcacctgccc gacgtggtggtatacaatgcaggcaccgacatcctcgagggggaccgccttggggggctg tccatcagcccagcgggcatcgtgaagcgggatgagctggtgttccggatggtccgtggc cgccgggtgcccatccttatggtgacctcaggcgggtaccagaagcgcacagcccgcatc attgctgactccatacttaatctgtttggcctggggctcattgggcctgagtcacccagc gtctccgcacagaactcagacacaccgctgcttccccctgcagtgccctga >gi568815595f:13381246_13604680|GENSCAN_predicted_peptide_4|264_aa MLMLLGGGCLAPELLSSRALPGQQCQHSQGGAPDSQMSNQKPDKPPRSHETYGLVVTTSH LSSSHGPKVCLPPPPGVSLQPRTGVLQPLLSPGLFTRALIREQMGDPHPLDHTGQLAKPL EVEKTPARRVGTVQPLCSKPENGQGEMSPYGLPDNFKEEAALNPTANFTQPKSAPRSSRL PLPPASLRRDFSLHPATHREALLGWNSSSGRKPLSFQSTEFAGLLKGGLEALGPDTWRLC SQWTSLGGREADGLILVGARGLAS >gi568815595f:13381246_13604680|GENSCAN_predicted_CDS_4|795_bp atgctgatgctcctgggaggaggctgccttgctccagagcttctgagctcgagggctctt cctgggcaacaatgtcagcatagccaggggggcgcccccgattcacagatgagcaaccag aagcccgacaagccacccaggtcccatgagacgtatggcctggtggtcaccacctcacat ctgagcagcagccatgggcccaaagtgtgcctgcccccacccccaggggtctccctgcag cccaggacaggggtgcttcaacccctcctttctccaggcctcttcaccagggccctgata agggagcagatgggagacccacatcctctggaccacactggccagctggccaagcccctg gaagtagagaagacccccgcaagaagagtgggcacagtgcagcctctctgcagcaagccc gagaatgggcagggagagatgtcaccatacggcctgccagacaacttcaaggaagaggcg gctctcaaccccacagccaacttcacgcagccgaaatcagctccacgctcctccaggctc cccctaccacctgcatccctccggagagacttcagtctccatcctgccacccacagggaa gccctcttgggctggaatagctcctcaggccgaaagcccctgagctttcagagcactgag tttgctgggctgctcaagggtgggctagaagctctgggtccagacacttggaggctctgc agccagtggacctctctaggaggaagagaagctgatggcttgattcttgtcggagctaga ggacttgcaagctag >gi568815595f:13381246_13604680|GENSCAN_predicted_peptide_5|442_aa MVLLWEPAGAWLALGLALALGPSVAAAAPRQDCTGVECPPLENCIEEALEPGACCATCVQ QGCACEGYQYYDCLQGGFVRGRVPAGQSYFVDFGSTECSCPPGGGKISCQFMLCPELPPN CIEAVVVADSCPQCGQVGCVHAGHKYAAGHTVHLPPCRACHCPDAGGELICYQLPGCHGN FSDAEEGDPERHYEDPYSYDQEVAEVEAATALGGEVQAGAVQAGAGGPPAALGGGSQPLS TIQAPPWPAVLPRPTAAAALGPPAPVQAKARRVTEDSEEEEEEEEEREEMAVTEQLAAGG HRGLDGLPTTAPAGPSLPIQEERAEAGARAEAGARPEENLILDAQATSRSTGPEGVTHAP SLGKAALVPTQAVPGSPRDPVKPSPHNILSTSLPDAAWIPPTREVPRKPQVLPHSHVEED TDPNSVHSIPRSSPEGGGLSSL >gi568815595f:13381246_13604680|GENSCAN_predicted_CDS_5|1329_bp atggtgctgctctgggagcctgcaggagcctggcttgctctgggcctggccctggccctg ggccccagcgtggccgcagctgcccctcggcaggactgcacgggcgtggagtgcccgccg ctggagaactgcattgaggaggcgctggagccgggtgcctgctgtgccacgtgtgtgcag cagggctgcgcctgcgagggctaccagtactatgactgcctacagggtggcttcgtgcgc ggccgcgtgcccgccggtcagtcctattttgtggacttcgggagcactgagtgctcctgc ccaccaggcggcggcaagatcagctgccagttcatgctgtgcccggagctgccgcccaac tgcatcgaggctgtagtggtggctgacagctgcccacagtgcggccaggtgggctgcgtc cacgcgggccacaagtacgccgctggccacactgttcacctgccgccctgccgggcctgc cactgccctgacgccggtggagagctcatctgctaccagctccccggttgccacgggaac ttctcagatgccgaggagggtgaccccgagcgacactacgaagacccctacagctatgac caggaggtggccgaggtggaagcagcaacagccctggggggtgaggtccaggcgggtgca gtccaggcaggcgcagggggccccccagctgctctgggaggtgggagtcagccactgtcc accatccaggcacccccctggccagctgtcctccccaggcccacagcggctgctgccctg ggtcccccagccccagtgcaggccaaagctaggagagtgaccgaggacagtgaggaggaa gaagaggaggaggaggagagagaggaaatggctgtcactgagcagctggcagcaggtggc cacagggggctggatgggctgcccactacagccccagctggacccagtcttcctatccag gaggagagggcagaagctggggcaagggcagaagctggggcaaggcctgaagagaacctc atcctggatgcccaagccacgtcccgcagcactgggccggagggcgtgacgcatgcaccg agcctgggcaaggctgctctcgtcccaactcaggccgtgcctggctctcccagggaccca gtcaagcccagcccccacaacatcctgtccacatcactgcctgatgcagcctggatccca cccacccgagaagtgcccaggaagccgcaagttctgccccattcccacgtggaggaggac acagaccccaactctgtccattctatccccagaagtagccctgaaggcggtggcctgagc tccctgtag >gi568815595f:13381246_13604680|GENSCAN_predicted_peptide_6|79_aa MPPPAKQPLARPLASLLLHPGSRGHTVTDEELPGVELPLLLVQNEERDILMTAGVSKIQY TLQGEQVNNVYQGYKNTSI >gi568815595f:13381246_13604680|GENSCAN_predicted_CDS_6|240_bp atgcccccaccagccaagcaacctctggcaagacccttggcctccctgctcctgcatcca ggctccaggggccacacggtgacagatgaggaattgccaggggtggagctcccactgttg ctggtccagaacgaggaaagagacattctcatgactgcaggtgtaagtaaaatccagtac accctccagggcgaacaagtgaacaacgtctaccagggctataaaaatacgtccatataa