GENSCAN 1.0 Date run: 5-Nov-116 Time: 04:28:18 Sequence gi568815584r:33826959_34050752 : 223794 bp : 41.57% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 2201 2758 558 1 0 70 42 180 0.645 7.49 1.02 PlyA + 2767 2772 6 1.05 2.00 Prom + 4472 4511 40 -3.65 2.01 Init + 11503 11567 65 0 2 71 115 26 0.196 4.27 2.02 Intr + 18718 18850 133 1 1 84 55 140 0.136 10.03 2.03 Intr + 21089 21203 115 1 1 53 110 66 0.544 4.40 2.04 Intr + 24423 24488 66 1 0 8 75 116 0.044 0.26 2.05 Intr + 25397 25532 136 0 1 -12 87 101 0.069 -1.29 2.06 Intr + 26479 26535 57 1 0 60 81 92 0.225 2.78 2.07 Intr + 29233 29401 169 1 1 -15 71 95 0.076 -3.47 2.08 Term + 32246 32419 174 1 0 66 42 115 0.282 1.48 2.09 PlyA + 34042 34047 6 1.05 3.05 PlyA - 35099 35094 6 1.05 3.04 Term - 35629 35463 167 2 2 111 48 99 0.665 5.30 3.03 Intr - 41316 41240 77 2 2 59 115 55 0.516 3.54 3.02 Intr - 43065 42999 67 1 1 35 86 66 0.512 -1.86 3.01 Init - 47877 47667 211 0 1 60 113 132 0.777 11.89 3.00 Prom - 49344 49305 40 -4.55 4.00 Prom + 52771 52810 40 -6.75 4.01 Init + 59731 59931 201 0 0 74 65 97 0.121 4.92 4.02 Term + 71186 71407 222 1 0 87 42 296 0.996 20.83 4.03 PlyA + 71594 71599 6 1.05 5.00 Prom + 74938 74977 40 -2.45 5.01 Init + 84425 84539 115 1 1 79 4 143 0.869 5.42 5.02 Intr + 85261 85296 36 2 0 104 81 57 0.777 3.92 5.03 Intr + 85994 86082 89 0 2 105 -4 50 0.236 -3.73 5.04 Intr + 88377 88456 80 2 2 60 80 83 0.189 2.13 5.05 Intr + 91376 91506 131 2 2 73 73 56 0.263 2.02 5.06 Term + 91859 92013 155 0 2 -21 44 208 0.497 2.60 5.07 PlyA + 94007 94012 6 1.05 6.08 PlyA - 94375 94370 6 1.05 6.07 Term - 97835 97727 109 1 1 51 53 79 0.155 -2.30 6.06 Intr - 102254 102118 137 2 2 98 90 66 0.911 6.25 6.05 Intr - 104257 104138 120 1 0 88 75 175 0.660 15.97 6.04 Intr - 109650 109546 105 0 0 126 16 57 0.225 1.79 6.03 Intr - 124202 123438 765 2 0 -43 97 723 0.375 50.84 6.02 Intr - 130587 130450 138 0 0 74 34 154 0.138 8.34 6.01 Init - 130841 130791 51 2 0 50 74 3 0.666 -3.69 6.00 Prom - 131077 131038 40 -8.95 7.00 Prom + 131614 131653 40 -8.05 7.01 Init + 133356 133424 69 2 0 113 64 33 0.905 2.50 7.02 Intr + 134023 134182 160 0 1 133 29 96 0.893 6.84 7.03 Intr + 137415 137519 105 1 0 20 74 114 0.643 2.47 7.04 Intr + 138675 138760 86 1 2 74 48 118 0.322 5.02 7.05 Term + 151806 152180 375 2 0 52 55 146 0.512 1.45 7.06 PlyA + 153265 153270 6 1.05 8.03 PlyA - 156596 156591 6 1.05 8.02 Term - 161119 160924 196 0 1 69 41 156 0.685 4.90 8.01 Init - 163664 163612 53 2 2 71 111 18 0.721 3.08 8.00 Prom - 164387 164348 40 -6.75 9.03 PlyA - 164594 164589 6 1.05 9.02 Term - 173521 173360 162 1 0 29 52 255 0.921 12.95 9.01 Init - 177026 176799 228 2 0 56 56 149 0.840 6.92 9.00 Prom - 178434 178395 40 -5.75 10.05 PlyA - 179438 179433 6 1.05 10.04 Term - 181540 181423 118 0 1 53 44 131 0.290 2.23 10.03 Intr - 203199 202899 301 1 1 39 44 212 0.320 6.77 10.02 Intr - 206971 206789 183 2 0 67 93 103 0.120 7.54 10.01 Init - 213750 213708 43 0 1 71 75 45 0.033 2.13 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 73824 73887 64 2 1 69 64 81 0.897 5.16 S.002 Term - 119893 119732 162 1 0 69 37 138 0.815 3.75 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584r:33826959_34050752|GENSCAN_predicted_peptide_1|185_aa MDKFLDTYTLPRLNQEEAESLNRSITDSEIEATINGLPTKKSPGADRFTARFYQWYKQEL VPFLLKLFQSIEKEGILPNSFYEASIILIPKPGRDTTKKENFRPISLMNIDPKILSKILT NQIQQHIKNLIHHDQAGFIPGIQDWFNIHKSINVIHHINRTKDKNHMIISIDAEKAFDEI QQSSC >gi568815584r:33826959_34050752|GENSCAN_predicted_CDS_1|558_bp atggataaattcctggacacatacaccctcccaagactaaaccaggaagaagctgaatct ctgaatagatcaataacagactctgaaattgaggcaacaattaatggcctaccaaccaaa aaaagtccaggagcagacagattcacagccagattctaccagtggtacaaacaggagctg gtaccattccttctgaaactattccagtcaatagaaaaagagggaatcctccctaactca ttttatgaggccagcatcatcctgataccaaagcctggcagagacacaacgaaaaaagag aattttagaccaatatccctgatgaacattgatccaaaaatcctcagtaaaatactgaca aaccaaatccagcagcacatcaaaaaccttatccaccatgatcaagctggcttcatccct gggattcaagactggttcaacatacacaaatcaataaatgtaatccatcatataaataga accaaagacaaaaaccacatgattatctcaatagatgcagaaaaggcctttgacgaaatt caacaatcttcatgctaa >gi568815584r:33826959_34050752|GENSCAN_predicted_peptide_2|304_aa MSSVSQLRSKVTRYEADYREQRWMEHMPWYNIRKAKMQHAAITEDRTRPARRQPPWGSSG KNSRPWLVGPETARRPYVLETHSQKSKDCIASTPALTFYRAKATASDDRVDGGADFAGIE SERMSRVQRLGRDGRMGIGNMMTPVTQGKKGSPRRHGVQEGRRAQEQVMGFRGYGLQPTW GTEALESSSHHHQAHGDLLQQPRETNASTVKKIMEEERGAAVLNGNSQGECLRRWHEPSG LTAGSGQDLGIQLQSWHAEENALLETIRESSYVELLMDTKARALQQKYLHRASDKPVMTD EFVC >gi568815584r:33826959_34050752|GENSCAN_predicted_CDS_2|915_bp atgagctctgtttcccagctgaggtcaaaagtcaccagatatgaagcagattatagagaa caaaggtggatggagcacatgccgtggtacaacattcggaaagccaagatgcagcatgca gccatcactgaggacaggactaggccagcaaggaggcagccaccatggggaagctcaggg aagaactccaggccctggctggttggaccagagactgctcgccgaccatatgttttggaa acccattctcaaaaatcaaaagactgtatagcatcaacaccagcattgactttctatcga gccaaagctacagcctcagatgaccgagtggatggtggtgctgattttgctggaatagag tcagagaggatgagcagagttcagcgtttgggaagagatgggagaatggggataggaaac atgatgaccccagtgacacagggcaagaaaggaagcccacgaaggcatggagtccaggaa gggagaagagcacaagagcaggtgatgggtttcaggggctacggcctgcagcctacctgg ggcactgaagctctggagagttcttcccaccaccaccaagctcatggtgatttgttacag cagccacgggaaactaatgcaagtactgtgaagaaaataatggaggaagagagaggggca gctgtgttgaatgggaatagccagggagagtgtctgaggagatggcatgagccctcaggg ctaacggcggggagtggccaagacctgggcattcagttgcagtcctggcatgcagaggaa aatgccctgttggaaactatcagagaatcatcctatgtggagcttctcatggatacaaaa gccagagccttacaacagaaatacttacacagagcctcagacaaacctgtaatgacagat gaatttgtctgttaa >gi568815584r:33826959_34050752|GENSCAN_predicted_peptide_3|173_aa MWNQRWKWVTGRGWNSLEGSEEDRKMWESLELPRYLFNDFDKNHDSDMNNKGQAEVDSDG DEELVGNWSKDLLLPDFSHAKQPIFLASERYQRNMLQDRGPNPDPKRGFLDLMQERIQDS IISLNILQQHCMEPHRGSGTYYCLGEGACMAGRVQTRALGTGSRARAPGLKTA >gi568815584r:33826959_34050752|GENSCAN_predicted_CDS_3|522_bp atgtggaatcaacgttggaaatgggtaacaggcagaggctggaacagtttggagggctca gaagaagacaggaaaatgtgggaaagtttggaacttcctagatacttgttcaatgacttt gacaaaaatcatgatagtgatatgaacaataagggccaggctgaggtggactcagatgga gatgaggaacttgttgggaactggagcaaagatctcttattacctgactttagccatgcc aagcagccaatatttctagcttctgaacgttaccaaagaaatatgttacaggacaggggt cccaatccagaccccaagagagggttcttggatctcatgcaagaaagaattcaagattct attatctctctgaacatcttgcagcagcattgcatggaaccccacagaggctctggtaca tattattgtcttggggaaggagcatgcatggctggaagagtccagacaagagccttaggc actggctccagggccagagcccctggtctaaaaacagcatga >gi568815584r:33826959_34050752|GENSCAN_predicted_peptide_4|140_aa MGPRRPEPVTGSQVKAMSSGPRLVGFVCIIEAACLHVSSGSRAVGSAVQYVTNKAPSFPC KKQTKQSGCQEEREEEEREEEEREEEEREEEEREEEEREEEEREEEEREEEEREEEEREE REEEEEESCCVIWSQRNIMV >gi568815584r:33826959_34050752|GENSCAN_predicted_CDS_4|423_bp atgggaccaaggaggccggaacctgtcactggaagccaggtcaaagccatgagcagtggc ccacggcttgttgggttcgtctgtatcattgaagctgcctgtctgcatgtatcctctggc tcgcgtgctgttggctctgcagtacaatatgtaaccaataaagctccctcctttccatgt aaaaaacaaacaaaacaaagtggatgccaggaggagagggaggaggaggagagggaggag gaggagagggaggaggaggagagggaggaggaggagagggaggaggaggagagggaggag gaggagagggaggaggaggagagggaggaggaggagagggaggaggaggagagggaggag agggaggaggaggaggaggaaagctgttgtgttatttggagccaaagaaatattatggtt taa >gi568815584r:33826959_34050752|GENSCAN_predicted_peptide_5|201_aa MPNPEIYYLPMVNIRALGPSRGIWAVKGIEALSVGLNEATVQAAVEAAEPTFLVSNNHSS TLYFYELNGFLSATYEYEYLALLSGYALPVMSTRLSLHIPSEVHTSRHNNLAIRPINNPT MTSKGSSERNSHTSPTLGQKLVGEEGMSKVVQSGAASVDVEAAASSPEDLSKIIDKGDYT KQQCRSNSLLLEEDAMYDFHS >gi568815584r:33826959_34050752|GENSCAN_predicted_CDS_5|606_bp atgcccaacccggagatttattatttgcctatggtaaatatccgagccctgggtccatcc cgtggaatatgggctgtaaagggcattgaggcccttagtgttgggttaaatgaagccaca gtccaggcagctgtggaggcagcagagcccaccttcctagtctctaataaccacagttct acgctttacttctatgagctcaatggttttcttagcgctacgtatgaatatgaatatttg gccctgttgagcggttatgccctgccagtgatgtccacacgtcttagtttgcacattccc agtgaggttcatacaagcagacacaacaatcttgcaattaggccaattaataatcctaca atgacctctaagggttcaagtgaaaggaacagtcacacatctccaactttaggtcaaaag ttagttggtgaggaaggtatgtcgaaagttgtgcagagtggagcagcaagtgttgatgtt gaagctgcagcaagttctccagaagatctatctaagatcattgataaaggtgactacact aaacaacagtgtagatcaaacagccttctgttggaagaagatgccatgtacgactttcac agctag >gi568815584r:33826959_34050752|GENSCAN_predicted_peptide_6|474_aa MQIIRPHPRPTESETLGARSRKDWDQLPEPQRPLAVERGPHSNGASDLQPLTYTVEEGKK ARSGGPGGGPGAEPVTRGVGRLKGRTSASPGVWPQSRQWWLPIPKRRPPTPCAALLAGPV RKRVVELRTTPAGSRRQIPSPESCEKLSLVPDAAAARVPWQPQVSEPRATLPAPRLRARV DRSLSGCTLGIPDLDSAGEMPLGHIMRLDLEKIALEYIVPCLHEVGFCYLDNFLGEVVGD CVLERVKQLHCTGALRDGQLAGPRAGVSKRHLRGDQITWIGGNEEGCEAISFLLSLIDRL VLYCGSRLGKYYVKERSKMNYQMGPLHQQGALMKAGPKVCDHGDALLTGHSQGAMVACYP GNGTGYVRHVDNPNGDGRCITCIYYLNKNWDAKLHGGILRIFPEGKSFIADVEPIFDRLL FFWSDRRNPHEVQPSYATRSTVVGSMTRIMEFARTVEASTPAVSTDRTASITWY >gi568815584r:33826959_34050752|GENSCAN_predicted_CDS_6|1425_bp atgcaaattatcaggccccatcccagacctactgaatcagaaactctaggggccaggagc aggaaagattgggatcagctacctgaaccccagagacccctggctgtagaaagagggcct cacagcaacggtgccagcgaccttcagccactgacatatactgttgaggaagggaagaag gcacgttctggcggtcccggaggcggtcccggggcggagcccgtgacgcgcggagtcggc cggcttaagggccggacctcagcgtctcccggagtctggccgcagtcgcggcagtggtgg cttcccatccccaaaaggcgccctccgactccttgcgccgcactgctcgccgggccagtc cggaaacgggtcgtggagctccgcaccactcccgctggttcccgaaggcagatcccttct cccgagagttgcgagaaactttcccttgtccccgacgctgcagcggctcgggtaccgtgg cagccgcaggtttctgaaccccgggccacgctccccgcgcctcggcttcgcgctcgtgta gatcgttccctctctggttgcacgctggggatcccggacctcgattctgcgggcgagatg cccctgggacacatcatgaggctggacctggagaaaattgccctggagtacatcgtgccc tgtctgcacgaggtgggcttctgctacctggacaacttcctgggcgaggtggtgggcgac tgcgtcctggagcgcgtcaagcagctgcactgcaccggggccctgcgggacggccagctg gcggggccgcgcgccggcgtctccaagcgacacctgcggggcgaccagatcacgtggatc gggggcaacgaggagggctgcgaggccatcagcttcctcctgtccctcatcgacaggctg gtcctctactgcgggagccggctgggcaaatactacgtcaaggagaggtctaagatgaat taccaaatggggcccctacatcagcagggtgccttgatgaaggcaggacctaaagtatgt gatcatggggatgccctgctcactggccacagtcaaggcgcaatggtggcttgctatccg ggaaatggaacaggttatgttcgccacgtggacaaccccaacggtgatggtcgctgcatc acctgcatctactatctgaacaagaattgggatgccaagctacatggtgggatcctgcgg atatttccagaggggaaatcattcatagcagatgtggagcccatttttgacagactcctg ttcttctggtcagatcgtaggaacccacacgaagtgcagccctcttacgcaaccagatcc actgtggttggcagtatgaccagaatcatggaatttgctagaactgtggaagcttctact cctgcagtaagcacagatcgcactgcctcaataacttggtattga >gi568815584r:33826959_34050752|GENSCAN_predicted_peptide_7|264_aa MVKPCLGRARWLMPVIPALWEAELSPVLITIISHCNSLLSDIPTSAVAPRQSILYMEATA FFLQVGAKVIAVFAIEMRLGAIGDFLAVEDAGLHFYCLKGGVGRGVENGFGGPHGRAHSL LGLADAPTRVGFPARALSPDSKAPGSEHRTSLEGCGLPVVCVCTPGSTRRLPQDFLGTGN FKKKVFDIVNFHVYSFLKLINLRLELTVIGWSSLPNPSFQMVCSPSLQKKNMPLTIQNPK IAHVPRVQKPPGHQMTGQLENARS >gi568815584r:33826959_34050752|GENSCAN_predicted_CDS_7|795_bp atggtgaaaccctgtctcggccgggcgcggtggctcatgcctgtaatcccagcactttgg gaggctgagctatcaccagttctgatcaccatcatttctcactgcaacagccttctctca gatatccctacttctgcagttgcaccacgtcaatccattctctacatggaagccacagcc ttcttcctacaggttggtgcaaaagtaattgctgtttttgccattgaaatgagattagga gccattggagactttttagctgtggaggatgcaggacttcatttttattgtttaaaaggt ggcgttggccgtggtgtagagaatggatttgggggtcctcatggcagggcacacagcctt ctgggcttggcagatgcacctacccgtgtgggcttccctgctcgagctctgagtcctgat agcaaagctccggggtctgagcacagaacttcactagaaggctgtggtcttcctgtggtg tgtgtgtgtacccctggaagtacacgaagacttcctcaggacttcttgggcactggcaat tttaagaaaaaagtttttgacattgtcaacttccatgtgtattctttcctaaaattaatt aatttgcgtttagaattgactgtgattggctggtcctccttacccaacccgtcttttcaa atggtttgctctcctagtttacaaaagaaaaatatgcctctcaccatccagaatcctaaa atagcgcatgttcccagggtacaaaaacctccaggacaccaaatgacaggacaattggag aatgcacgctcctga >gi568815584r:33826959_34050752|GENSCAN_predicted_peptide_8|82_aa MEENGRGPDPHPQSQGTWCFQYAGNFSTTAFNQKDEMLLQEHHRSDDGKATGVVQRSSEP LRTLSLLFLLHITVNLASSSGV >gi568815584r:33826959_34050752|GENSCAN_predicted_CDS_8|249_bp atggaagaaaatggaagagggcctgacccacaccctcaaagtcagggaacttggtgtttc caatatgcaggtaatttttcaaccactgcctttaaccaaaaggatgaaatgttactacaa gagcatcatagatcagatgacggtaaggccacaggggtagttcaaagatcatcagaacca ctccgaacattgtctttactgttcctgctgcacatcacagtaaatttagcatcctcctct ggagtataa >gi568815584r:33826959_34050752|GENSCAN_predicted_peptide_9|129_aa MDQKVSTISSSQTLLVNWTHRLLGDYHTSAQETPRVSNAEHLASGFRPITVKTVAVMPPP MLPSSTPKCPASWAGRGRINAMEELKRGPMGPEHQDHSILLERQGKASRKKEEEEEEEEE EEEEEEECI >gi568815584r:33826959_34050752|GENSCAN_predicted_CDS_9|390_bp atggatcagaaggtgtccaccatatccagttcccaaaccttactggttaactggactcac aggcttttaggagattatcacacgtcagcacaggagactccaagggtgagtaatgctgag catctggcctctgggtttcgtcccataactgtgaaaacagttgcagtaatgcccccacca atgcttccttcctcaacgcccaagtgtccagcttcatgggctggcaggggccgtataaat gccatggaagaactcaaaagggggcccatgggccctgaacaccaggaccattcaattctt ttagaaagacaaggcaaggcaagcagaaaaaaagaagaagaagaagaagaagaagaagaa gaagaagaagaagaagaagaatgcatttga >gi568815584r:33826959_34050752|GENSCAN_predicted_peptide_10|214_aa MDVPVMVDCCDGFVAHNRCLLNILKGMKITDYLFKQKLGHVSPQSPKKQTSRQDQTWQDF SSVKVYEKVARELKEGGQSSGKEKLDKPPPRNWESTSPTSPTPPPSGIIPRSAGYNRCLL VLESAESNCCLCTAFWLLQFGAVPSRTKQIAIPFPAPPHPLEHLAVFADFRLSPLGNTDV AGGHYPKRINIETENQLSHVLTHKWELNLGYTQT >gi568815584r:33826959_34050752|GENSCAN_predicted_CDS_10|645_bp atggatgtgccggtgatggttgattgttgtgatggttttgtagcacataataggtgtttg ctaaatattcttaagggaatgaagattacagattatttattcaagcaaaaactggggcat gttagtccccagtcccccaagaagcagacatcaagacaagatcaaacatggcaggatttt agtagtgtgaaagtctacgagaaagtggcaagggagctaaaggaaggaggccaatcctca ggcaaagagaagcttgacaaacctccaccaaggaactgggaaagtacttcccccacctcc ccaaccccacctcccagtgggatcattccaaggtcagcaggctacaaccgctgcctcctt gtgctggaaagcgcagaaagtaactgctgtctttgcacagcattctggcttcttcagttt ggggctgtcccttctcgaaccaaacaaattgccatccccttcccagcccctcctcacccc ctggaacatttggcagtgtttgcggactttcggttgtcaccattgggcaacacggatgtg gctggaggccattatcctaagcgaattaacatagaaacagaaaaccaattatctcatgtc ctcactcataagtgggagctaaatcttggctacacgcagacataa