GENSCAN 1.0 Date run: 3-Nov-116 Time: 17:40:23 Sequence gi568815593r:149405482_149650964 : 245483 bp : 45.20% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1560 1597 38 1 2 71 121 38 0.296 3.38 1.02 Intr + 15000 15122 123 2 0 105 37 91 0.716 6.48 1.03 Term + 15200 15349 150 1 0 103 53 123 0.478 8.21 1.04 PlyA + 20153 20158 6 1.05 2.13 PlyA - 20752 20747 6 1.05 2.12 Term - 26023 25974 50 2 2 84 54 62 0.133 -0.13 2.11 Intr - 41875 41741 135 2 0 119 60 13 0.027 2.14 2.10 Intr - 66613 66441 173 0 2 66 78 102 0.004 6.59 2.09 Intr - 98638 98524 115 2 1 61 82 15 0.237 -2.29 2.08 Intr - 100114 99966 149 0 2 70 96 65 0.903 5.38 2.07 Intr - 101652 101546 107 0 2 57 82 60 0.895 1.31 2.06 Intr - 104472 104398 75 0 0 95 115 34 0.901 6.51 2.05 Intr - 106391 106313 79 1 1 90 92 -44 0.786 -4.25 2.04 Intr - 107728 107589 140 1 2 57 107 103 0.949 8.36 2.03 Intr - 114907 114809 99 1 0 133 96 -20 0.912 3.61 2.02 Intr - 119690 119564 127 1 1 106 91 27 0.895 5.48 2.01 Init - 123102 123074 29 0 2 96 60 23 0.597 -0.63 2.00 Prom - 134751 134712 40 -3.86 3.07 PlyA - 138088 138083 6 1.05 3.06 Term - 140350 140081 270 1 0 39 38 217 0.709 7.28 3.05 Intr - 144340 144233 108 1 0 77 50 97 0.324 5.28 3.04 Intr - 144469 144457 13 0 1 47 95 7 0.498 -7.92 3.03 Intr - 144700 144594 107 1 2 137 107 98 0.730 15.61 3.02 Intr - 145484 145361 124 1 1 100 74 200 0.847 20.39 3.01 Init - 157580 157426 155 2 2 90 75 36 0.461 1.96 3.00 Prom - 161121 161082 40 -3.16 4.00 Prom + 161388 161427 40 -5.76 4.01 Init + 169107 169293 187 2 1 44 66 107 0.705 3.32 4.02 Intr + 175976 176143 168 0 0 35 100 72 0.340 3.02 4.03 Term + 176899 176966 68 2 2 39 29 138 0.408 1.10 4.04 PlyA + 177090 177095 6 1.05 5.03 PlyA - 178165 178160 6 1.05 5.02 Term - 193048 192773 276 1 0 -52 47 519 0.548 29.26 5.01 Init - 193806 193771 36 0 0 51 35 62 0.535 -4.06 5.00 Prom - 194909 194870 40 -3.56 6.00 Prom + 194919 194958 40 -10.74 6.01 Init + 194978 194983 6 1 0 79 66 4 0.547 -1.96 6.02 Intr + 195627 195750 124 2 1 110 99 111 0.905 14.56 6.03 Intr + 204067 204214 148 2 1 129 59 270 0.743 27.49 6.04 Intr + 211086 211285 200 0 2 76 82 267 0.587 23.89 6.05 Intr + 212695 212825 131 2 2 108 84 204 0.996 22.41 6.06 Intr + 213457 213561 105 0 0 48 94 157 0.991 12.71 6.07 Intr + 214873 214983 111 0 0 43 59 82 0.721 1.38 6.08 Intr + 216252 216581 330 2 0 108 110 353 0.981 35.33 6.09 Intr + 218531 218659 129 1 0 70 82 64 0.848 4.89 6.10 Intr + 221595 221790 196 2 1 80 71 256 0.965 22.09 6.11 Intr + 223328 223485 158 0 2 87 109 60 0.996 7.73 6.12 Intr + 226501 226639 139 0 1 108 -14 190 0.468 10.84 6.13 Intr + 229628 229741 114 0 0 65 86 44 0.268 2.32 6.14 Term + 230060 230094 35 0 2 110 37 22 0.360 -2.95 6.15 PlyA + 230717 230722 6 1.05 7.04 PlyA - 230979 230974 6 1.05 7.03 Term - 233014 232679 336 1 0 61 49 134 0.668 1.37 7.02 Intr - 235667 235518 150 2 0 53 119 66 0.896 6.56 7.01 Intr - 239201 239108 94 1 1 42 65 100 0.602 3.07 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:149405482_149650964|GENSCAN_predicted_peptide_1|103_aa XPKEPHLDQTPGKTCGGNPTALEPKESNGCNKCTGLSALRQGNRGQRGPSRKSWMWPIPS PSCFLEMRKSRKGHRGFRNDRGDDSFDVWSSSRPLPTPKRKLD >gi568815593r:149405482_149650964|GENSCAN_predicted_CDS_1|312_bp ngccccaaggagccacacctggaccagaccccaggaaagacatgtggtggaaaccctacg gccctggagcccaaggagagcaacggctgtaacaagtgcacaggtttgagtgcgctgaga caggggaaccgtgggcaaagaggtccctccagaaagagctggatgtggcccattcccagc ccctcgtgtttcttggaaatgagaaagtccaggaaagggcacaggggctttcgcaacgac cgtggcgatgatagctttgacgtgtggtcatccagccggccactgcccacacccaagaga aagctggattga >gi568815593r:149405482_149650964|GENSCAN_predicted_peptide_2|425_aa MEVKVETKNKWYGQEKDYNVLVMDLLGPSLEDLFNFCSRRFTMKTVLMLADQMISRIEYV HTKNFIHRDIKPDNFLMGIGRHCNKLFLIDFGLAKKYRDNRTRQHIPYREDKNLTGTARY ASINAHLGIEQSRRDDMESLGYVLMYFNRTSLPWQGLKAATKKQKYEKISEKKMSTPVEV LCKGFPAEFAMYLNYCRGLRFEEAPDYMYLRQLFRILFRTLNHQYDYTFDWTMLKQKAAQ QAASSSGQGQQAQTPTGKQTDKTKSNMKVGIVAPFECFNHRITNHSAKKFIMCYVANVTE KQSYHFSLPPVNGHHPGRHPREGAMLQVKTALSCQSVVPGTCRSPSSPSPTVIGSLLGSW VREGGGEHNAQACLIGVMVMNLDLFMVQFGPLRDSPGTFANLSGGAVGRDLRKEGFEDEP ACGSK >gi568815593r:149405482_149650964|GENSCAN_predicted_CDS_2|1278_bp atggaggtgaaggtagaaactaagaataagtggtatggtcaggaaaaagactacaatgta ctagtcatggatcttctgggacctagcctcgaagacctcttcaatttctgttcaagaagg ttcacaatgaaaactgtacttatgttagctgaccagatgatcagtagaattgaatatgtg catacaaagaattttatacacagagacattaaaccagataacttcctaatgggtattggg cgtcactgtaataagttattccttattgattttggtttggccaaaaagtacagagacaac aggacaaggcaacacataccatacagagaagataaaaacctcactggcactgcccgatat gctagcatcaatgcacatcttggtattgagcagagtcgccgagatgacatggaatcatta ggatatgttttgatgtattttaatagaaccagcctgccatggcaagggctaaaggctgca acaaagaaacaaaaatatgaaaagattagtgaaaagaagatgtccacgcctgttgaagtt ttatgtaaggggtttcctgcagaatttgcgatgtacttaaactattgtcgtgggctacgc tttgaggaagccccagattacatgtatctgaggcagctattccgcattcttttcaggacc ctgaaccatcaatatgactacacatttgattggacaatgttaaagcagaaagcagcacag caggcagcctcttccagtgggcagggtcagcaggcccaaacccccacaggcaagcaaact gacaaaaccaagagtaacatgaaagttgggatagttgctccttttgagtgttttaatcat cgtataactaatcatagtgccaagaagttcataatgtgttatgtagctaatgtcactgaa aaacagtcctaccattttagccttccccctgtgaatggacatcacccgggccggcacccc cgagaaggtgctatgctgcaggtaaagactgctttgtcctgccagagcgtggttcctgga acttgtcgcagtcccagtagtccttcccccacagtgatcggctccctcttgggttcctgg gtgcgggaaggagggggtgagcacaatgctcaggcctgtctaattggagtcatggtgatg aatctggatttgttcatggttcagtttgggccattaagagacagccctgggacttttgct aacttaagtggtggggcggtggggagggatctacgtaaggagggctttgaggatgaacca gcctgtggaagtaaatga >gi568815593r:149405482_149650964|GENSCAN_predicted_peptide_3|258_aa MKMGADSQNCGNSGTTGSQTLMEHQAAELRFIELQMNYVNEPRGEGDDLSQMMASSSGSK AEFIVGGKYKLVRKIGSGSFGDIYLAINITNGEEVAVKLESQKARHPQLLYESKLYKILQ GGVGIPHIRWWFPLCVQVIFDPYAEGLAAAFFTAALLMYLGLSSEVDLKALGEPRLKVQT WPSPRTTPHTTSPENGTEMVGSRFLRNMHFAKKHKKGLKKMQANNAKAMSALAEASRALV KPKEVKPKIPKGVSCELD >gi568815593r:149405482_149650964|GENSCAN_predicted_CDS_3|777_bp atgaagatgggtgctgactctcaaaactgtgggaattcaggcacaactggaagtcagacc ctgatggaacaccaagcggcagaattgagatttattgagctgcagatgaactatgtaaat gagccaaggggggagggggatgatttaagtcaaatgatggcgagtagcagcggctccaag gctgaattcattgtcggagggaaatataaactggtacggaagatcgggtctggctccttc ggggacatctatttggcgatcaacatcaccaacggcgaggaagtggcagtgaagctagaa tctcagaaggccaggcatccccagttgctgtacgagagcaagctctataagattcttcaa ggtggggttggcatcccccacatacgctggtggtttcctttatgcgtgcaggtcatcttt gatccctacgcggaagggttggctgctgccttttttacggcggcacttctgatgtactta ggtttgtccagtgaagtggatttgaaggcacttggagagccgcggcttaaggtacagaca tggccaagtccacgaaccacaccacacacaaccagtcctgaaaatggcacagaaatggtg ggctccaggttcctgaggaacatgcactttgccaagaagcacaagaagggcctaaagaag atgcaggccaacaatgccaaggcaatgagtgcacttgctgaggctagcagggccctcgta aagcccaaggaggttaagcccaagatcccaaagggtgtcagctgcgaactcgattga >gi568815593r:149405482_149650964|GENSCAN_predicted_peptide_4|140_aa MYNGYASDLLVYIASPNFPLSEPYIHLPVDTFTCIAQVSASPKFVSSIVHLTIQDRNPRV SPGSCVGIAAPASALGGHSHPERRGCRELGDAPLLPRLRCCPGGRPPCVSAGRHLARAGR QLVEEYLLCVQTASCVIQEA >gi568815593r:149405482_149650964|GENSCAN_predicted_CDS_4|423_bp atgtacaacggatatgccagtgacctcctagtctatattgccagtccaaacttccctctc tcagaaccttacatccatcttcctgttgacaccttcacttgcatagcacaggtatcagct agcccaaaatttgtctcatccatcgtccacctgactatccaagacagaaatcccagagtc agcccaggcagctgcgtgggcatagccgcgcccgccagcgccctcggtgggcacagccac ccggagcggcgcgggtgccgggagctgggcgacgcgcccctcctgccccggctgcgctgt tgcccgggcggccgacctccgtgcgtgagcgccggcaggcaccttgcgcgcgcggggagg cagctggtggaggagtacctgctgtgcgtccagacagccagctgtgtgatccaggaagca tag >gi568815593r:149405482_149650964|GENSCAN_predicted_peptide_5|103_aa MTPVFGGLGAVGTLEGEAGKKEEEEEEEEEEKERRGRRKKRKEKKEKEKKEKRKRKERKR KKRKRKKRKRKKKRKRKRKKEEEEEEEEEEEEEEEEEEEEESV >gi568815593r:149405482_149650964|GENSCAN_predicted_CDS_5|312_bp atgacacctgtcttcgggggcctgggggctgtgggcacccttgaaggggaagcaggaaaa aaagaagaagaggaggaagaagaagaagaggaaaaagaaagaagaggaagaagaaagaag aggaaggagaagaaggagaaggagaagaaggagaagaggaagaggaaggagaggaagagg aagaagaggaagaggaagaagaggaagaggaagaagaagaggaagaggaagaggaagaaa gaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaa gagtcagtttaa >gi568815593r:149405482_149650964|GENSCAN_predicted_peptide_6|641_aa MVLPQGDLDVLFSNIDDIIKVNSRFLHDLQETASKEEEQVQLVGNIFLEFQEELEQVYKV YCASYDQALLLVDTYRKEPELQRHIQGIVEAVVPQAGSSGLSFLLVIPLQRITRYPLLLQ KILENTVPDASAYPVLQRAVSALQDVNTNINEYKMRKEVASKYTKVEQLTLRERLARINT HTLSKKTTRLSQLLKQEAGLIPRTEDKEFDDLEERFQWVSLCVTELKNNVAAYLDNLQAF LYFRPHEYNLDIPEGPAVQYCNLARDLHLEAFLKFKQRLEGLVWQPLCSLAKALLGPQNL IKKRLDKLLDFERVEEKLLEVGSVTYQEEAARHTYQALNSLLVAELPQFNQLVMQWLGQI MCTFVTLQRDLAKQVLQRAEGSMAQLPHHHVPEPAFRKLVEDALGRTSNQLRSFQETFEK VQPPPTTQPLLPGSERQVQALLSRYGPGKLYQVTSNISGTGTLDLTLPRGQIVAILQNKD TKGNSGRWLVDTGGHRGYVPAGKLQLYHVVPSAEELRRQAGLNKDPRCLTPEPSPALVPS IPTMNQVIAAYPFVARSSHEVSLQAGQPVTILEAQDKKGNPEWSLVEVNGQRGSWPELLG DTERNQTDEARSSRSQSLRQADEAPINYVPCNCMLLEAENF >gi568815593r:149405482_149650964|GENSCAN_predicted_CDS_6|1926_bp atggtgttgccgcagggagatctggatgtcctgttctcaaacattgatgatatcatcaaa gtgaacagcagattcctccatgatctgcaggagacagcctccaaggaagaggaacaagtg cagctagttggtaacatatttctggaattccaagaggagttggagcaagtctataaggtc tactgtgccagctacgaccaggccttgctactggtggacacgtaccggaaggagccggag ctgcagcggcacatccagggcatcgttgaggcggtggtgccgcaagctggatcttcaggc ctcagtttcttgctggtaattcctctgcagaggatcaccaggtacccactgctgctgcag aaaatcctggagaacacagtccctgatgccagtgcctatcctgtccttcagagggctgtc tctgccctccaggacgtgaacaccaatatcaatgagtacaagatgcgcaaggaagtggcc tccaagtacaccaaggtagagcagctgaccctccgggagcggctggcccgcatcaacaca cacaccctctccaagaagaccacccggctgagccagctgctgaagcaggaggcggggctg atccccaggacagaagacaaggaatttgatgatttagaagagaggttccagtgggtgtct ctgtgtgtgactgagctgaagaacaacgtggctgcttacctggacaatctgcaggctttc ctctacttcaggccgcacgaatacaatctggacatccccgaggggcctgcagtgcagtat tgcaatttggcaagagaccttcaccttgaggccttcctgaaatttaagcaacggctagaa ggcctggtgtggcagccactgtgcagcctggccaaagccctgcttggccctcagaacctg atcaagaagcgtctggacaagctactggactttgagcgggtggaagagaagctgctggag gtgggcagtgtgacctaccaggaggaggccgcccggcacacataccaggcactcaactcg ctgctagtggctgagctcccacagtttaaccagctggtcatgcagtggctgggccagatc atgtgcacattcgtgaccctccagagggaccttgcaaagcaagtgctgcagagggcagag ggaagcatggcccagctgccccaccaccacgtcccagagcctgccttcaggaagctggtg gaggacgcactgggccggacgagtaaccagcttcgctcctttcaagagacctttgagaaa gtgcagccacctcccaccacacaaccgctccttccagggtctgaacgccaggtgcaggct ctcctgagcaggtatggccctgggaagctgtaccaggtgacaagcaacatcagtgggact gggactctggacctgactctgcctcggggccaaatcgtggccatccttcaaaacaaggac accaaaggcaacagcggccgctggctggtggacaccgggggacatcgtgggtatgtgccg gctgggaaactacagctgtaccatgtggtccccagtgcagaggagctcagaaggcaggcg gggctgaacaaagacccccgatgtctaacaccggagcccagcccagctctagtgccctct attcccaccatgaaccaggtcatagccgcgtacccttttgtggccagaagcagccatgaa gtgagcctgcaggcaggccagcctgtgaccatcctggaggcccaggacaagaaggggaac cctgagtggagcctggtggaagtgaatggacagagggggagctggcctgagctgctgggg gatacagaaagaaatcagacagatgaggcccggtcttccagatcccagtctctaaggcag gcagatgaggcacccatcaattacgtgccgtgcaactgcatgctccttgaggctgagaac ttctga >gi568815593r:149405482_149650964|GENSCAN_predicted_peptide_7|193_aa XLIINHQGSLSTGFMPDPVLGVRDEQKIMKMSSAKVSRDVSCVHIPTTPSAGLFPGLPTR RKVRKVRRQRDEATCLKAATQLLLLKPPADSWAPKLLRPLETPKRMLLAAKRPEPITTGT APSVSSVPAYCLVTCPACPCADAVDARTPYSWREAGKPSGACAYEHCPSLAAPGMGEEGK PQECLWGVTAAPE >gi568815593r:149405482_149650964|GENSCAN_predicted_CDS_7|582_bp nntctcatcatcaaccatcaaggtagcctaagcactggctttatgccagatcctgtgctg ggtgtcagagatgaacagaagatcatgaaaatgtctagtgccaaagtctcccgtgatgta tcctgtgtccacatcccaacaaccccatcggcaggactctttcctggtttgcccacaagg aggaaagtgaggaaagtgaggcgtcagagagatgaagcaacttgtttgaaggcggccaca cagctgcttctactgaagcctccagcagactcttgggccccaaagctgctgcgtccccta gaaacacctaaaaggatgctcctggctgcaaagcgccctgagcctataaccacaggaact gctccatcagtatccagtgtgcctgcttattgcctggttacctgcccggcttgtccttgt gcagatgcagtggatgccagaacaccctacagctggagagaggcggggaaaccctcaggt gcttgtgcctacgagcactgccccagcctggcagcaccagggatgggagaggaaggaaag ccccaggaatgcctgtggggtgtgacagcagcccctgagtga