GENSCAN 1.0 Date run: 4-Nov-116 Time: 04:45:27 Sequence gi568815585r:20088904_20289581 : 200678 bp : 44.96% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 Intr - 4289 4131 159 2 0 69 89 71 0.162 5.48 1.02 Intr - 10599 10406 194 1 2 53 7 103 0.086 -2.09 1.01 Init - 13034 12890 145 2 1 65 32 147 0.526 5.31 1.00 Prom - 18723 18684 40 -3.76 2.06 PlyA - 19273 19268 6 1.05 2.05 Term - 24368 24288 81 2 0 125 48 67 0.289 4.09 2.04 Intr - 29720 29525 196 1 1 78 56 101 0.355 5.32 2.03 Intr - 30327 30245 83 0 2 105 60 20 0.104 -0.66 2.02 Intr - 38772 38673 100 2 1 97 34 63 0.137 1.91 2.01 Init - 40046 39862 185 2 2 82 80 89 0.273 6.35 2.00 Prom - 46210 46171 40 -4.96 3.00 Prom + 47611 47650 40 -3.36 3.01 Init + 47832 47942 111 2 0 58 81 106 0.708 5.77 3.02 Term + 48655 48750 96 0 0 105 44 35 0.782 -1.23 3.03 PlyA + 49930 49935 6 1.05 4.09 PlyA - 50166 50161 6 1.05 4.08 Term - 54397 53078 1320 1 0 20 39 2355 0.877 215.37 4.07 Intr - 54564 54403 162 0 0 45 66 147 0.787 8.37 4.06 Intr - 55729 55601 129 1 0 100 91 12 0.682 3.59 4.05 Intr - 85224 85109 116 1 2 100 68 2 0.050 -0.43 4.04 Intr - 87518 87492 27 0 0 99 96 15 0.729 1.49 4.03 Intr - 88428 88288 141 1 0 96 71 163 0.951 15.72 4.02 Intr - 88910 88781 130 2 1 50 -11 68 0.101 -6.63 4.01 Init - 90559 90191 369 1 0 92 35 267 0.144 18.79 4.00 Prom - 90958 90919 40 -5.56 5.03 PlyA - 91696 91691 6 1.05 5.02 Term - 92835 92632 204 0 0 61 55 145 0.965 5.87 5.01 Init - 95741 95688 54 2 0 90 59 82 0.926 4.85 5.00 Prom - 98553 98514 40 -6.66 6.02 PlyA - 98660 98655 6 1.05 6.01 Sngl - 100678 99998 681 1 0 62 42 1047 0.992 93.69 6.00 Prom - 132412 132373 40 0.94 7.02 PlyA - 133090 133085 6 1.05 7.01 Sngl - 134577 133792 786 0 0 80 41 655 0.958 55.95 7.00 Prom - 137635 137596 40 -6.86 8.00 Prom + 137826 137865 40 -6.56 8.01 Init + 138952 139039 88 0 1 76 59 20 0.491 -1.30 8.02 Term + 142501 142769 269 2 2 142 41 147 0.690 11.06 8.03 PlyA + 146377 146382 6 1.05 9.04 PlyA - 147753 147748 6 1.05 9.03 Term - 148937 148821 117 2 0 67 48 73 0.453 -0.26 9.02 Intr - 152111 151887 225 2 0 66 58 132 0.608 6.18 9.01 Init - 158770 158582 189 1 0 68 70 134 0.880 8.61 9.00 Prom - 162170 162131 40 -5.56 10.09 PlyA - 163288 163283 6 1.05 10.08 Term - 164964 164729 236 1 2 130 53 29 0.526 0.08 10.07 Intr - 166546 166441 106 1 1 24 80 74 0.520 -0.01 10.06 Intr - 168161 167945 217 1 1 82 113 -25 0.017 -2.09 10.05 Intr - 175874 175720 155 2 2 91 48 87 0.317 3.87 10.04 Intr - 176453 176336 118 1 1 20 105 38 0.108 -0.83 10.03 Intr - 181502 181392 111 1 0 60 89 84 0.240 5.19 10.02 Intr - 196260 196088 173 0 2 65 96 38 0.011 1.04 10.01 Intr - 200235 200009 227 1 2 10 71 130 0.015 1.20 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 79802 79633 170 0 2 54 37 154 0.902 4.94 S.002 Init - 130263 130191 73 0 1 81 98 80 0.974 7.54 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585r:20088904_20289581|GENSCAN_predicted_peptide_1|166_aa MGQVFKGWWAASASPWFLWLYDKPSLAFEDLIPALCSAGPKLLSTKLRGKLPPGVLLAAS PRKQEKQGLSPKKTLPSCNVFPCPLLTRLAIATAGKGEILTGSSSSSTKNGRRYLQNILN AASWCPRPKILTLWAFTEKADQPLSVKNGEVFMCGRKGMGQQSSCQ >gi568815585r:20088904_20289581|GENSCAN_predicted_CDS_1|498_bp atggggcaggtgttcaaaggatggtgggcagcctcagcttcaccctggttcctgtggctg tacgacaaaccgtcactcgcctttgaagacctcatccccgcgctctgtagtgctggcccc aagctcctctctaccaagctgagaggaaaactgccccctggggttctgctggcagcctct ccacgaaaacaagaaaaacaaggcctgagccccaagaaaacccttccctcttgcaatgtc ttcccgtgccctttactgacaaggcttgccattgcgacggccggcaaaggggaaatactt acagggtccagctccagtagcacaaagaacggcagaaggtacttacagaatatcctcaat gctgcctcttggtgtccaagacctaaaatactgactctctgggcctttacagaaaaagct gaccaacccctgagtgtaaagaatggagaagtgttcatgtgtggccggaaaggtatggga cagcaatccagctgccag >gi568815585r:20088904_20289581|GENSCAN_predicted_peptide_2|214_aa MGNRAAARWPPPGKGPQLEGVEWSRGCFNKIPQMRWFTRRTVQLHSHRVLGVRNRRHLAA VRARLLSNFPSPMDTRGGILMFQLGGTRENQDLVQKKMRRPQSCRRLLRKALCTQEWKRC GSGPPGTNAKSLEAQLEKTGGPPGSLPGAPGSTRPSAHSEQRDGDPAEDPGHLGRPGAQD AARAVPSKLSVNSSAPIIRVEMQCGRVQEHGLRT >gi568815585r:20088904_20289581|GENSCAN_predicted_CDS_2|645_bp atggggaaccgggctgctgccaggtggccgccgccgggtaaaggaccacagctggagggg gtggagtggtcccgtggctgctttaacaaaatcccacagatgcggtggtttacacgccgc acagtacagctgcattcccacagagttctgggcgtccggaatcggcgccacctggctgca gtcagggcaaggttgctcagcaacttccccagccccatggacacccggggcggcatcctg atgtttcagttgggaggaaccagagagaatcaagacttggtccagaaaaaaatgcgccgg ccccagagctgccgccgactgttaaggaaggccctttgcacccaagaatggaagcggtgt gggtcgggcccgccaggcacaaacgcgaagtccctggaagcgcagctcgaaaagacaggc ggccctccaggctccctccctggcgctccgggctccacgcggccctcggcgcactcggag cagagggatggggaccccgccgaggatccaggccacctgggcaggccgggtgctcaggac gcagcccgggctgtcccatcgaagctttctgtcaactcttctgctcccatcatcagagtt gagatgcagtgcggtagagtccaggaacatgggcttcggacctga >gi568815585r:20088904_20289581|GENSCAN_predicted_peptide_3|68_aa MLWEKRGAGFSHLSMTTTAHLDSGVELSTVQDRYGFLAATVFPKLSIRTGVCTCRQYGKL PPICVWKS >gi568815585r:20088904_20289581|GENSCAN_predicted_CDS_3|207_bp atgctgtgggagaagagaggtgcaggattcagccacttgtccatgaccacaacggctcac ctggactcgggcgtggaactgtccaccgtacaggatcgttacgggttcctggccgcgact gtcttccccaaactgtccatcaggacaggtgtttgcacttgtcggcaatatgggaaactc ccaccaatttgtgtttggaagagctga >gi568815585r:20088904_20289581|GENSCAN_predicted_peptide_4|797_aa MVNSAMFYDIAEPLSHISSELAADKVPNIAGNIHAVRSGENGFGHKGSCFHRIIPGLMCQ GGDFTRHHDTGSKSIYGQKFDGENFILKHSGPGILCMANAGPDTNGSQVFICTAKTEWWA SSQLERRARPCRQARAGRADQAQRRSPTEGQALLGSEAWAPVTGPWAWDFAAGTVLHGSK MCLCFRHVPARGGHIGACLQTGGLRGEGGNQVARSLPVYQVSVWHCLKAWGWWGVLVHPW LQPKLPGAGCATKKEASSCEQAERGSCLPGDVYLTLAELGLGVGLQLVPPPPWAAFFGVT LLGQCLLVQVLGSCDACPVEKLPISPIPVPSRPLPPPMGADPALSRCSPVFMSIFLLQES EAMGDWSFLGRLLENAQEHSTVIGKVWLTVLFIFRILVLGAAAEDVWGDEQSDFTCNTQQ PGCENVCYDRAFPISHIRFWALQIIFVSTPTLIYLGHVLHIVRMEEKKKEREEEEQLKRE SPSPKEPPQDNPSSRDDRGRVRMAGALLRTYVFNIIFKTLFEVGFIAGQYFLYGFELKPL YRCDRWPCPNTVDCFISRPTEKTIFIIFMLAVACASLLLNMLEIYHLGWKKLKQGVTSRL GPDASEAPLGTADPPPLPPSSRPPAVAIGFPPYYAHTAAPLGQARAVGYPGAPPPAADFK LLALTEARGKGQSAKLYNGHHHLLMTEQNWANQAAERQPPALKAYPAASTPAAPSPVGSS SPPLAHEAEAGAAPLLLDGSGSSLEGSALAGTPEEEEQAVTTAAQMHQPPLPLGDPGRAS KASRASSGRARPEDLAI >gi568815585r:20088904_20289581|GENSCAN_predicted_CDS_4|2394_bp atggtcaattctgccatgttttatgacattgctgagcccttaagccacatctcttctgag ctagctgcagacaaagttccaaacatagcaggaaacattcatgctgtgaggtctggagag aatggatttggccataagggctcctgctttcacagaattattccagggcttatgtgccag ggtggtgacttcacacgccatcatgacactggcagcaagtccatctatgggcagaaattt gatggtgagaacttcatcctgaagcattcaggtcctggcatcttgtgcatggcaaatgct ggacccgacacaaatggttcccaggttttcatctgtactgccaaaactgagtggtgggcc agcagccagctggagcgcagggcgcggccgtgtcgtcaggcccgggctggcagggccgac caggctcaaaggcgcagccccacggaagggcaggcgctgctgggcagcgaggcctgggca ccggtcaccgggccttgggcctgggactttgccgccggcaccgtcctccacggctccaag atgtgtctctgcttccggcacgtgcccgcgagagggggccacattggggcgtgtctccag acagggggtctccgaggggagggcggcaaccaggtggcaagaagcctccctgtgtaccag gtctcagtgtggcactgcctgaaggcctgggggtggtggggtgtcctggttcacccttgg ctgcagccgaagctgcctggggcaggttgtgccaccaagaaagaggccagcagctgtgag caggctgagagaggaagctgcctgcctggggatgtttacctaacacttgctgaattgggc ctgggggtgggcctgcagctggtaccaccccctccttgggctgccttcttcggagtcaca cttctggggcagtgcctgctcgtccaggtcctggggagctgcgatgcctgtcctgtggag aagctgcccatcagccccatcccagtaccatccaggccgctgccgccgcccatgggtgcg gacccggcactcagccgttgcagcccggtgttcatgagcattttcctcttacaggaatct gaagcaatgggcgactggagctttctgggaagactcttagaaaatgcacaggagcactcc acggtcatcggcaaggtttggctgaccgtgctgttcatcttccgcatcttggtgctgggg gccgcggcggaggacgtgtggggcgatgagcagtcagacttcacctgcaacacccagcag ccgggctgcgagaacgtctgctacgacagggccttccccatctcccacatccgcttctgg gcgctgcagatcatcttcgtgtccacgcccaccctcatctacctgggccacgtgctgcac atcgtgcgcatggaagagaagaagaaagagagggaggaggaggagcagctgaagagagag agccccagccccaaggagccaccgcaggacaatccctcgtcgcgggacgaccgcggcagg gtgcgcatggccggggcgctgctgcggacctacgtcttcaacatcatcttcaagacgctg ttcgaggtgggcttcatcgccggccagtactttctgtacggcttcgagctgaagccgctc taccgctgcgaccgctggccctgccccaacacggtggactgcttcatctccaggcccacg gagaagaccatcttcatcatcttcatgctggcggtggcctgcgcgtccctgctgctcaac atgctggagatctaccacctgggctggaagaagctcaagcagggcgtgaccagccgcctc ggcccggacgcctccgaggccccgctggggacagccgatcccccgcccctgccccccagc tcccggccgcccgccgttgccatcgggttcccaccctactatgcgcacaccgctgcgccc ctgggacaggcccgcgccgtgggctaccccggggccccgccaccagccgcggacttcaaa ctgctagccctgaccgaggcgcgcggaaagggccagtccgccaagctctacaacggccac caccacctgctgatgactgagcagaactgggccaaccaggcggccgagcggcagcccccg gcgctcaaggcttacccggcagcgtccacgcctgcagcccccagccccgtcggcagcagc tccccgccactcgcgcacgaggctgaggcgggcgcggcgcccctgctgctggatgggagc ggcagcagtctggaggggagcgccctggcagggacccccgaggaggaggagcaggccgtg accaccgcggcccagatgcaccagccgcccttgcccctcggagacccaggtcgggccagc aaggccagcagggccagcagcgggcgggccagaccggaggacttggccatctag >gi568815585r:20088904_20289581|GENSCAN_predicted_peptide_5|85_aa MGPRRAWLGQMAQATQAAERDLNVNRYLYIHVHCSSVHNSQEVEGAQVPISEWTEKQNVV YACNGMLFSLKKEGISDTCYNMDEP >gi568815585r:20088904_20289581|GENSCAN_predicted_CDS_5|258_bp atggggccaaggagggcgtggctgggacagatggcgcaggccacgcaggccgctgagaga gatctcaatgtgaacagatatttgtacatccatgttcattgcagctctgttcacaacagc caagaggtggaaggagcccaagtgcccatcagcgaatggacagagaagcaaaatgtagtc tatgcatgcaatggaatgttattcagccttaaaaaggaaggaatttctgacacatgctac aacatggatgaaccttga >gi568815585r:20088904_20289581|GENSCAN_predicted_peptide_6|226_aa MDWGTLQTILGGVNKHSTSIGKIWLTVLFIFRIMILVVAAKEVWGDEQADFVCNTLQPGC KNVCYDHYFPISHIRLWALQLIFVSTPALLVAMHVAYRRHEKKRKFIKGEIKSEFKDIEE IKTQKVRIEGSLWWTYTSSIFFRVIFEAAFMYVFYVMYDGFSMQRLVKCNAWPCPNTVDC FVSRPTEKTVFTVFMIAVSGICILLNVTELCYLLIRYCSGKSKKPV >gi568815585r:20088904_20289581|GENSCAN_predicted_CDS_6|681_bp atggattggggcacgctgcagacgatcctggggggtgtgaacaaacactccaccagcatt ggaaagatctggctcaccgtcctcttcatttttcgcattatgatcctcgttgtggctgca aaggaggtgtggggagatgagcaggccgactttgtctgcaacaccctgcagccaggctgc aagaacgtgtgctacgatcactacttccccatctcccacatccggctatgggccctgcag ctgatcttcgtgtccacgccagcgctcctagtggccatgcacgtggcctaccggagacat gagaagaagaggaagttcatcaagggggagataaagagtgaatttaaggacatcgaggag atcaaaacccagaaggtccgcatcgaaggctccctgtggtggacctacacaagcagcatc ttcttccgggtcatcttcgaagccgccttcatgtacgtcttctatgtcatgtacgacggc ttctccatgcagcggctggtgaagtgcaacgcctggccttgtcccaacactgtggactgc tttgtgtcccggcccacggagaagactgtcttcacagtgttcatgattgcagtgtctgga atttgcatcctgctgaatgtcactgaattgtgttatttgctaattagatattgttctggg aagtcaaaaaagccagtttaa >gi568815585r:20088904_20289581|GENSCAN_predicted_peptide_7|261_aa MDWGTLHTFIGGVNKHSTSIGKVWITVIFIFRVMILVVAAQEVWGDEQEDFVCNTLQPGC KNVCYDHFFPVSHIRLWALQLIFVSTPALLVAMHVAYYRHETTRKFRRGEKRNDFKDIED IKKQKVRIEGSLWWTYTSSIFFRIIFEAAFMYVFYFLYNGYHLPWVLKCGIDPCPNLVDC FISRPTEKTVFTIFMISASVICMLLNVAELCYLLLKVCFRRSKRAQTQKNHPNHALKESK QNEMNELISDSGQNAITGFPS >gi568815585r:20088904_20289581|GENSCAN_predicted_CDS_7|786_bp atggattgggggacgctgcacactttcatcgggggtgtcaacaaacactccaccagcatc gggaaggtgtggatcacagtcatctttattttccgagtcatgatcctcgtggtggctgcc caggaagtgtggggtgacgagcaagaggacttcgtctgcaacacactgcaaccgggatgc aaaaatgtgtgctatgaccactttttcccggtgtcccacatccggctgtgggccctccag ctgatcttcgtctccaccccagcgctgctggtggccatgcatgtggcctactacaggcac gaaaccactcgcaagttcaggcgaggagagaagaggaatgatttcaaagacatagaggac attaaaaagcagaaggttcggatagaggggtcgctgtggtggacgtacaccagcagcatc tttttccgaatcatctttgaagcagcctttatgtatgtgttttacttcctttacaatggg taccacctgccctgggtgttgaaatgtgggattgacccctgccccaaccttgttgactgc tttatttctaggccaacagagaagaccgtgtttaccatttttatgatttctgcgtctgtg atttgcatgctgcttaacgtggcagagttgtgctacctgctgctgaaagtgtgttttagg agatcaaagagagcacagacgcaaaaaaatcaccccaatcatgccctaaaggagagtaag cagaatgaaatgaatgagctgatttcagatagtggtcaaaatgcaatcacaggtttccca agctaa >gi568815585r:20088904_20289581|GENSCAN_predicted_peptide_8|118_aa MAVNAGDSASFKEDSTPLTAPVLTLVGGQGREVSPVCKTQDARDTGDRSSSPTGSAFWCL HVLTKMCRQLLLITVVHAIVISISQVSRKPITQPIPCGSPGTYASAQVEGLSELKPAS >gi568815585r:20088904_20289581|GENSCAN_predicted_CDS_8|357_bp atggcagtaaacgcaggagactcagcaagttttaaggaggactcaactccactgactgct cctgtcctgactttggtgggtggacagggaagggaggtcagccccgtttgcaaaacacag gatgcccgtgacaccggagacaggtcttcttcaccgacaggaagtgccttctggtgcctg cacgttttaactaagatgtgtcgccaattacttttaattactgtcgtccacgctattgtc atcagcatttcacaagtttctcggaagcccatcacgcagcccataccctgcggttctccg gggacttatgcatcggcccaagttgagggtttgtctgaactgaaacccgcatcctag >gi568815585r:20088904_20289581|GENSCAN_predicted_peptide_9|176_aa MTVTPHVGTGDPGSCVNGSYGEQGFRAAPTPANDVVSLYPSHSPHAESPAPDGVGRPADT CRKAPNKIAMALFSSSPSQITVFGNMPGLRLPYHKTCILETGPGKKVNNNNNNNKKPSQK LMAVKAAFVVNEWDAYQQSRGDFSQGTQRGKGKEAPDSGETQATPPQQGVQAEHQQ >gi568815585r:20088904_20289581|GENSCAN_predicted_CDS_9|531_bp atgacagtcacgccccacgtggggactggggaccctggaagctgcgtcaacggctcctat ggggagcagggcttcagggcagcccccacacctgccaatgacgtcgtctctttgtacccg agccactcacctcacgcagagagtccagcccctgatggggttggcagacctgcagacacc tgcaggaaggcccccaacaaaatcgcaatggctctgttcagctcaagccctagccagata actgtctttggtaatatgcctgggctaagactcccgtaccataagacttgtattctggaa actggtccaggaaaaaaagtaaacaacaacaacaacaacaacaaaaagccatctcagaag ctcatggccgtaaaggctgcatttgtagttaatgaatgggatgcttatcagcagagcagg ggtgacttctcccaaggaacacagcgtggaaaggggaaagaggcccctgacagtggagag acccaagccacaccacctcagcaaggggtccaagctgagcatcagcagtga >gi568815585r:20088904_20289581|GENSCAN_predicted_peptide_10|447_aa XGGPGLQWQVRQLQLHLGEQILTVPSSPKSTGKLGSTAAVWPAVAPSRSSMKYAAPVEPP CCSWHNVSAIIPPSEENHPALLDNNSSLDHQSQGKEPWDPPALLHHLKDQPWNQRPVSWE QSTLQWSTFAAVNSTQPDGAMLCGATGASASASAAVPHPSRTPESSWQKQRFKFSTGLPS ENKRYSVASPGHHCAFKGSCQRTLRTSQPPSCQHLPEHLRLMLQGTSFPLIFDILPVLFL FLKAYPHPAASRVSSADPHTLRLPKYLLFFCTWPPTLCSPPILSGQLLTPSAMHSPADEM LRKLYVCDRMSGCWSSSSCPDRSKMELVHSEQNKTFINNFMELMSKKSILEPSLPDHGFQ IKNCLHDDSRFFPPEGQCTACFPRGLKTQRKIVHRHLHLLKGLSPGGSLPLHLSVNHATC GCHIYICPLHTSIRPVLTLTDSHWHQQ >gi568815585r:20088904_20289581|GENSCAN_predicted_CDS_10|1344_bp nggggaggcccaggtctgcagtggcaggttcggcagctgcagctgcacctgggagagcag atcctgactgttcccagctcccccaagagcacagggaagcttggatccacagctgcagtt tggccagctgtagccccatccaggagctccatgaagtatgcagccccagtcgagcctccg tgctgcagctggcataatgtcagtgccatcattcccccctctgaagagaatcacccagcg ctcctggataacaatagcagtctggatcaccaaagccagggaaaggaaccctgggaccca ccagccctgttgcaccatttgaaagaccagccctggaaccagcgacctgtgtcttgggag cagtccaccctccagtggtccacatttgcagcagtcaacagcacccagcctgatggagcc atgctctgtggggccaccggggccagtgccagtgccagtgctgctgttccccacccaagt agaacacccgagagcagctggcagaagcagcgcttcaaattttccaccgggctcccttct gagaacaaacgctattcagtggcgagccccggacaccactgcgctttcaaaggcagctgc cagaggacactcaggacttcacagccgccgagctgccagcaccttcctgagcacctgcga ctcatgctccagggaacctcattcccacttatctttgacattttgcctgtgcttttcctc ttcctcaaagcctatccacatcccgcagccagcagggtctcctctgctgatcctcacacg ctgagacttcccaagtacctgctgttcttctgcacttggccacccaccttgtgttctcca cccatcttgagcggtcagctcctgacaccatctgccatgcacagtcctgctgatgagatg ctaaggaaattatatgtgtgtgatagaatgtcaggatgctggtcaagctctagttgccca gacaggagcaagatggagctggttcacagtgagcagaacaagacttttataaacaatttc atggagctcatgagtaagaagtccatccttgagccttcactgccagaccatggattccag atcaagaattgcttgcatgatgactcaagattcttccctccagagggccaatgcacagca tgtttccccagaggtctgaaaacacagagaaagatcgtccacagacacctccacctgctt aaaggtctgagtcctgggggctcactgcctctccatctaagtgtcaaccacgcaacttgt ggctgtcacatctacatctgtcccctgcacacgtccatcaggccagtgctgactcttact gactctcactggcaccagcagtga