GENSCAN 1.0 Date run: 4-Nov-116 Time: 04:51:14 Sequence gi568815585r:20122698_20323480 : 200783 bp : 44.73% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 4946 5094 149 1 2 67 78 143 0.361 8.76 1.02 Intr + 9611 9754 144 2 0 65 83 38 0.091 0.40 1.03 Intr + 13983 14148 166 0 1 70 81 77 0.153 5.16 1.04 Term + 14861 14956 96 1 0 105 44 35 0.778 -1.23 1.05 PlyA + 16136 16141 6 1.05 2.09 PlyA - 16372 16367 6 1.05 2.08 Term - 20603 19284 1320 2 0 20 39 2355 0.877 215.37 2.07 Intr - 20770 20609 162 1 0 45 66 147 0.787 8.37 2.06 Intr - 21935 21807 129 2 0 100 91 12 0.682 3.59 2.05 Intr - 51430 51315 116 2 2 100 68 2 0.050 -0.43 2.04 Intr - 53724 53698 27 1 0 99 96 15 0.729 1.49 2.03 Intr - 54634 54494 141 2 0 96 71 163 0.951 15.72 2.02 Intr - 55116 54987 130 0 1 50 -11 68 0.101 -6.63 2.01 Init - 56765 56397 369 2 0 92 35 267 0.144 18.79 2.00 Prom - 57164 57125 40 -5.56 3.03 PlyA - 57902 57897 6 1.05 3.02 Term - 59041 58838 204 1 0 61 55 145 0.965 5.87 3.01 Init - 61947 61894 54 0 0 90 59 82 0.926 4.85 3.00 Prom - 64759 64720 40 -6.66 4.02 PlyA - 64866 64861 6 1.05 4.01 Sngl - 66884 66204 681 2 0 62 42 1047 0.992 93.69 4.00 Prom - 98618 98579 40 0.94 5.02 PlyA - 99296 99291 6 1.05 5.01 Sngl - 100783 99998 786 1 0 80 41 655 0.958 55.95 5.00 Prom - 103841 103802 40 -6.86 6.00 Prom + 104032 104071 40 -6.56 6.01 Init + 105158 105245 88 1 1 76 59 20 0.491 -1.30 6.02 Term + 108707 108975 269 0 2 142 41 147 0.690 11.06 6.03 PlyA + 112583 112588 6 1.05 7.04 PlyA - 113959 113954 6 1.05 7.03 Term - 115143 115027 117 0 0 67 48 73 0.453 -0.26 7.02 Intr - 118317 118093 225 0 0 66 58 132 0.608 6.18 7.01 Init - 124976 124788 189 2 0 68 70 134 0.880 8.61 7.00 Prom - 128376 128337 40 -5.56 8.10 PlyA - 129494 129489 6 1.05 8.09 Term - 131170 130935 236 2 2 130 53 29 0.526 0.08 8.08 Intr - 132752 132647 106 2 1 24 80 74 0.520 -0.01 8.07 Intr - 134367 134151 217 2 1 82 113 -25 0.017 -2.09 8.06 Intr - 142080 141926 155 0 2 91 48 87 0.317 3.87 8.05 Intr - 142659 142542 118 2 1 20 105 38 0.108 -0.83 8.04 Intr - 147708 147598 111 2 0 60 89 84 0.239 5.19 8.03 Intr - 162466 162294 173 1 2 65 96 38 0.007 1.04 8.02 Intr - 166441 166215 227 2 2 10 71 130 0.019 1.20 8.01 Init - 169987 169912 76 1 1 88 68 127 0.906 9.98 8.00 Prom - 173271 173232 40 -2.56 9.02 PlyA - 173285 173280 6 1.05 9.01 Sngl - 179347 178679 669 1 0 55 35 201 0.862 7.69 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 46008 45839 170 1 2 54 37 154 0.902 4.94 S.002 Init - 96469 96397 73 1 1 81 98 80 0.974 7.54 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585r:20122698_20323480|GENSCAN_predicted_peptide_1|184_aa MGLGKLLSNLALRERRRACPSGAYPEWAAGPARASLRVNAGALRVRSAVSVPHVATKAAL SKPLIQSSFQALGTISYCPYEVFYRIHREGIPAGLPRRQENSSSIKSKALDNFPKVMLWE KRGAGFSHLSMTTTAHLDSGVELSTVQDRYGFLAATVFPKLSIRTGVCTCRQYGKLPPIC VWKS >gi568815585r:20122698_20323480|GENSCAN_predicted_CDS_1|555_bp atggggctggggaagttgctgagcaaccttgccctgcgagagcgaaggcgggcgtgtccc tcgggggcgtaccccgagtgggcagcagggccagcccgcgcgtccctcagggtcaatgct ggggccctgagggtgaggtcagctgtgagcgtacctcatgtggctacaaaagctgccctc agcaagcctctaatccagagctcattccaagcattgggtacaatttcttactgcccctat gaggtgttctacagaatccacagagagggaatcccggcagggctgccaaggagacaagaa aactcatcatctataaagtcaaaggcgttagataactttcccaaagtcatgctgtgggag aagagaggtgcaggattcagccacttgtccatgaccacaacggctcacctggactcgggc gtggaactgtccaccgtacaggatcgttacgggttcctggccgcgactgtcttccccaaa ctgtccatcaggacaggtgtttgcacttgtcggcaatatgggaaactcccaccaatttgt gtttggaagagctga >gi568815585r:20122698_20323480|GENSCAN_predicted_peptide_2|797_aa MVNSAMFYDIAEPLSHISSELAADKVPNIAGNIHAVRSGENGFGHKGSCFHRIIPGLMCQ GGDFTRHHDTGSKSIYGQKFDGENFILKHSGPGILCMANAGPDTNGSQVFICTAKTEWWA SSQLERRARPCRQARAGRADQAQRRSPTEGQALLGSEAWAPVTGPWAWDFAAGTVLHGSK MCLCFRHVPARGGHIGACLQTGGLRGEGGNQVARSLPVYQVSVWHCLKAWGWWGVLVHPW LQPKLPGAGCATKKEASSCEQAERGSCLPGDVYLTLAELGLGVGLQLVPPPPWAAFFGVT LLGQCLLVQVLGSCDACPVEKLPISPIPVPSRPLPPPMGADPALSRCSPVFMSIFLLQES EAMGDWSFLGRLLENAQEHSTVIGKVWLTVLFIFRILVLGAAAEDVWGDEQSDFTCNTQQ PGCENVCYDRAFPISHIRFWALQIIFVSTPTLIYLGHVLHIVRMEEKKKEREEEEQLKRE SPSPKEPPQDNPSSRDDRGRVRMAGALLRTYVFNIIFKTLFEVGFIAGQYFLYGFELKPL YRCDRWPCPNTVDCFISRPTEKTIFIIFMLAVACASLLLNMLEIYHLGWKKLKQGVTSRL GPDASEAPLGTADPPPLPPSSRPPAVAIGFPPYYAHTAAPLGQARAVGYPGAPPPAADFK LLALTEARGKGQSAKLYNGHHHLLMTEQNWANQAAERQPPALKAYPAASTPAAPSPVGSS SPPLAHEAEAGAAPLLLDGSGSSLEGSALAGTPEEEEQAVTTAAQMHQPPLPLGDPGRAS KASRASSGRARPEDLAI >gi568815585r:20122698_20323480|GENSCAN_predicted_CDS_2|2394_bp atggtcaattctgccatgttttatgacattgctgagcccttaagccacatctcttctgag ctagctgcagacaaagttccaaacatagcaggaaacattcatgctgtgaggtctggagag aatggatttggccataagggctcctgctttcacagaattattccagggcttatgtgccag ggtggtgacttcacacgccatcatgacactggcagcaagtccatctatgggcagaaattt gatggtgagaacttcatcctgaagcattcaggtcctggcatcttgtgcatggcaaatgct ggacccgacacaaatggttcccaggttttcatctgtactgccaaaactgagtggtgggcc agcagccagctggagcgcagggcgcggccgtgtcgtcaggcccgggctggcagggccgac caggctcaaaggcgcagccccacggaagggcaggcgctgctgggcagcgaggcctgggca ccggtcaccgggccttgggcctgggactttgccgccggcaccgtcctccacggctccaag atgtgtctctgcttccggcacgtgcccgcgagagggggccacattggggcgtgtctccag acagggggtctccgaggggagggcggcaaccaggtggcaagaagcctccctgtgtaccag gtctcagtgtggcactgcctgaaggcctgggggtggtggggtgtcctggttcacccttgg ctgcagccgaagctgcctggggcaggttgtgccaccaagaaagaggccagcagctgtgag caggctgagagaggaagctgcctgcctggggatgtttacctaacacttgctgaattgggc ctgggggtgggcctgcagctggtaccaccccctccttgggctgccttcttcggagtcaca cttctggggcagtgcctgctcgtccaggtcctggggagctgcgatgcctgtcctgtggag aagctgcccatcagccccatcccagtaccatccaggccgctgccgccgcccatgggtgcg gacccggcactcagccgttgcagcccggtgttcatgagcattttcctcttacaggaatct gaagcaatgggcgactggagctttctgggaagactcttagaaaatgcacaggagcactcc acggtcatcggcaaggtttggctgaccgtgctgttcatcttccgcatcttggtgctgggg gccgcggcggaggacgtgtggggcgatgagcagtcagacttcacctgcaacacccagcag ccgggctgcgagaacgtctgctacgacagggccttccccatctcccacatccgcttctgg gcgctgcagatcatcttcgtgtccacgcccaccctcatctacctgggccacgtgctgcac atcgtgcgcatggaagagaagaagaaagagagggaggaggaggagcagctgaagagagag agccccagccccaaggagccaccgcaggacaatccctcgtcgcgggacgaccgcggcagg gtgcgcatggccggggcgctgctgcggacctacgtcttcaacatcatcttcaagacgctg ttcgaggtgggcttcatcgccggccagtactttctgtacggcttcgagctgaagccgctc taccgctgcgaccgctggccctgccccaacacggtggactgcttcatctccaggcccacg gagaagaccatcttcatcatcttcatgctggcggtggcctgcgcgtccctgctgctcaac atgctggagatctaccacctgggctggaagaagctcaagcagggcgtgaccagccgcctc ggcccggacgcctccgaggccccgctggggacagccgatcccccgcccctgccccccagc tcccggccgcccgccgttgccatcgggttcccaccctactatgcgcacaccgctgcgccc ctgggacaggcccgcgccgtgggctaccccggggccccgccaccagccgcggacttcaaa ctgctagccctgaccgaggcgcgcggaaagggccagtccgccaagctctacaacggccac caccacctgctgatgactgagcagaactgggccaaccaggcggccgagcggcagcccccg gcgctcaaggcttacccggcagcgtccacgcctgcagcccccagccccgtcggcagcagc tccccgccactcgcgcacgaggctgaggcgggcgcggcgcccctgctgctggatgggagc ggcagcagtctggaggggagcgccctggcagggacccccgaggaggaggagcaggccgtg accaccgcggcccagatgcaccagccgcccttgcccctcggagacccaggtcgggccagc aaggccagcagggccagcagcgggcgggccagaccggaggacttggccatctag >gi568815585r:20122698_20323480|GENSCAN_predicted_peptide_3|85_aa MGPRRAWLGQMAQATQAAERDLNVNRYLYIHVHCSSVHNSQEVEGAQVPISEWTEKQNVV YACNGMLFSLKKEGISDTCYNMDEP >gi568815585r:20122698_20323480|GENSCAN_predicted_CDS_3|258_bp atggggccaaggagggcgtggctgggacagatggcgcaggccacgcaggccgctgagaga gatctcaatgtgaacagatatttgtacatccatgttcattgcagctctgttcacaacagc caagaggtggaaggagcccaagtgcccatcagcgaatggacagagaagcaaaatgtagtc tatgcatgcaatggaatgttattcagccttaaaaaggaaggaatttctgacacatgctac aacatggatgaaccttga >gi568815585r:20122698_20323480|GENSCAN_predicted_peptide_4|226_aa MDWGTLQTILGGVNKHSTSIGKIWLTVLFIFRIMILVVAAKEVWGDEQADFVCNTLQPGC KNVCYDHYFPISHIRLWALQLIFVSTPALLVAMHVAYRRHEKKRKFIKGEIKSEFKDIEE IKTQKVRIEGSLWWTYTSSIFFRVIFEAAFMYVFYVMYDGFSMQRLVKCNAWPCPNTVDC FVSRPTEKTVFTVFMIAVSGICILLNVTELCYLLIRYCSGKSKKPV >gi568815585r:20122698_20323480|GENSCAN_predicted_CDS_4|681_bp atggattggggcacgctgcagacgatcctggggggtgtgaacaaacactccaccagcatt ggaaagatctggctcaccgtcctcttcatttttcgcattatgatcctcgttgtggctgca aaggaggtgtggggagatgagcaggccgactttgtctgcaacaccctgcagccaggctgc aagaacgtgtgctacgatcactacttccccatctcccacatccggctatgggccctgcag ctgatcttcgtgtccacgccagcgctcctagtggccatgcacgtggcctaccggagacat gagaagaagaggaagttcatcaagggggagataaagagtgaatttaaggacatcgaggag atcaaaacccagaaggtccgcatcgaaggctccctgtggtggacctacacaagcagcatc ttcttccgggtcatcttcgaagccgccttcatgtacgtcttctatgtcatgtacgacggc ttctccatgcagcggctggtgaagtgcaacgcctggccttgtcccaacactgtggactgc tttgtgtcccggcccacggagaagactgtcttcacagtgttcatgattgcagtgtctgga atttgcatcctgctgaatgtcactgaattgtgttatttgctaattagatattgttctggg aagtcaaaaaagccagtttaa >gi568815585r:20122698_20323480|GENSCAN_predicted_peptide_5|261_aa MDWGTLHTFIGGVNKHSTSIGKVWITVIFIFRVMILVVAAQEVWGDEQEDFVCNTLQPGC KNVCYDHFFPVSHIRLWALQLIFVSTPALLVAMHVAYYRHETTRKFRRGEKRNDFKDIED IKKQKVRIEGSLWWTYTSSIFFRIIFEAAFMYVFYFLYNGYHLPWVLKCGIDPCPNLVDC FISRPTEKTVFTIFMISASVICMLLNVAELCYLLLKVCFRRSKRAQTQKNHPNHALKESK QNEMNELISDSGQNAITGFPS >gi568815585r:20122698_20323480|GENSCAN_predicted_CDS_5|786_bp atggattgggggacgctgcacactttcatcgggggtgtcaacaaacactccaccagcatc gggaaggtgtggatcacagtcatctttattttccgagtcatgatcctcgtggtggctgcc caggaagtgtggggtgacgagcaagaggacttcgtctgcaacacactgcaaccgggatgc aaaaatgtgtgctatgaccactttttcccggtgtcccacatccggctgtgggccctccag ctgatcttcgtctccaccccagcgctgctggtggccatgcatgtggcctactacaggcac gaaaccactcgcaagttcaggcgaggagagaagaggaatgatttcaaagacatagaggac attaaaaagcagaaggttcggatagaggggtcgctgtggtggacgtacaccagcagcatc tttttccgaatcatctttgaagcagcctttatgtatgtgttttacttcctttacaatggg taccacctgccctgggtgttgaaatgtgggattgacccctgccccaaccttgttgactgc tttatttctaggccaacagagaagaccgtgtttaccatttttatgatttctgcgtctgtg atttgcatgctgcttaacgtggcagagttgtgctacctgctgctgaaagtgtgttttagg agatcaaagagagcacagacgcaaaaaaatcaccccaatcatgccctaaaggagagtaag cagaatgaaatgaatgagctgatttcagatagtggtcaaaatgcaatcacaggtttccca agctaa >gi568815585r:20122698_20323480|GENSCAN_predicted_peptide_6|118_aa MAVNAGDSASFKEDSTPLTAPVLTLVGGQGREVSPVCKTQDARDTGDRSSSPTGSAFWCL HVLTKMCRQLLLITVVHAIVISISQVSRKPITQPIPCGSPGTYASAQVEGLSELKPAS >gi568815585r:20122698_20323480|GENSCAN_predicted_CDS_6|357_bp atggcagtaaacgcaggagactcagcaagttttaaggaggactcaactccactgactgct cctgtcctgactttggtgggtggacagggaagggaggtcagccccgtttgcaaaacacag gatgcccgtgacaccggagacaggtcttcttcaccgacaggaagtgccttctggtgcctg cacgttttaactaagatgtgtcgccaattacttttaattactgtcgtccacgctattgtc atcagcatttcacaagtttctcggaagcccatcacgcagcccataccctgcggttctccg gggacttatgcatcggcccaagttgagggtttgtctgaactgaaacccgcatcctag >gi568815585r:20122698_20323480|GENSCAN_predicted_peptide_7|176_aa MTVTPHVGTGDPGSCVNGSYGEQGFRAAPTPANDVVSLYPSHSPHAESPAPDGVGRPADT CRKAPNKIAMALFSSSPSQITVFGNMPGLRLPYHKTCILETGPGKKVNNNNNNNKKPSQK LMAVKAAFVVNEWDAYQQSRGDFSQGTQRGKGKEAPDSGETQATPPQQGVQAEHQQ >gi568815585r:20122698_20323480|GENSCAN_predicted_CDS_7|531_bp atgacagtcacgccccacgtggggactggggaccctggaagctgcgtcaacggctcctat ggggagcagggcttcagggcagcccccacacctgccaatgacgtcgtctctttgtacccg agccactcacctcacgcagagagtccagcccctgatggggttggcagacctgcagacacc tgcaggaaggcccccaacaaaatcgcaatggctctgttcagctcaagccctagccagata actgtctttggtaatatgcctgggctaagactcccgtaccataagacttgtattctggaa actggtccaggaaaaaaagtaaacaacaacaacaacaacaacaaaaagccatctcagaag ctcatggccgtaaaggctgcatttgtagttaatgaatgggatgcttatcagcagagcagg ggtgacttctcccaaggaacacagcgtggaaaggggaaagaggcccctgacagtggagag acccaagccacaccacctcagcaaggggtccaagctgagcatcagcagtga >gi568815585r:20122698_20323480|GENSCAN_predicted_peptide_8|472_aa MAGCRSRALPGREAAKALREIYRSAGGGPGLQWQVRQLQLHLGEQILTVPSSPKSTGKLG STAAVWPAVAPSRSSMKYAAPVEPPCCSWHNVSAIIPPSEENHPALLDNNSSLDHQSQGK EPWDPPALLHHLKDQPWNQRPVSWEQSTLQWSTFAAVNSTQPDGAMLCGATGASASASAA VPHPSRTPESSWQKQRFKFSTGLPSENKRYSVASPGHHCAFKGSCQRTLRTSQPPSCQHL PEHLRLMLQGTSFPLIFDILPVLFLFLKAYPHPAASRVSSADPHTLRLPKYLLFFCTWPP TLCSPPILSGQLLTPSAMHSPADEMLRKLYVCDRMSGCWSSSSCPDRSKMELVHSEQNKT FINNFMELMSKKSILEPSLPDHGFQIKNCLHDDSRFFPPEGQCTACFPRGLKTQRKIVHR HLHLLKGLSPGGSLPLHLSVNHATCGCHIYICPLHTSIRPVLTLTDSHWHQQ >gi568815585r:20122698_20323480|GENSCAN_predicted_CDS_8|1419_bp atggcgggctgcaggtcccgagccctgcccggcagggaggcagctaaggccctgcgagaa atctaccgcagcgccgggggaggcccaggtctgcagtggcaggttcggcagctgcagctg cacctgggagagcagatcctgactgttcccagctcccccaagagcacagggaagcttgga tccacagctgcagtttggccagctgtagccccatccaggagctccatgaagtatgcagcc ccagtcgagcctccgtgctgcagctggcataatgtcagtgccatcattcccccctctgaa gagaatcacccagcgctcctggataacaatagcagtctggatcaccaaagccagggaaag gaaccctgggacccaccagccctgttgcaccatttgaaagaccagccctggaaccagcga cctgtgtcttgggagcagtccaccctccagtggtccacatttgcagcagtcaacagcacc cagcctgatggagccatgctctgtggggccaccggggccagtgccagtgccagtgctgct gttccccacccaagtagaacacccgagagcagctggcagaagcagcgcttcaaattttcc accgggctcccttctgagaacaaacgctattcagtggcgagccccggacaccactgcgct ttcaaaggcagctgccagaggacactcaggacttcacagccgccgagctgccagcacctt cctgagcacctgcgactcatgctccagggaacctcattcccacttatctttgacattttg cctgtgcttttcctcttcctcaaagcctatccacatcccgcagccagcagggtctcctct gctgatcctcacacgctgagacttcccaagtacctgctgttcttctgcacttggccaccc accttgtgttctccacccatcttgagcggtcagctcctgacaccatctgccatgcacagt cctgctgatgagatgctaaggaaattatatgtgtgtgatagaatgtcaggatgctggtca agctctagttgcccagacaggagcaagatggagctggttcacagtgagcagaacaagact tttataaacaatttcatggagctcatgagtaagaagtccatccttgagccttcactgcca gaccatggattccagatcaagaattgcttgcatgatgactcaagattcttccctccagag ggccaatgcacagcatgtttccccagaggtctgaaaacacagagaaagatcgtccacaga cacctccacctgcttaaaggtctgagtcctgggggctcactgcctctccatctaagtgtc aaccacgcaacttgtggctgtcacatctacatctgtcccctgcacacgtccatcaggcca gtgctgactcttactgactctcactggcaccagcagtga >gi568815585r:20122698_20323480|GENSCAN_predicted_peptide_9|222_aa MSEERLGKGLGSGEARKPRRSAAGGRVRPGQRRAGHQRVCGPTPGDRRPFAASERIKGRC VGEGASARPRGPREAVPGRGGWPPASGERRLVPGCSERGAHGTRGGAAGGAGEKEKSQDR LRKPAPALHPTAFTAIQADSPGARLLPFQRARPRAWGLVVNTLHQSPIQASNITKRGFWE ALRSRRNRTSPRGMSPGPAPVNASRGNTPGRPETWLDPRLLR >gi568815585r:20122698_20323480|GENSCAN_predicted_CDS_9|669_bp atgagtgaggaacggctggggaaaggtctcggcagcggggaagcgcggaagcccaggaga agtgcagcgggaggccgagtgaggccgggtcagcggcgggcaggccaccagagggtctgc ggtcctacccccggagacagaaggccctttgcagcgtctgagcggataaagggacgatgt gtgggtgagggcgccagtgcgaggccaagaggcccgcgggaggctgttccggggagaggc gggtggcccccagcgagcggcgagcggcgcctggttccgggctgttctgagcgcggagcc catgggacccgcggaggggctgctggtggagcaggagagaaagagaaaagccaagaccgg ctgagaaaaccagcccctgccctgcaccccaccgcattcacggcgattcaggcagattcg ccgggagcgcgcctgctgccgttccagagggcaagacccagggcgtggggcctggtggta aacaccctccaccagagcccgatccaggcctccaacatcaccaagcgcggtttctgggaa gcgctgcgcagccggcgcaatcggaccagcccccgcgggatgagccctgggcctgctccc gtgaatgcttcccggggaaacacaccaggccgccctgaaacctggctggacccccgcctc ctccgctaa