GENSCAN 1.0 Date run: 6-Nov-116 Time: 22:28:01 Sequence gi568815597r:221602415_221839744 : 237330 bp : 39.37% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 2282 2287 6 1 0 96 93 0 0.322 2.21 1.02 Term + 18804 18992 189 2 0 13 39 187 0.354 2.77 1.03 PlyA + 19565 19570 6 1.05 2.00 Prom + 20164 20203 40 -5.25 2.01 Sngl + 20453 20776 324 1 0 52 45 194 0.815 7.25 2.02 PlyA + 20962 20967 6 1.05 3.00 Prom + 25601 25640 40 -6.85 3.01 Sngl + 33781 34020 240 0 0 65 43 223 0.388 10.33 3.02 PlyA + 35596 35601 6 1.05 4.00 Prom + 36336 36375 40 -5.75 4.01 Init + 61405 61682 278 0 2 48 31 179 0.886 4.70 4.02 Intr + 65354 65502 149 2 2 73 103 44 0.099 3.36 4.03 Intr + 69069 69159 91 1 1 55 70 129 0.020 5.93 4.04 Intr + 94014 94401 388 0 1 73 106 137 0.029 7.87 4.05 Term + 98680 98790 111 0 0 39 49 96 0.211 -1.62 4.06 PlyA + 99032 99037 6 1.05 5.10 PlyA - 99751 99746 6 1.05 5.09 Term - 100263 99998 266 1 2 118 48 342 0.958 27.89 5.08 Intr - 104052 103681 372 1 0 129 21 412 0.919 32.71 5.07 Intr - 125882 125733 150 0 0 59 82 59 0.016 1.61 5.06 Intr - 137218 136520 699 2 0 63 90 526 0.067 40.82 5.05 Intr - 138928 138760 169 1 1 62 98 52 0.072 2.20 5.04 Intr - 139153 139033 121 0 1 74 20 81 0.065 -0.52 5.03 Intr - 141144 140834 311 0 2 101 63 132 0.055 6.19 5.02 Intr - 141424 141329 96 1 0 57 110 31 0.685 1.49 5.01 Init - 141687 141550 138 0 0 77 50 96 0.831 4.89 5.00 Prom - 146635 146596 40 -5.35 6.03 PlyA - 146706 146701 6 1.05 6.02 Term - 147525 147306 220 2 1 17 47 218 0.253 6.13 6.01 Init - 157663 157545 119 1 2 79 76 123 0.830 9.93 6.00 Prom - 157930 157891 40 -5.15 7.12 PlyA - 158609 158604 6 1.05 7.11 Term - 172249 171940 310 0 1 74 40 231 0.289 10.45 7.10 Intr - 177940 177845 96 0 0 39 100 106 0.256 5.11 7.09 Intr - 179005 178873 133 2 1 91 -16 113 0.183 -0.02 7.08 Intr - 179621 179570 52 2 1 76 97 42 0.255 1.46 7.07 Intr - 179939 179795 145 1 1 86 32 144 0.266 7.96 7.06 Intr - 180143 180049 95 2 2 56 25 45 0.447 -7.16 7.05 Intr - 180317 180238 80 0 2 137 93 1 0.636 3.85 7.04 Intr - 182958 182813 146 2 2 71 76 79 0.075 4.01 7.03 Intr - 221009 220878 132 1 0 55 32 136 0.004 3.54 7.02 Intr - 223637 223531 107 2 2 86 94 49 0.247 3.39 7.01 Init - 233736 233644 93 0 0 94 73 33 0.079 1.38 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 54643 54533 111 1 0 96 46 67 0.815 0.88 S.002 Intr + 137077 137237 161 0 2 72 88 120 0.825 8.26 S.003 Term - 139810 139689 122 2 2 88 43 149 0.873 8.06 S.004 Sngl + 213841 214098 258 0 0 42 32 244 0.909 9.08 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:221602415_221839744|GENSCAN_predicted_peptide_1|64_aa MTALKNPVPEWGGECYKITLSAKGEAASSLDASSAFESAQYGVMITVKGLVEETDFAESL LDGG >gi568815597r:221602415_221839744|GENSCAN_predicted_CDS_1|195_bp atgactgcgctgaaaaaccctgtgcctgagtggggaggagagtgttacaaaataacattg tcagccaagggtgaagctgcatcaagcttggatgccagctctgcatttgagagtgcgcag tatggagtaatgataactgttaaaggtttggtggaagagacagactttgcagagagcttg ttggatgggggttaa >gi568815597r:221602415_221839744|GENSCAN_predicted_peptide_2|107_aa METLGGSRKRGLIIELLKEKMGGNLKSISRRSLCPEFLSVLEVVQSMEIVVGRRVKGEVM GQRDEETILMLILFLSGGLQTGGISLSIEIQDLKNILSNSYTKALRF >gi568815597r:221602415_221839744|GENSCAN_predicted_CDS_2|324_bp atggagacactgggtggcagcagaaaaagaggtttaatcatagagctgttgaaagagaag atgggaggaaacctcaaatccatctctcgaaggagtttgtgtccagagtttttaagtgtt ttggaagtggtccaaagtatggagatagttgttggtcgaagagtgaagggtgaagtcatg ggacaaagagatgaagaaactattctcatgctgattctgttcctctctgggggtcttcaa actggtggcatcagcctttccattgaaattcaggatctgaaaaacatcttaagcaattct tacacaaaagccttaagattctaa >gi568815597r:221602415_221839744|GENSCAN_predicted_peptide_3|79_aa MTFKKRTEGSRRTNHAEMQDEEDGSINIIIIIIIIIINMQCKAPKGLGPQGWCTPFESSD RLSSHFLAFYLYQSILTLL >gi568815597r:221602415_221839744|GENSCAN_predicted_CDS_3|240_bp atgacattcaagaagcgaactgaaggaagcaggagaacaaaccatgcagagatgcaggat gaagaggatggctccatcaacatcatcatcatcatcatcatcatcatcatcaacatgcag tgcaaagcccccaaggggctggggcctcagggttggtgtactcccttcgaaagctcagac aggctttcttctcatttcttagcattctatctgtatcagtccattctcacactactataa >gi568815597r:221602415_221839744|GENSCAN_predicted_peptide_4|338_aa MIPQQLREDPTLNDWCHGVIIRKPCDDTGVHRGSPCEDKSGDWSDTATNQETPKELWQSP EARSKHEIDHSSLPSEVTNPVNTLTSDVWTLEFCLQQKSKSNISYSIVAISGRFWKQCFL NCFMSPHCQTYKLLIALLEGLNEQAWPKTTQSLDVLPVELGREGTEAGILLRRFHRSRNQ GGKMERAPLTVTHSDPQQNLFFLFSTLDSAGLEVLVPKGEMLLLGDTAMIPLRWKLILPA ADFGLLMLKNQQAKSDIGVLAGMTGEIGLLPDYRSKKKCAWNSRSYLILSSLVITFVAVT NKGFGHQGAKADGIPLMQAFPDAKGMIFAGEDDSIVRI >gi568815597r:221602415_221839744|GENSCAN_predicted_CDS_4|1017_bp atgataccacaacagcttagggaggaccctacgttgaatgactggtgtcatggtgtcatt ataaggaagccatgtgatgacacaggagtacacagaggaagtccatgtgaagacaaaagt ggagattggagtgatacagccacaaaccaggaaacaccaaaagaattatggcaatcacca gaagcaaggagcaagcatgaaatagatcattcctcattgccttcagaggtgacaaaccct gtcaacaccttgacttcagatgtctggactctagaattttgtttacagcaaaagagcaaa tctaatatcagttattccatagtggctataagtggaagattttggaaacagtgcttcttg aattgtttcatgtctccgcactgtcagacatacaaattactgatagccttgttggaagga ttgaatgagcaagcttggcccaagactactcagtctctggatgtgttgcccgtggagctt ggacgagaaggcacagaagcaggaatcttgctcagaagatttcatagatccaggaatcaa gggggaaaaatggaaagggctcctcttactgttacccatagtgatccacagcaaaattta ttctttctattctcaactttagactctgctggtctagaagttctggtccccaagggagag atgcttctactaggagatacagcaatgattccattaagatggaagttgatactccctgct gccgactttgggctccttatgctaaagaatcaacaagcaaagtcagatattggtgtattg gctgggatgactggggaaattggattactaccagattataggagtaagaaaaagtgtgcc tggaattcaagatcatatcttatactctcaagtcttgtaattacttttgttgctgtaacg aacaaaggatttggtcaccaaggcgcaaaggcagatgggatccccttaatgcaagccttc cccgatgcaaaaggaatgatatttgcaggagaggatgacagcattgtgagaatataa >gi568815597r:221602415_221839744|GENSCAN_predicted_peptide_5|773_aa MVTHIWLRINLFKYFTEFDYFRQQTLHKILSYVRLSDVYVMPVVTLICPENNYMNMPLLL SILLGLESEVDLQTPSNQGNFPAVSLMRIFSGLGKGRLAPPPTLLPDQELRPATWRRPKS GERGEAASPPPRSSGPIQSSYLHSALALFRQLRLFQKRRAAGPGLGRGRAGPRGGGRGMW GGSCLVVEGRGDSSVFLWEIFWARLDGFTGSMYARPSAAGMQKKWYVDPGRVLPGNTIAK TFTAWSNRETVIASGIELALQKKYYAREGGLTPSLPSKGSNSHPPVIATTVVSLKAANLT YMPSSSGSARSLNCGCSSASCCTVATYDKDNQAQTQAIAAGTTTTAIGTSTTCPANQMVN NNENTGSLSPSSGVGSPVSGTPKQLASIKIIYPNDLAKKMTKCSKSHLPSQGPVIIDCRP FMEYNKSHIQGAVHINCADKISRRRLQQGKITVLDLISCREGKDSFKRIFSKEIIVYDEN TNEPSRVMPSQPLHIVLESLKREGKEPLVLKDINYKISSQSSFYAAKTEYLRLSNLQRTI GYLAYGSGSCEVQEQGANIWQGGLSSFKQNHENLCDNSLQLQECREVGGGASAASSLLPQ PIPTTPDIENAELTPILPFLFLGNEQDAQDLDTMQRLNIGYVINVTTHLPLYHYEKGLFN YKRLPATDSNKQNLRQYFEEAFEFIEEAHQCGKGLLIHCQAGVSRSATIVIAYLMKHTRM TMTDAYKFVKGKRPIISPNLNFMGQLLEFEEDLNNGVTPRILTPKLMGVETVV >gi568815597r:221602415_221839744|GENSCAN_predicted_CDS_5|2322_bp atggtcactcatatttggctcagaataaatctcttcaaatattttacagagtttgactat tttcgacaacagacgttacacaaaatcctctcctatgtaagactatcggatgtgtacgtg atgcccgtcgtaacactgatctgccctgaaaacaattacatgaatatgcccctgctactc agtatactactaggcttggagtctgaagtagacctccaaaccccatccaatcagggcaac tttcctgcagtttccctaatgagaatattttccggactaggcaaggggcggcttgccccg ccccctactcttctccctgatcaggagctgcggccagctacctggcgccggccaaagtca ggggagcggggagaggcggcgtcacctcctccacgctcctccggccccattcaatctagc tatttgcactcggctctggctcttttccgccagctgcggctgttccagaagcgccgggct gccggtcctggcctcggcaggggccgtgcgggaccgaggggtggcggccgaggcatgtgg ggcggctcttgcctagtggtggaagggagaggggatagctcagtatttctctgggagatt ttctgggccagactggatggtttcacgggcagtatgtacgcccgaccctctgctgcaggg atgcagaagaaatggtatgttgatcctggacgtgtcctaccaggtaacacaatagcaaag actttcaccgcttggtccaaccgggaaacagttatagcttctggaatagaactcgctctt cagaaaaaatactacgcaagggaaggggggttgactccttcgctcccatccaaaggcagt aacagccaccctcctgtcatcgccaccaccgttgtgtccctcaaggctgcgaatctgacg tatatgccctcatccagcggctctgcccgctcgctgaattgtggatgcagcagtgccagc tgctgcactgtggcaacctacgacaaggacaatcaggcccaaacccaagccattgccgct ggcaccaccaccactgccatcggaacctctaccacctgccctgctaaccagatggtcaac aataatgagaatacaggctctctaagtccatcaagtggggtgggcagccctgtgtcaggg acccccaagcagctagccagcatcaaaataatctaccccaatgacttggcaaagaagatg accaaatgcagcaagagtcacctgccgagtcagggccctgtcatcattgactgcaggccc ttcatggagtacaacaagagtcacatccaaggagctgtccacattaactgtgccgataag atcagccggcggagactgcagcagggcaagatcactgtcctagacttgatttcctgtagg gaaggcaaggactctttcaagaggatcttttccaaagaaattatagtttatgatgagaat accaatgaaccaagccgagtgatgccctcccagccacttcacatagtcctcgagtccctg aagagagaaggcaaagaacctctggtgttgaaagatatcaactataagatttcgtcacaa agttcattttatgctgctaaaacagaatacctgagattgagtaatttacaaagaaccata ggttatttggcttatggttctggaagctgtgaagtgcaagagcagggtgccaacatctgg caaggtggacttagtagttttaagcagaaccatgaaaacctctgtgacaactccctccag ctccaagagtgccgggaggtggggggcggcgcatccgcggcctcgagcttgctacctcag cccatccccaccacccctgacatcgagaacgctgagctcacccccatcttgcccttcctg ttccttggcaatgagcaggatgctcaggacctggacaccatgcagcggctgaacatcggc tacgtcatcaacgtcaccactcatcttcccctctaccactatgagaaaggcctgttcaac tacaagcggctgccagccactgacagcaacaagcagaacctgcggcagtactttgaagag gcttttgagttcattgaggaagctcaccagtgtgggaaggggcttctcatccactgccag gctggggtgtcccgctccgccaccatcgtcatcgcttacttgatgaagcacactcggatg accatgactgatgcttataaatttgtcaaaggcaaacgaccaattatctccccaaacctt aacttcatggggcagttgctagagttcgaggaagacctaaacaacggtgtgacaccgaga atccttacaccaaagctgatgggcgtggagacggttgtgtga >gi568815597r:221602415_221839744|GENSCAN_predicted_peptide_6|112_aa MESHGRKDLMHRAGGLLVHTPQEGDGKVQAVLGGSDHLCGKLFIGISHMLEKINTGQSVC QGDKHQEMHAFENYKKEDVEREKEDTFILLKLSEQEEEFFLNIEDNHNCKEH >gi568815597r:221602415_221839744|GENSCAN_predicted_CDS_6|339_bp atggagagccatggtaggaaggatctgatgcatcgggctggaggcctgcttgttcatact ccgcaagaaggagatggcaaagtccaggctgtcctaggtggcagcgaccacctgtgcgga aagctatttattggcatcagtcacatgctagagaaaataaacactggacaatccgtctgt cagggagataaacaccaggaaatgcatgcctttgagaattacaagaaggaagatgttgaa cgagaaaaggaagataccttcatcctcctgaaactcagtgaacaagaagaggaatttttt ctaaacattgaagacaatcataattgcaaagagcattga >gi568815597r:221602415_221839744|GENSCAN_predicted_peptide_7|462_aa MPHALGGCLLCSKTTGSLASGFWFAEDWQSEEWEHADGQVQEPGQALLGSGSMVASRGGC LQLPKPSGFSRPVVLTQSVVPGPAASASPESLLEMQILKPTQDLLNQKLSGWLANYHKKV DMWLSVQKSTTAMIHYHYAVTCTVNTARKSIPVIEKENAGIMTLFFKEEWLVWPLRGFLE CSSTFLPAESRSVTETLGASVSLLVNEDISHSSIINLLWSENTDARSCRQDMFLSKTPMN QCCGQGMFDTWTVLQDLNKYWPPPEKGEKGKPVLEKRLEVGQVEGPAASGDNPSATVQPG GTGQFAVGQRPDDIMQFHFSFPKVNWVKGCTLSRQLHATVVTQVTPSECESKLLKGMASS CKAEGKAEESWTLSFVQIKLLFTQDLLKEYKEMAEIALSSHSSSGKRPFIVLRLVPAVTG PIGFEHKTTEFGTSEEFWIWYKKIQVQIPVVLKFRYTMIIAF >gi568815597r:221602415_221839744|GENSCAN_predicted_CDS_7|1389_bp atgccccatgccctagggggctgcctcctgtgtagcaaaactacaggctctcttgcctct ggcttctggtttgcagaagattggcaaagtgaggagtgggagcatgcagacgggcaggtg caagagccaggacaagcacttttgggctcaggctccatggtagcatctaggggtgggtgc ctgcaactcccaaagcccagtggtttctctcgtcccgtggttctcactcaaagcgtggtc cctggaccagcagcatcagcatcacctgaaagtttgctagaaatgcaaattctcaagcca acccaagacctactgaatcagaaactctcgggatggcttgccaactatcacaaaaaggta gatatgtggctaagtgttcagaaatcaacaacagccatgatccactatcactatgctgtc acatgcaccgtaaatacagcaagaaaatctatcccagttattgaaaaggagaatgcaggg ataatgacactgtttttcaaagaggagtggcttgtgtggcctctcagaggtttcctggaa tgtagctccacatttctgccagctgaatctaggtcagttactgaaactcttggagcctca gtttccttgcttgtaaatgaggatatcagtcattcttccatcatcaatttgttgtggtca gagaacacagatgcccgttcttgtagacaagatatgtttctttcaaaaacgccaatgaat cagtgttgtggccaaggtatgtttgacacctggacagtgctgcaggacctcaacaaatac tggcccccaccagagaagggggagaaaggaaaaccggttctggagaagaggctagaggtg gggcaggttgagggtccagctgcatcaggtgacaatccctctgcaactgtgcaacctgga gggacagggcagtttgcagttgggcaacggccagatgatatcatgcagtttcacttcagt ttccccaaagtgaactgggtcaaaggttgtacattgtcacgtcagctgcatgctactgtg gtcacacaggtcaccccgagtgaatgtgagagcaaactgctcaagggtatggcttcaagc tgcaaagctgaaggcaaagctgaagaatcctggacactctcctttgttcaaataaaactt ctatttacacaggatttgctaaaggagtacaaggaaatggcagagatagcattaagctca cacagcagttccggaaagcgccccttcattgttctcaggctcgtacctgcagtaacaggg cctataggctttgaacacaagacaactgagttcggaacctcagaagaattctggatatgg tacaaaaaaattcaggttcaaattcctgttgtgctcaagtttagatatacgatgatcatt gctttttaa