GENSCAN 1.0 Date run: 4-Nov-116 Time: 21:30:03 Sequence gi568815592f:125649777_125859799 : 210023 bp : 38.92% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3352 3369 18 0 0 72 97 39 0.042 2.08 1.02 Intr + 24636 24699 64 2 1 39 110 79 0.019 2.57 1.03 Intr + 32495 32704 210 0 0 62 100 113 0.684 7.86 1.04 Term + 33866 33927 62 0 2 62 54 64 0.534 -2.61 1.05 PlyA + 34549 34554 6 1.05 2.02 PlyA - 36258 36253 6 1.05 2.01 Sngl - 39448 39263 186 1 0 77 48 210 0.876 10.73 2.00 Prom - 44251 44212 40 -5.25 3.04 PlyA - 48013 48008 6 1.05 3.03 Term - 55556 55386 171 2 0 63 42 154 0.572 5.14 3.02 Intr - 57591 57390 202 2 1 4 115 69 0.755 -0.43 3.01 Init - 61360 61146 215 1 2 71 64 203 0.802 14.46 3.00 Prom - 72221 72182 40 -3.05 4.00 Prom + 77431 77470 40 -4.45 4.01 Init + 82727 82939 213 1 0 39 47 158 0.915 5.59 4.02 Intr + 85970 86122 153 1 0 79 57 50 0.393 0.35 4.03 Term + 87658 87861 204 0 0 -9 39 194 0.608 1.19 4.04 PlyA + 88056 88061 6 1.05 5.00 Prom + 92655 92694 40 -4.65 5.01 Init + 100001 100083 83 1 2 87 72 121 0.980 10.89 5.02 Intr + 102025 102103 79 1 1 106 63 28 0.963 0.73 5.03 Intr + 102231 102314 84 2 0 76 82 116 0.979 8.90 5.04 Intr + 104689 104770 82 0 1 7 115 55 0.861 -1.61 5.05 Term + 109341 110026 686 1 2 57 32 526 0.969 36.70 5.06 PlyA + 110140 110145 6 1.05 6.02 PlyA - 110930 110925 6 1.05 6.01 Sngl - 120500 120243 258 2 0 97 36 210 0.672 11.58 6.00 Prom - 123922 123883 40 -6.05 7.00 Prom + 124435 124474 40 -4.35 7.01 Init + 126278 126358 81 1 0 46 98 54 0.488 3.22 7.02 Term + 127512 127607 96 2 0 78 42 114 0.867 2.79 7.03 PlyA + 129896 129901 6 1.05 8.08 PlyA - 130088 130083 6 1.05 8.07 Term - 131671 131540 132 1 0 66 53 90 0.574 0.41 8.06 Intr - 132134 131991 144 2 0 29 64 92 0.334 0.46 8.05 Intr - 140462 140228 235 1 1 107 50 148 0.630 9.67 8.04 Intr - 156888 156754 135 2 0 93 64 42 0.011 1.06 8.03 Intr - 169108 169034 75 0 0 108 108 71 0.329 8.81 8.02 Intr - 171082 170955 128 1 2 83 38 93 0.301 2.36 8.01 Init - 183143 183063 81 2 0 80 34 90 0.017 3.82 8.00 Prom - 190151 190112 40 -8.05 9.00 Prom + 193412 193451 40 -6.25 9.01 Init + 193478 193581 104 1 2 74 105 97 0.969 9.76 9.02 Term + 197882 198359 478 2 1 4 42 205 0.195 0.83 9.03 PlyA + 199020 199025 6 1.05 10.03 PlyA - 199061 199056 6 1.05 10.02 Term - 203179 202875 305 2 2 60 43 159 0.012 2.95 10.01 Intr - 209181 209072 110 2 2 101 71 105 0.033 9.11 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 16873 16798 76 1 1 66 95 86 0.905 8.40 S.002 Sngl - 203252 202875 378 2 0 77 43 196 0.971 9.91 S.003 Term - 209181 209034 148 2 1 101 54 152 0.955 9.49 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:125649777_125859799|GENSCAN_predicted_peptide_1|117_aa MSRRAPSHRGMFAGVRQRMEHMGHADPDRYIAHETIRLAGPNLVYQPNVSQDLLKILPDY FPRWTSYKLSEELKPDCEYTILFSHLLREPSLIFFLKGSYKWSPEGGQSQVSHSGKN >gi568815592f:125649777_125859799|GENSCAN_predicted_CDS_1|354_bp atgagccgccgtgccccgagccatcgcggaatgttcgctggagtgaggcagcgtatggag cacatgggacatgcagacccagataggtacatagcccatgaaacaatccggcttgctggc ccaaatctggtataccagcccaatgtttcccaagatttgctcaagatcttgcctgattac ttccccagatggacatcttacaaactttcagaggagctgaagccagattgtgagtataca attctgttcagtcatcttctccgagaaccctctctcatcttttttctcaaaggttcttac aaatggtcaccagaaggaggacaatcacaagtaagtcactcaggcaaaaactga >gi568815592f:125649777_125859799|GENSCAN_predicted_peptide_2|61_aa MASSAVGPRRNSKTLPKANLAPKKVIVIVWWSAANLIHYSFLNPGKTIISENYAQEVYEM H >gi568815592f:125649777_125859799|GENSCAN_predicted_CDS_2|186_bp atggccagctcagcagttggaccgagaagaaactccaaaacacttcccaaagccaacctt gcaccaaaaaaggttatcgtcattgtttggtggtctgctgccaatcttatccactacagc tttctgaatcccggcaaaaccattatatctgaaaattatgctcaggaagtctatgaaatg cactga >gi568815592f:125649777_125859799|GENSCAN_predicted_peptide_3|195_aa MVNPKFFASEKITQEHNDKDFFRHTKLKEVTNSRSTLHLTPAHLRVSLKAHTVYYLATTV DYSGPKDSSVSSHVAHTKPVWWYLHTDAHEIWCRDLDRGTSLGRSIPCPPVLCSVRKIHL RPQVLRPTSPRNISPILNQELATCARNLATGPRNARSPGFLLSRIPSVWDPTENRTVQLT WQPLPEPLELWPKAL >gi568815592f:125649777_125859799|GENSCAN_predicted_CDS_3|588_bp atggtcaaccccaaattctttgccagtgaaaagattactcaagaacacaatgataaagac tttttcagacatacaaagttaaaagaagttactaacagcagatctacactacacttgacg ccagcacatcttagagtctcactcaaggcccatactgtgtactacctggctaccactgtt gattattcaggacccaaggactcttcagtcagcagccatgttgctcacacaaagcctgtt tggtggtatcttcacacggacgcgcatgaaatttggtgccgtgacttggatcgggggacc tcccttgggagatcaatcccctgtcctcctgttctttgctccgtgagaaagatccaccta cgacctcaggtcctcagacccaccagcccaaggaacatctcaccaattttaaatcaggag cttgctacatgtgccagaaatctggccactgggccaaggaatgcccgtagcccgggattc ctcctaagccgcatcccatctgtgtgggaccccactgaaaatcggacggttcaactcacc tggcagccacttccagagcccctggaactctggcccaaggctctctga >gi568815592f:125649777_125859799|GENSCAN_predicted_peptide_4|189_aa MTVKAPKGHKGDITSILLVQTLAQSCHAVRRPKLVSSERASGEALKLHNYRVLSCTSPLL FQLQPLLDYNHMILSNLAPDVRVPLSMQYADLIIKINTFSIQAAHITHKFLFNKERHAFH TRGQFGQIVSSQYLYEINCTEGMPIFTRRTKVEVNNFEAWGSFRGGEVRGSGTRLGLGQD KNTQYEKPE >gi568815592f:125649777_125859799|GENSCAN_predicted_CDS_4|570_bp atgactgtcaaggctcctaaaggtcataaaggtgacataacttctatactgttagttcaa acacttgctcagagctgccatgctgtgaggaggcccaagctagtcagctcagagagagca tctggagaggctctgaagctacacaactatagagtcctcagctgcacaagccccctgctg ttccagctccaaccactgctagactacaaccatatgatactgagtaacttagccccagac gtcagggtgccactgagtatgcagtatgctgacttaatcataaaaattaacacctttagt attcaagcagctcatatcactcacaaatttctctttaacaaagaaaggcatgcatttcat acacggggacaattcggtcagattgtttcttcccaatacctctatgagatcaattgcact gaaggaatgcctatttttactagaagaacgaaggtggaagtcaataattttgaagcatgg ggtagcttcagaggaggagaggttcggggatcgggtacaagacttggcttgggccaggat aaaaatactcagtatgaaaaacctgagtag >gi568815592f:125649777_125859799|GENSCAN_predicted_peptide_5|337_aa MKRPCEETTSESDMDETIDVGSENNYSGQSTSSVIRLNSPTTTSQIMARKKRRGIIEKRR RDRINNSLSELRRLVPTAFEKQGSAKLEKAEILQMTVDHLKMLQATGGKGYFDAHALAMD FMSIGFRECLTEVARYLSSVEGLDSSDPLRVRLVSHLSTCATQREAAAMTSSMAHHHHPL HPHHWAAAFHHLPAALLQPNGLHASESTPCRLSTTSEVPPAHGSALLTATFAHADSALRM PSTGSVAPCVPPLSTSLLSLSATVHAAAAAATAAAHSFPLSFAGAFPMLPPNAAAAVAAA TAISPPLSVSATSSPQQTSSGTNNKPYRPWGTEVGAF >gi568815592f:125649777_125859799|GENSCAN_predicted_CDS_5|1014_bp atgaagcgcccctgcgaggagacgacctccgagagcgacatggacgagaccatcgacgtg gggagcgagaacaattactcggggcaaagtactagctctgtgattagattgaattctcca acaacaacatctcagattatggcaagaaagaaaaggagagggattatagagaaaaggcgt cgggatcggataaataacagtttatctgagttgagaagacttgtgccaactgcttttgaa aaacaaggatctgcaaagttagaaaaagctgaaatattgcaaatgacagtggatcatttg aagatgcttcaggcaacagggggtaaaggctactttgacgcacacgctcttgccatggac ttcatgagcataggattccgagagtgcctaacagaagttgcgcggtacctgagctccgtg gaaggcctggactcctcggatccgctgcgggtgcggcttgtgtctcatctcagcacttgc gccacccagcgggaggcggcggccatgacatcctccatggcccaccaccatcatccgctc cacccgcatcactgggccgccgccttccaccacctgcccgcagccctgctccagcccaac ggcctccatgcctcagagtcaaccccttgtcgcctctccacaacttcagaagtgcctcct gcccacggctctgctctcctcacggccacgtttgcccatgcggattcagccctccgaatg ccatccacgggcagcgtcgccccctgcgtgccacctctctccacctctctcttgtccctc tctgccaccgtccacgccgcagccgcagcagccaccgcggctgcacacagcttccctctg tccttcgcgggggcattccccatgcttcccccaaacgcagcagcagcagtggccgcggcc acagccatcagcccgcccttgtcagtatcagccacgtccagtcctcagcagaccagcagt ggaacaaacaataaaccttaccgaccctgggggacagaagttggagctttttaa >gi568815592f:125649777_125859799|GENSCAN_predicted_peptide_6|85_aa MAAVQSSFGSAFEGDRGKRGRGRPWGEGDHGERERERAATCSHKWELNDENTWTHGVEQH TLAFSEVQKFQVELSEVVLKAGRKW >gi568815592f:125649777_125859799|GENSCAN_predicted_CDS_6|258_bp atggcagcagtacagtccagcttcggctcggcatttgagggagaccgtggaaagagaggg agagggagaccgtggggagagggagaccatggggagagggagagggagagggctgccaca tgttctcataagtgggagctgaatgatgagaacacatggacacatggggttgaacaacac acactcgctttttctgaagtacaaaagttccaggttgaactttctgaagttgttctgaaa gcaggtcgaaaatggtaa >gi568815592f:125649777_125859799|GENSCAN_predicted_peptide_7|58_aa MWQGHGIIVERRSADKHVNKALCIIDKDPEAQRAGSLNPDNAADIGTVRVNSQSLALE >gi568815592f:125649777_125859799|GENSCAN_predicted_CDS_7|177_bp atgtggcagggtcatgggataatagtggagagaaggtcagcagataaacacgtgaacaaa gctctctgcatcatagacaaggatccagaggctcagagagctgggtcacttaacccagat aatgcagctgatattggcacagtcagggttaattcacagtctctggctctggaataa >gi568815592f:125649777_125859799|GENSCAN_predicted_peptide_8|309_aa MPCCDTTVWRCPTDLKSGAAEAAENNKNLTNCAYSKVSLGNDMVPIWIVVKCSFFPDLNN RSSSWAESGRDLAWTALKYAAPENSALHCQKPAFKAFGTSACPQLNASSALPTTTPTKMP QERVPQPMRITIKRKIKPHGSGPMATINSSCVPSGVAFNRGAHHLLPPPVPGTPSWMLPN CSHYSNKNIAVGPSSRLLAETRENLGGGRFASAAGPVPLQLSVALTCFETKKLEENEPDM CCMCIDYFWEIQKKLVTLVTSGKKKGIKYINDLNSISELLKKKKRDSSTGSPNFVVMDDD CKSQWRRML >gi568815592f:125649777_125859799|GENSCAN_predicted_CDS_8|930_bp atgccatgctgtgacacaacagtgtggcgatgtcccacagacctgaaatctggagcagcc gaggctgcagaaaataacaagaacctaactaactgtgcttatagcaaagtatctttggga aatgatatggttccaatttggattgtcgttaagtgttccttcttccctgatctgaacaac aggagcagctcatgggctgaaagtgggagagatcttgcttggactgcccttaagtacgct gctccagagaactctgctctacattgtcagaaaccagctttcaaggcatttggcacctct gcttgcccccaactaaatgccagtagcgcccttcccacaaccacgccaaccaaaatgccc caggaaagggtaccacaaccaatgagaatcactattaagaggaaaatcaagccccacggc tcaggaccgatggccacaataaattcatcctgtgtcccttctggcgttgcgtttaacaga ggggcccaccatctccttccccctcccgttcccggtacaccatcctggatgctgccaaat tgcagtcattacagcaataagaacatagcagttgggccctccagccgactcttggcagag actcgagagaaccttggagggggaaggtttgcatcagctgcggggcctgtgcctttgcaa ctttctgttgctttgacatgtttcgaaaccaaaaagttggaggaaaatgaaccagatatg tgttgtatgtgcatagattatttctgggagatacagaagaagctggtaacactggttact tctgggaagaagaaaggaataaagtatatcaatgatttaaactccatctctgagctgctg aaaaaaaagaaacgagattcaagcacaggctcacctaattttgtggtaatggatgatgac tgcaagtcacagtggcggcggatgttgtga >gi568815592f:125649777_125859799|GENSCAN_predicted_peptide_9|193_aa MEEGATSNIALEAGKSKKRTSNGEDGPVDALISAWWIKDLNVRPKTIKTLEENLGNAIQD RGMGKDFMSKTPKAMATKAKIDKWDLIKLKSFCTAKETIIRVNRKPTEWEKNFAIYPSDK GLISRIYKELKQIYKKKTTPSKSGQRNEQTLLKRRHLCSQQTHEKMLIITGHQRNANQNH NEIPSYASYNGDH >gi568815592f:125649777_125859799|GENSCAN_predicted_CDS_9|582_bp atggaggaaggggctaccagtaatatagctctagaagctggaaaaagcaagaaaagaacc tctaatggggaggatggccctgtggacgccttgatttcagcctggtggattaaagactta aatgtcagacctaaaaccataaaaaccctagaagaaaacctaggcaatgccattcaggac agaggcatgggcaaggacttcatgtctaaaacaccaaaagcaatggcaacaaaagccaaa attgacaaatgggatctaattaaactaaagagcttctgcacagcaaaagaaactatcatc agagtgaacaggaaacctacagaatgggagaaaaattttgcaatctacccatctgacaaa gggctaatatccagaatctacaaagaactcaaacaaatttacaagaaaaaaacaacccca tcaaaaagtgggcaaaggaatgaacagacacttctcaaaagaagacatttatgcagccaa cagacacatgaaaaaatgctcatcatcactggtcatcagagaaatgcaaatcaaaatcac aatgagataccatcttacgccagttataatggcgatcattaa >gi568815592f:125649777_125859799|GENSCAN_predicted_peptide_10|138_aa XLEPPKAAVNWDALDLRSDAPSSGREHRLGMGLRNTPELEKTILKLIWNQKRAQIAKEIL SKKNKSEGITLPDITLPNFKLYYKAIVTKTAWYLYKCKCIDIAMEQNKEPRNKAKYSQPT DLRQSIQKHKLGKGNSIQ >gi568815592f:125649777_125859799|GENSCAN_predicted_CDS_10|417_bp nnactagagccaccaaaagcagcagtgaactgggatgctcttgacttgaggtctgatgct ccatctagcggtagagaacatcgccttgggatgggactgagaaacacgccagaattagaa aaaacaatcttaaaactcatatggaaccaaaaaagagcccaaatagccaaagaaatccta agcaaaaagaataaatctgaaggcatcacattacccgacatcacattacccaacttcaaa ttatactacaaagctatagttaccaaaacagcatggtacctgtataaatgtaaatgtata gacattgcaatggaacagaataaagaacccagaaataaagccaaatactcacaaccaact gatcttcgacagagcatacaaaaacataaactggggaaaggaaactctattcaataa