GENSCAN 1.0 Date run: 6-Nov-116 Time: 20:13:46 Sequence gi568815590f:37930574_38157264 : 226691 bp : 46.80% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.11 PlyA - 1026 1021 6 1.05 1.10 Term - 3913 3720 194 2 2 26 34 194 0.999 5.38 1.09 Intr - 4642 4500 143 0 2 93 65 136 0.989 11.80 1.08 Intr - 5296 5131 166 2 1 62 91 104 0.925 7.12 1.07 Intr - 6297 6147 151 0 1 126 56 46 0.916 4.94 1.06 Intr - 6813 6704 110 1 2 60 65 105 0.846 5.40 1.05 Intr - 7176 7065 112 0 1 83 96 53 0.881 5.55 1.04 Intr - 8261 8127 135 2 0 57 76 110 0.132 7.46 1.03 Intr - 18215 18113 103 1 1 86 46 47 0.038 0.48 1.02 Intr - 34011 33880 132 2 0 80 62 75 0.730 3.86 1.01 Init - 35896 34692 1205 1 2 75 77 1603 0.921 148.44 1.00 Prom - 43068 43029 40 -3.76 2.00 Prom + 45149 45188 40 -2.86 2.01 Init + 45983 46042 60 1 0 52 53 102 0.665 4.35 2.02 Term + 51723 51890 168 2 0 28 47 141 0.594 1.88 2.03 PlyA + 51950 51955 6 1.05 3.04 PlyA - 52966 52961 6 1.05 3.03 Term - 53274 53138 137 1 2 49 44 151 0.490 4.98 3.02 Intr - 70833 70717 117 1 0 62 92 38 0.122 2.04 3.01 Init - 71344 71293 52 1 1 63 110 34 0.189 4.44 3.00 Prom - 89339 89300 40 -1.96 4.00 Prom + 94519 94558 40 -4.26 4.01 Init + 100001 100145 145 1 1 79 80 211 0.648 19.54 4.02 Intr + 126508 126687 180 2 0 147 101 228 0.999 29.84 4.03 Intr + 132558 132571 14 1 2 74 99 0 0.002 -5.80 4.04 Intr + 141986 142090 105 1 0 116 45 107 0.392 9.61 4.05 Intr + 142475 142523 49 1 1 82 101 56 0.352 4.55 4.06 Term + 148384 148502 119 2 2 -24 45 123 0.050 -4.50 4.07 PlyA + 148661 148666 6 1.05 5.06 PlyA - 148700 148695 6 1.05 5.05 Term - 154428 154267 162 0 0 26 33 137 0.067 -0.06 5.04 Intr - 157483 157302 182 2 2 41 69 124 0.201 5.39 5.03 Intr - 159162 158762 401 2 2 83 81 170 0.222 9.95 5.02 Intr - 167687 167497 191 2 2 29 110 47 0.001 -0.62 5.01 Init - 170538 170536 3 0 0 97 81 0 0.003 0.20 5.00 Prom - 171837 171798 40 -6.16 6.00 Prom + 172636 172675 40 -4.96 6.01 Init + 174978 175165 188 2 2 104 94 111 0.875 9.84 6.02 Intr + 175805 175871 67 2 1 70 94 57 0.931 3.41 6.03 Intr + 176448 176593 146 2 2 57 80 132 0.876 8.38 6.04 Intr + 179806 179894 89 1 2 70 71 47 0.885 0.81 6.05 Intr + 180166 180260 95 2 2 93 75 72 0.929 6.08 6.06 Intr + 184332 184427 96 2 0 61 88 88 0.987 6.31 6.07 Intr + 186077 186152 76 1 1 80 115 43 0.999 5.29 6.08 Intr + 188697 188790 94 1 1 67 59 140 0.985 8.02 6.09 Intr + 190359 190576 218 0 2 106 87 115 0.997 11.35 6.10 Intr + 197718 197885 168 1 0 87 87 152 0.979 14.92 6.11 Intr + 198185 198378 194 0 2 74 113 144 0.998 14.71 6.12 Intr + 202881 202973 93 2 0 73 105 48 0.960 5.16 6.13 Intr + 205095 205193 99 2 0 71 100 26 0.822 2.41 6.14 Term + 208391 208498 108 1 0 107 47 61 0.964 2.41 6.15 PlyA + 209108 209113 6 1.05 7.08 PlyA - 211195 211190 6 1.05 7.07 Term - 213813 213700 114 0 0 64 49 162 0.951 8.47 7.06 Intr - 214742 214649 94 1 1 90 67 27 0.460 0.77 7.05 Intr - 215574 215390 185 0 2 50 90 236 0.995 18.59 7.04 Intr - 215874 215716 159 0 0 104 105 313 0.849 34.78 7.03 Intr - 217754 217627 128 0 2 58 111 143 0.998 14.00 7.02 Intr - 218181 218068 114 1 0 72 123 80 0.350 10.32 7.01 Init - 220245 220182 64 0 1 68 84 55 0.272 4.41 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 163182 163317 136 2 1 59 38 121 0.811 4.50 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590f:37930574_38157264|GENSCAN_predicted_peptide_1|816_aa MAPWPHENSSLAPWPDLPTLAPNTANTSGLPGVPWEAALAGALLALAVLATVGGNLLVIV AIAWTPRLQTMTNVFVTSLAAADLVMGLLVVPPAATLALTGHWPLGATGCELWTSVDVLC VTASIETLCALAVDRYLAVTNPLRYGALVTKRCARTAVVLVWVVSAAVSFAPIMSQWWRV GADAEAQRCHSNPRCCAFASNMPYVLLSSSVSFYLPLLVMLFVYARVFVVATRQLRLLRG ELGRFPPEESPPAPSRSLAPAPVGTCAPPEGVPACGRRPARLLPLREHRALCTLGLIMGT FTLCWLPFFLANVLRALGGPSLVPGPAFLALNWLGYANSAFNPLIYCRSPDFRSAFRRLL CRCGRRLPPEPCAAARPALFPSGVPAARSSPAQPRLCQRLDGPGSDLKAGLVPPHPPPKP SASVLGFLKGLTAVEVRIHFRYEVQLLLALYTTGGHGPIKTHLAATHRGSSYTEAIPFGL KTRLQISQDPSLNYEYLPTMGLKSFIQASLALLFGKHSQAIVENRVGGVHTVGDSGAFQL GVQFLRAWHKDARIVYIISSQKELHGLVFQDMGFTVYEYSVWDPKKLCMDPDILLNVVES KQIFPFFDIPCQGLYTSDLEEDTRILQYFVSQGFEFFCSQSLSKNFGIYDEGVGMLVVVA VNNQQLLCVLSQLEGLAQALWLNPPNTGARVITSILCNPALLGEWKQSLKEVVENIMLTK EKVKEKLQLLGTPGSWGHITEQSGTHGYLGLNSQQVEYLVRKKHIYIPKNGQINFSCINA NNINYITEGINEAVLLTESSEMCLPKEKKTLIGIKL >gi568815590f:37930574_38157264|GENSCAN_predicted_CDS_1|2451_bp atggctccgtggcctcacgagaacagctctcttgccccatggccggacctccccaccctg gcgcccaataccgccaacaccagtgggctgccaggggttccgtgggaggcggccctagcc ggggccctgctggcgctggcggtgctggccaccgtgggaggcaacctgctggtcatcgtg gccatcgcctggactccgagactccagaccatgaccaacgtgttcgtgacttcgctggcc gcagccgacctggtgatgggactcctggtggtgccgccggcggccaccttggcgctgact ggccactggccgttgggcgccactggctgcgagctgtggacctcggtggacgtgctgtgt gtgaccgccagcatcgaaaccctgtgcgccctggccgtggaccgctacctggctgtgacc aacccgctgcgttacggcgcactggtcaccaagcgctgcgcccggacagctgtggtcctg gtgtgggtcgtgtcggccgcggtgtcgtttgcgcccatcatgagccagtggtggcgcgta ggggccgacgccgaggcgcagcgctgccactccaacccgcgctgctgtgccttcgcctcc aacatgccctacgtgctgctgtcctcctccgtctccttctaccttcctcttctcgtgatg ctcttcgtctacgcgcgggttttcgtggtggctacgcgccagctgcgcttgctgcgcggg gagctgggccgctttccgcccgaggagtctccgccggcgccgtcgcgctctctggccccg gccccggtggggacgtgcgctccgcccgaaggggtgcccgcctgcggccggcggcccgcg cgcctcctgcctctccgggaacaccgggccctgtgcaccttgggtctcatcatgggcacc ttcactctctgctggttgcccttctttctggccaacgtgctgcgcgccctggggggcccc tctctagtcccgggcccggctttccttgccctgaactggctaggttatgccaattctgcc ttcaacccgctcatctactgccgcagcccggactttcgcagcgccttccgccgtcttctg tgccgctgcggccgtcgcctgcctccggagccctgcgccgccgcccgcccggccctcttc ccctcgggcgttcctgcggcccggagcagcccagcgcagcccaggctttgccaacggctc gacggacctggctcggacttgaaggcagggctagtgcccccccacccgccccccaagccc tcggcctcagttctgggttttctcaaaggtttgacagctgtggaggtgagaatccacttc cggtatgaagtacagttactgttggctctgtacaccactggaggacatgggcccatcaaa acacacctggctgcaacacacagaggcagtagctacacagaagccataccttttggcctg aagactcgactacagatttcacaggatccctccctgaattatgagtacttgcccaccatg ggcctgaaatcattcatccaggcctctctagcactcctctttggaaagcacagccaagcc attgtggagaacagggtagggggtgtacacactgttggtgacagtggtgccttccagctt ggcgtccagtttctcagagcttggcataaggatgctcgtatagtttacatcatctcttct caaaaagaactgcatggactcgtcttccaggacatgggctttacagtttatgaatactct gtctgggaccccaagaagctatgcatggaccccgacatactcctcaatgtggtggagagc aagcagatattcccattttttgatattccctgtcaaggtttatacaccagtgacttggaa gaagatactagaatcttacaatactttgtgtctcaaggctttgagttcttctgcagccag tctctgtccaagaattttggcatttatgatgaaggagtggggatgctagtggtggtggca gtcaacaaccagcagctgctgtgtgtcctctcccagctggaaggattagcccaggccctg tggctaaacccccccaacacgggtgcacgtgtcatcacctccatcctctgcaaccctgct ctgctgggagaatggaagcagagtctaaaagaagttgtagagaacatcatgctaaccaag gaaaaagtgaaggagaaactccagctcctgggaacccctgggtcctggggtcacatcacc gagcagagtgggacccacggctatcttggactcaactcccagcaggtggaatacctggtc aggaagaagcacatctatatccccaagaacggtcagattaacttcagctgtatcaatgcc aacaacataaattacatcactgagggcatcaatgaggctgtcctcctcacagagagctca gagatgtgtcttccaaaggaaaaaaaaacactgattggaataaaactttag >gi568815590f:37930574_38157264|GENSCAN_predicted_peptide_2|75_aa MKRQTPWLQKAPTLVQKRNQLTIKQPEAGPSGGIPEEGIAITGDGSSMRVIAPKDLPVGQ DVEVEDSDIEDPDPG >gi568815590f:37930574_38157264|GENSCAN_predicted_CDS_2|228_bp atgaagcgccagaccccatggctgcagaaagctccaaccctggtgcagaagcggaaccag ttaactataaaacagcctgaggcaggtccttcaggaggtattccagaagaaggcattgct atcacaggagacggtagctccatgcgtgttatcgctcctaaagaccttccagtaggacaa gatgtggaggtggaagacagtgatattgaggatcctgaccctgggtag >gi568815590f:37930574_38157264|GENSCAN_predicted_peptide_3|101_aa METSWTITDALVTPTVEGFKGTKACYHSPATAWASKAYKLSLQLPFYLSKNRTSLTAEHS SALPLKVIYDVMRVAAINIAATDWAGPKISSTVPESMLAKD >gi568815590f:37930574_38157264|GENSCAN_predicted_CDS_3|306_bp atggaaacctcctggaccatcacagatgctttggtaactcctacagtggagggatttaag gggactaaagcctgttatcactcgcctgctacagcatgggcttctaaagcctataaactc tccttacaactcccattttacctgtccaaaaaccggacaagtcttacagctgagcattcc tccgcactaccgttaaaagtcatctatgatgtcatgagggtggcggccatcaacattgca gccacagactgggcaggccccaaaatctcttcaacggttccagagagtatgctggctaaa gattga >gi568815590f:37930574_38157264|GENSCAN_predicted_peptide_4|203_aa MSGGSSCSQTPSRAIPATRRVVLGDGVQLPPGDYSTTPGGTLFSTTPGGTRIIYDRKFLM ECRNSPVTKTPPRDLPTIPGVTSPSSDEPPMEASQSHLRNSPEDKRAGGSKRQIFGFLVL HSFIHTLSASTMIHRLQVPAEKPTYHILHSAGTLTRDIRYGLTHEMKAQKLQQRLNDCSI EMNYRIIIFIMTLTNTESFVLLN >gi568815590f:37930574_38157264|GENSCAN_predicted_CDS_4|612_bp atgtccgggggcagcagctgcagccagaccccaagccgggccatccccgccactcgccgg gtggtgctcggcgacggcgtgcagctcccgcccggggactacagcacgacccccggcggc acgctcttcagcaccaccccgggaggtaccaggatcatctatgaccggaaattcctgatg gagtgtcggaactcacctgtgaccaaaacacccccaagggatctgcccaccattccgggg gtcaccagcccttccagtgatgagccccccatggaagccagccagagccacctgcgcaat agcccagaagataagcgggcgggcgggtctaagagacagatatttgggtttcttgtacta cacagcttcattcatactctaagtgccagcaccatgattcatcgcctgcaggttccagca gaaaaacccacctaccacatcctgcattctgcagggactctcacaagagacattcgctat ggcctaacccatgaaatgaaggcccagaagcttcagcagagactgaatgactgctccatt gagatgaactaccgcataattatcttcatcatgactttaacaaacacagaaagctttgtt ttgcttaattga >gi568815590f:37930574_38157264|GENSCAN_predicted_peptide_5|312_aa MCYYREQETKGKYWTLEGLAELGGCTPAGAWISVFTLCSICTKLRKSHSKDERKHASEVL VLPGRHFQLEEHTLSYMEKPQALIDLMQSIFLTHNPTWADCKQLLLSLFNTEEHRRVIQA AHQRLEKNAPVGTGDVRQCARQALPIETDPGWDPNQAQDLLKLLRYQEALIQGIKTEGKK ATNTGKVSEVYQKPDESPMLKPSGDYQPVQDLRAVNQVAAILHAIVPNPYTVLGQIPASA AWFTCLDIKDAFFCILLAPSTKTCSLSSRPMDESARSRSSNSTDLAEEPRRSRKRQLPCY DHTRGWSVNARL >gi568815590f:37930574_38157264|GENSCAN_predicted_CDS_5|939_bp atgtgttattatagggaacaggagaccaagggaaaatactggacactggaaggcctggct gagcttggaggatgcacacctgctggggcatggatttctgtcttcacactttgctccatc tgcacaaagctacgtaagagtcacagcaaagatgaacggaaacatgcatctgaggttttg gttttgccaggcagacattttcaactggaagagcatactctctcctatatggaaaaaccc caggctcttatcgacctaatgcagtccatcttcttaactcacaacccaacctgggctgac tgcaaacagctccttctgtcactgttcaatacagaagaacaccgcagagtaatacaagcg gctcatcagaggctagaaaaaaatgccccagtaggtacaggagatgtcagacagtgtgct cggcaggctttgccaatagaaactgacccaggctgggacccaaatcaggcccaagatctg ctgaagttgctgagataccaagaggctctaatacaaggaataaagactgaagggaagaag gcaacaaacactggaaaggtttcagaagtctatcagaaaccagatgaaagccccatgtta aaaccttctggtgactaccagcctgtacaagatttaagggcagtcaaccaggtagctgct atactgcatgctattgtgcctaacccgtacactgtgcttggacaaatacctgctagtgct gcttggttcacatgcttggacattaaagatgcattcttctgcatcctgttagccccttca actaaaacctgcagcctcagttcaagaccaatggacgagtcagcaagatccagatcatcc aactcgactgatcttgcagaggaaccaaggcgcagcaggaaaagacaactgccctgctac gaccacaccagaggctggtcggtcaacgcacggctgtag >gi568815590f:37930574_38157264|GENSCAN_predicted_peptide_6|576_aa MAAAGAGPGQEAGAGPGPGAVANATGAEEGEMKPVAAGAAAPPGEGISAAPTVEPSSGEA EGGEANLVDVSGGLETESSNGKDTLEGAGDTSEVMDTQAGSVDEENGRQLGEVELQCGIC TKWFTADTFGIDTSSCLPFMTNYSFHCNVCHHSGNTYFLRKQANLKEMCLSALANLTWQS RTQDEHPKTMFSKDKSKERDVFLVKEHPDPGSKDPEEDYPKFGLLDQDLSNIGPAYDNQK QSSAVSTSGNLNGGIAAGSSGKGRGAKRKQQDGGTTGTTKKARSDPLFSAQRLPPHGYPL EHPFNKDGYRYILAEPDPHAPDPEKLELDCWAGKPIPGDLYRACLYERVLLALHDRAPQL KISDDRLTVVGEKGYSMVRASHGVRKGAWYFEITVDEMPPDTAARLGWSQPLGNLQAPLG YDKFSYSWRSKKGTKFHQSIGKHYSSGYGQGDVLGFYINLPEDTETAKSLPDTYKDKALI KFKSYLYFEEKDFVDKAEKSLKQTPHSEIIFYKNGVNQGVAYKDIFEGVYFPAISLYKSC TMSDMGWGAVVEHTLADVLYHVETEVDGRRSPPWEP >gi568815590f:37930574_38157264|GENSCAN_predicted_CDS_6|1731_bp atggcggcggcaggagcaggacctggccaggaagcgggtgccgggcctggcccaggagcg gtcgcaaatgcaacaggggcagaagagggggagatgaagccggtggcagcgggagcagcc gctcctcctggagaggggatctctgctgctccgacagttgagcccagttccggggaggct gaaggcggggaggcaaacttggtcgatgtaagcggtggcttggagacagaatcatctaat ggaaaagatacactagaaggtgctggggatacatcagaggtgatggatactcaggcgggc tccgtggatgaagagaatggccgacagttgggtgaggtagagctgcaatgtgggatttgt acaaaatggttcacggctgacacatttggcatagatacctcatcctgtctacctttcatg accaactacagttttcattgcaacgtctgccatcacagtgggaatacctatttcctccgg aagcaagcaaacttgaaggaaatgtgccttagtgctttggccaacctgacatggcagtcc cgaacacaggatgaacatccgaagacaatgttctccaaagataagagtaaagaaagagat gtattcttggtaaaggaacacccagatccaggcagtaaagatccagaagaagattacccc aaatttggacttttggatcaggaccttagtaacattggtcctgcttatgacaaccaaaaa cagagcagtgctgtgtctactagtgggaatttaaatgggggaattgcagcaggaagcagc ggaaaaggacgaggagccaagcgcaaacagcaggatggagggaccacagggaccaccaag aaggcccggagtgaccctttgttttctgctcagcgccttccccctcatggctacccattg gaacacccgtttaacaaagatggctatcggtatattctagctgagcctgatccgcacgcc cctgaccccgagaagctggaacttgactgctgggcaggaaaacctattcctggagacctc tacagagcctgcttgtatgaacgggttttgttagccctacatgatcgagctccccagtta aagatctcagatgaccggctgactgtggttggagagaagggctactctatggtgagggcc tctcatggagtacggaaaggtgcctggtattttgaaatcactgtggatgagatgccacca gataccgctgccagactgggttggtcccagcccctaggaaaccttcaagctcctttaggt tatgataaatttagctattcttggcggagcaaaaagggaaccaagttccaccagtccatt ggcaaacactactcttctggctatggacagggagacgtcctgggattttatattaatctt cctgaagacacagagacagccaagtcattgccagacacatacaaagataaggctttgata aaattcaagagttatttgtattttgaggaaaaagactttgtggataaagcagagaagagc ctgaagcagactccccatagtgagataatattttataaaaatggtgtcaatcaaggtgtg gcttacaaagatatttttgagggggtttacttcccagccatctcactgtacaagagctgc acgatgagtgacatgggctggggcgccgtggtagagcacaccctggctgacgtcttgtat cacgtggagacagaagtggatgggaggcgcagtcccccatgggaaccctga >gi568815590f:37930574_38157264|GENSCAN_predicted_peptide_7|285_aa MLLATFKLCAGSSYRHMRNMKGLRQQAVMAISQELNRRALGGPTPSTWINQVRRRSSLLG SRLEETLYSDQELAYLQQGEEAMQKALGILSNQEGWKKESQQDNGDKVMSKVVPDVGKVF RLEVVVDQPMERLYEELVERMEAMGEWNPNVKEIKVLQKIGKDTFITHELAAEAAGNLVG PRDFVSVRCAKRRGSTCVLAGMATDFGNMPEQKGVIRAEHGPTCMVLHPLAGSPSKTKLT WLLSIDLKGWLPKSIINQVLSQTQVDFANHLRKRLESHPASEARC >gi568815590f:37930574_38157264|GENSCAN_predicted_CDS_7|858_bp atgctgctagcgacattcaagctgtgcgctgggagctcctacagacacatgcgcaacatg aaggggctgaggcaacaggctgtgatggccatcagccaggagctgaaccggagggccctg gggggccccacccctagcacgtggattaaccaggttcggcggcggagctctctactcggt tctcggctggaagagactctctacagtgaccaggagctggcctatctccagcagggggag gaggccatgcagaaggccttgggcatccttagcaaccaagagggctggaagaaggagagt cagcaggacaatggggacaaagtgatgagtaaagtggtcccagatgtgggcaaggtgttc cggctggaggtcgtggtggaccagcccatggagaggctctatgaagagctcgtggagcgc atggaagcaatgggggagtggaaccccaatgtcaaggagatcaaggtcctgcagaagatc ggaaaagatacattcattactcacgagctggctgccgaggcagcaggaaacctggtgggg ccccgtgactttgtgagcgtgcgctgtgccaagcgccgaggctccacctgtgtgctggct ggcatggccacagacttcgggaacatgcctgagcagaagggtgtcatcagggcggagcac ggtcccacttgcatggtgcttcacccgttggctggaagtccctctaagaccaaacttacg tggctactcagcatcgacctcaaggggtggctgcccaagagcatcatcaaccaggtcctg tcccagacccaggtggattttgccaaccacctgcgcaagcgcctggagtcccaccctgcc tctgaagccaggtgttga