GENSCAN 1.0 Date run: 3-Nov-116 Time: 01:20:04 Sequence gi568815594r:55335418_55582785 : 247368 bp : 39.62% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 707 702 6 1.05 1.02 Term - 4171 3703 469 0 1 96 44 217 0.250 11.56 1.01 Init - 6997 6990 8 1 2 45 93 0 0.460 -3.40 1.00 Prom - 8282 8243 40 -5.25 2.00 Prom + 10095 10134 40 -6.55 2.01 Init + 10920 11140 221 2 2 94 76 194 0.895 14.95 2.02 Intr + 16631 16790 160 2 1 46 79 74 0.014 1.37 2.03 Intr + 17030 17080 51 1 0 43 81 87 0.021 1.69 2.04 Intr + 23929 24071 143 0 2 110 86 41 0.095 4.33 2.05 Intr + 28712 28854 143 2 2 22 101 130 0.051 6.78 2.06 Term + 30046 30152 107 2 2 86 42 35 0.064 -3.71 2.07 PlyA + 30692 30697 6 1.05 3.06 PlyA - 30922 30917 6 -1.95 3.05 Term - 31470 31156 315 0 0 56 42 267 0.508 12.86 3.04 Intr - 46396 46308 89 2 2 42 84 59 0.543 -0.33 3.03 Intr - 51006 50892 115 0 1 88 91 119 0.966 11.30 3.02 Intr - 51473 51286 188 0 2 82 82 73 0.899 4.59 3.01 Init - 51942 51855 88 0 1 58 62 88 0.901 4.05 3.00 Prom - 56617 56578 40 -4.25 4.00 Prom + 56947 56986 40 -7.25 4.01 Init + 60773 60979 207 1 0 97 83 345 0.991 31.77 4.02 Intr + 76197 76422 226 2 1 73 109 172 0.626 14.44 4.03 Intr + 81655 81830 176 2 2 67 74 172 0.778 12.44 4.04 Intr + 82386 82568 183 2 0 64 105 61 0.764 4.46 4.05 Intr + 89121 89226 106 2 1 64 119 99 0.545 9.47 4.06 Term + 96491 96555 65 0 2 58 47 53 0.029 -4.73 4.07 PlyA + 97039 97044 6 1.05 5.14 PlyA - 98196 98191 6 1.05 5.13 Term - 100108 99998 111 1 0 78 42 113 0.624 3.28 5.12 Intr - 103120 102865 256 0 1 112 91 247 0.995 23.92 5.11 Intr - 107217 107015 203 0 2 74 63 49 0.880 -1.84 5.10 Intr - 108479 108270 210 2 0 49 95 102 0.890 5.19 5.09 Intr - 109362 109216 147 0 0 2 91 197 0.115 10.91 5.08 Intr - 114079 113979 101 2 2 72 115 20 0.460 2.01 5.07 Intr - 114815 114674 142 2 1 56 76 95 0.636 4.11 5.06 Intr - 117712 117637 76 0 1 84 53 129 0.928 7.60 5.05 Intr - 118407 118260 148 1 1 62 85 -1 0.614 -4.63 5.04 Intr - 120586 120480 107 0 2 94 93 61 0.778 6.14 5.03 Intr - 120883 120801 83 1 2 66 91 46 0.954 0.22 5.02 Intr - 123593 123475 119 0 2 83 95 60 0.879 5.46 5.01 Init - 126057 125820 238 0 1 64 78 77 0.280 2.52 5.00 Prom - 127630 127591 40 -3.25 6.00 Prom + 127975 128014 40 -3.95 6.01 Init + 134074 134156 83 0 2 61 20 139 0.045 4.89 6.02 Intr + 155363 155484 122 2 2 79 25 73 0.001 -0.68 6.03 Intr + 160129 160269 141 2 0 81 81 28 0.036 0.80 6.04 Intr + 174151 174225 75 2 0 84 115 8 0.179 1.67 6.05 Term + 207942 208075 134 1 2 94 42 126 0.886 5.87 6.06 PlyA + 208513 208518 6 1.05 7.02 PlyA - 209660 209655 6 1.05 7.01 Sngl - 210682 210278 405 1 0 61 44 400 0.985 26.93 7.00 Prom - 219550 219511 40 -5.45 8.06 PlyA - 219904 219899 6 1.05 8.05 Term - 221294 221140 155 0 2 60 44 116 0.851 1.50 8.04 Intr - 227195 226987 209 1 2 77 90 70 0.821 3.90 8.03 Intr - 234444 234301 144 2 0 36 86 146 0.926 7.68 8.02 Intr - 245494 245404 91 2 1 63 60 162 0.854 9.03 8.01 Intr - 246820 246700 121 1 1 109 47 183 0.653 15.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 18274 18159 116 2 2 89 49 120 0.844 5.95 S.002 Term + 61494 61583 90 2 0 54 41 115 0.860 0.14 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:55335418_55582785|GENSCAN_predicted_peptide_1|158_aa MPRSLAHFPLWPLPRGRIRPLLVLACRYKSQRRIRPKPYEVARGPRRGPTNSVQQQDLSP STQTTLQTGLSVIIHTHTHSALRNFTIKEVLYRLPWLLLRWSVHRVVAMVDEDPLPQVSG QFLSVLLRVQIYSSHRVGLSPLPLRPPQEAAWRLMRED >gi568815594r:55335418_55582785|GENSCAN_predicted_CDS_1|477_bp atgccaaggtcccttgcacacttcccactctggccgcttccccgagggagaattaggccc ctcttagtgttggcatgccggtataaatcccaacgcaggatccgccctaagccatatgag gtagctaggggaccgcggagaggacccactaactccgtccagcagcaggacttgtcacca tccacacaaacaacactgcaaacagggttgtctgtgatcattcacacacatacacattca gccctccggaatttcaccatcaaggaagtactttatcgactcccgtggcttctccttcgt tggtctgtgcacagagtcgtcgccatggtagatgaggatcctttaccccaggttagtggc cagtttctttccgtgttgctgagagtccagatttattcatcacaccgggtgggtctcagc cccttacccctaaggccaccacaagaggcggcatggcgcctcatgagagaggactag >gi568815594r:55335418_55582785|GENSCAN_predicted_peptide_2|274_aa MAPWAEAEHSALNPLRAVWLTLTAAFLLTLLLQLLPPGLLPGCAIFQDLIRYGKTKCGEP SRPAACRAFDVPKRATPYFCSNFSISGITQSVIQGIHLVYRQSIIDCTKSGLAFMEKLIK VGINEDVKGCSLHETAEGVHDGEMIFFPLLYHLSAVEWLPALVPYSISVPGSTFSKLASW FAQNSRGGTVPGLRRLFECLYVSVFSNVMIHVVQYCFGLVYYVLVGLTVLSQVPMDGRNG MSVVAFCQPVPWKGLSFLVGSAGSPPKGTLQSAW >gi568815594r:55335418_55582785|GENSCAN_predicted_CDS_2|825_bp atggctccctgggcggaggccgagcactcggcgctgaacccgctgcgcgcggtgtggctc acgctgaccgccgccttcctgctgaccctactgctgcagctcctgccgcccggcctgctc ccgggctgcgcgatcttccaggacctgatccgctatgggaaaaccaagtgtggggagccg tcgcgccccgccgcctgccgagcctttgatgtccccaagagagccactccatacttctgc tccaatttctcaatttcaggcattacacaatcagttatacaaggaatacacttggtatac agacagtccatcattgactgcactaagtctggtttggcttttatggaaaagttgataaag gttggtatcaatgaggatgtgaagggctgtagccttcacgaaactgccgaaggggtccat gatggagaaatgatatttttcccacttttatatcatctcagtgctgtggaatggcttcct gctttggtgccttactcaatctctgttcctgggagcaccttttccaagctggcttcatgg tttgctcagaattctcggggcggcacagttccaggcttacgaagactcttcgagtgcctc tacgtcagtgtcttctccaatgtcatgattcacgtcgtgcagtactgttttggacttgtc tattatgtccttgttggcctaactgtgctgagccaagtgccaatggatggcaggaatggc atgtctgtggtggcattctgccagccagtaccatggaaggggctgtcattcctggtgggc agtgctggttctccccctaaaggaactttgcagagcgcttggtga >gi568815594r:55335418_55582785|GENSCAN_predicted_peptide_3|264_aa MKLQRKFNKQKKKALHSGEGTWRRVAAFTVSSLVDYTSVEMFLLKMMQGERQSYKVGIHN FSILPHFPIKMKEKKLVRSPSLSEAGIHYPAQGVTFIFMAQCDCKSSSIQVPSRRMEDWL EKQGDTGYASEPFPVFGAAAIAVLTVTPAARHHHHCNWAQEEGKNTDLLRTCGANHWLQK SKQGLPSAQQEAVSLVGGLTEETNVSTDNGQEPEALEPRGNPKKLIQVREPRIMQFHMAT SKQESVNDSKGLMKIHRSTCVSLF >gi568815594r:55335418_55582785|GENSCAN_predicted_CDS_3|795_bp atgaagttgcagcgaaagtttaataagcaaaagaagaaagctctccacagtggagagggg acctggaggagggttgctgcttttacagtatcttcccttgtggactacaccagtgtcgaa atgttcttgctaaaaatgatgcagggagaaagacaaagttacaaagttggcatccacaat ttttccatcttgccccacttcccaattaaaatgaaggaaaaaaaattggtgaggtctcca tccctgtctgaagcaggcatccattatcctgcccagggggtgacctttatcttcatggcc cagtgtgactgcaagagctctagcatccaagttcccagcagaaggatggaggactggctg gaaaaacaaggagacacagggtatgcatcagagccctttcctgtctttggagccgcagcc attgctgtcctgactgttactcctgctgcccgccaccatcaccactgcaactgggcccag gaagaagggaaaaacacagaccttctgaggacctgcggagcaaaccactggcttcagaaa agcaaacagggacttccctctgcgcagcaggaagcagttagcctagtcggaggcttgact gaagaaacaaatgtatccactgataatgggcaagagcctgaggctctggagccaaggggg aaccctaaaaagctgattcaagtaagagaaccaaggatcatgcaattccacatggccacc tcgaagcaggaatccgttaatgatagtaaaggactgatgaaaatccaccggtctacctgt gtcagtttattttaa >gi568815594r:55335418_55582785|GENSCAN_predicted_peptide_4|320_aa MAAAAPGNGRASAPRLLLLFLVPLLWAPAAVRAGPDEDLSHRNKEPPAPAQQLQPQPVAV QGPEPARVEKIFTPAAPVHTNKEDPATQTNLGFIHAFVAAISVIIVSELGDKTFFIAAIM AMRYNRLTVLAGAMLALGLMTCLSVLFGYATTVIPRVYTYYVSTVLFAIFGIRMLREGLK MSPDEGQEELEEVQAELKKKDEEFQRTKLLNGPGDVETGTSITVPQKKWLHFISPIFVQA LTLTFLAEWGDRSQLTTIVLAAREDPYGVAVGGTVGHCLCTGLAVIGGRMIAQKISVRTV LLMELDCFSAPSVKQTALRQ >gi568815594r:55335418_55582785|GENSCAN_predicted_CDS_4|963_bp atggcggccgcggctccagggaacggccgcgcatcggcgccccggctgcttctgctcttt ctggttccgctgctgtgggccccggctgcggtccgggccggcccagatgaagaccttagc caccggaacaaagaaccgccggcgccggcccagcagctgcagccgcagcctgtggctgtg cagggccccgagccggcccgggtcgagaaaatatttacaccagcagctccagttcatacc aataaagaagatcctgctacccaaactaatttgggatttatccatgcatttgtcgctgcc atatcagttattattgtatctgaattgggtgataagacattttttatagcagccatcatg gcaatgcgctataaccgcctgaccgtgctggctggtgcaatgcttgccttgggactaatg acatgcttgtcagttttgtttggctatgccaccacagtcatccccagggtctatacatac tatgtttcaactgtattatttgccatttttggcattagaatgcttcgggaaggcttaaag atgagccctgatgagggtcaagaggaactggaagaagttcaagctgaattaaagaagaaa gatgaagaatttcaacgaaccaaacttttaaatggaccgggagatgttgaaacgggtaca agcataacagtacctcagaaaaagtggttgcattttatttcacccatttttgttcaagct cttacattaacattcttagcagaatggggtgatcgctctcaactaactacaattgtattg gcagctagagaggacccctatggtgtagccgtgggtggaactgtggggcactgcctgtgc acgggattggcagtaattggaggaagaatgatagcacagaaaatctctgtcagaactgtg ctcttgatggagctggactgcttcagtgctcctagcgtcaagcagactgctttaaggcaa tga >gi568815594r:55335418_55582785|GENSCAN_predicted_peptide_5|646_aa MTTFPVTAVLPHGGPGLHCNESIFHQHLWPRKSSSTLLIKVAATFVIWFGRLKKITQVEN FEKDQTGRQKGKPYNPSNKISSSAHNGFEGTIQRTHRPSYEDRVCFVATVRLATPQFIKE MCTVEEPNEEFTSRHSLEWKFLFLDHRAPPIIGYLPFEVLGTSGYDYYHVDDLENLAKCH EHLMQYGKGKSCYYRFLTKGQQWIWLQTHYYITYHQWNSRPEFIVCTHTVVSYAEVRAER RRELGIEESLPETAADKSQDSGSDNRINTVSLKEALERFDHSPTPSASSRSSRKSSHTAV SDPSSTPTKIPTDTSTPPRQHLPAHEKMVQRRSSFSSQFSAQLGAMQHLKDQLEQRTRMI EANIHRQQEELRKIQEQLQMVHGQGLQMFLQQSNPGLNFGSVQLSSGNSSNIQQLAPINM QGQVVPTNQIQSGMNTGHIGTTQHMIQQQTLQSTSTQSQQNVLSGHSQQTSLPSQTQSTL TAPLYNTMVISQPAAGSMVQIPSSMPQNSTQSAAVTTFTQDRQIRFSQGQQLVTKLVTAP VACGAVMVPSTMLMGQVVTAYPTFATQQQQSQTLSVTQQQQQQSSQEQQLTSVQQPSQAQ LTQPPQQFLQSTFPQSHHQQHQSQQQQQLSRHRTDSLPDPSKVQPQ >gi568815594r:55335418_55582785|GENSCAN_predicted_CDS_5|1941_bp atgactacttttcctgtaacagctgttttgcctcatggaggtccaggtcttcattgcaat gaaagcatctttcaccagcatctctggcctagaaagagcagcagtaccttgcttataaaa gttgctgccactttcgtcatctggtttgggagattaaaaaaaatcactcaggtagaaaat tttgaaaaggaccagactgggagacaaaaaggaaaaccatataatcctagcaataaaata tcctcttcagcacacaatggttttgaaggaactatacaacgcacacataggccatcttat gaagatagagtttgttttgtagctactgtcaggttagctacacctcagttcatcaaggaa atgtgcactgttgaagaacccaatgaagagtttacatctagacatagtttagaatggaag tttctgtttctagatcacagggcaccacccataatagggtatttgccatttgaagttctg ggaacatcaggctatgattactatcatgtggatgacctagaaaatttggcaaaatgtcat gagcacttaatgcaatatgggaaaggcaaatcatgttattataggttcctgactaagggg caacagtggatttggcttcagactcattattatatcacttaccatcagtggaattcaagg ccagagtttattgtttgtactcacactgtagtaagttatgcagaagttagggctgaaaga cgacgagaacttggcattgaagagtctcttcctgagacagctgctgacaaaagccaagat tctgggtcagataatcgtataaacacagtcagtctcaaggaagcattggaaaggtttgat cacagcccaaccccttctgcctcttctcggagttcaagaaaatcatctcacacggccgtc tcagacccttcctcaacaccaaccaagatcccgacggatacgagcactccacccaggcag catttaccagctcatgagaagatggtgcaaagaaggtcatcatttagtagtcagttttca gctcaattaggagccatgcaacatctgaaagaccaattggaacaacggacacgcatgata gaagcaaatattcatcggcaacaagaagaactaagaaaaattcaagaacaacttcagatg gtccatggtcaggggctgcagatgtttttgcaacaatcaaatcctgggttgaattttggt tccgttcaactttcttctggaaattcatctaatatccagcaacttgcacctataaatatg caaggccaagttgttcctactaaccagattcaaagtggaatgaatactggacacattggc acaactcagcacatgatacaacaacagactttacagagtacatcaactcagagtcaacaa aatgtactgagtgggcacagtcagcaaacatctctacccagtcagacacagagcactctt acagccccactgtataacactatggtgatttctcagcctgcagccggaagcatggtccag attccatctagtatgccacaaaacagcacccagagtgctgcagtaactacattcactcag gacaggcagataagattttctcaaggtcaacaacttgtgaccaaattagtgactgctcct gtagcttgtggggcagtcatggtacctagtactatgcttatgggccaggtggtgactgca tatcctacttttgctacacaacagcaacagtcacagacattgtcagtaacgcagcagcag cagcagcagagctcccaggagcagcagctcacttcagttcagcaaccatctcaggctcag ctgacccagccaccgcaacaatttttacagagcaccttccctcagtcacatcaccagcaa catcagtctcagcaacagcagcaactcagccggcacaggactgacagcttgcccgaccct tccaaggttcaaccacagtag >gi568815594r:55335418_55582785|GENSCAN_predicted_peptide_6|184_aa MPIVSPEDPPMGQDVEVEDSDVDDPDPAWLNPQTWNLQMQKANCIDQMDLTDIHKTFHPT TAEYTLFSNYISYYPLLIVVIHKPQDKPLHSTKGVTLSNTSFEFQCTQNDPPKLATKAWQ EIILTRLSVICKSSSLNSGKGNNRAHTGGKMAKQMIIQDDSVFKAEKNIFANAQKKIYKI IQWC >gi568815594r:55335418_55582785|GENSCAN_predicted_CDS_6|555_bp atgcctattgtttctcctgaagaccctccaatgggacaagatgtggaggtggaggacagt gatgttgatgatcctgaccccgcttggttgaatccacagacgtggaaccttcagatgcag aaggccaactgtatagaccaaatggacctaacagacatacacaaaacattccacccaaca acagcagaatacacattattctcaaactacatcagctattatcccctccttattgtagtc attcacaaaccccaggataagccccttcactccacgaagggtgtgaccctctccaatacc tcttttgaatttcaatgtacacaaaatgatcccccaaaactggccactaaggcctggcag gaaataattcttactcgactttcagtcatttgcaaatcatcttccttaaactcaggaaaa ggtaataacagagcccatactggaggcaaaatggcaaagcagatgatcattcaagatgat tccgttttcaaggctgaaaagaacatatttgcaaatgcacagaaaaagatctacaagatc atacagtggtgttaa >gi568815594r:55335418_55582785|GENSCAN_predicted_peptide_7|134_aa MGRRGGQLLRGTPALPAPGPRAAVRSPNGRRCRPRAGSVSGGGSGVPFLGSARLLASDSS DRARVRGLAARRSVSLLPSTRASRSPSPPLAFVDFFPPVKLGFLELALALWPLECNFLHA APQSLYSLKVFVAL >gi568815594r:55335418_55582785|GENSCAN_predicted_CDS_7|405_bp atggggcgccggggcggccagctcctccggggaacccccgccctcccggcgcccggcccg cgtgccgcagtccgcagtccgaacggccgccgttgccggccgcgggctggttccgttagt ggtggtggttccggggttccgttcctaggcagcgcgcggctattagcgtctgactccagc gaccgcgcgcgggttcgagggttggcggcgaggcgctcggtttctcttcttccgtccacc cgcgcttcccgttccccgtcaccgccgctggcgtttgtagatttctttccgccagtgaag ctgggttttctggagttggctctggcgctctggcccctggagtgtaatttcctacacgca gcgccgcagagtttatattctttgaaagtgtttgtagctttgtag >gi568815594r:55335418_55582785|GENSCAN_predicted_peptide_8|239_aa DPNEDTEWNDILRDFGILPPKEESKDEIEEMVLRLQKEAMVKPFEKMTLAQLKEAEDEFD EEDMQAVETYRKKRLQEWKALKKKQKFGELREISGNQYVNEVTNAEEDVWVIIHLYRSSI PMCLLVNQHLSLLARKFPETKFVKAIVNSCIQHYHDNCLPTIFVYKNGQIEAKFIGIIEC GGINLKLEELEWKLAEVGAIQTDLEENPRKDMVDMMVSSIRNTSIHDDSDSSNSDNDTK >gi568815594r:55335418_55582785|GENSCAN_predicted_CDS_8|720_bp gatcccaatgaagatacagaatggaatgacattttaagagatttcggcattcttcctcct aaagaagagtcaaaagatgaaattgaagaaatggttttacgtttacagaaagaagcaatg gtgaaaccatttgaaaagatgactcttgcacagctaaaggaagctgaagatgaatttgat gaagaagatatgcaggctgttgaaacatatagaaagaagcggttacaggaatggaaagct cttaagaaaaaacaaaaatttggagaattaagagaaatttctggaaatcagtatgtgaat gaagtcacaaatgcagaagaagatgtgtgggttataattcatctatacagatcaagcatc ccaatgtgtttgttggttaaccagcatcttagtcttctagcaagaaagtttccagaaact aaatttgttaaagccatcgtgaatagctgtattcaacactaccatgacaattgtttacca acaatttttgtgtataaaaatggtcagatagaagccaaattcattggaattatagaatgt ggagggataaatctcaagctggaagaacttgaatggaagctagcagaagttggagcaata cagactgatttggaagaaaaccccagaaaagacatggtagatatgatggtatcttcaatt agaaacacttctattcatgatgacagtgatagctccaacagtgataatgataccaaatag