GENSCAN 1.0 Date run: 6-Nov-116 Time: 09:06:34 Sequence gi568815589r:36239755_36476129 : 236375 bp : 42.63% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 PlyA - 103 98 6 1.05 1.05 Term - 738 341 398 1 2 31 38 156 0.009 -0.85 1.04 Intr - 6728 6277 452 1 2 64 84 503 0.858 39.52 1.03 Intr - 9643 9438 206 1 2 70 86 182 0.840 13.28 1.02 Intr - 19258 18828 431 2 2 92 81 307 0.070 22.91 1.01 Init - 25469 25220 250 2 1 63 39 340 0.071 24.27 1.00 Prom - 45947 45908 40 -4.15 2.05 PlyA - 45973 45968 6 -0.45 2.04 Term - 46744 46448 297 1 0 5 48 240 0.423 5.98 2.03 Intr - 52972 52875 98 2 2 42 81 150 0.851 8.51 2.02 Intr - 54397 54256 142 1 1 72 70 66 0.664 2.21 2.01 Init - 54886 54788 99 1 0 87 53 66 0.693 3.32 2.00 Prom - 60556 60517 40 -5.35 3.02 PlyA - 62008 62003 6 1.05 3.01 Sngl - 64216 63743 474 1 0 52 42 645 0.824 52.16 3.00 Prom - 73317 73278 40 -2.05 4.03 PlyA - 74576 74571 6 -0.45 4.02 Term - 78550 78308 243 1 0 88 49 161 0.462 7.12 4.01 Init - 79257 79192 66 0 0 81 116 -12 0.265 2.02 4.00 Prom - 79842 79803 40 -8.25 5.00 Prom + 81575 81614 40 -1.35 5.01 Init + 85369 85490 122 0 2 69 41 192 0.672 10.33 5.02 Term + 87105 87549 445 0 1 20 48 269 0.897 9.82 5.03 PlyA + 89052 89057 6 1.05 6.13 PlyA - 90324 90319 6 1.05 6.12 Term - 100060 99931 130 0 1 90 43 37 0.057 -3.93 6.11 Intr - 105199 105078 122 1 2 72 78 160 0.267 11.77 6.10 Intr - 111445 111361 85 0 1 76 88 91 0.981 6.80 6.09 Intr - 113094 112988 107 0 2 89 81 24 0.986 -0.11 6.08 Intr - 113577 113416 162 0 0 92 86 75 0.983 6.95 6.07 Intr - 116719 116549 171 1 0 107 86 23 0.895 3.22 6.06 Intr - 118188 118021 168 0 0 77 100 11 0.544 0.42 6.05 Intr - 130178 129965 214 1 1 67 87 191 0.591 14.70 6.04 Intr - 136373 136180 194 2 2 68 96 114 0.450 7.57 6.03 Intr - 151161 151059 103 2 1 41 41 101 0.002 -0.04 6.02 Intr - 156863 156704 160 0 1 81 56 100 0.002 4.22 6.01 Init - 167992 167866 127 1 1 66 44 105 0.176 4.27 6.00 Prom - 173654 173615 40 -6.75 7.06 PlyA - 175053 175048 6 1.05 7.05 Term - 176085 175861 225 0 0 38 36 221 0.741 7.70 7.04 Intr - 184929 184859 71 1 2 139 100 58 0.710 10.08 7.03 Intr - 198636 198507 130 0 1 17 47 156 0.191 3.75 7.02 Intr - 230617 230467 151 0 1 80 65 176 0.611 13.74 7.01 Init - 231039 230969 71 0 2 74 71 76 0.567 5.07 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 470 153 318 2 0 71 38 209 0.850 9.92 S.002 Sngl - 25469 25170 300 2 0 63 48 384 0.878 27.44 S.003 Term + 160736 161031 296 0 2 44 48 251 0.889 11.18 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589r:36239755_36476129|GENSCAN_predicted_peptide_1|578_aa MEENRRTQRLVFISIRTNPGTCRAGTTASLQPDREQQWAPRWIRSTADTLPDPEGWKSAA GLRLRQTAVVDNERKLSLSGNKHGLGTGWEFPVVEAAAELIIKMREALPCLGHQFTLHQC LLAVTARVNVRDRCGVAALHDSIVAVLVYEITEGTRGARNPSLGARPKQLPSGKTRASVR VSPVEEDASGREHKLAGQPRSSSSAAEAMEDGPLVSTLPAPQNTAGKELYFKNLSKRNKQ IMEKNGNNRKLRVCVATCNRADYSKLAPIMFGIKTEPEFFELDVVVLGSHLIDDYGNTYR MIEQDDFDINTRLHTIVRGEDEAAMVESVGLALVKLPDVLNRLKPDIMIVHGDRFDALAL ATSAALMNIRILHIEGGEVSGTIDDSIRHAITKLAHYHVCCTRSAEQHLISMCEDHDRIL LAGCPSYDKLLSAKNKDYMSIIRMWLGMQGWFNVYKSVNVMDHINRIKNHMIISVDAEKA LNKIQHPFMIKTLSKISIQGTYLNVIKAIYDKPTANVILNGEKLKAFPLRIGTRQGCPLS PLLFNMVLEVLARAIRQEKKRASKSVKRKSNRHSLLTI >gi568815589r:36239755_36476129|GENSCAN_predicted_CDS_1|1737_bp atggaagagaaccgtagaacccagcgactagtgttcatctcaattaggacgaacccaggc acttgccgtgcaggaacaacggcaagccttcagcccgatcgggagcagcagtgggcgcct cgctggatcaggagcacagcagacaccctgccagatccggaggggtggaagtcagcggcg ggtctgcgactgcggcaaacagcagtggtggacaacgagcgaaagcttagcttgagcggt aacaaacacggcctcgggactggctgggagttccctgtagtggaggccgccgctgaactg attataaagatgagagaggctctgccatgccttggtcatcaattcacactccaccagtgt cttctagcagtcacggcaagggttaacgtcagggaccgctgtggggtggccgcgctacac gacagtatagttgcggtcctggtttatgaaataactgagggaacaagaggcgcaagaaat ccctccttgggtgcaagaccaaaacaactacccagcgggaagactcgggcttcagtgcgt gtgtcgccagtggaggaggacgcttcggggcgggagcacaagctggcaggacagccccgc agcagctccagcgcggcagaggccatggaagatggtccgctggtcagcaccctgcctgcg cctcaaaataccgccgggaaggaactctattttaagaacctctcaaaacgaaacaagcaa atcatggagaagaatggaaataaccgaaagctgcgggtttgtgttgctacttgtaaccgt gcagattattctaaacttgccccgatcatgtttggcattaaaaccgaacctgagttcttt gaacttgatgttgtggtacttggctctcacctgatagatgactatggaaatacatatcga atgattgaacaagatgactttgacattaacaccaggctacacacaattgtgaggggagaa gatgaggcagccatggtggagtcagtaggcctggccctagtgaagctgccagatgtcctt aatcgcctgaagcctgatatcatgattgttcatggagacaggtttgatgccctggctctg gccacatctgctgccttgatgaacatccgaatccttcacattgaaggtggggaagtcagt gggaccattgatgactctatcagacatgccataacaaaactggctcattatcatgtgtgc tgcacccgcagtgcagagcagcacctgatatccatgtgtgaggaccatgatcgcatcctt ttggcaggctgcccttcctatgacaaacttctctcagccaagaacaaagactacatgagc atcattcgcatgtggctagggatgcagggatggtttaatgtatacaagtcagtaaatgtg atggaccacataaacagaattaaaaatcacatgatcatttcagtagatgcagaaaaagca ttaaacaaaatccagcatccctttatgattaaaactctcagcaaaatcagcatacaaggg acataccttaatgtaataaaagccatctatgacaaacccacagctaatgtaatactgaat ggggaaaagttgaaagcattccctctgagaattggaacaagacaaggatgcccactctca ccactcctcttcaacatggtactggaagtcttagccagagcaatcagacaagagaagaaa agggcatccaaatcagtaaagaggaagtcaaaccgtcactctttgctgacaatatga >gi568815589r:36239755_36476129|GENSCAN_predicted_peptide_2|211_aa MPGHQLLLTTPQGPCAALASPVMGSTYQHPASLGCPGHLKESGPPAALEGGSGKASFTLQ GACLLNRKAGGFLRLQSEPAEGKPREERKTLNMQKVKEIIDRAKSPLKEEDAKPKPNPDQ GHNSSILWRLREVRLSIEENLEANRGWFIRLKERSQLYNIKVQGEAASADVEAAASYPED LAKITDEGGCAKQQIFNVDKTNSFLLDEEAM >gi568815589r:36239755_36476129|GENSCAN_predicted_CDS_2|636_bp atgcccggtcaccagctgctcttaaccacccctcagggaccttgtgctgctcttgcttct ccagtcatgggctctacctaccagcatccggcctccctgggctgcccaggtcatttgaag gaatctgggcctccagcagcacttgagggtggctctgggaaagcatccttcactctgcag ggcgcctgcttgctgaacaggaaagccgggggctttctgaggctgcagtcagaacctgct gaggggaagcctcgggaagaaaggaaaacactgaacatgcaaaaggtgaaggaaataatt gaccgagccaagtcccccttgaaggaggaggatgcgaagccaaagcctaatccagaccaa ggccataactcttcaattctgtggaggctgagagaggtgaggttatctatagaagaaaac ttggaagctaacagaggctggttcatcagattgaaggaaagaagccagctctataacata aaagtgcaaggtgaagcagcaagtgctgatgtagaagctgcagcaagttatccagaagat ctggctaagatcactgacgaaggcggctgcgctaaacaacagattttcaatgtagataaa acaaacagctttctcttggatgaagaggccatgtag >gi568815589r:36239755_36476129|GENSCAN_predicted_peptide_3|157_aa MSSKEKFKFGEMAKADEVCYDREMKDYGPAKGGKKKDPNAPKRPPSGFFLFCSEFRPKIK STNPGISIGDVAKKLGEMWNNLNDSEKQPYVTKAAKLKEKYEKDVVDCKLKGKFYGAKRP AKVARKKVEEEDEEDEEEEEKEKDEEDEQRNCLSVSL >gi568815589r:36239755_36476129|GENSCAN_predicted_CDS_3|474_bp atgtccagtaaagagaaatttaaatttggtgaaatggcaaaggcggatgaagtgtgctat gatcgggaaatgaaggattatggaccagctaagggaggcaagaagaaggatcctaatgcc cccaaaaggccaccatctggattcttcctgttctgttcagaattccgccccaagatcaaa tccacaaaccccggcatctctattggagacgtggcaaaaaagttgggtgagatgtggaat aacttaaatgacagtgaaaagcagccttacgtcactaaggcggcaaagctgaaggagaag tatgagaaggatgttgttgactgtaagttgaaaggaaagttttatggcgcaaagcgtcct gctaaagttgcccggaaaaaggtggaagaggaagatgaagaagacgaggaggaagaagag aaggagaaggacgaggaggatgaacaaagaaactgtttatctgtctccttgtga >gi568815589r:36239755_36476129|GENSCAN_predicted_peptide_4|102_aa MLRQKQTNKQTNQAFLTKSNQKQHWILGKMKNEVHASLTMPLCNKHQLYRRRNLRLCPHC SYLPTLSISPYSTDWLGETEGYIPGAVNLMVKPSRSSRAASC >gi568815589r:36239755_36476129|GENSCAN_predicted_CDS_4|309_bp atgctcaggcaaaaacaaacaaacaaacaaacaaaccaggcttttctcacaaaatcaaat cagaagcagcactggattctaggaaaaatgaaaaatgaagtgcatgcttcacttactatg cccctgtgtaataaacatcagctttaccgaagaagaaacctgaggctgtgtccacactgc agttatcttcctacactctccatttctccttattctacagattggctgggggaaactgaa ggatatattcctggtgcagtaaacctgatggtcaaaccttcccggagctccagggcagcc agctgctag >gi568815589r:36239755_36476129|GENSCAN_predicted_peptide_5|188_aa MDADLAPLPRASADRVIMSCADSPEAELALQISSRERAPVRLVESTPFHPGKGKNENLAS SQQEEGVERLAPWLIRFRKGFLEEGIPELSQGRRQRREEEDFQAEGNVGPSVSRGTGNRL TLPECSMPGKGVLAAKAGAIIRSRLRRLLLRRGASNKIGLCPVAGESHLRVWSYGWESGV KGPRSGCI >gi568815589r:36239755_36476129|GENSCAN_predicted_CDS_5|567_bp atggacgctgacctggccccactcccgagggcctcagcggatagagtcattatgagctgc gccgatagcccggaagcggaactagcgttgcagatcagcagcagggagcgagcccctgtg aggttagtggaaagcacacctttccacccagggaagggaaaaaatgagaacttggcaagc tcacaacaggaggagggtgtagagcgtcttgctccctggcttataagattcaggaaaggt ttcctggaggaggggataccagaattgagtcaggggagaagacagaggagggaggaagag gacttccaggcagaggggaatgtggggcccagtgtgtctcggggaactgggaacaggctg acgctgccagagtgtagtatgccgggcaagggagtcctggcagccaaggctggagccatc atcaggagcaggttaagaaggctcttgttgcgaaggggagcatctaataagattggactt tgtcctgtggcaggggagagccatttaagggtttggagctatgggtgggagtcgggggta aaggggccacgatcaggttgtatttga >gi568815589r:36239755_36476129|GENSCAN_predicted_peptide_6|580_aa MKLLEHTVELKVLSVLRNRTSKSVASKSIGYSRPSSGGLLPQVFSPGPALKNAIRDFEYP CSTLSKSFQRTLLPENKSQGNRFVLGPVIELFSKERISELTPAEKVTNRYLGNRKDVVFL GCAVLPPTVFSEDSPSPKRQRLSHSVFDYTSASPAPSPPMRPWEMTSNRQPPSVRPSQHH FSGERCNTPARNRRSPPVRRQRGRRDRLSRHNSISQDENYHHLPYAQQQAIEEPRAFHPP NVSPRLLHPAAHPPQQNAVMVDIHDQLHQGTVPVSYTVTTVAPHGIPLCTGQHIPACSTQ QVPGCSVVFSGQHLPVCSVPPPMLQACSVQHLPVPYAAFPPLISSDPFLIHPPHLSPHHP PHLPPPGQFVPFQTQQSRSPLQRIENEVELLGEHLPVGGFTYPPSAHPPTLPPSAPLQFL THDPLHQEVSFGVPYPPFMPRRLTGRSRYRSQQPIPPPPYHPSLLPYVLSMLPVPPAVGP TFSFELDVEDGEVENYEALLNLAERLGEAKPRGLTKADIEQLPSYRFNPNNHQSEQTLQI VLAQFAELMLQKCIGIQNDQPKKHKFSLGVPHHMYIRTIH >gi568815589r:36239755_36476129|GENSCAN_predicted_CDS_6|1743_bp atgaagctgctggagcatacagttgagctcaaggtgctgtctgttctccggaacagaaca agcaagagtgtggcttcaaagagtataggttacagcaggccatctagtggaggcctgttg cctcaggtgttctctcctggaccagctttgaagaatgccatccgggactttgagtacccc tgctccacactgtcaaaatcctttcagagaactctgctcccagaaaacaaaagccaagga aacagatttgttttaggcccagtgatagaactcttctcaaaggaaaggatttcggaatta acaccagcagaaaaagttaccaataggtatttgggaaacaggaaggacgtggtgttcttg ggctgtgctgttttgcctccaacagtttttagtgaagatagtccaagtcctaagagacag cgcctctctcattcagtctttgattatacatcagcatcaccagctccctcaccaccaatg cgaccatgggagatgacatcaaataggcagcccccttcagttcgaccaagccaacatcac ttctcaggggaacgatgcaacacacctgcacgcaacagaagaagtcctcctgtcaggcgc cagagaggaagaagggatcgtctgtctcgacataattccattagtcaagatgaaaactat caccatctcccttacgcacagcagcaagcaatagaggagcctcgagccttccaccctccg aatgtatctccccgtctgctacatcctgctgctcatccaccccagcagaatgcagtcatg gttgacatacatgatcagctccatcaaggaacagtccctgtttcttacacagtaacaaca gtggcaccacatgggattccactctgcacaggccagcacatccctgcttgtagtacacag caggtcccaggatgctctgtggttttcagtggacagcacctccctgtctgtagtgtgcct cctccaatgcttcaggcatgttcagttcagcacttaccagtaccatatgctgcattccca ccccttatttctagtgatccatttcttatacatcctcctcacctttctccccatcatcct cctcatttgccaccaccaggccagtttgtccctttccaaacacagcaatcacgatcgcct ctgcaaaggatagaaaatgaagtggaactcttaggagaacatcttccagtaggaggtttt acttaccctccatcagcccaccccccaacattacctccatcagctcccttgcagttctta acacatgatcctttgcatcaggaggtgtcctttggagtaccttatcctccatttatgcct cggaggcttacaggacgtagtagataccgatcccagcagccaataccacctcccccttat catcccagcttactgccatatgtgttatcaatgcttccagtgccacctgcagtgggccca actttcagctttgaattagatgtagaagatggagaagtagaaaattacgaggccctgtta aacctggcagagcgactgggagaggcaaagcctcgtggactgactaaagcagatattgaa caacttccttcttatcggttcaatcctaacaaccaccagtcagaacagactttgcaaatc gtacttgcccaatttgccgagctgatgcttcagaagtgcatcgggattcagaatgaccaa cctaagaagcacaaatttagtttgggtgttcctcatcacatgtatatacggactatccat tga >gi568815589r:36239755_36476129|GENSCAN_predicted_peptide_7|215_aa MGAAAWKWGLDVGEGVLVRAEKVRNHSDTVWMGAKHYGLGLMVCGPEVDQGDLVKMPISG KASGDGEESFEYQEKVYEFVLGCIQSLPGLHAAHGSRVGQAYLEYRIYLLRILDYFEGSV GASFQCQSIHHDHICHDDSDKLTRGSESVHVTTSLPAQPAFEKTSALNKTTKDPHRVHFI PLLPPSEQMPQLRDRDTDHITGLFADTPQYQPGPQ >gi568815589r:36239755_36476129|GENSCAN_predicted_CDS_7|648_bp atgggggctgctgcatggaagtgggggctagatgttggtgaaggtgtccttgttcgagca gagaaggtaagaaaccactcagacactgtgtggatgggtgcgaagcattatggattgggg cttatggtttgtggccctgaggtagaccagggggatcttgttaaaatgccgatttctggc aaggcatcaggtgatggtgaagaaagttttgagtaccaggagaaagtttatgaatttgtg ttgggctgcattcaaagccttcctgggctgcatgcagcccatgggtcacgggttggacaa gcttatttagagtatcggatctaccttttgagaatcttggactactttgaaggtagtgtt ggtgcttcttttcagtgccagtccatccatcacgaccacatttgtcatgatgacagtgac aagctaaccagaggttctgagtctgtccatgttacaacttcactgccagcacaaccagca tttgagaaaaccagtgcactaaacaaaactaccaaggaccctcacagggtccacttcatt cccctgctacctccatcggagcagatgccacagctgagagaccgggatactgatcacatt acaggactctttgcagacactccccagtaccagcccggaccccagtag