GENSCAN 1.0 Date run: 6-Nov-116 Time: 12:46:59 Sequence gi568815575f:79071008_79272006 : 200999 bp : 36.51% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1226 1371 146 1 2 79 113 130 0.995 14.24 1.02 Intr + 39762 39979 218 0 2 14 32 185 0.015 2.72 1.03 Term + 41133 41248 116 1 2 97 50 68 0.613 1.65 1.04 PlyA + 41515 41520 6 1.05 2.00 Prom + 41524 41563 40 -3.25 2.01 Init + 47375 47435 61 1 1 66 93 58 0.426 5.56 2.02 Term + 57766 57914 149 2 2 12 44 191 0.779 4.18 2.03 PlyA + 58620 58625 6 1.05 3.00 Prom + 75107 75146 40 -4.95 3.01 Init + 100001 100957 957 1 0 74 59 430 0.230 32.17 3.02 Intr + 105076 105156 81 0 0 80 94 65 0.302 5.22 3.03 Intr + 115163 115192 30 1 0 95 89 20 0.067 0.21 3.04 Term + 131273 131365 93 1 0 26 42 123 0.003 -1.65 3.05 PlyA + 131451 131456 6 1.05 4.02 PlyA - 133307 133302 6 1.05 4.01 Sngl - 138304 137429 876 1 0 49 37 305 0.535 17.22 4.00 Prom - 138397 138358 40 -6.15 5.03 PlyA - 138565 138560 6 1.05 5.02 Term - 140034 139895 140 1 2 37 43 152 0.499 2.74 5.01 Init - 143637 143490 148 0 1 83 47 91 0.788 4.80 5.00 Prom - 143813 143774 40 -3.65 6.02 PlyA - 144175 144170 6 1.05 6.01 Sngl - 148394 147774 621 2 0 58 47 208 0.962 9.64 6.00 Prom - 153388 153349 40 -3.65 7.06 PlyA - 154969 154964 6 1.05 7.05 Term - 166124 165811 314 0 2 31 54 251 0.350 10.18 7.04 Intr - 167938 167813 126 2 0 59 96 112 0.853 8.83 7.03 Intr - 168218 168086 133 2 1 15 69 114 0.320 1.50 7.02 Intr - 172189 172101 89 2 2 97 94 17 0.249 1.97 7.01 Init - 181080 180915 166 0 1 86 75 84 0.818 6.74 7.00 Prom - 181210 181171 40 -2.75 8.00 Prom + 183188 183227 40 -5.15 8.01 Init + 187935 188120 186 2 0 66 99 131 0.684 11.11 8.02 Intr + 194471 194601 131 1 2 15 65 134 0.052 2.37 8.03 Intr + 199561 199630 70 1 1 63 110 39 0.182 1.87 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:79071008_79272006|GENSCAN_predicted_peptide_1|159_aa MRKNQHKNAENFKNQNAFSPPNNRNSSPTRAQNWTENEIDDLTAVGFRRKSVQFKEEGAD LKVYIRFYAPACKSSSGMGGPMKTCKSKEREKFSAVGEMQGDASTLEFCNTGAKFAEVVA QYNSSRPPSGTSILSEQLIILTYNMVWITGQPYYTCIAH >gi568815575f:79071008_79272006|GENSCAN_predicted_CDS_1|480_bp atgaggaaaaaccagcacaaaaatgctgaaaatttcaaaaaccagaatgccttttctcct ccaaataatcgcaactcctctccaacaagggcacaaaactggacagagaatgagattgat gatttgacagcagtaggcttcagaaggaagtctgtacagtttaaggaggaaggtgctgat ctgaaggtatatattaggttttatgccccagcatgtaaatcctcaagtggaatgggaggg cctatgaagacttgcaaaagtaaagaaagggagaagttctcagctgtgggagaaatgcag ggtgatgcatcaacgctggaattttgcaacactggggccaaatttgctgaggttgtggcc caatataatagctctcggcccccaagtggaacttctattctgtcagagcagctgattata ctaacttataacatggtttggatcactggtcaaccctactatacctgcatagcccactga >gi568815575f:79071008_79272006|GENSCAN_predicted_peptide_2|69_aa MKKRKKSSTSHIMVKMLKAKGTSGGIPEEDIVIIGDDSSLHVIVPKDLPVGQEVEVEDND INDLDLLEV >gi568815575f:79071008_79272006|GENSCAN_predicted_CDS_2|210_bp atgaagaaaagaaagaagtcctcaaccagccacatcatggtcaaaatgttgaaagccaaa ggtacttcaggaggtattccagaagaagacattgttatcataggagatgacagctccctg catgttattgtccctaaagatcttccagtgggacaagaggtggaggtggaagacaatgat attaatgatctcgaccttttggaggtctag >gi568815575f:79071008_79272006|GENSCAN_predicted_peptide_3|386_aa MPANYTCTRPDGDNTDFRYFIYAVTYTVILVPGLIGNILALWVFYGYMKETKRAVIFMIN LAIADLLQVLSLPLRIFYYLNHDWPFGPGLCMFCFYLKYVNMYASIYFLVCISVRRFWFL MYPFRFHDCKQKYDLYISIAGWLIICLACVLFPLLRTSDDTSGNRTKCFVDLPTRNVNLA QSVVMMTIGELIGFVTPLLIVLYCTWKTVLSLQDKYPMAQDLGEKQKALKMILTCAGVFL ICFAPYHFSFPLDFLVKSNEIKSCLARRVILIFHSVALCLASLNSCLDPVIYYFSTNEFR RRLSRQDLHDSIQLHAKSFRPVLCPPKVRPETTSLSLPPYFDGVVEDHCGVDGINSDAVV VRHMRINQCDTSHQQKEGQKPYDDFN >gi568815575f:79071008_79272006|GENSCAN_predicted_CDS_3|1161_bp atgcctgctaattacacgtgtaccaggccagatggagacaatacagattttcgatacttt atttatgcagtgacatacactgtcattcttgtgccaggtctcatagggaatatattagcc ctgtgggtattctatggttatatgaaagaaacaaaacgagctgtgatatttatgataaac ttagccattgctgacttactacaagttctttccttgccactgaggatcttctactacttg aatcatgactggccatttgggcctggtctctgcatgttctgtttctacctgaagtatgtc aacatgtatgcaagcatctacttcttggtctgcatcagtgtgcgacgattttggtttctc atgtacccctttcgcttccatgactgcaaacagaaatatgacctgtacatcagcattgct ggctggctgatcatctgccttgcctgtgtactctttccactcctcagaaccagtgatgat acctctggcaataggaccaaatgctttgtggatcttcctaccaggaatgtcaacctggcc cagtccgttgttatgatgaccattggcgagttgattgggtttgtaactccgcttctgatt gtcctatattgtacctggaagacggttttatcactgcaagataaatatcccatggcccaa gatcttggagagaaacagaaagccttgaagatgattctaacctgtgcaggggtattccta atttgctttgcaccttatcatttcagttttcctttagatttcctggtgaagtccaatgaa attaaaagctgcctagccagaagggtgattctaatatttcattctgtggcattgtgtctt gctagtctgaattcatgtcttgacccagtcatatactacttttccactaatgagttccga agacggctttcaagacaagatttgcatgacagcatccaactccatgcaaaatcctttaga cctgttctgtgccctcctaaagtccgtccagaaacaacttcactgtctttaccaccatat tttgatggagtggtggaggaccattgtggggtagatggcatcaattcggatgcagtggtg gttcgacatatgcgaatcaatcagtgtgatacatcacatcaacagaaagaaggacaaaaa ccatatgatgacttcaactga >gi568815575f:79071008_79272006|GENSCAN_predicted_peptide_4|291_aa MGDFNTPLSTLDRSMRQKVNKDTQELNSALHQVDLIHIYRTLHPKSTEYTFFQHHTTPIP KLTTELEALLSKCKRTEIITNCLSDHSAIKLELRIKKLTQNHSTTWKLNNLLLNDYWVHN KMKAEIKMFFETNEKKDTTHQNLCDAFKALCRGKFLALNAHKRKQERSKIDTLTSQLKEL EKQEQTHSKASRRQEITKIRAELKEIETKKPFEKINESRSWFFERINKIDRPLASLIKKK REKNQIDVIKNDKGDITTNPTEIQITIREYYKPLYANKLENLEEMDKFLDT >gi568815575f:79071008_79272006|GENSCAN_predicted_CDS_4|876_bp atgggagactttaacaccccactgtcaacattagacagatcaatgagacagaaagttaac aaggatacccaggaattgaactcagctctgcaccaagtggacctaatacacatctacaga actctccacccaaaatcaacagaatatacattttttcagcaccacaccacacctattcca aaattgaccacagagttggaagctctcctcagcaaatgtaaaagaacagaaattataaca aactgtctctcagaccacagtgcaatcaaactagaactcaggattaagaaactcactcaa aaccactcaactacatggaaactgaacaacctgctcctgaatgactactgggtacataac aaaatgaaggcagaaataaagatgttctttgaaaccaatgagaaaaaagacacaacacac cagaatctctgtgacgcattcaaagcactgtgtagagggaaatttctagcactaaatgcc cacaagagaaagcaggaaagatccaaaattgacactctaacatcacaattaaaagaacta gaaaagcaagagcaaacacattcaaaagctagcagaaggcaagaaataactaaaatcaga gcagaactgaaggaaatagagacaaaaaaaccctttgaaaaaattaatgaatccaggagc tggttttttgaaaggatcaacaaaattgatagaccgctagcaagtctaataaagaaaaaa agagagaagaatcaaatagatgtaataaaaaatgataaaggggatatcaccaccaatccc acagaaatacaaataaccatcagagaatactacaaacccctctatgcaaataaactagaa aatctagaagaaatggataaattcctggacacataa >gi568815575f:79071008_79272006|GENSCAN_predicted_peptide_5|95_aa MDEAGNHHYQQNITGTEKHTSHVLTHKWELNNENTWTQGGEHHTPGPLGGTIRQQHSRFT KIRCSADTADDTQANRVWSGPLENSNRPAAEVPVC >gi568815575f:79071008_79272006|GENSCAN_predicted_CDS_5|288_bp atggatgaagctggaaaccatcattatcagcaaaatatcacaggaacagaaaaacataca tcgcatgttctcactcataagtgggagttgaacaacgagaatacatggacacagggaggg gaacatcacacaccagggccattggggggaacgatcagacagcagcattcgcggttcacg aaaatccgctgttctgcagacactgctgatgatacccaggcaaacagggtctggagtgga cctctagaaaactccaacagacctgcagctgaggttcctgtctgttag >gi568815575f:79071008_79272006|GENSCAN_predicted_peptide_6|206_aa MELKNTARDLPETYTIINSQIDQAEERISDIKDQLNEIKCEEKIRQKKIMKRSEQILQEI WDGVRRPNLSFIGIPESEGWKGTKLETFQNIIQENFHNLARQANIQIQEIQRTPKRYSSR RATPRHIIIRFTKVEMKEKMLREAREKGQVTYKGKPIRSRADLSAETLKARRYWGPIFNM LKEKNFQSRISYPAKLSFISEGEIKS >gi568815575f:79071008_79272006|GENSCAN_predicted_CDS_6|621_bp atggagctgaaaaacacagcacgagatcttcctgaaacatacacaattatcaatagccaa attgatcaagcggaagaaaggatatcagacattaaagatcaacttaatgaaataaagtgt gaagaaaagattagacaaaaaaaaataatgaaaaggagtgaacaaatcctccaagaaata tgggacggtgtgagaagaccaaacctaagtttcattggtatacctgaaagtgaggggtgg aagggaaccaagttggaaacatttcaaaatattatccaggagaacttccacaacctagca agacaggccaacattcaaattcaggaaatacagagaacaccaaaaagatactcctcgaga agagcaactccaagacacataatcatcagattcaccaaggttgaaatgaaggaaaaaatg ttaagggaagccagagagaaaggccaggttacctacaaagggaagcccatcagatcaaga gcagatctctctgcagaaaccctaaaagccaggagatactgggggccaatatttaacatg cttaaagaaaagaattttcaatccagaatttcatatccagccaaattaagcttcataagc gaaggagaaataaaatcctga >gi568815575f:79071008_79272006|GENSCAN_predicted_peptide_7|275_aa MDEAGSHHSQQTNTGTENQTPHILTHKWELNNENTWTQGGEHYTLGPFGEWRAREECFCL RKKKMNYLAFPMETACGMLERIMSKKGGTHGLDGKGQIMAGFISCWLKNPWAMNNQQQYT DSIYTHPEPGSSSAMEGKELSGILAFLSPGLDSWTAYLDLPWPIEKHTALKAVSAPAVAK SGQDTAQAIASEGASPSASQLLHGFGPASAQNTTIEVWEPPPRFQGMYGNSWMSSRSLVQ GWSPHEEPLLGPYRREIWSWSPHTDSPLWHCPVEL >gi568815575f:79071008_79272006|GENSCAN_predicted_CDS_7|828_bp atggatgaagctggaagccatcattctcagcaaactaacacaggaacagaaaaccagaca ccacatattctcactcataagtgggagttgaataatgagaacacatggacgcagggaggg gaacattacacgctggggccttttggggagtggagggcaagggaagaatgtttctgtctc agaaaaaagaaaatgaactaccttgcatttccaatggaaacagcttgtggaatgttagaa agaataatgagtaagaagggtggaacccacggccttgatgggaagggccaaatcatggca ggattcatcagctgctggctaaagaacccttgggccatgaataaccagcagcaatacaca gatagtatatatactcatccggagccaggttccagttcagccatggaaggtaaagaatta agtgggatcttggcattcctgagtccaggcctagactcttggacagcatatctggacctc ccgtggcccatagagaaacacactgcccttaaggctgtttcagctccagctgtggctaaa agtggccaagatacagctcaggccattgcttcagaaggtgcaagccccagtgcttcacag ctgctacatggttttgggcctgccagtgcacagaatacaacaattgaggtttgggaacct ccacctagatttcaggggatgtatggaaattcctggatgtccagcagaagtctggtgcag gggtggagccctcatgaagaacctctgttagggccgtatagaagggaaatttggagttgg agcccccacacagactctccactttggcactgcccagtggaactgtga >gi568815575f:79071008_79272006|GENSCAN_predicted_peptide_8|129_aa MVKGIVSLWREVGFWSLQPVKSAYSPEHSELNYSTKGKAVTQQLSVFEGKFEGGPPSLYC HLEDTLSFGRRITLVENDLGRGDAFGTSKDSVLKDIALSGGWRRRRVSTSLTEALLKPVA PSSDPSPNR >gi568815575f:79071008_79272006|GENSCAN_predicted_CDS_8|387_bp atggtcaaaggcattgtgtctttgtggagagaggtaggtttctggagcctgcagccagta aaatctgcttatagccctgagcactcagaactaaactattccaccaagggaaaagcagta acacagcagctaagtgtgtttgaaggaaaatttgaaggtggccctcctagtctgtattgc cacttggaagacactctttcatttggaaggcgaataactcttgttgaaaatgacttaggc agaggtgacgcttttggaacaagtaaggactcagtactaaaagacatagcactgagtgga ggatggagaagaaggagggttagcaccagcttgacagaggcgctcctaaagcctgtggcg ccctcttctgacccctcacccaacagg