GENSCAN 1.0 Date run: 4-Nov-116 Time: 12:41:42 Sequence gi568815578r:10305250_10513514 : 208265 bp : 39.72% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 3407 3402 6 1.05 1.03 Term - 3755 3577 179 0 2 5 44 262 0.941 10.37 1.02 Intr - 14927 14748 180 0 0 64 80 73 0.077 3.02 1.01 Init - 17367 17361 7 0 1 86 98 0 0.135 1.89 1.00 Prom - 19641 19602 40 -6.05 2.00 Prom + 26156 26195 40 -6.65 2.01 Init + 27133 27247 115 0 1 48 111 60 0.750 4.72 2.02 Intr + 36866 37027 162 0 0 -9 56 153 0.334 1.43 2.03 Term + 37182 37309 128 1 2 91 45 69 0.383 0.36 2.04 PlyA + 38617 38622 6 1.05 3.07 PlyA - 39535 39530 6 1.05 3.06 Term - 44213 43908 306 2 0 -2 37 384 0.886 18.43 3.05 Intr - 47493 47372 122 1 2 110 23 60 0.359 0.99 3.04 Intr - 55704 55552 153 1 0 96 37 105 0.004 5.32 3.03 Intr - 58491 58325 167 2 2 58 55 100 0.008 2.38 3.02 Intr - 59139 59005 135 2 0 43 68 143 0.004 6.56 3.01 Init - 70126 70065 62 1 2 77 65 75 0.070 4.87 3.00 Prom - 72945 72906 40 -5.95 4.00 Prom + 81039 81078 40 -3.95 4.01 Init + 81136 81228 93 0 0 84 36 65 0.793 1.33 4.02 Intr + 81529 82033 505 0 1 59 23 644 0.656 46.62 4.03 Term + 82070 82245 176 0 2 -11 32 259 0.839 7.34 4.04 PlyA + 82429 82434 6 1.05 5.00 Prom + 83235 83274 40 -3.45 5.01 Init + 85252 85343 92 0 2 90 69 77 0.835 6.01 5.02 Term + 93608 93755 148 2 1 83 49 90 0.787 0.99 5.03 PlyA + 94355 94360 6 1.05 6.05 PlyA - 94690 94685 6 1.05 6.04 Term - 100438 99998 441 1 0 81 48 375 0.991 27.07 6.03 Intr - 102477 102367 111 0 0 62 95 36 0.691 1.36 6.02 Intr - 103554 103379 176 1 2 41 100 17 0.803 -3.06 6.01 Init - 108265 107281 985 1 1 47 98 280 0.950 18.38 6.00 Prom - 114319 114280 40 -4.75 7.00 Prom + 117885 117924 40 -6.05 7.01 Init + 123080 123122 43 1 1 46 84 34 0.388 -0.57 7.02 Intr + 128297 128681 385 0 1 7 63 224 0.490 4.78 7.03 Term + 128887 129154 268 1 1 60 38 270 0.903 13.18 7.04 PlyA + 129560 129565 6 -0.45 8.06 PlyA - 129603 129598 6 1.05 8.05 Term - 130033 129786 248 2 2 93 43 137 0.372 4.67 8.04 Intr - 138502 138208 295 1 1 24 7 213 0.296 2.46 8.03 Intr - 164494 164372 123 1 0 55 99 26 0.031 0.26 8.02 Intr - 164885 164676 210 2 0 40 33 142 0.061 2.09 8.01 Init - 170859 170713 147 0 0 62 93 132 0.909 11.24 8.00 Prom - 188609 188570 40 -2.45 9.03 PlyA - 188774 188769 6 1.05 9.02 Term - 204346 204055 292 0 1 50 43 168 0.424 2.33 9.01 Init - 205831 205719 113 1 2 43 87 105 0.197 5.63 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 58641 59192 552 2 0 49 39 313 0.929 18.36 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578r:10305250_10513514|GENSCAN_predicted_peptide_1|121_aa MPDMAMGKEGAEERDIRRWHRQDAVTDLMRGWFLASVPRQRSLEGGGGEHVMVVHDEFEV PVERGGGGGRGDGSDDDDVGRGDGWSGGGGDGGSGSGGGDGGDDGDYNGGGNKGGGNGHA G >gi568815578r:10305250_10513514|GENSCAN_predicted_CDS_1|366_bp atgccagatatggccatgggaaaggagggggcagaggaaagagatattcggaggtggcat agacaagacgcggtgactgatttaatgagggggtggttcctggcttcagtcccaaggcag agaagtttagaagggggcggtggtgagcatgtcatggtggttcatgatgagtttgaagtg cctgtggagagaggtggaggaggtggcagaggtgatggaagtgatgatgatgatgttggc agaggagatggatggtcaggtggtggtggtgatggtggcagtgggagcggtggtggtgat ggaggtgatgatggtgattataatggtggtggtaataaaggtggaggtaatgggcatgct ggttaa >gi568815578r:10305250_10513514|GENSCAN_predicted_peptide_2|134_aa MSTRNGLEKAQKRGREETSGTMSTKGRETQSFRMEKKVGEREERRAAALLEAQTKSSQNQ GYDTQFRALPFLVSPRFWAPTGSPVPAVEATCGVGSRLGAKAECSLPGQVGGTSPVGPSK TQAKVPLATDVPSW >gi568815578r:10305250_10513514|GENSCAN_predicted_CDS_2|405_bp atgagcacaaggaacggcctggaaaaggcacagaaaagaggcagagaggaaacttccggc actatgtccacaaaaggcagggaaacacaaagcttcagaatggaaaaaaaggttggagaa agagaggagagaagagctgctgcccttctggaagcccagactaagagctcccagaaccag ggctatgacacccaatttagggctctgccattcctggtgtctccaagattctgggcacca acaggttccccagtgcctgcagtggaagccacttgtggtgtgggatccaggctaggagca aaagccgaatgcagcctgccaggccaagtgggtggaacaagtccagtgggcccaagcaaa actcaggcaaaggtgccattggccacagatgttcccagctggtga >gi568815578r:10305250_10513514|GENSCAN_predicted_peptide_3|314_aa MWKKKIYGQRKESEVRKTEVSIKIHASGFDKLLESMFCILLVVEAFSLQKVVKMLEEVVA GWREVRKVDVELFANFPCNCKRISFEDCSQLVVVNFRWSATMLLIFKALISFAKLFEPPL YCLITLINYHLLKGSLEHRTSPGLRHQPVLSLSLAGSVASEFPAKVLKMFENFPLITKPT PLAGDMPTNPKALVTQRKQPFFRAFITCNHAFYRQPKYPRKSTPRRNKLGHYVIIKFLLT TESPMKKTEDNSTPVFIVDVKANKHQIKQAVKKLCDIDVAKVNTLVRTDREKKAHILLGP DYDALDVANKIGII >gi568815578r:10305250_10513514|GENSCAN_predicted_CDS_3|945_bp atgtggaagaagaagatttatggacagagaaaagaaagtgaggtacggaaaacagaagtg agcataaaaatccatgcttcgggatttgacaaactcttggaaagcatgttctgcatcctg ctggtagtggaagcattttccctgcaaaaagttgtcaagatgctggaagaagtggtagct ggttggcgagaggtcagaaaggtcgatgttgagttatttgccaacttcccgtgtaattgt aagaggatcagctttgaagattgctctcaattggttgttgtaaatttccgatggtcagcc actatgctcctcatcttcaaggctctcatctcctttgcaaagctttttgaaccaccactg tactgtctgataactttaataaactaccatcttcttaaaggctctctggagcacaggact tcacctgggctccggcatcaaccagtattaagtttatccttagcaggttcagtggcctct gagtttcctgctaaggtgctgaaaatgtttgaaaactttccactgatcacaaaacccaca ccactcgctggtgatatgcccacgaatcccaaggctttagtcacacaaagaaaacagccg ttcttccgcgctttcataacatgtaaccatgccttttacaggcagcccaaatatcctcga aagagcacccccaggagaaacaagcttggccactatgtcatcatcaagtttctgctgacc actgagtctcccatgaagaagacagaagacaacagcacacctgtgttcattgtggatgtt aaagccaacaagcaccagatcaaacaggctgtgaaaaagctctgtgacattgatgtggcc aaggtcaacaccctggttaggactgatagagagaagaaggcacatattcttctgggtcct gattatgatgctttggatgttgccaacaaaattgggatcatctaa >gi568815578r:10305250_10513514|GENSCAN_predicted_peptide_4|257_aa MSLDVMIELYRGNIWNDAKTVNVITTAHFSKVSAAPPKRSNKDPPFAAQASHHLVPPEII QSLLMTVANNLVTDKNSGEVMTVGINAIKEKTARCSLAMTEELLQDLAQYKTHKDKNVMM CARTLIQLFRTLNPQMLQKKFQGKPTEASIEARVQEYGELDAKDYIPGAEVLEGEKEENA ENDEDGWENTSLSEEQDADEQQEISKKLNRMPMEEWKAKAAAISTSQVLTQEDFQKIRMA QLRKELDAAPGKSQKRK >gi568815578r:10305250_10513514|GENSCAN_predicted_CDS_4|774_bp atgtctttagatgtaatgattgaactctacagagggaacatctggaatgatgccaaaact gtcaatgttatcacaactgcacatttctctaaggtttctgcagccccaccaaagagaagt aacaaagatcctccgtttgctgcacaagcatctcatcacctagtacccccagagattatt caatcattgcttatgactgtggcaaacaatttggttaccgacaagaattctggagaagtc atgacagtaggaatcaatgctataaaagagaaaacagctcgatgttctctggccatgact gaagaacttctccaagacctggctcagtataaaacacacaaggataagaatgtaatgatg tgtgctagaactttgattcagctcttccgaacactgaatcctcagatgctgcagaagaaa ttccagggtaagcctactgaggcctccatagaagcaagagtacaagaatatggagaatta gatgctaaagattacattccaggagcagaagttctggaaggtgagaaagaagagaatgct gaaaatgatgaagatggatgggaaaataccagtctcagtgaggagcaggatgctgatgaa cagcaagaaatctccaagaagctgaacaggatgcccatggaggagtggaaagccaaagct gcagccatcagcaccagccaagttttaactcaggaagacttccagaaaatccgcatggcc caactgagaaaagaacttgatgctgcccctgggaaatcccagaagaggaaataa >gi568815578r:10305250_10513514|GENSCAN_predicted_peptide_5|79_aa MAPASAPGERLRLLLLMAEGKGKLACSGITCPGEGFTSWQFFTICGELNDVPPKDMSKDL SPATCECDLIWKQGLCKCN >gi568815578r:10305250_10513514|GENSCAN_predicted_CDS_5|240_bp atggcgccagcatctgctcctggtgagagactcaggctgcttctactcatggcagaaggc aaagggaagctggcctgttcaggaatcacgtgccctggagaaggatttacatcttggcag ttttttaccatctgtggtgagttgaatgatgtccccccaaaagatatgtccaaagaccta agtcctgctacctgtgaatgtgaccttatttggaaacagggtctttgcaagtgcaattaa >gi568815578r:10305250_10513514|GENSCAN_predicted_peptide_6|570_aa MSRLEAKKPSLCKSEPLTTERVRTTLSVLKRIVTSCYGPSGRLKQLHNGFGGYVCTTSQS SALLSHLLVTHPILKILTASIQNHVSSFSDCGLFTAILCCNLIENVQRLGLTPTTVIRLN KHLLSLCISYLKSETCGCRIPVDFSSTQILLCLVRSILTSKPACMLTRKETEHVSALILR AFLLTIPENAEGHIILGKSLIVPLKGQRVIDSTVLPGILIEMSEVQLMRLLPIKKSTALK VALFCTTLSGDTSDTGEGTVVVSYGVSLENAVLDQLLNLGRQLISDHVDLVLCQKVIHPS LKQFLNMHRIIAIDRIGVTLMEPLTKMTGTQPIGSLGSICPNSYGSVKDVCTAKFGSKHF FHLIPNEATICSLLLCNRNDTAWDELKLTCQTALHVLQLTLKEPWALLGGGCTETHLAAY IRHKTHNDPESILKDDECTQTELQLIAEAFCSALESVVGSLEHDGGEILTDMKYGHLWSV QADSPCVANWPDLLSQCGCGLYNSQEELNWSFLRSTRRPFVPQSCLPHEAVGSASNLTLD CLTAKLSGLQVAVETANLILDLSYVIEDKN >gi568815578r:10305250_10513514|GENSCAN_predicted_CDS_6|1713_bp atgtctcgtttggaagctaagaagccatcattgtgtaagagtgaaccactgacaactgag agagtcaggaccacactttctgtcttgaaaagaattgtaacatcatgctatggcccctca ggtaggctgaagcagctgcacaatggctttggaggttacgtgtgtacaacctcacagtcc tcagctctgctcagtcaccttttggtcacacatcccattttaaagatcctgacagcctcc atacagaatcatgtgtcaagcttcagtgattgtggcttattcacagctattctttgctgc aacctgattgaaaatgttcagagattaggcttgacacccaccactgtcattagattaaat aaacatcttttgagtctttgcatcagttatctcaagtctgagacctgtggttgtcgaatc ccagtggactttagtagtactcagatcctcctttgtttggtgcgtagtatattaacaagt aaacctgcctgtatgctcaccagaaaggaaacagagcatgtcagtgctttgatcctgaga gcctttttgcttacaattccagaaaatgctgaaggccacatcattttaggaaagagttta attgtacctttaaaaggtcaaagagttatagattccactgtattacctgggatactcatt gaaatgtcagaagttcaattaatgaggctattacctatcaaaaaatcaactgccctcaag gtggcactcttttgtacaactttatccggagacacttctgacactggagaaggaactgtg gtggtcagttatggggtttctcttgaaaatgcagtcttggaccagctgcttaacctagga aggcagctaatcagtgaccacgtagatcttgtcctgtgccaaaaagttatacatccatct ttgaagcagtttctcaatatgcatcgtattattgccatagacagaattggagtgactctg atggaacccctgactaaaatgacaggaacacagcctattggatccctaggctcaatatgt cctaatagttatggaagtgtgaaagatgtgtgcactgcaaaatttggctccaaacatttt tttcatcttattcctaatgaagcaacaatctgcagcttgcttctctgcaacagaaatgac actgcctgggatgagctgaagctcacgtgtcagacggcactgcatgtcctgcagttaaca ctcaaggaaccatgggctttgttgggaggtggctgtactgaaactcatttggctgcatat atcagacacaagactcacaacgacccagaaagcattctcaaagatgatgaatgtactcaa acagaacttcaattaattgctgaagcattttgcagtgccctagaatctgttgttggctct ttagaacatgatggaggtgaaattctcactgacatgaagtatggacacctttggtcagtt caggcagattctccctgtgttgctaactggccagatttgctttcacagtgtggctgtgga ttatacaatagccaggaagaactcaactggtctttcttaagaagcacacgtcgtccattt gtgccacaaagctgccttccacatgaagctgtgggctcagccagcaacctgaccttggac tgtttgactgcaaagcttagtggcctacaggtggctgtagagacagccaatttgattttg gatctttcatatgttattgaagataaaaactaa >gi568815578r:10305250_10513514|GENSCAN_predicted_peptide_7|231_aa MNGPLVLQQTLYVPGFGYGLIGEEDEHDLRGSSLQGPQPCKPRGCFHVAGRATLHNVSNA VGGRPPPPDSLGYTVHPPDSQGHTGPLPAREGTGSHRFGVEGGSAGGSVVRRHSSSSPHR REDLLAARIRADRNIYHRQLFQRPPPPEAPETDLKQPLLPRIPTTFASREGTEHAQLCGR RRAAPTPLLRVFREAPANSGRHRKKRGAKFIQRIFQQIVETEILKERIKFK >gi568815578r:10305250_10513514|GENSCAN_predicted_CDS_7|696_bp atgaatgggcctttagtgctacagcaaacactttatgttcctggatttgggtatggtctt atcggggaggaagacgagcatgaccttagagggtcatccctacaaggtccacagccctgc aagccccgcggctgctttcacgtagctggcagggccacgctgcacaacgtctctaacgcc gtcggagggcggcccccacctcccgacagcctggggtacaccgtgcaccctccagacagt caggggcacacggggcccctcccagcgagggaggggacggggagtcacaggttcggcgta gaaggcggcagcgcaggtgggagcgtggtgaggcgccactcctccagctctccccaccgg cgggaagacctcctcgccgcccgcatccgtgccgaccgcaacatttaccatcggcagcta tttcagaggccgccaccgccagaggccccagaaacagatctcaagcagccgctgctgccg cggatcccgacaaccttcgcgtcgcgcgagggcacggagcacgcgcagctctgcggtcga cgccgggccgcccccactccgcttctccgggttttccgtgaagctccagccaacagcggc cgtcaccggaaaaaacggggagcgaagtttattcagcgtatattccagcaaattgttgaa actgaaattttaaaagaacgcataaaatttaaataa >gi568815578r:10305250_10513514|GENSCAN_predicted_peptide_8|340_aa MHTDNKTHCQVVSKAVNYDHRHPQEPREASQGSDYSSYLLDAPEASLQTPLLLGPGRNWG PKAFKGQRGSLLSDHHLQVPESGKEVLKRQSCVTLRMLSLGPLWIVFVVGTVFPRCLVVI TAGRYFIKHWNNDAPYSPQRESKVLTEFIYLVRDPEFTFLAPPPTLGVTIQHEICSGTQI RTISVPLPKPPSYGTGNFTPVMPPHNQLLAHVLEHAAPALPAFTHACRTVECPLLKRVEL SSETQLRCHLFLGAFYEPRSPRAFGNSAAAQTQNDLGSFAAGLRPQPGFKMAAKHFDGPW NLPKRKMASVQNIAAFKVGPKQKWRRTQAIFKGWDPNDSH >gi568815578r:10305250_10513514|GENSCAN_predicted_CDS_8|1023_bp atgcacaccgataataaaacacattgccaagttgtaagtaaagcagttaattatgaccat cgtcatccccaggaaccaagagaagcttctcaagggtcagattattccagctacctcttg gatgcccccgaggcctctctacaaactcccttactgctgggaccagggcggaactggggt cccaaagctttcaaaggacaaagaggatccttactctccgaccatcatctccaggttcct gaatctggaaaggaagtactgaagaggcagtcttgtgtcactctgagaatgctttccctg gggccactttggatagttttcgtcgttgggactgtcttccctcggtgtttggtggtgatc acagcgggaagatactttataaaacattggaacaatgatgccccctacagccctcagaga gaatccaaggtgcttactgagttcatctatttggtaagagatcctgagtttacttttctg gcccctcctccaacactgggggttacaattcaacatgagatttgctcaggaacacaaatc cgaacaatatcagtgcccctgcccaaacctccatcttatggcacaggcaacttcacacct gtaatgccaccacataaccaactcctggcacatgtccttgaacatgcagctccagctctc cctgccttcacccatgcatgccgcactgtggaatgtcctctcctaaagcgagttgagtta tcctctgagacccaactcagatgccacctgtttcttggagccttctatgaaccaaggtct ccgcgcgcgtttgggaactcagccgccgcccaaacacaaaatgatctgggcagcttcgct gcaggattgaggccacagccaggatttaagatggcagcgaaacacttcgatggcccgtgg aatctgcccaaaagaaaaatggcctctgttcagaatatcgcggctttcaaagtgggccca aaacaaaaatggcgccggacgcaggctatttttaaaggctgggaccccaatgactctcat tag >gi568815578r:10305250_10513514|GENSCAN_predicted_peptide_9|134_aa MEGQEHGTKARFSDKVTREAALSPTTGPVVLAGRVLQSESAANYFSLTGRESGLYQGDIL PHLGNEVQESDPLLYSVAKAVQLTGSDKLQPKIRAISPMVVTVPQWGRCSAGDYQLGDLS HFPFRALVSSTAKL >gi568815578r:10305250_10513514|GENSCAN_predicted_CDS_9|405_bp atggagggacaagaacacggcaccaaagccaggttctctgataaagtcacccgggaagcg gccctgtctccaacaacaggcccagtggttttggctggcagggtccttcaaagtgaatca gctgcaaactacttttctttaacaggaagagaatctggtctgtatcaaggcgatatccta ccacacttgggaaatgaggtacaggagagcgaccctctactttatagtgttgccaaggct gtccaacttactggttcagataaattacagcccaaaatcagagcaatttcacccatggtg gtcactgttccacagtggggacggtgctctgctggtgactatcaactcggtgatctcagt cattttccattcagggctttagtttcttcaactgcaaaactataa