GENSCAN 1.0 Date run: 3-Nov-116 Time: 12:09:42 Sequence gi568815595r:58328037_58533809 : 205773 bp : 44.15% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 4929 5054 126 2 0 98 74 260 0.028 26.48 1.02 Intr + 37838 37888 51 1 0 101 115 29 0.199 6.00 1.03 Intr + 41395 41442 48 0 0 104 99 10 0.120 2.58 1.04 Intr + 54478 54664 187 0 1 88 107 42 0.060 5.36 1.05 Intr + 62546 62623 78 0 0 82 66 58 0.683 2.52 1.06 Intr + 63111 63184 74 1 2 73 94 13 0.802 -0.47 1.07 Intr + 63737 63811 75 1 0 111 97 -3 0.761 2.61 1.08 Intr + 66962 67066 105 1 0 77 77 78 0.983 6.01 1.09 Intr + 67622 67723 102 1 0 86 94 52 0.984 5.97 1.10 Intr + 69003 69164 162 2 0 65 101 90 0.994 8.17 1.11 Intr + 69569 69686 118 1 1 102 52 96 0.569 7.44 1.12 Intr + 71206 71341 136 2 1 9 97 41 0.129 -3.27 1.13 Intr + 75826 75874 49 1 1 90 98 39 0.400 3.78 1.14 Intr + 80888 80965 78 1 0 113 99 39 0.365 7.25 1.15 Intr + 82054 82123 70 0 1 85 93 54 0.307 4.35 1.16 Term + 96716 96924 209 0 2 139 48 128 0.973 11.40 1.17 PlyA + 97097 97102 6 1.05 2.11 PlyA - 97143 97138 6 1.05 2.10 Term - 100143 99998 146 1 2 104 29 115 0.935 5.27 2.09 Intr - 100578 100437 142 0 1 65 63 114 0.978 6.53 2.08 Intr - 101763 101672 92 1 2 115 105 -1 0.999 4.01 2.07 Intr - 102202 102092 111 2 0 56 116 43 0.935 4.25 2.06 Intr - 102906 102621 286 0 1 84 76 170 0.984 12.31 2.05 Intr - 103592 103557 36 2 0 82 80 33 0.558 0.36 2.04 Intr - 103757 103695 63 2 0 50 99 51 0.713 1.31 2.03 Intr - 103948 103841 108 1 0 74 88 126 0.985 11.68 2.02 Intr - 105648 105595 54 0 0 115 92 68 0.970 9.08 2.01 Init - 105773 105732 42 2 0 104 80 40 0.704 4.61 2.00 Prom - 109789 109750 40 -1.66 3.00 Prom + 111576 111615 40 -6.06 3.01 Init + 112106 112209 104 1 2 85 115 60 0.457 8.12 3.02 Intr + 133182 133304 123 0 0 44 113 62 0.027 4.00 3.03 Intr + 144281 144424 144 2 0 75 75 60 0.026 2.80 3.04 Intr + 170677 170746 70 1 1 50 101 70 0.333 3.68 3.05 Term + 172910 173596 687 1 0 73 39 497 0.739 36.91 3.06 PlyA + 174029 174034 6 1.05 4.15 PlyA - 174337 174332 6 1.05 4.14 Term - 174739 174734 6 1 0 97 37 0 0.007 -6.33 4.13 Intr - 180989 180857 133 1 1 88 68 156 0.803 14.25 4.12 Intr - 189387 189170 218 0 2 58 117 212 0.953 18.40 4.11 Intr - 194565 194460 106 2 1 89 105 22 0.982 4.22 4.10 Intr - 196569 196390 180 2 0 131 121 60 0.996 12.58 4.09 Intr - 198620 198430 191 2 2 99 110 194 0.998 21.08 4.08 Intr - 200920 200758 163 0 1 98 67 141 0.969 12.98 4.07 Intr - 200997 200957 41 0 2 52 72 19 0.562 -6.28 4.06 Intr - 201652 201521 132 1 0 42 115 20 0.402 0.94 4.05 Intr - 202602 202423 180 0 0 103 1 277 0.850 20.56 4.04 Intr - 203330 203215 116 0 2 65 31 184 0.923 10.57 4.03 Intr - 203776 203657 120 2 0 89 115 38 0.985 7.07 4.02 Intr - 205516 205409 108 2 0 64 74 102 0.986 6.66 4.01 Intr - 205675 205567 109 1 1 34 110 38 0.288 0.46 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 4815 4487 329 1 2 2 47 279 0.861 10.07 S.002 Intr - 5121 4916 206 2 2 58 75 119 0.922 6.44 S.003 Intr - 5603 5374 230 2 2 52 105 161 0.942 10.87 S.004 Init + 93780 93816 37 2 1 52 121 39 0.888 3.80 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:58328037_58533809|GENSCAN_predicted_peptide_1|555_aa AAAAGRPGMAFMEKPPAGKVLLDDTVPLTAAIEASQSLQSHTEYIIRVQRGISVENSWQI VRRYSDFDLLNNSLQIAGLSLPLPPKKLIGNMDREFIAERQKGLQNYLNVITTNHILSNC ELVKKFLDPNNYSANYTEIALQQVSMFFRSEPKWEVVEPLKDIGWRIRKKYFLMKIKNQP KERLVLSWADLGPDKYLSDKDFQCLIKLLPSCLHPYIYRVTFATANESSALLIRMFNEKG TLKDLIYKAKPKDPFLKKYCNPKKIQGLELQQIKTYGRQILEVLKFLHDKGFPYGHLHAS NVMLDGDTCRLLDLENSLLGLPSFYRSYFSQFRKINTLESVDVHCFGHLLYEMTYGRPPD SVPVDSFPPAPSMAVASGFASVFCGTKMCICSFQVAVLESTLSCEACKNGMPTISRLLQM PLFSDVLLTTSEKPQFKIPTKLKEALRIAKECIEKRLIEEQKQKSKRSALENSEEHSAKY SNSNNSGISALPPPPPPPPPPAAPLPPASTEAPAQLSSQAVNGMSRGALLSSIQNFQKGT LRKAKTCDHSAPKIG >gi568815595r:58328037_58533809|GENSCAN_predicted_CDS_1|1668_bp gcggcggcggccgggcgtcccgggatggccttcatggagaagccgccagccggcaaggtg ctgctggacgacacggtgccgctgacagcagccatcgaggcgagccagagcctgcagtcc cacacggaatatattattcgagtgcaaagaggaatttctgtggaaaacagctggcagatt gttagaagatacagtgactttgatttgcttaacaacagcttacagattgcaggcctaagt ctacctcttcctcccaaaaaattgattggtaacatggatcgtgaattcatagctgaaagg cagaaaggtcttcagaactatctcaacgtgatcacaacaaatcatatcttgtctaattgt gagctggttaagaagtttttagatccaaacaactattccgcaaactatactgagattgcc ttgcaacaggtttccatgttcttccgatcagaaccaaagtgggaggtggtggaacctttg aaagacataggttggagaataaggaagaaatatttcttgatgaagattaaaaatcagcca aaggaacggctagtgttaagctgggctgaccttggcccagacaagtatttgtcagataaa gattttcagtgtctaatcaaacttctgccttcttgtttgcacccttacatctatcgggtt acctttgccacagctaatgaatcctcagcgttgctaattaggatgtttaacgaaaaggga acattgaaggatctgatctacaaggcaaaaccaaaagacccatttctaaagaagtactgc aaccctaagaagattcagggcctggaactccagcaaataaaaacatatggacggcaaata ttagaggtactgaagtttcttcatgacaagggattcccttatgggcatcttcacgcctcc aatgtgatgctcgatggggacacttgccggctgctggaccttgagaattccttattgggc ctgccttccttctaccgatcttatttttcacaattcaggaaaatcaatacattggaaagt gtggatgtccactgctttggccacttactgtatgaaatgacttatggacgaccgccagac tcggtgcctgtggactccttccctcctgccccgtccatggctgtggcaagtggatttgcc agcgtgttctgtggaactaaaatgtgtatctgttcatttcaagtggccgtgttggagtct acgctgtcttgtgaagcctgtaaaaatggcatgcctaccatctcccggctcttacagatg ccattattcagcgatgttttactaaccacttctgaaaaaccacagtttaagatccctaca aagttaaaagaggcattgagaattgccaaagaatgtatagagaagagactaattgaggaa cagaaacagaagtcaaaacgatctgctcttgaaaatagtgaagagcattcagcgaagtac agcaactccaataattcagggatatctgcattacctccacctcctccacctccaccacca ccagcagctcccttgcctcctgcgagcaccgaggcacctgcccagctctcgtctcaggct gtgaatggcatgagccgaggggccttgctcagctccatccagaatttccaaaaaggaact ttgaggaaagccaaaacctgtgatcacagtgctccgaagatcggctga >gi568815595r:58328037_58533809|GENSCAN_predicted_peptide_2|359_aa MAAVSGLVRRPLREVSGLLKRRFHWTAPAALQVTVRDAINQGMDEELERDEKVFLLGEEV AQYDGAYKVSRGLWKKYGDKRIIDTPISEMGFAGIAVGAAMAGLRPICEFMTFNFSMQAI DQVINSAAKTYYMSGGLQPVPIVFRGPNGASAGVAAQHSQCFAAWYGHCPGLKVVSPWNS EDAKGLIKSAIRDNNPVVVLENELMYGVPFEFPPEAQSKDFLIPIGKAKIERQGTHITVV SHSRPVGHCLEAAAVLSKEGVECEVINMRTIRPMDMETIEASVMKTNHLVTVEGGWPQFG VGAEICARIMEGPAFNFLDAPAVRVTGADVPMPYAKILEDNSIPQVKDIIFAIKKTLNI >gi568815595r:58328037_58533809|GENSCAN_predicted_CDS_2|1080_bp atggcggcggtgtctggcttggtgcggagaccccttcgggaggtctccgggctgctgaag aggcgctttcactggaccgcgccggctgcgctgcaggtgacagttcgtgatgctataaat cagggtatggatgaggagctggaaagagatgagaaggtatttctgcttggagaagaagtt gcccagtatgatggggcatacaaggttagtcgagggctgtggaagaaatatggagacaag aggattattgacactcccatatcagagatgggctttgctggaattgctgtaggtgcagct atggctgggttgcggcccatttgtgaatttatgaccttcaatttctccatgcaagccatt gaccaggttataaactcagctgccaagacctactacatgtctggtggccttcagcctgtg cctatagtcttcagagggcccaatggtgcctcagcaggtgtagctgcccagcactcacag tgctttgctgcctggtatgggcactgcccaggcttaaaggtggtcagtccctggaattca gaggatgctaaaggacttattaaatcagccattcgggataacaatccagtggtggtgcta gagaatgaattgatgtatggggttccttttgaatttcctccggaagctcagtcaaaagat tttctgattcctattggaaaagccaaaatagaaaggcaaggaacacatataactgtggtt tcccattcaagacctgtgggccactgcttagaagctgcagcagtgctatctaaagaagga gttgaatgtgaggtgataaatatgcgtaccattagaccaatggacatggaaaccatagaa gccagtgtcatgaagacaaatcatcttgtaactgtggaaggaggctggccacagtttgga gtaggagctgaaatctgtgccaggatcatggaaggtcctgcgttcaatttcctggatgct cctgctgttcgtgtcactggtgctgatgtccctatgccttatgcaaagattctagaggac aactctatacctcaggtcaaagacatcatatttgcaataaagaaaacattaaatatttag >gi568815595r:58328037_58533809|GENSCAN_predicted_peptide_3|375_aa MGASLDQEHSGHPAGSRGVEVSSGSATVANSSGGRSIFCPNEVTLGGFLDGGRSPERPSH DEKFGAFSSTPYPLGRLPDSGRYSQSQLPSELGEREKGKYKKTQPCGLTPYSMIQAHSQL CVSSFPETWALEDASLEQMDNGDWGYMMTDPVTLNVGGHLYTTSLTTLTRYPDSMLGAMF GGDFPTARDPQGNYFIDRDGPLFRYVLNFLRTSELTLPLDFKEFDLLRKEADFYQIEPLI QCLNDPKPLYPMDTFEEVVELSSTRKLSKYSNPVAVIITQLTITTKVHSLLEGISNYFTK WNKHMMDTRDCQVSFTFGPCDYHQEVSLRVHLMEYITKQGFTIRNTRVHHMSERANENTV EHNWTFCRLARKTDD >gi568815595r:58328037_58533809|GENSCAN_predicted_CDS_3|1128_bp atgggtgcctcgctggatcaggagcacagcggacaccctgccggatccagaggggtggaa gtcagcagcgggtctgcaacggtggcaaacagcagtggtggacggagcatcttttgtcct aatgaggtgactcttggtgggttcctggatgggggccggtctccagaaagaccaagccat gatgagaagtttggagcttttagctccactccctatcctctgggaaggcttccagactct ggacggtattcccagagtcagctcccttctgaattgggagaaagagagaaaggaaaatac aagaagacccaaccatgtggattaacaccctatagcatgatccaggcccacagccagctc tgtgtttccagtttccctgaaacctgggctcttgaagacgcatcactggagcagatggat aatggagactggggctatatgatgactgacccagtcacattaaatgtaggtggacacttg tatacaacgtctctcaccacattgacgcgttacccggattccatgcttggagctatgttt gggggggacttccccacagctcgagaccctcaaggcaattactttattgatcgagatgga cctcttttccgatatgtcctcaacttcttaagaacttcagaattgaccttaccgttggat tttaaggaatttgatctgcttcggaaagaagcagatttttaccagattgagcccttgatt cagtgtctcaatgatcctaagcctttgtatcccatggatacttttgaagaagttgtggag ctgtctagtactcggaagctttctaagtactccaacccagtggctgtcatcataacgcaa ctaaccatcaccactaaggtccattccttactagaaggcatctcaaattattttaccaag tggaataagcacatgatggacaccagagactgccaggtttcctttacttttggaccctgt gattatcaccaggaagtttctcttagggtccacctgatggaatacattacaaaacaaggt ttcacgatccgcaacacccgggtgcatcacatgagtgagcgggccaatgaaaacacagtg gagcacaactggactttctgtaggctagcccggaagacagacgactga >gi568815595r:58328037_58533809|GENSCAN_predicted_peptide_4|600_aa PTDERGELPASTAGSWAPLHTALMPKSSDPPRSSFTGTYLQGLETEATYDAATQEFVIHS PTLTATKWWPGDLGRSATHALVQAQLICSGARRGMHAFIVPIRSLQDHTPLPGIIIGDIG PKMDFDQTDNGFLQLNHVRVPRENMLSRFAQVLPDGTYVKLGTAQSNYLPMVVVRVELLS GEILPILQKACVIAMRYSVIRRQSRLRPRQGNLGCKSSLEDRATGKKGLLLLSCPRPIYL GCRSVPRVVIRKMRQVFMVSGIGEGGHNSDPEAKVLDYQTQQQKLFPQLAISYAFHFLAV SLLEFFQHSYTAILNQDFSFLPELHALSTGMKAMMSEFCTQGAEMCRRACGGHGYSKLSG LPSLVTKLSASCTYEGENTVLYLQVARFLVKSYLQTQMSPGSTPQRSLSPSVAYLTAPDL ARCPAQRAADFLCPELYTTAWAHVAVRLIKDSVQHLQTLTQSGADQHEAWNQTTVIHLQA AKVHCYYVTVKGFTEALEKLENEPAIQQVLKRLCDLHAIHGILTNSGDFLHDAFLSGAQV DMARTAYLDLLRLIRKDAILLTDAFDFTDQCLNSALGCYDGNVYERLFQWAQKSPTNTQK >gi568815595r:58328037_58533809|GENSCAN_predicted_CDS_4|1803_bp cccacagatgaaaggggagaactgcctgcaagtacagctgggagctgggccccactccat actgcgctgatgcctaagagttcagacccacctagaagcagcttcacagggacatatctt cagggcctggagactgaagccacctatgacgcagccacccaggagtttgtgatacacagc cccacgctgactgccaccaaatggtggcctggagacttgggacggtcagccacccatgcc ctggtccaggcccagctgatctgctcaggagccaggcggggcatgcacgcttttattgtg ccaatccggagtcttcaggaccacaccccactgccaggaatcatcattggggacatcgga cccaagatggactttgatcaaacagacaatggcttcctgcagctgaaccatgtgcgggtc cccagggagaacatgctgagtcgctttgcacaggtcttgccagatggcacctacgtcaaa ctcggtacagcacagagcaactaccttcccatggtggtggtgcgggtggagctgctgtca ggggagatcctccctatactgcagaaggcctgtgtcatcgccatgcgctactcggtcatc cgccgccaatcccggctccggcccaggcaagggaatctgggctgcaagagttctctggag gacagggccactgggaagaaggggcttctgctgctgtcctgcccgaggcccatttatctg ggctgcagaagtgtccccagggtggttataagaaagatgcgccaggtttttatggtgtcg gggataggggaaggtggacacaacagtgacccagaggcaaaggtcctggactaccagaca caacagcagaaactctttcctcagctggccatcagttatgccttccatttcctggcagtc agcctcttggagttcttccagcactcctacactgccattctgaaccaagacttcagcttc ctgcctgagctccacgcactgagcacgggcatgaaggccatgatgtcagaattctgcacc cagggagctgagatgtgccgcagggcctgtggcggacatggctactcaaagctgagtggc ctgccatcactggtcaccaaattgtcggcctcctgtacctacgagggtgagaacacagtg ctctacctgcaggtggccaggttcctggtgaagagctacctgcagactcagatgtcccct ggctccacgccacagagatctctctctccatctgtcgcatatctcaccgcacctgacctg gccaggtgtccagcccagagggcagccgacttcctctgcccggagctctacaccacggcc tgggcacatgtggcagtaaggctcataaaggactcagtgcagcatttacagaccctgacg caatccggagctgaccagcacgaggcttggaaccagaccactgtcatacacctccaggct gctaaggtgcactgctactatgtcactgtgaagggttttacagaagctctggagaaacta gaaaatgaaccagcgattcagcaggtgctcaagcgcctctgtgacctccatgccatacat ggaatcttgactaactcgggtgactttctccatgacgccttcctgtctggtgcccaagtg gacatggcaagaacagcctacctggacctgctccgcctgatccggaaggatgccatcctg ttaactgatgcttttgacttcaccgatcagtgtttaaattcagcacttggctgttatgat ggaaacgtctacgaacgcctgttccagtgggctcagaagtcaccaaccaatactcagaaa taa