GENSCAN 1.0 Date run: 5-Nov-116 Time: 04:34:17 Sequence gi568815595r:141852516_142095142 : 242627 bp : 41.16% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 5506 5521 16 0 1 109 77 11 0.854 2.51 1.02 Intr + 7229 7425 197 0 2 43 116 74 0.961 3.91 1.03 Term + 8369 8443 75 1 0 96 53 91 0.931 3.26 1.04 PlyA + 9788 9793 6 1.05 2.02 PlyA - 10283 10278 6 -0.45 2.01 Sngl - 12764 12492 273 2 0 114 45 428 0.991 36.18 2.00 Prom - 15828 15789 40 -5.25 3.05 PlyA - 15927 15922 6 1.05 3.04 Term - 23566 23475 92 2 2 76 54 73 0.512 -0.40 3.03 Intr - 24456 24178 279 1 0 79 39 244 0.519 15.13 3.02 Intr - 29417 29295 123 0 0 121 86 -13 0.658 1.44 3.01 Init - 33494 33410 85 2 1 74 49 69 0.484 2.63 3.00 Prom - 36272 36233 40 -4.25 4.03 PlyA - 36432 36427 6 1.05 4.02 Term - 42244 42115 130 0 1 16 42 245 0.635 9.37 4.01 Init - 44671 44640 32 1 2 33 99 16 0.455 -3.46 4.00 Prom - 44973 44934 40 -6.65 5.00 Prom + 46383 46422 40 -5.65 5.01 Init + 51167 51233 67 1 1 62 91 86 0.738 7.59 5.02 Intr + 54652 54759 108 2 0 109 92 63 0.829 8.14 5.03 Intr + 61137 61321 185 1 2 74 51 121 0.945 5.59 5.04 Intr + 69462 69548 87 2 0 62 103 46 0.765 2.75 5.05 Intr + 73016 73185 170 1 2 77 2 168 0.071 4.92 5.06 Intr + 76221 76488 268 0 1 -6 88 200 0.070 7.21 5.07 Intr + 81669 81899 231 2 0 61 94 73 0.381 2.25 5.08 Term + 85452 85802 351 2 0 28 54 228 0.416 6.90 5.09 PlyA + 86244 86249 6 1.05 6.04 PlyA - 86284 86279 6 -0.45 6.03 Term - 88164 87941 224 1 2 64 49 259 0.714 15.70 6.02 Intr - 89708 89594 115 2 1 92 22 110 0.397 3.90 6.01 Init - 91759 91757 3 1 0 113 81 0 0.604 1.85 6.00 Prom - 96187 96148 40 -5.35 7.13 PlyA - 97298 97293 6 -0.45 7.12 Term - 100181 99998 184 1 1 61 43 194 0.974 8.23 7.11 Intr - 100501 100396 106 2 1 99 102 63 0.977 7.15 7.10 Intr - 107325 107159 167 2 2 77 91 204 0.998 18.28 7.09 Intr - 111448 111297 152 1 2 63 89 96 0.994 5.24 7.08 Intr - 117626 117558 69 2 0 25 103 83 0.439 1.96 7.07 Intr - 121676 121533 144 2 0 41 107 91 0.822 5.86 7.06 Intr - 126167 126005 163 1 1 67 88 180 0.679 14.96 7.05 Intr - 141070 141023 48 0 0 103 113 -8 0.059 0.08 7.04 Intr - 147839 147737 103 0 1 56 53 81 0.011 -0.29 7.03 Intr - 148939 148685 255 2 0 95 43 120 0.019 4.69 7.02 Intr - 181047 180921 127 0 1 85 76 63 0.773 4.13 7.01 Init - 181439 181395 45 2 0 65 100 4 0.729 0.03 7.00 Prom - 182511 182472 40 -2.75 8.03 PlyA - 182561 182556 6 1.05 8.02 Term - 191685 191108 578 1 2 65 42 690 0.127 55.74 8.01 Init - 200956 200842 115 1 1 90 35 103 0.304 5.62 8.00 Prom - 211626 211587 40 -4.25 9.04 PlyA - 211960 211955 6 1.05 9.03 Term - 225413 225306 108 2 0 78 53 98 0.046 2.83 9.02 Intr - 225932 225782 151 1 1 94 64 15 0.019 -1.06 9.01 Intr - 240612 240546 67 1 1 63 116 47 0.020 2.04 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 73016 73186 171 1 0 77 35 167 0.842 7.14 S.002 Sngl - 191683 191108 576 1 0 93 42 691 0.863 60.92 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:141852516_142095142|GENSCAN_predicted_peptide_1|95_aa MEESQGEWTSDSMPRTSEIYGNDSGIGVLDSRSGESSYRRGQRTTASLRNWEGTVITRDL GTEQGSARNRKVCLPSTRDTGASTKAYTERIRTSC >gi568815595r:141852516_142095142|GENSCAN_predicted_CDS_1|288_bp atggaggaatcacaaggcgagtggacaagtgattctatgccaaggacatctgagatatat gggaatgactcaggcataggagttttggactccaggtcaggcgaatctagttaccggagg ggacagagaacaaccgcctcactgagaaactgggaggggactgtaattaccagggactta ggaacagagcaggggtcagccaggaacaggaaggtgtgcctgccctcaacccgggacact ggagctagcaccaaagcctacacggagaggatcaggacatcctgttga >gi568815595r:141852516_142095142|GENSCAN_predicted_peptide_2|90_aa MPKRKAEGDTKRDKAKVKDEPQRRSVRLSAKPAPPKPEPKPKKAPAKKGEKVPKGKKGKA DAGEEGNNPAENGVAKTDQAQKAESPGDAK >gi568815595r:141852516_142095142|GENSCAN_predicted_CDS_2|273_bp atgcccaagagaaaggctgaaggggatactaaaagagataaagccaaggtgaaggacgaa ccacagagaagatccgtgaggttgtctgctaagcctgctcctccaaagccagagcccaag cctaaaaaggcccctgcaaaaaagggagagaaggtacccaaagggaaaaagggaaaagct gatgctggcgaggaggggaataaccctgcagaaaatggagttgccaaaacagaccaggca cagaaagctgaaagtcctggagatgccaagtga >gi568815595r:141852516_142095142|GENSCAN_predicted_peptide_3|192_aa MILTDSLDLTGDAPVVKTHFPQGQQHYRGLLPGQSRKYFLLLDNGSVPALLFCCCHHMYH WPAFIACTQDRARRPRDRGRPRTPSCCPLTPALGGAPQEFSGGRVVDEELPLGQALVEGL LLVLRHGVRADGERSCGRGDGGGEGCGGASRLRRRSPETAVKGRSLGSVTFLQDALRGIA RGSHNVHGALLF >gi568815595r:141852516_142095142|GENSCAN_predicted_CDS_3|579_bp atgattctcactgattctttggatcttactggagatgcaccagtagttaagacccacttt ccccagggtcaacaacactacaggggactacttcctggacaatccaggaagtactttctc ctgctagataatggctcagtacctgcccttctcttctgctgctgtcaccacatgtaccat tggcctgcgtttatagcctgcacacaggaccgcgcccggcgcccccgggaccgaggccgg ccccggacgcccagctgctgcccgctcaccccagctcttggcggtgcgccccaggaattc tccggtggtcgggttgtagatgaagagcttccactcggccaggctctggttgagggactt cttctcgttcttcgtcatggtgtgcgcgcggacggcgagaggagctgcggccgcggggat ggaggcggcgagggctgcggaggcgcgtcccggctccggcggcgcagcccggagacggca gtgaagggacgctctctgggctctgtgacgttcctacaggacgctctgcgggggatcgcg cggggctcccacaatgtacacggtgcactgctgttctga >gi568815595r:141852516_142095142|GENSCAN_predicted_peptide_4|53_aa MRWSTTAGAVRRYPEEGIVITEDSPMRVAAPEDLPVGQEMKVEDGDTDDSDSV >gi568815595r:141852516_142095142|GENSCAN_predicted_CDS_4|162_bp atgagatggagtaccacagcaggagcagtcaggaggtatccagaagaaggcattgtcatc acagaggacagccccatgcgtgttgctgcccctgaagaccttccagtggggcaagagatg aaggtggaagacggtgatactgatgattctgactctgtgtag >gi568815595r:141852516_142095142|GENSCAN_predicted_peptide_5|488_aa MWVMLQTLNDEVPKYRDQIPSPGLMVFPKPVTALEYTFSRSDPTSYAGYIEDLKKFLKPY TLEEQKNLTVCPDGALFEQKGPVYVACQFPISLLQACSGMNDPDFGYSQGNPCILVKMNR NEDIPNVAVYPHNGMIDLKYFPYYGKKLHVGYLQPLVAVQVSFAPNNTGKEVTVECKIDG SANLKSQDDRDKFLGRVMFKITARAYVDGGRWLEGTKQPELGRRNPVGCGWRRWPKLQLE HFRAPFRGDSSRTGPVADEVNVVGRIAAPMTFTFWGHAHGYALSRGKRGFADEIKKTVHL AGGERKGAACGDLGWWEKQSGYCSCATKDEEFSPQDPREELAETLGPGLLPKSPFSPWIN LLYCFYRPMTSEFMCRQVSRCGPSRQVTAALAGVILNQAVSEMFLEKLFPEVPEPVLPID KPCRSQKTDSGGLHPETRLRTEARGFQAAAFSVLQQGASQPPWLREQGTPAMLCPLPPFP ECPSVQRG >gi568815595r:141852516_142095142|GENSCAN_predicted_CDS_5|1467_bp atgtgggttatgcttcagactctcaacgatgaggttccaaaataccgtgaccagattcct agcccaggactcatggtttttccaaaaccagtgaccgcattggaatatacattcagtagg tctgatccaacttcgtatgcagggtacattgaagaccttaagaagtttctaaaaccatat actttagaagaacagaagaacctcacagtctgtcctgatggagcactttttgaacagaag ggtccagtttatgttgcatgtcagtttcctatttcattacttcaagcatgcagtggtatg aatgatcctgattttggctattctcaaggaaacccttgtattcttgtgaaaatgaacaga aatgaagatataccaaatgtagcagtttatcctcataatggaatgatagacttaaaatat ttcccatattatgggaaaaaactgcatgttgggtatctacagccattggttgctgttcag gtcagctttgctcctaacaacactgggaaagaagtaacagttgagtgcaagattgatgga tcagccaacctaaaaagtcaggatgatcgtgacaagtttttgggacgagttatgttcaaa atcacagcacgtgcatatgtggacggtgggcgatggctagaggggacgaaacagccagag ctggggaggaggaacccagtggggtgtggctggcggagatggccaaaacttcagctggag cacttcagagcacccttcagaggcgattcatcccgaactggacctgttgctgatgaagtc aatgtggtaggcagaattgcagcccccatgaccttcacgttctggggccatgcccatggt tatgctctatcacgtggcaagaggggctttgcagatgaaattaagaaaacagttcacctt gctggtggggagagaaagggtgcagcgtgcggggacctgggctggtgggaaaagcagtcg ggttattgttcgtgtgctacaaaggatgaggaattcagtccccaggatccacgggaggag ctggctgagaccctggggcccggccttcttcccaagagtccattttccccttggattaac ttactttattgtttctaccgtcccatgacttctgagtttatgtgtagacaagtctcccgg tgcgggcccagcaggcaggtgacagccgcccttgcgggagtcatccttaaccaagcagtt tcagaaatgtttcttgagaaactgtttccggaagtcccggagcctgtgttgcccattgac aagccttgtcggtctcagaagacagactccgggggtctacatccagagacccggctgagg actgaagcccggggtttccaagcggctgccttctccgtcctgcagcagggtgcgagtcaa ccgccctggctaagggagcaaggaacccccgccatgctgtgtcccctccccccattcccc gaatgcccgtctgttcaaagagggtga >gi568815595r:141852516_142095142|GENSCAN_predicted_peptide_6|113_aa MGEAGTLDSLSPMFQQVEHVDNQKAQKGVEQFRGQQVDPECWHFSLKLAPDDTASFPSFL PDEGSICRSYCGRLPNARGLPDHCQLSLLSERPFQPLAETCFQESISVENAQW >gi568815595r:141852516_142095142|GENSCAN_predicted_CDS_6|342_bp atgggagaggccggaaccctagatagcttatctcctatgttccaacaagttgagcacgtg gacaatcagaaggcacagaagggtgtggagcagttcaggggccagcaagtagacccagag tgctggcacttctcactgaagcttgcacctgacgacaccgccagcttccccagtttcttg ccagatgaaggctccatctgccggtcttactgtggacggctgccaaatgcccgggggctc cctgatcactgccagctcagccttctctctgagaggcctttccagcctcttgctgaaact tgctttcaggagagcatctcagtggaaaatgcacagtggtga >gi568815595r:141852516_142095142|GENSCAN_predicted_peptide_7|520_aa MNSVGREILKFLKREIPRHVARVGLPYLCEALNLILWAPLSCDKACIGVLVEDNVNTGPE CQGMGDRTIKRGASGANETLAIPSWHCVKSAPRILAQCSLAIPGVAPVGPSIVAAPLEGT GGEPWQCLSDAISSGCRVYKLRGCNQKASSLKQRINPHQALNLPALSFWTSQQPPEHDRK RARKFIDSDFSESKRSKKGDKNGKGLRHFSMKVCEKVQRKGTTSYNEVADELVSEFTNSN NHLAADSAYDQKNIRRRVYDALNVLMAMNIISKEKKEIKWIGLPTNSAQECQNLEIEKQR RIERIKQKRAQLQELLLQQIAFKNLVQRNRQNEQQNQGPPALNSTIQLPFIIINTSRKTV IDCSISSDKFEYLFNFDNTFEIHDDIEVLKRMGMSFGLESGKCSLEDLKLAKSLVPKALE GYITDISTGPSWLNQGLLLNSTQSVSNLDLTTGATLPQSSVNQGLCLDAEVALATGQFLA PNSHQSSSAASHCSESRGETPCSFNDEDEEDDEEDSSSPE >gi568815595r:141852516_142095142|GENSCAN_predicted_CDS_7|1563_bp atgaactcagtgggaagagaaatattaaagtttctgaagagggaaatccctcgccacgta gctcgtgtaggcctcccttatctttgtgaagccttaaacttaattctgtgggccccacta agttgtgataaagcatgcataggggttttggtagaagataacgttaacacaggcccagag tgccaaggcatgggggacagaactatcaaaagaggggcttcaggcgctaatgaaaccctg gcaatcccttcctggcactgtgtcaagtctgctccccgaattttggcacagtgctcctta gccattccaggtgtggctccagtaggcccaagtatagtggctgctcctctagagggcaca ggtggtgaaccttggcagtgtctgagtgatgccatctccagtgggtgcagagtgtacaag ctgcggggatgcaaccagaaggcatcatctttgaagcagagaataaaccctcaccaggca ctgaatctgccggcactttcattttggacctctcagcaacctccagaacatgatagaaaa cgggctagaaaatttatagactctgatttttcagaaagtaaacgaagcaaaaaaggagat aaaaatgggaaaggcttgagacacttttcaatgaaagtgtgtgagaaagttcaacgaaaa ggtacaacatcgtacaatgaagtcgctgatgagctggtgtcagagttcaccaattcaaat aaccatttggctgctgattcggcttatgatcagaagaacattaggcgaagagtttatgat gctttaaatgtgctaatggcaatgaacataatttcaaaggaaaaaaaagaaatcaagtgg attggcctgcctaccaattctgctcaggaatgtcagaatctggagatagagaagcagagg cggatagaacggataaagcagaagcgggcccagctgcaagaacttctcctacagcaaatc gctttcaaaaacctggtacagagaaatcgacaaaatgagcagcaaaaccagggcccgccg gctctgaactctaccattcagctgccattcataatcatcaatacaagcagaaaaacagtc atagattgcagcatctccagtgacaagtttgagtatcttttcaattttgacaacaccttt gagatccatgatgacatagaagtactaaagcggatgggaatgtcgtttggcctggagtca ggcaaatgctctctggaggatctgaaacttgcgaaatccctggtgccaaaggctttagaa ggttatatcacagatatctccacaggaccttcttggttaaatcagggactacttctgaac tctacccaatcagtttcaaatttagacctgaccactggtgccaccttaccccagtcaagt gtaaaccaagggttatgcttggatgcagaagtggccttagcaactgggcagttcctggcc ccaaacagtcaccagtccagcagtgcggcctctcactgctccgagtcccgaggcgagacc ccctgttcgttcaatgatgaagatgaggaagatgatgaggaggattcctcctccccagaa taa >gi568815595r:141852516_142095142|GENSCAN_predicted_peptide_8|230_aa MVPASTSGEGLGKSTVIAEGKGESASHVAREGAREMEGAMSMLRLQKRLASSVLCCGKKK VWLDPNETSEIANANSRQQMRKLIKDELIISKPVDSPFLGLMPEKDLGPPEGRHSGIGKR KGTANARMPEKVMWILRRLLRRYCESKKIDRHMYHSLYLKVKGNVFKNKRILMEHIHKLK ADKARKKLLADQAEAPRSKTKEARKHGEERLQAKKEEIIKTLSKEEETKK >gi568815595r:141852516_142095142|GENSCAN_predicted_CDS_8|693_bp atggtgccagcatctacttccggtgagggccttgggaagtctacagtcatagcagaaggc aaaggagagtcagcaagtcacgtggcaagagagggagcaagagagatggagggagccatg agtatgctcaggcttcagaagaggctcgcctcttctgtcctctgctgtggcaagaagaag gtctggttggacccgaatgagaccagtgaaattgccaatgccaactcccgtcagcagatg cggaagctgatcaaagatgagctgatcatcagcaagcctgttgacagtccattcctgggc ttgatgccggaaaaagaccttggcccgccggaaggcaggcacagtggcataggtaagcgg aagggtacagccaatgcgcgaatgccagagaaggtcatgtggattctgcgccggctgctt agaagatactgtgaatctaagaagattgatcgccatatgtatcacagcctgtacctgaag gtgaagggaaatgtgttcaaaaacaagcggattctcatggagcacatccacaagctgaag gcagacaaggcccgcaagaagctcctggctgaccaggctgaggcccccaggtctaagacc aaggaagcacgcaagcacggtgaagaacgcctgcaggccaagaaggaggagatcatcaag actttgtctaaggaggaagagaccaagaaataa >gi568815595r:141852516_142095142|GENSCAN_predicted_peptide_9|108_aa XWFDFHKCRSKRIYRSESQSNKRVSLSVLSCLEIGEGLHKYPCGHHHWDCTGSDLKPAQY WVLLKALGLYNQQSRSRNAVEEPRLGIRDPKSPLGALPQLVAQLVPKL >gi568815595r:141852516_142095142|GENSCAN_predicted_CDS_9|327_bp ngttggtttgacttccacaaatgcagaagtaagaggatttatagatcagaatctcagtcc aacaaaagagtgtctctgtctgtgctgagctgcctggagataggggaggggttacataag tatccctgtggccaccaccactgggactgcactggttcagacctgaagccagcacaatac tgggtcttgctcaaggccctggggctttacaatcagcagagtaggtccagaaatgctgtt gaagagcccaggcttggaatcagggaccccaagagcccacttggtgctctacctcagctg gtggctcagctggtacctaaactgtga