GENSCAN 1.0 Date run: 6-Nov-116 Time: 14:39:00 Sequence gi568815585f:30635606_30864103 : 228498 bp : 44.79% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 7023 7152 130 1 1 78 92 13 0.032 0.45 1.02 Intr + 11327 11452 126 2 0 77 87 21 0.013 0.69 1.03 Intr + 17543 17700 158 2 2 134 115 -5 0.030 6.45 1.04 Term + 21869 23751 1883 0 2 114 38 677 0.035 50.66 1.05 PlyA + 25129 25134 6 1.05 2.07 PlyA - 25681 25676 6 1.05 2.06 Term - 27665 27649 17 0 2 117 44 26 0.033 -0.50 2.05 Intr - 35957 35869 89 1 2 57 100 43 0.081 2.01 2.04 Intr - 36204 36096 109 1 1 81 79 23 0.009 0.04 2.03 Intr - 41463 41429 35 2 2 94 86 29 0.013 1.17 2.02 Intr - 51258 51131 128 0 2 87 56 65 0.036 2.68 2.01 Init - 61783 61736 48 1 0 100 -26 111 0.517 1.85 2.00 Prom - 63348 63309 40 -2.66 3.00 Prom + 64743 64782 40 -3.96 3.01 Init + 70077 70094 18 2 0 91 62 37 0.270 0.36 3.02 Intr + 70583 70734 152 1 2 96 42 101 0.045 5.26 3.03 Intr + 81697 81741 45 1 0 69 70 90 0.009 2.82 3.04 Intr + 99946 100070 125 1 2 82 91 140 0.450 13.93 3.05 Intr + 108213 108302 90 1 0 98 42 60 0.816 2.37 3.06 Intr + 108455 108554 100 0 1 78 89 107 0.731 8.97 3.07 Intr + 116278 116377 100 1 1 101 91 43 0.623 6.01 3.08 Intr + 120287 120420 134 1 2 37 92 20 0.086 -3.26 3.09 Term + 128339 128501 163 2 1 95 35 147 0.907 7.51 3.10 PlyA + 128792 128797 6 1.05 4.00 Prom + 135552 135591 40 -4.76 4.01 Init + 147686 147758 73 1 1 110 103 62 0.936 9.18 4.02 Intr + 161818 161840 23 2 2 98 113 8 0.102 1.56 4.03 Intr + 162370 162509 140 0 2 50 72 66 0.083 0.46 4.04 Intr + 162714 162850 137 0 2 70 67 38 0.058 0.11 4.05 Term + 167573 167655 83 0 2 69 43 73 0.027 -1.24 4.06 PlyA + 170029 170034 6 1.05 5.10 PlyA - 171536 171531 6 1.05 5.09 Term - 173014 172869 146 2 2 111 42 81 0.685 3.87 5.08 Intr - 183925 183868 58 1 1 72 95 35 0.304 1.06 5.07 Intr - 196594 196442 153 1 0 104 60 49 0.785 3.97 5.06 Intr - 197881 197452 430 0 1 13 64 212 0.045 5.02 5.05 Intr - 201201 201174 28 1 1 81 105 2 0.021 -1.63 5.04 Intr - 201445 201281 165 2 0 49 76 81 0.217 2.93 5.03 Intr - 216768 216405 364 0 1 45 50 398 0.243 26.46 5.02 Intr - 220324 220271 54 1 0 74 93 57 0.927 3.98 5.01 Intr - 225169 225116 54 1 0 61 107 26 0.215 0.98 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 14404 14304 101 2 2 81 48 98 0.838 3.39 S.002 Sngl + 22459 23751 1293 0 0 89 38 602 0.929 51.30 S.003 Term + 40303 40421 119 2 2 86 48 81 0.826 2.60 S.004 Init + 105991 106039 49 0 1 86 58 39 0.827 -0.19 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585f:30635606_30864103|GENSCAN_predicted_peptide_1|765_aa XDMESPVFAFPLLLKLETHIEKLFLYSFSWDFECSQCGHQYQNRHMKSLVTFTNVIPEWH PLNAAHFGPCNNCNSKSQIRKMVLEKVSPIFMLHFVEGLPQNDLQHYAFHFEGCLYQITS VIQYRANNHFITWILDADGSWLECDDLKGPCSERHKKFEVPASEIHIVIWERKISQVTDK EAACLPLKKTNDQHALSNEKPVSLTSCSVGDAASAETASVTHPKDISVAPRTLSQDTAVT HGDHLLSGPKGLVDNILPLTLEETIQKTASVSQLNSEAFLLENKPVAENTGILKTNTLLS QESLMASSVSAPCNEKLIQDQFVDISFPSQVVNTNMQSVQLNTEDTVNTKSVNNTDATGL IQGVKSVEIEKDAQLKQFLTPKTEQLKPERVTSQVSNLKKKETTADSQTTTSKSLQNQSL KENQKKPFVGSWVKGLISRGASFMPLCVSAHNRNTITDLQPSVKGVNNFGGFKTKGINQK ASHVSKKARKSASKPPPISKPPAGPPSSNGTAAHPHAHAASEVLEKSGSTSCGAQLNHSS YGNGISSANHEDLVEGQIHKLRLKLRKKLKAEKKKLAALMSSPQSRTVRSENLEQVPQDG SPNDCESIEDLLNELPYPIDIASESACTTVPGVSLYSSQTHEEILAELLSPTPVSTELSE NGEGDFRYLGMGDSHIPPPVPSEFNDVSQNTHLRQDHNYCSPTKKNPCEVQPDSLTNNAC VRTLNLESPMKTDIFDEFFSSSALNALANDTLDLPHFDEYLFENY >gi568815585f:30635606_30864103|GENSCAN_predicted_CDS_1|2298_bp ngtgatatggaaagccctgtgtttgcatttcccctgctcttaaaactagaaacccacatt gaaaagctcttcctatattctttttcttgggactttgaatgttcgcagtgtggacaccaa tatcaaaacaggcatatgaagagtctggtcacctttacaaatgtcatccctgagtggcac ccacttaatgctgcccattttggtccatgtaacaattgcaacagtaaatcacaaataaga aaaatggtattagaaaaagtatctcccatattcatgttgcactttgtagaaggcttacca cagaatgacttgcagcactatgcatttcattttgaaggctgtctttatcagataacttct gtaattcagtatcgagcaaataatcattttataacatggattttagatgctgatggaagt tggctggaatgtgatgacttaaaaggcccatgttctgaaaggcacaagaaatttgaagtt cctgcttcagagatacatattgttatttgggaaagaaaaatatcccaagtgacagataaa gaagctgcctgccttccacttaaaaagactaatgaccaacacgctctcagtaatgagaaa ccagtatctttaacatcgtgttctgtgggtgatgctgcctcagctgaaacagcctcagta actcaccctaaagatatatcagttgcccctcgtactctttcacaggacacagctgtaact catggagatcatttactttcaggtccaaaaggtttggttgacaatattttacctctgaca cttgaagaaactatccagaaaacagcctcagtttcacagttaaattctgaagctttcctg ttagaaaataaacctgtagcagaaaatacaggaattctcaaaaccaatactttgctatca caagaatcactaatggcttcttcagtatcagctccatgtaatgaaaagcttattcaagac caatttgtggacataagttttccatcccaagttgtaaatacaaacatgcagtcagtacag ctgaatacagaagatactgtaaatactaaatctgtgaataatactgatgctactggtctt atacagggagtgaagtcagtagaaattgagaaggacgctcagttaaaacaattccttaca ccaaaaactgaacaattaaaaccagaacgtgtcacatctcaggtatctaatttgaagaaa aaagaaactacagcagattctcaaaccacaacatctaagtcattacagaatcagtctctg aaagaaaatcagaagaagccatttgtgggaagttgggttaaaggcttaataagcaggggt gcttcttttatgccactctgtgtttcagctcataatagaaacactataactgatttacaa ccttcagttaaaggggtaaataattttggtggctttaaaactaaaggtataaaccagaag gccagccacgtatccaagaaagctcgtaagagtgcaagtaagcctcctcccatcagtaag ccaccagcaggccctccatcgtctaatggcacagctgcccacccacatgctcatgctgct tcagaagttttggaaaagtctggaagcacctcatgtggagctcaactcaaccacagttct tatgggaatggtatttcttcagcaaaccatgaagacttggtggaaggtcagattcataaa cttcgtctaaaacttcgtaaaaagctaaaggcagaaaagaagaaattagctgctcttatg tcttccccgcaaagcagaacagttcgaagtgaaaatctagaacaggtgccccaggatggg tctccaaatgattgtgaatcaatagaggacttgttaaatgagctaccatatccaattgat attgccagtgagtctgcatgcaccactgttcctggtgtttccctgtacagtagtcaaact catgaagaaattttagcggaattattgtctcctacacctgtttcaacagagctgtcagaa aatggggaaggtgactttaggtatttgggaatgggagatagtcatatcccaccaccagta ccaagtgaattcaatgatgtttcccagaacacacatctgagacaggaccataattattgt agccccaccaagaaaaatccatgtgaagttcagccagactctctgacaaataatgcctgc gttagaacattaaacttggagagtccgatgaagactgatattttcgatgagtttttttcc tcctcagcattaaatgctttagcaaatgacacattagacctacctcatttcgatgaatat ctgtttgagaattattga >gi568815585f:30635606_30864103|GENSCAN_predicted_peptide_2|141_aa MVGSRFADKEDFLEEASSECEAYSVETPCLKIDNAVETLTSSPLLPSYLEANNWGLPDVL DMKSGEMKYKAGRHPLRHHLLREPPLMLLRSNIDTPSPSSHHNPRKQVWFTDGQPDKNAS DGVSASQPKKNVPRPRTFSAI >gi568815585f:30635606_30864103|GENSCAN_predicted_CDS_2|426_bp atggtgggcagccgatttgcagacaaggaggacttcctggaggaggcaagctctgaatgt gaagcctactcagtggagacaccttgcctgaaaatagacaatgcagtggaaaccctcacc agctccccactcctcccgtcttacctagaagcaaacaactggggtttaccagatgttctg gacatgaagtctggagaaatgaagtataaagctgggagacatccgctgaggcatcacctt ctcagggaacctcccctgatgctgcttcgatctaatatagacaccccttctcccagcagt caccacaacccccggaaacaggtatggttcacagatggacaaccagataaaaatgcatct gatggagtgtcagcaagtcagccaaagaaaaatgttcccaggccacgaacatttagtgcc atctga >gi568815585f:30635606_30864103|GENSCAN_predicted_peptide_3|308_aa MAKATLEIGGSIKKELKRSSTILWNTIITIITISTTNQHHHYQKHSPGVCLTNCKLHSIT TIINAEDFCEQRAGCAAGGRAVLSGEPEANMDQETVGNVVLLAIVTLISVVQNDCKILES GGRALLMTVFLTPNTVFSREQLLRFFAHKVEHESRTQNGRSFQRTGTLAFERVYTAKWQL AAPSYWVDVSGITNGIYVSCTMVIINFNFQHVLVATKLTQVFSFLHLVPAAFAGLMYLFV RQKYFVGYLGERTQSTPGYIFGKRIILFLFLMSVAGIFNYYLIFFFGSDFENYIKTISTT ISPLLLIP >gi568815585f:30635606_30864103|GENSCAN_predicted_CDS_3|927_bp atggccaaggccaccttggagattggtggctccatcaaaaaggagttaaaaagaagcagc actattttgtggaatacaatcatcaccattatcaccatcagcaccaccaaccagcaccac cattatcaaaagcattcacctggtgtctgccttacaaactgcaaactgcacagcataaca acaatcatcaatgcagaagacttctgcgagcaaagggcaggttgtgcagctggaggcaga gcagtcctctctggggagcctgaagcaaacatggatcaagaaactgtaggcaatgttgtc ctgttggccatcgtcaccctcatcagcgtggtccagaatgactgtaagatcctagaaagt gggggccgtgccttgctcatgactgtgtttctaacaccaaacacagtgttcagtagagag cagctgctgagattctttgcccataaagtggagcacgaaagcaggacccagaatgggagg agcttccagaggaccggaacacttgcctttgagcgggtctacactgccaagtggcagctg gctgccccatcttattgggtagatgtaagtggaattacgaatgggatttatgtttcatgc acgatggtgattattaacttcaactttcagcatgttttggtggctacgaaactgacacag gtgttttcatttctccacttagttcctgctgcgtttgctggactgatgtacttgtttgtg aggcaaaagtactttgtcggttacctaggagagagaacgcagagcacccctggctacata tttgggaaacgcatcatactcttcctgttcctcatgtccgttgctggcatattcaactat tacctcatcttctttttcggaagtgactttgaaaactacataaagacgatctccaccacc atctcccctctacttctcattccctaa >gi568815585f:30635606_30864103|GENSCAN_predicted_peptide_4|151_aa MADLLLSMLLLTLSLCWCRARLYTYRETSSEKEGYPGSPRQPLAAPLLWNPGFLHLKGES SGTTPGAQEGCTCQGPTDSFPCCLLGLQQRFLQPRGISGNSLEDEATPKSRGVFAGLTVE LWRASWMSLSVRVTGKAPGLVTVSSVGKKYG >gi568815585f:30635606_30864103|GENSCAN_predicted_CDS_4|456_bp atggctgatcttctactttccatgctcctgctcacactgtctctgtgctggtgtagagca aggctctacacatatagggaaaccagctcagagaaggagggatatcctgggagtcccagg cagccactggcagcacccctgctgtggaacccaggattcctgcacctgaaaggggagtct tctgggaccactcctggagcccaggagggctgcacgtgtcaggggcccacagacagcttc ccttgctgtctccttggactgcagcagcgtttcttacagccacgtggcatctcagggaat tctctggaagatgaggctacaccgaagtcaagaggcgtctttgctgggctcactgttgag ctgtggagggcctcgtggatgagcctttccgtgagggtgaccggaaaggcgcctgggctg gtgacggtgtcaagtgtaggcaagaaatacggctga >gi568815585f:30635606_30864103|GENSCAN_predicted_peptide_5|483_aa VFRLLGGYFKNYSDIVKTFDEVDYVASSELHAWNLLDQDLEKPDGSKVVLTQDQDLEKPD GSKVVLTQDQDLEKPDGSKVVLTQDQDLEKPDGSKVVLTQDQDLEKPDGSKVVLTQDQDL EKPDGSKVVLTQDQDLEKPDGSKVVLTQDQDLEQPDGLEIPNVSDNGEWAWKVLQEAKGV YPMLGLTLDREKGTSTDAERKIRTQKKCNGSQESFLFPARERGCQCQVMGNRVLWEGRKK GAATGFSPMAQGKPTDSPLCLEPLTPELTMLSVQVCRGQGGDGPLEPEHQGAAPRLAQRV SIWHQKAAAFKNQQAKEMKNTRPGPPRGTQTLAGLRGLSGKGMTNVVIAWLRASSAVTAL LSRSQMGPALKGSSLLSSPWVPTMHSLHIIFVPTDRDPVTLPPMPVGLQEALMQVHLTIL DYSDELGETSKVLSVGSITPTHSPSASPTGFISKPSHTYFMDSATPRPSSCHVLPELLQN PSS >gi568815585f:30635606_30864103|GENSCAN_predicted_CDS_5|1452_bp gtatttcgccttctcggcggttactttaaaaattattctgacattgtaaaaacgtttgat gaagtagactatgtggctagctctgagctgcatgcttggaatctgctggaccaagatctg gagaaaccagacgggtctaaggtggtcctaactcaggaccaagatctggagaaaccagac gggtctaaggtggtcctaactcaggaccaagatctggagaaaccagacgggtctaaggtg gtcctaactcaggaccaagatctggagaaaccagacgggtctaaggtggtcctaactcag gaccaagatctggagaaaccagacgggtctaaggtggtcctaactcaggaccaagatctg gagaaaccagacgggtctaaggtggtcctaactcaggaccaagatctggagaaaccagac gggtctaaggtggtcctaactcaggaccaagatctggagcaaccagatgggttagaaatt cctaacgtatctgataatggggagtgggcgtggaaggtgttgcaggaagccaagggggtc tatcccatgcttgggttgaccctggacagagaaaaaggaacatctacagatgcagaaaga aaaataagaactcaaaagaaatgcaacgggagtcaggaatcattcctgttccctgccagg gaaaggggttgccagtgccaggtgatgggcaacagggtgctctgggaaggaaggaaaaaa ggagctgccacagggttttctccaatggcccaggggaaacccacagattcgcctttgtgt ctggagccgttgaccccagaactcacaatgctcagtgttcaggtctgcaggggccaaggt ggtgatggtcccctcgagccagagcaccagggagcagccccaaggctggcccagagggtc tccatttggcaccaaaaagcagctgcatttaaaaatcaacaggcaaaggaaatgaaaaac acccgcccagggccacctaggggcacccagacccttgcaggtttacgaggcctttctgga aaaggcatgactaatgttgttattgcctggctgcgcgccagcagtgctgtcaccgccttg ctgagccgcagtcagatgggccctgccctcaagggcagctcactcctaagcagtccatgg gtgcccactatgcacagcctccatatcatcttcgttccaacagatagggatcccgtgacc ctgccacccatgccagttggattacaggaagccctgatgcaggttcacctgactattttg gattattctgatgagttgggagagacatcaaaggtcctttctgtaggctccatcacccca acacacagtccatcagcaagtccaactggcttcatctccaaaccatcccacacctacttc atggactctgctacccccagaccaagctcctgccatgtgttgcctgaactattgcaaaat ccttctagctga