GENSCAN 1.0 Date run: 7-Nov-116 Time: 21:22:09 Sequence gi568815593r:16802297_17002641 : 200345 bp : 44.60% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 3264 3396 133 0 1 31 36 181 0.612 4.76 1.02 PlyA + 4663 4668 6 1.05 2.09 PlyA - 4818 4813 6 1.05 2.08 Term - 11581 11573 9 1 0 115 41 0 0.156 -3.81 2.07 Intr - 15871 15713 159 1 0 133 92 184 0.957 23.48 2.06 Intr - 27070 27021 50 2 2 88 80 53 0.023 2.90 2.05 Intr - 44369 44140 230 1 2 55 36 166 0.090 5.61 2.04 Intr - 45667 45654 14 1 2 112 84 -9 0.003 -5.32 2.03 Intr - 57841 57812 30 1 0 93 62 51 0.013 1.33 2.02 Intr - 75411 75313 99 0 0 116 94 108 0.957 14.51 2.01 Init - 76340 76335 6 2 0 49 86 10 0.619 -2.79 2.00 Prom - 84084 84045 40 -5.26 3.00 Prom + 84263 84302 40 -6.26 3.01 Init + 90161 90163 3 1 0 108 81 0 0.654 1.30 3.02 Intr + 91927 92004 78 0 0 109 66 32 0.436 2.85 3.03 Intr + 93249 93276 28 2 1 103 95 -15 0.418 -1.61 3.04 Intr + 98893 99030 138 2 0 83 50 107 0.512 6.84 3.05 Term + 99083 99225 143 0 2 73 42 72 0.542 -0.81 3.06 PlyA + 99341 99346 6 1.05 4.10 PlyA - 99746 99741 6 1.05 4.09 Term - 100345 100046 300 1 0 36 34 370 0.182 21.62 4.08 Intr - 103301 103218 84 2 0 84 96 3 0.012 0.72 4.07 Intr - 118975 118922 54 1 0 68 94 42 0.085 1.98 4.06 Intr - 131991 131876 116 1 2 59 97 58 0.124 3.97 4.05 Intr - 151289 151211 79 2 1 29 68 59 0.002 -2.88 4.04 Intr - 155695 155609 87 1 0 95 52 34 0.041 0.67 4.03 Intr - 169632 169522 111 0 0 72 61 81 0.474 4.38 4.02 Intr - 169811 169732 80 0 2 121 80 12 0.186 2.97 4.01 Init - 188800 188596 205 1 1 92 44 91 0.003 4.16 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 34156 34294 139 1 1 74 48 177 0.951 9.74 S.002 Term + 184204 184320 117 0 0 109 48 124 0.992 9.04 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:16802297_17002641|GENSCAN_predicted_peptide_1|44_aa XYCYRSTNGHDDAMQFKTVDTCSSQTRGGATYRFPHVGFQIPQE >gi568815593r:16802297_17002641|GENSCAN_predicted_CDS_1|135_bp nnatattgttaccgctccacaaacggccatgatgatgccatgcagtttaagactgtggac acatgcagctcccaaacacgaggtggggccacgtaccggtttcctcatgttggatttcaa attccacaagaataa >gi568815593r:16802297_17002641|GENSCAN_predicted_peptide_2|198_aa MKGTRVWLRENGQHFPSTVNSCAEGIVVFRTDYGQNPVIGRFFHVAITCMQSLHNDDDDL SHLLLPNMLIMDLFVATLLLNRFQLFLTAKNQDNIAFESESVKPALFHGDQNLGEFLFMI VDFAGPESLGAVFDVSVAHDDEQVFTYKQSTITHQKVTAMHPTNEEGVDDMASLTELHGG SIMYNLFQRYKRNQIYCQ >gi568815593r:16802297_17002641|GENSCAN_predicted_CDS_2|597_bp atgaagggaacacgggtctggctgagagaaaatggccagcattttccaagtactgtaaat tcctgtgcagaaggcatcgtcgtcttccggacagactatggtcagaacccagtgattggc cgcttcttccacgttgctatcacttgcatgcagtccctgcacaacgatgatgatgacctt tcccaccttttgctaccaaacatgctaattatggacctgtttgtggccaccctccttctc aaccggtttcaactatttttgacagcaaagaatcaagataatatagcctttgaaagtgaa tcagtgaagcctgcgctctttcatggtgaccaaaatctgggagaatttctttttatgatt gtagactttgcgggcccagaaagtctcggtgctgtgtttgatgttagtgttgcccatgat gatgagcaggtattcacttacaagcagagcacaattacccaccagaaggtgactgctatg caccccacgaacgaggagggcgtggatgacatggcgtccttgacagagctccatggcggc tccatcatgtataacttattccagcggtataagagaaatcaaatatattgccagtaa >gi568815593r:16802297_17002641|GENSCAN_predicted_peptide_3|129_aa MDPFWDIGCNRHAPSLIALTVLRNAGQEASHLLRTNACCFASNTNASSGFGIHIHYLLSE ALAGLLGAPPPSVSLLHVPLSPAHKVVPSKEGRKEDFQQPKNLVNFSINSTHLETLPEER ICLLPALPY >gi568815593r:16802297_17002641|GENSCAN_predicted_CDS_3|390_bp atggaccccttctgggacattggatgtaatcgtcatgctccttctttgatagccttgacg gttttgagaaatgctggtcaggaagcctctcacctcctgaggaccaatgcctgttgcttt gccagcaacaccaatgcatcctctggcttcggcattcacatacattacctcctctcagag gccctcgctggcctgctgggcgcccctcctccgtcagtcagcttgctgcatgtacctttg tccccagcacacaaggtggtgcccagcaaagaaggcaggaaggaggatttccaacaaccc aagaacctggtgaacttcagtataaactccacccatctggaaactttgccagaagagcgt atttgtctcttaccagccctgccctattaa >gi568815593r:16802297_17002641|GENSCAN_predicted_peptide_4|371_aa MPEPPFSSVGSCVAGASRRNAAPCSTVPSPIDHPRAEECWHRAQDWQAAPPAAPVRDPLG EASWAPQSGTFTNWLTNHEHLFNIARGPPTPVNCQRSILSSNILDRCSPSVCAYIQCYQG SLSWHSAGLVLIVSRILPPTPMPPECRGERNTANIGKLNLKIPAKLIRTFPIYIILCAYS NASAKTTACLSTTGVRGSPLGRTLPPSASGGNIRITRYSVLISCYPEKGRILSQSFEGMS SYVKHSSVSGGLRWTHCPGWLAGWWLFLQDGRMTKKRRNNGRAKKGRGHVQPIRCTNCAR CVPKDKAIKKFVIRNIVEAAAVRDMSEASVFDAYVLPKLYVKLHYCLSCAVLSKVVRNRS REARKDRTPPP >gi568815593r:16802297_17002641|GENSCAN_predicted_CDS_4|1116_bp atgcctgagcctcccttctcctccgtgggctcctgtgtggccggagcctcccggaggaat gccgccccctgctccaccgtgcccagtcccatcgaccacccaagggctgaggagtgctgg cacagggcgcaggactggcaggcagctccacctgcggccccagtgcgggatccactgggt gaagccagctgggctcctcagtctggcacctttaccaattggttgaccaatcatgagcac ctttttaatatagcccgaggccctcccacacctgtgaactgtcagcgcagcattctctct agcaacatcctcgaccgctgctcacccagcgtctgcgcctatatccagtgctaccagggt agcctgtcctggcactcagcgggcttagttcttattgtctcaagaatcctcccgcctact ccaatgcccccagaatgcaggggggagcgtaacacagctaacattggaaagctgaatctt aagattcctgcaaaactcatcaggacttttcccatttacatcattctctgcgcttactcg aatgccagtgccaagaccacagcctgtctgagcaccacaggggtccgtggctcccccttg gggagaactctgccccctagtgcaagtggggggaacatcaggataacgcgctacagcgtc ctgatctcctgttacccggaaaaaggtcgtattctctcgcaaagctttgagggcatgtca agttatgtcaagcactcatcggtcagtggaggtctgagatggactcattgccctggctgg ctggctggctggtggctgttcctgcaggacgggaggatgacaaagaaaagaaggaacaat ggtcgtgccaaaaagggccgcggccacgtgcagcctattcgctgcactaactgtgcccga tgcgtgcccaaggacaaggccatcaagaaattcgtcattcgaaacatagtggaggccgca gcagtcagggacatgtctgaagcgagcgtcttcgatgcctatgtgcttcccaagctgtat gtgaagttacattactgtctgagttgtgcagttctcagcaaagtagtcaggaatcgatct cgtgaagcccgcaaggaccgaacacccccaccctga