GENSCAN 1.0 Date run: 6-Nov-116 Time: 08:47:21 Sequence gi568815586r:15009113_15317489 : 308377 bp : 36.85% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 4558 4730 173 2 2 41 55 133 0.345 2.31 1.02 PlyA + 5930 5935 6 1.05 2.04 PlyA - 7452 7447 6 1.05 2.03 Term - 28145 28012 134 0 2 30 32 181 0.730 3.97 2.02 Intr - 29348 29268 81 0 0 71 91 57 0.064 2.99 2.01 Init - 56551 56410 142 1 1 76 87 86 0.251 5.99 2.00 Prom - 65271 65232 40 -6.35 3.00 Prom + 67499 67538 40 -3.65 3.01 Init + 68688 68903 216 2 0 79 63 61 0.137 1.48 3.02 Term + 86230 86460 231 0 0 9 42 245 0.236 7.49 3.03 PlyA + 87021 87026 6 1.05 4.02 PlyA - 87251 87246 6 -3.84 4.01 Sngl - 87741 87262 480 0 0 67 39 275 0.970 16.33 4.00 Prom - 98790 98751 40 -6.25 5.04 PlyA - 98963 98958 6 1.05 5.03 Term - 100405 99998 408 1 0 106 48 346 0.748 26.63 5.02 Intr - 119975 119811 165 2 0 77 86 148 0.579 12.74 5.01 Init - 127958 127854 105 2 0 28 75 87 0.669 1.67 5.00 Prom - 129060 129021 40 -6.15 6.00 Prom + 131106 131145 40 -6.15 6.01 Init + 133270 133279 10 0 1 90 49 0 0.660 -3.09 6.02 Intr + 134877 135007 131 1 2 22 76 178 0.723 9.49 6.03 Intr + 136420 136588 169 0 1 6 62 157 0.575 3.50 6.04 Intr + 157255 157467 213 2 0 53 42 206 0.121 10.26 6.05 Intr + 160395 160516 122 1 2 90 58 76 0.144 4.09 6.06 Intr + 166349 166422 74 1 2 83 103 -17 0.078 -3.41 6.07 Intr + 172614 172753 140 0 2 88 70 122 0.450 9.59 6.08 Intr + 186980 187097 118 0 1 113 90 52 0.048 6.50 6.09 Intr + 197411 197498 88 2 1 29 59 66 0.004 -3.15 6.10 Intr + 201708 201740 33 2 0 138 91 12 0.184 4.00 6.11 Intr + 211895 212219 325 1 1 58 35 260 0.130 12.12 6.12 Intr + 217338 217421 84 1 0 93 47 55 0.230 0.77 6.13 Intr + 234539 234651 113 0 2 65 55 103 0.032 3.88 6.14 Intr + 242846 242927 82 1 1 68 77 45 0.005 -0.31 6.15 Term + 244196 244329 134 0 2 79 37 104 0.775 1.67 6.16 PlyA + 245660 245665 6 1.05 7.00 Prom + 256580 256619 40 -5.25 7.01 Init + 263480 263637 158 1 2 69 81 134 0.440 10.23 7.02 Intr + 270933 271050 118 0 1 75 53 137 0.156 8.45 7.03 Intr + 272111 272245 135 1 0 62 93 33 0.287 1.04 7.04 Term + 286892 286900 9 1 0 154 45 0 0.176 -0.48 7.05 PlyA + 287637 287642 6 1.05 8.04 PlyA - 289104 289099 6 1.05 8.03 Term - 291073 290755 319 0 1 83 44 154 0.146 3.87 8.02 Intr - 291591 291165 427 1 1 24 60 208 0.085 3.52 8.01 Init - 296885 296597 289 2 1 90 44 228 0.100 15.92 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 183216 183059 158 1 2 17 54 210 0.873 7.61 S.002 Sngl - 296885 296448 438 2 0 90 43 265 0.894 18.21 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:15009113_15317489|GENSCAN_predicted_peptide_1|57_aa XPYKLNLQQSSKALREREVSEAQENLQLREHQKMARKGRSELTVTGQGLSESHKEIG >gi568815586r:15009113_15317489|GENSCAN_predicted_CDS_1|174_bp nctccttataaattgaacttgcagcagtctagcaaggcattgagagaacgagaggtgtct gaagctcaggaaaatctgcagctcagagagcatcagaaaatggctagaaagggtaggtct gaactgactgtcacagggcagggactttcagagagtcacaaggagattggatga >gi568815586r:15009113_15317489|GENSCAN_predicted_peptide_2|118_aa MPNVQRLEDLLTCLAHHCHYWHLSKLPECPRISMQGPANPVSAYTTSAQSQMPTGVLPGG CCHRSFTTRPHLTVAGKVNALGGYKFGVSAGKALVATCLLFGGPRLQRWIQKPAQVAA >gi568815586r:15009113_15317489|GENSCAN_predicted_CDS_2|357_bp atgcctaatgtccagaggctggaggacttgctcacctgcctggctcaccactgccactac tggcacctgagcaagctccctgaatgcccgagaatcagcatgcaaggacctgctaatcca gtgtcagcatacaccacttcagcccagtctcagatgcccactggggtgcttcctggtggc tgctgccataggtcttttactaccagacctcacctaactgttgcaggcaaggtgaacgca ctgggtggttacaaatttggcgtgtcagccgggaaagctcttgtggctacctgcctgttg ttcggtggcccccgactccagcgatggatccagaagccagcccaagtggctgcctag >gi568815586r:15009113_15317489|GENSCAN_predicted_peptide_3|148_aa MKIRKFFELNYYSVTTYQSLWETAKAVLRGKCIALNAYIRKSERAQTGNLKSHLKELEKQ EPTKPKPRRRKEEQWEAFSRIGSSNGRLTGSGAQQTPCRIQRDGSQWRVCDGGKQQWWTA SESSARAVTNTDQKSAVARFNRVKTELL >gi568815586r:15009113_15317489|GENSCAN_predicted_CDS_3|447_bp atgaagattagaaaattctttgaactgaactactatagtgtcacaacctatcaaagcctc tgggagacagcaaaggcggtgctaagaggaaagtgcatagccctaaacgcctacatcaga aagtctgaaagagcacaaacaggcaatctaaagtcacacctcaaggaactagagaaacaa gaaccaaccaaacccaaacccaggagaagaaaggaagaacaatgggaagcctttagccgg atagggagcagcaatgggcgccttactggatcaggagcacagcagacaccctgccggatc cagagggatggaagtcagtggcgggtctgcgacggtggcaaacagcagtggtggacggct agtgaaagctcagctcgagctgtaacaaacacggaccagaagagtgcagttgcaagattt aatagagtgaaaacagagctcttataa >gi568815586r:15009113_15317489|GENSCAN_predicted_peptide_4|159_aa MNSRWIKDLHVRPKTIKTLEENQGNAIQAIGMGKGFMSKTPKAMATKAKIDKWDLIKLKS SCTAKETTIRVNGQPIEWEKIFAIYSSDKGLISRICKELKQISKEKTNQLHQKVGKGYEQ PLLKRRRLCSQKTHEKMLIITGHQRNANQNRNEIPSHTS >gi568815586r:15009113_15317489|GENSCAN_predicted_CDS_4|480_bp atgaattcaagatggattaaagacttacatgttagacctaaaaccataaaaaccctagaa gaaaaccaaggcaatgccattcaggccataggcatgggcaagggcttcatgtctaaaaca ccaaaagcaatggcaacaaaagccaaaattgacaaatgggatctaattaaactaaagagc tcctgcacagcaaaagaaactaccatcagagtgaacgggcaacctatagaatgggagaaa atttttgcaatctactcatctgacaaagggctaatatccagaatctgcaaagaactcaaa caaatttccaaggaaaaaacaaaccaactccatcaaaaagtgggcaaaggatatgaacag ccacttctcaaaagaagacgtttatgcagccaaaaaacacatgaaaaaatgctcatcatc actggccatcagagaaatgcaaatcaaaaccgcaatgagataccatctcacaccagttag >gi568815586r:15009113_15317489|GENSCAN_predicted_peptide_5|225_aa MCEVKPDGTLRRNRRIRHYSWRLQHLAIRNGEIQQAGTHSVLGSHTSHLLLRHNDPYFLE ETRTVTEKRVRTAVKKNMNLRVGDRRPRNKEDTIQREGHMRWGEGFVLVYDITDRGSFEE VLPLKNILDEIKKPKNVTLILVGNKADLDHSRQVSTEEGEKLATELACAFYECSACTGEG NITEIFYELCREVRRRRMVQGKTRRRSSTTHVKQAINKMLTKISS >gi568815586r:15009113_15317489|GENSCAN_predicted_CDS_5|678_bp atgtgtgaggtaaaacctgatggaactttaaggagaaatagacgaatccgccattatagt tggagacttcaacaccttgctatcagaaatggagagatccagcaggccgggactcactcg gtcctgggatctcatacttcacatctccttctgaggcacaatgacccatatttcttggag gagaccagaacggtcacagaaaagagagtaaggactgctgtaaaaaagaacatgaacttg agggttggtgatagaagacccaggaacaaggaagataccattcagagggaggggcacatg cgatggggggaaggctttgtgctggtctacgacattactgaccgaggaagttttgaggaa gtgctgccacttaagaacatcctagatgagatcaaaaagcccaagaatgtgactctcatc ttggttggaaacaaagctgacttggaccactccaggcaggttagcacagaagaaggagag aagctggccacagaattggcttgtgctttttacgagtgctctgcctgcactggagaaggg aacatcacagagatattctatgaattgtgtcgagaggtgcgtcgccggaggatggtgcag ggcaagacgaggcgacgcagctccaccacgcatgtcaagcaagccattaacaagatgctc accaaaatcagtagttag >gi568815586r:15009113_15317489|GENSCAN_predicted_peptide_6|611_aa MGTGTGGHQCGWNGEHKVGQVEEEEVSEEVKKGVEYERKRGVKDDSKLPGSESVAGVPSV TGPVHVLQSDKGISTAGVLRCLSGWQHRGAAVPDQMMKTSSARRVMQKEWENKNFSVSFL ILGLWTSHQGTLSFSLMIIDDDDDDDDDGDGGDGGDGGGGGGSGVGGGVGGGGGGLRHRL ESTPSVDLSWASSLQTAGLQNQISRSVEPHSLCHHGFKISQRLETGAVSGLYWAYTQPCI GRRQTTKACWGGYVYIDEERVKNGEKKVRSEVTHRTKWKIPAGHCKGHLKSEKGISIPLQ VSMGKLCLVNNLLNWIHVKSNEKAVRLAASASLKNLLEIQILWLRCRPTESEPVEVKEDT NPVKPQLGEPCQDRLLVFTSPSQRTPKPPWEENPVTKRTDSEAERAQGSEWVCRKSSRKS CLDTCVTLTCSHSPGQESYGPLQVRNGSPKIAKPSHTAQPRLQVAFASCWAAVVGTWMEL EIIILSELSQERKTKHRIFPLIDLFRICSAQVTLVVHMALNRRALFLIGSNLSRSGDEEE LKWLVSKRLGIMDAGKDGESEEPLYTVEEEENKEEEKEEENCEDKEESYTERQLAIFFFT RFREAGTFFIS >gi568815586r:15009113_15317489|GENSCAN_predicted_CDS_6|1836_bp atgggtacaggaacaggaggccaccagtgtggctggaatggagagcacaaagtagggcaa gtggaagaagaagaggtgagtgaggaggtgaagaagggtgtggagtatgagaggaaaaga ggagtgaaggatgactccaagctccctggctctgagagtgtcgcaggtgtgccatcagtc acgggacctgtgcacgtgctgcaatctgacaaggggatttcaacagcaggtgtgctcagg tgcctcagtggatggcagcacagaggggcagctgtgcctgaccaaatgatgaagacttcc agtgctaggcgtgttatgcaaaaagaatgggaaaataagaatttcagtgtctccttttta attttgggactctggacaagtcaccaaggcactctgagcttcagtctcatgataattgat gatgatgatgatgatgatgatgatggggatggtggtgatggtggtgatggtggtggtggt ggtggtagtggtgttggtggtggtgttggtggtggtggtggtggcctcagacacagactg gaatctacaccatctgtggacctttcctgggcctccagcttgcagacagcaggtctgcaa aatcagatcagcagatctgtggaaccacacagcctctgccatcatggcttcaagatcagc caaagactagagactggggccgtctcaggtctttactgggcatacacacaaccttgcata ggaagaaggcaaacgactaaagcctgctggggtggttacgtttatattgatgaggagaga gtaaagaatggagaaaagaaagtgagaagtgaggtcacacatagaaccaagtggaaaata cctgcagggcattgcaaaggccatttaaaatctgagaaaggcatttctattcctcttcaa gtttccatgggtaagctctgcctggtcaacaatttgttgaactggattcatgttaagtca aatgagaaagcagtcagactagcagcatcagcatcactgaagaacttgttagaaatacaa attctctggcttcggtgcagacctactgagtcagaacctgtggaggtaaaggaagataca aaccctgtaaagccacagctaggagagccctgtcaagataggctgctggtgttcacatct ccctcgcagcgaaccccaaagccgccctgggaggagaacccggtgaccaagaggaccgac tcagaggccgagcgcgctcagggctcggaatgggtctgcaggaagtcgtcccgaaagagc tgcctggacacgtgcgtgacactcacctgttcacactctccaggccaggagagctacggt cctctgcaagttcggaatgggtcccccaagatcgcgaagccctcccacactgcgcagccc aggcttcaggttgcgtttgcgagttgctgggctgcggtcgtggggacatggatggaattg gaaatcatcattctcagtgaactatcgcaagaacgaaaaaccaaacaccgcatattccca ctcatagatcttttccgcatctgctctgcccaggtgactcttgtagttcacatggccctg aacagaagggctctgtttcttattggctctaacctttctagatctggggatgaagaagag ttaaaatggcttgtatcaaaaagactgggcataatggatgctggaaaggatggggagtca gaggaacccttgtacactgttgaggaggaggagaacaaagaagaagaaaaggaagaagaa aactgtgaggacaaagaggaaagctacacagaaagacagctcgcaattttttttttcaca agatttagagaagctggaacattctttatctcctag >gi568815586r:15009113_15317489|GENSCAN_predicted_peptide_7|139_aa MFRRLQASYWTVWKQKQNKIRGNRSEKAGTNNKQKLPETSSRSAASILYKGVWPVHSSTS RCDCSRGLPPYQDMTGKIVLIPQVSYGLWQTEKRKSHWQPLWQVLDHTGMLDIAHFPPSC SNELNRMTLTKSISMWKAM >gi568815586r:15009113_15317489|GENSCAN_predicted_CDS_7|420_bp atgttcagaagactacaagcctcttattggacagtctggaaacagaagcaaaataaaatt cggggaaatagaagtgaaaaagctggcacaaataacaagcaaaagcttccagaaacatca tcccgttcagctgcatccatcctgtacaaaggagtctggccagtgcactcatcaaccagt cgctgtgactgttccagaggattgccaccttaccaggacatgactggaaaaattgtacta atcccacaagtgtcctatggcctgtggcagactgaaaaaaggaagagccattggcagcca ttatggcaggttttggaccatacaggaatgcttgacattgcacattttcctccaagttgt agcaatgagttaaaccgcatgaccctcacaaagagtatttcaatgtggaaggccatgtag >gi568815586r:15009113_15317489|GENSCAN_predicted_peptide_8|344_aa MTLPSELNKASGTDPGKTEICDCSERECKIAVLRKSKEIQDNTEKEFRILLDNLNKEIET VKKNQAEILELKNVIGILKSASQSFNKTTDQVEERLSRSWNSLEDSEEDRKIWESLETPR DLLNGFNQNADSDMDNEVQAEVVSGDKKLAGDWSKCHSCYALAKRLEAFFPCPRDLWNFE LKRDDLEYLVGEISKKQSLEIQEVTEHKSLENLQPDNAVENKNPIFWGEIQAGCRNLDKP GGLGGKNGFLGCPLPCCVQPQSLALCAPAAPAMAKQGQGTAQAVASEGASLKPWQLSCGV GPTGAQRTRIEVWDPLPRFQMMYGNTWKSRLNSSARAEPLVENL >gi568815586r:15009113_15317489|GENSCAN_predicted_CDS_8|1035_bp atgaccttaccaagtgaactaaataaagcatcagggactgatcctggaaaaacagagata tgtgactgttcagaaagggaatgcaaaatagctgtgttgaggaaatctaaagaaattcag gataacacagagaaggaattcagaattctattagataatttaaacaaagaaattgaaaca gtgaaaaagaatcaagctgaaattctggagctgaaaaatgtaattggcatactgaaaagt gcatcacagtcctttaataaaacaacggatcaagtagaagaaagactcagcagaagttgg aacagtttggaggactcagaagaagacaggaaaatatgggaaagtttggaaactcctaga gacttattgaatggctttaaccaaaatgctgatagtgatatggacaatgaagtccaggct gaggtggtctcaggagacaagaaacttgctggggactggagtaaatgtcattcttgctat gctttagcaaagagactggaggcttttttcccctgccctagagatctgtggaactttgaa cttaagagagatgatttagaatatctggtgggagaaatttctaagaagcaaagcttagaa attcaagaggtgacagagcataaaagtttagaaaacttgcagcctgacaatgcagtagaa aacaaaaatcccattttctggggagaaattcaagctggctgcagaaatttggataagcct ggaggcctaggagggaaaaatggttttctgggatgccctctcccctgttgtgtacagcct cagagcttggcgctctgtgccccagctgctccagccatggctaaacagggccaaggtaca gctcaggctgttgcttcagagggtgcaagcctcaagccttggcagctttcatgtggtgtt ggtcctacaggtgcacagaggacaagaattgaggtttgggatcctctgcctagatttcag atgatgtatggaaacacctggaagtccaggctgaactcttctgcaagggcagagcccctt gtggagaacctctga