GENSCAN 1.0 Date run: 6-Nov-116 Time: 10:58:19 Sequence gi568815593f:139500370_139726798 : 226429 bp : 49.64% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1135 1137 3 0 0 108 81 0 0.461 1.30 1.02 Intr + 3251 3444 194 1 2 0 89 135 0.504 3.09 1.03 Intr + 4401 4438 38 0 2 65 87 18 0.172 -2.69 1.04 Intr + 7884 8072 189 1 0 90 80 61 0.608 5.06 1.05 Term + 10582 10730 149 2 2 101 55 37 0.296 -0.24 1.06 PlyA + 11473 11478 6 1.05 2.00 Prom + 17005 17044 40 -6.66 2.01 Sngl + 17682 18161 480 2 0 83 37 207 0.907 11.32 2.02 PlyA + 19664 19669 6 1.05 3.06 PlyA - 23043 23038 6 1.05 3.05 Term - 23236 23154 83 2 2 111 37 14 0.219 -3.54 3.04 Intr - 23415 23292 124 0 1 100 53 76 0.384 5.46 3.03 Intr - 43282 43157 126 1 0 33 74 102 0.370 4.18 3.02 Intr - 61235 60936 300 2 0 58 52 147 0.424 5.03 3.01 Init - 61844 61422 423 2 0 79 44 329 0.982 21.95 3.00 Prom - 91208 91169 40 -3.46 4.00 Prom + 93968 94007 40 -4.96 4.01 Init + 95196 95244 49 2 1 89 99 63 0.901 6.61 4.02 Intr + 122999 123092 94 0 1 99 106 32 0.060 5.12 4.03 Term + 126387 126432 46 0 1 85 32 41 0.029 -5.12 4.04 PlyA + 126461 126466 6 1.05 5.10 PlyA - 126895 126890 6 1.05 5.09 Term - 129355 129346 10 0 1 142 32 -3 0.070 -2.83 5.08 Intr - 135215 135081 135 1 0 93 116 5 0.077 3.48 5.07 Intr - 138006 137810 197 0 2 79 65 123 0.243 7.31 5.06 Intr - 140811 140641 171 0 0 71 80 31 0.129 0.74 5.05 Intr - 142958 142892 67 1 1 112 85 4 0.159 1.41 5.04 Intr - 147599 147541 59 2 2 91 85 43 0.237 1.98 5.03 Intr - 154765 154551 215 2 2 90 41 79 0.169 1.83 5.02 Intr - 157079 156951 129 0 0 65 56 57 0.161 0.87 5.01 Init - 160727 160505 223 2 1 86 55 149 0.231 8.12 5.00 Prom - 166983 166944 40 -4.56 6.00 Prom + 175435 175474 40 -2.46 6.01 Init + 180155 181078 924 1 0 96 82 1198 0.832 113.93 6.02 Intr + 184959 185063 105 2 0 70 103 44 0.663 4.51 6.03 Intr + 185700 185768 69 2 0 104 80 36 0.884 3.78 6.04 Intr + 188398 188496 99 0 0 62 41 72 0.503 0.21 6.05 Intr + 189294 189502 209 2 2 99 48 94 0.268 4.38 6.06 Intr + 197039 197171 133 2 1 62 44 58 0.086 -0.55 6.07 Intr + 205310 205331 22 1 1 125 61 22 0.166 0.32 6.08 Intr + 210872 211039 168 0 0 76 85 104 0.385 8.82 6.09 Intr + 211663 211762 100 2 1 41 23 113 0.103 -0.73 6.10 Term + 220180 220417 238 1 1 25 43 170 0.374 1.84 6.11 PlyA + 222178 222183 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 171319 171105 215 1 2 85 23 212 0.803 10.64 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:139500370_139726798|GENSCAN_predicted_peptide_1|190_aa MKEETPDISEHLKEQTLDTPSLRTVTLTVRVCGFILEVSETKNPPEGTNSRHILSTQMGL SPIAKWRTGNGAGFQECIEDGEPIKHDCQQIIVQTYAARADLLEVPLANPDLNLYTNGSS FVENGIRRAGYAIVSDVTVLESPMAAILLLLAFRPCIFNLVKFVSSRIEAIKLQMVLQME PQMSSNNNFY >gi568815593f:139500370_139726798|GENSCAN_predicted_CDS_1|573_bp atgaaggaagaaactccggacatatctgaacatctgaaggaacaaactctggacacacca tctttaagaactgtaacactcaccgtgagggtctgcggcttcattcttgaagtcagtgag accaagaacccaccagaaggaaccaattccagacacattttgtctacccagatgggacta tcgcctatcgccaagtggaggacaggcaacggtgcaggttttcaagaatgcattgaggat ggggaaccaatcaagcatgactgccaacaaattatagtccagacttatgccgcccgagct gatctcttagaagtccccttagctaatcctgaccttaacctatataccaatggaagttca tttgtggagaatgggatacgaagggcaggttatgccatagttagtgatgtaactgtactt gaaagtcccatggcagccatcttgctattactcgccttcaggccctgtatttttaacctt gtcaaatttgtttcctctaggatcgaagccatcaagctacagatggtcttacaaatggaa ccccaaatgagctcaaataacaacttctactga >gi568815593f:139500370_139726798|GENSCAN_predicted_peptide_2|159_aa MPPAGDFIRPGAKARPPLEGAALSSRAARMDSRQSRTREAGGGCSVRLPRLSGGRSCLGP GTTGWRLWLTSPRLCPLLPGARVRNLCFTLESAAILPLNQTGMASGCPSCLLIVSEPFPD PYPLSDQQDDDRKRIQDQLLAVTLRKQSHTPVSLTMQWK >gi568815593f:139500370_139726798|GENSCAN_predicted_CDS_2|480_bp atgcctccggccggtgacttcatccggcccggggccaaggcccggcctccgctagagggc gctgctctcagcagccgcgccgcccggatggactcgcgccagagtaggacaagagaggca ggcggcggctgctccgtgcggctcccacgcctctcgggcggcaggtcctgcctcggacct gggaccactggctggcgcctgtggctcacatctccacgcctttgcccgctcctcccagga gcccgagtcagaaacctgtgcttcaccctcgagagcgctgccatcctccccctcaaccag acgggcatggcgtctggttgtccgtcctgtcttctcatcgtctcagagcccttccctgac ccctaccctctctcagaccaacaggatgatgacaggaagaggatccaggaccaactccta gcggtgaccttgagaaaacaatctcataccccagtttcccttacaatgcaatggaaataa >gi568815593f:139500370_139726798|GENSCAN_predicted_peptide_3|351_aa MGREALSTSVEGAAQRPSAPLGPNPAILARQWPSSAAAVPDWREALRVHTMRAWKGSRGG TLRPREHLRPKRGLRAGASTGAITGSGVWRPPRGASTGSKIQGGREPDGDYGGKCRPQPA QLASSAADLRPLTLWILFRAMRSPHHPAAAPLFAAALAATAAAAVAASSSQDQRRLLNRR RSLRQKAKAQASGSAGLSWSDSRRDCLSSSLLQVDRGTRGRAGCAAGVGPSEHGVGPTEG QAGFERPESLVRKLPLQPGPSLAARGSVRSPVSSPGQLAPSGGVPLLATPPRDQRLLVLR KGSVHAFLIPGSAAGPGQNVALREGWPGLVDLGNPTSGAEGGTKSSGSSTC >gi568815593f:139500370_139726798|GENSCAN_predicted_CDS_3|1056_bp atggggcgagaagccctttcgacctctgtcgaaggggccgcccaacgccccagcgccccc ctcggccccaaccccgccatcctggcccgacaatggccctcaagtgcagcggcagtccca gactggagagaagccctaagagtccacaccatgcgggcctggaagggcagccggggagga acgctgaggccgcgagagcacctccggcctaagcgcgggctgcgcgccggggcctccacg ggcgccatcactgggagtggggtatggaggccgccccgaggggccagcaccggcagcaag atccaaggaggccgcgagcccgacggagactacggcgggaagtgcaggccgcagcctgct cagctggccagcagcgcagccgacctccggccgcttaccttgtggattctcttcagagcc atgcggtcaccgcatcacccggcggcggcccctttattcgccgccgccctagccgccacc gccgccgccgccgtagctgcctcttcctcccaggatcagcgccgcctcctaaaccgccgc agatccctccgccagaaagcaaaagcccaagcctcgggctcggccggcctctcttggtcg gacagccggagggattgcctgtcttcctccctgctgcaggtggaccgagggacgcgcggg cgggcgggctgcgccgccggggtggggccgtcggagcacggggtgggcccgaccgagggg caggctggttttgagaggcccgagtcgctagtgagaaagctgcctctgcagccaggcccc tccctggcggcccgaggctccgtccggtcccccgtgtcgtcgccggggcagctggcgccc tccggcggggtgccgctgctagccaccccacctagagatcaaaggttactagtgctgaga aagggctccgtccacgccttcctcatcccagggtcagctgctggccctggccagaatgtg gctcttcgagaagggtggccaggtctggtcgacctaggcaacccaaccagcggggctgag ggtggaacaaaaagctctggctcctccacttgctag >gi568815593f:139500370_139726798|GENSCAN_predicted_peptide_4|62_aa MGFRHVGQAGLELLTSVLLSICSLLCDPNPDDPLVPEIARIYKTDREKYNRIAREWTQKY AM >gi568815593f:139500370_139726798|GENSCAN_predicted_CDS_4|189_bp atggggtttcgccatgttggccaggctggtctcgaactcctgacctcagtactcttgtcc atctgttctctgttgtgtgatcccaatccagatgatcctttagtgcctgagattgctcgg atctacaaaacagatagagaaaagtacaacagaatagctcgggaatggactcagaagtat gcgatgtaa >gi568815593f:139500370_139726798|GENSCAN_predicted_peptide_5|401_aa MGKSMRAEVGEAVLVPQTPAALAPSAQAAAPGFLFHRNLSWEKPIGSSCLSRRPPGPGKL PLRGQARSLASQPVAYQPISQHIQPIAKGYGGLKEERARPVPSKHLRSPFCVPDSVEGLI WVQPAFQSCQPPYQESQEACQHSRCHGTSWGRSGRHPASPSQEKSAKPPLSTETCGEGRL GLTAQLSQQSGARSFEDREGEFGRAPQLRLQGMGCSDSAWGWARDELLFELGLVPVGAKC LGKASGCGWTKSVPCRGEGHRKKRDGPGPAVGALVATPGFLSLDPSAWTAAQSHVSTSNR GVWKLAQAPEREGDRGLRVLRRDGFQRVTHSAEQGMGRHDLGPSFYEPQFPVLGAGKQLG EIWPPSTAYPSSVAGGKSSPNPRLLFTRQDNWIPFCPPRPS >gi568815593f:139500370_139726798|GENSCAN_predicted_CDS_5|1206_bp atggggaagagcatgagggcggaggtgggggaggctgtcctggtcccccagactccggct gccctggccccctcggcccaggctgcagctccggggttcctgtttcatcgaaacctgagc tgggaaaagcctatcgggtcatcttgcctgtcccgccggccaccaggcccaggcaagctg cccctcaggggacaggcccgcagcctggcctcccagccagtggcctatcagcccatctcc caacacatccagcccattgccaagggttatgggggcctcaaagaggaaagagcaaggcct gtgcccagcaagcatttaaggagtcccttctgtgtgccagatagtgtggaagggctgatc tgggtacagccggccttccagagctgccagccaccctatcaagaaagccaggaggcctgt caacacagccgctgccacggcacctcctggggaaggagcgggagacatccagcttctcct tcccaggaaaaatcagccaagccaccactttccacagagacctgtggggagggacggctg ggcctcacagctcagctgagccagcagagcggggccagaagcttcgaggaccgggaggga gaatttggccgcgccccccagctcagactccaaggaatggggtgctcagacagtgcttgg gggtgggcccgggatgaactgctctttgaactgggcctggtcccagttggtgctaaatgt ttagggaaggcatcgggatgtggctggaccaagtctgttccatgtcggggggaagggcac aggaaaaagagagatggaccagggcctgctgtgggggctctagtggccacacctggattc ctttccttagatccatctgcctggacagctgcccagagccacgtcagcacttccaaccgg ggagtctggaaactggctcaggcccctgagcgagaaggggacagggggctgcgggtcctc cgacgggatggttttcagagggtgacccactcagctgagcagggaatgggtcggcatgac cttgggccgagcttctatgaacctcagtttcccgtgctcggagctgggaaacagctgggg gagatctggcctccttccacagcatacccctcctccgtggctggaggcaaatcctccccc aaccccaggcttttattcaccaggcaggacaactggattcccttttgtcccccaagacct tcctaa >gi568815593f:139500370_139726798|GENSCAN_predicted_peptide_6|688_aa MSSLGGGSQDAGGSSSSSTNGSGGSGSSGPKAGAADKSAVVAAAAPASVADDTPPPERRN KSGIISEPLNKSLRRSRPLSHYSSFGSSGGSGGGSMMGGESADKATAAAAAASLLANGHD LAAAMAVDKSNPTSKHKSGAVASLLSKAERATELAAEGQLTLQQFAQSTEMLKRVVQEHL PLMSEAGAGLPDMEAVAGAEALNGQSDFPYLGAFPINPGLFIMTPAGVFLAESALHMAGL AEYPMQGELASAISSGKKKRKRCGMCAPCRRRINCEQCSSCRNRKTGHQICKFRKCEELK KKPSAALECANPQPSVSPEPTPQHTAETQSSSPVPASHCFVEMVPSPLLCYSWETDFLVT QEPFLQPITTLVILGDITWPQGYGLQLSKGSAGTQHMEGAPSPSSSFEACPGGAPPVAVL ALQQWVHRQLLGVLGSGCKGEGEQWPLPLIRRRKPQERGRSEAMETRLRLSLALRLSEGR GRIGTSGGAAASPAASDPTPSGPPLLPDLWAWLTKQPNQKGARSPARAGAPGAGGRSHLQ LWRNESGAAAAAAAAAERVWHPESPGIENNLGKAVGGRNVTAAALSTLGGNSVWSEELAL SPAKLPAAGGPQNGRSTNSLHHAPGKAADTQRQPVKAARREAVPCKMTGVELPKTMGTYL LHQRDLDVRPGVKGDHFGALKFDCPAGF >gi568815593f:139500370_139726798|GENSCAN_predicted_CDS_6|2067_bp atgtcgagcctcggcggtggctcccaggatgccggcggcagtagcagcagcagcaccaat ggcagcggtggcagtggcagcagtggcccaaaggcaggagcagcagacaagagtgcagtg gtggctgccgccgcaccagcctcagtggcagatgacacaccaccccccgagcgtcggaac aagagcggtatcatcagtgagcccctcaacaagagcctgcgccgctcccgcccgctctcc cactactcttcttttggcagcagtggtggtagtggcggtggcagcatgatgggcggagag tctgctgacaaggccactgcggctgcagccgctgcctccctgttggccaatgggcatgac ctggcggcggccatggcggtggacaaaagcaaccctacctcaaagcacaaaagtggtgct gtggccagcctgctgagcaaggcagagcgggccacggagctggcagccgagggacagctg acgctgcagcagtttgcgcagtccacagagatgctgaagcgcgtggtgcaggagcatctc ccgctgatgagcgaggcgggtgctggcctgcctgacatggaggctgtggcaggtgccgaa gccctcaatggccagtccgacttcccctacctgggcgctttccccatcaacccaggcctc ttcattatgaccccggcaggtgtgttcctggccgagagcgcgctgcacatggcgggcctg gctgagtaccccatgcagggagagctggcctctgccatcagctccggcaagaagaagcgg aaacgctgcggcatgtgcgcgccctgccggcggcgcatcaactgcgagcagtgcagcagt tgtaggaatcgaaagactggccatcagatttgcaaattcagaaaatgtgaggaactcaaa aagaagccttccgctgctctggagtgtgccaacccacagcccagcgtgtctcctgagccc actccccagcacacggccgaaactcagagctcctcacctgttccagcttcacactgcttt gtggaaatggtgccctcacctctcctctgctacagttgggaaactgacttcctggtgacc caggagcctttcctccagcccatcacaacactggtcattcttggtgacatcacctggcct cagggctatggtctgcagctcagtaagggcagcgctggcacccaacacatggaaggggct cccagcccctcttccagctttgaggcctgcccgggaggggcaccccctgtggctgttttg gctctgcagcagtgggtccacaggcagctgctgggagtcttggggagcggctgcaagggt gagggcgagcagtggccgctgccgttgataaggcgacggaagcctcaggagcgaggacgc agtgaagccatggaaacacggctgaggctttccctcgcgctgcggctgtcggagggccgc ggccggatagggacaagcggcggagcagcagcaagtccggccgcctccgatcctacgcct tctggccctcccctccttcccgacctctgggcctggctgacaaagcagcccaatcagaag ggcgcgcgcagcccagcgcgggccggggcaccgggcgcaggcgggaggtcgcatctgcag ctttggcgcaatgaatcaggcgcggccgccgccgccgccgccgccgccgagcgcgtctgg catccggagtctcccgggatcgaaaataatttgggcaaagcggtgggaggcagaaatgtc actgccgctgcattaagcacgctcggcgggaactcggtttggagcgaggagctggcgctc agccctgccaagctgcctgccgcagggggaccccagaatggtagatccaccaacagcttg caccatgcacctggaaaagctgcagacactcaacgccagcccgtgaaagcagccaggagg gaggctgtaccctgcaaaatgacaggggtagagctgcccaagaccatgggaacctacctc ttgcatcagcgtgacctggatgtgagacctggagtcaaaggagatcattttggagcttta aaatttgactgccccgctggattttga