GENSCAN 1.0 Date run: 3-Nov-116 Time: 07:16:49 Sequence gi568815585r:76785174_76986091 : 200918 bp : 40.74% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 4761 4900 140 1 2 97 38 116 0.224 4.64 1.02 PlyA + 5049 5054 6 1.05 2.03 PlyA - 5619 5614 6 1.05 2.02 Term - 11904 11815 90 0 0 94 44 113 0.939 4.24 2.01 Init - 13540 13460 81 1 0 60 89 26 0.529 0.92 2.00 Prom - 19158 19119 40 -4.65 3.00 Prom + 26158 26197 40 -2.65 3.01 Init + 30074 30126 53 1 2 76 79 18 0.295 0.38 3.02 Term + 31064 31373 310 2 1 64 43 237 0.631 10.35 3.03 PlyA + 31734 31739 6 1.05 4.07 PlyA - 32018 32013 6 1.05 4.06 Term - 34006 33928 79 0 1 87 49 86 0.556 0.86 4.05 Intr - 36885 36794 92 0 2 9 110 114 0.698 3.57 4.04 Intr - 41687 41603 85 1 1 77 99 47 0.128 3.60 4.03 Intr - 45683 45540 144 1 0 77 39 134 0.563 5.88 4.02 Intr - 45971 45831 141 1 0 54 53 115 0.212 3.15 4.01 Init - 52083 52004 80 0 2 78 80 54 0.914 4.18 4.00 Prom - 60192 60153 40 -5.25 5.03 PlyA - 60478 60473 6 1.05 5.02 Term - 66324 66220 105 0 0 55 55 113 0.090 2.13 5.01 Init - 68724 68620 105 0 0 60 89 59 0.075 3.47 5.00 Prom - 96414 96375 40 -3.95 6.02 PlyA - 98278 98273 6 1.05 6.01 Sngl - 100975 99998 978 1 0 69 49 1390 0.954 129.73 6.00 Prom - 109286 109247 40 -7.35 7.00 Prom + 111351 111390 40 -0.75 7.01 Init + 114192 114241 50 2 2 83 103 63 0.647 7.77 7.02 Intr + 122481 122575 95 0 2 109 78 16 0.075 1.39 7.03 Intr + 136022 136135 114 0 0 56 68 104 0.701 4.70 7.04 Intr + 138752 138874 123 0 0 38 80 124 0.821 6.24 7.05 Intr + 142888 142979 92 2 2 69 32 123 0.589 3.59 7.06 Term + 143328 143528 201 2 0 24 42 192 0.433 4.61 7.07 PlyA + 143763 143768 6 1.05 8.00 Prom + 144400 144439 40 -5.75 8.01 Init + 149537 149587 51 1 0 89 105 15 0.343 4.51 8.02 Intr + 150609 150724 116 2 2 23 99 31 0.290 -3.97 8.03 Intr + 152638 152778 141 1 0 19 113 95 0.466 3.65 8.04 Intr + 156454 156618 165 1 0 -30 76 159 0.478 1.05 8.05 Intr + 164177 164496 320 2 2 54 32 186 0.261 4.28 8.06 Intr + 167326 167477 152 2 2 50 116 104 0.660 8.36 8.07 Intr + 168427 168516 90 0 0 77 59 58 0.660 1.07 8.08 Intr + 170146 170351 206 0 2 70 64 203 0.892 13.18 8.09 Intr + 171837 172761 925 0 1 107 55 773 0.305 66.20 8.10 Term + 174502 174609 108 0 0 87 48 68 0.187 0.23 8.11 PlyA + 176486 176491 6 1.05 9.00 Prom + 178514 178553 40 -3.45 9.01 Init + 179915 179963 49 1 1 86 58 63 0.798 2.26 9.02 Term + 189202 189410 209 2 2 56 37 146 0.083 2.82 9.03 PlyA + 189893 189898 6 1.05 10.04 PlyA - 190236 190231 6 1.05 10.03 Term - 191513 191376 138 2 0 73 43 141 0.962 5.08 10.02 Intr - 195251 195048 204 2 0 90 63 191 0.854 15.27 10.01 Init - 196524 196363 162 0 0 82 3 191 0.865 9.78 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585r:76785174_76986091|GENSCAN_predicted_peptide_1|46_aa XSAGGPPADVARVSEYHQSIPQMKDTVPKLRSPTGAINQPRASIKI >gi568815585r:76785174_76986091|GENSCAN_predicted_CDS_1|141_bp naatctgcaggtggccctcctgcagatgtggccagagtatcagagtaccaccaatcgatt ccacaaatgaaggataccgtgcctaaattaagatctcccactggcgctatcaatcagcca agagcctcaattaaaatttga >gi568815585r:76785174_76986091|GENSCAN_predicted_peptide_2|56_aa MAKQRLENCSVLSYLKKALDIHVINPRYFTISHDAASPDEAEIEMSHRQDQLDVVE >gi568815585r:76785174_76986091|GENSCAN_predicted_CDS_2|171_bp atggcaaagcagagattagaaaattgttctgtgcttagttacctaaaaaaggctcttgat attcatgtgattaatcccagatattttaccatctcacatgacgctgcttctccagatgaa gccgagattgaaatgtcacaccggcaggatcaattagatgttgtagaatag >gi568815585r:76785174_76986091|GENSCAN_predicted_peptide_3|120_aa MNKGVCVSSWWNNLLSSGEDDADDNVDDGDGNGDNNGWIMACRALSYKSIQSALLKMFMK IDQQAAVENLVDMSGRVSRYWFSLVVYRRVYCLLTPTALNDMAEEAQKLQEATASGRSTC >gi568815585r:76785174_76986091|GENSCAN_predicted_CDS_3|363_bp atgaacaaaggagtgtgcgtgtcttcttggtggaacaatttactttcctctggtgaagat gatgctgatgacaatgttgatgatggtgacggtaatggcgataataatggttggataatg gcctgcagggctctttcctataaaagcatccagagtgctttactgaaaatgttcatgaag attgaccaacaagctgcagtagagaacttagtagacatgagtggcagagtctctaggtac tggttttctctggtagtgtatagacgtgtttactgcctcctcactcccactgcactgaat gacatggcagaagaggctcagaagctgcaggaggcaactgcttcaggaagaagcacctgt tag >gi568815585r:76785174_76986091|GENSCAN_predicted_peptide_4|206_aa MNVPTPGDTEKESCDVCVGVFSSTSNRYCGLDRSETVGLVGAESQSVILEAESPGPAFKR FWEEAANRRRWYRRLSLFLQHRVLELPSGEGSSEPKSPLKWTGFGGYRSKDCDQKATRNW SLKAVKGMLPAPPGRETRATIQCQGPTWDLCLKQSGKRERSRDQGWEARAGAEREVGITD SLILDPITLQGVSPSKTSAHRILSAS >gi568815585r:76785174_76986091|GENSCAN_predicted_CDS_4|621_bp atgaatgtccctactcctggtgacactgaaaaggaatcttgtgatgtttgtgttggggta tttagtagcaccagtaacagatactgtgggcttgacaggagtgaaactgtgggtttggtt ggtgctgagagtcagtctgtgatcctggaggcagaatcaccgggacccgctttcaagagg ttctgggaggaagctgcaaaccgacggaggtggtatcggagactgagcttgttcttgcag cacagggtcttagaactgccgtccggggaaggcagttcagagcccaagagccctttgaaa tggacaggctttggtggctaccgaagcaaggactgtgaccagaaggcaacacgaaactgg agtctcaaagcagttaaaggaatgcttccagctccaccagggagagaaacgagggccaca attcagtgccaggggcctacttgggacttgtgcctgaagcagagtggaaagcgagaaagg tcccgtgatcaaggatgggaagcaagagcaggagcagagagggaagtgggaatcacagac agcttaatccttgatcccataacccttcaaggtgtttctcccagtaaaacttctgcacat cgaatcctgtctgcttcttga >gi568815585r:76785174_76986091|GENSCAN_predicted_peptide_5|69_aa MDRKKTVLNKIGTKIEFEKYREVKKRHKNGVALKMGSSQPRELKLKDCSEKSHGVVLGHE GHNRPNETF >gi568815585r:76785174_76986091|GENSCAN_predicted_CDS_5|210_bp atggacaggaagaagactgtgctaaacaaaattggcactaaaattgaatttgagaagtac agagaggttaaaaaaaggcataaaaatggagtagctttgaaaatgggaagcagccagcca cgagagctgaagctgaaagattgctcagagaaaagccatggagtagttttaggccatgag ggtcacaacagaccaaatgaaactttctga >gi568815585r:76785174_76986091|GENSCAN_predicted_peptide_6|325_aa MALADSTRGLPNGGGGGGGSGSSSSSAEPPLFPDIVELNVGGQVYVTRRCTVVSVPDSLL WRMFTQQQPQELARDSKGRFFLDRDGFLFRYILDYLRDLQLVLPDYFPERSRLQREAEYF ELPELVRRLGAPQQPGPGPPPSRRGVHKEGSLGDELLPLGYSEPEQQEGASAGAPSPTLE LASRSPSGGAAGPLLTPSQSLDGSRRSGYITIGYRGSYTIGRDAQADAKFRRVARITVCG KTSLAKEVFGDTLNESRDPDRPPERYTSRYYLKFNFLEQAFDKLSESGFHMVACSSTGTC AFASSTDQSEDKIWTSYTEYVFCRE >gi568815585r:76785174_76986091|GENSCAN_predicted_CDS_6|978_bp atggctctggcggacagcacacgtggattacccaacgggggcggcggcgggggcggcagt ggctcctcgtcgtcctccgcggagccaccgctcttccccgacatcgtggagctgaacgtg gggggccaggtgtacgtgacccggcgctgcacggtggtgtcggtgcccgactcgctgctc tggcgcatgttcacgcagcagcagccgcaggagctggcccgggacagcaaaggccgcttc tttctggaccgggacggcttcctcttccgctacatcctggattacctgcgggacttgcag ctcgtgctgcccgactacttccccgagcgcagccggctgcagcgcgaggccgagtacttc gagctgccagagctcgtgcgccgcctcggggcgccccagcagcccggcccggggccgccg ccctcgcggcgcggggtgcacaaggagggctcgctgggtgacgagctgctgccgcttggc tactcggagcccgaacagcaggagggcgcctctgccggggcgccgtcgcccacgctggag ctggctagccgcagtccgtccgggggcgcggcgggcccgctgctcacgccgtcccagtcg ctggacggcagccggcgctcgggctacatcaccatcggctaccgcggctcctacaccatc gggcgggacgcgcaggcggacgccaagttccggcgagtggcgcgcatcaccgtttgcgga aagacgtcgctggccaaggaggtgtttggggacaccctgaacgaaagccgggaccccgac cgtcccccggagcgctacacctcgcgctattacctcaagttcaacttcctggagcaggcc ttcgacaagctgtccgagtcgggcttccacatggtggcgtgcagctccacgggcacctgc gcctttgccagcagcaccgaccagagcgaggacaagatctggaccagctacaccgagtac gtcttctgcagggagtga >gi568815585r:76785174_76986091|GENSCAN_predicted_peptide_7|224_aa MAEGKEEQVTSYMDGSRISHSGSLCHSKMSLGCSPDSPFQRPLKTKSVKIATVTSTFNSH HPDQSAAIDTVASPSTRKNITLTEGPGPYSQSSFSRRKCDNSDGGDPFLLLTSADLDQVF TLPPDEPDKMKETIMNQKLTKRQAEVHTGRKGTAHRKKKLGVNNIPGIEEVNMFTHQGTV IHFNNPEVQASLAANTFTMTGHAETKQLTEMLLSIDHKPVLQMV >gi568815585r:76785174_76986091|GENSCAN_predicted_CDS_7|675_bp atggcagaaggcaaggaggagcaagtcacatcttacatggatggcagcagaatctctcac agtgggtcgctgtgtcattcaaaaatgtcactggggtgctcgccagattcaccctttcag aggcccctaaagactaagtctgtaaaaattgccacagtcacctcaaccttcaacagccac caccctgatcagtcagcagccatcgacactgtggcaagtccctccaccagaaaaaatatt acacttactgaaggtccaggtccatacagtcaaagcagcttttcaagaaggaagtgtgac aactctgacggtggtgacccttttctcctgctaaccagtgctgatttagaccaagtgttc actcttccacctgatgagccagataagatgaaagaaacaatcatgaaccaaaaactcacc aaacggcaagcagaagtgcacactggtcggaaaggaactgctcacaggaaaaagaaatta ggggtaaacaatatccctggtattgaagaggtgaatatgtttacacaccaaggaacagtg attcactttaacaaccctgaagttcaggcatcgctggcagcaaacactttcaccatgaca ggccatgctgagacaaagcagctgacagaaatgctactcagcatcgatcataaaccagtg ctgcagatggtctga >gi568815585r:76785174_76986091|GENSCAN_predicted_peptide_8|757_aa MAKLTAGKEHLTNLPTLASEFQNLLKVSLLVSRVECLASFKFVSIDPGSCDGPNCSTKAG SSHGHAADDMKAFSQEGHLASLNSPSLLPDSNWSHPRFHPILRAKPNNAWHATVPSENEV TTEVPRDSSSPDHSRLDPQQTRNILTFSVIQNEMTEAGDANAMRRSSRSEDILEIDERKD HRQLSSCVFIPLTFRSFRGQGQPCVLWHTGCCSGLPEDSRAPPSCITSSTVSYGWMVQVP PGLHVTTPPPPLLPSLTFLHKTPLESFATAIHGLKVGHLTDRVIQRSKRMILDTLGAGFL GTTTEVFHIASQYSKIYSSNISSTVWGQPDIRLPPTYAAFVNGVAIHSMDFDDTWHPATH PSGAVLPVLTALAEALPRSPKFSGLDLLLAFNVGIEVQGRLLHFAKEANDMPKRFHPPSV VGTLGSAAAASKFLGLSSTKCREALAIAVSHAGAPMANAATQTKPLHIGNAAKHGIEAAF LAMLGLQGNKQVLDLEAGFGAFYANYSPKVLPSIASYSWLLDQQDVAFKRFPAHLSTHWV ADAAASVRKHLVAERALLPTDYIKRIVLRIPNVQYVNRPFPVSEHEARHSFQYVACAMLL DGGITVPSFHECQINRPQVRELLSKVELEYPPDNLPSFNILYCEISVTLKDGATFTDRSD TFYGHWRKPLSQEDLEEKFRANASKMLSWDTVESLIKIVKNLEDLEDCSVLTTLLKGPSP PEAGIECPQLFQAHIASRRWIYHSGVWRTVALFSQLH >gi568815585r:76785174_76986091|GENSCAN_predicted_CDS_8|2274_bp atggctaagttaactgcaggaaaggagcacttaacaaatctgcccacactggcatcagag ttccaaaatctcctcaaagtgagtttattagtgtccagggttgagtgtcttgcatctttt aagtttgtcagcattgatcctggaagttgtgatgggcccaactgcagtactaaagctgga tccagccatggacatgcagctgacgacatgaaggcattttcccaagaaggccacctggcc tctctaaactccccctcattactccctgatagcaactggtcccaccccaggtttcatcct atcctaagagccaagcccaataatgcgtggcatgccaccgtcccctctgaaaacgaagtg accacggaagtcccacgagactccagcagccctgaccacagccgattggacccacagcag acacggaatattctaacgtttagtgtaatccaaaatgagatgacagaggcaggagatgca aatgccatgagaaggtcaagcagatcagaagatattttggaaatagatgaacgcaaggac cacagacagctgagcagctgtgtgtttattcccctcacgtttcgttccttcaggggccag ggacagccatgtgttctttggcacacagggtgctgttcaggacttccagaggattccagg gctccaccttcctgtatcaccagcagcacggtctcatatggatggatggtgcaagtacct ccagggctccatgtcaccaccccacctcccccactgctccccagtctcacattcctccat aagacacccctggaaagctttgccacagcaatccatggcttgaaagtgggacacctgaca gatcgtgttattcagaggagcaagaggatgattctagacactctgggtgctgggttcctg ggaaccactacggaagtgtttcacatagccagccaatatagcaagatctacagttccaac atatccagcactgtttggggtcagccagacatcaggctcccgcccacatatgctgctttt gtgaacggtgtggctattcactccatggattttgatgacacgtggcaccctgccacccac ccttctggggctgtccttcctgtcctcacagctttagcagaagccctgccaaggagtcca aagttttctggccttgacctgctgctggctttcaatgttggtattgaagtgcaaggccga ttactgcatttcgccaaggaggccaatgacatgccaaagagattccatcccccttccgtg gtaggaacgttgggtagtgctgctgctgcatccaagtttttaggacttagctcgacaaag tgccgagaagctctggccattgctgtttcccatgctggggcacccatggccaatgctgcc acccagaccaagcccctccacattggcaatgctgccaagcatgggatagaagctgcattt ttggcaatgttgggtctccaaggaaacaagcaggtcttggacttggaggcaggatttggg gccttttatgccaactattccccaaaagtccttccaagcatagcttcctacagttggctg ctggaccagcaggacgtggcctttaagcgttttcctgcacatttatctacccactgggtg gcagacgcagctgcatctgtgagaaagcaccttgtagcagagagagccctgcttccaact gactacattaagagaattgtgctcaggataccaaatgtccagtatgtaaacaggcccttt ccagtttcggagcatgaagcccgtcattcattccagtatgtggcctgtgccatgctgctt gatggtggcatcactgtcccctcattccatgaatgccagatcaacaggccacaggtgaga gagctgctcagtaaggtggagctggagtaccctccggacaacttgccaagcttcaacata ctgtactgtgaaataagtgtcaccctcaaggatggagccaccttcacagatcgctctgat accttctatgggcactggagaaaaccactgagccaggaggacctagaggaaaagttcaga gccaatgcctccaagatgctgtcctgggacacagtggaaagccttataaagatagtcaaa aatctagaagacctagaagactgttctgtgttaactacacttctcaaaggaccctctcca ccagaggctggcattgagtgtccacagcttttccaggcacatattgcaagccgtcggtgg atctaccattctggagtctggaggacagtggccctcttctcacagctccactag >gi568815585r:76785174_76986091|GENSCAN_predicted_peptide_9|85_aa MGFRHVGQAGLELLTSESLFWGMPNSHGKSLTTLRPTSYAEAQDCHREWSHGCMASLQLF SHPSPGTRHESEDTVVDIQPSQAFR >gi568815585r:76785174_76986091|GENSCAN_predicted_CDS_9|258_bp atggggtttcgccatgttggccaggctggtctcgaactcctgacctcagaatccttgttc tggggaatgccaaattcgcatggaaaaagcctaaccaccctgaggcctacgagttatgcg gaagcccaggattgccacagggaatggtcacatggctgcatggccagccttcagctcttc agccatcctagcccaggcaccagacatgagagtgaagacactgttgtggatatccagcca agccaagcctttagatag >gi568815585r:76785174_76986091|GENSCAN_predicted_peptide_10|167_aa MASKAGNFYVPAEPKLAFVIRIRGISGVSPKVRKVLQLLRLRQIFSGNFVKLNKKGQNLC SLRRGRDAQVQRTDLSPTWDPIANAPALQWGYAPERNPPQGGARVQPEVGASGSARSFGG AGVKVFAPNVEQMLIVNHLKGSPTERREMCWRKVPGEPGGTSKTWQT >gi568815585r:76785174_76986091|GENSCAN_predicted_CDS_10|504_bp atggcaagcaaagccggcaacttctatgtacctgcagaacccaaattggcatttgtcatc aggatcagaggtatcagtggtgtgagcccaaaggtccgaaaggtgttgcagcttcttcgc cttcgtcaaatcttcagtggaaactttgtgaagcttaacaagaaagggcagaatctctgc agcctccgcaggggccgggatgcgcaggttcagcggacagatctctctcccacctgggac cccatcgccaacgcgccagcgctccagtggggctatgcgccagagcggaatccgccgcaa gggggcgcgcgggtgcagccggaggtgggagcgagcggctctgcacgcagcttcggaggt gctggggtgaaggtatttgcgcctaatgtcgagcagatgctcatcgttaatcacttgaaa gggagcccaacagaaagaagagaaatgtgctggaggaaagtgccaggagagcccggaggg acgtccaagacttggcagacctga