GENSCAN 1.0 Date run: 3-Nov-116 Time: 20:50:58 Sequence gi568815597r:204517854_204719992 : 202139 bp : 47.35% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 1990 1985 6 1.05 1.02 Term - 11312 10992 321 2 0 46 46 327 0.991 19.12 1.01 Init - 11734 11570 165 1 0 68 49 150 0.346 7.12 1.00 Prom - 14135 14096 40 -7.36 2.00 Prom + 14862 14901 40 -5.96 2.01 Init + 19456 19463 8 0 2 119 12 0 0.124 -4.15 2.02 Intr + 20356 20455 100 1 1 102 95 29 0.636 5.11 2.03 Intr + 26682 26816 135 2 0 82 106 103 0.680 12.26 2.04 Intr + 28944 29024 81 2 0 100 64 119 0.866 10.53 2.05 Intr + 31260 31674 415 2 1 103 36 98 0.132 -0.22 2.06 Intr + 42891 42992 102 1 0 17 77 99 0.065 1.85 2.07 Intr + 45653 45762 110 0 2 63 106 55 0.258 4.80 2.08 Intr + 55779 55890 112 2 1 96 109 -16 0.452 1.25 2.09 Intr + 56952 57034 83 1 2 102 64 36 0.247 1.96 2.10 Intr + 67577 67730 154 1 1 103 67 131 0.847 12.15 2.11 Term + 69743 69771 29 0 2 89 42 14 0.141 -4.86 2.12 PlyA + 74051 74056 6 1.05 3.05 PlyA - 75255 75250 6 1.05 3.04 Term - 75707 75494 214 1 1 89 41 111 0.112 3.20 3.03 Intr - 83579 83389 191 2 2 36 94 77 0.106 1.48 3.02 Intr - 87920 87772 149 0 2 73 106 21 0.071 2.35 3.01 Init - 102139 100168 1972 1 1 96 37 2451 0.252 226.33 3.00 Prom - 110677 110638 40 -2.66 4.00 Prom + 113434 113473 40 -7.26 4.01 Init + 113797 113812 16 0 1 73 68 21 0.131 -0.99 4.02 Term + 117913 118361 449 2 2 62 44 206 0.630 8.98 4.03 PlyA + 118916 118921 6 1.05 5.00 Prom + 123621 123660 40 -7.06 5.01 Init + 131685 131775 91 2 1 70 78 40 0.307 1.85 5.02 Intr + 134500 134530 31 2 1 92 110 4 0.348 0.19 5.03 Term + 136757 136919 163 2 1 23 54 182 0.841 5.71 5.04 PlyA + 139112 139117 6 1.05 6.00 Prom + 139478 139517 40 -4.16 6.01 Init + 141563 141611 49 1 1 64 98 39 0.525 3.61 6.02 Term + 146455 146579 125 2 2 113 43 59 0.278 2.45 6.03 PlyA + 148287 148292 6 1.05 7.00 Prom + 148442 148481 40 -5.46 7.01 Init + 156561 156613 53 2 2 89 89 93 0.500 9.13 7.02 Intr + 156843 156983 141 0 0 88 55 40 0.187 0.17 7.03 Intr + 166657 166784 128 1 2 78 52 31 0.030 -1.18 7.04 Intr + 167311 167441 131 2 2 56 76 83 0.057 4.31 7.05 Intr + 171864 172007 144 2 0 69 98 22 0.075 1.78 7.06 Term + 174853 174879 27 0 0 117 47 15 0.059 -1.63 7.07 PlyA + 177118 177123 6 1.05 8.03 PlyA - 177261 177256 6 1.05 8.02 Term - 181210 181066 145 0 1 79 44 126 0.701 4.68 8.01 Init - 186369 186287 83 0 2 69 67 89 0.780 5.34 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:204517854_204719992|GENSCAN_predicted_peptide_1|161_aa MAACYPLVVAMGVLPLEGLMDHQLVEGPMDTSILGCSPLELQEDQMMVQLQGAPMPCGNS SSSGSSSSSSYDWDNSGSISYTELQQALSQMGYNLSPQFTQLLVSSYCPRSVNPARQLDC FIQVCTQLQMPTEAFREKDTAVQGNIRLSFKDVVTMTARML >gi568815597r:204517854_204719992|GENSCAN_predicted_CDS_1|486_bp atggcagcgtgctaccccctggtggtggctatgggggtcctgcccctggagggccttatg gaccaccagctggtagagggccctatggacacctcaatcctgggatgttcccctctggaa ctccaggaggaccaaatgatggtacagctccagggggcccctatgccctgtggaaattca tccagcagtggaagcagctcttccagcagttatgactgggacaactcaggctccattagc tacacagagctgcagcaagctctgtcccaaatgggctacaacctgagcccccagttcacc cagctactggtctccagctactgcccacgctctgtcaatcctgccagacagcttgattgc ttcatccaggtgtgcacccagctgcagatgccgacagaggccttccgggagaaggacaca gctgtacaaggcaacattcggctcagcttcaaggacgtcgtcaccatgacagctcggatg ctatga >gi568815597r:204517854_204719992|GENSCAN_predicted_peptide_2|442_aa MENKVQRKVPLPEKELQKTISPHCLPQSINAYILEKDVGTAIVSDTTDDLWFLNESVSEQ LGVGIKVEAADTEQTSEEVGKVIEVGKNDDLEDSKSLSDDTDVEVTSEDEWQCTECKKFN SPSKRYCFRCWALRKDWYSDCSKLTHSLSTSDITAIPEKENEGNDVPDCRRTISAPVVRP KDAYIKKENSKLFDPCNSVEFLDLAHSSESQETISSMGEQLDNLSEQRTDTENMEDCQNL LKPCSLCLNVILKVVESYKGLKQESDIISLAFLKDPSGCIGLGKDCTSCTTGSGLANGMQ VEMNCAILSRGLGTIVWVIYLEMIVPRISPCALCTDSDQVLHVISLWNSNQGQTVSDSQI LSTCRCSLGRTAQGDAALEQQQDVYGTRIPSICTRSHSHSHSQRYVPLPFTFIPATYTHT CVGYLRRSSLSHLASFAKQFFI >gi568815597r:204517854_204719992|GENSCAN_predicted_CDS_2|1329_bp atggagaacaaagtgcagaggaaagttccacttccagaaaaagaactacagaagacgata tccccacactgcctacctcagagcataaatgcatacattctagagaaggatgtgggtact gccattgtttcagatactacagatgacttgtggtttttgaatgagtcagtatcagagcag ttaggtgttggaataaaagttgaagctgctgatactgaacaaacaagtgaagaagtaggg aaagtgattgaagtgggaaaaaatgatgacctggaggactctaagtccttaagtgatgat accgatgtagaggttacctctgaggatgagtggcagtgtactgaatgcaagaaatttaac tctccaagcaagaggtactgttttcgttgttgggccttgaggaaggattggtattcagat tgttcaaagttaacccattctctctccacgtctgatatcactgccatacctgaaaaggaa aatgaaggaaatgatgtccctgattgtcgaagaaccatttcggctcctgtcgttagacct aaagatgcgtatataaagaaagaaaactccaaactttttgatccctgcaactcagtggaa ttcttggatttggctcacagttctgaaagccaagagaccatctcaagcatgggagaacag ttagataacctttctgaacagagaacagatacagaaaacatggaggattgccagaatctc ttgaagccatgtagcttatgtttgaatgttatcctgaaggtagtggagagctacaaaggt cttaagcaggagagtgacatcatctcattggcatttttgaaagatccttctggctgcatt ggccttggaaaagattgcacttcctgcaccactggcagcgggctggccaatgggatgcaa gtggaaatgaactgtgctattctgagcagaggcttagggaccattgtgtgggtaatttac ttagaaatgattgtccctagaatctctccatgtgctttgtgtacagatagtgaccaggtt ctgcatgtgatttccctgtggaattccaaccaaggacagacagtgagtgatagccaaatc ctctccacttgcagatgtagccttgggaggacagcacaaggggatgcagccctggagcag cagcaggatgtgtacggcacccgcatcccctccatctgcacccgcagccacagccacagc cacagccagcgctatgtcccgttgccattcacattcatacctgcaacgtacactcataca tgtgtcggatatttgcgaagaagctccctttcccacctagcatcttttgcaaagcagttt ttcatttga >gi568815597r:204517854_204719992|GENSCAN_predicted_peptide_3|841_aa MRLLVAPLLLAWVAGATAAVPVVPWHVPCPPQCACQIRPWYTPRSSYREATTVDCNDLFL TAVPPALPAGTQTLLLQSNSIVRVDQSELGYLANLTELDLSQNSFSDARDCDFHALPQLL SLHLEENQLTRLEDHSFAGLASLQELYLNHNQLYRIAPRAFSGLSNLLRLHLNSNLLRAI DSRWFEMLPNLEILMIGGNKVDAILDMNFRPLANLRSLVLAGMNLREISDYALEGLQSLE SLSFYDNQLARVPRRALEQVPGLKFLDLNKNPLQRVGPGDFANMLHLKELGLNNMEELVS IDKFALVNLPELTKLDITNNPRLSFIHPRAFHHLPQMETLMLNNNALSALHQQTVESLPN LQEVGLHGNPIRCDCVIRWANATGTRVRFIEPQSTLCAEPPDLQRLPVREVPFREMTDHC LPLISPRSFPPSLQVASGESMVLHCRALAEPEPEIYWVTPAGLRLTPAHAGRRYRVYPEG TLELRRVTAEEAGLYTCVAQNLVGADTKTVSVVVGRALLQPGRDEGQGLELRVQETHPYH ILLSWVTPPNTVSTNLTWSSASSLRGQGATALARLPRGTHSYNITRLLQATEYWACLQVA FADAHTQLACVWARTKEATSCHRALGDRPGLIAILALAVLLLAAGLAAHLGTGQPRKVGP QRGKTGGRNRKADLPGYGTQSNTVFPPLSLPTIFIEKKQAKVNAKGKSTFTARNILCAVV KSFNLTFRQRNRGPESLCDLPSARSRIQTRVYRIPEPKPSLPSSTNLSHGSLEWPAVQCT LMFQHAQVHHLEPLPAASPDLALDNSSSPSGYSPDETVNHKSLVFILLSPGQRVRAFVPG L >gi568815597r:204517854_204719992|GENSCAN_predicted_CDS_3|2526_bp atgaggcttctcgtggccccactcttgctagcttgggtggctggtgccactgccgctgtg cccgtggtaccctggcatgttccctgcccccctcagtgtgcctgccagatccggccctgg tatacgccccgctcgtcctaccgcgaggctaccactgtggactgcaatgacctattcctg acggcagtccccccggcactccccgcaggcacacagaccctgctcctgcagagcaacagc attgtccgtgtggaccagagtgagctgggctacctggccaatctcacagagctggacctg tcccagaacagcttttcggatgcccgagactgtgatttccatgccctgccccagctgctg agcctgcacctagaggagaaccagctgacccggctggaggaccacagctttgcagggctg gccagcctacaggaactctatctcaaccacaaccagctctaccgcatcgcccccagggcc ttttctggcctcagcaacttgctgcggctgcacctcaactccaacctcctgagggccatt gacagccgctggtttgaaatgctgcccaacttggagatactcatgattggcggcaacaag gtagatgccatcctggacatgaacttccggcccctggccaacctgcgtagcctggtgcta gcaggcatgaacctgcgggagatctccgactatgccctggaggggctgcaaagcctggag agcctctccttctatgacaaccagctggcccgggtgcccaggcgggcactggaacaggtg cccgggctcaagttcctagacctcaacaagaacccgctccagcgggtagggccgggggac tttgccaacatgctgcaccttaaggagctgggactgaacaacatggaggagctggtctcc atcgacaagtttgccctggtgaacctccccgagctgaccaagctggacatcaccaataac ccacggctgtccttcatccacccccgcgccttccaccacctgccccagatggagaccctc atgctcaacaacaacgctctcagtgccttgcaccagcagacggtggagtccctgcccaac ctgcaggaggtaggtctccacggcaaccccatccgctgtgactgtgtcatccgctgggcc aatgccacgggcacccgtgtccgcttcatcgagccgcaatccaccctgtgtgcggagcct ccggacctccagcgcctcccggtccgtgaggtgcccttccgggagatgacggaccactgt ttgcccctcatctccccacgaagcttccccccaagcctccaggtagccagtggagagagc atggtgctgcattgccgggcactggccgaacccgaacccgagatctactgggtcactcca gctgggcttcgactgacacctgcccatgcaggcaggaggtaccgggtgtaccccgagggg accctggagctgcggagggtgacagcagaagaggcagggctatacacctgtgtggcccag aacctggtgggggctgacactaagacggttagtgtggttgtgggccgtgctctcctccag ccaggcagggacgaaggacaggggctggagctccgggtgcaggagacccacccctatcac atcctgctatcttgggtcaccccacccaacacagtgtccaccaacctcacctggtccagt gcctcctccctccggggccagggggccacagctctggcccgcctgcctcggggaacccac agctacaacattacccgcctccttcaggccacggagtactgggcctgcctgcaagtggcc tttgctgatgcccacacccagttggcttgtgtatgggccaggaccaaagaggccacttct tgccacagagccttaggggaccgtcctgggctcattgccatcctggctctcgctgtcctt ctcctggcagctgggctagcggcccaccttggcacaggccaacccaggaaggtgggaccc cagagaggcaaaacaggtggcaggaataggaaggcagaccttcctgggtacgggacccaa agcaacactgtgtttccacctctcagcctccccactatttttatagagaagaaacaggct aaggtgaatgccaaggggaaaagtactttcactgcccgaaacatcctctgtgccgtggtc aagagttttaacctcacatttagacaaagaaacagaggtccagaaagtttatgtgattta cccagtgccaggtccaggattcaaaccagagtctatcgtattcccgagcccaagccctct ctcccctctagcaccaacctctcccatggcagcttagaatggcctgcagtacagtgcaca ctgatgtttcagcatgcccaggtccatcacctggagccccttcctgcggcatcgccagat cttgctttggataactcctcctcccccagtggatacagtcctgatgaaactgtcaatcac aagtccctggtcttcatcctgctctcccctggccaaagagtgagggcctttgtccctgga ctgtga >gi568815597r:204517854_204719992|GENSCAN_predicted_peptide_4|154_aa MAEFDGRRFSGGLRILEGAAPCSSDNPKWKDQPSCRVTTNILPGRASLQGMSICWHIALT GTGNSSSITRNINSLSQCHREIYLHHHYGQHTCSRPSLLMSPAPSLPYESPARCHDHTHL IPSTSIVISSSHCHNPGHHSYPDHHQNMRMASLL >gi568815597r:204517854_204719992|GENSCAN_predicted_CDS_4|465_bp atggctgaatttgatggtagaaggttctcaggtggtctcagaattttggaaggagctgca ccttgcagcagtgacaatcccaaatggaaggaccaaccttcctgccgtgtaacaaccaac attcttccaggacgagcatcgctgcagggaatgagcatctgttggcacatcgccctcaca gggacaggcaacagcagcagcatcaccaggaatatcaactcattatcacaatgccatcgt gagatttacctccaccatcactatggtcagcacacctgcagccggccatctctgttaatg tcaccagccccatctttgccatatgaatcccctgctcgctgccatgaccacactcactta ataccttccacgagcatcgtcatctcctccagccactgtcacaatcctggtcatcatagc taccctgatcaccatcagaacatgcgtatggcctcattgctgtga >gi568815597r:204517854_204719992|GENSCAN_predicted_peptide_5|94_aa MPGIVQAREESSIQERRQTIKQAIRRNMKGPGHLPALFRVSICGYFHPSLDQKLFDSIQT SKSSNFVLMQKWHLVVTSIIGINQCVLEPACPSP >gi568815597r:204517854_204719992|GENSCAN_predicted_CDS_5|285_bp atgccaggcattgtgcaggcaagagaggaaagcagtatccaggaaagaagacagacaatt aagcaagcaataagaagaaacatgaagggccctggtcacctcccagcgctgtttcgagtc agcatatgtggctacttccatccttccctggaccagaagctctttgattccatccaaaca tccaagtcatccaactttgtgctcatgcagaagtggcacttggtagttacatccatcatt ggcattaaccagtgcgtgctggaaccagcttgccccagtccatga >gi568815597r:204517854_204719992|GENSCAN_predicted_peptide_6|57_aa MEPGLKCSSSEEEEGEAAPAPLHSPFLNLPLTVLPTCTPSSEQIALPDLLRPTEMVM >gi568815597r:204517854_204719992|GENSCAN_predicted_CDS_6|174_bp atggagcccggcctgaagtgctccagctcagaggaggaggagggagaggctgccccggca cctctgcacagtccttttctgaatctgcccctcaccgtgctccccacctgcacgccgtcc tcagaacagattgccttgcctgacctcctgaggccgacggagatggtgatgtga >gi568815597r:204517854_204719992|GENSCAN_predicted_peptide_7|207_aa METNTGELAALGAGRWLRAVSPLRAGGCEKAHLVTSSRTKNTDRSEQPGLAADVWRKLRP RKGLRWSSVPSPRGSSPAGQPEPAPPFASRTDGPPVAGQPILSPGSCGGAAVPGQEPGAL RRLLRPLPPRTPRPRQRPRCPRFPGGGGAAQGAPRIHRHPWNTENRKGRETHVADGRAVW NCRESSNLSWVIFLEKALQEEVNRKNY >gi568815597r:204517854_204719992|GENSCAN_predicted_CDS_7|624_bp atggagaccaacaccggggagctggcagccctgggggccggccggtggctcagagctgtg tctccactcagagcaggtggctgtgagaaagcccacctggtgaccagttccaggacaaag aacacggacagatctgagcaacctgggctggctgcggatgtttggagaaagctgaggccc aggaaggggctgagatggagctcagttccctcaccccgcggctcctcgccggctgggcag ccagagcctgcgccaccctttgccagcaggacagacggcccacctgtggctggccagccc atcctgtctcctgggtcctgcggcggagctgcggtgccggggcaggagccaggagccctg aggcggctgctccggccgctcccgccgcgtacgcctcggccacgccagcgacctcggtgt ccacgattccccggcggcggcggcgctgcccagggagctcccaggatccacagacaccct tggaacacagaaaaccgcaaaggcagagagacacacgttgcagatggcagagctgtgtgg aactgcagggagtcttcaaatttatcttgggttatattcctggaaaaggcactgcaggaa gaagtgaatcgtaaaaactattga >gi568815597r:204517854_204719992|GENSCAN_predicted_peptide_8|75_aa MEHYRMDALDPMGSDPGSEAPAGPGAERGNPSEYKLSASKCQSIERQSGSQKAKDVFFLS FSYPSTPPEELAHGK >gi568815597r:204517854_204719992|GENSCAN_predicted_CDS_8|228_bp atggagcactatcgaatggatgccttagaccccatgggctctgacccagggtccgaggca cctgcggggcctggagcagaaaggggtaatccttcagagtataaactcagtgcttcaaaa tgccagagcatcgaaagacaaagtggctcccagaaggccaaggacgtctttttcttgagc ttctcctacccctcaacccctcctgaagaattggcccatgggaagtag