GENSCAN 1.0 Date run: 3-Nov-116 Time: 21:08:45 Sequence gi568815592r:143995240_144195497 : 200258 bp : 40.44% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 12551 13019 469 2 1 7 54 336 0.528 15.56 1.02 PlyA + 13744 13749 6 1.05 2.04 PlyA - 14015 14010 6 1.05 2.03 Term - 22704 22520 185 1 2 22 45 137 0.069 -0.48 2.02 Intr - 45832 45705 128 0 2 84 86 6 0.154 -0.60 2.01 Init - 47013 46889 125 0 2 48 91 120 0.245 7.99 2.00 Prom - 47303 47264 40 -3.25 3.03 PlyA - 47606 47601 6 1.05 3.02 Term - 48891 48373 519 0 0 -10 45 219 0.017 1.11 3.01 Init - 51072 50683 390 0 0 88 44 290 0.177 21.42 3.00 Prom - 52366 52327 40 -7.65 4.03 PlyA - 52873 52868 6 1.05 4.02 Term - 53185 52881 305 2 2 8 53 218 0.325 4.65 4.01 Init - 54101 53891 211 2 1 58 37 144 0.483 5.30 4.00 Prom - 59162 59123 40 -5.15 5.00 Prom + 61992 62031 40 -4.85 5.01 Init + 62893 62919 27 0 0 79 81 43 0.239 1.40 5.02 Intr + 63939 64037 99 2 0 83 54 101 0.428 5.59 5.03 Intr + 68938 69099 162 0 0 86 18 128 0.184 4.85 5.04 Intr + 81301 81403 103 0 1 104 78 10 0.052 0.43 5.05 Intr + 96632 96675 44 0 2 54 91 57 0.000 -0.36 5.06 Intr + 99809 99984 176 1 2 33 79 161 0.005 7.52 5.07 Intr + 100036 100298 263 1 2 14 92 381 0.846 27.31 5.08 Term + 100977 101191 215 1 2 -18 49 229 0.691 4.81 5.09 PlyA + 102423 102428 6 1.05 6.05 PlyA - 102573 102568 6 1.05 6.04 Term - 121663 121471 193 0 1 76 48 165 0.559 7.21 6.03 Intr - 122260 121990 271 2 1 55 21 193 0.471 4.98 6.02 Intr - 123743 122898 846 0 0 2 37 350 0.675 11.71 6.01 Init - 130627 130549 79 1 1 50 82 64 0.283 3.27 6.00 Prom - 130775 130736 40 -5.85 7.00 Prom + 132616 132655 40 -6.05 7.01 Sngl + 134438 134770 333 1 0 83 37 279 0.565 18.07 7.02 PlyA + 134859 134864 6 1.05 8.00 Prom + 136186 136225 40 -7.05 8.01 Init + 145498 145570 73 0 1 83 78 116 0.991 11.38 8.02 Intr + 149620 149703 84 2 0 112 75 64 0.782 6.37 8.03 Intr + 154827 155174 348 1 0 47 39 236 0.034 9.00 8.04 Term + 162951 163024 74 1 2 37 40 125 0.164 -0.31 8.05 PlyA + 164077 164082 6 1.05 9.03 PlyA - 165187 165182 6 1.05 9.02 Term - 173310 173121 190 2 1 22 45 216 0.623 6.64 9.01 Init - 174161 173917 245 2 2 52 -49 278 0.573 7.75 9.00 Prom - 175557 175518 40 -5.35 10.00 Prom + 184504 184543 40 -3.15 10.01 Sngl + 191389 192252 864 0 0 74 43 1455 0.973 135.22 10.02 PlyA + 192589 192594 6 1.05 11.00 Prom + 194772 194811 40 -6.25 11.01 Init + 199277 199341 65 1 2 85 113 8 0.572 3.68 11.02 Term + 199880 200072 193 2 1 30 48 233 0.578 9.41 11.03 PlyA + 200209 200214 6 -3.74 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 183837 184079 243 2 0 75 31 174 0.801 8.18 S.002 Term + 184293 184517 225 2 0 81 46 101 0.832 1.00 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:143995240_144195497|GENSCAN_predicted_peptide_1|156_aa XLERQVVRIGSVDLIPDRRERLWQRRPGGPAALAVGLPGLRVTICGVPHPAPRSPDPHVG GAAKAHPPRGREPSAVLAARQCTATAAPSPPAAAASTQTPTLRGDDPRSSGEAARACHLR GHDGDPGKRPARQGPALLSCEHGCPVRPSVQHPPGE >gi568815592r:143995240_144195497|GENSCAN_predicted_CDS_1|471_bp nngttagagcgccaggttgtaagaatcgggtctgtggacctcataccagataggcgcgaa cgcctctggcagcggcgtccagggggtccggcggcactcgcggtggggctgcctgggttg cgggtgacgatctgcggggtcccgcacccggccccgcggagcccggacccgcacgtaggc ggcgcggcaaaggcacaccctcctcgcggccgcgaacccagcgccgtcctcgcagcgcgg caatgcacggccaccgctgcccccagcccgcccgccgcagccgcgagcacccaaacacct accctgcggggcgacgacccccggagctcaggcgaggccgctcgggcgtgccacctccgc ggccatgacggcgacccggggaagcgccccgcgcgccaaggccccgcgctgctgagctgt gagcacggctgccccgtccgtccgtccgtccagcacccgcccggagagtga >gi568815592r:143995240_144195497|GENSCAN_predicted_peptide_2|145_aa MGKDFMTKTPKAMATKAKIDKWDLIKRKRFCTAKETTIRVNSSYVCKTFPVLEKNKAELS LDHRSLSNYYILSFSHLRVLKIIQGWLGKEWKQGGNPVKETGESYCSQSKHPVSERVRNQ EEQEVMIRGGNFGVEDFQGRADPGC >gi568815592r:143995240_144195497|GENSCAN_predicted_CDS_2|438_bp atgggcaaggacttcatgactaaaacaccaaaagcaatggcaacaaaagccaaaatagac aaatgggatctaattaaacgaaagaggttctgcacagcaaaagaaactaccatcagagtg aacagctcatatgtctgcaaaacatttcctgttctggaaaagaataaagccgaactgtct cttgaccacagatccctttctaactactatattctctctttttcccacctaagggtttta aagattattcaaggctggctagggaaggagtggaaacaaggagggaatccagtgaaggag actggagagtcttactgcagtcagagtaaacatcctgtgagtgagagagtgagaaaccag gaagagcaggaagttatgatcagaggtgggaattttggagttgaagatttccaggggaga gcagatcctggttgttga >gi568815592r:143995240_144195497|GENSCAN_predicted_peptide_3|302_aa MGRNQSRKAENSKNQSTFSPPKDHSSLPATEQSWTENDFEELTEVGFRRLAISNFSELKE DVQTHCKEARNLEKRLDEWLTRTNSVEKNLNDLMELKTMAQELCDACTSFSSQFNQVEER VSVIEDQMNEDLVFEKIDKIDRPLARLIKKKREKNQIDTIKNDKGDITNDPTEIQTSIRE YYNHLYANKLENLEEMDKFLDTYTLPRLNQEEVESLNRPITGPEIEAIINSLPTKKSPGP DGFTAKFYQSYKEELVPFLLKQFQSIEKEGILPNSFYEASIILIPKPSRDTTKKKRILDQ YP >gi568815592r:143995240_144195497|GENSCAN_predicted_CDS_3|909_bp atggggagaaaccagagcagaaaagctgaaaattctaaaaaccagagcaccttttctcct ccaaaggatcacagctccttgccagcaacagaacaaagctggacagagaatgactttgag gagttgacagaagtaggcttcagaagattggcaatatcaaacttctctgagctgaaggag gatgttcaaacccattgcaaggaagccagaaaccttgaaaaaagattagatgaatggctc actagaacaaacagtgtagagaagaacttaaatgacctgatggagctgaaaaccatggca caagaactatgtgacgcatgcacaagcttcagtagccaattcaatcaagtggaagaaagg gtatcagtgattgaagatcaaatgaatgaagacctggtttttgaaaagatcgacaaaatt gatagaccactagcaaggctaataaagaagaaaagagagaagaatcaaatagacacaata aaaaatgataaaggggatatcaccaatgatcccacggaaatacaaacttctatcagagaa tactataaccacctctatgcaaataaactagaaaacctagaagaaatggataaattcctg gacacatacaccctgccaaggctaaaccaggaagaagttgaatccctgaatagaccaata acaggccctgaaattgaggcaataattaacagcctaccaaccaaaaaaagtccaggacca gacggattcacagccaaattctaccagagttacaaagaggagctggtaccattccttctg aaacaattccaatcaatagaaaaagagggaatcctccctaactcattttatgaggccagc atcatcctgataccaaagcctagcagagacacaacaaaaaaaaagagaattttagaccaa tatccctga >gi568815592r:143995240_144195497|GENSCAN_predicted_peptide_4|171_aa MRKQLWNWVTGRGWNRLEGPEEDRKLWESLELSRDLLNGFDQNAVNDVDNEVQAEVVSDE DKESLGNWSKGVELSWRTSASVVQEENVGLEPPHRVPTGALPNGAVRRGPLSSRSQNARS TDSLHCAPGKATNTQCQPMKAAKREAVPCKATEAELPKTVGAHFLHQHTLM >gi568815592r:143995240_144195497|GENSCAN_predicted_CDS_4|516_bp atgaggaagcaactttggaactgggtaacaggcagaggttggaacagattggagggccca gaagaggacaggaagttgtgggaaagtttggaactttctagagacttgttgaatggtttt gaccaaaatgcagttaatgatgtggacaatgaagtacaggctgaggtggtctcagatgaa gataaggaaagtcttgggaactggagcaaaggggtggagctctcatggagaacctctgct agcgtagtccaggaggaaaatgtggggttggagcccccacacagagtccccactggggca ctgcctaatggagctgtgagaagagggccactgtcctccagatcccagaatgctagatcc actgacagcttgcactgtgcacctggaaaagccaccaacactcaatgccagcccatgaaa gcagccaagagggaggccgtaccctgcaaagccacagaggcagagctgcccaagaccgtg ggagcccacttcttgcatcagcataccctgatgtga >gi568815592r:143995240_144195497|GENSCAN_predicted_peptide_5|362_aa MVLTYAQLLEAAKAYGTLHSPKPQPELYLGPFEPRLELEQWNRRSEELQTTEKREARPKP HKGGRRCPKHRPPSPAGRRQEPRPGYLAWPADSARHDSAESPHKQEGSYQMQPLGFELLS LHNYKKYFFLSSPAAGATGARRYAQCCSKGPHEGRSGGTPRASTSVRGKHEAQEPWRAGP LETGILKRIGKPEKRWERCPALRWLKHLFHQVEADSRFALILRNGEVVEKVEVAHVGAVR VAVLVHQPLPLGGVGVARADVLGLQVLQLAMDGVAVSHLAAFPFALRSEDAVEGDVWRWG ASVGGLGLIGNGKPKNDWTTSSPENKLLVYFGAVVAARPSLHNRGWITQKSAGASSGPNY VS >gi568815592r:143995240_144195497|GENSCAN_predicted_CDS_5|1089_bp atggtgctgacatatgctcagcttctggaagctgctaaggcttatggcaccttgcactcc ccaaagccacagccggagctctacctggggccctttgaaccaagactggaactggagcag tggaatcgcagaagcgaagagctgcagacgacggagaaaagggaagcgcgccccaagccg cacaaaggtggccgccggtgtcccaagcaccgaccgccatccccggcaggacgcaggcag gagcctcggccaggctacctcgcctggcccgcggactccgcgcggcacgactctgcagag agtccccacaagcaagaaggttcttaccaaatgcagccccttggctttgaacttctcagc ctccataactataagaaatatttctttctttcctccccagcagctggggctacaggtgca cgccgctacgcccagtgctgttctaagggtccacatgaaggcaggtcaggcgggactccc cgggcaagcacttctgtacgcggaaagcacgaggcgcaggagccatggagagccggtcct ttggagacaggaatacttaagaggattggcaaaccggagaaacgctgggagaggtgcccc gcactgcggtggctgaagcatcttttccatcaagttgaagcggactcgcgctttgctctc attctccgcaatggcgaagtagttgagaaggtcgaagtggcccatgtaggagcagtacga gtcgcggtgttggttcaccagccactcccacttggtggtgtcggcgtggcccgtgccgat gtacttggactgcaggtgctccagctggctatggatggtgtagcggtcagtcatctcgcc gctttccccttcgctctcaggtcagaggacgcagtggagggcgatgtctggaggtgggga gcatccgtgggtggcttggggctgataggaaatgggaaacccaagaatgactggactacg agcagccccgaaaataaactcttggtgtatttcggagccgtcgtggcggcacggccgtcc cttcacaaccggggatggattacccagaagtcagcaggcgccagttctggtcccaactat gtctcttag >gi568815592r:143995240_144195497|GENSCAN_predicted_peptide_6|462_aa MNKEMSGQTFVGKQNSVRMPKIISGLGVQKPNRQWRLVQDLRIINEAVVPLYQAVRNPYT LLSQIPEETGWFTVLDLKDALFCIAVHPDSQFLLAFEDPLNPTSQLTWTVLPQGFRDSPH LFGQALAQDLSQFSYLDTLVLRYVDDLLLAAPSETLCHQATQVLLNFLATCGYKVSKLKA QICSQQVKYLGLKLSKGTRALSEERIQPILAYPHPKTRKQLRGLLGITGFCQIWIPRYSE IARPLHTLIKKTQKANTHLVRWTPEAEAAFQVLKKALTQAPVLSLPTGQDFFLICHRKNR NSSRSPYTALWRGKEFLTSDGIPIKHQEAIRRLLLAVQKPKEVAVLHCQGHQKGKEREIE GNRQVDIEAEKAATQDPPLELLIEGPLVWGNPLWETKPQIEAVRLQIVLQMEPQMQSMTK IYHGPLDWPARPCSNVNDMEGTPPEEISTARPLLCLNSAGSS >gi568815592r:143995240_144195497|GENSCAN_predicted_CDS_6|1389_bp atgaacaaagaaatgagtggacaaacttttgtaggaaaacaaaactctgtgagaatgccg aagatcatttcaggactaggagtacagaaacccaacagacagtggaggttagtgcaagat ctcaggatcatcaatgaggctgttgtccctctataccaagctgtacgtaacccttatact ctgctttcccaaataccagaggaaacagggtggtttacagtcctggaccttaaggatgcc cttttctgcatcgctgtacatcctgactctcaattcttgcttgcctttgaagatcctttg aacccaacgtctcaactcacctggactgttttaccccaagggttcagggatagcccccat ctatttggccaggcattagcccaagacttgagccagttctcatacctggacactcttgtc cttcggtacgtggatgatttacttttagccgccccttcagaaaccttgtgccatcaagcc acccaagtactcttaaatttcctcgccacctgtggctacaaggtttccaaactaaaggct cagatctgctcacagcaggttaaatacttagggctaaaactatccaaaggcaccagggcc ctcagtgaggaacgtatccagcctatattggcttatcctcatcctaaaacccgaaaacaa ctaagagggctccttggcatcacaggcttctgccaaatatggattcccaggtacagcgaa atagccaggccattacatacactaattaagaaaactcagaaagccaatacccatttagta agatggacacctgaagcagaagcagcttttcaggtcttaaagaaggccctaacccaagcc ccagtgttaagcttgccaacggggcaagactttttccttatatgtcacagaaaaaacagg aatagctctaggagtccttacacagcactatggagaggaaaggaattcctgacttccgac ggaatacctatcaaacatcaggaagccattaggagattattattggctgtacagaaacct aaagaggtggcagtcttacactgccagggtcatcagaaaggaaaggaaagggaaatagaa gggaaccgccaagtggatattgaagccgaaaaagctgcaacgcaggaccctccattagaa ctgcttatagaaggacccctagtatggggtaatcccctctgggaaaccaaaccccaaatc gaagctgtaagactacaaattgttcttcaaatggagccccagatgcagtccatgactaag atctaccacggacccctggactggcctgctagaccgtgctccaatgttaatgacatggaa ggcacccctcctgaggaaatctcaactgcacgacccctactatgcctcaattcagcagga agcagttaa >gi568815592r:143995240_144195497|GENSCAN_predicted_peptide_7|110_aa MAAKEDKESKNKLELTEQTGTMSAPVASGLGGVGVLQNLRFFMKKLNTRLGQQSEKAKED LGEGGAVEGLAAVSHQQEIQMINDNVYELQKCLLLHFLLISHKNLPCDPS >gi568815592r:143995240_144195497|GENSCAN_predicted_CDS_7|333_bp atggctgccaaggaggataaggagagcaagaataagctagagctcacagagcaaactgga accatgtcagctcccgttgcctcaggtcttggtggtgtgggtgtcctgcagaatctgaga ttcttcatgaagaagctaaacacacgcctgggccagcagtcagaaaaggcgaaggaggac cttggggaaggtggagcagttgaaggcctggctgctgtctcccaccaacaagagatccag atgataaatgacaatgtgtatgagctacaaaaatgtctgctgcttcatttcctcctgatc tcacataagaatctcccctgtgacccttcctaa >gi568815592r:143995240_144195497|GENSCAN_predicted_peptide_8|192_aa MEQPQGDEMDEGEEEEKKARSQPQRYEIEIKDSLRGISVGKEVETLEPLGIAGSSPASSR VPGMQDLPQLPQPHPRSETASSVLAAATHRTANVTSAAGIPGGSPARSLPWGPEDERRKC LGRQTSHSPRRSAQPAGTPLGLREGLERVPGSGRSRAHGLWVGREPALYYPEVMGTVGQQ RSTGALVPAGLV >gi568815592r:143995240_144195497|GENSCAN_predicted_CDS_8|579_bp atggagcagcctcaaggggatgaaatggatgaaggggaggaagaagagaagaaagccaga tctcagccccaaaggtatgaaatagaaatcaaagatagtctaaggggaataagtgttggc aaggaggtggagacactggaacccttgggcattgcaggctcctcgcccgcgtcctcgagg gtcccagggatgcaggacctgccgcagcttccgcagccacacccacgctcagagacggct tccagcgtcctcgccgcggcaacgcaccggactgcaaatgtcaccagcgcagcggggatt cccgggggaagtccagcgcgctccctgccctggggaccagaggacgagcggaggaagtgt cttgggcggcaaacatcccactccccgcgccggtccgcgcagcccgcgggaacgcccctg ggcttgcgggaggggctggagcgcgtccccggctccgggcggtcccgagcgcatggactc tgggtggggcgggaacccgcgctctattatccagaagtcatgggaaccgtgggacagcag aggagcaccggagccttagtcccagcaggcctggtatag >gi568815592r:143995240_144195497|GENSCAN_predicted_peptide_9|144_aa MCGDEASADHKAAEKLIDEFAKVIADENLTPQQVCNADKTSPFWCYCPGKTLTRAETAST GIKDAKDRTTVQGCADAAGMHKVSKLEDINEVFFNINKEATVLHSLTADEKAEMVVNEND CDNSDKEDTINTAEKVPIDDIVDV >gi568815592r:143995240_144195497|GENSCAN_predicted_CDS_9|435_bp atgtgtggtgatgaagcatctgctgatcacaaagcagcagagaaattaattgatgaattt gccaaggtcattgctgatgaaaatttgactccacaacaagtttgcaatgctgataaaaca tcaccgttttggtgttattgccccggaaagacactgactagagctgagacagcctctaca ggaattaaggatgccaaggacagaacaactgtgcagggatgtgctgatgcagcaggcatg cataaagtcagtaagctggaagacatcaatgaagttttttttaacatcaacaaagaggct acagttcttcattcattgactgctgatgaaaaagctgaaatggttgtcaatgaaaatgat tgtgataatagtgacaaggaagataccattaacactgcagaaaaagtgcctatagatgac atagtggatgtgtga >gi568815592r:143995240_144195497|GENSCAN_predicted_peptide_10|287_aa MKDRLAELLDLSKQYDQQFPDGDDEFDSPHEDIVFETDHILESLYRDIRDIQDENQLLVA DVKRLGKQNARFLTSMRRLSSIKRDTNSIAKAIKARGEVIHCKLRAMKELSEAAEAQHGP HSAVARISRAQYNALTLTFQRAMHDYNQAEMKQRDNCKIRIQRQLEIMGKEVSGDQIEDM FEQGKWDVFSENLLADVKGARAALNEIESRHRELLRLESRIRDVHELFLQMAVLVEKQAD TLNVIELNVQKTVDYTGQAKAQVRKAVQYEEKNPCRTLCCFCCPCLK >gi568815592r:143995240_144195497|GENSCAN_predicted_CDS_10|864_bp atgaaagaccggctagcagaacttctggacttgtccaagcaatatgaccagcagttccca gacggggacgatgagtttgactcgccccacgaggacatcgtgttcgagacggaccacatc ctggagtccctgtaccgagacatccgggacattcaggatgaaaaccagctgctggtggcc gacgtgaagcggctgggaaagcagaacgcccgcttcctcacgtccatgcggcgcctcagc agcatcaagcgcgacaccaactccatcgccaaggccatcaaggcccggggcgaggtcatc cactgcaagctgcgcgccatgaaggagctgagcgaggcggctgaggcccagcacggcccg cactcggcagtggcgcgcatttcgcgggcgcagtacaacgcgctcaccctcaccttccag cgcgccatgcacgactacaaccaggccgagatgaagcagcgcgacaactgcaagatccgc atccagcgccagctggagatcatgggcaaggaagtctcgggcgaccagatcgaggacatg ttcgagcagggtaagtgggacgtgttttccgagaacttgctggccgacgtgaagggcgcg cgggccgccctcaacgagatcgagagccgccaccgcgaactgctgcgcctggagagccgc atccgcgacgtacacgagctcttcttgcagatggcggtgctggtggagaagcaggccgac accctgaacgtcatcgagctcaacgtacaaaagacggtcgactacaccggccaggccaag gcgcaggtgcggaaggccgtgcagtacgaggagaagaacccctgccggaccctctgctgc ttctgctgtccctgcctcaagtag >gi568815592r:143995240_144195497|GENSCAN_predicted_peptide_11|85_aa MGLEKDCLGRSRSCRRTVSLKSLWIVQNSNSKGLLFLQALKTKERMEDPKLPLLGAASSL PAEADPAGRLLLGPEVPLSHQQPTL >gi568815592r:143995240_144195497|GENSCAN_predicted_CDS_11|258_bp atgggactggagaaagactgcttgggaagaagcagatcatgcaggagaacggtcagtctg aaaagcctgtggattgtgcagaacagtaacagcaagggtcttcttttcctacaagcgttg aagaccaaggagagaatggaggaccctaaactgcccttgcttggagctgcaagttctctc cctgctgaagctgaccctgcaggtcgactgctgcttgggcctgaagtgcctctcagccac cagcagccaactctttga