GENSCAN 1.0 Date run: 7-Nov-116 Time: 14:33:47 Sequence gi568815588r:70154960_70366392 : 211433 bp : 45.86% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 4005 4044 40 -2.06 1.01 Sngl + 8877 9095 219 2 0 60 36 230 0.959 10.16 1.02 PlyA + 10836 10841 6 1.05 2.00 Prom + 11612 11651 40 -5.06 2.01 Sngl + 15148 15876 729 0 0 93 44 200 0.871 12.33 2.02 PlyA + 16610 16615 6 1.05 3.13 PlyA - 17583 17578 6 1.05 3.12 Term - 20142 19990 153 0 0 56 42 107 0.609 0.82 3.11 Intr - 38590 38465 126 1 0 49 79 51 0.120 1.18 3.10 Intr - 51374 51305 70 1 1 85 34 68 0.265 0.28 3.09 Intr - 54331 54246 86 1 2 56 111 48 0.938 2.52 3.08 Intr - 54726 54599 128 1 2 117 85 167 0.999 19.70 3.07 Intr - 58630 58504 127 1 1 74 95 147 0.999 14.25 3.06 Intr - 59627 59541 87 2 0 61 90 59 0.921 3.57 3.05 Intr - 62972 62853 120 2 0 45 91 104 0.905 7.09 3.04 Intr - 63858 63805 54 0 0 136 75 4 0.659 3.08 3.03 Intr - 75440 75382 59 0 2 95 116 13 0.717 3.40 3.02 Intr - 78466 78305 162 2 0 53 70 178 0.664 12.45 3.01 Init - 83865 83790 76 0 1 68 78 78 0.552 4.21 3.00 Prom - 93260 93221 40 -5.76 4.07 PlyA - 93366 93361 6 1.05 4.06 Term - 100868 99998 871 1 1 84 55 1587 0.772 147.11 4.05 Intr - 105780 105681 100 1 1 101 100 159 0.999 17.57 4.04 Intr - 111432 111118 315 1 0 96 105 425 0.616 41.04 4.03 Intr - 112197 112154 44 2 2 53 37 99 0.232 -0.82 4.02 Intr - 117740 117637 104 2 2 112 31 50 0.409 0.67 4.01 Init - 119175 119137 39 0 0 110 82 42 0.923 6.01 4.00 Prom - 120231 120192 40 -5.26 5.05 PlyA - 120256 120251 6 1.05 5.04 Term - 123789 123656 134 1 2 50 48 87 0.678 -0.85 5.03 Intr - 128950 128711 240 2 0 -38 96 155 0.318 1.22 5.02 Intr - 134933 134895 39 0 0 120 110 49 0.994 8.50 5.01 Init - 135140 135041 100 2 1 82 89 74 0.993 7.41 5.00 Prom - 138285 138246 40 -6.06 6.05 PlyA - 139751 139746 6 1.05 6.04 Term - 146549 146395 155 0 2 111 47 307 0.992 26.98 6.03 Intr - 169071 168904 168 1 0 75 95 301 0.714 29.42 6.02 Intr - 185743 185594 150 2 0 93 80 231 0.931 22.93 6.01 Intr - 205511 205388 124 2 1 15 64 157 0.404 6.16 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 93090 93211 122 1 2 88 38 88 0.949 2.44 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:70154960_70366392|GENSCAN_predicted_peptide_1|72_aa MINEVDADGNGTIDFPEFLTMMARKMKDTDSEEEIRETFCVFDKDGNGYISGVELHHVMT NLGVKLTDEEVD >gi568815588r:70154960_70366392|GENSCAN_predicted_CDS_1|219_bp atgattaatgaagtagatgctgatggtaatggcacaattgacttccctgaatttctgaca atgatggcaagaaaaatgaaagacacagacagtgaagaagaaattagagaaacattctgt gtgtttgataaggatggcaatggctatattagtggtgtagaacttcaccatgtgatgaca aaccttggagtgaagttaacagatgaagaagttgattaa >gi568815588r:70154960_70366392|GENSCAN_predicted_peptide_2|242_aa MEPFRKPERTVQTRNCYPAQQLRDPRAECKWTPGKEDSRGRSRSGQVVPSPPRISPAAPQ NRLRVTSPPPNPARASLRSDLPAFRPSPSSPNSPPRRDPQTHGLRGSSGKTAAGSDPPSE HTNQQHPGPPATRRMYVTPLPGTSGRVMGPAPSQRYASLPRCRPTSDSGRGGAGIRRDKA RSGSGPASAPPAPVWLADRARRGSAGTLGPGLGPGVPERPGTLGIAASHSRRTRKVRSGY RR >gi568815588r:70154960_70366392|GENSCAN_predicted_CDS_2|729_bp atggagccatttcgtaaaccggagcgcacggtacagacaagaaactgttatccggcccag cagctgcgagacccccgagcggaatgcaagtggaccccaggcaaagaggacagcagggga cggagcagatctggccaagttgtcccgtccccgcccagaatcagccccgcggccccacag aaccgcctgcgtgtcacttcccccccacccaaccccgccagggccagcctccgctccgac ctccccgccttccgtcccagcccctcgagccccaactcccctccccgacgcgaccctcag actcacggcctgaggggctcctccggcaaaacagcggctggctcggaccctccctcagag cacactaaccagcagcacccgggaccgccagctactcgccggatgtacgtcacacccctc cccgggacttccgggcgcgtaatgggccccgccccctcacagcgttacgcctctctgccc cggtgccgtcccaccagcgactcgggccgcggaggggcgggcataaggcgtgacaaagcg cgctcggggtctggccccgcctcggccccgcctgctcccgtctggctagctgaccgcgcg agacgtggcagcgccggaaccctgggtccggggctgggtcctggagtccctgagcggcct ggtacactcgggatcgcggcttcccattccagacgcaccaggaaagtcagaagtgggtac cgacggtga >gi568815588r:70154960_70366392|GENSCAN_predicted_peptide_3|415_aa MGNASARPPRLGGEERLCPAALHLGVLAVRRRGALSLSVGAACGLVALWQRRRQDSGTMS GFSTEERAAPFSLEYRVFLKNEKGQYISPFHDIPIYADKDVFHMVVEVPRWSNAKMEIAT KDPLNPIKQDVKKGKLRYVANLFPYKGYIWNYGAIPQTWEDPGHNDKHTGCCGDNDPIDV CEIGSKVCARGEIIGVKVLGILAMIDEGETDWKVIAINVDDPDAANYNDINDVKRLKPGY LEATVDWFRRYKVPDGKPENEFAFNAEFKDKDFAIDIIKSTHDHWKALVTKKTNGKGISC MNTTLSESPFKCDPDAARAIVDAPLGGRKWWSLALVIRCFCPEVTHITDISLARASRMTM PDLKVAVRIPHLHCIKPKKKAEGDAKEVKAKVKDKPRRRSAKLSLKLLLHTRAKA >gi568815588r:70154960_70366392|GENSCAN_predicted_CDS_3|1248_bp atggggaacgcctctgcccggccgccccgtctgggaggtgaggagcgcctctgcccggcc gcccttcatctgggagtgctcgcagtgcgcaggcgtggggctctctccttgtcagtcggc gccgcgtgcgggctggtggctctgtggcagcggcggcggcaggactccggcactatgagc ggcttcagcaccgaggagcgcgccgcgcccttctccctggagtaccgagtcttcctcaaa aatgagaaaggacaatatatatctccatttcatgatattccaatttatgcagataaggat gtgtttcacatggtagttgaagtaccacgctggtctaatgcaaaaatggagattgctaca aaggaccctttaaaccctattaaacaagatgtgaaaaaaggaaaacttcgctatgttgcg aatttgttcccgtataaaggatatatctggaactatggtgccatccctcagacttgggaa gacccagggcacaatgataaacatactggctgttgtggtgacaatgacccaattgatgtg tgtgaaattggaagcaaggtatgtgcaagaggtgaaataattggcgtgaaagttctaggc atattggctatgattgacgaaggggaaaccgactggaaagtcattgccattaatgtggat gatcctgatgcagccaattataatgatatcaatgatgtcaaacggctgaaacctggctac ttagaagctactgtggactggtttagaaggtataaggttcctgatggaaaaccagaaaat gagtttgcgtttaatgcagaatttaaagataaggactttgccattgatattattaaaagc actcatgaccattggaaagcattagtgactaagaaaacgaatggaaaaggaatcagttgc atgaatacaactttgtctgagagccccttcaagtgtgatcctgatgctgccagagccatt gtggatgctcctctcggaggaagaaagtggtggagccttgcactggtgattaggtgcttc tgcccagaagtgacgcacatcacagacatttcactggctagagcaagccgaatgaccatg cctgacctcaaggtggcagtgcgtattccccacctccactgcatcaagcccaagaaaaag gctgaaggggatgctaaggaagttaaagccaaggtgaaggacaaaccacggagaagatct gcaaagttatcactaaaactgctcctccacaccagagccaaagcctaa >gi568815588r:70154960_70366392|GENSCAN_predicted_peptide_4|490_aa MGPALKATLSDSQGTSMLWPLLNNSVIADLFDEKFNRTIQAKICLMKSSMEEMKGNDIKE FEGEPSQPPNSSWPLSQNGTNTEATPATNLTFSSYYQHTSPVAAMFIVAYALIFLLCMVG NTLVCFIVLKNRHMHTVTNMFILNLAVSDLLVGIFCMPTTLVDNLITGWPFDNATCKMSG LVQGMSVSASVFTLVAIAVERFRCIVHPFREKLTLRKALVTIAVIWALALLIMCPSAVTL TVTREEHHFMVDARNRSYPLYSCWEAWPEKGMRRVYTTVLFSHIYLAPLALIVVMYARIA RKLCQAPGPAPGGEEAADPRASRRRARVVHMLVMVALFFTLSWLPLWALLLLIDYGQLSA PQLHLVTVYAFPFAHWLAFFNSSANPIIYGYFNENFRRGFQAAFRARLCPRPSGSHKEAY SERPGGLLHRRVFVVVRPSDSGLPSESGPSSGAPRPGRLPLRNGRVAHHGLPREGPGCSH LPLTIPAWDI >gi568815588r:70154960_70366392|GENSCAN_predicted_CDS_4|1473_bp atggggccagctctcaaggccacgctttctgattcccagggcacatccatgctctggccc ttattgaataattcagtcattgccgatctgtttgacgaaaagtttaaccgaacaatccaa gcaaaaatctgtttgatgaaaagttccatggaggagatgaaggggaacgacatcaaagag tttgagggggagccctcccagcctcccaacagcagttggcccctaagtcagaatgggact aacactgaggccaccccggctacaaacctcaccttctcctcctactatcagcacacctcc cctgtggcggccatgttcattgtggcctatgcgctcatcttcctgctctgcatggtgggc aacaccctggtctgtttcatcgtgctcaagaaccggcacatgcatactgtcaccaacatg ttcatcctcaacctggctgtcagtgacctgctggtgggcatcttctgcatgcccaccacc cttgtggacaacctcatcactgggtggcccttcgacaatgccacatgcaagatgagcggc ttggtgcagggcatgtctgtgtcggcttccgttttcacactggtggccattgctgtggaa aggttccgctgcatcgtgcaccctttccgcgagaagctgaccctgcggaaggcgctcgtc accatcgccgtcatctgggccctggcgctgctcatcatgtgtccctcggccgtcacgctg accgtcacccgtgaggagcaccacttcatggtggacgcccgcaaccgctcctacccgctc tactcctgctgggaggcctggcccgagaagggcatgcgcagggtctacaccactgtgctc ttctcgcacatctacctggcgccgctggcgctcatcgtggtcatgtacgcccgcatcgcg cgcaagctctgccaggccccgggcccggcccccgggggcgaggaggctgcggacccgcga gcatcgcggcgcagagcgcgcgtggtgcacatgctggtcatggtggcgctgttcttcacg ctgtcctggctgccgctctgggcgctgctgctgctcatcgactacgggcagctcagcgcg ccgcagctgcacctggtcaccgtctacgccttccccttcgcgcactggctggccttcttc aacagcagcgccaaccccatcatctacggctacttcaacgagaacttccgccgcggcttc caggccgccttccgcgcccgcctctgcccgcgcccgtcggggagccacaaggaggcctac tccgagcggcccggcgggcttctgcacaggcgggtcttcgtggtggtgcggcccagcgac tccgggctgccctctgagtcgggccctagcagtggggcccccaggcccggccgcctcccg ctgcggaatgggcgggtggctcaccacggcttgcccagggaagggcctggctgctcccac ctgcccctcaccattccagcctgggatatctga >gi568815588r:70154960_70366392|GENSCAN_predicted_peptide_5|170_aa MDKDWAVGILGGLWKRLGLTRKYLEVCRGMEMDVDHKLYETRTLAAAALRGSQSARGEGN RPGPGAWERTPPAHLCPDRPDLRSAEAGAQPATKCPETVPLPAGPKGPAASRHRRPPEPA TGGIMEEWVSERKNDEYESSSGHCQDLSEKTRELIERRRRDTGGPTMGAL >gi568815588r:70154960_70366392|GENSCAN_predicted_CDS_5|513_bp atggacaaggactgggccgtgggcatcttgggtggcctgtggaagagattgggtttaacc aggaagtacctggaagtctgcaggggaatggaaatggacgttgaccacaagctctatgag accaggacccttgcggcagcagcgctgcgcgggagccagtcagcccgaggggagggaaac cggccagggcccggggcctgggagcgcacgccccctgctcacctctgcccggaccgaccg gacctgcggagcgcggaggcaggggcgcagcccgcaaccaagtgcccggagaccgtccct ctgcccgctggcccgaaggggcccgctgcctcccgccatcggagaccgccggagcccgca actgggggcatcatggaggaatgggtttctgagaggaaaaatgatgaatacgaaagttca tcaggacactgccaagatctctcagaaaaaactcgtgaattgattgagagacgtaggagg gatactggaggaccaacaatgggggctttatga >gi568815588r:70154960_70366392|GENSCAN_predicted_peptide_6|198_aa SETLSQKKRNEEQEKEEEEEEGEEEEGDGEEEEEEESNEEPDLAECKLVSFPIGIYKVLR NVSGQIHLITLANNELKSLTSKFMTTFSQLRELHLEGNFLHRLPSEVSALQHLKAIDLSR NQFQDFPEQLTALPALETINLEENEIVDVPVEKLAAMPALRSINLRFNPLNAEVRVIAPP LIKFDMLMSPEGARAPLP >gi568815588r:70154960_70366392|GENSCAN_predicted_CDS_6|597_bp agtgagaccctgtctcaaaagaaaagaaatgaagagcaagagaaggaggaagaagaggag gagggggaggaggaggagggagatggggaggaggaggaggaggaggaaagcaatgaggaa ccagacctggccgagtgcaagctggtctcctttcccattggcatctacaaggtcctgcgg aatgtctctggccagatccacctcatcaccctggctaacaacgagcttaagtccctcacc agcaagttcatgaccacattcagtcagctccgagagctccacctggaggggaacttccta caccgcctccccagcgaggtcagtgccctgcagcacctcaaggccattgacctgtcccgg aaccagttccaggacttccctgagcagcttaccgccctgccggcgctggagaccatcaac ctggaggagaacgagatcgtagatgtgcccgtggagaagctggccgccatgccagccttg cgcagcatcaacctccgcttcaacccactcaacgccgaggtgcgcgtgatcgccccgccg ctcatcaagtttgacatgctcatgtctccggaaggcgcaagagcccccctaccttag