GENSCAN 1.0 Date run: 4-Nov-116 Time: 02:28:08 Sequence gi568815597r:45411332_45619043 : 207712 bp : 44.46% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3725 3839 115 1 1 52 -17 143 0.476 0.22 1.02 Term + 3856 4013 158 2 2 48 38 175 0.980 6.60 1.03 PlyA + 4204 4209 6 1.05 2.10 PlyA - 6612 6607 6 1.05 2.09 Term - 10515 10390 126 0 0 105 39 58 0.399 0.88 2.08 Intr - 77273 77243 31 1 1 89 91 40 0.086 2.53 2.07 Intr - 79633 79521 113 1 2 68 97 36 0.058 1.68 2.06 Intr - 85292 85225 68 0 2 113 97 65 0.794 8.52 2.05 Intr - 86052 85968 85 0 1 36 80 60 0.620 -0.61 2.04 Intr - 88112 88014 99 2 0 76 111 35 0.528 4.91 2.03 Intr - 88335 88201 135 0 0 62 100 133 0.567 12.66 2.02 Intr - 88750 88551 200 2 2 70 23 146 0.804 5.37 2.01 Init - 89364 89316 49 0 1 86 58 27 0.380 -1.39 2.00 Prom - 90170 90131 40 -7.06 3.00 Prom + 91269 91308 40 -7.56 3.01 Init + 94765 94879 115 0 1 44 39 126 0.644 3.67 3.02 Intr + 96077 96219 143 0 2 61 78 146 0.612 10.97 3.03 Intr + 96881 97033 153 1 0 88 94 209 0.988 21.77 3.04 Term + 97465 97884 420 0 0 119 34 201 0.804 13.09 3.05 PlyA + 98785 98790 6 1.05 4.07 PlyA - 99737 99732 6 -1.95 4.06 Term - 100083 99998 86 1 2 93 53 89 0.717 3.72 4.05 Intr - 103306 103176 131 0 2 95 121 97 0.999 14.04 4.04 Intr - 103664 103542 123 1 0 79 91 96 0.997 8.70 4.03 Intr - 104476 104323 154 2 1 122 54 99 0.999 9.03 4.02 Intr - 107723 107607 117 0 0 90 100 99 0.967 11.74 4.01 Init - 116522 116420 103 2 1 49 93 74 0.475 4.41 4.00 Prom - 121246 121207 40 -4.86 5.00 Prom + 132950 132989 40 -7.06 5.01 Init + 133235 133287 53 1 2 93 81 -27 0.225 -1.99 5.02 Intr + 136097 136223 127 2 1 69 115 74 0.401 8.88 5.03 Intr + 139312 139545 234 0 0 -9 64 154 0.251 1.19 5.04 Intr + 150458 150547 90 1 0 95 94 20 0.319 3.49 5.05 Intr + 155238 155357 120 2 0 83 116 214 0.999 24.39 5.06 Intr + 155538 155689 152 2 2 59 35 207 0.993 11.46 5.07 Intr + 156651 156846 196 0 1 50 110 248 0.969 22.62 5.08 Intr + 157154 157353 200 1 2 50 80 84 0.978 1.95 5.09 Intr + 157596 157668 73 0 1 86 75 78 0.993 5.71 5.10 Intr + 157812 157898 87 2 0 102 89 55 0.971 7.17 5.11 Intr + 172777 172874 98 0 2 101 82 93 0.001 8.81 5.12 Intr + 179892 179939 48 0 0 84 94 72 0.326 5.20 5.13 Intr + 190924 191034 111 1 0 64 78 79 0.445 3.99 5.14 Intr + 193605 193685 81 0 0 62 86 57 0.721 1.65 5.15 Intr + 195151 195260 110 1 2 82 80 86 0.953 7.13 5.16 Intr + 195990 197006 1017 1 0 45 86 746 0.162 60.73 5.17 Intr + 201838 201917 80 2 2 35 95 99 0.238 4.57 5.18 Intr + 202765 202850 86 0 2 30 93 87 0.986 2.02 5.19 Intr + 202962 203035 74 0 2 89 55 21 0.521 -2.05 5.20 Intr + 203682 203870 189 1 0 80 92 283 0.999 27.46 5.21 Intr + 203974 204140 167 2 2 76 16 161 0.900 7.38 5.22 Term + 206132 206272 141 1 0 26 47 186 0.489 6.23 5.23 PlyA + 206898 206903 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 79593 79521 73 0 1 44 97 113 0.815 7.03 S.002 Term + 158560 158625 66 0 0 123 53 26 0.939 0.54 S.003 Sngl - 173109 172780 330 0 0 110 43 236 0.943 16.05 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:45411332_45619043|GENSCAN_predicted_peptide_1|90_aa SNSEPLGHVSFELFADKFPKTTENFCALTTGEKGFGYKELFQGLCVRLDGKHVVFGKVKE GMNIVEAMGHFGSGNGKTSKEITIADCGQL >gi568815597r:45411332_45619043|GENSCAN_predicted_CDS_1|273_bp tcaaacagcgagcccttgggccacgtctccttcgagctgtttgcagacaagtttccaaag acaacagaaaacttttgtgctctgaccactggagagaaaggatttggttataaggaatta ttccagggtttatgtgtccggttggatggcaagcatgtggtctttggcaaggtgaaagaa ggcatgaatattgtggaggccatggggcactttgggtctgggaatggcaagaccagcaag gagatcaccattgctgactgtggacaactctaa >gi568815597r:45411332_45619043|GENSCAN_predicted_peptide_2|301_aa MGFHHVGQAGLELLTSEKGSSTASIAGCVRKRRPQFPRWRCGSQSKVGLSKPTRSFEPGG GSCHWHRFGVVAPVYYFWGGLNQESKHKTPGSQTPAAPLPQDMNTSLSWFEQLDVLLNAT DGNVVRNKQWLYPLGVSTELIGLCICFFCSSGCIFLGSPPQNSTAVTPAVLWEESEIMQK ELKLLQYQLSQHQELLLKQLAEGRQAQVGSWKAKEPLPPTSCEGYYAPSLPPSPPLARAG TPLAAQSCLSGEYCGQGFEQVRHRASGQVMALKMNTLSSNRANMLKEVQLMNRLSHPNIL R >gi568815597r:45411332_45619043|GENSCAN_predicted_CDS_2|906_bp atggggtttcaccatgttggccaggctggtctcgaactcctcacctcagagaaaggtagt tccaccgcctccatcgccggttgtgttcgcaagaggcgacctcagtttcccaggtggcgc tgcggcagccagtccaaagtcggactttcaaaaccaacccgaagttttgaacccggcggc ggcagttgtcactggcacaggtttggcgtggtggcgccagtgtactacttctggggaggt ctcaaccaggagagcaagcacaagacaccagggagccagacccctgccgctcccctgcca caggatatgaatacgagcctcagctggtttgagcagctggatgtgcttctcaacgctact gatggaaatgtggtccggaataagcagtggctgtatcctcttggggtctccacagagctc attggcctgtgcatctgtttcttctgtagcagtggctgtatcttcttggggtctccaccg cagaatagcactgctgtcactcctgcagtgctgtgggaggagtcagagattatgcagaag gaattgaagttgctgcagtaccagttgagccagcaccaggagctgctgctgaaacagctg gctgagggacgacaggctcaggttggcagttggaaggctaaggagccgctgccaccaacg agctgtgagggttactatgctccctctttgccgccgtctcctcctcttgcccgcgcaggc acccctctggctgctcagtcctgcctcagtggagagtactgtggacaaggatttgagcag gtacgacaccgagcttctggtcaggtgatggctcttaagatgaacacattgagcagtaac cgggcaaacatgctgaaagaagtacagctcatgaatagactctcccatcccaacatcctt aggtaa >gi568815597r:45411332_45619043|GENSCAN_predicted_peptide_3|276_aa MTYQKDKEVLVGKRLVEPHRFVSFEDELDCGMNGIEGSGPTLAFLVLSTPAMFDRALKPF LQSCHLRMLTDPVDQCVAYHLGRVRESLPELQIEIIADYEVHPNRRPKILAQTAAHVAGA AYYYQRQDVEADPWGNQRISGVCIHPRFGGWFAIRGVVLLPGIEVPDLPPRKPHDCVPTR ADRIALLEGFNFHWRDWTYRDAVTPQERYSEEQKAYFSTPPAQRLALLGLAQPSEKPSSP SPDLPFTTPAPKKPGNPSRARSWLSPRVSPPASPGP >gi568815597r:45411332_45619043|GENSCAN_predicted_CDS_3|831_bp atgacatatcaaaaggataaagaagtgctggttgggaagagactggtggaaccacatcgt tttgtgtcatttgaggatgagttggactgtggcatgaacggcattgagggttcaggacct accctggccttcctggtactcagcacgcctgccatgtttgaccgggccctcaagcccttc ttgcagagctgccacctccgaatgctgactgacccagtggaccagtgtgtggcctaccat ctgggccgtgttagagagagcctcccagagctgcagatagaaatcattgctgactacgag gtgcaccccaaccgacgccccaagatcctggcccagacagcagcccatgtagctggggct gcttactactaccaacgacaagatgtggaggctgacccatgggggaaccagcgcatatca ggtgtgtgcatacacccccgatttgggggctggtttgccatccgaggggtagtgctgctg ccagggatagaggtgccagatctgccacccagaaaacctcatgactgtgtacctacaaga gctgaccgtatcgccctactcgaaggcttcaatttccactggcgtgattggacttaccgg gatgctgtgacaccccaggagcgctactcagaagagcagaaggcctacttctccactcca cctgcccaacgattggccctattgggcttggctcagccctcagagaagcctagttctccc tccccggaccttccctttaccacacccgcccccaagaagcctgggaatcccagcagagcc cggagctggctcagccccagggtctcaccacctgcatcccctggcccttga >gi568815597r:45411332_45619043|GENSCAN_predicted_peptide_4|237_aa MIAVLGHGLYTYIELPYLLHLVGCEEETMAVDESADRKMSSGNAKIGHPAPNFKATAVMP DGQFKDISLSDYKGKYVVFFFYPLDFTFVCPTEIIAFSDRAEEFKKLNCQVIGASVDSHF CHLAWVNTPKKQGGLGPMNIPLVSDPKRTIAQDYGVLKADEGISFRGLFIIDDKGILRQI TVNDLPVGRSVDETLRLVQAFQFTDKHGEVCPAGWKPGSDTIKPDVQKSKEYFSKQK >gi568815597r:45411332_45619043|GENSCAN_predicted_CDS_4|714_bp atgattgctgtcctgggacatgggctttatacttacattgaattgccttaccttttacat ttagttggctgtgaagaagagacaatggctgttgatgagtcagctgataggaagatgtct tcaggaaatgctaaaattgggcaccctgcccccaacttcaaagccacagctgttatgcca gatggtcagtttaaagatatcagcctgtctgactacaaaggaaaatatgttgtgttcttc ttttaccctcttgacttcacctttgtgtgccccacggagatcattgctttcagtgatagg gcagaagaatttaagaaactcaactgccaagtgattggtgcttctgtggattctcacttc tgtcatctagcatgggtcaatacacctaagaaacaaggaggactgggacccatgaacatt cctttggtatcagacccgaagcgcaccattgctcaggattatggggtcttaaaggctgat gaaggcatctcgttcaggggcctttttatcattgatgataagggtattcttcggcagatc actgtaaatgacctccctgttggccgctctgtggatgagactttgagactagttcaggcc ttccagttcactgacaaacatggggaagtgtgcccagctggctggaaacctggcagtgat accatcaagcctgatgtccaaaagagcaaagaatatttctccaagcagaagtga >gi568815597r:45411332_45619043|GENSCAN_predicted_peptide_5|1177_aa MASLVAHAYNCSTFGRPRSSGSGLTIPLKELSTLVTNDLHVHQIQKFLLWISDNMNIKTM GEYGATDGIVNPEGRHQKNLQRGICYLTSPGKAPAAAPRRAVQCGPAKGEAGPAPCTAHV ASATCLIVPRSSPNPRCGGAMAASCVLLHTGQKMPLIGLGTWKSEPGQVKAAVKYALSVG YRHIDCAAIYGNEPEIGEALKEDVGPGKAVPREELFVTSKLWNTKHHPEDVEPALRKTLA DLQLEYLDLYLMHWPYAFERGDNPFPKNADGTICYDSTHYKETWKALEALVAKGLVQALG LSNFNSRQIDDILSVASVRPAVLQVECHPYLAQNELIAHCQARGLEVTAYSPLGSSDRAW RDPDEPVLLEEPVVLALAEKYGRSPAQILLRWQVQRKVICIPKSITPSRILQNIKVFDFT FSPEEMKQLNALNKNWRYIVPMLTALFRSLVRHLRGTMAMESTATAAVAAELVSADKIED VPAPSTSADKVESLDVDSEAKKLLGLGQKHLVMGDIPAAVNAFQEAASLLGKKYGETANE CGEAFFFYGKSLLELARMENGVLGNALEGVHVEEEEGEKTEDESLVENNDNIDEEAREEL REQVYDAMGEKEEAKKTEDKSLAKPETDKEQDSEMEKGGREDMDISKSAEEPQEKVDLTL DWLTETSEEAKGGAAPEGPNEAEVTSGKPEQEVPDAEEEKSVSGTDVQEECREKGGQEKQ GEVIVSIEEKPKEVSEEQPVVTLEKQGTAVEVEAESLDPTVKPVDVGGDEPEEKVVTSEN EAGKAVLEQLVGQEVPPAEESPEVTTEAAEASAVEAGSEVSEKPGQEAPVLPKDGAVNGP SVVGDQTPIEPQTSIERLTETKDGSGLEEKVRAKLVPSQEETKLSVEESEAAGDGVDTKV AQGATEKSPEDKVQIAANEETQEREEQMKEGEETEGSEEDDKENDKTEEMPNDSVLENKS LQENEEEEIGNLELAWDMLDLAKIIFKRQETKEAQLYAAQAHLKLGEVSVESENYVQAVE EFQSCLNLQEQYLEAHDRLLAETHYQLGLAYGYNSQYDEAVAQFSKSIEVIENRMAVLNE QVKEAEGSSAEYKKEIEELKELLPEIREKIEDAKESQRSGNVAELALKATLRKPEEESPR KDDAKKAKQEPEVNGGSGDAVPSGNEVSENMEEEVGS >gi568815597r:45411332_45619043|GENSCAN_predicted_CDS_5|3534_bp atggccagcttggtggctcatgcctataactgcagcacttttgggaggccaaggtcatct ggctcaggattaaccattccactgaaggagctctccacattggtcaccaatgatcttcat gtccaccaaatccaaaagtttctgctgtggatatcagacaacatgaatataaagactatg ggagagtatggagccacggatggcattgtgaatccggagggccgacaccagaagaacctg caacgtggcatctgctaccttacttcccccggaaaagcgcctgcggcggcgcctaggcgc gcggtgcaatgtgggccagcaaaaggcgaggctggccccgccccttgcaccgcccacgtg gccagcgccacctgcctcattgtgcccaggagttctccaaacccgcgctgcggaggggca atggcggcttcctgtgttctactgcacactgggcagaagatgcctctgattggtctgggt acctggaagagtgagcctggtcaggtaaaagcagctgttaagtatgcccttagcgtaggc taccgccacattgattgtgctgctatctacggcaatgagcctgagattggggaggccctg aaggaggacgtgggaccaggcaaggcggtgcctcgggaggagctgtttgtgacatccaag ctgtggaacaccaagcaccaccccgaggatgtggagcctgccctccggaagactctggct gacctccagctggagtatctggacctgtacctgatgcactggccttatgcctttgagcgg ggagacaaccccttccccaagaatgctgatgggactatatgctacgactccacccactac aaggagacttggaaggctctggaggcactggtggctaaggggctggtgcaggcgctgggc ctgtccaacttcaacagtcggcagattgatgacatactcagtgtggcctccgtgcgtcca gctgtcttgcaggtggaatgccacccatacttggctcaaaatgagctaattgcccactgc caagcacgtggcctggaggtaactgcttatagccctttgggctcctctgatcgtgcatgg cgtgatcctgatgagcctgtcctgctggaggaaccagtagtcctggcattggctgaaaag tatggccgatctccagctcagatcttgctcaggtggcaggtccagcggaaagtgatctgc atccccaaaagtatcactccttctcgaatccttcagaacatcaaggtgtttgacttcacc tttagcccagaagagatgaagcagctaaatgccctgaacaaaaattggagatatattgtg cctatgcttacggctctattccgttcgctggttcgccacctcaggggaacgatggccatg gagtccacagccactgccgccgtcgccgcggagctggtttctgccgacaaaattgaagat gttcctgctccttctacatctgcagataaagtggagagtctggatgtggatagtgaagct aagaaactattgggtttaggacagaaacatctggtgatgggggatattccagcagctgtc aatgcattccaggaagcagctagtcttttaggtaagaagtatggagagacagctaatgag tgtggagaagccttctttttctatgggaaatcacttctggagttggcaagaatggagaat ggtgtgttgggaaacgccttggaaggtgtgcatgtggaagaggaagaaggagaaaaaaca gaagatgaatctctggtagaaaataatgataacatagatgaggaagcaagggaagagttg agagaacaggtttatgacgccatgggagaaaaagaagaagccaaaaaaacagaagacaag tctttggcaaagcctgaaactgataaagaacaggacagtgaaatggagaagggtggaaga gaagatatggatataagtaaatctgcagaggagccacaggaaaaagttgacttgactcta gattggttaactgaaacctctgaagaggcaaaaggaggagcagcaccagaaggaccgaat gaagctgaggtcacttctgggaagccagaacaggaagtaccagatgctgaggaagaaaaa tcagtttctggaactgatgtccaagaagagtgcagagaaaaaggaggtcaggagaagcag ggagaggtaattgtgagcatagaggagaagccaaaagaagtttcagaagagcagcctgtg gtgactctagaaaagcagggcactgcagtggaggtagaagcagagtctttagacccgaca gtcaagccagtggatgtgggtggggacgagccagaggagaaggtagttacctctgaaaac gaggcaggaaaggcggttcttgaacaactggtaggtcaagaagtaccacctgctgaagag tcaccagaggtgacaacagaggctgcagaggcctcagctgtagaggctggatcagaagtc tctgaaaagcctgggcaggaggctccagttctccctaaggatggtgcagtcaatggaccg tcagttgtaggagatcagactcctattgaaccacagacttctatagaaagactgacagaa acaaaagatggctcaggactagaggagaaggtcagggcaaagctggttcctagtcaggag gagactaagctgtctgtagaagagtctgaggcagctggagatggggttgataccaaggta gcccagggagctactgagaaatcacctgaagacaaagttcagatagctgctaatgaagag acacaagagagagaagaacagatgaaagagggtgaagaaactgaaggctcagaagaggat gataaagaaaatgataagaccgaagaaatgccaaatgattcagtccttgaaaacaagtct cttcaagaaaatgaggaggaggagattgggaacctagagcttgcctgggatatgctggat ttagcaaagatcatttttaaaaggcaagaaacaaaagaagcacagctttatgctgcccag gcacatcttaaactcggagaagttagtgttgaatctgaaaactatgtgcaagctgtggag gagttccagtcctgccttaacctgcaggaacagtacctggaagcccacgaccgtctcctt gcagagacccactaccagctgggcttggcttatgggtacaactctcagtatgatgaggca gtggcacagttcagcaaatctattgaagtcattgagaacagaatggctgtactaaacgag caggtgaaggaggctgaaggatcgtctgctgaatacaagaaagaaattgaggaactaaag gaactgctacccgaaattagagagaagatagaagatgcaaaggagtctcagcgtagtggg aatgtagctgaactggctctgaaagctactctgaggaaaccagaggaagagagtccccgg aaagatgatgcaaagaaagccaaacaagagccggaggtgaacggaggcagtggggatgct gtccccagtggaaatgaagtttcggaaaacatggaggaggaggtgggcagttaa