GENSCAN 1.0 Date run: 4-Nov-116 Time: 18:46:25 Sequence gi568815597f:91862755_92114271 : 251517 bp : 40.93% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 4553 4622 70 1 1 51 86 75 0.691 4.90 1.02 Intr + 12373 12519 147 2 0 20 63 149 0.135 4.89 1.03 Intr + 22866 23017 152 1 2 -21 131 154 0.012 7.76 1.04 Intr + 23161 23366 206 0 2 -10 -11 230 0.003 0.38 1.05 Term + 34457 34583 127 2 1 93 43 117 0.962 4.47 1.06 PlyA + 35139 35144 6 1.05 2.02 PlyA - 35862 35857 6 1.05 2.01 Sngl - 51251 50946 306 2 0 76 44 139 0.599 3.93 2.00 Prom - 54173 54134 40 -2.85 3.00 Prom + 62861 62900 40 -4.85 3.01 Init + 70441 70498 58 0 1 64 82 29 0.096 1.42 3.02 Intr + 85640 85834 195 0 0 44 94 62 0.113 0.86 3.03 Intr + 89654 89726 73 0 1 85 39 75 0.085 -0.15 3.04 Intr + 99964 100192 229 1 1 79 42 118 0.691 3.25 3.05 Intr + 105392 105506 115 1 1 25 116 125 0.856 8.10 3.06 Intr + 113512 113684 173 2 2 72 108 126 0.994 11.74 3.07 Intr + 114289 114639 351 0 0 77 94 250 0.999 19.09 3.08 Intr + 115414 115542 129 0 0 53 99 89 0.988 6.47 3.09 Intr + 116815 117003 189 0 0 80 68 166 0.952 12.66 3.10 Intr + 117889 118061 173 0 2 67 106 119 0.994 9.42 3.11 Intr + 118135 118424 290 1 2 18 84 243 0.978 12.87 3.12 Intr + 118514 118627 114 0 0 78 51 136 0.977 8.40 3.13 Intr + 118864 119001 138 2 0 101 71 213 0.596 20.41 3.14 Intr + 128430 128491 62 1 2 71 115 31 0.432 1.63 3.15 Intr + 131329 131500 172 0 1 86 110 0 0.762 0.59 3.16 Intr + 139295 139395 101 0 2 44 93 91 0.844 4.11 3.17 Intr + 141660 141865 206 2 2 27 58 200 0.989 7.98 3.18 Intr + 142365 142545 181 0 1 80 92 118 0.966 10.35 3.19 Term + 151452 151520 69 2 0 94 41 38 0.202 -3.34 3.20 PlyA + 153357 153362 6 1.05 4.00 Prom + 156737 156776 40 -7.65 4.01 Init + 167326 167556 231 0 0 64 67 207 0.590 12.51 4.02 Intr + 169751 169836 86 1 2 91 107 70 0.596 6.90 4.03 Intr + 180069 180226 158 0 2 135 82 126 0.991 15.43 4.04 Intr + 182778 182906 129 1 0 119 109 30 0.988 7.95 4.05 Term + 188835 188857 23 1 2 133 38 20 0.531 -1.00 4.06 PlyA + 189570 189575 6 1.05 5.00 Prom + 196880 196919 40 -6.45 5.01 Sngl + 204242 204790 549 1 0 78 38 351 0.906 22.96 5.02 PlyA + 205726 205731 6 1.05 6.02 PlyA - 205839 205834 6 1.05 6.01 Sngl - 212657 211779 879 2 0 86 38 1313 0.877 121.84 6.00 Prom - 213284 213245 40 -5.25 7.00 Prom + 216945 216984 40 -8.95 7.01 Init + 217818 217966 149 2 2 85 91 177 0.986 15.48 7.02 Intr + 225944 226141 198 2 0 111 98 85 0.595 9.34 7.03 Intr + 239719 239915 197 1 2 100 77 89 0.647 7.14 7.04 Intr + 245130 245247 118 1 1 71 99 97 0.575 7.70 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 23352 23124 229 1 1 66 94 207 0.964 15.11 S.002 Init - 25462 25453 10 1 1 83 116 7 0.891 3.82 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:91862755_92114271|GENSCAN_predicted_peptide_1|233_aa MVRGEQRSLTPLCLFCAGCLQRQEYQDGKTQSCSGESAEGKTTFSRGSKRSTCQEEPHEA PLEELAAVRAWKTLRIAGGTACPVRPLPTHRWPPVPAPRRVNTAPPAPPSQGRRSWRLER PWPQVGGKRRQVSPPSARRRRAPAAAAPRQNYAIRTRWGLSPPAGSSGNRAGKVAGAREP PTALLHSLGRGKYCTVKCSLRGYQAAFGYLCGKGTASTYFGRCRLDVGKRNIR >gi568815597f:91862755_92114271|GENSCAN_predicted_CDS_1|702_bp atggtacgaggagagcagagaagcctgactccactctgcctgttctgtgccggctgcctt cagaggcaagaataccaagatggtaagactcaatcctgctctggggaatctgccgaagga aaaacaacattctcacgaggcagcaagagaagcacctgccaggaggagccacatgaagcc ccactggaggagctggcagcagtgagagcctggaagacactgcgcatcgccgggggcact gcctgccccgtgcggccgctgcccacgcaccgctggccaccggtgcctgctccccggagg gtaaacaccgccccaccagcgcctccctcccaaggccggaggtcctggcggctggagcga ccctggccgcaagttggaggaaagcggcggcaagtttccccgcccagcgctcggcggcgg cgagctccggcagctgctgcgccgcggcaaaactacgccatccggacccgctggggactc tcacctcctgcagggagctccgggaatcgcgcagggaaagtggccggggcgcgagagccg ccgactgccctccttcactcgctgggaagaggaaagtattgtactgtcaaatgtagcctg cgggggtatcaggcagcatttgggtatttatgtggcaaagggacagcatcgacatatttt gggagatgccgacttgacgtggggaagaggaatattcgttag >gi568815597f:91862755_92114271|GENSCAN_predicted_peptide_2|101_aa MANIISLALAYCDSPELLQQPPTSGQIPCNPPPTPDLPASSEREGAFLRRKYASHSSVLL GQKNHKALFDLTIVLYFQLYLSPPSPVPMPMGTWLSEIQLH >gi568815597f:91862755_92114271|GENSCAN_predicted_CDS_2|306_bp atggctaatatcatcagcctagcactagcctattgtgactctccggaactactacagcag ccccctacatctggccagatcccctgcaaccctccacctacccccgaccttcctgctagc tctgagagagagggagcatttttaagacgcaaatatgcttcccattcttcagtgctcctt ggacaaaaaaaccataaagctttgtttgatctaactattgtcctatatttccagctttat ctctcccctccctctcccgtcccaatgcccatggggacttggctctctgagatccagctg cactga >gi568815597f:91862755_92114271|GENSCAN_predicted_peptide_3|1005_aa MSQKQKQIFDISVLHFGHSEFHVVALAGLETLGLSEPSALTSPVTGIAGACSHTINTTLN SKNKNTKIKSKQKVLANPGIYVTSIWQVHWPERGSSAECYESASVREERTCTCRQDSKQL RMSLPSRQTAIIVNPPPPEYINTKKNGRLTNQLQYLQKVVLKDLWKHSFSWPFQRPVDAV KLQLPPGDDIVLMAQALEKLFMQKLSQMPQEEQVVGVKERIKKGTQQNIAVSSAKEKSSP SATEKVFKQQEIPSVFPKTSISPLNVVQGASVNSSSQTAAQVTKGVKRKADTTTPATSAV KASSEFSPTFTEKSVALPPIKENMPKNVLPDSQQQYNVVKTVKVTEQLRHCSEILKEMLA KKHFSYAWPFYNPVDVNALGLHNYYDVVKNPMDLGTIKEKMDNQEYKDAYKFAADVRLMF MNCYKYNPPDHEVVTMARMLQDVFETHFSKIPIEPVESMPLCYIKTDITETTGRENTNEA SSEGNSSDDSEDERVKRLAKLQEQLKAVHQQLQVLSQVPFRKLNKKKEKSKKEKKKEKVN NSNENPRKMCEQMRLKEKSKRNQPKKRKQQFIGLKSEDEDNAKPMNYDEKRQLSLNINKL PGDKLGRVVHIIQSREPSLSNSNPDEIEIDFETLKASTLRELEKYVSACLRKRPLKPPAK KIMMSKEELHSQKKQELEKRLLDVNNQLNSRKRQTKSDKTQPSKAVENVSRLSESSSSSS SSSESESSSSDLSSSDSSDSESEMFPKFTEVKPNDSPSKENVKIGYCVQDTTSANTTLVH QTTPSHVMPPNHHQLAFNYQELEHLQTVKNISPLQILPPSGDSEQLSNGITVMHPSGDSD TTMLESECQAPVQKDIKIKNADSWKSLGKPVKPSGVMKSSDELFNQFRKAAIEKEVKART QELIRKHLEQNTKELKASQENQRDLGNGLTVESFSNKIQNKCSGEEQKEHQQSSEAQDKS KLWLLKDRDLARQKEQERRRREAMVGTIDMTLQSDIMTMFENNFD >gi568815597f:91862755_92114271|GENSCAN_predicted_CDS_3|3018_bp atgtctcaaaaacaaaaacaaatatttgatatcagcgtcttacatttcggccattctgag tttcacgttgttgctctggctggccttgaaaccttgggattaagtgagccttccgcttta acctccccagtaacgggcattgcaggcgcctgcagccacaccattaatactactttaaac agcaaaaacaagaacacgaaaattaaatctaaacaaaaagtgcttgccaatccaggtatt tatgtgacgtctatctggcaggtgcactggccagaaaggggctcttcggctgaatgctat gagtccgcttcagtaagagaagaaaggacatgtacttgtagacaggattcaaagcagtta agaatgtctctgccaagtcgacaaacagctattattgttaaccctcctccaccagaatat ataaatactaagaaaaatgggcgattgacaaatcaacttcagtatctacaaaaagttgtc ctaaaggatttatggaagcatagtttttcatggccctttcaacgtcctgtggatgctgtg aaactacagttgcctcctggagatgacattgttcttatggcacaagctctagagaagctg tttatgcagaaattatctcagatgccacaagaagagcaagttgtgggtgttaaggaaaga atcaagaaaggcactcaacagaatatagctgtttcttctgctaaagaaaaatcatcaccc agcgcaacagaaaaagtatttaagcagcaagaaattccttctgtatttcctaagacatct atttctcccttgaacgtggtacagggagcttcagtcaactccagttcacaaactgcggcc caagttacaaaaggtgtgaagaggaaagcagatacaacaactcctgcaacttcagcagtt aaagcaagtagtgaattttctccaacattcacagaaaaatcagtggcactgccacctata aaagaaaatatgccaaagaatgttttgccagattctcagcaacaatataatgttgtgaag actgttaaagtaactgaacaattaaggcactgtagtgagattcttaaagaaatgcttgca aagaaacatttttcatatgcatggcccttttataatcctgttgacgttaatgctttggga ctccataactactatgacgttgtcaaaaatccgatggatcttggaactattaaggagaaa atggataaccaagaatataaggatgcatacaaatttgcggcagatgttagattaatgttc atgaattgctacaagtacaatcctccagatcacgaagttgtgacaatggcaagaatgctt caggatgttttcgaaacgcatttttcaaagatcccgattgaacctgttgagagtatgcct ttatgttacatcaaaacagatatcacagaaaccactggtagagagaacactaatgaagcc tcctctgaagggaactcttctgatgattctgaagatgagcgagttaagcgtcttgcaaag cttcaggagcagcttaaagctgtacatcaacagctccaggttttgtcccaagtacctttc cgtaagctaaataaaaagaaagagaagtctaaaaaggaaaagaaaaaagaaaaggttaat aacagcaatgaaaatccaagaaaaatgtgtgagcaaatgaggctaaaggaaaagtccaag agaaatcagccaaagaaaaggaaacaacagttcattggtctaaaatctgaagatgaagat aatgctaaacctatgaactatgatgagaaaaggcagttaagtctgaatataaacaaactc cctggagataaacttgggcgagtagttcacataatacaatcaagagagccttctctgagc aattccaatcctgatgagatagagatagactttgaaacactgaaagcatcaacactaaga gaattagaaaaatatgtttcggcatgtctaagaaagagaccattaaaacctcctgctaag aaaataatgatgtccaaagaagaacttcactcacagaaaaaacaggaattggaaaagcgg ttactggatgttaataatcagttaaattctagaaaacgtcaaacaaaatctgataaaacg caaccatccaaagctgttgaaaatgtttcccgactgagtgagagcagcagcagcagcagc agctcatcagagtctgaaagtagcagcagtgacttaagctcttcagacagcagtgattct gaatcagaaatgttccctaagtttacagaagtaaaaccaaatgattctccttctaaagag aatgtaaagataggatattgtgtgcaagacacaacctctgccaatactacccttgttcat cagaccacaccttcacatgtaatgccaccaaatcaccaccaattagcatttaattatcaa gaattagaacatttacagactgtgaaaaacatttcacctttacaaattctgcctccctca ggtgattctgaacagctctcaaatggcataactgtgatgcatccatctggtgatagtgac acaacgatgttagaatctgaatgtcaagctcctgtacagaaggatataaagattaagaat gcagattcatggaaaagtttaggcaaaccagtgaaaccatcaggtgtaatgaaatcctca gatgagctcttcaaccaatttagaaaagcagccatagaaaaggaagtaaaagctcggaca caggaactcatacggaagcatttggaacaaaatacaaaggaactaaaagcatctcaagaa aatcagagggatcttgggaatggattgactgtagaatctttttcaaataaaatacaaaac aagtgctctggagaagagcagaaagaacatcagcagtcatcagaagctcaagataaatcc aaactctggcttctcaaagaccgtgatttagcaaggcagaaagaacaagagaggaggagg agagaagcaatggtgggtaccattgatatgacccttcaaagtgacattatgacaatgttt gaaaacaactttgattaa >gi568815597f:91862755_92114271|GENSCAN_predicted_peptide_4|208_aa MARLRDCLPRLMLTLRSLLFWSLVYCYCGLCASIHLLKLLWSLGKGPAQTFRRPAREHPP ACLSDPSLGTHCYVRIKDSGLRFHYVAAGERGKPLMLLLHGFPEFWYSWRYQLREFKSEY RVVALDLRGYGETDAPIHRQNYKLDCLITDIKDILDSLGYSKCVLIGHDWGGMIAWLIAI CYPEMVMKLIVINFPHPNVFTGIGDLEA >gi568815597f:91862755_92114271|GENSCAN_predicted_CDS_4|627_bp atggcgaggctgcgggattgcctgccccgcctgatgctcacgctccggtccctgctcttc tggtccctggtctactgctactgcgggctctgcgcctccatccacctgctcaaacttttg tggagcctcggcaaggggccggcgcagaccttccggcggcccgcccgggagcaccctccc gcgtgcctgagcgacccctccttgggcacccactgctacgtgcggatcaaggattcaggg ttaagatttcactatgttgctgctggagaaagaggcaaaccacttatgctgctgcttcat ggatttccagaattctggtattcttggcgttaccaactgagagaatttaaaagtgaatat cgagttgtagcactggatttgagaggttatggagaaacagatgctcccattcatcgacag aattataaattggattgtctaattacagatataaaggatattttagattctttagggtat agcaaatgtgttcttattggccatgactgggggggcatgattgcttggctaattgccatc tgttatcctgaaatggtgatgaagcttattgttattaacttccctcatccaaatgtattt acaggtattggtgatctagaggcttga >gi568815597f:91862755_92114271|GENSCAN_predicted_peptide_5|182_aa MAIWTWQGFTALQACMLTLSQLFTRVEVEFMPVYIPNDEEKKDPILFANTAHINMANALG VPGTGHTYEGCRVMISAGNLQLPVEAGLVEFTNISQKLKLDRDNIHQHLDEYAAIAVASK GGKTGIEEFSSYLKLPISEPLRQPPAHFGRNNDGNIDFREYVIGLKRVCKCPLSFLILMR MV >gi568815597f:91862755_92114271|GENSCAN_predicted_CDS_5|549_bp atggcaatctggacctggcagggattcacagccttacaggcctgtatgttgacactcagt caactcttcaccagggtagaagttgagtttatgcctgtttatatcccaaatgatgaagaa aaaaaagaccccatcctttttgccaatacagcacacatcaacatggcaaatgctctaggg gtgcctgggacagggcacacttacgaaggctgcagagtgatgatctccgcaggtaacctt caactacccgtagaagctggtttggtggaatttacaaacattagccagaagttgaagtta gacagggataacattcatcagcatttggatgaatatgctgcaattgcagttgcctcaaaa ggagggaagacaggaattgaagaattttcaagttatttaaaactcccaatttcagagccc ttgagacaaccccctgcccacttcggcaggaataatgatggcaacatagacttcagagag tatgtaataggtctgaaaagagtctgcaaatgtcctttaagctttttgatcttgatgcgg atggtttaa >gi568815597f:91862755_92114271|GENSCAN_predicted_peptide_6|292_aa MAPKRQSPLPLQKKKPRPPPALGLEETSASAGLPKKGEKEQQEAIEHIDEVQNEIDRLNE QDSEEILKVEQKYNKLRQPFFQKRSELIAKIPNFGVTTFVNHPQVSSLLGEEDEEALHYL TKVEVTEFEDIKSGYRIDFYFDENPYFENKVFSKEFHLNESGDPSSKSTKIKWKSGKDVT KRSSQTQNKASRKRQHEEPESFFTWFTDHSDAGADELEEVIKDDIWPNPLQYYLVPDMDD EEGGEDDDDDDDDGDEGEEELEDIDEGDEDEGEEDEDDDEGEEGEEDEGEDD >gi568815597f:91862755_92114271|GENSCAN_predicted_CDS_6|879_bp atggcccctaaacgccagtctccactcccacttcaaaagaagaaaccaagaccacctcct gctctgggactggaggagacatcggcctctgcaggcttgccgaagaagggagaaaaagaa cagcaagaagcaattgaacacattgatgaagtacaaaatgaaatagacagacttaatgaa caagacagtgaggagattttgaaagtagaacagaaatataacaaactccgccaaccattt tttcagaagaggtcagaattgatcgccaaaatcccaaattttggggtaacaacatttgtc aaccatccacaagtgtcttcactgcttggggaggaggacgaagaggcactgcattatttg actaaagttgaagtgacagaatttgaagatattaaatcaggttacagaatagatttttat tttgatgaaaatccttactttgaaaataaagttttctccaaagaatttcatctgaatgag agtggtgatccatcttcaaagtccaccaaaatcaaatggaaatctggaaaggatgtgacg aaacgttcaagtcaaacgcagaataaagccagcaggaagaggcagcatgaggaaccagag agcttctttacctggtttactgatcattctgatgcaggtgctgatgagttagaagaggtc atcaaagatgatatttggccaaacccattacagtattacttggttcctgatatggatgat gaagaaggaggagaagatgatgatgatgatgatgatgatggtgatgaaggggaggaagaa ttagaagatattgatgaaggggatgaggatgaaggtgaagaagatgaagatgatgatgaa ggggaggaaggagaggaggatgaaggagaagatgactaa >gi568815597f:91862755_92114271|GENSCAN_predicted_peptide_7|221_aa MARCGEGSAAPMVLLGSAGVCSKGLQRKGPCERRRLKATVSEQLSQDLLRLLREEFHTDV TFSVGCTLFKAHKAVLLARVPDFYFHTIGQTSNSLTNQEPIAVENVEALEFRTFLQIIYS SNRNIKNYEEEILRKKIMEIGISQKQLDISFPKCENSSDCSLQKHEIPEDISDRDDDFIS NDNYDLEPASELGEDLLKLYVKPCCPDIDIFVDGKRFKAHS >gi568815597f:91862755_92114271|GENSCAN_predicted_CDS_7|663_bp atggctcgctgtggggaaggcagtgcggcccccatggtacttctggggtccgctggagtt tgcagtaaggggttgcaaaggaaggggccgtgtgagcggcgccggctgaaggcgacggtg tcggagcagctcagccaggatttgctcaggcttctaagggaagaattccatacagatgtt accttctctgtgggttgtactttgttcaaagcacacaaagcagtccttttagcaagagtt cctgacttctattttcatactattggacagacatcaaatagtttaacaaatcaggagcct attgctgtggagaatgttgaagctttagaatttagaacgtttttacagattatatattca tcaaacagaaacataaaaaactatgaagaggaaattcttaggaaaaagataatggagatt gggatatcacaaaagcaacttgacatcagttttccaaagtgtgaaaactcatctgattgt tctcttcagaagcatgaaattccagaggatatcagtgacagagatgatgatttcatttcc aatgataattatgacttggagcctgcatctgaattaggagaagatttattgaagctttat gtgaaaccttgttgcccagatattgatatttttgttgatggaaaacgttttaaagctcac agn