GENSCAN 1.0 Date run: 4-Nov-116 Time: 00:08:51 Sequence gi568815587r:77969990_78179744 : 209755 bp : 45.16% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 Intr - 9113 9007 107 0 2 80 116 59 0.886 7.83 1.03 Intr - 11587 11470 118 1 1 110 106 77 0.980 11.74 1.02 Intr - 21310 21119 192 1 0 96 116 19 0.942 5.19 1.01 Init - 24654 24601 54 0 0 94 115 68 0.986 11.38 1.00 Prom - 33071 33032 40 -4.56 2.03 PlyA - 33775 33770 6 1.05 2.02 Term - 47281 46604 678 1 0 102 42 810 0.964 71.49 2.01 Init - 53221 53171 51 1 0 91 127 30 0.932 6.50 2.00 Prom - 55141 55102 40 -5.36 3.00 Prom + 55155 55194 40 -2.56 3.01 Init + 56596 56598 3 0 0 108 101 0 0.346 3.30 3.02 Intr + 64964 65083 120 1 0 38 41 129 0.229 3.89 3.03 Intr + 74365 74443 79 0 1 66 48 61 0.000 -0.98 3.04 Intr + 92164 92217 54 2 0 148 82 42 0.251 8.55 3.05 Term + 93882 94333 452 1 2 29 41 685 0.233 53.25 3.06 PlyA + 95812 95817 6 1.05 4.04 PlyA - 96347 96342 6 -0.45 4.03 Term - 97723 97659 65 2 2 30 49 99 0.196 -1.75 4.02 Intr - 103152 103009 144 1 0 54 63 61 0.233 0.45 4.01 Init - 109755 109590 166 0 1 94 105 288 0.972 29.19 4.00 Prom - 120701 120662 40 -4.06 5.00 Prom + 132080 132119 40 -4.96 5.01 Init + 166724 166760 37 1 1 69 101 37 0.900 3.27 5.02 Intr + 169481 169585 105 0 0 59 42 99 0.765 2.59 5.03 Intr + 171606 171713 108 1 0 42 93 215 0.947 17.66 5.04 Term + 177937 177947 11 2 2 106 54 0 0.213 -3.24 5.05 PlyA + 183010 183015 6 1.05 6.04 PlyA - 183491 183486 6 1.05 6.03 Term - 199242 199104 139 2 1 93 43 43 0.215 -2.26 6.02 Intr - 200624 200535 90 1 0 54 100 42 0.570 1.11 6.01 Init - 204565 203787 779 1 2 68 98 1312 0.836 123.17 6.00 Prom - 205496 205457 40 -1.86 7.00 Prom + 206106 206145 40 -6.26 7.01 Init + 206821 206882 62 0 2 116 37 105 0.448 8.92 7.02 Intr + 208463 208562 100 2 1 55 67 60 0.121 0.71 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 82770 82676 95 0 2 71 80 53 0.938 2.65 S.002 Init + 113507 113543 37 1 1 92 110 49 0.861 7.70 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:77969990_78179744|GENSCAN_predicted_peptide_1|157_aa MAAHLKKRVYEEFTKVVQPQEEIATKKLRLTKPSKSAALHIDLCKATSPADALQYLLQFA RKPVEAESVEGVVRILLEHYYKENDPSVRLKIASLLGLLSKTAGFSPDCIMDDAINILQN EKSHQVLAQLLDTLLAIGTKLPENQAIQMRLVDVACK >gi568815587r:77969990_78179744|GENSCAN_predicted_CDS_1|471_bp atggcggcgcaccttaagaagcgggtttatgaggaattcacgaaagtggttcagccacag gaggaaattgctactaagaaactccgactaacaaaaccaagtaaatctgcagcactccac atagatctgtgtaaagctacctccccagcagatgctttgcaatacttgctccagtttgcc aggaagcctgtcgaggcggaaagcgtagagggagtagtcaggattctcttggaacattat tacaaggagaatgatccatctgtgagactgaaaattgcatcattgttgggtttattatca aagacagcaggattttcaccagactgcattatggatgatgccatcaacatcctgcagaat gaaaagtctcatcaagtcctagctcaactgctggatactttgcttgcaattggcactaag ctaccagagaatcaagctatccaaatgcgattagttgatgtggcctgcaag >gi568815587r:77969990_78179744|GENSCAN_predicted_peptide_2|242_aa MTSQTPLPQSPRPRRPTMSTVVELNVGGEFHTTTLGTLRKFPGSKLAEMFSSLAKASTDA EGRFFIDRPSTYFRPILDYLRTGQVPTQHIPEVYREAQFYEIKPLVKLLEDMPQIFGEQV SRKQFLLQVPGYSENLELMVRLARAEAITARKSSVLVCLVETEEQDAYYSEVLCFLQDKK MFKSVVKFGPWKAVLDNSDLMHCLEMDIKAQGYKVFSKFYLTYPTKRNEFHFNIYSFTFT WW >gi568815587r:77969990_78179744|GENSCAN_predicted_CDS_2|729_bp atgacgagccagacccctctgccccagtccccccggcccaggcggccaacgatgtctact gttgtggagctgaacgtcgggggtgagttccacaccaccaccctgggtaccctgaggaag tttccgggctcaaagctggcagagatgttctctagcttagccaaggcctccacggacgcg gagggccgcttcttcatcgaccgccccagcacctatttcagacccatcctggactacctg cgcactgggcaagtgcccacacagcacatccctgaagtgtaccgtgaggctcagttctac gaaatcaagcctttggtcaagctgctggaggacatgccacagatctttggtgagcaggtg tctcggaagcagtttttgctgcaagtgccgggctacagcgagaacctggagctcatggtg cgcctggcacgtgcagaagccataacagcacggaagtccagcgtgcttgtgtgcctggtg gaaactgaggagcaggatgcatattattcagaggtcctgtgttttctgcaggataagaag atgttcaagtctgttgtcaagtttgggccctggaaggcggtcctagacaacagcgacctc atgcactgcctggagatggacattaaggcccaggggtacaaggtattctccaagttctac ctgacgtaccccaccaaaagaaacgaattccattttaacatttattcattcaccttcacc tggtggtga >gi568815587r:77969990_78179744|GENSCAN_predicted_peptide_3|235_aa MPEIEVNLQRNIELRVDENLIAGYHLSTDEALPKALPLEFPKNLPEKDLVEESFYLGVRG LRQNETGGVGGIIIQQISPEAVEEAEEATMQVLTKRYPKNCLLTVMDRYAAEVHNMEQVV MIPSLLRDVQLSGPGGQAQAEAPDLYTYFTMLKAICVDVDHGLLPREEWQAKVAGSEENG TAETEEVEDESASGELDLEAQFHLHFSSLHHILMHLTEKAQEVTRKYQEMTGQVW >gi568815587r:77969990_78179744|GENSCAN_predicted_CDS_3|708_bp atgcctgagattgaagttaacctacaaaggaacatagaactaagagttgatgagaacctg attgccggctatcatttgagcactgatgaagctctgcctaaggccctacccctggagttt ccgaaaaatcttcctgaaaaggacctggtggaagagagcttctatctaggggtgagggga ctccggcagaatgagactggaggcgtgggaggtatcatcattcagcagatttcaccagag gcagtggaggaggcagaggaagcaaccatgcaggtgctaaccaagcgttaccccaagaac tgcctgctgaccgtcatggaccggtatgcagccgaggtgcacaacatggagcaggtggtg atgatccccagccttctgcgggacgtgcagctgagtgggcctgggggccaggcccaggct gaggcccctgatctctacacctacttcaccatgctcaaggccatctgtgtggatgtggac catgggctgctgccgcgggaggagtggcaggccaaggtggcaggcagcgaagagaatgga accgcagagacagaggaagtcgaggacgagagtgcctcaggagagctggacctggaagcc cagttccacctgcacttctccagcctccatcacatcctcatgcacctcaccgagaaagcc caggaggtgacaaggaaataccaggaaatgacgggacaagtttggtag >gi568815587r:77969990_78179744|GENSCAN_predicted_peptide_4|124_aa MIARRNPEPLRFLPDEARSLPPPKLTDPRLLYIGFLGYCSGLIDNLIRRRPIATAGLHRQ LLYITAFFFAGYYLVKREDYLYAVRDREMFGYMKLHPEDFPEEVNCSADLIKESVFYHVK GSVE >gi568815587r:77969990_78179744|GENSCAN_predicted_CDS_4|375_bp atgatcgcacggcggaacccagaacccttacggtttctgccggatgaggcccggagcctg cccccgcccaagctgaccgacccgcggctcctctacatcggcttcttgggctactgctcc ggcctgattgataacctaatccggcggaggccgatcgcgacggctggtttgcatcgccag cttctatatattacggcctttttttttgctggatattatcttgtaaaacgtgaagactac ctgtatgctgtgagggaccgtgaaatgtttggatatatgaaattacatccagaggatttt cctgaagaagtgaactgctccgccgacctcatcaaagagtctgtcttctatcatgtgaag ggatctgtggagtga >gi568815587r:77969990_78179744|GENSCAN_predicted_peptide_5|86_aa MAQLDVIDGIFQSPGRASPLLCGDEKAFEKSHPERQSRKPIASTRGNSTVKWAAEDDDDD DLDTEKQKTNEDDQTAKKDKLKEGEK >gi568815587r:77969990_78179744|GENSCAN_predicted_CDS_5|261_bp atggctcaactagatgtcatagatggcatcttccaatcccctggccgtgcgagtccctta ctatgtggggatgagaaggcatttgagaagagtcaccccgagcgccaaagccgaaaacca attgccagtacccgtggcaattctacagtcaaatgggcagctgaagatgatgatgatgat gatcttgacaccgagaagcagaagaccaatgaagatgaccagacagcaaaaaaggataag ttaaaagaaggtgaaaaatga >gi568815587r:77969990_78179744|GENSCAN_predicted_peptide_6|335_aa MSDPITLNVGGKLYTTSLATLTSFPDSMLGAMFSGKMPTKRDSQGNCFIDRDGKVFRYIL NFLRTSHLDLPEDFQEMGLLRREADFYQVQPLIEALQEKEVELSKAEKNAMLNITLNQRV QTVHFTVREAPQIYSLSSSSMEVFNANIFSTSCLFLKLLGSKLFYCSNGNLSSITSHLQD PNHLTLDWVANVEGLPEEEYTKQNLKRLWVVPANKQINSFQVFVEEVLKIALSDGFCIDS SHPHALDFMNNKIIRLIRYRLGELSLLLGKSMRKTENKRSAVMAAPSTDGGLWADGGGWL GNVACDQLQEQNNPDNRPIKGRDESLAALSAQTAW >gi568815587r:77969990_78179744|GENSCAN_predicted_CDS_6|1008_bp atgtccgaccccatcacgctgaacgtcggggggaagctctatacaacctcactggcgacc ctgaccagcttccctgactccatgctaggcgccatgttcagcgggaagatgcccaccaag agggacagccagggcaactgcttcattgaccgtgacggcaaagtgttccgctatatcctc aacttcctgcggacctcccaccttgacctgcctgaggacttccaggagatggggctgctc cgcagggaggccgacttctaccaggtgcagcccctgattgaggccctgcaggagaaggaa gtggagctctccaaggccgagaagaatgccatgctcaacatcacactgaaccagcgtgtg cagacggtccacttcactgtgcgcgaggcaccccagatctacagcctctcctcttccagc atggaggtcttcaacgccaacatcttcagcacctcctgcctcttcctcaagctccttggc tctaagctcttctactgctccaatggcaatctctcctccatcaccagccacttgcaggac cccaaccacctgactctggactgggtggccaatgtggagggcctgccagaggaggagtac accaagcagaacctcaagaggctctgggtggtgcccgccaacaagcagatcaacagcttc caggtcttcgtggaagaggtactgaaaatcgctctgagcgatggcttctgcatcgattct tctcacccacatgctctggattttatgaacaataagattattcgattaatacggtacagg ttaggtgaactcagccttctgcttggcaaaagcatgagaaagacggagaacaaacgttca gcagtgatggcagcaccatcgacagacggggggctctgggctgatgggggtggctggctg ggaaacgtggcctgtgatcagctccaggagcaaaacaaccctgataacaggcccatcaaa gggagggacgagagtctggcagctctgtctgcccagacagcctggtga >gi568815587r:77969990_78179744|GENSCAN_predicted_peptide_7|54_aa MTKSCLYNNNNNNNNNNNKAGATVSIITNPRPCYKRQNDMTRGEFHESAEVCSQ >gi568815587r:77969990_78179744|GENSCAN_predicted_CDS_7|162_bp atgacgaaatcctgtctctacaacaacaacaacaacaacaacaacaacaacaacaaagca ggggccacagtgtccatcatcacaaatccacggccctgctacaaaaggcaaaatgacatg acacgaggggagtttcatgagtcggcggaggtttgcagtcag