GENSCAN 1.0 Date run: 5-Nov-116 Time: 13:21:43 Sequence gi568815587r:63474948_63698163 : 223216 bp : 44.51% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 6761 6893 133 1 1 78 47 67 0.340 1.90 1.02 Term + 9615 9955 341 1 2 27 36 303 0.475 13.70 1.03 PlyA + 10520 10525 6 1.05 2.03 PlyA - 12119 12114 6 1.05 2.02 Term - 13319 13309 11 0 2 94 40 10 0.050 -4.84 2.01 Init - 16087 15940 148 1 1 85 76 120 0.893 10.94 2.00 Prom - 22484 22445 40 -3.56 3.00 Prom + 27067 27106 40 -2.86 3.01 Init + 31512 31580 69 2 0 44 117 61 0.926 5.65 3.02 Intr + 33606 33694 89 2 2 72 110 91 0.995 8.47 3.03 Intr + 33831 34044 214 0 1 24 67 241 0.952 14.32 3.04 Intr + 34831 34950 120 0 0 113 101 175 0.999 21.99 3.05 Intr + 35516 35554 39 1 0 93 84 40 0.514 2.52 3.06 Intr + 36132 36158 27 2 0 102 105 10 0.502 2.41 3.07 Term + 41300 41446 147 1 0 104 55 197 0.813 15.90 3.08 PlyA + 41803 41808 6 1.05 4.00 Prom + 43392 43431 40 -6.16 4.01 Init + 50763 51048 286 2 1 86 22 173 0.373 7.75 4.02 Term + 53993 54006 14 0 2 148 42 11 0.346 0.86 4.03 PlyA + 54419 54424 6 1.05 5.00 Prom + 60224 60263 40 -4.66 5.01 Init + 61922 61930 9 1 0 78 123 5 0.570 3.29 5.02 Intr + 64569 64677 109 2 1 89 80 119 0.994 11.06 5.03 Intr + 69653 69942 290 0 2 80 110 270 0.615 25.36 5.04 Term + 71202 71309 108 2 0 88 48 20 0.322 -3.49 5.05 PlyA + 71488 71493 6 1.05 6.06 PlyA - 73619 73614 6 1.05 6.05 Term - 78118 78017 102 1 0 130 35 107 0.963 7.88 6.04 Intr - 83713 83445 269 2 2 75 57 329 0.992 25.75 6.03 Intr - 85246 85138 109 1 1 131 80 107 0.911 14.06 6.02 Intr - 87740 87669 72 2 0 74 119 50 0.987 6.30 6.01 Init - 88377 88369 9 0 0 84 111 11 0.744 2.87 6.00 Prom - 98236 98197 40 -3.06 7.05 PlyA - 98373 98368 6 1.05 7.04 Term - 100099 99998 102 1 0 140 36 84 0.897 6.68 7.03 Intr - 115421 115153 269 0 2 73 96 468 0.705 43.35 7.02 Intr - 123216 123114 103 0 1 102 71 65 0.754 5.95 7.01 Init - 132342 132274 69 0 0 78 75 104 0.536 7.16 7.00 Prom - 142168 142129 40 -1.56 8.11 PlyA - 142645 142640 6 1.05 8.10 Term - 147720 147652 69 0 0 109 35 22 0.158 -3.06 8.09 Intr - 156524 156093 432 2 0 117 111 391 0.704 38.14 8.08 Intr - 158150 158079 72 2 0 60 94 57 0.890 3.10 8.07 Intr - 161387 161260 128 0 2 79 94 33 0.925 3.40 8.06 Intr - 168548 168410 139 2 1 81 93 96 0.992 9.44 8.05 Intr - 169314 169222 93 0 0 80 111 17 0.879 3.36 8.04 Intr - 177039 176989 51 0 0 67 76 43 0.494 0.10 8.03 Intr - 177628 177524 105 1 0 109 98 54 0.991 8.91 8.02 Intr - 183957 183814 144 0 0 51 116 33 0.857 2.88 8.01 Init - 184297 184091 207 1 0 85 110 210 0.595 21.72 8.00 Prom - 188921 188882 40 -4.16 9.00 Prom + 189427 189466 40 -9.55 9.01 Init + 189988 190086 99 0 0 43 41 111 0.584 2.16 9.02 Intr + 193303 193354 52 0 1 68 89 43 0.860 0.88 9.03 Term + 195975 196207 233 1 2 29 46 213 0.881 7.84 9.04 PlyA + 198739 198744 6 1.05 10.04 PlyA - 201845 201840 6 1.05 10.03 Term - 206856 206713 144 0 0 117 48 113 0.148 8.11 10.02 Intr - 216932 216769 164 0 2 90 70 92 0.069 7.29 10.01 Init - 218367 218319 49 0 1 94 58 50 0.883 1.71 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 206690 206831 142 1 1 108 90 75 0.824 10.02 S.002 Term - 216932 216637 296 0 2 90 54 144 0.818 6.67 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:63474948_63698163|GENSCAN_predicted_peptide_1|157_aa MEYYAAIKNDEFMSFVGTWMKLETIILSKLSQGQKTKHRMFSLIDTEIAFDKIQHPFMIK NLNKISIKGTNLKAIKAIYDKPTANIIVIGEKLKAFPQITGKRRGCPLSPHLFDIVLEVL ARAIRQEKEIKGIQISKKEVKLLLFADMIIDLENPKA >gi568815587r:63474948_63698163|GENSCAN_predicted_CDS_1|474_bp atggaatactatgcagccataaaaaatgatgagttcatgtcctttgtagggacatggatg aagctggaaaccatcattctcagcaaactatcgcaaggacaaaaaaccaaacaccgcatg ttctcactcatagacacagaaatagcatttgacaagatccagcatccctttatgattaaa aacctcaacaaaatcagcataaaagggacaaacctcaaggcaataaaagccatctacgac aaacccacagccaacattatagtgattggggaaaagttgaaagcattcccccagataact ggaaaaagacgaggatgcccactttcaccacatttattcgacatagtgctagaagtccta gccagagcaatcagacaagagaaagaaataaagggcatccaaatcagtaaaaaggaagtc aaactgttgctgtttgctgatatgatcatagacctggaaaaccctaaagcctaa >gi568815587r:63474948_63698163|GENSCAN_predicted_peptide_2|52_aa MGLSPGAEGEYALRLPRIPPPLPKPASRTASTGPKDQPPALRRSAVPHSGNN >gi568815587r:63474948_63698163|GENSCAN_predicted_CDS_2|159_bp atgggcctgagcccgggcgccgagggggagtacgcgctccgcctccctaggattccccca cccctccccaaacccgcctcgcgaaccgccagtaccgggcccaaggaccagccgcctgcg ctcagacgttcagctgtgccccactcaggcaacaactga >gi568815587r:63474948_63698163|GENSCAN_predicted_peptide_3|234_aa MSPGEKLDPIPDSFILQPPVFHPVVPYVTTIFGGLHAGKMVMLQGVVPLDAHRFQVDFQC GCSLCPRPDIAFHFNPRFHTTKPHVICNTLHGGRWQREARWPHLALRRGSSFLILFLFGN EEVKVSVNGQHFLHFRYRLPLSHVDTLGIFGDILVEAVGFLNINPFVEGSREYPAGHPFL LMSPRLVLLLFQEGGLKLALNGQGLGATSMNQQALEQLRELRISGSVQLYCVHS >gi568815587r:63474948_63698163|GENSCAN_predicted_CDS_3|705_bp atgtcacctggagaaaaactggacccaattcctgacagcttcattctgcaaccaccagtc ttccacccggtggttccttatgtcacgacgatttttggaggcctgcatgcaggcaagatg gtcatgctgcaaggagtggtccctctagatgcacacaggtttcaggtggacttccagtgt ggctgcagcctgtgtccccggccagatatcgccttccacttcaaccctcgcttccatacc accaagccccatgtcatctgcaacaccctgcatggtggacgctggcaaagggaggcccgg tggccccacctggccctgcgaagaggctccagcttcctcatcctctttctcttcgggaat gaggaagtgaaggtgagtgtgaatggacagcactttctccacttccgctaccggctccca ctgtctcatgtggacacgctgggtatatttggtgacatcctggtagaggctgttggattc ctgaacatcaatccatttgtggagggcagcagagagtacccagctggacatcctttcctg ctgatgagccccaggctggtgctgctcctgttccaggagggagggctgaagctggcgctc aatgggcaggggctgggggccaccagcatgaaccagcaggccctggagcagctgcgggag ctccggatcagtggaagtgtccagctctactgtgtccactcctga >gi568815587r:63474948_63698163|GENSCAN_predicted_peptide_4|99_aa MAIRPKVIYRFNAIPIMLPMTFFTELEKTTLKFIRNQKGAHIAKSILSQKNKAGGIMLPD FKLYYKTTVTKTAWYWYQNRDIDQWNRTEPSEIMPRKCK >gi568815587r:63474948_63698163|GENSCAN_predicted_CDS_4|300_bp atggccatacggcccaaggtaatttatagattcaatgccatccccatcatgctaccaatg actttcttcacagaattggaaaaaactactttaaagttcatacggaaccaaaaaggagcc cacatcgccaagtcaatcctaagccaaaagaacaaagccggaggcatcatgctacctgac ttcaaactatactacaagactacagtaaccaaaacagcatggtactggtaccaaaacaga gatatagatcaatggaacagaacagagccctcagaaataatgccacgtaagtgtaagtaa >gi568815587r:63474948_63698163|GENSCAN_predicted_peptide_5|171_aa MASPHQEPKPGDLIEIFRLGYEHWALYIGDGYVIHLAPPMRVPLIAGEYPGAGSSSVFSV LSNSAEVKRERLEDVVGGCCYRVNNSLDHEYQPRPVEVIISSAKEMVGQKMKYSIVSRNC EHFVTQLRYGKSRCKQVEKAKVEVGVATALGILVVAGCSFAIRRYQKKATA >gi568815587r:63474948_63698163|GENSCAN_predicted_CDS_5|516_bp atggcttcgccacaccaagagcccaaacctggagacctgattgagattttccgccttggc tatgagcactgggccctgtatataggagatggctacgtgatccatctggctcctccaatg agagtgcctctgattgcaggtgagtaccccggggctggctcctccagtgtcttctcagtc ctgagcaacagtgcagaggtgaaacgggagcgcctggaagatgtggtgggaggctgttgc tatcgggtcaacaacagcttggaccatgagtaccaaccacggcccgtggaggtgatcatc agttctgcgaaggagatggttggtcagaagatgaagtacagtattgtgagcaggaactgt gagcactttgtcacccagctgagatatggcaagtcccgctgtaaacaggtggaaaaggcc aaggttgaagtcggtgtggccacggcgcttggaatcctggttgttgctggatgctctttt gcgattaggagataccaaaaaaaagcgacagcctga >gi568815587r:63474948_63698163|GENSCAN_predicted_peptide_6|186_aa MALVVNKQSYEVGIITANFMDRRNDILARPRPRLGDLIEISRFGYAHWAIYVGDGYVVHL APASEIAGAGAASVLSALTNKAIVKKELLSVVAGGDNYRVNNKHDDRYTPLPSNKIVKRA EELVGQELPYSLTSDNCEHFVNHLRYGVSRSDQVTGAVTTVGVAAGLLAAASLVGILLAR SKRERQ >gi568815587r:63474948_63698163|GENSCAN_predicted_CDS_6|561_bp atggctttggtagttaataagcaatcctatgaagtaggcataattactgccaacttcatg gataggaggaatgatatactggccagaccaagaccgagacttggagacctgattgagatt tctcgctttggctatgcacactgggccatctacgtgggagatggctatgtggtccatctg gctccggcaagtgaaattgctggagctggtgcggccagtgtcctgtctgccctgaccaac aaagccatagtgaagaaggaactgctgtctgtggtggctgggggagacaactacagggtc aataacaagcacgatgacagatacacaccactgccttccaacaaaatcgtcaagcgggca gaggagttggtggggcaggagttgccttattcgctgaccagtgacaactgcgagcacttc gtgaaccatctgcgctatggcgtctcccgcagtgaccaggtcactggtgcagtcacgaca gtaggtgtggcagcaggcctgctggctgccgcaagccttgtggggatcctgctggccaga agcaagcgggaaaggcaataa >gi568815587r:63474948_63698163|GENSCAN_predicted_peptide_7|180_aa MTPSSILMCALCPRLSLGYPLPSPEPKPGDLIEIFRPFYRHWAIYVGDGYVVHLAPPSEV AGAGAASVMSALTDKAIVKKELLYDVAGSDKYQVNNKHDDKYSPLPCSKIIQRAEELVGQ EVLYKLTSENCEHFVNELRYGVARSDQVRDVIIAASVAGMGLAAMSLIGVMFSRNKRQKQ >gi568815587r:63474948_63698163|GENSCAN_predicted_CDS_7|543_bp atgacccctagctccatactgatgtgtgcgctgtgtccccgcctctcgttgggatacccg ctgcccagcccagagcctaagcctggagacctgattgagatttttcgccctttctacaga cactgggccatctatgttggcgatggatatgtggttcatctggcccctccaagtgaggtc gcaggagctggtgcagccagtgtcatgtccgccctgactgacaaggccatcgtgaagaag gaattgctgtatgatgtggccgggagtgacaagtaccaggtcaacaacaaacatgatgac aagtactcgccgctgccctgcagcaaaatcatccagcgggcggaggagctggtggggcag gaggtgctctacaagctgaccagtgagaactgcgagcactttgtgaatgagctgcgctat ggagtcgcccgcagtgaccaggtcagagatgtcatcatcgctgcaagcgttgcaggaatg ggcttggcagccatgagccttattggagtcatgttctcaagaaacaagcgacaaaagcaa taa >gi568815587r:63474948_63698163|GENSCAN_predicted_peptide_8|479_aa MESSKPGPVQVVLVQKDQHSFELDEKALASILLQDHIRDLDVVVVSVAGAFRKGKSFILD FMLRYLYSQKESGHSNWLGDPEEPLTGFSWRGGSDPETTGIQIWSEVFTVEKPGGKKVAV VLMDTQGAFDSQSTVKDCATIFALSTMTSSVQIYNLSQNIQEDDLQQLQTLMFLVRDWSF PYEYSYGLQGGMAFLDKRLQVKEHQHEEIQNVRNHIHSCFSDVTCFLLPHPGLQVATSPD FDGKLKDIAGEFKEQLQALIPYVLNPSKLMEKEINGSKVTCRGLLEYFKATAEANNLAAA ASAKDIYYNNMEEVCGGEKPYLSPDILEEKHCEFKQLALDHFKKTKKMGGKDFSFRYQQE LEEEIKELYENFCKHNGSKNVFSTFRTPAVLFTGIVALYIASGLTGFIGLEVVAQLFNCM VGLLLIALLTWGYIRYSGQYRELGGAIDFGAAYVLEQIGFLHAPGEDRHQWFQVYVIPL >gi568815587r:63474948_63698163|GENSCAN_predicted_CDS_8|1440_bp atggagagcagcaagcctggtccagtgcaggttgttttggttcagaaagatcaacattcc tttgagctagatgagaaagccttggccagcatcctcttgcaggaccacatccgagatctt gatgtggtggtggtttcagtggctggtgccttccgaaagggcaagtccttcattctggat tttatgctacgatacttatattctcagaaggaaagtggccattcaaattggttgggtgac ccagaagaaccgttaacaggattttcctggagagggggatctgatccagaaaccactggg attcaaatctggagtgaagttttcactgtggagaagccaggtgggaagaaggttgcagtt gttctgatggatacccagggggcatttgacagccagtcaactgtgaaagactgtgctacc atctttgctctaagcactatgactagttctgttcagatttataatttatctcagaacatt caagaagatgatcttcaacagctgcagacactgatgtttttggttagagattggagtttc ccttatgaatatagctatggactccaaggaggaatggcatttttggataagcgtttacag gtgaaggaacatcaacatgaagaaattcagaatgttcgaaatcacattcactcatgtttc tccgatgtcacctgctttctcttaccacatccaggactccaggtggccacaagccctgac tttgatgggaaattaaaagatattgctggtgaattcaaagagcagttacaggcactgata ccgtatgtattaaacccatctaagttaatggaaaaggagatcaatggctcaaaggtcacc tgtcggggactactggagtattttaaggccactgctgaagccaacaacttagcagctgca gcctctgccaaggacatttattataacaacatggaagaggtttgtgggggagagaaacct tatttgtctccagacattctagaggagaagcactgtgaattcaaacaacttgctctggac cattttaagaagaccaagaagatgggtgggaaggatttcagctttcgttaccagcaggag ctggaggaggaaatcaaggaattatatgagaacttctgcaagcacaatggtagcaagaac gtcttcagcaccttccgaacccctgcagtgctgttcacgggcattgtagctttgtacata gcctcaggcctcactggcttcataggtcttgaggttgtagcccagttgttcaactgtatg gttggactactgttaatagcactcctcacctggggctacatcaggtattctggtcaatat cgtgagctgggcggagctattgattttggtgccgcatatgtgttggagcagataggtttc ctccatgccccaggggaggacaggcaccagtggttccaggtttatgtcatcccactttaa >gi568815587r:63474948_63698163|GENSCAN_predicted_peptide_9|127_aa MYHNVVEGYNPKDPSGIVRAVDMEDKVTWLQDGSLRFLTDKARENIITGKRRGSAGPKLR TPPTGTTGPWAAQGQLQPRGGAGGAPGGTRARALTAAAPGRKVRKVPPSLDQLGPKRGPP TTNSETE >gi568815587r:63474948_63698163|GENSCAN_predicted_CDS_9|384_bp atgtaccataatgtggttgaaggctacaacccaaaggatccaagtggcattgtcagagca gtagacatggaagataaagttacgtggttacaagatgggtctttaaggtttctaactgac aaagccagagaaaacatcatcacaggcaagaggcggggctcggcggggccgaagctccgg acgccgcccacggggaccaccggcccttgggccgcccaggggcagctgcagcccagggga ggagctggaggagcccccggcggcaccagggcccgagccctgacggcggccgcccccggc cgaaaagtaaggaaggtgccgccctcgctggaccagctgggcccgaagaggggccccccc accacaaattcggaaaccgagtga >gi568815587r:63474948_63698163|GENSCAN_predicted_peptide_10|118_aa MAFLHVGQAGLELLTSGLYQLLCGKWTEERQRRKQGDQIGGIYNNPRELAWTRVVAMEID RIGQTLDIFLKGGGSPRALPAQEELQLFVPRAGQAPGLPPPPGADGSAPKDDEEMEWD >gi568815587r:63474948_63698163|GENSCAN_predicted_CDS_10|357_bp atggcgtttctccatgttggtcaggctggtctcgaactcctgacctcaggcttataccag ctgctgtgtggaaaatggactgaagagaggcaaaggagaaagcagggagaccagataggc ggcatctacaataacccaagagagctagcctggaccagggtggtagcaatggagattgac agaattggtcagactttggatatctttttgaagggcgggggctccccgcgcgccttaccc gcacaggaggagctgcagctcttcgtccccagggcggggcaggctcctgggctcccgccg ccgccgggcgcggacggctcggctccgaaggacgacgaggagatggaatgggactga