GENSCAN 1.0 Date run: 3-Nov-116 Time: 19:53:53 Sequence gi568815587r:63452967_63660194 : 207228 bp : 44.22% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 PlyA - 1952 1947 6 1.05 1.04 Term - 10629 10537 93 0 0 95 49 71 0.732 1.73 1.03 Intr - 13406 13144 263 0 2 114 64 352 0.849 32.61 1.02 Intr - 15499 15391 109 1 1 95 71 128 0.999 11.66 1.01 Init - 21473 21465 9 2 0 57 106 0 0.306 -0.82 1.00 Prom - 24515 24476 40 -6.06 2.00 Prom + 27518 27557 40 -4.66 2.01 Init + 28742 28874 133 1 1 78 47 67 0.500 1.90 2.02 Term + 31596 31936 341 1 2 27 36 303 0.568 13.70 2.03 PlyA + 32501 32506 6 1.05 3.03 PlyA - 34100 34095 6 1.05 3.02 Term - 35300 35290 11 0 2 94 40 10 0.050 -4.84 3.01 Init - 38068 37921 148 1 1 85 76 120 0.893 10.94 3.00 Prom - 44465 44426 40 -3.56 4.00 Prom + 49048 49087 40 -2.86 4.01 Init + 53493 53561 69 2 0 44 117 61 0.926 5.65 4.02 Intr + 55587 55675 89 2 2 72 110 91 0.995 8.47 4.03 Intr + 55812 56025 214 0 1 24 67 241 0.952 14.32 4.04 Intr + 56812 56931 120 0 0 113 101 175 0.999 21.99 4.05 Intr + 57497 57535 39 1 0 93 84 40 0.514 2.52 4.06 Intr + 58113 58139 27 2 0 102 105 10 0.502 2.41 4.07 Term + 63281 63427 147 1 0 104 55 197 0.813 15.90 4.08 PlyA + 63784 63789 6 1.05 5.00 Prom + 65373 65412 40 -6.16 5.01 Init + 72744 73029 286 2 1 86 22 173 0.373 7.75 5.02 Term + 75974 75987 14 0 2 148 42 11 0.346 0.86 5.03 PlyA + 76400 76405 6 1.05 6.00 Prom + 82205 82244 40 -4.66 6.01 Init + 83903 83911 9 1 0 78 123 5 0.570 3.29 6.02 Intr + 86550 86658 109 2 1 89 80 119 0.994 11.06 6.03 Intr + 91634 91923 290 0 2 80 110 270 0.615 25.36 6.04 Term + 93183 93290 108 2 0 88 48 20 0.322 -3.49 6.05 PlyA + 93469 93474 6 1.05 7.06 PlyA - 95600 95595 6 1.05 7.05 Term - 100099 99998 102 1 0 130 35 107 0.963 7.88 7.04 Intr - 105694 105426 269 2 2 75 57 329 0.992 25.75 7.03 Intr - 107227 107119 109 1 1 131 80 107 0.911 14.06 7.02 Intr - 109721 109650 72 2 0 74 119 50 0.987 6.30 7.01 Init - 110358 110350 9 0 0 84 111 11 0.744 2.87 7.00 Prom - 120217 120178 40 -3.06 8.05 PlyA - 120354 120349 6 1.05 8.04 Term - 122080 121979 102 1 0 140 36 84 0.897 6.68 8.03 Intr - 137402 137134 269 0 2 73 96 468 0.705 43.35 8.02 Intr - 145197 145095 103 0 1 102 71 65 0.754 5.95 8.01 Init - 154323 154255 69 0 0 78 75 104 0.536 7.16 8.00 Prom - 164149 164110 40 -1.56 9.11 PlyA - 164626 164621 6 1.05 9.10 Term - 169701 169633 69 0 0 109 35 22 0.158 -3.06 9.09 Intr - 178505 178074 432 2 0 117 111 391 0.704 38.14 9.08 Intr - 180131 180060 72 2 0 60 94 57 0.890 3.10 9.07 Intr - 183368 183241 128 0 2 79 94 33 0.925 3.40 9.06 Intr - 190529 190391 139 2 1 81 93 96 0.992 9.44 9.05 Intr - 191295 191203 93 0 0 80 111 17 0.879 3.36 9.04 Intr - 199020 198970 51 0 0 67 76 43 0.494 0.10 9.03 Intr - 199609 199505 105 1 0 109 98 54 0.991 8.91 9.02 Intr - 205938 205795 144 0 0 51 116 33 0.857 2.88 9.01 Init - 206278 206072 207 1 0 85 110 210 0.517 21.72 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:63452967_63660194|GENSCAN_predicted_peptide_1|157_aa MNWGKPRPRPGDLIEIFRIGYEHWAIYVEDDCVVHLAPPSEEFEVGSITSIFSNRAVVKY SRLEDVLHGCSWKVNNKLDGTYLPLPVDKIIQRTKKMVNKIVQYSLIEGNCEHFVNGLRY GVPRSQQVEHALMEGAKAAGAVISAVVDSIKPKPITA >gi568815587r:63452967_63660194|GENSCAN_predicted_CDS_1|474_bp atgaattggggaaaaccaagacccagacctggagacctgattgagatttttcgaattggc tatgagcactgggccatctatgtagaagatgattgcgtggtccatctggctcccccaagt gaggagtttgaggtgggcagcattacttccatctttagcaatcgggccgtggtgaaatac agtcgtctggaggatgtgctgcatggctgctcctggaaggtcaataacaagctagatggg acgtacctgcccttgccggtggacaagatcatccagcgtacaaaaaagatggtcaacaag atcgtgcagtacagcctgattgaagggaactgtgagcactttgtcaatggcctcagatat ggcgtaccccggagccagcaggtagagcacgccctgatggaaggagcgaaggctgctgga gcagttatttcagctgtagtggatagcataaagcccaaaccaataactgcctga >gi568815587r:63452967_63660194|GENSCAN_predicted_peptide_2|157_aa MEYYAAIKNDEFMSFVGTWMKLETIILSKLSQGQKTKHRMFSLIDTEIAFDKIQHPFMIK NLNKISIKGTNLKAIKAIYDKPTANIIVIGEKLKAFPQITGKRRGCPLSPHLFDIVLEVL ARAIRQEKEIKGIQISKKEVKLLLFADMIIDLENPKA >gi568815587r:63452967_63660194|GENSCAN_predicted_CDS_2|474_bp atggaatactatgcagccataaaaaatgatgagttcatgtcctttgtagggacatggatg aagctggaaaccatcattctcagcaaactatcgcaaggacaaaaaaccaaacaccgcatg ttctcactcatagacacagaaatagcatttgacaagatccagcatccctttatgattaaa aacctcaacaaaatcagcataaaagggacaaacctcaaggcaataaaagccatctacgac aaacccacagccaacattatagtgattggggaaaagttgaaagcattcccccagataact ggaaaaagacgaggatgcccactttcaccacatttattcgacatagtgctagaagtccta gccagagcaatcagacaagagaaagaaataaagggcatccaaatcagtaaaaaggaagtc aaactgttgctgtttgctgatatgatcatagacctggaaaaccctaaagcctaa >gi568815587r:63452967_63660194|GENSCAN_predicted_peptide_3|52_aa MGLSPGAEGEYALRLPRIPPPLPKPASRTASTGPKDQPPALRRSAVPHSGNN >gi568815587r:63452967_63660194|GENSCAN_predicted_CDS_3|159_bp atgggcctgagcccgggcgccgagggggagtacgcgctccgcctccctaggattccccca cccctccccaaacccgcctcgcgaaccgccagtaccgggcccaaggaccagccgcctgcg ctcagacgttcagctgtgccccactcaggcaacaactga >gi568815587r:63452967_63660194|GENSCAN_predicted_peptide_4|234_aa MSPGEKLDPIPDSFILQPPVFHPVVPYVTTIFGGLHAGKMVMLQGVVPLDAHRFQVDFQC GCSLCPRPDIAFHFNPRFHTTKPHVICNTLHGGRWQREARWPHLALRRGSSFLILFLFGN EEVKVSVNGQHFLHFRYRLPLSHVDTLGIFGDILVEAVGFLNINPFVEGSREYPAGHPFL LMSPRLVLLLFQEGGLKLALNGQGLGATSMNQQALEQLRELRISGSVQLYCVHS >gi568815587r:63452967_63660194|GENSCAN_predicted_CDS_4|705_bp atgtcacctggagaaaaactggacccaattcctgacagcttcattctgcaaccaccagtc ttccacccggtggttccttatgtcacgacgatttttggaggcctgcatgcaggcaagatg gtcatgctgcaaggagtggtccctctagatgcacacaggtttcaggtggacttccagtgt ggctgcagcctgtgtccccggccagatatcgccttccacttcaaccctcgcttccatacc accaagccccatgtcatctgcaacaccctgcatggtggacgctggcaaagggaggcccgg tggccccacctggccctgcgaagaggctccagcttcctcatcctctttctcttcgggaat gaggaagtgaaggtgagtgtgaatggacagcactttctccacttccgctaccggctccca ctgtctcatgtggacacgctgggtatatttggtgacatcctggtagaggctgttggattc ctgaacatcaatccatttgtggagggcagcagagagtacccagctggacatcctttcctg ctgatgagccccaggctggtgctgctcctgttccaggagggagggctgaagctggcgctc aatgggcaggggctgggggccaccagcatgaaccagcaggccctggagcagctgcgggag ctccggatcagtggaagtgtccagctctactgtgtccactcctga >gi568815587r:63452967_63660194|GENSCAN_predicted_peptide_5|99_aa MAIRPKVIYRFNAIPIMLPMTFFTELEKTTLKFIRNQKGAHIAKSILSQKNKAGGIMLPD FKLYYKTTVTKTAWYWYQNRDIDQWNRTEPSEIMPRKCK >gi568815587r:63452967_63660194|GENSCAN_predicted_CDS_5|300_bp atggccatacggcccaaggtaatttatagattcaatgccatccccatcatgctaccaatg actttcttcacagaattggaaaaaactactttaaagttcatacggaaccaaaaaggagcc cacatcgccaagtcaatcctaagccaaaagaacaaagccggaggcatcatgctacctgac ttcaaactatactacaagactacagtaaccaaaacagcatggtactggtaccaaaacaga gatatagatcaatggaacagaacagagccctcagaaataatgccacgtaagtgtaagtaa >gi568815587r:63452967_63660194|GENSCAN_predicted_peptide_6|171_aa MASPHQEPKPGDLIEIFRLGYEHWALYIGDGYVIHLAPPMRVPLIAGEYPGAGSSSVFSV LSNSAEVKRERLEDVVGGCCYRVNNSLDHEYQPRPVEVIISSAKEMVGQKMKYSIVSRNC EHFVTQLRYGKSRCKQVEKAKVEVGVATALGILVVAGCSFAIRRYQKKATA >gi568815587r:63452967_63660194|GENSCAN_predicted_CDS_6|516_bp atggcttcgccacaccaagagcccaaacctggagacctgattgagattttccgccttggc tatgagcactgggccctgtatataggagatggctacgtgatccatctggctcctccaatg agagtgcctctgattgcaggtgagtaccccggggctggctcctccagtgtcttctcagtc ctgagcaacagtgcagaggtgaaacgggagcgcctggaagatgtggtgggaggctgttgc tatcgggtcaacaacagcttggaccatgagtaccaaccacggcccgtggaggtgatcatc agttctgcgaaggagatggttggtcagaagatgaagtacagtattgtgagcaggaactgt gagcactttgtcacccagctgagatatggcaagtcccgctgtaaacaggtggaaaaggcc aaggttgaagtcggtgtggccacggcgcttggaatcctggttgttgctggatgctctttt gcgattaggagataccaaaaaaaagcgacagcctga >gi568815587r:63452967_63660194|GENSCAN_predicted_peptide_7|186_aa MALVVNKQSYEVGIITANFMDRRNDILARPRPRLGDLIEISRFGYAHWAIYVGDGYVVHL APASEIAGAGAASVLSALTNKAIVKKELLSVVAGGDNYRVNNKHDDRYTPLPSNKIVKRA EELVGQELPYSLTSDNCEHFVNHLRYGVSRSDQVTGAVTTVGVAAGLLAAASLVGILLAR SKRERQ >gi568815587r:63452967_63660194|GENSCAN_predicted_CDS_7|561_bp atggctttggtagttaataagcaatcctatgaagtaggcataattactgccaacttcatg gataggaggaatgatatactggccagaccaagaccgagacttggagacctgattgagatt tctcgctttggctatgcacactgggccatctacgtgggagatggctatgtggtccatctg gctccggcaagtgaaattgctggagctggtgcggccagtgtcctgtctgccctgaccaac aaagccatagtgaagaaggaactgctgtctgtggtggctgggggagacaactacagggtc aataacaagcacgatgacagatacacaccactgccttccaacaaaatcgtcaagcgggca gaggagttggtggggcaggagttgccttattcgctgaccagtgacaactgcgagcacttc gtgaaccatctgcgctatggcgtctcccgcagtgaccaggtcactggtgcagtcacgaca gtaggtgtggcagcaggcctgctggctgccgcaagccttgtggggatcctgctggccaga agcaagcgggaaaggcaataa >gi568815587r:63452967_63660194|GENSCAN_predicted_peptide_8|180_aa MTPSSILMCALCPRLSLGYPLPSPEPKPGDLIEIFRPFYRHWAIYVGDGYVVHLAPPSEV AGAGAASVMSALTDKAIVKKELLYDVAGSDKYQVNNKHDDKYSPLPCSKIIQRAEELVGQ EVLYKLTSENCEHFVNELRYGVARSDQVRDVIIAASVAGMGLAAMSLIGVMFSRNKRQKQ >gi568815587r:63452967_63660194|GENSCAN_predicted_CDS_8|543_bp atgacccctagctccatactgatgtgtgcgctgtgtccccgcctctcgttgggatacccg ctgcccagcccagagcctaagcctggagacctgattgagatttttcgccctttctacaga cactgggccatctatgttggcgatggatatgtggttcatctggcccctccaagtgaggtc gcaggagctggtgcagccagtgtcatgtccgccctgactgacaaggccatcgtgaagaag gaattgctgtatgatgtggccgggagtgacaagtaccaggtcaacaacaaacatgatgac aagtactcgccgctgccctgcagcaaaatcatccagcgggcggaggagctggtggggcag gaggtgctctacaagctgaccagtgagaactgcgagcactttgtgaatgagctgcgctat ggagtcgcccgcagtgaccaggtcagagatgtcatcatcgctgcaagcgttgcaggaatg ggcttggcagccatgagccttattggagtcatgttctcaagaaacaagcgacaaaagcaa taa >gi568815587r:63452967_63660194|GENSCAN_predicted_peptide_9|479_aa MESSKPGPVQVVLVQKDQHSFELDEKALASILLQDHIRDLDVVVVSVAGAFRKGKSFILD FMLRYLYSQKESGHSNWLGDPEEPLTGFSWRGGSDPETTGIQIWSEVFTVEKPGGKKVAV VLMDTQGAFDSQSTVKDCATIFALSTMTSSVQIYNLSQNIQEDDLQQLQTLMFLVRDWSF PYEYSYGLQGGMAFLDKRLQVKEHQHEEIQNVRNHIHSCFSDVTCFLLPHPGLQVATSPD FDGKLKDIAGEFKEQLQALIPYVLNPSKLMEKEINGSKVTCRGLLEYFKATAEANNLAAA ASAKDIYYNNMEEVCGGEKPYLSPDILEEKHCEFKQLALDHFKKTKKMGGKDFSFRYQQE LEEEIKELYENFCKHNGSKNVFSTFRTPAVLFTGIVALYIASGLTGFIGLEVVAQLFNCM VGLLLIALLTWGYIRYSGQYRELGGAIDFGAAYVLEQIGFLHAPGEDRHQWFQVYVIPL >gi568815587r:63452967_63660194|GENSCAN_predicted_CDS_9|1440_bp atggagagcagcaagcctggtccagtgcaggttgttttggttcagaaagatcaacattcc tttgagctagatgagaaagccttggccagcatcctcttgcaggaccacatccgagatctt gatgtggtggtggtttcagtggctggtgccttccgaaagggcaagtccttcattctggat tttatgctacgatacttatattctcagaaggaaagtggccattcaaattggttgggtgac ccagaagaaccgttaacaggattttcctggagagggggatctgatccagaaaccactggg attcaaatctggagtgaagttttcactgtggagaagccaggtgggaagaaggttgcagtt gttctgatggatacccagggggcatttgacagccagtcaactgtgaaagactgtgctacc atctttgctctaagcactatgactagttctgttcagatttataatttatctcagaacatt caagaagatgatcttcaacagctgcagacactgatgtttttggttagagattggagtttc ccttatgaatatagctatggactccaaggaggaatggcatttttggataagcgtttacag gtgaaggaacatcaacatgaagaaattcagaatgttcgaaatcacattcactcatgtttc tccgatgtcacctgctttctcttaccacatccaggactccaggtggccacaagccctgac tttgatgggaaattaaaagatattgctggtgaattcaaagagcagttacaggcactgata ccgtatgtattaaacccatctaagttaatggaaaaggagatcaatggctcaaaggtcacc tgtcggggactactggagtattttaaggccactgctgaagccaacaacttagcagctgca gcctctgccaaggacatttattataacaacatggaagaggtttgtgggggagagaaacct tatttgtctccagacattctagaggagaagcactgtgaattcaaacaacttgctctggac cattttaagaagaccaagaagatgggtgggaaggatttcagctttcgttaccagcaggag ctggaggaggaaatcaaggaattatatgagaacttctgcaagcacaatggtagcaagaac gtcttcagcaccttccgaacccctgcagtgctgttcacgggcattgtagctttgtacata gcctcaggcctcactggcttcataggtcttgaggttgtagcccagttgttcaactgtatg gttggactactgttaatagcactcctcacctggggctacatcaggtattctggtcaatat cgtgagctgggcggagctattgattttggtgccgcatatgtgttggagcagataggtttc ctccatgccccaggggaggacaggcaccagtggttccaggtttatgtcatcccactttaa