GENSCAN 1.0 Date run: 7-Nov-116 Time: 02:03:20 Sequence gi568815587f:63439515_63646236 : 206722 bp : 44.09% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 26 805 780 1 0 60 32 209 0.954 8.50 1.02 PlyA + 1796 1801 6 1.05 2.05 PlyA - 2532 2527 6 1.05 2.04 Term - 24081 23989 93 0 0 95 49 71 0.717 1.73 2.03 Intr - 26858 26596 263 0 2 114 64 352 0.833 32.61 2.02 Intr - 28951 28843 109 1 1 95 71 128 0.999 11.66 2.01 Init - 34925 34917 9 2 0 57 106 0 0.306 -0.82 2.00 Prom - 37967 37928 40 -6.06 3.00 Prom + 40970 41009 40 -4.66 3.01 Init + 42194 42326 133 1 1 78 47 67 0.500 1.90 3.02 Term + 45048 45388 341 1 2 27 36 303 0.568 13.70 3.03 PlyA + 45953 45958 6 1.05 4.03 PlyA - 47552 47547 6 1.05 4.02 Term - 48752 48742 11 0 2 94 40 10 0.050 -4.84 4.01 Init - 51520 51373 148 1 1 85 76 120 0.893 10.94 4.00 Prom - 57917 57878 40 -3.56 5.00 Prom + 62500 62539 40 -2.86 5.01 Init + 66945 67013 69 2 0 44 117 61 0.926 5.65 5.02 Intr + 69039 69127 89 2 2 72 110 91 0.995 8.47 5.03 Intr + 69264 69477 214 0 1 24 67 241 0.952 14.32 5.04 Intr + 70264 70383 120 0 0 113 101 175 0.999 21.99 5.05 Intr + 70949 70987 39 1 0 93 84 40 0.514 2.52 5.06 Intr + 71565 71591 27 2 0 102 105 10 0.502 2.41 5.07 Term + 76733 76879 147 1 0 104 55 197 0.813 15.90 5.08 PlyA + 77236 77241 6 1.05 6.00 Prom + 78825 78864 40 -6.16 6.01 Init + 86196 86481 286 2 1 86 22 173 0.373 7.75 6.02 Term + 89426 89439 14 0 2 148 42 11 0.346 0.86 6.03 PlyA + 89852 89857 6 1.05 7.00 Prom + 95657 95696 40 -4.66 7.01 Init + 97355 97363 9 1 0 78 123 5 0.570 3.29 7.02 Intr + 100002 100110 109 2 1 89 80 119 0.994 11.06 7.03 Intr + 105086 105375 290 0 2 80 110 270 0.615 25.36 7.04 Term + 106635 106742 108 2 0 88 48 20 0.322 -3.49 7.05 PlyA + 106921 106926 6 1.05 8.06 PlyA - 109052 109047 6 1.05 8.05 Term - 113551 113450 102 1 0 130 35 107 0.963 7.88 8.04 Intr - 119146 118878 269 2 2 75 57 329 0.992 25.75 8.03 Intr - 120679 120571 109 1 1 131 80 107 0.911 14.06 8.02 Intr - 123173 123102 72 2 0 74 119 50 0.987 6.30 8.01 Init - 123810 123802 9 0 0 84 111 11 0.744 2.87 8.00 Prom - 133669 133630 40 -3.06 9.05 PlyA - 133806 133801 6 1.05 9.04 Term - 135532 135431 102 1 0 140 36 84 0.897 6.68 9.03 Intr - 150854 150586 269 0 2 73 96 468 0.705 43.35 9.02 Intr - 158649 158547 103 0 1 102 71 65 0.754 5.95 9.01 Init - 167775 167707 69 0 0 78 75 104 0.536 7.16 9.00 Prom - 177601 177562 40 -1.56 10.07 PlyA - 178078 178073 6 1.05 10.06 Term - 183153 183085 69 0 0 109 35 22 0.158 -3.06 10.05 Intr - 191957 191526 432 2 0 117 111 391 0.704 38.14 10.04 Intr - 193583 193512 72 2 0 60 94 57 0.890 3.10 10.03 Intr - 196820 196693 128 0 2 79 94 33 0.925 3.40 10.02 Intr - 203981 203843 139 2 1 81 93 96 0.991 9.44 10.01 Intr - 204747 204655 93 0 0 80 111 17 0.611 3.36 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587f:63439515_63646236|GENSCAN_predicted_peptide_1|259_aa MSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWVGRINIV KMAILPKVIYRFNAIPIKLPMTFFTQLEKTTLKFIWNQKRAHIAKSILSQKNKAGGITLP DFKLYYKATVTKTAWHWYQNRDIDQWNRTEPSEITLHIYNYLIFDKPDKNKQWGKDSLFN KWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLHVRPKTIKTQEENLGNTTQDIGMGKD FMSKTPKQWQQKTKLTNGI >gi568815587f:63439515_63646236|GENSCAN_predicted_CDS_1|780_bp atgagtgaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaactt acaagggatgtgaaggacctcttcaaggagaactacaaaccactgctcaatgaaatcaaa gaggatacaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatatcgtg aaaatggccatactgcccaaggtaatttacagattcaatgccatccccatcaagctacca atgactttcttcacacaattggaaaaaactactttaaagttcatatggaaccaaaaaaga gcccacattgccaagtcaatcctaagccaaaagaacaaagccggaggcatcacactacct gacttcaaactatactacaaggctacagtaaccaaaacagcatggcactggtaccaaaac agagatatagatcaatggaacagaacagagccctcagaaataacgctacatatctacaac tatctgatctttgacaaacctgacaaaaacaagcaatggggaaaggattccctatttaat aaatggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttcctt acaccttatacaaaaatcaattcaagatggattaaagacttacatgttagacctaaaacc ataaaaacccaagaagaaaacctaggcaataccactcaggacataggcatgggcaaggac ttcatgtctaaaacaccaaagcaatggcaacaaaagacaaaattgacaaatgggatctaa >gi568815587f:63439515_63646236|GENSCAN_predicted_peptide_2|157_aa MNWGKPRPRPGDLIEIFRIGYEHWAIYVEDDCVVHLAPPSEEFEVGSITSIFSNRAVVKY SRLEDVLHGCSWKVNNKLDGTYLPLPVDKIIQRTKKMVNKIVQYSLIEGNCEHFVNGLRY GVPRSQQVEHALMEGAKAAGAVISAVVDSIKPKPITA >gi568815587f:63439515_63646236|GENSCAN_predicted_CDS_2|474_bp atgaattggggaaaaccaagacccagacctggagacctgattgagatttttcgaattggc tatgagcactgggccatctatgtagaagatgattgcgtggtccatctggctcccccaagt gaggagtttgaggtgggcagcattacttccatctttagcaatcgggccgtggtgaaatac agtcgtctggaggatgtgctgcatggctgctcctggaaggtcaataacaagctagatggg acgtacctgcccttgccggtggacaagatcatccagcgtacaaaaaagatggtcaacaag atcgtgcagtacagcctgattgaagggaactgtgagcactttgtcaatggcctcagatat ggcgtaccccggagccagcaggtagagcacgccctgatggaaggagcgaaggctgctgga gcagttatttcagctgtagtggatagcataaagcccaaaccaataactgcctga >gi568815587f:63439515_63646236|GENSCAN_predicted_peptide_3|157_aa MEYYAAIKNDEFMSFVGTWMKLETIILSKLSQGQKTKHRMFSLIDTEIAFDKIQHPFMIK NLNKISIKGTNLKAIKAIYDKPTANIIVIGEKLKAFPQITGKRRGCPLSPHLFDIVLEVL ARAIRQEKEIKGIQISKKEVKLLLFADMIIDLENPKA >gi568815587f:63439515_63646236|GENSCAN_predicted_CDS_3|474_bp atggaatactatgcagccataaaaaatgatgagttcatgtcctttgtagggacatggatg aagctggaaaccatcattctcagcaaactatcgcaaggacaaaaaaccaaacaccgcatg ttctcactcatagacacagaaatagcatttgacaagatccagcatccctttatgattaaa aacctcaacaaaatcagcataaaagggacaaacctcaaggcaataaaagccatctacgac aaacccacagccaacattatagtgattggggaaaagttgaaagcattcccccagataact ggaaaaagacgaggatgcccactttcaccacatttattcgacatagtgctagaagtccta gccagagcaatcagacaagagaaagaaataaagggcatccaaatcagtaaaaaggaagtc aaactgttgctgtttgctgatatgatcatagacctggaaaaccctaaagcctaa >gi568815587f:63439515_63646236|GENSCAN_predicted_peptide_4|52_aa MGLSPGAEGEYALRLPRIPPPLPKPASRTASTGPKDQPPALRRSAVPHSGNN >gi568815587f:63439515_63646236|GENSCAN_predicted_CDS_4|159_bp atgggcctgagcccgggcgccgagggggagtacgcgctccgcctccctaggattccccca cccctccccaaacccgcctcgcgaaccgccagtaccgggcccaaggaccagccgcctgcg ctcagacgttcagctgtgccccactcaggcaacaactga >gi568815587f:63439515_63646236|GENSCAN_predicted_peptide_5|234_aa MSPGEKLDPIPDSFILQPPVFHPVVPYVTTIFGGLHAGKMVMLQGVVPLDAHRFQVDFQC GCSLCPRPDIAFHFNPRFHTTKPHVICNTLHGGRWQREARWPHLALRRGSSFLILFLFGN EEVKVSVNGQHFLHFRYRLPLSHVDTLGIFGDILVEAVGFLNINPFVEGSREYPAGHPFL LMSPRLVLLLFQEGGLKLALNGQGLGATSMNQQALEQLRELRISGSVQLYCVHS >gi568815587f:63439515_63646236|GENSCAN_predicted_CDS_5|705_bp atgtcacctggagaaaaactggacccaattcctgacagcttcattctgcaaccaccagtc ttccacccggtggttccttatgtcacgacgatttttggaggcctgcatgcaggcaagatg gtcatgctgcaaggagtggtccctctagatgcacacaggtttcaggtggacttccagtgt ggctgcagcctgtgtccccggccagatatcgccttccacttcaaccctcgcttccatacc accaagccccatgtcatctgcaacaccctgcatggtggacgctggcaaagggaggcccgg tggccccacctggccctgcgaagaggctccagcttcctcatcctctttctcttcgggaat gaggaagtgaaggtgagtgtgaatggacagcactttctccacttccgctaccggctccca ctgtctcatgtggacacgctgggtatatttggtgacatcctggtagaggctgttggattc ctgaacatcaatccatttgtggagggcagcagagagtacccagctggacatcctttcctg ctgatgagccccaggctggtgctgctcctgttccaggagggagggctgaagctggcgctc aatgggcaggggctgggggccaccagcatgaaccagcaggccctggagcagctgcgggag ctccggatcagtggaagtgtccagctctactgtgtccactcctga >gi568815587f:63439515_63646236|GENSCAN_predicted_peptide_6|99_aa MAIRPKVIYRFNAIPIMLPMTFFTELEKTTLKFIRNQKGAHIAKSILSQKNKAGGIMLPD FKLYYKTTVTKTAWYWYQNRDIDQWNRTEPSEIMPRKCK >gi568815587f:63439515_63646236|GENSCAN_predicted_CDS_6|300_bp atggccatacggcccaaggtaatttatagattcaatgccatccccatcatgctaccaatg actttcttcacagaattggaaaaaactactttaaagttcatacggaaccaaaaaggagcc cacatcgccaagtcaatcctaagccaaaagaacaaagccggaggcatcatgctacctgac ttcaaactatactacaagactacagtaaccaaaacagcatggtactggtaccaaaacaga gatatagatcaatggaacagaacagagccctcagaaataatgccacgtaagtgtaagtaa >gi568815587f:63439515_63646236|GENSCAN_predicted_peptide_7|171_aa MASPHQEPKPGDLIEIFRLGYEHWALYIGDGYVIHLAPPMRVPLIAGEYPGAGSSSVFSV LSNSAEVKRERLEDVVGGCCYRVNNSLDHEYQPRPVEVIISSAKEMVGQKMKYSIVSRNC EHFVTQLRYGKSRCKQVEKAKVEVGVATALGILVVAGCSFAIRRYQKKATA >gi568815587f:63439515_63646236|GENSCAN_predicted_CDS_7|516_bp atggcttcgccacaccaagagcccaaacctggagacctgattgagattttccgccttggc tatgagcactgggccctgtatataggagatggctacgtgatccatctggctcctccaatg agagtgcctctgattgcaggtgagtaccccggggctggctcctccagtgtcttctcagtc ctgagcaacagtgcagaggtgaaacgggagcgcctggaagatgtggtgggaggctgttgc tatcgggtcaacaacagcttggaccatgagtaccaaccacggcccgtggaggtgatcatc agttctgcgaaggagatggttggtcagaagatgaagtacagtattgtgagcaggaactgt gagcactttgtcacccagctgagatatggcaagtcccgctgtaaacaggtggaaaaggcc aaggttgaagtcggtgtggccacggcgcttggaatcctggttgttgctggatgctctttt gcgattaggagataccaaaaaaaagcgacagcctga >gi568815587f:63439515_63646236|GENSCAN_predicted_peptide_8|186_aa MALVVNKQSYEVGIITANFMDRRNDILARPRPRLGDLIEISRFGYAHWAIYVGDGYVVHL APASEIAGAGAASVLSALTNKAIVKKELLSVVAGGDNYRVNNKHDDRYTPLPSNKIVKRA EELVGQELPYSLTSDNCEHFVNHLRYGVSRSDQVTGAVTTVGVAAGLLAAASLVGILLAR SKRERQ >gi568815587f:63439515_63646236|GENSCAN_predicted_CDS_8|561_bp atggctttggtagttaataagcaatcctatgaagtaggcataattactgccaacttcatg gataggaggaatgatatactggccagaccaagaccgagacttggagacctgattgagatt tctcgctttggctatgcacactgggccatctacgtgggagatggctatgtggtccatctg gctccggcaagtgaaattgctggagctggtgcggccagtgtcctgtctgccctgaccaac aaagccatagtgaagaaggaactgctgtctgtggtggctgggggagacaactacagggtc aataacaagcacgatgacagatacacaccactgccttccaacaaaatcgtcaagcgggca gaggagttggtggggcaggagttgccttattcgctgaccagtgacaactgcgagcacttc gtgaaccatctgcgctatggcgtctcccgcagtgaccaggtcactggtgcagtcacgaca gtaggtgtggcagcaggcctgctggctgccgcaagccttgtggggatcctgctggccaga agcaagcgggaaaggcaataa >gi568815587f:63439515_63646236|GENSCAN_predicted_peptide_9|180_aa MTPSSILMCALCPRLSLGYPLPSPEPKPGDLIEIFRPFYRHWAIYVGDGYVVHLAPPSEV AGAGAASVMSALTDKAIVKKELLYDVAGSDKYQVNNKHDDKYSPLPCSKIIQRAEELVGQ EVLYKLTSENCEHFVNELRYGVARSDQVRDVIIAASVAGMGLAAMSLIGVMFSRNKRQKQ >gi568815587f:63439515_63646236|GENSCAN_predicted_CDS_9|543_bp atgacccctagctccatactgatgtgtgcgctgtgtccccgcctctcgttgggatacccg ctgcccagcccagagcctaagcctggagacctgattgagatttttcgccctttctacaga cactgggccatctatgttggcgatggatatgtggttcatctggcccctccaagtgaggtc gcaggagctggtgcagccagtgtcatgtccgccctgactgacaaggccatcgtgaagaag gaattgctgtatgatgtggccgggagtgacaagtaccaggtcaacaacaaacatgatgac aagtactcgccgctgccctgcagcaaaatcatccagcgggcggaggagctggtggggcag gaggtgctctacaagctgaccagtgagaactgcgagcactttgtgaatgagctgcgctat ggagtcgcccgcagtgaccaggtcagagatgtcatcatcgctgcaagcgttgcaggaatg ggcttggcagccatgagccttattggagtcatgttctcaagaaacaagcgacaaaagcaa taa >gi568815587f:63439515_63646236|GENSCAN_predicted_peptide_10|310_aa TLMFLVRDWSFPYEYSYGLQGGMAFLDKRLQVKEHQHEEIQNVRNHIHSCFSDVTCFLLP HPGLQVATSPDFDGKLKDIAGEFKEQLQALIPYVLNPSKLMEKEINGSKVTCRGLLEYFK ATAEANNLAAAASAKDIYYNNMEEVCGGEKPYLSPDILEEKHCEFKQLALDHFKKTKKMG GKDFSFRYQQELEEEIKELYENFCKHNGSKNVFSTFRTPAVLFTGIVALYIASGLTGFIG LEVVAQLFNCMVGLLLIALLTWGYIRYSGQYRELGGAIDFGAAYVLEQIGFLHAPGEDRH QWFQVYVIPL >gi568815587f:63439515_63646236|GENSCAN_predicted_CDS_10|933_bp acactgatgtttttggttagagattggagtttcccttatgaatatagctatggactccaa ggaggaatggcatttttggataagcgtttacaggtgaaggaacatcaacatgaagaaatt cagaatgttcgaaatcacattcactcatgtttctccgatgtcacctgctttctcttacca catccaggactccaggtggccacaagccctgactttgatgggaaattaaaagatattgct ggtgaattcaaagagcagttacaggcactgataccgtatgtattaaacccatctaagtta atggaaaaggagatcaatggctcaaaggtcacctgtcggggactactggagtattttaag gccactgctgaagccaacaacttagcagctgcagcctctgccaaggacatttattataac aacatggaagaggtttgtgggggagagaaaccttatttgtctccagacattctagaggag aagcactgtgaattcaaacaacttgctctggaccattttaagaagaccaagaagatgggt gggaaggatttcagctttcgttaccagcaggagctggaggaggaaatcaaggaattatat gagaacttctgcaagcacaatggtagcaagaacgtcttcagcaccttccgaacccctgca gtgctgttcacgggcattgtagctttgtacatagcctcaggcctcactggcttcataggt cttgaggttgtagcccagttgttcaactgtatggttggactactgttaatagcactcctc acctggggctacatcaggtattctggtcaatatcgtgagctgggcggagctattgatttt ggtgccgcatatgtgttggagcagataggtttcctccatgccccaggggaggacaggcac cagtggttccaggtttatgtcatcccactttaa