GENSCAN 1.0 Date run: 8-Nov-116 Time: 15:22:00 Sequence gi568815589r:94503384_94739310 : 235927 bp : 44.54% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 3065 3060 6 1.05 1.03 Term - 11844 11795 50 1 2 132 43 6 0.070 -2.03 1.02 Intr - 38953 38828 126 2 0 24 109 87 0.398 5.05 1.01 Init - 39815 39776 40 2 1 68 91 53 0.410 3.95 1.00 Prom - 41407 41368 40 -4.86 2.00 Prom + 48198 48237 40 -2.56 2.01 Init + 49306 49359 54 0 0 60 96 43 0.730 3.58 2.02 Intr + 49729 49910 182 0 2 37 62 101 0.467 1.07 2.03 Term + 50798 50978 181 2 1 50 36 133 0.346 1.38 2.04 PlyA + 51994 51999 6 -0.45 3.08 PlyA - 53459 53454 6 1.05 3.07 Term - 55749 55555 195 0 0 117 44 190 0.996 14.91 3.06 Intr - 60078 59959 120 0 0 112 86 147 0.961 17.59 3.05 Intr - 64024 63869 156 1 0 70 94 232 0.626 22.21 3.04 Intr - 68219 68079 141 2 0 103 94 224 0.997 25.05 3.03 Intr - 81286 81194 93 1 0 67 113 125 0.997 13.06 3.02 Intr - 84086 83924 163 1 1 120 73 209 0.984 22.58 3.01 Init - 90343 90174 170 1 2 100 76 283 0.318 27.31 3.00 Prom - 96895 96856 40 -8.06 4.11 PlyA - 99776 99771 6 1.05 4.10 Term - 100189 99998 192 1 0 61 53 336 0.933 24.82 4.09 Intr - 102193 102074 120 1 0 93 116 142 0.999 18.19 4.08 Intr - 103569 103432 138 0 0 76 100 264 0.376 27.06 4.07 Intr - 106678 106538 141 1 0 67 97 264 0.997 25.75 4.06 Intr - 114477 114385 93 0 0 81 93 67 0.989 6.66 4.05 Intr - 117108 116946 163 2 1 98 59 255 0.998 23.58 4.04 Intr - 118857 118825 33 2 0 106 84 49 0.906 3.64 4.03 Intr - 136031 135758 274 0 1 69 76 412 0.345 34.70 4.02 Intr - 138206 138012 195 0 0 83 18 127 0.365 4.59 4.01 Init - 142516 142378 139 1 1 47 27 142 0.347 4.30 4.00 Prom - 143227 143188 40 -6.76 5.00 Prom + 143363 143402 40 -4.86 5.01 Init + 150734 150927 194 1 2 60 53 144 0.693 6.54 5.02 Intr + 153624 153769 146 0 2 35 90 59 0.475 0.73 5.03 Intr + 154169 154287 119 0 2 68 76 55 0.627 2.48 5.04 Term + 160735 160848 114 0 0 57 51 91 0.647 0.87 5.05 PlyA + 162603 162608 6 1.05 6.03 PlyA - 163060 163055 6 1.05 6.02 Term - 165740 165634 107 0 2 122 40 97 0.936 6.87 6.01 Init - 175072 175000 73 1 1 45 72 56 0.662 0.93 6.00 Prom - 176181 176142 40 -5.06 7.05 PlyA - 176358 176353 6 1.05 7.04 Term - 178123 177981 143 2 2 43 43 195 0.959 8.59 7.03 Intr - 189123 188997 127 0 1 111 63 16 0.064 1.65 7.02 Intr - 210856 210671 186 1 0 53 57 115 0.042 4.79 7.01 Intr - 233461 233355 107 2 2 55 115 33 0.227 2.63 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 87125 87287 163 1 1 60 44 182 0.851 8.90 S.002 Term + 87675 88165 491 1 2 56 55 245 0.915 12.82 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589r:94503384_94739310|GENSCAN_predicted_peptide_1|71_aa MGSVCEALRQYSPASVMVYGTSEVIGQRQSLAAKPRRSQSESLGPEFQGLWEWLPGMALP PKRQRYGNQKL >gi568815589r:94503384_94739310|GENSCAN_predicted_CDS_1|216_bp atggggtctgtctgtgaagccttgcggcagtacagcccagcgtccgtgatggtctacggg acttccgaggtgattgggcagcgtcagtctttagccgctaagccaagaaggagtcagtca gagagccttgggccagagttccaggggctctgggagtggctgccagggatggctctgcct cctaaacgacagagatatggtaaccaaaaactctga >gi568815589r:94503384_94739310|GENSCAN_predicted_peptide_2|138_aa MKRNKKKKIQTGRKEKEQRRCQYNRCCPQCTSPPYKAPITWPPRVPSYHSGQAAVSVAAT NLMRQAAEPRHLQAPAYHSAAPDSAPPVTGSAAPDHALKPPPTASIVAPDNSTQPAPATD SAAEDNAPKPSTATGSTR >gi568815589r:94503384_94739310|GENSCAN_predicted_CDS_2|417_bp atgaagaggaacaagaagaagaaaatacagactggaagaaaggaaaaagaacagcgccga tgccagtacaaccgctgttgccctcaatgcaccagcccaccctacaaggctcctatcact tggcccccacgggtgccctcctaccactctggtcaagctgcagtctctgttgctgccacc aacctcatgaggcaagctgcagagccacgccatctacaggctccagcataccacagtgca gctcctgatagcgcacccccagtcacaggcagtgcagcaccggaccacgcccttaaacca ccccccactgccagcattgtagccccagataactccacccaacccgcccctgccacggac agtgcagcagaagacaacgcccctaagccatccaccgccactggcagtacacgttag >gi568815589r:94503384_94739310|GENSCAN_predicted_peptide_3|345_aa MTDRSPFETDMLTLTRYVMEKGRQAKGTGELTQLLNSMLTAIKAISSAVRKAGLAHLYGI AGSVNVTGDEVKKLDVLSNSLVINMVQSSYSTCVLVSEENKDAIITAKEKRGKYVVCFDP LDGSSNIDCLASIGTIFAIYRKTSEDEPSEKDALQCGRNIVAAGYALYGSATLVALSTGQ GVDLFMLDPALGEFVLVEKDVKIKKKGKIYSLNEGYAKYFDAATTEYVQKKKFPEVSEES QDGSAPYGARYVGSMVADVHRTLVYGGIFLYPANQKSPKGKLRLLYECNPVAYIIEQAGG LATTGTQPVLDVKPEAIHQRVPLILGSPEDVQEYLTCVQKNQAGS >gi568815589r:94503384_94739310|GENSCAN_predicted_CDS_3|1038_bp atgacggacagaagccccttcgaaaccgacatgctcaccctgacccgctacgttatggaa aaggggcgtcaggccaaagggactggggagctcacccagctgctgaactcaatgctgacg gccatcaaagccatctcctcggctgtgcgcaaggccggtctggcccacctgtatggaatc gcaggaagcgttaacgtgacgggagatgaggtgaagaaactggatgtgctatccaattcc ctggtgatcaacatggtccaatcctcctatagtacctgcgtcctggtctcagaagagaat aaggacgccatcatcaccgccaaggagaagcgggggaaatacgtggtctgctttgaccca ctggatggatcttccaatattgactgcctggcctccatcggaaccatctttgccatctat agaaagacctcagaggatgagccttctgaaaaggatgccctgcagtgtggccgcaatatt gtggccgcaggttatgcgctgtacggtagtgcaaccctggtggctctctccacagggcaa ggcgtggacctcttcatgcttgacccggctcttggtgaatttgtcctggtggaaaaagat gtcaagattaagaagaaaggaaagatttacagcctgaatgagggctatgccaagtatttt gatgcggccaccactgaatatgtgcagaaaaagaaattccctgaggtgagtgaagaaagc caggatggcagtgctccctatggggccaggtatgtgggctccatggtggctgacgtgcac cgcaccctggtctatggaggaatcttcctgtacccagccaaccagaagagccctaagggc aagctccggctcctgtatgaatgcaatcccgtggcctacatcattgagcaggcaggaggc ttggcgaccacggggacccagcctgtactggacgtgaagcccgaggcaattcaccagcga gtccccctcattctggggtcaccagaggatgtgcaggaatatctcacctgtgtgcagaaa aatcaggcaggcagctag >gi568815589r:94503384_94739310|GENSCAN_predicted_peptide_4|495_aa MGGIVEIEHGVIRSLKGPPGPEQEGKNFKILPPEEQLEELKIFNWRRFFLKSCLRNKELN KTLKGSTLFARHLSPLWGFEETVAVFPGGTTTFLHSTPPHLTTLGTGICTPAPHLRPVPT ALSCRPHLQPRTCRLHLQPRALPGSSMADQAPFDTDVNTLTRFVMEEGRKARGTGELTQL LNSLCTAVKAISSAVRKAGIAHLATPGYDNLPLLYGIAGSTNVTGDQVKKLDVLSNDLVM NMLKSSFATCVLVSEEDKHAIIVEPEKRGKYVVCFDPLDGSSNIDCLVSVGTIFGIYRKK STDEPSEKDALQPGRNLVAAGYALYGSATMLVLAMDCGVNCFMLDPAIGEFILVDKDVKI KKKGKIYSLNEGYARDFDPAVTEYIQRKKFPPDNSAPYGARYVGSMVADVHRTLVYGGIF LYPANKKSPNGKLRLLYECNPMAYVMEKAGGMATTGKEAVLDVIPTDIHQRAPVILGSPD DVLEFLKVYEKHSAQ >gi568815589r:94503384_94739310|GENSCAN_predicted_CDS_4|1488_bp atgggtggaatcgtggaaatagagcatggggttataaggagcctcaaggggccaccaggt ccagaacaggaagggaagaacttcaaaatcttgccacctgaggagcagctggaggagctg aaaatatttaactggagaaggtttttcctgaaaagttgcctgaggaacaaagagttaaat aagacccttaaaggatcaactctctttgccaggcatctaagtcccctctggggctttgag gagacagtcgctgtctttcctggtggtaccaccaccttccttcattcaacgccgccgcat ctgacaaccctgggcacaggaatctgcaccccagctccgcacctgcggccagtgcctact gccctctcttgccgcccgcacctgcagccccgcacctgccgcttgcacctgcagccccgc gctctacccggttcaagcatggctgaccaggcgcccttcgacacggacgtcaacaccctg acccgcttcgtcatggaggagggcaggaaggcccgcggcacgggcgagttgacccagctg ctcaactcgctctgcacagcagtcaaagccatctcttcggcggtgcgcaaggcgggcatc gcgcacctggcaaccccaggctatgacaacctgccgcttctctatggcattgctggttct accaacgtgacaggtgatcaagttaagaagctggacgtcctctccaacgacctggttatg aacatgttaaagtcatcctttgccacgtgtgttctcgtgtcagaagaagataaacacgcc atcatagtggaaccggagaaaaggggtaaatatgtggtctgttttgatccccttgatgga tcttccaacatcgattgccttgtgtccgttggaaccatttttggcatctatagaaagaaa tcaactgatgagccttctgagaaggatgctctgcaaccaggccggaacctggtggcagcc ggctacgcactgtatggcagtgccaccatgctggtccttgccatggactgtggggtcaac tgcttcatgctggacccggccatcggggagttcattttggtggacaaggatgtgaagata aaaaagaaaggtaaaatctacagccttaacgagggctacgccagggactttgaccctgcc gtcactgagtacatccagaggaagaagttccccccagataattcagctccttatggggcc cggtatgtgggctccatggtggctgatgttcatcgcactctggtctacggagggatattt ctgtaccccgctaacaagaagagccccaatggaaagctgagactgctgtacgaatgcaac cccatggcctacgtcatggagaaggctgggggaatggccaccactgggaaggaggccgtg ttagacgtcattcccacagacattcaccagagggcgccggtgatcttgggatcccccgac gacgtgctcgagttcctgaaggtgtatgagaagcactctgcccagtga >gi568815589r:94503384_94739310|GENSCAN_predicted_peptide_5|190_aa MENGPGGANNQHKSQSTVLRAGAPLQGPSRAISIVLQEDCCPCTAFLEQKSEKSEFFSMR VLTSGFSKHAESQANPQRSHANLVWGPSTPGCLVECLIDVRQGAPHESPNSSEAAVGESQ EVRNPKKIFLFQNSFIKVQLTDKKLHTLKIHTLSSVAIVVWKQPSTEHKRIRTAWQSQSV DFIVEPYGPQ >gi568815589r:94503384_94739310|GENSCAN_predicted_CDS_5|573_bp atggaaaatggacctggaggtgcaaacaaccagcacaagtcccagagcacagtcctcaga gctggagcccctctccaaggcccatcaagagccatctccatagtcctccaagaggactgc tgtccatgtaccgccttcttggaacagaaatcagagaagtcagagttcttctccatgcga gtactgacgagtggattctccaaacacgcagaaagccaagctaatcctcagcgctcccat gccaatctggtgtggggaccctccactccaggctgccttgtagaatgccttattgatgtg cggcaaggggcaccccatgaaagtcccaacagctctgaagctgctgtgggagagtcacag gaagtgaggaatcctaagaagatcttcctttttcaaaacagcttcatcaaggtgcaatta acagacaagaaactgcacacactgaaaatacacactttgtctagtgttgccattgtggtg tggaagcagccatcgacagaacataaaagaatacggacggcatggcagtcacagtcagtt gacttcattgtagagccttatgggccacaatga >gi568815589r:94503384_94739310|GENSCAN_predicted_peptide_6|59_aa MHHGQKQWPVVLTFPPGVSQCVIPVALIWGDFASLGMFGNNIWRHFGSHNWRAVDATGT >gi568815589r:94503384_94739310|GENSCAN_predicted_CDS_6|180_bp atgcaccacgggcagaagcaatggcctgttgtcctcacgtttcctcctggtgtttctcaa tgcgtcattccagtggctctcatctggggtgactttgcctccctggggatgtttggcaat aatatctggagacattttggttctcacaactggagggccgtggatgctactggcacctag >gi568815589r:94503384_94739310|GENSCAN_predicted_peptide_7|187_aa XWWAQILSTTVEIRYEESRCKKDLNSMKCFAHCGPKNFWVTGGISYGDMRMLADGEDSDK SRQLDSITASFPLPHPWKNLKSADYQMGPQVKCLHTPWDQGSELPLPSEQKVSHQLYLRP MPWGVEPPLSVESPPEPSALADDGVDDGDGDGDEKEEEDGGGGDEKQDSGVYSGSHSFES TSHKEYS >gi568815589r:94503384_94739310|GENSCAN_predicted_CDS_7|564_bp nagtggtgggcccagattttgagtaccacagttgaaataaggtatgaggagagcagatgc aaaaaggaccttaattccatgaagtgctttgcgcactgtgggcccaagaatttctgggta actggtggcatctcctacggcgatatgagaatgctggctgatggagaggacagcgataaa tctcggcagcttgacagtatcactgcctccttccctctcccacatccctggaaaaattta aagtctgcagattaccagatgggaccccaagttaaatgcctgcatactccctgggaccaa ggatccgaactccctttgccctcagaacagaaggttagccaccaactctacctccgccct atgccctggggagtggagcctccattgtcggtggaatctcccccagagccttctgctcta gctgatgatggtgttgatgatggtgatggtgatggtgatgagaaggaggaggaggatggt ggcggtggtgatgaaaaacaggattctggagtttacagtggttctcactcttttgaaagc acatcccataaagagtacagctga