GENSCAN 1.0 Date run: 3-Nov-116 Time: 02:29:55 Sequence gi568815592f:37254165_37481354 : 227190 bp : 43.56% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3754 3809 56 0 2 82 83 35 0.008 3.16 1.02 Intr + 25140 25447 308 0 2 33 94 300 0.919 21.19 1.03 Intr + 28021 28200 180 2 0 57 95 64 0.915 3.84 1.04 Intr + 30185 30300 116 0 2 9 92 147 0.839 7.37 1.05 Intr + 37079 37193 115 1 1 108 86 88 0.972 10.62 1.06 Intr + 58754 58860 107 0 2 97 99 148 0.997 16.73 1.07 Intr + 59652 59727 76 2 1 99 78 161 0.963 15.29 1.08 Intr + 62539 62666 128 2 2 92 91 150 0.849 16.10 1.09 Intr + 62947 63042 96 0 0 90 109 79 0.989 10.41 1.10 Term + 68907 69050 144 2 0 7 38 146 0.742 -0.59 1.11 PlyA + 69485 69490 6 1.05 2.06 PlyA - 69938 69933 6 1.05 2.05 Term - 70155 70128 28 2 1 87 48 44 0.050 -2.15 2.04 Intr - 77229 76908 322 1 1 110 82 194 0.304 15.92 2.03 Intr - 81002 80884 119 1 2 49 65 46 0.078 -1.49 2.02 Intr - 89089 89035 55 2 1 129 81 28 0.163 4.24 2.01 Init - 91445 91313 133 2 1 82 63 68 0.669 4.00 2.00 Prom - 91569 91530 40 -2.46 3.00 Prom + 95301 95340 40 -5.46 3.01 Init + 100001 100111 111 1 0 69 80 93 0.789 6.83 3.02 Intr + 106282 106410 129 0 0 104 87 77 0.940 10.09 3.03 Intr + 114320 115054 735 1 0 81 81 284 0.766 18.64 3.04 Intr + 117348 117410 63 2 0 78 78 100 0.982 6.91 3.05 Intr + 120456 120545 90 2 0 80 79 118 0.999 10.29 3.06 Intr + 122762 122869 108 1 0 79 76 65 0.889 4.88 3.07 Intr + 126986 127190 205 1 1 69 113 142 0.798 13.57 3.08 Intr + 141060 141107 48 1 0 55 121 14 0.001 0.05 3.09 Intr + 178184 178295 112 0 1 115 98 29 0.040 6.04 3.10 Intr + 178928 179213 286 2 1 72 75 47 0.039 -0.86 3.11 Intr + 181460 181598 139 1 1 57 102 66 0.216 4.94 3.12 Intr + 189835 189986 152 2 2 123 80 97 0.999 12.28 3.13 Intr + 192127 192285 159 0 0 124 107 183 0.991 23.98 3.14 Intr + 196087 196179 93 0 0 98 102 19 0.932 4.46 3.15 Intr + 197642 197713 72 1 0 103 116 20 0.981 5.90 3.16 Intr + 198883 198977 95 0 2 58 41 93 0.644 0.36 3.17 Intr + 199076 199148 73 2 1 77 64 44 0.565 0.31 3.18 Intr + 204448 204707 260 0 2 100 71 258 0.555 21.56 3.19 Intr + 204751 204836 86 1 2 87 48 21 0.553 -2.54 3.20 Intr + 205402 205520 119 2 2 104 61 90 0.999 8.08 3.21 Intr + 207385 207481 97 0 1 106 61 177 0.785 16.38 3.22 Intr + 207806 207938 133 0 1 85 57 136 0.998 9.90 3.23 Intr + 208665 208844 180 0 0 -2 99 315 0.972 22.58 3.24 Intr + 216857 216913 57 2 0 75 94 40 0.596 1.30 3.25 Intr + 217683 217740 58 0 1 68 84 -10 0.634 -4.51 3.26 Intr + 218255 218323 69 1 0 79 110 48 0.772 5.48 3.27 Intr + 219306 219437 132 2 0 70 98 255 0.793 25.54 3.28 Intr + 220360 220482 123 0 0 97 55 162 0.777 14.58 3.29 Intr + 221157 221267 111 2 0 71 49 180 0.916 12.98 3.30 Intr + 221942 222030 89 1 2 34 99 86 0.985 3.07 3.31 Intr + 223428 223475 48 0 0 88 109 38 0.908 3.70 3.32 Intr + 224245 224357 113 1 2 15 64 152 0.928 5.52 3.33 Intr + 224983 225091 109 2 1 96 89 103 0.984 10.54 3.34 Term + 225849 225981 133 0 1 127 48 182 0.993 15.66 3.35 PlyA + 226467 226472 6 -0.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 13517 13535 19 1 1 73 83 34 0.856 1.58 S.002 Init - 154117 154064 54 1 0 83 70 82 0.899 7.18 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:37254165_37481354|GENSCAN_predicted_peptide_1|441_aa MAAENSKQFWKRSAKLPGSFIKERSKVNTVPLKNKKASSFHEFARNTSDAWDIGDDEEED FSSPSFQTLNSKVALATAAQVLENHSKLRVKPERSQSTTSDVPANYKVIKSSSDAQLSRN SSDTCLRNPLHKQQSLPLRPIIPLVARISDQNASGAPPMTVREKTRLEKFRQLLSSQNTD LANTERRKLTLQRKREEYFGFIEQYYDSRNEEHHQDTYRQIFERILFIWAIRHPASGYVQ GINDLVTPFFVVFLSEYVEEDVENFDVTNLSQDMLRSIEADSFWCMSKLLDGIQDNYTFA QPGIQKKVKALEELVSRIDEQVHNHFRRYEVEYLQFAFRWMNNLLMRELPLRCTIRLWDT YQSEPEGFSHFHLYVCAAFLIKWRKEILDEEDFQSSRESSSSSSSSEAQNMGAQEDVSRR QQGLRGLRMEFEKHHAGHSED >gi568815592f:37254165_37481354|GENSCAN_predicted_CDS_1|1326_bp atggccgctgagaacagcaagcagttttggaagaggagcgctaagctgccggggagtttc attaaagaacgatcaaaagtcaacacagttcctctgaagaataagaaggcctccagtttt catgagtttgcacggaataccagtgatgcttgggacattggcgatgatgaggaagaggac ttttcctcaccttctttccaaactctgaactcaaaagttgctttggcaactgcagcccaa gttctagaaaaccacagcaagctgagagtaaaaccagaacggtcccagtcaacgacatcg gacgtccctgccaactacaaggtcataaagtccagcagtgatgcccagctgtccagaaac tctagtgatacatgcctgaggaacccactccacaaacagcaatcactccctctccggccc atcatccccctcgttgcccggatctcggatcagaacgcttctggggcccccccaatgact gtccgggagaaaacccgcctagaaaaattccgtcaacttctctccagccagaacactgac ttagcaaacactgagaggaggaagttgaccctgcagcggaagcgggaggaatattttggc ttcattgaacagtattatgactctcgaaacgaggaacatcaccaggatacctacagacag atctttgaaagaattctatttatttgggccatccgccaccctgccagtgggtatgtccag ggaattaatgacctggtcactccattctttgtcgtcttcctctcagaatatgtggaagag gatgtggagaactttgacgtgaccaacttgtctcaagacatgctgcgaagcattgaggct gacagcttttggtgcatgagcaagctgctggatggaatccaggataactacacctttgca caaccaggaatccagaagaaggtgaaggcactggaagagcttgtcagccggattgatgag caggtacataatcacttcaggaggtacgaggtagaatacctgcagtttgccttccgctgg atgaacaacctgcttatgcgggagcttcctcttcgctgcaccatccgcctgtgggacaca tatcagtctgaaccagaagggttctcccactttcatctctacgtgtgtgcagccttcttg atcaagtggaggaaagagatcttggatgaggaggattttcagtcatctcgggaaagcagc agcagcagcagcagcagcgaagcccagaacatgggagcccaggaagatgtcagcaggcgc cagcaaggcctcagagggctcaggatggaatttgagaagcaccatgcagggcactctgag gactaa >gi568815592f:37254165_37481354|GENSCAN_predicted_peptide_2|218_aa MGYYAAIKKDEFMSFVGTWMKLETIILSKLSQGQKSKRRMFSLIGLGVLCIEKAKQGKRE WGRLILWEAHLFHHESKIITLASMAASKTVSNQEISVGHPERELGPSTRHGILTVVQVTH PETWSGGCQKVKTWAVGVSFVQQEWFLAGVHGSHSGHSDQTAIRDEGSLGPRRRQHLSAV VIWGIGKHVLESVCLGEKQPNFFVAPVYCSHQCDPAGD >gi568815592f:37254165_37481354|GENSCAN_predicted_CDS_2|657_bp atgggatactatgcagccataaaaaaggatgagttcatgtcttttgtagggacatggatg aagctggaaaccatcattctcagcaaactatcgcaaggacaaaaaagcaaacgccgcatg ttctcactcatagggcttggagtcctctgcatcgagaaagcaaaacaagggaagagagaa tggggaaggctgatactgtgggaagctcatctttttcatcatgaatctaaaatcatcacc ttggcatctatggcagcttctaaaactgtctccaaccaggaaattagtgttggacatcct gagagagaactgggcccatccacgcggcatggaatactgacagtggtccaagtgacacat ccagagacatggagtggtggatgccagaaagttaagacctgggcggtgggtgtgagcttt gtacaacaggagtggttcctggccggggtccacggctcgcacagtggccacagtgatcag actgccatcagagatgaaggcagtctgggtccccggaggagacagcacctatcggcggta gtgatttggggcatcggcaaacatgtacttgagtctgtatgcctcggcgagaagcagccc aatttcttcgttgccccagtgtattgtagccaccagtgtgacccagctggtgactaa >gi568815592f:37254165_37481354|GENSCAN_predicted_peptide_3|1528_aa MGEPGFFVTGDRAGGRSWCLRRVGMSAGWLLLEDGCEVTVGRGFGVTYQLVSKICPLMIS RNHCVLKQNPEGQWTIMDNKSLNGVWLNRARLEPLRVYSIHQGDYIQLGVPLENKENAEY EYEVTEEDWETIYPCLSPKNDQMIEKNKELRTKRKFSLDELAGPGAEGPSNLKSKINKVS CESGQPVKSQGKGEVASTPSDNLDPKLTALEPSKTTGAPIYPGFPKVTEVHHEQKASNSS ASQRSLQMFKVTMSRILRLKIQMQEKHEAVMNVKKQTQKGNSKKVVQMEQELQDLQSQLC AEQAQQQARVEQLEKTFQEEEQHLQGLEIAQGEKDLKQQLAQALQEHWALMEELNRSKKD FEAIIQAKNKELEQTKEEKEKMQAQKEEVLSHMNDVLENELQCIICSEYFIEAVTLNCAH SFCSYCINEWMKRKIECPICRKDIKSKTYSLVLDNCINKMVNNLSSEVKERRIVLIRERK GRHPGDLCGSLRYFSQGVENYKNHFGFVKNALEGSQLETVTGVWSYWRNSAERGRPWTSQ VERWSGPNLSCLSRGRDPAGGGPRRGAVRDVIRLRCPQYWTRARRCGWRTRAGLWGNETE GGGGGSGSGGDSVGLTPPPLPGSSLSPLSLTMKRRTDPECTAPIKKQKKRVAELALSLSS TSDDEPPSSVSHGAKASTTSLSGSDSETEGKQHSSDSFDDAFKADSLVEGTSSRYSMYNS VSQKLMAKMGFREGEGLGKYSQGRKDIVEASSQKGRRGLGLTLRGFDQELNVDWRDEPEP SACEQVSWFPECTTEIPDTQEMSDWMVVGKRKMIIEDETEFCGEELLHSVLQCKSVFDVL DGEEMRRARTRANPYEMIRGVFFLNRAAMKMANMDFVFDRMFTNPRDSYGKPLVKDREAE LLYFADVCAGPGGFSEYVLWRKKWHAKGFGMTLKGPNDFKLEDFYSASSELFEPYYGRDI EEGTRRYEGQPLYGDFRSLTLGFTVHVRILVPSGLSLFTLGLQNTGEGGIDGDGDITRPE NISAFRNFVLDNTDRKGVHFLMADGGFSVEGQENLQEILSKQLLLCQFLMALSIVRTGGH FICKTFDLFTPFSVGLVYLLYCCFERVCLFKPITSRPANSERYVVCKGLKVGIDDVRDYL FAVNIKLNQLRNTDSDVNLVVPLEVIKGDHEFTDYMIRSNESHCSLQIKALAKIHAFVQD TTLSEPRQAEIRKECLRLWGIPDQARVAPSSSDPKSKFFELIQGTEIDIFSYKPTLLTSK TLEKIRPVFDYRCMVSGSEQKFLIGLGKSQIYTWDGRQSDRWIKLDLKTELPRDTLLSVE IVHELKGEGKAQRKISAIHILDVLVLNGTDVREQHFNQRSDLGPRRDCLDCRIQLAEKFV KAVSKPSRPDMNPIRVKEVYRLEEMEKIFVRLEMKIIKGSSGTPKLSYTGRDDRHFVPMG LYIVRTVNEPWTMGFSKSFKKKFFYNKKTKDSTFDLPADSIAPFHICYYGRLFWEWGDGI RVHDSQKPQDQDKLSKEDVLSFIQMHRA >gi568815592f:37254165_37481354|GENSCAN_predicted_CDS_3|4587_bp atgggggagcccggcttcttcgtcacaggagaccgcgccggtggccggagctggtgcctg cggcgggtggggatgagcgccgggtggctgctgctggaagatgggtgcgaggtgactgta ggacgaggatttggtgtcacataccaactggtatcaaaaatctgccccctgatgatttct cgaaaccactgtgttttgaagcagaatcctgagggccaatggacaattatggacaacaag agtctaaatggtgtttggctgaacagagcgcgtctggaacctttaagggtctattccatt catcagggagactacatccaacttggagtgcctctggaaaataaggagaatgcggagtat gaatatgaagttactgaagaagactgggagacaatatatccttgtctttccccaaagaat gaccaaatgatagaaaaaaataaggaattgagaactaaaaggaaattcagtttggatgaa ttagcaggtcctggagctgaaggcccctcaaatttgaaatccaaaataaataaagtgtct tgtgaatctggtcagccagtgaaatcacaggggaaaggtgaagtggccagtacaccctct gacaatttggatcctaagttgactgcccttgagccaagtaagaccacaggggctcccatt taccctggcttccccaaagtcacagaggttcatcatgagcagaaagcctcaaactcttca gcatctcagagaagcttacagatgtttaaggtgaccatgtccaggattctgaggctcaaa atacagatgcaggaaaaacatgaagccgttatgaatgtgaaaaagcagacccaaaagggg aactcaaagaaagttgtgcaaatggagcaggaacttcaggacttacagtcccagctgtgt gcagagcaggctcagcagcaggcaagagtggagcaactagagaagactttccaggaagag gaacagcatcttcagggtttggagatagcccaaggagaaaaggacctgaagcaacagctg gcccaggctctgcaggagcattgggctctaatggaagagctaaatcgcagcaagaaggac tttgaagcaatcattcaagccaagaacaaagaattagagcagaccaaggaagagaaggag aagatgcaagcacagaaggaagaagttcttagccacatgaatgatgtgctagagaatgag ctccaatgtattatttgttcagaatacttcattgaggctgtcaccttgaactgtgcccac agtttctgctcctactgtatcaatgaatggatgaagcggaagatagaatgccccatttgt cggaaggacattaagtccaaaacgtactctttggttctggacaattgcattaataagatg gtaaataatctgagctcagaagtgaaagaacgacgaattgttctcattagggaacgaaaa ggcaggcacccaggagacctctgtggaagcctcaggtacttcagccaaggtgttgaaaat tataaaaatcactttggttttgtgaagaacgcattagagggatcacaactggagacagtc acgggagtttggagttactggagaaattctgccgagagggggcgcccctggacctcccag gttgagaggtggagcgggccaaacctcagctgcctttcccggggccgggacccggccggg ggaggaccgaggcgcggcgctgtccgtgacgtcatcaggctgcgctgcccgcagtactgg acccgagcgcgacggtgcggctggcggacccgggctggcttgtggggaaacgaaactgag ggaggaggcggcggctctggcagcggcggcgacagtgtcggcctgaccccccctccgctc cccggcagctcgctctctcccctcagcttaacgatgaagaggagaactgacccagaatgc actgcccccatcaagaaacagaaaaaaagagttgcagagcttgccctgagcctcagctcc acgtccgatgatgaacctccctcctctgtcagtcatggagcaaaagcatctactacaagc cttagtgggtctgatagtgagaccgaggggaaacaacacagctctgactcttttgacgat gcattcaaagcagactctcttgtggaaggaacttcttctcgctattccatgtataatagc gtctcccagaagcttatggccaagatgggcttcagggaaggtgaaggattgggtaaatac agccagggtcggaaggacatcgttgaggcttccagtcagaaaggtcgaagaggcttgggt ctgacactccggggctttgaccaggagctgaacgtggactggcgagatgagccagagccc agtgcttgtgagcaggtgtcatggtttccagaatgtaccactgaaattcctgacactcag gaaatgagcgattggatggtggtgggaaagagaaagatgattattgaagatgaaacagag ttttgtggggaagagctgcttcacagtgtgttgcagtgtaagagcgtgtttgatgtcttg gatggggaagagatgcggcgagctcggactcgggccaatccctatgagatgatccgagga gtcttctttctaaacagggcagcaatgaagatggctaacatggattttgtatttgatcgc atgttcacaaatccgcgggactcttatgggaagccactggtgaaggaccgggaagctgag cttctgtactttgctgatgtctgcgcaggcccaggtggcttctcagagtatgtgctgtgg aggaagaagtggcatgcaaagggctttggaatgactttgaagggccctaatgacttcaag ctggaggacttctactctgcttccagtgaactcttcgaaccctactatggtagggacatt gaggagggtactaggaggtatgagggacagcccctctatggggacttcaggtctcttacc ctgggcttcacagttcatgtcagaatcttggttccctctggactatccctttttaccctg ggcctgcagaatacaggtgagggtgggattgatggagatggagatatcacccgcccagag aacatctctgcttttcggaattttgtcctggataacacagatcgcaagggtgtccatttt ctgatggctgatgggggtttctcggtggaggggcaggagaacctgcaggagatcctcagc aagcagctgcttctgtgtcagttcctcatggcgctgtccattgtccggacaggaggccac ttcatctgtaaaacctttgacctgttcacaccgtttagtgtggggcttgtctacctgctg tactgctgctttgaacgagtttgtctcttcaagcctattaccagccgtcctgccaactca gagaggtatgtggtgtgcaagggcctgaaggtgggcatagatgatgttcgggattacctc ttcgcagtgaatattaaactcaatcagctgcggaacacggattccgacgtcaacttggtg gtccccctggaggtgatcaagggagaccatgaatttactgactacatgatacggtccaat gagagccactgtagtctgcagatcaaagctctggcgaaaatccatgcctttgttcaagac acgacactgagtgagcctcgacaggcagagatacggaaggagtgcctccgactctggggg atcccagaccaggctcgtgtggctccttcttcctccgaccctaaatcgaagttctttgag ctaatccagggcactgagattgacatcttcagctacaagcccacactgctcacctctaaa accctggagaagatccgccctgtgtttgactaccgctgcatggtatctggcagtgagcag aagttcctcatcggcctggggaaatcccagatctacacatgggatggccgccagtcagac cgctggatcaagctagacctgaagacagagctgccccgggacactctgctatctgtggaa attgtgcatgagctgaaaggggaggggaaggcccagaggaagatcagtgccatccacatc ctcgatgtccttgtgctgaatggcaccgacgttcgggagcagcactttaaccagcggtct gacctgggccccaggagagactgtttggattgtagaattcagcttgccgagaaatttgtg aaagccgtttccaagcctagtcggcccgacatgaatcccatcagggtgaaggaggtgtac agactggaagagatggagaagatttttgtcaggttggagatgaagatcatcaagggctcc agtggcaccccaaagctcagctacacagggcgtgatgaccggcactttgtacccatgggc ctctacatcgtcaggacagtgaatgagccctggactatgggattcagcaaaagcttcaag aagaagttcttctacaacaagaaaaccaaggactctacttttgacctccctgcagactcc attgccccatttcacatttgctactatggccggctcttctgggagtggggggatggcatt cgtgtgcatgactcccagaagccccaggaccaggacaagctgtccaaggaggacgtcctc tccttcatccagatgcacagggcctaa