GENSCAN 1.0 Date run: 2-Nov-116 Time: 18:00:08 Sequence gi568815592f:37335630_37580142 : 244513 bp : 46.74% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 Intr - 7624 7570 55 2 1 129 81 28 0.150 4.24 1.01 Init - 9980 9848 133 2 1 82 63 68 0.658 4.00 1.00 Prom - 10104 10065 40 -2.46 2.00 Prom + 13836 13875 40 -5.46 2.01 Init + 18536 18646 111 1 0 69 80 93 0.789 6.83 2.02 Intr + 24817 24945 129 0 0 104 87 77 0.940 10.09 2.03 Intr + 32855 33589 735 1 0 81 81 284 0.766 18.64 2.04 Intr + 35883 35945 63 2 0 78 78 100 0.982 6.91 2.05 Intr + 38991 39080 90 2 0 80 79 118 0.999 10.29 2.06 Intr + 41297 41404 108 1 0 79 76 65 0.889 4.88 2.07 Intr + 45521 45725 205 1 1 69 113 142 0.798 13.57 2.08 Intr + 59595 59642 48 1 0 55 121 14 0.001 0.05 2.09 Intr + 96719 96830 112 0 1 115 98 29 0.040 6.04 2.10 Intr + 97463 97748 286 2 1 72 75 47 0.039 -0.86 2.11 Intr + 99995 100133 139 1 1 57 102 66 0.216 4.94 2.12 Intr + 108370 108521 152 2 2 123 80 97 0.999 12.28 2.13 Intr + 110662 110820 159 0 0 124 107 183 0.991 23.98 2.14 Intr + 114622 114714 93 0 0 98 102 19 0.932 4.46 2.15 Intr + 116177 116248 72 1 0 103 116 20 0.981 5.90 2.16 Intr + 117418 117512 95 0 2 58 41 93 0.644 0.36 2.17 Intr + 117611 117683 73 2 1 77 64 44 0.565 0.31 2.18 Intr + 122983 123242 260 0 2 100 71 258 0.555 21.56 2.19 Intr + 123286 123371 86 1 2 87 48 21 0.553 -2.54 2.20 Intr + 123937 124055 119 2 2 104 61 90 0.999 8.08 2.21 Intr + 125920 126016 97 0 1 106 61 177 0.785 16.38 2.22 Intr + 126341 126473 133 0 1 85 57 136 0.998 9.90 2.23 Intr + 127200 127379 180 0 0 -2 99 315 0.972 22.58 2.24 Intr + 135392 135448 57 2 0 75 94 40 0.596 1.30 2.25 Intr + 136218 136275 58 0 1 68 84 -10 0.634 -4.51 2.26 Intr + 136790 136858 69 1 0 79 110 48 0.772 5.48 2.27 Intr + 137841 137972 132 2 0 70 98 255 0.793 25.54 2.28 Intr + 138895 139017 123 0 0 97 55 162 0.777 14.58 2.29 Intr + 139692 139802 111 2 0 71 49 180 0.916 12.98 2.30 Intr + 140477 140565 89 1 2 34 99 86 0.985 3.07 2.31 Intr + 141963 142010 48 0 0 88 109 38 0.908 3.70 2.32 Intr + 142780 142892 113 1 2 15 64 152 0.928 5.52 2.33 Intr + 143518 143626 109 2 1 96 89 103 0.984 10.54 2.34 Term + 144384 144516 133 0 1 127 48 182 0.994 15.66 2.35 PlyA + 145002 145007 6 -0.45 3.08 PlyA - 147328 147323 6 1.05 3.07 Term - 147660 147557 104 1 2 99 52 199 0.995 15.84 3.06 Intr - 149233 149181 53 0 2 72 107 73 0.913 6.15 3.05 Intr - 149565 149471 95 0 2 98 95 97 0.999 10.16 3.04 Intr - 154287 154243 45 0 0 87 97 11 0.231 0.51 3.03 Intr - 155597 155476 122 0 2 81 53 40 0.301 0.01 3.02 Intr - 177292 177218 75 2 0 91 64 80 0.271 5.39 3.01 Init - 179223 179118 106 0 1 71 34 106 0.278 3.88 3.00 Prom - 188228 188189 40 -5.36 4.06 PlyA - 188758 188753 6 1.05 4.05 Term - 200300 200093 208 1 1 70 34 99 0.425 -0.49 4.04 Intr - 203995 203822 174 0 0 139 53 101 0.547 10.75 4.03 Intr - 210169 210118 52 2 1 98 95 35 0.624 3.17 4.02 Intr - 212555 212221 335 1 2 101 53 74 0.575 0.41 4.01 Init - 213117 212891 227 0 2 92 107 64 0.588 6.64 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 72652 72599 54 1 0 83 70 82 0.899 7.18 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:37335630_37580142|GENSCAN_predicted_peptide_1|63_aa MGYYAAIKKDEFMSFVGTWMKLETIILSKLSQGQKSKRRMFSLIGLGVLCIEKAKQGKRE WGS >gi568815592f:37335630_37580142|GENSCAN_predicted_CDS_1|189_bp atgggatactatgcagccataaaaaaggatgagttcatgtcttttgtagggacatggatg aagctggaaaccatcattctcagcaaactatcgcaaggacaaaaaagcaaacgccgcatg ttctcactcatagggcttggagtcctctgcatcgagaaagcaaaacaagggaagagagaa tggggaagn >gi568815592f:37335630_37580142|GENSCAN_predicted_peptide_2|1528_aa MGEPGFFVTGDRAGGRSWCLRRVGMSAGWLLLEDGCEVTVGRGFGVTYQLVSKICPLMIS RNHCVLKQNPEGQWTIMDNKSLNGVWLNRARLEPLRVYSIHQGDYIQLGVPLENKENAEY EYEVTEEDWETIYPCLSPKNDQMIEKNKELRTKRKFSLDELAGPGAEGPSNLKSKINKVS CESGQPVKSQGKGEVASTPSDNLDPKLTALEPSKTTGAPIYPGFPKVTEVHHEQKASNSS ASQRSLQMFKVTMSRILRLKIQMQEKHEAVMNVKKQTQKGNSKKVVQMEQELQDLQSQLC AEQAQQQARVEQLEKTFQEEEQHLQGLEIAQGEKDLKQQLAQALQEHWALMEELNRSKKD FEAIIQAKNKELEQTKEEKEKMQAQKEEVLSHMNDVLENELQCIICSEYFIEAVTLNCAH SFCSYCINEWMKRKIECPICRKDIKSKTYSLVLDNCINKMVNNLSSEVKERRIVLIRERK GRHPGDLCGSLRYFSQGVENYKNHFGFVKNALEGSQLETVTGVWSYWRNSAERGRPWTSQ VERWSGPNLSCLSRGRDPAGGGPRRGAVRDVIRLRCPQYWTRARRCGWRTRAGLWGNETE GGGGGSGSGGDSVGLTPPPLPGSSLSPLSLTMKRRTDPECTAPIKKQKKRVAELALSLSS TSDDEPPSSVSHGAKASTTSLSGSDSETEGKQHSSDSFDDAFKADSLVEGTSSRYSMYNS VSQKLMAKMGFREGEGLGKYSQGRKDIVEASSQKGRRGLGLTLRGFDQELNVDWRDEPEP SACEQVSWFPECTTEIPDTQEMSDWMVVGKRKMIIEDETEFCGEELLHSVLQCKSVFDVL DGEEMRRARTRANPYEMIRGVFFLNRAAMKMANMDFVFDRMFTNPRDSYGKPLVKDREAE LLYFADVCAGPGGFSEYVLWRKKWHAKGFGMTLKGPNDFKLEDFYSASSELFEPYYGRDI EEGTRRYEGQPLYGDFRSLTLGFTVHVRILVPSGLSLFTLGLQNTGEGGIDGDGDITRPE NISAFRNFVLDNTDRKGVHFLMADGGFSVEGQENLQEILSKQLLLCQFLMALSIVRTGGH FICKTFDLFTPFSVGLVYLLYCCFERVCLFKPITSRPANSERYVVCKGLKVGIDDVRDYL FAVNIKLNQLRNTDSDVNLVVPLEVIKGDHEFTDYMIRSNESHCSLQIKALAKIHAFVQD TTLSEPRQAEIRKECLRLWGIPDQARVAPSSSDPKSKFFELIQGTEIDIFSYKPTLLTSK TLEKIRPVFDYRCMVSGSEQKFLIGLGKSQIYTWDGRQSDRWIKLDLKTELPRDTLLSVE IVHELKGEGKAQRKISAIHILDVLVLNGTDVREQHFNQRSDLGPRRDCLDCRIQLAEKFV KAVSKPSRPDMNPIRVKEVYRLEEMEKIFVRLEMKIIKGSSGTPKLSYTGRDDRHFVPMG LYIVRTVNEPWTMGFSKSFKKKFFYNKKTKDSTFDLPADSIAPFHICYYGRLFWEWGDGI RVHDSQKPQDQDKLSKEDVLSFIQMHRA >gi568815592f:37335630_37580142|GENSCAN_predicted_CDS_2|4587_bp atgggggagcccggcttcttcgtcacaggagaccgcgccggtggccggagctggtgcctg cggcgggtggggatgagcgccgggtggctgctgctggaagatgggtgcgaggtgactgta ggacgaggatttggtgtcacataccaactggtatcaaaaatctgccccctgatgatttct cgaaaccactgtgttttgaagcagaatcctgagggccaatggacaattatggacaacaag agtctaaatggtgtttggctgaacagagcgcgtctggaacctttaagggtctattccatt catcagggagactacatccaacttggagtgcctctggaaaataaggagaatgcggagtat gaatatgaagttactgaagaagactgggagacaatatatccttgtctttccccaaagaat gaccaaatgatagaaaaaaataaggaattgagaactaaaaggaaattcagtttggatgaa ttagcaggtcctggagctgaaggcccctcaaatttgaaatccaaaataaataaagtgtct tgtgaatctggtcagccagtgaaatcacaggggaaaggtgaagtggccagtacaccctct gacaatttggatcctaagttgactgcccttgagccaagtaagaccacaggggctcccatt taccctggcttccccaaagtcacagaggttcatcatgagcagaaagcctcaaactcttca gcatctcagagaagcttacagatgtttaaggtgaccatgtccaggattctgaggctcaaa atacagatgcaggaaaaacatgaagccgttatgaatgtgaaaaagcagacccaaaagggg aactcaaagaaagttgtgcaaatggagcaggaacttcaggacttacagtcccagctgtgt gcagagcaggctcagcagcaggcaagagtggagcaactagagaagactttccaggaagag gaacagcatcttcagggtttggagatagcccaaggagaaaaggacctgaagcaacagctg gcccaggctctgcaggagcattgggctctaatggaagagctaaatcgcagcaagaaggac tttgaagcaatcattcaagccaagaacaaagaattagagcagaccaaggaagagaaggag aagatgcaagcacagaaggaagaagttcttagccacatgaatgatgtgctagagaatgag ctccaatgtattatttgttcagaatacttcattgaggctgtcaccttgaactgtgcccac agtttctgctcctactgtatcaatgaatggatgaagcggaagatagaatgccccatttgt cggaaggacattaagtccaaaacgtactctttggttctggacaattgcattaataagatg gtaaataatctgagctcagaagtgaaagaacgacgaattgttctcattagggaacgaaaa ggcaggcacccaggagacctctgtggaagcctcaggtacttcagccaaggtgttgaaaat tataaaaatcactttggttttgtgaagaacgcattagagggatcacaactggagacagtc acgggagtttggagttactggagaaattctgccgagagggggcgcccctggacctcccag gttgagaggtggagcgggccaaacctcagctgcctttcccggggccgggacccggccggg ggaggaccgaggcgcggcgctgtccgtgacgtcatcaggctgcgctgcccgcagtactgg acccgagcgcgacggtgcggctggcggacccgggctggcttgtggggaaacgaaactgag ggaggaggcggcggctctggcagcggcggcgacagtgtcggcctgaccccccctccgctc cccggcagctcgctctctcccctcagcttaacgatgaagaggagaactgacccagaatgc actgcccccatcaagaaacagaaaaaaagagttgcagagcttgccctgagcctcagctcc acgtccgatgatgaacctccctcctctgtcagtcatggagcaaaagcatctactacaagc cttagtgggtctgatagtgagaccgaggggaaacaacacagctctgactcttttgacgat gcattcaaagcagactctcttgtggaaggaacttcttctcgctattccatgtataatagc gtctcccagaagcttatggccaagatgggcttcagggaaggtgaaggattgggtaaatac agccagggtcggaaggacatcgttgaggcttccagtcagaaaggtcgaagaggcttgggt ctgacactccggggctttgaccaggagctgaacgtggactggcgagatgagccagagccc agtgcttgtgagcaggtgtcatggtttccagaatgtaccactgaaattcctgacactcag gaaatgagcgattggatggtggtgggaaagagaaagatgattattgaagatgaaacagag ttttgtggggaagagctgcttcacagtgtgttgcagtgtaagagcgtgtttgatgtcttg gatggggaagagatgcggcgagctcggactcgggccaatccctatgagatgatccgagga gtcttctttctaaacagggcagcaatgaagatggctaacatggattttgtatttgatcgc atgttcacaaatccgcgggactcttatgggaagccactggtgaaggaccgggaagctgag cttctgtactttgctgatgtctgcgcaggcccaggtggcttctcagagtatgtgctgtgg aggaagaagtggcatgcaaagggctttggaatgactttgaagggccctaatgacttcaag ctggaggacttctactctgcttccagtgaactcttcgaaccctactatggtagggacatt gaggagggtactaggaggtatgagggacagcccctctatggggacttcaggtctcttacc ctgggcttcacagttcatgtcagaatcttggttccctctggactatccctttttaccctg ggcctgcagaatacaggtgagggtgggattgatggagatggagatatcacccgcccagag aacatctctgcttttcggaattttgtcctggataacacagatcgcaagggtgtccatttt ctgatggctgatgggggtttctcggtggaggggcaggagaacctgcaggagatcctcagc aagcagctgcttctgtgtcagttcctcatggcgctgtccattgtccggacaggaggccac ttcatctgtaaaacctttgacctgttcacaccgtttagtgtggggcttgtctacctgctg tactgctgctttgaacgagtttgtctcttcaagcctattaccagccgtcctgccaactca gagaggtatgtggtgtgcaagggcctgaaggtgggcatagatgatgttcgggattacctc ttcgcagtgaatattaaactcaatcagctgcggaacacggattccgacgtcaacttggtg gtccccctggaggtgatcaagggagaccatgaatttactgactacatgatacggtccaat gagagccactgtagtctgcagatcaaagctctggcgaaaatccatgcctttgttcaagac acgacactgagtgagcctcgacaggcagagatacggaaggagtgcctccgactctggggg atcccagaccaggctcgtgtggctccttcttcctccgaccctaaatcgaagttctttgag ctaatccagggcactgagattgacatcttcagctacaagcccacactgctcacctctaaa accctggagaagatccgccctgtgtttgactaccgctgcatggtatctggcagtgagcag aagttcctcatcggcctggggaaatcccagatctacacatgggatggccgccagtcagac cgctggatcaagctagacctgaagacagagctgccccgggacactctgctatctgtggaa attgtgcatgagctgaaaggggaggggaaggcccagaggaagatcagtgccatccacatc ctcgatgtccttgtgctgaatggcaccgacgttcgggagcagcactttaaccagcggtct gacctgggccccaggagagactgtttggattgtagaattcagcttgccgagaaatttgtg aaagccgtttccaagcctagtcggcccgacatgaatcccatcagggtgaaggaggtgtac agactggaagagatggagaagatttttgtcaggttggagatgaagatcatcaagggctcc agtggcaccccaaagctcagctacacagggcgtgatgaccggcactttgtacccatgggc ctctacatcgtcaggacagtgaatgagccctggactatgggattcagcaaaagcttcaag aagaagttcttctacaacaagaaaaccaaggactctacttttgacctccctgcagactcc attgccccatttcacatttgctactatggccggctcttctgggagtggggggatggcatt cgtgtgcatgactcccagaagccccaggaccaggacaagctgtccaaggaggacgtcctc tccttcatccagatgcacagggcctaa >gi568815592f:37335630_37580142|GENSCAN_predicted_peptide_3|199_aa MTEKKEIGFKGTGPECNFLSGPGEDKKLHNVDWSTAASQRTREPLDAEKSAFQAKKEGEP EKLHMRPPTAASRGAGTPSADLARMLGQLSVHELARNSLPPVSALAVSTLPVTVTAIDGL EEKLSQCRRDLEAVNSRLHSRELSPEARRSLEKEKNSLMNKASNYEKELKFLRQENRKNM LLSVAIFILLTLVYAYWTM >gi568815592f:37335630_37580142|GENSCAN_predicted_CDS_3|600_bp atgacagagaaaaaagagatcggcttcaagggaacaggccctgagtgtaactttctttct ggccctggggaagacaagaagctgcacaatgtggattggagcacagctgcaagccagagg acaagggagcccttggatgcagagaagtcagcattccaggccaagaaggagggggagcca gaaaagctgcatatgaggcccccaacagcagctagccgtggagctggcaccccatcagct gacctggccaggatgctggggcagttatctgtgcatgagctggcacgaaattccctgccc ccagtctctgccttggctgtctctactttgccggtcacagtcacagccatcgatgggcta gaggagaagctgtcccagtgtcggagagacctggaggccgtgaactccagactccacagc cgggagctgagcccagaggccaggaggtccctggagaaggagaaaaacagcctaatgaac aaagcctccaactacgagaaggaactgaagtttcttcggcaagagaaccggaagaacatg ctgctctctgtggccatctttatcctcctgacgctcgtctatgcctactggaccatgtga >gi568815592f:37335630_37580142|GENSCAN_predicted_peptide_4|331_aa MPGCFRHLQTQMQTSEDTPQPVPEGSKMGSRVQSWTQSQDPKLSALFPRGPGDSREKPEA QSQLRELARQGCGMQRRGAEGRLTLAPDAAELDFSALIVGCHLCKAGWTHQPQRVVNLQP VLLLVHTRPEQPQPPTPQSPAEEYANTPQNKFQLEFLPSCFRQLHPSKPWPPLSTSSNWQ TSKERQDSGPQDRALKGLVETGKTWPGILIISISDVTGLGGEEATPLQGKLKTEIFSHAA GKWGEQRKGAQEPLRGSVLEAFQWPRRQGDVSAAPCQELGVFIRTWAAPAAVSWRIRASH GLLSNHLPGPPGPDCAVAAPPPSSCAGKLEY >gi568815592f:37335630_37580142|GENSCAN_predicted_CDS_4|996_bp atgccaggctgtttcaggcacctgcagacccagatgcagacatctgaggacacaccacaa ccagtccctgaggggagcaagatggggagcagggtgcagagctggacacaaagccaggac cccaaactcagcgctctgttcccacgtgggccaggagactcaagagaaaaacctgaggcc caatcccagctgcgtgaactggctcgacaaggctgcgggatgcagaggagaggggcagaa gggaggctaaccctggctccagatgcagctgagctagatttttctgccctgatcgtgggc tgccatctgtgcaaggctggttggacccaccagcctcagagggtggttaatttacaacct gtgctgctgcttgtccacaccaggccagagcagcctcaacccccaacccctcagtctcca gcagaagaatatgcgaacactcctcagaacaagttccagctcgagttccttccttcctgc ttccgtcaactccatccctccaagccttggcctcccctcagcacctcatccaactggcaa acttccaaggagaggcaggacagtgggccacaggatagagcattaaagggcctggtagaa actggaaagacctggcccgggatattgatcatctccatcagtgatgtcacaggactgggt ggggaggaggccactcccctgcaggggaagctgaagacagagatcttcagccatgcagcc gggaagtggggagagcagaggaagggggcccaggagcccctccgagggtctgtgctggag gctttccagtggccgagaaggcagggcgatgtgtcggctgcgccctgccaggagctaggt gttttcattaggacttgggcggcccctgccgccgtgagctggcggatccgtgcctcccac gggctgctcagcaaccaccttccagggcccccaggaccggactgcgccgtggccgctcct cctccctcctcttgtgctgggaagttggagtattaa