GENSCAN 1.0 Date run: 4-Nov-116 Time: 06:30:28 Sequence gi568815596f:85654190_85864183 : 209994 bp : 48.38% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.11 PlyA - 238 233 6 1.05 1.10 Term - 7128 7039 90 0 0 82 48 68 0.417 -0.08 1.09 Intr - 7920 7840 81 0 0 94 86 81 0.990 8.33 1.08 Intr - 9302 9157 146 0 2 79 86 78 0.702 6.70 1.07 Intr - 9658 9475 184 1 1 87 63 358 0.999 32.56 1.06 Intr - 11189 11100 90 2 0 109 100 29 0.982 6.39 1.05 Intr - 11605 11417 189 1 0 112 94 131 0.986 15.88 1.04 Intr - 12553 12428 126 1 0 95 46 243 0.999 21.68 1.03 Intr - 12988 12917 72 1 0 108 95 103 0.882 12.60 1.02 Intr - 13617 13490 128 1 2 34 78 -29 0.810 -8.90 1.01 Init - 13994 13928 67 2 1 92 105 147 0.884 16.10 1.00 Prom - 22612 22573 40 -4.06 2.00 Prom + 32425 32464 40 -4.36 2.01 Init + 40230 40281 52 2 1 111 110 119 0.999 15.74 2.02 Intr + 40727 40807 81 0 0 111 93 35 0.984 5.91 2.03 Intr + 41131 41234 104 2 2 71 98 106 0.983 9.79 2.04 Intr + 41769 41867 99 2 0 81 94 98 0.982 10.01 2.05 Intr + 43317 43488 172 2 1 65 109 25 0.869 1.82 2.06 Intr + 48863 48911 49 0 1 85 105 -7 0.027 -1.56 2.07 Intr + 56985 57101 117 0 0 96 48 112 0.167 7.58 2.08 Term + 58859 59018 160 2 1 91 39 49 0.089 -2.29 2.09 PlyA + 61645 61650 6 1.05 3.00 Prom + 63147 63186 40 -3.56 3.01 Init + 66824 66909 86 1 2 91 83 55 0.328 5.59 3.02 Intr + 76171 76220 50 1 2 44 89 57 0.396 -0.28 3.03 Term + 77449 77567 119 2 2 95 53 96 0.829 5.50 3.04 PlyA + 80778 80783 6 1.05 4.00 Prom + 82580 82619 40 -5.16 4.01 Init + 100001 100768 768 1 0 96 80 773 0.961 72.00 4.02 Intr + 109802 109993 192 1 0 136 91 264 0.861 31.19 4.03 Intr + 113416 113451 36 0 0 100 87 20 0.452 1.56 4.04 Intr + 117389 117573 185 1 2 84 89 50 0.330 3.29 4.05 Intr + 117776 117905 130 2 1 -8 66 97 0.129 -1.40 4.06 Intr + 124945 125103 159 0 0 84 72 141 0.807 12.28 4.07 Intr + 141796 141923 128 0 2 119 36 33 0.265 0.68 4.08 Intr + 146229 146352 124 0 1 96 87 102 0.741 11.49 4.09 Intr + 156092 156146 55 1 1 79 73 -8 0.015 -4.65 4.10 Intr + 156516 156656 141 1 0 99 39 70 0.152 3.52 4.11 Intr + 157615 157766 152 2 2 77 115 37 0.179 5.18 4.12 Term + 159373 159480 108 0 0 51 51 65 0.102 -2.39 4.13 PlyA + 160962 160967 6 1.05 5.00 Prom + 161363 161402 40 -7.86 5.01 Init + 161650 161786 137 0 2 61 121 109 0.899 11.13 5.02 Intr + 163505 163598 94 2 1 8 89 66 0.626 -1.33 5.03 Intr + 164119 164207 89 0 2 68 105 35 0.411 1.97 5.04 Term + 169807 169912 106 1 1 108 55 71 0.576 3.68 5.05 PlyA + 172577 172582 6 1.05 6.00 Prom + 177333 177372 40 -2.46 6.01 Init + 184410 184608 199 2 1 57 69 110 0.866 3.80 6.02 Term + 185354 185562 209 0 2 55 51 129 0.816 3.40 6.03 PlyA + 185780 185785 6 1.05 7.07 PlyA - 185820 185815 6 -0.45 7.06 Term - 186203 185955 249 2 0 90 47 191 0.989 10.90 7.05 Intr - 190365 190207 159 0 0 82 63 80 0.922 5.08 7.04 Intr - 192374 192188 187 1 1 82 107 -24 0.865 -1.41 7.03 Intr - 194015 193672 344 2 2 41 83 381 0.876 27.13 7.02 Intr - 207103 206992 112 0 1 89 72 48 0.693 3.68 7.01 Intr - 209296 209173 124 2 1 104 108 -2 0.625 3.04 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 198855 198761 95 1 2 -12 44 181 0.826 3.38 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:85654190_85864183|GENSCAN_predicted_peptide_1|390_aa MAESHLLQWLLLLLPTLCGPGTAAWTTSSLACAQGPEFWCQSLEQALQCRALGHCLQEVW GHVGADDLCQECEDIVHILNKMAKEAIFQDTMRKFLEQECNVLPLKLLMPQCNQVLDDYF PLVIDYFQNQTDSNGICMHLGLCKSRQPEPEQEPGMSDPLPKPLRDPLPDPLLDKLVLPV LPGALQARPGPHTQDLSEQQFPIPLPYCWLCRALIKRIQAMIPKGALAVAVAQVCRVVPL VAGGICQCLAERYSVILLDTLLGRMLPQLVCRLVLRCSMDDSAGPRSPTGEWLPRDSECH LCMSVTTQAGNSSEQAIPQAMLQACVGSWLDREKCKQFVEQHTPQLLTLVPRGWDAHTTC QLEAPEGFWLAIRKTPFPDPEHCPAQNPSH >gi568815596f:85654190_85864183|GENSCAN_predicted_CDS_1|1173_bp atggctgagtcacacctgctgcagtggctgctgctgctgctgcccacgctctgtggccca ggcactgctgcctggaccacctcatccttggcctgtgcccagggccctgagttctggtgc caaagcctggagcaagcattgcagtgcagagccctagggcattgcctacaggaagtctgg ggacatgtgggagccgatgacctatgccaagagtgtgaggacatcgtccacatccttaac aagatggccaaggaggccattttccaggacacgatgaggaagttcctggagcaggagtgc aacgtcctccccttgaagctgctcatgccccagtgcaaccaagtgcttgacgactacttc cccctggtcatcgactacttccagaaccagactgactcaaacggcatctgtatgcacctg ggcctgtgcaaatcccggcagccagagccagagcaggagccagggatgtcagaccccctg cccaaacctctgcgggaccctctgccagaccctctgctggacaagctcgtcctccctgtg ctgcccggggccctccaggcgaggcctgggcctcacacacaggatctctccgagcagcaa ttccccattcctctcccctattgctggctctgcagggctctgatcaagcggatccaagcc atgattcccaagggtgcgctagctgtggcagtggcccaggtgtgccgcgtggtacctctg gtggcgggcggcatctgccagtgcctggctgagcgctactccgtcatcctgctcgacacg ctgctgggccgcatgctgccccagctggtctgccgcctcgtcctccggtgctccatggat gacagcgctggcccaaggtcgccgacaggagaatggctgccgcgagactctgagtgccac ctctgcatgtccgtgaccacccaggccgggaacagcagcgagcaggccataccacaggca atgctccaggcctgtgttggctcctggctggacagggaaaagtgcaagcaatttgtggag cagcacacgccccagctgctgaccctggtgcccaggggctgggatgcccacaccacctgc cagctggaggctcctgagggcttctggctggccatcaggaaaacaccctttccggacccc gagcactgccccgcccagaaccccagtcactga >gi568815596f:85654190_85864183|GENSCAN_predicted_peptide_2|277_aa MATWALLLLAAMLLGNPGLEVSVSPKGKNTSGRESGFGWAIWMEGLVFSRLSPEYYDLAR AHLRDEEKSCPCLAQEGPQGDLLTKTQELGRDYRTCLTIVQKLKKMVDKPTQRSVSNAAT RVCRTGRSRWRDVCRNFMRRYQSRVTQGLVAGETAQQICEDLRLCIPSTGIYSYKFPFWH SLSYNPARVGSSGDHGGGHEILTGDLKLLYKPSNPDRSPTHNQQRSSDVWRLPIEVNVAE LALFPGPWALGTGISEPHGPRSCVATPVPIGEATQHP >gi568815596f:85654190_85864183|GENSCAN_predicted_CDS_2|834_bp atggctacctgggccctcctgctccttgcagccatgctcctgggcaacccaggccttgag gtcagtgtgagccccaagggcaagaacacttctggaagggagagtggatttggctgggcc atctggatggaaggtctggtcttctctcgtctgagccctgagtactacgacctggcaaga gcccacctgcgtgatgaggagaaatcctgcccgtgcctggcccaggagggcccccagggt gacctgttgaccaaaacacaggagctgggccgtgactacaggacctgtctgacgatagtc caaaaactgaagaagatggtggataagcccacccagagaagtgtttccaatgctgcgacc cgggtgtgtaggacggggaggtcacgatggcgcgacgtctgcagaaatttcatgaggagg tatcagtctagagttacccagggcctcgtggccggagaaactgcccagcagatctgtgag gacctcaggttgtgtataccttctacaggcatttacagctacaaatttcccttttggcac agcttaagctacaacccggccagagtggggtcaagtggggaccatggaggaggacatgag atcctaacaggcgacctcaaactgctgtacaagcccagcaaccctgaccgctccccaacc cacaaccagcaaaggagctcagatgtttggagattgccaatagaagtcaatgtggcagag cttgctctgtttcctgggccgtgggctctgggaactggcatctcagagcctcatggcccg agaagctgtgtagctacccctgttcctattggtgaagccacccagcacccttag >gi568815596f:85654190_85864183|GENSCAN_predicted_peptide_3|84_aa MPQTQKNCVTLMTTTDQESEDREKQMNPSSLAVFTCAHPTGIVDTEHSGILIQAVDNFME FLKAFVTDTSFLELEMRERRDGAA >gi568815596f:85654190_85864183|GENSCAN_predicted_CDS_3|255_bp atgccacaaacccagaaaaactgtgtcacactaatgaccaccacagaccaggagagtgaa gacagagagaagcaaatgaatccaagcagcctggcggtcttcacttgtgctcaccccaca ggcattgtggacacagaacattcagggatcctcattcaagcagttgacaatttcatggag ttcctcaaggcctttgtaactgacaccagttttctagaactggaaatgagagaaaggaga gatggggctgcatga >gi568815596f:85654190_85864183|GENSCAN_predicted_peptide_4|725_aa MKHIPVLEDGPWKTVCVKELNGLKKLKRKGKEPARRANGYKTFRLDLEAPEPRAVATNGL RDRTHRLQPVPVPVPVPVPVAPAVPPRGGTDTAGERGGSRAPEVSDARKRCFALGAVGPG LPTPPPPPPPAPQSQAPGGPEAQPFREPGLRPRILLCAPPARPAPSAPPAPPAPPESTVR PAPPTRPGESSYSSISHVIYNNHQDSSASPRKRPGEATAASSEIKALQQTRRLLANARER TRVHTISAAFEALRKQVPCYSYGQKLSKLAILRIACNYILSLARLADLDYSADHSNLSFS ECVQRCTRTLQAEGRAKKRKLLSTLYVPDTVQHFSSNPWSCTLSIGCSLQPIQKSSDVKV EGVIPRNMHNMWSYNNANRHCSRPLPPPSTSTMCSELFSPEASPLPEARTLPSTVACAQE APSKCLVPSLAQSNSSYCGHGPFELLASGRAVKPSAMTGQHRRMEADGKPQQLPAASPGP ARPSQCPTLGRPPLSEPALSRLSPLYPGPFCLSCFPQCVDSSGSNICWALSGPNRPPISS CVDSHYNEDNVSQLPMGLCMAVGLSSSQRVQAEEDVGQTLSSNTFNVINIHKYLLDPETH SSRYWLVVGVVVSQPLEKIPQPPPTFVPSREHGHLICSPERCSCRMLDPQSLGLGVRQAW VGTRLPTWQHYDCTVGKGLYFLICKMGVIMPKPPVPTPKRHDSVMEITWPLESRDLGVGP DTAVP >gi568815596f:85654190_85864183|GENSCAN_predicted_CDS_4|2178_bp atgaagcacatcccggtcctcgaggacgggccgtggaagaccgtgtgcgtgaaggagctg aacggccttaagaagctcaagcggaaaggcaaggagccggcgcggcgcgcgaacggctat aaaactttccgactggacttggaagcgcccgagccccgcgccgtagccaccaacgggctg cgggacaggacccatcggctgcagccggtcccggtaccggtgccggtgccagtcccagtg gcgccggccgttcccccaagagggggcacggacacagccggggagcgcgggggctctcgg gcgcccgaggtctccgacgcgcggaaacgctgcttcgccctaggcgcagtggggccagga ctccccacgccgccgccgccgccgcctcctgcgccccagagccaggcacctgggggccca gaggcacagcctttccgggagccgggtctgcgtcctcgcatcttgctgtgcgcaccgccc gcgcgccccgcgccgtcagcacccccagcaccgccagcgcccccggagtccactgtgcgc cctgcgcccccgacgcgccccggggaaagttcctactcgtcaatttcacacgtaatttac aataaccaccaggattcctccgcgtcgcctaggaaacgaccgggcgaagcgactgccgcc tcctccgagatcaaagccctgcagcagacccggaggctcctggcgaacgccagggagcgg acgcgggtgcacaccatcagcgcagccttcgaggcgctcaggaagcaggtgccgtgctac tcatatgggcagaagctgtccaaactggccatcctgaggatcgcctgtaactacatcctg tccctggcgcggctggctgaccttgactacagtgccgaccacagcaacctcagcttctcc gagtgtgtgcagcgctgcacccgcaccctgcaggccgagggacgtgccaagaagcgcaag ctattgagtaccctctatgtacctgacacagtgcagcacttcagctcaaacccctggtcc tgcacccttagcatagggtgcagcctgcagcccatccagaaatcctcggatgtcaaagta gagggagtcataccaagaaacatgcataacatgtggtcatataataatgctaatcggcac tgcagcagaccactgccaccaccaagcacatcaaccatgtgttctgagctcttcagtcct gaggccagcccgctgcctgaagctcgcacactgcccagcacagtggcctgtgcccaggag gcccccagcaagtgcctggtgccgtcccttgcccagagcaacagctcttattgcggccat gggccttttgagctgctggcatctggaagagcagtgaagccaagtgcgatgaccggacag catcgcaggatggaagcagatgggaagccgcagcagctgcctgcagcttcgcctgggcca gctcgaccctcacaatgccccacgctggggaggccacccctctcagagccagccctctct cggctctccccgctttatcctggcccattctgcctcagctgctttcctcagtgcgtggat tccagtggcagcaacatctgctgggcactgtctggccctaatagaccaccaatttctagt tgtgtggacagccactacaatgaagacaatgtatctcagcttcccatggggctctgtatg gctgtgggactgagttctagtcaacgggtgcaagcagaagaggatgtggggcagaccctg tcatcaaacacatttaatgtaattaatattcacaaatatctgctggaccctgaaacacac tcctcccgttactggctggttgtgggtgtcgtggtgtcacagcccctggagaaaatcccc caacccccaccaacctttgtcccctccagagagcatgggcaccttatctgcagcccagaa agatgcagctgtcggatgctggatcctcagagcctgggcttgggagttaggcaggcctgg gtgggaacccggctccccacttggcagcactatgactgtactgtgggcaagggcctctat ttcctcatctgtaaaatgggcgtgatcatgcccaaacccccagtgcccacacccaagagg cacgacagtgtcatggaaataacatggcccttggaatccagagatctgggtgtgggtcct gatactgcagttccatga >gi568815596f:85654190_85864183|GENSCAN_predicted_peptide_5|141_aa MKLPAQCHQLPKGDTLVLLEMVHLLLREPFSNLGKLEFKHEDSEISQHSHLLNHFGAQQK TWADPGCLDLNPKVRPQVLLTVGRGHLAASLPSSNGSSGGSNSSSSWGVSLQLQSEQQQL TRLEDISTVFPGSCDVLCGGS >gi568815596f:85654190_85864183|GENSCAN_predicted_CDS_5|426_bp atgaaactgccagcccagtgtcaccagttacccaagggtgacaccctcgtcctgctggag atggtgcatctgctgcttcgagagcccttttctaacctgggaaagcttgagtttaaacat gaagatagtgaaataagtcagcactcccacctgctcaaccactttggtgctcaacagaag acgtgggcagatcctggctgcctagatttgaaccccaaagtacggccacaggttctgcta acagtggggaggggacatctggcagcatctctcccaagcagcaacggtagcagtggtggc agcaacagcagcagcagctggggagtgtccctccaacttcagagtgagcagcagcagctg accagattggaggacatttccactgtcttccctggcagctgtgatgtgttgtgcggtggc agctga >gi568815596f:85654190_85864183|GENSCAN_predicted_peptide_6|135_aa MTFALLATLSWIPDPRRQKGLCFECYDLTPGPAYGEEEGDGKTFEWAGAKAALGLHKQLQ EGKVEAGTSRTASLVGSRVFQQKADQQRRVVTFSNQPRPHAAASQTQYQQQSYGARHPGS GSSVGHTKQRSREPE >gi568815596f:85654190_85864183|GENSCAN_predicted_CDS_6|408_bp atgacttttgctctcctggccactctgagctggatcccagaccccaggagacagaaggga ctctgctttgaatgctatgacctgactcctggccctgcatatggtgaggaggagggagat gggaagacgttcgaatgggctggggccaaggctgcactggggctgcacaagcagttgcag gaagggaaagtggaggcagggacctccaggacagcttccctggttggttctcgagtcttt cagcagaaggcagaccaacagagaagggttgtgaccttctccaaccagcccaggcctcac gccgctgcatcgcagacccagtatcagcagcagagctacggagcacgtcatcctgggagt ggatcctccgtgggtcacaccaagcagcgcagcagggaaccagaatga >gi568815596f:85654190_85864183|GENSCAN_predicted_peptide_7|391_aa XMPSEYTYVKLRSDCSRPSLQWYTRAQSKMRRPSLLLKDILKCTLLVFGVWILYILKLNY TTEECDMKKMHYVDPDHVKRAQKYAQQVLQKECRPKFAKTSMALLFEHRYSVDLLPFVQK APKDSEAESKYDPPFGFRKFSSKVQTLLELLPEHDLPEHLKAKTCRRCVVIGSGGILHGL ELGHTLNQFDVVIRLNSAPVEGYSEHVGNKTTIRMTYPEGAPLSDLEYYSNDLFVAVLFK SVDFNWLQAMVKKETLPFWVRLFFWKQVAEKIPLQPKHFRILNPVIIKETAFDILQYSEP QSRFWGRDKNVPTIGVIAVVLATHLCDEVSLAGFGYDLNQPRTPLHYFDSQCMAAMNFQT MHNVTTETKFLLKLVKEGVVKDLSGGIDREF >gi568815596f:85654190_85864183|GENSCAN_predicted_CDS_7|1176_bp ncaatgccaagtgagtacacctatgtgaaactgagaagtgattgctcgaggccttccctg caatggtacacccgagctcaaagcaagatgagaaggcccagcttgttattaaaagacatc ctcaaatgtacattgcttgtgtttggagtgtggatcctttatatcctcaagttaaattat actactgaagaatgtgacatgaaaaaaatgcattatgtggaccctgaccatgtaaagaga gctcagaaatatgctcagcaagtcttgcagaaggaatgtcgtcccaagtttgccaagaca tcaatggcgctgttatttgagcacaggtatagcgtggacttactcccttttgtgcagaag gcccccaaagacagtgaagctgagtccaagtacgatcctccttttgggttccggaagttc tccagtaaagtccagaccctcttggaactcttgccagagcacgacctccctgaacacttg aaagccaagacctgtcggcgctgtgtggttattggaagcggaggaatactgcacggatta gaactgggccacaccctgaaccagttcgatgttgtgataaggttaaacagtgcaccagtt gagggatattcagaacatgttggaaataaaactactataaggatgacttatccagagggc gcaccactgtctgaccttgaatattattccaatgacttatttgttgctgttttatttaag agtgttgatttcaactggcttcaagcaatggtaaaaaaggaaaccctgccattctgggta cgactcttcttttggaagcaggtggcagaaaaaatcccactgcagccaaaacatttcagg attttgaatccagttatcatcaaagagactgcctttgacatccttcagtactcagagcct cagtcaaggttctggggccgagataagaacgtccccacaatcggtgtcattgccgttgtc ttagccacacatctgtgcgatgaagtcagtttggcgggttttggatatgacctcaatcaa cccagaacacctttgcactacttcgacagtcaatgcatggctgctatgaactttcagacc atgcataatgtgacaacggaaaccaagttcctcttaaagctggtcaaagagggagtggtg aaagatctcagtggaggcattgatcgtgaattttga