GENSCAN 1.0 Date run: 6-Nov-116 Time: 13:49:41 Sequence gi568815581f:46772516_46976715 : 204200 bp : 50.37% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.11 Intr - 1394 1153 242 1 2 115 79 394 0.875 38.39 1.10 Intr - 9980 9902 79 0 1 127 49 27 0.001 1.31 1.09 Intr - 28746 28699 48 1 0 68 94 30 0.029 0.25 1.08 Intr - 32588 32540 49 2 1 47 101 44 0.071 -0.15 1.07 Intr - 43424 43370 55 1 1 103 51 74 0.298 4.18 1.06 Intr - 46290 46003 288 2 0 74 83 111 0.235 5.56 1.05 Intr - 47488 47406 83 1 2 106 37 49 0.134 0.04 1.04 Intr - 47796 47649 148 2 1 110 47 74 0.199 5.74 1.03 Intr - 53209 53047 163 2 1 70 67 106 0.609 5.73 1.02 Intr - 59047 58933 115 1 1 103 92 78 0.988 9.72 1.01 Init - 70351 70316 36 1 0 73 40 79 0.476 1.61 1.00 Prom - 72235 72196 40 -4.16 2.00 Prom + 77180 77219 40 -4.96 2.01 Init + 79124 79200 77 1 2 90 60 285 0.999 24.46 2.02 Intr + 79529 79684 156 2 0 78 35 123 0.139 5.13 2.03 Intr + 91419 91655 237 0 0 50 63 149 0.344 5.33 2.04 Intr + 100002 100258 257 0 2 113 59 209 0.977 17.59 2.05 Intr + 102586 102851 266 2 2 66 105 353 0.994 32.03 2.06 Intr + 103730 104033 304 1 1 140 91 280 0.226 29.66 2.07 Intr + 104753 104960 208 0 1 6 110 90 0.090 1.14 2.08 Intr + 112009 112103 95 1 2 84 81 58 0.232 4.31 2.09 Intr + 113378 113448 71 0 2 85 59 57 0.243 1.40 2.10 Intr + 131021 131159 139 1 1 111 80 77 0.614 9.24 2.11 Intr + 132893 132989 97 0 1 50 91 34 0.444 -1.03 2.12 Intr + 134778 134927 150 0 0 50 66 88 0.410 2.08 2.13 Intr + 135804 135908 105 0 0 53 86 57 0.238 1.33 2.14 Intr + 157005 157069 65 0 2 123 79 81 0.989 9.06 2.15 Intr + 158584 158692 109 2 1 69 93 73 0.984 5.24 2.16 Intr + 159552 159684 133 0 1 66 85 88 0.998 6.95 2.17 Intr + 162514 162654 141 0 0 99 81 78 0.984 8.75 2.18 Term + 166084 166245 162 0 0 112 52 283 0.995 25.04 2.19 PlyA + 168261 168266 6 1.05 3.06 PlyA - 168632 168627 6 1.05 3.05 Term - 170006 169843 164 0 2 96 54 108 0.726 6.30 3.04 Intr - 171172 171068 105 2 0 40 77 79 0.532 2.19 3.03 Intr - 176999 176873 127 2 1 67 20 75 0.518 -1.15 3.02 Intr - 177808 177717 92 2 2 86 60 70 0.606 3.71 3.01 Init - 181597 181357 241 1 1 71 13 152 0.256 4.04 3.00 Prom - 181651 181612 40 -4.46 4.02 PlyA - 181768 181763 6 1.05 4.01 Sngl - 182938 182288 651 1 0 37 43 205 0.906 7.08 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:46772516_46976715|GENSCAN_predicted_peptide_1|436_aa MNVDIFQCDSEETLKKASDGPRTEKVTQDLAQPFWTTGRQLRFVLHLSLQRNIYVFGPLM NDCPQQMASSLQAKTWAAFAHQCSAAKGQLHTELLHEDALKKWTSPAMGWERSRSHDKPR RLSRPLVPPRPFPRAPCAGSSRVRRGLADQKGQQFPTQRSLLPTGSASFTPDRGCAESWC LRPRALIGCSLTSSNPAAPRWAREGGGCGWRCASDKPESHFQSQVDFVPTIGGVAPPLHG RGQTSSSAPLLMEPHLLGLLLGLLLGGTRVLAGYPIWWCWAQGCVAVNIFADIYLWWVRG LTDFKNEATYLCAWPLPPCTRSSPNPNMEVSVEGGALLGPGTSWASVQYCPGASRSLALG QQYTSLGSQPLLCGSIPGLVPKQLRFCRNYIEIMPSVAEGVKLGIQECQHQFRGRRWNCT TIDDSLAIFGPVLDKX >gi568815581f:46772516_46976715|GENSCAN_predicted_CDS_1|1308_bp atgaatgtggacatcttccagtgtgattcagaagagaccctgaagaaagcctctgatggg cctagaacagagaaggtgacccaggaccttgcccagcctttctggacaactggaagacag ttacgcttcgtcctccacctctccctccaacgaaacatttacgtatttggtcctctgatg aatgactgtccccagcagatggcaagttccttgcaggccaaaacctgggctgcttttgct catcagtgctcagctgccaaggggcagctgcacacagagttgctccatgaagatgcgttg aagaaatggaccagtccagcaatgggctgggaacgcagcaggagccatgacaagcccagg cggctctcccgacccttggtgcccccgaggccatttccccgcgctccctgtgccggcagc agccgcgtgcggagagggctcgccgaccagaaggggcagcagttccctacacagcggtcc ctgctccccaccggcagtgcttccttcaccccagaccggggctgcgcagagtcctggtgc ctcaggccgcgggcgctgattggctgctcgctgacatcctcaaacccggctgctccgcgc tgggctcgggaggggggcggctgcgggtggaggtgcgcttctgacaagcccgaaagtcat ttccaatctcaagtggactttgttccaactattgggggcgtcgctccccctcttcatggt cgcgggcaaacttcctcctcggcgcctcttctaatggagccccacctgctcgggctgctc ctcggcctcctgctcggtggcaccagggtcctcgctggctacccaatttggtggtgctgg gcccagggctgtgtggctgtgaatatctttgcagacatctacctgtggtgggttcgtggt ctcactgacttcaagaatgaagccacgtacctttgcgcctggcccctgccgccctgcacc cgctcctctcccaaccccaacatggaagtttccgtggagggtggagcccttctgggccca ggaacaagttgggcctctgtccagtactgcccaggagccagcaggtccctggccctgggc cagcagtacacatctctgggctcacagcccctgctctgcggctccatcccaggcctggtc cccaagcaactgcgcttctgccgcaattacatcgagatcatgcccagcgtggccgagggc gtgaagctgggcatccaggagtgccagcaccagttccggggccgccgctggaactgcacc accatagatgacagcctggccatctttgggcccgtcctcgacaaagnn >gi568815581f:46772516_46976715|GENSCAN_predicted_peptide_2|923_aa MRPPPALALAGLCLLALPAAAASYFGRQSRSLMPSRDPHAGALSAAILASGEPSLPRTPS PPSAFALRSGDREGWVDGWAKGVGHPWGLQENKWNLQHGTSLPRACANNGTVLIIEGVTH KAFPRGLALPGPKPSLSLPVRGKRVIGAQGAFKNLSSLTGREVLTPFPGLGTAAAPAQGG AHLKQCDLLKLSRRQKQLCRREPGLAETLRDAAHLGLLECQFQFRHERWNCSLEGRMGLL KRGFKETAFLYAVSSAALTHTLARACSAGRMERCTCDDSPGLESRQAWQWGVCGDNLKYS TKFLSNFLGSKRGNKDLRARADAHNTHVGIKAVKSGLRTTCKCHGVSGSCAVRTCWKQLS PFRETGQVLKLRYDSAVKVSSATNEALGRLELWAPARQGSLTKGLAPRSGDLVYMEDSPS FCRPSKYSPGTADGEKAQVVEAIRSEKGELSVLEWTRVRSVPWSPVSLIWLFKYASAEAS FLSRELGYAHSPSRAQNYHQCSLMTEAQGLVNEVAALPVLTTATVVVLRRLHTESLAAST TRTKLCRSLNLELLQDEVGLPGTCILIDAVTLTGKQLEGKNHGFHLSGFQYPTPNLAYYR CSTEFVVPFKDDAWNNPLPARVDSSKKTLFHPSAESRGFLWRKKKRKCHALDPPGKERLT VAVSTRGTWGQLTLVLQTQVLGSVGRGVLSPYASGSWALPFQSCKMIAINAACLLEALKI GQVHEIQSCMGRLETADKQSVHIVENEIQASIDQIFSRLERLEILSSKEPPNKRQNARLR VDQLKYDVQHLQTALRNFQHRRHAREQQERQREELLSRTFTTNDSDTTIPMDESLQFNSS LQKVHNGMDDLILDGHNILDGLRTQRLTLKGTQKKILDIANMLGLSNTVMRLIEKRAFQD KYFMIGGMLLTCVVMFLVVQYLT >gi568815581f:46772516_46976715|GENSCAN_predicted_CDS_2|2772_bp atgcgccccccgcccgcgctggccctggccgggctctgcctgctggcgctgcccgccgcc gccgcctcctacttcgggcgtcagtcccgctctctgatgccctctcgggatcctcacgcc ggtgccctgtctgccgccatcctggcctccggcgagccgtccttgcctcggactccttcc ccaccctccgccttcgccctgcggagcggagaccgagagggctgggtggatggctgggcg aagggtgtgggccatccgtggggcctgcaggagaacaagtggaatctgcagcatgggaca tctctgcctagagcctgtgcaaacaatggcactgtcctcatcattgagggggtcacgcac aaggcattccccagaggcctggcccttccagggcccaagcccagcctgagcctgcctgtg cgtgggaagagggtgatcggagcccagggtgcattcaagaacctgtcaagcctgaccggg cgggaagtcctgacgcccttcccaggattgggcactgcggcagccccggcacagggcggg gcccacctgaagcagtgtgacctgctgaagctgtcccggcggcagaagcagctctgccgg agggagcccggcctggctgagaccctgagggatgctgcgcacctcggcctgcttgagtgc cagtttcagttccggcatgagcgctggaactgtagcctggagggcaggatgggcctgctc aagagaggcttcaaagagacagctttcctgtacgcggtgtcctctgccgccctcacccac accctggcccgggcctgcagcgctgggcgcatggagcgctgcacctgtgatgactctccg gggctggagagccggcaggcctggcagtggggcgtgtgcggtgacaacctcaagtacagc accaagtttctgagcaacttcctggggtccaagagaggaaacaaggacctgcgggcacgg gcagacgcccacaatacccacgtgggcatcaaggctgtgaagagtggcctcaggaccacg tgtaagtgccatggcgtatcaggctcctgtgccgtgcgcacctgctggaagcagctctcc ccgttccgtgagacgggccaggtgctgaaactgcgctatgactcggctgtcaaggtgtcc agtgccaccaatgaggccttgggccgcctagagctgtgggcccctgccaggcagggcagc ctcaccaaaggcctggccccaaggtctggggacctggtgtacatggaggactcacccagc ttctgccggcccagcaagtactcacctggcacagcagatggggagaaagcacaggtggta gaagccatccgttcagagaaaggagagctttctgtgcttgagtggacccgagtgagatct gtgccctggagccctgtgtccttgatttggcttttcaaatatgcctccgctgaggcctca ttcttgtctcgagagctgggttatgcacactcacccagccgtgctcaaaactaccaccag tgcagcctgatgacagaggcccagggtttggtaaatgaagttgctgctctgccagtgctg acaacagccacagtagttgttttgcgcaggcttcatacagaatccctggctgcctcaacc acaagaacaaagctgtgcagaagcctgaacctggagcttctgcaagatgaggtagggctt ccaggtacttgtatcctgattgatgctgtcactctcactggaaaacaattggaaggtaag aaccatggcttccacctctctggattccagtaccctacaccaaatctggcttattatagg tgctcaacagaatttgtagttccctttaaagatgatgcttggaataacccacttccagca agagtggacagcagcaagaagactttgtttcatccttcagctgagtcgcggggcttcctt tggaggaagaagaagcggaaatgtcatgctctggacccacctggtaaagagaggttgacc gtggcagtgagtactcggggcacctggggccagctgacccttgtgcttcagacacaggtg ttgggctcggtgggaagaggggtcctctcaccctatgcctcagggtcctgggcgcttccc tttcaaagctgcaaaatgatcgccattaatgcagcttgtctcctggaggcgctgaaaata gggcaggtccacgagatccagtcttgcatgggacgcctggagacggcagacaagcagtct gtgcacatagtagaaaacgaaatccaagcaagcatagaccagatattcagccgtctagaa cgtctggagattttgtccagcaaggagccccctaacaaaaggcaaaatgccagacttcgg gttgaccagttaaagtatgatgtccagcacctgcagactgcgctcagaaacttccagcat cggcgccatgcaagggagcagcaggagagacagcgagaagagcttctgtctcgaaccttc accactaacgactctgacaccaccataccaatggacgaatcactgcagtttaactcctcc ctccagaaagttcacaacggcatggatgacctcattttagatgggcacaatattttagat ggactgaggacccagagactgaccttgaaggggactcagaagaagatccttgacattgcc aacatgctgggcttgtccaacacagtgatgcggctcatcgagaagcgggctttccaggac aagtactttatgataggtgggatgctgctgacctgtgtggtcatgttcctcgtggtgcag tacctgacatga >gi568815581f:46772516_46976715|GENSCAN_predicted_peptide_3|242_aa MGKDFMSKTPKAMATKAKIDKWDLIKLKSFRTAKETTISVNRQPTEWEKIFAIYSSDKGL ISRIYNELKQIYKKKTTPSTTSRLLLTYKRQRVHTLWGPVWREEDPRIQLGNVTHILPTE RQLGACDHMACDHHGQRSMAEQRPYDFRGLVIETLLALHGGWAQVLSCSRLPDGACAYYV PGMHKGPCDPSRAPHEKCGRLSGPMVPSWMRGQLLSDSCEAFTVTTAQPLVCIMRINDGE HK >gi568815581f:46772516_46976715|GENSCAN_predicted_CDS_3|729_bp atgggcaaggacttcatgtctaaaacaccaaaagcgatggcaacaaaagccaaaattgac aaatgggatctaatcaaactaaagagcttccgcacagcaaaagaaactaccatcagcgtg aacaggcaacctacagaatgggagaaaatttttgcaatctactcatctgacaaagggcta atatcaagaatctacaatgaactcaaacaaatttacaagaaaaaaacaaccccatcaaca accagccgcctcctcctcacctacaagaggcagagagtccacactctctggggaccagta tggagggaagaagacccaagaatccagctggggaatgtgacccacattcttcccactgag aggcagctgggggcttgtgaccatatggcttgtgatcaccacggccaaaggagtatggca gaacagaggccctatgacttccgaggcttggtcatagaaactttgctggccctgcatggg ggatgggctcaagtgctctcctgcagccgcttacccgatggcgcctgcgcctactacgtg cccgggatgcacaaaggcccctgcgaccccagcagagcaccccacgagaagtgtgggaga ttaagtgggcccatggttccaagctggatgaggggccagctcctctcggactcctgtgag gccttcacagtcaccacagctcagccccttgtctgcataatgaggataaatgatggagaa cataagtga >gi568815581f:46772516_46976715|GENSCAN_predicted_peptide_4|216_aa MNIDAKILNKILANRIQQHIKKLIHHDQVGFIPGMQGWFNIRKSINVIQHINRTKDKNHM IISIDAEKAFDKIQQPFMLKTLNKLGIDGTYLKIIRAIYDKPTANIILNGQKLEAFPLKT GTRQGCPLSPLLFNIVLEVLARAIRQKEIKGIQLGKEEVKLSLFADDMIVYLENPTVSAQ NLFKLISNFSKVSGYKINVQKSQAFLYTNNRQPNHE >gi568815581f:46772516_46976715|GENSCAN_predicted_CDS_4|651_bp atgaacattgatgcaaaaatcctcaataaaatactggcaaaccgaatccagcagcacatc aaaaagcttatccaccatgatcaagtgggcttcatccctgggatgcaaggctggttcaac atacgaaaatcaataaacgtaatccagcatataaacagaaccaaagacaaaaaccacatg attatctcaatagatgcagaaaaggcctttgacaaaattcaacaacccttcatgctaaaa actctcaataaattaggtattgatgggacgtatctcaaaataataagagctatctatgac aaacccacagccaatatcatactgaatggacaaaaactggaagcattccctttgaaaact ggcacaagacagggatgcccgctctcaccactcctattcaacatagtgttggaagttctg gccagggcaatcaggcagaaggaaataaagggtattcaattaggaaaagaggaagtcaaa ttgtccctgtttgcagatgacatgattgtatatctagaaaaccccactgtctcagcccaa aatctctttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaatgtgcaa aaatcacaagcattcttatacaccaataacagacagccaaatcatgagtga