GENSCAN 1.0 Date run: 4-Nov-116 Time: 11:10:29 Sequence gi568815577r:32476788_32684304 : 207517 bp : 43.35% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 9683 9722 40 0 1 92 53 63 0.050 0.48 1.02 Intr + 16926 16975 50 0 2 72 102 23 0.663 0.42 1.03 Intr + 18240 18383 144 1 0 91 83 81 0.664 8.15 1.04 Intr + 22497 22720 224 1 2 79 57 108 0.067 4.65 1.05 Intr + 28526 28580 55 1 1 97 65 38 0.104 0.95 1.06 Term + 38027 38403 377 0 2 137 48 464 0.668 42.20 1.07 PlyA + 38579 38584 6 1.05 2.00 Prom + 38790 38829 40 -3.36 2.01 Init + 40096 40119 24 0 0 33 110 47 0.207 1.23 2.02 Term + 42459 42560 102 2 0 66 49 119 0.204 4.08 2.03 PlyA + 43528 43533 6 1.05 3.00 Prom + 57916 57955 40 -3.86 3.01 Init + 60386 60493 108 1 0 84 66 60 0.584 3.62 3.02 Intr + 69145 69258 114 0 0 96 113 -12 0.571 2.74 3.03 Intr + 70936 70989 54 0 0 69 72 87 0.754 4.38 3.04 Term + 72284 72289 6 1 0 119 50 0 0.426 -2.83 3.05 PlyA + 72997 73002 6 1.05 4.35 PlyA - 74130 74125 6 1.05 4.34 Term - 79823 79557 267 2 0 97 40 110 0.506 2.59 4.33 Intr - 91292 91206 87 2 0 23 60 107 0.257 1.57 4.32 Intr - 92938 92901 38 2 2 63 83 53 0.101 0.28 4.31 Intr - 96581 96497 85 2 1 80 109 77 0.984 8.39 4.30 Intr - 97032 96844 189 0 0 67 72 137 0.916 9.78 4.29 Intr - 97889 97789 101 0 2 80 96 33 0.728 3.13 4.28 Intr - 99536 99425 112 2 1 58 93 62 0.462 3.65 4.27 Intr - 102044 101907 138 2 0 70 55 83 0.569 3.86 4.26 Intr - 107772 107374 399 0 0 -13 81 248 0.010 8.70 4.25 Intr - 115631 115491 141 2 0 90 47 104 0.116 7.05 4.24 Intr - 125580 125485 96 0 0 110 64 95 0.760 9.51 4.23 Intr - 126505 126374 132 1 0 17 108 199 0.771 15.64 4.22 Intr - 127496 127338 159 2 0 65 98 165 0.979 15.38 4.21 Intr - 130929 130862 68 1 2 116 58 53 0.982 3.72 4.20 Intr - 133218 133051 168 1 0 73 115 102 0.823 11.32 4.19 Intr - 135592 135318 275 0 2 28 80 347 0.687 25.08 4.18 Intr - 137156 137046 111 1 0 42 61 87 0.319 0.89 4.17 Intr - 141504 141414 91 1 1 111 76 80 0.644 8.15 4.16 Intr - 154594 154357 238 1 1 101 82 134 0.128 11.29 4.15 Intr - 155014 154823 192 1 0 139 84 -2 0.482 4.19 4.14 Intr - 162259 162121 139 0 1 55 108 14 0.674 0.57 4.13 Intr - 165179 165109 71 2 2 90 94 91 0.755 7.88 4.12 Intr - 169002 168911 92 1 2 92 -25 124 0.890 1.21 4.11 Intr - 173559 173397 163 0 1 65 76 76 0.646 3.65 4.10 Intr - 179071 178960 112 0 1 96 66 -45 0.240 -5.52 4.09 Intr - 180115 179900 216 0 0 65 116 145 0.771 12.62 4.08 Intr - 180333 180216 118 1 1 81 14 78 0.505 -0.78 4.07 Intr - 181085 180929 157 2 1 79 47 57 0.439 0.28 4.06 Intr - 189348 189156 193 2 1 44 103 109 0.677 7.49 4.05 Intr - 189786 189646 141 2 0 76 82 55 0.603 3.17 4.04 Intr - 193585 193501 85 2 1 32 121 54 0.893 1.98 4.03 Intr - 196744 196553 192 2 0 49 58 117 0.861 4.26 4.02 Intr - 202023 201858 166 0 1 99 68 158 0.999 14.43 4.01 Intr - 204861 204709 153 0 0 76 71 120 0.947 9.37 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 183448 183928 481 0 1 98 116 246 0.914 23.92 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815577r:32476788_32684304|GENSCAN_predicted_peptide_1|296_aa XFTRYYCFLYAPVSSIANPVKGCFHLGKKQDCLSYSALQVLSRRCYGKQRCKIIVNNHHF GSPCLPGVKKYLTVTYACGPRNECEVIKAESNLQQLSSALRSCQGPRISGVPQGQPRDIP QGPGNHQPHPATSPGFPGKSPLLLMGMKAPGPPPPSHFHVLLFLIIVNVKNSHPERAALL FVSSVCIGLALTLCALVIRESCAKDFRDLQLGREQLVPGSDKVEEDSEDEEEEEDPSESD FPGELSGFCRTSYPIYSSIEAAELAERIERREQIIQEIWMNSGLDTSLPRNMGQFY >gi568815577r:32476788_32684304|GENSCAN_predicted_CDS_1|891_bp ntttttaccaggtattactgcttcctctatgcccctgtgagctctattgcaaatcctgtg aaaggatgctttcatctgggtaaaaagcaagattgcttgtcttactcagctttgcaagtc ctatcccgaaggtgctatgggaagcagagatgcaaaatcatcgtcaacaatcaccatttt ggaagcccctgtttgccaggcgtgaaaaaatacctcactgtgacctacgcatgtgggccc agaaatgaatgcgaggtaattaaagcagaaagcaatttgcagcagctgagctcagctctc cggagctgccagggtcccaggatcagtggtgtgccccaaggacagccaagggacattcct caaggccctggaaatcatcagccacatccagcaacttcaccgggcttccctggaaagtct cccttgctgctgatgggcatgaaggcgccagggcctcctccaccttctcattttcatgtc ttgctgttcctgatcattgtcaatgttaagaactcccacccggagagagctgccctgctg ttcgtgtccagtgtctgcatcggcctggccctcacactgtgcgccctggtcatcagagag tcctgtgccaaggacttccgcgacttgcagctggggagggagcagctggtgccaggaagt gacaaggtcgaggaggacagcgaggatgaagaagaggaggaggacccctctgagtctgat ttcccaggggaactgtcggggttctgtaggacttcatatcctatatacagttccatagaa gctgcagagctcgcagaaaggattgagcgcagggagcaaatcattcaggaaatatggatg aacagtggtttggacacctcgctcccaagaaacatgggccagttctactga >gi568815577r:32476788_32684304|GENSCAN_predicted_peptide_2|41_aa MRKLKAARFEGSIFPVPMNTNAVQLVADDRLLGAADVHVTS >gi568815577r:32476788_32684304|GENSCAN_predicted_CDS_2|126_bp atgaggaagctgaaggccgccaggtttgaaggcagcatcttccctgtccccatgaacaca aatgctgtccagctggtggctgatgaccgtctgcttggggctgcagatgttcacgttacc agttga >gi568815577r:32476788_32684304|GENSCAN_predicted_peptide_3|93_aa MASEGLFNKTAFEQKPGGIKGASLKKGGKRAPGKGMPSFKFSPFLAFVYLLNLLTNIYLT NICYDPGTELSDTKSSEICLQNYVFIGPPRSEC >gi568815577r:32476788_32684304|GENSCAN_predicted_CDS_3|282_bp atggccagtgaaggtctctttaataagacagcatttgagcagaaaccaggaggaatcaaa ggagcaagccttaagaagggtggaaagagagctccaggaaaagggatgccttctttcaaa ttctcacctttcctggcatttgtttacttgctcaaccttctaacaaacatttatttaaca aacatctgctatgatccaggcactgaactaagtgacacaaagagctcagaaatatgcctt cagaactacgtcttcattggaccaccccggtctgagtgctga >gi568815577r:32476788_32684304|GENSCAN_predicted_peptide_4|1694_aa MLAKQLEALGLAEKPQLVTRFQEVFRSMWSVNGDSISKIYAGTGALEGKAKAGKLKDGAR SVTRTIQNNFFDSSKQEAIDVLLLGNTLNSDLADKARALLTTGSLRASSKVLKSMCENFY KYSKPKKIRVCVGTWNVNGGKQFRSIAFKNQTLTDWLLDAPKLAGIQEFQDKRSKPTDIF AIGFEEMVELNAGNIVSASTTNQKLWAVELQKTISRDNKYVLLASEQLVGVCLFVFIRPQ HAPFIRDVAVDTVKTGMGGATGNKGAVAIRMLFHTTSLCFVCSHFAAGQSQVKERNEDFI EIARKLSFPMVFRGFLEGKVTFAPTYKYDLFSDDYDTSEKCRTPAWTDRVLWRRRKWPFD RSAEDLDLLNASFQDESKILYTWTPGTLLHYGRAELKTSDHRPVVALIDIDIFEVEAEER QNIYKEVIAVQGPPDGTVLVSIKSSLPENNFFDDALIDELLQQFASFGEVILIRTRCLFP CFKNVLFQLYSSFERLNDVHVCMCLDFIGKELLNRTITIALKSPDWIKNLEEEMSLEKIS IALPSSTSSTLLGEDAEVAADFDMEGSPIDAQPATPLPQKDPAQPLEPKRPPPPRPDAVS LHLKQDLQAQDLLDTVQPDRLQEPLVPVAAPMPQSGPQPNLETPPQPPPRSRSSHSLPSE ASSQPQVKTNGISDGKRESPLKIDPFEDLSFNLLAVSKAQLSVQTSPVPTPDPKRLIQLP SATQSNVLSSPLGHNKSRASSSLDGFKDSFDLQGQSTLKISNPKGWVTFEEEEDFGVKGK SKSACSDLLGNQPSSFSGSNLTLNDDWNKGRDHGEQVHCPPCVVHDGGSDSVLKAGKKTG EEKNDITPNSEGNVYPRCDIFPSIRGGEDHITSNIAGERGIPDCERLLVRRSCWRSGDPR PAGPAGHAAGAFSTPQYLGGTAMVLLHVKRGDESQFLLQAPGSTELEELTVQVARVYNGR LKVQRLCSEMEELAEHGIFLPPNMQGLTDDQIEELKLKDEWGEKCVPSGGAVFKKDDIGR RNGQAPNEKMKQVLKKTIEEAKAIISKKQVEAGVCVTMEMVKDALDQLRGAVMIVYPMGL PPYDPIRMEFENKEDLSGTQAGLNVIKEAEAQLWWAAKELRRTKKLSDYVGKNEKTKIIA KIQQRGQGAPAREPIISSEEQKQLMLYYHRRQEELKIFATFQVVVLCPPELGPRSLGPLD QNPNQLRNRLLADEPGGSGARDAADLLGSKAFSTEDGGEPGVDSSVPLSVVSFHLVSFGP GNVHMTQGEKERLRVLLTLSLGLGPSAHTDGVGGLHSSVPIALPFPTRMLAGQLEARDPK EGTHPEDPCPGAGAVMEKTAVAAEVLTEDCNTGEMPALEPAFGKISPLSADEETIPKYAG HKNQSATLLGQRSSSNNSAPPKAMHPSSRSLQNLSGRKSPVQASQAAMLQEQMAAAGGAD GSSSVLESSEGGFLSHVQPDEFTASSPNIAELQVLNVAVVTVATLTPEQTYKWEITTLIK HKTWSKFLVLTVRGSAVPTDLEAEAAKCQRKVHFSQVYYYANAQTTHTTYPDGLEVVQFL NKQTARQSLDLVCRGIVIFVKKMLNVDVEKMQTLRLKREEAGNSVRLYDDYKCSICYISL EYTWLMQAFSLKEPVISNLNKAYGQGTMTRDPSIFCALAMGEGKKRLTRHLRMSKRDKLA QAENDEENEHQGSS >gi568815577r:32476788_32684304|GENSCAN_predicted_CDS_4|5085_bp atgctagctaaacagttggaagctcttggtttagctgaaaagcctcagttggtgactcgc tttcaagaagtttttcggtcaatgtggtccgtgaatggtgattcaatcagtaagatatat gcaggaactggagctcttgaagggaaagcgaaggctggaaagttaaaagatggtgctcgc tctgttacccgaacaattcagaataacttctttgacagctccaagcaagaggccattgat gttttgctactgggaaatactctgaatagtgatttagctgacaaagctcgagcactttta actactggaagtttgcgtgcatcttctaaagtactaaagagcatgtgtgagaatttctac aaatattcaaagcctaagaaaattcgagtatgtgtcggaacctggaatgtgaatggtggg aagcaatttcgcagcatagcttttaagaatcagacactcactgactggcttcttgatgca cccaagttagctggcatccaggagtttcaagataaaagaagtaagccaactgatatattt gcaattggttttgaagaaatggtagaattgaatgctggaaacattgtgagtgcaagcaca acaaatcagaagctctgggctgtagaacttcagaagacaatctccagagacaacaagtat gtgctgctggcttctgaacagttggtgggcgtctgtttgtttgtttttatcagaccacag catgctccttttatcagggatgttgcagttgatactgtgaagactggaatgggaggtgca actggaaataagggagcagttgcaatccgaatgctcttccatacaaccagcctttgcttc gtctgtagccactttgctgcagggcagtcacaagtcaaagaaagaaatgaagattttata gaaatagcacgaaaattgagttttcctatggtttttagaggatttttagaaggaaaggta acctttgctccgacatataagtatgacttgttttctgacgactatgacaccagtgaaaag tgccgcacccctgcctggacagaccgtgtcctttggagaaggaggaaatggccttttgat agatcagctgaagatctagatcttctaaatgctagttttcaagatgaaagcaaaattctg tacacgtggactccaggcactttgctgcactatggaagagctgagctgaagacttctgac cacaggcctgtcgttgccctgattgatatagatatatttgaagttgaagctgaagagagg caaaacatttataaagaagtaattgcagttcagggtccaccagatggtacagtattggtc tcaatcaaaagttctttaccagaaaataatttttttgatgatgccttgattgatgagctt ctgcagcagtttgcaagttttggtgaagttatacttataagaaccagatgcctgtttcct tgttttaaaaatgttctcttccaactatacagcagttttgagagactgaatgatgtgcat gtgtgcatgtgtttggatttcatagggaaagagttattgaatcggactataactattgct ttaaaaagtccagactggatcaaaaatttggaagaagaaatgagtttagagaaaattagc attgcattgccatcatcaacaagctctaccctgcttggtgaagatgcagaggttgcagca gattttgatatggaaggttctcctattgacgcgcagccagcaacgccgctgccgcagaaa gaccccgcccagcccttggagcccaagcggccgccgccgccccgcccggacgcagtcagc cttcacctcaagcaggacttgcaggcccaggacctgctggatacagtacagccagaccgg ttgcaagagcctcttgtccctgtggcagcacctatgcctcagtctggcccccagccaaat ttggaaaccccaccacaaccaccacctcgaagcaggtcatcccatagcttgccttcagaa gcttcctcacaaccgcaagtaaaaacaaatggaatctctgatggcaaaagagaatcacca ttaaagattgacccatttgaagatctgtcatttaatctgcttgctgtatcaaaggctcag ctatctgttcaaacgtcacctgttcccaccccagacccaaagaggttgattcagttgcct tctgcaacgcaaagtaatgttttgagttctcctcttggtcataacaaaagcagggcttca tcttcacttgatggctttaaggacagttttgatctacagggccagtctacattaaaaatt agcaacccgaaaggatgggtaaccttcgaggaagaagaggattttggtgtgaaagggaag tcaaagtcagcttgttcagacttactgggtaatcagccaagttcattttctggctccaac ctgacattgaatgatgactggaataaaggccgtgaccacggagaacaggtccactgtcct ccctgcgtggtgcatgatggaggctcagactccgtcctcaaggctggcaagaagacagga gaggaaaagaatgatattactcccaacagcgaaggaaatgtatacccgcgctgtgatatt tttcccagtatccgggggggagaggatcatattacttccaatatcgcaggggagcgtggg attccggactgtgagcggctgttagtgcgtcgcagctgctggcgatccggcgaccctcgg ccggcaggacccgcgggccacgcagccggggccttctcaacgcctcagtacctcggcggg accgccatggttctgctgcacgtgaagcggggcgacgagagccagttcctgctgcaggcg cctgggagtaccgagctggaggagctcacggtgcaggtggcccgggtctataatgggcgg ctcaaggtgcagcgcctctgctcagaaatggaagaattagccgaacatggcatatttctc cctcctaatatgcaaggactgaccgatgatcagattgaagaattgaaattgaaggatgaa tggggtgaaaaatgcgtacccagcggaggtgcagtgtttaaaaaggatgatattggacga aggaatgggcaagctccaaatgagaagatgaagcaagtgttaaagaagactatagaagaa gccaaagcaataatatctaagaaacaagtggaagccggtgtctgtgttaccatggagatg gtgaaagatgccttggaccagcttcgaggcgcggtgatgattgtttaccccatggggttg ccaccgtatgatcccatccgcatggagtttgaaaataaggaagacttgtcgggaacacag gcagggctcaacgtcattaaagaggcagaggcgcagctgtggtgggcagccaaggagctg agaagaacgaagaagctttcagactacgtggggaagaatgaaaaaaccaaaattatcgcc aagattcagcaaaggggacagggagctccagcccgagagcctattattagcagtgaggag cagaagcagctgatgctgtactatcacagaagacaagaggagctcaagatttttgccact ttccaggtggtcgttctttgtcctcctgagcttgggcctcgctccttgggacccctggac cagaaccccaatcagctcaggaacaggcttctggcagatgagcccggcggcagtggagcg agagatgcagctgatcttctaggatctaaagcattttccacggaggatggtggagagcca ggtgtggacagctcagtccctctgtctgtggtgtcctttcacctggtcagcttcgggcct gggaatgtgcacatgacccagggggagaaggagcggctgcgggtcctgctcacccttagt ttggggctggggccttcagcccacactgacggggtaggagggctgcattcaagtgtcccc atagcccttcccttcccaaccaggatgctggcaggtcaactcgaggccagggaccccaaa gagggcacccacccagaggacccgtgcccaggagctggggctgtcatggagaagacagct gtggcagccgaggttctcacggaggactgcaacactggggagatgccagcattggaacct gcttttggaaaaatttcacctctgtcagctgatgaagagacaatacccaaatacgctggc cacaagaatcagagtgccactctcctgggacaaagatcgtcatctaacaattcagctcct ccaaaggctatgcatccatcttcccgaagtctgcaaaacttgagtggcagaaaatcacca gtgcaggcttcccaggccgccatgctgcaggagcagatggcagcagccggaggagctgac ggaagctcctcagtcttagagagttctgaaggtggatttctcagccacgttcaacctgat gagttcactgcttcttccccaaacattgcagagctgcaggtgttgaatgtggcggttgtc actgtggccaccctgaccccagagcaaacatataagtgggagataaccactttaataaaa cacaaaacatggtccaaattcctggtgctgactgtccggggctcagccgtgcccaccgac ttagaggcagaggctgcaaaatgccagcgcaaggtgcatttctcccaggtttactattat gcaaatgctcagacgacacacacgacctaccctgatggtttggaagttgtgcagtttctt aacaagcagacagcgcggcagtccctggacctggtgtgcagaggaatcgtgatctttgta aagaagatgctgaatgtggatgtagaaaagatgcagaccctgaggctgaagagggaggaa gctggaaactccgtgcggctttatgatgattacaaatgtagcatttgctacatatcttta gaatatacttggttaatgcaggcattctctttgaaagaaccagttatttctaatctcaac aaagcctatggtcagggaacaatgacccgagatccaagtattttttgtgcactggctatg ggagagggtaaaaagaggttaacaaggcaccttcgtatgtccaagagagacaagctggcc caggctgaaaatgatgaggaaaatgaacaccaagggtcatcataa