GENSCAN 1.0 Date run: 6-Nov-116 Time: 05:40:45 Sequence gi568815592r:53028209_53245632 : 217424 bp : 42.09% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3360 3415 56 2 2 42 99 94 0.880 6.71 1.02 Term + 4703 4802 100 2 1 70 33 130 0.873 2.32 1.03 PlyA + 6534 6539 6 1.05 2.00 Prom + 7168 7207 40 -8.75 2.01 Init + 7363 7472 110 0 2 91 89 76 0.417 7.74 2.02 Intr + 7524 7649 126 0 0 -9 91 130 0.385 2.47 2.03 Intr + 13044 13142 99 0 0 53 61 87 0.196 0.81 2.04 Intr + 19972 20126 155 1 2 50 44 133 0.239 3.89 2.05 Intr + 21926 22086 161 0 2 59 105 27 0.105 0.29 2.06 Intr + 26073 26205 133 2 1 56 82 80 0.122 3.50 2.07 Term + 33814 33947 134 2 2 1 34 158 0.035 -1.03 2.08 PlyA + 34054 34059 6 1.05 3.03 PlyA - 34606 34601 6 1.05 3.02 Term - 37471 37136 336 1 0 92 40 338 0.656 22.99 3.01 Init - 40002 39991 12 0 0 77 121 23 0.422 3.64 3.00 Prom - 40275 40236 40 -6.15 4.00 Prom + 40341 40380 40 -6.45 4.01 Init + 45285 45430 146 2 2 27 -24 186 0.296 0.94 4.02 Intr + 46033 46162 130 1 1 108 80 -33 0.478 -2.32 4.03 Intr + 48278 48335 58 1 1 71 93 87 0.880 5.04 4.04 Intr + 50591 50690 100 0 1 87 37 131 0.993 6.15 4.05 Intr + 52760 52890 131 2 2 102 64 132 0.994 11.62 4.06 Intr + 54296 54410 115 0 1 67 79 107 0.892 6.29 4.07 Intr + 64221 64339 119 0 2 90 115 36 0.826 5.69 4.08 Intr + 64526 64616 91 0 1 70 87 47 0.989 0.93 4.09 Intr + 65258 65353 96 2 0 89 69 65 0.880 2.91 4.10 Intr + 67305 67456 152 0 2 109 58 94 0.028 7.39 4.11 Intr + 81964 82016 53 2 2 88 73 59 0.132 2.11 4.12 Term + 91618 91836 219 0 0 80 48 130 0.568 4.26 4.13 PlyA + 93498 93503 6 1.05 5.07 PlyA - 93558 93553 6 1.05 5.06 Term - 100588 99998 591 1 0 80 46 383 0.618 26.84 5.05 Intr - 102723 102595 129 0 0 61 51 88 0.752 2.37 5.04 Intr - 103911 103799 113 1 2 101 47 117 0.959 8.08 5.03 Intr - 106116 105864 253 0 1 61 75 215 0.969 13.58 5.02 Intr - 117554 117350 205 1 1 56 91 134 0.490 8.78 5.01 Init - 121992 121907 86 0 2 77 84 81 0.924 5.16 5.00 Prom - 123138 123099 40 -7.45 6.00 Prom + 132626 132665 40 -5.25 6.01 Init + 148154 148300 147 1 0 73 9 216 0.126 12.34 6.02 Intr + 154990 155171 182 0 2 61 67 106 0.009 3.54 6.03 Term + 172168 172366 199 1 1 68 48 160 0.093 5.89 6.04 PlyA + 172506 172511 6 -0.45 7.03 PlyA - 175296 175291 6 1.05 7.02 Term - 175510 175339 172 0 1 68 44 182 0.850 8.12 7.01 Init - 177583 177438 146 1 2 62 101 92 0.674 7.54 7.00 Prom - 178297 178258 40 -11.34 8.05 PlyA - 178734 178729 6 -0.45 8.04 Term - 179625 179342 284 1 2 -67 48 284 0.850 3.50 8.03 Intr - 180147 179977 171 1 0 76 97 64 0.533 5.09 8.02 Intr - 207615 207468 148 0 1 -86 72 241 0.538 3.99 8.01 Init - 209180 209079 102 2 0 68 94 154 0.749 14.29 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 69514 69622 109 1 1 109 43 96 0.881 4.20 S.002 Term - 168738 168541 198 0 0 18 33 219 0.826 5.82 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:53028209_53245632|GENSCAN_predicted_peptide_1|51_aa MCPRGSIRFTFEVQNYRRSTRQCTKAILGPPPPDDGIQAVEEARGESIKEA >gi568815592r:53028209_53245632|GENSCAN_predicted_CDS_1|156_bp atgtgcccaagaggctccatacggttcacctttgaagttcaaaattaccgtcgcagcacc aggcaatgtaccaaggctattttgggcccaccccctccagatgatggaattcaggctgtg gaagaagccagaggggaatcaataaaagaagcttag >gi568815592r:53028209_53245632|GENSCAN_predicted_peptide_2|305_aa MAVRCRISLKSTGSSWADMQMYRQAAMRRKEEGTVACWQDMRNDSDKRPSVTFLSLPLLS QQAQKKDLIKEEGMRNLVRPDTHLTAEVVQFCSGSSPPQSVTRFFDGSTSTRGQTSREIL KNIVRHLNESTRRVDEETPDAGSKGDLESHDNCSKCTKDIREKESVLQRKKQANIKNLIL DKTSQFQPEISEYMTLAYKDFTLCRPETTYWQHNISPPLPRGNDLQVYPLLQPRHSPRLL HLCLLTPLAVASPQRLFEGPKCDLRLRTNASLQIGSINHQDLEMEAVDGRSEHIKLKIIV VKTFF >gi568815592r:53028209_53245632|GENSCAN_predicted_CDS_2|918_bp atggccgtgaggtgcagaatatccctgaagtccacagggtcttcatgggcagacatgcaa atgtacaggcaggcagcaatgaggagaaaggaagaaggaacagtggcgtgctggcaggac atgagaaacgacagtgataagaggccttcagtgaccttcctttcccttccccttctgagt cagcaggcacaaaagaaggacctcatcaaggaggaagggatgaggaatcttgtaaggccg gacactcatctcacggccgaagtggttcagttctgcagtggtagcagtcctccccagagt gtaaccagatttttcgatggcagcaccagcacaagaggccaaacttcaagagaaatatta aagaacatagtgaggcatctaaatgaaagtacaagaagggtagatgaagaaacaccggat gctggaagcaaaggagacttggaaagccatgataattgttccaagtgtacaaaggatatc agagagaaggagtcagtgcttcaaagaaagaagcaggcaaatatcaaaaatttgatactt gacaagacatcacaattccagcctgaaatctctgagtatatgacattagcatacaaagat ttcacactgtgtagaccagagaccacatactggcagcataatatttcacccccccttccc cgtggcaatgacctgcaggtgtaccctctgcttcagcctcgacactctccaagactgctt catctttgtcttctcactccacttgcagtagcaagtcctcaaagattgtttgaagggccc aaatgcgatctccgactccgaacgaacgcctcgttgcagatcggctcgatcaaccatcaa gacttagaaatggaggctgtcgatggacgttctgagcacattaaattaaagataatcgtt gtgaaaacatttttctga >gi568815592r:53028209_53245632|GENSCAN_predicted_peptide_3|115_aa MAFKRSLAEFSVGEAHQAAAITAGPAYGSHLRGAGGQIGARWGGGGCRGLGEWRDARCQP RRGIVRAPERQPYWRPRTPPRSCPLSCGNLRRRAEHAQCRCCPCGRPGHPSPRAA >gi568815592r:53028209_53245632|GENSCAN_predicted_CDS_3|348_bp atggccttcaagcggagcttagcagagttctcggtcggagaagcgcaccaggcagcggca ataactgcgggcccggcgtacggcagccatcttcgcggggcagggggccagatcggggcg cgctggggtggcggcggctgccggggactcggggagtggagggacgcccgttgccagccg aggcgggggattgtgcgagcaccggagcgtcagccctactggagaccccggacgccgccg cggagctgccccctcagctgcggaaacctgcgccgacgcgccgagcatgcgcagtgccgg tgctgcccgtgtgggcggcccggccacccatccccgcgggctgcctag >gi568815592r:53028209_53245632|GENSCAN_predicted_peptide_4|469_aa MFRAQWMFELAPGVSSSNLENRPCRAARGSLQKTSADTKGKQEQAKEEKGCHLFTIRSKF VFIISHQIIGITMMTYLLSPLNFYFEIITGPQARELFLKAVEEEQNGALYEAIKFYRRAM QLVPDIEFKITYTRSPDGDGVGNSYIEDNDDDSKMADLLSYFQQQLTFQESVLKLCQPEL ESSQIHISVLPMEVLMYIFRWVVSSDLDLRSLEQLSLVCRGFYICARDPEIWRLACLKVW GRSCIKLVPYTSWREMFLERPRVRFDGVYISKTTYIRQGEQSLDGFYRAWHQVEYYRYIR FFPDGHVMMLTTPEEPQSIVPRLRTRNTRNHLTINTDIFVVSLYKKQIRVFMWGYSYVPV VTRGSTNSSGYIILVTLLTRIWTQVMSSEFLEDGVTREVGRTHLATKTAFPPTNPAQMSK SSLAVTPPCGLHGRDRWTKAMTVKVPSGVKATYSLAAPQPIAFTPILLH >gi568815592r:53028209_53245632|GENSCAN_predicted_CDS_4|1410_bp atgttccgagctcagtggatgtttgaacttgctccaggtgtaagctctagcaatttagaa aatcgaccttgcagagcagcaagaggctctctccagaaaacatcggcagataccaaagga aaacaagaacaggcaaaagaagaaaaaggctgccatctttttaccatcagatccaagttt gtgtttataattagtcatcaaataattggcattaccatgatgacttaccttctatctcct cttaatttttattttgaaataattacaggtccacaggctcgagaactcttcctaaaagca gtagaagaagaacaaaatggagctctctatgaagccatcaagttttatcgtagggctatg caacttgtacctgatatagagttcaagattacttatacccggtctccagatggtgatggc gttggaaacagctacattgaagataatgatgatgacagcaaaatggcagatctcttgtcc tacttccagcagcaactcacatttcaggagtctgtgcttaaactgtgtcagcctgagctt gagagcagtcagattcacatatcagtgctgccaatggaggtcctgatgtacatcttccga tgggtggtgtctagtgacttggacctcagatcattggagcagttgtcgctggtgtgcaga ggattctacatctgtgccagagaccctgaaatatggcgtctggcctgcttgaaagtttgg ggcagaagctgtattaaacttgttccgtacacgtcctggagagagatgtttttagaacgg cctcgtgttcggtttgatggcgtgtatatcagtaaaaccacatatattcgtcaaggggaa cagtctcttgatggtttctatagagcctggcaccaagtggaatattacaggtacataaga ttctttcctgatggccatgtgatgatgttgacaacccctgaagagcctcagtccattgtt ccacgtttaagaactaggaataccagaaaccacttgactataaatacagatattttcgtc gtgtccctgtacaagaagcagatcagagttttcatgtggggctacagctatgttccagtg gtcaccagaggttcaacaaactcatctggatacatcattcttgtcacattacttacaagg atctggacacaggtcatgagctcagagtttttagaagatggtgtcactagggaggttgga aggacccatttggccacaaagacagcatttccacctaccaacccagcccagatgtctaag tccagcttagctgtgacccctccttgtggactccatggcagagacaggtggacaaaagca atgactgttaaggtgccatctggggtcaaggccacatacagtctggcagcacctcagcca attgccttcacccctatcctactccattga >gi568815592r:53028209_53245632|GENSCAN_predicted_peptide_5|458_aa MRLSKPFAFSPSHLCLLSLILESLDLPDRKQSPGPRCLSGPIQLYQEPLRTKFSSISEGS RIGENLGWPDLIMEPDDFDSEDKEILSWDINDVKLPQNVKKTDWFQEWPDSYAKHIYSSE DKNAQRHLSSWAMRNTNNHNSRILKKSCLGVVVCGRDCLAEEGRKIYLRPAICDKARQKQ QRKRCPNCDGPLKLIPCRGHGGFPVTNFWRHDGRFIFFQSKGEHDHPKPETKLEAEARRA MKKVNTAPSSVSLSLKGSTETRSYGLGGITDLTDQTSTVDPMKLYEKRKLSSSRTYSSGD LLPPSASGVYSDHGDLQAWSKNAALGRNHLADNCYSNYPFPLTSWPCSFSPSQNSSEPFY QQLPLEPPAAKTGCPPLWPNPAGNLYEEKVHVDFNSYVQSPAYHSPQEDPFLFTYASHPH QQYSLPSKSSKWDFEEEMTYLGLDHCNNDMLLNLCPLR >gi568815592r:53028209_53245632|GENSCAN_predicted_CDS_5|1377_bp atgaggctttccaagccttttgccttctcgccttctcatctctgcctcctaagcctcatc cttgaaagtctggaccttcctgacagaaaacaatctcctggtccaaggtgcttgagtggg ccgatccagctatatcaagaacctttgagaacaaaattctcaagcatttctgaggggagt cgaataggtgaaaaccttggctggcctgaccttatcatggaacctgacgactttgattct gaagacaaagagatattaagctgggatattaatgatgtgaaactgccacagaacgtgaaa aaaaccgactggttccaggagtggccagattcctatgccaaacacatctacagctcggag gacaagaatgcgcagcggcacctgagcagctgggccatgcgcaataccaacaaccacaac tcccgcatcctcaagaagtcctgcctgggtgtggtggtgtgcggccgcgactgtctcgca gaggaggggcgcaagatctacctgagacctgccatctgtgacaaggcccggcagaagcag cagcggaaacgctgtcccaactgtgacgggcctctgaagctcatcccttgccgaggtcat gggggcttcccggtcaccaacttctggaggcacgacggacgctttatatttttccagtca aagggagagcatgatcatccaaaaccagaaaccaagttagaagctgaggcaagaagagcc atgaagaaagtgaacacagcaccttcctccgtctcattgagcctgaaggggagcacagag accaggagttatggtctgggaggaatcacagatctgactgaccagacttccactgtggac cccatgaagctctatgaaaagcgcaaattgtccagtagcagaacctacagtagtggagac ctgcttcctccttctgcctccggagtctactctgatcatggcgatctacaagcgtggagt aaaaatgctgctttggggagaaatcatcttgctgacaactgttattccaattatcctttt cctctgaccagctggccttgcagcttctctccttcccaaaactcttcagaacccttttac cagcagcttccattggagccacctgcagccaaaactggctgtcccccattatggccaaat ccagcgggtaatctttatgaagagaaagtacatgtggattttaacagctacgtccagtct cctgcataccattcacctcaagaagacccctttctcttcacctacgcctctcatcctcat cagcaatattcactgccaagcaagagcagcaaatgggattttgaggaagaaatgacatac ttgggtttggatcactgcaacaatgatatgcttctgaacctgtgtcctttgagatga >gi568815592r:53028209_53245632|GENSCAN_predicted_peptide_6|175_aa MSGLEGGKKKPLKQPKKQAKEMDEEQEASKREKKGEAEETQELKVKAKAALRGVCRAQGS CVREDVLDSEPGEASVGVLTNSSSKRGGSGSFQPRIEASMASGNWVSRPSRGGKFQQRSH DRQNLTYLPSDALQRKFAIVVRRTGFWRQWSYLEPEFSSKKMSVITTPSTGCYAN >gi568815592r:53028209_53245632|GENSCAN_predicted_CDS_6|528_bp atgtccggcctcgaaggcgggaagaagaagcccctgaaacaaccaaagaagcaagccaag gagatggacgaggaacaagaggcctccaaacgggaaaaaaaaggagaagcagaagaaacc caggagctaaaagtgaaggccaaggctgccctgaggggcgtgtgcagagctcagggctcc tgtgtgagggaggacgttcttgactctgagccaggtgaagccagtgtgggggtgcttaca aattcgagcagtaagagaggaggttctgggagcttccagcccaggatagaagcctccatg gcttctggcaactgggtgtccagacccagcagaggtgggaagtttcagcaaagatctcat gaccgacaaaacctaacgtatttaccatccgacgctttacagagaaagtttgccatcgta gttaggaggacaggtttttggagacaatggagttacctggagcctgaattttcttctaag aaaatgagtgtaataacaacgccctcaacaggttgctatgcgaattaa >gi568815592r:53028209_53245632|GENSCAN_predicted_peptide_7|105_aa MGESQENDCPVTQQGAGAYRPFFIGEGEMGGAGENDFQGERMDPVLREWEHLLSARHCGE GQSKAMPENLESGDDVSDVKAQSACVASSKGSSNLYKSKKERRVD >gi568815592r:53028209_53245632|GENSCAN_predicted_CDS_7|318_bp atgggggaatcacaagagaatgattgcccagtgacccaacagggtgcaggtgcttatagg cccttcttcataggggaaggggagatgggaggtgcaggagaaaatgattttcagggggaa agaatggacccagtgctccgagaatgggaacacctgctgtctgccaggcactgtggggaa gggcagagtaaagccatgcctgagaacctggagtccggtgatgatgtaagtgatgtcaag gcccaaagtgcctgtgtggctagttcaaagggctcttcaaatctgtacaaatccaagaag gagagaagagttgattga >gi568815592r:53028209_53245632|GENSCAN_predicted_peptide_8|234_aa MATHWRGTSLGVSKETVQESLQEEVMSELNVEGQEKKAAKPKEKCEKGLAAYRAKGKPDA AKKGVVKAEKKQEKEEEAEENEEGITELQFGAHTQMGWSSVCPKDKEKVRSFIRKGNITY GFERKLVGTGEDFGSWQAPIGGCRGKGRISGGLRDGPRKSGTAREVDTGVSRRQSYSGPH RRWPCLPLRLLPEQLQCDQAVRSHEEEHDDCASAAGHAHGEVGLRKWPLGDADS >gi568815592r:53028209_53245632|GENSCAN_predicted_CDS_8|705_bp atggccacacactggcgaggcaccagtcttggtgtgagcaaggagacagtgcaggaaagc cttcaggaggaggtgatgtcggaactaaatgtggaaggacaggagaagaaggctgcaaag ccgaaggaaaaatgcgaaaagggtcttgctgcatatcgagctaaaggaaagcctgatgca gcaaaaaagggagttgtcaaggctgaaaaaaagcaagaaaaagaggaggaagctgaggaa aatgaagagggaatcacagaattgcaatttggtgcacacacacagatggggtggtcttca gtgtgtccgaaggacaaagagaaggttaggagttttattagaaagggaaatattacatat ggttttgaaagaaagctcgttggcactggagaagattttgggagctggcaagctccgatt ggagggtgccgtgggaagggacgcatcagtggaggtctcagagatggtccccggaaaagt ggaactgcaagagaagttgatacaggtgtcagcaggagacagtcatatagtggtcctcac agaagatggccatgtcttcctctgaggctgcttccagaacaattacagtgtgatcaggct gttagatcccatgaagaagagcatgatgattgtgcaagtgcagctggacatgcccacggt gaagttggcctcaggaaatggccacttggtgatgctgacagctga