GENSCAN 1.0 Date run: 6-Nov-116 Time: 13:07:12 Sequence gi568815578r:23776191_23979676 : 203486 bp : 43.65% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 7368 8048 681 2 0 76 40 1099 0.838 100.09 1.02 PlyA + 8092 8097 6 -0.45 2.02 PlyA - 8199 8194 6 1.05 2.01 Sngl - 12225 11647 579 0 0 74 32 217 0.841 10.88 2.00 Prom - 16139 16100 40 -5.66 3.00 Prom + 17640 17679 40 -3.26 3.01 Init + 23445 23551 107 2 2 98 -1 63 0.300 -3.79 3.02 Term + 24289 24751 463 1 1 41 37 677 0.991 52.43 3.03 PlyA + 25275 25280 6 1.05 4.00 Prom + 27122 27161 40 -4.06 4.01 Init + 36017 36129 113 1 2 92 73 132 0.990 11.78 4.02 Term + 37876 38443 568 1 1 51 47 185 0.547 4.59 4.03 PlyA + 39863 39868 6 1.05 5.04 PlyA - 42253 42248 6 1.05 5.03 Term - 47913 47830 84 0 0 107 50 23 0.504 -1.95 5.02 Intr - 49133 49020 114 2 0 62 102 146 0.987 14.04 5.01 Init - 50470 50243 228 1 0 102 94 557 0.999 54.07 5.00 Prom - 54497 54458 40 -5.86 6.00 Prom + 58363 58402 40 -7.86 6.01 Init + 60588 60971 384 2 0 70 41 201 0.034 10.44 6.02 Intr + 63688 63855 168 0 0 -1 94 119 0.038 3.74 6.03 Term + 67001 67099 99 1 0 114 48 26 0.520 -0.67 6.04 PlyA + 69411 69416 6 1.05 7.03 PlyA - 70736 70731 6 1.05 7.02 Term - 79202 79119 84 2 0 114 49 21 0.675 -1.55 7.01 Init - 80628 80527 102 0 0 70 87 89 0.588 7.24 7.00 Prom - 86051 86012 40 -2.96 8.04 PlyA - 86347 86342 6 1.05 8.03 Term - 100081 99998 84 1 0 115 50 58 0.854 2.35 8.02 Intr - 101428 101315 114 1 0 68 87 128 0.997 11.34 8.01 Init - 103486 103256 231 1 0 65 96 446 0.593 39.36 8.00 Prom - 110276 110237 40 -4.56 9.00 Prom + 113154 113193 40 -7.36 9.01 Sngl + 114686 115162 477 1 0 76 53 715 0.946 62.90 9.02 PlyA + 115412 115417 6 -0.45 10.11 PlyA - 115515 115510 6 1.05 10.10 Term - 120498 120439 60 0 0 51 50 79 0.127 -1.80 10.09 Intr - 122411 121620 792 2 0 15 -33 375 0.005 9.77 10.08 Intr - 123488 123241 248 0 2 46 41 219 0.003 10.18 10.07 Intr - 136169 136002 168 0 0 126 27 72 0.029 4.82 10.06 Intr - 141415 141342 74 0 2 57 45 27 0.053 -5.65 10.05 Intr - 142664 142597 68 2 2 113 64 45 0.284 2.30 10.04 Intr - 143986 143874 113 2 2 95 87 38 0.609 4.50 10.03 Intr - 145302 145140 163 0 1 52 20 80 0.540 -2.85 10.02 Intr - 145922 145758 165 2 0 16 94 149 0.597 8.46 10.01 Init - 151103 151056 48 2 0 95 73 18 0.232 1.95 10.00 Prom - 159026 158987 40 -1.46 11.00 Prom + 162352 162391 40 -0.36 11.01 Init + 162586 162834 249 0 0 39 50 123 0.526 1.23 11.02 Term + 166316 166738 423 1 0 43 40 553 0.451 41.30 11.03 PlyA + 167606 167611 6 1.05 12.02 PlyA - 168821 168816 6 1.05 12.01 Term - 176212 175995 218 2 2 82 42 148 0.812 6.91 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578r:23776191_23979676|GENSCAN_predicted_peptide_1|226_aa MVITIITTIIIIITTITITIIITVTITATTKTITINTSTTIIPTITTIITTNTIINIITI ITTTIIINTTISAIITITTTTITTITITNFTTITITTTITTRATIITTIITTTTTTNIIT IITITITINNTSITAVINITISITTITISILIIIIHIMSTTIIINTITIITITSITTIPS SSSSKPPPPSPFSSPAYHSHHHHHLQHHCQVSVSSCSALDIVPHPL >gi568815578r:23776191_23979676|GENSCAN_predicted_CDS_1|681_bp atggtcatcaccatcatcaccaccatcatcataatcatcactaccatcaccatcactatt atcatcaccgtcaccattactgccaccaccaagaccatcaccatcaacaccagtaccacc ataatacccaccatcactaccatcattaccaccaacactattatcaacatcatcactatc atcaccaccaccatcattatcaataccaccatctctgccatcatcaccatcaccaccacc actattactaccatcactattaccaacttcaccactattaccatcaccactactattact acaagagctaccatcatcaccaccattataaccaccaccaccaccaccaacatcatcact attatcacaatcaccatcaccattaacaataccagcattactgctgtcatcaacatcacc atctccattaccaccatcaccatcagtatactcatcatcattatacacatcatgagcacc actatcatcatcaacaccatcaccatcattaccattaccagcattaccaccataccatca tcatcatcttcaaaaccaccaccaccatcaccattttcatccccagcatatcattctcat caccatcatcatcttcagcaccactgccaggtgtctgtcagcagctgttctgcattggac attgtgccacaccctttataa >gi568815578r:23776191_23979676|GENSCAN_predicted_peptide_2|192_aa MVILPKVIYRFNAILIKLPMTFFTELEKTTLKFIWNQKRAHIAKSILSQKNKVGGITLPD FKLYYKATVTKTAWYRYQNRDIDQWNRTEPSEIMPHIYNYLIFDKPDKNKKWGKDSLFNK FCWENWLAIGRKLKLDPFLTPYTKINSRRIKDLNVKPKTIKTLEENLGITIQDIGMSKNF MSKTPKAMATEA >gi568815578r:23776191_23979676|GENSCAN_predicted_CDS_2|579_bp atggtcatactgcccaaggtaatttatagattcaatgccatcctcatcaaactaccaatg actttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcc cacattgccaagtcaatcctaagccaaaagaacaaagttggaggcatcacgctacctgac ttcaaactatactacaaggctacagtaaccaaaacagcatggtaccggtaccaaaacaga gatatagaccaatggaacagaacagagccctcagaaataatgccacatatctacaactat ctgatctttgacaaacctgacaaaaacaagaaatggggaaaggattccctatttaacaaa ttttgctgggaaaactggctagccataggtagaaagctgaaactggatcccttccttaca ccttatacaaaaattaattcaagacggattaaagacttaaatgttaaacctaaaaccata aaaaccctagaagaaaacctaggcattaccattcaggacataggcatgagcaagaacttc atgtcaaaaacaccaaaagcaatggcaacagaagcgtaa >gi568815578r:23776191_23979676|GENSCAN_predicted_peptide_3|189_aa MASRWHSLLFLQLSPQPLLDRPRMDPLRRQGLSQGGHHHYHHDNHHCPITITVTTTVVNI ISSVTIIISNITIITLSTISTFIPIIINTPSPLLLSLSTTITIIINTIAIAISITTNITI TTITTTTTITITIIITTITNTIAITITTTITMVILISNTIIIMTTITNINMTTITTIITT SLSSLPSSP >gi568815578r:23776191_23979676|GENSCAN_predicted_CDS_3|570_bp atggccagtagatggcatagcctcctgttcctgcagctgtccccacagcccctgctggac aggcccagaatggaccctctcagaagacaaggactctctcagggtggccatcaccattat catcatgacaatcaccactgtcccatcaccatcaccgttaccaccactgttgtaaacatc atctctagtgtcaccattattatctccaacatcaccatcatcactctcagtaccatcagc acatttatccccattatcatcaacaccccatcaccactattactatcactatcaaccacc atcaccatcattatcaacaccattgccattgccatcagtattactactaacatcaccatc accactatcaccacaaccaccaccattaccatcaccatcatcatcactaccatcaccaac accattgccatcaccattaccaccacaatcaccatggtaatcctcatcagtaacaccatc atcatcatgactaccattaccaatatcaacatgaccaccataaccaccatcatcaccaca tcactgtcttcattaccatcatcaccataa >gi568815578r:23776191_23979676|GENSCAN_predicted_peptide_4|226_aa MRVHGFILEVSKTKNPPERTNSGHILVIMKALLPMTKQQIKTDLGKFSDDPDSYIDVLQG LRQTFNLTWRDVMLLLDQILAFNEKNAALSSAKEFGDTWYLTQVNDRMTAEERDKFPTAQ QAIPSRAPHWDLNSDHGDWSHKHLLTCVLEGVRRMRKKPMNYSVMSTMSGKGRKCFCLPQ AATGGLKKIYSPVTQLTGGSIDPKREVYYPISCRHQEKAPKVSHGP >gi568815578r:23776191_23979676|GENSCAN_predicted_CDS_4|681_bp atgagggtccacggcttcattcttgaagtcagcaagaccaagaacccaccagaaagaacc aattctggacacattttggtgatcatgaaggcactattgcctatgaccaagcagcagatc aagacagacctggggaagttttcagatgatcctgatagttacatagatgtcctacagggc ctaaggcaaaccttcaatctcacttggagagatgtcatgctattattagatcaaatcctg gcctttaatgaaaagaatgcagctttatcttcagccaaagagtttggagatacctggtat cttactcaagtaaatgatagaatgacagctgaagaaagggacaaattccctactgctcag caagccatccccagtagggctccccactgggacctcaactcagatcatggagactggagt cacaaacatctgttaacctgtgttctagaaggagtaaggagaatgaggaaaaagcccatg aattattcagtgatgtccaccatgtcagggaaaggaagaaaatgcttctgcctccctcaa gcagctacgggaggccttaagaaaatatactcccctgtcacccaactcactggagggtca attgatcctaaaagagaagtttattacccaatcagctgcagacatcaggagaaagctcca aaagtaagccatgggccctga >gi568815578r:23776191_23979676|GENSCAN_predicted_peptide_5|141_aa MAWPLCTLLLLLATQAVALAWSPQEEDRIIEGGIYDADLNDERVQRALHFVISEYNKATE DEYYRRLLRVLRAREQIVGGVNYFFDIEVGRTICTKSQPNLDTCAFHEQPELQKKQLCSF QIYEVPWEDRMSLVNSRCQEA >gi568815578r:23776191_23979676|GENSCAN_predicted_CDS_5|426_bp atggcctggcccctgtgcaccctgctgctcctgctggccacccaggctgtggccctggcc tggagcccccaggaggaggacaggataatcgagggtggcatctatgatgcagacctcaat gatgagcgggtacagcgtgcccttcactttgtcatcagcgagtataacaaggccactgaa gatgagtactacagacgcctgctgcgggtgctacgagccagggagcagatcgtgggcggg gtgaattacttcttcgacatagaggtgggccgaaccatatgtaccaagtcccagcccaac ttggacacctgtgccttccatgaacagccagaactgcagaagaaacagttgtgctctttc cagatctacgaagttccctgggaggacagaatgtccctggtgaattccaggtgtcaagaa gcctag >gi568815578r:23776191_23979676|GENSCAN_predicted_peptide_6|216_aa MELKAMARDIHDACTSFSSQIDQVEERISVIEDQINEMKQEEKFREKRVKTNEQSLQEIC GYVKRPNLRLIGVPEGDGENGIKLENTLQDIIQENFPDLARQANIQIQEIQRTPQRYTLR RATPRHIMRIKYLGIQLTRDVKDLFKENYKLLLNEINEDTNKWKNIPRSWIGRINILKMA IMPKCHSPFCLTPYLCSKEQYRVWMLVKRVHSNQPA >gi568815578r:23776191_23979676|GENSCAN_predicted_CDS_6|651_bp atggagctgaaagccatggcacgagacatacatgatgcatgcacaagcttcagtagccaa attgatcaagtggaagaaaggatatcagtgattgaagatcaaattaatgaaatgaagcaa gaagagaagtttagagaaaaaagagtaaaaacaaatgaacaaagcctccaagaaatatgt ggctatgtgaaaagaccaaatctacgtttgattggtgtgcctgaaggtgatggggagaat ggaatcaagctggaaaatactcttcaggatattatccaggagaacttcccagacctagca aggcaggccaacattcaaattcaggaaatacagagaacaccacaaagatacaccttgaga agagcaaccccaagacacataatgagaataaaatacctaggaatccaacttacaagggat gtgaaggatctcttcaaggagaactacaaattactgctcaatgaaataaacgaggacaca aacaaatggaagaatattccacgctcatggataggaagaatcaatatcctgaaaatggcc ataatgcccaagtgccacagtcccttctgcctcaccccctatctctgctctaaggaacag tacagagtgtggatgctggtgaaacgtgttcatagtaaccagccagcctga >gi568815578r:23776191_23979676|GENSCAN_predicted_peptide_7|61_aa MNYFFDVEMGQITCAKSQPNLDNCSFHHQRKLQEKQFCSFQICEVPWDDRISLVKSTCQN V >gi568815578r:23776191_23979676|GENSCAN_predicted_CDS_7|186_bp atgaactacttctttgatgtggagatggggcaaatcacatgtgccaagtcccagcccaac ttggacaactgttccttccatcaccagcgaaaacttcaggagaaacagttctgctctttc cagatctgtgaagttccttgggatgacagaatatctttggtgaaatccacatgtcagaat gtctag >gi568815578r:23776191_23979676|GENSCAN_predicted_peptide_8|142_aa MMWPMHTPLLLLTALMVAVAGSASAQSRTLAGGIHATDLNDKSVQCALDFAISEYNKVIN KDEYYSRPLQVMAAYQQIVGGVNYYFNVKFGRTTCTKSQPNLDNCPFNDQPKLKEEEFCS FQINEVPWEDKISILNYKCRKV >gi568815578r:23776191_23979676|GENSCAN_predicted_CDS_8|429_bp atgatgtggcccatgcacaccccactgctgctgctgactgccttgatggtggccgtggcc gggagtgcctcggcccaatctaggaccttggcaggtggcatccatgccacagacctcaat gacaagagtgtgcagtgtgccctggactttgccatcagcgagtacaacaaggtcattaat aaggatgagtactacagccgccctctgcaggtgatggctgcctaccagcagatcgtgggt ggggtgaactactacttcaatgtgaagttcggtcgaaccacatgcaccaagtcccagccc aacttggacaactgtcccttcaatgaccagccaaaactgaaagaggaagagttctgctct ttccagatcaatgaagttccctgggaggataaaatttccattctgaactacaagtgccgg aaagtctag >gi568815578r:23776191_23979676|GENSCAN_predicted_peptide_9|158_aa MVITIITTLTIIITTITITIIITVTITATTKTITIKTSTTIIPTITTIFTTITIITIIIT IITTIIIIINTTVSAVITITTTTITTITITNFTTITITTIIITRATIITTIITTTTTTII TIITITSPLSIPTLLLSSTSPSPLPPSPSVYSPSLYSS >gi568815578r:23776191_23979676|GENSCAN_predicted_CDS_9|477_bp atggtcatcaccatcatcaccaccctcaccataatcatcactaccatcaccatcactatt atcatcaccgtcaccattactgccaccaccaagaccatcaccatcaaaaccagtaccacc ataatccccaccatcactaccatctttaccaccatcaccattatcaccatcatcatcact atcatcaccaccatcatcatcattatcaatacgaccgtctctgccgtcatcaccatcacc accaccactattactaccatcactatcaccaacttcaccactattaccatcaccactatt atcattacaagagctaccatcatcaccaccattataaccaccaccaccaccaccatcatc actattatcacaatcacatcaccattatcaataccaacattactgctgtcatcaacatca ccatcaccactaccaccatcaccatcagtatactcaccatcattatactcatcatga >gi568815578r:23776191_23979676|GENSCAN_predicted_peptide_10|632_aa MGKDGATNTDGDTSRRSPPALARKIIQSDICDADLNDDRWQHALDFAISEYNKERMMNTT VTPQVLPGQQQGEWPSPSGWLTTPDSPGQLSSFWGGARLTLLLPADIASDLNQMRTAVTQ HSGGGDVGWVSYFFDMKIDLTTCTKSQPDLDNCLFSDQPQLKEKQFCSFQIYEVPLKDRM SLVKSRQRSAALTASASGAGLSNIFHAYSCRCSLGQDALGYGVHLSLMSHVLKNKSQRNN GEDGRKPPGPLLRTLQQLCEHHPTTFEIQTTIREYYKHLYANKLENLEEMDKFLDTYTLP RLNQEEVEFLNRPITGSEIVAIINSLPTKKSPGPDGFTAEFYQRYKEELRTIYLGIQLTR DMKDLFKENYKPLLNEIKEDTNKWKNIPCSWVGRINIVKVAILPKVIYRFNAIPIKLPMT FFTELEKTTLKFIWNQKRARITKSILSQKNKAGGITLADFKVYYEATVTKTAWYWYQNRD IDQWNRTEPSEITLLIYNYLIFDKPEKNKQWGKDSLFNKWCWENWLAICGKLKLDPFLTP YTKINLGWIKDLNVRPKTIKTLEENLGITIQDIGMGKDFMSKTPKAMATKAIIDKWDLMK LKSFCTAKETTFRVFGEQVVFGYINKLFNGDF >gi568815578r:23776191_23979676|GENSCAN_predicted_CDS_10|1899_bp atgggcaaggatggggctactaacacagatggggacactagtcgaaggagtcccccagct ctggctagaaagatcattcaaagtgacatctgtgatgcagaccttaatgatgatagatgg cagcatgccctggacttcgccatcagtgagtacaacaaggagagaatgatgaacactaca gtcaccccgcaggtgctgccaggccaacagcagggcgagtggccttcaccctcaggctgg ctgaccacccccgacagcccagggcagctgagttccttctggggtggagcacgcctgacc ctgcttttgccagctgacatagcatcagacctcaaccagatgaggacagcagtcacccag catagtggaggaggggatgtgggctgggtgagctacttctttgatatgaagatagaccta accacatgcactaagtctcagccagacttggacaactgtctcttcagtgaccagccacag ttgaaggagaaacagttctgctctttccagatctacgaagttcccttgaaggacagaatg tccctggtgaaatccagacagaggagtgcagccctcacggccagtgcctctggggcaggg ctctcaaatatcttccatgcctacagctgcaggtgtagtttgggacaggatgccttgggc tatggagtccacctttcacttatgagccatgtgctcaagaacaagagccaaaggaacaat ggagaagatggaaggaaaccacctgggcccctgctgaggacactgcagcaactctgtgaa caccatcccacgacctttgaaatacaaactaccatcagagaatactacaaacacctctac gcaaataaactagaaaatctagaagaaatggataaattcctcgacacatacactctccca agactaaaccaggaagaagttgaatttctgaatagaccaataacaggatctgaaattgtg gcaataatcaatagcttaccaaccaaaaagagtccaggaccagatggattcacagccgaa ttctaccagaggtacaaggaggaactgagaacaatatacttaggaatccaacttacaagg gacatgaaggacctcttcaaggagaactacaaaccactgcttaatgaaataaaagaggat acaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatatcgtgaaagtg gccatactgcccaaagtaatttatagattcaatgccatccccatcaagctaccaatgact ttcttcacagaattggagaaaactactttaaagttcatatggaaccaaaaaagagcccgc atcaccaagtcaatcctaagccaaaagaacaaagctggaggcatcacgctagctgacttc aaagtatactacgaggctacagtaaccaaaacagcatggtactggtaccaaaacagagat atagatcaatggaacagaacagagccctcagaaataacgctgcttatctacaactatctg atctttgacaaacctgagaaaaacaagcaatggggaaaggattccctatttaataaatgg tgctgggaaaactggctagccatatgtggaaagctgaaactggatcccttccttacacct tatacaaaaatcaatttgggatggattaaagacttaaacgttagacctaaaaccataaaa accctagaagaaaacctaggcattaccattcaggacataggcatgggcaaggacttcatg tccaaaacaccaaaagcaatggcaacaaaagccataattgacaaatgggatctaatgaaa ctaaagagcttctgcacagcaaaagaaactaccttcagagtttttggcgaacaggtggtg tttggttacatcaacaagttgtttaatggtgatttctga >gi568815578r:23776191_23979676|GENSCAN_predicted_peptide_11|223_aa MKRLSTLLTLHLYAYPIPPGRGTGTQDLPNGETARAVRAETPPNSPHCGDEERKAVAILG ARAVICFNTLFGALWFLASPSFQHHHHYHNHSDHYHHHSTNANIITITIIIITTTTSISI INTAAISITSTTIITITTIAITNTIIITTNNINTITISTINTISITTILTTITTIITTIT PMITIIIASHHYHHQHHQHHHYSHRHHYHKRSHHDHHHHHHHH >gi568815578r:23776191_23979676|GENSCAN_predicted_CDS_11|672_bp atgaagcgcctctccaccttgctcaccctccacttgtacgcataccccattcctcctgga cgtgggacaggaactcaggacctgccaaatggcgagactgcaagagctgtaagggctgaa acaccccccaactcaccacactgtggtgatgaggagagaaaagctgtggccattctggga gccagggctgtgatatgctttaacaccctgtttggggctctgtggttcctggcatctcca agctttcagcaccaccaccactaccataaccattctgatcattatcaccatcattctacc aatgctaacatcatcaccatcaccatcatcatcatcactaccaccaccagcatctccatt atcaacactgccgccatcagcatcacctccactactatcatcacaattaccaccatcgca atcacaaacaccatcatcatcaccacaaataacataaacaccatcaccatcagtaccatc aacaccatcagcatcacaaccatcctcaccaccatcaccactatcatcactaccatcacc cccatgattaccattattattgccagtcaccactaccatcaccaacaccaccagcaccat cattattctcaccgtcaccattatcacaagcgttctcaccatgaccatcatcaccatcac caccatcactga >gi568815578r:23776191_23979676|GENSCAN_predicted_peptide_12|72_aa XLTLGHCTAGQDALGYGVHLSLMSNVPKNKSPRNGVEDGRKPPGPLLRTLHQLYELTTPT AVGKWMSSSWEV >gi568815578r:23776191_23979676|GENSCAN_predicted_CDS_12|219_bp ncccttacccttggacactgtactgcgggacaggatgccttgggctatggtgttcacctt tcactcatgagcaatgtgcccaagaacaagagcccaaggaatggtgtagaagatggaagg aaaccacctgggcccctgctgaggacactgcaccaactctatgaattgaccacccctaca gctgttggtaaatggatgtcgtcttcatgggaggtgtag