GENSCAN 1.0 Date run: 8-Nov-116 Time: 09:35:30 Sequence gi568815596r:46805466_47009171 : 203706 bp : 48.13% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 153 348 196 2 1 25 13 189 0.181 4.09 1.02 Term + 8723 8946 224 0 2 127 48 141 0.952 11.08 1.03 PlyA + 10280 10285 6 -0.45 2.08 PlyA - 10653 10648 6 -0.45 2.07 Term - 14570 14070 501 2 0 -42 49 213 0.057 -1.12 2.06 Intr - 15757 15401 357 1 0 -19 41 243 0.155 4.35 2.05 Intr - 16443 16342 102 0 0 52 66 62 0.506 0.77 2.04 Intr - 20455 20276 180 1 0 57 105 47 0.809 3.36 2.03 Intr - 22598 22423 176 0 2 105 64 96 0.834 8.56 2.02 Intr - 23642 23590 53 1 2 55 81 54 0.408 -0.05 2.01 Init - 30693 30665 29 0 2 60 110 30 0.272 1.51 2.00 Prom - 39339 39300 40 -3.06 3.00 Prom + 40681 40720 40 -2.36 3.01 Init + 51121 51243 123 0 0 47 78 73 0.168 2.38 3.02 Intr + 58055 58148 94 1 1 50 92 79 0.578 4.04 3.03 Intr + 60494 60665 172 0 1 48 64 66 0.171 -0.80 3.04 Intr + 63756 63816 61 0 1 96 77 19 0.485 0.34 3.05 Term + 65536 65676 141 0 0 129 39 80 0.536 5.13 3.06 PlyA + 65864 65869 6 -1.75 4.03 PlyA - 66154 66149 6 1.05 4.02 Term - 66708 66549 160 2 1 32 49 132 0.281 1.11 4.01 Init - 75993 75857 137 0 2 75 72 147 0.788 11.41 4.00 Prom - 91960 91921 40 -2.66 5.00 Prom + 92125 92164 40 -5.76 5.01 Init + 92888 92936 49 1 1 58 107 49 0.731 4.91 5.02 Term + 95424 95497 74 1 2 108 45 41 0.646 -0.13 5.03 PlyA + 96574 96579 6 1.05 6.04 PlyA - 99149 99144 6 1.05 6.03 Term - 100129 99998 132 1 0 82 37 120 0.965 4.39 6.02 Intr - 102504 102345 160 2 1 57 92 174 0.997 14.69 6.01 Init - 103700 103558 143 2 2 67 51 186 0.918 10.41 6.00 Prom - 116594 116555 40 -5.26 7.00 Prom + 125441 125480 40 -3.26 7.01 Init + 135497 135555 59 1 2 59 61 91 0.574 2.28 7.02 Intr + 135557 136260 704 2 2 -40 117 572 0.245 38.41 7.03 Intr + 144898 145061 164 2 2 112 123 71 0.998 12.69 7.04 Intr + 151374 151542 169 2 1 67 65 302 0.804 25.32 7.05 Intr + 153030 153087 58 1 1 117 94 -2 0.310 1.24 7.06 Intr + 155620 155731 112 1 1 87 61 14 0.105 -1.02 7.07 Intr + 162465 162531 67 2 1 85 69 43 0.360 0.58 7.08 Intr + 169379 169480 102 0 0 120 41 28 0.700 1.45 7.09 Intr + 169508 169638 131 0 2 120 105 106 0.968 15.91 7.10 Intr + 173327 173442 116 1 2 105 61 119 0.085 10.15 7.11 Intr + 187985 188063 79 2 1 90 40 123 0.889 7.25 7.12 Intr + 188892 189049 158 2 2 72 108 147 0.961 13.91 7.13 Intr + 189671 189734 64 2 1 62 103 138 0.511 11.42 7.14 Intr + 198638 198802 165 1 0 82 96 26 0.546 2.96 7.15 Intr + 200457 200594 138 2 0 78 80 197 0.966 18.56 7.16 Intr + 201176 201259 84 1 0 101 116 70 0.997 11.12 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 110703 110962 260 1 2 88 44 160 0.817 7.31 S.002 Term + 173327 173446 120 1 0 105 42 115 0.910 7.07 S.003 Term - 176368 176132 237 1 0 111 44 153 0.870 9.37 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:46805466_47009171|GENSCAN_predicted_peptide_1|139_aa ELSLEPMVPASFPSGHSVLSPGLCLSAGTLNTCSGAVSGSPTSYFLRYLQAFMWVLLLDP AYLEKGFGKGCFTRGLSWSWKQCEFILARQRGPDSPSLDASVGRGLENVAQAAVSSRYSP QHVHGRVTAITCSSEKEMR >gi568815596r:46805466_47009171|GENSCAN_predicted_CDS_1|420_bp gagctgtcgctggagcccatggtccctgccagctttcctagtggtcactcagtcctgtca ccaggcctctgcctttctgctggcactctcaacacctgctcaggtgctgtctcaggctcc cctacctcctacttcctgcgctacctccaggccttcatgtgggtgcttctcctggaccca gcgtacctggagaaggggtttggcaaaggctgcttcactcgaggcctctcatggagctgg aaacaatgcgagtttattttggcacggcaaagagggcccgattccccttctctggacgcc tctgtgggaagaggactagagaatgtggcccaggccgctgtcagttcccgttactctccc cagcacgtccacggccgagtcacagccatcacatgtagctctgagaaggaaatgagataa >gi568815596r:46805466_47009171|GENSCAN_predicted_peptide_2|465_aa MLEEAACQRRVLETVKSKIMVLANSVSAQHLITSELEDQGLSEEDCRTPRYQLTGWLISW VQYGQAYRLGMRTSEAPRSPQAAWTATPGQGILETERQGPEFINHLQTCRSGLGAQIPFG ASWARALTRRTLEKEGRPRGEAQGWPDPSPEPQSDAVDLLSSSLQSSWVCETIEESQQRQ INKIDRPLTRLIKKKREKNQIDAIKNDKGDITTDPTEILTTIREYYKHFYANKLENLEEM DKFFNTYTLPRLNKEEVESLNRPIIGSEIEAIINSLPTKKSPGPDGFTAEFYQRYKEELN KIPRNPTYKGSEGPLQGDYKPLLSEIKGDTNKRKNIPCSWIGRINIVKMAILPKVIYRFN AIPIRLPMPFFTELEKTTLKFIWNQKRARIAKSILSQKNKAGGITLPVFKLYYKATVTKT AWYWYQNRDIEQWNRTEPSEIIPHIYNYLIFETLNVLKKECLRIQ >gi568815596r:46805466_47009171|GENSCAN_predicted_CDS_2|1398_bp atgctggaggaggcagcatgtcagcgcagggttctggagactgtgaagtccaagatcatg gtgctggccaattcagtttctgcccaacacctcatcacctccgagttagaggaccagggc ttgagcgaggaggactgtaggaccccgaggtaccagctgacagggtggctgatctcatgg gtccagtatggccaggcctacaggctggggatgaggacctctgaagcccctcgtagcccc caggctgcctggacggcaaccccaggccaggggatcctggaaacagaacggcagggccct gagttcatcaaccacctgcaaacctgccggtctgggttgggtgcccaaatcccgtttggg gccagctgggccagggccctcacacggagaacactcgagaaggaagggcggcccagagga gaggcccaagggtggccggacccgagccctgaaccacagtctgatgccgtggacctgctt tcaagcagcttacagtctagttgggtctgtgagaccattgaagaatcccagcagaggcag atcaacaaaattgatagaccgctaacaagattaataaagaagaaaagagagaagaatcaa atagacgcaataaaaaatgataaaggagatatcaccaccgatcccacagaaatactaact accatcagagaatactataaacacttctacgcaaataaactagaaaatctagaagaaatg gataaattcttcaacacatacaccctcccaagactaaacaaggaagaagttgaatccctg aatagaccaataataggctctgaaattgaggcaataattaatagcttaccaaccaaaaaa agtccaggaccagacggattcacagctgaattctaccagaggtacaaggaggagctgaat aaaatacctaggaatccaacttacaagggatctgaaggacctcttcaaggagactataaa ccactgctcagtgaaataaaaggggatacaaacaaacggaagaacattccatgctcatgg ataggaagaatcaatatcgtgaaaatggccatactgcccaaggtaatttatagattcaat gctatccccatcaggctaccaatgcctttcttcacagaattggagaaaactactttaaag ttcatatggaaccaaaaaagagcccgcattgccaagtcaatcctaagtcaaaagaacaaa gctggaggcatcacgctacctgtcttcaaactatactacaaggctacagtaaccaaaaca gcatggtactggtaccaaaacagagatatagagcaatggaacagaacagagccctcagaa ataataccacacatctacaactatctgatctttgaaactctcaatgtgctcaaaaaggag tgtttgaggatccagtag >gi568815596r:46805466_47009171|GENSCAN_predicted_peptide_3|196_aa MHLLWNANPPGKKTLVSAHTSGGSTGDWVEVQRTGPGPQGEGLQMAAAGLKPEAGWEKAE ARIKSVSVGEEEVARDMSFSHLLLPLLQAHSRSDSVNLSLFPGRPPGPRRPTHRGTCCFL VLAFAHLRLRPSSLCASWTVSLQTPESSLKELAPRKTTHHQGSTGRPLVGKVGSLVLNHP QGLDIKPSNLQALAFE >gi568815596r:46805466_47009171|GENSCAN_predicted_CDS_3|591_bp atgcatctcctgtggaatgcaaatccccctggcaagaaaaccctcgtgtcagcccatacc tcggggggatctactggagactgggtggaggtgcagagaacaggtcctgggccccaaggg gaggggctacagatggccgcagctgggctaaagcccgaggctggctgggagaaggcggaa gcccgaataaaatcagtctctgttggtgaggaagaagtagcccgagacatgtctttctcg caccttcttcttcccctgctgcaagcccactcccgttcagacagcgtgaacctgagtctt tttcctggcaggccacctgggccacgaaggcccacacatcgaggcacctgctgcttctta gtgctggcctttgctcacctaagactcaggccaagctccctgtgtgcctcctggacagtg tccctgcagactcctgagtcctcactgaaggagctggccccacggaagactacccatcac cagggcagtactgggcggccacttgtagggaaagtggggagtctagtccttaaccatcct cagggccttgacatcaagccgtccaacctgcaggccctggcttttgaatga >gi568815596r:46805466_47009171|GENSCAN_predicted_peptide_4|98_aa MGQQGTSRDSIVHLRASGVGPGQVYCSNAVQTPTDPEKNGQLLTNEAHGVQQIPGTDSID LRNSEKGDSIMARFREIFLEEVKSEVNFEGQCSFRPIR >gi568815596r:46805466_47009171|GENSCAN_predicted_CDS_4|297_bp atggggcagcagggcacctcccgagacagcatcgtgcacctacgagcatctggagttggt ccaggccaggtctactgctccaatgcagtacagacacctactgacccagaaaaaaacggc caattgctgacaaacgaagcacatggcgtgcagcagattcctggtacagactctattgac ctcaggaattccgagaagggagacagcatcatggctagattcagggaaatcttcctggag gaggtgaagtctgaggttaactttgaaggacagtgcagctttagacctattagataa >gi568815596r:46805466_47009171|GENSCAN_predicted_peptide_5|40_aa MVLDGIVSENQSLFLGGNCYYSYILDEKLKYRNFNAKKQA >gi568815596r:46805466_47009171|GENSCAN_predicted_CDS_5|123_bp atggttttagatgggatcgtgtctgaaaaccagtctttgttcctgggaggtaactgttat tatagttatattttagatgagaaactgaagtatagaaacttcaatgctaagaaacaggcc tag >gi568815596r:46805466_47009171|GENSCAN_predicted_peptide_6|144_aa MRSLLRTPFLCGLLWAFCAPGARAEEPAASFSQPGSMGLDKNTVHDQEHIMEHLEGVINK PEAEMSPQELQLHYFKMHDYDGNNLLDGLELSTAITHVHKEEGSEQAPLMSEDELINIID GVLRDDDKNNDGYIDYAEFAKSLQ >gi568815596r:46805466_47009171|GENSCAN_predicted_CDS_6|435_bp atgagatccctgctcagaacccccttcctgtgtggcctgctctgggccttttgtgcccca ggcgccagggctgaggagcctgcagccagcttctcccaacccggcagcatgggcctggat aagaacacagtgcacgaccaagagcatatcatggagcatctagaaggtgtcatcaacaaa ccagaggcggagatgtcgccacaagaattgcagctccattacttcaaaatgcatgattat gatggcaataatttgcttgatggcttagaactctccacagccatcactcatgtccataag gaggaagggagtgaacaggcaccactaatgagtgaagatgaactgattaacataatagat ggtgttttgagagatgatgacaagaacaatgatggatacattgactatgctgaatttgca aaatcactgcagtag >gi568815596r:46805466_47009171|GENSCAN_predicted_peptide_7|790_aa MGRGREARRPRAPGLRAGLNELRLRGGGGGGGRVPGRPRAPFARLAAVAALAASEPGSPR PPHAPAGVEPRPGAGPQLAAPGPGAEAVAAAAAAAAAAAPGAATAEAEVAPVRVRAARPE PQPGGAGRCAGSCWCCCCCCCPPSAARAPAAARAPAAVCAPVDPARECAPARTPPPAGSP LLGRTFHDSAREKMAAKGAHGSYLKVESELERCRAEGHWDRMPELVRQLQTLSMPGGGGN RRGSPSAAFTFPDTDDFGKLLLAEALLEQCLKENHAKIKDSMPLLEKNEPKMSEAKNYLS SILNHGRLSPQYMCEAMLILGKLHYVEGSYRDAISMYARAGIDDMSMENKPLYQMRLLSE AFVIKACLRLLGGTQSRWSEVLQGRPCHGPFVTSASLRTPDQTCQIRGRPWKGNVPSRFQ VQGTQVWDSSSVGAAAVLVPFDRLDLRLLLAAPECLPPRMSPPSAHGEGLESGGPSRTGL SLERLPNSIASRFRLTEREEEVITCFERASWIAQVFLQELEKTTNNSTSRHLKGCHPLDY ELTYFLEAALQSAYVKNLKKGNIVKGMRELREVLRTVETKATQNFKVMAAKHLAGVLLHS LSEECYWSPLSHPLPEFMGKEESSFATQALRKPHLYEGDNLYCPKDNIEEALLLLLISES MADSRAQPCRDFRASRHPLLGHLSPWGHGQHSPTVATGQPSEPLATSLSVTELSPRATRD VVLSRVPEQEEDRTVSLQNAAAIYDLLSITLGRRGQYVMLSECLERAMKFAFGEFHLWYQ VALSMVACGK >gi568815596r:46805466_47009171|GENSCAN_predicted_CDS_7|2370_bp atggggagggggcgggaagcgaggcgcccccgggcgccggggctccgcgcgggattaaat gagctccgactccgcggcgggggcggcggcggggggcgggtacccgggcggccgcgagcg cccttcgcgcgcctggctgctgtggccgcgctggcagccagcgagcccggctcgccgagg cctccccacgcccccgcgggggtggagccgcggccaggggcgggtccgcagctggcggcg ccggggcccggggcggaggctgtggcagcagctgcagcggcggcggcggcggcagcgcca ggagctgctacagcagaggcggaggttgctcctgtacgcgtacgggccgctcggccggag ccgcagcccggaggcgccgggcggtgcgctgggagctgctggtgctgctgctgctgctgc tgcccaccctccgccgcccgggcccccgctgccgcccgggccccggctgccgtctgcgcc cccgtcgaccccgcccgcgagtgcgccccagccaggacgccgcccccggccgggtctcca cttcttggccgcaccttccatgacagcgcccgcgagaagatggctgcgaagggcgcgcac ggctcctacctgaaggtggagagcgagctggagcgctgccgcgccgagggccactgggac cgcatgccggagctggtccggcagctgcagacgctgagcatgcccggcggcggaggtaac aggcgaggcagcccgagcgcagcgttcacctttccggacaccgatgactttgggaaattg ctgctggctgaggccctcctggagcagtgtttgaaggagaaccatgccaaaataaaagac tccatgcctttgctggagaagaatgagccgaagatgagcgaagccaaaaattatctaagc agtatccttaaccatgggaggctctcgccacagtacatgtgtgaggccatgctgatcctg ggcaaactgcattacgtggagggctcataccgagatgccatcagcatgtacgcacgggcc gggattgatgacatgtccatggagaacaagcccctgtatcagatgcggctgctgtcggag gcttttgtcatcaaagcatgcctccggcttcttgggggaacacagtcccggtggtcagag gtcctgcaggggaggccctgccatggcccctttgtaacttccgccagcctacggacccct gaccaaacatgtcagatcaggggaagaccctggaaaggaaatgttcccagccgattccag gtgcagggtacccaagtctgggattcatcttccgtgggtgctgcagctgtcctcgtgccc tttgacaggctggacctccgcctcctcctggctgcaccagagtgtctgccccctcggatg agcccaccttcggcccatggggagggcctggagtcagggggcccgagcaggacaggcctc tctctggaacgcctacccaactccatcgcctcccgcttccgcctgacagagagggaggag gaagtgatcacctgttttgagagggcctcctggatcgctcaggtgttcctgcaggaattg gagaagaccacaaataacagcacgtcgaggcatctgaaaggctgtcacccgcttgactat gagctcacctacttcctggaagctgccctccagagcgcctatgtgaaaaacctgaagaag gggaacatcgtgaagggcatgagagagctccgggaggtgctgcggactgtggagaccaaa gcaactcagaacttcaaagtgatggcggccaagcacctggcgggggtcctgctgcactcc ctgagtgaggagtgctactggagccccctgtcccaccctctgcctgagttcatgggcaag gaggagagttctttcgccactcaggccctgcggaaacctcacctctatgaaggagacaac ctctactgccccaaggacaacatcgaggaagccctcctgctcctcctcatcagcgaatcc atggctgactccagagctcagccttgccgggatttccgggcatcccggcaccctctcctg ggccatctaagtccatgggggcatgggcagcactcaccaactgtggcaacagggcagcca tcagaacccctggccaccagtctctcagtgactgagctgtcccccagggcaactcgagat gtggtgctgagccgggtgccggagcaggaggaggaccggacagtgagcttgcagaatgcc gcagccatctatgacctcctgagcatcacgttgggcagaaggggacagtacgtcatgctc tcggagtgcctggagcgagccatgaagtttgcgtttggagaatttcacctttggtaccag gtggccctctccatggtggcttgtgggaag