GENSCAN 1.0 Date run: 8-Nov-116 Time: 06:23:36 Sequence gi568815585r:37464518_37698726 : 234209 bp : 35.88% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 7177 7330 154 0 1 60 73 82 0.502 4.10 1.02 Intr + 7411 7673 263 2 2 36 53 160 0.132 3.58 1.03 Term + 20607 20756 150 2 0 81 43 85 0.090 0.23 1.04 PlyA + 21497 21502 6 1.05 2.05 PlyA - 21571 21566 6 1.05 2.04 Term - 23308 23116 193 0 1 96 48 141 0.437 6.81 2.03 Intr - 27199 27157 43 2 1 76 102 -10 0.239 -4.42 2.02 Intr - 29563 29380 184 1 1 73 25 152 0.219 5.84 2.01 Init - 30159 30139 21 0 0 99 84 25 0.516 2.99 2.00 Prom - 31402 31363 40 -3.95 3.00 Prom + 38218 38257 40 -1.45 3.01 Init + 41620 41694 75 0 0 68 74 3 0.334 -1.96 3.02 Intr + 41953 42026 74 0 2 97 88 95 0.644 7.69 3.03 Intr + 47931 48012 82 0 1 71 56 54 0.420 -0.78 3.04 Intr + 55039 55117 79 0 1 72 69 83 0.301 3.01 3.05 Intr + 78972 79222 251 1 2 7 1 245 0.004 3.93 3.06 Intr + 82959 83239 281 2 2 31 77 145 0.095 3.05 3.07 Intr + 83317 83387 71 1 2 115 63 70 0.390 5.11 3.08 Term + 85480 85844 365 2 2 8 32 224 0.112 2.54 3.09 PlyA + 86831 86836 6 1.05 4.00 Prom + 93261 93300 40 -1.35 4.01 Sngl + 95916 96140 225 2 0 73 55 191 0.961 9.19 4.02 PlyA + 96852 96857 6 1.05 5.22 PlyA - 98220 98215 6 1.05 5.21 Term - 102254 102136 119 0 2 45 44 135 0.913 2.52 5.20 Intr - 104866 104783 84 2 0 42 88 113 0.977 5.57 5.19 Intr - 105304 105227 78 2 0 87 82 60 0.937 3.90 5.18 Intr - 106152 106063 90 1 0 83 68 100 0.989 6.55 5.17 Intr - 106941 106852 90 1 0 57 92 84 0.956 4.75 5.16 Intr - 110135 110055 81 0 0 76 93 37 0.758 1.69 5.15 Intr - 114604 114502 103 1 1 79 72 124 0.983 8.73 5.14 Intr - 114842 114712 131 0 2 60 100 41 0.962 1.99 5.13 Intr - 115474 115344 131 0 2 50 102 106 0.522 7.62 5.12 Intr - 116180 116044 137 2 2 114 87 121 0.999 13.05 5.11 Intr - 117997 117849 149 2 2 82 98 144 0.999 13.83 5.10 Intr - 119586 119452 135 1 0 78 84 140 0.999 12.22 5.09 Intr - 120411 120199 213 1 0 86 107 139 0.995 13.36 5.08 Intr - 121763 121622 142 2 1 63 63 173 0.984 11.31 5.07 Intr - 122411 122265 147 2 0 40 89 156 0.998 10.41 5.06 Intr - 123469 123305 165 1 0 61 98 88 0.956 6.34 5.05 Intr - 125541 125522 20 1 2 57 115 -9 0.097 -5.89 5.04 Intr - 126012 125884 129 1 0 53 1 128 0.058 0.35 5.03 Intr - 127647 127583 65 2 2 96 98 -17 0.104 -2.46 5.02 Intr - 132765 132667 99 2 0 116 111 89 0.849 12.31 5.01 Init - 134209 134091 119 1 2 66 110 25 0.591 2.22 5.00 Prom - 138900 138861 40 -3.35 6.00 Prom + 140396 140435 40 -5.05 6.01 Init + 142884 143341 458 2 2 75 53 173 0.069 7.82 6.02 Intr + 157088 157157 70 2 1 97 50 54 0.059 0.77 6.03 Term + 165215 165310 96 1 0 70 39 111 0.264 1.39 6.04 PlyA + 166632 166637 6 1.05 7.09 PlyA - 167301 167296 6 1.05 7.08 Term - 173108 172386 723 2 0 43 43 612 0.998 44.69 7.07 Intr - 174612 174523 90 0 0 38 86 154 0.491 9.47 7.06 Intr - 174782 174741 42 2 0 73 108 30 0.329 1.02 7.05 Intr - 186942 186748 195 0 0 82 78 140 0.888 11.09 7.04 Intr - 190766 190571 196 1 1 79 87 36 0.453 1.10 7.03 Intr - 199212 198899 314 0 2 98 111 127 0.789 10.06 7.02 Intr - 209897 209711 187 1 1 71 88 52 0.009 2.27 7.01 Intr - 227560 227482 79 2 1 76 34 118 0.017 2.89 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 142063 142605 543 0 0 79 41 187 0.815 8.94 S.002 Term + 216005 216128 124 2 1 70 44 124 0.886 3.08 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585r:37464518_37698726|GENSCAN_predicted_peptide_1|188_aa MKPQTLVVSVTVLKDGVSGVCSFRCSDVSRVSPFWWVHGLADFRSEATDLCRVKLQTFAV SVIAHKGGASGVVRSSDGFMVLLTSGVKPQTFAVSVTALKGGASRVICSSQWVGGLADFR SEAADPAVLQLIKVVQTQTNNAAKLYRMESPQRQCLWREIQQLTGMGPPSSGQPIFVENH TAYICGKP >gi568815585r:37464518_37698726|GENSCAN_predicted_CDS_1|567_bp atgaagccacagacactcgtggtgagtgttacagttcttaaagatggtgtgtccggcgtt tgttccttcagatgttcagacgtgtccagagtttctcccttctggtgggttcatggtctt gctgacttcagaagtgaagccacagacctctgcagagtgaagctgcagaccttcgcggtg agtgttatagctcataaaggtggcgcgtctggagttgttcgttcctctgatgggttcatg gtcttgctgacttcaggagtgaagccacagacctttgcagtgagtgttacagctcttaaa ggtggtgcatccagagttatttgttcctcccagtgggttggtggtctcgctgacttcagg agtgaagctgcagaccctgcggtgttacagctcataaaggtagtgcagacccaaacaaat aatgcagcaaaattatacaggatggaatcccctcaaagacaatgcctctggagagagatc cagcaacttacaggaatgggtccacctagttcagggcagcctatatttgtggaaaaccat acagcctatatttgtggtaaaccataa >gi568815585r:37464518_37698726|GENSCAN_predicted_peptide_2|146_aa MVMEEGSKRVAQKEPHRTCSDTPNLNSEIIHLEKGIRGGQETKSVNSGDNSLHGKMKPEK WRLSPNKPKASQGPVHHVLARQRQSLDTQEQRGPLEALGLPEFAIRKEESGPFSFIFHLT GVMENHDWFPKWSLLGPVTVDGGWML >gi568815585r:37464518_37698726|GENSCAN_predicted_CDS_2|441_bp atggtgatggaagaaggaagtaaaagagtagctcagaaagagccgcacaggacatgctca gatactcctaatcttaacagtgaaataattcatttggagaaaggaataagaggtgggcaa gagaccaagagcgtcaactctggggacaattcactgcatggaaaaatgaaaccagagaag tggagactttcccctaataagccaaaggcttcccagggccctgtccatcatgtcctggca aggcaaaggcagtctcttgatacacaagagcaaagagggcctctggaagcactgggttta ccagagtttgcaatcagaaaagaagaaagtggccctttctccttcatcttccacttgaca ggtgtgatggagaaccatgactggttcccaaagtggtcacttcttggaccagtgactgta gatgggggatggatgctgtga >gi568815585r:37464518_37698726|GENSCAN_predicted_peptide_3|425_aa MAERPIDGSHHRTLCRQPPIPAWSQNHTGSPAMDPNQEKIPDLPEKEFRRKIANQEAFHI RSTKYHSHKSVYNMICKKLRFDCQLHAPSESDNSIVDLGTEPVDVVMEKNQPLSVDQCWL QAIEFSVHLIDLLSILHRCNGFNGIQKAVEDQTGSRPPNSDHDLFLVQAWLWEVLSLELL IRPTTELDPGFSSFPVGMGENLSSSTARAAAANSGMSRESRLQGLCMCLSGSSAQTPHSS LCWSGAPGWELTGHLLSSGLQRSMEELWVSRDSYSFIIFPWSSCFLDESQCVYRDVPVED LVIKVQKGNVGWEPPHSVPIGVLPCGAVRRRPPSSRSRNGRSIESLNCAPKEATDTQCQP RKAARREAVPCKATGAELSEAVGAHLLYQHDLDVKHGVKGDHFGALRFDCPTGFWTCMGP AAPLF >gi568815585r:37464518_37698726|GENSCAN_predicted_CDS_3|1278_bp atggctgagagacccatagatggttcacatcacaggactctgtgcagacaacccccaata ccagcctggagccagaatcacactggttcaccagcaatggatccaaaccaagaaaaaatt cctgatttacctgaaaaagaattcaggagaaaaattgctaaccaagaagctttccacatt agaagcacaaagtaccattcacataagtcagtctataacatgatctgcaaaaagttacgt tttgattgccagcttcatgccccatctgaaagtgataacagtattgtagacctaggcact gaaccagtggatgttgtcatggaaaagaatcagcccctttctgttgaccagtgctggctg caggcgatagagttttcagtgcatctcatcgatttactgagcatacttcacagatgtaat ggtttcaatgggattcagaaagctgtagaagatcagactggcagcagaccaccaaacagt gaccatgacctctttttggtgcaagcttggctttgggaagttctttctttggagcttctt atccgtccaaccactgagctggaccctggatttagttcctttcctgtgggcatgggagag aacctgtcctcctccacagccagagctgctgctgctaattctgggatgtccagggaatca aggctccagggactctgcatgtgcctgagcggcagctctgcccagactccccatagctct ctgtgttggtctggagcccctggttgggagctcactgggcatctcctgagttcagggttg caaaggtccatggaagaattgtgggtctccagggactcttactcattcatcattttccca tggtcgagttgtttccttgatgaatcccaatgtgtctaccgagatgttccagttgaagat ctagtaattaaagtgcagaagggaaatgtagggtgggagcctccacacagtgtccctatt ggggtacttccttgtggagctgtgagaagaaggccaccatcctccagatcccggaatggt agatccattgaaagcttgaactgtgcacctaaagaagccacagacactcaatgccagccc aggaaagcggccaggagggaggctgtaccctgcaaagccacaggggcagagctgtccgag gctgtgggagcccacctcctgtatcagcatgacctggatgtgaaacatggagtcaaagga gatcattttggagctttaagatttgactgccccactggattttggacttgcatgggtcct gcagcccctttgttttaa >gi568815585r:37464518_37698726|GENSCAN_predicted_peptide_4|74_aa MAPSYWETRRGPGPGHFNEGSPRPGHNSISGVSFRRLIDSLSPGTPRSTELDLPVRERVA LVGGDKPTKIHTDI >gi568815585r:37464518_37698726|GENSCAN_predicted_CDS_4|225_bp atggcaccctcctattgggaaacgagaagggggcctggacccggacacttcaatgagggc agtccacgtccaggccacaactccatttcaggggtgagtttccggaggctcattgattcc ctctccccaggaacacctagaagcacagaattagatctcccagtcagagaacgggttgca ttagttggaggagacaaacccaccaagattcacactgacatttga >gi568815585r:37464518_37698726|GENSCAN_predicted_peptide_5|808_aa MIPFLPMFSLLLLLIVNPINANNHYDKILAHSRIRGRDQGPNVCALQQILGTKKKYFSTC KNWYKKSICGQKTTVLYECCPGYMRMEGMKGCPAVLPIDHVYGTLGIVGATTTQRYSDAS KLREEIEGKGSFTYFAPNNMSNRKDIRRGLESNVNVELLNALHSHMINKRMLTKDLKNGM IIPSMYNNLGLFINHYPNGVVTVNCARIIHGNQIATNGVVHVIDRVLTQIGTSIQDFIEA EDDLSSFRAAAITSDILEALGRDGHFTLFAPTNEAFEKLPRGVLERIMGDKVASEALMKY HILNTLQCSESIMGGAVFETLEGNTIEIGCDGDSITVNGIKMVNKKDIVTNNGVIHLIDQ VLIPDSAKQVIELAGKQQTTFTDLVAQLGLASALRPDGEYTLLAPVNNAFSDDTLSMDQR LLKLILQNHILKVKVGLNELYNGQILETIGGKQLRVFVYRTAVCIENSCMEKGSKQGRNG AIHIFREIIKPAEKSLHEKLKQDKRFSTFLSLLEAADLKELLTQPGDWTLFVPTNDAFKG MTSEEKEILIRDKNALQNIILYHLTPGVFIGKGFEPGVTNILKTTQGSKIFLKEVNDTLL VNELKSKESDIMTTNGVIHVVDKLLYPAATKIITKVVEPKIKVIEGSLQPIIKTEGPTLT KVKIEGEPEFRLIKEGETITEVIHGEPIIKKYTKIIDGVPVEITEKETREERIITGPEIK YTRISTGGGETEETLKKLLQEEVTKVTKFIEGGDGHLFEDEEIKRLLQGELCILKRLKQL KENNRLCRKEQQQKVIFEGFRKSANIRG >gi568815585r:37464518_37698726|GENSCAN_predicted_CDS_5|2427_bp atgattccctttttacccatgttttctctactattgctgcttattgttaaccctataaac gccaacaatcattatgacaagatcttggctcatagtcgtatcaggggtcgggaccaaggc ccaaatgtctgtgcccttcaacagattttgggcaccaaaaagaaatacttcagcacttgt aagaactggtataaaaagtccatctgtggacagaaaacgactgtgttatatgaatgttgc cctggttatatgagaatggaaggaatgaaaggctgcccagcagttttgcccattgaccat gtttatggcactctgggcatcgtgggagccaccacaacgcagcgctattctgacgcctca aaactgagggaggagatcgagggaaagggatccttcacttactttgcaccgaataacatg tctaacagaaaggatatccgtagaggtttggagagcaacgtgaatgttgaattactgaat gctttacatagtcacatgattaataagagaatgttgaccaaggacttaaaaaatggcatg attattccttcaatgtataacaatttggggcttttcattaaccattatcctaatggggtt gtcactgttaattgtgctcgaatcatccatgggaaccagattgcaacaaatggtgttgtc catgtcattgaccgtgtgcttacacaaattggtacctcaattcaagacttcattgaagca gaagatgacctttcatcttttagagcagctgccatcacatcggacatattggaggccctt ggaagagacggtcacttcacactctttgctcccaccaatgaggcttttgagaaacttcca cgaggtgtcctagaaaggatcatgggagacaaagtggcttccgaagctcttatgaagtac cacatcttaaatactctccagtgttctgagtctattatgggaggagcagtctttgagacg ctggaaggaaatacaattgagataggatgtgacggtgacagtataacagtaaatggaatc aaaatggtgaacaaaaaggatattgtgacaaataatggtgtgatccatttgattgatcag gtcctaattcctgattctgccaaacaagttattgagctggctggaaaacagcaaaccacc ttcacggatcttgtggcccaattaggcttggcatctgctctgaggccagatggagaatac actttgctggcacctgtgaataatgcattttctgatgatactctcagcatggatcagcgc ctccttaaattaattctgcagaatcacatattgaaagtaaaagttggccttaatgagctt tacaacgggcaaatactggaaaccatcggaggcaaacagctcagagtcttcgtatatcgt acagctgtctgcattgaaaattcatgcatggagaaagggagtaagcaagggagaaacggt gcgattcacatattccgcgagatcatcaagccagcagagaaatccctccatgaaaagtta aaacaagataagcgctttagcaccttcctcagcctacttgaagctgcagacttgaaagag ctcctgacacaacctggagactggacattatttgtgccaaccaatgatgcttttaaggga atgactagtgaagaaaaagaaattctgatacgggacaaaaatgctcttcaaaacatcatt ctttatcacctgacaccaggagttttcattggaaaaggatttgaacctggtgttactaac attttaaagaccacacaaggaagcaaaatctttctgaaagaagtaaatgatacacttctg gtgaatgaattgaaatcaaaagaatctgacatcatgacaacaaatggtgtaattcatgtt gtagataaactcctctatccagcagcaactaaaattataaccaaagttgtggaaccaaaa attaaagtgattgaaggcagtcttcagcctattatcaaaactgaaggacccacactaaca aaagtcaaaattgaaggtgaacctgaattcagactgattaaagaaggtgaaacaataact gaagtgatccatggagagccaattattaaaaaatacaccaaaatcattgatggagtgcct gtggaaataactgaaaaagagacacgagaagaacgaatcattacaggtcctgaaataaaa tacactaggatttctactggaggtggagaaacagaagaaactctgaagaaattgttacaa gaagaggtcaccaaggtcaccaaattcattgaaggtggtgatggtcatttatttgaagat gaagaaattaaaagactgcttcagggagaattatgcattctgaaacgactcaagcaatta aaggagaataacagactatgtcgaaaggaacaacaacaaaaagtcatttttgaaggtttt cgcaaatctgccaacataagaggttaa >gi568815585r:37464518_37698726|GENSCAN_predicted_peptide_6|207_aa MEENLGNTIQDIDMGKDLMTKAPKAMATKAKIDKWDLIKLKSFCTAKETIIRVNRQPTEW EKSFAIYPSDKGLIFRTYKELKQINKKKTNNPIKKWVKDMNRHFSKEDIYAANKHMKKSP SSLLIRETQIKTTMRYHLRPVRMAIVKKSGNNRIPTSKKALARCNPLTLDFSASIITEST PGEDAVRIVKITTKDLDDYLNLVAEAA >gi568815585r:37464518_37698726|GENSCAN_predicted_CDS_6|624_bp atggaagaaaacctaggcaataccattcaggacatagatatgggcaaagacctcatgact aaagcaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatctaattaaacta aagagcttctgcacagcaaaagaaactatcatcagagtgaacaggcaacctacagaatgg gaaaaaagttttgccatctatccatctgacaaagggctaatatttagaacctacaaggaa cttaaacaaattaacaagaaaaaaacaaacaaccccatcaaaaagtgggtgaaggatatg aacagacacttttcaaaagaagacatttatgcggccaacaaacatatgaaaaaaagccca tcatcactgctcattagagaaacgcaaatcaaaaccacaatgagataccatctcaggcca gttagaatggcgattgttaaaaagtcaggaaacaacagaatccccaccagcaaaaaggct ctcgccagatgcaaccccttgaccttggacttctcagcttccataattacagaatctact cctggtgaagatgctgtgaggattgttaaaataactacaaaggatttagatgattattta aacttagttgctgaagctgcgtag >gi568815585r:37464518_37698726|GENSCAN_predicted_peptide_7|608_aa XHRQVRLEQARSTTNHRRVDDITVGPGDSIEIILTMNLSIIAGFIWGEIKQMWDGGLQDY IHDWWNLMDFVMNSLYLATISLKIVAFVKYSALNPRESWDMWHPTLVAEALFAIANIFSS LRLISLFTANSHLGPLQISLGRMLLDILKFLFIYCLVLLAFANGLNQLYFYYEETKGLTC KGIRCEKQNNAFSTLFETLQSLFWSIFGLINLYVTNVKAQHEFTEFVGATMFGTYNVISL VVLLNMLIAMMNNSYQLIADHADIEWKFARTKLWMSYFEEGGTLPTPFNVIPSPKSLWYL IKWIWTHLCKKKMRRKPESFGTIGRRAADNLRRHHQYQEVMRNLVKRYVAAMIRDAKTEE GLTEENFKELKQDISSFRFEVLGLLRGSKLSTIQSANASKESSNSADSDEKSDSEGNSKD KKKNFSLFDLTTLIHPRSAAIASERHNISNGSALVVQEPPREKQRKVNFVTDIKNFGLFH RRSKQNAAEQNANQIFSVSEEVARQQAAGPLERNIQLESRGLASRGDLSIPGLSEQCVLV DHRERNTDTLGLQVGKRVCPFKSEKVVVEDTVPIIPKEKHAKEEDSSIDYDLNLPDTVTH EDYVTTRL >gi568815585r:37464518_37698726|GENSCAN_predicted_CDS_7|1827_bp ncacatcgacaggtcagacttgaacaggcaaggtccaccaccaaccatcgtcgagtggat gatattaccgtgggtcctggtgattctattgaaatcatacttacaatgaatctttctata attgcaggcttcatatggggagaaattaaacagatgtgggatggcggacttcaggactac atccatgattggtggaatctaatggactttgtaatgaactccttatatttagcaacaatc tccttgaaaattgttgcatttgtaaagtacagtgcccttaatccacgagaatcatgggac atgtggcatcccactctggtggcagaggctttatttgctattgcaaacatcttcagttct ctgcgtctgatctcactgtttactgcaaattctcacctgggacctctgcaaatatctctg ggaagaatgctcctggacattttgaagtttctattcatatactgccttgtgttgctagca tttgcaaatggcctaaatcaattgtacttctattatgaagaaacgaaagggttaacctgc aaaggcataagatgtgaaaagcagaataatgcattttcaacgttatttgagacactgcag tccctgttttggtcaatatttgggctcatcaatttatatgtgaccaatgtcaaagcacag catgaatttactgagtttgttggtgccaccatgtttgggacatacaatgtcatctctctg gttgttctactcaacatgttaatagctatgatgaataattcttaccaactgattgctgac catgcagatatagaatggaaatttgcacgaacaaagctttggatgagttattttgaagaa ggaggtactctgcctactcccttcaatgtcatcccgagccccaagtctctctggtacctg atcaaatggatctggacacacttgtgcaagaaaaagatgagaagaaagccagaaagtttt ggaacaatagggaggcgagctgctgataacttgagaagacatcaccaataccaagaagtt atgaggaacctggtgaagcgatacgttgctgcaatgattagagatgctaaaactgaagaa ggcctgaccgaagagaactttaaggaactaaagcaagacatttctagtttccgctttgaa gtcctgggattactaagaggaagcaaactttccacaatacaatctgcgaatgcctcgaag gagtcttcaaattcggcagactcagatgaaaagagtgatagcgaaggtaatagcaaggac aagaaaaagaatttcagcctttttgatttaaccaccctgattcatccgagatcagcagca attgcctctgaaagacataacataagcaatggctctgccctggtggttcaggagccgccc agggagaagcagagaaaagtgaattttgtgaccgatatcaaaaactttgggttatttcat agacgatcaaaacaaaatgctgctgagcaaaatgcaaaccaaatcttctctgtttcagaa gaagttgctcgtcaacaggctgcaggaccacttgagagaaatattcaactggaatctcga ggattagcttcacggggtgacctgagcattcccggtctcagtgaacaatgtgtgttagta gaccatagagaaaggaatacggacacactggggttacaggtaggaaagagagtgtgtcca ttcaagtcagagaaggtggtggtggaggacacggttcctataataccaaaggagaaacat gcaaaagaagaggactctagtatagactatgatctaaacctcccagacacagtcacccac gaagattacgtgaccacaagattgtga