GENSCAN 1.0 Date run: 4-Nov-116 Time: 09:43:45 Sequence gi568815580r:32312217_32513176 : 200960 bp : 38.81% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 PlyA - 456 451 6 1.05 1.05 Term - 6572 6417 156 2 0 48 54 89 0.003 -1.55 1.04 Intr - 18282 18199 84 0 0 70 103 25 0.041 1.30 1.03 Intr - 19500 19410 91 2 1 94 89 38 0.061 3.58 1.02 Intr - 20647 20459 189 0 0 109 40 89 0.061 3.98 1.01 Init - 29070 28922 149 0 2 83 74 40 0.155 1.71 1.00 Prom - 32503 32464 40 -5.45 2.00 Prom + 35865 35904 40 -2.95 2.01 Init + 40110 40192 83 2 2 70 61 149 0.993 10.89 2.02 Intr + 40468 40567 100 1 1 140 -24 38 0.030 -3.01 2.03 Intr + 51478 51577 100 0 1 126 84 61 0.915 8.26 2.04 Intr + 55467 55619 153 1 0 8 94 191 0.907 10.82 2.05 Intr + 65073 65244 172 1 1 30 57 122 0.114 1.38 2.06 Term + 71079 71193 115 0 1 75 38 100 0.286 0.76 2.07 PlyA + 72978 72983 6 1.05 3.00 Prom + 83381 83420 40 -4.75 3.01 Init + 84268 84694 427 0 1 81 58 335 0.674 26.31 3.02 Term + 87087 88507 1421 1 2 78 45 443 0.628 29.14 3.03 PlyA + 89110 89115 6 1.05 4.02 PlyA - 89763 89758 6 1.05 4.01 Sngl - 100960 99998 963 1 0 83 32 1135 0.724 103.82 4.00 Prom - 107949 107910 40 -4.55 5.00 Prom + 112528 112567 40 -3.25 5.01 Init + 123957 124024 68 2 2 67 58 63 0.347 1.80 5.02 Term + 128460 128631 172 0 1 73 42 184 0.978 8.62 5.03 PlyA + 128845 128850 6 1.05 6.04 PlyA - 129450 129445 6 1.05 6.03 Term - 132817 132718 100 0 1 68 49 83 0.009 -0.98 6.02 Intr - 153290 153086 205 0 1 71 100 59 0.221 2.74 6.01 Init - 158212 158092 121 1 1 91 113 115 0.426 14.98 6.00 Prom - 160676 160637 40 -4.15 7.03 PlyA - 160846 160841 6 1.05 7.02 Term - 172683 172618 66 0 0 108 39 119 0.837 5.96 7.01 Init - 185109 184729 381 0 0 59 -3 197 0.032 4.62 7.00 Prom - 187573 187534 40 -4.05 8.00 Prom + 193176 193215 40 -5.45 8.01 Init + 199976 200316 341 1 2 30 66 267 0.960 15.48 8.02 Intr + 200341 200450 110 1 2 -36 -48 234 0.043 -3.59 8.03 Intr + 200628 200815 188 1 2 82 87 268 0.101 24.59 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 40468 40573 106 1 1 140 49 34 0.893 1.60 S.002 Init + 140886 140984 99 2 0 86 85 72 0.806 7.01 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815580r:32312217_32513176|GENSCAN_predicted_peptide_1|222_aa MDEAGNHHSQQTIARMENQTLHALTHRWELNNENTWTQGGEHHTPGPVMGKESTQASGAQ TGGTSFKSLETTGSISEASSAKHCTELFSRFTTFNPDSVPSDGVVGDTAGTVWPGVLKGE PCHLGTCYRCVLDPHPTPSESDTIQGIHVQVCYMSTLHDADVCDTNDPVTHMDTVGLSCH SISFYLIHKVDCHAEGMADSKTTAAKEGPPQVMLTATGDGEQ >gi568815580r:32312217_32513176|GENSCAN_predicted_CDS_1|669_bp atggatgaagctggaaaccatcattctcagcaaactatcgcaaggatggaaaaccaaaca ctgcatgctctcactcacaggtgggaattgaacaatgagaacacttggacacagggtggg gaacatcacacaccggggcctgtcatggggaaagagtccacacaggcttctggggcacag acaggagggaccagctttaagagcttagaaacaacaggttccatctctgaagcctcaagt gccaagcactgcactgaactcttctctaggtttactacgtttaatcccgacagtgtcccg tcagatggtgtggtaggagacacagcagggacagtgtggccaggagttctcaaaggggaa ccttgtcacctgggaacttgttatagatgcgttcttgatccccatccgacaccttctgaa tcagacacgattcaggggatacatgtgcaggtttgttatatgagtacattgcatgatgct gacgtttgtgatacaaatgatcctgtcacccatatggacactgtgggcctcagctgtcat tccatttctttttatttgatacacaaagtagactgccatgctgaagggatggcagacagt aaaactacagctgcaaaagagggcccaccgcaggtcatgctgactgccactggtgatggc gaacagtga >gi568815580r:32312217_32513176|GENSCAN_predicted_peptide_2|240_aa MHPVASDTWGGKSILPRWPEQGILFLSLTRLRRSQLGTPVITPVKTTPVIMREGKNLHGR REHWNTVNFLRQSLTAYDFQSSTCKRKGYFEEAQGIQGWQTRFSPNSKQARETHTNHDDH TKQQGDKEEKPSTFTGGMAVCLREPAIRKKGMNLSLRQASQRTCSKVVPTTWTSQSLCHP RSSILIGRVRHYPAAKREDSGVSTLNNYRNGSGLTESQHKTSLDSDSQRQVIPSLRNGAE >gi568815580r:32312217_32513176|GENSCAN_predicted_CDS_2|723_bp atgcatcccgtggccagcgacacctggggcggcaagagcatcctgcctcgctggcctgag cagggaatcctgttcctttcattgaccagattaagaagatcacagctgggaacacctgtg atcacacctgtgaagaccacacctgtgattatgagagaaggaaagaatctccatggaaga agggaacactggaacactgtgaactttttgagacaaagccttacagcttatgattttcag tcatcaacatgtaaaaggaaaggttattttgaagaggctcaagggatacaaggatggcag acacggttcagtcccaattctaagcaagccagggagacacatacgaaccatgatgaccat acaaagcaacagggagataaggaggagaagccaagcaccttcacaggagggatggctgtc tgcctacgggagccagccataaggaagaaggggatgaacttgagtctccgccaggcatca cagagaacgtgttccaaggtggttcctaccacctggacatctcagagcctgtgtcatcca aggtctagcattcttattgggcgagtgaggcactacccagctgccaaacgtgaagacagt ggtgtaagtacactcaacaactacagaaatgggtcagggttaacagaaagccagcacaag acttccctggatagtgactcacaaaggcaagtgattccaagcctaagaaatggagcagag tga >gi568815580r:32312217_32513176|GENSCAN_predicted_peptide_3|615_aa MARELRDECTSFSSRCDQLEERVSVIEDQMNEMKREEKFREKRVKRNEQSLQEIWDYVKR PNLRLIGVPESDRENGTKVENTVQDIIQENFPNLARPANIQIQEIQRMPQRYSSRRATPR HITVRFTKAEMKEKMLRAAREKVLEVLARAIRQEKEIKGIQLGKEEVKLSVFTDDMIVYL ENPIVSAQNLLKLIGNFSKVSGYKINVQKSQAFLYNSNRQTESQLMSEHPFTIASKRIKY LGIQLRREVKDLFKENYKPLLNEIKQDTNKWKNIPCSWIGRINIVKMAILPKVIYRFNAI PIKLPMTFFTELEKTTLKFIWNQKRACTVKSILSKKNKAGGITLPDFKLYYKATVTKTAW YWYQNRDIDQWNRTEPSEIIPHIYNHLLFDKPNKNKQWGKDSLFNKWCWENWLAICRKLK LDLFLTPYTKINSRWIKDLHVRPKTIKTLEENLGNTIQDTGMGKGFMSKTPKAMATKAKI DKWDLIKLKSFCTAKETTIRVNRQPTEWEKILTIYSSDKGLISRIYKELKQIYKKKTNNP INKWVKDMNRHFSKEDIYAANRHMKKCSSSLAIRDMQIKTTIRYHLTPVRMAIIKKSGNN RCWRGCGEIGTLLHC >gi568815580r:32312217_32513176|GENSCAN_predicted_CDS_3|1848_bp atggcacgagaactacgtgacgaatgcacaagcttcagtagccgatgcgatcaactggaa gaaagggtatcagtgattgaagatcaaatgaatgaaatgaagcgagaagagaagtttaga gaaaaaagagtaaaaagaaacgaacaaagcctccaagaaatatgggactatgtgaaaaga ccaaatctacgtctgattggtgtacctgaaagtgacagggagaatggaaccaaggtggaa aacactgtgcaggatattatccaggagaacttccccaatctagcaaggccggccaacatt caaatacaggaaatacagagaatgccacaaagatactcctcgagaagagcaactccaaga cacataactgtcagattcaccaaagctgaaatgaaggaaaaaatgttaagggcagccaga gagaaagtgttggaagttctggccagggcaatcaggcaggagaaagaaataaagggtatt caattaggaaaagaggaagtcaaattgtccgtgtttacagatgacatgattgtatattta gaaaaccccatcgtctcagcccaaaatctccttaagctgataggcaacttcagcaaagtc tcaggatacaaaatcaatgtgcaaaaatcacaagcattcttatacaacagtaacagacaa acagagagtcaactcatgagtgaacacccattcacaattgcttcaaagagaataaaatac ctaggaatccaacttagaagggaagtgaaggacctcttcaaggagaactacaaaccactg ctcaacgaaataaaacaggacacgaacaaatggaagaacattccatgctcatggatagga agaatcaatatcgtgaaaatggccatactgcccaaggtaatttatagattcaatgccatc cccatcaagctaccaatgactttcttcacagaattggaaaaaactactttaaagttcata tggaaccaaaaaagagcctgcactgtcaagtcaatcctaagcaaaaagaacaaagctgga ggcatcacgctacctgacttcaaactatactacaaggctacagtaaccaaaacagcatgg tactggtaccaaaacagagatatagaccaatggaacagaacagagccctcagaaataata ccacacatctataaccatctgctctttgacaaacctaacaaaaacaagcaatggggaaag gattccctatttaataaatggtgctgggaaaactggctagccatatgtagaaagctgaaa ctggatctcttccttacaccttatacaaaaattaattcaagatggattaaagacttacat gttagacctaaaaccataaaaaccctagaagaaaacctaggcaataccattcaggacaca ggcatgggcaagggcttcatgtctaaaacaccaaaagcaatggcaactaaagccaaaatt gacaaatgggatctaattaaactaaagagcttctgcacagcaaaagaaactaccatcaga gtgaataggcaacctacagaatgggagaaaattcttacaatctactcatctgacaaaggg ctaatatccagaatctacaaagaactcaaacaaatttacaagaaaaaaacaaacaacccc atcaacaagtgggtgaaggatatgaacagacatttctcaaaagaagacatttatgcagcc aacagacacatgaaaaaatgctcatcatcactggccatcagagacatgcaaatcaaaacc acaataagataccatctcacaccagttagaatggcgatcattaaaaagtcaggaaacaac aggtgctggagaggatgtggagaaataggaacacttttacactgttga >gi568815580r:32312217_32513176|GENSCAN_predicted_peptide_4|320_aa MSKSESPKEPEQLRKLFIGGLSFETTDESLRSHFEQWGTLTDCVVMRDPNTKRSRGFGFV TYATVEEVDAAMNARPHKVDGRVVEPKRAVSREDSQRPGAHLTVKKIFVGGIKEDTEEHH LRDYFEQYGKIEVIEIMTDRGSGKKRGFAFVTFDDHDSVDKIVIQKYHTVNGHNCEVRKA LSKQEMASASSSQRGRSGSGNFGGGRGGGFGGNDNFGHGGNFSGRGGFGGSRAGGRYGGS GDGSNGFGNDGSNFGGGGSYNDFGNYNNQSSNFGPMKGGNFGGRNSGPYGGGGQYFAKPR NQGGYGGSSSSSSCVSGRRF >gi568815580r:32312217_32513176|GENSCAN_predicted_CDS_4|963_bp atgtctaagtcagagtctcctaaagagcccgaacagctgaggaagctcttcattggaggg ttgagctttgaaacaactgatgagagcctgaggagccattttgagcaatggggaacgctc acggactgtgtggtaatgagagatccaaacaccaagcgctccaggggctttgggtttgtc acatatgccactgtggaggaggtggatgcagctatgaatgcaaggccacacaaggtggat ggaagagttgtggaaccaaagagagctgtctccagagaagattctcaaagaccaggtgcc cacttaactgtgaaaaagatatttgttggtggcattaaagaagacactgaagaacatcac ctaagagattattttgaacagtatggaaaaattgaagtgattgaaatcatgactgaccga ggcagtggcaagaaaaggggctttgcctttgtaacctttgacgaccatgactccgtggat aagattgtcattcagaaataccatactgtgaatggccacaactgtgaagttagaaaagcc ctgtcaaagcaagagatggctagtgcttcatccagccaaagaggtcgaagtggttctgga aactttggtggtggtcgtggaggtggtttcggtgggaatgacaacttcggtcatggagga aacttcagtggtcgtggtggctttggtggcagccgtgctggtggtagatatggtggcagt ggggatggctctaatggatttggtaatgatggaagcaattttggaggtggtggaagctac aatgattttggcaattacaacaatcagtcttcaaattttggacccatgaagggaggaaat tttggaggcagaaactctggcccctatggcggtggaggccagtactttgcaaaaccacga aaccaaggtggctatggcggttccagcagcagcagtagctgtgtcagtggcagaagattt taa >gi568815580r:32312217_32513176|GENSCAN_predicted_peptide_5|79_aa MEEVLDERSPHKNFRVGEIRKGITYRMSVSSLEENSVDNSGIPKKCCINNALDGTEDHIV WKYMDICSCVEKQRRRTQL >gi568815580r:32312217_32513176|GENSCAN_predicted_CDS_5|240_bp atggaagaggttttagatgagcgctctcctcacaagaatttcagagttggggaaatacga aaagggataacttacagaatgagtgtcagtagcttggaagaaaattctgtagacaacagt ggcattcctaagaaatgctgtatcaacaatgctcttgatggcacagaggaccatattgtg tggaaatacatggacatctgcagctgcgtggaaaagcaacgcagaagaactcaactgtga >gi568815580r:32312217_32513176|GENSCAN_predicted_peptide_6|141_aa MDPAPSLGCSLKDVKWSSVAVPLDLLVSTYRLPQIARLDNGIRSDSKSRMCWDCGKFGPY FVLRGFCSFIIYWPLGAKTWTLRPQGGSKWLFVNIRGLSRIHVLSLKLKCSGISGHGQIH TDGWEGRQDTLQEGSIAYAES >gi568815580r:32312217_32513176|GENSCAN_predicted_CDS_6|426_bp atggacccggcgccctcgctgggctgcagcctcaaggatgtgaagtggagctcggtggcc gtgccgctcgacctcctggtcagcacttaccggctgccccagatcgcgcgcctggacaac ggaatcagatcagattctaagtcccggatgtgttgggactgtgggaaatttggtccttac tttgtgctcagaggcttctgcagctttatcatttactggcctctaggagcaaaaacatgg acacttaggccccagggaggctccaagtggctgtttgtaaacatcagaggtctttctcgg atccatgttttatcattgaaattgaagtgttctgggatctcaggtcatggacagatccat actgatggttgggaagggaggcaggatacgctacaagaaggaagtattgcctatgctgag tcttga >gi568815580r:32312217_32513176|GENSCAN_predicted_peptide_7|148_aa MIISIDTEKAFNKIQHCFMIKTLSKIGIEGTYLSVIKAIYDKLTANIILNGEKLKAFPVR MGTRQRCPLSSLLFNIVLEVLARAIRQEKKIKGIQIGKEEIKLSLIADDMIIYLENAKDF SRRLLELPTQCEDDEDKGFYDDPLPLNE >gi568815580r:32312217_32513176|GENSCAN_predicted_CDS_7|447_bp atgatcatctcaatagatacagaaaaagcattcaacaaaatccagcattgctttatgatt aaaactctcagcaaaattggcatagaagggacatacctcagtgtaataaaagccatatat gacaaactcacagccaacataatactgaatggggaaaaattgaaagcattccctgtgaga atgggaacaagacaaagatgcccactctcatcactcctcttcaacatagtactggaagtc ctagccagagcaatcagacaagagaaaaaaataaaaggcatccaaatcggtaaagaggaa atcaaactgtcactgattgctgatgatatgatcatttaccttgaaaacgctaaagacttc tccagaaggctcctggaactacctactcaatgtgaagatgatgaggacaaaggcttttat gatgatccacttccacttaatgaatag >gi568815580r:32312217_32513176|GENSCAN_predicted_peptide_8|213_aa MNRRGLNQYLNAVENAQHVEVESIPLPDMPHASSNILIQEIPLPGAQPPSILKKPSACGP PTQAVSILLFLGRGVPHLPPGRKPLGPPPGPPPPQVVQMYGRKVGVALDLPPPSPEIAQR GHDDDVFSTSEDDGYPEDVDQDKHDDSTDDSQEIPEEGREVEEFSEDNDEDDSDDSKAEK QSQKHNQEELHSDGTSTASSQQQAPRQSVPPSQ >gi568815580r:32312217_32513176|GENSCAN_predicted_CDS_8|639_bp atgaacagaagaggactcaaccaatatttgaatgctgtcgagaatgcccagcacgtggaa gtggagagtattcctttaccagatatgccacatgcttcttctaacattttgatccaggaa attccacttcctggtgcccagccaccctccatccttaagaaaccctcagcctgtggacct ccaactcaggcagtttctatccttctcttcttgggacgtggtgttccacatttgcctcct ggcagaaaacctcttggccctccccctggtccaccccctcctcaagtcgtgcagatgtat ggccgtaaagtgggtgttgccctagatcttccccctcctagtcctgaaattgcccagcga ggtcatgatgatgatgtttttagcaccagtgaagatgatggctatcctgaggacgtggat caagataagcatgatgacagtactgatgacagtcaagaaatccctgaggagggacgggaa gtagaggaattttcagaggacaatgatgaagatgattctgatgactctaaagcagaaaaa caatcacaaaaacacaatcaagaggaactgcattctgatggcacatccactgcttcttca cagcagcaggctccccggcagtctgttcctccttctcag