GENSCAN 1.0 Date run: 6-Nov-116 Time: 07:02:38 Sequence gi568815597f:9972026_10279953 : 307928 bp : 44.56% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 49 163 115 0 1 67 91 79 0.518 6.49 1.02 Intr + 3567 3750 184 1 1 53 63 244 0.998 17.25 1.03 Intr + 9006 9145 140 0 2 66 76 42 0.994 0.91 1.04 Term + 10276 10676 401 2 2 68 49 313 0.969 20.68 1.05 PlyA + 12519 12524 6 1.05 2.00 Prom + 24943 24982 40 -4.56 2.01 Init + 25234 25306 73 0 1 95 82 183 0.776 19.63 2.02 Intr + 35545 35723 179 2 2 41 93 244 0.784 19.84 2.03 Term + 77187 77309 123 2 0 47 36 164 0.709 5.48 2.04 PlyA + 77555 77560 6 -0.45 3.08 PlyA - 78531 78526 6 1.05 3.07 Term - 82603 82415 189 1 0 -36 48 183 0.093 -0.65 3.06 Intr - 86962 86702 261 1 0 -11 82 156 0.248 2.78 3.05 Intr - 87609 87316 294 0 0 43 49 147 0.409 3.51 3.04 Intr - 88371 88272 100 2 1 81 103 7 0.692 1.61 3.03 Intr - 88560 88418 143 0 2 34 111 84 0.947 4.45 3.02 Intr - 90623 90537 87 2 0 106 86 25 0.839 4.27 3.01 Init - 92065 91916 150 1 0 56 110 87 0.903 7.74 3.00 Prom - 93058 93019 40 -9.16 4.00 Prom + 93114 93153 40 -4.46 4.01 Init + 98066 98068 3 1 0 108 81 0 0.563 1.30 4.02 Intr + 100078 100189 112 0 1 99 115 3 0.563 4.05 4.03 Intr + 123436 123571 136 2 1 88 93 49 0.404 5.03 4.04 Intr + 129083 129170 88 2 1 31 81 132 0.400 6.77 4.05 Intr + 130923 131067 145 2 1 88 97 86 0.985 9.36 4.06 Intr + 133491 133719 229 1 1 76 68 141 0.992 7.83 4.07 Intr + 134295 134558 264 0 0 88 36 87 0.494 0.13 4.08 Intr + 135197 135349 153 2 0 62 69 282 0.850 22.89 4.09 Intr + 145434 145575 142 0 1 13 85 167 0.497 9.26 4.10 Intr + 147488 147588 101 1 2 124 80 67 0.971 8.41 4.11 Intr + 149937 150051 115 0 1 73 96 120 0.993 11.75 4.12 Intr + 154769 154852 84 1 0 110 53 45 0.903 3.22 4.13 Intr + 157367 157423 57 1 0 85 99 5 0.565 0.38 4.14 Intr + 158475 158591 117 2 0 104 95 8 0.931 3.76 4.15 Intr + 158690 158788 99 1 0 145 110 64 0.995 14.61 4.16 Intr + 160344 160457 114 2 0 22 91 60 0.542 0.34 4.17 Intr + 162963 163161 199 2 1 80 65 105 0.529 6.32 4.18 Intr + 165042 165180 139 1 1 55 77 71 0.701 2.22 4.19 Intr + 172915 173014 100 1 1 60 76 55 0.883 1.61 4.20 Intr + 174938 175065 128 1 2 57 72 179 0.992 12.68 4.21 Intr + 177159 177257 99 0 0 92 98 9 0.734 1.53 4.22 Intr + 179301 179536 236 0 2 105 96 166 0.986 16.43 4.23 Intr + 186331 186457 127 2 1 115 89 51 0.995 7.64 4.24 Intr + 189117 189261 145 0 1 74 76 163 0.816 13.98 4.25 Intr + 196111 196245 135 0 0 73 117 156 0.998 17.76 4.26 Intr + 199113 199304 192 2 0 58 82 340 0.860 30.09 4.27 Intr + 206619 206805 187 2 1 109 -70 273 0.729 12.86 4.28 Intr + 207391 207537 147 2 0 80 80 339 0.618 32.51 4.29 Term + 207959 208077 119 0 2 56 53 118 0.687 3.80 4.30 PlyA + 208183 208188 6 1.05 5.00 Prom + 234886 234925 40 -0.66 5.01 Init + 238715 238768 54 1 0 75 92 192 0.524 17.58 5.02 Intr + 245852 245877 26 1 2 125 68 11 0.021 -0.38 5.03 Intr + 260225 260409 185 2 2 84 81 38 0.290 2.13 5.04 Intr + 284222 284298 77 0 2 102 70 62 0.955 5.03 5.05 Intr + 286468 286647 180 0 0 109 67 86 0.996 8.66 5.06 Intr + 289880 289945 66 1 0 90 105 67 0.833 7.70 5.07 Intr + 295355 295533 179 1 2 104 80 196 0.998 19.12 5.08 Intr + 296127 296238 112 0 1 73 97 43 0.998 4.08 5.09 Intr + 299477 299554 78 1 0 86 63 98 0.989 6.85 5.10 Intr + 300216 300281 66 2 0 65 103 17 0.499 0.00 5.11 Intr + 300411 300525 115 2 1 126 42 26 0.724 1.82 5.12 Intr + 304296 304374 79 1 1 75 65 79 0.986 2.91 5.13 Intr + 305961 306103 143 0 2 33 92 196 0.586 14.50 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:9972026_10279953|GENSCAN_predicted_peptide_1|279_aa MENSEKTEVVLLACGSFNPITNMHLRLFELAKDYMNGTGRYTVVKGIISPVGDAYKKKGL IPAYHRVIMAELATKNSKWVEVDTWESLQKEWKETLKVLRHHQEKLEASDCDHQQNSPTL ERPGRKRKWTETQDSSQKKSLEPKTKAVPKVKLLCGADLLESFAVPNLWKSEDITQIVAN YGLICVTRAGNDAQKFIYESDVLWKHRSNIHVVNEWIANDISSTKIRRALRRGQSIRYLV PDLVQEYIEKHNLYSSESEDRNAGVILAPLQRNTAEAKT >gi568815597f:9972026_10279953|GENSCAN_predicted_CDS_1|840_bp atggaaaattccgagaagactgaagtggttctccttgcttgtggttcattcaatcccatc accaacatgcacctcaggttgtttgagctggccaaggactacatgaatggaacaggaagg tacacagttgtcaaaggcatcatctctcctgttggtgatgcctacaagaagaaaggactc attcctgcctatcaccgggtcatcatggcagaacttgctaccaagaattctaaatgggtg gaagttgatacatgggaaagtcttcagaaggagtggaaagagactctgaaggtgctaaga caccatcaagagaaattggaggctagtgactgtgatcaccagcagaactcacctactcta gaaaggcctggaaggaagaggaagtggactgaaacacaagattctagtcaaaagaaatcc ctagagccaaaaacaaaagctgtgccaaaggtcaagctgctgtgtggggcagatttattg gagtcctttgctgttcccaatttgtggaagagtgaagacatcacccaaatcgtggccaac tatgggctcatatgtgttactcgggctggaaatgatgctcagaagtttatctatgaatcg gatgtgctgtggaaacaccggagcaacattcacgtggtgaatgaatggatcgctaatgac atctcatccacaaaaatccggagagccctcagaaggggccagagcattcgctacttggta ccagatcttgtccaagaatacattgaaaagcataatttgtacagctctgagagtgaagac aggaatgctggggtcatcctggcccctttgcagagaaacactgcagaagctaagacatag >gi568815597f:9972026_10279953|GENSCAN_predicted_peptide_2|124_aa MPADLSGTWTLLSSDNFEGYMLALGIDFATRKIAKLLKPQKVIEQNGDSFTIHTNSSLRN YFVKFKVGEEFDEDNRGLDNRKCKVLRADYVEMLLEYDEDLVFTFSDLRLQLGTQERNQL DFKC >gi568815597f:9972026_10279953|GENSCAN_predicted_CDS_2|375_bp atgcccgccgacctcagcggtacttggaccctgctcagcagcgacaacttcgagggctac atgctggccctaggtattgactttgccactcgtaaaatagccaagttgctgaagccacag aaagtgattgagcagaatggggattcttttaccatccacacgaacagcagcctaaggaac tactttgtgaaatttaaagttggagaagaatttgatgaagataacagaggcctggacaac agaaaatgcaaggtactgagggctgactatgtggagatgctgctcgaatatgacgaagac ctggtctttaccttcagtgacctcaggctgcagctggggacacaggaacgcaaccagcta gacttcaagtgctaa >gi568815597f:9972026_10279953|GENSCAN_predicted_peptide_3|407_aa MSVYGHGEAQTRKTSCNKLMTSCEYHPKTLNLFVKARKTSFEGFKQGNNQERFKVINKKC IKAAGTAAYACSPNTLGAKENAIHKHSGLIRTFWNQLSWGSQRGNTVMGCITTFQSMSDH QILLVVRYTKTHCYCTTAHRIQYSYMQYRFVAQEQYLGHTPRFHGFMHSLFVSWELHELN GLQAPPVQFIDNGNTSGQAQLHDGPFRESLPRLDCARKAAARSLNQHLFPLLDLGNYFFI PKGQSSDNSVLQTLTEEQAQATAPSRHADLYLALMVLDNSFCFVEASQTLIVPLQEVHQV ITTGSHIWSVASNTVQGVGIAVFRPEVTQMSKSQPAFLSASQKAQRINMHIEHIKRSKGQ HHFLKLMKENDQKKKKAKEKGTREAHCVRTNGKKPELLEPITYEFMA >gi568815597f:9972026_10279953|GENSCAN_predicted_CDS_3|1224_bp atgagtgtgtatggacatggtgaggcacagacaagaaagacaagctgcaacaagctcatg acgagctgtgaataccatcctaagacactgaacttgttcgtgaaggcaagaaagacatct tttgaaggatttaagcaggggaataaccaggaaagatttaaagtgattaacaaaaaatgc attaaggcggccggtacagcggcttacgcctgcagccccaacactttgggagccaaggaa aatgccatacataaacattcaggactcatccgaaccttctggaaccagctctcctgggga agtcagagaggaaatacagtcatgggctgcataacgacatttcagtcaatgtcagatcac cagatcttactggtggtccgatacacaaaaactcactgttattgtacaactgcccacagg attcagtatagttacatgcagtacaggtttgtagcccaggagcaatacctaggccatacc ccacgcttccatggctttatgcacagcctcttcgtctcctgggaactgcatgagctcaat gggcttcaagctcctcctgttcaattcatagacaatgggaataccagtgggcaggctcag ctccatgatggccccttccgggagagcctcccaaggctcgactgtgccagaaaagctgct gccaggagcctcaaccaacacctgtttcccctcctggatctggggaattatttcttcatt ccaaaagggcagagctctgacaatagtgtccttcagactcttacagaagagcaggcacaa gctaccgcgcccagccggcatgcagatctttatctggccctcatggttcttgacaacagc ttctgctttgttgaggccagtcagacccttatagtgccactccaggaggtgcaccaagtc atcactactgggagccacatctggtcagtggcatccaatactgtccagggggtcggcatt gctgtcttccgccctgaggtgacacagatgtcaaaatcacagccagcatttctcagtgcc tcccagaaggctcagagaattaatatgcatattgagcacattaagcgctctaaaggccag catcatttcctaaaactcatgaaggaaaatgatcagaaaaagaagaaagccaaagagaaa ggtaccagagaagcacactgtgtgagaaccaatgggaagaagcctgagctgctggaacct attacctatgaattcatggcataa >gi568815597f:9972026_10279953|GENSCAN_predicted_peptide_4|1303_aa MRENPPGPPIAASAPGPSQSLGLNVHNMTPATSPIGASGVAHRSQSSEGVSSLSSSPSNS LETQSQSLSRSQSMDIDGVSCEKSMSQVDVDSGIENMEVDENDRREKRSLSDKEPSSGPE VSEEQALQLVCKIFRVSWKDRDRDVIFLSSLSAQFKQNPKEVFSDFKDLIGQILMEVLMM STQTRDENPFASLTATSQPIAAAARSPDRNLLLNTGSNPGTSPMFCSVASFGASSLSSPH SAASGTAAGSQPSSPRYRPYTVTHPWASSGVSILSSSPSPPALASSPQAVPASSSRQRPS STGPPLPPASPSATSRRPSSLRISPSMYDNPFSFLFLALSGDSSDEEDEEEDDDDGDGDD EGGGGGDDFSCVQFGSSLGASGGASNWDSYSDHFTIETCKETDMLNYLIECFDRVGIEEK KAPKMCSQPAVSQLLSNIRSQCISHTALVLQGSLTQPRSLQQPSFLVPYMLCRNLPYGFI QELVRTTHQDEEVFKQIFIPILQGLALAAKECSLDSDYFKYPLMALGELCETKFGKTHPV CNLVASLRLWLPKSLSPGCGRELQRLSYLGAFFSFSVFAEDDVKVVEKYFSGPAITLENT RVVSQSLQHYLELGRQELFKILHSILLNGETREAALSYMAAVVNANMKKAQMQTDDRLVS TDGFMLNFLWVLQQLSTKIKLETVDPTYIFHPRCRITLPNDETRVNATMEDVNDWLTELY GDQPPFSEPKFPTECFFLTLHAHHLSILPSCRRYIRRLRAIRELNRTVEDLKNNESQWKD SPLATRHREMLKRCKTQLKKLVRCKACADAGLLDESFLRRCLNFYGLLIQLLLRILDPAY PDITLPLNSDVPKVFAALPEFYVEDVAEFLFFIVQYSPQALYEPCTQDIVMFLVVMLCNQ NYIRNPYLVAKLVEVMFMTNPAVQPRTQKFFEMIENHPLSTKLLVPSLMKFYTDVEHTGA TSEFYDKFTIRYHISTIFKSLWQNIAHHGTFMEEFNSGKQFVRYINMLINDTTFLLDESL ESLKRIHEVQEEMKNKEQWDQLPRDQQQARQSQLAQDERVSRSYLALATETVDMFHILTK QVQKPFLRPELGPRLAAMLNFNLQQLCGPKCRDLKVENPEKYGFEPKKLLDQLTDIYLQL DCARFAKAIADDQRSYSKELFEEVISKMRKAGIKSTIAIEKFKLLAEKVEEIVAKNARAE IDYSDAPDEFRGKWTHPLMDTLMTDPVRLPSGTIMDRSIILRHLLNSPTDPFNRQTLTES MLEPDTAKANEASRSSGRSEAAVHVLEAKCGKPTPGPPRASKR >gi568815597f:9972026_10279953|GENSCAN_predicted_CDS_4|3912_bp atgagggagaaccctccggggcctcccatagcggcatcagccccaggaccctctcagagt cttggtctcaatgtccacaacatgaccccagctacctccccaataggtgcatcaggagta gcccatcgaagccagagcagtgaaggagtcagttctctcagcagctcgccctctaatagc cttgaaacgcaatctcagtctctctcacgttcccagagcatggatatcgatggtgtctca tgtgagaaaagcatgtcccaggtggatgtggattcaggaattgaaaacatggaggttgat gaaaatgatcgaagagaaaagcggagcctcagtgataaggagccttcctcgggccctgaa gtgtctgaagagcaggccttacagctggtctgtaagatcttccgtgtctcttggaaggac cgggacagagatgtcatctttctttcttctctttctgcacagtttaagcagaacccaaaa gaagtattctccgattttaaggacttgattggccagattttaatggaagtgctaatgatg tccactcagaccagagatgaaaacccatttgccagtctgacagccacatcacagccaatt gctgcagcagcacggtcaccagacagaaatctcttgctaaacactggctccaatccagga acaagccccatgttctgcagcgtggcttcctttggtgccagctctttgtctagtcctcac agtgcagcctctggaactgctgcgggaagccagccttcatccccgcggtatcgcccctac actgtcactcacccatgggcgtcctcaggcgtctccattctgtcgagctccccaagtccc cctgccctcgccagtagcccccaagcagtgcccgccagcagttccagacagaggcccagc agcacgggtccacccctaccacccgcctcacccagtgccacgagcagacgcccctcctcc ctgaggatctctcctagtatgtacgacaatcctttctccttcctcttcctcgcactttct ggggacagtagtgatgaagaagatgaagaagaagatgatgatgatggtgatggtgatgat gaaggtggtggtggtggtgatgatttttcttgtgtccagtttgggtccagtttgggagcc tctggtggagcaagtaattgggattcctacagtgaccatttcaccattgaaacctgcaaa gagacagatatgctgaactacctcatcgagtgtttcgaccgagttggaatagaggaaaaa aaagcaccaaagatgtgcagccagccagcagtcagccagcttctgagcaacatccgctca cagtgcatatcccatactgctttagtactacaaggctccctaacacagcccaggtccttg cagcagccgtccttcctagtgccgtatatgctgtgtaggaatctcccatatggcttcatt caggaactggtgagaaccactcaccaggatgaagaagtgttcaagcagatatttatcccc attttacaaggcctggctcttgctgccaaagagtgctccctcgacagtgactactttaaa taccccctcatggcactaggtgagctctgtgaaaccaagtttgggaagacacaccctgtg tgcaatttggttgcttctttgcggttgtggttgccgaaatccttaagtcctggctgtggg cgggagctgcagagactctcttacttaggggctttctttagcttctcagtctttgcagaa gatgatgttaaagtggttgaaaaatacttctcagggcctgccattaccctggaaaacact cgtgtggttagccaatcattgcagcattacttagagctcggaaggcaagagctttttaag attctgcatagtattttgttaaatggcgaaacccgtgaggctgctctcagttacatggcg gctgtcgtcaatgccaatatgaagaaagcacagatgcagacagatgatagattggtgtct acagatggatttatgctgaatttcctttgggtactgcagcagctaagtacaaaaatcaag ttagaaacagttgatcccacgtatatttttcacccaagatgtcggattactcttcccaat gatgagacgcgtgtgaatgcaacgatggaagatgtgaatgactggctgactgaactctat ggcgatcagcctccattttctgagccgaaattccctacggagtgcttctttctcaccctg catgctcaccacctctctattctgcctagttgccgtcgctatatccgcagactccgggct atccgggagctcaatagaactgtagaagatttgaaaaataatgaaagccaatggaaagat tccccactggcaactagacaccgcgaaatgctgaagcgctgtaaaactcagcttaagaaa ctggtacggtgcaaggcctgtgctgatgctggcctacttgacgagagcttcctgagaaga tgtctgaatttttatggccttctcattcagctgctgctccgcatcctggaccccgcatat cccgatataacactgcctttaaattcagatgtccccaaggtatttgcagcgttgcctgag ttttatgtagaagatgttgcagaatttttattttttattgtacaatactctccccaggcg ctttatgagccctgtactcaggatattgtgatgttccttgttgtgatgttgtgcaaccag aactacatccgaaacccatatttggtggccaaactggtagaagtcatgtttatgaccaac cctgctgttcagccacgaacccagaagttttttgaaatgattgagaaccatcctctctcc accaagttgttggtaccttccctgatgaagttttatacagatgttgagcataccggagcc accagtgagttttatgacaagttcacaattcgctatcatattagcaccatttttaaaagc ctttggcaaaacatagctcaccatggcacctttatggaggagttcaactccgggaagcag tttgttcgctatataaacatgttgataaacgacacgacgtttttgctcgatgaaagtctg gagtctctgaagcgaatccatgaagtgcaggaagagatgaagaacaaagaacagtgggac cagttgccccgggatcagcagcaggctcgtcagtctcagcttgctcaggatgagcgtgtg tcccgctcttacctcgccctggccaccgaaaccgtggacatgttccacatcctcacgaag caggtccagaagcccttcctcagaccggagcttggaccccgattggctgcaatgctgaac tttaatcttcagcaactttgtggccccaagtgccgtgacctgaaagttgaaaaccctgag aaatacggctttgaaccaaagaagctgttggaccaactgacggatatttacttacagctg gactgtgctcggttcgcgaaagccattgctgacgaccagagatcctacagtaaggaattg tttgaagaagttatttcaaagatgcggaaggcagggatcaaatccacaatagcaatagaa aaatttaagctgctcgccgagaaagtggaggagatagtggccaagaacgcacgcgcagaa atcgactacagcgacgctcctgatgagttcagaggcaagtggactcaccctctgatggac accctcatgacagaccccgtgcggctgccctctggcaccatcatggaccgctccatcatc ctgcggcacctgctcaactcccccacggaccccttcaaccggcagacgctgacagagagc atgctggaaccagacacagccaaggccaacgaggcaagcagaagcagcggccgcagcgaa gctgccgttcatgtgttggaggccaaatgtggcaaaccaaccccaggcccacccagagcg agcaaacgctga >gi568815597f:9972026_10279953|GENSCAN_predicted_peptide_5|454_aa MGLRSGALRLAAAAARAENVNSSMTRRKLGCNFKRRFDSLFLDCIYIYNKAIKMSGASVK VAVRVRPFNSRETSKESKCIIQMQGNSTSIINPKNPKEAPKSFSFDYSYWSHTSPEDPCF ASQNRVYNDIGKEMLLHAFEGYNVCIFAYGQTGAGKSYTMMGKQEESQAGIIPQLCEELF EKINDNCNEEMSYSVEVSYMEIYCERVRDLLNPKNKGNLRVREHPLLGPYVEDLSKLAVT SYTDIADLMDAGNKARTVAATNMNETSSRSHAVFTIVFTQKKHDNETNLSTEKVSKISLV DLAGSERADSTGAKGTRLKEGANINKSLTTLGKVISALAEVSFSDKILGITQFDFVLKVK DTKENKSKYRSVKTRANTEGGNSRTAMVAALSPADINYDETLSTLRYADRAKQIKCNAVI NEDPNAKLVRELKEEVTRLKDLLRAQGLGDIIDX >gi568815597f:9972026_10279953|GENSCAN_predicted_CDS_5|1362_bp atggggctgcggagcggcgccctgcggctcgcggcggccgctgctcgcgctgagaatgta aattcctccatgacaaggaggaaacttggctgtaacttcaaaagaagatttgattcttta tttctggactgcatatatatatataacaaggccattaaaatgtcgggagcctcagtgaag gtggctgtccgggtaaggcccttcaattctcgagagaccagcaaggaatccaaatgcatc attcagatgcaaggcaactcgaccagtattattaacccaaagaatccaaaggaagctcca aagtccttcagcttcgactattcctactggtctcatacctcacccgaagatccctgtttt gcatctcaaaaccgtgtgtacaatgacattggcaaggaaatgctcttacacgcctttgag ggatataatgtctgtatttttgcctatgggcagactggtgctggaaaatcttatacaatg atgggtaaacaagaagaaagccaggctggcatcattccacagttatgtgaagaacttttt gagaaaatcaatgacaactgtaatgaagaaatgtcttactctgtagaggtgagctacatg gaaatttactgtgaaagagtacgagatttgctgaatccaaaaaacaagggtaatttgcgt gtgcgtgaacacccacttcttggaccctatgtggaggatctgtccaagttggcagttact tcctacacagacattgctgacctcatggatgctgggaacaaagccaggacagtggcagct acaaacatgaatgaaacaagtagccgttcccacgctgtgtttacgattgttttcacccag aagaaacacgataatgagaccaacctttccactgagaaggtcagtaaaatcagcttggtg gatctagcaggaagtgaacgagctgattcaactggtgccaaagggactcgattaaaggaa ggagcaaatattaataagtctcttacaactttgggcaaagtcatttcagccttggccgag gtgagcttttcagataagatactaggtataacacagtttgactttgtactgaaagtaaaa gatactaaggagaacaaaagtaaatatagatctgtaaaaactagagcaaacactgaaggt ggcaattctcggactgcaatggttgctgctctgagccccgcggatatcaactacgatgag actttgagcactctgagatatgcagatcgtgcaaaacaaattaaatgcaatgctgttatc aatgaggaccccaatgccaaactggttcgtgaattaaaggaggaggtgacacggctgaag gaccttcttcgtgctcagggcctgggagatattattgatann