GENSCAN 1.0 Date run: 6-Nov-116 Time: 15:05:45 Sequence gi568815590r:78660145_78862431 : 202287 bp : 35.49% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1425 1479 55 2 1 57 80 64 0.044 4.00 1.02 Intr + 3293 3448 156 0 0 19 86 87 0.003 0.66 1.03 Intr + 5882 5922 41 0 2 102 64 52 0.005 1.22 1.04 Intr + 6491 6649 159 1 0 90 -51 184 0.001 3.96 1.05 Intr + 18419 18535 117 1 0 60 55 81 0.042 1.74 1.06 Intr + 26323 26464 142 0 1 65 28 194 0.079 10.11 1.07 Intr + 29078 29229 152 0 2 82 103 50 0.075 4.86 1.08 Intr + 37263 37362 100 2 1 70 63 73 0.037 1.76 1.09 Intr + 38282 38369 88 0 1 65 110 52 0.019 3.21 1.10 Intr + 45766 46027 262 1 1 53 96 157 0.033 9.57 1.11 Term + 46159 46332 174 0 0 44 46 136 0.943 1.78 1.12 PlyA + 47872 47877 6 1.05 2.00 Prom + 51787 51826 40 -5.65 2.01 Init + 51850 51962 113 0 2 51 108 66 0.988 4.63 2.02 Intr + 55077 55184 108 0 0 98 91 43 0.940 4.08 2.03 Term + 57184 57349 166 1 1 99 42 142 0.999 7.11 2.04 PlyA + 57537 57542 6 1.05 3.05 PlyA - 58806 58801 6 1.05 3.04 Term - 63910 63779 132 1 0 46 51 121 0.248 1.31 3.03 Intr - 78491 78360 132 2 0 79 115 21 0.274 3.82 3.02 Intr - 83775 83629 147 0 0 67 71 48 0.018 0.51 3.01 Init - 84737 84222 516 2 0 2 -11 316 0.118 8.03 3.00 Prom - 92119 92080 40 -5.15 4.06 PlyA - 92469 92464 6 1.05 4.05 Term - 92762 92668 95 0 2 108 47 35 0.090 -1.59 4.04 Intr - 98323 98149 175 1 1 14 86 118 0.110 2.79 4.03 Intr - 102214 100237 1978 0 1 52 29 1425 0.072 119.37 4.02 Intr - 119601 119427 175 1 1 -13 67 142 0.253 0.08 4.01 Init - 119850 119688 163 0 1 70 72 127 0.742 9.24 4.00 Prom - 124068 124029 40 -5.25 5.04 PlyA - 125130 125125 6 1.05 5.03 Term - 134039 133753 287 0 2 32 48 218 0.531 6.78 5.02 Intr - 138054 137984 71 2 2 61 98 59 0.352 2.11 5.01 Init - 141393 141371 23 0 2 98 107 30 0.835 5.15 5.00 Prom - 142230 142191 40 -8.95 6.04 PlyA - 142809 142804 6 1.05 6.03 Term - 144548 144363 186 2 0 62 37 168 0.768 5.61 6.02 Intr - 146682 146506 177 0 0 80 81 163 0.892 13.99 6.01 Init - 177787 177695 93 1 0 58 109 65 0.305 6.03 6.00 Prom - 180037 179998 40 -6.75 7.00 Prom + 180955 180994 40 -3.05 7.01 Init + 183265 183340 76 0 1 76 82 51 0.211 4.60 7.02 Intr + 191314 191490 177 2 0 51 75 113 0.137 5.27 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 6773 6596 178 2 1 107 56 196 0.958 16.87 S.002 Term + 26323 26511 189 0 0 65 39 220 0.850 11.27 S.003 Init + 45806 46027 222 1 0 74 96 142 0.818 12.20 S.004 Sngl + 108846 109673 828 2 0 33 43 309 0.930 16.38 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590r:78660145_78862431|GENSCAN_predicted_peptide_1|481_aa MFIVAAVRTNSVPITTEMRDLNFVWDLRDWSFETRSYMQLAQLFGMCRIDVDLLKGMVVE SNIMQSGKNQVAEPMGARNVSGWEVTGFLHFDSIFCRRDVRASPKEVSLASFRAGTIDYP EDLMTYSRPLCEEEPVLKKHGPICQKTATKKRKTFDSSRQRAEGTDIPTVKPLKPRPEPP KKPSNWRRKHEEFIATIRAAKGLDQALKEGGKLPPPPPPSYDPDYIQCPYCQRRFNENAA DRHINFCKEQAARISNKGKFSTDTKGKPTSRTQVYKPPALKKSNSPGTASSGSSRLPQPS GAGKTVVGKVSSSSSSLGNKLQTLSPSHKGIAAPHAGWLWLSVGQRIPAGMVGGPSCEVL LCDEEWIGNLLNQQSDHILVKKSCYAGESSLPLVGLYSPKPAEQLSHPNSKDGSLSLPLG VPSQGMYGGLTSCFAGVAVTFAGKFGVPGSLPVPEQLLCQDSPYLYVRLKALVEWLHKGV S >gi568815590r:78660145_78862431|GENSCAN_predicted_CDS_1|1446_bp atgttcattgtagcagcagttagaaccaactcagtgcccatcactacagaaatgagggac ttgaacttcgtgtgggatttaagagactggagctttgaaacaagaagctatatgcagctt gctcagctttttgggatgtgcagaatagatgtggatctgcttaagggaatggtggttgaa agcaatataatgcaaagtggaaagaatcaagttgcggaaccaatgggagctcggaatgtt agcgggtgggaggtaaccggctttttacatttcgacagcatcttctgtcggagggatgtc agagcatctcctaaggaagtgtcactggccagctttagagccggtacaatagattacccg gaggatttgatgacttattcacggccgctgtgtgaggaagagcctgtattgaaaaaacat ggacccatttgccagaagactgcaactaaaaaacggaagacttttgattcaagcagacag agagctgaaggaactgatattccaacagtaaaacctctcaaaccgaggccagaaccacca aagaaaccatctaattggagaaggaaacatgaagaattcattgctaccataagagcagct aaaggccttgatcaggccctcaaagagggtggcaaacttcctcctcctcctccaccttct tatgatcctgattatattcaatgtccatattgtcagaggagattcaatgaaaatgcagct gatagacatataaatttctgtaaagaacaggcagcacgtattagtaataaagggaaattt tctacagataccaaaggaaaaccaacttctcggacacaggtgtataagccacccgcactt aaaaagtcaaattctcctggaactgcatcatcaggatcttcacgattaccgcagccaagt ggcgctggcaaaactgttgtaggtaaagtgtcttcaagtagcagctctttgggaaacaaa cttcagaccttatctccctctcataaagggatagcagcccctcatgcaggttggctttgg ctctctgtgggccagagaataccagcagggatggttggaggtcccagttgtgaggtcctg ctctgtgatgaggaatggattgggaacctgcttaaccagcagtctgaccacattttggta aagaagtcatgctatgctggggaatcctctctgcctctggttggtttgtactctccaaag cctgcagaacagctgagtcacccaaacagcaaagatggcagcctgtcccttcccctggga gttccatcccagggtatgtacgggggtctaacatcctgctttgctggagttgcagttact tttgctgggaagtttggagttcctgggtctttgcctgtgccggagcagctgctctgccaa gactccccatatctgtatgtcagactgaaggctctagtggagtggcttcacaagggcgtc tcttga >gi568815590r:78660145_78862431|GENSCAN_predicted_peptide_2|128_aa MDKLGPPLRTGRLVQRLYDSDTKSDSAIKRHELLPIYKANVKPRNSTPPSLARNPAPGVL TNKRKTYTESYIARPDGDCASSLNGGNIKGIEGHSPGNLPKFCHECGTKYPVEWAKFCCE CGIRRMIL >gi568815590r:78660145_78862431|GENSCAN_predicted_CDS_2|387_bp atggacaagttaggccctccacttcgaacgggaagacttgtgcaaaggttgtatgacagt gatacaaaatcagacagtgccatcaaaagacatgagttattacctatttacaaagctaat gtcaaaccccgaaattccacaccacctagtttggcaagaaatcctgccccaggtgtgctt acaaacaaaagaaaaacatatactgagagctacatagccaggccagatggggactgtgca tcttcccttaatggtggaaatattaaaggcattgaaggacattcacctggaaacttacca aaattctgccatgagtgtgggactaaataccctgtagaatgggccaaattttgctgtgaa tgtggcattcgaagaatgattctatga >gi568815590r:78660145_78862431|GENSCAN_predicted_peptide_3|308_aa MCDPGNHTLPTNLCNLELGDPLLNPLHQGFQSNTQRYGSLGREAAQAHVGEPGTVDIPPS GFPAKVTATPEKQEIRSLCIPLGKRLNPVGQAAMVCRPYFHGASKDKTHWLGIPASHQQQ CCAYLGQSSQGEGKATIFTVWASHLFQPADFEESKPIGQKGSPNTAQLLYQHPSLVIPSA TGKSKATRDWSRPPANHSSPMENWPNCARGKKKGRQRRTLKEGMFLFRAARKLRQFLKMN STGDFDLHLLKVSEGTTILLNCTGQHLSHATSEFGLASSEQEQGPNGRPAGESGDADQRC SIFINQIS >gi568815590r:78660145_78862431|GENSCAN_predicted_CDS_3|927_bp atgtgtgaccctggaaaccacactcttcccacgaatctttgcaacctcgagttgggagat cccctcttgaacccactccatcagggctttcagtctaatacacagagatacgggagtctt ggcagagaagctgctcaggcacatgttggagaaccaggaactgtagatattccaccttca ggcttcccggcaaaagtaactgcaactccagaaaagcaggagattagatccttgtgcata cccttaggaaagaggctgaatccagtgggccaagcagcgatggtctgtaggccctacttc catggtgcctcaaaggataagacacattggcttggaattccagccagccaccagcagcag tgttgtgcctacctgggacagagttcccagggagaggggaaggccaccatcttcactgtt tgggcaagtcacctttttcagcctgcagactttgaagagtccaaaccgatcgggcagaag ggatcccccaacacagcacaattgctctaccaacacccttcactggtgataccttcagct accggaaaatccaaggcaactagggactggagtagacccccagcaaaccacagcagccct atggaaaattggccaaattgtgccagggggaaaaaaaaaggtaggcaacgtcgaacattg aaggaaggtatgtttttattccgtgctgctcgcaagttgaggcaatttcttaaaatgaat agcactggtgattttgatctccacttattaaaagtttcagaaggcacaacaatactgttg aactgcactggccagcatctttcccacgcaaccagtgagtttgggctggccagttctgag caggaacaagggccaaatggtagacctgctggagaaagtggagatgcagatcagagatgc agcattttcattaaccaaatttcttga >gi568815590r:78660145_78862431|GENSCAN_predicted_peptide_4|861_aa MDKFLDRYILPRLNQEEVESLNRRITSSEIEAVINSLPTKKTRDQTDLELNSTRASIILT PKPGRDTTKKENFRPTSLMNVNTKIFGKILPNQIQEHIEKLIHHDQVGFISGIFPRDPAR CQKWVENCRRADLEDKTPDQLNKHYRLCAKHFETSMICRTGPYRTVLRDNAIPTIFDLNS HLNNPHSRHRKRIKELSEDEIRTLKQKKIDETSEQEQKHKETNNSNAQNPSEEEGEGQDE DILPLTLEEKENKEYLKYLLEILILMGRQNIPLDGHEADEIPEGLFTPDNFQALLECRIN SGEEVLRKRFETTAVNTLFCSKTQQRQMLEICESCIREETLREVRDSHVFSIITDDVVDI AGEEHLPVLVRFVDESHNLREEFIGFLPYEADAEILAVKFHTMITEKWGLNMEYCRGQAY IVSSGFSSKMKVVASRLLEKYPQAIYTLCSFCALNMWLAKSVPVMGVSVALGTIEEVCSF FHRSPQLLLELDNVISVLFQNSKERGKELKEICHSQWTGRHDAFEILVELLQALVLCLDG INSDTNIRWNNCIAGRAFVLCSAVTDFDFIVTIVVLKNVLSFTRAFGKNLQGQTSDVFFA AGSLTAVLHSLNEVMENIEVYNEFWFEEATNLATKLDIQMKLPGKFRRAHQGNLESQLTF ESYYKETLSVPTVEHIIQELKDIFSEQHLKALKCLSLVPSVMGQLKFNTLEEHHADMYRS DLPNPDTLSAELHCWGIKWKHRGKDIELPSTIYEALQLPDIKFFPNVYALLKKAQERIKL TGRADTLMKARKKSKLVTTENDQTVKINIKRGKRNKGYTEHSENSYQNDSRTWMKQETII LSKLTQEQKTKDRTFSLLSGS >gi568815590r:78660145_78862431|GENSCAN_predicted_CDS_4|2586_bp atggataaattcctggacagatatatactcccaagactgaaccaggaagaagttgaatcc ttgaataggcgaataacaagttctgaaattgaggcagtaataaatagcctgccaaccaag aaaacccgcgaccagacagatttagagctgaattctaccagagccagcatcattctcaca ccaaaacctggcagagatactacaaaaaaagaaaacttcaggccaacatctctgatgaac gtcaatacaaaaatcttcggtaaaatactgccaaaccaaatccaggagcacatcgaaaag cttatccaccatgatcaagttggcttcatctctgggatcttcccgcgggaccctgccaga tgccagaagtgggtggagaactgtaggagagcagacttagaagataaaacacctgatcag ctaaataaacattatcgattatgtgccaaacattttgagacctctatgatctgtagaact ggtccttataggacagttcttcgagataatgcaataccaacaatatttgatcttaacagt catttgaacaacccacatagtagacacagaaaacgaataaaagaactgagtgaagatgaa atcaggacactgaaacagaaaaaaattgatgaaacttctgagcaggaacaaaaacataaa gaaaccaacaatagcaatgctcagaaccccagcgaagaagagggtgaagggcaagatgag gacattttacctctaacccttgaagagaaggaaaacaaagaatacctcaaatatctactt gaaatcttgattctgatgggaaggcaaaacatacctctggacggacatgaggctgatgaa atcccagaaggtctctttactccagataactttcaggcactactggagtgtcggataaat tctggtgaagaggttctgagaaagcggtttgagacaacagcagttaacacgttgttttgt tcaaaaacacagcagaggcagatgctagagatctgtgagagctgtattcgagaagaaact ctcagggaagtgagagactcacacgtcttttccattatcactgacgatgtagtggacata gcaggggaagagcacctacctgtgttggtgaggtttgttgatgaatctcataacctaaga gaggaatttataggcttcctgccttatgaagctgatgcagaaattttggctgtgaaattt cacactatgataactgagaagtggggattaaatatggagtattgtcgtggccaggcttac attgtctctagtggattttcttccaaaatgaaagttgttgcttctagacttttagagaaa tatccccaagctatctacacactctgctctttctgtgccttaaatatgtggttggcaaaa tcagtacctgttatgggagtatctgttgcattaggaacaatcgaggaagtttgttctttt ttccatcgatcaccacaactgcttttagaacttgacaacgtaatttctgttctttttcag aacagtaaagaaaggggtaaagaactgaaggaaatctgccattctcagtggacagggagg catgatgcttttgaaattttagtggaactcctgcaagcacttgttttatgtttagatggt ataaatagtgacacaaatattagatggaataactgtatagctggccgagcatttgtactc tgcagtgcagtaacagattttgatttcattgttactattgttgttcttaaaaatgtccta tcttttacaagagcctttgggaaaaacctccaggggcaaacctctgatgtcttctttgca gccggtagcttgactgcagtactgcattcactcaacgaagtgatggaaaatattgaagtt tataatgaattttggtttgaggaagccacaaatttggcaaccaaacttgatattcaaatg aaactccctgggaaattccgcagagctcaccagggtaacttggaatctcagctaaccttt gagagttactataaagaaaccctaagtgtcccaacagtggagcacattattcaggaactt aaagatatattctcagaacagcacctcaaagctcttaaatgcttatctctggtaccctca gtcatgggacaactcaaattcaatactttggaggaacaccatgctgacatgtatagaagt gacttacccaatcctgacacgctgtcagctgagcttcattgttggggaatcaaatggaaa cacagggggaaagatatagagcttccgtccaccatctatgaagccctccaactgcctgac atcaagttttttcctaatgtgtatgcattgctgaagaaagcacaagaaagaattaaactc acgggtagagcagatacactaatgaaagcaagaaagaaatcaaagcttgtcactacagaa aatgaccaaactgtaaagataaatattaaaagaggaaagagaaacaaaggatatacagaa cattcagaaaacagctatcaaaatgacagtaggacatggatgaagcaggaaaccatcatt ctcagcaaactaacacaggaacagaaaaccaaagaccgcacgttctcactcctaagtggg agttga >gi568815590r:78660145_78862431|GENSCAN_predicted_peptide_5|126_aa MAYGHGSRYIFGLPPLILVLLPVASSDCDIEDVLKNICDSWEEVKISTLTGVRKKLIPTV IDDFEGFKTSIEKVNADMVEIARELELEVEPEDVTEMLKSQDKTGMDDELLLRHEQRKWF LEMILL >gi568815590r:78660145_78862431|GENSCAN_predicted_CDS_5|381_bp atggcttacggccatggctcacggtatatctttggacttcctcccctgatccttgttctg ttgccagtagcatcatctgattgtgatattgaagatgtccttaaaaacatttgtgattca tgggaggaggtcaaaatatccacattaacaggagttcggaagaagttgattccaactgtc atagatgactttgaggggttcaagacttcaatagagaaagtaaatgcagatatggtagaa atagcaagagaattagaattagaagtggagcctgaagatgtgactgaaatgctgaaatct caggataaaactggaatggatgatgagttgcttctcaggcatgagcaaagaaagtggttt cttgaaatgattttactttga >gi568815590r:78660145_78862431|GENSCAN_predicted_peptide_6|151_aa MRELLENSQAVLTEAILYSQHPSNKAADCPQYLLNASHGLKTIVSGRWAGRERPLWLEDS DVVPRNVREGSGAQITRALRAWYPVGSDQKAASNSPQRLLSVLRGNAAASAPLSGGSPQP VSEAAKGSLSGALPVLLAAPSYSRRRPGFGE >gi568815590r:78660145_78862431|GENSCAN_predicted_CDS_6|456_bp atgagagaacttctagagaacagccaagcagtcctaactgaggccatcttatactcccag cacccatccaacaaggcagctgactgtccacagtacttactgaatgccagccatgggcta aagaccatagtaagtggacgctgggcaggaagagaaaggccattgtggctggaggatagt gatgtggtaccacgaaacgtcagagagggcagtggggcccagatcaccagggcactgagg gcctggtacccagtgggaagtgaccagaaggctgcgagtaacagtccccaacgcctgctt tctgtcctgagagggaacgctgcagcctccgcgccgctcagcggtggcagcccacagccg gtctcagaagcagccaaaggctctctgtctggcgcccttcccgtgctcctggccgcccca agttactcacgcaggcggcccgggttcggcgagtag >gi568815590r:78660145_78862431|GENSCAN_predicted_peptide_7|85_aa MEVTEDFDLAMVQQSDRSRRKGGYEGTRPRVQKSLQILAGNKVLGKGTGQTMEWKSAFQE KIRSCAYRINHQGLAMAKHFLASLX >gi568815590r:78660145_78862431|GENSCAN_predicted_CDS_7|255_bp atggaggtcactgaagactttgatctagccatggttcagcagagtgacaggagtaggaga aagggaggatatgaaggcacacgacccagagttcagaaaagccttcaaatcctagcggga aacaaagtgttaggcaaagggacagggcagactatggagtggaagtcagcatttcaggag aaaatcaggtcctgtgcttacaggatcaatcaccagggcttggccatggcaaagcacttc ctggcatcattgtnn