GENSCAN 1.0 Date run: 6-Nov-116 Time: 15:29:54 Sequence gi568815597r:110748080_110948936 : 200857 bp : 38.37% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.09 PlyA - 30 25 6 1.05 1.08 Term - 2919 2713 207 0 0 112 48 109 0.080 5.66 1.07 Intr - 29390 29229 162 2 0 27 65 97 0.003 0.55 1.06 Intr - 30281 30075 207 2 0 105 36 77 0.014 2.55 1.05 Intr - 35183 35041 143 0 2 65 71 69 0.121 2.05 1.04 Intr - 35474 35360 115 2 1 46 61 109 0.084 3.10 1.03 Intr - 40545 40416 130 2 1 40 54 64 0.018 -2.02 1.02 Intr - 54180 54091 90 2 0 84 113 39 0.931 4.19 1.01 Init - 59665 59262 404 1 2 60 39 231 0.818 11.55 1.00 Prom - 73397 73358 40 -3.65 2.03 PlyA - 74988 74983 6 1.05 2.02 Term - 75197 75024 174 2 0 12 40 150 0.143 -0.62 2.01 Init - 94718 94635 84 2 0 60 94 63 0.183 4.97 2.00 Prom - 98057 98018 40 -3.45 3.05 PlyA - 98569 98564 6 1.05 3.04 Term - 100644 99998 647 1 2 27 38 588 0.998 41.00 3.03 Intr - 101890 101715 176 0 2 -40 4 203 0.220 -2.24 3.02 Intr - 105609 105468 142 1 1 89 105 65 0.619 6.79 3.01 Init - 106426 106369 58 1 1 80 96 26 0.610 4.12 3.00 Prom - 110096 110057 40 -3.75 4.00 Prom + 125864 125903 40 -4.05 4.01 Init + 139921 139995 75 0 0 55 94 65 0.576 4.94 4.02 Intr + 144266 144454 189 1 0 67 109 135 0.965 12.36 4.03 Intr + 145507 145652 146 0 2 54 74 43 0.265 -2.34 4.04 Intr + 151237 151420 184 1 1 130 28 134 0.095 10.47 4.05 Intr + 163106 163212 107 1 2 95 13 75 0.115 -1.21 4.06 Intr + 163298 163564 267 2 0 37 84 189 0.939 9.12 4.07 Term + 165778 165964 187 1 1 52 38 129 0.899 0.28 4.08 PlyA + 166226 166231 6 1.05 5.00 Prom + 177572 177611 40 -6.05 5.01 Init + 181636 181788 153 0 0 87 97 41 0.430 4.93 5.02 Term + 186896 187048 153 1 0 33 49 156 0.693 3.14 5.03 PlyA + 187167 187172 6 1.05 6.02 PlyA - 188394 188389 6 1.05 6.01 Sngl - 200278 199880 399 1 0 67 38 300 0.726 18.91 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 66029 66150 122 0 2 59 44 167 0.970 7.06 S.002 Sngl + 156693 157028 336 2 0 90 42 161 0.919 7.48 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:110748080_110948936|GENSCAN_predicted_peptide_1|485_aa MSEFPFTIASKRIKYLGIHLTRDVKDLFKENYKPLLNEIKEDTNKWKNIPRSWIGRINIV KMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRAHIAKSILSQKNKAGGITLP DFKLYYKATITKTAWWRQEKGFLPRKLDDGESGCSPQTHVFVKKRARICDSEETPRSLDP ESLVLAVSLPLCGGYIQLQGRKGALEFRVFTCDDQKGKAPALPRIHHMRATESNQTKQQQ EKGSDEGFLGEKQKRPNFPLRLSGVIDLDMCMLSSLLSDSNKGSGSMAQWSGAQMTVCTL CLLKPQEESQSRQLCKGLPSVDSGKNTVYSSPLHLSNASIPDLEKSVLARTGERYAQIHL SFESPCFNYCAVVWHTKIPALLLTQSGKHKCLLADDNTDHHAYSAACQTVGQGLPLLDTL QNDPGDTEGLCSILANSENSAATRRIVLRLLTGHKTPTHCPLATYSLSDSALPLTLHCLL EQSSL >gi568815597r:110748080_110948936|GENSCAN_predicted_CDS_1|1458_bp atgagtgaattcccattcacaattgcttcaaagagaataaaatacctaggaatccacctt acaagggacgtgaaggacctcttcaaggagaactacaaaccactgctcaatgaaattaaa gaggatacaaacaaatggaagaacattccacgctcatggataggaagaatcaatatcgtg aaaatggccatactgcccaaggtaatttacagattcaatgccatccccatcaagctacca atgactttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaaga gcccacatcgccaagtcaatcctaagccaaaagaacaaagctggaggcatcacactacct gacttcaaactatactacaaggctacaataaccaaaacagcatggtggaggcaggaaaaa ggtttcttgccaaggaaacttgatgatggggagtctggttgttcacctcaaactcacgta tttgtgaaaaaacgagccaggatctgtgactctgaggaaacccctagaagcttggaccca gagagtttggttttagctgtgagtctacccctgtgtggagggtatatccaactccaggga agaaaaggagctctggagtttagggtttttacctgtgatgatcaaaagggaaaagctcct gcacttccccgtatacaccacatgagggcgacagagtcaaaccagaccaagcagcagcag gaaaagggcagtgatgaaggttttctaggagaaaagcagaaaaggcctaacttccctcta agattgtctggagtaatagatctagatatgtgtatgctctccagtctcctttccgactct aacaaaggctctggcagtatggcacagtggtcaggagcccagatgacagtatgcacattg tgcctactgaagccccaggaggagtcacaaagcagacagctctgcaagggcctgccctct gtggactcggggaagaacactgtgtactcttcccctctccatctcagcaatgcatcaatc ccagacttggaaaaatcagtgttggcgaggacaggagaaagatatgctcagatacatttg tcatttgagagcccgtgctttaactactgtgctgttgtgtggcacaccaagatccctgca ctcctactcacccaatctggcaaacacaagtgcctgcttgcagatgacaacacagatcat catgcctactcagctgcttgccagacagttggtcagggtctcccattgctggacacatta caaaatgacccaggagacactgagggcctctgctccatccttgccaattctgagaatagt gcagccacaaggaggattgttctgaggcttctcacagggcacaaaacccccacgcactgt cctctggccacctacagcctctctgactctgcactgcctctgaccctgcactgcctgctt gagcaaagcagtctttga >gi568815597r:110748080_110948936|GENSCAN_predicted_peptide_2|85_aa MAIIKKIKITNAGEDMGKEELLHTVGGKVNVKEKTLKEAKEKEKVVYKGNPIGLTVKLSA ETLQARRDWGPIFSILKEKKFQPRI >gi568815597r:110748080_110948936|GENSCAN_predicted_CDS_2|258_bp atggctattatcaaaaagataaaaataacaaatgctggcgaggatatggggaaagaggaa ctcttacacactgttggtggaaaggtcaatgtgaaagaaaaaaccttaaaggaagctaaa gagaaggagaaagtcgtttacaaagggaaccccatcgggctaacagtgaaactgtcagca gaaaccttacaagccagaagagattgggggcctatattcagcatcctgaaagaaaagaaa tttcaaccaagaatttga >gi568815597r:110748080_110948936|GENSCAN_predicted_peptide_3|340_aa MNQRGEVTKEVISFIGSTEGAMSVFEENGNRGMQRKMEAGMQRGRRTKKERKNHLLYALP ACTVFICFYPEIRHVVHLTKTMSELSLRMNVGGHLSLGLISQPSFFRINLSPSVGKAIIS PDTGAQRWKRAQREERLKAQQNTDKDVAAHFQASHKPSAEDAEGQSPLSQKYSPSTEKCL PEIQGIFDRDPDTLLYLLQQKSEPEEPCIGSKAPKDDKTIIEEQATKIADLKRHVEFLVA ENKRLRKENKQLKAEKARLLKGPIEKELDVDADFVETSELWSLPPHSETATASSTWKKFA ANTGKAKDIPIPNLPPLDFPSPELPLMELSEDILKGFMNN >gi568815597r:110748080_110948936|GENSCAN_predicted_CDS_3|1023_bp atgaaccagaggggtgaagtcacaaaagaagtgatcagtttcatagggtccacagaagga gcaatgtcagtctttgaggaaaacggaaacagaggaatgcagagaaaaatggaagcagga atgcaaaggggaaggaggacaaagaaagagaggaagaatcacctactatatgcccttcct gcatgtactgtgttcatctgcttttatccagaaatacgccacgtggtgcacttgacaaaa accatgtcagagctgtctctccgcatgaacgtaggaggacacttgagtttgggcctcatt tctcaaccttcttttttccgtataaacttgagtccctctgtgggcaaagccatcatctcc cctgatactggtgctcagagatggaaaagggcccagcgtgaagaaagattgaaagcccag cagaacacagacaaggatgtagctgcccattttcaggcatctcacaaaccctctgcagag gatgcagagggccagagtcccctttctcagaagtacagcccttccacagagaaatgcctg cctgagattcaggggatctttgacagggatccagacacactactttatttacttcagcaa aagagtgagccagaagagccatgtattggaagcaaagcccccaaagatgataaaacaatt atagaggagcaggcaaccaaaattgcagatttgaagaggcatgtggaattccttgtggct gagaataaaagattaaggaaagaaaataaacaactgaaggctgaaaaggccagacttcta aaaggtccaatagaaaaggagctggatgtagatgctgattttgtagaaacgtcagagtta tggagcttgccaccacattcagaaactgctacagcctcctcaacctggaagaagtttgca gcaaacaccgggaaagccaaggacattccaatccccaatcttcctcccttggattttcca tctccagaacttcctcttatggagctctctgaggatattctgaaaggatttatgaataat taa >gi568815597r:110748080_110948936|GENSCAN_predicted_peptide_4|384_aa MKQEIRVESSRCDDKSKKLHDARKEICGCCILGFGIYLLIHNNFGVLFHNLPSLTLGNVF VIVGSIIMVVAFLGCMGSIKENKCLLMSCWDYRRKPLRPVCMGQLLNWSQEKEFFKKSLF SDSCQTNGKGWKIADMGSLSYTTREVGVGQAHPISGSKTIFHSLTAAAMSLKVVKLISEH LLDKRGKDKLDLMAQHQRGKKKPGDYYEQFYAHKLENLEEMDEFLEAYNLPRLKKSPGPE RFTAEFFQICKEEVVPILLKLLKNIEEEGLFPNSFYKASIILILKPGKDKKKKENLRPIY LMNTNTKSSTKYEQIESSSKSKSTIHNSKDMESTWMPINSGLDKENVVLIHYGIQHNHKK EQNYVLCSDMDASGVHYPKQTCLG >gi568815597r:110748080_110948936|GENSCAN_predicted_CDS_4|1155_bp atgaagcaggagataagagtggaaagtagcagatgtgatgacaaaagcaagaagttgcac gatgcaagaaaggagatctgtggctgctgcattttgggctttgggatctacctgctgatc cacaacaacttcggagtgctcttccataacctcccctccctcacgctgggcaatgtgttt gtcatcgtgggctctattatcatggtagttgccttcctgggctgcatgggctctatcaag gaaaacaagtgtctgcttatgtcgtgttgggattataggcgtaagccactgcgcccagtc tgtatgggtcaactcttaaactggagccaagagaaagaattttttaaaaagtccctcttc tcagatagttgtcagactaatggcaaaggatggaagatagcagacatggggtccctgtct tatacaaccagagaagtgggtgttggccaggcacatcccatctcaggcagcaagacaatc tttcactcactgacggcagcagccatgtctctcaaagtggtgaaactaatatctgagcat cttttagacaagagaggcaaagacaaactggatttaatggcccaacatcaaaggggaaaa aaaaaacctggagactattatgaacagttctatgcacacaaactggaaaacctagaagaa atggatgaattcttagaagcatacaacctcccaagactgaaaaaaagccctggaccagaa agattcacagctgaattcttccagatatgtaaagaagaggtggtaccaattctactaaaa ttactcaaaaatattgaggaggaggggctttttcctaactcattctacaaggccagcatc attctcatactaaaacctggcaaagataaaaagaaaaaagaaaacttaaggccaatatac ctgatgaacacaaacacaaaatcctcaacaaaatacgagcaaattgaatccagcagcaaa tcaaaaagcactattcacaatagcaaagacatggaatcaacctggatgcccatcaacagt ggactagataaagaaaatgtggtacttatacactatggaatacaacacaaccataaaaaa gaacaaaattatgtcctttgcagcgacatggatgcttctggagtccattatcctaagcag acttgcttaggataa >gi568815597r:110748080_110948936|GENSCAN_predicted_peptide_5|101_aa MDGARGHYPQQTNAGTENQIPHVLTYKWELNDEITWKQRGNNRYWGLPGEKVVQDREKHS ASLEESEEREQEPLPSNPENSGFVQDRQGSTSMSLQEPQNY >gi568815597r:110748080_110948936|GENSCAN_predicted_CDS_5|306_bp atggatggagctagaggccattatcctcagcaaactaacgcaggaacagaaaaccaaata ccacatgttctcacttataaatgggagctaaatgatgagatcacgtggaaacaaagagga aacaacagatactggggcctaccaggagagaaggtggttcaggacagagagaaacactct gcttctttggaagaaagtgaggaaagagaacaggagcctctgcctagtaatccagagaat tctggatttgtccaagaccgtcaaggcagtacctctatgagtctgcaagaaccacagaat tactga >gi568815597r:110748080_110948936|GENSCAN_predicted_peptide_6|132_aa MDHIKKRKTENAYNAIINGEANVTGSQLLSSILPTSDVSQHNILTSHSKTRQEKRTEMEY YTHEKQEKGTLNSNAAYEQSHFFNKNYTEDIFPVTPPELEETIRDEKIRRLKQVLREKEA ALEEMRKKMHQK >gi568815597r:110748080_110948936|GENSCAN_predicted_CDS_6|399_bp atggatcacataaagaagagaaaaacagagaatgcttataacgcaatcataaatggggaa gctaatgtcaccggttcccaactcctaagcagtattttaccaacttcagatgtgtcacaa cataacattctcacgagtcacagcaaaaccagacaagaaaagagaactgagatggaatac tatacccatgagaagcaagagaaaggcactttgaattcaaatgcagcttatgaacaaagt catttcttcaataaaaattataccgaagatattttcccagtgacaccaccggagttagaa gaaaccattcgagatgaaaaaataagaagacttaagcaggtgctgagagagaaagaagca gctcttgaagaaatgcgtaagaagatgcaccaaaaataa