GENSCAN 1.0 Date run: 3-Nov-116 Time: 06:04:00 Sequence gi568815597f:232705450_232908164 : 202715 bp : 41.22% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 Intr - 584 390 195 0 0 8 46 177 0.002 3.96 1.01 Init - 12006 11901 106 0 1 100 71 53 0.123 5.23 1.00 Prom - 20905 20866 40 -5.45 2.03 PlyA - 21074 21069 6 1.05 2.02 Term - 23186 22911 276 2 0 11 54 276 0.700 10.98 2.01 Init - 25209 25126 84 0 0 58 110 29 0.654 2.97 2.00 Prom - 26384 26345 40 -6.85 3.00 Prom + 28014 28053 40 -7.85 3.01 Init + 28225 28242 18 0 0 90 116 8 0.362 4.01 3.02 Intr + 39668 39833 166 1 1 40 26 154 0.179 3.01 3.03 Intr + 42201 42327 127 1 1 57 60 129 0.109 5.82 3.04 Intr + 43729 43847 119 1 2 47 23 56 0.444 -5.81 3.05 Intr + 44527 44677 151 2 1 37 57 188 0.072 8.90 3.06 Intr + 57806 57894 89 2 2 49 67 33 0.002 -4.00 3.07 Intr + 67948 68505 558 2 0 17 90 361 0.635 21.07 3.08 Intr + 69009 69095 87 1 0 32 33 147 0.266 2.62 3.09 Intr + 81459 81574 116 1 2 10 31 125 0.047 -1.75 3.10 Term + 83135 83263 129 1 0 152 33 69 0.926 5.00 3.11 PlyA + 84301 84306 6 1.05 4.04 PlyA - 85093 85088 6 1.05 4.03 Term - 88676 88526 151 1 1 74 38 112 0.369 1.20 4.02 Intr - 91460 91276 185 2 2 5 92 68 0.205 -3.44 4.01 Init - 93665 93564 102 2 0 103 63 172 0.342 16.49 4.00 Prom - 96795 96756 40 -8.55 5.00 Prom + 98373 98412 40 -6.05 5.01 Sngl + 100001 102718 2718 1 0 91 42 1675 0.923 155.62 5.02 PlyA + 102937 102942 6 1.05 6.04 PlyA - 103740 103735 6 1.05 6.03 Term - 111496 111369 128 2 2 74 48 74 0.132 -0.54 6.02 Intr - 125953 125845 109 1 1 79 95 100 0.922 8.74 6.01 Init - 127683 127675 9 0 0 72 119 21 0.923 2.96 6.00 Prom - 135509 135470 40 -3.35 7.04 PlyA - 136903 136898 6 1.05 7.03 Term - 138043 137934 110 2 2 99 36 81 0.579 1.69 7.02 Intr - 146020 145865 156 2 0 31 101 86 0.580 3.26 7.01 Init - 147337 147253 85 1 1 64 100 84 0.655 8.23 7.00 Prom - 148786 148747 40 -4.55 8.03 PlyA - 150227 150222 6 1.05 8.02 Term - 161630 161557 74 0 2 109 42 51 0.247 -0.31 8.01 Init - 166111 165907 205 1 1 97 81 214 0.530 20.66 8.00 Prom - 170230 170191 40 -5.55 9.03 PlyA - 174791 174786 6 1.05 9.02 Term - 175237 175023 215 2 2 99 32 185 0.885 10.41 9.01 Init - 176382 176301 82 0 1 75 32 76 0.233 1.88 9.00 Prom - 177914 177875 40 -2.85 10.00 Prom + 183092 183131 40 -4.55 10.01 Init + 190290 190408 119 2 2 77 92 73 0.772 6.32 10.02 Term + 191833 191962 130 1 1 40 47 159 0.606 3.67 10.03 PlyA + 191970 191975 6 -0.45 11.04 PlyA - 192319 192314 6 -0.45 11.03 Term - 194764 194557 208 0 1 100 50 117 0.796 4.93 11.02 Intr - 195693 195449 245 0 2 7 35 181 0.294 0.07 11.01 Intr - 198349 198205 145 0 1 60 90 82 0.222 4.96 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:232705450_232908164|GENSCAN_predicted_peptide_1|101_aa MALPFLEKGNTEQATCSVLDVSSVKCLRAKQMGCAVPHNKEVIRRLGLDCAYEGGKSEDT LASGEYVVSTQAPGSSSEGRSKRLLLVQRQLVSGSQTWKAX >gi568815597f:232705450_232908164|GENSCAN_predicted_CDS_1|303_bp atggcattaccatttttggagaaagggaacacagaacaagcaacatgttcagttttggat gtttcgagtgtgaaatgtttgagggccaagcaaatgggatgcgcagtgccccataacaaa gaagtaataagacgactaggtctggactgtgcctatgaaggtggaaagtctgaagacaca cttgcctctggagagtatgtagtcagtactcaggcaccgggtagctcctcagaaggaagg agcaagaggctcttactggttcagaggcagcttgtctcgggatctcagacgtggaaggct gnn >gi568815597f:232705450_232908164|GENSCAN_predicted_peptide_2|119_aa MQTQSQEEQEMGLTREAGVRSHGNCHLQNKNHIQKSSDNFKEFEESLLVVDPSGVGGFLT MTLWIQKSYKMQFNKYWKSNEHSAEKQLLPYIIFWNIPDLEEVTGMSIHRELYLELRDD >gi568815597f:232705450_232908164|GENSCAN_predicted_CDS_2|360_bp atgcagacacaaagccaggaggaacaagagatggggctgacaagagaggcaggagtcaga tctcatgggaactgtcacctgcagaataaaaatcacatacagaaaagttcagacaacttt aaggagtttgaagaatccctgctagtcgtggacccatctggggttggtggattcctcacc atgactctttggattcagaagtcatataagatgcagtttaacaaatattggaagtcaaat gagcattctgcggaaaaacagcttcttccatacatcatcttctggaatatcccagatctg gaagaggtcacaggaatgtccattcatagggagctctacctggagctcagagatgattga >gi568815597f:232705450_232908164|GENSCAN_predicted_peptide_3|519_aa MEPTKKLSLVTPGLAQGVPEHGSHSGKDGCDRYMGQEHGLQFTRSALMLSVPKMTDLQQA LSSMVVTVAKMDAIGMWADSMVPVHQICLDAVSAENERSAASASQCGQDTKAGTNHIGDK NPFPPLFRVLLQWPQVQAASFCRKASGKSFRRNPEEGIVITGDDNSMRVIVPEDLPVGQD VEAEDSDINDPDLGSSNFSCSAIKLMRYFQRKKIKTLAILFEREADEEDKTNGRTARQRR SEKKEHPEEFGWGQSENRLLDGQTAGEDHLPTPSPVQLPFHPTVNHLHQSIKPPHSSFKS MCDWDSVQSSGYRKLSHWPPVPAKKAKGPLNWFTLKPPGDGKSKRVHCKMHRLGLWESQT PTPECCLGAGAQKHLPGSCTCLSACIPSRKGIELQAAKPTATPLSHVLRARPPPPPQPTQ CEDTQDEDLDDDPLPLNEKRASGAQQPVSASSSIELTQACKANSCEATEKLPPPEVEVLV SNKHLAPVTPLQPLLPESPAAAPTLTQYWLCFPGCPTAC >gi568815597f:232705450_232908164|GENSCAN_predicted_CDS_3|1560_bp atggagcctactaagaagctctctttggtcacaccaggacttgcacaaggggtccctgag catggtagtcacagtggtaaagatggatgtgacagatacatgggccaagagcatgggctc cagttcaccagatctgccttgatgctctcagtgccaaaaatgacagacctgcagcaagcc ttatctagcatggtagttacagtggcaaagatggatgcaataggtatgtgggccgacagc atggttccagttcaccagatctgcctggatgctgtcagtgccgaaaacgagagatctgca gcaagcgctagccaatgtggacaggacaccaaagcagggactaaccacattggggataaa aaccccttccctcctttgttccgtgtgctcttgcagtggccacaagtgcaagcagcatcc ttctgcagaaaagcctcaggcaagtccttcaggaggaatccagaagaaggcattgttatc acaggagatgacaactccatgcgtgtcattgtccctgaagaccttccagtgggacaagat gtggaggcggaagacagtgatattaatgatcctgaccttggctctagtaacttttcgtgt tctgccatcaagctgatgaggtatttccagaggaaaaaaattaaaacacttgcaatactc ttcgagagagaagcagacgaggaagacaagacaaacggcagaacagcgcggcagagaagg agcgaaaagaaggaacatccggaggagttcggctggggacagtcagagaatcggctgctg gacggccaaactgcaggggaagatcatcttcccactccatcccctgtccagctcccattc catcccactgtgaaccacctccaccaatcaataaaacccccacattcatccttcaagtcc atgtgtgactgggactctgtacagagctcagggtacagaaagttatcacactggccccct gtccctgcaaaaaaggcaaagggtccactgaactggttcacccttaagccacccggggac ggcaaatctaagagagtgcactgtaagatgcaccgacttgggctttgggagtcgcagaca cccacccctgaatgctgccttggcgccggagcccaaaagcacttgcctggttcctgcacc tgcctgtctgcatgcatcccctcccgtaaggggattgagctccaggcagccaaacctaca gccacacccctgtcacatgtcctgcgagcaagaccacctcctcctcctcagcctactcaa tgtgaagacacccaggatgaagaccttgatgatgatccacttccacttaatgaaaagaga gcctccggggcgcagcagccagtgtcagccagttcttctattgagctcacgcaggcctgc aaagccaactcttgtgaagctactgaaaagcttcctcctccggaggtggaagtgttggtc tctaataaacatcttgcacctgtaactccacttcagcctctgctcccggagagcccagct gcagcaccaactttgacccaatactggctgtgctttcctggctgccctacagcatgctaa >gi568815597f:232705450_232908164|GENSCAN_predicted_peptide_4|145_aa MPRDADVAATTAALRERQLPIKYSEEASESPQRKPKEAGYILEWDLTGGFKDFLTGSWLK ELCSISGLEVSGKRCLNYDKGDCEDEGSCYVDEDSRFPFCASTGSEHSLKIAQCPEDSRR HEIPIATTAEVVNDTVQLEALPPFK >gi568815597f:232705450_232908164|GENSCAN_predicted_CDS_4|438_bp atgccaagagatgccgatgtagcagcaacaacagcagccttaagagagagacagcttccc ataaagtactcagaagaagcttcagaaagcccccagaggaagcctaaagaggcaggatat atcttggaatgggaccttacaggtggattcaaagattttctgactggcagttggttgaaa gagttatgctctatctcaggacttgaagtcagtggaaagagatgcttgaattatgataag ggggattgtgaagatgaaggttcttgttatgtagatgaagactccaggtttcctttctgt gcaagcaccggaagtgaacattctctgaagattgctcagtgcccagaagacagcaggagg catgagatccctattgcaaccacggctgaggttgtaaatgacaccgtgcaattggaagct ctgcctccttttaaatga >gi568815597f:232705450_232908164|GENSCAN_predicted_peptide_5|905_aa MAASLSERLFSLELLVDWVRLEARLLPSPAAAVEQEEEEEEKEQGEASSPRGLCPAVAFR LLDFPTLLVYPPDGPGAPAAEPWPGVIRFGRGKSCLFRLQPATLHCRLLRTPLATLLLQL PPGRPTPTPQLLGACDISLATAAHRVVGPAASGCSHRHRGRFPLHNRVGERTGDIALAYR LTDLGSRLLSQLERPLTFTRTGGGAEVSPQTQQERQQLQQPASQPSPKEADKPLGELEIP EAQKDLKEMVKSKAECDNVGSVENGKTNSVVTCSGAGNGRNVSSLNEEVTELDMETNIFC PPPLYYTNLTQEKPPPAQAKITIEPQMNAPEEMDDASPEKKRVNPPAHRSCLKHPSSAAH EHPPMLVNPPHIQNIGATNQTCQTEQNRINTIRQLPLLNALLVELSLLYDQPVTSPAHIH PHLAWLYRTEDKKSPESSAKSTCRSEAKKDKRSVGGCEKSVSLQYKKNQIENYKEDKYSE KSSGALHKRVPKGRLLYGLTNTLRLRLKLTNPDMLVVHEKRELYRKRQSQMLGTKFRIPS SKVKLLSSAEQSQKPQLPEDKYLDSDASFTENSDTSRQISGVFDEPSTSKETKLKYATEK KTVDCSKNRINNVSLEEVVSPANSIIPERLTPTNILGGNVEMKIQSPCVFQQDAVVDRIV DKEIDIRQVKTTDNDILMADISDKRTGKNSCYENISELKYSDDLSSPCYSEDFCTSEDTS RSFKAHDSSSRTENPKHSQYTSKSSDTGVSKKKNSSDRSSILSPPFSAGSPVHSYRKFHI SKTQDKSLEEASSISASDLSSTHWTEQKENQIDQNSMHNSEITKRAQDISVKTRSSWKSL EKSQSPQTSQVSSYLPSNVSELNVLDSSTSDHFEEGNDDVGSLNISKQCKDICELVINKL PGYTM >gi568815597f:232705450_232908164|GENSCAN_predicted_CDS_5|2718_bp atggcggcctcgctgtccgagcggctcttctcgctggagctgctggtggactgggtgcgt ttggaagcccggctgctgccgtcccccgctgccgcagtggagcaggaggaggaagaggag gaaaaggagcagggggaggcctcgtcgccgcgcggtctgtgccccgccgtggccttccgc ctgctggacttccccacgctgttggtttaccctcctgacggccccggcgctcccgccgcc gaaccgtggcccggtgtcatccgcttcggtcgcggcaagtcctgcctcttccgcctgcag cctgctaccctgcactgccggctcctgcggaccccgcttgccaccttgctgctgcagctg ccccctgggcgcccgacgcccaccccacagctcctgggggcctgcgacatttcgctggcc accgcagcgcacagggtcgtggggccggccgcctccggatgctcccaccgtcaccgggga cgtttccccctgcataatcgagtgggcgagcggactggggacattgcactggcctaccgc ctgactgacctgggaagccgcctgctgagccaacttgagcggcccctcaccttcacccgc acaggaggaggagcggaggtcagtccccaaacccagcaggaaagacagcagctgcagcag ccagcctcacagccaagcccaaaagaggctgataagccgctgggggagttagaaatccca gaggcacagaaggatttgaaggaaatggttaaaagtaaggccgaatgtgataatgtgggt tctgtggagaatggcaaaaccaattctgttgttacatgttcaggtgctggcaatgggaga aatgttagctccctaaatgaggaagtcacagaattggacatggagaccaatatattttgc cctcctcctttgtattacactaacttgacccaagaaaaaccgccccctgcacaggctaaa atcaccattgagcctcaaatgaatgcacctgaggaaatggatgatgcttctcctgaaaaa aagcgtgtaaatcccccagcacacaggagttgtctaaagcatccaagttctgcagcacac gaacatcctccaatgcttgtaaatcctccacatattcagaatataggagcaactaatcaa acatgtcaaactgaacaaaatcgaattaatacaataaggcagttgcctttgttaaatgct ttgttagttgagttgtccttgttatatgaccaacctgtgacaagtcctgctcatatacat cctcacctagcctggttatataggactgaggataagaagtcacccgaatcttctgccaaa tccacatgccggtctgaagccaagaaggataagcgttctgtggggggatgtgaaaagtca gtgagtcttcagtataaaaagaaccaaattgaaaactataaggaagataaatattctgaa aagagcagtggtgccctccataaaagagttccaaaagggaggctactttatggcttaaca aatacactaagactacgtttaaagctgacaaatcctgatatgttggtggtacatgaaaaa agagaactatatagaaaaagacaatcacaaatgttgggtacaaaattcagaattccgtca tccaaagttaaactattaagctctgcagaacaaagtcagaagccacaactgcctgaagat aagtatttagattcagatgcatctttcactgaaaatagtgatacctcaagacaaatcagt ggagtttttgatgagcccagcacaagtaaagaaactaaactgaaatatgcaactgaaaaa aagacagttgattgtagtaaaaatagaatcaataatgtttcattggaagaagttgtgagt cctgcaaattccattattccagaaaggcttacccctacaaatattctgggaggaaatgtg gaaatgaaaatccaaagtccatgtgttttccaacaggatgctgttgttgacagaattgta gataaggaaatagatattagacaggtcaaaaccacagataatgacattcttatggctgat ataagtgacaagagaacaggtaaaaatagttgctatgaaaacatctcagaactgaagtat tcagatgatttgtctagcccttgctattctgaagatttctgtaccagtgaggacaccagc agaagtttcaaagctcatgatagcagttcaaggacagaaaatccaaaacatagtcaatat acaagcaagtctagtgacacaggagtgtccaaaaagaaaaatagtagtgacaggagttct atccttagcccacctttttcagccgggtcacctgtacactcatacagaaaatttcatatt tcaaagactcaggataaaagtttggaggaagcatctagtatctctgctagtgatttatct tcaacacattggactgaacaaaaagaaaaccagatagatcaaaatagtatgcacaattct gaaattacaaagagagctcaagacatctctgttaaaacaagaagtagttggaaatcttta gaaaaaagccagtcaccacaaacatcccaggtgagttcttacctgccttcaaatgtgtcc gaacttaatgtcctggatagcagtacatcagatcactttgaagaaggcaatgatgatgtt ggttcactaaatatttccaagcaatgcaaagatatttgtgaattagtaataaataaactt ccaggatacacaatgtaa >gi568815597f:232705450_232908164|GENSCAN_predicted_peptide_6|81_aa MLLSCASKLSPVDGSSRDSTVNEDSDSTGLRICIFSKTPGHSIHPLQRIQKHGTILELEI GSSPDTEPAGARILDSQPSEL >gi568815597f:232705450_232908164|GENSCAN_predicted_CDS_6|246_bp atgttgctgagctgtgcttcaaaactttcacctgtggatggatcatcccgggactcaact gtaaatgaagattctgattcaacaggtctgagaatctgcatttttagcaagactccagga cacagtattcatcccctccagaggatacagaaacatggtactatcttggaattggagatt gggtcctcaccagacactgaacctgctggtgccaggatattagactcccagccttcagaa ctgtaa >gi568815597f:232705450_232908164|GENSCAN_predicted_peptide_7|116_aa MKKKINGKDGGSEETTEGVENPHQKSWRKPSPQPSIHKNQQMVNQDIHENQKTSSHWRGN NGLLSQEEIGNLNRSRTSKDANSLQGSRISDPPPPFTMHGLAMGHIWSSSHFPAPF >gi568815597f:232705450_232908164|GENSCAN_predicted_CDS_7|351_bp atgaagaagaaaataaatggaaaagatgggggaagtgaagagaccacagaaggtgtggaa aatccacaccagaagtcttggagaaaaccctctccccagccctctatccataaaaaccaa caaatggtcaatcaagatattcatgaaaaccagaagaccagtagtcactggagagggaac aatgggcttctatctcaagaagaaataggaaatctaaatagatctagaacaagtaaagat gcaaactctctgcagggctcccggatctcagaccctcctcctcccttcacaatgcacggc cttgccatgggccacatctggtcatcgtcccatttccctgcccccttctag >gi568815597f:232705450_232908164|GENSCAN_predicted_peptide_8|92_aa MPESPTHSMGSCVARASPTSTTPCSTAPSPIDHPRAEERERTAQDWQAAPPAAPVRDPLG EASWAPDSVLAHGLYRRLHLPACQDPAGSKPL >gi568815597f:232705450_232908164|GENSCAN_predicted_CDS_8|279_bp atgcctgagtctcccactcactccatgggctcctgtgtggcccgagcctccccgacgagc accaccccctgctccacggcacccagtcccatcgaccacccaagggctgaggaacgcgag cgcacggcgcaggactggcaggcagctccacctgcagccccggtgcgggatccactgggt gaagccagctgggctcctgactctgtgcttgctcatggcctctaccggaggctacacctc ccagcctgtcaggaccctgcaggctctaagcctttataa >gi568815597f:232705450_232908164|GENSCAN_predicted_peptide_9|98_aa MTKKQEGKLMEQMEELSFQIKAYTIHSMERRNGLQGAMRNKKALVSLQVQVHGVWEKQSK LQFKRAEEEAESSGGAVFQLQQETGERWRCNIVNHGHL >gi568815597f:232705450_232908164|GENSCAN_predicted_CDS_9|297_bp atgaccaagaaacaggaagggaaacttatggaacaaatggaggagttgtcttttcagata aaagcatacaccattcactcaatggagagaagaaatggtctgcaaggagcaatgaggaac aagaaggcacttgtgtcactccaggtccaagtgcatggagtatgggagaaacaaagcaag ctacagtttaagagggctgaagaggaagcggagtcctctgggggcgctgtgtttcagttg cagcaagaaacaggagagagatggcgatgcaatattgttaaccatgggcatctatag >gi568815597f:232705450_232908164|GENSCAN_predicted_peptide_10|82_aa MDKLSLPIRFWVVPGDHPKFEESWQWKKEHLIGAVSEVLGGREHAVQAQMLALDTPRCQC EKKIKFRDPKLNTPKGKGKLGS >gi568815597f:232705450_232908164|GENSCAN_predicted_CDS_10|249_bp atggacaagctgtcactacccattagattttgggtggtcccaggggaccatcccaagttt gaggaaagttggcaatggaaaaaggagcatctgataggagcagtcagtgaagtccttgga ggcagagagcatgctgttcaggcgcagatgctggcgttggacacacccagatgtcagtgt gaaaagaaaataaaattccgggaccccaaactcaatacaccaaagggaaaaggaaagctt ggaagctga >gi568815597f:232705450_232908164|GENSCAN_predicted_peptide_11|199_aa XDPDLFYFVTLSALGLDPKGAAWSNVAAPAPVIASPAHFRQQEQGKENKGTKKDTVANKE IDQISKFSSDFAFTQRFPERQASVPVSAARVLSFEVHVLRIARLAAHTICRWVIETLPIP ASFLLSNHEMGYWAKCFKDTPQKGFDRNVVPGLGDAVQNGFSHAMAHSLAFLHHNGGARR TAFPSFLAAGDDHVTDSDQ >gi568815597f:232705450_232908164|GENSCAN_predicted_CDS_11|600_bp nnggacccagacttgttctattttgtcactttatcagctctgggccttgatcccaaaggc gctgcatggtccaatgtggctgctccagctccagtcatcgcatcaccagctcatttcaga cagcaagagcaaggaaaggagaataagggaactaaaaaggacactgtcgcaaataaggaa atagatcaaatttctaagtttagcagcgacttcgccttcacacagagattcccagaacgc caagcctcggtgccagtttctgcggcccgtgtcctctcctttgaagttcatgtactacgc atagcaaggctggcagctcacactatttgtagatgggtgatagaaacattacctatccct gcttccttcttgctttccaaccacgaaatggggtactgggctaaatgcttcaaagacact cctcaaaagggctttgacagaaatgttgttcctggcttgggagatgcagtgcagaatggc ttttcacatgccatggcccattcccttgccttcctccaccataatggaggagcaaggaga acagccttcccatccttccttgcagctggggatgaccatgtgacagattctgaccaatga