GENSCAN 1.0 Date run: 2-Nov-116 Time: 23:12:16 Sequence gi568815585f:40962644_41182911 : 220268 bp : 39.49% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 54751 54790 40 -1.15 1.01 Init + 58092 58161 70 2 1 56 91 35 0.232 1.86 1.02 Term + 64246 64388 143 2 2 84 44 140 0.960 6.31 1.03 PlyA + 65808 65813 6 1.05 2.02 PlyA - 66096 66091 6 1.05 2.01 Sngl - 84318 83746 573 0 0 43 44 251 0.903 12.11 2.00 Prom - 87711 87672 40 -7.85 3.00 Prom + 88559 88598 40 -3.35 3.01 Init + 92111 92188 78 1 0 61 -16 65 0.196 -5.49 3.02 Intr + 92400 92531 132 2 0 73 110 46 0.464 5.22 3.03 Intr + 98264 98518 255 1 0 35 49 315 0.885 19.02 3.04 Intr + 98870 99032 163 1 1 13 84 298 0.977 20.53 3.05 Term + 99176 99372 197 0 2 50 46 144 0.590 2.99 3.06 PlyA + 99864 99869 6 1.05 4.00 Prom + 101156 101195 40 -6.85 4.01 Init + 101663 101701 39 1 0 64 63 6 0.526 -4.06 4.02 Intr + 102521 102644 124 1 1 77 98 148 0.906 13.94 4.03 Intr + 105918 106094 177 1 0 98 107 108 0.997 12.67 4.04 Intr + 108884 108930 47 0 2 22 86 55 0.429 -4.09 4.05 Intr + 110139 110214 76 2 1 69 107 64 0.611 4.57 4.06 Intr + 113401 113594 194 2 2 40 100 147 0.972 9.39 4.07 Intr + 118003 118166 164 0 2 87 89 126 0.994 10.45 4.08 Term + 120061 120271 211 1 1 41 33 182 0.835 3.78 4.09 PlyA + 120406 120411 6 1.05 5.02 PlyA - 120434 120429 6 1.05 5.01 Sngl - 130931 130764 168 2 0 94 43 112 0.613 1.81 5.00 Prom - 134649 134610 40 -1.65 6.05 PlyA - 134766 134761 6 1.05 6.04 Term - 145091 144402 690 2 0 93 43 174 0.215 6.00 6.03 Intr - 170008 167865 2144 2 2 12 65 2000 0.013 176.47 6.02 Intr - 171473 171388 86 1 2 42 105 79 0.009 3.54 6.01 Init - 208398 208127 272 0 2 49 91 282 0.418 21.09 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 19411 19340 72 1 0 35 80 141 0.828 9.12 S.002 Sngl - 169868 167844 2025 2 0 96 44 1936 0.951 183.89 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585f:40962644_41182911|GENSCAN_predicted_peptide_1|70_aa MHQETRSAEQSREIHLFPGKFPPDCTSGLMRNILQSADRALFTDGSARHASTTQKRTAAV LQPFFWNIQS >gi568815585f:40962644_41182911|GENSCAN_predicted_CDS_1|213_bp atgcaccaagaaacaaggtccgctgagcagtcacgggagattcacttgttccctggaaag ttccccccagattgtacctctggcctcatgaggaatattctacaatcagctgacagagcc ttgtttacagatggttctgcacgacatgcgagcaccacccaaaagcggacagctgcagtg ctacagccctttttctggaacatccagtcatga >gi568815585f:40962644_41182911|GENSCAN_predicted_peptide_2|190_aa MWDYVKRPNLRLIGVPESDGENGTKLENTLQDIIQENSPNLARQANIQIWEIQRTPQRYS LRRATLRHIIVRFTKVEMKEKLLRAAREKGRVTHRGKPIRLTADLSAETLQARREWGPIF NIIKEKSFQPRISYPAKLSFISEGEIKSFTDKQMLRDFVTTRPALQELLKEALNMERNNW YQPLQKHAKL >gi568815585f:40962644_41182911|GENSCAN_predicted_CDS_2|573_bp atgtgggactatgtgaaaagaccaaatctgcgtctcattggtgtacctgaaagtgacggg gagaatggaaccaagttggaaaacactcttcaggatataatccaggagaactcccccaat ctggcaaggcaggccaacattcaaatttgggaaatacagagaacaccacaaagatactcc ttgagaagagcaactctaagacacataatcgtcagattcaccaaagttgaaatgaaggaa aaactgttaagggcagccagagaaaaaggtcgggttacccacagagggaagcccatcaga ctaacagcagatctctcagcagaaactctacaagccagaagagagtgggggccgatattc aacattattaaagaaaagagttttcaacccagaatttcatatccagccaaactaagcttc ataagtgaaggagaaataaaatcctttacagacaagcaaatgctgagagattttgtcacc accaggcctgcgctgcaagagctcctgaaggaagcactaaacatggaaaggaacaactgg taccagccactgcaaaaacatgccaaattgtag >gi568815585f:40962644_41182911|GENSCAN_predicted_peptide_3|274_aa MEWEQERDSGDGEHRHNPSIHQFNLHVPSPASSLMIQRETSQSPCSKVTLLTVVCEREED MSPDTSSSLQLLLPPPPPPPPPPLLLPTRSRAREQAPSERVGVVGERRGGAAPLQQQSLR RSLATRSAGPAVSPLFFLTPPEDPDPQVPVAPLVKLGTAERWSRGLSKVSGSEGGSGGHR AAGRARLVPLGKRESEWNPGRLAERLAQSYLQGRFSGRGRNVTPLNALGYGARALGGGRR PVQGAGWTAAGEATQCSLYDAVGGRMHPLGFRSP >gi568815585f:40962644_41182911|GENSCAN_predicted_CDS_3|825_bp atggagtgggaacaggaaagggacagtggagatggggagcacagacataatccttccatc caccagttcaacctgcacgtgccaagccctgcttcaagtctgatgatacagagggaaaca agtcaaagcccctgctccaaagttacactcctgacagtggtctgtgagcgggaggaggat atgtccccagacacttccagctctctgcagctgctgctgccgccgccgccgccgccgccg ccgccgccgctgctgctgcccacacgctcccgagctagggaacaagccccatccgagcgc gtgggcgtcgtaggggagaggaggggcggggccgctccgctgcaacagcaaagtttgagg cgaagcctagcgactcgcagcgccggtcccgcagtttcacctcttttttttttaactccg ccagaggaccccgacccacaggtcccggttgctccgctggtcaaattgggaacagcggaa cgctggtcccggggactgagtaaggtgtctggatcggagggaggttcgggtgggcatcgg gcggctggaagagctcgactcgtcccgctgggaaagcgcgagtctgagtggaaccctgga cgacttgcagagcggctggcgcagtcatacctgcagggccggttctcagggcggggtaga aatgttaccccgctgaacgccctggggtatggggcacgggctctagggggaggccggcgg ccggttcagggggctggttggacggccgcgggtgaagcgacgcagtgctctctctacgac gctgtcgggggtcgcatgcaccccttgggtttccgaagcccctga >gi568815585f:40962644_41182911|GENSCAN_predicted_peptide_4|343_aa MDTVLLLQTVLQSIKQKSLDKAKEEEKASKEFAAMEAAALKAYQEDLKRLGLESEILEPS ITPVTSTIPPTSTSNQQKEKKEKKKRKKDPSKGRWVEGITSEGYHYYYDLISGASQWEKP EGFQGDLKKTAVKTVWVEGLSEDGFTYYYNTETGESRWEKPDDFIPHTSDLPSSKVNENS LGTLDESKSSDSHSDSDGEQEAEEGGVSTETEKPKIKFKEKNKNSDGGSDPETQKEKSIQ KQNSLGSNEEKSKTLKKSNPYGEWQEIKQEVESHEEVDLELPSTENEYVSTSEADGGGEP KVVFKEKTVTSLGVMADGVAPVFKKRRTENGKSRNLRQRGDDQ >gi568815585f:40962644_41182911|GENSCAN_predicted_CDS_4|1032_bp atggataccgtattgttattgcaaacagtgctgcagtcaattaaacagaaaagcctggat aaggcaaaggaagaagaaaaggcatcaaaggagtttgctgcaatggaggcagctgccctg aaagcataccaagaggatttgaaaagacttggcttagagtcagaaattttggagccaagc ataacaccagtaaccagcactatcccacctacctcgacatcaaatcaacagaaagaaaag aaagaaaagaagaaaagaaaaaaagatccttcaaagggcagatgggtagaaggcataacc tctgagggttaccattactattatgatcttatctcaggagcatctcagtgggagaaacct gaaggatttcaaggagacttaaaaaagacagcagtgaagaccgtttgggtagaaggttta agtgaagatggttttacctattactataatacagaaacaggagaatccagatgggagaaa cctgatgatttcattccacacactagtgatctgccttctagtaaggtcaatgaaaattca cttggcaccctagatgaatccaaatcatcagattcgcatagtgattctgatggggaacag gaagcagaagaaggaggggtctctacagagacagaaaagccaaaaataaagtttaaggaa aaaaataaaaatagtgatggaggaagtgacccagaaacacagaaagaaaaaagtattcag aaacagaattcattaggttcaaatgaagaaaaatcgaaaactcttaagaaatcaaaccca tatggagaatggcaagaaattaaacaagaggttgagtctcatgaggaggtagatttggaa cttccaagcactgaaaatgagtatgtatcaacttcagaagctgatggtggcggagaaccc aaagtggtatttaaagaaaaaacagtcacttctcttggagttatggcagatggagtggcc ccagtcttcaaaaagagaagaactgaaaatggaaaatctagaaatttaaggcaacgaggt gatgatcaatag >gi568815585f:40962644_41182911|GENSCAN_predicted_peptide_5|55_aa MAGEASGNLQSLWKAKGKQAHLTWLEQEEEIEEEVLHTSKQADPMATHSLAQEQQ >gi568815585f:40962644_41182911|GENSCAN_predicted_CDS_5|168_bp atggctggggaagcctcgggaaatttacaatcattgtggaaggcaaaagggaagcaggca catcttacatggttagagcaggaggaagagatagaagaggaggtgctacatacttctaaa caagcagatcctatggcaactcactcactagcacaagaacagcaataa >gi568815585f:40962644_41182911|GENSCAN_predicted_peptide_6|1063_aa MRSLGQNPTEAELQDMINEVDADGNGTVDFPEFLTMMARKMKDTDSEEEIRDAFCVFDKD GNGYISATELHHVMTNLGENLTDDEVDEMIRGNGKKSGLIVLTTVDSDERGRQWQGFTGG LNSVNGLVLLSLRRRCYLSVSEGRLRRSQSRVLQRFSPSAPVAISTMQSREDAPRSRRLA SPRGGKRPKKIHKPTVSAFFTGPEELKDTAHSAALLAQLKSFYDARLLCDVTIEVVTPGS GPGTGRLFPCNRNVLAAACPYFKSMFTGGMYESQQASVTMHDVDAESFEVLVDYCYTGRV SLSEANVERLYAASDMLQLEYVREACASFLARRLDLTNCTAILKFADAFGHRKLRSQAQS YIAQNFKQLSHMGSIREETLADLTLAQLLAVLRLDSLDVESEQTVCHVAVQWLEAAPKER GPSAAEVFKCVRWMHFTEEDQDYLEGLLTKPIVKKYCLDVIEGALQMRYGDLLYKSLVPV PNSSSSSSSSNSLVSAAENPPQRLGMCAKEMVIFFGHPRDPFLCCDPYSGDLYKVPSPLT CLAHTRTVTTLAVCISPDHDIYLAAQPRTDLWVYKPAQNSWQQLADRLLCREGMDVAYLN GYIYILGGRDPITGVKLKEVECYNVKRNQWALVAPLPHSFLSFDLMVIRDYLYALNSKRM FCYDPSHNMWLKCVSLKRNDFQEACVFNEEIYCICDIPVMKVYNPVRAEWRQMNNIPLVS ETNNYRIIKHGQKLLLITSRTPQWKKNRVTVYEYDIRGDQWINIGTTLGLLQFDSNFFCL SARVYPSCLEPGQSFLTEEEEIPSESSTEWDLGGFSEPDSESGSSSSLSDDDFWDSIFLL LPFMVLGLGSNPAPRSEWVWGVERGQAVRADTPEPAGTGGGGWWGVGMCRVGLPLVPGSC LFLEPEAQTCSCGCCSCSYTWKGRSYLFPALPRVQGGSDPLLHFGRLQPCPGGWGFCLLG GVEQEAWICRCVFGSCSCTGSSHPNLEGAGLPLAPWSVQPQLHFPAAAGVMVAAPAITTV HMRSLTRLDLGLPSVTCGISDYAPIHLPFHYLFYTRDTVDVLY >gi568815585f:40962644_41182911|GENSCAN_predicted_CDS_6|3192_bp atgaggtctcttgggcagaatcccacagaagcagagttacaggacatgattaatgaagta gatgctgatggtaatggcacagttgacttccctgaatttctaacaatgatggcaagaaaa atgaaagacacagacagtgaagaagaaattagagatgcattctgtgtgtttgataaggat ggcaatggctatattagtgcaacagaacttcaccatgtgatgacaaaccttggagagaat ttaacagatgacgaggttgatgaaatgatcagagggaatgggaagaagagtggcttgatt gttttgactacggtggacagtgatgagagaggaagacagtggcagggtttcacaggaggg ttgaattcagtaaatgggctcgtgctgctgtctcttcggagacgctgctatcttagcgtc agcgagggaaggttgaggaggagccagagccgggtcctgcagcgtttctcgccatcagcg cccgtcgccatctccaccatgcagtcccgggaagacgccccgcgctctcgccgcctagcc agtccccgtggtgggaagcggcccaagaagattcacaaacccacagtttcggcctttttc acgggtccagaggaattaaaggacacggcccattctgcagccctgctggcacagctcaag tccttctacgatgcgcggctgctgtgtgatgtgaccatcgaggtggtgacgcctggcagc gggcctggcacgggtcgcctgttcccctgcaaccgcaatgtgctggccgcggcatgtccc tacttcaagagcatgttcacaggtggcatgtacgagagccagcaggccagcgtgaccatg cacgatgtggacgccgagtccttcgaggtgttggtcgactactgctacacgggtcgtgtg tctctcagtgaggccaacgtggagcgcctgtacgcggcctccgacatgctacagctggaa tatgtgcgggaagcctgtgcctccttcttagcccgacgtcttgacctgaccaactgcacc gccatcctcaagtttgcagatgcctttggccatcgcaagctgcgatcccaggcccagtcc tatatagctcagaacttcaagcaactcagccacatgggttcaattcgggaggagactcta gcagatctgaccctggcccagctgctggctgtcctgcgcttggatagtctggacgtggag agtgagcagacagtgtgccatgtggcagtgcagtggctggaggctgctcccaaagagcgg ggtcccagtgctgcagaagtcttcaagtgcgtgcgctggatgcacttcactgaagaagat caggactacttagaagggctgctgaccaagcccatcgtgaagaagtactgcctggacgtt attgaaggggccctgcagatgcgctatggtgacctgttgtacaagtctctggtgccagtg ccaaacagcagcagcagcagtagcagcagcaactctcttgtatctgcagcagaaaatcca ccccagagactgggtatgtgtgccaaggagatggtgatcttctttggacaccccagagat ccctttctctgctgtgatccatactcgggggacctttacaaagtgccgtcacctttgacc tgtctggctcacactaggactgtcaccactttagctgtctgtatctctcctgaccatgac atctatctagctgctcagcccaggacagacctctgggtgtataaaccagctcagaatagt tggcagcaacttgcagatcgcttgctgtgtcgtgagggcatggatgtggcatatctcaat ggctatatctacattttgggggggcgagaccctattactggagttaagttgaaggaagtg gaatgctacaatgttaagagaaaccagtgggcattggtggctccactgccccattctttt ttatcctttgacctaatggtaattcgagactatctctatgctctcaacagtaagcgcatg ttctgttatgatcctagccacaatatgtggctgaagtgcgtttctctgaagcgcaatgac tttcaggaagcctgcgtcttcaatgaggagatctattgtatctgtgatatcccagtcatg aaggtctacaacccagttagggcagaatggaggcaaatgaataatattcccttggtctca gagaccaacaactacagaattatcaagcatggccaaaaattgttgctcatcacctctcgc accccacagtggaaaaagaaccgggtgactgtgtatgaatatgatattaggggagaccaa tggattaatataggtaccacattaggcctcttgcagtttgattctaactttttttgcctc tctgctcgtgtttatccttcctgccttgaacctggtcagagtttcctcactgaagaagaa gaaataccaagtgagtctagcactgaatgggacttaggtggattcagtgagccagactct gagtcaggaagttcaagttctctttctgatgatgatttttgggactctatcttcctcctg ctgccattcatggtcctggggcttggttccaaccctgctccgagatcagagtgggtgtgg ggagtggagagaggccaggcagtgagagcagacacccctgagcctgcagggacaggtggt gggggttggtggggggtagggatgtgtcgggtggggctcccgcttgtccctggctcctgc ctgttcctggagccggaggcccagacctgcagctgcgggtgctgcagctgcagctacacc tggaagggcagatcctacttgttcccagctctcccaagagtacagggaggctcggatcca ttgctgcattttgggcggctgcagccctgcccaggagggtggggcttctgcctgctcggt ggagtagagcaggaggcctggatctgcaggtgcgttttcggcagctgcagctgcacaggg agctcccatcccaacttagaaggggccgggctcccccttgctccatggagtgtgcaaccc cagctgcacttccctgctgcagccggcgtgatggtagcagcccctgccatcaccaccgta cacatgaggtcattgacccgtctggatttaggtttaccatctgttacttgcggaatttca gactatgctccaatccacctaccttttcattatttgttttacacaagggatactgttgat gtcctctactag