GENSCAN 1.0 Date run: 5-Nov-116 Time: 20:58:21 Sequence gi568815596r:80202254_80403819 : 201566 bp : 40.12% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 1389 1639 251 1 2 89 44 146 0.682 5.18 1.02 PlyA + 3099 3104 6 1.05 2.02 PlyA - 3219 3214 6 1.05 2.01 Sngl - 11576 10683 894 2 0 49 42 323 0.870 19.68 2.00 Prom - 19611 19572 40 -2.65 3.03 PlyA - 20856 20851 6 1.05 3.02 Term - 40707 40402 306 0 0 101 55 136 0.772 5.73 3.01 Init - 41753 41628 126 2 0 77 47 106 0.833 5.70 3.00 Prom - 43228 43189 40 -5.75 4.03 PlyA - 43498 43493 6 1.05 4.02 Term - 46891 46774 118 0 1 105 41 142 0.196 8.23 4.01 Init - 63754 63588 167 1 2 69 95 123 0.408 10.25 4.00 Prom - 69567 69528 40 -5.35 5.00 Prom + 74226 74265 40 -8.55 5.01 Sngl + 74284 74490 207 0 0 79 48 264 0.980 16.44 5.02 PlyA + 78440 78445 6 1.05 6.00 Prom + 81245 81284 40 -3.75 6.01 Init + 82845 82955 111 2 0 19 81 141 0.506 6.76 6.02 Intr + 89453 89660 208 1 1 30 70 140 0.252 4.13 6.03 Term + 89766 89917 152 1 2 110 43 97 0.899 4.49 6.04 PlyA + 90780 90785 6 1.05 7.04 PlyA - 92489 92484 6 1.05 7.03 Term - 98655 98476 180 0 0 100 48 121 0.855 5.93 7.02 Intr - 101456 100004 1453 1 1 70 48 1743 0.870 157.23 7.01 Init - 102314 102043 272 2 2 46 50 208 0.614 9.30 7.00 Prom - 107003 106964 40 -3.95 8.00 Prom + 110782 110821 40 -5.15 8.01 Init + 112390 112515 126 0 0 75 64 119 0.834 8.41 8.02 Term + 120297 120503 207 2 0 69 54 180 0.128 9.06 8.03 PlyA + 122413 122418 6 1.05 9.04 PlyA - 122519 122514 6 1.05 9.03 Term - 124350 124252 99 0 0 93 43 65 0.161 -0.35 9.02 Intr - 131380 131309 72 1 0 68 98 37 0.334 1.38 9.01 Init - 143112 142981 132 0 0 59 87 136 0.934 10.79 9.00 Prom - 166939 166900 40 -6.15 10.00 Prom + 173829 173868 40 -4.45 10.01 Init + 178825 179045 221 0 2 70 77 138 0.489 9.05 10.02 Term + 200210 200324 115 2 1 52 49 107 0.002 0.26 10.03 PlyA + 200769 200774 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:80202254_80403819|GENSCAN_predicted_peptide_1|83_aa XWFWVSAAFPGAQCKLLVELPLWFLEDGGPLLTAPLGGAPVGTLCGGSNSTFPFRTALAE VLYEGPTPATNFCLGIQAFSYIL >gi568815596r:80202254_80403819|GENSCAN_predicted_CDS_1|252_bp ngctggttttgggtgtctgcagcttttccaggtgcacagtgcaagctgttggtggagtta ccactctggtttctggaggatggtggccctcttctcacagctccactaggtggtgcccca gtagggactctgtgtgggggctccaactccacatttcccttccgcactgccttagcagag gttctctatgagggtcccacccctgcaacaaacttctgcctgggcatccaggcattttca tacatcctctga >gi568815596r:80202254_80403819|GENSCAN_predicted_peptide_2|297_aa MGDFNTPLSTLDRSTRQKVNKDTQELNSALHQADLIDIYRTLHPKSAEYTFFSAPHHTYS KIDHTVGSKALLSKCKRTEIITNCLSDHSAIKLELRIKKLTQNCSTTWKLNNVLLNDYWV HNEMKAEIKMFFETNKNKDTTYQNLWDTFKAVCRGKFIALNAHKRKQERSKIDTLTSQLK ELEKQEQTHSKTSRRQEITKIRGEVKEIETQKTLQKINESSSWFFEKINKIDGPLARQIK KKREKNQIDTIKNDKGDITTDPTEIQTTIREYYKHLCANKLENLEEMDKFLDTYTLP >gi568815596r:80202254_80403819|GENSCAN_predicted_CDS_2|894_bp atgggagactttaacaccccactgtcaacattagacagatcaacaagacagaaagttaac aaggatacccaggaattgaactcagctctgcaccaagcagacctaatagacatctataga actctccaccccaaatcagcagaatatacattcttttcagcaccacaccacacttattcc aaaattgaccacacagttggaagtaaagcactcctcagcaaatgtaaaagaacagaaatt ataacaaactgtctctcagaccacagtgcaatcaaattagaactcaggattaagaaactc actcaaaactgctcaactacgtggaaactgaacaatgtgctcctgaatgactactgggta cataatgaaatgaaggcagaaataaaaatgttctttgaaaccaataagaacaaagacaca acataccagaatctctgggacacatttaaagcagtgtgtagagggaaatttatagcacta aatgcccacaagagaaagcaggaaagatctaaaattgacaccctaacatcacaattaaaa gaactagagaagcaagagcaaacacattcaaaaactagcagaaggcaagaaataactaag atcagaggagaagtgaaggagatagagacacaaaaaacccttcaaaaaatcaatgaatcc agtagctggttttttgaaaagatcaacaaaattgatgggccgctagcaagacaaataaag aagaaaagagagaagaatcaaatagacacaataaaaaatgataaaggggatatcaccact gatcccacagagatacaaactaccatcagagaatactataaacacctctgtgcaaataaa ctagaaaatctagaagaaatggataaattcctcgacacatacaccctcccatga >gi568815596r:80202254_80403819|GENSCAN_predicted_peptide_3|143_aa MRPRLMQMALQAAALMSPSRNCSKSQPGCGGIGTLSQWEYKMKLFLALCCSLRLSPPVVL VKTHSLDPGGSIAILLHLAGSLQNEENTPPRVTIKKREQTQAGRGTARARLLFSCPIWIR EASGEMEPNSEWEMPLFELTLRQ >gi568815596r:80202254_80403819|GENSCAN_predicted_CDS_3|432_bp atgaggccacgtttgatgcagatggccctgcaggctgctgcattgatgtcccctagtaga aactgcagcaaaagccagccaggatgtggaggaattgggactctcagtcagtgggaatat aaaatgaaactgtttctggccctttgctgctccctccgcctctctcctccagtggtccta gtgaaaacccatagcctggaccctggtggcagcatagctattctgctccatctagcagga tccctgcagaatgaagaaaacacaccacccagagttaccatcaagaagagagaacagacc caggctggcagaggaacagctcgggccagactgctgttcagctgccctatctggataagg gaggcatctggggagatggagccaaactctgagtgggaaatgccattgtttgagttgaca cttagacagtga >gi568815596r:80202254_80403819|GENSCAN_predicted_peptide_4|94_aa MTKSQNQKLCRQFNTSKWPPESAEVKERTFLDLILELLNQMFHVGTKESLVFQVPRLLSV EIKACCGNETTSSPFLLEEPADDGGEKGTAGSFY >gi568815596r:80202254_80403819|GENSCAN_predicted_CDS_4|285_bp atgaccaaaagtcagaatcagaaactctgccgacagtttaacacatccaaatggccacca gaatcagctgaagttaaagaacgtacattcctggatctcattctagaactacttaatcag atgttccatgttgggacaaaagaatctttagttttccaagttcccaggttactctctgta gaaataaaagcatgctgtggaaatgaaaccacatcatcaccttttcttctggaagagcct gctgatgatggaggagagaagggcactgcaggctctttctactaa >gi568815596r:80202254_80403819|GENSCAN_predicted_peptide_5|68_aa MVPAPASDEGFNVLSLIAEDEGEKASHGKGKEMREREEEKEEEEEEEEEEEEEEGCARLF LTTRSHRN >gi568815596r:80202254_80403819|GENSCAN_predicted_CDS_5|207_bp atggtgccagcacctgcttctgatgaaggcttcaatgtactttcactaatagcagaagat gaaggagagaaggcatcacatggcaaggggaaggaaatgagagaaagagaggaggagaag gaggaggaggaggaggaagaagaagaagaggaggaggaggagggatgtgccagactcttt ttaacaaccagatctcacaggaactaa >gi568815596r:80202254_80403819|GENSCAN_predicted_peptide_6|156_aa MSVQLVSHDLASWKLAKPDHDDFSLYHGSRLPSWMKKELTIDSKQLRCGNSFSTSPEGPC SWLLAFSSYENWKMCVHMATCIPGSGSPGLPDHSGKGKPLKNQNKVSPVLSGIQVFESHP ITEENGDSLALQDDNPGYTLIQPLPGYVTLFCSSFI >gi568815596r:80202254_80403819|GENSCAN_predicted_CDS_6|471_bp atgtccgtccagctggtcagccatgacctggcttcatggaaactggcaaaaccagaccat gatgattttagcctttatcatggcagcagactgccatcatggatgaagaaggaattgact atagatagcaaacagctcagatgtggaaacagtttttccacatcaccagaaggcccttgt agctggctgctagccttcagctcctatgaaaactggaagatgtgtgtgcacatggccacg tgtattccaggatcaggttcaccagggctgccagaccacagtggtaaaggcaagccacta aagaaccagaacaaagtcagcccagtgctttcaggaattcaggtttttgagtctcatcct atcacagaagaaaatggtgacagcctagctttgcaggatgacaatcctggttacaccctg attcaaccccttcctggctatgtgaccttgttttgttcatcttttatttag >gi568815596r:80202254_80403819|GENSCAN_predicted_peptide_7|634_aa MAGCFRGVKGGAGQWSFPMTDINQTVRSWGVASPEFGVFSPHNVTVRTAEGKEGGRKAKL GLRHVVGKLAGPRSRLPALPAALAAPSRAANGCPQLCRCEGRLLYCEALNLTEAPHNLSG LLGLSLRYNSLSELRAGQFTGLMQLTWLYLDHNHICSVQGDAFQKLRRVKELTLSSNQIT QLPNTTFRPMPNLRSVDLSYNKLQALAPDLFHGLRKLTTLHMRANAIQFVPVRIFQDCRS LKFLDIGYNQLKSLARNSFAGLFKLTELHLEHNDLVKVNFAHFPRLISLHSLCLRRNKVA IVVSSLDWVWNLEKMDLSGNEIEYMEPHVFETVPHLQSLQLDSNRLTYIEPRILNSWKSL TSITLAGNLWDCGRNVCALASWLNNFQGRYDGNLQCASPEYAQGEDVLDAVYAFHLCEDG AEPTSGHLLSAVTNRSDLGPPASSATTLADGGEGQHDGTFEPATVALPGGEHAENAVQIH KVVTGTMALIFSFLIVVLVLYVSWKCFPASLRQLRQCFVTQRRKQKQKQTMHQMAAMSAQ EYYVDYKPNHIEGALVIINEYGSCTCHQQPARECEVSRSIIEISVSIELKSEEVVVVAVG GCREGKGKAVCIGRRAPSSDDNLSKCMIEINCHG >gi568815596r:80202254_80403819|GENSCAN_predicted_CDS_7|1905_bp atggcaggctgtttccgcggagtaaaaggtggcgccggtcagtggtcgtttccaatgacg gacattaaccagactgtcagatcctggggagtcgcgagccccgagtttggagttttttcc ccccacaacgtcacagtccgaactgcagagggaaaggaaggcggcaggaaggcgaagctc gggctccggcacgtagttgggaaacttgcgggtcctagaagtcgcctccccgccttgccg gccgcccttgcagccccgagccgagcagcaaacgggtgcccgcagctgtgccggtgcgag gggcggctgctgtactgcgaggcgctcaacctcaccgaggcgccccacaacctgtccggc ctgctgggcttgtccctgcgctacaacagcctctcggagctgcgcgccggccagttcacg gggttaatgcagctcacgtggctctatctggatcacaatcacatctgctccgtgcagggg gacgcctttcagaaactgcgccgagttaaggaactcacgctgagttccaaccagatcacc caactgcccaacaccaccttccggcccatgcccaacctgcgcagcgtggacctctcgtac aacaagctgcaggcgctcgcgcccgacctcttccacgggctgcggaagctcaccacgctg catatgcgggccaacgccatccagtttgtgcccgtgcgcatcttccaggactgccgcagc ctcaagtttctcgacatcggatacaatcagctcaagagtctggcgcgcaactctttcgcc ggcttgtttaagctcaccgagctgcacctcgagcacaacgacttggtcaaggtgaacttc gcccacttcccgcgcctcatctccctgcactcgctctgcctgcggaggaacaaggtggcc attgtggtcagctcgctggactgggtttggaacctggagaaaatggacttgtcgggcaac gagatcgagtacatggagccccatgtgttcgagaccgtgccgcacctgcagtccctgcag ctggactccaaccgcctcacctacatcgagccccggatcctcaactcttggaagtccctg acaagcatcaccctggccgggaacctgtgggattgcgggcgcaacgtgtgtgccctagcc tcgtggctcaacaacttccaggggcgctacgatggcaacttgcagtgcgccagcccggag tacgcacagggcgaggacgtcctggacgccgtgtacgccttccacctgtgcgaggatggg gccgagcccaccagcggccacctgctctcggccgtcaccaaccgcagtgatctggggccc cctgccagctcggccaccacgctcgcggacggcggggaggggcagcacgacggcacattc gagcctgccaccgtggctcttccaggcggcgagcacgccgagaacgccgtgcagatccac aaggtggtcacgggcaccatggccctcatcttctccttcctcatcgtggtcctggtgctc tacgtgtcctggaagtgtttcccagccagcctcaggcagctcagacagtgctttgtcacg cagcgcaggaagcaaaagcagaaacagaccatgcatcagatggctgccatgtctgcccag gaatactacgttgattacaaaccgaaccacattgagggagccctggtgatcatcaacgag tatggctcgtgtacctgccaccagcagcccgcgagggaatgcgaggtttccaggtccatc attgagatttcagtttctattgaactaaagagtgaggaggtggtggtagtggcagttggt gggtgcagagaagggaaggggaaggctgtgtgcattggacggagggctccttcatcagat gacaatttaagcaaatgtatgattgaaataaactgtcatggatga >gi568815596r:80202254_80403819|GENSCAN_predicted_peptide_8|110_aa MSEAAKETCKDDAVCLMVLCQCLAGVECIISEEGLPLAYHINLRDGARRRRPRGPKRQPG PGADALPRPQPSRRVCCQDDRKRVCGEMGASVALSIIQNPVAFAGSSLSM >gi568815596r:80202254_80403819|GENSCAN_predicted_CDS_8|333_bp atgagtgaagctgcaaaggaaacgtgtaaggatgatgctgtctgcctcatggttctatgt caatgtctagctggtgttgagtgtatcatttctgaagagggtttgccattggcatatcac atcaatctccgggacggcgcgcggcggcggaggccgcggggccccaagcgccagcctggc cccggcgcagatgcgctgccccggccgcagcccagccgccgcgtgtgttgtcaggacgat cggaaacgcgtgtgtggggagatgggtgccagtgtcgccttgtccattatccaaaaccca gtcgcttttgctggttcctcactcagcatgtga >gi568815596r:80202254_80403819|GENSCAN_predicted_peptide_9|100_aa MNVSGARAFQTEGPADVNGSEAGTLLRMAMGLVCQELNEEEQWKKKEEKKGGGWLSGLSN SLTNGIMQAHLDWPSQSDANGVDSIGRGMDPLTHITLSST >gi568815596r:80202254_80403819|GENSCAN_predicted_CDS_9|303_bp atgaatgtcagtggggcaagggccttccagacagaaggacctgcagatgtaaatggctct gaagcaggaacattgctgagaatggcaatggggctagtgtgccaggagctgaatgaagaa gagcagtggaagaaaaaagaagaaaaaaaaggtggtggctggctttctggtctgagcaac tcactgaccaatggtattatgcaggcacacttagattggccttcacagtcagatgctaat ggagttgattccattggcagagggatggatccacttacccatatcaccctgagcagtaca taa >gi568815596r:80202254_80403819|GENSCAN_predicted_peptide_10|111_aa MLSEVPDSPKPEECQWLAPSCSGGGSLLGVGAGVGGSLAFPQLGDQFEKSRKSTELDKLH PKQAKDSISGLTISLDPGAIQEPIKNCLIRIKDAVTQEISVALEAVSNVTT >gi568815596r:80202254_80403819|GENSCAN_predicted_CDS_10|336_bp atgctcagtgaggtccctgattcacctaagcctgaggagtgccagtggcttgcaccttcg tgttcaggaggtggtagtctcctgggtgttggtgctggggttgggggcagcctggccttc ccccagcttggagatcagtttgagaagtcgaggaagagcacagagcttgataagcttcac ccaaagcaagcaaaagacagcataagtgggctaacaataagcctggatcctggggctatc caggagcccatcaaaaattgcctcattagaataaaagatgctgtcacccaagaaatttca gtggctttagaagctgtttccaatgttaccacctag