GENSCAN 1.0 Date run: 4-Nov-116 Time: 02:29:36 Sequence gi568815583f:59668352_59869143 : 200792 bp : 40.55% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 Intr - 4388 4286 103 0 1 107 97 58 0.087 6.91 1.03 Intr - 9736 9560 177 2 0 63 78 166 0.879 12.07 1.02 Intr - 11417 11241 177 0 0 87 94 161 0.998 15.57 1.01 Init - 11974 11890 85 1 1 36 99 123 0.567 9.23 1.00 Prom - 12606 12567 40 -6.55 2.00 Prom + 14032 14071 40 -8.25 2.01 Init + 15638 15765 128 1 2 68 92 52 0.188 3.28 2.02 Intr + 20470 20660 191 1 2 32 55 132 0.231 2.61 2.03 Intr + 20726 20967 242 0 2 31 42 321 0.387 18.15 2.04 Intr + 33506 33550 45 1 0 99 87 34 0.427 2.09 2.05 Intr + 37766 37955 190 1 1 54 53 155 0.639 6.84 2.06 Term + 41825 41892 68 0 2 101 44 47 0.601 -1.28 2.07 PlyA + 42497 42502 6 1.05 3.00 Prom + 47010 47049 40 -7.65 3.01 Init + 48379 48438 60 0 0 47 100 39 0.419 1.88 3.02 Intr + 54304 54453 150 0 0 104 80 81 0.628 8.34 3.03 Intr + 57078 57178 101 2 2 40 80 67 0.493 -0.91 3.04 Term + 60100 60406 307 1 1 90 39 125 0.495 1.40 3.05 PlyA + 60513 60518 6 1.05 4.00 Prom + 61205 61244 40 -1.55 4.01 Init + 63359 63406 48 1 0 61 82 50 0.530 2.93 4.02 Term + 67719 67826 108 2 0 109 54 48 0.522 1.03 4.03 PlyA + 69262 69267 6 1.05 5.04 PlyA - 71358 71353 6 1.05 5.03 Term - 73720 73547 174 1 0 57 40 75 0.083 -3.62 5.02 Intr - 78674 78431 244 1 1 74 103 197 0.974 16.38 5.01 Init - 89649 89459 191 0 2 46 98 109 0.964 6.23 5.00 Prom - 92552 92513 40 -6.95 6.00 Prom + 98329 98368 40 -6.45 6.01 Sngl + 100001 100795 795 1 0 105 39 1030 0.998 95.42 6.02 PlyA + 100819 100824 6 1.05 7.04 PlyA - 101084 101079 6 1.05 7.03 Term - 110650 110596 55 0 1 124 43 61 0.328 1.25 7.02 Intr - 113144 112913 232 0 1 46 72 150 0.065 5.11 7.01 Init - 115331 114986 346 2 1 43 -33 391 0.222 19.92 7.00 Prom - 121741 121702 40 -5.45 8.00 Prom + 121858 121897 40 -7.25 8.01 Init + 123308 123796 489 1 0 79 97 232 0.569 18.44 8.02 Term + 123965 124531 567 1 0 63 37 217 0.956 7.53 8.03 PlyA + 124670 124675 6 1.05 9.00 Prom + 129187 129226 40 -5.95 9.01 Init + 136386 136737 352 2 1 50 77 222 0.527 14.67 9.02 Intr + 138322 138824 503 2 2 46 -14 259 0.163 3.27 9.03 Intr + 139347 139562 216 2 0 12 86 171 0.187 7.08 9.04 Intr + 146603 146702 100 1 1 75 100 -3 0.000 -1.64 9.05 Intr + 160671 160789 119 1 2 82 49 69 0.108 1.66 9.06 Intr + 165419 165511 93 1 0 59 91 91 0.919 5.74 9.07 Term + 166095 166448 354 2 0 39 41 199 0.857 3.81 9.08 PlyA + 166754 166759 6 1.05 10.05 PlyA - 167741 167736 6 1.05 10.04 Term - 170936 170609 328 1 1 74 38 169 0.375 3.70 10.03 Intr - 181887 181802 86 0 2 77 84 125 0.342 8.70 10.02 Intr - 185120 184868 253 1 1 68 54 118 0.244 2.91 10.01 Init - 196669 196587 83 1 2 40 96 104 0.189 6.89 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 146407 146374 34 0 1 85 115 3 0.847 0.01 S.002 Init - 151760 151711 50 2 2 69 105 50 0.843 5.27 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583f:59668352_59869143|GENSCAN_predicted_peptide_1|181_aa MPVSSRPLPEDDSIEADILAITGPEDQPGSLEVNGNKVRKKLMAPDISLTLDPSDGSVLS DDLDESGEIDLDGLDTPSENSNEFEWEDDLPKPKTTEVIRKGSITEYTAAEEKEDGRRWR MFRIGEQDHRVDMKAIEPYKKVISHGGYYGDGLNAIVVFAVCFMPESSQPNYRYLMDNLF N >gi568815583f:59668352_59869143|GENSCAN_predicted_CDS_1|543_bp atgcctgtgtcttctagacctttaccagaagatgatagtattgaagcagatatactagct ataactggaccagaggaccagcctggctcactagaagttaatggaaataaagtgagaaag aaactaatggctccagacattagcctgacactggatcctagtgatggctctgtattgtca gatgatttggatgaaagtggggagattgacttagatggcttagacacaccgtcagagaat agtaatgagtttgagtgggaagatgatcttccaaaacccaagactactgaagtaattagg aaaggctcaattactgaatacacagcagcagaggaaaaagaagatggacgacgctggcgt atgttcaggattggagaacaggaccacagggttgatatgaaggcaattgaaccctataaa aaagttatcagccatgggggatattatggggatggattaaatgccattgttgtgtttgct gtctgtttcatgcctgaaagtagtcagcctaactatagatacctgatggacaatcttttt aan >gi568815583f:59668352_59869143|GENSCAN_predicted_peptide_2|287_aa MSYSYLICYVNIQKMHKVTKKKALCSVMTKLQWKYPEVLDFTRVKGWGSGGGARFHPEHN YQKRKSDGVGGQTAKYKQDPSQGRGPKQGGDPGSTSRAGSMAPPTKGRFPVQLRSHRAEF SQAALTPEEALAPSSSSPLQAGDVGLTARSQKAGPSGARSPRSAVETPAQSPGRSGTASA AAADPDTDYSTIHWINCVDPTKDLSSGCYMASKRVEEKRMLSVGAKTKLEILVHICINML SGLGSPSYRRPLHHPEHRLLKYFWPDLGCIPQLQPHPLQGSEQIGLI >gi568815583f:59668352_59869143|GENSCAN_predicted_CDS_2|864_bp atgtcctatagctatcttatctgctacgtcaacattcaaaagatgcacaaagttacaaag aaaaaagccttatgttccgtaatgacaaaattacaatggaaatatcctgaagtgttagat ttcacaagggtaaaagggtgggggagcggaggaggggcaaggttccatcccgagcacaac taccagaaacggaaaagcgacggggtggggggccaaactgccaaatacaaacaggaccca agccaagggcggggacctaagcagggcggggatccaggaagcacctcccgagcaggttct atggctccccctaccaagggccggttcccagtccagctccggagccaccgtgccgagttc tcccaggccgcactcaccccggaggaagccttggccccctcgtcctcttcgcccctccag gccggcgacgtggggctgacggccaggtcgcaaaaagcagggccgagcggagcccgctcc cctcggtcggcggtggagaccccggcccaatcccccggccgcagcggtacggcgtcggcg gcagcagctgacccggacacagactatagtaccattcactggatcaactgtgttgatcct acaaaggatttaagcagcggttgctacatggcatcaaagagagtggaggagaaaaggatg ctgtcagttggtgccaaaaccaaactggaaatcctggtgcatatttgcataaatatgctg tctgggcttggaagcccttcttaccgcagacctcttcatcaccctgaacatcgccttttg aaatatttctggccagatctagggtgtattccacagctgcagccccatccattgcaaggt tctgagcagattggcttgatatga >gi568815583f:59668352_59869143|GENSCAN_predicted_peptide_3|205_aa MSERHLPALVLAAKALACDPQLSSLTMRAELTRHFKSCYQLNGSLVKKSTSSFSLSILII ADGLSIISSWTLPVKYTDKNRTTPGAPFLGNDANCFKLDSGLSRAGERSWASAVEGLSSL QLSVEADDEASTGIAAVVTLYGDMCLGTNINIFSCFKGFSKGKSPVMMTSLRLEYRFLRE IFKFCIFTTVAFNKHASGIGGIMKM >gi568815583f:59668352_59869143|GENSCAN_predicted_CDS_3|618_bp atgtcagaaagacacctgccagcacttgtcctggcagctaaggcattggcctgtgaccca cagctgagctccctgaccatgcgagctgagctgacacggcatttcaagtcctgttaccag ctcaacggttctcttgtaaaaaaaagcacctcttcatttagcctctccatcttgatcatt gctgatgggctatcaataatttcaagttggaccttgcctgtgaaatacacagataaaaat cgtaccacacctggtgccccctttttggggaatgatgcaaattgtttcaaattggacagt ggactctccagggcaggggaaagatcttgggcaagtgctgtagaaggtctttcttccctg cagctctctgttgaggctgacgatgaagcaagcactggaattgcagccgttgttacactg tatggcgatatgtgtttgggtaccaacatcaacattttttcatgctttaaaggtttttct aaagggaaatcgccagtcatgatgacctcactacgactagaatacaggtttttaagggaa atttttaaattctgtatttttactactgtggctttcaataaacatgcttctggcataggt ggcatcatgaaaatgtaa >gi568815583f:59668352_59869143|GENSCAN_predicted_peptide_4|51_aa MPKGKLLWLKQGEGLQLTPIMLKATDHLFLASPTEASICQFPTYRAAKMPA >gi568815583f:59668352_59869143|GENSCAN_predicted_CDS_4|156_bp atgcctaaaggaaagctgctctggctgaagcagggtgaagggctgcagctgacacctatc atgttgaaggctactgatcatcttttcctagcctcaccaacagaagcatcaatatgccaa tttccaacctaccgtgcagcaaagatgccggcctga >gi568815583f:59668352_59869143|GENSCAN_predicted_peptide_5|202_aa MSCYVPRRRCRVSVKATKNQHRDLSVCCYHRIPVLTTIAGLIERYSSYLIISQCAPIGPQ ASEGSECQVDSPGNPTDDTAVEKPGQEAKRLTLQARGEAPSGCICGALLKSAGNWAPQLW LAQEASLRRPEVALKRCSIHSACHTLADTATLRDIPLKEGQAQPCGRALSHLSLPSPPPT NLDSERNGSLHPWTVSTCICID >gi568815583f:59668352_59869143|GENSCAN_predicted_CDS_5|609_bp atgtcgtgctatgtgccaaggagaagatgcagagtatcagtgaaagccactaagaaccag cacagagatctttcagtttgctgctatcacagaatcccagtacttaccacaattgcagga cttattgaacggtattccagttacctgattattagtcagtgtgcccccataggaccccaa gcttctgaagggtctgagtgccaagtggactcccctgggaatccaacggatgacacagct gtagagaagccagggcaggaagcaaagaggctcacgctgcaggccagaggcgaagcccca tcaggctgcatctgtggagctttgttgaagtctgcagggaactgggcaccacagctgtgg ctggcacaagaggcaagtctgagaagacctgaagtggcgctcaagaggtgttcaatacac agtgcatgtcacacgttggcagatacagcaactttaagagacatccctctgaaagaagga caagctcaaccctgtggcagagctctgtcccacctcagcctccccagcccaccccccacc aacctcgactcagagcgcaatggctctttgcatccttggacagtttctacttgcatctgc atcgattaa >gi568815583f:59668352_59869143|GENSCAN_predicted_peptide_6|264_aa MAVGKNKRLTKGGKKGAKKKVVDPFSKKDWYDVKAPAMFNIRNIGKTLVTRTQGTKIASD GLKGRVFEVSLADLQNDEVAFRKFKLITEDVQGKNCLTNFHGMDLTRDKMCSMVKKWQTM IEAHVDVKTTDGYLLRLFCVGFTKKRKNQIRKTSYAQHQQVRQIRKKMMEIMIREVQTND LKEVVNKLIPDSIGKDIEKACQSIYPLHDVFVRKVKMLKKPKFELGKLMELHGEGSSSGK ATGDETGAKVERADGYEPPVQESV >gi568815583f:59668352_59869143|GENSCAN_predicted_CDS_6|795_bp atggctgttggcaagaacaagcgccttacgaaaggcggcaaaaagggagccaagaagaaa gtggttgatccattttctaagaaagattggtatgatgtgaaagcacctgctatgttcaat ataagaaatattggaaagacgctcgtcaccaggacccaaggaaccaaaattgcgtctgat ggtctcaagggtcgtgtgtttgaagtgagtcttgctgatttgcagaatgatgaagttgca tttagaaaattcaagctgattactgaagatgttcagggtaaaaactgcctgactaacttc catggcatggatcttacccgtgacaaaatgtgttccatggtcaaaaaatggcagacaatg attgaagctcacgttgatgtcaagactaccgatggttacttgcttcgtctgttctgtgtt ggttttactaaaaaacgcaaaaatcagatacggaagacctcttatgctcagcaccaacag gtccgccaaatccggaagaagatgatggaaatcatgatccgagaggtgcagacaaatgac ttgaaagaagtggtcaataaattgattccagacagcattggaaaagacatagaaaaggct tgccaatctatttatcctctccatgatgtcttcgttagaaaagtaaaaatgctgaagaag cccaagtttgaattgggaaagctcatggagcttcatggtgaaggcagtagttctggaaaa gccactggggacgagacaggtgctaaagttgaacgagctgatggatatgaaccaccagtc caagaatctgtttaa >gi568815583f:59668352_59869143|GENSCAN_predicted_peptide_7|210_aa MTDKLSEADFRRSVITNFSKLKEHVLTHHKEAKNLEKGVDEWLTRINSVGNNLNDLMELK TMVREVRDACTSFNSQFDQVEERISVIEDQINEIKRNDKFREKRVKGNKASKKYGKIQTT IREYYKRLYANKLENLEEMDKFLDTETLPRLNQEEVESLNRPITGSEIEAIINSLPTKKS PGPDGFTAKFYQRTGGDDLPASVPVQGLGP >gi568815583f:59668352_59869143|GENSCAN_predicted_CDS_7|633_bp atgactgacaagttgtcagaagcagacttcagaaggtcagtaataacaaacttctccaag ctaaaggagcatgttctaacccatcacaaggaagctaaaaaccttgaaaaaggggtagac gaatggctaactagaataaacagcgtagggaataacttaaatgacctgatggagctgaaa accatggtacgagaagttcgtgatgcatgcacaagcttcaatagccaatttgatcaagtg gaagaaaggatatcagtgattgaagatcaaattaatgaaataaagcgaaacgacaagttt agagaaaaaagagtaaaaggaaacaaagcctccaagaaatatgggaaaatacaaactacc atcagagaatactataaacgcctctatgcaaataaactagaaaatctagaagaaatggat aaattcctagacacagagaccctcccaagactaaaccaggaagaagttgaatctctgaat agaccaataacaggctctgaaattgaggcaataattaatagcttaccaaccaaaaaaagt ccaggaccagatggattcacagccaaattctaccagagaacaggtggagatgatcttccg gcatcagtcccagtccaaggtttgggcccctga >gi568815583f:59668352_59869143|GENSCAN_predicted_peptide_8|351_aa MKKSFIGDRAAQRRPAVGSSEICFGKLRSLLQPGCPDECSAPSREETLEWEAPLCRQVVL SSLQLSAERTPWSGWLLSAAGRPDVCKSQQRGGLEWVASLCSWSSGCLQFWLSPGLLWVS EGRKCILIGPWVAMGGPRKGITSSYSWGLAAQPPAIRPSLALKGVKAAGGWCVRTALSVC TPGKAVTVPGLGPEIRVGADIREKLGSWSRYFQPSRAGREVSGEAFRVPKSKEMSGSVAA TCAAAAVPGRVGLLPAPRSQEHRKSHVCSRGLGGSSGTQKAPTPTWEGRGSHLSLALGGS MESAAPAVPPCYSRRDGSGHSRRPAAAIIIRMKSIQWEAMENEDRKGEDKY >gi568815583f:59668352_59869143|GENSCAN_predicted_CDS_8|1056_bp atgaagaagagctttatcggcgatagagcagctcagaggagacccgcagtgggaagctca gagatctgctttgggaagctccgttcccttctgcagccagggtgtcctgatgagtgttca gctcctagcagagaggaaaccctggagtgggaagctcctctctgcaggcaggttgtcctg tcatctctgcagctgtcagcagaaaggacgccctggagtgggtggctcctctctgcagct ggtcgtcctgatgtctgcaaatctcagcagagaggaggcctggagtgggtggcttctctc tgcagctggtcctccggatgtctgcagttctggctgagcccagggcttttatgggtctca gaggggaggaagtgcatactgattggtccatgggtggccatgggtggtcctagaaaaggc atcacaagttcctactcctggggactggcagcccagcccccagccattaggccctccctg gccttgaagggggtcaaggcagcagggggctggtgtgtcagaactgccttgagtgtgtgc acacctggaaaggctgtgacggtgcctgggcttggccctgagatcagagtgggcgctgac atcagggagaagctaggcagctggagcaggtacttccagccttcaagggcggggagggag gtgtcgggggaggctttccgggtccccaagagtaaagagatgtctgggtctgtagccgca acttgcgccgctgcagctgtgcccgggagggtggggctcctgcctgctcccaggtcccaa gagcacaggaagtctcatgtctgcagccgtggcttgggtggctccagcggcacccagaaa gctcccaccccgacttgggaggggaggggctcccacttgtccctggctctgggtggctcc atggagagtgcagcccctgccgtgcctccctgctacagccggcgtgatggcagtggccac tccagacggcctgccgctgccatcattatccggatgaaaagtatacaatgggaagcaatg gaaaatgaggatagaaaaggagaggataaatattaa >gi568815583f:59668352_59869143|GENSCAN_predicted_peptide_9|578_aa MKDEMNEMKREEKFREKRIKRNEQSLQEIWDYVKRPNLRLIGIPESDGENGTKLENTLQD IIQENFPNLARQANIQIQEIQRTPQRYSPRRASLRHIIVRFTKVETKEKMLRAAREKEIQ TTIREYYKHLYTNKLENLEEMDKFLDTETLPRLNQEEVESLNRPTTGSEIEAIINSLPTK KSPGPDRFTAELYQRYKEELVPFLLKLFQSIEKEGILPNSFYEASIILIPKPGRDTTKKE NFRPIFLMNTDAKILNKILANRVQQHIKKLIHHDQVGFIPGDARLTESQIMSERPFTIAS KRIKYLGIQLTRDVKDLFKENYKPLLNEIKEDINKWKNIPRSWIGRINIVKMAILPKNTF TETPGPVFEVDTTKNSIITSFPFCHMLFSRSKIISESSRPASSGLATPARRELFYSYGNA QSFKTDSRPRSFIWEFEKFRSQRGANRKAKNIDDFRLLLTQDIKMQIKQKLSKQIMSLSN GYINKGMNRGTTLLPSPEERFQRPTCKPQACNVLEILSGQSIYGQGFYPTPVAELFDAIL SPPFNADLALQMCRFPKVLMFFAGVVYKVTKGNTNPSE >gi568815583f:59668352_59869143|GENSCAN_predicted_CDS_9|1737_bp atgaaagatgaaatgaatgaaatgaagcgagaagagaagtttagagaaaaaagaataaaa agaaatgaacaaagcctccaagaaatatgggactatgtgaaaagaccaaatctacgtctg attggcatacctgaaagtgacggggagaatggaaccaagttggaaaacactctgcaggat attatccaggagaacttccccaatctagcaaggcaggccaacattcaaattcaggaaata cagagaacgccacaaagatactccccaagaagagcaagtctaagacacataattgtcaga ttcaccaaagttgaaacgaaggaaaaaatgttaagggcagccagagagaaagaaatacaa actaccatcagagaatactataaacacctctacacaaataaactagaaaatctagaagaa atggataaattcctagacacagagaccctcccaagactaaaccaagaagaagtagaatct ctgaatagaccaacaacagggtctgaaattgaggcaataattaatagcttaccaaccaaa aagagtccaggaccagatagattcacagccgagctataccagaggtacaaagaggagctg gtaccattccttctgaaactattccaatcaatagaaaaagagggaatcctccccaactca ttttatgaggccagcatcatcctgataccaaagcctggcagagacacaaccaaaaaagag aactttagaccaatattcctgatgaacactgatgcaaaaatccttaataaaatactggca aaccgagtccagcagcacatcaaaaagcttatccaccatgatcaagtgggcttcatccct ggggatgcaaggctgacagagagccaaatcatgagtgaacgcccattcacaattgcttca aagagaataaaatacctaggaatccaacttacaagggacgtgaaggacctcttcaaggag aactacaaaccactgctcaatgaaataaaagaggatataaacaaatggaagaacattcca cgctcatggataggaagaatcaatatcgtgaaaatggccatactgcccaagaataccttc acagaaacacctggaccagtgtttgaagttgacacaacaaaaaattccatcataacatcc tttccgttttgccatatgttattcagtagaagcaagataatttctgaaagttccagacct gcatcctctggcttagcaactccagccagaagagagctcttctattcatacgggaacgcc cagagtttcaagactgactcacggccccggagcttcatttgggagtttgagaagttcaga agccaaagaggtgctaacaggaaagcgaagaacattgatgattttaggctcctactgact caggacattaagatgcaaattaaacagaaacttagtaagcaaatcatgtcactttctaat ggatacatcaataaaggcatgaacagaggcaccacactgttgccaagccccgaggaaaga tttcaaaggcccacctgcaagccccaggcatgcaatgtgttagaaattctatctggccaa agtatttatggtcagggcttttaccccaccccagtagctgaactttttgatgctatcctt tcaccccctttcaatgctgacttggctctgcagatgtgcaggttcccaaaggtgttgatg ttctttgccggtgtggtttacaaggttacaaaggggaacacaaacccctcggagtaa >gi568815583f:59668352_59869143|GENSCAN_predicted_peptide_10|249_aa MWPHTPNIITKDCNKGYGYQPETVDKDRWSYCNVPVHILIAVTWGTDICVSVEYISRSGN TGSKGVADTNSNHILLAFTSTVRKWLAENTGISLLRDFYLAIVEYLAHTSPEGSGGSQTE AQSQAESRGEQLPPDTTGDLCWLNVKLVKERVWSEHSRPNRRKELPFLFMASEPAEQCEG SWALLSSVSCVHKLCQSAPSSKCLHYPRTIAMDQLPSAASGEFLQHLESCSYSPLLAHNF WQGSLPPSG >gi568815583f:59668352_59869143|GENSCAN_predicted_CDS_10|750_bp atgtggcctcatacacccaacattataacaaaagactgcaataagggctatggctatcag ccagaaaccgtggacaaagaccgttggagctattgtaatgtccctgtgcatattcttata gctgttacttggggcacagatatatgcgtttctgttgagtatatatctagaagtggaaat accgggtcgaagggtgtagcagacacgaactctaaccatatactcttggcctttacctca acggtccggaagtggcttgctgaaaacacaggcatctctctgctgagagacttttattta gccattgttgaatatttggcccacacgagcccagaggggagtggaggaagccagacagag gcacaatctcaggcagagtcccgtggagagcagcttccgcctgataccactggggacctc tgctggcttaatgttaagcttgttaaagagagggtctggagtgagcattcaaggccaaac agaaggaaggagttacccttcctgttcatggcctcggagccagcagagcagtgtgaagga agctgggcactgctcagctcagtatcttgtgtgcacaagctctgccagtctgcaccctcc agcaaatgtcttcactatccacggaccattgctatggaccagctcccatctgcagcctct ggtgagtttctccagcacctggaaagctgctcctacagcccacttctggcccacaacttc tggcaagggtctttaccaccgtcaggctga