GENSCAN 1.0 Date run: 6-Nov-116 Time: 05:46:38 Sequence gi568815594f:138915982_139145471 : 229490 bp : 41.30% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 585 704 120 2 0 111 90 65 0.884 9.24 1.02 Intr + 9186 9288 103 2 1 103 80 46 0.638 4.13 1.03 Intr + 12173 12402 230 0 2 71 115 66 0.385 4.27 1.04 Intr + 12522 12728 207 2 0 97 19 156 0.895 7.95 1.05 Term + 23747 23884 138 1 0 80 53 91 0.011 1.78 1.06 PlyA + 25091 25096 6 1.05 2.03 PlyA - 25270 25265 6 1.05 2.02 Term - 28497 28360 138 0 0 29 47 160 0.872 2.98 2.01 Init - 28620 28534 87 0 0 93 72 44 0.763 3.99 2.00 Prom - 60398 60359 40 -2.55 3.00 Prom + 71894 71933 40 -3.75 3.01 Init + 76502 76562 61 1 1 74 34 127 0.018 7.36 3.02 Intr + 80331 80391 61 1 1 50 87 56 0.002 -1.43 3.03 Intr + 83626 83789 164 1 2 7 74 147 0.009 3.90 3.04 Term + 97804 97910 107 2 2 69 47 105 0.039 2.09 3.05 PlyA + 98099 98104 6 1.05 4.00 Prom + 98197 98236 40 -6.95 4.01 Init + 100001 100190 190 1 1 59 109 155 0.002 11.92 4.02 Intr + 127093 127362 270 2 0 79 91 239 0.764 19.89 4.03 Term + 128658 129493 836 1 2 80 36 638 0.998 49.96 4.04 PlyA + 130168 130173 6 1.05 5.08 PlyA - 130293 130288 6 1.05 5.07 Term - 143626 143002 625 0 1 89 38 524 0.979 40.37 5.06 Intr - 144693 144343 351 2 0 115 115 348 0.999 33.71 5.05 Intr - 146076 145884 193 1 1 82 83 78 0.924 4.33 5.04 Intr - 151789 151703 87 2 0 61 86 51 0.669 1.22 5.03 Intr - 156058 155885 174 2 0 76 75 164 0.884 12.89 5.02 Intr - 157586 157473 114 0 0 69 84 99 0.967 7.10 5.01 Init - 161076 161007 70 0 1 55 59 100 0.927 5.06 5.00 Prom - 166070 166031 40 -6.25 6.00 Prom + 166529 166568 40 -8.55 6.01 Init + 167626 168253 628 0 1 45 58 433 0.957 31.55 6.02 Term + 172622 172677 56 0 2 120 42 60 0.466 1.34 6.03 PlyA + 173175 173180 6 1.05 7.04 PlyA - 173241 173236 6 1.05 7.03 Term - 199331 198949 383 0 2 35 55 557 0.482 41.12 7.02 Intr - 209348 209183 166 2 1 70 74 261 0.997 21.51 7.01 Init - 221720 221649 72 2 0 60 50 83 0.387 2.82 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 80107 79931 177 2 0 73 115 45 0.825 4.67 S.002 Sngl - 100127 99630 498 2 0 69 44 291 0.981 16.59 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594f:138915982_139145471|GENSCAN_predicted_peptide_1|265_aa MASCFFKASKGEKEAPTRWALWVTPSLQLQAATIRPVETKGSIKDTGNGKSSSVCDECQL LLHLLSSSAKALGIAHLHPGEVNSFIAHTNPAWWSFHTDVHEIWCRDSDRGTSLGRSILC PPVLCLVRKIHLRPQVLRPTSPRNISPILNQRQRRQVLSVDPKLRCRSRTREGSLPLVFN HCRDASLIIHPGFRGVRPRRDTCLGPSPLAASPAFLGKGQNPPREHEAKRKCWCTVTYEN QNKASQNTVKVMAEPRHPLLTSISE >gi568815594f:138915982_139145471|GENSCAN_predicted_CDS_1|798_bp atggcctcttgcttcttcaaagctagcaagggagaaaaagaggctcccacaagatgggcg ctctgggtgaccccatccctacagctccaggcagccaccatccggcctgttgaaaccaag ggatctattaaggacacaggaaatggaaagagcagctcagtgtgtgatgaatgtcagtta ctattacatttattaagctcttctgcaaaggccttgggtatagcccacctgcacccaggt gaagtaaacagctttattgctcacacaaatcctgcttggtggtcttttcacacagacgtg catgaaatttggtgccgtgactcggatcgggggacctcccttgggagatcaatcctctgt cctcctgttctttgcttggtgagaaagatccacctacgacctcaggtcctcagaccaacc agcccaagaaacatctcaccaattttaaatcagagacaaaggagacaggttttatctgtg gacccaaaactccggtgccggtcacggactagggaaggcagccttcccttggtgtttaat cattgcagggacgcctctctgattattcacccaggtttcagaggtgtcagaccacgcagg gacacctgccttggtccttcacccttagcggcaagtcccgcttttctggggaaggggcaa aacccaccaagggagcatgaagccaagcggaaatgctggtgtacagtcacctatgagaac caaaataaggccagccagaacactgtcaaggtcatggcagagcccaggcatcccttgtta acatccatctctgagtga >gi568815594f:138915982_139145471|GENSCAN_predicted_peptide_2|74_aa MEDWGGGSLNTESSLEGYACGSGPPVMVTEHLWGIPENIHEEEARERTGLPVTQALSTES IKTQARYSNVEHLY >gi568815594f:138915982_139145471|GENSCAN_predicted_CDS_2|225_bp atggaggattggggtggtggatctctcaatacggagtcttctcttgagggttatgcatgc ggttctggaccaccagtcatggtgacagaacacctctgggggattccagaaaatatacat gaggaggaagccagggaaaggactggacttcctgttactcaagcactgagtactgagagc atcaagactcaggctcgttattcaaatgtggagcacttatactag >gi568815594f:138915982_139145471|GENSCAN_predicted_peptide_3|130_aa MKEKKKRKKREEEEEEGGGGGTTASGSFSPVVLDKLLPRKLRQQNTPKKAEGLRAYLTEK TVASRASQSAKPGVRVVSSLQVSPAGRRPSGTLENSQDEVEVPLSGYNPNPPKGTIVFSS PPCKTKNVTT >gi568815594f:138915982_139145471|GENSCAN_predicted_CDS_3|393_bp atgaaggagaagaagaagaggaagaagagggaggaggaggaggaagaaggaggaggagga ggcactacagccagcggtagcttctccccagttgttctagacaagctccttccaaggaag ctgagacagcaaaacactcctaagaaggctgaagggctgagagcctaccttacagagaaa acagtggctagcagagcttcacaaagcgccaaaccaggagttcgggtcgtgtcgtctttg caggtgtcgccagctggaagaagacctagtggcactcttgaaaactcacaggatgaagtt gaagttcctttatctggctacaatccgaacccaccaaaaggaacaattgttttttcttcc cctccttgtaagaccaaaaatgtaaccacctga >gi568815594f:138915982_139145471|GENSCAN_predicted_peptide_4|431_aa MFHSPRRLCSALLQRDAPGLRRLPAPGLRRPLSPPAAVPRPASPRLLAAASAASGAARSC SRTVCSMGTGTSRLYSALAKTLNSSAASQHPEYLVSPDPEHLEPIDPKELLEECRAVLHT RPPRFQRDFVDLRTDCPSTHPPIRVMQWNILAQALGEGKDNFVQCPVEALKWEERKCLIL EEILAYQPDILCLQEVDHYFDTFQPLLSRLGYQGTFFPKPWSPCLDVEHNNGPDGCALFF LQNRFKLVNSANIRLTAMTLKTNQVAIAQTLECKESGRQFCIAVTHLKARTGWERFRSAQ GCDLLQNLQNITQGAKIPLIVCGDFNAEPTEEVYKHFASSSLNLNSAYKLLSADGQSEPP YTTWKIRTSGECRHTLDYIWYSKHALNVRSALDLLTEEQIGPNRLPSFNYPSDHLSLVCD FSFTEESDGLS >gi568815594f:138915982_139145471|GENSCAN_predicted_CDS_4|1296_bp atgtttcatagtccgcggcggctctgctcggccctgctgcagagggacgcgcccggcctg cgccgcctgcccgccccagggctgcgccgcccgttgtccccgccggctgctgttcccagg cccgcatccccccggctgctggcggcggcctcggcggcctcgggcgccgcgaggtcgtgt tcccgaacagtgtgttccatgggaaccggtacaagcagactctatagtgctctcgccaag acactgaacagcagcgctgcctcccagcacccagagtatttggtgtcacctgacccagag catctggagcccattgatcctaaagagcttcttgaggaatgcagggccgtcctgcacacc cgacctccccggttccagagggattttgtggatctgaggacagattgccctagtacccac ccacctatcagggttatgcaatggaacatcctcgcccaagctcttggagaaggcaaagac aactttgtacagtgccctgttgaagcactcaaatgggaagaaaggaaatgtctcatcctg gaagaaatcctggcctaccagcctgatatattgtgcctccaagaggtggaccactatttt gacaccttccagccactcctcagtagactaggctatcaaggcacgtttttccccaaaccc tggtcaccttgtctagatgtagaacacaacaatggaccagatggttgtgccttatttttt cttcaaaaccgattcaagctagtcaacagtgccaatattaggctgacagccatgacattg aaaaccaaccaggtggccattgcacagaccctggagtgcaaggagtcaggccgacagttc tgcatcgctgttacccatctaaaagcacgcactggctgggagcggtttcgatcagctcaa ggctgtgacctccttcagaacctgcaaaacatcacccaaggagccaagattccccttatt gtgtgtggggacttcaatgcagagccaacagaagaggtctacaaacactttgcttcctcc agcctcaacctgaacagcgcctacaagctgctgagtgctgatgggcagtcagaaccccca tacactacctggaagatccggacctcaggggagtgcaggcacaccctggattacatctgg tattctaaacatgctctaaatgtaaggtcagctctcgatctgctcactgaagaacagatt ggacccaacaggttaccttccttcaattatccttcagaccacctgtctctagtgtgtgac ttcagctttactgaggaatctgatggactttcataa >gi568815594f:138915982_139145471|GENSCAN_predicted_peptide_5|537_aa MVNVSNDLSVVRRVLDLLADNFIVEASVHSSNAHCTDKTIEAAEALLHMESPTCLRDSRS PVEVFVPPCVSTPEFIHAAMRPDVITETVVEVSTEESEPMDTSPIPTSPDSHEPMKKKKV GRKPKTQQSPISNGSPELGIKKKPREGKGNTTYLWEFLLDLLQDKNTCPRYIKWTQREKG IFKLVDSKAVSKLWGKHKNKPDMNYETMGRALRYYYQRGILAKVEGQRLVYQFKDMPKNI VVIDDDKSETCNEDLAGTTDEKSLERVSLSAESLLKAASSVRSGKNSSPINCSRAEKGVA RVVNITSPGHDASSRSPTTTASVSATAAPRTVRVAMQVPVVMTSLGQKISTVAVQSVNAG APLITSTSPTTATSPKVVIQTIPTVMPASTENGDKITMQPAKIITIPATQLAQCQLQTKS NLTGSGSINIVGTPLAVRALTPVSIAHGTPVMRLSMPTQQASGQTPPRVISAVIKGPEVK SEAVAKKQEHDVKTLQLVEEKPADGNKTVTHVVVVSAPSAIALPVTMKTEGLVTCEK >gi568815594f:138915982_139145471|GENSCAN_predicted_CDS_5|1614_bp atggtaaatgtcagcaacgatttgtcagttgtacgtagagtattagatcttctggctgat aacttcatagtggaagcatcagttcacagcagtaatgcacactgtacagataagacaatt gaagctgctgaagccctgcttcatatggaatctcctacctgcttgagggattcaagaagt cctgtggaagtgtttgttcctccttgtgtatcaactccagaattcatccatgctgctatg aggccagatgtcattacagaaactgtagtggaggtgtcaactgaagagtctgaacccatg gatacctctcctattccaacatcaccagatagccatgaaccaatgaaaaagaaaaaagtt ggccgtaaaccaaagacccagcaatcaccaatttccaatgggtctcctgagttaggtata aagaagaaaccaagagaaggaaaaggaaacacaacctatttgtgggagtttcttttagat ctacttcaagataaaaatacttgtcccaggtatattaaatggactcagagagaaaaaggc atattcaagctggtggattcaaaggctgtctctaagctttggggaaagcataagaacaaa ccagacatgaactatgaaaccatgggacgagctttgagatactactaccaaaggggaatt cttgcaaaggttgaaggacagaggcttgtatatcagttcaaggatatgccgaaaaacata gtggtcatagatgatgacaaaagtgaaacctgtaatgaagatttagcaggaactactgat gaaaaatcattagaacgagtgtcactgtctgcagaaagtctcctgaaagcagcatcctct gttcgcagtggaaaaaattcatcccctataaactgctccagagcagagaagggtgtagct agagttgtgaatatcacttcccctgggcacgatgcttcatccaggtctcctactaccact gcatctgtgtcagcaacagcagctccaaggacagttcgtgtggcaatgcaggtacctgtt gtaatgacatcattgggtcagaaaatttcaactgtggcagttcagtcagttaatgcaggt gcaccattaataaccagcactagtccaacaacagcgacctctccaaaggtagtcattcag acaatccctactgtgatgccagcttctactgaaaatggagacaaaatcaccatgcagcct gccaaaattattaccatcccagctacacagcttgcacagtgtcaactgcagacaaagtca aatctgactggatcaggaagcattaacattgttggaaccccattggctgtgagagcactt acccctgtttcaatagcccatggtacacctgtaatgagactatcaatgcctactcagcag gcatctggccagactcctcctcgagttatcagtgcagtcataaaggggccagaggttaaa tcggaagcagtggcaaaaaagcaagaacatgatgtgaaaactttgcagctagtagaagaa aaaccagcagatggaaataagacagtgacccacgtagtggttgtcagtgcgccttcagct attgcccttcctgtaactatgaaaacagaaggactagtgacatgtgagaaataa >gi568815594f:138915982_139145471|GENSCAN_predicted_peptide_6|227_aa MRKSWNCGRSLKGEPSGGGERSPEKPAAPERGQQRPLRPDSVAGAAERQQPHSLPGSSAT GVKRTTRPPPSAQLNRGSVLQHPLPALSPAAAAAARRTHPEATAPDSRQVTLQGDHFLIR RLHYFIHSPLSPSRPSDCEVDTVPPAQTATGESPRLLCTPSPAAPMHGRKVPLSLPARMS RSSWFVGPSCRDVAMFIPGLELLLHIDLHLVFTHFYTTVKCDCLPKV >gi568815594f:138915982_139145471|GENSCAN_predicted_CDS_6|684_bp atgcggaagtcatggaattgtggacgaagtctgaagggagaaccgagcgggggtggcgag cgttcgccggagaagcccgccgcccccgaacgcgggcagcagcggcctctccggccagac agcgtggcaggcgcagccgagagacaacagcctcactcacttcctggttcctcagcaact ggggtgaagcgcacaactcgtccgcccccgagtgcccaacttaaccgcggctccgttctc cagcacccgctgccagcgctcagcccggcagctgctgctgctgctcgccgaacccaccct gaagcgacagcgccggattcgaggcaggttactctccaaggtgatcacttcctgattcgt cgcctccattacttcattcactcccccctttcccccagccgccccagtgactgtgaggtg gacactgtacccccagcacagactgctacaggggagtcaccccgccttctgtgcaccccc tctcctgctgcaccgatgcacgggagaaaagttcccctgtccttaccggcccggatgagc agatccagctggttcgtgggtccctcatgcagagacgtcgccatgtttatcccggggctg gagctgctgcttcacattgacttacaccttgtcttcacccacttctacacaaccgttaaa tgtgactgtttgcctaaagtatga >gi568815594f:138915982_139145471|GENSCAN_predicted_peptide_7|206_aa MTSAVVDSGGTILELSSNGVENQEESEKVSEYPAVIVEPVPSARLEQGYAAQVLVYDDET YMMQDVAEEQEVETENVETVAAQDHASTFRAPPGTAGEGPGGADDEGPVRRQGKVTVKYD RKELRKRLNLEEWILEQLTRLYDCQEEEIPELEIDVDELLDMESDDARAATVKELLVDCY KPTEAFISGLLDKIRGMQKLSTPQKK >gi568815594f:138915982_139145471|GENSCAN_predicted_CDS_7|621_bp atgacatcagcagtggttgacagtggaggtactattttggagctttccagcaatggagta gaaaatcaagaggaaagtgaaaaggtttctgaatatccagcagtgattgtggagccagtt ccaagtgccagattagagcagggctatgcagcccaggttctggtttatgatgatgagact tatatgatgcaagatgtggcagaagaacaagaagttgagaccgagaatgtggaaacagtg gcggcccaggaccacgcgtctactttcagagccccccccgggaccgcaggagagggcccg ggcggcgcggacgatgagggcccagtgaggcgccaagggaaggtcaccgtcaagtatgac cgcaaggagctacggaagcgcctcaacctagaggagtggatcctggagcagctcacgcgc ctctacgactgccaggaagaggagatcccagaactggagattgacgtggatgagctcctg gacatggagagtgacgatgcccgggctgccacggtcaaggagctgctggttgactgttac aaacccacagaggccttcatttctggtctgctggacaagatccggggcatgcagaagctg agcacaccccagaagaagtga