GENSCAN 1.0 Date run: 8-Nov-116 Time: 05:13:06 Sequence gi568815594r:138958986_139184190 : 225205 bp : 41.25% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 2256 2277 22 2 1 92 63 34 0.142 1.39 1.02 Intr + 14688 14770 83 1 2 34 78 90 0.197 1.04 1.03 Intr + 33444 33558 115 2 1 12 34 216 0.008 7.70 1.04 Intr + 37327 37387 61 2 1 50 87 56 0.002 -1.43 1.05 Intr + 40622 40785 164 2 2 7 74 147 0.009 3.90 1.06 Term + 54800 54906 107 0 2 69 47 105 0.039 2.09 1.07 PlyA + 55095 55100 6 1.05 2.00 Prom + 55193 55232 40 -6.95 2.01 Init + 56997 57186 190 2 1 59 109 155 0.002 11.92 2.02 Intr + 84089 84358 270 0 0 79 91 239 0.764 19.89 2.03 Term + 85654 86489 836 2 2 80 36 638 0.998 49.96 2.04 PlyA + 87164 87169 6 1.05 3.08 PlyA - 87289 87284 6 1.05 3.07 Term - 100622 99998 625 1 1 89 38 524 0.979 40.37 3.06 Intr - 101689 101339 351 0 0 115 115 348 0.999 33.71 3.05 Intr - 103072 102880 193 2 1 82 83 78 0.924 4.33 3.04 Intr - 108785 108699 87 0 0 61 86 51 0.669 1.22 3.03 Intr - 113054 112881 174 0 0 76 75 164 0.884 12.89 3.02 Intr - 114582 114469 114 1 0 69 84 99 0.967 7.10 3.01 Init - 118072 118003 70 1 1 55 59 100 0.927 5.06 3.00 Prom - 123066 123027 40 -6.25 4.00 Prom + 123525 123564 40 -8.55 4.01 Init + 124622 125249 628 1 1 45 58 433 0.957 31.55 4.02 Term + 129618 129673 56 1 2 120 42 60 0.466 1.34 4.03 PlyA + 130171 130176 6 1.05 5.04 PlyA - 130237 130232 6 1.05 5.03 Term - 156327 155945 383 1 2 35 55 557 0.482 41.12 5.02 Intr - 166344 166179 166 0 1 70 74 261 0.997 21.51 5.01 Init - 168133 168128 6 1 0 50 121 10 0.780 0.86 5.00 Prom - 172438 172399 40 -5.65 6.00 Prom + 174836 174875 40 -6.05 6.01 Init + 185089 185167 79 0 1 71 98 69 0.558 7.47 6.02 Intr + 186201 186341 141 1 0 31 46 119 0.049 1.40 6.03 Intr + 217124 217363 240 0 0 115 121 183 0.977 20.90 6.04 Intr + 218564 218670 107 0 2 91 39 39 0.841 -1.69 6.05 Intr + 218801 218928 128 1 2 54 81 70 0.860 1.46 6.06 Intr + 219087 219487 401 0 2 60 -12 340 0.381 14.42 6.07 Term + 219656 219792 137 0 2 102 44 101 0.544 4.30 6.08 PlyA + 220792 220797 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 37103 36927 177 0 0 73 115 45 0.825 4.67 S.002 Sngl - 57123 56626 498 0 0 69 44 291 0.981 16.59 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:138958986_139184190|GENSCAN_predicted_peptide_1|183_aa MTPVPRKSHILVLLVRSVHDLIAVHGYFPDNRSPEEKEEERRRRKEGEEGEEEMKEKKKR KKREEEEEEGGGGGTTASGSFSPVVLDKLLPRKLRQQNTPKKAEGLRAYLTEKTVASRAS QSAKPGVRVVSSLQVSPAGRRPSGTLENSQDEVEVPLSGYNPNPPKGTIVFSSPPCKTKN VTT >gi568815594r:138958986_139184190|GENSCAN_predicted_CDS_1|552_bp atgaccccggtccctcgcaagagtcacatcctggtgttgctagtcagatcggtgcacgat cttattgctgttcatggttatttccctgataatcgctccccagaggaaaaagaagaagaa agaagaagaagaaaagaaggggaagaaggggaggaggagatgaaggagaagaagaagagg aagaagagggaggaggaggaggaagaaggaggaggaggaggcactacagccagcggtagc ttctccccagttgttctagacaagctccttccaaggaagctgagacagcaaaacactcct aagaaggctgaagggctgagagcctaccttacagagaaaacagtggctagcagagcttca caaagcgccaaaccaggagttcgggtcgtgtcgtctttgcaggtgtcgccagctggaaga agacctagtggcactcttgaaaactcacaggatgaagttgaagttcctttatctggctac aatccgaacccaccaaaaggaacaattgttttttcttcccctccttgtaagaccaaaaat gtaaccacctga >gi568815594r:138958986_139184190|GENSCAN_predicted_peptide_2|431_aa MFHSPRRLCSALLQRDAPGLRRLPAPGLRRPLSPPAAVPRPASPRLLAAASAASGAARSC SRTVCSMGTGTSRLYSALAKTLNSSAASQHPEYLVSPDPEHLEPIDPKELLEECRAVLHT RPPRFQRDFVDLRTDCPSTHPPIRVMQWNILAQALGEGKDNFVQCPVEALKWEERKCLIL EEILAYQPDILCLQEVDHYFDTFQPLLSRLGYQGTFFPKPWSPCLDVEHNNGPDGCALFF LQNRFKLVNSANIRLTAMTLKTNQVAIAQTLECKESGRQFCIAVTHLKARTGWERFRSAQ GCDLLQNLQNITQGAKIPLIVCGDFNAEPTEEVYKHFASSSLNLNSAYKLLSADGQSEPP YTTWKIRTSGECRHTLDYIWYSKHALNVRSALDLLTEEQIGPNRLPSFNYPSDHLSLVCD FSFTEESDGLS >gi568815594r:138958986_139184190|GENSCAN_predicted_CDS_2|1296_bp atgtttcatagtccgcggcggctctgctcggccctgctgcagagggacgcgcccggcctg cgccgcctgcccgccccagggctgcgccgcccgttgtccccgccggctgctgttcccagg cccgcatccccccggctgctggcggcggcctcggcggcctcgggcgccgcgaggtcgtgt tcccgaacagtgtgttccatgggaaccggtacaagcagactctatagtgctctcgccaag acactgaacagcagcgctgcctcccagcacccagagtatttggtgtcacctgacccagag catctggagcccattgatcctaaagagcttcttgaggaatgcagggccgtcctgcacacc cgacctccccggttccagagggattttgtggatctgaggacagattgccctagtacccac ccacctatcagggttatgcaatggaacatcctcgcccaagctcttggagaaggcaaagac aactttgtacagtgccctgttgaagcactcaaatgggaagaaaggaaatgtctcatcctg gaagaaatcctggcctaccagcctgatatattgtgcctccaagaggtggaccactatttt gacaccttccagccactcctcagtagactaggctatcaaggcacgtttttccccaaaccc tggtcaccttgtctagatgtagaacacaacaatggaccagatggttgtgccttatttttt cttcaaaaccgattcaagctagtcaacagtgccaatattaggctgacagccatgacattg aaaaccaaccaggtggccattgcacagaccctggagtgcaaggagtcaggccgacagttc tgcatcgctgttacccatctaaaagcacgcactggctgggagcggtttcgatcagctcaa ggctgtgacctccttcagaacctgcaaaacatcacccaaggagccaagattccccttatt gtgtgtggggacttcaatgcagagccaacagaagaggtctacaaacactttgcttcctcc agcctcaacctgaacagcgcctacaagctgctgagtgctgatgggcagtcagaaccccca tacactacctggaagatccggacctcaggggagtgcaggcacaccctggattacatctgg tattctaaacatgctctaaatgtaaggtcagctctcgatctgctcactgaagaacagatt ggacccaacaggttaccttccttcaattatccttcagaccacctgtctctagtgtgtgac ttcagctttactgaggaatctgatggactttcataa >gi568815594r:138958986_139184190|GENSCAN_predicted_peptide_3|537_aa MVNVSNDLSVVRRVLDLLADNFIVEASVHSSNAHCTDKTIEAAEALLHMESPTCLRDSRS PVEVFVPPCVSTPEFIHAAMRPDVITETVVEVSTEESEPMDTSPIPTSPDSHEPMKKKKV GRKPKTQQSPISNGSPELGIKKKPREGKGNTTYLWEFLLDLLQDKNTCPRYIKWTQREKG IFKLVDSKAVSKLWGKHKNKPDMNYETMGRALRYYYQRGILAKVEGQRLVYQFKDMPKNI VVIDDDKSETCNEDLAGTTDEKSLERVSLSAESLLKAASSVRSGKNSSPINCSRAEKGVA RVVNITSPGHDASSRSPTTTASVSATAAPRTVRVAMQVPVVMTSLGQKISTVAVQSVNAG APLITSTSPTTATSPKVVIQTIPTVMPASTENGDKITMQPAKIITIPATQLAQCQLQTKS NLTGSGSINIVGTPLAVRALTPVSIAHGTPVMRLSMPTQQASGQTPPRVISAVIKGPEVK SEAVAKKQEHDVKTLQLVEEKPADGNKTVTHVVVVSAPSAIALPVTMKTEGLVTCEK >gi568815594r:138958986_139184190|GENSCAN_predicted_CDS_3|1614_bp atggtaaatgtcagcaacgatttgtcagttgtacgtagagtattagatcttctggctgat aacttcatagtggaagcatcagttcacagcagtaatgcacactgtacagataagacaatt gaagctgctgaagccctgcttcatatggaatctcctacctgcttgagggattcaagaagt cctgtggaagtgtttgttcctccttgtgtatcaactccagaattcatccatgctgctatg aggccagatgtcattacagaaactgtagtggaggtgtcaactgaagagtctgaacccatg gatacctctcctattccaacatcaccagatagccatgaaccaatgaaaaagaaaaaagtt ggccgtaaaccaaagacccagcaatcaccaatttccaatgggtctcctgagttaggtata aagaagaaaccaagagaaggaaaaggaaacacaacctatttgtgggagtttcttttagat ctacttcaagataaaaatacttgtcccaggtatattaaatggactcagagagaaaaaggc atattcaagctggtggattcaaaggctgtctctaagctttggggaaagcataagaacaaa ccagacatgaactatgaaaccatgggacgagctttgagatactactaccaaaggggaatt cttgcaaaggttgaaggacagaggcttgtatatcagttcaaggatatgccgaaaaacata gtggtcatagatgatgacaaaagtgaaacctgtaatgaagatttagcaggaactactgat gaaaaatcattagaacgagtgtcactgtctgcagaaagtctcctgaaagcagcatcctct gttcgcagtggaaaaaattcatcccctataaactgctccagagcagagaagggtgtagct agagttgtgaatatcacttcccctgggcacgatgcttcatccaggtctcctactaccact gcatctgtgtcagcaacagcagctccaaggacagttcgtgtggcaatgcaggtacctgtt gtaatgacatcattgggtcagaaaatttcaactgtggcagttcagtcagttaatgcaggt gcaccattaataaccagcactagtccaacaacagcgacctctccaaaggtagtcattcag acaatccctactgtgatgccagcttctactgaaaatggagacaaaatcaccatgcagcct gccaaaattattaccatcccagctacacagcttgcacagtgtcaactgcagacaaagtca aatctgactggatcaggaagcattaacattgttggaaccccattggctgtgagagcactt acccctgtttcaatagcccatggtacacctgtaatgagactatcaatgcctactcagcag gcatctggccagactcctcctcgagttatcagtgcagtcataaaggggccagaggttaaa tcggaagcagtggcaaaaaagcaagaacatgatgtgaaaactttgcagctagtagaagaa aaaccagcagatggaaataagacagtgacccacgtagtggttgtcagtgcgccttcagct attgcccttcctgtaactatgaaaacagaaggactagtgacatgtgagaaataa >gi568815594r:138958986_139184190|GENSCAN_predicted_peptide_4|227_aa MRKSWNCGRSLKGEPSGGGERSPEKPAAPERGQQRPLRPDSVAGAAERQQPHSLPGSSAT GVKRTTRPPPSAQLNRGSVLQHPLPALSPAAAAAARRTHPEATAPDSRQVTLQGDHFLIR RLHYFIHSPLSPSRPSDCEVDTVPPAQTATGESPRLLCTPSPAAPMHGRKVPLSLPARMS RSSWFVGPSCRDVAMFIPGLELLLHIDLHLVFTHFYTTVKCDCLPKV >gi568815594r:138958986_139184190|GENSCAN_predicted_CDS_4|684_bp atgcggaagtcatggaattgtggacgaagtctgaagggagaaccgagcgggggtggcgag cgttcgccggagaagcccgccgcccccgaacgcgggcagcagcggcctctccggccagac agcgtggcaggcgcagccgagagacaacagcctcactcacttcctggttcctcagcaact ggggtgaagcgcacaactcgtccgcccccgagtgcccaacttaaccgcggctccgttctc cagcacccgctgccagcgctcagcccggcagctgctgctgctgctcgccgaacccaccct gaagcgacagcgccggattcgaggcaggttactctccaaggtgatcacttcctgattcgt cgcctccattacttcattcactcccccctttcccccagccgccccagtgactgtgaggtg gacactgtacccccagcacagactgctacaggggagtcaccccgccttctgtgcaccccc tctcctgctgcaccgatgcacgggagaaaagttcccctgtccttaccggcccggatgagc agatccagctggttcgtgggtccctcatgcagagacgtcgccatgtttatcccggggctg gagctgctgcttcacattgacttacaccttgtcttcacccacttctacacaaccgttaaa tgtgactgtttgcctaaagtatga >gi568815594r:138958986_139184190|GENSCAN_predicted_peptide_5|184_aa MKESEKVSEYPAVIVEPVPSARLEQGYAAQVLVYDDETYMMQDVAEEQEVETENVETVAA QDHASTFRAPPGTAGEGPGGADDEGPVRRQGKVTVKYDRKELRKRLNLEEWILEQLTRLY DCQEEEIPELEIDVDELLDMESDDARAATVKELLVDCYKPTEAFISGLLDKIRGMQKLST PQKK >gi568815594r:138958986_139184190|GENSCAN_predicted_CDS_5|555_bp atgaaggaaagtgaaaaggtttctgaatatccagcagtgattgtggagccagttccaagt gccagattagagcagggctatgcagcccaggttctggtttatgatgatgagacttatatg atgcaagatgtggcagaagaacaagaagttgagaccgagaatgtggaaacagtggcggcc caggaccacgcgtctactttcagagccccccccgggaccgcaggagagggcccgggcggc gcggacgatgagggcccagtgaggcgccaagggaaggtcaccgtcaagtatgaccgcaag gagctacggaagcgcctcaacctagaggagtggatcctggagcagctcacgcgcctctac gactgccaggaagaggagatcccagaactggagattgacgtggatgagctcctggacatg gagagtgacgatgcccgggctgccacggtcaaggagctgctggttgactgttacaaaccc acagaggccttcatttctggtctgctggacaagatccggggcatgcagaagctgagcaca ccccagaagaagtga >gi568815594r:138958986_139184190|GENSCAN_predicted_peptide_6|410_aa MQELNGKTETTDPLKEGAGCSLHHEPGCPESVHMTSLLILQPAFKKANALRLFTTKESHN VCHSPVTPMRAGADHPVARSLRTSFARIQKAVTGRACYEPVFVLLKKQIPIRAKPPYLPG NRGRWQHGAGVESENKRTAEYNLSCKLAAPGRKGKDARHLSGSLPRAPRSRSPLQPRIGA GGSAARGRRDSRPSPETAPQSGSLTVTGGRGFAKTPSPVFPVPTPQPWLSERNYHPPNSP AARRLVPAFAGCRVAGPSAQLRSAAGGPLELPGELRALAPSHCRSLRGPQRRSNSSLSVV GARMRECSGVGVFWPKCLRIRKRKPSSGCTLLVSTLSHKGEPSSGALSRAFKCTKANDTV PWVALWLLQPRRDGPGTWRGIHQLISGLQRYPFYGFTTSLEQRRNTEEYK >gi568815594r:138958986_139184190|GENSCAN_predicted_CDS_6|1233_bp atgcaggaacttaatggaaaaacggaaaccacagatcctttgaaagaaggggcaggctgc agtctacaccatgagccaggatgtcctgagtctgtccacatgaccagtttattaatacta caaccagcatttaagaaagccaacgcactacggctatttacaaccaaggaatctcacaac gtatgtcattcccctgttacccctatgagagctggtgctgaccatcccgtcgcgcggtcc ctgaggaccagcttcgcgaggatccagaaggcggtcactggacgtgcctgttacgagccc gtttttgttttgcttaagaaacagattccgatccgagcaaagcctccctacctcccgggg aatagggggagatggcagcacggtgccggcgtggagagcgagaataaaagaactgccgag tacaacctgtcgtgcaagctagcagcgccgggaaggaaaggtaaagacgctcgtcacttg tccggctccttgcccagggctccccgatctcgctctccactccagccccggattggtgca gggggctctgccgcacggggcaggcgtgactctcgcccttctccggagacggccccacag tcgggcagtctgacagttacaggggggcgaggttttgcaaaaactcctagcccggtcttt ccagttcccactcctcagccttggcttagtgaaagaaattaccacccgcccaacagcccg gccgcgcggcggttggtgccggcgtttgctggctgtcgggtggctgggcccagcgcgcag ctgcggagcgcggcgggcggccccttggagctgcccggggagctgcgggcactcgcacct agccattgtcgctccttgcgcggtccgcaacggagaagtaactcaagtctgagtgttgtt ggtgcgagaatgagagagtgttctggtgtgggagtattttggcccaagtgcctgcgaatt cgaaaacgaaaacccagcagtggctgcacgcttctggtttccacactttcacacaaaggg gagccttcgagcggggcactctcacgcgccttcaagtgcacaaaagccaatgacacggtg ccgtgggttgccctgtggctcctgcagcctcgccgcgatggacctggcacctggcgcggt attcaccagctgatttctgggctccagcgttaccctttttacggcttcacaaccagtctg gagcaaagaagaaacactgaggaatataagtga