GENSCAN 1.0 Date run: 5-Nov-116 Time: 17:17:51 Sequence gi568815592r:107605203_107849795 : 244593 bp : 44.08% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2631 2712 82 1 1 63 105 28 0.096 0.70 1.02 Intr + 6068 6213 146 2 2 58 46 101 0.343 2.83 1.03 Term + 11446 11618 173 2 2 61 42 87 0.238 -0.61 1.04 PlyA + 13309 13314 6 1.05 2.00 Prom + 24583 24622 40 -4.16 2.01 Sngl + 28420 30264 1845 0 0 69 42 2307 0.752 218.33 2.02 PlyA + 32641 32646 6 1.05 3.03 PlyA - 32999 32994 6 1.05 3.02 Term - 35611 35571 41 2 2 89 48 77 0.813 1.15 3.01 Init - 36722 36650 73 2 1 86 110 21 0.805 3.44 3.00 Prom - 39428 39389 40 -3.96 4.06 PlyA - 40149 40144 6 1.05 4.05 Term - 41574 41231 344 1 2 85 48 165 0.992 6.87 4.04 Intr - 42454 42275 180 2 0 98 70 76 0.978 6.64 4.03 Intr - 48284 48149 136 2 1 73 68 87 0.668 5.34 4.02 Intr - 54254 54158 97 1 1 71 7 83 0.304 -1.49 4.01 Init - 55189 55116 74 1 2 68 71 64 0.275 1.27 4.00 Prom - 58135 58096 40 -3.76 5.00 Prom + 60896 60935 40 -3.96 5.01 Init + 63944 64062 119 1 2 82 9 94 0.535 0.57 5.02 Intr + 70522 70589 68 1 2 112 49 106 0.794 7.65 5.03 Intr + 71035 71159 125 2 2 38 101 62 0.619 2.80 5.04 Intr + 74914 74931 18 0 0 119 115 -9 0.327 1.71 5.05 Intr + 75216 75308 93 2 0 67 75 42 0.186 0.96 5.06 Intr + 83842 83947 106 0 1 64 59 69 0.616 1.39 5.07 Intr + 86473 86625 153 2 0 61 93 80 0.823 5.84 5.08 Intr + 93722 93826 105 0 0 103 56 47 0.706 3.19 5.09 Term + 94627 94790 164 2 2 19 49 148 0.666 2.10 5.10 PlyA + 95428 95433 6 -0.45 6.07 PlyA - 96978 96973 6 1.05 6.06 Term - 100123 99998 126 1 0 59 38 199 0.998 10.28 6.05 Intr - 102809 102664 146 0 2 74 34 145 0.402 7.70 6.04 Intr - 115791 115501 291 1 0 44 40 254 0.059 13.31 6.03 Intr - 139941 139747 195 1 0 85 96 205 0.998 20.39 6.02 Intr - 141687 141487 201 1 0 95 110 386 0.844 40.76 6.01 Init - 144782 144482 301 2 1 57 96 156 0.520 10.71 6.00 Prom - 145463 145424 40 -6.06 7.00 Prom + 145818 145857 40 -7.06 7.01 Init + 147194 147349 156 1 0 81 110 78 0.647 9.21 7.02 Term + 149506 149532 27 0 0 51 41 95 0.915 -0.83 7.03 PlyA + 150231 150236 6 1.05 8.03 PlyA - 151413 151408 6 1.05 8.02 Term - 166425 166352 74 1 2 55 43 107 0.408 0.97 8.01 Init - 191108 190988 121 2 1 56 91 64 0.603 3.85 8.00 Prom - 195887 195848 40 -3.16 9.00 Prom + 198955 198994 40 -6.26 9.01 Init + 204247 204382 136 0 1 105 74 58 0.327 6.41 9.02 Intr + 211240 211368 129 2 0 34 97 120 0.263 8.17 9.03 Intr + 225082 225223 142 2 1 37 95 67 0.205 1.71 9.04 Term + 233146 233293 148 1 1 119 37 68 0.095 2.17 9.05 PlyA + 233317 233322 6 1.05 10.05 PlyA - 233666 233661 6 1.05 10.04 Term - 235952 235894 59 0 2 95 44 32 0.127 -2.65 10.03 Intr - 240746 240620 127 2 1 94 83 26 0.422 2.95 10.02 Intr - 243283 243170 114 1 0 92 41 70 0.643 3.34 10.01 Intr - 243535 243440 96 1 0 33 96 73 0.789 2.81 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:107605203_107849795|GENSCAN_predicted_peptide_1|133_aa XMRHTVSIQNLNGCWSYGPLYGASSREREGKKEGAPVEPKNSNLSGRKEKASNSNKNFLQ MWNLKESRESKPIFSRRNKILKLKANRRGFYGALCEGPSVDSGLMITLTAETVGAVISEA ADTHSDGWPSPLI >gi568815592r:107605203_107849795|GENSCAN_predicted_CDS_1|402_bp ngtatgagacacactgtgtcaatccagaatcttaatggctgctggagctatgggcctctt tacggggccagctctagagagagagaggggaagaaagagggggccccggtggagccaaag aacagtaatctttcaggaagaaaggaaaaggccagcaacagcaacaaaaacttcctgcag atgtggaatttgaaggaatccagagaaagcaagcctatttttagcaggaggaataaaatt cttaaattgaaagcaaatcggagagggttctatggcgctctctgcgaggggccgagcgtg gattcgggtctgatgattactttgactgcagaaacagtgggggctgtaatctccgaagca gcggacactcacagcgacggttggcccagtcctttgatttag >gi568815592r:107605203_107849795|GENSCAN_predicted_peptide_2|614_aa MDIFYKETQANLPAGLCSTLHPPMENKAEGTGVQLLTPDSWNIPLTDARRKAPSPVATAG QSQGPGPSASTTVSPSDTANCSVTKIPTPVPKSIPISETPNIPPVSVQPPASIGPPLGVP PRSPPMVMTNRGPVPLPIFMEQQIMQQIRPPFIRGPPHHASNPNSPLSNPMLPGIGPPPG GPRNLGPTSSPMHRPMLSPHIHPPSTPTMPGNPPGLLPPPPPGAPLPSLPFPPVSMMPNG PMPVPQMMNFGLPSLAPLVPPPTLLVPYPVIVPLPVPIPIPIPIPHVSDSKPPNGFSSNG ENFIPNAPGDSAAAGGKPSGHSLSPRDSKQGSSKSADSPPGCSGQALSLAPTPAEHGRSE VVDLTRRAGSPPGPPGAGGQLGFPGVLQGPQDGVIDLTVGHRARLHNVIHRALHAHVKAE REPSAAERRTCGGCRDGHCSPPAAGDPGPGAPAGPEAAAACNVIVNGTRGAAAEGAKSAE PPPEQPPPPPPPAPPKKLLSPEEPAVSELESVKENNCASNCHLDGEAAKKLMGEEALAGG DKSDPNLNNPADEDHAYALRMLPKTGCVIQPVPKPAEKAAMAPCIISSPMLSAGPEDLEP PLKRRCLRIRNQNK >gi568815592r:107605203_107849795|GENSCAN_predicted_CDS_2|1845_bp atggacattttctacaaagagacccaggccaatcttccagctgggctgtgcagcacatta caccctcccatggaaaataaagcagaaggcaccggggtgcagctgctcactccagactct tggaatatcccgctaacagatgctcggaggaaggccccctccccggtggctacagctggc caaagccagggccctggcccgtcggcgtccaccaccgtctctccatctgacactgccaac tgctctgtcactaaaatccccacgccagtgcccaagtccatccccatcagcgagactcca aatatccctcctgtctccgtccagccacctgctagcatcgggcctccccttggcgtcccg cctcggagccctcccatggtgatgaccaaccgcggcccggtgccgctgcccatcttcatg gagcagcagatcatgcagcagatccgcccgcccttcatccgcgggcctccgcaccatgcc tccaaccccaacagccccctgtccaaccccatgcttcccggcatcgggcccccgcccggt ggccccagaaacctgggccccacttccagccccatgcaccggcccatgctatcgccccac atccaccccccgagcacccccaccatgcccgggaaccccccaggcctgctgcccccgccg cctccgggcgccccgctgccgagtcttcccttcccgccagtgagcatgatgccaaatggc ccgatgccggtgccccagatgatgaatttcgggctgccgtcgcttgccccgctggtgccg cccccgaccctgctcgtgccgtaccccgtgatcgtgcccctaccggtgcccatccccatc cccatccctatccctcacgtcagcgactccaagccccccaacgggttctccagcaacggg gagaacttcattccgaacgcccctggcgactccgcggcggcgggcggcaagccaagcgga cactccctgtccccccgggactccaagcagggctcgtccaagtccgcggactcgcccccc ggctgctcgggccaggccctgagcctggcgcccacgcccgccgagcatggccggagcgag gtggtggacctgacgcggcgcgccggcagccccccgggccccccgggcgcgggcggccag ctcggcttcccaggcgtgctgcagggcccgcaggacggcgtcatcgacctgaccgtgggc caccgagcccggctgcacaacgtgatccaccgcgcgctgcacgcgcacgtcaaggcggag cgcgagccgagcgccgcggagcgcaggacctgcggcggctgcagggacggccactgcagc ccgcccgccgccggcgacccaggcccgggcgccccggcgggccccgaggcggccgcggcc tgcaacgtcatcgtgaacggcacgcgcggcgccgccgccgagggcgctaagagcgcggag ccgcctcccgagcagccgccgccgccgccgccgcccgcgccccccaagaagctgctgtcg cctgaggaaccggcggtgagcgagctagagtcggtcaaggagaataactgtgcttccaac tgccacctggacggggaggcggccaaaaagctgatgggcgaggaggccctggcggggggc gacaagtcagacccgaaccttaataaccccgcggacgaggaccatgcctatgctctgcgg atgctgcccaagaccggctgcgtgatccagcctgtgccaaaacccgcggagaaggctgcc atggcaccgtgcatcatctcctcgcccatgctcagcgccgggcctgaggacctggagccg ccgctcaaaaggaggtgcctccgaattagaaatcagaataagtaa >gi568815592r:107605203_107849795|GENSCAN_predicted_peptide_3|37_aa MATPVPLARSLRQESLEKAGRSGEVPVLLKMEGYGYI >gi568815592r:107605203_107849795|GENSCAN_predicted_CDS_3|114_bp atggccacccccgtccctttggcacgatctcttagacaggaaagtctggaaaaggcgggg cgttcgggtgaggtcccagtcctgctgaagatggaaggttatgggtacatctga >gi568815592r:107605203_107849795|GENSCAN_predicted_peptide_4|276_aa MGMVRKQALERTEWALVCRQAWSSRMGYSTTVSPTQCQYTCSRGLLDSQLQLYKCTQALP EIHSLGHVCPSLITVKYSHYVLAAATLFIWDNNESLRTGLLSVSTCKSGTRDFMRISGMV SLTSEVDNVEVFHYFPLHIQGKKFWAVGAIMTIFVISIYYEPGAHWHHMWKLLKVYSLES QKREEDISKRLPNFLKSSRRPWEDLILMLRKSPEMSKEYRASPLSLSAALRSKDRQSRHR CVTSTFHIREVRCCKSQGLPLCMHETAAHICPSLLA >gi568815592r:107605203_107849795|GENSCAN_predicted_CDS_4|831_bp atgggcatggttaggaagcaggccctggaacggacggagtgggctcttgtctgcagacaa gcgtggtctagcagaatgggatactccacaactgtctcaccaactcagtgccagtacaca tgctctagaggacttctggactcgcagctacaactgtacaagtgcacacaagccctacca gagatccactcactcgggcacgtctgcccatcacttatcacagttaaatacagccattac gtgctggctgcggcaacgctgtttatctgggacaataacgagtccctgaggactggattg ctctcagtctcgacttgtaaaagtggaaccagagattttatgagaatttctggaatggtt tccctaacctctgaggttgacaacgtggaagtgtttcattattttccattacacatccaa gggaagaagttctgggctgtaggggctatcatgacaatttttgtaataagcatctactac gagccaggagcccactggcaccacatgtggaagcttctgaaggtttacagcctggaaagt caaaaaagagaagaggacatatccaaaaggttgccaaactttcttaaaagtagccgcagg ccatgggaggatctgattctgatgctgagaaaatctccggagatgtctaaggaatataga gcaagccccctgagcctttctgctgcccttcggtctaaggacagacagtctaggcatcgc tgtgtaacctccacattccacattcgtgaagtccgctgctgcaagagccagggcctgcct ctgtgcatgcatgaaacagcagcccacatctgcccttccctgctggcttaa >gi568815592r:107605203_107849795|GENSCAN_predicted_peptide_5|316_aa MWHPDRGSWHRNEQEGKDGYRGTNLEVPCCSTATPTSTPLWVKLPPPDTSTKPELQLEEL DADSSWMPLHGSCPTGKLAVDARSSLFTEPAFPYWKKMKHTFQNILSWSKFTNLTYSVTQ LWLLVVMAQTSYPHSLVGRKQVILHSSCLYSLDNKTSLAGDDHDDDDHDDNDSSPAEKGP PPITKSLDKADEQSHLRLAVLELGFYKPVNLLLVAMTGCLLQLMMKKEQCGTHGGEGSTL EGSRPDVPQDLFSPTLGFYGSTSLSADLGENVILKLYLPSFPLQESEPSAEHGENLWALY VGFEERRSEASLAKAP >gi568815592r:107605203_107849795|GENSCAN_predicted_CDS_5|951_bp atgtggcaccccgaccgtggttcctggcacaggaacgagcaggagggtaaggacggctac cgtggaaccaacttggaagtgccgtgctgctcaacagccacccccacctccacaccgctt tgggtaaaattgccgccgccggacacctcaaccaagcctgaactgcagcttgaagagctg gatgcagattcttcctggatgcctttgcatggctcgtgtccaactggaaagctggcagtg gatgctcgctcttccctcttcactgagcccgcttttccatactggaagaagatgaaacac acatttcagaatatcctgagctggagcaagtttactaatctcacctacagcgtgacacag ctgtggctgttggtagtgatggctcaaaccagctatcctcattcactggtgggcaggaag caggtaattctacactcatcttgcctgtactctctggataataaaacaagtttagccggt gatgatcatgatgacgatgaccatgatgataatgattcatcacctgcagagaagggccct ccccccatcactaaatcactggacaaggctgatgagcagagccacctgcggttggcagta cttgagttaggcttttacaagcctgtgaatttactcttggtggccatgacagggtgtctt ctgcagctgatgatgaaaaaagagcaatgtggaacccacgggggagaaggttctactctg gaggggagccgtccagatgtcccccaagacctcttttccccaactctaggattctacggt tctacatcactctcggccgacctgggggagaacgtcattcttaagctctacctgccatcg tttcctctgcaggaaagcgagcccagtgctgagcatggggagaacctctgggctctttat gtaggatttgaagagcgcagatctgaggcctcacttgcgaaagcaccgtga >gi568815592r:107605203_107849795|GENSCAN_predicted_peptide_6|419_aa MNPEHELESAAQAPKLQDGKVWKHDRRAMVNCRLGQNLEECRENCQILIDQMGSLFQIKS RVLMTPLALSPPRSTPEPDLSSIPQDAATVPSLAAPQALTVCLYINKQANAGPYLERKKV QQLPEHFGPERPSAVLQQAVQACIDCAHQQKLVFSLVKQGYGGEMVSVSASFDGKQHLRS LPVVNSIGYVLRFLAKLCRSLLCDDLFSHQPFPRGCSASEKVQEKEEGRMESVKTVTTEE YLVNPVGMNRYSVDTSASTFNHRGSLHPSSSLYCKRQNSGDSHLGGGPAATAGGPRTSPM SSGGPSAPGLRPPASSPKRNTTSLEGNRCASSPSQDAQDARRPRSRNPSAWTVEDVVWFV KDADPQALGPHVELFRKHEIDGNALLLLKSDMVMKYLGLKLGPALKLCYHIDKLKQAKF >gi568815592r:107605203_107849795|GENSCAN_predicted_CDS_6|1260_bp atgaatcctgagcatgaacttgaatctgcagctcaggccccgaaactgcaggatggaaag gtgtggaagcatgatagaagagcaatggttaactgccgtctaggacagaatctggaagag tgtagagagaattgccagattctcattgaccaaatgggctctctcttccagatcaagtct cgggttctcatgactcccttagccctctcacctccgcggagtaccccagagcccgacctc agctccatccctcaggacgcagccacggtccccagcttggcggccccacaggctctcaca gtctgcctctacatcaacaagcaggccaatgcggggccctatctggagaggaagaaggtg cagcagctcccggagcattttgggcccgagcggccatcggcggtgctgcagcaggccgtc caagcctgcatcgactgcgcccaccagcagaagctggtcttctccctggtcaagcagggc tatggtggtgagatggtgtcagtctcggcttcctttgatggcaaacagcacctgcggagc ctgcctgtggtgaacagcatcggctatgtcctccgcttcctcgccaagctgtgccgaagc ctcctgtgcgatgacctcttcagccaccagcccttccccaggggctgcagtgcctctgag aaagtccaggagaaagaggaagggaggatggaatcagtcaagacagtcaccaccgaagag tacctggtgaaccctgtgggcatgaaccgctacagcgtggacacctccgcctccaccttt aaccacaggggctccttgcacccctcctcctcgctgtactgcaagaggcagaactctgga gacagccaccttgggggtggtcctgctgccaccgctggtggtccccgcactagccccatg tcttctggtggcccctcggcacctgggctgaggcctccagcctccagccccaagagaaac acgacctctcttgaaggaaacagatgtgcctcaagcccttctcaggatgcgcaggatgcc aggcggccacggagcaggaacccctccgcctggactgtggaggacgtggtgtggtttgtg aaggacgccgacccacaggctctggggcctcacgtggagctcttcagaaagcacgagatt gatggcaacgctctgctgttgctgaagagtgacatggtcatgaagtacctgggcctgaag ctgggacctgcactgaaactctgctaccacattgacaaactgaagcaagccaagttctga >gi568815592r:107605203_107849795|GENSCAN_predicted_peptide_7|60_aa MNIELVCQTIIMEDIIREVNVDGEEKDPNTEPCKIPTRRAWRNEKGASKRNQKNDDDDDD >gi568815592r:107605203_107849795|GENSCAN_predicted_CDS_7|183_bp atgaatatagagttggtatgtcagaccataataatggaagatatcatcagagaagtgaat gtagatggagaagagaaggatccaaacactgagccctgcaagattccaacacgaagagcc tggagaaatgagaaaggagcaagcaaaagaaatcagaaaaatgacgatgatgatgatgac taa >gi568815592r:107605203_107849795|GENSCAN_predicted_peptide_8|64_aa MEQLALSYIAGSENGITIGKLAVLYKVKHNLPYEPTIPLPGADVEFGYKEAYQAKLHSCD PYKN >gi568815592r:107605203_107849795|GENSCAN_predicted_CDS_8|195_bp atggagcaactggccctctcttacattgctgggagtgaaaatggtataaccattggaaaa ttggcagttctttataaagttaaacataacctaccctacgaaccaacaattccactccca ggtgctgatgtcgagtttggctacaaggaggcatatcaggcaaaattacattcctgtgac ccttataagaattag >gi568815592r:107605203_107849795|GENSCAN_predicted_peptide_9|184_aa MVVPVKGANESLRKGHQGVDGERENAENFFESNIHQDLFPNWKAWGHTGGHENGFSELES VRFHLDGFCAALKTQVGSQRRKKLLEKKDRAFSLCGSGPVTLADTQGLTEWKHFSQGLAV LVEAQPRDPYPRGSVWQKTPGIAKETRCRALDGAFVKLKHRNLSRIDRRVVRPDFRPPPL SPKA >gi568815592r:107605203_107849795|GENSCAN_predicted_CDS_9|555_bp atggtggtccctgtgaagggtgctaatgagagtctgaggaagggccaccaaggggtagat ggagaaagggagaatgcagaaaacttttttgaaagtaacatccatcaggacctgtttccc aattggaaggcatggggtcacactggtggccatgaaaatggcttttctgaactggaatcc gtgcggttccatttagatggattctgtgcagctctgaagacacaagtaggctctcagagg aggaagaagctactggagaagaaagacagggccttctctctttgtggctctgggcctgtg accctggctgatactcaaggactgactgagtggaaacatttttctcaaggtctggcagtt ttggtggaagcacagcccagagacccttatcccaggggcagcgtctggcagaaaacacca gggattgcaaaggaaaccagatgccgagctctggatggggctttcgtgaagctaaagcac aggaacttgagccgcattgacaggcgcgttgtcaggccagacttccggcctcccccattg tccccgaaggcttga >gi568815592r:107605203_107849795|GENSCAN_predicted_peptide_10|131_aa RGVGKGVLRSDGGSPLGTSTGGFVLPPEFGEQALSPRSPDSSGSRRNERAASPSVATRGR EVADRGPARLVGTGPSLEHSPQICPTRVAPEFNGEPGICRGKKQGPSIWKIRAVYKLLLL VNTNDPSFYIV >gi568815592r:107605203_107849795|GENSCAN_predicted_CDS_10|396_bp aggggagtcggcaaaggtgttctcaggagcgacgggggatctcccctggggacgtcaacc ggaggcttcgttctgccaccagaatttggtgagcaggcgctcagcccccggagtccggat tccagcggcagcaggagaaatgagcgggcggcttctccctctgtggccaccagggggcgc gaggtggccgaccggggcccggccaggctggtggggacaggaccttcactggaacattcc ccacagatttgcccaacacgtgtggcacctgagttcaatggggagccaggcatttgtcga ggaaagaaacaagggcccagcatctggaagatacgagcagtttataagttactgctgttg gtcaataccaatgacccttcattttatattgtatga