GENSCAN 1.0 Date run: 4-Nov-116 Time: 19:56:25 Sequence gi568815597r:110503286_110704782 : 201497 bp : 44.75% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 4073 4101 29 1 2 99 94 44 0.412 3.40 1.02 Intr + 5466 5587 122 0 2 76 52 68 0.274 2.14 1.03 Term + 6548 6570 23 0 2 110 38 33 0.293 -1.03 1.04 PlyA + 9731 9736 6 -0.45 2.13 PlyA - 10137 10132 6 1.05 2.12 Term - 15503 13967 1537 1 1 99 43 1872 0.069 173.75 2.11 Intr - 27040 26921 120 0 0 76 47 73 0.019 1.61 2.10 Intr - 30171 30111 61 1 1 85 64 67 0.022 1.79 2.09 Intr - 45606 45532 75 1 0 26 95 127 0.159 6.69 2.08 Intr - 46149 46071 79 0 1 45 75 49 0.087 -1.58 2.07 Intr - 47940 47887 54 0 0 97 75 30 0.004 1.78 2.06 Intr - 56307 56203 105 0 0 68 53 61 0.002 1.01 2.05 Intr - 64082 63897 186 2 0 55 47 106 0.075 3.09 2.04 Intr - 69422 69319 104 0 2 4 75 143 0.063 4.49 2.03 Intr - 79069 78967 103 1 1 93 81 55 0.604 5.05 2.02 Intr - 81289 81112 178 0 1 66 75 93 0.800 5.72 2.01 Init - 81446 81418 29 2 2 78 106 8 0.636 0.70 2.00 Prom - 90373 90334 40 -4.86 3.03 PlyA - 91716 91711 6 1.05 3.02 Term - 101437 99998 1440 1 0 87 38 1152 0.562 100.92 3.01 Init - 103993 103679 315 1 0 94 32 172 0.097 9.55 3.00 Prom - 107351 107312 40 -6.46 4.05 PlyA - 108167 108162 6 1.05 4.04 Term - 112459 112321 139 0 1 110 39 70 0.019 1.74 4.03 Intr - 117255 117176 80 0 2 84 94 16 0.004 0.15 4.02 Intr - 125376 125239 138 0 0 66 60 44 0.479 0.06 4.01 Init - 125757 125698 60 0 0 69 83 69 0.923 5.75 4.00 Prom - 139287 139248 40 -1.26 5.02 PlyA - 139307 139302 6 1.05 5.01 Sngl - 171524 169797 1728 2 0 61 34 3232 0.998 308.00 5.00 Prom - 180293 180254 40 -2.86 6.00 Prom + 184854 184893 40 0.44 6.01 Init + 200326 200434 109 0 1 86 33 174 0.921 12.08 6.02 Term + 201066 201286 221 1 2 66 44 107 0.474 1.30 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 124997 124881 117 2 0 49 36 97 0.867 -0.86 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:110503286_110704782|GENSCAN_predicted_peptide_1|57_aa MALAGFTAFRRAMVTSPDAFWSWQAKCQGHQREGKEDGDLESSLCYCGHPDARMPTA >gi568815597r:110503286_110704782|GENSCAN_predicted_CDS_1|174_bp atggctctggcagggttcacagccttcaggagggcaatggtcacaagcccagatgccttc tggagctggcaggccaagtgtcagggacaccagagggagggcaaagaggatggtgacctg gagagcagcctctgctattgtggtcatccagatgcacggatgcccacagcgtaa >gi568815597r:110503286_110704782|GENSCAN_predicted_peptide_2|876_aa MERNLSKSTNLMELRSFSVLPACQQFPQASHDLLKWSHPIYKTVWGFAYTQQGPRKCDRP MFFPVLQMQHSFKQGLNVQPSDPHASWTTLAQGEKSGLQGYLEEQLINTDSPCEISAILR EEETTVNPYYIYKLCTFQSEGQLQVLCSKVQYKVNSCNPKEKVSQNPWSISTTLTQGMRI SPPQDVKRCQAQKVMDSPEYAEEGQIFPEQEFPKPDADLHSAMRSGVTDPPPSHKIPCRG EHISGSPVSHIKQTTGGRVKAKAFKGVQNSSSNVTTIGRGQLVLKALTKAVHEKKTRPGY CQTPAGTDTTVPTSGSSAALASGASRWAEAHRKLFSARQGPVTEETRKAMPDLISQSRKV SYEQGMDVCGWKEMEVALVNFDNSDEIQEEPGYATDFDSTSPKGRPGGSSFSNGKILISE STNHETAFSKLPGDYADPPGPEPVVLNEGNQRVIINIAGLRFETQLRTLSQFPETLLGDR EKRMQFFDSMRNEYFFDRNRPSFDGILYYYQSGGKIRRPANVPIDIFADEISFYELGSEA MDQFREDEGFIKDPETLLPTNDIHRQFWLLFEYPESSSAARAVAVVSVLVVVISITIFCL ETLPEFREDRELKVVRDPNLNMSKTVLSQTMFTDPFFMVESTCIVWFTFELVLRFVVCPS KTDFFRNIMNIIDIISIIPYFATLITELVQETEPSAQQNMSLAILRIIRLVRVFRIFKLS RHSKGLQILGQTLKASMRELGLLIFFLFIGVILFSSAVYFAEVDEPESHFSSIPDGFWWA VVTMTTVGYGDMCPTTPGGKIVGTLCAIAGVLTIALPVPVIVSNFNYFYHRETENEEKQN IPGEIERILNSVGSRMGSTDSLNKTNGGCSTEKSRK >gi568815597r:110503286_110704782|GENSCAN_predicted_CDS_2|2631_bp atggaaaggaacttgtccaagagcacaaacctcatggagctgaggagcttctcagtgcta cccgcctgccagcagttccctcaagcttcccacgatcttctgaaatggagccatcctatt tacaaaacggtgtggggattcgcctacacgcagcagggaccccgaaagtgtgatcgcccc atgttctttccagttctacagatgcagcactcatttaagcaaggattgaatgttcagccc tcagatcctcatgcttcctggaccaccttagctcaaggggagaagtctggcctccaaggc taccttgaagagcaactaatcaacactgactcgccctgtgaaatctcagccatcctcaga gaggaagaaacaactgtgaacccctattacatctacaagctgtgcaccttccagtcagaa ggacagcttcaggtcctttgcagcaaagtccagtacaaagtgaactcatgcaaccccaaa gagaaggtctcccagaacccctggtcaatttccactaccctcacccaagggatgagaatt tctccaccacaagatgttaaaagatgtcaggctcagaaagtcatggattcaccagagtat gcagaagaaggccagatttttccagaacaagagttccctaaacctgatgcagatctgcac tcagccatgcgctctggggtgacagatcctccaccctctcacaagatcccctgccgagga gagcacatctcaggatcacctgtcagccacatcaagcagacaacaggaggcagggtaaag gctaaggccttcaagggggtccagaacagcagcagcaatgtaacgactatagggagaggg cagcttgtgctgaaggctcttaccaaggctgtccacgagaagaagacccgtccaggctac tgtcagaccccagctggaactgacactacagtgcccaccagcggcagctcagctgccctg gcatctggagccagcaggtgggcggaggcccatcgtaaactcttcagtgccaggcagggg ccagtcactgaggagaccaggaaagcgatgcctgacttgatcagccagtcacgcaaggtc tcctatgagcagggaatggatgtgtgtggctggaaagaaatggaggttgcgctggtcaat tttgataattcagatgaaatccaagaagagccaggctatgccacagacttcgactcaacc agcccaaaaggccggcctgggggcagctccttctccaacgggaagatcctcatcagcgaa agcaccaaccatgagacggccttctccaagcttccgggagactatgctgaccccccaggg cctgagccagtggtcctaaatgaaggaaaccagcgggtgatcatcaacattgctgggctg agatttgagacccagctcagaacccttagtcagttcccagagactctcctgggagaccgg gaaaaaaggatgcagttctttgactccatgagaaatgagtatttctttgatcggaaccgg cccagttttgatggaatcctatattattaccaatctggtgggaaaattcggcgcccagcc aatgttcccattgatatctttgctgatgaaatctccttctatgagctgggtagtgaggcc atggaccagttccgggaggatgaaggcttcatcaaagaccctgaaacactgctacccacc aatgacatccaccgtcagttctggctcctctttgagtaccctgaaagttccagcgctgcc cgtgctgtggccgtggtctcggtgttggttgtggtcatctccatcaccatcttctgcctg gagacactgccagagttccgggaggatagggagctaaaggtggtcagagaccccaatctc aacatgagcaagacagtcctctcccagaccatgttcaccgaccctttcttcatggtggag tctacctgcatcgtgtggttcaccttcgagctggtgctccggttcgtggtctgccccagc aagactgacttcttcaggaacatcatgaacatcattgacatcatctccattatcccctac tttgcaactctcatcacagagctagtccaggagacagagccgagtgcccaacagaacatg tccctggccatcctgaggatcatccgcctggtgagggtcttccgcatcttcaagctctcg cgccactccaaggggctgcagatcctcgggcaaacactgaaggcgtccatgcgggagttg gggttgctcatcttctttctcttcattggagtcatcctcttctccagtgcagtctacttt gctgaggtggatgagccagagtcccatttctctagcattcctgatggcttctggtgggca gtggtcaccatgacaactgtaggctatggggacatgtgcccgaccaccccaggggggaag attgtgggcactctgtgtgccattgcaggggtcctcaccattgccctccctgtgcctgtc attgtctccaacttcaattacttctaccaccgggagactgagaatgaagaaaagcagaac atcccaggagaaattgaaagaatcctcaacagtgtaggctcaagaatgggcagcacagac tctcttaataagaccaatggtggctgttccacagagaagtctaggaaatga >gi568815597r:110503286_110704782|GENSCAN_predicted_peptide_3|584_aa MEAPVPIRKGKSVSGRCAARAAFPARLRRPEGLRGTRRRSLQRSRLGLSPRRQALADHCR LLGRSQTGEIKGNLHPAQEECTEWARDPLGFAWRGAEGTERSWWVDTYDPEADHECCERV VINISGLRFETQLKTLAQFPETLLGDPKKRMRYFDPLRNEYFFDRNRPSFDAILYYYQSG GRLRRPVNVPLDIFSEEIRFYELGEEAMEMFREDEGYIKEEERPLPENEFQRQVWLLFEY PESSGPARIIAIVSVMVILISIVSFCLETLPIFRDENEDMHGSGVTFHTYSNSTIGYQQS TSFTDPFFIVETLCIIWFSFEFLVRFFACPSKAGFFTNIMNIIDIVAIIPYFITLGTELA EKPEDAQQGQQAMSLAILRVIRLVRVFRIFKLSRHSKGLQILGQTLKASMRELGLLIFFL FIGVILFSSAVYFAEADERESQFPSIPDAFWWAVVSMTTVGYGDMVPTTIGGKIVGSLCA IAGVLTIALPVPVIVSNFNYFYHRETEGEEQAQYLQVTSCPKIPSSPDLKKSRSASTISK SDYMEIQEGVNNSNEDFREENLKTANCTLANTNYVNITKMLTDV >gi568815597r:110503286_110704782|GENSCAN_predicted_CDS_3|1755_bp atggaggccccggtgcccatccggaagggtaaatccgtgtcggggcgttgcgcggcgcgt gctgccttccctgcccggctgcggcgcccagaggggctgcggggcacccggcggcgcagc ctgcagcgcagcaggcttgggctgagtccgcggcggcaagcccttgccgaccactgccgt ctgctcggcaggtcccagaccggggagatcaaggggaacttgcaccctgctcaagaagag tgcacagaatgggctcgggaccctctggggtttgcgtggcgcggggcggagggcacggag aggagctggtgggtggacacctatgacccagaggcagaccacgagtgctgtgagagggtg gtgatcaacatctcagggctgcggtttgagacccagctaaagaccttagcccagtttcca gagaccctcttaggggacccaaagaaacgaatgaggtactttgaccccctccgaaatgag tactttttcgatcggaaccgccctagctttgatgccattttgtactactaccagtcaggg ggccgattgaggcgacctgtgaatgtgcccttagatatattctctgaagaaattcggttt tatgagctgggagaagaagcgatggagatgtttcgggaagatgaaggctacatcaaggag gaagagcgtcctctgcctgaaaatgagtttcagagacaagtgtggcttctctttgaatac ccagagagctcagggcctgccaggattatagctattgtgtctgtcatggtgattctgatc tcaattgtcagcttctgtctggaaacattgcccatcttccgggatgagaatgaagacatg catggtagtggggtgaccttccacacctattccaacagcaccatcgggtaccagcagtcc acttccttcacagaccctttcttcattgtagagacactctgcatcatctggttctccttt gaattcttggtgaggttctttgcctgtcccagcaaagccggcttcttcaccaacatcatg aacatcattgacattgtggccatcatcccctacttcatcaccctggggacagagttggct gagaagccagaggacgctcagcaaggccagcaggccatgtcactggccatcctccgtgtc atccggttggtaagagtctttaggattttcaagttgtccagacactccaaaggtctccag attctaggtcagaccctcaaagccagcatgagagaattgggcctcctgatattctttctc ttcataggggtcatccttttctctagtgctgtgtattttgcagaggccgatgagcgagag tcccagttccccagcatcccagatgccttctggtgggcagtcgtctccatgacaactgta ggctatggagacatggttccgactaccattgggggaaagatagtgggttccctatgtgcg attgcaggtgtgttaactattgccttaccggtccctgtcattgtgtccaatttcaactac ttctaccaccgggagacagagggagaggaacaggcccaatacttgcaagtgacaagctgt ccaaagatcccatcctcccctgacctaaagaaaagtagaagtgcctctaccattagtaag tctgattacatggagatccaggagggtgtaaataacagtaatgaggactttagagaggaa aacttgaaaacagccaactgtaccttggctaacacaaactatgtgaatattaccaaaatg ttaactgatgtctga >gi568815597r:110503286_110704782|GENSCAN_predicted_peptide_4|138_aa MNDIKCKFEENTSGLAAKKERRLNQQIFMGHLFKSGTRCRTLDKAPIRQYRDGHNGSLLS RSSQSGPSSLAFCQVGFLTAQPSPPRRRNGKDRYTLVLQHQECQDDLATSSLVYLSLPCF KDLGRSKHQSITVADTNK >gi568815597r:110503286_110704782|GENSCAN_predicted_CDS_4|417_bp atgaatgatatcaaatgcaaatttgaggagaacacatcaggtcttgctgccaagaaagag agacgcctcaaccaacaaatatttatggggcacctgtttaagtcaggcaccagatgccgg acactggacaaggcccccatcaggcagtacagagatggacacaatgggtctctgctctca aggagttcccagtctgggccctcctctctggccttctgccaagtggggttcttaacagca cagccttcacctccgagaaggcgcaatgggaaagacagatacacgttggttctgcaacac caggaatgccaggatgatttagccacctcctcacttgtctacctttccctcccctgcttc aaagacttgggtcgatcgaagcaccaaagcatcactgttgctgacactaacaagtag >gi568815597r:110503286_110704782|GENSCAN_predicted_peptide_5|575_aa MDERLSLLRSPPPPSARHRAHPPQRPASSGGAHTLVNHGYAEPAAGRELPPDMTVVPGDH LLEPEVADGGGAPPQGGCGGGGCDRYEPLPPSLPAAGEQDCCGERVVINISGLRFETQLK TLCQFPETLLGDPKRRMRYFDPLRNEYFFDRNRPSFDAILYYYQSGGRIRRPVNVPIDIF SEEIRFYQLGEEAMEKFREDEGFLREEERPLPRRDFQRQVWLLFEYPESSGPARGIAIVS VLVILISIVIFCLETLPEFRDEKDYPASTSQDSFEAAGNSTSGSRAGASSFSDPFFVVET LCIIWFSFELLVRFFACPSKATFSRNIMNLIDIVAIIPYFITLGTELAERQGNGQQAMSL AILRVIRLVRVFRIFKLSRHSKGLQILGQTLKASMRELGLLIFFLFIGVILFSSAVYFAE ADDPTSGFSSIPDAFWWAVVTMTTVGYGDMHPVTIGGKIVGSLCAIAGVLTIALPVPVIV SNFNYFYHRETEGEEQSQYMHVGSCQHLSSSAEELRKARSNSTLSKSEYMVIEEGGMNHS AFPQTPFKTGNSTATCTTNNNPNSCVNIKKIFTDV >gi568815597r:110503286_110704782|GENSCAN_predicted_CDS_5|1728_bp atggacgagcgcctcagccttctgcgctcgccgccgccgccctcagcccgccaccgcgcc caccctcctcagcgcccagcgagcagcggcggtgcccacacgctggtgaaccacggctac gcggagcccgccgcaggccgcgagctgccgcccgacatgaccgtggtgcccggggaccac ctgctggagccggaggtggccgatggtggaggggccccgcctcaaggcggctgtggcggc ggcggctgcgaccgctacgagccgctgccgccctcactgccggccgcgggcgagcaggac tgctgcggggagcgcgtggtcatcaacatctccgggctgcgcttcgagacgcagctgaag accctttgccagttccccgagacgctgctgggcgaccccaagcggcgcatgaggtacttc gacccgctccgcaacgagtacttcttcgaccgcaaccggcccagcttcgacgccatcctc tactactatcagtccgggggccgcatccgccggccggtcaacgtgcccatcgacattttc tccgaggagatccgcttctaccagctgggcgaggaggccatggagaagttccgcgaggac gagggcttcctgcgggaggaggagcggcccttgccccgccgcgacttccagcgccaggtg tggctgctcttcgagtaccccgagagctccgggccggcccggggcatcgccatcgtgtcc gtgctggtcatcctcatctccattgtcatcttctgcctggagacgctgccggagttccgc gacgagaaggactaccccgcctcgacgtcgcaggactcattcgaagcagccggcaacagc acgtcggggtcccgcgcaggagcctccagcttctccgatcccttcttcgtggtggagacg ctgtgcatcatctggttctccttcgaactgctggtgcggttcttcgcttgtcctagcaaa gccaccttctcgcgaaacatcatgaacctgatcgacattgtggccatcattccttatttt atcactctgggtaccgagctggccgaacgacagggcaatggacagcaggccatgtctctg gccatcctgagggtcatccgcctggtaagggtcttccgcatcttcaagctgtcgcgccac tccaaggggctgcagatcctcgggcaaacgctgaaggcgtccatgcgggagctgggattg ctcatcttcttcctctttattggggtcatccttttctccagcgcggtctactttgccgag gcagacgaccccacttcaggtttcagcagcatcccggatgccttctggtgggcagtggta accatgacaacagtgggttacggcgatatgcacccagtgaccatagggggcaagattgtg ggatctctctgtgccatcgccggtgtcttgaccatcgcattgccagttcccgtgattgtt tccaacttcaattacttctaccaccgggagacagaaggggaagagcaatcccagtacatg cacgtgggaagttgccagcacctctcctcttcagccgaggagctccgaaaagcaaggagt aactcgactctgagtaagtcggagtatatggtgatcgaagaggggggtatgaaccatagc gctttcccccagacccctttcaaaacgggcaattccactgccacctgcaccacgaacaat aatcccaactcttgtgtcaacatcaaaaagatattcaccgatgtttaa >gi568815597r:110503286_110704782|GENSCAN_predicted_peptide_6|109_aa MVEGKEEHVLSYMNGSRQRENEKAVKVETPDKTINSCAQCKLSVDLPFWSLEDGGPLLTA PLGGAPVGTLCGGSDPTFPFHTVLAEVLYERPTPAANFCLGIQVFPYIF >gi568815597r:110503286_110704782|GENSCAN_predicted_CDS_6|330_bp atggtggaaggcaaggaggagcacgtcctgtcttacatgaatggcagcaggcaaagagag aatgagaaagctgtaaaagtggaaacacctgataaaaccatcaactcttgtgcacagtgc aagctgtctgtggatctaccattctggagtctggaggacggtggccctcttctcacagct ccactaggtggtgccccagtagggactctgtgtgggggctctgaccccacatttcccttc cacactgtcctagcagaggttctgtatgagaggcccacccctgcagcaaacttctgcctg ggcatccaggtgtttccttacatcttctga