GENSCAN 1.0 Date run: 7-Nov-116 Time: 17:02:43 Sequence gi568815597r:110417255_110618787 : 201533 bp : 48.53% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 33963 34034 72 2 0 78 98 37 0.630 3.32 1.02 Intr + 36707 36832 126 1 0 130 73 146 0.991 18.18 1.03 Term + 38978 39097 120 1 0 107 48 123 0.996 8.67 1.04 PlyA + 40082 40087 6 1.05 2.06 PlyA - 40791 40786 6 -0.45 2.05 Term - 41440 41135 306 1 0 66 43 131 0.484 1.52 2.04 Intr - 46469 46307 163 1 1 81 99 2 0.203 0.58 2.03 Intr - 47114 46946 169 0 1 64 69 137 0.489 8.40 2.02 Intr - 49115 49017 99 0 0 54 109 78 0.715 6.58 2.01 Init - 51440 51383 58 2 1 84 95 53 0.984 7.07 2.00 Prom - 59369 59330 40 -1.96 3.00 Prom + 63451 63490 40 -3.96 3.01 Init + 63512 63573 62 1 2 72 109 33 0.984 4.68 3.02 Intr + 65521 65671 151 1 1 92 76 152 0.995 14.56 3.03 Intr + 66660 66777 118 2 1 97 72 144 0.976 13.74 3.04 Intr + 67452 67690 239 1 2 -2 103 151 0.434 4.93 3.05 Intr + 69602 69756 155 1 2 99 12 217 0.905 14.07 3.06 Intr + 71248 71380 133 1 1 42 94 100 0.198 6.65 3.07 Intr + 72120 72317 198 2 0 -85 78 317 0.837 12.95 3.08 Intr + 72918 73016 99 2 0 19 80 189 0.999 11.51 3.09 Term + 73780 73917 138 0 0 107 38 132 0.886 8.06 3.10 PlyA + 73996 74001 6 1.05 4.04 PlyA - 76294 76289 6 -0.45 4.03 Term - 76986 76855 132 0 0 98 35 73 0.137 1.09 4.02 Intr - 84561 84367 195 0 0 78 47 94 0.206 3.91 4.01 Init - 87173 87072 102 2 0 64 50 56 0.085 -0.36 4.00 Prom - 92947 92908 40 -4.16 5.13 PlyA - 94078 94073 6 1.05 5.12 Term - 101534 99998 1537 1 1 99 43 1872 0.069 173.75 5.11 Intr - 113071 112952 120 0 0 76 47 73 0.019 1.61 5.10 Intr - 116202 116142 61 1 1 85 64 67 0.022 1.79 5.09 Intr - 131637 131563 75 1 0 26 95 127 0.159 6.69 5.08 Intr - 132180 132102 79 0 1 45 75 49 0.087 -1.58 5.07 Intr - 133971 133918 54 0 0 97 75 30 0.004 1.78 5.06 Intr - 142338 142234 105 0 0 68 53 61 0.002 1.01 5.05 Intr - 150113 149928 186 2 0 55 47 106 0.075 3.09 5.04 Intr - 155453 155350 104 0 2 4 75 143 0.063 4.49 5.03 Intr - 165100 164998 103 1 1 93 81 55 0.604 5.05 5.02 Intr - 167320 167143 178 0 1 66 75 93 0.800 5.72 5.01 Init - 167477 167449 29 2 2 78 106 8 0.636 0.70 5.00 Prom - 176404 176365 40 -4.86 6.03 PlyA - 177747 177742 6 1.05 6.02 Term - 187468 186029 1440 1 0 87 38 1152 0.628 100.92 6.01 Init - 190024 189710 315 1 0 94 32 172 0.097 9.55 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:110417255_110618787|GENSCAN_predicted_peptide_1|105_aa MRGATRVSIMLLLVTVSDCAVITGACERDVQCGAGTCCAISLWLRGLRMCTPLGREGEEC HPGSHKVPFFRKRKHHTCPCLPNLLCSRFPDGRYRCSMDLKNINF >gi568815597r:110417255_110618787|GENSCAN_predicted_CDS_1|318_bp atgagaggtgccacgcgagtctcaatcatgctcctcctagtaactgtgtctgactgtgct gtgatcacaggggcctgtgagcgggatgtccagtgtggggcaggcacctgctgtgccatc agcctgtggcttcgagggctgcggatgtgcaccccgctggggcgggaaggcgaggagtgc caccccggcagccacaaggtccccttcttcaggaaacgcaagcaccacacctgtccttgc ttgcccaacctgctgtgctccaggttcccggacggcaggtaccgctgctccatggacttg aagaacatcaatttttag >gi568815597r:110417255_110618787|GENSCAN_predicted_peptide_2|264_aa MRLSDIGKGESPDDHRGQAAAKTLACQTSSPEIQIQWLWDAAQDPGYFTSTPASGELKHI REKIVRSFLDILLVYLTEEIPPGNASVVCWLQHVGAIFIIAIVQMGKLRYTETNMSVTDA QQSLSSRQGAETGKDLLDKRTINHILPVTWTKWSSQDPSTAPQGGSQFRKGSTKVHQAAV NGTSRKAMNVKGQAMVKMESWTWKGTLYPGPEPETKGESDSRAARRSQVTTAGDNVRKRR QKQKQKQTKKGPCGGLAWLKWVLE >gi568815597r:110417255_110618787|GENSCAN_predicted_CDS_2|795_bp atgaggctcagtgacattggcaagggtgaaagcccagatgaccacaggggccaggcagct gctaaaactctagcttgccagacttcctctccagaaatccagatccagtggctctgggat gcggcacaggaccctgggtacttcacaagcaccccagcttctggggaattgaagcacatc cgagaaaagattgttcgttccttcctggacatcttattggtttacctcactgaagagatc cctccaggtaatgctagtgtggtgtgctggttacaacacgtaggtgctatttttattatt gccattgtacagatggggaaactgaggtacacagaaacaaatatgtctgtgactgacgcg cagcagagtctatcgagtaggcagggggctgagacaggaaaggatcttttggacaaaagg accataaatcatatccttcctgtgacctggaccaagtggtcttcccaagacccaagcact gcccctcagggagggtcacaatttagaaagggctccacaaaagtccaccaggcagcagtc aatggtacctcgagaaaagctatgaatgtcaaaggacaagccatggtcaaaatggaatcc tggacgtggaagggaactttatatccaggacctgagcctgaaaccaagggagaaagcgac tcaagggctgcaagaaggtcccaagtcaccacagcaggtgacaatgtccgtaaaaggaga caaaaacaaaaacaaaaacaaacaaagaaaggaccctgtgggggcctggcctggcttaag tgggtcctggagtaa >gi568815597r:110417255_110618787|GENSCAN_predicted_peptide_3|430_aa MRGLVVFLAVFALSEVNAITRVPLHKGKSLRRALKERRLLEDFLRNHHYAVSRKHSSSGV VASESLTNYLDCQYFGKIYIGTLPQKFTLVFDTGSPDIWVPSVYCNSDACQRIASIEAIF LQGLLRGSALAGTVSGEKILRAPLTLGLCFRKPPTLRSVQVLHPQNMGKSLSIQYGTGSM RGLLGYDTVTDLVEVMILLQVSNIVDPHQTVGLSTQEPGDVFTYSEFDGILGLAYPSLAS EYALRLGFRNDQGSMLTLRAIDLSYYTGSLHWIPMTARILAVHCGQEGPGEGGLDEAILH TFGSVIIDGVVVACDGGCQAILDTGTSLLVGPGGNILNIQQAIGATAGQYNEFDIDCGRL SSIPTAVFEIHGKKYPLPPSAYTSQDQGFCTSGFQGDYSSQQWILGNVFIWEYYSVFDRT NNRVGLAKAV >gi568815597r:110417255_110618787|GENSCAN_predicted_CDS_3|1293_bp atgaggggccttgtggtattccttgcagtctttgctctctctgaggtcaatgccatcacc agggttcctctgcacaaagggaagtcgctgaggagggccctgaaggagcgcaggctcctg gaggacttcctgaggaatcaccattatgcagtcagcaggaagcactccagctctggggtg gtggccagcgagtctctgaccaactacctggattgtcagtactttgggaagatctacatc gggacccttccccagaagttcaccttggtgtttgatacaggctccccggatatctgggtg ccctctgtctactgcaacagtgatgcctgtcagagaattgccagtattgaggccatattc ttacaaggcctgctgaggggttcggccctggctggcaccgtgagtggtgagaagattcta agggccccactaactctgggtctgtgtttcagaaaaccaccaacgcttcgatccgtccaa gtcctccacccacagaacatgggcaagtccctgtccatccagtatggcacaggcagcatg cggggcttgctgggctatgacactgtcaccgacctagtggaggtcatgattctcttgcag gtctccaacattgtggacccccaccagactgtgggtctgagcacccaggaacctggcgac gtcttcacctactccgagtttgatgggatcctggggctggcctatccctctcttgcctct gagtacgcgctgcgccttggtttcaggaatgaccaggggagcatgctcacgctgagggcc attgatctgtcgtactacacaggctccctgcactggatacccatgactgcaagaatactg gcagttcactgtggacaggaaggacctggggagggagggctggatgaggccatcttgcat acctttggaagtgtcatcattgacggcgtggtggtggcctgtgacggtggctgtcaggcc atcctggacaccggcacctccctgctggtggggcctggcggcaacatcctcaacatccag caggccattggagccactgcgggccagtacaatgagtttgacatcgactgcgggcgcctg agcagcattcccacggctgtcttcgagatccacggcaagaagtaccccctgccaccctcc gcctataccagccaggaccagggcttctgcaccagtggtttccagggtgactatagttcc cagcagtggatcctggggaatgtcttcatctgggagtattacagtgtctttgacaggacc aataaccgtgtggggctggcgaaggctgtctga >gi568815597r:110417255_110618787|GENSCAN_predicted_peptide_4|142_aa MFPGLAAATPQWNPPPPMGIEGTPPQSCPMTVAEQARESTSKTESTGFANQTHCFGVFYL LQASHKIQSTLKTRGLHKGMLTRKQGLLEAILEGYQPQGVGPLHKQTTDVLVAPSQSLLI HHLAQPSEGLTTETNVDPVPEP >gi568815597r:110417255_110618787|GENSCAN_predicted_CDS_4|429_bp atgttccctggcttagcagcagcgactcctcagtggaacccaccaccgcccatgggcata gagggcaccccaccacagagctgtcctatgacggtggcagagcaagcaagagagagtaca agcaagacagaatccacgggctttgcaaaccaaacgcattgctttggtgtcttctatttg ttacaagcaagtcacaaaatccagtccacactcaagacgagagggttacacaagggcatg ctgaccaggaagcagggattactggaggccattttggaaggttatcaaccacagggggta ggccctcttcataagcaaaccacggatgtgctggtggccccttcccaaagcctcctcatt caccacctggcacagccctcagaaggcctgaccacagagaccaatgtggaccccgtccca gaaccataa >gi568815597r:110417255_110618787|GENSCAN_predicted_peptide_5|876_aa MERNLSKSTNLMELRSFSVLPACQQFPQASHDLLKWSHPIYKTVWGFAYTQQGPRKCDRP MFFPVLQMQHSFKQGLNVQPSDPHASWTTLAQGEKSGLQGYLEEQLINTDSPCEISAILR EEETTVNPYYIYKLCTFQSEGQLQVLCSKVQYKVNSCNPKEKVSQNPWSISTTLTQGMRI SPPQDVKRCQAQKVMDSPEYAEEGQIFPEQEFPKPDADLHSAMRSGVTDPPPSHKIPCRG EHISGSPVSHIKQTTGGRVKAKAFKGVQNSSSNVTTIGRGQLVLKALTKAVHEKKTRPGY CQTPAGTDTTVPTSGSSAALASGASRWAEAHRKLFSARQGPVTEETRKAMPDLISQSRKV SYEQGMDVCGWKEMEVALVNFDNSDEIQEEPGYATDFDSTSPKGRPGGSSFSNGKILISE STNHETAFSKLPGDYADPPGPEPVVLNEGNQRVIINIAGLRFETQLRTLSQFPETLLGDR EKRMQFFDSMRNEYFFDRNRPSFDGILYYYQSGGKIRRPANVPIDIFADEISFYELGSEA MDQFREDEGFIKDPETLLPTNDIHRQFWLLFEYPESSSAARAVAVVSVLVVVISITIFCL ETLPEFREDRELKVVRDPNLNMSKTVLSQTMFTDPFFMVESTCIVWFTFELVLRFVVCPS KTDFFRNIMNIIDIISIIPYFATLITELVQETEPSAQQNMSLAILRIIRLVRVFRIFKLS RHSKGLQILGQTLKASMRELGLLIFFLFIGVILFSSAVYFAEVDEPESHFSSIPDGFWWA VVTMTTVGYGDMCPTTPGGKIVGTLCAIAGVLTIALPVPVIVSNFNYFYHRETENEEKQN IPGEIERILNSVGSRMGSTDSLNKTNGGCSTEKSRK >gi568815597r:110417255_110618787|GENSCAN_predicted_CDS_5|2631_bp atggaaaggaacttgtccaagagcacaaacctcatggagctgaggagcttctcagtgcta cccgcctgccagcagttccctcaagcttcccacgatcttctgaaatggagccatcctatt tacaaaacggtgtggggattcgcctacacgcagcagggaccccgaaagtgtgatcgcccc atgttctttccagttctacagatgcagcactcatttaagcaaggattgaatgttcagccc tcagatcctcatgcttcctggaccaccttagctcaaggggagaagtctggcctccaaggc taccttgaagagcaactaatcaacactgactcgccctgtgaaatctcagccatcctcaga gaggaagaaacaactgtgaacccctattacatctacaagctgtgcaccttccagtcagaa ggacagcttcaggtcctttgcagcaaagtccagtacaaagtgaactcatgcaaccccaaa gagaaggtctcccagaacccctggtcaatttccactaccctcacccaagggatgagaatt tctccaccacaagatgttaaaagatgtcaggctcagaaagtcatggattcaccagagtat gcagaagaaggccagatttttccagaacaagagttccctaaacctgatgcagatctgcac tcagccatgcgctctggggtgacagatcctccaccctctcacaagatcccctgccgagga gagcacatctcaggatcacctgtcagccacatcaagcagacaacaggaggcagggtaaag gctaaggccttcaagggggtccagaacagcagcagcaatgtaacgactatagggagaggg cagcttgtgctgaaggctcttaccaaggctgtccacgagaagaagacccgtccaggctac tgtcagaccccagctggaactgacactacagtgcccaccagcggcagctcagctgccctg gcatctggagccagcaggtgggcggaggcccatcgtaaactcttcagtgccaggcagggg ccagtcactgaggagaccaggaaagcgatgcctgacttgatcagccagtcacgcaaggtc tcctatgagcagggaatggatgtgtgtggctggaaagaaatggaggttgcgctggtcaat tttgataattcagatgaaatccaagaagagccaggctatgccacagacttcgactcaacc agcccaaaaggccggcctgggggcagctccttctccaacgggaagatcctcatcagcgaa agcaccaaccatgagacggccttctccaagcttccgggagactatgctgaccccccaggg cctgagccagtggtcctaaatgaaggaaaccagcgggtgatcatcaacattgctgggctg agatttgagacccagctcagaacccttagtcagttcccagagactctcctgggagaccgg gaaaaaaggatgcagttctttgactccatgagaaatgagtatttctttgatcggaaccgg cccagttttgatggaatcctatattattaccaatctggtgggaaaattcggcgcccagcc aatgttcccattgatatctttgctgatgaaatctccttctatgagctgggtagtgaggcc atggaccagttccgggaggatgaaggcttcatcaaagaccctgaaacactgctacccacc aatgacatccaccgtcagttctggctcctctttgagtaccctgaaagttccagcgctgcc cgtgctgtggccgtggtctcggtgttggttgtggtcatctccatcaccatcttctgcctg gagacactgccagagttccgggaggatagggagctaaaggtggtcagagaccccaatctc aacatgagcaagacagtcctctcccagaccatgttcaccgaccctttcttcatggtggag tctacctgcatcgtgtggttcaccttcgagctggtgctccggttcgtggtctgccccagc aagactgacttcttcaggaacatcatgaacatcattgacatcatctccattatcccctac tttgcaactctcatcacagagctagtccaggagacagagccgagtgcccaacagaacatg tccctggccatcctgaggatcatccgcctggtgagggtcttccgcatcttcaagctctcg cgccactccaaggggctgcagatcctcgggcaaacactgaaggcgtccatgcgggagttg gggttgctcatcttctttctcttcattggagtcatcctcttctccagtgcagtctacttt gctgaggtggatgagccagagtcccatttctctagcattcctgatggcttctggtgggca gtggtcaccatgacaactgtaggctatggggacatgtgcccgaccaccccaggggggaag attgtgggcactctgtgtgccattgcaggggtcctcaccattgccctccctgtgcctgtc attgtctccaacttcaattacttctaccaccgggagactgagaatgaagaaaagcagaac atcccaggagaaattgaaagaatcctcaacagtgtaggctcaagaatgggcagcacagac tctcttaataagaccaatggtggctgttccacagagaagtctaggaaatga >gi568815597r:110417255_110618787|GENSCAN_predicted_peptide_6|584_aa MEAPVPIRKGKSVSGRCAARAAFPARLRRPEGLRGTRRRSLQRSRLGLSPRRQALADHCR LLGRSQTGEIKGNLHPAQEECTEWARDPLGFAWRGAEGTERSWWVDTYDPEADHECCERV VINISGLRFETQLKTLAQFPETLLGDPKKRMRYFDPLRNEYFFDRNRPSFDAILYYYQSG GRLRRPVNVPLDIFSEEIRFYELGEEAMEMFREDEGYIKEEERPLPENEFQRQVWLLFEY PESSGPARIIAIVSVMVILISIVSFCLETLPIFRDENEDMHGSGVTFHTYSNSTIGYQQS TSFTDPFFIVETLCIIWFSFEFLVRFFACPSKAGFFTNIMNIIDIVAIIPYFITLGTELA EKPEDAQQGQQAMSLAILRVIRLVRVFRIFKLSRHSKGLQILGQTLKASMRELGLLIFFL FIGVILFSSAVYFAEADERESQFPSIPDAFWWAVVSMTTVGYGDMVPTTIGGKIVGSLCA IAGVLTIALPVPVIVSNFNYFYHRETEGEEQAQYLQVTSCPKIPSSPDLKKSRSASTISK SDYMEIQEGVNNSNEDFREENLKTANCTLANTNYVNITKMLTDV >gi568815597r:110417255_110618787|GENSCAN_predicted_CDS_6|1755_bp atggaggccccggtgcccatccggaagggtaaatccgtgtcggggcgttgcgcggcgcgt gctgccttccctgcccggctgcggcgcccagaggggctgcggggcacccggcggcgcagc ctgcagcgcagcaggcttgggctgagtccgcggcggcaagcccttgccgaccactgccgt ctgctcggcaggtcccagaccggggagatcaaggggaacttgcaccctgctcaagaagag tgcacagaatgggctcgggaccctctggggtttgcgtggcgcggggcggagggcacggag aggagctggtgggtggacacctatgacccagaggcagaccacgagtgctgtgagagggtg gtgatcaacatctcagggctgcggtttgagacccagctaaagaccttagcccagtttcca gagaccctcttaggggacccaaagaaacgaatgaggtactttgaccccctccgaaatgag tactttttcgatcggaaccgccctagctttgatgccattttgtactactaccagtcaggg ggccgattgaggcgacctgtgaatgtgcccttagatatattctctgaagaaattcggttt tatgagctgggagaagaagcgatggagatgtttcgggaagatgaaggctacatcaaggag gaagagcgtcctctgcctgaaaatgagtttcagagacaagtgtggcttctctttgaatac ccagagagctcagggcctgccaggattatagctattgtgtctgtcatggtgattctgatc tcaattgtcagcttctgtctggaaacattgcccatcttccgggatgagaatgaagacatg catggtagtggggtgaccttccacacctattccaacagcaccatcgggtaccagcagtcc acttccttcacagaccctttcttcattgtagagacactctgcatcatctggttctccttt gaattcttggtgaggttctttgcctgtcccagcaaagccggcttcttcaccaacatcatg aacatcattgacattgtggccatcatcccctacttcatcaccctggggacagagttggct gagaagccagaggacgctcagcaaggccagcaggccatgtcactggccatcctccgtgtc atccggttggtaagagtctttaggattttcaagttgtccagacactccaaaggtctccag attctaggtcagaccctcaaagccagcatgagagaattgggcctcctgatattctttctc ttcataggggtcatccttttctctagtgctgtgtattttgcagaggccgatgagcgagag tcccagttccccagcatcccagatgccttctggtgggcagtcgtctccatgacaactgta ggctatggagacatggttccgactaccattgggggaaagatagtgggttccctatgtgcg attgcaggtgtgttaactattgccttaccggtccctgtcattgtgtccaatttcaactac ttctaccaccgggagacagagggagaggaacaggcccaatacttgcaagtgacaagctgt ccaaagatcccatcctcccctgacctaaagaaaagtagaagtgcctctaccattagtaag tctgattacatggagatccaggagggtgtaaataacagtaatgaggactttagagaggaa aacttgaaaacagccaactgtaccttggctaacacaaactatgtgaatattaccaaaatg ttaactgatgtctga