GENSCAN 1.0 Date run: 6-Nov-116 Time: 12:28:51 Sequence gi568815586r:79492610_79790244 : 297635 bp : 38.80% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 708 759 52 2 1 61 85 63 0.348 4.67 1.02 Intr + 7036 7176 141 2 0 30 19 202 0.201 6.90 1.03 Intr + 10006 10101 96 2 0 85 81 36 0.084 1.66 1.04 Intr + 10363 10487 125 2 2 31 89 76 0.913 1.38 1.05 Intr + 12081 12206 126 2 0 84 -1 147 0.027 5.36 1.06 Intr + 15092 15134 43 1 1 46 62 34 0.009 -6.51 1.07 Intr + 16943 17090 148 0 1 81 88 124 0.343 10.07 1.08 Term + 24811 24940 130 1 1 23 40 163 0.626 1.67 1.09 PlyA + 25409 25414 6 1.05 2.00 Prom + 36668 36707 40 -2.05 2.01 Init + 49765 49801 37 0 1 96 115 51 0.595 8.83 2.02 Intr + 50016 50149 134 1 2 84 14 31 0.059 -5.26 2.03 Intr + 53457 53526 70 2 1 57 98 123 0.083 8.04 2.04 Intr + 60436 60573 138 2 0 38 113 99 0.073 6.91 2.05 Term + 66123 66562 440 1 2 53 39 490 0.919 35.05 2.06 PlyA + 66639 66644 6 1.05 3.00 Prom + 67433 67472 40 -7.85 3.01 Init + 69208 69409 202 0 1 52 75 214 0.312 14.44 3.02 Intr + 72927 73069 143 1 2 41 36 83 0.169 -2.45 3.03 Intr + 73322 73498 177 1 0 -16 26 241 0.195 6.69 3.04 Intr + 77239 77352 114 0 0 99 77 67 0.962 6.42 3.05 Term + 82162 82503 342 0 0 0 41 333 0.482 13.33 3.06 PlyA + 82574 82579 6 1.05 4.09 PlyA - 88212 88207 6 1.05 4.08 Term - 100084 99998 87 1 0 82 44 111 0.605 2.78 4.07 Intr - 101824 101720 105 1 0 67 89 70 0.937 4.49 4.06 Intr - 104049 103902 148 2 1 67 86 99 0.757 6.92 4.05 Intr - 121000 120966 35 1 2 88 116 -1 0.149 -1.20 4.04 Intr - 128598 128467 132 0 0 77 109 241 0.871 25.02 4.03 Intr - 197606 197120 487 1 1 -28 83 425 0.348 22.49 4.02 Intr - 197782 197665 118 2 1 104 78 162 0.923 15.40 4.01 Init - 198031 197947 85 1 1 75 82 163 0.896 13.43 4.00 Prom - 205526 205487 40 -7.95 5.03 PlyA - 207177 207172 6 1.05 5.02 Term - 218308 218121 188 2 2 62 36 200 0.791 8.87 5.01 Init - 219737 219581 157 2 1 40 115 150 0.985 13.02 5.00 Prom - 222289 222250 40 -5.65 6.00 Prom + 236083 236122 40 -7.35 6.01 Sngl + 237032 237250 219 1 0 49 48 212 0.946 8.21 6.02 PlyA + 237437 237442 6 1.05 7.00 Prom + 243002 243041 40 -3.05 7.01 Init + 252857 252973 117 1 0 87 75 82 0.597 7.05 7.02 Term + 270817 270933 117 0 0 81 38 56 0.025 -2.54 7.03 PlyA + 274214 274219 6 1.05 8.06 PlyA - 275021 275016 6 1.05 8.05 Term - 283406 283320 87 2 0 108 42 123 0.995 6.38 8.04 Intr - 285991 285941 51 1 0 55 106 70 0.925 3.69 8.03 Intr - 289253 289206 48 2 0 93 110 27 0.908 3.36 8.02 Intr - 293869 293765 105 1 0 64 72 127 0.993 8.19 8.01 Intr - 296174 296039 136 1 1 11 75 137 0.887 4.35 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 10272 10326 55 2 1 107 84 76 0.875 10.60 S.002 Term + 12081 12215 135 2 0 84 43 147 0.952 6.84 S.003 Term + 77480 77698 219 1 0 -13 42 204 0.824 1.76 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:79492610_79790244|GENSCAN_predicted_peptide_1|286_aa MEKKAFAKSMAAYQLPGVLTHMITRPHNGLLLQAEEQGEPVRVPKLKNLESDARGQEASS MGERYCAGSMIPASASVNNLKKLPIMEDGEGEHILHGMCNGINQGYNSTHTEWRKARQGD HPLGSSVEQGEPTLSREVAWESLQAPETDTDSQHKATASQKSGQTVLHADPSPYFSSLSR NLDVNLYGPGLFLVDTDSLAGMRDKIEFYKSTQEWKGNNPNHSPSSIEDELLCSYGKLPC DFQRTKRRFRKFPERKGEEEEETEEEEEDKEDSEDEEKNVVTIQRI >gi568815586r:79492610_79790244|GENSCAN_predicted_CDS_1|861_bp atggagaaaaaggcatttgccaagtcaatggctgcatatcagttaccaggagtattaact cacatgatcacaaggccccacaatggcttgctcctgcaggctgaggagcaaggagagcca gtccgagttccaaaactgaagaatttggagtctgatgctcgagggcaggaagcatccagc atgggagaaagatactgtgcaggaagcatgataccagcctctgcttctgttaacaacctc aagaagcttccaatcatggaggacggggaaggggagcacatattacatggtatgtgcaat gggattaatcaaggatacaactcaacccatacagaatggagaaaagcaaggcagggtgac cacccacttgggagcagtgtggagcagggggaacctacactgtccagggaagtggcttgg gagagtctgcaggcaccagagactgatacggactcccagcacaaagcaaccgcctctcag aaaagtggccagactgttctccatgcagatcccagtccttacttctcctcactgagcagg aatttggatgtgaatctgtatggtcctggactttttttggttgacacagattcactggct gggatgagagataaaattgaattctataagtctacacaagaatggaaaggcaataatcca aaccactctccaagttctattgaagatgagttgctttgcagttacggcaaattgccttgc gattttcaaaggaccaagagaagattccgtaagtttccagagagaaagggagaggaagag gaggagacggaggaggaggaagaggacaaagaggacagtgaggatgaggaaaagaatgtg gtgaccatacaaagaatctga >gi568815586r:79492610_79790244|GENSCAN_predicted_peptide_2|272_aa MDAELQVTLTVEGRDTLTKLSASLTIPGLQLYLIAALLPNPKPPLCPPLVSPHLNPQVVF METKGGEGQSREVDATVEGNRVKLQTFTVSVTALKAVRLELFVPLVWSCSFLPVRSSSCW PPADLHAYVTNVHNVFGPQNHPDSSSTPERCEETSEKPKKKKKQKPQEVPQEKTEDPSIS FSKPKKKKSFSKEELMSSDLEETAGSTSLLKRKKSSPKTETVNDPEEAGNRSVSKKKTKF SKEEPVSSGPEEAAGSKSSSKKKMFHKEAQED >gi568815586r:79492610_79790244|GENSCAN_predicted_CDS_2|819_bp atggacgccgagcttcaggtaactctcacagtggaaggccgagacactttaactaaatta tctgcttccctgactattcctggactacagctatatctcattgccgcccttctgcccaat ccaaagcctcctttgtgtcctcctcttgtatccccccaccttaacccacaagtggttttc atggagacaaaaggtggagaaggacagtcccgtgaagtggatgcaacagttgaaggtaac agagtgaagctgcagaccttcacggtgagtgttacagctcttaaggcggtccgtctagag ttgttcgttcctctcgtttggagttgttcattcctcccggtgcgttcatcgtcttgctgg cctcctgcagaccttcacgcttatgtgacaaacgttcacaatgtttttggtcctcaaaat caccctgacagcagtagtactccagagaggtgtgaggagacgagtgaaaaacccaaaaag aagaaaaagcaaaagccccaggaggttcctcaggagaaaacagaagacccatctatctct ttctccaaacccaagaaaaagaaatctttttccaaggaggagttgatgagtagtgatctt gaagagactgctggcagcaccagtcttctcaagaggaagaagtcttcacccaagacggaa acagttaatgaccccgaagaggcaggcaacagaagtgtctccaagaaaaagacgaaattc tccaaagaggagcctgtcagcagtggacctgaagaggctgctggcagcaagagcagctcc aagaagaaaatgttccataaagaagcccaggaagattag >gi568815586r:79492610_79790244|GENSCAN_predicted_peptide_3|325_aa MPPAFPTVGSCMARASPMGAAPCSAAPGPIDHPRAENCRHTAWDWQAAPPVALVRDPLGE ASWARESANLVGTWKTFVSVSGIVNTLISTLSKRTNQLSVKQTNGLPVKWTNQQDSCLTL TTKVCSFTPEPVRPRTHQKEEIPNTSEHQKEQTLDTLPLRTVTLAVRVHGFILEDWAMSW MQSLPANPGCGHWGQLRPLQWQGVRAAATTNHVCSFTPEASETTNLPGGTNNSSHAALRA VTLTVKVCSFTPEPARPRTHQKKETANTSEHQKERNPDTPPLRTVTLTARVRSFILEVSE TKNPPIPDTLALKKQFESDGKRYST >gi568815586r:79492610_79790244|GENSCAN_predicted_CDS_3|978_bp atgccccctgctttccccaccgtgggctcctgcatggcccgagcctccccgatgggtgcc gccccctgctccgcagcgcctggtcccatcgaccacccaagggctgagaattgcaggcac acggcatgggactggcaggcagctccacccgtggccctggtgcgggatccactaggcgaa gccagctgggctcgtgaatcagctaatctagtggggacgtggaaaacttttgtgtctgtc tcagggattgtaaacacactaatcagcaccctgtcaaaacggaccaatcagctctctgta aaacagaccaatgggctccctgtaaaatggaccaatcagcaggacagctgtttaacactc accacgaaggtctgcagcttcactcctgagccagtgagaccacgaacccaccagaaggaa gaaattccgaacacatccgaacatcaaaaggaacaaactctggacacgctgcctttaaga actgtaacactcgccgtgagggtccacggcttcattcttgaagactgggccatgagttgg atgcagtctctacctgccaacccaggctgtggccactggggccagcttcgccctctccag tggcagggtgtcagggcagctgctaccactaatcatgtctgcagcttcactcctgaagcc agtgagaccacgaacctgccgggaggaacgaacaactctagccatgccgccttaagagct gtaacactcaccgtgaaggtctgcagcttcactcctgagccagcgagaccacgaacccac cagaagaaagaaactgcaaacacatccgaacatcagaaggaacgaaatccggacacacca cctttaagaactgtgacgctcaccgcgagggtccgcagcttcattcttgaagtcagtgag acgaagaacccaccaattccggacacattagcactaaagaaacagtttgaaagtgatgga aaaagatattctacataa >gi568815586r:79492610_79790244|GENSCAN_predicted_peptide_4|398_aa MGRAGRLGGGVSPLSSLLGAAIVGVRTVAPPAAVGAAAVPLLEVVSWHPRGRRKEEEAAA GALVGGLSGLGGSTTDFLEEWKAKREKMRAKQNPPGPAPPGGGSSDAAGKPPAGALGTPA AAAANELNNNLPGGAPAAPAVPGPGGVNCAVGSAMLTRAAPGPRRSEDEPPAASASAAPP PQRDEEEPDGVPEKGKSSGPSARKGKGQIEKRKLREKRRSTGVVNIPAAECLDEYEDDEA GQKERKREDAITQQNTIQNEAVNLLDPGSSYLLQEPPRTVSGRYKSTTSVSEEDVSSRYS RTDRSGFPRYNRDANVSGTLVSSSTLEKKIEDLEKEVVRERQENLRLVRLMQDKEEMIGK LKEEIDLLNRDLDDIEDENEQLKQENKTLLKVVGQLTR >gi568815586r:79492610_79790244|GENSCAN_predicted_CDS_4|1197_bp atggggcgggcgggccgacttgggggtggggtcagtcctctctcctcccttctaggggcg gcgatcgtcggggtccgtactgtagccccgccggctgctgtgggagcggcggccgtccct ctcctggaggtcgtctcctggcatcctcggggccgcaggaaggaagaggaggcagcggcc ggagccctggtgggcggcctgagcggcctcggcggcagcaccacagacttcctggaggag tggaaggcgaaacgcgagaagatgcgcgccaagcagaaccccccgggcccggcccccccg ggagggggcagcagcgacgccgctgggaagccccccgcgggggctctgggcaccccggcg gccgccgctgccaacgagctcaacaacaacctcccgggcggcgcgccggccgcacctgcc gtccccggtcccgggggcgtgaactgcgcggtcggctccgccatgctgacgcgggcggcc cccggcccgcggcggtcggaggacgagcccccagccgcctctgcctcggctgcaccgccg ccccagcgtgacgaggaggagccggacggcgtcccagagaagggcaagagctcgggcccc agtgccaggaaaggcaaggggcagatcgagaagaggaagctgcgggagaagcggcgctcc accggcgtggtcaacatccctgccgcagagtgcttagatgagtacgaagatgatgaagca gggcagaaagagcggaaacgagaagatgcaattacacaacagaacactattcagaatgaa gctgtaaacttactagatccaggcagttcctatctgctacaggagccacctagaacagtt tcaggcagatataaaagcacaaccagtgtctctgaagaagatgtctcaagtagatattct cgaacagatagaagtgggttccctagatataacagggatgcaaatgtttcaggtactctg gtttcaagtagcacactggaaaagaaaattgaagatcttgaaaaggaagtagtaagagaa agacaagaaaacctaagacttgtgagactgatgcaagataaagaggaaatgattggaaaa ctcaaagaagaaattgatttattaaatagagacctagatgacatagaagatgaaaatgaa cagctaaagcaggaaaataaaactcttttgaaagttgtgggtcagctgaccaggtag >gi568815586r:79492610_79790244|GENSCAN_predicted_peptide_5|114_aa MAEGEGETEYVLHGDRRQSEQEKLPVLKPSYRFWNWWVLGLTDFKNEAADPSAEGAGSGL SQPRGGLSRSSGGLKGSLSGLKGSLSAARVGSEAKEAPRVSEGCKGSQHAVASQ >gi568815586r:79492610_79790244|GENSCAN_predicted_CDS_5|345_bp atggcagaaggtgaaggggaaacagagtacgtcttacatggagacaggagacagagtgag caggaaaaactgccagttttaaaaccatcatatcgtttctggaattggtgggttcttggt ctcactgacttcaagaatgaagccgcggaccctagcgccgagggagcgggctccggcctc agccagcccagaggagggctctctcggtccagcggcgggctgaagggctccttaagcggg ctgaagggctccttaagcgcggccagagtgggctccgaggccaaggaggcaccaagagtc agcgagggctgcaagggcagccagcatgctgtcgcctctcaatag >gi568815586r:79492610_79790244|GENSCAN_predicted_peptide_6|72_aa MSIHQEDLAILDDYALYNKAAKYVKKKLIDQKEEIGKFMIIVVENTTPFSIMDRKTGQKI SKDIEELNNSID >gi568815586r:79492610_79790244|GENSCAN_predicted_CDS_6|219_bp atgtcaattcaccaagaagacttagcaatcctagatgattatgcactctataacaaagct gcaaaatatgtgaagaaaaaactgatagatcagaaagaagaaataggcaaattcatgatt atagttgtggagaatacaacacccttctcaataatggatagaaaaactggacagaaaatc agcaaagatatagaagaactcaacaacagcattgactga >gi568815586r:79492610_79790244|GENSCAN_predicted_peptide_7|77_aa MKSISEQGNRMCKRKCEEIREVVYGGKLKFSVFQMLTIQLTLSDPYFFSFFHNCVKSVAI NSLFHNTHTGSASLIEL >gi568815586r:79492610_79790244|GENSCAN_predicted_CDS_7|234_bp atgaagagtatctcggaacaaggaaacagaatgtgcaaaaggaagtgtgaggaaatcaga gaagttgtatatggcggaaagctgaagttctcagtcttccagatgcttacaatccagctc accctttcagatccttacttcttcagtttcttccacaattgtgtcaaatctgttgctata aattccttatttcataatactcacactggttctgcttccctgattgaactttaa >gi568815586r:79492610_79790244|GENSCAN_predicted_peptide_8|142_aa XYETSSTSAGDRYDSLLGRSGSYSYLEERKPYSSRLEKDDSTDFKKLYEQILAENEKLKA QLHDTNMELTDLKLQLEKATQRQERFADRSLLEMEKRERRALERRISEMEEELKMLPDLK ADNQRLKDENGALIRVISKLSK >gi568815586r:79492610_79790244|GENSCAN_predicted_CDS_8|429_bp nnatatgaaaccagttctacatcagctggtgatcgatatgattccttgctgggtcgctct ggatcatacagttacttagaagaaagaaaaccttacagcagcaggctagaaaaggatgac tcaactgactttaaaaagctttatgaacaaattctagctgaaaatgaaaagctgaaggca cagctacatgatacaaatatggaactaacagatcttaaattacagttggaaaaggccacc cagagacaagaaagatttgctgatagatcactgttggaaatggaaaaaagggaacgaaga gctctagaaagaagaatatctgaaatggaagaagagctcaaaatgttaccagacctaaaa gcagacaaccagaggctaaaggatgaaaatggggccttgatcagagttataagcaaactt tccaaataa