GENSCAN 1.0 Date run: 4-Nov-116 Time: 23:27:12 Sequence gi568815592r:88043859_88245274 : 201416 bp : 38.53% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3859 4255 397 0 1 61 77 484 0.393 37.83 1.02 Intr + 10088 10144 57 0 0 59 103 58 0.795 2.34 1.03 Intr + 13754 13855 102 0 0 51 68 164 0.946 9.93 1.04 Intr + 14858 14964 107 0 2 85 89 22 0.967 1.01 1.05 Intr + 15595 15730 136 0 1 61 103 131 0.993 11.12 1.06 Intr + 20667 20760 94 1 1 52 119 -5 0.698 -2.90 1.07 Term + 22324 22477 154 1 1 106 41 146 0.828 8.11 1.08 PlyA + 22539 22544 6 1.05 2.02 PlyA - 24948 24943 6 1.05 2.01 Sngl - 45196 44891 306 1 0 40 55 207 0.965 8.22 2.00 Prom - 45458 45419 40 -4.35 3.06 PlyA - 48759 48754 6 1.05 3.05 Term - 49646 49479 168 2 0 79 33 124 0.822 2.90 3.04 Intr - 55013 54980 34 1 1 31 59 37 0.007 -7.59 3.03 Intr - 56921 56639 283 0 1 96 101 106 0.379 8.25 3.02 Intr - 71597 71453 145 2 1 49 98 68 0.204 2.83 3.01 Init - 72322 72257 66 1 0 59 86 57 0.438 3.72 3.00 Prom - 83464 83425 40 -3.35 4.02 PlyA - 84079 84074 6 1.05 4.01 Sngl - 101416 99998 1419 1 0 49 52 1302 0.977 117.45 4.00 Prom - 103629 103590 40 -6.45 5.04 PlyA - 103687 103682 6 1.05 5.03 Term - 103884 103743 142 2 1 43 41 184 0.210 5.62 5.02 Intr - 122896 122718 179 1 2 11 50 228 0.124 9.10 5.01 Init - 123799 123707 93 1 0 45 44 146 0.134 6.37 5.00 Prom - 125922 125883 40 -6.25 6.03 PlyA - 126035 126030 6 1.05 6.02 Term - 126852 126783 70 2 1 71 39 134 0.082 3.23 6.01 Init - 141489 141359 131 0 2 47 58 171 0.342 9.67 6.00 Prom - 155741 155702 40 -4.25 7.00 Prom + 161727 161766 40 -7.85 7.01 Init + 166773 166923 151 2 1 74 74 112 0.926 8.65 7.02 Intr + 167815 167920 106 2 1 35 80 79 0.667 0.15 7.03 Intr + 169286 169368 83 2 2 76 30 88 0.141 0.16 7.04 Term + 185163 185374 212 1 2 60 42 129 0.781 1.97 7.05 PlyA + 185512 185517 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:88043859_88245274|GENSCAN_predicted_peptide_1|348_aa PSSARVPGAQHRSSAPATIEASRFVTGFGAGRLRAGCRSSLRRTCPQEPRRRLRLGRPSG PRTMSPRGTGCSAGLLMTVGWLLLAGLQSARGTNVTAAVQDAGLAHEGEGEEETENNDSE TAENYAPPETEDVSNRNVVKEVEFGMCTVTCGIGVREVILTNGCPGGESKCVVRVEECRG PTDCGWGKPISESLESVRLACIHTSPLNRFKYMWKLLRQDQQSIILVNDSAILEVRKESH PLAFECDTLDNNEIVATIKFTVYTSSVVVLSTLKLGYCNLLIHLYDNIYNTYMSSTPWAA VKAFWGAKASTPEVQSEQSSVRYKDSTSLDQLPTEMPGEDDALSEWNE >gi568815592r:88043859_88245274|GENSCAN_predicted_CDS_1|1047_bp cctagctccgcgcgagttccaggcgctcagcaccggagcagcgccccggcaaccattgag gcgtcacggttcgtcacgggattcggagccgggcggctacgggcggggtgtcgcagctct cttcgacgtacctgtcctcaggagccgcggcggcgactgcgcctcggacggccgtcgggg ccgagaaccatgagccccaggggcacgggctgctccgccgggctgctgatgactgtcggc tggctgcttctggcgggcctccagtccgcgcgcgggaccaacgtcaccgctgccgtccag gatgccggcctggcccacgaaggcgagggcgaggaggagaccgaaaacaacgacagcgag accgcggagaactacgctccgcctgaaaccgaggatgtttcaaataggaatgtcgtcaaa gaagtagaattcggaatgtgcaccgttacatgtggtattggtgttagagaagttatatta acaaatggatgccctggtggtgaatccaagtgtgttgtacgggtagaagaatgccgtgga ccaacagattgtggctggggtaaaccaatttcagaaagtcttgaaagtgttagattggca tgtattcacacatctcccttaaatcgtttcaaatatatgtggaaacttctaagacaagac caacaatccattatacttgtaaatgattcagcaatcctagaagtacgcaaggaaagtcac cccttggctttcgagtgtgacacactggataataatgaaatagtagcaactattaaattc acagtctatacgagcagtgttgtggttctctcaactctaaaacttggttattgtaaccta ctaattcatttgtatgacaatatttataatacctacatgagtagtacaccctgggcagca gtcaaggctttctggggggcaaaagcctctacacctgaggtacaatccgagcagagttct gtgagatacaaagattcaacttctcttgaccaattaccaacagaaatgcctggtgaagat gatgctttaagtgaatggaatgaatga >gi568815592r:88043859_88245274|GENSCAN_predicted_peptide_2|101_aa MKTACLSFDEKPSSCLGKGIFQNTGGIRVVVGRDSDTQAEERWKRHLSGNHPYRFHQKLQ WKQSPFPAAALKEAPPRIRNRQTRRLRASSGDKNWKRKIEG >gi568815592r:88043859_88245274|GENSCAN_predicted_CDS_2|306_bp atgaagacggcttgcttgtcctttgatgaaaaacccagcagctgtctgggaaaggggatc ttccagaacacaggggggattcgggtggtggtagggagagactctgacacacaggcagag gaaagatggaaacggcatctttcagggaaccatccttatcgcttccatcagaagctgcag tggaaacaatcaccttttccggcagcagctctgaaggaggcaccccccagaatcagaaac aggcagactaggcgtttgagagcatcctctggagacaagaactggaaaaggaaaatagag ggatga >gi568815592r:88043859_88245274|GENSCAN_predicted_peptide_3|231_aa MSGVQKKDAGRWESLQEQVDTKGSAPPCGPALGCSLVKTRNINVSLLQSNAQDENVKEEA MKNEAEIGKSGHTPSNKCETFGGYFPRIFCNRTLEPLALARSHPVERAFIFNTSLLLLLH SFLALFMHFIQFFFQNTKNLDTLHKQQYEVIAAQNKQTDTKKGRILKVRVDVNTREVLVQ HPGRIRSHTDLKDECRCFIDRWRWFSVGWMGSWKKEVAGRRSSPGVWPSSS >gi568815592r:88043859_88245274|GENSCAN_predicted_CDS_3|696_bp atgagtggggtgcagaaaaaagatgcaggcagatgggagagtctgcaggagcaagtggac acaaagggatctgcacctccttgtgggccagccttggggtgttctctagtcaaaacaaga aacataaatgtatccctgctgcaatctaatgctcaagatgaaaatgtaaaggaggaggct atgaaaaatgaagctgagattgggaagtcagggcatacaccaagtaacaaatgtgaaacc ttcggaggatacttccccagaatattctgtaaccggacccttgagccgcttgctctggcc cgttctcaccctgtggagcgtgctttcattttcaatacatctctgctcttgttgcttcat tctttccttgccttgtttatgcattttatccaattcttttttcaaaacactaagaaccta gacacgctccacaagcaacaatatgaagttattgcagcccaaaataagcaaacagacaca aagaagggaagaatacttaaggtccgtgttgatgttaataccagagaggtccttgtccag catccaggaagaatcaggtcacacacagacttgaaggatgaatgccggtgttttattgac cggtggaggtggttctcagtgggatggatgggaagctggaagaaggaagtagcgggaaga cgatcttcccctggagtttggccatccagcagctaa >gi568815592r:88043859_88245274|GENSCAN_predicted_peptide_4|472_aa MKSILDGLADTTFRTITTDLLYVGSNDIQYEDIKGDMASKLGYFPQKFPLTSFRGSPFQE KMTAGDNPQLVPADQVNITEFYNKSLSSFKENEENIQCGENFMDIECFMVLNPSQQLAIA VLSLTLGTFTVLENLLVLCVILHSRSLRCRPSYHFIGSLAVADLLGSVIFVYSFIDFHVF HRKDSRNVFLFKLGGVTASFTASVGSLFLTAIDRYISIHRPLAYKRIVTRPKAVVAFCLM WTIAIVIAVLPLLGWNCEKLQSVCSDIFPHIDETYLMFWIGVTSVLLLFIVYAYMYILWK AHSHAVRMIQRGTQKSIIIHTSEDGKVQVTRPDQARMDIRLAKTLVLILVVLIICWGPLL AIMVYDVFGKMNKLIKTVFAFCSMLCLLNSTVNPIIYALRSKDLRHAFRSMFPSCEGTAQ PLDNSMGDSDCLHKHANNAASVHRAAESCIKSTVKIAKVTMSVSTDTSAEAL >gi568815592r:88043859_88245274|GENSCAN_predicted_CDS_4|1419_bp atgaagtcgatcctagatggccttgcagataccaccttccgcaccatcaccactgacctc ctgtacgtgggctcaaatgacattcagtacgaagacatcaaaggtgacatggcatccaaa ttagggtacttcccacagaaattccctttaacttcctttaggggaagtcccttccaagag aagatgactgcgggagacaacccccagctagtcccagcagaccaggtgaacattacagaa ttttacaacaagtctctctcgtccttcaaggagaatgaggagaacatccagtgtggggag aacttcatggacatagagtgtttcatggtcctgaaccccagccagcagctggccattgca gtcctgtccctcacgctgggcaccttcacggtcctggagaacctcctggtgctgtgcgtc atcctccactcccgcagcctccgctgcaggccttcctaccacttcatcggcagcctggcg gtggcagacctcctggggagtgtcatttttgtctacagcttcattgacttccacgtgttc caccgcaaagatagccgcaacgtgtttctgttcaaactgggtggggtcacggcctccttc actgcctccgtgggcagcctgttcctcacagccatcgacaggtacatatccattcacagg cccctggcctataagaggattgtcaccaggcccaaggccgtggtggcgttttgcctgatg tggaccatagccattgtgatcgccgtgctgcctctcctgggctggaactgcgagaaactg caatctgtttgctcagacattttcccacacattgatgaaacctacctgatgttctggatc ggggtcaccagcgtactgcttctgttcatcgtgtatgcgtacatgtatattctctggaag gctcacagccacgccgtccgcatgattcagcgtggcacccagaagagcatcatcatccac acgtctgaggatgggaaggtacaggtgacccggccagaccaagcccgcatggacattagg ttagccaagaccctggtcctgatcctggtggtgttgatcatctgctggggccctctgctt gcaatcatggtgtatgatgtctttgggaagatgaacaagctcattaagacggtgtttgca ttctgcagtatgctctgcctgctgaactccaccgtgaaccccatcatctatgctctgagg agtaaggacctgcgacacgctttccggagcatgtttccctcttgtgaaggcactgcgcag cctctggataacagcatgggggactcggactgcctgcacaaacacgcaaacaatgcagcc agtgttcacagggccgcagaaagctgcatcaagagcacggtcaagattgccaaggtaacc atgtctgtgtccacagacacgtctgccgaggctctgtga >gi568815592r:88043859_88245274|GENSCAN_predicted_peptide_5|137_aa MNAKAATRGRPGTEPSAQHGALVADVTWLRQVLAMRAARERVINGPEVVGGACTSPFLAW SGCEGIREHSEVMAAGASGSAAAEEEEEEENCVASDQKGTGPEKSRKTGQKAGALTLVSS DEILEVEVAECGKVGHI >gi568815592r:88043859_88245274|GENSCAN_predicted_CDS_5|414_bp atgaacgccaaggcggccacaagaggacggccgggaaccgaacccagcgcccagcacggg gcccttgtggccgacgtgacgtggctccgtcaggtgttggcgatgcgcgctgcccgggag cgtgtgattaatggccccgaggtcgtgggcggcgcatgcaccagccccttcttggcctgg tcagggtgtgagggtatccgggagcacagcgaggtcatggcagcgggggcctccgggagc gcggcggcggaggaggaggaggaagaggagaactgtgtggcaagtgatcaaaaaggaaca ggaccagagaagagcaggaaaactggtcagaaagcaggcgccctaaccctggtgagcagt gatgagatcctggaagtggaggtggcagaatgtgggaaggtgggccacatttaa >gi568815592r:88043859_88245274|GENSCAN_predicted_peptide_6|66_aa MLLSTAMELTLHKEQLPDAWTASRLQTQPDMDSECHTDNDTLLRWDVKDLDLMKEGEKVN DNNKKP >gi568815592r:88043859_88245274|GENSCAN_predicted_CDS_6|201_bp atgctgctcagcactgcaatggaactaaccttacacaaagagcagctgccagatgcctgg actgcaagccgcctgcaaacccaacctgacatggacagcgaatgccacactgataatgac accttgctccgctgggatgttaaagatctggatttaatgaaagaaggtgagaaggtgaac gacaacaacaaaaagccctaa >gi568815592r:88043859_88245274|GENSCAN_predicted_peptide_7|183_aa MRKICICSSYKGIVKIEIEIEIEKVHEKRGKLGDDTVFRELLPLFELPKTGFCGGILKKG YSLSWTLSSDKARCAGNINSLRLASRKRNSEFEKLPYLIPDSVTPKEESAGRLSLEADRS QQPQASSPPQCHIACSEPGLQACSSTVSPSASRLQVHPNTAQALAAPGFGTMPGTLTRIS GQS >gi568815592r:88043859_88245274|GENSCAN_predicted_CDS_7|552_bp atgagaaaaatctgcatctgctcaagttacaaagggattgtgaagattgagattgagata gaaatagagaaagtgcatgaaaaaaggggaaaacttggtgatgacactgttttcagagag ctgttaccactctttgagcttcctaagacaggcttctgtgggggaatcctgaaaaaagga tatagcttgtcatggactctttcttctgataaagctcgctgtgctggaaacattaatagc ctacgacttgcttccaggaagagaaatagtgaatttgagaaattaccatacctcattcct gactcagtgactccaaaagaggaatcagcaggaagactgtccctagaggcagacaggtct cagcagccccaggcttccagtccaccccagtgccatattgcctgcagtgaacctgggctt caggcttgctccagcactgtcagccccagtgcttccaggcttcaagtacaccccaacact gcacaagccttagcagctccaggttttggaaccatgccaggcaccctgaccagaatctct ggacaaagctga