GENSCAN 1.0 Date run: 4-Nov-116 Time: 15:50:53 Sequence gi568815575f:41595634_41796776 : 201143 bp : 39.25% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 PlyA - 742 737 6 1.05 1.05 Term - 12137 12021 117 2 0 51 55 58 0.592 -3.64 1.04 Intr - 14392 14271 122 2 2 109 95 39 0.835 5.99 1.03 Intr - 31070 30971 100 2 1 107 91 73 0.924 8.26 1.02 Intr - 41028 40945 84 0 0 65 94 100 0.964 7.40 1.01 Init - 48578 47724 855 2 0 43 25 390 0.012 22.87 1.00 Prom - 51810 51771 40 -7.75 2.00 Prom + 51968 52007 40 -3.05 2.01 Init + 59808 60172 365 2 2 39 116 314 0.585 25.87 2.02 Term + 61229 61283 55 2 1 97 41 34 0.598 -4.35 2.03 PlyA + 61543 61548 6 1.05 3.04 PlyA - 64486 64481 6 1.05 3.03 Term - 64928 64752 177 2 0 110 48 140 0.867 8.90 3.02 Intr - 69819 69644 176 1 2 36 90 114 0.962 5.14 3.01 Init - 71488 71365 124 1 1 77 82 77 0.985 6.38 3.00 Prom - 77576 77537 40 -6.15 4.02 PlyA - 78242 78237 6 1.05 4.01 Sngl - 80861 80124 738 2 0 82 41 695 0.879 60.16 4.00 Prom - 96632 96593 40 -5.75 5.00 Prom + 96696 96735 40 -6.15 5.01 Sngl + 100022 101146 1125 1 0 75 51 141 0.527 5.50 5.02 PlyA + 101619 101624 6 1.05 6.00 Prom + 103961 104000 40 -4.15 6.01 Init + 111665 111728 64 1 1 65 94 43 0.132 3.99 6.02 Intr + 116245 116355 111 2 0 54 87 41 0.010 0.03 6.03 Intr + 119306 119468 163 0 1 83 116 138 0.069 14.21 6.04 Intr + 123664 123700 37 1 1 60 67 54 0.757 -2.15 6.05 Term + 123957 124205 249 2 0 74 49 245 0.926 13.92 6.06 PlyA + 124361 124366 6 1.05 7.07 PlyA - 124396 124391 6 1.05 7.06 Term - 126737 126588 150 2 0 38 47 134 0.008 1.23 7.05 Intr - 143823 143751 73 2 1 82 115 54 0.881 5.99 7.04 Intr - 149968 149891 78 0 0 69 91 108 0.169 6.95 7.03 Intr - 153105 152995 111 2 0 118 78 9 0.038 1.48 7.02 Intr - 191650 191545 106 2 1 96 68 75 0.474 4.65 7.01 Intr - 193798 193697 102 2 0 103 81 9 0.495 0.93 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 48578 47649 930 2 0 43 39 412 0.849 27.98 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:41595634_41796776|GENSCAN_predicted_peptide_1|425_aa MNINAKILNKILVNRIQQHIKKLIHHDQVSFIPGMQGWFNICKSINIIHHINRTNNKNHM ITSIDAEKAFDKIQQRFMLKTLNKLGIDGTYLKIVRAIYDKPTANIILNGQKLEAFPLKT GTRQECPLSPLLFNIVLEVLARAIRQEKEIKGIRLGKEEVKLSLFADDMIVYLENPIISA QNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQIESQIMSELPFTIASKRIKYLGIQLTK DVKDLFKESYKPLLNEINEDTNKWKNIPCLWIGRINIMKMAILPKERDRYAYKIHLPETV EQLRKFNARRKLKGAVLAAVSSHKFNSFYGDPPEELPDFSEDPTSSGAVSQVLDSLEEIH ALTDCSEKDLDFLHSVFQDQHLHTLLDPANERRENMENLAGSSLEMGLSHILWQLLVTCS PGCES >gi568815575f:41595634_41796776|GENSCAN_predicted_CDS_1|1278_bp atgaacatcaatgcaaaaatcctcaataaaatactggtaaaccgaatacagcagcacatc aaaaagcttatccaccacgatcaagtcagcttcatccctgggatgcaaggctggttcaac atatgcaaatcaataaacataatccatcatataaacagaaccaacaacaaaaaccacatg attacctcaatagatgcagaaaaggcctttgacaaaattcaacagcgcttcatgctaaaa actctcaataaactaggtattgatggaacttatctcaaaatagtaagagctatttatgac aaacccacagccaatatcatactgaatgggcaaaaactggaagcattccctttgaaaact ggcacaagacaggaatgccctctctcaccactcctattcaacatagtgttggaagttctg gccagggcaatcaggcaagagaaagaaataaaaggtattcgattaggaaaagaggaagtc aaattgtccctgtttgcagatgacatgattgtatatttagaaaaccccatcatctcagcc caaaatctccttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaatgtg caaaaatcacaagcattcctatacaccaataacagacaaatagagagccaaatcatgagt gaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaacttacaaag gatgtgaaggacctcttcaaggagagctacaaaccactgctcaatgaaataaatgaggac acaaacaaatggaagaatattccatgcttatggataggaagaatcaatatcatgaaaatg gccatactgcccaaagagcgggatcgttacgcctacaagattcatcttccagaaacagta gagcagctgaggaaattcaatgcaaggaggaaactaaagggtgcagtactagccgctgtg tcaagtcacaaattcaactcattctatggggatccccctgaagagttaccagatttctcc gaagaccctacctcctcaggagcagtctcacaggtgctggacagcctggaagagattcat gcgcttacagactgcagtgaaaaggacctagattttctacacagtgttttccaggatcag catcttcacacactactagatccagcaaatgagagaagagagaacatggagaatcttgca gggtcaagcttggaaatgggtttatctcacattctttggcaactgctagtcacgtgctca cctggatgtgagagctga >gi568815575f:41595634_41796776|GENSCAN_predicted_peptide_2|139_aa MTEGHKMILTPEIPMLSHMMSEKHSNGDGSAQNSSIIKWKWFMQEHPLREVQGGNILKQG ASFPLGLTLELCEELWDSTDTWTGPREQLSTDQRRAAWCVDDSSKVNRQHPVWKDATLDQ RSGSRTRAITTSVRTRDDS >gi568815575f:41595634_41796776|GENSCAN_predicted_CDS_2|420_bp atgactgaaggacataaaatgatcttgacacctgaaatacccatgctttctcacatgatg tcagagaaacattcaaatggggatggcagcgcccagaatagttccataataaaatggaaa tggtttatgcaggaacatcctctccgggaagtgcaaggaggaaacatcctcaagcaggga gcctcttttcccctaggactgactctggaactgtgtgaggaactgtgggattctactgac acttggacagggccccgtgaacagctctcaactgaccaacgaagagctgcttggtgtgtg gatgacagttccaaggtgaatcgacagcatcctgtttggaaggatgccactcttgatcaa agaagtggttctaggaccagagccataactaccagtgttagaaccagggatgattcctaa >gi568815575f:41595634_41796776|GENSCAN_predicted_peptide_3|158_aa MAGTGHGRSDYSQVKAPILSGLAQEVVGVKLSFKASSKLSAGRVGTPHFMAPEVVKREPY GKPVDVWGCGVILFILLSGCLPFYGTKERLFEGIIKGKYKMNPRQWSHISESAKDLVRRM LMLDPAERITVYEALNHPWLKVHEHSQVTFYTLSEHFQ >gi568815575f:41595634_41796776|GENSCAN_predicted_CDS_3|477_bp atggctgggactggccatggtcgctctgattattcccaagttaaggcccccattttgagt ggcttggcccaggaagtggtaggagtcaagttgagtttcaaagcatcatccaaactgtca gcaggacgtgttggaacacctcattttatggcaccagaagtggtcaaaagagagccttac ggaaagcctgtagacgtctgggggtgcggtgtgatcctttttatcctgctcagtggttgt ttgcctttttacggaaccaaggaaagattgtttgaaggcattattaaaggaaaatataag atgaatccaaggcagtggagccatatctctgaaagtgccaaagacctagtacgtcgcatg ctgatgctggatccagctgaaaggatcactgtttatgaagcactgaatcacccatggctt aaggtacatgagcattcacaggtcaccttctacactttgtctgaacacttccagtga >gi568815575f:41595634_41796776|GENSCAN_predicted_peptide_4|245_aa MDKNELVQKAKLAEQAERYDDMAACMKSVTEQGAELSNEERNLLSVAYKNVLGARRSSWR VVSSIEQKTEGAEKKQQMAREYREKVDTELRDICNDVLFLLEKFLIPSASQAESKVFSLK MKGDYYHYLAEVATGDDKKGIVDQSQQAYQEAFEISKKEMQPTHPIRPGLALNFSVFYYE ILNSPEKACSLAKTAFDEAIAELDTLIEESYKDSMLIMQLLRDNLTLWTSDTQGDEAEAG EGGEN >gi568815575f:41595634_41796776|GENSCAN_predicted_CDS_4|738_bp atggataaaaatgagctggttcagaaggccaaactggccgagcaggctgagcgatatgat gacatggcagcctgcatgaagtctgtaactgagcaaggagctgaattatccaatgaggag aggaatcttctctcagttgcttataaaaatgttttaggagcccgtaggtcatcttggagg gtcgtctcaagtattgaacaaaagacggaaggtgctgagaaaaaacagcagatggctcga gaatacagagagaaagttgatacggagctaagagatatctgcaatgatgtactgtttctt ttggaaaagttcttgatccccagtgcttcacaagcagagagcaaagtcttctctttgaaa atgaaaggagattactaccattacttggcagaggttgccactggtgatgacaagaaaggg attgtggatcagtcacaacaagcataccaagaagcttttgaaatcagcaaaaaggaaatg caaccaacacatcctatcagaccaggtctggcccttaacttctctgtgttctattatgag attctgaactccccagagaaagcctgctctcttgcaaagacagcttttgatgaagccatt gctgaacttgatacattaattgaagagtcatacaaagacagcatgctaataatgcaatta ctgagagacaacttgacattgtggacatcagatacccaaggagacgaagctgaagcagga gaaggaggggaaaattaa >gi568815575f:41595634_41796776|GENSCAN_predicted_peptide_5|374_aa MTTTSVSSWPYSSHRMRFITNHSDQPPQNFSATPNVTTCPMDEKLLSTVLTTSYSVIFIV GLVGNIIALYVFLGIHRKRNSIQIYLLNVAIADLLLIFCLPFRIMYHINQNKWTLGVILC KVVGTLFYMNMYISIILLGFISLDRYIKINRSIQQRKAITTKQSIYVCCIVWMLALGGFL TMIILTLKKGGHNSTMCFHYRDKHNAKGEAIFNFILVVMFWLIFLLIILSYIKIGKNLLR ISKRRSKFPNSGKYATTARNSFIVLIIFTICFVPYHAFRFIYISSQLNVSSCYWKEIVHK TNEIMLVLSSFNSCLDPVMYFLMSSNIRKIMCQLLFRRFQGEPSRSESTSEFKPGYSLHD TSVAVKIQSSSKST >gi568815575f:41595634_41796776|GENSCAN_predicted_CDS_5|1125_bp atgacgacaacttcagtcagcagctggccttactcctcccacagaatgcgctttataacc aatcatagcgaccaaccgccacaaaacttctcagcaacaccaaatgttactacctgtccc atggatgaaaaattgctatctactgtgttaaccacatcctactctgttattttcatcgtg ggactggttgggaacataatcgccctctatgtatttctgggtattcaccgtaaaagaaat tccattcaaatttatctacttaacgtagccattgcagacctcctactcatcttctgcctc cctttccgaataatgtatcatattaaccaaaacaagtggacactaggtgtgattctgtgc aaggttgtgggaacactgttttatatgaacatgtacattagcattattttgcttggattc atcagtttggatcgctatataaaaattaatcggtctatacagcaacggaaggcaataaca accaaacaaagtatttatgtctgttgtatagtatggatgcttgctcttggtggattccta actatgattattttaacacttaagaaaggagggcataattccacaatgtgtttccattac agagataagcataacgcaaaaggagaagccatttttaacttcattcttgtggtaatgttc tggctaattttcttactaataatcctttcatatattaagattgggaagaatctattgagg atttctaaaaggaggtcaaaatttcctaattctggtaaatatgccactacagctcgtaac tcctttattgtacttatcatttttactatatgttttgttccctatcatgcctttcgattc atctacatttcttcacagctaaatgtatcatcttgctactggaaagaaattgttcacaaa accaatgagatcatgctggttctctcatctttcaatagttgcttagatccagtcatgtat ttcctgatgtccagtaacattcgcaaaataatgtgccaacttctttttagacgatttcaa ggtgaaccaagtaggagtgaaagcacttcagaatttaaaccaggatactccctgcatgat acatctgtggcagtgaaaatacagtctagttctaaaagtacttga >gi568815575f:41595634_41796776|GENSCAN_predicted_peptide_6|207_aa MKDHMERGAQPAQLFQVMLQTPVLLLNKILPYPSSPFKLSAYPHSSWIRTGAWELLNAGL TLELCEELRDSTETWTGPRKQLSTDQQRVAWCVNDSSKVNNLVWKDATLDQRSLEQQLCC NRLLKKGYTRQQFCHKSTLNKGDRVIYKLTCPPYCCVRFPLAGTGPRILYLSRLASNLEL LKKGKGRGEQKKEEVTCGMLRKVKTCK >gi568815575f:41595634_41796776|GENSCAN_predicted_CDS_6|624_bp atgaaagatcacatggagagaggggcacagcctgctcaactcttccaggtgatgctccag acacctgttctgctgctcaataaaattcttccctacccatcctcacccttcaaattgtca gcgtatcctcattcttcgtggataaggacaggagcttgggaactgctgaacgcaggactg actctggaactgtgtgaggaactgcgggattctactgagacttggacagggccccgtaaa cagctctcaactgaccaacaaagagttgcttggtgtgtaaatgacagttccaaggtgaac aatcttgtttggaaggatgccactcttgatcaaagaagtttggagcagcagctgtgctgc aatcggttactgaagaaagggtacactcgccagcagttttgccacaagagtacactgaac aaaggagacagggtcatttataagctgacgtgtccaccctactgctgtgtccggtttcca ttggctggaacgggacctcgcattctgtatctgtcccgattggctagcaacttagaactt cttaaaaaaggcaaaggcagaggagaacaaaagaaggaggaagtaacttgtggaatgctg agaaaagtaaaaacctgcaaataa >gi568815575f:41595634_41796776|GENSCAN_predicted_peptide_7|206_aa XNTEPPALPCWAPNIQLLIRILTLPSPTDLVDSADLKREASICHMLKHPHIVELLETYSS DGMLYMVFELLRLFPHPFWVPPESPLALRTSPKFFLETPESIEAFYFMDGADLCFEIVKR ADAGFVYSEAVASHYMRQILEALRYCHDNNIIHRDVKPLKSTVSLLQYSIGHTEPALHQC GTGKYQEVMITGSHLGNWLSQMSINR >gi568815575f:41595634_41796776|GENSCAN_predicted_CDS_7|621_bp ntaaacactgagcccccagccctgccatgctgggctccaaacatccagcttctcattaga atcttaacactgccctctccaactgatttggtagactctgcagatctaaagcgggaagcc agtatctgtcatatgctgaaacatccacacattgtagagttattggagacatatagctca gatggaatgctttacatggttttcgaattgctccgtcttttccctcatcccttctgggtc cctccagaaagtcctctagctctcagaacatctcccaaattctttttggaaacaccagag tccattgaggcattttattttatggatggagcagatctgtgttttgaaatcgtaaagcga gctgacgctggttttgtgtacagtgaagctgtagccagccattatatgagacagatactg gaagctctacgctactgccatgataataacataattcacagggatgtgaagcctctgaag tcaacagtgtcacttctgcagtattctattggtcacacagagccagccctgcatcagtgt gggacgggtaaatatcaggaggtgatgatcactgggagccatcttggaaactggctatca cagatgtcaatcaaccgttga