GENSCAN 1.0 Date run: 5-Nov-116 Time: 01:31:04 Sequence gi568815575r:41575760_41776494 : 200735 bp : 39.36% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 1956 2062 107 1 2 89 48 136 0.800 7.29 1.02 PlyA + 2147 2152 6 -1.75 2.09 PlyA - 2338 2333 6 -0.45 2.08 Term - 2414 2400 15 2 0 108 47 5 0.512 -4.34 2.07 Intr - 2769 2581 189 0 0 52 115 167 0.442 14.66 2.06 Intr - 11228 11148 81 2 0 99 115 19 0.955 4.62 2.05 Intr - 13833 13756 78 0 0 84 107 54 0.975 5.73 2.04 Intr - 34266 34145 122 1 2 109 95 39 0.951 5.99 2.03 Intr - 50944 50845 100 1 1 107 91 73 0.924 8.26 2.02 Intr - 60902 60819 84 2 0 65 94 100 0.964 7.40 2.01 Init - 68452 67598 855 1 0 43 25 390 0.012 22.87 2.00 Prom - 71684 71645 40 -7.75 3.00 Prom + 71842 71881 40 -3.05 3.01 Init + 79682 80046 365 1 2 39 116 314 0.585 25.87 3.02 Term + 81103 81157 55 1 1 97 41 34 0.598 -4.35 3.03 PlyA + 81417 81422 6 1.05 4.04 PlyA - 84360 84355 6 1.05 4.03 Term - 84802 84626 177 1 0 110 48 140 0.867 8.90 4.02 Intr - 89693 89518 176 0 2 36 90 114 0.962 5.14 4.01 Init - 91362 91239 124 0 1 77 82 77 0.985 6.38 4.00 Prom - 97450 97411 40 -6.15 5.02 PlyA - 98116 98111 6 1.05 5.01 Sngl - 100735 99998 738 1 0 82 41 695 0.879 60.16 5.00 Prom - 116506 116467 40 -5.75 6.00 Prom + 116570 116609 40 -6.15 6.01 Sngl + 119896 121020 1125 0 0 75 51 141 0.527 5.50 6.02 PlyA + 121493 121498 6 1.05 7.00 Prom + 123835 123874 40 -4.15 7.01 Init + 131539 131602 64 0 1 65 94 43 0.132 3.99 7.02 Intr + 136119 136229 111 1 0 54 87 41 0.010 0.03 7.03 Intr + 139180 139342 163 2 1 83 116 138 0.069 14.21 7.04 Intr + 143538 143574 37 0 1 60 67 54 0.757 -2.15 7.05 Term + 143831 144079 249 1 0 74 49 245 0.926 13.92 7.06 PlyA + 144235 144240 6 1.05 8.05 PlyA - 144270 144265 6 1.05 8.04 Term - 146611 146462 150 1 0 38 47 134 0.008 1.23 8.03 Intr - 163697 163625 73 1 1 82 115 54 0.882 5.99 8.02 Intr - 169842 169765 78 2 0 69 91 108 0.172 6.95 8.01 Intr - 172979 172869 111 1 0 118 78 9 0.040 1.48 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 68452 67523 930 1 0 43 39 412 0.849 27.98 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575r:41575760_41776494|GENSCAN_predicted_peptide_1|35_aa XNIKEPFLDLLSLREFYENGGKVQHQFSSSSTPKH >gi568815575r:41575760_41776494|GENSCAN_predicted_CDS_1|108_bp nttaacatcaaggagcctttccttgatcttctcagtctacgagaattctatgagaatgga ggcaaggttcagcatcaattctcttccagctctactcccaagcactga >gi568815575r:41575760_41776494|GENSCAN_predicted_peptide_2|507_aa MNINAKILNKILVNRIQQHIKKLIHHDQVSFIPGMQGWFNICKSINIIHHINRTNNKNHM ITSIDAEKAFDKIQQRFMLKTLNKLGIDGTYLKIVRAIYDKPTANIILNGQKLEAFPLKT GTRQECPLSPLLFNIVLEVLARAIRQEKEIKGIRLGKEEVKLSLFADDMIVYLENPIISA QNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQIESQIMSELPFTIASKRIKYLGIQLTK DVKDLFKESYKPLLNEINEDTNKWKNIPCLWIGRINIMKMAILPKERDRYAYKIHLPETV EQLRKFNARRKLKGAVLAAVSSHKFNSFYGDPPEELPDFSEDPTSSGAVSQVLDSLEEIH ALTDCSEKDLDFLHSVFQDQHLHTLLDLYDKINTKSSPQIRNPPSDAVQRAKEVLEEISC YPENNDAKELKRILTQPHFMALLQTHDVVAHEVYSDEALRVTPPPTSPYLNGDSPESANG DMDMENVTRVRLVQFQKNTDEPMNCSA >gi568815575r:41575760_41776494|GENSCAN_predicted_CDS_2|1524_bp atgaacatcaatgcaaaaatcctcaataaaatactggtaaaccgaatacagcagcacatc aaaaagcttatccaccacgatcaagtcagcttcatccctgggatgcaaggctggttcaac atatgcaaatcaataaacataatccatcatataaacagaaccaacaacaaaaaccacatg attacctcaatagatgcagaaaaggcctttgacaaaattcaacagcgcttcatgctaaaa actctcaataaactaggtattgatggaacttatctcaaaatagtaagagctatttatgac aaacccacagccaatatcatactgaatgggcaaaaactggaagcattccctttgaaaact ggcacaagacaggaatgccctctctcaccactcctattcaacatagtgttggaagttctg gccagggcaatcaggcaagagaaagaaataaaaggtattcgattaggaaaagaggaagtc aaattgtccctgtttgcagatgacatgattgtatatttagaaaaccccatcatctcagcc caaaatctccttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaatgtg caaaaatcacaagcattcctatacaccaataacagacaaatagagagccaaatcatgagt gaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaacttacaaag gatgtgaaggacctcttcaaggagagctacaaaccactgctcaatgaaataaatgaggac acaaacaaatggaagaatattccatgcttatggataggaagaatcaatatcatgaaaatg gccatactgcccaaagagcgggatcgttacgcctacaagattcatcttccagaaacagta gagcagctgaggaaattcaatgcaaggaggaaactaaagggtgcagtactagccgctgtg tcaagtcacaaattcaactcattctatggggatccccctgaagagttaccagatttctcc gaagaccctacctcctcaggagcagtctcacaggtgctggacagcctggaagagattcat gcgcttacagactgcagtgaaaaggacctagattttctacacagtgttttccaggatcag catcttcacacactactagatctgtatgacaaaattaacacaaagtcttcaccacaaatc aggaatcctccaagcgatgcagtacagagagccaaagaggtattggaagaaatttcatgt taccctgagaataacgacgcaaaggaactaaagcgtattttaacacaacctcatttcatg gccttacttcagactcacgacgtagtggcacatgaagtttacagtgatgaagcattgagg gtcacacctcctcccacctctccctatttaaacggcgattctccagaaagtgctaacgga gacatggatatggagaatgtgaccagagttcggctggtacagtttcaaaagaacacagat gaaccaatgaattgttctgcctga >gi568815575r:41575760_41776494|GENSCAN_predicted_peptide_3|139_aa MTEGHKMILTPEIPMLSHMMSEKHSNGDGSAQNSSIIKWKWFMQEHPLREVQGGNILKQG ASFPLGLTLELCEELWDSTDTWTGPREQLSTDQRRAAWCVDDSSKVNRQHPVWKDATLDQ RSGSRTRAITTSVRTRDDS >gi568815575r:41575760_41776494|GENSCAN_predicted_CDS_3|420_bp atgactgaaggacataaaatgatcttgacacctgaaatacccatgctttctcacatgatg tcagagaaacattcaaatggggatggcagcgcccagaatagttccataataaaatggaaa tggtttatgcaggaacatcctctccgggaagtgcaaggaggaaacatcctcaagcaggga gcctcttttcccctaggactgactctggaactgtgtgaggaactgtgggattctactgac acttggacagggccccgtgaacagctctcaactgaccaacgaagagctgcttggtgtgtg gatgacagttccaaggtgaatcgacagcatcctgtttggaaggatgccactcttgatcaa agaagtggttctaggaccagagccataactaccagtgttagaaccagggatgattcctaa >gi568815575r:41575760_41776494|GENSCAN_predicted_peptide_4|158_aa MAGTGHGRSDYSQVKAPILSGLAQEVVGVKLSFKASSKLSAGRVGTPHFMAPEVVKREPY GKPVDVWGCGVILFILLSGCLPFYGTKERLFEGIIKGKYKMNPRQWSHISESAKDLVRRM LMLDPAERITVYEALNHPWLKVHEHSQVTFYTLSEHFQ >gi568815575r:41575760_41776494|GENSCAN_predicted_CDS_4|477_bp atggctgggactggccatggtcgctctgattattcccaagttaaggcccccattttgagt ggcttggcccaggaagtggtaggagtcaagttgagtttcaaagcatcatccaaactgtca gcaggacgtgttggaacacctcattttatggcaccagaagtggtcaaaagagagccttac ggaaagcctgtagacgtctgggggtgcggtgtgatcctttttatcctgctcagtggttgt ttgcctttttacggaaccaaggaaagattgtttgaaggcattattaaaggaaaatataag atgaatccaaggcagtggagccatatctctgaaagtgccaaagacctagtacgtcgcatg ctgatgctggatccagctgaaaggatcactgtttatgaagcactgaatcacccatggctt aaggtacatgagcattcacaggtcaccttctacactttgtctgaacacttccagtga >gi568815575r:41575760_41776494|GENSCAN_predicted_peptide_5|245_aa MDKNELVQKAKLAEQAERYDDMAACMKSVTEQGAELSNEERNLLSVAYKNVLGARRSSWR VVSSIEQKTEGAEKKQQMAREYREKVDTELRDICNDVLFLLEKFLIPSASQAESKVFSLK MKGDYYHYLAEVATGDDKKGIVDQSQQAYQEAFEISKKEMQPTHPIRPGLALNFSVFYYE ILNSPEKACSLAKTAFDEAIAELDTLIEESYKDSMLIMQLLRDNLTLWTSDTQGDEAEAG EGGEN >gi568815575r:41575760_41776494|GENSCAN_predicted_CDS_5|738_bp atggataaaaatgagctggttcagaaggccaaactggccgagcaggctgagcgatatgat gacatggcagcctgcatgaagtctgtaactgagcaaggagctgaattatccaatgaggag aggaatcttctctcagttgcttataaaaatgttttaggagcccgtaggtcatcttggagg gtcgtctcaagtattgaacaaaagacggaaggtgctgagaaaaaacagcagatggctcga gaatacagagagaaagttgatacggagctaagagatatctgcaatgatgtactgtttctt ttggaaaagttcttgatccccagtgcttcacaagcagagagcaaagtcttctctttgaaa atgaaaggagattactaccattacttggcagaggttgccactggtgatgacaagaaaggg attgtggatcagtcacaacaagcataccaagaagcttttgaaatcagcaaaaaggaaatg caaccaacacatcctatcagaccaggtctggcccttaacttctctgtgttctattatgag attctgaactccccagagaaagcctgctctcttgcaaagacagcttttgatgaagccatt gctgaacttgatacattaattgaagagtcatacaaagacagcatgctaataatgcaatta ctgagagacaacttgacattgtggacatcagatacccaaggagacgaagctgaagcagga gaaggaggggaaaattaa >gi568815575r:41575760_41776494|GENSCAN_predicted_peptide_6|374_aa MTTTSVSSWPYSSHRMRFITNHSDQPPQNFSATPNVTTCPMDEKLLSTVLTTSYSVIFIV GLVGNIIALYVFLGIHRKRNSIQIYLLNVAIADLLLIFCLPFRIMYHINQNKWTLGVILC KVVGTLFYMNMYISIILLGFISLDRYIKINRSIQQRKAITTKQSIYVCCIVWMLALGGFL TMIILTLKKGGHNSTMCFHYRDKHNAKGEAIFNFILVVMFWLIFLLIILSYIKIGKNLLR ISKRRSKFPNSGKYATTARNSFIVLIIFTICFVPYHAFRFIYISSQLNVSSCYWKEIVHK TNEIMLVLSSFNSCLDPVMYFLMSSNIRKIMCQLLFRRFQGEPSRSESTSEFKPGYSLHD TSVAVKIQSSSKST >gi568815575r:41575760_41776494|GENSCAN_predicted_CDS_6|1125_bp atgacgacaacttcagtcagcagctggccttactcctcccacagaatgcgctttataacc aatcatagcgaccaaccgccacaaaacttctcagcaacaccaaatgttactacctgtccc atggatgaaaaattgctatctactgtgttaaccacatcctactctgttattttcatcgtg ggactggttgggaacataatcgccctctatgtatttctgggtattcaccgtaaaagaaat tccattcaaatttatctacttaacgtagccattgcagacctcctactcatcttctgcctc cctttccgaataatgtatcatattaaccaaaacaagtggacactaggtgtgattctgtgc aaggttgtgggaacactgttttatatgaacatgtacattagcattattttgcttggattc atcagtttggatcgctatataaaaattaatcggtctatacagcaacggaaggcaataaca accaaacaaagtatttatgtctgttgtatagtatggatgcttgctcttggtggattccta actatgattattttaacacttaagaaaggagggcataattccacaatgtgtttccattac agagataagcataacgcaaaaggagaagccatttttaacttcattcttgtggtaatgttc tggctaattttcttactaataatcctttcatatattaagattgggaagaatctattgagg atttctaaaaggaggtcaaaatttcctaattctggtaaatatgccactacagctcgtaac tcctttattgtacttatcatttttactatatgttttgttccctatcatgcctttcgattc atctacatttcttcacagctaaatgtatcatcttgctactggaaagaaattgttcacaaa accaatgagatcatgctggttctctcatctttcaatagttgcttagatccagtcatgtat ttcctgatgtccagtaacattcgcaaaataatgtgccaacttctttttagacgatttcaa ggtgaaccaagtaggagtgaaagcacttcagaatttaaaccaggatactccctgcatgat acatctgtggcagtgaaaatacagtctagttctaaaagtacttga >gi568815575r:41575760_41776494|GENSCAN_predicted_peptide_7|207_aa MKDHMERGAQPAQLFQVMLQTPVLLLNKILPYPSSPFKLSAYPHSSWIRTGAWELLNAGL TLELCEELRDSTETWTGPRKQLSTDQQRVAWCVNDSSKVNNLVWKDATLDQRSLEQQLCC NRLLKKGYTRQQFCHKSTLNKGDRVIYKLTCPPYCCVRFPLAGTGPRILYLSRLASNLEL LKKGKGRGEQKKEEVTCGMLRKVKTCK >gi568815575r:41575760_41776494|GENSCAN_predicted_CDS_7|624_bp atgaaagatcacatggagagaggggcacagcctgctcaactcttccaggtgatgctccag acacctgttctgctgctcaataaaattcttccctacccatcctcacccttcaaattgtca gcgtatcctcattcttcgtggataaggacaggagcttgggaactgctgaacgcaggactg actctggaactgtgtgaggaactgcgggattctactgagacttggacagggccccgtaaa cagctctcaactgaccaacaaagagttgcttggtgtgtaaatgacagttccaaggtgaac aatcttgtttggaaggatgccactcttgatcaaagaagtttggagcagcagctgtgctgc aatcggttactgaagaaagggtacactcgccagcagttttgccacaagagtacactgaac aaaggagacagggtcatttataagctgacgtgtccaccctactgctgtgtccggtttcca ttggctggaacgggacctcgcattctgtatctgtcccgattggctagcaacttagaactt cttaaaaaaggcaaaggcagaggagaacaaaagaaggaggaagtaacttgtggaatgctg agaaaagtaaaaacctgcaaataa >gi568815575r:41575760_41776494|GENSCAN_predicted_peptide_8|137_aa XLRLFPHPFWVPPESPLALRTSPKFFLETPESIEAFYFMDGADLCFEIVKRADAGFVYSE AVASHYMRQILEALRYCHDNNIIHRDVKPLKSTVSLLQYSIGHTEPALHQCGTGKYQEVM ITGSHLGNWLSQMSINR >gi568815575r:41575760_41776494|GENSCAN_predicted_CDS_8|414_bp nngctccgtcttttccctcatcccttctgggtccctccagaaagtcctctagctctcaga acatctcccaaattctttttggaaacaccagagtccattgaggcattttattttatggat ggagcagatctgtgttttgaaatcgtaaagcgagctgacgctggttttgtgtacagtgaa gctgtagccagccattatatgagacagatactggaagctctacgctactgccatgataat aacataattcacagggatgtgaagcctctgaagtcaacagtgtcacttctgcagtattct attggtcacacagagccagccctgcatcagtgtgggacgggtaaatatcaggaggtgatg atcactgggagccatcttggaaactggctatcacagatgtcaatcaaccgttga