GENSCAN 1.0 Date run: 3-Nov-116 Time: 06:22:54 Sequence gi568815590f:12912791_13122041 : 209251 bp : 41.99% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 6853 6925 73 0 1 45 49 106 0.050 3.68 1.02 Term + 18745 19049 305 2 2 72 42 133 0.199 1.45 1.03 PlyA + 19176 19181 6 1.05 2.00 Prom + 31535 31574 40 -5.65 2.01 Init + 33631 33890 260 0 2 64 40 252 0.849 14.66 2.02 Term + 39327 39474 148 0 1 -12 36 247 0.262 5.89 2.03 PlyA + 41272 41277 6 1.05 3.03 PlyA - 42611 42606 6 1.05 3.02 Term - 43234 42765 470 2 2 44 44 272 0.648 12.55 3.01 Init - 47333 47207 127 2 1 78 60 94 0.662 5.97 3.00 Prom - 56130 56091 40 -7.25 4.00 Prom + 56560 56599 40 -3.95 4.01 Init + 63936 64124 189 2 0 69 42 114 0.661 3.97 4.02 Term + 64338 64598 261 2 0 66 47 213 0.738 9.54 4.03 PlyA + 65062 65067 6 -0.45 5.00 Prom + 65509 65548 40 -5.35 5.01 Init + 67664 67732 69 1 0 55 81 62 0.517 3.30 5.02 Intr + 68502 68621 120 2 0 72 68 110 0.379 7.17 5.03 Intr + 78044 78210 167 1 2 87 76 81 0.230 4.64 5.04 Intr + 93412 93566 155 1 2 90 59 272 0.303 23.29 5.05 Intr + 99894 100067 174 1 0 63 94 181 0.700 15.19 5.06 Term + 108503 109254 752 0 2 70 38 560 0.020 41.82 5.07 PlyA + 110213 110218 6 1.05 6.00 Prom + 116263 116302 40 -6.55 6.01 Init + 117209 117284 76 1 1 73 39 124 0.879 7.30 6.02 Intr + 119504 119574 71 0 2 52 78 70 0.410 0.38 6.03 Intr + 120913 120977 65 0 2 89 100 44 0.139 2.30 6.04 Term + 129641 129806 166 2 1 78 41 236 0.912 14.31 6.05 PlyA + 130254 130259 6 1.05 7.07 PlyA - 132675 132670 6 1.05 7.06 Term - 134884 134781 104 2 2 114 47 70 0.557 2.96 7.05 Intr - 137899 137767 133 1 1 87 70 66 0.397 4.00 7.04 Intr - 145653 145533 121 2 1 19 60 105 0.015 0.38 7.03 Intr - 154921 154747 175 2 1 62 -5 139 0.000 0.08 7.02 Intr - 156400 156349 52 1 1 37 28 139 0.013 0.36 7.01 Init - 161181 161110 72 0 0 95 90 28 0.762 4.82 7.00 Prom - 161801 161762 40 -3.95 8.14 PlyA - 161858 161853 6 1.05 8.13 Term - 173141 173021 121 1 1 109 43 74 0.921 1.97 8.12 Intr - 173673 173500 174 2 0 104 95 72 0.980 7.63 8.11 Intr - 175914 175697 218 0 2 78 105 213 0.999 18.38 8.10 Intr - 177680 177462 219 2 0 107 116 237 0.984 25.98 8.09 Intr - 178642 178528 115 0 1 96 110 60 0.999 8.53 8.08 Intr - 180035 179822 214 0 1 12 96 337 0.990 23.75 8.07 Intr - 182167 181969 199 1 1 78 82 372 0.998 33.60 8.06 Intr - 182455 182296 160 0 1 101 94 309 0.985 31.87 8.05 Intr - 185785 185609 177 0 0 135 99 91 0.999 13.01 8.04 Intr - 187980 186557 1424 0 2 100 110 1913 0.738 182.02 8.03 Intr - 190063 190000 64 0 1 97 72 49 0.410 2.00 8.02 Intr - 198033 197952 82 1 1 71 99 49 0.424 2.08 8.01 Intr - 202867 202796 72 2 0 116 121 62 0.941 10.76 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 128595 128659 65 2 2 85 46 78 0.830 3.97 S.002 Term + 154775 154913 139 2 1 112 49 176 0.959 12.55 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590f:12912791_13122041|GENSCAN_predicted_peptide_1|125_aa MSVSVMKDAEMSAIESLPRRSSQLAAGPGAKPLTARGRQCRPAAPSAGPAEPAPTRNSRW PSSAARSPSSLPRLSLHTSPQAGAGSGLGQPKVGPTQRSGGLKGSSSAARVDTEAEEAPR ASEGC >gi568815590f:12912791_13122041|GENSCAN_predicted_CDS_1|378_bp atgtccgtgtccgtcatgaaggatgccgagatgagtgcaatagagtctcttccccgaagg agctcacagttggctgctggcccgggtgctaagcccctcactgcccggggccggcagtgc cggccggccgctccgagtgcgggacctgccgagcccgcgcctacccggaactcgcgctgg ccctcgagcgccgcgcgcagccccagttccctcccgcgcctctccctccacacctccccg caagcaggagccggctccggcctgggccagcccaaagtggggcccacacagcgcagtggc gggctgaagggctcctcaagcgcggccagagtggacaccgaggccgaggaggcgccgaga gcgagtgagggctgctag >gi568815590f:12912791_13122041|GENSCAN_predicted_peptide_2|135_aa MGYEGQEEAESKNVPGDWEDGDCRRKKRAAALGARRAKDLRFECVDVTLVEMSSKMRAWD LQVYEKTNSEKQNFEKSTQRFSLQLESFCPWLRDSAVTTRNGRAAASRHTVRKAPATGAL TMVCIGERREASVAP >gi568815590f:12912791_13122041|GENSCAN_predicted_CDS_2|408_bp atgggctatgaagggcaagaagaagcagagtctaagaatgtgccaggtgactgggaggac ggagattgtagaagaaagaaaagggcagctgctttaggtgcacggagggcgaaggacctc aggttcgaatgtgttgatgtcaccctggtggagatgtcaagcaagatgagagcatgggat ctgcaggtttatgagaagaccaattcagagaaacagaattttgagaaatctacacagcgt ttttccttgcagttagaaagcttttgcccctggctgcgggacagcgctgtgactactcgc aacgggagagctgctgccagtcgccacaccgtgcggaaagcgccggcgaccggagcactg acaatggtctgcataggggagcggagagaagcttctgttgcgccctag >gi568815590f:12912791_13122041|GENSCAN_predicted_peptide_3|198_aa MDITFQDCKAPVEIVLLKTSNEELQFDVSTSIKGLSELRSAAPMFSHQLPRVERHFAAAS LINGTQPKRCTLDVSSSSGGSDSQPPGCGNDSSNPGGPSSLPPGSQRMRLRPGPLPYFQL QPSFQPSCEHLILIFNPFLLEIVRIVSIICNLTEKKEKVEMSTFDTAKWDVTPQKNHGSG KGCAYLTFAVTPLRTRSS >gi568815590f:12912791_13122041|GENSCAN_predicted_CDS_3|597_bp atggacattactttccaggactgtaaagctccagtggaaattgtgttacttaaaacttcc aatgaagagttgcaatttgatgtctccacttccataaagggattgtctgaattgagatca gctgcaccaatgttcagtcaccagcttcctagggtggagaggcattttgcagcagcctct ctgatcaatggaacacagcccaaaaggtgcacccttgatgttagcagttctagcggtggt tctgattcccagcccccaggctgtggcaatgacagcagcaaccctgggggcccatcaagc ttgccgcccgggagccaacgcatgaggctcaggccaggtcccctcccctactttcagttg caaccttcattccaaccttcctgtgagcatctaattctcatattcaaccccttcctgctc gaaatagttaggatagtttccattatctgcaacctaacagagaagaaagagaaggtagaa atgagcacattcgatacagcaaaatgggacgtaacaccccagaagaaccatggaagtggc aaaggttgtgcctatttgacattcgcagttacacctttaagaacaagaagttcatag >gi568815590f:12912791_13122041|GENSCAN_predicted_peptide_4|149_aa MAKHPGVSTDRITILSLKGKAERLDSRTWHDFKPRKELPSISSDLQAKDKAIVQPRPIWE GVGLLPSPGWKVTPDNQPAVRQMADSLEALHNAMGSGFITNAGRWSTQQCQSQDQPGEER NVLLCTQITVNLFTTPVEQALECQARKSP >gi568815590f:12912791_13122041|GENSCAN_predicted_CDS_4|450_bp atggcaaagcatcctggggttagcactgacaggattaccatccttagcctgaaagggaaa gcggaaagactggattccagaacttggcatgactttaaacccaggaaagagctgcccagc atcagctctgacctgcaggcaaaggacaaagccattgttcaacccagacctatctgggag ggagttgggttgctgccctctccaggttggaaggtgaccccagacaatcagcctgctgtc agacagatggctgattcccttgaagctttgcataatgccatgggctcagggtttattact aatgctggtcgatggagcactcagcagtgccaatcacaagatcagcctggagaggagagg aatgtgttactgtgcacacagataaccgtcaacctgttcacgaccccagtggagcaagcc ctagagtgtcaagcaaggaagtcaccatga >gi568815590f:12912791_13122041|GENSCAN_predicted_peptide_5|478_aa MSWCDYTISRDGEWAKLPAPVSMRSHWLATSNGAGRAQGVNWGWDIMSKDADQIVGGFRR HAQGLCFLQRLTQEVYHEKDRTISFHSYKFSFTLHIEKVMRSNCHSLEVETAVIHKDKRM DHEAAQLEKQHVHNVYESTAPYFSDLQSKAWPRVRQFLQEQKPGSLIADIGCGTGKYLKV NSQVHTVGCDYCGPLVEIARNRGCEAMVCDNLNLPFRDEGFDAIISIGEQCGSKRSHSVG YEPAMARTCFANISKEGEEEYGFYSTLGKSFRSWFFSRSLDESTLRKQIERVRPLKNTEV WASSTVTVQPSRHSSLDFDHQEPFSTKGQSLDEEVFVESSSGKHLEWLRAPGTLKHLNGD HQGEMRRNGGGNFLDSTNTGVNCVDAGNIEDDNPSASKILRRISAVDSTDFNPDDTMSVE DPQTDVLDSTAFMRYYHVFREGELCSLLKENVSELRILSSGNDHGNWCIIAEKKRGCD >gi568815590f:12912791_13122041|GENSCAN_predicted_CDS_5|1437_bp atgtcatggtgtgactacaccatctcacgggacggggagtgggcaaagctccctgctccc gtgtctatgagaagtcattggctggctacttcaaatggtgctggcagagcacagggtgtg aattgggggtgggacatcatgagcaaagatgcagatcagattgtggggggctttaggcga catgctcagggcctgtgcttccttcagagactcacacaagaggtttatcatgagaaggac cgcactatttcatttcactcctacaagttttcatttacgttacacattgagaaagttatg agaagcaactgtcactctctggaggtggagactgccgtgattcacaaagacaagaggatg gatcatgaagccgcccagctggagaagcagcatgtgcacaatgtgtacgagagcacagcc ccttacttcagcgacctgcagagcaaagcctggcctcgtgtccgccagttcctgcaagag cagaagccaggcagcctcatcgctgacataggttgtgggactggaaaatatcttaaagtg aacagccaggtacataccgtgggctgtgactactgtgggccactggtagagattgcccgg aatagaggatgtgaagccatggtatgtgacaaccttaatctcccctttagggatgagggc ttcgatgccatcatctccataggagagcagtgtggttcaaaacggtcccacagcgtgggc tatgaacctgctatggcaagaacctgttttgcaaatatttctaaggaaggcgaggaagaa tatggattttacagcacattaggaaaatcgtttcgttcctggtttttctccagatctttg gatgaatcgactctgaggaagcaaattgaaagagtaagacccttgaaaaacacagaagtt tgggccagtagcactgtaacagtccagccttccagacactctagtttagactttgatcac caagagccattttcaacaaaagggcaaagtttagatgaggaagtgtttgtggaatcttct tctggaaaacacttggagtggctgagagcaccaggcactctgaaacatttaaatggagac catcaaggggaaatgaggagaaatggagggggaaattttctggatagcactaatactggt gtgaattgtgtggatgcaggcaacatagaagatgataatccttctgctagtaaaatattg agaaggatttctgcagttgattccacagatttcaacccagatgatacaatgtctgtcgaa gatccacagactgatgttttggactccacagcctttatgcgctactaccatgtgtttcga gaaggggagctctgcagtctgctcaaggagaatgtgtcagagctccgtatcctgagttct gggaatgatcatggtaactggtgtatcattgcagagaaaaagagaggttgtgattga >gi568815590f:12912791_13122041|GENSCAN_predicted_peptide_6|125_aa MGQLDGSSPVMEEDTPSPRASDWPSEGNFSVMDKVTCTPHTGRTLLLQGASEPLNVFEPM RWKNSSGKERRFSSLSTVLSVHQAAKHQRILPPDGTRPTCGGSGGAISVGLEADEKREVS FPGSI >gi568815590f:12912791_13122041|GENSCAN_predicted_CDS_6|378_bp atggggcagcttgacggctcctcacctgtgatggaagaagatactccatctccccgagca tcagattggccctcagaagggaacttcagtgtgatggataaagtcacctgcacacctcac acaggaagaactctcctgcttcagggggctagcgagcctttgaacgtttttgagcccatg cgttggaaaaattcctccggaaaagagagaaggttttcttcattaagcacggtgttgtct gtccatcaggcagcaaagcatcagcgcatcctaccaccagatggcactagacccacttgt ggaggctctgggggcgccatcagcgtgggactagaagctgatgaaaaacgagaagtttca tttcccggttccatttaa >gi568815590f:12912791_13122041|GENSCAN_predicted_peptide_7|218_aa MRKEIHRASKSRKMPSFGTVEKDRHIGDDECRLDLFHEEAGGLLLDNSVFEGKHRLLPFS VPNTNIVPGIFPGLQNHMDQKTICFTIRRASEDKRMKRKHPAGAHPRTARGIVSGQSRSR ESPQARPGRSAPPEDCMTPEVKNVPVIEMQDQLLKFMRESVFRRQNVSKIHVKVEERRQN NNDEAFQGYCNSMVDRFSLESSVIPLMMYCYARPDTQT >gi568815590f:12912791_13122041|GENSCAN_predicted_CDS_7|657_bp atgaggaaagaaatacacagagcatcaaagtcacgaaagatgccttcctttggaacagta gagaaagacaggcatattggagacgatgaatgccgtttggatctcttccacgaagaagct ggaggtctgctcctagacaatagcgtctttgaaggcaaacatcgccttcttcccttctct gtcccgaacactaacatagtacctggcatatttccagggctccaaaatcatatggaccag aagacaatctgctttacaatcagacgtgcttctgaagacaagaggatgaagagaaagcac ccagcaggggcacatcccaggacagcaaggggcatcgtgagtgggcagtcacgtagccgc gagtctccacaagccaggccaggtcgttctgctccacctgaggactgcatgactcccgag gtgaaaaatgttcctgtgatagagatgcaggaccaactgttaaagtttatgagagaatct gtttttaggagacaaaatgtatccaagatacatgtaaaagttgaagagcgaagacaaaac aacaatgatgaagctttccagggctattgcaattctatggtagacagattctcactggaa tcttcagtcatcccacttatgatgtactgttatgcacggccggatacacaaacgtga >gi568815590f:12912791_13122041|GENSCAN_predicted_peptide_8|1079_aa XIEAKEACDWLRATGFPQYAQLYEDFLFPIDISLVKREHDFLDRDAIEALCRRLNTLNKC AVMKLEISPHRKRSDDSDEDEPCAISGKWTFQRDSKRWSRLEEFDVFSPKQDLVPGSPDD SHPKDGPSPGGTLMDLSERQEVSSVRSLSSTGSLPSHAPPSEDAATPRTNSVISVCSSSN LAGNDDSFGSLPSPKELSSFSFSMKGHEKTAKSKTRSLLKRMESLKLKSSHHSKHKAPSK LGLIISGPILQEGMDEEKLKQLNCVEISALNGNRINVPMVRKRSVSNSTQTSSSSSQSET SSAVSTPSPVTRTRSLSACNKRVGMYLEGFDPFNQSTFNNVVEQNFKNRESYPEDTVFYI PEDHKPGTFPKALTNGSFSPSGNNGSVNWRTGSFHGPGHISLRRENSSDSPKELKRRNSS SSMSSRLSIYDNVPGSILYSSSGDLADLENEDIFPELDDILYHVKGMQRIVNQWSEKFSD EGDSDSALDSVSPCPSSPKQIHLDVDNDRTTPSDLDSTGNSLNEPEEPSEIPERRDSGVG ASLTRSNRHRLRWHSFQSSHRPSLNSVSLQINCQSVAQMNLLQKYSLLKLTALLEKYTPS NKHGFSWAVPKFMKRIKVPDYKDRSVFGVPLTVNVQRTGQPLPQSIQQAMRYLRNHCLDQ VGLFRKSGVKSRIQALRQMNEGAIDCVNYEGQSAYDVADMLKQYFRDLPEPLMTNKLSET FLQIYQYVPKDQRLQAIKAAIMLLPDENREVLQTLLYFLSDVTAAVKENQMTPTNLAVCL APSLFHLNTLKRENSSPRVMQRKQSLGKPDQKDLNENLAATQGLAHMIAECKKLFQVPEE MSRCRNSYTEQELKPLTLEALGHLGNDDSADYQHFLQDCVDGLFKEVKEKFKGWVSYSTS EQAELSYKKVSEGPPLRLWRSVIEVPAVPEEILKRLLKEQHLWDVDLLDSKVIEILDSQT EIYQYVQNSMAPHPARDYVVLRTWRTNLPKGACALLLTSVDHDRAPVVGVRVNVLLSRYL IEPCGPGKSKLTYMCRVDLRGHMPEWYTKSFGHLCAAEVVKIRDSFSNQNTETKDTKSR >gi568815590f:12912791_13122041|GENSCAN_predicted_CDS_8|3240_bp naaattgaagccaaggaagcttgtgattggctacgggcaactggtttcccccagtatgca cagctttatgaagatttcctgttccccatcgatatttccttggtcaagagagagcatgat tttttggacagagatgccattgaggctctatgcaggcgtctaaatactttaaacaaatgt gcggtgatgaagctagaaattagtcctcatcggaaacgaagtgacgattcagacgaggat gagccttgtgccatcagtggcaaatggactttccaaagggacagcaagaggtggtcccgg cttgaagagtttgatgtcttttctccaaaacaagacctggtccctgggtccccagacgac tcccacccgaaggacggccccagccccggaggcacgctgatggacctcagcgagcgccag gaggtgtcttccgtccgcagcctcagcagcactggcagcctccccagccacgcgcccccc agcgaggatgctgccaccccccggactaactccgtcatcagcgtttgctcctccagcaac ttggcaggcaatgacgactctttcggcagcctgccctctcccaaggaactgtccagcttc agcttcagcatgaaaggccacgaaaaaactgccaagtccaagacgcgcagtctgctgaaa cggatggagagcctgaagctcaagagctcccatcacagcaagcacaaagcgccctcaaag ctggggttgatcatcagcgggcccatcttgcaagaggggatggatgaggagaagctgaag cagctcaactgcgtggagatctccgccctcaatggcaaccgcatcaacgtccccatggta cgaaagaggagcgtttccaactccacgcagaccagcagcagcagcagccagtcggagacc agcagcgcggtcagcacgcccagccctgttacgaggacccggagcctcagtgcgtgcaac aagcgggtgggcatgtacttagagggcttcgatcctttcaatcagtcaacatttaacaac gtggtggagcagaactttaagaaccgcgagagctacccagaggacacggtgttctacatc cctgaagatcacaagcctggcactttccccaaagctctcaccaatggcagtttctccccc tcggggaataacggctctgtgaactggaggacgggaagcttccacggccctggccacatc agcctcaggagggaaaacagtagcgacagccccaaggaactgaagagacgcaattcttcc agctccatgagcagccgcctgagcatctacgacaacgtgccgggctccatcctctactcc agttcaggggacctggcggatctggagaacgaggacatcttccccgagctggacgacatc ctctaccacgtgaaggggatgcagcggatagtcaatcagtggtcggagaagttttctgat gagggagattcggactcagccctggactcggtctctccctgcccgtcctctccaaaacag atacacctggatgtggacaacgaccgaaccacacccagcgacctggacagcacaggcaac tccctgaatgaaccggaagagccctccgagatcccggaaagaagggattctggggttggg gcttccctaaccaggtccaacaggcaccgactgagatggcacagtttccagagctcacat cggccaagcctcaactctgtatcactacagattaactgccagtctgtggcccagatgaac ctgctgcagaaatactcactcctaaagctaacggccctgctggagaaatacacaccttct aacaagcatggttttagctgggccgtgcccaagttcatgaagaggatcaaggttccagac tacaaggaccggagtgtgtttggggtcccactgacggtcaacgtgcagcgcacaggacaa ccgttgcctcagagcatccagcaggccatgcgatacctccggaaccattgtttggatcag gttgggctcttcagaaaatcgggggtcaagtcccggattcaggctctgcgccagatgaat gaaggtgccatagactgtgtcaactacgaaggacagtctgcttatgacgtggcagacatg ctgaagcagtattttcgagatcttcctgagccactaatgacgaacaaactctcggaaacc tttctacagatctaccaatatgtgcccaaggaccagcgcctgcaggccatcaaggctgcc atcatgctgctgcctgacgagaaccgggaggttctgcagaccctgctttatttcctgagc gatgtcacagcagccgtaaaagaaaaccagatgaccccaaccaacctggccgtgtgctta gcgccttccctcttccatctcaacaccctgaagagagagaattcctctcccagggtaatg caaagaaaacaaagtttgggcaaaccagatcagaaagatttgaatgaaaacctagctgcc actcaagggctggcccatatgatcgccgagtgcaagaagcttttccaggttcccgaggaa atgagccgatgtcgtaattcctataccgaacaagagctgaagcccctcactctggaagca ctcgggcacctgggtaatgatgactcagctgactaccaacacttcctccaggactgtgtg gatggcctgtttaaagaagtcaaagagaagtttaaaggctgggtcagctactccacttcg gagcaggctgagctgtcctataagaaggtgagcgaaggaccccctctgaggctttggagg tcagtcattgaagtccctgctgtgccagaggaaatcttaaagcgcctacttaaagaacag cacctctgggatgtagacctgttggattcaaaagtgatcgaaattctggacagccaaact gaaatttaccagtatgtccaaaacagtatggcacctcatcctgctcgagactacgttgtt ttaagaacctggaggactaatttacccaaaggagcctgtgcccttttactaacctctgtg gatcacgatcgcgcacctgtggtgggtgtgagggttaatgtgctcttgtccaggtatttg attgaaccctgtgggccaggaaaatccaaactcacctacatgtgcagagttgacttaagg ggccacatgccagaatggtacacaaaatcttttggacatttgtgtgcagctgaagttgta aagatccgggattccttcagtaaccagaacactgaaaccaaagacaccaaatctaggtga