GENSCAN 1.0 Date run: 3-Nov-116 Time: 12:35:06 Sequence gi568815592r:37537371_37796811 : 259441 bp : 50.43% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 PlyA - 741 736 6 1.05 1.04 Term - 2254 2077 178 0 1 139 54 111 0.758 9.96 1.03 Intr - 8428 8377 52 2 1 98 95 35 0.613 3.17 1.02 Intr - 10814 10480 335 1 2 101 53 74 0.567 0.41 1.01 Init - 11376 11150 227 0 2 92 107 64 0.580 6.64 1.00 Prom - 18085 18046 40 -4.26 2.28 PlyA - 23644 23639 6 1.05 2.27 Term - 30660 30355 306 0 0 84 55 110 0.012 2.42 2.26 Intr - 46205 46098 108 2 0 46 50 93 0.072 1.78 2.25 Intr - 48722 48632 91 1 1 44 78 101 0.086 4.70 2.24 Intr - 54953 54824 130 0 1 45 -12 356 0.140 21.05 2.23 Intr - 56360 56291 70 2 1 93 115 -36 0.412 -1.65 2.22 Intr - 61483 61404 80 2 2 73 99 47 0.390 3.57 2.21 Intr - 67669 67562 108 2 0 48 106 57 0.006 3.76 2.20 Intr - 90338 90158 181 2 1 39 93 42 0.002 -0.76 2.19 Intr - 96830 96804 27 2 0 118 53 38 0.155 1.61 2.18 Intr - 97505 97403 103 1 1 108 44 84 0.810 6.18 2.17 Intr - 98436 98349 88 1 1 8 73 111 0.859 0.63 2.16 Intr - 100943 100817 127 2 1 75 59 138 0.788 9.85 2.15 Intr - 101297 101167 131 0 2 101 109 139 0.999 17.71 2.14 Intr - 106573 106439 135 2 0 93 109 164 0.991 19.54 2.13 Intr - 107279 107127 153 0 0 45 72 237 0.999 17.84 2.12 Intr - 108706 108563 144 2 0 96 94 3 0.660 1.95 2.11 Intr - 109005 108828 178 0 1 37 44 310 0.969 20.99 2.10 Intr - 109954 109803 152 2 2 87 80 306 0.987 29.58 2.09 Intr - 111896 111612 285 0 0 76 105 557 0.999 53.51 2.08 Intr - 113035 112739 297 2 0 88 65 490 0.972 43.55 2.07 Intr - 114970 114641 330 2 0 74 80 510 0.970 44.40 2.06 Intr - 117173 116904 270 0 0 105 57 286 0.988 24.71 2.05 Intr - 117562 117430 133 1 1 45 59 278 0.927 20.82 2.04 Intr - 118526 118330 197 0 2 94 71 309 0.999 28.93 2.03 Intr - 121049 120875 175 2 1 67 65 442 0.987 39.31 2.02 Intr - 126736 126597 140 2 2 107 115 167 0.974 21.48 2.01 Init - 158989 158779 211 1 1 36 -24 304 0.581 10.95 2.00 Prom - 165836 165797 40 -6.46 3.00 Prom + 174291 174330 40 -1.76 3.01 Init + 184205 184315 111 1 0 67 89 38 0.176 2.01 3.02 Intr + 187543 187761 219 0 0 61 92 68 0.174 3.00 3.03 Intr + 191753 191884 132 1 0 27 98 51 0.112 0.84 3.04 Term + 194520 194591 72 2 0 99 51 63 0.870 1.61 3.05 PlyA + 195892 195897 6 1.05 4.00 Prom + 203037 203076 40 -1.46 4.01 Init + 207190 207296 107 0 2 71 41 113 0.271 4.59 4.02 Term + 218363 218579 217 2 1 95 49 95 0.038 2.92 4.03 PlyA + 223838 223843 6 1.05 5.04 PlyA - 228443 228438 6 1.05 5.03 Term - 240851 240798 54 2 0 122 52 47 0.504 2.06 5.02 Intr - 246635 246526 110 0 2 67 88 86 0.533 6.50 5.01 Init - 254710 254656 55 1 1 78 77 35 0.416 2.85 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 47940 47856 85 0 1 50 81 135 0.868 7.98 S.002 Term - 54953 54820 134 0 2 45 37 338 0.855 22.65 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:37537371_37796811|GENSCAN_predicted_peptide_1|263_aa MPGCFRHLQTQMQTSEDTPQPVPEGSKMGSRVQSWTQSQDPKLSALFPRGPGDSREKPEA QSQLRELARQGCGMQRRGAEGRLTLAPDAAELDFSALIVGCHLCKAGWTHQPQRVVNLQP VLLLVHTRPEQPQPPTPQSPAEEYANTPQNKFQLEFLPSCFRQLHPSKPWPPLSTSSNWQ TSKERQDSGPQDRALKGLVETGKTWPGILIISISDVTGLGGEEATPLQGKLKTEIFSHAA GKWGEQRKGAQEPLRGSVLEAFQ >gi568815592r:37537371_37796811|GENSCAN_predicted_CDS_1|792_bp atgccaggctgtttcaggcacctgcagacccagatgcagacatctgaggacacaccacaa ccagtccctgaggggagcaagatggggagcagggtgcagagctggacacaaagccaggac cccaaactcagcgctctgttcccacgtgggccaggagactcaagagaaaaacctgaggcc caatcccagctgcgtgaactggctcgacaaggctgcgggatgcagaggagaggggcagaa gggaggctaaccctggctccagatgcagctgagctagatttttctgccctgatcgtgggc tgccatctgtgcaaggctggttggacccaccagcctcagagggtggttaatttacaacct gtgctgctgcttgtccacaccaggccagagcagcctcaacccccaacccctcagtctcca gcagaagaatatgcgaacactcctcagaacaagttccagctcgagttccttccttcctgc ttccgtcaactccatccctccaagccttggcctcccctcagcacctcatccaactggcaa acttccaaggagaggcaggacagtgggccacaggatagagcattaaagggcctggtagaa actggaaagacctggcccgggatattgatcatctccatcagtgatgtcacaggactgggt ggggaggaggccactcccctgcaggggaagctgaagacagagatcttcagccatgcagcc gggaagtggggagagcagaggaagggggcccaggagcccctccgagggtctgtgctggag gctttccagtga >gi568815592r:37537371_37796811|GENSCAN_predicted_peptide_2|1449_aa MSRSCRAACGRAGCQHQGRAAGSPSPLLLRLRFFYSGLDTLIRMPASPGPGAFDPGIPFR TFPEPVLPVPAPAQAQIVHAGQACVVKEDNISERVYTIREGDTLMLQCLVTGHPRPQVRW TKTAGSASDKFQETSVFNETLRIERIARTQGGRYYCKAENGVGVPAIKSIRVDVQYLDEP MLTVHQTVSDVRGNFYQEKTVFLRCTVNSNPPARFIWKRGSDTLSHSQDNGVDIYEPLYT QGETKVLKLKNLRPQDYASYTCQVSVRNVCGIPDKAITFRLTNTTAPPALKLSVNETLVV NPGENVTVQCLLTGGDPLPQLQWSHGPGPLPLGALAQGGTLSIPSVQARDSGYYNCTATN NVGNPAKKTVNLLVRSMKNATFQITPDVIKESENIQLGQDLKLSCHVDAVPQEKVTYQWF KNGKPARMSKRLLVTRNDPELPAVTSSLELIDLHFSDYGTYLCMASFPGAPVPDLSVEVN ISSETVPPTISVPKGRAVVTVREGSPAELQCEVRGKPRPPVLWSRVDKEAALLPSGLPLE ETPDGKLRLERVSRDMSGTYRCQTARYNGFNVRPREAQVQLNVQFPPEVEPSSQDVRQAL GRPVLLRCSLLRGSPQRIASAVWRFKGQLLPPPPVVPAAAEAPDHAELRLDAVTRDSSGS YECSVSNDVGSAACLFQVSAKAYSPEFYFDTPNPTRSHKLSKNYSYVLQWTQREPDAVDP VLNYRLSIRQLNQHNAVVKAIPVRRVEKGQLLEYILTDLRVPHSYEVRLTPYTTFGAGDM ASRIIHYTERQIRWPPVLALRTLSSGPKQGILCRAPHLSSDLVSPLAFSAINSPNLSDNT CHFEDEKICGYTQDLTDNFDWTRQNALTQNPKRSPNTGPPTDISGTPEGYYMFIETSRPR ELGDRARLVSPLYNASAKFYCVSFFYHMYGKHIGSLNLLVRSRNKGALDTHAWSLSGNKG NVWQQAHVPISPSGPFQIIFEGVRGPGYLGDIAIDDVTLKKGECPRKQTDPNKGARREGA ACDGLKFHLSSPMDDGELTDDPIECKHLWIHRVDSKGAQYMLAELNCIHVAPRFLVFMDE GHKVGEKDSGGQPFQVYTDFSFYPHAIPEKMSIMPTLQVGKLRLREEEESAYGHTAAKAR AGTRSWSLNRPPELKSFSLWTNMTQSVREPITGPLQCQTYTHSNTQHLGTWMKLETIILS KLTQEQKTKHHMFSLTTQSAPPSLFSPELPLSSNSRGHASMSLDDDDDDDDDDDDDDDDD DNDDDDGRKTYRIPPGSDLSACGWVNDHMSQGDSTGPPDEKHSYAKEETQAKLTGPGISY PGKPSYRLHPSGYTPLEQFREKATGMLTGRTVSEVSALYLASYKASFLLAVVLIIGIQKW TRQSHAQCVPRPFCFCASMSRAWGEKGAVERGFHSGNGMWGNQRHKGATVYQFLQETVLR GNYTISATT >gi568815592r:37537371_37796811|GENSCAN_predicted_CDS_2|4350_bp atgagccggagttgccgcgcggcttgcgggcgagccggctgtcagcaccaaggccgggcg gcgggctcgccttccccgctgttgctgcggcttcgcttcttctactcagggcttgacacc ctgatcaggatgccggcatcccctggccctggcgccttcgaccccggcatcccttttcgg accttccctgaaccagttcttcctgtccccgctccagcccaggcgcagatcgtgcatgcg ggccaggcatgtgtggtgaaagaggacaatatcagcgagcgtgtctacaccatccgggag ggggacaccctcatgctgcagtgccttgtaacagggcaccctcgaccccaggtacggtgg accaagacggcaggtagcgcctcggacaagttccaggagacatcggtgttcaacgagacg ctgcgcatcgagcgtattgcacgcacgcagggcggccgctactactgcaaggctgagaac ggcgtgggggtgccggccatcaagtccatccgcgtggacgtgcagtacctggatgagcca atgctgacggtgcaccagacggtgagcgatgtgcgaggcaacttctaccaggagaagacg gtgttcctgcgctgtactgtcaactccaacccgcctgcccgcttcatctggaagcggggt tccgataccctatcccacagccaggacaatggggttgacatctatgagcccctctacact cagggggagaccaaggtcctgaagctgaagaacctgcggccccaggactatgccagctac acctgccaggtgtctgtgcgtaacgtgtgcggcatcccagacaaggccatcaccttccgg ctcaccaacaccacggcaccaccagccctgaagctgtctgtgaacgaaactctggtggtg aaccctggggagaatgtgacggtgcagtgtctgctgacaggcggtgatcccctcccccag ctgcagtggtcccatgggcctggcccactgcccctgggtgctctggcccagggtggcacc ctcagcatcccttcagtgcaggcccgggactctggctactacaactgcacagccaccaac aatgtgggcaaccctgccaagaagactgtcaacctgctggtgcgatccatgaagaacgct acattccagatcactcctgacgtgatcaaagagagtgagaacatccagctgggccaggac ctgaagctatcgtgccacgtggatgcagtgccccaggagaaggtgacctaccagtggttc aagaatggcaagccggcacgcatgtccaagcggctgctggtgacccgcaatgatcctgag ctgcccgcagtcaccagcagcctagagctcattgacctgcacttcagtgactatggcacc tacctgtgcatggcttctttcccaggggcacccgtgcccgacctcagcgtcgaggtcaac atctcctctgagacagtgccgcccaccatcagtgtgcccaagggtagggccgtggtgacc gtgcgcgagggatcgcctgccgagctgcaatgcgaggtgcggggcaagccgcggccgcca gtgctctggtcccgcgtggacaaggaggctgcactgctgccctcggggctgcccctggag gagactccggacgggaagctgcggctggagcgagtgagccgagacatgagcgggacctac cgctgccagacggcccgctataatggcttcaacgtgcgcccccgtgaggcccaggtgcag ctgaacgtgcagttcccgccggaggtggagcccagttcccaggacgtgcgccaggcgctg ggccggcccgtgctcctgcgctgctcgctgctgcgaggcagcccccagcgcatcgcctcg gctgtgtggcgtttcaaagggcagctgctgccgccgccgcctgttgttcccgccgccgcc gaggcgccggatcacgcggagctgcgcctcgacgccgtaactcgcgacagcagcggcagc tacgagtgcagcgtctccaacgatgtgggctcggctgcctgcctcttccaggtctccgcc aaagcctacagcccggagttttacttcgacacccccaaccccacccgcagccacaagctg tccaagaactactcctacgtgctgcagtggactcagagggagcccgacgctgtcgaccct gtgctcaactacagactcagcatccgccagttgaaccagcacaatgcggtggtcaaggcc atcccggtccggcgtgtggagaaggggcagctgctggagtacatcctgaccgatctccgt gtgccccacagctatgaggtccgcctcacaccctataccaccttcggggctggtgacatg gcctcccgcatcatccactacacagagcgccagatccgctggcccccagtcctggctctg aggaccctgtcctctggtcccaagcagggtatcctctgcagagccccacacctcagttct gacttggtttccccgcttgctttctcagccatcaactctccgaacctttcagacaacacc tgccactttgaggatgagaagatctgtggctatacccaggacctgacagacaactttgac tggacgcggcagaatgccctcacccagaaccccaaacgctcccccaacactggtcccccc accgacataagtggcacccctgagggctactacatgttcatcgagacatcgaggcctcgg gagctgggggaccgtgcaaggttagtgagtcccctctacaatgccagcgccaagttctac tgtgtctccttcttctaccacatgtacgggaaacacatcggctccctcaacctcctggtg cggtcccggaacaaaggggctctggacacgcacgcctggtctctcagtggcaataagggc aatgtgtggcagcaggcccatgtgcccatcagccccagtgggcccttccagattattttt gagggggttcgaggcccgggctacctgggggatattgccatagatgacgtcacactgaag aagggggagtgtccccggaagcagacggatcccaataaaggtgcaagacgggaaggagct gcctgcgatggcctgaaattccacctttcatcccctatggatgacggagagcttacagat gaccctattgaatgcaagcacctttggatccatagagtggacagtaaaggtgctcagtac atgttggctgagctgaactgcatacatgtggcccccaggttcctggtctttatggacgaa gggcacaaggttggtgaaaaggactccgggggccagcccttccaagtttacactgatttc tccttttaccctcatgctatccctgagaagatgtcaataatgcccacgttacaggtggga aaactgaggcttagagaggaggaggaatctgcctacggtcacacagctgcaaaggctaga gctgggaccaggagctggtctcttaaccgaccacctgagctcaagagcttttctctctgg accaacatgacccaaagtgtgcgcgagcctatcacaggtcccctgcaatgccaaacatac acgcacagcaatacacaacacctggggacatggatgaagctggaaaccatcattctcagc aaactgacacaagaacagaaaaccaaacaccacatgttctcactcaccacccagtctgcc ccgccctctctcttctcacctgaacttcccctctcctcaaactctcgaggccacgcctct atgtccttggatgatgatgatgacgacgacgacgatgatgatgatgatgatgacgacgat gacaatgatgatgatgatggaaggaagacctacagaatccctccaggctctgacctcagt gcttgtgggtgggtgaatgaccacatgtcgcagggagactccacaggtcctcccgatgag aagcactcttatgccaaagaggagactcaggccaaactgacaggaccaggaattagctac cctggtaaacccagctatcgactgcacccgagcggctacacaccactggagcagttcagg gagaaagccaccggcatgctcaccggcagaactgtgagtgaagtttccgccctgtatctg gcctcatacaaggcctccttcctgctggctgttgtgctaataattgggatacagaaatgg acacgacagagccatgcccagtgtgtccccaggcccttctgcttttgtgcgagcatgagc agggcatggggagagaagggggccgtggagagaggtttccactcaggaaacgggatgtgg ggaaaccagaggcacaaaggagccacggtctaccagttcctccaggaaaccgtgctcagg ggaaactatacaatatcagcaacaacatga >gi568815592r:37537371_37796811|GENSCAN_predicted_peptide_3|177_aa MGTVPVAHQVYQMFYRALCKYYPFQPGEVGVINPILVLDSGGVGKRRKKEKTGFVSSKDS QPAGGDSIPGELRLPAGSHSNSAGPSWDGAAMGKQKPEPHFPTPEAGGKQAAPKGVHNFQ EAMGPRIQGSSPKMEQFCVIISQYPQQWHKHPTQRKLLALDNSSRKKTTPRTLTRKA >gi568815592r:37537371_37796811|GENSCAN_predicted_CDS_3|534_bp atgggtacagttccagtggcccaccaggtataccagatgttttaccgggcactttgcaaa tattacccttttcagcctggtgaagtaggtgttattaaccccattttagtgctggattca ggtggagtagggaaaagacggaagaaagagaagacaggattcgtgtcctcaaaggattcc cagcctgctgggggagacagcatccctggggagctgagattaccagcaggaagccacagc aattctgctggcccaagttgggatggagctgcaatggggaaacagaagcctgagcctcat tttcctaccccggaagctggaggaaagcaggccgcccccaagggggtgcataacttccag gaagcgatgggtcccaggatccagggcagttctccaaagatggagcagttctgcgttatc atcagtcaatatccacagcagtggcacaagcaccccacccagagaaagctcctggccctg gataatagctccagaaagaagacaactccacggaccctgacaagaaaggcctga >gi568815592r:37537371_37796811|GENSCAN_predicted_peptide_4|107_aa MSAQGEEQMSGSPCPGGAPGLVRKVDAEEINHRTAQTRDAGQGGHCLQDPSAEVPHGAEV GSRGRNSCENSDLFVSLSQAKVDNEAKLAITFLCVIMVVVVMLTAAG >gi568815592r:37537371_37796811|GENSCAN_predicted_CDS_4|324_bp atgagtgcccaaggagaggaacaaatgagtggcagtccctgtcctggaggagcccctggt ctagtgcggaaagtggatgcagaagaaataaaccaccggaccgcacaaaccagggacgcc ggacaaggaggacactgccttcaggacccttctgcggaggtccctcatggtgcagaagtt gggagtagaggcagaaacagctgtgaaaattctgatttgtttgtttctctgagccaagct aaagtggataatgaagccaagctggcgattaccttcctgtgcgtgataatggtggtagtg gtgatgctgactgcagcaggatga >gi568815592r:37537371_37796811|GENSCAN_predicted_peptide_5|72_aa MPLILKSILKGDCEGYRVGGQKRLRALRRPTLSAISTARPGAARTLAFRNKKNDFEEKPG GKEEELGDVGAL >gi568815592r:37537371_37796811|GENSCAN_predicted_CDS_5|219_bp atgcccctcattctcaagtccatcctaaaaggtgattgtgagggctacagggttggagga cagaagaggctgcgcgccctccgcaggcccacgctgtccgccattagcacagcccggccc ggcgctgctcgcacattagccttcaggaataaaaagaatgatttcgaagagaagccagga ggcaaagaggaggagctgggagatgtgggggctctctga