GENSCAN 1.0 Date run: 5-Nov-116 Time: 10:54:36 Sequence gi568815578f:33506956_33744643 : 237688 bp : 46.48% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 Intr - 3177 3094 84 1 0 95 75 45 0.017 3.69 1.02 Intr - 14670 14587 84 1 0 84 76 31 0.145 1.29 1.01 Init - 24490 24442 49 1 1 65 65 44 0.237 0.91 1.00 Prom - 30600 30561 40 -1.66 2.00 Prom + 60974 61013 40 -2.86 2.01 Init + 75350 75352 3 1 0 113 81 0 0.064 1.80 2.02 Intr + 104198 104380 183 1 0 77 103 188 0.560 19.18 2.03 Intr + 116160 116341 182 2 2 74 83 202 0.915 16.97 2.04 Intr + 117809 118062 254 2 2 99 99 110 0.907 10.28 2.05 Intr + 121395 121480 86 1 2 57 66 71 0.956 1.34 2.06 Intr + 122764 122959 196 0 1 79 91 93 0.969 7.69 2.07 Intr + 129685 129714 30 2 0 124 92 11 0.652 3.20 2.08 Intr + 133386 133576 191 1 2 83 55 280 0.949 23.50 2.09 Term + 137392 137691 300 0 0 101 52 236 0.856 16.52 2.10 PlyA + 138280 138285 6 1.05 3.13 PlyA - 140320 140315 6 1.05 3.12 Term - 141280 141266 15 1 0 78 49 0 0.210 -6.66 3.11 Intr - 142498 142427 72 1 0 56 66 105 0.830 4.70 3.10 Intr - 143301 143143 159 0 0 96 72 97 0.943 9.08 3.09 Intr - 147595 147555 41 2 2 62 109 29 0.781 0.34 3.08 Intr - 151078 150987 92 0 2 45 85 204 0.997 15.44 3.07 Intr - 151599 151522 78 2 0 87 86 78 0.986 6.17 3.06 Intr - 151879 151767 113 1 2 82 73 143 0.681 11.38 3.05 Intr - 152595 152542 54 0 0 93 85 62 0.924 5.58 3.04 Intr - 152777 152698 80 0 2 78 100 133 0.944 12.77 3.03 Intr - 153048 152930 119 2 2 110 85 82 0.999 10.21 3.02 Intr - 153440 153304 137 2 2 110 80 213 0.988 22.07 3.01 Init - 153708 153664 45 0 0 95 76 -15 0.760 -1.08 3.00 Prom - 153944 153905 40 -15.35 4.00 Prom + 154271 154310 40 -7.36 4.01 Init + 155391 155516 126 2 0 72 40 160 0.845 9.76 4.02 Intr + 156577 156884 308 0 2 99 7 368 0.897 25.05 4.03 Intr + 159546 159564 19 0 1 97 94 11 0.723 -0.79 4.04 Term + 160042 161280 1239 0 0 2 47 1796 0.817 158.50 4.05 PlyA + 161538 161543 6 1.05 5.07 PlyA - 161880 161875 6 -0.45 5.06 Term - 162152 162141 12 2 0 74 35 11 0.466 -7.40 5.05 Intr - 162517 162420 98 2 2 93 105 133 0.715 15.23 5.04 Intr - 162757 162732 26 0 2 102 89 25 0.995 1.67 5.03 Intr - 163837 163729 109 2 1 114 72 122 0.998 12.54 5.02 Intr - 165467 165443 25 2 1 127 101 36 0.992 6.40 5.01 Init - 167397 167269 129 0 0 121 86 205 0.964 21.75 5.00 Prom - 168372 168333 40 -13.87 6.09 PlyA - 168748 168743 6 1.05 6.08 Term - 170024 169777 248 0 2 110 48 471 0.999 41.25 6.07 Intr - 170375 170150 226 2 1 89 98 81 0.916 6.76 6.06 Intr - 170563 170471 93 1 0 43 78 86 0.656 3.26 6.05 Intr - 171398 171224 175 1 1 111 37 275 0.995 24.64 6.04 Intr - 173019 172800 220 1 1 70 37 458 0.997 36.26 6.03 Intr - 173461 173371 91 1 1 122 77 60 0.987 7.87 6.02 Intr - 178824 178699 126 0 0 85 42 68 0.712 2.78 6.01 Init - 179309 179049 261 2 0 94 80 484 0.998 43.26 6.00 Prom - 182104 182065 40 -3.76 7.05 PlyA - 182912 182907 6 1.05 7.04 Term - 201014 200751 264 2 0 103 38 314 0.992 23.41 7.03 Intr - 203798 203600 199 1 1 29 103 235 0.942 18.45 7.02 Intr - 207781 207719 63 0 0 106 86 98 0.863 9.23 7.01 Init - 213252 213140 113 0 2 72 56 213 0.500 16.32 7.00 Prom - 213687 213648 40 -4.86 8.03 PlyA - 215177 215172 6 1.05 8.02 Term - 225173 224772 402 2 0 77 35 303 0.985 19.05 8.01 Intr - 235905 235759 147 0 0 68 59 62 0.146 1.73 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578f:33506956_33744643|GENSCAN_predicted_peptide_1|73_aa MHFKILNTDEEDIQQQYGTAAVKVKHLDSMQFIRTMLAVDQASPGTWTQLETIILSKLSQ GQKTKHRMFSLTX >gi568815578f:33506956_33744643|GENSCAN_predicted_CDS_1|219_bp atgcatttcaaaatcctaaacacagatgaagaggatatacagcagcagtatggtacagct gcagttaaagtaaaacacctggactccatgcagtttattagaacaatgttagctgtagat caggcaagcccagggacatggacgcagctggaaaccatcattctgagcaaactatcacaa ggacagaaaaccaaacaccgcatgttctcactcacagnn >gi568815578f:33506956_33744643|GENSCAN_predicted_peptide_2|474_aa MRFSNGPASSTSSALTNQQLPATCGARQLSKLKRFLTTLQQFGNDISPEIGEKVRTLVLA LVANLPLLQRELLHCARAAKQTPSQYLAQHEHLLLNTSIASPADSSELLMEVHGNGKRPS PERREENSFDRDTIAPEPPAKRVCTISPAPRHSPALTVPLMNPGGQFHPTPPPLQHYTLE DIATSHLYREPNKMLEHREVRDRHHSLGLNGGYQDELVDHRLTEREWADEWKHLDHALNC IMEMVEKTRRSMAVLRRCQESDREELNYWKRRYNENTELRKTGTELVSRQHSPGSADSLS NDSQREFNSRPEEAVNKVKIQAMSEVQKAVAEAEQKAFEVIATERARMEQTIADVKRQAA EDAFLVINEQEESTENCWNCGRKASETCSGCNIARYCGSFCQHKDWERHHRLCGQNLHGQ SPHGQGRPLLPVGRGSSARSADCSVPSPALDKTSATTSRSSTPASVTAIDTNGL >gi568815578f:33506956_33744643|GENSCAN_predicted_CDS_2|1425_bp atgagattcagcaatggtcctgcctcctccacatcatctgcactcacaaatcagcaattg ccagccacttgtggtgctcgacaactcagcaagttgaaacgctttcttaccactctgcaa cagtttggcaatgacatctcccctgagattggggagaaggtgcggactcttgttcttgca ctggtggccaacctgcccctgctgcagcgggaactgctgcactgcgctcgggcggccaag cagaccccatcccagtacctggctcagcacgaacaccttctgctcaacacaagcattgca tcgcctgctgactcgtcagagttgctcatggaggtgcacggaaatgggaagaggcccagt ccagagaggagagaagagaatagttttgatagagacacaattgctcctgagcctcctgcc aagagagtatgtaccatcagccctgctcctcggcacagtcctgctctcactgtgcccctc atgaatcccgggggccaattccatcctacccctccacctcttcagcattacaccttagag gatattgcaacttctcacctgtatcgggaacccaacaagatgctagagcatcgagaagtt cgtgatagacaccacagtcttggtctaaatggaggctatcaagatgagttggtagatcat cgtttgacagaaagggaatgggctgatgaatggaaacatcttgaccatgcgctgaattgc attatggaaatggtagagaaaacaaggcgctctatggcagttctgcggcgctgtcaggaa tcagatcgtgaagaactcaactactggaaaagacggtacaatgaaaacacagagctgagg aaaacggggaccgagttggtctccaggcagcacagccctgggagtgcagattctctcagc aatgattctcagagagagttcaacagcaggccagaagaagctgtgaataaggtgaaaatt caggccatgtcagaagtacagaaggccgtcgctgaggcagagcagaaagcctttgaagtg attgcaacagagagagcacgaatggagcaaaccatagcggatgtcaagcggcaggccgca gaggatgctttcctcgtcatcaatgagcaagaggagtccacggagaactgctggaactgt ggccgcaaagccagcgagacatgcagtggctgcaatatcgcgcgatactgtggctctttc tgccagcacaaggactgggagcggcaccaccgcctctgtggtcagaacctgcatggccag agcccccacggccagggccggccgctgcttcctgtaggcaggggctcctctgccaggtcc gccgactgcagcgtgcccagcccagccctcgacaagacctcggcaaccacatcgcgttcc tcaacacctgcttctgtgacagctatcgacaccaacggactctga >gi568815578f:33506956_33744643|GENSCAN_predicted_peptide_3|334_aa MRSREARHFAGQMPPEYERASKVDQFVTRFLLRETVSQLQALQSSLEGASDTLEAQAHGW RSDAESVEAQSRLCGSRRAGRRALRSVSRSSTWSPGSSDTGRSSEAEMQWRLQVNRLQEL IDQLECKAPRLEPLREEDLAKGPDLHILMAQRQVQVAEEGLQDFHRALRCYVDFTGAQSH CLHVSAQKMLDGASFTLYEFWQDEASWRRHQQSPGSKAFQRILIDHLRAPDTLTTVFFPA TRPPQKRFHCKEQGTDQLVQLAFQRGQFQAHGVQTPPASCPWMLFRPRLEPVEAPLSLFL KDLGIRLTMEKVTCTIIFGNEHVDARGKRQFLGF >gi568815578f:33506956_33744643|GENSCAN_predicted_CDS_3|1005_bp atgaggtccagagaggccaggcacttcgctgggcagatgccccctgagtacgagagggcc tccaaagtggaccagtttgtgacgcgcttcctgctgcgggagacggtgagccagctgcaa gcccttcagagctcgctggagggggcgtcagataccctggaggcccaggcccatggctgg cggtcagatgcagagagcgtggaggcgcagagcaggctctgcggcagccggcgggcagga cgccgagccctgaggagtgtcagccggtcatccacctggtcccccggctcttctgacaca gggcgcagctcagaggccgagatgcagtggcggctccaggtgaaccgcctccaggagctc atcgaccagctcgagtgcaaggccccccggctggaacccctgcgtgaagaggacctggcc aaggggcctgacttgcacatcctcatggcccagaggcaggtccaggtggcagaggaaggc ctgcaggacttccaccgagccctgcgctgctatgtggacttcacaggggcccagagccat tgtctgcatgtgtccgcccagaagatgctggacggtgcctccttcaccctgtatgagttc tggcaggatgaggcctcctggagaaggcaccagcagtcgcctggcagcaaggccttccag cgcatcctcatcgaccacctgcgggccccggacaccctcaccactgtgttcttcccagcc acccgtccgccacagaaacgcttccactgtaaggagcagggcactgaccagttggtgcag ctggctttccagagaggccagttccaggcacatggggtgcagacaccccctgcttcatgc ccttggatgctcttcaggccacgactagaaccagttgaggcccctctctccctcttcctc aaggatttgggaattaggctcacgatggaaaaggtcacctgcaccatcatctttggaaat gagcacgtcgatgcccgaggaaagaggcagtttctaggtttctga >gi568815578f:33506956_33744643|GENSCAN_predicted_peptide_4|563_aa MGNYSSHKRTKAPKQARKERPADMDKAWWKSFLNHLTRKKPATRIVLILPLDKRQPLANA GQRIDYASGAGLGSPAAPRLRGAGEGSEREPRMPVLLLLRRQEARRPEEGGARAALSWPR LLSRFRSPGKAPREAGPAEEQPRKRNFRQAQDRGREVSGRRSRVARRSGAFSPAAGPSPK VASVSGSRRVHRPSSLGRIAVVVDQGSGFTKAGFAGENQPRIVLKSSSLVPSWDRPVLPG APGCELAGGVARAHPIKHGVVADWEALEGLWERLLVGGLRVCPEQWPVLVSDSPLAPPAG RERVAELLFETLAVPACHMASTALLALCSTGAFSGLAVEAGAGVCHATPIYAGHSWHQAT FRLNVAGSTLSRYLRDLLVAANPDLLQQALPRKAITHLKKRSCYVSLDFEGDLRDPARHH PASFSVGNGCCVCLSSERFRCPEPIFQPGLLGQAEQGLPALAFRALQKMPKTLRTRLADT VVLAGGSTLFPGFAERLDKELEAQCRRHGYAALRPHLVAKHGRGMAVWTGGSMVASLHSF QRRWITRAMYQECGSRLLYDVFN >gi568815578f:33506956_33744643|GENSCAN_predicted_CDS_4|1692_bp atgggaaactatagttcccacaaaaggaccaaagcacccaagcaggcccgcaaggagagg ccggctgacatggacaaggcctggtggaaatcgttcctcaaccacctcactcggaagaag ccggctaccaggatcgtgctgattctccccctggacaagcggcagccgctggccaacgct gggcaacggattgactacgcgtccggcgctgggctgggctccccggcggcacccagattg cgcggagcgggcgaaggtagcgagcgcgagccgaggatgccggtactgctgctgctgcgg cggcaagaggcgcggcggccggaagagggcggggccagggcagctctgagctggccgcgg ctgctctcgcgcttccggtccccggggaaggctccccgcgaagccggccccgccgaggag cagccgcgcaaacggaacttccgtcaagcccaggacagaggccgggaggtatcgggccga cgaagccgagtggcgcggaggagcggagccttcagccccgcggctgggccgagcccgaag gtggcgtcggtgtcggggagccgccgcgtgcaccggccgtcctccctgggccgcatcgcg gtggtggtggaccagggctcgggcttcaccaaggcgggcttcgcgggcgagaaccagccg cgcatagtgctgaagagctctagcttggtgcccagctgggaccggccggtgctgcccgga gcgccgggctgcgagctggcgggcggcgtggcgcgggcgcaccccatcaagcacggcgtg gtggcggactgggaggcgctggaagggctgtgggagcgcctgctggtgggcggcctgcgg gtgtgcccagagcagtggcccgtgctggtgagcgactcgccgttggcgccgcccgcgggc cgcgaaagggtggcggagctgctgttcgagaccctggcagtgcccgcgtgccacatggct agcaccgcgttgctggcgctctgctccaccggcgcgttcagcgggctggccgtggaggcg ggcgcgggcgtgtgccacgccacgcccatctacgcgggtcactcgtggcaccaggccacc ttccgactgaacgtggcaggcagcaccctgtcgcgctacctgcgggatctgctggtggcg gcgaaccctgacctcttgcagcaggccctgccccgcaaggccatcacacatctcaagaag cgcagctgctacgtgtccctggacttcgagggcgacctccgcgaccccgcccgccaccat ccggccagtttcagcgtgggtaacgggtgctgcgtctgcctcagcagtgagcgcttccgc tgccccgaacccatcttccagccgggcctgctgggccaggctgagcaggggctgcccgcg ctggccttccgggcgctgcagaagatgcccaaaacgctgcggacacgcctggcagacacc gtggtgctagccggcggctccacactgtttcctggcttcgccgagcgcctggacaaggag ctggaggcgcagtgccggcggcacggctacgcggccctgcggccccacctggtggccaag catgggcgtggcatggctgtgtggaccggcggctccatggtggcctccctgcactccttc cagcgccgctggataactcgggccatgtaccaggagtgtggctccaggctgctgtacgat gtgttcaactga >gi568815578f:33506956_33744643|GENSCAN_predicted_peptide_5|132_aa MACAGLLTVCLLRPPAPQPQPQTPRHPQLAPDPGPAGHTLFQDVFRRADKNDDGKLSFEE FQNYFADGVLSLGELQELFSGIDGHLTDNLETEKLCDYFSEHLGVYRPVLAALESLNRAV LAAMDATKLHPE >gi568815578f:33506956_33744643|GENSCAN_predicted_CDS_5|399_bp atggcgtgcgcggggctgctcaccgtgtgcctgctccggccgcccgcgccccagccccag ccccagaccccgcggcacccccagctcgcgcccgacccggggcccgccggacacacgctc ttccaggacgttttccgcagagcagacaagaatgatgatgggaagctctcatttgaggaa ttccagaattactttgccgatggggttctcagcctgggggagctgcaggaactgttcagc ggcattgatgggcatctcaccgacaatttagaaacagaaaaactgtgtgactacttctca gagcacctgggtgtctaccggccggtgctggctgcattggaatcgctgaaccgtgcagtg ctcgctgccatggatgccaccaagctgcacccagaataa >gi568815578f:33506956_33744643|GENSCAN_predicted_peptide_6|479_aa MALAGAPAGGPCAPALEALLGAGALRLLDSSQIVIISAAQDASAPPAPTGPAAPAAGPCD PDLLLFATPQAPRPTPSAPRPALGRPPLPHRSLPSVGLSDNNSPRVFNWRLLHARPLGDF KPAKTHNPAVKRRLDLETDHQYLAESSGPARGRGRHPGKGVKSPGEKSRYETSLNLTTKR FLELLSHSADGVVDLNWAAEVLKVQKRRIYDITNVLEGIQLIAKKSKNHIQWLGSHTTVG VGGRLEGLTQDLRQLQESEQQLDHLMNICTTQLRLLSEDTDSQRYPWIGRRDLRSIADPA EQMVMVIKAPPETQLQAVDSSENFQISLKSKQGPIDVFLCPEETVGGISPGKTPSQEVTS EEENRATDSATIVSPPPSSPPSSLTTDPSQSLLSLEQEPLLSRMGSLRAPVDEDRLSPLV AADSLLEHVREDFSGLLPEEFISLSPPHEALDYHFGLEEGEGIRDLFDCDFGDLTPLDF >gi568815578f:33506956_33744643|GENSCAN_predicted_CDS_6|1440_bp atggccttggccggggcccctgcgggcggcccatgcgcgccggcgctggaggccctgctc ggggccggcgcgctgcggctgctcgactcctcgcagatcgtcatcatctccgccgcgcag gacgccagcgccccgccggctcccaccggccccgcggcgcccgccgccggcccctgcgac cctgacctgctgctcttcgccacaccgcaggcgccccggcccacacccagtgcgccgcgg cccgcgctcggccgcccgccgcttcctcaccggagcctccccagcgtcggactcagtgat aataatagcccacgtgtattcaactggcgtttactgcatgccaggcctttgggcgacttc aagccagctaaaactcacaaccccgctgtgaagcggaggctggacctggaaactgaccat cagtacctggccgagagcagtgggccagctcggggcagaggccgccatccaggaaaaggt gtgaaatccccgggggagaagtcacgctatgagacctcactgaatctgaccaccaagcgc ttcctggagctgctgagccactcggctgacggtgtcgtcgacctgaactgggctgccgag gtgctgaaggtgcagaagcggcgcatctatgacatcaccaacgtccttgagggcatccag ctcattgccaagaagtccaagaaccacatccagtggctgggcagccacaccacagtgggc gtcggcggacggcttgaggggttgacccaggacctccgacagctgcaggagagcgagcag cagctggaccacctgatgaatatctgtactacgcagctgcgcctgctctccgaggacact gacagccagcgatatccttggattggccgtagggaccttcgtagcattgcagaccctgca gagcagatggttatggtgatcaaagcccctcctgagacccagctccaagccgtggactct tcggagaactttcagatctcccttaagagcaaacaaggcccgatcgatgttttcctgtgc cctgaggagaccgtaggtgggatcagccctgggaagaccccatcccaggaggtcacttct gaggaggagaacagggccactgactctgccaccatagtgtcaccaccaccatcatctccc ccctcatccctcaccacagatcccagccagtctctactcagcctggagcaagaaccgctg ttgtcccggatgggcagcctgcgggctcccgtggacgaggaccgcctgtccccgctggtg gcggccgactcgctcctggagcatgtgcgggaggacttctccggcctcctccctgaggag ttcatcagcctttccccaccccacgaggccctcgactaccacttcggcctcgaggagggc gagggcatcagagacctcttcgactgtgactttggggacctcacccccctggatttctga >gi568815578f:33506956_33744643|GENSCAN_predicted_peptide_7|212_aa MAAPPQLRALLVVVNALLRKRRYHAALAVLKGFRNGAVYGAKIRAPHALVMTFLFRNGSL QEKLWAILQATYIHSWNLARFVFTYKGLRALQSYIQGKTYPAHAFLAAFLGGILVFGENN NINSQINMYLLSRVLFALSRLAVEKGYIPEPRWDPFPLLTAVVWGLVLWLFEYHRSTLQP SLQSSMTYLYEDSNVWHDISDFLVYNKSRPSN >gi568815578f:33506956_33744643|GENSCAN_predicted_CDS_7|639_bp atggcagccccgccgcagctaagggctctgctcgtagtcgtcaacgcactgctgcgcaag cgccgctaccacgctgcgttggccgtgcttaagggcttccggaacggggctgtctatgga gccaaaatccgggcccctcacgcgctggtcatgacctttctcttccggaatggcagcctc caggagaagctgtgggccatactgcaggccacatatatccactcctggaacctggcacgg tttgtgttcacctacaagggtctccgtgccctgcagtcctacatacaaggcaagacctac ccagcacacgcattcctggcggccttcctcgggggtatcctggtgtttggagaaaacaat aacatcaacagccagatcaacatgtacctgttgtcacgcgtcctgtttgccctgagccgc ctggctgtagagaagggctacatccctgaacccaggtgggacccgttcccgctgctcact gcggtggtgtgggggctggtgctgtggctctttgagtatcaccgatccaccctgcagccc tcgctgcagtcctccatgacctacctctatgaggacagcaatgtatggcacgacatctca gacttcctcgtctataacaagagccgtccctccaattaa >gi568815578f:33506956_33744643|GENSCAN_predicted_peptide_8|182_aa LGMSCDRILTNGTTIELYRICNFLIKGKGKGMDRALWLMPAIPVHWQAEAGRCEGAAGGG AARNSRLRRPRRRSPSRASKIACAILEPPPSPPQEPKPGYASIVGAVAVESGPPGGAQEP PPSPQPWKPNLGSRDPSISGLPWLHSITQSPGHRVWTPPHLNDQVYTDLISAAHAELPIP II >gi568815578f:33506956_33744643|GENSCAN_predicted_CDS_8|549_bp ctaggaatgtcatgtgacagaattctcaccaatgggacaacaatagaactgtaccgtatc tgcaacttcctcataaaaggaaaaggaaagggcatggacagggcactgtggctcatgcct gcaatcccagtacattggcaggctgaggccgggcgctgcgagggcgcggcgggagggggc gcagcgcggaacagccgcctccgccggccccgccgccgctcaccctccagggcctcaaag atcgcctgcgccatcttggagccgccgccgtcgccgccacaggaaccgaagcccggctac gcgagcattgtgggagccgtggcggttgagtcggggccgccagggggcgcccaggagccg ccgccgagcccccagccctggaaacccaacctaggctcgcgggaccccagcatctctggg ctgccctggctccactcgataacccagtcaccgggccaccgagtgtggaccccgcctcac ttgaacgaccaagtctacaccgaccttatatccgccgcacacgcggaacttcccatccct attatctaa