GENSCAN 1.0 Date run: 8-Nov-116 Time: 03:43:35 Sequence gi568815583r:74855987_75056583 : 200597 bp : 49.07% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 3932 3927 6 1.05 1.02 Term - 14939 14874 66 2 0 71 44 57 0.422 -2.46 1.01 Init - 17509 17213 297 1 0 75 80 262 0.916 19.32 1.00 Prom - 20868 20829 40 -6.26 2.00 Prom + 24059 24098 40 -4.36 2.01 Init + 34088 34232 145 1 1 102 48 103 0.672 6.30 2.02 Intr + 34541 34668 128 0 2 55 83 125 0.580 9.10 2.03 Intr + 35393 35593 201 1 0 96 98 176 0.996 18.88 2.04 Intr + 36675 36816 142 2 1 88 89 171 0.999 17.13 2.05 Intr + 37152 37334 183 1 0 73 38 184 0.948 11.66 2.06 Intr + 40166 40339 174 0 0 62 83 262 0.926 23.01 2.07 Intr + 41025 41233 209 1 2 79 105 123 0.982 11.90 2.08 Term + 41526 41744 219 2 0 98 43 153 0.976 8.84 2.09 PlyA + 43432 43437 6 1.05 3.08 PlyA - 43653 43648 6 1.05 3.07 Term - 45542 45433 110 0 2 62 48 61 0.534 -1.83 3.06 Intr - 46077 45964 114 1 0 120 100 23 0.946 7.12 3.05 Intr - 46704 46618 87 1 0 50 86 77 0.892 3.64 3.04 Intr - 48726 48524 203 2 2 94 65 65 0.800 3.73 3.03 Intr - 49245 49168 78 2 0 123 109 104 0.999 14.67 3.02 Intr - 50379 50292 88 1 1 81 14 100 0.993 0.93 3.01 Init - 50814 50601 214 0 1 107 66 134 0.962 10.03 3.00 Prom - 51882 51843 40 -5.96 4.07 PlyA - 52550 52545 6 1.05 4.06 Term - 67784 67671 114 2 0 71 40 114 0.845 3.47 4.05 Intr - 70901 70780 122 0 2 96 86 65 0.992 7.31 4.04 Intr - 73246 73130 117 2 0 35 86 130 0.929 7.94 4.03 Intr - 81621 81463 159 1 0 41 45 94 0.416 0.36 4.02 Intr - 82050 81929 122 2 2 75 82 143 0.643 12.54 4.01 Init - 87101 86953 149 2 2 76 43 52 0.179 -0.94 4.00 Prom - 96111 96072 40 -5.86 5.05 PlyA - 98454 98449 6 1.05 5.04 Term - 100730 99998 733 1 1 -6 47 892 0.774 68.64 5.03 Intr - 105770 105701 70 0 1 71 100 41 0.890 1.84 5.02 Intr - 108306 108254 53 2 2 80 119 27 0.904 3.55 5.01 Init - 109200 109193 8 0 2 95 81 0 0.794 0.52 5.00 Prom - 110510 110471 40 -6.16 6.00 Prom + 117725 117764 40 -2.46 6.01 Init + 120345 120503 159 2 0 84 63 50 0.284 1.96 6.02 Intr + 139589 139687 99 1 0 34 109 75 0.003 4.51 6.03 Intr + 146270 146338 69 1 0 40 92 57 0.123 0.68 6.04 Intr + 155806 155860 55 0 1 127 110 25 0.727 7.15 6.05 Intr + 156691 156819 129 2 0 61 56 152 0.971 9.97 6.06 Intr + 160607 160763 157 0 1 59 108 177 0.999 15.87 6.07 Intr + 161884 161985 102 1 0 105 99 102 0.999 12.29 6.08 Intr + 162408 162549 142 0 1 85 88 186 0.518 18.66 6.09 Term + 162803 162997 195 1 0 140 46 138 0.985 12.21 6.10 PlyA + 165484 165489 6 1.05 7.00 Prom + 175937 175976 40 -6.16 7.01 Init + 181312 181344 33 0 0 79 84 25 0.526 1.07 7.02 Intr + 187455 187550 96 2 0 98 78 139 0.997 14.11 7.03 Intr + 188400 188630 231 2 0 113 30 250 0.619 19.67 7.04 Intr + 192567 192735 169 2 1 81 98 308 0.939 30.62 7.05 Term + 193164 193249 86 1 2 49 43 131 0.748 2.52 7.06 PlyA + 193735 193740 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583r:74855987_75056583|GENSCAN_predicted_peptide_1|120_aa MWTEAECPPTALLWAGRGGGGGGRSDPGAVIGPRPRGRASRLRQNRCLAPGSGGSTRSAA GSGCPEAAAFAEFARWPPITMSAFDTNPFADPVDVNPFQLLSFLQVSEKSLKESFLSFTA >gi568815583r:74855987_75056583|GENSCAN_predicted_CDS_1|363_bp atgtggacggaagccgaatgcccacccaccgcactattgtgggctgggcggggggggggg gggggggggcggagcgaccccggtgccgtcatagggccacgccctcgggggagggcatcg cgcctgcgccagaaccgctgcctcgctcccggaagtggagggtctacacgaagcgccgct gggtctgggtgcccggaggcagcagcgttcgcggagttcgcccgctggcccccgatcacc atgtcggctttcgacaccaaccccttcgcggacccagtggatgtaaaccccttccagctt ttaagtttcctgcaggtgtcagagaagtccttgaaggagtccttcctgtcattcacagca tga >gi568815583r:74855987_75056583|GENSCAN_predicted_peptide_2|466_aa MAAPRGEPLAGVSASVFVERVLGTTPGIIGQVESAPGIPRKDGWVPGRVFPLSCAVQQYA WGKMGSNSEVARLLASSDPLAQIAEDKPYAELWMGTHPRGDAKILDNRISQKTLSQWIAE NQDSLGSKVKDTFNGNLPFLFKVLSVETPLSIQAHPNKELAEKLHLQAPQHYPDANHKPE MAIALTPFQGLCGFRPVEEIVTFLKKVPEFQFLIGDEAATHLKQTMSHDSQAVASSLQSC FSHLMKSEKKVVVEQLNLLVKRISQQAAAGNNMEDIFGELLLQLHQQYPGDIGCFAIYFL NLLTLKPGEAMFLEANVPHAYLKGDCVECMACSDNTVRAGLTPKFIDVPTLCEMLSYTPS SSKDRLFLPTRSQEDPYLSIYDPPVPDFTIMKTEVPGSVTEYKVLALDSASILLMVQGTV IASTPTTQTPIPLQRGGVLFIGANESVSLKLTEPKDLLIFRACCLL >gi568815583r:74855987_75056583|GENSCAN_predicted_CDS_2|1401_bp atggccgctccgcgaggtgagccattggctggggtgtcggcgagtgtgttcgtggagcgc gtcctggggacgactcccggcattatcggccaggttgagtccgctcctggcattccgcgg aaagatgggtgggtgccgggcagagtattcccactttcctgtgcggtgcagcagtatgcc tgggggaagatgggttccaacagcgaagtggcgcggctgttggccagcagtgatccactg gcccagatcgcagaggacaagccttatgcagagttgtggatggggactcacccccgaggg gatgccaagatccttgacaaccgcatctcacagaagaccctaagccagtggattgctgag aaccaggacagcttgggctcaaaggtcaaggacacctttaatggcaacctgcccttcctc ttcaaagtgctctcagttgaaacacccctgtccatccaggcacaccctaacaaggagctg gcagagaagctgcacctccaggctccgcagcactaccccgatgccaaccacaagccagag atggccattgccctcacccccttccagggcttgtgtggcttccggccagttgaggagatt gtaacctttctaaagaaggtgcctgagtttcagttcctgattggagatgaggcagcaaca cacctgaagcagaccatgagccatgactcccaggctgtggcctcctctctgcagagctgt ttctcccacctgatgaagagtgagaagaaggtggtggtggaacagctcaacctgttggtg aagcggatctcccagcaagcggctgccggaaacaacatggaggacatctttggggagctt ttgctacagctgcaccagcagtacccaggtgatatcggctgctttgccatctacttcctg aacctgcttaccctgaagcctggggaggccatgtttctggaggccaacgtaccccatgcc tacctgaaaggagactgcgtggagtgcatggcgtgttcagacaacacagttcgtgctggc ctgacacccaagttcattgatgtgccaaccctgtgtgaaatgctcagctatacccctagc tccagcaaggacaggctctttctcccaacacggagtcaggaagacccctacctctcaatc tatgacccccctgtaccagacttcaccattatgaagacggaggtccctggctctgtcact gaatacaaggtcttggcactggactctgccagcatcctcctgatggtacaggggacagta atagccagcacacccacaacccagacaccaatccctctgcaacgtggtggcgtgctcttc attggggccaatgagagtgtctcactgaagcttactgagccgaaggacctgctgatattc cgtgcctgctgtctgctgtaa >gi568815583r:74855987_75056583|GENSCAN_predicted_peptide_3|297_aa MATAEPSGRALRLSTPGPRPSGARDRAPGAAGPPSGQIGNRALRLGERTPAAVEKRGPYM VTRAPSIQAKLQKHRDLAKAVLRRKGMLGASPNRPDSSGKRSVKFNKGYTALSQSPDENL VSLDSDSDGELGSRYSSGYSSAEASPEHLSVQGQAAGSITTTAKCLFGARCEHSTVPDFW SVLRSSTLPSSPGGAHGLFNMLLLLVLSWGLFFLYPPVDITLQGCTCLTMTGIIAGCLMI QQDPGFQNPWTWGIPNWSNRPDSSGHSMGLTGYSGLGFKQTSGCDTASFSPQPKQQK >gi568815583r:74855987_75056583|GENSCAN_predicted_CDS_3|894_bp atggcgaccgcggagcccagcgggcgcgcgttgcggttgtctaccccgggaccccggccc agcggggctcgggaccgcgcgccgggagctgcggggccaccctccgggcagatcggtaat agagccctccgtctgggggagcgcacccccgcggccgtggaaaagcgggggccatacatg gtgacgcgcgcaccctccattcaagccaagctgcagaagcaccgggacctggccaaggcc gttctgcggagaaaaggcatgctgggggcctcgccgaaccgcccagactcttcagggaaa aggtcagtgaagtttaacaagggctatactgctcttagccagagtccagatgaaaacctg gtgtccctcgactctgacagtgatggggagctgggatccagatactcctccgggtattca tctgcagaggcaagtccagaacacctttctgtgcagggccaggctgcaggcagcattacc accactgccaaatgcttgtttggtgcccgctgtgagcacagcactgtgccagacttctgg tctgttctacgatcttccactctgccctcttcccccggtggagcccatggcctcttcaac atgctcctgctgctggtgctgtcttggggactcttcttcctgtaccctccagtagacatt accctccaaggttgcacctgtctaactatgacagggatcatagcaggttgcttaatgata cagcaggatcctggcttccagaatccctggacctggggcatccccaactggtccaaccgt ccggattcttctggtcacagcatgggccttactggatactcaggattgggctttaagcag acctctgggtgtgacacagcctctttcagcccccagcccaaacagcagaaatag >gi568815583r:74855987_75056583|GENSCAN_predicted_peptide_4|260_aa MTFLSHQKSSCCVLKKEEEAKVQVWLLQRDSEGFQGRKWQEEIKDPEDLDPRAAIAVMLG AALRRCAVAATTRADPRGLLHSARTPGPAVAPTLRPPDREWPPPACSASAKVTLSLRARR LGSEDTIYFRMGHTIILFNQAFLAIQSVRCYSHGSQETDEEFDARWVTYFNKPDIDAWEL RKGINTLVTYDMVPEPKIIDAALRACRRLNDFASTVRILEVVKDKAGPHKEIYPYVIQEL RPTLNELGISTPEELGLDKV >gi568815583r:74855987_75056583|GENSCAN_predicted_CDS_4|783_bp atgacatttctgagccatcagaagtccagctgttgtgttttaaaaaaagaagaagaagca aaagtccaggtgtggcttctccagagggacagcgaaggatttcaaggtaggaaatggcag gaagagataaaagaccctgaagaccttgacccgcgcgccgccatcgccgtcatgctgggc gccgctctccgccgctgcgctgtggccgcaaccacccgggccgaccctcgaggcctcctg cactccgcccggacccccggccccgccgtggcgccgaccctgcggccacctgaccgagag tggccgccgccggcctgcagcgcctcggcgaaggtgacattgagcctccgggcacgccgc ttggggtccgaggacacaatatacttccgtatgggacacacgattatacttttcaatcag gcatttctcgctatccagtcagttcgctgctattcccatgggtcacaggagacagatgag gagtttgatgctcgctgggtaacatacttcaacaagccagatatagatgcctgggaattg cgtaaagggataaacacacttgttacctatgatatggttccagagcccaaaatcattgat gctgctttgcgggcatgcagacggttaaatgattttgctagtacagttcgtatcctagag gttgttaaggacaaagcaggacctcataaggaaatctacccctatgtcatccaggaactt agaccaactttaaatgaactgggaatctccactccggaggaactgggccttgacaaagtg taa >gi568815583r:74855987_75056583|GENSCAN_predicted_peptide_5|287_aa MGRERNRGTETFHSLTKGTAGEASCQVMIRPVEKQRPLTNRYVNPPAGVCAAPAPLPLLA LARRDRRPCSPGAEAAPWQSRRSRRRRRMENFRKVRSEEAPAGCGAEGGGPGSGPFADLA PGAVHMRVKEGSKIRNLMAFATASMAQPATRAIVFSGCGRATTKTVTCAEILKRRLAGLH QVTRLRYRSVREVWQSLPPGPTQGQTPGEPAASLSVLKNVPGLAILLSKDALDPRQPGYQ PPNPHPGPSSPPAAPASKRSLGEPAAGEGSAKRSQPEPGVADEDQTA >gi568815583r:74855987_75056583|GENSCAN_predicted_CDS_5|864_bp atgggaagggaaagaaatcgaggcacagagacgttccatagcttgaccaagggcacagct ggggaagccagctgccaggtcatgatcagaccagtggagaaacagaggcctctgaccaac agatatgtgaacccgcccgccggcgtctgcgctgctccggcgcccttacccctgctggcc cttgcaaggcgcgaccggcggccatgcagccccggggctgaggccgccccatggcaaagc cggcggtcccggcgacgacggcgcatggagaacttccgtaaggtgcgctccgaagaggcg ccagcggggtgcggggccgagggaggcggcccgggctccggccccttcgcagacctggcg ccgggcgcggtgcacatgcgggtcaaggaaggcagcaagatccggaacctgatggccttc gccaccgccagcatggcgcagccagccacgcgcgccatcgtcttcagcggctgcggccgg gccaccaccaaaaccgtcacgtgcgccgagatcctcaagcgccgcctggcgggcctgcac caggtcacgcggctgcgctaccggagcgtacgcgaggtgtggcagagcctcccgcctggg cccacgcagggtcagacgcctggcgagccggccgctagtctcagcgtacttaagaacgtg cccggcctcgccatcctactttccaaggacgcgctggatccgcgacagcccggctaccag cccccgaatccccatcctggtccctcgtccccgccagccgcgccagcgtccaagaggagc ctaggggaacccgcagctggagaaggctccgcgaagcgatcgcaacccgagccaggggtt gcggacgaggatcagacggcctga >gi568815583r:74855987_75056583|GENSCAN_predicted_peptide_6|368_aa MALSACRCQPMHFPVLVDWFGDPLGVKISSDSFMEWINEDNIKKCLCGIFINPPRSGSRP APRRIARVQRSSAQRCGLSAAERPRQDQGLSTSLHWCPCVSLFTGFVYQFRTKRCGQATG PAGNIMAEKVNNFPPLPKFIPLKPCFYQDFEADIPPQHVSMTKRLYYLWMLNSVTLAVNL VGCLAWLIGGGGATNFGLAFLWLILFTPCSYVCWFRPIYKAFKTDSSFSFMAFFFTFMAQ LVISIIQAVGIPGWGVCPTLASSCSGWIATISFFGTNIGSAVVMLIPTVMFTVMAVFSFI ALSMVHKFYRGSGGSFSKAQEEWTTGAWKNPHVQQAAQNAAMGAAQGAMNQPQTQYSATP NYTYSNEM >gi568815583r:74855987_75056583|GENSCAN_predicted_CDS_6|1107_bp atggctctttcagcctgcaggtgtcagcccatgcatttccctgtgcttgtggactggttt ggtgatccactgggtgtcaagatttcttctgatagctttatggaatggatcaatgaggat aacatcaaaaaatgtttatgtggaatcttcatcaacccaccccggagcggctcgcggccg gctccgcgccgcatcgctcgggtgcagcgcagctcagcgcagcgctgcggcctttcggca gccgaacggccgcggcaggatcaaggcttgtccacctccttgcactggtgcccctgtgtc tccctcttcacgggctttgtctatcagttcaggacaaagaggtgtgggcaggccactggg ccagctggtaacatcatggcagagaaagtgaacaacttcccaccattgcccaaattcatc ccgctgaagccatgtttctaccaagacttcgaggcagatattcctccccagcatgtcagc atgaccaagcgcctctactacctctggatgttgaacagcgtcacgctggccgtgaacctg gtgggctgtctcgcgtggctgatcggaggcgggggagccaccaactttggcctcgccttt ctctggctcatcctcttcacaccctgctcctacgtctgctggtttcggcccatttacaag gccttcaagactgacagctccttcagtttcatggcattcttctttaccttcatggctcag ttggtcatcagcatcatccaggccgtgggcatcccaggctggggcgtctgccccacactg gcctcttcctgcagcggctggattgctaccatctccttcttcggaacgaacattggctcg gcggtggtgatgctaattcccactgtcatgttcacagtgatggccgtcttttccttcatc gccctcagcatggttcataaattttaccggggaagtggggggagtttcagcaaagctcag gaggagtggaccacaggggcctggaagaatccacatgtgcagcaggcagcccagaacgca gccatgggggcagcccagggtgccatgaatcagcctcagactcagtattccgccaccccc aattacacgtactccaatgagatgtga >gi568815583r:74855987_75056583|GENSCAN_predicted_peptide_7|204_aa MKEECGPEPPRLEVAVVTTERAKHFYSPQDIPVTLYSDADEWEIWKSRSDPVLHIDLRRW ADLLLVAPLDANTLGKVASGICDNLLVSDVLVPSSVPGPHTQFAELQTSLYKETCCCGAP TCVMRAWDRSKPLLFCPAMNTAMWEHPITAQQVDQLKAFGYVEIPCVAKKLVCGDEGLGA MAEVGTIVDKVKEVLFQHSGFQQS >gi568815583r:74855987_75056583|GENSCAN_predicted_CDS_7|615_bp atgaaggaagaatgtggtccagagccccccaggctggaagtagcagtggtcacaactgag agagccaaacatttctacagcccccaggacattcctgtcaccctctacagcgacgctgat gaatgggagatatggaagagccgctctgacccagttctgcacattgacctgcggaggtgg gcagacctcctgctggtggctcctcttgatgccaacactctggggaaggtggccagtggc atctgtgacaacttgcttgtgagtgatgtcctggtgccctcgtccgtccctgggcctcac acccagtttgctgagctgcagacatccttgtacaaggagacctgctgctgtggggccccg acctgcgtcatgcgggcctgggaccgcagcaagcccctgctcttctgcccggccatgaac accgccatgtgggagcacccgatcacagcgcagcaggtagaccagctcaaggcctttggc tatgtcgagatcccctgtgtggccaagaagctggtgtgcggagatgaaggtctcggggcc atggctgaagtggggaccatcgtggacaaagtgaaagaagtcctcttccagcacagtggc ttccagcagagttga