GENSCAN 1.0 Date run: 3-Nov-116 Time: 23:45:27 Sequence gi568815583f:74912674_75118980 : 206307 bp : 47.98% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.07 PlyA - 1224 1219 6 1.05 1.06 Term - 11097 10984 114 0 0 71 40 114 0.895 3.47 1.05 Intr - 14214 14093 122 1 2 96 86 65 0.992 7.31 1.04 Intr - 16559 16443 117 0 0 35 86 130 0.929 7.94 1.03 Intr - 24934 24776 159 2 0 41 45 94 0.416 0.36 1.02 Intr - 25363 25242 122 0 2 75 82 143 0.643 12.54 1.01 Init - 30414 30266 149 0 2 76 43 52 0.179 -0.94 1.00 Prom - 39424 39385 40 -5.86 2.05 PlyA - 41767 41762 6 1.05 2.04 Term - 44043 43311 733 2 1 -6 47 892 0.774 68.64 2.03 Intr - 49083 49014 70 1 1 71 100 41 0.890 1.84 2.02 Intr - 51619 51567 53 0 2 80 119 27 0.904 3.55 2.01 Init - 52513 52506 8 1 2 95 81 0 0.794 0.52 2.00 Prom - 53823 53784 40 -6.16 3.00 Prom + 61038 61077 40 -2.46 3.01 Init + 63658 63816 159 0 0 84 63 50 0.284 1.96 3.02 Intr + 82902 83000 99 2 0 34 109 75 0.003 4.51 3.03 Intr + 89583 89651 69 2 0 40 92 57 0.123 0.68 3.04 Intr + 99119 99173 55 1 1 127 110 25 0.727 7.15 3.05 Intr + 100004 100132 129 0 0 61 56 152 0.971 9.97 3.06 Intr + 103920 104076 157 1 1 59 108 177 0.999 15.87 3.07 Intr + 105197 105298 102 2 0 105 99 102 0.999 12.29 3.08 Intr + 105721 105862 142 1 1 85 88 186 0.518 18.66 3.09 Term + 106116 106310 195 2 0 140 46 138 0.985 12.21 3.10 PlyA + 108797 108802 6 1.05 4.00 Prom + 119250 119289 40 -6.16 4.01 Init + 124625 124657 33 1 0 79 84 25 0.526 1.07 4.02 Intr + 130768 130863 96 0 0 98 78 139 0.997 14.11 4.03 Intr + 131713 131943 231 0 0 113 30 250 0.619 19.67 4.04 Intr + 135880 136048 169 0 1 81 98 308 0.939 30.62 4.05 Term + 136477 136562 86 2 2 49 43 131 0.715 2.52 4.06 PlyA + 137048 137053 6 1.05 5.04 PlyA - 138234 138229 6 1.05 5.03 Term - 146093 145993 101 0 2 130 42 13 0.271 -0.81 5.02 Intr - 156414 156228 187 0 1 75 20 95 0.235 0.66 5.01 Init - 164047 163970 78 1 0 62 93 63 0.216 5.26 5.00 Prom - 166448 166409 40 -4.96 6.05 PlyA - 166653 166648 6 1.05 6.04 Term - 174370 173994 377 2 2 37 37 270 0.553 11.70 6.03 Intr - 197922 197790 133 0 1 73 75 107 0.751 8.12 6.02 Intr - 202748 202450 299 0 2 26 10 197 0.008 2.29 6.01 Intr - 203118 202916 203 2 2 56 26 138 0.027 3.33 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 201588 201803 216 2 0 76 86 226 0.860 18.06 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583f:74912674_75118980|GENSCAN_predicted_peptide_1|260_aa MTFLSHQKSSCCVLKKEEEAKVQVWLLQRDSEGFQGRKWQEEIKDPEDLDPRAAIAVMLG AALRRCAVAATTRADPRGLLHSARTPGPAVAPTLRPPDREWPPPACSASAKVTLSLRARR LGSEDTIYFRMGHTIILFNQAFLAIQSVRCYSHGSQETDEEFDARWVTYFNKPDIDAWEL RKGINTLVTYDMVPEPKIIDAALRACRRLNDFASTVRILEVVKDKAGPHKEIYPYVIQEL RPTLNELGISTPEELGLDKV >gi568815583f:74912674_75118980|GENSCAN_predicted_CDS_1|783_bp atgacatttctgagccatcagaagtccagctgttgtgttttaaaaaaagaagaagaagca aaagtccaggtgtggcttctccagagggacagcgaaggatttcaaggtaggaaatggcag gaagagataaaagaccctgaagaccttgacccgcgcgccgccatcgccgtcatgctgggc gccgctctccgccgctgcgctgtggccgcaaccacccgggccgaccctcgaggcctcctg cactccgcccggacccccggccccgccgtggcgccgaccctgcggccacctgaccgagag tggccgccgccggcctgcagcgcctcggcgaaggtgacattgagcctccgggcacgccgc ttggggtccgaggacacaatatacttccgtatgggacacacgattatacttttcaatcag gcatttctcgctatccagtcagttcgctgctattcccatgggtcacaggagacagatgag gagtttgatgctcgctgggtaacatacttcaacaagccagatatagatgcctgggaattg cgtaaagggataaacacacttgttacctatgatatggttccagagcccaaaatcattgat gctgctttgcgggcatgcagacggttaaatgattttgctagtacagttcgtatcctagag gttgttaaggacaaagcaggacctcataaggaaatctacccctatgtcatccaggaactt agaccaactttaaatgaactgggaatctccactccggaggaactgggccttgacaaagtg taa >gi568815583f:74912674_75118980|GENSCAN_predicted_peptide_2|287_aa MGRERNRGTETFHSLTKGTAGEASCQVMIRPVEKQRPLTNRYVNPPAGVCAAPAPLPLLA LARRDRRPCSPGAEAAPWQSRRSRRRRRMENFRKVRSEEAPAGCGAEGGGPGSGPFADLA PGAVHMRVKEGSKIRNLMAFATASMAQPATRAIVFSGCGRATTKTVTCAEILKRRLAGLH QVTRLRYRSVREVWQSLPPGPTQGQTPGEPAASLSVLKNVPGLAILLSKDALDPRQPGYQ PPNPHPGPSSPPAAPASKRSLGEPAAGEGSAKRSQPEPGVADEDQTA >gi568815583f:74912674_75118980|GENSCAN_predicted_CDS_2|864_bp atgggaagggaaagaaatcgaggcacagagacgttccatagcttgaccaagggcacagct ggggaagccagctgccaggtcatgatcagaccagtggagaaacagaggcctctgaccaac agatatgtgaacccgcccgccggcgtctgcgctgctccggcgcccttacccctgctggcc cttgcaaggcgcgaccggcggccatgcagccccggggctgaggccgccccatggcaaagc cggcggtcccggcgacgacggcgcatggagaacttccgtaaggtgcgctccgaagaggcg ccagcggggtgcggggccgagggaggcggcccgggctccggccccttcgcagacctggcg ccgggcgcggtgcacatgcgggtcaaggaaggcagcaagatccggaacctgatggccttc gccaccgccagcatggcgcagccagccacgcgcgccatcgtcttcagcggctgcggccgg gccaccaccaaaaccgtcacgtgcgccgagatcctcaagcgccgcctggcgggcctgcac caggtcacgcggctgcgctaccggagcgtacgcgaggtgtggcagagcctcccgcctggg cccacgcagggtcagacgcctggcgagccggccgctagtctcagcgtacttaagaacgtg cccggcctcgccatcctactttccaaggacgcgctggatccgcgacagcccggctaccag cccccgaatccccatcctggtccctcgtccccgccagccgcgccagcgtccaagaggagc ctaggggaacccgcagctggagaaggctccgcgaagcgatcgcaacccgagccaggggtt gcggacgaggatcagacggcctga >gi568815583f:74912674_75118980|GENSCAN_predicted_peptide_3|368_aa MALSACRCQPMHFPVLVDWFGDPLGVKISSDSFMEWINEDNIKKCLCGIFINPPRSGSRP APRRIARVQRSSAQRCGLSAAERPRQDQGLSTSLHWCPCVSLFTGFVYQFRTKRCGQATG PAGNIMAEKVNNFPPLPKFIPLKPCFYQDFEADIPPQHVSMTKRLYYLWMLNSVTLAVNL VGCLAWLIGGGGATNFGLAFLWLILFTPCSYVCWFRPIYKAFKTDSSFSFMAFFFTFMAQ LVISIIQAVGIPGWGVCPTLASSCSGWIATISFFGTNIGSAVVMLIPTVMFTVMAVFSFI ALSMVHKFYRGSGGSFSKAQEEWTTGAWKNPHVQQAAQNAAMGAAQGAMNQPQTQYSATP NYTYSNEM >gi568815583f:74912674_75118980|GENSCAN_predicted_CDS_3|1107_bp atggctctttcagcctgcaggtgtcagcccatgcatttccctgtgcttgtggactggttt ggtgatccactgggtgtcaagatttcttctgatagctttatggaatggatcaatgaggat aacatcaaaaaatgtttatgtggaatcttcatcaacccaccccggagcggctcgcggccg gctccgcgccgcatcgctcgggtgcagcgcagctcagcgcagcgctgcggcctttcggca gccgaacggccgcggcaggatcaaggcttgtccacctccttgcactggtgcccctgtgtc tccctcttcacgggctttgtctatcagttcaggacaaagaggtgtgggcaggccactggg ccagctggtaacatcatggcagagaaagtgaacaacttcccaccattgcccaaattcatc ccgctgaagccatgtttctaccaagacttcgaggcagatattcctccccagcatgtcagc atgaccaagcgcctctactacctctggatgttgaacagcgtcacgctggccgtgaacctg gtgggctgtctcgcgtggctgatcggaggcgggggagccaccaactttggcctcgccttt ctctggctcatcctcttcacaccctgctcctacgtctgctggtttcggcccatttacaag gccttcaagactgacagctccttcagtttcatggcattcttctttaccttcatggctcag ttggtcatcagcatcatccaggccgtgggcatcccaggctggggcgtctgccccacactg gcctcttcctgcagcggctggattgctaccatctccttcttcggaacgaacattggctcg gcggtggtgatgctaattcccactgtcatgttcacagtgatggccgtcttttccttcatc gccctcagcatggttcataaattttaccggggaagtggggggagtttcagcaaagctcag gaggagtggaccacaggggcctggaagaatccacatgtgcagcaggcagcccagaacgca gccatgggggcagcccagggtgccatgaatcagcctcagactcagtattccgccaccccc aattacacgtactccaatgagatgtga >gi568815583f:74912674_75118980|GENSCAN_predicted_peptide_4|204_aa MKEECGPEPPRLEVAVVTTERAKHFYSPQDIPVTLYSDADEWEIWKSRSDPVLHIDLRRW ADLLLVAPLDANTLGKVASGICDNLLVSDVLVPSSVPGPHTQFAELQTSLYKETCCCGAP TCVMRAWDRSKPLLFCPAMNTAMWEHPITAQQVDQLKAFGYVEIPCVAKKLVCGDEGLGA MAEVGTIVDKVKEVLFQHSGFQQS >gi568815583f:74912674_75118980|GENSCAN_predicted_CDS_4|615_bp atgaaggaagaatgtggtccagagccccccaggctggaagtagcagtggtcacaactgag agagccaaacatttctacagcccccaggacattcctgtcaccctctacagcgacgctgat gaatgggagatatggaagagccgctctgacccagttctgcacattgacctgcggaggtgg gcagacctcctgctggtggctcctcttgatgccaacactctggggaaggtggccagtggc atctgtgacaacttgcttgtgagtgatgtcctggtgccctcgtccgtccctgggcctcac acccagtttgctgagctgcagacatccttgtacaaggagacctgctgctgtggggccccg acctgcgtcatgcgggcctgggaccgcagcaagcccctgctcttctgcccggccatgaac accgccatgtgggagcacccgatcacagcgcagcaggtagaccagctcaaggcctttggc tatgtcgagatcccctgtgtggccaagaagctggtgtgcggagatgaaggtctcggggcc atggctgaagtggggaccatcgtggacaaagtgaaagaagtcctcttccagcacagtggc ttccagcagagttga >gi568815583f:74912674_75118980|GENSCAN_predicted_peptide_5|121_aa MLGPSPDTMNSSRKWTNSWAEENRSPPGFCKAFSMLALLLQMLGSRGPKRTTILQMQSEP EGGKITSFDLHAIPLLMQPKLLAKEATCRIQRPERESAGPPVPKDLLSRLAGKASPCNLS R >gi568815583f:74912674_75118980|GENSCAN_predicted_CDS_5|366_bp atgttgggaccaagccccgacaccatgaacagcagcaggaagtggaccaattcctgggca gaagaaaacaggtccccgcctggcttttgcaaagccttcagcatgctggctcttctcctg cagatgttgggcagcaggggccccaaaaggaccacaatccttcagatgcagtctgagcca gaaggaggaaaaatcacctcctttgacctgcatgctataccactattaatgcagcccaag ctattagctaaggaggctacatgcagaatccagaggcctgaaagggagtctgctgggcct ccagtccctaaggacctgctgtctaggctagcaggcaaagcatcaccatgtaatctcagc agataa >gi568815583f:74912674_75118980|GENSCAN_predicted_peptide_6|337_aa XVCECTSGHSVSGYSGRYRVDLENLCGHTLYLANLVGTWRTFVSSSGIVNAPISALSKQT TGLYHSAGSKVCSFTPEANETTNPPGGTNNSRRTALRAVTLTAKVRSFTPEPVRPGTHQK EETPNTSEHQKEQTPDTPPLRTVTLTARVRGFILEVSETKNPPIPDTPTLTLTPCQASIL QYDFQELPECWLVTAAALLLVTSISFEQSSVSDESQWFQDGRIGTAPLYSSQHKQHRRRV ISAFPTEEHSSSQATEQSWMENYFDELREEGFRRSVITNFSELKKDVRTHCKEAKNLEKR LDEWLTRINSVEKTLNYLMELKTMAREHHDACTSISS >gi568815583f:74912674_75118980|GENSCAN_predicted_CDS_6|1014_bp nnggtttgtgaatgcaccagtggacactctgtatctggctactctggtagatacagagtg gacttggagaacctttgtggccacactctgtatctagctaatctggtggggacgtggaga acctttgtgtctagctcagggattgtaaacgcaccaatcagcgccctgtcaaaacagacc actggactctaccactcagcaggatcaaaggtctgcagcttcactcctgaagccaacgag accacgaacccaccgggaggaacgaacaactccagacgcaccgccttaagagctgtaaca ctcacggcaaaggtccgcagcttcactcctgagccagtgagaccaggaacccaccagaag gaagaaactccaaacacatccgaacatcagaaggaacaaactccggacacaccgccttta agaactgtaacactcaccgcaagggtccgtggcttcattcttgaagtcagtgagaccaag aacccaccaattccggacacaccaacccttaccctcaccccttgccaggccagcattctg caatatgatttccaggagcttccagaatgctggcttgttacagctgctgccctgctctta gtgaccagcatctccttcgagcagtcctctgtgtcagatgagagtcagtggttccaagat ggccgaataggaacagctccactctacagctcccagcataagcaacacagaagacgggtg atttctgcattcccaactgaggaacacagctcctcgcaagcaacggaacaaagctggatg gagaattactttgatgagttgagagaagaaggcttcagacgatcagtaataacaaacttc tccgagctaaagaaggatgttcgaacccattgcaaagaagctaaaaaccttgaaaaaaga ttagacgaatggctaactagaataaacagcgtagagaagaccttaaattacctgatggag ctgaaaaccatggcacgagaacaccatgatgcatgcacaagcatcagtagctga