GENSCAN 1.0 Date run: 8-Nov-116 Time: 10:16:28 Sequence gi568815583f:74928319_75149232 : 220914 bp : 47.91% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 Intr - 914 798 117 0 0 35 86 130 0.066 7.94 1.03 Intr - 9289 9131 159 2 0 41 45 94 0.044 0.36 1.02 Intr - 9718 9597 122 0 2 75 82 143 0.642 12.54 1.01 Init - 14769 14621 149 0 2 76 43 52 0.179 -0.94 1.00 Prom - 23779 23740 40 -5.86 2.05 PlyA - 26122 26117 6 1.05 2.04 Term - 28398 27666 733 2 1 -6 47 892 0.774 68.64 2.03 Intr - 33438 33369 70 1 1 71 100 41 0.890 1.84 2.02 Intr - 35974 35922 53 0 2 80 119 27 0.904 3.55 2.01 Init - 36868 36861 8 1 2 95 81 0 0.794 0.52 2.00 Prom - 38178 38139 40 -6.16 3.00 Prom + 45393 45432 40 -2.46 3.01 Init + 48013 48171 159 0 0 84 63 50 0.284 1.96 3.02 Intr + 67257 67355 99 2 0 34 109 75 0.003 4.51 3.03 Intr + 73938 74006 69 2 0 40 92 57 0.123 0.68 3.04 Intr + 83474 83528 55 1 1 127 110 25 0.727 7.15 3.05 Intr + 84359 84487 129 0 0 61 56 152 0.971 9.97 3.06 Intr + 88275 88431 157 1 1 59 108 177 0.999 15.87 3.07 Intr + 89552 89653 102 2 0 105 99 102 0.999 12.29 3.08 Intr + 90076 90217 142 1 1 85 88 186 0.518 18.66 3.09 Term + 90471 90665 195 2 0 140 46 138 0.985 12.21 3.10 PlyA + 93152 93157 6 1.05 4.00 Prom + 103605 103644 40 -6.16 4.01 Init + 108980 109012 33 1 0 79 84 25 0.526 1.07 4.02 Intr + 115123 115218 96 0 0 98 78 139 0.997 14.11 4.03 Intr + 116068 116298 231 0 0 113 30 250 0.619 19.67 4.04 Intr + 120235 120403 169 0 1 81 98 308 0.939 30.62 4.05 Term + 120832 120917 86 2 2 49 43 131 0.715 2.52 4.06 PlyA + 121403 121408 6 1.05 5.04 PlyA - 122589 122584 6 1.05 5.03 Term - 130448 130348 101 0 2 130 42 13 0.271 -0.81 5.02 Intr - 140769 140583 187 0 1 75 20 95 0.235 0.66 5.01 Init - 148402 148325 78 1 0 62 93 63 0.216 5.26 5.00 Prom - 150803 150764 40 -4.96 6.03 PlyA - 151008 151003 6 1.05 6.02 Term - 158725 158349 377 2 2 37 37 270 0.553 11.70 6.01 Init - 178194 178146 49 0 1 45 57 64 0.063 0.11 6.00 Prom - 180654 180615 40 -5.06 7.00 Prom + 180893 180932 40 -4.16 7.01 Init + 185943 186158 216 2 0 76 86 226 0.897 18.06 7.02 Intr + 186268 186355 88 0 1 38 77 68 0.648 0.24 7.03 Intr + 186740 186871 132 0 0 63 101 11 0.508 0.52 7.04 Intr + 186981 187145 165 1 0 29 44 123 0.161 1.93 7.05 Intr + 202808 202868 61 0 1 120 103 41 0.216 6.59 7.06 Term + 212168 212288 121 2 1 127 43 38 0.324 1.15 7.07 PlyA + 212974 212979 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583f:74928319_75149232|GENSCAN_predicted_peptide_1|183_aa MTFLSHQKSSCCVLKKEEEAKVQVWLLQRDSEGFQGRKWQEEIKDPEDLDPRAAIAVMLG AALRRCAVAATTRADPRGLLHSARTPGPAVAPTLRPPDREWPPPACSASAKVTLSLRARR LGSEDTIYFRMGHTIILFNQAFLAIQSVRCYSHGSQETDEEFDARWVTYFNKPDIDAWEL RKX >gi568815583f:74928319_75149232|GENSCAN_predicted_CDS_1|549_bp atgacatttctgagccatcagaagtccagctgttgtgttttaaaaaaagaagaagaagca aaagtccaggtgtggcttctccagagggacagcgaaggatttcaaggtaggaaatggcag gaagagataaaagaccctgaagaccttgacccgcgcgccgccatcgccgtcatgctgggc gccgctctccgccgctgcgctgtggccgcaaccacccgggccgaccctcgaggcctcctg cactccgcccggacccccggccccgccgtggcgccgaccctgcggccacctgaccgagag tggccgccgccggcctgcagcgcctcggcgaaggtgacattgagcctccgggcacgccgc ttggggtccgaggacacaatatacttccgtatgggacacacgattatacttttcaatcag gcatttctcgctatccagtcagttcgctgctattcccatgggtcacaggagacagatgag gagtttgatgctcgctgggtaacatacttcaacaagccagatatagatgcctgggaattg cgtaaagnn >gi568815583f:74928319_75149232|GENSCAN_predicted_peptide_2|287_aa MGRERNRGTETFHSLTKGTAGEASCQVMIRPVEKQRPLTNRYVNPPAGVCAAPAPLPLLA LARRDRRPCSPGAEAAPWQSRRSRRRRRMENFRKVRSEEAPAGCGAEGGGPGSGPFADLA PGAVHMRVKEGSKIRNLMAFATASMAQPATRAIVFSGCGRATTKTVTCAEILKRRLAGLH QVTRLRYRSVREVWQSLPPGPTQGQTPGEPAASLSVLKNVPGLAILLSKDALDPRQPGYQ PPNPHPGPSSPPAAPASKRSLGEPAAGEGSAKRSQPEPGVADEDQTA >gi568815583f:74928319_75149232|GENSCAN_predicted_CDS_2|864_bp atgggaagggaaagaaatcgaggcacagagacgttccatagcttgaccaagggcacagct ggggaagccagctgccaggtcatgatcagaccagtggagaaacagaggcctctgaccaac agatatgtgaacccgcccgccggcgtctgcgctgctccggcgcccttacccctgctggcc cttgcaaggcgcgaccggcggccatgcagccccggggctgaggccgccccatggcaaagc cggcggtcccggcgacgacggcgcatggagaacttccgtaaggtgcgctccgaagaggcg ccagcggggtgcggggccgagggaggcggcccgggctccggccccttcgcagacctggcg ccgggcgcggtgcacatgcgggtcaaggaaggcagcaagatccggaacctgatggccttc gccaccgccagcatggcgcagccagccacgcgcgccatcgtcttcagcggctgcggccgg gccaccaccaaaaccgtcacgtgcgccgagatcctcaagcgccgcctggcgggcctgcac caggtcacgcggctgcgctaccggagcgtacgcgaggtgtggcagagcctcccgcctggg cccacgcagggtcagacgcctggcgagccggccgctagtctcagcgtacttaagaacgtg cccggcctcgccatcctactttccaaggacgcgctggatccgcgacagcccggctaccag cccccgaatccccatcctggtccctcgtccccgccagccgcgccagcgtccaagaggagc ctaggggaacccgcagctggagaaggctccgcgaagcgatcgcaacccgagccaggggtt gcggacgaggatcagacggcctga >gi568815583f:74928319_75149232|GENSCAN_predicted_peptide_3|368_aa MALSACRCQPMHFPVLVDWFGDPLGVKISSDSFMEWINEDNIKKCLCGIFINPPRSGSRP APRRIARVQRSSAQRCGLSAAERPRQDQGLSTSLHWCPCVSLFTGFVYQFRTKRCGQATG PAGNIMAEKVNNFPPLPKFIPLKPCFYQDFEADIPPQHVSMTKRLYYLWMLNSVTLAVNL VGCLAWLIGGGGATNFGLAFLWLILFTPCSYVCWFRPIYKAFKTDSSFSFMAFFFTFMAQ LVISIIQAVGIPGWGVCPTLASSCSGWIATISFFGTNIGSAVVMLIPTVMFTVMAVFSFI ALSMVHKFYRGSGGSFSKAQEEWTTGAWKNPHVQQAAQNAAMGAAQGAMNQPQTQYSATP NYTYSNEM >gi568815583f:74928319_75149232|GENSCAN_predicted_CDS_3|1107_bp atggctctttcagcctgcaggtgtcagcccatgcatttccctgtgcttgtggactggttt ggtgatccactgggtgtcaagatttcttctgatagctttatggaatggatcaatgaggat aacatcaaaaaatgtttatgtggaatcttcatcaacccaccccggagcggctcgcggccg gctccgcgccgcatcgctcgggtgcagcgcagctcagcgcagcgctgcggcctttcggca gccgaacggccgcggcaggatcaaggcttgtccacctccttgcactggtgcccctgtgtc tccctcttcacgggctttgtctatcagttcaggacaaagaggtgtgggcaggccactggg ccagctggtaacatcatggcagagaaagtgaacaacttcccaccattgcccaaattcatc ccgctgaagccatgtttctaccaagacttcgaggcagatattcctccccagcatgtcagc atgaccaagcgcctctactacctctggatgttgaacagcgtcacgctggccgtgaacctg gtgggctgtctcgcgtggctgatcggaggcgggggagccaccaactttggcctcgccttt ctctggctcatcctcttcacaccctgctcctacgtctgctggtttcggcccatttacaag gccttcaagactgacagctccttcagtttcatggcattcttctttaccttcatggctcag ttggtcatcagcatcatccaggccgtgggcatcccaggctggggcgtctgccccacactg gcctcttcctgcagcggctggattgctaccatctccttcttcggaacgaacattggctcg gcggtggtgatgctaattcccactgtcatgttcacagtgatggccgtcttttccttcatc gccctcagcatggttcataaattttaccggggaagtggggggagtttcagcaaagctcag gaggagtggaccacaggggcctggaagaatccacatgtgcagcaggcagcccagaacgca gccatgggggcagcccagggtgccatgaatcagcctcagactcagtattccgccaccccc aattacacgtactccaatgagatgtga >gi568815583f:74928319_75149232|GENSCAN_predicted_peptide_4|204_aa MKEECGPEPPRLEVAVVTTERAKHFYSPQDIPVTLYSDADEWEIWKSRSDPVLHIDLRRW ADLLLVAPLDANTLGKVASGICDNLLVSDVLVPSSVPGPHTQFAELQTSLYKETCCCGAP TCVMRAWDRSKPLLFCPAMNTAMWEHPITAQQVDQLKAFGYVEIPCVAKKLVCGDEGLGA MAEVGTIVDKVKEVLFQHSGFQQS >gi568815583f:74928319_75149232|GENSCAN_predicted_CDS_4|615_bp atgaaggaagaatgtggtccagagccccccaggctggaagtagcagtggtcacaactgag agagccaaacatttctacagcccccaggacattcctgtcaccctctacagcgacgctgat gaatgggagatatggaagagccgctctgacccagttctgcacattgacctgcggaggtgg gcagacctcctgctggtggctcctcttgatgccaacactctggggaaggtggccagtggc atctgtgacaacttgcttgtgagtgatgtcctggtgccctcgtccgtccctgggcctcac acccagtttgctgagctgcagacatccttgtacaaggagacctgctgctgtggggccccg acctgcgtcatgcgggcctgggaccgcagcaagcccctgctcttctgcccggccatgaac accgccatgtgggagcacccgatcacagcgcagcaggtagaccagctcaaggcctttggc tatgtcgagatcccctgtgtggccaagaagctggtgtgcggagatgaaggtctcggggcc atggctgaagtggggaccatcgtggacaaagtgaaagaagtcctcttccagcacagtggc ttccagcagagttga >gi568815583f:74928319_75149232|GENSCAN_predicted_peptide_5|121_aa MLGPSPDTMNSSRKWTNSWAEENRSPPGFCKAFSMLALLLQMLGSRGPKRTTILQMQSEP EGGKITSFDLHAIPLLMQPKLLAKEATCRIQRPERESAGPPVPKDLLSRLAGKASPCNLS R >gi568815583f:74928319_75149232|GENSCAN_predicted_CDS_5|366_bp atgttgggaccaagccccgacaccatgaacagcagcaggaagtggaccaattcctgggca gaagaaaacaggtccccgcctggcttttgcaaagccttcagcatgctggctcttctcctg cagatgttgggcagcaggggccccaaaaggaccacaatccttcagatgcagtctgagcca gaaggaggaaaaatcacctcctttgacctgcatgctataccactattaatgcagcccaag ctattagctaaggaggctacatgcagaatccagaggcctgaaagggagtctgctgggcct ccagtccctaaggacctgctgtctaggctagcaggcaaagcatcaccatgtaatctcagc agataa >gi568815583f:74928319_75149232|GENSCAN_predicted_peptide_6|141_aa MQINASEDDIEDPGTPDESQWFQDGRIGTAPLYSSQHKQHRRRVISAFPTEEHSSSQATE QSWMENYFDELREEGFRRSVITNFSELKKDVRTHCKEAKNLEKRLDEWLTRINSVEKTLN YLMELKTMAREHHDACTSISS >gi568815583f:74928319_75149232|GENSCAN_predicted_CDS_6|426_bp atgcaaataaatgcaagtgaagacgatattgaggaccctggtactcctgatgagagtcag tggttccaagatggccgaataggaacagctccactctacagctcccagcataagcaacac agaagacgggtgatttctgcattcccaactgaggaacacagctcctcgcaagcaacggaa caaagctggatggagaattactttgatgagttgagagaagaaggcttcagacgatcagta ataacaaacttctccgagctaaagaaggatgttcgaacccattgcaaagaagctaaaaac cttgaaaaaagattagacgaatggctaactagaataaacagcgtagagaagaccttaaat tacctgatggagctgaaaaccatggcacgagaacaccatgatgcatgcacaagcatcagt agctga >gi568815583f:74928319_75149232|GENSCAN_predicted_peptide_7|260_aa MAVVPLLLSGGLWNTVGASNLAVTRGSMVKLLEMHYSVHLQSHDVLYGSGTGQQSVTSVT SMDDSNSYWRIWEVGAFGEEGEGSHLDDWTILCNGPYWVRDDLGQYFLSRGLAGDRVDAF YSYWCVRNWWVLGLTDFKNEATDPCGVKLRTFAVSVTALKAVRLELFVPPGGFVVSLASG VKLQTFAVSVTAHKSKVDPKTPDNHYSTFCFYEFDYYRQLMDREPEVEGATTLPEPSLPA VVSWAPTPSAWNLPALIKAG >gi568815583f:74928319_75149232|GENSCAN_predicted_CDS_7|783_bp atggctgtggttccgctactgctgtctgggggtttgtggaacaccgtgggagcgtccaat ctggctgttactcgcggctctatggtgaagctgctcgaaatgcactacagtgtccatctg cagtcacacgatgtgctctatgggtcaggtactgggcagcagtcagtgacaagtgtaacc tccatggatgacagcaacagttactggagaatatgggaagtgggtgcttttggtgaggaa ggtgaaggtagtcacctggatgactggaccatactctgtaacgggccctactgggtgaga gatgacttgggtcagtactttctgagtagaggacttgctggtgacagggtggatgctttt tattcatactggtgtgtccggaattggtgggttcttggtctcactgacttcaagaatgaa gccacggacccttgcggagtgaagctgcggacctttgccgtgagtgttacagctcttaag gcggtgcgtctggagttgttcgttcctcccggtgggttcgtggtctcgttggcttcagga gtgaagctgcagacctttgctgtgagtgttacagctcataaaagcaaggtggacccaaag acccctgacaaccactattctactttctgtttctatgaatttgactactatagacaactc atggacagagagccggaggtggaaggggccacaactctaccagagccttcccttccagct gttgtctcctgggcgcccacccccagtgcctggaacttgccagctctcatcaaggctggc tag