GENSCAN 1.0 Date run: 4-Nov-116 Time: 09:50:14 Sequence gi568815585f:97334471_97564439 : 229969 bp : 39.21% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1980 2156 177 2 0 63 -46 195 0.334 2.91 1.02 Intr + 2726 2842 117 1 0 56 52 102 0.351 3.14 1.03 Intr + 8546 8746 201 1 0 109 65 91 0.976 7.46 1.04 Intr + 12334 12597 264 0 0 81 115 235 0.979 22.29 1.05 Intr + 23012 23165 154 1 1 84 91 182 0.979 16.82 1.06 Term + 32575 32723 149 2 2 45 38 108 0.185 -1.42 1.07 PlyA + 32982 32987 6 1.05 2.08 PlyA - 33627 33622 6 1.05 2.07 Term - 37160 36780 381 2 0 77 32 156 0.682 2.75 2.06 Intr - 37614 37525 90 0 0 47 84 67 0.686 1.47 2.05 Intr - 40803 40663 141 0 0 38 111 86 0.861 5.53 2.04 Intr - 41159 41098 62 0 2 48 94 28 0.262 -3.07 2.03 Intr - 46556 46424 133 2 1 38 115 65 0.277 3.50 2.02 Intr - 51807 51720 88 2 1 43 83 58 0.308 -0.15 2.01 Init - 53287 53070 218 1 2 93 50 155 0.397 10.42 2.00 Prom - 64939 64900 40 -1.95 3.00 Prom + 67457 67496 40 -6.15 3.01 Init + 71229 71310 82 2 1 75 42 64 0.953 1.68 3.02 Term + 72789 72937 149 1 2 78 33 178 0.990 8.38 3.03 PlyA + 73200 73205 6 1.05 4.00 Prom + 77098 77137 40 -7.65 4.01 Init + 77940 77993 54 2 0 67 64 69 0.300 3.73 4.02 Intr + 78270 78332 63 2 0 88 94 38 0.297 2.40 4.03 Intr + 84565 84735 171 0 0 77 75 167 0.201 13.52 4.04 Intr + 95911 95989 79 0 1 60 48 80 0.008 -0.59 4.05 Intr + 98350 98625 276 2 0 55 64 225 0.619 13.37 4.06 Intr + 98790 98920 131 1 2 18 -3 166 0.414 -0.01 4.07 Intr + 99956 100314 359 1 2 -14 89 716 0.281 54.63 4.08 Term + 129735 129972 238 0 1 97 39 297 0.908 20.36 4.09 PlyA + 131203 131208 6 1.05 5.00 Prom + 133650 133689 40 -6.35 5.01 Init + 140400 140668 269 2 2 59 80 246 0.709 17.40 5.02 Intr + 141285 141572 288 0 0 9 27 189 0.388 0.34 5.03 Term + 141688 141949 262 1 1 -4 53 277 0.763 9.01 5.04 PlyA + 142327 142332 6 1.05 6.00 Prom + 147791 147830 40 -6.55 6.01 Init + 161082 161301 220 2 1 59 49 133 0.666 5.34 6.02 Intr + 163822 163945 124 2 1 82 83 109 0.923 8.52 6.03 Term + 167240 167495 256 2 1 52 48 145 0.527 0.97 6.04 PlyA + 168862 168867 6 1.05 7.00 Prom + 173897 173936 40 -3.45 7.01 Init + 179192 179260 69 1 0 47 115 24 0.395 2.12 7.02 Intr + 180470 180562 93 1 0 87 105 56 0.461 6.44 7.03 Intr + 181430 181632 203 1 2 43 91 82 0.057 1.16 7.04 Intr + 190700 190783 84 2 0 93 80 53 0.060 3.02 7.05 Intr + 197079 197158 80 0 2 134 52 42 0.022 3.48 7.06 Intr + 199009 199202 194 2 2 65 -40 150 0.274 -1.81 7.07 Term + 202077 202193 117 2 0 121 41 109 0.481 7.06 7.08 PlyA + 203284 203289 6 1.05 8.05 PlyA - 204455 204450 6 1.05 8.04 Term - 205667 205588 80 0 2 14 47 184 0.052 3.85 8.03 Intr - 212881 212720 162 2 0 89 72 77 0.015 5.23 8.02 Intr - 223670 223571 100 2 1 103 48 76 0.014 3.86 8.01 Init - 224394 224332 63 0 0 43 77 69 0.509 2.50 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 174302 174227 76 2 1 85 89 71 0.886 8.20 S.002 Term - 223670 223524 147 2 0 103 38 131 0.973 6.52 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585f:97334471_97564439|GENSCAN_predicted_peptide_1|353_aa MGRPSWIECSCKRPYKRRSYKREGELALTEEEQGSVTTEADWIDVATAEECQQPPEDGRQ EGTVYEPGSWSSSDIKSAGTLIWGVPASKTVRNSIQLFPTFPVGPAIGTNTAISFAPYLA PVTPGVGLVPTEILPTTPVIVPGSPPVTVPGSTATQKLLRTDKLEVCREFQRGNCARGET DCRFAHPADSTMIDTSDNTVTVCMDYIKGRCMREKCKYFHPPAHLQAKIKAAQHQANQAA VAAQAAAAAATVMAFPPGALHPLPKRQALEKSNGTSAVFNPSVLHYQQALTSAQLQQHAA FIPTGSAKLTWFPPYNLLQYWPKSQTIRNEESWVPLNSQRLDITDEEHLTLLV >gi568815585f:97334471_97564439|GENSCAN_predicted_CDS_1|1062_bp atggggagaccctcctggattgaatgcagttgcaagcgtccttataagagacgctcatat aagagagagggagagttagcactgacagaagaggagcaaggcagtgtgaccacagaggca gattggattgatgtggccacagcagaggaatgccagcaaccaccagaagatggaagacaa gaaggcactgtctatgaaccagggagctggtcctcatcagacatcaaatctgcgggcacc ttgatttggggggttccagcttccaaaactgtgagaaattcaattcagttgtttcccact ttccctgtaggtcccgcgatagggacaaatacggctattagctttgctccttacctagca cctgtaacccctggagttgggttggtcccaacggaaattctgcccaccacgcctgttatt gttcccggaagtccaccggtcactgtcccgggctcaactgcaactcagaaacttctcagg actgacaaactggaggtatgcagggagttccagcgaggaaactgtgcccggggagagacc gactgccgctttgcacaccccgcagacagcaccatgatcgacacaagtgacaacaccgta accgtttgtatggattacataaaggggcgttgcatgagggagaaatgcaaatattttcac cctcctgcacacttgcaggccaaaatcaaagctgcgcagcaccaagccaaccaagctgcg gtggccgcccaggcagccgcggccgcggccacagtcatggcctttccccctggtgctctt catcctttaccaaagagacaagcacttgaaaaaagcaatggtaccagcgcggtctttaac cccagcgtcttgcactaccagcaggctctcaccagcgcacagttgcagcaacacgccgcg ttcattccaacaggttctgccaaactcacttggttcccaccttacaatctgctccaatac tggcctaagagccaaaccatccggaatgaagagtcatgggtgccgctgaactcacaaagg ctggatataactgatgaggagcatttaactctgttggtttaa >gi568815585f:97334471_97564439|GENSCAN_predicted_peptide_2|370_aa MEKMTCIILTGLHNVQSACTNQHEFGDVTLLIGEGIPSRAWGRSEIGRRDTEVPKSMAVP HGACFRGCLEEPLQYPLYSLSLDGFLSWHQPTLQATSWQAAVVQRKNQKEYWRGISGGLS VPPVRFVPFRALVPHLACITLAPLQCGKQSPGGFNIMACGKQMAGAWSGQQTQSSLTLGV TASSNVTPEECQKGLRDQSKCQCVFDVWQHLMCLSSDLETWTICIDSTPEGTLSFKQKPV VGGRSTVSITILLENPSVAKSGPNPLTWHQGTTQSIRIKPTLQPLPQTDALINLNGSYSP AELDTSPAFFVDLHLSFFLFNDGALRNVQSPCRIQRTGKVLLAGGGGERCSGKDSEYKLY FNCAFKNVIK >gi568815585f:97334471_97564439|GENSCAN_predicted_CDS_2|1113_bp atggaaaagatgacgtgcataattttaactggcttgcacaatgtacaaagtgcttgcaca aaccagcatgagtttggagatgtcactctgctcattggtgaaggcatcccctccagggca tggggacggtctgagatagggcgacgggacacagaggttccaaagagcatggctgtgcct catggagcctgcttcagagggtgcttagaagaacccctgcaatatccactatattcacta agtctggatggcttcctgagctggcatcagccaacgctacaggcaacatcatggcaggca gcagtggtgcaaaggaaaaatcaaaaagagtactggaggggaatttcaggaggcttgtct gtgccaccagtcagatttgtgccctttcgagccttagttcctcacttggcttgtatcacg ctggcacctctgcagtgcgggaaacagagtcccggtgggtttaacataatggcgtgtgga aagcagatggctggtgcatggagtggtcagcagacccagtccagtctgaccttgggagtc actgcctccagcaatgtcaccccagaggaatgccagaaggggctgagagaccagagcaaa tgccagtgtgtctttgacgtgtggcagcatcttatgtgtttgagctctgatttggagaca tggacaatatgcattgactctacacctgaggggactctttcctttaagcagaagcctgtg gttggaggtaggagtactgtttccatcactattctgcttgaaaatccttcagtggccaag tcagggccgaaccctttaacttggcatcaaggtaccacacaatcaatcagaatcaaacct acgctgcagccgctcccacaaacggatgctctgatcaatctgaatgggagttattctcca gctgaacttgatacttccccagctttctttgtagacttgcatttgtctttctttctattt aatgatggtgccttaaggaatgtgcaaagtccttgtagaattcagaggacagggaaagta cttcttgctgggggaggaggagagagatgttcagggaaagattcggagtataaattgtat tttaattgtgcctttaagaatgtgattaagtaa >gi568815585f:97334471_97564439|GENSCAN_predicted_peptide_3|76_aa MGLYSTKSLVGPLMEMPSCSTCDFQSYFLPSTNEDKKIEHLASAVMMMNPVRISLEMRMK VQGAGVPARTSPTQDT >gi568815585f:97334471_97564439|GENSCAN_predicted_CDS_3|231_bp atggggctctactccaccaagtcacttgtgggcccactgatggagatgccatcctgttca acttgtgacttccaaagttattttttaccatctacaaatgaggacaagaagattgaacac ttagcgagtgctgtcatgatgatgaatcctgtgagaatttcactggagatgaggatgaaa gtacaaggagctggggtccccgctagaacatcaccaactcaggacacttag >gi568815585f:97334471_97564439|GENSCAN_predicted_peptide_4|456_aa MTLSHTLSPEEEHMQDCKSRIPAVLLNHTKQKLTKTQIKASSVPARAQPYSDCLVSSLIP REPAIQNCKGKQIQENPTQVSECLYTNHILTEATVKILNCIRSLTGQETIVCSPYTCQQR FTGRMPRRRREKWNSSFWLATHLVLLPEQGTATGSSISAPPKPQQPVPVSIPPASVARGS RALVGLAEASVTRQLTYDWLAERARRAPPGKALGAGCCYEESRAFRLPFGIPLSRFSTHP WLTGRGKSSTSVRHRKGRVYVAGGGGGGGRGGTMREYKVVVLGSGGVGKSALTVQFVTGT FIEKYDPTIEDFYRKEIEVDSSPSVLEILDTAGTEQFASMRDLYIKNGQGFILVYSLVNQ QSFQDIKPMRDQIIRVKRYEKVPVILVGNKVDLESEREVSSSEGRALAEEWGCPFMETSA KSKTMVDELFAEIVRQMNYAAQPDKDDPCCSACNIQ >gi568815585f:97334471_97564439|GENSCAN_predicted_CDS_4|1371_bp atgacactgagtcacactttgagtcctgaagaagagcacatgcaggactgcaagagtaga attccagctgtcttactaaaccacactaaacaaaaacttactaaaactcagatcaaggcc tcatcagtccctgccagagcacagccttactctgactgcctggtgtcatcactgatccca agagaaccagccatccaaaactgcaaaggaaaacaaatccaggaaaaccctacacaggtc agtgaatgcctctacaccaatcacatcttgactgaagctaccgtgaagattctcaactgt attaggagcctcacaggtcaagaaaccatcgtctgctctccttatacttgccagcagcgt ttcacaggcagaatgccacggaggcgacgtgaaaagtggaattcatcattctggcttgca acgcatcttgttcttttgccagaacagggcacagcaactggctcctcaatctctgcccct cccaaaccccagcagccggtccctgtctccatcccgccggcctcggtggcaaggggctcg cgagctctggttggcttggccgaagcttccgtcactcgccagctcacctacgattggttg gcagagcgtgcgcgaagagccccgcctggcaaggcactgggagctggctgttgctatgag gagtccagggccttccggctgccgttcgggattccgctctccaggttttcaactcacccg tggctgacagggcgtgggaagagctcgacttcagttcggcaccgaaaggggcgggtctat gtcgcgggcggcggcggcggcggcggccgcggagggacgatgcgcgagtacaaagtggtg gtgctgggctcgggcggggtaggcaaatccgccctgaccgtgcagttcgtgaccggcacc ttcatcgagaaatacgaccccaccatcgaggacttctaccgcaaggagatcgaggtggat tcgtcgccgtcggtgctggagatcctggacacggcgggcaccgagcagttcgcgtccatg cgggacctgtacatcaagaacggccagggcttcatcctcgtctacagcctcgtcaaccag cagagcttccaggacatcaagcccatgcgggaccagatcatccgcgtgaagcggtatgag aaagtgccagtcatcttggttgggaacaaagtggacctggaaagtgagagagaagtatcg tccagcgaaggcagagcccttgctgaagagtggggctgcccctttatggaaacttccgct aagagtaaaacaatggtggacgaactctttgcagaaattgtgaggcagatgaactatgct gctcagcctgacaaagatgacccatgctgttctgcatgtaacatacaatag >gi568815585f:97334471_97564439|GENSCAN_predicted_peptide_5|272_aa MHKLFSFTHTNSIRCVIFSNDRLLSKYTQQNWSALKLRGQWLSLFAIRSVNSFVEPLKAA HGHSLGPGVGVEKSWQKWLRTGTRLENQGSTGFDRHLIIFSPKVQLYQVQYAFKAINQGG LTSVAVRKKDCAVIVTHKIVPDKLLDSRTHLFKMTKNTGRVITGVTADSRSREQRAQYKA ANCKYNIFEELGPRVYKCDAAGYYCGFKATAAGVKQAESTSFLEVKKKFAWTFEHSGNRN YMPFYCAINDLKSSETEIRVVTVEIPEFRILT >gi568815585f:97334471_97564439|GENSCAN_predicted_CDS_5|819_bp atgcacaaactcttctcattcactcataccaatagtattcgctgtgtaattttttcaaat gaccgacttctttccaaatatacacaacaaaactggtcagcattaaaactgcgaggacag tggttgtcactgtttgccattagatctgtcaactcctttgtggagcctctcaaagctgcc catggccatagccttgggcctggggttggagtggagaagagctggcagaaatggctgcgg actggaacccggctggagaaccagggcagcactggttttgaccgccaccttataattttt tcacccaaggtccagctctaccaagtacaatatgcttttaaggctattaaccagggtggc cttacatcagtagctgtcagaaagaaagattgtgcagtaattgtcacacacaagatagta cctgacaaattattggattccaggactcacttattcaagatgaccaaaaacactggccgt gtgatcacaggagtgacagctgacagcagatcccgggaacagagggcgcaatataaggca gctaattgcaaatacaatatatttgaagaactaggccctcgggtgtacaagtgtgatgct gcaggttactactgtgggtttaaagccactgcagcaggagttaaacaagccgagtcaacc agcttccttgaagtgaagaagaaatttgcctggacatttgaacacagtggcaaccgcaat tacatgcctttctactgtgctatcaatgatctaaaatcttcagagacagaaattcgagta gttacagttgaaattcctgaattcaggattcttacatga >gi568815585f:97334471_97564439|GENSCAN_predicted_peptide_6|199_aa MVLFSSSLSSSSLTLSALFRLLSSDVPRHSCVASSPFRVSSLILSLPEHKPVSGSPRPSI CKTDSFDFLSFSEQLREQREQVAVASRRQLRPVGCFERSTGLQSGMQPAPGKSGEHFPNR RKDPCFRNSDRNIWVATYESIPSCLPMPTQQCQTLQESRGLGRGREFETYCECIENHEKN LLNLIGLANEIESFAKPLL >gi568815585f:97334471_97564439|GENSCAN_predicted_CDS_6|600_bp atggttttattcagcagctctcttagcagcagctctcttacactgtctgcgctgttccgg ctgctttcttcagacgttcccaggcacagctgcgtggccagctctccctttagggtcagc agcttaattctttctctccctgagcacaagccagtttctggctccccccgaccatccatc tgcaagacagacagctttgattttctctctttctctgagcagctgagggaacaaagagaa caagtggccgtggcatctagaaggcagctcaggcctgttggttgctttgagaggagcact ggccttcaatcagggatgcagccagccccaggaaaatcaggggagcattttccaaacaga aggaaggacccatgtttcagaaattctgatcggaatatttgggtggccacatatgaaagt attccttcgtgcctgccaatgcccactcagcagtgccaaaccttgcaagaaagcaggggc ttggggagagggcgtgagtttgaaacatactgcgaatgcattgaaaaccatgaaaaaaat cttctcaacttaattggcttggccaatgagattgaatcatttgcaaaaccacttttatga >gi568815585f:97334471_97564439|GENSCAN_predicted_peptide_7|279_aa MEKALVHHNLYAGRVEQLRLSIQHLQATFQVSFRASEKESAPPSPWVTHIPSGELRAALT LTPDFTQDLGEEKQGARCRWMTCCASYRNSQFKVFNKWTTESGLLQAHDRNHLNLTPRRV LRDMVEAGNHRYQQTIAKTKNQTLQVLTHRSAGNARSSPKQADAGLLWKDTLWPAPGHTF IEFHGFVPIHAPTAYGYGCALGQCFSAAPRAHVSNTWMSPLDRAQEAMWYTGKEQGFWTQ ATCRIPLKPFTLHNIPHSVSILSVGKRYTQAAIATLPMR >gi568815585f:97334471_97564439|GENSCAN_predicted_CDS_7|840_bp atggaaaaggccttggtgcaccacaatctatatgcaggaagggtggagcagctcaggctg tcaatccagcatctccaagccactttccaagttagcttcagggcttcagagaaagaaagt gctcctccctcgccctgggtcactcacatccccagtggagagctgagagctgccttgaca cttacaccagattttactcaggacttgggagaagagaaacagggagcaagatgcaggtgg atgacatgctgtgcaagttacaggaactcacagtttaaagtcttcaataaatggactaca gagtcaggtctcctgcaggcccacgacagaaatcatcttaacctgacacccaggagggtc ctcagggacatggttgaagctggaaaccatcgttatcagcaaaccatcgcaaagacaaaa aaccaaacactgcaggttctcactcacaggtctgctggaaatgctagatcatcccctaag caggcagatgcaggactcctttggaaagatacactctggccagcccctggccataccttc atcgagttccatggttttgttccaatccatgcgccaacggcatatggatatggctgtgca ctggggcaatgcttttctgctgcaccccgggcccatgtatccaacacctggatgtctcct ctggatcgtgcacaggaggccatgtggtatacagggaaagagcagggattctggacgcag gctacttgcaggattccgctcaagcccttcacactccacaatattccccactcagtcagc attctcagtgtaggtaaacgatacactcaggcagctattgcaactcttccaatgagatga >gi568815585f:97334471_97564439|GENSCAN_predicted_peptide_8|134_aa MQRRKNDTVDFGDSGAKDGMANLPPHLLLDVGPIIKVQHNPAVDVLGLRDCTGEEGKQKD ALVPWKAALRDILLSGIQTEHGEAPRLRLSPALLFPAALDTPLFGLLQVIMGNDDDDVDN CNDNNGNNDKNCGR >gi568815585f:97334471_97564439|GENSCAN_predicted_CDS_8|405_bp atgcaaaggcgtaagaatgacacagtggactttggggactcaggggcaaaagatgggatg gcgaatctgcctccacatctccttttggatgtggggccaataattaaggtgcagcacaat ccagctgttgatgtgctgggcctaagggactgtactggagaagaaggaaaacaaaaagat gccttagttccatggaaggctgctctgagggatattctgctaagtggcatccagacagaa cacggagaagctcctcgattgcgtctctcccctgccttattgtttcctgctgcccttgac actccactgtttgggctattgcaagtaatcatgggaaatgatgatgatgatgttgataac tgcaatgacaacaacggcaataatgacaaaaactgtggaagataa