GENSCAN 1.0 Date run: 8-Nov-116 Time: 01:47:32 Sequence gi568815585f:97176236_97466553 : 290318 bp : 39.64% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 249 366 118 2 1 71 59 97 0.703 5.51 1.02 Term + 13882 14021 140 2 2 41 36 160 0.406 3.24 1.03 PlyA + 15174 15179 6 1.05 2.03 PlyA - 15448 15443 6 1.05 2.02 Term - 23994 23882 113 1 2 86 44 75 0.321 0.64 2.01 Init - 25071 24999 73 0 1 65 116 62 0.593 7.98 2.00 Prom - 26031 25992 40 -6.85 3.10 PlyA - 26696 26691 6 1.05 3.09 Term - 26796 26711 86 1 2 76 53 109 0.089 3.04 3.08 Intr - 32247 32163 85 0 1 53 87 32 0.002 -1.93 3.07 Intr - 48085 47823 263 2 2 38 87 127 0.032 3.88 3.06 Intr - 54356 54267 90 0 0 37 89 84 0.067 2.45 3.05 Intr - 57636 57566 71 2 2 83 100 70 0.089 5.61 3.04 Intr - 62286 62156 131 0 2 72 44 65 0.036 -0.93 3.03 Intr - 66396 66264 133 2 1 91 92 49 0.329 5.33 3.02 Intr - 66670 66525 146 1 2 75 108 31 0.024 1.96 3.01 Init - 80450 80376 75 2 0 17 95 75 0.115 2.24 3.00 Prom - 83040 83001 40 -6.25 4.02 PlyA - 83657 83652 6 1.05 4.01 Sngl - 85653 85222 432 0 0 87 42 225 0.502 13.83 4.00 Prom - 91050 91011 40 -3.05 5.00 Prom + 95806 95845 40 -3.45 5.01 Init + 100001 100174 174 1 0 100 116 73 0.894 10.69 5.02 Term + 107993 108106 114 1 0 101 43 54 0.058 -0.21 5.03 PlyA + 108327 108332 6 1.05 6.05 PlyA - 108661 108656 6 1.05 6.04 Term - 114852 114745 108 0 0 107 41 34 0.364 -1.87 6.03 Intr - 121136 120991 146 0 2 52 50 180 0.469 9.68 6.02 Intr - 122636 122521 116 1 2 50 84 59 0.369 0.87 6.01 Init - 124454 124405 50 2 2 62 78 57 0.449 2.57 6.00 Prom - 128727 128688 40 -5.85 7.00 Prom + 130368 130407 40 -6.65 7.01 Init + 132725 132880 156 1 0 101 63 55 0.148 4.26 7.02 Intr + 139487 139614 128 1 2 20 68 115 0.324 1.26 7.03 Intr + 142687 142796 110 1 2 77 96 86 0.516 7.31 7.04 Intr + 143283 143314 32 1 2 76 80 53 0.247 0.23 7.05 Intr + 158041 158205 165 0 0 69 89 62 0.113 3.64 7.06 Intr + 159310 159432 123 0 0 75 69 38 0.041 0.46 7.07 Intr + 160961 161077 117 1 0 56 52 102 0.306 3.14 7.08 Intr + 166781 166981 201 1 0 109 65 91 0.978 7.46 7.09 Intr + 170569 170832 264 0 0 81 115 235 0.979 22.29 7.10 Intr + 181247 181400 154 1 1 84 91 182 0.979 16.82 7.11 Term + 190810 190958 149 2 2 45 38 108 0.185 -1.42 7.12 PlyA + 191217 191222 6 1.05 8.08 PlyA - 191862 191857 6 1.05 8.07 Term - 195395 195015 381 2 0 77 32 156 0.682 2.75 8.06 Intr - 195849 195760 90 0 0 47 84 67 0.686 1.47 8.05 Intr - 199038 198898 141 0 0 38 111 86 0.861 5.53 8.04 Intr - 199394 199333 62 0 2 48 94 28 0.262 -3.07 8.03 Intr - 204791 204659 133 2 1 38 115 65 0.277 3.50 8.02 Intr - 210042 209955 88 2 1 43 83 58 0.308 -0.15 8.01 Init - 211522 211305 218 1 2 93 50 155 0.397 10.42 8.00 Prom - 223174 223135 40 -1.95 9.00 Prom + 225692 225731 40 -6.15 9.01 Init + 229464 229545 82 2 1 75 42 64 0.953 1.68 9.02 Term + 231024 231172 149 1 2 78 33 178 0.990 8.38 9.03 PlyA + 231435 231440 6 1.05 10.00 Prom + 235333 235372 40 -7.65 10.01 Init + 236175 236228 54 2 0 67 64 69 0.300 3.73 10.02 Intr + 236505 236567 63 2 0 88 94 38 0.297 2.40 10.03 Intr + 242800 242970 171 0 0 77 75 167 0.201 13.52 10.04 Intr + 254146 254224 79 0 1 60 48 80 0.008 -0.59 10.05 Intr + 256585 256860 276 2 0 55 64 225 0.619 13.37 10.06 Intr + 257025 257155 131 1 2 18 -3 166 0.414 -0.01 10.07 Intr + 258191 258549 359 1 2 -14 89 716 0.281 54.63 10.08 Term + 287970 288207 238 0 1 97 39 297 0.965 20.36 10.09 PlyA + 289438 289443 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 107098 106904 195 1 0 106 49 155 0.808 8.31 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585f:97176236_97466553|GENSCAN_predicted_peptide_1|85_aa MEQCSPAHVNLKASPEHYTPLEGSEDAILTWTSTNVLQEAWQKADSNQPQQVTGAEAQPF LKATHALEDADGDLSTFSSSLVLCK >gi568815585f:97176236_97466553|GENSCAN_predicted_CDS_1|258_bp atggagcagtgcagccctgcccatgtgaacctcaaagcctctccggagcattatactcct ctagagggttcagaggatgcaatactcacctggacttccacgaatgtattgcaagaggca tggcagaaagcggattccaaccagccacagcaggtgacaggagctgaagctcagcccttc ctgaaagcaacccatgccctcgaggatgctgacggtgatctgagtacattttccagctcc cttgttctgtgtaaataa >gi568815585f:97176236_97466553|GENSCAN_predicted_peptide_2|61_aa MKEKEKEIIGNGNANQTGGLCVSKDLLLTPLQGLEWVYVQLCAERYQHLQQVAHSMEAGG G >gi568815585f:97176236_97466553|GENSCAN_predicted_CDS_2|186_bp atgaaggaaaaagaaaaagaaataattggaaacgggaatgcgaatcagacaggaggacta tgtgtttctaaagatctgctccttacacctctccaaggcctggagtgggtgtacgtgcag ctctgtgctgaaaggtaccagcatctccagcaagtggcccacagcatggaggctggaggt ggctag >gi568815585f:97176236_97466553|GENSCAN_predicted_peptide_3|359_aa MEDLEDLKQNTRIRHYHSWRSYQAMHQFQRMMSLDNDSIIHASYSSIPTDPSLTVLKLSI KQRTGGGKKKKRNKPYISRGLGMFLCSTWGRGQVSDTTLCPAKSQVQGQDYILGRRRQDL PPFPILTWFQEALIYDTVNSMILCHRSKGNERGYEAHTPVPSKPPFWYLSTQENKEAARV ARIVGNSSFVIAYLCSRCLLSRGVSTNGFNYGFNTGHDDFSMKAAVLTITTSLELGGTTW ILNSGSGATGNRAPGTIHPGGSAGLHQGGVRPEKKLCPAPCCFLPAQPEALQKTCADRPD MWQELSSSIVQSGPSCGFHKRLKFQPYKGIRGRKRSSEVEGDLKEEEHKTRSGAKGSQH >gi568815585f:97176236_97466553|GENSCAN_predicted_CDS_3|1080_bp atggaagatttggaggacttaaaacagaacactcgtataaggcattatcattcctggagg tcttaccaagccatgcaccaatttcagaggatgatgtcattggacaacgactccataata catgccagttatagtagcatccccacagatccaagcttaactgttctcaaactgtcaatc aagcaacgtacaggaggggggaaaaaaaaaaaaaggaacaaaccctacatttctcgggga ctggggatgtttctgtgttccacatggggccgaggacaagtctcagacaccactctgtgc cctgcaaaatcacaagttcagggtcaggattatattttgggtagaagacggcaggatctg ccaccatttcccattctcacctggtttcaggaagccttaatctatgacacagtgaatagc atgatcctttgtcacagatccaaaggcaatgagagaggttatgaggcccatacacctgtt ccaagcaaaccaccattctggtacttgagtacccaggagaacaaagaagcagctagagta gcacgcattgtgggaaattcatcctttgtcattgcctatctctgttcacgctgccttctt tcacgaggtgtgagcaccaatggcttcaactacggcttcaacacaggacatgatgatttt tctatgaaagcagctgttctcaccatcacaaccagccttgaacttggtggcacaacctgg atcctaaatagtggctctggagcaactgggaatagggccccggggaccatccacccaggt ggcagcgctgggctgcaccaagggggagtcaggccagaaaaaaagctttgccctgctccc tgctgctttttgccagcccagccagaagccctgcagaaaacctgtgctgaccggcccgac atgtggcaggaactatccagttctattgtacaatctgggccaagctgtgggttccacaaa aggctcaagttccagccctacaaaggaatcagaggaagaaagagaagctcagaggtggaa ggagacctgaaggaggaggagcacaaaacaagaagtggagccaaggggagtcaacactga >gi568815585f:97176236_97466553|GENSCAN_predicted_peptide_4|143_aa MEAQPQRGNETLISHSQDPQKVGSEGCSLARQQEMKPGQLQSCSLYRLSQIRAVLPAAKV VLGPSSQCLKCGKSLVGSYRRWQGKRKAGNLKWPRRHNFSERALIYSEMNGYAVSIYPWP VESRGRKRVLPNFKGQGLGDGTI >gi568815585f:97176236_97466553|GENSCAN_predicted_CDS_4|432_bp atggaggcccagcctcagcgaggaaacgagacactcatttcacactcacaggacccacag aaggtgggaagtgagggatgcagcctggctagacagcaggaaatgaaaccagggcagctt cagtcctgcagcctttatcgtctcagtcaaatcagagctgttttacctgctgcaaaagtg gtgctggggccttccagccagtgcctgaagtgtgggaaatctttggtgggcagttacaga agatggcagggaaagagaaaggcagggaatctcaagtggcccaggaggcacaacttcagc gaaagagcgttaatttattctgagatgaatggttatgctgtctccatctacccttggcca gtggaaagcagaggcagaaaaagggtgctgccaaatttcaaaggtcaaggtcttggagat gggacaatttaa >gi568815585f:97176236_97466553|GENSCAN_predicted_peptide_5|95_aa MALNVAPVRDTKWLTLEVCRQFQRGTCSRSDEECKFAHPPKSCQVENGRVIACFDSLKIV GKYNTFKGHAPSEAPGVGEVGGEGWKPVLASFGFW >gi568815585f:97176236_97466553|GENSCAN_predicted_CDS_5|288_bp atggctttgaacgttgccccagtcagagatacaaaatggctgacattagaagtctgcaga cagtttcaaagaggaacatgctcacgctctgatgaagaatgcaaatttgctcatcccccc aaaagttgtcaggttgaaaatggaagagtaattgcctgctttgattccctaaagattgta gggaagtataatacctttaagggccacgctccctctgaagcccctggggttggggaagtt gggggagaggggtggaaaccagttcttgcctctttcggcttttggtag >gi568815585f:97176236_97466553|GENSCAN_predicted_peptide_6|139_aa MLKVKKKMKRKAMLQESRRPTAAARDKQSLMPCIPLVLFRHKQKNKTSPSFPTLIGHMET LPGTYHTVRQLADTHLDASPDYQQFEGSSVVFTAEPSMSKTEPHADWPSHKTDQSLSSLY SNLSAEHNIQTPTQFLSYS >gi568815585f:97176236_97466553|GENSCAN_predicted_CDS_6|420_bp atgttgaaggtgaaaaagaagatgaagagaaaggcaatgttgcaagagagcagaaggccc actgcagctgcaagggacaagcaatccttaatgccgtgtattcctctagtacttttccgg cacaaacagaagaataaaacttccccatcattccctactttaataggacacatggagacc cttcctggtacttatcatactgtacggcagttggctgatactcatcttgatgcctctcct gactatcagcaatttgaaggcagtagtgttgtcttcactgctgaaccctctatgtcaaag acagaacctcatgctgactggccttcccataagacagatcagagcctgtcatccctgtac tcaaatcttagtgctgagcataacattcaaacacctacacagtttctgtcttactcatga >gi568815585f:97176236_97466553|GENSCAN_predicted_peptide_7|532_aa MEKGHQRWACGGEVRGQSVQKEPMSIGAEAEKCGCVWNLRVAWLDCSIKNHGHEKKAGLG LERKLTLWQCGRPCSWVRMWKALLVGEDVEGLTRGYVLEKHAAAGILRKISRYLLRVDHS EEKPWMGTLRKVPAKMTPPVSRGRCSRENCKYLHPPTHLKTQLEINGRNNLIQQKTAAAM LAQQMQFMFPGTPLHPVASVQSSQEPALPRVKEMPGAEGSKINKIQCFGDRQTQEKGKQE GTVYEPGSWSSSDIKSAGTLIWGVPASKTVRNSIQLFPTFPVGPAIGTNTAISFAPYLAP VTPGVGLVPTEILPTTPVIVPGSPPVTVPGSTATQKLLRTDKLEVCREFQRGNCARGETD CRFAHPADSTMIDTSDNTVTVCMDYIKGRCMREKCKYFHPPAHLQAKIKAAQHQANQAAV AAQAAAAAATVMAFPPGALHPLPKRQALEKSNGTSAVFNPSVLHYQQALTSAQLQQHAAF IPTGSAKLTWFPPYNLLQYWPKSQTIRNEESWVPLNSQRLDITDEEHLTLLV >gi568815585f:97176236_97466553|GENSCAN_predicted_CDS_7|1599_bp atggagaaagggcatcagagatgggcctgtgggggtgaagtaagagggcagtctgtgcag aaagagcccatgagcataggcgcagaggcagaaaagtgtgggtgtgtttggaatctgaga gtagcatggttggactgcagcataaaaaaccatgggcatgaaaagaaagcagggcttgga ctggaaaggaagctcactctctggcaatgtggaaggccttgctcatgggtgaggatgtgg aaggccttgctcgtgggtgaggatgtggaaggccttactcgtgggtacgtactggagaaa catgcagcagcaggaattctgagaaaaattagcaggtatctactgagggtggaccacagt gaagaaaaaccctggatggggaccctcagaaaggtgccagcgaagatgaccccacccgtg tcacggggccgttgttcgagagagaactgcaagtatcttcaccctccgacacacttaaaa actcaactagaaattaatggaaggaacaatttgattcagcaaaaaactgcagcagcaatg cttgcccagcagatgcaatttatgtttccaggaacaccacttcatccagtggcttctgta cagagtagccaggaaccagctctgcccagagtaaaagaaatgccaggtgcagagggctcc aagataaataagatacagtgttttggggatagacagacacaagaaaagggcaaacaagaa ggcactgtctatgaaccagggagctggtcctcatcagacatcaaatctgcgggcaccttg atttggggggttccagcttccaaaactgtgagaaattcaattcagttgtttcccactttc cctgtaggtcccgcgatagggacaaatacggctattagctttgctccttacctagcacct gtaacccctggagttgggttggtcccaacggaaattctgcccaccacgcctgttattgtt cccggaagtccaccggtcactgtcccgggctcaactgcaactcagaaacttctcaggact gacaaactggaggtatgcagggagttccagcgaggaaactgtgcccggggagagaccgac tgccgctttgcacaccccgcagacagcaccatgatcgacacaagtgacaacaccgtaacc gtttgtatggattacataaaggggcgttgcatgagggagaaatgcaaatattttcaccct cctgcacacttgcaggccaaaatcaaagctgcgcagcaccaagccaaccaagctgcggtg gccgcccaggcagccgcggccgcggccacagtcatggcctttccccctggtgctcttcat cctttaccaaagagacaagcacttgaaaaaagcaatggtaccagcgcggtctttaacccc agcgtcttgcactaccagcaggctctcaccagcgcacagttgcagcaacacgccgcgttc attccaacaggttctgccaaactcacttggttcccaccttacaatctgctccaatactgg cctaagagccaaaccatccggaatgaagagtcatgggtgccgctgaactcacaaaggctg gatataactgatgaggagcatttaactctgttggtttaa >gi568815585f:97176236_97466553|GENSCAN_predicted_peptide_8|370_aa MEKMTCIILTGLHNVQSACTNQHEFGDVTLLIGEGIPSRAWGRSEIGRRDTEVPKSMAVP HGACFRGCLEEPLQYPLYSLSLDGFLSWHQPTLQATSWQAAVVQRKNQKEYWRGISGGLS VPPVRFVPFRALVPHLACITLAPLQCGKQSPGGFNIMACGKQMAGAWSGQQTQSSLTLGV TASSNVTPEECQKGLRDQSKCQCVFDVWQHLMCLSSDLETWTICIDSTPEGTLSFKQKPV VGGRSTVSITILLENPSVAKSGPNPLTWHQGTTQSIRIKPTLQPLPQTDALINLNGSYSP AELDTSPAFFVDLHLSFFLFNDGALRNVQSPCRIQRTGKVLLAGGGGERCSGKDSEYKLY FNCAFKNVIK >gi568815585f:97176236_97466553|GENSCAN_predicted_CDS_8|1113_bp atggaaaagatgacgtgcataattttaactggcttgcacaatgtacaaagtgcttgcaca aaccagcatgagtttggagatgtcactctgctcattggtgaaggcatcccctccagggca tggggacggtctgagatagggcgacgggacacagaggttccaaagagcatggctgtgcct catggagcctgcttcagagggtgcttagaagaacccctgcaatatccactatattcacta agtctggatggcttcctgagctggcatcagccaacgctacaggcaacatcatggcaggca gcagtggtgcaaaggaaaaatcaaaaagagtactggaggggaatttcaggaggcttgtct gtgccaccagtcagatttgtgccctttcgagccttagttcctcacttggcttgtatcacg ctggcacctctgcagtgcgggaaacagagtcccggtgggtttaacataatggcgtgtgga aagcagatggctggtgcatggagtggtcagcagacccagtccagtctgaccttgggagtc actgcctccagcaatgtcaccccagaggaatgccagaaggggctgagagaccagagcaaa tgccagtgtgtctttgacgtgtggcagcatcttatgtgtttgagctctgatttggagaca tggacaatatgcattgactctacacctgaggggactctttcctttaagcagaagcctgtg gttggaggtaggagtactgtttccatcactattctgcttgaaaatccttcagtggccaag tcagggccgaaccctttaacttggcatcaaggtaccacacaatcaatcagaatcaaacct acgctgcagccgctcccacaaacggatgctctgatcaatctgaatgggagttattctcca gctgaacttgatacttccccagctttctttgtagacttgcatttgtctttctttctattt aatgatggtgccttaaggaatgtgcaaagtccttgtagaattcagaggacagggaaagta cttcttgctgggggaggaggagagagatgttcagggaaagattcggagtataaattgtat tttaattgtgcctttaagaatgtgattaagtaa >gi568815585f:97176236_97466553|GENSCAN_predicted_peptide_9|76_aa MGLYSTKSLVGPLMEMPSCSTCDFQSYFLPSTNEDKKIEHLASAVMMMNPVRISLEMRMK VQGAGVPARTSPTQDT >gi568815585f:97176236_97466553|GENSCAN_predicted_CDS_9|231_bp atggggctctactccaccaagtcacttgtgggcccactgatggagatgccatcctgttca acttgtgacttccaaagttattttttaccatctacaaatgaggacaagaagattgaacac ttagcgagtgctgtcatgatgatgaatcctgtgagaatttcactggagatgaggatgaaa gtacaaggagctggggtccccgctagaacatcaccaactcaggacacttag >gi568815585f:97176236_97466553|GENSCAN_predicted_peptide_10|456_aa MTLSHTLSPEEEHMQDCKSRIPAVLLNHTKQKLTKTQIKASSVPARAQPYSDCLVSSLIP REPAIQNCKGKQIQENPTQVSECLYTNHILTEATVKILNCIRSLTGQETIVCSPYTCQQR FTGRMPRRRREKWNSSFWLATHLVLLPEQGTATGSSISAPPKPQQPVPVSIPPASVARGS RALVGLAEASVTRQLTYDWLAERARRAPPGKALGAGCCYEESRAFRLPFGIPLSRFSTHP WLTGRGKSSTSVRHRKGRVYVAGGGGGGGRGGTMREYKVVVLGSGGVGKSALTVQFVTGT FIEKYDPTIEDFYRKEIEVDSSPSVLEILDTAGTEQFASMRDLYIKNGQGFILVYSLVNQ QSFQDIKPMRDQIIRVKRYEKVPVILVGNKVDLESEREVSSSEGRALAEEWGCPFMETSA KSKTMVDELFAEIVRQMNYAAQPDKDDPCCSACNIQ >gi568815585f:97176236_97466553|GENSCAN_predicted_CDS_10|1371_bp atgacactgagtcacactttgagtcctgaagaagagcacatgcaggactgcaagagtaga attccagctgtcttactaaaccacactaaacaaaaacttactaaaactcagatcaaggcc tcatcagtccctgccagagcacagccttactctgactgcctggtgtcatcactgatccca agagaaccagccatccaaaactgcaaaggaaaacaaatccaggaaaaccctacacaggtc agtgaatgcctctacaccaatcacatcttgactgaagctaccgtgaagattctcaactgt attaggagcctcacaggtcaagaaaccatcgtctgctctccttatacttgccagcagcgt ttcacaggcagaatgccacggaggcgacgtgaaaagtggaattcatcattctggcttgca acgcatcttgttcttttgccagaacagggcacagcaactggctcctcaatctctgcccct cccaaaccccagcagccggtccctgtctccatcccgccggcctcggtggcaaggggctcg cgagctctggttggcttggccgaagcttccgtcactcgccagctcacctacgattggttg gcagagcgtgcgcgaagagccccgcctggcaaggcactgggagctggctgttgctatgag gagtccagggccttccggctgccgttcgggattccgctctccaggttttcaactcacccg tggctgacagggcgtgggaagagctcgacttcagttcggcaccgaaaggggcgggtctat gtcgcgggcggcggcggcggcggcggccgcggagggacgatgcgcgagtacaaagtggtg gtgctgggctcgggcggggtaggcaaatccgccctgaccgtgcagttcgtgaccggcacc ttcatcgagaaatacgaccccaccatcgaggacttctaccgcaaggagatcgaggtggat tcgtcgccgtcggtgctggagatcctggacacggcgggcaccgagcagttcgcgtccatg cgggacctgtacatcaagaacggccagggcttcatcctcgtctacagcctcgtcaaccag cagagcttccaggacatcaagcccatgcgggaccagatcatccgcgtgaagcggtatgag aaagtgccagtcatcttggttgggaacaaagtggacctggaaagtgagagagaagtatcg tccagcgaaggcagagcccttgctgaagagtggggctgcccctttatggaaacttccgct aagagtaaaacaatggtggacgaactctttgcagaaattgtgaggcagatgaactatgct gctcagcctgacaaagatgacccatgctgttctgcatgtaacatacaatag