GENSCAN 1.0 Date run: 4-Nov-116 Time: 01:56:02 Sequence gi568815586f:116475943_116676317 : 200375 bp : 46.32% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 1091 1086 6 1.05 1.03 Term - 6765 6691 75 0 0 88 49 79 0.721 1.84 1.02 Intr - 19488 19378 111 0 0 6 84 104 0.146 2.38 1.01 Init - 21738 21700 39 0 0 66 70 65 0.202 2.70 1.00 Prom - 22999 22960 40 -5.56 2.04 PlyA - 24112 24107 6 1.05 2.03 Term - 24292 24162 131 2 2 70 47 107 0.358 3.14 2.02 Intr - 30971 30856 116 1 2 90 98 29 0.432 4.19 2.01 Init - 32628 32552 77 0 2 53 -45 181 0.412 -0.14 2.00 Prom - 38163 38124 40 -5.36 3.00 Prom + 46573 46612 40 -3.16 3.01 Init + 56662 56800 139 0 1 40 53 141 0.739 6.10 3.02 Term + 58150 58184 35 2 2 152 42 53 0.991 4.85 3.03 PlyA + 59360 59365 6 1.05 4.00 Prom + 62533 62572 40 -8.56 4.01 Init + 64163 64191 29 1 2 95 111 8 0.540 2.96 4.02 Intr + 68553 68750 198 0 0 78 37 79 0.311 0.27 4.03 Intr + 75173 75237 65 2 2 106 52 50 0.478 1.56 4.04 Intr + 77581 77704 124 2 1 66 24 128 0.009 3.84 4.05 Intr + 83316 83335 20 0 2 92 46 53 0.007 -2.25 4.06 Term + 99900 100378 479 1 2 37 41 638 0.775 49.00 4.07 PlyA + 100630 100635 6 1.05 5.00 Prom + 105165 105204 40 -5.96 5.01 Init + 105231 105327 97 2 1 72 115 72 0.720 8.78 5.02 Term + 124103 124176 74 0 2 92 48 60 0.074 0.47 5.03 PlyA + 125442 125447 6 1.05 6.04 PlyA - 125488 125483 6 1.05 6.03 Term - 130504 130370 135 1 0 75 53 106 0.112 3.82 6.02 Intr - 138488 138422 67 1 1 72 4 134 0.026 2.31 6.01 Init - 157719 157622 98 0 2 77 94 50 0.199 4.28 6.00 Prom - 175083 175044 40 -4.86 7.00 Prom + 175198 175237 40 -6.06 7.01 Init + 185006 185209 204 1 0 97 44 234 0.850 16.76 7.02 Intr + 188733 188814 82 2 1 67 67 26 0.205 -2.39 7.03 Term + 189264 189433 170 1 2 11 54 158 0.288 2.74 7.04 PlyA + 189609 189614 6 1.05 8.02 PlyA - 190277 190272 6 1.05 8.01 Sngl - 192688 192482 207 1 0 62 41 143 0.587 2.19 8.00 Prom - 193708 193669 40 -1.96 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 77581 77720 140 2 2 66 36 149 0.913 5.63 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:116475943_116676317|GENSCAN_predicted_peptide_1|74_aa MASGKGPVAATSLSFGSLILGEASCQGMRSLRQPCGNVPVEELRPPANSHEDIDQVEGIR RRAGHMNGGLQEWG >gi568815586f:116475943_116676317|GENSCAN_predicted_CDS_1|225_bp atggcaagtggcaaaggtcctgtggctgctacgagcctgtcttttggatcactcattctg ggagaagccagctgccagggaatgcggagccttcgacagccctgtgggaatgtccctgtg gaggaactgaggcctccagccaacagccatgaggatattgaccaagtggaagggattcga agaagagcaggacacatgaatgggggcctgcaggagtgggggtga >gi568815586f:116475943_116676317|GENSCAN_predicted_peptide_2|107_aa MQEAARAAAARRARWGRAAQQPPDAAVLGSLTTPQTTYFLGQRALLVFGGIETERSKERP PPFQGGIVGFLFSPSEKQLHSYMQSSDFSEYLIGARDYVSFEGDKDE >gi568815586f:116475943_116676317|GENSCAN_predicted_CDS_2|324_bp atgcaggaggcagccagagccgcggcggcgcggcgggcccgctggggccgagcagcgcag cagccccccgacgccgctgtccttggaagtcttacaactccacagaccacgtacttcctg ggccaaagagcactcctagtctttggggggatagaaactgagagaagcaaagagagacct ccaccattccaaggtggtattgttgggttcctattcagtccatcagagaagcagctccat tcatacatgcagtcatctgacttcagtgagtacctgatcggtgccagggactacgttagc ttcgaaggagacaaagatgaatga >gi568815586f:116475943_116676317|GENSCAN_predicted_peptide_3|57_aa MVTIIDHHGCWWVVDGSENSIKRKASDEAGTVVQEEDAGGLDQGAHSFPECIYSGYN >gi568815586f:116475943_116676317|GENSCAN_predicted_CDS_3|174_bp atggtgaccatcattgatcatcatggctgctggtgggtagtggatgggagtgagaacagc attaagaggaaggccagtgatgaggcggggacagtggtccaggaggaagatgcaggtggc ctggaccagggggcacacagttttcccgaatgtatctactccggttacaactag >gi568815586f:116475943_116676317|GENSCAN_predicted_peptide_4|304_aa MAPKVPTEHRFDAQPEETGEDLATFCFLLPVFLAPKVHQGGGFHGQTNLANAALNQVTQM SLLQGQLMCECSSKNSDEFHEDKDLFILFCSSTEQVLSICPKELKAGTLHEDKYIYMHVP SSAVHDHQEVETTQMSVIINSDEARALHGHSRIRCRSSRHPQEPPGPSRRRRRGPDPHTM PSEKTFKQRRTFEQRVEDVRLIREQHPTKIPVIIERYKGEKQLPVLDKTKFLVPDHVNMS ELIKIIRRRLQLNANQAFFLLVNGHSMVSVSTPISEVYESEKDEDGFLYMVCASQETFGM KLSV >gi568815586f:116475943_116676317|GENSCAN_predicted_CDS_4|915_bp atggctcctaaggtgccaactgaacacaggtttgacgcacagcctgaggaaactggagag gacctggccaccttttgcttccttctccccgtgtttctggctcccaaagtacaccaagga ggaggatttcatgggcaaacaaatttggccaatgctgcattaaaccaagttacccagatg tctctacttcagggtcagttgatgtgtgaatgttcatccaagaacagtgatgagttccat gaagacaaagacctgtttatcttgttctgctccagcaccgagcaggtactcagtatatgc ccaaaagaattgaaagcaggtactctacatgaagacaagtacatctacatgcatgttcct agcagcgcagttcatgatcaccaggaggtggaaacaacccaaatgtctgtcatcatcaac agcgacgaggcccgagccttacacggccacagtcggattcgctgccgcagcagccgccac ccccaggagccgccgggaccctcgcgtcgtcgccgccgcggcccagatccccacaccatg ccgtcggagaagaccttcaagcagcggcgcaccttcgaacaaagagtagaagatgtccga cttattcgagagcagcatccaaccaaaatcccggtgataatagaacgatacaagggtgag aagcagcttcctgttctggataaaacaaagttccttgtacctgaccatgtcaacatgagt gagctcatcaagataattagaaggcgcttacagctcaatgctaatcaggccttcttcctg ttggtgaacggacacagcatggtcagcgtctccacaccaatctcagaggtgtatgagagt gagaaagatgaagatggattcctgtacatggtctgtgcctcccaggagacgttcgggatg aaattgtcagtgtaa >gi568815586f:116475943_116676317|GENSCAN_predicted_peptide_5|56_aa MPAPPLPSAMIGSFLRSPQKQMLLRFLYSLQNGEPELAGRKGKSDGITIQGQVSTP >gi568815586f:116475943_116676317|GENSCAN_predicted_CDS_5|171_bp atgcctgctccccctttgccttctgccatgattggaagcttcctgagatctccccagaag caaatgctgctacgcttcctatatagcctgcagaacggagagccagagctggctggcaga aaaggcaagtcagatggcatcaccatccaaggccaggtctccactccctga >gi568815586f:116475943_116676317|GENSCAN_predicted_peptide_6|99_aa MTLHSQKDPGLDEKLYSCAIYGESCKLVSSESRKDIIIIITIIIIVNECILDDLLRGNQG SERRSDLDNFRVTQLDSLSDLSDLKVSVVQKKRHKWKNQ >gi568815586f:116475943_116676317|GENSCAN_predicted_CDS_6|300_bp atgaccctgcactctcagaaggaccctgggttggatgagaagttatatagttgtgcaatt tatggagaaagttgtaaactggtgtcatccgagagcagaaaggacatcatcatcatcatc accatcatcatcatagttaatgagtgcatcctagatgatctcttaagaggaaatcaaggc tcagagagacgatctgacttggacaattttcgagtgacacagctggattcactctcagat ctctctgacttgaaagtatctgttgtccagaaaaaaagacataagtggaaaaatcaatga >gi568815586f:116475943_116676317|GENSCAN_predicted_peptide_7|151_aa MPEPPLPAVGSCAARASPTSAAPCSTGPSPIDHPRAEECGHTAQDWQAAPPAAQYGIHWV KPTGLLSLNIQHFSNSFDFGTLLDLSFGEDLAVNVEPDLVNDTEEYEHEEMRCHLNLTQS RKISLSGLPAANVRSPIKLTSQNVEVMDRNI >gi568815586f:116475943_116676317|GENSCAN_predicted_CDS_7|456_bp atgcctgagcctcccctcccggccgtgggctcctgcgcagcccgagcctccccgacgagc gctgccccctgctccacggggcccagtcccatcgaccacccaagggctgaggagtgtggg cacacggcgcaggactggcaagcagctccacctgcggcccagtacgggatccactgggtg aagccaactgggctcctgagtctgaatatacagcatttttcaaactcatttgactttggg acccttctggatctctcttttggggaggatttggcagtaaatgtagagccagatcttgtg aatgacactgaagaatatgaacatgaagaaatgcgatgccacctcaacctcacacagtcc agaaaaatcagcctatcaggcctccctgctgcaaatgtcagaagccccatcaaactgaca agccagaatgtggaagttatggacagaaacatctga >gi568815586f:116475943_116676317|GENSCAN_predicted_peptide_8|68_aa MNNSEIGQPPVPRYAQRPQRSHVVDDWWTEKGKRGTENGCEVQTQPDWLQLGVCLIRTQF KQLATFIG >gi568815586f:116475943_116676317|GENSCAN_predicted_CDS_8|207_bp atgaacaattcagaaattgggcagcctcctgtgccaaggtacgctcagagaccccagcgc agccatgtggtggatgattggtggactgaaaaaggaaagcgaggtacagaaaatggatgt gaggtacagacacagccggactggttacagctcggcgtttgccttattcgaacacagttc aaacagttggctacatttattggctaa