GENSCAN 1.0 Date run: 4-Nov-116 Time: 18:39:08 Sequence gi568815581r:57874476_58083395 : 208920 bp : 46.79% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.08 Intr - 206 92 115 2 1 67 54 72 0.018 1.72 1.07 Intr - 2189 2108 82 1 1 22 89 53 0.027 -1.56 1.06 Intr - 5263 5136 128 1 2 68 97 128 0.104 11.18 1.05 Intr - 11101 10754 348 1 0 -5 75 711 0.569 56.05 1.04 Intr - 11404 11267 138 1 0 143 -13 131 0.705 9.16 1.03 Intr - 14909 14832 78 2 0 76 48 61 0.395 0.65 1.02 Intr - 16199 16096 104 0 2 55 64 59 0.324 0.09 1.01 Init - 16805 16622 184 2 1 90 21 115 0.301 4.28 1.00 Prom - 21897 21858 40 -4.36 2.00 Prom + 26286 26325 40 -6.56 2.01 Init + 28806 28878 73 2 1 88 96 47 0.283 4.98 2.02 Intr + 37567 37761 195 2 0 64 77 93 0.551 5.19 2.03 Intr + 39003 39083 81 1 0 28 108 54 0.538 1.01 2.04 Term + 39210 39274 65 1 2 73 37 58 0.630 -2.75 2.05 PlyA + 40399 40404 6 1.05 3.00 Prom + 43944 43983 40 -1.56 3.01 Init + 45105 45157 53 2 2 88 92 36 0.926 4.63 3.02 Intr + 51750 51860 111 0 0 45 86 84 0.303 3.39 3.03 Intr + 59502 59549 48 0 0 93 81 27 0.034 0.30 3.04 Intr + 66018 66133 116 0 2 102 -7 91 0.054 1.09 3.05 Intr + 79370 79572 203 0 2 87 81 42 0.875 2.40 3.06 Intr + 79695 79766 72 2 0 76 75 95 0.944 6.60 3.07 Intr + 82918 82998 81 0 0 99 76 6 0.527 0.33 3.08 Term + 85470 85691 222 2 0 65 44 145 0.906 4.72 3.09 PlyA + 86203 86208 6 1.05 4.07 PlyA - 87124 87119 6 1.05 4.06 Term - 88357 88293 65 2 2 93 47 50 0.619 -0.55 4.05 Intr - 90953 90846 108 0 0 49 94 68 0.448 3.76 4.04 Intr - 104838 104677 162 1 0 46 116 198 0.986 18.35 4.03 Intr - 106311 106128 184 0 1 69 56 74 0.907 1.66 4.02 Intr - 108918 108292 627 0 0 60 56 485 0.900 34.91 4.01 Init - 113636 113604 33 2 0 112 84 43 0.006 5.44 4.00 Prom - 114905 114866 40 -4.66 5.00 Prom + 127267 127306 40 -0.96 5.01 Sngl + 130900 131103 204 0 0 79 40 320 0.978 21.49 5.02 PlyA + 131193 131198 6 -3.74 6.04 PlyA - 131231 131226 6 -5.80 6.03 Term - 131498 131272 227 0 2 26 42 132 0.234 -0.66 6.02 Intr - 132052 131868 185 0 2 86 105 188 0.710 19.73 6.01 Init - 132662 132469 194 2 2 92 77 408 0.975 38.54 6.00 Prom - 163697 163658 40 -0.56 7.07 PlyA - 168964 168959 6 1.05 7.06 Term - 173513 173403 111 2 0 108 50 59 0.749 2.66 7.05 Intr - 178910 178734 177 2 0 22 81 95 0.368 2.32 7.04 Intr - 183289 183205 85 0 1 57 70 28 0.074 -2.28 7.03 Intr - 185083 184985 99 0 0 119 99 14 0.407 4.83 7.02 Intr - 186329 186148 182 2 2 113 81 39 0.418 4.37 7.01 Intr - 193412 193293 120 2 0 128 90 6 0.752 5.49 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 5263 5132 132 1 0 68 54 134 0.891 6.09 S.002 Init + 78032 78080 49 1 1 72 95 28 0.834 3.10 S.003 Init - 110784 110779 6 0 0 68 92 5 0.969 -0.44 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:57874476_58083395|GENSCAN_predicted_peptide_1|393_aa MEKAGEKQVGVGGNHRSDGSMLLELLLDVQVEAARRLYECGVEGSAHERLEHMRASVQYA DGEEVGCLPPLPGLDSFTGEVEGHCDPGFLGPGGGQRAERIPGIDIQVLGKTPLESPYPI PPVGPSVKPVPGCTGGVDDRPPSPSTARSRMPPPREDRSPRAAWGPAPRLRKMTSLFRRS SSGSGGGGTAGARGGGGGTAAPQELNNSRPARQVRRLEFNQAMDDFKTMFPNMDYDIIEC VLRANSGAVDATIDQLLQMNLEGGGSSGGVYEDSSDSEDSIPPEILERTLEPDSSDEEPP PVYSPPAYHMHVFDRPYPLAPPTPPPRPSIQDIHFQHLLIACHTQVLSHVYDEVCAVFLI QRNTCPAVATLQAAMELWGPPGRAPSCPQAQGX >gi568815581r:57874476_58083395|GENSCAN_predicted_CDS_1|1179_bp atggagaaggcaggggagaagcaggttggggtcggagggaatcacaggtcggatggatcc atgttgttggagctgctgttggatgtccaagtggaggcagcaagaaggctgtatgagtgt ggagttgaggggagtgcccatgagaggttggagcacatgagagccagtgttcagtatgca gatggagaggaggttggctgtttgccgcccctgccgggcctggattccttcacaggggag gtggagggccactgtgatcctggcttcctgggacctggaggaggacagagggctgaaaga attccagggattgacatccaggtgctggggaagacccccctagaatctccctaccccatc ccccctgtgggaccatccgtgaaaccggtgccaggatgcaccgggggtgtggatgaccgg ccgcccagccccagcacggcccggtcccgcatgccaccgccccgcgaggaccgctctcca agggctgcctggggccccgcgccccggctccgcaaaatgaccagcctgttccgccggagc agcagcggcagcggcgggggtggcaccgccggggcacgcgggggcgggggaggcacggcc gccccccaggagctcaacaacagccggcctgcccgccaggtgcgccgcctggagttcaac caggccatggacgacttcaagaccatgttccccaacatggattacgacatcatcgaatgc gtgctgcgcgccaacagcggcgctgtggacgccaccatcgaccagctgctgcagatgaac ctggagggcggtggcagcagcggcggcgtctatgaggacagctccgactcggaggacagc atccccccggagatcttggaaaggactttggaacctgatagctcggatgaagagccccca cctgtgtactccccgccagcctaccacatgcacgtgttcgaccggccctaccctctggct cccccgactccgcctccccgtccctccatccaggacattcatttccagcatttactgatt gcctgccatacgcaggtgctgagccatgtgtatgatgaggtgtgtgccgttttcctgatt cagcggaacacgtgtcctgcggttgccacgctgcaggctgccatggagctgtggggacct cctggccgggctccgagctgccctcaggcccagggcgnn >gi568815581r:57874476_58083395|GENSCAN_predicted_peptide_2|137_aa MASLSWSGFSSSCISMAATMGSSQEAKNYNSMNGRSLQKVLCEDPSQPQPNAPTPGPGDR YTHTTLAPVAGAGACQQAHTVAVTPSCNHEKGPCVFVNSTESEFPEVHPNPIPKIPRISS EEISSFSSELTSPPGTC >gi568815581r:57874476_58083395|GENSCAN_predicted_CDS_2|414_bp atggcctccctgtcctggtctgggttctcctcgagttgtatttccatggcagccacaatg ggcagcagccaggaggccaagaattacaacagcatgaatggcagaagcctacagaaggtg ctctgtgaggatccctcccaaccgcagcccaacgcccccacccctggccctggagaccgc tacacacacacaaccttggccccagttgcaggtgctggtgcctgccaacaagcacacaca gttgcagtcacaccctcgtgtaaccatgaaaaaggcccatgtgtctttgtcaattctact gaatcagaatttcctgaggtgcacccaaaccccatccccaagatcccaaggatttctagc gaggaaatatcatccttctcgtctgaactgaccagtcctccagggacctgctga >gi568815581r:57874476_58083395|GENSCAN_predicted_peptide_3|301_aa MIKGLSSHTDKPTKPCTYKELKFTITKGASAWPPAAGILLRHWRGPKKAHTKPKRQGKEE CSVKKEYLKPSNLEGPIFQMMWLQMEVHCLTYKTFHEQKINLDGVKPLGSAESGQRVSHP ALLHTLQLPGDPSQVSPSSTSPSQRWHHIQTEVPVALTPNNREDPASHLIQQGPNHMVAQ VLRDHPGKAHSSRYKTLTADQTPPPTPTTASEPPGLQPLTPAAYRGPPTECEGDGSLLLS LWLYEAQIAGAYGIIYCENEADGEPLGSGLEMVSSDRTVGKTGHFPSEGPPAQGKCQGMN G >gi568815581r:57874476_58083395|GENSCAN_predicted_CDS_3|906_bp atgatcaaaggcttaagctcccacaccgacaaacccactaagccatgcacatacaaggag ctcaagttcaccatcaccaaaggtgcctctgcatggcctcctgctgctggcattttgctg agacactggagaggaccgaaaaaggcccacaccaagccaaaaagacaaggcaaagaggaa tgcagtgtcaagaaagaatatttaaagccaagtaaccttgaaggccctatattccagatg atgtggctacagatggaggtgcactgcctgacctacaagactttccacgagcaaaaaata aacctagatggtgtcaagcccctgggatctgcagaaagtggacagcgggtttcacaccca gccctgctgcacacgctccagcttcctggagacccgagccaggtttctcccagcagcacc tctccatcccagagatggcaccatattcaaactgaggtaccagtggcactgacccccaac aacagagaggacccagcttcacacttgatccagcaggggcccaaccacatggtggctcag gtgctgcgggatcacccagggaaggctcattcctctcgctacaagacgctgactgcggat cagacacctccacccactcccaccactgcctctgagcccccagggctccagcccctcaca ccagccgcctacagagggccacctactgaatgtgaaggtgatggaagccttttgctaagt ctgtggctgtatgaagctcagatcgctggagcatatggcattatctactgtgagaatgag gctgatggagagccactgggctcaggcttggaaatggtgagctctgatcggaccgtgggg aaaacaggacatttcccttctgaaggtcccccagcccagggcaaatgccagggcatgaat ggttga >gi568815581r:57874476_58083395|GENSCAN_predicted_peptide_4|392_aa MEANWTAFLFQAHEASHHQQQAAQNSLLPLLSSAVEPPDQKPLLPIPITQKPQGAPETLK DAIGIKKEKPKTSFVCTYCSKAFRDSYHLRRHESCHTGIKLVSRPKKTPTTVVPLISTIA GDSSRTSLVSTIAGILSTVTTSSSGTNPSSSASTTAMPVTQSVKKPSKPVKKNHACEMCG KAFRDVYHLNRHKLSHSDEKPFECPICNQRFKRKDRMTYHTCTAAFATKDRLRTHMVRHE GKVSCNICGKLLSAAYITSHLKTHGQSQSINCNTCKQGISKTCMSEETSNQKQQQQQQQQ QQQQQQQQQQHVTSWPGKQVETLRLWEEAVKARKKGMAESVRLLYRISKSQLYESMGPGL WELWGLRFAYPVGDVPLYATISCCASQPATIM >gi568815581r:57874476_58083395|GENSCAN_predicted_CDS_4|1179_bp atggaggccaactggaccgcgttcctgttccaggcccatgaagcttcccatcaccaacag caggcagcacagaacagcttgctgcccctcctgagctctgccgtggagccccctgatcag aaaccattgcttccaataccaataactcagaaacctcagggtgcaccagaaacattaaag gatgccattgggattaaaaaagaaaaacccaaaacttcatttgtgtgcacttactgcagt aaagctttcagggacagctatcacctgaggcgccacgaatcctgccacacagggatcaag ttggtgtcccggccaaagaaaacccccaccacggtggttccccttatctctaccatcgct ggggacagcagccgaacttcgttggtctcgaccattgcaggcatcttgtcaacagtcact acatcttcctcgggcaccaaccccagtagcagtgccagcaccacagctatgccagtgacc cagtctgtcaagaaacccagtaagcctgtcaagaagaaccatgcttgtgagatgtgtggg aaggccttccgagatgtgtaccatctcaatcgacacaagctctcccattcagatgagaaa ccctttgagtgtcctatttgtaatcagcgcttcaagaggaaggaccggatgacttaccat acgtgcactgctgcctttgccaccaaagacagactgcggacacacatggtgcgccatgaa ggcaaggtatcatgtaacatctgtgggaagctcctgagtgcagcatacatcaccagccac ttaaagactcatgggcagagccaaagtatcaactgtaatacatgtaaacaaggcatcagt aaaacatgcatgagtgaagagaccagtaaccaaaagcagcagcagcagcagcagcagcag cagcagcagcagcaacaacaacaacaacaacatgtgacaagctggccagggaagcaagta gaaacactgagactgtgggaagaagctgttaaagcaaggaagaaagggatggctgaatct gtccgtttattgtatcgcatatccaaatcacagctgtacgagtcaatgggaccaggctta tgggaactgtgggggctgcgttttgcatacccagtgggagatgtgcccctctatgctacc atctcctgctgtgcttctcaacctgccacgataatgtga >gi568815581r:57874476_58083395|GENSCAN_predicted_peptide_5|67_aa MGSTKSVTNHLMYESEICYDGENSVVILCFSLGSNCDSCCCFCYGFCYDYGFEIEIFHNL DFWAHQL >gi568815581r:57874476_58083395|GENSCAN_predicted_CDS_5|204_bp atgggttctacaaaaagtgtcaccaatcatcttatgtacgagagcgagatctgctatgac ggggagaatagcgtggtgatcctctgcttctccttggggagtaactgcgactcctgctgt tgcttctgctacggcttctgctacgactacggcttcgagatcgagatcttccataacttg gacttctgggcccatcaactttaa >gi568815581r:57874476_58083395|GENSCAN_predicted_peptide_6|201_aa MSGGGVIRGPAGNNDCRIYVGNLPPDIRTKDIEDVFYKYGAIRDIDLKNRRGGPPFAFVE FEDPRDAEDAVYGRDGYDYDGYRLRVEFPRSGRGTGRGGGGGGGGGAPRGRYGPPSRRSE NRVVVSGLPPSGSWQDLKDHMREAGDVCYADVYRDGTGVVEFVRKEDMTYAVRKLDNTKF RSHEVGYTRILFFDQNWIQWS >gi568815581r:57874476_58083395|GENSCAN_predicted_CDS_6|606_bp atgtcgggaggtggtgtgattcgtggccccgcagggaacaacgattgccgcatctacgtg ggtaacttacctccagacatccgaaccaaggacattgaggacgtgttctacaaatacggc gctatccgcgacatcgacctcaagaatcgccgcgggggaccgcccttcgccttcgttgag ttcgaggacccgcgagacgcggaagacgcggtgtatggtcgcgacggctatgattacgat gggtaccgtctgcgggtggagtttcctcgaagcggccgtggaacaggccgaggcggcggc gggggtggaggtggcggagctccccgaggtcgctatggccccccatccaggcggtctgaa aacagagtggttgtctctggactgcctccaagtggaagttggcaggatttaaaggatcac atgcgtgaagcaggtgatgtatgttatgctgatgtttaccgagatggcactggtgtcgtg gagtttgtacggaaagaagatatgacctatgcagttcgaaaactggataacactaagttt agatctcatgaggtaggttatacacgtattcttttctttgaccagaattggatacagtgg tcttaa >gi568815581r:57874476_58083395|GENSCAN_predicted_peptide_7|257_aa EGQTALALLPGKHPGVYPTFSARAILRILAGDEHLKSGVECTVALFSCQLGSSSLLLLST PPREKGEESALIAWRTVQQIHVRASLGMSPKLPRLLVRRKRSLSPRCETDPKTPCNFLLK ISTKKHSHSHSSSWSSRTSGKEHLPTWIWQPKPSKPPEHHVQYADGNDPVEMENLMLQMR EWIIAGVMLFNKQEKKGSSVREEGLASDRHMDDSFIMRGGKQNEDQMFSGVAPSSSFIRS TNMHHAIMLYCSGQSFC >gi568815581r:57874476_58083395|GENSCAN_predicted_CDS_7|774_bp gaaggacaaacagcactggctctcctcccaggaaagcaccctggcgtttaccccacattt tcagcaagagccatattgaggattcttgctggagatgagcatcttaagagtggggtggag tgcacagtcgctctgttctcctgccaacttggcagctcatcgctgctgttactatcaaca cctccaagggaaaagggagaggagtcagctctgatcgcctggcgcactgtgcagcaaatt catgtcagagccagccttggaatgtctcccaagctcccaaggctcctggtgaggaggaaa aggagcctttctcccagatgtgaaactgatcctaagaccccctgtaatttcctgctgaaa atctccaccaagaaacacagccacagccacagttcttcctggagttcaagaacctcaggc aaagaacacctgcccacatggatctggcagcccaagccctctaagcccccagagcatcat gttcaatatgctgatgggaatgatccagttgagatggaaaatttgatgctgcagatgaga gagtggataattgctggagtgatgctttttaacaagcaagagaagaagggatccagtgtg cgagaggaggggctggcctcagacagacacatggacgattcattcatcatgagaggagga aagcaaaatgaggaccagatgttctctggtgttgctcccagctcttcattcattcgctcc acaaatatgcaccatgctatcatgctctactgctctgggcagtcattctgctag