GENSCAN 1.0 Date run: 6-Nov-116 Time: 21:15:14 Sequence gi568815596f:203836815_204057883 : 221069 bp : 40.23% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 4939 4978 40 -2.25 1.01 Init + 7477 7589 113 0 2 76 49 83 0.080 2.93 1.02 Intr + 12252 12378 127 0 1 62 47 119 0.162 4.96 1.03 Intr + 12616 12781 166 0 1 29 68 106 0.218 1.31 1.04 Term + 15819 15892 74 1 2 64 47 77 0.118 -1.71 1.05 PlyA + 16837 16842 6 1.05 2.05 PlyA - 17001 16996 6 1.05 2.04 Term - 17362 17186 177 1 0 0 41 168 0.568 0.00 2.03 Intr - 17615 17438 178 1 1 37 86 93 0.709 3.00 2.02 Intr - 21023 20885 139 0 1 10 108 65 0.468 -0.70 2.01 Init - 22017 21954 64 0 1 49 38 129 0.604 5.36 2.00 Prom - 27985 27946 40 -4.55 3.00 Prom + 31016 31055 40 -5.35 3.01 Init + 31129 31237 109 0 1 85 113 8 0.392 3.68 3.02 Intr + 33772 34119 348 2 0 60 85 402 0.370 31.50 3.03 Intr + 34564 34673 110 2 2 105 101 6 0.898 2.68 3.04 Term + 35894 35998 105 1 0 89 54 79 0.916 2.03 3.05 PlyA + 36279 36284 6 -1.75 4.02 PlyA - 36420 36415 6 1.05 4.01 Sngl - 39032 38733 300 2 0 61 48 291 0.739 17.94 4.00 Prom - 39606 39567 40 -7.15 5.15 PlyA - 39868 39863 6 1.05 5.14 Term - 41773 41525 249 1 0 27 37 160 0.026 -0.48 5.13 Intr - 48150 48079 72 0 0 69 53 92 0.103 2.48 5.12 Intr - 52742 52677 66 2 0 83 42 76 0.061 0.68 5.11 Intr - 55608 55358 251 1 2 85 0 162 0.111 3.33 5.10 Intr - 69498 69459 40 0 1 105 48 75 0.042 2.08 5.09 Intr - 74691 74488 204 0 0 59 53 114 0.589 3.47 5.08 Intr - 78731 78520 212 0 2 21 79 213 0.444 11.41 5.07 Intr - 80594 80447 148 2 1 78 35 68 0.744 -0.61 5.06 Intr - 81693 81544 150 0 0 84 107 70 0.914 7.94 5.05 Intr - 98979 98929 51 0 0 92 92 44 0.489 3.39 5.04 Intr - 103799 103703 97 1 1 23 93 89 0.501 1.99 5.03 Intr - 106470 106320 151 1 1 64 88 99 0.009 5.80 5.02 Intr - 107669 107561 109 2 1 -2 105 92 0.012 0.84 5.01 Init - 119914 119828 87 1 0 82 95 41 0.082 4.89 5.00 Prom - 125265 125226 40 -6.15 6.04 PlyA - 125939 125934 6 1.05 6.03 Term - 128569 128372 198 1 0 31 43 191 0.680 5.32 6.02 Intr - 130185 129863 323 1 2 78 19 161 0.429 2.85 6.01 Init - 132750 132648 103 0 1 87 78 109 0.881 10.25 6.00 Prom - 142864 142825 40 -4.55 7.00 Prom + 145302 145341 40 -6.95 7.01 Init + 146736 146832 97 2 1 77 68 66 0.845 4.02 7.02 Term + 148464 148570 107 1 2 110 48 76 0.957 3.39 7.03 PlyA + 148779 148784 6 1.05 8.00 Prom + 154567 154606 40 -5.35 8.01 Init + 162620 162737 118 1 1 82 92 115 0.994 11.71 8.02 Term + 164980 165032 53 2 2 121 48 72 0.959 3.11 8.03 PlyA + 166476 166481 6 1.05 9.06 PlyA - 168076 168071 6 1.05 9.05 Term - 178124 178007 118 1 1 92 49 118 0.366 5.33 9.04 Intr - 205066 204994 73 2 1 60 80 46 0.086 -1.45 9.03 Intr - 206991 206909 83 2 2 124 90 17 0.654 3.86 9.02 Intr - 210912 210804 109 1 1 46 105 47 0.482 0.62 9.01 Intr - 214226 214020 207 0 0 19 83 234 0.594 14.13 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:203836815_204057883|GENSCAN_predicted_peptide_1|159_aa MREKEEGKDCMEKVDLHQVNEKQGSSGRRRKSKFQRASSMFLPPSVMVSLKELDCLNTPQ NITLIYYIDNITMIGQVEQEMIWKAASFEYCSEQERAPQQVQAVVQHPVLFGSCNLADPI VSKASVVEKMQCGVYELSGIDPNVRCHTKKNDVSEKHSE >gi568815596f:203836815_204057883|GENSCAN_predicted_CDS_1|480_bp atgagggaaaaggaagaggggaaagactgcatggaaaaagtagatcttcatcaggttaat gaaaaacagggcagctcaggcagaaggagaaaatcaaagtttcagagagcctcgtctatg tttctgccaccctctgtgatggtgagtctgaaggaactggactgcctgaacaccccacag aacatcactctgatctattacattgacaacatcacgatgattggacaagttgagcaagag atgatatggaaggctgccagctttgagtactgttcagagcaggaaagggcaccacagcag gtccaggctgtggtgcagcaccctgtgctatttgggtcatgtaatctggcagatcctatt gtgtcaaaggcatcagtggtagaaaagatgcagtgtggagtatatgagctctctgggatt gatccaaatgtcagatgtcacaccaagaaaaatgatgtttctgagaagcattcagaataa >gi568815596f:203836815_204057883|GENSCAN_predicted_peptide_2|185_aa MTAWRYEISFSGNENALEQHSEMLTSRKHQKHVFPVLLKYKQVKKLPSNASPAFHPLDKI DNCEHENKTAAEYQAPLGDQSRCDDKLLPGIIRITALEPKEKITHTGRHIEHKKETRQAA GYVFIGKRKELVRARQPREPKQKLSPLRSKQKGPWKSWALPDYGGSPVAISKTRSHVRAQ RCEMH >gi568815596f:203836815_204057883|GENSCAN_predicted_CDS_2|558_bp atgactgcttggcggtatgagatttccttttcgggtaatgaaaacgctctggagcagcat agtgaaatgctgacaagtagaaaacaccagaaacatgtgtttcctgtgctcttaaaatat aagcaggtcaagaagcttccatcaaatgcatctcctgcatttcatcctttggataaaata gacaattgtgagcatgagaacaagactgcagccgagtatcaggcccccctgggggatcag agcaggtgtgatgacaaacttttaccaggcataattcgaataaccgcactagaacctaaa gaaaagataacgcatacaggcaggcatatagagcacaagaaggaaaccaggcaggcagct ggatacgtcttcattgggaagcgaaaagaactggtcagagcaaggcagccaagagagcca aagcagaaactcagtcctctcagatccaagcaaaagggaccgtggaaaagctgggcctta cctgactatggaggcagtccagtggccatttccaagaccaggagccatgtgagagctcag cgatgtgaaatgcactga >gi568815596f:203836815_204057883|GENSCAN_predicted_peptide_3|223_aa MACLGFQRHKAQLNLATRTWPCTLLFFLLFIPVFCKAMHVAQPAVVLASSRGIASFVCEY ASPGKATEVRVTVLRQADSQVTEVCAATYMMGNELTFLDDSICTGTSSGNQVNLTIQGLR AMDTGLYICKVELMYPPPYYLGIGNGTQIYVIDPEPCPDSDFLLWILAAVSSGLFFYSFL LTAVSLSKMLKKRSPLTTGVYVKMPPTEPECEKQFQPYFIPIN >gi568815596f:203836815_204057883|GENSCAN_predicted_CDS_3|672_bp atggcttgccttggatttcagcggcacaaggctcagctgaacctggctaccaggacctgg ccctgcactctcctgttttttcttctcttcatccctgtcttctgcaaagcaatgcacgtg gcccagcctgctgtggtactggccagcagccgaggcatcgccagctttgtgtgtgagtat gcatctccaggcaaagccactgaggtccgggtgacagtgcttcggcaggctgacagccag gtgactgaagtctgtgcggcaacctacatgatggggaatgagttgaccttcctagatgat tccatctgcacgggcacctccagtggaaatcaagtgaacctcactatccaaggactgagg gccatggacacgggactctacatctgcaaggtggagctcatgtacccaccgccatactac ctgggcataggcaacggaacccagatttatgtaattgatccagaaccgtgcccagattct gacttcctcctctggatccttgcagcagttagttcggggttgtttttttatagctttctc ctcacagctgtttctttgagcaaaatgctaaagaaaagaagccctcttacaacaggggtc tatgtgaaaatgcccccaacagagccagaatgtgaaaagcaatttcagccttattttatt cccatcaattga >gi568815596f:203836815_204057883|GENSCAN_predicted_peptide_4|99_aa MGSSRNASTMLKKRHEQRRVLGAMLEEELTLEPLATLSGSLVIHSLACDKWFQGGGLQRP GSPCMVGQAPFGFMDYLGLELPQDPDPRNGDDDEGWNPM >gi568815596f:203836815_204057883|GENSCAN_predicted_CDS_4|300_bp atgggaagtagcaggaatgcgagtaccatgctgaagaagagacatgagcagaggcgtgtg ctaggagccatgttggaggaggaactgactctggagcccctggccactctatccggctcc cttgtcatccacagtcttgcctgtgacaagtggttccaaggaggagggttgcaacgccca ggctccccctgcatggtaggccaagccccttttgggttcatggattatttgggcttggag ctgccccaggaccctgatcccagaaatggagatgatgatgaaggttggaatcccatgtga >gi568815596f:203836815_204057883|GENSCAN_predicted_peptide_5|628_aa MQTTTKAAHPMGNQNFSWQQSCDSGLEKKSKRKDSRERETLLSQTHTLTGEAERLFVEKF PTLSGAPTQMRRNQKTNSGNMTKQGSSTPPKNHTSSPAMDPNQEEILHLPEKEFRRSQQT VEEYLQPLRRKAAIQDSYSQERYQSLIKMRKTEAQTGNEPGPSHILLEFWLETEGREESE AGVFVPKATFLQNHSSFQGAFSHSYGDPCSSAWVLNKLDYPQYGLNPVPLKIPMLKLQVP VPQNVTVFGDKGFKEIMKVKCDHMDPVPRNSSSEIQLDFKIEKLDSLHRLFAWLQTVEAT VLPSERRVVQCRDEELAAETGLSGLNQASQSLELQISSGIRSHRSRNPIVNCTCEESRLC APYENPVLDDLRQNSFFLKLSLLQPLTFVENLSSMKPVPGAKKFRLFQNVVSLEAHRHLI IAHSTLGPFNPNQSFIRAAHLYAMQSSGGTTEDAQLCPDTPETLSVPYESDSPFLLIPLT VSPCCHFDEHMRLYPLRLLRDTVRRSSPDTGPSILDFLVFRIQPAPAETPTSTRSKTKRE NNHQSPQGEEKSYQWDACCTANMTRWKQEENDKQEEKGDDVLQESAAHEKSSVLSSKICD HPLKYISYNSSQRKLLLFVQIDSITLCD >gi568815596f:203836815_204057883|GENSCAN_predicted_CDS_5|1887_bp atgcagactacaacaaaggctgcacatcctatgggtaaccagaacttcagctggcaacaa agttgtgattctggattggaaaaaaagtcaaaaagaaaagactcacgggaaagggagacc ctcctctcccaaacacacactctgaccggagaagctgagcgtctgtttgtggagaagttt ccgactttatctggagcacctacccaaatgagaaggaaccagaaaaccaactctggtaat atgacaaaacaaggctcttcaacacccccaaaaaatcacactagttcaccagcaatggat ccaaaccaagaagaaattcttcatctacctgaaaaagaattcaggagaagccagcagaca gtggaggaatatctgcaaccactaagaagaaaggctgccatccaggactcctattcccag gaaagatatcagtcactcatcaagatgaggaaaactgaggcacagacaggtaacgaacct ggcccaagtcatatactgctggagttctggctggagacggaagggcgggaggaaagtgag gctggagtgtttgttcccaaggcgaccttcctgcaaaaccacagctcctttcagggagcc ttctcgcacagctacggcgacccttgcagctctgcctgggttctgaacaaattagattac ccacaatatgggctgaatcctgtccccctcaaaattcctatgttgaaattacaagtccca gtacctcagaatgtgactgtatttggagataagggctttaaagagataatgaaggtaaaa tgcgatcatatggacccagtgcccaggaattcttccagtgaaatccaacttgatttcaaa atagaaaaactggattcccttcacagactctttgcttggctacaaactgttgaagcaacc gtgcttccttcagagaggcgggttgtccagtgcagagatgaagagctggctgcagagaca ggcctttcgggtttgaatcaggcttcgcagtctctcgagctgcagatcagcagcggcatt agatcccataggagcaggaaccctattgtgaactgcacgtgcgaggaatctaggttgtgc gctccttatgagaatccagtgcttgatgatctgaggcagaacagtttcttcctaaaacta tccctcctccaaccccttacattcgtggaaaatttgtcttccatgaaaccggtccctggt gccaaaaagtttcgccttttccagaatgtcgtatctttggaagcacacagacatttgatc attgcccactctaccctggggccattcaatcccaaccagtcatttataagagctgcacac ttgtatgccatgcaatcttcaggaggcaccactgaggacgcacagctatgtcctgacaca ccagagaccctctcagtcccatatgagtcggattcacccttccttctaattcctttgacc gtctccccttgctgccattttgatgagcacatgcgtttatatccattacggttgttgagg gacacagtaagaaggtcctcaccagataccggtccttctatcttggacttcttagtcttc cgaattcaacctgcaccagcagaaactcctacttctacaaggtctaagacgaaacgggaa aacaaccaccaaagcccacaaggtgaagagaaaagctaccagtgggatgcatgctgcacg gctaacatgacaaggtggaagcaggaagaaaatgacaagcaggaagaaaaaggtgatgat gtccttcaggaaagtgctgctcatgagaagtcatctgtgttatcttctaaaatttgcgat catcccctgaaatatatttcttacaattcctctcagaggaagctgcttctttttgtccaa atagatagtattactttgtgtgattaa >gi568815596f:203836815_204057883|GENSCAN_predicted_peptide_6|207_aa MVLSCDECGLHEEKDITRQRTLVVRIQAKTPTEGETKETHFIRGPKTPAPVTDWEGSLPL VFNHCRDTSLIIHPCFRGVRPRRDACLGPSCLAASPTFLGEGQVPQPLLCLYPFSAFLGG KKPPTTSPSPLAASPTFLGEEQELATSARNLATRPRNACSPGFLLSRVPSVRDPTGNRTV PLTLAATPRAPGTLAQGSLTPSQTFLA >gi568815596f:203836815_204057883|GENSCAN_predicted_CDS_6|624_bp atggtcctctcttgtgacgaatgtggacttcatgaagaaaaagacataacaaggcagaga acattagttgtcagaattcaagctaagactcctacagaaggagagacaaaggagacacat tttatccgtggacccaaaactccggcaccggtcacggactgggaaggcagccttcccttg gtgtttaatcattgcagggacacctctctgattattcacccatgtttcagaggtgtcaga ccacgcagggatgcctgccttggtccttcatgcttagcggcaagtcccacttttctgggg gaggggcaagtaccccaaccccttctgtgtctctaccccttctccgcctttctggggggc aagaaacccccgaccacttctccttcacccttagcggcaagtcccacttttctaggggag gagcaagagcttgctacaagtgccagaaatctggccaccaggccaaggaatgcctgcagc ccaggattcctcctaagccgcgtcccatctgtgcgggaccccactggaaatcggactgtc ccactcaccttggcagccactcccagagcccctggaactctggcccaaggctctctgact ccttcccagaccttcttggcttag >gi568815596f:203836815_204057883|GENSCAN_predicted_peptide_7|67_aa MHSDFQLVLTFLKCSCDPDASETPAGWYPLLRGVISKAFLITILNTKLPQADFMENPTCN ASSAAYF >gi568815596f:203836815_204057883|GENSCAN_predicted_CDS_7|204_bp atgcattctgatttccagttggttcttacattcctgaaatgcagctgtgatccagatgcc tcagaaacaccagctggttggtatcctctcctcagaggtgtgatctcaaaggccttccta ataaccatcctgaatactaagctgccccaagcagacttcatggagaacccaacctgtaac gccagctcagcggcttacttctga >gi568815596f:203836815_204057883|GENSCAN_predicted_peptide_8|56_aa MVPDLKNKYVYLNVIDGGAQGMSESSREGDIREEKLEKEGKEPVVLQLVSLECELD >gi568815596f:203836815_204057883|GENSCAN_predicted_CDS_8|171_bp atggtgcctgacctgaagaacaaatatgtgtacctaaatgtcatagatggaggtgcacag ggaatgagtgaatcttccagagagggagatatcagggaggaaaagcttgaaaaggaaggc aaggagccagttgtactacagctggtgtctcttgagtgtgagctagattga >gi568815596f:203836815_204057883|GENSCAN_predicted_peptide_9|196_aa XHNELDGLGEQVVETTQIGTPRGDVHMIGSEKKHKAAQRYYFSVWHPHLKQLQNKPQLGN VARFQQRGGERQELHGQILCKVLSKVADMPGLECIVSRGRAQRRKRPTIISSSTRAITPS PRVTASPSKLRSTGRLLGRQGPNLFSHHRIPSAYHGAWELKLQNKLPPRNLDSQGIQRDI TKICFSGTEAAEALNW >gi568815596f:203836815_204057883|GENSCAN_predicted_CDS_9|591_bp ngacataatgagctggatggattgggagagcaggtcgttgaaactacacagattgggaca cccaggggggatgtccacatgatcgggtcagagaaaaagcacaaggcagctcagaggtat tacttctcagtatggcatccccatttgaaacagctgcaaaacaagccacagcttgggaat gtggcaaggtttcagcagcgtggaggtgaaaggcaagagttgcatggacagattctctgc aaggtgctgagcaaggtagcagacatgccaggtcttgagtgcattgtgtctagaggtaga gcacagaggaggaaaaggcctaccataatcagttcttccactagggctataaccccaagc cccagagttactgcttctccatcaaaattgagatccacaggaagattattaggaaggcaa ggaccaaatctgttttctcaccatcgaatccccagtgcctatcacggtgcatgggaactg aagttgcagaacaaactgccaccaagaaatctggacagtcagggcatccagagagacata accaagatctgcttctctggaacagaagctgctgaagccctgaattggtag