GENSCAN 1.0 Date run: 24-Oct-119 Time: 21:38:39 Sequence gi568815596f:203767943_203972809 : 204867 bp : 40.82% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 111 106 6 1.05 1.01 Sngl - 5369 4992 378 2 0 80 33 661 0.982 55.71 1.00 Prom - 7719 7680 40 -4.35 2.08 PlyA - 9124 9119 6 1.05 2.07 Term - 17662 17514 149 2 2 54 32 187 0.007 6.78 2.06 Intr - 20794 20684 111 2 0 113 95 -15 0.001 1.13 2.05 Intr - 42542 42387 156 0 0 115 26 95 0.074 5.06 2.04 Intr - 44780 44630 151 2 1 77 97 0 0.116 -1.39 2.03 Intr - 45390 45275 116 1 2 46 68 96 0.421 2.65 2.02 Intr - 51662 51476 187 2 1 71 35 107 0.292 2.04 2.01 Init - 54581 54357 225 2 0 56 44 193 0.262 10.22 2.00 Prom - 54637 54598 40 -2.45 3.05 PlyA - 54876 54871 6 1.05 3.04 Term - 62772 62379 394 2 1 16 42 205 0.404 2.22 3.03 Intr - 65848 65763 86 1 2 58 98 73 0.416 3.00 3.02 Intr - 66517 66487 31 0 1 63 89 48 0.469 -0.39 3.01 Init - 72851 72712 140 2 2 79 16 156 0.851 7.16 3.00 Prom - 73493 73454 40 -7.55 4.00 Prom + 73811 73850 40 -2.25 4.01 Init + 76349 76461 113 1 2 76 49 83 0.081 2.93 4.02 Intr + 81124 81250 127 1 1 62 47 119 0.158 4.96 4.03 Intr + 81488 81653 166 1 1 29 68 106 0.212 1.31 4.04 Term + 84691 84764 74 2 2 64 47 77 0.115 -1.71 4.05 PlyA + 85709 85714 6 1.05 5.05 PlyA - 85873 85868 6 1.05 5.04 Term - 86234 86058 177 2 0 0 41 168 0.570 0.00 5.03 Intr - 86487 86310 178 2 1 37 86 93 0.713 3.00 5.02 Intr - 89895 89757 139 1 1 10 108 65 0.469 -0.70 5.01 Init - 90889 90826 64 1 1 49 38 129 0.606 5.36 5.00 Prom - 96857 96818 40 -4.55 6.00 Prom + 99888 99927 40 -5.35 6.01 Init + 100001 100109 109 1 1 85 113 8 0.392 3.68 6.02 Intr + 102644 102991 348 0 0 60 85 402 0.370 31.50 6.03 Intr + 103436 103545 110 0 2 105 101 6 0.898 2.68 6.04 Term + 104766 104870 105 2 0 89 54 79 0.916 2.03 6.05 PlyA + 105151 105156 6 -1.75 7.02 PlyA - 105292 105287 6 1.05 7.01 Sngl - 107904 107605 300 0 0 61 48 291 0.739 17.94 7.00 Prom - 108478 108439 40 -7.15 8.15 PlyA - 108740 108735 6 1.05 8.14 Term - 110645 110397 249 2 0 27 37 160 0.026 -0.48 8.13 Intr - 117022 116951 72 1 0 69 53 92 0.103 2.48 8.12 Intr - 121614 121549 66 0 0 83 42 76 0.061 0.68 8.11 Intr - 124480 124230 251 2 2 85 0 162 0.111 3.33 8.10 Intr - 138370 138331 40 1 1 105 48 75 0.042 2.08 8.09 Intr - 143563 143360 204 1 0 59 53 114 0.589 3.47 8.08 Intr - 147603 147392 212 1 2 21 79 213 0.444 11.41 8.07 Intr - 149466 149319 148 0 1 78 35 68 0.744 -0.61 8.06 Intr - 150565 150416 150 1 0 84 107 70 0.914 7.94 8.05 Intr - 167851 167801 51 1 0 92 92 44 0.489 3.39 8.04 Intr - 172671 172575 97 2 1 23 93 89 0.501 1.99 8.03 Intr - 175342 175192 151 2 1 64 88 99 0.009 5.80 8.02 Intr - 176541 176433 109 0 1 -2 105 92 0.012 0.84 8.01 Init - 188786 188700 87 2 0 82 95 41 0.082 4.89 8.00 Prom - 194137 194098 40 -6.15 9.04 PlyA - 194811 194806 6 1.05 9.03 Term - 197441 197244 198 2 0 31 43 191 0.680 5.32 9.02 Intr - 199057 198735 323 2 2 78 19 161 0.425 2.85 9.01 Init - 201622 201520 103 1 1 87 78 109 0.860 10.25 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:203767943_203972809|GENSCAN_predicted_peptide_1|125_aa MNDEGSPIKVTLATLKMSVQPTASLGGFEITPPVVLQLKCGSGPVHISRQCLVAVEEDAE SEDEEEEDVNLLSISGKRSASGGGSKVPQKKGKLAADEDDDDDDDDDDDENDNDFDDEEA EEKHE >gi568815596f:203767943_203972809|GENSCAN_predicted_CDS_1|378_bp atgaatgatgaaggcagtccaattaaagtcacactggcaactttgaaaatgtctgtacag ccaactgcttcccttgggggctttgaaatcacaccaccagtggtcttacagttgaagtgt ggttcagggccagtgcatattagtagacagtgcttagtagctgtggaggaagatgcagag tcagaagatgaagaggaggaggatgtgaacctcttaagcatatctggaaagcggtctgcc tctggaggtggtagcaaggttccacagaaaaaaggaaaacttgctgctgatgaagatgat gatgatgatgatgatgatgatgatgatgaaaatgataatgattttgatgatgaggaagct gaagaaaagcacgagtaa >gi568815596f:203767943_203972809|GENSCAN_predicted_peptide_2|364_aa MGTGSRIHQSEAPTLESESDASDTKKQRLRPFLEVAAVAARLLFHGSSGCGAVPSLHLEQ TFYCDSNRIQHLDRPLPTSTQGPATKIFWQNSPKTYLVNNDVFLRFPSPVFAQKVTPQAP PHWYKRTTSRLDMIWRKGIYTKDLKAWAQTDTGTPTFLAAYPQEPKAGNNPNVYRQNLDL VATAMSTAKNLFPRLPADRGGHMTISCPARCKLPVLEWGFWDSSLKGLIFQWALVDWISS SPPTLCLWPFSSHQSHEQLSTHEHPRADQPWLMGIGYTALWSMCNMMFFKEINAFLFPIM AGPFFRPILSLKINEVPSTSQGLRSAGGRHLDWQAAPPAAPVRDPLGEASWAPESGGEVE NLYV >gi568815596f:203767943_203972809|GENSCAN_predicted_CDS_2|1095_bp atgggcacaggatccaggatccaccaatcagaagcacccactctggagtctgaatcagat gctagtgacacaaagaaacagcgactcagaccctttctagaggtggcggcagtggcagca agattgcttttccacggcagcagtggctgcggtgctgtccccagcctgcacctggagcag acattctactgtgatagcaacagaatacagcatctggacaggccactgcccacgtctaca cagggccccgcaaccaagatattctggcagaactctccgaaaacttatttagtcaataac gatgtatttttaaggtttccaagcccagtttttgctcagaaagtaaccccacaggctcct ccacactggtacaagaggaccacctcaagattggacatgatctggagaaagggtatatac acaaaagacttgaaagcatgggctcaaacagatactggtacaccaacattcctggcagcg tatccacaagagccaaaagctggaaataacccgaatgtctaccgacagaaccttgatttg gtggcgacagcaatgagcacggctaaaaatctgtttcccaggctccctgcagataggggt ggccacatgactataagctgtccagcgagatgtaagcttccagtgcttgagtggggtttc tgggacagttcattaaaaggccttatctttcagtgggcactggtggactggatttccagc agcccaccaactctttgtctgtggcctttctctagccaccagagtcatgagcagctctcc acccatgagcaccccagagcagaccaaccatggctgatgggaattgggtacactgccctt tggtcaatgtgcaacatgatgttcttcaaagagattaatgcttttctatttccaataatg gcaggcccgtttttcagaccaatcctttcactgaaaataaatgaagtcccatcgaccagc caagggctgaggagtgccggcggacgacacctggactggcaggcagctccacctgcagcc ccggtgcgggatccactgggtgaagccagctgggctcctgaatctggtggggaagtggag aacctttatgtctag >gi568815596f:203767943_203972809|GENSCAN_predicted_peptide_3|216_aa MELESCSSRSRNQPALAAIISPQNGRSSPLIDVFASYESPHLFSRGWLTSSPQQQQQLMW FLSSAWTLADTGRYCSSETNGDKGGRIPDVSHNGIPTIEMLWDSEAWAWSGHVTSYNWEM TIVLTGSFSPHDREWMSQHPPLLVLPSGPPGNEMSPPTPNPKPMRGAAFLALFIPNSALW KQDCWLAKKPEITFPLLVEANAERHLERLLSSRFST >gi568815596f:203767943_203972809|GENSCAN_predicted_CDS_3|651_bp atggagctggaatcatgttctagcagaagcagaaatcagccagctctggctgccatcatc tctccccagaatggccgcagcagccctctgattgatgtctttgcctcctatgagtcccct cacctcttctccagaggctggctcacttcttcaccccagcagcagcagcagttgatgtgg tttctgtcttctgcctggaccctagcagatacaggtcgctactgcagctctgaaacaaac ggggataaaggtggcagaataccagatgtcagtcataatggaatccccaccatcgaaatg ctctgggacagtgaagcgtgggcatggtcagggcatgtgacctcatacaactgggaaatg actattgttctcaccgggtcattcagtccccatgatcgtgaatggatgagccagcaccct cctctcctagtgttgccgagtgggcctccaggaaatgaaatgtctccaccaacaccaaac cccaaaccaatgagaggagcagcatttctggccttattcatacccaactcagctctgtgg aaacaagactgctggcttgccaagaagccagaaataacatttcctctactggttgaggca aatgctgagagacaccttgagaggttactgagttcacgcttttctacctag >gi568815596f:203767943_203972809|GENSCAN_predicted_peptide_4|159_aa MREKEEGKDCMEKVDLHQVNEKQGSSGRRRKSKFQRASSMFLPPSVMVSLKELDCLNTPQ NITLIYYIDNITMIGQVEQEMIWKAASFEYCSEQERAPQQVQAVVQHPVLFGSCNLADPI VSKASVVEKMQCGVYELSGIDPNVRCHTKKNDVSEKHSE >gi568815596f:203767943_203972809|GENSCAN_predicted_CDS_4|480_bp atgagggaaaaggaagaggggaaagactgcatggaaaaagtagatcttcatcaggttaat gaaaaacagggcagctcaggcagaaggagaaaatcaaagtttcagagagcctcgtctatg tttctgccaccctctgtgatggtgagtctgaaggaactggactgcctgaacaccccacag aacatcactctgatctattacattgacaacatcacgatgattggacaagttgagcaagag atgatatggaaggctgccagctttgagtactgttcagagcaggaaagggcaccacagcag gtccaggctgtggtgcagcaccctgtgctatttgggtcatgtaatctggcagatcctatt gtgtcaaaggcatcagtggtagaaaagatgcagtgtggagtatatgagctctctgggatt gatccaaatgtcagatgtcacaccaagaaaaatgatgtttctgagaagcattcagaataa >gi568815596f:203767943_203972809|GENSCAN_predicted_peptide_5|185_aa MTAWRYEISFSGNENALEQHSEMLTSRKHQKHVFPVLLKYKQVKKLPSNASPAFHPLDKI DNCEHENKTAAEYQAPLGDQSRCDDKLLPGIIRITALEPKEKITHTGRHIEHKKETRQAA GYVFIGKRKELVRARQPREPKQKLSPLRSKQKGPWKSWALPDYGGSPVAISKTRSHVRAQ RCEMH >gi568815596f:203767943_203972809|GENSCAN_predicted_CDS_5|558_bp atgactgcttggcggtatgagatttccttttcgggtaatgaaaacgctctggagcagcat agtgaaatgctgacaagtagaaaacaccagaaacatgtgtttcctgtgctcttaaaatat aagcaggtcaagaagcttccatcaaatgcatctcctgcatttcatcctttggataaaata gacaattgtgagcatgagaacaagactgcagccgagtatcaggcccccctgggggatcag agcaggtgtgatgacaaacttttaccaggcataattcgaataaccgcactagaacctaaa gaaaagataacgcatacaggcaggcatatagagcacaagaaggaaaccaggcaggcagct ggatacgtcttcattgggaagcgaaaagaactggtcagagcaaggcagccaagagagcca aagcagaaactcagtcctctcagatccaagcaaaagggaccgtggaaaagctgggcctta cctgactatggaggcagtccagtggccatttccaagaccaggagccatgtgagagctcag cgatgtgaaatgcactga >gi568815596f:203767943_203972809|GENSCAN_predicted_peptide_6|223_aa MACLGFQRHKAQLNLATRTWPCTLLFFLLFIPVFCKAMHVAQPAVVLASSRGIASFVCEY ASPGKATEVRVTVLRQADSQVTEVCAATYMMGNELTFLDDSICTGTSSGNQVNLTIQGLR AMDTGLYICKVELMYPPPYYLGIGNGTQIYVIDPEPCPDSDFLLWILAAVSSGLFFYSFL LTAVSLSKMLKKRSPLTTGVYVKMPPTEPECEKQFQPYFIPIN >gi568815596f:203767943_203972809|GENSCAN_predicted_CDS_6|672_bp atggcttgccttggatttcagcggcacaaggctcagctgaacctggctaccaggacctgg ccctgcactctcctgttttttcttctcttcatccctgtcttctgcaaagcaatgcacgtg gcccagcctgctgtggtactggccagcagccgaggcatcgccagctttgtgtgtgagtat gcatctccaggcaaagccactgaggtccgggtgacagtgcttcggcaggctgacagccag gtgactgaagtctgtgcggcaacctacatgatggggaatgagttgaccttcctagatgat tccatctgcacgggcacctccagtggaaatcaagtgaacctcactatccaaggactgagg gccatggacacgggactctacatctgcaaggtggagctcatgtacccaccgccatactac ctgggcataggcaacggaacccagatttatgtaattgatccagaaccgtgcccagattct gacttcctcctctggatccttgcagcagttagttcggggttgtttttttatagctttctc ctcacagctgtttctttgagcaaaatgctaaagaaaagaagccctcttacaacaggggtc tatgtgaaaatgcccccaacagagccagaatgtgaaaagcaatttcagccttattttatt cccatcaattga >gi568815596f:203767943_203972809|GENSCAN_predicted_peptide_7|99_aa MGSSRNASTMLKKRHEQRRVLGAMLEEELTLEPLATLSGSLVIHSLACDKWFQGGGLQRP GSPCMVGQAPFGFMDYLGLELPQDPDPRNGDDDEGWNPM >gi568815596f:203767943_203972809|GENSCAN_predicted_CDS_7|300_bp atgggaagtagcaggaatgcgagtaccatgctgaagaagagacatgagcagaggcgtgtg ctaggagccatgttggaggaggaactgactctggagcccctggccactctatccggctcc cttgtcatccacagtcttgcctgtgacaagtggttccaaggaggagggttgcaacgccca ggctccccctgcatggtaggccaagccccttttgggttcatggattatttgggcttggag ctgccccaggaccctgatcccagaaatggagatgatgatgaaggttggaatcccatgtga >gi568815596f:203767943_203972809|GENSCAN_predicted_peptide_8|628_aa MQTTTKAAHPMGNQNFSWQQSCDSGLEKKSKRKDSRERETLLSQTHTLTGEAERLFVEKF PTLSGAPTQMRRNQKTNSGNMTKQGSSTPPKNHTSSPAMDPNQEEILHLPEKEFRRSQQT VEEYLQPLRRKAAIQDSYSQERYQSLIKMRKTEAQTGNEPGPSHILLEFWLETEGREESE AGVFVPKATFLQNHSSFQGAFSHSYGDPCSSAWVLNKLDYPQYGLNPVPLKIPMLKLQVP VPQNVTVFGDKGFKEIMKVKCDHMDPVPRNSSSEIQLDFKIEKLDSLHRLFAWLQTVEAT VLPSERRVVQCRDEELAAETGLSGLNQASQSLELQISSGIRSHRSRNPIVNCTCEESRLC APYENPVLDDLRQNSFFLKLSLLQPLTFVENLSSMKPVPGAKKFRLFQNVVSLEAHRHLI IAHSTLGPFNPNQSFIRAAHLYAMQSSGGTTEDAQLCPDTPETLSVPYESDSPFLLIPLT VSPCCHFDEHMRLYPLRLLRDTVRRSSPDTGPSILDFLVFRIQPAPAETPTSTRSKTKRE NNHQSPQGEEKSYQWDACCTANMTRWKQEENDKQEEKGDDVLQESAAHEKSSVLSSKICD HPLKYISYNSSQRKLLLFVQIDSITLCD >gi568815596f:203767943_203972809|GENSCAN_predicted_CDS_8|1887_bp atgcagactacaacaaaggctgcacatcctatgggtaaccagaacttcagctggcaacaa agttgtgattctggattggaaaaaaagtcaaaaagaaaagactcacgggaaagggagacc ctcctctcccaaacacacactctgaccggagaagctgagcgtctgtttgtggagaagttt ccgactttatctggagcacctacccaaatgagaaggaaccagaaaaccaactctggtaat atgacaaaacaaggctcttcaacacccccaaaaaatcacactagttcaccagcaatggat ccaaaccaagaagaaattcttcatctacctgaaaaagaattcaggagaagccagcagaca gtggaggaatatctgcaaccactaagaagaaaggctgccatccaggactcctattcccag gaaagatatcagtcactcatcaagatgaggaaaactgaggcacagacaggtaacgaacct ggcccaagtcatatactgctggagttctggctggagacggaagggcgggaggaaagtgag gctggagtgtttgttcccaaggcgaccttcctgcaaaaccacagctcctttcagggagcc ttctcgcacagctacggcgacccttgcagctctgcctgggttctgaacaaattagattac ccacaatatgggctgaatcctgtccccctcaaaattcctatgttgaaattacaagtccca gtacctcagaatgtgactgtatttggagataagggctttaaagagataatgaaggtaaaa tgcgatcatatggacccagtgcccaggaattcttccagtgaaatccaacttgatttcaaa atagaaaaactggattcccttcacagactctttgcttggctacaaactgttgaagcaacc gtgcttccttcagagaggcgggttgtccagtgcagagatgaagagctggctgcagagaca ggcctttcgggtttgaatcaggcttcgcagtctctcgagctgcagatcagcagcggcatt agatcccataggagcaggaaccctattgtgaactgcacgtgcgaggaatctaggttgtgc gctccttatgagaatccagtgcttgatgatctgaggcagaacagtttcttcctaaaacta tccctcctccaaccccttacattcgtggaaaatttgtcttccatgaaaccggtccctggt gccaaaaagtttcgccttttccagaatgtcgtatctttggaagcacacagacatttgatc attgcccactctaccctggggccattcaatcccaaccagtcatttataagagctgcacac ttgtatgccatgcaatcttcaggaggcaccactgaggacgcacagctatgtcctgacaca ccagagaccctctcagtcccatatgagtcggattcacccttccttctaattcctttgacc gtctccccttgctgccattttgatgagcacatgcgtttatatccattacggttgttgagg gacacagtaagaaggtcctcaccagataccggtccttctatcttggacttcttagtcttc cgaattcaacctgcaccagcagaaactcctacttctacaaggtctaagacgaaacgggaa aacaaccaccaaagcccacaaggtgaagagaaaagctaccagtgggatgcatgctgcacg gctaacatgacaaggtggaagcaggaagaaaatgacaagcaggaagaaaaaggtgatgat gtccttcaggaaagtgctgctcatgagaagtcatctgtgttatcttctaaaatttgcgat catcccctgaaatatatttcttacaattcctctcagaggaagctgcttctttttgtccaa atagatagtattactttgtgtgattaa >gi568815596f:203767943_203972809|GENSCAN_predicted_peptide_9|207_aa MVLSCDECGLHEEKDITRQRTLVVRIQAKTPTEGETKETHFIRGPKTPAPVTDWEGSLPL VFNHCRDTSLIIHPCFRGVRPRRDACLGPSCLAASPTFLGEGQVPQPLLCLYPFSAFLGG KKPPTTSPSPLAASPTFLGEEQELATSARNLATRPRNACSPGFLLSRVPSVRDPTGNRTV PLTLAATPRAPGTLAQGSLTPSQTFLA >gi568815596f:203767943_203972809|GENSCAN_predicted_CDS_9|624_bp atggtcctctcttgtgacgaatgtggacttcatgaagaaaaagacataacaaggcagaga acattagttgtcagaattcaagctaagactcctacagaaggagagacaaaggagacacat tttatccgtggacccaaaactccggcaccggtcacggactgggaaggcagccttcccttg gtgtttaatcattgcagggacacctctctgattattcacccatgtttcagaggtgtcaga ccacgcagggatgcctgccttggtccttcatgcttagcggcaagtcccacttttctgggg gaggggcaagtaccccaaccccttctgtgtctctaccccttctccgcctttctggggggc aagaaacccccgaccacttctccttcacccttagcggcaagtcccacttttctaggggag gagcaagagcttgctacaagtgccagaaatctggccaccaggccaaggaatgcctgcagc ccaggattcctcctaagccgcgtcccatctgtgcgggaccccactggaaatcggactgtc ccactcaccttggcagccactcccagagcccctggaactctggcccaaggctctctgact ccttcccagaccttcttggcttag