GENSCAN 1.0 Date run: 6-Nov-116 Time: 02:06:20 Sequence gi568815579f:54639506_54756112 : 116607 bp : 45.45% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3986 4025 40 1 1 65 105 25 0.705 2.25 1.02 Intr + 4415 4585 171 0 0 56 48 105 0.589 3.21 1.03 Term + 12580 12725 146 2 2 109 41 100 0.638 5.47 1.04 PlyA + 13936 13941 6 1.05 2.00 Prom + 21500 21539 40 -8.26 2.01 Init + 23529 23562 34 2 1 87 89 82 0.757 6.30 2.02 Intr + 24249 24533 285 1 0 55 94 152 0.824 9.81 2.03 Intr + 24681 24980 300 1 0 118 94 115 0.688 11.81 2.04 Intr + 25294 25344 51 2 0 128 105 -38 0.493 0.78 2.05 Intr + 26894 26931 38 0 2 126 65 25 0.254 1.98 2.06 Intr + 27797 27944 148 1 1 -25 77 138 0.194 1.21 2.07 Intr + 31926 32142 217 1 1 124 49 115 0.213 8.76 2.08 Intr + 32541 32582 42 0 0 74 94 48 0.117 1.36 2.09 Intr + 57288 57559 272 0 2 104 94 28 0.032 2.19 2.10 Intr + 57707 58009 303 0 0 127 94 79 0.603 8.96 2.11 Intr + 58201 58440 240 2 0 -2 94 244 0.608 13.52 2.12 Intr + 58690 58992 303 2 0 93 94 146 0.611 12.26 2.13 Intr + 61931 62006 76 0 1 92 68 35 0.330 0.47 2.14 Intr + 64448 64491 44 2 2 84 92 24 0.112 0.28 2.15 Intr + 68883 68933 51 1 0 96 109 -13 0.160 0.48 2.16 Intr + 69097 69381 285 2 0 84 110 111 0.308 10.21 2.17 Intr + 69531 69797 267 1 0 104 33 88 0.633 2.40 2.18 Intr + 70050 70346 297 1 0 117 94 259 0.981 26.25 2.19 Intr + 70602 70904 303 1 0 117 94 177 0.997 17.76 2.20 Term + 78786 78946 161 1 2 46 37 154 0.472 4.20 2.21 PlyA + 78952 78957 6 1.05 3.00 Prom + 81894 81933 40 -1.36 3.01 Init + 84992 85025 34 1 1 90 97 56 0.902 5.89 3.02 Intr + 86548 86832 285 2 0 129 71 73 0.990 7.01 3.03 Intr + 88142 88405 264 0 0 63 66 213 0.459 14.08 3.04 Intr + 88880 88915 36 0 0 65 94 40 0.429 0.53 3.05 Intr + 95748 95852 105 1 0 107 89 40 0.956 6.19 3.06 Intr + 96315 96367 53 1 2 109 92 21 0.922 3.23 3.07 Term + 96466 96591 126 0 0 111 41 68 0.894 2.68 3.08 PlyA + 97529 97534 6 1.05 4.00 Prom + 100526 100565 40 -5.96 4.01 Init + 101602 101654 53 0 2 74 53 42 0.117 -0.07 4.02 Intr + 102590 102774 185 2 2 56 94 198 0.359 16.63 4.03 Intr + 104290 104583 294 2 0 77 94 194 0.892 15.88 4.04 Intr + 107830 107880 51 2 0 141 117 -20 0.945 5.08 4.05 Intr + 112144 112248 105 2 0 113 106 31 0.994 7.59 4.06 Intr + 112711 112763 53 2 2 114 92 21 0.999 3.73 4.07 Term + 112862 113014 153 1 0 109 42 152 0.975 10.62 4.08 PlyA + 113528 113533 6 1.05 5.03 PlyA - 113937 113932 6 1.05 5.02 Term - 115558 115316 243 1 0 131 39 87 0.582 4.00 5.01 Intr - 116522 116244 279 2 0 23 67 150 0.247 4.07 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 48994 48864 131 1 2 48 89 107 0.852 6.42 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579f:54639506_54756112|GENSCAN_predicted_peptide_1|118_aa MGLTAQGHARRDEAWTCNFPEASPGLDAMVRIPQCCVDGRAEGGKETPRGGSEKKKKPPV TLTWTGQTQKDPCCRGPLEVTYAQLDHQAFTWRTAQAVSPESMVPMAESSMYTALARH >gi568815579f:54639506_54756112|GENSCAN_predicted_CDS_1|357_bp atgggacttactgcccaaggtcatgcacgcagggatgaagcatggacctgcaactttcct gaagcatctccagggctggatgccatggtaaggatcccgcaatgctgtgttgatggacgg gctgaaggagggaaggagaccccacggggaggctctgagaagaagaaaaagcccccagtc actctcacttggacaggacagactcagaaagacccctgctgcaggggacccctggaggtg acatatgctcagctggaccaccaggccttcacttggaggacagcccaggctgtgtcccca gagtccatggtgcccatggctgagtctagcatgtacacagccctcgccaggcactga >gi568815579f:54639506_54756112|GENSCAN_predicted_peptide_2|1238_aa MIPTFTALLCLGPLPKPTLWAEPGSVISWGNSVTIWCQGTLEAREYRLDKEESPAPWDRQ NPLEPKNKARFSIPSMTEDYAGRYRCYYRSPVGWSQPSDPLELVMTGAYSKPTLSALPSP LVTSGKSVTLLCQSRSPMDTFLLIKERAAHPLLHLRSEHGAQQHQAEFPMSPVTSVHGGT YRCFSSHGFSHYLLSHPSDPLELIVSGSLEGPRPSPTRSVSTAGPAQLLTSREKTSLSSE KPGRRGPRDPLMDEPLQAEETAVQRPRGSSELLQEGRVRLQPNGQGFYSKSTLSAQPSPV LTSGGKVTPLCPSRLGFDRFILTVEGEHKLFCLLDPEKQSDGQFQALILADPMTSSHRWS APSDPPGHPDRRPTLWAKPGSVISWRSPMTMWCQGTLEAQEYHLYKEGSTEPWDRTNPLE TRNKARYSIPSMTQHHAVRYQCYYLSPAGWSEPSDPLELVMTGFYSKPTLSALPSPVVAS GGKVTLRCGSQKGYHHFVLMKEGEHQLPRTLDSQQLHSGGFQALFPVGPVTPSHRWRFTC YYYYMNTPQVWSHPSDPLEILPSETLTLQCGSDVGYDRFTLYKEGECDFLQRPGQQPQAG LSQANFTLGPVRGSHGGQYRCSGAHNLSSEWSAPSDPLDILIAGQIPGRPSLSVQLWPTV ASGENVTLLCQSQEWMHTFLLTKEGAAHPLLCLRSKYGAHKYQAEFPMSPVTSAHTGTYR CYGSLSSDPYLLSHPSGPVELVVSDAICVVSPEISPPGINTLAFEVTTQWRLQETFLLSS TVPCRNSSPWLSLGQKAQALAGTLPKPSLWAEPGSVITWESPMTLWCQGTLDTQGYYLTK EGNPMTWYQQSPPEPRNKTNFFIPSMREHHAGRYHCHYLSPAGWSERSEPLELVVTGAHR KPTLSALPSPVVTSGENVTIQCSSRVGFHRFILIEEGENKLSWMLDSQELSKGLSLVPGP VPCGPCGCQSPVDVQMLWALHELPLGVSRKPSLLTLQGPVVAPGENLTLQCGSDVGYDKF TLYKEGGHDLVQGSGRQPQAGLSQANFTLGPVRVSHGGQYRCYGAHNLSSEWSAPSDPLS ILIAGQIRGRPSLSVQPGPTVASGENVTLLCQSREQLDTFLLTKEGAAHHPLRLRSEHQA QQHQAEFPMSPVTSAHAGTYRCYSSRRFFPYLLSHPSDPLELVVSEIEKIVKDYYEHLYT NKLENLEEMDKFLVTQNLSYFNQEESENLNRPITSSEI >gi568815579f:54639506_54756112|GENSCAN_predicted_CDS_2|3717_bp atgatccccaccttcacggctctgctctgcctcgggcccctccccaaacccaccctctgg gctgagccaggctctgtgatcagctgggggaactctgtgaccatctggtgtcaggggacc ctggaggctcgggagtaccgtctggataaagaggaaagcccagcaccctgggacagacag aacccactggagcccaagaacaaggccagattctccatcccatccatgacagaggactat gcagggagataccgctgttactatcgcagccctgtaggctggtcacagcccagtgacccc ctggagctggtgatgacaggagcctacagtaaacccaccctttcagccctgccgagtcct cttgtgacctcaggaaagagcgtgaccctgctgtgtcagtcacggagcccaatggacact tttcttctgatcaaggagcgggcagcccatcccctactgcatctgagatcagagcacgga gctcagcagcaccaggctgaattccccatgagtcctgtgacctcagtgcacggggggacc tacaggtgcttcagctcacacggcttctcccactacctgctgtcacaccccagtgacccc ctggagctcatagtctcaggatccttggagggtcccaggccctcacccacaaggtccgtc tcaacagctggtccagcccagctgctgacgtccagggagaaaacttctctgtcatctgag aagcctggacggagagggccacgtgatcctctaatggacgagcccctgcaggcagaggaa acagccgtgcaaaggccccgaggcagcagcgagctcttgcaggaaggccgcgtgaggctg cagccaaatgggcaaggattctacagcaaatccaccctctcagctcagcccagccctgtc ctgacctcaggaggtaaggtgacaccattatgtccttcacggctgggatttgacaggttc attctgaccgtggaaggtgaacacaagctattctgtctcctggaccccgaaaaacagtcc gatggacagttccaagccctgatcctggcagaccccatgacttctagccacaggtggtca gcccccagtgacccccctggacatcctgatcgcaggcccaccctctgggctaagccaggc tctgtgatcagctggagaagccccatgaccatgtggtgtcaggggaccctggaagcccag gagtaccatctatataaagagggaagcacagagccctgggacagaacgaatccactggag accaggaacaaggccagatactccatcccatccatgacacagcaccatgcagtgagatat cagtgttactatctcagccctgcgggctggtcagagcccagtgaccccctggagctggtg atgacaggattctacagcaaacccaccctctcagccctgcccagccctgtggtggcctca ggagggaaagtgaccctccgatgtggctcacagaagggatatcaccattttgttctgatg aaggaaggagaacaccagctcccccggaccctggactcacagcagctccacagtgggggg ttccaggccctgttccctgtgggccccgtgacccccagccacaggtggaggttcacatgc tattactattatatgaacaccccccaggtgtggtcccaccccagtgaccccctggagatt ctgccctcagagaccctgaccctccagtgtggctctgatgtcggctacgacagattcact ctgtacaaggagggggaatgtgacttcctccagcgccctggccagcagccccaggctggg ctctcccaggccaacttcaccctgggccctgtgaggggctcccacgggggccagtacaga tgctccggtgcacacaacctctcctccgagtggtcggcccccagtgaccccctggacatc ctgatcgcaggacagatccctggcagaccctccctctcagtgcagttgtggcccacagtg gcctcaggagagaacgtgaccctgctgtgtcaatcacaagagtggatgcacactttcctt ctgaccaaggagggggcagcccatcccctgctgtgtctgagatcaaagtacggagctcat aagtaccaggctgaattccccatgagtcctgtgacctcagcccacacggggacctacagg tgctacggctcactcagctccgacccctacctgctgtctcaccccagtggccccgtggag ctcgtggtctcagatgctatctgtgtagtttctcctgaaatatcaccacctggaatcaac acactggcatttgaagtcacgacccaatggcgtctacaagagaccttccttctcagctca actgtgccctgcagaaactcttctccatggctgagtctgggccagaaagcccaagcactt gcagggaccctccccaaacccagcctctgggctgagccaggctctgtgattacctgggag agccccatgaccctctggtgccaggggaccctggatacccagggttactatctcaccaag gaaggaaaccccatgacctggtaccaacagagcccaccagagcccaggaacaagaccaac ttcttcatcccatccatgagagagcaccatgcagggagataccactgtcactatctcagc cctgcaggctggtcagagcgcagcgagcccctggagctggtggtgacaggagcccacaga aaacccactctctcagccctgccgagccctgtggtgacctcaggagagaacgtgaccatc cagtgtagctcaagggtgggatttcacaggttcattttgattgaggaaggagaaaacaag ctctcctggatgctggactcacaggaactctccaaggggctgtcccttgtccctggccct gttccctgtgggccgtgtggctgccagtcaccggtggatgttcagatgctatgggcatta cacgaacttcccctgggcgtgtctaggaagccctccctcctgaccctgcagggccctgtc gtggcccctggggagaatctgaccctccagtgtggctctgatgtcggctatgacaaattc actctgtacaaggaggggggacatgacctcgtccagggctctggccggcagccccaggct gggctctcccaggccaacttcaccctgggccctgtgagggtctcccacgggggccagtac agatgctacggtgcacacaacctctcctccgagtggtcggcccccagtgaccccctgagc atcctgatcgcaggacagatccgtggcagaccctccctctcggtgcagccgggccccacg gtggcctcaggagagaacgtgaccctgctgtgtcagtcacgggagcagttggacactttc cttctgaccaaggagggggcagcccatcacccactgcgtctgagatcagagcaccaagct cagcagcaccaggctgaattccccatgagtcctgtgacctcagcccacgcggggacctac aggtgctacagctcacgcagattcttcccctacctgctgtctcaccccagtgaccccctg gagctcgtggtctcagaaatagaaaagatcgtcaaagactattatgaacacctctataca aacaagctagaaaacctagaagaaatggataaattcctggtaacacaaaatttatcatat ttcaaccaggaagaaagtgaaaacctgaacagaccaataacaagttcagaaatttaa >gi568815579f:54639506_54756112|GENSCAN_predicted_peptide_3|300_aa MSLMVVSMACVGGQDKPFLSAWPGTVVSEGQHVTLQCRSRLGFNEFSLSKEDGMPVPELY NRIFRNSFLMGPVTPAHAGTYRCCSSHPHSPTGWSAPSNPVVIMVTGPLVKSGETVILQC WSDVRFERFLLHREGITEDPLRLVGQLHDAGSQVNYSMGPMTPALAGTYRCFGSVTHLPY ELSAPSDPLDIVVVVELSSSQDTMAPGNSRNLHVLIGTSVVIIPFAILLFFLLHRWCANK KNAVVMDQEPAGNRTVNREDSDEQDPQEVTYAQLNHCVFTQRKITRPSQRPKTPPTDTSV >gi568815579f:54639506_54756112|GENSCAN_predicted_CDS_3|903_bp atgtcgctcatggtcgtcagcatggcgtgtgttggtggtcaggacaagcccttcctctct gcctggcccggcactgtggtgtctgaaggacaacatgtgactcttcagtgtcgctctcgt cttgggtttaatgaattcagtctgtccaaagaagacgggatgcctgtccctgagctctac aacagaatattccggaacagctttctcatgggccctgtgaccccagcacatgcagggacc tacagatgttgcagttcacacccacactcccccactgggtggtcggcacccagcaaccct gtggtgatcatggtcacaggtcccctggtgaaatcaggagagacggtcatcctgcaatgt tggtcagatgtcaggtttgagcgcttccttctgcacagagaggggatcactgaggacccc ttgcgcctcgttggacagctccacgatgcgggttcccaggtcaactattccatgggtccc atgacacctgcccttgcagggacctacagatgctttggttctgtcactcacttaccctat gagttgtcggctcccagtgaccctctggacatcgtggtcgtagtggagctgtcatcgtcc caggacaccatggccccaggtaactccagaaacctgcacgttctgattgggacctcagtg gtcatcatcccctttgctatcctcctcttctttctccttcatcgctggtgtgccaacaaa aagaatgctgttgtaatggaccaagagcctgcagggaacagaacagtgaacagggaggac tctgatgaacaagaccctcaggaggtgacatacgcacagttgaatcactgcgttttcaca cagagaaaaatcactcgcccttctcagaggcccaagacacccccaacagataccagcgtg taa >gi568815579f:54639506_54756112|GENSCAN_predicted_peptide_4|297_aa MLLIIFYSSNRKPTLEPREGKFKDTLHLIGEHHDGVSKANFSIGPMMQDLAGTYRCYGSV THSPYQLSAPSDPLDIVITGLYEKPSLSAQPGPTVLAGESVTLSCSSRSSYDMYHLSREG EAHERRFSAGPKVNGTFQADFPLGPATHGGTYRCFGSFRDSPYEWSNSSDPLLVSVTGNP SNSWPSPTEPSSETGNPRHLHVLIGTSVVIILFILLLFFLLHRWCCNKKNAVVMDQEPAG NRTVNREDSDEQDPQEVTYAQLNHCVFTQRKITRPSQRPKTPPTDIIVYTELPNAEP >gi568815579f:54639506_54756112|GENSCAN_predicted_CDS_4|894_bp atgcttctgataattttctacagcagcaacaggaaaccaacactggaacccagagaaggg aagtttaaggacactttgcacctcattggagagcaccatgatggggtctccaaggccaac ttctccatcggtcccatgatgcaagaccttgcagggacctacagatgctacggttctgtt actcactccccctatcagttgtcagctcccagtgaccctctggacatcgtcatcacaggt ctatatgagaaaccttctctctcagcccagccgggccccacggttctggcaggagagagc gtgaccttgtcctgcagctcccggagctcctatgacatgtaccatctatccagggagggg gaggcccatgaacgtaggttctctgcagggcccaaggtcaacggaacattccaggccgac tttcctctgggccctgccacccacggaggaacctacagatgcttcggctctttccgtgac tctccatacgagtggtcaaactcgagtgacccactgcttgtttctgtcacaggaaaccct tcaaatagttggccttcacccactgaaccaagctccgaaaccggtaaccccagacacctg catgttctgattgggacctcagtggtcatcatcctcttcatcctcctcctcttctttctc cttcatcgctggtgctgcaacaaaaaaaatgctgttgtaatggaccaagagcctgcaggg aacagaacagtgaacagggaggactctgatgaacaagaccctcaggaggtgacatatgca cagttgaatcactgcgttttcacacagagaaaaatcactcgcccttctcagaggcccaag acacccccaacagatatcatcgtgtacacggaacttccaaatgctgagccctga >gi568815579f:54639506_54756112|GENSCAN_predicted_peptide_5|173_aa AGPSWTEVKLTLSAYLHPRTGLSAVQRPSLQAQIPTTSPYPHHKPISPLQANISTLGLYL HSRPISPLQADISIIGPYRQSRPISPIQAKISTDSPTHAMLTTVSDMVLPVQTGGRAPAQ LSSAQDVIWRPAHAVYMLTTSWEGDVRRLILPCMRPSGCSLKSGTRLPGNCSH >gi568815579f:54639506_54756112|GENSCAN_predicted_CDS_5|522_bp gcaggcccttcctggactgaagttaaactcaccctcagtgcctacctgcacccaagaaca gggctgtcggctgtgcagagacccagcctccaagcccagatccccaccacaagcccatat ccccaccacaagcccatatctccactccaggccaatatttccaccctaggcctgtatctc cactccaggcccatatctccactccaggccgatatttccatcataggcccatatcgccaa tccaggcccatatcgccaatccaggccaagatctccactgactcaccaacacacgccatg ctgacgaccgtgagcgacatggtgctgccggtgcagacaggcggccgtgccccagctcag ctcagcagcgcacaggatgttatttggcgccctgcccatgcagtttacatgttgaccaca tcatgggagggtgacgtacgcaggctcattctaccttgcatgaggcccagtgggtgctcg ctcaagagcggaacacggcttcctggaaattgttctcactag