GENSCAN 1.0 Date run: 5-Nov-116 Time: 05:40:24 Sequence gi568815586f:93369286_93601218 : 231933 bp : 41.48% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1791 1946 156 2 0 62 67 104 0.358 4.99 1.02 Intr + 8842 9136 295 0 1 17 75 225 0.027 9.76 1.03 Intr + 9323 9408 86 0 2 68 96 29 0.031 0.32 1.04 Intr + 25231 25434 204 0 0 55 78 166 0.394 10.77 1.05 Intr + 29483 29570 88 1 1 38 111 65 0.517 2.42 1.06 Term + 29892 30094 203 1 2 98 44 85 0.893 1.67 1.07 PlyA + 30257 30262 6 -0.45 2.08 PlyA - 30420 30415 6 1.05 2.07 Term - 30500 30448 53 0 2 96 43 56 0.008 -1.49 2.06 Intr - 41589 41449 141 1 0 82 70 142 0.758 11.20 2.05 Intr - 42014 41768 247 2 1 98 115 242 0.999 24.01 2.04 Intr - 72518 72416 103 1 1 91 20 76 0.009 0.36 2.03 Intr - 74018 73960 59 2 2 86 96 25 0.010 -0.14 2.02 Intr - 79962 79882 81 0 0 127 59 32 0.294 3.12 2.01 Init - 84550 84545 6 1 0 72 131 9 0.702 4.03 2.00 Prom - 86210 86171 40 -7.15 3.04 PlyA - 86638 86633 6 1.05 3.03 Term - 98686 98265 422 2 2 3 49 223 0.225 4.37 3.02 Intr - 101488 101402 87 2 0 64 82 53 0.253 1.32 3.01 Init - 107432 107315 118 2 1 35 69 157 0.932 8.91 3.00 Prom - 116178 116139 40 -4.35 4.00 Prom + 124722 124761 40 -2.85 4.01 Sngl + 127766 128071 306 1 0 49 43 238 0.992 11.02 4.02 PlyA + 128148 128153 6 1.05 5.00 Prom + 135340 135379 40 -5.15 5.01 Init + 136830 136922 93 2 0 86 84 23 0.234 0.13 5.02 Intr + 140618 140736 119 1 2 71 82 27 0.275 -1.26 5.03 Intr + 147978 148100 123 0 0 64 103 57 0.530 3.58 5.04 Term + 152168 152546 379 2 1 81 52 162 0.622 5.18 5.05 PlyA + 152617 152622 6 1.05 6.02 PlyA - 154176 154171 6 1.05 6.01 Sngl - 160588 160139 450 1 0 16 49 277 0.741 12.56 6.00 Prom - 162733 162694 40 -5.25 7.00 Prom + 163263 163302 40 -6.05 7.01 Init + 165965 166037 73 1 1 86 33 83 0.071 3.88 7.02 Intr + 195232 195321 90 2 0 111 98 35 0.638 5.85 7.03 Intr + 201113 201257 145 0 1 100 21 188 0.134 11.72 7.04 Intr + 202288 202342 55 1 1 127 27 55 0.056 1.36 7.05 Intr + 203511 203751 241 2 1 81 91 83 0.007 4.10 7.06 Term + 205437 205894 458 1 2 30 36 385 0.014 21.80 7.07 PlyA + 206143 206148 6 1.05 8.00 Prom + 208714 208753 40 -5.95 8.01 Init + 222109 222529 421 0 1 67 -9 313 0.442 14.19 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 9564 9633 70 0 1 78 47 129 0.878 4.23 S.002 Term + 60311 60451 141 1 0 1 47 234 0.870 7.55 S.003 Term + 72863 73135 273 1 0 37 51 231 0.838 8.79 S.004 Term + 88645 88776 132 0 0 131 49 70 0.800 4.51 S.005 Term - 114883 114683 201 1 0 61 38 138 0.835 2.51 S.006 Term + 203400 203602 203 1 2 123 46 158 0.875 11.67 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:93369286_93601218|GENSCAN_predicted_peptide_1|343_aa LSGSSLNPVCLRNLHSVSIPSPRVWGGTLSGMRVLGITIGKVEKDYSPTLEQRPVPPASE RPPAPGLTGLAAPPAPTSAPGRSCARLALAPTPRRGGGGRAPTAAAGAAAAAAGARLYDE VQAQPDADLRPRGLQEAGGVPVLPERAGGRDETKGARPAPVSPSGHSRRRLVPGKVPGRN RVITLQRFENVVYIRSASQAGSIQLWFTLQVLLVSSSRYPDQWIVPGGGMEPEEEPGGAA VREVYEEQNQDRKHRTYVYVLTVTEILEDWEDSVNIGRKREWFKVEDAIKVLQCHKPVHA EYLEKLKLGCSPANGNSTVPSLPDNNALFVTAAQTSGLPSSVR >gi568815586f:93369286_93601218|GENSCAN_predicted_CDS_1|1032_bp ctatctggaagctctctgaacccagtgtgtttacggaatcttcatagtgttagcattcct tcccccagggtatggggcggaaccctctctggaatgagggtcttaggaatcacaatcgga aaggtggagaaagattacagtcctaccttggagcagcgtcccgtcccgcccgcgtcggag cggccgccggccccgggactgaccggcctcgccgcacctcccgcaccgactagcgctccc gggcgctcctgcgcccgactcgccctcgcccccactccccggcggggtggcggcggccgg gcccccacggcggcggccggagcagcagcagcagcagcaggagcccgcctctatgatgaa gttcaagcccaaccagacgcggacctacgaccgcgagggcttcaagaagcgggcggcgtg cctgtgcttccggagcgagcaggaggacgagatgagacaaagggggctcgcccagcccca gtttctccatctgggcactcgagaaggcgcctggtccccgggaaggtcccgggccgcaat agagttataaccttgcagaggtttgagaatgttgtttatatacgaagtgcttctcaggct ggaagtatacagttatggttcaccttacaggtgctgctggtgagtagcagccggtaccca gaccagtggattgtcccaggaggaggaatggaacccgaggaggaacctggcggtgctgcc gtgagggaagtttatgaggagcagaaccaagaccgaaagcacagaacatatgtttatgtt ctaacagtcactgaaatattagaagattgggaagattctgttaatattggaaggaagaga gagtggttcaaagtagaagatgctatcaaagttctccagtgtcataaacctgtacatgca gagtatctggaaaagctaaagctgggttgttccccagccaatggaaattctacagtccct tcccttccggataataatgccttgtttgtaaccgctgcacagacctctgggttgccatct agtgtaagatag >gi568815586f:93369286_93601218|GENSCAN_predicted_peptide_2|229_aa MPIASDIIEKHGRNQSDLRQTGDGEGGRKLNAISQAISYAWHTLPYKVSWAGSAASRLLR PAARSGRPEAARRGVGRGGEGSRETQRLLAEPVPGIKAEPDESNARYFHVVIAGPQDSPF EGGTFKLELFLPEEYPMAAPKVRFMTKIYHPNVDKLGRICLDILKDKWSPALQIRTVLLS IQALLSAPNPDDPLANDVAEQWKTNEAQAIETDGFAEYANAVNEKTILG >gi568815586f:93369286_93601218|GENSCAN_predicted_CDS_2|690_bp atgccgatagcaagtgacataatagagaaacatgggagaaatcaatcagacttgagacag acaggtgatggtgaaggaggaagaaagctcaatgccatttcacaggccatttcctatgct tggcatactcttccctacaaggtcagctgggcaggcagcgcggcctctcgcctccttcgg cccgcggcccgctctgggaggcccgaggcggcgcggagaggggttggccgcggcggcgag ggaagtcgggaaacccagcgtttgctggcagaaccagttcctggcatcaaagccgaacca gatgagagcaacgcccgttattttcatgtggtcattgctggccctcaggattcccccttt gagggagggacttttaaacttgaactattccttccagaagaatacccaatggcagcccct aaagtacgtttcatgaccaaaatttatcatcctaatgtagacaagttgggaagaatatgt ttagatattttgaaagataagtggtccccagcactgcagatccgcacagttctgctatcg atccaggccttgttaagtgctcccaatccagatgatccattagcaaatgatgtagcggag cagtggaagaccaacgaagcccaagccatagaaacagatggatttgctgaatatgctaat gctgtgaatgagaaaacaattttggggtag >gi568815586f:93369286_93601218|GENSCAN_predicted_peptide_3|208_aa MVNIGKDGPHRKATKQIVQVQESLFSGDDVQGALKNEKITTQMQLEAIILSELMQKQPNT TCSNLQVGAQRKARPAGSQIIHPRKIISEIKSLSPEFNPKTKATEPLRVRGLRNRLPRAP ELFDLKESLEPQAASPPHGRPPGNKGGAIRNAQPHAGSSAGRTKAGDAGNLALPGTTGGR PLQLPTNTVGLPSFEEVLDRRSRGKRLP >gi568815586f:93369286_93601218|GENSCAN_predicted_CDS_3|627_bp atggtaaacattggcaaagatgggccacacagaaaggctaccaaacaaattgtgcaggtt caagaaagccttttcagtggagatgatgtccaaggtgctctgaagaatgagaaaataaca acacagatgcagctggaggccattatcctaagtgaattaatgcaaaaacagccaaatact acatgttctaacttacaagtaggagcccagagaaaagcacgccctgctgggagtcagatt attcaccccaggaagatcatttcagagatcaagtccctcagcccggaattcaaccccaaa acaaaagctacagagcctctgagggtcagaggtttgaggaacaggctaccaagagcacca gaactcttcgacctaaaggagtccctagagccacaagctgcatcacctccccacgggcgt ccgccagggaacaaagggggcgctataagaaacgcgcagccacatgccggctcttctgca ggaaggacaaaggctggggatgcaggcaacctcgccctgccgggaacgacaggcgggaga cctctacaactgccgaccaacactgtgggactgcccagtttcgaggaagtcctcgacagg aggagtagagggaagcggttaccttga >gi568815586f:93369286_93601218|GENSCAN_predicted_peptide_4|101_aa MNSETESVIKKQQPKSSGPDRSTAEFYQMCKEPVPILLILFPQQQQQKIEEEGLLPNSFY KNGIILIPKSDRDITEKQNYGPVSLINMDAKILNKIRANQI >gi568815586f:93369286_93601218|GENSCAN_predicted_CDS_4|306_bp atgaattctgaaactgaatcagtaataaaaaaacaacaaccaaaaagctctggaccagac agatccacagctgaattctaccagatgtgcaaagagccagtaccaatcctactgatacta ttcccccaacaacaacaacaaaaaatcgaggaggaaggactcctccccaactcattctac aaaaacggcatcatcctgatacccaaatctgacagagacataacagaaaaacaaaactat gggccagtatccctgataaacatggatgctaaaatcctcaacaaaatacgagcaaaccaa atctag >gi568815586f:93369286_93601218|GENSCAN_predicted_peptide_5|237_aa MGFHHVGQAGLELLTSGDLPTLAPQNTGITRQSGTLVTIDKPTLTHHYHPKPIVSIRVHS WCCTIYGFEQSGNWYLETKIWCSIPFKKLVCEGLWLYDHTWDWKYRGDTWLSLTLASEWG EMCMLHSAIQGYSLLPSGSGGFPIPWGRGLLFWIEGTQPAGKEERMWKTEDHALAFNGQP WGGGLKSLPLTSHWSEQSFMQAHRCNCKTAGNYALAVCRGRRETQTLVNVSLLGFTL >gi568815586f:93369286_93601218|GENSCAN_predicted_CDS_5|714_bp atggggtttcaccacgttggccaggctggtcttgaactcctgacctcaggtgatctgccc accttggccccccaaaatactgggattacaaggcagagtggcacattggttacaattgat aaacctacactgacacatcattatcacccaaagcctattgtttccattagggttcactct tggtgctgtacaatctatgggtttgaacaaagtggaaactggtatctagaaaccaagatc tggtgtagtattcctttcaagaagcttgtctgtgaagggttatggctgtatgatcacacg tgggactggaagtacagaggggatacctggctcagtctaactctggccagtgaatgggga gaaatgtgtatgctccacagtgccattcagggatacagcctccttccaagtggtagtggt ggcttccccattccttggggccgtgggctcctcttctggattgagggcacccagccagct ggcaaggaagaaagaatgtggaaaacagaggatcatgcgctagctttcaatggccagccc tggggggggggactcaaatcacttccactcacatcccattggtcagaacaaagttttatg caagctcaccgctgcaactgcaagacagctgggaactatgctctagcggtgtgccgtgga agaagagaaactcaaacactggtgaatgttagcttgcttggcttcactctctga >gi568815586f:93369286_93601218|GENSCAN_predicted_peptide_6|149_aa MKDGKIKSRDFALITFESPADAKNAANDMKGKSLDGKAIQVEQANEPSFESGVRWRPPSP SRNRGPPRCLRCGRKGSGGAKGHPSGGGHMDDGGYTLNANMSSSRTGVPSLLGTRGQFHE RQFFYRPGVGGDGFGMKLFHLRSSGIRFS >gi568815586f:93369286_93601218|GENSCAN_predicted_CDS_6|450_bp atgaaggatgggaaaatcaagtccagagactttgcattgattacttttgagagccctgca gatgctaagaatgctgccaacgatatgaagggaaagtctttggatggaaaagcaattcaa gtagagcaagccaacgaaccatcttttgaaagtggtgttaggtggagaccaccatctcct tcgagaaacagaggccctccaagatgtctgagatgtggaagaaaaggtagtggaggagca aaagggcatccctcaggtggaggacacatggatgatggcggatacactcttaatgccaac atgagttcttctaggacaggggtccccagccttttgggcaccaggggccagtttcatgaa agacaatttttctatagaccaggggtcgggggggatggtttcgggatgaaactgttccac ctcagatcatcaggtattagattctcataa >gi568815586f:93369286_93601218|GENSCAN_predicted_peptide_7|353_aa MVEGEGEAGMSFMAREGGGDRERGGTIVLGPGSWRLSRNFKKAKLFLVAHQSLKGPDQGD RLRSPAVGTGTSTASPRHRCVLEPESLVSEDPQSLLHAHPPQKVAAGGKPGSSPRVEVSQ PGLVLGFALTSRKDANPSLTPARAATCLCRGDPSLMTLRCLEPSGNGGEGTRSQWGTAGS AEEPSPQAARLAKALRELGQTGWYWGSMTVNEAKEKLKEAPEGTFLIRDSSHSDYLLTIS VKTSAGPTNLRIEYQDGKFRLDSIICVKSKLKQFDSVVHLIDYYVQMCKDKRTGPEAPRN GTVHLYLTKPLYTSAPSLQHLCRLTINKCTGAIWGLPLPTRLKDYLEEYKFQV >gi568815586f:93369286_93601218|GENSCAN_predicted_CDS_7|1062_bp atggtggaaggtgaaggagaagcaggcatgtctttcatggccagagaaggaggaggagat agagaaaggggaggtactattgttttggggccaggctcatggagattaagcaggaacttc aagaaagccaagctcttcttggtagcacaccagagtctgaagggacctgatcaaggggac cgcctccggtccccggccgtgggcaccgggacgagcacggcgtccccacgccatcgatgt gtcttagagccggagagtctggtttccgaggacccacagtcgctcctgcacgcccacccc ccgcaaaaggtcgcagccggagggaaacccggcagcagtccgagagtggaggtgtcccag cccggactcgttttgggattcgcactgacttcaaggaaggacgcgaacccttctctgacc ccagctcgggcggccacctgtctttgccgcggtgacccttctctcatgaccctgcggtgc cttgagccctccgggaatggcggggaagggacgcggagccagtgggggaccgcggggtcg gcggaggagccatccccgcaggcggcgcgtctggcgaaggccctgcgggagctcggtcag acaggatggtactggggaagtatgactgttaatgaagccaaagagaaattaaaagaggca ccagaaggaactttcttgattagagatagctcgcattcagactacctactaacaatatct gttaaaacatcagctggaccaactaatcttcgaatcgaataccaagacggaaaattcaga ttggactctatcatatgtgtcaaatccaagcttaaacaatttgacagtgtggttcatctg atcgactactatgttcagatgtgcaaggataagcggacaggtccagaagccccccggaac ggcactgttcacctttatctgaccaaaccgctctacacgtcagcaccatctctgcagcat ctctgtaggctcaccattaacaaatgtaccggtgccatctggggactgcctttaccaaca agactaaaagattacttggaagaatataaattccaggtataa >gi568815586f:93369286_93601218|GENSCAN_predicted_peptide_8|141_aa MGARRKRVCVRVWLSVCLSAARGISEFRLFLLEEFSNGVPAGAGSSVYDVHTLSLKREVI LAPRGDAAEVLFVCGQGRVRSPTELWRGGSSQTRPGRRSGLPGPYVTQRRQQGAGHRLTN VKAFHFGSAPDECGIESRRLX >gi568815586f:93369286_93601218|GENSCAN_predicted_CDS_8|423_bp atgggagcgaggagaaagcgtgtgtgcgttcgtgtgtggctgtctgtctgtctgtctgcg gcaagaggaatctcagagtttcggttatttctccttgaagaattttcaaacggagttcct gcaggggctggctctagcgtctacgatgtgcacacactgtctctcaagagggaagtgatc ctggcgccacggggtgatgcagcagaagttctttttgtctgtggccagggtagagtgcgg agccccacggagctgtggcgtggcggctcctcccagacgcgtccggggcgccgctcgggt ctcccaggaccttatgtaacccagcgtcggcagcaaggagccggtcacaggctgaccaac gtcaaggcgtttcactttggaagcgcaccagatgagtgtggtattgagagcagacggctg gnn