GENSCAN 1.0 Date run: 5-Nov-116 Time: 00:50:42 Sequence gi568815590f:116838244_117042698 : 204455 bp : 38.46% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 483 754 272 1 2 89 47 177 0.924 8.36 1.02 PlyA + 812 817 6 1.05 2.13 PlyA - 4462 4457 6 1.05 2.12 Term - 9448 9257 192 1 0 76 49 256 0.775 16.94 2.11 Intr - 10786 10703 84 1 0 52 72 76 0.721 1.50 2.10 Intr - 12524 12375 150 2 0 32 82 210 0.713 14.24 2.09 Intr - 13853 13705 149 0 2 49 95 177 0.985 13.53 2.08 Intr - 14465 14306 160 2 1 88 22 195 0.982 11.54 2.07 Intr - 16225 16002 224 2 2 74 85 185 0.987 13.72 2.06 Intr - 18045 17923 123 1 0 120 107 170 0.999 21.74 2.05 Intr - 18528 18403 126 1 0 90 82 102 0.906 9.53 2.04 Intr - 19230 19024 207 1 0 39 77 194 0.232 11.53 2.03 Intr - 25016 24887 130 2 1 51 72 104 0.396 4.45 2.02 Intr - 28498 28343 156 1 0 75 95 77 0.475 6.39 2.01 Init - 35956 35876 81 1 0 83 49 158 0.509 10.42 2.00 Prom - 37507 37468 40 -9.35 3.03 PlyA - 38397 38392 6 1.05 3.02 Term - 39426 39172 255 0 0 87 48 144 0.849 4.90 3.01 Init - 47155 47090 66 1 0 90 72 64 0.593 6.12 3.00 Prom - 51301 51262 40 -7.35 4.00 Prom + 58522 58561 40 -4.15 4.01 Sngl + 60206 61384 1179 1 0 60 45 702 0.922 58.82 4.02 PlyA + 62443 62448 6 1.05 5.03 PlyA - 62992 62987 6 1.05 5.02 Term - 74781 74363 419 1 2 9 42 265 0.175 8.45 5.01 Init - 85834 85777 58 1 1 68 64 17 0.099 -3.17 5.00 Prom - 86167 86128 40 -1.85 6.00 Prom + 99135 99174 40 -6.25 6.01 Init + 100001 100210 210 1 0 88 3 222 0.140 12.58 6.02 Term + 104315 104458 144 1 0 111 43 157 0.702 10.43 6.03 PlyA + 105734 105739 6 1.05 7.04 PlyA - 106507 106502 6 1.05 7.03 Term - 108056 107857 200 0 2 4 32 215 0.971 4.08 7.02 Intr - 108321 108142 180 1 0 44 80 100 0.906 3.72 7.01 Init - 112042 111883 160 1 1 78 48 245 0.801 17.59 7.00 Prom - 118883 118844 40 -6.25 8.00 Prom + 121309 121348 40 -5.95 8.01 Init + 135174 135577 404 2 2 60 27 208 0.126 8.05 8.02 Intr + 150078 150187 110 0 2 13 76 74 0.068 -2.29 8.03 Term + 154664 154770 107 0 2 70 53 104 0.268 2.69 8.04 PlyA + 155422 155427 6 1.05 9.00 Prom + 158543 158582 40 -4.75 9.01 Init + 162007 162133 127 0 1 88 80 98 0.386 9.37 9.02 Intr + 175342 175478 137 2 2 54 57 78 0.351 0.67 9.03 Intr + 187745 187916 172 1 1 111 50 96 0.178 6.69 9.04 Term + 197203 197378 176 2 2 57 55 180 0.660 8.54 9.05 PlyA + 199435 199440 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590f:116838244_117042698|GENSCAN_predicted_peptide_1|90_aa XRPYFSALCGGKQLSKLVHPLNHHPYCPFKPELCLPLSFPNSWASFKAQYLSLRPEMFPD SPIRVPNTCCIAQSGIYKIVSIAFLFICVS >gi568815590f:116838244_117042698|GENSCAN_predicted_CDS_1|273_bp naaaggccctatttttctgcactctgtggagggaaacagctcagcaagcttgttcacccc ctgaaccatcacccatactgtccctttaaaccagaactctgcttgccactctcctttcct aactcttgggcatccttcaaggctcagtatttgtcacttcgtccagaaatgtttcctgac tccccaatccgagtccctaatacctgctgcatagcccaatccggcatctataaaattgtt tcaattgcattcttattcatctgtgtctcctaa >gi568815590f:116838244_117042698|GENSCAN_predicted_peptide_2|593_aa MAVWEEAVAGPAPAVPAPGAATFRRTRPARTMFYAHFVLSKRGPLAKIWLAAHWDKKLTK AHVFECNLESSVESIISPKVKMALRTSGHLLLGVVRIYHRKAKYLLADCNEAFIKIKMAF RPGDFGMDDREIMREGSAFEDDDMLVSTTTSNLLLESEQSTSNLNEKINHLEYEDQYKDD NFGEGNDGGILDDKLISNNDGGIFDDPPALSEAGVMLPEQPAHDDMDEDDNVSMGGPDSP DSVDPVEPMPTMTDQTTLVPNEEEAFALEPIDITVKETKAKRKRKLIVDSVKELDSKTIR AQLSDYSDIVTTLDLAPPTKKLMMWKETGGVEKLFSLPAQPLWNNRLLKLFTRCLTPLVP EDLRKRRKGGEADNLDEFLKEFENPEVPREDQQQQHQQRDVIDEPIIEEPSRLQESVMEA SRTNIDESAMPPPPPQGVKRKAGQIDPEPVMPPQQVEQMEIPPVELPPEEPPNICQLIPE LELLPEKEKEKEKEKEDDEEEEDEDASGGDQDQEERRWNKRTQQMLHGLQRALAKTGAES ISLLELCRNTNRKQAAAKFYSFLVLKKQQAIELTQEEPYSDIIATPGPRFHII >gi568815590f:116838244_117042698|GENSCAN_predicted_CDS_2|1782_bp atggctgtctgggaggaggcggtggcgggcccggcgcccgcggtgccagccccgggagca gcaaccttccgccgaactcggccagccagaacaatgttctacgcacattttgttctcagt aaaagagggcctctggccaaaatttggctagcggcccattgggataagaagctaaccaaa gcccatgtgttcgagtgtaatttagagagcagcgtggagagtatcatctcaccaaaggtg aaaatggcattacggacatcaggacatctcttactgggagtagttcgaatctatcacagg aaagccaaataccttcttgcagactgtaatgaagcattcattaagataaagatggctttt cggccaggtgattttggaatggatgatcgtgagataatgagagaaggcagtgcttttgag gatgacgacatgttagtaagcactactacttctaacctcctattagagtctgaacagagc accagcaatctgaatgagaaaattaaccatttagaatatgaagatcaatataaggatgat aattttggagaaggaaatgatggtggaatattagatgacaaacttattagtaataatgat ggcggtatctttgatgatccccctgccctctctgaggcaggggtgatgttgccagagcag cctgcacatgacgatatggatgaggatgataatgtatcaatgggtgggcctgatagtcct gattcagtggatcccgttgaaccaatgccaaccatgactgatcaaacaacacttgttcca aatgaggaagaagcatttgcattggagcctattgatataactgttaaagaaacaaaagcc aagaggaagaggaagctaattgttgacagtgtcaaagagttggatagcaagacaattaga gcccaacttagtgattattcagatattgttactactttggatctggcaccgcccaccaag aaattgatgatgtggaaagagacaggaggagtagaaaaactgttttctttacctgctcag cctttgtggaataacagactactgaagctctttacacgctgtcttacaccgcttgtacca gaagaccttagaaaaaggaggaaaggaggagaggcagataatttggatgaattcctcaaa gaatttgaaaatccagaggttcctagagaggaccagcaacagcagcatcagcagcgtgat gttatcgatgagcccattattgaagagccaagccgcctccaggagtcagtgatggaggcc agcagaacaaacatagatgagtcagctatgcctccaccaccacctcagggagttaagcga aaagctggacaaattgacccagagcctgtgatgcctcctcagcaggtagagcagatggaa ataccacctgtagagcttcccccagaagaacctccaaatatctgtcagctaataccagag ttagaacttctgccagaaaaagagaaggagaaagagaaggaaaaagaagatgatgaagag gaagaggatgaagatgcatcagggggcgatcaagatcaggaagaaagaagatggaacaaa aggactcagcagatgcttcatggtcttcagcgtgctcttgctaaaactggagctgaatct atcagtttgcttgagttatgtcgaaatacgaacagaaaacaagctgccgcaaagttctac agcttcttggttcttaaaaagcagcaagctattgagctgacacaggaagaaccgtacagt gacatcatcgcaacacctggaccaaggttccatattatataa >gi568815590f:116838244_117042698|GENSCAN_predicted_peptide_3|106_aa MAEGDGEAGTSYVAGAGGREQRLGILTTILNANFLKSPLKTNQKNIKYLANYNRNSLDKA LSCGQYGHLANGLQPVRRTLPQPKNSGPTVSGAQDCPLSYPSPISA >gi568815590f:116838244_117042698|GENSCAN_predicted_CDS_3|321_bp atggcggaaggtgatggggaagcaggcacatcttacgtggctggagcaggaggaagagag caaaggcttgggattttaacaactatccttaatgccaatttcttaaagtctccattaaaa acaaaccaaaagaacataaaatatttggctaactacaacagaaattcacttgataaagct ttgagctgtggccagtatggccacttagctaatggactccagccagtgagacgaaccttg ccccaacccaaaaacagtggacccacagtgagtggagcacaagactgtccactttcttat ccatctcctatatcagcataa >gi568815590f:116838244_117042698|GENSCAN_predicted_peptide_4|392_aa MWKRLRNWVTSRGWNSLEGSEKDRKMWESLELPRDLLNGFDENADSDTKNKVQAEVVSDG DEELVGNWSQGDSCYVLAKRLAGFCPCPKDLWNFELEKDDLGYLAEKMSKQQSIQDVTWA LLKAFSFERETEHRSLENFQPDNVIEKKNPFSGEKFKPATEICISSKEPNANPQDHGENV SRSCHRPSWQPLPSQAQKPRRKKCFCGLGPGLPSCVQPRDLVPCVSAALVMAERGQCTAW AVGSEGGSPKPWQPPHDVEPVGAEKSRIEVWEPLPRFQKMYGNVWMPRQKFAAGAGPSWR TSARSVWKGNVGLGSSHRVPTGALTSGAVGRGPPSSRSQNGRSTDSMHHAPGKATDTRRH PMKAARREAVPCKATGVELPKTTGTHFLHQLD >gi568815590f:116838244_117042698|GENSCAN_predicted_CDS_4|1179_bp atgtggaagcgacttaggaactgggtaacaagcagaggttggaacagtttggagggctca gaaaaagacaggaaaatgtgggaaagtttggaacttcctagagacttgctgaatggtttt gatgaaaatgctgatagtgacacgaaaaataaggtccaggctgaggttgtctcagatgga gatgaggaacttgttgggaactggagccaaggtgactcctgttatgttttagcaaagaga ctggcagggttttgcccctgccctaaagatttgtggaactttgaacttgagaaagatgat ttagggtatctggcggaaaaaatgtctaagcagcaaagcattcaagatgtgacttgggcg ctgttaaaagcattcagttttgaaagggaaacagagcatagaagtttggaaaatttccag cctgacaatgtgatagaaaagaaaaacccattttctggagagaaattcaagccagctaca gaaatttgcataagtagcaaggagcctaatgctaatccccaagaccatggggaaaacgtc tccaggtcatgtcacagaccttcatggcagcctctcccatcacaggcccagaagcctagg agaaagaagtgtttttgtgggctgggcccagggctgccaagctgtgtgcagcctagagac ttggtgccctgtgtctccgctgctctagtcatggctgaaagaggccaatgtacagcttgg gctgtgggttcagagggtggaagtcccaagccttggcagcctccacatgatgttgagcct gtgggtgcagagaagtcaagaattgaggtttgggaacctctgcctagatttcagaagatg tatggaaatgtctggatgcccaggcaaaagtttgctgcaggggcagggccctcatggaga acctctgctaggtcagtgtggaagggaaatgtggggttggggtcctcacacagagtccct actggggcactgactagtggagctgtgggaagagggccaccatcctccaggtcccagaat ggtagatccactgacagcatgcaccatgcacctggaaaagccacagacactcgacgccat cccatgaaagcagccagaagggaggctgtaccctgcaaagccacaggggtggagctgccc aagaccacgggaacccacttcttgcatcagcttgactag >gi568815590f:116838244_117042698|GENSCAN_predicted_peptide_5|158_aa MTRMGVVAHACNHSTLGGREPLAEGVAIVSAGSADSVFPPAGSEESGQSGRVGFPPAQHT RLSKGQSECFLKRILDPMPPDWMRSPSRGRQTLYRGAFPLAQVGASLGGSSQRKEQAAIS AVLQPPLISPGKGGTQVNRAWSGPPANHSSPTEEGPDC >gi568815590f:116838244_117042698|GENSCAN_predicted_CDS_5|477_bp atgaccaggatgggcgtggtggctcatgcctgtaatcacagcactttgggaggccgagag ccccttgcagaaggagtggccatagtctccgcaggatcagctgactcagtctttccccct gctggctctgaggaatccgggcagtccggaagagtgggattcccaccagcacagcacacc cgcttgtccaaggggcagtcagagtgcttccttaaacggatcctggatcccatgcctcct gactggatgagatcccccagtagaggtcgccagacactttataggggagcgttcccactg gctcaggtaggtgcttctctgggtgggagctcccagaggaaggagcaggcagccatctct gctgttctgcagcctccactgatatctccagggaagggaggcacccaggtgaatagggcc tggagtggacccccagcaaaccacagcagccctacggaagaggggcctgactgttaa >gi568815590f:116838244_117042698|GENSCAN_predicted_peptide_6|117_aa MGPGDFRRCRERISQGLQGLPGRAELWFPPRPACDFFGDGRSTDIQEEALAASPLLEDLR RRLTRAFQWAVEMHFQNHQLARTLLDLNMKVQQLKKEYELEITSDSQSPKDDAANPE >gi568815590f:116838244_117042698|GENSCAN_predicted_CDS_6|354_bp atgggccccggggacttccgccgctgcagagagagaatttcccaggggctccagggactc ccaggtagagcggagctttggttcccacctcgtcccgcgtgcgacttcttcggggacggc aggagcacggacatccaggaggaggccctcgccgccagcccgctgctggaggacctcaga cgacggctgacgcgcgccttccagtgggcggtggaaatgcatttccaaaaccaccagctg gctagaactttactggacctaaacatgaaagtgcagcaattgaaaaaggagtatgaactg gaaattacatcagactcccaaagcccaaaagatgatgctgcgaatccggaataa >gi568815590f:116838244_117042698|GENSCAN_predicted_peptide_7|179_aa MLAALAALARSQRLLGLGVRSGHAREALQPDTALWGPLSGLANAGASSLCLWGGRGEALS PPICINVRIRKQMRGTQPPEKQCLTQATVIGVHPSESYCPALSSDGTRGCQTRGNRGCGN GSSSNSESNQERRQLALEGIVPGSRHRGILAVITAVNRADGCAGTLGPARKGAGEMRFL >gi568815590f:116838244_117042698|GENSCAN_predicted_CDS_7|540_bp atgctggcagccctcgcagcccttgctcgctctcagcgcctcctcggcctcggcgtccgc tctgggcatgctcgagaagcccttcagcccgacactgcgctctgggggcccctctctggg ctggccaatgcgggagccagctccctctgcttgtggggaggaagaggtgaagccctctca cccccaatatgcatcaatgtcagaatcaggaagcagatgagaggtacacagccccctgag aagcagtgcttgacacaggctactgtaataggtgtccacccatctgaatcctactgccca gcactcagctcagatggaacaagaggctgccagacgagaggaaacagaggctgcggcaat ggcagcagcagcaacagtgaaagcaaccaagagagaaggcagctggcactggagggcatc gtgccaggttccaggcacagagggattctggcagtgattacagcggtaaatagagctgat ggttgtgccgggaccctaggccctgccaggaagggagcaggggagatgagattcttataa >gi568815590f:116838244_117042698|GENSCAN_predicted_peptide_8|206_aa MSELPFTITSKRIKYLGIQLTRDVKDLFRENYKPLLNEIKEDTNKWKNIPCSWVGRINIV KIAILPKVIYRFSAIPIKLPMTSFTESEETTLKFIWNQKRAHIAKTILSQKNKAGCIKLP DFKLYYKATVTKTACQLIPLSTVKLRPSSFRLLASNPVPLAWTTPFPLMVPEFSNLQHKN TEPTQRIVAMSSKEDIEIKVSECHGG >gi568815590f:116838244_117042698|GENSCAN_predicted_CDS_8|621_bp atgagtgaactcccattcacaattacttcaaagagaataaaatacctaggaatccaactt acaagggatgtgaaggacctcttcagggagaactacaaaccgctgctcaacgaaataaaa gaggacacaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatattgtg aaaatagccatactgcccaaggtgatttatagattcagtgccatccccatcaagctacca atgacttccttcacagaatcggaagaaactactttaaagttcatatggaaccaaaaaaga gcccacattgccaagacaatcctaagccaaaagaacaaagctggatgcatcaagctacct gacttcaaactatattacaaggctacagtaaccaaaacagcatgtcaactgataccgtta agtaccgtgaagctcaggccttcttctttccgtctgctggctagcaatccagtccccctt gcatggacaactccttttcctctcatggtaccagagttttccaatttgcagcacaagaat acagagcctacgcagagaatcgtggccatgtcgagtaaagaagatatagagattaaagtt tcagagtgccatggtggctga >gi568815590f:116838244_117042698|GENSCAN_predicted_peptide_9|203_aa MGEWAQIPMTELSGPGRGQTFKSCVKIFSQSTLTVGPDASSPSILAPYSLKMTTIYIPGL GGVKLWDEDGNPSYGSLKGPKNYLGGTEGDGAFVRSMELHFRSYEGKAEGRCRGEKRREN CLSLHSKEQSVPLVNLVDCNPLADFGHADARRGLPQPWAAPSRSDIECLKLFQVQGASCQ WIYHSGMWRTVALFSQFHEAVPQ >gi568815590f:116838244_117042698|GENSCAN_predicted_CDS_9|612_bp atgggggagtgggcgcaaattcctatgaccgagttgtctggacctgggcgagggcaaaca tttaaatcttgtgtgaagatttttagtcagtccaccctcactgtgggccctgacgcttcc agtccaagtatattagcaccttacagcttaaaaatgacaaccatatatatacctggactt ggtggtgtcaagctttgggatgaagatggaaatccttcttatgggagtctaaaggggcca aagaattatctaggagggacagagggtgatggtgcatttgttcgtagcatggagctgcac tttaggagttatgaaggcaaggcagaagggagatgcaggggggagaaaaggagagagaac tgcctgtcccttcacagcaaagaacaaagtgttcctttagtgaatcttgttgattgcaat cccctggctgactttgggcatgctgatgcaaggcgtgggctcccacagccttgggcagct ccttcacggtctgacattgagtgcctgaagcttttccaggtgcagggtgcaagctgtcag tggatctaccattctgggatgtggaggacagtggccctcttctcacagttccacgaagca gtgccccagtga