GENSCAN 1.0 Date run: 4-Nov-116 Time: 20:47:17 Sequence gi568815589r:23592560_23862234 : 269675 bp : 38.70% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 3064 3059 6 1.05 1.02 Term - 10886 10818 69 2 0 54 34 146 0.527 2.76 1.01 Init - 24344 24288 57 2 0 82 88 40 0.251 4.76 1.00 Prom - 32782 32743 40 -3.65 2.00 Prom + 34417 34456 40 -3.65 2.01 Init + 39401 39406 6 1 0 59 110 0 0.482 0.33 2.02 Intr + 39490 39609 120 0 0 58 40 113 0.245 3.27 2.03 Intr + 40751 40842 92 1 2 28 71 78 0.058 -2.03 2.04 Intr + 48843 48950 108 0 0 14 68 111 0.087 0.18 2.05 Intr + 51468 51645 178 0 1 42 115 99 0.236 7.00 2.06 Intr + 54139 54215 77 0 2 43 96 94 0.831 3.09 2.07 Term + 56241 56370 130 0 1 23 44 142 0.778 -0.03 2.08 PlyA + 56865 56870 6 1.05 3.07 PlyA - 58588 58583 6 1.05 3.06 Term - 58896 58801 96 0 0 110 43 114 0.628 6.09 3.05 Intr - 71107 70951 157 0 1 41 75 140 0.127 7.09 3.04 Intr - 73797 73706 92 0 2 44 94 34 0.103 -2.53 3.03 Intr - 76829 76725 105 2 0 27 82 103 0.121 3.09 3.02 Intr - 78866 78841 26 0 2 86 86 31 0.429 -0.47 3.01 Init - 79841 79730 112 2 1 83 83 73 0.351 6.72 3.00 Prom - 82274 82235 40 -3.45 4.00 Prom + 82355 82394 40 -5.85 4.01 Init + 89338 89648 311 0 2 43 -34 291 0.689 9.23 4.02 Term + 89714 90239 526 2 1 36 38 457 0.884 28.25 4.03 PlyA + 90902 90907 6 1.05 5.08 PlyA - 92388 92383 6 1.05 5.07 Term - 100325 99998 328 1 1 73 39 366 0.896 23.40 5.06 Intr - 109045 108820 226 2 1 52 97 237 0.953 17.12 5.05 Intr - 112512 112359 154 0 1 103 119 154 0.984 18.72 5.04 Intr - 128996 128921 76 1 1 69 99 21 0.006 -0.10 5.03 Intr - 136474 136374 101 1 2 90 57 58 0.004 0.89 5.02 Intr - 138566 138463 104 0 2 93 53 111 0.042 7.07 5.01 Init - 144321 144267 55 0 1 64 29 52 0.030 -1.60 5.00 Prom - 145815 145776 40 -6.85 6.00 Prom + 148199 148238 40 -5.15 6.01 Init + 148628 148691 64 1 1 94 83 56 0.979 7.06 6.02 Term + 150354 150586 233 1 2 40 41 161 0.739 2.25 6.03 PlyA + 151890 151895 6 1.05 7.06 PlyA - 151928 151923 6 1.05 7.05 Term - 161264 161118 147 2 0 67 55 94 0.636 0.92 7.04 Intr - 164495 164299 197 0 2 42 73 95 0.672 1.61 7.03 Intr - 169687 169447 241 1 1 99 115 219 0.423 21.90 7.02 Intr - 193163 193056 108 2 0 61 51 94 0.001 2.56 7.01 Init - 201428 201348 81 2 0 40 86 66 0.028 2.62 7.00 Prom - 206340 206301 40 -3.25 8.00 Prom + 227391 227430 40 -4.95 8.01 Init + 228990 229141 152 2 2 83 -15 188 0.632 5.61 8.02 Intr + 229292 229597 306 2 0 40 14 370 0.501 19.44 8.03 Intr + 232384 232532 149 1 2 16 49 116 0.067 -0.54 8.04 Term + 238879 239009 131 2 2 80 54 110 0.665 4.16 8.05 PlyA + 239389 239394 6 1.05 9.03 PlyA - 240188 240183 6 1.05 9.02 Term - 259047 258519 529 2 1 50 39 258 0.591 9.84 9.01 Init - 267843 267773 71 0 2 75 105 1 0.501 1.07 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 191113 191193 81 0 0 87 119 25 0.808 6.52 S.002 Term + 192575 192799 225 1 0 36 48 167 0.826 3.30 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589r:23592560_23862234|GENSCAN_predicted_peptide_1|41_aa MTNCKFQVANMMTLNSEREKPLATRGDRARNVARVMEELKL >gi568815589r:23592560_23862234|GENSCAN_predicted_CDS_1|126_bp atgaccaattgcaagtttcaggttgccaacatgatgacactgaattcagaaagagagaag ccactggccacacgtggcgaccgagctcgaaatgtggccagagtgatggaagagctgaag ttgtaa >gi568815589r:23592560_23862234|GENSCAN_predicted_peptide_2|236_aa MQVKTRYLLCEVAAEEILAPTEANEKPNEGMKTQNHHCITLKAEGKQERSIFDSKQRNYI AISVFTIRYKNYSISQQTTGEPICETVTRDGGGHLTESTDLLTQEALLSTTQASRGTVAM DPKRQYSLGTVVVLGYPEMTKQAVNLNGGKRSRSAHEVGEQNDDRGHMYKNQKFNDITEV TKSQIRQTTDTKTREAEGNVVTQEKACGHRDWSDAATSQELGQPPEAGRGKEQILP >gi568815589r:23592560_23862234|GENSCAN_predicted_CDS_2|711_bp atgcaggtgaagactcgatacctcctttgtgaggtggcagctgaggagattctggcaccc actgaggccaatgaaaagcccaacgaaggaatgaagactcagaaccaccactgcattact ttgaaggcagaaggaaaacaagagcggagcatctttgactcgaaacaacgaaactacatt gcaatatccgttttcaccatccgatacaaaaattacagcatttcccagcagacaacaggg gagccaatttgtgaaacggtaaccagagatggtggaggtcacctcacagagagcactgac ctcttaacccaagaagctctgctcagtaccactcaagcctctagggggacagtggctatg gatcccaaaagacagtactctcttggcacagtggtagtactggggtaccctgagatgact aaacaggcagtgaatttaaatggtggcaagagaagtaggagtgcacatgaagtgggagag caaaatgatgatagaggccatatgtataaaaaccaaaagtttaatgacattactgaagtc accaaatctcagattcgacagaccaccgacactaaaacaagggaagcagaaggcaacgtg gtgacacaagagaaggcctgtggtcacagagactggagtgatgcagccacaagccaagaa cttgggcagccaccagaagctggaagagggaaggaacagattctcccttag >gi568815589r:23592560_23862234|GENSCAN_predicted_peptide_3|195_aa MRESELRCNLNPWNLEKEASAERSGVDLDLLSGSCHGDEDSEIIQVCSVSIRSCSDLAVT LSTQDDTKRSHADAGLPVEHQKTTWLSITKHPDFSLILRCQDPSENFVEVPYLFRKVWPC EENYSQRESAQMSVIVKSRIFYERCWRDFTTDSPVHLCEQKVGQGPQSSDTLRLLRELIP VFGNAPALSVLQEST >gi568815589r:23592560_23862234|GENSCAN_predicted_CDS_3|588_bp atgcgtgaaagtgagctcagatgtaacctaaacccatggaacttggaaaaagaagcatct gccgagaggagcggggttgatcttgatttgctttcagggagttgccacggagatgaggac tctgagattatccaggtatgttcagtctctattcgcagttgtagtgatttagcagtaacg ttatcaacgcaagatgacaccaaaagatctcatgctgatgctggactcccagtggaacac cagaagaccacctggctcagtatcacaaagcaccctgatttctccctcatcttgagatgt caggatccctcagaaaacttcgtagaggtgccctatttgttccgcaaggtgtggccttgt gaagagaattatagtcaacgggaaagtgcgcagatgtcagtcattgttaaatcaaggatc ttttacgagagatgttggcgagatttcaccactgatagtcctgttcacctgtgtgaacag aaagttgggcagggcccacaatcatcggacaccttgcggcttttgcgtgagcttattcct gtgtttggaaacgcccctgccttgtctgtacttcaagaatcaacttaa >gi568815589r:23592560_23862234|GENSCAN_predicted_peptide_4|278_aa MTKGNTPKYGLVFHSTFTGQAAARNKGRISQYLANKCSIASQIDCFAEVLMSIFGEKLQE QVEERLSFYETGEILSKKMDVMKTEMVQAEKTAAEIIRKLEKQETPEECAKTSVKPKKKK KQKPQEAPQENGIKDPSNSFSKPNKRKSFSKEKLVSSHREETAGSISLPNRKTSSPKEEA VNDPKEAGNRSVTKKKTGNSPSKRKELVSSGPEEAAGSKSSSKKKKKLHKVPQEDESANG RPPSGGAHPTPRRHFPFRVLHPNKKFSHTKHIVNPKAV >gi568815589r:23592560_23862234|GENSCAN_predicted_CDS_4|837_bp atgacaaagggtaatacaccaaaatatggactcgttttccactccaccttcactggccaa gcagctgccaggaataaaggtcgcatctcccaatacctggcaaacaaatgcagtattgcc tcgcaaatcgactgcttcgctgaggtgctcatgagcatatttggggagaagcttcaagaa caagttgaagagagactgtctttctatgagactggagagatcctatcaaagaaaatggat gtcatgaaaacagaaatggttcaggcagagaaaacggctgctgagattattaggaagctg gagaaacaggaaactccagaggagtgtgcaaagacaagtgtaaaacccaaaaagaagaaa aagcaaaagccccaggaggctcctcaggagaatggaatcaaagacccatctaactctttc tccaaacctaataaaaggaaatctttttccaaggagaaattggttagcagccatcgagaa gagaccgctggcagcatcagtcttcccaacaggaagacgtcttcacccaaggaggaagca gttaatgatcccaaagaggcaggcaacaggagtgtcactaagaaaaaaacaggaaattcc ccttcaaagaggaaggagctggtcagcagtggacctgaagaggctgctggcagcaagagc agctccaagaagaagaaaaagttacataaagtaccccaggaagatgagagtgcaaacgga cgtcccccgtcaggcggggcacatcctaccccaaggcgacatttcccattccgtgtccta caccccaataaaaaattctcacacacgaagcatattgtgaatcctaaggcagtatga >gi568815589r:23592560_23862234|GENSCAN_predicted_peptide_5|347_aa MSESRSEQKTDSDWKPEYGQSLGYGFVNYIDPKDAEKAINTLNGLRLQTKTIKGPAAWAE YVEESTVKLCGFYSVTSVSDFLRCGSGTLLNAAIGLGGVSLGFVFCSWFLDQVSYARPSS ASIRDANLYVSGLPKTMTQKELEQLFSQYGRIITSRILVDQVTGISRGVGFIRFDKRIEA EEAIKGLNGQKPPGATEPITVKFANNPSQKTNQAILSQLYQSPNRRYPGPLAQQAQRFRF SPMTIDGMTSLAGINIPGHPGTGWCIFVYNLAPDADESILWQMFGPFGAVTNVKVIRDFN TNKCKGFGFVTMTNYDEAAMAIASLNGYRLGDRVLQVSFKTNKTHKA >gi568815589r:23592560_23862234|GENSCAN_predicted_CDS_5|1044_bp atgagcgaaagtagaagtgagcaaaaaactgacagtgactggaaaccagagtatgggcag agcttgggatatggctttgtgaactacattgaccccaaggatgcagagaaagctatcaac accctgaatggattgagacttcaaaccaaaacaataaaaggcccagctgcttgggctgaa tatgtggaagaaagcacagtaaagctttgtggcttttatagtgtgacatcagtttctgat tttctgagatgtgggtcaggaactcttttgaatgctgcaataggacttggaggtgtaagt cttggatttgtgttctgttcttggtttcttgatcaggtttcctatgctcgcccaagttca gcttctatcagagatgcaaatttatatgtcagcggacttccaaaaacaatgacccagaag gagttggaacagcttttttcacaatatggacgcattattacttctcgtattcttgtcgac caggtcactggcatatcaaggggtgtagggtttattcgatttgacaagcgaattgaggca gaagaagctatcaaaggcctaaatggccagaaacctcccggtgccacggagccaatcact gtaaagtttgctaataacccaagccaaaaaaccaatcaggccatcctttcccagctgtac cagtctccaaacagaaggtatccaggaccgctagctcagcaggcacagcgttttaggttt tctccaatgaccattgacggaatgaccagtttggctggaattaatatccctgggcaccct ggaacagggtggtgtatatttgtgtacaacctggctcctgacgcagatgagagtatcctg tggcaaatgtttgggccttttggagctgtcaccaatgtgaaggtcatccgtgactttaac accaataaatgcaaaggttttggatttgtgactatgacaaactatgatgaggctgccatg gcgatagctagcctcaatggataccgtctgggagacagagtactgcaggtctcctttaag acaaacaaaacgcacaaagcctaa >gi568815589r:23592560_23862234|GENSCAN_predicted_peptide_6|98_aa MEKNKFVKKYKRDKPKAYYAERKIPKQTDGVWAGGGINAEYKGNLFKYYRHSFTNLKNTL SEAIKKRNPNGTQSPEEIELYMVKVVKAMGQMGTMATN >gi568815589r:23592560_23862234|GENSCAN_predicted_CDS_6|297_bp atggagaagaacaaatttgtaaagaagtataaaagagacaagccaaaagcatactacgct gaacgaaaaattccaaaacagactgatggtgtttgggctggtgggggtataaatgctgag tataaaggaaacctatttaaatactacagacacagcttcacaaacctcaagaacactctc tctgaagcaataaagaaaaggaatcctaatggaacacagtccccagaggaaattgagctg tacatggttaaagttgtaaaggctatgggtcagatgggaaccatggctacgaattaa >gi568815589r:23592560_23862234|GENSCAN_predicted_peptide_7|257_aa MGPYKWLYDIVHAKLPGKSSWRTVFKGIDADLEQLLIDRIGLDFYVVTDSYSYWEFWVNR RETVIAAMETQLSNGPTCNNTANGPTTINNNCSSPVDSGNTEDSKTNLIVNYLPQNMTQE ELKSLFGSIGEIESCKLVRDKITGSFGICCQRASQVILKPKKVLGIRLCLMKYGHRIPAL ESLVMSARNARSWALSRPTESESVVGTWKHKSARVLFAIPDLFWGVVFVYFASGDWWYRR DTILNLSACRRNDGKCR >gi568815589r:23592560_23862234|GENSCAN_predicted_CDS_7|774_bp atggggccttacaaatggctgtatgatatagtgcatgctaagttaccaggaaagagctct tggaggactgtttttaaagggattgatgcagacctggagcagctgctcattgacagaata ggtctagatttctatgtcgtgactgattcgtactcatactgggagttttgggttaacaga cgtgagacagtaattgctgccatggaaacacaactgtctaatgggccaacttgcaataac acagccaatggtccaaccaccataaacaacaactgttcgtcaccagttgactctgggaac acagaagacagcaagaccaacttaatagtcaactaccttcctcagaacatgacacaggag gaactaaagagtctctttgggagcattggtgaaatagagtcctgtaagcttgtaagagac aaaataacagggagttttggaatctgttgtcagagagcttcccaggtgatcttgaagccc aaaaaagttttgggaatcagactgtgtctcatgaagtatggtcacaggattcccgcattg gaatcacttgtaatgagtgctagaaatgcacggtcctgggctctgtccaggcctactgaa tcagaatctgtagtggggacctggaagcataagagtgcacgtgtgctgtttgctattcca gaccttttctggggggtagtctttgtttactttgcatcaggagattggtggtatagaaga gacaccattctcaatttaagcgcttgtcgtaggaatgatggaaaatgtcgatga >gi568815589r:23592560_23862234|GENSCAN_predicted_peptide_8|245_aa MDQGQGADAVPAPTPPALACCDSKLQLLRLVPLAVSTAAEESLGASSAERRGAGRRGLLG RAPGGGAGGEFAAPRPPPARALPDRPSGPCRQGDAGLQQLSSVPPDFRVSPGFPDKVSLD SVESVERPEEADSRCKFANPEARGANGESEQLTRSQPRRTKDRAWNPLPSAQHSEAPKQP ALHRHSSKDLIYAFLNKGNLISEQVPAPGHGDKGGVCIRRRLRPGETQRLEKQLSYQPAA EMEMA >gi568815589r:23592560_23862234|GENSCAN_predicted_CDS_8|738_bp atggaccaggggcaaggcgcggacgccgtccccgcccccacgcccccagcacttgcctgc tgtgactcgaagctgcagctactacggctcgtgccgctggccgtcagcactgccgcggag gagtcgctcggggcgagctccgcggagaggcgcggggcgggaaggaggggcttgcttgga agagcgccaggaggcggggccggcggcgagttcgcggcgccgcggccgccgccggcgcgt gccctgcccgaccgcccttcgggtccgtgccggcagggagatgccgggcttcagcagctc agctccgtccctccggacttccgagtctcaccaggcttccccgacaaggtttctctggac tcggtggagagcgttgagcgcccagaggaggctgattccaggtgcaagtttgcaaatccc gaggcccggggcgctaacggggagtcagagcagctaacgcgaagtcagccgcgtaggacc aaggacagggcttggaatccacttcccagcgcccagcacagcgaggctcccaagcagccg gcacttcatagacacagctccaaggatcttatttacgcttttctgaacaaagggaatctc atatccgagcaagtgccggcaccgggccacggtgacaaaggcggagtttgtatcagaaga agactgaggcccggggagacacagcggctagagaaacagctgtcttatcagcccgctgca gagatggagatggcttga >gi568815589r:23592560_23862234|GENSCAN_predicted_peptide_9|199_aa MQEGKRFIIQRIEHSREGFNLFSCLASTGKTQSGVRSEEDEGSGEKGKRERRGLVPNVVT SLRTCGADALKRDGGVGRRGVPNGQGQEPSPDLGVSDPDTRSSILAALPRGSDFLSVRLQ SSSPTGCALPRCRSYSCSLNPSQGLSGWFYFALLAVQPAPDTVPGTGRSAAHVEAASRAS GAALSRLRVLTGSCKLLAS >gi568815589r:23592560_23862234|GENSCAN_predicted_CDS_9|600_bp atgcaagagggcaagaggttcattatccaaaggatagaacattccagggaaggtttcaat cttttttcctgcctggccagcacaggcaagactcagagtggagtgagaagtgaagaggat gaagggtcgggagagaaagggaagagagagaggcgggggctagtccccaacgtggtgact tcgctgcgcacctgtggggccgacgctctgaagcgggacgggggtgttggtcggaggggg gtacccaatggccaggggcaggagccctcgcccgacttgggagtctcggaccctgacact cgctccagcatcctggcagccctcccgagagggtcggactttctttcggtcaggttgcag agttcttccccgactggctgcgcgctgcctcggtgcagaagttactcctgcagtctcaac cccagccagggcctctcgggctggttttattttgcccttctcgccgtgcagcccgccccc gacaccgtcccgggcactgggaggagcgcggctcacgtggaggcggcgtcccgcgcctcc ggagcagcgctgagtaggctgcgcgtgttgaccggcagttgcaaactattggccagttaa