GENSCAN 1.0 Date run: 8-Nov-116 Time: 01:54:15 Sequence gi568815592f:42464320_42717507 : 253188 bp : 40.94% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 398 393 6 1.05 1.03 Term - 2696 2602 95 0 2 85 52 69 0.246 0.01 1.02 Intr - 19508 19431 78 0 0 102 95 49 0.815 5.60 1.01 Init - 19761 19656 106 0 1 68 64 73 0.594 3.33 1.00 Prom - 26745 26706 40 -4.65 2.00 Prom + 27806 27845 40 -5.75 2.01 Init + 34717 34726 10 0 1 73 93 2 0.454 0.01 2.02 Term + 35359 35699 341 2 2 27 40 275 0.946 10.31 2.03 PlyA + 36746 36751 6 1.05 3.00 Prom + 43246 43285 40 -6.75 3.01 Init + 46659 46689 31 2 1 85 107 30 0.545 4.55 3.02 Term + 50609 50724 116 0 2 23 45 184 0.958 5.35 3.03 PlyA + 50946 50951 6 1.05 4.00 Prom + 76740 76779 40 -3.05 4.01 Sngl + 83136 83702 567 2 0 50 34 384 0.490 25.10 4.02 PlyA + 84410 84415 6 1.05 5.00 Prom + 87349 87388 40 -4.55 5.01 Init + 100001 100078 78 1 0 92 84 166 0.863 17.71 5.02 Intr + 100120 100212 93 0 0 76 42 92 0.733 2.64 5.03 Intr + 109415 109674 260 1 2 94 98 128 0.635 9.74 5.04 Intr + 127832 127910 79 2 1 98 44 49 0.110 0.13 5.05 Intr + 129872 129985 114 1 0 36 83 109 0.213 4.92 5.06 Intr + 139269 139399 131 2 2 116 78 101 0.996 10.47 5.07 Intr + 141402 141540 139 0 1 43 95 134 0.993 9.15 5.08 Intr + 142270 142332 63 0 0 67 103 36 0.659 1.00 5.09 Intr + 152921 153019 99 1 0 56 105 55 0.910 3.39 5.10 Intr + 153090 153188 99 2 0 85 89 121 0.980 11.29 5.11 Intr + 168233 168396 164 1 2 74 108 112 0.913 9.65 5.12 Intr + 168486 168585 100 0 1 104 91 22 0.985 3.29 5.13 Intr + 171099 171227 129 2 0 51 76 96 0.593 4.67 5.14 Intr + 178097 178162 66 1 0 74 95 32 0.106 0.68 5.15 Intr + 179964 180017 54 2 0 46 116 32 0.032 0.06 5.16 Intr + 180154 180217 64 0 1 98 111 30 0.997 3.67 5.17 Intr + 181147 181271 125 2 2 70 95 79 0.973 6.18 5.18 Intr + 183799 183851 53 0 2 59 89 44 0.549 -1.61 5.19 Intr + 185965 186067 103 1 1 106 86 60 0.532 6.86 5.20 Intr + 193921 194001 81 0 0 80 60 106 0.931 5.92 5.21 Intr + 194327 194505 179 1 2 -36 66 215 0.957 4.70 5.22 Intr + 198939 199100 162 0 0 53 108 103 0.472 6.97 5.23 Intr + 201090 201193 104 0 2 83 94 23 0.363 1.30 5.24 Intr + 201848 201926 79 0 1 45 78 97 0.623 2.09 5.25 Intr + 205773 205921 149 0 2 73 82 88 0.958 5.66 5.26 Intr + 206341 206396 56 2 2 66 119 41 0.689 2.78 5.27 Intr + 211737 211872 136 2 1 83 93 24 0.897 1.62 5.28 Intr + 212464 212554 91 2 1 48 113 104 0.977 7.03 5.29 Intr + 220475 220552 78 2 0 106 99 14 0.753 2.05 5.30 Intr + 223897 224067 171 1 0 27 94 88 0.660 1.44 5.31 Intr + 225250 225351 102 1 0 84 110 27 0.868 2.87 5.32 Term + 226713 226854 142 0 1 116 32 103 0.921 3.92 5.33 PlyA + 226940 226945 6 1.05 6.07 PlyA - 226954 226949 6 1.05 6.06 Term - 233224 232760 465 1 0 5 47 227 0.535 4.33 6.05 Intr - 234188 234024 165 2 0 100 44 216 0.890 17.64 6.04 Intr - 240292 240046 247 0 1 51 80 267 0.510 18.74 6.03 Intr - 240858 240702 157 1 1 64 67 67 0.151 0.35 6.02 Intr - 244097 243842 256 2 1 75 40 189 0.318 8.89 6.01 Init - 249124 248951 174 1 0 36 64 131 0.554 3.09 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 32978 32637 342 2 0 63 36 195 0.852 7.58 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:42464320_42717507|GENSCAN_predicted_peptide_1|92_aa MDERRSGFDYIKQQWLKHKRLIHLQKRSLEEIVRECSRSVKAGEDKMSGLPEESAPLKQS SYTATIRFLNLFPTLPPFLFHKTAIAIMTRSQ >gi568815592f:42464320_42717507|GENSCAN_predicted_CDS_1|279_bp atggacgagagaagaagtgggtttgactatattaaacaacaatggcttaaacataaaagg cttattcatttacagaaaagaagtctagaggagatagtccgggaatgcagcaggagtgtg aaagcaggagaggacaaaatgtccggccttcctgaagaatctgctccccttaagcaatct tcctacacagcaaccatccgatttctcaatcttttccccacccttcccccttttctattc cacaaaaccgccattgccatcatgacccgttctcaatga >gi568815592f:42464320_42717507|GENSCAN_predicted_peptide_2|116_aa MVEAFCANNAPANIVSVPKTQWIFCKKCGKHQPHKVTQYKKGKNCLYAQGKQPYDGKHSG YDGQTKRIFQKKAKTTKKTVLRLERVEPNCISKRMLAIKICEHFELGGDKKKRGAK >gi568815592f:42464320_42717507|GENSCAN_predicted_CDS_2|351_bp atggtggaagctttctgtgccaataatgctcctgcaaacattgtgagtgttcctaaaact cagtggattttctgtaagaagtgtggcaagcaccaaccccacaaagtgacacagtacaag aagggcaagaattgtctatatgcccagggaaaacagccttatgatgggaagcatagtggc tatgatggacagactaagcggattttccaaaaaaaggctaaaactacaaagaagacagtg ctaaggcttgagcgcgttgagcccaactgcatatctaagagaatgctggctattaagata tgcgagcattttgaactgggaggagataagaagaaaaggggggccaagtga >gi568815592f:42464320_42717507|GENSCAN_predicted_peptide_3|48_aa MDGGRPPDLGGSKEDEEEPANETETEHPEIQEENQVTMVSSKPDEGSY >gi568815592f:42464320_42717507|GENSCAN_predicted_CDS_3|147_bp atggatggaggaagacctcctgatttgggaggaagtaaggaagatgaagaggaaccagca aatgagactgaaacagaacacccagaaatacaggaagaaaaccaggtgacaatggtgtct tcaaagccagatgaaggaagttattaa >gi568815592f:42464320_42717507|GENSCAN_predicted_peptide_4|188_aa MKCVSLTADDRSLLGIPEGGEGLSLWLLVLRMLPLNVSYQLLMEAKTRDIKYFGVPSCIT GIGHVQTKASSSAEEELQRSKSSAPRRLVPPPALRRPAWARAPPAGSRHRCRRSTVDSEL PPVLWVALPPSLLKPPAVAVPLYPAWGGYLERRMRGFFPVGRTKGSTDDLKLGSREQTRL SCIQIRRQ >gi568815592f:42464320_42717507|GENSCAN_predicted_CDS_4|567_bp atgaagtgtgtctccttaactgctgatgaccggagccttctgggaattcctgagggaggg gaaggacttagcctgtggcttctagtcctgaggatgctccctttgaatgtcagttaccag ctgcttatggaagctaagacaagggacatcaagtacttcggggtcccaagctgtataaca ggaatcggtcacgttcaaacaaaagctagctccagtgctgaagaggaactccagagatcg aaaagctcagccccacggaggttggtccctcctccggcactgcggaggccagcctgggca agggcgccccctgcaggctcccggcatcggtgccggagaagcacggtggactctgagctg ccgccggttttgtgggtggctttgcctccctcacttctaaaacctccagcggtggcggtc cccttataccctgcctgggggggatacttggagaggagaatgaggggcttcttcccagtg ggcagaaccaaaggcagtaccgatgacttgaagctgggctctagagagcaaacaagactt tcctgtattcaaattcgcaggcagtga >gi568815592f:42464320_42717507|GENSCAN_predicted_peptide_5|1180_aa MASELEPEVQAIDRSLLECSAEEIAGAAPVGRNRLLDVLEQPDPDLGKQLCRPSPCGKWL QATDLTREVYQHLAHYVPKIYCRGPNPFPQKEDMLAQHVLLGPMEWYLCGEDPAFGFPKL EQANKPSHLCGRVFKVGEPTYSCRDCAVDPTCVLCMECFLGSIHRDHRYRMTTSGGGGFC DCGDTEAWKEGPYCQKHELNTSEIEEEEDPLVHLSEDVIARTYNIFAITFRYAVEILTWE KESELPADLEMVEKSDTYYCMLFNDEVHTYEQVIYTLQKAVNCTQKEAIGFATTVDRDGR RSVRYGDFQYCEQAKSVIVNYQQLQRDFMEDDHERAVSVTALSVQFFTAPTLNYERLQSD YVTDDHDREFSVADLSVQIFTVPSLARMLITEENLMSIIIKTFMDHLRHRDAQGRFQFER YTALQAFKFRRVQSLILDLKYVLISKPTEWSDELRQKFLEGFDAFLELLKCMQGMDPITR QVGQHIEMEPEWEAAFTLQMKLTHVISMMQDWCASDIYYYHNVKCRREMFDKDVVMLQIF STPDYGKRFSSEITHKDVVQQNNTLIEEMLYLIIMLVGERFSPGVGQVNATDEIKREIIH QLSIKPMAHSELVKSLPEDENKETGMESVIEAVAHFKKPGLTGRGMYELKPECAKEFNLY FYHFSRAEQSKTFNAVKKMRESSPTSPVAETEGTIMEESSRDKDKAERKRKAEIARLRRE KIMAQMSEMQRHFIDENKELFQQTLELDASTSAVLDHRYFDSVQAKEQRRQQRLRLHTSY DVENGEFLCPLCECLSNTVIPLLLPPRNIFNNRLNFSDQPNLTQWIRTISQQIKALQFLR KEESTPNNASTKNSENVDELQLPEGFRPDFRPKIPYSESIKEMLTTFGTATYKVGLKVHP NEEDPRVPIMCWGSCAYTIQSIERILSDEDKPLFGPLPCRLVGLVLAFPALQCQDFSGIS LGTGDLHIFHLVTMAHIIQILLTSCTEENGMDQENPPCEEESAVLALYKTLHQYTGRYPR ESNKLINLPEDYSSLINQASNFSCPKSGGDKSRAPTLCLVCGSLLCSQSYCCQTELEGED VGACTAHTYSCGSGVGIFLRVRECQVLFLAGKTKGCFYSPPYLDDYGETDQGLRRGNPLH LCKERFKKIQKLWHQHSVTEEIGHAQEANQTLVGIDWQHL >gi568815592f:42464320_42717507|GENSCAN_predicted_CDS_5|3543_bp atggcgtcggagctagagccagaggtgcaggccatcgaccggagcttgctggaatgttcg gccgaggagattgcgggggccgcgcctgtggggaggaaccgactcctggacgtcctggag cagcccgacccagacttggggaaacagctgtgccggccctcgccttgtgggaaatggctg caagcaactgacctcactagagaagtgtaccagcatttagcccactatgtacccaaaatc tactgcaggggtcccaacccttttccacagaaagaagacatgctggcacagcatgttttg ttgggaccaatggaatggtacctttgtggtgaagatcctgcatttggatttccaaaactt gagcaagcaaacaaaccttctcatctttgtggtcgtgtttttaaagtaggagagcctaca tattcttgcagagactgtgcagttgatccaacttgtgttttgtgcatggagtgctttttg ggaagtattcacagagatcatcgatataggatgacaacatcaggaggtggaggtttctgt gactgtggtgatactgaagcctggaaagagggtccttactgtcaaaaacatgaacttaac acctctgaaattgaggaagaagaggatcctcttgttcatttatcagaagatgtgatagca agaacttataacatttttgctattacgtttcggtatgcagtagaaatattaacctgggaa aaagaaagtgaattgccagcagatttagagatggtagagaagagtgacacctactattgc atgctgtttaatgatgaggttcacacctatgaacaagttatttatactcttcagaaagct gttaactgtacacaaaaagaagctattggttttgcaactacagtagatcgagatgggcgt aggtctgttcgatatggagattttcagtattgtgagcaagcaaaatcagtaattgtgaat taccagcagttgcagagagattttatggaggatgatcacgagcgagcagtgtcggtgact gctctatctgtccagttcttcaccgcacctactctgaactatgagcgtttgcagagtgat tatgtgacagatgaccacgacagagagttttcagtcgcagacctctcggttcagatattc acggttccttcacttgctcgaatgctcatcacagaagaaaacttaatgagcattatcatt aagacttttatggatcatttgagacatcgagatgcccagggcagatttcagtttgaacga tacactgctttacaagccttcaaatttaggagagtacagagccttattttagatctcaag tatgtgttaattagcaaaccaactgaatggtcagatgagctgaggcagaagttcctagaa gggtttgatgcctttttggaattactaaaatgtatgcagggaatggatccaattacacgt caagtaggacaacatattgaaatggaaccagagtgggaagcagccttcacactacaaatg aaattaacacatgtcatttcaatgatgcaggactggtgtgcttcagatatttattactac cataatgtgaaatgcagacgtgagatgtttgacaaggatgtagtaatgcttcagattttc agtactccagactatggaaaaagatttagttctgagattacccataaggatgttgttcag cagaacaatactctaatagaagaaatgctatacctcattataatgcttgttggagagaga tttagtcctggagttggacaggtaaatgctacagatgaaatcaagcgagagattatccat cagttgagtatcaagcctatggctcatagtgaattggtaaagtctttacctgaagatgag aacaaggagactggcatggagagtgtaatcgaagcagttgcccatttcaagaaacctgga ttaacaggacgaggcatgtatgaactgaaaccagaatgtgccaaagagttcaacttgtat ttctatcacttttcaagggcagaacagtccaagacttttaatgctgttaaaaagatgagg gagagttcacctaccagtcccgtggcagagacagaaggaaccataatggaagagagttca agggacaaagacaaagctgagaggaagagaaaagcagagattgccagactgcgcagagaa aagatcatggctcagatgtctgaaatgcagcggcattttattgatgaaaacaaagaactc tttcagcagacattagaactggatgcctcaacctctgctgttcttgatcataggtatttt gattccgttcaagctaaagaacagcgaaggcaacagagattacgcttacatacgagctat gatgtagaaaacggagaattcctttgccccctttgtgaatgcttgagtaatactgttatt cctctgctgcttcctccaagaaatatttttaacaacaggttaaatttttcagaccaacca aatctgactcagtggattagaacaatatctcagcaaataaaagcattacagtttcttagg aaagaagaaagtactcctaataatgcctctacaaagaattcagaaaatgtggatgaatta cagctccctgaagggttcaggcctgattttcgtcctaagatcccttattctgagagcata aaagaaatgctaacgacatttggaactgctacctacaaggtgggactaaaggttcatccc aatgaagaggatcctcgtgttcccataatgtgttggggtagctgcgcgtacaccatccaa agcatagaaagaattttgagtgatgaagataaaccattgtttggtcctttaccttgcaga ctggtgggcttggtgcttgcatttcctgcgttgcagtgtcaggatttttcagggatcagc cttggcactggagaccttcacattttccatctggttactatggcacacatcatacagatc ttacttacctcatgtacagaagagaatggcatggatcaagaaaatcccccttgtgaagaa gaatcagcagttcttgctttgtataaaacacttcaccagtatacgggaagatatccaaga gaatctaacaaattaataaaccttccagaggattacagcagcctcattaatcaagcatcc aatttctcgtgcccgaaatcaggtggtgataagagcagagccccaactctgtgccttgtg tgcggatctctgctgtgctcccagagttactgctgccagactgaactggaaggggaggat gtaggagcctgcacagctcacacctactcctgtggctctggagtgggcatcttcctgaga gtacgggaatgtcaggtgctatttttagctggcaaaaccaaaggctgtttttattctcct ccttaccttgatgactatggggagaccgaccagggactcagacggggaaatcctttacat ttatgcaaagagcgattcaagaagattcagaagctctggcaccaacacagtgtcacagag gaaattggacatgcacaggaagccaatcagacactggttggcattgactggcaacattta taa >gi568815592f:42464320_42717507|GENSCAN_predicted_peptide_6|487_aa MGLSLGFNLPHLPPLAWWSWVGQEEGDTLPATLGGGSQREEPSPGMLSPSLTLSFQSKPG AQRGAAEGQLSKTQFLPSPRTWLEWGFEWWAGSASGQVLRVRPTMEGGPHDPALKELTAS LPQLVEPARPTPQGGLSHDMIWRGESGAPTWLGVESSEGKDCACLGSPSPASAQALGLSR PSGNTWLERALATSDLRIKSNVDGRYLVDGVPFSCCNPSSPRPCIQYQITNNSAHYSYDH QTEELNLWVRGCRAALLSYYSSLMNSMGVVTLLIWLFEVTITIGLRYLQTSLDGVSNPEE SESESQGWLLERSVPETWKAFLESVKKLGKGNQHHTTPKVPGDLLWLIGGSSGVGCCPAV EEEQQSDMSPPFGDQASALGGPRDTHTGGHHSPMDNTNCPQQRARNPLGASSVSSPQIPI LKNTISGMLLHQRRFFEIAHLPELANTGVFTFFRKEKLSGEGENDFADFPSPGDQTKAGP SIIPPVG >gi568815592f:42464320_42717507|GENSCAN_predicted_CDS_6|1464_bp atggggctgtcactgggcttcaacctcccacatctccctcccctggcctggtggtcctgg gtgggccaggaagagggggacacacttcctgctacacttggaggaggctctcagagggag gagccaagtccaggaatgctgtctccctcactgactctgtcttttcagagcaagcccggt gcccagagaggggctgcagaaggacagctgagcaagacacagtttctgccttccccacgc acatggctggagtggggctttgagtggtgggcaggcagtgccagcgggcaggtgctccgt gtcaggcccaccatggagggtggcccgcacgatcctgccctcaaggagctgacagcttct ttgccacagctggtagaacctgcacggccaactccccagggagggctctcccatgatatg atatggcgtggtgagtcaggagctcccacctggctgggggtggagtcctccgagggcaag gactgtgcctgcctcgggtctccatcccctgcctctgcacaggccctgggactgagcagg ccttctggaaacacctggctggaaagggccttggccaccagtgaccttcgaatcaagagc aacgtggatgggcggtacctggtggacggcgtccctttcagctgctgcaatcctagctcg ccacggccctgcatccagtatcagatcaccaacaactcagcacactacagttacgaccac cagacggaggagctcaacctgtgggtgcgtggctgcagggctgccctgctgagctactac agcagcctcatgaactccatgggtgtcgtcacgctcctcatttggctcttcgaggtgacc attacaattgggctgcgctacctacagacgtcgctggatggtgtgtccaaccccgaggaa tctgagagcgagagccagggctggctgctggagaggagcgtgccggagacctggaaggcc tttctggagagtgtgaagaagctgggcaagggcaaccagcaccacaccacccctaaagtg ccaggtgatctcctgtggctcatcggtggaagcagtggggtaggctgctgccctgctgtg gaagaggagcaacaatcagacatgagtccaccctttggagaccaggcctcagctcttggt gggcccagggacacccacacaggtggccatcacagccccatggacaacactaattgtcca cagcaaagggcaaggaatcctctgggagcttcttccgtttcttccccccagatacccatc ttgaaaaacactatttctggaatgcttctgcatcaaaggagattctttgagatagcccat cttcctgagctagcaaatacaggagttttcactttctttaggaaagagaagctttcaggg gaaggagagaatgattttgctgacttcccaagccctggtgaccagaccaaggcagggccc agcataattcctccagttggatga