GENSCAN 1.0 Date run: 4-Nov-116 Time: 08:17:20 Sequence gi568815590r:73846633_74056055 : 209423 bp : 41.81% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 2169 2164 6 1.05 1.02 Term - 32780 32457 324 2 0 15 43 409 0.142 22.88 1.01 Init - 46427 46365 63 2 0 77 62 53 0.132 2.82 1.00 Prom - 48051 48012 40 -8.05 2.00 Prom + 48157 48196 40 -6.75 2.01 Init + 50433 50591 159 2 0 67 95 56 0.230 4.07 2.02 Intr + 60158 60265 108 1 0 102 52 47 0.421 2.06 2.03 Intr + 61511 61689 179 1 2 91 44 83 0.438 2.00 2.04 Intr + 62682 62766 85 0 1 54 68 113 0.222 4.80 2.05 Intr + 63716 64028 313 1 1 57 69 158 0.614 5.63 2.06 Intr + 64493 65119 627 0 0 55 48 355 0.545 19.45 2.07 Term + 66262 66899 638 2 2 44 43 193 0.226 3.82 2.08 PlyA + 67187 67192 6 1.05 3.04 PlyA - 67258 67253 6 1.05 3.03 Term - 68097 67948 150 0 0 40 36 136 0.633 0.53 3.02 Intr - 68504 68253 252 2 0 19 41 198 0.067 4.91 3.01 Init - 86097 85960 138 0 0 47 84 137 0.413 9.39 3.00 Prom - 87443 87404 40 -5.85 4.00 Prom + 91518 91557 40 -5.55 4.01 Sngl + 92912 93154 243 1 0 100 45 166 0.969 8.33 4.02 PlyA + 95296 95301 6 1.05 5.04 PlyA - 96210 96205 6 1.05 5.03 Term - 100188 99998 191 1 2 108 36 132 0.997 6.63 5.02 Intr - 109422 109279 144 1 0 79 115 108 0.947 11.93 5.01 Init - 113136 113133 4 0 1 68 94 0 0.548 -1.19 5.00 Prom - 116161 116122 40 -7.95 6.00 Prom + 117601 117640 40 -6.85 6.01 Init + 118299 118326 28 2 1 68 86 21 0.392 -0.28 6.02 Intr + 122878 122997 120 2 0 65 98 110 0.964 9.25 6.03 Intr + 124937 125018 82 0 1 49 92 81 0.914 2.38 6.04 Intr + 125433 125615 183 0 0 73 81 193 0.914 15.08 6.05 Intr + 129576 129859 284 0 2 31 91 184 0.025 9.14 6.06 Intr + 132143 132229 87 0 0 26 111 52 0.331 0.32 6.07 Term + 134523 134989 467 1 2 110 52 280 0.975 20.79 6.08 PlyA + 135042 135047 6 1.05 7.00 Prom + 135338 135377 40 -9.55 7.01 Init + 135493 135594 102 0 0 110 93 86 0.910 11.59 7.02 Term + 136683 136703 21 2 0 132 54 7 0.631 -0.97 7.03 PlyA + 137291 137296 6 1.05 8.00 Prom + 137650 137689 40 -8.05 8.01 Sngl + 137861 138172 312 1 0 102 48 232 0.588 16.28 8.02 PlyA + 138698 138703 6 1.05 9.00 Prom + 139720 139759 40 -3.55 9.01 Init + 146033 146047 15 1 0 101 68 36 0.058 1.60 9.02 Intr + 161865 162042 178 2 1 18 68 142 0.121 3.77 9.03 Intr + 163369 163497 129 2 0 72 107 69 0.209 6.95 9.04 Term + 170267 170763 497 0 2 23 43 397 0.016 22.44 9.05 PlyA + 171190 171195 6 1.05 10.00 Prom + 171428 171467 40 -4.75 10.01 Sngl + 171839 172408 570 1 0 49 48 224 0.766 10.40 10.02 PlyA + 172468 172473 6 -0.45 11.05 PlyA - 172512 172507 6 -0.45 11.04 Term - 173123 173011 113 0 2 24 49 124 0.421 -0.16 11.03 Intr - 176602 176488 115 1 1 62 98 109 0.611 8.40 11.02 Intr - 180990 180922 69 0 0 60 98 66 0.628 3.26 11.01 Init - 184554 184351 204 0 0 50 80 176 0.255 11.90 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 32735 32457 279 2 0 9 43 366 0.826 19.38 S.002 Sngl + 170308 170763 456 0 0 88 43 424 0.952 33.83 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590r:73846633_74056055|GENSCAN_predicted_peptide_1|128_aa MELIKANQIFVARTIPGAAELCRSALKSSSRCTRKPMSLFPKLRIREAFQVCKYQKAAAH EVLWGPKALRVGLEVWKTHPKGKICEPHNCGWKRNRKKTARNRTSSAFRGTHEENNRSRV KTGAVSPQ >gi568815590r:73846633_74056055|GENSCAN_predicted_CDS_1|387_bp atggagctgattaaagccaatcagatctttgtagctagaactatccctggggctgctgag ctgtgtcgttctgctcttaaatcctcatcccgttgtacaaggaaacctatgtctctcttc cccaaattaagaatacgagaagccttccaggtctgcaaataccagaaagcagctgcgcat gaagttttatggggccccaaggctctgagagtgggcttggaggtatggaaaacacatccc aagggaaaaatctgtgaaccgcacaactgtggttggaaacgaaataggaaaaaaactgcg cggaaccggacgtcatcagcatttcgcggtacacacgaagaaaacaaccgtagtcgggtc aaaactggagccgtgtctcctcagtaa >gi568815590r:73846633_74056055|GENSCAN_predicted_peptide_2|702_aa MGIRELPNFHPYAGKMAHPNSTGTKAPVLWAIPDLAIWLFICILCNKLVNIIKAELRVNR EESHSALGSQEEVGCPCSPEQSSQSYVTQKETSKEISKGPQKPLGYWLCALQAVGGGEFC PTRVHVPFSLSDLKQIKVDLGKFSDDPDSLLSVNLLTRTAVLTVRYHLRNPGTVCNQTLK QLRGFLGSTGFCQLWIPGYNEMARPLYTLIKETQRANTHLGEWEPEAETAFKTLNQAPVQ TPALSFPTGKNFSLYITERAGIALGVLTQTSGTTPQPVACLRTYVQLAELVVLTRALELG KKKRINVYTDSKYAYLILHAHAAIWKEREFVTSGGIPIKYHKDIMELLHAVQKPKEVAVL QCQSHQKGEEEKAEGNCWADAEAKIAAKRNLPLEIPTEGPLVWNNPLQEIKLQYSPTKTE WGISWGHSFLPSGWLMTEEGKVFIPEASQWKILKSLHQTFHMGIENTHQRAKSLIIGPNL LQTIQQVVKAWLGMATATGTRIASLSTSLSYYHTRSKDFSDSLQEITKSILTLQSQIDSL AAVTLQNHQGLYLLAAEKGGLCTFLGEECCFYTNQSGIVQDAAWHLQEKASEIRQCLSNS YTNLWSWATWLLPFLGPVTAFLLLLAFGPCIFNLLVKFVSSRIEAIKLQMVLQMEPQMSS THNFYRGPLDQPTGPLTGLESSPLEDTTTAGSLLCPYLAGSS >gi568815590r:73846633_74056055|GENSCAN_predicted_CDS_2|2109_bp atggggatcagagaacttccaaattttcatccatatgctgggaagatggcacaccccaac tccacaggaacaaaggctcctgttctctgggccattccggaccttgccatctggctgttc atttgtatactttgtaataaattggtaaacataattaaggctgagctgagggtcaacaga gaggaaagccattcagctctggggtcccaagaagaagttggttgtccctgcagccctgag cagagctctcaaagttacgtcacccaaaaggaaacaagcaaagaaatctccaagggacca caaaaacccctgggctattggttatgtgcccttcaagctgtagggggaggggaattttgc ccaacccgggtacacgtccccttctccctctctgatttaaagcagatcaaggtagatctg gggaagttttcagatgatcctgatagccttctcagtgttaatctcctgacccggacagct gtcctcacagtccgttaccatctgaggaatcctgggacagtctgtaaccagacattaaaa cagttgcgggggttccttggaagcaccggcttttgccaactatggatccctggatacaat gagatggccaggccactctatactctaatcaaggagacccagagggcaaatactcatcta ggagaatgggaaccagaggcagaaacagccttcaaaaccttaaatcaggccccagtacaa actccagccttaagctttcccacaggaaaaaacttctctttatacatcacagagagagca gggatagctcttggagtccttactcagactagtgggacaaccccacaaccagtggcatgc ctaaggacctatgtccagttagcagaactagtggtgcttacccgagccttagaactggga aagaaaaaaagaataaatgtgtatacagatagcaagtatgcttatctaatcctacatgcc catgctgcaatatggaaagaaagggagttcgtaacctctgggggaatccccattaaatac cacaaggatatcatggagttattgcacgcagtgcaaaaacccaaggaggtggctgtctta cagtgccaaagccatcaaaaaggtgaagaagaaaaggcagaaggaaactgttgggcagac gctgaggccaaaattgctgccaagcggaacctcccattagaaatacctacggaaggaccc ttggtatggaacaaccctctccaagagattaagctgcagtattccccgaccaaaacagaa tggggaatttcatggggccatagttttctcccctctgggtggttaatgacagaagaggga aaggtattcatacctgaagccagccagtggaaaatacttaagtccctccaccaaactttt catatgggtattgagaatactcatcaaagggccaaatccctaattatagggccaaatctc ctccagaccatccagcaagtagtcaaagcctggttaggaatggccactgctacaggaacc agaatagccagtttatctacttcactatcctactaccacacacgctcaaaggatttctca gacagtttgcaagaaataacaaaatctatccttactctacaatcccaaatagactctttg gcagcagtgactctccaaaaccaccaaggcctatacctcctcgctgctgagaaaggagga ctttgcaccttcttaggggaagagtgttgtttttacactaaccagtcagggatagtgcaa gacgctgcctggcatttacaggaaaaggcttctgaaatcagacaatgcctttcaaactct tataccaacctctggagttgggcaacatggcttctcccctttctaggtcccgtgacagcc ttcttgctattactcgcctttgggccctgtatttttaacctccttgtcaaatttgtttcc tccaggattgaggccatcaagctacagatggtcttacaaatggaaccccaaatgagctca actcacaacttctaccgaggacccctggatcaacccactggccctttgactggcctagag agttcccctctggaggacactacaactgcagggtcccttctttgcccctatctagcagga agtagctag >gi568815590r:73846633_74056055|GENSCAN_predicted_peptide_3|179_aa MEKRVGTSKVKQDSFSQPNDSPQRLNLLPSSWSEEAGTPLPGVTCRLNPCRVLKDSPAVP PRCPCSLRAALAPSQPAAAPAACTPSSTMTDQAFVTLTTNDAYTKGVLALGSSLKQHRTT KRLVILTTPQYSKCVFMDADTLVLANIDDLFEREELSAGPDPRWPDCFNSEVFVYQPSV >gi568815590r:73846633_74056055|GENSCAN_predicted_CDS_3|540_bp atggagaaaagagtgggaacctcaaaagttaaacaggattcattttcccaacccaatgac tctcctcaacgactaaatctgctgccatcttcttggagtgaagaagcaggaacaccctta cctggagttacctgcaggttgaacccctgcagagttttaaaggatagtcccgctgtgcct cctcgctgcccttgctccctccgtgctgcccttgctccctcccaacctgcggctgccccg gctgcctgcacccccagcagcaccatgacagatcaggcctttgtgacactgaccacgaat gatgcctacaccaaaggtgtcctggccctggggtcatctctgaaacagcacaggaccacc aagagactggtcatactcaccacccctcagtattcaaaatgtgtatttatggatgcggat actctggtcctagcaaatattgatgatctttttgagagagaagaattgtcagcaggacca gacccaaggtggcctgactgcttcaattccgaagtcttcgtttatcagccttcagtttaa >gi568815590r:73846633_74056055|GENSCAN_predicted_peptide_4|80_aa MAERGQHRVWAVASEGASLNPWQLSNGVEPVSAQMSRIEVWEPLPRFQKIYRNAWMSRQK FAAGQGSHGDPLIGQCGRKM >gi568815590r:73846633_74056055|GENSCAN_predicted_CDS_4|243_bp atggctgaaaggggccaacatagagtttgggccgtggcttcagagggtgcaagcctcaat ccttggcagctttcaaatggtgttgagcctgtcagtgcacagatgtcaagaattgaggtt tgggaacctctgcctagatttcagaagatatatcgaaatgcctggatgtccaggcagaag tttgctgcagggcagggctctcatggggaccctctgatagggcagtgtggaagaaaaatg tga >gi568815590r:73846633_74056055|GENSCAN_predicted_peptide_5|112_aa MDGEEKTYGGCEGPDAMYVKLISSDGHEFIVKREHALTSGTIKAMLSGPGQFAENETNEV NFREIPSHVLSKVCMYFTYKVRYTNSSTEIPEFPIAPEIALELLMAANFLDC >gi568815590r:73846633_74056055|GENSCAN_predicted_CDS_5|339_bp atggatggagaggagaaaacctatggtggctgtgaaggacctgatgccatgtatgtcaaa ttgatatcatctgatggccatgaatttattgtaaaaagagaacatgcattaacatcaggc acgataaaagccatgttgagtggcccaggtcagtttgctgagaacgaaaccaatgaggtc aattttagagagataccttcacatgtgctatcgaaagtatgcatgtattttacgtacaag gttcgctacactaacagctccaccgagattcctgaattcccaattgcacctgaaattgca ctggaactgctgatggctgcgaacttcttagattgttaa >gi568815590r:73846633_74056055|GENSCAN_predicted_peptide_6|416_aa MGESPEPEEVTLAFLVFLEIGQVHSYLKVFAFVIAVVRKAFKSSSFRSPDKTGRPVDARY PQLMKSSIESPNKLPRSPRLTRQSRSHRSRVPAYCHSPYPRAAPPPPAACSGGFRGSRKN YFRWVRAAFGDAREIRTCRVGSRVSQSWTRAAGASAAARHPRDAVSGVGQPVGGRTASLR KEDCIVCGRRAPRSPGLCLPGVLQQRAFGAGSRLEYGAFGSRAPSPASGSSAGYVRFLNT PSDKSEDGRLIYTGNMARAVFGVKCFSYSTSLIGLTFLPYIFTQNNAISESVPLPIQIIF YGIMGSFTVITPVLLHFITKGYVIRLYHEATTDTYKAITYNAMLAETSTVFHQNDVKIPD AKHVFTTFYAKTKSLLVNPVLFPNREDYIHLMGYDKEEFILYMEETSEEKRHKDDK >gi568815590r:73846633_74056055|GENSCAN_predicted_CDS_6|1251_bp atgggagaatcacctgagcccgaggaggtcacactggcctttttggtattcctcgaaatc ggtcaagttcactcttacctcaaggtctttgcatttgtaattgctgttgtccgaaaagct tttaaatcctcatcatttaggtctccagacaagactggacggccagtagatgcacgatac ccccaactcatgaagtcgagcatagaatcacccaacaagttacccagaagcccgagactc actcgtcagtcccgcagccaccgcagccgggtccccgcgtactgccacagcccctatccc agggccgcccccccacccccagctgcctgctccggtgggttccggggcagtcgaaagaat tacttccgctgggtcagagccgctttcggcgacgcgcgggagatccgtacctgtcgggtg ggaagccgtgtctcgcagtcgtggactcgtgcagctggggcgtccgcagccgctcgtcac ccgcgtgatgctgtttctggcgttgggcagcccgtgggcggtcgaactgcctctctgcgg aaggaggactgcattgtgtgcggccgccgcgctccgaggtccccgggcctctgtctcccg ggcgtcctccagcagcgggccttcggggccggtagccggctggagtacggggccttcggg agccgcgcgccttctccggcgtccgggtcgagcgcaggatatgttcgattcttaaatacg ccatctgacaaatcagaagatggaaggctaatttatactggcaatatggcccgagcagtg tttggtgtgaaatgtttctcttattctacgagtctgattggccttacatttctgccatac atttttacacaaaataatgctatttctgaaagtgtgcctctgcctattcaaatcatattc tatggcatcatgggaagctttacggtgatcaccccagtgctgcttcactttattacaaaa ggctatgtcattcgattgtaccatgaggccacaacagacacttataaagccattacctac aatgctatgcttgcagaaacgagtacagtgtttcaccagaatgatgtgaagattccagat gctaaacatgtatttaccacattttatgctaaaacaaaatcactgttagttaatccagtg ctctttccaaaccgtgaagactatatccatctaatgggttatgacaaagaagaatttatt ttgtatatggaagaaaccagtgaagagaaacggcataaagatgacaaatga >gi568815590r:73846633_74056055|GENSCAN_predicted_peptide_7|40_aa MAFKHTRKTPVESEVAIHRIHITLTSCNVKSLDKMYPSNE >gi568815590r:73846633_74056055|GENSCAN_predicted_CDS_7|123_bp atggcttttaaacataccagaaaaacacccgtggagtcagaggtggcaattcaccgaatt catatcactctaacgagttgcaacgtaaaatccctggataagatgtatcctagcaatgaa tga >gi568815590r:73846633_74056055|GENSCAN_predicted_peptide_8|103_aa MASGVQVADEVCRIFYDMKVRKCSTPEEIKKRKKAVIFCLGADEKCIIVEEGKEILVGDV GVTITGPFKHFVGMLPEKDCRCALYDASFETKESSTEELIFFL >gi568815590r:73846633_74056055|GENSCAN_predicted_CDS_8|312_bp atggcctcaggagtgcaagtagctgatgaagtatgtcgcattttttatgacatgaaagtt cgtaaatgctccacaccagaagaaatcaagaaaagaaagaaggctgtcattttttgtctc ggtgcagacgaaaagtgcatcattgtagaagaaggcaaagagatcttggttggagatgtt ggtgtaaccataactggtcctttcaagcattttgtgggaatgcttcctgaaaaagattgt cgctgtgctttgtatgatgcaagctttgaaacaaaagaatccagtacagaagagttgatt ttttttttgtag >gi568815590r:73846633_74056055|GENSCAN_predicted_peptide_9|272_aa MALLLLRSEDKLGKLVVLQVQSNEGLHEGSAVMGMKRRLERFTGTGNSSANLVMKRRKSL LDLLGRDLKQLYFNLYITVNTMNLPKRKEVICRGSDDDYSFCRALKGGHHHQRPKVGKTT KMGSNQSRKAENSKNQSVSSPKDRSSSPATEQSWMENDFDELTEEGFRRSVITNFAELKE DVRTHRKEAKNLEKGLDEWLTRINSVEKTLNDLMEMKSVARELHDTCRSFSSQFDQVEER VSVIEDQISEMKQEEKFREKRVVRNEQSLQET >gi568815590r:73846633_74056055|GENSCAN_predicted_CDS_9|819_bp atggcccttctcctgttgagaagtgaagacaaattaggaaagttggtggtgcttcaggtt cagagcaatgagggcctacatgaaggcagcgcggtaatggggatgaaaaggaggttagaa agatttactgggactgggaattctagtgccaacttggtgatgaagaggaggaaatcactt ttagatctgttggggagagatttaaagcaattatatttcaatctctatataactgtcaac accatgaatcttccaaagcgcaaagaagttatttgccgaggatctgatgacgattactct ttttgcagagctctgaagggaggtcaccatcatcaaagaccaaaggtaggtaaaaccaca aagatggggagcaaccagagcagaaaagctgaaaattctaaaaaccagagtgtctcttct ccaaaggatcgcagctcctcgccagcaacggaacaaagctggatggagaatgactttgat gagttgacagaagaaggctttagaaggtcagtaataacaaacttcgctgagctaaaggag gatgttcgaacccatcgcaaggaagctaaaaaccttgaaaaaggattggatgaatggcta actagaataaacagtgtagagaagaccttaaatgacctgatggagatgaaaagcgtggca cgagaactacatgacacatgcagaagcttcagtagccaattcgatcaagtggaagaaagg gtatcagtgattgaagatcaaattagtgaaatgaagcaagaagagaagtttagagaaaaa agagtagtaagaaatgaacaaagcctccaagaaacatga >gi568815590r:73846633_74056055|GENSCAN_predicted_peptide_10|189_aa MGYFNTPLSILDRSMRQKVNKDIQDLNTVLHQADLIDIYRTLHPKSTEYTFFSAPHCTYS KIDHIVGSKALLSKCKRTEITTNCLSDHNAIKLELRIKKLTKNYTTTWKLNNLLLSDYWV HNEMKAEIKIFFETNENKDITYQNLWDTFKSVCRGKFIALNAHKRKQKRSKIDTLTSQLK ELEKQEQTH >gi568815590r:73846633_74056055|GENSCAN_predicted_CDS_10|570_bp atgggatactttaacaccccactgtcaatattagacagatcaatgagacagaaggttaac aaggatatccaggacttgaacacagttctgcaccaagcagacctaatagacatctacaga actctccaccccaaatcaacagaatatacattcttctcagcaccacattgcacttattcc aaaattgaccacatagttggaagtaaagcactcctcagcaaatgtaaaagaacagaaatc acaacaaactgtctttcagaccacaatgcaatcaaattagaactcaggattaagaaactc actaaaaactacacaactacatggaaactgaacaacctgctcctaagtgactactgggta cataacgaaatgaaggcagaaataaagatattctttgaaaccaatgagaacaaagacata acgtaccagaatctctgggacacatttaaatcagtgtgtagagggaaatttatagcacta aatgcccacaagagaaagcagaaaagatctaaaatcgacaccctaacatcacaattaaaa gaactagagaagcaagagcaaacacattga >gi568815590r:73846633_74056055|GENSCAN_predicted_peptide_11|166_aa MRDMVVMDRKGGNYNRKGGNPITDTLSGGRKLGSVQKPLDNTGVAPARNPQLLQNLFQPH ATAKSSLKFICILLLGCEKWQPDPQFGPGTKPGFHQEQRVAGCTGQKRGGDPMVPAGKWA AGTKNRSREALHPKDEANLILAHKLFDVLLDLVCQYFIEDFLIDVH >gi568815590r:73846633_74056055|GENSCAN_predicted_CDS_11|501_bp atgagagatatggtggtcatggacaggaaaggaggaaattacaataggaaaggtggcaat cctattactgacaccctatcaggtggtcggaagctggggtcagtccagaagcctttggat aacacgggggtagccccagccagaaatcctcagttgctccaaaacctcttccagccccat gcaacagctaagtcctctctgaaattcatctgcatcttgctattgggctgcgagaaatgg cagcccgaccctcagtttggtccaggaacaaagccaggatttcaccaggagcagagggta gcaggttgcactggacagaagaggggaggggatcccatggtgccagcagggaagtgggca gcgggaaccaagaaccggtccagggaagccttgcatcccaaggatgaagccaacttgatc ttggcgcataagctttttgatgtgttgctggatttggtttgccagtattttattgaggat tttctcatcgatgttcattag