GENSCAN 1.0 Date run: 6-Nov-116 Time: 02:18:52 Sequence gi568815590r:73705651_73930474 : 224824 bp : 40.46% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 Intr - 3512 3382 131 0 2 63 116 78 0.162 7.59 1.01 Init - 22383 22284 100 0 1 59 61 88 0.310 3.67 1.00 Prom - 25915 25876 40 -4.95 2.00 Prom + 34175 34214 40 -5.25 2.01 Init + 40972 41124 153 0 0 43 30 189 0.839 6.63 2.02 Intr + 41433 41624 192 2 0 32 47 159 0.762 4.97 2.03 Term + 41746 41832 87 0 0 111 44 144 0.864 8.98 2.04 PlyA + 44377 44382 6 1.05 3.03 PlyA - 45141 45136 6 1.05 3.02 Term - 50650 50540 111 1 0 93 47 87 0.717 2.68 3.01 Init - 51939 51751 189 0 0 63 37 92 0.294 0.66 3.00 Prom - 73018 72979 40 -3.55 4.07 PlyA - 73179 73174 6 1.05 4.06 Term - 88465 88364 102 1 0 100 42 91 0.478 3.00 4.05 Intr - 103452 103390 63 0 0 41 100 68 0.653 1.30 4.04 Intr - 104979 104824 156 0 0 101 116 11 0.867 4.49 4.03 Intr - 119599 119497 103 0 1 108 91 72 0.775 8.76 4.02 Intr - 124822 124731 92 1 2 72 98 99 0.752 7.17 4.01 Init - 152302 152285 18 1 0 114 61 6 0.083 0.57 4.00 Prom - 164876 164837 40 -5.25 5.03 PlyA - 164968 164963 6 1.05 5.02 Term - 173762 173439 324 2 0 15 43 409 0.142 22.88 5.01 Init - 187409 187347 63 2 0 77 62 53 0.132 2.82 5.00 Prom - 189033 188994 40 -8.05 6.00 Prom + 189139 189178 40 -6.75 6.01 Init + 191415 191573 159 2 0 67 95 56 0.230 4.07 6.02 Intr + 201140 201247 108 1 0 102 52 47 0.421 2.06 6.03 Intr + 202493 202671 179 1 2 91 44 83 0.438 2.00 6.04 Intr + 203664 203748 85 0 1 54 68 113 0.222 4.80 6.05 Intr + 204698 205010 313 1 1 57 69 158 0.614 5.63 6.06 Intr + 205475 206101 627 0 0 55 48 355 0.544 19.45 6.07 Term + 207244 207881 638 2 2 44 43 193 0.226 3.82 6.08 PlyA + 208169 208174 6 1.05 7.03 PlyA - 208240 208235 6 1.05 7.02 Term - 209079 208930 150 0 0 40 36 136 0.640 0.53 7.01 Intr - 209508 209235 274 2 1 47 41 213 0.136 9.12 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 173717 173439 279 2 0 9 43 366 0.826 19.38 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590r:73705651_73930474|GENSCAN_predicted_peptide_1|77_aa MGKGLGIDSYSEKINTSLILNIISDQGDANENHTSLQDKMANPKEKTAMCLVNELARFNR VQPQYKLLNERGPAHSK >gi568815590r:73705651_73930474|GENSCAN_predicted_CDS_1|231_bp atgggcaaaggacttggaatagacagttactctgagaagataaacacatcattaatactc aacatcattagtgatcagggagatgcaaatgaaaaccacacttctctccaagataaaatg gcaaacccaaaagagaaaactgcaatgtgtctggtaaatgagttagcccgtttcaataga gtccaaccccagtataaacttctgaatgaaagagggcctgctcattcaaag >gi568815590r:73705651_73930474|GENSCAN_predicted_peptide_2|143_aa MWEGRRGGGEAAKGAAFLRVPRAAELQGSPAPSAFFAGGAEVGVSRTPPMRLPPPGGCRA PAAAGSAGLAAASWAPGRGPFPGGLGTRASDCARSAPGPQSPRPRRDSPPPRLPRALRLP LCAVQGTRDHPASAPAAAVQRTV >gi568815590r:73705651_73930474|GENSCAN_predicted_CDS_2|432_bp atgtgggagggaaggcgcgggggaggggaggccgcgaagggggctgcttttctccgggtt ccccgtgctgcggaactgcagggctccccagcgccctctgccttcttcgccggcggcgcg gaagtcggggtctcccggacgcccccgatgaggctcccgccccctggcggctgccgggcc ccagcagctgcaggctctgcggggctagcggcggcgagctgggcccctgggcgagggcca ttcccggggggcttgggcacgcgggcgagcgactgcgcaaggagcgcgcccggtccgcag tctcctcgtccccggcgcgactccccgccccctcgtctgccaagggctctccggctgccc ctctgtgctgtgcaggggacgcgcgaccatcccgcgtctgctccggccgccgctgtgcaa cgcacggtttag >gi568815590r:73705651_73930474|GENSCAN_predicted_peptide_3|99_aa MQKATIPGTLMTGQTSRLATHSGSDTERLETISEHTSRLHCHWQACLFNEKMNEGLRPAD DAGPYPSNICRYIEKPHVGVLVDSPSCGPSGKPSSTIDE >gi568815590r:73705651_73930474|GENSCAN_predicted_CDS_3|300_bp atgcagaaagcaacaattccaggaactttgatgacggggcagacaagtagattggctaca cacagtggcagtgatacagagagactagagacaatatctgaacacaccagcaggctgcac tgccactggcaggcatgtttgttcaatgagaaaatgaatgaagggctcaggcctgcagat gatgctgggccctaccccagcaacatctgcaggtacatagaaaagccacatgtgggtgtt ctggttgacagccccagctgcggtcccagtggaaaaccatcatcaactatagatgagtga >gi568815590r:73705651_73930474|GENSCAN_predicted_peptide_4|177_aa MPSRLKKRLQKELLALQNDPPPGMTLNEKSVQNSITQWIVDMEGAPGTLYEGEKFQLLFK FSSRYPFDSPQVMFTGENIPVHPHVYSNGHICLSILTEDWSPALSVQSVCLSIISMLSSC KEKGKAVCGYKAFYVPVTLKDYRKMILVDATVIILLAEDSPTEKMSTLIIQSLNFNL >gi568815590r:73705651_73930474|GENSCAN_predicted_CDS_4|534_bp atgcccagccgcttaaagaaacgactacagaaagaactgttggctttgcaaaatgaccca cctcctggaatgaccttaaatgagaagagtgttcaaaattcaattacacagtggattgta gacatggaaggtgcaccaggtaccttatatgaaggggaaaaatttcaacttctatttaaa tttagtagtcgatatccttttgactctcctcaggtcatgtttactggtgaaaatattcct gttcatcctcatgtttatagcaatggtcatatctgtttatccattctaacagaagactgg tccccagcgctctcagtccaatcagtttgtcttagcattattagcatgctttccagctgc aaggaaaagggcaaagcagtttgtgggtacaaggccttctatgttcctgttacgcttaaa gactatagaaagatgatacttgttgatgccactgttatcatcctcctagcagaagatagt cctactgagaaaatgagcactttgatcattcagtctttgaactttaacctttga >gi568815590r:73705651_73930474|GENSCAN_predicted_peptide_5|128_aa MELIKANQIFVARTIPGAAELCRSALKSSSRCTRKPMSLFPKLRIREAFQVCKYQKAAAH EVLWGPKALRVGLEVWKTHPKGKICEPHNCGWKRNRKKTARNRTSSAFRGTHEENNRSRV KTGAVSPQ >gi568815590r:73705651_73930474|GENSCAN_predicted_CDS_5|387_bp atggagctgattaaagccaatcagatctttgtagctagaactatccctggggctgctgag ctgtgtcgttctgctcttaaatcctcatcccgttgtacaaggaaacctatgtctctcttc cccaaattaagaatacgagaagccttccaggtctgcaaataccagaaagcagctgcgcat gaagttttatggggccccaaggctctgagagtgggcttggaggtatggaaaacacatccc aagggaaaaatctgtgaaccgcacaactgtggttggaaacgaaataggaaaaaaactgcg cggaaccggacgtcatcagcatttcgcggtacacacgaagaaaacaaccgtagtcgggtc aaaactggagccgtgtctcctcagtaa >gi568815590r:73705651_73930474|GENSCAN_predicted_peptide_6|702_aa MGIRELPNFHPYAGKMAHPNSTGTKAPVLWAIPDLAIWLFICILCNKLVNIIKAELRVNR EESHSALGSQEEVGCPCSPEQSSQSYVTQKETSKEISKGPQKPLGYWLCALQAVGGGEFC PTRVHVPFSLSDLKQIKVDLGKFSDDPDSLLSVNLLTRTAVLTVRYHLRNPGTVCNQTLK QLRGFLGSTGFCQLWIPGYNEMARPLYTLIKETQRANTHLGEWEPEAETAFKTLNQAPVQ TPALSFPTGKNFSLYITERAGIALGVLTQTSGTTPQPVACLRTYVQLAELVVLTRALELG KKKRINVYTDSKYAYLILHAHAAIWKEREFVTSGGIPIKYHKDIMELLHAVQKPKEVAVL QCQSHQKGEEEKAEGNCWADAEAKIAAKRNLPLEIPTEGPLVWNNPLQEIKLQYSPTKTE WGISWGHSFLPSGWLMTEEGKVFIPEASQWKILKSLHQTFHMGIENTHQRAKSLIIGPNL LQTIQQVVKAWLGMATATGTRIASLSTSLSYYHTRSKDFSDSLQEITKSILTLQSQIDSL AAVTLQNHQGLYLLAAEKGGLCTFLGEECCFYTNQSGIVQDAAWHLQEKASEIRQCLSNS YTNLWSWATWLLPFLGPVTAFLLLLAFGPCIFNLLVKFVSSRIEAIKLQMVLQMEPQMSS THNFYRGPLDQPTGPLTGLESSPLEDTTTAGSLLCPYLAGSS >gi568815590r:73705651_73930474|GENSCAN_predicted_CDS_6|2109_bp atggggatcagagaacttccaaattttcatccatatgctgggaagatggcacaccccaac tccacaggaacaaaggctcctgttctctgggccattccggaccttgccatctggctgttc atttgtatactttgtaataaattggtaaacataattaaggctgagctgagggtcaacaga gaggaaagccattcagctctggggtcccaagaagaagttggttgtccctgcagccctgag cagagctctcaaagttacgtcacccaaaaggaaacaagcaaagaaatctccaagggacca caaaaacccctgggctattggttatgtgcccttcaagctgtagggggaggggaattttgc ccaacccgggtacacgtccccttctccctctctgatttaaagcagatcaaggtagatctg gggaagttttcagatgatcctgatagccttctcagtgttaatctcctgacccggacagct gtcctcacagtccgttaccatctgaggaatcctgggacagtctgtaaccagacattaaaa cagttgcgggggttccttggaagcaccggcttttgccaactatggatccctggatacaat gagatggccaggccactctatactctaatcaaggagacccagagggcaaatactcatcta ggagaatgggaaccagaggcagaaacagccttcaaaaccttaaatcaggccccagtacaa actccagccttaagctttcccacaggaaaaaacttctctttatacatcacagagagagca gggatagctcttggagtccttactcagactagtgggacaaccccacaaccagtggcatgc ctaaggacctatgtccagttagcagaactagtggtgcttacccgagccttagaactggga aagaaaaaaagaataaatgtgtatacagatagcaagtatgcttatctaatcctacatgcc catgctgcaatatggaaagaaagggagttcgtaacctctgggggaatccccattaaatac cacaaggatatcatggagttattgcacgcagtgcaaaaacccaaggaggtggctgtctta cagtgccaaagccatcaaaaaggtgaagaagaaaaggcagaaggaaactgttgggcagac gctgaggccaaaattgctgccaagcggaacctcccattagaaatacctacggaaggaccc ttggtatggaacaaccctctccaagagattaagctgcagtattccccgaccaaaacagaa tggggaatttcatggggccatagttttctcccctctgggtggttaatgacagaagaggga aaggtattcatacctgaagccagccagtggaaaatacttaagtccctccaccaaactttt catatgggtattgagaatactcatcaaagggccaaatccctaattatagggccaaatctc ctccagaccatccagcaagtagtcaaagcctggttaggaatggccactgctacaggaacc agaatagccagtttatctacttcactatcctactaccacacacgctcaaaggatttctca gacagtttgcaagaaataacaaaatctatccttactctacaatcccaaatagactctttg gcagcagtgactctccaaaaccaccaaggcctatacctcctcgctgctgagaaaggagga ctttgcaccttcttaggggaagagtgttgtttttacactaaccagtcagggatagtgcaa gacgctgcctggcatttacaggaaaaggcttctgaaatcagacaatgcctttcaaactct tataccaacctctggagttgggcaacatggcttctcccctttctaggtcccgtgacagcc ttcttgctattactcgcctttgggccctgtatttttaacctccttgtcaaatttgtttcc tccaggattgaggccatcaagctacagatggtcttacaaatggaaccccaaatgagctca actcacaacttctaccgaggacccctggatcaacccactggccctttgactggcctagag agttcccctctggaggacactacaactgcagggtcccttctttgcccctatctagcagga agtagctag >gi568815590r:73705651_73930474|GENSCAN_predicted_peptide_7|141_aa XVNFGTFQLNPCRVLKDSPAVPPRCPCSLRAALAPSQPAAAPAACTPSSTMTDQAFVTLT TNDAYTKGVLALGSSLKQHRTTKRLVILTTPQYSKCVFMDADTLVLANIDDLFEREELSA GPDPRWPDCFNSEVFVYQPSV >gi568815590r:73705651_73930474|GENSCAN_predicted_CDS_7|426_bp nnagttaactttggcaccttccagttgaacccctgcagagttttaaaggatagtcccgct gtgcctcctcgctgcccttgctccctccgtgctgcccttgctccctcccaacctgcggct gccccggctgcctgcacccccagcagcaccatgacagatcaggcctttgtgacactgacc acgaatgatgcctacaccaaaggtgtcctggccctggggtcatctctgaaacagcacagg accaccaagagactggtcatactcaccacccctcagtattcaaaatgtgtatttatggat gcggatactctggtcctagcaaatattgatgatctttttgagagagaagaattgtcagca ggaccagacccaaggtggcctgactgcttcaattccgaagtcttcgtttatcagccttca gtttaa