GENSCAN 1.0 Date run: 4-Nov-116 Time: 02:16:47 Sequence gi568815595r:15275376_15475645 : 200270 bp : 45.10% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.08 PlyA - 1450 1445 6 1.05 1.07 Term - 16528 16458 71 2 2 99 36 81 0.623 2.10 1.06 Intr - 17997 17848 150 1 0 99 78 33 0.428 3.53 1.05 Intr - 25421 25394 28 2 1 126 72 -15 0.318 -1.61 1.04 Intr - 28856 28728 129 2 0 94 110 157 0.971 19.39 1.03 Intr - 30648 30615 34 2 1 134 65 4 0.965 0.93 1.02 Intr - 31902 31752 151 1 1 62 88 71 0.092 3.72 1.01 Init - 36018 35970 49 0 1 86 64 35 0.062 0.01 1.00 Prom - 37260 37221 40 -4.96 2.00 Prom + 37908 37947 40 -4.06 2.01 Init + 44087 44161 75 1 0 91 80 -2 0.363 0.39 2.02 Intr + 49042 49119 78 0 0 73 77 86 0.217 5.75 2.03 Term + 54316 54360 45 0 0 115 54 21 0.093 -1.39 2.04 PlyA + 54640 54645 6 1.05 3.03 PlyA - 54991 54986 6 1.05 3.02 Term - 55191 55093 99 0 0 111 41 121 0.576 7.83 3.01 Init - 57033 56896 138 0 0 70 91 286 0.269 25.24 3.00 Prom - 66044 66005 40 -4.26 4.00 Prom + 72760 72799 40 -3.36 4.01 Init + 76609 76656 48 0 0 77 78 57 0.504 4.72 4.02 Term + 97351 97491 141 0 0 66 44 152 0.826 6.53 4.03 PlyA + 97508 97513 6 -0.45 5.03 PlyA - 98727 98722 6 1.05 5.02 Term - 100309 99998 312 1 0 42 45 409 0.310 27.00 5.01 Init - 111624 111622 3 0 0 108 81 0 0.405 1.30 5.00 Prom - 116608 116569 40 -2.46 6.02 PlyA - 117369 117364 6 1.05 6.01 Sngl - 121235 120846 390 2 0 88 54 284 0.951 21.22 6.00 Prom - 131050 131011 40 -4.96 7.05 PlyA - 131648 131643 6 1.05 7.04 Term - 136062 135881 182 1 2 71 43 107 0.804 2.27 7.03 Intr - 138787 138646 142 1 1 79 95 22 0.889 1.93 7.02 Intr - 140567 140397 171 2 0 95 96 101 0.982 11.74 7.01 Init - 148637 148572 66 2 0 79 80 25 0.239 -0.13 7.00 Prom - 150187 150148 40 -5.56 8.00 Prom + 151033 151072 40 -4.36 8.01 Init + 152138 152267 130 1 1 64 63 48 0.453 -0.40 8.02 Intr + 152301 152507 207 1 0 91 81 154 0.527 14.05 8.03 Intr + 154538 154632 95 0 2 114 76 53 0.997 6.38 8.04 Intr + 156712 156848 137 0 2 39 95 120 0.966 7.17 8.05 Intr + 158973 159163 191 0 2 121 65 172 0.999 17.43 8.06 Intr + 160967 161200 234 0 0 80 89 209 0.993 17.86 8.07 Term + 165188 165216 29 0 2 115 43 -7 0.240 -4.26 8.08 PlyA + 167214 167219 6 1.05 9.12 PlyA - 168019 168014 6 1.05 9.11 Term - 176338 176269 70 0 1 81 55 170 0.888 10.41 9.10 Intr - 178556 178454 103 0 1 106 45 143 0.998 11.03 9.09 Intr - 180644 180524 121 2 1 67 75 96 0.954 6.27 9.08 Intr - 181204 181085 120 1 0 49 80 152 0.878 11.19 9.07 Intr - 182950 182811 140 2 2 74 79 50 0.648 2.88 9.06 Intr - 191062 190966 97 1 1 91 110 34 0.852 5.48 9.05 Intr - 195241 195161 81 1 0 133 110 63 0.993 12.83 9.04 Intr - 198660 198625 36 0 0 91 106 6 0.656 1.16 9.03 Intr - 198897 198853 45 0 0 106 90 37 0.926 4.31 9.02 Intr - 199576 199550 27 1 0 109 96 -1 0.665 1.11 9.01 Intr - 200112 200050 63 0 0 83 86 48 0.707 3.01 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:15275376_15475645|GENSCAN_predicted_peptide_1|203_aa MGFCHVGQAGLELLTSEAAADDPKPGLIREKAEEIRMMMMAQNQGKGGKACVGHGIQARE LRCSHGRCGFVRQMPEGEDARQKFRSVLVEATVKLDELVKKIGKAVEDSKPYWEARRVAR QVGRKKQGRKEPLACLGAVLATGLWVRRSGSVTFFPVALVTFPLGGHRPVPGNCVIPAAQ TLPVVTSMKIYPEESDEIIHAEK >gi568815595r:15275376_15475645|GENSCAN_predicted_CDS_1|612_bp atggggttttgccatgttggccaggctggtctcgaactcctgacctcagaagctgcagcg gatgaccctaagcctggtctgataagggagaaggcagaggaaatcaggatgatgatgatg gcacagaaccaggggaaagggggcaaagcctgtgttggacatgggatccaggctcgggag ctgaggtgctcccatgggaggtgtggctttgtcaggcagatgcctgagggagaggatgct cgtcagaagttccgctctgttctggttgaagcaacggtgaaactggatgaactggtgaag aaaattggcaaagctgtggaagactccaagccctactgggaggcacggagggtggcgagg caggtgggcaggaagaagcagggtagaaaagagccacttgcctgcctgggggcagttctt gccacgggactttgggtgaggaggtctggctcagtgacgttcttccctgtggccctggtc acattccccctgggtggccacaggccagttcctgggaactgcgtcattcctgctgcccaa acccttccagttgtgacttctatgaaaatctaccctgaagaatctgatgaaattattcat gctgagaagtaa >gi568815595r:15275376_15475645|GENSCAN_predicted_peptide_2|65_aa MRVAVFKYGLQFFGTAPGSRRIRMQCPTQDLLQEGLQANVYFNQISPVYKKKQTLPTAPC IGKLR >gi568815595r:15275376_15475645|GENSCAN_predicted_CDS_2|198_bp atgagggttgcagttttcaaatatggcctacaattctttggcactgccccaggatcaaga agaattagaatgcagtgccccacacaggatttgctgcaggaaggcctccaggccaatgtt tacttcaaccagatctcacctgtgtacaagaagaaacagactttgccaacagctccctgc attgggaaactgcgctga >gi568815595r:15275376_15475645|GENSCAN_predicted_peptide_3|78_aa MDAALKRSRSEEPAEILPPARDEEEEEEEGMEQGLEEEEEVDPRIQGELEKLNQSTDDIN RRETELEVQRAEFLFLVK >gi568815595r:15275376_15475645|GENSCAN_predicted_CDS_3|237_bp atggacgcggcactgaagcggagccgctcggaggagccagccgaaatcctgccgcctgcc cgggacgaggaggaggaggaggaagaggggatggagcaggggctggaggaggaagaagag gtggatccccggatccagggagaactggagaagttaaatcagtccacggatgatatcaac agacgggagactgaacttgaggtacagagagctgagttcctgttcttggtgaagtga >gi568815595r:15275376_15475645|GENSCAN_predicted_peptide_4|62_aa MVDALAACTLYLEKPQAAKSPPVHAIETATRAIGALTYPEPPLLPSPQYPTIARPTIAYY TY >gi568815595r:15275376_15475645|GENSCAN_predicted_CDS_4|189_bp atggtggatgcactggcagcttgcaccctgtacctggaaaagccacaggcagccaagagc ccacctgttcatgctattgaaacagccaccagagccattggggcccttacttacccagag ccacccctgctgccctcgccccagtaccctaccattgcaaggcccacaattgcctactat acctactga >gi568815595r:15275376_15475645|GENSCAN_predicted_peptide_5|104_aa MHLRPAAVAIAAATMPKRKAEGDAKGDKAKVKDEPQRRSARLSAKPAPPKPEPKPKKASA KKGEKVPKGKKGKADAGKEGNNPAENGDAKTDQSQKAEGAGDAK >gi568815595r:15275376_15475645|GENSCAN_predicted_CDS_5|315_bp atgcacctacgtcccgccgccgtcgccatcgccgcggccaccatgcccaagagaaaggct gaaggagatgctaaaggagataaagccaaggtgaaggacgaaccgcagagaagatccgct aggttgtctgctaaacctgctcctccaaagccagagcccaagcctaaaaaggcctctgca aagaagggagagaaggtacccaaagggaaaaagggaaaggctgatgctggcaaggagggg aataaccctgcagaaaatggagatgccaaaacagaccagtcacagaaagctgaaggtgct ggagatgccaagtga >gi568815595r:15275376_15475645|GENSCAN_predicted_peptide_6|129_aa MGKKQSRKTENSKNQSASPPPKERSSSPAMEQSWTENDFDELREEGFRQSNYSELKEEVR THGKEVKNLEKRLDEWLTRITSVEKPLNDLMELKTMAQELRDKCTSLSSRFNQLEERVSV MEDQMNEMK >gi568815595r:15275376_15475645|GENSCAN_predicted_CDS_6|390_bp atggggaaaaaacagagcagaaaaactgaaaattctaaaaatcagagcgcctctcctcct ccaaaggaacgcagctcctcaccagcaatggaacaaagctggacggagaatgactttgat gagttgagagaagaaggcttcagacaatcaaactactctgagctaaaggaggaagttcga acccatggcaaagaagttaaaaaccttgaaaaaagattagacgaatggctaactagaata accagcgtagagaagcccttaaatgacctgatggagctgaaaaccatggcacaagaacta cgtgacaaatgcacaagcctcagtagccgattcaatcaactggaagaaagggtatcagtg atggaagatcaaatgaatgaaatgaaatga >gi568815595r:15275376_15475645|GENSCAN_predicted_peptide_7|186_aa MVSNSWAQVINLPQPPKMLGLQQNPLYDTERCKVFQCDLTKDDLLDHVPPESVDVVMLIF VLSAVHPDKMHLVLQNIYKVLKPGKSVLFRDYGLYDHAMLRFKASSKLGENFYVRQDGTR SYFFTDDFLAQLFMDTGYEEVVNEYVFRETVNKKEGLCVPRVFLQSKFLKPPKNPSPVVL GLDPKS >gi568815595r:15275376_15475645|GENSCAN_predicted_CDS_7|561_bp atggtctcaaactcctgggctcaggtgatcaacctgcctcagcctcccaaaatgctggga ttacagcaaaatcctttatatgatacagaaagatgcaaggtattccagtgtgatctgact aaagatgatcttctggatcatgtaccgccagagtctgtggatgttgttatgttgatattt gtgctgtcagctgttcatcctgataagatgcaccttgtcttacaaaacatttacaaggta ttaaaaccaggcaaaagtgtcttgtttcgtgactacggactgtatgatcatgccatgctt aggtttaaagccagcagcaaacttggagaaaacttttatgttagacaagatgggaccaga tcatatttttttactgatgacttcctggctcagctctttatggacacaggttatgaagaa gtggtaaacgagtatgtgtttcgagagacggtgaataaaaaagaaggcctgtgtgtgcca agagttttccttcagagcaaatttctaaagcctcctaagaacccatctcctgtggtcctg ggcctggatcctaagtcctga >gi568815595r:15275376_15475645|GENSCAN_predicted_peptide_8|340_aa MAPPPPLPLGTRKFLPDAGSPTALPPQIPLSPPRRGENLLLDPERWPGSTVLPDASARSS DRGCTGRAPIWVRGRCGAMNGTANPLLDREEHCLRLGESFEKRPRASFHTIRYDFKPASI DTSCEGELQVGKGDEVTITLPHIPGSTPPMTVFKGNKRPYQKDCVLIINHDTGEYVLEKL SSSIQVKKTRAEGSSKIQARMEQQPTRPPQTSQPPPPPPPMPFRAPTKPPVGPKTSPLKD NPSPEPQLDDIKRELRAEVDIIEQMSSSSGSSSSDSESSSGSDDDSSSSGGEDNGPASPP QPSHQQPYNSRPAVANGTSRPQGSNQLMNTLTSGGEAVWT >gi568815595r:15275376_15475645|GENSCAN_predicted_CDS_8|1023_bp atggcaccgcccccgccacttccgctaggaacccggaagttcctacccgacgccggaagt cccacggccttgcctcctcagattcctctctcacccccacgcagaggagagaacttgctt ctggacccggagcggtggcccggaagcacagtcctcccagacgccagcgccagaagctcg gatcgcggctgcaccgggagagcgccgatctgggtgcgaggcaggtgcggggccatgaat gggaccgcaaacccgctgctggaccgcgaggaacattgcctgaggctcggggagagcttc gagaagcggccgcgggcctccttccacactattcgttatgattttaaaccagcatctata gacacttcctgtgaaggagagcttcaagttggcaaaggagatgaagtcacaattacactg ccacatatccctggatccacaccacccatgactgtgttcaaggggaacaaacggccttac cagaaagactgtgtgcttattattaatcatgacactggtgaatatgtgctggaaaaactc agtagcagcattcaggtgaagaaaacaagagctgagggcagcagtaaaatccaggcccga atggaacagcagcccactcgtcctccacagacgtcacagccaccaccacctccaccacct atgccattcagagctccaacgaagcctccagttggacccaaaacttctcccttgaaagat aacccctcacctgaacctcagttggatgacatcaaaagagagctgagggctgaagttgac attattgaacaaatgagcagcagcagtgggagcagctcttcagactctgagagctcttcg ggaagtgatgacgatagctccagcagtggaggcgaggacaatggcccagcctctcctccg cagccttcacaccagcagccctacaacagtaggcctgccgttgccaatggaaccagccgg ccacaaggaagcaaccagctcatgaacaccctcactagtggaggggaagcagtctggact tag >gi568815595r:15275376_15475645|GENSCAN_predicted_peptide_9|300_aa GEKGDLGMMGLPGSRGPMGSKGYPGSRGEKGSRGEKGDLGPKGEKGFPGFPGMLGQKGEM GPKGEPGIAGHRGPTGRPGKRGKQGQKGDSGVMGPPGKPGPSGQPGRPGPPGPPPAGQLI MGPKGERGFPGPPGRCLCGPTMNVNNPSYGESVYGPSSPRVPVIFVVNNQEELERLNTQN AIAFRRDQRSLYFKDSLGWLPIQLTPFYPVDYTADQHGTCGDGLLQPGEECDDGNSDVGD DCIRCHRAYCGDGHRHEGVEDCDGSDFGYLTCETYLPGSYGDLQCTQYCYIDSTPCRYFT >gi568815595r:15275376_15475645|GENSCAN_predicted_CDS_9|903_bp ggtgaaaaaggtgacctgggtatgatgggcttgccagggtcaagaggaccaatgggctcc aagggctaccctggatccagaggggaaaagggatccagaggtgaaaagggtgacctgggt cccaaaggagaaaagggtttcccaggatttcctggaatgttggggcagaaaggtgaaatg ggtccaaaaggtgaacctgggatagcaggacaccgaggacccacaggaagaccaggaaaa cgaggcaagcagggacagaaaggggatagtggagttatgggcccaccaggcaagcctggg ccttctggtcaacctggccgtccggggcccccaggccccccacctgcaggacaacttata atgggacccaaaggggaaagaggatttcccgggcctccaggaagatgtctttgtggaccc actatgaatgtgaataacccttcctacggggaatctgtgtatgggcccagttccccgcga gttcctgtgatttttgtggtcaacaaccaggaggagcttgagaggctgaacacccaaaac gccattgccttccgcagagaccagagatctctgtacttcaaggacagccttggctggctc cccatccagctgacccctttctaccctgtggattacactgcagaccagcacggcacctgt ggggatgggctcctgcagcctggggaggagtgtgacgacggtaacagcgatgtgggtgac gactgcatccgctgtcaccgtgcctactgtggagatggtcaccggcatgagggtgtggag gactgtgacggctctgactttggctacctgacatgcgagacctatctccctgggtcatat ggagacctgcaatgcacccagtactgctacatcgactccacgccctgccgctacttcacc tga