GENSCAN 1.0 Date run: 4-Nov-116 Time: 12:47:18 Sequence gi568815595r:15311259_15526511 : 215253 bp : 44.90% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 13159 13236 78 0 0 73 77 86 0.234 5.75 1.02 Term + 18433 18477 45 0 0 115 54 21 0.101 -1.39 1.03 PlyA + 18757 18762 6 1.05 2.03 PlyA - 19108 19103 6 1.05 2.02 Term - 19308 19210 99 0 0 111 41 121 0.581 7.83 2.01 Init - 21150 21013 138 0 0 70 91 286 0.268 25.24 2.00 Prom - 30161 30122 40 -4.26 3.00 Prom + 36877 36916 40 -3.36 3.01 Init + 40726 40773 48 0 0 77 78 57 0.504 4.72 3.02 Term + 61468 61608 141 0 0 66 44 152 0.826 6.53 3.03 PlyA + 61625 61630 6 -0.45 4.03 PlyA - 62844 62839 6 1.05 4.02 Term - 64426 64115 312 1 0 42 45 409 0.310 27.00 4.01 Init - 75741 75739 3 0 0 108 81 0 0.405 1.30 4.00 Prom - 80725 80686 40 -2.46 5.02 PlyA - 81486 81481 6 1.05 5.01 Sngl - 85352 84963 390 2 0 88 54 284 0.951 21.22 5.00 Prom - 95167 95128 40 -4.96 6.05 PlyA - 95765 95760 6 1.05 6.04 Term - 100179 99998 182 1 2 71 43 107 0.804 2.27 6.03 Intr - 102904 102763 142 1 1 79 95 22 0.889 1.93 6.02 Intr - 104684 104514 171 2 0 95 96 101 0.982 11.74 6.01 Init - 112754 112689 66 2 0 79 80 25 0.239 -0.13 6.00 Prom - 114304 114265 40 -5.56 7.00 Prom + 115150 115189 40 -4.36 7.01 Init + 116255 116384 130 1 1 64 63 48 0.453 -0.40 7.02 Intr + 116418 116624 207 1 0 91 81 154 0.527 14.05 7.03 Intr + 118655 118749 95 0 2 114 76 53 0.997 6.38 7.04 Intr + 120829 120965 137 0 2 39 95 120 0.966 7.17 7.05 Intr + 123090 123280 191 0 2 121 65 172 0.999 17.43 7.06 Intr + 125084 125317 234 0 0 80 89 209 0.993 17.86 7.07 Term + 129305 129333 29 0 2 115 43 -7 0.240 -4.26 7.08 PlyA + 131331 131336 6 1.05 8.15 PlyA - 132136 132131 6 1.05 8.14 Term - 140455 140386 70 0 1 81 55 170 0.888 10.41 8.13 Intr - 142673 142571 103 0 1 106 45 143 0.998 11.03 8.12 Intr - 144761 144641 121 2 1 67 75 96 0.954 6.27 8.11 Intr - 145321 145202 120 1 0 49 80 152 0.878 11.19 8.10 Intr - 147067 146928 140 2 2 74 79 50 0.648 2.88 8.09 Intr - 155179 155083 97 1 1 91 110 34 0.852 5.48 8.08 Intr - 159358 159278 81 1 0 133 110 63 0.993 12.83 8.07 Intr - 162777 162742 36 0 0 91 106 6 0.653 1.16 8.06 Intr - 163014 162970 45 0 0 106 90 37 0.922 4.31 8.05 Intr - 163693 163667 27 1 0 109 96 -1 0.662 1.11 8.04 Intr - 164229 164167 63 0 0 83 86 48 0.831 3.01 8.03 Intr - 165939 165868 72 0 0 70 110 1 0.472 0.10 8.02 Intr - 167745 167700 46 2 1 111 65 43 0.866 2.71 8.01 Init - 170740 170337 404 1 2 60 39 173 0.501 5.70 8.00 Prom - 172663 172624 40 -4.96 9.08 PlyA - 172789 172784 6 -0.45 9.07 Term - 173489 173002 488 0 2 17 38 297 0.980 12.46 9.06 Intr - 173942 173780 163 2 1 80 71 47 0.974 1.75 9.05 Intr - 176512 176354 159 1 0 51 85 45 0.458 0.68 9.04 Intr - 177049 176948 102 1 0 48 77 57 0.171 0.97 9.03 Intr - 178379 178267 113 0 2 34 117 54 0.235 3.00 9.02 Intr - 182338 182168 171 2 0 0 100 82 0.137 0.51 9.01 Init - 187329 187254 76 0 1 84 92 59 0.508 5.16 9.00 Prom - 196580 196541 40 0.04 10.03 PlyA - 196654 196649 6 1.05 10.02 Term - 201163 201003 161 2 2 82 48 45 0.048 -2.00 10.01 Init - 210367 210262 106 1 1 95 99 36 0.491 5.37 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:15311259_15526511|GENSCAN_predicted_peptide_1|40_aa CPTQDLLQEGLQANVYFNQISPVYKKKQTLPTAPCIGKLR >gi568815595r:15311259_15526511|GENSCAN_predicted_CDS_1|123_bp tgccccacacaggatttgctgcaggaaggcctccaggccaatgtttacttcaaccagatc tcacctgtgtacaagaagaaacagactttgccaacagctccctgcattgggaaactgcgc tga >gi568815595r:15311259_15526511|GENSCAN_predicted_peptide_2|78_aa MDAALKRSRSEEPAEILPPARDEEEEEEEGMEQGLEEEEEVDPRIQGELEKLNQSTDDIN RRETELEVQRAEFLFLVK >gi568815595r:15311259_15526511|GENSCAN_predicted_CDS_2|237_bp atggacgcggcactgaagcggagccgctcggaggagccagccgaaatcctgccgcctgcc cgggacgaggaggaggaggaggaagaggggatggagcaggggctggaggaggaagaagag gtggatccccggatccagggagaactggagaagttaaatcagtccacggatgatatcaac agacgggagactgaacttgaggtacagagagctgagttcctgttcttggtgaagtga >gi568815595r:15311259_15526511|GENSCAN_predicted_peptide_3|62_aa MVDALAACTLYLEKPQAAKSPPVHAIETATRAIGALTYPEPPLLPSPQYPTIARPTIAYY TY >gi568815595r:15311259_15526511|GENSCAN_predicted_CDS_3|189_bp atggtggatgcactggcagcttgcaccctgtacctggaaaagccacaggcagccaagagc ccacctgttcatgctattgaaacagccaccagagccattggggcccttacttacccagag ccacccctgctgccctcgccccagtaccctaccattgcaaggcccacaattgcctactat acctactga >gi568815595r:15311259_15526511|GENSCAN_predicted_peptide_4|104_aa MHLRPAAVAIAAATMPKRKAEGDAKGDKAKVKDEPQRRSARLSAKPAPPKPEPKPKKASA KKGEKVPKGKKGKADAGKEGNNPAENGDAKTDQSQKAEGAGDAK >gi568815595r:15311259_15526511|GENSCAN_predicted_CDS_4|315_bp atgcacctacgtcccgccgccgtcgccatcgccgcggccaccatgcccaagagaaaggct gaaggagatgctaaaggagataaagccaaggtgaaggacgaaccgcagagaagatccgct aggttgtctgctaaacctgctcctccaaagccagagcccaagcctaaaaaggcctctgca aagaagggagagaaggtacccaaagggaaaaagggaaaggctgatgctggcaaggagggg aataaccctgcagaaaatggagatgccaaaacagaccagtcacagaaagctgaaggtgct ggagatgccaagtga >gi568815595r:15311259_15526511|GENSCAN_predicted_peptide_5|129_aa MGKKQSRKTENSKNQSASPPPKERSSSPAMEQSWTENDFDELREEGFRQSNYSELKEEVR THGKEVKNLEKRLDEWLTRITSVEKPLNDLMELKTMAQELRDKCTSLSSRFNQLEERVSV MEDQMNEMK >gi568815595r:15311259_15526511|GENSCAN_predicted_CDS_5|390_bp atggggaaaaaacagagcagaaaaactgaaaattctaaaaatcagagcgcctctcctcct ccaaaggaacgcagctcctcaccagcaatggaacaaagctggacggagaatgactttgat gagttgagagaagaaggcttcagacaatcaaactactctgagctaaaggaggaagttcga acccatggcaaagaagttaaaaaccttgaaaaaagattagacgaatggctaactagaata accagcgtagagaagcccttaaatgacctgatggagctgaaaaccatggcacaagaacta cgtgacaaatgcacaagcctcagtagccgattcaatcaactggaagaaagggtatcagtg atggaagatcaaatgaatgaaatgaaatga >gi568815595r:15311259_15526511|GENSCAN_predicted_peptide_6|186_aa MVSNSWAQVINLPQPPKMLGLQQNPLYDTERCKVFQCDLTKDDLLDHVPPESVDVVMLIF VLSAVHPDKMHLVLQNIYKVLKPGKSVLFRDYGLYDHAMLRFKASSKLGENFYVRQDGTR SYFFTDDFLAQLFMDTGYEEVVNEYVFRETVNKKEGLCVPRVFLQSKFLKPPKNPSPVVL GLDPKS >gi568815595r:15311259_15526511|GENSCAN_predicted_CDS_6|561_bp atggtctcaaactcctgggctcaggtgatcaacctgcctcagcctcccaaaatgctggga ttacagcaaaatcctttatatgatacagaaagatgcaaggtattccagtgtgatctgact aaagatgatcttctggatcatgtaccgccagagtctgtggatgttgttatgttgatattt gtgctgtcagctgttcatcctgataagatgcaccttgtcttacaaaacatttacaaggta ttaaaaccaggcaaaagtgtcttgtttcgtgactacggactgtatgatcatgccatgctt aggtttaaagccagcagcaaacttggagaaaacttttatgttagacaagatgggaccaga tcatatttttttactgatgacttcctggctcagctctttatggacacaggttatgaagaa gtggtaaacgagtatgtgtttcgagagacggtgaataaaaaagaaggcctgtgtgtgcca agagttttccttcagagcaaatttctaaagcctcctaagaacccatctcctgtggtcctg ggcctggatcctaagtcctga >gi568815595r:15311259_15526511|GENSCAN_predicted_peptide_7|340_aa MAPPPPLPLGTRKFLPDAGSPTALPPQIPLSPPRRGENLLLDPERWPGSTVLPDASARSS DRGCTGRAPIWVRGRCGAMNGTANPLLDREEHCLRLGESFEKRPRASFHTIRYDFKPASI DTSCEGELQVGKGDEVTITLPHIPGSTPPMTVFKGNKRPYQKDCVLIINHDTGEYVLEKL SSSIQVKKTRAEGSSKIQARMEQQPTRPPQTSQPPPPPPPMPFRAPTKPPVGPKTSPLKD NPSPEPQLDDIKRELRAEVDIIEQMSSSSGSSSSDSESSSGSDDDSSSSGGEDNGPASPP QPSHQQPYNSRPAVANGTSRPQGSNQLMNTLTSGGEAVWT >gi568815595r:15311259_15526511|GENSCAN_predicted_CDS_7|1023_bp atggcaccgcccccgccacttccgctaggaacccggaagttcctacccgacgccggaagt cccacggccttgcctcctcagattcctctctcacccccacgcagaggagagaacttgctt ctggacccggagcggtggcccggaagcacagtcctcccagacgccagcgccagaagctcg gatcgcggctgcaccgggagagcgccgatctgggtgcgaggcaggtgcggggccatgaat gggaccgcaaacccgctgctggaccgcgaggaacattgcctgaggctcggggagagcttc gagaagcggccgcgggcctccttccacactattcgttatgattttaaaccagcatctata gacacttcctgtgaaggagagcttcaagttggcaaaggagatgaagtcacaattacactg ccacatatccctggatccacaccacccatgactgtgttcaaggggaacaaacggccttac cagaaagactgtgtgcttattattaatcatgacactggtgaatatgtgctggaaaaactc agtagcagcattcaggtgaagaaaacaagagctgagggcagcagtaaaatccaggcccga atggaacagcagcccactcgtcctccacagacgtcacagccaccaccacctccaccacct atgccattcagagctccaacgaagcctccagttggacccaaaacttctcccttgaaagat aacccctcacctgaacctcagttggatgacatcaaaagagagctgagggctgaagttgac attattgaacaaatgagcagcagcagtgggagcagctcttcagactctgagagctcttcg ggaagtgatgacgatagctccagcagtggaggcgaggacaatggcccagcctctcctccg cagccttcacaccagcagccctacaacagtaggcctgccgttgccaatggaaccagccgg ccacaaggaagcaaccagctcatgaacaccctcactagtggaggggaagcagtctggact tag >gi568815595r:15311259_15526511|GENSCAN_predicted_peptide_8|474_aa MSELPFTIASKRIKYLGIKLTRDVKDLFKENCKPLLNEMKENRNKWKNIPCSWIGRINIV KMAILPKVIYRFSAIPIKLPMTFFTELEKTTLNFIWNQKRDHIAKTILSQKKKAGGITLP DFKLYYKATVTKTAWGSLADQEGRYGLRCSGRPGPPGVPGMPGPIGWPGPEGPRGEKGDL GMMGLPGSRGPMGSKGYPGSRGEKGSRGEKGDLGPKGEKGFPGFPGMLGQKGEMGPKGEP GIAGHRGPTGRPGKRGKQGQKGDSGVMGPPGKPGPSGQPGRPGPPGPPPAGQLIMGPKGE RGFPGPPGRCLCGPTMNVNNPSYGESVYGPSSPRVPVIFVVNNQEELERLNTQNAIAFRR DQRSLYFKDSLGWLPIQLTPFYPVDYTADQHGTCGDGLLQPGEECDDGNSDVGDDCIRCH RAYCGDGHRHEGVEDCDGSDFGYLTCETYLPGSYGDLQCTQYCYIDSTPCRYFT >gi568815595r:15311259_15526511|GENSCAN_predicted_CDS_8|1425_bp atgagtgagctcccattcacaattgcttcaaagagaataaaatacctaggaatcaaactt acaagggatgtgaaggacctcttcaaggagaactgcaaaccactgctcaatgaaatgaaa gagaacagaaacaaatggaagaacattccatgctcatggataggaagaatcaatattgtg aaaatggccatactgcccaaggtaatttatagattcagtgccatccccatcaagctacca atgactttcttcacagaattggaaaaaactactttaaacttcatatggaaccaaaaaaga gaccacattgccaagacaatcttaagccaaaagaagaaagctggaggcatcacactacct gacttcaaactatactacaaggctacagtaaccaagacagcatgggggagcttggccgac caggaaggaaggtatggtctgcgctgttctggtagacctggccccccaggtgttcctggc atgcctgggcccatcggttggccaggccctgaaggacccaggggtgaaaaaggtgacctg ggtatgatgggcttgccagggtcaagaggaccaatgggctccaagggctaccctggatcc agaggggaaaagggatccagaggtgaaaagggtgacctgggtcccaaaggagaaaagggt ttcccaggatttcctggaatgttggggcagaaaggtgaaatgggtccaaaaggtgaacct gggatagcaggacaccgaggacccacaggaagaccaggaaaacgaggcaagcagggacag aaaggggatagtggagttatgggcccaccaggcaagcctgggccttctggtcaacctggc cgtccggggcccccaggccccccacctgcaggacaacttataatgggacccaaaggggaa agaggatttcccgggcctccaggaagatgtctttgtggacccactatgaatgtgaataac ccttcctacggggaatctgtgtatgggcccagttccccgcgagttcctgtgatttttgtg gtcaacaaccaggaggagcttgagaggctgaacacccaaaacgccattgccttccgcaga gaccagagatctctgtacttcaaggacagccttggctggctccccatccagctgacccct ttctaccctgtggattacactgcagaccagcacggcacctgtggggatgggctcctgcag cctggggaggagtgtgacgacggtaacagcgatgtgggtgacgactgcatccgctgtcac cgtgcctactgtggagatggtcaccggcatgagggtgtggaggactgtgacggctctgac tttggctacctgacatgcgagacctatctccctgggtcatatggagacctgcaatgcacc cagtactgctacatcgactccacgccctgccgctacttcacctga >gi568815595r:15311259_15526511|GENSCAN_predicted_peptide_9|423_aa MTGSSFSLAHLLIISGLLCYSAGCLDPQGDVPDKADGGGKRGEARLGQPSWRDGREDAKE DGMDQMHRKKHKRGDGGQGIEEALPSLDQKKRGGHKACCLLTPPPPPLFPPPFFRGGRSP LLSPDMKNLMLELETSQSPCMQGSLGSPGPPGPQVLLPSSPPLPTKAKGETVKMLQKGFL DKQEYPPETPPHRVLAPLACLQEGAPSPPLVIPRQTGSGVDLQQTPTDLQQRVLTVRRKT NKQKGHLHQNPICTSPSSKTKARELHNTCTSFSSQFNHVEERVSVIEDQMNEMKREEKFR EKRVRRNEQSLQEIWDYVKRPNLRLIGVPESDGENGIKLENTLQDIIQEYFPNLARQANI QIEEIQRTPQRYSSRRACLSRVPEGSTKHGKEQLVPATTKSYQTVKTINTMKKLHQLMSK ITS >gi568815595r:15311259_15526511|GENSCAN_predicted_CDS_9|1272_bp atgacgggctcatcattcagcctcgctcatctgctcatcatttcaggactgctctgttac tcggcaggctgcttggacccacaaggggatgttccagacaaagcagatgggggaggtaaa agaggtgaggcaagactgggtcagccgtcatggagagatgggcgtgaagacgccaaagag gatggaatggatcagatgcacagaaagaagcataaaaggggagatgggggacaggggatc gaggaagcccttcccagcctggatcagaagaagcgtggtggccacaaagcatgctgcctg ctgacgcctcctccaccaccactgttcccaccaccattcttcagaggtggccgaagtccg cttctctccccagacatgaagaatctcatgctggaactggagacctcgcagtccccgtgc atgcaaggctcgctaggctcccctgggcctcccggcccccaggtattgctaccaagcagc ccacccctccccaccaaagctaaaggggaaacagtcaaaatgctgcagaaaggatttcta gacaaacaagagtaccctccagagaccccaccccacagagtcctggcccctctggcctgc ctccaagagggtgctccatcgcctccactggtgatacccaggcaaacagggtctggagtg gacctccagcaaactccaacagacctgcagcagagggtcctgactgttagaaggaaaact aacaaacagaaaggacatctacaccaaaatcccatctgtacatcaccatcatcaaagacc aaagcacgagaactacacaacacatgcacaagcttcagtagccaattcaatcatgtggaa gaaagggtatcagtgattgaagatcaaatgaatgaaatgaagcgagaagagaagtttaga gaaaaaagagtaagaagaaatgaacaaagcctccaagaaatatgggactatgtgaaaaga ccaaatctacgtctgattggtgtacctgaaagtgacggggagaatggaatcaagttggaa aacactcttcaggatattatccaggagtacttccccaacctagcaaggcaggccaacatt caaattgaggaaatacagagaacgccacaaagatactcctcaagaagagcctgcctttca agagttcctgaaggaagcactaaacatggaaaggaacaactggtaccagccaccacaaaa tcataccaaacggtaaagaccatcaacaccatgaagaaactgcatcaactaatgagcaaa ataacaagctaa >gi568815595r:15311259_15526511|GENSCAN_predicted_peptide_10|88_aa MVVLNPMTLGIYLQLFFLSIVSQPTFINSVLPISAAQEHLAPFKPALISHGNQHKHRHGA RVWELVTAEGEASWLLQALHHQPAAGCK >gi568815595r:15311259_15526511|GENSCAN_predicted_CDS_10|267_bp atggttgtcctgaatccaatgactttgggaatttatcttcagcttttcttcctctctatc gtgtctcagccgactttcatcaacagcgttcttccaatctcagcagcccaggagcattta gctccattcaagcccgcacttatctctcatggcaaccagcataaacacaggcatggggcg cgtgtttgggagctggtgacagctgaaggagaggcctcgtggctgctacaggctttgcac catcaaccagctgcgggctgcaaatga