GENSCAN 1.0 Date run: 7-Nov-116 Time: 17:45:37 Sequence gi568815595r:15311346_15526511 : 215166 bp : 44.90% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 13072 13149 78 0 0 73 77 86 0.234 5.75 1.02 Term + 18346 18390 45 0 0 115 54 21 0.101 -1.39 1.03 PlyA + 18670 18675 6 1.05 2.03 PlyA - 19021 19016 6 1.05 2.02 Term - 19221 19123 99 0 0 111 41 121 0.581 7.83 2.01 Init - 21063 20926 138 0 0 70 91 286 0.268 25.24 2.00 Prom - 30074 30035 40 -4.26 3.00 Prom + 36790 36829 40 -3.36 3.01 Init + 40639 40686 48 0 0 77 78 57 0.504 4.72 3.02 Term + 61381 61521 141 0 0 66 44 152 0.826 6.53 3.03 PlyA + 61538 61543 6 -0.45 4.03 PlyA - 62757 62752 6 1.05 4.02 Term - 64339 64028 312 1 0 42 45 409 0.310 27.00 4.01 Init - 75654 75652 3 0 0 108 81 0 0.405 1.30 4.00 Prom - 80638 80599 40 -2.46 5.02 PlyA - 81399 81394 6 1.05 5.01 Sngl - 85265 84876 390 2 0 88 54 284 0.951 21.22 5.00 Prom - 95080 95041 40 -4.96 6.05 PlyA - 95678 95673 6 1.05 6.04 Term - 100092 99911 182 1 2 71 43 107 0.804 2.27 6.03 Intr - 102817 102676 142 1 1 79 95 22 0.889 1.93 6.02 Intr - 104597 104427 171 2 0 95 96 101 0.982 11.74 6.01 Init - 112667 112602 66 2 0 79 80 25 0.239 -0.13 6.00 Prom - 114217 114178 40 -5.56 7.00 Prom + 115063 115102 40 -4.36 7.01 Init + 116168 116297 130 1 1 64 63 48 0.453 -0.40 7.02 Intr + 116331 116537 207 1 0 91 81 154 0.527 14.05 7.03 Intr + 118568 118662 95 0 2 114 76 53 0.997 6.38 7.04 Intr + 120742 120878 137 0 2 39 95 120 0.966 7.17 7.05 Intr + 123003 123193 191 0 2 121 65 172 0.999 17.43 7.06 Intr + 124997 125230 234 0 0 80 89 209 0.993 17.86 7.07 Term + 129218 129246 29 0 2 115 43 -7 0.240 -4.26 7.08 PlyA + 131244 131249 6 1.05 8.15 PlyA - 132049 132044 6 1.05 8.14 Term - 140368 140299 70 0 1 81 55 170 0.888 10.41 8.13 Intr - 142586 142484 103 0 1 106 45 143 0.998 11.03 8.12 Intr - 144674 144554 121 2 1 67 75 96 0.954 6.27 8.11 Intr - 145234 145115 120 1 0 49 80 152 0.878 11.19 8.10 Intr - 146980 146841 140 2 2 74 79 50 0.648 2.88 8.09 Intr - 155092 154996 97 1 1 91 110 34 0.852 5.48 8.08 Intr - 159271 159191 81 1 0 133 110 63 0.993 12.83 8.07 Intr - 162690 162655 36 0 0 91 106 6 0.653 1.16 8.06 Intr - 162927 162883 45 0 0 106 90 37 0.922 4.31 8.05 Intr - 163606 163580 27 1 0 109 96 -1 0.662 1.11 8.04 Intr - 164142 164080 63 0 0 83 86 48 0.831 3.01 8.03 Intr - 165852 165781 72 0 0 70 110 1 0.472 0.10 8.02 Intr - 167658 167613 46 2 1 111 65 43 0.866 2.71 8.01 Init - 170653 170250 404 1 2 60 39 173 0.501 5.70 8.00 Prom - 172576 172537 40 -4.96 9.08 PlyA - 172702 172697 6 -0.45 9.07 Term - 173402 172915 488 0 2 17 38 297 0.980 12.46 9.06 Intr - 173855 173693 163 2 1 80 71 47 0.974 1.75 9.05 Intr - 176425 176267 159 1 0 51 85 45 0.458 0.68 9.04 Intr - 176962 176861 102 1 0 48 77 57 0.171 0.97 9.03 Intr - 178292 178180 113 0 2 34 117 54 0.235 3.00 9.02 Intr - 182251 182081 171 2 0 0 100 82 0.137 0.51 9.01 Init - 187242 187167 76 0 1 84 92 59 0.508 5.16 9.00 Prom - 196493 196454 40 0.04 10.03 PlyA - 196567 196562 6 1.05 10.02 Term - 201076 200916 161 2 2 82 48 45 0.048 -2.00 10.01 Init - 210280 210175 106 1 1 95 99 36 0.491 5.37 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:15311346_15526511|GENSCAN_predicted_peptide_1|40_aa CPTQDLLQEGLQANVYFNQISPVYKKKQTLPTAPCIGKLR >gi568815595r:15311346_15526511|GENSCAN_predicted_CDS_1|123_bp tgccccacacaggatttgctgcaggaaggcctccaggccaatgtttacttcaaccagatc tcacctgtgtacaagaagaaacagactttgccaacagctccctgcattgggaaactgcgc tga >gi568815595r:15311346_15526511|GENSCAN_predicted_peptide_2|78_aa MDAALKRSRSEEPAEILPPARDEEEEEEEGMEQGLEEEEEVDPRIQGELEKLNQSTDDIN RRETELEVQRAEFLFLVK >gi568815595r:15311346_15526511|GENSCAN_predicted_CDS_2|237_bp atggacgcggcactgaagcggagccgctcggaggagccagccgaaatcctgccgcctgcc cgggacgaggaggaggaggaggaagaggggatggagcaggggctggaggaggaagaagag gtggatccccggatccagggagaactggagaagttaaatcagtccacggatgatatcaac agacgggagactgaacttgaggtacagagagctgagttcctgttcttggtgaagtga >gi568815595r:15311346_15526511|GENSCAN_predicted_peptide_3|62_aa MVDALAACTLYLEKPQAAKSPPVHAIETATRAIGALTYPEPPLLPSPQYPTIARPTIAYY TY >gi568815595r:15311346_15526511|GENSCAN_predicted_CDS_3|189_bp atggtggatgcactggcagcttgcaccctgtacctggaaaagccacaggcagccaagagc ccacctgttcatgctattgaaacagccaccagagccattggggcccttacttacccagag ccacccctgctgccctcgccccagtaccctaccattgcaaggcccacaattgcctactat acctactga >gi568815595r:15311346_15526511|GENSCAN_predicted_peptide_4|104_aa MHLRPAAVAIAAATMPKRKAEGDAKGDKAKVKDEPQRRSARLSAKPAPPKPEPKPKKASA KKGEKVPKGKKGKADAGKEGNNPAENGDAKTDQSQKAEGAGDAK >gi568815595r:15311346_15526511|GENSCAN_predicted_CDS_4|315_bp atgcacctacgtcccgccgccgtcgccatcgccgcggccaccatgcccaagagaaaggct gaaggagatgctaaaggagataaagccaaggtgaaggacgaaccgcagagaagatccgct aggttgtctgctaaacctgctcctccaaagccagagcccaagcctaaaaaggcctctgca aagaagggagagaaggtacccaaagggaaaaagggaaaggctgatgctggcaaggagggg aataaccctgcagaaaatggagatgccaaaacagaccagtcacagaaagctgaaggtgct ggagatgccaagtga >gi568815595r:15311346_15526511|GENSCAN_predicted_peptide_5|129_aa MGKKQSRKTENSKNQSASPPPKERSSSPAMEQSWTENDFDELREEGFRQSNYSELKEEVR THGKEVKNLEKRLDEWLTRITSVEKPLNDLMELKTMAQELRDKCTSLSSRFNQLEERVSV MEDQMNEMK >gi568815595r:15311346_15526511|GENSCAN_predicted_CDS_5|390_bp atggggaaaaaacagagcagaaaaactgaaaattctaaaaatcagagcgcctctcctcct ccaaaggaacgcagctcctcaccagcaatggaacaaagctggacggagaatgactttgat gagttgagagaagaaggcttcagacaatcaaactactctgagctaaaggaggaagttcga acccatggcaaagaagttaaaaaccttgaaaaaagattagacgaatggctaactagaata accagcgtagagaagcccttaaatgacctgatggagctgaaaaccatggcacaagaacta cgtgacaaatgcacaagcctcagtagccgattcaatcaactggaagaaagggtatcagtg atggaagatcaaatgaatgaaatgaaatga >gi568815595r:15311346_15526511|GENSCAN_predicted_peptide_6|186_aa MVSNSWAQVINLPQPPKMLGLQQNPLYDTERCKVFQCDLTKDDLLDHVPPESVDVVMLIF VLSAVHPDKMHLVLQNIYKVLKPGKSVLFRDYGLYDHAMLRFKASSKLGENFYVRQDGTR SYFFTDDFLAQLFMDTGYEEVVNEYVFRETVNKKEGLCVPRVFLQSKFLKPPKNPSPVVL GLDPKS >gi568815595r:15311346_15526511|GENSCAN_predicted_CDS_6|561_bp atggtctcaaactcctgggctcaggtgatcaacctgcctcagcctcccaaaatgctggga ttacagcaaaatcctttatatgatacagaaagatgcaaggtattccagtgtgatctgact aaagatgatcttctggatcatgtaccgccagagtctgtggatgttgttatgttgatattt gtgctgtcagctgttcatcctgataagatgcaccttgtcttacaaaacatttacaaggta ttaaaaccaggcaaaagtgtcttgtttcgtgactacggactgtatgatcatgccatgctt aggtttaaagccagcagcaaacttggagaaaacttttatgttagacaagatgggaccaga tcatatttttttactgatgacttcctggctcagctctttatggacacaggttatgaagaa gtggtaaacgagtatgtgtttcgagagacggtgaataaaaaagaaggcctgtgtgtgcca agagttttccttcagagcaaatttctaaagcctcctaagaacccatctcctgtggtcctg ggcctggatcctaagtcctga >gi568815595r:15311346_15526511|GENSCAN_predicted_peptide_7|340_aa MAPPPPLPLGTRKFLPDAGSPTALPPQIPLSPPRRGENLLLDPERWPGSTVLPDASARSS DRGCTGRAPIWVRGRCGAMNGTANPLLDREEHCLRLGESFEKRPRASFHTIRYDFKPASI DTSCEGELQVGKGDEVTITLPHIPGSTPPMTVFKGNKRPYQKDCVLIINHDTGEYVLEKL SSSIQVKKTRAEGSSKIQARMEQQPTRPPQTSQPPPPPPPMPFRAPTKPPVGPKTSPLKD NPSPEPQLDDIKRELRAEVDIIEQMSSSSGSSSSDSESSSGSDDDSSSSGGEDNGPASPP QPSHQQPYNSRPAVANGTSRPQGSNQLMNTLTSGGEAVWT >gi568815595r:15311346_15526511|GENSCAN_predicted_CDS_7|1023_bp atggcaccgcccccgccacttccgctaggaacccggaagttcctacccgacgccggaagt cccacggccttgcctcctcagattcctctctcacccccacgcagaggagagaacttgctt ctggacccggagcggtggcccggaagcacagtcctcccagacgccagcgccagaagctcg gatcgcggctgcaccgggagagcgccgatctgggtgcgaggcaggtgcggggccatgaat gggaccgcaaacccgctgctggaccgcgaggaacattgcctgaggctcggggagagcttc gagaagcggccgcgggcctccttccacactattcgttatgattttaaaccagcatctata gacacttcctgtgaaggagagcttcaagttggcaaaggagatgaagtcacaattacactg ccacatatccctggatccacaccacccatgactgtgttcaaggggaacaaacggccttac cagaaagactgtgtgcttattattaatcatgacactggtgaatatgtgctggaaaaactc agtagcagcattcaggtgaagaaaacaagagctgagggcagcagtaaaatccaggcccga atggaacagcagcccactcgtcctccacagacgtcacagccaccaccacctccaccacct atgccattcagagctccaacgaagcctccagttggacccaaaacttctcccttgaaagat aacccctcacctgaacctcagttggatgacatcaaaagagagctgagggctgaagttgac attattgaacaaatgagcagcagcagtgggagcagctcttcagactctgagagctcttcg ggaagtgatgacgatagctccagcagtggaggcgaggacaatggcccagcctctcctccg cagccttcacaccagcagccctacaacagtaggcctgccgttgccaatggaaccagccgg ccacaaggaagcaaccagctcatgaacaccctcactagtggaggggaagcagtctggact tag >gi568815595r:15311346_15526511|GENSCAN_predicted_peptide_8|474_aa MSELPFTIASKRIKYLGIKLTRDVKDLFKENCKPLLNEMKENRNKWKNIPCSWIGRINIV KMAILPKVIYRFSAIPIKLPMTFFTELEKTTLNFIWNQKRDHIAKTILSQKKKAGGITLP DFKLYYKATVTKTAWGSLADQEGRYGLRCSGRPGPPGVPGMPGPIGWPGPEGPRGEKGDL GMMGLPGSRGPMGSKGYPGSRGEKGSRGEKGDLGPKGEKGFPGFPGMLGQKGEMGPKGEP GIAGHRGPTGRPGKRGKQGQKGDSGVMGPPGKPGPSGQPGRPGPPGPPPAGQLIMGPKGE RGFPGPPGRCLCGPTMNVNNPSYGESVYGPSSPRVPVIFVVNNQEELERLNTQNAIAFRR DQRSLYFKDSLGWLPIQLTPFYPVDYTADQHGTCGDGLLQPGEECDDGNSDVGDDCIRCH RAYCGDGHRHEGVEDCDGSDFGYLTCETYLPGSYGDLQCTQYCYIDSTPCRYFT >gi568815595r:15311346_15526511|GENSCAN_predicted_CDS_8|1425_bp atgagtgagctcccattcacaattgcttcaaagagaataaaatacctaggaatcaaactt acaagggatgtgaaggacctcttcaaggagaactgcaaaccactgctcaatgaaatgaaa gagaacagaaacaaatggaagaacattccatgctcatggataggaagaatcaatattgtg aaaatggccatactgcccaaggtaatttatagattcagtgccatccccatcaagctacca atgactttcttcacagaattggaaaaaactactttaaacttcatatggaaccaaaaaaga gaccacattgccaagacaatcttaagccaaaagaagaaagctggaggcatcacactacct gacttcaaactatactacaaggctacagtaaccaagacagcatgggggagcttggccgac caggaaggaaggtatggtctgcgctgttctggtagacctggccccccaggtgttcctggc atgcctgggcccatcggttggccaggccctgaaggacccaggggtgaaaaaggtgacctg ggtatgatgggcttgccagggtcaagaggaccaatgggctccaagggctaccctggatcc agaggggaaaagggatccagaggtgaaaagggtgacctgggtcccaaaggagaaaagggt ttcccaggatttcctggaatgttggggcagaaaggtgaaatgggtccaaaaggtgaacct gggatagcaggacaccgaggacccacaggaagaccaggaaaacgaggcaagcagggacag aaaggggatagtggagttatgggcccaccaggcaagcctgggccttctggtcaacctggc cgtccggggcccccaggccccccacctgcaggacaacttataatgggacccaaaggggaa agaggatttcccgggcctccaggaagatgtctttgtggacccactatgaatgtgaataac ccttcctacggggaatctgtgtatgggcccagttccccgcgagttcctgtgatttttgtg gtcaacaaccaggaggagcttgagaggctgaacacccaaaacgccattgccttccgcaga gaccagagatctctgtacttcaaggacagccttggctggctccccatccagctgacccct ttctaccctgtggattacactgcagaccagcacggcacctgtggggatgggctcctgcag cctggggaggagtgtgacgacggtaacagcgatgtgggtgacgactgcatccgctgtcac cgtgcctactgtggagatggtcaccggcatgagggtgtggaggactgtgacggctctgac tttggctacctgacatgcgagacctatctccctgggtcatatggagacctgcaatgcacc cagtactgctacatcgactccacgccctgccgctacttcacctga >gi568815595r:15311346_15526511|GENSCAN_predicted_peptide_9|423_aa MTGSSFSLAHLLIISGLLCYSAGCLDPQGDVPDKADGGGKRGEARLGQPSWRDGREDAKE DGMDQMHRKKHKRGDGGQGIEEALPSLDQKKRGGHKACCLLTPPPPPLFPPPFFRGGRSP LLSPDMKNLMLELETSQSPCMQGSLGSPGPPGPQVLLPSSPPLPTKAKGETVKMLQKGFL DKQEYPPETPPHRVLAPLACLQEGAPSPPLVIPRQTGSGVDLQQTPTDLQQRVLTVRRKT NKQKGHLHQNPICTSPSSKTKARELHNTCTSFSSQFNHVEERVSVIEDQMNEMKREEKFR EKRVRRNEQSLQEIWDYVKRPNLRLIGVPESDGENGIKLENTLQDIIQEYFPNLARQANI QIEEIQRTPQRYSSRRACLSRVPEGSTKHGKEQLVPATTKSYQTVKTINTMKKLHQLMSK ITS >gi568815595r:15311346_15526511|GENSCAN_predicted_CDS_9|1272_bp atgacgggctcatcattcagcctcgctcatctgctcatcatttcaggactgctctgttac tcggcaggctgcttggacccacaaggggatgttccagacaaagcagatgggggaggtaaa agaggtgaggcaagactgggtcagccgtcatggagagatgggcgtgaagacgccaaagag gatggaatggatcagatgcacagaaagaagcataaaaggggagatgggggacaggggatc gaggaagcccttcccagcctggatcagaagaagcgtggtggccacaaagcatgctgcctg ctgacgcctcctccaccaccactgttcccaccaccattcttcagaggtggccgaagtccg cttctctccccagacatgaagaatctcatgctggaactggagacctcgcagtccccgtgc atgcaaggctcgctaggctcccctgggcctcccggcccccaggtattgctaccaagcagc ccacccctccccaccaaagctaaaggggaaacagtcaaaatgctgcagaaaggatttcta gacaaacaagagtaccctccagagaccccaccccacagagtcctggcccctctggcctgc ctccaagagggtgctccatcgcctccactggtgatacccaggcaaacagggtctggagtg gacctccagcaaactccaacagacctgcagcagagggtcctgactgttagaaggaaaact aacaaacagaaaggacatctacaccaaaatcccatctgtacatcaccatcatcaaagacc aaagcacgagaactacacaacacatgcacaagcttcagtagccaattcaatcatgtggaa gaaagggtatcagtgattgaagatcaaatgaatgaaatgaagcgagaagagaagtttaga gaaaaaagagtaagaagaaatgaacaaagcctccaagaaatatgggactatgtgaaaaga ccaaatctacgtctgattggtgtacctgaaagtgacggggagaatggaatcaagttggaa aacactcttcaggatattatccaggagtacttccccaacctagcaaggcaggccaacatt caaattgaggaaatacagagaacgccacaaagatactcctcaagaagagcctgcctttca agagttcctgaaggaagcactaaacatggaaaggaacaactggtaccagccaccacaaaa tcataccaaacggtaaagaccatcaacaccatgaagaaactgcatcaactaatgagcaaa ataacaagctaa >gi568815595r:15311346_15526511|GENSCAN_predicted_peptide_10|88_aa MVVLNPMTLGIYLQLFFLSIVSQPTFINSVLPISAAQEHLAPFKPALISHGNQHKHRHGA RVWELVTAEGEASWLLQALHHQPAAGCK >gi568815595r:15311346_15526511|GENSCAN_predicted_CDS_10|267_bp atggttgtcctgaatccaatgactttgggaatttatcttcagcttttcttcctctctatc gtgtctcagccgactttcatcaacagcgttcttccaatctcagcagcccaggagcattta gctccattcaagcccgcacttatctctcatggcaaccagcataaacacaggcatggggcg cgtgtttgggagctggtgacagctgaaggagaggcctcgtggctgctacaggctttgcac catcaaccagctgcgggctgcaaatga