GENSCAN 1.0 Date run: 8-Nov-116 Time: 10:23:30 Sequence gi568815595f:15327780_15539152 : 211373 bp : 44.83% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 2587 2582 6 1.05 1.02 Term - 2787 2689 99 0 0 111 41 121 0.649 7.83 1.01 Init - 4629 4492 138 0 0 70 91 286 0.257 25.24 1.00 Prom - 13640 13601 40 -4.26 2.00 Prom + 20356 20395 40 -3.36 2.01 Init + 24205 24252 48 0 0 77 78 57 0.504 4.72 2.02 Term + 44947 45087 141 0 0 66 44 152 0.826 6.53 2.03 PlyA + 45104 45109 6 -0.45 3.03 PlyA - 46323 46318 6 1.05 3.02 Term - 47905 47594 312 1 0 42 45 409 0.310 27.00 3.01 Init - 59220 59218 3 0 0 108 81 0 0.405 1.30 3.00 Prom - 64204 64165 40 -2.46 4.02 PlyA - 64965 64960 6 1.05 4.01 Sngl - 68831 68442 390 2 0 88 54 284 0.951 21.22 4.00 Prom - 78646 78607 40 -4.96 5.05 PlyA - 79244 79239 6 1.05 5.04 Term - 83658 83477 182 1 2 71 43 107 0.804 2.27 5.03 Intr - 86383 86242 142 1 1 79 95 22 0.889 1.93 5.02 Intr - 88163 87993 171 2 0 95 96 101 0.982 11.74 5.01 Init - 96233 96168 66 2 0 79 80 25 0.239 -0.13 5.00 Prom - 97783 97744 40 -5.56 6.00 Prom + 98629 98668 40 -4.36 6.01 Init + 99734 99863 130 1 1 64 63 48 0.453 -0.40 6.02 Intr + 99897 100103 207 1 0 91 81 154 0.527 14.05 6.03 Intr + 102134 102228 95 0 2 114 76 53 0.997 6.38 6.04 Intr + 104308 104444 137 0 2 39 95 120 0.966 7.17 6.05 Intr + 106569 106759 191 0 2 121 65 172 0.999 17.43 6.06 Intr + 108563 108796 234 0 0 80 89 209 0.993 17.86 6.07 Term + 112784 112812 29 0 2 115 43 -7 0.240 -4.26 6.08 PlyA + 114810 114815 6 1.05 7.15 PlyA - 115615 115610 6 1.05 7.14 Term - 123934 123865 70 0 1 81 55 170 0.888 10.41 7.13 Intr - 126152 126050 103 0 1 106 45 143 0.998 11.03 7.12 Intr - 128240 128120 121 2 1 67 75 96 0.954 6.27 7.11 Intr - 128800 128681 120 1 0 49 80 152 0.878 11.19 7.10 Intr - 130546 130407 140 2 2 74 79 50 0.648 2.88 7.09 Intr - 138658 138562 97 1 1 91 110 34 0.852 5.48 7.08 Intr - 142837 142757 81 1 0 133 110 63 0.993 12.83 7.07 Intr - 146256 146221 36 0 0 91 106 6 0.653 1.16 7.06 Intr - 146493 146449 45 0 0 106 90 37 0.922 4.31 7.05 Intr - 147172 147146 27 1 0 109 96 -1 0.662 1.11 7.04 Intr - 147708 147646 63 0 0 83 86 48 0.831 3.01 7.03 Intr - 149418 149347 72 0 0 70 110 1 0.472 0.10 7.02 Intr - 151224 151179 46 2 1 111 65 43 0.866 2.71 7.01 Init - 154219 153816 404 1 2 60 39 173 0.501 5.70 7.00 Prom - 156142 156103 40 -4.96 8.12 PlyA - 156268 156263 6 -0.45 8.11 Term - 156968 156481 488 0 2 17 38 297 0.980 12.46 8.10 Intr - 157421 157259 163 2 1 80 71 47 0.974 1.75 8.09 Intr - 159991 159833 159 1 0 51 85 45 0.458 0.68 8.08 Intr - 160528 160427 102 1 0 48 77 57 0.171 0.97 8.07 Intr - 161858 161746 113 0 2 34 117 54 0.235 3.00 8.06 Intr - 165817 165647 171 2 0 0 100 82 0.138 0.51 8.05 Intr - 171363 171250 114 1 0 56 48 83 0.093 1.52 8.04 Intr - 173228 173105 124 2 1 62 79 39 0.059 0.56 8.03 Intr - 184057 183916 142 0 1 74 94 47 0.685 4.26 8.02 Intr - 185489 185296 194 2 2 35 63 104 0.330 0.89 8.01 Init - 208276 208217 60 1 0 80 93 22 0.484 3.25 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:15327780_15539152|GENSCAN_predicted_peptide_1|78_aa MDAALKRSRSEEPAEILPPARDEEEEEEEGMEQGLEEEEEVDPRIQGELEKLNQSTDDIN RRETELEVQRAEFLFLVK >gi568815595f:15327780_15539152|GENSCAN_predicted_CDS_1|237_bp atggacgcggcactgaagcggagccgctcggaggagccagccgaaatcctgccgcctgcc cgggacgaggaggaggaggaggaagaggggatggagcaggggctggaggaggaagaagag gtggatccccggatccagggagaactggagaagttaaatcagtccacggatgatatcaac agacgggagactgaacttgaggtacagagagctgagttcctgttcttggtgaagtga >gi568815595f:15327780_15539152|GENSCAN_predicted_peptide_2|62_aa MVDALAACTLYLEKPQAAKSPPVHAIETATRAIGALTYPEPPLLPSPQYPTIARPTIAYY TY >gi568815595f:15327780_15539152|GENSCAN_predicted_CDS_2|189_bp atggtggatgcactggcagcttgcaccctgtacctggaaaagccacaggcagccaagagc ccacctgttcatgctattgaaacagccaccagagccattggggcccttacttacccagag ccacccctgctgccctcgccccagtaccctaccattgcaaggcccacaattgcctactat acctactga >gi568815595f:15327780_15539152|GENSCAN_predicted_peptide_3|104_aa MHLRPAAVAIAAATMPKRKAEGDAKGDKAKVKDEPQRRSARLSAKPAPPKPEPKPKKASA KKGEKVPKGKKGKADAGKEGNNPAENGDAKTDQSQKAEGAGDAK >gi568815595f:15327780_15539152|GENSCAN_predicted_CDS_3|315_bp atgcacctacgtcccgccgccgtcgccatcgccgcggccaccatgcccaagagaaaggct gaaggagatgctaaaggagataaagccaaggtgaaggacgaaccgcagagaagatccgct aggttgtctgctaaacctgctcctccaaagccagagcccaagcctaaaaaggcctctgca aagaagggagagaaggtacccaaagggaaaaagggaaaggctgatgctggcaaggagggg aataaccctgcagaaaatggagatgccaaaacagaccagtcacagaaagctgaaggtgct ggagatgccaagtga >gi568815595f:15327780_15539152|GENSCAN_predicted_peptide_4|129_aa MGKKQSRKTENSKNQSASPPPKERSSSPAMEQSWTENDFDELREEGFRQSNYSELKEEVR THGKEVKNLEKRLDEWLTRITSVEKPLNDLMELKTMAQELRDKCTSLSSRFNQLEERVSV MEDQMNEMK >gi568815595f:15327780_15539152|GENSCAN_predicted_CDS_4|390_bp atggggaaaaaacagagcagaaaaactgaaaattctaaaaatcagagcgcctctcctcct ccaaaggaacgcagctcctcaccagcaatggaacaaagctggacggagaatgactttgat gagttgagagaagaaggcttcagacaatcaaactactctgagctaaaggaggaagttcga acccatggcaaagaagttaaaaaccttgaaaaaagattagacgaatggctaactagaata accagcgtagagaagcccttaaatgacctgatggagctgaaaaccatggcacaagaacta cgtgacaaatgcacaagcctcagtagccgattcaatcaactggaagaaagggtatcagtg atggaagatcaaatgaatgaaatgaaatga >gi568815595f:15327780_15539152|GENSCAN_predicted_peptide_5|186_aa MVSNSWAQVINLPQPPKMLGLQQNPLYDTERCKVFQCDLTKDDLLDHVPPESVDVVMLIF VLSAVHPDKMHLVLQNIYKVLKPGKSVLFRDYGLYDHAMLRFKASSKLGENFYVRQDGTR SYFFTDDFLAQLFMDTGYEEVVNEYVFRETVNKKEGLCVPRVFLQSKFLKPPKNPSPVVL GLDPKS >gi568815595f:15327780_15539152|GENSCAN_predicted_CDS_5|561_bp atggtctcaaactcctgggctcaggtgatcaacctgcctcagcctcccaaaatgctggga ttacagcaaaatcctttatatgatacagaaagatgcaaggtattccagtgtgatctgact aaagatgatcttctggatcatgtaccgccagagtctgtggatgttgttatgttgatattt gtgctgtcagctgttcatcctgataagatgcaccttgtcttacaaaacatttacaaggta ttaaaaccaggcaaaagtgtcttgtttcgtgactacggactgtatgatcatgccatgctt aggtttaaagccagcagcaaacttggagaaaacttttatgttagacaagatgggaccaga tcatatttttttactgatgacttcctggctcagctctttatggacacaggttatgaagaa gtggtaaacgagtatgtgtttcgagagacggtgaataaaaaagaaggcctgtgtgtgcca agagttttccttcagagcaaatttctaaagcctcctaagaacccatctcctgtggtcctg ggcctggatcctaagtcctga >gi568815595f:15327780_15539152|GENSCAN_predicted_peptide_6|340_aa MAPPPPLPLGTRKFLPDAGSPTALPPQIPLSPPRRGENLLLDPERWPGSTVLPDASARSS DRGCTGRAPIWVRGRCGAMNGTANPLLDREEHCLRLGESFEKRPRASFHTIRYDFKPASI DTSCEGELQVGKGDEVTITLPHIPGSTPPMTVFKGNKRPYQKDCVLIINHDTGEYVLEKL SSSIQVKKTRAEGSSKIQARMEQQPTRPPQTSQPPPPPPPMPFRAPTKPPVGPKTSPLKD NPSPEPQLDDIKRELRAEVDIIEQMSSSSGSSSSDSESSSGSDDDSSSSGGEDNGPASPP QPSHQQPYNSRPAVANGTSRPQGSNQLMNTLTSGGEAVWT >gi568815595f:15327780_15539152|GENSCAN_predicted_CDS_6|1023_bp atggcaccgcccccgccacttccgctaggaacccggaagttcctacccgacgccggaagt cccacggccttgcctcctcagattcctctctcacccccacgcagaggagagaacttgctt ctggacccggagcggtggcccggaagcacagtcctcccagacgccagcgccagaagctcg gatcgcggctgcaccgggagagcgccgatctgggtgcgaggcaggtgcggggccatgaat gggaccgcaaacccgctgctggaccgcgaggaacattgcctgaggctcggggagagcttc gagaagcggccgcgggcctccttccacactattcgttatgattttaaaccagcatctata gacacttcctgtgaaggagagcttcaagttggcaaaggagatgaagtcacaattacactg ccacatatccctggatccacaccacccatgactgtgttcaaggggaacaaacggccttac cagaaagactgtgtgcttattattaatcatgacactggtgaatatgtgctggaaaaactc agtagcagcattcaggtgaagaaaacaagagctgagggcagcagtaaaatccaggcccga atggaacagcagcccactcgtcctccacagacgtcacagccaccaccacctccaccacct atgccattcagagctccaacgaagcctccagttggacccaaaacttctcccttgaaagat aacccctcacctgaacctcagttggatgacatcaaaagagagctgagggctgaagttgac attattgaacaaatgagcagcagcagtgggagcagctcttcagactctgagagctcttcg ggaagtgatgacgatagctccagcagtggaggcgaggacaatggcccagcctctcctccg cagccttcacaccagcagccctacaacagtaggcctgccgttgccaatggaaccagccgg ccacaaggaagcaaccagctcatgaacaccctcactagtggaggggaagcagtctggact tag >gi568815595f:15327780_15539152|GENSCAN_predicted_peptide_7|474_aa MSELPFTIASKRIKYLGIKLTRDVKDLFKENCKPLLNEMKENRNKWKNIPCSWIGRINIV KMAILPKVIYRFSAIPIKLPMTFFTELEKTTLNFIWNQKRDHIAKTILSQKKKAGGITLP DFKLYYKATVTKTAWGSLADQEGRYGLRCSGRPGPPGVPGMPGPIGWPGPEGPRGEKGDL GMMGLPGSRGPMGSKGYPGSRGEKGSRGEKGDLGPKGEKGFPGFPGMLGQKGEMGPKGEP GIAGHRGPTGRPGKRGKQGQKGDSGVMGPPGKPGPSGQPGRPGPPGPPPAGQLIMGPKGE RGFPGPPGRCLCGPTMNVNNPSYGESVYGPSSPRVPVIFVVNNQEELERLNTQNAIAFRR DQRSLYFKDSLGWLPIQLTPFYPVDYTADQHGTCGDGLLQPGEECDDGNSDVGDDCIRCH RAYCGDGHRHEGVEDCDGSDFGYLTCETYLPGSYGDLQCTQYCYIDSTPCRYFT >gi568815595f:15327780_15539152|GENSCAN_predicted_CDS_7|1425_bp atgagtgagctcccattcacaattgcttcaaagagaataaaatacctaggaatcaaactt acaagggatgtgaaggacctcttcaaggagaactgcaaaccactgctcaatgaaatgaaa gagaacagaaacaaatggaagaacattccatgctcatggataggaagaatcaatattgtg aaaatggccatactgcccaaggtaatttatagattcagtgccatccccatcaagctacca atgactttcttcacagaattggaaaaaactactttaaacttcatatggaaccaaaaaaga gaccacattgccaagacaatcttaagccaaaagaagaaagctggaggcatcacactacct gacttcaaactatactacaaggctacagtaaccaagacagcatgggggagcttggccgac caggaaggaaggtatggtctgcgctgttctggtagacctggccccccaggtgttcctggc atgcctgggcccatcggttggccaggccctgaaggacccaggggtgaaaaaggtgacctg ggtatgatgggcttgccagggtcaagaggaccaatgggctccaagggctaccctggatcc agaggggaaaagggatccagaggtgaaaagggtgacctgggtcccaaaggagaaaagggt ttcccaggatttcctggaatgttggggcagaaaggtgaaatgggtccaaaaggtgaacct gggatagcaggacaccgaggacccacaggaagaccaggaaaacgaggcaagcagggacag aaaggggatagtggagttatgggcccaccaggcaagcctgggccttctggtcaacctggc cgtccggggcccccaggccccccacctgcaggacaacttataatgggacccaaaggggaa agaggatttcccgggcctccaggaagatgtctttgtggacccactatgaatgtgaataac ccttcctacggggaatctgtgtatgggcccagttccccgcgagttcctgtgatttttgtg gtcaacaaccaggaggagcttgagaggctgaacacccaaaacgccattgccttccgcaga gaccagagatctctgtacttcaaggacagccttggctggctccccatccagctgacccct ttctaccctgtggattacactgcagaccagcacggcacctgtggggatgggctcctgcag cctggggaggagtgtgacgacggtaacagcgatgtgggtgacgactgcatccgctgtcac cgtgcctactgtggagatggtcaccggcatgagggtgtggaggactgtgacggctctgac tttggctacctgacatgcgagacctatctccctgggtcatatggagacctgcaatgcacc cagtactgctacatcgactccacgccctgccgctacttcacctga >gi568815595f:15327780_15539152|GENSCAN_predicted_peptide_8|609_aa MKPYKEVQVSEILQPQLTLQFLASIASLDSNSYSSAALPCRDAEVGIQRAMEPPLKTLKL PFIEYFLVTRLLTHIIFNLHNNVTRASLASETVLTGVDLMEWPPKGFQILCGHLEMKFST TEHSVQELGRLQESQTLRDAKVGKGLELIWSKAFIVHLEKSRPTGEMGCPRSHITFSFVS NTLAQTGLELRPLAMKVFPEAEEGRLPLDGLDPQGDVPDKADGGGKRGEARLGQPSWRDG REDAKEDGMDQMHRKKHKRGDGGQGIEEALPSLDQKKRGGHKACCLLTPPPPPLFPPPFF RGGRSPLLSPDMKNLMLELETSQSPCMQGSLGSPGPPGPQVLLPSSPPLPTKAKGETVKM LQKGFLDKQEYPPETPPHRVLAPLACLQEGAPSPPLVIPRQTGSGVDLQQTPTDLQQRVL TVRRKTNKQKGHLHQNPICTSPSSKTKARELHNTCTSFSSQFNHVEERVSVIEDQMNEMK REEKFREKRVRRNEQSLQEIWDYVKRPNLRLIGVPESDGENGIKLENTLQDIIQEYFPNL ARQANIQIEEIQRTPQRYSSRRACLSRVPEGSTKHGKEQLVPATTKSYQTVKTINTMKKL HQLMSKITS >gi568815595f:15327780_15539152|GENSCAN_predicted_CDS_8|1830_bp atgaagccctataaagaagtccaggtttctgaaatcctccagccccagctaactcttcag ttccttgcctccatcgcctctctggacagtaacagttatagctcagcagcccttccctgt cgggatgcagaagttgggatccagcgggccatggaaccaccactaaaaacactaaagtta cctttcattgagtattttctggtcaccagactcttgacacatattatatttaatcttcac aacaatgtcacaagggccagcttggcttctgagaccgtccttactggggtggacctcatg gagtggcccccaaaaggtttccagattctctgtggtcatctggaaatgaaattctcaacc actgagcacagtgtccaggaattagggaggctccaggaatcacagaccctgagagatgcc aaagttggaaagggcttagagctcatctggtccaaggctttcattgtacatctagaaaaa tccagacccacaggggaaatgggttgtcctaggtcacacattacgtttagctttgtcagc aacactttggcccagacaggattagagctgagacccctggccatgaaggtcttccctgaa gcagaggagggcaggctccccttggacggtctcgacccacaaggggatgttccagacaaa gcagatgggggaggtaaaagaggtgaggcaagactgggtcagccgtcatggagagatggg cgtgaagacgccaaagaggatggaatggatcagatgcacagaaagaagcataaaagggga gatgggggacaggggatcgaggaagcccttcccagcctggatcagaagaagcgtggtggc cacaaagcatgctgcctgctgacgcctcctccaccaccactgttcccaccaccattcttc agaggtggccgaagtccgcttctctccccagacatgaagaatctcatgctggaactggag acctcgcagtccccgtgcatgcaaggctcgctaggctcccctgggcctcccggcccccag gtattgctaccaagcagcccacccctccccaccaaagctaaaggggaaacagtcaaaatg ctgcagaaaggatttctagacaaacaagagtaccctccagagaccccaccccacagagtc ctggcccctctggcctgcctccaagagggtgctccatcgcctccactggtgatacccagg caaacagggtctggagtggacctccagcaaactccaacagacctgcagcagagggtcctg actgttagaaggaaaactaacaaacagaaaggacatctacaccaaaatcccatctgtaca tcaccatcatcaaagaccaaagcacgagaactacacaacacatgcacaagcttcagtagc caattcaatcatgtggaagaaagggtatcagtgattgaagatcaaatgaatgaaatgaag cgagaagagaagtttagagaaaaaagagtaagaagaaatgaacaaagcctccaagaaata tgggactatgtgaaaagaccaaatctacgtctgattggtgtacctgaaagtgacggggag aatggaatcaagttggaaaacactcttcaggatattatccaggagtacttccccaaccta gcaaggcaggccaacattcaaattgaggaaatacagagaacgccacaaagatactcctca agaagagcctgcctttcaagagttcctgaaggaagcactaaacatggaaaggaacaactg gtaccagccaccacaaaatcataccaaacggtaaagaccatcaacaccatgaagaaactg catcaactaatgagcaaaataacaagctaa