GENSCAN 1.0 Date run: 6-Nov-116 Time: 03:34:24 Sequence gi568815592r:53551601_53755272 : 203672 bp : 39.76% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 591 875 285 2 0 53 42 197 0.750 6.89 1.02 PlyA + 2616 2621 6 1.05 2.07 PlyA - 3489 3484 6 1.05 2.06 Term - 5775 5650 126 0 0 41 41 77 0.058 -4.40 2.05 Intr - 10506 10473 34 2 1 83 86 45 0.266 1.11 2.04 Intr - 12654 12535 120 2 0 77 75 98 0.725 6.09 2.03 Intr - 13289 13198 92 2 2 91 59 95 0.861 4.77 2.02 Intr - 22059 21782 278 1 2 12 41 223 0.391 6.31 2.01 Init - 28035 27987 49 0 1 76 61 44 0.200 1.68 2.00 Prom - 35116 35077 40 -6.75 3.06 PlyA - 35160 35155 6 1.05 3.05 Term - 37608 37491 118 2 1 120 40 87 0.854 4.13 3.04 Intr - 38524 38398 127 2 1 52 56 97 0.732 1.72 3.03 Intr - 60892 60857 36 2 0 73 98 42 0.035 1.02 3.02 Intr - 65153 65005 149 1 2 59 60 46 0.100 -2.14 3.01 Init - 66851 66697 155 2 2 54 71 189 0.957 13.31 3.00 Prom - 70622 70583 40 -4.85 4.00 Prom + 77283 77322 40 -3.75 4.01 Init + 78494 78578 85 1 1 95 36 41 0.283 0.63 4.02 Intr + 91135 91243 109 2 1 114 63 44 0.648 2.92 4.03 Intr + 94165 94279 115 1 1 37 103 124 0.660 8.33 4.04 Term + 94786 94812 27 0 0 120 48 16 0.571 -2.10 4.05 PlyA + 96453 96458 6 1.05 5.03 PlyA - 96498 96493 6 1.05 5.02 Term - 100730 99998 733 1 1 89 52 762 0.917 64.65 5.01 Init - 103672 102501 1172 1 2 110 80 821 0.956 76.54 5.00 Prom - 106569 106530 40 -7.85 6.11 PlyA - 106691 106686 6 1.05 6.10 Term - 110209 110122 88 0 1 96 54 101 0.960 3.65 6.09 Intr - 110765 110674 92 2 2 98 87 76 0.980 6.37 6.08 Intr - 114072 114001 72 0 0 122 86 43 0.888 6.18 6.07 Intr - 114330 114129 202 2 1 49 39 104 0.390 -0.03 6.06 Intr - 139581 139418 164 0 2 67 69 136 0.775 7.45 6.05 Intr - 140931 140811 121 2 1 0 87 84 0.861 -0.92 6.04 Intr - 141071 140974 98 2 2 -12 87 152 0.145 2.99 6.03 Intr - 150279 150231 49 2 1 108 75 12 0.231 -0.34 6.02 Intr - 150966 150854 113 0 2 74 67 105 0.212 5.26 6.01 Init - 153915 153595 321 0 0 72 46 165 0.832 8.16 6.00 Prom - 162131 162092 40 -5.15 7.00 Prom + 163120 163159 40 -8.55 7.01 Init + 165157 165434 278 0 2 55 75 170 0.698 8.90 7.02 Intr + 166126 166244 119 1 2 118 92 6 0.303 3.19 7.03 Intr + 178497 178519 23 1 2 110 116 26 0.375 3.94 7.04 Intr + 181140 181172 33 2 0 98 83 28 0.231 0.80 7.05 Term + 192739 192873 135 0 0 55 39 132 0.273 2.04 7.06 PlyA + 193136 193141 6 1.05 8.02 PlyA - 194137 194132 6 1.05 8.01 Term - 203020 202901 120 1 0 67 45 150 0.889 6.09 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:53551601_53755272|GENSCAN_predicted_peptide_1|94_aa MWESLELPRDLLNGFAQHADMLTIMTADNNVHAEVVSDGDEKLVGNWSKSDSCYVLAKRL VASCPCPRDLWNFELEKDDLGYLVEEISKQQSIQ >gi568815592r:53551601_53755272|GENSCAN_predicted_CDS_1|285_bp atgtgggaaagtttggagcttcctagagacttgttgaatggctttgcccaacatgctgac atgttgacaataatgacagctgacaataatgtccatgctgaggtggtctcagatggagat gagaaacttgttgggaactggagtaaaagtgactcttgttacgttttagcaaagagactg gtggcatcgtgcccctgccctagagatttgtggaactttgaacttgagaaagatgattta ggatacctggtggaagaaatttctaagcagcaaagcattcaataa >gi568815592r:53551601_53755272|GENSCAN_predicted_peptide_2|232_aa MERIPHEWFSTIPLVMKLLLDEGEQALGTRLKSPRTRLLNSAEGTASGDCMQPDPGMSHV ILLQATGSAVGVARNRAGELAAGTSRGGAGYKPGVRHAVQDSQKYRTRIQCEESGMNLDF EGKLTWNQILALLFASVIFGWGLPALKHDSTTPLPKAMEVAADEQAVLYLIQVSAWRGRR SGTPFRDCADKRVQEKGETAEVNEENRIQPTEKKKSFAQKKRGSPRRKTNKP >gi568815592r:53551601_53755272|GENSCAN_predicted_CDS_2|699_bp atggagcggatacctcatgaatggtttagcacgatcccgttggtgatgaagctgctcctg gacgaaggagagcaagccttggggactagactgaagtcgccgagaacccgactactaaac agtgctgaggggacagcctctggggactgcatgcagccagaccccgggatgtctcatgtg atcctccttcaggccaccggaagtgcagtgggcgtggccaggaacagagcgggagagcta gcagcaggtacctccaggggaggagcgggttataaacctggtgtaaggcacgccgtccag gactcccagaaataccgaacgagaatccaatgtgaggaaagtggaatgaacttggacttt gaaggcaaactaacctggaatcaaatactggctctgctgtttgcaagtgtgatctttggc tggggcctgccagccttgaagcatgactccacaacgcccctacccaaggccatggaggtt gctgctgatgagcaagctgttttatatttaatccaggtttctgcttggagaggaagaagg tcaggaacaccgttcagggactgtgcagacaagagggtgcaagagaaaggagagacagca gaagtaaatgaagaaaacagaattcagccaactgagaagaaaaaatcttttgctcaaaaa aagagaggaagtcctaggagaaaaacaaacaaaccatga >gi568815592r:53551601_53755272|GENSCAN_predicted_peptide_3|194_aa MKEKYENSPLAQGQKESVGGFRESDVFTGGEAEVRWQQVQVEVDDEEMEVERRGCWIHPA ALWHSGHCAAVKMPLTPPHPSEIIIIDCCSCIPAVTTTSTTGASLLQDITRVEVKAGQKG WKKGVSNHQRASVIDKQVLEGKDVNQEPEKNEILLWPLQLNRHKIKLIISPLSYSAATLI SRCLISKSTDYQAV >gi568815592r:53551601_53755272|GENSCAN_predicted_CDS_3|585_bp atgaaagagaagtatgaaaacagcccactggctcaaggtcagaaggaaagcgtgggaggc tttagagaaagtgatgtgttcacaggtggtgaagctgaagtcaggtggcagcaagttcag gtggaagtggatgatgaggaaatggaggtggaaaggagaggatgctggatccatcctgct gctctttggcattctgggcactgtgctgctgtgaagatgcctttaacacctccccaccct agtgagatcatcatcattgattgctgcagttgcatccctgctgtcaccaccacttccacc acaggggcatccttgctgcaggatatcacaagggtagaagtgaaggcagggcagaaaggc tggaaaaagggagtttctaaccaccaaagagcatcagttattgataaacaagtacttgaa ggaaaagatgttaaccaggaaccagagaaaaatgaaattttgctttggcctctccaactt aacaggcataaaatcaaactcatcatctccccactctcatattctgctgcgactcttatt tctcgatgtctcatctctaagtccactgactaccaagctgtttaa >gi568815592r:53551601_53755272|GENSCAN_predicted_peptide_4|111_aa MEIWKQFSSFLEDDAIDDNKHLGESLQPELLGNVILGFREGEHFHPNHPLLLSKVVLHSS ETPLRHDSRHRRYICERDKIFTFKDDILKEMAEHKSTGKYPEMEYSSANGD >gi568815592r:53551601_53755272|GENSCAN_predicted_CDS_4|336_bp atggagatatggaagcaattttcttctttcctagaagatgatgcaatagatgacaataag catttgggtgaatctctccagccagagcttcttgggaatgtcattttaggcttccgcgaa ggtgagcattttcatccaaatcacccattattgctgtctaaagttgttctacattcttct gagacgcctttaaggcatgactctagacaccggaggtacatctgtgaacgagacaaaatt ttcacctttaaggatgatattctaaaagagatggcagaacataaatcaacaggaaaatat ccagagatggaatactcttctgctaatggggattaa >gi568815592r:53551601_53755272|GENSCAN_predicted_peptide_5|634_aa MAPKKKIVKKNKGDINEMTIIVEDSPLNKLNALNGLLEGGNGLSCISSELTDASYGPNLL EGLSKMRQENFLCDLVIGTKTKSFDVHKSVMASCSEYFYNILKKDPSIQRVDLNDISPLG LATVIAYAYTGKLTLSLYTIGSIISAAVYLQIHTLVKMCSDFLIREMSVENCMYVVNIAE TYSLKNAKAAAQKFIRDNFLEFAESDQFMKLTFEQINELLIDDDLQLPSEIVAFQIAMKW LEFDQKRVKYAADLLSNIRFGTISAQDLVNYVQSVPRMMQDADCHRLLVDAMNYHLLPYH QNTLQSRRTRIRGGCRVLVTVGGRPGLTEKSLSRDILYRDPENGWSKLTEMPAKSFNQCV AVMDGFLYVAGGEDQNDARNQAKHAVSNFCRYDPRFNTWIHLASMNQKRTHFSLSVFNGL VYAAGGRNAEGSLASLECYVPSTNQWQPKTPLEVARCCHASAVADGRVLVTGGYIANAYS RSVCAYDPASDSWQELPNLSTPRGWHCAVTLSDRVYVMGGSQLGPRGERVDVLTVECYSP ATGQWSYAAPLQVGVSTAGVSALHGRAYLVGGWNEGEKKYKKCIQCFSPELNEWTEDDEL PEATVGVSCCTLSMPNNVTRESRASSVSSVPVSI >gi568815592r:53551601_53755272|GENSCAN_predicted_CDS_5|1905_bp atggcacccaaaaagaagattgtcaaaaagaacaaaggagatatcaatgagatgactata atcgtagaagatagccccctaaacaaactgaatgctttgaatgggctcctagagggaggc aatggccttagctgcatttcttctgaactaacagatgcttcttatggccccaacctcttg gaaggtttaagtaaaatgcggcaggagaacttcctatgtgacttagtcattggtaccaaa accaaatcctttgatgttcataagtcagtcatggcttcatgcagtgagtatttttacaac atcctaaaaaaagacccgtcaattcagagggtggatctcaatgatatctcaccactaggc ctggccactgtcattgcatatgcctacactggaaagctcactctctccttgtatacaata ggaagcattatttctgctgctgtttatcttcagatccatactcttgtaaagatgtgcagt gattttctgatacgggagatgagtgttgagaattgcatgtatgttgttaatattgctgaa acatactccctaaaaaatgcaaaagcagcagcccagaaatttattcgggataacttcctt gaatttgcagaatcggatcagtttatgaaacttacatttgaacaaattaatgaacttctt atagatgatgacttacagttgccttctgagatagtagcattccagattgcaatgaaatgg ttagaatttgaccaaaagagagtaaaatacgctgcagatcttttgagcaatattcgcttt ggtaccatctctgcacaagacctggtcaattatgttcaatccgtaccaagaatgatgcaa gatgctgattgtcacagacttctcgtagatgctatgaactaccacttgcttccatatcat caaaacacattgcaatctaggcgaacaagaatccgaggtggctgccgagtcctcgtcact gttgggggacgcccaggccttactgagaagtcccttagcagagacatcttgtatagagac cctgaaaatggatggagcaagcttacggaaatgccagccaaaagttttaatcagtgtgtg gctgtgatggatggatttctttatgtagccggtggtgaagaccagaatgatgcaagaaat caagccaagcatgcagtcagcaatttctgcagatacgatccccgcttcaacacctggata cacctggccagcatgaaccagaagcgcacgcacttcagcctgagcgtgttcaacgggctc gtgtacgccgcgggcggccgcaacgcagaaggaagcctggcctcgctggagtgctacgtg ccctccaccaatcagtggcagccgaagacgcccctggaggtggcgcgctgctgccacgct agcgcggtcgccgacggccgcgtgctggtgaccggaggctacatagccaacgcctactcg cgctctgtgtgcgcctacgacccggccagcgactcgtggcaggagctgccgaacctcagc acaccccggggctggcactgcgcggtcacgctgagcgacagagtgtacgtgatgggcggc agccagctggggccgcgcggggagcgcgtggacgtgctcaccgtggagtgctacagcccc gcgaccggccagtggagctacgcggcgccgctgcaggtgggagtgagcactgcgggcgtc tcggcgctgcatggccgcgcctacctggtggggggctggaacgagggcgagaagaagtac aagaagtgcatccagtgcttcagccccgagctcaacgagtggacggaggacgacgagcta cccgaggccactgtcggcgtgtcctgctgcaccctctcgatgcccaacaacgtgactcgg gaatcccgggccagttcggtatcttctgtgccagtcagtatctga >gi568815592r:53551601_53755272|GENSCAN_predicted_peptide_6|439_aa MIHFPSLPPGTGAWSQRPWRLCKHSPLSSSEHLVNPTGVECGRDSQRVPPKKRNRDQGPE ERAWGLMHLLVWPQCSPPRSLCQPINQTPKDDLEDFRGNHQSLGSYEQQPQQAACASEVY PQDPRLFWALPPNGHNQQCQFEFTRACHLFWNINIHAGLPLTRQEKAQKEVSKGGRDRIM KGYADPDNRTTGGSACEWQQGVWGSGVGENRQICYILEVDLAGLAERLLGWGKKRILLRL KEVEKRHYYTLHLAAEASHRSDRVTPIEEPVFVPQPTHPNPNHYTKTISWSVQETQPLST NRSPAPCPRLAGELKAKDGEEGARCSAPGPKRRLGLPYPVGRQRVTLKIAAAGSCQIPSQ EESPAKTSSSQVDLPLHPRRNSYWQEQCEKCKPPTVLAEACKVLPSFDPKSHILTVTVTI IDDDLEVHSEHLYHRRIKQ >gi568815592r:53551601_53755272|GENSCAN_predicted_CDS_6|1320_bp atgatccactttccaagcctccctcctggaactggggcctggtcccagcgaccctggaga ctctgtaagcacagtcctctgagctcctcagagcatctagtcaaccccacaggtgtagag tgtgggagagacagtcagagagtgcccccaaagaagagaaacagagaccagggcccagag gaaagagcctggggcctcatgcacctcctggtctggcctcagtgcagcccacccagaagt ctctgccaacccatcaatcagacaccaaaggatgacttggaagattttcgaggaaatcac caaagcttgggctcctatgagcagcagccacagcaggccgcctgtgcctctgaagtctat ccacaagatccacgactcttctgggcacttcccccaaatggccacaaccagcagtgccag tttgagtttacaagagcctgccatctgttctggaatatcaacattcatgcagggttaccc ctaacaagacaggagaaggcacaaaaggaggtgagcaaaggaggcagagatcggatcatg aagggctatgcggaccctgacaataggaccactggaggaagtgcgtgtgaatggcagcag ggagtctggggcagtggagttggagaaaataggcaaatttgttatattttggaggtagac ctggcaggacttgctgaaagattgttgggttgggggaagaaaagaattttgctaaggctc aaagaggtggaaaaaaggcactactacaccctacatctagcagctgaagccagccacaga tctgatagagtaacaccaattgaagaaccagtttttgtgccccaacccacccaccccaat cccaatcattatacgaagacaatcagctggtcagttcaagaaacccaacccctctccacc aaccggagccccgccccctgcccccggctggctggagagttgaaggccaaagacggagag gagggcgctcgctgctctgctcccgggccaaagcgcaggctcggcctgccttatccagtg ggacggcaaagggtcactctaaaaattgctgcagccgggtcttgtcagatcccttcccaa gaggagtctcctgctaaaacttcatcatctcaagttgacctgccacttcacccaaggagg aattcctactggcaggaacagtgtgaaaagtgcaagccaccaacagtcctggcggaagct tgtaaagttcttccttcctttgatcccaaaagccacattttgactgtcactgtgactatc attgatgatgacctagaagtgcacagtgaacatttgtatcatcgtagaataaaacagtga >gi568815592r:53551601_53755272|GENSCAN_predicted_peptide_7|195_aa MTQSRFGTESTEAKRKDGEFRFSEFNASVRKCPKMRWTVGHQPMRRTWGIGDTNLEVINA QEIGPGKRHLENYFRHKKIKESLQRSLGRGGQRSEGGVKSSNPLITWLASLVTNPHLEAK GLVMSHLFSINSDRGYVMRKIVSKDGTNLEENQKIIDDQNWKTGTKFLVKIAPLKRHNGD GGIVPKPKYRKRHVL >gi568815592r:53551601_53755272|GENSCAN_predicted_CDS_7|588_bp atgacccaaagcaggtttggaacagagagcacagaagcaaagcgaaaagatggagagttt cgttttagtgagttcaatgcatctgtgaggaaatgtccaaaaatgcgttggactgttggt catcagcctatgagaaggacttgggggattggggatacaaatttagaagtcatcaatgca caggagattggacctgggaaaagacacctggagaactattttaggcacaaaaagataaag gagagcctgcaaaggagcctgggaagaggtggtcagaggtcagagggtggagttaaaagt tctaaccctctaatcacatggctggcttctctggtgaccaacccccatctggaagctaag ggccttgtcatgagtcacctctttagcataaactcagataggggctatgtgatgcggaag attgtgtcaaaggatggaacaaacctggaagagaaccagaaaatcatcgatgatcaaaac tggaaaacaggtacaaaattcttagtgaaaattgctccactgaaacgtcataatggtgat ggagggattgtcccaaaaccaaaataccggaaaagacatgtactctaa >gi568815592r:53551601_53755272|GENSCAN_predicted_peptide_8|39_aa CCAIWPFGGSPEADSIALKCGRSSKPRVEWGEVGWSGVE >gi568815592r:53551601_53755272|GENSCAN_predicted_CDS_8|120_bp tgttgtgccatctggccctttggaggctccccagaagctgactccattgccctgaaatgt ggtcgttcatccaaacccagagtggagtggggtgaagtggggtggagtggagtggagtga