GENSCAN 1.0 Date run: 7-Jul-118 Time: 15:57:53 Sequence gi568815597r:12468343_12717348 : 249006 bp : 47.88% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 PlyA - 2233 2228 6 1.05 1.04 Term - 6144 6023 122 1 2 95 41 37 0.041 -1.66 1.03 Intr - 17613 17521 93 1 0 95 52 74 0.214 4.44 1.02 Intr - 20465 20433 33 0 0 58 107 41 0.223 1.19 1.01 Init - 28581 28422 160 0 1 65 61 272 0.745 22.19 1.00 Prom - 29769 29730 40 -2.36 2.00 Prom + 32321 32360 40 -5.66 2.01 Init + 34617 34636 20 2 2 76 50 28 0.134 -3.90 2.02 Intr + 38511 38751 241 0 1 124 101 345 0.984 36.95 2.03 Term + 40551 40682 132 2 0 134 48 154 0.999 14.09 2.04 PlyA + 41664 41669 6 1.05 3.15 PlyA - 41997 41992 6 1.05 3.14 Term - 45035 44884 152 0 2 -4 42 154 0.020 -0.33 3.13 Intr - 52927 52874 54 2 0 94 89 27 0.573 2.35 3.12 Intr - 59161 59101 61 1 1 60 84 40 0.039 -0.89 3.11 Intr - 64383 64102 282 0 0 35 67 135 0.085 3.72 3.10 Intr - 81186 81151 36 0 0 98 78 26 0.054 1.06 3.09 Intr - 85727 85678 50 0 2 117 73 22 0.221 2.00 3.08 Intr - 102088 101888 201 2 0 55 105 103 0.696 7.96 3.07 Intr - 102725 102634 92 1 2 86 89 -63 0.175 -6.76 3.06 Intr - 104511 104386 126 2 0 88 86 74 0.844 6.99 3.05 Intr - 110614 110376 239 1 2 97 82 442 0.990 40.91 3.04 Intr - 111070 110951 120 1 0 131 82 235 0.999 27.89 3.03 Intr - 112324 112181 144 1 0 89 66 253 0.968 23.68 3.02 Intr - 127749 127644 106 2 1 70 76 21 0.050 -0.68 3.01 Init - 135262 135183 80 1 2 71 97 26 0.183 2.35 3.00 Prom - 139124 139085 40 -4.26 4.09 PlyA - 139847 139842 6 1.05 4.08 Term - 139970 139849 122 0 2 83 48 138 0.902 7.94 4.07 Intr - 140712 140613 100 0 1 35 63 65 0.500 -1.62 4.06 Intr - 148961 148812 150 2 0 58 76 277 0.334 23.86 4.05 Intr - 150489 150387 103 2 1 64 100 77 0.292 6.68 4.04 Intr - 151184 151120 65 2 2 113 66 22 0.344 0.02 4.03 Intr - 151632 151585 48 0 0 82 35 86 0.334 1.58 4.02 Intr - 162683 162600 84 2 0 42 116 40 0.004 2.22 4.01 Init - 167968 167933 36 1 0 83 97 -8 0.022 -0.38 4.00 Prom - 172156 172117 40 -4.26 5.00 Prom + 174838 174877 40 -6.36 5.01 Init + 176205 176372 168 2 0 92 97 151 0.972 13.94 5.02 Intr + 182781 182997 217 2 1 89 82 120 0.511 9.58 5.03 Intr + 193449 193512 64 1 1 114 92 36 0.375 4.38 5.04 Term + 197619 198393 775 0 1 104 44 551 0.845 45.16 5.05 PlyA + 198644 198649 6 1.05 6.00 Prom + 214236 214275 40 -3.56 6.01 Init + 215097 215172 76 2 1 62 78 43 0.368 2.06 6.02 Intr + 217045 217182 138 2 0 67 82 84 0.672 6.14 6.03 Intr + 222879 222897 19 1 1 87 94 12 0.290 -2.63 6.04 Term + 224561 224711 151 2 1 41 38 172 0.324 4.88 6.05 PlyA + 225882 225887 6 1.05 7.03 PlyA - 227678 227673 6 1.05 7.02 Term - 238779 238676 104 1 2 88 33 82 0.426 1.14 7.01 Init - 244754 244673 82 2 1 81 116 20 0.538 5.24 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:12468343_12717348|GENSCAN_predicted_peptide_1|135_aa MRGEECPLDLDLDLDLDLDLDLAQATGWDSAEALTRATQLSAARRNLLCYGFQDISQGPK YTISGRRQDTSWQLGGTLQCLYKGYISAWCCDGQPDLGDPDLHNNIVVRLSWWESDRSAL DSSKSPTEKESDSTI >gi568815597r:12468343_12717348|GENSCAN_predicted_CDS_1|408_bp atgcgaggggaagaatgccccctggacctggacctggacctggacctggacctggacctg gacctggcccaggccacaggctgggattctgcagaggctctaactcgggccactcagctg tcggcggcccggaggaatctgctttgttatggctttcaggatatttcacaaggaccgaaa tacaccatctctggaaggaggcaggacacgagctggcagcttggtggtacccttcagtgc ctctacaaggggtacatttctgcctggtgctgcgacggtcaacctgacctaggggacccc gacctacataataatattgttgtgagattgtcctggtgggagtcagacagatcagccctc gacagttccaagagccccactgaaaaggaatctgattctacaatttaa >gi568815597r:12468343_12717348|GENSCAN_predicted_peptide_2|130_aa MGKLWANFIAVENIDSYCVLISSKAVYFLKSGDYVDREAIFLEVKYDDLYHCLVSKDHGK VYVQVTKKAVSTSSGVSIPGPSHQKPMVHVKSEVLAVKLSQEINYAKSLYYEQQLMLRLS ENREQLELDS >gi568815597r:12468343_12717348|GENSCAN_predicted_CDS_2|393_bp atggggaagctctgggcaaacttcatcgctgtggagaacattgacagctactgcgtgctc atctcctccaaagctgtttacttcctgaaaagtggagactacgtggatcgagaagccatt ttcctagaagtcaaatacgatgacctctaccactgccttgtctccaaagaccatgggaag gtgtatgtgcaggtgaccaagaaagccgtgagcacgagcagtggagtgtccatccccggc ccctcccaccagaagcccatggtccatgtgaaatctgaggtccttgctgtcaagttgtca caagaaataaactacgcaaagagcctctactatgaacagcagcttatgttaagactcagc gaaaaccgagagcagctggagctggactcctga >gi568815597r:12468343_12717348|GENSCAN_predicted_peptide_3|580_aa MRTSAPAEVQGPGFQTGFRSTLEAPRSWGNLGPAEERVWLVGPAAGGCWHLASHLQLRVQ PQIVLWGRTEKCLKETTEEIRQMGTECHYFICDVGNREEVYQTAKAVREKVGDITILVNN AAVVHGKSLMDSDDDALLKSQHINTLGQFWTTKAFLPRMLELQNGHIVCLNSVLALSAIP GAIDYCTSKASAFAFMESLTLGLLDCPGVSATTVLPFHTSTEMFQGMRVRFPNLFPPLKP ETVARRTVEAVQLNQALLLLPWTMHALVILKRKVSPQVPGWPLLSCNMHSQESQLLPWLL GHEWARTLQFESPNSLRALSVPGPCEGLGIRDEIPAMKAPTGLTSKRVGTTPGLAGEASE LRLYEEKPEAKEGISPWRHTDYKLERPCRICQRERETEDKSRQRNVEEPDWLSFPAFIFL PCWMLPALEHWTTSSSAFGLLDLYQWFDRDSRAIGHRQKATLSASLLLRFGDSDWPPCSS ACRRPIVELRLRPPATAAMEAMEMRLFGVELFCWICTLSSTQAPTPTDRPSVQLHVAGQE YSQAARFHCTFTVINVKWVLNVARQTYHASVAYSLSRYPT >gi568815597r:12468343_12717348|GENSCAN_predicted_CDS_3|1743_bp atgcggacctcagcgccagctgaggttcaaggtccaggttttcaaactgggttccggagc accctagaggccccaagaagctggggtaacctcgggccggcggaggagcgcgtgtggctc gtaggccctgcggccggaggctgctggcaccttgcatctcatttacagctccgggttcag ccacagattgttctctggggccggactgagaaatgcctgaaggagacgacggaggagatc cggcagatgggcactgagtgccattacttcatctgtgatgtgggcaaccgggaggaggtg taccagacggccaaggccgtccgggagaaggtgggtgacatcaccatcctggtgaacaat gccgccgtggtccatgggaagagcctaatggacagtgatgatgatgccctcctcaagtcc caacacatcaacaccctgggccagttctggaccaccaaggccttcctgccgcgtatgctg gagctgcagaatggccacatcgtgtgcctcaactccgtgctggcactgtctgccatcccc ggtgccatcgactactgcacatccaaagcgtcagccttcgccttcatggagagcctgacc ctggggctgctggactgtccgggagtcagcgccaccacagtgctgcccttccacaccagc accgagatgttccagggcatgagagtcaggtttcccaacctctttcccccactgaagccg gagacggtggcccggaggacagtggaagctgtgcagctcaaccaggccctcctcctcctc ccatggacaatgcatgccctcgttatcttgaaaaggaaggtaagcccccaagtgccaggc tggcccctgctctcatgcaacatgcactcccaggaaagccaacttctgccctggctgctg ggacatgagtgggccaggacccttcagtttgagtcaccaaacagcctgagggccctctct gtgccgggcccatgtgagggcctggggatcagagatgaaatacccgccatgaaggccccc acaggcttgacaagtaaacgggtggggactacaccagggcttgctggggaagcgtcggaa ctccggttgtacgaggagaagcctgaagccaaggagggaatcagcccgtggagacacaca gactacaagttggagaggccttgccgcatctgtcaaagggaaagggagaccgaggataaa agcaggcagaggaacgtggaagaaccagactggctgagttttccagccttcatctttctc ccgtgctggatgcttcctgcccttgaacattggactacaagttcttcagcttttggactc ttggacttataccagtggtttgacagggactctcgggccattggccacagacagaaggct acactgtcggcttccctacttttgaggtttggggactcggactggcctccttgctcctcg gcttgtagacggcctattgtggaacttcgccttcgtccaccagctacagcggcgatggag gcgatggaaatgcgcctatttggagttgagctgttctgctggatctgcactctctccagc acgcaggctcctactcccaccgaccgcccttctgtgcagctacatgtggccggacaagaa tacagccaggctgcccgattccactgtacattcacggtcatcaacgtcaaatgggtcctt aatgtcgcccggcaaacataccatgcttctgtggcctattccctttcccgttaccctaca tga >gi568815597r:12468343_12717348|GENSCAN_predicted_peptide_4|235_aa MDTVPILQQETKVGPEKAPQVPALQNWQPSPQFSDSPSLKIFPGFFAGLADTSRCWWLFT QKEDPPLQNLSPELGLCSKVLLPSGFPLSIVRHLGQAFKSPLYVNTEEVRAKMIYLVVKA AVGLVLPAKLRDLSRENVLITGGGRGIGRQLAREFAERGARKNEGDAFQSYFVKRLLGYC VHVGQHFDRALDQLEVIFMTVVLYMSVVPYTIGNIRDCGNIQGDFKKVVENGIKG >gi568815597r:12468343_12717348|GENSCAN_predicted_CDS_4|708_bp atggacactgtacccattttacagcaggaaaccaaggtgggcccggaaaaggcaccacaa gttcctgctctgcagaactggcagcccagcccccagttttcagactctcctagcctgaag atctttccaggcttcttcgcaggcctggcggacaccagccgctgctggtggcttttcacc cagaaggaagatcctcccctgcagaacctctccccagaactaggtctttgcagcaaagtc ctgctccccagcggcttccccctaagcatcgtgaggcacttgggtcaggcttttaaaagt ccactctacgtgaacacagaagaggtccgggcaaagatgatctatctggtggtgaaagca gccgtcggactggtgctgcccgccaagctgcgggacctgtcgcgggagaacgtcctcatc accggcggcgggagaggcatcgggcgtcagctcgcccgcgagttcgcggagcgcggcgcc agaaagaatgaaggagatgcttttcagtcctactttgtcaaacgacttctggggtattgt gtccatgtgggccagcactttgacagggcccttgaccaactggaagtaatattcatgacc gtggtactctacatgtctgtggtaccatacacgattggtaatatacgtgactgtggtaat atacaaggggacttcaaaaaggtggtggaaaatggaattaaaggatga >gi568815597r:12468343_12717348|GENSCAN_predicted_peptide_5|407_aa MAVPWLVLLLALPIFFLGVFVWAVFEHFLTTDIPATLQHPAKLRFLHCIFLYLVTLGNIF EKLGICSMPKFIRFLHDSVRIKKDPELVVTDLRFGTIPVRLFQPKAASSRPRRGIIFYHG GATVFGSLDCYHGLCNYLARETESVLLMIGYRKLPDHHSPALFQDCMNASIHFLKALETY GVDPSRVVVCGESVGGAAVAAITQALVGRSDLPRIRAQVLIYPVVQAFCLQLPSFQQNQN VPLLSRKFMVTSLCNYLAIDLSWRDAILNGTCVPPDVWRKYEKWLSPDNIPKKFKNRGYQ PWSPGPFNEAAYLEAKHMLDVENSPLIADDEVIAQLPEAFLVSCENDILRDDSLLYKKRL EDQGVRVTWYHLYDGFHGSIIFFDKKALSFPCSLKIVNAVVSYIKGI >gi568815597r:12468343_12717348|GENSCAN_predicted_CDS_5|1224_bp atggctgtcccctggctagtgctactcttggcattgcccatctttttcctgggggtcttt gtctgggctgtctttgagcacttcctcaccacggatatccctgctaccttgcagcatcct gccaagttgagattcctgcattgcatattcctctacctggtcactttggggaatatattt gagaagctgggaatttgctccatgcccaaatttattcgttttttacatgatagcgtgaga attaaaaaggaccctgaacttgtggtgaccgacctgcgttttgggacgatacccgtgagg ctgttccagccgaaggcagcatcctccagaccccggcgaggcatcatcttctaccatgga ggggccacagtatttgggagcctggattgttaccatggcctgtgcaattatctggcccgg gagactgaatctgtacttctgatgattgggtaccgcaagcttcctgaccaccattcccct gcccttttccaagactgcatgaatgcctccattcacttcctgaaggccctggaaacctat ggggtggacccctccagggttgtggtctgtggagaaagcgtcggaggtgcagcggtggcc gccatcacccaggccttggtgggcagatcagatcttccccggatccgggctcaggttctg atttatccagttgtccaggcattctgtttgcagttgccatcctttcagcagaaccaaaat gtcccattactttcccggaagttcatggtgacttctctgtgtaactatctggccattgac ctctcctggcgtgacgccatcttgaacggcacttgtgtacccccagacgtctggaggaag tacgagaagtggctcagccctgacaacatccccaagaaatttaagaacagaggctaccaa ccctggtctcccggcccttttaatgaagctgcctatctagaagccaaacatatgctggat gtagaaaattcacccctgatagcagatgatgaggtcatcgctcagcttcctgaggccttc ctggtgagctgtgagaatgacatactccgtgatgacagcttgctctataagaagcgcttg gaggaccagggggtccgcgtgacatggtaccacctgtatgatggttttcacggatccatt atcttttttgataagaaggctctctctttcccatgttccctgaagattgtgaatgctgta gtcagttatataaagggcatatga >gi568815597r:12468343_12717348|GENSCAN_predicted_peptide_6|127_aa MVMPIQEESWGPCLIKVTLDSTAGGDDSVKIKKGSKLVVISLHFRMTLMRLFQLKGDIIF YHRVGREFGSLGRLLGNSLAIIKFPLTTKSAMKKTEDNNTLVFSVDVKANKYQIAGMSHC AWLNILK >gi568815597r:12468343_12717348|GENSCAN_predicted_CDS_6|384_bp atggtaatgcctatccaggaggagtcgtggggaccctgcctcatcaaagtgacccttgac tcaactgctggaggagatgacagcgtgaaaataaagaagggctctaaactggtggtgatc agcctgcattttaggatgacactcatgaggctgttccagctcaagggagacatcatcttc taccacagagtgggcagggagtttggaagcctgggaaggcttctggggaacagccttgct atcattaagttcccgctgaccaccaagtctgccatgaagaagacagaagacaacaacaca cttgtgttctctgtggatgttaaagccaacaagtatcagattgcaggcatgagccactgt gcctggctgaatattcttaaataa >gi568815597r:12468343_12717348|GENSCAN_predicted_peptide_7|61_aa MATKVCTWTEEKTLQELVKYFLVGWDQDWVLVNWDHLIRGEFDKDPSVNQESCPQCGAFM L >gi568815597r:12468343_12717348|GENSCAN_predicted_CDS_7|186_bp atggccactaaagtctgcacatggaccgaggaaaagacactgcaggagctggtaaagtat ttccttgttggttgggaccaagactgggtattagtgaattgggaccatttaatccgggga gaatttgataaagaccccagtgtcaaccaggaatcttgcccccaatgtggagcttttatg ctgtag