GENSCAN 1.0 Date run: 4-Nov-116 Time: 09:28:05 Sequence gi568815589f:105199218_105485503 : 286286 bp : 39.29% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 372 367 6 1.05 1.02 Term - 8866 8784 83 2 2 84 49 66 0.076 -0.82 1.01 Init - 25176 25077 100 0 1 77 70 79 0.581 5.47 1.00 Prom - 33417 33378 40 -4.65 2.00 Prom + 38467 38506 40 -4.65 2.01 Init + 45722 45909 188 1 2 57 11 257 0.782 11.48 2.02 Intr + 86994 87096 103 0 1 92 48 60 0.024 1.66 2.03 Intr + 100003 100092 90 0 0 70 91 100 0.618 7.77 2.04 Intr + 110507 110649 143 1 2 93 81 73 0.154 5.33 2.05 Intr + 136346 136482 137 2 2 81 81 85 0.125 6.39 2.06 Intr + 149141 149234 94 0 1 107 84 -9 0.076 -1.30 2.07 Intr + 156995 157164 170 2 2 89 59 55 0.269 1.37 2.08 Intr + 161974 162113 140 2 2 84 79 89 0.989 6.86 2.09 Intr + 163604 163790 187 1 1 111 111 23 0.996 5.24 2.10 Intr + 165338 165503 166 0 1 76 82 90 0.991 5.30 2.11 Intr + 166266 166422 157 0 1 66 116 42 0.963 3.89 2.12 Intr + 175381 175518 138 0 0 72 93 98 0.883 8.44 2.13 Intr + 183906 184142 237 2 0 116 115 93 0.937 11.69 2.14 Intr + 186205 186486 282 0 0 55 80 145 0.834 7.09 2.15 Term + 189345 189503 159 2 0 54 49 90 0.540 -1.34 2.16 PlyA + 190706 190711 6 1.05 3.04 PlyA - 190812 190807 6 1.05 3.03 Term - 197277 197224 54 0 0 84 39 37 0.053 -5.02 3.02 Intr - 199650 199543 108 0 0 67 69 88 0.231 4.36 3.01 Init - 214199 214077 123 2 0 78 113 81 0.597 9.82 3.00 Prom - 222900 222861 40 -5.05 4.00 Prom + 226932 226971 40 -4.85 4.01 Init + 240193 240255 63 0 0 18 78 95 0.505 1.75 4.02 Intr + 242999 243073 75 1 0 63 92 67 0.326 3.39 4.03 Intr + 244049 244156 108 1 0 33 52 115 0.500 1.96 4.04 Intr + 246406 246442 37 0 1 60 105 44 0.657 0.22 4.05 Term + 248415 248842 428 1 2 68 42 178 0.569 5.68 4.06 PlyA + 252394 252399 6 1.05 5.00 Prom + 256025 256064 40 -4.85 5.01 Init + 258602 259075 474 1 0 61 85 230 0.623 15.52 5.02 Intr + 265019 265114 96 1 0 72 80 50 0.773 1.89 5.03 Intr + 268976 269107 132 1 0 84 94 145 0.999 14.62 5.04 Intr + 271884 272054 171 2 0 62 50 80 0.637 0.82 5.05 Intr + 272687 272788 102 1 0 124 81 74 0.999 9.75 5.06 Term + 279666 279782 117 2 0 52 49 126 0.986 2.66 5.07 PlyA + 279859 279864 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 223736 223620 117 2 0 90 43 124 0.822 5.66 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589f:105199218_105485503|GENSCAN_predicted_peptide_1|60_aa MAQMSDVMDKVFKAGIISMFKELKKTMLNEVKKGNAKSEPCWLVIGSDGKQLSSREGSAG >gi568815589f:105199218_105485503|GENSCAN_predicted_CDS_1|183_bp atggcccagatgtcagatgtgatggacaaagtgttcaaagcaggcattataagtatgttc aaagaactaaagaaaaccatgcttaatgaagtaaagaaagggaatgccaagtctgagccc tgctggcttgttattggttcagatggcaaacagctatcaagcagggaagggtctgctgga tag >gi568815589f:105199218_105485503|GENSCAN_predicted_peptide_2|796_aa MPPVRRVARRAPAARYLSLSPASPERTVQSKDPHSAFPSAGSRRCPCAGSAAADSKAPLR GAYSFHSVYTGYANHSKQSAAKCHGSWPNCLVHIPAVSSKREWKPLEDRSCTDIPWLLLF ILFCIGMGFICGFSIATGAAARLVSGYDSYGNICGQKNTKLEAIPNSGMDHTQRKYVFFL DPCNLDLINRKIKSVALCVAACPRQELKTLSDVQKFAEINGSALCSYNLKPSEYTTSPKS SVLCPKLPVPASAPIPFFHRCAPVNISCYAKFAEALITFVSDNSVLHRLISGVMTSKEII LGLCLLSLGGTGVLWWLYAKQRRSPKETVTPEQLQIAEDNLRALLIYAISATVFTVILFL IMLVMRKRVALTIALFHVAGKVFIHLPLLVFQPFWTFFALVLFWVYWIMTLLFLGTTGSP VQNEQGFVEFKISGPLQYMWWYHVVGLIWISEFILACQQMTVAGAVVTYYFTRDKRNLPF TPILASVNRLIRYHLGTVAKGSFIITLVKIPRMILMYIHSQLKGKNAYTATAINSTNFCT SAKDAFVILVENALRVATINTVGDFMLFLGKVLIVCSTGLAGIMLLNYQQDYTVWVLPLI IVCLFAFLVAHCFLSIYEMVVDVLFLCFAIDTKYNDGSPGREFYMDKVLMEFVENSRKAM KEAGKGGVADSRELKPMVGGDEEVAALQEFHFHFLSLSVFTDCTSSGEAFVICITQDMLL FLFVCLPITWMAEFLSQLRPPSIKQIIELNMRTSGFVLELFPERGSLVKETLKIYEFFYD SLKVILEKCTKSDEEV >gi568815589f:105199218_105485503|GENSCAN_predicted_CDS_2|2391_bp atgcctcccgtgcgccgcgtcgcgcggcgggcgcccgccgcccgatacctctcgctgtcc ccagcctcccctgagcgcaccgtgcagtccaaagacccccactccgcctttccgtccgcc gggtcccgccgatgcccgtgcgctggctccgcggctgccgactcaaaggcgcctctgcgc ggcgcctattcattccacagtgtctacacaggatatgctaaccacagtaaacagagcgca gcaaagtgtcatgggtcctggcccaactgtctggttcacattcctgctgtaagctccaaa cgagaatggaagccgctggaggaccgtagctgcacagacataccatggctgctgctcttc atcctcttctgcattgggatgggatttatttgtggcttttcaatagcaacaggtgcagca gcaagactagtgtcaggatacgacagctatggaaatatctgtgggcagaaaaatacaaag ttggaagcaataccaaacagtggcatggaccacacccagcggaagtatgtattctttttg gatccatgcaacctggacttgataaaccggaagattaagtctgtagcactgtgtgtagca gcgtgtccaaggcaagaactgaaaactctgagtgatgttcagaagtttgcagagataaat ggttcagccctatgtagctacaacctaaagccttctgaatacactacatctccaaaatct tctgttctctgccccaaactaccagttccagcgagtgcacctattccattcttccatcgc tgtgctcctgtgaacatttcctgctatgccaagtttgcagaggccctgatcacctttgtc agtgacaatagtgtcttacacaggctgattagtggagtaatgaccagcaaagaaattata ttgggactttgcttgttatcactaggaggcacaggtgtactatggtggctgtatgcaaag caaagaaggtctcccaaagaaactgttactcctgagcagcttcagatagctgaagacaat cttcgggccctcctcatttatgccatttcagctacagtgttcacagtgatcttattcctg ataatgttggttatgcgcaaacgtgttgctcttaccatcgccttgttccacgtagctggc aaggtcttcattcacttgccactgctagtcttccaacccttctggactttctttgctctt gtcttgttttgggtgtactggatcatgacacttctttttcttggcactaccggcagtcct gttcagaatgagcaaggctttgtggagttcaaaatttctgggcctctgcagtacatgtgg tggtaccatgtggtgggcctgatttggatcagtgaatttattctagcatgtcagcagatg acagtggcaggagctgtggtaacatactattttactagggataaaaggaatttgccattt acacctattttggcatcagtaaatcgccttattcgttaccacctaggtacggtggcaaaa ggatctttcattatcacattagtcaaaattccgcgaatgatccttatgtatattcacagt cagctcaaaggaaagaatgcatacacagccacagctatcaacagcaccaacttctgcacc tcagcaaaggatgcctttgtcattctggtggagaatgctttgcgagtggctaccatcaac acagtaggagattttatgttattccttggcaaggtgctgatagtctgcagcacaggttta gctgggattatgctgctcaactaccagcaggactacacagtatgggtgctgcctctgatc atcgtctgcctctttgctttcctagtcgctcattgcttcctgtctatttatgaaatggta gtggatgtattattcttgtgttttgccattgatacaaaatacaatgatgggagccctggc agagaattctatatggataaagtgctgatggagtttgtggaaaacagtaggaaagcaatg aaagaagctggtaagggaggcgtcgctgattccagagagctaaagccgatggtaggtgga gatgaggaggtggccgccctccaagaatttcactttcacttcctctctctctctgtcttc actgactgcacttcttcaggagaagcttttgttatctgtatcacgcaggacatgctgctc tttctgtttgtgtgcttacccatcacttggatggcagaattcttgtcacaactgagacca ccttctataaaacagatcattgaactaaacatgagaacttccggttttgttcttgaactt ttccctgagagaggctcgctagttaaggaaactttaaagatttatgaatttttttatgat tctctgaaagtgattttggaaaaatgtaccaaaagtgatgaagaggtttag >gi568815589f:105199218_105485503|GENSCAN_predicted_peptide_3|94_aa MGGTYIGKAKFSWKLPVFFSLYLIGQNYVIRPLQASGKYGKCGPGKPKDWTPLVEAETHW VSSTASDQRYEAKEPLEELYSKLKEFRKAVPASK >gi568815589f:105199218_105485503|GENSCAN_predicted_CDS_3|285_bp atgggcggcacctatattgggaaggcgaaattttcttggaaattgccagtattcttcagc ttatatctcattgggcagaactatgtcatacgaccacttcaagcttcagggaagtatgga aagtgtggcccaggaaagccaaaagattggacacccctggtagaggcagaaactcactgg gttagttccacagcatctgatcaaagatatgaggccaaagagcctctcgaggagctctat tctaaactcaaggagttcagaaaggctgtgccagccagtaagtaa >gi568815589f:105199218_105485503|GENSCAN_predicted_peptide_4|236_aa MVVVVVVVVVVVVVVVVMEWEEEVGLLAMDHMGVTVGRGDSESLPNRPHNHGRRRKAHPT LAADERQLSSAGKLIFIKPSALWQAGAEPGSTQSGLFPRFRYIPTPALFLPLDRWVRSSL AAPRRIVWGGASDACWVEGGLIRLQHQILFPSRALRTGRLLDAQWPISGPERPLSLEEKG WGKGVQPAEALAVDRRAALPTLHGYFPGAVSRCAQARCAPQPLPSAGRPFTLATAA >gi568815589f:105199218_105485503|GENSCAN_predicted_CDS_4|711_bp atggtggtggtggtggtggtggtggtggtggtggtggtggtggtggtggtgatggagtgg gaggaagaagtggggcttcttgcaatggaccacatgggagtgacggtggggagaggagat tcagagagccttccaaacaggcctcacaatcatggccgaaggcgaaaggcacatcctaca ttggcagcagatgaaagacagctttcatctgcagggaaactcattttcataaaaccatca gctctctggcaagcaggggcagagccaggaagcacgcagtctggtcttttcccacgtttt cgctacatccccaccccagcccttttcctccctttggaccgctgggtccgctcaagcctg gctgcccctcgaaggatagtgtggggaggggcttcagatgcgtgctgggtggagggtggg ctaatccgtctccagcaccagatactctttccttcccgggccctccgcaccggccggctg ctggatgcccagtggcctatttcagggccggagagaccgctatcgctggaggaaaaagga tgggggaagggagtgcagcctgcagaggcactagcggtagacagacgggcggcgctcccc accctccacggctacttcccgggagcagtcagccgctgcgcgcaggcgcggtgcgctcct cagcccctcccttctgcgggccgccctttcaccctggcaaccgcggcgtga >gi568815589f:105199218_105485503|GENSCAN_predicted_peptide_5|363_aa MLAAVAIRAAPGTGTVTGSMRGCSWTKHTASGFLCGCLRLDEVNTVVPENSEMPVTAEPQ GVGGRGATPLLFLPPAAWPVGVAGHVSACSVLPPCLGPWLPGWPNPVVTCCHVGQPPDAG GSWEGYSVTALAQGIPRSRPLEVLLLFTPAVHVTVCSLEALQRIISTLANKNDEIQNFID TLHHTLKGVQENSSNILSELDEEFDSLYSILDEVKESMINCIKQEQARKSQELQLILHIK PATKVAMGHVFSLAFPSSHQVPKSLLHIECALCILINPLAIPGMSDVPVQLSQISQCNNA LENSEELLEFATRSLDIKEPEEFSKRHIEKTWNNDYEVVKEISEEKLESKFDMYKKTMKE VVP >gi568815589f:105199218_105485503|GENSCAN_predicted_CDS_5|1092_bp atgctggctgctgtggccatacgggcagctccgggcactggcacagtcactggctccatg cgaggctgcagctggaccaagcatactgcaagtggcttcctctgtgggtgcctgcgtctg gatgaagtgaacacagtggtgcctgaaaactcagagatgccagtaactgcagagccccaa ggggtcgggggcaggggtgctacacctctcttgttcctgccacctgcagcttggccagtg ggagtggcagggcatgtttcagcctgttcagtcctgccaccttgcttgggcccgtggctc ccaggctggcccaatcccgttgtcacttgctgtcatgtggggcagccgcctgatgctggt ggaagttgggagggctatagtgttacagctctggctcagggaatcccgaggtctaggccc ttagaagtgttgctactcttcactcctgcagttcatgtcactgtctgcagcttggaagct ctacagaggatcatttcaactctggcaaataaaaatgatgaaattcagaactttattgat acactacatcatacactaaaaggagttcaggaaaattcgtccaacatactctcagagtta gatgaagaatttgatagtttatactctatactggatgaagtaaaagaaagtatgattaac tgtatcaagcaggaacaagctcgtaaatcccaagagttacagctcatactgcatattaaa cctgccaccaaagtggccatgggtcatgtcttttctcttgcctttccttcttcccatcaa gtaccaaaaagtcttttgcatattgagtgtgcattatgcatcttgatcaaccccctagcc atccctggcatgagtgatgttccagtgcaactgagtcagattagtcaatgtaataatgcc ctggagaactctgaagaactattagaatttgcaacaaggtcattagatataaaggaacct gaagaattttcaaagaggcatatagagaaaacttggaacaacgattatgaagtagtgaaa gagatctctgaagaaaaattagagagtaaatttgacatgtacaagaaaaccatgaaggaa gttgtcccatga