GENSCAN 1.0 Date run: 7-Nov-116 Time: 23:58:32 Sequence gi568815594f:188638371_188839156 : 200786 bp : 37.85% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 13016 13085 70 1 1 82 63 78 0.667 5.96 1.02 Intr + 19129 19202 74 2 2 62 98 14 0.458 -1.99 1.03 Intr + 19254 19397 144 2 0 98 110 56 0.512 8.36 1.04 Term + 21138 21230 93 2 0 101 53 100 0.443 4.65 1.05 PlyA + 21648 21653 6 1.05 2.08 PlyA - 23036 23031 6 1.05 2.07 Term - 24966 24821 146 1 2 71 48 139 0.630 5.29 2.06 Intr - 25280 25100 181 2 1 39 44 130 0.614 2.22 2.05 Intr - 34573 34222 352 0 1 23 43 163 0.001 -0.30 2.04 Intr - 35011 34858 154 2 1 93 37 88 0.001 2.41 2.03 Intr - 59813 59667 147 0 0 49 101 145 0.555 11.19 2.02 Intr - 64400 64271 130 2 1 20 75 91 0.838 0.35 2.01 Init - 65220 65143 78 0 0 52 70 107 0.360 6.41 2.00 Prom - 74083 74044 40 -5.75 3.00 Prom + 83354 83393 40 -4.25 3.01 Init + 95607 95760 154 2 1 60 78 52 0.114 1.60 3.02 Term + 100046 101124 1079 0 2 21 38 905 0.911 70.08 3.03 PlyA + 101346 101351 6 1.05 4.00 Prom + 101355 101394 40 -10.25 4.01 Init + 101838 102052 215 2 2 48 107 136 0.527 9.76 4.02 Intr + 115996 116178 183 1 0 32 75 157 0.125 6.78 4.03 Intr + 116368 116594 227 1 2 74 94 113 0.383 7.21 4.04 Intr + 123292 123489 198 2 0 52 86 127 0.809 7.30 4.05 Term + 123563 123588 26 0 2 133 51 13 0.929 -0.49 4.06 PlyA + 123845 123850 6 1.05 5.04 PlyA - 123919 123914 6 1.05 5.03 Term - 133866 133724 143 1 2 47 42 116 0.837 0.01 5.02 Intr - 136685 136472 214 2 1 -20 91 160 0.687 2.87 5.01 Init - 138551 138183 369 2 0 60 68 219 0.586 14.14 5.00 Prom - 151143 151104 40 -4.95 6.03 PlyA - 151223 151218 6 1.05 6.02 Term - 154287 154157 131 1 2 96 37 69 0.795 -0.04 6.01 Init - 164079 163953 127 0 1 62 106 186 0.974 18.17 6.00 Prom - 177893 177854 40 -4.55 7.04 PlyA - 178429 178424 6 1.05 7.03 Term - 188662 188528 135 1 0 47 48 159 0.930 4.84 7.02 Intr - 190818 190726 93 0 0 68 75 67 0.504 2.64 7.01 Init - 192578 192384 195 2 0 64 8 169 0.907 5.48 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 161337 161455 119 1 2 42 43 170 0.853 5.62 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594f:188638371_188839156|GENSCAN_predicted_peptide_1|126_aa MEIPVIDSDVGAKHISSICLRDPARHTQSPSINQSPVLSSSHWSQSAQISGPLVVLLSFL VGSRKVVNSPFIWGFAVVVRLEVTLISSVFVENTQQCAPPLERRSFLQVRASSGLTLRAK RLQTAH >gi568815594f:188638371_188839156|GENSCAN_predicted_CDS_1|381_bp atggagattcccgtgattgacagtgatgttggagcaaagcacatttccagcatctgcttg cgtgaccctgcaaggcacactcagtccccatcaatcaaccagtcacctgtgctgagctct tcgcactggtctcaatctgcccagatttcaggaccgttggttgtcctgctcagctttctg gtgggttccagaaaagttgttaactcaccgtttatttggggttttgctgttgttgtaagg ctagaagtgacactcatttcttctgtcttcgtggaaaacactcaacagtgcgcacccccg ctggagcgcaggagcttcctgcaggttcgcgccagctcgggcctgacactcagggcaaag cggctccaaactgcccactga >gi568815594f:188638371_188839156|GENSCAN_predicted_peptide_2|395_aa MVTESKDYLRFNMQAFLSNPLSDVYKPATWRLPLYDKSLGFITPFTLTQADSNFAGRAYP FQPIANQEIQSSQKKLTRMKLELRVPVVLDVFTEMATLLTIITRLVNKWVDLWPRQENGL LEFAGGLLQTLFTWVSTMEAAEQQRLLPAPSSGSFVPEGHRFEASWSSPLCLPSEEKSGE AVWPQLLHCTVVNSAQSKPPSLLSTVRGKSPPKVSVMADAPPPTKLHHPRSTPDCCAGSE NFKPVVLSLLGSMGVGPAEQEHLAPWLQPPFQESKWFCLAGVPGTTGSCECLLIPTLRTV PCVNVNLDIIAQIIGCDVPGILAPSGELPLVIHTDDVHLFPMSPGSGAALAFADHFPYTV FPGNRERHGIDSPSESPGGTNPADTLTLNIYLPEL >gi568815594f:188638371_188839156|GENSCAN_predicted_CDS_2|1188_bp atggttacggaatcaaaagactatcttcgatttaacatgcaggctttcctgtcaaatcca ctttcagatgtttacaagcctgcgacctggaggctgcctctctatgacaagagccttggc ttcataaccccctttaccttaactcaagctgactccaactttgcaggcagagcttatccc tttcaaccaattgccaatcaggaaatccaaagttctcaaaagaaactgacaagaatgaag ctggagctccgcgttccagttgttcttgatgtcttcacagagatggctacacttcttaca attattactagacttgttaataaatgggttgatctgtggcctcgacaagaaaatggtctg ctggagtttgctggaggtctactccagaccctgttcacctgggtgtcaacaatggaggct gctgaacagcaaagattgctgcctgctccttcttctggaagctttgtcccagaggggcac cggtttgaggccagttggagctctcctctatgccttcccagtgaggagaaatctggagaa gcagtctggccgcagctgcttcactgcactgtggtgaattctgcccaatccaaacctccc agtctccttagcactgtcagaggaaaatcgcctcctaaagtctcagtaatggcagatgcc cctcccccgaccaagctccatcatcccaggtcgactccagactgctgtgctggcagtgag aatttcaagccagtggttcttagcttgctgggctccatgggagtgggacctgctgagcaa gaacacttggctccctggcttcagcctcctttccaggagagtaaatggttctgtcttgct ggggttccaggcaccactgggtcttgtgaatgcctcctgattcctactttacgtactgtc ccttgtgtaaatgtcaaccttgacattattgcgcaaatcataggctgtgatgtacctggc attcttgccccttctggagagctgcccttagttattcatacagacgatgtacaccttttc cctatgagccctgggtctggggctgctttggcatttgcagaccactttccatatacagtc tttccagggaacagagagagacatggaatagattctccgtcagagtctccaggaggaacc aaccctgctgacaccttgactctgaacatctatctgcccgaactgtga >gi568815594f:188638371_188839156|GENSCAN_predicted_peptide_3|410_aa MKPRTLAVSVTVPKDGVSRVFSSRCSNVPGVSSFWWVRGLADFRSKAADLRPRRPLSSGS PPLEKLFARGGPLRTFLERQAGSEAHLKVRRPELLAVIKLLNEKEQELRETEHLLHDENE GLRKLAENEITLCQKEITQLKHQIILLLVPSEETDENDLILEVTAGVGGQEAMLFTSEIF DMYQQYAAFKRWHFETLEYFPSEVGGLRHASASIGGSEAYRHMKFEGGVHRVQRVPKTEK QGCVHTSTMTVAILPQPTEINLVINPKDLRIDTKRASGAGEQHVNTTDSAVRIVHLPTGV VSECQQERSQLKNKELAMTKLRAKLYSMHVEEEINKRQNARKIQIGSKGRSEKIRTYNFP QNRVTDHRINKTLHDLETFMQGDYLLDELVQSLKEYANYESLVEIISQKV >gi568815594f:188638371_188839156|GENSCAN_predicted_CDS_3|1233_bp atgaagcctcggacccttgcagtgagtgttacagttcctaaagacggtgtgtccagagtt ttttcctccagatgttcaaatgtgcctggagtttcttccttctggtgggttcgtggtctc gctgacttcaggagtaaagctgcagaccttcgcccccgccggcccctgagctccggtagc ccgccgctggagaagctgttcgcccggggcgggcccttgcggaccttcctcgagcgccag gcggggtctgaagcccatttgaaggtcaggaggcccgagttgctggcggtgatcaaactg ctgaacgagaaggagcaggagctgcgggagactgagcacttgctgcacgatgagaatgaa ggtttaaggaaacttgcagagaatgaaatcactttgtgtcaaaaagaaataactcagctg aagcatcagattatcttacttttggttccctcagaagaaacagatgaaaatgatttgatc ctggaagtaactgcaggagttggaggtcaggaggcaatgttgtttacatcagagatattt gatatgtatcagcaatatgctgcatttaaaagatggcattttgaaaccctggaatatttt ccaagtgaagtaggtggccttagacatgcatctgccagcattgggggttcagaagcctat aggcacatgaaatttgaaggaggtgttcacagagtacaaagagtgccaaagacagaaaag caaggctgcgtccatactagcaccatgactgtagcaatattaccccagcctactgagatt aatctggtgattaatccgaaagatttgagaattgacactaagcgagccagtggagctggg gagcagcatgtaaataccacggacagtgctgtccggatagttcatcttccaacaggtgtt gtttctgaatgtcaacaagagagatctcagctgaaaaataaagagctggctatgacaaag ttacgtgcaaaactgtacagcatgcatgtagaagaagaaataaataaaagacagaatgct agaaaaattcagattggaagtaaaggaagatcagagaaaataagaacatataattttcca cagaaccgggtcacagatcacagaataaacaagacgctgcatgatcttgaaacttttatg caaggagattatctactggatgaacttgtacagtcattgaaggaatacgccaattatgaa tctttagtagaaattatttcccaaaaagtttaa >gi568815594f:188638371_188839156|GENSCAN_predicted_peptide_4|282_aa MKESYDVLGLRVPQERIPLKGSPLPHGEVRPNGESGYSLLSDWEIPCLALGQSWCTAGGS GLTSKEPHTLNCTSSALSPPLAALEERFNRPLHCGSPFLGWPTLEPSPSASGEVWKGEAL AGTRAAHGAHGPAAELRTCSPPCLSPPTMGSRVAQASPMCATPSSMAPGPINRPRAEECR HVARDWRTAPPTALAWDPLGKASWAPESGSLLEEAQFTGALVLPLWPHHCICTKPRHFPS SSQSATEIAKGATMHQSGAAFQWATFVWETPVDLGASPPSWL >gi568815594f:188638371_188839156|GENSCAN_predicted_CDS_4|849_bp atgaaagagagctatgatgtcctaggactgagagtaccccaggaaagaatacccttgaaa gggtctccccttccccatggtgaagtcaggcctaatggagagagtggctacagcctactc agtgattgggaaattccctgtcttgccctgggccagagctggtgtaccgctggtggatca ggtcttacaagcaaagaacctcacactctcaactgcacctcctcagccttgtcgccccct ctggccgcgcttgaggagcgcttcaaccggccgctgcactgtgggagcccctttctgggc tggccgacgctggagccctctccttctgcttctggggaggtgtggaagggagaggcgctg gcgggaaccagggctgcacatggcgctcatgggccagcggcagagctcaggacctgcagc ccgccatgcctgagcccccccacaatgggctcccgtgtggcccaagcctccccaatgtgt gccaccccctcctccatggcaccgggtcccatcaaccgcccaagggctgaggagtgcagg catgtggcacgggactggcggacagctccacccacagccctggcatgggatccactaggc aaagccagctgggctcctgagtcaggcagcctgctggaagaagcacagtttactggagcc ctggtgcttcctctctggccccatcactgcatttgcacaaagcccagacacttccccagc agttcccagtcagcaactgaaatagccaagggtgctactatgcaccagtctggagctgct ttccagtgggccacttttgtttgggaaaccccagttgacctgggtgccagccctccttct tggctgtga >gi568815594f:188638371_188839156|GENSCAN_predicted_peptide_5|241_aa MPGTVVDQEKLLRKKWWKGAQTNLQLPLRQTEPPMKTGIMNFCSKNYHRNIPEKPGESTD PSEGGGLPPQAPRDSQGTVSWLCFLSWRLVALLTGCLKIHSVLLRGHGGSETGPSGYRMR GSWEKEKSESLTDIFKEIIEEKFPGLARDLDIQIQEAQRTPGKFITKRSTPNYTVIKLLK DKGKNFKSCEAKASVLEVLTRTIRQEKEIKGIQISKEEVKLLLFADDMIVYPEIPEDSKN S >gi568815594f:188638371_188839156|GENSCAN_predicted_CDS_5|726_bp atgcctggcacagtagtggatcaagaaaaattattgagaaaaaaatggtggaaaggagcc cagactaacctgcagctcccactcagacagacagagccgcctatgaagactggcatcatg aacttctgctccaagaactaccacaggaacataccagaaaagccaggagaatccacagac ccttctgaaggaggtggtttgccaccgcaggctccgcgggacagccaaggaactgtgagt tggctttgctttctcagctggaggcttgtagccctactcaccggctgcctgaaaatacac tccgtgctgttgcggggacatggtggaagtgagactggcccttcaggctaccggatgcgt gggagctgggaaaaagagaaatctgaaagtttgacagacatattcaaggaaataattgag gaaaagttccctggccttgctagagatttagacattcaaatacaagaagctcaaagaaca cctggaaaattcatcacaaaaagatcaacacctaactacacagtcatcaagttacttaaa gacaaaggaaagaattttaagagctgtgaggcaaaagcatcagtactggaagtcctaacc agaacaatcagacaagagaaagaaataaagggcatccaaatcagtaaagaggaagtcaaa ctgctactgtttgctgatgacatgattgtatacccagaaatccctgaagactcaaaaaat tcctag >gi568815594f:188638371_188839156|GENSCAN_predicted_peptide_6|85_aa MLAKCDREEGVDTQQEVEREGSVRVMAKNNVINSDKNNVSAVELLTSLNAGLNATSSFGM NICFLTAISITLIGEQQDLSSVIKL >gi568815594f:188638371_188839156|GENSCAN_predicted_CDS_6|258_bp atgctggctaagtgtgacagagaagagggtgtggacacgcagcaagaagttgaaagggaa ggaagtgtccgggttatggccaaaaataacgtgattaatagtgataagaataatgtttct gcagtggaacttttaacttctctgaatgcaggcttgaatgcaacctctagttttggcatg aatatctgtttcttaactgccatttctattacattgattggtgaacaacaagacctcagt tcagtgataaagctgtag >gi568815594f:188638371_188839156|GENSCAN_predicted_peptide_7|140_aa MPAGYEKWRDSALCMFERTDLDQRSQKPGGGQWSYEGENSYGPSLGQWPKLRWNRVHIRE AHGAQVSSSSSLVAEAKLSCGVQKRSLENIALSGGEKAVWSTLKMCCLAQPPPSMIRARS SGQPAAAFTSALAASPHTFM >gi568815594f:188638371_188839156|GENSCAN_predicted_CDS_7|423_bp atgccagcaggatatgagaaatggagggatagcgccctgtgcatgtttgagaggacagat ttggaccagagatctcaaaagccaggaggtggccagtggtcttatgaaggagaaaatagc tatggcccctccctgggccaatggccaaaacttagatggaatcgagtacacatccgagaa gcccacggggcacaggtcagctcaagcagttccctagtcgcagaagcaaagctcagctgt ggggtacagaagaggtcccttgaaaacattgccctttctggtggtgagaaggctgtttgg tctacactgaagatgtgttgtttagcacagccacctccatcaatgatccgagctagatct tctggacaacctgctgcagcttttacatcagcacttgctgcttcacctcacacttttatg tga