GENSCAN 1.0 Date run: 4-Nov-116 Time: 09:52:50 Sequence gi568815577f:46536952_46763556 : 173032 bp : 44.88% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 495 588 94 2 1 98 92 162 0.722 17.14 1.02 Intr + 1532 1651 120 0 0 94 67 126 0.954 11.57 1.03 Intr + 2926 3040 115 2 1 73 91 123 0.685 10.61 1.04 Intr + 4805 4944 140 2 2 112 94 109 0.901 14.01 1.05 Intr + 8186 8322 137 0 2 101 68 53 0.987 4.89 1.06 Intr + 8930 9010 81 1 0 95 76 82 0.712 7.53 1.07 Intr + 9964 10091 128 0 2 68 64 260 0.901 21.08 1.08 Intr + 12820 12934 115 1 1 99 80 233 0.995 24.05 1.09 Intr + 13592 13793 202 1 1 83 92 107 0.992 9.46 1.10 Intr + 14683 14792 110 2 2 60 56 55 0.484 -0.50 1.11 Intr + 14873 14953 81 1 0 55 100 85 0.756 6.23 1.12 Intr + 17218 17341 124 0 1 35 80 34 0.612 -2.54 1.13 Intr + 17624 17745 122 0 2 106 69 201 0.607 20.21 1.14 Intr + 17871 17982 112 2 1 94 96 130 0.992 14.35 1.15 Intr + 18036 18070 35 1 2 118 66 -12 0.485 -2.46 1.16 Intr + 18928 19140 213 0 0 87 100 78 0.786 7.81 1.17 Intr + 19988 20118 131 1 2 59 109 123 0.998 10.99 1.18 Intr + 20634 20802 169 0 1 80 96 262 0.999 26.15 1.19 Intr + 21272 21442 171 1 0 109 94 219 0.992 24.74 1.20 Intr + 23771 23832 62 1 2 102 95 130 0.997 12.63 1.21 Intr + 25492 25633 142 1 1 56 64 34 0.591 -1.84 1.22 Intr + 26907 26981 75 2 0 79 83 39 0.534 2.21 1.23 Intr + 28762 28936 175 0 1 46 78 189 0.997 13.21 1.24 Intr + 29609 29732 124 0 1 51 83 195 0.997 14.94 1.25 Intr + 30419 30667 249 2 0 114 58 587 0.449 54.85 1.26 Intr + 30821 30947 127 2 1 97 40 35 0.026 0.28 1.27 Intr + 45768 45896 129 2 0 73 98 31 0.006 3.49 1.28 Intr + 49040 49185 146 1 2 38 47 133 0.007 3.28 1.29 Term + 61858 61987 130 1 1 42 43 150 0.075 3.55 1.30 PlyA + 62005 62010 6 -3.24 2.03 PlyA - 62033 62028 6 -0.45 2.02 Term - 62552 62412 141 2 0 69 50 229 0.920 15.13 2.01 Init - 65464 65327 138 1 0 43 105 186 0.721 15.94 2.00 Prom - 77462 77423 40 -1.56 3.00 Prom + 83843 83882 40 -3.46 3.01 Init + 87618 87682 65 2 2 80 116 52 0.690 5.96 3.02 Intr + 88665 88824 160 0 1 101 47 24 0.147 -0.41 3.03 Intr + 106584 106688 105 2 0 65 68 157 0.761 11.81 3.04 Intr + 107355 107537 183 2 0 86 32 126 0.937 6.78 3.05 Intr + 111507 111668 162 2 0 64 101 158 0.904 14.87 3.06 Intr + 112624 112788 165 0 0 108 81 372 0.975 38.66 3.07 Intr + 121794 121969 176 2 2 72 100 242 0.972 22.54 3.08 Intr + 123882 124011 130 0 1 96 78 34 0.994 3.90 3.09 Intr + 124849 124985 137 0 2 47 86 233 0.943 18.37 3.10 Intr + 126432 126603 172 0 1 87 69 67 0.810 4.65 3.11 Term + 126956 127042 87 1 0 89 38 63 0.692 -0.94 3.12 PlyA + 127142 127147 6 1.05 4.00 Prom + 130138 130177 40 -6.76 4.01 Init + 130267 130556 290 0 2 82 105 247 0.494 20.39 4.02 Term + 153850 154294 445 1 1 -16 35 477 0.102 26.71 4.03 PlyA + 160630 160635 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 42841 42485 357 1 0 67 48 166 0.987 6.56 S.002 Sngl + 153932 154294 363 1 0 27 35 431 0.847 27.78 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815577f:46536952_46763556|GENSCAN_predicted_peptide_1|1252_aa SVMNRMHVVSVPYALMKANPLSWIQKVCFYKARAALVKSRDMHWSLLAQRGQRDVSLSSL RMLIVADGANPWSISSCDAFLNVFQSRGLRPEVICPCASSPEALTVAIRRPPDLGGPPPR KAVLSMNGLSYGVIRVDTEEKLSVLTVQDVGQVMPGANVCVVKLEGTPYLCKTDEVGEIC VSSSATGTAYYGLLGITKNVFEAVPVTTGGAPIFDRPFTRTGLLGFIGPDNLVFIVGKLD GLMVTGVRRHNADDVVATALAVEPMKFVYRGRIAVFSVTVLHDDRIVLVAEQRPDASEED SFQWMSRVLQAIDSIHQVGVYCLALVPANTLPKAPLGGIHISETKQRFLEGTLHPCNVLM CPHTCVTNLPKPRQKQPEVGPASMIVGNLVAGKRIAQASGRELAHLEDSDQARKFLFLAD VLQWRAHTTPDHPLFLLLNAKGTVTSTATCVQLHKRAERVAAALMEKGRLSVGDHVALVY PPGVDLIAAFYGCLYCGCVPVTVRPPHPQNLGTTLPTVKMIVEVSKSACVLTTQAVTRLL RSKEAAAAVDIRTWPTILDTGVVWPGCRPKTHDSQGQIMCRLLPVKAQSVQNTCGNTNVA GVSCLTDDIPKKKIASVFRPPSPDVLAYLDFSVSTTGILAGVKMSHAATSALCRSIKLQC ELYPSRQIAICLDPYCGLGFALWCLCSVYSGHQSVLVPPLELESNVSLWLSAVSQYKARV TFCSYSVMEMCTKGLGAQTGVLRMKGVNLSCVRTCMVVAEERPRIALTQSFSKLFKDLGL PARAVSTTFGCRVNVAICLQGTAGPDPTTVYVDMRALRHDRGKGRSLATLTPFTAMATAT CGESERGPERDPAKPCDVQDTDSCRAAGILPGVKVIIAHTETKGPLGDSHLGEIWVSSPH NATGYYTVYGEEALHADHFSARLSFGDTQTIWARTGYLGFLRRTELTDASGGRHDALYVV GSLDETLELRGMRYHPIDIETSVIRAHRSIAECAVFTWTNLLVVVVELDGLEQDALDLVA LVTNVVLEEHYLVVGVVVIVDPGVIPINSRGEKQRMHLRDGFLADQLDPIYVAYNIKHFL NYLKKYFESAKYIYKNTDAGILTDGETRKGKERPGMGIWEPPMAAASSRPSFHFAGNFFN NQVNMPNGNLDPQKKQREPVKETTSLGKGGEYYIKGISHGTKESEQQPSALDLPSERGYP SEKEPENQLRSAASAERSPGDQAHPGKQAEATTPLQGHTGQPAVICMDEERI >gi568815577f:46536952_46763556|GENSCAN_predicted_CDS_1|3759_bp agcgtcatgaacaggatgcacgtggtcagcgtcccctacgcgctgatgaaggcgaaccca ctctcctggatccagaaagtgtgcttctataaagctcgggccgcgctggtgaagtcgcga gacatgcactggtctctcctagctcagcggggccagagggacgtcagcctcagctcactg cgcatgctgattgtggccgatggtgccaacccgtggtcgatctcctcctgtgacgccttc ctcaacgtcttccagtccagaggtctgaggccagaggtcatctgtccttgtgcaagttct cctgaggcgctgactgtcgccatccgcaggccacctgatctgggaggaccacctccaaga aaagcagtcctgtcgatgaacggtctaagttatggtgttatcagagtggatactgaagaa aagttgtcagtccttactgttcaggacgttggtcaggtgatgcctggagctaatgtatgt gttgtgaagttagaaggtaccccttatctttgtaaaactgatgaagtgggagaaatatgc gtcagttccagtgcaactggcacagcgtactatggattgcttggaatcacgaagaatgtg tttgaggcagttccggtcaccacaggaggagcacccatctttgacaggccattcaccagg acaggcctgctgggcttcatcgggcctgacaacctggtcttcatcgtgggcaaactggac gggctgatggtcactggagttcgcagacacaatgcagatgacgttgtggccaccgcactg gccgtggagcccatgaagtttgtctacagaggcaggatcgctgtgttctctgtgaccgtg ctgcacgacgaccggattgtcctggtggctgagcagcggccggatgcctcggaggaggac agcttccagtggatgagccgtgtgctgcaggccattgatagcatccaccaggtgggcgtg tactgtctggccctggttcctgccaacaccttgcccaaggctcctctcggagggattcac atttctgaaaccaaacagcgctttctggaagggacgctgcacccgtgtaatgtgctgatg tgccctcacacctgtgttaccaacctccccaaacctcgtcagaaacaaccagaggttgga ccagcctcaatgatcgtggggaacctggttgctgggaagagaatcgctcaggcttccggg agagagctcgcccacctggaggacagcgaccaggcacggaagttcctgttcctggctgac gtgctgcagtggcgtgcccacaccactcctgaccacccgctgttcttgctgctgaacgcc aagggcaccgtcacaagcactgcaacctgtgtccagctgcacaaaagggctgagagagtg gccgcggctctgatggagaagggaagactgagtgttggggaccatgtggctctggtctac ccaccaggggtggacctcattgccgcgttctatggctgcttgtactgtggctgcgtgcct gtcaccgtgcggcccccgcaccctcagaacctcggcaccacactgcccaccgtcaagatg atcgtggaggtcagcaagtctgcatgcgtcctcaccacgcaggctgtcacacggctgctc aggtccaaggaggctgctgctgccgtggacatcaggacctggcccaccatcctagacaca ggtgtggtgtggcctggctgccgtccaaaaacacacgactcccaaggacaaatcatgtgt cgcctcttgcctgtgaaagcccagagcgttcagaatacatgtgggaacactaatgttgct ggtgtctcctgtttaacagatgacatcccaaaaaagaagatagcaagcgttttcaggccc ccctcccccgatgtcctcgcatacttggacttcagcgtgtcaaccactgggatattagcg ggagtgaagatgtcgcacgcggccacaagcgccttatgccgctccataaagctgcagtgt gagctgtacccctcgcggcagatcgccatctgcctcgacccctactgtggccttggtttt gccctgtggtgtctgtgcagtgtctactcgggacaccaatcagtgctggtgcccccgctg gagctggagagcaacgtgtccctgtggctgtcggccgtcagccagtacaaggcccgcgtc accttctgctcctactctgtgatggagatgtgcaccaagggcctaggcgcacagacgggt gtcctcaggatgaagggggtgaacctgtcatgtgtgcgcacgtgcatggtggtcgccgag gagcggcccaggattgcgctgacccagtccttctccaagctcttcaaggacctgggcctg ccggcccgcgccgtaagcaccacgttcgggtgcagggtcaacgtggccatctgcctccag ggcacagctggcccggaccccacaaccgtctacgtggacatgcgggcactgcgccatgac agaggcaagggaaggagcttggccaccctgactccgttcactgccatggccacagccacc tgtggggagtcagagaggggtccagagagggacccagccaagccatgtgatgtccaggat acagactcctgcagggcagcagggatcctccccggcgtgaaggtcatcatcgcacacacc gagaccaaaggacccttgggagactcacacctgggagagatctgggtaagcagcccccac aatgccaccgggtactacaccgtttacggggaggaggcgcttcatgccgaccacttcagt gcccggctgagttttggagacacacagaccatctgggcaaggaccggctaccttggcttc cttcggcgaacagagctcactgatgccagtggagggcggcacgatgcactgtatgtggtt gggtctctggatgaaactctggagctcagaggcatgcggtaccaccccatcgacattgag acctctgtcatccgagcacacaggagcatcgctgagtgtgccgtattcacctggaccaac ctgctggtggtggtggtggagctggatgggctagagcaggatgccctggacctggtggcc ctggtgaccaacgtggtgctggaggagcactacctggtcgtgggagtggtggtcatcgtg gacccaggggtgatccctatcaactctcggggtgagaagcagcgcatgcacctgcgggac ggcttcctggctgaccagctggaccccatctatgtcgcctacaacataaagcacttcctg aattatttaaagaaatattttgaatctgccaagtacatttacaaaaacacggatgctggt attttaacagatggagagacaaggaaaggaaaggaaaggcctggcatgggcatttgggag ccaccaatggcagctgcttctagtaggccatctttccactttgctggtaatttttttaac aaccaagtgaatatgccaaatggtaacttggatccacagaaaaagcaaagagaaccagta aaggaaaccacatcattaggaaaagggggagagtactacatcaagggaatatcccatggg acaaaagaatctgaacaacagccttcagctctagaccttccctctgagagaggctaccca agtgagaaggaaccagaaaaccaactccggagcgctgccagcgctgagcgttcacctggt gatcaggcgcatcccgggaagcaggccgaagccaccacgcccttgcagggccacaccggc caaccagctgttatctgcatggatgaggaacgcatttaa >gi568815577f:46536952_46763556|GENSCAN_predicted_peptide_2|92_aa MSELEKAMVALIDVFHQYSGREGDKHKLKKSELKELINNELSHFLEEIKEQEVVDKVMET LDNDGDGECDFQEFMAFVAMVTTACHEFFEHE >gi568815577f:46536952_46763556|GENSCAN_predicted_CDS_2|279_bp atgtctgagctggagaaggccatggtggccctcatcgacgttttccaccaatattctgga agggagggagacaagcacaagctgaagaaatccgaactgaaggagctcatcaacaatgag ctttcccatttcttagaggaaatcaaagagcaggaggttgtggacaaagtcatggaaaca ctggacaatgatggagacggcgaatgtgacttccaggaattcatggcctttgttgccatg gttactactgcctgccacgagttctttgaacatgagtga >gi568815577f:46536952_46763556|GENSCAN_predicted_peptide_3|513_aa MLAVRCPARPCVFSVLWFLGGRLVTSRAKHLAQPLHPMSNGDGDRTDTLAPATQKGETEA KSRQSAEILKSNLTQGEEPAECSEAGLLQEGVQPEEFVAIADYAATDETQLSFLRGEKIL ILRQTTADWWWGERAGCCGYIPANHVGKHVDEYDPEDTWQDEEYFGSYGTLKLHLEMLAD QPRTTKYHSVILQNKESLTDKVILDVGCGTGIISLFCAHYARPRAVYAVEASEMAQHTGQ LVLQNGFADIITVYQQKVEDVVLPEKVDVLVSEWMGTCLLFEFMIESILYARDAWLKEDG VIWPTMAALHLVPCSADKDYRSKVLFWDNAYEFNLSALKSLAVKEFFSKPKYNHILKPED CLSEPCTILQLDMRTVQISDLETLRGELRFDIRKAGTLHGFTAWFSVHFQSLQEGQPPQV LSTGPFHPTTHWKQTLFMMDDPVPVHTGDVVTGSVVLQRNPVWRRHMSVALSWAVTSRQD PTSQKASVASQAAVSLGVSSVMRVLTGSLRIRV >gi568815577f:46536952_46763556|GENSCAN_predicted_CDS_3|1542_bp atgttggccgtcaggtgccctgccaggccctgcgtgttcagtgtgctctggttcctggga ggaagacttgtgacctcaagagccaagcacctggcccaacctcttcaccctatgtctaat ggggatggggataggacagacaccttggcccctgccactcagaagggggaaactgaggca aaaagcaggcagtctgcagagattctgaaatccaacctgacccagggagaagagcctgct gagtgcagtgaggccggtctcctgcaggagggagtacagccagaggagtttgtggccatc gcggactacgctgccaccgatgagacccagctcagttttttgagaggagaaaaaattctt atcctgagacaaaccactgcagattggtggtggggtgagcgtgcgggctgctgtgggtac attccggcaaaccatgtggggaagcacgtggatgagtacgaccccgaggacacgtggcag gatgaagagtacttcggcagctatggaactctgaaactccacttggagatgttggcagac cagccacgaacaactaaataccacagtgtcatcctgcagaataaagaatccctgacggat aaagtcatcctggacgtgggctgtgggactgggatcatcagtctcttctgtgcacactat gcgcggcctagagcggtgtacgcggtggaggccagtgagatggcacagcacacggggcag ctggtcctgcagaacggctttgctgacatcatcaccgtgtaccagcagaaggtggaggat gtggtgctgcccgagaaggtggacgtgctggtgtctgagtggatggggacctgcctgctg tttgagttcatgatcgagtccatcctgtatgcccgggatgcctggctgaaggaggacggg gtcatttggcccaccatggctgcgttgcaccttgtgccctgcagtgctgataaggattat cgtagcaaggtgctcttctgggacaacgcgtacgagttcaacctcagcgctctgaaatct ttagcagttaaggagtttttttcaaagcccaagtataaccacattttgaaaccagaagac tgtctctctgaaccgtgcactatattgcagttggacatgagaaccgtgcaaatttctgat ctagagaccctgaggggcgagctgcgcttcgacatcaggaaggcggggaccctgcacggc ttcacggcctggtttagcgtccacttccagagcctgcaggaggggcagccgccgcaggtg ctcagcaccgggcccttccaccccaccacacactggaagcagacgctgttcatgatggac gacccagtccctgtccatacaggagacgtggtcacgggttcagttgtgttgcagagaaac ccagtgtggagaaggcacatgtctgtggctctgagctgggctgtcacttccagacaagac cccacatctcaaaaagcctctgtggcttcccaggcagcagttagtctaggcgtgtcgtct gtgatgcgtgttctgactggcagccttcggatccgtgtgtga >gi568815577f:46536952_46763556|GENSCAN_predicted_peptide_4|244_aa MQRALCAWRALFACWALCACWALCAREGAATLTRKAASGSRCAQLPALGFQRPASSYSNQ RDLPSPGAPRGVCARFFPKREGAGGSAARGTGSGSVRRSQSEGLKGQEGSVERCPQPHAK KKIRMSLTFRRPKTLRLRRQPRYPRKSTPRRNKLGHYAIIKFPLTTESAVKKIEENNTLV FTVDVKANKHQIRQAVKKLYDSDVAKVTTLICPDKEKAYVRLAPDYDAFDVVTKLGSPKL SPAG >gi568815577f:46536952_46763556|GENSCAN_predicted_CDS_4|735_bp atgcagagggctctgtgcgcctggcgggctctgtttgcctgctgggctctgtgcgcctgc tgggctctgtgcgcccgggaaggtgcggccaccctcacgcggaaggcggccagcggatcc cggtgcgcgcagctcccagcgctggggttccagcgccccgcctcttcctatagcaaccag cgggacctgccgtcccccggggcaccccgaggggtctgcgcccgcttctttccgaaacgg gaaggcgctgggggctcggcagccagagggacgggttcagggagcgtccgccgaagccaa agcgaaggccttaaaggccaagaaggcagtgttgaaaggtgtccgcagccacacgcaaaa aagaagatccgcatgtcactcaccttcaggcggcccaagacactgcgactccggaggcag cccagatatcctcggaagagcacccccaggagaaacaagcttggccactatgctatcatc aagtttccgctgaccactgagtcggccgtgaagaagatagaagaaaacaacacgcttgtg ttcactgtggatgttaaagccaacaagcaccagatcagacaggctgtgaagaagctctat gacagtgatgtggccaaggtcaccaccctgatttgtcctgataaagagaaggcatatgtt cgacttgctcctgattatgatgctttcgatgttgtaacaaaattgggatcacctaaactg agtccagctggctaa