GENSCAN 1.0 Date run: 3-Oct-119 Time: 17:24:16 Sequence gi568815584r:100046764_100259587 : 212824 bp : 49.95% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 2888 2927 40 -3.26 1.01 Init + 18738 18748 11 2 2 98 113 3 0.022 3.93 1.02 Intr + 37924 38092 169 1 1 89 82 206 0.651 20.05 1.03 Intr + 50718 50895 178 2 1 121 107 218 0.994 26.49 1.04 Intr + 60177 60345 169 1 1 76 84 44 0.284 1.80 1.05 Intr + 61796 61862 67 2 1 133 29 36 0.503 1.11 1.06 Intr + 71141 71236 96 1 0 72 115 19 0.407 3.21 1.07 Intr + 72247 72305 59 0 2 95 77 58 0.476 3.08 1.08 Intr + 73471 73511 41 1 2 51 96 13 0.275 -3.73 1.09 Intr + 76776 76839 64 1 1 132 86 72 0.986 9.18 1.10 Intr + 78022 78106 85 1 1 100 29 71 0.872 2.22 1.11 Intr + 79797 80008 212 2 2 67 94 79 0.893 4.11 1.12 Intr + 80054 80428 375 2 0 63 28 154 0.214 1.03 1.13 Intr + 81814 81985 172 1 1 38 94 91 0.417 4.65 1.14 Intr + 82800 82921 122 2 2 82 90 107 0.889 9.59 1.15 Intr + 84701 84828 128 2 2 53 60 61 0.575 0.12 1.16 Intr + 85747 85866 120 2 0 71 84 50 0.076 3.37 1.17 Intr + 88002 88104 103 1 1 45 59 73 0.033 -0.67 1.18 Intr + 89142 89205 64 0 1 116 94 25 0.774 4.62 1.19 Intr + 89961 90056 96 2 0 91 81 52 0.840 5.01 1.20 Intr + 90563 90615 53 1 2 30 71 33 0.077 -6.49 1.21 Intr + 94417 94483 67 1 1 100 99 146 0.431 15.81 1.22 Intr + 94973 95030 58 1 1 108 113 112 0.183 14.16 1.23 Term + 97022 97098 77 0 2 33 47 88 0.017 -2.80 1.24 PlyA + 97452 97457 6 1.05 2.09 PlyA - 99679 99674 6 -1.95 2.08 Term - 100144 99998 147 1 0 119 52 194 0.052 16.80 2.07 Intr - 102947 102205 743 0 2 124 84 1725 0.979 166.87 2.06 Intr - 104764 104711 54 2 0 86 94 23 0.674 1.65 2.05 Intr - 112906 112743 164 0 2 46 80 206 0.475 15.22 2.04 Intr - 114012 113827 186 2 0 88 36 110 0.238 4.60 2.03 Intr - 119934 119842 93 2 0 110 107 50 0.514 8.18 2.02 Intr - 131696 131534 163 0 1 89 75 72 0.051 5.03 2.01 Init - 145681 145672 10 1 1 104 79 0 0.038 1.52 2.00 Prom - 156356 156317 40 -6.76 3.00 Prom + 156621 156660 40 -2.56 3.01 Init + 161532 161854 323 2 2 78 52 197 0.098 10.00 3.02 Intr + 167466 167797 332 0 2 72 40 139 0.005 2.77 3.03 Intr + 191666 191945 280 0 1 112 -4 181 0.771 7.84 3.04 Intr + 192436 193160 725 1 2 87 99 1694 0.940 161.48 3.05 Term + 193803 193975 173 1 2 82 40 147 0.977 7.29 3.06 PlyA + 194111 194116 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 27851 27754 98 2 2 103 107 63 0.906 9.48 S.002 Intr + 94973 95065 93 1 0 108 64 136 0.811 13.36 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584r:100046764_100259587|GENSCAN_predicted_peptide_1|861_aa MATSEQSICQARASVMVYDDTSKKWVPIKPGQQGFSRINIYHNTASNTFRVVGVKLQDQQ VVINYSIVKGLKYNQATPTFHQWRDARQVYGLNFASKEEATTFSNAMLFALNIMNSQEGV PGSVVGAAWSSRKSACPGAGLRWVCVTLSKSPPPSAGGMTPQARGSSHDTKMPGAWAPDV VSGRAFVRLHANSYPSRTARPVAGDAQFAWGQSSTYLTNMYCIDYRSKTMLFVFREEVSQ GTYGPSCYQCQFSMTPSPELFPKGPSSQRQVQNGPSPDEMDIQRSAEFCTPGPNPSCVGS LAVREWAYRLAVGWRQPWHTVALVERWRNLTLKPGPAGTAQAQSVVVSRVEESDCPAVIT FQTSDGAAPAAASGISRKKNLGHRWQPLHVDEAQGHTGRSVSSYGEPRSQSATTTSDGEG SKRAEQRLGESAQLGLATTCQPSPEWTFSKHLLCLRHRAKGWGHNYDCRCSSCPQEARCP LKETVNPPLSCHVVNTVASDVQVLKAYLGGPPPPPPPPVPPPPTGATPPPPPPLPAGGAQ GSSHDESSMSGLAAAIAGAKLRRVQRPEDASGGSSPSGTSKSDANRASSGGGGGGLMEEM NKLLAKRLSPASEGMQLSLPTVEGRREMNVDRKLLFSSTFTDTHPAYTPRHDHTEGLDIC LGLAPHQTGPRQAASFTPSFSHTGLSEDRELALGQWSYCSEDQFHHLKTCTQAWPPHRLC DCQRKILVPPPLRGPEQPASHLTPQIFAWCLDLPGCIGSWLMNVINTPALFASSNSPAHI PERPGTVKGSEPDHRMKPAGSVNDMALDAFDLDRMKQEILEEVVRELHKVKEEIIDEDSQ KPSQPQTPVHQSTHRSLGALL >gi568815584r:100046764_100259587|GENSCAN_predicted_CDS_1|2586_bp atggccacaagtgaacagagtatctgccaagcccgggcttccgtgatggtctacgatgac accagtaagaaatgggtaccaatcaaacctggccagcagggattcagccggatcaacatc taccacaacactgccagcaacaccttcagagtcgttggagtcaagttgcaggatcagcag gttgtgatcaattattcaatcgtgaaagggctgaagtacaatcaggccacgccaaccttc caccagtggcgagatgcccgccaggtctacggcttaaactttgcaagtaaagaagaggca accacgttctccaatgcaatgctgtttgccctgaacatcatgaattcccaagaaggagtg cctggaagtgtggtgggagcagcttggtctagcaggaagagtgcatgtcctggagccggg ctacgttgggtctgtgtgaccctgagcaagtcacctcctccatcagcaggtggaatgaca ccacaggcccgtggcagttcacatgacacgaagatgcccggtgcctgggctccagatgtt gtatcaggcagggcctttgtacgcctccatgccaacagctacccaagtcgcaccgctcgc ccagtggctggagatgctcagtttgcctggggccagtcatccacgtacttaacaaatatg tattgcattgactataggtcaaaaactatgctctttgtgttcagagaagaggtcagccag ggaacctatggcccgtcctgctaccagtgtcagttcagcatgactccttcacctgaactt ttccccaagggcccctccagccagcgtcaggtgcagaatggcccctctcctgatgagatg gacatccagagaagtgctgaattctgtaccccagggcccaatccaagctgtgtggggtca ctggccgtccgggaatgggcataccgcctggcggtcgggtggaggcagccgtggcacaca gtggcgcttgtggagcgctggcgaaacctgactctgaagccggggccggcgggtacagca caggcacagagtgtggtggtctccagagtggaggagtcagactgccctgctgtgattact ttccagacaagtgatggagcagcaccagcagcagcgtcaggaatctctagaaagaagaac ctcggccacaggtggcagcccctccacgtagatgaggcccagggacacacggggcgctct gtgagcagctatggggagcctaggtctcaaagtgcaaccaccacttcagatggtgagggt agcaagagagcagagcagaggcttggggagtctgctcagctgggtctggccactacctgc caaccttctccggagtggacattcagcaagcacctcctgtgtctcaggcaccgtgctaag ggctggggacacaactatgactgcagatgcagttcctgccctcaggaagcccgctgtcca ctcaaggagacagtgaacccacccttgtcatgtcacgtagtgaacactgtggcaagtgac gtgcaggtgctgaaggcctaccttggtgggcctccaccgccccccccacccccagtccca cctccacccactggggctaccccacctcccccacccccactgccagccggaggagcccag gggtccagccacgacgagagctccatgtcaggactggccgctgccatagctggggccaag ctgagaagagtccaacggccagaagacgcatctggaggctccagtcccagtgggacctca aagtccgatgccaaccgggcaagcagcgggggtggcggaggaggcctcatggaggaaatg aacaaactgctggccaagaggttgtctcccgcctctgagggcatgcagttgagtctccca acagtggagggcaggagggaaatgaacgtggacagaaagctcctgttcagcagcactttc acagacactcatcccgcctacactccccgccatgaccacacggagggcctggacatttgc ctgggcctggctcctcaccagacggggcctagacaagccgcctccttcacgccatcgttt tcccacacaggtttatctgaggaccgggagctggccctgggccaatggagttactgctcc gaggaccagttccaccacctgaaaacctgcacccaggcatggccgcctcacagactgtgc gactgccagaggaagatcctagtacctccccctctccggggacccgagcagccagccagc cacctaactcctcagatttttgcctggtgtcttgatttgccaggatgcataggttcctgg ctgatgaacgtgatcaatacccctgccctgtttgcttcttccaacagcccagctcacatt ccagagagacctggcacagtgaagggctcagaaccagatcacaggatgaagcctgctggg agcgtgaatgacatggccctggatgccttcgacttggaccggatgaagcaggagatccta gaggaggtggtgagagagctccacaaggtgaaggaggagatcatcgacgaggacagccag aagcccagccagccccagactccagtgcaccagagcacgcacaggagcctgggcgcgctg ctgtga >gi568815584r:100046764_100259587|GENSCAN_predicted_peptide_2|519_aa MAPGPQGLNSHFLDALWPFCLGTPSLGTRPFTLCGIMQPLYRALNSTGKGPVAPPPPRWR LTRPPRALQPLRENDPVGIFEAALSAAPWTAFGPELRKACQVYQTPGHGLPWDAQKHALT MCEPVAALADVSVSSQATASRGQALAHRHSRAGSAPRPPRQVQQSRAGRAPNGASRPTMG NSASRSDFEWVYTDQPHTQRRKEILAPWGSLSHVLQSSRCLQAAKYPAIKALMRPDPRLK WAVLVLVLVQMLACWLVRGLAWRWLLFWAYAFGGCVNHSLTLAIHDISHNAAFGTGRAAR NRWLAVFANLPVGVPYAASFKKYHVDHHRYLGGDGLDVDVPTRLEGWFFCTPARKLLWLV LQPFFYSLRPLCVHPKAVTRMEVLNTLVQLAADLAIFALWGLKPVVYLLASSFLGLGLHP ISGHFVAEHYMFLKGHETYSYYGPLNWITFNVGYHVEHHDFPSIPGYNLPLVRKIAPEYY DHLPQHHSWVKVLWDFVFEDSLGPYARVKRVYRLAKDGL >gi568815584r:100046764_100259587|GENSCAN_predicted_CDS_2|1560_bp atggccccaggccctcaaggcctgaactcacacttcttggatgccctgtggcccttctgt ctgggcactcccagccttggcaccaggccattcaccctctgtggcatcatgcagcctttg taccgagccctgaactcgaccgggaaggggcctgttgctccccctcctcccaggtggcgt cttacccgacctccgagggctctgcagccgctgcgggagaatgaccctgtcggtattttt gaggctgctttgagcgcggccccctggactgcctttggccctgagctaaggaaggcctgc caagtgtaccaaaccccgggccatggacttccatgggacgcacagaagcacgccctcact atgtgcgagcctgtggcagcgctggctgatgtgtcagtcagctcccaggcaactgccagc cgaggccaggctctggcccacaggcacagcagagctggttccgcgccgcggccgccgcga caggtgcagcagagccgagccggccgcgctccgaacggcgcctcccgccccaccatgggc aacagcgcgagccgcagcgacttcgagtgggtctacaccgaccagccgcacacgcagcgg cgcaaggagatactggctccttgggggtccctcagccacgtgctgcagtccagcaggtgc ctgcaggcagccaagtacccggccatcaaggccctgatgcggccagacccgcgcctcaag tgggcggtgctggtgctggtgctggtgcagatgctggcctgctggctggtgcgcgggctg gcctggcgctggctgctgttctgggcctacgcctttggtggctgcgtgaaccactcgctg acgctggccatccacgacatctcgcacaacgcggccttcggcacgggccgtgcggcacgc aaccgctggctggccgtgttcgccaacctgcccgtgggtgtgccctacgccgcctccttc aagaagtaccacgtggaccaccaccgctacctgggcggcgacgggctggacgtggacgtg cccacgcgtctggagggctggttcttctgcacacccgcccgcaagctgctctggctggtg ctgcagcccttcttctactcactacggccgctctgcgtccaccccaaggccgtgacccgc atggaggtgctcaacacgctggtgcagctggcggccgacctggccatctttgccctttgg gggctcaagcccgtggtctacctgctggccagctccttcctgggcctgggcctgcacccc atctcgggccacttcgtggccgagcactacatgttcctcaagggccacgagacctactcc tactatgggcctctcaactggatcaccttcaatgtgggctaccacgtggagcaccacgac ttccccagcatcccgggctacaacctgccgctggtgcggaagatcgcgcccgagtactac gaccacctgccgcagcaccactcctgggtgaaggtgctctgggattttgtgtttgaggac tccctggggccctatgccagggtgaagcgggtgtacaggctggcaaaagatggtctgtga >gi568815584r:100046764_100259587|GENSCAN_predicted_peptide_3|610_aa MAGCRSRALPRGEAAKARQEIERPALLGDPVRPPQLLAQVLSPSLPAPAGRSECGARQAH AHPELYLARKRGVQPRFQPMPLPPHLPANQGSRLWPRPAQRRAPTVQRAARKLRQEEGRR CPGRLRRPAMAPLLSPRTRKAASQAVAAFGARFLSGRAFAGRLPVQVSPGSVTATLPAER VSQEAGARSPRRSRAAAALGAEGGGYDRDTGGAAASSRGSRPRGARGRADRSPAPKAAAV AATPRSRVRPRPEAAGFVAVAPRRAAAARHREAGGGGGGGGALTSRAAGQPGRVRAAPPP VPSAPIREEPGDRGAEAAAAVAAEPSAMASGDTLYIATDGSEMPAEIVELHEIEVETIPV ETIETTVVGEEEEEDDDDEDGGGGDHGGGGGHGHAGHHHHHHHHHHHPPMIALQPLVTDD PTQVHHHQEVILVQTREEVVGGDDSDGLRAEDGFEDQILIPVPAPAGGDDDYIEQTLVTV AAAGKSGGGGSSSSGGGRVKKGGGKKSGKKSYLSGGAGAAGGGGADPGNKKWEQKQVQIK TLEGEFSVTMWSSAPGWTLPRRSRSPKSSFAAGRDLGLHVGLACAGSERQSEPVTAVASE FLAFSPSCGT >gi568815584r:100046764_100259587|GENSCAN_predicted_CDS_3|1833_bp atggcgggctgcaggtcccgagccctgccccgcggggaggcagctaaggcccggcaagaa atcgagcggccggcactgctgggggacccggtgcgccctccgcagctgctggcccaggtg ctaagcccctcactgcctgcgccggctggccgctccgagtgtggggcccgccaagcccac gcccacccggaactctatctggcccgcaagcgcggcgtgcagccccggttccagcccatg cctctccctccacacctcccggcaaaccaaggaagccggctctggcctcggccagcccag agaagggctcccacggtgcagcgtgccgcgaggaagctgcggcaggaagaagggcggagg tgccccgggcgcctgagacgcccggcaatggccccgctactctcgccgcgcacacgcaaa gccgcttcccaggcagttgcggcgttcggggcccgcttcctctcgggccgggccttcgcc ggccgcctgcccgtacaggtgtcccccggctccgtgactgcgactctcccagcggagagg gtctcccaggaagcaggggcgaggagccctcgaaggtctcgggcggcagcggctctcggc gcagagggcggagggtacgaccgcgacacaggcggggcggcagctagctcgcgaggctcc cgcccccgtggtgcccggggccgcgcggaccgctcaccggctcccaaggcagcggctgta gcggcgacgccccgttcccgagtgcggccccggcccgaggcggcgggttttgtggctgtt gcaccgcgaagggcggcagccgcgcgacaccgggaagcgggaggcggtggcggcggcggc ggcgcgctgacgtcacgcgccgcgggccagccagggcgcgtgcgagccgccccgcccccg gtcccatcggccccaatccgggaggagcccggcgaccgaggagccgaggccgccgcggcc gtggcggcggagccctcagccatggcctcgggcgacaccctctacatcgccacggacggc tcggagatgccggccgagatcgtggagctgcacgagatcgaggtggagaccatcccggtg gagaccatcgagaccacagtggtgggcgaggaggaggaggaggacgacgacgacgaggac ggcggcggtggcgaccacggcggcgggggcggccacgggcacgccggccaccaccaccac caccatcaccaccaccaccacccgcccatgatcgctctgcagccgctggtcaccgacgac ccgacccaggtgcaccaccaccaggaggtgatcctggtgcagacgcgcgaggaggtggtg ggcggcgacgactcggacgggctgcgcgccgaggacggcttcgaggatcagattctcatc ccggtgcccgcgccggccggcggcgacgacgactacattgaacaaacgctggtcaccgtg gcggcggccggcaagagcggcggcggcggctcgtcgtcgtcgggaggcggccgcgtcaag aagggcggcggcaagaagagcggcaagaagagttacctcagcggcggggccggcgcggcg ggcggcggcggcgccgacccgggcaacaagaagtgggagcagaagcaggtgcagatcaag accctggagggcgagttctcggtcaccatgtggtcctcagcgccgggctggaccctgccc cggcggtcacgctcgcccaagtcgtcgtttgctgcggggcgggacttggggctgcacgta gggctcgcgtgtgcgggctccgagcgtcagtcggagcctgtcaccgccgttgccagcgaa ttcctggccttttcgccttcctgcggtacctag