GENSCAN 1.0 Date run: 5-Nov-116 Time: 04:36:10 Sequence gi568815584f:99984685_100241793 : 257109 bp : 48.25% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 21440 21565 126 0 0 31 91 48 0.002 0.05 1.02 Intr + 48294 48461 168 1 0 94 31 78 0.050 2.62 1.03 Intr + 55766 55887 122 0 2 91 1 65 0.032 -1.69 1.04 Intr + 61061 61193 133 1 1 87 87 60 0.089 6.02 1.05 Intr + 82464 82523 60 1 0 76 82 82 0.027 5.11 1.06 Intr + 89756 89777 22 0 1 63 93 14 0.001 -4.00 1.07 Intr + 100003 100171 169 1 1 89 82 206 0.650 20.05 1.08 Intr + 112797 112974 178 2 1 121 107 218 0.994 26.49 1.09 Intr + 122256 122424 169 1 1 76 84 44 0.284 1.80 1.10 Intr + 123875 123941 67 2 1 133 29 36 0.503 1.11 1.11 Intr + 133220 133315 96 1 0 72 115 19 0.407 3.21 1.12 Intr + 134326 134384 59 0 2 95 77 58 0.476 3.08 1.13 Intr + 135550 135590 41 1 2 51 96 13 0.275 -3.73 1.14 Intr + 138855 138918 64 1 1 132 86 72 0.986 9.18 1.15 Intr + 140101 140185 85 1 1 100 29 71 0.872 2.22 1.16 Intr + 141876 142087 212 2 2 67 94 79 0.893 4.11 1.17 Intr + 142133 142507 375 2 0 63 28 154 0.214 1.03 1.18 Intr + 143893 144064 172 1 1 38 94 91 0.417 4.65 1.19 Intr + 144879 145000 122 2 2 82 90 107 0.889 9.59 1.20 Intr + 146780 146907 128 2 2 53 60 61 0.575 0.12 1.21 Intr + 147826 147945 120 2 0 71 84 50 0.076 3.37 1.22 Intr + 150081 150183 103 1 1 45 59 73 0.033 -0.67 1.23 Intr + 151221 151284 64 0 1 116 94 25 0.774 4.62 1.24 Intr + 152040 152135 96 2 0 91 81 52 0.840 5.01 1.25 Intr + 152642 152694 53 1 2 30 71 33 0.077 -6.49 1.26 Intr + 156496 156562 67 1 1 100 99 146 0.431 15.81 1.27 Intr + 157052 157109 58 1 1 108 113 112 0.183 14.16 1.28 Term + 159101 159177 77 0 2 33 47 88 0.017 -2.80 1.29 PlyA + 159531 159536 6 1.05 2.09 PlyA - 161758 161753 6 -1.95 2.08 Term - 162223 162077 147 1 0 119 52 194 0.052 16.80 2.07 Intr - 165026 164284 743 0 2 124 84 1725 0.979 166.87 2.06 Intr - 166843 166790 54 2 0 86 94 23 0.674 1.65 2.05 Intr - 174985 174822 164 0 2 46 80 206 0.475 15.22 2.04 Intr - 176091 175906 186 2 0 88 36 110 0.238 4.60 2.03 Intr - 182013 181921 93 2 0 110 107 50 0.514 8.18 2.02 Intr - 193775 193613 163 0 1 89 75 72 0.051 5.03 2.01 Init - 207760 207751 10 1 1 104 79 0 0.038 1.52 2.00 Prom - 218435 218396 40 -6.76 3.00 Prom + 218700 218739 40 -2.56 3.01 Init + 223611 223933 323 2 2 78 52 197 0.098 10.00 3.02 Intr + 229545 229876 332 0 2 72 40 139 0.005 2.77 3.03 Intr + 253745 254024 280 0 1 112 -4 181 0.771 7.84 3.04 Intr + 254515 255239 725 1 2 87 99 1694 0.940 161.48 3.05 Term + 255882 256054 173 1 2 82 40 147 0.962 7.29 3.06 PlyA + 256190 256195 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 89930 89833 98 2 2 103 107 63 0.911 9.48 S.002 Intr + 157052 157144 93 1 0 108 64 136 0.811 13.36 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584f:99984685_100241793|GENSCAN_predicted_peptide_1|1068_aa XRIELMKPNGKKTKTTTKTETKTKQSSKTNDGTTYTITECFNGPRRGNGSKEDSCCLFDV CSFLSMKAFGDFDGSAESSEQHLFVDGKDQPERLKGPNAWEQREGRVWLLVCPSESSHLG RTERSLTASQQTQFNLRKEISKAKSASGVCSPWVSVFVPNGQNELWNESMNSWNALRPVG TDDCPWPGTIVTVLVEAAFVELQALFDLQQSEQSICQARASVMVYDDTSKKWVPIKPGQQ GFSRINIYHNTASNTFRVVGVKLQDQQVVINYSIVKGLKYNQATPTFHQWRDARQVYGLN FASKEEATTFSNAMLFALNIMNSQEGVPGSVVGAAWSSRKSACPGAGLRWVCVTLSKSPP PSAGGMTPQARGSSHDTKMPGAWAPDVVSGRAFVRLHANSYPSRTARPVAGDAQFAWGQS STYLTNMYCIDYRSKTMLFVFREEVSQGTYGPSCYQCQFSMTPSPELFPKGPSSQRQVQN GPSPDEMDIQRSAEFCTPGPNPSCVGSLAVREWAYRLAVGWRQPWHTVALVERWRNLTLK PGPAGTAQAQSVVVSRVEESDCPAVITFQTSDGAAPAAASGISRKKNLGHRWQPLHVDEA QGHTGRSVSSYGEPRSQSATTTSDGEGSKRAEQRLGESAQLGLATTCQPSPEWTFSKHLL CLRHRAKGWGHNYDCRCSSCPQEARCPLKETVNPPLSCHVVNTVASDVQVLKAYLGGPPP PPPPPVPPPPTGATPPPPPPLPAGGAQGSSHDESSMSGLAAAIAGAKLRRVQRPEDASGG SSPSGTSKSDANRASSGGGGGGLMEEMNKLLAKRLSPASEGMQLSLPTVEGRREMNVDRK LLFSSTFTDTHPAYTPRHDHTEGLDICLGLAPHQTGPRQAASFTPSFSHTGLSEDRELAL GQWSYCSEDQFHHLKTCTQAWPPHRLCDCQRKILVPPPLRGPEQPASHLTPQIFAWCLDL PGCIGSWLMNVINTPALFASSNSPAHIPERPGTVKGSEPDHRMKPAGSVNDMALDAFDLD RMKQEILEEVVRELHKVKEEIIDEDSQKPSQPQTPVHQSTHRSLGALL >gi568815584f:99984685_100241793|GENSCAN_predicted_CDS_1|3207_bp ngtcggatagagttgatgaaacccaatgggaaaaagaccaaaacaacaacaaaaacagaa acaaagacaaaacagtcaagcaaaacaaacgatggcacaacttatacaattactgaatgc tttaatggccctagaagaggaaatggcagtaaagaggacagttgctgcctttttgatgtc tgttcatttctaagcatgaaggcttttggggattttgatggaagtgctgaatccagtgag caacatctgtttgtggatgggaaagatcagccggaaagactaaaagggccgaatgcatgg gagcagagagaaggtagagtgtggctgctcgtctgcccctcagaaagcagccatctgggc aggacagaacggagcctgactgctagtcagcaaacgcagttcaacctgagaaaggagatc agtaaagctaaatcagccagtggtgtttgttctccctgggtgtctgtgtttgtgcctaat gggcagaatgaattatggaatgaatcaatgaattcttggaatgctctgaggcctgtgggc actgatgattgtccctggcctggcaccatagtgacagtcctggtggaagcagcgttcgtg gagctgcaagccttatttgatctccagcaaagtgaacagagtatctgccaagcccgggct tccgtgatggtctacgatgacaccagtaagaaatgggtaccaatcaaacctggccagcag ggattcagccggatcaacatctaccacaacactgccagcaacaccttcagagtcgttgga gtcaagttgcaggatcagcaggttgtgatcaattattcaatcgtgaaagggctgaagtac aatcaggccacgccaaccttccaccagtggcgagatgcccgccaggtctacggcttaaac tttgcaagtaaagaagaggcaaccacgttctccaatgcaatgctgtttgccctgaacatc atgaattcccaagaaggagtgcctggaagtgtggtgggagcagcttggtctagcaggaag agtgcatgtcctggagccgggctacgttgggtctgtgtgaccctgagcaagtcacctcct ccatcagcaggtggaatgacaccacaggcccgtggcagttcacatgacacgaagatgccc ggtgcctgggctccagatgttgtatcaggcagggcctttgtacgcctccatgccaacagc tacccaagtcgcaccgctcgcccagtggctggagatgctcagtttgcctggggccagtca tccacgtacttaacaaatatgtattgcattgactataggtcaaaaactatgctctttgtg ttcagagaagaggtcagccagggaacctatggcccgtcctgctaccagtgtcagttcagc atgactccttcacctgaacttttccccaagggcccctccagccagcgtcaggtgcagaat ggcccctctcctgatgagatggacatccagagaagtgctgaattctgtaccccagggccc aatccaagctgtgtggggtcactggccgtccgggaatgggcataccgcctggcggtcggg tggaggcagccgtggcacacagtggcgcttgtggagcgctggcgaaacctgactctgaag ccggggccggcgggtacagcacaggcacagagtgtggtggtctccagagtggaggagtca gactgccctgctgtgattactttccagacaagtgatggagcagcaccagcagcagcgtca ggaatctctagaaagaagaacctcggccacaggtggcagcccctccacgtagatgaggcc cagggacacacggggcgctctgtgagcagctatggggagcctaggtctcaaagtgcaacc accacttcagatggtgagggtagcaagagagcagagcagaggcttggggagtctgctcag ctgggtctggccactacctgccaaccttctccggagtggacattcagcaagcacctcctg tgtctcaggcaccgtgctaagggctggggacacaactatgactgcagatgcagttcctgc cctcaggaagcccgctgtccactcaaggagacagtgaacccacccttgtcatgtcacgta gtgaacactgtggcaagtgacgtgcaggtgctgaaggcctaccttggtgggcctccaccg ccccccccacccccagtcccacctccacccactggggctaccccacctcccccaccccca ctgccagccggaggagcccaggggtccagccacgacgagagctccatgtcaggactggcc gctgccatagctggggccaagctgagaagagtccaacggccagaagacgcatctggaggc tccagtcccagtgggacctcaaagtccgatgccaaccgggcaagcagcgggggtggcgga ggaggcctcatggaggaaatgaacaaactgctggccaagaggttgtctcccgcctctgag ggcatgcagttgagtctcccaacagtggagggcaggagggaaatgaacgtggacagaaag ctcctgttcagcagcactttcacagacactcatcccgcctacactccccgccatgaccac acggagggcctggacatttgcctgggcctggctcctcaccagacggggcctagacaagcc gcctccttcacgccatcgttttcccacacaggtttatctgaggaccgggagctggccctg ggccaatggagttactgctccgaggaccagttccaccacctgaaaacctgcacccaggca tggccgcctcacagactgtgcgactgccagaggaagatcctagtacctccccctctccgg ggacccgagcagccagccagccacctaactcctcagatttttgcctggtgtcttgatttg ccaggatgcataggttcctggctgatgaacgtgatcaatacccctgccctgtttgcttct tccaacagcccagctcacattccagagagacctggcacagtgaagggctcagaaccagat cacaggatgaagcctgctgggagcgtgaatgacatggccctggatgccttcgacttggac cggatgaagcaggagatcctagaggaggtggtgagagagctccacaaggtgaaggaggag atcatcgacgaggacagccagaagcccagccagccccagactccagtgcaccagagcacg cacaggagcctgggcgcgctgctgtga >gi568815584f:99984685_100241793|GENSCAN_predicted_peptide_2|519_aa MAPGPQGLNSHFLDALWPFCLGTPSLGTRPFTLCGIMQPLYRALNSTGKGPVAPPPPRWR LTRPPRALQPLRENDPVGIFEAALSAAPWTAFGPELRKACQVYQTPGHGLPWDAQKHALT MCEPVAALADVSVSSQATASRGQALAHRHSRAGSAPRPPRQVQQSRAGRAPNGASRPTMG NSASRSDFEWVYTDQPHTQRRKEILAPWGSLSHVLQSSRCLQAAKYPAIKALMRPDPRLK WAVLVLVLVQMLACWLVRGLAWRWLLFWAYAFGGCVNHSLTLAIHDISHNAAFGTGRAAR NRWLAVFANLPVGVPYAASFKKYHVDHHRYLGGDGLDVDVPTRLEGWFFCTPARKLLWLV LQPFFYSLRPLCVHPKAVTRMEVLNTLVQLAADLAIFALWGLKPVVYLLASSFLGLGLHP ISGHFVAEHYMFLKGHETYSYYGPLNWITFNVGYHVEHHDFPSIPGYNLPLVRKIAPEYY DHLPQHHSWVKVLWDFVFEDSLGPYARVKRVYRLAKDGL >gi568815584f:99984685_100241793|GENSCAN_predicted_CDS_2|1560_bp atggccccaggccctcaaggcctgaactcacacttcttggatgccctgtggcccttctgt ctgggcactcccagccttggcaccaggccattcaccctctgtggcatcatgcagcctttg taccgagccctgaactcgaccgggaaggggcctgttgctccccctcctcccaggtggcgt cttacccgacctccgagggctctgcagccgctgcgggagaatgaccctgtcggtattttt gaggctgctttgagcgcggccccctggactgcctttggccctgagctaaggaaggcctgc caagtgtaccaaaccccgggccatggacttccatgggacgcacagaagcacgccctcact atgtgcgagcctgtggcagcgctggctgatgtgtcagtcagctcccaggcaactgccagc cgaggccaggctctggcccacaggcacagcagagctggttccgcgccgcggccgccgcga caggtgcagcagagccgagccggccgcgctccgaacggcgcctcccgccccaccatgggc aacagcgcgagccgcagcgacttcgagtgggtctacaccgaccagccgcacacgcagcgg cgcaaggagatactggctccttgggggtccctcagccacgtgctgcagtccagcaggtgc ctgcaggcagccaagtacccggccatcaaggccctgatgcggccagacccgcgcctcaag tgggcggtgctggtgctggtgctggtgcagatgctggcctgctggctggtgcgcgggctg gcctggcgctggctgctgttctgggcctacgcctttggtggctgcgtgaaccactcgctg acgctggccatccacgacatctcgcacaacgcggccttcggcacgggccgtgcggcacgc aaccgctggctggccgtgttcgccaacctgcccgtgggtgtgccctacgccgcctccttc aagaagtaccacgtggaccaccaccgctacctgggcggcgacgggctggacgtggacgtg cccacgcgtctggagggctggttcttctgcacacccgcccgcaagctgctctggctggtg ctgcagcccttcttctactcactacggccgctctgcgtccaccccaaggccgtgacccgc atggaggtgctcaacacgctggtgcagctggcggccgacctggccatctttgccctttgg gggctcaagcccgtggtctacctgctggccagctccttcctgggcctgggcctgcacccc atctcgggccacttcgtggccgagcactacatgttcctcaagggccacgagacctactcc tactatgggcctctcaactggatcaccttcaatgtgggctaccacgtggagcaccacgac ttccccagcatcccgggctacaacctgccgctggtgcggaagatcgcgcccgagtactac gaccacctgccgcagcaccactcctgggtgaaggtgctctgggattttgtgtttgaggac tccctggggccctatgccagggtgaagcgggtgtacaggctggcaaaagatggtctgtga >gi568815584f:99984685_100241793|GENSCAN_predicted_peptide_3|610_aa MAGCRSRALPRGEAAKARQEIERPALLGDPVRPPQLLAQVLSPSLPAPAGRSECGARQAH AHPELYLARKRGVQPRFQPMPLPPHLPANQGSRLWPRPAQRRAPTVQRAARKLRQEEGRR CPGRLRRPAMAPLLSPRTRKAASQAVAAFGARFLSGRAFAGRLPVQVSPGSVTATLPAER VSQEAGARSPRRSRAAAALGAEGGGYDRDTGGAAASSRGSRPRGARGRADRSPAPKAAAV AATPRSRVRPRPEAAGFVAVAPRRAAAARHREAGGGGGGGGALTSRAAGQPGRVRAAPPP VPSAPIREEPGDRGAEAAAAVAAEPSAMASGDTLYIATDGSEMPAEIVELHEIEVETIPV ETIETTVVGEEEEEDDDDEDGGGGDHGGGGGHGHAGHHHHHHHHHHHPPMIALQPLVTDD PTQVHHHQEVILVQTREEVVGGDDSDGLRAEDGFEDQILIPVPAPAGGDDDYIEQTLVTV AAAGKSGGGGSSSSGGGRVKKGGGKKSGKKSYLSGGAGAAGGGGADPGNKKWEQKQVQIK TLEGEFSVTMWSSAPGWTLPRRSRSPKSSFAAGRDLGLHVGLACAGSERQSEPVTAVASE FLAFSPSCGT >gi568815584f:99984685_100241793|GENSCAN_predicted_CDS_3|1833_bp atggcgggctgcaggtcccgagccctgccccgcggggaggcagctaaggcccggcaagaa atcgagcggccggcactgctgggggacccggtgcgccctccgcagctgctggcccaggtg ctaagcccctcactgcctgcgccggctggccgctccgagtgtggggcccgccaagcccac gcccacccggaactctatctggcccgcaagcgcggcgtgcagccccggttccagcccatg cctctccctccacacctcccggcaaaccaaggaagccggctctggcctcggccagcccag agaagggctcccacggtgcagcgtgccgcgaggaagctgcggcaggaagaagggcggagg tgccccgggcgcctgagacgcccggcaatggccccgctactctcgccgcgcacacgcaaa gccgcttcccaggcagttgcggcgttcggggcccgcttcctctcgggccgggccttcgcc ggccgcctgcccgtacaggtgtcccccggctccgtgactgcgactctcccagcggagagg gtctcccaggaagcaggggcgaggagccctcgaaggtctcgggcggcagcggctctcggc gcagagggcggagggtacgaccgcgacacaggcggggcggcagctagctcgcgaggctcc cgcccccgtggtgcccggggccgcgcggaccgctcaccggctcccaaggcagcggctgta gcggcgacgccccgttcccgagtgcggccccggcccgaggcggcgggttttgtggctgtt gcaccgcgaagggcggcagccgcgcgacaccgggaagcgggaggcggtggcggcggcggc ggcgcgctgacgtcacgcgccgcgggccagccagggcgcgtgcgagccgccccgcccccg gtcccatcggccccaatccgggaggagcccggcgaccgaggagccgaggccgccgcggcc gtggcggcggagccctcagccatggcctcgggcgacaccctctacatcgccacggacggc tcggagatgccggccgagatcgtggagctgcacgagatcgaggtggagaccatcccggtg gagaccatcgagaccacagtggtgggcgaggaggaggaggaggacgacgacgacgaggac ggcggcggtggcgaccacggcggcgggggcggccacgggcacgccggccaccaccaccac caccatcaccaccaccaccacccgcccatgatcgctctgcagccgctggtcaccgacgac ccgacccaggtgcaccaccaccaggaggtgatcctggtgcagacgcgcgaggaggtggtg ggcggcgacgactcggacgggctgcgcgccgaggacggcttcgaggatcagattctcatc ccggtgcccgcgccggccggcggcgacgacgactacattgaacaaacgctggtcaccgtg gcggcggccggcaagagcggcggcggcggctcgtcgtcgtcgggaggcggccgcgtcaag aagggcggcggcaagaagagcggcaagaagagttacctcagcggcggggccggcgcggcg ggcggcggcggcgccgacccgggcaacaagaagtgggagcagaagcaggtgcagatcaag accctggagggcgagttctcggtcaccatgtggtcctcagcgccgggctggaccctgccc cggcggtcacgctcgcccaagtcgtcgtttgctgcggggcgggacttggggctgcacgta gggctcgcgtgtgcgggctccgagcgtcagtcggagcctgtcaccgccgttgccagcgaa ttcctggccttttcgccttcctgcggtacctag