GENSCAN 1.0 Date run: 16-Jul-119 Time: 16:05:09 Sequence gi568815587r:128739128_128940265 : 201138 bp : 47.53% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 4989 5141 153 2 0 83 103 121 0.797 13.08 1.02 Intr + 18988 19199 212 0 2 106 115 297 0.989 31.91 1.03 Intr + 22118 22292 175 2 1 47 90 39 0.493 -0.06 1.04 Intr + 25804 25907 104 0 2 48 89 72 0.213 2.27 1.05 Intr + 28991 29145 155 2 2 103 91 281 0.999 29.62 1.06 Intr + 30021 30146 126 1 0 40 75 105 0.845 5.05 1.07 Intr + 33655 33858 204 2 0 105 106 176 0.547 20.27 1.08 Intr + 42831 42896 66 1 0 117 115 23 0.734 6.78 1.09 Intr + 55638 55672 35 1 2 89 64 31 0.045 -1.26 1.10 Term + 61907 62065 159 1 0 73 43 170 0.173 8.94 1.11 PlyA + 63280 63285 6 1.05 2.00 Prom + 65908 65947 40 -6.56 2.01 Init + 67127 67367 241 1 1 77 116 160 0.974 15.75 2.02 Intr + 68053 68112 60 2 0 134 82 5 0.888 3.21 2.03 Intr + 70030 70077 48 2 0 77 105 59 0.973 5.15 2.04 Intr + 71332 71693 362 2 2 71 61 518 0.427 42.34 2.05 Term + 81484 81504 21 0 0 123 34 24 0.118 -1.19 2.06 PlyA + 82252 82257 6 1.05 3.03 PlyA - 84731 84726 6 1.05 3.02 Term - 85777 85412 366 1 0 108 52 101 0.473 3.10 3.01 Init - 88917 88876 42 0 0 49 94 61 0.569 3.28 3.00 Prom - 90532 90493 40 -4.56 4.03 PlyA - 90930 90925 6 1.05 4.02 Term - 101137 99998 1140 1 0 77 42 611 0.881 47.36 4.01 Init - 103290 103255 36 0 0 85 111 55 0.987 7.51 4.00 Prom - 113009 112970 40 -1.96 5.06 PlyA - 113175 113170 6 -1.95 5.05 Term - 113688 113417 272 1 2 108 47 177 0.994 11.25 5.04 Intr - 115036 114940 97 1 1 85 61 88 0.730 5.38 5.03 Intr - 120910 120886 25 0 1 65 107 52 0.236 2.83 5.02 Intr - 121534 121480 55 2 1 78 58 45 0.554 -1.46 5.01 Init - 125876 125813 64 2 1 104 78 41 0.522 6.01 5.00 Prom - 128667 128628 40 -7.26 6.00 Prom + 130976 131015 40 -5.76 6.01 Init + 131986 132000 15 0 0 62 115 -5 0.028 0.01 6.02 Intr + 137184 137429 246 2 0 67 55 129 0.006 5.16 6.03 Intr + 141234 141369 136 2 1 25 44 98 0.003 -0.76 6.04 Intr + 151320 151397 78 1 0 58 91 88 0.930 5.62 6.05 Term + 151513 151856 344 2 2 117 38 109 0.354 3.47 6.06 PlyA + 154317 154322 6 1.05 7.07 PlyA - 154512 154507 6 -0.45 7.06 Term - 155443 155316 128 2 2 103 47 106 0.983 6.44 7.05 Intr - 161582 161435 148 2 1 40 113 85 0.706 6.01 7.04 Intr - 162758 162687 72 2 0 51 94 62 0.416 2.70 7.03 Intr - 163102 163042 61 0 1 6 53 60 0.163 -6.96 7.02 Intr - 164327 164240 88 0 1 56 105 54 0.487 2.93 7.01 Init - 165439 165304 136 1 1 61 95 131 0.820 9.71 7.00 Prom - 169224 169185 40 -3.06 8.00 Prom + 169505 169544 40 -4.66 8.01 Init + 172147 173083 937 0 1 73 110 1543 0.708 148.91 8.02 Intr + 177282 177572 291 1 0 96 55 281 0.858 22.71 8.03 Intr + 193377 193492 116 1 2 3 47 144 0.441 1.97 8.04 Intr + 197607 197712 106 2 1 64 86 50 0.298 2.19 8.05 Intr + 197810 197900 91 0 1 93 34 63 0.210 0.45 8.06 Intr + 197927 198083 157 2 1 41 83 122 0.235 7.01 8.07 Intr + 198303 198448 146 2 2 70 105 47 0.529 3.68 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 62041 61915 127 1 1 65 22 112 0.805 2.84 S.002 Init + 105126 105150 25 2 1 95 82 43 0.851 4.09 S.003 Term - 137455 137106 350 2 2 31 39 243 0.850 8.35 S.004 Init + 150106 150115 10 0 1 39 116 9 0.923 -0.58 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:128739128_128940265|GENSCAN_predicted_peptide_1|462_aa MTINGAMKLMNTELSELTRRQSGHIMHQQKWDNFERDKHFLEQQHKAELFQEALSVVSDD QSLFDSAYGAAAHLPKADMTASGSPDYGQPHKINPLPPQQEWINQPVRVNVKREYDHMNG SSHMTLSLTSFGFLYLNLTFSEASPGHLIETSASSLFTLLRVSVLVLITSLTDTWAGPET VLVQKPGTEDRAIFQRRNTLAGERPNRSPVQLESRESPVDCSVSKCSKLVGGGESNPMNY NSYMDEKNGPPPPNMTTNERRVIVPAENKIIVQSFSPLAGHSTPASTSKLSCLHKKGVYK LPRVELVPDPTLWTQEHVRQWLEWAIKEYSLMEIDTSFFQNMDGKELCKMNKEDFLRATT LYNTEVLLSHLSYLRESSLLAYNTTSHTDQSSRLSVKEGICKIEKFNMGQKQTKIINWYT GADGHIFTHWAFGCHLKFVSSQEADSRHHIREGLHARSGFFS >gi568815587r:128739128_128940265|GENSCAN_predicted_CDS_1|1389_bp atgacaatcaatggggccatgaagctcatgaacactgagttatctgagcttactcgtaga cagagtggccatataatgcatcaacaaaagtgggataattttgagcgggataagcatttc ctggaacaacagcataaagcggaactgttccaggaggctctgtcggtggtgagcgacgac cagtccctctttgactcagcgtacggagcggcagcccatctccccaaggccgacatgact gcctcggggagtcctgactacgggcagccccacaagatcaaccccctcccaccacagcag gagtggatcaatcagccagtgagggtcaacgtcaagcgggagtatgaccacatgaatgga tccagtcatatgactctctccctcacgagttttgggtttctctacctgaacctcaccttt tcagaggcctcccctggccacctgatcgaaaccagcgcctcctcactctttaccctgctt agagtgtctgtcctagttcttatcacctccctcactgacacgtgggcagggcctgaaacg gtcctagtgcagaagcctgggaccgaggacagggccattttccagaggagaaacaccctg gcgggcgagcggccaaaccgatcacctgtccagctggagagcagggagtctccggtggac tgcagcgttagcaaatgcagcaagctggtgggcggaggcgagtccaaccccatgaactac aacagctatatggacgagaagaatggcccccctcctcccaacatgaccaccaacgagagg agagtcatcgtccccgcagaaaataaaatcatcgttcagtccttcagtccattggctgga cactccaccccagctagcaccagcaagctcagctgcctgcacaagaagggtgtttacaaa ctgccacgtgtagaactggtgccagaccccacactgtggacacaggagcatgtgaggcaa tggctggagtgggccataaaggagtacagcttgatggagatcgacacatcctttttccag aacatggatggcaaggaactgtgtaaaatgaacaaggaggacttcctccgcgccaccacc ctctacaacacggaagtgctgttgtcacacctcagttacctcagggaaagttcactgctg gcctataatacaacctcccacaccgaccaatcctcacgattgagtgtcaaagaagggatc tgtaagatcgagaagtttaacatgggtcagaaacagaccaagatcatcaactggtacact ggtgccgacggccacatttttacacactgggcctttggctgccacctcaagtttgtcagt tctcaggaggcagactccaggcaccacattcgagagggcctccatgcacgctcaggcttc ttctcatag >gi568815587r:128739128_128940265|GENSCAN_predicted_peptide_2|243_aa MHWSHKREREIKHTQNSFLLVGIKEGFVDLVALELALEGEAGFGPVAIWAGEDMSLACLT VSEDLRFLELRMCDAEEVWKGPPLGGAQTISKNTEQRPQPDPYQILGPTSSRLANPGSGQ IQLWQFLLELLSDSANASCITWEGTNGEFKMTDPDEVARRWGERKSKPNMNYDKLSRALR YYYDKNIMTKVHGKRYAYKFDFHGIAQALQPHPTESSMYKYPSDISYMPSYHAHQQKEHQ IIQ >gi568815587r:128739128_128940265|GENSCAN_predicted_CDS_2|732_bp atgcactggtcccataagcgggagagagaaatcaagcacacacagaactcctttcttctg gttggtatcaaggaaggctttgtggacctggtagcacttgagttagctctggaaggagag gcaggatttggacctgtggcgatatgggctggggaagacatgagcttggcatgtctgaca gtcagtgaggacctcaggttcctggagcttaggatgtgtgatgcagaagaagtctggaaa ggtcctccccttggaggggcacaaacgatcagtaagaatacagagcaacggccccagcca gatccgtatcagatcctgggcccgaccagcagtcgcctagccaaccctggaagcgggcag atccagctgtggcaattcctcctggagctgctctccgacagcgccaacgccagctgtatc acctgggaggggaccaacggggagttcaaaatgacggaccccgatgaggtggccaggcgc tggggcgagcggaaaagcaagcccaacatgaattacgacaagctgagccgggccctccgt tattactatgataaaaacattatgaccaaagtgcacggcaaaagatatgcttacaaattt gacttccacggcattgcccaggctctgcagccacatccgaccgagtcgtccatgtacaag tacccttctgacatctcctacatgccttcctaccatgcccaccagcagaaggaacatcaa atcatccagtga >gi568815587r:128739128_128940265|GENSCAN_predicted_peptide_3|135_aa MNPIILQLHELELQPSAPCGSQHPRPPPPELPPASPNIPSPKAAPNPQAAPRALGHPHPA APELSPVSPGPQRPRPPPTRSCPLSSPHLPVPKAAPHPELLPEYPAPRALGCPSPLSCYP RTPPPAPSAAPHPQG >gi568815587r:128739128_128940265|GENSCAN_predicted_CDS_3|408_bp atgaatcctatcatcctacaactccacgaactggagctgcagccctcggccccctgtggc tcccagcaccctcggccacccccacccgagctgccccctgcgtcccccaacattccctcg cccaaggctgcccccaacccccaagctgctccccgcgccctcggccacccccacccggct gctcccgagctgtcccctgtatcccccggcccccaacgccctcggccaccccccacccgg agctgccccctgagttccccccacctccctgtgcccaaggctgccccccacccggagctg ctcccggagtaccccgccccgcgcgctctcggctgcccctcacccttgagctgctacccg cgtaccccgccccccgcgccctcggctgccccccacccccagggctga >gi568815587r:128739128_128940265|GENSCAN_predicted_peptide_4|391_aa MNASSRNVFDTLIRVLTESMFKHLRKWVVTRFFGHSRQRARLVSKDGRCNIEFGNVEAQS RFIFFVDIWTTVLDLKWRYKMTIFITAFLGSWFFFGLLWYAVAYIHKDLPEFHPSANHTP CVENINGLTSAFLFSLETQVTIGYGFRCVTEQCATAIFLLIFQSILGVIINSFMCGAILA KISRPKKRAKTITFSKNAVISKRGGKLCLLIRVANLRKSLLIGSHIYGKLLKTTVTPEGE TIILDQININFVVDAGNENLFFISPLTIYHVIDHNSPFFHMAAETLLQQDFELVVFLDGT VESTSATCQVRTSYVPEEVLWGYRFAPIVSKTKEGKYRVDFHNFSKTVEVETPHCAMCLY NEKDVRARMKRGYDNPNFILSEVNETDDTKM >gi568815587r:128739128_128940265|GENSCAN_predicted_CDS_4|1176_bp atgaatgcttccagtcggaatgtgtttgacacgttgatcagggtgttgacagaaagtatg ttcaaacatcttcggaaatgggtcgtcactcgcttttttgggcattctcggcaaagagca aggctagtctccaaagatggaaggtgcaacatagaatttggcaatgtggaggcacagtca aggtttatattctttgtggacatctggacaacggtacttgacctcaagtggagatacaaa atgaccattttcatcacagccttcttggggagttggtttttctttggtctcctgtggtat gcagtagcgtacattcacaaagacctcccggaattccatccttctgccaatcacactccc tgtgtggagaatattaatggcttgacctcagcttttctgttttctctggagactcaagtg accattggatatggattcaggtgtgtgacagaacagtgtgccactgccatttttctgctt atctttcagtctatacttggagttataatcaattctttcatgtgtggggccatcttagcc aagatctccaggcccaaaaaacgtgccaagaccattacgttcagcaagaacgcagtgatc agcaaacggggagggaagctttgcctcctaatccgagtggctaatctcaggaagagcctt cttattggcagtcacatttatggaaagcttctgaagaccacagtcactcctgaaggagag accattattttggaccagatcaatatcaactttgtagttgacgctgggaatgaaaattta ttcttcatctccccattgacaatttaccatgtcattgatcacaacagccctttcttccac atggcagcggagacccttctccagcaggactttgaattagtggtgtttttagatggcaca gtggagtccaccagtgctacctgccaagtccggacatcctatgtcccagaggaggtgctt tggggctaccgttttgctcccatagtatccaagacaaaggaagggaaataccgagtggat ttccataactttagcaagacagtggaagtggagacccctcactgtgccatgtgcctttat aatgagaaagatgttagagccaggatgaagagaggctatgacaaccccaacttcatcttg tcagaagtcaatgaaacagatgacaccaaaatgtaa >gi568815587r:128739128_128940265|GENSCAN_predicted_peptide_5|170_aa MKDLVCDAEELGQCPEGGENRGYCVSRPLGPADVGYKAHPRQGALLLLTCPLALGGVRDF TLELPTQDRNQTPFSDLQSPEADTRIGQEQFKTFAGGRSWEVSRAVSCSERAPGWAVPGN EDGGMLNASKKSPSEKTPAFAEGQLYGMEQRVTEAAGSAPSGSRAFPEST >gi568815587r:128739128_128940265|GENSCAN_predicted_CDS_5|513_bp atgaaggacctagtatgtgatgctgaggaacttgggcagtgtccggaaggtggtgagaac cggggctattgcgtctccaggccccttggtccagctgatgtgggttacaaggcccacccg cgccagggggcgctgctgctgctgacctgccctctggcactgggaggtgtgagggacttc accctggagctgccaacccaggatcgtaaccaaacaccgttctcagacctgcagagccca gaggcagacacgaggattggtcaagagcagttcaagacatttgctggaggacgcagctgg gaagtctcccgggcagtgtcctgttctgagagggctcctgggtgggctgtgcccggtaat gaggatgggggcatgctaaatgccagcaagaagagccccagtgagaagactccagcattt gctgaagggcagctctatggcatggaacaaagggtcaccgaagctgctggctctgctccc tctggctccagggctttcccagaaagcacttga >gi568815587r:128739128_128940265|GENSCAN_predicted_peptide_6|272_aa MCLPQVDRVERPVDSIALSWEQNLPGLCTFNPAGSQLSAVRRKAFTLQFPGDCSQDAWVS TTDSSAEGLLRGNFPPEKHRNETAMSPSHTGKGSSLREVELLDKRAEALLVMSTSHEDQA GSGAWGHQKGEGACPGSCLSYGFHELKNTLWSSSVLKAGRVSSHRRPAAAARRGEIVRGE QRARGQLAWVLSLDLLPLQLADQAPGPSLHGGNQSRRAGDAAPRLWLPRLPKGIPPSAPS LSGNGEMLPTVRLAHRGATEQPRRGSEPRFLP >gi568815587r:128739128_128940265|GENSCAN_predicted_CDS_6|819_bp atgtgtctcccccaggtagaccgcgtggagcgcccagtggactcgatagcgctgtcctgg gaacagaaccttcccggactgtgcacttttaaccctgcaggttctcagctgtctgccgtc agaaggaaagcattcacactccagttcccaggagactgcagccaggacgcttgggtttcc accacggattcctctgctgaggggctgctccgcggtaactttcctcccgagaagcacagg aatgaaacggcgatgagcccctctcacacagggaagggctccagcctgcgggaggtggag ctgctggataaaagagctgaggcccttctggtgatgtccaccagccacgaggatcaggcg gggtcaggggcctggggacaccagaaaggggagggtgcctgccctggcagctgcctttcc tacggcttccacgaattaaagaacacactctggtcatcatctgtcctcaaggcagggcgc gtttccagccatcgccgcccagccgccgccgcacgccggggagagatcgtccggggtgag cagcgagcccggggacagctagcctgggtcctgtccctggacctactccccctgcagctg gcggatcaggcccccggaccctcactccacggcgggaatcagagccgtcgggcgggggat gcagcgcccagactgtggctcccccgccttcccaaaggaatccccccatctgctccatcc ctctcgggaaacggcgaaatgctgccaaccgtccgcttagcgcaccgcggagcgactgag cagccccgccgaggctccgagccaagatttctgccctaa >gi568815587r:128739128_128940265|GENSCAN_predicted_peptide_7|210_aa MLTRLVLSAHLSSTTSPPWTHAAISWELDNVLMPSPRIWPQVTPTAGQDVHAIVTRTCES VLSSAVYTHGCGCVSGEGIVEGTAEVCEMSGMRPQRSLIAKLLSTLRKGAAGKADEASQD SGIVDTETPTVKFYVIQTLGDLAIGLPGGPGFWLHNKGGWLVLNGATEENKNKTQKDSFV ADDYLNSYLMTDPHLETLPALNDLSIAHTQ >gi568815587r:128739128_128940265|GENSCAN_predicted_CDS_7|633_bp atgttgacgcggctggtcctcagtgcacacctgagtagcacgacctctccgccctggacg cacgctgccatcagctgggagctggacaacgtgctgatgcctagtcccagaatctggccc caggtgactccaacagctgggcaggatgtgcatgccatagtaaccagaacctgtgagtct gtgctgagctctgccgtctacacccacggctgtggctgtgtgagcggtgagggcatcgtg gaagggactgcagaggtgtgcgagatgagtggaatgaggccccagcgttctctgattgcc aagctgttaagcactctgcggaagggagctgctgggaaggcggacgaggcatctcaggat tctggcattgtagacacagaaacaccgactgtgaagttttatgtaatacaaacactggga gatttagcaataggcctgccaggcggccctggcttctggctgcacaacaaaggggggtgg ctggtgctgaatggggcaacagaagaaaataaaaacaaaacacagaaagattccttcgtg gctgacgactatcttaacagttatctgatgacagatccgcacttggaaacactgccagcc ctcaatgacctgtctatagctcatacacagtga >gi568815587r:128739128_128940265|GENSCAN_predicted_peptide_8|615_aa MAGDSRNAMNQDMEIGVTPWDPKKIPKQARDYVPIATDRTRLLAEGKKPRQRYMEKSGKC NVHHGNVQETYRYLSDLFTTLVDLKWRFNLLVFTMVYTVTWLFFGFIWWLIAYIRGDLDH VGDQEWIPCVENLSGFVSAFLFSIETETTIGYGFRVITEKCPEGIILLLVQAILGSIVNA FMVGCMFVKISQPKKRAETLMFSNNAVISMRDEKLCLMFRVGDLRNSHIVEASIRAKLIK SRQTKEGEFIPLNQTDINVGFDTGDDRLFLVSPLIISHEINQKSPFWEMSQAQLHQEEFE VVVILEGMVEATGMTCQARSSYMDTEVLWGHRFTPVLTLEKGFYEVDYNTFHDTYETNTP SCCAKELAEMKREGRLLQYLPSPPLLGGCAEAGLDAEAEQNEEDEPKGLDDPLTPFTADI RSLLVELLSEFGKCECDLENAGCRREDPVFMASRGISPQQSDDAATQVPMETKLGQEEQK APQARGFVTPAHLGPAPQTLRVSFWGEKCGPPGRLLAQAWPVSASGAMRCLCWDECMAPA SLPPPQSPQASECVIFIIGHRKRLLGKPRIPRRRRAGLPLGTIDQGCQFPALSNALNWGQ SACSSEDPDAVTGSX >gi568815587r:128739128_128940265|GENSCAN_predicted_CDS_8|1845_bp atggctggcgattctaggaatgccatgaaccaggacatggagattggagtcactccctgg gaccccaagaagattccaaaacaggcccgcgattatgtccccattgccacagaccgtacg cgcctgctggccgagggcaagaagccacgccagcgctacatggagaagagtggcaagtgc aacgtgcaccacggcaacgtccaggagacctaccggtacctgagtgacctcttcaccacc ctggtggacctcaagtggcgcttcaacttgctcgtcttcaccatggtttacactgtcacc tggctgttcttcggcttcatttggtggctcattgcttatatccggggtgacctggaccat gttggcgaccaagagtggattccttgtgttgaaaacctcagtggcttcgtgtccgctttc ctgttctccattgagaccgaaacaaccattgggtatggcttccgagtcatcacagagaag tgtccagaggggattatactcctcttggtccaggccatcctgggctccatcgtcaatgcc ttcatggtggggtgcatgtttgtcaagatcagccagcccaagaagagagcggagaccctc atgttttccaacaacgcagtcatctccatgcgggacgagaagctgtgcctcatgttccgg gtgggcgacctccgcaactcccacatcgtggaggcctccatccgggccaagctcatcaag tcccggcagaccaaagagggggagttcatccccctgaaccagacagacatcaacgtgggc tttgacacgggcgacgaccgcctcttccttgtgtctcctctgatcatctcccatgagatc aaccagaagagccctttctgggagatgtctcaggctcagctgcatcaggaagagtttgaa gttgtggtcattctagaagggatggtggaagccacaggcatgacctgccaagcccggagc tcctacatggatacagaggtgctctggggccaccgattcacaccagtcctcaccttggaa aagggcttctatgaggtggactacaacaccttccatgatacctatgagaccaacacaccc agctgctgtgccaaggagctggcagaaatgaagagggaaggccggctcctccagtacctc cccagccccccactgctggggggctgtgctgaggcagggctggatgcagaggctgagcag aatgaagaagatgagcccaaggggctggatgaccccttaacacccttcactgctgacatc cggagcctccttgtggagctgttgtcagagtttggaaagtgtgagtgtgacctagagaac gctgggtgcagacgtgaagatccagtattcatggcatccaggggcatttcacctcagcaa agtgatgacgctgccactcaggttcccatggaaaccaaactgggacaggaggaacaaaag gctcctcaagctcggggctttgtcaccccagcacacctgggccctgcacctcagacgctg cgtgtgagcttctggggagagaagtgtgggcctccagggagactcctggcccaggcctgg cctgtgtctgcttccggagccatgcgctgcctgtgctgggatgaatgcatggcccctgca tccttaccccctcctcagagcccacaagcttccgagtgcgtcatcttcattattggccac aggaaacgacttcttggaaaacccaggattccgaggaggaggagggctggccttcctctt gggactattgatcagggctgtcagttcccagctctgtccaatgctctgaactggggacag tccgcatgcagctctgaggacccagatgctgtcactgggtcctgn