GENSCAN 1.0 Date run: 5-Nov-116 Time: 23:34:54 Sequence gi568815587f:120146446_120418389 : 271944 bp : 44.90% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1033 1194 162 1 0 78 115 103 0.924 11.09 1.02 Intr + 22526 22703 178 2 1 6 107 65 0.348 0.12 1.03 Intr + 23326 23748 423 0 0 69 72 182 0.330 8.76 1.04 Intr + 24014 24145 132 1 0 62 77 39 0.350 1.04 1.05 Term + 29077 29256 180 0 0 91 47 96 0.489 3.41 1.06 PlyA + 29751 29756 6 1.05 2.07 PlyA - 31037 31032 6 1.05 2.06 Term - 35425 35262 164 2 2 74 42 83 0.179 0.40 2.05 Intr - 41201 41077 125 1 2 72 58 55 0.168 1.13 2.04 Intr - 42728 42679 50 2 2 91 87 43 0.345 1.98 2.03 Intr - 43557 43373 185 1 2 25 99 56 0.144 -0.09 2.02 Intr - 46897 46848 50 0 2 69 113 12 0.169 0.12 2.01 Init - 47171 47095 77 2 2 60 50 93 0.423 3.29 2.00 Prom - 52107 52068 40 -4.26 3.00 Prom + 55647 55686 40 -5.96 3.01 Init + 64835 65065 231 1 0 60 100 708 0.999 65.46 3.02 Intr + 79216 79350 135 0 0 105 111 204 0.993 25.16 3.03 Intr + 80371 80551 181 0 1 38 82 273 0.993 21.14 3.04 Term + 82423 82697 275 2 2 158 46 437 0.999 42.13 3.05 PlyA + 83471 83476 6 1.05 4.09 PlyA - 85888 85883 6 1.05 4.08 Term - 87295 87279 17 2 2 98 48 38 0.312 -0.80 4.07 Intr - 93070 92951 120 2 0 79 50 113 0.262 7.07 4.06 Intr - 93841 93689 153 2 0 62 99 110 0.318 9.54 4.05 Intr - 96424 96344 81 2 0 78 94 11 0.098 0.31 4.04 Intr - 99467 99339 129 0 0 67 60 78 0.064 3.57 4.03 Intr - 99578 99547 32 1 2 64 116 17 0.059 -0.13 4.02 Intr - 107209 107139 71 1 2 84 70 68 0.019 2.58 4.01 Init - 119513 119421 93 2 0 63 114 82 0.968 8.68 4.00 Prom - 121348 121309 40 -4.76 5.00 Prom + 132574 132613 40 -1.36 5.01 Init + 147664 147684 21 0 0 66 100 0 0.175 -1.14 5.02 Intr + 151820 151945 126 1 0 116 92 80 0.989 12.08 5.03 Intr + 158585 158767 183 1 0 57 36 144 0.633 6.08 5.04 Intr + 159199 159340 142 0 1 69 96 263 0.999 25.13 5.05 Intr + 161034 161215 182 1 2 113 91 113 0.977 13.69 5.06 Intr + 162980 163141 162 1 0 77 105 220 0.989 22.77 5.07 Intr + 166194 166327 134 2 2 58 80 61 0.348 1.74 5.08 Intr + 180996 181195 200 0 2 93 96 114 0.688 11.69 5.09 Term + 183532 184070 539 2 2 107 42 275 0.951 19.21 5.10 PlyA + 184172 184177 6 1.05 6.04 PlyA - 184758 184753 6 1.05 6.03 Term - 190228 189730 499 0 1 80 47 185 0.129 7.60 6.02 Intr - 237070 237013 58 2 1 113 101 30 0.339 4.74 6.01 Init - 243391 243367 25 1 1 76 115 13 0.398 2.49 6.00 Prom - 244811 244772 40 -2.86 7.00 Prom + 245077 245116 40 -4.76 7.01 Init + 249176 249189 14 1 2 83 61 9 0.071 -2.50 7.02 Intr + 259673 259696 24 2 0 106 115 -16 0.432 0.04 7.03 Intr + 261293 261378 86 2 2 48 98 85 0.868 4.96 7.04 Intr + 262949 263005 57 0 0 95 103 27 0.851 3.76 7.05 Term + 264355 264371 17 2 2 112 44 33 0.841 -0.30 7.06 PlyA + 267378 267383 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 99737 99752 16 1 1 84 100 14 0.844 2.58 S.002 Intr + 100004 100072 69 0 0 112 110 75 0.851 11.25 S.003 Init - 143291 143106 186 2 0 61 88 124 0.925 8.76 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587f:120146446_120418389|GENSCAN_predicted_peptide_1|358_aa XLVGISSKNGCLTSPPQSRLSDPSVCRYSFTQFPRAHLAHTALKASVYTTSTSRRLGSAS REPAQAEALHELCLPSLRRECGQDASKRPGKEGAEGAQPGRWADPEGTWRGTRKVPSAVG ITISTILMMDLESTGVPCPGSCSLWKAAGAGMPNGLNELLGDSLSLVNAEPTLTVRPNPT ANNRAALGGSGLPVPGTGSGRGGCRCGSSARSAKLLCLGGRGHWATCSRGCPQAGLPGFL TARLAGNDAALWHSEPAPTAHRNLQPPSLPQLSLCGPHYPTFCRAQLPYPLYRLSGAMVK MWKLKGPKMGEEGTELKGRAGEQVQTEAKEPSLAYLLLWKQFRGKELGILEPPECQLG >gi568815587f:120146446_120418389|GENSCAN_predicted_CDS_1|1077_bp nncttagtcggcatttcttccaagaatggctgcctgacctccccgccccagtccaggctg agtgacccctccgtgtgccgctacagcttcacacagttccccagggcccaccttgcccac actgcattgaaagcatctgtttacaccacttccaccagcaggcgcctaggaagcgcatcg agggagcccgctcaagctgaagcgctgcacgagctgtgtcttccttctctccgccgcgaa tgcgggcaggatgcgagcaaaagacccggcaaagagggcgccgagggggcgcagccgggg cgctgggcggacccagaggggacatggagaggaacgcggaaggtgccctctgcggtgggc attactatttccaccatcctcatgatggatctggaatccacgggggtcccctgcccagga tcgtgcagcctgtggaaggcagcaggcgccgggatgccgaatggattgaatgagcttctc ggggattccctgagcttggtgaacgcagagcccaccctgacggtcaggcctaaccccacc gccaataacagagcagccctaggtggaagtgggctgcctgtgcctggcactgggagcggc agaggcggctgccgctgtggaagtagcgcccgcagtgccaagctcctgtgcctgggagga cgcgggcactgggcgacctgcagtcggggctgcccccaggcagggctccctggctttctc acagcgcggctcgcgggaaacgacgctgccctatggcacagtgaacctgccccaactgct catcggaacctgcagcccccttccctgccccaactgtccctctgcggaccccattaccct actttctgtcgagcccaactgccctatcctttatacaggctgtcgggggcaatggtgaag atgtggaagctgaaaggacccaagatgggggaagagggtacagaactcaagggaagggct ggggagcaggttcagacagaggcaaaagaacccagccttgcttacctgctcctctggaaa cagtttcgtggcaaagaattaggaattctggagccgcccgaatgccagctgggatga >gi568815587f:120146446_120418389|GENSCAN_predicted_peptide_2|216_aa MWQPLLAKEAEVPLVEEDLGIGQQQPGGLAPACLWGRSGGSQVKEILQHKWLAQGPLACM HHCPAHKPHGSPLANVESTFLNLYSRPAIIFFLPDWPSGPVPWQGVEWLAAQSMVILAAG LSPLTVVPLFRSPSLKPHPHSLSYYQEQTPKYFCVSSGKILGASCLYRLPVQHRRLSAQW LQMVPLYCVGKDQDFFVHSPGPGSEWALAVNYMSEC >gi568815587f:120146446_120418389|GENSCAN_predicted_CDS_2|651_bp atgtggcaacctttgctggccaaggaggcagaggtgcctctggtggaagaagaccttggg atcggccagcagcagcccggaggactggcccctgcctgcctctggggccgctctggtggt agtcaagtcaaggagattctgcagcataaatggctggcccagggaccactggcctgtatg catcactgccctgctcataaacctcatggctccccactggcaaatgttgagtccaccttc ctcaacctgtattcaaggcctgccatcatcttcttcctgccagactggccttcaggtcct gtgccttggcagggtgtggagtggcttgcagcacagtccatggtcatccttgctgctggg ctctcccctctgactgtggtccctctgtttcgaagccctagcctcaagcctcatccccac tctctttcctactaccaagagcagacaccaaaatatttctgcgtttcctctggaaaaatc ctgggagcatcctgcctgtaccgactcccagttcagcacaggcgactttcagcccagtgg ctgcagatggttcctctctattgtgtgggcaaggaccaggacttctttgtccatagccca ggacctggctcagagtgggctctggctgttaattacatgtccgaatgctga >gi568815587f:120146446_120418389|GENSCAN_predicted_peptide_3|273_aa MRLPGVPLARPALLLLLPLLAPLLGTGAPAELRVRVRLPDGQVTEESLQADSDADSISLE LRKPDGTLVSFTADFKKDVKVFRALILGELEKGQSQFQALCFVTQLQHNEIIPSEAMAKL RQKNPRAVRQAEEVRGLEHLHMDVAVNFSQGALLSPHLHNVCAEAVDAIYTRQEDVRFWL EQGVDSSVFEALPKASEQAELPRCRQVGDHGKPCVCRYGLSLAWYPCMLKYCHSRDRPTP YKCGIRSCQKSYSFDFYVPQRQLCLWDEDPYPG >gi568815587f:120146446_120418389|GENSCAN_predicted_CDS_3|822_bp atgcgccttcccggggtacccctggcgcgccctgcgctgctgctgctgctgccgctgctc gcgccgctgctgggaacgggtgcgccggccgagctgcgggtccgcgtgcggctgccggac ggccaggtgaccgaggagagcctgcaggcggacagcgacgcggacagcatcagcctcgag ctgcgcaagcccgacggcaccctcgtctccttcaccgccgacttcaagaaggatgtgaag gtcttccgggccctgatcctgggggagctggagaaggggcagagtcagttccaggccctc tgctttgtcacccagctgcagcacaatgagatcatccccagtgaggccatggccaagctc cggcagaaaaatccccgggcagtgcggcaggcggaggaggttcggggtctggagcatctg cacatggatgtcgctgtcaacttcagccagggggccctgctgagcccccatctccacaac gtgtgtgccgaggccgtggatgccatctacacccgccaggaggatgtccggttctggctg gagcaaggtgtggacagttctgtgttcgaggctctgcccaaggcctcagagcaggcggag ctgcctcgctgcaggcaggtgggggaccacgggaagccctgcgtctgccgctatggcctg agcctggcctggtacccctgcatgctcaagtactgccacagccgcgaccggcccacgccc tacaagtgtggcatccgcagctgccagaagagctacagcttcgacttctacgtgccccag aggcagctgtgtctctgggatgaggatccctacccaggctag >gi568815587f:120146446_120418389|GENSCAN_predicted_peptide_4|231_aa MRKKKGSSLDDEEAPAKETEKEWAGKWKKDRVESPRNDQSWSVHCGGPGPATDKDSVSLN NPILEEVSPQSGDQVLAPVLSSLEILSFGDPYLLSYSFNWLSEKGMAAVREQGAPPCCRA PVSGDPLPAQVFVLPAGAPAKPGSPSSSSRPPPPREPLPEPSPPLSRAPLLRPGQHSAPP GLPHSPVSRPRSQVAVRLYEMMDVEVTSAQAFGDSHLPERGSGLLKVLLIG >gi568815587f:120146446_120418389|GENSCAN_predicted_CDS_4|696_bp atgaggaagaaaaaagggtcaagtttagatgatgaggaggcaccagcaaaagagactgag aaagagtgggcaggcaaatggaagaaagaccgggtagaaagtccccgcaacgaccaaagc tggtctgtgcactgtggaggtccaggaccagctacggacaaagactctgtttcccttaat aaccccatcctggaagaggtcagcccacagtctggggatcaagtcctagctcctgtgttg tcctccttggagatcctgtcctttggggatccttaccttctctcctactccttcaactgg ctcagtgagaaagggatggcagcggtcagggagcagggagcccctccttgctgcagagcc cctgtgtctggtgaccctctgcctgctcaggtctttgtgcttccagccggggcccctgcg aagccagggtctccttcctccagttcccggccgccgccgccgcgggagccgctgccggag ccgtcgccgccactgtcgcgcgctccgctgctccggcccgggcaacactcggcgccgcca ggattgccacactccccagtctcccgccccaggtcccaggttgctgtgaggctctacgag atgatggatgtggaagttaccagtgcccaggcttttggtgacagccaccttccagagcgg ggaagcggcttgctcaaagtgcttctcatcggctga >gi568815587f:120146446_120418389|GENSCAN_predicted_peptide_5|562_aa MRLLLSEIKTEDLSDSLQQTLSHRPCHLSQGPAMMSGNQMSGLNASPCQAFGHPGLPGSS LEPHLEASQHLPVPKHLPSSGGADEPSDLEELEKFAKTFKQRRIKLGFTQGDVGLAMGKL YGNDFSQTTISRFEALNLSFKNMCKLKPLLEKWLNDAESSPSDPSVSTPSSYPSLSEVFG RKRKKRTSIETNIRLTLEKRFQDVSSTFGWARVGYLTQNPKPSSEEISMIAEQLSMEKEV VRVWFCNRRQKEKRINCPVATPIKPPVYNSRLHSLSTCYMFGTGAPVVNRTRDPCLQCSR KTDTKQMLFVMKASEKRMALALCLQVLCSLCGWLSLYISFCHLNKHRSYEWSCRLVTFTH GVLSIGLSAYIGFIDGPWPFTHPGSPNTPLQVHVLCLTLGYFIFDLGWCVYFQSEGALML AHHTLSILGIIMALVLGESGTEVNAVLFGSELTNPLLQMRWFLRETGHYHSFTGDVVDFL FVALFTGVRIGVGACLLFCEMVSPTPKWFVKAGGVAMYAVSWCFMFSIWRFAWRKSIKKY HAWRSRRSEERQLKHNGHLKIH >gi568815587f:120146446_120418389|GENSCAN_predicted_CDS_5|1689_bp atgaggctactgctttctgaaattaaaaccgaagatctcagtgactccctgcagcagacc ctctcccatcggccatgccacctgagtcaaggacctgccatgatgtccggaaaccaaatg tctgggctaaatgccagcccatgtcaggcatttgggcaccctgggctgccaggatcctct ttagaaccccacctggaagcatcccagcatctcccagtgcccaagcatctacccagctct ggaggggccgatgagcccagtgacctcgaggagctggagaagtttgccaagaccttcaag cagaggcgcattaagctgggcttcacacagggagatgtggggctggcgatgggaaagctg tatggcaacgacttcagccagaccaccatctcacgatttgaggccctcaacctgagcttc aagaacatgtgcaagctcaagcccctgctggagaagtggctgaatgatgcagagtcctct ccgtcagacccctcagtgagcacgcccagctcctaccccagcctcagtgaagtatttggt aggaagagaaagaaacggaccagcatcgagaccaacatccgcctgactctggagaagagg tttcaagatgtgagcagcaccttcgggtgggcacgggtgggctacctcacacagaaccca aaacccagctcggaggagatctccatgattgcagagcagttgtccatggagaaggaggtg gtgagggtctggttctgcaaccgacgccaaaaggagaagcgaatcaactgccctgtggcc acacccatcaaaccacctgtctacaactcccggctgcattcactgagcacctgctatatg tttggtactggggctccagtggtgaaccgcacacgtgacccctgcttacagtgtagcagg aaaacagatactaaacaaatgctatttgtgatgaaggcttcagagaagaggatggcatta gctctgtgtctgcaggtgctgtgcagcctgtgtggctggctctcgctctatatttctttc tgccacctgaataagcaccgaagctatgagtggagctgccgcctggtcaccttcacccat ggagtcctctctataggcctctccgcttatattggcttcattgatggcccatggcctttt acccacccaggctcacccaatacacctctccaagttcatgtcctgtgtctcaccttgggc tacttcatcttcgacttgggctggtgcgtctactttcagtctgagggtgccttgatgctg gctcatcacacattgagtatcttgggcattatcatggcccttgtgcttggggagtctggc acagaggtcaatgcagtcctctttggaagtgagcttaccaaccccttgctacagatgcgc tggtttctccgggaaacagggcactatcacagtttcactggagatgtagtggacttcctc tttgtggctctgttcacaggagtgaggattggtgtgggagcttgcctccttttctgtgaa atggtctcccccacgcctaagtggtttgtgaaggctgggggagtagcgatgtatgctgtg tcttggtgtttcatgtttagcatctggcgctttgcatggaggaagagcatcaagaagtac catgcttggagaagcaggcggagtgaggaacggcagctgaaacacaacggacatctcaaa atacactag >gi568815587f:120146446_120418389|GENSCAN_predicted_peptide_6|193_aa MDGNGRHYDGTVKLQENKLRVPTDSTLWLPVLGRAAGAGGEVPRPEVTGTRVSQRRGLLP IPLRPGLFRNRQRRRLLPDAPREGREMPVRPGLEALVREAEAAASPGAGPARRVLEALAG ARADSGGGRRPWGPRNRGRARGFRFRFWFPGPGRVRRELALPVPRRLYGPLPSDFGKPPH PGSLRRSGPRSRP >gi568815587f:120146446_120418389|GENSCAN_predicted_CDS_6|582_bp atggatggaaatggacgtcattatgatgggaccgtcaagttgcaagaaaacaaactcagg gttcccactgattctacattatggctcccggtgctgggacgagccgctggcgccggcgga gaggtcccgcggcccgaggtgacagggacccgggtctctcagcgccgcggactgctcccc atccctttgaggcccggactcttccgcaatcgccagaggcggcggctcctccccgacgct ccccgcgaggggcgggagatgcccgtccggcccggcctggaagccctggtccgggaggcg gaggcggcggcgtctcccggcgcagggccggctcgccgggttctggaggccctcgcgggc gcgcgcgcagacagcggaggaggccggcggccgtgggggccgcgcaaccgcgggcgcgcg cgcgggttccggttccggttctggtttcccggccctggccgcgtccggcgcgagctcgcg cttcccgtcccccgccgactttatggccccctcccctctgacttcgggaagccgcctcac ccgggatccctccgccgctcagggccgcgctctcgcccttga >gi568815587f:120146446_120418389|GENSCAN_predicted_peptide_7|65_aa MAEGKFPLKKPIRHGSILNRESPTDKKQKVERIASHDFDPTDSSSKKTKSSSEESRSEIY EYFGY >gi568815587f:120146446_120418389|GENSCAN_predicted_CDS_7|198_bp atggcagaaggcaagtttcccctcaaaaaacctataaggcatggaagtattttgaaccga gagtcaccaacagataagaagcagaaagttgagcgcattgcatcacatgattttgacccc acagatagctcctccaagaagacaaagtctagttcagaggagagtagatccgagatatat gagtattttggctactga