GENSCAN 1.0 Date run: 4-Nov-116 Time: 22:05:20 Sequence gi568815593f:136053761_136277477 : 223717 bp : 43.09% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 183 320 138 1 0 110 87 62 0.974 8.74 1.02 Intr + 956 1101 146 0 2 73 101 69 0.999 6.70 1.03 Intr + 1920 2056 137 2 2 103 59 181 0.998 16.07 1.04 Intr + 2905 3035 131 1 2 82 38 171 0.999 11.84 1.05 Intr + 5330 5454 125 0 2 81 111 193 0.999 21.20 1.06 Intr + 7074 7176 103 2 1 107 111 128 0.999 16.75 1.07 Intr + 7740 7819 80 1 2 118 90 35 0.978 5.97 1.08 Intr + 8903 8927 25 1 1 105 95 7 0.589 0.70 1.09 Term + 9691 9812 122 2 2 49 41 91 0.503 -0.86 1.10 PlyA + 10039 10044 6 1.05 2.06 PlyA - 10245 10240 6 1.05 2.05 Term - 15686 15538 149 0 2 98 48 36 0.598 -1.34 2.04 Intr - 16346 16169 178 2 1 104 7 146 0.715 7.59 2.03 Intr - 30567 30467 101 1 2 115 61 5 0.012 0.33 2.02 Intr - 52229 52136 94 2 1 87 99 34 0.099 3.94 2.01 Init - 85835 85707 129 2 0 83 64 108 0.216 8.05 2.00 Prom - 90621 90582 40 -5.86 3.00 Prom + 92394 92433 40 -2.96 3.01 Init + 100010 100403 394 1 1 54 96 166 0.506 10.92 3.02 Intr + 107096 107347 252 0 0 103 110 42 0.877 5.31 3.03 Intr + 109512 109631 120 1 0 92 78 4 0.523 0.27 3.04 Intr + 118674 118895 222 1 0 101 84 36 0.707 2.60 3.05 Intr + 120616 120872 257 2 2 110 113 86 0.986 10.46 3.06 Term + 122138 122200 63 1 0 96 42 41 0.872 -1.81 3.07 PlyA + 122379 122384 6 1.05 4.07 PlyA - 122803 122798 6 1.05 4.06 Term - 131624 131534 91 1 1 104 38 86 0.652 2.39 4.05 Intr - 133807 133755 53 1 2 101 103 -24 0.574 -1.89 4.04 Intr - 137523 137329 195 0 0 49 41 152 0.439 6.21 4.03 Intr - 138808 138459 350 2 2 6 6 625 0.877 41.18 4.02 Intr - 139478 138877 602 1 2 74 69 262 0.713 15.06 4.01 Init - 139863 139730 134 0 2 59 69 166 0.994 9.63 4.00 Prom - 142358 142319 40 -9.06 5.00 Prom + 142418 142457 40 -2.76 5.01 Init + 144819 144890 72 2 0 41 93 59 0.225 0.82 5.02 Intr + 147038 147161 124 1 1 62 94 77 0.294 5.86 5.03 Intr + 149868 149993 126 1 0 107 109 -10 0.278 3.65 5.04 Term + 151219 151286 68 2 2 95 49 44 0.169 -0.70 5.05 PlyA + 158207 158212 6 -0.45 6.10 PlyA - 159005 159000 6 1.05 6.09 Term - 159844 159675 170 2 2 115 43 218 0.866 18.04 6.08 Intr - 162515 162440 76 2 1 88 94 87 0.973 8.39 6.07 Intr - 171594 171514 81 0 0 55 91 73 0.922 4.13 6.06 Intr - 172495 172274 222 1 0 109 100 61 0.949 7.72 6.05 Intr - 177789 177594 196 2 1 94 94 210 0.950 21.62 6.04 Intr - 193975 193711 265 2 1 66 92 281 0.354 22.87 6.03 Intr - 198122 197889 234 0 0 103 113 389 0.372 40.56 6.02 Intr - 212676 212460 217 0 1 87 87 56 0.207 3.48 6.01 Intr - 221077 220913 165 1 0 87 87 47 0.620 4.66 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:136053761_136277477|GENSCAN_predicted_peptide_1|335_aa XKTLFELAAESDVSTAIDLFRQAGLGNHLSGSERLTLLAPLNSVFKDGTPPIDAHTRNLL RNHIIKDQLASKYLYHGQTLETLGGKKLRVFVYRNSLCIENSCIAAHDKRGRYGTLFTMD RVLTPPMGTVMDVLKGDNRFSMLVAAIQSAGLTETLNREGVYTVFAPTNEAFRALPPRER SRLLGDAKELANILKYHIGDEILVSGGIGALVRLKSLQGDKLEVSLKNNVVSVNKEPVAE PDIMATNGVVHVITNVLQPPANRPQERGDELADSALEIFKQASAFSRASQRSVRLETWMS LPDIHFQRGPIPNVELTAYAKSLEKELQYCGAHKT >gi568815593f:136053761_136277477|GENSCAN_predicted_CDS_1|1008_bp nccaagacactatttgaattggctgcagagtctgatgtgtccacagccattgaccttttc agacaagccggcctcggcaatcatctctctggaagtgagcggttgaccctcctggctccc ctgaattctgtattcaaagatggaacccctccaattgatgcccatacaaggaatttgctt cggaaccacataattaaagaccagctggcctctaagtatctgtaccatggacagaccctg gaaactctgggcggcaaaaaactgagagtttttgtttatcgtaatagcctctgcattgag aacagctgcatcgcggcccacgacaagagggggaggtacgggaccctgttcacgatggac cgggtgctgacccccccaatggggactgtcatggatgtcctgaagggagacaatcgcttt agcatgctggtagctgccatccagtctgcaggactgacggagaccctcaaccgggaagga gtctacacagtctttgctcccacaaatgaagccttccgagccctgccaccaagagaacgg agcagactcttgggagatgccaaggaacttgccaacatcctgaaataccacattggtgat gaaatcctggttagcggaggcatcggggccctggtgcggctaaagtctctccaaggtgac aagctggaagtcagcttgaaaaacaatgtggtgagtgtcaacaaggagcctgttgccgag cctgacatcatggccacaaatggcgtggtccatgtcatcaccaatgttctgcagcctcca gccaacagacctcaggaaagaggggatgaacttgcagactctgcgcttgagatcttcaaa caagcatcagcgttttccagggcttcccagaggtctgtgcgactagaaacttggatgtca ctgcctgacattcacttccagagaggacctatcccaaatgtggaattgactgcctatgcc aagtccctggaaaaggagcttcagtattgtggggctcataaaacatga >gi568815593f:136053761_136277477|GENSCAN_predicted_peptide_2|216_aa MTMAYNQGDSSGVVSSRHMKDSKLPDEGDHGVYENKGVNNDSKVRKLGHREGQKLAPSYT SGISGMMIYTSEDRAQLPERFEHAHCVPCPIHYSAIKKLTLSLTTPAKPRVSPLISPENA LAVVPDDMHMAKPIGHINNSSELMLVLKRLSFLNFQDSIHILSDPFAVPPPWSSIMSCLD DCNGLLIGFPAATPTLHFNPLSTQKPKQSFKNIFNN >gi568815593f:136053761_136277477|GENSCAN_predicted_CDS_2|651_bp atgacgatggcttacaaccagggtgatagcagtggggtggtgagcagcagacatatgaaa gacagtaagcttcctgatgaaggagatcatggggtgtatgagaacaaaggagttaacaat gattccaaggtgaggaaactggggcacagagaggggcagaaacttgccccaagctacaca tctgggattagtggaatgatgatttacaccagtgaagacagagcccaacttccagaaaga tttgaacatgcacactgtgtcccatgtcccattcattactcagccatcaaaaagctgact ctcagtctcaccactcctgccaagcccagagtcagtcctctcatctcccccgagaatgct cttgccgtggttcctgatgacatgcacatggctaaacccatcggacacatcaataattca tctgagttgatgcttgtccttaaaaggctatccttcctcaacttccaggattccatacac atattatcagacccctttgcagtgccaccaccctggtcctccatcatgtcttgcctggat gactgtaacggcctcctaattggcttcccagctgctactcctaccctccacttcaaccca ctttccacacaaaaaccaaaacaatcatttaaaaacatatttaacaattaa >gi568815593f:136053761_136277477|GENSCAN_predicted_peptide_3|435_aa MASLFSFTSPAVKRLLGWKQGDEEEKWAEKAVDALVKKLKKKKGAMEELEKALSSPGQPS KCVTIPRSLDGRLQVSHRKGLPHVIYCRVWRWPDLQSHHELKPLDICEFPFGSKQKEVCI NPYHYKRVESPVLPPVLVPRHNEFNPQHSLLVQFRNLSHNEPHMPQNATFPDSFHQPNNT PFPLSPNSPYPPSPASSTYPNSPASSGPGSPFQLPADTPPPAYMPPDDQMGQDNSQPMDT SNNMIPQIMPSISSRDVQPVAYEEPKHWCSIVYYELNNRVGEAFHASSTSVLVDGFTDPS NNKSRFCLGLLSNVNRNSTIENTRRHIGKGVHLYYVGGEVYAECLSDSSIFVQSRNCNFH HGFHPTTVCKIPSSCSLKIFNNQEFAQLLAQSVNHGFEAVYELTKMCTIRMSFVKFSWAK TTISSKPENLINPGK >gi568815593f:136053761_136277477|GENSCAN_predicted_CDS_3|1308_bp atggccagcttgttttcttttactagtccagcagtaaagcgattgttgggctggaaacaa ggtgatgaggaggagaaatgggcagaaaaggcagttgatgctttggtgaagaaactaaaa aagaaaaagggtgccatggaggaactggagaaagccttgagcagtccaggacagccgagt aaatgtgtcactattcccagatctttagatggacgcctgcaggtttctcacagaaaaggc ttaccccatgttatatattgtcgtgtttggcgctggccggatttgcagagtcatcatgag ctaaagccgttggatatttgtgaatttccttttggatctaagcaaaaagaagtttgtatc aacccataccactataagagagtggagagtccagtcttacctccagtattagtgcctcgt cataatgaattcaatccacaacacagccttctggttcagtttaggaacctgagccacaat gaaccacacatgccacaaaatgccacgtttccagattctttccaccagcccaacaacact ccttttcccttatctccaaacagcccttatcccccttctcctgctagcagcacatatccc aactccccagcaagttctggaccaggaagtccatttcagctcccagctgatacgcctcct cctgcctatatgccacctgatgatcagatgggtcaagataattcccagcctatggataca agcaataatatgattcctcagattatgcccagtatatccagcagggatgttcagcctgtt gcctatgaagagcctaaacattggtgttcaatagtctactatgaattaaacaatcgtgtt ggagaagcttttcatgcatcttctactagtgtgttagtagatggattcacagatccttca aataacaaaagtagattctgcttgggtttgttgtcaaatgttaatcgtaattcgacaatt gaaaacactaggcgacatattggaaaaggtgttcatctgtactatgttggtggagaggtg tatgcggaatgcctcagtgacagcagcatatttgtacagagtaggaactgcaactttcat catggctttcatcccaccactgtctgtaagattcccagcagctgcagcctcaaaattttt aacaatcaggagtttgctcagcttctggctcaatctgtcaaccatgggtttgaggcagta tatgagctcaccaaaatgtgtaccattcggatgagttttgtcaagttttcctgggccaag actaccataagcagtaaaccagagaacctgataaatccaggaaaataa >gi568815593f:136053761_136277477|GENSCAN_predicted_peptide_4|474_aa MLVWERVCTPLEVVCPACATQQPVGTRNKPGHAWREPVKEDNERSRSPGQPPPVPPLPRT PARDTSALRAARKLSLLLKTGAQQPEGVGTSSLNTGRLDRDQSLQDCSETLRWRRRRRTR RTPWPPALRALCHRALSRDPGLRLHLLSASPRRRSPEAPSTWSSLELRVRAERAVVKPKG ESVTGTSVLVATERNSEPEGRGHRTVQLRELGSASRRGKQLSAEASERTRPGVFAVPAGP KAPTLGQVKRSGWGRRDPRAMYGDIFNATGGPEAAVGSALAPGATVKAEGALPLELATAR GMRDGAATKPDLPTYLLLFFLLLLSVALVVLFIGCQLRHSAFAALPHDRSLRDARAPWKT RPDGLQGKLLPDLLNKFPLTGLLGTLSAGQFLPPGIFPFALSFPGPEAPEEPYLDQNRLV KEEPEDQLLNNCIISSPMLLPTSNMFYKQQFQEFDNFGAKVQKFKEKGYECFLS >gi568815593f:136053761_136277477|GENSCAN_predicted_CDS_4|1425_bp atgcttgtctgggagcgcgtctgcacacccttggaggtggtgtgcccggcctgcgctacc cagcagccagtgggcactcggaacaaaccaggccatgcctggagggagcctgtcaaggag gataatgaacgcagccgctcccccgggcagccgccgccagtcccgcccctcccccgaacc cccgcccgggacacgagcgcgctcagagccgccagaaagctcagtctcctgctcaaaaca ggagcgcagcagccggagggagtggggacaagctcgctgaacactgggcgtctcgacagg gaccagagcctgcaggactgttccgagacgctgcggtggcggcggcgccggaggacgcgg aggactccctggccacccgccctccgtgcgctctgccaccgggcgctcagcagagacccc gggctgcggctgcacttgctgagtgcctctccccgccgccgcagcccagaggcgcccagc acctggagctcactggagctgagggtccgagctgagcgggctgttgtaaagccgaaaggc gagtcagtgactgggacctctgtcctggtggcgacggagagaaacagtgagcccgagggg cgcgggcacagaactgtccagctccgagaactggggtccgccagccggcggggaaagcag ctgagcgcggaggcgagtgagcggacgcgtcccggggtctttgcagtccccgccggcccc aaggcaccgactctcggccaagtgaagcggtccggctggggtcggcgggacccgcgggcc atgtacggcgacatattcaacgccacgggcggccccgaggcggcggtaggcagcgcgctg gccccaggagccacggtcaaggcagaaggcgctttgccgctggagctggccactgcgcgc ggtatgagggacggcgcggccacaaagcccgacctgcccacctacctgctgctcttcttc ctgctgctgctctcggtggcgctcgtcgtcctcttcatcggttgccagctgcgccattcg gccttcgccgcgctgccccacgaccgctcgctgcgcgacgcccgcgcgccctggaagacg cggccggatggtctccagggaaaattgctgcctgacctgctgaataaattcccattaaca gggctcctggggaccctttctgcagggcagttccttccacctggcatcttcccttttgct ctgagctttcctgggcctgaagcccctgaagagccctatctggaccagaaccgtctggtg aaggaagagcctgaagatcagctactaaataactgcataattagtagcccaatgcttctt cccactagtaacatgttctacaagcaacaattccaagaatttgataactttggagcaaaa gtccaaaagttcaaagagaaaggatatgagtgctttcttagctga >gi568815593f:136053761_136277477|GENSCAN_predicted_peptide_5|129_aa MKLVALLPLLAKHGVVFPLLRLGLEAAFELPVDGRSRLDGLESSGPFLEKAFKMTLLPEV VLGFTKPAGMKWDEALSAPLWGSNGSAPWKPAVLGRFSPSALCCLTSVKLQTPELESTGS FQLIAHYDT >gi568815593f:136053761_136277477|GENSCAN_predicted_CDS_5|390_bp atgaagctggtggcactgttgcccttgctagccaagcatggagtcgtctttcctctgctg aggttggggctggaggctgcttttgagctgcctgtggatgggagaagtagactggatggc ttggagtcatctggccccttcttggagaaagcctttaagatgacactcctcccagaagtg gtgctgggcttcactaaacctgctggcatgaagtgggatgaggccctgtccgccccactc tggggctctaatggcagcgcaccctggaagcctgctgtcctaggcaggttttccccctca gcgctctgctgtctgacatcagtgaaactgcaaactcctgagttggagtccactggatcc ttccagctcatcgcacattacgatacatga >gi568815593f:136053761_136277477|GENSCAN_predicted_peptide_6|541_aa FVAHPNCQQQLLTMWYENLSGLRQQSIAVKFLAVFGVSIGLPFLAIAYWIAPCSKLGRTL RSPFMKFVAHAVSFTIFLGLLVVNASDRFEGVKTLPNETFTDYPKQIFRVKTTQFSWTEM LIMKWVLGMIWSECKEIWEEGPREYVLHLWNLLDFGMLSIFVASFTARFMAFLKATEAQL YVDQHVQDDTLHNVSLPPEVAYFTYARDKWWPSDPQIISEGLYAIAVVLSFSRIAYILPA NESFGPLQISLGRTVKDIFKFMVIFIMVFVAFMIGMFNLYSYYRGAKYNPAFTTVEESFK TLFWSIFGLSEVISVVLKYDHKFIENIGYVLYGVYNVTMVVVLLNMLIAMINNSYQEIEE DADVEWKFARAKLWLSYFDEGRTLPAPFNLVPSPKSFYYLIMRIKMCLIKLCKSKAKSCE NDLEMGMLNSKFKKTRYQAGMRNSENLTANNTLSKPTRYQKIMKRLIKRYVLKAQVDREN DEVNEGELKEIKQDISSLRYELLEEKSQATGELADLIQQLSEKFGKNLNKDHLRVNKGKD I >gi568815593f:136053761_136277477|GENSCAN_predicted_CDS_6|1626_bp ttcgttgctcatcctaactgtcagcagcaattgcttaccatgtggtatgaaaatctctca ggcttacgtcaacagtctatcgctgtgaaattcctggctgtctttggagtctccataggc ctcccttttctcgccatagcctattggattgctccgtgcagcaagctaggacgaaccctg aggagccctttcatgaagtttgtagctcatgcagtttcttttacaatcttcttgggatta ttagttgtgaatgcatctgaccgatttgaaggtgttaaaaccctgccaaacgaaaccttc acagactacccaaaacaaatcttcagagtgaaaaccacacagttctcctggacagaaatg ctcattatgaagtgggtcttaggaatgatttggtccgaatgcaaggaaatctgggaggag gggccacgggagtacgtgctgcacttgtggaacctgctagatttcgggatgctgtccatc ttcgtggcctccttcacagcacgcttcatggccttcctgaaggccacggaggcacagctg tacgtggaccagcacgtgcaggacgacacgctgcacaatgtctcgcttccgccggaagtg gcatacttcacctacgccagggacaagtggtggccttcagaccctcagatcatatcggaa gggctctacgcgatagccgtcgtgctgagcttctctcgcattgcatacattctgccagcc aacgagagttttgggcccctgcagatctcgctagggagaactgtgaaagatatcttcaag ttcatggtcattttcatcatggtatttgtggccttcatgattgggatgttcaacctgtac tcttactaccgaggtgccaaatacaacccagcgtttacaacggttgaagaaagttttaaa actttgttttggtccatattcggcttatctgaagtaatctcagtggtgctgaaatacgac cacaaattcatcgagaacattggctacgttctctacggcgtttataacgtcaccatggtg gtagtgttgctcaacatgctaatagccatgataaacaactcctatcaggaaattgaggag gatgcagatgtggaatggaagttcgcccgagcaaaactctggctgtcttactttgatgaa ggaagaactctacctgctccttttaatctagtgccaagtcctaaatcattttattatctc ataatgagaatcaagatgtgcctcataaaactctgcaaatctaaggccaaaagctgtgaa aatgaccttgaaatgggcatgctgaattccaaattcaagaagactcgctaccaggctggc atgaggaattctgaaaatctgacagcaaataacactttgagcaagcccaccagataccag aaaatcatgaaacggctcataaaaagatacgtcctgaaagcccaggtggacagagaaaat gacgaagtcaatgaaggcgagctgaaggaaatcaagcaagatatctccagcctgcgctat gagcttcttgaggaaaaatctcaagctactggtgagctggcagacctgattcaacaactc agcgagaagtttggaaagaacttaaacaaagaccacctgagggtgaacaagggcaaagac atttag