GENSCAN 1.0 Date run: 4-Nov-116 Time: 02:43:55 Sequence gi568815577f:25641599_25869229 : 227631 bp : 39.68% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init - 2389 2253 137 1 2 61 68 116 0.095 6.56 1.00 Prom - 8687 8648 40 -1.85 2.00 Prom + 22006 22045 40 -4.55 2.01 Init + 23281 23353 73 0 1 87 14 93 0.149 3.08 2.02 Intr + 42285 42350 66 1 0 100 87 49 0.090 3.96 2.03 Intr + 48268 48375 108 2 0 49 121 17 0.341 0.44 2.04 Intr + 52158 52325 168 1 0 81 20 172 0.449 8.70 2.05 Intr + 57079 57281 203 2 2 37 95 151 0.635 8.78 2.06 Intr + 60572 60671 100 1 1 8 87 102 0.400 0.86 2.07 Intr + 64381 64488 108 2 0 43 108 48 0.302 1.64 2.08 Intr + 73579 73714 136 2 1 59 70 74 0.024 1.41 2.09 Term + 77441 77546 106 2 1 83 33 67 0.019 -2.40 2.10 PlyA + 78254 78259 6 1.05 3.05 PlyA - 78761 78756 6 1.05 3.04 Term - 83079 83042 38 1 2 118 49 62 0.982 1.82 3.03 Intr - 83752 83628 125 0 2 101 102 145 0.999 16.51 3.02 Intr - 88203 88033 171 2 0 100 98 202 0.836 20.54 3.01 Init - 88362 88328 35 0 2 61 80 34 0.807 -0.71 3.00 Prom - 88690 88651 40 -9.35 4.00 Prom + 89658 89697 40 -10.25 4.01 Init + 90764 90772 9 1 0 55 61 0 0.228 -5.36 4.02 Intr + 92181 92315 135 2 0 90 103 40 0.543 5.54 4.03 Intr + 92595 92661 67 2 1 67 61 69 0.713 -0.34 4.04 Intr + 93043 93171 129 2 0 93 100 82 0.777 9.65 4.05 Intr + 93266 93472 207 0 0 13 110 174 0.549 10.23 4.06 Intr + 93843 93980 138 1 0 8 92 92 0.256 1.11 4.07 Intr + 99999 100077 79 1 1 13 81 117 0.026 1.19 4.08 Intr + 103612 103756 145 1 1 36 91 134 0.999 7.86 4.09 Intr + 107438 107522 85 1 1 45 115 57 0.997 2.57 4.10 Intr + 110391 110636 246 1 0 47 45 278 0.453 15.91 4.11 Intr + 116412 116606 195 1 0 71 11 122 0.533 1.26 4.12 Intr + 120714 120767 54 1 0 88 92 79 0.982 6.33 4.13 Intr + 122612 122752 141 0 0 81 68 72 0.923 3.90 4.14 Intr + 122997 123189 193 1 1 72 121 237 0.998 23.03 4.15 Term + 127406 127634 229 2 1 65 52 274 0.999 16.62 4.16 PlyA + 127965 127970 6 1.05 5.00 Prom + 128614 128653 40 -6.85 5.01 Init + 137790 138008 219 2 0 71 73 296 0.972 25.08 5.02 Intr + 152290 152394 105 0 0 51 98 48 0.014 1.59 5.03 Intr + 168304 168342 39 0 0 138 61 45 0.027 4.30 5.04 Intr + 170738 170882 145 1 1 -6 98 125 0.022 3.03 5.05 Term + 181525 181637 113 2 2 102 32 81 0.861 1.64 5.06 PlyA + 182698 182703 6 1.05 6.05 PlyA - 183328 183323 6 1.05 6.04 Term - 192532 192254 279 1 0 -7 48 289 0.692 9.86 6.03 Intr - 195133 195059 75 1 0 65 72 55 0.410 0.39 6.02 Intr - 201627 201471 157 2 1 37 44 154 0.363 4.99 6.01 Init - 202238 202201 38 2 2 22 77 84 0.382 0.23 6.00 Prom - 203126 203087 40 -10.05 7.00 Prom + 206163 206202 40 -6.45 7.01 Init + 206744 206924 181 1 1 83 61 91 0.725 5.29 7.02 Term + 207090 207658 569 1 2 61 32 309 0.747 16.19 7.03 PlyA + 207841 207846 6 1.05 8.04 PlyA - 208625 208620 6 1.05 8.03 Term - 214512 214313 200 1 2 17 49 143 0.495 -0.12 8.02 Intr - 215359 215145 215 0 2 33 60 127 0.063 1.84 8.01 Init - 220683 220613 71 0 2 89 23 78 0.036 1.98 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 41250 40891 360 0 0 46 99 181 0.835 11.23 S.002 Init + 100001 100077 77 1 2 91 81 115 0.969 11.71 S.003 Term + 138061 138117 57 0 0 95 39 59 0.823 -1.59 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815577f:25641599_25869229|GENSCAN_predicted_peptide_1|46_aa MAFIASWWSDFNKASSSYLYIPIHLLYEEGPQEETNAMETMGGEHP >gi568815577f:25641599_25869229|GENSCAN_predicted_CDS_1|138_bp atggcctttatagcttcctggtggtctgacttcaataaagctagcagctcttacctctat attccaatacatcttctctatgaggaaggcccacaggaggaaaccaatgcaatggaaacc atgggtggtgaacatccn >gi568815577f:25641599_25869229|GENSCAN_predicted_peptide_2|355_aa MESVSLCETLNVNGQSEGEDAGKEHHKAYGFSAPKDQQVVTAVEYQEAILACKTPKKTVS SRLEWKKLGRSVSFVYYQQTLQGDFKNRAEMIDFNIRIKNVTRSDAGKYRCEVSAPSEQG QNLEEDTVTLEVLGDVHVLAPAVPSCEVPSSALSGTVVELRCQDKEGNPAPEYTWFKDGI RLLENPRLGSQSTNSSYTMNTKTGTLQFNTVSKLDTGEYSCEARNSVGYRRCPGKRMQVD DLNISGIIAAVVVVALVISVCGLGVCYAQRKGYFSKRRKRDGSYAQGFALCVYFHSWELK ATSKEMPNGWSPQVCTTGFAWLSIASQRLSPGTSTLQYPLSHNVNLSSSKGSFSL >gi568815577f:25641599_25869229|GENSCAN_predicted_CDS_2|1068_bp atggaatctgtatccctttgtgaaacactaaatgtcaatggccagtctgagggagaagat gcaggaaaagaacatcataaggcctatgggttttctgccccaaaagaccaacaagtagtc acagcagtagagtaccaagaggctattttagcctgcaaaaccccaaagaagactgtttcc tccagattagagtggaagaaactgggtcggagtgtctcctttgtctactatcaacagact cttcaaggtgattttaaaaatcgagctgagatgatagatttcaatatccggatcaaaaat gtgacaagaagtgatgcggggaaatatcgttgtgaagttagtgccccatctgagcaaggc caaaacctggaagaggatacagtcactctggaagtattaggtgatgtgcatgtattggct ccagcagttccatcatgtgaagtaccctcttctgctctgagtggaactgtggtagagcta cgatgtcaagacaaagaagggaatccagctcctgaatacacatggtttaaggatggcatc cgtttgctagaaaatcccagacttggctcccaaagcaccaacagctcatacacaatgaat acaaaaactggaactctgcaatttaatactgtttccaaactggacactggagaatattcc tgtgaagcccgcaattctgttggatatcgcaggtgtcctgggaaacgaatgcaagtagat gatctcaacataagtggcatcatagcagccgtagtagttgtggccttagtgatttccgtt tgtggccttggtgtatgctatgctcagaggaaaggctacttttcaaaaaggaggaagcgg gatgggtcttatgcccaaggatttgcactttgtgtttacttccattcctgggaactaaaa gcaacatcaaaagaaatgcctaatggctggagcccacaggtgtgcaccacaggatttgcc tggctttcaatagcatctcaacgtttatctccgggaacttccaccctccagtaccctctc agtcacaacgtcaatctctccagttccaaaggctccttttccctctaa >gi568815577f:25641599_25869229|GENSCAN_predicted_peptide_3|122_aa MGNKEEKGTVVRISMILQRLFRFSSVIRSAVSVHLRRNIGVTAVAFNKELDPIQKLFVDK IREYKSKRQTSGGPVDASSEYQQELERELFKLKQMFGNADMNTFPTFKFEDPKFEVIEKP QA >gi568815577f:25641599_25869229|GENSCAN_predicted_CDS_3|369_bp atggggaataaggaggagaagggaactgtggtcagaatcagcatgattcttcagaggctc ttcaggttctcctctgtcattcggtcagccgtctcagtccatttgcggaggaacattggt gttacagcagtggcatttaataaggaacttgatcctatacagaaactctttgtggacaag attagagaatacaaatctaagcgacagacatctggaggacctgttgatgctagttcagag tatcagcaagagctggagagggagctttttaagctcaagcaaatgtttggtaatgcagac atgaatacatttcccaccttcaaatttgaagatcccaaatttgaagtcatcgaaaaaccc caggcctga >gi568815577f:25641599_25869229|GENSCAN_predicted_peptide_4|683_aa MSQMIKLYLRKIKLHVFSQLICSKSDCIDVTPSATRVKSTFIVSIIHHYHRVPREHMIAS DMIRKTGEGNKRVSILFLGLRQTPRSTNVEVSCDPGPGREGPQDTELLSPPSKVPSCQSL RRHHLRSTSGPGSAPTRLPAIAMHYGPPFQSVDAHRTGSVSETVCDRTGLGETEAKQEEE VEGPWDLTLLVAGAAGLTRRDAARGALAAAVLSRLWSAGGGDRADSGVAMTKREAEELIE IEIDGTEKAECTEESIVEQTYAPAECVSQAIDINEPIGNLKKLLEPRLQCSLDAHEICLQ DIQLDPERSLFDQGVKTDGTVQLSVQVISYQGIEPKLNILEIVKPADTVEVVIDPDAHHA ESEAHLVEEAQVITLDGTKHITTISDETSEQVTRWAAALEGYRKEQERLGIPYDPIQWST DQVLHWVVWVMKEFSMTDIDLTTLNISGRELCSLNQEDFFQRVPRGEILWSHLELLRKYV LASQEQQMNEIVTIDQPVQIIPASVQSATPTTIKVINSSAKAAKVQRAPRISGEDRSSPG NRTGNNGQIQLWQFLLELLTDKDARDCISWVGDEGEFKLNQPELVAQKWGQRKNKPTMNY EKLSRALRYYYDGDMICKVQGKRFVYKFVCDLKTLIGYSAAELNRLVTECEQKKLAKMQL HGIAQPVTAVALATASLQTEKDN >gi568815577f:25641599_25869229|GENSCAN_predicted_CDS_4|2052_bp atgagccagatgataaaactgtatctaagaaagattaagctgcacgtattctcacagtta atttgctctaagtctgactgcatagatgttactccttctgcaacacgggtaaaatccact ttcattgtttcaattatccaccattaccatcgggtgccacgagagcatatgatagcgtct gacatgatcagaaagactggggaaggcaacaagagggtatcgatcttatttctgggtcta cggcaaactccaaggtctacaaacgtagaggtcagctgtgaccccgggccaggccgtgaa ggtccccaggacacagagctgctctctcctcctagtaaagtcccgagctgccaaagcctc cgccgccaccacctccgctctacttccggccctggctccgcccccacacgcctacccgcc atcgcaatgcattatgggccgccgtttcagtcggtcgacgctcaccggacaggaagcgtc tcggagacagtctgcgaccggacgggtctaggtgagacagaagccaaacaggaggaggaa gtggaggggccctgggacctcacacttctagtcgcgggagctgcaggtcttacccggaga gacgctgcacgtggagccctcgccgctgccgttctcagccggctctggagtgcgggcggg ggcgacagggccgattccggagtggccatgactaaaagagaagcagaggagctgatagaa attgagattgatggaacagagaaagcagagtgcacagaagaaagcattgtagaacaaacc tacgcgccagctgaatgtgtaagccaggccatagacatcaatgaaccaataggcaattta aagaaactgctagaaccaagactacagtgttctttggatgctcatgaaatttgtctgcaa gatatccagctggatccagaacgaagtttatttgaccaaggagtaaaaacagatggaact gtacagcttagtgtacaggtaatttcttaccaaggaattgaaccaaagttaaacatcctt gaaattgttaaacctgcggacactgttgaggttgttattgatccagatgcccaccatgct gaatcagaagcacatcttgttgaagaagctcaagtgataactcttgatggcacaaaacac atcacaaccatttcagatgaaacttcagaacaagtgacaagatgggctgctgcactggaa ggctataggaaagaacaagaacgccttgggataccctatgatcccatacagtggtccaca gaccaagtcctgcattgggtggtttgggtaatgaaggaattcagcatgaccgatatagac ctcaccacactcaacatttcggggagagaattatgtagtctcaaccaagaagattttttt cagcgggttcctcggggagaaattctctggagtcatctggaacttctccgaaaatatgta ttggcaagtcaagaacaacagatgaatgaaatagttacaattgatcaacctgtgcaaatt attccagcatcagtgcaatctgctacacctactaccattaaagttataaatagtagtgcg aaagcagccaaagtacaaagagcgccgaggatttcaggagaagatagaagctcacctggg aacagaacaggaaacaatggccaaatccaactatggcagtttttgctagaacttcttact gataaggacgctcgagactgcatttcttgggttggtgatgaaggtgaatttaagctaaat cagcctgaactggttgcacagaaatggggacagcgtaaaaataagcctacgatgaactat gagaaactcagtcgtgcattaagatattattacgatggggacatgatttgtaaagttcaa ggcaagagatttgtgtacaagtttgtctgtgacttgaagactcttattggatacagtgca gcggagttgaaccgtttggtcacagaatgtgaacagaagaaacttgcaaagatgcagctc catggaattgcccagccagtcacagcagtagctctggctactgcttctctgcaaacggaa aaggataattga >gi568815577f:25641599_25869229|GENSCAN_predicted_peptide_5|206_aa MNTPEGRNSEHIRTSEGTNSGHAAFKNCNTARVHGFMLEVGKTKNPPIPDTFWRPRRDFC LSLSGETIAYRQACWGYMHEPPHLAMSSSFCCLSVSEVPPVIKDTSQKCDAPDPPSPSIM INLHPEYTKNSNSMKSRQNIRTNIPIRRYKNDRLGVVAHGCNPRTCKGQGVFVRCGQAGL RLGLRLPQQFYFCQLGYKKLCCNLVV >gi568815577f:25641599_25869229|GENSCAN_predicted_CDS_5|621_bp atgaacacaccggaaggaagaaactcggaacacatccgaacatcagaaggaacaaactcc ggacacgccgcctttaagaactgtaacaccgccagggtccacggcttcatgcttgaagtt ggtaagaccaagaacccaccaattccggacacgttttggcgaccacgaagggacttttgc ctgtcgctgagcggtgagaccatcgcctatcgccaagcatgctggggttatatgcatgag ccgccacacttggccatgtcatcttccttctgttgcttgtctgtatcggaagttcctccc gttattaaggataccagtcaaaagtgtgatgctcccgatcctccttcaccttccatcatg attaacttgcatccagaatatacaaagaactcgaactcaatgaaaagtcgacaaaatatt cgaacaaacattccaataagacgatataaaaatgaccggctgggcgtggttgctcatggc tgcaatcccagaacttgtaaaggccaaggtgtatttgtcaggtgtggacaggcaggtctg cggttgggcttgcgtcttcctcaacagttctacttctgccaactgggttataaaaagctc tgctgcaatctggtggtttaa >gi568815577f:25641599_25869229|GENSCAN_predicted_peptide_6|182_aa MNLTDPCKEQEQCNDPKTQLCSHYQGTFDRKTLRGSFSPEIHQAAERECGGETEGERVSA DRVTKKSPQKKLLLLLSVKVELSYVIADKPLATLNPLWEEASRQANTGSGQLLLGASRSK LYAGPMAASKWGYLRPLKPKRTCYSAISALPATFGLSVNSSVGPLPHHMRRLPSTTNGKG PV >gi568815577f:25641599_25869229|GENSCAN_predicted_CDS_6|549_bp atgaacctgacggacccctgcaaggaacaagagcagtgcaacgacccaaaaactcagctc tgttctcattaccagggaacatttgacaggaaaaccttgagaggtagtttttccccagag attcatcaggcagcagaaagggaatgtgggggagaaactgaaggagaacgagtgtcagca gacagagttaccaagaaaagtcctcagaagaaattattattgttactctcagtgaaagtg gagctgtcttatgtaattgctgacaagccacttgccacactcaaccccttgtgggaggaa gcaagtaggcaagcaaatacgggatctggccagctgcttttgggtgccagcaggagcaaa ctctatgcaggccccatggcagcatccaagtggggctacctaagacccctgaagcccaag aggacatgttacagtgctatttcagctctgccagccacgttcggcttaagtgttaacagt tcagtgggccctttgcctcatcacatgaggcggctgccctccaccaccaacggcaaaggg ccagtgtga >gi568815577f:25641599_25869229|GENSCAN_predicted_peptide_7|249_aa MRLKRKTHFLGKDSSPSQLQKFANHQDNGENISEEFQRSSEQPLLLQAQRPRREKWFNGP GAEKTRVELWKPPLRFQRIYGNAWIFRQNSVAGEESSWRTSTRAMQRENVGLEPPHRVPT GALPSGAVRRGPPSSRPQNGKSTDSLHCAPGKATGTQCQPVKAAVGVEPCKATGVELPKA MGAHLLHQHALNVRHGVKEYYFGTLRFNECPVRFWTCMGPVAPLILAISPIWNGNICTMP VPPLCLGNN >gi568815577f:25641599_25869229|GENSCAN_predicted_CDS_7|750_bp atgaggttgaaaagaaaaacccattttctggggaaggattcaagtccaagccagctgcag aaatttgccaatcaccaagacaatggggaaaatatctctgaggaatttcagagatcttca gagcagcccctcctattacaggcccagaggcctaggagggaaaaatggtttaatgggcca ggtgcagagaagacaagagttgagctttggaaacctccacttagatttcagaggatatat ggaaatgcctggatatttaggcagaactctgttgcaggggaagagtcctcatggagaact tctactagggcaatgcagagggaaaatgtggggttggagcccccacatagagtccccact ggggcactgcctagtggagccgtgagaagaggaccaccgtcctccagaccccagaatggt aaatccactgacagcttgcactgtgcacctggaaaagccacaggcactcaatgccagcct gtgaaagcagctgtgggggtggaaccctgcaaagccacaggggtggagctgcccaaggcc atgggagcccacctcttgcatcagcatgccctgaatgtgcgacatggagtcaaagaatat tactttggaactttaagatttaatgagtgccctgtcaggttttggacttgcatggggcct gtggcccctttgattttggcaatttctcccatttggaatgggaacatctgcacaatgcct gtacccccattgtgtcttggaaataactaa >gi568815577f:25641599_25869229|GENSCAN_predicted_peptide_8|161_aa MTSPDTEFAGALILAVPDLQICKKVKNDFMTYEMDLEEWIEFAQEQMNACAGQGGLYFLE YRKLAKYKCGEVPKRGKENRKKSTLHFEVVRICSGEESLEKLQTVVTEVEMPAIPVPSAI VEAESPKSTSKGISGMTLFCRVGVMRGTGSCSFHLQAAFWG >gi568815577f:25641599_25869229|GENSCAN_predicted_CDS_8|486_bp atgacctcaccagacactgaatttgctggtgccttgatcttggcggtcccagacctccag atctgtaagaaggtaaagaatgatttcatgacatatgagatggatcttgaagaatggatt gaatttgcacaggaacaaatgaatgcctgtgcagggcagggaggactatatttcttggaa tacagaaaattagcaaagtataaatgtggggaagtgcccaagagagggaaagagaacagg aagaaatccactctgcactttgaggttgtcagaatctgttcaggagaggagtcattagag aagctgcagactgttgtgacagaggttgagatgcctgctattccagttccaagtgcaata gtggaggcagaaagtcctaaatccacaagtaaagggattagtgggatgactcttttttgt cgtgttggtgtcatgagggggactggcagttgttcgttccatctgcaagctgcattttgg ggataa