GENSCAN 1.0 Date run: 6-Nov-116 Time: 21:24:42 Sequence gi568815583f:96232106_96437619 : 205514 bp : 43.25% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 20021 20100 80 1 2 103 84 7 0.011 0.15 1.02 Intr + 30918 30973 56 0 2 86 60 120 0.021 7.62 1.03 Intr + 42192 42253 62 1 2 63 77 16 0.036 -3.55 1.04 Term + 48580 48822 243 0 0 31 43 585 0.217 44.20 1.05 PlyA + 49189 49194 6 -0.45 2.05 PlyA - 49818 49813 6 1.05 2.04 Term - 50884 50792 93 1 0 85 45 71 0.112 0.33 2.03 Intr - 53354 53221 134 0 2 67 57 78 0.086 2.96 2.02 Intr - 58678 58525 154 1 1 43 93 68 0.471 2.45 2.01 Init - 59426 59355 72 2 0 18 100 47 0.554 -0.03 2.00 Prom - 60284 60245 40 -2.56 3.02 PlyA - 60605 60600 6 1.05 3.01 Sngl - 99242 98889 354 2 0 64 44 537 0.999 40.95 3.00 Prom - 99599 99560 40 -12.87 4.00 Prom + 99807 99846 40 -12.11 4.01 Init + 100007 100442 436 1 1 53 69 645 0.556 55.43 4.02 Intr + 101971 102498 528 2 0 110 91 836 0.914 79.01 4.03 Term + 105243 105517 275 1 2 85 36 115 0.427 1.63 4.04 PlyA + 105521 105526 6 1.05 5.00 Prom + 106085 106124 40 -4.66 5.01 Init + 109147 109200 54 0 0 125 57 83 0.535 8.18 5.02 Term + 113815 113967 153 0 0 23 49 177 0.425 5.22 5.03 PlyA + 114905 114910 6 1.05 6.04 PlyA - 115355 115350 6 1.05 6.03 Term - 125083 124811 273 1 0 64 49 168 0.281 5.97 6.02 Intr - 137983 137873 111 1 0 105 78 12 0.107 2.48 6.01 Init - 146531 146475 57 2 0 52 110 11 0.092 1.01 6.00 Prom - 147854 147815 40 -2.06 7.00 Prom + 150979 151018 40 -2.86 7.01 Init + 153615 153674 60 2 0 59 85 10 0.013 -0.89 7.02 Intr + 156252 156477 226 2 1 47 95 205 0.013 14.66 7.03 Intr + 158179 158356 178 2 1 109 99 1 0.012 2.28 7.04 Intr + 184197 184230 34 0 1 98 103 -4 0.253 0.33 7.05 Term + 184703 185362 660 1 0 6 35 247 0.924 5.11 7.06 PlyA + 186675 186680 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583f:96232106_96437619|GENSCAN_predicted_peptide_1|146_aa KILEPPLMLTEVVLPPEIRDSAPCWRGNSQYGDKDGYHSYGHLEPLNHKMRKTTLSENLA YHQYSLPRQQEQNSMKEEEEEEEEEEEEEEEKEEGEGEGEEEEEEEEEEEEEEEEEEEEE EEEEEEEEEEEEEEEGGGGGRGGCGK >gi568815583f:96232106_96437619|GENSCAN_predicted_CDS_1|441_bp aaaatcctagaacctcctttgatgttgaccgaggttgtcttgcctccagaaataagagat tctgctccatgctggcggggaaatagccagtacggggacaaagatggctaccacagctat ggccatctggagcccctcaatcacaagatgcgtaaaaccacattgtctgagaatcttgca tatcatcaatattcattgcctaggcaacaagagcaaaactccatgaaagaagaagaagaa gaagaagaagaagaagaagaagaagaagaagaaaaagaagaaggagaaggagaaggagaa gaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaggaagaggaagag gaagaggaagaggaagaagaagaagaagaggaagaagaagaagaaggaggaggaggaggc aggggaggatgtggaaaatag >gi568815583f:96232106_96437619|GENSCAN_predicted_peptide_2|150_aa MNFGASPGTVLKERGGQPDIGVEKVPVPLGIVLYATSSQEWQAVESLAFVGPLDIWQRTR KPRFPSVSPSAGKLQAPHAAAVYRPPGLPGEVSRSSCSRGVEQLHEDPKVAKNNSAFQMR EPVVQHSVMGQWALVPQCPASPPDTALRSS >gi568815583f:96232106_96437619|GENSCAN_predicted_CDS_2|453_bp atgaattttggggcctccccaggaactgttttaaaagaaagagggggccagccagacatt ggtgtggagaaggtcccagttcctctgggaatcgtcctgtatgcaacatcttcacaagag tggcaggcagttgaatctctggccttcgtgggaccactggacatctggcagagaaccagg aaaccacggttcccttctgtgagtccctcagccggaaaactacaagctcctcatgcagct gctgtctacagaccccctggactgcctggggaagtgagtcggagctcctgcagcagaggg gtggaacagctacacgaagatccgaaggtggccaaaaataacagcgctttccagatgagg gagccagttgtccagcacagtgtcatgggacagtgggccctggtgccacaatgcccggca tctccaccagatacagctttgagatcttcatga >gi568815583f:96232106_96437619|GENSCAN_predicted_peptide_3|117_aa MKVVRVGRCAASSWRWAAAAAACPGRPARASLSPAAAARSRWVEAARRSLRAARAAAAAG AAAEAAGSGPEPLRLAPPPPLKSLLPLLPPPEPLLLLLPPPPHTHRERVNSPAAGEK >gi568815583f:96232106_96437619|GENSCAN_predicted_CDS_3|354_bp atgaaagtggtccgcgtcgggcgctgcgcggcgtcgtcctggcgctgggccgccgccgcc gccgcctgccccgggcgtccggcgcgcgcctcgctctctccggccgccgccgcccgcagt cgctgggtggaggccgctcgccgttccctgcgggccgcccgggccgccgctgccgccgga gccgccgccgaagccgccgggtcgggcccggagccgctgcgtctagcgccgccgccgccg ctgaagtcgctgctgccgctgctgccgccgccggagccgctgctgctgctgctgccgccg ccgcctcacacacatagggaaagagtcaactcgccggcggcgggggagaaatga >gi568815583f:96232106_96437619|GENSCAN_predicted_peptide_4|412_aa MVVSTWRDPQDEVPGSQGSQASQAPPVPGPPPGAPHTPQTPGQGGPASTPAQTAAGGQGG PGGPGSDKQQQQQHIECVVCGDKSSGKHYGQFTCEGCKSFFKRSVRRNLSYTCRANRNCP IDQHHRNQCQYCRLKKCLKVGMRREAVQRGRMPPTQPTHGQFALTNGDPLNCHSYLSGYI SLLLRAEPYPTSRFGSQCMQPNNIMGIENICELAARMLFSAVEWARNIPFFPDLQITDQV ALLRLTWSELFVLNAAQCSMPLHVAPLLAAAGLHASPMSADRVVAFMDHIRIFQEQVEKL KALHVDSAEYSCLKAIVLFTSDACGLSDVAHVESLQEKSQCALEEYVRSQYPNQPTRFGK LLLRLPSLRTVSSSVIEQLFFVRLVGKTPIETLIRDMLLSGSSFNWPYMAIQ >gi568815583f:96232106_96437619|GENSCAN_predicted_CDS_4|1239_bp atggtagtcagcacgtggcgcgacccccaggacgaggtgcccggctcacagggcagccag gcctcgcaggcgccgcccgtgcccggcccgccgcccggcgccccgcacacgccacagacg cccggccaagggggcccagccagcacgccagcccagacggcggccggtggccagggcggc cctggcggcccgggtagcgacaagcagcagcagcagcaacacatcgagtgcgtggtgtgc ggagacaagtcgagcggcaagcactacggccagttcacgtgcgagggctgcaagagcttc ttcaagcgcagcgtgcggaggaacctgagctacacgtgccgcgccaaccggaactgtccc atcgaccagcaccatcgcaaccagtgccagtactgccgcctcaaaaagtgcctcaaagtg ggcatgagacgggaagcggtgcagaggggcaggatgccgccgacccagccgacccacggg cagttcgcgctgaccaacggggatcccctcaactgccactcgtacctgtccggatatatt tccctgctgttgcgcgcggagccctatcccacgtcgcgcttcggcagccaatgcatgcag cccaacaacatcatgggtatcgagaacatttgcgaactggccgcgaggatgctcttcagc gccgtcgagtgggcccggaacatccccttcttccccgacctgcagatcacggaccaggtg gccctgcttcgcctcacctggagcgagctgtttgtgttgaatgcggcgcagtgctccatg cccctccacgtcgccccgctcctggccgccgccggcctgcatgcttcgcccatgtccgcc gaccgggtggtcgcctttatggaccacatacggatcttccaagagcaagtggagaagctc aaggcgctgcacgttgactcagccgagtacagctgcctcaaggccatagtcctgttcacc tcagatgcctgtggtctctctgatgtagcccatgtggaaagcttgcaggaaaagtctcag tgtgctttggaagaatacgttaggagccagtaccccaaccagccgacgagattcggaaag cttttgcttcgcctcccttccctccgcaccgtctcctcctcagtcatagagcaattgttt ttcgtccgtttggtaggtaaaacccccatcgaaaccctcatccgggatatgttactgtcc ggcagcagttttaactggccgtatatggcaattcaataa >gi568815583f:96232106_96437619|GENSCAN_predicted_peptide_5|68_aa MASPTAPRAPGCQARQGRRQISDQIAGVAGTLRFCSIQLAEGEPRVVLDPNSNWGSGITE DNGKGGAA >gi568815583f:96232106_96437619|GENSCAN_predicted_CDS_5|207_bp atggcgtctcccactgcaccgcgagcgccgggctgccaggctcggcagggccggcggcag atctccgaccaaatcgcaggcgtggccgggactcttcgcttctgcagcatacagctggcc gagggggagccgcgggtggttttggaccccaacagcaactgggggtcagggataacggag gacaatgggaaagggggagctgcctag >gi568815583f:96232106_96437619|GENSCAN_predicted_peptide_6|146_aa MVVQKGMVVPASQQRLPSQEAQEAMEGPDSGESSHTPSSGTQERNRTLFCRKVDVGVGLS APDFPLRGFRNPELTLLGGRCLAAWTRRRMRSVAKTRARCRTLERHLAAETGMTPPSLPA CGSLIVRHPDFFRNRNPQRSRNAQVF >gi568815583f:96232106_96437619|GENSCAN_predicted_CDS_6|441_bp atggttgtacaaaagggtatggtagtgccggcttctcaacaaaggctcccaagtcaggaa gcccaggaagccatggaaggaccagacagcggggaatcctcccacaccccatcttccggc acacaggaaaggaacagaactttgttttgcagaaaagtagatgtaggtgtgggtctgagc gctcccgattttccgctcagaggctttaggaaccccgagttaacattgctcggtggccgc tgcctcgctgcgtggacgcggcggcggatgcgctcggtggcgaaaacccgggcgcgctgc cgtaccttggagcgccacctcgctgcagaaactggaatgacgccgccaagcctcccagcc tgcggcagcctaattgtacgtcacccggactttttccgtaatcgaaaccctcagagaagc cgcaacgcacaggtcttctga >gi568815583f:96232106_96437619|GENSCAN_predicted_peptide_7|385_aa MGYEKYSLLCAWKEEESQTLVSTGYHIGRLIIDANLEASGTPIHKVDAVVGPEGDNASID IFGNIWYIRQQVHVFTMYFMVQHHGTTGSKPCIYHGGNYCFLKPGVYGYGTATSSFKSHP LICVGKVHHYHLLIIFGLGGTDFQLSQVKYSLQTRTPESACGCQFQLCPGVDLRETGAEA RGDLGGAALTGAGDAQIGSVPSSRRPRGGHDALTNVAVNTSSCPGELFQQTGREPYYLSG RRRLSPLSVFMLHRRHLGELSFRCGINLHLLRDSLSAFTRLSHSRRARPSPSARPPPSWS SQRRAEPHCRGEEPGVKLGQGERAADARRERKVKVKSSAWASTQDGPGTNAFHFCLFASG ESGVLNHSEELFLSHKAHGRKHPRC >gi568815583f:96232106_96437619|GENSCAN_predicted_CDS_7|1158_bp atggggtatgaaaagtatagtcttctgtgtgcttggaaggaagaggagagccagacattg gtctccaccgggtaccacattgggaggctgataattgatgccaaccttgaagccagtggg acaccaatccacaaagtggatgctgtggttggtcctgagggtgacaatgccagcattgac atatttgggaacatctggtacatcaggcagcaagtacatgtatttaccatgtacttcatg gtacaacatcatggtacaacaggcagcaagccatgtatttaccatggtggcaattactgt ttcctcaagcctggagtgtatggttatgggacagccacttcctccttcaaatcacatcca ttaatctgtgtgggcaaagtccatcactatcatctattgatcatatttggcttgggtgga acagattttcaactgtcacaagtaaaatactccctacagaccaggaccccggagagtgct tgcgggtgtcagtttcagctctgccccggcgtggatcttcgggaaaccggagcggaggct cgtggagacctgggaggagctgcgctgactggggcaggggacgcccagattggatcggtt ccgagctctcgtcgcccccgaggagggcacgacgcgctgacaaacgtggccgtcaacacc tcgagctgccccggcgagttatttcaacagactgggagagagccgtactatttgagtggc aggcgacgcctttccccactctcggtatttatgttgcaccgacggcacttaggagagctt agttttagatgcggcattaatcttcacttgttaagagatagtctgtcagcctttaccagg ctctcccactcccgacgtgcaaggccctcccccagcgcgcggccacccccaagctggtcc tcgcaaaggcgcgcggagccgcactgccgaggagaggagcccggggttaaactggggcaa ggcgagcgagccgcggacgccaggcgcgagcgcaaggttaaagtcaaaagctctgcctgg gccagcactcaagacggtcctggcaccaacgcctttcatttttgcctgtttgcaagcggg gaaagtggagtcctcaatcatagcgaggagcttttcttaagccacaaagcacacgggcgc aaacatcctcgctgctaa