GENSCAN 1.0 Date run: 2-Nov-116 Time: 22:43:54 Sequence gi568815587r:123626000_123830888 : 204889 bp : 40.70% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 1023 1201 179 1 2 77 47 175 0.833 9.17 1.02 PlyA + 1770 1775 6 1.05 2.07 PlyA - 1792 1787 6 1.05 2.06 Term - 8207 8144 64 1 1 122 42 95 0.917 4.68 2.05 Intr - 12325 12187 139 2 1 98 96 110 0.844 11.40 2.04 Intr - 15039 14982 58 0 1 89 81 37 0.503 0.64 2.03 Intr - 15694 15564 131 2 2 47 49 84 0.598 -0.11 2.02 Intr - 16672 16447 226 1 1 126 72 379 0.997 36.74 2.01 Init - 19011 18892 120 0 0 31 20 144 0.176 0.19 2.00 Prom - 20214 20175 40 -6.15 3.00 Prom + 20680 20719 40 -6.55 3.01 Init + 25552 25600 49 0 1 96 58 60 0.709 2.96 3.02 Intr + 27630 28153 524 1 2 79 98 279 0.793 19.74 3.03 Intr + 28242 28480 239 2 2 97 47 146 0.403 6.89 3.04 Intr + 36262 36334 73 1 1 104 89 35 0.084 3.69 3.05 Term + 36791 36916 126 1 0 99 39 126 0.093 6.10 3.06 PlyA + 39022 39027 6 1.05 4.06 PlyA - 39039 39034 6 1.05 4.05 Term - 39306 39137 170 1 2 100 36 98 0.290 2.86 4.04 Intr - 43668 43620 49 0 1 108 83 37 0.089 2.53 4.03 Intr - 46013 45935 79 1 1 70 87 44 0.090 1.13 4.02 Intr - 52299 52208 92 0 2 53 73 116 0.064 4.47 4.01 Init - 52423 52352 72 1 0 70 58 51 0.719 1.42 4.00 Prom - 73663 73624 40 -3.05 5.04 PlyA - 73780 73775 6 1.05 5.03 Term - 76397 76291 107 0 2 55 49 142 0.953 4.59 5.02 Intr - 78054 77872 183 1 0 12 89 137 0.755 5.04 5.01 Init - 87549 87309 241 0 1 71 11 126 0.057 1.28 5.00 Prom - 88110 88071 40 -4.05 6.08 PlyA - 88523 88518 6 1.05 6.07 Term - 95751 95575 177 0 0 100 55 167 0.808 11.30 6.06 Intr - 100992 100121 872 1 2 107 -3 740 0.170 56.92 6.05 Intr - 101596 101477 120 2 0 41 94 104 0.876 5.85 6.04 Intr - 102323 102134 190 2 1 30 47 234 0.681 11.74 6.03 Intr - 103215 103055 161 1 2 108 72 134 0.820 12.59 6.02 Intr - 103910 103616 295 2 1 54 44 314 0.762 19.26 6.01 Init - 104889 104488 402 0 0 66 68 435 0.955 35.97 6.00 Prom - 107743 107704 40 -5.75 7.00 Prom + 116783 116822 40 -4.45 7.01 Init + 117878 117901 24 1 0 93 67 48 0.043 2.78 7.02 Term + 127935 128282 348 2 0 3 43 484 0.101 28.90 7.03 PlyA + 128439 128444 6 1.05 8.00 Prom + 143644 143683 40 -3.25 8.01 Init + 171913 171993 81 0 0 69 115 38 0.478 5.62 8.02 Intr + 174773 174824 52 1 1 83 105 37 0.325 2.46 8.03 Term + 176187 176278 92 1 2 54 42 64 0.240 -4.70 8.04 PlyA + 176408 176413 6 1.05 9.04 PlyA - 177459 177454 6 1.05 9.03 Term - 180380 179409 972 2 0 123 47 520 0.468 42.46 9.02 Intr - 201759 201341 419 1 2 42 20 177 0.013 -0.88 9.01 Init - 203028 202602 427 0 1 81 53 246 0.758 16.91 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 52299 52129 171 0 0 53 44 165 0.841 5.44 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:123626000_123830888|GENSCAN_predicted_peptide_1|59_aa XPGSCDWRSSTGPGARTGVLAADEQHLAGQEELMLTISPTSEGLKNIVVLQPSLLLSIQ >gi568815587r:123626000_123830888|GENSCAN_predicted_CDS_1|180_bp nctccggggtcttgtgactggagatcctcaacaggccctggagccaggactggagtcttg gcagctgatgagcagcaccttgccggccaggaggagctgatgctgacgatctccccaaca tctgaaggcttaaagaacattgtcgttcttcagccctccttgcttctctcaatacaataa >gi568815587r:123626000_123830888|GENSCAN_predicted_peptide_2|245_aa MIWKRRDSLLAELGVFQILLLHHSQCQFWVVGASGKVVVYIYEYRNGHQEVESPFQGRLQ WNGSKDLQDVSITVLNVTLNDSGLYTCNVSREFEFEAHRPFVKTTRLIPLRVTEEASSVF VSGKAFQQPNSGVSVATGVLHPEGGVPAFLSLSPALSLLTPSCIAGGKCGKAELVRLPAG EDFTSVVSEIMMYILLVFLTLWLLIEMIYCYRKVSKAEEAAQENASDYLAIPSENKENSA VPVEE >gi568815587r:123626000_123830888|GENSCAN_predicted_CDS_2|738_bp atgatttggaaacgaagagattcgctcctggcagagctgggagtcttccagattctgctc ctgcaccatagccagtgtcagttctgggtggttggtgccagtggcaaagtggttgtatat atttacgagtatcggaatggccaccaggaggtggagagcccctttcaggggcgcctgcag tggaatggcagcaaggacctgcaggacgtgtccatcactgtgctcaacgtcactctgaac gactctggcctctacacctgcaatgtgtcccgggagtttgagtttgaggcgcatcggccc tttgtgaagacgacgcggctgatccccctaagagtcaccgaggaggctagttccgtgttc gtgagtgggaaggcctttcagcagcctaactcaggtgtttctgttgcaaccggagtcctg catcctgaggggggagttccagcctttctcagtctatctccagcattatcactgctaaca ccctcgtgcatagctggtgggaagtgtggcaaggctgagctagttcgtctgccagctgga gaggacttcacctctgtggtctcagaaatcatgatgtacatccttctggtcttcctcacc ttgtggctgctcatcgagatgatatattgctacagaaaggtctcaaaagccgaagaggca gcccaagaaaacgcgtctgactaccttgccatcccatctgagaacaaggagaactctgcg gtaccagtggaggaatag >gi568815587r:123626000_123830888|GENSCAN_predicted_peptide_3|336_aa MGFRRVGQAGLELLTSDTHRGKPARALFCHQRHQCVIIVSIVTVTCIRHGEVLVLTPVDK HERSQGKQSIEGRHLLGLAASKATQRDSLGQGLRPLRFFRGNPLGRTAPRGRLSDRRKGR APVPGEALGAFGAGWGLWALSDPTGPPQFEGADQPLRARSRRRSAPRTVASPPTPELQLP LAPLPRSFAPRDRLRSDWNPPSPAAPASSALTASPAGRDPTPEAAGAARLGYRMRSTSVP STPALGSLGRCSGAGKEAAAPVRPAMSEVGGCTVQAVSGSTILGSGGWWPCLPGQAQHHV EAAKAWGWHPMKPQPELYISPFQPQLEQLEHRTPSP >gi568815587r:123626000_123830888|GENSCAN_predicted_CDS_3|1011_bp atggggtttcgccgtgttggccaggctggtcttgaactcctgacctcagatacgcacaga ggcaagccagccagagctcttttctgtcaccaacgacatcaatgtgttattattgttagc attgttactgttacctgcatccggcatggcgaggtgctggtacttaccccagtagataag cacgagagaagccaggggaaacaatctattgaaggcaggcatcttctggggctggcggct tccaaggctacacagagagattccctcggtcaaggactgcgccctctcagattctttcga ggaaaccctttgggacgaactgcccccagggggcgactttctgaccgaaggaagggccga gcaccggtgcctggggaggccctgggagcttttggagccgggtgggggctttgggcccta agcgaccccactggacctccccagttcgagggagccgatcagccgctccgcgcccgctct cgccggcgctcagcaccacggacagtcgcctccccgcccaccccggaactccagcttcca ctcgcgcccctgcctcgctcgtttgcgcccagggatcggttgcgttccgactggaatcct cccagccccgcggctcctgcttcgtccgccctaactgcttctccagctgggcgtgacccc actcccgaagccgccggggccgcccgtctgggctacagaatgcgttccacgtcggtcccg tccacgcccgccctcggctccctggggaggtgcagcggggccgggaaggaagcggcagct cccgttcggccggcgatgtcagaggttggggggtgcacagtgcaagctgtcagtggatct accattctggggtctggaggatggtggccttgtcttccaggacaggctcaacaccatgtg gaagctgccaaggcttggggctggcaccccatgaagccacagcctgagctctacatcagc ccctttcagccacagctagagcagctggaacacaggacaccaagtccctag >gi568815587r:123626000_123830888|GENSCAN_predicted_peptide_4|153_aa MQEIVRDNHVQQLKWGHKRETCGEPGAAVGHRGPQSGGQAECRAEEGQNHLRPTRGSQQA DKNSNSQIKVKMTPSLWAEGQRTAKHPITFNPVQQNIHSLVKEALLFPLFILGLVYKAFP GHTAAGKSGFMRVVTERNQRIENEEWVALPHIS >gi568815587r:123626000_123830888|GENSCAN_predicted_CDS_4|462_bp atgcaggaaattgtgagagacaatcatgtccagcagctaaagtggggacacaaaagagaa acttgcggagagcctggagctgctgttggtcatcggggcccacagtctggaggacaagct gaatgcagagcagaggagggccagaaccacctaagacccacaagagggagccaacaggct gacaagaacagcaattcccagatcaaggtcaaaatgacaccatcgctgtgggcagaaggg cagagaactgccaaacatccaataactttcaatccagtccagcaaaacatacatagccta gttaaagaggcccttctctttcctcttttcatcctggggttggtgtacaaggcctttcct ggccatacggctgctgggaaatctggtttcatgagagtggtgactgagagaaatcaaaga atagaaaatgaagaatgggtggctctgccacatataagttaa >gi568815587r:123626000_123830888|GENSCAN_predicted_peptide_5|176_aa MGKDFMTKTPKAMATKAKIDKWDLIKLKSFCTAKETIIRVNRQPTEWEKIFAIYPSDKGL ISRIYKEHKHIYKKKTTPSKNNSSSNVAQGSQKFGHPCSRGSKGEADPCLFQLLVAAGVA FPGCGCIAPNPSFTFTWHPASAERQPEAATAAAPISAFRPPPRGRGQGGHVAGKAT >gi568815587r:123626000_123830888|GENSCAN_predicted_CDS_5|531_bp atgggcaaagacttcatgactaaaacaccaaaagcaatggcaacaaaagccaaaattgac aaatgggatctaattaaactaaagagcttctgcacagcaaaagaaactatcatcagagtg aacagacaacctacagaatgggagaaaatttttgcaatctatccatctgacaaagggcta atatccagaatctacaaggaacataagcacatttacaagaaaaaaacaaccccatcaaaa aacaattcttcttccaatgtggcccagggaagccaaaagtttggacacccctgttctagg ggctcaaagggagaagctgatccttgcctcttccagcttctggtggctgctggtgtggca ttccccggctgtggctgcatcgctccaaacccttccttcacctttacgtggcatcctgct tcagctgaacgtcagccagaggcagcaactgcagccgcccccatctcagctttccggcca cctcccaggggacgtgggcagggaggccacgtggcaggaaaagcgacctag >gi568815587r:123626000_123830888|GENSCAN_predicted_peptide_6|738_aa MATAVEPEDQDLWEEEGILMVKLEDDFTCRPESVLQRDDPVLETSHQNFRRFRYQEAASP REALIRLRELCHQWLRPERRTKEQILELLVLEQFLTVLPGELQSWVRGQRPESGEEAVTL VEGLQKQPRRPRRWASSPKISSRDNQELPPDSMVTGSWNYSQVTVHVHGQEVLSEETVHL GVEPESPNELQDPVQSSTPEQSPEETTQSPDLGAPAEQRPHQEEELQTLQESEVPVPEDP DLPAERSSGDSEMVALLTALSQVCPSYLCTTENLFEEPLGISHTKQGQNTRLPVASRFDE ILMWFQGLVTFKDVAVCFSQDQWSDLDPTQKEFYGEYVLEEDCGIVVSLSFPIPRPDEIS QVREEEPWVPDIQEPQETQEPEILSFTYTGDRSKDEEECLEQEDLSLEDIHRPVLGEPEI HQTPDWEIVFEDNPGRLNERRFGTNISQVNSFVNLRETTPVHPLLGRHHDCSVCGKSFTC NSHLVRHLRTHTGEKPYKCMECGKSYTRSSHLARHQKVHKMNAPYKYPLNRKNLEETSPV TQAERTPSVEKPYRCDDCGKHFRWTSDLVRHQRTHTGEKPFFCTICGKSFSQKSVLTTHQ RIHLGGKPYLCGECGEDFSEHRRYLAHRKTHAAEELYLCSECGRCFTHSAAFAKHLRGHA SVRPCRCNECGKSFSRRDHLLYISPRQVQGTTNEKISGRGLQQLLGAWVHLRNLGNRSTE KQFHGKRAVGQLNNPSKQ >gi568815587r:123626000_123830888|GENSCAN_predicted_CDS_6|2217_bp atggctacagccgtggaaccagaggaccaggatctttgggaagaagagggaattctgatg gtgaaactggaagatgatttcacctgtcggccagagtctgtcttacagagggatgacccg gtgctggaaacctcccaccagaacttccgacgcttccgctaccaggaggcagcaagccct agagaagctctcatcagactccgagaactttgtcaccagtggctgagaccagagaggcgg acaaaggagcagatcctagagctgcttgtgctggaacaatttcttaccgtcctacctgga gaactacagagctgggtgcggggccaacggccagaaagtggcgaggaggcagtgacgctg gtggagggtttgcagaaacaacccaggagaccaaggcggtgggcatcttctcctaaaata agctcccgtgacaaccaagaacttcctcctgactccatggtgactggaagttggaattat tcccaggtgactgtccatgttcacggccaggaagtcctgtcagaggagacggtgcattta ggagtggagcctgagtcacctaatgagctgcaggatcctgtgcaaagctcgacccccgag cagtctcctgaggaaaccacacagagcccagatctgggggcaccggcagagcagcgtcca caccaggaagaggagctccagaccctgcaggagagcgaggtcccagtgcccgaggaccca gaccttcctgcagagaggagctctggagactcagagatggttgctcttcttactgctctg tcacaggtgtgccctagttacctctgtaccacagagaatttgtttgaagaaccactgggc ataagccatactaaacagggacaaaatacgaggctacccgtagcatcacgttttgatgaa atccttatgtggtttcagggactggtaacgttcaaggatgtggccgtatgcttttcccag gaccagtggagtgatctggacccaacacagaaagagttctatggagaatatgtcttggaa gaagactgtggaattgttgtctctctgtcatttccaatccccagacctgatgagatctcc caggttagagaggaagagccttgggtcccagatatccaagagcctcaggagactcaagag ccagaaatcctgagttttacctacacaggagataggagtaaagatgaggaagagtgtctg gagcaggaagatctgagtttggaggatatacacaggcctgttttgggagaaccagaaatt caccagactccagattgggaaatagtctttgaggacaatccaggtagacttaatgaaaga agatttggtactaatatttctcaagtgaatagttttgtgaaccttcgggaaactacaccc gtccaccccctgttagggaggcatcatgactgttctgtgtgtggaaagagcttcacttgt aactcccaccttgttagacacctgaggactcacacaggagagaaaccctataaatgtatg gaatgtggaaaaagttacacacgaagctcacatcttgccaggcaccaaaaggttcacaag atgaacgcgccttacaaatatcccctaaaccggaagaatttggaagagacctcccctgtg acacaggctgagagaactccatcagtggagaaaccctatagatgtgatgattgcggaaag cacttccgctggacttcagaccttgtcagacatcagaggacacatactggagaaaaaccc ttcttttgtactatttgtggcaaaagcttcagccagaaatctgtgttaacaacacaccaa agaatccacctgggaggcaaaccctacttgtgtggagagtgtggtgaggacttcagtgaa cacaggcggtacctggcgcaccggaagacgcacgctgctgaggaactctacctctgcagc gagtgcgggcgctgcttcacccacagcgcagcgttcgccaagcacttgagaggacacgcc tcagtgaggccctgccgatgcaacgaatgtgggaagagcttcagtcgcagggaccacctc ttgtacatctctcctagacaagtccaaggaactactaacgagaagatttcaggaagaggc ctacagcaattgcttggtgcttgggttcatttgcggaatcttggcaacaggtctacagag aagcagttccacggcaaaagagctgtggggcagttgaataatccatccaaacaatga >gi568815587r:123626000_123830888|GENSCAN_predicted_peptide_7|123_aa MEDEDTDQEFQNAGVYAGGFQTGPNITVEMTDNIIATEWQLDEQHRLTKDNGEAHHPGAQ GQLQAEFAGHDGGVVKGIADGEVAVKRHDSEDQELGGAHEEVEEGLQQAAGHADYCSCHY KGS >gi568815587r:123626000_123830888|GENSCAN_predicted_CDS_7|372_bp atggaggatgaggacactgaccaggagttccaaaatgctggtgtctatgcaggcggcttt caaactgggcccaacatcacagtagaaatgactgataacattattgccacagaatggcaa ctggatgagcagcatcgtctgacaaaagacaatggtgaagcccaccacccaggagctcag ggccagctgcaggcagagtttgctggtcatgatggtggggtggtgaaggggattgcagat ggtgaggtagcggtcaaaagacatgatagtgaggatcaagaactcggtggtgcccacgaa gaagtggaagaaggcctgcagcaggcagcaggacatgcagattactgttcttgccactac aaaggttcctag >gi568815587r:123626000_123830888|GENSCAN_predicted_peptide_8|74_aa MRKFLKGTKQGHSGSSCFRPALPDGLQLCNTVNLLIASPLIGKGDWCGQEEIRSHLLPVI LCAKERRMNENYLL >gi568815587r:123626000_123830888|GENSCAN_predicted_CDS_8|225_bp atgaggaaatttttgaagggcaccaagcaaggacattcaggcagctcatgcttcaggcct gcactccctgatggcttacagttgtgcaacacagtgaacttgctaattgcttctcctctg attggaaaaggagattggtgtggccaagaagaaattcggtcacatctcctacctgttatc ctgtgcgcaaaggaaagaagaatgaatgaaaactatcttttatag >gi568815587r:123626000_123830888|GENSCAN_predicted_peptide_9|605_aa MARELRDACTSFNSRCNQVEEKVSVIEHQINEIKQEDKVREKRVKRNEQSLQEIWDCVKR PNLCLIGVPESDGENGTKLENTLQDIIQENFPNLARQANIQIQEIQRTPQRYSSRREIPR HITVRFTKVEIKEKMLRAAREKDRSMRQKVNKDIQDLNSALHQADLIDIYRTLHPKSSEY TFFSAPHCTYSKIGHIIGSEALLSKCKRTEITTNRLSDHSAIILELRTKKLTQNRTTTWK LNNLLLNDYWVNNEMKAGIKIFFETNENKDTTYQNLWDTFKAGNPAHHIVVVMGNWSTVT EITLIAFPALLEIRISLFVVLVVTYTLTATGNITIISLIWIDHRLQTPMYFFLSNLSFLD ILYTTVITPKLLACLLGEEKTISFAGCMIQTYFYFFLGTVEFILLAVMSFDRYMAICDPL HYTVIMNSRACLLLVLGCWVGAFLSVLFPTIVVTRLPYCRKEINHFFCDIAPLLQVACIN THLIEKINFLLSALVILSSLAFTTGSYVYIISTILRIPSTQGRQKAFSTCASHITVVSIA HGSNIFVYVRPNQNSSLDYDKVAAVLITVVTPLLNPFIYSLRNEKVQEVLRETVNRIMTL IQRKT >gi568815587r:123626000_123830888|GENSCAN_predicted_CDS_9|1818_bp atggcacgagaacttcgtgacgcatgcacaagcttcaatagccgatgcaatcaagtggaa gaaaaggtatcagtgattgaacatcaaattaatgaaataaagcaagaagataaagttaga gaaaaaagagtaaaaagaaatgaacaaagcctccaagaaatatgggactgtgtgaaaaga ccaaatctatgtttgattggtgtacctgaaagtgatggggagaatggaaccaagttggaa aacactcttcaggatattatccaggagaacttccccaacctagcaaggcaggccaacatt caaattcaggaaatacagagaacaccacaaagatactcctcgagaagagaaatcccaaga cacataactgtcagattcaccaaggttgaaatcaaggaaaaaatgttaagggcagccaga gagaaagacagatcgatgagacagaaggttaataaggatatccaggacttgaactcagct ctgcaccaagcagacctaatagacatctatagaactctccaccccaaatcatcagaatat acattcttctcagcaccacattgcacctattctaaaattggccacataattggaagtgaa gcactcctcagcaaatgtaaaagaacagaaatcacaacaaaccgtctctcagaccacagt gcaatcatattagaactcaggactaagaaactcactcaaaaccgcacaactacatggaaa ctgaacaacctgctcctgaatgactactgggtgaataatgaaatgaaggcaggaataaag attttctttgaaaccaatgaaaacaaagacacaacataccagaatctctgggacacattt aaagcaggaaaccctgcccaccatatagtagttgtcatgggaaactggagcactgtgact gaaatcaccctaattgccttcccagctctcctggagattcgaatatctctcttcgtggtt cttgtggtaacttacacattaacagcaacaggaaacatcaccatcatctccctgatatgg attgatcatcgcctgcaaactccaatgtacttcttcctcagtaatttgtcctttctggat atcttatacaccactgtcattaccccaaagttgttggcctgcctcctaggagaagagaaa accatatcttttgctggttgcatgatccaaacatatttctacttctttctggggacggtg gagtttatcctcttggcggtgatgtcctttgaccgctacatggctatctgcgacccactg cactacacggtcatcatgaacagcagggcctgccttctgctggttctgggatgctgggtg ggagccttcctgtctgtgttgtttccaaccattgtagtgacaaggctaccttactgtagg aaagaaattaatcatttcttctgtgacattgcccctcttcttcaggtggcctgtataaat actcacctcattgagaagataaactttctcctctctgcccttgtcatcctgagctccctg gcattcactactgggtcctacgtgtacataatttctaccatcctgcgtatcccctccacc cagggccgtcagaaagctttttctacctgtgcttctcacatcactgttgtctccattgcc cacgggagcaacatctttgtgtatgtgagacccaatcagaactcctcactggattatgac aaggtggccgctgtcctcatcacagtggtgacccctctcctgaacccttttatctacagc ttgaggaatgagaaggtacaggaagtgttgagagagacagtgaacagaatcatgaccttg atacaaaggaaaacttga