GENSCAN 1.0 Date run: 5-Nov-116 Time: 23:21:56 Sequence gi568815591r_77919318 : 200120 bp : 36.62% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 607847 607842 6 1.05 1.03 Term - 620717 620433 285 0 0 12 49 197 0.128 2.62 1.02 Intr - 635354 635250 105 0 0 42 116 30 0.012 0.69 1.01 Init - 654267 654103 165 1 0 37 23 182 0.068 6.30 1.00 Prom - 660286 660247 40 -3.95 2.03 PlyA - 660419 660414 6 1.05 2.02 Term - 671198 670895 304 2 1 44 48 283 0.419 13.56 2.01 Init - 695810 695668 143 0 2 77 74 73 0.517 4.45 2.00 Prom - 699343 699304 40 -2.95 3.04 PlyA - 699431 699426 6 -4.04 3.03 Term - 699686 699637 50 1 2 107 38 59 0.318 -0.71 3.02 Intr - 707922 707803 120 2 0 83 121 102 0.663 12.55 3.01 Init - 710261 710228 34 0 1 98 61 9 0.400 -0.72 3.00 Prom - 723484 723445 40 -2.75 4.00 Prom + 738589 738628 40 -5.45 4.01 Sngl + 749538 750410 873 0 0 30 43 506 0.976 35.99 4.02 PlyA + 750462 750467 6 1.05 5.00 Prom + 750771 750810 40 -11.34 5.01 Sngl + 750983 751447 465 2 0 60 39 232 0.840 11.39 5.02 PlyA + 753019 753024 6 1.05 6.00 Prom + 760172 760211 40 -4.75 6.01 Init + 766982 767030 49 2 1 65 113 21 0.313 3.46 6.02 Intr + 774898 775007 110 0 2 108 94 -18 0.050 -0.12 6.03 Intr + 786271 786345 75 1 0 63 23 119 0.036 1.69 6.04 Intr + 798960 799078 119 0 2 1 105 123 0.059 3.64 6.05 Term + 799166 799343 178 0 1 38 37 129 0.539 -1.02 6.06 PlyA + 799400 799405 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r_77919318|GENSCAN_predicted_peptide_1|184_aa MKGFKEIVLLRPIYQATRKAAIFEWGPEEEKALQQVQAAALPLGPYDPADQMVLEGGYLL NEGPMTCFRGRLENSSWDLWPPSGEMGEKKRHLLAGGQPTQAITATHKRITLLQERRKQQ LIPPPVTSWRTRGPESVHVKPSLSAQSAFKKTSTVNKTTKDPHSPLHSLATSARAGVGIH GRET >gi568815591r_77919318|GENSCAN_predicted_CDS_1|555_bp atgaagggcttcaaagaaattgtgttactccggcccatttatcaagcgacccgaaaggct gctatttttgaatggggcccagaggaggagaaggctctgcaacaggtccaggctgctgct ctgccacttgggccatatgacccagcagatcaaatggtgcttgagggagggtacctcttg aatgagggtcctatgacctgcttcaggggaaggttagaaaattcttcctgggatttatgg ccgccttccggggagatgggcgagaagaagcgccacctcctggctggaggccaaccaact caagccattacagcaactcataaaagaataaccctgctccaagaaaggagaaaacagcag ctaattccaccaccagtaacatcctggcgaacccgaggccctgagtctgtccacgtgaaa ccttcactgtcagcacaatcagcattcaagaaaaccagcacagtaaataaaacaactaag gaccctcacagtcctcttcactcccttgctacctctgccagagcaggtgttggtatccat ggccgagagacctga >gi568815591r_77919318|GENSCAN_predicted_peptide_2|148_aa METPGDRHIDSLTVKEDREKVQKVDRGMRKLSHPWSFLQWKCLSVWQSLFTASTPTLLFR KCLKLNVKNFPNVLGSPRDKAVEFVKMTQAPGVMPVILDETGDQLQDQTVLLKPSTPEVT RSQSPSMEKKAELAFQGTQLLSDLCLDG >gi568815591r_77919318|GENSCAN_predicted_CDS_2|447_bp atggagactcctggagacaggcacattgacagcctgacagtgaaggaggacagagagaaa gtacagaaagttgacagaggaatgcgaaaattgagtcatccatggtcattcttacagtgg aagtgcttatcagtctggcagagtttattcacagccagcacacctacattgctattccgc aaatgcctgaagctaaatgtgaaaaattttcccaatgtcctgggatcaccaagggacaaa gctgtggaatttgtaaagatgacccaagctcctggagttatgccagtcatccttgatgag actggagatcagctccaggaccaaactgttcttttgaaaccctccacaccagaggtcaca agaagccaatctcctagcatggagaagaaggcagaattagcgttccagggcactcagctc ctcagtgacttgtgcttggatggctga >gi568815591r_77919318|GENSCAN_predicted_peptide_3|67_aa MGVNYEWGNHEGTTRPHKEGEVPGVDYIFITVEDFMELEKSGALLESGTYEAILEYAIHD DYSHHAE >gi568815591r_77919318|GENSCAN_predicted_CDS_3|204_bp atgggggttaattatgagtggggtaatcatgaaggcaccacaaggccacataaggagggt gaggtccctggagtggattatattttcatcactgttgaagattttatggaattggagaaa agtggtgctctcctagaaagtgggacttatgaagcgattttggagtatgcaatccatgat gactatagtcaccatgctgagtaa >gi568815591r_77919318|GENSCAN_predicted_peptide_4|290_aa MFFETNENKDTTYQNLWDAFKAVRRGKFIALNAHKRKKERSKIDTLTSQLKELEKQGQTH SKANRRQEITKIRAELKQIETQKTFQKINESRSWFFERINKIDRPIARLIKKKRVKNQID EIKNDKGDITTDPTEIQTTIREYYKHLYANKLENLEEMDKFLNTYTLPRLNQEEVESLNR PITGSEIVAIINSLPAKKSPGPDGFTAKFYQRYKEELVPLLLKLFQSIEKEGILPNSFYE ASIILIPKPGRDTTKKENFRPISLMNIDAKILNKILANRIQQHIQKAYPP >gi568815591r_77919318|GENSCAN_predicted_CDS_4|873_bp atgttctttgaaaccaatgagaacaaagacacaacataccagaatctctgggacgcattc aaagcagtgcgcagagggaaatttatagcactaaatgcccacaagagaaagaaggaaaga tccaaaattgataccctaacatcacaattaaaagaactagaaaagcaagggcaaacacat tcaaaagctaacagaaggcaagaaataactaagatccgagcagaactgaagcaaatagag acacaaaaaacctttcaaaaaattaatgaatccaggagctggttttttgaaaggatcaac aaaattgatagaccgatagcaagactaataaagaagaaaagagtgaagaatcaaatagat gaaataaaaaatgataaaggggatatcaccaccgatcccacagaaatacaaactaccatc agagaatactacaaacacctctatgcaaataaactagaaaatctagaagaaatggataaa ttcctcaacacatacactctcccaagactaaaccaggaagaagttgaatctctgaataga ccaataacaggctctgaaattgtggcaataatcaatagcttaccagccaaaaagagtcca ggaccagatggattcacagccaaattctaccagaggtacaaggaggagctggtaccatta cttctgaaactattccaatcaatagaaaaagagggaatcctccctaactcattttatgag gccagcatcatcctgataccaaagccgggcagagacacaacaaaaaaagagaattttaga ccaatatccttgatgaatattgatgcaaaaatcctcaataaaatactggcaaaccgaatc cagcagcacattcaaaaagcttatccaccatga >gi568815591r_77919318|GENSCAN_predicted_peptide_5|154_aa MSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLCNEIKDDTNKWKNVQCSWAGRINIM KMAILPKVIYRFNAIPIKLPMTFFTELENITLKFIWNQKRARIAKSILSQKNKAGGITLP DFKLNYKARVTKTAWYRYQNRDIDQWNRTEPSES >gi568815591r_77919318|GENSCAN_predicted_CDS_5|465_bp atgagtgaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaactt acaagggatgtgaaggacctcttcaaggagaactacaaaccactgtgcaatgaaataaaa gacgatacaaacaaatggaagaacgttcaatgctcatgggcaggaagaatcaatatcatg aaaatggccatactgcccaaggtaatttatagattcaatgccatccccatcaagctacca atgactttcttcacagaattggaaaacattactttaaagttcatatggaaccaaaaaaga gcccgcattgccaagtcaatcctaagccaaaagaacaaagccggaggcatcacgctacct gacttcaaactcaactacaaggctagagtaaccaaaacagcatggtaccggtaccaaaac agagatatagatcaatggaacagaacagagccctcagaaagttga >gi568815591r_77919318|GENSCAN_predicted_peptide_6|176_aa MIFSDNSKWLLTLSRSRPGLPKNHSQFLLAKSYSSFQPNYTPTHLTHMYNHGKWTEKKLI PVDCEEADSHFVCKPGVWSHQVGAHTVHLSTLSSRPGVLNPWAMDWYHPIKNWATRQRSV AALDSHRSRNPILNSECEGCRLSAPYENLMPDDVSLSPITPRWDHLVAGKQAQGSP >gi568815591r_77919318|GENSCAN_predicted_CDS_6|531_bp atgattttctctgacaactccaaatggcttctgacactttcccgttcacgtccaggtctc cctaaaaatcattctcagtttttgttagcaaagtcatatagttcttttcagccaaattat acacctacacacctaacacatatgtacaatcatggaaagtggactgagaagaagctgatt ccagtggactgtgaagaagctgattctcattttgtttgcaaacctggtgtttggtcacac caagtaggagcacataccgtccatctcagcactttgtcctctagaccaggggtcctcaac ccctgggccatggactggtaccatcctattaagaactgggccacacggcagagatcagtg gcagcattagactctcatagaagcaggaaccctatcttgaacagtgaatgtgaggggtgt aggttgtccgctccttatgagaatctaatgcctgatgatgtgtcactgtcccccatcacc cccagatgggaccatctagttgcaggaaaacaagctcagggctccccttga GENSCAN 1.0 Date run: 5-Nov-116 Time: 23:21:58 Sequence gi568815591r_77919318 : 200302 bp : 35.41% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1434037 1434167 131 1 2 82 70 106 0.177 7.69 1.02 Intr + 1434212 1434329 118 0 1 79 72 115 0.325 8.12 1.03 Term + 1457195 1457265 71 2 2 27 39 125 0.054 -1.38 1.04 PlyA + 1457400 1457405 6 1.05 2.00 Prom + 1465118 1465157 40 -2.85 2.01 Sngl + 1474532 1474927 396 0 0 65 42 186 0.869 7.70 2.02 PlyA + 1475196 1475201 6 1.05 3.08 PlyA - 1476342 1476337 6 1.05 3.07 Term - 1513365 1513150 216 2 0 75 48 117 0.356 2.56 3.06 Intr - 1533430 1533252 179 1 2 54 72 112 0.449 4.92 3.05 Intr - 1534037 1533667 371 0 2 65 44 459 0.557 33.02 3.04 Intr - 1534922 1534757 166 2 1 93 68 214 0.560 18.00 3.03 Intr - 1558281 1558189 93 0 0 48 49 111 0.091 2.22 3.02 Intr - 1559018 1558901 118 1 1 89 66 33 0.293 0.32 3.01 Init - 1562251 1562207 45 0 0 78 121 31 0.575 6.13 3.00 Prom - 1567168 1567129 40 -4.75 4.03 PlyA - 1567700 1567695 6 1.05 4.02 Term - 1584401 1584333 69 1 0 112 36 85 0.872 2.66 4.01 Init - 1590915 1590859 57 2 0 76 100 58 0.788 7.16 4.00 Prom - 1592683 1592644 40 -3.25 5.04 PlyA - 1595081 1595076 6 1.05 5.03 Term - 1599787 1599656 132 0 0 99 33 77 0.179 0.41 5.02 Intr - 1600778 1600723 56 2 2 79 92 48 0.177 2.08 5.01 Init - 1605313 1605256 58 0 1 49 100 76 0.170 6.42 5.00 Prom - 1607532 1607493 40 -4.35 6.04 PlyA - 1608031 1608026 6 1.05 6.03 Term - 1610193 1610036 158 0 2 89 48 54 0.080 -1.39 6.02 Intr - 1619444 1619316 129 2 0 -14 89 138 0.173 3.45 6.01 Intr - 1626395 1626285 111 2 0 71 80 79 0.554 4.83 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r_77919318|GENSCAN_predicted_peptide_1|106_aa XMEEALLSLAVASSPRSLNQQLPELPDPRYILEPQDNPELAPALLWPPLLQGSDVSLRPA VWSSGDLPALSQGAGAGDTAEIGGNPRSSSHEDVDGWISGEGDVND >gi568815591r_77919318|GENSCAN_predicted_CDS_1|321_bp ntcatggaggaagccttgctgtctctggctgtggcatcttctcccagatcccttaatcaa caactcccagagcttccagacccaagatacattttggaaccccaggacaatcctgaactg gcacccgccctgctatggccaccactgcttcaaggcagtgatgtttctctcaggcctgct gtctggagctctggtgatcttcctgctctttcacaaggagcaggtgctggagatacagct gagattgggggtaatcctagatcatcaagccatgaagatgtcgatggttggatttcagga gaaggtgatgtcaatgactaa >gi568815591r_77919318|GENSCAN_predicted_peptide_2|131_aa MTQMSKIQLLGSALLKAQEKQSQVWVSIELLGSLALRWNCLCVQLEELMIINSSTSTHHV LLELLPSHWCTRVLLRLGATSLSPNSGSQMEVARHLTWFYADSNPGIRPKTLDVDSDMSW ATEVNSPEHVT >gi568815591r_77919318|GENSCAN_predicted_CDS_2|396_bp atgacccagatgtccaaaatacaacttttaggctccgctctgcttaaagcccaagaaaaa cagagccaggtctgggtgtctatagagctactagggagcctagccttaagatggaattgt ttgtgtgtccagcttgaagagttaatgatcatcaactcatcaacatcaactcatcatgta ctgttagaactgctgccaagtcactggtgtacaagagttcttctaagacttggagccaca agcttgtctcccaattcaggatcccagatggaagtagcccggcacctcacctggttctat gcagactctaaccctggtattagacctaagactctagacgttgattctgacatgtcttgg gcaactgaagtgaactctcctgaacatgtgacctaa >gi568815591r_77919318|GENSCAN_predicted_peptide_3|395_aa MRTIKDLVYDMEFSKDGSVVSGSATGSWCPNSFYSTDHNVWVLPAAKYFSLHTAPSRAQL PARCRYPQNFIAYGMSSGPHKNLNQGSRARTLAEPEMWHQARHKSRSRYFCSIGAQVGGS QRVLLAQEPGARLGRTEAEERSPGTEATRPTAMSKSLKKKSHWTSKVHESVIGRNPEGQL GFELKGGAENGQFPYLGEVKPGKVAYESGSKLVSEELLLEVNETPVAGLTIRDVLAVIKH CKDPLRLKCVKQGESSGLLSVLPGELLGQIECASVASPEADEELLGKGGRKKRKKEWEEG RERRKEQLQGAAASPAGLGREGGGGLSEAVPLTQQAALLGSIPSRELVLVFSEKQGKPQA SLTYHVVEMQPSHFFPKYYEQLHSGSHLQNSSAKK >gi568815591r_77919318|GENSCAN_predicted_CDS_3|1188_bp atgaggacaataaaagaccttgtatatgacatggaatttagcaaggatgggagtgtggtt tctgggtctgctactggctcttggtgccctaatagcttttattccacagaccataatgta tgggttctgccagctgctaagtatttctctttgcatactgcacctagcagagcacagctg ccagcaagatgtcggtaccctcaaaacttcattgcttatggaatgtcgtctggaccacac aagaatcttaatcagggatcccgcgccaggacgctcgcagagcccgagatgtggcaccag gcgcgccacaagtccaggtcccgctatttctgctccatcggagcccaggtcggagggagt cagcgcgtcctcctcgcacaggagcctggcgcgcggctcggtcgcacagaggctgaagaa aggagcccaggaactgaggcgactcgccccactgccatgtccaaaagcttgaaaaagaaa agccactggactagcaaagtccatgagagtgtcattggcaggaacccggagggccagctg ggctttgaactgaaggggggcgccgagaatggacagttcccctacctgggggaggtgaag cccggcaaggtggcctatgagagcggcagcaaattggtgtcggaggagctgctgctggag gtgaacgagacccccgtggcggggctcaccatcagggacgtgctggccgtgatcaaacac tgcaaggaccccctccggctcaagtgtgtcaagcaaggtgagagcagcggcttgctcagt gttttgccgggcgagctgctggggcaaattgaatgtgcgtcagtggcatcacccgaagcg gatgaagagcttttggggaagggaggaaggaagaaaagaaagaaagaatgggaggaaggg agggaaagaaggaaggagcaacttcagggtgcagcggcatcaccagcgggtctgggaaga gaaggtggtgggggtctttctgaggctgtccccttaacccaacaggcagccctcctgggc agcatcccttcaagagaacttgtacttgttttttctgaaaagcaaggaaaacctcaggca tccctgacctaccatgtagtggaaatgcagccatctcacttctttcccaaatattatgag cagctccactcaggttcccaccttcagaattcttctgccaagaagtag >gi568815591r_77919318|GENSCAN_predicted_peptide_4|41_aa MKCCIATEKVLDIFEDLQKSTQCEDDEDEDLYDGLLLFHEL >gi568815591r_77919318|GENSCAN_predicted_CDS_4|126_bp atgaagtgttgcattgctacagagaaggtcttggacatctttgaagatctgcagaagtct actcaatgtgaagatgatgaagatgaagacctttatgatggtctacttttatttcatgaa ttgtaa >gi568815591r_77919318|GENSCAN_predicted_peptide_5|81_aa MERIADKIDIPETPGAMEKDYKPEHSHLAAKTIPAVYYAQGKGLMRIQGSCLQAREEPSP ETETYFVMAVQIDQYRPQGPK >gi568815591r_77919318|GENSCAN_predicted_CDS_5|246_bp atggaaaggattgctgataagattgacatacctgagacaccaggagcaatggagaaagat tacaagcctgagcatagtcatttagcagctaaaacaataccagcagtttactatgcacag gggaaaggccttatgaggatacaaggtagctgtctgcaagccagggaagagccctcacca gaaactgaaacctattttgttatggccgtccaaatagaccaatacaggccacaaggccct aaatag >gi568815591r_77919318|GENSCAN_predicted_peptide_6|132_aa XYKYCQYYHNHLPLNSVFCVRNKKVVSGTMRHGIPVAAQLVPAPCKQEPETNSRGVEVEF IRTKVYEKNQGDYAKYKGITSFSETLPLRSTNCDYLNEGKAIPPSSCLPAPLSSAPDRTL QASFHSALKVAA >gi568815591r_77919318|GENSCAN_predicted_CDS_6|399_bp ncatataaatattgtcagtattatcacaaccacttaccactgaattcagtgttttgtgtc aggaataaaaaggttgtatctggtacaatgcgacatggtatacccgtggcagcacagttg gtacctgctccttgcaaacaagaaccagagactaactctagaggtgtagaggtagaattt atccgaacaaaagtgtatgagaaaaatcagggggattatgcaaaatataaaggaataaca agtttctcagagaccctcccactgagatctactaactgtgactacctaaatgagggcaaa gccatcccaccctccagctgcttacctgcccctctcagctctgcacccgaccgcactctg caggcatccttccattcagctctcaaggttgccgcatga