GENSCAN 1.0 Date run: 24-Oct-119 Time: 21:54:39 Sequence gi568815581f:16281908_16482594 : 200687 bp : 48.86% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 4703 4901 199 0 1 -16 36 168 0.086 -0.39 1.02 Intr + 17981 18071 91 2 1 68 95 71 0.109 5.80 1.03 Intr + 31640 31707 68 1 2 112 64 51 0.221 2.80 1.04 Intr + 34774 34805 32 1 2 129 116 -5 0.700 4.17 1.05 Intr + 35868 36001 134 1 2 102 100 203 0.995 23.26 1.06 Term + 43893 43991 99 2 0 94 48 81 0.979 2.83 1.07 PlyA + 45896 45901 6 1.05 2.06 PlyA - 49510 49505 6 1.05 2.05 Term - 61034 60910 125 0 2 78 49 178 0.954 11.45 2.04 Intr - 62804 62690 115 2 1 58 109 80 0.994 7.12 2.03 Intr - 66778 66709 70 0 1 58 86 65 0.729 2.48 2.02 Intr - 68122 68024 99 0 0 59 89 52 0.712 1.63 2.01 Init - 71529 71120 410 0 2 94 70 552 0.984 50.03 2.00 Prom - 77296 77257 40 -3.86 3.00 Prom + 92721 92760 40 -4.66 3.01 Init + 97175 97207 33 1 0 107 77 75 0.141 6.23 3.02 Term + 99995 100690 696 1 0 67 31 1342 0.168 119.95 3.03 PlyA + 100811 100816 6 1.05 4.04 PlyA - 101575 101570 6 1.05 4.03 Term - 109706 109588 119 0 2 105 47 168 0.962 13.10 4.02 Intr - 123422 123372 51 0 0 108 63 20 0.287 0.38 4.01 Init - 125186 125108 79 2 1 58 89 38 0.404 0.78 4.00 Prom - 127685 127646 40 -8.66 5.00 Prom + 128767 128806 40 -4.96 5.01 Init + 132552 132555 4 2 1 97 68 0 0.276 -0.94 5.02 Intr + 133865 133934 70 0 1 103 60 43 0.694 1.24 5.03 Intr + 135794 135961 168 2 0 62 94 174 0.700 14.46 5.04 Intr + 138208 138341 134 1 2 53 74 233 0.999 18.69 5.05 Intr + 140692 140982 291 2 0 117 109 323 0.997 34.41 5.06 Intr + 141562 141860 299 2 2 93 83 323 0.992 28.89 5.07 Intr + 144192 144362 171 2 0 30 105 226 0.969 18.64 5.08 Intr + 144815 144978 164 1 2 81 41 202 0.014 13.57 5.09 Intr + 146910 147075 166 0 1 62 100 209 0.982 19.46 5.10 Intr + 149877 149943 67 2 1 68 71 99 0.911 4.68 5.11 Intr + 150059 150393 335 0 2 92 50 681 0.810 59.99 5.12 Intr + 151667 151791 125 1 2 106 109 150 0.999 18.28 5.13 Intr + 154324 154409 86 1 2 66 55 122 0.724 6.16 5.14 Intr + 157420 157507 88 2 1 69 106 45 0.603 3.43 5.15 Intr + 158278 158346 69 1 0 65 99 59 0.377 3.00 5.16 Term + 159167 159173 7 2 1 97 54 0 0.337 -5.16 5.17 PlyA + 159271 159276 6 1.05 6.04 PlyA - 159665 159660 6 -0.45 6.03 Term - 162224 161681 544 1 1 111 44 704 0.996 62.04 6.02 Intr - 166053 165938 116 0 2 93 105 129 0.868 14.35 6.01 Init - 166364 166362 3 2 0 91 101 0 0.846 1.60 6.00 Prom - 170847 170808 40 -4.86 7.00 Prom + 172179 172218 40 -5.66 7.01 Init + 173226 173236 11 2 2 41 77 15 0.037 -4.89 7.02 Term + 174054 174633 580 0 1 63 35 486 0.259 34.76 7.03 PlyA + 176984 176989 6 1.05 8.04 PlyA - 178851 178846 6 -0.45 8.03 Term - 180479 180210 270 2 0 135 51 293 0.510 25.78 8.02 Intr - 181700 181607 94 1 1 69 43 75 0.840 1.07 8.01 Init - 182759 182698 62 2 2 57 100 64 0.883 5.22 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 100001 100690 690 1 0 82 31 1336 0.821 123.50 S.002 Term + 144815 144982 168 1 0 81 51 196 0.980 13.08 S.003 Init + 145581 145640 60 2 0 46 109 129 0.980 10.54 S.004 Term + 154882 154982 101 2 2 71 45 71 0.885 -0.61 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:16281908_16482594|GENSCAN_predicted_peptide_1|207_aa XRDFEFGAFSRGINTVFAKYTPGKESSGFYESHLKDWQELASSVVFLASKIVVSPQSQTR NAIMLMRDFPDDPGMQWDTEHVARVLLQHIEVNGINLVVTFDAGGVSGHSNHIALYAAVR ALHSEGKLPKGCSVLTLQSVNVLRKYISLLDLPLSLLHTQDVLFVLNSKEVAQAKKAMSC HRSQLLWFRRLYIIFSRYMRINSLSFL >gi568815581f:16281908_16482594|GENSCAN_predicted_CDS_1|624_bp nggcgggacttcgagtttggggccttttccagaggtattaacactgtgtttgctaaatac accccagggaaggagtcgtcgggcttctacgagtcacacttaaaagactggcaagagttg gcttcttctgttgtgttcttagcttccaaaatagtagtctctcctcagagccagacgcgc aatgccatcatgttaatgagggatttcccagatgacccaggcatgcagtgggacacagag cacgtggccagagtcctccttcagcacatagaagtgaatggcatcaatctggtggtgact ttcgatgcagggggagtaagtggccacagcaatcacattgctctgtatgcagctgtgagg gccctgcactcagaagggaagttacctaaagggtgctctgtgctcacgcttcagtctgtg aatgtgctgcgcaagtacatctcccttctggatctgcccttgtctctgcttcatacgcag gatgtcctcttcgtgctcaacagcaaagaagtggcacaggccaagaaagccatgtcctgc caccgcagccagctcctctggttccgccgcctctacattatcttctcccggtacatgaga atcaactcactgagcttcctctga >gi568815581f:16281908_16482594|GENSCAN_predicted_peptide_2|272_aa MRRSRSSAAAKLRGQKRSGASAAPAASAAAALAPSATRTRRSASQAGSKSQAVEKPPSEK PRLRRSSPRAQEEGPGEPPPPELALLPPPPPPPPTPATPTSSASNLDLGEQRERWETFQK RQKLTSEGAAKLLLDTFEYQGLVKHTGGCHCGAVRFEVWASADLHIFDCNCSICKKKQNR HFIVPASRFKLLKGAEHITTYTFNTHKAQHTFCKRCGVQSFYTPRSNPGGFGIAPHCLDE GTVRSMVTEEFNGSDWEKAMKEHKTIKNMSKE >gi568815581f:16281908_16482594|GENSCAN_predicted_CDS_2|819_bp atgcggcgatcgaggagctctgcggccgccaagctgcgcgggcagaagcggtccggggcc tccgcggcccccgcggcctccgcggccgctgccttggcacccagcgccacccgcacacgg cgctccgctagccaggccgggagcaagagccaggcggtggagaagccgccgtcggagaag ccgcggctgaggcgctcgtcgccgcgggcccaggaggagggcccgggggagccgccgccg cctgagctggcgttgctcccgccaccgccgccgccgccgccgactcccgcgaccccgacg tcctcggcgtccaacctggacctgggcgagcagcgggagcgctgggagacgttccagaag cggcagaagcttacctccgagggtgccgccaagctcctgctagacacctttgaataccag ggcctggtgaagcacacaggaggctgccactgtggagcagttcgttttgaagtttgggcc tcagcagacttgcatatatttgactgcaattgcagcatttgcaagaagaagcagaataga cacttcattgttccagcttctcgcttcaagctcctgaagggagctgagcacataacgact tacacgttcaatactcacaaagcccagcataccttctgtaagagatgtggcgttcagagc ttctatactccacgatcaaaccccggaggcttcggaattgccccccactgcctggatgag ggcactgtgcggagtatggtcactgaggaattcaatggcagcgattgggagaaggccatg aaagagcacaagaccatcaagaacatgtctaaagagtga >gi568815581f:16281908_16482594|GENSCAN_predicted_peptide_3|242_aa MGSSLASLGLVVKMQIFVKTLTGKTITLEVEPSDTIENVKAKIQDKEGIPPDQQRLIFAG KQLEDGRTLSDYNIQKESTLHLVLRLRGGMQIFVKTLTGKTITLEVEPSDTIENVKAKIQ DKEGIPPDQQRLIFAGKQLEDGRTLSDYNIQKESTLHLVLRLRGGMQIFVKTLTGKTITL EVEPSDTIENVKAKIQDKEGIPPDQQRLIFAGKQLEDGRTLSDYNIQKESTLHLVLRLRG GC >gi568815581f:16281908_16482594|GENSCAN_predicted_CDS_3|729_bp atgggctccagcctcgcctctctgggcctggtggtcaaaatgcagatcttcgtgaaaacc cttaccggcaagaccatcacccttgaggtggagcccagtgacaccatcgaaaatgtgaag gccaagatccaggataaggaaggcattccccccgaccagcagaggctcatctttgcaggc aagcagctggaagatggccgtactctttctgactacaacatccagaaggagtcgaccctg cacctggtcctgcgtctgagaggtggtatgcagatcttcgtgaagaccctgaccggcaag accatcaccctggaagtggagcccagtgacaccatcgaaaatgtgaaggccaagatccag gataaagaaggcatccctcccgaccagcagaggctcatctttgcaggcaagcagctggaa gatggccgcactctttctgactacaacatccagaaggagtcgaccctgcacctggtcctg cgtctgagaggtggtatgcagatcttcgtgaagaccctgaccggcaagaccatcactctg gaggtggagcccagtgacaccatcgaaaatgtgaaggccaagatccaagataaagaaggc atcccccccgaccagcagaggctcatctttgcaggcaagcagctggaagatggccgcact ctttctgactacaacatccagaaagagtcgaccctgcacctggtcctgcgcctgaggggt ggctgttaa >gi568815581f:16281908_16482594|GENSCAN_predicted_peptide_4|82_aa MQADGGTTAFPSGLEGGGRPGSALWSVYKTGEKGESKKLERNRSPAEERAAMLALHLPET LGQFTFQSHIVTAEVEAQRDAS >gi568815581f:16281908_16482594|GENSCAN_predicted_CDS_4|249_bp atgcaggcggacggcgggaccacagcatttccctccggcttggagggcggaggccgaccc gggagcgcgctctggagcgtttataagacaggagaaaagggagaaagcaaaaagttggaa agaaacagaagtcctgcagaagagcgggctgccatgctggctttgcatcttccagagacg cttggccagttcaccttccagagccacatcgtcacggctgaagtagaagcccagagagat gccagttga >gi568815581f:16281908_16482594|GENSCAN_predicted_peptide_5|747_aa MAPATEKLRDPSSRHALASACGVSRLETLDGGQEDGSEADRGKLDFGSGLPPMESQFQGE DRKFAPQIRVNLNYRKGTGASQPDPNRFDRDRLFNAVSRGVPEDLAGLPEYLSKTSKYLT DSEYTEGSTGKTCLMKAVLNLKDGVNACILPLLQIDRDSGNPQPLVNAQCTDDYYRGHSA LHIAIEKRSLQCVKLLVENGANVHARACGRFFQKGQGTCFYFGELPLSLAACTKQWDVVS YLLENPHQPASLQATDSQGNTVLHALVMISDNSAENIALVTSMYDGLLQAGARLCPTVQL EDIRNLQDLTPLKLAAKEGKIEIFRHILQREFSGLSHLSRKFTEWCYGPVRVSLYDLASV DSCEENSVLEIIAFHCKSPHRHRMVVLEPLNKLLQAKWDLLIPKFFLNFLCNLIYMFIFT AVAYHQPTLKKARALFQALLTVVSQVLCFLAIEWYLPLLVSALVLGWLNLLYYTRGFQHT GIYSVMIQKVILRDLLRFLLIYLVFLFGFAVALVSLSQEAWRPEAPTGPNATESVQPMEG QEDEGNGAQYRGILEASLELFKFTIGMGELAFQEQLHFRGMVLLLLLAYVLLTYILLLNM LIALMSETVNSVATDSWSIWKLQKAISVLEMENGYWWCRKKQRAGVMLTVGTKPDGSPDE RWCFSPANVDKNHMQPTLHNASHYLEGRTAQNGGPHLPEPGVCEGRDPVPRPTVGVCKPE RTGLQIREESASCLAAEYWSQEPAMRF >gi568815581f:16281908_16482594|GENSCAN_predicted_CDS_5|2244_bp atggcccctgctactgagaagctccgggatcccagcagccgccacgccctggcctcagcc tgcggggtaagtcggttggagacattagatggaggccaagaagatggctctgaggcggac agaggaaagctggattttgggagcgggctgcctcccatggagtcacagttccagggcgag gaccggaaattcgcccctcagataagagtcaacctcaactaccgaaagggaacaggtgcc agtcagccggatccaaaccgatttgaccgagatcggctcttcaatgcggtctcccggggt gtccccgaggatctggctggacttccagagtacctgagcaagaccagcaagtacctcacc gactcggaatacacagagggctccacaggtaagacgtgcctgatgaaggctgtgctgaac cttaaggacggagtcaatgcctgcattctgccactgctgcagatcgacagggactctggc aatcctcagcccctggtaaatgcccagtgcacagatgactattaccgaggccacagcgct ctgcacatcgccattgagaagaggagtctgcagtgtgtgaagctcctggtggagaatggg gccaatgtgcatgcccgggcctgcggccgcttcttccagaagggccaagggacttgcttt tatttcggtgagctacccctctctttggccgcttgcaccaagcagtgggatgtggtaagc tacctcctggagaacccacaccagcccgccagcctgcaggccactgactcccagggcaac acagtcctgcatgccctagtgatgatctcggacaactcagctgagaacattgcactggtg accagcatgtatgatgggctcctccaagctggggcccgcctctgccctaccgtgcagctt gaggacatccgcaacctgcaggatctcacgcctctgaagctggccgccaaggagggcaag atcgagattttcaggcacatcctgcagcgggagttttcaggactgagccacctttcccga aagttcaccgagtggtgctatgggcctgtccgggtgtcgctgtatgacctggcttctgtg gacagctgtgaggagaactcagtgctggagatcattgcctttcattgcaagagcccgcac cgacaccgaatggtcgttttggagcccctgaacaaactgctgcaggcgaaatgggatctg ctcatccccaagttcttcttaaacttcctgtgtaatctgatctacatgttcatcttcacc gctgttgcctaccatcagcctaccctgaagaaggcaagggccctgttccaggccctgctc acagtggtgtcccaggtgctgtgtttcctggccatcgagtggtacctgcccctgcttgtg tctgcgctggtgctgggctggctgaacctgctttactatacacgtggcttccagcacaca ggcatctacagtgtcatgatccagaaggtcatcctgcgggacctgctgcgcttccttctg atctacttagtcttccttttcggcttcgctgtagccctggtgagcctgagccaggaggct tggcgccccgaagctcctacaggccccaatgccacagagtcagtgcagcccatggaggga caggaggacgagggcaacggggcccagtacaggggtatcctggaagcctccttggagctc ttcaaattcaccatcggcatgggcgagctggccttccaggagcagctgcacttccgcggc atggtgctgctgctgctgctggcctacgtgctgctcacctacatcctgctgctcaacatg ctcatcgccctcatgagcgagaccgtcaacagtgtcgccactgacagctggagcatctgg aagctgcagaaagccatctctgtcctggagatggagaatggctattggtggtgcaggaag aagcagcgggcaggtgtgatgctgaccgttggcactaagccagatggcagccccgatgag cgctggtgcttcagtcctgctaatgtggacaaaaaccacatgcagcccacgctacacaat gcctcccactacctggaaggccgcactgctcagaatggagggccacatctgccagagcct ggagtctgcgaaggccgggacccggttccccggcccacagtgggggtgtgcaaacccgag agaactgggttgcaaattcgtgaagaatcagcatcatgtttggcagctgagtattggagc caggagcctgccatgaggttttga >gi568815581f:16281908_16482594|GENSCAN_predicted_peptide_6|220_aa MEGDALGAMEKLCRQLTYHLSPHSQWRRHRGLVKRKPQACLKAVLAGSPPDNTVDLSGIP LTSRDLERVTSYLQRCGEQVDSVELGFTGLTDDMVLQLLPALSTLPRLTTLALNGNRLTR AVLRDLTDILKDPSKFPNVTWIDLGNNVDIFSLPQPFLLSLRKRSPKQGHLPTILELGEG PGSGEEVREGTVGQEDPGGGPVAPAEDHHEGKETVAAAQT >gi568815581f:16281908_16482594|GENSCAN_predicted_CDS_6|663_bp atggaaggcgatgcactgggcgccatggagaagctgtgccggcagctgacataccacctc agcccccactcccagtggaggcggcaccgggggctggtgaaaaggaagccacaggcctgc ctcaaggctgtcctggccggaagccccccagacaacacagtggacctgtcgggaatccca ctgacctcccgagacctggagcgggtgaccagctacctacagcgctgtggggagcaggta gacagcgtggagctgggcttcacaggcctcacggacgacatggtcctgcagctgctgcca gcactcagcaccctgccccgcctcaccacactggcactcaatggcaaccggttgacccgg gccgtgctgcgcgacctcactgacatccttaaggatcccagcaagttccccaatgtcacg tggattgacctgggcaacaacgtggacatcttctccttgccccagcccttcctgctcagc ctgcgcaagcgctccccaaagcagggccacctacccaccatcctggagctgggtgagggc ccaggcagtggggaggaggtccgggaagggacagtaggccaggaggaccctggagggggc cctgtggcacctgccgaagaccaccatgagggcaaggagactgtagctgcagctcagacg tga >gi568815581f:16281908_16482594|GENSCAN_predicted_peptide_7|196_aa MLARTPPGLPPAENRTRQHWPRSGAVRRWGTGGGRGGEGGRGRGRGGGGRGGGGGGVAVV GGGGRGGRGGGRGGVGGGRGGRRRKRRRKRRKKRRKEEKKEEDQEEEEEEEEEEEKEEEE EEEEKKEEEEEEKEEEEEEKEEEEEEKKEKEEEEEEEEEQKKEEEEEEEEEEKEKEEEEE EEEEEEAGLDTETHIP >gi568815581f:16281908_16482594|GENSCAN_predicted_CDS_7|591_bp atgttggccagaacccctccagggctccctcctgcagagaacaggactaggcagcactgg cccagaagtggagcagtaagaaggtggggaacggggggagggagaggaggagaaggagga agaggaagagggagaggaggaggaggaaggggagggggaggaggaggagtagcagtagta ggaggaggaggaagaggagggagaggaggaggaagaggaggggtaggaggaggaagagga ggaaggaggaggaagaggaggaggaagaggaggaagaagaggagaaaggaggagaagaag gaagaggatcaggaagaggaggaggaggaagaggaggaggaagagaaggaagaggaggag gaggaagaggagaagaaggaagaggaggaggaagagaaggaagaggaggaggaagagaag gaagaggaggaggaagagaagaaggaaaaggaggaagaggaggaggaggaagaggagcag aagaaggaagaggaggaggaagaggaagaagaggagaaggagaaagaagaggaggaggaa gaggaagaggaggaggaagctggacttgacactgagacacacataccctaa >gi568815581f:16281908_16482594|GENSCAN_predicted_peptide_8|141_aa MAFDTFSIGSVVQEIENPCRRALDPSHLGWEPFSGLCWGSCGSPQNEDSLQWDLGMESTS LDDVLYRYASFRNLVDPITHDLIISLARYIHCPKPVGWLWSEPARSRGVRWVPFWAEQLY AQGGLADRPKEDKPVLQAPRA >gi568815581f:16281908_16482594|GENSCAN_predicted_CDS_8|426_bp atggcttttgacacattctcaattgggtctgtggtccaggagatcgagaacccctgcaga agggcactggaccccagccacctgggctgggagcctttctctgggctgtgctggggcagc tgtgggagcccccagaatgaggacagcctgcagtgggacctgggcatggagtcgacctcg ctagacgacgttctgtatcgctacgccagcttccggaacctggtggaccccatcacacac gacctcatcatcagcctggcacgctacatccactgtcccaagccggtagggtggctgtgg tccgagccagccaggtcccggggtgtgcggtgggtgcctttctgggcagagcagctgtat gcccaaggaggactggcagacaggcccaaggaggacaagcccgtcctccaggcaccaaga gcttga