GENSCAN 1.0 Date run: 6-Nov-116 Time: 01:53:17 Sequence gi568815578r:50904042_51110338 : 206297 bp : 45.67% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 9919 10198 280 0 1 79 54 164 0.191 9.15 1.02 Term + 17474 17685 212 0 2 49 55 66 0.340 -3.14 1.03 PlyA + 19921 19926 6 1.05 2.08 PlyA - 20944 20939 6 1.05 2.07 Term - 26701 26525 177 1 0 13 52 121 0.547 -1.31 2.06 Intr - 26955 26785 171 0 0 51 121 146 0.760 14.34 2.05 Intr - 32221 32107 115 0 1 62 69 187 0.909 14.65 2.04 Intr - 38101 37990 112 2 1 60 96 30 0.040 0.44 2.03 Intr - 44621 44588 34 2 1 84 97 24 0.087 0.70 2.02 Intr - 51244 51145 100 0 1 47 87 106 0.912 6.51 2.01 Init - 54482 54322 161 2 2 106 59 255 0.995 23.70 2.00 Prom - 54650 54611 40 -6.36 3.00 Prom + 54676 54715 40 -16.02 3.01 Sngl + 54802 56184 1383 0 0 108 51 1639 0.999 158.30 3.02 PlyA + 57022 57027 6 1.05 4.05 PlyA - 57241 57236 6 1.05 4.04 Term - 60808 60710 99 1 0 109 36 40 0.183 -0.97 4.03 Intr - 65506 65430 77 2 2 89 45 28 0.055 -2.17 4.02 Intr - 72284 72141 144 0 0 22 63 90 0.048 0.15 4.01 Init - 77471 77423 49 2 1 72 105 68 0.543 8.13 4.00 Prom - 96349 96310 40 -5.16 5.03 PlyA - 98379 98374 6 1.05 5.02 Term - 100765 99998 768 1 0 115 53 1623 0.930 154.91 5.01 Init - 106297 105524 774 1 0 86 110 1957 0.994 192.40 5.00 Prom - 107073 107034 40 -6.66 6.00 Prom + 114546 114585 40 -4.26 6.01 Sngl + 118523 119137 615 1 0 79 53 238 0.056 14.93 6.02 PlyA + 120616 120621 6 1.05 7.13 PlyA - 122729 122724 6 1.05 7.12 Term - 126797 126697 101 0 2 112 50 70 0.641 3.89 7.11 Intr - 131577 131555 23 2 2 105 119 6 0.525 2.59 7.10 Intr - 136267 136254 14 1 2 105 95 16 0.306 -2.42 7.09 Intr - 136632 136520 113 1 2 39 116 73 0.792 5.30 7.08 Intr - 136897 136741 157 1 1 71 20 85 0.437 -0.42 7.07 Intr - 147847 147755 93 1 0 82 81 79 0.710 6.76 7.06 Intr - 150492 150307 186 0 0 -15 86 117 0.248 1.09 7.05 Intr - 164149 164034 116 2 2 82 88 29 0.048 2.47 7.04 Intr - 168890 168723 168 0 0 106 111 -30 0.045 1.02 7.03 Intr - 174521 174438 84 0 0 71 74 69 0.123 3.59 7.02 Intr - 178795 178734 62 0 2 71 94 -5 0.067 -3.22 7.01 Init - 179187 179090 98 0 2 55 88 76 0.159 4.08 7.00 Prom - 179493 179454 40 -2.56 8.04 PlyA - 179733 179728 6 1.05 8.03 Term - 187264 187187 78 1 0 83 50 90 0.881 2.46 8.02 Intr - 188483 188451 33 2 0 108 116 23 0.801 5.52 8.01 Init - 189897 189814 84 0 0 99 50 30 0.788 1.12 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 38085 37990 96 2 0 82 96 36 0.844 2.92 S.002 Init - 43128 43058 71 0 2 81 70 99 0.888 5.92 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578r:50904042_51110338|GENSCAN_predicted_peptide_1|163_aa TAVCAGQNETVKALLGKGAQVNAVNQNGSTPLHHAASKNRHEIALMLLEGGANPDGKDHY EATAKHQATAKGNFKMIHILLYYKASTIIQDTEGRFSDYKTGAHNWKTATVLLLILILSV FRFQARILAVTTRISSTPVEPVVFRLNSSKSLRYVYKQREDDA >gi568815578r:50904042_51110338|GENSCAN_predicted_CDS_1|492_bp actgctgtttgtgctggccagaatgagactgtaaaagcccttctgggaaaaggtgctcaa gtgaatgctgtcaatcaaaatggctctactcccctacatcatgcagcttccaaaaacagg catgagattgctctcatgttactagaaggtggggctaatccagatggtaaggatcattat gaggctacagcaaagcaccaggccacagccaagggtaacttcaagatgattcatatcctt ctgtactacaaagcatccacaatcatccaagacactgagggaagattttcagactacaag actggtgcccacaactggaaaactgctactgtgttgctgcttatcctgatactttccgta ttcagatttcaggcaagaattctagcagtgaccaccagaatttccagtacaccagtagaa ccagttgttttcaggctgaactcctctaagagtttaagatatgtatacaaacagagggaa gatgatgcctga >gi568815578r:50904042_51110338|GENSCAN_predicted_peptide_2|289_aa MASLEVSRSPRRSRRELEVRSPRQNKYSVLLPTYNERENLPLIVWLLVKSFSESGINYEI IIIDDGSPDGTRDVAEQLEKIYGSDRILLRPREKKLGLANLFYRKQKEGNFDIVSGTRYK GNGGVYGWDLKRKIIRLYRKEVLEKLIEKCVSKGYVFQMEMIVRARQLNYTIGEHPRPRR EPESAEPERDEAPGAPSPLPPPPCRRHPPAAAAAVRPPSTPAPRAPRGRVKPGPGRGRIV VRIAALQPHTRSFAHPGADGSPVPARTATRLCSGGPGASSCSPTPFSAL >gi568815578r:50904042_51110338|GENSCAN_predicted_CDS_2|870_bp atggcctccttggaagtcagtcgtagtcctcgcaggtctcggcgggagctggaagtgcgc agtccacgacagaacaaatattcggtgcttttacctacctacaacgagcgcgagaacctg ccgctcatcgtgtggctgctggtgaaaagcttctccgagagtggaatcaactatgaaatt ataatcatagatgatggaagcccagatggaacaagggatgttgctgaacagttggagaag atctatgggtcagacagaattcttctaagaccacgagagaaaaagttgggactagctaat ttattctacaggaagcaaaaggagggtaattttgatattgtctctggaactcgctacaaa ggaaatggaggtgtatatggctgggatttgaaaagaaaaataatcagattataccgaaaa gaagttctagagaaattaatagaaaaatgtgtttctaaaggctacgtcttccagatggag atgattgttcgggcaagacagttgaattatactattggcgagcacccgcggccgcggcgc gagccggagtccgccgagccggagcgcgacgaggccccgggcgcgccctccccgctgccg ccaccgccgtgccgccgccatccgcccgccgccgccgccgctgtccggcccccgagcacg ccggccccgcgcgcgcctcgaggccgagtcaagcctggccccgggcgcgggcgcattgtt gttcgcatcgccgccctccagccgcacacacgctcctttgcacaccccggcgccgacggg tcccccgtcccggcacggaccgctacccgactctgctccggaggtccaggcgcctcctcc tgcagcccgacccccttctccgccctgtga >gi568815578r:50904042_51110338|GENSCAN_predicted_peptide_3|460_aa MASREEVLALQAEVAQREEELNSLKQKLASALLAEQEPQPERLVPVSPLPPKAALSRDEI LRYSRQLVLPELGVHGQLRLGTACVLIVGCGGLGCPLAQYLAAAGVGRLGLVDYDVVEMS NLARQVLHGEALAGQAKAFSAAASLRRLNSAVECVPYTQALTPATALDLVRRYDVVADCS DNVPTRYLVNDACVLAGRPLVSASALRFEGQITVYHYDGGPCYRCIFPQPPPAETVTNCA DGGVLGVVTGVLGCLQALEVLKIAAGLGPSYSGSLLLFDALRGHFRSIRLRSRRLDCAAC GERPTVTDLLDYEAFCGSSATDKCRSLQLLSPEERVSVTDYKRLLDSGAFHLLLDVRPQV EVDICRLPHALHIPLKHLERRDAESLKLLKEAIWEEKQGTQEGAAVPIYVICKLGNDSQK AVKILQSLSAAQELDPLTVRDVVGGLMAWAAKIDGTFPQY >gi568815578r:50904042_51110338|GENSCAN_predicted_CDS_3|1383_bp atggcttcccgggaggaggtactcgccttacaagctgaagttgcccaacgtgaggaggaa ttgaattcgctgaagcagaagctggcgtcggctcttttggctgagcaggaaccgcagcca gaacggctggttccggtgtcgccgctgccgccgaaggccgctctgtcccgagatgagatt ctgcgctatagccggcagctagtgctgcccgagctgggcgtgcacggacagctgcgcctg gggaccgcgtgcgtgctaatcgtgggctgcggtgggctcggctgtccactagcgcagtac ttggcagcggccggcgtgggccgccttggccttgtggactatgacgtggtagagatgagc aacctggcccgccaagtgctgcatggcgaggcactggctggccaggccaaggccttttcg gccgccgcctcgctgcgccgcctcaattcggcagtggaatgcgtgccgtacactcaggcc cttacgccagccactgccctagacctggtccgccgatatgatgtggtggctgactgctcg gacaacgtgcccactcgctacctggttaatgacgcatgtgtgctggcgggtcggcccctc gtgtctgccagtgccttgcgcttcgagggccaaatcacagtctaccattatgacggtggc ccttgctatcgctgcatattcccccaaccacccccagcggagacagtgaccaactgcgcg gacggcggggtgctcggtgtcgttaccggggtcctgggctgcctgcaggccttggaagtg ctgaaaatcgctgcgggtctgggcccctcttacagtggcagcttgttgctctttgatgcc ctgagagggcatttccgctctattcggctgcggagccgcaggctcgactgtgcagcttgc ggggaacggcccactgtgactgatctgctggactatgaagccttctgtggctcctcagcc actgataaatgccgctccctgcaactactgagcccagaggagcgtgtttctgtcaccgac tataagcgactgctggattctggggcattccacctgttgctggacgtcaggcctcaggtg gaggtggacatttgtcgtttgcctcatgccctacacatccctctgaaacatttggaacgc agggatgcggagagcctgaaactcttaaaagaagcaatctgggaagagaagcagggcaca caagaaggggctgctgtccccatttatgtgatttgcaaactgggaaatgactcacagaaa gccgtgaagatcctccagtccttatcagcagctcaagagttagaccctttaacagttcgg gatgttgtggggggcctcatggcctgggctgccaaaatcgatggaacatttccacagtac tga >gi568815578r:50904042_51110338|GENSCAN_predicted_peptide_4|122_aa MARPEKCIPAVLKELTEPPLPNLQAMGLDGADPIYPTPSPDPGMGLWLRPGQSEYPSPAR YSVPNWPGGGGLSAEQQGGSVPLVDLQTGTDVRLYGQDAQSVLFVVVSLCVRHQYLTICN EH >gi568815578r:50904042_51110338|GENSCAN_predicted_CDS_4|369_bp atggcccgacctgagaaatgcatccctgccgtgctgaaggagctcaccgaaccccctctc cccaacttgcaggccatgggactggatggtgctgaccccatttacccaacccccagtcca gatccagggatgggcctatggctgaggcctggccaatcagaataccccagccctgctcgc tacagtgtgcccaattggcctgggggtggggggttgtctgctgagcagcagggcggctct gtccctctggtggacctgcagactgggacagatgtaaggttgtatggacaagacgcccag tctgtcttgttcgtggttgtgtccctctgtgtcaggcatcagtacctgaccatttgtaat gagcattaa >gi568815578r:50904042_51110338|GENSCAN_predicted_peptide_5|513_aa MTLLPGDNSDYDYSALSCTSDASFHPAFLPQRQAIKGAFYRRAQRLRPQDEPRQGCQPED RRRRIIINVGGIKYSLPWTTLDEFPLTRLGQLKACTNFDDILNVCDDYDVTCNEFFFDRN PGAFGTILTFLRAGKLRLLREMCALSFQEELLYWGIAEDHLDGCCKRRYLQKIEEFAEMV EREEEDDALDSEGRDSEGPAEGEGRLGRCMRRLRDMVERPHSGLPGKVFACLSVLFVTVT AVNLSVSTLPSLREEEEQGHCSQMCHNVFIVESVCVGWFSLEFLLRLIQAPSKFAFLRSP LTLIDLVAILPYYITLLVDGAAAGRRKPGAGNSYLDKVGLVLRVLRALRILYVMRLARHS LGLQTLGLTARRCTREFGLLLLFLCVAIALFAPLLYVIENEMADSPEFTSIPACYWWAVI TMTTVGYGDMVPRSTPGQVVALSSILSGILLMAFPVTSIFHTFSRSYLELKQEQERVMFR RAQFLIKTKSQLSVSQDSDILFGSASSDTRDNN >gi568815578r:50904042_51110338|GENSCAN_predicted_CDS_5|1542_bp atgaccctcttaccgggagacaattctgactacgactacagcgcgctgagctgcacctcg gacgcctccttccacccggccttcctcccgcagcgccaggccatcaagggcgcgttctac cgccgggcgcagcggctgcggccgcaggatgagccccgccagggctgtcagcccgaggac cgccgccgtcggatcatcatcaacgtaggcggcatcaagtactcgctgccctggaccacg ctggacgagttcccgctgacgcgcctgggccagctcaaggcctgcaccaacttcgacgac atcctcaacgtgtgcgatgactacgacgtcacctgcaacgagttcttcttcgaccgcaac ccgggggccttcggcactatcctgaccttcctgcgcgcgggcaagctgcggctgctgcgc gagatgtgcgcgctgtccttccaggaggagctgctgtactggggcatcgcggaggaccac ctggacggctgctgcaagcgccgctacctgcagaagattgaggagttcgcggagatggtg gagcgggaggaagaggacgacgcgctggacagcgagggccgcgacagcgagggcccggcc gagggcgagggccgcctggggcgctgcatgcggcgactgcgcgacatggtggagaggccg cactcggggctgcctggcaaggtgttcgcctgcctgtcggtgctcttcgtgaccgtcacc gccgtcaacctctccgtcagcaccttgcccagcctgagggaggaggaggagcagggccac tgttcccagatgtgccacaacgtcttcatcgtggagtcggtgtgcgtgggctggttctcc ctggagttcctcctgcggctcattcaggcgcccagcaagttcgccttcctgcggagcccg ctgacgctgatcgacctggtggccatcctgccctactacatcacgctgctggtggacggc gccgccgcaggccgtcgcaagcccggcgcgggcaacagctacctggacaaggtggggctg gtgctgcgcgtgctgcgggcgctgcgcatcctgtacgtgatgcgcctggcgcgccactcc ctggggctgcagacgctggggctcacggcccgccgctgcacccgcgagttcgggctcctg ctgctcttcctctgcgtggccatcgccctcttcgcgcccctgctctacgtcatcgagaac gagatggccgacagccccgagttcaccagcatccctgcctgctactggtgggctgtcatc accatgacgacggtgggctatggcgacatggtccccaggagcaccccgggccaggtagtg gccctgagcagcatcctgagcggcatcctgctcatggccttcccagtcacctccatcttc cacaccttctcccgctcctacctggagctcaagcaggagcaagagagggtgatgttccgg agggcgcagttcctcatcaaaaccaagtcgcagctgagcgtgtcccaggacagtgacatc ttgttcggaagtgcctcctcggacaccagagacaataactga >gi568815578r:50904042_51110338|GENSCAN_predicted_peptide_6|204_aa MRPRRGHRGGERPGLRIPVTPWDEPKPDPHLGRNKRAPRRHAPAVTPESSVALASPQAGF EAPAGSSGPRPRAAAPEPEFAGSGSGSRFGGARRGCSAQSSHLKFPSRRPRLGVGPAAAS VPPGCPRVRGEDARGAWKSPGEPALPPGAAAKLRSRGPAQLPPSEPRPPRLCLPVPAGLR VRRERGAAAAEGGSEAGGGTEGGD >gi568815578r:50904042_51110338|GENSCAN_predicted_CDS_6|615_bp atgcgcccccgccggggtcaccgcggcggcgaacgtcccggacttcgcatcccggtgacc ccttgggatgaaccgaagccggacccgcaccttggccgtaacaagcgcgctccacgccgc cacgcaccagcggtgactcccgagtcctctgtcgccctggcctccccgcaggcgggtttc gaggcccccgcagggagttcagggccacggccacgtgcagccgcccccgagcccgagttt gcaggctcggggtccggttcgcgcttcggtggcgcccggcgaggctgcagcgcgcaaagt tcccacctcaagttcccgtcgcggcgtccccgcctcggagttggcccggccgcggcttcc gttccccccggctgcccacgggtccggggcgaggacgcgagaggggcctggaagtcgccc ggggagcccgctctgcctcccggagccgccgccaaacttcggtcccggggcccggctcag ctcccgccctcggagccacggccgccccgtctctgcctcccggtccccgccgggctccga gtgcgccgggagcgcggcgcagcagccgccgagggagggagcgaggcaggaggcgggacg gagggaggggactga >gi568815578r:50904042_51110338|GENSCAN_predicted_peptide_7|404_aa MFLHQGHSFSKCEHPPQARQEVLETGYRDEQDRSPTCACLSFPLLELMEPHTTEQVNPRA GLTCGGKDGMIYQRNTQAQISGQAPALLPRRSSVEENTVVTVTYEAPPLYPPSGHVSIHG PSQQRVARLPLLIHQAPDNNDKLGGEGQNYMKVLESQQKQASSNEELILGRKERQENTRK GKSAALEKGIDLTGSYRHCQAVYDGVTKRTVQTASKANAGHSEPDCVSLEPALPLDSKLL ICKAPGFPQCINKPDESSWTKALSEKEAKTLFSLGTAINSWISSRFGFCWKNSEKGEGGK RVGARRGMAVGVELALKMGPLVHQTWPVYWIEQSSPKEIDFSAFECDLIGNKVFADLTKV YERSLPVSLFPGTKQGGKIPEHSTQLSTALRLAAASEINGPVSC >gi568815578r:50904042_51110338|GENSCAN_predicted_CDS_7|1215_bp atgtttctgcatcagggccactctttcagcaagtgtgagcacccaccacaggccaggcaa gaggtgctggagacaggataccgtgatgaacaggacaggagccccacctgtgcctgtttg tccttcccccttttggaactgatggagcctcacaccacagaacaagtcaaccctagagct gggttgacttgtgggggaaaggacggcatgatctaccagcgcaacacgcaggctcagatt tcaggccaggcccctgccctattacccaggaggtcttcagtggaagagaatactgttgtt actgtgacatatgaagcaccccctctttacccaccctctggacacgtttcaatccatggt cccagccagcagagggtagcccggcttcctctactcatccatcaggctccagataacaat gataaactcgggggtgaggggcaaaactacatgaaggtactggaaagccaacaaaagcag gcaagctctaacgaagagctgatacttggaagaaaggaaaggcaagagaacaccaggaag ggaaagagcgcagctcttgaaaagggtatcgacctcactgggagttacaggcactgccag gcggtttatgatggtgtgacaaagcgaacggttcagacggccagcaaggcaaacgcagga cactcggagcctgactgcgtgagcttggaaccagcactgccacttgacagcaagcttctt atctgcaaagctcctggcttccctcaatgtatcaacaagcctgatgaatccagctggact aaggcactctctgaaaaagaggccaagacactgttcagcttaggcacagctataaacagc tggatttcatcaaggtttgggttttgctggaagaattcagaaaaaggagaagggggcaag agagttggtgcacgcaggggcatggctgtgggtgtggaactagcgttgaagatgggcccg cttgttcatcagacctggcctgtgtactggattgaacagagttcccccaaagagatagat ttcagtgcctttgaatgtgaccttattggaaacaaggtctttgcagatttaaccaaggtc tatgagagaagccttcccgtgtcactctttcctggaacgaaacagggtggcaaaatccca gagcattccactcagctgtcgacagccctgcgactggctgcagcctcggaaataaatggt cccgtcagctgctag >gi568815578r:50904042_51110338|GENSCAN_predicted_peptide_8|64_aa MKRGVLDLVQERIQGKSIEYSASKFIRNPPPLNKEKVWKGCCEDECDAVCRSPTQGKCSL SSAE >gi568815578r:50904042_51110338|GENSCAN_predicted_CDS_8|195_bp atgaagagaggggtcttggaccttgtgcaagaaagaattcagggcaagtccatagagtac agtgcaagcaagtttattaggaacccaccaccactgaacaaagaaaaagtctggaagggc tgctgtgaggacgagtgtgatgctgtttgtaggagcccgacacaagggaagtgctcgctg agctctgcagaatga