GENSCAN 1.0 Date run: 2-Jun-117 Time: 11:47:18 Sequence gi568815581f:56494224_56694919 : 200696 bp : 43.90% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 4695 4875 181 2 1 73 91 78 0.328 5.95 1.02 Term + 16250 17046 797 0 2 68 50 750 0.251 62.65 1.03 PlyA + 19622 19627 6 1.05 2.03 PlyA - 19771 19766 6 1.05 2.02 Term - 38857 38741 117 1 0 -90 42 287 0.911 4.84 2.01 Init - 43115 42993 123 2 0 70 64 135 0.959 9.47 2.00 Prom - 74120 74081 40 -2.46 3.00 Prom + 90907 90946 40 -2.76 3.01 Sngl + 100001 100699 699 1 0 82 42 1198 0.994 108.81 3.02 PlyA + 105456 105461 6 1.05 4.00 Prom + 111567 111606 40 -4.06 4.01 Init + 126496 126724 229 0 1 91 69 82 0.695 5.14 4.02 Intr + 133011 133085 75 1 0 37 115 32 0.489 0.29 4.03 Intr + 134985 135108 124 1 1 110 97 27 0.814 5.44 4.04 Intr + 139553 139626 74 2 2 24 93 45 0.493 -2.35 4.05 Term + 141806 141897 92 0 2 93 48 87 0.638 3.08 4.06 PlyA + 144034 144039 6 1.05 5.04 PlyA - 144067 144062 6 1.05 5.03 Term - 154733 154677 57 2 0 99 39 84 0.768 2.29 5.02 Intr - 169531 169366 166 0 1 72 24 96 0.013 1.56 5.01 Init - 176100 176021 80 0 2 58 76 59 0.086 2.23 5.00 Prom - 179069 179030 40 -1.96 6.02 PlyA - 179571 179566 6 1.05 6.01 Sngl - 180868 180608 261 1 0 42 40 361 0.872 21.86 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:56494224_56694919|GENSCAN_predicted_peptide_1|325_aa MDALQYARYKQPVSGLPITKLIDPSDEQSLKKINSTSSSHIDCLPSPPPSPEMHRRKTVS DSQPCSDEEACSEVFLPTNSDYDSSDALSPRDLDLVYLSSHDIAQQTLSGLSGSAPDVLQ VHDVKTPLGPGQDPQGEGPNPDHSCAEFLHSLTLTGFTPKNHAKTVSGGRPPLGFLGKRK PGKHPHYGGFSRHHRWLRIHSETQSLSLSEGIYTQHLSQACGLAQEPKEAKRAGPALDDP RGLTLAHAASLPEERNSSLQDARPSVRRLYVEPYAAAVVAQDEKPWASLSPPSGGRITLP SPTGPDVSQEGPTASPMSEILSSML >gi568815581f:56494224_56694919|GENSCAN_predicted_CDS_1|978_bp atggatgctctacagtatgcaagatacaaacaaccagtttctggcttgcccatcactaag ctgatagacccctcagatgagcagagcctaaagaagatcaattctacatcatcatcacat atagactgtcttccatccccacccccatccccagagatgcacagaagaaagacagtgagt gattcacagccctgctctgatgaagaagcctgctcagaagtcttcctccccaccaacagt gactacgactccagcgatgccctgagccccagagacctggacctggtctacctatcatct cacgacattgcgcagcagacccttagcggcctaagcggcagcgcccccgacgtcctgcaa gtgcacgacgtgaaaacccctctggggccgggccaggatccccagggcgagggcccaaat cccgatcactcatgtgccgagtttctccatagcctgaccctcacggggttcacacccaag aaccacgccaagactgtgtccggtgggcggcccccgctaggcttcctgggaaagcggaag ccaggcaagcacccccactatggcggcttcagccgccatcatcgctggttgcgcatccac agcgagacccagtcgctatcgctctctgagggcatttatacacagcacctgtcccaggcc tgtggtctggcccaggagcccaaggaggccaagcgggccggccctgcccttgatgatccc aggggcctaactctggcccacgctgccagccttcctgaggagcggaacagcagtctccag gacgcgaggccttccgtccgccgcctctacgtggagccctacgcagcggccgtggtggcc caggacgaaaaaccatgggcaagcttgagcccgccctctggaggccgcatcaccctgccc agccccactggccccgatgtgagtcaggagggccccaccgcctctcccatgtcagaaata ctcagcagcatgctttag >gi568815581f:56494224_56694919|GENSCAN_predicted_peptide_2|79_aa MVSKGNSGGCYQKTVRMNARKARTTNVHYTSSPSSNNQLLESEANSNYKKKRKKKKEEEE EEEKNKKKEEEEKEKEKNR >gi568815581f:56494224_56694919|GENSCAN_predicted_CDS_2|240_bp atggtttccaaaggaaattcagggggctgttaccagaagacggtacgaatgaatgctcga aaggcaagaacaacaaatgtccactacacttcatccccaagttctaacaaccaactactg gagtctgaagccaacagcaattacaagaagaagaggaagaagaagaaggaagaggaggag gaggaggagaagaacaagaagaaggaggaggaggagaaggagaaggagaagaatcgttaa >gi568815581f:56494224_56694919|GENSCAN_predicted_peptide_3|232_aa MERCPSLGVTLYALVVVLGLRATPAGGQHYLHIRPAPSDNLPLVDLIEHPDPIFDPKEKD LNETLLRSLLGGHYDPGFMATSPPEDRPGGGGGAAGGAEDLAELDQLLRQRPSGAMPSEI KGLEFSEGLAQGKKQRLSKKLRRKLQMWLWSQTFCPVLYAWNDLGSRFWPRYVKVGSCFS KRSCSVPEGMVCKPSKSVHLTVLRWRCQRRGGQRCGWIPIQYPIISECKCSC >gi568815581f:56494224_56694919|GENSCAN_predicted_CDS_3|699_bp atggagcgctgccccagcctaggggtcaccctctacgccctggtggtggtcctggggctg cgggcgacaccggccggcggccagcactatctccacatccgcccggcacccagcgacaac ctgcccctggtggacctcatcgaacacccagaccctatctttgaccccaaggaaaaggat ctgaacgagacgctgctgcgctcgctgctcgggggccactacgacccaggcttcatggcc acctcgccccccgaggaccggcccggcgggggcgggggtgcagctgggggcgcggaggac ctggcggagctggaccagctgctgcggcagcggccgtcgggggccatgccgagcgagatc aaagggctagagttctccgagggcttggcccagggcaagaagcagcgcctaagcaagaag ctgcggaggaagttacagatgtggctgtggtcgcagacattctgccccgtgctgtacgcg tggaacgacctgggcagccgcttttggccgcgctacgtgaaggtgggcagctgcttcagt aagcgctcgtgctccgtgcccgagggcatggtgtgcaagccgtccaagtccgtgcacctc acggtgctgcggtggcgctgtcagcggcgcgggggccagcgctgcggctggattcccatc cagtaccccatcatttccgagtgcaagtgctcgtgctag >gi568815581f:56494224_56694919|GENSCAN_predicted_peptide_4|197_aa MALSDPPFWVHMLVQSSPIAKRTDLNSQKDIVEVTACDLQGEIITDIATSTLLSRSLPMG EACWYDTQVALWRGSCGSYNAYLKALPKRFSEIMQVQLSSTVSKPSPKVLGGQKMGWEAA IKPDPPLRGVLAIIGLVTEHWGRATDSETLVPCMPEGGDDLGVADGEDNNSTEPNSFNKL RKLKEHQVTQEVTSFIL >gi568815581f:56494224_56694919|GENSCAN_predicted_CDS_4|594_bp atggctctcagtgatccccccttctgggttcatatgctagtacagtcttctcccatagcg aagaggacagacctgaatagccaaaaggatattgtggaagtgacagcatgtgacctccaa ggcgagatcataacagacatcgctacttctaccttgctctctcgatcactccccatgggg gaagcctgttggtacgacactcaagtagccctatggagaggttcttgtgggagctataat gcctacctcaaagctttgccaaaaagattcagtgaaataatgcaagtacaattatctagc acagtttctaaaccttcccctaaagtcctcgggggccagaagatggggtgggaagcagct ataaaacctgacccacctctcagaggtgtgctggcaattatagggctggtgactgagcac tggggaagagccacagactccgagacccttgttccatgtatgccagaaggtggagatgac ttgggtgtggctgatggagaagataacaacagcacagagcccaactccttcaacaaactc aggaaactaaaagaacaccaggtcacccaagaagtaactagcttcatcctataa >gi568815581f:56494224_56694919|GENSCAN_predicted_peptide_5|100_aa MERPHVGVVANGSTKILADLQHQLPDLAACGSHLRRQQAVRRTGFGPPVILLGIQSGPSG PWSSGELDTLGTSQRNERVLAGVFGDGSIQMPAQPFECSL >gi568815581f:56494224_56694919|GENSCAN_predicted_CDS_5|303_bp atggagaggccacatgtgggtgttgtggccaatgggtccactaagatcttagctgacctg cagcatcaactgcctgaccttgctgcctgtggatcccacctgaggaggcagcaggctgtg cggaggacaggatttggcccccccgtcattctcctgggcatccagtctggtccatcaggt ccatggagcagcggggagctggacaccctggggacaagccagagaaatgagagagtcctt gctggggtgtttggcgatggtagcatccagatgccagcccagccctttgaatgcagcctg tga >gi568815581f:56494224_56694919|GENSCAN_predicted_peptide_6|86_aa MVEREREREGRRGRRRREEEEEEEEEEEEEEEEEENKNENKNKNKKNKNENKNKNRNNKK RRKLELPVSGQRKSTVFALDAACCTP >gi568815581f:56494224_56694919|GENSCAN_predicted_CDS_6|261_bp atggtggagagagagagagagagagaaggaagaagaggaagaagaagaagagaagaagaa gaggaggaggaggaggaagaagaagaagaagaggaggaggaggagaacaagaatgagaac aagaacaagaacaagaagaacaagaacgagaacaagaacaagaacaggaacaacaagaag aggagaaagttagaattgcctgtctctgggcagaggaagagtacagtttttgctcttgat gcagcctgttgtaccccatga