GENSCAN 1.0 Date run: 5-Nov-116 Time: 17:35:39 Sequence gi568815591r:132934610_133135920 : 201311 bp : 39.74% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 4010 4304 295 2 1 40 43 309 0.533 15.39 1.02 PlyA + 5151 5156 6 1.05 2.00 Prom + 10763 10802 40 -6.55 2.01 Sngl + 19954 20205 252 0 0 117 42 165 0.126 9.64 2.02 PlyA + 20978 20983 6 1.05 3.04 PlyA - 21066 21061 6 1.05 3.03 Term - 39346 39248 99 1 0 82 43 80 0.912 0.05 3.02 Intr - 40677 40560 118 2 1 69 105 142 0.981 13.55 3.01 Init - 49530 49316 215 0 2 97 63 96 0.642 6.26 3.00 Prom - 56606 56567 40 -6.65 4.00 Prom + 56761 56800 40 -3.35 4.01 Init + 59763 59865 103 2 1 74 82 72 0.490 5.65 4.02 Term + 62055 62311 257 1 2 34 39 203 0.479 4.76 4.03 PlyA + 62357 62362 6 1.05 5.00 Prom + 67999 68038 40 -4.75 5.01 Init + 72591 72645 55 2 1 48 116 21 0.410 2.40 5.02 Term + 76417 76544 128 2 2 39 54 156 0.947 4.76 5.03 PlyA + 77179 77184 6 1.05 6.00 Prom + 77597 77636 40 -4.55 6.01 Init + 81910 81957 48 0 0 56 74 32 0.621 -0.40 6.02 Term + 82066 82239 174 0 0 71 48 146 0.928 5.68 6.03 PlyA + 82769 82774 6 1.05 7.00 Prom + 83570 83609 40 -6.25 7.01 Init + 85338 85419 82 2 1 58 84 39 0.807 1.68 7.02 Term + 86031 86140 110 1 2 32 49 156 0.765 3.79 7.03 PlyA + 86957 86962 6 1.05 8.02 PlyA - 88178 88173 6 1.05 8.01 Sngl - 101311 99998 1314 1 0 108 47 1538 0.999 147.50 8.00 Prom - 118546 118507 40 -4.75 9.00 Prom + 121231 121270 40 -5.25 9.01 Init + 125765 125930 166 1 1 71 110 76 0.531 7.94 9.02 Term + 133957 134108 152 2 2 88 44 96 0.450 2.29 9.03 PlyA + 134251 134256 6 1.05 10.09 PlyA - 135061 135056 6 1.05 10.08 Term - 135620 135516 105 2 0 84 28 184 0.770 9.43 10.07 Intr - 147355 147248 108 1 0 65 100 144 0.117 12.86 10.06 Intr - 153547 153490 58 0 1 132 85 -2 0.004 1.87 10.05 Intr - 162097 162070 28 2 1 97 87 36 0.050 0.66 10.04 Intr - 164236 164203 34 1 1 78 64 46 0.007 -1.92 10.03 Intr - 183155 182884 272 0 2 58 57 189 0.687 9.14 10.02 Intr - 183600 183450 151 0 1 -11 91 159 0.414 5.11 10.01 Intr - 198335 198103 233 0 2 117 27 126 0.100 5.97 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 155319 155191 129 0 0 86 11 135 0.950 5.80 S.002 Term - 198335 198079 257 0 2 117 49 143 0.830 8.06 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:132934610_133135920|GENSCAN_predicted_peptide_1|98_aa XKSAYRMEIGMQESYPHKPQSKLTALLHWAHTLNLQCVPDTGNKQTGKEKVPRTFRSLSP GEMRRRVLQEEMRLRGLHNPLTPLGVKLSACLGAGDTW >gi568815591r:132934610_133135920|GENSCAN_predicted_CDS_1|297_bp nngaaaagtgcctatcgaatggaaattggaatgcaagaaagctatccccataagccacag tccaaactgacagctctcctccactgggctcacactttaaacctgcagtgtgtccccgac accggcaacaagcaaacaggaaaggagaaggttcccaggacgtttagatctctgagccct ggggagatgaggagaagagtgctccaagaggagatgcggctgcgaggtctacacaaccca ctgaccccactgggtgtgaagttgagtgcatgtttgggagcaggggacacttggtag >gi568815591r:132934610_133135920|GENSCAN_predicted_peptide_2|83_aa MGPPQPSTWAVFSISTKDGLYNVQRQGEPTAAGPIPQMVQFPTFHKILPRKATRTAAVPL RGQVEQISPFCMHSTSLAICLYL >gi568815591r:132934610_133135920|GENSCAN_predicted_CDS_2|252_bp atggggccaccacagccctcaacctgggctgtcttcagcatctcaacaaaagacggactc tacaatgttcagagacaaggggagcccacagcagcaggcccaattcctcagatggtacaa ttcccaacattccacaaaatcctgcccagaaaagccaccaggacagcagcagtcccactg aggggccaagtagagcagatatctccattctgtatgcactcgacgtccctcgccatctgc ctgtacctctag >gi568815591r:132934610_133135920|GENSCAN_predicted_peptide_3|143_aa MAAVQSSFGSASGGDRGKRGRGRPWGEGDHGERETVGRGRPWGEGDRGERERRERERRSL SEKLYFQYKIKRLKQAKELDRERAAANEQLTRAILRERICSEEERAKAKHLKPRCYSSEP LVLIGAQLGTHWTGKMGPWNGEL >gi568815591r:132934610_133135920|GENSCAN_predicted_CDS_3|432_bp atggcagcagtacagtccagctttggctcggcatcagggggagaccgtggaaagagaggg agagggagaccgtggggagagggagaccatggggagagggagaccgtggggagagggaga ccgtggggagagggagaccgtggggagagggagaggagggagagggagaggaggagcctt tccgagaagttgtattttcaatacaagattaaaagactaaagcaagccaaagagctggac cgagagagggctgctgccaatgagcagttaaccagagccatccttcgggagaggatatgt agcgaggaggaacgcgctaaggcaaagcacctgaaacccagatgttactcctcggaacct ttagtcctcattggagcacagttgggaactcactggacagggaaaatgggaccatggaat ggagaactttga >gi568815591r:132934610_133135920|GENSCAN_predicted_peptide_4|119_aa MALSKHYTEVSSRLNFMASTYSMMLEKDTANCCKELAILPSPVANQQPNVEEISKGKRQR CDAEGVCSPGFRNGPFTTEQPFKKKNTVGWDAKQGEGTARLTIVKNIFGILPTCYTNVS >gi568815591r:132934610_133135920|GENSCAN_predicted_CDS_4|360_bp atggctttgagtaaacattacacagaagtaagttcaagattaaacttcatggccagcacc tattctatgatgttagagaaagacaccgccaattgctgtaaagaattggcaatccttcca agccctgtcgctaatcagcaacctaatgtggaagagataagcaaagggaagaggcagcgg tgcgatgctgagggagtgtgctcaccaggctttcgaaacggaccatttactacagagcag cctttcaagaagaaaaacaccgtggggtgggacgcaaagcaaggggaagggactgcaaga ttaactatcgttaaaaacatcttcggcatactgcctacatgctacacaaatgtttcataa >gi568815591r:132934610_133135920|GENSCAN_predicted_peptide_5|60_aa MIKHRRADQANKELTMAQGLFGKNCDSKRESAMQNMKCKYSGQGAIGTNRFGSRCGRSVQ >gi568815591r:132934610_133135920|GENSCAN_predicted_CDS_5|183_bp atgataaaacacagaagagcagaccaggctaataaagaattgacaatggcacaagggtta tttgggaaaaactgtgatagtaagagggagtcagccatgcaaaatatgaagtgcaaatat tctggacaaggagctataggcacaaacaggtttggcagccgatgtggccgaagtgttcaa tga >gi568815591r:132934610_133135920|GENSCAN_predicted_peptide_6|73_aa MGYSCLTVCSNQEKKGLEAILAKLWRSGGETGTLSEYSPVFPDPAQKDLTLFSSVPNTAL PTGSTDGFQKAQD >gi568815591r:132934610_133135920|GENSCAN_predicted_CDS_6|222_bp atgggctactcctgtttaacagtttgctccaaccaagaaaagaaggggctcgaggctata ttggcaaagctatggaggtcaggaggggaaacagggactctctctgaatattctccagta tttccagatcctgcacagaaggacctcacactcttctccagtgttcctaacacagcatta ccaacaggatcaacagatggcttccagaaggcccaggactaa >gi568815591r:132934610_133135920|GENSCAN_predicted_peptide_7|63_aa MVSYNPNTPLSITAKTDCRQRHLLWILGGSPAAVVIACTEASKFAELLSAARKNSRNKSG AIF >gi568815591r:132934610_133135920|GENSCAN_predicted_CDS_7|192_bp atggtcagctataatccaaacacaccattgagtatcacagcaaaaacagactgccgacag aggcatttactctggattctcggcggcagtcctgctgctgttgtcattgcttgcacagaa gcatcaaaatttgcagaactgctgtcagcagctcggaaaaattccaggaataagagtgga gcaatcttttaa >gi568815591r:132934610_133135920|GENSCAN_predicted_peptide_8|437_aa MAAGTLYTYPENWRAFKALIAAQYSGAQIRVLSAPPHFHFGQTNRTSEFLRKFPAGKVPA FEGDDGFCVFESNAIAYYVSNEELRGSTPEAAAQVVQWVSFADSDIVPPASTWVFPTLGI MHHNKQATENAKEEVRRILGLLDAYLKTRTFLVGERVTLADITVVCTLLWLYKQVLEPSF RRAFRNTNRWFLTCINQPQFRAVLGELKLCEKMAQFDAKKFAETQPKKDTPRKEKGSREE KQKPQAERKEEKKAAAPAPEEEMDECEQALAAEPKAKDPFAHLPKSTFVLDEFKRKYSNE DTLSVALPYFWEHFDKDGWSLWYSEYRFPEELTQTFMSCNLITGMFQRLDKLRKNAFASV ILFGTNNSSSISGVWVFRGQELAFPLSPDWQVDYESYTWRKLDPGREETQTLVREYFSWE GAFQHVGKAFNHGKIFK >gi568815591r:132934610_133135920|GENSCAN_predicted_CDS_8|1314_bp atggcggctgggaccctgtacacgtatcctgaaaactggagggccttcaaggctctcatc gctgctcagtacagcggggctcagatccgcgtgctctccgcaccaccccacttccatttt ggccaaaccaaccgcacctctgaatttcttcgcaaatttcctgccggcaaggtcccagca tttgagggtgatgatggattctgtgtgtttgagagcaacgccattgcctactatgtgagc aatgaggagctgcggggaagtactccagaggcagcagcccaggtggtgcagtgggtgagc tttgctgattccgatatagtgcccccagccagtacctgggtgttccccaccttgggcatc atgcaccacaacaaacaggccactgagaatgcaaaggaggaagtgaggcgaattctgggg ctgctggatgcttacttgaagacgaggacttttctggtgggcgaacgagtgacattggct gacatcacagttgtctgcaccctgttgtggctctataagcaggttctagagccttctttc cgccgggcctttcgcaataccaaccgctggttcctcacctgcattaaccagccccagttc cgggctgtcttgggggaactgaaactgtgtgagaagatggcccagtttgatgctaaaaag tttgcagagacccagcctaaaaaggacacaccacggaaagagaagggttcacgggaagag aagcagaagccccaggctgagcggaaggaggagaaaaaggcggctgcccctgctcctgag gaggagatggatgaatgtgagcaggcgctggctgctgagcccaaggccaaggaccccttc gctcacctgcccaagagtacctttgtgttggatgaatttaagcgcaagtactccaatgag gacacactctctgtggcactgccatatttctgggagcactttgataaggacggctggtcc ctgtggtactcagagtatcgcttccctgaagaactcactcagaccttcatgagctgcaat ctcatcactggaatgttccagcgactggacaagctgaggaagaatgccttcgccagtgtc atcctttttggaaccaacaatagcagctccatttctggagtctgggtcttccgaggccag gagcttgcctttccgctgagtccagattggcaggtggactacgagtcatacacatggcgg aaactggatcctggcagagaggagacccagacgctggttcgagagtacttttcctgggag ggggccttccagcatgtgggcaaagccttcaatcacggcaagatcttcaagtga >gi568815591r:132934610_133135920|GENSCAN_predicted_peptide_9|105_aa MAEPRDVCYERRAQVPQAHTGGASHQDWVSLKVSWGKWFLGRDLNNEQEFARQKADCEDF CVHSTGGKSEAGSQLRDFHRRRQLSIWIGDIAEETDAKGTYLDMF >gi568815591r:132934610_133135920|GENSCAN_predicted_CDS_9|318_bp atggcagaaccacgtgacgtgtgctacgagagacgtgcacaggtaccacaggcccacaca ggtggggcatctcaccaggactgggtgtcattaaaagtttcctggggtaagtggtttcta ggcagggatctgaacaatgagcaggagtttgctagacaaaaggcagattgtgaggacttc tgtgtgcacagcacagggggaaagagtgaagcagggagccagttaagagacttccacaga agacgacagttatctatctggattggagatatagcagaggagacagatgcaaaaggaaca tatttagacatgttttag >gi568815591r:132934610_133135920|GENSCAN_predicted_peptide_10|329_aa XMPYTVKQFVHCTAPGLYLHRPQCDRLSNPSHASSGRSFLILSQSTGLSLQWTLKDATRR ASDPSLQKGPRPPEEFSLWKGLGKGVNAKKCESLRPFRILTIKVEVKQGWMKTQDKEECT SGDWHGDTAHGFPGKTFLLNRYTVYCVMQGIVANATLAGTGAGGYWHVILQKGHLAIVTD GFVCWHVVESTGPLSMKVISWEGISELIPNPRTFSGIVEAALSIEEALGPGSISVENPDS IHCKDLTWWKREKEQRSWKVSAGAGTGIMGGTTSTRRVTFEADENENITVVKGIRLSENV IDRMKESSPSGSKSQRYSGAYGASGIAEF >gi568815591r:132934610_133135920|GENSCAN_predicted_CDS_10|990_bp nggatgccgtatacagttaaacagtttgtgcactgcacagctccagggctgtatttacac agaccacaatgtgatcggctatctaatccctcacatgcttcctctgggaggtcttttctg atcctctcacagtccacgggactctcccttcagtggacactcaaagatgctaccagacgc gcctctgaccccagtctccagaaaggaccacgccctccagaagaattctcactgtggaag ggattaggcaagggtgtgaatgccaagaagtgtgaatccctgaggccatttcggatcctt accataaaagtggaagtgaaacaaggttggatgaagacacaggacaaggaggaatgtacc agtggggactggcatggggatacagctcatggcttccctgggaagaccttcttgctgaat cggtacacagtttattgtgtcatgcagggcattgttgctaatgccacattggctggcact ggggctggtggctattggcatgtcatcctacagaaagggcaccttgccattgtgacagat ggattcgtgtgctggcatgttgtggaatcaacaggtccattaagtatgaaagtgataagc tgggaagggatttcagaactaataccgaatcctaggacatttagcggtattgttgaggct gccctctctatagaagaggccctggggccaggttcaatttctgtggagaaccctgacagc attcattgcaaagatttgacgtggtggaaaagggaaaaagaacaaaggagttggaaggtc tccgctggggcaggaaccggaatcatgggtgggaccaccagcacccgccgggtcaccttc gaggcggacgagaatgagaacatcaccgtggtgaagggcatccggctttcggaaaatgtg attgatcgaatgaaggaatcctctccatctggttcgaagtctcagcggtattctggtgct tatggtgcctcaggtattgctgaattttaa