GENSCAN 1.0 Date run: 8-Nov-116 Time: 10:09:50 Sequence gi568815596r:175078291_175281393 : 203103 bp : 37.08% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 4768 5673 906 0 0 86 39 217 0.821 12.59 1.02 PlyA + 6863 6868 6 1.05 2.06 PlyA - 7624 7619 6 1.05 2.05 Term - 9705 9605 101 1 2 18 54 118 0.177 -1.29 2.04 Intr - 10357 10321 37 1 1 40 92 32 0.134 -4.28 2.03 Intr - 14977 14771 207 1 0 66 80 234 0.775 18.75 2.02 Intr - 19303 19154 150 1 0 50 96 119 0.937 8.34 2.01 Init - 29352 29224 129 0 0 104 27 83 0.856 4.00 2.00 Prom - 31116 31077 40 -2.65 3.18 PlyA - 31191 31186 6 1.05 3.17 Term - 33003 32959 45 0 0 109 49 35 0.836 -2.07 3.16 Intr - 33364 33278 87 1 0 40 103 49 0.202 0.85 3.15 Intr - 35818 35704 115 0 1 91 63 38 0.321 1.13 3.14 Intr - 36578 36400 179 2 2 71 75 106 0.482 5.40 3.13 Intr - 39828 39700 129 0 0 65 72 115 0.985 7.57 3.12 Intr - 40079 39961 119 0 2 71 53 92 0.973 3.26 3.11 Intr - 43250 43154 97 2 1 66 53 121 0.878 5.06 3.10 Intr - 51917 51848 70 1 1 75 90 46 0.280 1.77 3.09 Intr - 89618 89531 88 0 1 62 59 66 0.100 -0.79 3.08 Intr - 90148 89951 198 2 0 68 81 167 0.566 12.40 3.07 Intr - 91921 91794 128 0 2 32 50 68 0.131 -3.20 3.06 Intr - 100960 100767 194 1 2 122 89 189 0.864 19.77 3.05 Intr - 101888 101808 81 2 0 79 75 47 0.727 1.42 3.04 Intr - 103176 103065 112 2 1 113 78 43 0.966 5.26 3.03 Intr - 103783 103692 92 1 2 79 45 96 0.508 2.27 3.02 Intr - 106039 105862 178 0 1 49 34 229 0.518 12.60 3.01 Init - 126038 125962 77 2 2 58 94 34 0.086 1.63 3.00 Prom - 135321 135282 40 -3.75 4.00 Prom + 139442 139481 40 -4.05 4.01 Init + 161909 162094 186 1 0 65 31 167 0.123 7.80 4.02 Intr + 175148 175195 48 1 0 87 119 40 0.891 4.96 4.03 Term + 176971 177105 135 0 0 86 48 143 0.978 7.14 4.04 PlyA + 177583 177588 6 1.05 5.02 PlyA - 178169 178164 6 1.05 5.01 Sngl - 178828 178628 201 1 0 101 47 171 0.203 7.41 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 111843 112550 708 2 0 70 47 228 0.958 12.87 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:175078291_175281393|GENSCAN_predicted_peptide_1|301_aa MAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRACIAKSILSQKNKAGGITLPD FKLYYKATVTKTAWYWYQNRDIDQWNRIEPSEITPHMYNYLIVDKPEKNKQWGKDSLFNK WCWENWLAICRKLKLDPFLTYTKINSRWIKDFKVRPKTIKTLEENLGITIQDIGMGKDFM SKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTKWKKIFATYLSDKGLISRIYN ELKQIYKKKTKNPIQNWAKDMNRHFSKEDIYAAKRHMKKMLITGHRRNANQNHNEIPSHT N >gi568815596r:175078291_175281393|GENSCAN_predicted_CDS_1|906_bp atggccatactgcccaaggtaatttatagattcaatgccatccccatcaagctaccaatg actttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcc tgcatcgccaagtcaatcctaagccaaaagaacaaagctggaggcatcacgctacctgac ttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaacaga gatatagatcaatggaacagaatagagccctcagaaataacgccgcatatgtacaactat ctgatcgttgacaaacctgagaaaaacaagcaatggggaaaggattccctatttaataaa tggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttccttact tatacaaaaattaactcaagatggattaaagactttaaggttagacctaaaaccataaaa accctagaagaaaacctaggcattaccattcaggacataggcatgggcaaagacttcatg tctaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatctaattaaa ctaaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaacctacaaaa tggaagaaaattttcgcaacctacttatctgacaaagggctaatatcgagaatctacaat gaactcaaacaaatttacaagaaaaaaacaaagaaccccatccaaaactgggcaaaggat atgaacagacacttctcaaaagaagacatttatgcagccaaaagacacatgaaaaaaatg ctcatcactggccatcggagaaatgcaaatcaaaaccacaatgagataccatctcacacc aattag >gi568815596r:175078291_175281393|GENSCAN_predicted_peptide_2|207_aa MAAAQSSFGSASEGDRGKRGRGRPWGEGEGDRGEREREREPQNRLKAALTQQHPPVTNGD TVKGHGSGLVRTQSEESRPQSLQQPATSTTETPASPAHTTPQTQSTSGRRRRAANEDPDE KRRKFLERNRAAASRCRQKRKVWVQSLEKKAEDLSSLNGQLQVGANRIAAFAIKGILLDV DNSEKDIVFALTEFVAEETDMKQVIKV >gi568815596r:175078291_175281393|GENSCAN_predicted_CDS_2|624_bp atggcggcagcacagtccagctttggctcggcatcagagggagaccgtggaaagagaggg agagggagaccatggggagagggagagggagaccgtggggagagggagagggagagggag ccacaaaacagattaaaagctgctttgacccagcaacatcctccagttaccaatggtgat actgtcaaaggtcatggtagcggattggttaggactcagtcagaggaatctcgaccgcag tcattacaacagccagccacatccactacagaaactccggcttctccagctcacacaact ccacagacccaaagtacaagtggtcgtcggagaagagcagctaacgaagatcctgatgaa aaaaggagaaagtttttagagcgaaatagagcagcagcttcaagatgccgacaaaaaagg aaagtctgggttcagtctttagagaagaaagctgaagacttgagttcattaaatggtcag ctgcaggttggtgcaaacagaatcgcagcttttgccattaaaggcatattattagatgtg gataacagcgaaaaagacattgtctttgcacttacagaatttgtggcagaggagacagac atgaaacaagtaattaaagtgtga >gi568815596r:175078291_175281393|GENSCAN_predicted_peptide_3|662_aa MAGLEITSAAALEIANTCPYQRACFSEKPDDDFGAQPNKNLGFASSFGIVDALESIRQDI HVHHCGGTKRWRKEPNDNPFIALDTPQSYFTEEENDQQVNFYYREAATDKRPNLTREEAG EEPTSPVTQYLQPRSPEECKMFACAKLACTPSLIRAGSRVAYRPISASVLSRPEASRTGE GSTVFNGAQNGVSQLIQREFQTSAISRDIDTAAKFIGAGAATVGVAGSGAGIGTVFGSLI IGYARTKLGRCIMEQDRQRRGLQEVSCVCKEKTVKNFRTLGDIEECKKLLPALRAPCPIM SARVLFDARPLYPPCRRPAFFSLSLTESPDSEALPQKEGKKGRVGEGEGRQKEALGAIVA VLPPRRPARCGLWEPDWLGRTVRQYKDLWNMSDDKPFLCTAPGCGQRFTNEDHLAVHKHK HEMTLKFGPARNDSVIVADQTPTPTRFLKNCEEVGLFNELASPFENEFKKASEDDIKKMP LDLSPLATPIIRSKIEEPSVVETTHQDSPLPHPESTTSDEKEVPLAQTAQPTSAIVRPAS LQVPNVLLTSSDSSVIIQQAVPSPTSSTVITQAPSSNRPIVPVPGPFPLLLHLPNGQTMP VAIPASITSSNVHVPAAVPLVRPVTMVPSVPGIPGPSSPQPVQSEAKMWKIGVLHLKLDV II >gi568815596r:175078291_175281393|GENSCAN_predicted_CDS_3|1989_bp atggctggcctggagattacatcagcagcagcactggagatcgccaacacatgtccctac caaagagcatgtttcagtgagaaaccagatgatgattttggagcacagcctaataagaac ctggggtttgcctcttctttcggcattgttgatgctcttgagagcatcaggcaggacatt catgtgcaccattgtggcggcacgaaaagatggcggaaagagccaaacgacaaccctttc attgctcttgacaccccacagtcctatttcacagaggaagagaatgaccagcaagtcaac ttctactacagagaagcagcaactgataaaaggccaaatcttacaagagaggaagcggga gaggagcccacgtcgcctgtcacccaatatctccagccgcgcagtcccgaagagtgtaag atgttcgcctgcgccaagctcgcctgcaccccctctctgatccgagctggatccagagtt gcatacagaccaatttctgcatcagtgttatctcgaccagaggctagtaggactggagag ggctctacggtatttaatggggcccagaatggtgtgtctcagctaatccaaagggagttt cagaccagtgcaatcagcagagacattgatactgctgccaaatttattggtgcaggtgct gcaacagtaggagtggctggttctggtgctggtattggaacagtctttggcagccttatc attggttatgccagaactaagttgggtagatgcatcatggaacaagacaggcaaagaaga ggtctacaagaagtttcctgcgtttgtaaagagaagactgtaaagaattttaggactctt ggggatattgaggaatgcaaaaaactgcttcccgcccttcgggctccttgtccaatcatg agcgcccgagtgctctttgatgcccgtcccctctacccgccctgccgaagacccgccttc ttctccttaagcctgacggaatcacctgactcggaggcgctccctcagaaggaaggcaag aaggggcgtgtgggtgaaggggaggggcgccagaaggaagcccttggggcgattgtggct gtgctgccacctcggcggcccgcgcggtgtgggctttgggaacctgactggctgggccgg accgttaggcaatacaaggacctgtggaatatgagtgatgacaaaccctttctatgtact gcgcctggatgtggccagcgttttaccaacgaggatcatttggctgtccataaacataaa catgagatgacactgaaatttggtccagcacgtaatgacagtgtcattgtggctgatcag accccaacaccaacaagattcttgaaaaactgtgaagaagtgggtttgtttaatgagttg gcgagtccatttgagaatgaattcaagaaagcttcagaagatgacattaaaaaaatgcct ctagatttatcccctcttgcaacacctatcataagaagcaaaattgaggagccttctgtt gtagaaacaactcaccaggatagtcctttacctcacccagagtctactaccagtgatgag aaggaagtaccattggcacaaactgcacagcccacatcagctattgttcgtccagcatca ttacaggttcccaatgtgctgcttacaagttctgactcaagtgtaattattcagcaggca gtaccttcaccaacctcaagtactgtaatcacccaggcaccatcctctaacaggccaatt gtccctgtaccaggcccatttcctcttctgttacatcttcctaatggacaaaccatgcct gttgctattcctgcatcaattacaagttctaatgtgcatgttccagctgcagtcccactc gttcgaccagtcaccatggtgcctagtgttccaggaatcccaggtccttcctctccccaa ccagtacagtcagaagcaaaaatgtggaaaataggagtattacatctgaaactagatgtg atcatctga >gi568815596r:175078291_175281393|GENSCAN_predicted_peptide_4|122_aa MAEGKEEQVMSYMDGSKQRENHQILGDLLTVTRTAWERPAPMIQLPPTRSLPQPVGIPDE IWYRDLAQKSSLLQVTGLQYPRVLLCDIIASREPTDHEQQQGLQSELKLNSKHFRRAFGD TY >gi568815596r:175078291_175281393|GENSCAN_predicted_CDS_4|369_bp atggcggaaggcaaggaggagcaagtcatgtcctacatggatggcagcaagcaaagagag aaccatcagatcttgggagacttactcactgtcacaagaacagcatgggaaagacctgcc cccatgattcaattacctcccaccaggtccctcccacaacccgtgggaattccagatgag atttggtacagagatcttgcacagaaaagtagcttgcttcaggtcacagggctgcaatat cctagggttttactttgtgacatcattgcttctcgagaaccaactgaccacgaacagcaa caaggtcttcaatcagaattaaaactcaattcaaaacactttcgaagagcatttggagat acatattaa >gi568815596r:175078291_175281393|GENSCAN_predicted_peptide_5|66_aa MPKPRPPWAPAPPEPDPPLRAPPPALQHPVPSTAQGLRSAGARSGTGGQLACRPSAGSTR RSQLSS >gi568815596r:175078291_175281393|GENSCAN_predicted_CDS_5|201_bp atgcccaagccccgccccccatgggctcccgcgccgcccgagcccgatcctcccctacgg gcgccgccccctgctctgcagcacccggtcccatcgaccgcccaagggctgaggagtgcg ggtgcgcggagcgggactggcgggcagctcgcctgccgccccagcgcgggatccactagg cgaagccagctgagctcctga