GENSCAN 1.0 Date run: 6-Nov-116 Time: 22:16:58 Sequence gi568815585r:108108536_108311268 : 202733 bp : 37.04% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 7551 7630 80 2 2 41 105 63 0.327 3.88 1.02 Intr + 17825 17969 145 2 1 68 92 98 0.679 7.56 1.03 Term + 19728 19856 129 2 0 76 38 69 0.576 -2.10 1.04 PlyA + 20413 20418 6 1.05 2.10 PlyA - 20534 20529 6 1.05 2.09 Term - 21871 21695 177 1 0 -31 42 211 0.274 1.30 2.08 Intr - 25964 25746 219 2 0 59 65 144 0.323 6.78 2.07 Intr - 30529 30386 144 1 0 -9 101 98 0.035 0.96 2.06 Intr - 50014 49963 52 0 1 107 57 8 0.041 -2.41 2.05 Intr - 50623 50377 247 2 1 70 59 143 0.102 5.20 2.04 Intr - 70029 69904 126 1 0 40 101 96 0.277 5.83 2.03 Intr - 71528 71504 25 2 1 91 94 20 0.043 -0.42 2.02 Intr - 82528 82312 217 0 1 102 87 65 0.104 5.38 2.01 Init - 89350 89280 71 1 2 70 68 68 0.050 3.57 2.00 Prom - 95226 95187 40 -6.55 3.02 PlyA - 95331 95326 6 1.05 3.01 Sngl - 102733 99998 2736 1 0 43 42 1663 0.616 149.82 3.00 Prom - 107502 107463 40 -7.85 4.00 Prom + 107912 107951 40 -2.25 4.01 Init + 109644 109698 55 2 1 80 63 91 0.199 6.63 4.02 Intr + 110008 110124 117 2 0 30 80 146 0.359 7.52 4.03 Term + 120664 121697 1034 2 2 128 35 263 0.331 16.29 4.04 PlyA + 122352 122357 6 1.05 5.00 Prom + 123411 123450 40 -2.65 5.01 Sngl + 133188 133319 132 2 0 90 28 156 0.524 2.82 5.02 PlyA + 135807 135812 6 1.05 6.03 PlyA - 136041 136036 6 1.05 6.02 Term - 147516 146638 879 0 0 60 32 298 0.850 12.95 6.01 Init - 151099 151025 75 1 0 99 81 57 0.867 7.24 6.00 Prom - 156297 156258 40 -4.85 7.00 Prom + 159490 159529 40 -2.85 7.01 Init + 161361 161699 339 2 0 44 87 287 0.546 21.50 7.02 Intr + 161805 161889 85 2 1 88 91 75 0.755 6.27 7.03 Intr + 168949 169036 88 2 1 10 65 116 0.059 -0.39 7.04 Intr + 194241 194358 118 0 1 112 22 62 0.487 1.55 7.05 Intr + 194919 195069 151 2 1 93 59 76 0.658 4.01 7.06 Term + 198291 198403 113 1 2 51 43 114 0.732 0.94 7.07 PlyA + 199791 199796 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585r:108108536_108311268|GENSCAN_predicted_peptide_1|117_aa MLEPQLKELVGVPFPVRYKSANQGLQSYHLSDIDNMLSNYCFSSLMYKTEVVVLSFSGGH TEDSMGNPCKVLNKKTEGRTVVFPIFEVLNLGLASWLLSLQTAYCGKSPCDPMSQFF >gi568815585r:108108536_108311268|GENSCAN_predicted_CDS_1|354_bp atgttagagccccagctgaaggagttagtcggagtcccatttcctgtaaggtacaagagt gcaaatcagggcctacagagttaccatctgagtgacattgacaatatgcttagcaactat tgcttcagttctcttatgtacaaaacagaggtggtggtgctgtctttctccgggggtcac acagaagattcgatggggaatccttgtaaagtgcttaacaagaagactgaaggccgcact gtcgtcttccctatttttgaggttttgaatctcggactggcttcctggctcctcagcttg cagacggcgtattgtgggaagtcaccttgtgatcccatgagtcaattcttctaa >gi568815585r:108108536_108311268|GENSCAN_predicted_peptide_2|425_aa MTIKKYPDVATCRGVKTGGATSIYISVMLNGRHVPPQIPLNMRLCPSLRISEIPNDVGFH SPALNAPLTSKLSNVPSNSKHCFQILHLQWGLLFLKVLGESKDSGVKPQTFAVSVTALKG GASGVVCSSWWVCGPADFRNEASDPRAIRVFYARFHGHKEVEAFRYCKEFRQTWGQESNT KSGPLYENGCEGEGKLLGGPALGRMLFNQSFLREMTREHVISLPQRRTLNLLPCCGVTSK PGDTAWCPDKRGLGSLFTPVTTRGHREGTIYENNPSPDTLSPGALILDFSASITKTQKNL VTLSGIFFQSEKAIWNPAECPRSPGSVGVSFRSLESGVRTTGVEACKDVYLLSDLKENKF QKAVSESMEQKTQRQTDLDLIREQKQEELMSVADCGMWDERMNENETEGNVGEKGTQEAP QQSQR >gi568815585r:108108536_108311268|GENSCAN_predicted_CDS_2|1278_bp atgaccatcaaaaagtatccagatgttgccacctgcaggggtgtcaagactggtggtgct actagcatctacatttctgttatgttaaatgggagacatgttcctccacagatccccctc aatatgaggctctgtccttctctgagaatttcagaaatccctaatgacgtaggcttccat agcccagccctaaatgctccactgacatccaagctttcaaatgtcccttcaaattctaag cactgttttcagatccttcaccttcagtggggactcctctttttgaaagtcttgggtgaa tccaaagattcaggagtgaagccacagaccttcgcagtgagtgttacagctcttaaaggt ggtgcttccggagttgtttgctcctcctggtgggtttgtggtcccgctgacttcaggaat gaagcctcagaccctcgcgccatccgagtcttctatgcccgttttcatggacacaaggag gttgaagctttcaggtattgtaaagagttcagacaaacgtggggacaagagtcaaacacc aaatctggaccattatatgagaatggctgtgaaggagaaggcaagcttctgggtggccca gctctgggtcgcatgctatttaaccagtccttccttagggagatgacgagggagcacgtg ataagcctcccacagagacgcacgctaaacctccttccctgctgtggagtcacctcaaag ccaggtgatacggcttggtgccctgataaaagaggcctagggagcttgttcaccccagtc accacacgaggacacagagaagggaccatctatgagaacaatccctcaccagacacccta tctcctggtgctttgatcttggatttttcagcctccataacgaaaacccaaaagaatctg gttacattatctggcattttcttccagagcgagaaagcaatatggaatccggctgaatgt ccaagatcaccaggcagtgtgggtgtatctttcaggagcctcgagtctggggtcagaaca accggtgttgaggcttgcaaggatgtgtatttattgagtgaccttaaagaaaataagttc cagaaggctgtttctgaatctatggaacagaaaactcaaagacagacagacctggacctc attagagaacaaaagcaagaggaactgatgtcagtagcagactgtgggatgtgggatgag aggatgaatgaaaatgaaacagaagggaatgttggtgaaaagggtacccaagaggcccca cagcagagccagcgctaa >gi568815585r:108108536_108311268|GENSCAN_predicted_peptide_3|911_aa MAASQTSQTVASHVPFADLCSTLERIQKSKGRAEKIRHFREFLDSWRKFHDALHKNHKDV TDSFYPAMRLILPQLERERMAYGIKETMLAKLYIELLNLPRDGKDALKLLNYRTPTGTHG DAGDFAMIAYFVLKPRCLQKGSLTIQQVNDLLDSIASNNSAKRKDLIKKSLLQLITQSSA LEQKWLIRMIIKDLKLGVSQQTIFSVFHNDAAELHNVTTDLEKVCRQLHDPSVGLSDISI TLFSAFKPMLAAIADIEHIEKDMKHQSFYIETKLDGERMQMHKDGDVYKYFSRNGYNYTD QFGASPTEGSLTPFIHNAFKADIQICILDGEMMAYNPNTQTFMQKGTKFDIKRMVEDSDL QTCYCVFDVLMVNNKKLGHETLRKRYEILSSIFTPIPGRIEIVQKTQAHTKNEVIDALNE AIDKREEGIMVKQPLSIYKPDKRGEGWLKIKPEYVSGLMDELDILIVGGYWGKGSRGGMM SHFLCAVAEKPPPGEKPSVFHTLSRVGSGCTMKELYDLGLKLAKYWKPFHRKAPPSSILC GTEKPEVYIEPCNSVIVQIKAAEIVPSDMYKTGCTLRFPRIEKIRDDKEWHECMTLDDLE QLRGKASGKLASKHLYIGGDDEPQEKKRKAAPKMKKVIGIIEHLKAPNLTNVNKISNIFE DVEFCVMSGTDSQPKPDLENRIAEFGGYIVQNPGPDTYCVIAGSENIRVKNIILSNKHDV VKPAWLLECFKTKSFVPWQPRFMIHMCPSTKEHFAREYDCYGDSYFIDTDLNQLKEVFSG IKNSNEQTPEEMASLIADLEYRYSWDCSPLSMFRRHTVYLDSYAVINDLSTKNEGTRLAI KALELRFHGAKVVSCLAEGVSHVIIGEDHSRVADFKAFRRTFKRKFKILKESWVTDSIDK CELQEENQYLI >gi568815585r:108108536_108311268|GENSCAN_predicted_CDS_3|2736_bp atggctgcctcacaaacttcacaaactgttgcatctcacgttccttttgcagatttgtgt tcaactttagaacgaatacagaaaagtaaaggacgtgcagaaaaaatcagacacttcagg gaatttttagattcttggagaaaatttcatgatgctcttcataagaaccacaaagatgtc acagactctttttatccagcaatgagactaattcttcctcagctagaaagagagagaatg gcctatggaattaaagaaactatgcttgctaagctttatattgagttgcttaatttacct agagatggaaaagatgccctcaaacttttaaactacagaacacccactggaactcatgga gatgctggagactttgcaatgattgcatattttgtgttgaagccaagatgtttacagaaa ggaagtttaaccatacagcaagtaaacgaccttttagactcaattgccagcaataattct gctaaaagaaaagacctaataaaaaagagccttcttcaacttataactcagagttcagca cttgagcaaaagtggcttatacggatgatcataaaggatttaaagcttggtgttagtcag caaactatcttttctgtttttcataatgatgctgctgagttgcataatgtcactacagat ctggaaaaagtctgtaggcaactgcatgatccttctgtaggactcagtgatatttctatc actttattttctgcatttaaaccaatgctagctgctattgcagatattgagcacattgag aaggatatgaaacatcagagtttctacatagaaaccaagctagatggtgaacgtatgcaa atgcacaaagatggagatgtatataaatacttctctcgaaatggatataactacactgat cagtttggtgcttctcctactgaaggttctcttaccccattcattcataatgcattcaaa gcagatatacaaatctgtattcttgatggtgagatgatggcctataatcctaatacacaa actttcatgcaaaagggaactaagtttgatattaaaagaatggtagaggattctgatctg caaacttgttattgtgtttttgatgtattgatggttaataataaaaagctagggcatgag actctgagaaagaggtatgagattcttagtagtatttttacaccaattccaggtagaata gaaatagtgcagaaaacacaagctcatactaagaatgaagtaattgatgcattgaatgaa gcaatagataaaagagaagagggaattatggtaaaacaacctctatccatctacaagcca gacaaaagaggtgaagggtggttaaaaattaaaccagagtatgtcagtggactaatggat gaattggacattttaattgttggaggatattggggtaaaggatcacggggtggaatgatg tctcattttctgtgtgcagtagcagagaagccccctcctggtgagaagccatctgtgttt catactctctctcgtgttgggtctggctgcaccatgaaagaactgtatgatctgggtttg aaattggccaagtattggaagccttttcatagaaaagctccaccaagcagcattttatgt ggaacagagaagccagaagtatacattgaaccttgtaattctgtcattgttcagattaaa gcagcagagatcgtacccagtgatatgtataaaactggctgcaccttgcgttttccacga attgaaaagataagagatgacaaggagtggcatgagtgcatgaccctggacgacctagaa caacttagggggaaggcatctggtaagctcgcatctaaacacctttatataggtggtgat gatgaaccacaagaaaaaaagcggaaagctgccccaaagatgaagaaagttattggaatt attgagcacttaaaagcacctaaccttactaacgttaacaaaatttctaatatatttgaa gatgtagagttttgtgttatgagtggaacagatagccagccaaagcctgacctggagaac agaattgcagaatttggtggttatatagtacaaaatccaggcccagacacgtactgtgta attgcagggtctgagaacatcagagtgaaaaacataattttgtcaaataaacatgatgtt gtcaagcctgcatggcttttagaatgttttaagaccaaaagctttgtaccatggcagcct cgctttatgattcatatgtgcccatcaaccaaagaacattttgcccgtgaatatgattgc tatggtgatagttatttcattgatacagacttgaaccaactgaaggaagtattctcagga attaaaaattctaacgagcagactcctgaagaaatggcttctctgattgctgatttagaa tatcggtattcctgggattgctctcctctcagtatgtttcgacgccacaccgtttatttg gactcgtatgctgttattaatgacctgagtaccaaaaatgaggggacaaggttagctatt aaagccttggagcttcggtttcatggagcaaaagtagtttcttgtttagctgagggagtg tctcatgtaataattggggaagatcatagtcgtgttgcagattttaaagcttttagaaga acttttaagagaaagtttaaaatcctaaaagaaagttgggtaactgattcaatagacaag tgtgaattacaagaagaaaaccagtatttgatttaa >gi568815585r:108108536_108311268|GENSCAN_predicted_peptide_4|401_aa MVDSRAHRVRHLFTTEASAAPGLEEDDEERRKRRGGTLLRGARFRARGRLRSLSASLRYL QRATMEKSWMLWNFVERWLIALASWSWALCRISLLPLIVTFHLYGGIILLLLIFISIAGI LYKFQDVLLYFPEQPSSSRLYVPMPTGIPHENIFIRTKDGIRLNLILIRYTGDNSPYSPT IIYFHGNAGNIGHRLPNALLMLVNLKVNLLLVDYRGYGKSEGEASEEGLYLDSEAVLDYV MTRPDLDKTKIFLFGRSLGGAVAIHLASENSHRISAIMVENTFLSIPHMASTLFSFFPMR YLPLWCYKNKFLSYRKISQCRMPSLFISGLSDQLIPPVMMKQLYELSPSRTKRLAIFPDG THNDTWQCQGYFTALEQFIKEVVKSHSPEEMAKTSSNVTII >gi568815585r:108108536_108311268|GENSCAN_predicted_CDS_4|1206_bp atggtggactcccgagctcatcgggtccggcacctcttcaccacggaggcctcggcggcc cctgggctggaggaggatgatgaggagcgacggaagcgacgcgggggtacgctgctgcgc ggcgcccggtttcgtgcccgcggccgactgcgcagcctgtccgcgagtctgagatactta cagagagctacaatggaaaagtcctggatgctgtggaactttgttgaaagatggctaata gccttggcttcatggtcttgggctctctgccgtatttctcttttacctttaatagtgact tttcatctgtatggaggcattatcttacttttgttaatattcatatcaatagcaggtatt ctgtataaattccaggatgtattgctttattttccagaacagccatcctcttcacgtctt tatgttcccatgcccactggcattccacatgaaaacattttcatcagaaccaaagatgga atacgtctgaatcttattttgatacgatacactggagacaattcaccctattccccaact ataatttattttcatgggaatgcaggcaacataggtcacaggttgccaaatgcattactt atgttggttaacctcaaagttaaccttttgctggttgattatcgaggatatggaaaaagt gaaggagaagcaagtgaagaaggactctacttagattctgaagctgtgttagactacgtg atgactagacctgaccttgataaaacaaaaatttttctttttggccgttccttgggtgga gcagtggctattcatttggcttctgaaaattcacataggatttcagccattatggtggag aacacatttttaagcataccacatatggccagcactttattttcattctttccgatgcgt taccttcctttatggtgctacaaaaataaatttttgtcctacagaaaaatctctcagtgt agaatgccttcacttttcatctctggactctcagatcaattaattccaccagtaatgatg aaacaactttatgaactctccccatctcggactaagagattagccatttttccagatggg actcacaatgacacatggcagtgccaaggctatttcactgcacttgaacagttcatcaaa gaagtcgtaaagagccattctcctgaagaaatggcaaaaacttcatctaatgtaacaatt atataa >gi568815585r:108108536_108311268|GENSCAN_predicted_peptide_5|43_aa MEVETAVMRSRDKEAEEHQQPQKLEARKDSSPEAFGGGGPAGA >gi568815585r:108108536_108311268|GENSCAN_predicted_CDS_5|132_bp atggaggtggagactgcagtgatgagatcacgagacaaggaagccgaggaacaccagcag ccacagaaactagaagcaaggaaagactcctccccagaagcctttggagggggtggtcct gctggtgcttaa >gi568815585r:108108536_108311268|GENSCAN_predicted_peptide_6|317_aa MTDSLQRLEELLSGGMPVCNVTDNKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMS ELPFTIASKRIKYLGIQLKRDVKDLFKENYKPLLNEIKEDTKKWKNIPCSWVGRINIVKM AILPKIIYRFNAIPIKLPMTFFTELEKNYFKVHTEPKKSSHRQVNPKPKQQSRRHHATDF KLYYKATVTKTAWYWYQNRDIDQWNRTKPSEITPHIYNHLIFDKPDKNKQWGKDSLFNKW CWENWLAICRKLKLDPFLTPYTKINSRWIKDLHFRSKTIKTLEENLGITIQDTGMGKDFT SKTQKQWQQKPKWTNGI >gi568815585r:108108536_108311268|GENSCAN_predicted_CDS_6|954_bp atgacggatagtttgcaaagactggaagaactactctctggtggcatgcctgtttgcaat gtgactgataataagctgataagcaacttcagcaaagtctcaggatacaaaatcaatgta caaaaatcacaagcattcttatacaccaataacagacaaacagagagccaaatcatgagt gaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaacttaaaagg gatgtgaaggacctcttcaaggagaactacaaaccactgctcaatgaaataaaagaggat acaaagaaatggaagaacattccatgctcatgggtaggaagaatcaatatcgtgaaaatg gccatactgcccaagataatttatagattcaatgccatccccatcaagctaccaatgact ttcttcacagaattggaaaaaaactactttaaagttcatacggaaccaaaaaagagctcg catcgccaagtcaatcctaagccaaaacaacaaagccggaggcatcacgctactgacttc aaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaacagagat atagatcaatggaacagaacaaagccctcagaaataacaccgcatatctacaaccatctg atctttgacaaacctgacaaaaacaagcaatggggaaaggattccctatttaataaatgg tgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttccttacacct tatacaaaaattaattcaagatggattaaagacttacattttagatctaaaaccataaaa accctagaagaaaacctaggcattaccattcaggacacaggcatgggcaaggacttcacg tctaaaacacaaaagcaatggcaacaaaagccaaaatggacaaatgggatctaa >gi568815585r:108108536_108311268|GENSCAN_predicted_peptide_7|297_aa MDDSTEREQSRLTSCLKKREEMKLKECVSILPRKESPSVRSSKDGKLLAATLLLALLSCC LTVVSFYQVAALQGDLASLRAELQGHHAEKLPAGAGAPKAGLEEAPAVTAGLKIFEPPAP GEGNSSQNSRNKRAVQGPEETEMLQERSPDPDPRRGFLDLTPEKIRGESTGLPSVATFGP RNGEISSSGNILPNSSDTLSSLGIKGKSLLVLYTDKTYAMGHLIQRKKVHVFGDELSLVT LFRCIQNMPETLPNNSCYSAGIAKLEEGDELQLAIPRENAQISLDGDVTFFGALKLL >gi568815585r:108108536_108311268|GENSCAN_predicted_CDS_7|894_bp atggatgactccacagaaagggagcagtcacgccttacttcttgccttaagaaaagagaa gaaatgaaactgaaggagtgtgtttccatcctcccacggaaggaaagcccctctgtccga tcctccaaagacggaaagctgctggctgcaaccttgctgctggcactgctgtcttgctgc ctcacggtggtgtctttctaccaggtggccgccctgcaaggggacctggccagcctccgg gcagagctgcagggccaccacgcggagaagctgccagcaggagcaggagcccccaaggcc ggcctggaggaagctccagctgtcaccgcgggactgaaaatctttgaaccaccagctcca ggagaaggcaactccagtcagaacagcagaaataagcgtgccgttcagggtccagaagaa acagaaatgttacaggaaaggagtcccgatccagacccccggagagggttcttggatctg acgccagaaaaaattcggggcgaatccacagggctcccttctgttgccacatttgggcca aggaatggagagatttcttcgtctggaaacattttgccaaactcttcagatactctttcc tctctgggaatcaaaggaaaatctctactagttttatatactgataagacctacgccatg ggacatctaattcagaggaagaaggtccatgtctttggggatgaattgagtctggtgact ttgtttcgatgtattcaaaatatgcctgaaacactacccaataattcctgctattcagct ggcattgcaaaactggaagaaggagatgaactccaacttgcaataccaagagaaaatgca caaatatcactggatggagatgtcacattttttggtgcattgaaactgctgtga