GENSCAN 1.0 Date run: 8-Nov-116 Time: 05:02:56 Sequence gi568815597f:28638257_28843987 : 205731 bp : 46.95% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 6444 6536 93 2 0 81 80 86 0.601 7.48 1.02 Term + 9937 9957 21 0 0 62 54 32 0.245 -4.49 1.03 PlyA + 10257 10262 6 1.05 2.04 PlyA - 13257 13252 6 1.05 2.03 Term - 16600 16439 162 1 0 72 52 146 0.914 7.34 2.02 Intr - 17958 17783 176 1 2 4 55 115 0.235 -0.54 2.01 Init - 18613 18364 250 1 1 45 36 143 0.833 2.43 2.00 Prom - 19066 19027 40 -6.66 3.00 Prom + 26301 26340 40 -5.46 3.01 Init + 30528 30583 56 2 2 30 89 151 0.652 8.16 3.02 Intr + 51818 51930 113 2 2 31 87 124 0.952 6.62 3.03 Intr + 53329 53453 125 2 2 124 86 68 0.970 10.50 3.04 Intr + 54686 54789 104 1 2 83 115 100 0.998 11.17 3.05 Intr + 58671 58828 158 0 2 72 54 114 0.958 6.05 3.06 Intr + 64182 64313 132 1 0 102 66 133 0.987 13.12 3.07 Intr + 65936 66073 138 0 0 81 111 52 0.976 7.24 3.08 Intr + 72264 72386 123 1 0 61 91 69 0.927 5.06 3.09 Term + 75817 76517 701 2 2 97 36 637 0.884 53.10 3.10 PlyA + 79932 79937 6 1.05 4.00 Prom + 83892 83931 40 -6.46 4.01 Init + 98865 98891 27 2 0 79 92 28 0.670 2.06 4.02 Intr + 99402 99426 25 2 1 105 115 16 0.785 3.60 4.03 Intr + 100003 100082 80 2 2 80 86 23 0.784 0.57 4.04 Intr + 104147 105730 1584 1 0 130 86 830 0.743 76.04 4.05 Term + 111351 111368 18 2 0 96 42 26 0.302 -2.88 4.06 PlyA + 112678 112683 6 1.05 5.00 Prom + 120609 120648 40 -5.06 5.01 Init + 137231 137288 58 1 1 83 99 74 0.646 8.05 5.02 Term + 150654 150739 86 1 2 75 53 56 0.118 -1.38 5.03 PlyA + 150918 150923 6 1.05 6.00 Prom + 165709 165748 40 -6.26 6.01 Init + 174128 174354 227 1 2 97 117 411 0.995 42.84 6.02 Intr + 174471 174541 71 0 2 101 22 88 0.515 2.33 6.03 Term + 176643 176686 44 1 2 64 55 64 0.283 -1.98 6.04 PlyA + 177982 177987 6 1.05 7.04 PlyA - 182839 182834 6 1.05 7.03 Term - 187759 187562 198 1 0 79 37 105 0.896 1.90 7.02 Intr - 188500 188436 65 2 2 54 94 94 0.962 5.04 7.01 Init - 189428 189251 178 2 1 74 -29 174 0.533 3.72 7.00 Prom - 190171 190132 40 -4.26 8.04 PlyA - 193461 193456 6 1.05 8.03 Term - 196970 196870 101 0 2 79 48 72 0.632 0.59 8.02 Intr - 198844 198787 58 1 1 115 113 -14 0.709 2.26 8.01 Init - 200295 200290 6 0 0 121 64 0 0.647 1.95 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:28638257_28843987|GENSCAN_predicted_peptide_1|37_aa MVFSKENTYSRTKMAKVAGALVTGKDNEFGESDTYSR >gi568815597f:28638257_28843987|GENSCAN_predicted_CDS_1|114_bp atggtcttcagtaaggaaaacacttactcaaggaccaagatggccaaggtggctggagct ttggtaaccgggaaagacaatgagtttggagagtcagatacctacagccgctga >gi568815597f:28638257_28843987|GENSCAN_predicted_peptide_2|195_aa MTVDYCKLNQVVTPVAAAEPDVALLLEQINTSPGTCYAATDLANAFFSIPVHKAHERQFA FSLQGRQYTFTILPQGYINSHKAVTRKTASFEWSPKQDKALQQVQAVVQAALPFGPYDPA DPMVLEVSVADRVAVWCLWQAPQPLSSPNGPMNKVAMVAGMKVTNGLSNMDFHSPRLTWL RPRLSAQFSSSRDQH >gi568815597f:28638257_28843987|GENSCAN_predicted_CDS_2|588_bp atgacagtggattattgtaagcttaaccaagtggtgactccagttgcagctgctgaacca gatgtggctttattgcttgagcaaattaacacatctcctggtacctgttatgcagctact gatttggcaaatgcctttttctccattcctgtccacaaggcccacgagaggcaattcgcc ttcagcttgcaaggccggcaatatactttcactatcttacctcaggggtatatcaactct cacaaagctgtgacccgaaagactgccagttttgagtggagtccaaaacaggacaaggct ctgcaacaggtccaggctgttgtgcaagctgctctgccatttggaccatatgacccagca gatccaatggtgcttgaggtgtcagtggcagatagggttgctgtttggtgcctttggcag gccccccaacccctgtcatcgcccaatgggcccatgaacaaagtggccatggtggcaggg atgaaggttactaatgggctcagcaacatggacttccactcaccaaggctgacctggcta cggccacgactgagtgcccaattttccagcagcagagaccaacactga >gi568815597f:28638257_28843987|GENSCAN_predicted_peptide_3|549_aa MLRRLPALAARRPPARRRRLFIDGHFYNRIYEAGSENNTAVVAVETHTIHKIEEGIDTGT IEANEDMEIAYPITCGESKAILLWKKFVCPGINVKCVKFNDQLISPKHFVHLAGKSTLKD WKRAIRLGGIMLRKMMDSGQIDFYQHDKVCSNTCRSTKFDLLISSARAPVPGQQTSVVQT PTSADGSITQIAISEESMEEAGLEWNSALTAAVTMATEEGVKKDSEEISEDTLMFWKGIA DVGLMEEVVCNIQKEIEELLRGVQQRLIQAPFQVTDAAVLNNVAHTFGLMDTVKKVLDNR RNQVEQGEEQFLYTLTDLERQLEEQKKQGQDHRLKSQTVQNVVLMPVSTPKPPKRPRLQR PASTTVLSPSPPVQQPQFTVISPITITPVGQSFSMGNIPVATLSQGSSPVTVHTLPSGPQ LFRYATVVSSAKSSSPDTVTIHPSSSLALLSSTAMQDGSTLGNMTTMVSPVELVAMESGL TSAIQAVESTSEDGQTIIEIDPAPDPEAEDTEGKAVILETELRTEEKVVAEMEEHQHQVH NVEIVVLED >gi568815597f:28638257_28843987|GENSCAN_predicted_CDS_3|1650_bp atgctccgtcgcctgcccgccctggccgctcgccgcccgcccgcccgacggagacgtttg tttatcgatggacacttttacaacaggatttatgaagctgggtcggagaacaacacggca gttgtagcagtagaaactcacacgatacacaaaattgaagaagggattgatacaggcact atagaagcaaatgaggatatggaaattgcttaccccataacttgtggggagagcaaagcc atcctcctctggaagaagtttgtatgtccaggaataaacgtgaagtgtgtcaagttcaat gatcagttgatcagccccaagcactttgttcatctggctggcaagtccactctgaaggac tggaagagagctattcgtctgggtgggatcatgctcaggaaaatgatggactccggacag attgatttttaccaacatgacaaagtttgctccaatacctgcagaagcaccaaatttgat cttctgatcagcagtgcaagagctccagtgccaggacagcagacaagtgtggtgcagaca cccacttcggctgatggtagcatcacgcagattgccatctcagaagagagcatggaagag gcagggctggaatggaactcagctctcaccgctgctgtcaccatggccacggaggagggt gtaaagaaagactcagaggaaatttcagaggacactttgatgttctggaaaggaatagct gatgtagggctgatggaagaggttgtctgcaatatacagaaggaaatagaggagctactc aggggagttcagcagcggctcatccaggctcccttccaagtcacagatgctgctgttctc aacaatgtagcacacacatttggcctaatggacacagtcaagaaggttttagacaacaga aggaaccaagtagagcagggagaagaacagtttctctatactctgacagacttggaacgc cagttggaggagcagaagaagcaaggccaggatcacaggctgaaatctcagacagttcaa aatgtggtactgatgcctgtgagcactcctaagcctccaaaaaggccccggctccagcgg ccagcctccaccactgtcttgagcccttctcctcctgtccagcagcctcagttcacagtc atctcacccatcaccatcaccccagtgggtcagtcattttccatgggcaatattccagtg gccaccctcagccagggctccagtcctgtgactgtccacacactgccttctggccctcag ctcttccgctatgccacagtggtctcctctgccaagagcagctcaccagacacagtgacc atccacccttcatctagcttggcgctgctgagctctactgccatgcaggatgggagtaca ctgggcaacatgaccaccatggttagccctgtggaattggtggccatggagtccggccta acctcggcaattcaggctgttgaaagcacctcagaggatgggcagaccatcattgagatt gatccagccccggacccagaagctgaagatactgagggcaaagcagtcatcttggagaca gagctgaggactgaggagaaagttgtggctgagatggaagaacaccagcatcaagttcac aatgtggagattgtggtcttagaggattaa >gi568815597f:28638257_28843987|GENSCAN_predicted_peptide_4|577_aa MSASSLLEQRPKGQGNKVQNGSVHQKDGLNDDDFEPYLSPQARPNNAYTAMSDSYLPSYY SPSIGFSYSLGEAAWSTGGDTAMPYLTSYGQLSNGEPHFLPDAMFGQPGALGSTPFLGQH GFNFFPSGIDFSAWGNNSSQGQSTQSSGYSSNYAYAPSSLGGAMIDGQSAFANETLNKAP GMNTIDQGMAALKLGSTEVASNVPKVVGSAVGSGSITSNIVASNSLPPATIAPPKPASWA DIASKPAKQQPKLKTKNGIAGSSLPPPPIKHNMDIGTWDNKGPVAKAPSQALVQNIGQPT QGSPQPVGQQANNSPPVAQASVGQQTQPLPPPPPQPAQLSVQQQAAQPTRWVAPRNRGSG FGHNGVDGNGVGQSQAGSGSTPSEPHPVLEKLRSINNYNPKDFDWNLKHGRVFIIKSYSE DDIHRSIKYNIWCSTEHGNKRLDAAYRSMNGKGPVYLLFSVNGSGHFCGVAEMKSAVDYN TCAGVWSQDKWKGRFDVRWIFVKDVPNSQLRHIRLENNENKPVTNSRDTQEVPLEKAKQV LKIIASYKHTTSIFDDFSHYEKRQEEEESVKKPFNYK >gi568815597f:28638257_28843987|GENSCAN_predicted_CDS_4|1734_bp atgtcggccagcagcctcttggagcagagaccaaaaggtcaaggaaacaaagtacaaaat ggatctgtacatcaaaaggatggattaaacgatgatgattttgaaccttacttgagtcca caggcaaggcccaataatgcatatactgccatgtcagattcctacttacccagttactac agtccctccattggcttctcctattctttgggtgaagctgcttggtctacggggggtgac acagccatgccctacttaacttcttatggacagctgagcaacggagagccccacttccta ccagatgcaatgtttgggcaaccaggagccctaggtagcactccatttcttggtcagcat ggttttaatttctttcccagtgggattgacttctcagcatggggaaataacagttctcag ggacagtctactcagagctctggatatagtagcaattatgcttatgcacctagctcctta ggtggagccatgattgatggacagtcagcttttgccaatgagaccctcaataaggctcct ggcatgaatactatagaccaagggatggcagcactgaagttgggtagcacagaagttgca agcaatgttccaaaagttgtaggttctgctgttggtagcgggtccattactagtaacatc gtggcttccaatagtttgcctccagccaccattgctcctccaaaaccagcatcttgggct gatattgctagcaagcctgcaaaacagcaacctaaactgaagaccaagaatggcattgca gggtcaagtcttccgccacccccgataaagcataacatggatattggaacttgggataac aagggtcccgttgcaaaagccccctcacaggctttggttcagaatataggtcagccaacc caggggtctcctcagcctgtaggtcagcaggctaacaatagcccaccagtggctcaggca tcagtagggcaacagacacagccattgcctccacctccaccacagcctgcccagctttca gtccagcaacaggcagctcagccaacccgctgggtagcacctcggaaccgtggcagtggg ttcggtcataatggggtggatggtaatggagtaggacagtctcaggctggttctggatct actccttcagaaccccacccagtgttggagaagcttcggtccattaataactataacccc aaagattttgactggaatctgaaacatggccgggttttcatcattaagagctactctgag gacgatattcaccgttccattaagtataatatttggtgcagcacagagcatggtaacaag agactggatgctgcttatcgttccatgaacgggaaaggccccgtttacttacttttcagt gtcaacggcagtggacacttctgtggcgtggcagaaatgaaatctgctgtggactacaac acatgtgcaggtgtgtggtcccaggacaaatggaagggtcgttttgatgtcaggtggatt tttgtgaaggacgttcccaatagccaactgcgacacattcgcctagagaacaacgagaat aaaccagtgaccaactctagggacactcaggaagtgcctctggaaaaggctaagcaggtg ttgaaaattatagccagctacaagcacaccacttccatttttgatgacttctcacactat gagaaacgccaagaggaagaagaaagtgttaaaaagccctttaactacaagtaa >gi568815597f:28638257_28843987|GENSCAN_predicted_peptide_5|47_aa MVSALASAPVRKRPALTCAGSLSWWMSASSNSLLKPEACASHSRLES >gi568815597f:28638257_28843987|GENSCAN_predicted_CDS_5|144_bp atggtgtccgcccttgcttctgcgcctgtgcggaagcgcccggccctcacctgcgcaggg tccctgtcatggtggatgtctgcatcatccaactcgttgcttaagccagaggcctgtgcg agtcactctcgactcgagtcctga >gi568815597f:28638257_28843987|GENSCAN_predicted_peptide_6|113_aa MEPAPSAGAELQPPLFANASDAYPSACPSAGANASGPPGARSASSLALAIAITALYSAVC AVGLLGNVLVMFGIVRGTWGPARGEATYIEGNTGMCVIVCIRIIISTRILQIK >gi568815597f:28638257_28843987|GENSCAN_predicted_CDS_6|342_bp atggaaccggccccctccgccggcgccgagctgcagcccccgctcttcgccaacgcctcg gacgcctaccctagcgcctgccccagcgctggcgccaatgcgtcggggccgccaggcgcg cggagcgcctcgtccctcgccctggcaatcgccatcaccgcgctctactcggccgtgtgc gccgtggggctgctgggcaacgtgcttgtcatgttcggcatcgtccggggcacctggggc ccagcgagaggcgaggccacttacatcgaggggaacacaggaatgtgtgtcatcgtgtgc attagaatcatcatcagtacccgcatcctgcagataaaatga >gi568815597f:28638257_28843987|GENSCAN_predicted_peptide_7|146_aa MDDFEGFEASVEEVTADVVEIARELELEVGPEDVTELLLLQSHEKTLMDEKLLLIDEQEK IATATPTFGNHYPQPAAINIETVSSMRARTVSSSLLHPKYLELCLVLQGGQPNDVEFFYH PHFTDEKTDSTLSSTVDSQQEAGFEP >gi568815597f:28638257_28843987|GENSCAN_predicted_CDS_7|441_bp atggatgactttgaggggttcgaggcttcagtggaggaagtaactgcagatgtagtagaa atagcaagagaactagaattagaagtggggcctgaagatgtgactgaattgctgctgctg caatctcatgaaaaaactttaatggatgagaagttgcttcttatagatgagcaagaaaaa attgccacagccaccccaaccttcggcaaccactacccccaaccagcagccatcaatatt gagactgtcagctccatgagggcaaggactgtgtcttcatcactgttgcatcccaagtac ctagaactgtgcctggtacttcaggggggacaacccaatgatgtagagtttttctatcat ccccatttcacagatgagaaaacagacagcacgctgagttcaacagtggacagccagcaa gaagctggatttgaaccctag >gi568815597f:28638257_28843987|GENSCAN_predicted_peptide_8|54_aa MGMGPPSCRKTSSGAPLILPYGVAGPGMGQQAQWGPKNADSHKDVAYQTLDLWK >gi568815597f:28638257_28843987|GENSCAN_predicted_CDS_8|165_bp atggggatgggaccacctagttgcaggaaaacaagctcaggagccccactgattctacct tacggggtggctggaccagggatggggcagcaggcacaatgggggcccaagaacgcagac tcccacaaggatgtggcatatcagaccctggacctctggaaatag