GENSCAN 1.0 Date run: 4-Nov-116 Time: 10:15:46 Sequence gi568815597f:28583613_28814770 : 231158 bp : 46.66% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 2128 2167 40 -0.36 1.01 Init + 8900 9132 233 1 2 111 80 400 0.985 39.23 1.02 Term + 10082 10505 424 2 1 134 41 508 0.954 45.47 1.03 PlyA + 11812 11817 6 1.05 2.07 PlyA - 12555 12550 6 1.05 2.06 Term - 19962 19927 36 0 0 120 41 39 0.652 -0.26 2.05 Intr - 21848 21760 89 0 2 104 94 71 0.909 8.99 2.04 Intr - 29749 29635 115 1 1 111 77 163 0.994 17.52 2.03 Intr - 34418 34341 78 2 0 48 94 123 0.994 8.65 2.02 Intr - 38553 38359 195 0 0 82 48 183 0.663 13.31 2.01 Init - 40405 40334 72 1 0 66 87 42 0.683 2.97 2.00 Prom - 41605 41566 40 -2.56 3.05 PlyA - 45165 45160 6 1.05 3.04 Term - 59358 59141 218 1 2 43 55 157 0.195 5.21 3.03 Intr - 70999 70390 610 1 1 97 55 130 0.002 2.59 3.02 Intr - 72602 72427 176 0 2 4 55 115 0.235 -0.54 3.01 Init - 73257 73008 250 0 1 45 36 143 0.833 2.43 3.00 Prom - 73710 73671 40 -6.66 4.00 Prom + 80945 80984 40 -5.46 4.01 Init + 85172 85227 56 1 2 30 89 151 0.652 8.16 4.02 Intr + 106462 106574 113 1 2 31 87 124 0.952 6.62 4.03 Intr + 107973 108097 125 1 2 124 86 68 0.970 10.50 4.04 Intr + 109330 109433 104 0 2 83 115 100 0.998 11.17 4.05 Intr + 113315 113472 158 2 2 72 54 114 0.958 6.05 4.06 Intr + 118826 118957 132 0 0 102 66 133 0.987 13.12 4.07 Intr + 120580 120717 138 2 0 81 111 52 0.976 7.24 4.08 Intr + 126908 127030 123 0 0 61 91 69 0.927 5.06 4.09 Term + 130461 131161 701 1 2 97 36 637 0.884 53.10 4.10 PlyA + 134576 134581 6 1.05 5.00 Prom + 138536 138575 40 -6.46 5.01 Init + 153509 153535 27 1 0 79 92 28 0.670 2.06 5.02 Intr + 154046 154070 25 1 1 105 115 16 0.785 3.60 5.03 Intr + 154647 154726 80 1 2 80 86 23 0.784 0.57 5.04 Intr + 158791 160374 1584 0 0 130 86 830 0.743 76.04 5.05 Term + 165995 166012 18 1 0 96 42 26 0.302 -2.88 5.06 PlyA + 167322 167327 6 1.05 6.00 Prom + 175253 175292 40 -5.06 6.01 Init + 191875 191932 58 0 1 83 99 74 0.646 8.05 6.02 Term + 205298 205383 86 0 2 75 53 56 0.118 -1.38 6.03 PlyA + 205562 205567 6 1.05 7.00 Prom + 220353 220392 40 -6.26 7.01 Init + 228772 228998 227 0 2 97 117 411 0.995 42.84 7.02 Intr + 229115 229185 71 2 2 101 22 88 0.438 2.33 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 71244 71083 162 0 0 72 52 146 0.911 7.34 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:28583613_28814770|GENSCAN_predicted_peptide_1|218_aa MEAEGCRYQFRVALLGDAAVGKTSLLRSYVAGAPGAPEPEPEPEPTVGAECYRRALQLRA GPRVKLQLWDTAGHERFRCITRSFYRNVVGVLLVFDVTNRKSFEHIQDWHQEVMATQGPD KVIFLLVGHKSDLQSTRCVSAQEAEELAASLGMAFVETSVKNNCNVDLAFDTLADAIQQA LQQGDIKLEEGWGGVRLIHKTQIPRSPSRKQHSGPCQC >gi568815597f:28583613_28814770|GENSCAN_predicted_CDS_1|657_bp atggaggccgagggctgccgctaccaatttcgggtcgcgctgctgggggacgcggcggtg ggcaagacgtcgctgctgcggagctacgtggcaggcgcgcctggcgccccggagccggag cccgagcccgagcccacggtgggcgccgagtgctaccgccgcgcgctgcagctgcgggcc gggccgcgggtcaagctgcaactctgggacaccgcgggccacgagcgcttcaggtgcatc accaggtccttttaccggaatgtggtgggtgtcctgctggtctttgatgtgacaaacagg aagtcctttgaacacatccaagactggcaccaggaggtcatggccactcagggcccggac aaggtcatcttcctgctggttggccacaagagtgacctgcagagcacccgctgtgtctca gcccaggaggccgaggagctagctgcctccctgggcatggccttcgtggagacctcggtt aaaaacaactgcaatgtggacctggcctttgacaccctcgctgatgctatccagcaggcc ctgcagcagggggacatcaagctagaagagggctgggggggtgtccggctcatccacaag acccaaatccccaggtcccccagcaggaagcagcactcaggcccatgccagtgttga >gi568815597f:28583613_28814770|GENSCAN_predicted_peptide_2|194_aa MAASHFTGLTAVADVIKDLDTQIALIGLGPHSSKKKQDLDKLYELKSKARQIMNQFGPSA LINLSNFSSIKPEPASTPPQGSMANSTAVVLTKKKLQDLVREVDPNEQLDEDVEEMLLQI ADDFIESVVTAACQLARHRKSSTLEVKDVQLHLERQWNMWIPGFGSEEIRPYKKACTTEA HKQRMALIRKTTKK >gi568815597f:28583613_28814770|GENSCAN_predicted_CDS_2|585_bp atggctgcctctcatttcaccgggctcacagctgttgctgatgtaattaaagatctagac actcagatagctttaattggccttggtcctcacagctccaaaaagaaacaggatctcgat aagctctatgagctgaagtccaaagctcggcagattatgaaccagtttggcccctcagcc ctaatcaacctctccaatttctcatccataaaaccggaaccagccagcacccctccacaa ggctccatggccaatagtactgcagtggtattgaccaagaagaaattacaggacttagta agagaagtggatcctaatgagcagttggatgaagatgtggaggagatgctgctgcagatt gctgatgattttatcgagagtgtggtgacagcagcctgtcagcttgcgcggcatcgcaag tctagcaccctggaggtgaaagatgtccagctgcatttagagcgccagtggaacatgtgg atcccaggatttggctctgaagaaatccgaccctacaaaaaagcttgcaccacagaagct cacaaacagagaatggcattgatccggaaaacaaccaagaaataa >gi568815597f:28583613_28814770|GENSCAN_predicted_peptide_3|417_aa MTVDYCKLNQVVTPVAAAEPDVALLLEQINTSPGTCYAATDLANAFFSIPVHKAHERQFA FSLQGRQYTFTILPQGYINSHKAVTRKTASFEWSPKQDKALQQVQAVVQAALPFGPYDPA DPMVLEVSVADRVAVWCLWQAPKGQRFILAGIDTYSKYGFAYLACNASAKTIIRGLTECL IHHHGIPHSTASDHGTDFTAKQVQQWALAHGIHLSYHVPHYPEAAGLIEQRNGLLKSQLQ RQLGDNSLQGWGKVLQKAVYALNQHPIYGTVSPITRIQGSRNQEVEMEVAPLTITLSDPL ANFLLPVPMALCSADLEVLVLEGGMLPPGGTRIPLNWKLRLPPRPVEEQEREEGVRTQTY SGSVFEGPGCGRDGAVRIQALGKGLTERIRLGEDAAVGTRALRRDLGNGDGCEFSGI >gi568815597f:28583613_28814770|GENSCAN_predicted_CDS_3|1254_bp atgacagtggattattgtaagcttaaccaagtggtgactccagttgcagctgctgaacca gatgtggctttattgcttgagcaaattaacacatctcctggtacctgttatgcagctact gatttggcaaatgcctttttctccattcctgtccacaaggcccacgagaggcaattcgcc ttcagcttgcaaggccggcaatatactttcactatcttacctcaggggtatatcaactct cacaaagctgtgacccgaaagactgccagttttgagtggagtccaaaacaggacaaggct ctgcaacaggtccaggctgttgtgcaagctgctctgccatttggaccatatgacccagca gatccaatggtgcttgaggtgtcagtggcagatagggttgctgtttggtgcctttggcag gcccccaaagggcagaggtttatcctcgctggaatagacacttactctaaatatgggttt gcctatcttgcatgcaatgcttctgccaagactatcatccgtggacttacagaatgcctt atccaccatcacggtattccacacagcactgcctctgaccacggcactgactttacagct aaacaagtgcaacagtgggctcttgctcatggaattcacttgtcttaccatgttccccat tatcctgaagcagctggattgatagaacagcggaatggccttttgaagtcacaattacaa cgccaactaggtgacaattctttgcagggctggggaaaagttctccagaaggctgtgtat gctctgaatcagcatccaatatatggtactgtttctcccataaccaggattcaagggtcc aggaatcaagaggtggaaatggaagtggctccactcaccatcacccttagtgatccactg gcaaacttcttgcttcctgttcccatggcattatgttctgctgacctagaggtcttagtt ctagagggaggaatgctgccaccaggaggcacaaggattccattaaactggaagttaaga ttgcctcctaggccagttgaggaacaagagcgggaggagggagtcaggacccagacctac agtgggtcagtgttcgaaggaccaggttgtggacgagacggggctgtgaggattcaggct cttgggaagggactgactgagaggatcaggctcggagaagacgcggctgtggggacccgg gctctaagacgggacttagggaacggggacggatgcgagttctccggcatctga >gi568815597f:28583613_28814770|GENSCAN_predicted_peptide_4|549_aa MLRRLPALAARRPPARRRRLFIDGHFYNRIYEAGSENNTAVVAVETHTIHKIEEGIDTGT IEANEDMEIAYPITCGESKAILLWKKFVCPGINVKCVKFNDQLISPKHFVHLAGKSTLKD WKRAIRLGGIMLRKMMDSGQIDFYQHDKVCSNTCRSTKFDLLISSARAPVPGQQTSVVQT PTSADGSITQIAISEESMEEAGLEWNSALTAAVTMATEEGVKKDSEEISEDTLMFWKGIA DVGLMEEVVCNIQKEIEELLRGVQQRLIQAPFQVTDAAVLNNVAHTFGLMDTVKKVLDNR RNQVEQGEEQFLYTLTDLERQLEEQKKQGQDHRLKSQTVQNVVLMPVSTPKPPKRPRLQR PASTTVLSPSPPVQQPQFTVISPITITPVGQSFSMGNIPVATLSQGSSPVTVHTLPSGPQ LFRYATVVSSAKSSSPDTVTIHPSSSLALLSSTAMQDGSTLGNMTTMVSPVELVAMESGL TSAIQAVESTSEDGQTIIEIDPAPDPEAEDTEGKAVILETELRTEEKVVAEMEEHQHQVH NVEIVVLED >gi568815597f:28583613_28814770|GENSCAN_predicted_CDS_4|1650_bp atgctccgtcgcctgcccgccctggccgctcgccgcccgcccgcccgacggagacgtttg tttatcgatggacacttttacaacaggatttatgaagctgggtcggagaacaacacggca gttgtagcagtagaaactcacacgatacacaaaattgaagaagggattgatacaggcact atagaagcaaatgaggatatggaaattgcttaccccataacttgtggggagagcaaagcc atcctcctctggaagaagtttgtatgtccaggaataaacgtgaagtgtgtcaagttcaat gatcagttgatcagccccaagcactttgttcatctggctggcaagtccactctgaaggac tggaagagagctattcgtctgggtgggatcatgctcaggaaaatgatggactccggacag attgatttttaccaacatgacaaagtttgctccaatacctgcagaagcaccaaatttgat cttctgatcagcagtgcaagagctccagtgccaggacagcagacaagtgtggtgcagaca cccacttcggctgatggtagcatcacgcagattgccatctcagaagagagcatggaagag gcagggctggaatggaactcagctctcaccgctgctgtcaccatggccacggaggagggt gtaaagaaagactcagaggaaatttcagaggacactttgatgttctggaaaggaatagct gatgtagggctgatggaagaggttgtctgcaatatacagaaggaaatagaggagctactc aggggagttcagcagcggctcatccaggctcccttccaagtcacagatgctgctgttctc aacaatgtagcacacacatttggcctaatggacacagtcaagaaggttttagacaacaga aggaaccaagtagagcagggagaagaacagtttctctatactctgacagacttggaacgc cagttggaggagcagaagaagcaaggccaggatcacaggctgaaatctcagacagttcaa aatgtggtactgatgcctgtgagcactcctaagcctccaaaaaggccccggctccagcgg ccagcctccaccactgtcttgagcccttctcctcctgtccagcagcctcagttcacagtc atctcacccatcaccatcaccccagtgggtcagtcattttccatgggcaatattccagtg gccaccctcagccagggctccagtcctgtgactgtccacacactgccttctggccctcag ctcttccgctatgccacagtggtctcctctgccaagagcagctcaccagacacagtgacc atccacccttcatctagcttggcgctgctgagctctactgccatgcaggatgggagtaca ctgggcaacatgaccaccatggttagccctgtggaattggtggccatggagtccggccta acctcggcaattcaggctgttgaaagcacctcagaggatgggcagaccatcattgagatt gatccagccccggacccagaagctgaagatactgagggcaaagcagtcatcttggagaca gagctgaggactgaggagaaagttgtggctgagatggaagaacaccagcatcaagttcac aatgtggagattgtggtcttagaggattaa >gi568815597f:28583613_28814770|GENSCAN_predicted_peptide_5|577_aa MSASSLLEQRPKGQGNKVQNGSVHQKDGLNDDDFEPYLSPQARPNNAYTAMSDSYLPSYY SPSIGFSYSLGEAAWSTGGDTAMPYLTSYGQLSNGEPHFLPDAMFGQPGALGSTPFLGQH GFNFFPSGIDFSAWGNNSSQGQSTQSSGYSSNYAYAPSSLGGAMIDGQSAFANETLNKAP GMNTIDQGMAALKLGSTEVASNVPKVVGSAVGSGSITSNIVASNSLPPATIAPPKPASWA DIASKPAKQQPKLKTKNGIAGSSLPPPPIKHNMDIGTWDNKGPVAKAPSQALVQNIGQPT QGSPQPVGQQANNSPPVAQASVGQQTQPLPPPPPQPAQLSVQQQAAQPTRWVAPRNRGSG FGHNGVDGNGVGQSQAGSGSTPSEPHPVLEKLRSINNYNPKDFDWNLKHGRVFIIKSYSE DDIHRSIKYNIWCSTEHGNKRLDAAYRSMNGKGPVYLLFSVNGSGHFCGVAEMKSAVDYN TCAGVWSQDKWKGRFDVRWIFVKDVPNSQLRHIRLENNENKPVTNSRDTQEVPLEKAKQV LKIIASYKHTTSIFDDFSHYEKRQEEEESVKKPFNYK >gi568815597f:28583613_28814770|GENSCAN_predicted_CDS_5|1734_bp atgtcggccagcagcctcttggagcagagaccaaaaggtcaaggaaacaaagtacaaaat ggatctgtacatcaaaaggatggattaaacgatgatgattttgaaccttacttgagtcca caggcaaggcccaataatgcatatactgccatgtcagattcctacttacccagttactac agtccctccattggcttctcctattctttgggtgaagctgcttggtctacggggggtgac acagccatgccctacttaacttcttatggacagctgagcaacggagagccccacttccta ccagatgcaatgtttgggcaaccaggagccctaggtagcactccatttcttggtcagcat ggttttaatttctttcccagtgggattgacttctcagcatggggaaataacagttctcag ggacagtctactcagagctctggatatagtagcaattatgcttatgcacctagctcctta ggtggagccatgattgatggacagtcagcttttgccaatgagaccctcaataaggctcct ggcatgaatactatagaccaagggatggcagcactgaagttgggtagcacagaagttgca agcaatgttccaaaagttgtaggttctgctgttggtagcgggtccattactagtaacatc gtggcttccaatagtttgcctccagccaccattgctcctccaaaaccagcatcttgggct gatattgctagcaagcctgcaaaacagcaacctaaactgaagaccaagaatggcattgca gggtcaagtcttccgccacccccgataaagcataacatggatattggaacttgggataac aagggtcccgttgcaaaagccccctcacaggctttggttcagaatataggtcagccaacc caggggtctcctcagcctgtaggtcagcaggctaacaatagcccaccagtggctcaggca tcagtagggcaacagacacagccattgcctccacctccaccacagcctgcccagctttca gtccagcaacaggcagctcagccaacccgctgggtagcacctcggaaccgtggcagtggg ttcggtcataatggggtggatggtaatggagtaggacagtctcaggctggttctggatct actccttcagaaccccacccagtgttggagaagcttcggtccattaataactataacccc aaagattttgactggaatctgaaacatggccgggttttcatcattaagagctactctgag gacgatattcaccgttccattaagtataatatttggtgcagcacagagcatggtaacaag agactggatgctgcttatcgttccatgaacgggaaaggccccgtttacttacttttcagt gtcaacggcagtggacacttctgtggcgtggcagaaatgaaatctgctgtggactacaac acatgtgcaggtgtgtggtcccaggacaaatggaagggtcgttttgatgtcaggtggatt tttgtgaaggacgttcccaatagccaactgcgacacattcgcctagagaacaacgagaat aaaccagtgaccaactctagggacactcaggaagtgcctctggaaaaggctaagcaggtg ttgaaaattatagccagctacaagcacaccacttccatttttgatgacttctcacactat gagaaacgccaagaggaagaagaaagtgttaaaaagccctttaactacaagtaa >gi568815597f:28583613_28814770|GENSCAN_predicted_peptide_6|47_aa MVSALASAPVRKRPALTCAGSLSWWMSASSNSLLKPEACASHSRLES >gi568815597f:28583613_28814770|GENSCAN_predicted_CDS_6|144_bp atggtgtccgcccttgcttctgcgcctgtgcggaagcgcccggccctcacctgcgcaggg tccctgtcatggtggatgtctgcatcatccaactcgttgcttaagccagaggcctgtgcg agtcactctcgactcgagtcctga >gi568815597f:28583613_28814770|GENSCAN_predicted_peptide_7|100_aa MEPAPSAGAELQPPLFANASDAYPSACPSAGANASGPPGARSASSLALAIAITALYSAVC AVGLLGNVLVMFGIVRGTWGPARGEATYIEGNTGMCVIVX >gi568815597f:28583613_28814770|GENSCAN_predicted_CDS_7|300_bp atggaaccggccccctccgccggcgccgagctgcagcccccgctcttcgccaacgcctcg gacgcctaccctagcgcctgccccagcgctggcgccaatgcgtcggggccgccaggcgcg cggagcgcctcgtccctcgccctggcaatcgccatcaccgcgctctactcggccgtgtgc gccgtggggctgctgggcaacgtgcttgtcatgttcggcatcgtccggggcacctggggc ccagcgagaggcgaggccacttacatcgaggggaacacaggaatgtgtgtcatcgtgtnn