GENSCAN 1.0 Date run: 4-Nov-116 Time: 22:36:50 Sequence gi568815596f:60791682_61022532 : 230851 bp : 41.04% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 121 201 81 0 0 43 113 59 0.509 4.92 1.02 Intr + 448 608 161 0 2 61 66 129 0.998 5.86 1.03 Intr + 1946 2034 89 2 2 38 58 113 0.983 2.00 1.04 Intr + 2290 2510 221 2 2 120 98 90 0.958 10.30 1.05 Intr + 3283 3339 57 0 0 104 87 62 0.983 5.86 1.06 Term + 5381 5479 99 1 0 83 42 154 0.998 7.45 1.07 PlyA + 7258 7263 6 1.05 2.11 PlyA - 7328 7323 6 1.05 2.10 Term - 13243 13035 209 2 2 32 50 216 0.023 8.72 2.09 Intr - 16864 16819 46 1 1 63 75 86 0.015 1.86 2.08 Intr - 19972 19890 83 2 2 83 44 56 0.053 -0.86 2.07 Intr - 23028 22951 78 1 0 58 56 86 0.048 1.00 2.06 Intr - 25518 25477 42 1 0 96 83 29 0.031 0.49 2.05 Intr - 26500 26328 173 0 2 73 -5 102 0.045 -1.94 2.04 Intr - 26732 26587 146 2 2 101 27 128 0.115 6.16 2.03 Intr - 26852 26809 44 0 2 146 27 3 0.060 -2.96 2.02 Intr - 28439 28405 35 1 2 87 99 30 0.070 0.95 2.01 Init - 28866 28547 320 0 2 64 -4 185 0.044 3.75 2.00 Prom - 31304 31265 40 -2.35 3.00 Prom + 38042 38081 40 -4.85 3.01 Init + 38333 38419 87 1 0 74 53 29 0.308 -1.30 3.02 Intr + 39973 40023 51 0 0 111 80 5 0.433 0.19 3.03 Term + 43643 44194 552 1 0 48 51 226 0.944 8.22 3.04 PlyA + 45798 45803 6 1.05 4.05 PlyA - 45839 45834 6 1.05 4.04 Term - 49381 49177 205 0 1 106 50 150 0.968 8.86 4.03 Intr - 49638 49497 142 1 1 85 47 83 0.834 2.39 4.02 Intr - 51405 51284 122 2 2 78 59 72 0.718 2.52 4.01 Init - 70042 69963 80 1 2 81 84 47 0.147 4.19 4.00 Prom - 74004 73965 40 -6.65 5.00 Prom + 85156 85195 40 -5.15 5.01 Init + 90160 90169 10 0 1 109 117 11 0.440 5.88 5.02 Intr + 100002 100144 143 1 2 96 90 152 0.996 15.35 5.03 Intr + 102716 102864 149 1 2 79 81 167 0.968 13.21 5.04 Intr + 109311 109402 92 0 2 65 115 83 0.934 7.42 5.05 Intr + 125196 125336 141 1 0 79 103 74 0.961 7.40 5.06 Intr + 126713 126925 213 0 0 69 86 205 0.999 16.16 5.07 Intr + 128360 128428 69 0 0 72 107 75 0.985 6.04 5.08 Term + 130082 130854 773 0 2 97 32 391 0.857 26.96 5.09 PlyA + 133834 133839 6 1.05 6.03 PlyA - 134860 134855 6 1.05 6.02 Term - 145697 145458 240 2 0 -8 49 333 0.888 14.84 6.01 Init - 146922 146653 270 0 0 90 74 271 0.974 22.91 6.00 Prom - 149129 149090 40 -6.75 7.15 PlyA - 150305 150300 6 1.05 7.14 Term - 150752 150714 39 2 0 121 33 65 0.353 0.51 7.13 Intr - 153427 153328 100 0 1 32 82 88 0.302 1.79 7.12 Intr - 156504 156302 203 0 2 112 -15 223 0.168 11.56 7.11 Intr - 161433 161316 118 2 1 32 63 63 0.339 -2.25 7.10 Intr - 162307 162252 56 1 2 100 116 44 0.511 5.26 7.09 Intr - 162477 162401 77 1 2 31 97 87 0.409 2.22 7.08 Intr - 163393 163337 57 2 0 85 95 37 0.762 2.04 7.07 Intr - 168836 168711 126 0 0 82 89 60 0.934 5.23 7.06 Intr - 169867 169782 86 0 2 70 58 83 0.967 2.04 7.05 Intr - 171195 171145 51 2 0 57 97 59 0.054 0.80 7.04 Intr - 173803 173742 62 1 2 90 95 24 0.065 -0.09 7.03 Intr - 217334 217080 255 2 0 82 94 151 0.526 11.82 7.02 Intr - 225520 225477 44 2 2 60 100 36 0.316 -0.96 7.01 Init - 226659 226458 202 0 1 34 78 156 0.277 8.29 7.00 Prom - 228233 228194 40 -7.45 8.00 Prom + 229366 229405 40 -8.25 8.01 Sngl + 230383 230721 339 0 0 88 37 383 0.997 28.98 8.02 PlyA + 230772 230777 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 136554 137336 783 2 0 86 47 165 0.837 7.97 S.002 Term + 204447 204629 183 2 0 109 41 90 0.807 2.96 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:60791682_61022532|GENSCAN_predicted_peptide_1|235_aa MKIEATHVKKKQLHHYLPAEILQKKKKQSLSDVNRSSGGLQSKRLSLDSSCLDSSRDTDN GTPFNSPASKSDSPSVGETERNSAEPAAVIVEKPLSVPPAQGLSIPVIGAKVDSTVKTVS PPTVCTIPTVVGRNVIPRITTPHNPAQGQPHLNGMSNITKTVTPKRSHSPSIDGTPKRLK DVEKDAIGGESMPIPTIDTSRKKRLPSKELPDSSSPVPANNIRVIKNSIRLTLNR >gi568815596f:60791682_61022532|GENSCAN_predicted_CDS_1|708_bp atgaaaattgaagcaactcatgtaaagaaaaaacaacttcaccactaccttcctgcagaa attcttcaaaagaagaaaaagcaaagtctctctgatgtcaatcgaagctcgggcggactt caatccaaaagattgtctctggatagcagttgtctggatagctccagagacactgataat ggaacaccttttaattctccagcgtccaagtctgatagcccttctgtaggagaaacagaa aggaatagtgctgagcctgctgctgtaattgtggagaagccactgagtgtaccaccagcc caaggactttccattccagtgattggcgcaaaagttgactctacagtaaaaactgtatca ccccccactgtgtgtaccattcctaccgtagtaggacgaaatgtcattcctagaatcaca acacctcacaaccctgcccagggacaaccgcatctgaatggaatgtcaaatataactaag actgttacacctaagagatcccattccccatccatagatgggactcctaagaggttgaaa gacgtagaaaaggatgccattggaggagaatctatgcctattccaactattgatacatca cgcaaaaagagactacccagtaaagaactaccagattcatcatctccagttccagcaaac aacatccgtgtcatcaaaaattccattcgactgacccttaatcggtaa >gi568815596f:60791682_61022532|GENSCAN_predicted_peptide_2|391_aa MEGAAEKDRRGETSERREELGCWWWERSSALDGPAPGEDCLPTPSPASVSLSIPAESHLH YSVKFSHLSFKFVCDWFFWDAGQELRIQKAVTLALCPCGKAEGPRSWALSNGGGGDPTAA RLPKGMITDFGVGGEVTVQTGSPSSGSEMELGQEMELFSHKGCFRAVAVTVDPAHHILFL PMGPAYLFCVHLPIDPSTSLGSHRMNYSEHHGLHYPGNSFHDGSLCTESLLGNAVKINTC AVQGTEGTAGNKRGMHLPMEKTENPIAFLRMYGQQQGCAGRWCQCQNKTELEDIQLVSTA ELTARLLGKAQSGPEFLQQVWGAKPLMAAVAHLERLQQGRCGQGCVCHGASRSLEQAGPL PTSELEGQEPDVPGCSCSHPVTTADPGIPSF >gi568815596f:60791682_61022532|GENSCAN_predicted_CDS_2|1176_bp atggagggcgcggcagagaaagacagaagaggagaaacatctgaacgccgagaggagttg ggctgttggtggtgggagaggagttcagctctggacggcccggctccaggggaagactgt cttcccactccatcccctgcttctgtctctctatccatccccgctgagagccacctccac tactcagtgaaattctcacatttatccttcaagttcgtgtgtgactggtttttctgggat gctggacaagagcttaggatacagaaagctgtcacattggccctctgcccttgcggaaag gcagagggtccacggagctgggctttgagcaacggtggcggtggcgacccaacagcagcc aggctcccaaaggggatgatcactgactttggggtggggggtgaggtcacagtacagaca gggagcccaagctcagggtctgaaatggagcttggccaagaaatggagctgttttcacac aagggatgcttcagagccgtggcggtaactgtggaccctgcacatcacattctcttcctt cccatgggcccagcttatttgttctgtgtccacctacccatagatccttcaacctccttg ggaagccatcgaatgaattattctgagcatcacgggttgcattatccaggaaacagtttc cacgacggcagtttgtgcacagaaagtttattggggaatgctgtcaagattaacacctgt gctgttcaaggcactgagggtacagcagggaataaaagagggatgcaccttccaatggag aaaacagaaaaccccatcgccttcctgagaatgtacggccaacagcaaggctgtgcaggt agatggtgtcagtgtcagaacaaaactgaattagaagacatccagctggtgtccactgca gaactgactgctcgcttgctgggcaaagcccagagcggcccggagttcctgcagcaggtc tggggagccaagccactgatggcagcagtggcccacctggagcggctgcagcagggaagg tgcggccagggctgcgtgtgccatggagccagcaggagcctggaacaggcgggacccctg cccacttctgaattggaagggcaggagcctgatgttcctgggtgcagctgcagccaccca gtcacgactgcggacccaggcatccctagcttttag >gi568815596f:60791682_61022532|GENSCAN_predicted_peptide_3|229_aa MASAGRARGQCDERGRRVQGGQVPDKAGRRQGLTMLPMLEGSDYSQLGHKARLHLKKKKK KKERKKERKKERKKERKKERKKERKKEKINPGINIKTDFSGERREVSIGSSYWSQAVLGP SACIVTWRTKCLHCDLEEVASLNPGFIICKTRLLILASQMKCSYGASLVLMAAAYMVAAV ITTVTHLREAGGVGIHRRVTGKGWFRGFGEMSRHGTITEIVRRGLEGRP >gi568815596f:60791682_61022532|GENSCAN_predicted_CDS_3|690_bp atggcaagtgcagggagagcaagagggcagtgtgacgagagagggagaagggtgcagggg ggccaggttccagataaggctggaaggagacagggtcttactatgttgcccatgctggag ggcagtgactattcacagctgggccacaaagcaagactccatctcaaaaagaagaagaaa aagaaagaaagaaagaaagaaagaaagaaagaaagaaagaaagaaagaaagaaagaaaga aagaaagaaagaaagaaagaaaaaataaatccagggatcaatataaaaacagatttttct ggggagaggagagaagtaagcatagggagttcctactggagccaggcagtcctgggacca agtgcctgcattgtgacttggaggaccaagtgcctgcattgtgacttggaggaagttgcc tctttaaacccgggattcatcatctgcaagacaaggttattgatactcgcctcgcagatg aagtgtagttatggcgccagcctagtgcttatggctgctgcatacatggtagcagctgtg attactactgtaactcatctcagggaagcaggaggtgtaggcatccacagaagggtaaca gggaaaggatggtttaggggctttggggaaatgagccgccatgggactattacagagata gtcagaaggggactagaaggaaggccatga >gi568815596f:60791682_61022532|GENSCAN_predicted_peptide_4|182_aa MLLNRNVAEERSQSQNWPGVSVGCGGRGKIEDCVWVLLKVPLGGGGTGLSSGGKEGNRNG PHLHTSETNPIHLAAIKPVPVFLNPGWGALKNACAQVSLQANLITPSVDGSRASRLCEAE LGPLASASLDLKVFPEGQFGLACRYLCRPQQCTENTPSQHAHKSGSSAKHQTPTLLPMPD AK >gi568815596f:60791682_61022532|GENSCAN_predicted_CDS_4|549_bp atgctgctcaacaggaatgtggcagaggaaagaagccagagccaaaactggccaggtgtg agtgtgggctgtggggggagggggaagatcgaggactgtgtctgggtgctgctgaaagtg cctctaggtggtggggggacagggctgtcttcgggagggaaggaaggcaacaggaatggc ccacacctccacacctctgaaaccaacccgattcatcttgctgcaatcaaacctgtacca gtgtttctcaaccctggctggggagctcttaaaaatgcctgtgcccaggtgtctctccag gccaatctaatcacaccctctgtggatgggtcccgggcatccagactctgtgaagcagaa ctcgggcctctagcgtctgctagtctagatctaaaggtgtttcctgagggacagtttggc ctggcatgcaggtacctctgcagaccacaacagtgcaccgaaaacaccccctcccagcac gcacacaagtctggctcctcagccaaacatcaaacaccaacactgctgcccatgccagat gccaagtga >gi568815596f:60791682_61022532|GENSCAN_predicted_peptide_5|529_aa MASGAYNPYIEIIEQPRQRGMRFRYKCEGRSAGSIPGEHSTDNNRTYPSIQIMNYYGKGK VRITLVTKNDPYKPHPHDLVGKDCRDGYYEAEFGQERRPLFFQNLGIRCVKKKEVKEAII TRIKAGINPFNVPEKQLNDIEDCDLNVVRLCFQVFLPDEHGNLTTALPPVVSNPIYDNHD IEVRFVLNDWEAKGIFSQADVHRQVAIVFKTPPYCKAITEPVTVKMQLRRPSDQEVSESM DFRYLPDEKDTYGNKAKKQKTTLLFQKLCQDHEPNLFSHDAVVREMPTGVSSQAESYYPS PGPISSGLSHHASMAPLPSSSWSSVAHPTPRSGNTNPLSSFSTRTLPSNSQGIPPFLRIP VGNDLNASNACIYNNADDIVGMEASSMPSADLYGISDPNMLSNCSVNMMTTSSDSMGETD NPRLLSMNLENPSCNSVLDPRDLRQLHQMSSSSMSAGANSNTTVFVSQSDAFEGSDFSCA DNSMINESGPSNSTNPNSHGFVQDSQYSGIGSMQNEQLSDSFPYEFFQV >gi568815596f:60791682_61022532|GENSCAN_predicted_CDS_5|1590_bp atggcctccggtgcgtataacccgtatatagagataattgaacaacccaggcagagggga atgcgttttagatacaaatgtgaagggcgatcagcaggcagcattccaggggagcacagc acagacaacaaccgaacatacccttctatccagattatgaactattatggaaaaggaaaa gtgagaattacattagtaacaaagaatgacccatataaacctcatcctcatgatttagtt ggaaaagactgcagagacggctactatgaagcagaatttggacaagaacgcagacctttg tttttccaaaatttgggtattcgatgtgtgaagaaaaaagaagtaaaagaagctattatt acaagaataaaggcaggaatcaatccattcaatgtccctgaaaaacagctgaatgatatt gaagattgtgacctcaatgtggtgagactgtgttttcaagtttttctccctgatgaacat ggtaatttgacgactgctcttcctcctgttgtctcgaacccaatttatgacaaccatgac atagaagttcgttttgtgttgaacgattgggaagcaaaaggcatcttttcacaagctgat gtacaccgtcaagtagccattgttttcaaaactccaccatattgcaaagctatcacagaa cccgtaacagtaaaaatgcagttgcggagaccttctgaccaggaagttagtgaatctatg gattttagatatctgccagatgaaaaagatacttacggcaataaagcaaagaaacaaaag acaactctgcttttccagaaactgtgccaggatcacgaaccaaacttgttttctcatgat gcagttgtgagagaaatgcctacaggggtttcaagtcaagcagaatcctactatccctca cctgggcccatctcaagtggattgtcacatcatgcctcaatggcacctctgccttcttca agctggtcatcagtggcccaccccaccccacgctcaggcaatacaaacccactgagtagt ttttcaacaaggacacttccttctaattcgcaaggtatcccaccattcctgagaatacct gttgggaatgatttaaatgcttctaatgcttgcatttacaacaatgccgatgacatagtc ggaatggaagcgtcatccatgccatcagcagatttatatggtatttctgatcccaacatg ctgtctaattgttctgtgaatatgatgacaaccagcagtgacagcatgggagagactgat aatccaagacttctgagcatgaatcttgaaaacccctcatgtaattcagtgttagaccca agagacttgagacagctccatcagatgtcctcttccagtatgtcagcaggcgccaattcc aatactactgtttttgtttcacaatcagatgcatttgagggatctgacttcagttgtgca gataacagcatgataaatgagtcgggaccatcaaacagtactaatccaaacagtcatggt tttgttcaagatagtcagtattcaggtattggcagtatgcaaaatgagcaattgagtgac tcctttccatatgaattttttcaagtataa >gi568815596f:60791682_61022532|GENSCAN_predicted_peptide_6|169_aa MAEEGSAAGGVMDINTVLQEVLKTALIHDGLAYEICKAAKASDKCQAHLCVLCVLASNCD EPMYVKLVEALCAEHQINLIKVDDQKLGESRVLIEMEKQQQDEVDCNIKEAHEKLEMEME VARLHQCQVMLMRQDLMRCQEELWRMEKLNNQEMQKRRQLEPMREECRH >gi568815596f:60791682_61022532|GENSCAN_predicted_CDS_6|510_bp atggccgaggaaggcagtgctgctggaggtgtaatggacattaatactgttttacaggag gtgctgaagaccgccctcatccatgatggcctagcatatgaaatttgcaaagctgccaaa gcctcagacaagtgccaagcccatctttgtgtgctgtgtgtgcttgcatccaactgtgat gagcctatgtatgtcaagttggtggaggccctttgtgctgaacaccaaatcaacctaatt aaggttgatgaccagaaactaggggaatcgagggtactcattgagatggagaagcagcag caggacgaagtggactgcaatatcaaggaggctcatgagaagctggagatggagatggag gttgctcgccttcatcaatgccaggtcatgctaatgaggcaggatttgatgaggtgtcaa gaagagctgtggaggatggaaaagctgaacaaccaagagatgcaaaaacgacggcaactg gagcccatgcgagaggagtgcaggcactag >gi568815596f:60791682_61022532|GENSCAN_predicted_peptide_7|491_aa MQSVRGLSSCCVKPFRLSATSREVNTFKERLCSLSMPWLDKCRSQLLQTKVYALKIEKPF CQSENRSGPEGVKSQAPASGEKELLNELQKFLETEKDELILEVMNPPPKKIRLQELEDSI DNLSQNGEGRISVSHVGSTASKNSNLNVCNVCLGILQEFCEKDFIKKSLFEVSVVFAHPE TVEDCHFLMAVMKALNKIKEEDFLKQFPCPPNSPKAVCAVLEIECAHGAVFVAGRYNKYS RNLPQTPWIIDGERKLESSVEELISDHLLAVFKAESFNFSSSGREDVDVRTLGNGRPFAI ELVNPHRVHFTSQEIKELQQKINNSSNKIQVRDLQLVTREAIGHMKEGEEEKTKTYSALI WTNKAIQKKDIEFLNDIKDLKIDQKTPLRVLHRRPLAVRARVIHFMETQYVDEHHFRLHL KTQAGTYPSTCCTAAIEPELYWKSSCYIKEFVHGDFGRTKPNIGSLMNVTADILELDVES VDVDWPPALDD >gi568815596f:60791682_61022532|GENSCAN_predicted_CDS_7|1476_bp atgcaaagtgtacggggcttgagttcctgttgcgtaaaacctttccggctttcagcaact tctcgagaggttaacacctttaaggagagactttgctccctgtcgatgccgtggctggac aaatgccgctctcaactgctacagaccaaagtctacgctttaaagatagagaagccattc tgtcaaagtgaaaacagaagcgggccggaaggggttaaatctcaggctccggcgtctggg gagaaggagttgctcaatgaactacagaaatttctggaaactgaaaaagatgaattaatt ttggaagttatgaacccacctcccaagaaaattcgactgcaagaactggaagatagtatt gataatctaagtcaaaatggagagggaaggatctctgttagtcatgttggaagcactgct tccaagaactcaaatttaaatgtatgtaatgtatgcctaggaattcttcaagaattctgt gagaaagatttcattaaaaagagcttgtttgaagtgagtgtggtctttgctcacccagaa acagttgaggattgccacttcctaatggcagttatgaaagccttgaataagataaaggaa gaggatttccttaagcagtttccttgtcctccaaactcaccaaaggctgtatgcgctgtt cttgaaattgaatgtgctcatggtgctgtttttgtggctgggagatataataaatactcc aggaatctaccacaaactccttggataattgatggagaaaggaagctggaatcttcagtg gaagaattaatttcagatcatctgttggcagtatttaaagcagagagttttaatttttca tcctctggaagagaagatgtagatgtgagaacattaggaaatggaaggccctttgcaatt gagctggtgaatcctcatagagtacatttcacttcacaagaaattaaggaacttcagcag aaaattaataactcatctaacaaaatccaagtacgtgacttgcagcttgtcacaagagag gcaataggacatatgaaagaaggtgaagaagaaaagacaaagacctacagtgccttaatt tggacaaataaagcgatacagaagaaagacattgaattcctaaatgacataaaggactta aaaatcgaccagaaaacacctttgcgcgtccttcaccgaaggcccctggctgtgcgagct cgcgtcattcacttcatggagacacagtacgtggatgagcaccacttccgcctccacttg aaaactcaggctggcacgtatccttccacctgctgcactgccgccatcgaaccagaactc tattggaaaagttcatgctacattaaagagtttgtacatggagacttcgggagaaccaag ccaaacattgggtccctgatgaatgtgactgcagacattctggagctggatgttgagtct gtagatgttgactggccacctgctctggatgactag >gi568815596f:60791682_61022532|GENSCAN_predicted_peptide_8|112_aa MGRNQSRKAENSKNQSTSSPPKDRSSLPATEQSWTENDFDELTEVGFRRSAITNFSELKE HVLTRRKEAKNLEKRLDEWLTRINSVEKTLNDLMELKTMAQELRDARTSFNS >gi568815596f:60791682_61022532|GENSCAN_predicted_CDS_8|339_bp atggggagaaaccagagcagaaaagctgaaaattctaaaaatcagagcacctcatctcct ccaaaggatcgcagctccttgccagcaacggaacaaagctggacagagaatgactttgac gagttgacagaagtaggcttcagaaggtcggcaataacaaacttctccgagctaaaggag catgttctaacccgtcgcaaagaagctaaaaaccttgaaaaaaggttagatgaatggctt actagaataaacagtgtagagaagaccttaaatgacctgatggagctgaaaaccatggca caagaacttcgtgatgcacgcacaagcttcaatagctga