GENSCAN 1.0 Date run: 7-Nov-116 Time: 04:14:32 Sequence gi568815583r:39991790_40206086 : 214297 bp : 44.59% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 386 440 55 1 1 57 97 40 0.512 0.35 1.02 Intr + 980 1059 80 0 2 103 110 26 0.782 5.57 1.03 Intr + 2927 2980 54 1 0 46 72 68 0.463 0.18 1.04 Intr + 5175 5276 102 2 0 103 42 61 0.907 3.37 1.05 Intr + 6942 6995 54 2 0 49 91 72 0.839 2.78 1.06 Intr + 9199 9435 237 0 0 37 86 428 0.985 35.31 1.07 Intr + 10924 10999 76 0 1 96 80 101 0.900 9.19 1.08 Intr + 11404 11525 122 2 2 69 76 100 0.900 7.11 1.09 Intr + 15227 15276 50 1 2 120 107 -30 0.931 -0.42 1.10 Intr + 16238 16406 169 2 1 83 21 119 0.826 4.65 1.11 Intr + 24713 24883 171 1 0 119 36 152 0.994 13.24 1.12 Intr + 25319 25453 135 1 0 94 81 159 0.999 16.56 1.13 Intr + 27304 27411 108 0 0 129 84 102 0.997 14.38 1.14 Intr + 29110 29238 129 0 0 105 95 117 0.833 14.99 1.15 Intr + 30730 30816 87 0 0 80 98 100 0.999 10.37 1.16 Intr + 34188 34300 113 2 2 60 93 117 0.993 8.58 1.17 Intr + 37617 37675 59 0 2 94 75 -4 0.952 -2.57 1.18 Intr + 38570 38667 98 0 2 84 86 98 0.984 8.93 1.19 Intr + 40380 40448 69 2 0 86 87 41 0.893 3.18 1.20 Intr + 42537 42655 119 2 2 79 106 96 0.958 9.76 1.21 Term + 43238 43295 58 2 1 129 40 4 0.878 -3.14 1.22 PlyA + 43785 43790 6 1.05 2.06 PlyA - 43969 43964 6 1.05 2.05 Term - 44711 44544 168 2 0 99 43 160 0.999 10.48 2.04 Intr - 45160 45131 30 1 0 95 86 21 0.580 0.93 2.03 Intr - 45229 45197 33 1 0 147 101 24 0.882 8.02 2.02 Intr - 46605 46493 113 1 2 85 77 124 0.980 11.10 2.01 Init - 47327 47087 241 2 1 104 58 164 0.649 11.16 2.00 Prom - 48323 48284 40 -8.36 3.00 Prom + 58041 58080 40 -3.56 3.01 Init + 69169 69264 96 0 0 67 11 116 0.043 2.11 3.02 Intr + 73641 73800 160 2 1 68 72 36 0.092 -0.44 3.03 Intr + 82103 82225 123 0 0 27 101 68 0.479 2.56 3.04 Intr + 82899 83174 276 1 0 56 68 152 0.133 7.49 3.05 Intr + 90543 90685 143 1 2 119 85 16 0.163 4.47 3.06 Term + 95910 96026 117 2 0 101 54 95 0.694 5.94 3.07 PlyA + 96087 96092 6 1.05 4.09 PlyA - 96121 96116 6 1.05 4.08 Term - 96404 96317 88 1 1 116 54 62 0.572 2.73 4.07 Intr - 100099 100002 98 1 2 103 66 62 0.797 4.31 4.06 Intr - 101117 100944 174 2 0 63 93 83 0.946 6.44 4.05 Intr - 101417 101370 48 2 0 87 77 25 0.488 0.18 4.04 Intr - 112551 112391 161 1 2 84 32 109 0.195 4.61 4.03 Intr - 114302 114006 297 0 0 88 101 171 0.955 15.25 4.02 Intr - 117330 116984 347 2 2 48 105 132 0.634 5.94 4.01 Init - 147771 146624 1148 0 2 58 80 227 0.188 12.30 4.00 Prom - 150592 150553 40 -4.66 5.00 Prom + 150746 150785 40 -4.26 5.01 Init + 164891 165017 127 1 1 102 66 87 0.976 8.28 5.02 Intr + 169340 169466 127 0 1 49 92 94 0.902 5.64 5.03 Intr + 173264 173407 144 2 0 81 78 88 0.976 6.50 5.04 Intr + 178273 178332 60 1 0 93 80 84 0.982 5.95 5.05 Intr + 178748 178892 145 2 1 106 83 73 0.993 8.88 5.06 Intr + 184688 184884 197 1 2 66 54 55 0.973 -1.89 5.07 Intr + 191925 192094 170 0 2 74 110 165 0.992 16.89 5.08 Intr + 193376 193590 215 0 2 49 86 61 0.764 0.43 5.09 Intr + 193762 193853 92 0 2 92 73 22 0.700 -0.11 5.10 Intr + 204756 204985 230 0 2 80 93 131 0.842 10.31 5.11 Intr + 207826 207938 113 2 2 63 91 87 0.786 6.60 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583r:39991790_40206086|GENSCAN_predicted_peptide_1|714_aa ADSKQDDQTGDLIKSDPSGHLTGMVGTALYVSPEVQGSTKSAYNQGALDNNIKLKQTSKY LVRKVDLFSLGIIFFEMSYHPMVTASERIFVLNQLRDPTSPKFPEDFDDGEHAKQKSVIS WLLNHDPAKRPTATELLKSELLPPPQMEESELHEVLHHTLTNVDGKAYRTMMAQIFSQRI SPAIDYTYDSDILKGNFSIRTAKMQQHVCETIIRIFKRHGAVQLCTPLLLPRNRQIYEHN EAALFMDHSGMLVMLPFDLRIPFARYVARNNILNLKRYCIERVFRPRKLDRFHPKELLEC AFDIVTSTTNSFLPTAEIIYTIYEIIQEFPALQLCRLYKFIEQKGDLQDLMPTINSLIKQ KTGIAQLVKYGLKDLEEVVGLLKKLGIKLQVLINLGLVYKVQQHNGIIFQFVAFIKRRQR AVPEILAAGGRYDLLIPQFRGPQALGPVPTAIGVSIAIDKISAAVLNMEESVTISSCDLL VVSVGQMSMSRAINLTQKLWTAGITAEIMYDWSQSQEELQEYCRHHEITYVALVSDKEGS HVKVKSFEKERQTEKRVLETELVDHVLQKLRTKVTDERNGREASDNLAVQNLKGSFSNAS GLFEIHGATVVPIVSVLAPEKLSASTRRRYETQVQTRLQTSLANLHQKSSEIEILAWDAD EQAFNTTVKQLLSRLPKQRYLKLVCDEIYNIKVEKKVSVLFLYSYRDDYYRILF >gi568815583r:39991790_40206086|GENSCAN_predicted_CDS_1|2145_bp gctgacagcaaacaagacgatcagacaggagacttgattaagtcagacccttcaggtcac ttaactgggatggttggcactgctctctatgtaagcccagaggtccaaggaagcaccaaa tctgcatacaaccagggagcattggacaacaacataaagttgaaacagacatctaagtac ctggtgaggaaagtggatctcttcagcctgggaattatcttctttgagatgtcctatcac cccatggtcacggcttcagaaaggatctttgttctcaaccaactcagagatcccacttcg cctaagtttccagaagactttgacgatggagagcatgcaaagcagaaatcagtcatctcc tggctgttgaaccacgatccagcaaaacggcccacagccacagaactgctcaagagtgag ctgctgcccccaccccagatggaggagtcagagctgcatgaagtgctgcaccacacgctg accaacgtggatgggaaggcctaccgcaccatgatggcccagatcttctcgcagcgcatc tcccctgccatcgattacacctatgacagcgacatactgaagggcaacttctcaatccgt acagccaagatgcagcagcatgtgtgtgaaaccatcatccgcatctttaaaagacatgga gctgttcagttgtgtactccactactgcttccccgaaacagacaaatatatgagcacaac gaagctgccctattcatggaccacagcgggatgctggtgatgcttccttttgacctgcgg atcccttttgcaagatatgtggcaagaaataatatattgaatttaaaacgatactgcata gaacgtgtgttcaggccgcgcaagttagatcgatttcatcccaaagaacttctggagtgt gcatttgatattgtcacttctaccaccaacagctttctgcccactgctgaaattatctac actatctatgaaatcatccaagagtttccagcacttcagctgtgtcgactctacaagttt attgaacagaagggagatttgcaagatcttatgccaacaataaattcattaataaaacag aaaacaggtattgcacagttggtgaagtatggcttaaaagacctagaggaggttgttgga ctgttgaagaaactcggcatcaagttacaggtcttgatcaatttgggcttggtttacaag gtgcagcagcacaatggaatcatcttccagtttgtggctttcatcaaacgaaggcaaagg gctgtacctgaaatcctcgcagctggaggcagatatgacctgctgattccccagtttaga gggccacaagctctggggccagttcccactgccattggggtcagcatagctatagacaag atatctgctgctgtcctcaacatggaggaatctgttacaataagctcttgtgacctcctg gttgtaagtgttggccagatgtctatgtccagggccatcaacctaacccagaaactctgg acagcaggcatcacagcagaaatcatgtacgactggtcacagtcccaagaggaattacaa gagtactgcagacatcatgaaatcacctatgtggcccttgtctcggataaagaaggaagc catgtcaaggttaagtctttcgagaaggaaaggcagacagagaagcgtgtgctggagact gaacttgtggaccatgtactgcagaaactgaggactaaagtcactgatgaaaggaatggc agagaagcttccgataatcttgcagtgcaaaatctgaaggggtcattttctaatgcttca ggtttgtttgaaatccatggagcaacagtggttcccattgtgagtgtgctagccccggag aagctgtcagccagcactaggaggcgctatgaaactcaggtacaaactcgacttcagacc tcccttgccaacttacatcagaaaagcagtgaaattgaaattctggcttgggatgctgat gaacaggcatttaacacaactgtgaagcagctgctgtcacgcctgccaaagcaaagatac ctcaaattagtctgtgatgaaatttataacatcaaagtagaaaaaaaggtgtctgtgcta tttctgtacagctatagagatgactactacagaatcttattttaa >gi568815583r:39991790_40206086|GENSCAN_predicted_peptide_2|194_aa MVLLESEQVWLGLAGRGKGAAQALLETCASCRRTRALTRQDEDLGLANGAGPVVFQFLTE LTRLFQKCRTSGSVYITLKKYDGRTKPIPKKGTVEGFEPADNKCLLRATDGKKKISTVVS SKEVNKFQMGKSQYDSEHWAYSNLLRANMDGLKKRDKKNKTKKTKAAAAAAAAAPAAAAT APTTAATTAATAAQ >gi568815583r:39991790_40206086|GENSCAN_predicted_CDS_2|585_bp atggtgttgttggagagcgagcaggtatggctaggcctggccggccgagggaagggggcg gctcaggcgttactcgagacctgtgcctcctgcaggaggaccagggccttgacgcggcag gacgaggatttggggctggctaacggggctgggcctgttgttttccagttcctgacggag ctgaccagacttttccagaagtgccggacgtcgggcagcgtctatatcaccttgaagaag tatgacggtcgaaccaaacccattccaaagaagggtactgtggagggctttgagcccgca gacaacaagtgtctgttaagagctaccgatgggaagaagaagatcagcactgtggtgagc tccaaggaagtgaataagtttcagatgggcaagagtcagtatgatagtgaacactgggct tattcaaacctccttagagctaacatggatgggctgaagaagagagacaaaaagaacaaa actaagaagaccaaagcagcagcagcagcagcagcagcagcacctgccgcagcagcaaca gcaccaacaacagcagcaacaacagcagcaacagcagcacagtaa >gi568815583r:39991790_40206086|GENSCAN_predicted_peptide_3|304_aa MEVKIRRSSQQGQPLQGGVRKREEEVEQKLKRWHQLPVATEHFNCGCRDYVKLYLIIINL IVNNYMWLVVSVLGTAGAHFLLAMDTLKAARNQPCATPDLRNKSKCVRNWWVLSFTDFKN EAADPRDSGAQLVSPSGSRNGAAGGAAYQSCAVRSHSSALGWSMGLGAVEKGAAFVGEAR AAQEPTEGVGGSGMAGCRSRALPRGKAAKAQREIEHSTGAGRLVVGRTLPEGSCHATDSA HPGHTLRHQSPLVSSFTKVLRNHGHLGLCCAAYRVVVGITLLNSKRGKEITAKAERLQVT QAGS >gi568815583r:39991790_40206086|GENSCAN_predicted_CDS_3|915_bp atggaggtgaagattcggagaagtagccagcagggacaaccacttcaaggaggggtgcgt aaaagggaggaggaagtggagcagaagctgaagaggtggcaccagctgcctgtggctact gagcacttcaactgtggctgcagagactatgttaaattgtatttaatcataattaattta attgtaaacaactacatgtggctagtggtttctgtactgggcactgcaggtgcacatttc ctgctggccatggacacactaaaggcagccagaaatcaaccgtgcgccacccccgacctg agaaacaagtctaaatgtgtccggaattggtgggttcttagtttcactgacttcaagaat gaagccgcggaccctcgcgactcaggagcccagctggtttcacctagtggatcccgcaat ggggctgcaggtggagctgcctaccagtcctgcgccgtgcgctcgcattcctcagccctt gggtggtcgatgggactgggcgccgtggagaagggggcggcattcgtcggggaggctcgg gccgcacaggagcctacggagggggtgggaggctcaggcatggcgggctgcaggtcccga gccctgccccgcgggaaggcagctaaggcccagcgagaaatcgagcacagcaccggagca gggaggctggtagttggcaggaccctgccagagggaagctgccatgccactgattcagcc caccctggccacaccctcaggcaccagagcccactggtgtcatccttcaccaaggtgctt cggaaccatggacatttgggcctctgctgtgctgcctatcgcgtggtggtcggaatcact cttctcaactccaagaggggaaaggaaatcacggcgaaagctgagcgccttcaggtcact caagctggcagctga >gi568815583r:39991790_40206086|GENSCAN_predicted_peptide_4|786_aa MIVYLENPIVSAPNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIAS KRIKYLGIQLTRDVRDLFKENYKPLLNEIKEDTNKWKNIPCSWVGRINIVKMAILPKVIY RFNAIPIKLPMTFFTELEKATLKFIWNQKRACIAKSILSQKNKAGGITLPDFKLYYKATV TKTAWYWYQNRHIDQWNRTEPSEIMPHIYNYLIFDKPDKNKQWGKDSLFNKWCWENCLAI CRKLKLDPFLTPYTKINSRWIKDLHVRPETIKTLEENLGNTIQDIGMGKDFMSKTPKAMA TKAKIDKSDLIKLKSFCTAKETTIRVNRQPTGWEKIFATYSSDKGLISRLYNELKQIYKK KQTTPPKSGRREVPFIHPSRDVEACVRDRIRTGDGAPTALLATVGILLGPLQCFHGKVRT FVTVPGSGPARDLALHSPLVSPRRDAQGGGASSAVCGMPRAGVFWKQYRTVRSGLLPPRP VPAAAAAPACASRLLPQPGEMEPSQCVEELEDDVFQPEDGEPVTQPGSLLSADLFAQSLL DCPLSRLQLFPLTHCCGPGLRPTSQEDKATQTLSPASPSQGVMLPCGVTEEPQRLFYGNA GYRLPLPASFPAVLPIGEQPPEGQWQHQAEVQIARKLQCIADQFHRLHVQQGSVKGKPAT CMVALGKVFIPLSSENPVERPPPSPPALADKERTKTASSSTPQFPPPQASPGKWYLCGQG TSPNLHQQNQNRVWWQILLFLHNLALNGEENRNGAGPRWPVPLPALGALLAFYPIPGSTG AIEETQ >gi568815583r:39991790_40206086|GENSCAN_predicted_CDS_4|2361_bp atgattgtatatctagaaaaccccatcgtctcagccccaaatctccttaagctgataagc aacttcagcaaagtctcaggatacaaaatcaatgtgcaaaaatcacaagcattcttatac accaataacagacaaacagagagccaaatcatgagtgaactcccattcacaattgcttca aagagaataaaatacttaggaatccaacttacaagggatgtgagggacctcttcaaggag aactacaaaccactgctcaatgaaataaaagaggatacaaacaaatggaagaacattcca tgctcatgggtaggaagaatcaatatcgtgaaaatggccatactgcccaaggtaatttat agattcaatgccatccccatcaagctaccaatgactttcttcacagaattggaaaaagct actttaaagttcatatggaaccaaaaaagagcctgcattgccaagtcaattctaagccaa aagaacaaagctggaggcatcacgctacctgacttcaaactatactacaaggctacagta accaaaacagcatggtactggtaccaaaacagacatatagaccaatggaacagaacagag ccctcagaaataatgccgcatatctacaactatctgatctttgacaaacctgacaaaaac aagcaatgggggaaggattccctatttaataaatggtgctgggaaaactgtctagccata tgtagaaagctgaaactggatcccttccttacaccttatacaaaaattaattcaagatgg attaaagacttacatgttagacctgaaaccataaaaaccctagaagaaaacctaggcaat accattcaggacataggcatgggcaaggacttcatgtctaaaacaccaaaagcaatggca acaaaagccaaaattgacaaaagcgatctaattaaactaaagagcttctgcacagcaaaa gaaactaccatcagagtgaacaggcaacctacaggatgggagaaaatttttgcaacctac tcatctgacaaagggctaatatccagactctacaatgaactcaaacaaatttacaagaaa aaacaaacaaccccaccaaaaagtgggcgaagggaagtaccctttattcatccgagtaga gatgttgaagcctgtgttagggacagaatccgcactggcgacggcgctccgactgcgctt ctggcgacggtcggaattttgctcggccccttgcaatgtttccatgggaaggttcgtaca ttcgtgaccgtccctggcagcggcccagcccgggacttggcgcttcactcgccattggtc agtcctcggcgtgacgcgcaggggggcggggcctcatcagctgtttgcgggatgccccga gcaggcgtattttggaaacaataccgcaccgtgcggagtggcctcctcccgccccggcct gtgcccgccgccgccgccgcccctgcctgcgcctcccgcctcctgccgcagcccggagag atggagccatctcagtgtgtggaggagctggaggatgatgtgttccaaccagaggatggg gagccggtgacccaacccgggagcttgctctctgctgacctgtttgcccagagcctactg gactgccccctcagccgacttcagctcttccctctcacccactgctgtggccctggcctt cgacccaccagccaggaagacaaagctacccagactctcagcccagcctcccccagccaa ggtgtcatgctgccttgtggggtgactgaggaaccccagcgactcttttatggcaatgct ggctatcggcttcctctccctgccagtttcccagcagtcttgcccattggggagcagccc cccgaagggcagtggcaacatcaagcagaggtacagattgcccgaaagcttcagtgcatt gcagaccagttccaccggcttcatgtgcagcaaggcagcgtgaaggggaagccagcaacc tgcatggtggccctgggaaaggtttttattcccctgtccagtgagaaccctgtggagagg cccccgcccagtcctcctgccctggcagacaaagagagaacaaagaccgcctcttccagc accccccagtttccacccccccaggcctctcctgggaaatggtacctgtgtgggcagggc accagccccaatctgcaccagcagaaccaaaatcgtgtgtggtggcagatcctcctcttc ctgcacaaccttgctttgaatggagaagagaacaggaacggggcaggccctaggtggcca gtgcccctccctgctctgggagcattgctagccttctaccccatccctggatccacaggg gctatcgaggagacccagtga >gi568815583r:39991790_40206086|GENSCAN_predicted_peptide_5|540_aa MAPASAWLLVTEATRSFYFWWKMKKEQERERGVPHTFNDQISAPRAGCGRKPRRSVAQRK GLQQDEDLSQECRMAAVKKEGGALSEAMSLEGDEWELSKENVQPLRQGRIMSTLQGALAQ ESACNNTLQQQKRAFEYEIRFYTGNDPLDVWDRYISWTEQNYPQGGKESNMSTLLERAVE ALQGEKRYYSDPRFLNLWLKLGRLCNEPLDMYSYLHNQGIGVSLAQFYISWAEEYEAREN FRKADAIFQEGIQQKAEPLERLQSQHRQFQARVSRQTLLALEKEEEEEVFESSVPQRSTL AELKSKGKKTARAPIIRVGGALKAPSQNRGLQNPFPQQMQNNSRITVFDENADEASTAEL SKPTVQPWIAPPMPRAKENELQAGPWNTGRSLEHRPRGNTASLIAVPAVLPSFTPYVEET ARQPVMTPCKIEPSINHILSTRKPGKEEGDPLQRVQSHQQASEEKKEKMMYCKEKIYAGV GEFSFEEIRAEVFRKKLKEQREAELLTSAEKRAEMQKQIEEMEKKLKEIQTTQQERTGDQ >gi568815583r:39991790_40206086|GENSCAN_predicted_CDS_5|1620_bp atggcgccagcatctgcttggcttctggtgacggaggccacaaggagcttttacttctgg tggaagatgaagaaggagcaagagagagaaaggggcgtaccacatacttttaacgaccag atctcagctccgagggcaggttgcggaagaaagcccaggcggtctgtggcccagaggaaa ggcctgcagcaggacgaggacctgagccaggaatgcaggatggcggcggtgaagaaggaa gggggtgctctgagtgaagccatgtccctggagggagatgaatgggaactgagtaaagaa aatgtacaacctttaaggcaagggcggatcatgtccacgcttcagggagcactggcacaa gaatctgcctgtaacaatactcttcagcagcagaaacgggcatttgaatatgaaattcga ttttacactggaaatgaccctctggatgtttgggataggtatatcagctggacagagcag aactatcctcaaggtgggaaggagagtaatatgtcaacgttattagaaagagctgtagaa gcactacaaggagaaaaacgatattatagtgatcctcgatttctcaatctctggcttaaa ttagggcgtttatgcaatgagcctttggatatgtacagttacttgcacaaccaagggatt ggtgtttcacttgctcagttctatatctcatgggcagaagaatatgaagctagagaaaac tttaggaaagcagatgcgatatttcaggaagggattcaacagaaggctgaaccactagaa agactacagtcccagcaccgacaattccaagctcgagtgtctcggcaaactctgttggca cttgagaaagaagaagaggaggaagtttttgagtcttctgtaccacaacgaagcacacta gctgaactaaagagcaaagggaaaaagacagcaagagctccaatcatccgtgtaggaggt gctctcaaggctccaagccagaacagaggactccaaaatccatttcctcaacagatgcaa aataatagtagaattactgtttttgatgaaaatgctgatgaggcttctacagcagagttg tctaagcctacagtccagccatggatagcaccccccatgcccagggccaaagagaatgag ctgcaagcaggcccttggaacacaggcaggtccttggaacacaggcctcgtggcaataca gcttcactgatagctgtacccgctgtgcttcccagtttcactccatatgtggaagagact gcacgacagccagttatgacaccatgtaaaattgaacctagtataaaccacatcctaagc accagaaagcctggaaaggaagaaggagatcctctacaaagggttcagagccatcagcaa gcgtctgaggagaagaaagagaagatgatgtattgtaaggagaagatttatgcaggagta ggggaattctcctttgaagaaattcgggctgaagttttccggaagaaattaaaagagcaa agggaagccgagctattgaccagtgcagagaagagagcagaaatgcagaaacagattgaa gagatggagaagaagctaaaagaaatccaaactactcagcaagaaagaacaggtgatcag