GENSCAN 1.0 Date run: 7-Nov-116 Time: 01:31:18 Sequence gi568815595f:186684429_186889266 : 204838 bp : 43.78% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2901 2982 82 2 1 107 48 49 0.521 2.01 1.02 Intr + 5498 5643 146 0 2 91 0 162 0.590 7.70 1.03 Intr + 7337 7427 91 1 1 86 75 0 0.599 -1.93 1.04 Intr + 7645 7893 249 2 0 3 77 335 0.370 21.31 1.05 Intr + 9067 9087 21 2 0 118 111 -14 0.404 1.42 1.06 Intr + 11817 11927 111 1 0 82 100 -3 0.116 0.65 1.07 Intr + 17663 17808 146 0 2 91 -19 178 0.155 7.40 1.08 Intr + 25873 25937 65 0 2 92 64 52 0.239 0.72 1.09 Intr + 33150 33309 160 0 1 102 113 145 0.947 18.39 1.10 Intr + 35677 35787 111 0 0 87 92 114 0.997 12.28 1.11 Intr + 38009 38143 135 1 0 130 39 97 0.782 9.76 1.12 Intr + 38286 38490 205 2 1 81 94 35 0.632 2.17 1.13 Intr + 40660 40832 173 2 2 45 72 158 0.924 9.56 1.14 Intr + 47117 47201 85 1 1 55 110 87 0.993 6.99 1.15 Intr + 48074 48246 173 0 2 78 89 149 0.997 13.66 1.16 Intr + 54671 54778 108 1 0 72 69 87 0.977 5.68 1.17 Term + 57094 57903 810 0 0 66 28 406 0.985 25.59 1.18 PlyA + 58259 58264 6 1.05 2.00 Prom + 59050 59089 40 -4.96 2.01 Init + 63983 64030 48 1 0 94 119 -8 0.793 3.85 2.02 Intr + 68913 68945 33 2 0 127 72 3 0.644 1.02 2.03 Intr + 69623 69695 73 1 1 105 115 18 0.975 5.18 2.04 Intr + 77012 77185 174 0 0 40 54 149 0.471 6.61 2.05 Intr + 79766 79790 25 0 1 116 52 -7 0.417 -4.42 2.06 Intr + 84812 84875 64 2 1 83 96 29 0.569 1.92 2.07 Term + 87996 88421 426 2 0 93 44 253 0.945 16.70 2.08 PlyA + 94612 94617 6 1.05 3.00 Prom + 99116 99155 40 0.74 3.01 Init + 100026 100049 24 2 0 63 94 37 0.882 1.34 3.02 Intr + 100136 100268 133 1 1 74 93 125 0.998 11.82 3.03 Intr + 100534 100673 140 2 2 86 72 88 0.824 7.18 3.04 Intr + 102074 102217 144 1 0 62 78 103 0.965 7.18 3.05 Intr + 102699 102836 138 2 0 72 99 135 0.999 13.66 3.06 Intr + 103067 103156 90 1 0 48 111 59 0.952 4.39 3.07 Intr + 103375 103454 80 0 2 75 94 6 0.966 -1.75 3.08 Term + 104697 104841 145 0 1 89 31 122 0.975 3.98 3.09 PlyA + 105316 105321 6 1.05 4.10 PlyA - 105490 105485 6 1.05 4.09 Term - 105636 105541 96 0 0 75 39 74 0.959 -0.83 4.08 Intr - 105827 105714 114 2 0 55 97 96 0.955 7.84 4.07 Intr - 105978 105898 81 0 0 54 98 76 0.966 5.03 4.06 Intr - 107422 107297 126 1 0 99 78 62 0.990 7.18 4.05 Intr - 108182 108062 121 1 1 100 76 89 0.983 9.40 4.04 Intr - 108519 108376 144 2 0 53 88 85 0.800 4.40 4.03 Intr - 110353 110230 124 2 1 12 89 104 0.445 2.54 4.02 Intr - 122556 122320 237 1 0 41 115 139 0.887 9.49 4.01 Init - 126381 126312 70 0 1 81 105 61 0.912 8.31 4.00 Prom - 131086 131047 40 -5.06 5.00 Prom + 131398 131437 40 -4.16 5.01 Init + 133204 133262 59 0 2 51 77 69 0.394 2.88 5.02 Intr + 140553 140782 230 0 2 72 38 107 0.291 1.61 5.03 Intr + 142124 142194 71 0 2 71 105 120 0.151 10.90 5.04 Intr + 142619 142716 98 1 2 48 92 64 0.712 1.61 5.05 Term + 142789 143134 346 1 1 88 39 167 0.889 5.77 5.06 PlyA + 144837 144842 6 1.05 6.00 Prom + 166227 166266 40 -5.96 6.01 Init + 168610 168844 235 0 1 46 110 203 0.749 14.60 6.02 Term + 169756 170276 521 2 2 80 43 645 0.999 53.66 6.03 PlyA + 172428 172433 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:186684429_186889266|GENSCAN_predicted_peptide_1|956_aa ENATVYYLALDVQESDCRVLSRKHWNDFSSALANTKDSPVLLDSLKDTELYRKQANKALV KYKGENDDFPSFGVDQTCTIMNCVQPHLTYPLHPGGREHSPIAKPPLSFILMTIIPMDTI SKDPVPMHTTTMAVIALTMDPVTHHPIAKVPKVTIAIATAHHLGTQKDEVKVKDTVPSIA NCICVRTPSPKDSLVSGMEQDPSVMQILGPAIRQSRLEDLFYGQLLHRKAWMKVSSALAN TKDSPVLLDSLKDTELYRKQANKALVKYKGENDDFPSFGVDQTFYVIMAKKAMNLCKYSV DFHRLLLSLTQESQSEEIDCNDKDLFKAVDAALKKYNSQNQSNNQFVLYRITEATKTVGS DTFYSFKYEIKEGDCPVQSGKTWQDCEYKDAAKAATGECTATVGKRSSTKFSVATQTCQI TPGGWFILWHCPGGKLTISDYPNSSRVLHCTQCVQGLNRRTRRHKTCPPLQAASQIEERI GFEGRPGLESFRAPCLLASPFPCLEKGAEGPVVTAQYDCLGCVHPISTQSPDLEPILRHG IQYFNNNTQHSSLFMLNEVKRAQRQDTGECTDNAYIDIQLRIASFSQNCDIYPGKDFVQP PTKICVGCPRDIPTNSPELEETLTHTITKLNAENNATFYFKIDNVKKARVQVVAGKKYFI DFVARETTCSKESNEELTESCETKKLGISLMKRPPGFSPFRSSRIGEIKEETTVSPPHTS MAPAQDEERDSGKEQGHTRRHDWGHEKQRKHNLGHGHKHERDQGHGHQRGHGLGHGHEQQ HGLGHGHKFKLDDDLEHQGGHVLDHGHKHKHGHGHGKHKNKGKKNGKHNGWKTEHLASSS EDSTTPSAQTQEKTEGPTPIPSLAKPGVTVTFSDFQDSDLIATMMPPISPAPIQSDDDWI PDIQIDPNGLSFNPISDFPDTTSPKCPGRPWKSVSEINPTTQMKESYYFDLTDGLS >gi568815595f:186684429_186889266|GENSCAN_predicted_CDS_1|2871_bp gaaaatgcaactgtgtattatttagccttagatgttcaagaatctgactgccgggtccta tccaggaaacactggaatgacttctcttcagcattggccaataccaaggatagtcccgtc ctcttagattccctcaaggataccgagctctacagaaaacaagccaacaaagcccttgtg aagtataaaggagagaatgatgactttccctctttcggagtggaccaaacatgtacgatc atgaattgcgtgcaaccccatttgacctatccgctccatcctggtgggcgtgaacattct ccgattgccaagcctccattgagcttcatcctcatgaccatcatccccatggacaccatc tccaaggaccctgttcctatgcacaccaccaccatggccgtgattgctttgacgatggac cctgtaacccaccaccctatagccaaggtccccaaggtcaccattgccattgccacggcc caccacctgggcactcagaaggacgaggtcaaggtaaaggacactgtcccttccattgca aactgcatctgtgtacggactccctcccctaaggactctctagtctcaggtatggagcag gacccatctgtcatgcagatcttaggacctgctatcagacagagtaggttagaggattta ttttatggccagctcctacacagaaaggcctggatgaaagtctcttcagcattggccaat accaaggatagtcccgtcctcttagattccctcaaggataccgagctctacagaaaacaa gccaacaaagcccttgtgaagtacaaaggagagaatgatgactttccctctttcggagtg gaccaaactttttatgtcatcatggctaaaaaggctatgaatctttgcaaatactctgta gactttcacaggctgctactaagtttaacccaggaatcacagtccgaggaaattgactgc aatgacaaggatttatttaaagctgtggatgctgctctgaagaaatataacagtcaaaac caaagtaacaaccagtttgtattgtaccgcataactgaagccactaagacggttggctct gacacgttttattccttcaagtacgaaatcaaggagggggattgtcctgttcaaagtggc aaaacctggcaggactgtgagtacaaggatgctgcaaaagcagccactggagaatgcacg gcaaccgtggggaagaggagcagtacgaaattctccgtggctacccagacctgccagatt actccaggtggctggtttatcctctggcactgccctggtgggaaattaaccatttctgac tatcccaactcttcccgtgtgctgcattgtacccagtgtgtgcaagggctcaacagaagg accagaagacataagacctgccctccattgcaggcagcatcccagattgaggagcgcata ggctttgagggcagacctggcctggaatctttcagagctccttgccttcttgcctctcct tttccttgtctagaaaagggagccgagggccctgtggtgacagcccagtacgactgcctc ggctgtgtgcatcctatatcaacgcagagcccagacctggagcccattctgagacacggc attcagtactttaacaacaacactcaacattcctccctcttcatgcttaatgaagtaaaa cgggcccaaagacaggataccggtgaatgtacagataatgcatacatcgatattcagcta cgaattgcttccttctcacagaactgtgacatttatccagggaaggattttgtacaacca cctaccaagatttgcgtgggctgccccagagatatacccaccaacagcccagagctggag gagacactgactcacaccatcacaaagcttaatgcagagaataacgcaactttctatttc aagattgacaatgtgaaaaaagcaagagtacaggtggtggctggcaagaaatattttatt gacttcgtggccagggaaaccacatgttccaaggaaagtaatgaagagttgaccgaaagc tgtgagaccaaaaaacttggcatctcactgatgaaaaggcctccaggtttttcacctttc cgatcatcacgaataggggaaataaaagaagaaacaactgtaagtccaccccacacttcc atggcacctgcacaagatgaagagcgggattcaggaaaagaacaagggcatactcgtaga catgactggggccatgaaaaacaaagaaaacataatcttggccatggccataaacatgaa cgtgaccaagggcatgggcaccaaagaggacatggccttggccatggacacgaacaacag catggtcttggtcatggacataagttcaaacttgatgatgatcttgaacaccaagggggc catgtccttgaccatggacataagcataagcatggtcatggccacggaaaacataaaaat aaaggcaaaaagaatggaaagcacaatggttggaaaacagagcatttggcaagctcttct gaagacagtactacaccttctgcacagacacaagagaagacagaagggccaacacccatc ccttccctagccaagccaggtgtaacagttaccttttctgactttcaggactctgatctc attgcaactatgatgcctcctatatcaccagctcccatacagagtgatgacgattggatc cctgatatccagatagacccaaatggcctttcatttaacccaatatcagattttccagac acgacctccccaaaatgtcctggacgcccctggaagtcagttagtgaaattaatccaacc acacaaatgaaagaatcttattatttcgatctcactgatggcctttcttaa >gi568815595f:186684429_186889266|GENSCAN_predicted_peptide_2|280_aa MVGEESDVLLPRCSDLGTAASQRINAQIVFPLSEIHNHEPGCLLLVLTRRHASKNRNEIV VKLLEGGVDPHAEDHCEATAMHRAAAKGNLKMIPILLYYRTSTNIQDTEGKTQRPSRRKG VPIQTPREDSWISHKKDFRGLSKAEVFLCARRSHSCQLGWRRVPPALVGGGQSWPGGRVP WGVAQTVCPERPRRERRLRGEREEEAADKVMARRWRKSRRRKRKEMEERMSPEETEGTNF DFAGEAVGQAERKGPAFSAAEESFHEEEKRRRKEQSDLTF >gi568815595f:186684429_186889266|GENSCAN_predicted_CDS_2|843_bp atggtaggtgaggagtctgatgtcttattgcctagatgcagtgatctgggaactgctgcc tcccagaggataaatgcgcagatagtctttccattatccgaaattcacaaccatgaacct ggctgtttgttgttggtgctgactcgcaggcatgcttccaaaaacaggaatgagattgtt gtcaagttactagaaggcggggttgatccacatgctgaggaccattgtgaggctacagca atgcaccgggcagcagccaagggtaacttgaagatgattcctatccttctatactacaga acatccacgaacatccaagacactgaggggaagactcaaagaccttctaggaggaagggg gtcccgatccagaccccaagagaggattcttggatctcgcacaagaaagatttcaggggt ctttcaaaagccgaggttttcctgtgcgctcggaggagccatagctgccagctgggctgg cggcgggtgcccccagcccttgtgggcggcgggcagagctggcctgggggccgggtcccg tggggggtcgcgcagacagtgtgtccggagcgcccccggcgggagcgcaggctgcggggc gagagggaagaggaggcggcggataaggtgatggcgagaagatggaggaagagcagacga aggaagagaaaggagatggaagagagaatgtcaccagaggagaccgaaggaacaaatttt gactttgcaggagaagctgtgggccaggcagaaagaaaaggaccagctttttctgcagct gaagaaagtttccatgaggaagaaaaacggaggcgaaaggagcagagcgacctgaccttc tga >gi568815595f:186684429_186889266|GENSCAN_predicted_peptide_3|297_aa MDPDGVIESNWNEIVDNFDDMNLKESLLRGIYAYGFEKPSAIQQRAIIPCIKGYDVIAQA QSGTGKTATFAISILQQLEIEFKETQALVLAPTRELAQQVVLLSATMPTDVLEVTKKFMR DPIRILVKKEELTLEGIKQFYINVEREEWKLDTLCDLYETLTITQAVIFLNTRRKVDWLT EKMHARDFTVSALHGDMDQKERDVIMREFRSGSSRVLITTDLLARGIDVQQVSLVINYDL PTNRENYIHRIGRGGRFGRKGVAINFVTEEDKRILRDIETFYNTTVEEMPMNVADLI >gi568815595f:186684429_186889266|GENSCAN_predicted_CDS_3|894_bp atggaccccgatggtgtcatcgagagcaactggaatgagattgttgataactttgatgat atgaatttaaaggagtctctccttcgtggcatctatgcttacggttttgagaagccttcc gctattcagcagagagctattattccctgtattaaagggtatgatgtgattgctcaagct cagtcaggtactggcaagacagccacatttgctatttccatcctgcaacagttggagatt gagttcaaggagacccaagcactagtattggcccccaccagagaactggctcaacaggtt gtgttgctttctgccacaatgccaactgatgtgttggaagtgaccaaaaaattcatgaga gatccaattcgaattctggtgaaaaaggaagaattgacccttgaaggaatcaaacagttt tatattaatgttgagagagaggaatggaagttggatacactttgtgacttgtacgagaca ctgaccattacacaggctgttatttttctcaatacgaggcgcaaggtggactggctgact gagaagatgcatgccagagacttcacagtttctgctctgcatggtgacatggaccagaag gagagagatgttatcatgagggaattccggtcagggtcaagtcgtgttctgatcactact gacttgttggctcgcgggattgatgtgcaacaagtgtctttggttataaattatgatcta cctaccaatcgtgaaaactatattcacagaattggcagagggggtcgatttgggaggaaa ggtgtggctataaactttgttactgaagaagacaagaggattcttcgtgacattgagact ttctacaatactacagtggaggagatgcccatgaatgtggctgaccttatttaa >gi568815595f:186684429_186889266|GENSCAN_predicted_peptide_4|370_aa MAPGCSASGEGLMLPYDMEEKQKDTIRSKSHVSSRGQGKASSDVSALKAHAFCSATKQDK EDRDLHLRRPAPTKIAGSGPSVDRQGSRGPRAAAPPSLLSLPDRPELFRLRVLELNASDE RGIQVVREKVKNFAQLTVSGSRSDGKPCPPFKIVILDEADSMTSAAQAALRRTMEKESKT TRFCLICNYVSRIIEPLTSRCSKFRFKPLSDKIQQQRLLDIAKKENVKISDEGIAYLVKV SEGDLRKAITFLQSATRLTGGKEITEKVITDIAGVIPAEKIDGVFAACQSGSFDKLEAVV KDLIDEGHAATQLVNQLHDVVVENNLSDKQKSIITEKLAEVDKCLADGADEHLQLISLCA TVMQQLSQNC >gi568815595f:186684429_186889266|GENSCAN_predicted_CDS_4|1113_bp atggcgccaggctgttcagcttctggtgagggcctcatgctgccttatgacatggaggag aagcagaaagacactattagaagtaaaagccacgtgtcctcaagagggcaaggcaaagca tcttcagatgtctctgcccttaaagcacacgcgttctgctctgcgacgaagcaggacaag gaggacagggacctgcacctccggaggcccgcacctacgaagatagcgggctcgggacct tcggtggaccggcagggttccagaggcccgcgcgccgccgccccgccctcattgctgagc ctgccagataggcctgaacttttccgattaagagttcttgagttaaatgcatctgatgaa cgtggaatacaagtagttcgagagaaagtgaaaaattttgctcaattaactgtgtcagga agtcgctcagatgggaagccgtgtccgccttttaagattgtgattctggatgaagcagat tctatgacctcagctgctcaggcagctttaagacgtaccatggagaaggagtcgaaaacc acccgattctgtcttatctgtaactatgtcagtcgaataattgaacccctgacctctaga tgttcaaaattccgcttcaagcctctgtcagataaaattcaacagcagcgattactagac attgccaagaaggaaaatgtcaaaattagtgatgagggaatagcttatcttgttaaagtg tcagaaggagacttaagaaaagccattacatttcttcaaagcgctactcgattaacaggt ggaaaggagatcacagagaaagtgattacagacattgccggggtaataccagctgagaaa attgatggagtatttgctgcctgtcagagtggctcttttgacaaactagaagctgtggtc aaggatttaatagatgagggtcatgcagcaactcagctcgtcaatcaactccatgatgtg gttgtagaaaataacttatctgataaacagaagtctattatcacagaaaaacttgccgaa gttgacaaatgcctagcagatggtgctgatgaacatttgcaactcatcagcctttgtgca actgtgatgcagcagttatctcagaattgttaa >gi568815595f:186684429_186889266|GENSCAN_predicted_peptide_5|267_aa MVSDANSEGSKNSAGIYSGCPGIIRSTEAMDTQGMVKVVAPSVLPFLPHRAFSKIDGHSS VLTAPGIRGPQGWMTSLKRNLSGFPAAAPKSRLEDEGCSSREFSSMDTFEGYEGAEDMEK LDDSHETCSSKSAFLERRESTQGEKEERDEEATATGSTKKERAGASTPADSCGGDGWFPG EGAAPEDLHPGELPSLRSPRLPGSRAPAFPREAMWGPGPCPATQKAAGAAAAAERGRPGP PPGSPRGALRARLAPAARRRTSRDGDP >gi568815595f:186684429_186889266|GENSCAN_predicted_CDS_5|804_bp atggtgagtgatgcaaacagcgagggctctaagaattcagcagggatctacagtggatgc cccggcatcatccggagcacagaggccatggacactcaggggatggtcaaggtcgtagct ccctccgttcttccgttcctccctcacagagctttcagcaagatagatggtcattcttct gtcctcactgctcccggcataaggggaccccagggctggatgacttccctaaagaggaat ctctcagggtttcctgcagccgctcccaagtcccgtctggaggatgaaggctgctccagc agggagtttagctccatggacacctttgaaggctatgaaggtgcagaagacatggaaaag ttggatgacagtcacgaaacctgttcctccaaatccgctttcctggagaggagagaaagc acccagggagaaaaagaagaacgcgatgaggaagcaacggcgaccggcagcaccaagaag gagcgtgctggggcgtccacgccggctgactcctgcgggggcgacggctggtttccaggc gagggcgcggcgcccgaggacctccaccccggagagctgccctccctgcggtcgccccgg ctccccggctccagagcgcccgcattcccgagagaggcgatgtgggggcccgggccctgc ccagctacgcagaaagcagccggggccgcggcggcggcagaaaggggacgcccagggccg cctcccgggagcccgaggggtgccctgcgtgctcgtctagctcccgccgcccggcgaagg acctcgcgggacggggacccctag >gi568815595f:186684429_186889266|GENSCAN_predicted_peptide_6|251_aa MWIPGLRMLLLGAVLLLLALPGHDQETTTQGPGVLLPLPKGACTGWMAGIPGHPGHNGAP GRDGRDGTPGEKGEKGDPGLIGPKGDIGETGVPGAEGPRGFPGIQGRKGEPGEGAYVYRS AFSVGLETYVTIPNMPIRFTKIFYNQQNHYDGSTGKFHCNIPGLYYFAYHITVYMKDVKV SLFKKDKAMLFTYDQYQENNVDQASGSVLLHLEVGDQVWLQVYGEGERNGLYADNDNDST FTGFLLYHDTN >gi568815595f:186684429_186889266|GENSCAN_predicted_CDS_6|756_bp atgtggattccagggctcaggatgctgttgctgggagctgttctactgctattagctctg cccggtcatgaccaggaaaccacgactcaagggcccggagtcctgcttcccctgcccaag ggggcctgcacaggttggatggcgggcatcccagggcatccgggccataatggggcccca ggccgtgatggcagagatggcacccctggtgagaagggtgagaaaggagatccaggtctt attggtcctaagggagacatcggtgaaaccggagtacccggggctgaaggtccccgaggc tttccgggaatccaaggcaggaaaggagaacctggagaaggtgcctatgtataccgctca gcattcagtgtgggattggagacttacgttactatccccaacatgcccattcgctttacc aagatcttctacaatcagcaaaaccactatgatggctccactggtaaattccactgcaac attcctgggctgtactactttgcctaccacatcacagtctatatgaaggatgtgaaggtc agcctcttcaagaaggacaaggctatgctcttcacctatgatcagtaccaggaaaataat gtggaccaggcctccggctctgtgctcctgcatctggaggtgggcgaccaagtctggctc caggtgtatggggaaggagagcgtaatggactctatgctgataatgacaatgactccacc ttcacaggctttcttctctaccatgacaccaactga