GENSCAN 1.0 Date run: 2-Nov-116 Time: 21:18:05 Sequence gi568815591r:32769348_32979604 : 210257 bp : 42.87% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1535 1673 139 2 1 45 60 127 0.501 5.15 1.02 Intr + 5855 5932 78 1 0 50 61 82 0.242 0.53 1.03 Intr + 13725 13857 133 2 1 56 38 77 0.029 -1.20 1.04 Term + 16486 16967 482 2 2 79 37 130 0.183 0.97 1.05 PlyA + 18819 18824 6 1.05 2.00 Prom + 29700 29739 40 -5.35 2.01 Init + 32839 32932 94 0 1 75 11 155 0.785 5.13 2.02 Intr + 32981 33111 131 0 2 50 68 97 0.544 3.39 2.03 Intr + 43080 43200 121 2 1 33 97 60 0.380 0.55 2.04 Intr + 48130 48244 115 2 1 83 33 99 0.583 2.49 2.05 Intr + 51370 51394 25 1 1 90 88 35 0.545 0.81 2.06 Intr + 53703 53874 172 2 1 54 98 92 0.820 5.39 2.07 Intr + 55729 56380 652 2 1 78 39 272 0.460 11.34 2.08 Intr + 58047 58170 124 0 1 57 89 48 0.331 1.47 2.09 Intr + 65494 65572 79 0 1 56 110 81 0.238 5.31 2.10 Intr + 71799 71905 107 1 2 55 109 136 0.921 11.41 2.11 Term + 73483 73590 108 0 0 107 43 83 0.787 3.23 2.12 PlyA + 75032 75037 6 1.05 3.04 PlyA - 75234 75229 6 1.05 3.03 Term - 76823 76786 38 0 2 131 42 31 0.019 -0.68 3.02 Intr - 86734 86616 119 0 2 61 54 104 0.716 3.49 3.01 Init - 87141 87014 128 0 2 90 72 111 0.855 9.38 3.00 Prom - 88796 88757 40 -3.05 4.00 Prom + 97204 97243 40 -5.15 4.01 Init + 97264 97413 150 0 0 104 58 106 0.551 9.29 4.02 Term + 97809 97871 63 2 0 48 38 90 0.553 -3.09 4.03 PlyA + 98386 98391 6 -0.45 5.04 PlyA - 98844 98839 6 1.05 5.03 Term - 101533 99998 1536 1 0 139 40 1134 0.998 103.17 5.02 Intr - 105810 105645 166 2 1 103 115 91 0.998 12.34 5.01 Init - 110257 110088 170 1 2 68 111 81 0.990 7.45 5.00 Prom - 115613 115574 40 -6.05 6.03 PlyA - 116912 116907 6 1.05 6.02 Term - 123377 123041 337 1 1 9 42 271 0.216 7.66 6.01 Init - 138419 138406 14 2 2 108 75 7 0.052 1.26 6.00 Prom - 143572 143533 40 -5.15 7.02 PlyA - 144042 144037 6 1.05 7.01 Sngl - 145447 145235 213 1 0 79 52 239 0.344 14.33 7.00 Prom - 147398 147359 40 -10.55 8.10 PlyA - 147491 147486 6 1.05 8.09 Term - 148195 147997 199 0 1 46 48 253 0.990 12.99 8.08 Intr - 149025 148964 62 0 2 96 100 84 0.945 7.01 8.07 Intr - 152083 151992 92 2 2 100 86 88 0.922 8.59 8.06 Intr - 159344 159209 136 2 1 77 -5 93 0.742 -1.88 8.05 Intr - 160422 160270 153 0 0 77 61 152 0.816 10.75 8.04 Intr - 168624 168428 197 1 2 70 64 136 0.856 7.61 8.03 Intr - 170079 169902 178 0 1 77 24 139 0.668 4.97 8.02 Intr - 173201 173033 169 1 1 -82 38 222 0.355 -0.77 8.01 Init - 173674 173523 152 1 2 93 70 141 0.990 10.36 8.00 Prom - 177856 177817 40 -6.75 9.00 Prom + 186928 186967 40 -9.15 9.01 Init + 188227 188447 221 0 2 90 86 431 0.918 39.25 9.02 Intr + 197699 197746 48 2 0 76 91 49 0.029 0.88 9.03 Intr + 205270 205415 146 1 2 64 52 215 0.960 14.51 9.04 Intr + 205835 206024 190 0 1 91 107 212 0.965 21.12 9.05 Intr + 207007 207152 146 1 2 71 115 185 0.998 18.51 9.06 Term + 210076 210221 146 2 2 36 40 145 0.317 1.59 9.07 PlyA + 210225 210230 6 -3.64 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 197706 197746 41 2 2 105 91 30 0.897 4.71 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:32769348_32979604|GENSCAN_predicted_peptide_1|277_aa XAGKSANYSQCRLRLRSRQQTPQQVHLVIKPTYAGGKNLMPYLHWGQEVNCSGDPEMQKK LLTLWTAPPMKKRHFGHNHLASLQVANFSSSSCLLLKLRNSSNLCPLPSSNAGSTFLETL WLLGHTARPICLSSRKRVESYSKLCQGLGGMVRQGEMKGQKTGGTGGDSHRPQADQGRMQ TAEFANVVLQLGREERKKGAGKYPLLSHIMGQDVRKPKFKTLAPHTGYVTKAFCLCFLDL LFLVCNMQMLILAVCGDMPMKLTEMKHRKVLSTVLDT >gi568815591r:32769348_32979604|GENSCAN_predicted_CDS_1|834_bp nnagctggtaaaagtgctaactatagccaatgccgattgaggcttcgttccaggcaacaa acacctcagcaggttcacctggtgattaaacccacctatgctgggggcaagaacctgatg ccctacttacactggggacaggaagtgaactgctcaggtgatccagagatgcagaagaaa cttctgaccctatggacagctccccccatgaaaaagagacattttggtcacaaccattta gccagtctacaagttgcaaacttttcttcatcttcctgtcttcttctgaaacttcgaaat tcttccaatctctgcccattacccagttccaatgctggatccacatttttagaaacactg tggctccttggacacactgcaaggcctatttgtctctcctcaaggaaaagggtggaaagc tacagcaaactttgccagggactgggagggatggtgaggcagggagagatgaaggggcaa aagacaggtgggacaggaggtgacagccatagaccacaagcggatcagggaaggatgcag acggcagaatttgcaaatgttgtactgcagcttggaagagaggagagaaagaagggggca gggaaatatccactgctgtcccacatcatggggcaggatgtcaggaagcccaagttcaaa acccttgccccacatactggctatgtgactaaggcattttgcttgtgctttctggatctc cttttcctggtctgcaacatgcagatgctgatattagccgtatgtggtgatatgccaatg aaattaactgaaatgaagcataggaaagtgcttagcacagttcttgacacataa >gi568815591r:32769348_32979604|GENSCAN_predicted_peptide_2|575_aa MALGSPAPVALQSTALQLFSWADVEWLFQVHAPLDSAPVGSLYGGSSPTFAFHTALGEVL HEGSASAADFCLDIQPRAAGRLQQGRCCAACFTRFSTVSWENPREKAFGMSMSVEEHSTS LIHPSDVIQYNLTVALHIIYTRRTQFYTSVQQGKKHQFDHGELFGWLFCKVHPGAVVFAI LAAMSIPGSANLQTQWNTVGEFSNLPQEFIEWIKYSTKPVLEVLARAIRQEKEIKGIQLG KEEVKLSLFADDMIVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTES QIMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLKEIKEDTNKWKNIPCSWVGRIN TVKMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRARIAKSILSQKNKAGGIT LSDFKLYYKATVTKTACARTKIVYSMYSQKAAEEVKRELIVKSELLHSRKVMVCKKIQLK TDTARSPQKPIGPSQMLWVTLTVEGLLRRSGSMSLQPLSREGVSQRALGQSSRGSGSGCQ KMKGIEIKRRERLKCSTKIERRKRLTDSEGSWRRE >gi568815591r:32769348_32979604|GENSCAN_predicted_CDS_2|1728_bp atggccttgggcagccctgcccctgtggctctgcagagtacagccctgcagctgttctca tgggctgatgttgagtggcttttccaggttcatgctccactagacagtgccccagtgggg agtctgtatgggggctccagccccacatttgccttccacactgccttaggagaggttctc catgagggctctgcctctgcagcagacttctgcctggacatccagcccagagctgctggt aggctccagcaagggagatgctgtgcggcctgtttcaccaggtttagcacagtaagctgg gaaaacccaagggaaaaagcctttgggatgtcgatgagtgtggaagaacacagcacatca cttattcatccttcagatgttatccagtacaatctgacagtagccctgcacataatctac actcgaaggacacagttttatacttcagttcagcagggaaaaaaacaccagtttgatcat ggagagctatttggatggctcttttgcaaagtacatcctggtgctgttgtttttgctata ttagcagcaatgtcaataccaggttcagcaaatctgcaaacccagtggaatactgtaggg gagttcagcaacttgccacaagaatttatagaatggatcaaatatagtactaaaccagtg ttggaagttctggccagggcaattaggcaggagaaggaaataaagggtattcaattagga aaagaggaagtcaaattgtccctgtttgcagatgacatgattgtatatctagaaaacccc attgtctcagcccaaaatctccttaagctgataagcaacttcagcaaagtctcaggatac aaaatcaatgtacaaaaatcacaagcattcttatacaccaataacagacaaacagagagc caaatcatgagtgaactcccattcacaattgcttcaaagagaataaaatacctaggaatc caacttacaagggacgtgaaggacctcttcaaggagaactacaaaccactgctcaaggaa ataaaagaggatacaaacaaatggaagaacattccatgctcatgggtaggaagaatcaat accgtgaaaatggccatactgcccaaggtaatttatagattcaatgccatccccatcaag ctaccaatgactttcttcacagaattggaaaaaactactttaaagttcatatggaaccaa aaaagggcccgcatcgccaagtcaatcttaagccaaaagaacaaagctggaggcatcaca ctatctgacttcaaactatactacaaggctacagtaaccaaaacagcatgtgccagaaca aaaatagtatactcaatgtatagtcagaaagcagctgaagaagtgaagcgagaactgata gttaaaagtgaactattacattctagaaaagtcatggtgtgtaagaagatccaactgaag actgacactgcccgatcgcctcagaagcctataggaccttcacagatgctctgggtaacc ctcacagtggaggggcttctgaggcgatcgggcagcatgagtcttcaaccgctaagccga gaaggagtcagtcagagagccttgggccagagttccaggggctccgggagtggctgccag aaaatgaaaggaattgaaattaagagaagggagagattgaagtgtagcaccaagattgaa aggagaaagaggttgacggatagtgaaggaagttggagaagagagtaa >gi568815591r:32769348_32979604|GENSCAN_predicted_peptide_3|94_aa MMAVTVTQCFQLSQTSVQVCDMLRLELAVFGGLKGKQAACGAGAWQSQAAYPARTHSKTR ALNQSAFDKTCMSLNLTVDLLDGLPSDSNQFMAI >gi568815591r:32769348_32979604|GENSCAN_predicted_CDS_3|285_bp atgatggcagtaactgtgacccagtgcttccagttgtcccaaacctcagtgcaagtctgt gacatgctgcgtcttgagttggcagtgtttggagggcttaaaggaaaacaggctgcctgc ggtgcaggagcatggcaaagccaggctgcctaccctgctcggacccactccaagacacgt gctcttaaccagtctgcttttgataaaacttgtatgtccttgaacttaactgtggatctc cttgatggactgccatctgactcaaaccagttcatggctatctga >gi568815591r:32769348_32979604|GENSCAN_predicted_peptide_4|70_aa MIQSPPTRPYLQHWELQSDMILGGDTEPNRINGHAHVPVSVVVGNAWCSLGDAAVNKTIN TDHFVEKHED >gi568815591r:32769348_32979604|GENSCAN_predicted_CDS_4|213_bp atgatccaatcacctcccaccaggccctacctccaacactgggaattacaatccgacatg attttgggtggggacacagagccaaaccgtatcaatggccatgctcatgttccagtttct gtggttgtaggcaatgcttggtgttccctgggggatgcagccgtaaataagacaatcaac actgatcattttgtagaaaagcatgaagactaa >gi568815591r:32769348_32979604|GENSCAN_predicted_peptide_5|623_aa MSTQDERQINTEYAVSLLEQLKLFYEQQLFTDIVLIVEGTEFPCHKMVLATCSSYFRAMF MSGLSESKQTHVHLRNVDAATLQIIITYAYTGNLAMNDSTVEQLYETACFLQVEDVLQRC REYLIKKINAENCVRLLSFADLFSCEELKQSAKRMVEHKFTAVYHQDAFMQLSHDLLIDI LSSDNLNVEKEETVREAAMLWLEYNTESRSQYLSSVLSQIRIDALSEVTQRAWFQGLPPN DKSVVVQGLYKSMPKFFKPRLGMTKEEMMIFIEASSENPCSLYSSVCYSPQAEKVYKLCS PPADLHKVGTVVTPDNDIYIAGGQVPLKNTKTNHSKTSKLQTAFRTVNCFYWFDAQQNTW FPKTPMLFVRIKPSLVCCEGYIYAIGGDSVGGELNRRTVERYDTEKDEWTMVSPLPCAWQ WSAAVVVHDCIYVMTLNLMYCYFPRSDSWVEMAMRQTSRSFASAAAFGDKIFYIGGLHIA TNSGIRLPSGTVDGSSVTVEIYDVNKNEWKMAANIPAKRYSDPCVRAVVISNSLCVFMRE THLNERAKYVTYQYDLELDRWSLRQHISERVLWDLGRDFRCTVGKLYPSCLEESPWKPPT YLFSTDGTEEFELDGEMVALPPV >gi568815591r:32769348_32979604|GENSCAN_predicted_CDS_5|1872_bp atgtccactcaagacgagaggcagatcaatactgaatatgctgtgtcattgttggaacag ttgaaactgttttatgaacagcagttgtttactgacatagtgttaattgttgagggcact gaattcccttgtcataagatggttcttgcaacatgtagctcttatttcagggccatgttt atgagtggactaagtgaaagcaaacaaacccatgtacacctgaggaatgtcgatgctgcc accttacagataataataacttatgcatacacgggtaacttggcaatgaatgacagcact gtagaacagctttatgaaacagcttgcttcctacaggtagaagatgtgttacaacgttgt cgagaatatttaattaaaaaaataaatgcagagaattgtgtacgattgttgagttttgct gatctcttcagttgtgaggaattaaaacagagtgctaaaagaatggtggagcacaagttc actgctgtgtatcatcaggacgcgttcatgcagctgtcacatgacctactgatagatatt ctcagtagtgacaatttaaatgtagaaaaggaagaaaccgttcgagaagctgctatgctg tggctagagtataacacagaatcacgatcccagtatttgtcttctgttcttagccaaatc agaattgatgcactttcagaagtaacacagagagcttggtttcaaggtctgccacccaat gataagtcagtggtggttcaaggtctgtataagtccatgcccaagtttttcaaaccaaga cttgggatgactaaagaggaaatgatgattttcattgaagcatcttcagaaaatccttgt agtctttactcttctgtctgttacagcccccaagcagaaaaagtttacaagttatgtagc ccaccagctgatttgcataaggttgggaccgttgtaactcctgataatgatatctacata gcagggggtcaagttcctctgaaaaacacaaaaacaaatcacagtaaaacaagcaaactt cagactgccttcagaactgtgaattgcttttattggtttgatgcacagcaaaatacctgg tttccaaagaccccaatgctttttgtccgcataaagccatctttggtttgctgtgaaggc tatatctatgcaattggaggagatagcgtaggtggagaacttaatcggaggaccgtagaa agatacgacactgagaaagatgagtggacgatggtaagccctttaccttgtgcttggcaa tggagtgcagcagttgtggttcatgactgcatttatgtgatgacactgaacctcatgtac tgttattttccaaggtctgactcatgggtagaaatggccatgagacagactagtaggtcc tttgcttcagctgcagcttttggtgataaaattttctatattggagggttgcatattgct accaattccggcataagactcccctctggcactgtagatgggtcttcagtaactgtggaa atttatgatgtgaataaaaatgagtggaaaatggcagccaacatccctgctaagaggtac tctgacccctgtgttagagctgttgtgatctcaaattctctatgtgtgtttatgcgagaa acccacttaaatgagcgagctaaatacgtcacctaccaatatgacctggaacttgaccgg tggtctctgcggcagcatatatctgaacgtgtactgtgggacttggggagagattttcga tgcactgtggggaaactctatccatcctgccttgaagagtctccatggaaaccaccaact tatcttttttcaacggatgggacagaagagtttgaactggatggagaaatggttgcacta ccacctgtatag >gi568815591r:32769348_32979604|GENSCAN_predicted_peptide_6|116_aa MVPASTQRKASDAGAAVRTQMSLSPFSPNLPQLRALLGAALTSSLALGKAEAALERGGHG LEKRCCGNKATRPTPPQLSPAPGLVLFSPTHSHFPSQPAKICISGEIPRDKTPPSA >gi568815591r:32769348_32979604|GENSCAN_predicted_CDS_6|351_bp atggtgccagccagcacacaacggaaggcttcagacgctggagctgccgtccgcacccag atgtctctctctcccttttcccctaacctcccccaactgcgggcccttttgggtgctgct ctgacttcttccttggccttgggaaaagccgaagcggctctggagagaggagggcatggt ttggagaagaggtgctgcggcaacaaagcgacccgcccaacgcctcctcaactttctcct gcccccggtctggtattgtttagtcctactcattcccactttccgagccagcctgcaaaa atctgtatttcaggggaaattccaagggataaaacacctccttcagcataa >gi568815591r:32769348_32979604|GENSCAN_predicted_peptide_7|70_aa MVRDAANSHQRFLTNSQQGNGALSPTAEKIRILPPSMVLDAYSYPEPPDGSPPLLTSSVG VCEMQGREPS >gi568815591r:32769348_32979604|GENSCAN_predicted_CDS_7|213_bp atggtgagggatgcagcaaactctcatcaaaggttcctgactaacagccagcaaggaaat ggagctctcagtcctacagctgaaaagatccgcattctgccaccctcaatggttctggat gcatattcttacccagagcctccagatggaagcccacctctgctgacatcttcggttggg gtttgtgagatgcaaggtcgagaacccagttga >gi568815591r:32769348_32979604|GENSCAN_predicted_peptide_8|445_aa MSSWPGREDARAGGAWWPREPLDQELQRAREQKRRRHDAQQLQQLKRLESLVIGACSAFV DRTGKFNRARCPRAPPLSSLGHCDMEKPRSFHPLLRRRDEDTGPAVLESSSVQSTPFKSK QAKKQVASFSSDHSLQQLSATSKTAAFASDLPAITHENVKFSTTIQKTLNPLKVTEDSQM QDDQPLCTLTGRVRGYRCSHRGNPVRPLNVMDKVLAPRQVSDYLEFPNGPEGPYLAQSLS PSCVGFLSISSSESSQLRGDPTAHPVQLPGGGILPAAPGYGTREDEAKPEDSIPDIPGNE YAREFLAHAPTKGLWLPLGKEVRVMQCECWHCKWYGHRTGYKECPFFIKDNQKLQQFRVA HEDFMYDIIRDNKQHEKNVRIQQLKQLLEDSTSGEDRSSSSSSEGKEKHKKKKKKEKHKK RKKEKKKKKKRKHKSSKSNEGSDSE >gi568815591r:32769348_32979604|GENSCAN_predicted_CDS_8|1338_bp atgtcgtcctggccggggcgcgaggatgcgagggctgggggcgcgtggtggccgcgggag ccgctggatcaggaactgcagcgggcccgggagcagaagcggcggcggcacgacgcccag cagctgcagcagctcaagcgcctggagtccttagttattggcgcttgctcggcatttgtg gaccgaacaggcaagttcaaccgagcacgctgcccccgcgctccacctctgagttccctg ggccactgcgacatggagaaaccccgttcttttcatcctctgctacggagaagggacgag gacacgggtcctgcagtgttggagtcttcaagtgttcaaagcacaccttttaaatctaaa caggcaaagaaacaagttgcctccttcagtagtgaccattcactgcaacagctgtcagcc acttccaaaactgcagcttttgccagtgacttgccagccatcacacatgaaaatgtcaag ttctccaccacaatacaaaaaacgctgaatcccctgaaggtcacagaggactcacaaatg caagatgaccagcctttatgcacactcactggcagggtgagagggtacagatgttctcac agaggaaaccctgtaaggccactgaatgttatggacaaagtcctggcacctcgccaagtc tctgattatttagagttccctaatgggccagagggaccttacttggctcagtccctcagt ccatcctgtgttggtttcttgagtatcagcagctcagagagcagtcaactccgaggagac ccgacagctcatccagtccagctccccggtggaggcatccttcctgcagctcctggctac ggcaccagggaagatgaggctaagccagaagatagcataccagatataccaggcaatgaa tacgccagagaatttctggctcatgcaccaactaaaggactttggttgccactggggaaa gaagtcagagttatgcagtgtgaatgttggcattgcaaatggtatggtcaccgaacaggc tacaaagaatgccctttctttatcaaagacaaccaaaagttacaacagttcagagtagca catgaggatttcatgtatgacatcatacgagacaataaacaacatgaaaagaatgtaagg atacagcagttaaaacagttactggaggattctacctcaggtgaagataggagcagctcc agttcctctgaaggtaaagagaaacacaagaaaaagaagaagaaagaaaagcataagaaa aggaagaaagaaaagaaaaagaagaaaaaacggaagcacaaatcttccaagtcaaatgag ggttctgactcagagtga >gi568815591r:32769348_32979604|GENSCAN_predicted_peptide_9|298_aa MAFRGWRPPPPPLLLLLLWVTGQAAPVAGLGSDAELQIERRFVPDECPRTVRSGDFVRYH YVGTFPDGQKFDSRVTMDCGPKLKGPVPVSYDRDSTFNVFVGKGQLITGMDQALVGMCVN ERRFVKIPPKLAYGNEGVSGVIPPNSVLHFDVLLMDIWNSEDQVQIHTYFKPPSCPRTIQ VSDFVRYHYNGTFLDGTLFDSSHNRMKTYDTYVGIGWLIPGMDKGLLGMCVGEKRIITIP PFLAYGEDGDGADSAAGFQQISSFLGWEGDAGWALPKASGRSSGLAASGLLVWAYSFA >gi568815591r:32769348_32979604|GENSCAN_predicted_CDS_9|897_bp atggcgttccggggctggaggcccccgccgccaccgctgctcctgctgctgctctgggtg accgggcaggcagcgcccgtggcgggcctgggctccgacgcggagctgcagatcgagcgg cgcttcgtgcccgacgagtgcccgcgcaccgtgcgcagcggcgacttcgtgcgctaccac tacgtggggacgttccccgacggccagaagttcgactccagggtcaccatggattgtggc cctaaactaaagggcccggttcctgtcagctatgacagagactccactttcaatgtgttt gtgggaaaaggacagctgatcacagggatggaccaggctcttgttgggatgtgcgtaaac gagagacgtttcgtgaagattcccccaaagcttgcctacggaaatgaaggagtttctggt gtgatcccccccaattcagtgcttcattttgatgtacttctgatggatatttggaattct gaagaccaggttcagattcacacctatttcaagcccccgagttgccctcggaccatccag gtgtctgattttgtgaggtaccactacaacgggacgttcctggacggaactctgtttgat tcgagtcacaatcgcatgaaaacatatgacacgtatgtgggaattggctggctgattcct ggaatggataaagggctgctggggatgtgtgtgggtgagaagcgcatcatcaccattcct ccttttctggcctatggagaggatggagatggggctgactcagcagccggctttcagcag atttcatcattccttggatgggaaggagatgcagggtgggctttgccaaaggccagtgga aggagctcaggcttggcggcatctggtttgcttgtctgggcctactcctttgcatga