GENSCAN 1.0 Date run: 3-Nov-116 Time: 15:58:59 Sequence gi568815581f:32071043_32324730 : 253688 bp : 45.69% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 Intr - 6239 6145 95 1 2 24 74 102 0.051 2.01 1.05 Intr - 9001 8824 178 2 1 41 22 151 0.120 2.78 1.04 Intr - 15924 15889 36 1 0 34 111 53 0.010 0.43 1.03 Intr - 35820 35659 162 1 0 36 94 104 0.605 5.75 1.02 Intr - 36420 36280 141 1 0 101 34 143 0.348 10.52 1.01 Init - 36919 36892 28 1 1 30 99 -2 0.603 -5.03 1.00 Prom - 37313 37274 40 -5.96 2.12 PlyA - 37416 37411 6 1.05 2.11 Term - 40962 40516 447 0 0 29 41 343 0.828 18.92 2.10 Intr - 43080 42945 136 2 1 115 12 109 0.203 6.57 2.09 Intr - 47100 46959 142 1 1 69 82 27 0.408 -0.39 2.08 Intr - 47930 47865 66 0 0 87 30 93 0.249 2.28 2.07 Intr - 56733 56549 185 2 2 73 14 121 0.070 2.63 2.06 Intr - 65719 65677 43 2 1 94 99 5 0.016 -0.50 2.05 Intr - 71521 71336 186 2 0 80 -15 241 0.066 12.66 2.04 Intr - 79972 79822 151 1 1 89 -47 140 0.033 0.34 2.03 Intr - 88987 88836 152 2 2 82 81 34 0.779 1.98 2.02 Intr - 89410 89274 137 0 2 67 79 27 0.491 -0.09 2.01 Init - 89738 89506 233 2 2 99 41 115 0.550 3.65 2.00 Prom - 96265 96226 40 -3.36 3.00 Prom + 100503 100542 40 -3.56 3.01 Init + 102783 102870 88 2 1 24 95 46 0.751 -0.28 3.02 Intr + 104277 104320 44 1 2 70 99 42 0.870 1.46 3.03 Intr + 104920 104973 54 0 0 86 100 43 0.956 4.48 3.04 Intr + 107579 107694 116 1 2 79 78 113 0.994 8.65 3.05 Intr + 111715 111823 109 1 1 81 69 44 0.991 2.09 3.06 Intr + 112129 112230 102 0 0 54 77 94 0.972 5.27 3.07 Intr + 121159 121257 99 0 0 51 95 65 0.894 3.81 3.08 Intr + 132848 132931 84 1 0 68 91 63 0.893 4.62 3.09 Intr + 137065 137267 203 0 2 76 94 92 0.212 6.68 3.10 Intr + 151717 151860 144 1 0 31 92 118 0.038 5.90 3.11 Term + 153574 153691 118 1 1 84 40 89 0.992 1.71 3.12 PlyA + 154392 154397 6 1.05 4.00 Prom + 154773 154812 40 -1.86 4.01 Init + 187765 187889 125 0 2 63 53 238 0.953 15.47 4.02 Intr + 188541 188577 37 0 1 108 86 30 0.985 3.06 4.03 Intr + 191511 191721 211 2 1 82 65 111 0.181 6.69 4.04 Intr + 195110 195258 149 0 2 -77 86 289 0.462 12.15 4.05 Intr + 196860 196883 24 2 0 81 91 23 0.229 0.12 4.06 Intr + 206732 206917 186 1 0 46 81 57 0.054 0.69 4.07 Intr + 211113 211208 96 2 0 49 81 68 0.726 2.41 4.08 Intr + 213617 213775 159 1 0 110 75 220 0.992 23.08 4.09 Intr + 217750 217986 237 0 0 114 35 377 0.995 32.81 4.10 Intr + 223252 223400 149 0 2 100 96 82 0.873 9.23 4.11 Intr + 227050 227162 113 1 2 93 96 165 0.945 17.82 4.12 Intr + 234299 234399 101 0 2 85 89 78 0.122 7.43 4.13 Intr + 243859 243951 93 0 0 89 65 32 0.027 1.16 4.14 Intr + 245190 245250 61 2 1 100 86 74 0.043 6.71 4.15 Term + 249916 250187 272 2 2 141 55 422 0.999 39.85 4.16 PlyA + 250866 250871 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 151748 151860 113 1 2 77 92 94 0.956 8.38 S.002 Term + 234299 234468 170 0 2 85 36 134 0.808 5.94 S.003 Init + 245202 245250 49 2 1 73 86 119 0.955 9.57 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:32071043_32324730|GENSCAN_predicted_peptide_1|214_aa MAFIKKLNSEMIKAAEADSDCNFHHFLNQQCNQYHLSSTEGYGGPHKWLSYNWNQNATST AAASATTIASTGQTFQITGNPVTMAGKVITKLPLPANSKTVTVNVPATQGGSWIREPKIL SSARCEGQWLILKFLSYSLIRFLPKLWFQLTPLSEAASEFSCYGNGAAEIPRNLLVRLIA SPSCFTFCHELKLPEIFPEAKQMPALCLYSPQNX >gi568815581f:32071043_32324730|GENSCAN_predicted_CDS_1|642_bp atggcctttatcaagaaattaaattcagaaatgattaaagcagcagaagctgacagtgat tgcaacttccaccacttcctcaaccaacagtgcaaccagtaccatctctccagcacagaa ggttatggtggcccccataagtggctcagttacaactggaaccaaaatgccaccagcaca gctgcggcatctgctacaaccattgccagcacaggtcagacgttccaaattacaggcaat ccagtcactatggcaggaaaagtaattaccaaactgccacttcctgcaaacagcaagact gtcactgtaaatgtgccagcaacacaaggaggttcttggattcgagagcctaagatcctg agttcagctcggtgtgagggccagtggttgatcttgaagtttctttcctactccctcatc cgtttcctgcccaagctgtggttccagctgactcccctatcagaggctgcgagtgagttc agttgttatggcaacggggctgcggagatccccaggaacctgctggtccgcctcattgcc tcaccttcctgcttcaccttctgccatgagttaaagctccctgagatctttccagaagcc aagcagatgccagcactgtgcttgtacagcccgcagaactnn >gi568815581f:32071043_32324730|GENSCAN_predicted_peptide_2|625_aa MGSDLCAQPGTPAAAVGWAAPDASLGTISLRGCNWTRRITSSFHSWHWRTQWCPEAWRLQ EPQGPKEGVTALAKGAPSSFSPTIQQVPSSCPASRKNEVTDKWRLSKTNRSFTERHNSSE ETYKSGRLFMGLRGEEECADWSMGSHGRTRVTSSHTGLRDQQPSPRASGPPSLKVWFRNR RFKLKQQQQQQQQSAKQPNQIPSIQEECAHLPQNIPQCLCFFSRVASPQESRDSASSASS GPAGCGGGGGGGGGGGGGGGGGDDGTRRGTRAFGTRTRSAPEQRSRCPLWWSGKVQCFDT RVEREQASGKHLPSAKPSRCLIEEVHLPVFLLISLNLSLAKAHVGSGTGVMRRTRKGSRA SGDAPSWDGPTFMDLNVFAGNMGLAAIINDLAAVIVVLLQKYGKSQATSSMPRDLFPSTL KACSIPQQRRDRVEKEKAQAVEQQAKVSEQKKAEDVKAQMEALSKQQHLTSHQRGELGTR IATASEKGTLIGIFDTSSGHLIQELRRGSQAANIYCINFNQDASLICVSSDHGTVHIFAA EDLKRNKQSSLASASFLPKYFSYKWSFSKFQVPSGSPCICAFGTQPNAVIAICADCSYYK LLFNPKGECIRDVYAQFLEMTDDKL >gi568815581f:32071043_32324730|GENSCAN_predicted_CDS_2|1878_bp atggggtctgacctctgtgcacagccaggcacgccggctgctgctgtggggtgggcagct ccagatgccagcttgggcaccatctccctgcgaggctgcaactggaccaggcgcatcaca agcagcttccacagctggcactggagaacacagtggtgcccggaagcttggcgactccag gaaccacagggccccaaagagggagtcacagccctggctaagggagctcccagctctttt agccccaccattcagcaggtcccgagttcttgtcctgcatccaggaagaatgaggtaaca gacaagtggaggctgagcaagacaaacaggagctttactgagcgacacaacagctcagag gaaacctacaagtccgggaggctttttatgggcctcagaggggaagaagagtgtgccgat tggtccatgggcagccatgggaggacccgcgtcacaagttcccacactggtctgcgggat cagcaacccagcccccgggcttcaggccctcccagcctgaaggtttggtttaggaaccgg cgattcaaattgaagcagcagcagcagcagcagcagcaatcagcaaagcaaccaaaccag ataccttccatccaagaagaatgtgcccacctcccccagaacatcccccagtgcttatgc tttttctcccgtgttgcttccccgcaagagagccgggacagcgccagctccgcctcctcc ggcccagcgggctgtggcggcggcggcggcggcggcggcggcggcggcggcggcggcggc ggcggggacgacggcacgcggcgagggacacgcgctttcggaacgcgaacgcgctctgcg ccggaacagcgctcccgctgcccactctggtggagtggaaaagtccaatgctttgatacc agggtggagcgtgaacaggcttccgggaaacatcttccatcagccaaaccgtcccgctgc cttattgaagaagtccatcttccagtttttctgctcatatctctgaatctctccttggca aaggctcacgtgggctccggcacgggcgtcatgcggaggacaaggaaggggagccgcgcg tctggggatgccccctcatgggatggacctactttcatggacctcaacgtctttgcaggg aacatgggcctagcggccatcataaatgacctggctgcagtgatagttgtgctgttacag aaatatggaaaatcccaggccacgtcctccatgcctcgggacctgtttccttccacacta aaggcttgctcaataccccagcaaaggagggatagagtggagaaagaaaaggcacaagca gttgagcaacaggccaaggttagtgaacagaagaaggcagaggatgtcaaggcccaaatg gaggctctttcaaaacagcagcacttgacttcccatcagcggggggagttgggaacaaga attgcaactgcatccgagaaagggacccttataggaatatttgatacttcctcagggcat ttaattcaggaactgcgaagaggatctcaagcagccaatatttactgcattaacttcaat caggatgcgtccctcatctgtgtatccagcgaccatggcacagtgcatatttttgcagct gaagatctaaaaaggaataaacagtccagtttggcctcagccagtttccttccaaaatac ttcagttacaagtggagtttctccaagtttcaggttccctcaggctctccgtgcatttgt gcctttggaacacagccaaacgccgtcattgcaatttgtgcagactgcagctactacaaa ctcctgttcaaccccaagggggagtgcatccgagatgtctacgcgcagtttctagagatg actgatgacaagttgtga >gi568815581f:32071043_32324730|GENSCAN_predicted_peptide_3|386_aa MKVPPRAEEITIPADVTPERVPTHIVDYSEAEQSDEQLHQEISQANVICIVYAVNNKHSI DKEVRSISARLPIVWDVRSASARLPRMGCEERLCPAALSGRLPLILVGNKSDLVEYSSME TILPIMNQYTEIETCVECSAKNLKNISELFYYAQKAVLHPTGPLYCPEEKEMKPACIKAL TRIFKISDQDNDGTLNDAELNFFQRQKKIREDHKSYYAINTVYVYGQEKYLLQHFMDSRI PCLIVAAKSDLHEVKQEYSISPTDFCRKHKMPPPQAFTCNTADAPSKDIFVKLTTMAMYP LRARIAEWGAMLLADDEGSITHQVHAVSSVEDSIFMESSHGPLFLEARHVTQADLKSSTF WLRASFGATVFAVLGFAMYKALLKQR >gi568815581f:32071043_32324730|GENSCAN_predicted_CDS_3|1161_bp atgaaggttcctccccgggcagaagaaatcaccattccagctgatgtcaccccagagaga gttccaacacacattgtagattactcagaagcagaacagagtgatgaacaacttcatcaa gaaatatctcaggctaatgtcatctgtatagtgtatgccgttaacaacaagcattctatt gataaggaagtgaggagcatctctgcccggctgcccatcgtctgggatgtgaggagcgcc tctgcccggctgccccgaatgggatgtgaggagcgcctctgcccagccgccctgtctggg aggctgcctttaatattggttgggaacaaatctgatctggtggaatatagtagtatggag accatccttcctattatgaaccagtatacagaaatagaaacctgtgtggagtgttcagcg aaaaacctgaagaacatatcagagctcttttattacgcacagaaagctgttcttcatcct acagggcccctgtactgcccagaggagaaggagatgaaaccagcttgtataaaagccctt actcgtatatttaaaatatctgatcaagataatgatggtactctcaatgatgctgaactc aacttctttcagaggcagaagaaaattcgtgaagatcataaatcctactatgcgattaac actgtttatgtatatggacaagagaaatacttgttgcaacactttatggacagcagaata ccttgcttaatcgtagctgcaaagtcagacctgcatgaagttaaacaagaatacagtatt tcacctactgatttctgcaggaaacacaaaatgcctccaccacaagccttcacttgcaat actgctgatgcccccagtaaggatatctttgttaaattgacaacaatggccatgtatccg ttaagagcccggatagctgagtggggtgccatgctgttagcagacgatgagggcagtatt actcaccaggtgcatgctgtcagctctgtagaagattccatcttcatggagtcatctcat ggtccactttttcttgaagccaggcacgtgacacaagctgacctcaagagctccacgttt tggcttcgagcaagttttggtgctactgtttttgcagttttgggctttgctatgtacaaa gcattattgaaacagcgatga >gi568815581f:32071043_32324730|GENSCAN_predicted_peptide_4|670_aa MELQDVLLLLLLMLMLLLRPHHQKACLVDPTSFFAAAPHWSSLLDTLVPGSVQWPFILLC VDFKCLVRNLKRGHGVAHLQNEEQGRSANRPSSVSLLISAKGSPREGPLHGTPPGIDECL PAAAAPQPPPPPDPVSAMGEHPSPGPAVAACAEAERIEELEPEAEERLPAAPEDHWKVLF DQSVPSSVTVKKYYDQRSNKFEECWAEQNQTGFSSLKPCRVLDMVRYDPPRGFSTCRTTE PLFQIFTEQQVGEFPEILGVVNQTDVIHETWHFGLKFDPGNTGYISTGKFRSLLESHSSK LDPHKREVLLALADSHADGQIGYQDFVSLMSNKRSNSFRQAILQGNRRLSSKALLEEKGL SLSQRLIRHVAYETLPREIDRKWYYDSYTCCPPPWFMITVTLLEARTRVAFFLYNGVSLG QFVLQVTHPRYLKNSLVYHPQLRAQVWRYLTYIFMHAGIEHLGLNVVLQLLVGVPLEMVH GATRIGLVYVAGVVAGSLAVSVADMTAPVVGSSGGVYALVSAHLANIVMHGTCRQAKTGF PPKAGPQVAHTSLSVKKGLLNWSGMKCQFKLLRMAVALICMSMEFGRAVWLRFHPSAYPP CPHPSFVAHLGGVAVGITLGVVVLRNYEQRLQDQSLWWIFVAMYTVFVLFAVFWNIFAYT LLDLKLPPPP >gi568815581f:32071043_32324730|GENSCAN_predicted_CDS_4|2013_bp atggagctccaggatgtcttgctgctgctgctgctgatgctgatgctgctgctgcggcct caccatcaaaaagcctgcctcgtggaccccacgtcattcttcgctgcagcccctcactgg agcagtctgttggacactctggtgcctggttctgtgcagtggcccttcatcctgctctgc gttgacttcaagtgtcttgttaggaaccttaaaagaggccatggtgtagcgcacttgcaa aatgaagagcaaggaagaagtgccaataggcccagttccgtgtcacttctcattagcgcc aaaggttctcccagggaggggcctctgcatggcacacctccaggcatcgacgagtgcctg cctgctgcagctgccccgcagccgccgccgcccccggaccccgtctcggccatgggcgag caccccagcccgggccccgcggtggccgcctgcgccgaggcggagcgcatcgaggagctg gaacccgaggccgaggagcggctgcccgcggcgccggaggaccactggaaagtcctgttt gatcagagtgttccatcaagtgttactgtgaaaaaatactacgaccaaaggtcaaataag tttgaagaatgctgggctgaacaaaatcagacaggtttctccagcctaaaaccttgcaga gtccttgatatggtgcgttatgaccctccaagaggtttttccacttgtaggaccacagaa cccttatttcagatctttactgagcaacaagtaggtgagtttcctgaaatacttggtgtt gtgaaccaaacagatgtcatccatgaaacctggcattttggcttgaagtttgaccctggg aacacaggctacattagcacaggcaagttccggagtcttctggagagccacagctccaag ctggacccgcacaaaagggaggtcctcctggctcttgccgacagccacgcggatgggcag atcggctaccaggattttgtcagcctaatgagcaacaagcgttccaacagcttccgccaa gccatcctgcagggcaaccgcaggctaagcagcaaggccctgctggaggagaaggggctg agcctctcgcagcgacttatccgccatgtggcctatgagaccctgccccgggaaattgac cgcaagtggtactatgacagctacacctgctgccccccaccctggttcatgatcacagtc acgctgctggaggcaaggacaagggttgcctttttcctctacaatggggtgtcactaggt caatttgtactgcaggtaactcatccacgttacttgaagaactccctggtttaccaccca cagctgcgagcacaggtttggcgctacctgacatacatcttcatgcatgcagggatagaa cacctgggactcaatgtggtgctgcagctgctggtgggggtgcccctggagatggtgcat ggagccacccgaattgggcttgtctacgtggccggtgttgtggcagggtccttggcagtg tctgtggctgacatgaccgctccagtcgtgggctcttctggaggggtgtatgctctcgtc tctgcccatctggccaacattgtcatgcatggaacatgtcggcaagcgaagacaggcttt ccacctaaagcgggaccgcaggtggcccacacttcactttctgtgaagaaggggttgctg aactggtcaggcatgaagtgccagttcaagctgctgcggatggctgtggcccttatctgt atgagcatggagtttgggcgggccgtgtggctccgcttccacccgtcggcctatcccccg tgccctcacccaagctttgtggcgcacttgggtggcgtggccgtgggcatcaccctgggc gtggtggtcctgaggaactacgagcagaggctccaggaccagtcactgtggtggattttt gtggccatgtacaccgtcttcgtgctgttcgctgtcttctggaacatctttgcctacacc ctgctggacttaaagctgccgcctcccccctga