GENSCAN 1.0 Date run: 8-Nov-116 Time: 16:28:38 Sequence gi568815597f:149967858_150244793 : 276936 bp : 41.73% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 PlyA - 1034 1029 6 1.05 1.04 Term - 3394 3168 227 2 2 90 49 256 0.989 17.86 1.03 Intr - 9678 9569 110 2 2 54 100 125 0.518 9.31 1.02 Intr - 26162 25951 212 2 2 41 91 131 0.125 5.49 1.01 Init - 28253 28251 3 2 0 85 101 0 0.184 1.05 1.00 Prom - 28575 28536 40 -8.25 2.00 Prom + 30491 30530 40 -5.65 2.01 Init + 42705 42926 222 2 0 90 6 178 0.189 8.51 2.02 Intr + 51052 51127 76 0 1 66 75 39 0.103 -1.43 2.03 Term + 64762 64886 125 2 2 4 48 243 0.476 9.47 2.04 PlyA + 65290 65295 6 1.05 3.00 Prom + 75249 75288 40 -3.65 3.01 Sngl + 86007 86426 420 2 0 79 52 363 0.648 27.85 3.02 PlyA + 88177 88182 6 1.05 4.00 Prom + 90219 90258 40 -6.85 4.01 Init + 100001 100093 93 1 0 107 113 100 0.995 14.83 4.02 Intr + 100773 100907 135 2 0 78 79 112 0.987 9.14 4.03 Intr + 104309 104369 61 1 1 105 78 14 0.961 -0.51 4.04 Intr + 108376 108455 80 2 2 66 97 117 0.990 8.75 4.05 Intr + 109812 109922 111 2 0 90 87 118 0.954 11.56 4.06 Intr + 113485 113619 135 0 0 52 51 195 0.642 12.04 4.07 Intr + 114027 114140 114 2 0 83 94 108 0.998 10.62 4.08 Intr + 114859 115026 168 0 0 76 82 149 0.854 12.32 4.09 Intr + 124080 124238 159 2 0 65 86 141 0.993 10.86 4.10 Intr + 124445 124552 108 1 0 80 75 102 0.987 7.66 4.11 Intr + 125670 125791 122 2 2 90 115 34 0.931 4.67 4.12 Intr + 142639 142770 132 1 0 94 93 87 0.744 8.64 4.13 Intr + 176862 177060 199 0 1 -13 68 109 0.000 -2.77 4.14 Intr + 183055 183201 147 0 0 81 90 182 0.999 17.21 4.15 Intr + 188209 188349 141 0 0 120 92 151 0.996 18.33 4.16 Intr + 189054 189158 105 2 0 82 55 116 0.985 7.19 4.17 Intr + 189528 189629 102 2 0 66 68 48 0.471 0.05 4.18 Term + 190962 191666 705 2 0 135 55 850 0.987 78.92 4.19 PlyA + 195260 195265 6 1.05 5.00 Prom + 196816 196855 40 -6.65 5.01 Init + 201210 201421 212 2 2 76 86 180 0.464 14.91 5.02 Intr + 205318 205447 130 1 1 47 48 65 0.059 -1.82 5.03 Intr + 205873 205957 85 0 1 82 115 69 0.123 7.47 5.04 Term + 221893 222074 182 2 2 52 47 200 0.524 9.09 5.05 PlyA + 222333 222338 6 1.05 6.02 PlyA - 222350 222345 6 1.05 6.01 Sngl - 231975 231682 294 0 0 53 37 279 0.924 14.75 6.00 Prom - 245397 245358 40 -6.65 7.08 PlyA - 246088 246083 6 1.05 7.07 Term - 252904 252834 71 2 2 50 36 194 0.995 7.52 7.06 Intr - 255383 255329 55 2 1 18 96 158 0.868 7.23 7.05 Intr - 258938 258751 188 0 2 48 68 448 0.993 37.29 7.04 Intr - 261380 261215 166 2 1 58 51 147 0.991 6.61 7.03 Intr - 262836 262714 123 0 0 25 119 75 0.942 4.16 7.02 Intr - 264069 263920 150 0 0 92 106 80 0.977 9.64 7.01 Init - 267929 267876 54 2 0 93 78 86 0.856 9.43 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 182404 182430 27 0 0 79 100 37 0.806 3.71 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:149967858_150244793|GENSCAN_predicted_peptide_1|183_aa MFLWKGEGKTETFTENLAYVIVVIVKHCPYYLYPLSMGQESLNRIYMFDPRGLEYILLEA SVCTGSNSSQFCWLKVLEDHMTLDMDAVLSDFVRSTGAEPGLARDLLEGKNWDVNAALSD FEQLRQVHAGNLPPSFSEGSGGSRTPEKGFSDREPTRPPRPILQRQDDIVQGTGVTGMFP YME >gi568815597f:149967858_150244793|GENSCAN_predicted_CDS_1|552_bp atgtttttgtggaaaggagaaggaaaaacagaaaccttcactgaaaacttggcctatgta attgtggtcattgtgaagcattgcccatactacttgtacccactcagtatggggcaagaa agtttaaacaggatttacatgtttgaccctagaggcttagagtacattctattggaagca tctgtctgcacaggaagcaactcatcccagttttgctggctgaaagtacttgaggatcac atgaccctggacatggatgctgttctgtcagattttgtccgttccacaggagcagagcca gggctagcgcgagatctcctagaaggaaagaattgggatgtgaatgccgccctcagtgat tttgaacagctacgtcaagtccatgctggaaacctacccccatcctttagtgaggggagt ggtggctccaggacccctgaaaaagggttttctgacagagagcctactcgccctccccga cccatcctccagcggcaggatgacatcgttcaaggtactggggtgactggaatgtttccg tatatggagtag >gi568815597f:149967858_150244793|GENSCAN_predicted_peptide_2|140_aa MEVEAVVETATAAVAPHGIWSWSQKGLIGTGLPQWLFAHSAPGRHSGSGGWFRIGGFLLS GRKQAPVSGRSGLVAFMQQTASNTQQDSMVATPKIKAQSKQNPVKRRKEEKKRKKRRKRN KKKKKEEEEEEEEEEEESIP >gi568815597f:149967858_150244793|GENSCAN_predicted_CDS_2|423_bp atggaggtggaggcggtggtggagactgcgaccgcagctgttgctccgcacgggatctgg agctggagccagaaaggactgatcgggactggcctcccccagtggctttttgcccactcc gcacccggtcgccactccggctccggcggctggttcaggatcggtggctttctgctgtcc ggccggaaacaggcgccagtttccggccggagcgggttggtggcctttatgcagcaaaca gcttctaacactcagcaagacagtatggtggctactcctaaaattaaagcacagtctaag caaaatcctgttaaaagaagaaaagaagaaaagaagaggaagaagaggaggaagaggaat aagaagaagaagaaggaggaggaggaggaggaggaagaagaagaagaagaaagtatacca tga >gi568815597f:149967858_150244793|GENSCAN_predicted_peptide_3|139_aa MEGEQVEKPDTKEKKPEVKKADAGGKVKKGNLKAKKPKNRKPHCSQNPVIVRGIGRYSPS AMYSRKATCKRKYSAVKSKVEKQKEKFPATITKPAGGGKNGGTQVVKLCKMPTYYLTEDV LSKLLSQGKKPFSQHMRKL >gi568815597f:149967858_150244793|GENSCAN_predicted_CDS_3|420_bp atggagggtgaacaagttgagaagccagatactaaagagaagaaacctgaagtcaagaag gctgatgctggtggcaaggtgaaaaagggtaacctcaaggctaagaagcccaagaacagg aagccccattgcagtcaaaatcctgtcattgtcagagggattggcagatattctccatct gctatgtattccagaaaggccacgtgcaagaggaagtactcagctgtgaaatccaaggtt gaaaagcaaaaggagaagtttcctgcaactattacaaaaccagctggtggtggcaagaac ggtggtacgcaggtggttaaactttgcaaaatgcctacatattatcttactgaagatgtg ctttcaaagctgttgagccaaggaaaaaaacccttcagtcagcacatgagaaaactgtga >gi568815597f:149967858_150244793|GENSCAN_predicted_peptide_4|938_aa MNVVFAVKQYISKMIEDSGPGMKVLLMDKETTGIVSMVYTQSEILQKEVYLFERIDSQNR EIMKHLKAICFLRPTKENVDYIIQELRRPKYTIYFIYFSNVISKSDVKSLAEADEQEVVA EVQQVITKEYELFEFRRTEVPPLLLILDRCDDAITPLLNQWTYQAMVHELLGINNNRIDL SRVPGISKDLREVVLSAENDEFYANNMYLNFAEIGSNIKNLMEDFQKKKPKEQQKLESIA DMKAFVENYPQFKKMSGTVSKHVTVVGELSRLVSERNLLEVSEVEQELACQNDHSSALQN IKRLLQNPKVTEFDAARLVMLYALHYERHSSNSLPGLMMDLRNKGVSEKYRKLVSAVVEY GGKRVRGSDLFSPKDAVAITKQFLKGLKGVENVYTQHQPFLHETLDHLIKGRLKENLYPY LGPSTLRDRPQDIIVFVIGGATYEEALTVYNLNRTTPGVRIVLGGTTVHNTKRKFWLLDC TAEARRALKSHQGQRAEDETVVGGRAQLPLLSPLQVFPTKQRCWRAALGSVLVVRTHLQG PQDGNQQPAPPEKVGWVRKFCGKGIFREIWKNRYVVLKGDQLYISEKEVKDEKNIQEVFD LSDYEKCEELRKSKSRSKKNHSKFTLAHSKQPGNTAPNLIFLAVSPEEKESWINALNSAI TRAKNRILDEVTVEEDSYLAHPTRDRAKIQHSRRPPTRGHLMAVASTSTSDGMLTLDLIQ EEDPSPEEPTSCAESFRVDLDKSVAQLAGSRRRADSDRIQPSADRASSLSRPWEKTDKGA TYTPQAPKKLTPTEKGRCASLEEILSQRDAASARTLQLRAEEPPTPALPNPGQLSRIQDL VARKLEETQELLAEVQGLGDGKRKAKDPPRSPPDSESEQLLLETERLLGEASSNWSQAKR VLQEVRELRDLYRQMDLQTPDSHLRQTTPHSQYRKSLM >gi568815597f:149967858_150244793|GENSCAN_predicted_CDS_4|2817_bp atgaacgtggtttttgctgtgaagcagtacatttccaaaatgatagaggacagcgggcct ggtatgaaagtacttctcatggataaagagacgactggcatagtgagtatggtatacaca caatcggagattctacagaaggaagtgtacctctttgaacgcattgattctcaaaatcga gagatcatgaaacacctgaaggcaatttgttttcttcgacctacaaaggagaatgtggat tatattattcaggagctccgaagacccaaatacactatatatttcatttatttcagtaat gtgatcagcaagagtgacgtgaagtcattggctgaagctgatgaacaggaagttgtggct gaggttcagcaagtgataactaaagaatatgaactgtttgaattccgtcggacagaggtt cctccattgctccttattttagatcgctgtgatgatgccatcaccccattgctaaaccag tggacatatcaggccatggtccacgaactactaggcataaacaacaatcggattgatctt tccagagtgccgggaatcagtaaagacttaagagaagtggtcctatctgctgaaaatgat gaattctatgctaataatatgtacctgaactttgctgagattggtagcaatataaagaat ctcatggaagattttcagaagaagaaaccaaaagaacagcaaaaactagaatcaatagca gacatgaaggcgtttgttgagaattatccacagttcaagaaaatgtctgggactgtttca aagcatgtgacagtggttggagaactgtctcgattggtcagtgaacggaatctgctggag gtttcagaggttgagcaagaactggcctgtcaaaatgaccattctagtgctctccagaat ataaaaaggcttctgcagaaccccaaagtgacagagtttgatgctgcccgcctggtgatg ctttatgctttacattatgagcgacacagcagcaatagcctgccaggactaatgatggac ctcaggaataaaggtgtttctgagaagtatcgaaagctcgtgtctgcagttgttgaatat ggtggtaaacgagtcagaggaagtgacctcttcagccccaaagatgctgtggctatcacc aaacaattcctcaaaggactgaagggagtagaaaatgtatatacacagcatcaacctttc ctacatgaaaccctggatcatctcatcaaaggaaggcttaaggaaaacctatatccttat ttaggccccagcacactcagagacagacctcaggatatcattgtgtttgtaattggagga gccacctatgaagaggctctaacagtttataacctgaaccgcaccactcctggagtgagg attgtcctgggaggcaccacagtgcacaacacgaaaaggaagttctggcttctggactgc acagccgaagcaaggagagctctcaagtcacatcaaggtcagcgagcagaagatgaaacg gtggttgggggaagggcacagcttcctctcttgtccccactacaggttttccctactaaa caaaggtgttggagagcagctttgggttctgtgctggttgttagaactcatctccaggga cctcaggatggaaaccagcagcctgcaccgcccgagaaggtcggctgggtccggaaattc tgcgggaaagggattttcagggagatttggaaaaaccgctatgtggtgctgaaaggggac cagctctacatctctgagaaggaggtaaaagatgagaaaaatattcaagaggtatttgac ctgagtgactatgagaagtgtgaagagctccggaagtccaagagcaggagcaagaaaaat catagcaagtttactcttgcccactccaaacagcccggtaacacggcacccaacctgatc ttcctggcagtgagtccagaagagaaggaatcgtggatcaatgccctcaactctgccatc acccgagccaagaaccgtatcttggatgaggtcaccgttgaggaggacagctatcttgcc catcccactcgagacagggcaaaaatccagcactcccgccgccccccaacaaggggacac ctaatggctgtggcttccacctctacctcggatgggatgctgaccttggacttgatccaa gaggaagacccttcccctgaggaaccaacctcttgtgctgagagctttcgggttgacctg gacaagtctgtggcccagctggcagggagccggcggagagcggactcagaccgcatccag ccctccgcagaccgggcaagcagtctctcccgaccttgggaaaaaacagacaaaggggcc acctacaccccccaggcacccaagaagttgacgcccacagagaaaggccgctgcgcctcc ctggaggagatcctatctcagcgggatgctgcctctgcccgcaccctccagctgcgggct gaggaacccccaacccctgccctccccaacccggggcagctgtcccggatccaggacctg gtagcaaggaaactggaggagactcaggagcttctggcagaggttcagggactgggagat gggaagcgaaaggccaaggacccccctcggtctccgccggattctgagtcagagcagctg ctgctggagacggaacggctgctgggagaggcatcatcgaattggagccaggcaaagagg gtgctgcaggaggtcagggagctgagagacctgtacagacagatggacctgcagaccccg gactcccacctcagacagaccaccccgcacagtcagtaccggaagagcctgatgtga >gi568815597f:149967858_150244793|GENSCAN_predicted_peptide_5|202_aa MAPVTMMGYRSGKMGILADVQLQVGPPGPWLHLVVIAPVPECITGIGIFSSWGSPDVGPL LYDIRAIMWGSKRVQIRTSQEKRYRGQSLGRATREALGTFSCGVPDITAPPCYELASGHM LLWRTEKGEQTPIMTPNSKRTEVKLKVIELYNRQALISRPLEWKSVLATVTVNTECQLDW IEGRKVLILGVSVRVLQKEINI >gi568815597f:149967858_150244793|GENSCAN_predicted_CDS_5|609_bp atggcccctgttacaatgatgggctataggtcaggtaaaatggggattctggctgatgtc cagcttcaagtgggtccaccggggccgtggctccacctggtggtcattgccccagtccct gagtgtataactgggattggtatattcagcagttggggaagccccgatgtggggcccctg ctttatgatataagagctatcatgtggggaagcaaaagggtacaaatcagaaccagccaa gagaagagatacagagggcaaagtctgggaagagctacacgggaagctttgggcaccttc tcctgtggagttccagacatcactgcccctccctgctacgagttggcttcaggtcacatg ttgctctggagaaccgagaagggtgagcagacaccaatcatgaccccaaacagcaagagg acagaagtgaaattgaaagtcatagagctctacaatcgtcaagctcttattagcagaccg ttggaatggaaatctgttttagctactgtaacggttaatactgagtgtcaacttgattgg attgaaggacgcaaagtattgatcctgggtgtgtccgtgagggtgttgcaaaaagagatt aacatttga >gi568815597f:149967858_150244793|GENSCAN_predicted_peptide_6|97_aa MDDFEEFKTSVEEGTADVVEIARGLDLEVELEDESELPQSHDKTLTDVELLLMDKQRKRF LEMESTCEDSVNIVEMTTKDLKYYNNLVVKAVAGFED >gi568815597f:149967858_150244793|GENSCAN_predicted_CDS_6|294_bp atggatgactttgaggagtttaagacatcagtggaggaaggaactgcagatgtggtagaa atagcaaggggactagatttagaagtggagcttgaagatgaatctgaattgcctcaatct catgataaaactttaacagatgtggagttgcttcttatggataagcaaagaaagagattt cttgagatggaatctacttgtgaagattctgtgaacatagtggaaatgacaacaaaggat ttaaaatattacaacaacttagttgttaaagcagtggcagggtttgaggattga >gi568815597f:149967858_150244793|GENSCAN_predicted_peptide_7|268_aa MEMKKKINLELRNRSPEEVTELVLDNCLCVNGEIEGLNDTFKELEFLSMANVELSSLARL PSLNKLRKLELSDNIISGGLEVLAEKCPNLTYLNLSGNKIKDLSTVEALQNLKNLKSLDL FNCEITNLEDYRESIFELLQQITYLDGFDQEDNEAPDSEEEDDEDGDEDDEEEEENEAGP PEGYEEEEEEEEEEDEDEDEDEDEAGSELGEGEEEVGLSYLMKEEIQDEEDDDDYVEEGE EEEEEEEGGLRGEKRKRDAEDDGEEEDD >gi568815597f:149967858_150244793|GENSCAN_predicted_CDS_7|807_bp atggagatgaagaagaagattaacctggagttaaggaacagatccccggaggaggtgaca gagttagtccttgataattgcctgtgtgtcaatggggaaattgaaggcctgaatgatact ttcaaagaactagaatttctgagtatggctaatgtggaactaagttcgctggcccggctt cccagcttaaataaacttcgaaaattggagcttagtgataatataatttctggaggcttg gaagtcctggcagagaaatgtccaaatcttacctacctcaatctgagtggaaacaaaata aaagatctcagtacagtagaagctctgcaaaatcttaaaaatttgaaaagtcttgacctg tttaactgtgagatcacaaacctggaagattatagagaaagtatttttgaactactgcag caaatcacatacttagatggatttgatcaggaggataatgaagcgccggactctgaagag gaggatgatgaggatggcgatgaagatgatgaagaggaagaggaaaatgaagctggtcca ccggaaggatatgaggaagaggaggaggaagaggaagaggaggatgaggatgaggatgaa gatgaagatgaagcaggttcagagttgggagagggagaagaggaagtgggcctctcatac ttaatgaaagaagaaattcaggatgaagaagatgatgatgactatgttgaagaaggggaa gaagaggaagaagaggaagaaggaggtcttcgaggggagaagaggaaacgagatgctgaa gacgatggagaggaagaagatgactag