GENSCAN 1.0 Date run: 4-Nov-116 Time: 00:25:07 Sequence gi568815590f:54035284_54247716 : 212433 bp : 42.67% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 PlyA - 296 291 6 1.05 1.05 Term - 12835 12782 54 1 0 81 45 63 0.807 -2.12 1.04 Intr - 15905 15729 177 2 0 122 96 112 0.922 14.59 1.03 Intr - 17473 17372 102 1 0 124 93 6 0.946 4.15 1.02 Intr - 19850 19777 74 0 2 82 110 31 0.957 2.91 1.01 Init - 27115 26971 145 1 1 85 92 73 0.769 7.73 1.00 Prom - 27549 27510 40 -11.24 2.00 Prom + 33229 33268 40 -5.45 2.01 Init + 34680 34727 48 2 0 65 81 12 0.269 -0.80 2.02 Intr + 37846 38038 193 0 1 54 48 201 0.117 10.84 2.03 Intr + 40163 40314 152 0 2 12 65 140 0.046 3.06 2.04 Term + 56759 57226 468 1 0 30 54 420 0.513 26.79 2.05 PlyA + 59207 59212 6 1.05 3.03 PlyA - 60257 60252 6 1.05 3.02 Term - 65656 65541 116 2 2 112 49 18 0.309 -1.95 3.01 Init - 66336 66162 175 0 1 35 68 188 0.333 9.06 3.00 Prom - 70928 70889 40 -7.05 4.02 PlyA - 71475 71470 6 1.05 4.01 Sngl - 81191 80595 597 2 0 58 37 675 0.520 55.34 4.00 Prom - 84546 84507 40 -4.55 5.00 Prom + 92255 92294 40 -5.95 5.01 Init + 92582 92729 148 1 1 57 56 59 0.271 -0.10 5.02 Intr + 99713 100125 413 0 2 52 50 287 0.930 14.28 5.03 Intr + 101228 101382 155 1 2 121 82 85 0.991 9.15 5.04 Intr + 101985 102150 166 0 1 65 107 145 0.950 13.14 5.05 Intr + 107380 107503 124 0 1 55 106 174 0.929 15.14 5.06 Term + 112099 112436 338 2 2 58 38 223 0.837 8.05 5.07 PlyA + 114247 114252 6 1.05 6.02 PlyA - 115151 115146 6 1.05 6.01 Sngl - 119096 118875 222 2 0 30 44 217 0.676 6.40 6.00 Prom - 120302 120263 40 -5.95 7.00 Prom + 120589 120628 40 -4.95 7.01 Init + 123696 123782 87 2 0 61 93 74 0.408 5.89 7.02 Term + 125728 126429 702 0 0 61 33 304 0.411 14.73 7.03 PlyA + 127400 127405 6 1.05 8.10 PlyA - 127631 127626 6 1.05 8.09 Term - 130744 130541 204 1 0 22 38 160 0.169 0.79 8.08 Intr - 135294 135197 98 1 2 49 72 64 0.674 -0.29 8.07 Intr - 139392 139234 159 1 0 95 117 131 0.934 15.74 8.06 Intr - 140018 139933 86 1 2 71 28 115 0.874 2.34 8.05 Intr - 142554 142436 119 0 2 36 47 81 0.050 -2.86 8.04 Intr - 143356 143275 82 0 1 101 47 127 0.116 8.62 8.03 Intr - 143934 143825 110 0 2 99 -23 73 0.253 -4.54 8.02 Intr - 149335 149238 98 2 2 61 81 70 0.166 2.41 8.01 Init - 151774 151705 70 1 1 64 86 42 0.346 2.86 8.00 Prom - 155592 155553 40 -5.45 9.02 PlyA - 155964 155959 6 1.05 9.01 Sngl - 156753 156253 501 0 0 41 41 482 0.810 34.59 9.00 Prom - 157310 157271 40 -4.05 10.00 Prom + 158080 158119 40 -7.85 10.01 Init + 175332 175387 56 2 2 94 42 38 0.544 0.61 10.02 Intr + 176530 176653 124 1 1 55 92 97 0.617 6.47 10.03 Intr + 182762 183055 294 1 0 38 36 287 0.054 14.78 10.04 Intr + 192389 192496 108 1 0 84 -4 103 0.017 0.26 10.05 Term + 200958 201131 174 2 0 59 40 198 0.773 8.88 10.06 PlyA + 201511 201516 6 1.05 11.04 PlyA - 204279 204274 6 1.05 11.03 Term - 208466 208304 163 1 1 81 50 97 0.411 1.63 11.02 Intr - 208730 208670 61 0 1 120 60 36 0.481 0.87 11.01 Init - 209773 209614 160 1 1 71 70 162 0.733 12.74 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 90978 90858 121 0 1 52 74 77 0.821 3.10 S.002 Init - 168275 168198 78 2 0 110 75 34 0.834 5.41 S.003 Sngl - 186132 185911 222 0 0 64 43 210 0.894 9.00 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590f:54035284_54247716|GENSCAN_predicted_peptide_1|183_aa MNVVKYTYYNNRFFLESKDFALFYRFDIIGLSPDSQEDESGIKQAAENIKALIDQEVKNG IPSNRIILGGFSQGGALSLYTALTTQQKLAGVTALSCWLPLRASFPQGPIGGANRDISIL QCHGDCDPLVPLMFGSLTVEKLKTLVNPANVTFKTYEGMMHSSCQQEMMDVKQFIDKLLP PID >gi568815590f:54035284_54247716|GENSCAN_predicted_CDS_1|552_bp atgaatgtagtaaaatatacttactacaataatagatttttcttagaatcaaaagatttt gccttattttacaggtttgatattattgggctttcaccagattcacaggaggatgaatct gggattaaacaggcagcagaaaatataaaagctttgattgatcaagaagtgaagaatggc attccttctaacagaattattttgggagggttttctcagggaggagctttatctttatat actgcccttaccacacagcagaaactggcaggtgtcactgcactcagttgctggcttcca cttcgggcttcctttccacagggtcctatcggtggtgctaatagagatatttctattctc cagtgccacggggattgtgaccctttggttcccctgatgtttggttctcttacggtggaa aaactaaaaacattggtgaatccagccaatgtgacctttaaaacctatgaaggtatgatg cacagttcgtgtcaacaggaaatgatggatgtcaagcaattcattgataaactcctacct ccaattgattga >gi568815590f:54035284_54247716|GENSCAN_predicted_peptide_2|286_aa MVIVKMTNNSQVLKGMGDLAFRDDSIQPQEEPAIHPRSSQLVPPREIQDSKELNRTCCLN GGTCMLGSFCACPLSFYGRNSMSRRTLRRIPTVMSNTRTEPPTWGQIKKLSQTAEENLRK AGQPVTMNNLKPGQQSERPCRSKEEEEEKEKEEGEGEAAAAEEMEEAEAAAEENPVEVRG RRRRRKTEEKEEEEEKEKEGREGEGGEGERRKRRRGKRRWRRRRRRRQREKEEEEEKEKE EEEERRKKKRRHADKFLGLIPDPLNQKLAVGPKNQVFQVILMHAIV >gi568815590f:54035284_54247716|GENSCAN_predicted_CDS_2|861_bp atggttatcgtaaaaatgacaaataatagccaagtgttaaagggtatgggagacctggcc ttcagagatgacagcattcagccccaggaggagcctgcaattcatcctcggtcttcccaa cttgtgccgcccagggagatacaggacagtaaggaactaaacagaacctgctgtctgaat gggggaacctgcatgctggggtccttctgtgcctgccccctctccttctatggacggaac tccatgtcgagacggacgctgaggaggatcccaactgtcatgagcaacacccgaacagag ccacccacctggggacagatcaagaagctgtcacagacggcagaagaaaacctgaggaaa gcgggacaaccagtcacaatgaataatttaaagcctgggcaacagagtgagagaccctgt cgaagcaaggaggaggaggaggaaaaggagaaggaagagggggagggggaggcagcggcg gcagaggagatggaagaggcagaggcggcggcggaggagaaccctgtcgaagtcaggggg aggaggaggaggagaaagacggaagagaaggaggaggaggaggagaaggagaaagaagga agagaaggagaaggaggagaaggagaaagaaggaagagaaggagggggaagagaaggtgg aggaggaggagaaggaggagacagagagagaaggaggaagaggaggagaaggagaaggaa gaggaggaagaaagaagaaagaagaagaggaggcatgcagacaaattccttggtctcatt ccagacccactcaatcagaaactggcagtggggcccaagaatcaagtctttcaagtgatt ttgatgcatgctatagtatga >gi568815590f:54035284_54247716|GENSCAN_predicted_peptide_3|96_aa MVRAAGSAGGGGSRGVTAGARASPQLLPALRRKCGPGTPAGRPRARRQPLQRVSAWCVGD FPAWIGRYWVGTIIINSNSNETGSCNFCLRVQRFNV >gi568815590f:54035284_54247716|GENSCAN_predicted_CDS_3|291_bp atggtccgcgcggccgggtccgcggggggcgggggcagccggggggtcacagccggcgcg agggccagtccccagctcctccctgcattgcggaggaagtgtggccccgggacccccgcg ggccggcctagggcacggcgtcagcctctgcagcgtgtttctgcctggtgtgtgggtgat tttcctgcatggattgggagatactgggtaggtaccattattattaatagtaacagtaat gaaactgggtcatgcaacttttgtttacgcgttcaaaggtttaatgtttaa >gi568815590f:54035284_54247716|GENSCAN_predicted_peptide_4|198_aa MMFYKKEIQTIFLFKFKIGRKAVETTCNINNAFGPGTANEHTVQWWFKKFCKGDESREDE EHSGRPSEVDNNQLRVIIEADPLTSTQEVAEELNVNHSAVMRHLKQIEKVKKLDKWVPQE LSENKKSHLSEVSSSRVAGNNNEPFLNWFVMGDEKWILYNNQQRLSQWLDPEEAPKHFSK PDLHQKRSWSLFGGLLSV >gi568815590f:54035284_54247716|GENSCAN_predicted_CDS_4|597_bp atgatgttctacaaaaaggaaattcaaacgattttcttattcaagttcaaaataggtcgt aaagcagtggagacaacttgcaacatcaacaacgcatttggtccaggaactgctaacgaa catacagtgcagtggtggttcaagaagttttgcaaaggagatgagagtcgtgaagatgag gagcatagtggaaggccatcagaagttgacaacaaccaattgagagtaatcattgaagct gatcctcttacaagtacacaagaagttgccgaagaactcaacgtcaaccattctgcagtc atgcggcacttgaagcaaatcgaaaaggtgaaaaagctcgataagtgggtgccacaagag ctgagtgaaaataaaaaaagtcatctttctgaagtgtcatcttctcgtgttgcaggcaac aacaatgaaccatttctcaattggtttgtgatgggtgacgaaaagtggattttgtacaac aatcagcaacgactaagtcagtggctggacccagaagaagctccaaagcacttctcaaag ccagacttgcaccaaaaaaggtcatggtcactgtttggtggtctgctgtcggtctga >gi568815590f:54035284_54247716|GENSCAN_predicted_peptide_5|447_aa MPLHSSLDDKSKTRSQKNKIKTFKGQTGVEEPPKVTEKDQLGMEEENQKLFLVHALVWRK RRFYVQNIKRDPEDKAGSRKIRSHPQTTKRTTETRLQTPTPLPDSRACARGCSATGAGPR PQSSPRDHVASEQLRAPLKVLGSAGYGRSLAGRWGPGPGPTPGPAACEPGQLKAESRLQE TGKWVVAERRPRGRRRGRKCGRGHKGERQRGTRPRLGFEGGQTPFYIRIPKYGFNEGHSF RRQYKPLSLNRLQYLIDLGRVDPSQPIDLTQLVNGRGVTIQPLKRDYGVQLVEEGADTFT AKVNIEVQLASELAIAAIEKNGGVVTTAFYDPRSLDIVCKPVPFFLRGQPIPKRMLPPEE LVPYYTDAKNRGYLADPAKFPEARLELARKYGYILPDITKDELFKMLCTRKDPRQIFFGL APGWVVNMADKKILKPTDENLLKYYTS >gi568815590f:54035284_54247716|GENSCAN_predicted_CDS_5|1344_bp atgccattgcactccagcctagatgacaagagcaaaactcgatctcaaaaaaataaaata aaaacatttaaaggtcagacaggtgtggaggagcccccaaaagtaacggagaaagatcag ctagggatggaagaagaaaaccagaaattgtttttagtgcatgctctcgtttggcgaaag agaaggttttacgttcagaatattaaaagggaccccgaggacaaagcaggatctcggaaa ataagatcacatccccaaacaactaaaaggacaaccgagacaaggttgcagactcccacg cctctgcccgactcccgggcttgcgcgcgcggctgcagcgcgacaggggcggggccacgg ccacaaagcagcccccgagaccacgtggcctccgagcagctcagggcgcccttgaaagtt cttggatctgcgggttatggccggtcccttgcagggcggtggggcccgggccctggacct actccggggcctgccgcgtgtgagcctggccaacttaaagccgaatcccggctccaagaa accggtaaatgggtggtggcggagagaagaccaagaggtcggagaagaggtagaaaatgt ggcagaggccataaaggagaaaggcaaagaggaacccggccccgcttgggctttgaggga ggccagactccattttacatccgaatcccaaaatacgggtttaacgaaggacatagtttc agacgccagtataagcctttgagtctcaatagactgcagtatcttattgatttgggtcgt gttgatcctagtcaacctattgacttaacccagcttgtcaatgggagaggtgtgaccatc cagccacttaaaagggattatggtgtccagctggttgaggagggtgctgacacctttacg gcaaaagttaatattgaagtacagttggcttcagaactagctattgctgccattgaaaaa aatggtggtgttgttactacagccttctatgatccaagaagtctggacattgtatgcaaa cctgttccattctttcttcgtggacaacccattccaaaaagaatgcttccaccagaagaa ctggtaccatattacactgatgcaaagaaccgtgggtacctggcggatcctgccaaattt cctgaagcacgacttgaactcgccaggaagtatggttatatcttacctgatatcactaaa gatgaactcttcaaaatgctctgtactaggaaggatccaaggcagattttctttggtctt gctccaggatgggtggtgaatatggccgataagaaaatcctaaaacctacagatgaaaat ctccttaagtattatacctcatga >gi568815590f:54035284_54247716|GENSCAN_predicted_peptide_6|73_aa MRAQQLCDVPEVTQLLSEPGFKSRFGDMEAVLLQPCYVFSVKHQNAAILGSVRTSESALF SLTRTKLGYTAVI >gi568815590f:54035284_54247716|GENSCAN_predicted_CDS_6|222_bp atgagagcacagcagttatgtgatgtgcccgaggtcacacagctgctgtcagagcctgga ttcaaatccaggtttggtgacatggaagctgtgctacttcaaccatgctatgtcttttca gtaaagcatcaaaatgcagccattctgggatctgtgcgaacatcagagagcgcacttttt tctctgacacgaactaaattgggttacactgctgttatctga >gi568815590f:54035284_54247716|GENSCAN_predicted_peptide_7|262_aa MLCLKEALRYRDLKDLFGTKKQRKAVGNKPRDLVPCVPVAPAVTERGQRRARAVASEGGS SKPWQLPRGVEPTGAQKSRTEVWEPLPRFQKMFGNAWMPRQKFAAGVGPLWRTSAGAVQK ENVGSEPRQRVHTGALPSGTVRRQPPSSRPQNGRSTNSLHHVPGKDAGTQCQPMKAARRE AVPCKATGRELSKTMGTHLLHQRDLDARPGVKGDHFGALKCDCPAGFQTCMGPVTPLFWP ISPIWNGCIYPIPVPPLYLGSN >gi568815590f:54035284_54247716|GENSCAN_predicted_CDS_7|789_bp atgctgtgcctgaaggaagcattgcgttacagagatctcaaagacctttttggaacaaaa aagcaaagaaaggctgtaggaaacaagcctagggacttggtgccctgtgtcccagtcgct ccagctgtgactgaaaggggccaacgtagagctcgggctgtggcttcagagggtggaagc tctaagccttggcagcttccacgtggtgttgagcctacaggtgcacagaagtcaagaact gaggtttgggaacctctgcctagatttcagaagatgtttggaaatgcctggatgcccagg caaaagtttgctgcaggagtggggcccttatggagaacctctgctggagcagtgcagaag gaaaatgtgggatcggagccccgacagagagtccatactggggcactgcctagtggaact gtgagaagacagccaccgtcctccagaccccagaatggtagatccaccaacagcttgcac catgtgcctggaaaagatgcaggtactcaatgccagcccatgaaggcagccaggagggag gctgtaccctgcaaagccacagggagggagctgtccaagaccatgggaacccacctcttg catcagcgtgacctggatgcgagacctggagtcaaaggagatcattttggagctttaaaa tgtgactgccctgctggatttcagacttgcatgggccctgtaacccctttgttttggccc atttctcccatttggaatggctgtatttacccaatacctgtacccccattgtatctagga agtaactag >gi568815590f:54035284_54247716|GENSCAN_predicted_peptide_8|341_aa MGVEKAGDKGESPFATSSRFISKGCIRGMAPASASGEDSGNVQTWPKAKGEPMYHMVETW KNVECQDLKSVRVKIQEIPLFTDKTTKAQRIKGCTQKTAFVDHQEFPVIEYQKATGDSEE HMDFSVTGPRGKPDALSPLLLQQPLQKAPFGRKSLKREARMFRKDLDKGTQAVPGPEGRN WEPEWSNPAAEKLTLENLSRKVFPAFLKQPGKTRELRSENCKKCGSPGGRSPAKITAEAE AGRGDKSASRNAYCVSPFMKRVSGLSLCGDKARRAISMQAVMADTRPESEATAKLRQQGP AASHEGNIMAASLTPDNIQQQLTLSHSPVNAIKLQRCYIRY >gi568815590f:54035284_54247716|GENSCAN_predicted_CDS_8|1026_bp atgggtgtggagaaggcaggagacaaaggagagtccccctttgctacttccagcagattt atcagcaagggctgtataagaggcatggcaccagcatctgcttctggtgaggactcagga aacgtacaaacatggccgaaggcaaagggggagcccatgtatcacatggtagagacatgg aaaaacgtagaatgccaggatcttaaatctgtaagagtaaagattcaggaaattccctta tttacagataagacaactaaggcccagcgtattaagggctgcactcagaagactgctttt gtggaccaccaggaatttcctgtgatagaatatcagaaagctactggtgactcggaggag cacatggacttctcagtcacaggaccccgcgggaagccagatgcactcagccctttactt ctccaacagccgttacaaaaggcaccatttggaagaaagtcattgaagagagaagccaga atgttcaggaaggatctagacaaggggacccaggcagtccccgggcctgaaggacgcaac tgggaaccggagtggagtaacccggcagcagaaaaactgaccctggagaatttaagcaga aaagtattccctgcatttctcaaacaacctggaaagacgcgagaactgcgctcagaaaac tgcaagaaatgcggcagcccaggtgggaggagtccagccaaaatcacagcagaagcggaa gccggcagaggggataaaagtgcttcaaggaacgcttactgtgtttcgcccttcatgaaa agagtcagtgggctgtcactgtgtggagacaaagccaggagggctatcagcatgcaggca gtgatggcagatacacgccctgagagtgaagccacagcgaagcttcgacagcaggggcca gcggcgagtcatgaggggaatataatggcagcttctctaactccggacaatatccagcag cagctaacactttctcattctccagttaatgcaataaaattgcagagatgctacatacgg tattaa >gi568815590f:54035284_54247716|GENSCAN_predicted_peptide_9|166_aa MPRGSRSHTSYRAPPTSQAPQMRATPRLAPAAQPPAVAPPSAVGSPAPAPRQPSLMAQMA TTAAGVAVGSAVGHTLGYALTGSFSGGSNGEPAKLDITYQQHQGTWPAQQQQPCFHEIKQ FLECAQNQGDTNKLCGAFKRFSNSADLQTDRPNQEVQIGEMENQLS >gi568815590f:54035284_54247716|GENSCAN_predicted_CDS_9|501_bp atgccacgtggaagccgaagccacacctcctacagggcccctcccaccagccaggcacct cagatgagagctacacccaggctagcaccagcagctcagccgccagcagtggcaccccca tctgcagttggctctcctgctcctgcaccccggcagccaagtctgatggcccaaatggca accactgcagctggcgtggctgtaggctctgctgtgggtcacacactgggttatgccctc actgggagcttcagtggaggaagtaatggtgagcctgcaaaacttgacatcacttaccag cagcatcagggaacctggccagcacagcagcagcagccttgtttccatgagatcaaacag tttttggagtgtgcccagaaccagggtgacaccaacaagctctgtggggctttcaagagg ttctcaaacagtgcagacttgcaaacagataggcctaatcaagaagttcaaattggagaa atggaaaatcagctctcctaa >gi568815590f:54035284_54247716|GENSCAN_predicted_peptide_10|251_aa MNLGKSSTSTSAPAIQNHSRISFPPLNTPPSLSQVPAPRTLVLGIKQWRCMPEEKELGPE RWKQKFREVRYSLKNAETKSGTEVSQLLVKCRQQTQWANGITFKRKDLGTGLLGCGPPWS PPLALWALNIPYVNWTAPPLLLRQPPRVPRGAEGLGEEGLLSNCLQSWLFNPGADNANTK MSCDPEIRVQQETLKVTVTRISILEMKPPMPSEWKELPQSHEWQSHIDAGIGPASALIQG PWAFDNMDQRD >gi568815590f:54035284_54247716|GENSCAN_predicted_CDS_10|756_bp atgaacttgggcaagtcctcaacatctacaagtgcaccggcaatacagaaccacagcagg atctcctttcccccacttaacacccctccatcactatcacaagttcctgctccccgcaca cttgtcttgggaatcaagcagtggagatgtatgccggaagagaaggagcttgggcctgag aggtggaagcagaagtttagagaggtcaggtattcactaaagaatgctgaaaccaagagt ggaaccgaagtttctcaacttctagtaaaatgccgacagcagacccagtgggccaatggc atcacatttaagaggaaggatttagggacaggacttctgggttgcggtcccccttggtca cctcccttagctctgtgggccttaaatatcccatatgtgaattggacagcaccaccactt ctgctgcgccagcctcccagggtccctcgaggtgcagagggactcggtgaagagggcctt cttagtaactgtctccaatcctggttattcaatcctggtgcagacaatgccaatacaaag atgtcatgtgacccagaaatccgtgtgcagcaagaaaccttgaaggtgacagtcaccagg atctccattttagagatgaagccaccgatgcccagtgagtggaaggagctgccacagagc catgagtggcagagccatatagacgcaggcattggcccagctagtgctctcatccagggg ccatgggccttcgacaatatggatcagcgggactag >gi568815590f:54035284_54247716|GENSCAN_predicted_peptide_11|127_aa MAAKCTRPGKERRRAVGLEQGSGESLALNAAPLRGPVGAVGDPWHTFSDITHLTLSQLKS KGGKTRDNYLKMERVLLSKTGQRRPCQLSESDSPGEPEFCPRGHRMRKTVTEDTEPPQSP QDHSQES >gi568815590f:54035284_54247716|GENSCAN_predicted_CDS_11|384_bp atggcagccaagtgtacacggccgggtaaggagaggcgaagagcagtgggccttgagcag gggagcggggagtccctggcactcaacgctgcgccgctgagaggccccgtgggagctgtc ggggatccatggcacactttctctgacataacacatctcaccctctctcaacttaaatca aagggaggaaaaactcgagataattatctgaagatggagagggtcctgcttagtaagaca ggacagcgaaggccatgccaactcagtgagtctgactccccgggagaaccagagttttgc cctagaggtcatcgcatgaggaagactgtaactgaagacactgagcctccccagagcccg caagaccacagccaggaaagctag