GENSCAN 1.0 Date run: 4-Nov-116 Time: 10:44:51 Sequence gi568815574f:13186638_13387375 : 200738 bp : 37.91% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 13225 13303 79 0 1 43 48 86 0.427 1.37 1.02 Intr + 18400 18501 102 2 0 -2 108 79 0.337 0.13 1.03 Intr + 21988 22220 233 2 2 10 64 722 0.813 58.57 1.04 Term + 41538 41642 105 2 0 87 35 68 0.140 -1.17 1.05 PlyA + 44813 44818 6 1.05 2.03 PlyA - 45017 45012 6 1.05 2.02 Term - 47736 47581 156 0 0 69 42 179 0.819 8.35 2.01 Init - 48171 47824 348 0 0 72 70 122 0.720 6.04 2.00 Prom - 58140 58101 40 -4.25 3.00 Prom + 59944 59983 40 -6.75 3.01 Init + 67278 67334 57 2 0 67 75 28 0.337 0.76 3.02 Intr + 69818 70190 373 1 1 96 -1 249 0.676 10.51 3.03 Term + 70613 70962 350 0 2 79 49 211 0.821 9.96 3.04 PlyA + 71474 71479 6 1.05 4.00 Prom + 73964 74003 40 -2.55 4.01 Init + 78699 78828 130 2 1 79 89 71 0.741 6.67 4.02 Term + 82156 82313 158 2 2 93 44 130 0.802 6.21 4.03 PlyA + 83448 83453 6 1.05 5.00 Prom + 86551 86590 40 -7.15 5.01 Init + 92769 92775 7 2 1 83 91 0 0.180 0.89 5.02 Intr + 99615 99649 35 1 2 96 94 25 0.223 0.92 5.03 Term + 100160 100741 582 1 0 48 41 514 0.922 36.21 5.04 PlyA + 100883 100888 6 1.05 6.10 PlyA - 104421 104416 6 1.05 6.09 Term - 110431 110011 421 0 1 65 50 138 0.596 1.48 6.08 Intr - 111211 111070 142 2 1 133 97 94 0.995 13.29 6.07 Intr - 112507 112320 188 0 2 115 66 102 0.939 9.11 6.06 Intr - 116354 116240 115 0 1 119 56 50 0.993 3.49 6.05 Intr - 117515 117464 52 2 1 97 100 33 0.184 2.96 6.04 Intr - 137123 136918 206 0 2 51 89 138 0.075 8.20 6.03 Intr - 138069 137964 106 0 1 73 87 15 0.060 -1.23 6.02 Intr - 139713 139584 130 2 1 82 83 37 0.055 2.38 6.01 Init - 149485 148926 560 1 2 42 92 394 0.015 29.71 6.00 Prom - 149970 149931 40 -7.15 7.00 Prom + 154878 154917 40 -5.65 7.01 Sngl + 155074 155430 357 0 0 63 43 317 0.894 20.51 7.02 PlyA + 155526 155531 6 -3.94 8.00 Prom + 155939 155978 40 -7.65 8.01 Init + 156388 156862 475 0 1 85 28 255 0.016 14.93 8.02 Intr + 157120 157239 120 2 0 32 64 94 0.008 0.95 8.03 Intr + 157772 157901 130 0 1 33 98 80 0.011 2.33 8.04 Intr + 158099 158371 273 2 0 -28 98 152 0.469 0.33 8.05 Intr + 158533 158699 167 1 2 85 65 255 0.996 21.58 8.06 Intr + 159248 159533 286 0 1 52 57 183 0.715 7.08 8.07 Intr + 159689 159860 172 2 1 79 40 86 0.434 1.92 8.08 Term + 164135 164281 147 1 0 77 52 77 0.202 -0.08 8.09 PlyA + 164360 164365 6 1.05 9.07 PlyA - 165069 165064 6 1.05 9.06 Term - 168761 168327 435 2 0 37 39 174 0.555 1.70 9.05 Intr - 169381 169286 96 1 0 101 55 59 0.897 3.19 9.04 Intr - 171982 171827 156 1 0 66 71 93 0.947 4.69 9.03 Intr - 173349 173130 220 2 1 93 98 108 0.998 9.68 9.02 Intr - 179756 179630 127 0 1 65 100 61 0.511 3.82 9.01 Intr - 182712 182619 94 0 1 99 56 69 0.448 3.42 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 156388 156981 594 0 0 85 48 294 0.979 21.04 S.002 Init + 157780 157901 122 0 2 121 98 78 0.861 11.81 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815574f:13186638_13387375|GENSCAN_predicted_peptide_1|172_aa MLEEEKANQCGLIYMNNRQGNSYGNKVQKLPYIGNGNITFGGTYCDCVCSVPNHAQKGLS KKKEEEEEEEEEEEEEEEEGGEEGGEEGGEEGGEEGGEEEEEEEEEEEEEEEEEEEEEEE EEEEEEEEEEEESQERSQFYSGVFVSALDLEKCLELFLVPLSSSLPLNLSTG >gi568815574f:13186638_13387375|GENSCAN_predicted_CDS_1|519_bp atgttggaggaagagaaagccaaccaatgtggcctgatatacatgaacaacagacaagga aatagttatggtaataaagttcaaaagctcccatatattggaaatggcaatattactttt gggggaacttactgcgactgcgtgtgcagcgtgcctaaccatgctcaaaaaggtctttca aaaaagaaagaagaagaagaagaagaagaagaagaggaagaagaggaagaagaagaaggg ggagaagaagggggagaagaagggggagaagaagggggagaagaagggggagaagaagaa gaagaagaggaagaggaagaggaagaggaagaagaagaagaagaagaagaagaagaagaa gaagaagaagaagaagaagaagaagaagaagaagagagccaggaaaggagccagttctac agcggtgtcttcgtctcagccttggatctggaaaaatgcctagagcttttcctggtgcct ctttcttcctctctgcctctcaacctctccactggttaa >gi568815574f:13186638_13387375|GENSCAN_predicted_peptide_2|167_aa MDQSAFTSSLLRPIKALGSGRAEQTSGDQLQKGATHSRASSLLRAAEMTRRPASREELPD PGLFCHSIKLLFVLFSFHLSTFLILPCQRRRTWELLDGGAESCKTNRANTPIAHYLDLVV SGVSKLPVVTAAVEVACGVPASTAPSQRAGAYAGTYSCPSLCSSQHA >gi568815574f:13186638_13387375|GENSCAN_predicted_CDS_2|504_bp atggaccagtcggcattcacttcctcccttctgcggcccataaaagccctgggttcaggc agagctgagcagacatcaggtgaccagctgcagaaaggagctacccactccagggcctca tctctgctgagagctgcagagatgaccagaagacctgcatccagagaggagcttcctgat ccagggctattctgtcactcaataaagcttctctttgtcttgttctccttccatttgtcc acgttcctcattcttccctgccaaaggagaagaacttgggagctgctggatggtggggct gaaagctgtaaaacaaacagggctaacacgcccattgctcactaccttgaccttgtggtt tcgggagtctccaagcttcctgttgtcactgcagctgtggaagttgcttgtggtgtgcct gcttcaactgcaccctcacagagagctggtgcctatgccggcacctacagctgcccatcc ttgtgcagcagccagcatgcttga >gi568815574f:13186638_13387375|GENSCAN_predicted_peptide_3|259_aa MPQNSLEECALELGKKSLQENVNNFPKTKLFQFLKLTNWILPKITKFKPIEGAENVFTDG SSNGKASYFGLKGKVFQTPYTAAQKVELVAVIEVLTAFDMPVSMISDSTYVVHSTQLTEN AQLRLHTDEQLMTLFSQLQTAVRMFCGDGHSSFYYNNNAPGYTSQALATFFSMWNIKRIT GIPYNSQGQAIVERMNLSLKQQLQKQTEGDREYGTPQMQLNLALLTLNFLSLPKGQMLSA AEQHLQKPAVKTEAEQLIW >gi568815574f:13186638_13387375|GENSCAN_predicted_CDS_3|780_bp atgccacagaacagcttagaggagtgtgcattagagcttgggaaaaaatcacttcaggaa aatgtcaataattttcctaaaacaaagctgtttcagtttttgaaattaactaattggatt ctccctaaaataactaaatttaaaccaattgaaggtgctgagaatgttttcacagatggg tctagtaacggtaaagcttcttattttggattaaaaggtaaagttttccagacgccctat actgcagctcaaaaagtggagctggtagcggtaattgaggtattgactgcttttgatatg cctgttagtatgatttctgattctacatacgtggttcattccacacagttaactgaaaat gctcagttacgacttcatacagatgaacaactgatgactttattttctcaactgcaaaca gcagttagaatgttttgcggtgatgggcattccagcttctactacaacaataatgcccca ggttatactagccaagctctagctacatttttctctatgtggaatattaaacgcattact ggtatcccatacaattctcaaggacaagccatagtggaaagaatgaatctctccctaaaa cagcagttgcaaaagcagactgagggagacagagaatatggaaccccacagatgcaactg aatctagcattattaactttaaattttttgagcctgcccaaaggccagatgttatcagca gctgaacagcatctacagaaaccagctgtaaagacagaagcagaacaactgatttggtga >gi568815574f:13186638_13387375|GENSCAN_predicted_peptide_4|95_aa MAIFSMLILPIHEHRMFFHLFVSSLIPLSTGFVVLLEEVLHIPCLLEFAGGPLQTLFTWV SPAKAAEQQRLLPAPSSGSFVPERHSSDASWSSAL >gi568815574f:13186638_13387375|GENSCAN_predicted_CDS_4|288_bp atggccattttctcaatgttgattcttcctatccatgagcatcgaatgtttttccatttg tttgtgtcctctcttattccgttaagcactggttttgtagttctacttgaagaggtcctt cacatcccttgtttgctggagtttgctggaggtccactccagaccttgttcacctgggta tcaccagcgaaggctgcagaacagcaaagactgctgcctgctccttcctctggaagcttt gtcccagagcggcactcatcagatgccagctggagctctgctctatga >gi568815574f:13186638_13387375|GENSCAN_predicted_peptide_5|207_aa MPGDCLQCIHQATEKKVPDKLLDSSTVTPLFKITENIGCVMTGMTADSRSQVQRARYEAA NWKYKYGYEIPVDMLCKRIADISQVYTRNAEMRPLGCCMILIGIDEEQGPQVCKCDPAGY YCGCKATAVGVKQTESTSFLEKKVKKKFDWTFEQTLETAITCLSTVLSIDFKPSEIEVGV VTVENPKFRILTEAEIDAHLVALAERD >gi568815574f:13186638_13387375|GENSCAN_predicted_CDS_5|624_bp atgccaggcgactgccttcagtgcatacatcaggccactgagaagaaagtacctgacaaa ttattggattccagcacagtgactcccttattcaagataactgaaaacattggttgtgtt atgaccggaatgacagctgacagcagatcccaggtacagagggcacgctatgaggcagct aactggaaatacaagtatggctatgaaattcctgtggacatgctgtgtaaaagaattgcc gatatttctcaggtctacacacggaatgctgaaatgaggcctcttggttgttgtatgatt ttaattggtatagatgaagagcaaggccctcaggtatgtaagtgtgatcctgcaggttac tattgtgggtgtaaagccactgcagtgggagttaaacaaactgagtcaaccagcttccta gaaaaaaaagtgaagaagaaatttgactggacatttgaacagactttggaaactgcaatt acatgcctgtctactgttctatcaattgatttcaaaccttcagaaatagaagttggagta gtgacagttgaaaatcctaaattcaggattcttacagaagcagagattgatgctcatctt gttgctctagcagagagagactaa >gi568815574f:13186638_13387375|GENSCAN_predicted_peptide_6|639_aa MFTKESKPSKNRSLVPETSRHTGDTSNGCADVKGLSNHVHQLIADAVSSPNHGDSPNLLI ADNPQLSALLIGKANGNVGTGTCDKVNNIHPAVHTKTDHSVASSPSSAISTATPSPKSTE QRSINSVTSLNSPHSGLHTVNGEGLGKSQSSTKVDLPLASHRSTSQILPSMSVSICPSST EVLKACRNPGKNGLSNSCILLDKCPPPRPPTSPYPPLPKDKLNPPTPSIYLENKRDAFFP PLHQFCTNPKNPVTVIRGLAGALKLDLGLFSTKTLVEANNEHMVEVRTQLLQPADENWDP TGTKKIWRCESNRSHTTIAKYAQYQASSFQESLRVELIKKCFEACSKISVASHQENNNFC SVNINIGPGDCEWFVVPEDYWGVLNDFCEKNNLNFLMSSWWPNLEDLYEANVPVYRFIQR PGDLVWINAGTVHWVQAVGWCNNIAWNVGPLTACQYKLAVERYEWNKLKSVKSPVPMVHL SWNMARNIKVSDPKLFEMINNIKWTETENHYLKMGYHFLPNIAENAEADLQLGNRQRLKE LKDQARKSLYCNEWVILMRAHKSRRQGKVWSFLGIGKLFLAKMLIEKVKDLLMRSQMEMM NLLGTGAKVTLVTLQQRTWLCFHCPRAWWEAELNGDDLG >gi568815574f:13186638_13387375|GENSCAN_predicted_CDS_6|1920_bp atgtttaccaaagagagcaagccttcaaaaaatagatccttggtgcctgaaacaagcagg catactggagacacatctaatggctgtgctgatgtcaagggactttctaatcatgttcat cagttgatagcagatgctgtttccagtcctaaccatggagattcaccaaatttattaatt gcagacaatcctcagctctctgctttgttgattggaaaagccaatggcaatgtgggtact ggaacctgtgacaaagtgaataatattcacccagctgttcatacaaagactgatcattct gttgcctcttcaccctcttcagccatttccacagcaacaccttctcctaaatccactgag cagagaagcataaacagtgttaccagccttaacagtcctcacagtggattacacacagtc aatggagaggggctggggaagtcacagagctctacaaaagtagacctgcctttagctagc cacagatctacttctcagatcttaccatcaatgtcagtgtctatatgccccagttcaaca gaagttctgaaagcatgcaggaatccaggtaaaaatggcttgtctaatagctgcattttg ttagataaatgtccacctccaagaccaccaacttcaccatacccacccttgccaaaggac aagttgaatccacccacacctagtatttacttggaaaataaacgtgatgctttctttcct ccattacatcaattttgtacaaatccaaaaaaccctgttacagtaatacgtggccttgct ggagctcttaaattagatcttggacttttctctaccaaaactttggtagaagctaacaat gaacatatggtagaagtgaggacacagttgctgcaaccagcagatgaaaactgggatccc actggaacaaagaaaatctggcgttgtgaaagcaatagatctcatactacaattgccaaa tacgcacaataccaggcttcctccttccaggaatcattgagagttgaacttatcaagaaa tgttttgaagcctgcagcaaaatttccgtggccagtcaccaagaaaataacaacttctgc tctgttaacataaatattggtccaggagattgtgaatggtttgttgtacctgaagattat tggggtgttctgaatgacttctgtgaaaaaaataatttgaattttttaatgagttcttgg tggcccaaccttgaagatctttatgaagcaaatgtccctgtgtatagatttattcagcga cctggagatttggtctggataaatgcaggcactgtgcattgggttcaagctgttggctgg tgcaataacattgcctggaatgttggtccacttacagcctgccagtataaattggcagtg gaacggtatgaatggaacaaattgaaaagtgtgaagtcaccagtacccatggtgcatctt tcctggaatatggcacgaaatatcaaagtctcagatccaaagctttttgaaatgattaac aacattaaatggactgagacagaaaatcattatttaaaaatgggataccactttttgcca aatatagctgaaaatgcagaagcagatttgcaactgggtaataggcagaggttgaaagag ttgaaggaccaggctaggaaaagtttgtattgtaatgaatgggtgattctgatgagggct cacaagtccagaagacaagggaaagtttggagcttcttggggattggtaaactgtttctc gccaaaatgctgatagagaaagtaaaagacttgctgatgagatctcagatggaaatgatg aacttactgggaactggagcaaaagtcacccttgttactctgcagcagagaacctggctg tgtttccattgccctagggcttggtgggaagctgaacttaacggtgatgacctagggtag >gi568815574f:13186638_13387375|GENSCAN_predicted_peptide_7|118_aa MHFVVDTGVEHSVGTQAVGPLSKNYVNINGAAGVTEKRPYFKSKRCVTGGQEVQNEFLYL PNSLVPLLGRDLLQKLQAQISFTREGDMTLNLGQRKAMIMTLTIPTTEEWRLREMQNL >gi568815574f:13186638_13387375|GENSCAN_predicted_CDS_7|357_bp atgcactttgtagtcgatactggtgtggagcattcggtaggaactcaagcagttggacca ctgtctaaaaattatgtcaatataaatggggctgcaggagtaacagaaaagaggccttac ttcaaatctaaaagatgtgtgactggaggacaggaagtccaaaatgaatttttatatttg ccaaatagtctggtgcccttgttaggaagagacttgctccagaaattgcaagcacaaatc tcctttacccgagaaggggacatgactttaaacctaggtcaaagaaaagccatgataatg acccttaccatccccacaacagaggaatggagactacgagagatgcaaaatttgtaa >gi568815574f:13186638_13387375|GENSCAN_predicted_peptide_8|589_aa MAFKEIKKALIHALALGLPDMTKPFYVYIYERKGISIRVLVQTLGSWYWPVACLSQRLDL VAMGWPPCFKAVAATALLAEDANKLTFGQRLIIQVPHTIVTLMGQRGHGWLSNPRMLRYQ ELLCGNPYIILVTLNTLDPRTLLPKEWAEHRKPPLCCPAQRVELTALTRASLVAKGKSVN IYTDPRCAFAILHSHGAIWANMAPRDSRNWTTPCENLPVDFIELPQVGGCRYMLVFVCTF SSSVKVERMKETLKQLLKVLPGNLSKVKSGAAHGPSVSQRHPTKLTGYSPYEMVFGSPPT IITQIKGDLKEIGELTLRRQMQALGEAMQEIQRYHTLGSSQPAETSSDHNSGPAASQQNP DYPKQLILRQNYATADKDNCPALTIPEAEIRVSKPVNKKEVFPHSHKGPVSIHFDACQAS HLSKLNTIGTIRKNLGQERVSSRATKAVTGKSKKECPDCDNQWTTHEFISTYTQEGLLYL PAKRKTQRTQVQVSPMQHFRFYTSFNEHFNPEVSKNQIPPISAENLFAQLAESTANNLRV TIPDIVWKSTVKIPVLFCSILITGAYSPLSCTAACSIDHDPLTWTLLKL >gi568815574f:13186638_13387375|GENSCAN_predicted_CDS_8|1770_bp atggccttcaaggaaatcaagaaggctttgatccacgccctggcattaggactgccagac atgacaaagcctttttacgtgtatatctatgaaagaaaaggaatatctataagagtcttg gtacaaacactagggtcatggtattggcccgtggcatgtttgtcccagcgactagacttg gtggctatgggatggcctccctgtttcaaggcagtggcagccactgccctgttagcagaa gatgctaacaagctcacatttggacagaggttgataattcaggtgccccatacgatcgtc accctgatggggcagagggggcatggctggctttctaaccctaggatgttaagatatcaa gaacttttgtgtggaaacccctacataatcctggtgactctgaatactctagatccacgc acactgctgccaaaagagtgggcagagcacagaaagcccccgttgtgttgcccagcccaa agagttgagctaacagccctcactagagcatcgctcgtggccaaaggaaagtcagtaaac atctatactgacccaaggtgtgcttttgccattttgcattcccatggagccatatgggcc aacatggcccccagggattcaagaaactggactacaccctgtgaaaacctgcctgtggac tttatcgagctgcctcaagttggaggctgccggtacatgctagtgtttgtctgcactttc tcaagttctgtaaaagttgaaagaatgaaggagacactcaaacagctgttaaaggttttg ccaggaaacttatctaaggtgaaatcaggagctgcccatggtccttctgtgagtcagagg caccctactaaattaactgggtattcaccctatgagatggtgtttggctcaccacccaca atcataactcagataaaaggggatttaaaagaaattggggaattaaccttaagaaggcaa atgcaagccttaggtgaggccatgcaggaaatacaaaggtatcacaccttgggttcatca cagccagctgaaaccagcagcgaccacaactcaggacccgcggccagtcaacaaaaccca gactacccaaaacagttgatcctgcggcaaaactatgccactgctgacaaggataactgc cctgctctgaccataccggaggctgaaatcagggtcagtaaacctgtaaacaaaaaagaa gtattccctcactcgcataaaggccctgtctccatacattttgatgcctgccaagcttca catctcagtaaactaaatactattgggaccatccgtaaaaatctaggacaagaaagagtc agcagcagagctaccaaggctgtaacaggaaaatccaaaaaggagtgccctgattgtgat aatcagtggaccacacatgaatttatcagcacctatacacaggaagggttgctctatttg ccagccaagaggaaaacccaaaggacccaagttcaagttagcccaatgcaacactttagg ttttatacatctttcaatgaacactttaatcctgaagtatcaaaaaatcaaattcctcct atatcagctgaaaacctgtttgcccagctagccgaaagtactgctaacaatttaagagtt actattccagacattgtatggaaaagcactgtgaaaatccctgtcctgttctgttccatt ctgattactggtgcatacagccccctgtcatgtaccgctgcttgctcaatcgatcacgac cctctcacatggaccctcttaaagttgtga >gi568815574f:13186638_13387375|GENSCAN_predicted_peptide_9|375_aa RKYHSAKEAYEQLLQTENLPAQVKATVLQQLGWMHHNMDLVGDKATKESYAIQYLQKSLE ADPNSGQSWYFLGSVLYQQQNQPMDALQAYICAVQLDHGHAAAWMDLGTLYESCNQPQDA IKCYLNAARSKRCSNTSTLAARIKFLQAYRAHDPNTEHVLNHSQTPILQQSLSLHMITSS QVEGLSSPAKKKRTSSPTKHLEQLRANRDNLNPAQKHQLEQLESQFVLMQQMRHKEVAQV RTTGIHNGAITDSSLPTNSVSNRQPHGALTRVSSVSQPGVRPACVEKLLSSGAFSAGCIP CGTSKILGSTDTILLGSNCIAGSESNGNVPYLQQNTHTLPHNHTDLNSSTEEPWRKQLSN SAQVKKGLAAFLALI >gi568815574f:13186638_13387375|GENSCAN_predicted_CDS_9|1128_bp aggaagtatcattctgcaaaggaggcatatgaacaacttttgcagacagaaaaccttcct gcacaagtaaaagcaactgtattgcaacagttaggttggatgcatcataatatggatcta gtaggagacaaagccacaaaggaaagctatgctattcagtatctccaaaagtctttggag gcagatcctaattctggccaatcgtggtattttcttggaagtgtgttgtatcagcagcaa aatcagcctatggatgctttacaggcatatatttgtgctgtacaattggaccatgggcat gccgcagcctggatggacctaggtactctctatgaatcctgcaatcaacctcaagatgcc attaaatgctacctaaatgcagctagaagcaaacgttgtagtaatacctctacgcttgct gcaagaattaaatttctacaggcttatagagctcatgatccaaatactgaacatgtatta aaccacagtcaaacaccaattttacagcaatccttgtcactacacatgattacttctagc caagtagaaggcctgtccagtcctgccaagaagaaaagaacatctagtccaacaaagcac ttggaacaactgcgagcaaatagagataatttaaatccagcacagaagcatcagctggaa cagttagaaagtcagtttgtcttaatgcagcaaatgagacacaaagaagttgctcaggta cgaactactggaattcataacggggccataactgattcatcactgcctacaaactctgtc tctaatcgacaaccacatggtgctctgaccagagtatctagcgtctctcagcctggagtt cgccctgcttgtgttgaaaaacttttgtccagtggagctttttctgcaggctgtattcct tgtggcacatcaaaaattctaggaagtacagacactatcttgctaggcagtaattgtata gcaggaagtgaaagtaatggaaatgtgccttacctgcagcaaaatacacacactctacct cataatcatacagacctgaacagcagcacagaagagccatggagaaaacagctatctaac tccgctcaggtaaaaaaaggactagctgctttcttggctcttatatag