GENSCAN 1.0 Date run: 7-Nov-116 Time: 18:41:45 Sequence gi568815583f:66603259_66881532 : 278274 bp : 49.60% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 6459 6498 40 2.14 1.01 Init + 7569 7576 8 2 2 103 91 0 0.460 2.30 1.02 Intr + 16162 16275 114 1 0 59 85 53 0.344 1.66 1.03 Intr + 18703 18789 87 1 0 88 41 63 0.256 0.69 1.04 Intr + 19110 19203 94 0 1 70 49 60 0.328 0.27 1.05 Intr + 26540 26624 85 1 1 105 23 96 0.400 4.19 1.06 Intr + 32855 32954 100 0 1 93 98 34 0.489 4.07 1.07 Term + 38653 38788 136 1 1 66 46 80 0.131 -0.91 1.08 PlyA + 39946 39951 6 -0.45 2.08 PlyA - 40315 40310 6 1.05 2.07 Term - 40878 40713 166 2 1 77 42 110 0.201 2.69 2.06 Intr - 49368 49226 143 0 2 76 32 149 0.443 7.25 2.05 Intr - 53021 52892 130 1 1 120 47 -14 0.018 -1.60 2.04 Intr - 65714 65464 251 2 2 77 35 125 0.050 2.34 2.03 Intr - 67273 67209 65 2 2 91 42 63 0.452 0.44 2.02 Intr - 74450 74274 177 0 0 56 84 56 0.398 1.89 2.01 Init - 77104 77095 10 1 1 80 115 -2 0.473 2.32 2.00 Prom - 80038 79999 40 -2.86 3.03 PlyA - 81159 81154 6 -0.45 3.02 Term - 81636 81511 126 0 0 110 47 75 0.795 3.88 3.01 Init - 81859 81809 51 1 0 66 90 16 0.382 1.03 3.00 Prom - 93218 93179 40 -2.56 4.00 Prom + 96050 96089 40 -2.86 4.01 Init + 97479 97569 91 2 1 47 103 28 0.562 0.88 4.02 Intr + 99993 100817 825 1 0 38 105 1153 0.925 103.39 4.03 Intr + 108410 108466 57 0 0 116 106 69 0.495 10.36 4.04 Intr + 113163 113240 78 1 0 129 76 47 0.255 7.12 4.05 Intr + 116142 116267 126 1 0 57 57 92 0.210 3.65 4.06 Term + 163472 163686 215 0 2 79 48 118 0.062 4.29 4.07 PlyA + 166476 166481 6 1.05 5.00 Prom + 166495 166534 40 -6.46 5.01 Sngl + 177747 178277 531 2 0 84 40 1263 0.980 117.17 5.02 PlyA + 178711 178716 6 1.05 6.00 Prom + 202409 202448 40 -4.66 6.01 Sngl + 205420 205998 579 0 0 72 42 453 0.952 35.28 6.02 PlyA + 206004 206009 6 -4.04 7.00 Prom + 206021 206060 40 -11.43 7.01 Init + 206145 206276 132 2 0 92 56 99 0.978 7.25 7.02 Intr + 206412 206466 55 2 1 86 84 63 0.912 4.25 7.03 Term + 206649 207292 644 1 2 57 49 690 0.949 56.33 7.04 PlyA + 207440 207445 6 1.05 8.05 PlyA - 207452 207447 6 1.05 8.04 Term - 209791 209399 393 1 0 57 48 310 0.020 18.93 8.03 Intr - 236109 235968 142 2 1 67 93 47 0.358 3.46 8.02 Intr - 238974 238844 131 0 2 107 56 54 0.361 3.59 8.01 Init - 241804 241799 6 1 0 100 94 0 0.221 2.77 8.00 Prom - 243448 243409 40 -6.16 9.00 Prom + 247857 247896 40 -4.16 9.01 Init + 259375 259432 58 0 1 77 64 71 0.656 5.07 9.02 Intr + 260611 260755 145 2 1 97 15 99 0.279 2.84 9.03 Term + 262703 262916 214 2 1 73 42 122 0.535 2.80 9.04 PlyA + 263628 263633 6 1.05 10.04 PlyA - 263731 263726 6 1.05 10.03 Term - 274226 274188 39 2 0 136 44 13 0.553 -1.11 10.02 Intr - 274912 274852 61 0 1 108 97 33 0.772 4.94 10.01 Intr - 277779 277694 86 0 2 88 49 56 0.291 0.32 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 144243 144074 170 0 2 85 110 79 0.817 8.81 S.002 Intr - 174416 174366 51 0 0 100 95 35 0.841 4.28 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583f:66603259_66881532|GENSCAN_predicted_peptide_1|207_aa MPSLGSQPGKPSPGGAATCTRFEARTLRFPGVVTLVEQGPWASTPRLRKGQLCPHRAPRG LAERNRVTERAAVATGRSRLIGRPCEPAADPAWPSPGCPSQSWSPVLTAESILNCAAKHR GMGVVELLREAYLRITEEGYCLDSQQSVFSECKIPQTPDSSQQAWDGFSPLLTSGQGHPG VGKPVVEDIQRQCGLEFALFPAQPSRP >gi568815583f:66603259_66881532|GENSCAN_predicted_CDS_1|624_bp atgcccagcctcgggtcacagccagggaaacccagccccggaggcgcagcaacctgcacc cgtttcgaggcccgcactttgcgcttcccaggggttgttaccctggtcgagcaaggtccc tgggcgtcaactccccgtctgcggaaagggcagctttgtccccatcgcgcccctcgaggc ctggctgagaggaaccgggtgaccgagagggccgccgtggcgacgggccgctcccgcctt atcggccgcccctgcgagcccgcggccgaccccgcctggccctccccgggctgcccctcc cagagctggtcccctgtgctgacagcagagagtattctcaactgtgcagcaaagcaccgg ggcatgggggtggtggagctgctccgggaggcctatttgagaattactgaggaagggtac tgcttggactcacagcagagtgtgttctccgaatgtaagataccccaaactcctgacagc tctcaacaggcttgggatggcttttctcctctgctgacctctggccagggccaccctggt gtgggcaaaccagtggtggaggacatccagagacaatgtggcctggagtttgccttgttc cctgctcagccctccaggccctaa >gi568815583f:66603259_66881532|GENSCAN_predicted_peptide_2|313_aa MPHDSTADPHHRCPFSPRLPPQTSSHPSRIERYVCGLATIYPKHPPYYHHSHKYLLHAYY VPVCGPLTFCTSLLVTVIRIVLQIPLAGLLPNGPMGGIEAWNKSGLLMGWPATLSDHTQS CRASQPACPDGASVDLKPDNWMKAYGTQAHEIPKSSGASSTGRELAGRTPQCLLQSYPPC SAAPPSDTQPLDTTVGTAEWLHEIWAQIHLWSPDKLQILQEEMHMLNSDFLPASLKVNPQ DTASKPNHLVGSQGYSVGGDPRGHRPNAEGLSEAPGTHILALNRSWQPGESKAVLMEPDE QPIPHCIRLLGLP >gi568815583f:66603259_66881532|GENSCAN_predicted_CDS_2|942_bp atgccacatgattccacagcagacccacatcacaggtgccccttcagtccacggctgcca cctcagacctccagccacccctccagaatagagaggtatgtctgtggccttgctaccatc taccccaaacacccaccctattatcaccatagccacaaatacctcctgcatgcctactat gtgccagtttgtggccctctgacattttgcacgtctttgttggtcactgttatccggatt gttcttcagattccactggcaggtctgctcccaaatgggcccatgggtggaattgaggcc tggaacaagtctggcctcctgatgggctggccagcaactctttctgatcacacacagtcc tgcagggcctcacagccagcatgcccagatggagcatcagtggacctgaagcctgacaac tggatgaaagcctatggaacccaagcccatgaaatccccaaatcttctggggcttccagc acagggagagagctggcagggcggactccacaatgcctgctccaatcttaccctccctgc tcagcagccccaccctctgacacacagccactggataccactgtggggactgctgagtgg ctgcatgaaatctgggctcagattcacctttggagtcctgataaactacagatcctccaa gaagagatgcacatgctgaacagtgacttcttgcctgcctccttaaaggtcaacccccaa gatacagcatccaaacccaatcacctggtgggcagccagggctatagcgtgggaggagac cccaggggccacagacccaatgctgaaggactctcagaagcaccaggaacacacatcctt gccctcaacagatcctggcaacctggtgaatccaaagccgtccttatggaaccagatgaa caacctataccccactgtatccgtctgcttgggctgccataa >gi568815583f:66603259_66881532|GENSCAN_predicted_peptide_3|58_aa MKAQNLGLCECTTPLREIEKGFEILTESDAVSANLWSAEETGRLDLHSVEAKRACTGA >gi568815583f:66603259_66881532|GENSCAN_predicted_CDS_3|177_bp atgaaggctcagaacctgggtctctgtgaatgcaccacacctctcagggagattgaaaaa ggatttgaaattctcactgaatctgatgcagttagtgcaaatctgtggtcagcagaagaa acaggaaggcttgacctgcactcagtggaagctaagagagcttgcacaggggcttga >gi568815583f:66603259_66881532|GENSCAN_predicted_peptide_4|463_aa MGVRKCLSPIFLSGSNGHSQRAQREQYLVPGYRMFRSKRSGLVRRLWRSRVVPDREEGGS GGGGGGDEDGSLGSRAEPAPRAREGGGCGRSEVRPVAPRRPRDAVGQRGAQGAGRRRRAG GPPRPMSEPGAGAGSSLLDVAEPGGPGWLPESDCETVTCCLFSERDAAGAPRDASDPLAG AALEPAGGGRSREARSRLLLLEQELKTVTYSLLKRLKERSLDTLLEAVESRGGVPGGCVL VPRADLRLGGQPAPPQLLLGRLFRWPDLQHAVELKPLCGCHSFAAAADGPTVCCNPYHFS RLCGPESPPPPYSRLSPRDEYKPLDLSDSTLSYTETEATNSLITAPGEFSGHYWTNLGDH PEITPLAGLAFHLPEFWGSADPESLVTLSVLGCPCSCKQPSGGGNSSRRHLGIAIIEFSP DVSLGLLSFLEEIQNYELDHGKHSMPIFTEKLTLAIKDQIEAG >gi568815583f:66603259_66881532|GENSCAN_predicted_CDS_4|1392_bp atgggtgtgaggaagtgtctcagtccgatctttctttcggggagcaacggccactctcag agagcgcagcgcgaacagtacctggttccaggatatcgtatgttcaggtccaaacgctcg gggctggtgcggcgactttggcgaagtcgtgtggtccccgaccgggaggaaggcggcagc ggcggcggcggtggcggcgacgaggatgggagcttgggcagccgagctgagccggccccg cgggcaagagagggcggaggctgcggccgctccgaagtccgcccggtagccccgcggcgg ccccgggacgcagtgggacagcgaggcgcccagggcgcggggaggcgccggcgcgcaggg ggccccccgaggcccatgtcggagccaggggccggcgctgggagctccctgctggacgtg gcggagccgggaggcccgggctggctgcccgagagtgactgcgagacggtgacctgctgt ctcttttcggagcgggacgccgccggcgcgccccgggacgccagcgaccccctggccggg gcggccctggagccggcgggcggcgggcggagtcgcgaagcgcgctcgcggctgctgctg ctggagcaggaactcaaaaccgtcacgtactcgctgctgaagcggctcaaggagcgctcg ctggacacgctgctggaggcggtggagtcccgcggcggcgtgccgggcggctgcgtgctg gtgccgcgcgccgacctccgcctgggcggccagcccgcgccgccgcagctgctgctcggc cgcctctttcgctggcccgacctgcagcacgccgtggagctgaagcccctgtgcggctgc cacagcttcgccgccgccgccgacggccctaccgtgtgctgcaacccctaccacttcagc cggctctgcgggcccgaatctccgccacctccctactctcggctgtctcctcgcgacgag tacaagccactggatctgtccgattccacattgtcttacactgaaacggaggctaccaac tccctcatcactgctccgggtgaattctcaggccactactggacaaaccttggggaccac ccagagatcacaccactcgctggcctggccttccatctcccggagttctgggggtctgcc gatcccgagagccttgttaccctcagtgtcctgggatgtccctgtagctgcaaacagccc tccggagggggaaacagttctagaaggcacttgggcattgccatcatcgagttcagccca gatgtctccctgggactcttgtcttttctagaagaaatccagaattatgaactggatcat ggaaaacatagcatgcccatcttcactgaaaagttaaccttggcaataaaagaccaaata gaagccggctga >gi568815583f:66603259_66881532|GENSCAN_predicted_peptide_5|176_aa MSPDATKPSHWCSVAYWEHRTRVGRLYAVYDQAVSIFYDLPQGSGFCLGQLNLEQRSESV RRTRSKIGFGILLSKEPDGVWAYNRGEHPIFVNSPTLDAPGGRALVVRKVPPGYSIKVFD FERSGLQHAPEPDAADGPYDPNSVRISFAKGWGPCYSRQFITSCPCWLEILLNNPR >gi568815583f:66603259_66881532|GENSCAN_predicted_CDS_5|531_bp atgtctccggacgccaccaagccgagccactggtgcagcgtggcgtactgggagcaccgg acgcgcgtgggccgcctctatgcggtgtacgaccaggccgtcagcatcttctacgaccta cctcagggcagcggcttctgcctgggccagctcaacctggagcagcgcagcgagtcggtg cggcgaacgcgcagcaagatcggcttcggcatcctgctcagcaaggagcccgacggcgtg tgggcctacaaccgcggcgagcaccccatcttcgtcaactccccgacgctggacgcgccc ggcggccgcgccctggtcgtgcgcaaggtgccccccggctactccatcaaggtgttcgac ttcgagcgctcgggcctgcagcacgcgcccgagcccgacgccgccgacggcccctacgac cccaacagcgtccgcatcagcttcgccaagggctgggggccctgctactcccggcagttc atcacctcctgcccctgctggctggagatcctcctcaacaaccccagatag >gi568815583f:66603259_66881532|GENSCAN_predicted_peptide_6|192_aa MGPQDLPARCWHSSPHLGDYSIVKSHNDKNKKLEEGSPGYNPPEEAPVKSVGRRVLDGLK HCYHSFSLLWMDTRTATHSLWGILSGHMLNHREHRQFVRVRADLFCLVPLLIFAVVLFME LLLPIIVKIFPNMLLSTFETQSIKEERPKKQLWGSWNWLHFSRTPSRRKETKGRTAKAFS MFFQKIRETGKT >gi568815583f:66603259_66881532|GENSCAN_predicted_CDS_6|579_bp atgggacctcaggaccttcctgcacgctgctggcactcatctccccaccttggcgactac tccatagtcaagtcccacaacgacaagaacaagaagctggaggaaggcagccccgggtac aacccccctgaagaggcgccagtgaagtctgtggggcggagggtcctggacgggctgaag cactgttaccacagcttcagccttctctggatggacaccaggacagccacacactcgctc tgggggattctcagtggccatatgctgaaccaccgggagcacaggcagtttgtccgtgtc cgtgctgacctcttttgcctggtcccgctcctcatctttgcagtggtgcttttcatggag ttgctgttgcccatcatagtgaaaatctttcccaacatgttgctgtccacatttgagacc cagtccatcaaggaggagagaccgaagaagcagctatgggggagctggaactggctacat ttctccaggacaccatcgaggagaaaagaaaccaaggggaggactgccaaagccttctcc atgtttttccagaagatccgagagacaggaaagacctaa >gi568815583f:66603259_66881532|GENSCAN_predicted_peptide_7|276_aa MRLRSIKANNELIAEEGVDSLNIKELLSTCRTRGMWALGVPENHTLPETVVKGAQVKVDK VKEALKDTAPVLEILKEEKITKEEMGVLSDACSKLKEQNKSLTKEKEELELLKEDVQDDS DDLQEIKKEVSKTSKEKYVEKSKASKTLTKRVQQMIWWIDSLIAQLEMDQRPGKLGQAED SGGAGEKVIGITELISTMKQIKNIPENKLISLASTLDKSKNGKVNIDKLIKVIELVDKDV HVSTSQVAEIVAMLEKEERWRRRRRLRERLRRRLQK >gi568815583f:66603259_66881532|GENSCAN_predicted_CDS_7|831_bp atgaggttgaggtccataaaggccaacaatgagttgattgctgaggaaggggtggacagc ctgaatatcaaggaattgctgtcgacatgtagaactcgagggatgtgggccctcggggtc ccagaaaaccatactcttccagagacagtggtgaagggagcccaggtgaaagtggacaag gttaaggaggccctgaaggacactgcccctgtgctggagatcttgaaggaggagaaaata actaaagaagaaatgggtgtacttagcgatgcgtgttctaaactgaaggagcaaaataag tccctaaccaaggagaaggaggagctggagctgctgaaggaggatgtccaggacgatagt gatgacttgcaagagatcaagaaggaagtttcaaagaccagcaaagaaaaatatgtggaa aaatctaaagccagcaagacactgacaaagagagttcagcagatgatctggtggatagac agcttgatcgcacagctggagatggatcagaggcctggcaagctgggccaggcagaggac tcagggggagcgggagagaaagtcatcggtatcactgagctcatcagtaccatgaagcaa atcaagaacattccagaaaacaagctgatcagcttggcctcaacactggataaaagtaag aatggcaaggtcaacattgacaagctcattaaggtgattgagcttgtggacaaagatgtt cacgtctctaccagtcaggtggctgagattgtagcaatgctggagaaggaggagaggtgg aggagaagaagaaggctaagggaaaggctgaggaggaggctgcagaagtga >gi568815583f:66603259_66881532|GENSCAN_predicted_peptide_8|223_aa MQAFAVHLLCTKFDARRWEFSSEQKRRKDLVKKIKRQRGSQAVTTRSSFVSASAAVFVSK ATSIYESISTSIEPPLTQYPKHQFKNRSFSAEKDRSSLPAMEQSRMENDFDELTEVGFRK SVITNFSKLKEDVRTHRKEAKNLEKRLDKRLTRKNSVKKTLNDLMELKTMAQELHDSCTS FSSQLDQVEERVSVIEDQINEMKREEKFREKKVKRNEQSLQEI >gi568815583f:66603259_66881532|GENSCAN_predicted_CDS_8|672_bp atgcaggcgtttgctgtgcacctcttgtgcaccaaattcgatgctaggcgctgggaattc agcagtgagcaaaagagaaggaaggacctggtgaaaaagataaaaagacagagaggcagt caggcggtgaccacacgatcctcatttgtctctgcatctgcagctgtatttgtatccaag gccacatccatatatgaatctatatctacatccatagagccaccacttacccagtatcca aaacatcagtttaagaatagaagctttagtgccgaaaaggatcgcagctccttgccagca atggaacaaagcaggatggagaatgactttgacgagttgacagaagtaggcttcagaaag tcggtaataacaaacttctccaagctgaaggaggatgttcgaacccatcgcaaggaagct aaaaaccttgaaaaaagattagacaaacggctaactagaaaaaacagtgtaaagaagacc ttaaatgacctgatggagctgaaaaccatggcacaagaactacatgactcatgcacaagc ttcagtagccaactcgatcaagtggaagaaagggtatcagtgattgaagatcaaataaat gaaatgaagagagaagagaagtttagagaaaaaaaagtaaaaagaaacgaacaaagcctc caagaaatatag >gi568815583f:66603259_66881532|GENSCAN_predicted_peptide_9|138_aa MDFNRKELASTGGNADIVAVGAAVMQGGLAVEEEDMSLGPDSYWIPTPPAPSSADKLVPS TKAPAAELMAGAVLNAWEDGPNVPPQGIASHISTARDGSCCANTLCRLVGTLAEMSGHLD FNPVFAIHSLCGPPPQTK >gi568815583f:66603259_66881532|GENSCAN_predicted_CDS_9|417_bp atggattttaatcgaaaagagttagcctctactggtggaaatgcggacattgtggcagta ggggctgctgtcatgcaaggaggtctggcagtggaggaagaagacatgagccttgggcca gactcttactggattcccaccccacctgccccgtcttcagcagacaagctggtacccagc acaaaggctccagctgctgagctaatggcaggtgctgtgctgaatgcctgggaagatggc cccaatgttcctccccaagggattgccagtcacatcagcacagcacgggatggatcctgc tgtgccaacactctctgcaggcttgtggggaccctggctgagatgtctggacacctggat ttcaatccggtctttgccatccactccctgtgtggccctcccccgcaaactaaataa >gi568815583f:66603259_66881532|GENSCAN_predicted_peptide_10|61_aa SPQRQYSEQEKGPPSQAQWEKQRSHHQGRWMFQRDLSLGLLPGICWKEKDFKVEVFAEPL K >gi568815583f:66603259_66881532|GENSCAN_predicted_CDS_10|186_bp tctccacagagacaatacagtgaacaagagaagggcccacccagccaggcccagtgggag aagcagcggtcccaccaccagggcagatggatgttccagcgcgacctcagcctggggtta cttcctggaatctgctggaaggaaaaggattttaaagtggaagtttttgctgagccccta aaatga