GENSCAN 1.0 Date run: 8-Nov-116 Time: 04:57:20 Sequence gi568815586f:123502149_123720137 : 217989 bp : 46.36% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 PlyA - 643 638 6 1.05 1.05 Term - 12155 12133 23 0 2 146 33 21 0.439 0.87 1.04 Intr - 17879 17721 159 0 0 67 68 135 0.875 9.36 1.03 Intr - 21497 21347 151 2 1 104 115 221 0.999 26.14 1.02 Intr - 31704 31026 679 2 1 102 86 994 0.594 92.22 1.01 Init - 42701 42643 59 2 2 38 84 51 0.070 0.48 1.00 Prom - 45797 45758 40 -4.86 2.00 Prom + 46490 46529 40 -5.36 2.01 Init + 52392 52419 28 2 1 68 89 12 0.321 -0.93 2.02 Intr + 54140 54335 196 0 1 79 56 148 0.933 9.17 2.03 Term + 55010 55193 184 2 1 86 48 142 0.994 7.02 2.04 PlyA + 57022 57027 6 1.05 3.00 Prom + 62886 62925 40 -5.16 3.01 Init + 82489 82668 180 0 0 107 64 490 0.904 45.68 3.02 Intr + 84599 84791 193 1 1 108 100 115 0.980 13.77 3.03 Intr + 88194 88301 108 1 0 62 100 136 0.890 12.46 3.04 Intr + 89833 89933 101 2 2 39 93 0 0.037 -4.57 3.05 Intr + 99896 100108 213 1 0 51 74 234 0.055 17.21 3.06 Intr + 103783 103833 51 0 0 94 101 56 0.990 6.60 3.07 Intr + 103880 104011 132 1 0 110 94 38 0.976 7.44 3.08 Intr + 105284 105375 92 1 2 87 68 111 0.999 7.79 3.09 Intr + 106532 106681 150 2 0 26 79 197 0.996 11.88 3.10 Intr + 107701 107980 280 1 1 55 103 220 0.824 17.68 3.11 Intr + 111022 111104 83 0 2 118 78 77 0.934 8.14 3.12 Intr + 113037 113168 132 0 0 99 98 160 0.995 17.86 3.13 Intr + 114363 114455 93 0 0 80 86 82 0.932 6.28 3.14 Intr + 115610 115724 115 2 1 85 80 117 0.873 11.05 3.15 Intr + 116533 116689 157 0 1 66 110 147 0.900 14.28 3.16 Intr + 117284 117576 293 0 2 46 88 210 0.997 13.65 3.17 Term + 117816 117992 177 2 0 56 37 240 0.999 13.39 3.18 PlyA + 118770 118775 6 1.05 4.09 PlyA - 118898 118893 6 1.05 4.08 Term - 119874 119608 267 0 0 68 37 229 0.542 11.29 4.07 Intr - 122714 122639 76 1 1 81 116 94 0.998 11.02 4.06 Intr - 124345 124277 69 0 0 62 100 88 0.137 5.70 4.05 Intr - 125008 124902 107 1 2 98 -24 72 0.111 -3.99 4.04 Intr - 128137 128021 117 1 0 124 101 44 0.989 9.96 4.03 Intr - 128385 128249 137 1 2 70 94 87 0.998 7.79 4.02 Intr - 130298 130197 102 0 0 84 71 112 0.964 9.25 4.01 Init - 131409 131397 13 0 1 102 61 26 0.792 1.74 4.00 Prom - 133775 133736 40 -4.56 5.00 Prom + 138838 138877 40 -5.26 5.01 Init + 143337 143413 77 2 2 92 86 7 0.451 1.46 5.02 Intr + 145815 145978 164 0 2 84 26 125 0.471 5.52 5.03 Intr + 150559 150587 29 2 2 108 91 -6 0.395 -0.47 5.04 Intr + 157368 157436 69 2 0 63 95 52 0.359 2.78 5.05 Intr + 157647 157782 136 2 1 101 108 -29 0.343 0.54 5.06 Term + 158047 158087 41 2 2 80 55 54 0.419 -1.35 5.07 PlyA + 158166 158171 6 1.05 6.00 Prom + 168529 168568 40 -1.66 6.01 Init + 169093 169174 82 0 1 62 57 97 0.989 3.13 6.02 Intr + 169359 169466 108 1 0 96 70 184 0.895 17.66 6.03 Intr + 169908 169979 72 1 0 105 66 41 0.372 2.98 6.04 Intr + 177041 177141 101 0 2 81 91 47 0.438 4.13 6.05 Intr + 184688 184887 200 1 2 115 75 4 0.372 -0.05 6.06 Intr + 185903 186029 127 2 1 67 89 92 0.854 7.88 6.07 Intr + 188385 188526 142 2 1 116 87 28 0.983 5.43 6.08 Intr + 192694 192828 135 2 0 99 88 -1 0.718 1.54 6.09 Intr + 193072 193149 78 2 0 92 110 3 0.838 2.42 6.10 Intr + 194267 194347 81 0 0 91 56 58 0.684 2.51 6.11 Intr + 202384 202540 157 2 1 39 86 193 0.971 13.27 6.12 Intr + 204578 204703 126 2 0 120 73 76 0.820 9.09 6.13 Term + 206305 206335 31 1 1 55 48 38 0.120 -6.17 6.14 PlyA + 208651 208656 6 1.05 7.02 PlyA - 208829 208824 6 1.05 7.01 Sngl - 210419 209958 462 2 0 86 39 467 0.751 35.76 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 100001 100108 108 1 0 105 74 180 0.922 18.60 S.002 Init - 124338 124277 62 0 2 71 100 86 0.856 8.85 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:123502149_123720137|GENSCAN_predicted_peptide_1|356_aa MFIDLGKLLNFSGPLQLPFLPAPPRGARPGSIERNSDPNLGGAAGLRTRAPSPLPPPPPP PPPPTREQLRAEPGAQTPGPHTPRRRPCPRPRPPPAARCALAPAPAERARPRPRLGPAGP ALDSGKFGSCTSLRGGGQARRVAMEEERGSALAAESALEKNVAELTVMDVYDIASLVGHE FERVIDQHGCEAIARLMPKVVRVLEILEVLVSRHHVAPELDELRLELDRLRLERMDRIEK ERKHQKELELVEDVWRGEAQDLLSQIAQLQEENKQLMTNLSHKDVNFSEEEFQKHEGRLE ALDPQGLDLRALSRFPVHQCTHARSALHPGNFASTAFQIPAGLVMNQPTRTTTGSL >gi568815586f:123502149_123720137|GENSCAN_predicted_CDS_1|1071_bp atgttcatcgacttgggcaagttacttaacttctccgggcctctgcagcttccttttctg cccgcaccgccccgaggcgcccggcccggctccattgagcgcaactcggatcccaacttg ggaggcgccgcgggcctgcgcacgcgcgcgccctccccgctgccgccgccgccgccgccg ccgccgccgcccactcgggagcagctccgggccgagccgggcgcccagaccccgggcccg cacaccccgcgccgccgcccgtgcccgcgcccgcgcccgccgcccgctgcccgctgcgcc ctcgccccagcgcccgctgagcgcgcccgcccgcggcccaggctgggcccggccggcccg gccctcgacagcggcaagtttgggagttgcacgagtttgcggggcgggggacaggccagg agggtggccatggaggaggagcgggggtcggcgctggcggccgagtcggcgctggagaag aacgtggccgagctgaccgtcatggacgtgtacgacatcgcgtcgcttgtgggccacgag ttcgagcgggtcattgaccagcacggctgcgaggccatcgcgcgcctcatgcccaaggtc gtgcgcgtcctggagatcctggaggtgctggtcagccgccaccacgtcgcgcccgagctg gacgagctgcgcctggagctggaccgcctgcgcctggagaggatggaccgcatcgagaag gagcgcaagcaccagaaggagctggagctggtggaggatgtgtggcgaggggaggcgcag gacctcctctcccagatcgcccagctgcaggaggagaacaagcagctcatgaccaacctc tcccacaaggatgtcaatttctcagaggaggagttccagaagcatgaagggcgcctggag gccttggacccacagggcctggacctgagggctctgagccgcttccctgtgcatcagtgc acgcatgctcgcagcgccctccacccagggaactttgcttccacagccttccagatccct gctgggcttgtcatgaatcagcccacacgtacaaccactggcagcctctaa >gi568815586f:123502149_123720137|GENSCAN_predicted_peptide_2|135_aa MTSKGEVCSDTSWKGRYPVILSTPTVVKVAGLESWIHHTRVKPWIVPEEPENPGDIASYS CEPLEDLRLLFKRQPIEAVKLQMVLQMEPQMQSVTKIYHRPLDRPPSLCSDVDDIKGNHP KEISTARPLLHPNSA >gi568815586f:123502149_123720137|GENSCAN_predicted_CDS_2|408_bp atgacttcgaaaggcgaggtatgcagtgatacatcctggaaaggacgctacccagtcatt ttatctaccccaaccgtggttaaagtggctggattggaatcttggatacatcacactcga gtcaaaccctggatagtgccagaggaacccgaaaatccaggagacattgctagctattcc tgtgaacctctagaggatctgcgcctgctcttcaagagacaaccaatcgaagctgtaaaa ctacaaatggttcttcaaatggagccccagatgcaatccgtgactaagatctaccacaga cccctggaccggcctcctagcctatgctctgatgttgatgacatcaaaggcaaccatccc aaggaaatatcaactgcacgacccctgctacaccccaattcagcatga >gi568815586f:123502149_123720137|GENSCAN_predicted_peptide_3|849_aa MVTLAELLVLLAALLATVSGYFVSIDAHAEECFFERVTSGTKMGLIFEVAEGGFLDIDVE ITGPDNKGIYKGDRESSGKYTFAAHMDGTYKFCFSNRMSTMTPKIVMFTIDIGEAPKGQD METEAHQNKLEEMINELAVAMTAVKHEQEYMEVRERIHRANVLSVFPTLKSVLATYGNKS SGSRAQTSACVGRKRSSRPAAGAAGSGSARWGCTRRVRAAATDAAKERAMEHVTEGSWES LPVPLHPQVLGALRELGFPYMTPVQSATIPLFMRNKDVAAEAMNSEEERETKTLCLQVTG SGKTLAFVIPILEILLRREEKLKKSQVGAIIITPTRELAIQIDEVLSHFTKHFPEFSGNI IVATPGRLEDMFRRKAEGLDLASCVRSLDVLVLDEADRLLDMGFEASLFEYLFSIVKHCV LILLASERLSVSPLFISINTILEFLPKQRRTGLFSATQTQEVENLVRAGLRNPVRVSVKE KGVAASSAQKTPSRLENYYMVCKADEKFNQLVHFLRNHKQEKHLVFFSTCACVEYYGKAL EVLVKGVKIMCIHGKMKYKRNKIFMEFRKLQSGILVCTDVMARGIDIPEVNWVLQYDPPS NASAFVHRCGRTARIGHGGSALVFLLPMEESYINFLAINQKEMKPQRNTADLLPKLKSMA LADRAVFEKGMKAFVSYVQAYAKHECNLIFRLKDLDFASLARGFALLRMPKMPELRGKQF PDFVPVDVNTDTIPFKDKIREKQRQKLLEQQRREKTENEGRRKFIKNKAWSKQKAKKEKK KKMNEKRKREEGSDIEDEDMEELLNDTRLLKKLKKGKITEEEFEKGLLTTGKRTIKTVDL GISDLEDDC >gi568815586f:123502149_123720137|GENSCAN_predicted_CDS_3|2550_bp atggtgacgcttgctgaactgctggtgcttctggccgctctcctggccacggtctcgggc tatttcgttagcatcgacgcccatgctgaagagtgcttctttgagcgggtcacctcgggc accaagatgggcctcatcttcgaggtggcggagggcggcttcctggacatcgacgtggag attacaggaccagataacaaaggaatttacaaaggagacagagaatccagtgggaaatac acatttgctgctcacatggatggaacatacaaattttgttttagtaaccggatgtccacc atgactccaaaaatagtgatgttcaccattgatattggggaggctccaaaaggacaagat atggaaacagaagctcaccagaacaagctagaagaaatgatcaatgagctagcagtggcg atgacagctgtaaagcacgaacaggaatacatggaagtccgggagagaatacacagagcc aatgtgctttctgtttttcctacccttaaatctgttcttgctacttatggaaacaaatct tcaggatctagagcacagacatcagcttgtgttggcaggaagcgaagttcccggccggcc gctggagctgcgggaagcggaagtgctcgttgggggtgcacaaggcgcgttcgagcagcg gcgaccgacgcggcgaaggagcgcgccatggagcatgtgacagagggctcctgggagtcg ctgcctgtgccgctgcacccgcaggtgctgggcgcgctgcgggagctgggcttcccgtac atgacgccggtgcagtccgcaaccatccctctgttcatgcgaaacaaagatgtcgctgca gaagcgatgaactctgaggaagaaagggaaaccaaaactctctgtttgcaggtcacaggt agtggcaaaacactcgcttttgtcatccccatcctggaaattcttctgagaagagaagag aagttaaaaaagagtcaggttggagccataatcatcacccccactcgagagctggccatt caaatagacgaggtcctgtcgcatttcacgaagcacttccccgagttcagtgggaacatc attgtggccactccaggccgcttggaggacatgttccggaggaaggccgaaggcttggat ctggccagctgtgtgcgatccctggatgtcctggtgttggatgaggcagacagacttctg gacatggggtttgaggcaagtctttttgaatacctgtttagtatcgtgaagcactgtgtg ttgattctgctggcaagtgagcggttaagtgtgagccctctttttatcagcatcaacacc attctggagtttttgccaaagcagaggagaacaggccttttctctgccactcagacgcag gaagtggagaacctggtgagagcgggcctccggaaccctgtccgggtctcagtgaaggag aagggcgtggcagccagcagtgcccagaagaccccctcccgcctggaaaactactacatg gtatgcaaggcagatgagaaatttaatcagctggtccattttcttcgcaatcataagcag gagaaacacctggtcttcttcagcacctgtgcctgtgtggaatactatgggaaggctctg gaagtgctggtgaagggcgtgaagattatgtgcattcacggaaagatgaaatataaacgc aataagatcttcatggagttccgcaaattgcaaagtgggattttagtgtgcactgatgtg atggcccggggaattgatattcctgaagtcaactgggttttgcagtatgaccctcccagc aatgcaagtgccttcgtgcatcgctgcggtcgcacagctcgcattggccacgggggcagc gctctggtgttcctcctgcccatggaagagtcatacatcaatttccttgcaattaaccaa aaagagatgaagccccagagaaacacagcggaccttctgccaaaactcaagtccatggcc ctggctgacagagctgtgtttgaaaagggcatgaaagcttttgtgtcatatgtccaagct tatgcaaagcatgaatgcaacctgattttcagattaaaggatcttgattttgccagcctt gctcgaggttttgccctgctgaggatgcccaagatgccagaattgagaggaaagcagttt ccagattttgtgcccgtggacgttaataccgacacgattccatttaaagataaaatcaga gaaaagcagaggcagaaactcctggagcaacaaagaagagagaaaacagaaaatgaaggg agaagaaaattcataaaaaataaagcttggtcaaagcagaaggccaaaaaagaaaagaag aaaaaaatgaatgagaaaaggaaaagggaagagggttctgatattgaagatgaggacatg gaagaacttcttaatgacacaagactcttgaaaaaacttaagaaaggcaaaataactgaa gaagaatttgagaagggcttgttgacaactggcaaaagaacaatcaagacagtggattta gggatctcagatttggaagatgactgctga >gi568815586f:123502149_123720137|GENSCAN_predicted_peptide_4|295_aa MDDKELIEYFKSQMKEDPDMASAVAAIRTLLEFLKRDKGETIQGLRANLTSAIETLCGVD SSVAVSSGGELFLRFISLASLEYSDYSKCKKIMIERGELFLRRISLSRNKIADLCHTFIK DGATILTHAYSRVVLRVLEAAVAAKKRFSVYVTESQPDFKKMAKALCHLNVPVTVVLDAA VGYIMEKADLVIVGAEGVVENGGIINKRAVSGSSTLITHTKALPGQSTKYSVPTNVYFSP QYKADTLKVAQTGQDLKEEHPWVDYTAPSLITLLFTDLGVLTPSAVSDELIKLYL >gi568815586f:123502149_123720137|GENSCAN_predicted_CDS_4|888_bp atggacgacaaggagttaattgaatactttaagtctcagatgaaagaagatcctgacatg gcctcagcagtggctgccatccggacgttgctggagttcttgaagagagataaaggggag acaatccagggtctgagggcgaatctcaccagtgccatagaaaccctgtgtggtgtggac tcctctgtggcagtgtcctctggcggggagctcttcctccgcttcatcagtcttgcctcc ctggaatactccgattactccaaatgtaaaaagatcatgattgagcggggagaacttttt ctcaggagaatatcactgtcaagaaacaaaattgcagatctgtgccatactttcatcaaa gatggagcgacaatattgactcacgcctactccagagtggtcctgagagtcctggaagca gccgtggcggccaagaagcgatttagtgtatacgtcacagagtcacagcctgattttaag aaaatggccaaagccctctgccacctcaacgtccctgtcactgtggtgctagatgctgct gtcggctacatcatggagaaagcagatcttgtcatagttggtgctgaaggagttgttgaa aacggaggaattattaacaagagagcagtctcaggaagtagcacactgatcactcacacc aaggccctaccaggtcagagcactaaatattcagtgccgactaatgtgtacttctctcct cagtataaggcagacactctcaaggtcgcgcagactggacaagacctcaaagaggagcat ccgtgggtcgactacactgccccttccttaatcactctgctgtttacagacctgggcgtg ctgacaccctcagcagtcagcgatgagctcatcaagctctatctgtaa >gi568815586f:123502149_123720137|GENSCAN_predicted_peptide_5|171_aa MVLGNSHLFMNRSNKLAVIASHIQESRFLYPGKNGRLGDFFGDPGNPPEFNPSGSKDGKY ELLTSANEVIVEEIKDLMTKNNQEMKSRILACDITGGLYLKVPQMPSLLQYLLWVFLPDQ DQRSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSMLKAKKKKLKVSA >gi568815586f:123502149_123720137|GENSCAN_predicted_CDS_5|516_bp atggtgctgggaaattcgcatttattcatgaatcgttccaacaaacttgctgtgatagca agtcacattcaagaaagccgattcttatatcctggaaagaatggcagacttggagacttc ttcggagaccctggcaaccctcctgaatttaatccctctgggagtaaagatggaaaatac gaacttttaacctcagcaaatgaagttattgttgaagagattaaagatctaatgaccaaa aacaatcaggaaatgaaatcaaggatattggcttgtgacatcacgggaggactgtacctg aaggtgcctcagatgccttctcttctgcagtatttgctgtgggtgtttcttcccgatcaa gatcagagatctcagttaatcctcccacccccagttcatgttgactacagggctgcttgc ttctgtcatcgaaatctcattgaaattggttatgtctgttctgtgtgtttgtcaatgctg aaagccaagaaaaagaaactgaaagtgtctgcctga >gi568815586f:123502149_123720137|GENSCAN_predicted_peptide_6|479_aa MGFQPPAALLLRLFLLQGILRLLWGDLAFIPPFIRMSGPAVSASLVGDTEGVTVSLAVLQ DEAGILPIPTCGVLNNETEDWSVTVIPENVTVIPNQVYQPLGPCPCNLTAGACDVRCCCD QECSSNLTTLFRRSCFTGVFGGDVNPPFDQLCSAGTTTRGVPDWFPFLCVQSPLANTPFL GYFYHGAVSPKQDSSFEVYVDTDAKDFADFGYKQGDPIMTVKKAYFTIPQVSLAGQCMQN APVAFLHNFDVKCVTNLELYQERDGIINAKIKNVALGETPLNNGSTPRIVNVEEHYIFKW NNNTISEINVKIFRAEINAHQKGIMTQRFVVKFLSYNSGNEEELSGNPGYQLGKPVRALN INRMNNVTTLHLWQSGVDAPDPGADPLASSVNGMCLDIPAHLSIRILISDAGAVEGITQQ EILGVETRFSSVNWQYQCGLTCEHKADLLPISASVQFIKIPAQLPHPLTRMNILIAGLN >gi568815586f:123502149_123720137|GENSCAN_predicted_CDS_6|1440_bp atgggcttccagcctccggccgctcttcttttgaggcttttccttctgcagggcatcctg aggcttctgtggggggacctggctttcatccctccttttatccgaatgtccggccctgcg gtcagcgcgtccctggtcggagacaccgagggtgtgaccgtgtccctggcagtgctgcag gacgaggcgggaatattgccaattccgacgtgtggagtgctgaacaatgagacggaagac tggagcgtgactgtgatccccgagaacgtgactgtcattcctaaccaggtgtatcagccc cttggcccttgtccttgtaatttaacagctggagcctgtgatgttcgctgctgctgtgac caggaatgctcatcaaatttaacaacgctgttcagacggtcctgcttcaccggcgtgttt ggaggagacgtcaatcctccttttgatcagctctgctctgctgggacgacgacacgtggt gtccccgattggtttccctttctgtgtgtgcagtccccccttgccaacacacccttcctt ggttacttctatcatggtgctgtttcccccaaacaggactcttcctttgaagtatatgtg gatactgacgcaaaagactttgcagactttggttacaaacaaggagatcccattatgact gtaaagaaggcatattttactattccgcaggtgtccctggctgggcagtgtatgcagaac gccccagtggcatttcttcacaattttgatgttaaatgcgttactaatttggaactatac caagaacgagatggtattatcaatgcgaagataaagaatgttgccttaggagaaactcct ttaaataacggatcaacccctagaattgtgaatgtggaagaacattatattttcaaatgg aataataataccatcagtgaaataaatgttaaaatttttagggcagagattaatgcccac cagaaagggataatgacacagagatttgtagtaaaatttttaagctataatagtggtaat gaagaagaattatctggaaatccaggttaccaacttggcaagcctgtccgagctctaaat atcaacaggatgaataatgtcacgactttacatctttggcaatcgggtgtagatgcccct gatccaggtgcagacccgctggctagcagtgtgaacggcatgtgcctggatattcctgct cacctgagcatccgcatcctcatctcggatgctggcgcggtggaagggattactcagcag gagatactcggtgtagagacaaggttctcctcagtgaactggcagtaccagtgtgggctt acctgtgagcacaaggccgaccttctccctatcagtgcatccgtccagtttattaaaatt cctgcacagttaccccaccccctgacaagaatgaatattctcatcgctggcctgaactga >gi568815586f:123502149_123720137|GENSCAN_predicted_peptide_7|153_aa MAGRPAPGGARWAAAAPSRAGPHTQPVRPGPRALPVAWPRPRTMGLQTAAAAPGSATAAA APAAATAVREVPPPSHCRFRFRPRVPPPPPADQWTPEVVAARRGGRANQREPGGYDHCSW RSAGLDAELNPARLPGFPGGVSPSVHAAKLTRS >gi568815586f:123502149_123720137|GENSCAN_predicted_CDS_7|462_bp atggcgggccgacccgcgcccggaggggctcgatgggcggcggcggctccgagccgcgcg ggcccgcacactcagccggtgcggcccgggcctcgagctcttcctgtggcctggccgcgg ccccgcactatggggctccagactgcagcggccgcccccggttctgccaccgcggccgcc gctccagctgccgccacagcagtccgcgaggtcccgcccccaagccactgccggttccgg ttccgcccccgcgtcccgccccccccgccggccgaccaatggacgcccgaggttgtcgcc gcacgacgtggcggcagggccaaccagcgcgaacccggcgggtacgaccactgcagctgg cgctccgccggactggacgcagagctaaatcccgcgcggctccccggctttcctggagga gtctcgccgagtgtgcacgccgcgaagctcacccgttcctag