GENSCAN 1.0 Date run: 6-Nov-116 Time: 06:45:46 Sequence gi568815588f:86554784_86765748 : 210965 bp : 48.46% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.08 PlyA - 283 278 6 1.05 1.07 Term - 11936 11804 133 1 1 54 48 143 0.133 4.46 1.06 Intr - 43632 43534 99 2 0 88 84 16 0.106 0.43 1.05 Intr - 57439 57299 141 0 0 -15 101 123 0.109 2.77 1.04 Intr - 64110 63018 1093 1 1 15 53 379 0.215 16.17 1.03 Intr - 65026 65004 23 0 2 104 97 44 0.663 4.09 1.02 Intr - 68423 68373 51 1 0 41 81 74 0.314 0.02 1.01 Init - 71689 71616 74 1 2 69 78 29 0.313 0.54 1.00 Prom - 75947 75908 40 -7.06 2.02 PlyA - 75968 75963 6 1.05 2.01 Sngl - 76824 76042 783 0 0 94 36 895 0.987 80.87 2.00 Prom - 89215 89176 40 -3.86 3.00 Prom + 89334 89373 40 -7.96 3.01 Init + 93802 93916 115 0 1 55 78 120 0.812 6.37 3.02 Intr + 98259 98353 95 1 2 88 92 17 0.644 1.78 3.03 Intr + 100436 100636 201 1 0 81 53 91 0.923 4.38 3.04 Intr + 101372 101517 146 1 2 132 51 63 0.989 6.08 3.05 Intr + 103249 103382 134 1 2 68 76 152 0.990 12.29 3.06 Intr + 103701 103904 204 1 0 54 83 320 0.821 27.27 3.07 Intr + 104514 104685 172 1 1 42 107 260 0.999 22.30 3.08 Intr + 105112 105276 165 1 0 65 94 333 0.738 30.68 3.09 Intr + 106498 106605 108 1 0 82 64 189 0.948 15.30 3.10 Intr + 107469 107649 181 0 1 77 79 152 0.946 13.07 3.11 Intr + 108876 109019 144 2 0 130 57 88 0.990 10.38 3.12 Intr + 111037 111211 175 0 1 70 63 99 0.980 5.11 3.13 Intr + 113886 114001 116 1 2 128 100 98 0.999 15.17 3.14 Intr + 124584 124735 152 2 2 80 50 248 0.612 19.16 3.15 Intr + 125299 125374 76 1 1 96 100 49 0.940 6.42 3.16 Intr + 126653 127020 368 1 2 79 73 153 0.971 6.84 3.17 Intr + 132286 132489 204 1 0 106 94 393 0.995 40.02 3.18 Term + 132759 132891 133 0 1 70 54 103 0.937 2.66 3.19 PlyA + 133519 133524 6 -1.95 4.00 Prom + 133594 133633 40 -4.66 4.01 Init + 134436 134491 56 2 2 87 88 77 0.918 6.36 4.02 Intr + 137113 137392 280 1 1 97 86 207 0.475 18.88 4.03 Intr + 137752 137788 37 0 1 78 115 57 0.495 5.24 4.04 Intr + 144495 144663 169 1 1 143 58 6 0.154 2.10 4.05 Intr + 151748 151936 189 2 0 75 88 189 0.100 16.30 4.06 Intr + 155122 155267 146 1 2 91 110 126 0.992 15.03 4.07 Intr + 155414 155461 48 0 0 121 99 14 0.955 4.45 4.08 Intr + 156799 156939 141 2 0 117 64 160 0.963 16.82 4.09 Intr + 161685 162030 346 1 1 83 16 230 0.455 9.75 4.10 Intr + 163181 163361 181 2 1 88 98 166 0.999 17.47 4.11 Intr + 163944 164064 121 2 1 70 97 123 0.984 11.47 4.12 Intr + 171354 171469 116 1 2 86 39 128 0.076 7.87 4.13 Term + 178104 178193 90 2 0 87 48 98 0.003 3.42 4.14 PlyA + 181259 181264 6 1.05 5.00 Prom + 195113 195152 40 -3.46 5.01 Init + 201003 201668 666 2 0 -1 14 387 0.748 15.63 5.02 Term + 202381 202485 105 0 0 84 40 39 0.632 -2.89 5.03 PlyA + 203280 203285 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 26417 26689 273 1 0 114 48 104 0.828 4.33 S.002 Term + 148052 148199 148 2 1 103 49 128 0.805 7.77 S.003 Init + 151680 151936 257 2 2 83 88 218 0.835 16.29 S.004 Term - 171498 171350 149 1 2 104 44 157 0.897 10.96 S.005 Intr - 179379 179259 121 1 1 69 106 52 0.867 4.65 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588f:86554784_86765748|GENSCAN_predicted_peptide_1|537_aa MRLSKEDLCKVGGQHIRISEQFGMSSGGLEVQDEGISKFGVWKKEPIMLENEIPGIQLTS DVKDLFKENYKPLLNEIKEDTNKWKNIPCSWVGRINIGKMAILPKVIYRFNAIPIKLPMI FFTELEKTTLKFIWNQKRARIAKSMLSQKNKAGGIMLPDFKLYYKATVTKTAWYWYQNRN IDQWNRTEPSEIMPCIYNYLIFDKPEKNKQWGKDSLFNKWCWENWLAICRKLKLDPFLTP YTKINSRWIKDLNVRPKTIKTLEENLGNTIQDIGMGKDFMSKTPKAMATKAKIDKWDVIK LNSFCTAKETTIRVNRQATEWEKIFAIYSSDKGIISRIYNELQQIYKKKTNNPIKKWVKD MNRHFSKEDIYAAKKHMKKCSSSLAIREMQIKTTMRYHLTPVRMAIIKKSGNNRNNGKPL ARLGAVMDALLDQKHSGHPAGTGGMEVSVRPAKAANSRGGRTLTKAKPKSLWIYITQKSH HWGCLDDASISEFLKISPLCNKSKVQKPESKETVCQASELKLSHHIPCDLHVYIQMA >gi568815588f:86554784_86765748|GENSCAN_predicted_CDS_1|1614_bp atgaggctaagtaaagaagatctgtgcaaggttggaggccagcatatcagaatcagtgaa caatttggtatgagttctggaggcttggaagtccaagatgaaggcatcagtaaatttggc gtctggaaaaaggagcccatcatgcttgagaatgaaataccaggaattcaacttacaagc gatgtgaaggacctcttcaaggagaactataaaccactgctcaatgaaataaaagaggat acaaacaaatggaagaacattccatgctcatgggtaggaagaatcaacatcgggaaaatg gccatactgcccaaggtaatttatagattcaatgccatccccatcaagctaccaatgatt ttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcccgc atcgccaagtcaatgctaagccaaaagaacaaagctggaggcatcatgctacctgacttc aaactatactacaaggccacagtaaccaaaacagcatggtactggtaccaaaacagaaat atagaccaatggaacagaacagagccctcagaaattatgccgtgtatctacaactatctg atctttgacaaacctgagaaaaacaagcaatggggaaaggattccctatttaataaatgg tgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttccttacacct tatacaaaaattaattcgagatggattaaagacttaaatgttagacctaaaactataaaa accctagaagaaaacctaggcaataccattcaggacataggcatgggcaaggacttcatg tctaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatgtaattaaa ctaaatagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaagctacagaa tgggagaaaatttttgcaatctactcatctgacaaagggataatatccagaatctacaat gaactccaacaaatttacaagaaaaaaacaaacaaccccatcaaaaagtgggtgaaggat atgaacagacacttctcaaaagaagacatttatgcagccaaaaaacacatgaaaaaatgc tcatcatcactggccatcagagaaatgcaaatcaaaaccacaatgagataccatctcaca ccagttagaatggcaatcattaaaaagtcaggaaacaacaggaacaatggcaagccttta gccagattgggagcagtaatggacgccttgctggatcagaagcacagcggacaccctgcc ggaaccggagggatggaagtcagcgtcaggcctgcaaaggcggcaaacagccgtggtgga cggaccctgactaaagcaaagcctaaaagtttgtggatttacatcacccagaaatctcat cattggggatgtcttgatgatgccagcatttctgaatttctgaaaatatcacctctgtgc aacaagtctaaagttcagaaacctgaatccaaggaaactgtatgtcaggcctctgagctc aagctaagccatcatatcccctgtgacctgcacgtgtacatccagatggcctga >gi568815588f:86554784_86765748|GENSCAN_predicted_peptide_2|260_aa MPKGKKAKGKKVAPAPAVVKKQEAKKVVNPPFEKRPKNFVIGRDIQPKRDLTRFVKWPRY IRLQRQRAILCKQLKVPSAINQFTQALDHQTSTHKYRPETKQEKKQRLLALAEKKAAGKG DVPTKKPSVFQAGVNTVTTLVENKKAQPMVIAQDMDPIEVVVFLPALCRKMGVPYSIIKG KARLGHLVHRKTCTTVAFTQVNSEGKGALAKLVETIRTNYNDRYDEIRHHWGGNVLGPKS VVRIAKLKKAKAKEFAIKLG >gi568815588f:86554784_86765748|GENSCAN_predicted_CDS_2|783_bp atgccgaaaggaaagaaggccaaagggaagaaggtggctccggcccctgctgtcgtgaag aagcaggaggccaagaaagtggtgaatcccccgtttgagaaaaggcctaagaattttgtc attggacgggacatccagcccaaaagagacctcacccgctttgtgaaatggccccgctac atcaggttgcagaggcagagagccatcctctgtaagcagctgaaagtgccttctgcgatt aaccagttcacccaggccctggaccaccaaacatctactcacaagtacagaccagagaca aagcaagagaagaagcagaggctgttggccctggctgagaagaaagctgctggcaaaggg gacgtccccactaagaaaccatctgtctttcaagcaggagttaataccgtcaccaccttg gtggagaacaagaaagcccagccgatggtgattgcacaagacatggatcccattgaggtg gttgtcttcttgcctgccctgtgtcgtaaaatgggggtcccttactccattatcaagggg aaggcaagactgggccatctagtccacaggaagacctgcaccactgtcgccttcacacag gttaactcggaaggcaaaggcgctttggctaagctggtggaaactatcaggaccaattac aacgacagatacgatgagatccgccatcactggggcggcaatgtcctgggtcccaagtct gtggttcgcatcgccaagctcaaaaaggcaaaggctaaagaatttgccattaaactgggt taa >gi568815588f:86554784_86765748|GENSCAN_predicted_peptide_3|962_aa MIHANLAGRALGTALSLRDQHPRPGDVGAISEKSGGSTAPAASSKTGQGAGPGSPPLLTS SQLGSGLQAWRLLGCPMEPSREEGTPWASGSAKPTVKPWVPSATSIILAQAQGHIPSTPL LPGEGRGHPQDLGMLAQAPGTWAAAWVPLPTVDVPDHAHYTLGTVILLVGLTGMLGNLTV IYTFCRSRSLRTPANMFIINLAVSDFLMSFTQAPVFFTSSLYKQWLFGETGCEFYAFCGA LFGISSMITLTAIALDRYLVITRPLATFGVASKRRAAFVLLGVWLYALAWSLPPFFGWSA YVPEGLLTSCSWDYMSFTPAVRAYTMLLCCFVFFLPLLIIIYCYIFIFRAIRETGRALQT FGACKGNGESLWQRQRLQSECKMAKIMLLVILLFVLSWAPYSAVALVAFAGYAHVLTPYM SSVPAVIAKASAIHNPIIYAITHPKYRVAIAQHLPCLGVLLGVSRRHSRPYPSYRSTHRS TLTSHTSNLSWISIRRRQESLGSESEVGWTHMEAAAVWGAAQQANGRSLYGQGLEDLEAK APPRPQGHEAETPGKTQDYAVSLQALEVALSPVLHGIHSPSPMAPLHTSKLLPHNVLRIH FPAQQPHPRLSLREAAADSTSMSYSVTLTGPGPWGFRLQGGKDFNMPLTISRITPGSKAA QSQLSQGDLVVAIDGVNTDTMTHLEAQNKIKSASYNLSLTLQKSKRPIPISTTAPPVQTP LPVIPHQKDPALDTNGSLVAPSPSPEARASPGTPGTPELRPTFSPAFSRPSAFSSLAEAS DPGPPRASLRAKTSPEGARDLLGPKALPGSSQPRQYNNPIGLYSAETLREMAQMYQMSLR GKASGVGLPGGADYQERFNPSALKDSALSTHKPIEVKGLGGKATIIHAQYNTPISMYSQD AIMDAIAGQAQAQGSDFSGLYEEAGRRDFYSPLNKGTILEAPRPLTAKLSFKEGEVTRFF QQ >gi568815588f:86554784_86765748|GENSCAN_predicted_CDS_3|2889_bp atgatacatgcaaacttggcaggccgggcactggggacagccctgagcctccgggatcag caccccaggccaggagatgtgggtgccatcagcgagaaatcaggtggtagcacagctcct gctgcctcctctaagacagggcaaggggcaggcccggggtcccctccacttctgacatcc agtcaacttggatcaggcctgcaggcctggaggttgctcggatgccccatggagccctcc agggaggagggcacaccctgggcctctggatctgccaagcccacagtgaagccttgggtg ccttctgccacctccatcatcctagcccaggcccagggccatattccatcaacaccactg ctacctggggaaggccgagggcaccctcaggacctggggatgctggcccaggcacctggg acttgggctgctgcctgggtccccctccccacggttgatgttccagaccatgcccactat accctgggcacagtgatcttgctggtgggactcacggggatgctgggcaacctgacggtc atctataccttctgcaggagcagaagcctccggacacctgccaacatgttcattatcaac ctcgcggtcagcgacttcctcatgtccttcacccaggcccctgtcttcttcaccagtagc ctctataagcagtggctctttggggagacaggctgcgagttctatgccttctgtggagct ctctttggcatttcctccatgatcaccctgacggccatcgccctggaccgctacctggta atcacacgcccgctggccacctttggtgtggcgtccaagaggcgtgcggcatttgtcctg ctgggcgtttggctctatgccctggcctggagtctgccacccttcttcggctggagcgcc tacgtgcccgaggggttgctgacatcctgctcctgggactacatgagcttcacgccggcc gtgcgtgcctacaccatgcttctctgctgcttcgtgttcttcctccctctgcttatcatc atctactgctacatcttcatcttcagggccatccgggagacaggacgggctctccagacc ttcggggcctgcaagggcaatggcgagtccctgtggcagcggcagcggctgcagagcgag tgcaagatggccaagatcatgctgctggtcatcctcctcttcgtgctctcctgggctccc tattccgctgtggccctggtggcctttgctgggtacgcacacgtcctgacaccctacatg agctcggtgccagccgtcatcgccaaggcctctgcaatccacaaccccatcatttacgcc atcacccaccccaagtacagggtggccattgcccagcacctgccctgcctgggggtgctg ctgggtgtatcacgccggcacagtcgcccctaccccagctaccgctccacccaccgctcc acgctgaccagccacacctccaacctcagctggatctccatacggaggcgccaggagtcc ctgggctcggagagtgaggtgggctggacacacatggaggcagcagctgtgtggggagct gcccagcaagcaaatgggcggtccctctacggtcagggtctggaggacttggaagccaag gcaccccccagaccccagggacacgaagcagagactccagggaagacccaggattatgct gtgagcctgcaggctttggaagtggccctgtcacccgtgctgcacgggattcacagcccc agccccatggcccctctccacacctcaaaactcctgccccataacgtcctccgcatccac tttccagctcagcagccgcacccgaggctcagcctgagggaggcggccgctgacagcacc agcatgtcttacagtgtgaccctgactgggcccgggccctggggcttccgtctgcagggg ggcaaggacttcaacatgcccctcactatctcccggatcacaccaggcagcaaggcagcc cagtcccagctcagccagggtgacctcgtggtggccattgacggcgtcaacacagacacc atgacccacctggaagcccagaacaagatcaagtctgccagctacaacttgagcctcacc ctgcagaaatcaaagcgtcccattcccatctccacgacagcacctccagtccagacccct ctgccggtgatccctcaccagaaggaccccgctctggacacgaacggcagcctggtggca cccagccccagccctgaggcgagggccagcccaggcaccccaggcaccccggagctcagg cccacctttagccctgccttctcccggccctccgccttctcctcactcgccgaggcctct gaccctggccctccgcgggccagcctgagggccaagaccagcccagagggggcccgggac ctactcggcccaaaagccctgccgggctcgagccagccgaggcaatataacaaccccatt ggcctgtactcggcagagaccctgagggagatggctcagatgtaccagatgagcctccga gggaaggcctcgggtgtcggactcccaggaggcgccgactaccaggaacgcttcaacccc agtgccctgaaggactcggccctgtccacccacaagcccatcgaggtgaaggggctgggc ggcaaggccaccatcatccatgcgcagtacaacacgcccatcagcatgtattcccaggat gccatcatggatgccatcgctgggcaggcccaagcccaaggcagtgacttcagtgggctc tatgaggaggctggaagaagagacttctacagtcctcttaacaaggggaccattctggag gcccccaggcctctcacagcgaagctgtcctttaaggagggggaagttaccaggttcttc cagcaatga >gi568815588f:86554784_86765748|GENSCAN_predicted_peptide_4|639_aa MDAHSRLGFPLLGRASPLASLPIKDLAVDSASPVYQAVIKSQNKPEDEADEWARRSSNLQ SRSFRILAQMTGTEFSECRLSGWLQRREMLRGHQGPGPSTAEARESCYLPLKCKTLMKKL CEGQGKGLKRNVTAHVLPNCATGTMAFQPKSLMLKAKRLPGIPPPQQAGLPPSLPPHRSG ITPIEHAPVCTSQATTPLLPASAQPPAAASPSAASPPLATAAAHTAIASASTTAPASSPA DSPRPQASSYSPAVAASSAPATHTSYSEGPAAPAPKPRVVTTASIRPSVYQPAVGQNPME PASGLGETGSSDPGVREGGEPARDPERRGRSRSSPRGLLRAEGGVRRPDLIAGARAPAYT PSPAPNYNPAPSVAYSGGPAEPASRPPWVTDDSFSQKFAPGKSTTSISKQTLPRGGPAYT PAGPQVPPLARGTVQRAERFPASSRTPLCGHCNNVIRYGPAVPLHWGTGRAGPFLVAMGR SWHPEEFTCAYCKTSLADVCFVEEQNNVYCERCYEQFFAPLCAKCNTKIMGEVMHALRQT WHTTCFVCAACKKPFGNSLFHMEDGEPYCEKDYINLFSTKCHGCDFPVEAGDKFIEALGH TWHDTCFICAVCHVNLEGQPFYSKKDRPLCKKHAHTINL >gi568815588f:86554784_86765748|GENSCAN_predicted_CDS_4|1920_bp atggacgcgcactcacggctgggttttcctctgcttggcagggcgtcaccgctggcgagc ctccctattaaggaccttgccgtagacagcgcctctcccgtctaccaggctgtgattaag agccagaacaagccagaagatgaggctgacgagtgggcacgccgttcctccaacctgcag tctcgctccttccgcatcctggcccagatgacggggacagaattcagtgagtgcaggctc tcagggtggctgcagaggagggagatgctgaggggccaccagggacctgggcccagcact gcagaagccagggagtcctgctacctgcccttgaagtgcaagaccctgatgaagaagctc tgcgaaggtcaagggaaaggtttgaaacggaacgtaacagcccacgttttgccaaattgc gcaactggcaccatggcctttcagcccaaatccttaatgttaaaagctaaaaggctgcct ggaatccccccaccccaacaggctggactccctccatccttacccccacacagatctggc atcacccctattgagcatgcgccggtgtgcaccagccaggccaccaccccgctgctgccc gcttctgcccagccacctgctgctgcctctcccagtgcggcttcgccacccctggccaca gctgctgcccacactgccatcgcctccgcctccaccacagcccctgcttcaagtcctgcc gacagcccaaggccccaggcctcttcctacagccccgcagtggccgcctcttcagcacct gccacccacaccagctacagtgagggccccgccgcccctgcacccaagccccgggttgtc accactgccagcatccggccttctgtctaccagccagctgtgggtcagaatcctatggag cctgcatctggtctgggggagacaggctcttcggatccgggcgtcagagagggcggggag cccgcgcgggacccggagcgccgcggccgcagccgcagctccccccggggcctcctccgg gccgagggcggcgttcgcaggccggatcttatcgctggcgcccgagcaccagcctatacc ccctcacctgcccccaactataaccctgcaccctcggtggcctacagcgggggccctgcg gagcctgccagccgtccaccctgggtgacagatgatagcttctcccagaagtttgccccg ggcaagagcaccacctccatcagcaagcagaccctgccccggggaggcccagcctacacc ccagcgggtcctcaggtgccaccacttgccagggggaccgtccagagggctgagcgattc ccagccagcagccggactccactctgcggtcactgcaacaatgtcatccggtatggtcca gctgtgcccctgcactggggcactggaagggcgggcccatttctggtagccatgggccgt tcttggcaccctgaagagttcacctgtgcctactgcaagacttccctggcagatgtgtgc tttgtggaagagcagaacaacgtttactgtgagcgatgttatgagcaattctttgccccg ctgtgtgccaagtgcaacaccaaaattatgggggaagtaatgcatgccttgagacagaca tggcacaccacctgcttcgtctgtgcggcctgcaagaagccttttgggaacagcctcttc cacatggaagacggggagccctactgcgagaaagactacatcaatctgttcagcaccaag tgccatggctgcgatttccccgtggaggctggcgacaagtttatcgaagccctgggccac acttggcacgacacctgcttcatttgcgcagtctgccatgtgaatctggaggggcagccg ttctactccaagaaggacagacccctgtgcaagaagcacgcacacaccatcaacttgtag >gi568815588f:86554784_86765748|GENSCAN_predicted_peptide_5|256_aa MPLEFAGSQVTGARARRAWRSRIDEGSRSPRQILPAGEPPSPCLRLGTLARSPGRDAGSG GWARNKCNVTYIFLCRNYHDLAIPHLGDLVPSQRPYFTPADWRSPQCECRPEEGSTRKSP REWGYPAPKAGAGVQGADGRSARQPCPRSPPPRSIACYLRRLREATRAAAVGLACGEGPA GPRGGAAPTPARTSSSAPPCRRLALLPGNWAPGSANSSGASAAEFGDAEWAKVAPGTLLE SLNGKGAHLPAPERRG >gi568815588f:86554784_86765748|GENSCAN_predicted_CDS_5|771_bp atgccgctggaattcgcgggctcgcaggtcacaggagcccgggcgcgccgggcctggcgc tcccgaatcgatgagggaagccgctctccccggcagatcctcccggccggggagcctcca tcaccctgcctgcgcctcggcacgctggcaaggagcccgggaagagacgccgggagcggt gggtgggcgagaaacaaatgcaatgtgacttatatttttctttgtcgcaactaccacgac ctggccatcccccaccttggtgacttggtgccttcccagcgaccttatttcacgccagct gactggcgttcaccgcagtgtgagtgtcggccggaggaggggtccacgcgaaagagcccg cgagagtggggatatccggccccgaaggccggagcgggcgtccagggcgccgatggccgc agcgcccggcagccgtgtccacgctcccctccgccgcgttccatcgcgtgctacctacgg cgtctgcgggaagctacccgggcggcagctgtggggctggcttgtggggaggggccggcg ggcccgcgcggaggagcagccccgactcccgcccgcaccagcagctcggcgcccccttgc cggcggctggccctcctccccggcaactgggcgccgggctccgcgaactcttcgggcgct tccgccgccgagtttggggatgcggagtgggcgaaagtcgctccggggacgcttctggaa tccttaaatgggaaaggcgctcacctccccgccccggagcggcgaggctga