GENSCAN 1.0 Date run: 8-Nov-116 Time: 09:34:58 Sequence gi568815587r:5249805_5354728 : 104924 bp : 38.50% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 84 79 6 1.05 1.03 Term - 3601 3473 129 1 0 83 49 153 0.988 8.10 1.02 Intr - 4710 4488 223 2 1 105 113 224 0.996 23.81 1.01 Init - 4924 4833 92 1 2 96 102 131 0.999 15.31 1.00 Prom - 5014 4975 40 -6.05 2.00 Prom + 13098 13137 40 -3.15 2.01 Init + 13643 13719 77 1 2 95 37 120 0.960 8.21 2.02 Term + 14038 14665 628 1 1 14 41 195 0.802 0.24 2.03 PlyA + 15008 15013 6 1.05 3.04 PlyA - 16364 16359 6 1.05 3.03 Term - 18793 18665 129 1 0 96 44 116 0.861 5.20 3.02 Intr - 19872 19650 223 2 1 56 113 170 0.810 13.51 3.01 Init - 20086 19995 92 1 2 100 115 100 0.991 13.91 3.00 Prom - 25492 25453 40 -6.35 4.05 PlyA - 26051 26046 6 1.05 4.04 Term - 31944 31843 102 0 0 32 47 120 0.893 -0.40 4.03 Intr - 33386 33208 179 0 2 68 98 160 0.970 13.72 4.02 Intr - 34612 34472 141 2 0 14 33 172 0.607 3.70 4.01 Init - 36190 35986 205 1 1 67 11 121 0.541 1.36 4.00 Prom - 38578 38539 40 -8.15 5.00 Prom + 41052 41091 40 -6.85 5.01 Init + 42591 42597 7 2 1 73 60 1 0.407 -3.02 5.02 Intr + 43629 43793 165 1 0 57 53 151 0.651 7.51 5.03 Term + 44526 45157 632 1 2 40 37 299 0.673 13.49 5.04 PlyA + 45749 45754 6 1.05 6.02 PlyA - 46613 46608 6 1.05 6.01 Sngl - 52142 51210 933 2 0 67 52 445 0.995 35.00 6.00 Prom - 54947 54908 40 -4.85 7.00 Prom + 63855 63894 40 -3.95 7.01 Init + 65587 65763 177 0 0 48 65 170 0.033 10.01 7.02 Intr + 74248 74506 259 0 1 87 60 146 0.121 7.81 7.03 Term + 81309 81553 245 1 2 -13 44 200 0.090 0.58 7.04 PlyA + 81980 81985 6 1.05 8.02 PlyA - 83072 83067 6 1.05 8.01 Sngl - 93534 92782 753 0 0 74 39 357 0.931 23.30 8.00 Prom - 93737 93698 40 -1.55 9.00 Prom + 98389 98428 40 -7.25 9.01 Sngl + 101704 102642 939 0 0 73 44 314 0.790 21.75 9.02 PlyA + 104069 104074 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 65587 65853 267 0 0 48 48 268 0.877 13.79 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:5249805_5354728|GENSCAN_predicted_peptide_1|147_aa MGHFTEEDKATITSLWGKVNVEDAGGETLGRLLVVYPWTQRFFDSFGNLSSASAIMGNPK VKAHGKKVLTSLGDAIKHLDDLKGTFAQLSELHCDKLHVDPENFKLLGNVLVTVLAIHFG KEFTPEVQASWQKMVTGVASALSSRYH >gi568815587r:5249805_5354728|GENSCAN_predicted_CDS_1|444_bp atgggtcatttcacagaggaggacaaggctactatcacaagcctgtggggcaaggtgaat gtggaagatgctggaggagaaaccctgggaaggctcctggttgtctacccatggacccag aggttctttgacagctttggcaacctgtcctctgcctctgccatcatgggcaaccccaaa gtcaaggcacatggcaagaaggtgctgacttccttgggagatgccataaagcacctggat gatctcaagggcacctttgcccagctgagtgaactgcactgtgacaagctgcatgtggat cctgagaacttcaagctcctgggaaatgtgctggtgaccgttttggcaatccatttcggc aaagaattcacccctgaggtgcaggcttcctggcagaagatggtgactggagtggccagt gccctgtcctccagataccactga >gi568815587r:5249805_5354728|GENSCAN_predicted_peptide_2|234_aa MVLKGKFTALNVDVKESERGQTDNIRKPKGDVRIPGKIQGSSLSQEELDTLNRPVISSKT EMIIKIYQQQKSPGPDRFTAEFYQTFKEELIPILLTVFYKIEKERILPKSFYEASITLIP KLGKNITKKKKHYRSISLMNIDANILDKILANSIRQHIKTHDQVGFMRGMHGWFNIRKSI SVIHNIKRIGNKNHVIISIATETAFNKIQHHFMTKTQQNRFTKDIPQCNKSQLW >gi568815587r:5249805_5354728|GENSCAN_predicted_CDS_2|705_bp atggtgctaaaaggaaagttcacagccctaaatgtcgatgtcaaagagtctgaaagagga cagacagacaatataagaaaacctaaaggagacgttcgaattcctggaaagatacagggc tctagcttaagtcaggaagaattagataccctgaacagaccagtaataagcagcaagact gaaatgataataaaaatttaccaacaacaaaaaagtccaggaccagatagattcacagct gaattctaccagacattcaaagaagaattgataccaatcctattgacagtattctacaag atagagaaagagagaatcctccctaaatcattctatgaagccagtatcaccttaatacca aaactgggaaagaacataaccaaaaaaaaaaaacactacagatcaatatccctgatgaac atagatgccaacatccttgacaaaatactagctaatagcatccgacagcatatcaaaacc catgatcaagtgggtttcatgcgagggatgcacggatggtttaacatacgcaagtcaata agtgtgatacacaacataaagagaattggaaacaaaaatcatgtgattatctcaatagcc acagaaactgcattcaacaaaattcaacatcactttatgactaaaactcagcaaaatcgg tttacaaaggacatacctcaatgtaataaaagccagctatggtaa >gi568815587r:5249805_5354728|GENSCAN_predicted_peptide_3|147_aa MVHFTAEEKAAVTSLWSKMNVEEAGGEALGRLLVVYPWTQRFFDSFGNLSSPSAILGNPK VKAHGKKVLTSFGDAIKNMDNLKPAFAKLSELHCDKLHVDPENFKLLGNVMVIILATHFG KEFTPEVQAAWQKLVSAVAIALAHKYH >gi568815587r:5249805_5354728|GENSCAN_predicted_CDS_3|444_bp atggtgcattttactgctgaggagaaggctgccgtcactagcctgtggagcaagatgaat gtggaagaggctggaggtgaagccttgggcagactcctcgttgtttacccctggacccag agattttttgacagctttggaaacctgtcgtctccctctgccatcctgggcaaccccaag gtcaaggcccatggcaagaaggtgctgacttcctttggagatgctattaaaaacatggac aacctcaagcccgcctttgctaagctgagtgagctgcactgtgacaagctgcatgtggat cctgagaacttcaagctcctgggtaacgtgatggtgattattctggctactcactttggc aaggagttcacccctgaagtgcaggctgcctggcagaagctggtgtctgctgtcgccatt gccctggcccataagtaccactga >gi568815587r:5249805_5354728|GENSCAN_predicted_peptide_4|208_aa MTEGSLIILGCMNQIINRNVETNYGEAGYERALNEMETAMLPAHSGLQHVEIWGFVKTGL KSEAPLDKSKFQKIASTTVRDILPVSEPDVNLAMGWDPPPVEPSNQLLQSKSNAAAKHKL RYCGCEKLEVDIPALWPLLLTFTSCRLEVVVQATVADHTSSTIIAFLQESLREKKTYEIS DEKGERRKAELTVMSDEGAFSSGYQWGL >gi568815587r:5249805_5354728|GENSCAN_predicted_CDS_4|627_bp atgactgaaggctctctcataattcttggttgcatgaatcagattatcaacagaaatgtt gagacaaactatggggaagcagggtatgaaagagctctgaatgaaatggaaaccgcaatg cttcctgcccattcagggctccagcatgtagaaatctggggctttgtgaagactggctta aaatcagaagccccattggataagagcaagttccagaagatagcatcaaccactgttaga gatatactgccagtctcagagcctgatgttaatttagcaatgggctgggaccctcctcca gtagaaccttctaaccagctgctgcagtcaaagtcgaatgcagctgcaaaacacaaactc agatattgtggatgcgagaaattagaagtagatattcctgccctgtggcccttgcttctt acttttacttcttgtcgattggaagttgtggtccaagccacagttgcagaccatacttcc tcaaccataattgcatttcttcaggaaagtttgagggagaaaaagacttatgaaatttct gatgagaaaggagagaggagaaaggcagagctgactgtgatgagtgatgaaggtgccttc tcatctgggtaccagtggggcctctaa >gi568815587r:5249805_5354728|GENSCAN_predicted_peptide_5|267_aa MTGVKLQTFTVNVPALKAAPLELFVPPGGFVVSLASRVKLQTFAVSVTAHKGSVDPNNIR VLRIPSKLRSLAGFTQWIPHLGCRGSCLPVPCRAPALFSPWVVDGTGCPGAGGGARRRGS GRTGAHEGGEAQAWRAAGPEPCPAGRQLRPGEELSTAAAGPGAKPLTARGRWGRLAAPSA GSSEPMPTRNSLKHGAQPGSCPRLSLHTSWQAEGAGSGLGQPRKGLPQCSGRLKGSSSAT NVGAQAEEAPRASEGCEDCQHAVTSQY >gi568815587r:5249805_5354728|GENSCAN_predicted_CDS_5|804_bp atgaccggagtgaagctgcagaccttcacagtgaacgttccagctcttaaggcggcgcct ctggagttgttcgttcctcccggtgggttcgtggtctcgctggcttcaagagtgaagctg cagaccttcgcggtgagcgttacagctcataaaggcagtgtagacccaaacaacataagg gttcttcgcatccccagcaaactcaggagcctagccggcttcacccagtggatcccgcac ctgggctgcagggggagctgcctgccagtcccctgccgtgcgcccgcactcttcagccct tgggtggtcgatgggactgggtgccctggagcagggggcggcgctcgtcggcgaggctcg ggccgcacaggagcccatgaaggtggggaggctcaggcatggcgcgctgcaggtcccgag ccctgccccgcaggaaggcagctaaggcccggcgaggaattgagcacagcagctgctggc ccaggtgctaagcccctcactgcccgcggccggtggggccggctggcggctccgagtgcg gggtcctcggagcccatgcccacccggaactccctcaagcacggcgcacagcccggttcc tgcccgcgtctgtccctccacacctcctggcaagctgagggagccggctctggccttggc cagcccagaaaagggctcccacagtgcagcggccggctgaagggctcctcaagtgccacc aatgtgggagcccaggcagaggaggcgccaagagcgagcgagggctgtgaggactgccag cacgctgtcacctctcaatactaa >gi568815587r:5249805_5354728|GENSCAN_predicted_peptide_6|310_aa MWYNNSAGPFLLTGFLGSEAVHYRISMSFFVIYFSVLFGNGTLLVLIWNDHSLHEPMYYF LAMLADTDLGMTFTTMPTVLGVLLLDQREIAHAACFTQSFIHSLAIVESGILLVLAYDCF IAIRTPLRYNCILTNSRVMNIGLGVLMRGFMSILPIILSLYCYPYCGSRALLHTFCLHQD VIKLACADITFNHIYPIIQTSLTVFLDALIIIFSYILILKTVMGIASGQEEAKSLNTCVS HISCVLVFHITVMGLSFIHRFGKHAPHVVPITMSYVHFLFPPFVNPIIYSIKTKQIQRSI IRLFSGQSRA >gi568815587r:5249805_5354728|GENSCAN_predicted_CDS_6|933_bp atgtggtataacaacagtgctggccccttcttgctgactggcttcttgggctcagaggca gttcactaccggatctctatgtccttctttgtcatctacttctccgtcctttttggaaat ggcactcttcttgtcctcatttggaatgatcacagcctccatgagcccatgtactacttc ctggctatgctggcagacacggaccttgggatgacattcactacaatgcccacagtcctg ggtgtcctgctgctagaccagagggagattgcccatgctgcctgtttcacccaatccttc attcattcactggccattgtagaatcaggtatcttgcttgttttggcctatgactgtttc attgccatccgcacaccactgaggtacaactgcattcttaccaattcccgagtgatgaac ataggactgggggtactgatgagaggttttatgtccattttgcccataattctttcactc tactgctacccatattgtggttcccgtgccctcttgcacacattttgcctccatcaagat gtcataaaactcgcctgtgctgatatcacgtttaatcacatatatccaattattcagact tctttgactgtctttttagatgctctaatcatcatcttttcttatatactaatcctcaag acagtgatgggcattgcgtctggacaagaggaagctaaatctctcaacacttgtgtctcc catattagctgtgtcctagtatttcacatcactgtgatgggactgtcattcattcacagg tttgggaaacatgcacctcatgtggtccccattaccatgagctatgtccattttctcttt cctccattcgtgaatcctatcatttatagcatcaagaccaagcagattcaaagaagcatt attcgcctattttctgggcagagtagggcttga >gi568815587r:5249805_5354728|GENSCAN_predicted_peptide_7|226_aa MGVDNVNLQEAGPMDNLFVIHPQDAHYSGHHGQFHHEVLPQCGKETVHGFVKTVVMLDED AHYSRHRGQCHHEVCACQHGEEVVHGLMKTVVMLDEVEEHAIAQKDAHINSKEGDGDPVM SCLQPWKASQQKRGCSNIGPHSVSTENTKDSGHCCQSYTKVYGCQYSQRVVHGLMKSAVF SDKEEEAAIAKDCRDIDDKEGDRTPRMKSLQTWKASEEKRGSSNGG >gi568815587r:5249805_5354728|GENSCAN_predicted_CDS_7|681_bp atgggagtggataatgtaaacctgcaagaagcaggccccatggataacctctttgtgatt catccacaagacgcccattacagtgggcatcatggtcaatttcaccatgaggtcttgcct cagtgtggcaaggaaacagtacatgggttcgtgaagactgtggtcatgcttgatgaggat gcccattacagtaggcatcgtggtcaatgtcaccatgaggtctgtgcctgccagcatggt gaggaagtagtacatgggctcatgaagactgtggtcatgcttgatgaggtagaggagcat gccattgcccagaaggatgcacacataaacagcaaagaaggggatggagatccagtgatg agctgcctccagccctggaaagccagtcagcaaaaaaggggctgcagtaatattgggcca catagtgtatccaccgagaacaccaaggacagtggacattgttgtcaaagttacaccaag gtctatggctgccagtatagtcagagagtagtacacgggctcatgaagagtgcagtcttc tctgataaagaggaggaggctgccattgccaaggactgtagagacatagatgataaagaa ggagatagaactccaaggatgaaatctctccagacctggaaagccagtgaggagaaaagg ggcagcagtaatggtgggtga >gi568815587r:5249805_5354728|GENSCAN_predicted_peptide_8|250_aa MLAATDLGLALTTMPTVLGVLWLDHREIGSAACFSQAYFIHSLSFLESGILLAMAYDRFI AICNPLRYTSVLTNTRVVKIGLGVLMRGFVSVVPPIRPLYFFLYCHSHVLSHAFCLHQDV IKLACADTTFNRLYPAVLVVFIFVLDYLIIFISYVLILKTVLSIASREERAKALITCVSH ICCVLVFYVTVIGLSLIHRFGKQVPHIVHLIMSYAYFLFPPLMNPITYSVKTKQIQNAIL HLFTTHRIGT >gi568815587r:5249805_5354728|GENSCAN_predicted_CDS_8|753_bp atgctggctgccacagacctggggctggccctgaccacaatgcccacggtgctgggagtc ctctggctggatcacagggagattggaagtgcggcctgcttttcccaggcctactttata cactcactttcctttctcgagtctggcattctgcttgccatggcctatgaccgttttatt gccatctgcaaccctcttagatatacctctgtacttactaatactcgagtagtgaagatt gggctgggagttctgatgaggggatttgtatccgttgttcccccaatcaggcccctctat ttttttctgtattgtcactcccatgttctttcacatgcattctgccttcaccaggatgtc attaaactcgcctgtgctgataccaccttcaaccgactgtacccagctgtgcttgtagtc tttatatttgtgctggattatctgattatcttcatctcctatgtgttgatactcaagact gtcctgagcattgcctccagagaggagagggccaaggctctcattacctgtgtctcccat atctgctgtgtcctggttttttatgtcacagtgattggattgtctctgattcatcgtttt ggaaagcaggttccacatattgttcacctcattatgagctatgcctattttctgttccct ccactaatgaatcctataacatatagtgtcaagaccaagcagattcagaatgccattctt cacctttttactacccatagaattggaacctga >gi568815587r:5249805_5354728|GENSCAN_predicted_peptide_9|312_aa MGLNKSASTFQLTGFPGMEKAHHWIFIPLLAAYISILLGNGTLLFLIRNDHNLHEPMYYF LAMLAATDLGVTLTTMPTVLGVLWLDHREIGHGACFSQAYFIHTLSVMESGVLLAMAYDC FITIRSPLRYTSILTNTQVMKIGVRVLTRAGLSIMPIVVRLHWFPYCRSHVLSHAFCLHQ DVIKLACADITFNRLYPVVVLFAMVLLDFLIIFFSYILILKTVMGIGSGGERAKALNTCV SHICCILVFYVTVVCLTFIHRFGKHVPHVVHITMSYIHFLFPPFMNPFIYSIKTKQIQSG ILRLFSLPHSRA >gi568815587r:5249805_5354728|GENSCAN_predicted_CDS_9|939_bp atggggctcaataagtctgcttccaccttccagcttactggcttcccaggcatggagaag gcacatcactggatattcatcccattattggcagcctacatctccatacttcttggcaat ggcactcttctctttctcatcaggaatgatcataacctccatgagcccatgtactatttc ttagctatgttggcagctacagacctcggagtgacattgaccacaatgcccacagtgcta ggtgttctgtggttagatcacagggagattggccatggagcctgcttctctcaggcctat tttatccatactctttctgtcatggagtcaggtgtcttgcttgccatggcttatgactgt ttcattaccatccgcagccccttaagatatacctctatcctgaccaacacccaggtaatg aagattggtgtgcgggtattgacaagggctggtctgtccattatgccaatagttgttcgc ctacactggtttccctactgtcgatcccatgtactctcccatgctttctgtctacaccaa gatgtcatcaagctagcctgtgctgacatcaccttcaaccgtctctatccagttgtagtt ttatttgcaatggtcttgttggactttctcatcatctttttctcctacattttgattctc aagactgtcatgggcattggttctggaggagaaagggccaaggccctcaacacatgtgtc tctcatatctgctgcatcctggtcttctatgtcactgtagtttgtctgacatttattcat aggtttggaaagcatgttcctcatgtcgttcacatcacaatgagctacatccacttcctt ttcccaccttttatgaacccatttatctatagcattaaaactaagcagattcagagtggc atacttcgtttattctctctgcctcactctagagcatga