GENSCAN 1.0 Date run: 8-Nov-116 Time: 00:21:07 Sequence gi568815587r:58322335_58523261 : 200927 bp : 36.55% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 21480 21613 134 1 2 97 78 63 0.188 5.64 1.02 Intr + 26627 26718 92 1 2 106 87 65 0.573 6.07 1.03 Term + 26932 27040 109 1 1 46 43 107 0.561 -1.00 1.04 PlyA + 27189 27194 6 -0.45 2.02 PlyA - 27675 27670 6 1.05 2.01 Sngl - 32165 31827 339 2 0 68 37 234 0.202 12.11 2.00 Prom - 34621 34582 40 -6.25 3.03 PlyA - 34766 34761 6 1.05 3.02 Term - 36763 35791 973 0 1 79 39 379 0.805 22.63 3.01 Init - 38223 38042 182 0 2 60 93 101 0.779 6.50 3.00 Prom - 41318 41279 40 -3.65 4.00 Prom + 42050 42089 40 -3.75 4.01 Sngl + 48641 49522 882 1 0 42 43 378 0.980 24.47 4.02 PlyA + 50891 50896 6 1.05 5.00 Prom + 51701 51740 40 -3.85 5.01 Init + 68588 68771 184 1 1 69 40 152 0.055 5.84 5.02 Term + 77425 77708 284 2 2 66 43 155 0.151 3.40 5.03 PlyA + 78434 78439 6 1.05 6.07 PlyA - 78864 78859 6 1.05 6.06 Term - 80339 80131 209 0 2 79 38 96 0.116 0.22 6.05 Intr - 80890 80577 314 0 2 56 84 211 0.043 12.40 6.04 Intr - 86946 86810 137 0 2 92 14 116 0.009 3.05 6.03 Intr - 100742 100517 226 1 1 72 34 209 0.137 10.96 6.02 Intr - 105772 105685 88 2 1 85 116 12 0.647 1.81 6.01 Init - 106121 105968 154 2 1 94 49 112 0.748 8.09 6.00 Prom - 110989 110950 40 -5.55 7.02 PlyA - 113061 113056 6 1.05 7.01 Sngl - 117817 116873 945 1 0 52 32 311 0.970 18.19 7.00 Prom - 131279 131240 40 -3.65 8.00 Prom + 135354 135393 40 -5.05 8.01 Init + 135537 135612 76 2 1 83 99 54 0.723 7.30 8.02 Intr + 144192 144280 89 1 2 53 40 53 0.032 -4.23 8.03 Term + 161436 161657 222 2 0 91 43 165 0.709 8.23 8.04 PlyA + 162553 162558 6 1.05 9.05 PlyA - 163027 163022 6 1.05 9.04 Term - 171420 171302 119 1 2 76 47 46 0.022 -2.98 9.03 Intr - 173521 173441 81 2 0 46 86 85 0.025 2.79 9.02 Intr - 183299 183135 165 0 0 120 86 27 0.264 4.71 9.01 Init - 185771 184886 886 2 1 107 -19 480 0.281 33.82 9.00 Prom - 186493 186454 40 -6.45 10.03 PlyA - 188921 188916 6 1.05 10.02 Term - 189521 189484 38 0 2 117 50 39 0.825 -0.48 10.01 Intr - 192325 192217 109 1 1 53 58 166 0.925 9.04 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:58322335_58523261|GENSCAN_predicted_peptide_1|111_aa XHMVHHGEDNSIKHSCNFVCVNGMTGAGLYMHEDNSPIEDGNCSEAVEHKVQHGDEHSIE QGCHFICVHGMAGARLARHSHDWWNITEKVMDKIGLTERETEYVPNAEGGI >gi568815587r:58322335_58523261|GENSCAN_predicted_CDS_1|336_bp naccacatggttcaccatggggaagacaatagtataaaacacagctgcaattttgtctgt gtcaatggaatgactggagctgggctgtatatgcatgaagataatagtcccatagaagat ggcaactgcagtgaggctgtagaacacaaggttcagcatggagatgaacacagcatagaa caaggatgccatttcatctgtgtccatggaatggctggagctaggctagcaagacacagc catgactggtggaacatcacagaaaaggtgatggacaagattggacttacagaaagagag actgaatatgtccccaatgcggagggaggcatttag >gi568815587r:58322335_58523261|GENSCAN_predicted_peptide_2|112_aa MDENCFVLSLYSHKGLIISGANTDVVVWDPEGTKIISASTLVQGGNFNLNENMCYYRMPL LTIDPGGAVYENGIFMCTEGTGQTVSCGLSQLNHRKNTLNIKGVTVLPTWGM >gi568815587r:58322335_58523261|GENSCAN_predicted_CDS_2|339_bp atggatgagaactgttttgttctcagcctgtattcccacaagggcctcatcatctctgga gccaatactgatgtggtggtgtgggaccctgaaggcacaaagatcatctcagccagcacc ttggtgcagggaggaaatttcaatctcaatgagaacatgtgctactaccgcatgcctctg ctcaccatcgaccctgggggtgctgtgtatgagaatggcatcttcatgtgcactgagggc actggacaaactgtctcctgtggtctttctcagctgaaccacagaaagaacaccttaaac attaagggagtgactgtactccctacctggggtatgtag >gi568815587r:58322335_58523261|GENSCAN_predicted_peptide_3|384_aa MNVLLTDSNSNKKIVHKHICSLQSAPKTTNLQPSISDILLSVESNDRKNVSKIKGDCFNT RVSCDSKITSMENNTEVSEFILLGLTNAPELQVPLFIMFTLIYLITLTGNLGMIILILLD SHLHTPMYFFLSNLSLAGIGYSSAVTPKVLTGLLIEDKAISYSACAAQMFFCAVFATVEN YLLSSMAYDRYAAVCNPLHYTTTMTTRVCACLAIGCYVIGFLNASIQIGDTFRLSFCMSN VIHHFFCDKPAVITLTCSEKHISELILVLISSFNVFFALLVTLISYLFILITILKRHTGK GYQKPLSTCGSHLIAIFLFYITVIIMYIRPSSSHSMDTDKIASVFYTMIIPMLSPIVYTL RNKDVKNAFMKVVEKAKYSLDSVF >gi568815587r:58322335_58523261|GENSCAN_predicted_CDS_3|1155_bp atgaatgtccttctgacagattcaaattcaaataaaaagattgtgcataaacacatctgc agcctacagtcagcccccaagactacgaacctccaaccctcaatatctgatatcctgcta agtgttgagagtaatgacaggaagaatgtgtctaagataaaaggggattgtttcaacaca agagtatcttgtgattctaaaataacatccatggagaataatacagaggtgagtgaattc atcctgcttggtctaaccaatgccccagaactacaggttcccctctttatcatgtttacc ctcatctacctcatcactctgactgggaacctggggatgatcatattaatcctgctggac tctcatctccacactcccatgtacttttttctcagtaacctgtctcttgcaggcattggt tactcctcagctgtcactccaaaggttttaactgggttgcttatagaagacaaagccatc tcctacagtgcctgtgctgctcagatgttcttttgtgcagtctttgccactgtggaaaat tacctcttgtcctcaatggcctatgaccgctacgcagcagtgtgtaaccccctacattat accaccaccatgacaacacgtgtgtgtgcttgtctggctataggctgttatgtcattggt tttctgaatgcttctatccaaattggagatacatttcgcctctctttctgcatgtccaat gtgattcatcactttttctgtgacaaaccagcagtcattactctgacctgctctgagaaa cacattagtgagttgattcttgttcttatatcaagttttaatgtcttttttgcacttctt gttaccttgatttcctatctgttcatattgatcaccattcttaagaggcacacaggtaag ggataccagaagcctttatctacctgtggttctcacctcattgccattttcttattttat ataactgtcatcatcatgtacatacgaccaagttccagtcattccatggacacagacaaa attgcatctgtgttctacactatgatcatccccatgctcagtcctatagtctataccctg aggaacaaagacgtgaagaatgcattcatgaaggttgttgagaaggcaaaatattctcta gattcagtcttttaa >gi568815587r:58322335_58523261|GENSCAN_predicted_peptide_4|293_aa MIISIDAEKAFDKIQQRFMLKTRNKLGIDGTYFKIIRAIYDKPTANIILNGQKLEAFPLK TGTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFAGDMIVYLENPIIS AQILLKLMSNFSKVSAYKINVQKSQAFLYINNRQTESQIMSELPFTIASKRIKYLGIQLT RDVKELFKENYKPLLKEIKEDTNKWKNIPCSWVGRINIVKMAILPKVIYRFNAIPIKLPR TFFTELEKTTLKFIWNQKRACIAKSILSQKNKAGGITLPDFKLYYKATVTKTA >gi568815587r:58322335_58523261|GENSCAN_predicted_CDS_4|882_bp atgattatctcaatagatgcagaaaaggcctttgacaaaattcaacaacgcttcatgcta aaaactcgcaataaattaggtattgatgggacgtatttcaaaataataagagctatctat gacaaacccacagccaatatcatactgaatgggcaaaaactggaagcattccctttgaaa actggcacaagacagggatgccctctctcaccactcctattcaacatagtgttggaagtt ctggccagggcaattaggcaggagaaggaaataaagggtattcaattaggaaaagaggaa gtcaaattgtccctgtttgcaggtgacatgattgtatatctagaaaaccccatcatctca gcccaaattctccttaagctgatgagcaacttcagcaaagtctcagcatacaaaatcaat gtacaaaaatcacaagcattcttatacatcaacaacagacaaacagagagccaaatcatg agtgaactcccattcacaattgcttcaaagagaataaaatatctaggaatccaacttaca agggacgtgaaggaactcttcaaggagaactacaaaccactgctcaaggaaataaaagag gatacaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatatcgtgaaa atggccatactgcccaaggtaatttacagattcaatgccatccccatcaagctaccaagg actttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcc tgcattgccaagtcaatcctaagccaaaagaacaaagctggaggcatcacactacctgac ttcaaactatactacaaggctacagtaaccaaaacagcatga >gi568815587r:58322335_58523261|GENSCAN_predicted_peptide_5|155_aa MCSSSSLLCLSLHTSLQAEGAGSGLDQPRKGLPQCSGRLKGSSSTARVGTKAEEVPRASK GCKSRLTEYWLFGFVLMWPQALKGFAYYWGSVGESVMAFVSSPSGNHQNVLTFGTKEASV YTDYQFVPRLFTGSSKNRIEKGALYVAEDLFFCIP >gi568815587r:58322335_58523261|GENSCAN_predicted_CDS_5|468_bp atgtgcagctccagttccctcctgtgcctctccctccacacctccctgcaagctgaggga gccggctctggcctcgaccagcccaggaaggggctcccacagtgcagcggccggctgaag ggctcctcaagcacggccagagtgggcaccaaggccgaggaggtgccgagagcgagcaag ggctgtaagtcacgcttaactgagtactggctgtttggctttgtgctcatgtggccccag gccctgaagggctttgcatattattggggctctgtaggtgaatcagttatggcctttgtc tcaagtccaagtgggaatcaccagaatgtcttaacttttggcacaaaagaggcttcagtt tatacagactatcagtttgtaccccgactattcacaggatccagcaagaataggatagag aaaggtgccttatatgtggctgaagacttatttttctgtattccttga >gi568815587r:58322335_58523261|GENSCAN_predicted_peptide_6|375_aa MACTSAGPYKVQRAILRPLKKRSSNKQSNCCYANDPMRERVHPRLHGLSRGSGFPIPLST AEAHAILASPFWPCGFLLQSSNLSLVDFGYSSAVTPKVMAGFLRGDKVISYNACAVQMFF FVALATVENYLLASMAYDRYAAVCKPLHYTTTMTASVVWLSEKLYVVTQRHLLMKVSSSV SAPAGTLSLLIIIAGDEISGEYNLSLVDFCYSSAVTPIVMAGFLIEDKVISYNACAAQMY IFVAFATVENYLLASMAYDRYAAVCKPLHYTTTMTTTVCARLAIGSYLCGFLNASIHTGD TFSLSFFGIFYGTIIFMYLQPSSSHSMDTDKMAPVFYTMVIPMLNPLVYSLRNKEVKSAF KKVVEKAKLSVGWSV >gi568815587r:58322335_58523261|GENSCAN_predicted_CDS_6|1128_bp atggcatgtacaagtgctggcccttacaaggttcagagagcaatcctcagacctctgaag aaacgctccagcaataagcagagcaactgctgctatgccaatgaccccatgagggaaaga gtacatcctaggctccatggcctatcaaggggtagtggattcccaattccattgtccaca gctgaagcccatgctattcttgcctctccattctggccatgtggattcctcctccaatca agtaacctgtctctggtggactttggatactcctcagctgtcactcccaaggtcatggct gggttccttagaggagacaaggtcatctcctacaatgcatgtgctgttcagatgttcttc tttgtagccttggccacggtggaaaattacttgttggcctcaatggcctatgaccgctat gcagcagtgtgcaaacccctacactacaccaccaccatgacggccagtgttgtctggcta agtgaaaagctgtatgtggtcactcaaagacacctgctgatgaaggtatcatcatctgtt tctgcacctgctggaacattaagccttttgattatcatagcaggagatgagataagtgga gagtataacttgtctctagtggacttttgctactcttcagctgtcactcccatcgtcatg gctggattccttatagaagacaaggtcatctcttacaatgcatgtgctgctcaaatgtat atctttgtagcttttgccactgtggaaaattacctcttggcctcaatggcctatgaccgc tatgcagcagtgtgcaaacccctacattacaccacaaccatgacaacaactgtgtgtgct cgtctggccataggctcctacctctgtggtttcctgaatgcctccatccacactggggac acatttagtctctctttcttcggcatcttctatgggactattatcttcatgtacttacaa cccagctccagtcactccatggacacagacaaaatggcacctgtgttctatacaatggtc atccccatgctgaaccctctggtctatagtctgaggaacaaggaagtgaagagtgcattc aagaaagttgttgagaaggcaaaattgtctgtaggatggtcagtttaa >gi568815587r:58322335_58523261|GENSCAN_predicted_peptide_7|314_aa MENNTEVTEFILVGLTDDPELQIPLFIVFLFIYLITLVGNLGMIELILLDSCLHTPMYFF LSNLSLVDFGYSSAVTPKVMVGFLTGDKFILYNACATQFFFFVAFITAESFLLASMAYDR YAALCKPLHYTTTMTTNVCACLAIGSYICGFLNASIHTGNTFRLSFCRSNVVEHFFCDAP PLLTLSCSDNYISEMVIFFVVGFNDLFSILVILISYLFIFITIMKMRSPEGRQKAFSTCA SHLTAVSIFYGTGIFMYLRPNSSHFMGTDKMASVFYAIVIPMLNPLVYSLRNKEVKSAFK KTVGKAKASIGFIF >gi568815587r:58322335_58523261|GENSCAN_predicted_CDS_7|945_bp atggagaacaacacagaggtgactgaattcatccttgtggggttaactgatgacccagaa ctgcagatcccactcttcatagtcttccttttcatctacctcatcactctggttgggaac ctggggatgattgaattgattctactggactcctgtctccacacccccatgtacttcttc ctcagtaacctctccctggtggactttggttattcctcagctgtcactcccaaggtgatg gtggggtttctcacaggagacaaattcatattatataatgcttgtgccacacaattcttc ttctttgtagcctttatcactgcagaaagtttcctcctggcatcaatggcctatgaccgc tatgcagcattgtgtaaacccctgcattacaccaccaccatgacaacaaatgtatgtgct tgcctggccataggctcctacatctgtggtttcctgaatgcatccattcatactgggaac actttcaggctctccttctgtagatccaatgtagttgaacactttttctgtgatgctcct cctctcttgactctctcatgttcagacaactacatcagtgagatggttattttttttgtg gtgggattcaatgacctcttttctatcctggtaatcttgatctcctacttatttatattt atcaccatcatgaagatgcgctcacctgaaggacgccagaaggccttttctacttgtgct tcccaccttactgcagtttccatcttttatgggacaggaatctttatgtacttacgacct aactccagccatttcatgggcacagacaaaatggcatctgtgttctatgccatagtcatt cccatgttgaatccactggtctacagcctgaggaacaaagaggttaagagtgcctttaaa aagactgtagggaaggcaaaggcctctataggattcatattttaa >gi568815587r:58322335_58523261|GENSCAN_predicted_peptide_8|128_aa MDEAGNPHSQQTNTETENQTQHVLTPAAGAPQSLNTIIIRRLTYLSREKSNSWELGLISF YQGNCEIVLILEEQKHQCVTSVRVKGYGDQVINGALAQFCLTVGPMGPQTHPVVISPVLE CVVGIDSS >gi568815587r:58322335_58523261|GENSCAN_predicted_CDS_8|387_bp atggatgaagctggaaaccctcattctcagcaaactaacacagaaacagaaaaccaaaca cagcatgttctcactccagcagctggagcacctcaaagcctgaacactataatcatcaga aggctcacctacctgagtcgagaaaaatcaaacagctgggaactgggacttatttccttt taccagggtaactgtgaaattgtactaattctggaagaacaaaaacatcagtgtgtcaca tcagtcagagtaaagggttatggagatcaggtgatcaatggagctttagctcagttttgt ctcacagtgggcccaatgggtccccaaacacatccagttgtcatttccccagttctggaa tgcgtagttggaattgatagcagctag >gi568815587r:58322335_58523261|GENSCAN_predicted_peptide_9|416_aa MENSTEVTEFILLGLTDDPNLQIPLLLAFLFIYLITLLGNGGMMVIIHSDSHLHTPMYFF LSNLSLVDLGYSSAVAPKTVAALRSGDKAISYDGCAAQFFFFVGFATVECYLLASMAYDR HAAVCRPLHYTTTMTAGVCALLATGSYVSGFLNASIHAAGTFRLSFCGSNEINHFFCDIP PLLALSCSDTRISKLVVFVAGFNVFFTLLVILISYFFICITIQRMHSAEGQKKVFSTCAS HLTALSIFYGTIIFMYLQPNSSQSVDTDKIASVFYTVVIPMLNPLIYSLRNKEVKTMRPI QQHTGRTAFTGQDPLHIGIFLIEIYRAQEYLHLEKSLGSSYPSTGCIPISDEFRDLMLFA VPAEFILEGTDSRTRQQGGTFISQDLLKSFRTGLSCLWSVLCPTYTAQVKRRERNH >gi568815587r:58322335_58523261|GENSCAN_predicted_CDS_9|1251_bp atggagaatagcacagaagtgacagagtttatcctcttgggattaacagatgaccccaat cttcagatacccctcctcctggcatttttattcatctacctcatcaccctgcttgggaat gggggaatgatggtgatcatccactcagactcccatctccacactccaatgtactttttc ctcagtaacctctcccttgtagacttgggttactcatcagctgtagcccccaaaacggtg gctgcattgcggtcaggggacaaggccatctcctacgatggatgtgcagctcagttcttc ttctttgtggggtttgccactgttgagtgctacctcctggcctccatggcctatgatcgc catgcagcggtatgtaggcctcttcattacaccaccaccatgacagcaggtgtgtgtgcc ctccttgctactggttcctatgtctctggcttcctcaatgcctctatccatgcagcaggc accttcagactctccttctgtggttctaatgagattaatcatttcttctgtgacattccc ccactcctggctctctcatgctctgacacacgcatcagcaagttggtggtctttgtggca ggcttcaacgtctttttcaccctcctggtcatccttatttcttacttcttcatatgcatc accattcagaggatgcattctgctgaagggcagaagaaagtcttctccacctgtgcttcc catctcactgctttgtccatcttctatggcacaatcatcttcatgtacttacagcccaac tccagccagtccgtggacacagacaaaatagcctctgtgttttacacagtggtgattccc atgctgaatcccttgatatacagccttaggaacaaagaagtgaaaacaatgaggcctatt cagcaacacacaggaagaacagcctttacaggccaggatccactgcatataggaatcttt ctcattgaaatctacagagctcaagagtatctgcatttggagaagagcctgggaagcagc tatccaagcactgggtgcattccaatctcagatgaatttcgtgaccttatgctctttgct gtacctgcagaattcattttggagggcacagactctagaaccagacagcagggagggacc ttcatatcccaggacctgctgaagtcattccgaacaggtttaagttgcctctggtctgtc ctctgtcccacctacacagcacaagtaaaaaggagagagagaaaccactga >gi568815587r:58322335_58523261|GENSCAN_predicted_peptide_10|48_aa EIATTTSTFSKHHPDQSAAIKAPHRQKKKLQLTEGSDFKKSSVEAVLN >gi568815587r:58322335_58523261|GENSCAN_predicted_CDS_10|147_bp gagattgccacaaccacctcaaccttcagcaagcatcatcctgatcagtcagcagccatc aaggcccctcaccggcaaaagaaaaaattacaactcactgaaggctcagacttcaaaaag tccagtgttgaagcagttctgaattga