GENSCAN 1.0 Date run: 8-Nov-116 Time: 08:37:18 Sequence gi568815587r:58302468_58503409 : 200942 bp : 36.49% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 187 182 6 1.05 1.01 Sngl - 3506 2919 588 2 0 10 44 290 0.719 12.73 1.00 Prom - 3754 3715 40 -9.65 2.02 PlyA - 3855 3850 6 1.05 2.01 Sngl - 5280 3976 1305 0 0 49 48 536 0.629 41.47 2.00 Prom - 5670 5631 40 -4.75 3.05 PlyA - 5908 5903 6 1.05 3.04 Term - 6725 6574 152 0 2 54 43 209 0.994 10.09 3.03 Intr - 7863 7705 159 1 0 23 42 145 0.177 2.44 3.02 Intr - 8721 8615 107 2 2 64 29 134 0.206 4.04 3.01 Init - 22793 22693 101 2 2 26 97 78 0.448 2.28 3.00 Prom - 24034 23995 40 -2.55 4.03 PlyA - 24155 24150 6 1.05 4.02 Term - 46729 46422 308 2 2 88 34 122 0.169 1.09 4.01 Init - 52032 51702 331 0 1 68 63 235 0.520 16.55 4.00 Prom - 54488 54449 40 -6.25 5.03 PlyA - 54633 54628 6 1.05 5.02 Term - 56630 55658 973 1 1 79 39 379 0.805 22.63 5.01 Init - 58090 57909 182 1 2 60 93 101 0.779 6.50 5.00 Prom - 61185 61146 40 -3.65 6.00 Prom + 61917 61956 40 -3.75 6.01 Sngl + 68508 69389 882 2 0 42 43 378 0.980 24.47 6.02 PlyA + 70758 70763 6 1.05 7.00 Prom + 71568 71607 40 -3.85 7.01 Init + 88455 88638 184 2 1 69 40 152 0.055 5.84 7.02 Term + 97292 97575 284 0 2 66 43 155 0.151 3.40 7.03 PlyA + 98301 98306 6 1.05 8.07 PlyA - 98731 98726 6 1.05 8.06 Term - 100206 99998 209 1 2 79 38 96 0.116 0.22 8.05 Intr - 100757 100444 314 1 2 56 84 211 0.043 12.40 8.04 Intr - 106813 106677 137 1 2 92 14 116 0.009 3.05 8.03 Intr - 120609 120384 226 2 1 72 34 209 0.137 10.96 8.02 Intr - 125639 125552 88 0 1 85 116 12 0.647 1.81 8.01 Init - 125988 125835 154 0 1 94 49 112 0.748 8.09 8.00 Prom - 130856 130817 40 -5.55 9.02 PlyA - 132928 132923 6 1.05 9.01 Sngl - 137684 136740 945 2 0 52 32 311 0.970 18.19 9.00 Prom - 151146 151107 40 -3.65 10.00 Prom + 155221 155260 40 -5.05 10.01 Init + 155404 155479 76 0 1 83 99 54 0.723 7.30 10.02 Intr + 164059 164147 89 2 2 53 40 53 0.032 -4.23 10.03 Term + 181303 181524 222 0 0 91 43 165 0.710 8.23 10.04 PlyA + 182420 182425 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:58302468_58503409|GENSCAN_predicted_peptide_1|195_aa MCKNRKHSYTPITDKQKLPFTIASKGIKYLGIQLTRDVKDLFKENYKPLLNEIKEDTKKW KNIPCSWVGRINIVKMAILPKVIYRFNAIPIKLPMTFFTEMEKTTLTFIWNQKRARIAKS ILSQKNKAGGITLPDFKLYYKATVTKTSWYWYQNRDIDQWNRTKPSEIIPHIYNYLIFDK TDKNKQWGKVSLFNK >gi568815587r:58302468_58503409|GENSCAN_predicted_CDS_1|588_bp atgtgcaaaaatcgcaagcattcttatacaccaataacagacaaacagaaactcccattc acaattgcttcaaagggaataaaatacctaggaatccaacttacaagggatgtgaaggac ctcttcaaggagaactacaaaccactgctcaatgaaataaaagaggatacaaagaaatgg aagaacattccatgctcatgggtaggaagaatcaatattgtgaaaatggctatactgccc aaggtcatttatagattcaatgccatccccatcaagctaccaatgactttcttcacagaa atggaaaaaactactttaacgttcatatggaaccaaaaaagagcccgcattgccaagtca atcctaagccaaaagaacaaagctggcggcatcacactaccggacttcaaactatactac aaggctacagtaaccaaaacatcatggtactggtaccaaaacagagatatagaccaatgg aacagaacaaagccctcagaaataattccgcatatctacaactatctgatctttgataaa actgacaaaaacaagcaatggggaaaggtttccctgtttaataaatga >gi568815587r:58302468_58503409|GENSCAN_predicted_peptide_2|434_aa MGDFNTPLSTLDRSTRQKVNKDIQELNSALHQVDLIDIYRTLHPKSTEYTFSLAAHHTYS KIEHIVGSKALLSKSKRTEVVTNCLSDHSAIKLELRIKKHTQNHSTTRKLNNLLLNDYWV HNKMKAEIKMFFETNENRDTTYQNLWDTFKAVCTGKFMALTAHKRKQERSKIDTLTSQLK ELEKQEQTHSKASRRQEITKIRAELKEIETKNTFQKISESRSWFFEKINKIDRPLAGLIK KKREKNQIDTIKNDKGDITTDPTEIQTTIREYYKHLYANKLENLEEMDTFLDTYTLPRLN QEEVESLNRPITGSEIEAIINSLPTKKSPGPDGFTAEFYQRYKEELVPLLLKLFQSIEKE GILPNSFYEARIILIPKPGRDTTKKENFRPISLINIDAKILNKILANQIQQHIKKLTHHD QVGFIPGMQGWFNI >gi568815587r:58302468_58503409|GENSCAN_predicted_CDS_2|1305_bp atgggagactttaacaccccactgtcaacattagacagatcaacgagacagaaagttaac aaggatatccaggaattgaactcagctctgcaccaagtggacctaatagacatctacaga actctccaccccaaatcaacagaatatacattctccttagcagcacaccacacttattcc aaaattgagcacatagttggaagtaaagctctcctcagcaaatctaaaagaacagaagtt gtaacaaactgtctctcagaccacagtgcaatcaaactagaactcaggattaagaaacac actcaaaaccactcaactacacggaaactgaacaacctgctcctgaatgactactgggta cataacaaaatgaaggcagaaataaagatgttctttgaaaccaatgagaacagagacaca acatatcagaatctctgggacacattcaaggcagtgtgtacagggaaatttatggcacta actgcccacaagagaaagcaggaaagatccaaaattgacaccctaacatcacaattaaaa gaactagagaagcaagagcaaacacattcaaaagctagcagaaggcaagaaataactaag atcagagcagaactgaaggaaatagagacaaaaaacacctttcaaaaaatcagtgaatcc aggagctggttttttgaaaagatcaacaaaattgatagaccgctagcaggactaataaag aagaaaagagagaagaatcaaatagacacaataaaaaatgacaaaggggacattaccact gatcccacagaaatacaaactaccatcagagaatactataaacacctctatgcaaataaa ctagaaaatctagaagaaatggatacattcctcgacacatacaccctcccaagactaaac caggaagaagttgaatctctgaatagaccaataacaggctctgaaattgaggcaataatt aatagcttaccaaccaaaaaaagtccaggaccagatggattcacagccgaattctatcag aggtacaaagaggagctggtaccattacttctgaaactattccaatcaatagaaaaagag ggaatcctccctaactcattttatgaggccagaatcatcctgataccaaagcctggcaga gacacaaccaaaaaagagaattttagaccaatatccttgataaacattgatgcaaaaatc ctcaataaaatactggcaaaccaaatccagcagcacatcaaaaagcttactcaccatgat caagtgggcttcatccctgggatgcaaggctggttcaacatatga >gi568815587r:58302468_58503409|GENSCAN_predicted_peptide_3|172_aa MPTLKRNKSKIIKLILRHEKLEKEEKAKPKASKRRKTSVSAAASVNATILISKVQGPWRV LGQERDLLPCKTLAFNEKYVALAAAQEFGHTWYLSQVNDRMTAEKRDKFPTGQQAIPVWI LTECSSSPAMEQSWTENDFDELREEGFRRSNYSKLKEEVRMHGKEVETLKKN >gi568815587r:58302468_58503409|GENSCAN_predicted_CDS_3|519_bp atgcctacattaaaaagaaataagtctaaaatcattaaactaattttgcgccatgagaaa ctagaaaaagaagagaaagctaaacccaaagctagcaaaaggaggaaaactagtgtttct gctgctgcatcggtgaacgcaactattctgatcagcaaggtccagggaccgtggcgggtt cttggacaagagagggatctgctgccatgtaaaaccctggcctttaatgaaaagtatgtg gctttagctgcagcccaagagtttggacatacctggtatcttagtcaagtaaatgataga atgacagctgaaaaaagggacaaattccctactggtcagcaagccatcccagtatggatc ctcactgaatgcagctcctcaccagcaatggaacaaagctggacagagaacgactttgac gagttgagagaagaaggcttcagacgatcaaactactccaagctaaaggaggaagttcga atgcatggcaaagaagttgaaaccttgaaaaaaaattag >gi568815587r:58302468_58503409|GENSCAN_predicted_peptide_4|212_aa MDENCFVLSLYSHKGLIISGANTDVVVWDPEGTKIISASTLVQGGNFNLNENMCYYRMPL LTIDPGGAVYENGIFMCTEGTGQTVSCGLSQLNHRKNTLNIKGVTVLPTWVILTSYLFIF ITILKMHSAQGHLKALSTCASHLIAVSIFYGTTIFMHLQPSSSHSMDTDEMASLFYAVFI SMLNLVFYSLRSKEVKNAFKKAVEKAKFFLEL >gi568815587r:58302468_58503409|GENSCAN_predicted_CDS_4|639_bp atggatgagaactgttttgttctcagcctgtattcccacaagggcctcatcatctctgga gccaatactgatgtggtggtgtgggaccctgaaggcacaaagatcatctcagccagcacc ttggtgcagggaggaaatttcaatctcaatgagaacatgtgctactaccgcatgcctctg ctcaccatcgaccctgggggtgctgtgtatgagaatggcatcttcatgtgcactgagggc actggacaaactgtctcctgtggtctttctcagctgaaccacagaaagaacaccttaaac attaagggagtgactgtactccctacctgggttatcttgacctcctacctgttcatattc atcaccatcttgaagatgcactcagctcagggacacttaaaagctttgtccacctgtgcc tctcacctcattgcagtctccatcttctatggaactactatctttatgcacttacagcct agctccagccattccatggacacagatgaaatggcatccttgttctatgctgtgttcatc tccatgctgaaccttgtgttctacagcctgaggagcaaagaagtcaagaatgcattcaaa aaggcggttgagaaggcaaaatttttcttagaactgtga >gi568815587r:58302468_58503409|GENSCAN_predicted_peptide_5|384_aa MNVLLTDSNSNKKIVHKHICSLQSAPKTTNLQPSISDILLSVESNDRKNVSKIKGDCFNT RVSCDSKITSMENNTEVSEFILLGLTNAPELQVPLFIMFTLIYLITLTGNLGMIILILLD SHLHTPMYFFLSNLSLAGIGYSSAVTPKVLTGLLIEDKAISYSACAAQMFFCAVFATVEN YLLSSMAYDRYAAVCNPLHYTTTMTTRVCACLAIGCYVIGFLNASIQIGDTFRLSFCMSN VIHHFFCDKPAVITLTCSEKHISELILVLISSFNVFFALLVTLISYLFILITILKRHTGK GYQKPLSTCGSHLIAIFLFYITVIIMYIRPSSSHSMDTDKIASVFYTMIIPMLSPIVYTL RNKDVKNAFMKVVEKAKYSLDSVF >gi568815587r:58302468_58503409|GENSCAN_predicted_CDS_5|1155_bp atgaatgtccttctgacagattcaaattcaaataaaaagattgtgcataaacacatctgc agcctacagtcagcccccaagactacgaacctccaaccctcaatatctgatatcctgcta agtgttgagagtaatgacaggaagaatgtgtctaagataaaaggggattgtttcaacaca agagtatcttgtgattctaaaataacatccatggagaataatacagaggtgagtgaattc atcctgcttggtctaaccaatgccccagaactacaggttcccctctttatcatgtttacc ctcatctacctcatcactctgactgggaacctggggatgatcatattaatcctgctggac tctcatctccacactcccatgtacttttttctcagtaacctgtctcttgcaggcattggt tactcctcagctgtcactccaaaggttttaactgggttgcttatagaagacaaagccatc tcctacagtgcctgtgctgctcagatgttcttttgtgcagtctttgccactgtggaaaat tacctcttgtcctcaatggcctatgaccgctacgcagcagtgtgtaaccccctacattat accaccaccatgacaacacgtgtgtgtgcttgtctggctataggctgttatgtcattggt tttctgaatgcttctatccaaattggagatacatttcgcctctctttctgcatgtccaat gtgattcatcactttttctgtgacaaaccagcagtcattactctgacctgctctgagaaa cacattagtgagttgattcttgttcttatatcaagttttaatgtcttttttgcacttctt gttaccttgatttcctatctgttcatattgatcaccattcttaagaggcacacaggtaag ggataccagaagcctttatctacctgtggttctcacctcattgccattttcttattttat ataactgtcatcatcatgtacatacgaccaagttccagtcattccatggacacagacaaa attgcatctgtgttctacactatgatcatccccatgctcagtcctatagtctataccctg aggaacaaagacgtgaagaatgcattcatgaaggttgttgagaaggcaaaatattctcta gattcagtcttttaa >gi568815587r:58302468_58503409|GENSCAN_predicted_peptide_6|293_aa MIISIDAEKAFDKIQQRFMLKTRNKLGIDGTYFKIIRAIYDKPTANIILNGQKLEAFPLK TGTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFAGDMIVYLENPIIS AQILLKLMSNFSKVSAYKINVQKSQAFLYINNRQTESQIMSELPFTIASKRIKYLGIQLT RDVKELFKENYKPLLKEIKEDTNKWKNIPCSWVGRINIVKMAILPKVIYRFNAIPIKLPR TFFTELEKTTLKFIWNQKRACIAKSILSQKNKAGGITLPDFKLYYKATVTKTA >gi568815587r:58302468_58503409|GENSCAN_predicted_CDS_6|882_bp atgattatctcaatagatgcagaaaaggcctttgacaaaattcaacaacgcttcatgcta aaaactcgcaataaattaggtattgatgggacgtatttcaaaataataagagctatctat gacaaacccacagccaatatcatactgaatgggcaaaaactggaagcattccctttgaaa actggcacaagacagggatgccctctctcaccactcctattcaacatagtgttggaagtt ctggccagggcaattaggcaggagaaggaaataaagggtattcaattaggaaaagaggaa gtcaaattgtccctgtttgcaggtgacatgattgtatatctagaaaaccccatcatctca gcccaaattctccttaagctgatgagcaacttcagcaaagtctcagcatacaaaatcaat gtacaaaaatcacaagcattcttatacatcaacaacagacaaacagagagccaaatcatg agtgaactcccattcacaattgcttcaaagagaataaaatatctaggaatccaacttaca agggacgtgaaggaactcttcaaggagaactacaaaccactgctcaaggaaataaaagag gatacaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatatcgtgaaa atggccatactgcccaaggtaatttacagattcaatgccatccccatcaagctaccaagg actttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcc tgcattgccaagtcaatcctaagccaaaagaacaaagctggaggcatcacactacctgac ttcaaactatactacaaggctacagtaaccaaaacagcatga >gi568815587r:58302468_58503409|GENSCAN_predicted_peptide_7|155_aa MCSSSSLLCLSLHTSLQAEGAGSGLDQPRKGLPQCSGRLKGSSSTARVGTKAEEVPRASK GCKSRLTEYWLFGFVLMWPQALKGFAYYWGSVGESVMAFVSSPSGNHQNVLTFGTKEASV YTDYQFVPRLFTGSSKNRIEKGALYVAEDLFFCIP >gi568815587r:58302468_58503409|GENSCAN_predicted_CDS_7|468_bp atgtgcagctccagttccctcctgtgcctctccctccacacctccctgcaagctgaggga gccggctctggcctcgaccagcccaggaaggggctcccacagtgcagcggccggctgaag ggctcctcaagcacggccagagtgggcaccaaggccgaggaggtgccgagagcgagcaag ggctgtaagtcacgcttaactgagtactggctgtttggctttgtgctcatgtggccccag gccctgaagggctttgcatattattggggctctgtaggtgaatcagttatggcctttgtc tcaagtccaagtgggaatcaccagaatgtcttaacttttggcacaaaagaggcttcagtt tatacagactatcagtttgtaccccgactattcacaggatccagcaagaataggatagag aaaggtgccttatatgtggctgaagacttatttttctgtattccttga >gi568815587r:58302468_58503409|GENSCAN_predicted_peptide_8|375_aa MACTSAGPYKVQRAILRPLKKRSSNKQSNCCYANDPMRERVHPRLHGLSRGSGFPIPLST AEAHAILASPFWPCGFLLQSSNLSLVDFGYSSAVTPKVMAGFLRGDKVISYNACAVQMFF FVALATVENYLLASMAYDRYAAVCKPLHYTTTMTASVVWLSEKLYVVTQRHLLMKVSSSV SAPAGTLSLLIIIAGDEISGEYNLSLVDFCYSSAVTPIVMAGFLIEDKVISYNACAAQMY IFVAFATVENYLLASMAYDRYAAVCKPLHYTTTMTTTVCARLAIGSYLCGFLNASIHTGD TFSLSFFGIFYGTIIFMYLQPSSSHSMDTDKMAPVFYTMVIPMLNPLVYSLRNKEVKSAF KKVVEKAKLSVGWSV >gi568815587r:58302468_58503409|GENSCAN_predicted_CDS_8|1128_bp atggcatgtacaagtgctggcccttacaaggttcagagagcaatcctcagacctctgaag aaacgctccagcaataagcagagcaactgctgctatgccaatgaccccatgagggaaaga gtacatcctaggctccatggcctatcaaggggtagtggattcccaattccattgtccaca gctgaagcccatgctattcttgcctctccattctggccatgtggattcctcctccaatca agtaacctgtctctggtggactttggatactcctcagctgtcactcccaaggtcatggct gggttccttagaggagacaaggtcatctcctacaatgcatgtgctgttcagatgttcttc tttgtagccttggccacggtggaaaattacttgttggcctcaatggcctatgaccgctat gcagcagtgtgcaaacccctacactacaccaccaccatgacggccagtgttgtctggcta agtgaaaagctgtatgtggtcactcaaagacacctgctgatgaaggtatcatcatctgtt tctgcacctgctggaacattaagccttttgattatcatagcaggagatgagataagtgga gagtataacttgtctctagtggacttttgctactcttcagctgtcactcccatcgtcatg gctggattccttatagaagacaaggtcatctcttacaatgcatgtgctgctcaaatgtat atctttgtagcttttgccactgtggaaaattacctcttggcctcaatggcctatgaccgc tatgcagcagtgtgcaaacccctacattacaccacaaccatgacaacaactgtgtgtgct cgtctggccataggctcctacctctgtggtttcctgaatgcctccatccacactggggac acatttagtctctctttcttcggcatcttctatgggactattatcttcatgtacttacaa cccagctccagtcactccatggacacagacaaaatggcacctgtgttctatacaatggtc atccccatgctgaaccctctggtctatagtctgaggaacaaggaagtgaagagtgcattc aagaaagttgttgagaaggcaaaattgtctgtaggatggtcagtttaa >gi568815587r:58302468_58503409|GENSCAN_predicted_peptide_9|314_aa MENNTEVTEFILVGLTDDPELQIPLFIVFLFIYLITLVGNLGMIELILLDSCLHTPMYFF LSNLSLVDFGYSSAVTPKVMVGFLTGDKFILYNACATQFFFFVAFITAESFLLASMAYDR YAALCKPLHYTTTMTTNVCACLAIGSYICGFLNASIHTGNTFRLSFCRSNVVEHFFCDAP PLLTLSCSDNYISEMVIFFVVGFNDLFSILVILISYLFIFITIMKMRSPEGRQKAFSTCA SHLTAVSIFYGTGIFMYLRPNSSHFMGTDKMASVFYAIVIPMLNPLVYSLRNKEVKSAFK KTVGKAKASIGFIF >gi568815587r:58302468_58503409|GENSCAN_predicted_CDS_9|945_bp atggagaacaacacagaggtgactgaattcatccttgtggggttaactgatgacccagaa ctgcagatcccactcttcatagtcttccttttcatctacctcatcactctggttgggaac ctggggatgattgaattgattctactggactcctgtctccacacccccatgtacttcttc ctcagtaacctctccctggtggactttggttattcctcagctgtcactcccaaggtgatg gtggggtttctcacaggagacaaattcatattatataatgcttgtgccacacaattcttc ttctttgtagcctttatcactgcagaaagtttcctcctggcatcaatggcctatgaccgc tatgcagcattgtgtaaacccctgcattacaccaccaccatgacaacaaatgtatgtgct tgcctggccataggctcctacatctgtggtttcctgaatgcatccattcatactgggaac actttcaggctctccttctgtagatccaatgtagttgaacactttttctgtgatgctcct cctctcttgactctctcatgttcagacaactacatcagtgagatggttattttttttgtg gtgggattcaatgacctcttttctatcctggtaatcttgatctcctacttatttatattt atcaccatcatgaagatgcgctcacctgaaggacgccagaaggccttttctacttgtgct tcccaccttactgcagtttccatcttttatgggacaggaatctttatgtacttacgacct aactccagccatttcatgggcacagacaaaatggcatctgtgttctatgccatagtcatt cccatgttgaatccactggtctacagcctgaggaacaaagaggttaagagtgcctttaaa aagactgtagggaaggcaaaggcctctataggattcatattttaa >gi568815587r:58302468_58503409|GENSCAN_predicted_peptide_10|128_aa MDEAGNPHSQQTNTETENQTQHVLTPAAGAPQSLNTIIIRRLTYLSREKSNSWELGLISF YQGNCEIVLILEEQKHQCVTSVRVKGYGDQVINGALAQFCLTVGPMGPQTHPVVISPVLE CVVGIDSS >gi568815587r:58302468_58503409|GENSCAN_predicted_CDS_10|387_bp atggatgaagctggaaaccctcattctcagcaaactaacacagaaacagaaaaccaaaca cagcatgttctcactccagcagctggagcacctcaaagcctgaacactataatcatcaga aggctcacctacctgagtcgagaaaaatcaaacagctgggaactgggacttatttccttt taccagggtaactgtgaaattgtactaattctggaagaacaaaaacatcagtgtgtcaca tcagtcagagtaaagggttatggagatcaggtgatcaatggagctttagctcagttttgt ctcacagtgggcccaatgggtccccaaacacatccagttgtcatttccccagttctggaa tgcgtagttggaattgatagcagctag