GENSCAN 1.0 Date run: 7-Nov-116 Time: 22:49:17 Sequence gi568815587f:47465623_47671626 : 206004 bp : 45.72% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.16 PlyA - 2797 2792 6 1.05 1.15 Term - 6735 6608 128 1 2 43 53 217 0.959 12.14 1.14 Intr - 7609 7466 144 2 0 42 76 168 0.943 11.25 1.13 Intr - 9899 9714 186 0 0 61 113 188 0.964 18.26 1.12 Intr - 11334 11224 111 1 0 63 110 41 0.588 4.15 1.11 Intr - 11803 11675 129 2 0 72 115 52 0.990 6.97 1.10 Intr - 13330 13255 76 1 1 58 107 36 0.681 1.59 1.09 Intr - 17234 17073 162 2 0 91 73 111 0.962 10.07 1.08 Intr - 17910 17831 80 1 2 111 75 70 0.999 7.27 1.07 Intr - 18901 18767 135 2 0 53 92 158 0.685 13.24 1.06 Intr - 21176 21128 49 2 1 112 103 45 0.983 6.65 1.05 Intr - 21619 21537 83 2 2 72 78 39 0.953 0.66 1.04 Intr - 23402 23215 188 1 2 34 72 228 0.035 15.13 1.03 Intr - 84037 83979 59 1 2 94 103 17 0.076 1.48 1.02 Intr - 87535 87370 166 0 1 79 105 123 0.187 13.06 1.01 Init - 92196 92189 8 0 2 51 95 0 0.092 -2.57 1.00 Prom - 92820 92781 40 -5.56 2.00 Prom + 98073 98112 40 -5.16 2.01 Init + 100001 100174 174 1 0 102 99 392 0.999 39.05 2.02 Intr + 100284 100364 81 2 0 85 94 170 0.998 17.13 2.03 Intr + 104078 104269 192 1 0 54 81 116 0.981 7.19 2.04 Term + 105849 106007 159 2 0 103 41 173 0.999 12.04 2.05 PlyA + 106337 106342 6 -0.45 3.04 PlyA - 106594 106589 6 1.05 3.03 Term - 108168 107308 861 0 0 62 48 728 0.947 58.93 3.02 Intr - 110077 109971 107 2 2 109 116 18 0.846 6.63 3.01 Init - 112377 111789 589 0 1 73 114 745 0.628 70.99 3.00 Prom - 112718 112679 40 -10.45 4.00 Prom + 112770 112809 40 -6.76 4.01 Init + 113470 113536 67 0 1 88 72 150 0.996 12.64 4.02 Intr + 113647 113712 66 2 0 92 78 34 0.759 1.68 4.03 Intr + 114903 115000 98 1 2 96 90 124 0.995 13.13 4.04 Intr + 115213 115362 150 0 0 98 78 98 0.999 10.16 4.05 Intr + 116466 116591 126 2 0 120 105 233 0.998 29.08 4.06 Intr + 116727 116846 120 2 0 69 75 100 0.990 7.49 4.07 Term + 118692 118859 168 2 0 91 36 243 0.999 17.28 4.08 PlyA + 118919 118924 6 1.05 5.00 Prom + 119618 119657 40 -15.35 5.01 Init + 121147 121231 85 0 1 72 81 101 0.716 6.96 5.02 Intr + 122129 122227 99 0 0 107 35 61 0.804 2.78 5.03 Term + 122382 122812 431 1 2 120 46 294 0.867 23.86 5.04 PlyA + 123553 123558 6 -3.44 6.02 PlyA - 124085 124080 6 1.05 6.01 Sngl - 125188 124199 990 1 0 98 52 2258 0.999 218.07 6.00 Prom - 127314 127275 40 -6.76 7.08 PlyA - 127570 127565 6 -0.45 7.07 Term - 127666 127646 21 1 0 74 54 6 0.166 -5.89 7.06 Intr - 128709 128526 184 2 1 54 94 127 0.109 9.69 7.05 Intr - 133731 133678 54 2 0 109 93 11 0.131 1.79 7.04 Intr - 160119 160052 68 0 2 84 94 44 0.387 2.30 7.03 Intr - 169112 169050 63 2 0 51 94 44 0.347 0.21 7.02 Intr - 169949 169923 27 2 0 110 87 38 0.755 4.21 7.01 Init - 176843 176757 87 2 0 117 80 200 0.999 20.74 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 23392 23215 178 1 1 56 72 220 0.959 16.62 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587f:47465623_47671626|GENSCAN_predicted_peptide_1|567_aa MCLCRKCRHFGVFCSGGSGSGGGTRRLPRDSASAARRRRRLRRRRQQLRQRQAQVQPLPL VTTIISVSTNLTTLDNIISKKMNGTLDHPDQPDLDAIKMFVGQVPRTWSEKDLRELFEQY GAVYEINVLRDRSQNPPQSKGCCFVTFYTRKAALEAQNALHNMKVLPGMHHPIQMKPADS EKNNAVEDRKLFIGMISKKCTENDIRVMFSSFGQIEECRILRGPDGLSRGCAFVTFTTRA MAQTAIKAMHQAQTMEGCSSPMVVKFADTQKDKEQKRMAQQLQQQMQQISAASVWGNLAG LNTLGPQYLALYLQLLQQTASSGNLNTLSSLHPMGGLNAMQLQNLAALAAAASAAQNTPS GTNALTTSSSPLSVLTSSGSSPSSSSSNSVNPIASLGALQTLAGATAGLNVGSLAGMAAL NGGLGSSGLSNGTGSTMEALTQAYSGIQQYAAAALPTLYNQNLLTQQSIGAAGSQKEGPE GANLFIYHLPQEFGDQDLLQMFMPFGNVVSAKVFIDKQTNLSKCFGFVSYDNPVSAQAAI QSMNGFQIGMKRLKVQLKRSKNDSKPY >gi568815587f:47465623_47671626|GENSCAN_predicted_CDS_1|1704_bp atgtgcctgtgccggaagtgccgccattttggggtgttctgctctggcggcagcggcagc ggcggcgggacgcggaggctcccccgggattcggcctcagcagcgaggcggcggcggcgg ctgcggaggcgcaggcagcaactgaggcagcggcaggctcaggtgcagccgctgcccctg gtaaccaccattatttctgtttctaccaatttgactactttagataacatcatctcaaag aaaatgaacggcaccctggaccacccagaccaaccagatcttgatgctatcaagatgttt gtgggccaggttccaaggacctggtctgaaaaggacttgcgggaactcttcgaacagtat ggtgctgtgtatgaaatcaacgtcctaagggataggagccaaaacccgcctcagagcaaa gggtgctgttttgttacattttacacccgtaaagctgcattagaagctcagaatgctctt cacaacatgaaagtcctcccagggatgcatcaccctatacagatgaaacctgctgacagt gagaagaacaatgcagtggaagacaggaagctgtttattggtatgatttccaagaagtgc actgaaaatgacatccgagtcatgttctcttcgtttggacagattgaagaatgccggata ttgcggggacctgatggcctgagccgaggttgtgcatttgtgacttttacaacaagagcc atggcacagacggctatcaaggcaatgcaccaagcacagaccatggagggttgctcatca cccatggtggtaaaatttgctgatacacagaaggacaaagaacagaagagaatggcccag cagctccagcagcagatgcagcaaatcagcgcagcatctgtgtggggaaaccttgctggt ctaaatactcttggaccccagtatttagcactttatttgcagctccttcagcagactgcc tcctctgggaacctcaacaccctgagcagcctccacccaatgggagggttgaatgcaatg cagttacagaatttggctgcactagctgctgcagctagtgcagctcagaacacaccaagt ggtaccaatgctctcactacatccagcagtcccctcagcgtgctcactagttcagggtcc tcacctagctctagcagcagtaattctgtcaaccccatagcctcacttggagccctgcag acattagctggagcaacggctggcctcaatgttggctctttggcaggaatggctgcttta aatggtggcctgggcagcagtggcctttccaatggcaccgggagcaccatggaggccctc actcaggcctactcgggtatccagcaatatgctgctgctgcgctccccactctgtacaac cagaatcttctgacacagcagagtattggtgctgctggaagccagaaggaaggtccagag ggagccaacctgttcatctaccacctgccccaggagtttggtgatcaggacctgctgcag atgtttatgccctttgggaatgtcgtgtctgccaaggttttcatagacaagcagacaaac ctgagcaagtgttttggttttgtaagttacgacaatcctgtttcggcccaagctgccatc cagtccatgaacggctttcagattggcatgaagcggcttaaagtgcagctcaaacgttcg aagaatgacagcaagccctactga >gi568815587f:47465623_47671626|GENSCAN_predicted_peptide_2|201_aa MAATALLEAGLARVLFYPTLLYTLFRGKVPGRAHRDWYHRIDPTVLLGALPLRSLTRQLV QDENVRGVITMNEEYETRFLCNSSQEWKRLGVEQLRLSTVDMTGIPTLDNLQKGVQFALK YQSLGQCVYVHCKAGRSRSATMVAAYLIQVHKWSPEEAVRAIAKIRSYIHIRPGQLDVLK EFHKQITARATKDGTFVISKT >gi568815587f:47465623_47671626|GENSCAN_predicted_CDS_2|606_bp atggcggccaccgcgctgctggaggccggcctggcgcgggtgctcttctacccgacgctg ctctacaccctgttccgcgggaaggtgccgggtcgggcgcaccgggactggtaccaccgc atcgaccccaccgtgctgctgggcgcgctgccgttgcggagcttgacgcgccagctggta caggacgagaacgtgcgcggggtgatcaccatgaacgaggagtacgagacgaggttcctg tgcaactcttcacaggagtggaagagactaggagtcgagcagctgcggctcagcacagta gacatgactgggatccccaccttggacaacctccagaagggagtccaatttgctctcaag taccagtcgctgggccagtgtgtttacgtgcattgtaaggctgggcgctccaggagtgcc actatggtggcagcatacctgattcaggtgcacaaatggagtccagaggaggctgtaaga gccatcgccaagatccggtcatacatccacatcaggcctggccagctggatgttcttaaa gagttccacaagcagattactgcacgggcaacaaaggatgggacttttgtcatttcaaag acatga >gi568815587f:47465623_47671626|GENSCAN_predicted_peptide_3|518_aa MESPEEPGASMDENYFVNYTFKDRSHSGRVAQGIMKLCLEEELFADVTISVEGREFQLHR LVLSAQSCFFRSMFTSNLKEAHNRVIVLQDVSESVFQLLVDYIYHGTVKLRAEELQEIYE VSDMYQLTSLFEECSRFLARTVQVGNCLQVMWLADRHSDPELYTAAKHCAKTHLAQLQNT EEFLHLPHRLLTDIISDGVPCSQNPTEAIEAWINFNKEEREAFAESLRTSLKEIGENVHI YLIGKESSRTHSLAVSLHCAEDDSISVSGQNSLCHQITAACKHGGDLYVVGGSIPRRMWK CNNATVDWEWCAPLPRDRLQHTLVSVPGKDAIYSLGGKTLQDTLSNAVIYYRVGDNVWTE TTQLEVAVSGAAGANLNGIIYLLGGEENDLDFFTKPSRLIQCFDTETDKCHVKPYVLPFA GRMHAAVHKDLVFIVAEGDSLVCYNPLLDSFTRLCLPEAWSSAPSLWKIASCNGSIYVFR DRYKKGDANTYKLDPATSAVTVTRGIKVLLTNLQFVLA >gi568815587f:47465623_47671626|GENSCAN_predicted_CDS_3|1557_bp atggaatcaccagaggagcctggagcatccatggatgagaactactttgtgaactacact ttcaaagatcggtcacattcaggccgtgtggctcaaggcatcatgaaactgtgtctagag gaggagctctttgctgatgtcaccatttcggtggaaggccgggagtttcagctccatcgg ctggtcctctcagctcagagctgcttcttccgatccatgttcacttccaacctgaaggag gcccacaaccgggtgattgtgctgcaggatgtcagcgagtctgttttccagctcctggtt gattatatctaccatgggactgtgaaacttcgagctgaggagttgcaggaaatttatgag gtgtcagacatgtatcagctgacatctctctttgaggaatgctctcggtttttggcccgc acagtgcaagtgggaaactgccttcaggtgatgtggctggcagatcggcacagtgatcct gagctctatacggctgccaagcactgtgccaagacccacctggcccagctgcagaataca gaggaatttctccacttgccccaccgcttactcacagatatcatctcggatggagttccg tgttctcagaacccaacagaggcaatagaagcctggatcaactttaataaagaggaaaga gaggcttttgcagagtcactcaggacaagcttgaaggaaattggggagaatgtgcacatt tacctgattgggaaagagtcatctcgtacccactcgttggctgtgtccttgcactgtgca gaagatgactccatcagtgtaagtggccaaaacagtttgtgccaccagatcactgcggcc tgcaagcatggtggagacttgtatgtggtgggagggtccatcccacggcgcatgtggaag tgcaacaatgccaccgttgactgggagtggtgtgctcctttgcctcgggaccggctccag cacaccctggtgtctgtgcccgggaaagatgccatatattcactgggtggcaagacactg caagataccctctccaacgcagtcatttattatcgcgtaggtgataatgtgtggacagag acaactcagctagaggtggctgtgtcaggggctgctggtgccaacctcaacgggatcatc tacttactagggggggaggagaatgatctggacttctttaccaaaccttcccgactcatc cagtgctttgacacagagacagacaaatgccatgtgaagccctatgtgctgccctttgca ggccgcatgcacgcagctgtgcataaagatctggtgttcatcgtggctgaaggggactcc ctggtgtgctacaatcccttgctagacagcttcacccggctttgccttcctgaggcctgg agctctgccccatccctctggaagattgccagctgtaacgggagcatctatgtcttccgg gaccgatataaaaagggggatgccaacacctacaagcttgaccctgccacttcagccgta actgtcacaagaggtattaaggtgctgcttaccaatttgcagtttgtgttggcctaa >gi568815587f:47465623_47671626|GENSCAN_predicted_peptide_4|264_aa MAAAAVARLWWRGILGASALTRGTGRPSVLLLPVRRESAGADTRPTVRPRNDVAHKQLSA FGEYVAEILPKYVQQVQVSCFNELEVCIHPDGVIPVLTFLRDHTNAQFKSLVDLTAVDVP TRQNRFEIVYNLLSLRFNSRIRVKTYTDELTPIESAVSVFKAANWYEREIWDMFGVFFAN HPDLRRILTDYGFEGHPFRKDFPLSGYVELRYDDEVKRVVAEPVELAQEFRKFDLNSPWE AFPVYRQPPESLKLEAGDKKPDAK >gi568815587f:47465623_47671626|GENSCAN_predicted_CDS_4|795_bp atggcggcggcggcggtagccaggctgtggtggcgcgggatcttgggggcctcggcgctg accagggggactgggcgaccctccgttctgttgctgccggtgaggcgggagagcgccggg gccgacacgcgccccactgtcagaccacggaatgatgtggcccacaagcagctctcagct tttggagagtatgtggctgaaatcttgcccaagtatgtccaacaagttcaggtgtcctgc ttcaatgagttagaggtctgtatccatcctgatggcgtcatcccagtgctgactttcctc agggatcacaccaatgcacagttcaaatctctggttgacttgacagcagtggacgtccca actcggcaaaaccgttttgagattgtctacaacctgttgtctctgcgcttcaactcacgg atccgtgtgaagacctacacagatgagctgacgcccattgagtctgctgtctctgtgttc aaggcagccaactggtatgaaagggagatctgggacatgtttggagtcttctttgctaac caccctgatctaagaaggatcctgacagattatggcttcgagggacatcctttccggaaa gactttcctctatctggctatgttgagttacgttatgatgatgaagtgaagcgggtggtg gcagagccggtggagttggcccaagagttccgcaaatttgacctgaacagcccctgggag gctttcccagtctatcgccaacccccggagagtctcaagcttgaagccggagacaagaag cctgatgccaagtag >gi568815587f:47465623_47671626|GENSCAN_predicted_peptide_5|204_aa MAATLQFLVCLVVAICLLSGVTTTQPHAGQPMDSTSVGGGLQEPEAPEVMFEVFPPSLGH GGHSPPLTWLPLQLLWAGLELDVMGQLHIQDEELASTHPGRRLRLLLQHHVPSDLEGTEQ WLQQLQDLRKGPPLSTWDFEHLLLTGLSCVYRLHAASEAEERGRWAQVFALLAQETLWDL CKGFCPQDRPPSLGSWASILDPFP >gi568815587f:47465623_47671626|GENSCAN_predicted_CDS_5|615_bp atggctgcgaccctgcagttcctggtttgcctggtggtagccatttgtctcctctctggt gtgactacaacccagccccatgcagggcagcccatggacagcaccagcgtgggaggtggc ctgcaggagccagaggccccggaagtgatgtttgaggtctttcctcccagcctgggacat gggggccacagcccaccccttacatggctccccttgcagctgctctgggctgggctggag ctggatgtcatggggcagctgcacatccaggatgaggaactagcgtccacacacccaggc cgccgactcagactcctcctgcagcaccacgtgcccagtgacttggagggcactgagcag tggctgcagcagctccaggacctgcggaaggggcctcctcttagcacttgggactttgaa catctgctcctcacaggcctgtcctgcgtctaccggctccacgcagctagtgaggctgag gaacggggccgctgggcccaggtcttcgctctcctggcacaggaaacactctgggacctg tgcaaaggtttctgcccccaggaccggcccccttccctggggtcctgggcctccatcctt gaccccttcccctga >gi568815587f:47465623_47671626|GENSCAN_predicted_peptide_6|329_aa MLPLLLGLLGPAACWALGPTPGPGSSELRSAFSAARTTPLEGTSEMAVTFDKVYVNIGGD FDVATGQFRCRVPGAYFFSFTAGKAPHKSLSVMLVRNRDEVQALAFDEQRRPGARRAASQ SAMLQLDYGDTVWLRLHGAPQYALGAPGATFSGYLVYADADADAPARGPPAPPEPRSAFS AARTRSLVGSDAGPGPRHQPLAFDTEFVNIGGDFDAAAGVFRCRLPGAYFFSFTLGKLPR KTLSVKLMKNRDEVQAMIYDDGASRRREMQSQSVMLALRRGDAVWLLSHDHDGYGAYSNH GKYITFSGFLVYPDLAPAAPPGLGASELL >gi568815587f:47465623_47671626|GENSCAN_predicted_CDS_6|990_bp atgctgccgcttctgctgggcctgctgggcccagcggcctgctgggccctgggcccgacc cccggcccgggatcctctgagctgcgctcggccttctcggcggcacgcaccacccccctg gagggcacgtcggagatggcggtgaccttcgacaaggtgtacgtgaacatcgggggcgac ttcgatgtggccaccggccagtttcgctgccgcgtgcccggcgcctacttcttctccttc acggctggcaaggccccgcacaagagcctgtcggtgatgctggtgcgaaaccgcgacgag gtgcaggcgctggccttcgacgagcagcggcggccaggcgcgcggcgcgcagccagccag agcgccatgctgcagctcgactacggcgacacagtgtggctgcggctgcatggcgccccg cagtacgcgctaggcgcgcccggcgccaccttcagcggctacctagtctacgccgacgcc gacgctgacgcgcctgcgcgcgggccgcccgcgccccccgagccgcgctcggccttctcg gcggcgcgcacgcgcagcttggtgggctcggacgctggccccgggccgcggcaccaacca ctcgccttcgacaccgagttcgtcaacattggcggcgacttcgacgcggcggccggcgtg ttccgctgccgtctgcccggcgcctacttcttctccttcacgctgggcaagctgccgcgt aagacgctgtcggttaagctgatgaagaaccgcgacgaggtgcaggccatgatttacgac gacggcgcgtcgcggcgccgcgagatgcagagccagagcgtgatgctggccctgcggcgc ggcgacgccgtctggctgctcagccacgaccacgacggctacggcgcctacagcaaccac ggcaagtacatcaccttctccggcttcctggtgtaccccgacctcgcccccgccgccccg ccgggcctcggggcctcggagctactgtga >gi568815587f:47465623_47671626|GENSCAN_predicted_peptide_7|167_aa MADAASQVLLGSGLTILSQPLMYVKVLIQHYQESDKGEELGPGNVQKEVSSSFDHVIKEF FASMLTYPFVLVSNLMAVNNCGVIFLKYSFEHTFFSSTVSTLSAAASEPGKAAGAPAAPA AARAVSAQPGTRSLQPAARSPQPGARSRAQTEPDSTAAPSQFCKKVN >gi568815587f:47465623_47671626|GENSCAN_predicted_CDS_7|504_bp atggcggacgcggccagtcaggtgctcctgggctccggtctcaccatcctgtcccagccg ctcatgtacgtgaaagtgctcatccagcattaccaggagagtgacaagggtgaggagtta ggacctggaaatgtacagaaagaagtctcatcttcctttgaccacgttatcaaggagttt tttgcgagtatgttgacctatccctttgtgcttgtctccaatcttatggctgtcaacaac tgtggagtaatcttcttaaaatacagctttgagcacaccttcttctcctctactgtaagc accctgtccgctgccgcctcagagccgggaaaagcagccggagcccccgccgcccctgcc gcagcgcgggcggtcagcgcgcagcccggcacccgcagcctgcagcctgcagcccgcagc ccgcagcccggagccagatcgcgggctcagaccgaacccgactcgaccgccgcccccagc cagttttgcaaaaaggtaaactga