GENSCAN 1.0 Date run: 8-Nov-116 Time: 11:10:55 Sequence gi568815587f:47479092_47684478 : 205387 bp : 45.74% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.09 Intr - 3765 3604 162 0 0 91 73 111 0.959 10.07 1.08 Intr - 4441 4362 80 2 2 111 75 70 0.999 7.27 1.07 Intr - 5432 5298 135 0 0 53 92 158 0.685 13.24 1.06 Intr - 7707 7659 49 0 1 112 103 45 0.983 6.65 1.05 Intr - 8150 8068 83 0 2 72 78 39 0.953 0.66 1.04 Intr - 9933 9746 188 2 2 34 72 228 0.035 15.13 1.03 Intr - 70568 70510 59 2 2 94 103 17 0.076 1.48 1.02 Intr - 74066 73901 166 1 1 79 105 123 0.187 13.06 1.01 Init - 78727 78720 8 1 2 51 95 0 0.092 -2.57 1.00 Prom - 79351 79312 40 -5.56 2.00 Prom + 84604 84643 40 -5.16 2.01 Init + 86532 86705 174 2 0 102 99 392 0.999 39.05 2.02 Intr + 86815 86895 81 0 0 85 94 170 0.998 17.13 2.03 Intr + 90609 90800 192 2 0 54 81 116 0.981 7.19 2.04 Term + 92380 92538 159 0 0 103 41 173 0.999 12.04 2.05 PlyA + 92868 92873 6 -0.45 3.04 PlyA - 93125 93120 6 1.05 3.03 Term - 94699 93839 861 1 0 62 48 728 0.947 58.93 3.02 Intr - 96608 96502 107 0 2 109 116 18 0.846 6.63 3.01 Init - 98908 98320 589 1 1 73 114 745 0.628 70.99 3.00 Prom - 99249 99210 40 -10.45 4.00 Prom + 99301 99340 40 -6.76 4.01 Init + 100001 100067 67 1 1 88 72 150 0.996 12.64 4.02 Intr + 100178 100243 66 0 0 92 78 34 0.759 1.68 4.03 Intr + 101434 101531 98 2 2 96 90 124 0.995 13.13 4.04 Intr + 101744 101893 150 1 0 98 78 98 0.999 10.16 4.05 Intr + 102997 103122 126 0 0 120 105 233 0.998 29.08 4.06 Intr + 103258 103377 120 0 0 69 75 100 0.990 7.49 4.07 Term + 105223 105390 168 0 0 91 36 243 0.999 17.28 4.08 PlyA + 105450 105455 6 1.05 5.00 Prom + 106149 106188 40 -15.35 5.01 Init + 107678 107762 85 1 1 72 81 101 0.716 6.96 5.02 Intr + 108660 108758 99 1 0 107 35 61 0.804 2.78 5.03 Term + 108913 109343 431 2 2 120 46 294 0.867 23.86 5.04 PlyA + 110084 110089 6 -3.44 6.02 PlyA - 110616 110611 6 1.05 6.01 Sngl - 111719 110730 990 2 0 98 52 2258 0.999 218.07 6.00 Prom - 113845 113806 40 -6.76 7.08 PlyA - 114101 114096 6 -0.45 7.07 Term - 114197 114177 21 2 0 74 54 6 0.166 -5.89 7.06 Intr - 115240 115057 184 0 1 54 94 127 0.109 9.69 7.05 Intr - 120262 120209 54 0 0 109 93 11 0.131 1.79 7.04 Intr - 146650 146583 68 1 2 84 94 44 0.387 2.30 7.03 Intr - 155643 155581 63 0 0 51 94 44 0.347 0.21 7.02 Intr - 156480 156454 27 0 0 110 87 38 0.755 4.21 7.01 Init - 163374 163288 87 0 0 117 80 200 0.999 20.74 7.00 Prom - 174855 174816 40 -4.16 8.05 PlyA - 176439 176434 6 1.05 8.04 Term - 177560 177324 237 2 0 89 54 30 0.050 -4.13 8.03 Intr - 189816 189750 67 2 1 69 99 24 0.254 0.51 8.02 Intr - 198310 198180 131 1 2 59 92 125 0.846 9.49 8.01 Intr - 200982 200882 101 1 2 101 93 64 0.993 8.03 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 9923 9746 178 2 1 56 72 220 0.959 16.62 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587f:47479092_47684478|GENSCAN_predicted_peptide_1|310_aa MCLCRKCRHFGVFCSGGSGSGGGTRRLPRDSASAARRRRRLRRRRQQLRQRQAQVQPLPL VTTIISVSTNLTTLDNIISKKMNGTLDHPDQPDLDAIKMFVGQVPRTWSEKDLRELFEQY GAVYEINVLRDRSQNPPQSKGCCFVTFYTRKAALEAQNALHNMKVLPGMHHPIQMKPADS EKNNAVEDRKLFIGMISKKCTENDIRVMFSSFGQIEECRILRGPDGLSRGCAFVTFTTRA MAQTAIKAMHQAQTMEGCSSPMVVKFADTQKDKEQKRMAQQLQQQMQQISAASVWGNLAG LNTLGPQYLA >gi568815587f:47479092_47684478|GENSCAN_predicted_CDS_1|930_bp atgtgcctgtgccggaagtgccgccattttggggtgttctgctctggcggcagcggcagc ggcggcgggacgcggaggctcccccgggattcggcctcagcagcgaggcggcggcggcgg ctgcggaggcgcaggcagcaactgaggcagcggcaggctcaggtgcagccgctgcccctg gtaaccaccattatttctgtttctaccaatttgactactttagataacatcatctcaaag aaaatgaacggcaccctggaccacccagaccaaccagatcttgatgctatcaagatgttt gtgggccaggttccaaggacctggtctgaaaaggacttgcgggaactcttcgaacagtat ggtgctgtgtatgaaatcaacgtcctaagggataggagccaaaacccgcctcagagcaaa gggtgctgttttgttacattttacacccgtaaagctgcattagaagctcagaatgctctt cacaacatgaaagtcctcccagggatgcatcaccctatacagatgaaacctgctgacagt gagaagaacaatgcagtggaagacaggaagctgtttattggtatgatttccaagaagtgc actgaaaatgacatccgagtcatgttctcttcgtttggacagattgaagaatgccggata ttgcggggacctgatggcctgagccgaggttgtgcatttgtgacttttacaacaagagcc atggcacagacggctatcaaggcaatgcaccaagcacagaccatggagggttgctcatca cccatggtggtaaaatttgctgatacacagaaggacaaagaacagaagagaatggcccag cagctccagcagcagatgcagcaaatcagcgcagcatctgtgtggggaaaccttgctggt ctaaatactcttggaccccagtatttagca >gi568815587f:47479092_47684478|GENSCAN_predicted_peptide_2|201_aa MAATALLEAGLARVLFYPTLLYTLFRGKVPGRAHRDWYHRIDPTVLLGALPLRSLTRQLV QDENVRGVITMNEEYETRFLCNSSQEWKRLGVEQLRLSTVDMTGIPTLDNLQKGVQFALK YQSLGQCVYVHCKAGRSRSATMVAAYLIQVHKWSPEEAVRAIAKIRSYIHIRPGQLDVLK EFHKQITARATKDGTFVISKT >gi568815587f:47479092_47684478|GENSCAN_predicted_CDS_2|606_bp atggcggccaccgcgctgctggaggccggcctggcgcgggtgctcttctacccgacgctg ctctacaccctgttccgcgggaaggtgccgggtcgggcgcaccgggactggtaccaccgc atcgaccccaccgtgctgctgggcgcgctgccgttgcggagcttgacgcgccagctggta caggacgagaacgtgcgcggggtgatcaccatgaacgaggagtacgagacgaggttcctg tgcaactcttcacaggagtggaagagactaggagtcgagcagctgcggctcagcacagta gacatgactgggatccccaccttggacaacctccagaagggagtccaatttgctctcaag taccagtcgctgggccagtgtgtttacgtgcattgtaaggctgggcgctccaggagtgcc actatggtggcagcatacctgattcaggtgcacaaatggagtccagaggaggctgtaaga gccatcgccaagatccggtcatacatccacatcaggcctggccagctggatgttcttaaa gagttccacaagcagattactgcacgggcaacaaaggatgggacttttgtcatttcaaag acatga >gi568815587f:47479092_47684478|GENSCAN_predicted_peptide_3|518_aa MESPEEPGASMDENYFVNYTFKDRSHSGRVAQGIMKLCLEEELFADVTISVEGREFQLHR LVLSAQSCFFRSMFTSNLKEAHNRVIVLQDVSESVFQLLVDYIYHGTVKLRAEELQEIYE VSDMYQLTSLFEECSRFLARTVQVGNCLQVMWLADRHSDPELYTAAKHCAKTHLAQLQNT EEFLHLPHRLLTDIISDGVPCSQNPTEAIEAWINFNKEEREAFAESLRTSLKEIGENVHI YLIGKESSRTHSLAVSLHCAEDDSISVSGQNSLCHQITAACKHGGDLYVVGGSIPRRMWK CNNATVDWEWCAPLPRDRLQHTLVSVPGKDAIYSLGGKTLQDTLSNAVIYYRVGDNVWTE TTQLEVAVSGAAGANLNGIIYLLGGEENDLDFFTKPSRLIQCFDTETDKCHVKPYVLPFA GRMHAAVHKDLVFIVAEGDSLVCYNPLLDSFTRLCLPEAWSSAPSLWKIASCNGSIYVFR DRYKKGDANTYKLDPATSAVTVTRGIKVLLTNLQFVLA >gi568815587f:47479092_47684478|GENSCAN_predicted_CDS_3|1557_bp atggaatcaccagaggagcctggagcatccatggatgagaactactttgtgaactacact ttcaaagatcggtcacattcaggccgtgtggctcaaggcatcatgaaactgtgtctagag gaggagctctttgctgatgtcaccatttcggtggaaggccgggagtttcagctccatcgg ctggtcctctcagctcagagctgcttcttccgatccatgttcacttccaacctgaaggag gcccacaaccgggtgattgtgctgcaggatgtcagcgagtctgttttccagctcctggtt gattatatctaccatgggactgtgaaacttcgagctgaggagttgcaggaaatttatgag gtgtcagacatgtatcagctgacatctctctttgaggaatgctctcggtttttggcccgc acagtgcaagtgggaaactgccttcaggtgatgtggctggcagatcggcacagtgatcct gagctctatacggctgccaagcactgtgccaagacccacctggcccagctgcagaataca gaggaatttctccacttgccccaccgcttactcacagatatcatctcggatggagttccg tgttctcagaacccaacagaggcaatagaagcctggatcaactttaataaagaggaaaga gaggcttttgcagagtcactcaggacaagcttgaaggaaattggggagaatgtgcacatt tacctgattgggaaagagtcatctcgtacccactcgttggctgtgtccttgcactgtgca gaagatgactccatcagtgtaagtggccaaaacagtttgtgccaccagatcactgcggcc tgcaagcatggtggagacttgtatgtggtgggagggtccatcccacggcgcatgtggaag tgcaacaatgccaccgttgactgggagtggtgtgctcctttgcctcgggaccggctccag cacaccctggtgtctgtgcccgggaaagatgccatatattcactgggtggcaagacactg caagataccctctccaacgcagtcatttattatcgcgtaggtgataatgtgtggacagag acaactcagctagaggtggctgtgtcaggggctgctggtgccaacctcaacgggatcatc tacttactagggggggaggagaatgatctggacttctttaccaaaccttcccgactcatc cagtgctttgacacagagacagacaaatgccatgtgaagccctatgtgctgccctttgca ggccgcatgcacgcagctgtgcataaagatctggtgttcatcgtggctgaaggggactcc ctggtgtgctacaatcccttgctagacagcttcacccggctttgccttcctgaggcctgg agctctgccccatccctctggaagattgccagctgtaacgggagcatctatgtcttccgg gaccgatataaaaagggggatgccaacacctacaagcttgaccctgccacttcagccgta actgtcacaagaggtattaaggtgctgcttaccaatttgcagtttgtgttggcctaa >gi568815587f:47479092_47684478|GENSCAN_predicted_peptide_4|264_aa MAAAAVARLWWRGILGASALTRGTGRPSVLLLPVRRESAGADTRPTVRPRNDVAHKQLSA FGEYVAEILPKYVQQVQVSCFNELEVCIHPDGVIPVLTFLRDHTNAQFKSLVDLTAVDVP TRQNRFEIVYNLLSLRFNSRIRVKTYTDELTPIESAVSVFKAANWYEREIWDMFGVFFAN HPDLRRILTDYGFEGHPFRKDFPLSGYVELRYDDEVKRVVAEPVELAQEFRKFDLNSPWE AFPVYRQPPESLKLEAGDKKPDAK >gi568815587f:47479092_47684478|GENSCAN_predicted_CDS_4|795_bp atggcggcggcggcggtagccaggctgtggtggcgcgggatcttgggggcctcggcgctg accagggggactgggcgaccctccgttctgttgctgccggtgaggcgggagagcgccggg gccgacacgcgccccactgtcagaccacggaatgatgtggcccacaagcagctctcagct tttggagagtatgtggctgaaatcttgcccaagtatgtccaacaagttcaggtgtcctgc ttcaatgagttagaggtctgtatccatcctgatggcgtcatcccagtgctgactttcctc agggatcacaccaatgcacagttcaaatctctggttgacttgacagcagtggacgtccca actcggcaaaaccgttttgagattgtctacaacctgttgtctctgcgcttcaactcacgg atccgtgtgaagacctacacagatgagctgacgcccattgagtctgctgtctctgtgttc aaggcagccaactggtatgaaagggagatctgggacatgtttggagtcttctttgctaac caccctgatctaagaaggatcctgacagattatggcttcgagggacatcctttccggaaa gactttcctctatctggctatgttgagttacgttatgatgatgaagtgaagcgggtggtg gcagagccggtggagttggcccaagagttccgcaaatttgacctgaacagcccctgggag gctttcccagtctatcgccaacccccggagagtctcaagcttgaagccggagacaagaag cctgatgccaagtag >gi568815587f:47479092_47684478|GENSCAN_predicted_peptide_5|204_aa MAATLQFLVCLVVAICLLSGVTTTQPHAGQPMDSTSVGGGLQEPEAPEVMFEVFPPSLGH GGHSPPLTWLPLQLLWAGLELDVMGQLHIQDEELASTHPGRRLRLLLQHHVPSDLEGTEQ WLQQLQDLRKGPPLSTWDFEHLLLTGLSCVYRLHAASEAEERGRWAQVFALLAQETLWDL CKGFCPQDRPPSLGSWASILDPFP >gi568815587f:47479092_47684478|GENSCAN_predicted_CDS_5|615_bp atggctgcgaccctgcagttcctggtttgcctggtggtagccatttgtctcctctctggt gtgactacaacccagccccatgcagggcagcccatggacagcaccagcgtgggaggtggc ctgcaggagccagaggccccggaagtgatgtttgaggtctttcctcccagcctgggacat gggggccacagcccaccccttacatggctccccttgcagctgctctgggctgggctggag ctggatgtcatggggcagctgcacatccaggatgaggaactagcgtccacacacccaggc cgccgactcagactcctcctgcagcaccacgtgcccagtgacttggagggcactgagcag tggctgcagcagctccaggacctgcggaaggggcctcctcttagcacttgggactttgaa catctgctcctcacaggcctgtcctgcgtctaccggctccacgcagctagtgaggctgag gaacggggccgctgggcccaggtcttcgctctcctggcacaggaaacactctgggacctg tgcaaaggtttctgcccccaggaccggcccccttccctggggtcctgggcctccatcctt gaccccttcccctga >gi568815587f:47479092_47684478|GENSCAN_predicted_peptide_6|329_aa MLPLLLGLLGPAACWALGPTPGPGSSELRSAFSAARTTPLEGTSEMAVTFDKVYVNIGGD FDVATGQFRCRVPGAYFFSFTAGKAPHKSLSVMLVRNRDEVQALAFDEQRRPGARRAASQ SAMLQLDYGDTVWLRLHGAPQYALGAPGATFSGYLVYADADADAPARGPPAPPEPRSAFS AARTRSLVGSDAGPGPRHQPLAFDTEFVNIGGDFDAAAGVFRCRLPGAYFFSFTLGKLPR KTLSVKLMKNRDEVQAMIYDDGASRRREMQSQSVMLALRRGDAVWLLSHDHDGYGAYSNH GKYITFSGFLVYPDLAPAAPPGLGASELL >gi568815587f:47479092_47684478|GENSCAN_predicted_CDS_6|990_bp atgctgccgcttctgctgggcctgctgggcccagcggcctgctgggccctgggcccgacc cccggcccgggatcctctgagctgcgctcggccttctcggcggcacgcaccacccccctg gagggcacgtcggagatggcggtgaccttcgacaaggtgtacgtgaacatcgggggcgac ttcgatgtggccaccggccagtttcgctgccgcgtgcccggcgcctacttcttctccttc acggctggcaaggccccgcacaagagcctgtcggtgatgctggtgcgaaaccgcgacgag gtgcaggcgctggccttcgacgagcagcggcggccaggcgcgcggcgcgcagccagccag agcgccatgctgcagctcgactacggcgacacagtgtggctgcggctgcatggcgccccg cagtacgcgctaggcgcgcccggcgccaccttcagcggctacctagtctacgccgacgcc gacgctgacgcgcctgcgcgcgggccgcccgcgccccccgagccgcgctcggccttctcg gcggcgcgcacgcgcagcttggtgggctcggacgctggccccgggccgcggcaccaacca ctcgccttcgacaccgagttcgtcaacattggcggcgacttcgacgcggcggccggcgtg ttccgctgccgtctgcccggcgcctacttcttctccttcacgctgggcaagctgccgcgt aagacgctgtcggttaagctgatgaagaaccgcgacgaggtgcaggccatgatttacgac gacggcgcgtcgcggcgccgcgagatgcagagccagagcgtgatgctggccctgcggcgc ggcgacgccgtctggctgctcagccacgaccacgacggctacggcgcctacagcaaccac ggcaagtacatcaccttctccggcttcctggtgtaccccgacctcgcccccgccgccccg ccgggcctcggggcctcggagctactgtga >gi568815587f:47479092_47684478|GENSCAN_predicted_peptide_7|167_aa MADAASQVLLGSGLTILSQPLMYVKVLIQHYQESDKGEELGPGNVQKEVSSSFDHVIKEF FASMLTYPFVLVSNLMAVNNCGVIFLKYSFEHTFFSSTVSTLSAAASEPGKAAGAPAAPA AARAVSAQPGTRSLQPAARSPQPGARSRAQTEPDSTAAPSQFCKKVN >gi568815587f:47479092_47684478|GENSCAN_predicted_CDS_7|504_bp atggcggacgcggccagtcaggtgctcctgggctccggtctcaccatcctgtcccagccg ctcatgtacgtgaaagtgctcatccagcattaccaggagagtgacaagggtgaggagtta ggacctggaaatgtacagaaagaagtctcatcttcctttgaccacgttatcaaggagttt tttgcgagtatgttgacctatccctttgtgcttgtctccaatcttatggctgtcaacaac tgtggagtaatcttcttaaaatacagctttgagcacaccttcttctcctctactgtaagc accctgtccgctgccgcctcagagccgggaaaagcagccggagcccccgccgcccctgcc gcagcgcgggcggtcagcgcgcagcccggcacccgcagcctgcagcctgcagcccgcagc ccgcagcccggagccagatcgcgggctcagaccgaacccgactcgaccgccgcccccagc cagttttgcaaaaaggtaaactga >gi568815587f:47479092_47684478|GENSCAN_predicted_peptide_8|178_aa XNKRDTHFTIEDLKSLGYHVCDTLLDFCDPDQMKFTQCLAELKELLRQEIHKKFHELGQD VDLEGSWSDISLSDIESSTSGSDSSLSDGLPVHLANIADEAAKMASGKYAIKWSWASTQV SPSALEIYLLLLKLKFPHLSPSVAISHLHFKSLAPSLLPPSSEQTAILVKTGSEARLP >gi568815587f:47479092_47684478|GENSCAN_predicted_CDS_8|537_bp ngtaataaaagagacacccactttaccatcgaagatctgaagtccttaggttatcatgtc tgtgacacccttctggacttttgtgatcctgaccaaatgaagttcactcagtgtctagca gagcttaaggagcttttacgacaggaaatccacaagaaattccatgaacttggacaagat gtagatttagaaggaagttggagtgacatctctttgtctgacattgaatccagcaccagt ggctctgacagttctctctcagatggtcttcctgttcacctagcaaacatagcagatgag gctgccaagatggcttcgggaaaatatgccatcaagtggtcctgggccagcacacaagta tctccttcagccttggaaatttatcttctgcttctgaaactgaagttcccgcacctttcc ccctccgtggctatttcacacctccatttcaagtctcttgctccctctctgctgccacct tcctccgagcaaacagccatactagtcaagactggttcagaagctaggctcccgtga