GENSCAN 1.0 Date run: 6-Nov-116 Time: 01:33:59 Sequence gi568815587r:47472933_47678074 : 205142 bp : 45.73% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.14 Intr - 299 156 144 0 0 42 76 168 0.946 11.25 1.13 Intr - 2589 2404 186 1 0 61 113 188 0.964 18.26 1.12 Intr - 4024 3914 111 2 0 63 110 41 0.588 4.15 1.11 Intr - 4493 4365 129 0 0 72 115 52 0.990 6.97 1.10 Intr - 6020 5945 76 2 1 58 107 36 0.681 1.59 1.09 Intr - 9924 9763 162 0 0 91 73 111 0.962 10.07 1.08 Intr - 10600 10521 80 2 2 111 75 70 0.999 7.27 1.07 Intr - 11591 11457 135 0 0 53 92 158 0.685 13.24 1.06 Intr - 13866 13818 49 0 1 112 103 45 0.983 6.65 1.05 Intr - 14309 14227 83 0 2 72 78 39 0.953 0.66 1.04 Intr - 16092 15905 188 2 2 34 72 228 0.035 15.13 1.03 Intr - 76727 76669 59 2 2 94 103 17 0.076 1.48 1.02 Intr - 80225 80060 166 1 1 79 105 123 0.187 13.06 1.01 Init - 84886 84879 8 1 2 51 95 0 0.092 -2.57 1.00 Prom - 85510 85471 40 -5.56 2.00 Prom + 90763 90802 40 -5.16 2.01 Init + 92691 92864 174 2 0 102 99 392 0.999 39.05 2.02 Intr + 92974 93054 81 0 0 85 94 170 0.998 17.13 2.03 Intr + 96768 96959 192 2 0 54 81 116 0.981 7.19 2.04 Term + 98539 98697 159 0 0 103 41 173 0.999 12.04 2.05 PlyA + 99027 99032 6 -0.45 3.04 PlyA - 99284 99279 6 1.05 3.03 Term - 100858 99998 861 1 0 62 48 728 0.947 58.93 3.02 Intr - 102767 102661 107 0 2 109 116 18 0.846 6.63 3.01 Init - 105067 104479 589 1 1 73 114 745 0.628 70.99 3.00 Prom - 105408 105369 40 -10.45 4.00 Prom + 105460 105499 40 -6.76 4.01 Init + 106160 106226 67 1 1 88 72 150 0.996 12.64 4.02 Intr + 106337 106402 66 0 0 92 78 34 0.759 1.68 4.03 Intr + 107593 107690 98 2 2 96 90 124 0.995 13.13 4.04 Intr + 107903 108052 150 1 0 98 78 98 0.999 10.16 4.05 Intr + 109156 109281 126 0 0 120 105 233 0.998 29.08 4.06 Intr + 109417 109536 120 0 0 69 75 100 0.990 7.49 4.07 Term + 111382 111549 168 0 0 91 36 243 0.999 17.28 4.08 PlyA + 111609 111614 6 1.05 5.00 Prom + 112308 112347 40 -15.35 5.01 Init + 113837 113921 85 1 1 72 81 101 0.716 6.96 5.02 Intr + 114819 114917 99 1 0 107 35 61 0.804 2.78 5.03 Term + 115072 115502 431 2 2 120 46 294 0.867 23.86 5.04 PlyA + 116243 116248 6 -3.44 6.02 PlyA - 116775 116770 6 1.05 6.01 Sngl - 117878 116889 990 2 0 98 52 2258 0.999 218.07 6.00 Prom - 120004 119965 40 -6.76 7.08 PlyA - 120260 120255 6 -0.45 7.07 Term - 120356 120336 21 2 0 74 54 6 0.166 -5.89 7.06 Intr - 121399 121216 184 0 1 54 94 127 0.109 9.69 7.05 Intr - 126421 126368 54 0 0 109 93 11 0.131 1.79 7.04 Intr - 152809 152742 68 1 2 84 94 44 0.387 2.30 7.03 Intr - 161802 161740 63 0 0 51 94 44 0.347 0.21 7.02 Intr - 162639 162613 27 0 0 110 87 38 0.755 4.21 7.01 Init - 169533 169447 87 0 0 117 80 200 0.999 20.74 7.00 Prom - 181014 180975 40 -4.16 8.04 PlyA - 182598 182593 6 1.05 8.03 Term - 183719 183483 237 2 0 89 54 30 0.049 -4.13 8.02 Intr - 195975 195909 67 2 1 69 99 24 0.246 0.51 8.01 Intr - 204469 204339 131 1 2 59 92 125 0.765 9.49 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 16082 15905 178 2 1 56 72 220 0.959 16.62 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:47472933_47678074|GENSCAN_predicted_peptide_1|526_aa MCLCRKCRHFGVFCSGGSGSGGGTRRLPRDSASAARRRRRLRRRRQQLRQRQAQVQPLPL VTTIISVSTNLTTLDNIISKKMNGTLDHPDQPDLDAIKMFVGQVPRTWSEKDLRELFEQY GAVYEINVLRDRSQNPPQSKGCCFVTFYTRKAALEAQNALHNMKVLPGMHHPIQMKPADS EKNNAVEDRKLFIGMISKKCTENDIRVMFSSFGQIEECRILRGPDGLSRGCAFVTFTTRA MAQTAIKAMHQAQTMEGCSSPMVVKFADTQKDKEQKRMAQQLQQQMQQISAASVWGNLAG LNTLGPQYLALYLQLLQQTASSGNLNTLSSLHPMGGLNAMQLQNLAALAAAASAAQNTPS GTNALTTSSSPLSVLTSSGSSPSSSSSNSVNPIASLGALQTLAGATAGLNVGSLAGMAAL NGGLGSSGLSNGTGSTMEALTQAYSGIQQYAAAALPTLYNQNLLTQQSIGAAGSQKEGPE GANLFIYHLPQEFGDQDLLQMFMPFGNVVSAKVFIDKQTNLSKCFX >gi568815587r:47472933_47678074|GENSCAN_predicted_CDS_1|1578_bp atgtgcctgtgccggaagtgccgccattttggggtgttctgctctggcggcagcggcagc ggcggcgggacgcggaggctcccccgggattcggcctcagcagcgaggcggcggcggcgg ctgcggaggcgcaggcagcaactgaggcagcggcaggctcaggtgcagccgctgcccctg gtaaccaccattatttctgtttctaccaatttgactactttagataacatcatctcaaag aaaatgaacggcaccctggaccacccagaccaaccagatcttgatgctatcaagatgttt gtgggccaggttccaaggacctggtctgaaaaggacttgcgggaactcttcgaacagtat ggtgctgtgtatgaaatcaacgtcctaagggataggagccaaaacccgcctcagagcaaa gggtgctgttttgttacattttacacccgtaaagctgcattagaagctcagaatgctctt cacaacatgaaagtcctcccagggatgcatcaccctatacagatgaaacctgctgacagt gagaagaacaatgcagtggaagacaggaagctgtttattggtatgatttccaagaagtgc actgaaaatgacatccgagtcatgttctcttcgtttggacagattgaagaatgccggata ttgcggggacctgatggcctgagccgaggttgtgcatttgtgacttttacaacaagagcc atggcacagacggctatcaaggcaatgcaccaagcacagaccatggagggttgctcatca cccatggtggtaaaatttgctgatacacagaaggacaaagaacagaagagaatggcccag cagctccagcagcagatgcagcaaatcagcgcagcatctgtgtggggaaaccttgctggt ctaaatactcttggaccccagtatttagcactttatttgcagctccttcagcagactgcc tcctctgggaacctcaacaccctgagcagcctccacccaatgggagggttgaatgcaatg cagttacagaatttggctgcactagctgctgcagctagtgcagctcagaacacaccaagt ggtaccaatgctctcactacatccagcagtcccctcagcgtgctcactagttcagggtcc tcacctagctctagcagcagtaattctgtcaaccccatagcctcacttggagccctgcag acattagctggagcaacggctggcctcaatgttggctctttggcaggaatggctgcttta aatggtggcctgggcagcagtggcctttccaatggcaccgggagcaccatggaggccctc actcaggcctactcgggtatccagcaatatgctgctgctgcgctccccactctgtacaac cagaatcttctgacacagcagagtattggtgctgctggaagccagaaggaaggtccagag ggagccaacctgttcatctaccacctgccccaggagtttggtgatcaggacctgctgcag atgtttatgccctttgggaatgtcgtgtctgccaaggttttcatagacaagcagacaaac ctgagcaagtgttttgnn >gi568815587r:47472933_47678074|GENSCAN_predicted_peptide_2|201_aa MAATALLEAGLARVLFYPTLLYTLFRGKVPGRAHRDWYHRIDPTVLLGALPLRSLTRQLV QDENVRGVITMNEEYETRFLCNSSQEWKRLGVEQLRLSTVDMTGIPTLDNLQKGVQFALK YQSLGQCVYVHCKAGRSRSATMVAAYLIQVHKWSPEEAVRAIAKIRSYIHIRPGQLDVLK EFHKQITARATKDGTFVISKT >gi568815587r:47472933_47678074|GENSCAN_predicted_CDS_2|606_bp atggcggccaccgcgctgctggaggccggcctggcgcgggtgctcttctacccgacgctg ctctacaccctgttccgcgggaaggtgccgggtcgggcgcaccgggactggtaccaccgc atcgaccccaccgtgctgctgggcgcgctgccgttgcggagcttgacgcgccagctggta caggacgagaacgtgcgcggggtgatcaccatgaacgaggagtacgagacgaggttcctg tgcaactcttcacaggagtggaagagactaggagtcgagcagctgcggctcagcacagta gacatgactgggatccccaccttggacaacctccagaagggagtccaatttgctctcaag taccagtcgctgggccagtgtgtttacgtgcattgtaaggctgggcgctccaggagtgcc actatggtggcagcatacctgattcaggtgcacaaatggagtccagaggaggctgtaaga gccatcgccaagatccggtcatacatccacatcaggcctggccagctggatgttcttaaa gagttccacaagcagattactgcacgggcaacaaaggatgggacttttgtcatttcaaag acatga >gi568815587r:47472933_47678074|GENSCAN_predicted_peptide_3|518_aa MESPEEPGASMDENYFVNYTFKDRSHSGRVAQGIMKLCLEEELFADVTISVEGREFQLHR LVLSAQSCFFRSMFTSNLKEAHNRVIVLQDVSESVFQLLVDYIYHGTVKLRAEELQEIYE VSDMYQLTSLFEECSRFLARTVQVGNCLQVMWLADRHSDPELYTAAKHCAKTHLAQLQNT EEFLHLPHRLLTDIISDGVPCSQNPTEAIEAWINFNKEEREAFAESLRTSLKEIGENVHI YLIGKESSRTHSLAVSLHCAEDDSISVSGQNSLCHQITAACKHGGDLYVVGGSIPRRMWK CNNATVDWEWCAPLPRDRLQHTLVSVPGKDAIYSLGGKTLQDTLSNAVIYYRVGDNVWTE TTQLEVAVSGAAGANLNGIIYLLGGEENDLDFFTKPSRLIQCFDTETDKCHVKPYVLPFA GRMHAAVHKDLVFIVAEGDSLVCYNPLLDSFTRLCLPEAWSSAPSLWKIASCNGSIYVFR DRYKKGDANTYKLDPATSAVTVTRGIKVLLTNLQFVLA >gi568815587r:47472933_47678074|GENSCAN_predicted_CDS_3|1557_bp atggaatcaccagaggagcctggagcatccatggatgagaactactttgtgaactacact ttcaaagatcggtcacattcaggccgtgtggctcaaggcatcatgaaactgtgtctagag gaggagctctttgctgatgtcaccatttcggtggaaggccgggagtttcagctccatcgg ctggtcctctcagctcagagctgcttcttccgatccatgttcacttccaacctgaaggag gcccacaaccgggtgattgtgctgcaggatgtcagcgagtctgttttccagctcctggtt gattatatctaccatgggactgtgaaacttcgagctgaggagttgcaggaaatttatgag gtgtcagacatgtatcagctgacatctctctttgaggaatgctctcggtttttggcccgc acagtgcaagtgggaaactgccttcaggtgatgtggctggcagatcggcacagtgatcct gagctctatacggctgccaagcactgtgccaagacccacctggcccagctgcagaataca gaggaatttctccacttgccccaccgcttactcacagatatcatctcggatggagttccg tgttctcagaacccaacagaggcaatagaagcctggatcaactttaataaagaggaaaga gaggcttttgcagagtcactcaggacaagcttgaaggaaattggggagaatgtgcacatt tacctgattgggaaagagtcatctcgtacccactcgttggctgtgtccttgcactgtgca gaagatgactccatcagtgtaagtggccaaaacagtttgtgccaccagatcactgcggcc tgcaagcatggtggagacttgtatgtggtgggagggtccatcccacggcgcatgtggaag tgcaacaatgccaccgttgactgggagtggtgtgctcctttgcctcgggaccggctccag cacaccctggtgtctgtgcccgggaaagatgccatatattcactgggtggcaagacactg caagataccctctccaacgcagtcatttattatcgcgtaggtgataatgtgtggacagag acaactcagctagaggtggctgtgtcaggggctgctggtgccaacctcaacgggatcatc tacttactagggggggaggagaatgatctggacttctttaccaaaccttcccgactcatc cagtgctttgacacagagacagacaaatgccatgtgaagccctatgtgctgccctttgca ggccgcatgcacgcagctgtgcataaagatctggtgttcatcgtggctgaaggggactcc ctggtgtgctacaatcccttgctagacagcttcacccggctttgccttcctgaggcctgg agctctgccccatccctctggaagattgccagctgtaacgggagcatctatgtcttccgg gaccgatataaaaagggggatgccaacacctacaagcttgaccctgccacttcagccgta actgtcacaagaggtattaaggtgctgcttaccaatttgcagtttgtgttggcctaa >gi568815587r:47472933_47678074|GENSCAN_predicted_peptide_4|264_aa MAAAAVARLWWRGILGASALTRGTGRPSVLLLPVRRESAGADTRPTVRPRNDVAHKQLSA FGEYVAEILPKYVQQVQVSCFNELEVCIHPDGVIPVLTFLRDHTNAQFKSLVDLTAVDVP TRQNRFEIVYNLLSLRFNSRIRVKTYTDELTPIESAVSVFKAANWYEREIWDMFGVFFAN HPDLRRILTDYGFEGHPFRKDFPLSGYVELRYDDEVKRVVAEPVELAQEFRKFDLNSPWE AFPVYRQPPESLKLEAGDKKPDAK >gi568815587r:47472933_47678074|GENSCAN_predicted_CDS_4|795_bp atggcggcggcggcggtagccaggctgtggtggcgcgggatcttgggggcctcggcgctg accagggggactgggcgaccctccgttctgttgctgccggtgaggcgggagagcgccggg gccgacacgcgccccactgtcagaccacggaatgatgtggcccacaagcagctctcagct tttggagagtatgtggctgaaatcttgcccaagtatgtccaacaagttcaggtgtcctgc ttcaatgagttagaggtctgtatccatcctgatggcgtcatcccagtgctgactttcctc agggatcacaccaatgcacagttcaaatctctggttgacttgacagcagtggacgtccca actcggcaaaaccgttttgagattgtctacaacctgttgtctctgcgcttcaactcacgg atccgtgtgaagacctacacagatgagctgacgcccattgagtctgctgtctctgtgttc aaggcagccaactggtatgaaagggagatctgggacatgtttggagtcttctttgctaac caccctgatctaagaaggatcctgacagattatggcttcgagggacatcctttccggaaa gactttcctctatctggctatgttgagttacgttatgatgatgaagtgaagcgggtggtg gcagagccggtggagttggcccaagagttccgcaaatttgacctgaacagcccctgggag gctttcccagtctatcgccaacccccggagagtctcaagcttgaagccggagacaagaag cctgatgccaagtag >gi568815587r:47472933_47678074|GENSCAN_predicted_peptide_5|204_aa MAATLQFLVCLVVAICLLSGVTTTQPHAGQPMDSTSVGGGLQEPEAPEVMFEVFPPSLGH GGHSPPLTWLPLQLLWAGLELDVMGQLHIQDEELASTHPGRRLRLLLQHHVPSDLEGTEQ WLQQLQDLRKGPPLSTWDFEHLLLTGLSCVYRLHAASEAEERGRWAQVFALLAQETLWDL CKGFCPQDRPPSLGSWASILDPFP >gi568815587r:47472933_47678074|GENSCAN_predicted_CDS_5|615_bp atggctgcgaccctgcagttcctggtttgcctggtggtagccatttgtctcctctctggt gtgactacaacccagccccatgcagggcagcccatggacagcaccagcgtgggaggtggc ctgcaggagccagaggccccggaagtgatgtttgaggtctttcctcccagcctgggacat gggggccacagcccaccccttacatggctccccttgcagctgctctgggctgggctggag ctggatgtcatggggcagctgcacatccaggatgaggaactagcgtccacacacccaggc cgccgactcagactcctcctgcagcaccacgtgcccagtgacttggagggcactgagcag tggctgcagcagctccaggacctgcggaaggggcctcctcttagcacttgggactttgaa catctgctcctcacaggcctgtcctgcgtctaccggctccacgcagctagtgaggctgag gaacggggccgctgggcccaggtcttcgctctcctggcacaggaaacactctgggacctg tgcaaaggtttctgcccccaggaccggcccccttccctggggtcctgggcctccatcctt gaccccttcccctga >gi568815587r:47472933_47678074|GENSCAN_predicted_peptide_6|329_aa MLPLLLGLLGPAACWALGPTPGPGSSELRSAFSAARTTPLEGTSEMAVTFDKVYVNIGGD FDVATGQFRCRVPGAYFFSFTAGKAPHKSLSVMLVRNRDEVQALAFDEQRRPGARRAASQ SAMLQLDYGDTVWLRLHGAPQYALGAPGATFSGYLVYADADADAPARGPPAPPEPRSAFS AARTRSLVGSDAGPGPRHQPLAFDTEFVNIGGDFDAAAGVFRCRLPGAYFFSFTLGKLPR KTLSVKLMKNRDEVQAMIYDDGASRRREMQSQSVMLALRRGDAVWLLSHDHDGYGAYSNH GKYITFSGFLVYPDLAPAAPPGLGASELL >gi568815587r:47472933_47678074|GENSCAN_predicted_CDS_6|990_bp atgctgccgcttctgctgggcctgctgggcccagcggcctgctgggccctgggcccgacc cccggcccgggatcctctgagctgcgctcggccttctcggcggcacgcaccacccccctg gagggcacgtcggagatggcggtgaccttcgacaaggtgtacgtgaacatcgggggcgac ttcgatgtggccaccggccagtttcgctgccgcgtgcccggcgcctacttcttctccttc acggctggcaaggccccgcacaagagcctgtcggtgatgctggtgcgaaaccgcgacgag gtgcaggcgctggccttcgacgagcagcggcggccaggcgcgcggcgcgcagccagccag agcgccatgctgcagctcgactacggcgacacagtgtggctgcggctgcatggcgccccg cagtacgcgctaggcgcgcccggcgccaccttcagcggctacctagtctacgccgacgcc gacgctgacgcgcctgcgcgcgggccgcccgcgccccccgagccgcgctcggccttctcg gcggcgcgcacgcgcagcttggtgggctcggacgctggccccgggccgcggcaccaacca ctcgccttcgacaccgagttcgtcaacattggcggcgacttcgacgcggcggccggcgtg ttccgctgccgtctgcccggcgcctacttcttctccttcacgctgggcaagctgccgcgt aagacgctgtcggttaagctgatgaagaaccgcgacgaggtgcaggccatgatttacgac gacggcgcgtcgcggcgccgcgagatgcagagccagagcgtgatgctggccctgcggcgc ggcgacgccgtctggctgctcagccacgaccacgacggctacggcgcctacagcaaccac ggcaagtacatcaccttctccggcttcctggtgtaccccgacctcgcccccgccgccccg ccgggcctcggggcctcggagctactgtga >gi568815587r:47472933_47678074|GENSCAN_predicted_peptide_7|167_aa MADAASQVLLGSGLTILSQPLMYVKVLIQHYQESDKGEELGPGNVQKEVSSSFDHVIKEF FASMLTYPFVLVSNLMAVNNCGVIFLKYSFEHTFFSSTVSTLSAAASEPGKAAGAPAAPA AARAVSAQPGTRSLQPAARSPQPGARSRAQTEPDSTAAPSQFCKKVN >gi568815587r:47472933_47678074|GENSCAN_predicted_CDS_7|504_bp atggcggacgcggccagtcaggtgctcctgggctccggtctcaccatcctgtcccagccg ctcatgtacgtgaaagtgctcatccagcattaccaggagagtgacaagggtgaggagtta ggacctggaaatgtacagaaagaagtctcatcttcctttgaccacgttatcaaggagttt tttgcgagtatgttgacctatccctttgtgcttgtctccaatcttatggctgtcaacaac tgtggagtaatcttcttaaaatacagctttgagcacaccttcttctcctctactgtaagc accctgtccgctgccgcctcagagccgggaaaagcagccggagcccccgccgcccctgcc gcagcgcgggcggtcagcgcgcagcccggcacccgcagcctgcagcctgcagcccgcagc ccgcagcccggagccagatcgcgggctcagaccgaacccgactcgaccgccgcccccagc cagttttgcaaaaaggtaaactga >gi568815587r:47472933_47678074|GENSCAN_predicted_peptide_8|144_aa FTQCLAELKELLRQEIHKKFHELGQDVDLEGSWSDISLSDIESSTSGSDSSLSDGLPVHL ANIADEAAKMASGKYAIKWSWASTQVSPSALEIYLLLLKLKFPHLSPSVAISHLHFKSLA PSLLPPSSEQTAILVKTGSEARLP >gi568815587r:47472933_47678074|GENSCAN_predicted_CDS_8|435_bp ttcactcagtgtctagcagagcttaaggagcttttacgacaggaaatccacaagaaattc catgaacttggacaagatgtagatttagaaggaagttggagtgacatctctttgtctgac attgaatccagcaccagtggctctgacagttctctctcagatggtcttcctgttcaccta gcaaacatagcagatgaggctgccaagatggcttcgggaaaatatgccatcaagtggtcc tgggccagcacacaagtatctccttcagccttggaaatttatcttctgcttctgaaactg aagttcccgcacctttccccctccgtggctatttcacacctccatttcaagtctcttgct ccctctctgctgccaccttcctccgagcaaacagccatactagtcaagactggttcagaa gctaggctcccgtga