GENSCAN 1.0 Date run: 4-Nov-116 Time: 16:40:45 Sequence gi568815581f:15845249_16075339 : 230091 bp : 43.84% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 15681 15757 77 2 2 76 69 43 0.620 1.76 1.02 Intr + 18807 18909 103 0 1 89 91 94 0.661 9.98 1.03 Intr + 30877 30904 28 0 1 100 93 9 0.046 0.29 1.04 Term + 38148 38278 131 1 2 95 45 50 0.096 -0.26 1.05 PlyA + 40126 40131 6 1.05 2.04 PlyA - 40536 40531 6 1.05 2.03 Term - 49186 48947 240 1 0 68 44 104 0.879 0.03 2.02 Intr - 51527 51441 87 2 0 141 89 43 0.925 9.87 2.01 Init - 67878 67765 114 0 0 77 96 51 0.726 5.01 2.00 Prom - 69129 69090 40 -5.36 3.00 Prom + 70515 70554 40 -8.16 3.01 Init + 70865 70931 67 1 1 74 91 26 0.292 2.73 3.02 Intr + 82449 82627 179 1 2 5 113 119 0.465 5.74 3.03 Intr + 83261 83319 59 1 2 72 80 68 0.074 2.08 3.04 Intr + 97737 97831 95 0 2 78 60 84 0.104 4.21 3.05 Intr + 99972 100335 364 1 1 39 94 794 0.142 69.34 3.06 Intr + 101754 101884 131 0 2 59 64 50 0.429 0.04 3.07 Term + 121653 121774 122 1 2 80 55 134 0.554 7.94 3.08 PlyA + 124207 124212 6 1.05 4.00 Prom + 125225 125264 40 -6.86 4.01 Init + 126945 127050 106 2 1 48 103 16 0.332 -0.51 4.02 Intr + 128327 128485 159 0 0 55 53 108 0.436 3.96 4.03 Term + 129388 130094 707 2 2 15 39 468 0.405 28.28 4.04 PlyA + 130471 130476 6 1.05 5.00 Prom + 150413 150452 40 -3.96 5.01 Init + 154601 154784 184 1 1 62 94 297 0.970 24.98 5.02 Intr + 154870 155114 245 2 2 68 65 242 0.665 17.12 5.03 Intr + 156667 156777 111 0 0 96 21 105 0.774 5.18 5.04 Intr + 158583 158639 57 2 0 72 110 15 0.555 1.18 5.05 Intr + 158953 159014 62 0 2 123 111 49 0.997 8.23 5.06 Intr + 161226 161320 95 0 2 75 91 83 0.846 6.91 5.07 Intr + 179769 179923 155 1 2 72 110 112 0.884 11.59 5.08 Intr + 181292 181454 163 1 1 53 92 80 0.982 4.45 5.09 Term + 182126 182274 149 0 2 75 35 139 0.956 5.36 5.10 PlyA + 182881 182886 6 1.05 6.13 PlyA - 183929 183924 6 1.05 6.12 Term - 187235 187048 188 0 2 74 43 165 0.996 8.25 6.11 Intr - 194406 194227 180 1 0 68 69 107 0.038 6.64 6.10 Intr - 199635 199417 219 1 0 85 42 152 0.026 8.57 6.09 Intr - 201845 201703 143 1 2 78 106 25 0.093 3.30 6.08 Intr - 203670 203597 74 0 2 94 107 25 0.129 3.20 6.07 Intr - 212816 212659 158 0 2 89 75 60 0.180 4.53 6.06 Intr - 216646 216153 494 0 2 82 34 292 0.917 15.84 6.05 Intr - 217022 216857 166 0 1 112 29 31 0.635 -1.38 6.04 Intr - 218939 218820 120 0 0 74 82 211 0.770 19.57 6.03 Intr - 220423 220237 187 1 1 60 85 33 0.428 -0.54 6.02 Intr - 225277 225077 201 1 0 129 91 190 0.749 22.88 6.01 Intr - 226417 226161 257 2 2 62 93 183 0.994 13.36 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 199563 199709 147 2 0 11 42 198 0.843 5.40 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:15845249_16075339|GENSCAN_predicted_peptide_1|112_aa MRLLSTSTRGKVEVAPRKREPRSSFSVERRNPLVALAREYGGFKCNAVLKWYQKKTEGYM YLPENGFIEGQEYGSNSTGYRVAKGFQAMVAGGLSARTVVLEERRKPTTPEL >gi568815581f:15845249_16075339|GENSCAN_predicted_CDS_1|339_bp atgagactgctgagcacctccaccagagggaaagtggaagtggccccaagaaagagggaa ccacgaagcagtttcagtgtggaaagaagaaaccctctggtggccttggcccgggaatat ggcggtttcaagtgcaatgctgtactgaaatggtaccagaagaagacagaaggttacatg tatctccctgagaatgggttcatagaaggccaggaatatgggtctaactccactggctac agagttgcaaagggttttcaagccatggtggcaggtggtctctcagccagaactgtggtt ctggaggagaggaggaaacccacaactcctgaattatga >gi568815581f:15845249_16075339|GENSCAN_predicted_peptide_2|146_aa MGFWAAETIATDSATTDASGGTLGASHLTPHLIRPLSRVEVMSKVWQLKGTSHTGTSMGT KANRLKMLLLRPGSALLPPPPSDTDFDITATVSPHLNVIPVTDSHSTEFLSGLFDDLSLF INNFTCYKVYRLLVCVVQEGMTVQSS >gi568815581f:15845249_16075339|GENSCAN_predicted_CDS_2|441_bp atgggcttctgggcagcagagaccattgccacagacagtgcaacaacagatgcctcaggg ggcacgttgggagcctctcacctgacacctcatttaatacgaccactttctagggtggaa gtgatgtctaaagtttggcagctaaaaggaaccagccacacaggaacaagtatggggaca aaagccaacagattaaagatgctactccttcgccctggctctgcactgctgcctcctcca ccttctgacaccgattttgacatcactgccactgtctcccctcatctaaatgtgatccct gtcactgactctcacagcactgagtttctctctgggctctttgacgacttatccttgttc attaataactttacttgttataaagtctaccgcttgcttgtctgtgtagttcaggagggc atgacagttcagagttcatga >gi568815581f:15845249_16075339|GENSCAN_predicted_peptide_3|338_aa MAACLRTGEDSCSPELSPSKGSVVQVRDEGTQPSKGRADGAAVENAGRRGIRDAPQCLLL LTWRRLGSRVRRKVAFNIQVEKPVQLYKGMGLDVSQTRLGDRKCYWRSCLISLAFCVAAE VCVNCCGNVIGGAGGARGPAGPAMLLETQDALYVALELVIAALSVAGNVLVCAAVGTANT LQTPTNYFLVSLAAADVAVGLFAIPFAITISLGFCTDFYGCLFLACFVLVLTQSSIFSLL AVAVDRYLAICVPLRLFRGSQQEQEPPAWEGCEPYVGVWPLHVLLPVLLDQEALCLPSVL SDLCSRAVTATGWRTEGRKDVAHALQECFGEMDKRMQT >gi568815581f:15845249_16075339|GENSCAN_predicted_CDS_3|1017_bp atggctgcctgcttacgaacaggcgaagactcttgcagcccagagctgtctccctccaaa ggcagtgtagttcaggtgagagatgaggggactcagcccagcaagggcagagcagatgga gcggctgtggagaatgcaggaaggagaggaatccgggacgcccctcagtgcctgctgctg ctgacatggagaagactcggatcaagagtcaggcgcaaggtggctttcaacatccaagtg gagaagccggtgcagctgtacaagggcatgggactggatgtcagccagacaaggctgggg gacaggaagtgctattggaggagctgcctgatctctctggccttctgtgtggcagcagaa gtttgtgtcaactgctgtgggaacgtgattggtggagcagggggcgcccggggcccagct ggcccggccatgctgctggagacacaggacgcgctgtacgtggcgctggagctggtcatc gccgcgctttcggtggcgggcaacgtgctggtgtgcgccgcggtgggcacggcgaacact ctgcagacgcccaccaactacttcctggtgtccctggctgcggccgacgtggccgtgggg ctcttcgccatcccctttgccatcaccatcagcctgggcttctgcactgacttctacggc tgcctcttcctcgcctgcttcgtgctggtgctcacgcagagctccatcttcagccttctg gccgtggcagtcgacagatacctggccatctgtgtcccgctcaggttgtttcgtgggtcc cagcaggaacaggagcctcctgcttgggagggctgtgagccttatgtgggtgtctggcca ctccatgtgctgctgcccgtgctgttggaccaggaggccttatgtctcccatcagtgctc agtgacctctgcagcagagcagtgactgccacgggctggcgaacggaaggacggaaggac gtggcccatgcccttcaggaatgttttggggagatggacaaacgcatgcagacctga >gi568815581f:15845249_16075339|GENSCAN_predicted_peptide_4|323_aa MSQREPLVKNGRHWTCGAEAGGFCGHAGKYTSYTADQQHSILIGANPIVNCARRGSTLCA PYENLMHHPPPLVCGNIVFHEAGPWCQKERYKLTVTDGILLNRYKSLVTGTRARGVIAVL WVLAFGIGLTPFLGWNSKDSATNNCTEPWDGTTNESCCLVKCLFENVVPMSYMVYFNFFG CVLPPLLIMLVIYIKIFLVACRQLQRTELMDHSRTTLQREIHAAKSLAMIVGIFALCWLP VHAVNCVTLFQPAQGKNKPKWAMNMAILLSHANSVVNPIVYAYRNRDFRYTFHKIISRYL LCQADVKSGNGQAGVQPALGVGL >gi568815581f:15845249_16075339|GENSCAN_predicted_CDS_4|972_bp atgagccagagggagccattggttaagaatggcaggcactggacgtgtggagccgaggct ggggggttctgcggccatgctgggaagtatacgtcttacactgcagatcagcagcattcg attctcataggagctaaccctattgtgaactgtgcacgcaggggatctacgctgtgcgct ccttatgagaatctaatgcatcaccccccgcccctggtctgtggaaacattgtcttccat gaagctggtccctggtgccagaaagagcgctataaactgaccgtaacagatggtattctt ttaaacaggtataaaagtttggtcacggggacccgagcaagaggggtcattgctgtcctc tgggtccttgcctttggcatcggattgactccattcctggggtggaacagtaaagacagt gccaccaacaactgcacagaaccctgggatggaaccacgaatgaaagctgctgccttgtg aagtgtctctttgagaatgtggtccccatgagctacatggtatatttcaatttctttggg tgtgttctgcccccactgcttataatgctggtgatctacattaagatcttcctggtggcc tgcaggcagcttcagcgcactgagctgatggaccactcgaggaccaccctccagcgggag atccatgcagccaagtcactggccatgattgtggggatttttgccctgtgctggttacct gtgcatgctgttaactgtgtcactcttttccagccagctcagggtaaaaataagcccaag tgggcaatgaatatggccattcttctgtcacatgccaattcagttgtcaatcccattgtc tatgcttaccggaaccgagacttccgctacacttttcacaaaattatctccaggtatctt ctctgccaagcagatgtcaagagtgggaatggtcaggctggggtacagcctgctctcggt gtgggcctatga >gi568815581f:15845249_16075339|GENSCAN_predicted_peptide_5|406_aa MFRLLSWSLGRGFLRAAGRRCRGCSARLLPGLAGGPGPEVQVPPSRVAPHGRGPGLLPLL AALAWFSRPAAAEEEEQQGADGAAAEDGADEAEAEIIQLLKRAKVRRLRALRPAERGSLC ERGLGVAAHYSLRLAEEWMEAPRLSIMKDEPEEAELILHDALRLAYQTDNKKAITYTYDL AEQLFKATMSYLLGGGMKQEDNAIIEISLKLASIYAAQNRQEFAVAGYEFCISTLEEKIE REKELAEDIMSVEEKANTHLLLGMCLDACARYLLFSKQPSQAQRMYEKALQISEEIQGER HPQTIVLMSDLATTLDAQGRFDEAYIYMQRASDLARQINHPELHMVLSNLAAVLMHRERY TQAKEIYQEALKQAKLKKDEISVQHIREELAELSKKSRPLTNSVKL >gi568815581f:15845249_16075339|GENSCAN_predicted_CDS_5|1221_bp atgttccggctcctgagctggagcctgggccgaggcttcctgcgggccgcggggcggcgg tgccggggctgctccgcgcgcctgctcccggggctggcaggaggtccggggcccgaggtg caggtgccgccatcccgagtcgcgccgcacggccggggcccaggcctgctgccgctgctg gcagcgctcgcctggttctcgaggcccgctgcggcagaggaggaggagcagcagggagcc gacggggccgctgccgaggacggggcggacgaggccgaggcagagatcatccagctgctg aagcgagccaaggtgaggcggctccgggccctgcgcccggccgagcgcggtagcctttgt gagcggggtcttggcgttgctgcgcattactctctccgcctcgccgaagagtggatggag gctccgcggttgagcattatgaaagatgagccagaagaggctgagttaattttgcatgac gctcttcgtctcgcctatcagactgataacaagaaggccatcacttacacttatgatttg gctgaacaactttttaaagcaacaatgagttacctccttggagggggcatgaagcaggag gacaatgcaataattgaaatttccctaaagctggccagtatctatgctgcgcagaacaga caggaatttgctgttgctggctatgaattctgcatttcaactctagaggaaaaaattgaa agagaaaaggaattagcagaagacattatgtcagtggaagagaaagccaatacccacctc ctcttgggcatgtgcttagacgcctgtgctcgctaccttctgttctccaagcagccgtca caggcacaaaggatgtatgaaaaagctctgcagatttctgaagaaatacaaggagaaaga cacccacagaccattgtgctgatgagtgacctggctactaccctggatgcacagggccgc tttgatgaggcctatatttatatgcaaagggcatcagatctggcaagacagataaatcat cctgagctacacatggtactcagtaatctagctgcagttttgatgcacagagaacgatat acacaagcaaaagagatctaccaggaagcactgaagcaagcaaagctgaaaaaagatgaa atttctgtacaacacatcagggaagagttggctgagctgtcaaagaaaagtagacctttg acaaattctgtcaagctctaa >gi568815581f:15845249_16075339|GENSCAN_predicted_peptide_6|795_aa XTPRATTESFEDGLKYPKQIKRESPPIRAFEGAITKGKPYDGITTIKEMGRSIHEIPRQD ILTQESRKTPEVVQSTRPIIEGSISQGTPIKFDNNSGQSAIKHNVKSLITGPSKLSRGMP PLEIVPENIKVVERGKYEDVKAGETVRSRHTSVRQLSPTPGYPSQYQLYAMENTRQTILN DYITSQQMQVNLRPDVARGLSPREQPLGLPYPATRGHPTHLAAAASAEREREREREKERE RERIAAASSDLYLRPGSEQPGRPGSHGYVRSPSPSVRTQETMLQQRPSVFQGTNGTSVIT PLDPTAQLRIMPLPAGGPSISQGLPASRYNTAADALAALVDAAASAPQMDVSKTKESKHE AARLEENLRSRSAAVSEQQQLEQKTLEVEKRSVQCLYTSSAFPSGKPQPHSSVVYSEAGK DKGPPPKSRYEEELRTRGKTTITAANFIDVIITRQIASDKDARERGSQSSDSSSSYDPTR QYEGPLHHYRPQQESPSPQQQLPPSSQAEGMGQVPRTHRLITLADHICVPVVHEKQDSLL LLSQRGAEPAEQRNDARSPGSISYLPSFFTKLENTSPMVKSKKQEIFRKLNSSGGGDSDM AGAAAVEWAGSHWCSTSCWSRSTSPYIVDEAPNVDIGQGHCKQAGPKRFNIYTSCFNEGI DLILRDGHLIVMQSSVSSRGHSFADPASNLGLEDIIRKALMGSFDDKVEDHGVVMSQPMG VVPGTANTSVVTSGSTQFPYNPLTMRMLSSTPPTPIACAPSAVNQAAPHQQNRIWEREPA PLLSAQYETLSDSDD >gi568815581f:15845249_16075339|GENSCAN_predicted_CDS_6|2388_bp nggacaccaagagcaacaactgaaagctttgaagatggccttaaatatcccaaacaaatt aaaagggaaagtcctcccatacgagcatttgaaggtgccattaccaaaggaaaaccatat gatggcatcaccaccatcaaagaaatggggcgttccattcatgagattccaaggcaagat attttaactcaggaaagtcggaaaactccagaagtggtccagagcacacggccgataatt gagggttccatttcccagggcacaccaataaagtttgacaacaactcaggtcaatctgcc atcaaacacaatgtcaaatccttaatcacggggcctagcaaactatcccgtggaatgcct ccgctggaaattgtgccagagaacataaaagtggtagaacggggaaaatatgaggatgtg aaagcaggcgagaccgtgcgttcccggcacacgtcagtgagacagctttcaccaactcca ggttacccaagtcagtatcagctttacgcaatggagaacacaagacagacaatcttaaat gattacattacctcacaacagatgcaagtgaacttgcgtccagatgtggccagaggactc tccccaagagagcagccactgggtctcccatacccagcaacgagaggacacccaacacac cttgcagctgctgcaagtgctgagagggaacgggaacgggagcgggagaaggagcgggag cgggaacggattgctgcagcttcctccgacctctacctgcggccaggctcagaacagcct ggccgacctggcagtcatggatatgttcgctccccttccccttcagtaagaactcaggag accatgttgcaacagagacccagtgttttccaaggaaccaatggaaccagtgtaatcaca cctttggatccaactgctcagctacgaatcatgccactgcctgctgggggcccttcaata agccaaggcctgccagcctcccgttacaacactgctgcggatgccctggctgctcttgtg gatgctgcagcttctgcaccccagatggatgtgtccaaaacaaaagagagtaagcatgaa gctgccaggttagaagaaaatttgagaagcaggtcagcagcagttagtgaacagcagcag ctagagcagaaaaccctggaggtggagaagagatctgttcagtgtttatacacttcttca gcctttccaagtggcaagccccagcctcattcttcagtagtttattctgaggctgggaaa gataaagggcctcctccaaaatccagatatgaggaagagctaaggaccagagggaagact accattactgcagctaacttcatagacgtgatcatcacccggcaaattgcctcggacaag gatgcgagggaacgtggctctcaaagttcagactcttctagtagctatgatcctaccaga caatatgaaggaccattacatcactatcgaccacagcaggaatcaccatctccccaacaa cagctgcccccttcttcacaggcagagggaatggggcaagtgcccaggacccatcggctg atcacacttgctgatcacatctgtgttccggttgtgcatgagaaacaggacagcttgctg ctcttgtctcagaggggcgcagagcctgcagagcagaggaatgatgcccgctcaccaggg agtataagctacttgccttcattcttcaccaagcttgaaaatacatcacccatggttaaa tcaaagaagcaggagatttttcgtaagttgaactcctctggtggaggtgactctgatatg gctggagcagcagcagtggagtgggcaggatcccactggtgcagcaccagctgctggagc aggtccaccagcccctacattgtagatgaggctcccaatgttgacattggccagggccat tgcaaacaagctggaccaaaaaggttcaacatttacaccagctgctttaacgagggcatt gatcttatcctccgtgatggtcacctcattgtcatgcagagctcagttagctctagaggc cattcttttgctgatcctgccagtaatcttgggctggaagacattatcaggaaggctctc atgggaagctttgatgacaaagttgaggatcatggagttgtcatgtcccagcctatggga gtagtgcctggtactgccaacacctcagttgtgaccagtggctcaactcagtttccttat aaccctctgactatgcggatgctcagcagtactccaccaacaccgattgcatgtgctccc tctgcggtgaaccaagcagctcctcaccaacagaacaggatctgggagcgagagcctgcc ccactgctctcagcacagtacgagaccctgtcggatagtgatgactga