GENSCAN 1.0 Date run: 5-Nov-116 Time: 11:12:03 Sequence gi568815584r:34463125_34730152 : 267028 bp : 44.65% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 Intr - 6370 6281 90 1 0 95 115 54 0.789 8.99 1.01 Init - 7362 7300 63 0 0 82 102 12 0.779 1.38 1.00 Prom - 11293 11254 40 -3.46 2.00 Prom + 14171 14210 40 -6.06 2.01 Sngl + 22855 23328 474 0 0 91 47 491 0.997 41.41 2.02 PlyA + 25576 25581 6 1.05 3.22 PlyA - 27557 27552 6 1.05 3.21 Term - 53462 53186 277 1 1 90 41 141 0.325 4.53 3.20 Intr - 66351 66234 118 1 1 5 78 94 0.447 -0.38 3.19 Intr - 70415 70320 96 0 0 45 111 82 0.900 6.18 3.18 Intr - 73151 72970 182 1 2 21 57 175 0.944 7.21 3.17 Intr - 76520 76431 90 1 0 7 80 206 0.028 10.81 3.16 Intr - 78463 78292 172 2 1 23 29 136 0.006 0.20 3.15 Intr - 83758 83598 161 0 2 64 111 44 0.027 3.93 3.14 Intr - 93642 93490 153 2 0 127 9 64 0.016 1.59 3.13 Intr - 94464 94280 185 0 2 32 14 163 0.089 1.89 3.12 Intr - 101371 101240 132 1 0 59 52 108 0.341 5.14 3.11 Intr - 104647 104562 86 2 2 98 67 28 0.231 1.24 3.10 Intr - 104889 104730 160 0 1 57 79 54 0.147 0.96 3.09 Intr - 106025 105592 434 0 2 67 38 385 0.001 24.77 3.08 Intr - 130026 129921 106 0 1 75 69 75 0.930 4.09 3.07 Intr - 134521 134426 96 1 0 62 116 27 0.858 3.11 3.06 Intr - 140347 140224 124 0 1 62 76 60 0.980 2.79 3.05 Intr - 142593 142472 122 0 2 75 94 107 0.996 9.29 3.04 Intr - 145016 144906 111 2 0 64 97 75 0.989 6.58 3.03 Intr - 146618 146514 105 2 0 70 94 83 0.982 7.51 3.02 Intr - 166830 166783 48 0 0 58 83 118 0.794 7.18 3.01 Init - 167028 166987 42 0 0 64 64 83 0.587 2.02 3.00 Prom - 206389 206350 40 -4.76 4.00 Prom + 215476 215515 40 -4.26 4.01 Init + 223073 223202 130 1 1 59 32 130 0.671 4.81 4.02 Term + 231202 231467 266 2 2 -11 42 435 0.530 24.67 4.03 PlyA + 231684 231689 6 1.05 5.06 PlyA - 233793 233788 6 1.05 5.05 Term - 246311 246295 17 0 2 104 54 16 0.468 -1.80 5.04 Intr - 250012 249936 77 0 2 100 87 32 0.985 3.46 5.03 Intr - 250437 250130 308 0 2 90 70 222 0.989 15.85 5.02 Intr - 251467 251414 54 1 0 116 95 60 0.971 8.68 5.01 Init - 251778 251689 90 0 0 53 55 102 0.839 2.03 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 76504 76431 74 1 2 110 80 161 0.965 18.04 S.002 Intr - 112718 112632 87 2 0 123 86 60 0.871 9.47 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584r:34463125_34730152|GENSCAN_predicted_peptide_1|51_aa MNSSRAGSVLPALSLPQPLAKVGNTFSIEIFLDLKVRDDPRLINRDSQRQQ >gi568815584r:34463125_34730152|GENSCAN_predicted_CDS_1|153_bp atgaactcctcaagggctggctctgtgctacctgccctatcactcccacagcccctggca aaggttggaaatacttttagtatagaaattttcttggatctgaaagtacgagatgatcca cggttaataaacagggatagccagcggcagcag >gi568815584r:34463125_34730152|GENSCAN_predicted_peptide_2|157_aa MALKAKKEALAPPKAEAKAKALKAKKAVMKDVHIHKKRRPARHPPSRGPGHCDSRPPKYP RKSAPRRNKLDRYAIIKFLWTTESAMKKIEDNTTYVFRVDVKANKQQIKQAVKKLPDNDA AKVNTLIRPDGEKKAYVQLAPDYDVLDVAKKKWDHLN >gi568815584r:34463125_34730152|GENSCAN_predicted_CDS_2|474_bp atggcgttgaaagcaaagaaggaagcgcttgcccctcctaaagccgaagccaaagcgaag gctttaaaggccaagaaggcagtgatgaaagatgtccacatccacaaaaaaagaagaccc gcacgtcacccaccttccagaggcccaggacactgcgactcgaggccacccaaatatcct cggaagagcgcccccaggagaaacaagcttgaccgttatgccatcatcaagtttctgtgg accactgagtccgccatgaagaagatagaagacaacaccacatatgtgttcagggtggat gttaaagccaacaagcagcagatcaaacaggctgtgaagaagctccctgacaatgatgcg gccaaggtcaacaccctgattcggcccgatggagagaagaaggcatatgttcaactggct ccagattacgatgttttggatgttgccaaaaaaaaatgggatcatctaaactga >gi568815584r:34463125_34730152|GENSCAN_predicted_peptide_3|999_aa MRACAGPRLGAAMMEGLDDGPDFLSEEDRGLKAINVDLQSDAALQVDISDALSERDKVKF TVHTKSSLPNFKQNEFSVVRQHEEFIWLHDSFVENEDYAGYIIPPAPPRPDFDASREKLQ KLGEGEGSMTKEEFTKMKQELEAEYLAIFKKTVAMHEVFLCRVAAHPILRRDLNFHVFLE YNQDLSVRGKNKKEKLEDFFKNMVKSADGVIVSGVKDVDDFFEHERTFLLEYHNRVKDAS AKSDRMTRSHKMHRPGRMMPGVTVKDVNQQEFIRALAAFLRKSGKLKVPEWVDTVKLLAK HKELAPYDENWLYTRAASTAWHLYLWGGAGVGSMTKIYGGCQRSGGMPGHFSRGSKNVAH RVLQALEGLKMVEKDQDGGRKLTRQGQRDLDRIAGQDLLYRRSRSLVDYENANKALDKAR AKNKDVLQAETSQQLCCQKFEKISESAKQELIDFKTRRVAAFRKNLVELAELELKHAKIS DNCSINSSTQPEVILPPRGHFAISGDICGYHNWVFYGYYQYMPGQQERNSVSHKKKKKKK KKKKKKKKKKKKNTIIIIVYLQIFPSVQKWFVVQRYTLNSTGARPPFQVPDSRPRAHLRP TSPGDTQACPTPSVGPRSGSGRDRSAGDDPGKGVAKAGESANAVPHYHKLCSRVSHIWGN RRGQHIRSAMDKPRPGKTTFVIMVSPLPEQQMKGLDLDPEARALGPAFIHWWTSHGLIFT SEGYVKDKTHSASNGNSTVQTALGNMPLQATMNRLPDDYDPYAVEEPSDEEPALSSSEDE VDVLLHGTPDQKRKLIRECLTGESESSSEDEFEKEMEAELNSTMKTMEDKLSSLGTGSSS GNGKVATAPTRYYDDIYFDSDSEDEDRAVQVTKKKKKKQHKIPTNDELLYDPEKDNRDQA WVDAQRRGHESYKTQYRAMFVMNCSINKEEVLRYKASENRKKRRVHKKMRSNREDAAEKA ETDVEEIYHPVMCTECSTEVAVYDKDEVFHFFNVLASHS >gi568815584r:34463125_34730152|GENSCAN_predicted_CDS_3|3000_bp atgcgcgcctgcgccggccctcgcctcggagcagccatgatggaaggcctggacgacggc ccggacttcctctcagaagaggaccgcggacttaaagcaataaatgtagatcttcaaagt gatgctgctctgcaggtggacatttctgatgctcttagtgagcgggataaagtaaaattc actgttcacacaaagagttcattgccaaattttaaacaaaacgagttttcagttgttcgg caacatgaggaatttatctggcttcatgattcctttgttgaaaatgaagactatgcaggt tatatcattccaccagcaccaccaagacctgattttgatgcttcaagggaaaaactacag aagcttggtgaaggagaagggtcaatgacgaaggaagaattcacaaagatgaaacaggaa ctggaagctgaatatttggcaatattcaagaagacagttgcgatgcatgaagtgttcctg tgtcgtgtggcagcacatcctattttgagaagagatttaaatttccatgtcttcttggaa tataatcaagatttgagtgtgcgaggaaaaaataaaaaagagaaacttgaagacttcttt aaaaacatggttaaatcagcagatggagtaatcgtttcaggagtaaaggatgtagatgat ttctttgagcacgaacgaacatttcttttggagtatcataaccgagttaaggatgcatct gctaaatctgatagaatgacaagatcccacaaaatgcacaggccgggccgcatgatgcct ggagttactgtaaaagatgtgaaccagcaggagttcatcagagctctggcagccttcctc agaaagtctgggaagctgaaagtccccgaatgggtggacaccgtcaagctgctggccaag cacaaagagcttgctccctacgatgagaactggttgtacacgcgagctgcttccacagcg tggcacctgtacctctggggtggcgctggggttggctccatgaccaagatctatggggga tgtcagagaagtggcggcatgcctggccactttagccgaggctccaagaatgtggcccac cgggtcctccaagccctggaggggctaaaaatggtggaaaaggaccaagatgggggccgc aaactgacacgtcagggacagagagatctggacagaatcgctggacaggatctcctgtat cgaaggtctaggtcactagtggattatgaaaatgctaataaagcactggataaagcaaga gcaaaaaataaagatgttctacaggccgaaacttcccaacaattatgttgtcagaaattt gaaaaaatatctgagtctgcaaaacaagaacttatagattttaagacaagaagagttgct gcattcagaaaaaatttagtggaactggcagagttagaactgaagcatgcaaagatttct gataactgttctattaacagtagtactcaaccagaggtgattctgccccccagaggacat tttgcaatatctggagacatttgtggttatcacaactgggtgttctatggttactaccag tacatgcctggacaacaagagcgaaactccgtctcacataagaagaagaagaagaagaag aagaagaagaagaagaagaagaagaagaagaagaagaatacaataataataatagtgtat ttacagattttcccaagtgttcaaaaatggtttgttgtccagcgttatacacttaactcg accggtgcacggcctccctttcaggtcccagactcccggccgcgcgcccacttgcgccca accagccccggagacacccaggcctgtcccacgccgtcggtaggtccccggtccgggagc gggagagaccggagcgccggggacgaccccggcaagggcgtggctaaggcaggggaaagc gcgaacgcagtcccccactaccacaaattatgcagtcgagtttcccacatttggggaaat cgcaggggtcagcacatccggagtgcaatggataagcctcgccctgggaaaaccaccttc gtgatcatggtatctcccctgccagagcagcagatgaagggcctagacctggatccagaa gctagggctctcggtccagcattcatccactggtggacatcacatgggcttatttttacc agcgaaggttacgtgaaggacaaaacgcactcagccagcaacggaaactcaacagttcaa acagcactggggaacatgccgctgcaggccaccatgaaccggcttccggatgactacgac ccctacgcggttgaagagcctagcgacgaggagccggctttgagcagctctgaggatgaa gtggatgtgcttttacatggaactcctgaccaaaaacgaaaactcatcagagaatgtctt accggagaaagtgaatcatctagtgaagatgaatttgaaaaggagatggaagctgaatta aattctaccatgaaaacaatggaggacaagttatcctctctgggaactggatcttcctca ggaaatggaaaagttgcaacagctccgacaaggtactacgatgatatatattttgattct gattccgaggatgaagacagagcagtacaggtgaccaagaaaaaaaagaagaaacaacac aagattccaacaaatgacgaattactgtatgatcctgaaaaagataacagagatcaggcc tgggttgatgcacagagaagggggcatgaatcatacaaaactcaatatagagcaatgttt gtaatgaattgttctattaacaaagaggaggttctaagatataaagcctcagagaacagg aagaaaaggcgggtccataagaagatgaggtctaaccgggaagatgctgccgagaaggca gagacagatgtggaagaaatctatcacccagtcatgtgcactgaatgttccactgaagtg gcagtctacgacaaggatgaagtctttcattttttcaatgttttagcaagccattcctaa >gi568815584r:34463125_34730152|GENSCAN_predicted_peptide_4|131_aa MSSELGKDFSAYSVENKQEVGKTIRTVTALVRDDSSPDQGDEKRHCDSRPPKYPRNSAPR RNKLDRYAIIKFLWTTESAMKKIEDNTTLVFTVDVKANKQQIKQAVKKLCDIDAAKVNTL IRPGGEKKAYV >gi568815584r:34463125_34730152|GENSCAN_predicted_CDS_4|396_bp atgagctctgagcttgggaaagatttctctgcctacagtgtggagaacaagcaggaggtg ggcaagactataagaaccgtaacagccctggtgagagatgacagcagcccggaccagggg gatgaaaaaagacactgcgactcgaggccacccaaatatcctcggaatagcgcccccagg agaaacaagcttgaccgttatgccatcatcaagtttctgtggaccactgagtccgccatg aagaagatagaagacaacaccacacttgtgttcacggtggatgttaaagccaacaagcag cagatcaaacaggctgtgaagaagctctgtgacattgatgcggccaaggtcaacaccctg attcggcctggtggagagaagaaggcatatgtttga >gi568815584r:34463125_34730152|GENSCAN_predicted_peptide_5|181_aa MLGGWRRLCQAPGLRYHRVAPPGLQGSRSECHRAEARAAAAATAEGTMASGVTVNDEVIK VFNDMKVRKSSTQEEIKKRKKAVLFCLSDDKRQIIVEEAKQILVGDIGDTVEDPYTSFVK LLPLNDCRYALYDATYETKESKKEDLVFIFWAPESAPLKSKMIYASSKDAIKKKFTAHKE V >gi568815584r:34463125_34730152|GENSCAN_predicted_CDS_5|546_bp atgctcgggggctggaggcggctctgccaggctcccgggctccgctaccaccgcgtcgcc ccccccggcctgcagggctcgcgctctgagtgccacagagccgaagcccgagctgccgcc gcagccacagccgagggcactatggcttctggagttacagtgaatgatgaagtcatcaaa gtttttaatgatatgaaagtaaggaaatcttctacacaagaggagatcaaaaagagaaag aaagcagttctcttctgtttaagcgatgacaaaagacaaataattgtagaggaagcaaag cagatcttggtgggtgacattggtgatactgtagaggacccctacacatcttttgtgaag ttgctacctctgaatgattgccgatatgctttgtacgatgccacatacgaaacaaaagag tctaagaaagaagacctagtatttatattctgggctcctgaaagtgcacctttaaaaagc aagatgatttatgctagctctaaagatgccattaaaaagaaatttacagcccataaagaa gtctga