GENSCAN 1.0 Date run: 4-Nov-116 Time: 17:51:56 Sequence gi568815588r:69472402_69673043 : 200642 bp : 47.80% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 4450 4500 51 0 0 59 59 61 0.616 1.46 1.02 Intr + 4606 4674 69 0 0 100 91 11 0.687 1.98 1.03 Intr + 11290 11475 186 0 0 123 73 249 0.711 26.79 1.04 Intr + 12740 12814 75 1 0 78 80 122 0.538 10.11 1.05 Intr + 14222 14338 117 1 0 71 75 35 0.531 1.16 1.06 Intr + 23193 23288 96 2 0 48 105 230 0.999 20.91 1.07 Intr + 25879 26041 163 0 1 96 36 130 0.702 8.15 1.08 Intr + 30147 30232 86 1 2 115 97 37 0.946 6.84 1.09 Intr + 32037 32084 48 2 0 62 80 70 0.832 2.48 1.10 Intr + 33723 33839 117 2 0 88 80 238 0.969 23.66 1.11 Term + 34428 34577 150 2 0 88 49 207 0.922 14.71 1.12 PlyA + 35239 35244 6 1.05 2.00 Prom + 36959 36998 40 -7.06 2.01 Init + 39394 39469 76 0 1 83 99 38 0.420 5.65 2.02 Term + 66911 67191 281 0 2 15 49 643 0.980 48.71 2.03 PlyA + 67827 67832 6 -0.45 3.02 PlyA - 67974 67969 6 1.05 3.01 Sngl - 69945 69406 540 0 0 51 42 184 0.678 6.19 3.00 Prom - 76486 76447 40 -3.96 4.02 PlyA - 79794 79789 6 1.05 4.01 Sngl - 100642 99998 645 1 0 60 48 656 0.998 54.98 4.00 Prom - 101915 101876 40 -12.11 5.06 PlyA - 102082 102077 6 1.05 5.05 Term - 103525 103363 163 0 1 5 42 368 0.748 21.31 5.04 Intr - 105247 105156 92 1 2 87 93 38 0.682 2.99 5.03 Intr - 106486 106277 210 1 0 61 28 127 0.323 3.11 5.02 Intr - 113288 113185 104 0 2 111 91 13 0.476 3.79 5.01 Init - 114775 114766 10 1 1 83 67 5 0.215 -1.30 5.00 Prom - 124887 124848 40 -3.56 6.00 Prom + 134686 134725 40 -3.26 6.01 Init + 139063 139169 107 0 2 62 85 53 0.550 2.09 6.02 Intr + 141991 142138 148 1 1 74 80 41 0.825 2.14 6.03 Intr + 144576 144717 142 2 1 87 35 66 0.727 1.13 6.04 Intr + 146748 146812 65 1 2 82 64 54 0.402 0.84 6.05 Term + 148209 148316 108 2 0 147 47 25 0.840 2.81 6.06 PlyA + 148838 148843 6 1.05 7.00 Prom + 153349 153388 40 -5.06 7.01 Init + 159343 159438 96 0 0 79 96 113 0.976 11.51 7.02 Term + 160389 160658 270 2 0 88 43 275 0.964 18.48 7.03 PlyA + 161174 161179 6 1.05 8.06 PlyA - 164218 164213 6 1.05 8.05 Term - 167746 167728 19 0 1 95 48 23 0.320 -3.21 8.04 Intr - 171609 171440 170 0 2 119 97 104 0.904 13.14 8.03 Intr - 173450 173349 102 2 0 104 86 27 0.883 4.47 8.02 Intr - 183942 183812 131 1 2 80 100 15 0.047 2.31 8.01 Init - 188428 188257 172 1 1 107 32 93 0.029 5.20 8.00 Prom - 189909 189870 40 -5.86 9.00 Prom + 193889 193928 40 -2.96 9.01 Init + 194858 194988 131 1 2 86 109 104 0.944 10.05 9.02 Intr + 197799 197901 103 0 1 123 98 -12 0.490 3.48 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 22259 22420 162 1 0 108 53 91 0.838 5.88 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:69472402_69673043|GENSCAN_predicted_peptide_1|385_aa MEVQCPQNLEYRLVVLEERGVGQQEPSCPGDVGEKTVFSRLIGALVLSVGIYAEVERQKY KTLESAFLAPAIILILLGVVMFMVSFIGVLASLRDNLYLLQAFMYILGICLIMELIGGVV ALTFRNQQLLLRQQVLSHTLGCADLSDGPGSGPVKMFMGVPVIPAQTIDFLNDNIRRGIE NYYDDLDFKNIMDFVQKKFKCCGGEDYRDWSKNQYHDCSAPGPLACGVPYTCCIRNTVDT APVGTGGLSGTPGWKRLLDPGLGACELQQSLLQNIDSMVRQTEVVNTMCGYKTIDKERFS VQDVIYVRGCTNAVIIWFMDNYTIMAGILLGILLPQFLGVLLTLLYITRVEDIIMEHSVT DGLLGPGAKPSVEAAGTGCCLCYPN >gi568815588r:69472402_69673043|GENSCAN_predicted_CDS_1|1158_bp atggaggttcagtgtccccagaacttagagtaccgcctggtggtcctggaggaaagggga gtgggacagcaggaacccagctgccctggggatgtgggagagaagactgttttctcccga ctgattggggccctggtcctgtctgtgggcatctatgcagaggttgagcggcagaaatat aaaacccttgaaagtgccttcctggctccagccatcatcctcatcctcctgggcgtcgtc atgttcatggtctccttcattggtgtgctggcgtccctccgtgacaacctgtaccttctc caagcattcatgtacatccttgggatctgcctcatcatggagctcattggtggcgtggtg gccttgaccttccggaaccagcagttgctcttaaggcagcaggtgctgtcccacaccctg ggttgtgcagacctctcagatggcccaggatctggccctgtgaaaatgttcatgggagtg cctgtgattccagcccagaccattgacttcctgaacgacaacattcgaagaggaattgag aactactatgatgatctggacttcaaaaacatcatggactttgttcagaaaaagttcaag tgctgtggcggggaggactaccgagattggagcaagaatcagtaccacgactgcagtgcc cctggacccctggcctgtggggtgccctacacctgctgcatcaggaacacggtagacact gctcctgtggggactggggggctgtcggggaccccaggatggaagaggctactggaccct ggtcttggggcctgtgaacttcagcagagcctcttgcagaatatcgattctatggtcagg cagacagaagttgtcaacaccatgtgtggctacaaaactatcgacaaggagcgtttcagt gtgcaggatgtcatctacgtgcggggctgcaccaacgccgtgatcatctggttcatggac aactacaccatcatggcgggcatcctcctgggcatcctgcttccccagttcctgggggtg ctgctgacgctgctgtacatcacccgggtggaggacatcatcatggagcactctgtcact gatgggctcctggggcccggtgccaagcccagcgtggaggcggcaggcacgggatgctgc ttgtgctaccccaattag >gi568815588r:69472402_69673043|GENSCAN_predicted_peptide_2|118_aa MDEAGSHHSRQTDTGTENQTPHVLTRHGYTKISETTTKELIHVTKKIEEEEEEEEEEEEE EEEERRRRRKKKKKKKEKEKEKEKKKKKKKKKKKKKKKKKKEKEKEKEKEKKTQPLQD >gi568815588r:69472402_69673043|GENSCAN_predicted_CDS_2|357_bp atggatgaagctggaagccatcattctcggcaaactgacacaggaacagaaaaccaaaca ccgcatgttctcactcgtcatggctacaccaaaatctcagaaactaccactaaagaactc attcatgtaacaaaaaaaatagaagaagaagaggaagaggaagaagaggaagaggaagag gaagaagaagaaagaagaagaagaaggaagaagaagaagaagaagaaggagaaggagaag gagaaggagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaag aaggagaaggagaaggagaaggagaaggagaagaagactcagcccctgcaggattaa >gi568815588r:69472402_69673043|GENSCAN_predicted_peptide_3|179_aa MWESLELPRDLLNGFDQNADSDVDNEVQAEVVSDGDEELVGNWSKGHPCYAKRLVAFSPC PRDLWKFEYERENLGYLAEEISKHQSVQEEAEHKSLENLHPGNAIEKKNSFLRGKFKPIA EICISNEEPNVNHQDNGENVSRECLRTSQQPLPSEAQGLRREKWFGGPVHPSCFSSTCG >gi568815588r:69472402_69673043|GENSCAN_predicted_CDS_3|540_bp atgtgggaaagtttggaacttcctagagacctgttgaatggctttgaccaaaatgctgat agtgatgtggacaatgaagtccaggctgaggtggtctcagatggagatgaggaacttgtt ggaaactggagtaaaggccacccttgctatgcaaagagactggtggcattttccccctgc cctagagatctgtggaaatttgaatatgagagagagaatttagggtatctggcagaagaa atttctaagcaccaaagcgttcaagaggaagcagagcataaaagtttagaaaatttgcac cctggcaatgcgatagaaaagaaaaactcatttcttagggggaaattcaagcccattgca gaaatttgcataagtaacgaggaaccgaatgttaatcaccaagacaatggggaaaatgtc tccagggaatgtctgagaacttcacagcagcccctcccatcagaggcccaggggcttagg agggaaaaatggtttgggggaccagtgcatcccagttgcttcagttccacttgtggctaa >gi568815588r:69472402_69673043|GENSCAN_predicted_peptide_4|214_aa MTPQPSGAPTVQVTRETERSFPRASEDEVTCPTSAPPSPTRTRGNCAEAEEGGCRGAPRK LRARRGGRSRPKSELALSKQRRSRRKKANDRERNRMHNLNSALDALRGVLPTFPDDAKLT KIETLRFAHNYIWALTQTLRIADHSLYALEPPAPHCGELGSPGGSPGDWGSLYSPVSQAG SLSPAASLEERPGLLGATFSACLSPGSLAFSDFL >gi568815588r:69472402_69673043|GENSCAN_predicted_CDS_4|645_bp atgacgcctcaaccctcgggtgcgcccactgtccaagtgacccgtgagacggagcggtcc ttccccagagcctcggaagacgaagtgacctgccccacgtccgccccgcccagccccact cgcacacgggggaactgcgcagaggcggaagagggaggctgccgaggggccccgaggaag ctccgggcacggcgcgggggacgcagccggcctaagagcgagttggcactgagcaagcag cgacggagtcggcgaaagaaggccaacgaccgcgagcgcaatcgaatgcacaacctcaac tcggcactggacgccctgcgcggtgtcctgcccaccttcccagacgacgcgaagctcacc aagatcgagacgctgcgcttcgcccacaactacatctgggcgctgactcaaacgctgcgc atagcggaccacagcttgtacgcgctggagccgccggcgccgcactgcggggagctgggc agcccaggcggttcccccggggactgggggtccctctactccccagtctcccaggctggc agcctgagtcccgccgcgtcgctggaggagcgacccgggctgctgggggccaccttttcc gcctgcttgagcccaggcagtctggctttctcagattttctgtga >gi568815588r:69472402_69673043|GENSCAN_predicted_peptide_5|192_aa MAEGHKEWTIKDKLKNKFKDEFDIVSLPCTLGSHPQPQVDANSWESQEDGRVFTIREFKS PGVRRCPASTPRLSNPLRSVQRPLSQPPFGKPRSSSAALRSPGCQHCELAQQPLGGELRA GQHQPAEALQLLADEIWVWQGWPMRQTSVKKEKEEEKEKEKEEEGGEEEEEEEEEEEEEE EEEEEAAAAAAA >gi568815588r:69472402_69673043|GENSCAN_predicted_CDS_5|579_bp atggcagaagggcataaagaatggaccattaaggacaaactaaaaaacaaattcaaggat gaatttgatatagtctccctgccttgcactttaggttctcacccccaaccccaggtggac gccaactcctgggaatcccaggaggacgggagagtcttcaccatccgagagttcaagagc ccaggggtcaggcgatgcccagcgagcacccccagactttccaaccctctccgctccgtc cagcggcccctgtcccagccgccttttggcaagccgcgctcttcctccgctgccctgcgc tctcctggctgccagcactgtgagctggcgcagcagccgttgggtggggagctgcgcgct gggcagcaccagcctgcggaagcgctccagctcctggcagatgaaatatgggtctggcaa ggctggcctatgaggcagacaagtgtgaagaaggagaaggaggaggagaaggagaaggag aaagaagaagaaggaggagaagaagaagaagaagaagaagaggaggaggaggaggaggag gaggaggaggaggaagcagcagcagcagcagcagcttga >gi568815588r:69472402_69673043|GENSCAN_predicted_peptide_6|189_aa MRYCYTSIRMVKIQKTDNSKYWQGRIGTTGILIHHWTLVTVDYNCQSYKTQALFKKEFLG KSKDNRGGGKSNKDTTGNGSLWRLQGSCNADSLRLFECELDTSNSHIYILQMLRNMCQTL YVVFLKLFNLAGVPAGSKTSMASYGLQCKSQTKNTIGPSTIEISVPNLSQTLNICSTLKG TMKQASRLN >gi568815588r:69472402_69673043|GENSCAN_predicted_CDS_6|570_bp atgagatactgctatacatctattagaatggtcaaaatccagaaaactgacaactctaaa tactggcaaggaagaattggaacaacagggattctcattcatcactggactctagtaaca gtggattataactgccagagttataagacacaggctctatttaagaaggagttcttaggg aaatccaaagataacaggggagggggaaaatccaacaaagacactacaggaaatggaagc ttatggcgcctacagggcagctgcaatgcagactctcttagacttttcgaatgtgaattg gacacatccaattcacacatctacatccttcaaatgctaaggaacatgtgtcaaactctc tatgtggttttcctgaaactctttaatttggctggagtacctgcaggctccaaaacttcc atggcttcctatggcctacagtgcaagtcacagactaagaataccataggtcccagcacc attgaaatttcagtcccaaacctcagccagaccctgaacatatgttccaccctgaaagga acaatgaagcaagcctccaggcttaattaa >gi568815588r:69472402_69673043|GENSCAN_predicted_peptide_7|121_aa MVRILANGEIVQDDDPRVRTTTQPPRGSIPRQSFFNRGHGAPPGGPGPRQQQAGARLGAA QSPFNDLNRQLVNMGFPQWHLGNHAVEPVTSILLLFLLMMLGVRGLLLVGLVYLVSHLSQ R >gi568815588r:69472402_69673043|GENSCAN_predicted_CDS_7|366_bp atggtgcggatcttggccaatggggaaatcgtgcaggatgacgacccccgagtgaggacc actacccagccaccaagaggtagcattcctcgacagagcttcttcaacaggggccatggt gctcccccagggggtcctggcccccgccagcagcaggcaggtgccaggctgggtgctgct cagtcccccttcaatgacctcaaccggcagctggtgaacatgggctttccgcagtggcat cttggcaaccatgctgtggagccggtgacctccatcctgctcctcttcctgctcatgatg cttggtgttcgtggcctcctcctggttggccttgtctacctggtgtcccacctgagtcag cggtga >gi568815588r:69472402_69673043|GENSCAN_predicted_peptide_8|197_aa MEHHDCRCKGLPHMLVAQKQQLTVKDGNTLCSRWIKWLQRQAKDPAFSPSQESRVEKGRV VEGSDHRVHEARLPGNHSRSRGVDSAPPTLAHAGLLLTEEKTLKENGRRAEALRVGVHWR HGTSQGPGRTLWAEKVKPHFRTRGQVYDASSAPVTEHTATELPAGNQGSQPGLQEGGLEG TVVTLVSWKVTSEKDKS >gi568815588r:69472402_69673043|GENSCAN_predicted_CDS_8|594_bp atggagcaccatgactgcagatgtaagggtctcccacacatgctggtggctcagaagcaa cagctgacagtaaaagatggaaacaccctttgcagccgctggatcaaatggcttcagcgc caggccaaagatccagcttttagcccaagccaggaaagcagagtagagaaggggagagtt gttgaggggtcagaccatcgggttcacgaggccaggcttcctggcaaccactcccggagc aggggagtggattctgctccaccaaccctggctcacgctggcctgctcttgacggaagag aagacccttaaggaaaatggccgccgagcagaagctctgagggttggcgtgcattggcgg catggaacaagtcaggggcctggaaggacactgtgggcagaaaaggtgaagccccacttc aggacaaggggccaagtgtacgatgcttcctcagcacctgtgactgagcacacggctact gagctgcctgctggcaatcaagggagccagccagggctgcaggaggggggtctagagggg actgtggtcaccctggtgtcctggaaagtcactagtgaaaaagacaagagctag >gi568815588r:69472402_69673043|GENSCAN_predicted_peptide_9|78_aa MPGTTRLWLLCPCGWDAVVGPELDEMPSWDSIRWLVTFPAEGIRYQRGPPESSICDSPPL SYSLTKHLALLLQSTHPK >gi568815588r:69472402_69673043|GENSCAN_predicted_CDS_9|234_bp atgccgggcaccaccaggctctggctgctgtgtccatgtggttgggatgctgtggtgggc ccagagctggatgagatgccgagctgggactctatcaggtggctggtcaccttccctgct gagggcatcaggtatcaaagaggccctccagaatcctcaatctgtgatagtcccccactc agttattctctcacaaagcatctggctcttctccttcagagcactcatcccaag