GENSCAN 1.0 Date run: 6-Nov-116 Time: 21:35:16 Sequence gi568815582r:8469776_8689838 : 220063 bp : 46.09% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 15881 15920 40 -1.86 1.01 Init + 21017 21028 12 1 0 89 100 6 0.553 2.15 1.02 Intr + 21699 21866 168 2 0 109 81 10 0.500 2.54 1.03 Term + 23042 23155 114 1 0 89 39 67 0.591 0.47 1.04 PlyA + 24627 24632 6 1.05 2.00 Prom + 38133 38172 40 -4.96 2.01 Init + 38589 38734 146 2 2 82 80 110 0.647 9.23 2.02 Intr + 45344 45493 150 2 0 18 63 135 0.305 3.28 2.03 Intr + 59177 59360 184 2 1 37 76 107 0.009 4.19 2.04 Term + 75660 75737 78 2 0 61 41 186 0.916 8.96 2.05 PlyA + 76047 76052 6 1.05 3.00 Prom + 80052 80091 40 -3.06 3.01 Init + 84923 85040 118 1 1 52 61 99 0.676 3.96 3.02 Intr + 85293 85339 47 1 2 128 84 27 0.681 4.43 3.03 Intr + 85428 85447 20 2 2 62 110 17 0.229 -3.29 3.04 Intr + 87754 87914 161 1 2 33 91 70 0.278 1.43 3.05 Intr + 87962 88034 73 0 1 42 66 116 0.270 3.26 3.06 Term + 90869 91040 172 2 1 58 54 75 0.199 -1.60 3.07 PlyA + 91897 91902 6 1.05 4.03 PlyA - 99748 99743 6 1.05 4.02 Term - 100230 99998 233 1 2 105 52 343 0.998 29.04 4.01 Init - 102387 102312 76 0 1 79 92 135 0.953 12.25 4.00 Prom - 102844 102805 40 -9.36 5.00 Prom + 104883 104922 40 -5.16 5.01 Init + 104992 105063 72 0 0 103 86 52 0.253 7.57 5.02 Intr + 109510 109655 146 0 2 32 95 75 0.238 1.68 5.03 Intr + 119349 119500 152 0 2 97 54 44 0.000 1.71 5.04 Intr + 119852 120051 200 0 2 85 55 221 0.000 17.57 5.05 Intr + 121047 121507 461 2 2 32 -54 1179 0.001 90.08 5.06 Term + 121635 122190 556 0 1 46 50 584 0.995 44.20 5.07 PlyA + 124089 124094 6 1.05 6.00 Prom + 125635 125674 40 -2.66 6.01 Init + 126734 126897 164 1 2 57 55 83 0.475 0.26 6.02 Intr + 131366 131553 188 2 2 63 102 101 0.777 8.33 6.03 Term + 137704 137804 101 2 2 131 38 24 0.268 -0.01 6.04 PlyA + 139419 139424 6 1.05 7.00 Prom + 142986 143025 40 -4.46 7.01 Init + 155891 156023 133 1 1 69 110 187 0.674 19.32 7.02 Intr + 158955 159335 381 1 0 82 67 153 0.764 7.48 7.03 Intr + 165264 165304 41 1 2 78 94 64 0.183 3.94 7.04 Intr + 165393 165537 145 2 1 120 95 179 0.997 21.66 7.05 Intr + 169316 169387 72 0 0 94 78 65 0.958 5.48 7.06 Intr + 171356 171409 54 0 0 98 89 66 0.988 6.65 7.07 Intr + 172352 172432 81 0 0 80 116 72 0.997 8.81 7.08 Intr + 172688 172790 103 0 1 110 83 143 0.999 15.23 7.09 Intr + 174782 174950 169 2 1 59 82 306 0.933 27.05 7.10 Intr + 183233 183380 148 1 1 42 37 67 0.007 -3.19 7.11 Term + 190972 191141 170 2 2 41 48 404 0.916 29.74 7.12 PlyA + 191870 191875 6 -0.45 8.05 PlyA - 191943 191938 6 1.05 8.04 Term - 192879 192837 43 2 1 105 48 30 0.044 -2.67 8.03 Intr - 193102 193067 36 0 0 62 100 45 0.029 0.48 8.02 Intr - 196365 196327 39 2 0 114 96 0 0.082 0.74 8.01 Init - 204880 204744 137 1 2 58 53 154 0.461 6.53 8.00 Prom - 211536 211497 40 -2.96 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 120063 119844 220 0 1 72 74 485 0.972 42.29 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582r:8469776_8689838|GENSCAN_predicted_peptide_1|97_aa MEMKLLTLRHTRLQESGAQQRSGDWNNLPGNPRIKARQSPQGILGFCSAHLPEERGAYTK LLVSTPLLLDSFDEHHNDDDDDNLMFISQTYVNTFHA >gi568815582r:8469776_8689838|GENSCAN_predicted_CDS_1|294_bp atggagatgaagctcctcactctgagacacactcgcttgcaagagtcaggagcacaacag aggtcgggagactggaacaatctcccaggtaacccaagaataaaagctcgtcagtctcca cagggcatcctgggattctgttcagctcacctgccagaagaaagaggtgcctacaccaag ctactggtttccacccctctattactggattcctttgatgaacaccataatgatgatgat gatgacaatttaatgttcatatctcaaacatatgtcaacacttttcatgcatga >gi568815582r:8469776_8689838|GENSCAN_predicted_peptide_2|185_aa MLATQDVAQRCKELGITALHIKLWATGGNRNKSPRPGAQMSLRAPCLLRSWLDHQYVDSK EARRSRQGPSCHAVTVERQYILNMQALLDSQSWQDALNRNPWQTLDLVYFMTAEHRTIDF EGPSCKVGKALSEEGFRESFGEDSKILSILATWKRLQLARKIINIIIIIITIITIIITYC YQVTF >gi568815582r:8469776_8689838|GENSCAN_predicted_CDS_2|558_bp atgctggccacccaggatgtggcccagaggtgcaaggagctgggcatcactgccctacac atcaaactctgggccacaggaggaaacaggaacaagtcccctagacctggggcccagatg tccctcagagccccttgcctgctcaggtcatggttagaccaccaatatgttgattcaaag gaagcacgacggtcacgccaggggccatcatgccacgctgtgactgtggagcgtcaatat atcctcaacatgcaggcactgctggacagccaaagctggcaggatgccctgaataggaat ccctggcagactctagatcttgtctatttcatgacagctgagcacaggactattgacttt gaaggtccttcctgcaaggtgggaaaggcactctcagaagagggctttagagaatcgttc ggagaggacagcaaaatcctcagcatcttggctacctggaagagattgcagctggctagg aaaataatcaacatcatcatcatcatcatcaccatcatcaccatcatcatcacctattgc tatcaagttacattttga >gi568815582r:8469776_8689838|GENSCAN_predicted_peptide_3|196_aa MSQIRILKAADTGLALTDEKSCSTYFACCNLCNGHNNPEGLSFTVTFVVFSGFAHVCTTD AGDHKQYTRVTSFGHYILGNLLGSYEEMYVALKNTGAGQQPWSLFQNQLTQGMNSDNNST YFTDIVGRLKKIKRHHHLVTLYQSKSSPSVGLRLLSVIQGMRTCLASFQGALTMSGIPHQ PQRSPPGEGTKVSIGD >gi568815582r:8469776_8689838|GENSCAN_predicted_CDS_3|591_bp atgagtcaaataagaatcctaaaagcagctgacactggcttagcactcacagatgagaag tcctgttctacgtactttgcctgctgtaacttatgcaacggccacaacaaccctgagggt ctctccttcacagtaacctttgtggttttctctggttttgcccatgtatgcactactgat gcaggagaccacaagcaatatacaagggtcactagttttggccactacatcttgggtaat ttattaggcagctacgaagaaatgtatgtggcccttaagaacacaggtgctggtcagcag ccttggtcccttttccagaaccaacttacccagggcatgaactcagataataacagcacc tacttcacagacatcgtgggaagactgaagaagataaagcgccatcatcatctggtaacc ctctaccagagcaagtcttctccttctgtgggcctcaggctcctgtctgtcattcagggc atgagaacatgcttggcaagtttccagggggccctcactatgtctgggatcccccatcag ccccaacgttcacctccaggagaaggcaccaaagtcagcattggggactga >gi568815582r:8469776_8689838|GENSCAN_predicted_peptide_4|102_aa MTGFLSFLLQAYLLLLLTGILFLFGAMVTLAGISVYIAYSAAAFREALCLLEEKALLDQV DISFGWSLALGWISFIAELLTGAAFLAAARELSLRRRQDQAI >gi568815582r:8469776_8689838|GENSCAN_predicted_CDS_4|309_bp atgacggggtttctgagcttcctcctccaagcctacctcctcctcctgctcactggaatt ctcttcctctttggagccatggtgaccctcgctgggatcagcgtctacatagcgtattca gccgccgccttccgggaggcgctgtgtctcttggaggagaaggccctcctggaccaggtg gacatcagcttcggctggtccctggccctgggctggatcagcttcatcgccgagctgctc accggggcagccttcctggcagcagcccgcgagctcagcctgagacggaggcaggaccag gccatatga >gi568815582r:8469776_8689838|GENSCAN_predicted_peptide_5|528_aa MAAPCEFPSRAASCLDVHIRLNAWEHQKLWQKKKVPGSTYYKEEISPPLFPNLVSVTLRL LEELRAMQPCYAWPRSPAVLGKAPELTARTGALPPSSPSPDSLRSCRLDSLTVTFSSLKG FISGPPEAGVGAQRLGLAAIDGPQQVLRPRASPLQPLGVNDIPEVRADGGRQEHKAERPG QRSRAGQPAQSFFIIIATSIATIITTTTITIINITTNNNTIITTITIIITTITSSSITNS TTITIIIITNNNTVITTITIIIITTIITSSSITNTTTITIIIITTTTITITTDNTTITTI TIITTNTAIITTITTSSSPPSPPPPPPSPPSPPPPSSLSPPLPSQHHHHHHHHHHHHHHH HHHHHHHHHHYHHHCHRHHHHHHHHHHHQPPPSPPTITTTIITTITIIITTIITTITTIT TIIITTIITTITTITTTVITITTTMTITIIITTTTSTSTTLLFIITITLINSTIFFTTIK PFTPSRNTSGTFYVSDAIVYVKNIGMNNPKTSPALKKYRIQASQTFTK >gi568815582r:8469776_8689838|GENSCAN_predicted_CDS_5|1587_bp atggcagccccatgtgagtttcccagtcgtgctgcaagctgccttgatgtccacattcga ttaaacgcctgggaacatcaaaagctctggcaaaagaagaaggtcccaggatccacttac tacaaggaagaaatttcaccacctttatttcctaaccttgtcagtgtgactttgagactt ctggaagagctcagagctatgcagccctgttatgcctggccccgaagcccggctgttttg ggaaaggctccagaactcacggctaggaccggggcgctccctccatcctcaccctccccg gactcactgagaagttgccggctcgattcgctgactgtcacgttctccagcctgaagggg ttcatcagcggtccgccagaggccggagtgggagctcagaggctcgggctggctgcgatt gatggaccccagcaggtcctgcgcccccgggccagtcctctccagccgctcggtgtcaat gatataccagaagtccgtgccgatggcggccgccaggagcacaaagctgagcgccccggt cagcgcagccgcgccggccagcccgcccagtctttcttcattatcatagccaccagcatt gccaccatcatcaccaccaccaccatcaccatcatcaacatcaccaccaacaacaacaca atcatcactaccatcaccatcatcatcaccaccatcaccagcagcagcatcaccaacagc accaccatcaccatcatcatcatcaccaacaacaacaccgtcatcactaccatcaccatc atcatcatcaccaccatcatcaccagcagcagcatcaccaacaccaccaccatcaccatc atcatcatcaccaccaccaccatcaccatcaccacggacaacaccaccatcaccaccatc accatcatcaccaccaacaccgccatcatcactaccatcaccacatcatcttcaccacca tcaccaccaccaccaccaccatcaccaccatcaccaccaccaccatcatcgctatcacca ccattgccatcacaacaccaccatcaccaccatcaccaccaccaccaccatcaccaccat caccaccatcaccaccaccaccatcatcactatcaccaccattgccatcgtcatcaccat catcatcaccaccaccatcaccaccaaccaccaccatcaccaccaaccattaccaccacc atcatcaccaccatcaccatcatcatcaccaccatcatcaccaccattaccactatcacc accatcatcatcaccaccatcatcaccaccattaccactatcaccaccacggtcattacc atcaccaccaccatgaccatcaccatcatcatcactaccaccaccagcaccagcaccact ctcctctttatcatcaccatcactcttatcaattccaccatcttcttcaccaccataaaa ccttttactccttcaagaaatacttccggtaccttctatgtgtctgatgctatagtctat gttaagaatataggcatgaacaatccaaaaacaagtcctgcccttaagaaatataggatt caggcttcccagacttttacaaagtag >gi568815582r:8469776_8689838|GENSCAN_predicted_peptide_6|150_aa MLGQFQTPPLTGLGASIFNLLEVGNHIRMEIPGDHPSVRSPGHEERLEDEMPESGIVGFM KDDGCNQCLTHSRHSRLVSGAVVMTITVVNPNIVIAPSSSSPSFLSSSHPVPSCSGLQLS SVALSAHIFPDEKEKVENFVFVQAALFACG >gi568815582r:8469776_8689838|GENSCAN_predicted_CDS_6|453_bp atgctgggccagttccagacaccacccttaactggcctgggagcctccattttcaatctc ttggaagttggcaaccatataaggatggaaattcctggagaccacccttctgtgagaagc ccaggccatgaggaaaggctggaggatgagatgccagagagtgggattgttgggttcatg aaagatgatggctgcaaccagtgtctgacacatagcaggcattcacgtctagtaagtggt gctgttgtgatgactatcactgttgtcaaccccaatattgtcattgcaccatcatcatcg tcaccttcattcctgagctcctctcatcctgttccctcttgttcaggcctacagctcagc agcgttgctctgtctgcacacattttcccagatgagaaagaaaaagtcgaaaattttgtt ttcgtacaagctgccctttttgcctgtggctaa >gi568815582r:8469776_8689838|GENSCAN_predicted_peptide_7|498_aa MVQLAPAAAMDEVTFRSDTVLSDVHLYTPNHRHLMVRLNSVGQPVFLSQFKLLWSQDSWT DSGAKGGSHRDVHTKEPPSAETGSTGSPPGSGHGNEGFSLQAGTDTTGQEVAEAQLDEDG DLDVVRRPRAASDSNPAGPLRDKVHPMILAQEEDDVLGEEAQGSPHDIIRIEHTMATPLE DVGKQVWRGALLLADYILFRQDLFRGCTALELGAGTGLASIIAATMARTVYCTDVGADLL SMCQRNIALNSHLAATGGGIVRVKELDWLKDDLCTDPKVPFSWSQEEISDLYDHTTILFA AEVFYDDDLTDAVFKTLSRLAHRLKNACTAILSVEKRLNFTLRHLDVTCEAYDHFRSCLH ALEQLADGKLRFVVEPVEASFPQLLVYERLQQLVTASSNPLLLLNQCMRRANGGAADIGS RQWIPASTREAFQTVLFLTWQCGGEIGPASGQVLEEEEEEEEEEEEEEEEGEEGEEGEEE EEEEEEEEEEEEEEEELC >gi568815582r:8469776_8689838|GENSCAN_predicted_CDS_7|1497_bp atggtacagctggctcctgcggcagccatggacgaggtcacctttaggagcgacactgtg ctgtcagatgtccacctctataccccgaaccatagacatctcatggtacggctgaacagc gtggggcagccagttttcctgtcccaattcaagcttctatggagccaagactcttggaca gattcaggagccaagggtggcagtcacagagatgttcacacaaaggagcctccttctgct gagacaggcagcacagggtcccctccaggaagtggccatggtaatgagggtttctccctc caggccgggactgacaccactggccaggaagtggctgaagctcagctggatgaggatggg gatttggacgtggtgagaagaccacgagccgcctctgattccaacccagcagggcctctg agagacaaggtacatcccatgattctagcacaggaagaagacgacgtcctgggagaggaa gcacaaggcagcccgcacgatatcatcagaatagagcacaccatggccacgcccctggag gatgttggcaagcaggtgtggcggggcgccctgctcctggcagactacatcctgttccga caggacctcttccgaggatgtacagcgctggagctcggggccggcacggggctcgctagc atcatcgcagccaccatggcacggaccgtttattgtacagatgtcggtgcagatctcttg tccatgtgccagcgaaacattgccctcaacagccacctggctgccactggaggtggtata gttagggtcaaagaactggactggctgaaggacgacctctgcacagatcccaaggtcccc ttcagttggtcacaagaggaaatttctgacttgtacgatcacaccaccatcctgtttgca gccgaagtgttttacgacgacgacttgactgatgctgtgtttaaaacgctctcccgactc gcccacagattgaaaaatgcctgcacagccatactgtcggtggagaagaggctcaacttc acattgagacacttggacgtcacatgtgaagcctacgatcacttccgctcctgcctgcac gcgctggagcagctcgcagatggcaagctgcgcttcgtggtggagcccgtggaggcctcc ttcccacagctcctggtttacgagcgcctccagcaactggtgactgccagttcaaaccca cttctgctgctcaaccaatgcatgcgccgggcaaacggtggtgctgctgatattggaagc aggcagtggatcccagcttcaaccagggaggccttccagactgtgctgtttcttacatgg cagtgtggtggggagattgggccggcaagtgggcaagtcttggaggaggaggaggaggag gaggaggaggaggaggaggaggaggaggagggggaggagggggaggagggggaggaggaa gaggaggaggaggaggaagaggaggaggaggaggaggaggaggaggagctgtgctga >gi568815582r:8469776_8689838|GENSCAN_predicted_peptide_8|84_aa MRAPSSGSAGCLLDFPQRLAGLPGSAERPGQSAPPAPALAFATANGLSRLSGNQRTPSGC LTDITKSNFEKTQTISFMLFLGEL >gi568815582r:8469776_8689838|GENSCAN_predicted_CDS_8|255_bp atgcgcgccccgagttccggcagcgctgggtgtctgctcgacttcccccagcgactcgcc ggactccctggctccgccgagcgtcccggccaatcagcgccgccagccccggccctcgct ttcgccacagccaatgggttgagcagactctcaggcaaccaaagaacaccctcagggtgc ttgacagacattaccaagtccaactttgagaagacccaaaccatcagcttcatgctcttt cttggtgagctctga