GENSCAN 1.0 Date run: 4-Nov-116 Time: 02:56:11 Sequence gi568815586r:88405199_88645865 : 240667 bp : 36.88% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 22355 22529 175 2 1 25 44 182 0.051 6.52 1.02 Intr + 48905 49044 140 1 2 4 77 178 0.779 6.64 1.03 Term + 51369 51513 145 0 1 84 54 56 0.437 -1.80 1.04 PlyA + 52002 52007 6 1.05 2.13 PlyA - 52704 52699 6 1.05 2.12 Term - 61085 60762 324 2 0 37 54 342 0.472 19.48 2.11 Intr - 75744 75577 168 0 0 100 20 108 0.039 4.42 2.10 Intr - 84325 84227 99 1 0 48 93 70 0.218 2.89 2.09 Intr - 85111 85027 85 0 1 102 16 80 0.121 1.10 2.08 Intr - 101180 100996 185 2 2 101 81 35 0.247 1.76 2.07 Intr - 101939 101830 110 0 2 66 115 44 0.626 3.98 2.06 Intr - 110419 110336 84 2 0 82 92 58 0.947 4.47 2.05 Intr - 111292 111136 157 1 1 97 116 55 0.957 7.86 2.04 Intr - 113669 113499 171 2 0 89 88 90 0.113 8.32 2.03 Intr - 127305 127243 63 0 0 86 76 57 0.171 2.30 2.02 Intr - 140667 140554 114 0 0 70 111 66 0.443 6.82 2.01 Init - 142173 142168 6 0 0 78 79 4 0.454 -0.71 2.00 Prom - 142413 142374 40 -5.65 3.00 Prom + 147681 147720 40 -6.65 3.01 Init + 151697 151862 166 1 1 67 70 111 0.513 7.05 3.02 Intr + 163952 164068 117 0 0 101 65 47 0.193 3.22 3.03 Intr + 175239 175412 174 1 0 114 51 31 0.002 0.99 3.04 Intr + 199852 200149 298 2 1 12 83 270 0.199 13.91 3.05 Term + 201810 202320 511 0 1 31 48 186 0.466 1.76 3.06 PlyA + 202401 202406 6 1.05 4.04 PlyA - 204920 204915 6 1.05 4.03 Term - 210862 210777 86 2 2 119 43 89 0.740 4.34 4.02 Intr - 215001 214877 125 2 2 104 25 167 0.354 11.31 4.01 Intr - 240038 239930 109 0 1 53 106 97 0.746 6.42 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 75744 75496 249 0 0 100 47 162 0.882 8.02 S.002 Init - 113645 113499 147 2 0 69 88 121 0.886 10.34 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:88405199_88645865|GENSCAN_predicted_peptide_1|153_aa XTCKKEILQICGGTTQQKGAGEQLLELAKAQEKLVFPTASVENVAIHSTSGNQKCIALMA LCKLEIHDAGSVAQSKSECPGTSNAVTQFEAKDPRTRVWRSTEVQRSKIKVLAGLCFSLE TLGQNPLLLIRSVSRIQSCSSMTEVMVFFLAVS >gi568815586r:88405199_88645865|GENSCAN_predicted_CDS_1|462_bp nncacatgtaagaaggaaattcttcaaatttgtggaggcaccacccaacaaaaaggagca ggggaacaactcttggaacttgcaaaggcccaggaaaagcttgtcttcccaacagccagt gtggaaaatgtcgcaattcacagcacatcaggaaatcagaagtgtattgccttaatggcc ctctgcaagctggagatccacgatgctggtagcgtggctcagtccaagtctgaatgccct ggaacaagcaatgctgtaactcagtttgaggccaaagacccaagaacccgagtatggaga agcactgaagtccaaaggtccaaaatcaaggtgttggcagggctgtgtttctctcttgag accctaggacagaatcctctcttacttattcggagtgttagcagaattcagtcttgcagt tctatgactgaagtaatggttttcttcctggctgtcagctga >gi568815586r:88405199_88645865|GENSCAN_predicted_peptide_2|521_aa MVTWILTCIYLQLLLFNPLVKTEGICRNRVTNNVKDVTKLVANLPKDYMITLKYVPGMDV LPSHCWISEMVVQLSDSLTDLLDKFSNISEGLSNYSIIDKLVNIVDDLVECVKENSSKDL KKSFKSPEPRLFTPEEFFRIFNRSIDAFKDFVVASETSDCVVSSTLSPEKDSRVSVTKPF MLPPVAASSLRNDSSSSNRKAKNPPGDSSLHWAAMALPALFSLIIGFAFGALYWKKRQPS LTRAVENIQINEEDNEIRYFVLLNVCPIKHDIAISHTVYLPIMSLRSPSLMTVAPNQKLA FNTILTDRSHGLPTHLAVAINCNPFRLLFISGCSRVGMAFSNTPTDHGNSSPGHHDPKTE EPLPMTTTTKGPREVLPGYCQCSFKSQGLLSQLVVNAARPETHLSGSWGPLWPRERSSSP ATEQSWTKNDFDGLREEGFRQSNYSELQEEIQTKGNEVKNFEKNLDEYVTRITNTEKCLR ELMELKAKARELHEECRSLRSRCDQLEERVSVMEDETNEMK >gi568815586r:88405199_88645865|GENSCAN_predicted_CDS_2|1566_bp atggtgacttggattctcacttgcatttatcttcagctgctcctatttaatcctctcgtc aaaactgaagggatctgcaggaatcgtgtgactaataatgtaaaagacgtcactaaattg gtggcaaatcttccaaaagactacatgataaccctcaaatatgtccccgggatggatgtt ttgccaagtcattgttggataagcgagatggtagtacaattgtcagacagcttgactgat cttctggacaagttttcaaatatttctgaaggcttgagtaattattccatcatagacaaa cttgtgaatatagtggatgaccttgtggagtgcgtgaaagaaaactcatctaaggatcta aaaaaatcattcaagagcccagaacccaggctctttactcctgaagaattctttagaatt tttaatagatccattgatgccttcaaggactttgtagtggcatctgaaactagtgattgt gtggtttcttcaacattaagtcctgagaaagattccagagtcagtgtcacaaaaccattt atgttaccccctgttgcagccagctcccttaggaatgacagcagtagcagtaataggaag gccaaaaatccccctggagactccagcctacactgggcagccatggcattgccagcattg ttttctcttataattggctttgcttttggagccttatactggaagaagagacagccaagt cttacaagggcagttgaaaatatacaaattaatgaagaggataatgagataaggtatttt gttttgctaaatgtgtgcccaatcaagcatgacattgccatttcacacactgtgtacctg cccataatgtctttaagaagtccttcactcatgacagtagctcctaaccagaaacttgcc ttcaacactattctaaccgatcggtctcatggtcttcctacacaccttgctgttgccatt aactgcaacccattccgacttctatttatcagtggctgttctcgtgtgggaatggctttc agcaacactcctactgatcatggcaacagctcaccagggcatcatgacccaaagacagag gagcctcttcccatgaccaccaccaccaaaggcccaagggaagtactgccaggctactgt cagtgttcatttaagtcccaagggctcctcagtcagcttgtggtgaatgctgccaggcct gagactcacctttcagggtcatggggtcccctatggcccagagaacgcagttcctcacca gcaacggaacaaagctggacgaagaatgactttgacgggttgagagaagaaggcttcaga caatcaaattactctgagctacaggaggaaattcaaaccaaaggcaacgaagttaaaaac tttgaaaaaaatttagacgaatatgtaactagaataaccaacacagagaagtgcttaagg gagctgatggagctgaaagccaaggctcgagaactgcatgaagaatgtagaagcctcagg agccgatgcgatcaactggaagaaagggtatcagtgatggaagatgaaacgaatgaaatg aagtga >gi568815586r:88405199_88645865|GENSCAN_predicted_peptide_3|421_aa MMFTKEYLMELTLMRAVAIELMELKVNHSGWTTGVNEYEGISRVYYYSLPKKKEESKAKS LGMRALNPNLFHKPATTYWCRDLGHMIERSNFALALLVSARGGERSPAAEPPALLYSSRP PAPDTPPSLPEPGVGEKQARARREESGGRPEHGVDRHLVQESSSWYLVDAAVGRSFQRKK QAAIITDLQPLLVIPRQTGSGVALQQNPADQQQRGLTVRGKTNKQKGIASTSTTRTSTQN PHLNVTNGKEQRTLHPKSTEYTFFSAPHNTYSKTDHIIGCKTLLSKRKRTEIITNSLSDQ SVIKLEIRIKNLPQNCPTTWKLNNLLLNDDWVNNEIKAEINKFIENNENKDTACWNLWDT PEAMFRGKFVALNAHRGKQETSKIDTLTSQLKELEEQEQTNSKTRRQEIIKIRAELKEIE T >gi568815586r:88405199_88645865|GENSCAN_predicted_CDS_3|1266_bp atgatgtttactaaagagtacttaatggagctgaccttaatgagggcagttgccattgag ttgatggaactaaaagtcaatcacagtggatggactacaggagtgaatgagtatgaggga atttcaagggtatattactacagtttgccaaaaaagaaagaagagagtaaagcaaagagc ttgggaatgcgagctctgaatccgaatctgtttcataagccagccaccacatattggtgc cgcgatcttgggcacatgatagaaagaagtaactttgctctagcgcttctagtctcggcg cgaggcggcgagcgaagcccggctgctgagccgccggcgcttttatacagttcccgcccg cctgctccggacacgcctccttcccttccggagcccggggtaggcgagaagcaggcaagg gcgcggagggaagaatcgggagggcggccagagcacggggttgacagacacctcgtacag gagagctccagctggtacttggtggatgctgctgtgggacgaagtttccagaggaagaaa caggcagcaattattactgatctgcagcctctgctggtgatacccaggcaaacagggtct ggagtggcccttcagcaaaatccagcagaccagcagcagaggggcctgactgttagaggg aaaactaataaacagaaaggaatagcatcaacatcaacaacaaggacgtccacacaaaat ccccatctgaacgtcaccaatggcaaagaacaaagaaccctccaccccaaatcaacagaa tatacattcttctcagcaccacataacacttattctaaaactgaccacataattggatgt aaaacactcctcagcaaacgtaaaagaacagaaatcataacaaacagtctctcagaccaa agtgtaatcaaattagaaatcaggattaagaacctccctcaaaactgcccaactacatgg aaactgaacaacctgctcctgaatgacgactgggtaaataacgaaattaaggcagaaata aataagttcattgaaaacaatgagaacaaggacacagcatgctggaatctctgggacaca cctgaagccatgtttagagggaaatttgtagcactaaatgcccacaggggaaagcaggaa acatctaaaatcgataccctgacatcacaattaaaagaactagaggagcaggagcaaaca aattcaaaaactagaagacaagaaataattaagatcagagcagaattgaaggagatagag acatga >gi568815586r:88405199_88645865|GENSCAN_predicted_peptide_4|106_aa XSPWVTLGSGIVKKKSTPIDKTNRKVGNETCDSEQCGESCLGEWDLGICTATEFPVIRAT LGKWKGHLAAEEGIDFAPGAGLGMKLQIKEKKQAPDNDPETQHITQ >gi568815586r:88405199_88645865|GENSCAN_predicted_CDS_4|321_bp ngcagtccctgggttacactgggtagtggaattgtaaagaagaaatccactcctatagac aaaaccaatcggaaggtaggaaatgaaacctgtgactcagagcaatgtggagaatcctgt ttgggagagtgggacctgggaatctgtactgctactgagttcccagtgattcgagcaaca ctggggaagtggaaagggcatctggctgctgaagagggcatcgattttgctcccggtgca ggtttgggaatgaagcttcagatcaaagagaagaagcaagcccctgacaatgatccagaa acccagcatatcactcagtag