GENSCAN 1.0 Date run: 3-Nov-116 Time: 18:04:56 Sequence gi568815596r:127842009_128127163 : 285155 bp : 44.14% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 1565 1560 6 1.05 1.03 Term - 6177 6099 79 2 1 80 33 146 0.628 5.54 1.02 Intr - 11097 10917 181 1 1 29 95 225 0.789 16.23 1.01 Init - 16092 16020 73 0 1 107 53 35 0.758 3.18 1.00 Prom - 20818 20779 40 -6.86 2.08 PlyA - 21229 21224 6 1.05 2.07 Term - 23197 23086 112 0 1 117 47 81 0.973 4.93 2.06 Intr - 24988 24892 97 2 1 131 115 69 0.999 12.97 2.05 Intr - 25271 25137 135 0 0 66 98 26 0.739 1.94 2.04 Intr - 27536 27446 91 2 1 86 113 99 0.946 11.77 2.03 Intr - 28920 28806 115 2 1 103 78 97 0.999 10.65 2.02 Intr - 29351 29241 111 1 0 51 95 97 0.986 6.19 2.01 Init - 32226 31820 407 0 2 61 75 371 0.999 29.16 2.00 Prom - 57065 57026 40 -1.86 3.03 PlyA - 57562 57557 6 1.05 3.02 Term - 64204 64136 69 1 0 95 42 38 0.269 -2.16 3.01 Init - 74033 73830 204 2 0 41 12 202 0.511 6.75 3.00 Prom - 91376 91337 40 -3.56 4.20 PlyA - 92158 92153 6 1.05 4.19 Term - 100156 99998 159 1 0 118 41 118 0.993 8.04 4.18 Intr - 100529 100416 114 2 0 112 36 38 0.756 1.64 4.17 Intr - 103551 103448 104 1 2 56 82 62 0.993 2.29 4.16 Intr - 107986 107861 126 2 0 89 96 50 0.987 6.55 4.15 Intr - 108400 108152 249 2 0 55 23 244 0.746 12.01 4.14 Intr - 113336 112978 359 1 2 89 119 248 0.844 22.90 4.13 Intr - 136081 135977 105 0 0 104 84 64 0.521 6.93 4.12 Intr - 144954 144777 178 1 1 90 92 130 0.745 12.58 4.11 Intr - 147858 147706 153 1 0 51 30 118 0.360 2.34 4.10 Intr - 151300 151179 122 0 2 98 80 51 0.996 5.44 4.09 Intr - 154483 154342 142 2 1 34 91 129 0.985 7.21 4.08 Intr - 157837 157733 105 2 0 88 123 18 0.970 5.49 4.07 Intr - 158138 158048 91 2 1 96 119 64 0.993 9.87 4.06 Intr - 158446 158299 148 0 1 78 98 -8 0.239 -0.56 4.05 Intr - 171146 171005 142 0 1 103 9 70 0.124 0.01 4.04 Intr - 172906 172795 112 1 1 131 82 44 0.903 8.05 4.03 Intr - 175907 175807 101 0 2 68 79 77 0.534 4.63 4.02 Intr - 184290 184173 118 0 1 87 94 13 0.900 1.84 4.01 Init - 185155 185084 72 1 0 100 103 71 0.861 8.92 4.00 Prom - 187989 187950 40 -4.56 5.00 Prom + 197376 197415 40 -5.26 5.01 Init + 197854 197907 54 0 0 80 100 23 0.635 3.98 5.02 Intr + 202420 202602 183 0 0 0 62 219 0.360 10.48 5.03 Intr + 220611 220703 93 2 0 93 95 45 0.192 5.86 5.04 Term + 221577 221831 255 2 0 54 52 71 0.104 -4.31 5.05 PlyA + 224483 224488 6 1.05 6.00 Prom + 232134 232173 40 -5.66 6.01 Init + 249350 249407 58 1 1 81 80 134 0.728 11.37 6.02 Intr + 255421 255556 136 2 1 112 110 -16 0.079 2.63 6.03 Intr + 271076 271250 175 2 1 75 78 41 0.224 1.74 6.04 Intr + 273116 273212 97 1 1 78 63 106 0.874 6.68 6.05 Intr + 274257 274335 79 1 1 109 95 51 0.990 6.51 6.06 Intr + 278348 278448 101 2 2 60 90 72 0.934 4.35 6.07 Intr + 279191 279290 100 0 1 62 95 42 0.392 1.47 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:127842009_128127163|GENSCAN_predicted_peptide_1|110_aa MAAGGSDPRAGDVEEDASQLIFPKEFETAETLLNSEVHMLLEHRKQQNESAEDEQELSEV FMKTLNYTARFSRFKNRETIASVRSLEGRFEDEELQQILDDIQTKRSFQY >gi568815596r:127842009_128127163|GENSCAN_predicted_CDS_1|333_bp atggcggcgggtggcagcgatccgcgggctggcgacgtagaggaggacgcctcacagctc atctttcctaaagagtttgaaacagctgagacacttctaaattcagaagttcatatgctt ctggaacatcgaaagcagcagaatgagagtgcagaggacgaacaggagctctcagaagtc ttcatgaaaacattaaactacacagcccgtttcagtcgtttcaaaaacagagagaccatt gccagtgttcgtagcttggagggacggtttgaagatgaggagctgcagcagattcttgat gatatccagacaaagcgcagctttcagtattaa >gi568815596r:127842009_128127163|GENSCAN_predicted_peptide_2|355_aa MGKRRCVPPLEPKLAAGCCGVKKPKLSGSGTHSHGNQSTTVPGSSSGPLQNHQHVDSSSG RENVSDLTLGPGNSPITRMNPASGALSPLPRPNGTANTTKNLVVTAEMCCYCFDVLYCHL YGFPQPRLPRFTNDPYPLFVTWKTGRDKRLRGCIGTFSAMNLHSGLREYTLTSALKDSRF PPLTREELPKLFCSVSLLTNFEDASDYLDWEVGVHGIRIEFINEKGVKRTATYLPEVAKE QDNYTGVTSDRKDAGAGSLGTAIMACVGLSSHVSESPRDWQTDWAPDWDQIQTIDSLLRK GGFKAPITSEFRKTIKLTRYRSEKVTISYAEYIASRQHCFQNGTLHAPPLYNHYS >gi568815596r:127842009_128127163|GENSCAN_predicted_CDS_2|1068_bp atgggaaaaagacgttgtgttcctccactcgagcccaagttggcagcaggctgttgtggg gtcaagaagcccaaattatctggaagtggaacgcacagtcacgggaatcagtccacaact gtccccggctctagttcaggacctcttcaaaaccaccagcatgtggacagcagcagtgga cgggagaatgtgtcagacttaactctgggacctggaaactctcccatcacacgaatgaat cccgcatcgggagcgctgagccctcttccccggcctaatggaactgccaacaccactaag aatctggtggtgactgcagagatgtgctgctactgcttcgacgtactctactgtcacctc tatggcttcccacagccacgacttcctagattcaccaatgacccctatccgctctttgtg acgtggaagacagggcgggacaagcggcttcgtggctgcattgggaccttctcagccatg aatcttcattcaggactcagggaatacacgttaaccagtgcacttaaggacagccgattt ccccccctgacccgagaggagctgcctaaacttttctgctctgtctccctccttactaac tttgaggatgccagtgattacctggactgggaggtaggggtccatgggattcgaattgaa ttcattaatgaaaaaggtgtcaaacgcacagccacatatttacctgaggttgctaaggaa caagataattatactggggtaacatctgacaggaaggacgcaggagcagggagcctgggg acagccattatggcatgtgtcgggctatcatctcatgtctcagagtctccacgggactgg cagacagactgggcgccagactgggatcagatccagacaatagactccttgctcaggaaa ggtggctttaaagctccaattaccagtgaattcagaaaaacgatcaaactcaccaggtac cgaagtgagaaggtgacaatcagttacgcagagtatattgcttcccgacagcactgtttc cagaacggcactcttcatgccccgcccctctacaatcattactcctga >gi568815596r:127842009_128127163|GENSCAN_predicted_peptide_3|90_aa MGEGQNSNTDRILEVVDFGSSRTLMGDFERFKTSVEKVIADVVEIARELELEVEPENVTE LLQSHYKTDQPHPESSAFSIHSGRFKGAHE >gi568815596r:127842009_128127163|GENSCAN_predicted_CDS_3|273_bp atgggagaaggtcaaaatagcaacactgacaggattttggaagtagttgattttggaagt agtagaaccctcatgggtgacttcgagaggttcaagacttcagtggagaaagtaattgct gatgtggtggaaatagcaagagaactagaattagaagtggagccggaaaatgtgactgaa ttgctgcagtctcattataaaactgaccagccccatcctgaatcatctgccttcagcata cactcaggtcgtttcaaaggggcccacgaataa >gi568815596r:127842009_128127163|GENSCAN_predicted_peptide_4|899_aa MGPPRHPQAGEIEAGGAGGGRRLQVEMSSQQFPRLGAPSTGLSQAPSQIANSGSAGLINP AATVNDESGRDSEVSAREHMSSSSSLQSREEKQEPVVGHPSNLHHIMTTNVQMSIIRSNA PGPPLHIGASHLPRGAAAAAVMSSSKVTTVLRPTSQLPNAATAQPAVQHIIHQPIQARRS LGRPTLSIQHPPSAAISIQRPAQSRDVTTRITLPSHPALGTPKQQLHTMAQKTIFSTGTP VAAATVAPILATNTIPSATTAGSVSHTQAPTSTIVTMTVPSHSSHATAVTTSNIPVAKVV PQQITHTSPRIQPDYPAERSSLIPISGHRASPNPVAMETRSDNRPSVPVQFQYFLPTYPP SAYPLAAHTYTPITSSVSTIRQYPVSAQAPNSAITAQTGVGVASTVHLNPMQLMTVDASH ARHIQGIQPAPISTQAVVLADGATIVANPISNPFSAAPAATTVVQTHSQSASTNAPAQGS SPRPSILRKKPATDGMAVRKTLIPPQPPDVASPRVESSMRSTSGSPRPAGAKPKSEIHVS MATPVTVSMETVSNQNNDQPTIAVPPTAQQPPPTIPTMIAAASPPSQPAVALSTIPGAVP ITPPITTIAAAPPPSVTVGGSLSSVLGPPVPEIKVKEEVEPMDIMRPVSAVPPLATNTVS PSLALLANNLSMPTSDLPPGASPRKKPRKQQHVISTEEGDMMETNSTDDEKSTAKSLLVK AEKRKSPPKEYIDEEGVRYVPVRPRPPITLLRHYRNPWKAAYHHFQRYSDVRVKEEKKAM LQEIANQKGVSCRAQGWKVHLCAAQLLQLTNLEHDVYERLTNLQEGIIPKKKAATDDDLH RINELIQGNMQRCKLVMDQISEARDSMLKVLDHKDRVLKLLNKNGTVKKVSKLKRKEKV >gi568815596r:127842009_128127163|GENSCAN_predicted_CDS_4|2700_bp atgggccctccgcggcacccccaggccggcgagatagaagcgggcggtgcgggcggcggg cggcggctacaggtggaaatgagttctcaacagtttcctcggttaggagccccttctacc gggctgagccaggccccttctcagattgcaaacagtggttctgctggattgataaaccca gctgctacagtcaatgatgaatctggtcgagattctgaagtcagtgccagggagcacatg agttccagcagctccctccagtcccgggaggagaagcaagagcctgttgtgggccatccc agtaacctgcatcacatcatgactacaaatgtgcaaatgtctatcatccgcagcaatgct cctgggccccctcttcacattggagcttctcatttacctcgaggtgcagctgctgctgct gtgatgtccagttctaaagtaaccacagtcctgaggccgacctcacagctgccaaatgct gctactgctcagccagcagtacagcacatcattcaccaaccaatccaggcacgtcggagc ctgggtaggccaaccttgtctatccagcatcctccatctgcagcaatcagtattcagcgt cctgcccagtcacgagatgtcacaacaagaatcacactaccatctcaccctgcattaggg acgccaaaacagcagcttcatacaatggctcagaaaacaatcttcagtactggcacgcca gtggctgcagccacagtagcacctattttggcaaccaacaccattccttcagcgaccaca gctggatctgtgtcacacacgcaagctcccacaagtaccattgttaccatgacagtaccc tcccattcctcccatgctactgctgtgaccacctcaaacatcccagtcgccaaggtggtg ccccagcagatcacgcacacttctcctcggatccagccagactaccctgccgagaggagt agcctgattcccatctccggacatcgggcctctcccaatcctgtggccatggaaacccga agtgacaacagaccgtctgttcccgttcagttccaatattttttgccaacttacccccct tctgcatacccactggcggcacatacctacaccccaatcaccagttccgtgtccactatc cgacagtatccagtttcagctcaggctccaaactctgccatcacagctcagactggtgtt ggggtagcgtctaccgtccacctaaaccccatgcagttgatgacagtggatgcatcgcat gctcgacatattcaagggatccagccagcacccatcagtacccaggcagtggtgttggca gatggagccacaattgtggccaaccctattagcaatccattcagtgctgctccagcagca acaaccgtggtgcagacccacagccagagtgctagcaccaacgctcccgcccagggctca tcgccacggccaagcatactccggaagaaacctgccacagatggaatggcagttcggaaa accctcattcctcctcagcctcctgatgttgctagtcctcgagtggaaagctctatgcgg agtacgtctgggtcacctaggcctgcaggtgccaaacccaagtctgaaatccacgtgtct atggccactccggtcactgtgtccatggagactgtatccaatcaaaataatgatcagcct accattgccgtccctccaactgcccagcagcccccaccgaccattccaactatgattgca gcagccagtcccccgtcacaaccagccgttgccctttcaaccattcctggagcggtcccc atcactccacccatcaccaccattgcagctgcaccacctccatcagtcactgtgggtggc agtctttcctccgtcttgggccctcccgttcctgaaattaaagtgaaagaagaagtagaa ccaatggatatcatgaggccagtttctgcagttcctccactggctaccaacactgtgtct ccatctcttgcattgctggcaaacaacttgtccatgcctacaagtgacctaccacctggt gcctccccaaggaaaaagcctcgaaagcaacagcatgtgatctcaacagaagaaggtgac atgatggagacaaacagcactgatgatgagaagtccactgccaagagtcttctggtgaag gctgagaagcgcaagtctcctcccaaggagtatattgatgaggaaggtgtgagatatgtc ccagtgcgtccaagaccccccattactttgcttcgtcactatcggaacccctggaaagct gcttaccaccactttcagaggtacagtgacgtccgggtcaaagaggagaagaaagctatg ctgcaggaaatagctaatcagaaaggagtatcctgtcgtgctcaaggctggaaagtccac ctctgtgctgcccagttactacagctgacgaatctagaacatgatgtctatgaaagactt actaacctgcaggaagggattatcccaaagaaaaaagcagcaacagatgatgatctccac cgaataaacgaactgatacagggaaatatgcagaggtgtaaacttgtgatggatcaaatc agtgaagccagagactccatgcttaaggttttagatcataaagaccgtgtcctgaagctg cttaacaagaacgggactgtcaaaaaagtgtccaaattgaagcgaaaggaaaaagtctag >gi568815596r:127842009_128127163|GENSCAN_predicted_peptide_5|194_aa MGGDITPRNENMDLKKEPKFCKGDENVEDEEHSGRSSEVDNDQLRPLIEADSLITTRKVA EELNIEHSMVVRHLKQIGKCSSRVVVLLWHLAQAMIHSAMTSGLPCTSQKWHTLASRGIS KNEEEKEAPHGHRMGVQELKSSAVLEKGAHVLTVSPNALMSFRILWASTSYLIYPAWSVF SLWLLATFIYDCLG >gi568815596r:127842009_128127163|GENSCAN_predicted_CDS_5|585_bp atggggggagacataacacccaggaatgaaaatatggatttaaagaaggaaccaaagttt tgcaaaggagatgagaacgttgaagatgaggagcatagtggccggtcatcggaagttgac aatgaccagttaagaccactcattgaagctgattctctcataactactcgaaaagttgct gaagaactcaacatcgaacattctatggttgttaggcatttgaagcaaattggaaagtgc agctcaagggtagtggtcctgctctggcacctggcccaggcgatgatccacagtgccatg acctcaggactgccatgcaccagccagaagtggcacactctggcctctagagggatctcc aagaacgaggaagagaaggaggctccacacggtcacagaatgggggtccaggagcttaaa tcatcagctgtcctggaaaagggggctcatgtcttgaccgtatctccaaatgcactgatg tcctttcgtatcctttgggcaagcacgtcctacttgatttaccctgcctggtctgttttt tctctctggttactggcaacattcatctatgactgcttaggctga >gi568815596r:127842009_128127163|GENSCAN_predicted_peptide_6|249_aa MGCKGDASGACAAGALPVTGVCYKMGVLVVLTVLWLFSSVKADSKAITTSLTTKWFSTPL LLEARPKPLLFKGDHRYPSSNPESPVVIFYSEIGSEEFSNFHRQLISKSNAGKINYVFRH YIFNPRKEPVYLSGYGVELAIKSTEYKAKDDTQVKGTEVNTTVIGENDPIDEVQGFLFGK LRDLHPDLEGQLKELRKHLVESTNEMAPLKVWQLQDLSFQTAARILASPVELALVVMKDL SQNFPTKAS >gi568815596r:127842009_128127163|GENSCAN_predicted_CDS_6|747_bp atgggctgcaagggagacgcgagcggtgcgtgtgccgcgggtgcgctgccggtgacagga gtttgctataaaatgggagttctggttgtactcactgttctgtggctgttctcctcagta aaggccgactcaaaagccattacaacctctcttacaacaaaatggttttccactccattg ttgttagaagccagacccaaacctttattgttcaaaggagatcacagatatccctcgtct aatcctgaaagccctgtggtgattttctactctgagattggctctgaggaattttccaat tttcaccgccagcttatatcaaaaagcaatgcaggcaaaatcaattatgtattcagacat tatatatttaatcccaggaaggagcctgtttacctctctggctatggcgtggaattggcc attaagagcactgagtacaaggccaaggatgatactcaggtgaaaggaactgaggtaaac accacagtgattggtgaaaatgatcctattgatgaggttcaggggttcctctttggaaaa ttaagagatctgcaccccgacctggagggacagttgaaagaactcagaaagcatcttgta gagagcaccaatgaaatggcacctttaaaggtttggcagttgcaagatctcagtttccag actgctgctcgaatcttggcttctcctgttgagttggctttggttgtcatgaaggatctt agtcagaattttcctaccaaagccagn