GENSCAN 1.0 Date run: 4-Nov-116 Time: 03:08:47 Sequence gi568815589f:111561635_111766937 : 205303 bp : 40.16% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.13 PlyA - 966 961 6 1.05 1.12 Term - 1597 1487 111 1 0 66 48 146 0.991 5.98 1.11 Intr - 8575 8457 119 2 2 118 94 165 0.999 19.36 1.10 Intr - 13208 13100 109 2 1 81 95 67 0.989 5.64 1.09 Intr - 17317 17162 156 1 0 100 101 183 0.999 20.09 1.08 Intr - 21955 21838 118 0 1 78 115 149 0.999 16.15 1.07 Intr - 24531 24364 168 2 0 93 96 98 0.969 9.24 1.06 Intr - 31348 31292 57 0 0 108 30 76 0.516 0.88 1.05 Intr - 32675 32588 88 0 1 77 52 67 0.781 0.11 1.04 Intr - 35798 35683 116 1 2 94 107 71 0.764 8.77 1.03 Intr - 38874 38846 29 0 2 72 115 13 0.074 -1.60 1.02 Intr - 45338 45260 79 1 1 106 99 6 0.859 2.13 1.01 Init - 47831 47380 452 2 2 41 78 384 0.582 28.13 1.00 Prom - 61439 61400 40 -7.25 2.00 Prom + 63573 63612 40 -6.75 2.01 Init + 69774 70109 336 2 0 65 100 463 0.015 40.52 2.02 Intr + 71160 71277 118 2 1 81 65 -15 0.002 -5.38 2.03 Intr + 81205 81360 156 2 0 57 14 149 0.113 3.46 2.04 Intr + 85411 85625 215 2 2 55 91 223 0.686 16.81 2.05 Intr + 87819 88011 193 2 1 75 32 231 0.557 14.34 2.06 Intr + 88111 88289 179 2 2 23 105 143 0.543 8.22 2.07 Term + 91466 91588 123 1 0 68 49 113 0.901 2.80 2.08 PlyA + 92692 92697 6 1.05 3.00 Prom + 94899 94938 40 -4.25 3.01 Init + 97379 97424 46 1 1 54 49 49 0.529 -1.50 3.02 Intr + 99719 99875 157 0 1 88 51 165 0.951 10.95 3.03 Intr + 100512 100830 319 0 1 66 2 206 0.195 4.84 3.04 Term + 105181 105306 126 0 0 102 49 143 0.763 9.10 3.05 PlyA + 108571 108576 6 1.05 4.15 PlyA - 109071 109066 6 1.05 4.14 Term - 125236 125136 101 2 2 114 49 17 0.368 -2.29 4.13 Intr - 130799 129917 883 2 1 78 92 240 0.520 13.36 4.12 Intr - 132728 132597 132 2 0 14 61 118 0.188 1.62 4.11 Intr - 135869 135707 163 1 1 72 59 61 0.082 0.66 4.10 Intr - 143730 143613 118 1 1 44 108 84 0.704 4.60 4.09 Intr - 160951 160775 177 2 0 91 113 58 0.913 7.57 4.08 Intr - 162277 162158 120 2 0 15 91 110 0.912 3.55 4.07 Intr - 166415 165999 417 0 0 86 115 308 0.991 26.37 4.06 Intr - 169493 169192 302 1 2 -10 13 288 0.602 6.95 4.05 Intr - 175404 175311 94 1 1 46 87 25 0.120 -3.70 4.04 Intr - 176888 176646 243 0 0 85 101 63 0.252 3.85 4.03 Intr - 186565 186458 108 2 0 101 75 104 0.986 9.74 4.02 Intr - 194844 194691 154 0 1 65 78 111 0.213 6.52 4.01 Intr - 197214 197183 32 1 2 109 78 21 0.103 0.13 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 69774 70313 540 2 0 65 38 557 0.818 42.23 S.002 Term + 202019 202162 144 1 0 62 42 183 0.901 8.03 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589f:111561635_111766937|GENSCAN_predicted_peptide_1|533_aa MLLCQKAPSLKTTYNHPPAADSAGTALNLETTVKQTRETQLEYNNVGTDLSPEPKSFNYP LLSSSGDQFEIQLNQQLWSLIPNNDVRRLVSHVIRTLKTDCTETHLQLACAKLISRTGLL MKLLSEQQELRTVSMTAWKPRMNRKSRSRMRQSHFASHAGRWWHNHSTLQPQSPKLQMAE LSEARRRSFRMVRTKTWTLKKHFVGYPTNSDFELKTAELPPLKNGGLEFLIAYGMLYFVE VLLEALFLTVDPYMRVAAKRLKEGDTMMGQQVAKVVESKNVALPKGTIVLASPGWTTHSI SDGKDLEKLLTEWPDTIPLSLALGTVGMPGLTAYFGLLEICGVKGGETVMVNAAAGAVGS VVGQIAKLKGCKVVGAVGSDEKVAYLQKLGFDVVFNYKTVESLEETLKKASPDGYDCYFD NVGGEFSNTVIGQMKKFGRIAICGAISTYNRTGPLPPGPPPEIVIYQELRMEAFVVYRWQ GDARQKALKDLLKWVLEGKIQYKEYIIEGFENMPAAFMGMLKGDNLGKTIVKA >gi568815589f:111561635_111766937|GENSCAN_predicted_CDS_1|1602_bp atgctactatgccagaaggcaccatctctgaaaacaacctacaatcatcctcctgcggca gattccgctgggactgcattaaacttagagacgactgttaaacaaaccagggaaacacag ttggaatacaacaacgtgggcactgacctgtcccccgaacccaaaagcttcaattaccca ttgctctcatcctcaggtgaccagtttgaaattcagctaaaccagcagctatggtccctc atccccaacaacgatgtgagaaggcttgtttctcatgttatccggaccttgaagacggac tgcactgagacccatttgcaactggcctgtgccaagctcatctctaggacaggcctccta atgaagcttctcagtgagcagcaagaattgagaactgtatcaatgacagcatggaagccc agaatgaacagaaagagcagaagtcgaatgagacagtctcactttgccagccatgctgga aggtggtggcacaatcatagcacactgcagccacaatctcccaagcttcagatggctgaa ctgagtgaggcaaggagaaggagcttcaggatggttcgtactaagacatggaccctgaag aagcactttgttggctatcctactaatagtgactttgagttgaagacagctgagctccca cccttaaaaaatggaggccttgagtttctaattgcctatggaatgctttattttgtagag gtcctgcttgaagctttgttcctcaccgtggatccctacatgagagtggcagccaaaaga ttgaaggaaggtgatacaatgatggggcagcaagtggccaaagttgtggaaagtaaaaat gtagccctaccaaaaggaactattgtactggcttctccaggctggacaacgcactccatt tctgatgggaaagatctggaaaagctgctgacagagtggccagacacaataccactgtct ttggctctggggacagttggcatgccaggcctgactgcctactttggcctacttgaaatc tgtggtgtgaagggtggagaaacagtgatggttaatgcagcagctggagctgtgggctca gtcgtggggcagattgcaaagctcaagggctgcaaagttgttggagcagtagggtctgat gaaaaggttgcctaccttcaaaagcttggatttgatgtcgtctttaactacaagacggta gagtctttggaagaaaccttgaagaaagcgtctcctgatggttatgattgttattttgat aatgtaggtggagagttttcaaacactgttatcggccagatgaagaaatttggaaggatt gccatatgtggagccatctctacatataacagaaccggcccacttcccccaggcccaccc ccagagattgttatctatcaggagcttcgcatggaagcttttgtcgtctaccgctggcaa ggagatgcccgccaaaaagctctgaaggacttgctgaaatgggtcttagagggtaaaatc cagtacaaggaatatatcattgaaggatttgaaaacatgccagctgcatttatgggaatg ctgaaaggagataatttggggaagacaatagtgaaagcatga >gi568815589f:111561635_111766937|GENSCAN_predicted_peptide_2|439_aa MGAPLLSPGWGAGAAGRRWWMLLAPLLPALLLVRPAGALVEGLYCGTRDCYEVLGVSRSA GKAEIARAYRQLARRYHPDRYRPQPGDEGPGRTPQSAEEAFLLVATAYETLKIVFPELKI VKELTTLLMPSRAVWIIWSHKMSGGGAGPELDIGYVPVLLHMPKRIGSGMDTSRSTTVIP VEIRNKELIALLVKAQDRQEEETNYKLRPFNGLSYEVGYLVILQDEETRKDYDYMLDHPE EYYSHYYHYYSRRLAPKVDVRVVILVSVCAISVFQFFSWWNSYNKAISYLATVPKYRIQA TEIAKQQGLLKKAKEKGKNKKSKEEIRDEEENIIKNIIKIWYCRWIYNFNIKGKEYGEEE RLYIIRKSMKMSKSQFDSLEDHQKETFLKRELWIKENYEVYKQEQEEELKKKLANDPRWK RYRRWMKNEGPGRLTFVDD >gi568815589f:111561635_111766937|GENSCAN_predicted_CDS_2|1320_bp atgggggcgccgctgctctctcccggctggggagccggggctgccggccggcgctggtgg atgctgctggcgcccctgctgccggcgctgctgctggtgcggcccgcgggggccctggtg gaggggctctactgcggcacgcgggactgctacgaggtgctgggcgtgagccgctcggcg ggcaaggcggagatcgcgcgggcctaccgccagctggcccggcgctaccaccctgaccgc taccggccccagcccggagacgagggccccgggcggacgccgcagagcgccgaggaggct ttcctgctggtggcaaccgcctacgagacactcaagattgtttttccagagctgaaaata gtcaaggagctaacaactttattgatgccttccagagcagtttggataatttggagtcat aaaatgagtggtggaggggcgggccctgaactagatattggttatgttccagtcctcctg catatgcccaagaggattgggagtggaatggacaccagcagaagcaccacagttattcct gtagagattagaaacaaggagcttattgccctgcttgtcaaggctcaggacagacaggag gaagaaacaaattataaactaaggccatttaatggactgtcctatgaagttggatatctt gtcattttacaggatgaagaaacacgaaaagattatgattacatgctggatcatccagaa gagtactacagccattactaccactactatagcaggcgcttggcccctaaggtggatgtt agagtagtgattttggtcagcgtgtgtgctatttcggtgtttcagtttttcagctggtgg aatagctacaataaggcaatcagctacctagccacagtgcccaagtaccgtatccaagct acagagattgccaagcagcagggactgctcaaaaaagccaaagaaaaaggcaaaaacaaa aagtccaaagaagaaattcgtgacgaggaggagaacatcataaagaacattataaaaatt tggtattgtcggtggatctataattttaacatcaaaggcaaagaatatggagaggaagag agattatacattatacgtaaatctatgaagatgtcaaagtctcaatttgatagtctagaa gatcatcagaaagaaacttttcttaaacgagagctctggatcaaggagaattatgaggtc tacaagcaagaacaagaggaagaattaaagaaaaagttggcaaatgaccccagatggaag agatacaggagatggatgaagaatgaagggcctgggcggttaacatttgtggatgactga >gi568815589f:111561635_111766937|GENSCAN_predicted_peptide_3|215_aa MDEKSCGESVEHHTDGAPASLEDAALCGSPPRFQAQLQVPPRFQALRSGEHCQLAGVPAV SGKRLRAGVDGAATRRGREEGEQVWGNDEFGCGPGDSGESSGRYLSSWPEAQRPGLKEVI QVHWGEKASSMKSRAQSLGPGDQPHRWPVAERRGNKRQQGFRARPREEEIRDSKVSQAAA ELQQYCMQNACKDALLVGVPAGSNPFREPRSCALL >gi568815589f:111561635_111766937|GENSCAN_predicted_CDS_3|648_bp atggatgagaagtcttgtggggaaagtgtggagcatcacacagatggggcgccagcttcg ctggaggacgccgcgctctgcgggtccccgcctcggttccaggcccagctgcaggtgcca cccaggtttcaggcgctgaggagcggagagcactgccagctcgctggggtccccgctgtc tctggcaagcggctccgcgccggggtggacggcgctgccacccgccgaggtagagaggag ggggaacaggtttggggaaatgatgagttcggctgcggacctggggactccggggagtca tccgggaggtatttgtcctcatggcctgaagctcagcgcccagggctgaaagaggtcatt caagtgcattggggcgaaaaagcctcttctatgaagagcagagctcagagcctgggaccc ggggaccagccacaccgatggccggtggccgagagaagaggaaataagagacagcaaggt ttccgagcaaggcctagagaagaggaaataagagacagcaaggtctctcaggcagctgca gagcttcaacagtactgtatgcagaatgcctgcaaggatgccctgctggtgggtgttcca gctggaagtaaccccttccgggagcctagatcctgtgctttactctga >gi568815589f:111561635_111766937|GENSCAN_predicted_peptide_4|1014_aa XLFIDDKGILFPSSFLIEYEFLIPPSLKPEIDIPSLSELKELLNPVPEIINYVDEKEKLF ERDLTNKHGIEDIGDIKFSSTEILTIQSQSEPEECSKPVPRIQEPHSQYSVTDLKKIFSV KEESLVINLEKAEWWKQAGLNLKMMETLEHLNTYLCHDNLSSNDTKIEIFLPTKVLQLEL AILNDMKWYLIVALICISLMIGDVEHFFHTFQSPVQCKALTPFDSMKAEKGKELQKFEAN RDWFMKFKEKSHLRNIKVQGEATSANVEAAASYPEDLANIIDEDGYTKQQIFNVDETAFY LKKVLSRTFIATCLEHKSHSSPIALIDEKSTNAHLSLPQKSPSLAKEVPDLCFSDDYFSD KGAAKEEKPKNDQEPVNRIIQKKENNDHFELDCTGPSIKSPSSSIIKKASFEHGKKQEND LDLLSDFIMLRNKYKTCTSKTEVTNSDEKHDKEACSLTLQEESPIVHINKTLEEINQERG TDSVIEIQASDSQCQAFCLLEAAASPILKNLVSLCTLPTANWKFATVIFDQTRFLLKEQE KVVSDAVRQGSTLLDRFGGFLLEIQIPYVFFASEGLLNTPDILQLLESKDMDEAGNHHSE QTVTRTENQTLHVLTHRWELNNENTWTQDGEHHTPGPVVGWRELIIAPGVEATALIIRQI ADHSLMTSKRDPHEWLDKSWLKVSPSEENRNQISTLSSQSSASDLDSVIQEHNEYYQYLG LGETVQEDKTTILNDNSSIMELKEISSFLPPVTSYNQTSYWKDSSCKSNIGQNTPFLINI ESRRPAYNSFLNHSDSESDVFSLGLTQMNCETIKSPTDTQKRVSVVPRFINSQKRRTHEA KGFINKDVSDPIFSLEGTQSPLHWNFKKNIWEQENHPFNLQYGAQQTACNKLYSQKGNLF TDQQKCLSDESEGLTCESSKDETFWRELPSVPSLDLFRASDSNANQKEFNSLYFYQRAGK SLGQKRHHESSFNSGDKESLTGFMCSQLPQFKKRRLAYEKVPGRVDGQTRLRFF >gi568815589f:111561635_111766937|GENSCAN_predicted_CDS_4|3045_bp natttatttattgatgataaaggaatactttttccgtcaagttttcttattgagtatgaa ttcttaatacctccaagcctcaaaccagaaattgatattccatcactctcagaactgaag gagttattaaacccagtgccagaaataataaactatgtagatgaaaaggaaaagcttttt gaaagagatcttactaacaagcatggaattgaggatatcggggatataaaattcagctcc acagagattttgaccattcaaagccagagtgaaccagaagagtgcagtaaaccagtgcca agaattcaagagccccacagccaatattcagttacagatttgaaaaagatattttctgtt aaagaagaaagccttgtgattaatctggaaaaggcagagtggtggaaacaagcaggacta aatctgaaaatgatggaaacattggaacatctgaatacatatttatgtcatgataatttg tcttctaatgacactaaaattgagatatttttgcctacgaaagtgcttcaattagaatta gccattctgaatgatatgaaatggtacctcattgtagctttgatttgcatttctctgatg attggtgatgttgagcacttttttcatacattccaaagcccagtacaatgcaaggctcta actccttttgattctatgaaggctgagaaaggtaaggagctacagaagtttgaagctaac agagattggttcatgaagtttaaggaaaaaagtcatctccgtaacataaaagtacaaggt gaagcaacaagtgcaaatgtagaagctgcagcaagttatcctgaagatctggccaatatt attgatgaagatggctacactaaacaacagatattcaatgtagatgaaacagcgttctat ttgaagaaagtgttatctaggacttttatagccacatgtctagaacataaaagtcattct tcacctattgcacttattgatgaaaaatctacaaatgctcatttatcacttccacaaaag agtccatctctggcaaaagaagtaccagatctatgtttttctgatgactatttctctgat aaaggagcagcaaaagaagaaaaaccaaagaatgaccaagaaccagtaaacagaataatc caaaagaaagaaaataacgatcactttgaacttgactgcacaggaccatctattaaatca ccttcctcttcaataattaaaaaagcatcttttgaacatggcaaaaaacaagagaatgat ttggaccttttgagcgactttattatgctgcgaaataaatataagacttgcacctcaaag actgaagtcacaaacagtgatgaaaaacatgataaagaagcatgttctttgacacttcaa gaagaaagtcctattgttcatattaataaaaccctggaggaaataaatcaggaaagggga acagatagtgtcattgaaattcaagcgtcagatagccagtgccaagcattttgcctcctc gaagcagcagcttctcctatcttaaaaaaccttgtatccttgtgtaccctccctactgct aattggaaatttgccactgttatttttgaccaaacaaggtttctcttaaaggaacaagaa aaagtagtaagtgatgctgttcgccaaggaagcaccttgctggatagatttggaggtttt cttttggaaattcagattccatatgtgttttttgcatctgaaggacttcttaatactcca gacatacttcagctgctagaatccaaggacatggatgaagctggaaaccatcattctgag caaactgtcacaaggacagaaaaccaaacactgcatgttctcactcataggtgggaattg aacaatgagaacacttggacacaggatggggaacatcacacaccggggcctgtcgtgggg tggcgggagcttataattgccccaggagtagaagcaactgccttgataattcgacaaatt gctgaccacagtttaatgacctcaaagagagatcctcatgaatggttggataaatcctgg cttaaagtttcaccatctgaggaaaataggaatcagattagtaccttgtcttctcaaagt tcagcttctgatttagactctgtcattcaagaacataatgaatattatcagtatttagga ttaggagagacagtgcaggaagacaaaaccaccattttgaatgacaactcttccattatg gaactaaaagaaatctcaagttttttaccacctgtgacttcatacaatcagaccagctac tggaaagactccagctgtaaatctaatatagggcagaatactccttttctaattaatata gaatcaaggagaccggcttataactcctttctaaaccacagtgattcagagtcagatgtc ttttctttgggtctaacacaaatgaactgtgaaactataaaatcaccaactgacactcag aagagagtgtcagttgtcccccgttttataaattctcagaaaaggagaacacatgaagca aaaggtttcataaataaagatgtatcggaccctatcttttcactagagggcactcaatct cctcttcattggaactttaagaaaaatatatgggaacaagagaatcacccgttcaactta caatatggtgcacagcagactgcatgtaacaaattgtactctcagaaaggtaatttattc actgatcagcaaaaatgtctatcagatgagtctgaaggcctcacatgtgaaagttcaaaa gatgagactttctggagagaattaccatctgtccccagtttggatttatttcgtgcttct gattctaatgcaaatcaaaaagaattcaacagcctttatttctaccaaagagctggaaaa agtttaggacagaaaaggcaccatgaatcttcatttaactcaggagacaaggaatcatta acaggttttatgtgctcacaactaccacaattcaaaaaacgacgtctagcatatgaaaaa gtccctggtagagttgatgggcagactcggctgaggtttttttga