GENSCAN 1.0 Date run: 3-Nov-116 Time: 17:41:11 Sequence gi568815596f:205583291_205895055 : 311765 bp : 44.00% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 2219 2214 6 1.05 1.02 Term - 6166 6081 86 2 2 85 50 72 0.382 0.92 1.01 Init - 17575 17443 133 1 1 78 110 86 0.781 10.10 1.00 Prom - 18339 18300 40 -1.86 2.05 PlyA - 18455 18450 6 1.05 2.04 Term - 24921 24865 57 0 0 95 50 35 0.271 -1.91 2.03 Intr - 32216 32078 139 1 1 69 38 148 0.861 8.37 2.02 Intr - 39865 39780 86 1 2 72 98 60 0.165 4.02 2.01 Init - 71560 71522 39 1 0 62 95 53 0.114 3.59 2.00 Prom - 73063 73024 40 -5.16 3.00 Prom + 85652 85691 40 -2.56 3.01 Init + 96940 97002 63 0 0 56 100 21 0.212 1.25 3.02 Intr + 97225 97311 87 0 0 107 105 6 0.259 4.37 3.03 Intr + 100051 100073 23 0 2 65 115 -2 0.251 -3.46 3.04 Intr + 102953 103111 159 2 0 106 57 114 0.532 9.20 3.05 Intr + 111118 111197 80 1 2 110 61 22 0.070 0.89 3.06 Intr + 114254 114431 178 0 1 75 82 279 0.809 24.98 3.07 Intr + 116126 116250 125 2 2 39 68 57 0.516 -0.97 3.08 Intr + 122788 122963 176 2 2 113 84 131 0.888 14.86 3.09 Intr + 123547 123662 116 0 2 26 31 99 0.141 -2.75 3.10 Intr + 132903 133084 182 0 2 73 82 363 0.752 33.71 3.11 Intr + 139188 139418 231 1 0 55 101 296 0.986 25.34 3.12 Intr + 140495 140650 156 0 0 61 70 183 0.999 13.78 3.13 Intr + 142623 142792 170 1 2 98 80 124 0.822 12.27 3.14 Intr + 144601 144756 156 0 0 99 94 205 0.797 22.41 3.15 Intr + 157229 157373 145 1 1 73 94 173 0.947 16.26 3.16 Intr + 159913 160262 350 2 2 138 102 312 0.971 32.68 3.17 Intr + 162456 162600 145 2 1 71 110 185 0.996 18.86 3.18 Intr + 163752 163838 87 1 0 129 63 27 0.948 4.24 3.19 Intr + 166435 166551 117 2 0 86 103 138 0.888 15.54 3.20 Intr + 169545 169685 141 1 0 85 115 126 0.982 15.32 3.21 Intr + 179385 179507 123 1 0 64 18 92 0.323 0.36 3.22 Intr + 180384 180646 263 1 2 53 75 439 0.810 36.31 3.23 Intr + 182184 182280 97 2 1 84 71 123 0.940 9.78 3.24 Term + 192956 193236 281 0 2 82 55 466 0.930 38.31 3.25 PlyA + 196398 196403 6 1.05 4.00 Prom + 199691 199730 40 -4.06 4.01 Init + 201214 201217 4 0 1 109 57 0 0.275 -0.84 4.02 Intr + 208945 209019 75 2 0 78 89 72 0.700 5.79 4.03 Term + 211464 211768 305 1 2 80 49 425 0.931 33.13 4.04 PlyA + 212256 212261 6 1.05 5.00 Prom + 220135 220174 40 -3.06 5.01 Init + 221496 221563 68 2 2 59 91 76 0.405 5.54 5.02 Intr + 223823 223848 26 2 2 101 51 21 0.086 -2.63 5.03 Term + 238706 238812 107 0 2 87 44 73 0.026 1.37 5.04 PlyA + 240602 240607 6 1.05 6.06 PlyA - 242509 242504 6 1.05 6.05 Term - 251284 251174 111 1 0 105 44 54 0.171 1.26 6.04 Intr - 265244 265187 58 1 1 100 42 78 0.061 3.29 6.03 Intr - 269053 268916 138 0 0 122 74 -9 0.493 0.68 6.02 Intr - 274318 274161 158 1 2 81 58 129 0.902 8.01 6.01 Init - 277277 277191 87 2 0 85 65 2 0.332 -1.66 6.00 Prom - 279802 279763 40 -2.16 7.00 Prom + 282435 282474 40 -4.86 7.01 Init + 289983 290127 145 2 1 76 85 73 0.535 6.08 7.02 Intr + 290725 291078 354 2 0 -32 115 232 0.576 9.16 7.03 Term + 300765 300826 62 1 2 28 49 110 0.067 -0.93 7.04 PlyA + 301174 301179 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 239103 239225 123 2 0 8 47 156 0.803 1.88 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:205583291_205895055|GENSCAN_predicted_peptide_1|72_aa MEYYAAIKRNEIMSFAGTWMELEAISLGKLMQEQKTKHRIFSLTETAFEMGLDLELPWEQ RFGEPDFGEILW >gi568815596f:205583291_205895055|GENSCAN_predicted_CDS_1|219_bp atggaatactatgcagccataaaaaggaatgagatcatgtcctttgcagggacgtggatg gagctggaagccattagcctcggcaaactgatgcaggaacagaaaaccaaacaccgtata ttctcacttacagagacagcatttgagatgggactggatctggagttgccgtgggaacag aggtttggagagcctgattttggtgagatcctgtggtga >gi568815596f:205583291_205895055|GENSCAN_predicted_peptide_2|106_aa MEAEKFKDEELHLPFSMTSLNVELGRDAYSWLFEAGTSASIQAPEVLLADSLQDLRVLPW KEKKRHMLAAPKLAVRLENMFLSLLCSLEKPLHSYLPKVLPGDRTL >gi568815596f:205583291_205895055|GENSCAN_predicted_CDS_2|321_bp atggaggctgagaagttcaaggatgaggagctgcatctgcctttctccatgacatcactg aatgtggagttgggacgagatgcatacagttggctctttgaagctggcaccagtgcttct atacaagcccccgaggtgctgctggcagatagtctacaggatctgcgggtcctcccctgg aaagagaagaagagacacatgttagcagctcctaagctggccgtccggctggagaacatg ttcctcagcctcctgtgcagtttggaaaaaccactacacagttacctgcccaaggtgcta cctggagacagaacactctag >gi568815596f:205583291_205895055|GENSCAN_predicted_peptide_3|1216_aa MRPSLKGSGKSCSPGTEDSQECPSPKPWLHYSHKQESPTLGTGKNGLACLTPSERPTRTP PGKSLRHFFNAALCRKVRSRLQRWSLEKPLPAAAGEFPPPLPSRLALCFSTYRRKCSNSP SLDFAISLRVMNLPQVKDPPCGGRLNSKDAGYITSPGYPQDYPSHQNCEWIVYAPEPNQK IVLNFNPHFEIEKHDCKAKFCYAYDSKSPKEIAKRSSYVVFIVAEGESMAFGRKNLGATP LTILFKFPAILTTTFTLHLESKDVRASNRVSAEVIDSSHRENSPAFSVNGAEPESWRLIP GTDASSHTARASTGRYNIVANQPEALCAKQTACFWLYDFIEIRDGDSESADLLGKHCGNI APPTIISSGSMLYIKFTSDYARQGAGFSLRYEIFKTGSEDCSKNFTSPNGTIESPGFPEK YPHNLDCTFTILAKPKMEIILQFLIFDLEHDPLQVGEGDCKYDWLDIWDGIPHVGPLIGK YCGTKTPSELRSSTGILSLTFHTDMAVAKDGFSARYYLVHQEPLENFQCNVPLGMESGRI ANEQISASSTYSDGRWTPQQSRLHGDDNGWTPNLDSNKEYLQVDLRFLTMLTAIATQGAI SRETQNGYYVKSYKLEVSTNGEDWMVYRHGKNHKVFQANNDATEVVLNKLHAPLLTRFVR IRPQTWHSGIALRLELFGCRVTDAPCSNMLGMLSGLIADSQISASSTQEYLWSPSAARLV SSRSGWFPRIPQAQPGEEWLQVDLGTPKTVKGVIIQGARGGDSITAVEARAFVRKFKVSY SLNGKDWEYIQDPRTQQPKLFEGNMHYDTPDIRRFDPIPAQYVRVYPERWSPAGIGMRLE VLGCDWTGAENLPPIQAGFIYVPWNQGLSAVGDPGAYSKPTVETLGPTVKSEETTTPYPT EEEATECGENCSFEDDKDLQLPSGFNCNFDFLEEPCGWMYDHAKWLRTTWASSSSPNDRT FPGPKRGTEISSPNATLFSVQVARMVLYKLDLKVQVTPVFCKAHDRNFLRLQSDSQREGQ YARLISPPVHLPRSPVCMEFQYQATGGRGVALQVVREASQESKLLWVIREDQGGEWKHGR IILPSYDMEYQIVFEGVIGKGRSGEIAIDDIRISTDVPLENCMGGTLLPGTEPTVDTVPM QPIPAYWYYVMAAGGAVLVLVSVALALVLHYHRFRYAAKKTDHSITYKTSHYTNGAPLAV EPTLTIKLEQDRGSHC >gi568815596f:205583291_205895055|GENSCAN_predicted_CDS_3|3651_bp atgaggccttcactcaaggggagtgggaagtcttgcagccctggtacagaggactcccag gaatgcccttcacctaaaccctggctacattattctcacaaacaggagtctcccacattg ggcactggcaaaaatggtttggcatgcctgacaccaagtgagaggccaaccaggacaccc ccagggaaaagcctgcggcatttcttcaacgctgccctatgccggaaagttaggtctcgg ctgcagcgctggtctctggagaagcctctgcctgcagccgccggcgagttcccgcctccc ctccccagccgcctcgctctttgcttttccacgtataggagaaaatgctctaacagtcca agcctggattttgctatctctcttcgggtgatgaatttgccccaggtgaaagacccaccg tgcggaggtcgtttgaattccaaagatgctggctatatcacctctcccggttacccccag gactacccctcccaccagaactgcgagtggattgtttacgcccccgaacccaaccagaag attgtcctcaacttcaaccctcactttgaaatcgagaagcacgactgcaaggccaaattc tgttatgcttatgacagcaaatctcccaaggaaattgccaagaggtcatcctatgttgta ttcatagtggcagagggagaaagcatggcttttggaaggaaaaacttaggagctaccccg ctcaccatcctcttcaaattccctgccatcctcaccaccaccttcactctccatctggag tccaaagatgtgagggccagtaatagagtgtcagcggaagtaattgattcctcacaccga gagaattcgcctgcctttagtgtcaatggagctgaaccagagtcttggaggctcatcccc ggcactgatgcctcctcccacactgccagggcctccactggtcgctacaatattgtggcc aaccagccagaagctctgtgcgcaaaacaaacagcctgcttctggctgtatgactttatc gagattcgggatggggacagtgaatccgcagacctcctgggcaaacactgtgggaacatc gccccgcccaccatcatctcctcgggctccatgctctacatcaagttcacctccgactac gcccggcagggggcaggcttctctctgcgctacgagatcttcaagacaggctctgaagat tgctcaaaaaacttcacaagccccaacgggaccatcgaatctcctgggtttcctgagaag tatccacacaacttggactgcacctttaccatcctggccaaacccaagatggagatcatc ctgcagttcctgatctttgacctggagcatgaccctttgcaggtgggagagggggactgc aagtacgattggctggacatctgggatggcattccacatgttggccccctgattggcaag tactgtgggaccaaaacaccctctgaacttcgttcatcgacggggatcctctccctgacc tttcacacggacatggcggtggccaaggatggcttctctgcgcgttactacctggtccac caagagccactagagaactttcagtgcaatgttcctctgggcatggagtctggccggatt gctaatgaacagatcagtgcctcatctacctactctgatgggaggtggacccctcaacaa agccggctccatggtgatgacaatggctggacccccaacttggattccaacaaggagtat ctccaggtggacctgcgctttttaaccatgctcacggccatcgcaacacagggagcgatt tccagggaaacacagaatggctactatgtcaaatcctacaagctggaagtcagcactaat ggagaggactggatggtgtaccggcatggcaaaaaccacaaggtatttcaagccaacaac gatgcaactgaggtggttctgaacaagctccacgctccactgctgacaaggtttgttaga atccgccctcagacctggcactcaggtatcgccctccggctggagctcttcggctgccgg gtcacagatgctccctgctccaacatgctggggatgctctcaggcctcattgcagactcc cagatctccgcctcttccacccaggaatacctctggagccccagtgcagcccgcctggtt agcagccgctcgggctggttccctcgaatccctcaggcccagcccggtgaggagtggctt caggtagatctgggaacacccaagacagtgaaaggtgtcatcatccagggagcccgcgga ggagacagtatcactgctgtggaagccagagcatttgtgcgcaagttcaaagtctcctac agcctaaacggcaaggactgggaatacattcaggaccccaggacccagcagccaaagctg ttcgaagggaacatgcactatgacacccctgacatccgaaggtttgaccccattccggca cagtatgtgcgggtatacccggagaggtggtcgccggcggggattgggatgcggctggag gtgctgggctgtgactggacaggtgcagaaaacctccccccgatacaagctggcttcatt tatgtgccctggaatcaggggctcagcgctgtaggggatccaggagcttactccaagccc acggtagagacgctgggacccactgtgaagagcgaagagacaaccaccccctaccccacc gaagaggaggccacagagtgtggggagaactgcagctttgaggatgacaaagatttgcag ctcccttcgggattcaattgcaacttcgatttcctcgaggagccctgtggttggatgtat gaccatgccaagtggctccggaccacctgggccagcagctccagcccaaacgaccggacg tttccagggccaaaaagaggtactgagatttcctcaccaaatgccaccttgttctctgta caagtggccagaatggtcctgtacaagttggatttgaaggtgcaggtcacacctgtgttc tgcaaggcacatgacaggaatttcttgcggctgcagagtgacagccagagagagggccag tatgcccggctcatcagcccccctgtccacctgccccgaagcccggtgtgcatggagttc cagtaccaggccacgggcggccgcggggtggcgctgcaggtggtgcgggaagccagccag gagagcaagttgctgtgggtcatccgtgaggaccagggcggcgagtggaagcacgggcgg atcatcctgcccagctacgacatggagtaccagattgtgttcgagggagtgatagggaaa ggacgttccggagagattgccattgatgacattcggataagcactgatgtcccactggag aactgcatggggggcaccctcctgccagggaccgagcccacagtggacacggtgcccatg cagcccatcccagcctactggtattacgtaatggccgccgggggcgccgtgctggtgctg gtctccgtcgcgctggccctggtgctccactaccaccggttccgctatgcggccaagaag accgatcactccatcacctacaaaacctcccactacaccaacggggcccctctggcggtg gagcccaccctaaccattaagctagagcaagaccgtggctcgcactgctga >gi568815596f:205583291_205895055|GENSCAN_predicted_peptide_4|127_aa MVDIPEIHEREGYEDEIDGEYCYDLADEYEVDWSNSSSATSGSGAPSTDKEKSWLYTLDP ILITIIAMSSLGVLLGATCAGLLLYCTCSYSGLSSRSCTTLENYNFELYDGLKHKVKMNH QKCCSEA >gi568815596f:205583291_205895055|GENSCAN_predicted_CDS_4|384_bp atggtggacatcccagaaatacatgagagagaaggatatgaagatgaaattgatggtgag tactgttatgatttagcagatgaatacgaggtggactggagcaattcttcttctgcaacc tcagggtctggcgccccctcgaccgacaaagaaaagagctggctgtacaccctggatccc atcctcatcaccatcatcgccatgagctcactgggcgtcctcctgggggccacctgtgca ggcctcctgctctactgcacctgttcctactcgggcctgagctcccgaagctgcaccaca ctggagaactacaacttcgagctctacgatggccttaagcacaaggtcaagatgaaccac caaaagtgctgctccgaggcatga >gi568815596f:205583291_205895055|GENSCAN_predicted_peptide_5|66_aa MLMRATQTIALSPAINGSSHQSWFGGSAADTRLSKLKPRWVHKRVPTERTAPTPPGETVK KAIEKV >gi568815596f:205583291_205895055|GENSCAN_predicted_CDS_5|201_bp atgctcatgcgtgcgactcagacgatagccttgtcccctgccatcaatggatcttcacac cagagttggtttggaggttcagcagctgatacacggctgtccaaacttaagcccagatgg gtccacaaaagggtcccaacggaaagaactgctcccacacctcctggagaaacagtcaag aaggctatagaaaaggtttag >gi568815596f:205583291_205895055|GENSCAN_predicted_peptide_6|183_aa MEREDPDASPGACQGSTREATGEECTQASGWNDEPKTAQLENGTLEAGSGSDDSTFSSGA AASLKTKGVTMVERLLNSEEGSFASLIKNLTQWMMPAIHKCTLEIKCPPCQKKAEYRSGS NSTDGWHRSNGDDSEEEKSNYVETANLKIGGETKKINDLIQESKKTKEDQVVPKDWAWIP ERI >gi568815596f:205583291_205895055|GENSCAN_predicted_CDS_6|552_bp atggagagagaggaccctgatgcatccccaggagcctgccaggggagcacgagggaggca accggagaagaatgcacccaggcatctgggtggaatgatgaacccaagaccgcacagcta gaaaatggcaccctggaagctggatcgggctctgatgattccacattcagctcaggggca gctgcctctctaaagaccaagggtgttacaatggtggagaggctgttgaattcagaggag gggagctttgcgagcttgattaaaaatttaacacagtggatgatgccagccatacacaag tgtaccttagagataaaatgtccaccatgtcagaagaaagcagagtacagatctggcagt aactctactgatggatggcatcgaagtaatggtgatgacagtgaggaagagaaatccaat tatgtagaaactgcgaatttgaaaattggaggggaaactaaaaaaatcaatgacctaatt caggagtcaaagaaaaccaaagaagatcaagtggtccccaaagactgggcttggatacct gaaagaatatag >gi568815596f:205583291_205895055|GENSCAN_predicted_peptide_7|186_aa MPDLPWLMVEERIKSLRKSGEAGADIPRKARKIMKVDYVPKDDPKESAETFWSRILGELD DKRNHGQGLRDTGSVSGIYSFWWVLGLADFKNEAADLAVLQLSDVVHPELFVPHGAFMVY GSDLKDEAADLTVSVTALKGGASGVVCSSRWVRGLADLKNDAADPRGVPKPQAMDQYMDH YWSMAC >gi568815596f:205583291_205895055|GENSCAN_predicted_CDS_7|561_bp atgccggatttgccatggctgatggtggaagaaaggattaaaagtctcaggaaaagtgga gaggctggagcggatataccaagaaaggccaggaaaatcatgaaagttgattatgttcca aaagacgacccgaaagaatctgctgagaccttctggagtaggatcctgggggagctagat gataaacggaaccatggccaaggtttaagagatactgggtctgtgtccggaatttattcc ttctggtgggttcttggtctcgctgacttcaagaatgaagctgcagacctcgcggtatta cagctctcagacgtggtgcatccggagttgtttgttcctcacggtgcgttcatggtctat ggctctgacttaaaggatgaagctgcagacctcacggtcagtgttacagctcttaaaggt ggtgcatccggagttgtttgttcctcccggtgggttcgtggtctcgctgacttgaagaat gacgcggcagaccctcgcggggtccccaagccccaggccatggaccagtacatggaccac tactggtccatggcctgttag