GENSCAN 1.0 Date run: 8-Nov-116 Time: 17:45:24 Sequence gi568815597f:103471603_103656330 : 184728 bp : 35.61% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 14789 15404 616 1 1 71 60 174 0.037 8.44 1.02 Intr + 34019 34080 62 0 2 61 115 44 0.133 1.93 1.03 Intr + 48323 48519 197 1 2 63 86 111 0.153 5.69 1.04 Intr + 54465 54660 196 0 1 75 95 156 0.961 13.50 1.05 Intr + 56093 56140 48 1 0 68 86 46 0.548 0.36 1.06 Intr + 65314 65448 135 0 0 32 34 115 0.427 0.34 1.07 Intr + 65740 65882 143 0 2 30 68 225 0.477 12.93 1.08 Intr + 69748 69873 126 1 0 84 91 31 0.559 1.87 1.09 Intr + 71694 71845 152 0 2 34 91 79 0.414 1.69 1.10 Intr + 73339 73500 162 2 0 32 90 116 0.803 5.23 1.11 Intr + 74646 74740 95 1 2 35 100 21 0.339 -3.24 1.12 Intr + 75375 75433 59 2 2 61 103 17 0.360 -2.74 1.13 Term + 79339 79507 169 1 1 105 28 146 0.799 6.67 1.14 PlyA + 79647 79652 6 1.05 2.04 PlyA - 79802 79797 6 1.05 2.03 Term - 98380 97884 497 2 2 55 42 301 0.536 15.94 2.02 Intr - 98653 98395 259 1 1 76 2 249 0.303 11.21 2.01 Init - 98978 98778 201 2 0 99 2 163 0.700 7.76 2.00 Prom - 99213 99174 40 -8.45 3.00 Prom + 99954 99993 40 -9.75 3.01 Init + 100001 100168 168 1 0 78 94 75 0.926 5.12 3.02 Intr + 100508 100654 147 1 0 60 98 37 0.711 1.41 3.03 Intr + 101461 101658 198 0 0 74 99 180 0.999 16.33 3.04 Intr + 102106 102336 231 0 0 70 73 182 0.999 11.95 3.05 Intr + 102658 102791 134 0 2 75 33 130 0.993 4.72 3.06 Intr + 103839 103938 100 0 1 69 111 59 0.989 5.49 3.07 Intr + 105888 106006 119 2 2 16 100 94 0.989 1.74 3.08 Intr + 106118 106243 126 2 0 72 119 102 0.995 10.57 3.09 Intr + 145149 145215 67 0 1 33 109 88 0.046 3.39 3.10 Intr + 145887 146006 120 2 0 28 94 64 0.326 0.77 3.11 Intr + 146352 146498 147 2 0 70 98 37 0.831 2.41 3.12 Intr + 147309 147506 198 2 0 78 91 192 0.999 17.13 3.13 Intr + 147952 148182 231 0 0 70 73 144 0.994 8.15 3.14 Intr + 148949 149082 134 1 2 67 33 125 0.983 3.42 3.15 Intr + 150192 150291 100 0 1 50 111 75 0.983 5.19 3.16 Intr + 152264 152382 119 1 2 49 100 94 0.999 5.04 3.17 Intr + 152494 152619 126 1 0 72 119 85 0.996 8.87 3.18 Term + 153955 154144 190 1 1 79 38 96 0.885 -0.36 3.19 PlyA + 156926 156931 6 1.05 4.07 PlyA - 157518 157513 6 1.05 4.06 Term - 178796 178678 119 0 2 103 33 122 0.928 5.92 4.05 Intr - 179477 179372 106 2 1 32 16 149 0.895 0.97 4.04 Intr - 180949 180727 223 0 1 35 86 147 0.715 6.41 4.03 Intr - 182420 182026 395 2 2 8 67 319 0.388 14.33 4.02 Intr - 183104 182736 369 2 0 40 55 205 0.382 6.78 4.01 Intr - 183621 183541 81 0 0 51 68 72 0.388 0.42 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 48350 48519 170 1 2 92 86 93 0.808 8.55 S.002 Term + 107709 107898 190 0 1 89 38 93 0.952 0.34 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:103471603_103656330|GENSCAN_predicted_peptide_1|719_aa MGKDFMSKTPKAMTIKAKIDKWDLIKLKSFCTAKATTISVNRKPTEWEKIFAICSSDKGL ISRIYNELKQIYKKKTNNPIKKWAKDMNRHFSKEDIYAAKRHMKKCSSSLAIREIQIKTT MRYHLTAVRMAIIKKSGNSRCWRGCGEVGTLLHSRWDCKLVQPLWKSVWRFLRDLELEIP FDPAIPLLGVYPKDYKSCCYKDTCTRNFNMKEKMRQMNTLKAHLDKYGKRIEEAEMVLTD NEEFNEKVICKHLKKVRELSMQISVGTASLKERMVKAKVLVMGLKCSSTARRKMAAPEQP LAISRGCTSSSSLSPPRGDRTLLVRHLPAELTAEEKEDLLKYFGAQSVRVLSDKGRLKHT AFATFPNEKAAIKILSKSRAEGYFVLLSHEKHLNCFDHVPSQVDEDIATQSADQEGRIYE DYMPLHAPLPPTSPQPPEEPPLPDEDEELSSEESEYESTDDEDRQRMNKLMELANLQPKR PKTIKQRHVRKKRKIKDMLNTPLCPSHSSLHPVLLPSDVFDQPQPVGNKRIEFHISTDMP AAFKKDLEKEQNCEEKNHDLPATEVDASNIGFGKIFPKPNLDITEEIKEDSDEMPSECIS RRELEKGRISREEMETLSVFRSYEPGEPNCRIYVKNLAKHVQEKDLKYIFGRYVDFSSET QRIMFDIRLMKEGRMKGQAFIGLPNEKAAAKALKEANGYVLFGKPMVVVSFKSIYFKTI >gi568815597f:103471603_103656330|GENSCAN_predicted_CDS_1|2160_bp atgggcaaggacttcatgtctaaaacaccaaaagcaatgacaataaaagccaaaattgac aaatgggatctaattaaactaaagagcttctgcacagcaaaagcaactaccatcagtgtg aacaggaaacctacagaatgggagaaaatttttgcaatctgctcatctgacaaagggcta atatccagaatctacaatgaactcaaacaaatttacaagaaaaaaactaacaaccccatc aaaaagtgggcaaaggatatgaacagacacttctcaaaagaagacatttatgcagccaaa agacacatgaaaaaatgctcatcatcactggccataagagaaatacaaatcaaaaccaca atgagataccatctcacagcagttagaatggcgatcattaaaaagtcaggaaacagcagg tgctggagaggatgtggagaagtaggaacacttttacacagtcggtgggactgtaaacta gttcaaccattgtggaagtcagtgtggcgattcctcagggatctagaactagaaatacca tttgacccagccatcccattactgggtgtatacccaaaggattataaatcatgctgctat aaagacacatgcacacgtaattttaacatgaaagaaaaaatgcgacaaatgaatacactt aaagcacacttggacaagtatgggaagagaatagaggaggcagagatggtgttgactgac aatgaagaatttaatgagaaagtcatatgtaagcatctgaagaaagtcagggagctaagc atgcagatatctgtaggaacagcatccctgaaagagagaatggtaaaggcaaaggtctta gttatgggtctgaagtgttcaagcacagccagaaggaaaatggcagctcccgagcagccg cttgcgatatcaaggggatgcacgagctcctcctcgctttccccgcctcggggcgaccga acccttctggtcaggcacctgccggctgagcttactgctgaggagaaagaggacttgctg aagtacttcggggctcagtctgtgcgggtcctgtcagataaggggcgactgaaacataca gcttttgccacattccctaatgaaaaagcagctataaagatcctttccaaaagtagagca gagggatattttgttctactgagccacgaaaaacacctgaattgtttcgaccatgtgcct tcccaggttgatgaagacattgctacacagtctgcagatcaggaaggaagaatttatgaa gactatatgccattgcatgcacctcttccacccacatctcctcagccacctgaggaacct cctttgccagacgaggatgaggaattatctagtgaagaatcagaatatgaaagcactgat gatgaggaccgacagagaatgaacaaattaatggaactagcaaatcttcagcccaaaaga cctaaaacaataaagcagcgccatgtgagaaaaaagagaaaaataaaggatatgttgaat acacctttgtgtccttcacacagcagtttacatccagtgctgttaccttcagatgtattt gaccaaccacaacctgtaggtaacaaaagaattgaattccatatatctaccgacatgcca gctgcatttaagaaagatttagaaaaggaacaaaattgtgaggaaaaaaatcatgattta cctgctactgaagttgatgcatccaatataggatttggaaaaatcttccccaaacctaat ttggacatcacagaggagattaaagaagactctgatgaaatgccttcagaatgtatttct agaagggaattggaaaagggcagaatttctagagaagaaatggaaacactttcagttttc agaagttatgaaccgggtgaaccaaactgtagaatttatgtaaagaatttagctaaacat gttcaagaaaaggaccttaaatatatttttggaagatatgttgacttttcatcagaaaca cagcggatcatgtttgatatacgtttgatgaaagaaggtcgtatgaaaggacaagctttc attggacttcctaatgaaaaagcagcagcaaaagccttaaaggaagctaatggatatgtg ctttttggaaaacccatggtggttgtatcctttaaatccatctatttcaaaactatataa >gi568815597f:103471603_103656330|GENSCAN_predicted_peptide_2|318_aa MEPLIHTEYLCSGDAIILIFMVLGARVVISFCILSVMPGYMLPPDSTVLAYRSLRMSTSH FMMELKMGGGGCGSGHLLLKVQGNIAQFLLDVVSDLPLGSGAEAIANLGKDLHEVVGQVL ASQVQMQNGVREGVALIDGHHVGDPISKAHDNASVEGQPSLDGHVHGQGVEGIKHDLSHL LSVGLEVQGVLGQQHQVLLQGHVQSLVEGVMPDLLYVIPVGHDTMLNGGTSGSGYCACSQ PCSPHRSPSGPCPPSCPGAGGTDDGRKHGWGLSSPAKPALHMLEPLSITSTAISSSIATG GGVGQWSGRTTWCMGWQR >gi568815597f:103471603_103656330|GENSCAN_predicted_CDS_2|957_bp atggagccgctgatccacacggagtacttgtgctctggggatgcgatcatcttgatcttc atggtgctaggtgcccgggtggtgatctccttctgcatcctgtcggtgatgcctgggtac atgttgccaccagatagcaccgtgttggcctacaggtctttgcggatgtccacgtcacac ttcatgatggagttgaagatgggaggaggaggatgtggcagtggccatctactgctcaaa gtccagggcaacatagcacagtttctccttgacgtagtgtctgatctcccgctcggcagt ggtgctgaagctatagccaaccttggtaaggatcttcatgaggtagttggtcaggtcctg gccagccaggtccagatgcagaatggtgtcagggagggcgtagcccttatagatgggcac catgtgggtgaccccatctccaaagcccatgacaatgccagtgtagagggacagcctagc ctggatggccacgtacatggccagggtgttgaaggtatcaaacatgatctgagtcatctt ctctctgttggccttgaggttcagggggtcctcggtcagcagcaccaggtgctcctccaa ggccatgtgcagtcccttgtagaaggtgtgatgccagatcttctctatgtcatcccagtt ggtcatgataccatgcttaatgggggtacttcagggtcaggatactgtgcttgctctcag ccatgtagcccacataggagtccctctggcccatgcccaccatcatgccctggtgccggg ggcaccgatgatggaaggaaacatggctgggggttgtcgtccccagcaaagccagctttg cacatgctggagccattgtcaatcaccagtacggcaatctcttcttccattgcgactggt ggaggagtgggacagtggagcggcaggacaacatggtgcatgggctggcagaggtga >gi568815597f:103471603_103656330|GENSCAN_predicted_peptide_3|884_aa MKFFLLLFTIGFCWAQYSPNTQQGRTSIVHLFEWRWVDIALECERYLAPKGFGGVQVSPP NENVAIHNPFRPWWERYQPVSYKLCTRSGNEDEFRNMVTRCNNVGVRIYVDAVINHMSGN AVSAGTSSTCGSYFNPGSRDFPAVPYSGWDFNDGKCKTGSGDIENYNDATQVRDCRLVGL LDLALEKDYVRSKIAEYMNHLIDIGVAGFRLDASKHMWPGDIKAILDKLHNLNSNWFPAG SKPFIYQEVIDLGGEPIKSSDYFGNGRVTEFKYGAKLGTVIRKWNGEKMSYLKLYKMAVG FMLAHPYGFTRVMSSYRWPRQFQNGNDVNDWVGPPNNNGVIKEVTINPDTTCGNDWVCEH RWRQIRNMVNFRNVVDGQPFTNWYDNGSNQVAFGRGNRGFIVFNNDDCLYYTWLGHFMAK NMLVEDQSGSYSPNTQQGRTSIVHLFEWRWVDIALECERYLAPKGFGGVQVSPPNENVAI YNPFRPWWERYQPVSYKLCTRSGNEDEFRNMVTRCNNVGVRIYVDAVINHMCGNAVSAGT SSTCGSYFNPGSRDFPAVPYSGWDFNDGKCKTGSGDIENYNDATQVRDCRLTGLLDLALE KDYVRSKIAEYMNHLIDIGVAGFRLDASKHMWPGDIKAILDKLHNLNSNWFPAGSKPFIY QEVIDLGGEPIKSSDYFGNGRVTEFKYGAKLGTVIRKWNGEKMSYLKLYKMAVGFMLAHP YGFTRVMSSYRWPRQFQNGNDVNDWVGPPNNNGVIKEVTINPDTTCGNDWVCEHRWRQIR NMVIFRNVVDGQPFTNWYDNGSNQVAFGRGNRGFIVFNNDDWSFSLTLQTGLPAGTYCDV ISGDKINGNCTGIKIYVSDDGKAHFSISNSAEDPFIAIHAESKL >gi568815597f:103471603_103656330|GENSCAN_predicted_CDS_3|2655_bp atgaagttctttctgttgcttttcaccattgggttctgctgggctcagtattccccaaat acacaacaaggacggacatctattgttcatctgtttgaatggcgatgggttgatattgct cttgaatgtgagcgatatttagctcccaagggatttggaggggttcaggtctctccacca aatgaaaatgttgcaattcacaaccctttcagaccttggtgggaaagataccaaccagtt agctataaattatgcacaagatctggaaatgaagatgaatttagaaacatggtgactaga tgtaacaatgttggggttcgtatttatgtggatgctgtaattaatcatatgtctggtaat gctgtgagtgcaggaacaagcagtacctgtggaagttacttcaaccctggaagtagggac tttccagcagtcccatattctggatgggattttaatgatggtaaatgtaaaactggaagt ggagatatcgagaactacaatgatgctactcaggtcagagattgtcgtctggttggtctt cttgatcttgcactggagaaagattatgtgcgttccaagattgccgaatatatgaatcat ctcattgacattggtgttgcagggttcagacttgatgcttccaagcacatgtggcctgga gacataaaggcaattttggacaaactgcataatctaaacagtaactggttccctgcagga agtaaacctttcatttaccaggaggtaattgatctgggtggtgagccaattaaaagcagt gactactttggaaatggccgggtgacagaattcaagtatggtgcaaaactcggcacagtt attcgcaagtggaatggagagaagatgtcttacctaaagctgtataaaatggcagttgga tttatgcttgctcatccttatggttttacacgagtaatgtcaagctaccgttggccaaga cagtttcaaaatggaaacgatgttaatgattgggttgggccaccaaataataatggagta attaaagaagttactattaatccagacactacttgtggcaatgactgggtctgtgaacat cgatggcgccaaataaggaacatggttaatttccgcaatgtagtggatggccagcctttt acaaactggtatgataatgggagcaaccaagtggcttttgggagaggaaacagaggattc attgttttcaacaatgatgactgtctttactacacgtggcttggtcacttcatggctaaa aacatgcttgtggaagaccagtctggctcgtattccccaaatacacaacaaggacggaca tctattgttcatctgtttgaatggcgatgggttgatattgctcttgaatgtgagcgatat ttagctccgaagggatttggaggggttcaggtctctccaccaaatgaaaatgttgcaatt tacaaccctttcagaccttggtgggaaagataccaaccagttagctataaattatgcaca agatctggaaatgaagatgaatttagaaacatggtgactagatgtaacaatgttggggtt cgtatttatgtggatgctgtaattaatcatatgtgtggtaacgctgtgagtgcaggaaca agcagtacctgtggaagttacttcaaccctggaagtagggactttccagcagtcccatat tctggatgggatttcaatgatggtaaatgtaaaactggaagtggagatatcgagaactac aatgatgctactcaggtcagagattgtcgtctgactggtcttcttgatcttgcactggag aaggattacgtgcgttctaagattgccgaatatatgaaccatctcattgacattggtgtt gcagggttcagacttgatgcttccaagcacatgtggcctggagacataaaggcaattttg gacaaactgcataatctaaacagtaactggttccctgcaggaagtaaacctttcatttac caggaggtaattgatctgggtggtgagccaattaaaagcagtgactactttggtaatggc cgggtgacagaattcaagtatggtgcaaaactcggcacagttattcgcaagtggaatgga gagaagatgtcttacttaaagctgtacaaaatggcagttggatttatgcttgctcatcct tacggatttacacgagtaatgtcaagctaccgttggccaagacagtttcaaaatggaaac gatgttaatgattgggttgggccaccaaataataatggagtaattaaagaagttactatt aatccagacactacttgtggcaatgactgggtctgtgaacatcgatggcgccaaataagg aacatggttattttccgcaatgtagtggatggccagccttttacaaattggtatgataat gggagcaaccaagtggcttttgggagaggaaacagaggattcattgttttcaacaatgat gactggtcattttctttaactttgcaaactggtcttcctgctggcacatactgtgatgtc atttctggagataaaattaatggcaattgcacaggcattaaaatttacgtttctgatgat ggcaaagctcatttttctattagtaactctgctgaagatccatttattgcaattcatgct gaatctaaattgtaa >gi568815597f:103471603_103656330|GENSCAN_predicted_peptide_4|430_aa CCVPLKGTEIVHLGSSDFETVAIRCSQRFLVTASFRSSTVVKGQAAAVVVAKGHLFQESS RSTGQEKPAPKVLFDPELEDSRQEMAPTVPSTPYPVGRPPSPEPTAPRPPRVDKNKSETA GKSTSLAARLTAQDGNSNALREQRYTRTDEDPNEGPDMERLRGYREALIERLKKGAQKGT NVNKVSEVIQGKEESPAQFYQRPCEAYCMYTPFDLESPENQPLINTALVIQSAEDIQRKL QKQARFAGMKTLQLLERANEVFVNRDATSGQESRKEGERQARFKKSPTIFGEASVGDLQM FPAKDLGCILLQYVDDLLLGHSMAVRCAKRTGALLRHLEDCGYKVPKKKAQICRQQHNAK QGPSVPRGIEASGAAHFEDLQVDFTEMPKCRSDIVWIKDWNVAPLWPQWKGPQTINLTTP TAVNVEGIPA >gi568815597f:103471603_103656330|GENSCAN_predicted_CDS_4|1293_bp tgttgtgtgcccttaaaagggacagaaattgtgcacttggggagctcggattttgagaca gtagctatccgatgctcccagagattcctagttacagctagttttagatcctctacagtg gtaaaaggacaggcagcagcagtagtagtagcaaagggacatttatttcaggaaagttct cgctccaccggccaagagaagccagcacctaaagttctgtttgacccagagctcgaggac tcaaggcaggagatggcaccaacagtgccctcaaccccttatccagtggggaggccccct tctcctgagcccacagcccctagaccacccagagtagacaagaacaaaagtgaaactgcg ggaaaatccacttccctggcagcccgcttaacggcccaagacgggaattcaaatgccctg agagagcagcgatatactaggacagatgaggatccaaatgaaggaccagatatggaaagg ctaagagggtaccgagaggcattaattgaaaggttgaaaaaaggggctcaaaagggtacc aatgtaaataaagtttctgaagtcatccaaggaaaggaggaaagcccagcccagttctat caaagaccgtgtgaggcctattgcatgtacactcctttcgatctggagagtcctgaaaat cagccgttgattaatacggccttagttattcagagtgcagaagatatccagagaaaattg caaaaacaggctaggtttgcaggaatgaaaaccttgcagttactggaaagagctaatgaa gtatttgtaaatagagatgcaacaagcggccaagaaagccgtaaggagggcgaacgccag gccaggttcaaaaagtcccccaccatctttggggaggcctcggttggagacctccaaatg tttcctgctaaagacctaggttgcatcctgctccagtatgtagatgaccttctgctagga cactccatggcagtcaggtgtgcaaaaaggacgggtgccctgcttcgacacctggaggac tgtggatataaagtgcccaaaaagaaagctcagatctgcagacagcagcacaatgcgaag caaggcccctctgtacctcggggaatagaagcctctggagcagctcattttgaagatctt caagtggacttcacagaaatgcctaaatgtagaagtgacattgtgtggatcaaggactgg aacgtggctccgctgtggccacagtggaaaggaccccagacaattaacctgaccactccc acagctgtcaatgtagaaggaatcccagcctag