GENSCAN 1.0 Date run: 8-Nov-116 Time: 12:55:55 Sequence gi568815584r:73960645_74184394 : 223750 bp : 42.14% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 529 731 203 0 2 18 86 259 0.979 15.86 1.02 Intr + 811 926 116 1 2 72 82 106 0.998 7.57 1.03 Intr + 1093 1259 167 2 2 96 69 121 0.808 9.76 1.04 Term + 2326 2355 30 0 0 102 48 19 0.386 -3.82 1.05 PlyA + 2446 2451 6 1.05 2.13 PlyA - 2609 2604 6 1.05 2.12 Term - 2842 2813 30 1 0 99 42 20 0.007 -4.62 2.11 Intr - 9481 9366 116 2 2 98 110 86 0.982 11.05 2.10 Intr - 11264 11208 57 0 0 74 115 25 0.766 1.74 2.09 Intr - 12380 12240 141 0 0 99 97 181 0.892 19.50 2.08 Intr - 14435 14280 156 0 0 38 76 85 0.253 1.36 2.07 Intr - 16415 16380 36 0 0 125 90 -2 0.261 1.02 2.06 Intr - 16730 16655 76 2 1 121 81 69 0.366 7.67 2.05 Intr - 22580 22374 207 2 0 1 107 201 0.631 11.65 2.04 Intr - 26249 26170 80 0 2 23 109 41 0.660 -1.95 2.03 Intr - 27387 27242 146 2 2 85 111 201 0.507 21.11 2.02 Intr - 37677 37503 175 1 1 85 4 103 0.614 -0.32 2.01 Init - 43021 42997 25 1 1 102 98 30 0.607 5.14 2.00 Prom - 50443 50404 40 -5.75 3.00 Prom + 50746 50785 40 -6.25 3.01 Init + 54612 54623 12 2 0 108 92 8 0.685 3.06 3.02 Intr + 57899 58048 150 1 0 37 47 168 0.533 7.04 3.03 Intr + 58724 58890 167 1 2 29 72 128 0.557 3.14 3.04 Intr + 62272 62500 229 1 1 88 78 171 0.306 13.05 3.05 Intr + 68540 68605 66 1 0 77 97 68 0.524 4.78 3.06 Intr + 73384 73527 144 0 0 84 95 120 0.995 11.86 3.07 Intr + 89058 89551 494 2 2 96 34 353 0.785 21.57 3.08 Intr + 94940 95041 102 2 0 68 61 104 0.868 4.07 3.09 Intr + 96262 96336 75 1 0 90 86 50 0.836 2.71 3.10 Intr + 96500 96614 115 2 1 88 70 100 0.932 7.73 3.11 Term + 96943 96987 45 0 0 99 49 27 0.829 -3.87 3.12 PlyA + 98042 98047 6 1.05 4.16 PlyA - 98513 98508 6 -0.45 4.15 Term - 100102 99998 105 1 0 116 43 92 0.994 4.93 4.14 Intr - 104276 104178 99 2 0 106 93 71 0.989 8.79 4.13 Intr - 104716 104537 180 1 0 56 79 261 0.935 21.14 4.12 Intr - 106242 106061 182 1 2 123 61 131 0.935 12.57 4.11 Intr - 106925 106736 190 2 1 76 89 255 0.948 22.64 4.10 Intr - 108337 108216 122 2 2 78 17 91 0.847 0.29 4.09 Intr - 110853 110551 303 1 0 59 88 214 0.902 14.14 4.08 Intr - 111330 111252 79 0 1 105 62 109 0.940 8.21 4.07 Intr - 111720 111559 162 0 0 59 56 212 0.971 14.35 4.06 Intr - 111967 111893 75 1 0 79 74 80 0.956 4.49 4.05 Intr - 114373 114311 63 1 0 107 89 27 0.855 2.70 4.04 Intr - 122738 122568 171 2 0 69 75 71 0.789 3.12 4.03 Intr - 122949 122907 43 2 1 57 110 23 0.983 -1.28 4.02 Intr - 123333 123237 97 1 1 118 94 33 0.983 5.05 4.01 Init - 123750 123699 52 0 1 102 44 177 0.964 14.11 4.00 Prom - 130058 130019 40 -4.45 5.00 Prom + 132639 132678 40 -7.25 5.01 Init + 137177 137216 40 1 1 29 115 53 0.178 2.50 5.02 Intr + 140511 140664 154 1 1 46 66 118 0.070 3.61 5.03 Intr + 165549 165701 153 0 0 66 70 77 0.042 1.97 5.04 Intr + 181220 181413 194 2 2 74 82 117 0.692 7.91 5.05 Intr + 200905 201147 243 2 0 -6 61 270 0.439 11.45 5.06 Term + 204008 204126 119 0 2 65 43 101 0.717 1.02 5.07 PlyA + 206144 206149 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584r:73960645_74184394|GENSCAN_predicted_peptide_1|171_aa WSDADHTDFIDTAGAMLQYAVSLLKPTKVSARQLPPSVARVDAKSRVLFPLGLGHAAEYV RPRVALIGDAAHRVHPLAGQGVNMGFGDISSLAHHLSTAAFNGKDLGSVSHLTGYETERQ RHNTALLAATDLLKRLYSTSASPLVLLRTWGLQATNAVSPLKEQIMAFASK >gi568815584r:73960645_74184394|GENSCAN_predicted_CDS_1|516_bp tggagtgatgctgaccacacggacttcatcgacacagctggtgccatgctgcagtatgct gtcagccttctgaagcccactaaggtctcggctcgccagctgcccccaagcgtagccagg gtggatgccaaaagccgagttctgtttcctcttgggttgggacatgctgctgagtacgtc aggcctcgggtggcgctcattggggatgcagcccacagagtccatccgcttgcaggacag ggtgtcaacatgggctttggggatatctccagcttggcccatcacctcagtacggcagcc ttcaatgggaaggacttaggttccgtgagccacctcacaggttatgaaacagaaagacag cgtcacaacactgctcttctggctgctacagacttactaaaaaggctctattctaccagt gcctccccgcttgtgttgctcaggacgtggggcttgcaggccacaaatgcagtgtctcca ctcaaagaacagattatggcctttgcaagcaaatga >gi568815584r:73960645_74184394|GENSCAN_predicted_peptide_2|414_aa MGSFGYEQDRKSREGEVTVGQLLDLSQVHIIAFFVQQASDAPIDNCKEHCAGGVTALSTP SGGRRSLNQQTWFEGIFLSSMCPINVSASTLYGIMFDAGSTGTRIHVYTFVQKMPGQLPI LEGEVFDSVKPGLSAFVDQPKQNRPVPATKWIVHQMNGLSLLKGAETVQGLLEVAKDSIP RSHWKKTPVVLKATAGLRLLPEHKAKALLFEVKEIFRKSPFLVPKGSVSIMDGSDEGILA WVTVNFLTEGHRNNVTFERGSFRLENFRTSMKLVNPVFYSYLGFGLKAARLATLGALETE GEVGFEPCYAEVLRVVRGKLHQPEEVQRGSFYAFSYYYDRAVDTDMIDYEKGGILKVEDF ERKAREVCDNLENFTSGSPFLCMDLSYITALLKDGFGFADSTVLQKVQYTCNKG >gi568815584r:73960645_74184394|GENSCAN_predicted_CDS_2|1245_bp atgggcagttttggttatgagcaagatagaaagagcagggagggagaggtaacagttggt cagctgctagatttatcacaggtacacataattgctttctttgtacaacaggcttcagat gctcctatagataattgcaaggaacactgcgctgggggtgtgactgccctcagcacccct tctggtggcagacgcagtttgaaccagcagacttggtttgagggtatcttcctgtcttcc atgtgccccatcaatgtcagcgccagcaccttgtatggaattatgtttgatgcagggagc actggaactcgaattcatgtttacacctttgtgcagaaaatgccaggacagcttccaatt ctagaaggggaagtttttgattctgtgaagccaggactttctgcttttgtagatcaacct aagcagaatagaccagtgccagccactaaatggatcgttcatcagatgaacgggttatct cttttgaagggtgctgagaccgttcaagggctcttagaggtggccaaagactcaatcccc cgaagtcactggaaaaagaccccagtggtcctaaaggcaacagcaggactacgcttactg ccagaacacaaagccaaggctctgctctttgaggtaaaggagatcttcaggaagtcacct ttcctggtaccaaagggcagtgttagcatcatggatggatccgacgaaggcatattagct tgggttactgtgaattttctgacagagggacacaggaacaatgttacttttgaaagaggt agctttaggctagagaacttcaggaccagcatgaaattagtcaatcctgtattttacagt tacctgggatttggattgaaagctgcaagactagcaaccctgggagccctggagacagaa ggggaggtgggctttgagccctgctatgccgaagtgctgagggtggtacgaggaaaactt caccagccagaggaggtccagagaggttccttctatgctttctcttactattatgaccga gctgttgacacagacatgattgattatgaaaaggggggtattttaaaagttgaagatttt gaaagaaaagccagggaagtgtgtgataacttggaaaacttcacctcaggcagtcctttc ctgtgcatggatctcagctacatcacagccctgttaaaggatggctttggctttgcagac agcacagtcttacagaaagtccagtatacctgtaacaaaggttaa >gi568815584r:73960645_74184394|GENSCAN_predicted_peptide_3|532_aa MASKRCIRDFRSRGGKLKSFQLYATFRKEGCGPVLANGNPGGGRGQEVSATQDLRRGWGI ASSASRFPWRQSWPGRPRLGNYDSGAPGEAKMPSKGKDKKKGKSKGKDTKKLIKTDESVV DRAKANASLWEARLEVTELSRIKYRDTSRILAKSNEDLKKKQCKMEKDIMSVLSYLKKQD QEKDNMIEKLKQQLNETKEKAQEEKDKLEQKYTRQINELEGQFHQKAKEIGMIHTELKAV RQFQKRKIQVERELDDEINDLLVKEKIMQLVQQRSQIQTLQKKVVNLETALSYMTKEFES EVLKLQQHAMIENQAGQVEIDKLQHLLQMKDREMNRVKKLAKNILDERTEVERFFLDALH QVKQQILISRKHYKQIAQAAFNLKMRAACTGRTEYPKIRTFDGREHSTNSVNQDLLEAEK WTHIEGNVDIGDLTWEQKEKVLRLLFAKMNGCPSRKYNQSSRPPVPDYVVSDSGETKEFG DESKLQDKIFITQQIAISDSSGEVVLPTIPKEPQESDTVGSQSHYNLEDKGL >gi568815584r:73960645_74184394|GENSCAN_predicted_CDS_3|1599_bp atggcctctaagaggtgtattcgggatttcaggagtcggggaggcaagttgaagagcttc caactttatgcaactttcaggaaagaaggctgcggaccggtattagcaaatggaaatccg gggggcggtcgtgggcaggaagtatcggccacacaggacctacggcgcgggtggggcatt gccagctcggcgtcccggttcccttggagacagagctggccagggcggccgcggctgggc aactacgacagcggagcccctggggaagccaagatgccgtcgaagggaaaggacaaaaag aaaggcaagagcaaaggcaaagacacgaagaagttaataaaaacagatgaatctgtggtg gacagagccaaggccaatgcctccctttgggaggccaggttggaagtcacagaactctct aggattaagtatcgtgatacttcacggatactggcaaaaagtaatgaggacttaaagaaa aagcaatgtaaaatggagaaagacataatgtcagtattaagttacctgaagaagcaggat caggagaaagataatatgattgaaaaactgaaacagcaattaaatgaaacaaaggaaaaa gcccaagaggagaaggataaattggaacaaaagtataccaggcaaattaatgaactagag ggacagttccatcaaaaagccaaagaaattggcatgattcacacagagctgaaagcagta agacaattccagaagagaaaaatccaagtggagagagagttagatgatgagatcaatgat ctgttggttaaggaaaagattatgcaacttgtccagcagagatcacaaatccaaaccctt cagaagaaggtagtaaacttggagactgctctgagttacatgaccaaagagtttgagagt gaagttttaaaactgcagcaacacgcaatgatagagaaccaagcaggtcaggtagaaatt gacaagctgcagcaccttcttcagatgaaggacagggaaatgaatcgtgtgaagaagctg gccaagaacatactggatgagagaacagaagtggaaagattctttttagatgctctgcac caagtgaagcaacagatcctaattagcaggaagcattataagcagatagcacaagctgct ttcaatttaaaaatgagagcagcatgtacaggaagaacagaatatcccaaaatcagaaca tttgatggcagagagcacagcaccaatagtgtgaatcaggatcttctggaggccgaaaaa tggacacatattgaaggaaatgtggatattggagatttgacctgggagcagaaggaaaaa gtattgcgattgctctttgcaaaaatgaatggctgtccttctaggaaatacaaccagagt tctaggcctccagttccagactatgttgtttctgacagtggggaaacaaaggaatttggg gatgaaagtaagcttcaagataaaatcttcatcacccagcaaattgcaatatcagactct tctggtgaagtggtgctacccactattccaaaagaacctcaggagtctgacacagtggga agtcagagtcattacaacctagaggacaaaggcttataa >gi568815584r:73960645_74184394|GENSCAN_predicted_peptide_4|640_aa MAALLAAAAVRARILQAKTTLFGYSVPCRERPLCHCSLPPPYFTIRGDLRYQTSLKRVIE SGSQSFPCSWPTKDLEYVARFTAGKKQSARNNSAVLIHYIQSENFLYIAELLGNISIHQV SVSSKVKSSPTWYSASSFSSSVPTVKLFIGGKFVESKSDKWIDIHNPATNEVIGRVPQAT KAEMDAAIASCKRAFPAWADTSVLSRQQVLLRYQQLIKENLKEIAKLITLEQGKTLADAE GDVFRGLQVVEHACSVTSLMMGETMPSITKDMDLYSYRLPLGVCAGIAPFNFPAMIPLWM FPMAMVCGNTFLMKPSERVPGATMLLAKLLQDSGAPDGTLNIIHGQHEAVNFICDHPDIK AISFVGSNKAGEYIFERGSRHGKRVQANMGAKNHGVVMPDANKENTLNQLVGAAFGAAGQ RCMALSTAVLVGEAKKWLPELVEHAKNLRVNAGDQPGADLGPLITPQAKERVCNLIDSGT KEGASILLDGRKIKVKGYENGNFVGPTIISNVKPNMTCYKEEIFGPVLVVLETETLDEAI QIVNNNPYGNGTAIFTTNGATARKYAHLVDVGQVGVNVPIPVPLPMFSFTGSRSSFRGDT NFYGKQGIQFYTQLKTITSQWKEEDATLSSPAVVMPTMGR >gi568815584r:73960645_74184394|GENSCAN_predicted_CDS_4|1923_bp atggcggcgctattggcggcggcggcagtgcgagcccggatcctgcaggcgaaaactact ctttttggttactctgtcccctgccgtgagcgtccactttgccattgctcccttccccca ccctacttcactatccgtggtgatctgaggtaccaaacatctttaaaacgggtgatagaa agtgggagccagtctttcccctgttcatggcctaccaaggatttagagtatgtggcaaga tttaccgcaggcaagaaacagtctgcaagaaacaacagtgccgttcttattcattacatc cagtctgagaatttcctctatattgctgagcttttaggtaatattagtattcaccaggtt tccgtttcttccaaggtgaaatccagtcccacctggtattcagcatcttccttctcttct tcagtgccaactgtaaagctcttcattggtgggaaattcgttgaatccaaaagtgacaaa tggatcgatatccacaacccagccaccaatgaggtcattggtcgggtccctcaggccacc aaggcagaaatggatgcagccattgcttcctgcaaacgtgcttttcctgcatgggcagac acttcagtattaagccgccagcaggtcttgctccgctatcaacaacttattaaagaaaac ttgaaagaaattgccaagttaatcacattggaacaagggaagaccctagctgatgctgaa ggagatgtatttcgaggccttcaggtggttgagcatgcctgtagtgtgacatccctcatg atgggagagaccatgccatccatcaccaaagacatggacctttattcctaccgtctgcct ctgggagtgtgtgcaggcattgctccattcaattttcctgccatgatccccctttggatg tttcccatggccatggtgtgtggaaataccttcctaatgaaaccatctgagcgagtccct ggagcaactatgcttcttgctaagttgctccaggattctggtgcccctgatggaacatta aacatcatccatggacagcatgaagctgtaaattttatttgcgatcatccggacatcaaa gcaatcagctttgtgggatccaacaaggcaggagagtatatcttcgagagaggatcaaga catggcaagagggttcaagccaatatgggagccaagaaccatggggtagtcatgccagat gccaataaggaaaataccctgaaccagctggttggggcagcatttggagctgctggtcag cgctgcatggctctttcaacagcagtccttgtgggagaagccaagaagtggctgccagag ctggtggagcatgccaaaaacctgagagtcaatgcaggagatcagcctggagctgatctt ggccctctgatcactccccaggccaaagagcgagtctgtaatctgattgatagtggaaca aaggagggagcttccatccttcttgatggacgaaaaattaaagtgaaaggctatgaaaat ggcaactttgttggaccaaccatcatctcgaatgtcaagccaaatatgacctgttacaaa gaggagatttttggtccagttcttgtggttctggagacagaaacattggatgaagccatc cagattgtaaataacaacccatatggaaatggaactgccatcttcaccaccaatggagcc actgctcggaaatatgcccacttggtggatgttggacaggtgggagtgaatgtccccatt ccagtgcctttgccaatgttctcattcaccggctctcgatcctccttcaggggagacacc aatttctatggcaaacagggcatccaattctacactcagttaaagaccattacttctcag tggaaagaagaagatgctactctttcctcacctgctgttgtcatgcctaccatgggccgt tag >gi568815584r:73960645_74184394|GENSCAN_predicted_peptide_5|300_aa MAEIERDDIDMLKELGSLTTANLMEKVRGLQNLAYQLGLDECEYPDRICSGVYSTLSCVF HPELRMAIIKETPKQKQKVTGIGKDMVKLEPSYIAGGYVKWYTPCGKLVFLQKVKRSSRN QVLNALMDLGNDLSARGNEDLGFLCNSLANKASLCLMNFKGLAGSGSVMNEAKQFSRSIS GKEESTVYRGSRKLLTPRTEGESTEELSGGPHKAEVQPSEGKTHGHGNTLPVHCRSSPPD GSEESLGEGAPPLNCKDCHCQGLHSPVRKLEETNLGNLISPRRIKALTLEVLQLNSLITL >gi568815584r:73960645_74184394|GENSCAN_predicted_CDS_5|903_bp atggctgagatagaacgtgatgacatcgacatgttgaaagaactggggagtctcaccacg gctaatttgatggagaaggttcgaggcctacagaacctagcctatcagctggggctggat gagtgtgagtaccccgatcgcatttgttcaggggtgtattccaccctgagctgtgtattc caccctgagctcaggatggctataattaaagaaaccccaaaacaaaaacagaaagtaaca ggtattggcaaggatatggtgaaattagaaccttcatacattgctggtggctatgtaaaa tggtatactccctgtgggaaactggtatttcttcaaaaggttaaaagaagttccaggaat caagttttgaacgcattaatggacttgggaaatgatttgtcagcacgtggaaatgaggac ttaggtttcttgtgtaacagcttggctaacaaagcaagtttgtgcctcatgaactttaag ggccttgctgggagtggcagtgtgatgaatgaagcaaagcaattttccaggagcatttca ggaaaagaagagtcaacggtgtacagaggtagcagaaagctgctaacccctaggactgaa ggagagtcaacagaggaacttagtggaggtccccacaaggctgaggttcagccctctgaa gggaaaacacatggccacgggaacactctgccggtgcattgccggagcagcccacctgat ggcagtgaagaatctttgggggagggagccccaccactgaactgtaaggactgccattgc caaggccttcactctccggtcaggaaattagaagagactaatctggggaatctgatcagc ccaagaagaataaaggcactgacactggaagttctacaactaaacagcctgatcacactg tag