GENSCAN 1.0 Date run: 8-Nov-116 Time: 02:13:18 Sequence gi568815593f:23409034_23627770 : 218737 bp : 35.61% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init - 2370 2267 104 0 2 90 101 110 0.161 12.30 1.00 Prom - 9320 9281 40 -3.15 2.00 Prom + 12144 12183 40 -5.35 2.01 Sngl + 15216 15539 324 2 0 98 46 244 0.728 16.95 2.02 PlyA + 17469 17474 6 1.05 3.00 Prom + 24186 24225 40 -5.95 3.01 Init + 29054 29192 139 1 1 84 22 137 0.414 7.05 3.02 Intr + 53639 53774 136 0 1 88 115 63 0.524 7.71 3.03 Intr + 67500 67617 118 0 1 74 55 69 0.348 1.75 3.04 Intr + 83612 83759 148 1 1 51 70 69 0.095 0.29 3.05 Intr + 97417 97596 180 2 0 80 22 99 0.014 1.42 3.06 Intr + 98136 98294 159 1 0 44 29 137 0.050 2.44 3.07 Intr + 99966 100069 104 1 2 17 81 92 0.059 0.37 3.08 Intr + 100437 100560 124 2 1 94 60 122 0.992 9.24 3.09 Intr + 100887 100994 108 1 0 77 116 148 0.922 15.84 3.10 Intr + 108848 108897 50 0 2 43 115 57 0.665 1.38 3.11 Intr + 111990 112146 157 2 1 28 86 103 0.677 2.76 3.12 Intr + 113271 113372 102 1 0 77 84 88 0.936 6.53 3.13 Intr + 113581 113852 272 2 2 102 99 240 0.999 22.84 3.14 Intr + 114258 114325 68 2 2 47 94 56 0.716 -1.12 3.15 Intr + 115301 115494 194 2 2 87 80 247 0.999 22.01 3.16 Intr + 117200 118736 1537 0 1 79 80 758 0.627 61.68 3.17 Intr + 121087 121261 175 1 1 81 36 107 0.196 3.82 3.18 Term + 140864 140905 42 1 0 120 43 46 0.144 -0.52 3.19 PlyA + 141305 141310 6 1.05 4.00 Prom + 174064 174103 40 -1.95 4.01 Init + 191529 191612 84 2 0 91 95 93 0.995 11.17 4.02 Term + 194155 194367 213 0 0 64 42 154 0.982 4.65 4.03 PlyA + 194517 194522 6 1.05 5.03 PlyA - 194850 194845 6 1.05 5.02 Term - 196781 196746 36 2 0 128 41 29 0.030 -1.44 5.01 Init - 205541 205530 12 2 0 121 99 8 0.267 4.99 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 100001 100069 69 1 0 92 81 81 0.935 8.90 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:23409034_23627770|GENSCAN_predicted_peptide_1|35_aa MGASLDQKLRGHPAGSGGVEVSDGSATTAISSGGR >gi568815593f:23409034_23627770|GENSCAN_predicted_CDS_1|105_bp atgggcgcctccctggatcagaagctcaggggacaccctgctggatccggaggggtggaa gtcagtgatgggtctgcgacgacggcgatcagcagtggtggacgn >gi568815593f:23409034_23627770|GENSCAN_predicted_peptide_2|107_aa MGYSHYEDDLPNSGVDTSIGNEPLDSDSTDIHFSKVGRLLRTVEYAEFKSFSTRVARSLG PVVLMQEKAIGPEASLEPEPSPFKKGCPSIPDGAWYTDGSSQGPTAA >gi568815593f:23409034_23627770|GENSCAN_predicted_CDS_2|324_bp atgggctacagtcattatgaggatgacttacccaatagtggggtggatacatccataggt aatgaacccctggattcggacagcacagacatccactttagcaaagtggggcgattactg agaacagtggagtatgctgagttcaagtcctttagcaccagagttgcaagaagtttggga cctgtagtcctaatgcaagagaaggccattgggcctgaggcatccttagagcctgagcca tcgccatttaagaaagggtgtccctccattcctgatggggcatggtacacagatgggtcc agccaaggtcctactgctgcctga >gi568815593f:23409034_23627770|GENSCAN_predicted_peptide_3|1270_aa MQPEATRFLTSSTAPVDTITIVKPKIVVRDIHQALHSDGPAGTTQIEIWTVSRGLTRTLK YPGHIKNVKLFIPPLGLQRQEANILLLDREVSGYTKFTFPPTVYKGSLFSTFSSTLNIEQ QKKKGKCEGNLQTLLHQTLWAPQRVGSYPYHLDKQFFVAAQVSVGIVESPDARILEGCDK RPMPSHSARAGAHTVQPLQPSLSASLHLSTLPVTQEYIGFPSTARAQPQAILLGVPRAVA LKGTPTLPDENQCKNSGNSKRHSVPLEDPGLRRDVRSSPDARLSRKHRSWGKPKLEQGLL DSPSTMSPEKSQEESPEEDTERTERKPMVKDAFKDISIYFTKEEWAEMGDWEKTRYRNVK RNYNALITIGLRATRPAFMCHRRQAIKLQVDDTEDSDEEWTPRQQVKPPWMALRVEQRKH QKGMPKASFSNESSLKELSRTANLLNASGSEQAQKPVSPSGEASTSGQHSRLKLELRKKE TERKMYSLRERKGHAYKEVSEPQDDDYLYCEMCQNFFIDSCAAHGPPTFVKDSAVDKGHP NRSALSLPPGLRIGPSGIPQAGLGVWNEASDLPLGLHFGPYEGRITEDEEAANNGYSWLI TKGRNCYEYVDGKDKSWANWMRYVNCARDDEEQNLVAFQYHRQIFYRTCRVIRPGCELLV WYGDEYGQELGIKWGSKWKKELMAGREPKPEIHPCPSCCLAFSSQKFLSQHVERNHSSQN FPGPSARKLLQPENPCPGDQNQEQQYPDPHSRNDKTKGQEIKERSKLLNKRTWQREISRA FSSPPKGQMGSCRVGKRIMEEESRTGQKVNPGNTGKLFVGVGISRIAKVKYGECGQGFSV KSDVITHQRTHTGEKLYVCRECGRGFSWKSHLLIHQRIHTGEKPYVCRECGRGFSWQSVL LTHQRTHTGEKPYVCRECGRGFSRQSVLLTHQRRHTGEKPYVCRECGRGFSRQSVLLTHQ RRHTGEKPYVCRECGRGFSWQSVLLTHQRTHTGEKPYVCRECGRGFSWQSVLLTHQRTHT GEKPYVCRECGRGFSNKSHLLRHQRTHTGEKPYVCRECGRGFRDKSHLLRHQRTHTGEKP YVCRECGRGFRDKSNLLSHQRTHTGEKPYVCRECGRGFSNKSHLLRHQRTHTGEKPYVCR ECGRGFRNKSHLLRHQRTHTGEKPYVCRECGRGFSDRSSLCYHQRTHTGEKPYVCREDDD LSHDYVPGPNRGSGCLFSRPNNEMQTNGERMGKSLKAFPLRNEARLGYLISQQLFSTTQA VESGSPLPHG >gi568815593f:23409034_23627770|GENSCAN_predicted_CDS_3|3813_bp atgcagccagaggccacaagattcctaacctcctcaactgctcctgtggataccatcact attgtaaaacctaagattgttgttagagacattcatcaggccctgcattctgatggacca gctggcaccacccagatcgagatctggacagtatcaagaggactcacaagaacgttgaaa tatccagggcatataaaaaatgtgaagctttttataccacccttaggcctgcagaggcag gaagcaaacatcttgttactagatcgtgaagtaagtggctataccaagtttacattccca cctacagtgtacaagggatcccttttctccacattttcatcaacattaaatattgagcag cagaaaaaaaagggaaaatgtgaaggcaacttacagacactcctgcaccaaacactctgg gctccacaaagggtggggtcctacccctaccacctagacaaacagttctttgtagcagct caggtgtctgttgggattgtagagtcccctgatgccaggatcctggaaggctgtgacaag aggcccatgcctagccactctgcaagagcaggtgcacacacagtgcagcctctgcagccc agcctgagtgcatcgctccacctgagtaccttacccgtgacccaggagtacattggattc cccagcacagcgagagcccaacctcaagccatacttcttggtgtccccagggctgtggca cttaaaggaacaccaaccctcccagatgagaatcagtgcaagaactctggcaattcaaaa agacacagtgtccccttggaagacccaggcctgcggagggacgtgaggagcagccccgac gcccgactcagtcgcaaacaccggagttggggaaaaccaaaattggagcagggccttcta gacagtcccagcaccatgagccctgaaaagtcccaagaggagagcccagaagaagacaca gagagaacagagcggaagcccatggtcaaagatgccttcaaagacatttccatatacttc accaaggaagaatgggcagagatgggagactgggagaaaactcgctataggaatgtgaaa aggaactataatgcactgattactataggtctcagagccactcgaccagctttcatgtgt caccgaaggcaggccatcaaactccaggtggatgacacagaagattctgatgaagaatgg acccctaggcagcaagtcaaacctccttggatggccttaagagtggaacagcgtaaacac cagaagggaatgcccaaggcgtcattcagtaatgaatctagtttgaaagaattgtcaaga acagcaaatttactgaatgcaagtggctcagagcaggctcagaaaccagtgtccccttct ggagaagcaagtacctctggacagcactcaagactaaaactggaactcaggaagaaggag actgaaagaaagatgtatagcctgcgagaaagaaagggtcatgcatacaaagaggtcagc gagccgcaggatgatgattacctctattgtgagatgtgtcagaacttcttcattgacagc tgtgctgcccatgggccccctacatttgtaaaggacagtgcagtggacaaggggcacccc aaccgttcagccctcagtctgcccccagggctgagaattgggccatcaggcatccctcag gctgggcttggagtatggaatgaggcatctgatctgccgctgggtctgcactttggccct tatgagggccgaattacagaagacgaagaggcagccaacaatggatactcctggctgatc accaaggggagaaactgctatgagtatgtggatggaaaagataaatcctgggccaactgg atgaggtatgtgaactgtgcccgggatgatgaagagcagaacctggtggccttccagtac cacaggcagatcttctatagaacctgccgagtcattaggccaggctgtgaactgctggtc tggtatggggatgaatacggccaggaactgggcatcaagtggggcagcaagtggaagaaa gagctcatggcagggagagaaccaaagccagagatccatccatgtccctcatgctgtctg gccttttcaagtcagaaatttctcagtcaacatgtagaacgcaatcactcctctcagaac ttcccaggaccatctgcaagaaaactcctccaaccagagaatccctgcccaggggatcag aatcaggagcagcaatatccagatccacacagccgtaatgacaaaaccaaaggtcaagag atcaaagaaaggtccaaactcttgaataaaaggacatggcagagggagatttcaagggcc ttttctagcccacccaaaggacaaatggggagctgtagagtgggaaaaagaataatggaa gaagagtccagaacaggccagaaagtgaatccagggaacacaggcaaattatttgtgggg gtaggaatctcaagaattgcaaaagtcaagtatggagagtgtggacaaggtttcagtgtt aaatcagatgttattacacaccaaaggacacatacaggggagaagctctacgtctgcagg gagtgtgggcggggctttagctggaagtcacacctcctcattcaccagaggatacacaca ggggagaagccctatgtctgcagggagtgtgggcggggctttagctggcagtcagtcctc ctcactcaccagaggacacacacaggggagaagccctatgtctgcagggagtgtgggcgg ggctttagccggcagtcagtcctcctcactcaccagaggagacacacaggggagaagccc tatgtctgcagggagtgtgggcggggctttagccggcagtcagtcctcctcactcaccag aggagacacacaggggagaagccctatgtctgcagggagtgtgggcggggctttagctgg cagtcagtcctcctcactcaccagaggacacacacaggggagaagccctatgtctgcagg gagtgtgggcggggctttagctggcagtcagtcctcctcactcaccagaggacacacaca ggggagaagccctatgtctgcagggagtgtgggcggggctttagcaataagtcacacctc ctcagacaccagaggacacacacaggggagaagccctatgtctgcagggagtgtgggcgg ggctttcgcgataagtcacacctcctcagacaccagaggacacacacaggggagaagccc tatgtctgcagggagtgtgggcggggctttagagataagtcaaacctcctcagtcaccag aggacacacacaggggagaagccctatgtctgcagggagtgtgggcggggctttagcaat aagtcacacctcctcagacaccagaggacacacacaggggagaagccctatgtctgcagg gagtgtgggcggggctttcgcaataagtcacacctcctcagacaccagaggacacacaca ggggagaagccctacgtctgcagggagtgtgggcggggctttagcgataggtcaagcctc tgctatcaccagaggacacacacaggggagaagccctacgtctgcagggaggatgatgat ctaagccatgactatgttcccggaccaaaccgagggtcgggctgcttgttctcacggccc aataacgagatgcagacgaacggagagagaatggggaaaagcttaaaagcctttcctctg agaaatgaagcaagattaggatacctaatatcacaacagttattcagcacgacccaagct gtagaatctggaagccctcttcctcatggttga >gi568815593f:23409034_23627770|GENSCAN_predicted_peptide_4|98_aa MAITKSQTVTDIGEDKRKTEYLHTVGGPPLTICDLASHDKHLVTCEWASPSPPVLPSACG AQKPHTAESSGKELAGGLGHAICTGLRISPFTTSYLWL >gi568815593f:23409034_23627770|GENSCAN_predicted_CDS_4|297_bp atggctatcacaaaaagtcaaacagtaaccgatatcggcgaagataaaaggaaaactgaa tacttacacactgttggtgggcctcctttgacgatctgcgaccttgcgtcacatgataaa cacctggtcacctgtgaatgggcttctccaagccctcctgtcctgccttctgcctgtggt gcgcagaagccacacaccgctgaaagctctgggaaagaactggctggtggacttggacat gctatatgtaccggtctcagaatctcacccttcacaacaagctatttgtggctttaa >gi568815593f:23409034_23627770|GENSCAN_predicted_peptide_5|15_aa MAILVFITPNSRLHQ >gi568815593f:23409034_23627770|GENSCAN_predicted_CDS_5|48_bp atggctattctcgtattcattactccaaattcaagacttcatcagtaa