GENSCAN 1.0 Date run: 5-Nov-116 Time: 14:16:15 Sequence gi568815579f:47908668_48122324 : 213657 bp : 45.85% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3381 3572 192 0 0 89 43 112 0.638 5.41 1.02 Intr + 8754 8945 192 0 0 81 53 116 0.316 6.01 1.03 Intr + 14105 14296 192 2 0 81 53 92 0.540 3.61 1.04 Intr + 19149 19340 192 0 0 89 43 110 0.726 5.21 1.05 Intr + 24507 24698 192 0 0 81 53 92 0.059 3.61 1.06 Intr + 29862 30053 192 0 0 89 43 112 0.363 5.41 1.07 Intr + 35248 35439 192 1 0 81 53 92 0.125 3.61 1.08 Intr + 40619 40810 192 2 0 89 43 112 0.494 5.41 1.09 Intr + 46007 46198 192 2 0 89 43 112 0.357 5.41 1.10 Intr + 46891 47000 110 1 2 99 82 31 0.565 3.53 1.11 Term + 48677 48717 41 0 2 91 43 55 0.544 -1.35 1.12 PlyA + 48746 48751 6 1.05 2.05 PlyA - 49970 49965 6 1.05 2.04 Term - 68187 68045 143 1 2 132 45 45 0.980 2.69 2.03 Intr - 68837 68706 132 0 0 25 113 75 0.942 4.32 2.02 Intr - 70932 70903 30 1 0 89 111 -4 0.208 0.10 2.01 Init - 82601 82505 97 2 1 86 100 135 0.550 13.00 2.00 Prom - 88378 88339 40 -4.56 3.00 Prom + 98445 98484 40 -5.96 3.01 Init + 102859 103108 250 0 1 66 50 429 0.012 34.53 3.02 Intr + 105504 105641 138 1 0 126 113 105 0.968 17.24 3.03 Intr + 107226 107372 147 1 0 77 89 91 0.973 8.31 3.04 Intr + 111052 111232 181 2 1 96 55 88 0.386 5.23 3.05 Intr + 117164 117267 104 2 2 93 81 60 0.081 5.62 3.06 Term + 117608 117651 44 0 2 85 48 -1 0.020 -7.08 3.07 PlyA + 119281 119286 6 1.05 4.23 PlyA - 119958 119953 6 1.05 4.22 Term - 121915 121890 26 2 2 103 55 17 0.391 -1.71 4.21 Intr - 125695 125548 148 1 1 76 56 169 0.769 12.31 4.20 Intr - 130650 130541 110 1 2 78 105 183 0.997 19.00 4.19 Intr - 132081 131938 144 1 0 68 84 152 0.977 13.05 4.18 Intr - 132936 132906 31 0 1 101 87 21 0.859 1.00 4.17 Intr - 135249 135193 57 0 0 62 76 67 0.256 1.98 4.16 Intr - 139586 139280 307 1 1 98 63 285 0.317 23.45 4.15 Intr - 144480 144330 151 1 1 74 77 159 0.714 12.62 4.14 Intr - 146382 146211 172 0 1 85 92 194 0.974 19.02 4.13 Intr - 150889 150848 42 1 0 94 92 9 0.461 0.34 4.12 Intr - 153485 153331 155 0 2 90 74 231 0.536 21.69 4.11 Intr - 159219 159124 96 1 0 108 93 29 0.961 5.38 4.10 Intr - 166207 166067 141 2 0 41 98 64 0.183 3.02 4.09 Intr - 169157 169104 54 0 0 112 121 -1 0.091 4.55 4.08 Intr - 180045 180019 27 1 0 115 121 11 0.041 5.19 4.07 Intr - 186937 186797 141 2 0 114 60 23 0.157 2.42 4.06 Intr - 189592 189472 121 1 1 70 115 95 0.998 10.47 4.05 Intr - 191193 191004 190 2 1 148 100 149 0.999 21.69 4.04 Intr - 196057 195921 137 1 2 76 99 90 0.994 8.27 4.03 Intr - 196777 196666 112 0 1 31 107 121 0.973 8.68 4.02 Intr - 202011 201813 199 1 1 68 0 257 0.118 13.31 4.01 Init - 203816 203672 145 2 1 86 96 33 0.822 2.73 4.00 Prom - 205799 205760 40 -9.65 5.06 PlyA - 206800 206795 6 1.05 5.05 Term - 207065 206982 84 2 0 154 46 94 0.999 9.45 5.04 Intr - 207298 207206 93 1 0 96 94 147 0.992 16.26 5.03 Intr - 209114 208971 144 2 0 120 105 61 0.949 11.48 5.02 Intr - 210523 210470 54 1 0 133 105 96 0.999 14.98 5.01 Intr - 212655 212503 153 0 0 113 29 345 0.991 31.37 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 116913 116788 126 0 0 103 37 150 0.803 4.28 S.002 Init - 197862 197855 8 0 2 85 115 0 0.809 2.92 S.003 Term - 202011 201737 275 1 2 68 42 233 0.815 12.33 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579f:47908668_48122324|GENSCAN_predicted_peptide_1|626_aa XHTGLLTVPVAAGMCPHRASVLAVPTARTPSCTICTVQYGRLWPHMATEQLNMAGPNQDF QDVVCHTGLLTVPVAAGMCPHWASVLAVPTARTPSCTICTVQYGRLWPHMATEQLNMAGP NQDFQDVVCHTGLLTVPVAAGMCPHWASVLAVPTARTPSCTICTVQYGRLWPHMATEQLN MAGPNQHFQDVVCHTGLLTVPVEAGMCPHRASVLAVPTARTPSCTICTVQYGRLWPHMAT EQLNMAGPNQDFQDVVCHTGLLTVPVAAGMCPHWASVLAVPTARTPSCTICTVQYGRLWP HMATEQLNMAGPNQHFQDVVCHTGLLTVPVAAGMCPHRASVLAVPTARTPSCTICTVQYG RLWPHMATEQLNMAGPNQDFQDVVCHTGLLTVPVAAGMCPHWASVLAVPTARTPSCTICT VQYGRLWPHMATEQLNMAGPNQHFQDVVCHTGLLTVPVAAGMCPHRASVLAVPTARTPSC TICTVQYGRLWPHMATEQLNMAGPNQDFQDVVCHTGLLTVPVAAGMCPHRASVLAVPTAR TPSCTICTVQYGRLWPHMATEQLNMAGPNQDFQDVVWGRHGTQGPHWPWGLGDVRGEPLV IGSLTLRKIKGPKVSFSTPYLDKDKK >gi568815579f:47908668_48122324|GENSCAN_predicted_CDS_1|1881_bp nnccacacggggctcctcactgttcccgtagcagcaggcatgtgcccccacagggcctct gtactggctgttcccactgcccgaacaccctcatgcaccatctgcactgtccaatacggc cgcctctggccacacatggctactgagcagttgaacatggctggtccaaaccaagatttc caagacgtcgtgtgccacacggggctcctcactgttcccgtagcagcaggcatgtgcccc cactgggcctctgtactggctgttcccactgcccgaacaccctcatgcaccatctgcact gtccaatacggccgcctctggccacacatggctactgagcagttgaacatggctggtcca aaccaagatttccaagacgtcgtatgccacacggggctcctcactgttcccgtagcagca ggcatgtgcccccactgggcctctgtactggctgttcccactgcccgaacaccctcatgc accatctgcactgtccaatacggccgcctctggccacacatggctactgagcagttgaac atggctggtccaaaccaacatttccaagacgtcgtatgccacacggggctcctcactgtt cccgtagaagcaggcatgtgcccccacagggcctctgtactggctgttcccactgcccga acaccctcatgcaccatctgcactgtccaatacggccgcctctggccacacatggctact gagcagttgaacatggctggtccaaaccaagatttccaagacgtcgtgtgccacacgggg ctcctcactgttcccgtagcagcaggcatgtgcccccactgggcctctgtactggctgtt cccactgcccgaacaccctcatgcaccatctgcactgtccaatacggccgcctctggcca cacatggctactgagcagttgaacatggctggtccaaaccaacatttccaagacgtcgta tgccacacggggctcctcactgttcccgtagcagcaggcatgtgcccccacagggcctct gtactggctgttcccactgcccgaacaccctcatgcaccatctgcactgtccaatacggc cgcctctggccacacatggctactgagcagttgaacatggctggtccaaaccaagatttc caagacgtcgtgtgccacacggggctcctcactgttcccgtagcagcaggcatgtgcccc cactgggcctctgtactggctgttcccactgcccgaacaccctcatgcaccatctgcact gtccaatacggccgcctctggccacacatggctactgagcagttgaacatggctggtcca aaccaacatttccaagacgtcgtatgccacacggggctcctcactgttcccgtagcagca ggcatgtgcccccacagggcctctgtactggctgttcccactgcccgaacaccctcatgc accatctgcactgtccaatacggccgcctctggccacacatggctactgagcagttgaac atggctggtccaaaccaagatttccaagacgtcgtgtgccacacggggctcctcactgtt cccgtagcagcaggcatgtgcccccacagggcctctgtactggctgttcccactgcccga acaccctcatgcaccatctgcactgtccaatacggccgcctctggccacacatggctact gagcagttgaacatggctggtccaaaccaagatttccaagacgtcgtgtgggggcgccat ggaacgcagggccctcactggccctggggactgggtgacgtcaggggtgagcctctggtg attggctccctcaccctgcgtaagatcaaagggcctaaagttagcttttctacaccctac ctggataaggataagaaatag >gi568815579f:47908668_48122324|GENSCAN_predicted_peptide_2|133_aa MARHMGLLLVWVCLIRGVVGGVVGAVFNVLEEETITHFPEVTDGECVFPFHYKNGTYYDC IKSKARHKWCSLNKTYEGYWKFCSAEDFANCVFPFWYRRLIYWECTDDGEAFGKKWCSLT KNFNKDRIWKYCE >gi568815579f:47908668_48122324|GENSCAN_predicted_CDS_2|402_bp atggcccggcacatggggctcctgctggtttgggtctgcctgattcgcggtgtggtcggt ggtgtggtcggcgctgtatttaacgttctggaagaagaaactataactcattttccagaa gttacagatggggagtgtgtctttccattccactataaaaatggaacatattatgactgc atcaagtccaaggcaagacacaagtggtgctcgttaaacaagacctacgaaggatactgg aagttttgcagtgcagaagattttgcaaactgtgtatttcccttctggtacagacgcttg atctactgggagtgtactgatgatggggaagcatttgggaaaaaatggtgttcactgacc aagaattttaacaaggaccgaatttggaaatactgtgaatga >gi568815579f:47908668_48122324|GENSCAN_predicted_peptide_3|287_aa MTDNDDNDDDDNDDHHHMIMTIMRWIMMTMMMTTMMMIIILMTMTIMRLMMVTMTDDDDH DNDDEHYDDANDNSVENDDDDSDGMHEECVFPFTYKGSVYFTCTHIHSLSPWCATRAVYN GQWKYCQSEDYPRCIFPFIYRGKAYNSCISQGSFLGSLWCSVTSVFDEKQQWKFCETNEY GGNSLRKPCIFPSIYRNNVVSDCMEDESNKLWCPTTENMDKDGKWSFCADTSNLGMGVGL LLPSLIVRGIAPVMEIDVGFLDPPSRKDLLPSSKEEPWGTQPETAHL >gi568815579f:47908668_48122324|GENSCAN_predicted_CDS_3|864_bp atgactgataatgatgacaatgatgatgatgacaatgatgatcatcatcacatgataatg acaataatgaggtggattatgatgacaatgatgatgacaacgatgatgatgatcatcatc ctgatgacaatgacaataatgagactgatgatggtgacaatgactgatgatgatgatcat gacaatgatgatgagcattatgatgatgccaatgacaatagcgtggagaatgatgatgat gacagtgatgggatgcatgaggaatgtgtctttcctttcacctacaagggatctgtttac ttcacttgcacccatattcatagcttatccccttggtgtgccaccagagccgtgtacaac ggccagtggaagtactgccagagtgaagattacccacgctgtatcttccctttcatctat cgaggaaaggcttataacagctgcatctcccagggcagcttcttaggcagtctgtggtgc tcagtcacctctgtcttcgatgagaaacagcagtggaaattctgtgaaacgaatgagtat gggggaaattctctcaggaagccctgcatcttcccctccatctacagaaataatgtggtc tctgattgcatggaggatgaaagcaacaagctctggtgcccaaccacagagaacatggat aaggatggaaagtggagtttctgtgccgacaccagtaatctggggatgggggttggtctc ctcctcccttcactcattgtccgtggaattgcccctgtgatggagatagacgtgggcttc ctagatcccccttcaagaaaggacttgctgcccagctccaaggaggagccctggggaacc caacctgagacagctcatctttaa >gi568815579f:47908668_48122324|GENSCAN_predicted_peptide_4|901_aa MVPLGLALPPCTSAGPQEHICHTRPPCPSAEASQGHPHSSVTLMMAPPGGKGKQEDHRSF GTEDPRSLHPRRFRLKELSSDYTAKRREPKPPHRIPEHKLHRACAHAPDPGSGKPSEVSI IPGLQKEEKAAVERRRLHVLKALKKLRIEADEAPVVAVLGSGGGLRAHIACLGVLSEMKE QGLLDAVTYLAGVSGSTWAISSLYTNDGDMEALEADLKHRFTRQEWDLAKSLQKTIQAAR SENYSLTDFWAYMVISKQTRELPESHLSNMKKPVEEGTLPYPIFAAIDNDLQPSWQEARA PETWFEFTPHHAGFSALGAFVSITHFGSKFKKGRLVRTHPERDLTFLRDQLRNLTLKARL LRLQESSQGEHPPPEDEGGEPEHTWLTEMLENWTRTSLEKQEQPHEDPERKGTMCSVISV INSSLSNLMDFVKKTGICASKWEWGTTHNFLYKHGGIRDKIMSSRKHLHLVDAGLAINTP FPLVLPPTREVHLILSFDFSAGDPFESHWQPIGGCPPTLWTIRATTDYCRRHKIPFPQVE EAELDLWSKAPASCYILKGETGPVVMHFPLFNIDACGGDIEAWSDTYDTFKLADTYTLDV VVLLLALAKKNVRENKKKILRELMNVAGHTLHGLEFTLAVLTGPITSDQLDCDFDSVIQK KASKELKVVKFVLQVPREILELEHECLTIRSIMSNVQMARMNVIVQTNAFHCSFMTALLA SSSAQVSSVEFPMGPACIFLRKGIAEKQRERPLGQDEIEELREAFLEFDKDRDGFISCKD LGNLMRTMGYMPTEMELIELGQQIRMNLGGRVDFDDFVELMTPKLLAETAGMIGVQEMRD AFKEFDTNGDGEITLVELQQAMQRLLGERLTPREISEVVREADVNGDGTVDFEEFVKMMS R >gi568815579f:47908668_48122324|GENSCAN_predicted_CDS_4|2706_bp atggttcctttgggcttggctcttcctccctgcaccagtgcagggccccaggaacacatc tgccacacacggccgccatgtccctctgctgaagctagtcagggtcatcctcactcctca gtgaccctcatgatggctccaccagggggaaaaggaaagcaggaggaccacagaagcttt ggcaccgaggatccccgcagtcttcacccgcggagattccggctgaaggagctgtccagc gactacaccgctaagcgcagggagcccaagcctccgcaccggattccggagcacaagctc caccgcgcatgcgcacacgccccagacccaggctcaggcaagccctctgaagtttccata attcctgggctccagaaagaagaaaaggcggccgtggagagacgaagacttcatgtgctg aaagctctgaagaagctaaggattgaggctgatgaggccccagttgttgctgtgctgggc tcaggcggaggactgcgggctcacattgcctgccttggggtcctgagtgagatgaaagaa cagggcctgttggatgccgtcacgtacctcgcaggggtctctggatccacttgggcaata tcttctctctacaccaatgatggtgacatggaagctctcgaggctgacctgaaacatcga tttacccgacaggagtgggacttggctaagagcctacagaaaaccatccaagcagcgagg tctgagaattactctctgaccgacttctgggcctacatggttatctctaagcaaaccaga gaactgccggagtctcatttgtccaatatgaagaagcccgtggaagaagggacactaccc tacccaatatttgcagccattgacaatgacctgcaaccttcctggcaggaggcaagagca ccagagacctggttcgagttcacccctcaccacgctggcttctctgcactgggggccttt gtttccataacccacttcggaagcaaattcaagaagggaagactggtcagaactcaccct gagagagacctgactttcctgagagaccagttaaggaatctgaccctgaaagcccgatta ctgaggctgcaagaaagttcacaaggggaacatcctcccccagaagatgaaggcggtgag cctgaacacacctggctgactgagatgctcgagaattggaccaggacctccctggaaaag caggagcagccccatgaggaccccgaaaggaaaggtaccatgtgtagtgtgataagtgtc atcaacagctcactcagtaacttgatggattttgtgaagaaaacaggcatttgcgcttca aagtgggaatgggggaccactcacaacttcctgtacaaacacggtggcatccgggacaag ataatgagcagccggaagcacctccacctggtggatgctggtttagccatcaacactccc ttcccactcgtgctgcccccgacgcgggaggttcacctcatcctctccttcgacttcagt gccggagatcctttcgagtcacactggcagccaattggagggtgtccgcccacactgtgg accatccgggctaccactgactactgccgccgccacaagatcccctttccccaagtagaa gaggctgagctggatttgtggtccaaggcccccgccagctgctacatcctgaaaggagaa actggaccagtggtgatgcattttcccctgttcaacatagatgcctgtggaggtgatatt gaggcatggagtgacacatacgacacattcaagcttgctgacacctacactctagatgtg gtggtgctactcttggcattagccaagaagaatgtcagggaaaacaagaagaagatcctt agagagttgatgaacgtggccggccacacgcttcatggccttgagttcaccttggctgtc ctaacagggccaatcaccagtgaccagctagactgtgattttgatagcgtcattcagaag aaggcgtccaaggagctgaaggtggtgaaatttgtcctgcaggtccctcgggagatcctg gagctggagcatgagtgtctgacaatcagaagcatcatgtccaatgtccagatggccaga atgaatgtgatagttcagaccaatgccttccactgctcctttatgactgcacttctagcc agtagctctgcacaagttagctctgtagaattccccatgggccccgcctgcatcttcttg aggaaaggcattgctgagaaacagcgggaaagaccactgggacaagatgagattgaagag ctgcgggaagcatttcttgagttcgataaggaccgagatgggttcatctcttgtaaggat ctggggaatctcatgaggacgatgggttacatgcccacggagatggaactgattgagctc ggccagcaaatccgcatgaacctgggtggccgtgtagactttgatgactttgtggagctg atgacccccaaattgcttgcagaaacagctgggatgatcggtgtccaggagatgcgggat gccttcaaggagtttgacacgaatggagatggggagatcaccctggtggagctacagcag gccatgcagagactcctgggggagcggctcaccccccgggagatctctgaggttgtccgg gaggctgatgttaatggagacggcacagttgactttgaagagtttgtgaagatgatgtct cgctga >gi568815579f:47908668_48122324|GENSCAN_predicted_peptide_5|175_aa LKKDYLDGVGDTLDLVVIGAYLGRGKRAGRYGGFLLASYDEDSEELQAICKLGTGFSDEE LEEHHQSLKALVLPSPRPYVRIDGAVIPDHWLDPSAVWEVKCADLSLSPIYPAARGLVDS DKGISLRFPRFIRVREDKQPEQATTSAQVACLYRKQSQIQNQQGEDSGSDPEDTY >gi568815579f:47908668_48122324|GENSCAN_predicted_CDS_5|528_bp ctgaagaaggactaccttgatggcgtgggtgacaccctggacctggtggtgatcggcgcc tacctgggccgggggaagcgggccggccggtacgggggcttcctgctggcctcctacgac gaggacagtgaggagctgcaggccatatgcaagcttggaactggcttcagtgatgaggag ctggaggagcatcaccagagcctcaaggcgctggtgctgcccagcccacgcccttacgtg cggatagatggcgctgtgattcccgaccactggctggaccccagcgctgtgtgggaggtg aagtgcgctgacctctccctctctcccatctaccctgctgcgcggggcctggtggatagt gacaagggcatctcccttcgcttccctcggtttattcgagtccgtgaagacaagcagccg gagcaggccaccaccagtgctcaggtggcctgtttgtaccggaagcaaagtcagattcag aaccaacaaggcgaggactcaggctctgaccctgaagatacctactaa