GENSCAN 1.0 Date run: 4-Nov-116 Time: 09:35:16 Sequence gi568815592f:87377360_87611523 : 234164 bp : 38.68% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 15360 15483 124 2 1 93 94 101 0.896 11.58 1.02 Intr + 19340 19453 114 0 0 81 40 62 0.304 0.20 1.03 Intr + 20884 21018 135 2 0 83 109 65 0.958 7.72 1.04 Intr + 30473 30730 258 0 0 47 76 108 0.158 2.01 1.05 Intr + 30888 30961 74 1 2 67 18 70 0.289 -3.89 1.06 Term + 31008 31268 261 2 0 48 42 359 0.691 21.84 1.07 PlyA + 32178 32183 6 -1.75 2.00 Prom + 32218 32257 40 -12.33 2.01 Init + 32481 32588 108 2 0 80 95 123 0.999 12.47 2.02 Intr + 33226 33309 84 0 0 114 55 89 0.963 7.30 2.03 Intr + 34979 35061 83 1 2 119 44 51 0.209 1.32 2.04 Intr + 38265 38515 251 0 2 -5 98 184 0.142 6.26 2.05 Intr + 39310 39468 159 2 0 78 37 184 0.718 11.34 2.06 Intr + 40849 41057 209 2 2 48 97 231 0.531 17.87 2.07 Intr + 48532 48703 172 0 1 20 94 83 0.867 0.69 2.08 Intr + 49022 49092 71 0 2 58 80 80 0.076 2.18 2.09 Intr + 49167 49286 120 2 0 56 78 98 0.045 5.37 2.10 Term + 86661 86891 231 2 0 114 32 182 0.912 10.69 2.11 PlyA + 87082 87087 6 1.05 3.04 PlyA - 87108 87103 6 1.05 3.03 Term - 95587 95421 167 2 2 85 43 144 0.974 6.70 3.02 Intr - 102658 101989 670 1 1 -5 89 248 0.682 5.94 3.01 Init - 103471 103124 348 1 0 83 37 184 0.478 10.03 3.00 Prom - 110337 110298 40 -6.05 4.00 Prom + 115127 115166 40 -4.55 4.01 Init + 123264 123308 45 2 0 92 111 21 0.772 5.65 4.02 Intr + 123799 123951 153 0 0 113 83 36 0.957 4.95 4.03 Intr + 129023 129089 67 1 1 64 99 45 0.918 0.76 4.04 Intr + 131061 131237 177 1 0 91 84 54 0.702 4.27 4.05 Intr + 131682 131816 135 1 0 16 44 136 0.516 1.62 4.06 Term + 134040 134167 128 1 2 105 34 34 0.495 -2.84 4.07 PlyA + 134255 134260 6 1.05 5.08 PlyA - 134968 134963 6 1.05 5.07 Term - 137140 137054 87 1 0 103 28 68 0.564 -1.02 5.06 Intr - 137661 137598 64 2 1 101 71 41 0.527 1.50 5.05 Intr - 141380 141271 110 2 2 86 94 4 0.272 -1.04 5.04 Intr - 144227 144105 123 2 0 39 103 67 0.681 3.16 5.03 Intr - 152289 152173 117 0 0 59 32 89 0.415 0.14 5.02 Intr - 153583 153425 159 1 0 87 67 234 0.986 20.46 5.01 Init - 164642 164559 84 2 0 48 100 74 0.954 5.47 5.00 Prom - 165083 165044 40 -3.35 6.07 PlyA - 165641 165636 6 1.05 6.06 Term - 175833 175674 160 2 1 124 39 63 0.928 1.43 6.05 Intr - 178146 178049 98 0 2 76 94 88 0.904 6.09 6.04 Intr - 185426 185343 84 2 0 94 116 0 0.826 2.50 6.03 Intr - 186873 186771 103 2 1 62 102 100 0.300 8.06 6.02 Intr - 197988 197920 69 2 0 106 93 20 0.199 1.68 6.01 Intr - 212929 212739 191 1 2 99 88 118 0.138 10.36 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 53674 53814 141 2 0 93 76 58 0.900 4.50 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:87377360_87611523|GENSCAN_predicted_peptide_1|321_aa MKQCMYEEVPTGVCLRDEAKQQRSKQERLQQAGDMQSLHGGGVYLIIAEAGISFVCSQPH LAIPVKPSQLSTPIVMPLVVFFQLPAPHPPTVLRPQLGLHPNPECDREKMSVRDHDPEVL TRNSVLTTEVKMSLPEIPLFLKFFQPIFWGKDPQSGTPPHPHLAPPPLVRSLPSPWLRGG GCERPTAPTVAVRAPGQKQTDTAPAVDSDPELEQRSIPAGGATPGLQGCPLAFARKPDLG RLSNWRVGAGKVLSRVPSDPAAARMRTQIRSAHTDPERAHRSGARTQIRSAHTDPERAHR SGARTQIRSAHTDPERDGLSH >gi568815592f:87377360_87611523|GENSCAN_predicted_CDS_1|966_bp atgaagcagtgtatgtatgaggaagtgcccacaggagtgtgcctccgggatgaagcaaag caacagagatccaagcaggagaggctgcaacaggctggagatatgcagagtcttcatggg ggaggtgtgtatctaataattgctgaagctggaatttccttcgtgtgcagtcagccacac ctcgctatacctgtcaaaccatctcaattatccactccaattgtcatgcctctcgtggtg ttcttccagctccctgccccacatccacccactgtcctcagacctcagctggggctgcac cccaaccctgagtgtgacagagagaagatgagtgttagagatcatgatcctgaagtgctg acgagaaattcagtcttaaccacggaggttaagatgtctctgcctgaaattcctttgttt ttaaagtttttccagcccatcttttgggggaaagaccctcaaagtgggaccccgccccac ccccacctggccccgccccctctggtccggagccttccatctccatggttacgcggcggt ggctgcgagcgcccaactgctccgaccgtcgcggtgagggccccaggacagaagcagaca gacacggctcctgctgtcgattccgatccagagctggaacagcggtccattcctgccgga ggcgccaccccagggctgcagggctgccctctagcctttgcgcggaagccggacctcggg agacttagtaattggagggtgggtgcagggaaggtgctgagccgcgtccctagcgacccg gcagcggcgcgcatgcgcacacagatccggagcgcgcacacagatccggagcgcgcacac agatccggagcgcgcacacagatccggagcgcgcacacagatccggagcgcgcacacaga tccggagcgcgcacacagatccggagcgcgcacacagatccggagcgcgacggactctcc cactga >gi568815592f:87377360_87611523|GENSCAN_predicted_peptide_2|495_aa MPPTQAESVIRSIIREIGQECAAHGEIVSETLIAFMVKAVVLDPSNGFNMDRTLMKSDVQ NLVKESRAGAQGDGLRVMLPSACQLQYIQPTDSYVRRILKIKYIEHCYLAILVEFLEEHH RVLESRLGSVTREITDNRACAKEELESLYRKIISYVLLRSGLGSPTDIKTVREVTAALQS VFPQAELGTFLTLSKKDKERQLKELTMIVTGIRLFNRDCGKGGEGIDDLPAVLHVAIPAT MQHIDYQLETARSQVYRYTAILEKAANDPLMRAELQPYMLKEALYNIRQYEVFLQIILVP SNQENNIVNLYSTSSSNSHCLVSHFNDELYDRVFKNIPCSVQGFWHPLNHMAYITGVEDL AENRLPAVIEPTFRGCGEQSDIITGAQEVEMMTKQLGAHLEQLKMTIKSKIAVPTSQVFA NLRQKVTHSVQTDLSHLRRENCSQVYPPKDTSTQSMREDSTGVPRPQIYLAGLRGGKSEI TDEVKVNLTRDVDET >gi568815592f:87377360_87611523|GENSCAN_predicted_CDS_2|1488_bp atgcctccaactcaggccgaaagtgttataaggagtattatacgagaaataggacaagaa tgtgcagcccatggagagattgtttctgaaactctgattgcttttatggtgaaagctgtt gtcctggatccaagtaatggctttaacatggatagaaccctcatgaaaagtgatgtgcag aatcttgttaaggaaagcagggctggtgctcagggagatgggttgagagtgatgctgcca agtgcctgccagctgcagtatattcagcctacagattcttacgtaagaagaattctaaaa attaaatatatagaacattgttatttggcaattttagtggaatttctcgaagaacatcac cgggtcctagagtctagattaggctctgttacccgagaaattacagataacagagcatgt gctaaagaagaattggaaagcctctaccggaagattatcagctatgtgttactccgctct ggccttggatcccctacagacatcaagactgtcagagaggtaacagctgctctacagagt gtttttcctcaggcagagcttgggacatttctaactctttctaagaaggacaaagaacgc cagctgaaagaactcaccatgattgttactggaattcgtttatttaacagagactgtgga aaaggaggagaaggcattgatgatttgccagctgttctccatgtagcaatcccagccacc atgcagcatattgattaccagcttgagactgcccggagccaggtataccgctacacagcc atccttgagaaggcagccaacgacccactcatgagggctgaacttcagccatatatgtta aaagaagcgctatataatatacgacaatatgaggtcttccttcagatcattttggtccct tcaaatcaagaaaacaatattgtaaacttgtactcaacctcatcttccaattcacattgc cttgtgagtcatttcaatgatgaactctatgacagggtcttcaagaatatcccctgctca gtacaaggattctggcatcccctaaaccacatggcatacatcacaggtgttgaggatctt gcggagaacagacttcctgctgtcattgagcctactttcagaggatgtggggagcagtca gatataattactggtgctcaagaagtggaaatgatgacaaaacagttaggagcccatctg gaacaactaaaaatgaccataaaatcaaagatagcggtcccaacatcacaagtctttgct aatttgcgccagaaagttactcactcagtacaaactgatcttagtcacttgagaagagaa aattgttcccaagtgtaccctccaaaggacactagcacccagtccatgagggaagacagc actggggtgcccaggcctcagatttacttggctggtcttcgtggaggaaagagcgaaatc accgatgaggtcaaggtgaacttaactagagatgtggatgaaacctaa >gi568815592f:87377360_87611523|GENSCAN_predicted_peptide_3|394_aa MDHCSWIKPYQIKAKKSLVYLSFPSFPNTVPISITVPTTSNSNPTLERFYGLGAEVTGKD PIGFFKMRFVLPALPPTTAPFTNLRNKTMPRLMPNDKSKVSVVEIGDLRQTIAIDRSSSY ALVQLAIPFTLAFHQPEEGKIRHHKVREAPYRSFNSHVYLDAIGVPRGIPDQFKARNQIA AGFESIFWWVTINKNIDWINYIYYNQQQFMNYTRDAVKGIAEQLGTNCQMAWENRIALDM ILAERGGVCIMIKTECCAFIPNNTAPNGSITKALQGLTALSNELASSSGVNDPFTGWLEK WFGKWKGITASILTSLTAVMGVLILVGCCVIPCICGLVQRHRGPPLVVIETKPLGLERLA GLPVGHALKLGSGIQATPQNGENADRREAFPAAS >gi568815592f:87377360_87611523|GENSCAN_predicted_CDS_3|1185_bp atggaccattgttcctggatcaagccttaccaaattaaagctaagaaaagcttagtctat ctatcttttccttcctttcctaacacagtgcctatatccattactgttcctaccactagc aactctaaccccactttagagcgtttctatggtttaggagcagaagtcactggaaaggac cctataggcttctttaagatgcgctttgttctccctgctctacctcctacaactgcccct ttcacaaacctacgaaataaaactatgcctcgcctcatgccaaatgacaaaagcaaggtc tcagtagtagaaataggagacctaaggcaaaccatagctattgacaggagcagtagttat gctctagttcaattggctatcccttttactctggcatttcatcaaccagaggaaggaaaa ataagacatcataaagtgagagaagctccttataggtcattcaactctcatgtctattta gatgcaattggagtcccacggggaataccagatcaatttaaagcccgaaatcaaatagct gcaggatttgagtcaatattttggtgggtcacaattaataaaaacatagattggataaac tacatctattataaccaacagcaatttatgaactacactagagatgctgttaaaggaata gctgagcaattagggactaactgccagatggcttgggaaaataggatagccttagacatg atattagcagaaagaggaggagtttgcatcatgattaaaactgaatgttgtgccttcatc ccaaacaacactgcccctaatggaagtataacaaaggcattgcaaggtctgactgctctg tccaatgagttagccagcagctcaggggtaaatgacccctttacaggatggctagaaaag tggttcggtaaatggaaaggaataacagcctcaattcttacttccctcacagctgtaatg ggtgtacttattcttgtcgggtgctgtgtcataccatgcatctgtgggttggtgcagaga caccggggaccgccccttgtggtcatagagacgaagccgctcggcctagagcgcctcgcc ggcctcccggtcggccacgcactaaaactgggctcaggaatccaagctacaccccaaaat ggggagaacgcggatcgaagggaggccttcccggccgcgtcctag >gi568815592f:87377360_87611523|GENSCAN_predicted_peptide_4|234_aa MAFLALSNLDAAVYQVTYQLKIPCTALCTVLMLNRTLSKLQWVSVFMLCAGVTLVQWKPA QATKVVVEQNPLLGFGAIAIAVLCSGFAGVYFEKVLKSSDTSLWVRNIQMYLSGIIVTLA GVYLSDGAEIKEKGFFYGYTYYVWFVIFLASVGGLYTSVVVKYTDNIMKGFSAAAAIVLS TIASVMLFGLQITLTFALGTLLVCVSIYLYGLPRQDTTSIQQGETASKERVIGV >gi568815592f:87377360_87611523|GENSCAN_predicted_CDS_4|705_bp atggctttcctagctcttagcaatctggatgcagcagtgtaccaggtgacctaccagttg aagattccgtgtactgctttatgcactgttttaatgttaaaccggacactcagcaaatta cagtgggtttcagtttttatgctgtgtgctggagttacgcttgtacagtggaaaccagcc caagctacaaaagtggtggtggaacaaaatccattattagggtttggcgctatagctatt gctgtattgtgctcaggatttgcaggagtatattttgaaaaagttttaaagagttcagat acttctctttgggtgagaaacattcaaatgtatctatcagggattattgtgacattagct ggcgtctacttgtcagatggagctgaaattaaagaaaaaggatttttctatggttacaca tattatgtctggtttgtcatctttcttgcaagtgttggtggcctctacacttctgttgtg gttaagtacacagacaacatcatgaaaggcttttctgcagcagcggccattgtcctttcc accattgcttcagtaatgctgtttggattacagataacactcacctttgccctgggtact cttcttgtatgtgtttccatatatctctatggattacccagacaagacactacatccatc caacaaggagaaacagcttcaaaggagagagttattggtgtgtga >gi568815592f:87377360_87611523|GENSCAN_predicted_peptide_5|247_aa MVGLLGTGFQLFGYEEKLQSNPLQHLFEVYVQVNKEAADDKSVAKAAQEFFQRLELGDVQ ALSLWQKFRDLSIEEYIRVYKRLGVYFDEYSGESFYREKSQEVLKLLESKGLLLKTMYDQ VRPIITRICLVTLGLWPLCFRDLAAAIDRMDKYNFDTMIYVDFKGLLLSDYKFSWDRVFQ SRGDTGVFLQYTHARLHSHLAAVAHKTLQIKDSPPEVAGARLHLFKAVRSVLANGMKLLG ITPVCRM >gi568815592f:87377360_87611523|GENSCAN_predicted_CDS_5|744_bp atggtaggtcttctgggaactggcttccagctgtttggctatgaggaaaaactgcagtcc aatcctctacagcatctctttgaagtttatgtacaagttaataaagaagcagcagatgat aaaagtgtagcaaaagcagcacaggagttcttccaacgattggaactgggcgatgtgcaa gcactttcactgtggcaaaaatttcgggacttgagcattgaagagtacattcgggtttac aagcgtctgggagtatattttgatgaatattcaggagaatcattttatcgtgaaaaatct caagaggtcttaaagttgctggagagtaaaggactcctactgaaaacaatgtatgatcag gtaagaccaattattacccggatctgcttagtaactcttggtttatggccactttgtttc agagatcttgcagctgctatagatcgaatggacaagtataattttgatacaatgatatat gtggacttcaaaggtttactcttatctgactacaagttcagctgggatcgtgttttccag agtcgcggggacacaggagtcttcctacagtacacacacgcccgcctccacagtcatctt gcagctgtggcacacaaaacactacaaataaaagatagtcctcctgaagtggctggggcc agacttcatcttttcaaagctgtccgttctgtcctagccaatggaatgaaacttcttgga ataacacctgtatgtaggatgtaa >gi568815592f:87377360_87611523|GENSCAN_predicted_peptide_6|234_aa PGKGCLRNLGFIPSSKGNEAAVEPQCASATTYLRHRGRSHGLTDSAYSRCTRDFPRALAA AGQRNHLPNKLLACKSFSQALLLKKPREVADFQLSVDSLLEKDNDHSRPDIQVQAKRLAE KLRCDTVVSEISTGQRTVNFKINRELLTKTVLQQVIEDGSKYGLKSELFSGLPQKKIVVE FRVARPLYKGTQGSKIVKSSSLETGTASFLSLSADESKSQDHPRFKGRKIEYIS >gi568815592f:87377360_87611523|GENSCAN_predicted_CDS_6|705_bp ccagggaagggatgcctacgtaacctcggcttcatcccttcttcaaaggggaatgaggca gcggtagagccacagtgcgcatcggccaccacataccttagacatcgaggacgtagccat ggtcttactgactctgcgtattccagatgcactcgggatttcccgcgcgccctcgcggct gcagggcagaggaaccatctcccaaataaactacttgcctgcaagtctttttctcaggcc ctgcttttgaagaagccaagagaagtagctgattttcagctttctgtggattctttattg gaaaaagacaatgaccattcaagaccagatattcaagttcaagccaagagactagcagag aagctaagatgtgatacagtggtgagtgaaatcagtactggtcaaaggactgtaaatttc aaaataaacagagagctcttaacaaagacagtgctacaacaagtaattgaagatggctca aaatatggattaaaaagtgaacttttctctggacttccccagaagaagattgtggttgaa ttcagggtagcaagacctctttacaaaggtactcaaggctctaagatagtgaaatcatct agtcttgaaacaggcacagcatcatttttgtcactttctgctgatgaaagcaagtcacaa gatcaccccagattcaagggaagaaaaatagagtacatctcttaa