GENSCAN 1.0 Date run: 24-Oct-119 Time: 21:52:15 Sequence gi568815595f:191229675_191489604 : 259930 bp : 37.62% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 16903 17147 245 2 2 6 37 361 0.478 17.88 1.02 PlyA + 18128 18133 6 1.05 2.04 PlyA - 19351 19346 6 1.05 2.03 Term - 69759 69655 105 0 0 62 41 119 0.120 2.03 2.02 Intr - 82094 81886 209 0 2 55 42 156 0.216 5.57 2.01 Init - 86939 86873 67 2 1 89 45 72 0.396 4.29 2.00 Prom - 94877 94838 40 -6.05 3.03 PlyA - 95033 95028 6 1.05 3.02 Term - 100034 99559 476 0 2 76 48 336 0.712 22.46 3.01 Init - 106688 106601 88 2 1 39 48 68 0.213 -1.25 3.00 Prom - 110614 110575 40 -1.95 4.00 Prom + 122675 122714 40 -4.85 4.01 Init + 123709 123778 70 0 1 83 81 4 0.571 0.46 4.02 Intr + 127414 127476 63 2 0 73 70 101 0.959 4.57 4.03 Intr + 128324 128450 127 0 1 79 48 183 0.989 12.22 4.04 Intr + 131395 131485 91 1 1 81 60 116 0.987 7.18 4.05 Intr + 140245 140362 118 0 1 126 77 182 0.998 20.02 4.06 Intr + 145388 145915 528 0 0 84 90 351 0.954 26.88 4.07 Intr + 150485 150600 116 0 2 67 85 171 0.989 13.95 4.08 Intr + 151013 151057 45 1 0 62 100 48 0.679 1.09 4.09 Intr + 151154 151258 105 1 0 81 75 121 0.993 9.59 4.10 Intr + 153072 153151 80 2 2 78 115 56 0.970 4.73 4.11 Intr + 159822 159928 107 0 2 59 116 66 0.897 5.44 4.12 Term + 162067 162086 20 2 2 137 42 -6 0.667 -2.80 4.13 PlyA + 162470 162475 6 1.05 5.00 Prom + 163619 163658 40 -4.95 5.01 Init + 171849 172004 156 2 0 76 103 125 0.788 12.76 5.02 Intr + 185259 185425 167 2 2 91 58 66 0.009 1.74 5.03 Intr + 192268 192416 149 1 2 58 109 -3 0.002 -2.24 5.04 Term + 196820 197004 185 0 2 85 42 96 0.272 1.42 5.05 PlyA + 198244 198249 6 1.05 6.00 Prom + 210279 210318 40 -1.65 6.01 Sngl + 231489 231782 294 2 0 86 40 302 0.997 20.65 6.02 PlyA + 231913 231918 6 1.05 7.05 PlyA - 232812 232807 6 1.05 7.04 Term - 237533 236765 769 1 1 29 54 251 0.046 7.68 7.03 Intr - 241672 241400 273 0 0 43 90 152 0.100 6.63 7.02 Intr - 247536 247397 140 0 2 105 64 -19 0.018 -4.26 7.01 Intr - 252607 252429 179 2 2 16 73 158 0.203 5.82 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:191229675_191489604|GENSCAN_predicted_peptide_1|81_aa XWVTELDPVSKKKKERKKERKEEEEGKGKGEGEEEEEEEEDEKEEEEEEEEEGGGRGEGG EGGEGGEELEATAFLDSLSKP >gi568815595f:191229675_191489604|GENSCAN_predicted_CDS_1|246_bp ncctgggtgacagagctagaccctgtatcaaaaaaaaaaaaagaaagaaagaaagaaaga aaagaagaagaggaagggaaaggtaaaggggaaggggaagaggaagaagaggaagaggag gatgaaaaggaagaagaagaggaagaagaagaagaaggaggaggaagaggagaaggagga gaaggaggagaaggaggagaagaactagaggccacagcattccttgattcattatctaag ccttga >gi568815595f:191229675_191489604|GENSCAN_predicted_peptide_2|126_aa MDQSALCKTDQSALCKMYRSAGSEKYHLTPKGYCLIKALMNAQVTKALKSLRSSVVAASS VEDAATELGYLKSMRKVGLWNGTGRWNSTMGMTWLELKQLSCEEQCPEAAKGIGALALAH KTILPS >gi568815595f:191229675_191489604|GENSCAN_predicted_CDS_2|381_bp atggaccaatccgctctttgtaaaacggaccaatcagctctctgtaaaatgtaccgatca gcaggatctgagaaatatcacctaacgcctaaaggatactgtctcattaaggcactaatg aatgcacaggtgacgaaggcactgaaatcattgagaagctcagtggtagctgcttcatca gtggaagatgctgctacagaactaggctatctaaaaagtatgaggaaagtgggcctctgg aatggaacaggccgttggaatagtaccatgggaatgacgtggctggagctgaagcaactg agctgtgaagagcagtgtccagaggctgcaaagggtattggggcccttgccctggcccac aaaaccatccttccgtcctag >gi568815595f:191229675_191489604|GENSCAN_predicted_peptide_3|187_aa MVKIKKSKWVLKISNASKDAEQLELTYIAGSLDWSMLTSAIPGQGPGCPFNAGFPVTAGA ERGPERSSSAGRGAKSGRQGSQAGERSGSGGQGLVTKAAGDGPRGTKLAAQLPRRGHLLR VPGPNRYQISGARKWTGAVQYGGREETERRAGRLGEGELQVCNRNCEWLASEDQRPLPPR CCLLCAQ >gi568815595f:191229675_191489604|GENSCAN_predicted_CDS_3|564_bp atggttaaaattaaaaaaagtaaatgggtcctgaaaatatcaaatgccagcaaggatgca gaacaactggagctcacatacattgctggcagcttggactggtcgatgctgacttcagcc ataccgggccagggtcccggttgcccctttaacgcgggcttcccggtcaccgccggcgcc gagcggggccccgagcgcagcagcagcgccggacgcggcgcaaagtccggccggcagggc tcgcaggccggagaacggagcgggtcgggcgggcaggggctggtgaccaaagcagcggga gacggtccccgaggcaccaagctggcggcccagctgccccgccgcggccacctgctccgc gtccccggccccaatcgataccaaatatccggagcccggaaatggaccggggccgtccag tacggagggagagaggagacagaaaggcgggcgggccggctgggggagggggagctgcag gtctgtaaccgcaattgcgagtggctggcgagcgaagatcagcgcccgctgccgcctcgc tgttgtctcctttgtgctcagtga >gi568815595f:191229675_191489604|GENSCAN_predicted_peptide_4|489_aa MGGKKFEKWEGERWFTLNVPTEKVCRDFAVLEDHTLAHSLQEQEIEHHLASNVQRNRLVQ HDLQVAKQLQEEDLKAQAQLQKRYKDLEQQDCEIAQEIQEKLAIEAERRRIQEKKDEDIA RLLQEKELQEEKKRKKHFPEFPATRAYADSYYYEDGDQPGSRRARELGSGFSRPCRLQRD GKTVKHKKEKPEHPLENLEEPEQHCSSKRSLSSSSSGKGRDNPHINNEQHERKRSTQERP RRPLLPTISGEVFLSTECDDWETKINHQTRNWEKQSRHQDRLSPKSSQKAGLHCKEVVYG RDHGQGEHRKRRHRPRTPPFSESEEQLHLHDAGMKPRVMKEAVSTPSRMAHRDQEWYDAE IARKLQEEELLATQVDMRAAQVAQDEEIARLLMAEEKKAYKKAKEREKSSLDKRKQDPEW KPKTAKAANSKSKESDEPHHSKNERPARPPPPIMTDGEDADYTHFTNQQSSTRHFSKSES SHKGFHYKH >gi568815595f:191229675_191489604|GENSCAN_predicted_CDS_4|1470_bp atgggaggcaaaaagtttgagaaatgggagggtgaaaggtggttcaccttaaatgtgccc acagagaaagtatgccgagattttgctgtcctggaggaccacaccctggctcacagcctg caggaacaagagattgagcatcatttggcatcgaacgttcagcggaaccgtttggtccag catgatctccaggtggctaagcagctccaagaggaagatctgaaagcgcaggcccagctc cagaagcgctacaaagaccttgaacaacaagactgtgaaattgctcaggaaattcaggag aagctggctattgaggcagagagacgacgcattcaggagaagaaggatgaggacatagct cgccttttgcaagaaaaggagttacaggaagagaaaaagagaaagaaacactttccagag ttccctgcaacccgtgcttatgcagatagttactattatgaagatggagaccaaccaggg tcaaggagggccagggaattgggttctggattctcaagaccttgtagactccaaagagat ggaaagactgtgaagcacaagaaagagaaaccagaacatccactggagaacttggaagag ccagaacaacattgttcatcgaagagatccctgtcatcctctagctcgggcaaagggagg gacaatccccatattaacaatgagcagcatgaaaggaaacggtccactcaggagaggcct cggagacctctgcttcccacgatcagtggtgaagtgtttctgagcactgaatgtgatgac tgggagactaagattaaccatcagactcgaaattgggaaaaacagtctcgacaccaagat cgactttcacccaagtcctcacaaaaagcagggcttcactgcaaggaagttgtatatggg agggaccatgggcaaggtgagcacagaaaaaggagacacaggcccaggactcctccattc tcagagagtgaggagcagctccacctccatgacgcaggaatgaagccaagagtgatgaaa gaagctgtatctactccatcacgaatggcccacagggatcaggaatggtatgatgctgaa attgccagaaaactgcaagaagaagaacttttggctacccaggtggacatgagagccgct caagtagctcaagatgaagaaatcgctcgacttctaatggctgaagaaaagaaagcttac aaaaaagccaaggagcgggagaaatcatctttggacaaaagaaagcaagaccccgagtgg aagccaaaaacagctaaagcagcaaattccaagtcaaaagagagtgatgaacctcaccat tctaagaatgaaaggccagcacggccaccaccacctatcatgacagatggtgaagatgcg gattacactcattttacaaaccagcagagttccacacggcatttctcaaaatcagagtcc tctcataaaggttttcattacaaacattaa >gi568815595f:191229675_191489604|GENSCAN_predicted_peptide_5|218_aa MDSHDYKMEFHYRPSASWGREKLISALSKSESLKTRETDCSFQSVAKAQEPMVALFEKLN HTHSCKEHKKPTPVLEEEAFCMLTKFENAVTPCIRPFSHCYKEIPQTGHGLNATKLSSSY CFLNLNTVHAVTAHIDLLLSFQSIFSYTVFMNITRSTECGLGTSIMDKPWQHVKMPNLRL HPRPMESKSAFLTQFPEDVQIELYDKPKNLSSQVKVFN >gi568815595f:191229675_191489604|GENSCAN_predicted_CDS_5|657_bp atggactcccatgattataagatggagttccactataggccgtctgcaagctggggaaga gagaagctgatatcggctctgtccaagtccgaaagcctcaaaaccagggaaactgactgc agctttcagtctgtggccaaagcccaagagcccatggtggccctctttgagaagctaaac cacacacatagctgtaaagaacacaaaaagcccacacctgtcctggaggaagaagcattc tgcatgctgacaaagtttgaaaatgcagttactccctgtattaggccattctcacattgt tataaagagatacctcagaccgggcatggtctgaatgctacaaaactgagttcaagctat tgcttcctaaatttgaacaccgttcatgcagtcacagcccacatcgatcttttattaagc tttcagtctattttctcatacaccgtttttatgaacatcaccagaagtacagagtgtggt ctgggaactagcatcatggacaaaccctggcagcatgttaaaatgccgaatcttagactc caccccagacccatggaatcaaaatctgcatttttaactcaattcccagaagatgtgcag attgaattgtatgataaacctaagaacctaagttctcaggtgaaagtttttaattga >gi568815595f:191229675_191489604|GENSCAN_predicted_peptide_6|97_aa MASSAELDFNLQALLEQLSQDELSKFKSLIRTISLGKELQTVPQTEVDKANGKQLVEIFT SHSCSYWAGMAAIQVFEKMNQTHLSGRADEHCVMPPP >gi568815595f:191229675_191489604|GENSCAN_predicted_CDS_6|294_bp atggcatcttctgcagagctggacttcaacctgcaggctcttctggagcagctcagccag gatgagttgagcaagttcaagtctctgatcagaacaatctccctgggaaaggagctacag accgtcccccagacagaggtagacaaggctaatgggaagcaactggtagaaatcttcacc agccactcctgcagctactgggcagggatggcagccatccaggtctttgaaaagatgaat caaacgcatctgtctgggagagctgatgaacactgtgtgatgcccccaccttaa >gi568815595f:191229675_191489604|GENSCAN_predicted_peptide_7|453_aa XTRENEDAKVETPDKPVRLIHSHENSMKETKPTPKIQVLSHWVPPITRVNYGSTIQNEIL GNLMKKLLIFHITPTPSSHILNQFSFSGDSKICWAQQLPVSAILSSRNDLSIVDFGICGG SWNQSPYGYTERTVIVKAGYIICRAGGKCKYRDPGIKEELVSLLFPKAFHSTHYEWVKYC ILKELRHWNEFCIWIRGGDEAHLIMVDKLFDVLLDLVCQYFTEEFCINVHQGYWSKILFF GCVSAWLWYQDDMIVYLENPLVSAQNLLKLISNFSKVSGHKINVQKSKAFLYTNNRQTES QIMSELPFTIASKRIKYLGIQLTRDVKDLLKENYKPLLNEIKDDTNKWKNIPRSWVGKIN IVKMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRARIVKLILSQKNKAGGIT LPDFKLYYKTTGTKTAWYWYQNRDVDQWNRIEP >gi568815595f:191229675_191489604|GENSCAN_predicted_CDS_7|1362_bp nccacaagagaaaatgaagatgcaaaagtggaaacccctgacaaacccgtgagacttatt cactcccatgagaacagtatgaaggaaacaaagcccacccccaagattcaagttctctcc cactgggtccctccaataacacgtgtgaattatgggagtacaattcaaaatgagattttg ggaaatctcatgaaaaagctgctcatctttcatataacacccaccccttcttcccatatt ctaaaccaattttctttttctggggacagtaaaatctgttgggcacaacaacttccagtg agtgctatactttcttccagaaacgacttgagtatcgtggattttggtatttgtgggggc tcctggaaccaatccccttatgggtacacagaaaggactgtcatagtcaaagctggctac ataatctgcagagctggtggaaaatgcaaatacagggacccgggaataaaggaagaactg gtcagtctcctcttcccaaaggctttccactcaacccattatgagtgggtgaagtattgc atcctaaaagaattaagacactggaatgaattttgtatctggattagaggaggggatgaa gcccacttgatcatggtggataagctttttgatgtgctgctggatttggtttgccagtat tttactgaggaattttgcatcaatgttcatcaaggatattggtctaaaattctctttttt ggttgtgtctctgcctggctttggtatcaggatgacatgattgtatatctagaaaacccc cttgtctcagcccaaaatctccttaagctgataagcaactttagcaaagtctcaggacac aaaatcaatgtacaaaaatcaaaagcattcttatacaccaataacagacaaacggagagc caaatcatgagtgaactcccattcacaattgcttcaaagagaataaaatacttaggaatc caacttacaagggacgtgaaggacctcttgaaggagaactacaaaccactgctcaatgaa ataaaagacgatacaaacaaatggaagaacattccacgctcatgggtaggaaaaatcaat atcgtgaaaatggccatactgcccaaggtaatttatagattcaatgccatccccatcaag ctaccaatgactttcttcacagaattggaaaaaactactttaaagttcatatggaaccaa aaaagagcccgcattgtcaagttaatcctaagccaaaagaacaaagctggaggcatcaca ctacctgacttcaaactatactacaagactacaggaaccaaaacagcatggtactggtac caaaacagagatgtagatcaatggaacagaatagagccctga