GENSCAN 1.0 Date run: 6-Nov-116 Time: 05:25:08 Sequence gi568815591f:26052499_26285780 : 233282 bp : 43.46% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 6144 6338 195 0 0 58 58 144 0.235 6.93 1.02 Intr + 16422 16491 70 0 1 114 37 66 0.222 3.28 1.03 Intr + 27142 27170 29 0 2 75 86 33 0.096 -1.29 1.04 Intr + 27881 27955 75 2 0 94 80 41 0.199 2.53 1.05 Intr + 40414 40536 123 1 0 69 44 97 0.652 3.10 1.06 Intr + 45687 45904 218 0 2 44 95 102 0.813 4.65 1.07 Intr + 49117 49252 136 2 1 56 81 88 0.570 4.53 1.08 Intr + 49981 50053 73 1 1 66 101 36 0.562 2.11 1.09 Intr + 52546 52581 36 0 0 102 86 9 0.304 0.56 1.10 Intr + 70439 70640 202 1 1 25 108 88 0.043 3.36 1.11 Term + 71604 71719 116 1 2 58 44 187 0.593 10.03 1.12 PlyA + 74287 74292 6 1.05 2.00 Prom + 97209 97248 40 -4.16 2.01 Init + 100001 100570 570 1 0 92 92 816 0.138 75.49 2.02 Intr + 125445 125624 180 2 0 52 99 164 0.993 13.96 2.03 Term + 132035 133285 1251 1 0 125 54 321 0.893 23.99 2.04 PlyA + 133770 133775 6 1.05 3.10 PlyA - 135111 135106 6 1.05 3.09 Term - 137962 137900 63 1 0 98 48 79 0.426 2.79 3.08 Intr - 141196 141077 120 1 0 37 75 160 0.248 10.29 3.07 Intr - 143411 143349 63 2 0 85 78 88 0.251 6.41 3.06 Intr - 143983 143903 81 1 0 23 105 88 0.239 3.83 3.05 Intr - 144160 144123 38 2 2 52 63 27 0.144 -5.42 3.04 Intr - 144519 144309 211 0 1 70 92 227 0.878 19.79 3.03 Intr - 144963 144817 147 0 0 42 99 45 0.493 1.43 3.02 Intr - 145234 145124 111 1 0 86 68 68 0.658 5.18 3.01 Init - 148079 148074 6 2 0 94 82 0 0.289 0.87 3.00 Prom - 150152 150113 40 -5.86 4.00 Prom + 151167 151206 40 -6.26 4.01 Init + 153876 154012 137 2 2 54 52 140 0.678 6.61 4.02 Intr + 155895 156057 163 0 1 10 87 132 0.959 5.28 4.03 Intr + 159164 159258 95 1 2 48 101 76 0.909 3.66 4.04 Intr + 192321 192476 156 0 0 88 18 99 0.165 2.03 4.05 Intr + 192872 193105 234 2 0 74 53 131 0.073 5.00 4.06 Term + 195572 195779 208 2 1 36 52 136 0.206 1.61 4.07 PlyA + 196093 196098 6 1.05 5.04 PlyA - 197134 197129 6 1.05 5.03 Term - 197572 197416 157 0 1 29 36 149 0.007 1.21 5.02 Intr - 210712 210649 64 2 1 96 54 34 0.037 -1.42 5.01 Init - 214921 214885 37 1 1 76 106 8 0.444 1.57 5.00 Prom - 220199 220160 40 -1.66 6.02 PlyA - 223033 223028 6 1.05 6.01 Term - 223853 223663 191 0 2 44 38 161 0.404 4.31 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 100001 100771 771 1 0 92 49 871 0.823 77.66 S.002 Term + 159584 159710 127 2 1 104 32 51 0.851 -1.14 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:26052499_26285780|GENSCAN_predicted_peptide_1|424_aa XCVNQLAASAAERGDSFPPPVFHGAIHGPSCELAVQKHKAPWPSFRSQSFRSQQGHAASG ERPVIILCQRLLLAEPNEKPGSLGNVMAVARIEIGICEYYHEKTTEKALDSHGVLAGSTI KGVRSFQRNLELKLPATERATANAIELLTVLDQAYENFAPQILPSTGSPTSQETAQFKAN QNKPLVRGKGSPHEAIRYISAAHREWKPAILTSAIRSFCSTWLVFTSKNFPKLVTQHGST IAGNGQSSDETQVQGAAWKSDSRGTKRQIPTWILAEGNNAGAQLDIPGPTIPAPNCSLKV PQSWSTTPSMPSSLGKAYWLLACYWALVETEHWLWVIKSPCKPELPVMNWVLSDPSSHKV GGAQQHSINKWKWYIRNRARAGPEGTTLPLTKALTLWLKKYSNVLMLVEFTGLTMFPDIL KQLE >gi568815591f:26052499_26285780|GENSCAN_predicted_CDS_1|1275_bp nnatgtgtgaatcagcttgctgcttcggcagcagaacgtggagacagctttccacccccg gttttccacggggcaatacacggaccatcgtgtgagctggcggtgcagaagcataaagct ccttggccatcattccgttcacaatcattccgttcacaacagggccatgctgcatctggg gaaaggccggtgatcatcctctgccagcgcctcttattggctgaacctaatgagaagcca gggagcctgggaaatgtgatggctgtggcaagaatagaaattggaatctgtgagtattac catgagaaaaccacagagaaagctctggacagccatggagtcttagctggatctacaata aaaggagtcagaagctttcagaggaatttggagctgaagttgccagcaacagagagggcc acagctaacgcgatagagctgctcacagtgttagaccaggcctatgaaaactttgcaccg cagatcctccccagcactggcagccccacgtcccaggaaactgctcagttcaaggcaaac cagaacaaaccattggtcagaggaaaaggaagcccacatgaggccataagatatatttca gctgcccatcgagaatggaagccagcaatcctgacatcagccattaggtcattttgttct acctggctggtgttcacctccaagaactttcccaagcttgtgacccaacatggaagtacg atagcaggaaacgggcagagctcagatgagacccaagtccagggtgcagcctggaagtct gattccagaggaacaaagagacaaataccaacgtggattctggcagaggggaacaatgca ggtgcccaacttgacatcccaggcccaaccatccctgctcctaattgttccctgaaggtg ccccagagctggagtaccacgcccagcatgccctcctcactagggaaggcctactggctc ttggcctgttactgggccttggtggaaactgaacattggctatgggtcatcaagtcacca tgtaaacctgaactgcctgtcatgaactgggtgctttctgacccatctagccataaagtg ggtggtgcacagcagcattccatcaacaaatggaagtggtatatacgtaatcgggctcga gcaggtcctgaaggcacaacattgcctctgaccaaggcactcactttatggctaaagaag tacagcaacgtgctcatgctcgtggaatttaccggtcttaccatgttccccgacatcctg aagcagctggaatga >gi568815591f:26052499_26285780|GENSCAN_predicted_peptide_2|666_aa MKHLKRWWSAGGGLLHLTLLLSLAGLRVDLDLYLLLPPPTLLQDELLFLGGPASSAYALS PFSASGGWGRAGHLHPKGRELDPAAPPEGQLLREVRALGVPFVPRTSVDAWLVHSVAAGS ADEAHGLLGAAAASSTGGAGASVDGGSQAVQGGGGDPRAARSGPLDAGEEEKAPAEPTAQ VPDAGGCASEENGVLREKHEAVDHSSQHEENEERVSAQKENSLQQNDDDENKIAEKPDWE AEKTTESRNEGISLGDIPLPGSISDGMNSSAHYHVNFSQAISQDVNLHEAILLCPNNTFR RDPTARTSQSQEPFLQLNSHTTNPEQTLPGTNLTGFLSPVDNHMRNLTSQDLLYDLDINI FDEINLMSLATEDNFDPIDVSQLFDEPDSDSGLSLDSSHNNTSVIKSNSSHSVCDEGAIG YCTDHESSSHHDLEGAVGGYYPEPSKLCHLDQSDSDFHGDLTFQHVFHNHTYHLQPTAPE STSEPFPWPGKSQKIRSRYLEDTDRNLSRDEQRAKALHIPFSVDEIVGMPVDSFNSMLSR YYLTDLQVSLIRDIRRRGKNKVAAQNCRKRKLDIILNLEDDVCNLQAKKETLKREQAQCN KAINIMKQKLHDLYHDIFSRLRDDQGRPVNPNHYALQCTHDGSILIVPKELVASGHKKET QKGKRK >gi568815591f:26052499_26285780|GENSCAN_predicted_CDS_2|2001_bp atgaagcacctgaagcggtggtggtcggccggcggcggcctcctgcacctcaccctcctg ctgagcttggcggggctccgcgtagacctagatctttacctgctgctgccgccgcccacc ctgctgcaggacgagctgctgttcctgggcggcccggccagctccgcctacgcgctcagc cccttctcggcctcgggagggtgggggcgcgcgggccacttgcaccccaagggccgggag ctggaccctgccgcgccgcccgagggccagctgctccgggaggtgcgcgcgctcggggtc cccttcgtccctcgcaccagcgtggatgcatggctggtgcacagcgtggctgccgggagc gcggacgaggcccacgggctgctcggcgccgccgccgcctcgtccaccggaggagccggc gccagcgtggacggcggcagccaggctgtgcaggggggcggcggggacccccgagcggct cggagtggccccttggacgccggggaagaggagaaggcacccgcggaaccgacggctcag gtgccggacgctggcggatgtgcgagcgaggagaatggggtactaagagaaaagcacgaa gctgtggatcatagttcccagcatgaggaaaatgaagaaagggtgtcagcccagaaggag aactcacttcagcagaatgatgatgatgaaaacaaaatagcagagaaacctgactgggag gcagaaaagaccactgaatctagaaatgagggcatctcattgggagatattcctcttcca ggcagtatcagtgatggcatgaattcttcagcacattatcatgtaaacttcagccaggct ataagtcaggatgtgaatcttcatgaggccatcttgctttgtcccaacaatacatttaga agagatccaacagcaaggacttcacagtcacaagaaccatttctgcagttaaattctcat accaccaatcctgagcaaacccttcctggaactaatttgacaggatttctttcaccggtt gacaatcatatgaggaatctaacaagccaagacctactgtatgaccttgacataaatata tttgatgagataaacttaatgtcattggccacagaagacaactttgatccaatcgatgtt tctcagctttttgatgaaccagattctgattctggcctttctttagattcaagtcacaat aatacctctgtcatcaagtctaattcctctcactctgtgtgtgatgaaggtgctataggt tattgcactgaccatgaatctagttcccatcatgacttagaaggtgctgtaggtggctac tacccagaacccagtaagctttgtcacttggatcaaagtgattctgatttccatggagat cttacatttcaacacgtatttcataaccacacttaccacttacagccaactgcaccagaa tctacttctgaaccttttccgtggcctgggaagtcacagaagataaggagtagatacctt gaagacacagatagaaacttgagccgtgatgaacagcgtgctaaagctttgcatatccct ttttctgtagatgaaattgtcggcatgcctgttgattctttcaatagcatgttaagtaga tattatctgacagacctacaagtctcacttatccgtgacatcagacgaagagggaaaaat aaagttgctgcgcagaactgtcgtaaacgcaaattggacataattttgaatttagaagat gatgtatgtaacttgcaagcaaagaaggaaactcttaagagagagcaagcacaatgtaac aaagctattaacataatgaaacagaaactgcatgacctttatcatgatatttttagtaga ttaagagatgaccaaggtaggccagtcaatcccaaccactatgctctccagtgtacccat gatggaagtatcttgatagtacccaaagaactggtggcctcaggccacaaaaaggaaacc caaaagggaaagagaaagtga >gi568815591f:26052499_26285780|GENSCAN_predicted_peptide_3|279_aa MEREKEQFRKLFIGGLSFETTEESLRNYYEQWGKLTDCVVMRDPASKRSRGFGFVTFSSM AEVDAAMAARPHSIDGRVVEPKRAVAREESGKPGAHVTVKKLFVGGIKEDTEEHHLRDYF EEYGKIDTIEIITDRQSGKKRGFGFVTFDDHDPVDKIVLQKYHTINGHNAEATLALGIHV VAVEISDQDQEVTLEEDLMDMAVDVDLGMAIMGMEEDLEVAILEVAPVMEEEEEDMVVED LDMATRVGATEVVMTTMEEEGVLLQVTNEEVVNHRVFKK >gi568815591f:26052499_26285780|GENSCAN_predicted_CDS_3|840_bp atggagagagaaaaggaacagttccgtaagctctttattggtggcttaagctttgaaacc acagaagaaagtttgaggaactactacgaacaatggggaaagcttacagactgtgtggta atgagggatcctgcaagcaaaagatcaagaggatttggttttgtaactttttcatccatg gctgaggttgatgctgccatggctgcaagacctcattcaattgatgggagagtagttgag ccaaaacgtgctgtagcaagagaggaatctggaaaaccaggggctcatgtaactgtgaag aagctgtttgttggcggaattaaagaagatactgaggaacatcaccttagagattacttt gaggaatatggaaaaattgataccattgagataattactgataggcagtctggaaagaaa agaggctttggctttgttacttttgatgaccatgatcctgtggataaaatcgtattgcag aaataccataccatcaatggtcataatgcagaagcaactttggctttggggattcacgtg gtggcggtggaaatttcggaccaggaccaggaagtaactttagaggaggatctgatggat atggcagtggacgtggatttggggatggctataatgggtatggaggaggacctggaggtg gcaattttggaggtagccccggttatggaggaggaagaggaggatatggtggtggaggac ctggatatggcaaccagggtgggggctacggaggtggttatgacaactatggaggaggaa ggtgtcttgctgcaggtaactaatgaagaagtggtcaaccacagagtcttcaagaaataa >gi568815591f:26052499_26285780|GENSCAN_predicted_peptide_4|330_aa MGKKQNGKSKKVEEAEPEEFVVEKVLDRRVVNGKVEYFLKWKGFTDADNTWEPEENLDCP ELIEAFLNSQKAGKEKDGTKRKSLSDSESDDSKSKKKRDAADKPRGFARGLDPERIIGAT DSSGELMFLMKCPYLFLSTAFFNPESLLIVFSVAIFPAAIFPVSCIHFFSYTIPPQDNSS IACTAAMKATGPDNAQSQVSPPGHAPSAEDPTGSRTVSSPCEDRPHPFLSWPTWISLALL LKTDGALERMPQQLPSLHPSQGKLLQAAMLLQKVAIIHCRGHQTPDNPIIAGNALADQVA KEVALQPVQDQFLSLSLFSPLYSSEEKEDF >gi568815591f:26052499_26285780|GENSCAN_predicted_CDS_4|993_bp atgggaaaaaaacagaatggaaagagtaaaaaagttgaagaggcagagcctgaagaattt gtcgtggaaaaagtactagatcgacgtgtagtgaatgggaaagtggaatatttcctgaag tggaagggatttacagatgctgacaatacttgggaacctgaagaaaatttagattgtcca gaattgattgaagcgtttcttaactctcagaaagctggcaaagaaaaagatggtacaaaa agaaaatctttatctgacagtgaatctgatgacagcaaatcaaagaagaaaagagatgct gctgacaaaccaagaggatttgccagaggtcttgatcctgaaagaataattggtgccaca gacagcagtggagaattgatgtttctcatgaaatgcccctaccttttcctctcaactgcc ttctttaaccccgaaagcctcctcattgtcttctcagttgccatcttcccagccgccatc ttcccagtcagctgtatccacttcttttcctacaccatcccccctcaggacaattctagt attgcttgcactgcggcaatgaaggccactggtccagacaatgcccaaagccaggtaagc ccaccaggccatgccccctctgcggaggaccccactggaagtcggactgtgagcagcccc tgcgaggaccgcccccatcccttcctgagctggcctacttggatctcattggccttgctg ctgaagactgacggtgccctggaacggatgccccagcaactaccatcgcttcatccaagc caaggcaaactccttcaagctgccatgctcctgcagaaagttgccatcattcattgcaga ggccaccaaaccccagacaatcctataatagctggaaatgcgctggcagatcaggtagcc aaagaagtagctctacaacccgtgcaagaccagtttctgtccctgtcgctgttctctcct ctttactcctcagaagaaaaggaggacttctga >gi568815591f:26052499_26285780|GENSCAN_predicted_peptide_5|85_aa MISERKEKNFLKSAHSELAVISEKKEIILIMLKSRSIVWIVWRLTVLSTRVLMAFSFYYQ WDNTGEKQKEPNDEDYCPYQCFKSS >gi568815591f:26052499_26285780|GENSCAN_predicted_CDS_5|258_bp atgatatcagaaaggaaagagaaaaactttttgaaaagtgctcatagtgaattggcagta atttcagagaagaaggagattattctcataatgcttaagagcaggagcatcgtctggatt gtctggcggttaactgtactttcaacaagagttttaatggcttttagcttttattatcag tgggataacacaggggagaaacagaaggagcccaatgatgaagattactgtccctaccag tgctttaaatcctcctaa >gi568815591f:26052499_26285780|GENSCAN_predicted_peptide_6|63_aa XKSVQPSLILFTEMIYRFLAPVLSALLIRDPTRRPKEIQNFAPEDSPHPCFDILNARKKE KSN >gi568815591f:26052499_26285780|GENSCAN_predicted_CDS_6|192_bp nggaagtctgttcagccgtccctgatcctcttcacggagatgatatacagatttttggct cctgtattgtcagcattgttgatcagggatcctaccagaagacccaaggaaatccagaat tttgcaccagaggactcaccacatccttgcttcgacatcttgaatgccagaaagaaggaa aaaagcaattga