GENSCAN 1.0 Date run: 5-Nov-116 Time: 22:31:58 Sequence gi568815591r:26093305_26297734 : 204430 bp : 43.59% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 4881 5098 218 0 2 44 95 102 0.711 4.65 1.02 Intr + 8311 8446 136 2 1 56 81 88 0.566 4.53 1.03 Intr + 9175 9247 73 1 1 66 101 36 0.559 2.11 1.04 Intr + 11740 11775 36 0 0 102 86 9 0.302 0.56 1.05 Intr + 29633 29834 202 1 1 25 108 88 0.043 3.36 1.06 Term + 30798 30913 116 1 2 58 44 187 0.593 10.03 1.07 PlyA + 33481 33486 6 1.05 2.00 Prom + 56403 56442 40 -4.16 2.01 Init + 59195 59764 570 1 0 92 92 816 0.138 75.49 2.02 Intr + 84639 84818 180 2 0 52 99 164 0.993 13.96 2.03 Term + 91229 92479 1251 1 0 125 54 321 0.893 23.99 2.04 PlyA + 92964 92969 6 1.05 3.10 PlyA - 94305 94300 6 1.05 3.09 Term - 97156 97094 63 1 0 98 48 79 0.426 2.79 3.08 Intr - 100390 100271 120 1 0 37 75 160 0.248 10.29 3.07 Intr - 102605 102543 63 2 0 85 78 88 0.251 6.41 3.06 Intr - 103177 103097 81 1 0 23 105 88 0.239 3.83 3.05 Intr - 103354 103317 38 2 2 52 63 27 0.144 -5.42 3.04 Intr - 103713 103503 211 0 1 70 92 227 0.878 19.79 3.03 Intr - 104157 104011 147 0 0 42 99 45 0.493 1.43 3.02 Intr - 104428 104318 111 1 0 86 68 68 0.658 5.18 3.01 Init - 107273 107268 6 2 0 94 82 0 0.289 0.87 3.00 Prom - 109346 109307 40 -5.86 4.00 Prom + 110361 110400 40 -6.26 4.01 Init + 113070 113206 137 2 2 54 52 140 0.678 6.61 4.02 Intr + 115089 115251 163 0 1 10 87 132 0.959 5.28 4.03 Intr + 118358 118452 95 1 2 48 101 76 0.909 3.66 4.04 Intr + 151515 151670 156 0 0 88 18 99 0.165 2.03 4.05 Intr + 152066 152299 234 2 0 74 53 131 0.073 5.00 4.06 Term + 154766 154973 208 2 1 36 52 136 0.206 1.61 4.07 PlyA + 155287 155292 6 1.05 5.04 PlyA - 156328 156323 6 1.05 5.03 Term - 156766 156610 157 0 1 29 36 149 0.007 1.21 5.02 Intr - 169906 169843 64 2 1 96 54 34 0.038 -1.42 5.01 Init - 174115 174079 37 1 1 76 106 8 0.446 1.57 5.00 Prom - 179393 179354 40 -1.66 6.05 PlyA - 182227 182222 6 1.05 6.04 Term - 183047 182857 191 0 2 44 38 161 0.429 4.31 6.03 Intr - 198612 198527 86 2 2 58 85 43 0.204 0.46 6.02 Intr - 199036 198894 143 1 2 80 45 83 0.750 2.35 6.01 Init - 202140 202129 12 0 0 97 76 9 0.379 0.72 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 59195 59965 771 1 0 92 49 871 0.823 77.66 S.002 Term + 118778 118904 127 2 1 104 32 51 0.851 -1.14 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:26093305_26297734|GENSCAN_predicted_peptide_1|260_aa XTGSPTSQETAQFKANQNKPLVRGKGSPHEAIRYISAAHREWKPAILTSAIRSFCSTWLV FTSKNFPKLVTQHGSTIAGNGQSSDETQVQGAAWKSDSRGTKRQIPTWILAEGNNAGAQL DIPGPTIPAPNCSLKVPQSWSTTPSMPSSLGKAYWLLACYWALVETEHWLWVIKSPCKPE LPVMNWVLSDPSSHKVGGAQQHSINKWKWYIRNRARAGPEGTTLPLTKALTLWLKKYSNV LMLVEFTGLTMFPDILKQLE >gi568815591r:26093305_26297734|GENSCAN_predicted_CDS_1|783_bp nncactggcagccccacgtcccaggaaactgctcagttcaaggcaaaccagaacaaacca ttggtcagaggaaaaggaagcccacatgaggccataagatatatttcagctgcccatcga gaatggaagccagcaatcctgacatcagccattaggtcattttgttctacctggctggtg ttcacctccaagaactttcccaagcttgtgacccaacatggaagtacgatagcaggaaac gggcagagctcagatgagacccaagtccagggtgcagcctggaagtctgattccagagga acaaagagacaaataccaacgtggattctggcagaggggaacaatgcaggtgcccaactt gacatcccaggcccaaccatccctgctcctaattgttccctgaaggtgccccagagctgg agtaccacgcccagcatgccctcctcactagggaaggcctactggctcttggcctgttac tgggccttggtggaaactgaacattggctatgggtcatcaagtcaccatgtaaacctgaa ctgcctgtcatgaactgggtgctttctgacccatctagccataaagtgggtggtgcacag cagcattccatcaacaaatggaagtggtatatacgtaatcgggctcgagcaggtcctgaa ggcacaacattgcctctgaccaaggcactcactttatggctaaagaagtacagcaacgtg ctcatgctcgtggaatttaccggtcttaccatgttccccgacatcctgaagcagctggaa tga >gi568815591r:26093305_26297734|GENSCAN_predicted_peptide_2|666_aa MKHLKRWWSAGGGLLHLTLLLSLAGLRVDLDLYLLLPPPTLLQDELLFLGGPASSAYALS PFSASGGWGRAGHLHPKGRELDPAAPPEGQLLREVRALGVPFVPRTSVDAWLVHSVAAGS ADEAHGLLGAAAASSTGGAGASVDGGSQAVQGGGGDPRAARSGPLDAGEEEKAPAEPTAQ VPDAGGCASEENGVLREKHEAVDHSSQHEENEERVSAQKENSLQQNDDDENKIAEKPDWE AEKTTESRNEGISLGDIPLPGSISDGMNSSAHYHVNFSQAISQDVNLHEAILLCPNNTFR RDPTARTSQSQEPFLQLNSHTTNPEQTLPGTNLTGFLSPVDNHMRNLTSQDLLYDLDINI FDEINLMSLATEDNFDPIDVSQLFDEPDSDSGLSLDSSHNNTSVIKSNSSHSVCDEGAIG YCTDHESSSHHDLEGAVGGYYPEPSKLCHLDQSDSDFHGDLTFQHVFHNHTYHLQPTAPE STSEPFPWPGKSQKIRSRYLEDTDRNLSRDEQRAKALHIPFSVDEIVGMPVDSFNSMLSR YYLTDLQVSLIRDIRRRGKNKVAAQNCRKRKLDIILNLEDDVCNLQAKKETLKREQAQCN KAINIMKQKLHDLYHDIFSRLRDDQGRPVNPNHYALQCTHDGSILIVPKELVASGHKKET QKGKRK >gi568815591r:26093305_26297734|GENSCAN_predicted_CDS_2|2001_bp atgaagcacctgaagcggtggtggtcggccggcggcggcctcctgcacctcaccctcctg ctgagcttggcggggctccgcgtagacctagatctttacctgctgctgccgccgcccacc ctgctgcaggacgagctgctgttcctgggcggcccggccagctccgcctacgcgctcagc cccttctcggcctcgggagggtgggggcgcgcgggccacttgcaccccaagggccgggag ctggaccctgccgcgccgcccgagggccagctgctccgggaggtgcgcgcgctcggggtc cccttcgtccctcgcaccagcgtggatgcatggctggtgcacagcgtggctgccgggagc gcggacgaggcccacgggctgctcggcgccgccgccgcctcgtccaccggaggagccggc gccagcgtggacggcggcagccaggctgtgcaggggggcggcggggacccccgagcggct cggagtggccccttggacgccggggaagaggagaaggcacccgcggaaccgacggctcag gtgccggacgctggcggatgtgcgagcgaggagaatggggtactaagagaaaagcacgaa gctgtggatcatagttcccagcatgaggaaaatgaagaaagggtgtcagcccagaaggag aactcacttcagcagaatgatgatgatgaaaacaaaatagcagagaaacctgactgggag gcagaaaagaccactgaatctagaaatgagggcatctcattgggagatattcctcttcca ggcagtatcagtgatggcatgaattcttcagcacattatcatgtaaacttcagccaggct ataagtcaggatgtgaatcttcatgaggccatcttgctttgtcccaacaatacatttaga agagatccaacagcaaggacttcacagtcacaagaaccatttctgcagttaaattctcat accaccaatcctgagcaaacccttcctggaactaatttgacaggatttctttcaccggtt gacaatcatatgaggaatctaacaagccaagacctactgtatgaccttgacataaatata tttgatgagataaacttaatgtcattggccacagaagacaactttgatccaatcgatgtt tctcagctttttgatgaaccagattctgattctggcctttctttagattcaagtcacaat aatacctctgtcatcaagtctaattcctctcactctgtgtgtgatgaaggtgctataggt tattgcactgaccatgaatctagttcccatcatgacttagaaggtgctgtaggtggctac tacccagaacccagtaagctttgtcacttggatcaaagtgattctgatttccatggagat cttacatttcaacacgtatttcataaccacacttaccacttacagccaactgcaccagaa tctacttctgaaccttttccgtggcctgggaagtcacagaagataaggagtagatacctt gaagacacagatagaaacttgagccgtgatgaacagcgtgctaaagctttgcatatccct ttttctgtagatgaaattgtcggcatgcctgttgattctttcaatagcatgttaagtaga tattatctgacagacctacaagtctcacttatccgtgacatcagacgaagagggaaaaat aaagttgctgcgcagaactgtcgtaaacgcaaattggacataattttgaatttagaagat gatgtatgtaacttgcaagcaaagaaggaaactcttaagagagagcaagcacaatgtaac aaagctattaacataatgaaacagaaactgcatgacctttatcatgatatttttagtaga ttaagagatgaccaaggtaggccagtcaatcccaaccactatgctctccagtgtacccat gatggaagtatcttgatagtacccaaagaactggtggcctcaggccacaaaaaggaaacc caaaagggaaagagaaagtga >gi568815591r:26093305_26297734|GENSCAN_predicted_peptide_3|279_aa MEREKEQFRKLFIGGLSFETTEESLRNYYEQWGKLTDCVVMRDPASKRSRGFGFVTFSSM AEVDAAMAARPHSIDGRVVEPKRAVAREESGKPGAHVTVKKLFVGGIKEDTEEHHLRDYF EEYGKIDTIEIITDRQSGKKRGFGFVTFDDHDPVDKIVLQKYHTINGHNAEATLALGIHV VAVEISDQDQEVTLEEDLMDMAVDVDLGMAIMGMEEDLEVAILEVAPVMEEEEEDMVVED LDMATRVGATEVVMTTMEEEGVLLQVTNEEVVNHRVFKK >gi568815591r:26093305_26297734|GENSCAN_predicted_CDS_3|840_bp atggagagagaaaaggaacagttccgtaagctctttattggtggcttaagctttgaaacc acagaagaaagtttgaggaactactacgaacaatggggaaagcttacagactgtgtggta atgagggatcctgcaagcaaaagatcaagaggatttggttttgtaactttttcatccatg gctgaggttgatgctgccatggctgcaagacctcattcaattgatgggagagtagttgag ccaaaacgtgctgtagcaagagaggaatctggaaaaccaggggctcatgtaactgtgaag aagctgtttgttggcggaattaaagaagatactgaggaacatcaccttagagattacttt gaggaatatggaaaaattgataccattgagataattactgataggcagtctggaaagaaa agaggctttggctttgttacttttgatgaccatgatcctgtggataaaatcgtattgcag aaataccataccatcaatggtcataatgcagaagcaactttggctttggggattcacgtg gtggcggtggaaatttcggaccaggaccaggaagtaactttagaggaggatctgatggat atggcagtggacgtggatttggggatggctataatgggtatggaggaggacctggaggtg gcaattttggaggtagccccggttatggaggaggaagaggaggatatggtggtggaggac ctggatatggcaaccagggtgggggctacggaggtggttatgacaactatggaggaggaa ggtgtcttgctgcaggtaactaatgaagaagtggtcaaccacagagtcttcaagaaataa >gi568815591r:26093305_26297734|GENSCAN_predicted_peptide_4|330_aa MGKKQNGKSKKVEEAEPEEFVVEKVLDRRVVNGKVEYFLKWKGFTDADNTWEPEENLDCP ELIEAFLNSQKAGKEKDGTKRKSLSDSESDDSKSKKKRDAADKPRGFARGLDPERIIGAT DSSGELMFLMKCPYLFLSTAFFNPESLLIVFSVAIFPAAIFPVSCIHFFSYTIPPQDNSS IACTAAMKATGPDNAQSQVSPPGHAPSAEDPTGSRTVSSPCEDRPHPFLSWPTWISLALL LKTDGALERMPQQLPSLHPSQGKLLQAAMLLQKVAIIHCRGHQTPDNPIIAGNALADQVA KEVALQPVQDQFLSLSLFSPLYSSEEKEDF >gi568815591r:26093305_26297734|GENSCAN_predicted_CDS_4|993_bp atgggaaaaaaacagaatggaaagagtaaaaaagttgaagaggcagagcctgaagaattt gtcgtggaaaaagtactagatcgacgtgtagtgaatgggaaagtggaatatttcctgaag tggaagggatttacagatgctgacaatacttgggaacctgaagaaaatttagattgtcca gaattgattgaagcgtttcttaactctcagaaagctggcaaagaaaaagatggtacaaaa agaaaatctttatctgacagtgaatctgatgacagcaaatcaaagaagaaaagagatgct gctgacaaaccaagaggatttgccagaggtcttgatcctgaaagaataattggtgccaca gacagcagtggagaattgatgtttctcatgaaatgcccctaccttttcctctcaactgcc ttctttaaccccgaaagcctcctcattgtcttctcagttgccatcttcccagccgccatc ttcccagtcagctgtatccacttcttttcctacaccatcccccctcaggacaattctagt attgcttgcactgcggcaatgaaggccactggtccagacaatgcccaaagccaggtaagc ccaccaggccatgccccctctgcggaggaccccactggaagtcggactgtgagcagcccc tgcgaggaccgcccccatcccttcctgagctggcctacttggatctcattggccttgctg ctgaagactgacggtgccctggaacggatgccccagcaactaccatcgcttcatccaagc caaggcaaactccttcaagctgccatgctcctgcagaaagttgccatcattcattgcaga ggccaccaaaccccagacaatcctataatagctggaaatgcgctggcagatcaggtagcc aaagaagtagctctacaacccgtgcaagaccagtttctgtccctgtcgctgttctctcct ctttactcctcagaagaaaaggaggacttctga >gi568815591r:26093305_26297734|GENSCAN_predicted_peptide_5|85_aa MISERKEKNFLKSAHSELAVISEKKEIILIMLKSRSIVWIVWRLTVLSTRVLMAFSFYYQ WDNTGEKQKEPNDEDYCPYQCFKSS >gi568815591r:26093305_26297734|GENSCAN_predicted_CDS_5|258_bp atgatatcagaaaggaaagagaaaaactttttgaaaagtgctcatagtgaattggcagta atttcagagaagaaggagattattctcataatgcttaagagcaggagcatcgtctggatt gtctggcggttaactgtactttcaacaagagttttaatggcttttagcttttattatcag tgggataacacaggggagaaacagaaggagcccaatgatgaagattactgtccctaccag tgctttaaatcctcctaa >gi568815591r:26093305_26297734|GENSCAN_predicted_peptide_6|143_aa MGDHPRIPGSTRSISPLYRADHFSLPGTPRLKRRRLLCSKPKGPKPLSSPQGGPAPALSA SRPPVGAAPSASAASAQPSPGKSVQPSLILFTEMIYRFLAPVLSALLIRDPTRRPKEIQN FAPEDSPHPCFDILNARKKEKSN >gi568815591r:26093305_26297734|GENSCAN_predicted_CDS_6|432_bp atgggtgaccacccaagaatccctggatcgacgcgctccatctccccgctctaccgcgct gatcatttctccctgcccggcacacctcggctgaaaaggaggcggctcctctgctccaag cccaagggacccaaacctctttccagcccgcaaggcggccccgcgcccgcgctcagcgcc agccgtccgcctgtgggcgccgcgcctagtgcttccgccgcctccgcgcagcccagccca gggaagtctgttcagccgtccctgatcctcttcacggagatgatatacagatttttggct cctgtattgtcagcattgttgatcagggatcctaccagaagacccaaggaaatccagaat tttgcaccagaggactcaccacatccttgcttcgacatcttgaatgccagaaagaaggaa aaaagcaattga