GENSCAN 1.0 Date run: 5-Nov-116 Time: 17:58:57 Sequence gi568815597f:225964352_226171476 : 207125 bp : 45.87% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.08 PlyA - 887 882 6 -0.45 1.07 Term - 1433 1400 34 1 1 128 49 30 0.309 0.16 1.06 Intr - 2302 2182 121 2 1 64 113 43 0.242 3.95 1.05 Intr - 4485 4410 76 0 1 75 102 -25 0.121 -3.31 1.04 Intr - 6695 6571 125 0 2 57 101 113 0.665 9.80 1.03 Intr - 6871 6804 68 0 2 54 105 54 0.532 2.25 1.02 Intr - 10236 10184 53 0 2 29 86 76 0.333 -0.79 1.01 Init - 13893 13879 15 0 0 78 84 15 0.110 -1.02 1.00 Prom - 15643 15604 40 -6.56 2.08 PlyA - 17621 17616 6 1.05 2.07 Term - 21172 20951 222 1 0 130 48 184 0.997 15.52 2.06 Intr - 24037 23545 493 0 1 87 92 296 0.973 22.90 2.05 Intr - 27012 26892 121 1 1 19 119 18 0.925 -2.45 2.04 Intr - 28216 28047 170 0 2 98 100 291 0.915 30.89 2.03 Intr - 28651 28540 112 2 1 51 57 132 0.192 5.84 2.02 Intr - 31032 30915 118 0 1 31 75 29 0.122 -4.06 2.01 Init - 34961 34842 120 2 0 96 94 118 0.365 13.47 2.00 Prom - 51789 51750 40 -4.06 3.00 Prom + 73016 73055 40 -2.76 3.01 Init + 98263 98460 198 0 0 78 98 174 0.995 14.30 3.02 Term + 98661 98786 126 2 0 93 46 51 0.985 -0.32 3.03 PlyA + 99688 99693 6 1.05 4.00 Prom + 99844 99883 40 -6.86 4.01 Init + 100001 100128 128 1 2 79 67 106 0.987 7.23 4.02 Intr + 101305 101458 154 1 1 47 98 203 0.983 17.27 4.03 Term + 107000 107128 129 1 0 124 48 108 0.993 8.58 4.04 PlyA + 107418 107423 6 1.05 5.00 Prom + 116885 116924 40 -2.96 5.01 Init + 127193 127252 60 1 0 79 87 -10 0.067 -0.75 5.02 Intr + 137256 137317 62 2 2 77 52 82 0.345 1.03 5.03 Term + 140558 140738 181 2 1 140 47 125 0.983 10.68 5.04 PlyA + 142016 142021 6 1.05 6.00 Prom + 143166 143205 40 -3.86 6.01 Init + 145158 145172 15 2 0 28 91 15 0.579 -3.90 6.02 Term + 146558 146731 174 1 0 56 44 275 0.826 17.66 6.03 PlyA + 147696 147701 6 1.05 7.00 Prom + 147853 147892 40 -7.96 7.01 Init + 150553 150618 66 0 0 52 39 122 0.471 2.77 7.02 Intr + 154639 154729 91 0 1 73 94 31 0.398 1.77 7.03 Intr + 155389 155502 114 2 0 38 94 74 0.391 3.42 7.04 Intr + 156108 156169 62 1 2 69 75 33 0.253 -1.45 7.05 Intr + 160956 161015 60 2 0 75 100 72 0.728 6.03 7.06 Intr + 164331 164516 186 2 0 78 73 27 0.264 0.09 7.07 Intr + 169019 169103 85 1 1 72 110 10 0.548 0.99 7.08 Term + 170768 170967 200 0 2 79 50 132 0.914 6.06 7.09 PlyA + 173589 173594 6 1.05 8.08 PlyA - 176398 176393 6 1.05 8.07 Term - 182470 182259 212 2 2 109 42 207 0.999 15.66 8.06 Intr - 188268 187984 285 1 0 50 70 290 0.746 20.71 8.05 Intr - 190482 190296 187 0 1 95 100 194 0.999 20.56 8.04 Intr - 195007 194833 175 0 1 60 96 252 0.998 23.14 8.03 Intr - 197338 197180 159 0 0 85 106 199 0.999 20.50 8.02 Intr - 201649 201508 142 2 1 115 77 80 0.998 9.01 8.01 Intr - 203940 203910 31 0 1 106 95 13 0.744 1.50 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 90280 90414 135 0 0 33 48 125 0.802 1.02 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:225964352_226171476|GENSCAN_predicted_peptide_1|163_aa MAISLRYFTGCVSVVAQDDEAVRLWVRNWWVLGLTDFKNEATDPGGVKLQIFAVSVRALK AAHLEFFVPPGGLMVSLASEVKLQTFAMLFIKVLIEPGTVVYACNPSILEAQVKRNVVGE QGDNKEKVSNKHVSNRIYVIMKFKGRYYDWTCTNFTELVLLLL >gi568815597f:225964352_226171476|GENSCAN_predicted_CDS_1|492_bp atggcgattagcctgcgctatttcacgggctgtgtttctgtggttgctcaggacgatgag gctgtaaggttgtgggtccggaattggtgggttcttggtctcactgacttcaagaatgaa gccacggaccctggaggagtgaagctgcagatcttcgcggtgagtgttagagctcttaag gcagcgcatctggagttcttcgttcctcccggtgggctcatggtctcgctggcttcagaa gtgaagctgcagaccttcgcgatgctgtttataaaagtactaatagaaccaggcacggtg gtttatgcctgtaatcccagcattttggaagcccaagtaaaaaggaatgtagtaggagag cagggtgataataaggagaaggtcagcaacaaacatgtgagcaatagaatctatgtcata atgaagttcaagggaaggtactatgactggacgtgcaccaattttactgaactggtgctg ctccttttgtag >gi568815597f:225964352_226171476|GENSCAN_predicted_peptide_2|451_aa MAEAAALVWIRGPGFGCKAVRCASGRCTVRDFIHRHCQDQNVPVENFFVKCNGALINTSD TVQHGAVYSLEPRLCGGKGGFGSMLRALGAQIEKTTNREACRDLSGRRLRDVNHEKAMAE WVKQQAEREAEKEQKRLERLQRKLVEPKHCFTSPDYQQQCHEMAERLEDSVLKGMQAASS KMVSAEISENRKRQWPTKSQTDRGASAGKRRCFWLGMEGLETAEGSNSESSDDDSEEAPS TSGMGFHAPKIGSNGVEMAAKFPSGSQRARVVNTDHGSPEQLQIPVTDSGRHILEDSCAE LGESKEHMESRMVTETEETQEKKAESKEPIEEEPTGAGLNKDKETEERTDGERVAEVAPE ERENVAVAKLQESQPGNAVIDKETIDLLAFTSVAELELLGLEKLKCELMALGLKCGGTLQ ERAARLFSVRGLAKEQIDPALFAKPLKGKKK >gi568815597f:225964352_226171476|GENSCAN_predicted_CDS_2|1356_bp atggcggaggccgcggcgctggtgtggattcgcggccctggcttcgggtgcaaggcggtg cggtgtgcctcgggtcggtgcaccgtccgggattttatccaccggcactgccaagatcag aatgttccagtggaaaacttctttgtgaaatgcaatggagcactcattaacaccagtgac acagtgcagcatggagctgtttatagtttggaacccagactttgcggtggaaaaggaggt tttggatctatgctccgagcacttggtgctcagattgagaagacaaccaatcgagaagct tgtcgggatctcagtggaaggagactacgcgatgtcaatcatgaaaaagcaatggctgaa tgggtaaaacaacaagccgagcgagaggctgaaaaggagcagaagcggctggagcgactg cagcggaagcttgtagaacccaagcactgcttcaccagccccgactaccagcagcagtgc catgagatggctgagcgtctggaggattccgtcctcaaaggtatgcaggctgcctccagc aagatggtttcagcagaaatcagtgagaatcggaaacggcaatggcctactaaatctcaa acagacagaggagccagtgcgggaaagaggagatgcttctggttgggcatggagggacta gagactgcagaagggtccaactctgagagttcagatgatgacagtgaagaagcaccaagc acttcaggaatgggtttccatgctccaaaaattggtagcaatggtgtcgagatggcagcc aaatttcccagtggttctcagagggcgagagtagtgaatacagaccatggatcaccagaa caactgcagatcccggtgactgactctgggaggcatattttagaagactcatgtgctgag ctgggggagtccaaagagcacatggaaagcaggatggttacagaaacagaagagacccag gagaagaaggcagagagtaaagaacccatagaagaggagcccactggggctggactgaat aaggataaagagacagaagaaaggactgatggggaaagagttgctgaggtagcacctgaa gaaagggaaaacgttgccgttgccaaactgcaggaaagccagccaggaaacgcagttatt gataaggaaactatagatttattggcgttcacctctgttgcagaactggagttgctgggt ttggagaagctcaaatgtgaactgatggcccttggactgaaatgtgggggcactctgcag gagcgggcagcaagactcttctctgtcagaggactggcaaaggagcaaattgacccggct ttatttgccaagcctttgaaagggaagaaaaaatga >gi568815597f:225964352_226171476|GENSCAN_predicted_peptide_3|107_aa MGAGPSLPSIVCDWLARGAGAGRRVLGDSLGVSHLSIVFAAAAAPPSLSNASAASRSPSS SRRRRGPNSSNRRGAEPALPPPPRAPGSSFFGEIPPAAPSPDPKPSP >gi568815597f:225964352_226171476|GENSCAN_predicted_CDS_3|324_bp atgggggcggggccgagccttccctccattgtgtgtgattggctggcgcgcggcgcgggg gcggggcggcgtgtgttgggggatagcctcggtgtcagccatctttcaattgtgttcgca gccgccgccgcgccgccgtcgctctccaacgccagcgccgcctctcgctcgccgagctcc agccgaaggagaagggggcccaacagcagcaaccgccgcggcgccgagcctgctctccct cctcctccccgcgcccctggctcctctttcttcggtgaaatcccgcccgccgccccttcc ccggaccccaaaccttcaccatga >gi568815597f:225964352_226171476|GENSCAN_predicted_peptide_4|136_aa MARTKQTARKSTGGKAPRKQLATKAARKSAPSTGGVKKPHRYRPGTVALREIRRYQKSTE LLIRKLPFQRLVREIAQDFKTDLRFQSAAIGALQEASEAYLVGLFEDTNLCAIHAKRVTI MPKDIQLARRIRGERA >gi568815597f:225964352_226171476|GENSCAN_predicted_CDS_4|411_bp atggctcgtacaaagcagactgcccgcaaatcgaccggtggtaaagcacccaggaagcaa ctggctacaaaagccgctcgcaagagtgcgccctctactggaggggtgaagaaacctcat cgttacaggcctggtactgtggcgctccgtgaaattagacgttatcagaagtccactgaa cttctgattcgcaaacttcccttccagcgtctggtgcgagaaattgctcaggactttaaa acagatctgcgcttccagagcgcagctatcggtgctttgcaggaggcaagtgaggcctat ctggttggcctttttgaagacaccaacctgtgtgctatccatgccaaacgtgtaacaatt atgccaaaagacatccagctagcacgccgcatacgtggagaacgtgcttaa >gi568815597f:225964352_226171476|GENSCAN_predicted_peptide_5|100_aa MPLIVKHTIILCTMRKGENASVEKLSSTKPEPAVKKVGDLCKTLLMSPFLRMSPFLLSPG PIPIRLGPRCSSKTALASVTIAELGVHPLSSLLDLAATSI >gi568815597f:225964352_226171476|GENSCAN_predicted_CDS_5|303_bp atgcccttgattgtaaaacataccatcattttatgtaccatgaggaaaggggaaaatgct tctgtggaaaaactgtcttccacgaaaccagaacctgctgtcaaaaaggttggagacctc tgcaaaactctgctgatgagtccattcctgcggatgagtccatttctgctctctcctgga cctattcccatcaggcttggccctcgctgttcctccaaaactgctcttgccagcgtcacc attgctgaacttggtgttcatcccttgtcctcattacttgatcttgcagcaacatccatc tga >gi568815597f:225964352_226171476|GENSCAN_predicted_peptide_6|62_aa MDGFQPHSRRELQQKLAILKLGYPSPPPGKIVIISNLQKRKPELNSLLEVTVLLPNGGSP DF >gi568815597f:225964352_226171476|GENSCAN_predicted_CDS_6|189_bp atggatggcttccagccacacagccggagggagctgcagcagaaacttgccattttaaag ctcggctacccctcaccaccacctggcaagattgtcatcatctccaatctacagaagcgg aagccggagctaaatagcctgctagaggtcacggtgctgctgcccaacggcgggagcccg gatttctga >gi568815597f:225964352_226171476|GENSCAN_predicted_peptide_7|287_aa MGRLAGRGADPPTSLPDGAAGRTGVLSDPQYSISQATGSRPLDITGQQGGAPGLSITLGE ARAAGEGQLSNLNGNEEGTIKPALASASSAERCPTEFGNAVVERGWESPSQEGLRSPQTQ MRPLDGAKASKIPAPTPAWNAHSAFITALNSTWFPKAMSPHTTSQGYNTLGFSDTVNTKQ HELQDTQDSWRGAEPHETHTKDGPAFQGSLDPWFLSWKRGAGCQLLDSRARAGLHARLRQ RLAPGAQVLQECVFNELLLNAAEPCSTLLPRPGVWEPDDSPAREGRT >gi568815597f:225964352_226171476|GENSCAN_predicted_CDS_7|864_bp atggggcggctggccgggcggggggctgacccccccacctccctcccggacggggcggct ggccggacaggagtcctcagtgaccctcaatattcaatcagccaagccacaggcagcagg cccttggacatcactggacaacagggaggagccccaggcctgagcatcaccctaggggag gccagggctgctggggaggggcagctttccaacctcaatgggaatgaagagggcaccatc aaaccagcactggcttccgcctcttctgcagagagatgccctactgagtttggcaatgct gtggttgagcgagggtgggagtccccaagccaggaagggctgcggtccccgcagacccag atgcgtccactagatggcgccaaagccagtaagattccagcccccaccccagcctggaac gcccacagtgcttttatcacagctcttaactccacttggttccccaaagctatgagccca cataccacctcccagggctacaacactttgggtttctcagacactgtcaacactaagcag catgaactgcaggacacacaggacagctggagaggggcagaaccgcatgagacacacacg aaagacggccctgccttccagggctcgctagatccttggttcctgtcgtggaagagaggg gcaggatgtcagttacttgacagcagggcccgtgctgggctccatgcccggcttcgccag cgtctggcgccgggagcgcaggtgctccaagagtgtgtgttcaatgaattgttactgaat gctgcagaaccctgctccacgctgcttcctcgccctggcgtttgggagcctgatgacagc cctgcccgtgaaggccgcacctag >gi568815597f:225964352_226171476|GENSCAN_predicted_peptide_8|396_aa IPGVNQGLKIEKDGKAFHPTYEEKLKLVALHKQVLMGPYNPDTCPEVGFFDVLGNDRRKE EEERRRREEEERERLQKEEEKRRREEEERLRREEEERRRIEEERLRLEQQKQQIMAALNS QTAVQFQQYAAQQYPGNYEQQQILIRQLQEQHYQQYMQQLYQVQLAQQQAALQKQQEVVV AGSSLPTSSKVNATVPSNMMSVNGQAKTHTDSSEKELEPEAAEEALENGPKESLPVIAAP SMWTRPQIKDFKEKIQQDADSVITVGRGEVVTVRVPTHEEGSYLFWEFATDNYDIGFGVY FEWTDSPNTAVSVHVSESSDDDEEEEENIGCEEKAKKNANKPLLDEIVPVYRRDCHEEVY AGSHQYPGRGVYLLKFDNSYSLWRSKSVYYRVYYTR >gi568815597f:225964352_226171476|GENSCAN_predicted_CDS_8|1191_bp atacccggggttaatcaaggtctgaaaatagaaaaagatggcaaagcatttcatccaact tatgaagaaaaattgaagcttgtggcactgcataagcaagttcttatgggcccatataat ccagacacttgtcctgaggttggattctttgatgtgttggggaatgacaggaggaaggag gaagaggagcgaaggcggcgtgaagaggaagaaagagaacgtctgcaaaaggaggaagag aaacgtaggagagaagaagaggaaaggcttcgacgggaggaagaggaaaggagacggata gaagaagaaaggcttcggttggagcagcaaaagcagcagataatggcagctttaaactcc cagactgccgtgcagttccagcagtatgcagcccaacagtatccagggaactacgaacag cagcaaattctcatccgccagttgcaggagcaacactatcagcagtacatgcagcagttg tatcaagtccagcttgcacagcaacaggcagcattacagaaacaacaggaagtagtagtg gctgggtcttccttgcctacatcatcaaaagtgaatgcaactgtaccaagtaatatgatg tcagttaatggacaggccaaaacacacactgacagctccgaaaaagaactggaaccagaa gctgcagaagaagccctggagaatggaccaaaagaatctcttccagtaatagcagctcca tccatgtggacacgacctcagatcaaagacttcaaagagaagattcagcaggatgcagat tccgtgattacagtgggccgaggagaagtggtcactgttcgagtacccacccatgaagaa ggatcatatctcttttgggaatttgccacagacaattatgacattgggtttggggtgtat tttgaatggacagactctccaaacactgctgtcagcgtgcatgtcagtgagtccagcgat gacgacgaggaggaagaagaaaacatcggttgtgaagagaaagccaaaaagaatgccaac aagcctttgctggatgagattgtgcctgtgtaccgacgggactgtcatgaggaggtgtat gctggcagccatcaatatccagggagaggagtctatctcctcaagtttgacaactcctac tctttgtggcggtcaaaatcagtctactacagagtctattatactagataa