GENSCAN 1.0 Date run: 6-Nov-116 Time: 08:08:47 Sequence gi568815579r:38925414_39132227 : 206814 bp : 48.80% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 Intr - 887 792 96 2 0 86 93 120 0.462 12.51 1.01 Init - 5311 5057 255 1 0 85 78 414 0.807 35.33 1.00 Prom - 6146 6107 40 -9.26 2.00 Prom + 6885 6924 40 -10.25 2.01 Sngl + 6958 7287 330 0 0 65 49 567 0.507 46.52 2.02 PlyA + 7577 7582 6 1.05 3.07 PlyA - 8269 8264 6 1.05 3.06 Term - 15195 14806 390 0 0 28 43 180 0.704 2.49 3.05 Intr - 17338 17270 69 1 0 161 86 183 0.700 24.78 3.04 Intr - 19691 19556 136 1 1 28 94 162 0.997 11.37 3.03 Intr - 21154 21059 96 0 0 113 105 179 0.999 21.22 3.02 Intr - 23265 23154 112 1 1 90 72 54 0.967 3.44 3.01 Init - 24906 24558 349 0 1 71 25 644 0.936 51.75 3.00 Prom - 34770 34731 40 -3.46 4.00 Prom + 60255 60294 40 -3.16 4.01 Init + 62783 62858 76 1 1 101 96 28 0.852 4.70 4.02 Intr + 66037 66107 71 2 2 122 72 36 0.590 4.30 4.03 Term + 94810 94929 120 0 0 41 45 115 0.250 0.97 4.04 PlyA + 95238 95243 6 1.05 5.06 PlyA - 96123 96118 6 1.05 5.05 Term - 100141 99998 144 1 0 89 37 163 0.937 9.21 5.04 Intr - 101592 101457 136 2 1 133 94 141 0.999 19.77 5.03 Intr - 105711 105616 96 2 0 93 105 69 0.997 8.22 5.02 Intr - 105907 105796 112 2 1 69 68 67 0.620 2.24 5.01 Init - 106814 106451 364 2 1 103 81 536 0.623 49.61 5.00 Prom - 110414 110375 40 -2.86 6.04 PlyA - 111599 111594 6 1.05 6.03 Term - 127185 127105 81 0 0 109 48 82 0.654 3.99 6.02 Intr - 128004 127961 44 1 2 66 75 36 0.077 -1.94 6.01 Init - 132807 132462 346 0 1 73 64 264 0.388 19.98 6.00 Prom - 134113 134074 40 -7.16 7.00 Prom + 135060 135099 40 -3.76 7.01 Init + 140713 140777 65 0 2 67 70 56 0.369 2.32 7.02 Intr + 145498 145640 143 1 2 69 89 84 0.510 6.60 7.03 Intr + 159739 159977 239 2 2 109 111 20 0.101 3.73 7.04 Intr + 171179 171363 185 1 2 87 62 13 0.019 -2.81 7.05 Intr + 171651 171772 122 0 2 80 43 50 0.052 -0.16 7.06 Intr + 173045 173245 201 0 0 129 81 155 0.968 18.16 7.07 Intr + 173547 173729 183 1 0 97 68 224 0.997 21.06 7.08 Intr + 174814 174937 124 2 1 123 47 171 0.999 16.14 7.09 Intr + 175167 175229 63 0 0 94 95 75 0.963 6.63 7.10 Intr + 175326 175440 115 0 1 105 91 89 0.999 11.35 7.11 Intr + 175536 175643 108 2 0 103 100 192 0.923 22.38 7.12 Intr + 175737 175794 58 2 1 97 89 66 0.940 6.06 7.13 Intr + 175875 175942 68 1 2 83 94 83 0.985 7.02 7.14 Intr + 176053 176124 72 0 0 130 42 122 0.997 11.40 7.15 Intr + 181534 181671 138 0 0 86 82 349 0.975 34.76 7.16 Term + 184640 184705 66 1 0 139 50 53 0.715 4.54 7.17 PlyA + 186042 186047 6 1.05 8.04 PlyA - 190603 190598 6 1.05 8.03 Term - 199141 199112 30 1 0 127 43 -1 0.519 -2.85 8.02 Intr - 200435 200185 251 0 2 78 86 123 0.512 8.26 8.01 Init - 202485 202449 37 0 1 70 107 62 0.267 4.51 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579r:38925414_39132227|GENSCAN_predicted_peptide_1|117_aa MARRLWPLLTRRGFRPRGGCISNDSPRRSFTTEKRNRNLLYEYAREGYSALPQLDIERFC ACPEEAAHALELRKGELRSADLPAIISTWQELRQLQEQIRSLEEEKAAVTEAVRALL >gi568815579r:38925414_39132227|GENSCAN_predicted_CDS_1|351_bp atggcgcggcgcttgtggcctttgctgactcgtcgggggttccggccccggggaggctgc atctccaacgatagtccaaggagaagtttcactacagagaaacgaaaccggaacctcctg tacgagtatgcgcgcgagggctacagcgcactccctcagctggacatagagcggttctgc gcatgcccagaagaggccgcacacgccctggagctccgcaagggggagctgcgctcggcg gacctgcccgcgatcatctcgacatggcaggagctgaggcagctgcaggagcagatccgg agcctggaggaagagaaggcagctgtgactgaggcagtgcgggccctgctg >gi568815579r:38925414_39132227|GENSCAN_predicted_peptide_2|109_aa MATLNQMHRLGPPKRPPRKLGPTEGRPQLKGVVLCTFTRKPKKPNSANRKCCRVRLSTGR EAVCFIPGEGHTLQEHQIVLVEGGRTQDLPGVKLTVVRGKYDCGHVQKK >gi568815579r:38925414_39132227|GENSCAN_predicted_CDS_2|330_bp atggctaccctgaaccagatgcaccgcctggggccccccaagcggccgcctcggaagctg ggccccacggaaggccggccgcagctgaagggtgtggtcctgtgcacgtttacccgcaag ccgaagaagcccaactcagccaatcgcaagtgctgtcgagtgcggctcagcactggccgc gaggccgtctgcttcatccctggggagggccacaccctgcaggagcaccagattgtcctt gtggagggcggccgcacccaggacctgccaggcgtcaagctcaccgttgtgcgtggcaag tacgactgtggccacgtgcagaagaagtga >gi568815579r:38925414_39132227|GENSCAN_predicted_peptide_3|383_aa MGARLSRRRLPADPSLALDALPPELLVQVLSHVPPRSLVTRCRPVCRAWRDIVDGPTVWL LQLARDRSAEGRALYAVAQRCLPSNEDKEEFPLCALARYCLRAPFGRNLIFNSCGEQGFR GWEVEHGGNGWAIEKNLTPVPGAPSQTCFVTSFEWCSKRQLVDLVMEGVWQELLDSAQIE ICVADWWGARENCGCVYQLRVRLLDVYEKEVVKFSASPDPVLQWTERGCRQVSHVFTNFG KGIRYVSFEQYGRDFLQFKTWWADEASIQAPRNTQAQPQIDITAHQLLGVGGWAGLDAQV VMQDDAIEQLRGVCIRAWEKITSGGEQYPSFGAVKQGPKEPYTDFIARLQEYLKKVIADS AAQDIVLRLLAFDNANLNCQAAL >gi568815579r:38925414_39132227|GENSCAN_predicted_CDS_3|1152_bp atgggcgcccggctatcgcggcgacggctgccggcggacccatccctggccctggacgcg ctgcccccggagctgctggtgcaggtgctgagccacgtgccgccacgctccttggtcacg cgatgccgcccagtgtgccgcgcctggcgcgacatagtggacgggcccactgtgtggctg ctgcagctggcccgcgaccgcagcgccgagggccgcgcactctacgcagtggctcaacgc tgcctgcccagcaacgaagacaaggaggagttcccgctgtgcgccctggcgcgctactgt ctgcgcgcgcccttcggccgcaatctcatcttcaactcctgcggagagcagggcttcaga ggctgggaggtggagcatggcgggaacggctgggccatagaaaagaacctaacaccggtg cctggggctccttcgcagacctgcttcgtgacctctttcgaatggtgctccaagaggcag cttgtggacctggtgatggaaggggtgtggcaggagctgctggacagcgcccagattgag atctgtgtggctgactggtggggcgctcgagagaactgcggctgcgtctaccagctccgg gtccgccttctggatgtgtatgaaaaggaagtggtcaagttctcagcctcacctgacccg gtccttcagtggactgagaggggctgccgacaggtctcccacgtcttcaccaactttggc aagggcatccgctacgtatcttttgagcagtacgggagagacttcttacaatttaaaact tggtgggcagatgaagcttccattcaggctcctcgcaacacccaggcccaacctcaaatt gatataactgcacaccaacttttgggggtcggcggctgggctggtttagatgcacaggtg gtcatgcaggatgatgccatagagcagcttagaggagtgtgcatcagagcttgggaaaaa atcacttcaggtggagaacagtacccttcctttggtgctgtaaaacagggaccaaaagaa ccgtacacagattttatagctcggttacaggagtatcttaaaaaggtgattgcagattcg gctgctcaggatatagtgttgcggttattagcttttgacaatgctaatctcaattgccag gctgctctgtga >gi568815579r:38925414_39132227|GENSCAN_predicted_peptide_4|88_aa MAQCSLSLLGSSDLPAFAFRTAGTTAGATRSWRTSKRIGQECSLWTVTWKTANLQDLKDK AASLSVVQQEGLDNKNPSSGLVPPTLCP >gi568815579r:38925414_39132227|GENSCAN_predicted_CDS_4|267_bp atggctcagtgcagcctcagcctcctgggctcaagtgatcttcctgctttcgccttccga acagctgggactacagctggtgccacgaggagctggcggacctccaagagaattggccaa gaatgcagcctctggacagtgacctggaaaactgctaacctccaagacttgaaagataaa gctgccagtctttcagttgtacaacaagaaggcctggacaacaagaacccttcttctgga ttggttccaccgacgctttgtccttga >gi568815579r:38925414_39132227|GENSCAN_predicted_peptide_5|283_aa MGASVSRGRAARVPAPEPEPEEALDLSQLPPELLLVVLSHVPPRTLLGRCRQVCRGWRAL VDGQALWLLILARDHGATGRALLHLARSCQSPARNARPCPLGRFCARRPIGRNLIRNPCG QEGLRKWMVQHGGDGWVVEENRTTVPGAPSQTCFVTSFSWCCKKQVLDLEEEGLWPELLD SGRIEICVSDWWGARHDSGCMYRLLVQLLDANQTVLDKFSAVPDPIPQWNNNACLHVTHV FSNIKMGVRFVSFEHRGQDTQFWAGHYGARVTNSSVIVRVRLS >gi568815579r:38925414_39132227|GENSCAN_predicted_CDS_5|852_bp atgggcgcctcggtctccaggggccgggccgcccgggtccccgcgccggagccggaaccc gaagaggcgctggacctgagccaactacccccagagctgcttctggtggtgctgagccac gtccccccgcgcacgctgctcgggcgctgccgccaagtgtgccggggctggcgagccctg gtggacggccaggccctgtggctgctgatcctggcccgcgaccacggcgccaccggccgc gcgctgctgcacctcgcccgcagctgccagtctcccgcccgtaacgccaggccttgcccc ctgggccgcttctgcgcgcgcagacccatcggacgcaaccttattcgcaacccctgcggc caagaaggcctccgaaagtggatggtgcaacacggtggggacggctgggtggtggaggaa aacaggacaaccgtgcctggggccccttctcagacgtgcttcgtgacttcattcagctgg tgttgcaagaagcaggtcttggacctagaggaggagggtctgtggccagaactgctggat agtggcaggattgagatttgtgtctctgactggtggggagcccgacacgacagcggctgt atgtacagactcctcgtccaacttctagacgccaaccagactgttctagataaattctct gctgtgcctgatcccatcccgcagtggaacaacaatgcctgccttcacgtcacccacgtg ttctccaacatcaagatgggcgtccgctttgtgtctttcgaacaccggggccaggacaca cagttctgggctggccactatggagcccgtgtgaccaactccagtgtgatcgtgcgagtc cgtctgtcctag >gi568815579r:38925414_39132227|GENSCAN_predicted_peptide_6|156_aa MRDRDKLQLFFLHLSFLESAIKTPPLFGNIRNEKTPFKDTPKKRSTALVNRSTRNVDRQL GMKLESHHMMDSCVFTFDQYWDPFKSVISTFTDSSAKWRLTYGSIVKSTAHTISQCLCMG TKQGRRHLGQGALQRSSSTSTNDPVCIGSAGNYQEL >gi568815579r:38925414_39132227|GENSCAN_predicted_CDS_6|471_bp atgagggacagagacaaactgcagctgttcttcctccacttgtccttcctggagagtgca ataaaaacacctcctttatttggtaacataaggaatgagaaaacccctttcaaagacacg ccaaagaagagatcaactgccttagtaaacagatctacacgtaatgtagaccgccagtta gggatgaagttagaatctcaccacatgatggattcctgtgtcttcaccttcgaccagtac tgggatcctttcaagagtgtcatcagcaccttcacagatagctctgcaaaatggcggcta acctatggctccattgtcaagtcaacagcccacactatttcccaatgcctttgtatgggg accaagcagggccggcggcacctgggccagggtgctctgcagagatctagttccacctcc accaacgaccccgtgtgcataggttcagctggaaactaccaggaactctag >gi568815579r:38925414_39132227|GENSCAN_predicted_peptide_7|649_aa MILSEKTDSQKLVFERGLNVVDNCAAGPTPESLWRAGVFLEVGYTELTTLLLTMHSVTHA SALTDPPVGEHCLIILCSPSSQPFLLYLWGADLLSPPAFPRSAITTPPPCTPFLATGPVT VYSCYSPWESRGPWGLPALPQSKSICLTQFQSCFHIFRYLFSNTPLYWYQLTILVHFHAA DKDIPKTGKKKRFNWTYSSIWLGRPQNHGRSAGFTGMSQYTWLVKSISDMAVARENEKEA KVETPDKLIRSREPGSMTVTWTTWVPTRSEVQFGLQPSGPLPLRAQGTFVPFVDGGILRR KLYIHRVTLRKLLPGVQYVYRCGSAQGWSRRFRFRALKNGAHWSPRLAVFGDLGADNPKA VPRLRRDTQQGMYDAVLHVGDFAYNLDQDNARVGDRFMRLIEPVAASLPYMTCPGNHEER YNFSNYKARFSMPGDNEGLWYSWDLGPAHIISFSTEVYFFLHYGRHLVQRQFRWLESDLQ KANKNRAARPWIITMGHRPMYCSNADLDDCTRHESKVRKGLQGKLYGLEDLFYKYGVDLQ LWAHEHSYERLWPIYNYQVFNGSREMPYTNPRGPVHIITGSAGCEERLTPFAVFPRPWSA VRVKEYGYTRLHILNGTHIHIQQVSDDQDGKIVDDVWVVRPLFGRRMYL >gi568815579r:38925414_39132227|GENSCAN_predicted_CDS_7|1950_bp atgattttatccgagaagactgacagtcagaagcttgtgtttgaaagaggattaaacgtg gtagataactgtgctgctgggcccacgcctgagagcctgtggagggctggggtgttcctg gaggtaggatacacagagctgaccacactcctgctgaccatgcacagcgtgacccacgct tcagccctgacagatcccccagttggtgagcactgcctgatcatcctctgttccccatcc tcccagcccttcctgctgtacctgtggggagctgatctcctcagtccccctgcttttccc cggtctgccatcaccaccccaccaccatgcacccccttcctggctactggtcctgttact gtctactcctgctattctccttgggagtccaggggtccctgggggctcccagcgctgccc cagagcaagtccatctgtcttacccagttccaaagctgcttccacattttcaggtatctt ttcagcaacaccccactctactggtaccagcttactatattagtccattttcatgctgct gataaagacatacccaagactgggaagaaaaagaggtttaattggacttacagttccata tggctggggaggcctcagaatcatggcagaagtgctgggtttacaggcatgagccaatac acctggctagtgaaaagcatttctgacatggcggtggcaagagaaaatgagaaagaagca aaagtggaaacccctgataaactcatcagatctcgtgagccaggctccatgactgtaact tggaccacatgggtcccaacccgctctgaagtgcaattcgggttgcagccgtcggggccc ctgcccctccgcgcccagggcaccttcgtcccctttgtggacgggggcattctccggcgg aagctctacatacaccgagtcacgcttcgcaagctgctgccaggggttcagtatgtttat cgctgtggcagtgcgcagggctggagccgtcggttccgcttcagggccctcaagaatggg gcccactggagtccccgtctggctgtgtttggagacctgggggctgacaacccgaaggcc gtcccccggctgcgcagggacacccagcagggcatgtatgacgccgttctccatgtggga gactttgcctacaacctggatcaggacaacgcccgtgttggggataggttcatgcggctc attgaacccgtggctgccagcctgccgtacatgacatgccctgggaatcatgaagaacgc tacaacttctctaactacaaggctcgcttcagcatgccgggggataatgagggcctgtgg tacagctgggatctgggtcccgcccacatcatctccttctccaccgaggtctatttcttt ctccattatggccgccacttggtacagaggcagtttcgctggctggagagcgacctccag aaagccaataagaaccgggcagcccggccgtggatcatcactatggggcaccggcccatg tactgctccaacgcagatctggacgactgcacacgacatgaaagcaaggtccgcaaaggc ctccaaggcaagctgtacgggttggaggatcttttctacaaatatggagtggatctgcag ctgtgggctcatgagcactcgtatgaacgactgtggccaatttacaactaccaggtattt aacggcagccgagagatgccctacaccaacccgcgagggcctgtccacatcatcacagga tctgctggctgtgaggagcggctgacgccctttgctgtcttcccgaggccctggagtgcc gtgcgtgtgaaggagtacgggtatacgcggctgcacatcctcaacgggacccacatccac atccagcaggtgtcggacgaccaggatgggaagatcgtagatgatgtctgggtggtgaga cccctgtttggccggaggatgtacctctag >gi568815579r:38925414_39132227|GENSCAN_predicted_peptide_8|105_aa MPGPLHTLLLLPALLHRHHRGHSRRHVESPTNIRGFPPSPPTSPHSNEADRAVGRASAGA ESGASQGPLQPPIGTERPIGDLRAENAKRGLEILEKGFTQMSLSL >gi568815579r:38925414_39132227|GENSCAN_predicted_CDS_8|318_bp atgccagggcctctgcacacactgctcctcctgcctgctctcttgcaccgccaccaccgc ggacactcccgccgccatgttgaatccccaacgaacatccggggctttcccccctcaccg cccaccagtccccatagcaacgaggctgaccgggcagtgggcagggccagtgcgggggcg gagtcgggcgcctctcagggcccgctccagcctccgattggtacagagcgtccaatcgga gaccttcgggcagaaaatgccaagcggggattggaaattttggaaaagggctttacacag atgtcactttctctgtga