GENSCAN 1.0 Date run: 3-Nov-116 Time: 04:18:54 Sequence gi568815576f:26522584_26730484 : 207901 bp : 47.81% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.07 PlyA - 992 987 6 1.05 1.06 Term - 3684 3637 48 0 0 101 42 50 0.333 -1.00 1.05 Intr - 10162 10112 51 1 0 134 66 30 0.863 4.50 1.04 Intr - 13903 13705 199 0 1 129 100 322 0.965 36.85 1.03 Intr - 19135 18206 930 0 0 136 89 1715 0.328 166.73 1.02 Intr - 22092 22021 72 2 0 111 107 26 0.268 5.32 1.01 Init - 28058 28009 50 2 2 56 84 39 0.236 0.72 1.00 Prom - 30812 30773 40 -7.26 2.00 Prom + 34497 34536 40 -4.46 2.01 Sngl + 37943 38578 636 1 0 76 38 836 0.940 73.59 2.02 PlyA + 39731 39736 6 1.05 3.00 Prom + 43165 43204 40 -7.16 3.01 Init + 46041 46096 56 2 2 101 100 8 0.185 4.06 3.02 Intr + 55735 55880 146 1 2 66 89 126 0.363 10.43 3.03 Term + 55935 56011 77 1 2 94 43 10 0.195 -4.90 3.04 PlyA + 56405 56410 6 1.05 4.07 PlyA - 58400 58395 6 1.05 4.06 Term - 66611 66599 13 1 1 136 54 8 0.219 -0.13 4.05 Intr - 77090 76911 180 1 0 110 89 207 0.520 21.98 4.04 Intr - 79438 79296 143 1 2 120 85 228 0.999 24.85 4.03 Intr - 85438 85306 133 0 1 73 101 261 0.997 26.55 4.02 Intr - 89607 89489 119 0 2 77 84 158 0.948 13.56 4.01 Init - 93736 93557 180 1 0 85 86 186 0.998 15.55 4.00 Prom - 95478 95439 40 -1.16 5.00 Prom + 97540 97579 40 -6.86 5.01 Init + 99383 99403 21 1 0 46 123 30 0.222 0.50 5.02 Intr + 100651 100769 119 0 2 92 75 186 0.244 16.96 5.03 Intr + 102898 103039 142 1 1 122 87 140 0.998 17.66 5.04 Intr + 105705 105847 143 2 2 111 88 130 0.999 14.45 5.05 Term + 107757 107904 148 0 1 115 47 121 0.951 8.07 5.06 PlyA + 108064 108069 6 1.05 6.00 Prom + 119234 119273 40 -6.36 6.01 Sngl + 119602 119814 213 0 0 97 46 211 0.979 12.99 6.02 PlyA + 120790 120795 6 -1.95 7.04 PlyA - 121150 121145 6 -0.45 7.03 Term - 122795 122470 326 0 2 -10 47 298 0.771 10.83 7.02 Intr - 123848 123690 159 0 0 99 91 74 0.987 8.76 7.01 Init - 124522 124465 58 1 1 84 86 27 0.944 3.76 7.00 Prom - 125369 125330 40 -6.36 8.04 PlyA - 129767 129762 6 1.05 8.03 Term - 146855 146840 16 1 1 127 55 22 0.047 0.51 8.02 Intr - 152039 151910 130 0 1 100 66 74 0.242 6.15 8.01 Init - 183355 183205 151 1 1 64 20 114 0.286 2.40 8.00 Prom - 183433 183394 40 -0.86 9.02 PlyA - 183682 183677 6 1.05 9.01 Term - 191437 191343 95 2 2 65 50 116 0.651 3.49 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576f:26522584_26730484|GENSCAN_predicted_peptide_1|449_aa MAEPEHVGTGPCSRATLALPADSLGTQAQGLEPRDESAQGRCEPGGLPANLSLKPQKRAL RPGLPCLRPSMRLSVRRVLLAAGCALVLVLAVQLGQQVLECRAVLAGLRSPRGAMRPEQE ELVMVGTNHVEYRYGKAMPLIFVGGVPRSGTTLMRAMLDAHPEVRCGEETRIIPRVLAMR QAWSKSGREKLRLDEAGVTDEVLDAAMQAFILEVIAKHGEPARVLCNKDPFTLKSSVYLS RLFPNSKFLLMVRDGRASVHSMITRKVTIAGFDLSSYRDCLTKWNKAIEVMYAQCMEVGK EKCLPVYYEQLVLHPRRSLKLILDFLGIAWSDAVLHHEDLIGKPGGVSLSKIERSTDQVI KPVNLEALSKWTGHIPGDVVRDMAQIAPMLAQLGYDPYANPPNYGNPDPFVINNTQRVLK GDYKTPANLKGYFQISANDFIAKKRRKCI >gi568815576f:26522584_26730484|GENSCAN_predicted_CDS_1|1350_bp atggctgagcctgagcacgtgggcactgggccctgctccagggccactctggctctgcca gcagacagccttggcacacaggcacaagggctggagcccagagatgagagtgcccaaggg agatgtgagcctggcgggctgcccgctaacctgtcgctgaagccccagaagcgggccctc aggccaggcctaccctgcctccggcccagcatgcgcctgtcggtgcggagggtgctgctg gcagccggctgcgccctggtcctggtgctggcggttcagctgggacagcaggtgctagag tgccgggcggtgctggcgggcctgcggagcccccggggggccatgcggcctgagcaggag gagctggtgatggtgggcaccaaccacgtggaataccgctatggcaaggccatgccgctc atcttcgtgggtggcgtgcctcgcagtggcaccacgttgatgcgcgccatgctggacgcg caccccgaggtgcgctgcggcgaggagacccgcatcatcccgcgcgtgctggccatgcgc caggcctggtccaagtctggccgtgagaagctgcggctggatgaggcgggggtgacggat gaggtgctggacgccgccatgcaggccttcatcctggaggtgattgccaagcacggagag ccggcccgcgtgctctgcaacaaggacccatttacgctcaagtcctcggtctacctgtcg cgcctgttccccaactccaagttcctgctgatggtgcgggacggccgggcctccgtgcac tccatgatcacgcgcaaagtcaccattgcgggctttgacctcagcagctaccgtgactgc ctcaccaagtggaacaaggccatcgaggtgatgtacgcccagtgcatggaggtaggcaag gagaagtgcctgcctgtgtactacgagcagctggtgctgcaccccaggcgctcactcaag ctcatcctcgacttcctcggcatcgcctggagcgacgctgtcctccaccatgaagacctc attggcaagcccggtggtgtctccctgtccaagatcgagcggtccacggaccaggtcatc aagcctgttaacctggaagcgctctccaagtggactggccacatccctggggatgtggtg cgggacatggcccagatcgcccccatgctggctcagctcggctatgacccttatgcaaac ccccccaactatggcaaccctgaccccttcgtcatcaacaacacacagcgggtcttgaaa ggggactataaaacaccagccaatctgaaaggatattttcagatctccgcaaatgacttc attgccaagaagagaagaaaatgcatttaa >gi568815576f:26522584_26730484|GENSCAN_predicted_peptide_2|211_aa MSKGDPKKLRGKMSSHAFFGQTCREAHKKKHPDASVNLSEFSKKCSERWKTMSAKEKGKF EDMAKADKAHYEREMKTYIPPKGETKKKFKDPNAPKRTPSAFFLFCSAYRPKIKGEHPGL SIGDVAKKLGEMWNNTAADDKQPYEKKAAKLKEKYEKDIAAYRAKGKPDAAKKGVVKAEK SKKKKEEEEDEEDEEDEEEEDEEDEDEEDDE >gi568815576f:26522584_26730484|GENSCAN_predicted_CDS_2|636_bp atgagcaaaggagatcctaagaagctgagaggcaaaatgtcatcacatgcattttttggg caaacttgtcgggaggcgcataagaagaagcacccagatgcttcagtcaacctctcagag ttttctaagaagtgctcagagaggtggaagaccatgtctgctaaagagaaaggaaaattt gaagatatggcaaaggcggacaaggcccattacgaaagagaaatgaaaacctatatccct cccaaaggggagacaaaaaagaagttcaaggatccgaatgcacccaagaggactccttcg gccttcttcctgttctgctctgcgtatcgcccaaaaatcaaaggagaacatcctggcctg tccattggtgatgttgcgaagaaactgggagagatgtggaataacactgccgcagatgac aagcagccttatgaaaagaaggctgcgaagctgaaggaaaaatacgaaaaggatattgct gcatatcgagctaaaggaaagcctgatgcagcaaaaaagggagttgtcaaggctgaaaaa agcaagaaaaagaaggaagaggaggaagatgaggaagatgaagaggatgaggaggaggaa gatgaagaagatgaagatgaagaagatgatgaataa >gi568815576f:26522584_26730484|GENSCAN_predicted_peptide_3|92_aa MTFTNAIAMPEVTLYGLKSSQRGFSDQQEQRLGTYTKCRLSGLKETTLNGNSGSGVQQPV LKKPSIHGLCHGDYAWGPQGPQRRKILPRVPQ >gi568815576f:26522584_26730484|GENSCAN_predicted_CDS_3|279_bp atgacgtttacaaatgccatagcaatgccagaagttaccctatatggtctaaaaagctct caacgtgggttctctgaccagcaggaacagcgcttaggaacctacacaaaatgcagatta tcagggctcaaggagaccacgctgaatgggaactctggaagtggggtacagcaacctgtg ttgaagaagccctccatccacggactgtgccatggggactatgcctggggaccccaaggt cctcagagaaggaagattttgcccagagtcccgcagtag >gi568815576f:26522584_26730484|GENSCAN_predicted_peptide_4|255_aa MSQAAKASASATVAVNPGPDTKGKGAPPAGTSPSPGTTLAPTTVPITSAKAAELPPGNYR LVVFELENFQGRRAEFSGECSNLADRGFDRVRSIIVSAGPWVAFEQSNFRGEMFILEKGE YPRWNTWSSSYRSDRLMSFRPIKMDAQEHKISLFEGANFKGNTIEIQGDDAPSLWVYGFS DRVGSVKVSSGTWVGYQYPGYRGYQYLLEPGDFRHWNEWGAFQPQMQSLRRLRDKQWHLE GSFPVLATEPPKWGD >gi568815576f:26522584_26730484|GENSCAN_predicted_CDS_4|768_bp atgtctcaggctgcaaaggcctcggcctcggccacagtggcggtgaacccagggcctgac accaaggggaagggggccccacctgcaggaacatcccctagtcccggcactaccctggcc ccaacaaccgtgcctattaccagcgccaaggcggcggaactgcctcctgggaactacagg ctggtggtcttcgaactggaaaacttccagggccgtcgagcagaattctcgggggagtgc tcaaatctggcagaccgtggcttcgaccgtgtgcgcagcatcattgtctccgcgggaccc tgggtcgcctttgagcagtccaacttccgcggggagatgttcatcctggagaagggcgag taccctcgctggaacacatggtcgagcagctaccgcagtgatcggctcatgtccttccgg cccatcaaaatggatgcccaggagcacaaaatctccctgtttgaaggggccaacttcaag ggcaacaccatagagatccagggggacgacgcacccagtctctgggtctacggcttcagt gaccgcgtgggcagcgtgaaggtctccagtggaacatgggttggctatcagtatcctggc taccgcgggtaccagtacctcctagagcctggtgacttccggcactggaatgagtgggga gccttccagccacagatgcagtccctgcgtcgcctgcgtgacaagcagtggcacctcgag gggtccttccctgtcctggccacagagccccccaaatggggagactga >gi568815576f:26522584_26730484|GENSCAN_predicted_peptide_5|190_aa MFPGPISMVVWDEDGFQGRRHEFTAECPSVLELGFETVRSLKVLSGAWVGFEHAGFQGQQ YILERGEYPSWDAWGGNTAYPAERLTSFRPAACANHRDSRLTIFEQENFLGKKGELSDDY PSLQAMGWEGNEVGSFHVHSGAWVCSQFPGYRGFQYVLECDHHSGDYKHFREWGSHAPTF QVQSIRRIQQ >gi568815576f:26522584_26730484|GENSCAN_predicted_CDS_5|573_bp atgttccctgggcctatctcgatggtggtgtgggatgaggacggcttccagggccggcgg cacgagttcacggccgagtgccccagcgtgctggagcttggcttcgagactgtgcgatct ttgaaagtgctgagtggagcgtgggtgggctttgagcatgctggcttccaagggcagcag tacattctggaacgaggcgaatatccaagctgggatgcctggggcggcaacacggcctac cccgccgagaggctcacctccttccggcctgcggcctgtgctaaccaccgtgactcgagg ctgacaatcttcgagcaagagaacttcctgggcaagaaaggagagctgagcgatgactat ccttccctccaggccatgggatgggaaggcaatgaagtagggtccttccacgtccactct ggggcctgggtttgctcccagtttccgggctaccgaggatttcagtatgtgctggaatgc gatcaccattccggtgactacaaacatttccgggagtggggctctcatgccccgaccttc caggtgcagagcatccgcaggatccagcagtga >gi568815576f:26522584_26730484|GENSCAN_predicted_peptide_6|70_aa MDEPAGHYAIMLSDIKTWFLLLPLYLSSHYCSTEIPKSKAVTKTSVGVIREKGTQPPSVK GGEKLKILHI >gi568815576f:26522584_26730484|GENSCAN_predicted_CDS_6|213_bp atggatgaacctgcaggacattatgctattatgctaagtgacatcaagacctggttcctc ctcctgcccctctacctgagctctcactactgctccactgagatccccaagagcaaagct gtgacaaagacttcagtgggagtcatcagagaaaaaggaacacaaccacctagcgtgaaa gggggtgagaagctgaagatcctacacatctag >gi568815576f:26522584_26730484|GENSCAN_predicted_peptide_7|180_aa MLTVSSQGLQGPQLLMPSKACPPPHSHATAVLQNSTFLIYEMHLGSLQISCECHNACRRE AENSGANIFQKAGGTSTKKLISWVEEDGCRPGVVLKGRDADSGHFLAEGQAPRGLPGTPG VSSSPEAGDGQIRLTDSCVQRLLEITKGSEFLRLQVEGGRCSGFQYRCSLDTVINPDDGI >gi568815576f:26522584_26730484|GENSCAN_predicted_CDS_7|543_bp atgctgacggtgtcatcccagggtctccagggaccccagctgcttatgccttctaaagcc tgcccacccccacacagccacgccacagccgtcctgcagaacagtacattccttatttat gagatgcatctgggctctctgcaaatcagctgtgaatgccacaatgcctgcaggagggag gcagagaattctggagccaacatcttccagaaggcaggtggaacctcaacaaagaaattg atctcctgggtggaggaagatggttgccgcccaggggttgtccttaagggccgcgacgca gacagcggtcacttcctggccgaggggcaggctcctcgcggcctccctgggactccgggc gtgtcgtccagccccgaggccggcgacgggcagatccgcctcacggacagctgcgtccag aggcttctggaaatcaccaaaggatcagaattcctcaggctgcaggtggagggaggtaga tgctccggattccagtacagatgttcactggatacagttatcaaccccgacgacggtatt tga >gi568815576f:26522584_26730484|GENSCAN_predicted_peptide_8|98_aa MPKAIETKAKFDKWDLIKLKSFCKAKETINRVNRQPTGWEKNFANYASDKAQASGSPFTP TANYNWLALSPQLPEKRALSTRLAFNDPSNLPLREKAM >gi568815576f:26522584_26730484|GENSCAN_predicted_CDS_8|297_bp atgccaaaagcaattgaaacaaaagcaaaatttgacaaatgggatctaattaagctaaag agcttctgcaaagcaaaagaaactatcaacagagtaaacagacagccaacaggatgggag aaaaattttgcaaactatgcatctgacaaagctcaggcctctggatcacccttcaccccc actgccaactacaattggttagctctgagtccacagttgccagagaaaagggctctcagc accagactagcattcaacgacccaagcaacctgcctctaagagaaaaggccatgtga >gi568815576f:26522584_26730484|GENSCAN_predicted_peptide_9|31_aa XTSPKMQKQLSSPLGAANKYKALTSPGLALC >gi568815576f:26522584_26730484|GENSCAN_predicted_CDS_9|96_bp nggacatctccaaagatgcagaagcagcttagttctccactaggtgcagccaacaagtac aaggcactcacgtccccagggctggccctctgctag