GENSCAN 1.0 Date run: 3-Nov-116 Time: 04:10:01 Sequence gi568815576r:26499493_26716319 : 216827 bp : 47.51% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 Intr - 139 27 113 1 2 92 -57 167 0.224 1.78 1.05 Intr - 2560 2408 153 1 0 78 87 140 0.779 13.17 1.04 Intr - 4301 4174 128 0 2 72 78 115 0.998 9.30 1.03 Intr - 6967 6811 157 1 1 135 67 112 0.999 13.38 1.02 Intr - 7436 7283 154 1 1 92 18 207 0.998 14.17 1.01 Init - 10780 10572 209 1 2 59 24 324 0.984 21.39 1.00 Prom - 11352 11313 40 -5.16 2.00 Prom + 11586 11625 40 -1.86 2.01 Init + 19905 19992 88 2 1 114 78 32 0.894 5.61 2.02 Term + 23103 23188 86 1 2 99 42 77 0.888 2.02 2.03 PlyA + 23425 23430 6 1.05 3.07 PlyA - 24083 24078 6 1.05 3.06 Term - 26775 26728 48 0 0 101 42 50 0.354 -1.00 3.05 Intr - 33253 33203 51 1 0 134 66 30 0.895 4.50 3.04 Intr - 36994 36796 199 0 1 129 100 322 0.965 36.85 3.03 Intr - 42226 41297 930 0 0 136 89 1715 0.328 166.73 3.02 Intr - 45183 45112 72 2 0 111 107 26 0.268 5.32 3.01 Init - 51149 51100 50 2 2 56 84 39 0.236 0.72 3.00 Prom - 53903 53864 40 -7.26 4.00 Prom + 57588 57627 40 -4.46 4.01 Sngl + 61034 61669 636 1 0 76 38 836 0.940 73.59 4.02 PlyA + 62822 62827 6 1.05 5.00 Prom + 66256 66295 40 -7.16 5.01 Init + 69132 69187 56 2 2 101 100 8 0.185 4.06 5.02 Intr + 78826 78971 146 1 2 66 89 126 0.363 10.43 5.03 Term + 79026 79102 77 1 2 94 43 10 0.195 -4.90 5.04 PlyA + 79496 79501 6 1.05 6.07 PlyA - 81491 81486 6 1.05 6.06 Term - 89702 89690 13 1 1 136 54 8 0.219 -0.13 6.05 Intr - 100181 100002 180 1 0 110 89 207 0.520 21.98 6.04 Intr - 102529 102387 143 1 2 120 85 228 0.999 24.85 6.03 Intr - 108529 108397 133 0 1 73 101 261 0.997 26.55 6.02 Intr - 112698 112580 119 0 2 77 84 158 0.948 13.56 6.01 Init - 116827 116648 180 1 0 85 86 186 0.998 15.55 6.00 Prom - 118569 118530 40 -1.16 7.00 Prom + 120631 120670 40 -6.86 7.01 Init + 122474 122494 21 1 0 46 123 30 0.222 0.50 7.02 Intr + 123742 123860 119 0 2 92 75 186 0.244 16.96 7.03 Intr + 125989 126130 142 1 1 122 87 140 0.998 17.66 7.04 Intr + 128796 128938 143 2 2 111 88 130 0.999 14.45 7.05 Term + 130848 130995 148 0 1 115 47 121 0.951 8.07 7.06 PlyA + 131155 131160 6 1.05 8.00 Prom + 142325 142364 40 -6.36 8.01 Sngl + 142693 142905 213 0 0 97 46 211 0.979 12.99 8.02 PlyA + 143881 143886 6 -1.95 9.04 PlyA - 144241 144236 6 -0.45 9.03 Term - 145886 145561 326 0 2 -10 47 298 0.771 10.83 9.02 Intr - 146939 146781 159 0 0 99 91 74 0.987 8.76 9.01 Init - 147613 147556 58 1 1 84 86 27 0.944 3.76 9.00 Prom - 148460 148421 40 -6.36 10.04 PlyA - 152858 152853 6 1.05 10.03 Term - 169946 169931 16 1 1 127 55 22 0.047 0.51 10.02 Intr - 175130 175001 130 0 1 100 66 74 0.242 6.15 10.01 Init - 206446 206296 151 1 1 64 20 114 0.290 2.40 10.00 Prom - 206524 206485 40 -0.86 11.02 PlyA - 206773 206768 6 1.05 11.01 Term - 214528 214434 95 2 2 65 50 116 0.840 3.49 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576r:26499493_26716319|GENSCAN_predicted_peptide_1|305_aa MSLSHLYRDGEGRIDDDDDERENFEITDWDLQNEFNPNRQRHWQTKEEATYGVWAERDSD DERPSFGGKRARDYSAPVNFISAGLKKGAAEEAELEDSDDEEKPVKQDDFPKDFGPRKLK TGGNFKPSQKGFAGGTKSFMDFGSWERHTKGIGQKLLQKMGYVPGRGLGKNAQGIINPIE AKQRKGKGAVGAYGSERTTQSMQDFPVVDSEEEAEEEFQKELSQWRKDPSGSKKKPKYSY KTVEELKAKGRISKKLTAPQKELSQVKVIDMTGREQKVYYSYSQISHKHNVPDDGLPLQS QQLPH >gi568815576r:26499493_26716319|GENSCAN_predicted_CDS_1|915_bp atgtcattgtcccacttataccgggatggggaaggccgcattgatgatgatgatgacgag cgggagaactttgagatcactgactgggatctccagaatgagttcaaccccaaccgacag cgccactggcagaccaaggaagaagccacctacggggtgtgggcagagcgagactcggat gatgagaggcccagctttggaggcaaacgggcccgtgactactctgcgccagtcaacttc atcagcgcagggctcaagaaaggggcagcggaggaggcagagttggaagattctgatgac gaagagaaacctgttaagcaggacgactttcctaaggattttggaccaaggaagctaaaa acgggtggcaattttaagcccagccagaaaggttttgcaggaggaaccaaatctttcatg gacttcggcagctgggaaagacacacaaaaggaattggacagaagcttcttcagaagatg ggctacgtccctggacggggcctcgggaagaatgcacaaggtatcattaacccaattgaa gccaagcagagaaagggaaaaggtgctgtgggggcttatggatccgagcgcaccactcag tccatgcaagacttccctgtggttgactcagaggaagaagctgaagaggagtttcagaag gagctgagccagtggaggaaagacccaagtggaagcaagaagaagcccaaatactcttac aagaccgtggaagagttgaaggccaagggcaggattagcaagaagctcactgctccccag aaggaactttctcaagtcaaggtcatagacatgacaggccgggagcagaaggtctactac agctacagtcagatcagccacaagcacaacgttcccgatgatgggctgccgctacagtcc caacagctgccacan >gi568815576r:26499493_26716319|GENSCAN_predicted_peptide_2|57_aa MPSRNGMLIVNYPEQCGFRTQNEANFEDRETHSYSGIKRLYVQENVHCSFASSDENC >gi568815576r:26499493_26716319|GENSCAN_predicted_CDS_2|174_bp atgcccagcaggaatggaatgcttattgttaattatcctgagcagtgtgggtttagaaca cagaatgaggccaactttgaggacagggagactcactcatacagtggcatcaagaggtta tatgtgcaggagaacgtccactgcagctttgccagtagcgatgaaaactgctaa >gi568815576r:26499493_26716319|GENSCAN_predicted_peptide_3|449_aa MAEPEHVGTGPCSRATLALPADSLGTQAQGLEPRDESAQGRCEPGGLPANLSLKPQKRAL RPGLPCLRPSMRLSVRRVLLAAGCALVLVLAVQLGQQVLECRAVLAGLRSPRGAMRPEQE ELVMVGTNHVEYRYGKAMPLIFVGGVPRSGTTLMRAMLDAHPEVRCGEETRIIPRVLAMR QAWSKSGREKLRLDEAGVTDEVLDAAMQAFILEVIAKHGEPARVLCNKDPFTLKSSVYLS RLFPNSKFLLMVRDGRASVHSMITRKVTIAGFDLSSYRDCLTKWNKAIEVMYAQCMEVGK EKCLPVYYEQLVLHPRRSLKLILDFLGIAWSDAVLHHEDLIGKPGGVSLSKIERSTDQVI KPVNLEALSKWTGHIPGDVVRDMAQIAPMLAQLGYDPYANPPNYGNPDPFVINNTQRVLK GDYKTPANLKGYFQISANDFIAKKRRKCI >gi568815576r:26499493_26716319|GENSCAN_predicted_CDS_3|1350_bp atggctgagcctgagcacgtgggcactgggccctgctccagggccactctggctctgcca gcagacagccttggcacacaggcacaagggctggagcccagagatgagagtgcccaaggg agatgtgagcctggcgggctgcccgctaacctgtcgctgaagccccagaagcgggccctc aggccaggcctaccctgcctccggcccagcatgcgcctgtcggtgcggagggtgctgctg gcagccggctgcgccctggtcctggtgctggcggttcagctgggacagcaggtgctagag tgccgggcggtgctggcgggcctgcggagcccccggggggccatgcggcctgagcaggag gagctggtgatggtgggcaccaaccacgtggaataccgctatggcaaggccatgccgctc atcttcgtgggtggcgtgcctcgcagtggcaccacgttgatgcgcgccatgctggacgcg caccccgaggtgcgctgcggcgaggagacccgcatcatcccgcgcgtgctggccatgcgc caggcctggtccaagtctggccgtgagaagctgcggctggatgaggcgggggtgacggat gaggtgctggacgccgccatgcaggccttcatcctggaggtgattgccaagcacggagag ccggcccgcgtgctctgcaacaaggacccatttacgctcaagtcctcggtctacctgtcg cgcctgttccccaactccaagttcctgctgatggtgcgggacggccgggcctccgtgcac tccatgatcacgcgcaaagtcaccattgcgggctttgacctcagcagctaccgtgactgc ctcaccaagtggaacaaggccatcgaggtgatgtacgcccagtgcatggaggtaggcaag gagaagtgcctgcctgtgtactacgagcagctggtgctgcaccccaggcgctcactcaag ctcatcctcgacttcctcggcatcgcctggagcgacgctgtcctccaccatgaagacctc attggcaagcccggtggtgtctccctgtccaagatcgagcggtccacggaccaggtcatc aagcctgttaacctggaagcgctctccaagtggactggccacatccctggggatgtggtg cgggacatggcccagatcgcccccatgctggctcagctcggctatgacccttatgcaaac ccccccaactatggcaaccctgaccccttcgtcatcaacaacacacagcgggtcttgaaa ggggactataaaacaccagccaatctgaaaggatattttcagatctccgcaaatgacttc attgccaagaagagaagaaaatgcatttaa >gi568815576r:26499493_26716319|GENSCAN_predicted_peptide_4|211_aa MSKGDPKKLRGKMSSHAFFGQTCREAHKKKHPDASVNLSEFSKKCSERWKTMSAKEKGKF EDMAKADKAHYEREMKTYIPPKGETKKKFKDPNAPKRTPSAFFLFCSAYRPKIKGEHPGL SIGDVAKKLGEMWNNTAADDKQPYEKKAAKLKEKYEKDIAAYRAKGKPDAAKKGVVKAEK SKKKKEEEEDEEDEEDEEEEDEEDEDEEDDE >gi568815576r:26499493_26716319|GENSCAN_predicted_CDS_4|636_bp atgagcaaaggagatcctaagaagctgagaggcaaaatgtcatcacatgcattttttggg caaacttgtcgggaggcgcataagaagaagcacccagatgcttcagtcaacctctcagag ttttctaagaagtgctcagagaggtggaagaccatgtctgctaaagagaaaggaaaattt gaagatatggcaaaggcggacaaggcccattacgaaagagaaatgaaaacctatatccct cccaaaggggagacaaaaaagaagttcaaggatccgaatgcacccaagaggactccttcg gccttcttcctgttctgctctgcgtatcgcccaaaaatcaaaggagaacatcctggcctg tccattggtgatgttgcgaagaaactgggagagatgtggaataacactgccgcagatgac aagcagccttatgaaaagaaggctgcgaagctgaaggaaaaatacgaaaaggatattgct gcatatcgagctaaaggaaagcctgatgcagcaaaaaagggagttgtcaaggctgaaaaa agcaagaaaaagaaggaagaggaggaagatgaggaagatgaagaggatgaggaggaggaa gatgaagaagatgaagatgaagaagatgatgaataa >gi568815576r:26499493_26716319|GENSCAN_predicted_peptide_5|92_aa MTFTNAIAMPEVTLYGLKSSQRGFSDQQEQRLGTYTKCRLSGLKETTLNGNSGSGVQQPV LKKPSIHGLCHGDYAWGPQGPQRRKILPRVPQ >gi568815576r:26499493_26716319|GENSCAN_predicted_CDS_5|279_bp atgacgtttacaaatgccatagcaatgccagaagttaccctatatggtctaaaaagctct caacgtgggttctctgaccagcaggaacagcgcttaggaacctacacaaaatgcagatta tcagggctcaaggagaccacgctgaatgggaactctggaagtggggtacagcaacctgtg ttgaagaagccctccatccacggactgtgccatggggactatgcctggggaccccaaggt cctcagagaaggaagattttgcccagagtcccgcagtag >gi568815576r:26499493_26716319|GENSCAN_predicted_peptide_6|255_aa MSQAAKASASATVAVNPGPDTKGKGAPPAGTSPSPGTTLAPTTVPITSAKAAELPPGNYR LVVFELENFQGRRAEFSGECSNLADRGFDRVRSIIVSAGPWVAFEQSNFRGEMFILEKGE YPRWNTWSSSYRSDRLMSFRPIKMDAQEHKISLFEGANFKGNTIEIQGDDAPSLWVYGFS DRVGSVKVSSGTWVGYQYPGYRGYQYLLEPGDFRHWNEWGAFQPQMQSLRRLRDKQWHLE GSFPVLATEPPKWGD >gi568815576r:26499493_26716319|GENSCAN_predicted_CDS_6|768_bp atgtctcaggctgcaaaggcctcggcctcggccacagtggcggtgaacccagggcctgac accaaggggaagggggccccacctgcaggaacatcccctagtcccggcactaccctggcc ccaacaaccgtgcctattaccagcgccaaggcggcggaactgcctcctgggaactacagg ctggtggtcttcgaactggaaaacttccagggccgtcgagcagaattctcgggggagtgc tcaaatctggcagaccgtggcttcgaccgtgtgcgcagcatcattgtctccgcgggaccc tgggtcgcctttgagcagtccaacttccgcggggagatgttcatcctggagaagggcgag taccctcgctggaacacatggtcgagcagctaccgcagtgatcggctcatgtccttccgg cccatcaaaatggatgcccaggagcacaaaatctccctgtttgaaggggccaacttcaag ggcaacaccatagagatccagggggacgacgcacccagtctctgggtctacggcttcagt gaccgcgtgggcagcgtgaaggtctccagtggaacatgggttggctatcagtatcctggc taccgcgggtaccagtacctcctagagcctggtgacttccggcactggaatgagtgggga gccttccagccacagatgcagtccctgcgtcgcctgcgtgacaagcagtggcacctcgag gggtccttccctgtcctggccacagagccccccaaatggggagactga >gi568815576r:26499493_26716319|GENSCAN_predicted_peptide_7|190_aa MFPGPISMVVWDEDGFQGRRHEFTAECPSVLELGFETVRSLKVLSGAWVGFEHAGFQGQQ YILERGEYPSWDAWGGNTAYPAERLTSFRPAACANHRDSRLTIFEQENFLGKKGELSDDY PSLQAMGWEGNEVGSFHVHSGAWVCSQFPGYRGFQYVLECDHHSGDYKHFREWGSHAPTF QVQSIRRIQQ >gi568815576r:26499493_26716319|GENSCAN_predicted_CDS_7|573_bp atgttccctgggcctatctcgatggtggtgtgggatgaggacggcttccagggccggcgg cacgagttcacggccgagtgccccagcgtgctggagcttggcttcgagactgtgcgatct ttgaaagtgctgagtggagcgtgggtgggctttgagcatgctggcttccaagggcagcag tacattctggaacgaggcgaatatccaagctgggatgcctggggcggcaacacggcctac cccgccgagaggctcacctccttccggcctgcggcctgtgctaaccaccgtgactcgagg ctgacaatcttcgagcaagagaacttcctgggcaagaaaggagagctgagcgatgactat ccttccctccaggccatgggatgggaaggcaatgaagtagggtccttccacgtccactct ggggcctgggtttgctcccagtttccgggctaccgaggatttcagtatgtgctggaatgc gatcaccattccggtgactacaaacatttccgggagtggggctctcatgccccgaccttc caggtgcagagcatccgcaggatccagcagtga >gi568815576r:26499493_26716319|GENSCAN_predicted_peptide_8|70_aa MDEPAGHYAIMLSDIKTWFLLLPLYLSSHYCSTEIPKSKAVTKTSVGVIREKGTQPPSVK GGEKLKILHI >gi568815576r:26499493_26716319|GENSCAN_predicted_CDS_8|213_bp atggatgaacctgcaggacattatgctattatgctaagtgacatcaagacctggttcctc ctcctgcccctctacctgagctctcactactgctccactgagatccccaagagcaaagct gtgacaaagacttcagtgggagtcatcagagaaaaaggaacacaaccacctagcgtgaaa gggggtgagaagctgaagatcctacacatctag >gi568815576r:26499493_26716319|GENSCAN_predicted_peptide_9|180_aa MLTVSSQGLQGPQLLMPSKACPPPHSHATAVLQNSTFLIYEMHLGSLQISCECHNACRRE AENSGANIFQKAGGTSTKKLISWVEEDGCRPGVVLKGRDADSGHFLAEGQAPRGLPGTPG VSSSPEAGDGQIRLTDSCVQRLLEITKGSEFLRLQVEGGRCSGFQYRCSLDTVINPDDGI >gi568815576r:26499493_26716319|GENSCAN_predicted_CDS_9|543_bp atgctgacggtgtcatcccagggtctccagggaccccagctgcttatgccttctaaagcc tgcccacccccacacagccacgccacagccgtcctgcagaacagtacattccttatttat gagatgcatctgggctctctgcaaatcagctgtgaatgccacaatgcctgcaggagggag gcagagaattctggagccaacatcttccagaaggcaggtggaacctcaacaaagaaattg atctcctgggtggaggaagatggttgccgcccaggggttgtccttaagggccgcgacgca gacagcggtcacttcctggccgaggggcaggctcctcgcggcctccctgggactccgggc gtgtcgtccagccccgaggccggcgacgggcagatccgcctcacggacagctgcgtccag aggcttctggaaatcaccaaaggatcagaattcctcaggctgcaggtggagggaggtaga tgctccggattccagtacagatgttcactggatacagttatcaaccccgacgacggtatt tga >gi568815576r:26499493_26716319|GENSCAN_predicted_peptide_10|98_aa MPKAIETKAKFDKWDLIKLKSFCKAKETINRVNRQPTGWEKNFANYASDKAQASGSPFTP TANYNWLALSPQLPEKRALSTRLAFNDPSNLPLREKAM >gi568815576r:26499493_26716319|GENSCAN_predicted_CDS_10|297_bp atgccaaaagcaattgaaacaaaagcaaaatttgacaaatgggatctaattaagctaaag agcttctgcaaagcaaaagaaactatcaacagagtaaacagacagccaacaggatgggag aaaaattttgcaaactatgcatctgacaaagctcaggcctctggatcacccttcaccccc actgccaactacaattggttagctctgagtccacagttgccagagaaaagggctctcagc accagactagcattcaacgacccaagcaacctgcctctaagagaaaaggccatgtga >gi568815576r:26499493_26716319|GENSCAN_predicted_peptide_11|31_aa XTSPKMQKQLSSPLGAANKYKALTSPGLALC >gi568815576r:26499493_26716319|GENSCAN_predicted_CDS_11|96_bp nggacatctccaaagatgcagaagcagcttagttctccactaggtgcagccaacaagtac aaggcactcacgtccccagggctggccctctgctag