GENSCAN 1.0 Date run: 5-Nov-116 Time: 02:52:25 Sequence gi568815596r:121627727_121836850 : 209124 bp : 44.14% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 PlyA - 67 62 6 1.05 1.05 Term - 15646 15598 49 0 1 81 34 84 0.064 -0.92 1.04 Intr - 21992 21832 161 2 2 48 95 45 0.034 -0.01 1.03 Intr - 34346 34225 122 0 2 66 111 26 0.554 2.91 1.02 Intr - 42093 41960 134 2 2 83 89 46 0.021 4.49 1.01 Init - 58719 58571 149 0 2 62 91 126 0.866 8.72 1.00 Prom - 63506 63467 40 -3.66 2.03 PlyA - 63963 63958 6 1.05 2.02 Term - 67350 67238 113 1 2 92 45 14 0.876 -3.78 2.01 Init - 67549 67399 151 1 1 44 90 152 0.580 9.21 2.00 Prom - 67823 67784 40 -4.56 3.02 PlyA - 67973 67968 6 1.05 3.01 Sngl - 81338 80856 483 2 0 64 41 520 0.193 40.98 3.00 Prom - 93802 93763 40 -2.36 4.09 PlyA - 95564 95559 6 1.05 4.08 Term - 100186 99998 189 1 0 56 34 147 0.873 3.55 4.07 Intr - 100630 100562 69 1 0 45 98 76 0.926 3.68 4.06 Intr - 103378 103167 212 2 2 73 27 121 0.981 3.13 4.05 Intr - 104478 104370 109 0 1 97 84 157 0.989 16.06 4.04 Intr - 108024 107887 138 0 0 80 90 4 0.555 0.46 4.03 Intr - 109073 109020 54 2 0 107 99 102 0.830 12.38 4.02 Intr - 116027 115729 299 0 2 22 34 189 0.032 3.49 4.01 Init - 128055 127728 328 0 1 103 18 195 0.230 11.49 4.00 Prom - 128387 128348 40 -7.76 5.00 Prom + 129015 129054 40 -3.66 5.01 Init + 129690 129708 19 2 1 72 101 19 0.363 0.83 5.02 Intr + 130984 131080 97 2 1 52 72 44 0.414 -1.73 5.03 Intr + 133683 133798 116 0 2 62 92 81 0.859 5.99 5.04 Intr + 135279 135358 80 1 2 88 107 58 0.999 6.97 5.05 Term + 137408 137641 234 1 0 61 49 389 0.999 28.62 5.06 PlyA + 138746 138751 6 1.05 6.03 PlyA - 142137 142132 6 1.05 6.02 Term - 144974 144825 150 2 0 92 38 66 0.406 -0.09 6.01 Init - 154580 154440 141 2 0 34 91 88 0.124 3.83 6.00 Prom - 159299 159260 40 -0.76 7.00 Prom + 159807 159846 40 -4.46 7.01 Init + 162296 162352 57 1 0 73 64 43 0.596 1.71 7.02 Intr + 168588 168707 120 2 0 117 36 39 0.181 2.29 7.03 Intr + 168931 168994 64 0 1 71 106 0 0.193 -1.61 7.04 Intr + 170851 170979 129 2 0 120 70 47 0.159 6.77 7.05 Intr + 189178 189268 91 2 1 39 92 49 0.017 -0.55 7.06 Term + 192211 192238 28 1 1 102 52 49 0.033 0.25 7.07 PlyA + 195193 195198 6 1.05 8.03 PlyA - 195247 195242 6 1.05 8.02 Term - 198450 198250 201 0 0 58 47 100 0.505 0.29 8.01 Init - 204418 204104 315 1 0 62 55 113 0.294 2.76 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 42164 41960 205 2 1 99 89 56 0.816 6.08 S.002 Term + 62892 62960 69 2 0 123 54 9 0.935 -1.06 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:121627727_121836850|GENSCAN_predicted_peptide_1|204_aa MRILRVLPGRHLPQWNALPEQRVAGPGGGSTQWLNTGKEQALGVRTSETCSSLSPPISAR QSLRPVKIITQFSWKLISPCDPAQILPTVFLNGLVTATTVFKDEHSSSITPLHMGKPITV NQTSVFSPTVGGTYQGQLTFSVRGKAPGDFTPAKPNTARPAPARGVTIERQPPLWSRFTL SMVYPGWWRYIILAYIEVVAFEVF >gi568815596r:121627727_121836850|GENSCAN_predicted_CDS_1|615_bp atgcggatcctgagagtgctcccgggtaggcatttgccccagtggaacgccttgccagag cagcgcgtggcaggccccggtggaggatcaacgcagtggctgaacaccgggaaggaacag gcacttggagtccggacatctgaaacttgcagttcactttcacccccgatttctgctcga cagagtttgcgcccagtcaaaattattacacagttcagttggaagcttatttcaccctgc gaccctgcccaaattctgccaactgtcttcctcaatggcctcgtaactgctactacagtg tttaaagacgagcatagcagcagcataacacctttgcacatggggaaacccataacagtg aaccaaacatctgtattttctcctacagtgggtggtacttaccagggtcaactaacgttc tccgtgaggggcaaagctcccggggactttacgccagcgaagcctaacacggccaggcca gcgcctgcgcggggcgttaccatagagaggcagccgcctctctggtctcggtttacgctc tctatggtgtacccgggttggtggcgctacatcatcttggcttatattgaagttgtggct tttgaagtcttctag >gi568815596r:121627727_121836850|GENSCAN_predicted_peptide_2|87_aa MLTLVPLTFPQACLSPLVSLGLDCPLPLEMFIIKSFTDQNHSSYLQCRKEGIFSQVVTEP GKKPFAMTSLYPTSTRVPGLFLALGDP >gi568815596r:121627727_121836850|GENSCAN_predicted_CDS_2|264_bp atgctgacattggtgcctctcaccttccctcaggcctgcctctctcccctagtgagcctc ggcctagactgcccacttccactggagatgttcatcatcaagtcatttacagatcaaaac cacagttcttacctgcagtgccggaaggagggaatattctcccaggttgtgacagagccc ggaaaaaagccatttgcgatgacctcactgtacccgacttctaccagagtccctggcctt tttctggcccttggtgatccctga >gi568815596r:121627727_121836850|GENSCAN_predicted_peptide_3|160_aa MNYKGSPIKVTLATLKMSVQPKDSLGGFEITPPVVLRLKCGSGPVHISGQHLVAVEEDAE SEDEEEEDVKLLSVSGKRSARGGGSKIPQKKVKVAADEDEDDDDEDFDDGEAEEKAPVKK SIRDTPAKNAQKSNQNGKDSKPSTPRSKGQDSFKKTGKNF >gi568815596r:121627727_121836850|GENSCAN_predicted_CDS_3|483_bp atgaattacaaaggcagtccaattaaagtaacactggcaactttgaaaatgtctgtacag ccaaaggattcccttggaggctttgaaatcacaccaccagtggtcttacggttgaagtgt ggttcagggccagtgcatattagtggacagcacttagtagctgtggaggaagatgcagag tcagaagatgaagaggaggaggatgtgaaactcttaagtgtatctggaaagcggtcggcc cgtggaggtggtagcaagattccacagaaaaaagtaaaagttgctgctgatgaagatgaa gatgatgatgatgaggattttgatgatggggaagctgaagaaaaagcgccggtgaagaaa tccatacgagatactccagccaaaaatgcacaaaagtcaaatcagaatggaaaagactca aaaccgtcaacaccaagatcaaaaggacaagactccttcaaaaaaacaggaaaaaacttc taa >gi568815596r:121627727_121836850|GENSCAN_predicted_peptide_4|465_aa MAPAAQQSASVARAAKEVDAAGNQRNQSQRPPLPPRPPGLTPRPSLGSSEARGAPEDGLS AALPEAPSAAFGTEARVQRAEERESGRGLSLRPSDRHRQKQPAKAALGAPWATRVKVRLK KRKKKKKKQRKGKERKAKGKGKEHAKDRKYSQVLDNGLDNKFHEDLEQLKEIQAHRGLCH FWGLCVQSQHAKTTGSWGNTVVCPGRNKSEDVEFQKEVAQVRKRITQRKKQEQLTPGVVY VRHLPNLLDETQIFSYFSQFGTVTRFRLSRSKRTGNSKGYAFVEFESEDVAKIVAETMNN YLFGERLLECHFMPPEKVHKELFKDWNIPFKQPSYPSVKRYNRNRTLTQKLRMEERFKKK ERLLRKKLAKKGIDYDFPSLVLRKKKKKVSGTLDTPEKTVDSQGPTPVCTPTFLERRKSQ VAELNDDDKDDEIVFKQPISCVKEEIQETQTPTHSRKKRRRSSNQ >gi568815596r:121627727_121836850|GENSCAN_predicted_CDS_4|1398_bp atggcgccggctgcacaacaatcagccagtgtagcaagggcggccaaggaagtggacgcc gcaggcaaccagcgcaatcaatcgcaacggccgccgctaccgccgcgtccccccggtctt acgccacgaccgtcgctagggtcctccgaggctcgcggcgcgccggaggacgggctgtcc gccgcgctgcctgaggcgccgtccgctgcgttcggaaccgaggcgcgcgtccaaagggca gaggagagggagagtgggcggggcctcagtctgaggccttcagacaggcatcgccagaaa caacctgcgaaggcggccctcggagcaccctgggcaacaagagtgaaagtccgtctcaaa aaaaggaaaaagaaaaagaaaaaacaaaggaaaggaaaagaaaggaaagcaaagggaaaa ggaaaggagcatgcaaaggataggaaatatagccaggtcctagacaatggtctggacaac aagttccacgaggacctggagcaactaaaggagattcaggcccacagagggctgtgccac ttctggggcctttgtgtccagagccagcacgccaagaccactggcagctggggcaatact gtggtctgtccaggaagaaataagtctgaagatgtcgagtttcaaaaggaggtggcgcag gttcgcaagcgcataacccagcgaaaaaaacaagaacaacttactcctggagtagtctat gtgcgccacctacctaacctacttgacgaaacccagatcttttcatatttctcccagttt ggcactgtgacacggttcaggctgtccagaagtaaaaggactggaaatagcaaaggctat gcatttgtggagtttgagtctgaggatgttgccaaaatagttgctgaaacaatgaacaac tacctgtttggtgaaagactcttggagtgtcattttatgccacctgaaaaagtacataaa gaactctttaaagactggaatattccatttaagcagccatcatatccatcagtgaaacgg tataatcggaatcggacactaacacaaaagctacggatggaggagcgatttaaaaagaaa gaaagattactcaggaagaaattagctaaaaaaggaattgactatgattttccttctttg gttttacgtaagaagaagaaaaaagtttcaggtactcttgacactcctgagaagactgtg gatagccagggccccacaccagtttgtacaccaacatttttggagaggcgaaaatctcaa gtggctgaactgaatgatgatgataaagatgatgaaatagttttcaaacagcccatatcc tgtgtaaaagaagaaatacaagagactcaaacacctacacattcacggaaaaaaagacga agaagcagcaatcagtga >gi568815596r:121627727_121836850|GENSCAN_predicted_peptide_5|181_aa MAIMLYVPKRCLKAREHFGTVKTHLTSLKTKFPAEQYYRFHEHWRFVLQRLVFLAAFVVY LETETLVTREAVTEILGIEPDREKGFHLDVEDYLSGVLILASELSRLSVNSVTAGDYSRP LHISTFINELDSGFRLLNLKNDSLRKRYDGLKYDVKKVEEVVYDLSIRGFNKETAAACVE K >gi568815596r:121627727_121836850|GENSCAN_predicted_CDS_5|546_bp atggctatcatgctttatgttccaaagaggtgtttgaaagctcgagaacattttggtaca gtaaaaacacatctaacatctttgaagaccaaatttcctgctgaacagtattacagattt catgagcactggaggtttgtgttgcagcgcttggtcttcttggcagcatttgttgtgtat ttggaaacagaaacactagtgactcgagaagcagttacagaaattcttggcattgagcca gatcgggagaaaggatttcatctggatgtagaagattatctctcaggagttctaattctt gccagtgaactgtcgaggctgtctgtcaacagcgtgactgctggagactactcccgaccc ctccacatctccaccttcatcaatgagctggattccggttttcgccttctcaacctgaaa aatgactccctgaggaagcgctacgacggattgaaatatgacgtgaagaaagtagaggaa gtggtctatgatctctccatccggggctttaataaggagacggcagcagcttgtgttgaa aaatag >gi568815596r:121627727_121836850|GENSCAN_predicted_peptide_6|96_aa MIHRTLGTREKDRGLVRDKRLQIGHSARSSGRGYTKILEITTKELIQKVKANILPFVLPN CEKLSGLTSIVRVSKGFLKGSSDSRDVAEEAEGGSF >gi568815596r:121627727_121836850|GENSCAN_predicted_CDS_6|291_bp atgatacatcggactttgggaactcgggagaaagatcgggggctggtgagggataaaaga ctacaaattggccacagtgcacgctcctcaggtcgtgggtacaccaaaatcttagaaatc accactaaagaacttattcagaaagtaaaagcaaacatcctgccctttgtacttcccaac tgtgaaaaattatctggcttaacttccattgtcagggtgagcaaagggttcctcaaagga agctctgacagcagggacgtggcagaggaagctgaagggggatctttctga >gi568815596r:121627727_121836850|GENSCAN_predicted_peptide_7|162_aa MERRPEGKLESVNSDDAQIVFKSPTPVRHFCGFPGSLTLLLQAAGFDSEQFLCSHKDSYI SPLCMSVSLLPLLMKTPVMLELMVAMCPLIGVFLALNVLFSASRNVQTFIQQPVWANESG MVPGHFCLRGGRPIITQLALPPQVQGATSPDGCGSPAVFLRA >gi568815596r:121627727_121836850|GENSCAN_predicted_CDS_7|489_bp atggaacgaaggccagaaggaaaactcgaaagtgttaacagtgatgacgctcagatagtc ttcaaaagcccaacacctgttcgtcatttctgtggcttcccagggagtcttactcttctg cttcaagctgctgggtttgactctgagcagtttctgtgcagccacaaagattcttatatc tctcctctgtgcatgtcagtgtccttgcttcctcttcttatgaagacaccggtcatgttg gagctcatggttgcaatgtgcccgcttatcggagtcttcctggccttgaatgtcctcttc tcagcatccaggaatgtccagacattcatccaacaaccagtctgggctaatgagtcaggc atggtgcctggacacttctgcctgcgtgggggaaggcccatcatcacccagctggcactc ccgccccaggtgcagggagccaccagcccggatggctgtggttcaccagccgtgttcctg cgggcctga >gi568815596r:121627727_121836850|GENSCAN_predicted_peptide_8|171_aa MGLSPHLSDDNISGLGWRLRSQPVTPIHPLSVEPAGKQDPAPRSARDSEGVVSALLSAAS HQAPAQESSSIRPSGGSRFPEADSEMRICEQVNYEGVLLGKTCQGRAAPAQSLGNTCGSR KEVSGPVLEVERGDRGFKMSQLVPLGQNPEPSFGAAGFLVPVSRVIKSEIT >gi568815596r:121627727_121836850|GENSCAN_predicted_CDS_8|516_bp atgggcctctctccacatctgtctgatgacaacatctctggcttgggctggagactgagg tcccagccagtcacacccatccacccactctccgtggagcctgcaggcaagcaggatcca gctcctcggtctgccagggactcggagggagtggtctcggccctgctctcagctgcctcc catcaagctccagcgcaagagtccagcagcatcaggccgtctggtgggtcacgtttccca gaagcagactctgagatgagaatttgtgaacaagtgaattatgaaggcgtgctcctgggg aaaacctgtcagggacgtgcagctcctgctcagagtctgggaaatacttgtggctctaga aaagaagtcagtggccctgtgctagaagtggagagaggggacagaggcttcaagatgagc cagcttgtgcccctgggacagaatcctgaaccaagcttcggagctgccggattcctggtc cctgtttcccgcgtgattaaatctgaaattacttga