GENSCAN 1.0 Date run: 4-Nov-116 Time: 21:03:16 Sequence gi568815586r:54141758_54357650 : 215893 bp : 47.76% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 950 1064 115 1 1 56 106 38 0.617 2.78 1.02 Intr + 2225 2327 103 0 1 117 59 40 0.611 3.23 1.03 Intr + 21306 21451 146 0 2 59 78 111 0.648 7.13 1.04 Intr + 23242 23264 23 2 2 111 115 12 0.656 3.46 1.05 Intr + 30778 30799 22 0 1 129 75 -3 0.155 -0.38 1.06 Intr + 32295 32413 119 1 2 46 110 50 0.211 3.18 1.07 Term + 33688 33780 93 0 0 110 43 33 0.128 -1.17 1.08 PlyA + 34063 34068 6 1.05 2.03 PlyA - 34076 34071 6 1.05 2.02 Term - 40866 40339 528 0 0 138 48 269 0.977 22.15 2.01 Init - 42132 41899 234 0 0 82 55 388 0.540 33.04 2.00 Prom - 54210 54171 40 -4.36 3.00 Prom + 55047 55086 40 -5.46 3.01 Init + 58646 58665 20 1 2 91 47 11 0.242 -3.13 3.02 Intr + 58703 58884 182 2 2 77 50 130 0.512 7.61 3.03 Intr + 64478 64542 65 0 2 93 47 39 0.166 -1.26 3.04 Intr + 68555 68604 50 1 2 102 110 5 0.494 1.58 3.05 Intr + 69775 69946 172 1 1 96 98 47 0.870 6.45 3.06 Term + 76809 76880 72 2 0 90 55 49 0.322 -0.29 3.07 PlyA + 77212 77217 6 1.05 4.07 PlyA - 79800 79795 6 1.05 4.06 Term - 100148 99998 151 1 1 126 43 134 0.996 10.08 4.05 Intr - 104458 104358 101 1 2 81 81 42 0.758 1.71 4.04 Intr - 110470 110284 187 0 1 112 99 80 0.999 11.19 4.03 Intr - 115935 115757 179 0 2 121 80 175 0.994 18.72 4.02 Intr - 117916 117752 165 1 0 117 94 -12 0.767 2.46 4.01 Init - 122397 122347 51 0 0 57 23 67 0.339 -1.74 4.00 Prom - 126483 126444 40 -8.86 5.00 Prom + 128478 128517 40 -5.26 5.01 Init + 139051 139065 15 0 0 83 105 -2 0.834 1.28 5.02 Intr + 139629 139745 117 2 0 90 83 110 0.999 11.36 5.03 Intr + 140038 140184 147 0 0 48 81 123 0.990 8.03 5.04 Intr + 140333 140543 211 1 1 98 89 231 0.991 22.69 5.05 Intr + 140637 140729 93 1 0 85 69 47 0.844 2.44 5.06 Intr + 140816 140908 93 0 0 61 68 91 0.947 4.34 5.07 Intr + 141043 141117 75 2 0 104 115 85 0.999 12.29 5.08 Intr + 141322 141477 156 2 0 52 66 152 0.942 9.38 5.09 Intr + 142055 142210 156 0 0 75 86 94 0.982 7.88 5.10 Term + 145196 145209 14 0 2 113 49 9 0.649 -2.14 5.11 PlyA + 145311 145316 6 1.05 6.04 PlyA - 146342 146337 6 1.05 6.03 Term - 151624 150617 1008 1 0 122 49 1028 0.999 94.40 6.02 Intr - 153470 153378 93 2 0 76 109 66 0.681 7.66 6.01 Init - 155262 155179 84 0 0 68 98 60 0.517 5.82 6.00 Prom - 157659 157620 40 -9.16 7.00 Prom + 161942 161981 40 -2.06 7.01 Init + 182048 182092 45 1 0 64 89 21 0.416 -1.46 7.02 Intr + 183326 183424 99 1 0 54 54 126 0.623 6.11 7.03 Intr + 198790 198858 69 0 0 90 100 84 0.701 9.18 7.04 Intr + 200449 200530 82 0 1 71 48 162 0.954 9.71 7.05 Intr + 201468 201559 92 1 2 106 78 81 0.994 8.61 7.06 Intr + 203703 203758 56 2 2 106 83 28 0.913 1.88 7.07 Intr + 206010 206087 78 0 0 41 110 100 0.928 6.17 7.08 Intr + 206243 206294 52 2 1 117 98 19 0.952 4.71 7.09 Intr + 207863 207901 39 1 0 99 80 44 0.838 3.12 7.10 Term + 208719 208766 48 2 0 120 49 44 0.849 1.00 7.11 PlyA + 210070 210075 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:54141758_54357650|GENSCAN_predicted_peptide_1|206_aa MWMLSNRGQTNKGISRPCRTQGSSPRVDPKGFSSAASLGLLEPDQNSLLTRDRQTDLFPV PQLPVVLRMETLRHRLGQVAYQILRAVSAGGRLYTAAGSAHRNGLEDGPQELLKAPGLVT RDDKLRPTMVPGFRSSGEHTTLPLKTVLAKQRYQTQEQNDANSVIHRGHTDLPYSQPLAV GAEKFLDISDTMVPFCKYPLHGGGKI >gi568815586r:54141758_54357650|GENSCAN_predicted_CDS_1|621_bp atgtggatgttgagcaatagaggacagacaaacaaaggaatctccagaccctgcaggacc caaggcagcagcccacgagtggatcctaaaggattttcatcagctgcaagtctggggtta ctagagcctgaccaaaattctctactgaccagagacagacagacagacctgttccctgtt ccccaacttccagtggtgctgcgcatggagacgctgaggcaccgattagggcaagtggcc taccagatcttgagggctgtcagtgctggtggcagactttacaccgcagcaggcagtgcc cacagaaatgggcttgaggatggaccacaagaactgctcaaagcacctggactagtaaca agagatgacaagctgaggcctacgatggtacctggcttcaggagctcaggggaacacact accctaccccttaaaacggtacttgcaaagcagcgctaccaaacacaggaacaaaatgat gctaactcagtgattcacaggggacacacagaccttccatacagccagccccttgcagtt ggggctgagaaattcctggacatatctgacacaatggtcccgttttgcaaatatccactc catggaggtggtaagatttag >gi568815586r:54141758_54357650|GENSCAN_predicted_peptide_2|253_aa MEPQPCPGSLAESFLEEELRLNAELSQLQFSEPVGIIYNPVEYAWEPHRNYVTRYCQGPK EVLFLGMNPGPFGMAQTGVPFGEVSMVRDWLGIVGPVLTPPQEHPKRPVLGLECPQSEVS GARFWGFFRNLCGQPEVFFHHCFVHNLCPLLFLAPSGRNLTPAELPAKQREQLLGICDAA LCRQVQLLGVRLVVGVGRLAEQRARRALAGLMPEVQVEGLLHPSPRNPQANKGWEAVAKE RLNELGLLPLLLK >gi568815586r:54141758_54357650|GENSCAN_predicted_CDS_2|762_bp atggagccccagccctgccctggaagcttggctgagagcttcctggaggaggagcttcgg ctcaatgctgagctgagccagctgcagttttcggagcctgtgggcatcatctacaatccc gtggagtatgcatgggagccacatcgcaactacgtgactcgctactgccagggccccaag gaagtactcttcctgggcatgaaccctggaccttttggcatggcccagactggggtgccc tttggggaagtaagcatggtccgggactggttgggcattgtggggcctgtgctgacccct ccccaagagcatcctaaacgaccagtgctgggactggagtgcccacagtcagaagtgagt ggtgcccgattctggggctttttccggaacctctgtggacagcctgaggtcttcttccat cactgttttgtccacaatctatgccctctgcttttcctggctcccagcgggcgcaacctt actcctgctgagctgcctgccaagcagcgagaacagcttcttgggatctgtgatgcagcc ctctgccggcaggtgcagctgctgggggtgcggctggtggtgggagttgggcgactggca gagcagcgggcacgacgggctctggcaggcctgatgccagaggtccaggtggaagggctc ctgcatccctctccccgtaacccacaggccaacaagggctgggaggcagtggccaaggaa agattgaatgagctggggctgctgccactgctgttgaaatga >gi568815586r:54141758_54357650|GENSCAN_predicted_peptide_3|186_aa MRLQQVRVPSALKTEEANDISCPTWKVKVQKTEGTRLLHILRSVISCRKKHPTCRKKHST RMTQSLPSLLLKKARLRSWEGRIELEKGICPSGAYSEVVLGAFIRGAVDRVFGNSSRRKI LKLEPTPNLRKEKVLSRMGSVEKKARKGIPPIPARQKHKVNSKPYLLHHGQTQAHNSQRF GYTKLL >gi568815586r:54141758_54357650|GENSCAN_predicted_CDS_3|561_bp atgcggctccaacaggtaagggtcccctcagcattgaagacagaagaggcaaatgacatt tcatgtcccacttggaaagtcaaggtgcagaaaacagaggggaccagactccttcatatc ctgcgcagcgtcatctcctgccgcaagaagcaccccacctgccgcaagaagcactccacc cgcatgactcagtctttgcccagtctcctgctgaagaaagccaggctcaggtcatgggaa ggtcggattgagctggagaaagggatatgcccatcaggtgcttactcagaggtggtgctg ggtgctttcataagaggggcagtggacagagtctttggaaatagctcgaggagaaagatt ctgaagctggagcccactccaaatctcaggaaagagaaggtcctgagtaggatggggagt gtggaaaagaaggcccggaagggaataccacccatcccagcaagacaaaaacataaagtc aattcaaagccttatcttctgcatcatggccagactcaggcacacaacagccagcgcttc ggttacacaaaactgctctga >gi568815586r:54141758_54357650|GENSCAN_predicted_peptide_4|277_aa MRVEPELDKADRYQGPYCSTAASCYGNGLGQLRSGAQAAGWGQGSCQPAKTQTWEGRPLL LAQRERKNWGDPGPGGLSLSGGTVCDMGKKTKRTADSSSSEDEEEYVVEKVLDRRVVKGQ VEYLLKWKGFSEEHNTWEPEKNLDCPELISEFMKKYKKMKEGENNKPREKSESNKRKSNF SNSADDIKSKKKREQSNDIARGFERGLEPEKIIGATDSCGDLMFLMKWKDTDEADLVLAK EANVKCPQIVIAFYEERLTWHAYPEDAENKEKETAKS >gi568815586r:54141758_54357650|GENSCAN_predicted_CDS_4|834_bp atgagagtagaacctgaactggacaaagcagacagataccagggaccctattgtagtact gctgccagttgctatggcaacggactgggacagctgcgcagtggcgcccaggcagctggg tgggggcaggggagctgccagcccgccaagacccagacctgggaggggaggcccctcctg ttagcccagagggaaagaaagaactggggcgatccgggacctggtggccttagtctttca ggtggaacggtgtgcgacatgggaaagaaaaccaagcggacagctgacagttcttcttca gaggatgaggaggagtatgttgtggagaaggtgctagacaggcgcgtggttaagggacaa gtggaatatctactgaagtggaaaggcttttctgaggagcacaatacttgggaacctgag aaaaacttggattgccctgagctaatttctgaatttatgaaaaagtataagaagatgaag gagggtgaaaataataaacccagggagaagtcagaaagtaacaagaggaaatccaatttc tcaaacagtgccgatgacatcaaatctaaaaaaaagagagagcagagcaatgatatcgct cggggctttgagagaggactggaaccagaaaagatcattggggcaacagattcctgtggt gatttaatgttcctaatgaaatggaaagacacagatgaagctgacctggttcttgcaaaa gaagctaatgtgaaatgtccacaaattgtgatagcattttatgaagagagactgacatgg catgcatatcctgaggatgcggaaaacaaagagaaagaaacagcaaagagctaa >gi568815586r:54141758_54357650|GENSCAN_predicted_peptide_5|358_aa MSKSESPKEPEQLRKLFIGGLSFETTDESLRSHFEQWGTLTDCVVMRDPNTKRSRGFGFV TYATVEEVDAAMNARPHKVDGRVVEPKRAVSREDSQRPGAHLTVKKIFVGGIKEDTEEHH LRDYFEQYGKIEVIEIMTDRGSGKKRGFAFVTFDDHDSVDKIVIQKYHTVNGHNCEVRKA LSKQEMASASSSQRGRSGSGNFGGGRGGGFGGNDNFGRGGNFSGRGGFGGSRGGGGYGGS GDGYNGFGNDGGYGGGGPGYSGGSRGYGSGGQGYGNQGSGYGGSGSYDSYNNGGGGGFGG GSGSNFGGGGSYNDFGNYNNQSSNFGPMKGGNFGGRSSGPYGGGGQYFAKPRNQGTSL >gi568815586r:54141758_54357650|GENSCAN_predicted_CDS_5|1077_bp atgtctaagtcagagtctcctaaagagcccgaacagctgaggaagctcttcattggaggg ttgagctttgaaacaactgatgagagcctgaggagccattttgagcaatggggaacgctc acggactgtgtggtaatgagagatccaaacaccaagcgctccaggggctttgggtttgtc acatatgccactgtggaggaggtggatgcagctatgaatgcaaggccacacaaggtggat ggaagagttgtggaaccaaagagagctgtctccagagaagattctcaaagaccaggtgcc cacttaactgtgaaaaagatatttgttggtggcattaaagaagacactgaagaacatcac ctaagagattattttgaacagtatggaaaaattgaagtgattgaaatcatgactgaccga ggcagtggcaagaaaaggggctttgcctttgtaacctttgacgaccatgactccgtggat aagattgtcattcagaaataccatactgtgaatggccacaactgtgaagttagaaaagcc ctgtcaaagcaagagatggctagtgcttcatccagccaaagaggtcgaagtggttctgga aactttggtggtggtcgtggaggtggtttcggtgggaatgacaacttcggtcgtggagga aacttcagtggtcgtggtggctttggtggcagccgtggtggtggtggatatggtggcagt ggggatggctataatggatttggtaatgatggtggttatggaggaggcggccctggttac tctggaggaagcagaggctatggaagtggtggacagggttatggaaaccagggcagtggc tatggcgggagtggcagctatgacagctataacaacggaggcggaggcggctttggcggt ggtagtggaagcaattttggaggtggtggaagctacaatgattttgggaattacaacaat cagtcttcaaattttggacccatgaagggaggaaattttggaggcagaagctctggcccc tatggcggtggaggccaatactttgcaaaaccacgaaaccaaggaacgtccttgtga >gi568815586r:54141758_54357650|GENSCAN_predicted_peptide_6|394_aa MSDRRDQEDLAKSTWNLLLTAGKTDARKQSRNRVIQLSTSELGEMELTWQEIMSITELQG LNAPSEPSFEPQAPAPYLGPPPPTTYCPCSIHPDSGFPLPPPPYELPASTSHVPDPPYSY GNMAIPVSKPLSLSGLLSEPLQDPLALLDIGLPAGPPKPQEDPESDSGLSLNYSDAESLE LEGTEAGRRRSEYVEMYPVEYPYSLMPNSLAHSNYTLPAAETPLALEPSSGPVRAKPTAR GEAGSRDERRALAMKIPFPTDKIVNLPVDDFNELLARYPLTESQLALVRDIRRRGKNKVA AQNCRKRKLETIVQLERELERLTNERERLLRARGEADRTLEVMRQQLTELYRDIFQHLRD ESGNSYSPEEYALQQAADGTIFLVPRGTKMEATD >gi568815586r:54141758_54357650|GENSCAN_predicted_CDS_6|1185_bp atgtctgaccgtagagatcaggaagaccttgccaaatctacttggaatctcttgctgacg gcagggaaaacagatgcaaggaagcagagcaggaacagggtgatacagctgtccacttca gagctaggagagatggaactgacttggcaggagatcatgtccatcaccgagctgcagggt ctgaatgctccaagtgagccatcatttgagccccaagccccagctccataccttggacct ccaccacccacaacttactgcccctgctcaatccacccagattctggcttcccacttcct ccaccaccttatgagctcccagcatccacatcccatgtcccagatcccccatactcctat ggcaacatggccataccagtctccaagccactgagcctctcaggcctgctcagtgagccg ctccaagaccccttagccctcctggacattgggctgccagcagggccacctaagccccaa gaagacccagaatccgactcaggattatccctcaactatagcgatgctgaatctcttgag ctggaggggacagaggctggtcggcggcgcagcgaatatgtagagatgtacccagtggag tacccctactcactcatgcccaactccttggcccactccaactataccttgccagctgct gagacccccttggccttagagccctcctcaggccctgtgcgggctaagcccactgcacgg ggggaggcagggagtcgggatgaacgtcgggccttggccatgaagattccttttcctacg gacaagattgtcaacttgccggtagatgactttaatgagctattggcaaggtacccgctg acagagagccagctagcgctagtccgggacatccgacgacggggcaaaaacaaggtggca gcccagaactgccgcaagaggaagctggaaaccattgtgcagctggagcgggagctggag cggctgaccaatgaacgggagcggcttctcagggcccgcggggaggcagaccggaccctg gaggtcatgcgccaacagctgacagagctgtaccgtgacattttccagcaccttcgggat gaatcaggcaacagctactctcctgaagagtacgcgctgcaacaggctgccgatgggacc atcttccttgtgccccgggggaccaagatggaggccacagactga >gi568815586r:54141758_54357650|GENSCAN_predicted_peptide_7|219_aa MTRSRLTSTSASQVQAGPNGNVWPISGGVSFAAPRRHQLRGKMEALILEPSLYTVKAILI LDNDGDRLFAKYYDDTYPSVKEQKAFEKNIFNKTHRTDSEIALLEGLTVVYKSSIDLYFY VIGSSYENELMLMAVLNCLFDSLSQMLRKNVEKRALLENMEGLFLAVDEIVDGGVILESD PQQVVHRVALRGEDVPLTEQTVSQVLQSAKEQIKWSLLR >gi568815586r:54141758_54357650|GENSCAN_predicted_CDS_7|660_bp atgacgagatctcgcctcacctcaacctccgcctcccaggttcaagcggggcctaatggg aacgtatggccaatcagcggcggcgtttcttttgcggctccacgtcggcaccagctgcgg ggcaagatggaggcgctgattttggaaccttccctgtatactgtcaaagccatcctgatt ctggacaatgatggagatcgactttttgccaagtactatgacgacacctaccccagtgtc aaggagcaaaaggcctttgagaagaacattttcaacaagacccatcggactgacagtgaa attgccctcttggaaggcctgacagtggtatacaaaagcagtatagatctctatttctat gtgattggcagctcctatgaaaatgagctgatgcttatggctgttctgaactgtctcttc gactcattgagccagatgctgaggaaaaatgtagaaaagcgagcactgctggagaacatg gaggggctgttcttggctgtggatgaaattgtagatggaggggtgatcctagagagtgat ccccagcaggtggtacaccgggtggcattaaggggtgaagatgtcccccttacggagcag accgtgtctcaggtgctgcagtcagccaaagaacagatcaagtggtcactccttcggtga