GENSCAN 1.0 Date run: 8-Nov-116 Time: 09:50:11 Sequence gi568815586f:120338127_120540534 : 202408 bp : 47.85% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.15 PlyA - 422 417 6 1.05 1.14 Term - 4979 4944 36 2 0 85 52 64 0.433 -0.16 1.13 Intr - 8163 8009 155 1 2 106 84 252 0.950 26.39 1.12 Intr - 9388 9320 69 2 0 94 99 92 0.994 10.05 1.11 Intr - 9562 9536 27 2 0 99 63 34 0.510 0.09 1.10 Intr - 15253 15173 81 2 0 97 131 91 0.991 13.91 1.09 Intr - 18893 18776 118 2 1 35 59 121 0.687 3.94 1.08 Intr - 19772 19690 83 0 2 92 101 107 0.981 11.76 1.07 Intr - 20927 20879 49 2 1 66 105 123 0.916 10.05 1.06 Intr - 25009 24917 93 1 0 112 98 150 0.994 18.56 1.05 Intr - 26629 26588 42 1 0 97 100 14 0.737 1.94 1.04 Intr - 29966 29882 85 1 1 53 79 84 0.608 3.82 1.03 Intr - 30147 30066 82 1 1 35 99 134 0.986 7.90 1.02 Intr - 30361 30291 71 0 2 66 40 64 0.626 -1.77 1.01 Init - 30855 30707 149 0 2 121 84 97 0.680 10.17 1.00 Prom - 45133 45094 40 -4.76 2.04 PlyA - 46244 46239 6 1.05 2.03 Term - 50898 50820 79 2 1 77 38 106 0.823 1.74 2.02 Intr - 55232 55171 62 2 2 77 107 32 0.718 1.53 2.01 Init - 57672 57619 54 0 0 92 90 59 0.814 5.78 2.00 Prom - 74039 74000 40 -1.96 3.00 Prom + 74172 74211 40 -3.86 3.01 Init + 100001 100103 103 1 1 85 84 74 0.818 5.65 3.02 Intr + 100253 100395 143 0 2 85 107 240 0.941 25.67 3.03 Term + 102654 102737 84 2 0 92 44 51 0.665 -1.25 3.04 PlyA + 105274 105279 6 1.05 4.03 PlyA - 105865 105860 6 1.05 4.02 Term - 106829 106746 84 2 0 97 43 101 0.996 4.15 4.01 Init - 108246 108100 147 0 0 97 99 298 0.636 31.89 4.00 Prom - 108294 108255 40 -16.83 5.00 Prom + 108312 108351 40 -16.89 5.01 Init + 108355 108435 81 0 0 53 103 115 0.999 8.49 5.02 Intr + 108531 108703 173 2 2 121 97 359 0.997 38.84 5.03 Intr + 118950 119053 104 0 2 70 98 142 0.974 13.22 5.04 Term + 120501 120508 8 1 2 121 42 0 0.606 -3.27 5.05 PlyA + 121575 121580 6 -0.45 6.05 PlyA - 123271 123266 6 1.05 6.04 Term - 124036 123893 144 1 0 117 53 47 0.825 2.01 6.03 Intr - 125981 125824 158 0 2 51 90 191 0.677 15.33 6.02 Intr - 127661 127570 92 1 2 98 47 103 0.594 6.84 6.01 Init - 131483 131296 188 2 2 34 93 527 0.997 46.33 6.00 Prom - 133668 133629 40 -4.16 7.00 Prom + 146915 146954 40 -4.46 7.01 Init + 158296 158427 132 0 0 76 100 232 0.897 23.34 7.02 Term + 159947 160084 138 1 0 116 42 130 0.993 9.16 7.03 PlyA + 160347 160352 6 1.05 8.08 PlyA - 161275 161270 6 1.05 8.07 Term - 165759 165658 102 0 0 124 31 107 0.999 6.88 8.06 Intr - 165955 165844 112 0 1 89 80 94 0.951 9.08 8.05 Intr - 166857 166769 89 0 2 60 52 24 0.638 -5.23 8.04 Intr - 171997 171891 107 2 2 86 80 177 0.842 16.63 8.03 Intr - 178662 178441 222 1 0 96 105 172 0.998 17.80 8.02 Intr - 184237 184088 150 2 0 48 95 110 0.989 7.83 8.01 Init - 184865 184499 367 2 1 77 50 262 0.583 18.49 8.00 Prom - 185576 185537 40 -9.46 9.00 Prom + 187322 187361 40 -3.06 9.01 Init + 188182 188540 359 0 2 60 33 289 0.176 17.18 9.02 Intr + 196209 196263 55 0 1 60 57 69 0.441 -0.02 9.03 Term + 196532 196831 300 1 0 92 40 340 0.845 24.82 9.04 PlyA + 197470 197475 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:120338127_120540534|GENSCAN_predicted_peptide_1|379_aa MADPGPAGPPRSPGPRPLRPGARRSRGPFVSLLLPQQDVHRGTQLADYAGPARPSASRGP GGRQEAQRERGEGEGLREYFGQFGEVKECLVMRDPLTKRSRGFGFVTFMDQAGVDKVLAQ SRHELDSKTIDPKVAFPRRAQPKMVTRTKKIFVGGLSVNTTVEDVKQYFEQFGKVDDAML MFDKTTNRHRGFGFVTFESEDIVEKVCEIHFHEINNKMVECKKAQPKEVMSPTGSARGRS RVMPYGMDAFMLGIGMLGYPGFQATTYASRSYTGLAPGYTYQFPGQDTDGVAQAIPLTAY GPMAAAAAAAAVVRGTGSTPSRTGGFLGTTSPGPMAELYGAANQDSGVSSYISAASPAPS TGFGHSLGRPSLQLTEDHE >gi568815586f:120338127_120540534|GENSCAN_predicted_CDS_1|1140_bp atggccgatccgggtccggccgggcctccccggagcccgggcccgcgccccctgcgccct ggcgcccggcgctcacgcgggccctttgtgtctctcctcctcccgcagcaagatgttcat cgggggactcagttggcagactacgcaggcccagctcggccctcggcttcccggggcccc ggtgggcgccaggaggctcagcgggaacggggcgagggcgaagggctgcgcgaatacttc ggccagttcggggaggtgaaggagtgtctggtgatgcgggaccccctgaccaagagatcc aggggtttcggcttcgtcactttcatggaccaggcgggggtggataaagtgctggcgcaa tcgcggcacgagctcgactccaaaacaattgaccctaaggtggccttccctcggcgagca cagcccaagatggtgactcgaacgaagaagatctttgtgggggggctgtcggtgaacacc acggtggaggacgtgaagcaatattttgagcagtttgggaaggtggacgacgccatgctg atgtttgacaaaaccaccaaccggcaccgagggttcgggtttgtcacgtttgagagtgag gacatcgtggagaaagtgtgtgaaattcattttcatgaaatcaacaacaaaatggtggaa tgtaagaaagctcagccaaaggaggtgatgtcgccaacgggctcagcccgggggaggtct cgagtcatgccctacggaatggacgccttcatgctgggcatcggcatgctgggttaccca ggtttccaagccacaacctacgccagccggagttatacaggcctcgcccctggctacacc taccagttccccggtcaggacacagatggtgtggcccaagccattcctctcactgcctac ggaccaatggcggcggcagcggcggcagcggctgtggttcgagggacaggttcgactccc agccgcacagggggcttcctggggaccaccagccccggccccatggccgagctctacggg gcggccaaccaggactcgggggtcagcagttacatcagcgccgccagccctgcccccagc accggcttcggccacagtcttgggcgccccagcctgcagctgactgaggaccacgagtga >gi568815586f:120338127_120540534|GENSCAN_predicted_peptide_2|64_aa MGRARWLTPVIPALLEAEVPIMHPLSACSLCHPVNALVRSRDADWLRAGPWAQRCVSIIV RDCE >gi568815586f:120338127_120540534|GENSCAN_predicted_CDS_2|195_bp atgggccgggcacggtggctcacacctgtaatcccagcacttttggaggctgaggtcccc atcatgcacccgctcagtgcttgttctctctgccatcctgtcaatgcccttgtgagatca cgtgatgccgactggctccgagctgggccctgggctcagcgctgtgtgagcatcattgta cgggactgtgaatag >gi568815586f:120338127_120540534|GENSCAN_predicted_peptide_3|109_aa MAVVGVSSVSRLLGRSRPQLGRPMSSGAHGEEGSARMWKTLTFFVALPGVAVSMLNVYLK SHHGEHERPEFIAYPHLRIRTKKLAFELQVSVQVEKKLALVISVSTVKV >gi568815586f:120338127_120540534|GENSCAN_predicted_CDS_3|330_bp atggcggtagttggtgtgtcctcggtttctcggctgctgggtcggtcccgcccacagctg gggcggcctatgtcgagtggcgcccatggcgaagagggctcagctcgcatgtggaagact ctcaccttcttcgtcgcgctccccggggtggcagtcagcatgctgaatgtgtacctgaag tcgcaccacggagagcacgagagacccgagttcatcgcctacccccatctccgcatcagg accaagaaattagcatttgagcttcaagtcagtgtccaagttgaaaagaaattggcatta gttatttctgtttccacagtgaaggtctag >gi568815586f:120338127_120540534|GENSCAN_predicted_peptide_4|76_aa MNSVGEACTDMKREYDQCFNRWFAEKFLKGDSSGDPCTDLFKRYQQCVQKAIKEKEIPIE GLEFMGHGKEKPENSS >gi568815586f:120338127_120540534|GENSCAN_predicted_CDS_4|231_bp atgaacagtgtgggggaggcatgcacggacatgaagcgcgagtacgaccagtgcttcaat cgctggttcgccgagaaatttctcaagggggacagctccggggacccgtgcaccgacctc ttcaagcgctaccagcagtgtgttcagaaagcaataaaggagaaagagattcctattgaa ggactggagttcatgggccatggcaaagaaaagcctgaaaattcttcttga >gi568815586f:120338127_120540534|GENSCAN_predicted_peptide_5|121_aa MWSRLVWLGLRAPLGGRQGFTSKADPQGSGRITAAVIEHLERLALVDFGSREAVARLEKA IAFADRLRAVDTDGVEPMESVLEDRCLYLRSDNVVEGNCADELLQNSHRVVEEYFVAPPG R >gi568815586f:120338127_120540534|GENSCAN_predicted_CDS_5|366_bp atgtggtcgcggttggtgtggctgggccttcgggcccctctgggcgggcgccagggcttc acctccaaggcggatcctcagggcagtggccggatcacggctgcggtgatcgagcacctg gagcgtctagcgcttgtggacttcggcagccgcgaggcagtggcgcgactggagaaagct atcgccttcgccgaccggctacgcgccgtggacacagacggggtggagcccatggaatcg gtcctggaggacagatgtctatacctgagatccgacaatgtggtagaaggcaactgtgct gatgaattactacaaaactcccatcgcgtcgtggaggagtactttgtggcccccccaggt aggtga >gi568815586f:120338127_120540534|GENSCAN_predicted_peptide_6|193_aa MSGWADERGGEGDGRIYVGNLPTDVREKDLEDLFYKYGRIREIELKNRHGLVPFAFVRFE DPRDAEDAIYGRNGYDYGQCRLRVEFPRTYGGRGSWQDLKDHMREAGDVCYADVQKDGVG MVEYLRKEDMEYALRKLDDTKFRSHEGETSYIRVYPERSTSYGYSRSRSGSRGRDSPYQS RGSPHYFSPFRPY >gi568815586f:120338127_120540534|GENSCAN_predicted_CDS_6|582_bp atgtcgggctgggcggacgagcgcggcggcgagggcgacgggcgcatctacgtggggaac cttccgaccgacgtgcgcgagaaggacttggaggacctgttctacaagtacggccgcatc cgcgagatcgagctcaagaaccggcacggcctcgtgcccttcgccttcgtgcgcttcgag gacccccgagatgcagaggatgctatttatggaagaaatggttatgattatggccagtgt cggcttcgtgtggagttccccaggacttatggaggtcggggcagctggcaggacctgaag gatcacatgcgagaagctggggatgtctgttatgctgatgtgcagaaggatggagtgggg atggtcgagtatctcagaaaagaagacatggaatatgccctgcgtaaactggatgacacc aaattccgctctcatgagggtgaaacttcctacatccgagtttatcctgagagaagcacc agctatggctactcacggtctcggtctgggtcaaggggccgtgactctccataccaaagc aggggttccccacactacttctctcctttcaggccctactga >gi568815586f:120338127_120540534|GENSCAN_predicted_peptide_7|89_aa MCDRKAVIKNADMSEEMQQDSVECATQALEKYNIEKDIAAHIKKEFDKKYNPTWHCIVGR NFGSYVTHETKHFIYFYLGQVAILLFKSG >gi568815586f:120338127_120540534|GENSCAN_predicted_CDS_7|270_bp atgtgcgaccgaaaggccgtgatcaaaaatgcggacatgtcggaagagatgcaacaggac tcggtggagtgcgctactcaggcgctggagaaatacaacatagagaaggacattgcggct catatcaagaaggaatttgacaagaagtacaatcccacctggcattgcatcgtggggagg aacttcggtagttatgtgacacatgaaaccaaacacttcatctacttctacctgggccaa gtggccattcttctgttcaaatctggttaa >gi568815586f:120338127_120540534|GENSCAN_predicted_peptide_8|382_aa MRFAKKHNKKGLKKMQANSARATSARADAIKALVKPKEVKPEIPKGVSRKLDPLAYIAHP QAWEMCSFLHCQGAQAMSVKGQGQRSNQGPGCSSSLGSQRCPDTYEGFRVDISVCQCEDR RTVYQVFESVAKKYDVMNDMMSLGIHRVWKDLLLWKMHPLPGTQLLDVAGGTGDIAFRFL NYVQSQHQRKQKRQLRAQQNLSWEEIAKEYQNEEDSLGGSRVVVCDINKEMLKVGKQKAL AQGYRAGLAWVLGDAEELPFDDDKFDIYTIAFGIRNVTHIDQALQEAHRVLKPGGRFLCL EFSQVNNPLISRLYDLYSFQVIPVLGEVIAGDWKSYQYLVESIRRFPSQEEFKDMIEDAG FHKVTYESLTSGIVAIHSGFKL >gi568815586f:120338127_120540534|GENSCAN_predicted_CDS_8|1149_bp atgcgctttgccaagaagcacaacaagaaaggcctaaaaaagatgcaggccaacagtgcc agggccacgagtgcacgtgctgacgctatcaaggcccttgtaaagcccaaggaggttaag cccgagatcccaaagggtgtcagccgcaagctcgatccacttgcctacattgcccacccc caggcttgggaaatgtgctcattcctgcattgccaaggggctcaagctatgtcggtcaaa ggccaaggccaaagatcaaaccaaggcccaggctgcagctccagcttaggctcccaaagg tgcccggacacctacgaaggcttcagagtagatatctctgtctgccaatgtgaggacaga aggactgtctatcaggtgtttgaaagtgtggctaagaagtatgatgtgatgaatgatatg atgagtcttggtatccatcgtgtttggaaggatttgctgctctggaagatgcacccgctt cctgggacccagctgcttgatgttgctggaggcacaggtgacattgcattccggttcctt aattatgttcagtcccagcatcagagaaaacagaagaggcagttaagggcccaacaaaat ttatcctgggaagaaattgccaaagagtaccagaatgaagaagattccttgggcgggtct cgtgtcgtggtgtgtgacatcaacaaggagatgctaaaggttggaaagcagaaagccttg gctcaaggatacagagctggacttgcatgggtattaggagatgctgaagaactgcccttt gatgatgacaagtttgatatttacaccattgcctttgggatccggaatgtcacacacatt gatcaggcactccaggaagctcatcgggtgctgaaaccaggaggacggtttctctgtctg gaatttagccaagtgaacaatcccctcatatccaggctttatgatctatatagcttccag gtcatccctgtcctgggagaggtcatcgctggagactggaagtcctatcagtaccttgta gagagtatccgaaggtttccgtctcaggaagagttcaaggacatgatagaagatgcaggc tttcacaaggtgacttacgaaagtctaacatcaggcattgtggccattcattctggcttc aaactttaa >gi568815586f:120338127_120540534|GENSCAN_predicted_peptide_9|237_aa MKMLIMLEVKFKLSVSGVIGEIDDCGNIDTAATGETLDIQPEELQEDELTNMNKKWSCDK KVEDVPEEVMPAKTFTLKEFLEVFHNIESTKDKISEVDKNLERNMAICQGIEKIFHTTRS PSGFFEMLFGDSSPFPEQFEKPRKETGKNVAMKAENRCRRRPPPALNAMSLGPRRARSAP TAVAAEAPVDAAELPQRRRHRLRHGQEQRLQQLLRLFGQQQRATAAPLRLGGASRRV >gi568815586f:120338127_120540534|GENSCAN_predicted_CDS_9|714_bp atgaaaatgctgataatgctggaagtgaaattcaaattgagtgtcagtggagttatagga gaaatagatgactgtgggaatattgacactgctgccactggagagactctagacatacag ccagaggaactgcaggaagatgaacttaccaacatgaacaagaaatggagctgtgacaaa aaggttgaagatgtcccagaagaagtgatgccagcaaaaactttcacattaaaggagttc ttggaggtatttcataacattgaaagcacaaaggataaaatatcagaagtagataaaaac ttagaaagaaatatggcaatttgccaaggcatagaaaagatatttcatacgacaagaagt ccatccgggttctttgagatgctgtttggcgactcgtcgccattcccggagcagtttgag aagccaaggaaggaaacagggaaaaatgtcgccatgaaggccgagaaccgctgccgccgc cgacccccgccggccctgaacgccatgagcctgggtccccgccgcgcccgctccgctccg actgccgtcgccgccgaggcccccgttgatgccgctgagctcccccaacgccgccgccac cgcctccgacatggacaagaacagcggctccaacagctcctccgcctcttcgggcagcag caaagggcaacagccgccccgctccgcctcggcggggccagccggcgagtctaa