GENSCAN 1.0 Date run: 4-Nov-116 Time: 15:56:16 Sequence gi568815594f:163250274_163451608 : 201335 bp : 37.46% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 539 534 6 1.05 1.02 Term - 3765 3602 164 1 2 73 42 116 0.102 2.62 1.01 Init - 4387 4324 64 1 1 64 75 72 0.167 5.02 1.00 Prom - 8655 8616 40 -4.25 2.03 PlyA - 9920 9915 6 1.05 2.02 Term - 15425 15317 109 1 1 115 44 119 0.998 7.20 2.01 Init - 15673 15498 176 1 2 79 110 94 0.880 9.57 2.00 Prom - 22188 22149 40 -7.55 3.04 PlyA - 22220 22215 6 1.05 3.03 Term - 24262 24170 93 1 0 81 44 129 0.342 4.65 3.02 Intr - 29333 29305 29 0 2 120 89 12 0.281 1.42 3.01 Init - 35138 34961 178 2 1 37 36 157 0.168 4.87 3.00 Prom - 40072 40033 40 -2.75 4.03 PlyA - 40100 40095 6 1.05 4.02 Term - 49263 49045 219 0 0 -9 48 186 0.254 0.96 4.01 Init - 66760 66419 342 1 0 68 39 151 0.075 5.48 4.00 Prom - 71808 71769 40 -6.25 5.03 PlyA - 72830 72825 6 1.05 5.02 Term - 75485 75030 456 2 0 78 47 286 0.999 17.64 5.01 Init - 76281 75583 699 0 0 65 116 451 0.990 40.63 5.00 Prom - 79643 79604 40 -11.34 6.05 PlyA - 80106 80101 6 1.05 6.04 Term - 82235 81763 473 0 2 63 52 332 0.819 21.21 6.03 Intr - 94094 93845 250 2 1 96 49 184 0.070 11.39 6.02 Intr - 94597 94501 97 0 1 52 93 35 0.572 -0.51 6.01 Init - 100345 100287 59 1 2 38 105 55 0.732 3.03 6.00 Prom - 115912 115873 40 -4.25 7.00 Prom + 116038 116077 40 -6.95 7.01 Init + 119586 119878 293 2 2 66 37 165 0.435 5.87 7.02 Intr + 123228 123394 167 0 2 90 58 72 0.013 3.08 7.03 Term + 154487 154755 269 0 2 56 44 218 0.451 8.87 7.04 PlyA + 155650 155655 6 1.05 8.00 Prom + 161343 161382 40 -5.55 8.01 Sngl + 164493 165017 525 2 0 33 43 230 0.632 8.83 8.02 PlyA + 166058 166063 6 1.05 9.00 Prom + 166527 166566 40 -7.15 9.01 Init + 175584 175822 239 2 2 55 73 146 0.611 7.34 9.02 Intr + 176137 176207 71 1 2 53 69 65 0.727 -0.99 9.03 Intr + 176679 176765 87 1 0 48 92 100 0.916 5.42 9.04 Term + 177525 177691 167 1 2 104 46 77 0.417 2.20 9.05 PlyA + 178182 178187 6 1.05 10.05 PlyA - 178253 178248 6 -3.64 10.04 Term - 178378 178264 115 0 1 49 49 182 0.057 7.46 10.03 Intr - 178473 178380 94 1 1 66 31 72 0.052 -2.60 10.02 Intr - 178606 178558 49 1 1 72 101 65 0.066 3.53 10.01 Intr - 192784 192650 135 1 0 82 90 39 0.068 3.34 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594f:163250274_163451608|GENSCAN_predicted_peptide_1|75_aa MAGAAHSMELTGAGNMQQLGTGPPMAAHGPVSMYFLPSEAHKNPSPSQTHRDVGTTCLQI GTTHSRSPESCIVAQ >gi568815594f:163250274_163451608|GENSCAN_predicted_CDS_1|228_bp atggctggggctgcacactccatggagctgaccggagctgggaacatgcagcagctggga acaggcccacccatggctgcccatggaccagtcagcatgtacttcctcccttctgaagcc cacaaaaaccccagccccagccagactcacagagatgtcgggacaacctgcctgcagata ggcactacccactccaggtctcctgagagctgtattgttgctcaataa >gi568815594f:163250274_163451608|GENSCAN_predicted_peptide_2|94_aa MKKQPRGYKVCCHRGINTICQLAGIKDMYAKVSGSISMLPLTWGLFHGLSVRNPSTAGRG TLKKEPGDEVPHIKLDLEEVNALQGMWRSVRSGF >gi568815594f:163250274_163451608|GENSCAN_predicted_CDS_2|285_bp atgaagaaacaacccagaggttacaaggtctgctgccacaggggcatcaacaccatatgc cagctcgctggcatcaaagacatgtatgccaaggtctctgggtccatcagtatgctcccc ctcacctggggcctcttccatgggctctcagtcagaaacccgtcaacagctggcaggggg accttgaaaaaggagccaggtgatgaggttccacatatcaaactggacttggaagaagtg aatgctctacagggaatgtggcgctctgtgcggtcgggtttctag >gi568815594f:163250274_163451608|GENSCAN_predicted_peptide_3|99_aa MGDITTVTTEIQKITRGYYEHLYAHKLENLEEMCKFLERYNPPRLNQEEIETLNRPITSS FCPYPLLHMCQIKETRARSDEDDGISPRNSDFVLYGETG >gi568815594f:163250274_163451608|GENSCAN_predicted_CDS_3|300_bp atgggagatattacaactgttactacagaaatacaaaagatcactcgaggctactatgaa cacctgtatgcacataaactagaaaacctagaggagatgtgtaaattcctggaaagatac aaccctcctagattaaaccaggaagaaatagaaactctgaacagaccaataacaagcagt ttctgcccatatcctctcctgcatatgtgccagataaaagaaaccagagctagatctgat gaagacgatggaatatcacccagaaattctgactttgtgctgtatggagaaacaggatga >gi568815594f:163250274_163451608|GENSCAN_predicted_peptide_4|186_aa MDGNAWMSKQKPAAGAEPSWRTSISAVQRGNVGLEPLHRVPTGALPSGPVRKRPLFSRSQ NGRSPSSLHPVPGKTADTQHQSLRAATGTGSCKDTGAELPKALGAPSLHQYALDRGEEAA EEKSKASRCWFMKFKEGRHLHYIKVQGAVASTDVEAAASDPEDLAKTIDKHGNTKNHFQW RQIAFY >gi568815594f:163250274_163451608|GENSCAN_predicted_CDS_4|561_bp atggatggaaatgcctggatgtccaagcagaagcctgctgcaggggcagagccctcatgg agaacctctattagtgcagtccagaggggaaatgtggggttggagcccctacacagagtc cccactggggcactgcctagtggacctgtgagaaagaggccactattctccagatcccag aatggtagatcccccagcagcttgcaccctgtgcctggaaaaactgcagacactcaacat cagtctttgagagcagccactgggactggatcctgcaaagacacaggggcagagctgccc aaggccttgggagccccctctttgcatcaatatgccctagatagaggcgaggaagctgct gaagaaaagtcgaaagctagcagatgttggttcatgaagtttaaggaagggaggcatctc cattacataaaagtgcaaggtgcagtagcaagtacagatgtagaagctgcagcaagtgat ccagaagatctagctaagacaattgataaacatggcaacactaaaaatcattttcagtgg agacaaatagccttctattga >gi568815594f:163250274_163451608|GENSCAN_predicted_peptide_5|384_aa MNSTLFSQVENHSVHSNFSEKNAQLLAFENDDCHLPLAMIFTLALAYGAVIILGVSGNLA LIIIILKQKEMRNVTNILIVNLSFSDLLVAIMCLPFTFVYTLMDHWVFGEAMCKLNPFVQ CVSITVSIFSLVLIAVERHQLIINPRGWRPNNRHAYVGIAVIWVLAVASSLPFLIYQVMT DEPFQNVTLDAYKDKYVCFDQFPSDSHRLSYTTLLLVLQYFGPLCFIFICYFKIYIRLKR RNNMMDKMRDNKYRSSETKRINIMLLSIVVAFAVCWLPLTIFNTVFDWNHQIIATCNHNL LFLLCHLTAMISTCVNPIFYGFLNKNFQRDLQFFFNFCDFRSRDDDYETIAMSTMHTDVS KTSLKQASPVAFKKINNNDDNEKI >gi568815594f:163250274_163451608|GENSCAN_predicted_CDS_5|1155_bp atgaattcaacattattttcccaggttgaaaatcattcagtccactctaatttctcagag aagaatgcccagcttctggcttttgaaaatgatgattgtcatctgcccttggccatgata tttaccttagctcttgcttatggagctgtgatcattcttggtgtctctggaaacctggcc ttgatcataatcatcttgaaacaaaaggagatgagaaatgttaccaacatcctgattgtg aacctttccttctcagacttgcttgttgccatcatgtgtctcccctttacatttgtctac acattaatggaccactgggtctttggtgaggcgatgtgtaagttgaatccttttgtgcaa tgtgtttcaatcactgtgtccattttctctctggttctcattgctgtggaacgacatcag ctgataatcaaccctcgagggtggagaccaaataatagacatgcttatgtaggtattgct gtgatttgggtccttgctgtggcttcttctttgcctttcctgatctaccaagtaatgact gatgagccgttccaaaatgtaacacttgatgcgtacaaagacaaatacgtgtgctttgat caatttccatcggactctcataggttgtcttataccactctcctcttggtgctgcagtat tttggtccactttgttttatatttatttgctacttcaagatatatatacgcctaaaaagg agaaacaacatgatggacaagatgagagacaataagtacaggtccagtgaaaccaaaaga atcaatatcatgctgctctccattgtggtagcatttgcagtctgctggctccctcttacc atctttaacactgtgtttgattggaatcatcagatcattgctacctgcaaccacaatctg ttattcctgctctgccacctcacagcaatgatatccacttgtgtcaaccccatattttat gggttcctgaacaaaaacttccagagagacttgcagttcttcttcaacttttgtgatttc cggtctcgggatgatgattatgaaacaatagccatgtccacgatgcacacagatgtttcc aaaacttctttgaagcaagcaagcccagtcgcatttaaaaaaatcaacaacaatgatgat aatgaaaaaatctga >gi568815594f:163250274_163451608|GENSCAN_predicted_peptide_6|292_aa MAHDFAKHPLIQQDRRQCESRVSLPLRNLKSKIAFGLQNSAERALGYFPWSKEAPSRIPD TLLPGSARACSTRVSWEPEGPARHERIQPSTLSAVSSLLTSACSATLIPIFPAALPSPSP PRSLGTPDNALAPETGAKKTNSKEGKLRVYAFPETSAKSLIDKVKDERGRKTMRVSETGS GGPRWGLQCRQVAGCSKSPDRSRAPAEGYGDDAGERRGYPTRSSGFPTTCSRGISGWGSK FSFTPVLFKRPRHWEGARVRLSHTRSFSLRVLSWLEALALAAPEFSRSNWLG >gi568815594f:163250274_163451608|GENSCAN_predicted_CDS_6|879_bp atggcacatgactttgccaaacatccactgatccagcaagacagacgtcagtgtgaaagc agagtttctcttcccttgagaaatctgaaatctaaaattgcattcggactgcaaaattct gctgaaagagccttaggctattttccatggtctaaggaggctccttcgcggatccccgac actctcctcccgggttcggcccgcgcctgctccacccgggtgagttgggaaccggaggga cctgctcggcacgaaagaattcagcccagtaccctctccgctgtctcttcactcctgaca agtgcctgttccgccaccctaatccccatttttccagccgcgctcccttcaccttcccca ccccgctctcttgggacccccgacaacgccctggcaccagagacgggagcgaaaaagaca aattccaaagagggtaagttgcgagtttatgcctttccagagacttctgcgaaatctctc attgacaaggtgaaggatgagaggggaagaaaaacgatgcgagtgtccgaaactggctct gggggaccaaggtggggtctccagtgcaggcaggtcgcaggttgctccaaatcaccggac cgttcgcgggctcctgccgaagggtatggggacgacgcgggtgaaaggagagggtaccct acgcggagttcgggctttcccaccacctgctcccggggaatttctggatggggatccaag ttttcctttaccccggttctctttaaaaggccgaggcactgggaaggcgcccgagttcgc ctatcccacacccgctcattttccttacgtgtcttgagctggctggaggcgctggctctg gccgcaccggagttttcgcggagtaactggctgggatga >gi568815594f:163250274_163451608|GENSCAN_predicted_peptide_7|242_aa MKTLEENLGNTIQYIGLGKDFMTKTQKAMAIKAKIDKWDLIKIKSFSTAKETIIKVNRQP TEWEKNFAIYPSDKGLISRIYKELKQIYKKKSKNPIKKYIQPVNFSGKAIDFEPAQAWIS ISPESFKRCITLAQRTHSSEPQYLHLLNRDKSNGSSGATIDFCLSFPVCGILLQQQPYQK EYAFPTPQYCQHSVALVLFPAISNKLSLFLMIAHASYIGDASIQQEPAGFATLLKLSRRT FE >gi568815594f:163250274_163451608|GENSCAN_predicted_CDS_7|729_bp atgaaaaccctagaagaaaacctaggcaataccattcagtacataggcttgggcaaagac ttcatgactaaaacacaaaaagcaatggcaataaaagccaaaatagacaaatgggatctg attaaaataaagagcttcagcacagcgaaagaaactatcatcaaagtgaacaggcaacct acagaatgggagaaaaattttgcaatctatccatcagacaaagggctaatatccagaatc tacaaggaacttaaacaaatttacaagaaaaaatcaaaaaaccccatcaaaaagtacatt cagccagtgaattttagtggaaaggccatcgactttgaaccagcacaggcctggattagt atttcaccagagtcattcaaaagatgtataaccttagcccaacggactcattcttccgaa cctcagtatcttcacctattaaacagggataaaagcaatggaagttccggtgcaacaatc gatttctgtttaagcttcccagtttgtggtattttgctacagcagcagccctatcagaaa gaatacgccttccccactccacaatactgccagcattctgtggcattggtgctgtttcct gcaataagcaataaactcagccttttcttaatgatagctcatgctagttatattggagat gccagcattcagcaagagcctgctggatttgccactcttttaaagctgtcaagaagaact tttgaataa >gi568815594f:163250274_163451608|GENSCAN_predicted_peptide_8|174_aa MPSQKFPAGAGHLWRAFARAVQKGNIGLEAPHRVPTEAPPIGAVRRGPLSSRPQNGRSTN SLPRLPGKATDTQRQPMKAAREETVPCKATGAELPKTIGTHLLHQHDLDVRPGVKGDHFG ALKFDYPPGFWTCMGTVTTSFCPISPIWNGCIYPMPVLPLYLGSNQFAFNFTGS >gi568815594f:163250274_163451608|GENSCAN_predicted_CDS_8|525_bp atgcccagtcagaagttccctgcaggggcagggcacttatggagagcctttgctagggca gtgcagaagggaaatatagggttggaggccccacacagagtccctactgaggcaccacct attggagctgtgagaagagggccactgtcctccagaccccagaatggtagatccactaac agcttgccccgtttgcctggaaaagccacagacactcaacgccagcccatgaaagcagcc agggaggagactgtaccctgcaaagccacaggggcagagctgcccaagaccataggaacc cacctcttgcatcagcatgatctggacgtgagacctggagtcaaaggagatcattttgga gctttaaaatttgactaccctcctggattttggacttgcatgggaactgtaaccacttcg ttttgtccaatttctcccatttggaatggctgtatttacccaatgcctgtactcccattg tatttaggaagtaaccagtttgctttcaattttacaggctcatag >gi568815594f:163250274_163451608|GENSCAN_predicted_peptide_9|187_aa MRQSLEGAFPQRTKFLWTRSPQPDDPTTGLRVPGQAPAHVITLTQPRVHLTIEGQEIDFL LDTGVAFSVLVSCPGQLSSRETGIALGVLTQTRGTTPQPVAYLRISAQLAELVILTRALT LGKGRRINVYTDSPVTAILLLLAFGPCIFNLLVKFVSSRIEAIKLQMVLQMEPQMSPTNN FYQGPLD >gi568815594f:163250274_163451608|GENSCAN_predicted_CDS_9|564_bp atgaggcaatcactggaaggagcatttccccagaggacaaagtttctctggaccagaagc ccccaaccagatgatccaacaacaggactgagggtgccagggcaagcgccagctcatgtc atcaccctcactcagccccgggtacatttaaccattgagggccaggaaattgacttcctt ctggacactggcgtggctttctcagtgttagtctcctgtcctggacagctgtcctcaaga gagacaggaatagctcttggggtccttactcagactcgtgggacaaccccacaaccagtg gcatatctaagaatcagtgcccagttagcagaactagtgatacttacccgagccttaaca ctgggaaagggaagaagaataaatgtgtatacagatagtcctgtgacagccatcttacta ttactcgcctttgggccctgtatttttaacctccttgtaaaatttgtgtcctctaggatc gaggccatcaagctacagatggtcttacaaatggaaccccaaatgagcccaactaacaac ttctaccaaggacccctggactga >gi568815594f:163250274_163451608|GENSCAN_predicted_peptide_10|130_aa LLGIYTTLSQTISIIQKVRERKLGTQKNVGEAIASVGVTESEKLRWVLGLADFKNEAADP CVVSLTSGVKLQTFAVNVTALKVGMSGVVCSSHGFVGLLTSGMKLQTLAVNVAARKATVD PKSEQQQDLL >gi568815594f:163250274_163451608|GENSCAN_predicted_CDS_10|393_bp ctgttgggcatctacactactttatcgcaaacaataagcattattcaaaaagtaagggag aggaaactaggcactcaaaaaaatgtaggagaagcaatagcctcagtaggtgtcactgaa agtgagaagttgaggtgggttcttggcctcgctgacttcaagaatgaagccgcggaccct tgcgtggtctcactgacttcaggagtgaagctgcagaccttcgcagtgaatgttacagct cttaaagttggcatgtccggagttgtttgttcctcccatgggttcgtgggcttgctgact tcaggaatgaagctgcagacccttgcagtgaatgttgcagctcgtaaagctactgtggac ccaaagagtgagcagcagcaagatttattgtga