GENSCAN 1.0 Date run: 4-Nov-116 Time: 16:33:51 Sequence gi568815585f:27820139_28024698 : 204560 bp : 44.71% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 1998 1993 6 1.05 1.02 Term - 8491 8462 30 1 0 120 49 36 0.715 0.75 1.01 Init - 9999 9751 249 0 0 46 59 200 0.806 8.37 1.00 Prom - 11488 11449 40 -5.76 2.00 Prom + 11765 11804 40 -3.46 2.01 Init + 13753 13811 59 0 2 75 94 65 0.258 4.69 2.02 Intr + 20705 20795 91 2 1 96 94 35 0.280 4.90 2.03 Intr + 25963 26017 55 0 1 69 121 11 0.078 1.05 2.04 Term + 32557 32624 68 2 2 104 44 84 0.615 3.70 2.05 PlyA + 34514 34519 6 1.05 3.00 Prom + 53062 53101 40 -4.56 3.01 Init + 57329 57349 21 1 0 77 101 9 0.085 0.78 3.02 Intr + 88002 88177 176 2 2 23 81 100 0.007 1.54 3.03 Term + 88398 88605 208 0 1 56 39 141 0.008 2.81 3.04 PlyA + 89881 89886 6 1.05 4.00 Prom + 99604 99643 40 -3.66 4.01 Init + 100001 100406 406 1 1 99 115 462 0.998 46.75 4.02 Intr + 104118 104447 330 1 0 117 38 547 0.131 48.20 4.03 Intr + 105775 105904 130 2 1 31 58 97 0.088 0.75 4.04 Intr + 108887 108944 58 2 1 66 116 21 0.135 1.59 4.05 Intr + 109969 109987 19 0 1 146 96 12 0.019 3.98 4.06 Intr + 111874 111901 28 2 1 106 86 -3 0.009 -1.53 4.07 Term + 125121 125277 157 0 1 79 33 178 0.142 8.81 4.08 PlyA + 125409 125414 6 1.05 5.07 PlyA - 125857 125852 6 1.05 5.06 Term - 134171 134038 134 0 2 15 53 174 0.156 4.85 5.05 Intr - 134483 134346 138 0 0 66 77 95 0.152 6.64 5.04 Intr - 141262 141133 130 1 1 44 65 137 0.759 7.27 5.03 Intr - 143231 143067 165 2 0 102 -4 84 0.174 0.76 5.02 Intr - 144877 144732 146 2 2 134 75 321 0.875 35.40 5.01 Init - 148868 148328 541 2 1 96 121 881 0.966 87.24 5.00 Prom - 150083 150044 40 -7.66 6.00 Prom + 153054 153093 40 -9.36 6.01 Sngl + 155639 156010 372 1 0 82 43 195 0.592 8.66 6.02 PlyA + 156057 156062 6 1.05 7.05 PlyA - 156717 156712 6 1.05 7.04 Term - 158314 157968 347 2 2 106 37 732 0.997 64.56 7.03 Intr - 159161 159067 95 1 2 61 56 28 0.195 -3.49 7.02 Intr - 160511 160331 181 0 1 92 80 51 0.143 3.63 7.01 Init - 168499 168325 175 1 1 54 110 162 0.808 14.51 7.00 Prom - 171403 171364 40 -5.86 8.06 PlyA - 172759 172754 6 1.05 8.05 Term - 180496 180451 46 0 1 134 44 90 0.750 5.88 8.04 Intr - 195118 195019 100 2 1 79 101 83 0.950 7.87 8.03 Intr - 195563 195452 112 2 1 94 111 74 0.997 10.25 8.02 Intr - 198451 198329 123 1 0 77 54 140 0.969 10.28 8.01 Intr - 203339 203212 128 0 2 60 76 105 0.934 6.90 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 104118 104563 446 1 2 117 54 589 0.865 53.80 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585f:27820139_28024698|GENSCAN_predicted_peptide_1|92_aa MAPSAVSPPQAPRRPLLGPWGLRAAGITETHIPVASSLSLCSEERQKGILLFPSRFLHLP SSDKPKNLPSEKTARISRYLGVEGVFQMVQFT >gi568815585f:27820139_28024698|GENSCAN_predicted_CDS_1|279_bp atggcgccctctgctgttagtccgccccaggctccgcgccggcctctcctgggtccgtgg ggcctgcgggctgcggggatcaccgagacccacattcccgtggccagcagcctttcgctc tgctcagaggagaggcagaagggcatattgctgtttcccagtcgctttttacacctgcct tcttcggataaacccaaaaatcttccttcagagaagacggcccgtatttcccgttatttg ggggtggagggagtcttccagatggtgcagttcacctag >gi568815585f:27820139_28024698|GENSCAN_predicted_peptide_2|90_aa MRMLVLWLRLLLKVSVGRKRGPSSAAQPLNAGGPQCSVQVLSFLTLHSSWCLNARTAGIW IQQSKFSEGLRWSLEFAFPRSSKVMLVLLV >gi568815585f:27820139_28024698|GENSCAN_predicted_CDS_2|273_bp atgaggatgctggtcctctggctcaggctgctcctcaaggtctcggtggggagaaagagg ggaccttcctccgcagcccagcctctgaatgctggtggcccccagtgctcggtccaggtc ctctccttcctcacccttcactcttcttggtgcttgaatgccaggacagcagggatttgg attcagcagagcaagttttctgaaggtctgcgctggagcctggaatttgcatttccaaga agctccaaggtgatgctggtgctgctggtctag >gi568815585f:27820139_28024698|GENSCAN_predicted_peptide_3|134_aa MQEQLGNHKYPQSNKQTILVNIYKGATVIKFQGVHSSSNQKFTSRDEKYKLGTDWFQVID GLWKGSWLDPRMRKQRIQRAKLTFSTCDCSDFGVLGEWDGVLEPIPWGYGESAVPLLICW KRTCFCRAPNDEEL >gi568815585f:27820139_28024698|GENSCAN_predicted_CDS_3|405_bp atgcaagaacagttgggtaatcacaaatatccccagtccaataaacaaacgatccttgtg aatatttacaagggagcaactgttataaagttccagggagtgcacagtagcagcaaccaa aaattcacaagtagagatgaaaagtacaagctgggcacagactggttccaggtgatagac ggactgtggaagggaagttggctggatccgaggatgcggaagcagaggatacagagagcc aagctcacgttttcaacctgcgactgctcggattttggtgttttgggggagtgggacggg gtcctggaaccaatcccctggggatatggagagtcagccgtacctttgctcatctgttgg aaaagaacgtgtttctgcagggcaccaaatgacgaagaactataa >gi568815585f:27820139_28024698|GENSCAN_predicted_peptide_4|375_aa MNGEEQYYAATQLYKDPCAFQRGPAPEFSASPPACLYMGRQPPPPPPHPFPGALGALEQG SPPDISPYEVPPLADDPAVAHLHHHLPAQLALPHPPAGPFPEGAEPGVLEEPNRVQLPFP WMKSTKAHAWKGQWAGGAYAAEPEENKRTRTAYTRAQLLELEKEFLFNKYISRPRRVELA VMLNLTERHIKIWFQNRRMKWKKEEDKKRGGGTAVGGGGVAEPEQDCAVTSGEELLALPP PPPPGGCGRNSCSEVEAVLKTSTVAWCALETNNYSRASMTFTSLEIMKTDRPSWSHKPSI PLGIFNPKVWFVSEEFSMSLISKSMVAYWRQAGLSYIRYSQICAKVVRDALKTEFKANAK KTSGNSVKIVKVKKE >gi568815585f:27820139_28024698|GENSCAN_predicted_CDS_4|1128_bp atgaacggcgaggagcagtactacgcggccacgcagctttacaaggacccatgcgcgttc cagcgaggcccggcgccggagttcagcgccagcccccctgcgtgcctgtacatgggccgc cagcccccgccgccgccgccgcacccgttccctggcgccctgggcgcgctggagcagggc agccccccggacatctccccgtacgaggtgccccccctcgccgacgaccccgcggtggcg caccttcaccaccacctcccggctcagctcgcgctcccccacccgcccgccgggcccttc ccggagggagccgagccgggcgtcctggaggagcccaaccgcgtccagctgcctttccca tggatgaagtctaccaaagctcacgcgtggaaaggccagtgggcaggcggcgcctacgct gcggagccggaggagaacaagcggacgcgcacggcctacacgcgcgcacagctgctagag ctggagaaggagttcctattcaacaagtacatctcacggccgcgccgggtggagctggct gtcatgttgaacttgaccgagagacacatcaagatctggttccaaaaccgccgcatgaag tggaaaaaggaggaggacaagaagcgcggcggcgggacagctgtcgggggtggcggggtc gcggagcctgagcaggactgcgccgtgacctccggcgaggagcttctggcgctgccgccg ccgccgccccccggaggatgtggacgtaattcctgttccgaggtagaggctgtgctgaag acaagcacagtggcctggtgcgccttggaaaccaacaactattcacgagccagtatgacc ttcacatctttagaaattatgaaaacagaccgtccgagctggagccacaagccctccatt cctcttggaatcttcaaccccaaggtgtggtttgtgtctgaggaattctcaatgagcctg atttccaaaagcatggtggcctactggagacaggctggactcagctacatccgatactcc cagatctgtgcaaaagtagtgagagatgcactgaagacagaattcaaagcaaatgccaaa aagacttctggcaacagcgtaaaaattgtgaaagtaaagaaggaataa >gi568815585f:27820139_28024698|GENSCAN_predicted_peptide_5|417_aa MYVSYLLDKDVSMYPSSVRHSGGLNLAPQNFVSPPQYPDYGGYHVAAAAAAAANLDSAQS PGPSWPAAYGAPLREDWNGYAPGGAAAAANAVAHGLNGGSPAAAMGYSSPADYHPHHHPH HHPHHPAAAPSCASGLLQTLNPGPPGPAATAAAEQLSPGGQRRNLCEWMRKPAQQSLGSQ VKTRTKDKYRVVYTDHQRLELEKEFHYSRYITIRRKAELAATLGLSERQVKIWFQNRRAK ERKINKKKLQQQQQQQPPQPPPPPPQPPQPQPGPLRSVPEPLSPDPGEGGLALRFDLPTN IREPNQRAHGGALRAVGEDEQSRETARAGARGLVSRCSRPRRPVSACCRSLLGLRRCPGF PGEIRGVSREVARDAKLFKRCAQLRRGGAGNLGRKRRSHSVKKAWSTGLRPVVEDLG >gi568815585f:27820139_28024698|GENSCAN_predicted_CDS_5|1254_bp atgtacgtgagctacctcctggacaaggacgtgagcatgtaccctagctccgtgcgccac tctggcggcctcaacctggcgccgcagaacttcgtcagccccccgcagtacccggactac ggcggttaccacgtggcggccgcagctgcagcggcagcgaacttggacagcgcgcagtcc ccggggccatcctggccggcagcgtatggcgccccactccgggaggactggaatggctac gcgcccggaggcgccgcggccgccgccaacgccgtggctcacggcctcaacggtggctcc ccggccgcagccatgggctacagcagccccgcagactaccatccgcaccaccacccgcat caccacccgcaccacccggccgccgcgccttcctgcgcttctgggctgctgcaaacgctc aaccccggccctcctgggcccgccgccaccgctgccgccgagcagctgtctcccggcggc cagcggcggaacctgtgcgagtggatgcggaagccggcgcagcagtccctcggcagccaa gtgaaaaccaggacgaaagacaaatatcgagtggtgtacacggaccaccagcggctggag ctggagaaggagtttcactacagtcgctacatcaccatccggaggaaagccgagctagcc gccacgctggggctctctgagaggcaggttaaaatctggtttcagaaccgcagagcaaag gagaggaaaatcaacaagaagaagttgcagcagcaacagcagcagcagccaccacagccg cctccgccgccaccacagcctccccagcctcagccaggtcctctgagaagtgtcccagag cccttgagtccggacccgggcgaggggggcttagcccttcgtttcgatcttcccaccaac atccgagagcctaatcagcgcgcccacggaggcgccttaagggcagttggggaagatgag cagagccgggaaacagcaagagcgggcgcccgggggctcgtgtcccgctgctctcgccca agacggccggtctcggcctgctgccggtccttgctgggtctgcgccgctgcccgggattc cctggagagattcgcggcgtctcccgagaggtggcgcgcgacgccaagcttttcaaaagg tgcgcgcaactacggcgcggaggtgcggggaacctgggccgcaagcgccgaagccactcg gtgaagaaggcctggagcactggccttcgacccgtcgtcgaagacctgggctga >gi568815585f:27820139_28024698|GENSCAN_predicted_peptide_6|123_aa MIAAQARVHAAGAPLPPRAHQRPESGKRLCQAKADRNREHVAICRRELPCQAATGDLPAR ATASAQSSSRRGESEPPAGLVGALPSPASTPRVLRSPLRLLMLRCFGRGAAQPALCPFSS SSF >gi568815585f:27820139_28024698|GENSCAN_predicted_CDS_6|372_bp atgattgccgcgcaagctagggtccacgcggccggggctcctctcccgcctcgggcgcac cagaggccagaaagcggaaagcggctctgccaggccaaggccgacagaaaccgagagcat gtcgctatttgccggagagaactgccctgccaggctgccacaggtgacttgcccgcgcga gccacggcctctgcgcagagctcctcccgaaggggagagtcagagccacccgcgggtttg gtcggggctcttcccagcccggcgtccacgccccgcgtcctgcgctcgcccttgcggctg cttatgctcaggtgtttcggaagaggcgccgcgcagccagctctctgtcccttcagctcc agctccttctaa >gi568815585f:27820139_28024698|GENSCAN_predicted_peptide_7|265_aa MDIEKVNSMDLGEFVDVFGNATERCPLIAAAVWSQRPFSDLEDLEKHFFAFIDALAQSAW ADPPDSMQRWGGKSPDWAGDLAAALKLRTPEEEARKVQPPKTRTFPSSAARAGYRPGPGP RRLCKDFLQVNAVLWGLWDLQSEKGNKSRAGQEGILRCHPDLAGSELQRGTLTAESQREQ SGAGLRSLGADERLRLAELNAQYRARFGFPFVLAARFSDRTAVPRELARRLLCPSAQELR TALGEVKKIGSLRLADLLRADPAKL >gi568815585f:27820139_28024698|GENSCAN_predicted_CDS_7|798_bp atggacattgagaaggtcaactccatggaccttggagaattcgtggatgtgtttgggaat gccactgagagatgtcctctgattgcagctgctgtttggtcccagcggccattctctgat ttggaagatttagagaagcacttttttgcctttattgatgcccttgcacagtcagcttgg gctgatcctccagacagcatgcaacggtggggagggaagtcccctgactgggcgggggac ctagcggctgctctgaaactccgaacacctgaagaggaggcgcggaaggtccagccgccc aagactcgcactttcccctcctccgcagcccgggcaggttaccgtcctgggcctgggcct aggagattatgcaaagatttccttcaagtaaacgctgttctctggggcctctgggatcta cagtcggagaaggggaataagtcccgggccggccaggagggcatcctgcgctgccacccg gacctggcgggcagcgagctgcagcggggcacgctcacggccgagtcgcagcgggaacag agcggcgcaggcctgaggagcctgggcgcggacgagcggctgcggctggccgagctcaac gcgcagtaccgcgcgcgcttcggtttccccttcgtgctcgccgcgcgcttcagcgaccgg acggcggtgccgcgcgagctggcgcgccggctgctctgcccgtccgcgcaggagctgcgc actgctctgggcgaggtgaagaagatcggcagcctgcgcctggccgacctcctccgcgca gaccccgccaagctgtag >gi568815585f:27820139_28024698|GENSCAN_predicted_peptide_8|169_aa XEIEYENQKRLEEEEDLNVLTFEDLLCFAYQVAKGMEFLEFKSCVHRDLAARNVLVTHGK VVKICDFGLARDIMSDSNYVVRGNARLPVKWMAPESLFEGIYTIKSDVWSYGILLWEIFS LGVNPYPGIPVDANFYKLIQNGFKMDQPFYATEEMSSPVSEPMSDGFLH >gi568815585f:27820139_28024698|GENSCAN_predicted_CDS_8|510_bp natgaaattgaatatgaaaaccaaaaaaggctggaagaagaggaggacttgaatgtgctt acatttgaagatcttctttgctttgcatatcaagttgccaaaggaatggaatttctggaa tttaagtcgtgtgttcacagagacctggccgccaggaacgtgcttgtcacccacgggaaa gtggtgaagatatgtgactttggattggctcgagatatcatgagtgattccaactatgtt gtcaggggcaatgcccgtctgcctgtaaaatggatggcccccgaaagcctgtttgaaggc atctacaccattaagagtgatgtctggtcatatggaatattactgtgggaaatcttctca cttggtgtgaatccttaccctggcattccggttgatgctaacttctacaaactgattcaa aatggatttaaaatggatcagccattttatgctacagaagaaatgtcttcgccagtttcc gagcccatgagtgacggcttcctgcactga