GENSCAN 1.0 Date run: 7-Nov-116 Time: 21:22:14 Sequence gi568815591r:22673649_22873993 : 200345 bp : 41.76% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 7096 7238 143 2 2 38 63 118 0.717 3.45 1.02 Term + 11279 11425 147 1 0 54 41 193 0.889 8.12 1.03 PlyA + 12095 12100 6 1.05 2.03 PlyA - 12134 12129 6 1.05 2.02 Term - 16782 16685 98 1 2 95 39 78 0.531 0.75 2.01 Init - 21532 21328 205 1 1 69 62 166 0.792 11.16 2.00 Prom - 22305 22266 40 -8.15 3.00 Prom + 23068 23107 40 -8.25 3.01 Init + 26610 26694 85 2 1 51 99 89 0.332 7.33 3.02 Intr + 30051 30286 236 1 2 75 94 10 0.200 -3.22 3.03 Intr + 30397 30793 397 0 1 83 90 118 0.463 4.73 3.04 Term + 31857 32071 215 1 2 58 47 167 0.414 6.01 3.05 PlyA + 34870 34875 6 1.05 4.00 Prom + 39694 39733 40 -2.55 4.01 Init + 40781 40875 95 1 2 34 100 89 0.154 4.60 4.02 Intr + 52695 53020 326 0 2 29 27 204 0.454 2.89 4.03 Intr + 53796 53986 191 1 2 93 90 216 0.707 20.68 4.04 Intr + 55045 55158 114 0 0 85 66 105 0.896 7.72 4.05 Intr + 55866 56012 147 2 0 102 98 112 0.932 13.11 4.06 Term + 57758 57925 168 1 0 94 39 201 0.998 12.70 4.07 PlyA + 58248 58253 6 1.05 5.00 Prom + 60185 60224 40 -6.55 5.01 Sngl + 67529 67933 405 1 0 91 38 147 0.746 6.03 5.02 PlyA + 68259 68264 6 1.05 6.03 PlyA - 68304 68299 6 1.05 6.02 Term - 93173 93044 130 1 1 -23 37 186 0.074 -0.93 6.01 Init - 100345 100002 344 1 2 75 75 362 0.148 30.35 6.00 Prom - 115514 115475 40 -5.25 7.03 PlyA - 115917 115912 6 1.05 7.02 Term - 122415 121967 449 1 2 -17 47 260 0.481 5.79 7.01 Init - 122899 122800 100 1 1 100 113 49 0.404 9.08 7.00 Prom - 132001 131962 40 -4.55 8.00 Prom + 137608 137647 40 -2.65 8.01 Init + 141695 141771 77 1 2 67 53 21 0.175 -2.88 8.02 Intr + 146289 146411 123 0 0 63 64 113 0.311 5.18 8.03 Intr + 148452 148555 104 0 2 72 110 55 0.175 5.00 8.04 Intr + 149001 149032 32 1 2 80 99 11 0.168 -1.67 8.05 Intr + 149123 149203 81 1 0 53 69 123 0.159 5.82 8.06 Term + 159887 160042 156 1 0 120 55 71 0.806 3.95 8.07 PlyA + 160575 160580 6 1.05 9.05 PlyA - 161096 161091 6 1.05 9.04 Term - 171062 170892 171 2 0 27 54 118 0.245 -0.86 9.03 Intr - 172998 172897 102 0 0 94 92 57 0.506 6.15 9.02 Intr - 180205 180012 194 2 2 60 46 130 0.254 4.29 9.01 Init - 191708 191438 271 2 1 68 14 254 0.715 13.18 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 100345 99998 348 1 0 75 49 365 0.841 27.09 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:22673649_22873993|GENSCAN_predicted_peptide_1|96_aa XVFQKHTQIPGKTIPTLLTVFLTNTLAPVPEAPAPPVVIAPPINQKEKGVIQKASSSPSS PSSSSHLLSCTECSSKFPSDPDENENLHMPREEAEA >gi568815591r:22673649_22873993|GENSCAN_predicted_CDS_1|291_bp ngagtcttccaaaaacacacccaaattcctggaaaaaccatccccaccctattaacagtc ttcctcactaacaccctggccccagtacctgaagccccagccccgccagttgtcatagct ccaccaattaatcagaaagagaagggtgtgattcaaaaagcatcatcatcaccatcatca ccatcatcatcatcacacctgctttcctgcacagaatgttcctcaaagtttccaagtgac ccagatgaaaacgagaacctgcacatgcccagagaggaagcagaagcttaa >gi568815591r:22673649_22873993|GENSCAN_predicted_peptide_2|100_aa MEYRKQKIHRCGEGKGNLLESGEGRSQDDDYTLIEEGSQYRLEHRDSGGGLSTVIAKSSA IILVFLKTAVFRSSSCFQDRVWAVEDIPGRLRMHAGVSSP >gi568815591r:22673649_22873993|GENSCAN_predicted_CDS_2|303_bp atggaatacagaaagcagaagatccatcgctgtggagaagggaaagggaatctcctggag agtggtgaagggagatcccaggatgacgattacacattaattgaagagggcagtcagtac agattggaacatcgtgactcaggaggtggcttgagtactgtcattgccaaatcttctgcc attatacttgtctttttaaaaactgcagtcttcaggagcagcagctgcttccaagacagg gtttgggcagttgaagatattcctggtcggctgaggatgcatgcaggtgttagcagtcct tag >gi568815591r:22673649_22873993|GENSCAN_predicted_peptide_3|310_aa MESMTARFLLGLKEEDSSAGKSWEGGQAGEINSLVAHTKPVWWSLHTDAHEIWCHDSNRG TSLGGWGISLGRSIPCPPALCSMKKIHLRPWVLRPTSPRNISLILNWFLSFSGRDRRRIL SVDPKLWCLSRTREDSLPLMFITRGRLPDYSPTFQRCPTTQGRLPWSFTLSSKYRFSGGQ EPPNPFSFTLSGKCRFSRGQEPPDPLFPRPDPLSPHPDPLSLCPNPLFPHPDPFPTFLEG ACYKCWKSGHCAKECPQPRIPPKPCPICAGPPRKSDCSTHLAATSRAPGTLAQGSLTDSF PDLLGLAAED >gi568815591r:22673649_22873993|GENSCAN_predicted_CDS_3|933_bp atggaatccatgactgccaggtttttactggggcttaaggaggaagacagtagcgctggc aaaagctgggaaggaggacaggcaggtgaaataaacagccttgttgctcacacaaagcct gtttggtggtctcttcacacggacgcacatgaaatttggtgccatgactcgaatcggggg acctccttgggaggttgggggatctcccttgggagatcaatcccctgtcctcctgctctt tgctccatgaaaaagatccacctacgaccttgggtcctcagacccaccagcccaaggaac atctcactaattttaaattggttcctttccttttctggtagagacaggagacgcatttta tctgtggacccaaaactctggtgcctgtcacggactcgggaagacagtcttcccttgatg tttatcacgcgaggacgcctgcctgattattcacccacgtttcagaggtgtccaaccacg cagggacgactgccttggtccttcacccttagcagcaagtaccgcttttctggggggcaa gaaccccccaaccccttctccttcaccctgagtggcaagtgccgcttttctagggggcaa gaaccccccgatcccttatttccacgccctgaccccttatctccacaccctgacccctta tctctgtgccccaaccctttatttccacaccccgacccctttcccacttttctggaggga gcttgctacaagtgctggaaatctggccactgcgccaaggaatgcccacagcccaggatt cctcctaagccgtgtcccatctgtgcgggacccccaagaaaatcggactgttcaactcac ctggcagccacttccagagcccctggaactctggcccaaggctctctgactgactccttc ccagatcttctcggcttagcagctgaagactga >gi568815591r:22673649_22873993|GENSCAN_predicted_peptide_4|346_aa MQVDQPLGCPECKRELRVGDINLAVICVKEKRCPSETVVKRLSGNGESTGSTRQTSGTES KVLTGRIPKGSLGRGQGSSQPPLSGLKQVKKVAEATRWQKGVTHSTWRRLEVTARNLRMA RQFYNSRSQGEPEHRRTQMTGAFGPVAFSLGLLLVLPAAFPAPVPPGEDSKDVAAPHRQP LTSSERIDKQIRYILDGISALRKETCNKSNMCESSKEALAENNLNLPKMAEKDGCFQSGF NEETCLVKIITGLLEFEVYLEYLQNRFESSEEQARAVQMSTKVLIQFLQKKAKNLDAITT PDPTTNASLLTKLQAQNQWLQDMTTHLILRSFKEFLQSSLRALRQM >gi568815591r:22673649_22873993|GENSCAN_predicted_CDS_4|1041_bp atgcaagtggatcagccactgggctgtccagaatgcaagagagaactcagagttggggac ataaacttggcagtcatctgtgtaaaagagaaaaggtgccccagtgaaacagtggtgaag agactcagtggcaatggggagagcactggcagcacaaggcaaacctctggcacagagagc aaagtcctcactgggaggattcccaaggggtcacttgggagagggcagggcagcagccaa cctcctctaagtgggctgaagcaggtgaagaaagtggcagaagccacgcggtggcaaaaa ggagtcacacactccacctggagacgccttgaagtaactgcacgaaatttgaggatggcc aggcagttctacaacagccgctcacagggagagccagaacacagaagaactcagatgact ggcgccttcggtccagttgccttctccctggggctgctcctggtgttgcctgctgccttc cctgccccagtacccccaggagaagattccaaagatgtagccgccccacacagacagcca ctcacctcttcagaacgaattgacaaacaaattcggtacatcctcgacggcatctcagcc ctgagaaaggagacatgtaacaagagtaacatgtgtgaaagcagcaaagaggcactggca gaaaacaacctgaaccttccaaagatggctgaaaaagatggatgcttccaatctggattc aatgaggagacttgcctggtgaaaatcatcactggtcttttggagtttgaggtataccta gagtacctccagaacagatttgagagtagtgaggaacaagccagagctgtgcagatgagt acaaaagtcctgatccagttcctgcagaaaaaggcaaagaatctagatgcaataaccacc cctgacccaaccacaaatgccagcctgctgacgaagctgcaggcacagaaccagtggctg caggacatgacaactcatctcattctgcgcagctttaaggagttcctgcagtccagcctg agggctcttcggcaaatgtag >gi568815591r:22673649_22873993|GENSCAN_predicted_peptide_5|134_aa MDKWDHVKLKSFRVAKDTINKVKRQPTEWEKISANYPSDKELVIRIYKELKQPCRKKKSN KSIKIWAKDLNRHFSKEGIQMANRYMKRCSTSLIIREIVSPQSKLLICKRQTITNAGENV EKRELLYAVGGNVN >gi568815591r:22673649_22873993|GENSCAN_predicted_CDS_5|405_bp atggacaaatgggatcacgtcaagttaaaaagcttccgcgtggcaaaggatacaatcaac aaagtgaagagacaacccacagaatgggagaaaatatctgccaactacccatctgacaag gaattagtaataagaatatacaaggagctcaaacaaccctgtaggaaaaaaaaatctaat aaatcgatcaaaatatgggcaaaagatttgaatagacatttctcaaaagaaggcatacaa atggcaaacaggtatatgaaaaggtgctcaacatcactgatcatcagagaaattgtctca ccccagtcaaaactgcttatatgcaaaagacagacaataacaaatgctggcgagaatgta gagaaaagggaactgttgtatgctgttggtgggaatgtaaattag >gi568815591r:22673649_22873993|GENSCAN_predicted_peptide_6|157_aa MTKKRRNNVRAKKGRGHVQPIRCTNCARCVPKDNAIKKFVIRNIVEAAAVRDISEASVFD AYVLPKLCVKLHYCVSCVIHSKVVRNRSREARKDRTPPPRFRPAGAAARPPPKPITLTTP NADKDVKQQELSFIVGGNAKFGGQLVISYKTKHTFTI >gi568815591r:22673649_22873993|GENSCAN_predicted_CDS_6|474_bp atgacaaagaaaagaaggaacaacgttcgtgccaaaaagggccgcggccacgtgcagcct attcgctgcactaactgtgcccgatgcgtgcccaaggacaacgccattaagaaattcgtc attcgaaacatagtggaggctgcagcagtcagggacatttctgaagcgagcgtcttcgat gcctatgtgcttcccaagctgtgtgtgaagctacattactgtgtgagttgtgtaattcac agcaaagtagtcaggaatcgatctcgtgaagcccgcaaggaccgaacacccccaccccga tttagacctgcgggtgctgccgcacgtcccccaccaaagcccataacactgacaacacca aatgctgacaaggatgtgaagcaacaagaactctccttcattgttggtggaaatgcaaaa tttggaggacagttggtgatttcttacaaaactaaacatacttttaccatatga >gi568815591r:22673649_22873993|GENSCAN_predicted_peptide_7|182_aa MAERGQHRAQAMVSEGGSLKRWQLSCDVVPVSAQDLRECLDAQAEVCCRDGFLIWTASAR AVQKGNVGWDPPHRVPTGAPPSRAVRRRPPSSRPQNDRSTGSLYHMPGKATDAQCQSMKA AGREAVSCKATEAELPETMGTYLLHQRDPDVRHGVKGDHFGPLRFDCPAGFRTCVGPVAP LF >gi568815591r:22673649_22873993|GENSCAN_predicted_CDS_7|549_bp atggctgaaaggggccaacatagagctcaggccatggtttcagagggtggaagcctcaag cgttggcagctttcatgtgatgttgtgcctgtgagtgcacaagatctacgggaatgcctg gatgcccaggcagaagtttgttgtagggatgggttcctcatatggacagcctctgctagg gcagtgcagaagggaaatgtggggtgggatcccccacacagagtccctactggggcacca cctagtcgggctgtgagaagaaggccaccatcctccagaccccagaatgatagatccact ggcagcttgtaccacatgcctggaaaagccacagatgctcaatgccagtccatgaaagca gctgggagggaggctgtatcctgcaaagccacagaggcagagctacccgagaccatggga acctacctcttgcatcagcgtgacccggatgtgagacatggagtcaaaggagatcatttt ggacctttaagatttgactgccctgctggatttcggacttgtgtggggcctgtagcccct ttgttttga >gi568815591r:22673649_22873993|GENSCAN_predicted_peptide_8|190_aa MQVQPKICTRCNALSIPVQRTTKTASTILPEALKVMIEPKLHTKLNPQLLVLVRVEVEEV KEQPGASQHSCATLAKKFLYGCFLTCKMGAVIEIIQTLHRTVTFPVLLPLTQLHHGDGRV AQGGPLTATTASGIRKGKEGVFEQTNNFLLPGYSLELLSTPGKHMYEIKFLQRRDSSKSI TGTPTKCKAL >gi568815591r:22673649_22873993|GENSCAN_predicted_CDS_8|573_bp atgcaggtccagcctaaaatttgtacaagatgcaatgctttgtcaattcctgtccaaagg accaccaagacggccagcactatactccctgaagcactgaaagttatgattgaacctaaa ttacacacaaagttaaatccgcagttgctggttttagtgagagttgaggtggaggaagta aaagaacaacctggagccagtcagcatagctgtgcgaccttggcaaaaaagtttctctac ggctgtttcctcacctgtaaaatgggggcagtaatagaaatcatccaaacccttcatcgt acagtcactttcccggttcttctcccactgacccagcttcaccatggcgacggccgtgtg gcgcagggaggaccccttacagcaaccacagcgtcgggaatccgaaagggaaaggagggt gtttttgagcagacaaacaatttcttattgccaggatacagcctggaactcttaagcaca ccaggcaagcatatgtatgaaataaaatttctacaaagaagagattcgtcaaaaagcatc actgggacaccaaccaagtgtaaggctctgtga >gi568815591r:22673649_22873993|GENSCAN_predicted_peptide_9|245_aa MVDGGPDQDIHVEATAGWLEQPGTKRLAGALQRQTLEDKAHYLQIILQSRAKCGDQDRGN QEAFTGPEVDLRENLEAQLAEKPIRQSEAGNSSNYVAEDWLPLKDVNISGGRGCKLIEEG LKHGEDSWCLECCMLRMMATARKYLAPKARLCHWTIICSRTGVPNPQAADWYQSEACQEP GSTAGGERWKRKRYQRPFSLPGTHEQRKDHVRHCEKAAAHTPGRESSPATIPAGTLILDL ELSEL >gi568815591r:22673649_22873993|GENSCAN_predicted_CDS_9|738_bp atggtggatgggggacctgatcaggacatccatgtggaagccacagcagggtggctggag cagcctggaacaaaacgcctggctggtgctctgcaaaggcagacactggaggacaaagca cattatctccagataattcttcagtccagagctaagtgcggggatcaggacagaggcaac caggaggcctttacagggccagaggtggatctcagggagaatctagaagctcagctggct gaaaaaccaatcaggcagtcagaagctgggaactcgtctaattacgtagcagaagactgg ctcccgttaaaggatgtgaacattagcggaggtcgggggtgcaaactcatcgaagagggg ttgaagcacggggaggactcctggtgcctggagtgctgcatgcttagaatgatggccacg gcccgcaaatatcttgcccccaaagcacgtctgtgtcattggactattatttgctctagg acaggggtccccaacccccaggctgcagactggtatcagtccgaggcctgtcaggaaccg ggcagcacagcaggaggcgaacgatggaagaggaagagataccagaggcccttctctctc cccggcacacatgaacagaggaaagaccatgtgaggcactgcgagaaggcagctgcccat actccaggaagagagtcctcaccagcaaccatccctgctggtaccttgatcttggacttg gagctttcagaactgtga