GENSCAN 1.0 Date run: 5-Nov-116 Time: 09:09:32 Sequence gi568815591f:22627439_22831570 : 204132 bp : 41.03% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 PlyA - 593 588 6 1.05 1.04 Term - 19376 18948 429 2 0 68 35 349 0.306 21.92 1.03 Intr - 27392 27249 144 2 0 72 59 123 0.067 7.36 1.02 Intr - 35422 35334 89 2 2 47 95 70 0.071 2.37 1.01 Init - 41847 41721 127 0 1 47 99 186 0.961 15.97 1.00 Prom - 43422 43383 40 -5.95 2.00 Prom + 45332 45371 40 -9.15 2.01 Init + 45719 45810 92 1 2 70 81 72 0.293 4.71 2.02 Intr + 45894 45943 50 0 2 46 77 36 0.228 -4.29 2.03 Intr + 53306 53448 143 0 2 38 63 118 0.578 3.45 2.04 Term + 57489 57635 147 2 0 54 41 193 0.819 8.12 2.05 PlyA + 58305 58310 6 1.05 3.03 PlyA - 58344 58339 6 1.05 3.02 Term - 62992 62895 98 2 2 95 39 78 0.529 0.75 3.01 Init - 67742 67538 205 2 1 69 62 166 0.792 11.16 3.00 Prom - 68515 68476 40 -8.15 4.00 Prom + 69278 69317 40 -8.25 4.01 Init + 72820 72904 85 0 1 51 99 89 0.332 7.33 4.02 Intr + 76261 76496 236 2 2 75 94 10 0.200 -3.22 4.03 Intr + 76607 77003 397 1 1 83 90 118 0.463 4.73 4.04 Term + 78067 78281 215 2 2 58 47 167 0.414 6.01 4.05 PlyA + 81080 81085 6 1.05 5.00 Prom + 85904 85943 40 -2.55 5.01 Init + 86991 87085 95 2 2 34 100 89 0.154 4.60 5.02 Intr + 98905 99230 326 1 2 29 27 204 0.454 2.89 5.03 Intr + 100006 100196 191 2 2 93 90 216 0.707 20.68 5.04 Intr + 101255 101368 114 1 0 85 66 105 0.896 7.72 5.05 Intr + 102076 102222 147 0 0 102 98 112 0.932 13.11 5.06 Term + 103968 104135 168 2 0 94 39 201 0.998 12.70 5.07 PlyA + 104458 104463 6 1.05 6.00 Prom + 106395 106434 40 -6.55 6.01 Sngl + 113739 114143 405 2 0 91 38 147 0.746 6.03 6.02 PlyA + 114469 114474 6 1.05 7.03 PlyA - 114514 114509 6 1.05 7.02 Term - 139383 139254 130 2 1 -23 37 186 0.074 -0.93 7.01 Init - 146555 146212 344 2 2 75 75 362 0.148 30.35 7.00 Prom - 161724 161685 40 -5.25 8.03 PlyA - 162127 162122 6 1.05 8.02 Term - 168625 168177 449 2 2 -17 47 260 0.481 5.79 8.01 Init - 169109 169010 100 2 1 100 113 49 0.404 9.08 8.00 Prom - 178211 178172 40 -4.55 9.02 PlyA - 183471 183466 6 1.05 9.01 Sngl - 195341 195117 225 2 0 104 54 213 0.300 14.39 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 146555 146208 348 2 0 75 49 365 0.841 27.09 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:22627439_22831570|GENSCAN_predicted_peptide_1|262_aa MGLKDANVLWSLRAGGGEDGSFVRCGPHQLHSPFGPLMTAVAAHWVGFTIAILQVKTVKL GEVKELPPDHIPAWTPSSQALLTDLGYCCLVQGQTPDKSFPVLGTWSCDCERRFFSIVLA SPCHWGTPQSFLGPISAAALNTHTPEPSAAATACRPYQTWHQEGSPRLGLLSLEEKELTG RSQKSLPPRIPTAFAAAAITSDICSLGCLGLLELSPVLTSMIELHEEYTTAHSLKPELLY PTQLAFPHLPGGESISPTEANL >gi568815591f:22627439_22831570|GENSCAN_predicted_CDS_1|789_bp atgggcttaaaagacgccaatgttctatggtccctgagagctggtggaggtgaagatgga agctttgtgcgctgtgggccccatcaactccactcgccttttgggcccttaatgacagct gttgcagcacattgggtaggctttactattgccattttacaagtgaaaacagtgaagctt ggagaagttaaggaacttcccccagatcacattcctgcatggactcctagcagtcaggca ttgctgacagatcttggttactgctgcttggtgcagggacagacacctgacaaatcattt cctgtattgggaacttggagttgtgactgtgagagacggttcttctccatagtcttggct agcccatgccactggggaacaccccagagttttctggggcccatctctgctgctgctctg aatactcataccccagaacccagtgctgctgcaactgcctgcaggccataccagacctgg caccaagaaggatcccctcggttaggacttctttccctagaggaaaaagagctaacaggg agatcccagaagtctttgccgccaaggatcccaacagcctttgctgctgctgccataact tcagatatctgtagccttggctgtctaggactcctagagttgtccccagtgctgacctca atgatagagctgcatgaagagtacaccacagcacactctctgaaaccagagctgctgtac cccacccaactggcgttcccacaccttcctggaggtgaaagtatttcccccactgaggct aatctgtaa >gi568815591f:22627439_22831570|GENSCAN_predicted_peptide_2|143_aa MAGLFFEGILPDRRSIGTAVDLRTVTQTIKWRLERFLSEESDQSMKKGVFQKHTQIPGKT IPTLLTVFLTNTLAPVPEAPAPPVVIAPPINQKEKGVIQKASSSPSSPSSSSHLLSCTEC SSKFPSDPDENENLHMPREEAEA >gi568815591f:22627439_22831570|GENSCAN_predicted_CDS_2|432_bp atggcaggtttattctttgaaggaatccttcctgaccggaggtcaattggcacagcggtt gatttgagaacagtaacacaaaccataaaatggagattagaaaggtttctttctgaggaa tctgaccagtccatgaaaaaaggagtcttccaaaaacacacccaaattcctggaaaaacc atccccaccctattaacagtcttcctcactaacaccctggccccagtacctgaagcccca gccccgccagttgtcatagctccaccaattaatcagaaagagaagggtgtgattcaaaaa gcatcatcatcaccatcatcaccatcatcatcatcacacctgctttcctgcacagaatgt tcctcaaagtttccaagtgacccagatgaaaacgagaacctgcacatgcccagagaggaa gcagaagcttaa >gi568815591f:22627439_22831570|GENSCAN_predicted_peptide_3|100_aa MEYRKQKIHRCGEGKGNLLESGEGRSQDDDYTLIEEGSQYRLEHRDSGGGLSTVIAKSSA IILVFLKTAVFRSSSCFQDRVWAVEDIPGRLRMHAGVSSP >gi568815591f:22627439_22831570|GENSCAN_predicted_CDS_3|303_bp atggaatacagaaagcagaagatccatcgctgtggagaagggaaagggaatctcctggag agtggtgaagggagatcccaggatgacgattacacattaattgaagagggcagtcagtac agattggaacatcgtgactcaggaggtggcttgagtactgtcattgccaaatcttctgcc attatacttgtctttttaaaaactgcagtcttcaggagcagcagctgcttccaagacagg gtttgggcagttgaagatattcctggtcggctgaggatgcatgcaggtgttagcagtcct tag >gi568815591f:22627439_22831570|GENSCAN_predicted_peptide_4|310_aa MESMTARFLLGLKEEDSSAGKSWEGGQAGEINSLVAHTKPVWWSLHTDAHEIWCHDSNRG TSLGGWGISLGRSIPCPPALCSMKKIHLRPWVLRPTSPRNISLILNWFLSFSGRDRRRIL SVDPKLWCLSRTREDSLPLMFITRGRLPDYSPTFQRCPTTQGRLPWSFTLSSKYRFSGGQ EPPNPFSFTLSGKCRFSRGQEPPDPLFPRPDPLSPHPDPLSLCPNPLFPHPDPFPTFLEG ACYKCWKSGHCAKECPQPRIPPKPCPICAGPPRKSDCSTHLAATSRAPGTLAQGSLTDSF PDLLGLAAED >gi568815591f:22627439_22831570|GENSCAN_predicted_CDS_4|933_bp atggaatccatgactgccaggtttttactggggcttaaggaggaagacagtagcgctggc aaaagctgggaaggaggacaggcaggtgaaataaacagccttgttgctcacacaaagcct gtttggtggtctcttcacacggacgcacatgaaatttggtgccatgactcgaatcggggg acctccttgggaggttgggggatctcccttgggagatcaatcccctgtcctcctgctctt tgctccatgaaaaagatccacctacgaccttgggtcctcagacccaccagcccaaggaac atctcactaattttaaattggttcctttccttttctggtagagacaggagacgcatttta tctgtggacccaaaactctggtgcctgtcacggactcgggaagacagtcttcccttgatg tttatcacgcgaggacgcctgcctgattattcacccacgtttcagaggtgtccaaccacg cagggacgactgccttggtccttcacccttagcagcaagtaccgcttttctggggggcaa gaaccccccaaccccttctccttcaccctgagtggcaagtgccgcttttctagggggcaa gaaccccccgatcccttatttccacgccctgaccccttatctccacaccctgacccctta tctctgtgccccaaccctttatttccacaccccgacccctttcccacttttctggaggga gcttgctacaagtgctggaaatctggccactgcgccaaggaatgcccacagcccaggatt cctcctaagccgtgtcccatctgtgcgggacccccaagaaaatcggactgttcaactcac ctggcagccacttccagagcccctggaactctggcccaaggctctctgactgactccttc ccagatcttctcggcttagcagctgaagactga >gi568815591f:22627439_22831570|GENSCAN_predicted_peptide_5|346_aa MQVDQPLGCPECKRELRVGDINLAVICVKEKRCPSETVVKRLSGNGESTGSTRQTSGTES KVLTGRIPKGSLGRGQGSSQPPLSGLKQVKKVAEATRWQKGVTHSTWRRLEVTARNLRMA RQFYNSRSQGEPEHRRTQMTGAFGPVAFSLGLLLVLPAAFPAPVPPGEDSKDVAAPHRQP LTSSERIDKQIRYILDGISALRKETCNKSNMCESSKEALAENNLNLPKMAEKDGCFQSGF NEETCLVKIITGLLEFEVYLEYLQNRFESSEEQARAVQMSTKVLIQFLQKKAKNLDAITT PDPTTNASLLTKLQAQNQWLQDMTTHLILRSFKEFLQSSLRALRQM >gi568815591f:22627439_22831570|GENSCAN_predicted_CDS_5|1041_bp atgcaagtggatcagccactgggctgtccagaatgcaagagagaactcagagttggggac ataaacttggcagtcatctgtgtaaaagagaaaaggtgccccagtgaaacagtggtgaag agactcagtggcaatggggagagcactggcagcacaaggcaaacctctggcacagagagc aaagtcctcactgggaggattcccaaggggtcacttgggagagggcagggcagcagccaa cctcctctaagtgggctgaagcaggtgaagaaagtggcagaagccacgcggtggcaaaaa ggagtcacacactccacctggagacgccttgaagtaactgcacgaaatttgaggatggcc aggcagttctacaacagccgctcacagggagagccagaacacagaagaactcagatgact ggcgccttcggtccagttgccttctccctggggctgctcctggtgttgcctgctgccttc cctgccccagtacccccaggagaagattccaaagatgtagccgccccacacagacagcca ctcacctcttcagaacgaattgacaaacaaattcggtacatcctcgacggcatctcagcc ctgagaaaggagacatgtaacaagagtaacatgtgtgaaagcagcaaagaggcactggca gaaaacaacctgaaccttccaaagatggctgaaaaagatggatgcttccaatctggattc aatgaggagacttgcctggtgaaaatcatcactggtcttttggagtttgaggtataccta gagtacctccagaacagatttgagagtagtgaggaacaagccagagctgtgcagatgagt acaaaagtcctgatccagttcctgcagaaaaaggcaaagaatctagatgcaataaccacc cctgacccaaccacaaatgccagcctgctgacgaagctgcaggcacagaaccagtggctg caggacatgacaactcatctcattctgcgcagctttaaggagttcctgcagtccagcctg agggctcttcggcaaatgtag >gi568815591f:22627439_22831570|GENSCAN_predicted_peptide_6|134_aa MDKWDHVKLKSFRVAKDTINKVKRQPTEWEKISANYPSDKELVIRIYKELKQPCRKKKSN KSIKIWAKDLNRHFSKEGIQMANRYMKRCSTSLIIREIVSPQSKLLICKRQTITNAGENV EKRELLYAVGGNVN >gi568815591f:22627439_22831570|GENSCAN_predicted_CDS_6|405_bp atggacaaatgggatcacgtcaagttaaaaagcttccgcgtggcaaaggatacaatcaac aaagtgaagagacaacccacagaatgggagaaaatatctgccaactacccatctgacaag gaattagtaataagaatatacaaggagctcaaacaaccctgtaggaaaaaaaaatctaat aaatcgatcaaaatatgggcaaaagatttgaatagacatttctcaaaagaaggcatacaa atggcaaacaggtatatgaaaaggtgctcaacatcactgatcatcagagaaattgtctca ccccagtcaaaactgcttatatgcaaaagacagacaataacaaatgctggcgagaatgta gagaaaagggaactgttgtatgctgttggtgggaatgtaaattag >gi568815591f:22627439_22831570|GENSCAN_predicted_peptide_7|157_aa MTKKRRNNVRAKKGRGHVQPIRCTNCARCVPKDNAIKKFVIRNIVEAAAVRDISEASVFD AYVLPKLCVKLHYCVSCVIHSKVVRNRSREARKDRTPPPRFRPAGAAARPPPKPITLTTP NADKDVKQQELSFIVGGNAKFGGQLVISYKTKHTFTI >gi568815591f:22627439_22831570|GENSCAN_predicted_CDS_7|474_bp atgacaaagaaaagaaggaacaacgttcgtgccaaaaagggccgcggccacgtgcagcct attcgctgcactaactgtgcccgatgcgtgcccaaggacaacgccattaagaaattcgtc attcgaaacatagtggaggctgcagcagtcagggacatttctgaagcgagcgtcttcgat gcctatgtgcttcccaagctgtgtgtgaagctacattactgtgtgagttgtgtaattcac agcaaagtagtcaggaatcgatctcgtgaagcccgcaaggaccgaacacccccaccccga tttagacctgcgggtgctgccgcacgtcccccaccaaagcccataacactgacaacacca aatgctgacaaggatgtgaagcaacaagaactctccttcattgttggtggaaatgcaaaa tttggaggacagttggtgatttcttacaaaactaaacatacttttaccatatga >gi568815591f:22627439_22831570|GENSCAN_predicted_peptide_8|182_aa MAERGQHRAQAMVSEGGSLKRWQLSCDVVPVSAQDLRECLDAQAEVCCRDGFLIWTASAR AVQKGNVGWDPPHRVPTGAPPSRAVRRRPPSSRPQNDRSTGSLYHMPGKATDAQCQSMKA AGREAVSCKATEAELPETMGTYLLHQRDPDVRHGVKGDHFGPLRFDCPAGFRTCVGPVAP LF >gi568815591f:22627439_22831570|GENSCAN_predicted_CDS_8|549_bp atggctgaaaggggccaacatagagctcaggccatggtttcagagggtggaagcctcaag cgttggcagctttcatgtgatgttgtgcctgtgagtgcacaagatctacgggaatgcctg gatgcccaggcagaagtttgttgtagggatgggttcctcatatggacagcctctgctagg gcagtgcagaagggaaatgtggggtgggatcccccacacagagtccctactggggcacca cctagtcgggctgtgagaagaaggccaccatcctccagaccccagaatgatagatccact ggcagcttgtaccacatgcctggaaaagccacagatgctcaatgccagtccatgaaagca gctgggagggaggctgtatcctgcaaagccacagaggcagagctacccgagaccatggga acctacctcttgcatcagcgtgacccggatgtgagacatggagtcaaaggagatcatttt ggacctttaagatttgactgccctgctggatttcggacttgtgtggggcctgtagcccct ttgttttga >gi568815591f:22627439_22831570|GENSCAN_predicted_peptide_9|74_aa MVKLSKEAKQRLQQLFKGSQFAIRWGFIPLVIYLGQWEKNRESDWREEGGGDSSPIFGRR GRERKIGDTAVHGP >gi568815591f:22627439_22831570|GENSCAN_predicted_CDS_9|225_bp atggtgaagctgagcaaagaggccaagcagagactacagcagctcttcaaggggagccag tttgccattcgctggggctttatccctcttgtgatttacctgggtcagtgggagaagaac cgggaaagtgactggagggaagagggtggcggagacagcagccccatttttggccgtcgg gggagggagagaaaaataggggacactgccgtgcacggcccctga