GENSCAN 1.0 Date run: 5-Nov-116 Time: 18:42:31 Sequence gi568815594r:121229317_121480647 : 251331 bp : 38.21% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 2300 2355 56 1 2 86 67 60 0.031 4.51 1.02 Term + 12968 13121 154 2 1 8 48 234 0.887 7.81 1.03 PlyA + 14015 14020 6 1.05 2.00 Prom + 18762 18801 40 -4.95 2.01 Init + 24284 24310 27 1 0 74 81 28 0.478 0.42 2.02 Intr + 24755 24892 138 1 0 52 95 47 0.513 1.54 2.03 Intr + 25767 25990 224 2 2 26 80 170 0.419 6.00 2.04 Intr + 35102 35308 207 2 0 49 121 105 0.567 7.17 2.05 Intr + 42148 42322 175 1 1 30 22 90 0.061 -4.38 2.06 Intr + 48382 48556 175 0 1 93 77 104 0.179 8.39 2.07 Intr + 54142 54246 105 2 0 5 71 127 0.153 1.97 2.08 Intr + 54460 54771 312 2 0 46 20 255 0.054 9.73 2.09 Intr + 58809 59056 248 1 2 23 41 174 0.006 2.46 2.10 Term + 69071 69217 147 1 0 95 48 105 0.902 4.12 2.11 PlyA + 69420 69425 6 1.05 3.04 PlyA - 69892 69887 6 1.05 3.03 Term - 86762 86700 63 2 0 63 48 125 0.653 2.91 3.02 Intr - 91006 90875 132 1 0 67 49 93 0.374 3.22 3.01 Init - 93894 93829 66 0 0 45 69 29 0.215 -2.18 3.00 Prom - 97160 97121 40 -3.65 4.04 PlyA - 98973 98968 6 1.05 4.03 Term - 100398 99998 401 1 2 66 32 233 0.831 9.79 4.02 Intr - 111422 111136 287 1 2 77 66 182 0.360 10.96 4.01 Init - 116075 115987 89 2 2 28 60 157 0.751 7.06 4.00 Prom - 126550 126511 40 -3.35 5.07 PlyA - 126587 126582 6 1.05 5.06 Term - 127164 126944 221 1 2 59 42 164 0.645 5.12 5.05 Intr - 140436 140149 288 1 0 85 -29 283 0.011 12.59 5.04 Intr - 140884 140601 284 0 2 86 21 233 0.630 12.54 5.03 Intr - 142972 142876 97 2 1 65 46 61 0.278 -2.25 5.02 Intr - 151387 150992 396 2 0 93 78 620 0.110 55.13 5.01 Init - 176813 176672 142 2 1 62 48 140 0.756 7.74 5.00 Prom - 183957 183918 40 -5.15 6.02 PlyA - 183983 183978 6 1.05 6.01 Sngl - 205121 204618 504 2 0 45 42 261 0.314 12.99 6.00 Prom - 209720 209681 40 -8.35 7.00 Prom + 210008 210047 40 -3.75 7.01 Init + 210506 210667 162 1 0 38 94 177 0.343 13.09 7.02 Term + 210762 210839 78 2 0 18 48 99 0.302 -4.32 7.03 PlyA + 211019 211024 6 1.05 8.02 PlyA - 211323 211318 6 1.05 8.01 Sngl - 212867 212406 462 2 0 68 48 292 0.808 19.11 8.00 Prom - 220956 220917 40 -4.95 9.00 Prom + 221455 221494 40 -4.05 9.01 Sngl + 222411 222635 225 2 0 54 55 280 0.453 16.19 9.02 PlyA + 223363 223368 6 1.05 10.00 Prom + 223488 223527 40 -6.85 10.01 Init + 228976 229133 158 0 2 43 100 138 0.934 9.93 10.02 Term + 232049 232214 166 2 1 60 54 167 0.924 6.91 10.03 PlyA + 232463 232468 6 -0.45 11.03 PlyA - 232545 232540 6 1.05 11.02 Term - 233287 233188 100 0 1 38 48 103 0.002 -2.08 11.01 Init - 250565 250426 140 2 2 74 96 90 0.688 8.09 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 225659 225611 49 2 1 69 97 43 0.879 4.46 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:121229317_121480647|GENSCAN_predicted_peptide_1|69_aa MDVDKSEMQLFGTKVQLSLNFEEQPPSLKPSREHQQNHTGGLVSTEGHKLVKVNSAVAAE QLSGKSAGI >gi568815594r:121229317_121480647|GENSCAN_predicted_CDS_1|210_bp atggatgtagataaatcagaaatgcagctgtttgggaccaaagtccagttatcactgaat tttgaagagcagccaccctccctgaagccatctagggaacaccagcagaatcacactgga ggactggtgtccacagaaggccacaagcttgtgaaggtcaactcagctgtggcagctgag caactgtcgggcaagtctgctggaatctga >gi568815594r:121229317_121480647|GENSCAN_predicted_peptide_2|585_aa MKMQVTRFLVIKDSVASCLHSPLPSVTSLTLGKAGCHLVSSSPCSEKLSAPAHIPCCDYR YEPPCLTCVGYFYGQLMLHPAGPSEEPYVMNVRIVPLEDPRGKHFQQPHPSFVKYALSVL TLQYFQVTRGCAKVAAIETTTFEERVLLLTVPKWRGMPCHTRPHTGKHQLRSEGRNNEQK VWAKGFYCGFGGKGKVEARHTQKTMRVSDIEATITLSPGPGGGALRELSEGALLFRGVKN SNSVCLALVAPSNLVFMGFMWSRIEAAPLPNTRWCLPHGYSGPANSGGCYYITTGMSCWL SDLYPRHDGVHVGLQGKDLEHDIPVDLGSRPTSVDPVSRSAPGPQGPKAPVDSGSRTTPA NPGAGPSPVDPSSRPAPVASGFRLIMIDQNTKLIPVPGWPLRLRLKIYLSSRSASLDSGS RFAPVDTNFIPVPTDRGHSHAPEDPVDRLTSVDPKIQTIIREYDKHLYANKLENLEEMDK FLGTYTLPRLNQEETESLNRPITGSEIQAIINSLPTKKIPGPDGFTAKFYQKYKEELAKI LYTFSLLLGPIRHISYPGDDGGSDGLNLSPIFGKCELGSIMETSG >gi568815594r:121229317_121480647|GENSCAN_predicted_CDS_2|1758_bp atgaagatgcaagtaactcggttcctggtcattaaagatagtgtggcttcctgcttgcac tctcctttgccctctgttacatcgctcactctggggaaagctggctgccacctcgtgagc agcagtccatgtagtgagaaactgagtgctcctgctcacatcccttgttgcgattatagg tatgaaccaccatgcctgacatgtgttggttacttctatggtcaactgatgctccatcca gcaggaccttctgaggagccttatgtaatgaatgtcagaattgtccctctagaagatcca agggggaagcatttccagcagccccatccttcatttgtcaagtatgctctatcagtgtta actcttcagtacttccaggttacacgtggatgtgccaaggtggcagccatagagacaact acctttgaagaaagagttttattactcacagttcccaagtggaggggcatgccatgccat acaaggccacacacagggaagcaccagcttcgatcagaaggcagaaacaatgagcagaaa gtatgggcaaagggcttttattgtggttttggggggaaaggaaaagtcgaggcaagacac acgcagaaaactatgagagtgtcagacatagaagccaccatcacccttagcccaggacct ggaggaggtgcactgagggaattatcagagggcgctctcctctttaggggtgtgaaaaac agtaactctgtttgtttggctctagtagccccttctaacttggttttcatggggtttatg tggtccagaatagaggcagctcctcttccaaacaccaggtggtgtctgccacatggctac tcaggtcctgcgaactctggtggctgctactacatcactacagggatgtcctgctggctc tcagatctctatccaagacatgatggtgtccatgttggactccaaggcaaggacctggag catgacattccagtggacctagggtccaggcccacctctgtggacccagtttccaggtct gccccagggccccagggccccaaggctccagtggactcagggtccaggaccactccagca aaccctggtgctggaccatcccctgtagacccaagctcccggccagcccctgtggcttca ggctttagactcatcatgatagaccaaaacaccaagctaattccagtgccaggttggccc ctgcgactcaggctcaagatctacctcagcagtaggtcagcctctttggactcaggctcc aggtttgcccctgtggacacaaattttatacctgtccccacagaccgaggccatagccat gcccctgaagacccagttgacaggctaacctcagtggacccaaaaatacaaactatcatc agagaatacgataaacacctctatgcaaataaactagaaaatctagaagaaatggataaa ttcctgggcacatacactctcccaagactaaaccaggaggaaactgaatctctgaatagg ccaataacaggttctgagattcaggcaataattaacagcctaccaaccaaaaaaattcca ggaccagatggattcacagccaaattctaccagaagtacaaagaggagctggcaaaaatt ctctataccttcagtcttctccttggccccataagacatatttcctaccctggtgatgat ggaggctcagatggcctgaatctgagtccaatatttgggaaatgtgagctgggctccatc atggaaacatctggctga >gi568815594r:121229317_121480647|GENSCAN_predicted_peptide_3|86_aa MKWKWYIQNPTDQEGTDELHRQQSSFLKHNQKTKVLLDSDTEHSGAERVDKQKISSACVP EHMLIMENDEKTGGDSGEDESEYGQE >gi568815594r:121229317_121480647|GENSCAN_predicted_CDS_3|261_bp atgaaatggaaatggtacatccagaatcccacagatcaggagggcacagatgaactgcac agacagcagtcttcattcctcaagcacaatcaaaagacaaaggtgctccttgattcagac acggagcacagtggtgcagaaagggtggacaaacagaaaatatccagtgcttgtgtgcct gaacatatgttaatcatggaaaatgatgagaaaacaggaggagatagtggggaagatgag agcgaatatggacaggaatga >gi568815594r:121229317_121480647|GENSCAN_predicted_peptide_4|258_aa MYWKDVEGSQDQREAEEMDLQQKATKGAQRVITIIKLFGRGSEWTDSDNYKDTDETFILF VLMYDLSYFLTAGAFICKMVPFVQSTAVVTEILTMTCIAVERHQGLVHPFKMKWQYTNRR AFTMLGNFEKEYDDVTIKMIFAIVQIIGFSNSICNPIVYAFMNENFKKNVLSAVCYCIVN KTFSPAQRHGNSGITMMRKKAKFSLRENPVEETKGEAFSDGNIEVKLCEQTEEKKKLKRH LALFRSELAENSPLDSGH >gi568815594r:121229317_121480647|GENSCAN_predicted_CDS_4|777_bp atgtattggaaggatgttgaaggctcacaggatcagcgtgaagctgaagaaatggaccta caacagaaagcgaccaagggagctcagagagtaattacaattatcaaactttttggcaga ggctcagaatggactgatagtgataattacaaagacacagatgaaacctttattcttttt gttttgatgtatgatttgtcctatttccttactgcaggtgctttcatttgcaagatggtg ccatttgtccagtctaccgctgttgtgacagaaatcctcactatgacctgcattgctgtg gaaaggcaccagggacttgtgcatccttttaaaatgaagtggcaatacaccaaccgaagg gctttcacaatgctaggtaattttgaaaaggaatatgatgatgtcacaatcaagatgatt tttgctatcgtgcaaattattggattttccaactccatctgtaatcccattgtctatgca tttatgaatgaaaacttcaaaaaaaatgttttgtctgcagtttgttattgcatagtaaat aaaaccttctctccagcacaaaggcatggaaattcaggaattacaatgatgcggaagaaa gcaaagttttccctcagagagaatccagtggaggaaaccaaaggagaagcattcagtgat ggcaacattgaagtcaaattgtgtgaacagacagaggagaagaaaaagctcaaacgacat cttgctctctttaggtctgaactggctgagaattctcctttagacagtgggcattaa >gi568815594r:121229317_121480647|GENSCAN_predicted_peptide_5|475_aa MINITNDQGYANQNHNVIPPYSCKNGHHQNTVDVGMDAVIRELFYIADADGQPVAGGGPA SRERTAMQALNITPEQFSRLLRDHNLTREQFIALYRLRPLVYTPELPGRAKLALVLTGVL IFALALFGNALVFYVVTRSKAMRTVTNIFICSLALSDLLITFFCIPVTMLQNISDNWLGE LITHLRERHRHSYSNTQRYPLQQEEKPIEEPLHGQHPLEASDFRAQVSAPHQGGGRRGQS TLPCQSLSSAIDSVDKLPVVKAKATHVVMNYVITKQTQESFQHFGQQAGLRDPGYTPTRV SPLRRPTLPGPNPSTMDSGSGDKDRNLSDKWGLFGLRSLQKYDSGSFATQAYRGAQKPSP MELIRAQANRMAEDPAALKPPKMDFPVTEGRKQPPRAHNLKPRLTNVHKHQENMTSPNEL NKAPVTSPGVTEICDLSDREFKITVLRKLKEAQDNTEKEFRIVSEKFNKEIEIIF >gi568815594r:121229317_121480647|GENSCAN_predicted_CDS_5|1428_bp atgatcaacatcactaatgatcagggatatgcaaatcaaaaccacaatgtgataccacct tactcctgcaagaatggccatcatcaaaacacagtagatgttggcatggatgcagtgatc agggaactcttctacattgctgatgcggatggccagccagtagcgggcggtggccccgcg tcccgggagcgcacagcaatgcaggcgcttaacattaccccggagcagttctctcggctg ctgcgggaccacaacctgacgcgggagcagttcatcgctctgtaccggctgcgaccgctc gtctacaccccagagctgccgggacgcgccaagctggccctcgtgctcaccggcgtgctc atcttcgccctggcgctctttggcaatgctctggtgttctacgtggtgacccgcagcaag gccatgcgcaccgtcaccaacatctttatctgctccttggcgctcagtgacctgctcatc accttcttctgcattcccgtcaccatgctccagaacatttccgacaactggctgggggaa ctaatcacgcacctcagggagcgccatcgtcactcctattcaaacacccagagatacccc ctacagcaggaagaaaagcctattgaggagccactacatgggcagcatcccttggaagcc agtgatttcagagcacaagtatcagcacctcaccaaggtggaggaaggagaggccagtct accctcccctgccagtccctgtcatcggccattgacagtgtggacaagctcccagtggtg aaggctaaagctacccatgtcgtcatgaattatgtgatcacaaaacagacccaggaaagc tttcagcattttgggcaacaggcagggctgagagatcctggctacacacccacaagggtg tcaccactcagaagaccaaccttacctggcccaaatcctagcaccatggactctggaagt ggggataaggacagaaacttgtcagataagtggggcctctttggactgagatcccttcag aagtatgattctggaagttttgccacccaggcctaccgaggagcccagaagccctctcca atggaactgatccgtgcccaggccaaccgaatggctgaagatccagcagccttgaagcca cccaagatggacttcccagtgacggaagggaggaaacagccaccacgggcacataatctc aaaccccgattgacaaatgttcacaaacatcaagaaaatatgacttcaccaaatgaacta aataaggcaccagtgaccagtcctggagtgacagagatttgtgacctttcagacagggaa ttcaaaataactgttctgaggaagctcaaagaagctcaagataacacagagaaggaattc agaattgtatcagagaaatttaacaaagagattgaaataattttttaa >gi568815594r:121229317_121480647|GENSCAN_predicted_peptide_6|167_aa MYGSAWMSRQKFAAGVGLSWRISARAMWKENVGLKPPHRVPTGAQPSGAVRRRPPSSRPQ NGRSTDSLHHAPGKAAETQCQLMKAARREAVPGKATRAELSKTVETHLLHQHDLDVRHGV KGNHFGHLRFDCPTGFWTFMGLVATLFWPVSHIWNGCIYSMLVLHCI >gi568815594r:121229317_121480647|GENSCAN_predicted_CDS_6|504_bp atgtatggaagtgcctggatgtccaggcagaagtttgctgcaggggtggggctctcatgg agaatttctgctagggcaatgtggaaggaaaatgtggggttgaagcccccacacagagtc cctactggagcacaacctagtggagctgtgagaaggaggccaccatcctccagaccccag aatggtagatccactgacagcttgcatcatgcacctggaaaagctgcagaaactcaatgc cagctcatgaaagcagcccggagggaggctgtacctggcaaagccacgagggcagagctg tccaagaccgtggaaacccacctcttgcatcagcatgacctggatgtgagacatggagtc aaaggaaatcattttggacatttaagatttgactgccccactggattttggactttcatg gggcttgtagccacgttattttggccagtgtctcacatttggaatggctgtatttactca atgcttgtactccactgtatctag >gi568815594r:121229317_121480647|GENSCAN_predicted_peptide_7|79_aa MRAAAAAMSRESLPSGHMHVFCGLAAGESGIAVSDNSYWQTAESTCFGPRQQLQWQQKET GKRLGEDLEREILWNTELW >gi568815594r:121229317_121480647|GENSCAN_predicted_CDS_7|240_bp atgagggcagcagcagcagcaatgagcagggaaagccttccctcagggcacatgcatgtt ttctgtggccttgctgctggggagagtggaattgctgtcagtgacaacagctattggcag acagctgagagcacctgctttggccctaggcagcagttgcagtggcagcagaaagaaacg ggaaagaggctaggagaagacctagaaagagagatcctctggaacaccgagctgtggtga >gi568815594r:121229317_121480647|GENSCAN_predicted_peptide_8|153_aa MRRKQHKKAENSKNQNASSPGKDHNSSPAMSLKMENEFDELSEEGFRRWVITISSELKEH VLTQCKEAKNLEKMLDELLTRITSVEKNINDLVEQKDTARELCEAYTSFNSQINQAEERI SVIEYQLNEIKREEKIREKRIKRNKASKKYGTM >gi568815594r:121229317_121480647|GENSCAN_predicted_CDS_8|462_bp atgaggagaaagcagcacaaaaaggctgaaaattccaaaaaccagaatgcctcttctcct ggaaaggatcacaactcctcaccagcaatgagtttgaagatggagaatgagtttgacgaa ttgtcagaagaaggcttcagaagatgggtaataacaatctcctccgagctaaaggaacat gttctaacccaatgcaaggaagctaagaaccttgaaaaaatgttagatgaattgctaact agaataaccagtgtagaaaagaacataaatgacctggtggagcagaaagacacagcaaga gaactttgtgaagcatacacaagttttaatagccaaatcaatcaagcagaagaaaggata tcagtgattgaatatcaacttaatgaaataaagagagaagagaaaattagagaaaaaaga ataaaaaggaacaaagcctccaagaaatatgggactatgtga >gi568815594r:121229317_121480647|GENSCAN_predicted_peptide_9|74_aa MSSTFISNNTAIQELFKRILEQLTVIRRCKASLHWYMGEGMNEMDEYIEAESNMNDLVPK YQQYQDAMAKEDRV >gi568815594r:121229317_121480647|GENSCAN_predicted_CDS_9|225_bp atgtcctccaccttcatcagcaacaacacagccatccaggagctgtttaagcgcatcttg gagcagctcactgtcatacgcaggtgcaaggcctccctgcactggtacatgggtgagggc atgaatgagatggatgagtacattgaggctgagagcaacatgaatgacctggtgcccaag taccaacagtaccaggatgccatggccaaggaagacagagtttga >gi568815594r:121229317_121480647|GENSCAN_predicted_peptide_10|107_aa MEESRSGTVREGGVTTESRDQKYRFEDATLLALKMEERATSQRVQMVSTNWKSYLVESVS CASLVLGNGALEVNGQSVGIVPAYGKGGAPFLSYTVSTRAHRSSTVH >gi568815594r:121229317_121480647|GENSCAN_predicted_CDS_10|324_bp atggaagaaagcaggagtggcacagtcagagaaggaggtgtgacaacagaaagcagagat cagaaatacagatttgaagatgctaccctgctggccctgaaaatggaggaaagggccaca agccaaagagtacagatggtctctactaactggaaaagttacctggtagaatctgtttcc tgtgccagccttgtgttgggcaatggtgctttggaagttaatggacagtcagtgggcatt gtgcctgcttatgggaagggtggtgcacccttcctgtcctacactgtctccaccagggcc cacagaagcagcacagttcattga >gi568815594r:121229317_121480647|GENSCAN_predicted_peptide_11|79_aa MLVSGTSLATVQQSTKWALGVPKSRPGLLDSMSGPFLGQRRAHCPEGLIQSIWDEGGTKL NGSNKLPSNAKAAGSLIAL >gi568815594r:121229317_121480647|GENSCAN_predicted_CDS_11|240_bp atgctggtttcaggtaccagcttagccacagtgcagcagagcaccaagtgggctcttggg gtccccaagtccaggcctgggctcttggacagcatgtctggacctttcctgggccagagg agagcccactgccctgaaggactaattcagtccatttgggatgagggtggaacaaaactg aatggttctaacaagcttccaagtaatgccaaagctgctggttctctgattgcactttga