GENSCAN 1.0 Date run: 3-Nov-116 Time: 02:44:44 Sequence gi568815593r:171768638_172106502 : 337865 bp : 43.76% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 5251 5321 71 0 2 62 101 58 0.143 5.02 1.02 Intr + 11687 11771 85 2 1 127 75 22 0.356 4.62 1.03 Intr + 17217 17339 123 2 0 42 94 107 0.214 7.48 1.04 Intr + 21593 21644 52 1 1 112 72 13 0.737 0.58 1.05 Intr + 21821 21912 92 0 2 77 92 26 0.662 1.61 1.06 Intr + 22158 22404 247 2 1 83 81 226 0.890 18.43 1.07 Term + 37749 37975 227 1 2 37 43 129 0.440 0.24 1.08 PlyA + 38692 38697 6 1.05 2.00 Prom + 39639 39678 40 -3.66 2.01 Init + 42968 43070 103 1 1 114 64 69 0.660 7.50 2.02 Term + 54785 54912 128 0 2 91 46 85 0.104 3.04 2.03 PlyA + 55053 55058 6 -1.95 3.04 PlyA - 55150 55145 6 1.05 3.03 Term - 55904 55882 23 0 2 112 43 24 0.572 -1.23 3.02 Intr - 56505 56451 55 0 1 110 78 44 0.587 4.15 3.01 Init - 56596 56555 42 1 0 55 57 84 0.454 0.42 3.00 Prom - 57632 57593 40 -1.06 4.00 Prom + 60953 60992 40 -5.26 4.01 Init + 64774 64881 108 0 0 95 103 88 0.617 11.22 4.02 Intr + 78986 79068 83 1 2 108 62 54 0.116 3.24 4.03 Intr + 80740 80790 51 1 0 74 78 95 0.131 5.12 4.04 Term + 89021 89168 148 2 1 76 43 96 0.247 1.27 4.05 PlyA + 90764 90769 6 1.05 5.14 PlyA - 90951 90946 6 1.05 5.13 Term - 95464 95315 150 1 0 121 37 48 0.211 0.91 5.12 Intr - 101170 101092 79 0 1 126 68 16 0.974 2.95 5.11 Intr - 102221 102111 111 1 0 50 71 131 0.959 7.09 5.10 Intr - 104353 104235 119 1 2 86 80 104 0.617 8.66 5.09 Intr - 107600 107565 36 2 0 132 42 33 0.734 1.56 5.08 Intr - 107897 107648 250 1 1 108 106 379 0.998 39.14 5.07 Intr - 109492 109374 119 1 2 62 105 79 0.806 6.26 5.06 Intr - 122967 122830 138 0 0 132 63 144 0.242 16.96 5.05 Intr - 130457 130367 91 1 1 74 78 26 0.618 0.20 5.04 Intr - 131463 131277 187 1 1 119 103 38 0.658 7.15 5.03 Intr - 142160 141935 226 2 1 92 93 149 0.898 13.36 5.02 Intr - 145768 145706 63 1 0 115 91 22 0.711 4.11 5.01 Init - 197404 197348 57 1 0 90 76 52 0.525 5.51 5.00 Prom - 204709 204670 40 -2.56 6.03 PlyA - 207774 207769 6 1.05 6.02 Term - 233320 233225 96 1 0 74 40 74 0.332 -0.83 6.01 Init - 237865 237821 45 1 0 120 101 126 0.882 17.78 6.00 Prom - 256534 256495 40 -4.16 7.03 PlyA - 256544 256539 6 1.05 7.02 Term - 259784 259597 188 0 2 91 38 19 0.130 -5.15 7.01 Init - 263942 263807 136 2 1 73 57 171 0.545 12.80 7.00 Prom - 266635 266596 40 -5.76 8.16 PlyA - 267910 267905 6 1.05 8.15 Term - 276385 276245 141 1 0 103 42 190 0.985 13.83 8.14 Intr - 284405 284292 114 2 0 72 88 152 0.813 14.24 8.13 Intr - 286057 285932 126 1 0 83 86 312 0.961 31.38 8.12 Intr - 287139 286951 189 0 0 100 75 443 0.999 43.98 8.11 Intr - 288836 288712 125 0 2 91 100 242 0.915 26.00 8.10 Intr - 292631 292502 130 2 1 147 59 254 0.482 28.67 8.09 Intr - 295619 295596 24 2 0 103 80 20 0.595 0.92 8.08 Intr - 296175 296083 93 0 0 116 70 164 0.985 17.56 8.07 Intr - 313868 313689 180 2 0 95 105 270 0.993 29.46 8.06 Intr - 314447 314324 124 1 1 78 92 164 0.999 16.39 8.05 Intr - 317679 317489 191 0 2 104 103 35 0.748 4.98 8.04 Intr - 321725 321567 159 2 0 63 37 209 0.726 13.48 8.03 Intr - 325323 324775 549 0 0 82 87 533 0.999 45.67 8.02 Intr - 327923 327789 135 2 0 82 66 321 0.999 30.06 8.01 Init - 328772 328734 39 2 0 64 86 14 0.420 -2.63 8.00 Prom - 329190 329151 40 -3.16 9.00 Prom + 329194 329233 40 -9.26 9.01 Init + 330205 330689 485 0 2 56 55 1031 0.570 91.38 9.02 Term + 330977 331070 94 2 1 74 43 107 0.496 2.10 9.03 PlyA + 331158 331163 6 1.05 10.00 Prom + 331847 331886 40 -3.26 10.01 Init + 333722 333858 137 1 2 89 65 163 0.891 13.71 10.02 Intr + 336859 336962 104 1 2 23 105 92 0.473 4.22 10.03 Term + 337199 337329 131 0 2 55 55 121 0.784 3.84 10.04 PlyA + 337765 337770 6 -0.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:171768638_172106502|GENSCAN_predicted_peptide_1|298_aa MDKETSLWLEDYTKEVGLMPGLEWKCEAPSAVLDPLPVPRPHQLLGQFSEGQAARSEAMA TQQVDSRRQVAAEQVAAQLLERRRGSHCDDEKQTLLALLILVLYLSTEIWEVMFVPVSGS SWEVSERIRECNYYQNLAVPQGLEYQTNEPSEEPIKTIRNWLKEKLHVFSEKLEEEVQQL EQLAWDLELWLDALLGEPHQEEHCSTYKSHLWEWAWALGREHKDFPVESGKRAELPAQTD MSQSQGSSWKHKDQEKEGRVKTLAQDLARGGTSTEPPLRAENRDLNLIESNLQGTGQP >gi568815593r:171768638_172106502|GENSCAN_predicted_CDS_1|897_bp atggataaagagacatccctgtggttggaggactacaccaaggaggtggggctcatgcca ggccttgaatggaagtgtgaagcacccagtgctgtactagaccctctgccagtcccccgc ccacaccagctactgggccagttctctgagggccaggcagccagatctgaggccatggca acccagcaagtggacagcagaaggcaggtggcagcagagcaggtggcagcccagctgctt gaacggagaaggggcagccactgtgatgacgagaagcagacgctgttggcattgctgatc ttggtgctgtacttgagcacagagatatgggaggtgatgtttgtgcctgtttcaggaagc agttgggaggtgtcagaaaggatcagagaatgtaactactaccagaatcttgcagttccc caggggcttgaatatcagaccaacgagccctcagaagaaccgataaagaccatcaggaac tggctgaaggagaagttgcatgtcttctcggagaagttagaggaagaggtgcagcagctg gagcagctagcgtgggacctggaactgtggctggatgctcttctgggagagccacaccag gaggagcactgctccacatataaaagtcacttgtgggagtgggcctgggccctggggaga gagcacaaagattttcctgtggaaagtggtaaacgggctgagcttcctgcacagactgac atgagccagtcccaaggcagctcttggaaacacaaggaccaagagaaagagggacgagtg aaaacacttgcccaggacctggcacggggtggtacaagcactgaaccaccattgagggca gagaaccgggatttgaatctcattgaatccaatctccagggcacaggacagccatag >gi568815593r:171768638_172106502|GENSCAN_predicted_peptide_2|76_aa MPSPITSFLHLLGKTRQYFSTGVIHSKIVNKSTKSLMLDFDPAPPPMQMSNPYSLCLLEL VPQSSDLKFASFMALS >gi568815593r:171768638_172106502|GENSCAN_predicted_CDS_2|231_bp atgcccagccccatcacatccttcttacatttacttgggaaaactagacagtacttcagc actggtgttatacacagcaaaatcgtcaacaaaagcacaaaatccttgatgctggacttt gaccccgccccacctcctatgcaaatgtcaaatccttacagcctctgccttctagagctg gttcctcagtcctcagacctgaaatttgcttccttcatggccctgagctga >gi568815593r:171768638_172106502|GENSCAN_predicted_peptide_3|39_aa MWLILAGLAWAGLWVSDGLKDVLVMAKGKSRRACIGMII >gi568815593r:171768638_172106502|GENSCAN_predicted_CDS_3|120_bp atgtggctgatcttggctgggctcgcctgggctggtctctgggtcagtgatggcctaaag gatgttcttgtcatggcaaaaggcaagagtagaagagcctgcattggcatgatcatttaa >gi568815593r:171768638_172106502|GENSCAN_predicted_peptide_4|129_aa MLERPTWKGTEGTLQPSGNEELKLAAYQELNPTNNHMRTLRLMEVTELTWGQSTTQQQGR VCPRLPGGMEDKNSGNGQKRRWAAGSVPKDKPGKGRSGEAIPGRLFVIAARALAVRRVWC CLGRSVKRD >gi568815593r:171768638_172106502|GENSCAN_predicted_CDS_4|390_bp atgctggagaggcccacatggaaaggaactgagggaaccctccagccatcaggaaatgag gagctaaagctggcagcctaccaggaactgaatcccaccaacaaccatatgaggaccctg aggctcatggaagttacagaactcacctggggccagtcaaccactcagcagcagggccgg gtctgccccaggctccctggggggatggaagataaaaacagcggcaacgggcaaaaacgg agatgggctgcagggagcgttcccaaggacaagccaggcaagggccggagtggagaagct attcccgggcgcctgtttgttattgctgccagagccctggccgttaggagagtgtggtgc tgtcttggaagaagtgtgaagcgggattag >gi568815593r:171768638_172106502|GENSCAN_predicted_peptide_5|541_aa MAEGKGETSMSYHADAGERNTSVMEDQNEDESPKKNTLWQISNGTSSVIVSRKRPSEGNY QKEKDLCIKYFDQWSESDQVEFVEHLISRMCHYQHGHINSYLKPMLQRDFITALPEQGLD HIAENILSYLDARSLCAAELVCKEWQRVISEGMLWKKLIERMVRTDPLWKGLSERRGWDQ YLFKNRPTDGPPNSFYRSLYPKIIQDIETIESNWRCGRHNLQRIQCRSENSKGVYCLQYD DEKIISGLRDNSIKIWDKTSLECLKVLTGHTGSVLCLQYDERVIVTGSSDSTVRVWDVNT GEVLNTLIHHNEAVLHLRFSNGLMVTCSKDRSIAVWDMASATDITLRRVLVGHRAAVNVV DFDDKYIVSASGDRTIKNRDKAEVAYTNLVWSTSTCEFVRTLNGHKRGIACLQYRDRLVV SGSSDNTIRLWDIECGACLRVLEGHEELVRCIRFDNKRIVSGAYDGKIKVWDLQAALDPR APASTLCLRTLVGFLVLNYWLRGYQMPKGVRSQLSYEAGIGSRRWVDAKQPNSSSTDISH L >gi568815593r:171768638_172106502|GENSCAN_predicted_CDS_5|1626_bp atggctgaaggcaaaggggaaacaagcatgtcttaccacgctgatgcaggagagaggaac acttcagttatggaagatcaaaatgaagatgagtccccaaagaaaaatactctttggcag ataagtaatggaacatcatctgtgatcgtctccagaaagaggccatcagaaggaaactat caaaaagaaaaagacttgtgtattaaatattttgaccagtggtctgaatcagatcaagtg gaatttgtggaacatcttatttcacgaatgtgtcattatcagcatggacatattaactct tacctgaagcccatgttgcagcgggactttattaccgctttaccagagcaaggcttagat cacatagcagaaaacattctttcgtacctggatgccaggtctctgtgtgcagcagagctg gtatgtaaagaatggcagcgagtgatctcagaaggaatgctttggaagaagctgattgaa cgaatggtacgcactgatcccctatggaaaggactttcagaaagaagagggtgggatcag tacctgtttaaaaacagacccacagatggccctccaaattcattttataggtcattatac ccaaagattatccaggatatagagactatagaatctaactggcggtgtggacgacacaac ttgcagaggattcagtgccgctctgaaaatagtaaaggtgtctactgtttacagtacgat gatgaaaaaattatcagtggcctacgagataattctattaagatatgggataaaaccagc ctggaatgtttgaaagtgttaacaggacacacaggctctgtcctctgtctgcagtatgat gagcgtgtcattgtaactggctcttcagattctacggtgagagtgtgggatgtgaacacg ggtgaagttcttaacacattgatccaccacaatgaggctgtattgcacttacgcttcagc aatggactgatggtgacctgttccaaggaccgctccattgctgtgtgggacatggcttct gcgaccgacatcactttacgccgtgtcctggttggccaccgggctgccgtcaatgtagta gactttgacgacaagtacatcgtgtctgcctctggtgacaggaccatcaaaaacagagac aaagcagaggtggcttataccaacctggtctggagcacgagcacctgtgaatttgttcgt actctcaatgggcacaagcggggcattgcctgtctccagtacagggatcgcctggttgtt agtggatcatcagataataccattaggctctgggatattgaatgtggtgcctgtttaaga gtcctagagggacatgaagaattggtccgatgcatccggtttgataacaagaggattgtc agtggggcctatgatgggaaaattaaagtttgggacttgcaagctgctcttgaccctcga gccccagcaagcacattgtgtttgcgcacattggtgggttttctagtcttgaactactgg ctacgtggctaccaaatgcctaagggagttcgttcacagctgagttatgaagctggaatt ggttctagacgctgggtagatgcaaagcagcctaactcttcaagtaccgacatttctcac ctctga >gi568815593r:171768638_172106502|GENSCAN_predicted_peptide_6|46_aa MEPDSVIEDKTIELMHMEDHGPPDSAHTFPLSALIGMDLSPFIEGY >gi568815593r:171768638_172106502|GENSCAN_predicted_CDS_6|141_bp atggagcccgactcggtgattgaggacaagaccatcgagctcatgcacatggaggaccat ggtcctccagattctgctcacacgtttcctttgtcagcactgataggaatggacctaagc ccatttattgaaggctactga >gi568815593r:171768638_172106502|GENSCAN_predicted_peptide_7|107_aa MGESLELPRDLLNGFDQNADNDMDNGVQAEVVSDEDEKCVGNGNKGPTASSLAKGKITEL RFDLIPRRRTLQFESNHVNCLFKRNKTKTKKANEHSSEKQQNPVSTT >gi568815593r:171768638_172106502|GENSCAN_predicted_CDS_7|324_bp atgggggaaagtttggaacttcctagagacctgttgaatggttttgaccaaaatgctgat aatgatatggacaatggagtccaagctgaggtggtctccgatgaagatgagaaatgtgtt gggaacgggaataaagggccaactgcaagtagcttggctaagggtaaaataactgaactg agatttgacctgattcccaggagacggactttgcaatttgaatccaaccatgttaactgc ctgttcaaacgaaacaaaacaaaaaccaagaaagccaacgaacactcttcagagaaacaa cagaatccagtctctacaacatga >gi568815593r:171768638_172106502|GENSCAN_predicted_peptide_8|772_aa MEMFCILSWVAVTHPFVSSITSNKALRELVAEAKAEVMEEIEDGRDEGEEEDAVDAASTL ENHTQNSSEVSPPSLNADKPLEESPSTPLAPSQSQDSVNEPCSQPSGDRSLQTTSPPVVA PGNENGLAVPVPLRKSRPVSMDARIQVAQEKQVAEQGGDLSPAANRSQKASQSRPNSSAL ETLGGEKLANGSLEPPAQAAPGPSKRDSDCSSLCTSESMDYGTNLSTDLSLNKEMGSLSI KDPKLYKKTLKRTRKFVVDGVEVSITTSKIISEDEKKDEEMRFLRQAWEGPPMVKSRGSD SLPSVAVESASGFSQRQGSHCLFFLGRIRPVAIYRVLVMCHVGTLPVSSHLSFKGPNRRQ ELRELRLLQKEEHRNQTQLSNKHELQLEQMHKRFEQEINAKKKFFDTELENLERQQKQQV EKMEQDHAVRRREEARRIRLEQDRDYTRFQEQLKLMKKEVKNEVEKLPRQQRKESMKQKM EEHTQKKQLLSFVTSRSMDRDFVAKQKEDLELAMKRLTTDNRREICDKERECLMKKQELL RDREAALWEMEEHQLQERHQLVKQQLKDQYFLQRHELLRKHEKEREQMQRYNQRMIEQLK VRQQQEKARLPKIQRSEGKTRMAMYKKSLHINGGGSAAEQREKIKQFSQQEEKRQKSERL QQQQKHENQMRDMLAQCESNMSELQQLQNEKCHLLVEHETQKLKALDESHNQNLKEWRDK LRPRKKALEEDLNQKKREQEMFFKLSEEAECPNPSTPSKAAKFFPYSSADAS >gi568815593r:171768638_172106502|GENSCAN_predicted_CDS_8|2319_bp atggaaatgttctgtatcttgagctgggtggcagttacgcatcccttcgtcagcagcatc accagtaacaaggctctgcgggagctggtggctgaggccaaggccgaggtgatggaagag atcgaagacggccgggatgagggggaagaggaggacgccgtggatgccgcctccaccctg gagaaccatactcagaactcctctgaggtgagtccgccaagcctcaatgctgacaagcct ctcgaggagtcaccttccaccccgctggcacccagccagtctcaggacagtgtgaatgag ccctgcagccagccctctggggacagatccctccaaaccaccagtcccccagtcgtggcc cctggaaatgagaacggcctggcagtgcctgtgcccctgcggaagtcccgacccgtgtca atggatgccagaattcaggtagcccaggagaagcaagttgctgagcagggtggggacctc agcccagcagccaacagatctcaaaaggccagccagagccggcccaacagcagcgccctg gagaccttgggtggggagaagctggccaatggcagcctggagccacctgcccaggcagct ccagggccttccaagagggactcggactgcagcagcctctgcacctctgagagcatggac tatggtaccaatctctccactgacctgtcgctgaacaaagagatgggctctctgtccatc aaggacccgaaactgtacaaaaaaaccctcaagcggacacgcaaatttgtggtggatggt gtggaggtgagcatcaccacctccaagatcatcagcgaagatgagaagaaggatgaggag atgagatttctcaggcaagcctgggaagggccaccaatggtgaaaagcagaggctcagac agtcttccctctgtggctgtagaaagcgcctctggcttttcccagaggcagggcagccat tgcctcttcttccttggtagaattcgaccagttgccatttacagagtgctggtcatgtgc cacgtgggcactttacctgtgtcatctcatttatccttcaaaggacccaacaggcgccag gaactccgagagcttcggctgctccagaaagaagagcatcggaaccagacccagctgagt aacaagcatgagctgcagctggagcaaatgcataaacgttttgaacaggaaatcaacgcc aagaagaagttctttgacacggaattagagaacctggagcgtcagcaaaagcagcaagtg gagaagatggagcaagaccatgccgtgcgccgccgggaggaggccaggcggatccgcctg gagcaggatcgggactacaccaggttccaagagcagctcaaactgatgaagaaagaggtg aagaacgaggtggagaagctcccccgacagcagcggaaggaaagcatgaagcagaagatg gaggagcacacgcagaaaaagcagcttctttccttcgtcacctccagaagcatggaccgg gactttgtagccaagcagaaggaggacctggagctggccatgaagaggctcaccaccgac aacaggcgggagatctgtgacaaggagcgcgagtgcctcatgaagaagcaggagctcctt cgagaccgggaagcagccctgtgggagatggaagagcaccagctgcaggagaggcaccag ctggtgaagcagcagctcaaagaccagtacttcctccagcggcacgagctgctgcgcaag catgagaaggagcgggagcagatgcagcgctacaaccagcgcatgatagagcagctgaag gtgcggcagcaacaggaaaaggcgcggctgcccaagatccagaggagtgagggcaagacg cgcatggccatgtacaagaagagcctccacatcaacggcgggggcagcgcagctgagcag cgtgagaagatcaagcagttctcccagcaggaggagaagaggcagaagtcggagcggctg cagcaacagcagaaacacgagaaccagatgcgggacatgctggcgcagtgtgagagcaac atgagcgagctgcagcagctgcagaatgaaaagtgccacctcctggtagagcacgaaacc cagaaactgaaggccctggatgagagccataaccagaacctgaaggaatggcgggacaag cttcggccgcgcaagaaggctctggaagaggatctgaaccagaagaagcgggagcaggag atgttcttcaagctgagcgaggaggcggagtgcccaaacccctccaccccaagcaaggcc gccaagttcttcccctacagttctgcggatgcttcttaa >gi568815593r:171768638_172106502|GENSCAN_predicted_peptide_9|192_aa MAAIIIIITTTTIITFTTITTTITTIITITIITTTTIITFTTITTIITITTIITITIIII TTITFTTITTIITITIIINIIITTITITTITITIITTIIPITITITPITTIITITMTITI TTITPVSSRSSSTIRKARYGPAQWLTSVIPALWEAKAVDHLRAQLFGTKGENQDPKGENQ DPKGENQDPKGS >gi568815593r:171768638_172106502|GENSCAN_predicted_CDS_9|579_bp atggcagccatcatcatcatcatcaccaccaccaccatcatcaccttcaccaccatcacc accaccatcaccaccatcattaccattaccatcatcaccaccaccaccatcatcaccttc accaccatcaccaccatcattaccatcaccaccatcattaccattaccatcatcatcatc accaccatcaccttcaccaccatcaccaccatcattaccatcaccatcattatcaacatc atcatcacgaccattaccatcaccaccatcactatcaccatcattaccaccatcatcccc atcaccatcaccattacacccatcaccactatcatcaccatcaccatgaccattaccatc accaccatcacccccgtcagcagcaggagcagcagcacaatcagaaaagcccgttatggc ccggcacagtggctcacatctgtaatcccagcactttgggaggccaaggcggtggatcac ctgagagcacagctttttggaaccaaaggtgagaaccaagatcccaaaggtgagaaccaa gatcccaaaggtgagaaccaagatcccaaaggttcctaa >gi568815593r:171768638_172106502|GENSCAN_predicted_peptide_10|123_aa MGKPVVEHVGEKIQVLSVQCEMPTGPPGSQQLDAGRGMELGVLHAGAGIQEVPGEYPLMD KQLSMLFPGRGENMPSVQVTGQRKLSQAKTVTGPTRRPFHHGHHCPDGDPDIGEEQGRAQ ATQ >gi568815593r:171768638_172106502|GENSCAN_predicted_CDS_10|372_bp atgggaaagccagtggtggagcacgtcggcgaaaagatccaagtgctcagcgttcagtgt gagatgcccactggacctccggggagtcaacagctggacgcgggccgaggcatggagctg ggagtgctgcacgctggagctggcattcaggaggtgcctggggaatacccgctgatggac aaacagctgtccatgctgttccctgggcggggtgagaacatgcccagcgtccaggtcact ggacagaggaaactttcacaagctaaaacagtgacagggcctacacgacgcccttttcat catggacaccactgtccagatggggacccagacatcggcgaggagcagggacgtgcccag gccacgcagtga