GENSCAN 1.0 Date run: 3-Nov-116 Time: 08:17:44 Sequence gi568815593r:163353970_163560050 : 206081 bp : 38.60% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2091 2316 226 0 1 73 44 116 0.339 2.76 1.02 Intr + 2837 2962 126 1 0 -2 56 126 0.155 0.36 1.03 Intr + 5729 6025 297 1 0 88 1 173 0.441 4.75 1.04 Term + 11037 11264 228 2 0 69 42 239 0.882 13.05 1.05 PlyA + 11811 11816 6 1.05 2.00 Prom + 25016 25055 40 -5.55 2.01 Sngl + 26458 26826 369 0 0 71 38 189 0.878 8.06 2.02 PlyA + 26854 26859 6 -1.95 3.00 Prom + 27051 27090 40 -3.65 3.01 Init + 31321 31544 224 0 2 34 37 196 0.029 7.18 3.02 Intr + 46524 46624 101 0 2 36 93 68 0.014 0.93 3.03 Intr + 59063 59198 136 0 1 94 82 70 0.264 5.71 3.04 Intr + 82517 82542 26 2 2 125 106 2 0.077 2.45 3.05 Intr + 83753 83781 29 0 2 107 83 10 0.134 -0.68 3.06 Intr + 85288 85551 264 0 0 76 95 153 0.953 11.59 3.07 Intr + 87109 87276 168 0 0 81 93 100 0.933 9.02 3.08 Intr + 88076 88174 99 1 0 49 95 49 0.619 1.09 3.09 Term + 90368 90430 63 1 0 102 41 27 0.363 -3.69 3.10 PlyA + 90473 90478 6 1.05 4.00 Prom + 100160 100199 40 -4.75 4.01 Sngl + 105929 106366 438 1 0 70 48 405 0.991 28.71 4.02 PlyA + 107016 107021 6 1.05 5.00 Prom + 107429 107468 40 -12.33 5.01 Init + 107829 107892 64 2 1 53 79 24 0.416 -0.64 5.02 Intr + 109887 109985 99 1 0 107 85 57 0.505 6.46 5.03 Intr + 110754 110833 80 1 2 90 100 56 0.513 5.35 5.04 Intr + 115672 115860 189 0 0 70 65 130 0.492 7.76 5.05 Intr + 117394 117494 101 0 2 58 93 59 0.946 1.39 5.06 Intr + 119210 119284 75 2 0 47 58 120 0.859 2.61 5.07 Intr + 119410 119588 179 1 2 45 62 167 0.981 8.44 5.08 Intr + 120088 120236 149 2 2 74 25 135 0.948 4.83 5.09 Intr + 121489 121703 215 0 2 30 9 257 0.899 8.59 5.10 Intr + 124715 124831 117 2 0 77 63 112 0.964 6.26 5.11 Intr + 128673 128819 147 0 0 56 71 207 0.997 14.23 5.12 Intr + 129051 129203 153 0 0 64 84 142 0.996 9.67 5.13 Intr + 129299 129398 100 2 1 83 72 86 0.995 5.69 5.14 Intr + 136421 136583 163 1 1 100 116 95 0.998 12.13 5.15 Intr + 142960 143175 216 2 0 93 70 107 0.676 6.95 5.16 Term + 143263 143354 92 2 2 87 39 75 0.730 -0.60 5.17 PlyA + 143478 143483 6 1.05 6.00 Prom + 146736 146775 40 -4.85 6.01 Init + 149426 149455 30 1 0 82 110 42 0.750 5.59 6.02 Intr + 158033 158227 195 1 0 85 80 92 0.929 6.79 6.03 Intr + 159586 159700 115 0 1 42 95 115 0.972 6.70 6.04 Intr + 159873 160025 153 1 0 82 82 134 0.998 11.32 6.05 Intr + 162549 162742 194 1 2 104 99 201 0.999 21.09 6.06 Intr + 163592 163705 114 1 0 68 95 30 0.725 1.42 6.07 Term + 164224 164394 171 0 0 80 29 169 0.999 7.04 6.08 PlyA + 165157 165162 6 1.05 7.06 PlyA - 166072 166067 6 1.05 7.05 Term - 169779 169689 91 2 1 74 47 85 0.568 -0.79 7.04 Intr - 173303 173188 116 2 2 21 92 75 0.086 -0.47 7.03 Intr - 177567 177522 46 2 1 107 81 34 0.416 2.09 7.02 Intr - 180868 180695 174 0 0 67 86 78 0.856 3.63 7.01 Init - 181555 181368 188 1 2 91 103 92 0.950 9.59 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 31321 31548 228 0 0 34 53 197 0.803 5.67 S.002 Sngl + 37904 38242 339 1 0 77 32 190 0.883 8.08 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:163353970_163560050|GENSCAN_predicted_peptide_1|292_aa XAWDLQPAMPEPPPNLWAPAQPEPPQQVPPPASRHPVPSTTQGLRSVGAWHGTGRQLHLQ PPCRIHWVKPAGLLSLVCSFTPEPARPRTHQKEETLNTSEHQKEQTPDKPPLRTVTLTTW RPRKDSGLLGWIQSPPCCVQPCNLVPYTLATPAMAKRSQCIAQVIASEGASPKPWQLPHG VGPAGVQKSRSEIWEPLPRFQRMYGNTWMSSQKIAAGLSVYPNYSWMWDKNSGPAECGYK ERLEHCSPLPFTGRGQLPHMMGSSTEVEAAAGPVPRATGRSGTMGLKELGML >gi568815593r:163353970_163560050|GENSCAN_predicted_CDS_1|879_bp nnggcttgggacctgcagcctgccatgcctgagcctccccccaacctgtgggctcctgcg cagcccgagcctccccaacaagtgccaccccctgcttcacggcacccagtcccatcgacc acccaagggctgaggagtgtgggcgcatggcatgggactggcaggcagctccacctgcag cccccatgcaggatccactgggtgaagccagctgggctcctgagtctggtctgcagcttc actcctgagccagcgagaccacgaacccaccagaaggaagaaactctgaacacatccgaa catcagaaggaacaaactccggacaagccgcctttaagaactgtaacactcaccacctgg aggcctaggaaggatagtggtctcctgggctggattcagagccctccttgctgtgtgcag ccttgcaacttggtgccctacaccctagccactccagccatggctaaaaggagccaatgt atagctcaggtcattgcttcagagggtgcaagccccaagccttggcagcttccacatggt gttgggcctgcaggtgtgcagaagtcaagaagtgagatttgggaacctctgcctagattt cagaggatgtatggaaacacctggatgtccagtcaaaagattgctgcagggttgtcagtg taccctaattattcttggatgtgggacaagaactcgggacctgctgaatgtgggtacaaa gaaagactggaacactgtagccctctgcccttcactggcagagggcagctgccccacatg atgggaagcagcactgaggtggaagcagcagcagggccagtccccagagccacaggccgg agtgggacaatgggactaaaagagcttggcatgctgtaa >gi568815593r:163353970_163560050|GENSCAN_predicted_peptide_2|122_aa MGKDFMSKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRINRQPTEWEKILAIYSSDKGL ISRIYNELKQIYKKKTNNPINKWVKDMNRHFSKEDIYAPKRHMKKCSSSLAIREIRIKTT MR >gi568815593r:163353970_163560050|GENSCAN_predicted_CDS_2|369_bp atgggcaaggacttcatgtctaaaacaccaaaagcaatggcaaccaaagccaaaattgac aaatgggatctaattaaactaaagagcttctgcacagcaaaagaaactaccatcagaata aacaggcaacctacagaatgggagaaaattttggcaatctactcatctgacaaagggcta atatccagaatctacaatgaactcaaacaaatttacaagaaaaaaacaaacaaccccatc aacaagtgggtgaaggatatgaacagacacttctcaaaagaagacatttatgcacccaaa agacacatgaaaaaatgctcatcatcactggccatcagagaaatacgaatcaaaaccaca atgagataa >gi568815593r:163353970_163560050|GENSCAN_predicted_peptide_3|369_aa MTDVLIKSGNFETARHRERILHENSGREQCDDSTSQGTANIANKPPEARRESPSQVSEGI NSAKNVIVDFQPPELYVDTSFSSRINLNQQLGSTQNLHFSDDVFEAQKGPLEVGGSTAAM TLGRKAPCQADLGFRLLELSTPLAVSSAALPFSRVLFCGIALGLSDLISWGEMIEVLTTT DSQKLLHQLNALLEQESRCQPKVCGLRLIESAHDNGLRMTARLRDFEVKDLLSLTQFFGF DTETFSLAVNLLDRFLSKMKVQPKHLGCVGLSCFYLAVKSIEEERNVPLATDLIRISQYR FTVSDLMRMEKIVLEKPSVLALSIIALEIQAQKCVELTEGIECLQKHSKKTGDICIYDKN FYTEPTASN >gi568815593r:163353970_163560050|GENSCAN_predicted_CDS_3|1110_bp atgactgatgttcttataaaaagtggaaattttgagacagcaagacacagagagcgaata ctacatgaaaattcaggcagagagcaatgtgatgattctacaagtcaaggaacagcaaac attgccaacaaaccaccagaagccaggcgagagtctccctctcaagtctcagagggaatc aactctgccaaaaatgtgatcgtggatttccaacctccggaactttatgtggacacatca ttcagttcccgaatcaatttgaatcaacagctaggttccacacaaaaccttcatttttca gatgatgtatttgaagcacagaaagggccccttgaagtgggcggctccaccgcagccatg acactggggcgcaaggcaccttgccaagccgacctaggtttcaggcttctggagttgagc actccattggcagtttcttctgcagctttgccattctctagagttttgttctgtggcatt gccttgggcctctcggatctgatatcgtggggtgagatgatagaggtactgacaacaact gactctcagaaactgctacaccagctgaatgccctgttggaacaggagtctagatgtcag ccaaaggtctgtggtttgagactaattgagtctgcacacgataatggcctcagaatgact gcaagactaagggactttgaagtaaaagatcttcttagtctaactcagttctttggcttt gacacagagacattttctctagctgtgaatttactggacagattcctgtctaaaatgaag gtacagcccaagcaccttgggtgtgttggactgagctgcttttatttggctgtaaaatca atagaagaggaaaggaatgtcccattggcaactgacttgatccgaataagtcaatatagg tttacggtttcagacttgatgagaatggaaaagattgtattggagaagccttctgtgttg gcattgtctatcattgcattagagatccaagcacagaagtgtgtagagttaacagaagga atagaatgtcttcagaaacattccaagaaaacaggtgacatttgtatctacgataaaaat ttttatacagaacctactgcctcaaactga >gi568815593r:163353970_163560050|GENSCAN_predicted_peptide_4|145_aa MPALEAALDILGARAWRHLNFNEHLLQGLVPLAPRGPARYHPTPLLKRGRHNPVPPGRGR TRRSRAHARNPTLRLRLGLSARVTAHAQERKLALWCACSFFPLNGRLRHRQAPDERTEIQ KSSHATNNGTELEPVKPELTTPSPK >gi568815593r:163353970_163560050|GENSCAN_predicted_CDS_4|438_bp atgccggctctggaggccgcactggatatcctgggcgcgcgtgcctggcggcacctgaac ttcaatgaacacctcctccaaggtctggtaccactggccccacggggtcccgcacggtac caccccactccgctcctcaaacggggccgacataatccagtccctcccggccgcggccgc accaggcggagccgagcgcacgcgcggaatcccacgcttaggctacgcctcggcctctcc gctcgggtcactgcgcatgcgcaggaacgcaagctagcgctttggtgtgcgtgttcgttt ttccctttgaatggccgtttacggcaccggcaggccccggatgaaagaactgaaatccag aaaagttcacatgcaacgaacaatgggacagaattggaaccagtgaagccggaactcact acaccgtcccctaagtga >gi568815593r:163353970_163560050|GENSCAN_predicted_peptide_5|712_aa MLYAGTVLLDLVYSFQKPSEEGCAPSPGAYDVKTLEVLKGPVSFQKSQRFKQQKESKQNL NVDKDTTLPASARKVKSSESKIRVLLQERGAQDRRIQDLETELEKMEARLNAALREKTSL SANNATLEKQLIELTRTNELLKSKGMMAKQEGMEMKLQVTQRSLEESQGKIAQLEGKLVS IEKEKIDEKSETEKLLEYIEEISCASDQVEKYKLDIAQLEENLKEKNDEILSLKQSLEEN IVILSKQVEDLNVKCQLLEKEKEDHVNRNREHNENLNAEMQNLKQKFILEQQEREKLQQK ELQIDSLLQQEKELSSSLHQKLCSFQEEMVKEKNLFEEELKQTLDELDKLQQKEEQAERL VKQLEEEAKSRAEELKLLEEKLKGKEAELEKSSAAHTQATLLLQEKYDSMVQSLEDVTAQ FESYKALTASEIEDLKLENSSLQEKAAKAGKNAEDVQHQILATESSNQEYVRMLLDLQTK SALKETEIKEITVSFLQKITDLQNQLKQQEEDFRKQLEDEEGRKAEKENTTAELTEEINK WRLLYEELYNKTKPFQEVSKLRCQLAKKKQSETKLQEELNKVLGIKHFDPSKAFHHESKE NFALKTPLKEEQLLVPKSVLVTVLQRNRTSKRYIEKLIYYEELADMIMKAEKACNLPFAG WRPRKAGVVQYKLESLRTIIAEAFNGPSISFSASTNSNINLIQKYPHRQPLK >gi568815593r:163353970_163560050|GENSCAN_predicted_CDS_5|2139_bp atgctgtatgctgggactgtgctactagatcttgtttattccttccaaaagccaagtgag gaaggttgtgcaccatctccaggtgcttatgatgttaaaactttagaagtattgaaagga ccagtatcctttcagaaatcacaaagatttaaacaacaaaaagaatctaaacaaaatctt aatgttgacaaagatactaccttgcctgcttcagctagaaaagttaagtcttcggaatca aagattcgtgttcttctacaggaacgtggtgcccaggacaggcggatccaggatctggaa actgagttggaaaagatggaagcaaggctaaatgctgcactaagggaaaaaacatctctc tctgcaaataatgctacactggaaaaacaacttattgaattgaccaggactaatgaacta ctaaaatctaagggtatgatggctaagcaagaaggcatggagatgaagctgcaggtcacc caaaggagtctcgaagagtctcaagggaaaatagcccaactggagggaaaacttgtttca atagagaaagaaaagattgatgaaaaatctgaaacagaaaaactcttggaatacatcgaa gaaattagttgtgcttcagatcaagtggaaaaatacaagctagatattgcccagttagaa gaaaatttgaaagagaagaatgatgaaattttaagccttaagcagtctcttgaggagaat attgttatattatctaaacaagtagaagatctaaatgtgaaatgtcagctgcttgaaaaa gaaaaagaagaccatgtcaacaggaatagagaacacaacgaaaatctaaatgcagagatg caaaacttaaaacagaagtttattcttgaacaacaggaacgtgaaaagcttcaacaaaaa gaattacaaattgattcacttctgcaacaagagaaagaattatcttcgagtcttcatcag aagctctgttcttttcaagaggaaatggttaaagagaagaatctgtttgaggaagaatta aagcaaacactggatgagcttgataaattacagcaaaaggaggaacaagctgaaaggctg gtcaagcaattggaagaggaagcaaaatctagagctgaagaattaaaactcctagaagaa aagctgaaagggaaggaggctgaactggagaaaagtagtgctgctcatacccaggccacc ctgcttttgcaggaaaagtatgacagtatggtgcaaagccttgaagatgttactgctcaa tttgaaagctataaagcgttaacagccagtgagatagaagatcttaagctggagaactca tcattacaggaaaaagcggccaaggctgggaaaaatgcagaggatgttcagcatcagatt ttggcaactgagagctcaaatcaagaatatgtaaggatgcttctagatctgcagaccaag tcagcactaaaggaaacagaaattaaagaaatcacagtttcttttcttcaaaaaataact gatttgcagaaccaactcaagcaacaggaggaagactttagaaaacagctggaagatgaa gaaggaagaaaagctgaaaaagaaaatacaacagcagaattaactgaagaaattaacaag tggcgtctcctctatgaagaactatataataaaacaaaaccttttcaggaagtatcaaaa ctccgctgtcagcttgctaaaaaaaaacaaagtgagacaaaacttcaagaggaattgaat aaagttctaggtatcaaacactttgatccttcaaaggcttttcatcatgaaagtaaagaa aattttgccctgaagaccccattaaaagaagaacaactcttggtaccaaaatctgtatta gtcacagttctccagagaaacagaaccagtaagagatacatagaaaagttgatttactat gaggaattggctgacatgattatgaaggctgagaaggcttgcaacctgccatttgcaggc tggagacccaggaaagcaggtgttgttcagtacaagcttgaaagcctgagaaccatcata gcagaagccttcaatggaccctcaatcagctttagtgcatccactaattcaaatattaac ctcattcagaaataccctcacagacagcccctgaagtag >gi568815593r:163353970_163560050|GENSCAN_predicted_peptide_6|323_aa MPEMPEDMEQEEVNIPNRRVLVTGATGLLGRAVHKEFQQNNWHAVGCGFRRARPKFEQVN LLDSNAVHHIIHDFQPHVIVHCAAERRPDVVENQPDAASQLNVDASGNLAKEAAAVGAFL IYISSDYVFDGTNPPYREEDIPAPLNLYGKTKLDGEKAVLENNLGAAVLRIPILYGEVEK LEESAVTVMFDKVQFSNKSANMDHWQQRFPTHVKDVATVCRQLAEKRMLDPSIKGTFHWS GNEQMTKYEMACAIADAFNLPSSHLRPITDSPVLGAQRPRNAQLDCSKLETLGIGQRTPF RIGIKESLWPFLIDKRWRQTVFH >gi568815593r:163353970_163560050|GENSCAN_predicted_CDS_6|972_bp atgcctgaaatgccagaggacatggagcaggaggaagttaacatccctaataggagggtt ctggttactggtgccactgggcttcttggcagagctgtacacaaagaatttcagcagaat aattggcatgcagttggctgtggtttcagaagagcaagaccaaaatttgaacaggttaat ctgttggattctaatgcagttcatcacatcattcatgattttcagccccatgttatagta cattgtgcagcagagagaagaccagatgttgtagaaaatcagccagatgctgcctctcaa cttaatgtggatgcttctgggaatttagcaaaggaagcagctgctgttggagcatttctc atctacattagctcagattatgtatttgatggaacaaatccaccttacagagaggaagac ataccagctcccctaaatttgtatggcaaaacaaaattagatggagaaaaggctgtcctg gagaacaatctaggagctgctgttttgaggattcctattctgtatggggaagttgaaaag ctcgaagaaagtgctgtgactgttatgtttgataaagtgcagttcagcaacaagtcagca aacatggatcactggcagcagaggttccccacacatgtcaaagatgtggccactgtgtgc cggcagctagcagagaagagaatgctggatccatcaattaagggaacctttcactggtct ggcaatgaacagatgactaagtatgaaatggcatgtgcaattgcagatgccttcaacctc cccagcagtcacttaagacctattactgacagccctgtcctaggagcacaacgtccgaga aatgctcagcttgactgctccaaattggagaccttgggcattggccaacgaacaccattt cgaattggaatcaaagaatcactttggcctttcctcattgacaagagatggagacaaacg gtctttcattag >gi568815593r:163353970_163560050|GENSCAN_predicted_peptide_7|204_aa MAREDSTATTAEAFAMGHCHSWSTRLAHQPFVIRYKPTSRWLLLHFISSFKTFPEDHFTY KNINISVLLLRKLKLSQVAKLAQAQTSRVFICKCQDLHSNSGLCDLKADTLPITRASEKQ GLRTGPQAKEGDYLQKALTVEPSHLDLLWNLSQEATERCAAKTRKHNKVKEMKHSFGVNC NYRRTLLKARTIIPLSTFQGPYVF >gi568815593r:163353970_163560050|GENSCAN_predicted_CDS_7|615_bp atggccagggaggattctactgccaccactgccgaggcatttgccatgggtcattgccat tcttggagcaccagactggcccatcagcccttcgtgatacgatacaaacccacaagcagg tggttgctgctccacttcatttcctcattcaaaacatttcctgaagatcatttcacttac aagaacataaatatctctgttttactgttgaggaaactaaagctcagtcaagttgctaaa cttgcccaagctcaaacttccagggtgtttatatgtaaatgccaagatttgcattcaaac tcaggtctctgtgaccttaaagctgatactctcccaattacaagagccagtgaaaaacag gggttgaggacagggccgcaagccaaggaaggtgattacctccaaaaggcactcacagtc gagccttcacatttggatctgttatggaacctttctcaggaagctactgaaagatgtgca gccaaaactagaaagcacaacaaagtgaaagaaatgaaacacagtttcggggtgaattgt aactacagacgaacacttctgaaggccagaaccatcatccctctttcaactttccaagga ccttatgtgttctaa