GENSCAN 1.0 Date run: 6-Nov-116 Time: 23:38:44 Sequence gi568815593f:163339257_163542562 : 203306 bp : 38.63% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 12162 12202 41 2 2 64 92 50 0.365 2.71 1.02 Intr + 16804 17029 226 1 1 73 44 116 0.423 2.76 1.03 Intr + 17550 17675 126 2 0 -2 56 126 0.145 0.36 1.04 Intr + 20442 20738 297 2 0 88 1 173 0.437 4.75 1.05 Term + 25750 25977 228 0 0 69 42 239 0.882 13.05 1.06 PlyA + 26524 26529 6 1.05 2.00 Prom + 39729 39768 40 -5.55 2.01 Sngl + 41171 41539 369 1 0 71 38 189 0.878 8.06 2.02 PlyA + 41567 41572 6 -1.95 3.00 Prom + 41764 41803 40 -3.65 3.01 Init + 46034 46257 224 1 2 34 37 196 0.029 7.18 3.02 Intr + 61237 61337 101 1 2 36 93 68 0.014 0.93 3.03 Intr + 73776 73911 136 1 1 94 82 70 0.264 5.71 3.04 Intr + 97230 97255 26 0 2 125 106 2 0.077 2.45 3.05 Intr + 98466 98494 29 1 2 107 83 10 0.134 -0.68 3.06 Intr + 100001 100264 264 1 0 76 95 153 0.953 11.59 3.07 Intr + 101822 101989 168 1 0 81 93 100 0.933 9.02 3.08 Intr + 102789 102887 99 2 0 49 95 49 0.619 1.09 3.09 Term + 105081 105143 63 2 0 102 41 27 0.363 -3.69 3.10 PlyA + 105186 105191 6 1.05 4.00 Prom + 114873 114912 40 -4.75 4.01 Sngl + 120642 121079 438 2 0 70 48 405 0.991 28.71 4.02 PlyA + 121729 121734 6 1.05 5.00 Prom + 122142 122181 40 -12.33 5.01 Init + 122542 122605 64 0 1 53 79 24 0.416 -0.64 5.02 Intr + 124600 124698 99 2 0 107 85 57 0.505 6.46 5.03 Intr + 125467 125546 80 2 2 90 100 56 0.513 5.35 5.04 Intr + 130385 130573 189 1 0 70 65 130 0.492 7.76 5.05 Intr + 132107 132207 101 1 2 58 93 59 0.946 1.39 5.06 Intr + 133923 133997 75 0 0 47 58 120 0.859 2.61 5.07 Intr + 134123 134301 179 2 2 45 62 167 0.981 8.44 5.08 Intr + 134801 134949 149 0 2 74 25 135 0.948 4.83 5.09 Intr + 136202 136416 215 1 2 30 9 257 0.899 8.59 5.10 Intr + 139428 139544 117 0 0 77 63 112 0.964 6.26 5.11 Intr + 143386 143532 147 1 0 56 71 207 0.997 14.23 5.12 Intr + 143764 143916 153 1 0 64 84 142 0.996 9.67 5.13 Intr + 144012 144111 100 0 1 83 72 86 0.995 5.69 5.14 Intr + 151134 151296 163 2 1 100 116 95 0.998 12.13 5.15 Intr + 157673 157888 216 0 0 93 70 107 0.676 6.95 5.16 Term + 157976 158067 92 0 2 87 39 75 0.730 -0.60 5.17 PlyA + 158191 158196 6 1.05 6.00 Prom + 161449 161488 40 -4.85 6.01 Init + 164139 164168 30 2 0 82 110 42 0.750 5.59 6.02 Intr + 172746 172940 195 2 0 85 80 92 0.929 6.79 6.03 Intr + 174299 174413 115 1 1 42 95 115 0.972 6.70 6.04 Intr + 174586 174738 153 2 0 82 82 134 0.998 11.32 6.05 Intr + 177262 177455 194 2 2 104 99 201 0.999 21.09 6.06 Intr + 178305 178418 114 2 0 68 95 30 0.725 1.42 6.07 Term + 178937 179107 171 1 0 80 29 169 0.999 7.04 6.08 PlyA + 179870 179875 6 1.05 7.06 PlyA - 180785 180780 6 1.05 7.05 Term - 184492 184402 91 0 1 74 47 85 0.568 -0.79 7.04 Intr - 188016 187901 116 0 2 21 92 75 0.086 -0.47 7.03 Intr - 192280 192235 46 0 1 107 81 34 0.416 2.09 7.02 Intr - 195581 195408 174 1 0 67 86 78 0.856 3.63 7.01 Init - 196268 196081 188 2 2 91 103 92 0.950 9.59 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 46034 46261 228 1 0 34 53 197 0.803 5.67 S.002 Sngl + 52617 52955 339 2 0 77 32 190 0.883 8.08 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:163339257_163542562|GENSCAN_predicted_peptide_1|305_aa MGEEKPRVCGEKRRAWDLQPAMPEPPPNLWAPAQPEPPQQVPPPASRHPVPSTTQGLRSV GAWHGTGRQLHLQPPCRIHWVKPAGLLSLVCSFTPEPARPRTHQKEETLNTSEHQKEQTP DKPPLRTVTLTTWRPRKDSGLLGWIQSPPCCVQPCNLVPYTLATPAMAKRSQCIAQVIAS EGASPKPWQLPHGVGPAGVQKSRSEIWEPLPRFQRMYGNTWMSSQKIAAGLSVYPNYSWM WDKNSGPAECGYKERLEHCSPLPFTGRGQLPHMMGSSTEVEAAAGPVPRATGRSGTMGLK ELGML >gi568815593f:163339257_163542562|GENSCAN_predicted_CDS_1|918_bp atgggagaagagaaacccagagtgtgtggagagaagaggagggcttgggacctgcagcct gccatgcctgagcctccccccaacctgtgggctcctgcgcagcccgagcctccccaacaa gtgccaccccctgcttcacggcacccagtcccatcgaccacccaagggctgaggagtgtg ggcgcatggcatgggactggcaggcagctccacctgcagcccccatgcaggatccactgg gtgaagccagctgggctcctgagtctggtctgcagcttcactcctgagccagcgagacca cgaacccaccagaaggaagaaactctgaacacatccgaacatcagaaggaacaaactccg gacaagccgcctttaagaactgtaacactcaccacctggaggcctaggaaggatagtggt ctcctgggctggattcagagccctccttgctgtgtgcagccttgcaacttggtgccctac accctagccactccagccatggctaaaaggagccaatgtatagctcaggtcattgcttca gagggtgcaagccccaagccttggcagcttccacatggtgttgggcctgcaggtgtgcag aagtcaagaagtgagatttgggaacctctgcctagatttcagaggatgtatggaaacacc tggatgtccagtcaaaagattgctgcagggttgtcagtgtaccctaattattcttggatg tgggacaagaactcgggacctgctgaatgtgggtacaaagaaagactggaacactgtagc cctctgcccttcactggcagagggcagctgccccacatgatgggaagcagcactgaggtg gaagcagcagcagggccagtccccagagccacaggccggagtgggacaatgggactaaaa gagcttggcatgctgtaa >gi568815593f:163339257_163542562|GENSCAN_predicted_peptide_2|122_aa MGKDFMSKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRINRQPTEWEKILAIYSSDKGL ISRIYNELKQIYKKKTNNPINKWVKDMNRHFSKEDIYAPKRHMKKCSSSLAIREIRIKTT MR >gi568815593f:163339257_163542562|GENSCAN_predicted_CDS_2|369_bp atgggcaaggacttcatgtctaaaacaccaaaagcaatggcaaccaaagccaaaattgac aaatgggatctaattaaactaaagagcttctgcacagcaaaagaaactaccatcagaata aacaggcaacctacagaatgggagaaaattttggcaatctactcatctgacaaagggcta atatccagaatctacaatgaactcaaacaaatttacaagaaaaaaacaaacaaccccatc aacaagtgggtgaaggatatgaacagacacttctcaaaagaagacatttatgcacccaaa agacacatgaaaaaatgctcatcatcactggccatcagagaaatacgaatcaaaaccaca atgagataa >gi568815593f:163339257_163542562|GENSCAN_predicted_peptide_3|369_aa MTDVLIKSGNFETARHRERILHENSGREQCDDSTSQGTANIANKPPEARRESPSQVSEGI NSAKNVIVDFQPPELYVDTSFSSRINLNQQLGSTQNLHFSDDVFEAQKGPLEVGGSTAAM TLGRKAPCQADLGFRLLELSTPLAVSSAALPFSRVLFCGIALGLSDLISWGEMIEVLTTT DSQKLLHQLNALLEQESRCQPKVCGLRLIESAHDNGLRMTARLRDFEVKDLLSLTQFFGF DTETFSLAVNLLDRFLSKMKVQPKHLGCVGLSCFYLAVKSIEEERNVPLATDLIRISQYR FTVSDLMRMEKIVLEKPSVLALSIIALEIQAQKCVELTEGIECLQKHSKKTGDICIYDKN FYTEPTASN >gi568815593f:163339257_163542562|GENSCAN_predicted_CDS_3|1110_bp atgactgatgttcttataaaaagtggaaattttgagacagcaagacacagagagcgaata ctacatgaaaattcaggcagagagcaatgtgatgattctacaagtcaaggaacagcaaac attgccaacaaaccaccagaagccaggcgagagtctccctctcaagtctcagagggaatc aactctgccaaaaatgtgatcgtggatttccaacctccggaactttatgtggacacatca ttcagttcccgaatcaatttgaatcaacagctaggttccacacaaaaccttcatttttca gatgatgtatttgaagcacagaaagggccccttgaagtgggcggctccaccgcagccatg acactggggcgcaaggcaccttgccaagccgacctaggtttcaggcttctggagttgagc actccattggcagtttcttctgcagctttgccattctctagagttttgttctgtggcatt gccttgggcctctcggatctgatatcgtggggtgagatgatagaggtactgacaacaact gactctcagaaactgctacaccagctgaatgccctgttggaacaggagtctagatgtcag ccaaaggtctgtggtttgagactaattgagtctgcacacgataatggcctcagaatgact gcaagactaagggactttgaagtaaaagatcttcttagtctaactcagttctttggcttt gacacagagacattttctctagctgtgaatttactggacagattcctgtctaaaatgaag gtacagcccaagcaccttgggtgtgttggactgagctgcttttatttggctgtaaaatca atagaagaggaaaggaatgtcccattggcaactgacttgatccgaataagtcaatatagg tttacggtttcagacttgatgagaatggaaaagattgtattggagaagccttctgtgttg gcattgtctatcattgcattagagatccaagcacagaagtgtgtagagttaacagaagga atagaatgtcttcagaaacattccaagaaaacaggtgacatttgtatctacgataaaaat ttttatacagaacctactgcctcaaactga >gi568815593f:163339257_163542562|GENSCAN_predicted_peptide_4|145_aa MPALEAALDILGARAWRHLNFNEHLLQGLVPLAPRGPARYHPTPLLKRGRHNPVPPGRGR TRRSRAHARNPTLRLRLGLSARVTAHAQERKLALWCACSFFPLNGRLRHRQAPDERTEIQ KSSHATNNGTELEPVKPELTTPSPK >gi568815593f:163339257_163542562|GENSCAN_predicted_CDS_4|438_bp atgccggctctggaggccgcactggatatcctgggcgcgcgtgcctggcggcacctgaac ttcaatgaacacctcctccaaggtctggtaccactggccccacggggtcccgcacggtac caccccactccgctcctcaaacggggccgacataatccagtccctcccggccgcggccgc accaggcggagccgagcgcacgcgcggaatcccacgcttaggctacgcctcggcctctcc gctcgggtcactgcgcatgcgcaggaacgcaagctagcgctttggtgtgcgtgttcgttt ttccctttgaatggccgtttacggcaccggcaggccccggatgaaagaactgaaatccag aaaagttcacatgcaacgaacaatgggacagaattggaaccagtgaagccggaactcact acaccgtcccctaagtga >gi568815593f:163339257_163542562|GENSCAN_predicted_peptide_5|712_aa MLYAGTVLLDLVYSFQKPSEEGCAPSPGAYDVKTLEVLKGPVSFQKSQRFKQQKESKQNL NVDKDTTLPASARKVKSSESKIRVLLQERGAQDRRIQDLETELEKMEARLNAALREKTSL SANNATLEKQLIELTRTNELLKSKGMMAKQEGMEMKLQVTQRSLEESQGKIAQLEGKLVS IEKEKIDEKSETEKLLEYIEEISCASDQVEKYKLDIAQLEENLKEKNDEILSLKQSLEEN IVILSKQVEDLNVKCQLLEKEKEDHVNRNREHNENLNAEMQNLKQKFILEQQEREKLQQK ELQIDSLLQQEKELSSSLHQKLCSFQEEMVKEKNLFEEELKQTLDELDKLQQKEEQAERL VKQLEEEAKSRAEELKLLEEKLKGKEAELEKSSAAHTQATLLLQEKYDSMVQSLEDVTAQ FESYKALTASEIEDLKLENSSLQEKAAKAGKNAEDVQHQILATESSNQEYVRMLLDLQTK SALKETEIKEITVSFLQKITDLQNQLKQQEEDFRKQLEDEEGRKAEKENTTAELTEEINK WRLLYEELYNKTKPFQEVSKLRCQLAKKKQSETKLQEELNKVLGIKHFDPSKAFHHESKE NFALKTPLKEEQLLVPKSVLVTVLQRNRTSKRYIEKLIYYEELADMIMKAEKACNLPFAG WRPRKAGVVQYKLESLRTIIAEAFNGPSISFSASTNSNINLIQKYPHRQPLK >gi568815593f:163339257_163542562|GENSCAN_predicted_CDS_5|2139_bp atgctgtatgctgggactgtgctactagatcttgtttattccttccaaaagccaagtgag gaaggttgtgcaccatctccaggtgcttatgatgttaaaactttagaagtattgaaagga ccagtatcctttcagaaatcacaaagatttaaacaacaaaaagaatctaaacaaaatctt aatgttgacaaagatactaccttgcctgcttcagctagaaaagttaagtcttcggaatca aagattcgtgttcttctacaggaacgtggtgcccaggacaggcggatccaggatctggaa actgagttggaaaagatggaagcaaggctaaatgctgcactaagggaaaaaacatctctc tctgcaaataatgctacactggaaaaacaacttattgaattgaccaggactaatgaacta ctaaaatctaagggtatgatggctaagcaagaaggcatggagatgaagctgcaggtcacc caaaggagtctcgaagagtctcaagggaaaatagcccaactggagggaaaacttgtttca atagagaaagaaaagattgatgaaaaatctgaaacagaaaaactcttggaatacatcgaa gaaattagttgtgcttcagatcaagtggaaaaatacaagctagatattgcccagttagaa gaaaatttgaaagagaagaatgatgaaattttaagccttaagcagtctcttgaggagaat attgttatattatctaaacaagtagaagatctaaatgtgaaatgtcagctgcttgaaaaa gaaaaagaagaccatgtcaacaggaatagagaacacaacgaaaatctaaatgcagagatg caaaacttaaaacagaagtttattcttgaacaacaggaacgtgaaaagcttcaacaaaaa gaattacaaattgattcacttctgcaacaagagaaagaattatcttcgagtcttcatcag aagctctgttcttttcaagaggaaatggttaaagagaagaatctgtttgaggaagaatta aagcaaacactggatgagcttgataaattacagcaaaaggaggaacaagctgaaaggctg gtcaagcaattggaagaggaagcaaaatctagagctgaagaattaaaactcctagaagaa aagctgaaagggaaggaggctgaactggagaaaagtagtgctgctcatacccaggccacc ctgcttttgcaggaaaagtatgacagtatggtgcaaagccttgaagatgttactgctcaa tttgaaagctataaagcgttaacagccagtgagatagaagatcttaagctggagaactca tcattacaggaaaaagcggccaaggctgggaaaaatgcagaggatgttcagcatcagatt ttggcaactgagagctcaaatcaagaatatgtaaggatgcttctagatctgcagaccaag tcagcactaaaggaaacagaaattaaagaaatcacagtttcttttcttcaaaaaataact gatttgcagaaccaactcaagcaacaggaggaagactttagaaaacagctggaagatgaa gaaggaagaaaagctgaaaaagaaaatacaacagcagaattaactgaagaaattaacaag tggcgtctcctctatgaagaactatataataaaacaaaaccttttcaggaagtatcaaaa ctccgctgtcagcttgctaaaaaaaaacaaagtgagacaaaacttcaagaggaattgaat aaagttctaggtatcaaacactttgatccttcaaaggcttttcatcatgaaagtaaagaa aattttgccctgaagaccccattaaaagaagaacaactcttggtaccaaaatctgtatta gtcacagttctccagagaaacagaaccagtaagagatacatagaaaagttgatttactat gaggaattggctgacatgattatgaaggctgagaaggcttgcaacctgccatttgcaggc tggagacccaggaaagcaggtgttgttcagtacaagcttgaaagcctgagaaccatcata gcagaagccttcaatggaccctcaatcagctttagtgcatccactaattcaaatattaac ctcattcagaaataccctcacagacagcccctgaagtag >gi568815593f:163339257_163542562|GENSCAN_predicted_peptide_6|323_aa MPEMPEDMEQEEVNIPNRRVLVTGATGLLGRAVHKEFQQNNWHAVGCGFRRARPKFEQVN LLDSNAVHHIIHDFQPHVIVHCAAERRPDVVENQPDAASQLNVDASGNLAKEAAAVGAFL IYISSDYVFDGTNPPYREEDIPAPLNLYGKTKLDGEKAVLENNLGAAVLRIPILYGEVEK LEESAVTVMFDKVQFSNKSANMDHWQQRFPTHVKDVATVCRQLAEKRMLDPSIKGTFHWS GNEQMTKYEMACAIADAFNLPSSHLRPITDSPVLGAQRPRNAQLDCSKLETLGIGQRTPF RIGIKESLWPFLIDKRWRQTVFH >gi568815593f:163339257_163542562|GENSCAN_predicted_CDS_6|972_bp atgcctgaaatgccagaggacatggagcaggaggaagttaacatccctaataggagggtt ctggttactggtgccactgggcttcttggcagagctgtacacaaagaatttcagcagaat aattggcatgcagttggctgtggtttcagaagagcaagaccaaaatttgaacaggttaat ctgttggattctaatgcagttcatcacatcattcatgattttcagccccatgttatagta cattgtgcagcagagagaagaccagatgttgtagaaaatcagccagatgctgcctctcaa cttaatgtggatgcttctgggaatttagcaaaggaagcagctgctgttggagcatttctc atctacattagctcagattatgtatttgatggaacaaatccaccttacagagaggaagac ataccagctcccctaaatttgtatggcaaaacaaaattagatggagaaaaggctgtcctg gagaacaatctaggagctgctgttttgaggattcctattctgtatggggaagttgaaaag ctcgaagaaagtgctgtgactgttatgtttgataaagtgcagttcagcaacaagtcagca aacatggatcactggcagcagaggttccccacacatgtcaaagatgtggccactgtgtgc cggcagctagcagagaagagaatgctggatccatcaattaagggaacctttcactggtct ggcaatgaacagatgactaagtatgaaatggcatgtgcaattgcagatgccttcaacctc cccagcagtcacttaagacctattactgacagccctgtcctaggagcacaacgtccgaga aatgctcagcttgactgctccaaattggagaccttgggcattggccaacgaacaccattt cgaattggaatcaaagaatcactttggcctttcctcattgacaagagatggagacaaacg gtctttcattag >gi568815593f:163339257_163542562|GENSCAN_predicted_peptide_7|204_aa MAREDSTATTAEAFAMGHCHSWSTRLAHQPFVIRYKPTSRWLLLHFISSFKTFPEDHFTY KNINISVLLLRKLKLSQVAKLAQAQTSRVFICKCQDLHSNSGLCDLKADTLPITRASEKQ GLRTGPQAKEGDYLQKALTVEPSHLDLLWNLSQEATERCAAKTRKHNKVKEMKHSFGVNC NYRRTLLKARTIIPLSTFQGPYVF >gi568815593f:163339257_163542562|GENSCAN_predicted_CDS_7|615_bp atggccagggaggattctactgccaccactgccgaggcatttgccatgggtcattgccat tcttggagcaccagactggcccatcagcccttcgtgatacgatacaaacccacaagcagg tggttgctgctccacttcatttcctcattcaaaacatttcctgaagatcatttcacttac aagaacataaatatctctgttttactgttgaggaaactaaagctcagtcaagttgctaaa cttgcccaagctcaaacttccagggtgtttatatgtaaatgccaagatttgcattcaaac tcaggtctctgtgaccttaaagctgatactctcccaattacaagagccagtgaaaaacag gggttgaggacagggccgcaagccaaggaaggtgattacctccaaaaggcactcacagtc gagccttcacatttggatctgttatggaacctttctcaggaagctactgaaagatgtgca gccaaaactagaaagcacaacaaagtgaaagaaatgaaacacagtttcggggtgaattgt aactacagacgaacacttctgaaggccagaaccatcatccctctttcaactttccaagga ccttatgtgttctaa