GENSCAN 1.0 Date run: 3-Nov-116 Time: 11:38:24 Sequence gi568815593f:163405687_163618360 : 212674 bp : 38.94% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 7346 7481 136 0 1 94 82 70 0.317 5.71 1.02 Intr + 30800 30825 26 2 2 125 106 2 0.076 2.45 1.03 Intr + 32036 32064 29 0 2 107 83 10 0.134 -0.68 1.04 Intr + 33571 33834 264 0 0 76 95 153 0.953 11.59 1.05 Intr + 35392 35559 168 0 0 81 93 100 0.933 9.02 1.06 Intr + 36359 36457 99 1 0 49 95 49 0.619 1.09 1.07 Term + 38651 38713 63 1 0 102 41 27 0.363 -3.69 1.08 PlyA + 38756 38761 6 1.05 2.00 Prom + 48443 48482 40 -4.75 2.01 Sngl + 54212 54649 438 1 0 70 48 405 0.991 28.71 2.02 PlyA + 55299 55304 6 1.05 3.00 Prom + 55712 55751 40 -12.33 3.01 Init + 56112 56175 64 2 1 53 79 24 0.416 -0.64 3.02 Intr + 58170 58268 99 1 0 107 85 57 0.505 6.46 3.03 Intr + 59037 59116 80 1 2 90 100 56 0.513 5.35 3.04 Intr + 63955 64143 189 0 0 70 65 130 0.492 7.76 3.05 Intr + 65677 65777 101 0 2 58 93 59 0.946 1.39 3.06 Intr + 67493 67567 75 2 0 47 58 120 0.859 2.61 3.07 Intr + 67693 67871 179 1 2 45 62 167 0.981 8.44 3.08 Intr + 68371 68519 149 2 2 74 25 135 0.948 4.83 3.09 Intr + 69772 69986 215 0 2 30 9 257 0.899 8.59 3.10 Intr + 72998 73114 117 2 0 77 63 112 0.964 6.26 3.11 Intr + 76956 77102 147 0 0 56 71 207 0.997 14.23 3.12 Intr + 77334 77486 153 0 0 64 84 142 0.996 9.67 3.13 Intr + 77582 77681 100 2 1 83 72 86 0.995 5.69 3.14 Intr + 84704 84866 163 1 1 100 116 95 0.998 12.13 3.15 Intr + 91243 91458 216 2 0 93 70 107 0.676 6.95 3.16 Term + 91546 91637 92 2 2 87 39 75 0.730 -0.60 3.17 PlyA + 91761 91766 6 1.05 4.00 Prom + 95019 95058 40 -4.85 4.01 Init + 97709 97738 30 1 0 82 110 42 0.750 5.59 4.02 Intr + 106316 106510 195 1 0 85 80 92 0.929 6.79 4.03 Intr + 107869 107983 115 0 1 42 95 115 0.972 6.70 4.04 Intr + 108156 108308 153 1 0 82 82 134 0.998 11.32 4.05 Intr + 110832 111025 194 1 2 104 99 201 0.999 21.09 4.06 Intr + 111875 111988 114 1 0 68 95 30 0.725 1.42 4.07 Term + 112507 112677 171 0 0 80 29 169 0.999 7.04 4.08 PlyA + 113440 113445 6 1.05 5.06 PlyA - 114355 114350 6 1.05 5.05 Term - 118062 117972 91 2 1 74 47 85 0.568 -0.79 5.04 Intr - 121586 121471 116 2 2 21 92 75 0.086 -0.47 5.03 Intr - 125850 125805 46 2 1 107 81 34 0.416 2.09 5.02 Intr - 129151 128978 174 0 0 67 86 78 0.856 3.63 5.01 Init - 129838 129651 188 1 2 91 103 92 0.950 9.59 5.00 Prom - 131111 131072 40 -4.45 6.04 PlyA - 132063 132058 6 1.05 6.03 Term - 154515 154324 192 0 0 70 46 143 0.853 4.74 6.02 Intr - 159446 159322 125 0 2 76 83 119 0.808 9.58 6.01 Init - 176331 176244 88 0 1 88 32 135 0.888 8.75 6.00 Prom - 176534 176495 40 -5.85 7.02 PlyA - 177364 177359 6 1.05 7.01 Sngl - 187054 186203 852 1 0 78 48 222 0.726 12.75 7.00 Prom - 204281 204242 40 -4.85 8.03 PlyA - 205634 205629 6 1.05 8.02 Term - 207194 207138 57 2 0 89 41 86 0.286 0.71 8.01 Intr - 209767 209611 157 0 1 14 97 196 0.298 12.19 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:163405687_163618360|GENSCAN_predicted_peptide_1|261_aa XPLEVGGSTAAMTLGRKAPCQADLGFRLLELSTPLAVSSAALPFSRVLFCGIALGLSDLI SWGEMIEVLTTTDSQKLLHQLNALLEQESRCQPKVCGLRLIESAHDNGLRMTARLRDFEV KDLLSLTQFFGFDTETFSLAVNLLDRFLSKMKVQPKHLGCVGLSCFYLAVKSIEEERNVP LATDLIRISQYRFTVSDLMRMEKIVLEKPSVLALSIIALEIQAQKCVELTEGIECLQKHS KKTGDICIYDKNFYTEPTASN >gi568815593f:163405687_163618360|GENSCAN_predicted_CDS_1|786_bp nggccccttgaagtgggcggctccaccgcagccatgacactggggcgcaaggcaccttgc caagccgacctaggtttcaggcttctggagttgagcactccattggcagtttcttctgca gctttgccattctctagagttttgttctgtggcattgccttgggcctctcggatctgata tcgtggggtgagatgatagaggtactgacaacaactgactctcagaaactgctacaccag ctgaatgccctgttggaacaggagtctagatgtcagccaaaggtctgtggtttgagacta attgagtctgcacacgataatggcctcagaatgactgcaagactaagggactttgaagta aaagatcttcttagtctaactcagttctttggctttgacacagagacattttctctagct gtgaatttactggacagattcctgtctaaaatgaaggtacagcccaagcaccttgggtgt gttggactgagctgcttttatttggctgtaaaatcaatagaagaggaaaggaatgtccca ttggcaactgacttgatccgaataagtcaatataggtttacggtttcagacttgatgaga atggaaaagattgtattggagaagccttctgtgttggcattgtctatcattgcattagag atccaagcacagaagtgtgtagagttaacagaaggaatagaatgtcttcagaaacattcc aagaaaacaggtgacatttgtatctacgataaaaatttttatacagaacctactgcctca aactga >gi568815593f:163405687_163618360|GENSCAN_predicted_peptide_2|145_aa MPALEAALDILGARAWRHLNFNEHLLQGLVPLAPRGPARYHPTPLLKRGRHNPVPPGRGR TRRSRAHARNPTLRLRLGLSARVTAHAQERKLALWCACSFFPLNGRLRHRQAPDERTEIQ KSSHATNNGTELEPVKPELTTPSPK >gi568815593f:163405687_163618360|GENSCAN_predicted_CDS_2|438_bp atgccggctctggaggccgcactggatatcctgggcgcgcgtgcctggcggcacctgaac ttcaatgaacacctcctccaaggtctggtaccactggccccacggggtcccgcacggtac caccccactccgctcctcaaacggggccgacataatccagtccctcccggccgcggccgc accaggcggagccgagcgcacgcgcggaatcccacgcttaggctacgcctcggcctctcc gctcgggtcactgcgcatgcgcaggaacgcaagctagcgctttggtgtgcgtgttcgttt ttccctttgaatggccgtttacggcaccggcaggccccggatgaaagaactgaaatccag aaaagttcacatgcaacgaacaatgggacagaattggaaccagtgaagccggaactcact acaccgtcccctaagtga >gi568815593f:163405687_163618360|GENSCAN_predicted_peptide_3|712_aa MLYAGTVLLDLVYSFQKPSEEGCAPSPGAYDVKTLEVLKGPVSFQKSQRFKQQKESKQNL NVDKDTTLPASARKVKSSESKIRVLLQERGAQDRRIQDLETELEKMEARLNAALREKTSL SANNATLEKQLIELTRTNELLKSKGMMAKQEGMEMKLQVTQRSLEESQGKIAQLEGKLVS IEKEKIDEKSETEKLLEYIEEISCASDQVEKYKLDIAQLEENLKEKNDEILSLKQSLEEN IVILSKQVEDLNVKCQLLEKEKEDHVNRNREHNENLNAEMQNLKQKFILEQQEREKLQQK ELQIDSLLQQEKELSSSLHQKLCSFQEEMVKEKNLFEEELKQTLDELDKLQQKEEQAERL VKQLEEEAKSRAEELKLLEEKLKGKEAELEKSSAAHTQATLLLQEKYDSMVQSLEDVTAQ FESYKALTASEIEDLKLENSSLQEKAAKAGKNAEDVQHQILATESSNQEYVRMLLDLQTK SALKETEIKEITVSFLQKITDLQNQLKQQEEDFRKQLEDEEGRKAEKENTTAELTEEINK WRLLYEELYNKTKPFQEVSKLRCQLAKKKQSETKLQEELNKVLGIKHFDPSKAFHHESKE NFALKTPLKEEQLLVPKSVLVTVLQRNRTSKRYIEKLIYYEELADMIMKAEKACNLPFAG WRPRKAGVVQYKLESLRTIIAEAFNGPSISFSASTNSNINLIQKYPHRQPLK >gi568815593f:163405687_163618360|GENSCAN_predicted_CDS_3|2139_bp atgctgtatgctgggactgtgctactagatcttgtttattccttccaaaagccaagtgag gaaggttgtgcaccatctccaggtgcttatgatgttaaaactttagaagtattgaaagga ccagtatcctttcagaaatcacaaagatttaaacaacaaaaagaatctaaacaaaatctt aatgttgacaaagatactaccttgcctgcttcagctagaaaagttaagtcttcggaatca aagattcgtgttcttctacaggaacgtggtgcccaggacaggcggatccaggatctggaa actgagttggaaaagatggaagcaaggctaaatgctgcactaagggaaaaaacatctctc tctgcaaataatgctacactggaaaaacaacttattgaattgaccaggactaatgaacta ctaaaatctaagggtatgatggctaagcaagaaggcatggagatgaagctgcaggtcacc caaaggagtctcgaagagtctcaagggaaaatagcccaactggagggaaaacttgtttca atagagaaagaaaagattgatgaaaaatctgaaacagaaaaactcttggaatacatcgaa gaaattagttgtgcttcagatcaagtggaaaaatacaagctagatattgcccagttagaa gaaaatttgaaagagaagaatgatgaaattttaagccttaagcagtctcttgaggagaat attgttatattatctaaacaagtagaagatctaaatgtgaaatgtcagctgcttgaaaaa gaaaaagaagaccatgtcaacaggaatagagaacacaacgaaaatctaaatgcagagatg caaaacttaaaacagaagtttattcttgaacaacaggaacgtgaaaagcttcaacaaaaa gaattacaaattgattcacttctgcaacaagagaaagaattatcttcgagtcttcatcag aagctctgttcttttcaagaggaaatggttaaagagaagaatctgtttgaggaagaatta aagcaaacactggatgagcttgataaattacagcaaaaggaggaacaagctgaaaggctg gtcaagcaattggaagaggaagcaaaatctagagctgaagaattaaaactcctagaagaa aagctgaaagggaaggaggctgaactggagaaaagtagtgctgctcatacccaggccacc ctgcttttgcaggaaaagtatgacagtatggtgcaaagccttgaagatgttactgctcaa tttgaaagctataaagcgttaacagccagtgagatagaagatcttaagctggagaactca tcattacaggaaaaagcggccaaggctgggaaaaatgcagaggatgttcagcatcagatt ttggcaactgagagctcaaatcaagaatatgtaaggatgcttctagatctgcagaccaag tcagcactaaaggaaacagaaattaaagaaatcacagtttcttttcttcaaaaaataact gatttgcagaaccaactcaagcaacaggaggaagactttagaaaacagctggaagatgaa gaaggaagaaaagctgaaaaagaaaatacaacagcagaattaactgaagaaattaacaag tggcgtctcctctatgaagaactatataataaaacaaaaccttttcaggaagtatcaaaa ctccgctgtcagcttgctaaaaaaaaacaaagtgagacaaaacttcaagaggaattgaat aaagttctaggtatcaaacactttgatccttcaaaggcttttcatcatgaaagtaaagaa aattttgccctgaagaccccattaaaagaagaacaactcttggtaccaaaatctgtatta gtcacagttctccagagaaacagaaccagtaagagatacatagaaaagttgatttactat gaggaattggctgacatgattatgaaggctgagaaggcttgcaacctgccatttgcaggc tggagacccaggaaagcaggtgttgttcagtacaagcttgaaagcctgagaaccatcata gcagaagccttcaatggaccctcaatcagctttagtgcatccactaattcaaatattaac ctcattcagaaataccctcacagacagcccctgaagtag >gi568815593f:163405687_163618360|GENSCAN_predicted_peptide_4|323_aa MPEMPEDMEQEEVNIPNRRVLVTGATGLLGRAVHKEFQQNNWHAVGCGFRRARPKFEQVN LLDSNAVHHIIHDFQPHVIVHCAAERRPDVVENQPDAASQLNVDASGNLAKEAAAVGAFL IYISSDYVFDGTNPPYREEDIPAPLNLYGKTKLDGEKAVLENNLGAAVLRIPILYGEVEK LEESAVTVMFDKVQFSNKSANMDHWQQRFPTHVKDVATVCRQLAEKRMLDPSIKGTFHWS GNEQMTKYEMACAIADAFNLPSSHLRPITDSPVLGAQRPRNAQLDCSKLETLGIGQRTPF RIGIKESLWPFLIDKRWRQTVFH >gi568815593f:163405687_163618360|GENSCAN_predicted_CDS_4|972_bp atgcctgaaatgccagaggacatggagcaggaggaagttaacatccctaataggagggtt ctggttactggtgccactgggcttcttggcagagctgtacacaaagaatttcagcagaat aattggcatgcagttggctgtggtttcagaagagcaagaccaaaatttgaacaggttaat ctgttggattctaatgcagttcatcacatcattcatgattttcagccccatgttatagta cattgtgcagcagagagaagaccagatgttgtagaaaatcagccagatgctgcctctcaa cttaatgtggatgcttctgggaatttagcaaaggaagcagctgctgttggagcatttctc atctacattagctcagattatgtatttgatggaacaaatccaccttacagagaggaagac ataccagctcccctaaatttgtatggcaaaacaaaattagatggagaaaaggctgtcctg gagaacaatctaggagctgctgttttgaggattcctattctgtatggggaagttgaaaag ctcgaagaaagtgctgtgactgttatgtttgataaagtgcagttcagcaacaagtcagca aacatggatcactggcagcagaggttccccacacatgtcaaagatgtggccactgtgtgc cggcagctagcagagaagagaatgctggatccatcaattaagggaacctttcactggtct ggcaatgaacagatgactaagtatgaaatggcatgtgcaattgcagatgccttcaacctc cccagcagtcacttaagacctattactgacagccctgtcctaggagcacaacgtccgaga aatgctcagcttgactgctccaaattggagaccttgggcattggccaacgaacaccattt cgaattggaatcaaagaatcactttggcctttcctcattgacaagagatggagacaaacg gtctttcattag >gi568815593f:163405687_163618360|GENSCAN_predicted_peptide_5|204_aa MAREDSTATTAEAFAMGHCHSWSTRLAHQPFVIRYKPTSRWLLLHFISSFKTFPEDHFTY KNINISVLLLRKLKLSQVAKLAQAQTSRVFICKCQDLHSNSGLCDLKADTLPITRASEKQ GLRTGPQAKEGDYLQKALTVEPSHLDLLWNLSQEATERCAAKTRKHNKVKEMKHSFGVNC NYRRTLLKARTIIPLSTFQGPYVF >gi568815593f:163405687_163618360|GENSCAN_predicted_CDS_5|615_bp atggccagggaggattctactgccaccactgccgaggcatttgccatgggtcattgccat tcttggagcaccagactggcccatcagcccttcgtgatacgatacaaacccacaagcagg tggttgctgctccacttcatttcctcattcaaaacatttcctgaagatcatttcacttac aagaacataaatatctctgttttactgttgaggaaactaaagctcagtcaagttgctaaa cttgcccaagctcaaacttccagggtgtttatatgtaaatgccaagatttgcattcaaac tcaggtctctgtgaccttaaagctgatactctcccaattacaagagccagtgaaaaacag gggttgaggacagggccgcaagccaaggaaggtgattacctccaaaaggcactcacagtc gagccttcacatttggatctgttatggaacctttctcaggaagctactgaaagatgtgca gccaaaactagaaagcacaacaaagtgaaagaaatgaaacacagtttcggggtgaattgt aactacagacgaacacttctgaaggccagaaccatcatccctctttcaactttccaagga ccttatgtgttctaa >gi568815593f:163405687_163618360|GENSCAN_predicted_peptide_6|134_aa MWIDITQPDQTLMNKKSPLITCLDVTALTRRVSWLVLSEDLQNPERLGDIRNPAGPSPSV KGTLQSFCDRGREILISYHFTCMRMAMTTKTITSLGEDAQNWNLHTLLLGMENGAAAVEN SSVVSENVKYRATI >gi568815593f:163405687_163618360|GENSCAN_predicted_CDS_6|405_bp atgtggattgacatcacccaacctgaccagacgctgatgaacaagaagagtccactgatc acttgcttggatgtcacagcactgacaagaagagtgagctggttagttctgagtgaagac ctccagaacccagagagactgggggacattaggaacccagcaggaccaagcccatccgtg aaaggaacacttcagagtttctgtgaccgcggaagagaaatcctaataagttatcacttc acatgcatgaggatggcaatgaccacaaagacaataacaagtcttggtgaggatgcacag aactggaaccttcatacgttgctgctagggatggaaaatggtgcagctgccgtggaaaac agttcagtagtttctgaaaatgttaaatatagagctaccatatga >gi568815593f:163405687_163618360|GENSCAN_predicted_peptide_7|283_aa MRPDFKLYYKATVTKTAWYLYQNRDIDQWNRIEPSEIMPRIYNYLIFDKPDKNKKWGKDS LFNKWCSENWLAICRKLKLDPFLTPYTKINSRWIKELNVRPKTIKTLEENLGNTIQDIGM GKDFISKTPKAMATKAKIDKWDLIKLKSFCTAKETTITVNRQPTEWEKILAIYSSDKGLI SRIYKEHKQIYKKKTNNPINKWAKDMNRHFSKEDIYAANRHMKKCSSALAIREMQIKTTM RYHLTPVRMAIIKKSGNNRCWRGCGEIGTLLYCWWDCKLVQPL >gi568815593f:163405687_163618360|GENSCAN_predicted_CDS_7|852_bp atgcgacctgacttcaaactatactacaaggctacagtaaccaaaacagcatggtacttg tatcaaaacagagatatagaccaatggaatagaatagagccctcagaaataatgccacgt atctacaactatctgatctttgacaaacctgataaaaacaagaaatggggaaaagattcc ctatttaataaatggtgctcagaaaactggctagccatatgtagaaagctgaaactggat cccttccttacaccttatacaaaaattaattcaagatggattaaagaattaaatgttaga cctaaaaccataaaaaccctagaagaaaacctaggcaataccattcaggacataggcatg ggcaaggacttcatatctaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaa tgggatctaattaaattaaagagcttctgcacagcaaaagaaactaccatcacagtgaac aggcaacctacagaatgggagaaaattttggcaatctactcatctgacaaagggctaata tccagaatctacaaagaacacaaacaaatttacaagaaaaaaacaaacaaccccatcaac aagtgggcaaaggatatgaacagacacttctcaaaagaagacatttatgcagccaacaga cacatgaaaaaatgctcatcagcactggccatcagagaaatgcaaatcaaaaccacaatg agatatcatctcacaccagttagaatggcgatcattaaaaagtcagggaacaacaggtgc tggagaggatgtggagaaataggaacacttttatactgttggtgggactgtaaactagtt caaccattgtga >gi568815593f:163405687_163618360|GENSCAN_predicted_peptide_8|71_aa XLCPVPPAKKNDSRSSSTAPPPRTGVTTYKQLAGLWEFESKALSQSKRDDDEPNQLLLAL REPSLQKAEST >gi568815593f:163405687_163618360|GENSCAN_predicted_CDS_8|216_bp nncctttgcccagtgcccccagctaagaagaatgattcccgatcgagcagcacggcacca ccaccaagaacaggggttactacctataaacaacttgcaggattgtgggagtttgaatca aaagcactgtcccaatcaaagcgagatgatgatgaaccgaaccagctgcttctagctctc cgagagccctctcttcagaaggcagagtcgacctga