GENSCAN 1.0 Date run: 6-Nov-116 Time: 17:18:52 Sequence gi568815594r:156663093_157070903 : 407811 bp : 36.29% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3060 3159 100 2 1 63 46 84 0.139 0.46 1.02 Intr + 10942 10983 42 2 0 67 116 44 0.167 2.39 1.03 Intr + 31377 31553 177 1 0 36 40 119 0.067 0.87 1.04 Term + 43400 43902 503 0 2 5 42 240 0.122 4.76 1.05 PlyA + 46659 46664 6 1.05 2.00 Prom + 48054 48093 40 -3.55 2.01 Sngl + 61515 61775 261 2 0 57 47 174 0.620 5.11 2.02 PlyA + 62565 62570 6 1.05 3.02 PlyA - 63598 63593 6 1.05 3.01 Sngl - 75959 75666 294 2 0 79 35 163 0.642 5.55 3.00 Prom - 80732 80693 40 -6.05 4.00 Prom + 86477 86516 40 -5.55 4.01 Init + 96471 96598 128 2 2 81 53 152 0.812 10.68 4.02 Term + 96956 97115 160 2 1 -39 41 208 0.207 -0.17 4.03 PlyA + 97690 97695 6 1.05 5.05 PlyA - 98553 98548 6 1.05 5.04 Term - 100114 100038 77 2 2 114 55 60 0.378 2.32 5.03 Intr - 109801 109594 208 1 1 71 99 193 0.570 16.43 5.02 Intr - 147925 147745 181 0 1 141 100 84 0.939 13.85 5.01 Init - 152941 152859 83 1 2 74 46 -1 0.170 -5.21 5.00 Prom - 153462 153423 40 -5.95 6.00 Prom + 156151 156190 40 -8.45 6.01 Init + 156209 156958 750 1 0 57 93 253 0.359 17.45 6.02 Term + 165251 165319 69 1 0 93 38 114 0.529 3.86 6.03 PlyA + 165715 165720 6 1.05 7.00 Prom + 169485 169524 40 -4.65 7.01 Sngl + 176509 177186 678 0 0 49 39 359 0.839 23.03 7.02 PlyA + 179139 179144 6 1.05 8.03 PlyA - 179716 179711 6 1.05 8.02 Term - 187324 187125 200 2 2 79 42 134 0.335 4.48 8.01 Init - 197306 197120 187 2 1 78 36 106 0.602 3.67 8.00 Prom - 199587 199548 40 -3.35 9.00 Prom + 204134 204173 40 -5.25 9.01 Init + 212328 212430 103 2 1 71 94 63 0.333 5.65 9.02 Term + 217312 217502 191 2 2 50 45 137 0.716 2.23 9.03 PlyA + 217649 217654 6 1.05 10.05 PlyA - 218958 218953 6 1.05 10.04 Term - 238263 238181 83 1 2 89 49 73 0.092 0.38 10.03 Intr - 260262 260203 60 1 0 31 96 125 0.620 5.39 10.02 Intr - 263943 263649 295 0 1 38 6 211 0.569 3.56 10.01 Init - 264588 264385 204 0 0 48 45 153 0.621 5.90 10.00 Prom - 269457 269418 40 -4.15 11.00 Prom + 290559 290598 40 -2.75 11.01 Init + 293191 293260 70 0 1 102 48 49 0.680 3.56 11.02 Term + 302599 302756 158 2 2 63 38 128 0.517 2.41 11.03 PlyA + 303072 303077 6 1.05 12.00 Prom + 307136 307175 40 -6.85 12.01 Init + 307677 307948 272 2 2 77 30 269 0.955 16.53 12.02 Intr + 308338 308705 368 1 2 63 24 150 0.478 -0.04 12.03 Term + 308963 309075 113 0 2 54 40 173 0.985 6.84 12.04 PlyA + 309311 309316 6 1.05 13.04 PlyA - 310412 310407 6 1.05 13.03 Term - 313886 313774 113 0 2 84 42 115 0.415 4.24 13.02 Intr - 334024 333935 90 2 0 38 92 66 0.519 1.05 13.01 Init - 334709 334373 337 2 1 72 30 257 0.574 15.79 13.00 Prom - 337960 337921 40 -8.55 14.07 PlyA - 339121 339116 6 1.05 14.06 Term - 339343 339191 153 1 0 26 43 167 0.657 2.94 14.05 Intr - 340045 339881 165 1 0 79 63 45 0.334 0.34 14.04 Intr - 342411 342244 168 0 0 124 66 33 0.645 3.92 14.03 Intr - 378186 378109 78 0 0 90 84 14 0.030 0.03 14.02 Intr - 386602 386485 118 0 1 47 56 113 0.171 3.55 14.01 Init - 389587 389481 107 1 2 76 89 57 0.149 4.37 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:156663093_157070903|GENSCAN_predicted_peptide_1|273_aa STAGDQLALGFWRALRTQESWVACGLLLFQRTGVRLKQTAKNKDKVQESKVAFQEEYAFH LHEFARTTLINCQELGTVKSRNLLSQHSGGYKSKIDVSAGLFPLEGYRSTRQKVTKDIQE LNSALHQADLIDIYRTLHPKSIEYTFFSAPHRTYSKIDHIVGSKALLSKCKRTEIITNCL SDHSAIKLELRIKKLTQNRSTTWKLNNLFLNDYWVHNEMKTEIKMFFETNENKDTTYQNL WDTFKAVCRGKLIALNAHKRKQERSKIDTLTHS >gi568815594r:156663093_157070903|GENSCAN_predicted_CDS_1|822_bp agcactgctggggatcagcttgctctaggtttctggagagcactgagaacacaagagagc tgggtggcttgtggcttgctgctgttccagaggacaggagtcagattaaagcaaacagcc aaaaacaaggacaaagtccaagagtcaaaagttgcctttcaggaagaatatgcttttcat ctacatgagtttgccaggactaccttaataaactgtcaggaacttggtaccgtgaagagc agaaatttattgtctcaacattctggtggctacaaatccaagattgatgtgtcagcagga ttgtttcctcttgagggctacagatcaacgcgacagaaagttaccaaggatatccaggaa ttgaactcagctctgcaccaagcggacctaatagacatctacagaactctccaccccaaa tcaatagaatatacattcttctcagcaccacaccgcacttattccaaaattgatcacata gttggaagtaaagcactcctcagcaaatgtaaaagaacagaaattataacaaactgtctc tcagaccacagtgccatcaaactagaactcaggattaagaaactcactcaaaaccgctca actacatggaaactgaacaacctgttcctgaatgactactgggtacataacgaaatgaag acagaaataaagatgttctttgaaaccaatgagaacaaagatacaacataccagaatctc tgggacacatttaaagcagtgtgtagagggaaacttatagcactaaatgcccacaagaga aagcaggaaagatctaaaattgacaccctaactcacagttaa >gi568815594r:156663093_157070903|GENSCAN_predicted_peptide_2|86_aa MRHHLQARESLKPEGQTNGPEWELWVIFLGLPMAPHEPISTDFLPSEAHKNSGLIQTGGD DRMTYLERGATHSRVSSLLRAEHPVG >gi568815594r:156663093_157070903|GENSCAN_predicted_CDS_2|261_bp atgaggcaccatcttcaagccagggaaagcctgaagcctgagggtcagactaatggacca gaatgggaactttgggtgatttttctgggcctgcccatggctccccatgaaccaataagc acagacttcctcccctctgaagctcataaaaattctggactcatccagactggaggagat gacagaatgacctacctggagagaggagctacacactccagggtctcctctctgctgaga gccgaacacccagtgggatga >gi568815594r:156663093_157070903|GENSCAN_predicted_peptide_3|97_aa MQDCPAMHTAEHLSSMALPTKCQQCSLNHVDTENCHYTLAECPEPPMRITDPSNPHSYLI SSFSYYWFIEEVIEEQRKRKELPLLSGKAAFYLPQTF >gi568815594r:156663093_157070903|GENSCAN_predicted_CDS_3|294_bp atgcaggactgccctgccatgcacactgctgaacatctctcctccatggcccttcccact aaatgtcagcagtgctcccttaatcatgttgacaccgaaaactgccactatacattggct gaatgccctgagcctcccatgagaatcactgacccatctaatcctcacagctacctaata agctcattttcttactattggtttatagaggaggtcatcgaggaacagagaaaaagaaag gaattaccattattgagtggcaaagctgcattttacctaccacagacattttaa >gi568815594r:156663093_157070903|GENSCAN_predicted_peptide_4|95_aa MVNAPGDRDAPRDQTTVNAPGDRDAPRVTKPRSMRLETGMHRGQGYTEGNQTIVNAPGDR DAPRVTKRWSMHLETGMHRGTKPRSMHLETGMHRG >gi568815594r:156663093_157070903|GENSCAN_predicted_CDS_4|288_bp atggtcaatgcacctggagacagggatgcaccgagggaccaaaccacggtcaatgcacct ggagacagggatgcaccgagggtaaccaaaccacgttcaatgcgcctggagacagggatg caccgaggacagggatacaccgagggtaaccaaaccatagtcaatgcacctggagacagg gatgcaccgagggtaaccaaacgatggtcaatgcacctggagacagggatgcaccgaggg accaaaccacggtcaatgcacctggagacagggatgcaccgagggtaa >gi568815594r:156663093_157070903|GENSCAN_predicted_peptide_5|182_aa MASQFSNSHMRTKILSTTPDLFVKHKRTYDFVEVEEPSDGTILGRWCGSGTVPGKQISKG NQIRIRFVSDEYFPSEPGFCIHYNIVMPQFTEAVSPSVLPPSALPLDLLNNAITAFSTLE DLIRYLEPERWQLDLEDLYRPTWQLLGKAFVFGRKSRGPSVETKDRCQGIAQITHRRGPG AP >gi568815594r:156663093_157070903|GENSCAN_predicted_CDS_5|549_bp atggcctcacagttttctaattcccacatgagaacgaaaattttgtctacaacaccagac ttgtttgttaaacataaaagaacgtatgattttgtagaagttgaggaacccagtgatgga actatattagggcgctggtgtggttctggtactgtaccaggaaaacagatttctaaagga aatcaaattaggataagatttgtatctgatgaatattttccttctgaaccagggttctgc atccactacaacattgtcatgccacaattcacagaagctgtgagtccttcagtgctaccc ccttcagctttgccactggacctgcttaataatgctataactgcctttagtaccttggaa gaccttattcgatatcttgaaccagagagatggcagttggacttagaagatctatatagg ccaacttggcaacttcttggcaaggcttttgtttttggaagaaaatccagaggtccttca gttgagaccaaagaccggtgtcaggggattgcacaaatcactcaccgacgtggccctgga gcaccatga >gi568815594r:156663093_157070903|GENSCAN_predicted_peptide_6|272_aa MTSSNLIAREEKSMPGFKVSKDRLTPLLGANASGDFTLKLMLIYHSKNARALKNYAQSTL PMLYQWNNKAWMMAHLFTAWFTKYFKRTTAQKKRFFQNITAHLQSTWSPKSSDEDVRIRR FTYKEIYVFMPANSISILQSMGQGVILSFKSYYVRNIFHKVIAAMDSNSSGESGQSKLKI FWKGFTILDAINNIHDSWEEVNISTLTGVWKKLVPIFTDKSDRFKTLMEEVTAQVVEIAR ELELEVEPEQIQSQSEQFQEVMTDEIRHQDRA >gi568815594r:156663093_157070903|GENSCAN_predicted_CDS_6|819_bp atgacatctagtaatttgatagctagagaagaaaagtcaatgcctggcttcaaagtttca aaggacaggctgactcccttgttaggggctaatgcatctggtgactttacgttgaagcta atgctcatttaccattccaaaaatgctagggctcttaagaattatgctcagtctactctg cctatgctgtatcaatggaacaacaaagcctggatgatggcacatttgtttacagcatgg tttactaaatattttaaacgtactactgctcagaaaaaaagattctttcaaaatattact gctcatttgcaatccacctggtcacccaagagctctgatgaagatgtacgtataaggaga tttacatacaaggagatttatgttttcatgcctgctaactcaatatctattctgcagtcc atgggccaaggagtaattctgagttttaagtcttattatgtaagaaacatatttcacaag gtaatagctgccatggatagtaattcctctggtgaatctgggcagagtaaactgaaaatc ttctggaaaggattcaccattcttgatgccattaacaatattcatgattcatgggaggag gtcaacatatcaacattaacaggagtttggaagaaactggttccaatcttcacggataaa tctgacaggttcaagactttgatggaggaagtaactgcacaggtggtggaaatagcaaga gaactagaattggaagtggaacctgaacagatacagagccaatcagaacagtttcaagaa gtaatgacagatgagatccggcatcaagaccgtgcttaa >gi568815594r:156663093_157070903|GENSCAN_predicted_peptide_7|225_aa MWNQFRNWVTGRNWNHLEGSEEDRKIWESLKLPRDLEGSEDRKIWESVEILKDLLNGFDQ NVDSDMHNEVEAEVVSDGDKELVGNWIKGHSCYTLGKRLAVFCSCPRDVWNFELERDYVE YLVENISKQQSIQEVTEHKSLENLQPDNAVEKKNLLSEEKFKPATAICISNKEPNANHQD NGKNVSRECHRPSQQPSHHRHGGLGGKNGFMGQAQGLHAVCSLGT >gi568815594r:156663093_157070903|GENSCAN_predicted_CDS_7|678_bp atgtggaaccaatttcggaactgggtaacaggcagaaactggaaccatttggagggatca gaagaagacaggaaaatatgggaaagtttgaaacttcctcgagacttggagggctcagaa gacaggaagatttgggaaagtgtggaaattcttaaagacttgttgaacggctttgaccaa aatgttgatagtgatatgcacaatgaggtcgaggcagaggtggtctcagatggagataag gaacttgttgggaactggattaaaggtcactcttgctatactttaggaaagagactggcg gttttttgctcctgccctagagatgtgtggaactttgaacttgagagagattatgtagag tatctggtggaaaacatttctaagcagcaaagcattcaagaggtgacagagcataagagt ttggaaaatttgcagcctgacaatgcagtagaaaagaaaaacctactttctgaagagaaa ttcaagccagctacagcaatttgcataagtaacaaggagccaaatgctaatcaccaagac aatgggaaaaatgtctccagggaatgtcacagaccttcgcagcagccctcccatcacagg catggaggcctaggagggaaaaatggttttatgggtcaggcccagggtctccatgctgtg tgcagcctagggacttag >gi568815594r:156663093_157070903|GENSCAN_predicted_peptide_8|128_aa MSVHLLEGRECSRSICWVGDYTHINSANAIKEDTQIFLWHLTMSSLTDNSSKLALRSEDK EVGVQDPQHERIITVSTNGSIHSPRFPHTYPRNTVLVWRLVAVEENVWIQLTFDERFGLE DPEDDICK >gi568815594r:156663093_157070903|GENSCAN_predicted_CDS_8|387_bp atgtctgttcacttgctcgaaggtcgtgaatgcagccggtccatttgctgggtaggggat tacactcacataaacagtgcaaatgccataaaggaggacactcagatcttcttgtggcat ctgacaatgtccagtctgacagataactcttcaaagttggccctgagatcagaagacaag gaagtaggagtacaagatcctcagcatgagagaattattactgtgtctactaatggaagt attcacagcccaaggtttcctcatacttatccaagaaatacggtcttggtatggagatta gtagcagtagaggaaaatgtatggatacaacttacgtttgatgaaagatttgggcttgaa gacccagaagatgacatatgcaagtaa >gi568815594r:156663093_157070903|GENSCAN_predicted_peptide_9|97_aa MTITHVEVEPLEESPHILSASKNLVSPDVSSFKQAWFTEDFKPTTQKEKKIPFKILLLID NAPGHPKVLMQMYKDINAVFMPANTTTILKPMNQEVI >gi568815594r:156663093_157070903|GENSCAN_predicted_CDS_9|294_bp atgaccatcactcatgtggaagttgagccattagaagaaagtccacacatcttatctgcc agtaaaaacctagtctcaccagatgtctcttcattcaagcaagcatggtttactgaagat tttaaacccactactcagaaagaaaaaaagattcctttcaaaatattactgctcattgat aacgcacctggtcacccaaaagttctgatgcagatgtacaaggatattaatgctgttttc atgcctgctaacacaacaaccattctgaagcccatgaatcaagaagtaatttga >gi568815594r:156663093_157070903|GENSCAN_predicted_peptide_10|213_aa MDNEIQAEVVSDGDEELVGNWSKGDSCYVLAETEAFCPCLRDSCNFELERDDSGYLVEEI SKQQSIQEKPKIEVWEPPPRFQKMYGNTWISRQKFAAGVGLSWRTSARAVWKRNVGSEPP HRVPTGTPPSGAVRRGPLSSRPWNARSSDSLYRVPGKPQTLNASPQSSKAGDVNDEKEAA MPGPEAGSFVKSQSDVYMCLLLAEKLLMTPQYL >gi568815594r:156663093_157070903|GENSCAN_predicted_CDS_10|642_bp atggacaatgaaatccaggctgaggtggtctcagatggagatgaggaacttgttgggaac tggagcaaaggtgactcttgttatgttttggcagagactgaagcattttgcccctgcctt agagattcgtgcaactttgaacttgagagagatgattcagggtatcttgtggaagaaatt tctaagcagcaaagcattcaagagaagccaaaaattgaggtttgggaacctccacctaga tttcagaagatgtatggaaacacctggatatccaggcagaagtttgctgcaggggtgggg ctgtcatggagaacctctgctagggcagtgtggaagagaaatgtggggtcagagccccca cacagagtccctactgggacaccacctagtggagctgtgagaagagggccactgtcctcc agaccctggaatgctagatcctctgacagcttgtaccgggtgcctggaaaaccacagaca ctcaacgccagcccacaaagctcaaaggctggagatgtgaacgatgagaaagaagcagcc atgccaggaccggaggcagggagcttcgttaaaagccaatctgacgtttacatgtgcttg ctccttgctgagaaacttctcatgactcctcagtatctctag >gi568815594r:156663093_157070903|GENSCAN_predicted_peptide_11|75_aa MENNNVDFPYSKFGPQSDGISITSITLEELKPKELQEGSLKEAIPLLTPGPSSHGSGLSC QAERKKDRNFNPNMN >gi568815594r:156663093_157070903|GENSCAN_predicted_CDS_11|228_bp atggagaacaacaatgtagactttccttactcaaagtttggtcctcagtccgatggcatt agcatcacctccataacccttgaggagctaaagccaaaagagctgcaggaagggagcttg aaagaagccataccccttcttactccaggtccttcaagtcatgggtcaggcctgagctgc caagcagagaggaaaaaagatagaaattttaaccccaatatgaactaa >gi568815594r:156663093_157070903|GENSCAN_predicted_peptide_12|250_aa MGERLTVLFLVAGKLEFTTQVGFRLSPLSLAGQGRCQQEKPEEAHLADWGESSLTAGTLE AATPESLSPPPGEGCTGVEAPSLLWQQRIAGGRGKKQGEGAAAGPTGRGAGAGPGGSGPT AARAQAERGRGAPAPQPGAQLAPGSERRSTDSGTWLEFSATKVHLVRARPPLLKAWGNSA ARGHRGAPVVPRPLPPRLGLRANLGATRGVSARGAAQTGGAAGLAVGKTPVAQERNAAKA APAFPSTEHL >gi568815594r:156663093_157070903|GENSCAN_predicted_CDS_12|753_bp atgggggaaagactcaccgttctgttccttgttgctggaaaactggaatttactactcag gttggattccgcctgagtcccctgtctctggccggccagggcagatgtcagcaggagaag cccgaagaggctcatttggctgactggggtgagagctcactcacggcgggcactttggaa gcagcgactcccgagtctctttcaccaccgccaggggaaggctgcactggggtggaagcg ccgagcctgctctggcagcagagaatcgcaggggggagggggaagaaacaaggcgagggc gccgcggcgggtcccacgggccggggcgccggagcggggccggggggctccgggccgacg gcggcccgggcgcaggcggagagaggccgcggggctcccgctcctcagcccggagcgcag ttggccccgggttcggagcgccgcagcacggattccggcacctggcttgagttttccgcg accaaagttcaccttgttcgggctcgtcccccgctcctcaaagcctggggcaattcggcc gctcggggtcaccgcggggctccggttgttccccgtcccctccccccacgcctcgggctc cgcgctaacctcggggctacgcgcggcgtctccgcacggggtgctgcgcagacagggggt gctgcgggactcgctgtgggcaagaccccggtggcccaggagaggaacgcggccaaggcg gcccctgctttcccttccacggaacacctctaa >gi568815594r:156663093_157070903|GENSCAN_predicted_peptide_13|179_aa MGKQLWNWIKDRVWKSLKGSEKERKMRESLELLRGLLNGCDQNADSDKDSEVQADKVSNG NEELIGNYSKSNAYYALAKSLAAFSSYPKYLWKFELQSDDLVYQAEEISKQQGLQPCCLA QPQDTALGIQAAPDPALAQRHPGQFKEDLDDEKEEKVVQRMRETQREIVLIKTKFGKRK >gi568815594r:156663093_157070903|GENSCAN_predicted_CDS_13|540_bp atggggaagcagctttggaactggataaaagacagagtttggaagagtttaaaaggctca gaaaaagagagaaaaatgagggaaagtttggaacttcttagaggcttgttaaatggttgt gaccaaaatgctgatagtgataaggacagtgaagtccaggctgataaggtctcaaatgga aatgaggaacttattggtaactacagcaaaagtaatgcatattatgccttagcaaagagt ttagctgcattcagttcataccctaagtatctgtggaagtttgaacttcagagtgatgat ttagtgtaccaagcagaagaaatttctaagcagcaaggcctacagccctgctgccttgca cagcctcaggacactgctcttggcatccaggctgctccagatccagctttagctcaaagg cacccaggacaattcaaagaagatttagatgatgagaaagaagagaaggtggtacagagg atgagggaaacacaaagggaaattgtgctgataaaaacaaaatttgggaaaagaaaatga >gi568815594r:156663093_157070903|GENSCAN_predicted_peptide_14|262_aa MQRSASSPPWFEAAASQQLLPFPTDLLLQLLWFWGRTDRTQLMKGSSGPVVTRFLRDHSN TPKATFEKACTYLPKVVLQHYPFLFIPSDIPQNACNPSQLEGECLVILLDICLISALASS QDCHSNHSPNLAQPLKPTVTIHVTHIPIPKSSSVIYNQSLTVICDHYAILIKSLHWKIQL KPNFSFSINYDLTFSCFEILPKLCGGGIPPYRKWMRRDTEEKHKEEGHVKIETEIDDAAT NQGSQRLQESPTARKRLVKVLP >gi568815594r:156663093_157070903|GENSCAN_predicted_CDS_14|789_bp atgcagcgatctgccagctcccctccttggtttgaggcggcagccagccagcagcttctt ccgttcccgacagatctcctccttcagcttctctggttctggggcagaactgatagaact cagcttatgaagggcagttctggccccgtggtcactcgatttttacgagaccacagtaat acaccaaaggctacttttgaaaaggcatgtacttatttaccgaaggtagtgcttcagcat tatcctttcctttttattcccagtgacattcctcaaaatgcatgcaacccatcacagctg gagggagaatgtctggtgattcttttggatatctgccttatttctgctttggcttcttcc caagactgtcactcaaatcatagtcccaacttggctcagccacttaagccaactgtcacc attcatgtgacccacattcccattcccaaatccagctcagttatttacaaccagtctttg actgttatatgtgatcattatgcgatcctaatcaagtccctgcattggaagatccaactt aaaccaaacttctcattttcaataaattatgacctcaccttctcctgctttgagatactg ccaaagctctgtggaggtggtattcctccttacagaaagtggatgcgcagagatacagaa gagaaacacaaagaagaaggccatgtgaagatagagacagagattgatgatgctgctaca aaccaagggagccaaagattgcaggagtcaccaacagctaggaagaggctagtaaaggtt cttccctag