GENSCAN 1.0 Date run: 16-Jul-119 Time: 16:09:05 Sequence gi568815596r:207666985_207868739 : 201755 bp : 43.80% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 12141 12225 85 2 1 61 53 69 0.690 1.68 1.02 Intr + 13471 13518 48 2 0 70 79 44 0.502 0.35 1.03 Term + 14717 15111 395 0 2 39 43 557 0.356 41.50 1.04 PlyA + 17107 17112 6 1.05 2.00 Prom + 30768 30807 40 -5.06 2.01 Init + 44913 45132 220 2 1 51 116 410 0.990 38.89 2.02 Intr + 57816 57890 75 1 0 102 95 -4 0.160 1.19 2.03 Intr + 59858 59892 35 0 2 61 103 7 0.180 -2.56 2.04 Intr + 70427 70462 36 1 0 108 99 -11 0.004 0.46 2.05 Intr + 75239 75358 120 1 0 117 91 25 0.982 6.39 2.06 Intr + 80063 80229 167 1 2 76 84 165 0.980 13.66 2.07 Intr + 83973 84135 163 0 1 121 94 98 0.972 13.68 2.08 Term + 86604 86714 111 2 0 66 42 53 0.660 -2.94 2.09 PlyA + 87463 87468 6 1.05 3.02 PlyA - 89110 89105 6 1.05 3.01 Sngl - 101755 99998 1758 1 0 94 49 3606 0.993 350.00 3.00 Prom - 113133 113094 40 -5.26 4.00 Prom + 117510 117549 40 -5.26 4.01 Init + 143078 143181 104 1 2 56 99 19 0.296 -0.47 4.02 Intr + 143944 144142 199 1 1 58 82 122 0.841 7.95 4.03 Term + 150866 150937 72 1 0 70 39 80 0.648 -0.79 4.04 PlyA + 152463 152468 6 1.05 5.03 PlyA - 152658 152653 6 1.05 5.02 Term - 161512 161335 178 0 1 94 41 127 0.816 5.76 5.01 Init - 164410 164304 107 1 2 77 82 89 0.799 4.89 5.00 Prom - 173532 173493 40 -1.36 6.04 PlyA - 174573 174568 6 1.05 6.03 Term - 181478 181411 68 0 2 79 39 77 0.243 -0.00 6.02 Intr - 183429 183359 71 2 2 66 82 22 0.110 -1.77 6.01 Intr - 194278 194121 158 1 2 128 92 77 0.890 10.91 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 66963 67063 101 2 2 135 101 33 0.989 8.21 S.002 Intr + 73671 73722 52 0 1 90 98 5 0.982 0.61 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:207666985_207868739|GENSCAN_predicted_peptide_1|175_aa MCFFMVLEAGKSLIKTPADVVSMESLCSGKLMSREFGVRIFHYPGEGPGGADDEGPVRRQ VKVTVKYDRKELRKRLNLEEWILEQLTRLYDCQEEEIPELEIDVDELLDMESDDARTARV KELLVDLLQTHRDLHLWPAGQDPGHAEAEHTPEEVRVPDPGERWLPQDNCCPSTS >gi568815596r:207666985_207868739|GENSCAN_predicted_CDS_1|528_bp atgtgtttcttcatggttctggaggctgggaagtctttgatcaagacgccagcagatgtg gtgtctatggagagcctgtgttctggtaaactcatgtcacgagaatttggtgtacgaata tttcattacccaggagagggcccgggcggcgcggacgatgagggcccagtgaggcgccaa gtgaaggtcaccgtcaagtatgaccgcaaggagctacggaagcgccttaacctagaggag tggatcctggagcagctcacgcgcctctacgactgccaggaagaggagatcccagaactg gaaattgacgtggatgagctcctggacatggagagtgacgatgcccgtactgccagggtc aaggagctgctggttgacttgttacaaacccacagagaccttcatctctggcctgctgga caagatccggggcatgcagaagctgagcacaccccagaagaagtgagggtccccgaccca ggcgaacggtggctcccacaggacaattgctgcccctcgacctcgtag >gi568815596r:207666985_207868739|GENSCAN_predicted_peptide_2|308_aa MGNTLTCCVSPNASPKLGRRAGSAELYCASDIYEAVSGDAVAVAPAVVEPAELDFGEGEG HHLQHISDREMPEDLALESNPSDHPRASTIFLSKSQTDVREKRKSNHLNHCDLSNILPHK EQREKVPEEYFKHDPEHKFIYRFVRTLFSAAQLTAECAIVTLVYLERLLTYAEIDICPTN WKRIVLGAILLASKVWDDQAVWNVDYCQILKDITVEDMNEMERHFLELLQFNINVPASVY AKYYFDLRSLADDNNLNFLFAPLSKERAQNLEAISRLCEDKDLCRAAMRRSFSADNFIGI QRSKAILS >gi568815596r:207666985_207868739|GENSCAN_predicted_CDS_2|927_bp atggggaacacgctgacctgttgcgtgtcccccaatgccagccccaagctgggccggcgc gcggggtcggcggagctgtactgcgcgtccgacatctacgaggcggtgtccggggacgcg gtggcggtagcgcccgctgtggtggagcctgccgagttggatttcggagagggcgagggc caccacctgcagcacatcagcgaccgcgagatgcccgaagatttagctttggagtcaaac ccttctgaccatccaagggcaagcacaattttcctgagcaaatctcaaacggatgtgcga gaaaagaggaagagcaaccatttgaaccattgtgaccttagcaatatattaccacataaa gaacagcgagaaaaagttccagaggaatactttaagcatgatcctgagcacaaatttatt tacagatttgttcgtactctttttagtgctgcacagctaacagctgaatgtgcaatagta actttggtttacttagaaaggcttttaacttatgctgaaatcgacatttgtcccaccaac tggaaaaggattgttctgggagccattcttcttgcctccaaggtttgggacgatcaggct gtatggaatgtggactactgccagatcctcaaggacattacagttgaggacatgaatgaa atggaaaggcattttctggagcttcttcagtttaatattaatgttcctgccagtgtttat gccaaatactactttgaccttcgctccttagcagatgacaacaacctgaattttctattt gctcctcttagcaaagaaagagcacagaacctagaggctatttctagattgtgtgaagac aaagacttgtgtagagccgctatgagaaggtctttcagtgctgataacttcattggtatt cagcgctctaaagccatcctctcttaa >gi568815596r:207666985_207868739|GENSCAN_predicted_peptide_3|585_aa MARPDPSAPPSLLLLLLAQLVGRAAAASKAPVCQEITVPMCRGIGYNLTHMPNQFNHDTQ DEAGLEVHQFWPLVEIQCSPDLRFFLCSMYTPICLPDYHKPLPPCRSVCERAKAGCSPLM RQYGFAWPERMSCDRLPVLGRDAEVLCMDYNRSEATTAPPRPFPAKPTLPGPPGAPASGG ECPAGGPFVCKCREPFVPILKESHPLYNKVRTGQVPNCAVPCYQPSFSADERTFATFWIG LWSVLCFISTSTTVATFLIDMERFRYPERPIIFLSACYLCVSLGFLVRLVVGHASVACSR EHNHIHYETTGPALCTIVFLLVYFFGMASSIWWVILSLTWFLAAGMKWGNEAIAGYAQYF HLAAWLIPSVKSITALALSSVDGDPVAGICYVGNQNLNSLRGFVLGPLVLYLLVGTLFLL AGFVSLFRIRSVIKQGGTKTDKLEKLMIRIGIFTLLYTVPASIVVACYLYEQHYRESWEA ALTCACPGHDTGQPRAKPEYWVLMLKYFMCLVVGITSGVWIWSGKTVESWRRFTSRCCCR PRRGHKSGGAMAAGDYPEASAALTGRTGPPGPAATYHKQVSLSHV >gi568815596r:207666985_207868739|GENSCAN_predicted_CDS_3|1758_bp atggctcggcctgacccatccgcgccgccctcgctgttgctgctgctcctagcgcagctg gtgggccgggcggccgccgcgtccaaggccccggtgtgccaggaaatcacggtgcccatg tgccgcggcatcggctacaacctgacgcacatgcccaaccagttcaaccacgacacgcag gacgaggcgggcctggaggtgcaccagttctggccgctggtggagatccaatgctcgccg gacctgcgcttcttcctatgctctatgtacacgcccatctgtctgcccgactaccacaag ccgctgccgccctgccgctcggtgtgcgagcgcgccaaggccggctgctcgccgctgatg cgccagtacggcttcgcctggcccgagcgcatgagctgcgaccgcctcccggtgctgggc cgcgacgccgaggtcctctgcatggattacaaccgcagcgaggccaccacggcgcccccc aggcctttcccagccaagcccacccttccaggcccgccaggggcgccggcctcggggggc gaatgccccgctgggggcccgttcgtgtgcaagtgtcgcgagcccttcgtgcccattctg aaggagtcacacccgctctacaacaaggtgcggacgggccaggtgcccaactgcgcggta ccctgctaccagccgtccttcagtgccgacgagcgcacgttcgccaccttctggataggc ctgtggtcggtgctgtgcttcatctccacgtccaccacagtggccaccttcctcatcgac atggaacgcttccgctatcctgagcgccccatcatcttcctgtcagcctgctacctgtgc gtgtcgctgggcttcctggtgcgtctggtcgtgggccatgccagcgtggcctgcagccgc gagcacaaccacatccactacgagaccacgggccctgcactgtgcaccatcgtcttcctc ctggtctacttcttcggcatggccagctccatctggtgggtcatcctgtcgctcacctgg ttcctggccgccggcatgaagtggggcaacgaggccatcgcgggctacgcgcagtacttc cacctggctgcgtggctcatccccagcgtcaagtccatcacggcactggcgctgagctcc gtggacggggacccagtggccggcatctgctacgtgggcaaccagaacctgaactcgctg cgcggcttcgtgctgggcccgctggtgctctacctgctggtgggcacgctcttcctgctg gcgggcttcgtgtcgctcttccgcatccgcagcgtcatcaagcagggcggcaccaagacg gacaagctggagaagctcatgatccgcatcggcatcttcacgctgctctacacggtcccc gccagcattgtggtggcctgctacctgtacgagcagcactaccgcgagagctgggaggcg gcgctcacctgcgcctgcccgggccacgacaccggccagccgcgcgccaagcccgagtac tgggtgctcatgctcaagtacttcatgtgcctggtggtgggcatcacgtcgggcgtctgg atctggtcgggcaagacggtggagtcgtggcggcgtttcaccagccgctgctgctgccgc ccgcggcgcggccacaagagcgggggcgccatggccgcaggggactaccccgaggcgagc gccgcgctcacaggcaggaccgggccgccgggccccgccgccacctaccacaagcaggtg tccctgtcgcacgtgtag >gi568815596r:207666985_207868739|GENSCAN_predicted_peptide_4|124_aa MRMARPATTFAGPVQNANVETLLLKNYSEFQDGDRGHYQKVSLFISWWNQDVPLLRGVVV VREPVCNEVSAKSTGQFHNTAERGEKEVNVPSSTTRQKGKPTDNPVIAAVAAPDVLSIVG MEIE >gi568815596r:207666985_207868739|GENSCAN_predicted_CDS_4|375_bp atgagaatggcaagaccagccaccacatttgcagggcctgtgcaaaatgcaaacgtggag acccttttgctcaaaaattattcagaatttcaagatggagacaggggtcattatcagaag gtttcacttttcatttcctggtggaatcaggacgtgccccttttgcggggagtggtggtg gtgagggagccggtgtgcaatgaggtgtctgccaagagcacagggcaattccacaacaca gcagagaggggtgaaaaggaagtgaacgtgccctcctccactaccagacaaaaagggaaa ccgactgataatccagttattgccgctgtggctgcacctgatgtgctcagcattgtggga atggagatagaatag >gi568815596r:207666985_207868739|GENSCAN_predicted_peptide_5|94_aa MARPQPAAWCSVRALAGLPGALGSSHNSVSNSLRPRCESCGAVFHSECKEKSVPCPRCVR RELQKKQKSFWQRLNMDESLEEACTMFELSYQNT >gi568815596r:207666985_207868739|GENSCAN_predicted_CDS_5|285_bp atggccaggcctcagccggcagcgtggtgctcggtaagagccttggctgggctgccagga gccctgggttccagtcacaactctgtctccaactcactgcgtccccggtgtgaaagctgt ggagccgttttccattctgaatgcaaagaaaagtctgtcccctgcccgaggtgtgttcgc cgagagctgcagaagaagcagaagtctttctggcagagactgaacatggacgagagtcta gaggaggcttgcaccatgttcgagctgtcctaccagaacacctga >gi568815596r:207666985_207868739|GENSCAN_predicted_peptide_6|98_aa VIEGKLAPFLGKVIKFATSHVYSCSLCSQKGFICEICNNGEILYPFEDISTSRFFTMKNI VLKDFVTFMPLGTVFKYLPLGTVHGEGSMEAECGDDFA >gi568815596r:207666985_207868739|GENSCAN_predicted_CDS_6|297_bp gtaatagagggaaagctggctccattcttgggcaaggtcattaaatttgccacctcacac gtgtacagctgcagtctttgtagccagaaggggttcatctgtgaaatctgtaacaatgga gagatcctctacccttttgaggatatttcaacaagcaggtttttcactatgaagaacatt gtacttaaagactttgtgacctttatgccacttggcactgtatttaaatatttgcccctg ggcacagtgcatggagaaggctccatggaggctgaatgtggagatgattttgcctaa