GENSCAN 1.0 Date run: 6-Nov-116 Time: 13:42:14 Sequence gi568815580r:35368614_35597874 : 229261 bp : 41.49% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 11 6 6 -0.45 1.03 Term - 1047 602 446 1 2 112 42 313 0.902 23.51 1.02 Intr - 4987 4843 145 1 1 83 77 205 0.978 17.83 1.01 Init - 5679 5263 417 0 0 68 110 429 0.889 39.58 1.00 Prom - 5892 5853 40 -6.45 2.04 PlyA - 6478 6473 6 -0.45 2.03 Term - 8164 7942 223 0 1 73 53 106 0.722 1.01 2.02 Intr - 8793 8665 129 2 0 58 78 193 0.655 14.19 2.01 Init - 17028 16817 212 0 2 45 13 149 0.103 1.40 2.00 Prom - 19496 19457 40 -5.05 3.03 PlyA - 19763 19758 6 1.05 3.02 Term - 25497 25173 325 2 1 68 45 161 0.100 3.05 3.01 Init - 29614 29490 125 1 2 40 56 124 0.076 4.09 3.00 Prom - 35869 35830 40 -3.65 4.06 PlyA - 36807 36802 6 1.05 4.05 Term - 43831 43690 142 0 1 78 45 115 0.001 2.62 4.04 Intr - 53732 53609 124 0 1 72 56 80 0.021 1.92 4.03 Intr - 53940 53766 175 0 1 120 21 108 0.453 5.89 4.02 Intr - 58406 58254 153 2 0 47 60 135 0.905 5.95 4.01 Init - 60041 59982 60 2 0 84 99 -7 0.718 1.36 4.00 Prom - 66691 66652 40 -6.15 5.08 PlyA - 66945 66940 6 1.05 5.07 Term - 69676 69512 165 1 0 106 43 101 0.524 4.33 5.06 Intr - 73529 73339 191 0 2 58 67 123 0.012 5.58 5.05 Intr - 76078 75870 209 0 2 110 40 92 0.030 4.40 5.04 Intr - 76994 76913 82 0 1 69 92 51 0.024 1.38 5.03 Intr - 87452 87303 150 0 0 25 53 108 0.318 0.21 5.02 Intr - 88686 88567 120 1 0 75 73 64 0.640 3.15 5.01 Init - 93870 93792 79 0 1 109 113 91 0.982 14.97 5.00 Prom - 97234 97195 40 -5.85 6.03 PlyA - 99133 99128 6 1.05 6.02 Term - 100129 99998 132 1 0 88 52 177 0.984 11.21 6.01 Init - 100599 100549 51 0 0 78 107 40 0.563 6.26 6.00 Prom - 103474 103435 40 -3.65 7.00 Prom + 105906 105945 40 -4.75 7.01 Init + 106602 106611 10 2 1 64 116 1 0.660 1.25 7.02 Intr + 109091 109201 111 0 0 61 86 54 0.469 1.93 7.03 Term + 109987 110132 146 2 2 104 36 150 0.716 8.49 7.04 PlyA + 111034 111039 6 1.05 8.10 PlyA - 111081 111076 6 1.05 8.09 Term - 119997 119234 764 1 2 75 36 394 0.611 25.49 8.08 Intr - 120316 120087 230 0 2 44 45 162 0.160 4.09 8.07 Intr - 121570 121431 140 1 2 90 45 23 0.205 -3.46 8.06 Intr - 129477 129106 372 0 0 -57 101 296 0.072 10.73 8.05 Intr - 133835 133697 139 1 1 37 64 196 0.969 11.65 8.04 Intr - 135089 134972 118 0 1 88 18 78 0.734 -0.60 8.03 Intr - 135741 135671 71 2 2 117 55 41 0.372 1.61 8.02 Intr - 138231 138158 74 0 2 6 94 88 0.145 -1.51 8.01 Init - 144001 143933 69 1 0 75 89 56 0.188 5.50 8.00 Prom - 149958 149919 40 -4.55 9.00 Prom + 150528 150567 40 -2.75 9.01 Init + 150633 150750 118 2 1 108 59 11 0.325 0.64 9.02 Term + 164257 164408 152 2 2 79 42 99 0.258 1.49 9.03 PlyA + 164610 164615 6 1.05 10.00 Prom + 169046 169085 40 -1.15 10.01 Init + 176369 176446 78 1 0 47 33 84 0.552 -2.07 10.02 Intr + 178122 178262 141 2 0 94 33 107 0.609 5.43 10.03 Term + 179731 179946 216 0 0 12 42 200 0.878 3.96 10.04 PlyA + 181098 181103 6 1.05 11.00 Prom + 197626 197665 40 -3.65 11.01 Init + 207775 207999 225 0 0 78 31 174 0.969 9.22 11.02 Intr + 212854 212958 105 0 0 56 63 108 0.955 4.59 11.03 Term + 213246 213485 240 2 0 45 44 252 0.995 11.54 11.04 PlyA + 213540 213545 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 43759 43837 79 1 1 131 35 94 0.818 4.66 S.002 Init + 97942 98055 114 0 0 75 64 67 0.806 3.26 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815580r:35368614_35597874|GENSCAN_predicted_peptide_1|335_aa MSAKLGKSSSLLTQTSEECNGILTEKMEEEEQTCDPDSSLHWSSSYSPETFRQQFRQFGY QDSPGPHEALSRLWELCHLWLRPEVHTKEQILELLVLEQFLAILPKELQAWVQKHHPENG EETVTMLEDVERELDGPKQIFFGRRKDMIAEKLAPSEITEELPSSQLMPVKKQLQGASWE LQSLRPHDEDIKTTNVKSASRQKTSLGIELHCNVSNILHMNGSQSSTYRGTYEQDGRFEK RQGNPSWKKQQKCDECGKIFSQSSALILHQRIHSGKKPYACDECAKAFSRSAILIQHRRT HTGEKPYKCHDCGKAFSQSSNLFRHRKRHIRKKVP >gi568815580r:35368614_35597874|GENSCAN_predicted_CDS_1|1008_bp atgtctgcaaaattgggaaagtcatcatcactcctaacacaaacttcagaggagtgtaat gggattctgacagagaagatggaagaggaagagcagacctgtgatccagactctagcctc cactggagcagcagctacagcccagagaccttccgccagcaattcaggcagtttggctac caggattcacctgggccccatgaggctctgagccggctctgggaactttgtcatctctgg ctgaggccggaagtgcacaccaaggagcagatcctggagctgctggtgctggagcagttc ctggccatcctcccaaaagagcttcaggcctgggtgcagaagcatcatccagagaatgga gaggaaactgtgactatgctggaggatgtggagagagagcttgatggaccaaagcagatc ttttttggacgaaggaaggacatgattgcagagaagctagcaccttcagaaatcactgag gaattgccaagtagccagctcatgcccgtgaagaagcagctccagggagcatcatgggag cttcagtccttaagaccacatgatgaagacatcaaaactacaaatgtgaaatctgcttca aggcaaaagacttctttaggcatagaactgcattgcaatgtttccaatatccttcatatg aatggctcccagagttccacatatagaggaacctatgaacaagatggtaggtttgaaaag agacaaggaaacccttcttggaaaaaacaacagaaatgtgatgaatgtggcaaaatcttt agtcagagctcagcccttattttacatcagagaatccacagtggaaagaaaccttatgca tgtgacgagtgtgcaaaggcattcagccgaagcgcaattctgattcagcatcgacgaacc catactggtgagaagccctacaagtgtcatgactgtggcaaagcctttagtcagagctca aatctttttagacataggaaaagacacattagaaaaaaagtcccataa >gi568815580r:35368614_35597874|GENSCAN_predicted_peptide_2|187_aa MEPHSDTEGRRPCDEECQRLPATSGNHTDTEGFLPRAFTEGMTLPTPSFQTSSLQTHVII NPCCFKPRSLCARPQRLFSMATSPRSGPRPDPPPEPEVVAQLAVPRFLPDGVGRFSTDSA PGAEVTGRPLSSRGCFSPGVPHISLRVPDGSKPVASTLPTSQVLFPRLFWPRFIWERGCL PPLPPLR >gi568815580r:35368614_35597874|GENSCAN_predicted_CDS_2|564_bp atggaaccacactcagacacagagggaagaaggccatgtgatgaggaatgtcaaagactg ccggcaaccagcgggaatcatacagacacggaaggcttcctccctagagccttcactgag ggtatgaccttaccaacaccttcatttcagacttctagcctccagacccatgtgataata aatccctgttgtttcaagccacgaagtttgtgtgcgcggccacagcggctcttctccatg gcaacgtccccgcgttccgggccccgccccgacccgccaccagaaccggaagttgttgcg cagctggcagttccccggtttctccccgacggcgtcgggaggtttagcacagattctgcg cctggtgctgaagtgacagggcggccactctcttccagaggctgctttagccccggggtc cctcatatatcccttagggtacctgacggctccaaaccggtggcctcaacccttcccacc tctcaggtcttgttccccagactcttctggccccgcttcatctgggagcgcggctgcctg ccgccacttcctcctctgcgctga >gi568815580r:35368614_35597874|GENSCAN_predicted_peptide_3|149_aa MNPGEELSPSSMVNRHFLSAERIQWDGVYEALVTVVSSAYEIKTDKLIPKFMWKCKGPRI TKTILEKNEVGEFTLLDFKTYHKFAVIKTMWYWHKDRYIDQQNDIESPEINSYVYGQLIF DKGAKTIQWDRMGFSTNGSGTTGLPHEKG >gi568815580r:35368614_35597874|GENSCAN_predicted_CDS_3|450_bp atgaatcctggagaggagctcagccctagctcaatggttaacaggcacttcctgtctgca gagaggattcagtgggatggtgtgtatgaggcacttgtcacagtagtatcaagtgcctat gaaataaaaactgacaagctgattccaaaattcatgtggaaatgcaagggacccagaata accaaaacaatcttggaaaagaatgaagttggagaatttacacttcttgatttcaaaact taccacaaatttgcagtaatcaaaactatgtggtactggcataaggacagatacattgat caacagaatgacattgaaagtccagaaataaattcatatgtttatggtcaactgattttt gacaaaggtgccaaaacaatacaatgggacagaatgggtttttcaacaaatggttctggg acaactggattgccacatgaaaaaggatga >gi568815580r:35368614_35597874|GENSCAN_predicted_peptide_4|217_aa MLGVLTVGGGRTGARSCWRQTLQAHYVTSETQMRLGSSFTKPNIPVNRLEWKPWGGTTQR QASPFQLDGTQGKTTAVLHHSGVLAVNAPALTVWDTAKPHHPKVYSYYYMVPYLVALTAL KLLLSPIGSGPNLQSIPYSLELGQCCALTPRDIIIATTQPSGPELLEVTSRVFSFFTPNP PTGEESSKPWESKVLLKEEKGVRTEPPGPSTKQEGRE >gi568815580r:35368614_35597874|GENSCAN_predicted_CDS_4|654_bp atgctgggggttctgacagttggaggagggagaacaggagcaaggtcatgctggaggcag acattgcaagcacattatgtcacttctgagacccagatgcgcctgggctcctcctttacc aagccaaacattcctgtgaacaggcttgagtggaagccctggggtgggacgacacaaagg caggcatctccgttccaactagatggaacccagggcaaaacgacagctgtgcttcaccat tctggggtacttgctgtcaatgcacctgccctcacagtctgggacactgccaagccccac catcccaaggtctatagttactactacatggtgccttatctcgtggcactcacggcactc aagttgttactgagccctattggctcaggcccaaacctccagagcatcccttactccctg gagttgggccaatgctgtgctctgacccccagggatataatcatagctacaacccagccc tctgggcctgagctgctagaagttacctcaagagtcttctccttcttcactcccaatcct ccaacaggagaggagtcatcaaagccatgggagtcaaaggttctcctgaaagaagaaaaa ggagtgaggacagagcccccaggaccttcaacaaaacaggagggaagagaatga >gi568815580r:35368614_35597874|GENSCAN_predicted_peptide_5|331_aa MATAAKEKHKGGHSRNGTYKWGLLGARIVIKKQCFPVIARCNKTKQVDSIPHRAYAKPSP FFPAVVVTSESKDPRVKQNPTHPLLAEGSNAAHFHKTKISQSLKWLPVGYLNSTTYEYRD LTKGLSLLCSLLLGVEADIPSMAGLHVRHLMQVPQQPCIAFAVSSPLHQGGKETEDAWSF HIPSTIHRWPTSPSVQLSSLRASTQLLCLHINLAKCFIWGILAQILSCHHHRITQFVGML TFVCAGDSSKLSPDIRTLVRRLTSYLVSASVLLKSKQMALCSASRILDVSVSPGDHVASE ARARCAPDLVAPRPMPHQNQGSLEWTEEGEN >gi568815580r:35368614_35597874|GENSCAN_predicted_CDS_5|996_bp atggctacagcagccaaagagaaacacaaaggaggacactctaggaatggaacctataag tggggacttctgggtgcacgaattgtgataaagaagcagtgctttccggtgatagctagg tgcaataaaaccaaacaagtagattccatacctcatcgtgcatatgcaaagcctagccca ttctttcctgcagtagttgtaactagtgaatcaaaggatccccgagttaagcaaaatcca actcatccactcctagctgaaggttctaatgctgctcatttccataaaacaaagatttcc cagagcctgaagtggcttcctgttggctacttaaacagcacaacatatgaatatcgtgac ttgaccaaaggcttgagtcttctttgctccctcctgctaggtgtggaagcagatatcccc tccatggctggactccacgtacggcatctcatgcaggtgccacagcagccctgcatagca tttgctgtatcatccccattgcaccagggagggaaggaaacagaggatgcctggtctttt catataccatccacgatacaccgatggcccacctctccttctgtccagctctcaagccta agggccagcacacagctcctgtgtctgcacatcaaccttgcaaagtgcttcatctgggga attcttgcacagattctgtcttgtcatcatcaccgcattacgcagtttgtgggcatgttg acttttgtgtgcgctggtgattccagcaagctttcccctgatatacgaacactggtcaga cgtctgacttcatatctggtgtctgcatcagtgcttctaaaaagcaagcagatggctctt tgctctgcttccaggatcctggatgtctcagtctccccaggtgatcatgtggcctcagaa gccagagctagatgtgctcctgatcttgtagcccctcgtcctatgcctcatcaaaaccag gggtcattggaatggactgaagaaggagaaaactag >gi568815580r:35368614_35597874|GENSCAN_predicted_peptide_6|60_aa MEVAQGVLEELERSMLEANYTDPQSKLRFSTIEEFSYIRRLPSDVVTGYLALRKATSIVP >gi568815580r:35368614_35597874|GENSCAN_predicted_CDS_6|183_bp atggaagtggcgcagggggtgttggaagagctggaaaggagcatgctggaggccaactac acagacccccagagcaaactgcggttcagcaccattgaagagttttcctacattcggagg ctgccctctgacgtcgtcaccggctacctggccctgaggaaggccacgagcatcgttccc tga >gi568815580r:35368614_35597874|GENSCAN_predicted_peptide_7|88_aa MNKALLAVSELCSPEALEQSLWWGVGRKIGGQIHMGPQQKEPPESSSAHQLFSRSHTTWA DGDEMPNEPITGSEHPDTLMSTPRRMND >gi568815580r:35368614_35597874|GENSCAN_predicted_CDS_7|267_bp atgaacaaagctctgctggcagtttctgaattgtgttccccagaagcacttgagcagtca ctgtggtggggagtggggaggaagataggcggccagatccacatgggaccacagcagaaa gaaccaccggaaagctcatctgcccaccagctattttcacgatctcacactacatgggct gatggtgatgaaatgccaaatgagccaataactggatctgagcaccctgacaccctcatg tccacacccagacgaatgaatgactag >gi568815580r:35368614_35597874|GENSCAN_predicted_peptide_8|658_aa MTSPTLCEGGGNEFKNAHDPKFQLCSIKCNEKILGVDEWNSRLLLVGRLCKAVLSVEVKI QRVDCMSLDAPKHALVLGTPEPHAQTALCPLGGPLWTLSLHLMYIPKGLRGQGINIPEKA PAAFAKGSVALVRETGAASTRKEAEEEVQERAHTTAGRPSGRTTPADIFPSRRHRLLRAP PFRLHREKPATAWSPGRVFKHQDGSNSGKEVPRPALGKRWGDQGKTRSAMAAQIPIVATT STPGIVRNSKKRPASPSHNGSSGGGYGASKKKKASASSFAQWAWAFSFAFVPPPPERAIT CHPSCPACQLHLPSQEPGVMTSGSFAAGGMWNFELEGDDFGYLAEEISKQQSIQEVTWVL LKAFRFKRETEHKSPENLQPDNVIENKNPFSEEKFKLAAEICISSPVPSGKKRFQVTDPG TPLLCAAWELGALCSSHSSHGKRGQGRAWATAAESASSKPGQFPHSVEPVGAQKSRIEVW EPPPTFQRMYGNAWMSRPKFAAGAGPLQRTSTRTLRKGNVELNPPNRVPTGKLPSGAVRR EPPSFRARNGRSTNSLHCVTGKDTDTQRQPVKVARREAIPCKATGAELPKTMGTHFLHQC DLDVRHGVKGDHFGALGFDCPAGFQTWCGSCSPFVLAYFSYLEWVYLSNACTLIVSRK >gi568815580r:35368614_35597874|GENSCAN_predicted_CDS_8|1977_bp atgacttcccctactctttgtgaaggaggaggaaatgaatttaagaatgctcatgatcct aaatttcagttatgtagtatcaagtgcaatgaaaaaatccttggtgtagatgagtggaac tctcgtttgctgctggtgggaagactttgcaaagcagtattatctgttgaggtcaaaatt cagagagtagattgcatgagtttggatgcaccaaaacatgctctggtgcttgggacccca gaacctcatgctcaaacggccctgtgccccctgggagggcccctgtggaccttgtccttg cacttgatgtatatccccaaaggcttgaggggtcagggcatcaatatcccagagaaagcc ccagcagcctttgccaaaggaagtgtggcattggtgagggaaacaggagctgcaagtacc aggaaggaagctgaagaggaagtgcaagagagggcgcacactacggcagggcgcccgagt ggcaggacaacgcccgctgacatcttcccgtcccgacggcacaggctactccgagctccg ccttttagattgcaccgagagaagccagctacagcttggagtccaggccgggttttcaag catcaagacggaagtaacagcggaaaggaagttccaaggcccgcgctgggaaaaaggtgg ggggaccaggggaagactcggagtgcgatggcggcgcaaattccaattgtggccaccact tccactcccggaatagtccggaacagcaagaagaggccggccagcccttcccacaatggc agcagcggcgggggctatggcgccagtaagaagaaaaaagcgtccgcttccagctttgcg cagtgggcctgggctttctcctttgcctttgtaccaccacctcctgaaagagcgatcact tgtcacccttcctgccctgcatgccagctccaccttccttctcaggaaccaggagtcatg acctctggctcattcgcagctggaggtatgtggaactttgaacttgagggagatgatttc gggtatctggcagaagaaatttctaagcagcaaagcattcaagaggtgacttgggtgctg ttaaaagcattccgttttaaaagggaaacagagcataaaagtccagaaaatttgcagcct gacaatgtgatagagaataaaaacccattttctgaggagaaattcaagctggctgcagaa atttgcataagcagcccagtgcctagtgggaaaaaaaggtttcaggtgactgacccaggg acccctctgctgtgcgcagcctgggaacttggtgccctgtgttccagccactccagccat ggtaaaaggggccaaggtagagcttgggccacggctgcagagagtgcaagctccaagcct ggccagtttccacatagtgttgagcctgtgggtgcacagaagtcaagaattgaggtttgg gaacctccacctacatttcagaggatgtatggaaatgcctggatgtccaggccaaagttt gctgcaggggcagggcccttacagagaacctctactaggacattgaggaagggaaatgtg gagttgaatcccccaaacagagtccccactgggaaactgcctagcggagctgtgagaaga gagccaccgtccttcagagcccggaatggtagatccaccaacagcttgcactgtgtgact ggaaaagacacagacactcaacgccagcctgtgaaagtagccaggagggaggctataccc tgcaaagccacaggggcagagctgcccaagaccatgggaacccatttcttgcatcagtgt gacttggatgtgagacatggagtcaaaggagatcattttggagctttaggatttgactgc cctgctggatttcagacttggtgtgggtcctgtagcccctttgttttggcctatttctcc tatttggaatgggtgtatttatccaatgcctgtaccctcattgtatctaggaagtaa >gi568815580r:35368614_35597874|GENSCAN_predicted_peptide_9|89_aa MGHLNNSVLEKFYYGVGAWGVIKEEPGRQRLSLDCTISEMSEEFCLWLVLLQAEIMAKAN ITLNDKRLTAFLLRSRTKQKDTLTAPTQQ >gi568815580r:35368614_35597874|GENSCAN_predicted_CDS_9|270_bp atggggcatctcaataacagtgtgttagaaaaattctattatggggttggggcttggggg gttattaaagaggaaccaggaaggcagaggctttccctggattgcacaatatcagaaatg tctgaggagttttgtctgtggctcgtcctgctacaagccgagatcatggcaaaagccaat atcacacttaatgacaaaagactgactgctttcctgctaagatctagaacaaaacaaaaa gacactctcacagctcctactcaacaataa >gi568815580r:35368614_35597874|GENSCAN_predicted_peptide_10|144_aa MDNFCWLLSSLGPTLSSMLCASGGSLAFSSLKPWFLVSVSSDKRKKGMRKGLYWPNQKQK LRTHDIFSPWIPLIQEAQRTPNRMNTKTSRTYTWTDHIQVTEKKNKEKIWKVDREKRNIT YRGTQIRITADFSSENEKIFEEDL >gi568815580r:35368614_35597874|GENSCAN_predicted_CDS_10|435_bp atggataacttctgctggctcctctcctcactcggccccaccctgtcctccatgctgtgt gcctcaggcggcagcctggcttttagcagcctgaagccatggtttttagtttctgtttct agtgataagcggaaaaaaggaatgaggaaagggctttactggcccaaccagaaacagaaa ctgaggacccatgacatattctctccctggatacccctgatccaagaagctcagagaaca ccaaacaggatgaataccaaaacaagcaggacatacacttggacagatcatattcaagtt actgaaaagaaaaacaaagagaaaatctggaaagtggacagagagaaaagaaacattaca tacagaggaacacaaataagaattacagcagacttctcatcagaaaatgagaagatattt gaggaggatctttaa >gi568815580r:35368614_35597874|GENSCAN_predicted_peptide_11|189_aa MAGGERHFLHGGGKRKKKEEDAKAEIPDKTIRSRETYSLPQEQYGGTAPKIQIISHWVPP TTHENYESTIQDEIWPHGPPPPLPPSTQRREDRARAPAGALRRPPRDEPQRGECIARGWA VPEPQRCARVGGGERGPAAAASPVPLWPALPGAGARARSARVRVWDPGAVGHAASLVGSF GEQRVSVLV >gi568815580r:35368614_35597874|GENSCAN_predicted_CDS_11|570_bp atggcaggaggtgaaaggcactttttacatggcggcggcaagagaaaaaaaaaagaggaa gatgcaaaagcggaaatccctgataaaaccatcagatctcgtgagacttattcactacca caagaacagtacgggggaactgcccccaagattcaaattatctcccactgggtccctcct acaacacatgagaattatgagagtacaattcaagatgagatttggccgcacgggccgccg ccgccgctgccgccgagcacgcagcgcagggaggaccgcgcccgagcgcctgcgggcgcc ctgcggcgcccgccgcgcgacgagccccaacgaggtgagtgtatcgcccgcggctgggca gtcccggagccgcagcgctgcgcgcgggtcggaggtggagaaagaggccccgccgcggcc gcgtccccggtccctttgtggccggcgctgcctggggccggggcgcgggcacgctcggcg cgggttcgggtctgggaccccggtgccgttggacacgcggcgtctctggttggcagcttt ggcgagcagcgagtttccgtcctagtttga