GENSCAN 1.0 Date run: 3-Nov-116 Time: 18:42:19 Sequence gi568815596f:86342048_86592116 : 250069 bp : 40.12% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 5404 5664 261 0 0 57 -25 211 0.056 3.81 1.02 Term + 8158 8469 312 0 0 29 43 250 0.140 8.62 1.03 PlyA + 9382 9387 6 1.05 2.03 PlyA - 9980 9975 6 1.05 2.02 Term - 10340 10048 293 0 2 44 42 237 0.155 9.22 2.01 Init - 13480 13366 115 1 1 101 38 79 0.148 4.63 2.00 Prom - 13738 13699 40 -9.15 3.00 Prom + 13820 13859 40 -11.04 3.01 Init + 14191 14373 183 0 0 68 121 120 0.993 12.49 3.02 Intr + 14478 14604 127 2 1 43 119 73 0.794 5.23 3.03 Intr + 23120 23354 235 0 1 -12 37 156 0.091 -3.68 3.04 Intr + 30822 30950 129 0 0 90 75 81 0.128 5.89 3.05 Term + 33354 33414 61 0 1 142 47 8 0.366 -1.60 3.06 PlyA + 33776 33781 6 1.05 4.00 Prom + 34327 34366 40 -4.55 4.01 Init + 43320 43380 61 2 1 103 100 54 0.216 8.02 4.02 Intr + 57330 57510 181 1 1 -37 63 261 0.895 9.10 4.03 Intr + 74148 74322 175 0 1 31 93 150 0.609 8.82 4.04 Intr + 84443 84515 73 1 1 75 89 58 0.032 2.66 4.05 Intr + 99643 99791 149 2 2 44 65 59 0.001 -1.77 4.06 Intr + 99971 100186 216 1 0 114 87 229 0.005 23.18 4.07 Intr + 107760 107915 156 2 0 95 73 82 0.986 6.69 4.08 Intr + 109056 109166 111 2 0 97 98 33 0.965 4.86 4.09 Intr + 113038 113140 103 0 1 97 71 110 0.999 9.03 4.10 Intr + 114395 114519 125 0 2 95 88 82 0.913 8.28 4.11 Intr + 114758 114830 73 1 1 54 83 59 0.863 0.06 4.12 Intr + 114936 115024 89 1 2 60 76 93 0.931 4.07 4.13 Intr + 118606 118784 179 0 2 66 110 114 0.626 9.20 4.14 Intr + 119376 119415 40 0 1 56 76 10 0.356 -6.09 4.15 Intr + 122006 122169 164 1 2 104 96 112 0.703 11.45 4.16 Intr + 124325 124869 545 2 2 78 116 411 0.503 34.51 4.17 Intr + 128157 128361 205 1 1 80 78 225 0.993 18.04 4.18 Intr + 132729 132943 215 0 2 106 61 152 0.998 11.74 4.19 Intr + 135785 135982 198 0 0 101 76 154 0.920 13.90 4.20 Intr + 136561 136688 128 2 2 67 68 61 0.938 1.48 4.21 Intr + 138120 138315 196 2 1 56 96 181 0.980 13.77 4.22 Intr + 139883 140055 173 0 2 55 95 50 0.956 1.14 4.23 Intr + 140411 140647 237 1 0 78 95 108 0.991 7.39 4.24 Intr + 141223 141291 69 0 0 58 76 64 0.597 0.66 4.25 Intr + 141940 142111 172 0 1 70 55 144 0.984 7.89 4.26 Intr + 142895 142982 88 0 1 86 80 69 0.993 4.01 4.27 Intr + 143682 143812 131 0 2 93 91 132 0.997 13.42 4.28 Intr + 147109 147201 93 2 0 89 96 9 0.637 0.82 4.29 Intr + 147271 147390 120 2 0 85 97 196 0.999 19.75 4.30 Intr + 147473 147612 140 0 2 101 75 102 0.995 9.46 4.31 Intr + 149094 149228 135 2 0 113 98 49 0.989 8.24 4.32 Term + 149992 150072 81 0 0 105 33 78 0.959 0.71 4.33 PlyA + 152131 152136 6 1.05 5.07 PlyA - 152270 152265 6 1.05 5.06 Term - 163902 163757 146 1 2 113 50 239 0.999 19.69 5.05 Intr - 165546 165432 115 0 1 59 80 178 0.967 13.20 5.04 Intr - 168432 168311 122 1 2 73 57 172 0.678 11.89 5.03 Intr - 187350 187171 180 1 0 64 99 191 0.910 16.72 5.02 Intr - 203285 202935 351 0 0 8 78 222 0.250 7.57 5.01 Init - 214860 214794 67 0 1 72 68 97 0.572 7.39 5.00 Prom - 248596 248557 40 -1.75 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 220799 221098 300 1 0 38 43 263 0.905 11.04 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:86342048_86592116|GENSCAN_predicted_peptide_1|190_aa MVEPGLQWLQTVNPHMTISQAGMKAEEAFLQCCPEAAHRLSLTFQRPEQSRVSSPPRFLA KGNECTVTGLGHSFSLEYRATFPEHISAREGQHRHKWVNVHAGDGIHTVRLACKRQRDNK GKILEGKQLKLREAKSDAEPTNGGARGGPEKLVHGTRSAKEEGRSLASLQAVIGIYSPFA CLVRGYGQLD >gi568815596f:86342048_86592116|GENSCAN_predicted_CDS_1|573_bp atggttgagccagggcttcagtggctgcagacagtaaatcctcacatgaccatctctcag gcaggaatgaaggcagaggaggctttccttcagtgctgcccggaggctgcccacagactt tccctcacatttcagaggccagagcagagtcgagtgtcatcccctccacgattccttgca aagggaaatgaatgtactgtgactggcttgggccactcattttccctcgagtatcgggcc accttccctgagcacatttctgccagggaagggcagcatagacacaagtgggttaatgtc catgctggagatgggattcatacagtcaggctggcctgcaaaaggcagagggacaataag ggcaaaatccttgaaggcaaacagttaaaactcagagaagcaaaatctgatgcagagccc acgaatgggggggccagaggggggcctgaaaagctggttcatgggactagatcagccaag gaggaaggcaggtccctggcctctctgcaagcagtgataggcatctacagcccctttgcc tgcttggtgaggggatatggtcagctggattaa >gi568815596f:86342048_86592116|GENSCAN_predicted_peptide_2|135_aa MATASGAPAYLEDCAKGAASLAPGSLIHVPKQVQFVCRADWMVPTHMEGDSSPLSPLTHM PVFSGNTLTDTPRSNALPAISSQASTSTITGNDQRQRTTLQQCQASNGSTSDLGYPRREQ SRGSWTFECQRPVKL >gi568815596f:86342048_86592116|GENSCAN_predicted_CDS_2|408_bp atggccactgccagtggtgcacctgcgtatcttgaggactgtgccaaaggagctgcctcc ctggctcccggctccctcatccacgtgccaaagcaggtccagtttgtctgcagagctgat tggatggtgcccacccacatggagggtgactcttccccactgagtccactgactcacatg ccagtcttctcaggaaacaccctcacagacacacccagaagcaacgctttaccagcaatc tccagtcaagccagcacctcaaccatcacagggaacgaccaaagacaacgcacaacccta caacaatgtcaagcttccaatggatccacctctgatctaggatacccaaggagagaacag agcaggggctcctggacttttgaatgtcaaagaccagtgaagctttaa >gi568815596f:86342048_86592116|GENSCAN_predicted_peptide_3|244_aa MGKKLCQSLTTSHLDEGCPAGKAGTLCHGARHTPGSAVIELKEGAGPGGAGAGPDAVPAN KMVSCGPLKQKHPGKGILGNVAQPNQVDRLQSHFEFAVSCLKTVSRVLIRAGKDTQRHIQ GRRPCDDGDRDASNAAAGQGMPRIAGNCQKPGEGHTRDSPSEPPEAANPANTLILYFWPL ELADALISQLLRILVADQYPLGIALAVGNWPPGGSSPHIVSDWLRVLLCCPGCGVAVRSV TVRT >gi568815596f:86342048_86592116|GENSCAN_predicted_CDS_3|735_bp atgggcaagaaactgtgtcagtcactcaccacctcccaccttgatgaagggtgtcctgca ggaaaagctggtaccctttgtcatggagctagacacacacctggctcagcagtcatagaa ctgaaggaaggagcagggccagggggagcaggtgcaggcccggacgccgtccccgccaac aagatggtctcctgtggcccactgaagcagaaacatccagggaaggggattctgggaaat gtagctcagcccaaccaagtggaccgattacaaagccactttgaatttgcagtttcatgt ctaaaaactgtatctagggtccttataagagcaggaaaagacacacagagacacatccaa ggaagaagaccatgtgatgatggagacagagatgcgagtaatgcagctgcaggccaggga atgccaaggattgctggcaattgccagaagccaggagaggggcacacaagagattctccc tcagagcctccagaagccgccaaccctgccaacaccttgattttgtacttctggcctctg gaactggcagatgcactcatctctcagctgctgagaatactggttgctgaccagtatcca ctgggaattgccctggctgtggggaattggcccccgggtgggagcagtccacacatagtc agtgactggctgagggtcttgctctgttgcccaggctgcggtgtagcagtgcggagtgta acagtgcgaacatga >gi568815596f:86342048_86592116|GENSCAN_predicted_peptide_4|1606_aa MPTGMAHLCPLPLRPPGQLQVTGGSTWSQRKKQRRSIRENADTGECRVALKSFGRRVAHG VKCEEEPEKESLKPNWKVSARSLILMQKEPSTESGGKQNLPQKVPGGGGLEAQWFVVKAK GAMCPRHVTIPAQQEVTRKPCLVMFNVPKWILDNTNREKGKESGAFSFRGWGLSTFKSTR AADNFSLTPSAVAFELLEKPNLGERGSRGAGARGGALPAGVETMVLTLGESWPVLVGRRF LSLSAADGSDGSHDSWDVERVAEWPWLSGTIRAVSHTDVTKKDLKVCVEFDGESWRKRRW IEVYSLLRRAFLVEHNLVLAERKSPEISERIVQWPAITYKPLLDKAGLGSITSVRFLGDQ QRVFLSKDLLKPIQDVNSLRLSLTDNQIVSKEFQALIVKHLDESHLLKGDKNLVGSEVKI YSLDPSTQWFSATVINGNPASKTLQVNCEEIPALKIVDPSLIHVEVVHDNLVTCGNSARI GAVKRKSSENNGTLVSKQAKSCSETIPQSYDVAWLVGKESAEECEPAVSFLEFYHTLNYI ISVCWQGGTIAMTGIEDLQGKLFSYLEMLHANSIHSLASPSMCPVQSVPTTVFKEILLGC TAATPPSKDPRQQSTPQAANSPPNLGAKIPQGCHKQSLPEEISSCLNTKSEALRTKPDVC KAGLLSKSSQIGTGDLKILTEPKGSCTQPKTNTDQENRLESVPQALTGLPKECLPTKASS KAELEIANPPELQKHLEHAPSPSDVSNAPEVKAGVNSDSPNNCSGKKVEPSALACRSQNL KESSVKVDNESCCSRSNNKIQNGECFSISLLVEAPSRKSVLTDPAKLKKLQQSGEAFVQD DSCVNIVAQLPKCRECRLDSLRKDKEQQKDSPVFCRFFHFRRLQFNKHGVLRVEGFLTPN KYDNEAIGLWLPLTKNVVGIDLDTAKYILANIGDHFCQMVISEKEAMSTIEPHKALSFQR LSDLLIFAGQVAWKRAVKGVREMCDVCDTTIFNLHWVCPRCGFGVCVDCYRMKRKNCQQA LYDVGDIVHSVRAKWGIKANCPCSNRQFKLFSKPASKEDLKQTSLAGEKPTLGAVLQQNP SVLEPAAVGGEAASKPAGSMKPACPASTSPLNWLADLTSGNVNKENKEKQPTMPILKNEI KCLPPLPPLSKSSTVLHTFNSTILTPVSNNNSGFLRNLLNSSTGKTENGLKNTPKILDDI FASLVQNKTTSDLSKRPQGLTIKPSILGFDTPHYWLCDNRLLCLQDPNNKSNWNVFRECW KQGQIGSLLKEHSALGQEFGRIAKQKDPVMVSGVHHKLNSELWKPESFRKEFGEQEVDLV NCRTNEIITGATVGDFWDGFEDVPNRLKNEKEPMVLKLKDWPPGEDFRDMMPSRFDDLMA NIPLPEYTRRDGKLNLASRLPNYFVRPDLGPKMYNAYAHRLVKEFFCNNLIQNFYSQQKS QSGTYHPPGLITPEDRKYGTTNLHLDVSDAANVMVYVGIPKGQCEQEEEVLKTIQDGDSD ELTIKRFIEGKEKPGALWHIYAAKDTEKIREFLKKVHNLYSCIKVAEDFVSPEHVKHCFW LTQEFRYLSQTHTNHEDKLQVKNVIYHAVKDAVAMLKASESSFGKP >gi568815596f:86342048_86592116|GENSCAN_predicted_CDS_4|4821_bp atgcccacgggaatggcccatctgtgtcctctgcccctgaggcctcctgggcagctgcaa gtgactggaggaagtacttggagccagagaaagaagcagagaaggagcatcagagagaat gcagacactggagaatgccgtgtcgcattaaagagctttggaaggagagtggcccatggt gtcaaatgtgaagaggagcctgagaaggagtccctgaagccaaactggaaagtctctgct agatccttgatccttatgcaaaaggagcccagcactgagtccggtggcaaacagaatctt cctcaaaaagtgccagggggaggaggactggaggcccagtggtttgtggtgaaagccaag ggggcaatgtgccctagacatgttaccatccctgcccaacaggaagtcacaaggaagcca tgcctggtcatgttcaatgtgccgaaatggatcctggacaacacaaacagagaaaaaggc aaggaatcaggggccttttcttttcggggctgggggctgagcactttcaaatccactcga gccgcagataatttttccctcactccctctgcggttgcgtttgagctgttagaaaagcca aacttgggagagcgagggtcacgcggcgctggggcccggggaggagctcttcctgcaggc gtggaaaccatggtgctcacgctcggagaaagttggccggtattggtggggaggaggttt ctcagtctgtccgcagccgacggcagcgatggcagccacgacagctgggacgtggagcgc gtcgccgagtggccctggctctccgggaccattcgagctgtttcccacaccgacgttacc aagaaggatctgaaggtgtgtgtggaatttgatggggaatcttggaggaaaagaagatgg atagaagtctacagccttctaaggagagcatttttagtagaacataatttggttttagct gaacgaaagtcacctgaaatttctgaacgaattgtacagtggcctgcaataacgtacaaa cctctgttggacaaagctggtttgggatccataacttctgttcgctttctgggagatcaa caaagagtatttctttctaaagaccttttgaagcctatacaggatgtaaacagtcttcga ctttctcttacggataatcagattgtcagtaaagaatttcaagctttgattgtgaagcat ttagatgaaagccatcttttaaaaggtgacaaaaacttagttggttcagaagtaaaaatt tatagcttggacccatctactcagtggttttcagcaaccgttataaatggaaacccagca tcaaaaactcttcaagtcaactgtgaggagattccagcactgaaaattgttgatccgtca ctgattcatgttgaagttgtacacgataaccttgtgacatgtggtaattctgcaagaatt ggagctgtaaaacgcaagtcttctgagaataatggaaccctggtttccaaacaagcaaaa tcttgctctgagactattcctcagagttacgacgttgcctggttagttgggaaagagtct gctgaggaatgtgagccggcagtgtcctttttagaattctatcatacactcaattatatt atctccgtctgctggcagggaggaactattgccatgactggtatagaagaccttcaggga aaactcttcagctatttagaaatgttacatgctaacagcattcactccttggcctctccc agtatgtgtcctgtgcagtctgtacctacaacagtttttaaggagatactgcttggctgt actgcggcaactccacctagtaaggacccaagacagcaaagtactccccaggctgccaac tctccacctaaccttggagcaaaaattcctcaaggatgtcataaacaaagtttaccagag gaaatttcttcctgtctaaatacaaagtctgaagctctgagaacaaaaccagatgtctgc aaagcagggttgctctcaaagtcctctcagattggaactggagacttgaaaattctgact gagccaaaaggcagctgtactcagcctaagacaaacactgatcaggaaaacagattggag tctgttccacaagcattgactggccttcctaaggagtgcttacctacaaaggcttcttct aaggcagaattggaaattgccaatcctcctgaactgcagaagcacctagaacatgcacct tccccatcggatgtttcaaatgcaccagaagtgaaagcaggtgtcaatagtgatagccct aataactgttcaggaaaaaaggtagaaccttcagctttagcttgccgatcacagaattta aaggaatcttcagtaaaagtagataatgaaagctgttgttcaagaagcaacaataaaatc cagaatggtgagtgtttctcaatatctttattggtagaagccccatccaggaagtcggtt ttgacagacccagctaaactcaaaaagctgcaacagagtggcgaggccttcgtacaggat gattcttgtgtgaacatcgtggcacagttgcctaaatgccgagagtgtcgcttggacagt ctccgcaaggataaggagcaacagaaggactcacctgtgttttgccgcttctttcacttc aggaggttacaattcaacaaacatggtgtgttgcgggtagaaggcttcttaacaccaaac aagtatgacaatgaagcaattggcttgtggttacctttaaccaaaaacgttgtggggatt gatttggacacagcaaagtacatcttggccaacattggagaccacttctgtcaaatggtg atttctgaaaaggaagctatgtcaactattgagccacacaaagccctctccttccaaaga ctaagtgatttactgatttttgcaggacaggttgcttggaagcgagctgtcaaaggtgtt cgagaaatgtgtgatgtgtgcgacaccaccatcttcaacctgcactgggtgtgtcctcgg tgtgggtttggagtatgtgtggactgctaccggatgaagagaaagaattgccaacaggca ctctatgatgttggagacattgttcattctgtaagagcgaaatggggaataaaggcaaac tgcccttgttcaaacaggcaattcaaactcttttcaaagccagcctcaaaggaagaccta aaacagacttctttagctggagaaaaaccgactcttggtgcagtgctccagcagaatccc tcagtgttggagccagcagctgtgggtggggaagcagcctccaagccagccggcagcatg aagcctgcctgtccagccagcacatctcctctaaactggctggccgacctaaccagcggg aatgtcaacaaggaaaacaaggaaaaacaaccaacaatgccaattttaaagaatgaaatc aaatgccttccacccctcccacctttaagcaaatccagcacagtcctccatacgtttaac agcacaattttgacacccgtaagcaacaacaattctggtttcctccggaatctcttgaat tcttctacaggaaagacagaaaatggactcaagaatacaccaaaaatccttgatgacatc tttgcctctttggtgcaaaataagacgacttctgatttatctaagaggcctcaaggacta accatcaagcccagcattctgggctttgacactcctcactattggctttgtgataatcgc ttgctgtgcttgcaagaccccaacaataagagcaactggaatgtgtttagggagtgctgg aaacaagggcagatagggtctttgctaaaggaacactctgctcttggacaagaatttggg agaattgccaaacagaaagatccagtgatggtgtctggagtgcatcataaattgaactct gaactttggaaacctgaatccttcaggaaagagtttggtgagcaggaagtagacctagtt aattgtaggaccaatgaaatcatcacaggagccacagtaggagacttctgggatggattt gaagatgttccaaatcgtttgaaaaatgaaaaagaaccaatggtgttgaaacttaaggac tggccaccaggagaagattttagagatatgatgccttccaggtttgatgatctgatggcc aacattccactgcccgagtacacaaggcgagatggcaaactgaatttggcctctaggctg ccaaactactttgttcggccagatctgggccccaagatgtataatgcttatgcacatcga ttggttaaagaatttttttgcaataatttgatccagaatttttattctcagcagaagagt caatcagggacctaccatcccccaggattaatcactcctgaagatcggaaatatggaaca acaaatcttcacttagatgtatctgatgcagctaatgtcatggtctatgtgggaattccc aaaggacagtgtgagcaagaagaagaagtccttaagaccatccaagatggagattctgac gaactcacaataaagcgatttattgaaggaaaagagaagccaggagcactgtggcacata tatgctgcaaaggacacggagaagataagggaatttcttaaaaaggttcataacttatat agctgcatcaaagtggctgaagattttgtttctccagagcatgttaaacactgcttctgg cttactcaggaattccgatatctgtcacagactcataccaatcacgaagataaattacag gtgaagaatgttatctaccatgcagtgaaagatgcagttgctatgctgaaagccagtgaa tccagttttggcaaaccttaa >gi568815596f:86342048_86592116|GENSCAN_predicted_peptide_5|326_aa MVFRVRPADPGQTTDERIHSDTGEERLCPAALRPGGEERLCPAALRPGGEERLCPAALRP GGEERLCPAALRPGGEERLCPAALRPGGEERLCPAALRPGGEERLCPAALRPGGEERLCP AALRPGGEERLCPAALRLGDIQREEEKVKRSVKDAAKKGQKDVCIVLAKEMIRSRKAVSK LYASKAHMNSVLMGMKNQLAVLRVAGSLQKSTEVMKAMQSLVKIPEIQATMRELSKEMMK AGIIEEMLEDTFESMDDQEEMEEEAEMEIDRILFEITAGALGKAPSKVTDALPEPEPPGA MAASEDEEEEEEALEAMQSRLATLRS >gi568815596f:86342048_86592116|GENSCAN_predicted_CDS_5|981_bp atggtgttccgggtccgacccgcagatcctggccaaacgacggatgaaagaatacactca gacacaggtgaggagcgcctctgcccggccgcccttcgtccgggaggtgaggagcgcctc tgcccggccgcccttcgtccgggaggtgaggagcgcctctgcccggccgcccttcgtccg ggaggtgaggagcgcctctgcccggccgcccttcgtccgggaggtgaggagcgcctctgc ccggccgcccttcgtccgggaggtgaggagcgcctctgcccggccgcccttcgtccggga ggtgaggagcgcctctgcccggccgcccttcgtccgggaggtgaggagcgcctctgcccg gccgcccttcgtccgggaggtgaggagcgcctctgcccggccgcccttcgtctgggagat atccaaagagaagaagaaaaagtgaaacgatctgtgaaagatgctgccaagaagggccag aaggatgtctgcatagttctggccaaggagatgatcaggtcaaggaaggctgtgagcaag ctgtatgcatccaaagcacacatgaactcagtgctcatggggatgaagaaccagctcgcg gtcttgcgagtggctggttccctgcagaagagcacagaagtgatgaaggccatgcaaagt cttgtgaagattccagagattcaggccaccatgagggagttgtccaaagaaatgatgaag gctgggatcatagaggagatgttagaggacacttttgaaagcatggacgatcaggaagaa atggaggaagaagcagaaatggaaattgacagaattctctttgaaattacagcaggggcc ttgggcaaagcacccagtaaagtgactgatgcccttccagagccagaacctccaggagcg atggctgcctcagaggatgaggaggaggaggaagaggctctggaggccatgcagtcccgg ctggccacactccgcagctag