GENSCAN 1.0 Date run: 6-Nov-116 Time: 14:43:42 Sequence gi568815586f:25703298_25904526 : 201229 bp : 38.65% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 953 1090 138 1 0 65 44 157 0.961 5.98 1.02 PlyA + 1207 1212 6 1.05 2.11 PlyA - 1325 1320 6 1.05 2.10 Term - 4516 4175 342 1 0 96 44 150 0.773 4.93 2.09 Intr - 7899 7713 187 2 1 108 92 125 0.990 13.67 2.08 Intr - 9208 9055 154 2 1 60 56 51 0.085 -2.69 2.07 Intr - 15304 15137 168 2 0 59 62 78 0.084 1.30 2.06 Intr - 27871 27799 73 1 1 39 64 68 0.102 -2.44 2.05 Intr - 32172 31971 202 2 1 103 82 107 0.328 9.97 2.04 Intr - 47966 47931 36 1 0 75 99 47 0.004 0.96 2.03 Intr - 59796 59663 134 0 2 2 47 134 0.003 -0.78 2.02 Intr - 73999 73828 172 0 1 75 54 118 0.086 6.12 2.01 Init - 90006 89957 50 0 2 55 61 47 0.031 -0.83 2.00 Prom - 90791 90752 40 -3.45 3.03 PlyA - 92389 92384 6 1.05 3.02 Term - 94822 94676 147 1 0 122 40 106 0.945 6.12 3.01 Init - 98484 98398 87 0 0 59 100 63 0.685 5.29 3.00 Prom - 98809 98770 40 -4.55 4.00 Prom + 99505 99544 40 -5.95 4.01 Init + 99848 100320 473 1 2 36 8 307 0.699 12.54 4.02 Term + 100716 101232 517 0 1 4 48 485 0.955 28.90 4.03 PlyA + 103484 103489 6 1.05 5.07 PlyA - 103742 103737 6 -0.45 5.06 Term - 104860 104614 247 0 1 78 55 136 0.407 3.58 5.05 Intr - 111253 111156 98 1 2 82 76 26 0.183 -1.31 5.04 Intr - 116837 116553 285 2 0 -21 37 223 0.013 2.91 5.03 Intr - 123671 123546 126 2 0 69 71 95 0.029 5.86 5.02 Intr - 130769 130652 118 1 1 35 106 76 0.053 3.65 5.01 Init - 158929 158847 83 1 2 74 71 92 0.343 6.59 5.00 Prom - 159048 159009 40 -6.65 6.00 Prom + 162578 162617 40 -5.45 6.01 Init + 171126 171135 10 2 1 79 97 5 0.284 1.35 6.02 Intr + 171867 171924 58 1 1 45 110 25 0.356 -2.58 6.03 Intr + 172085 172268 184 2 1 25 47 236 0.430 12.07 6.04 Term + 179792 179932 141 1 0 78 44 107 0.193 2.25 6.05 PlyA + 180589 180594 6 1.05 7.04 PlyA - 180716 180711 6 1.05 7.03 Term - 181647 181313 335 1 2 47 47 176 0.843 3.19 7.02 Intr - 181850 181674 177 0 0 70 81 120 0.634 8.47 7.01 Init - 182368 182332 37 1 1 114 55 51 0.664 4.66 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:25703298_25904526|GENSCAN_predicted_peptide_1|45_aa HFLVHTSTVLDREAHWNEVEETGVDADPSPEITSEVPEKDKDCLD >gi568815586f:25703298_25904526|GENSCAN_predicted_CDS_1|138_bp cacttcttagttcacactagcaccgtactagacagagaagctcactggaatgaagtggag gagactggagtggatgcagatcccagtccagagattacttcagaagtccctgagaaagat aaggactgcttggattaa >gi568815586f:25703298_25904526|GENSCAN_predicted_peptide_2|505_aa MSLTPRSCQQQQLMLSKLSVRAGFKPEPDFKAIFHCALLPSPEMMPNTCATYWHLQFIRL LASSQGSSQGVKAANCKQKQGDTSTHLSEWLKFRTLTTPNADKNMEQELSFIAGGNAKWI CDWEDDSVLDRNYVLKVLEIEKACNHGDNFVLISYNTSNEEQVEGDEKSKKRRKNNLIQQ IFIKYFACISLLDAGDKEPTVEYITDLVLSGSPVYVQPMRGTGTGEAPPCCKRSELPEAP PHPPSVKASWRFPEDLPPYLPPASITTKRIRTRISTGLAFIDGLRVRQACCTMRVSKQQE PEHSVSKSWLYNCMILGNSLNFSVFQFHYLCGLAIVPNAWSTLSAESKDCLAERSFVSVL TFLRGTDYLFLTESHGHLDKENKRIVHFTSNIGTFVQRVGSQGLRQLHSCGFSEYSLHSC LHGLEVSTCSFSRYKVQAAGGFTILGPGGHQPSSHSFTRQCPIGDSVWGSKPTFSRHMAV VEVVCGSSTPTAGFYLGTQALSHIL >gi568815586f:25703298_25904526|GENSCAN_predicted_CDS_2|1518_bp atgagtctgacaccaagaagttgtcagcagcagcagctaatgcttagcaaattaagtgtc agagcaggatttaaacccgagcctgacttcaaagccatcttccactgtgctttgctgcct tcacctgagatgatgccaaatacctgtgccacatactggcacctgcagtttattaggctg ttggcatctagtcaaggatcctcacagggtgttaaagcagcgaattgcaaacaaaaacaa ggagacaccagtacacacctatcagaatggctaaaattcagaacactgacaacaccaaat gcagacaagaatatggaacaagaactctcattcattgctggtggaaatgcaaaatggatt tgtgattgggaagatgacagtgttcttgacaggaactatgttctgaaggttctagaaata gaaaaagcctgcaatcatggtgataactttgtgctcatttcttataacacttctaatgaa gagcaagtggaaggagatgagaagtctaagaagagaagaaagaataatttgattcagcaa atatttattaaatattttgcatgtatctctcttctagatgctggagataaagagcctact gtggagtacattactgatcttgtgctctctgggtctccagtgtatgttcagccaatgagg ggcactggaacaggagaggctcctccctgctgtaaacggagtgaacttcctgaggctcca ccccatcctcccagtgtgaaggctagttggagattccctgaagaccttcccccttatctg cctcctgcatctatcaccactaagcggattagaacaagaattagcactggacttgctttt attgatggtctccgtgtgcgacaagcttgttgcaccatgagagtgtctaagcaacaagag ccagaacacagtgtttcaaaatcttggctctacaattgtatgatcttgggtaactcactt aacttctctgtgtttcagtttcattatctgtgtggccttgccattgttccaaatgcatgg agcaccctttctgcagaaagcaaagattgccttgctgagaggtcctttgtctccgtgctg acttttcttcgtggcaccgattatctatttctaacagagagtcatgggcatttggacaaa gaaaataaaaggattgtgcatttcacatcaaatataggaacatttgtacaaagggtgggc tcccaaggtcttaggcagctccactcctgtggtttttcagagtatagcctgcacagctgc cttcatgggttagaggtgagtacctgcagcttttccaggtacaaggtgcaggctgctggt ggatttaccattctggggcctggaggacatcagccttcttcccacagcttcacaagacag tgccccattggagactctgtgtggggctccaaacccacattttcccgccacatggctgta gtagaggtggtctgtggaagctccacccctacagcaggcttctacctaggcacccaggct ctgtcacacatcctctga >gi568815586f:25703298_25904526|GENSCAN_predicted_peptide_3|77_aa MLQQAQRTKKQLQQETAELGSEIPALTKKDALLLIKGMKTSGRGSRCVAAPLKILVKYKS LLPFLKQLQQLFSPENE >gi568815586f:25703298_25904526|GENSCAN_predicted_CDS_3|234_bp atgctgcaacaggcacagagaacaaagaaacaactgcaacaagaaactgctgaacttggg tctgaaatcccagcattaacaaaaaaggatgctctcctacttatcaagggaatgaagaca agtggacgtgggtcccgctgtgttgcagcacctttgaaaatacttgtgaaatataaatca ttgcttccatttctgaagcaattgcaacaactctttagcccagagaatgagtag >gi568815586f:25703298_25904526|GENSCAN_predicted_peptide_4|329_aa MVSSFDPSCRIKSQQVRSLGLSYRSEYHAVLQRPAARVPGRWSCPHQRPRGMEAENAGSY SLQQAQAFYTFPFQQLMAEAPNMAVVNEQQMPEEVPAPAPAQEPVQEAPKGRKRKPRTTE PKQPVEPKKPVESKKSSKSAKSKEKQEKLQTHLKSKENKEVFGVKVKNLEFGLQPHKIPG TETLCYVMPSSSARSAQFPRAQDKVHYYIKLKDLRDQLRGIERNTDVQEVQYTFDLQLAQ EDAKKMAVKEEKYDLGYEAAYGGAYGENPCDSEPCSFSSNGLIESLELRGESTFSGIPNE QWMTQSFTDQIPSFNNHCGTQEQEEESHA >gi568815586f:25703298_25904526|GENSCAN_predicted_CDS_4|990_bp atggtgagcagttttgacccatcatgtaggataaaaagtcaacaggtgaggtctctgggg ttgtcttaccgcagtgagtaccacgcggtactacagagaccggctgcccgtgtgcctggc aggtggagctgcccgcatcagcggcctcggggaatggaagcggagaacgcgggcagctac tcccttcagcaagctcaagctttttatacgtttccatttcaacaactgatggctgaagct cctaatatggcagttgtgaatgaacagcaaatgccagaagaagttccagccccagctcct gctcaggaaccagtgcaagaggctccaaaaggaagaaaaagaaaacccagaacaacagaa ccaaaacaaccagtggaacccaaaaaacctgttgagtcaaaaaaatctagcaagtctgca aaatcaaaagaaaagcaagaaaaattacagacacatttaaagtcaaaagaaaataaagaa gtttttggtgtaaaggttaagaacttggaatttgggcttcagccccataagattccaggc acagaaactctctgctatgttatgccgtcatccagtgcaagaagtgctcagtttcctcgg gcccaagacaaagttcattactacataaaactgaaagacttaagagatcagttgagaggc attgaacgaaatacggacgttcaagaggtgcaatatacatttgacctacagcttgcccaa gaagatgcaaagaagatggctgttaaggaagaaaaatatgatctgggttatgaggcagca tatggtggtgcttacggagaaaatccatgcgacagtgaaccttgcagcttctcttcaaat gggctaattgagagcctggagttaagaggagaatcaactttcagtggcattcctaatgag cagtggatgacccagtcatttacagaccaaattccttcctttaataatcactgtggaaca caagaacaggaagaagaaagccatgcttaa >gi568815586f:25703298_25904526|GENSCAN_predicted_peptide_5|318_aa MNNWINLKIRSLNSVCTDVENCSMHIVRQSSQIKDHSNYTQQQHGKEASAAEVIWIQEER EDAFNQKVAYGVARSSPDLVGEEPLQLILEKPVGDRRWSCAKGSLTGEQKLISKEERRRS TEIFGGSYLLDKIGCILFTPTMNGGSSVNRTERAQEYNEYCATGNVNATNKGGREGKQRE EIVEGGASTALHLLLPSGGLVEAAVWLQVGCLLKAVHVIMMPSLVWEQTVSRVGANLCPL FLPRCLQRALPFGLALAAVFHKSVSTIFAERRSVTMGLCSKSFVAPHTLSTSGPIVLVGF PQEMLLYYFYGQLLLIPP >gi568815586f:25703298_25904526|GENSCAN_predicted_CDS_5|957_bp atgaataattggataaacctgaaaatacgatcactgaactcggtatgcactgatgtggaa aactgttcaatgcacattgttaggcagagctcccaaattaaggatcatagcaattacaca caacaacagcatggaaaagaagccagtgctgcagaggtgatctggatccaggaagaaagg gaagatgccttcaatcaaaaagttgcttatggagtggccagaagcagtccagacctggtt ggggaagaaccactgcagttaatcctggaaaagcctgtcggggatagaaggtggagctgt gctaaaggctcactaactggggaacagaagttgatcagcaaggaggaaaggagaagatcc acagaaatatttggaggcagttatttattggacaaaatcggatgtattctgtttacaccc acaatgaatggaggctcctcagtaaatagaaccgagcgggctcaggagtataatgaatac tgtgcaacgggcaatgtgaatgcaaccaataagggagggagagaaggaaaacaaagagag gaaatagtcgagggaggagcaagcacagcccttcatctgcttctgccgagtggagggctt gtggaagctgcggtgtggcttcaggtaggttgtctgttgaaagctgtccatgtgataatg atgcctagtctggtttgggaacaaacagtaagcagagttggagccaacctctgcccactt ttcttaccaagatgcctgcaacgagccctaccctttggcttggctctggcagctgttttt cataagtcagtcagcacaatctttgcagaaaggagatcggtcactatgggtctctgctca aaatccttcgttgcccctcacactctttcgacctcaggaccaattgtgctggttggtttc ccccaggaaatgcttctctactatttctatggccagctccttctcattccaccttga >gi568815586f:25703298_25904526|GENSCAN_predicted_peptide_6|130_aa MEAAPNTPHPAHKHSSSPNQPGRRVVKETAEAVPPQKNAQSNGGEINFSNKLKIHIEEAL ALTQKLSAENLKKSRSSKTESTQEKFSDFNHSGTRKVQQNINVKSPVIQSWIPETAAARF NVGHAKFPLC >gi568815586f:25703298_25904526|GENSCAN_predicted_CDS_6|393_bp atggaagcagctcctaacacccctcaccctgctcacaaacactcgtcgagtccaaaccag cctggcaggagagttgtaaaggaaacagcagaagcagtccctccccagaaaaatgcacag tctaatggaggtgaaattaacttcagcaataaacttaagattcatattgaagaggcactt gcactgactcagaaactttctgcagagaatctgaagaagagcaggagcagcaagacagaa agcactcaagagaaattctcagattttaaccactcaggtaccaggaaggttcagcagaac atcaatgtaaaatcaccagtcattcagtcttggatccctgagacagctgcagccagattt aatgtgggacatgccaagtttccactgtgctga >gi568815586f:25703298_25904526|GENSCAN_predicted_peptide_7|182_aa MGTGSLQAVAGPAGNPVTCLALAEPGASTGLRGKELCPYWSMSGHGQAWKRHHKFPLWSA GLAAQSPLWPEGACLRPAVTHSAQAACAKGHLQASNEPPSAPPWLPLTPCSSAPKVRMGL ASQCCPECVHTRPGFCSAQARPQTCSEALGAGRGQAVGADTRNPAEALGAFPGPEGADCR DV >gi568815586f:25703298_25904526|GENSCAN_predicted_CDS_7|549_bp atgggcactggctctctgcaagctgtggctggaccagctggtaatcccgtcacctgtcta gctctggcggagcctggggcttctacgggcctcagggggaaggaactgtgtccctattgg tccatgagcggccatgggcaggcctggaaaaggcaccacaagttcccactctggtctgcg ggactggcagcccagtcgcctctctggcctgaaggagcctgtctgcgtcctgccgtcacc catagtgcccaggctgcctgtgccaaggggcacctgcaggccagcaatgagccgccctca gcacctccttggcttcccctaaccccgtgctcgtcggcacccaaagtccggatggggctc gcgtctcagtgctgccctgagtgtgtgcacacccggccaggtttttgcagtgcccaggct cgtccccaaacctgctctgaggctttgggagcaggaagaggccaggcagtgggagcagac acccgcaaccctgcagaggcattgggggccttcccaggccccgagggtgcagactgtaga gatgtctga