GENSCAN 1.0 Date run: 5-Nov-116 Time: 16:05:21 Sequence gi568815592f:87494351_87767120 : 272770 bp : 40.07% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 6273 6317 45 2 0 92 111 21 0.688 5.65 1.02 Intr + 6808 6960 153 0 0 113 83 36 0.955 4.95 1.03 Intr + 12032 12098 67 1 1 64 99 45 0.917 0.76 1.04 Intr + 14070 14246 177 1 0 91 84 54 0.702 4.27 1.05 Intr + 14691 14825 135 1 0 16 44 136 0.516 1.62 1.06 Term + 17049 17176 128 1 2 105 34 34 0.495 -2.84 1.07 PlyA + 17264 17269 6 1.05 2.08 PlyA - 17977 17972 6 1.05 2.07 Term - 20149 20063 87 1 0 103 28 68 0.564 -1.02 2.06 Intr - 20670 20607 64 2 1 101 71 41 0.527 1.50 2.05 Intr - 24389 24280 110 2 2 86 94 4 0.272 -1.04 2.04 Intr - 27236 27114 123 2 0 39 103 67 0.681 3.16 2.03 Intr - 35298 35182 117 0 0 59 32 89 0.415 0.14 2.02 Intr - 36592 36434 159 1 0 87 67 234 0.986 20.46 2.01 Init - 47651 47568 84 2 0 48 100 74 0.954 5.47 2.00 Prom - 48092 48053 40 -3.35 3.08 PlyA - 48650 48645 6 1.05 3.07 Term - 58842 58683 160 2 1 124 39 63 0.928 1.43 3.06 Intr - 61155 61058 98 0 2 76 94 88 0.904 6.09 3.05 Intr - 68435 68352 84 2 0 94 116 0 0.826 2.50 3.04 Intr - 69882 69780 103 2 1 62 102 100 0.301 8.06 3.03 Intr - 80997 80929 69 2 0 106 93 20 0.199 1.68 3.02 Intr - 95938 95748 191 1 2 99 88 118 0.142 10.36 3.01 Init - 120078 119827 252 0 0 74 76 162 0.170 10.99 3.00 Prom - 132141 132102 40 -4.85 4.00 Prom + 160264 160303 40 -3.75 4.01 Init + 163583 163668 86 1 2 56 107 53 0.485 4.34 4.02 Intr + 168653 168794 142 2 1 32 116 60 0.874 2.63 4.03 Intr + 170393 170509 117 1 0 16 92 103 0.896 3.24 4.04 Term + 171404 171526 123 1 0 55 38 123 0.878 1.40 4.05 PlyA + 173074 173079 6 1.05 5.08 PlyA - 173610 173605 6 1.05 5.07 Term - 181142 180964 179 0 2 56 42 101 0.871 -0.83 5.06 Intr - 181581 181510 72 1 0 37 87 97 0.875 2.96 5.05 Intr - 183617 183468 150 0 0 59 80 152 0.914 10.71 5.04 Intr - 187413 187270 144 1 0 51 90 94 0.950 5.23 5.03 Intr - 204005 203840 166 2 1 80 72 43 0.319 0.51 5.02 Intr - 206796 206597 200 1 2 35 23 143 0.539 0.65 5.01 Init - 207334 207100 235 1 1 107 75 286 0.859 25.55 5.00 Prom - 210061 210022 40 -7.65 6.03 PlyA - 211549 211544 6 1.05 6.02 Term - 213853 213801 53 2 2 97 38 47 0.734 -2.79 6.01 Init - 215934 215817 118 0 1 91 70 180 0.912 16.91 6.00 Prom - 222569 222530 40 -7.25 7.00 Prom + 227249 227288 40 -4.45 7.01 Sngl + 231338 231571 234 1 0 71 37 200 0.577 8.05 7.02 PlyA + 235921 235926 6 1.05 8.03 PlyA - 237338 237333 6 1.05 8.02 Term - 252071 251962 110 0 2 30 49 224 0.890 10.39 8.01 Init - 259162 259009 154 1 1 64 34 87 0.355 1.09 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:87494351_87767120|GENSCAN_predicted_peptide_1|234_aa MAFLALSNLDAAVYQVTYQLKIPCTALCTVLMLNRTLSKLQWVSVFMLCAGVTLVQWKPA QATKVVVEQNPLLGFGAIAIAVLCSGFAGVYFEKVLKSSDTSLWVRNIQMYLSGIIVTLA GVYLSDGAEIKEKGFFYGYTYYVWFVIFLASVGGLYTSVVVKYTDNIMKGFSAAAAIVLS TIASVMLFGLQITLTFALGTLLVCVSIYLYGLPRQDTTSIQQGETASKERVIGV >gi568815592f:87494351_87767120|GENSCAN_predicted_CDS_1|705_bp atggctttcctagctcttagcaatctggatgcagcagtgtaccaggtgacctaccagttg aagattccgtgtactgctttatgcactgttttaatgttaaaccggacactcagcaaatta cagtgggtttcagtttttatgctgtgtgctggagttacgcttgtacagtggaaaccagcc caagctacaaaagtggtggtggaacaaaatccattattagggtttggcgctatagctatt gctgtattgtgctcaggatttgcaggagtatattttgaaaaagttttaaagagttcagat acttctctttgggtgagaaacattcaaatgtatctatcagggattattgtgacattagct ggcgtctacttgtcagatggagctgaaattaaagaaaaaggatttttctatggttacaca tattatgtctggtttgtcatctttcttgcaagtgttggtggcctctacacttctgttgtg gttaagtacacagacaacatcatgaaaggcttttctgcagcagcggccattgtcctttcc accattgcttcagtaatgctgtttggattacagataacactcacctttgccctgggtact cttcttgtatgtgtttccatatatctctatggattacccagacaagacactacatccatc caacaaggagaaacagcttcaaaggagagagttattggtgtgtga >gi568815592f:87494351_87767120|GENSCAN_predicted_peptide_2|247_aa MVGLLGTGFQLFGYEEKLQSNPLQHLFEVYVQVNKEAADDKSVAKAAQEFFQRLELGDVQ ALSLWQKFRDLSIEEYIRVYKRLGVYFDEYSGESFYREKSQEVLKLLESKGLLLKTMYDQ VRPIITRICLVTLGLWPLCFRDLAAAIDRMDKYNFDTMIYVDFKGLLLSDYKFSWDRVFQ SRGDTGVFLQYTHARLHSHLAAVAHKTLQIKDSPPEVAGARLHLFKAVRSVLANGMKLLG ITPVCRM >gi568815592f:87494351_87767120|GENSCAN_predicted_CDS_2|744_bp atggtaggtcttctgggaactggcttccagctgtttggctatgaggaaaaactgcagtcc aatcctctacagcatctctttgaagtttatgtacaagttaataaagaagcagcagatgat aaaagtgtagcaaaagcagcacaggagttcttccaacgattggaactgggcgatgtgcaa gcactttcactgtggcaaaaatttcgggacttgagcattgaagagtacattcgggtttac aagcgtctgggagtatattttgatgaatattcaggagaatcattttatcgtgaaaaatct caagaggtcttaaagttgctggagagtaaaggactcctactgaaaacaatgtatgatcag gtaagaccaattattacccggatctgcttagtaactcttggtttatggccactttgtttc agagatcttgcagctgctatagatcgaatggacaagtataattttgatacaatgatatat gtggacttcaaaggtttactcttatctgactacaagttcagctgggatcgtgttttccag agtcgcggggacacaggagtcttcctacagtacacacacgcccgcctccacagtcatctt gcagctgtggcacacaaaacactacaaataaaagatagtcctcctgaagtggctggggcc agacttcatcttttcaaagctgtccgttctgtcctagccaatggaatgaaacttcttgga ataacacctgtatgtaggatgtaa >gi568815592f:87494351_87767120|GENSCAN_predicted_peptide_3|318_aa MEKISPGHVRGLHGSPSHHKPGDLGGKSDFTGWAQSPRAVCSLGNWCCVLATPAVAEKGQ HKVRATASESASPKPWQLPHRVEPPGKGCLRNLGFIPSSKGNEAAVEPQCASATTYLRHR GRSHGLTDSAYSRCTRDFPRALAAAGQRNHLPNKLLACKSFSQALLLKKPREVADFQLSV DSLLEKDNDHSRPDIQVQAKRLAEKLRCDTVVSEISTGQRTVNFKINRELLTKTVLQQVI EDGSKYGLKSELFSGLPQKKIVVEFRVARPLYKGTQGSKIVKSSSLETGTASFLSLSADE SKSQDHPRFKGRKIEYIS >gi568815592f:87494351_87767120|GENSCAN_predicted_CDS_3|957_bp atggagaaaatatctccagggcatgtcagaggtcttcatggcagcccctcccatcacaag cctggagacctaggaggcaaaagcgatttcacgggctgggcccagagtccccgtgctgtg tgcagcttaggaaattggtgctgtgtcctagccactccagctgtggctgaaaagggccaa cacaaagttcgggccacggcttcagagagtgcaagccccaagccctggcagcttccacat cgtgttgagcctccagggaagggatgcctacgtaacctcggcttcatcccttcttcaaag gggaatgaggcagcggtagagccacagtgcgcatcggccaccacataccttagacatcga ggacgtagccatggtcttactgactctgcgtattccagatgcactcgggatttcccgcgc gccctcgcggctgcagggcagaggaaccatctcccaaataaactacttgcctgcaagtct ttttctcaggccctgcttttgaagaagccaagagaagtagctgattttcagctttctgtg gattctttattggaaaaagacaatgaccattcaagaccagatattcaagttcaagccaag agactagcagagaagctaagatgtgatacagtggtgagtgaaatcagtactggtcaaagg actgtaaatttcaaaataaacagagagctcttaacaaagacagtgctacaacaagtaatt gaagatggctcaaaatatggattaaaaagtgaacttttctctggacttccccagaagaag attgtggttgaattcagggtagcaagacctctttacaaaggtactcaaggctctaagata gtgaaatcatctagtcttgaaacaggcacagcatcatttttgtcactttctgctgatgaa agcaagtcacaagatcaccccagattcaagggaagaaaaatagagtacatctcttaa >gi568815592f:87494351_87767120|GENSCAN_predicted_peptide_4|155_aa MKELRRSKKQTKFEVLRENVVNFIDCLVREYLLPPETQPLHEVVYFSAAHALREHLNAAP RIALHTALNNPYYYLKNEALKSEEGCIPNIAPDICIAYKLHLECSRLINLVDWSEAFATV VTAAEKMDANSATSEEMNEIIQYPFKTISTMSNYT >gi568815592f:87494351_87767120|GENSCAN_predicted_CDS_4|468_bp atgaaggagttaagaagaagtaagaagcaaaccaaatttgaagtactcagagaaaatgtt gtgaacttcattgactgtctagtgagagaataccttctgcctcctgagacacagcctctc catgaggtggtgtacttcagtgctgcccatgcccttcgtgagcatttaaatgctgctccg cgaattgccctccatactgcactcaacaatccttactattatctcaagaatgaagcactg aaaagcgaagaaggctgcattccgaatatcgccccagacatctgcatagcatacaaactg cacctagagtgtagcaggctcatcaacctcgtggactggtcagaggcttttgcaacagtt gtgacagctgctgaaaaaatggatgcaaattctgcaacctcagaagaaatgaatgaaatt atccagtatccttttaaaaccatttctacaatgtccaactacacataa >gi568815592f:87494351_87767120|GENSCAN_predicted_peptide_5|381_aa MACGATLKRTLDFDPLLSPASPKRRRCAPLSAPTSAAASPLSAAAATAASFSAAAASPQK YLRMEPSPFGDVSSRLTTVAGAPAEMIREDDIIRKFRRKGHISPQSGCVLAVIDWLQLLT LGLGGLRRGYSPSFSSDTADFSIVGVTMAEFNSCHTQYIAPKTPYLLSGVYRKSLLTLMW GVSSEVYNTPGYGKGKVISGKQILYNIKQEYKRMQKRRHLETSFQQTDPCCTSDAQPHAF LLSGPASPGTSSAASSPLKKEQPLFTLRQVGMICERLLKEREEKVREEYEEILNTKLAEQ YDAFVKFTHDQIMRRYGEQPASCYFKLSSVATTLRQQQLVLEISLMSVPPGCGPLLPVLI PVASFCCIITIWLLILMFEKD >gi568815592f:87494351_87767120|GENSCAN_predicted_CDS_5|1146_bp atggcgtgcggagccactctgaaaaggactctggatttcgacccgctgttgagcccggcg tccccgaagcgcaggcgatgtgcgccattgtcggcgcccacctcggccgctgcctccccg ttgtcggcggccgcggccaccgccgcctccttctccgctgcggccgcctcgccgcagaag tatctccgaatggagccatcccccttcggcgacgtctcctcccgcctcaccacagtcgct ggagccccagcggaaatgatcagagaggacgatattattcgtaaatttaggcgaaagggt cacatttcaccccaatctggatgtgtgcttgccgtaatcgattggttgcagctattaact ctgggactgggtggtttgaggcgaggctacagcccctccttcagttcagacacggctgat tttagtatagtgggggtgacgatggcagagtttaatagttgtcacacacagtatattgcc ccgaaaactccatatttgctctctggtgtttacagaaaaagtttgcttaccctcatgtgg ggtgtcagcagtgaggtttataatacccctgggtatggaaaaggaaaagtgatcagtgga aaacaaattctgtacaacataaaacaagagtataaacgaatgcagaagagaagacattta gaaacgagtttccaacagacagatccgtgttgtacttctgatgcacagccacatgcattt ctcctcagtggaccagcttcaccagggacttcatctgcagcatcctcaccattaaaaaaa gaacagcccttatttactctacggcaggttgggatgatctgtgaacgtttgttgaaagaa cgtgaagagaaagttcgagaagaatatgaagaaatattgaacacaaaacttgcagaacaa tatgatgcgtttgtgaagtttacgcatgatcaaataatgcgacgatatggagaacagcct gctagctgttatttcaagctttcgtcagtggcaaccactcttaggcagcagcaactggtt ttggaaatttccctgatgtcagtaccacctggatgtggacctttgctacctgtattaata ccagtggcctcattttgctgtatcattacaatttggcttcttatattaatgtttgaaaag gattaa >gi568815592f:87494351_87767120|GENSCAN_predicted_peptide_6|56_aa MEPQDKRHKREREPDTDSDTNSESGLDLSKTTEQTAAVEVYSSSWFGLENKIPKHI >gi568815592f:87494351_87767120|GENSCAN_predicted_CDS_6|171_bp atggaaccacaggacaaaaggcacaagagggaacgagagcctgacacagacagtgacacc aattctgagtcaggcttggacttaagcaagacaacggaacagacagctgcagtagaagta tattcttccagttggtttggactggaaaataagatccccaagcacatttga >gi568815592f:87494351_87767120|GENSCAN_predicted_peptide_7|77_aa MQNLFSGEAMELASGTEMAQHQCWYPQQRVSAATVPTPAAFSSITVQQYDLDITSCQVAQ NMILWPFQTVYVLSNTL >gi568815592f:87494351_87767120|GENSCAN_predicted_CDS_7|234_bp atgcagaatctgttttctggtgaagctatggagcttgccagtggcacagaaatggcccag caccagtgttggtatccccagcagagagttagtgcagccacagtgccaacgccagcggcg ttctcttctataacagtccagcagtatgatttggatataacttcctgccaagtagcccag aacatgattctctggcccttccagacagtctatgtgctatctaataccctttaa >gi568815592f:87494351_87767120|GENSCAN_predicted_peptide_8|87_aa MRKTQHPLCDIPAKDTQPELNHEETSDKTELRGIKQNNWSIFVKSFKAIKPNNYVEYAQE KEKEEKKEENEEKDEEKEEEEEEEKRY >gi568815592f:87494351_87767120|GENSCAN_predicted_CDS_8|264_bp atgaggaaaacacagcatcccctctgtgatattcctgccaaagatacacaacctgaactt aatcatgaggaaacatctgacaaaaccgaacttaggggcattaaacaaaacaactggtcc atatttgtcaaaagttttaaggccataaagccaaacaattatgttgaatatgctcaggag aaggagaaggaggagaagaaggaggagaatgaggagaaggatgaggagaaggaggaggag gaggaggaggagaaaagatattga