GENSCAN 1.0 Date run: 8-Nov-116 Time: 00:15:20 Sequence gi568815586r:10725349_10926269 : 200921 bp : 36.81% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 PlyA - 1240 1235 6 1.05 1.05 Term - 5811 5651 161 1 2 100 48 70 0.100 1.32 1.04 Intr - 7724 7623 102 0 0 112 58 64 0.114 5.03 1.03 Intr - 14922 14883 40 0 1 57 87 23 0.034 -4.02 1.02 Intr - 15976 15833 144 1 0 73 69 82 0.077 4.36 1.01 Init - 19979 19500 480 2 0 78 86 147 0.052 8.91 1.00 Prom - 21627 21588 40 -6.15 2.12 PlyA - 23173 23168 6 1.05 2.11 Term - 24815 24639 177 2 0 41 32 167 0.008 3.10 2.10 Intr - 30358 29833 526 0 1 43 39 280 0.000 10.62 2.09 Intr - 38930 38789 142 0 1 85 77 53 0.100 2.39 2.08 Intr - 39371 39083 289 2 1 21 31 215 0.020 5.00 2.07 Intr - 49059 48946 114 0 0 79 89 89 0.013 7.82 2.06 Intr - 66547 66017 531 1 0 51 39 218 0.010 5.40 2.05 Intr - 67101 66939 163 2 1 2 38 198 0.001 5.26 2.04 Intr - 72889 72775 115 2 1 33 31 101 0.002 -2.51 2.03 Intr - 84111 83929 183 1 0 108 -11 164 0.020 7.34 2.02 Intr - 90799 90646 154 1 1 74 24 115 0.586 2.42 2.01 Init - 91644 91492 153 0 0 67 81 55 0.361 2.74 2.00 Prom - 111028 110989 40 -0.35 3.04 PlyA - 111116 111111 6 1.05 3.03 Term - 122019 121715 305 1 2 51 33 238 0.401 8.95 3.02 Intr - 123059 123024 36 0 0 111 99 23 0.748 3.02 3.01 Init - 124089 124026 64 0 1 66 53 102 0.855 4.18 3.00 Prom - 124161 124122 40 -5.35 4.00 Prom + 126804 126843 40 -6.25 4.01 Sngl + 128094 128591 498 2 0 90 55 318 0.879 24.49 4.02 PlyA + 131914 131919 6 1.05 5.00 Prom + 133918 133957 40 -3.45 5.01 Init + 138625 138717 93 0 0 95 65 46 0.194 3.43 5.02 Term + 152767 152922 156 0 0 67 38 126 0.344 2.45 5.03 PlyA + 153448 153453 6 1.05 6.09 PlyA - 153724 153719 6 1.05 6.08 Term - 157350 156887 464 1 2 84 33 577 0.981 46.03 6.07 Intr - 158927 158806 122 1 2 51 116 95 0.129 7.82 6.06 Intr - 166577 166522 56 2 2 115 32 35 0.030 -2.54 6.05 Intr - 174939 174866 74 1 2 7 113 100 0.389 2.61 6.04 Intr - 176006 175889 118 2 1 54 61 101 0.532 3.12 6.03 Intr - 183904 183762 143 2 2 50 83 64 0.049 1.25 6.02 Intr - 192639 192558 82 0 1 59 43 102 0.040 1.09 6.01 Init - 199618 199232 387 1 0 38 41 209 0.237 8.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 30324 29671 654 0 0 91 36 293 0.876 20.32 S.002 Term + 51970 52065 96 0 0 108 38 113 0.970 5.29 S.003 Sngl - 67073 66780 294 2 0 51 38 265 0.835 13.25 S.004 Term - 84111 83789 323 1 2 108 44 161 0.810 7.70 S.005 Init - 158869 158806 64 1 1 63 116 72 0.847 8.99 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:10725349_10926269|GENSCAN_predicted_peptide_1|308_aa MPSLTTPIKIVLEVLARAIRQEKEIKSIQLGKEEVKLSLFTDGMIIYLENPTVSAQNLLK LISNFGKVSGYKINVQKSQALLYTSNRQTESQITSKLPFTIATKRIKYLGIQLTRDVKDL FKENYKPMLNKIKEDTNKWKNIPDSWIGRINIVKMHILPKLGPLQVLQRLVRNQKFTISN VKRSFEKLGEAVEKRYSVNTVLKKESSQRSDNKQAKLTRVVVPSNDWTQLEARAKKPVEA VQSGSFSGAQSREKNGNERDPLITKFTLNRRWSPCGAKGISNPNPAFSRQTSNSNSTVAY FCRYVRLA >gi568815586r:10725349_10926269|GENSCAN_predicted_CDS_1|927_bp atgccctctctcaccactcctattaaaatagtgttggaagttctggccagggcaatcagg caagagaaagaaataaagagtattcaattaggaaaagaggaagtcaaattgtccctgttt acagatggcatgattatatacttagaaaaccccactgtctcagcccaaaatctccttaag ctgataagcaacttcggcaaagtctcaggatacaaaatcaatgtgcaaaaatcacaagca ttactatacaccagtaacagacaaacagagagccaaatcacgagtaaactcccattcaca attgctacaaagagaataaaatacctaggaatccaacttacaagggatgtgaaggacctc ttcaaggagaactacaaaccaatgctcaacaaaataaaagaggacacaaacaaatggaag aacattccagattcatggataggaagaatcaatattgtgaaaatgcacatactgcccaag ttgggccctcttcaggtgctgcagagattggtgagaaaccagaagttcacaatcagtaat gtcaaaagaagctttgagaagctgggagaggctgtggaaaaaaggtacagtgtgaacaca gtcttgaaaaaagaaagctctcagaggagtgataacaaacaagccaagttaactagagtt gtagtgccctccaatgattggacccagctggaagccagagcaaagaagcctgtagaagct gttcaaagtggttcattctctggggcacagagcagggagaaaaatggaaatgagagggat cctctgattaccaaatttactctcaacaggcgttggtccccatgtggagcaaaaggcatc agtaaccccaatccagctttttctcgacagacttctaactcaaacagcactgtggcctat ttctgcaggtatgtacggcttgcttaa >gi568815586r:10725349_10926269|GENSCAN_predicted_peptide_2|848_aa MEFHDATLSCFWVPVVLEPEPTELGVVALGQIWMQPKVQPGVIMTPREAPKETGCSQGAA CPSCIPQVLRILHFQAEEAGLEHPSISSAFSSHQTLLTRAWDFRHTKQIRLHATGFRDPS TEAHMRAIKAVIIFLLLLIVYYPVFLVMTSSALIPQGKLVLMIELRVTKAFESRTSSTLM EAMVSISLDAVRTSDTVLYGLRGLEYSEDKKMWESLELPRDWLNGFDQNADSDMNNKVRV EVVSDEDEELLGTEVKPRDLVPCIPAAPAMAKRGQGTAQVMASGGTSPKFWQLPCGVESA GAQKSIIEVWEPPPRFQRMCGNAWMSRQRCAAGAGPSWRTSAKAVWKGNVGLETSHRVPT WALPSGAVRRGSPSSTPQNGRSTNNLHHVPGKTADTQHQPVKAVKREVVPCKPTEAELPR AVAAHLLHQHNLDNPDFVQGNNVPSGSQVGAFRNSLLADLTVTALELPNYSVRSHGPGSR LAHIDSDSRSAQLQTGSHDTSVQASPWNPTLQIYLNKPRVQNCYSTSQYQADLMDPSSRT TATDPGSRTDFVDPGPKTQAYYYIMDPGARPAQILIQTPNQSLQRFCYMAHTESLDELTG EGFSLLKPVYKEWKRDLVPCIPATPAMTKRGHGTARAIASEGGSPKPWQLPCGIEPAGPQ KSIIKVWEPPPRFQRMYGNSWMSRQKLAAGSGPSWRTSARAVWKGNVRLEPPHRVPTGTP PSGAVRIGPPSSRPQNGRSTDSLLHVPGKVTDTQHQPMKAARNGTIPCKATGAELPKAMG AHFLHQHDLDKLAKRVKAVHKAMTVMAMIPTGEPTVIQTFGNPHSNAKETLPFSSSFYRK RTRKGEIK >gi568815586r:10725349_10926269|GENSCAN_predicted_CDS_2|2547_bp atggaattccatgatgccactctttcctgcttctgggtccctgtagtcttagagccagag cccacagagctgggggtggtggcattggggcagatctggatgcagcccaaggtacagcca ggagtgattatgactcctagagaagctcctaaggagactggatgttctcaaggagcagcc tgtccatcttgcatcccccaggttctccggattctccatttccaagcagaagaagcagga ctggagcacccttccatatcatcagccttctcctcgcaccagaccttactcaccagggcc tgggactttagacacaccaagcagattcgactgcatgctacagggttcagagaccccagt acagaggcccacatgagggccataaaggcagtgatcatctttctgctcctcctcatcgtg tactacccagtctttcttgttatgacctctagcgctctgattcctcagggaaaattagtg ttgatgattgagctcagggtcaccaaagcctttgagagcaggactagcagcaccttgatg gaagccatggtatccatctcccttgatgctgtaagaacctcagacacagtcctttatggg ttgagaggtttggagtactcagaagacaaaaagatgtgggagagtttggaacttcctaga gactggttgaatggctttgaccaaaatgctgatagcgatatgaacaataaggtccgggtt gaggtggtctcagatgaagatgaggaacttttgggaactgaagtaaagcctagggactta gtgccctgcatcccagctgctccagccatggctaaaaggggccaaggtacagctcaggtc atggcttcagggggtacaagccccaagttttggcagcttccatgtggtgttgagtctgca ggtgcacagaagtcaataattgaggtttgggaacctccacctagatttcagagaatgtgt ggaaatgcgtggatgtccaggcagaggtgtgctgcaggggctgggccctcatggagaact tctgctaaggcagtgtggaagggaaatgtggggttggagacctcacacagagtccccact tgggcactgcctagtggagctgtgagaagagggtcaccatcctctacaccccaaaatggt agatccaccaacaacttgcaccatgtacctggaaaaactgcagacactcaacaccagcct gtgaaagcagtcaagagggaggttgtaccttgcaaacccacagaggcggagctgcccagg gctgtggcagcccaccttttgcatcagcataacctagataaccctgattttgttcagggg aataatgtgcccagtggatctcaagtgggagcatttcggaactctttgctggctgatctg actgtcacagcactggagctccctaactattcagttagatcccatggcccaggatccagg ctggcccacatagactcagactccaggtctgcccagctccagactggttcccatgacacc agtgtccaggccagcccctggaaccccacactacagatttacttgaacaaacccagggtc cagaactgctacagtacatcccaataccaggctgacctcatggacccaagctccaggacc actgctacagatccaggatccaggacagactttgtggatccaggaccaaagacccaagcc tactattatatcatggacccaggtgccagacctgctcaaatattaatccagacaccaaac cagtctctccagagattctgttacatggcccacacagaatctctagatgaactgactggt gaagggttttccctgctgaagccagtctataaagaatggaaaagggacttggtgccctgc atcccagctactccagccatgactaaaaggggccatggtacagctcgggctattgcttca gagggtggaagtcccaagccttggcagcttccatgtggtattgagcctgcaggtccacag aaatctataattaaggtttgggaacctccacctagatttcagaggatgtatggaaactcc tggatgtccaggcagaagttagctgcagggtctgggccctcatggagaacttctgctagg gcagtgtggaagggaaatgtgagattggagcccccacacagagtccctactgggacacca cctagtggagctgtgagaatagggccaccatcctccagaccccagaatggtagatctacc gacagcttgctccatgtgcctggaaaagtcacagacactcaacaccagcccatgaaagca gccaggaatgggactataccctgcaaagccacaggggcggagctgcccaaggccatggga gcccactttttgcatcagcatgacctggataagctggcgaaacgcgtgaaggccgtacac aaggcaatgacggtgatggcaatgattcccacgggagaaccaactgtgatccagaccttt ggcaatcctcacagtaatgcaaaggagactttaccattctcttcctcattttacagaaaa agaactagaaagggagaaatcaagtaa >gi568815586r:10725349_10926269|GENSCAN_predicted_peptide_3|134_aa MLLVLLSVVLLALSSAQSTDNDVNYEDFTFTIPDVEDSSQRPDQGPQRPPPEGLLPRPPG DSGNQDDGPQQRPPKPGGHHRHPPPPPFQNQQRPPRRGHRQLSLPRFPSVSLQEASSFFQ RDRPARHPQEQPLW >gi568815586r:10725349_10926269|GENSCAN_predicted_CDS_3|405_bp atgctgctggtcctgctctcagtggtccttctggctctgagctcagctcagagcacagat aatgatgtgaactatgaagactttactttcaccataccagatgtagaggactcaagtcag agaccagatcagggaccccagagacctcctcctgaaggactcctacctagaccccctggt gatagtggtaaccaagatgatggtcctcagcagagaccaccaaaaccaggaggccatcac cgccatcctcccccacctccttttcaaaatcagcaacgaccaccccgacgaggacaccgt caactctctctaccccgatttccttctgtcagcctgcaggaagcatcatcattcttccag agggacagaccagcaagacatccccaggagcaaccactctggtaa >gi568815586r:10725349_10926269|GENSCAN_predicted_peptide_4|165_aa MDGPATPVSTDSNPPTQQEDSSACKCTHLEKRLFPLLLVAQLLLSPPGAAAVKCQLDPAK WQDPQHSSICSVLHLRHWKGCEPDIGSQSTCFPEPESCLPVAADTDSNVTPATQQQRCCT LACNLGTGPLHLLLSLLMQLGARACATGSDLTSTSSRATVNLHVP >gi568815586r:10725349_10926269|GENSCAN_predicted_CDS_4|498_bp atggatggccctgctacacctgtgagcactgacagcaacccgcccacccagcaggaagac agcagtgcatgtaagtgcacacaccttgagaaaaggctcttcccactgctgctggtggca cagttgctgctgtcaccaccgggggctgcagcagtgaaatgccagttggacccagcaaag tggcaggatcctcagcattctagcatatgcagtgttctgcacctcaggcactggaaaggc tgtgaaccagacatagggagccaaagcacatgctttccagaaccagagagctgcctccct gtggctgctgacacagacagcaatgtcacccccgcaacacagcagcagagatgctgcaca cttgcatgcaacctggggacaggccctctccatctgctgctgagtctgctgatgcagctg ggggccagagcatgtgccactggcagtgacctgacttccaccagcagcagagccactgtg aacttgcacgtaccctga >gi568815586r:10725349_10926269|GENSCAN_predicted_peptide_5|82_aa MDGTGGHYVKGNKPGTEKQTVHVLTYVDHTETKKPEYIEKHINETIHFPELKVNSPYRKG PMIIQKPKFNIFRAQCKKTISG >gi568815586r:10725349_10926269|GENSCAN_predicted_CDS_5|249_bp atggatggaactggagggcattatgttaagggaaataagccaggcacagaaaaacaaaca gtgcatgttctcacttatgttgatcacacagagaccaagaaaccagaatatattgagaaa cacatcaatgaaacaatacatttcccagaactaaaggttaattctccatatcgaaagggc ccaatgattatccaaaagcccaaatttaacattttcagggctcagtgcaagaagacaatt agtggctaa >gi568815586r:10725349_10926269|GENSCAN_predicted_peptide_6|481_aa MNPGVGFFEKINKIDRPVARLIKKKREKNQIDPPKNDKGDITTDSTEIQTTIRENYKHLY ANKLENLEEMDTFLDTYTLPRLNQEEFESLSIPLTGSEIEAIINSLPTKKSPGPDGITAE FYQSYKEELAGKFQKQALISAQLLVRAFQLYLDMAEEFIIGNLSNGFIVLINCIDWVSKR ELSSVDKLLIILAISRIGLIWEILGYWRDWPKLLIQESGGSEYLEICLLVEQNPPPPTKI SAQVISRALEKQGTLPSEHWVAQIPSGKRYLLTFQCKASSSEGKMDRELTRFSQHKVGSD TRASCKMLLILLSVALLAFSSAQDLNEDGGDSEQFLDEERQGPPLGGQQSQPSAGDGNQD DGPQQGPPQQGGQQQQGPPPPQGKPQGPPQQGGQQQQGPPPPQGKPQGPPQQGGHPPPPQ GRPQGPPQQGGHPRPPRGRPQGPPQQGGHQQGPPPPPPGKPQGPPPQGGRPQGPPQGQSP Q >gi568815586r:10725349_10926269|GENSCAN_predicted_CDS_6|1446_bp atgaatccaggagttggtttttttgaaaagatcaacaaaattgatagaccagtagcaaga ctaataaagaagaaaagagagaaaaatcaaatagaccccccaaaaaatgataaaggggat atcaccactgattccacagaaatacaaactaccatcagagaaaactataaacacctctat gcaaataaactagaaaatctagaagaaatggatacattcctcgacacatacaccctccca agactaaaccaggaagaatttgaatccctgagtataccactaacaggctctgaaattgag gcaataattaatagcctaccaaccaaaaaaagtccaggaccagacggaatcacagccgaa ttctaccagagttacaaggaggagctggctgggaagttccaaaagcaggcactgatatct gctcagcttctggtgagggctttccagctgtatcttgacatggcagaagaattcataatt gggaatttgagcaatggatttatagtactgatcaactgcattgactgggtcagtaaaaga gagctgtcctcagtcgataaactcctcattatcttggcaatctccagaattgggctgatc tgggaaatattaggatattggagagattggcctaaactcttaatccaggagagtgggggc tctgaatacctggaaatctgcctgcttgtggagcagaacccacccccaccaaccaagatt tctgcacaggttatctccagggctttggagaaacaaggtactcttccctctgagcactgg gttgctcagatccccagtggaaagagatatttgcttacattccagtgtaaagcttcttct tcagaaggcaagatggacagggagctgacacgtttctcccagcacaaagttgggagtgac accagagcctcctgcaagatgcttctgattctgctgtcagtggccctgctggccttcagc tcagctcaggatttaaatgaagatggaggagactctgagcagttcctagatgaggagcgt cagggaccacctttgggaggacagcaatctcaaccctctgctggtgatgggaaccaggat gatggccctcagcagggaccaccccaacaaggaggccagcagcaacaaggtccaccacct cctcagggaaagccacaaggaccaccccaacaaggaggccagcagcaacaaggtccacca cctcctcagggaaagccacaaggaccaccccaacagggaggccatccccctcctcctcaa ggaaggccacaaggaccaccccaacagggaggccatccccgtcctcctcgaggaaggcca caaggaccaccccaacagggaggccatcagcaaggtcctcccccacctcctcctggaaag ccccagggaccacctccccaagggggccgcccacaaggacctccacaggggcagtctcct cagtaa