GENSCAN 1.0 Date run: 5-Nov-116 Time: 03:30:43 Sequence gi568815586r:10701617_10902570 : 200954 bp : 37.35% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.15 Intr - 518 344 175 1 1 94 84 186 0.956 17.82 1.14 Intr - 2532 2435 98 0 2 109 77 166 0.582 15.49 1.13 Intr - 4632 4561 72 0 0 56 102 41 0.174 0.98 1.12 Intr - 8498 8292 207 2 0 59 98 300 0.924 26.45 1.11 Intr - 11717 11595 123 2 0 103 63 295 0.999 28.36 1.10 Intr - 14167 14078 90 1 0 99 100 135 0.998 15.07 1.09 Intr - 16092 16027 66 0 0 61 55 90 0.691 1.18 1.08 Intr - 20945 20451 495 2 0 56 43 406 0.669 25.16 1.07 Intr - 21087 21039 49 2 1 49 99 42 0.390 -0.84 1.06 Intr - 21335 21234 102 1 0 60 109 94 0.104 7.07 1.05 Intr - 29543 29402 142 0 1 100 91 34 0.018 3.39 1.04 Intr - 31456 31355 102 2 0 112 58 64 0.122 5.03 1.03 Intr - 38654 38615 40 2 1 57 87 23 0.036 -4.02 1.02 Intr - 39708 39565 144 0 0 73 69 82 0.081 4.36 1.01 Init - 43711 43232 480 1 0 78 86 147 0.055 8.91 1.00 Prom - 45359 45320 40 -6.15 2.12 PlyA - 46905 46900 6 1.05 2.11 Term - 48547 48371 177 1 0 41 32 167 0.008 3.10 2.10 Intr - 54090 53565 526 2 1 43 39 280 0.000 10.62 2.09 Intr - 62662 62521 142 2 1 85 77 53 0.100 2.39 2.08 Intr - 63103 62815 289 1 1 21 31 215 0.020 5.00 2.07 Intr - 72791 72678 114 2 0 79 89 89 0.013 7.82 2.06 Intr - 90279 89749 531 0 0 51 39 218 0.010 5.40 2.05 Intr - 90833 90671 163 1 1 2 38 198 0.001 5.26 2.04 Intr - 96621 96507 115 1 1 33 31 101 0.002 -2.51 2.03 Intr - 107843 107661 183 0 0 108 -11 164 0.020 7.34 2.02 Intr - 114531 114378 154 0 1 74 24 115 0.586 2.42 2.01 Init - 115376 115224 153 2 0 67 81 55 0.361 2.74 2.00 Prom - 134760 134721 40 -0.35 3.04 PlyA - 134848 134843 6 1.05 3.03 Term - 145751 145447 305 0 2 51 33 238 0.401 8.95 3.02 Intr - 146791 146756 36 2 0 111 99 23 0.748 3.02 3.01 Init - 147821 147758 64 2 1 66 53 102 0.855 4.18 3.00 Prom - 147893 147854 40 -5.35 4.00 Prom + 150536 150575 40 -6.25 4.01 Sngl + 151826 152323 498 1 0 90 55 318 0.879 24.49 4.02 PlyA + 155646 155651 6 1.05 5.00 Prom + 157650 157689 40 -3.45 5.01 Init + 162357 162449 93 2 0 95 65 46 0.194 3.43 5.02 Term + 176499 176654 156 2 0 67 38 126 0.344 2.45 5.03 PlyA + 177180 177185 6 1.05 6.05 PlyA - 177456 177451 6 1.05 6.04 Term - 181082 180619 464 0 2 84 33 577 0.981 46.03 6.03 Intr - 182659 182538 122 0 2 51 116 95 0.130 7.82 6.02 Intr - 190309 190254 56 1 2 115 32 35 0.031 -2.54 6.01 Intr - 198661 198598 64 0 1 66 113 82 0.181 6.30 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 54056 53403 654 2 0 91 36 293 0.876 20.32 S.002 Term + 75702 75797 96 2 0 108 38 113 0.970 5.29 S.003 Sngl - 90805 90512 294 1 0 51 38 265 0.835 13.25 S.004 Term - 107843 107521 323 0 2 108 44 161 0.810 7.70 S.005 Init - 182601 182538 64 0 1 63 116 72 0.846 8.99 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:10701617_10902570|GENSCAN_predicted_peptide_1|795_aa MPSLTTPIKIVLEVLARAIRQEKEIKSIQLGKEEVKLSLFTDGMIIYLENPTVSAQNLLK LISNFGKVSGYKINVQKSQALLYTSNRQTESQITSKLPFTIATKRIKYLGIQLTRDVKDL FKENYKPMLNKIKEDTNKWKNIPDSWIGRINIVKMHILPKLGPLQVLQRLVRNQKFTISN VKRSFEKLGEAVEKRYSVNTVLKKESSQRSDNKQAKLTRVVVPSNDWTQLEARAKKPVEA VQSGSFSGAQSREKNGNERDPLITKFTLNRRWSPCGAKGISNPNPAFSRQTSNSNSTVAY FCRKPRWGRGPRSHGHRGRRLFSHRRRQRRRGEKSSRGLRVPSAGRLPCRRSQTGTVADS RGGKCLRLEKKGEAVWERGPWVGMDVGEGPTRGPGALGSLLGIWERRAFPLGKRRALEGD GEGKIEKLLARKGQVSVPDALPTWAWERRHAPVGHMPAKRCCNRLRETWVLVPRLPPPLC AHGCKQPFSVHQWLHASLFPDCGNARGVAAEKTLSEVGVLVKAPLSEQLSIEVNSAEKQI TAIKKNNPRKYLRSVGDGETVEFDVVEGEKGAEAANVTGPDGVPVEGSRYAADRRRYRRG YYGRRRGPPRNYAGEEEEEGSGSSEGFDPPATDRQFSGARNQLRRPQYRPQYRQRRFPPY HVGQTFDRRSRVLPHPNRIQSYPWSLPYPLPHQQLLKPLNGQIKAGEIGEMKDGVPEGAQ LQGPVHRNPTYRPRYRSRGPPRPRPAPAVGEAEDKENQQATSGPNQPSVRRGYRRPYNYR RRPRPPNAPSQDGKE >gi568815586r:10701617_10902570|GENSCAN_predicted_CDS_1|2385_bp atgccctctctcaccactcctattaaaatagtgttggaagttctggccagggcaatcagg caagagaaagaaataaagagtattcaattaggaaaagaggaagtcaaattgtccctgttt acagatggcatgattatatacttagaaaaccccactgtctcagcccaaaatctccttaag ctgataagcaacttcggcaaagtctcaggatacaaaatcaatgtgcaaaaatcacaagca ttactatacaccagtaacagacaaacagagagccaaatcacgagtaaactcccattcaca attgctacaaagagaataaaatacctaggaatccaacttacaagggatgtgaaggacctc ttcaaggagaactacaaaccaatgctcaacaaaataaaagaggacacaaacaaatggaag aacattccagattcatggataggaagaatcaatattgtgaaaatgcacatactgcccaag ttgggccctcttcaggtgctgcagagattggtgagaaaccagaagttcacaatcagtaat gtcaaaagaagctttgagaagctgggagaggctgtggaaaaaaggtacagtgtgaacaca gtcttgaaaaaagaaagctctcagaggagtgataacaaacaagccaagttaactagagtt gtagtgccctccaatgattggacccagctggaagccagagcaaagaagcctgtagaagct gttcaaagtggttcattctctggggcacagagcagggagaaaaatggaaatgagagggat cctctgattaccaaatttactctcaacaggcgttggtccccatgtggagcaaaaggcatc agtaaccccaatccagctttttctcgacagacttctaactcaaacagcactgtggcctat ttctgcaggaaaccccggtggggacgcggcccccgcagccacgggcaccgcggccgccgc ctctttagccaccgccgccggcagcgaagacgcggagaaaaaagttctcggggtctccgg gtccccagcgctggccgcctcccttgccggcgctcccagacgggcactgttgcggattcg cgtggtggaaaatgcctgcgtttggagaagaaaggagaggcagtctgggagaggggacct tgggtagggatggatgtaggggaaggaccgacacgtgggcctggcgctttggggtccttg cttgggatctgggaaaggagagcatttcctttggggaagaggagggcgctagaaggagac ggggagggaaagatagaaaagcttcttgccaggaagggtcaggtgtctgtccctgacgcc cttcccacatgggcatgggaaagacgccatgctcctgttggccacatgccagcaaagaga tgttgcaatcgtctccgggaaacttgggtcttagtcccacgtcttcctccccctctctgt gcccacggctgcaaacagccattcagtgtccaccaatggctacacgcttccctcttcccg gactgtgggaacgcacgtggtgtagcagctgaaaaaactttgtcggaagtgggggtattg gttaaagctcctctgtcagagcagctctcaattgaagtaaatagcgcagagaagcagata actgccatcaagaagaataacccacggaaatatctgcgcagtgtaggagatggagaaact gtagagtttgatgtggttgaaggagagaagggtgcagaagctgccaatgtgactggcccg gatggagttcctgtggaagggagtcgttacgctgcagatcggcgccgttacagacgtggc tactatggaaggcgccgtggccctccccggaattacgctggggaggaggaggaggaaggg agcggcagcagtgaaggatttgacccccctgccactgataggcagttctctggggcccgg aatcagctgcgccgcccccagtatcgccctcagtaccggcagcggcggttcccgccttac cacgtgggacagacctttgaccgtcgctcacgggtcttaccccatcccaacagaatacag agttacccctggtctctcccttacccgttacctcaccaacaacttctaaagccattaaat gggcagatcaaggctggtgagattggagagatgaaggatggagtcccagagggagcacaa cttcagggaccggttcatcgaaatccaacttaccgcccaaggtaccgtagcaggggacct cctcgcccacgacctgccccagcagttggagaggctgaagataaagaaaatcagcaagcc accagtggtccaaaccagccgtctgttcgccgtggataccggcgtccctacaattaccgg cgtcgcccgcgtcctcctaacgctccttcacaagatggcaaagag >gi568815586r:10701617_10902570|GENSCAN_predicted_peptide_2|848_aa MEFHDATLSCFWVPVVLEPEPTELGVVALGQIWMQPKVQPGVIMTPREAPKETGCSQGAA CPSCIPQVLRILHFQAEEAGLEHPSISSAFSSHQTLLTRAWDFRHTKQIRLHATGFRDPS TEAHMRAIKAVIIFLLLLIVYYPVFLVMTSSALIPQGKLVLMIELRVTKAFESRTSSTLM EAMVSISLDAVRTSDTVLYGLRGLEYSEDKKMWESLELPRDWLNGFDQNADSDMNNKVRV EVVSDEDEELLGTEVKPRDLVPCIPAAPAMAKRGQGTAQVMASGGTSPKFWQLPCGVESA GAQKSIIEVWEPPPRFQRMCGNAWMSRQRCAAGAGPSWRTSAKAVWKGNVGLETSHRVPT WALPSGAVRRGSPSSTPQNGRSTNNLHHVPGKTADTQHQPVKAVKREVVPCKPTEAELPR AVAAHLLHQHNLDNPDFVQGNNVPSGSQVGAFRNSLLADLTVTALELPNYSVRSHGPGSR LAHIDSDSRSAQLQTGSHDTSVQASPWNPTLQIYLNKPRVQNCYSTSQYQADLMDPSSRT TATDPGSRTDFVDPGPKTQAYYYIMDPGARPAQILIQTPNQSLQRFCYMAHTESLDELTG EGFSLLKPVYKEWKRDLVPCIPATPAMTKRGHGTARAIASEGGSPKPWQLPCGIEPAGPQ KSIIKVWEPPPRFQRMYGNSWMSRQKLAAGSGPSWRTSARAVWKGNVRLEPPHRVPTGTP PSGAVRIGPPSSRPQNGRSTDSLLHVPGKVTDTQHQPMKAARNGTIPCKATGAELPKAMG AHFLHQHDLDKLAKRVKAVHKAMTVMAMIPTGEPTVIQTFGNPHSNAKETLPFSSSFYRK RTRKGEIK >gi568815586r:10701617_10902570|GENSCAN_predicted_CDS_2|2547_bp atggaattccatgatgccactctttcctgcttctgggtccctgtagtcttagagccagag cccacagagctgggggtggtggcattggggcagatctggatgcagcccaaggtacagcca ggagtgattatgactcctagagaagctcctaaggagactggatgttctcaaggagcagcc tgtccatcttgcatcccccaggttctccggattctccatttccaagcagaagaagcagga ctggagcacccttccatatcatcagccttctcctcgcaccagaccttactcaccagggcc tgggactttagacacaccaagcagattcgactgcatgctacagggttcagagaccccagt acagaggcccacatgagggccataaaggcagtgatcatctttctgctcctcctcatcgtg tactacccagtctttcttgttatgacctctagcgctctgattcctcagggaaaattagtg ttgatgattgagctcagggtcaccaaagcctttgagagcaggactagcagcaccttgatg gaagccatggtatccatctcccttgatgctgtaagaacctcagacacagtcctttatggg ttgagaggtttggagtactcagaagacaaaaagatgtgggagagtttggaacttcctaga gactggttgaatggctttgaccaaaatgctgatagcgatatgaacaataaggtccgggtt gaggtggtctcagatgaagatgaggaacttttgggaactgaagtaaagcctagggactta gtgccctgcatcccagctgctccagccatggctaaaaggggccaaggtacagctcaggtc atggcttcagggggtacaagccccaagttttggcagcttccatgtggtgttgagtctgca ggtgcacagaagtcaataattgaggtttgggaacctccacctagatttcagagaatgtgt ggaaatgcgtggatgtccaggcagaggtgtgctgcaggggctgggccctcatggagaact tctgctaaggcagtgtggaagggaaatgtggggttggagacctcacacagagtccccact tgggcactgcctagtggagctgtgagaagagggtcaccatcctctacaccccaaaatggt agatccaccaacaacttgcaccatgtacctggaaaaactgcagacactcaacaccagcct gtgaaagcagtcaagagggaggttgtaccttgcaaacccacagaggcggagctgcccagg gctgtggcagcccaccttttgcatcagcataacctagataaccctgattttgttcagggg aataatgtgcccagtggatctcaagtgggagcatttcggaactctttgctggctgatctg actgtcacagcactggagctccctaactattcagttagatcccatggcccaggatccagg ctggcccacatagactcagactccaggtctgcccagctccagactggttcccatgacacc agtgtccaggccagcccctggaaccccacactacagatttacttgaacaaacccagggtc cagaactgctacagtacatcccaataccaggctgacctcatggacccaagctccaggacc actgctacagatccaggatccaggacagactttgtggatccaggaccaaagacccaagcc tactattatatcatggacccaggtgccagacctgctcaaatattaatccagacaccaaac cagtctctccagagattctgttacatggcccacacagaatctctagatgaactgactggt gaagggttttccctgctgaagccagtctataaagaatggaaaagggacttggtgccctgc atcccagctactccagccatgactaaaaggggccatggtacagctcgggctattgcttca gagggtggaagtcccaagccttggcagcttccatgtggtattgagcctgcaggtccacag aaatctataattaaggtttgggaacctccacctagatttcagaggatgtatggaaactcc tggatgtccaggcagaagttagctgcagggtctgggccctcatggagaacttctgctagg gcagtgtggaagggaaatgtgagattggagcccccacacagagtccctactgggacacca cctagtggagctgtgagaatagggccaccatcctccagaccccagaatggtagatctacc gacagcttgctccatgtgcctggaaaagtcacagacactcaacaccagcccatgaaagca gccaggaatgggactataccctgcaaagccacaggggcggagctgcccaaggccatggga gcccactttttgcatcagcatgacctggataagctggcgaaacgcgtgaaggccgtacac aaggcaatgacggtgatggcaatgattcccacgggagaaccaactgtgatccagaccttt ggcaatcctcacagtaatgcaaaggagactttaccattctcttcctcattttacagaaaa agaactagaaagggagaaatcaagtaa >gi568815586r:10701617_10902570|GENSCAN_predicted_peptide_3|134_aa MLLVLLSVVLLALSSAQSTDNDVNYEDFTFTIPDVEDSSQRPDQGPQRPPPEGLLPRPPG DSGNQDDGPQQRPPKPGGHHRHPPPPPFQNQQRPPRRGHRQLSLPRFPSVSLQEASSFFQ RDRPARHPQEQPLW >gi568815586r:10701617_10902570|GENSCAN_predicted_CDS_3|405_bp atgctgctggtcctgctctcagtggtccttctggctctgagctcagctcagagcacagat aatgatgtgaactatgaagactttactttcaccataccagatgtagaggactcaagtcag agaccagatcagggaccccagagacctcctcctgaaggactcctacctagaccccctggt gatagtggtaaccaagatgatggtcctcagcagagaccaccaaaaccaggaggccatcac cgccatcctcccccacctccttttcaaaatcagcaacgaccaccccgacgaggacaccgt caactctctctaccccgatttccttctgtcagcctgcaggaagcatcatcattcttccag agggacagaccagcaagacatccccaggagcaaccactctggtaa >gi568815586r:10701617_10902570|GENSCAN_predicted_peptide_4|165_aa MDGPATPVSTDSNPPTQQEDSSACKCTHLEKRLFPLLLVAQLLLSPPGAAAVKCQLDPAK WQDPQHSSICSVLHLRHWKGCEPDIGSQSTCFPEPESCLPVAADTDSNVTPATQQQRCCT LACNLGTGPLHLLLSLLMQLGARACATGSDLTSTSSRATVNLHVP >gi568815586r:10701617_10902570|GENSCAN_predicted_CDS_4|498_bp atggatggccctgctacacctgtgagcactgacagcaacccgcccacccagcaggaagac agcagtgcatgtaagtgcacacaccttgagaaaaggctcttcccactgctgctggtggca cagttgctgctgtcaccaccgggggctgcagcagtgaaatgccagttggacccagcaaag tggcaggatcctcagcattctagcatatgcagtgttctgcacctcaggcactggaaaggc tgtgaaccagacatagggagccaaagcacatgctttccagaaccagagagctgcctccct gtggctgctgacacagacagcaatgtcacccccgcaacacagcagcagagatgctgcaca cttgcatgcaacctggggacaggccctctccatctgctgctgagtctgctgatgcagctg ggggccagagcatgtgccactggcagtgacctgacttccaccagcagcagagccactgtg aacttgcacgtaccctga >gi568815586r:10701617_10902570|GENSCAN_predicted_peptide_5|82_aa MDGTGGHYVKGNKPGTEKQTVHVLTYVDHTETKKPEYIEKHINETIHFPELKVNSPYRKG PMIIQKPKFNIFRAQCKKTISG >gi568815586r:10701617_10902570|GENSCAN_predicted_CDS_5|249_bp atggatggaactggagggcattatgttaagggaaataagccaggcacagaaaaacaaaca gtgcatgttctcacttatgttgatcacacagagaccaagaaaccagaatatattgagaaa cacatcaatgaaacaatacatttcccagaactaaaggttaattctccatatcgaaagggc ccaatgattatccaaaagcccaaatttaacattttcagggctcagtgcaagaagacaatt agtggctaa >gi568815586r:10701617_10902570|GENSCAN_predicted_peptide_6|235_aa XALEKQGTLPSEHWVAQIPSGKRYLLTFQCKASSSEGKMDRELTRFSQHKVGSDTRASCK MLLILLSVALLAFSSAQDLNEDGGDSEQFLDEERQGPPLGGQQSQPSAGDGNQDDGPQQG PPQQGGQQQQGPPPPQGKPQGPPQQGGQQQQGPPPPQGKPQGPPQQGGHPPPPQGRPQGP PQQGGHPRPPRGRPQGPPQQGGHQQGPPPPPPGKPQGPPPQGGRPQGPPQGQSPQ >gi568815586r:10701617_10902570|GENSCAN_predicted_CDS_6|708_bp nnggctttggagaaacaaggtactcttccctctgagcactgggttgctcagatccccagt ggaaagagatatttgcttacattccagtgtaaagcttcttcttcagaaggcaagatggac agggagctgacacgtttctcccagcacaaagttgggagtgacaccagagcctcctgcaag atgcttctgattctgctgtcagtggccctgctggccttcagctcagctcaggatttaaat gaagatggaggagactctgagcagttcctagatgaggagcgtcagggaccacctttggga ggacagcaatctcaaccctctgctggtgatgggaaccaggatgatggccctcagcaggga ccaccccaacaaggaggccagcagcaacaaggtccaccacctcctcagggaaagccacaa ggaccaccccaacaaggaggccagcagcaacaaggtccaccacctcctcagggaaagcca caaggaccaccccaacagggaggccatccccctcctcctcaaggaaggccacaaggacca ccccaacagggaggccatccccgtcctcctcgaggaaggccacaaggaccaccccaacag ggaggccatcagcaaggtcctcccccacctcctcctggaaagccccagggaccacctccc caagggggccgcccacaaggacctccacaggggcagtctcctcagtaa