GENSCAN 1.0 Date run: 5-Nov-116 Time: 03:31:17 Sequence gi568815586r:10706054_10906980 : 200927 bp : 37.30% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.13 Intr - 195 124 72 0 0 56 102 41 0.030 0.98 1.12 Intr - 4061 3855 207 2 0 59 98 300 0.903 26.45 1.11 Intr - 7280 7158 123 2 0 103 63 295 0.998 28.36 1.10 Intr - 9730 9641 90 1 0 99 100 135 0.998 15.07 1.09 Intr - 11655 11590 66 0 0 61 55 90 0.691 1.18 1.08 Intr - 16508 16014 495 2 0 56 43 406 0.669 25.16 1.07 Intr - 16650 16602 49 2 1 49 99 42 0.390 -0.84 1.06 Intr - 16898 16797 102 1 0 60 109 94 0.104 7.07 1.05 Intr - 25106 24965 142 0 1 100 91 34 0.018 3.39 1.04 Intr - 27019 26918 102 2 0 112 58 64 0.122 5.03 1.03 Intr - 34217 34178 40 2 1 57 87 23 0.036 -4.02 1.02 Intr - 35271 35128 144 0 0 73 69 82 0.081 4.36 1.01 Init - 39274 38795 480 1 0 78 86 147 0.055 8.91 1.00 Prom - 40922 40883 40 -6.15 2.12 PlyA - 42468 42463 6 1.05 2.11 Term - 44110 43934 177 1 0 41 32 167 0.008 3.10 2.10 Intr - 49653 49128 526 2 1 43 39 280 0.000 10.62 2.09 Intr - 58225 58084 142 2 1 85 77 53 0.100 2.39 2.08 Intr - 58666 58378 289 1 1 21 31 215 0.020 5.00 2.07 Intr - 68354 68241 114 2 0 79 89 89 0.013 7.82 2.06 Intr - 85842 85312 531 0 0 51 39 218 0.010 5.40 2.05 Intr - 86396 86234 163 1 1 2 38 198 0.001 5.26 2.04 Intr - 92184 92070 115 1 1 33 31 101 0.002 -2.51 2.03 Intr - 103406 103224 183 0 0 108 -11 164 0.020 7.34 2.02 Intr - 110094 109941 154 0 1 74 24 115 0.586 2.42 2.01 Init - 110939 110787 153 2 0 67 81 55 0.361 2.74 2.00 Prom - 130323 130284 40 -0.35 3.04 PlyA - 130411 130406 6 1.05 3.03 Term - 141314 141010 305 0 2 51 33 238 0.401 8.95 3.02 Intr - 142354 142319 36 2 0 111 99 23 0.748 3.02 3.01 Init - 143384 143321 64 2 1 66 53 102 0.855 4.18 3.00 Prom - 143456 143417 40 -5.35 4.00 Prom + 146099 146138 40 -6.25 4.01 Sngl + 147389 147886 498 1 0 90 55 318 0.879 24.49 4.02 PlyA + 151209 151214 6 1.05 5.00 Prom + 153213 153252 40 -3.45 5.01 Init + 157920 158012 93 2 0 95 65 46 0.194 3.43 5.02 Term + 172062 172217 156 2 0 67 38 126 0.344 2.45 5.03 PlyA + 172743 172748 6 1.05 6.05 PlyA - 173019 173014 6 1.05 6.04 Term - 176645 176182 464 0 2 84 33 577 0.981 46.03 6.03 Intr - 178222 178101 122 0 2 51 116 95 0.127 7.82 6.02 Intr - 185872 185817 56 1 2 115 32 35 0.029 -2.54 6.01 Intr - 194224 194161 64 0 1 66 113 82 0.173 6.30 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 49619 48966 654 2 0 91 36 293 0.876 20.32 S.002 Term + 71265 71360 96 2 0 108 38 113 0.970 5.29 S.003 Sngl - 86368 86075 294 1 0 51 38 265 0.835 13.25 S.004 Term - 103406 103084 323 0 2 108 44 161 0.810 7.70 S.005 Init - 178164 178101 64 0 1 63 116 72 0.849 8.99 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:10706054_10906980|GENSCAN_predicted_peptide_1|704_aa MPSLTTPIKIVLEVLARAIRQEKEIKSIQLGKEEVKLSLFTDGMIIYLENPTVSAQNLLK LISNFGKVSGYKINVQKSQALLYTSNRQTESQITSKLPFTIATKRIKYLGIQLTRDVKDL FKENYKPMLNKIKEDTNKWKNIPDSWIGRINIVKMHILPKLGPLQVLQRLVRNQKFTISN VKRSFEKLGEAVEKRYSVNTVLKKESSQRSDNKQAKLTRVVVPSNDWTQLEARAKKPVEA VQSGSFSGAQSREKNGNERDPLITKFTLNRRWSPCGAKGISNPNPAFSRQTSNSNSTVAY FCRKPRWGRGPRSHGHRGRRLFSHRRRQRRRGEKSSRGLRVPSAGRLPCRRSQTGTVADS RGGKCLRLEKKGEAVWERGPWVGMDVGEGPTRGPGALGSLLGIWERRAFPLGKRRALEGD GEGKIEKLLARKGQVSVPDALPTWAWERRHAPVGHMPAKRCCNRLRETWVLVPRLPPPLC AHGCKQPFSVHQWLHASLFPDCGNARGVAAEKTLSEVGVLVKAPLSEQLSIEVNSAEKQI TAIKKNNPRKYLRSVGDGETVEFDVVEGEKGAEAANVTGPDGVPVEGSRYAADRRRYRRG YYGRRRGPPRNYAGEEEEEGSGSSEGFDPPATDRQFSGARNQLRRPQYRPQYRQRRFPPY HVGQTFDRRSRVLPHPNRIQSYPWSLPYPLPHQQLLKPLNGQIK >gi568815586r:10706054_10906980|GENSCAN_predicted_CDS_1|2112_bp atgccctctctcaccactcctattaaaatagtgttggaagttctggccagggcaatcagg caagagaaagaaataaagagtattcaattaggaaaagaggaagtcaaattgtccctgttt acagatggcatgattatatacttagaaaaccccactgtctcagcccaaaatctccttaag ctgataagcaacttcggcaaagtctcaggatacaaaatcaatgtgcaaaaatcacaagca ttactatacaccagtaacagacaaacagagagccaaatcacgagtaaactcccattcaca attgctacaaagagaataaaatacctaggaatccaacttacaagggatgtgaaggacctc ttcaaggagaactacaaaccaatgctcaacaaaataaaagaggacacaaacaaatggaag aacattccagattcatggataggaagaatcaatattgtgaaaatgcacatactgcccaag ttgggccctcttcaggtgctgcagagattggtgagaaaccagaagttcacaatcagtaat gtcaaaagaagctttgagaagctgggagaggctgtggaaaaaaggtacagtgtgaacaca gtcttgaaaaaagaaagctctcagaggagtgataacaaacaagccaagttaactagagtt gtagtgccctccaatgattggacccagctggaagccagagcaaagaagcctgtagaagct gttcaaagtggttcattctctggggcacagagcagggagaaaaatggaaatgagagggat cctctgattaccaaatttactctcaacaggcgttggtccccatgtggagcaaaaggcatc agtaaccccaatccagctttttctcgacagacttctaactcaaacagcactgtggcctat ttctgcaggaaaccccggtggggacgcggcccccgcagccacgggcaccgcggccgccgc ctctttagccaccgccgccggcagcgaagacgcggagaaaaaagttctcggggtctccgg gtccccagcgctggccgcctcccttgccggcgctcccagacgggcactgttgcggattcg cgtggtggaaaatgcctgcgtttggagaagaaaggagaggcagtctgggagaggggacct tgggtagggatggatgtaggggaaggaccgacacgtgggcctggcgctttggggtccttg cttgggatctgggaaaggagagcatttcctttggggaagaggagggcgctagaaggagac ggggagggaaagatagaaaagcttcttgccaggaagggtcaggtgtctgtccctgacgcc cttcccacatgggcatgggaaagacgccatgctcctgttggccacatgccagcaaagaga tgttgcaatcgtctccgggaaacttgggtcttagtcccacgtcttcctccccctctctgt gcccacggctgcaaacagccattcagtgtccaccaatggctacacgcttccctcttcccg gactgtgggaacgcacgtggtgtagcagctgaaaaaactttgtcggaagtgggggtattg gttaaagctcctctgtcagagcagctctcaattgaagtaaatagcgcagagaagcagata actgccatcaagaagaataacccacggaaatatctgcgcagtgtaggagatggagaaact gtagagtttgatgtggttgaaggagagaagggtgcagaagctgccaatgtgactggcccg gatggagttcctgtggaagggagtcgttacgctgcagatcggcgccgttacagacgtggc tactatggaaggcgccgtggccctccccggaattacgctggggaggaggaggaggaaggg agcggcagcagtgaaggatttgacccccctgccactgataggcagttctctggggcccgg aatcagctgcgccgcccccagtatcgccctcagtaccggcagcggcggttcccgccttac cacgtgggacagacctttgaccgtcgctcacgggtcttaccccatcccaacagaatacag agttacccctggtctctcccttacccgttacctcaccaacaacttctaaagccattaaat gggcagatcaag >gi568815586r:10706054_10906980|GENSCAN_predicted_peptide_2|848_aa MEFHDATLSCFWVPVVLEPEPTELGVVALGQIWMQPKVQPGVIMTPREAPKETGCSQGAA CPSCIPQVLRILHFQAEEAGLEHPSISSAFSSHQTLLTRAWDFRHTKQIRLHATGFRDPS TEAHMRAIKAVIIFLLLLIVYYPVFLVMTSSALIPQGKLVLMIELRVTKAFESRTSSTLM EAMVSISLDAVRTSDTVLYGLRGLEYSEDKKMWESLELPRDWLNGFDQNADSDMNNKVRV EVVSDEDEELLGTEVKPRDLVPCIPAAPAMAKRGQGTAQVMASGGTSPKFWQLPCGVESA GAQKSIIEVWEPPPRFQRMCGNAWMSRQRCAAGAGPSWRTSAKAVWKGNVGLETSHRVPT WALPSGAVRRGSPSSTPQNGRSTNNLHHVPGKTADTQHQPVKAVKREVVPCKPTEAELPR AVAAHLLHQHNLDNPDFVQGNNVPSGSQVGAFRNSLLADLTVTALELPNYSVRSHGPGSR LAHIDSDSRSAQLQTGSHDTSVQASPWNPTLQIYLNKPRVQNCYSTSQYQADLMDPSSRT TATDPGSRTDFVDPGPKTQAYYYIMDPGARPAQILIQTPNQSLQRFCYMAHTESLDELTG EGFSLLKPVYKEWKRDLVPCIPATPAMTKRGHGTARAIASEGGSPKPWQLPCGIEPAGPQ KSIIKVWEPPPRFQRMYGNSWMSRQKLAAGSGPSWRTSARAVWKGNVRLEPPHRVPTGTP PSGAVRIGPPSSRPQNGRSTDSLLHVPGKVTDTQHQPMKAARNGTIPCKATGAELPKAMG AHFLHQHDLDKLAKRVKAVHKAMTVMAMIPTGEPTVIQTFGNPHSNAKETLPFSSSFYRK RTRKGEIK >gi568815586r:10706054_10906980|GENSCAN_predicted_CDS_2|2547_bp atggaattccatgatgccactctttcctgcttctgggtccctgtagtcttagagccagag cccacagagctgggggtggtggcattggggcagatctggatgcagcccaaggtacagcca ggagtgattatgactcctagagaagctcctaaggagactggatgttctcaaggagcagcc tgtccatcttgcatcccccaggttctccggattctccatttccaagcagaagaagcagga ctggagcacccttccatatcatcagccttctcctcgcaccagaccttactcaccagggcc tgggactttagacacaccaagcagattcgactgcatgctacagggttcagagaccccagt acagaggcccacatgagggccataaaggcagtgatcatctttctgctcctcctcatcgtg tactacccagtctttcttgttatgacctctagcgctctgattcctcagggaaaattagtg ttgatgattgagctcagggtcaccaaagcctttgagagcaggactagcagcaccttgatg gaagccatggtatccatctcccttgatgctgtaagaacctcagacacagtcctttatggg ttgagaggtttggagtactcagaagacaaaaagatgtgggagagtttggaacttcctaga gactggttgaatggctttgaccaaaatgctgatagcgatatgaacaataaggtccgggtt gaggtggtctcagatgaagatgaggaacttttgggaactgaagtaaagcctagggactta gtgccctgcatcccagctgctccagccatggctaaaaggggccaaggtacagctcaggtc atggcttcagggggtacaagccccaagttttggcagcttccatgtggtgttgagtctgca ggtgcacagaagtcaataattgaggtttgggaacctccacctagatttcagagaatgtgt ggaaatgcgtggatgtccaggcagaggtgtgctgcaggggctgggccctcatggagaact tctgctaaggcagtgtggaagggaaatgtggggttggagacctcacacagagtccccact tgggcactgcctagtggagctgtgagaagagggtcaccatcctctacaccccaaaatggt agatccaccaacaacttgcaccatgtacctggaaaaactgcagacactcaacaccagcct gtgaaagcagtcaagagggaggttgtaccttgcaaacccacagaggcggagctgcccagg gctgtggcagcccaccttttgcatcagcataacctagataaccctgattttgttcagggg aataatgtgcccagtggatctcaagtgggagcatttcggaactctttgctggctgatctg actgtcacagcactggagctccctaactattcagttagatcccatggcccaggatccagg ctggcccacatagactcagactccaggtctgcccagctccagactggttcccatgacacc agtgtccaggccagcccctggaaccccacactacagatttacttgaacaaacccagggtc cagaactgctacagtacatcccaataccaggctgacctcatggacccaagctccaggacc actgctacagatccaggatccaggacagactttgtggatccaggaccaaagacccaagcc tactattatatcatggacccaggtgccagacctgctcaaatattaatccagacaccaaac cagtctctccagagattctgttacatggcccacacagaatctctagatgaactgactggt gaagggttttccctgctgaagccagtctataaagaatggaaaagggacttggtgccctgc atcccagctactccagccatgactaaaaggggccatggtacagctcgggctattgcttca gagggtggaagtcccaagccttggcagcttccatgtggtattgagcctgcaggtccacag aaatctataattaaggtttgggaacctccacctagatttcagaggatgtatggaaactcc tggatgtccaggcagaagttagctgcagggtctgggccctcatggagaacttctgctagg gcagtgtggaagggaaatgtgagattggagcccccacacagagtccctactgggacacca cctagtggagctgtgagaatagggccaccatcctccagaccccagaatggtagatctacc gacagcttgctccatgtgcctggaaaagtcacagacactcaacaccagcccatgaaagca gccaggaatgggactataccctgcaaagccacaggggcggagctgcccaaggccatggga gcccactttttgcatcagcatgacctggataagctggcgaaacgcgtgaaggccgtacac aaggcaatgacggtgatggcaatgattcccacgggagaaccaactgtgatccagaccttt ggcaatcctcacagtaatgcaaaggagactttaccattctcttcctcattttacagaaaa agaactagaaagggagaaatcaagtaa >gi568815586r:10706054_10906980|GENSCAN_predicted_peptide_3|134_aa MLLVLLSVVLLALSSAQSTDNDVNYEDFTFTIPDVEDSSQRPDQGPQRPPPEGLLPRPPG DSGNQDDGPQQRPPKPGGHHRHPPPPPFQNQQRPPRRGHRQLSLPRFPSVSLQEASSFFQ RDRPARHPQEQPLW >gi568815586r:10706054_10906980|GENSCAN_predicted_CDS_3|405_bp atgctgctggtcctgctctcagtggtccttctggctctgagctcagctcagagcacagat aatgatgtgaactatgaagactttactttcaccataccagatgtagaggactcaagtcag agaccagatcagggaccccagagacctcctcctgaaggactcctacctagaccccctggt gatagtggtaaccaagatgatggtcctcagcagagaccaccaaaaccaggaggccatcac cgccatcctcccccacctccttttcaaaatcagcaacgaccaccccgacgaggacaccgt caactctctctaccccgatttccttctgtcagcctgcaggaagcatcatcattcttccag agggacagaccagcaagacatccccaggagcaaccactctggtaa >gi568815586r:10706054_10906980|GENSCAN_predicted_peptide_4|165_aa MDGPATPVSTDSNPPTQQEDSSACKCTHLEKRLFPLLLVAQLLLSPPGAAAVKCQLDPAK WQDPQHSSICSVLHLRHWKGCEPDIGSQSTCFPEPESCLPVAADTDSNVTPATQQQRCCT LACNLGTGPLHLLLSLLMQLGARACATGSDLTSTSSRATVNLHVP >gi568815586r:10706054_10906980|GENSCAN_predicted_CDS_4|498_bp atggatggccctgctacacctgtgagcactgacagcaacccgcccacccagcaggaagac agcagtgcatgtaagtgcacacaccttgagaaaaggctcttcccactgctgctggtggca cagttgctgctgtcaccaccgggggctgcagcagtgaaatgccagttggacccagcaaag tggcaggatcctcagcattctagcatatgcagtgttctgcacctcaggcactggaaaggc tgtgaaccagacatagggagccaaagcacatgctttccagaaccagagagctgcctccct gtggctgctgacacagacagcaatgtcacccccgcaacacagcagcagagatgctgcaca cttgcatgcaacctggggacaggccctctccatctgctgctgagtctgctgatgcagctg ggggccagagcatgtgccactggcagtgacctgacttccaccagcagcagagccactgtg aacttgcacgtaccctga >gi568815586r:10706054_10906980|GENSCAN_predicted_peptide_5|82_aa MDGTGGHYVKGNKPGTEKQTVHVLTYVDHTETKKPEYIEKHINETIHFPELKVNSPYRKG PMIIQKPKFNIFRAQCKKTISG >gi568815586r:10706054_10906980|GENSCAN_predicted_CDS_5|249_bp atggatggaactggagggcattatgttaagggaaataagccaggcacagaaaaacaaaca gtgcatgttctcacttatgttgatcacacagagaccaagaaaccagaatatattgagaaa cacatcaatgaaacaatacatttcccagaactaaaggttaattctccatatcgaaagggc ccaatgattatccaaaagcccaaatttaacattttcagggctcagtgcaagaagacaatt agtggctaa >gi568815586r:10706054_10906980|GENSCAN_predicted_peptide_6|235_aa XALEKQGTLPSEHWVAQIPSGKRYLLTFQCKASSSEGKMDRELTRFSQHKVGSDTRASCK MLLILLSVALLAFSSAQDLNEDGGDSEQFLDEERQGPPLGGQQSQPSAGDGNQDDGPQQG PPQQGGQQQQGPPPPQGKPQGPPQQGGQQQQGPPPPQGKPQGPPQQGGHPPPPQGRPQGP PQQGGHPRPPRGRPQGPPQQGGHQQGPPPPPPGKPQGPPPQGGRPQGPPQGQSPQ >gi568815586r:10706054_10906980|GENSCAN_predicted_CDS_6|708_bp nnggctttggagaaacaaggtactcttccctctgagcactgggttgctcagatccccagt ggaaagagatatttgcttacattccagtgtaaagcttcttcttcagaaggcaagatggac agggagctgacacgtttctcccagcacaaagttgggagtgacaccagagcctcctgcaag atgcttctgattctgctgtcagtggccctgctggccttcagctcagctcaggatttaaat gaagatggaggagactctgagcagttcctagatgaggagcgtcagggaccacctttggga ggacagcaatctcaaccctctgctggtgatgggaaccaggatgatggccctcagcaggga ccaccccaacaaggaggccagcagcaacaaggtccaccacctcctcagggaaagccacaa ggaccaccccaacaaggaggccagcagcaacaaggtccaccacctcctcagggaaagcca caaggaccaccccaacagggaggccatccccctcctcctcaaggaaggccacaaggacca ccccaacagggaggccatccccgtcctcctcgaggaaggccacaaggaccaccccaacag ggaggccatcagcaaggtcctcccccacctcctcctggaaagccccagggaccacctccc caagggggccgcccacaaggacctccacaggggcagtctcctcagtaa