GENSCAN 1.0 Date run: 5-Nov-116 Time: 03:31:44 Sequence gi568815586r:10709140_10910075 : 200936 bp : 37.23% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.12 Intr - 975 769 207 0 0 59 98 300 0.905 26.45 1.11 Intr - 4194 4072 123 0 0 103 63 295 0.982 28.36 1.10 Intr - 6644 6555 90 2 0 99 100 135 0.998 15.07 1.09 Intr - 8569 8504 66 1 0 61 55 90 0.691 1.18 1.08 Intr - 13422 12928 495 0 0 56 43 406 0.669 25.16 1.07 Intr - 13564 13516 49 0 1 49 99 42 0.390 -0.84 1.06 Intr - 13812 13711 102 2 0 60 109 94 0.104 7.07 1.05 Intr - 22020 21879 142 1 1 100 91 34 0.018 3.39 1.04 Intr - 23933 23832 102 0 0 112 58 64 0.122 5.03 1.03 Intr - 31131 31092 40 0 1 57 87 23 0.036 -4.02 1.02 Intr - 32185 32042 144 1 0 73 69 82 0.081 4.36 1.01 Init - 36188 35709 480 2 0 78 86 147 0.055 8.91 1.00 Prom - 37836 37797 40 -6.15 2.12 PlyA - 39382 39377 6 1.05 2.11 Term - 41024 40848 177 2 0 41 32 167 0.008 3.10 2.10 Intr - 46567 46042 526 0 1 43 39 280 0.000 10.62 2.09 Intr - 55139 54998 142 0 1 85 77 53 0.100 2.39 2.08 Intr - 55580 55292 289 2 1 21 31 215 0.020 5.00 2.07 Intr - 65268 65155 114 0 0 79 89 89 0.013 7.82 2.06 Intr - 82756 82226 531 1 0 51 39 218 0.010 5.40 2.05 Intr - 83310 83148 163 2 1 2 38 198 0.001 5.26 2.04 Intr - 89098 88984 115 2 1 33 31 101 0.002 -2.51 2.03 Intr - 100320 100138 183 1 0 108 -11 164 0.020 7.34 2.02 Intr - 107008 106855 154 1 1 74 24 115 0.586 2.42 2.01 Init - 107853 107701 153 0 0 67 81 55 0.361 2.74 2.00 Prom - 127237 127198 40 -0.35 3.04 PlyA - 127325 127320 6 1.05 3.03 Term - 138228 137924 305 1 2 51 33 238 0.401 8.95 3.02 Intr - 139268 139233 36 0 0 111 99 23 0.748 3.02 3.01 Init - 140298 140235 64 0 1 66 53 102 0.855 4.18 3.00 Prom - 140370 140331 40 -5.35 4.00 Prom + 143013 143052 40 -6.25 4.01 Sngl + 144303 144800 498 2 0 90 55 318 0.879 24.49 4.02 PlyA + 148123 148128 6 1.05 5.00 Prom + 150127 150166 40 -3.45 5.01 Init + 154834 154926 93 0 0 95 65 46 0.194 3.43 5.02 Term + 168976 169131 156 0 0 67 38 126 0.344 2.45 5.03 PlyA + 169657 169662 6 1.05 6.07 PlyA - 169933 169928 6 1.05 6.06 Term - 173559 173096 464 1 2 84 33 577 0.981 46.03 6.05 Intr - 175136 175015 122 1 2 51 116 95 0.129 7.82 6.04 Intr - 182786 182731 56 2 2 115 32 35 0.030 -2.54 6.03 Intr - 191148 191075 74 1 2 7 113 100 0.384 2.61 6.02 Intr - 192215 192098 118 2 1 54 61 101 0.555 3.12 6.01 Init - 200159 199971 189 2 0 64 83 68 0.516 2.96 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 46533 45880 654 0 0 91 36 293 0.876 20.32 S.002 Term + 68179 68274 96 0 0 108 38 113 0.970 5.29 S.003 Sngl - 83282 82989 294 2 0 51 38 265 0.835 13.25 S.004 Term - 100320 99998 323 1 2 108 44 161 0.810 7.70 S.005 Init - 175078 175015 64 1 1 63 116 72 0.847 8.99 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:10709140_10910075|GENSCAN_predicted_peptide_1|680_aa MPSLTTPIKIVLEVLARAIRQEKEIKSIQLGKEEVKLSLFTDGMIIYLENPTVSAQNLLK LISNFGKVSGYKINVQKSQALLYTSNRQTESQITSKLPFTIATKRIKYLGIQLTRDVKDL FKENYKPMLNKIKEDTNKWKNIPDSWIGRINIVKMHILPKLGPLQVLQRLVRNQKFTISN VKRSFEKLGEAVEKRYSVNTVLKKESSQRSDNKQAKLTRVVVPSNDWTQLEARAKKPVEA VQSGSFSGAQSREKNGNERDPLITKFTLNRRWSPCGAKGISNPNPAFSRQTSNSNSTVAY FCRKPRWGRGPRSHGHRGRRLFSHRRRQRRRGEKSSRGLRVPSAGRLPCRRSQTGTVADS RGGKCLRLEKKGEAVWERGPWVGMDVGEGPTRGPGALGSLLGIWERRAFPLGKRRALEGD GEGKIEKLLARKGQVSVPDALPTWAWERRHAPVGHMPAKRCCNRLRETWVLVPRLPPPLC AHGCKQPFSVHQWLHASLFPDCGNARGVAAEKTLSEVGVLVKAPLSEQLSIEVNSAEKQI TAIKKNNPRKYLRSVGDGETVEFDVVEGEKGAEAANVTGPDGVPVEGSRYAADRRRYRRG YYGRRRGPPRNYAGEEEEEGSGSSEGFDPPATDRQFSGARNQLRRPQYRPQYRQRRFPPY HVGQTFDRRSRVLPHPNRIQ >gi568815586r:10709140_10910075|GENSCAN_predicted_CDS_1|2040_bp atgccctctctcaccactcctattaaaatagtgttggaagttctggccagggcaatcagg caagagaaagaaataaagagtattcaattaggaaaagaggaagtcaaattgtccctgttt acagatggcatgattatatacttagaaaaccccactgtctcagcccaaaatctccttaag ctgataagcaacttcggcaaagtctcaggatacaaaatcaatgtgcaaaaatcacaagca ttactatacaccagtaacagacaaacagagagccaaatcacgagtaaactcccattcaca attgctacaaagagaataaaatacctaggaatccaacttacaagggatgtgaaggacctc ttcaaggagaactacaaaccaatgctcaacaaaataaaagaggacacaaacaaatggaag aacattccagattcatggataggaagaatcaatattgtgaaaatgcacatactgcccaag ttgggccctcttcaggtgctgcagagattggtgagaaaccagaagttcacaatcagtaat gtcaaaagaagctttgagaagctgggagaggctgtggaaaaaaggtacagtgtgaacaca gtcttgaaaaaagaaagctctcagaggagtgataacaaacaagccaagttaactagagtt gtagtgccctccaatgattggacccagctggaagccagagcaaagaagcctgtagaagct gttcaaagtggttcattctctggggcacagagcagggagaaaaatggaaatgagagggat cctctgattaccaaatttactctcaacaggcgttggtccccatgtggagcaaaaggcatc agtaaccccaatccagctttttctcgacagacttctaactcaaacagcactgtggcctat ttctgcaggaaaccccggtggggacgcggcccccgcagccacgggcaccgcggccgccgc ctctttagccaccgccgccggcagcgaagacgcggagaaaaaagttctcggggtctccgg gtccccagcgctggccgcctcccttgccggcgctcccagacgggcactgttgcggattcg cgtggtggaaaatgcctgcgtttggagaagaaaggagaggcagtctgggagaggggacct tgggtagggatggatgtaggggaaggaccgacacgtgggcctggcgctttggggtccttg cttgggatctgggaaaggagagcatttcctttggggaagaggagggcgctagaaggagac ggggagggaaagatagaaaagcttcttgccaggaagggtcaggtgtctgtccctgacgcc cttcccacatgggcatgggaaagacgccatgctcctgttggccacatgccagcaaagaga tgttgcaatcgtctccgggaaacttgggtcttagtcccacgtcttcctccccctctctgt gcccacggctgcaaacagccattcagtgtccaccaatggctacacgcttccctcttcccg gactgtgggaacgcacgtggtgtagcagctgaaaaaactttgtcggaagtgggggtattg gttaaagctcctctgtcagagcagctctcaattgaagtaaatagcgcagagaagcagata actgccatcaagaagaataacccacggaaatatctgcgcagtgtaggagatggagaaact gtagagtttgatgtggttgaaggagagaagggtgcagaagctgccaatgtgactggcccg gatggagttcctgtggaagggagtcgttacgctgcagatcggcgccgttacagacgtggc tactatggaaggcgccgtggccctccccggaattacgctggggaggaggaggaggaaggg agcggcagcagtgaaggatttgacccccctgccactgataggcagttctctggggcccgg aatcagctgcgccgcccccagtatcgccctcagtaccggcagcggcggttcccgccttac cacgtgggacagacctttgaccgtcgctcacgggtcttaccccatcccaacagaatacag >gi568815586r:10709140_10910075|GENSCAN_predicted_peptide_2|848_aa MEFHDATLSCFWVPVVLEPEPTELGVVALGQIWMQPKVQPGVIMTPREAPKETGCSQGAA CPSCIPQVLRILHFQAEEAGLEHPSISSAFSSHQTLLTRAWDFRHTKQIRLHATGFRDPS TEAHMRAIKAVIIFLLLLIVYYPVFLVMTSSALIPQGKLVLMIELRVTKAFESRTSSTLM EAMVSISLDAVRTSDTVLYGLRGLEYSEDKKMWESLELPRDWLNGFDQNADSDMNNKVRV EVVSDEDEELLGTEVKPRDLVPCIPAAPAMAKRGQGTAQVMASGGTSPKFWQLPCGVESA GAQKSIIEVWEPPPRFQRMCGNAWMSRQRCAAGAGPSWRTSAKAVWKGNVGLETSHRVPT WALPSGAVRRGSPSSTPQNGRSTNNLHHVPGKTADTQHQPVKAVKREVVPCKPTEAELPR AVAAHLLHQHNLDNPDFVQGNNVPSGSQVGAFRNSLLADLTVTALELPNYSVRSHGPGSR LAHIDSDSRSAQLQTGSHDTSVQASPWNPTLQIYLNKPRVQNCYSTSQYQADLMDPSSRT TATDPGSRTDFVDPGPKTQAYYYIMDPGARPAQILIQTPNQSLQRFCYMAHTESLDELTG EGFSLLKPVYKEWKRDLVPCIPATPAMTKRGHGTARAIASEGGSPKPWQLPCGIEPAGPQ KSIIKVWEPPPRFQRMYGNSWMSRQKLAAGSGPSWRTSARAVWKGNVRLEPPHRVPTGTP PSGAVRIGPPSSRPQNGRSTDSLLHVPGKVTDTQHQPMKAARNGTIPCKATGAELPKAMG AHFLHQHDLDKLAKRVKAVHKAMTVMAMIPTGEPTVIQTFGNPHSNAKETLPFSSSFYRK RTRKGEIK >gi568815586r:10709140_10910075|GENSCAN_predicted_CDS_2|2547_bp atggaattccatgatgccactctttcctgcttctgggtccctgtagtcttagagccagag cccacagagctgggggtggtggcattggggcagatctggatgcagcccaaggtacagcca ggagtgattatgactcctagagaagctcctaaggagactggatgttctcaaggagcagcc tgtccatcttgcatcccccaggttctccggattctccatttccaagcagaagaagcagga ctggagcacccttccatatcatcagccttctcctcgcaccagaccttactcaccagggcc tgggactttagacacaccaagcagattcgactgcatgctacagggttcagagaccccagt acagaggcccacatgagggccataaaggcagtgatcatctttctgctcctcctcatcgtg tactacccagtctttcttgttatgacctctagcgctctgattcctcagggaaaattagtg ttgatgattgagctcagggtcaccaaagcctttgagagcaggactagcagcaccttgatg gaagccatggtatccatctcccttgatgctgtaagaacctcagacacagtcctttatggg ttgagaggtttggagtactcagaagacaaaaagatgtgggagagtttggaacttcctaga gactggttgaatggctttgaccaaaatgctgatagcgatatgaacaataaggtccgggtt gaggtggtctcagatgaagatgaggaacttttgggaactgaagtaaagcctagggactta gtgccctgcatcccagctgctccagccatggctaaaaggggccaaggtacagctcaggtc atggcttcagggggtacaagccccaagttttggcagcttccatgtggtgttgagtctgca ggtgcacagaagtcaataattgaggtttgggaacctccacctagatttcagagaatgtgt ggaaatgcgtggatgtccaggcagaggtgtgctgcaggggctgggccctcatggagaact tctgctaaggcagtgtggaagggaaatgtggggttggagacctcacacagagtccccact tgggcactgcctagtggagctgtgagaagagggtcaccatcctctacaccccaaaatggt agatccaccaacaacttgcaccatgtacctggaaaaactgcagacactcaacaccagcct gtgaaagcagtcaagagggaggttgtaccttgcaaacccacagaggcggagctgcccagg gctgtggcagcccaccttttgcatcagcataacctagataaccctgattttgttcagggg aataatgtgcccagtggatctcaagtgggagcatttcggaactctttgctggctgatctg actgtcacagcactggagctccctaactattcagttagatcccatggcccaggatccagg ctggcccacatagactcagactccaggtctgcccagctccagactggttcccatgacacc agtgtccaggccagcccctggaaccccacactacagatttacttgaacaaacccagggtc cagaactgctacagtacatcccaataccaggctgacctcatggacccaagctccaggacc actgctacagatccaggatccaggacagactttgtggatccaggaccaaagacccaagcc tactattatatcatggacccaggtgccagacctgctcaaatattaatccagacaccaaac cagtctctccagagattctgttacatggcccacacagaatctctagatgaactgactggt gaagggttttccctgctgaagccagtctataaagaatggaaaagggacttggtgccctgc atcccagctactccagccatgactaaaaggggccatggtacagctcgggctattgcttca gagggtggaagtcccaagccttggcagcttccatgtggtattgagcctgcaggtccacag aaatctataattaaggtttgggaacctccacctagatttcagaggatgtatggaaactcc tggatgtccaggcagaagttagctgcagggtctgggccctcatggagaacttctgctagg gcagtgtggaagggaaatgtgagattggagcccccacacagagtccctactgggacacca cctagtggagctgtgagaatagggccaccatcctccagaccccagaatggtagatctacc gacagcttgctccatgtgcctggaaaagtcacagacactcaacaccagcccatgaaagca gccaggaatgggactataccctgcaaagccacaggggcggagctgcccaaggccatggga gcccactttttgcatcagcatgacctggataagctggcgaaacgcgtgaaggccgtacac aaggcaatgacggtgatggcaatgattcccacgggagaaccaactgtgatccagaccttt ggcaatcctcacagtaatgcaaaggagactttaccattctcttcctcattttacagaaaa agaactagaaagggagaaatcaagtaa >gi568815586r:10709140_10910075|GENSCAN_predicted_peptide_3|134_aa MLLVLLSVVLLALSSAQSTDNDVNYEDFTFTIPDVEDSSQRPDQGPQRPPPEGLLPRPPG DSGNQDDGPQQRPPKPGGHHRHPPPPPFQNQQRPPRRGHRQLSLPRFPSVSLQEASSFFQ RDRPARHPQEQPLW >gi568815586r:10709140_10910075|GENSCAN_predicted_CDS_3|405_bp atgctgctggtcctgctctcagtggtccttctggctctgagctcagctcagagcacagat aatgatgtgaactatgaagactttactttcaccataccagatgtagaggactcaagtcag agaccagatcagggaccccagagacctcctcctgaaggactcctacctagaccccctggt gatagtggtaaccaagatgatggtcctcagcagagaccaccaaaaccaggaggccatcac cgccatcctcccccacctccttttcaaaatcagcaacgaccaccccgacgaggacaccgt caactctctctaccccgatttccttctgtcagcctgcaggaagcatcatcattcttccag agggacagaccagcaagacatccccaggagcaaccactctggtaa >gi568815586r:10709140_10910075|GENSCAN_predicted_peptide_4|165_aa MDGPATPVSTDSNPPTQQEDSSACKCTHLEKRLFPLLLVAQLLLSPPGAAAVKCQLDPAK WQDPQHSSICSVLHLRHWKGCEPDIGSQSTCFPEPESCLPVAADTDSNVTPATQQQRCCT LACNLGTGPLHLLLSLLMQLGARACATGSDLTSTSSRATVNLHVP >gi568815586r:10709140_10910075|GENSCAN_predicted_CDS_4|498_bp atggatggccctgctacacctgtgagcactgacagcaacccgcccacccagcaggaagac agcagtgcatgtaagtgcacacaccttgagaaaaggctcttcccactgctgctggtggca cagttgctgctgtcaccaccgggggctgcagcagtgaaatgccagttggacccagcaaag tggcaggatcctcagcattctagcatatgcagtgttctgcacctcaggcactggaaaggc tgtgaaccagacatagggagccaaagcacatgctttccagaaccagagagctgcctccct gtggctgctgacacagacagcaatgtcacccccgcaacacagcagcagagatgctgcaca cttgcatgcaacctggggacaggccctctccatctgctgctgagtctgctgatgcagctg ggggccagagcatgtgccactggcagtgacctgacttccaccagcagcagagccactgtg aacttgcacgtaccctga >gi568815586r:10709140_10910075|GENSCAN_predicted_peptide_5|82_aa MDGTGGHYVKGNKPGTEKQTVHVLTYVDHTETKKPEYIEKHINETIHFPELKVNSPYRKG PMIIQKPKFNIFRAQCKKTISG >gi568815586r:10709140_10910075|GENSCAN_predicted_CDS_5|249_bp atggatggaactggagggcattatgttaagggaaataagccaggcacagaaaaacaaaca gtgcatgttctcacttatgttgatcacacagagaccaagaaaccagaatatattgagaaa cacatcaatgaaacaatacatttcccagaactaaaggttaattctccatatcgaaagggc ccaatgattatccaaaagcccaaatttaacattttcagggctcagtgcaagaagacaatt agtggctaa >gi568815586r:10709140_10910075|GENSCAN_predicted_peptide_6|340_aa MESALPSIFTLVIIAEFIIGNLSNGFIVLINCIDWVSKRELSSVDKLLIILAISRIGLIW EILGYWRDWPKLLIQESGGSEYLEICLLVEQNPPPPTKISAQVISRALEKQGTLPSEHWV AQIPSGKRYLLTFQCKASSSEGKMDRELTRFSQHKVGSDTRASCKMLLILLSVALLAFSS AQDLNEDGGDSEQFLDEERQGPPLGGQQSQPSAGDGNQDDGPQQGPPQQGGQQQQGPPPP QGKPQGPPQQGGQQQQGPPPPQGKPQGPPQQGGHPPPPQGRPQGPPQQGGHPRPPRGRPQ GPPQQGGHQQGPPPPPPGKPQGPPPQGGRPQGPPQGQSPQ >gi568815586r:10709140_10910075|GENSCAN_predicted_CDS_6|1023_bp atggaaagtgccctgccgagtatcttcactcttgtaataattgcagaattcataattggg aatttgagcaatggatttatagtactgatcaactgcattgactgggtcagtaaaagagag ctgtcctcagtcgataaactcctcattatcttggcaatctccagaattgggctgatctgg gaaatattaggatattggagagattggcctaaactcttaatccaggagagtgggggctct gaatacctggaaatctgcctgcttgtggagcagaacccacccccaccaaccaagatttct gcacaggttatctccagggctttggagaaacaaggtactcttccctctgagcactgggtt gctcagatccccagtggaaagagatatttgcttacattccagtgtaaagcttcttcttca gaaggcaagatggacagggagctgacacgtttctcccagcacaaagttgggagtgacacc agagcctcctgcaagatgcttctgattctgctgtcagtggccctgctggccttcagctca gctcaggatttaaatgaagatggaggagactctgagcagttcctagatgaggagcgtcag ggaccacctttgggaggacagcaatctcaaccctctgctggtgatgggaaccaggatgat ggccctcagcagggaccaccccaacaaggaggccagcagcaacaaggtccaccacctcct cagggaaagccacaaggaccaccccaacaaggaggccagcagcaacaaggtccaccacct cctcagggaaagccacaaggaccaccccaacagggaggccatccccctcctcctcaagga aggccacaaggaccaccccaacagggaggccatccccgtcctcctcgaggaaggccacaa ggaccaccccaacagggaggccatcagcaaggtcctcccccacctcctcctggaaagccc cagggaccacctccccaagggggccgcccacaaggacctccacaggggcagtctcctcag taa