GENSCAN 1.0 Date run: 6-Nov-116 Time: 12:22:05 Sequence gi568815584f:95586468_95791318 : 204851 bp : 45.59% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 4329 4513 185 0 2 112 1 141 0.140 7.23 1.02 Intr + 6110 6240 131 0 2 32 94 16 0.087 -2.99 1.03 Term + 11319 11441 123 2 0 92 40 123 0.705 6.28 1.04 PlyA + 11904 11909 6 1.05 2.00 Prom + 12125 12164 40 -3.66 2.01 Init + 17832 17906 75 2 0 75 119 46 0.865 7.49 2.02 Term + 24814 24870 57 0 0 123 47 59 0.580 2.99 2.03 PlyA + 25831 25836 6 1.05 3.04 PlyA - 25867 25862 6 1.05 3.03 Term - 34627 34442 186 1 0 105 53 182 0.963 13.89 3.02 Intr - 44957 44928 30 2 0 116 89 -2 0.594 1.03 3.01 Init - 45982 45830 153 1 0 60 94 129 0.386 8.72 3.00 Prom - 50633 50594 40 -4.96 4.00 Prom + 53974 54013 40 1.04 4.01 Init + 65685 65759 75 2 0 49 101 45 0.105 2.99 4.02 Term + 75514 75669 156 0 0 25 37 140 0.150 0.53 4.03 PlyA + 75789 75794 6 1.05 5.03 PlyA - 76022 76017 6 1.05 5.02 Term - 84090 83988 103 2 1 112 35 86 0.967 3.45 5.01 Init - 84368 84283 86 2 2 97 92 66 0.991 8.19 5.00 Prom - 87322 87283 40 -7.06 6.07 PlyA - 88075 88070 6 1.05 6.06 Term - 93470 93369 102 2 0 40 44 84 0.250 -2.52 6.05 Intr - 94150 94006 145 0 1 20 93 106 0.381 4.58 6.04 Intr - 97849 97596 254 1 2 54 44 189 0.387 7.33 6.03 Intr - 99272 99076 197 0 2 35 69 85 0.678 0.43 6.02 Intr - 100386 100277 110 2 2 58 81 63 0.365 2.53 6.01 Init - 105363 105302 62 0 2 49 52 130 0.679 4.22 6.00 Prom - 114241 114202 40 -2.86 7.05 PlyA - 119204 119199 6 1.05 7.04 Term - 125335 125288 48 1 0 117 41 120 0.993 7.50 7.03 Intr - 125929 125753 177 1 0 118 113 138 0.997 19.42 7.02 Intr - 127615 127480 136 0 1 61 77 251 0.902 21.87 7.01 Init - 132585 132527 59 0 2 54 98 33 0.578 1.68 7.00 Prom - 141159 141120 40 -4.26 8.10 PlyA - 141260 141255 6 1.05 8.09 Term - 154766 154657 110 0 2 95 46 41 0.258 -0.73 8.08 Intr - 156088 155970 119 0 2 111 43 71 0.273 5.01 8.07 Intr - 156367 156324 44 1 2 67 77 37 0.259 -2.46 8.06 Intr - 156759 156733 27 0 0 62 115 20 0.065 0.41 8.05 Intr - 177596 177417 180 2 0 46 37 103 0.083 1.06 8.04 Intr - 182695 182563 133 0 1 90 87 80 0.845 8.75 8.03 Intr - 185229 185133 97 1 1 65 82 112 0.899 7.37 8.02 Intr - 199624 199384 241 1 1 117 64 28 0.048 0.42 8.01 Init - 200691 200620 72 0 0 78 73 59 0.142 2.60 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584f:95586468_95791318|GENSCAN_predicted_peptide_1|146_aa XPSRETGNERQTKGDQAVSLIPDTGIIFIPILNFPALTLCRPPSSGDTSQKKTFMRPKNI ITAYISVNIDYLICKEYNGGNTTFRLWDSTLLQILGRPTLGQAVCQGSRSLQGLLLVLSA FKAVASLLPQLSADYLPNARAQRFPG >gi568815584f:95586468_95791318|GENSCAN_predicted_CDS_1|441_bp nnacccagcagagaaacaggaaatgagaggcagacgaaaggagatcaagctgtcagcttg attcctgacacaggcatcatcttcatccctatactgaacttcccagcactcacgctctgc agacctccttcttcaggggacacttctcaaaagaagacatttatgcggccaaaaaacatc atcactgcatatatttcagtaaacatcgattacttaatatgcaaggagtacaatggcgga aataccacattcaggctctgggattcaacacttctgcaaatacttggaagacctaccctg ggccaggctgtgtgccaggggagccgcagcctccaggggctgctcctggtgctgtcagcg ttcaaagcagtcgcttcacttctgccgcaattatcagccgattatctccccaatgcccgc gctcagcgcttccctggttaa >gi568815584f:95586468_95791318|GENSCAN_predicted_peptide_2|43_aa MPGYANLGYQTLRNCEAINAGCFNLASVHLPEVDQNLNGHWTT >gi568815584f:95586468_95791318|GENSCAN_predicted_CDS_2|132_bp atgcctggctatgccaaccttgggtaccagactctcagaaactgtgaggccataaatgct ggctgttttaacctggcttcagtccacctgcctgaagtggaccagaacctgaatgggcat tggacgacctga >gi568815584f:95586468_95791318|GENSCAN_predicted_peptide_3|122_aa MAASASAQPCSLPCTAAGVAHNSPNMLPARNLPFRVHWLGIESVMAADPGQRPCGYTRPT QNPAPGPISTKLHSFKHFQKLPEYDAADGEQVPSGVMRQNPGYVFLAGSIRLPRSHFRRL TS >gi568815584f:95586468_95791318|GENSCAN_predicted_CDS_3|369_bp atggcagcctctgcctctgcccagccctgctccctgccctgcaccgctgcaggtgttgct cacaatagccccaatatgcttcctgcacgcaacctccctttcagagtccactggctggga atcgaatctgtgatggccgccgacccagggcagagaccctgtggttacaccaggcccacc cagaacccagccccaggtccaatttccacaaaactccactctttcaagcatttccagaag ttaccggagtatgatgctgctgatggggagcaggtgccgtcgggggtcatgcgacagaac cctggctatgtgtttctcgctggctccatccgcttgccgcgctcccacttccgccgactc acttcctga >gi568815584f:95586468_95791318|GENSCAN_predicted_peptide_4|76_aa MKINSEEERGLPWNASVIYDILHNNILCIRQKQVEVTEGECGRSWWVNDRQRCPVKSTTC DKGGFSLLLNKHAILQ >gi568815584f:95586468_95791318|GENSCAN_predicted_CDS_4|231_bp atgaagatcaacagtgaagaggaaaggggtcttccctggaacgcttcagtcatatacgac attttacacaacaatatactttgcatccggcaaaaacaggtggaggtgactgaaggcgaa tgtgggaggtcatggtgggttaatgatcgccagcgctgtccggtgaaatcaaccacctgt gacaaaggtggtttttcacttttactgaacaagcatgctattctacagtaa >gi568815584f:95586468_95791318|GENSCAN_predicted_peptide_5|62_aa MDWNGGERGEKALEEGSISELIGLDSGLSFCCNSKKEKEGKKDVTRICNGSCSWPSVNVS GM >gi568815584f:95586468_95791318|GENSCAN_predicted_CDS_5|189_bp atggactggaatggtggagagagaggtgagaaggcgctggaggaaggatctatttcagag ctgataggactcgacagtgggttgagcttctgttgtaactccaagaaggagaaggaagga aaaaaggacgtgactcgaatctgcaatggcagctgctcttggcccagtgtgaatgtgtct gggatgtaa >gi568815584f:95586468_95791318|GENSCAN_predicted_peptide_6|289_aa MAQALPGLLQAAAHLRGALYGTTLSHVDKLVPVVPLRGVFLGPGLGMDIAALLDLHRKNK PVHYQHLSSGSDKAHRNKATGGHVATWPAHPWPWGPECPLGLVVLLARLCQEASEAQEHS KKQVSHPSTLGPGMKKATELQVTTVWFLVLQLVFVHVMSEISEIRLCLLAKLSISGTCDL KDSLSGVGVTNVFENHFDLSRIAPRIAWKKQLQMPDSRGQFTCSPAFSAPAKLFSGTPPP GEPLSSQETTGSLGNTIFHSVAVVLPERGPNPDLKRRFLDLMQEFGASP >gi568815584f:95586468_95791318|GENSCAN_predicted_CDS_6|870_bp atggcccaggccctgcctggcctgctccaagctgctgcccatctgagaggagccctctat gggaccacattatctcatgtggataagcttgtgcctgtggtccctttacggggggtgttc ctggggccagggctgggcatggacattgctgcgctcctagacctgcacagaaaaaataag ccagtgcactatcagcatctgagctcaggatcagataaagcccacagaaacaaggccacc ggaggccacgtggccacctggccagcccatccttggccatggggcccagagtgcccactt ggcctggtcgtcttacttgctcgactttgtcaagaggcctcagaagctcaagaacatagc aagaagcaggtctcccatccctcaacccttggacctggtatgaaaaaagccacggagctg caggtgacaacagtttggttcctggtccttcagctggtctttgtccatgtcatgtctgag ataagtgagataaggctatgtcttctggccaagttgtccatctctggtacctgtgacctg aaggattcactctccggggtaggtgtcaccaatgtctttgagaaccactttgatctctcc agaattgctcccaggattgcctggaaaaagcagctgcagatgcctgacagcaggggtcag ttcacctgctctccagcgttttcagctcctgccaagctgttctcaggaactccaccccct ggtgagcctttgtcttctcaggagaccacagggagcctgggcaacacgattttccatagc gtggctgtagtgttaccagaaaggggtcccaatccagacctcaagagaaggttcttggac ctcatgcaagaatttggggcaagtccatag >gi568815584f:95586468_95791318|GENSCAN_predicted_peptide_7|139_aa MLTLTASNQHRTGNPNQSNRRPEDAMAECPTLGEAVTDHPDRLWAWEKFVYLDEKQHAWL PLTIEIKDRLQLRVLLRREDVVLGRPMTPTQIGPSLLPIMWQLYPDGRYRSSDSSFWRLV YHIKIDGVEDMLLELLPDD >gi568815584f:95586468_95791318|GENSCAN_predicted_CDS_7|420_bp atgctcactctcactgcttctaatcagcacagaactggaaatcctaatcagagcaacagg cggcccgaggacgccatggccgagtgcccgacactcggggaggcagtcaccgaccacccg gaccgcctgtgggcctgggagaagttcgtgtatttggacgagaagcagcacgcctggctg cccttaaccatcgagataaaggataggttacagttacgggtgctcttgcgtcgggaagac gtcgtcctggggaggcctatgacccccacccagataggcccaagcctgctgcctatcatg tggcagctctaccctgatggacgataccgatcctcagactccagtttctggcgcttagtg taccacatcaagattgacggcgtggaggacatgcttctcgagctgctgccagatgactga >gi568815584f:95586468_95791318|GENSCAN_predicted_peptide_8|340_aa MPDLSVWPWASHIILLSLSFVICNVLSMGTTLSVFCLGKIPLGTVKLMGCVAGVRLHSGT PVRAFLFTVWSMEQQYWPSLLGKQNLRTHPEQLLIESPHFNKIPDPNVAFYLIAVPEVEE EKLPLPGVGSPTHRGLSSAAKFSCILSHVVTQDLCIPEASPQPSGYCFLTFYPAELRPLD QWDWCLYKEYSLYYCEGSNPFSPLPPCEDAVRAAILEAKREPSSDTESAGALTLDFPASG TCLTTVPKSMTEKGKGPGDLRANARIWCRVISSGSSRGRYGYLRAVDTVWTPYATLEGNW TKELKGKPSSETLGALSKAPQLVEESVPHEADSTVHTFNQ >gi568815584f:95586468_95791318|GENSCAN_predicted_CDS_8|1023_bp atgcctgacctgtctgtatggccttgggcgagtcacatcatccttctgagcctcagcttt gtcatctgcaatgtcttaagcatgggcacaacattgtcagttttctgccttggaaagatc cctctggggacagtgaagttaatgggttgtgtggcaggtgtaagattacattcagggaca ccggttagagcctttctgttcactgtgtggtccatggaacagcagtattggccctccttg ttaggcaagcagaatctcaggacacatcctgagcaactactgattgagagtccgcatttt aacaagatcccagaccctaatgtggccttctatctgattgctgttccagaagtggaggag gagaagctgcccctccctggtgtaggatcacctactcaccgcgggctcagctcagcagcc aagttcagctgcatcctgagccatgttgtcacccaggacctttgcatccctgaagccagt ccacaaccttctggttactgtttcctaactttctatcctgctgaactgcgtccattggat cagtgggactggtgcctgtataaagagtactctctgtactactgtgagggaagcaaccct ttttcacccttaccgccatgtgaagatgcagtaagagccgccatcttggaagcaaagaga gagccttcatcagacactgaatctgctggtgccttgaccttggacttcccagcctctgga acttgcttaaccacagtgcccaaaagcatgacagaaaagggaaagggccctggggatcta agagccaatgcaaggatctggtgcagagtaatttcttctggctcctccagaggtagatat gggtatctgagggcagtggacacagtctggaccccttatgccacactggaaggcaactgg accaaggagctgaaggggaaaccaagcagtgagacgttaggtgctttgtccaaggcccct cagctagtggaggagtcagtcccacatgaagctgactccacagtccacacctttaaccaa taa