GENSCAN 1.0 Date run: 6-Nov-116 Time: 20:14:18 Sequence gi568815584r:95611758_95814066 : 202309 bp : 45.25% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 201 196 6 1.05 1.03 Term - 9337 9152 186 1 0 105 53 182 0.961 13.89 1.02 Intr - 19667 19638 30 2 0 116 89 -2 0.593 1.03 1.01 Init - 20692 20540 153 1 0 60 94 129 0.386 8.72 1.00 Prom - 25343 25304 40 -4.96 2.00 Prom + 28684 28723 40 1.04 2.01 Init + 40395 40469 75 2 0 49 101 45 0.105 2.99 2.02 Term + 50224 50379 156 0 0 25 37 140 0.150 0.53 2.03 PlyA + 50499 50504 6 1.05 3.03 PlyA - 50732 50727 6 1.05 3.02 Term - 58800 58698 103 2 1 112 35 86 0.967 3.45 3.01 Init - 59078 58993 86 2 2 97 92 66 0.991 8.19 3.00 Prom - 62032 61993 40 -7.06 4.07 PlyA - 62785 62780 6 1.05 4.06 Term - 68180 68079 102 2 0 40 44 84 0.250 -2.52 4.05 Intr - 68860 68716 145 0 1 20 93 106 0.381 4.58 4.04 Intr - 72559 72306 254 1 2 54 44 189 0.387 7.33 4.03 Intr - 73982 73786 197 0 2 35 69 85 0.678 0.43 4.02 Intr - 75096 74987 110 2 2 58 81 63 0.365 2.53 4.01 Init - 80073 80012 62 0 2 49 52 130 0.679 4.22 4.00 Prom - 88951 88912 40 -2.86 5.05 PlyA - 93914 93909 6 1.05 5.04 Term - 100045 99998 48 1 0 117 41 120 0.993 7.50 5.03 Intr - 100639 100463 177 1 0 118 113 138 0.997 19.42 5.02 Intr - 102325 102190 136 0 1 61 77 251 0.902 21.87 5.01 Init - 107295 107237 59 0 2 54 98 33 0.578 1.68 5.00 Prom - 115869 115830 40 -4.26 6.10 PlyA - 115970 115965 6 1.05 6.09 Term - 129476 129367 110 0 2 95 46 41 0.258 -0.73 6.08 Intr - 130798 130680 119 0 2 111 43 71 0.273 5.01 6.07 Intr - 131077 131034 44 1 2 67 77 37 0.259 -2.46 6.06 Intr - 131469 131443 27 0 0 62 115 20 0.065 0.41 6.05 Intr - 152306 152127 180 2 0 46 37 103 0.082 1.06 6.04 Intr - 157405 157273 133 0 1 90 87 80 0.840 8.75 6.03 Intr - 159939 159843 97 1 1 65 82 112 0.893 7.37 6.02 Intr - 174334 174094 241 1 1 117 64 28 0.004 0.42 6.01 Init - 175401 175330 72 0 0 78 73 59 0.008 2.60 6.00 Prom - 185125 185086 40 -2.56 7.00 Prom + 189299 189338 40 -4.06 7.01 Init + 191699 191824 126 1 0 33 110 86 0.589 3.53 7.02 Term + 195599 195727 129 1 0 101 47 75 0.837 2.88 7.03 PlyA + 196528 196533 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 175276 175392 117 0 0 48 71 118 0.931 6.30 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584r:95611758_95814066|GENSCAN_predicted_peptide_1|122_aa MAASASAQPCSLPCTAAGVAHNSPNMLPARNLPFRVHWLGIESVMAADPGQRPCGYTRPT QNPAPGPISTKLHSFKHFQKLPEYDAADGEQVPSGVMRQNPGYVFLAGSIRLPRSHFRRL TS >gi568815584r:95611758_95814066|GENSCAN_predicted_CDS_1|369_bp atggcagcctctgcctctgcccagccctgctccctgccctgcaccgctgcaggtgttgct cacaatagccccaatatgcttcctgcacgcaacctccctttcagagtccactggctggga atcgaatctgtgatggccgccgacccagggcagagaccctgtggttacaccaggcccacc cagaacccagccccaggtccaatttccacaaaactccactctttcaagcatttccagaag ttaccggagtatgatgctgctgatggggagcaggtgccgtcgggggtcatgcgacagaac cctggctatgtgtttctcgctggctccatccgcttgccgcgctcccacttccgccgactc acttcctga >gi568815584r:95611758_95814066|GENSCAN_predicted_peptide_2|76_aa MKINSEEERGLPWNASVIYDILHNNILCIRQKQVEVTEGECGRSWWVNDRQRCPVKSTTC DKGGFSLLLNKHAILQ >gi568815584r:95611758_95814066|GENSCAN_predicted_CDS_2|231_bp atgaagatcaacagtgaagaggaaaggggtcttccctggaacgcttcagtcatatacgac attttacacaacaatatactttgcatccggcaaaaacaggtggaggtgactgaaggcgaa tgtgggaggtcatggtgggttaatgatcgccagcgctgtccggtgaaatcaaccacctgt gacaaaggtggtttttcacttttactgaacaagcatgctattctacagtaa >gi568815584r:95611758_95814066|GENSCAN_predicted_peptide_3|62_aa MDWNGGERGEKALEEGSISELIGLDSGLSFCCNSKKEKEGKKDVTRICNGSCSWPSVNVS GM >gi568815584r:95611758_95814066|GENSCAN_predicted_CDS_3|189_bp atggactggaatggtggagagagaggtgagaaggcgctggaggaaggatctatttcagag ctgataggactcgacagtgggttgagcttctgttgtaactccaagaaggagaaggaagga aaaaaggacgtgactcgaatctgcaatggcagctgctcttggcccagtgtgaatgtgtct gggatgtaa >gi568815584r:95611758_95814066|GENSCAN_predicted_peptide_4|289_aa MAQALPGLLQAAAHLRGALYGTTLSHVDKLVPVVPLRGVFLGPGLGMDIAALLDLHRKNK PVHYQHLSSGSDKAHRNKATGGHVATWPAHPWPWGPECPLGLVVLLARLCQEASEAQEHS KKQVSHPSTLGPGMKKATELQVTTVWFLVLQLVFVHVMSEISEIRLCLLAKLSISGTCDL KDSLSGVGVTNVFENHFDLSRIAPRIAWKKQLQMPDSRGQFTCSPAFSAPAKLFSGTPPP GEPLSSQETTGSLGNTIFHSVAVVLPERGPNPDLKRRFLDLMQEFGASP >gi568815584r:95611758_95814066|GENSCAN_predicted_CDS_4|870_bp atggcccaggccctgcctggcctgctccaagctgctgcccatctgagaggagccctctat gggaccacattatctcatgtggataagcttgtgcctgtggtccctttacggggggtgttc ctggggccagggctgggcatggacattgctgcgctcctagacctgcacagaaaaaataag ccagtgcactatcagcatctgagctcaggatcagataaagcccacagaaacaaggccacc ggaggccacgtggccacctggccagcccatccttggccatggggcccagagtgcccactt ggcctggtcgtcttacttgctcgactttgtcaagaggcctcagaagctcaagaacatagc aagaagcaggtctcccatccctcaacccttggacctggtatgaaaaaagccacggagctg caggtgacaacagtttggttcctggtccttcagctggtctttgtccatgtcatgtctgag ataagtgagataaggctatgtcttctggccaagttgtccatctctggtacctgtgacctg aaggattcactctccggggtaggtgtcaccaatgtctttgagaaccactttgatctctcc agaattgctcccaggattgcctggaaaaagcagctgcagatgcctgacagcaggggtcag ttcacctgctctccagcgttttcagctcctgccaagctgttctcaggaactccaccccct ggtgagcctttgtcttctcaggagaccacagggagcctgggcaacacgattttccatagc gtggctgtagtgttaccagaaaggggtcccaatccagacctcaagagaaggttcttggac ctcatgcaagaatttggggcaagtccatag >gi568815584r:95611758_95814066|GENSCAN_predicted_peptide_5|139_aa MLTLTASNQHRTGNPNQSNRRPEDAMAECPTLGEAVTDHPDRLWAWEKFVYLDEKQHAWL PLTIEIKDRLQLRVLLRREDVVLGRPMTPTQIGPSLLPIMWQLYPDGRYRSSDSSFWRLV YHIKIDGVEDMLLELLPDD >gi568815584r:95611758_95814066|GENSCAN_predicted_CDS_5|420_bp atgctcactctcactgcttctaatcagcacagaactggaaatcctaatcagagcaacagg cggcccgaggacgccatggccgagtgcccgacactcggggaggcagtcaccgaccacccg gaccgcctgtgggcctgggagaagttcgtgtatttggacgagaagcagcacgcctggctg cccttaaccatcgagataaaggataggttacagttacgggtgctcttgcgtcgggaagac gtcgtcctggggaggcctatgacccccacccagataggcccaagcctgctgcctatcatg tggcagctctaccctgatggacgataccgatcctcagactccagtttctggcgcttagtg taccacatcaagattgacggcgtggaggacatgcttctcgagctgctgccagatgactga >gi568815584r:95611758_95814066|GENSCAN_predicted_peptide_6|340_aa MPDLSVWPWASHIILLSLSFVICNVLSMGTTLSVFCLGKIPLGTVKLMGCVAGVRLHSGT PVRAFLFTVWSMEQQYWPSLLGKQNLRTHPEQLLIESPHFNKIPDPNVAFYLIAVPEVEE EKLPLPGVGSPTHRGLSSAAKFSCILSHVVTQDLCIPEASPQPSGYCFLTFYPAELRPLD QWDWCLYKEYSLYYCEGSNPFSPLPPCEDAVRAAILEAKREPSSDTESAGALTLDFPASG TCLTTVPKSMTEKGKGPGDLRANARIWCRVISSGSSRGRYGYLRAVDTVWTPYATLEGNW TKELKGKPSSETLGALSKAPQLVEESVPHEADSTVHTFNQ >gi568815584r:95611758_95814066|GENSCAN_predicted_CDS_6|1023_bp atgcctgacctgtctgtatggccttgggcgagtcacatcatccttctgagcctcagcttt gtcatctgcaatgtcttaagcatgggcacaacattgtcagttttctgccttggaaagatc cctctggggacagtgaagttaatgggttgtgtggcaggtgtaagattacattcagggaca ccggttagagcctttctgttcactgtgtggtccatggaacagcagtattggccctccttg ttaggcaagcagaatctcaggacacatcctgagcaactactgattgagagtccgcatttt aacaagatcccagaccctaatgtggccttctatctgattgctgttccagaagtggaggag gagaagctgcccctccctggtgtaggatcacctactcaccgcgggctcagctcagcagcc aagttcagctgcatcctgagccatgttgtcacccaggacctttgcatccctgaagccagt ccacaaccttctggttactgtttcctaactttctatcctgctgaactgcgtccattggat cagtgggactggtgcctgtataaagagtactctctgtactactgtgagggaagcaaccct ttttcacccttaccgccatgtgaagatgcagtaagagccgccatcttggaagcaaagaga gagccttcatcagacactgaatctgctggtgccttgaccttggacttcccagcctctgga acttgcttaaccacagtgcccaaaagcatgacagaaaagggaaagggccctggggatcta agagccaatgcaaggatctggtgcagagtaatttcttctggctcctccagaggtagatat gggtatctgagggcagtggacacagtctggaccccttatgccacactggaaggcaactgg accaaggagctgaaggggaaaccaagcagtgagacgttaggtgctttgtccaaggcccct cagctagtggaggagtcagtcccacatgaagctgactccacagtccacacctttaaccaa taa >gi568815584r:95611758_95814066|GENSCAN_predicted_peptide_7|84_aa MASPLSLPVTHLEMLSEAGARRTEKAIAELRSKHKRRQGKGRSTSDLTKIFLNDGVKGSV SLNLRIDDTREVTDDKRLYQSLKP >gi568815584r:95611758_95814066|GENSCAN_predicted_CDS_7|255_bp atggcttccccactttcactgcctgtcactcacctggaaatgctcagtgaggctggggca cgcagaacagaaaaggccattgcagaattacggagtaaacataaaagacgccagggaaag gggaggtcaactagtgacctaacaaagatttttctaaatgatggggtaaagggttctgtc tctttaaatcttcgaatagatgacaccagggaagtcactgatgacaagagactgtaccag tctctcaagccatga