GENSCAN 1.0 Date run: 5-Nov-116 Time: 01:13:20 Sequence gi568815596r:20102769_20324867 : 222099 bp : 45.94% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3410 3536 127 0 1 40 111 97 0.369 6.94 1.02 Term + 3836 4295 460 2 1 31 42 145 0.867 -1.24 1.03 PlyA + 4352 4357 6 1.05 2.00 Prom + 6562 6601 40 -3.46 2.01 Init + 10993 11127 135 0 0 73 93 119 0.738 11.36 2.02 Intr + 11180 11288 109 1 1 64 70 52 0.984 0.86 2.03 Intr + 15059 15379 321 0 0 -11 55 385 0.426 21.13 2.04 Intr + 21990 22077 88 1 1 85 59 70 0.374 2.83 2.05 Term + 28478 28601 124 2 1 107 44 141 0.987 9.46 2.06 PlyA + 29332 29337 6 1.05 3.00 Prom + 29611 29650 40 -9.65 3.01 Init + 30362 30540 179 1 2 109 105 52 0.413 7.73 3.02 Intr + 38602 38701 100 1 1 31 86 67 0.092 0.91 3.03 Term + 38871 39041 171 2 0 88 53 74 0.539 1.73 3.04 PlyA + 39589 39594 6 1.05 4.03 PlyA - 40163 40158 6 1.05 4.02 Term - 53071 52865 207 1 0 21 43 288 0.603 14.94 4.01 Init - 53289 53098 192 0 0 85 50 180 0.875 12.92 4.00 Prom - 58901 58862 40 -4.36 5.07 PlyA - 58966 58961 6 1.05 5.06 Term - 66165 65788 378 0 0 63 32 209 0.495 7.59 5.05 Intr - 71878 71777 102 1 0 73 56 66 0.032 2.27 5.04 Intr - 75266 75119 148 1 1 82 -6 92 0.010 -0.56 5.03 Intr - 81829 81643 187 2 1 37 115 79 0.033 4.25 5.02 Intr - 84515 84373 143 1 2 76 99 25 0.057 2.40 5.01 Init - 93795 93719 77 0 2 36 57 92 0.043 1.48 5.00 Prom - 96914 96875 40 -7.06 6.08 PlyA - 98056 98051 6 1.05 6.07 Term - 100167 99998 170 1 2 64 48 297 0.999 21.34 6.06 Intr - 100454 100319 136 2 1 119 105 169 0.999 21.84 6.05 Intr - 101523 101045 479 1 2 120 109 238 0.998 21.87 6.04 Intr - 102656 102575 82 2 1 80 94 89 0.827 7.91 6.03 Intr - 107867 107747 121 1 1 37 72 67 0.254 0.50 6.02 Intr - 120940 120762 179 1 2 42 64 151 0.965 6.82 6.01 Init - 122099 122034 66 2 0 93 117 237 0.999 26.17 6.00 Prom - 134915 134876 40 -2.76 7.18 PlyA - 135023 135018 6 1.05 7.17 Term - 137760 137533 228 0 0 2 44 246 0.671 8.23 7.16 Intr - 151246 151054 193 0 1 55 85 172 0.928 13.09 7.15 Intr - 152216 152095 122 2 2 39 72 66 0.947 -0.61 7.14 Intr - 152573 152448 126 2 0 92 75 26 0.841 2.58 7.13 Intr - 153402 153265 138 0 0 112 55 94 0.998 9.16 7.12 Intr - 155603 155475 129 2 0 59 91 45 0.427 2.79 7.11 Intr - 158874 158788 87 0 0 91 -1 122 0.486 3.77 7.10 Intr - 159868 159862 7 0 1 92 92 0 0.052 -5.46 7.09 Intr - 160692 160425 268 1 1 57 82 106 0.057 3.49 7.08 Intr - 180718 180575 144 2 0 37 119 43 0.207 2.55 7.07 Intr - 188022 187884 139 0 1 62 74 74 0.333 3.44 7.06 Intr - 191750 191608 143 0 2 -5 96 75 0.428 -0.93 7.05 Intr - 194910 194785 126 1 0 60 92 62 0.415 4.45 7.04 Intr - 205303 205210 94 1 1 68 95 51 0.868 3.34 7.03 Intr - 205816 205546 271 0 1 69 98 184 0.719 15.04 7.02 Intr - 208895 208726 170 2 2 50 60 170 0.941 9.14 7.01 Intr - 215877 215769 109 2 1 68 99 31 0.290 2.49 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 125636 125430 207 2 0 82 38 146 0.970 6.34 S.002 Term - 148948 148817 132 1 0 61 32 124 0.823 2.19 S.003 Init + 164945 164972 28 1 1 76 86 14 0.820 -0.24 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:20102769_20324867|GENSCAN_predicted_peptide_1|195_aa XVKLQTFTVSVRAHKGSKDPKSKQQQQELSQRAKHPQHGKTPTSPAGFISHWDSLLDFAA PNPGTPAAHRELVPDNRGKEGKREREGDLLLWPTIPRRGNGGPRTGFSLPSSPAGAHLEP AQARKHRVQPRLPPAPLPSSRPRLSLHTSPPAEGAVSGLGQPQRGAPTAQRRAEGLLQCG QSGRRSRGSAKSEGC >gi568815596r:20102769_20324867|GENSCAN_predicted_CDS_1|588_bp ngagtgaagctacagaccttcacagtgagtgttagagctcataaaggcagcaaggaccca aaaagtaagcagcagcagcaagagctctcccaaagagcaaagcacccacagcatggaaaa acaccaacaagcccagccggcttcatctctcactgggactcgctgctggactttgcagca ccaaaccccggcactccggcagcccacagggagctagtcccagacaatcgaggaaaagaa gggaagcgagaaagagagggagacctgctattgtggccaacgatcccgcgaagaggcaac ggcggtccacgaacgggattcagcctcccatcaagcccagccggcgctcacctggaaccc gcgcaggcccgcaagcaccgtgttcagccccggctcccgcccgcgcctctcccttcctcc cgcccgcgcctctccctccacacctccccgccagcagagggagccgtctccggcctcggc cagccccaaagaggggcccccacagcgcagcggcgggctgaagggctcctccagtgcggc cagagcggacgccgaagccgaggaagcgccaagagtgagggctgctag >gi568815596r:20102769_20324867|GENSCAN_predicted_peptide_2|258_aa MASVWQGCARGTDSPRAQDSGNQSGDENYQSGNARKMQTVLASPQSFPEQEIKLERIPGL APGLIEPRTVYHFPLHQAEKKSTEEEPGGGGQEQLEEQEEPGGGGQEQLEEQEEPRGGGQ EQLEEQEDPGGGGQEQLEEQEEPRGGGQEQLEEQEEPGGGGQEQLEEQGEPGGGGQEQLE EQEEPGGGGAVFEPSIQEGQMALELLDEVSSGECQGGRPHTELRGLCEEDRNEGDEGVEF EASKPHGGKLAFEKRGLA >gi568815596r:20102769_20324867|GENSCAN_predicted_CDS_2|777_bp atggccagtgtctggcagggttgcgccaggggcacagacagccccagggctcaggactct gggaaccagagtggggacgagaactaccaatcagggaatgcccgcaagatgcagacagtt ttggcatctcctcagagttttccagagcaggaaatcaagctggagagaatacctggcttg gccccaggcctcattgaacccagaactgtctaccactttccattgcaccaagcagagaag aagagcactgaggaggagcccggaggtggaggtcaggagcagctggaagagcaggaggag cccggaggtggaggtcaggagcagctggaagagcaggaggagcccagaggtggaggtcag gagcagctggaagagcaggaggatcccggaggtggaggtcaggagcagctggaagagcag gaggagcccagaggtggaggtcaggagcagctggaggagcaggaggagcccggaggtgga ggtcaggagcagctggaggagcagggggagcccggaggtggaggtcaggagcagctggag gagcaggaggagcccggaggtggaggggctgtctttgagccatctattcaggaaggacaa atggccttggagctcctggatgaagtctcctctggagagtgtcagggcgggaggcctcac acggagctgcggggcctctgtgaggaagacagaaatgaaggcgatgagggtgttgagttt gaggcatccaagccacacggagggaagctggcatttgagaagagagggctggcgtga >gi568815596r:20102769_20324867|GENSCAN_predicted_peptide_3|149_aa MERRQGAVRRRVVQEDSTAQDREGSWSPRGWHNASKRYIEGCRSVTATGQSFQPEDRRLR MATPNKGLWLSRAQNLNTSRPEPRDDALAPAPWAQLTTADITFSSTPFPANPDEPSLSSD PLSSSESRMLEPKAISGNIWSSLTYSWGN >gi568815596r:20102769_20324867|GENSCAN_predicted_CDS_3|450_bp atggagaggaggcaaggagcagtccgtaggagggtagtgcaggaggacagcactgcacaa gacagggagggcagctggtccccgagggggtggcacaatgccagcaagaggtacatcgag ggctgtcgcagcgtcactgccaccggacagtctttccaacctgaggacaggaggctccgg atggccactcccaacaagggcctatggctctccagagcccagaacctgaacacatccagg ccagagccacgggatgatgcactggcccctgccccatgggcccagctcaccacggctgac atcaccttcagctccacccctttcccagcaaaccctgatgagccctctctttcctctgac ccattgtcctcctcggaatcaagaatgctagagccaaaagctatctcagggaacatctgg agcagcctcacttacagctggggaaactga >gi568815596r:20102769_20324867|GENSCAN_predicted_peptide_4|132_aa MLSKGPLQYAQVFGCKKRATAVAPGKHSNGLIKVNGWHLEMIELHMLQYKLLEPLLLWAK SDLLGGSHVAQICAVHQSISKALVAYYQKYMDEASKKDIKDTLIQNDQTLLVAGLLRCES KKFGSSGACARY >gi568815596r:20102769_20324867|GENSCAN_predicted_CDS_4|399_bp atgctgtccaagggcccactgcagtatgcgcaggtctttggatgcaagaaaagggccaca gctgtggcgcccggcaaacacagcaatggcctcatcaaggtgaatgggtggcacctggag atgatcgagctgcatatgctacaatacaagctgctggaaccacttctgctctgggcaaag agcgatttgctgggtggtagtcatgtggcccagatttgtgctgtccatcagtccatctcc aaagccctggtggcctattaccagaaatatatggatgaggcttccaagaaggatatcaaa gacaccctcatccagaatgaccagaccttgctggtagctggcctccttcgctgtgagtcc aaaaagtttggaagctctggtgcctgtgctcgctactag >gi568815596r:20102769_20324867|GENSCAN_predicted_peptide_5|344_aa MCQAQCQALDIASDPGPVLTFKAVARASPLEKECRAAGQQAQGPAWYLGCFPSLLGPQIS SSGNQVQKGAKCTEGSSELLPQNLLGGLWLTTSKEEASGSGCSQQEGLLMSPTLLKAPGK LDLKKGLGNPVLSPSSLSSSKQALRISSEKLTSPELLVAENEASCVALPATLTFQGQSSC PYVRSLVAASPKESGDCHHSLMLRDVAAWGMALCKLEALGQIPAHQPGEAPIPCPITADW LRPLPGWSYSADCTFYPLMTSPTRKFAQVQNSEPHEEAAEEKDDSSGALRPSGSNPSCPV PTLQPEFPIRHRALQQEDLPARASGPGRSICLPLAWMSARADAI >gi568815596r:20102769_20324867|GENSCAN_predicted_CDS_5|1035_bp atgtgccaggcacagtgccaggccctggacattgccagtgacccaggccccgtccttacc ttcaaggctgttgcgagggcaagccctctggagaaggagtgcagagctgcaggccagcag gcccagggtcccgcctggtatcttgggtgctttccttccctcctggggcctcagatctcc tcatctggaaatcaagttcagaaaggtgcaaaatgtactgagggctcttctgagttgctg ccgcagaacctgcttggtggcctgtggctcacgacgtcaaaggaggaggcatcgggaagt ggatgttcccagcaagagggtcttctgatgtcccccaccctcctcaaagcaccagggaaa ttggatcttaaaaagggccttgggaacccagtgctgagcccttcaagcctctcctcatcc aaacaggctctcagaatatcctctgagaaactcacttctcctgagctgttggtggctgaa aatgaggcctcctgtgtggccctcccagccactctcaccttccagggacagtcctcctgt ccctacgtgcgcagcctggtagctgcatcacccaaggaatctggtgactgccaccactct ctgatgctcagagatgttgctgcatggggcatggccctgtgcaagctggaggcactgggc cagatccctgcccaccaaccgggcgaggctccgatcccctgccccatcacagctgactgg ctgaggcccctccctggctggtcctacagcgctgactgcacgttctacccactcatgact tctcccacaaggaagtttgcccaggtgcagaactcagagccacatgaagaggctgcagaa gaaaaggacgactcgtctggggctcttagaccctcaggaagcaacccaagctgccctgtg cccacactccagcctgaatttcctatcagacacagagctttacagcaggaagacctgccc gccagagcctcggggcccggccgctccatctgcctgcccttggcctggatgtctgcccga gcagatgcaatttaa >gi568815596r:20102769_20324867|GENSCAN_predicted_peptide_6|410_aa MRRAALWLWLCALALSLQPALPPRNAGLDGGWDLGRSQVSARAARCPSLPPEFASPAPAP GAGVPGSLLRAGSAIPRDAEGRSRGFRGHTGQVQLTDQLPFPAPPRMTANQRPPALPKLV PGQIVATNLPPEDQDGSGDDSDNFSGSGAGALQDITLSQQTPSTWKDTQLLTAIPTSPEP TGLEATAASTSTLPAGEGPKEGEAVVLPEVEPGLTAREQEATPRPRETTQLPTTHLASTT TATTAQEPATSHPHRDMQPGHHETSTPAGPSQADLHTPHTEDGGPSATERAAEDGASSQL PAAEGSGEQDFTFETSGENTAVVAVEPDRRNQSPVDQGATGASQGLLDRKEVLGGVIAGG LVGLIFAVCLVGFMLYRMKKKDEGSYSLEEPKQANGGAYQKPTKQEEFYA >gi568815596r:20102769_20324867|GENSCAN_predicted_CDS_6|1233_bp atgaggcgcgcggcgctctggctctggctgtgcgcgctggcgctgagcctgcagccggcc ctgccgccccgtaacgccggccttgacgggggctgggacctgggccgcagccaggtgtca gcgcgggccgcccggtgcccgagtctgccgcccgagttcgcctccccagcccccgcgccc ggtgctggcgtcccggggtccctcctgcgggccggctcggccatccccagggatgcagag gggagaagccggggatttcgtggccacacggggcaggtacagctgacagatcagcttccc tttcctgctccaccacgaatgactgcgaatcagaggccccctgcgctgcccaagttagtg cctgggcaaattgtggctactaatttgccccctgaagatcaagatggctctggggatgac tctgacaacttctccggctcaggtgcaggtgctttgcaagatatcaccttgtcacagcag accccctccacttggaaggacacgcagctcctgacggctattcccacgtctccagaaccc accggcctggaggctacagctgcctccacctccaccctgccggctggagaggggcccaag gagggagaggctgtagtcctgccagaagtggagcctggcctcaccgcccgggagcaggag gccaccccccgacccagggagaccacacagctcccgaccactcatctggcctcaacgacc acagccaccacggcccaggagcccgccacctcccacccccacagggacatgcagcctggc caccatgagacctcaacccctgcaggacccagccaagctgaccttcacactccccacaca gaggatggaggtccttctgccaccgagagggctgctgaggatggagcctccagtcagctc ccagcagcagagggctctggggagcaggacttcacctttgaaacctcgggggagaatacg gctgtagtggccgtggagcctgaccgccggaaccagtccccagtggatcagggggccacg ggggcctcacagggcctcctggacaggaaagaggtgctgggaggggtcattgccggaggc ctcgtggggctcatctttgctgtgtgcctggtgggtttcatgctgtaccgcatgaagaag aaggacgaaggcagctactccttggaggagccgaaacaagccaacggcggggcctaccag aagcccaccaaacaggaggaattctatgcctga >gi568815596r:20102769_20324867|GENSCAN_predicted_peptide_7|831_aa XFCLPKSFGNLMIQQKMDKKAYFLGMMNGERLHGELLGTRDAETDGPEKGDQKGKASPFE EDQNRDLKQGDDDDSKINGRGLPNGMDADCKDFNRTPGSRQASPTEVVERLGPNTNPSEG LGPLPNPTANKPLVEEFSNPETQNLDAMEQVGLESLQFDYPGNQVPMDSSGATVGLFDYN SQQQLFQRTNALTVQQLTAAQQQQYALAAAQQPHIAGVFSAGLAPAAFVPNPYIISAAPP GTDPYTAAGLAAAATLAGPAVVPPQYYGVPWGVYPANLFQQQAAAAANNTASQQAASQAQ PGQQQVLRAGAGQRPLTPNQGQQGQQAESLAAAAAANPTLAFGQGLATGMPGYQVLAPTA YYDQTGALVVGPGARTGLGAPVRLMAPTPVLISSAAAQAGGLTNGSGRYISAAPGAEAKY RSASSTSSLFSSSSQLFPPSRLRYNRSDIMPSGRSRLLEDFRNNRFPNLQLRDLIGHIVE FSQDQHGSSFKEIFSSNTYGAVLSYDNSAFSGIPSEIYLRFGSLDQKLALATRIRGHVLP LALQMYGCRVIQKALESISSDQQSEMVKELDGHVLKCVKDQNGNHVVQKCIECVQPQSLQ FIIDAFKGQVFVLSTHPYGCRVIQRILEHCTAEQTLPILEELHQHTEQLVQDQYGNYVIQ HVLEHGRPEDKSKIVSEIRGKVLALSQHKFASNVVEKCVTHASRAERALLIDEVCCQNDG PHSALYTMMKDQYANYVVQKMIDMAEPAQRKIIMHKQPPPPNNGFKKKDKGGIHLTATCP QSELDAETVKSILAEYKIHNADVTLRSDATADDLIDVVEGNRVYIPCIYVK >gi568815596r:20102769_20324867|GENSCAN_predicted_CDS_7|2496_bp nncttttgcctaccaaaaagttttgggaacctgatgattcaacaaaagatggacaaaaag gcatatttcttggggatgatgaatggagagagactgcatggggagcttctcggcactaga gatgctgaaacagatggacctgagaaaggagatcaaaaaggcaaggcttctccatttgag gaggaccaaaacagagatcttaaacaaggagatgatgatgattctaaaataaatggcaga ggtttgccaaatggaatggatgccgattgcaaagattttaatcgtactcctggaagtcgt caagcctctccaactgaagtagttgagcgcttgggccccaatactaatccctcagaagga ctggggcctcttcctaatcctacagctaataaaccacttgttgaagaattttcaaatcct gaaactcagaatctggatgccatggaacaagttggtctggaatccttacagtttgactat cctggtaatcaggtaccaatggactcttcaggagctactgtaggcctttttgactacaat tcccagcagcagctctttcagaggactaatgcactaacagttcaacagttaactgcagct caacagcagcaatatgcattagcagcagctcagcagccacatatagctggtgtattctca gcaggccttgctccagctgcatttgtgccaaatccatacattattagtgctgctcctcca gggaccgatccgtatactgcagcaggattggctgcagcagctacattagcaggtccagca gtggttccacctcagtattacggcgttccatggggggtgtatccagccaacttatttcag cagcaagctgcagctgcggcaaataacacagccagtcagcaagcagcatcacaagctcag cctggacagcaacaggttctccgtgctggagcaggtcagcgtcctcttactcccaatcag ggtcagcaagggcagcaagcagaatcacttgcggcagctgcagcagcaaatccaacattg gcttttggtcagggtcttgctactggcatgccaggctatcaagtactagctccaactgcc tattatgatcagactggtgccttagtggttggccctggagcaaggactggccttggagct ccagttcggttaatggctccaacacctgttttaattagttcagcagcagcacaagctgga ggactgacaaatggtagtggtcgatatatctctgcagcacctggagcagaagcaaaatat cgaagtgcttcaagcacttccagtctatttagctccagcagccagctctttcctccttcc cggcttcggtataataggtctgatattatgccttctggccgcagtagattattggaagat ttcagaaacaaccgcttcccaaaccttcagcttagagacttgattggacatatagttgag ttttctcaagaccagcatggttctagctttaaggagatctttagcagcaacacctatgga gctgtcctctcctatgataacagtgctttttctggaataccttctgaaatctacctgagg tttgggagtctggatcaaaaattagccctggctactcgtattcgtggtcatgttctaccc ttagccttgcagatgtatggctgccgcgttattcagaaagcattagaatctatttcttct gaccagcagagtgaaatggtaaaggagctggatggtcatgtgctcaaatgtgtgaaagat cagaatggaaaccatgttgtacaaaaatgtatcgaatgtgttcagccacagtcactacag ttcatcattgatgctttcaagggacaagtatttgtgctttcaactcatccttatggctgc agagtaattcagcgcatcctagagcattgcactgcagaacagaccttacctatcttagaa gaactccaccaacatacagagcagttggtacaggatcagtatggcaattatgttattcag catgtactggaacacggtcgacctgaagacaagagcaaaattgtttccgaaatcagggga aaggttttagccctgagtcaacacaaatttgccagcaatgtagtagaaaagtgtgttact catgcctcccgtgctgagagagctttactgattgacgaggtttgctgccagaatgatggt cctcacagtgccttatacaccatgatgaaggaccagtatgccaattacgtggttcaaaag atgattgatatggctgaacctgctcagagaaagataatcatgcacaagcaaccaccgccc cccaacaatggctttaagaagaaggacaagggaggcattcatctcacagccacttgccca cagagtgagctggatgctgaaactgtgaagagcattctggctgaatacaagattcataat gccgatgtgactctacgtagtgatgctacagctgatgacctcattgatgtggtggaagga aacagagtttatatcccctgtatctatgttaaataa