GENSCAN 1.0 Date run: 5-Nov-116 Time: 11:04:50 Sequence gi568815589f:107183630_107429653 : 246024 bp : 42.48% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 5151 5352 202 2 1 42 67 118 0.365 4.19 1.02 Term + 5566 5855 290 2 2 42 43 190 0.850 4.45 1.03 PlyA + 6152 6157 6 1.05 2.00 Prom + 6967 7006 40 -5.55 2.01 Init + 7695 7753 59 2 2 54 110 6 0.499 0.23 2.02 Intr + 13504 13638 135 1 0 55 33 134 0.200 3.36 2.03 Intr + 22093 22179 87 1 0 88 72 92 0.142 5.77 2.04 Intr + 27206 27382 177 2 0 15 35 169 0.039 2.41 2.05 Intr + 33197 33276 80 2 2 96 89 44 0.223 3.58 2.06 Intr + 35771 35964 194 0 2 48 119 84 0.085 5.79 2.07 Term + 44185 44208 24 0 0 103 44 37 0.153 -1.95 2.08 PlyA + 44936 44941 6 -0.45 3.00 Prom + 46093 46132 40 -0.95 3.01 Init + 69243 69357 115 2 1 74 92 113 0.768 10.72 3.02 Intr + 72473 72513 41 0 2 77 82 27 0.185 -1.98 3.03 Term + 84792 84857 66 2 0 116 39 97 0.290 4.56 3.04 PlyA + 85948 85953 6 1.05 4.00 Prom + 98033 98072 40 -4.45 4.01 Init + 100001 100066 66 1 0 107 80 100 0.238 12.22 4.02 Intr + 100218 100424 207 2 0 35 75 105 0.685 2.25 4.03 Intr + 116512 116593 82 0 1 87 80 97 0.866 7.09 4.04 Intr + 118406 118485 80 0 2 72 106 127 0.986 11.25 4.05 Intr + 122750 123018 269 1 2 53 91 284 0.936 20.61 4.06 Intr + 128053 128108 56 1 2 64 111 59 0.800 3.50 4.07 Intr + 135123 135250 128 1 2 104 81 154 0.998 15.78 4.08 Intr + 138354 138489 136 2 1 37 88 154 0.985 9.52 4.09 Intr + 140261 140388 128 0 2 47 62 60 0.771 -1.22 4.10 Intr + 141205 141375 171 0 0 72 96 226 0.985 21.02 4.11 Intr + 146923 146946 24 0 0 111 72 25 0.006 0.50 4.12 Intr + 153788 153880 93 1 0 62 93 78 0.111 4.94 4.13 Intr + 154834 155031 198 0 0 111 55 108 0.309 8.43 4.14 Term + 164504 164734 231 1 0 -14 49 200 0.022 1.39 4.15 PlyA + 164947 164952 6 1.05 5.00 Prom + 174094 174133 40 -4.65 5.01 Init + 175271 175591 321 1 0 86 18 232 0.913 13.59 5.02 Intr + 176348 176458 111 1 0 39 101 121 0.918 8.16 5.03 Intr + 183976 184098 123 0 0 100 47 48 0.097 1.76 5.04 Intr + 188692 188795 104 0 2 107 103 68 0.828 8.25 5.05 Intr + 195557 195601 45 2 0 89 63 84 0.355 2.61 5.06 Intr + 205439 205576 138 2 0 86 77 82 0.075 5.56 5.07 Intr + 210339 210421 83 0 2 57 48 97 0.085 0.96 5.08 Intr + 220658 220820 163 0 1 27 93 145 0.048 7.01 5.09 Intr + 226614 226758 145 0 1 48 99 76 0.073 4.06 5.10 Term + 242289 242444 156 2 0 50 48 186 0.029 7.75 5.11 PlyA + 243473 243478 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 145914 146027 114 2 0 101 48 113 0.976 6.19 S.002 Init + 157264 157421 158 0 2 71 86 139 0.819 9.43 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589f:107183630_107429653|GENSCAN_predicted_peptide_1|163_aa MYRLGETLTIDIHDLNCVCRCPIRGDGIKYLLKLLGPCKICQILKEMEKGLIKSSGDTRK ETRGTAGNIEGTIRAPIVKTCLNFQWDLICLCCTVVSQPHGEGQLDPHWNQLCLQQQTPT GNDPPGGRPNPPRRPARDRPDIHLVEEEDEKKLAMPSTDTEIP >gi568815589f:107183630_107429653|GENSCAN_predicted_CDS_1|492_bp atgtataggctaggagagactctcaccatcgacattcatgacctcaactgtgtttgccgg tgcccaattagaggtgatggtataaagtacctgctaaagttgttaggaccatgcaaaatt tgccagatactaaaagagatggagaagggactgataaaaagcagcggtgacacaaggaag gaaacaaggggaacagcaggtaacattgaaggaactatcagagcccccatagtaaagacc tgcttgaattttcagtgggatctaatatgcctttgctgtactgtggtatctcagccacat ggagaagggcagctggatcctcactggaaccagctatgtctccagcaacagacacccaca ggtaatgacccacctggaggaagacccaatcccccaagacgacctgccagagacaggcca gacatccatcttgtagaagaagaggatgagaaaaagctagccatgccctccactgacact gaaatcccttag >gi568815589f:107183630_107429653|GENSCAN_predicted_peptide_2|251_aa MPDLNVGSGDHFPVFYHFSRLYRKHDAGIYLAAGEASGNTIMAEGEVGAGTSYMAGTTAR ERGGSPMMSDIPLSAGETGEVIKEKAILILGFEGLFRKDIIKGSQVREKGYSARSTPQEA LLNFSYVIIRCSPTTWRHVHRLLLVITMTQRPRYVWRVKDAGEADNLGNEWLINMPNSTD SFSTINLVGLYPAWTCCKALSCSGSTPKGFPPNSKQLTKHHDSFSVVGGCEVQLLVTYLK KHVLRYVAIYS >gi568815589f:107183630_107429653|GENSCAN_predicted_CDS_2|756_bp atgccagatctgaatgtgggaagtggagaccatttccctgttttttaccatttctccagg ctgtacaggaagcatgatgctggcatctacttggctgctggggaagcctcaggaaacaca atcatggcggaaggtgaagtgggagcaggcacatcttacatggcaggaacaacagcaaga gagagaggggggagccctatgatgagtgacatcccactttcagctggagagactggggaa gtcatcaaggaaaaggcaattctaattttgggctttgaaggacttttcagaaaagacatc ataaaagggagccaggtcagagaaaaaggctattcagccaggtccactcctcaggaagcc ttgctgaatttctcctatgtcatcatccgttgttcacctactacctggaggcacgtgcac aggcttttgctagtcatcacaatgactcagcggcctagatatgtgtggagagtgaaggat gctggagaagcagataacttggggaatgaatggttgataaacatgcccaacagcacagac tcctttagcaccattaacctcgttggcttatatccagcctggacctgctgcaaagctctc tcttgctctggcagtactcccaagggcttccctccaaattctaagcagcttaccaagcac catgactctttctcagtggtaggaggatgcgaagtgcaactacttgtcacttatctcaaa aagcatgtcctgcgttatgtggccatctacagctga >gi568815589f:107183630_107429653|GENSCAN_predicted_peptide_3|73_aa MALDNSNEKTREKGRLSKKLVMAEPFKQGRRMDVQKDGEGDLAGSWDNSGGKPTQHEDDE DEDLYNDPLSLNE >gi568815589f:107183630_107429653|GENSCAN_predicted_CDS_3|222_bp atggctttggataattccaatgagaagacaagagagaaggggagactctcaaagaaacta gtgatggcagagccctttaagcaaggacgtagaatggatgttcaaaaagatggagagggg gatttggcagggtcatgggacaatagtggagggaagcccactcaacatgaagatgatgag gatgaagacctttataatgatccactttcacttaatgaatag >gi568815589f:107183630_107429653|GENSCAN_predicted_peptide_4|622_aa MQVTLKTLQQQTFKIDIDPEETRSRSSCLGRGLCGRRMGKQWPDAEGLVADGVERTPAGL GLGARASGAGGGWEGPGRLSRRAWLSGMGAQVKALKEKIESEKGKDAFPVAGQKLIYAGK ILNDDTALKEYKIDEKNFVVVMVTKPKAVSTPAPATTQQSAPASTTAVTSSTTTTVAQAP TPVPALAPTSTPASITPASATASSEPAPASAAKQEKPAEKPAETPVATSPTATDSTSGDS SRSNLFEDATSALVTGQSYENMVTEIMSMGYEREQVIAALRASFNNPDRAVEYLLMGIPG DRESQAVVDPPQAASTGAPQSSAVAAAAATTTATTTTTSSGGHPLEFLRNQPQFQQMRQI IQQNPSLLPALLQQIGRENPQLLQQISQHQEHFIQMLNEPVQEAGGQGGGGGGGSGGIAE AGSGHMNYIQVTPQEKEAIERQKRVYQQVLLLSICHKSEHPELGLSAFGVSLKVLNERKL ATGLAAASNLQVKFGREPSNPVKQNAHCCSFPSRGTVSSQCFLLISSDSLAALPEAQELK IGKGLKGDLEQALPGRAVLHLAEVAGPSYSTRQPPPLSQQVWLPQEAQEFRQDFCLQLGH KSFHEEGPEWWSPCLSPPLTAV >gi568815589f:107183630_107429653|GENSCAN_predicted_CDS_4|1869_bp atgcaggtcaccctgaagaccctccagcagcagaccttcaagatagacattgaccccgag gagacgcggagccgatcctcctgtcttggccgtgggctttgtggcaggagaatggggaag cagtggccggacgccgaaggcctggtggcagatggcgtggagcgcacgcccgcgggcctg ggcctaggagctcgtgctagcggggccggagggggatgggaaggtccaggccgtctcagc cgtagagcctggctttctggtatgggtgcacaggtgaaagcactgaaagagaagattgaa tctgaaaaggggaaagatgcctttccagtagcaggtcaaaaattaatttatgcaggcaaa atcctcaatgatgatactgctctcaaagaatataaaattgatgagaaaaactttgtggtg gttatggtgaccaaacccaaagcagtgtccacaccagcaccagctacaactcagcagtca gctcctgccagcactacagcagttacttcctccaccaccacaactgtggctcaggctcca acccctgtccctgccttggcccccacttccacacctgcatccatcactccagcatcagcg acagcatcttctgaacctgcacctgctagtgcagctaaacaagagaagcctgcagaaaag ccagcagagacaccagtggctactagcccaacagcaactgacagtacatcgggtgattct tctcggtcaaacctttttgaagatgcaacgagtgcacttgtgacgggtcagtcttacgag aatatggtaactgagatcatgtcaatgggctatgaacgagagcaagtaattgcagccctg agagccagtttcaacaaccctgacagagcagtggagtatcttttaatgggaatccctgga gatagagaaagtcaggctgtggttgacccccctcaagcagctagtactggggctcctcag tcttcagcagtggctgcagctgcagcaactacgacagcaacaactacaacaacaagttct ggaggacatccccttgaatttttacggaatcagcctcagtttcaacagatgagacaaatt attcagcagaatccttccttgcttccagcgttactacagcagataggtcgagagaatcct caattacttcagcaaattagccaacaccaggagcattttattcagatgttaaatgaacca gttcaagaagctggtggtcaaggaggaggaggtggaggtggcagtggaggaattgcagaa gctggaagtggtcatatgaactacattcaagtaacacctcaggaaaaagaagctatagaa aggcagaaacgtgtttatcagcaagtcctgcttctaagtatctgccataaaagtgaacac cctgaactgggattatctgcatttggtgtatcactgaaggtactgaacgagagaaaactg gccacaggattggcagctgcatccaacctccaggtcaaatttggcagagaaccaagcaat cctgtcaaacagaatgcccactgctgctccttcccatctcgaggaactgtctcttctcaa tgttttcttttgatttcctctgacagtcttgctgctcttccagaggcacaggaactcaaa attgggaagggtttgaagggagacctggagcaagctctgcctggcagagctgtcctacac ctggcagaagtggctgggccttcatactcgacccgccagcccccaccactcagtcagcag gtatggctcccccaggaagcacaagagttcaggcaggacttctgtctgcaactgggccac aagtccttccatgaagaggggcctgaatggtggtctccatgtctatcaccacctctgaca gcagtttga >gi568815589f:107183630_107429653|GENSCAN_predicted_peptide_5|462_aa MIAAVAVVVVVAVSGYSLVCSCGPGRLPAAEIWSGPRKGLPRAGFVWKLRLGVGSQWQPC MDNAVIAAFPAVGHGDSPENEVGSVNIEPNYANELKQETQLQRCRDTAGAEDAVQEHKQQ HSFEAQMPENDTKWGAFPTPAWTMVVTQARMQVFIPVGSLYSSRLHQDLCAPSWGSQPAP AWGARRLSRHQLHRENLDAQQGQHFQRCTLTRGPHTRLKWTDSDNGQKSISTNNCTVPAV SEKLRKRAADFTQSLLDHLPLPSPWQLRGNTCVSSTVKWRCEREGGKGQPDATEALVNDC ASTLSLNGVLPEGLKLSLCLDPAVNCQEIQRTEEHEELHRKYTISKIQAVDPPQVKQPGS VNWSLLSKTSETRLGFGLGAEEERTTHKITDLLMRHNLMCKGTNRTGQAQERNTVIKATT MDASPLPSPYFCAFQWHPQNSVTTLHYWRTLEGPRVGRLLEE >gi568815589f:107183630_107429653|GENSCAN_predicted_CDS_5|1389_bp atgatcgctgctgttgccgttgttgttgttgtggctgtgtctgggtatagcctggtttgc agttgtggcccaggccgacttccagctgcagagatctggagcggcccaaggaaggggctc cccagggctgggtttgtttggaagctgcgtctcggtgtgggctcccagtggcaaccctgc atggataatgctgtaattgcagccttcccagctgtcggccatggagacagcccagaaaat gaggttggctcagtaaatattgaaccaaattatgcaaatgagctcaaacaagagactcag ctgcagagatgcagagacacggcaggagctgaagatgctgtacaggaacacaagcagcag cactcctttgaggcccagatgcctgagaatgatacaaagtggggagccttccctacccct gcatggacaatggttgtcactcaggcaagaatgcaggtgttcattccggtgggctcgctt tattcttccagactgcatcaggatctgtgtgctccttcttggggctcccagccagcccct gcatggggtgccaggaggctctccaggcatcagcttcacagagagaatcttgatgcccaa caaggacagcattttcagagatgtacccttacaaggggtccccatacaagattgaaatgg actgattccgacaacgggcagaagagcatttccactaacaactgcacagtcccagctgtt tctgaaaagctgcgcaagagagcagctgacttcacccagtctctgctggaccaccttcct cttccatccccctggcagctaagaggaaacacatgtgtctcctcaactgtaaaatggaga tgtgaaagggagggtggcaagggacagcctgatgctacagaggccctagtaaacgattgt gcatctacactaagtctcaacggtgtcctgccagagggattaaagctgagcctgtgtcta gatccagctgtcaattgccaggaaatacaaaggacagaggaacatgaagaactacaccga aagtacacaatcagcaaaatccaggctgtggaccctccacaagtcaaacagcctgggtcc gtcaactggagcctcctatctaagacatctgagacaagactgggatttggtctcggagca gaggaggagaggactactcacaaaatcacagaccttctgatgagacataacctgatgtgt aaaggtactaacaggaccggacaagcccaggagcgcaataccgtcataaaggcaacaaca atggacgcctcgcctctgccctctccatatttctgtgctttccaatggcatccccaaaac agtgtgacaacgcttcactactggaggaccctggagggacctagagttggccgattgcta gaggagtaa