GENSCAN 1.0 Date run: 6-Nov-116 Time: 08:37:47 Sequence gi568815582f:84550868_84762130 : 211263 bp : 47.41% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2788 2920 133 2 1 107 59 33 0.747 2.00 1.02 Intr + 5059 5288 230 1 2 114 89 63 0.515 6.51 1.03 Term + 7549 7619 71 2 2 131 55 17 0.412 0.80 1.04 PlyA + 11482 11487 6 1.05 2.12 PlyA - 14757 14752 6 1.05 2.11 Term - 16088 15978 111 2 0 102 40 214 0.456 16.56 2.10 Intr - 29198 29116 83 0 2 63 50 78 0.276 0.86 2.09 Intr - 33222 33195 28 0 1 115 32 44 0.240 -0.81 2.08 Intr - 39395 39238 158 0 2 82 84 322 0.753 30.93 2.07 Intr - 56449 56163 287 0 2 115 57 182 0.210 14.79 2.06 Intr - 57514 57480 35 1 2 43 119 11 0.083 -3.28 2.05 Intr - 63354 63290 65 1 2 105 75 44 0.097 3.24 2.04 Intr - 64235 64141 95 1 2 48 79 107 0.555 5.41 2.03 Intr - 65745 65706 40 1 1 66 21 66 0.437 -5.02 2.02 Intr - 66716 66634 83 1 2 50 91 123 0.946 8.08 2.01 Init - 67047 66971 77 0 2 103 62 216 0.701 21.06 2.00 Prom - 77605 77566 40 -1.06 3.00 Prom + 91757 91796 40 -1.46 3.01 Init + 92071 92135 65 0 2 72 71 49 0.539 0.95 3.02 Intr + 92565 92613 49 0 1 104 69 25 0.474 0.88 3.03 Intr + 98021 98199 179 1 2 55 36 200 0.982 10.22 3.04 Intr + 99985 100063 79 1 1 101 66 27 0.972 1.35 3.05 Intr + 106004 107077 1074 1 0 107 100 2580 0.997 251.50 3.06 Intr + 108893 109050 158 1 2 23 53 232 0.689 12.01 3.07 Term + 110711 111266 556 2 1 122 46 1237 0.999 116.70 3.08 PlyA + 114164 114169 6 1.05 4.03 PlyA - 114671 114666 6 1.05 4.02 Term - 115117 115029 89 2 2 70 49 103 0.527 2.42 4.01 Init - 117349 117319 31 1 1 50 94 62 0.540 2.91 4.00 Prom - 140507 140468 40 -2.86 5.00 Prom + 141893 141932 40 -5.36 5.01 Init + 149019 149179 161 2 2 42 49 227 0.295 11.50 5.02 Intr + 161361 161536 176 0 2 93 60 0 0.017 -2.72 5.03 Intr + 162568 162644 77 2 2 114 75 9 0.273 1.43 5.04 Intr + 182568 182636 69 2 0 95 94 26 0.053 3.28 5.05 Intr + 189442 189502 61 0 1 75 103 23 0.186 0.81 5.06 Intr + 193766 194806 1041 0 0 91 80 740 0.884 63.95 5.07 Intr + 201003 201131 129 1 0 64 44 63 0.451 0.17 5.08 Intr + 202570 202688 119 2 2 111 80 31 0.297 4.78 5.09 Term + 207809 208012 204 1 0 35 36 129 0.026 -0.23 5.10 PlyA + 209610 209615 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 63354 63185 170 1 2 105 36 85 0.832 3.04 S.002 Term + 165328 165402 75 0 0 139 38 38 0.859 1.74 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582f:84550868_84762130|GENSCAN_predicted_peptide_1|144_aa XPQLRNLRWVEENEQFPSLHKHFRQKTTEHKVFDADETDGLDGGLNVNPRLLDLQVCQEK LKARFLGETFASLNVSTQSKLRTGTSRRCCASHALHDSEDVGFMFVPVWSAPYSKLHTAQ PGLTSRFHCARCISARVDLLKTKV >gi568815582f:84550868_84762130|GENSCAN_predicted_CDS_1|435_bp nggcctcagttaaggaacctaagatgggtagaggaaaatgaacaatttccttccctacat aagcatttcaggcagaagacaactgaacacaaggtgttcgatgctgacgaaactgacggg ttggatggagggctgaatgtaaacccaaggttactggatcttcaggtttgtcaagaaaag ctgaaagctagatttttaggtgaaacctttgcatctttaaatgttagcacccaatccaaa cttcggactgggacttccaggcgctgttgtgcaagtcatgccctgcacgactccgaggac gtcggtttcatgtttgttccagtgtggagtgctccctacagtaaactgcacactgcgcag ccagggctcacctctcgttttcattgtgcccgatgcatctctgctcgggtggacctactc aaaaccaaggtctga >gi568815582f:84550868_84762130|GENSCAN_predicted_peptide_2|353_aa MATKIDKEACRAAYNLVRDDGSAVIWVTFKYDGSTIVPGEQGAEYQHFIQQCTVGSKLSD NGSTARRDNGLNTGTGWIAVTTLDVPYLGHTSEQPGVAAQFAWDGQRPCEGGEKCPVQIN KMMKDDRCVSSWPVDSLPSACTVFICFAFLEDSYTPVKARCECSLCSDALLVFVVLVSRM VVVCMSVSSTPWRSPPGERDFVPLVVVSLAGNPHPSWLDGCQSLSPMDDVRLFAFVRFTT GDAMSKRSKFALITWIGENVSGLQRAKTGTDKTLVKEVVQDCGDNAVTAGTCQQLASGRC GAVIPDTVASTVYSRCANFAKEFVISDRKELEEDFIKSELKKAGGANYDAQTE >gi568815582f:84550868_84762130|GENSCAN_predicted_CDS_2|1062_bp atggccaccaagatcgacaaagaggcttgccgggcggcgtacaacctggtgcgcgacgac ggctcggccgtcatctgggtgacttttaaatatgacggctccaccatcgtccccggcgag cagggagcggagtaccagcacttcatccagcagtgcacagttggaagcaagctttcggac aacggcagcacagcccggagggacaatggactcaacactggcactggctggatagcagtc accacgctcgacgtgccttatcttggtcatacctcagaacagcctggtgtggcagctcag ttcgcctgggatggtcagagaccttgcgagggcggtgagaaatgcccagtgcagattaat aaaatgatgaaggatgatagatgtgtatcttcctggcctgtggactctcttccctctgcc tgcactgtcttcatctgtttcgccttcctggaggactcctacacacccgtcaaggcccgg tgtgaatgttccctctgctcggatgccctccttgtctttgtggtgttggtgtcccgtatg gtggtcgtgtgcatgtctgtttcttcaactccttggcgaagtcctcctggagagagggac tttgtccctttggtggtggtatccttggcggggaatccccatccttcatggttagatggc tgccaatctctttcccccatggatgacgtccggttgtttgccttcgtgcgcttcaccacc ggggatgccatgagcaagaggtccaagtttgccctcatcacgtggatcggtgagaacgtc agcgggctgcagcgcgccaaaaccgggacggacaagaccctggtgaaggaggtcgtacag gattgtggggacaatgcagtgaccgctggcacctgccagcagctggcctccggacgctgt ggtgcagtgattcccgacacagtcgctagcacggtatacagtcgctgtgccaatttcgct aaggagtttgtgatcagtgatcggaaggagctggaggaagatttcatcaagagcgagctg aagaaggcggggggagccaattacgacgcccagacggagtaa >gi568815582f:84550868_84762130|GENSCAN_predicted_peptide_3|719_aa MEKLRHNGAKLLAQFTQLGVARRLGLGNTQQGKLLQQQVSGPRPGPSRLRLLAAGWSLLG GRCAEFRLGCGVRANDSGVPGDPAGEPVSAASLSPTLRAEISLMMEGSRQTRVSRPYKIS ESSKVYRWADHSSTVLQRLNEQRLRGLFCDVVLVADEQRVPAHRNLLAVCSDYFNSMFTI GMREAFQKEVELIGASYIGLKAVVDFLYGGELVLDGGNIDYVLETAHLLQIWTVVDFCCE YLEQEVSEDNYLYLQELASIYSLKRLDAFIDGFILNHFGTLSFTPDFLQNVSMQKLCVYL SSSEVQRECEHDLLQAALQWLTQQPEREAHARQVLENIHFPLIPKNDLLHRVKPAVCSLL PKEANCEGFIEEAVRYHNNLAAQPVMQTKRTALRTNQERLLFVGGEVSERCLELSDDTCY LDAKSEQWVKETPLPARRSHHCVAVLGGFIFIAGGSFSRDNGGDAASNLLYRYDPRCKQW IKVASMNQRRVDFYLASIEDMLVAIGGRNENGALSSVETYSPKTDSWSYVAGLPRFTYGH AGTIYKDFVYISGGHDYQIGPYRKNLLCYDHRTDVWEERRPMTTARGWHSMCSLGDSIYS IGGSDDNIESMERFDVLGVEAYSPQCNQWTRVAPLLHANSESGVAVWEGRIYILGGYSWE NTAFSKTVQVYDREADKWSRGVDLPKAIAGGSACVCALEPRPEDKKKKGKGKRHQDRGQ >gi568815582f:84550868_84762130|GENSCAN_predicted_CDS_3|2160_bp atggagaaactgaggcacaacggagctaagttgctggcccagttcacacagctgggagta gccaggagactcggacttggaaacacccagcagggaaaattgctgcagcagcaggtgagt ggcccccgccccgggccgtcccgactccgcctcctcgccgccggctggagcctgctgggc ggccgttgcgctgagttccgcctgggctgcggggtccgagcgaacgacagcggcgtcccc ggagaccccgccggcgagccggtgtcagcagcgtcgctttctccaacgctgagggctgaa atctctttaatgatggagggaagcaggcagacgcgagtgtctcggccatacaagatcagc gaatcatcaaaggtataccgctgggccgaccactcaagcacggtgctgcagcggctgaac gagcagcgtctccgcgggctcttctgcgacgtcgtcctggtggccgatgagcagcgtgtg ccagcccatcgcaacctgctggccgtgtgcagcgactacttcaactccatgttcaccatc ggcatgcgggaagctttccagaaggaggtggagctgatcggcgcctcctacattgggctc aaggccgtggtggacttcctgtacggcggggagctggtgctggatggcggcaacattgac tacgtcctggagacggctcacctgctgcagatctggacggtggtagacttctgctgtgag tacctggagcaggaggtgagcgaggacaactacctgtacctgcaggagctggcctccatc tacagcctcaagcggcttgatgccttcatcgatggcttcatcctgaaccacttcggcacg ctgtcctttacgcccgacttcctgcagaacgtctccatgcagaagctgtgtgtctacctg agcagcagcgaggtgcagcgggagtgtgagcacgacctcctgcaggccgccctgcagtgg ctgacgcagcagcccgagcgcgaggcccacgcccgccaggtgctggagaacatccacttc ccgctcatccccaagaacgacctgctgcaccgcgtcaagccggccgtgtgctcgctgctg cccaaggaggccaactgcgagggcttcatcgaggaggccgtgcgctaccacaacaacctg gcggcccagcccgtcatgcagaccaagcgcacggcgctgcgcaccaaccaggagcgcctg ctgtttgtgggcggcgaggtctccgagcggtgtctggagctcagtgacgacacctgctac ctggacgccaagagcgagcagtgggtcaaagagacgccgctgcccgcccggcggagccac cactgtgtcgcggtgctggggggcttcatcttcatcgccggcggcagcttctcacgggac aacggaggggatgcggcctccaatcttctttataggtatgacccccgctgtaaacagtgg atcaaggtggcctccatgaaccagcgccgtgtggatttctaccttgcctccatcgaagac atgctggtggccatcggcggccggaatgagaacggagcgctctcttcagtagagacgtac agtcccaagactgactcctggtcctatgtggccggcttgccaaggttcacgtacggccac gcgggcaccatctacaaagacttcgtgtacatctcggggggccacgactaccaaattggc ccctaccgcaagaacctgctatgctacgaccaccggacagacgtgtgggaggagcggcgg cccatgaccacggcgcgcggctggcacagcatgtgcagcctgggtgacagcatctactcc atcgggggcagcgatgacaacatcgagtccatggagcgcttcgacgtgctgggcgtggag gcctacagcccgcagtgcaaccagtggacccgcgtggcgccgctgctgcacgccaacagc gagtcgggcgtggcagtgtgggagggccgcatctacatcctgggcggctacagctgggag aacactgccttctccaagaccgtgcaggtgtacgaccgcgaggccgacaagtggagcagg ggcgtcgacctgcccaaggccatcgctggcgggtccgcctgtgtctgcgccctggagcca cggccagaggacaagaagaagaaaggcaaaggcaagaggcaccaggaccggggccagtga >gi568815582f:84550868_84762130|GENSCAN_predicted_peptide_4|39_aa MLLKRMRKNPGLGPVDGVPTTGSDSHKAHEMLLHHGGPG >gi568815582f:84550868_84762130|GENSCAN_predicted_CDS_4|120_bp atgcttctgaagcggatgcgaaagaacccaggtttgggaccagtggacggggttccaacc acgggctctgacagccacaaggcccatgagatgctgctgcaccatggaggaccagggtga >gi568815582f:84550868_84762130|GENSCAN_predicted_peptide_5|678_aa MSAARQAHAHLSAAQAQQRAGLPAPRGARPVRRRGGRCECVCAGEKMAAAGEAARQSAVY RGPCGPVLRVAEGRHGSAWGDCALCRGQHHMDEPEAAAGLAALSGQEGFLESEHFTPNTL HTRYPDVCVWFTLSSSPNYIFGDFSPDEFNQFFVTPRSSVELPPYSGTVLCGTQAVDKLP DGQEYQRIEFGVDEVIEPSDTLPRTPSYSISSTLNPQAPEFILGCTASKITPDGITKEAS YGSIDCQYPGSALALDGSSNVEAEVLENDGVSGGLGQRERKKKKKRPPGYYSYLKDGGDD SISTEALVNGHANSAVPNSVSAEDAEFMGDMPPSVTPRTCNSPQNSTDSVSDIVPDSPFP GALGSDTRTAGQPEGGPGADFGQSCFPAEAGRDTLSRTAGAQPCVGTDTTENLGVANGQI LESSGEGTATNGVELHTTESIDLDPTKPESASPPADGTGSASGTLPVSQPKSWASLFHDS KPSSSSPVAYVETKYSPPAISPLVSEKQVEVKEGLVPVSEDPVAIKIAGKQNLMCKICVR IIIAYDDIKVYDAGIAFELINNFSGLICGSPWGILMSVLSWPSCREWSAYVFTCVLIFAI AERGAAPYLFQMSSISEICFFTLSELLENVTLIHKPVSLQPRGLINKGNWCYINAVSFLD AVRKASLLQLSLLWVHVT >gi568815582f:84550868_84762130|GENSCAN_predicted_CDS_5|2037_bp atgtccgcggcccggcaggcgcacgcccacctgtcggccgcgcaggcgcagcagcgggcc ggcctccccgcgccccgcggcgcgcggccagtgcgcaggcgcggcggccgatgcgagtgt gtatgtgcgggcgagaagatggcggcggcgggggaagcagcgaggcagtcagctgtgtac agaggcccttgtggtcctgtcctgagagtagcggaaggaagacatggctctgcctgggga gactgtgccttatgcagaggccaacaccacatggatgagccggaagcagccgctggcctg gctgctctttcgggtcaggaaggctttcttgagtctgagcacttcacacctaatacccta catacccgctatcctgacgtgtgtgtctggttcactctctcgtcctcccccaattatatt tttggagattttagccctgatgaattcaatcaattctttgtgactcctcgatcttcagtt gagcttcctccatacagtggaacagttctgtgtggcacacaggctgtggataaactacct gatggacaagaatatcagagaattgagtttggtgtcgatgaagtcattgaacccagtgac actttgccgagaacccccagctacagtatttcaagcacactgaaccctcaggcccctgaa tttattctcggttgtacagcttccaaaataacccctgatggtatcactaaagaagcaagc tatggctccatcgactgccagtacccaggctctgccctcgctttggatggaagttctaat gtggaggcggaagttttggaaaatgatggtgtctcaggtggtcttggacaaagggagcgt aaaaagaagaaaaagcggccacctggatattacagctatttgaaagatggtggcgatgat agtatctccacagaagccctggtcaatggccatgccaattcagcagtcccgaacagtgtc agtgcagaggatgcagaatttatgggtgacatgcccccgtcagttacgcccaggacttgt aacagcccccagaactccacagactctgtcagtgacattgtgcctgacagtcctttcccc ggagcactcggcagtgacaccaggactgcagggcagccagaggggggccccggggctgat tttggtcagtcctgcttccctgcagaggctggcagagacaccctgtcaaggacagctggg gctcagccctgcgttggtaccgatactactgaaaaccttggagttgctaatggacaaata cttgaatcctcgggtgagggcacagctaccaacggggtggagttgcacaccacggaaagc atagacttggacccaaccaaacccgagagtgcatcacctcctgctgacggcacgggctct gcatcaggcacccttcctgtcagccagcccaagtcctgggccagcctctttcatgattct aagccctcttcctcctcgccggtggcctatgtggaaactaagtattcccctcccgccata tctcccctggtttctgaaaagcaggttgaagtcaaagaagggcttgttccggtttcagag gatcctgtagccataaagattgcaggaaaacagaatttgatgtgtaaaatatgtgtaaga ataatcatagcttatgatgacataaaagtttacgatgctggtattgcctttgagttgata aacaatttcagtgggctgatttgtggcagcccttgggggattctgatgtctgtattgagc tggccgtcatgccgggagtggtctgcttatgtatttacatgcgtgctcatctttgctata gctgagcgtggtgcagctccttacctgttccagatgtcatcaatttctgaaatatgcttc ttcactctttcagagttgctggagaatgtaaccctaatccataaaccagtgtcgttgcaa ccccgtgggctgatcaataaagggaactggtgctacattaatgctgtatccttcctggac gccgtccgcaaggccagcttgttgcagctgtccctcctttgggtgcatgtgacttag