GENSCAN 1.0 Date run: 6-Nov-116 Time: 14:04:04 Sequence gi568815582r:84466848_84717914 : 251067 bp : 48.84% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.08 PlyA - 2040 2035 6 1.05 1.07 Term - 13179 13066 114 0 0 91 55 182 0.999 13.77 1.06 Intr - 13861 13682 180 1 0 123 97 154 0.996 19.86 1.05 Intr - 15863 15745 119 0 2 88 93 153 0.936 15.98 1.04 Intr - 20212 19784 429 2 0 129 116 456 0.658 46.19 1.03 Intr - 22575 22425 151 0 1 38 27 120 0.288 0.64 1.02 Intr - 29066 28836 231 2 0 106 67 179 0.747 15.57 1.01 Init - 29325 29287 39 0 0 69 60 44 0.314 -0.32 1.00 Prom - 31143 31104 40 -8.46 2.00 Prom + 32887 32926 40 -3.26 2.01 Init + 37916 38024 109 1 1 39 28 135 0.114 0.98 2.02 Intr + 38368 38556 189 2 0 88 69 81 0.826 5.76 2.03 Intr + 39221 39321 101 0 2 22 89 41 0.110 -2.57 2.04 Intr + 48266 48368 103 1 1 92 47 103 0.520 6.35 2.05 Intr + 53313 53374 62 1 2 69 42 82 0.041 0.15 2.06 Intr + 60046 60153 108 0 0 104 65 24 0.232 2.18 2.07 Intr + 60496 60518 23 0 2 44 96 44 0.123 -2.86 2.08 Intr + 61647 61799 153 0 0 33 84 111 0.217 4.39 2.09 Intr + 67639 67743 105 1 0 46 80 62 0.308 0.53 2.10 Intr + 71242 71347 106 1 1 95 51 63 0.612 3.52 2.11 Intr + 72557 72635 79 1 1 92 80 72 0.189 5.92 2.12 Intr + 86808 86940 133 1 1 107 59 33 0.672 2.00 2.13 Intr + 89079 89308 230 0 2 114 89 63 0.398 6.51 2.14 Term + 91569 91639 71 1 2 131 55 17 0.325 0.80 2.15 PlyA + 95502 95507 6 1.05 3.12 PlyA - 98777 98772 6 1.05 3.11 Term - 100108 99998 111 1 0 102 40 214 0.456 16.56 3.10 Intr - 113218 113136 83 2 2 63 50 78 0.276 0.86 3.09 Intr - 117242 117215 28 2 1 115 32 44 0.240 -0.81 3.08 Intr - 123415 123258 158 2 2 82 84 322 0.753 30.93 3.07 Intr - 140469 140183 287 2 2 115 57 182 0.210 14.79 3.06 Intr - 141534 141500 35 0 2 43 119 11 0.083 -3.28 3.05 Intr - 147374 147310 65 0 2 105 75 44 0.097 3.24 3.04 Intr - 148255 148161 95 0 2 48 79 107 0.555 5.41 3.03 Intr - 149765 149726 40 0 1 66 21 66 0.437 -5.02 3.02 Intr - 150736 150654 83 0 2 50 91 123 0.946 8.08 3.01 Init - 151067 150991 77 2 2 103 62 216 0.701 21.06 3.00 Prom - 161625 161586 40 -1.06 4.00 Prom + 175777 175816 40 -1.46 4.01 Init + 176091 176155 65 2 2 72 71 49 0.539 0.95 4.02 Intr + 176585 176633 49 2 1 104 69 25 0.474 0.88 4.03 Intr + 182041 182219 179 0 2 55 36 200 0.982 10.22 4.04 Intr + 184005 184083 79 0 1 101 66 27 0.972 1.35 4.05 Intr + 190024 191097 1074 0 0 107 100 2580 0.997 251.50 4.06 Intr + 192913 193070 158 0 2 23 53 232 0.689 12.01 4.07 Term + 194731 195286 556 1 1 122 46 1237 0.999 116.70 4.08 PlyA + 198184 198189 6 1.05 5.03 PlyA - 198691 198686 6 1.05 5.02 Term - 199137 199049 89 1 2 70 49 103 0.527 2.42 5.01 Init - 201369 201339 31 0 1 50 94 62 0.540 2.91 5.00 Prom - 224527 224488 40 -2.86 6.00 Prom + 225913 225952 40 -5.36 6.01 Init + 233039 233199 161 1 2 42 49 227 0.280 11.50 6.02 Intr + 245381 245556 176 2 2 93 60 0 0.009 -2.72 6.03 Intr + 246588 246664 77 1 2 114 75 9 0.147 1.43 6.04 Term + 249348 249422 75 2 0 139 38 38 0.453 1.74 6.05 PlyA + 249841 249846 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 147374 147205 170 0 2 105 36 85 0.832 3.04 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582r:84466848_84717914|GENSCAN_predicted_peptide_1|420_aa MWGKGKKAMGITLNHVGEALPPEMVTRLYDGMRRVDLTGKAKGPSENVSQEQFTASMSHL LKGNSEEKSLMIMKMISATEGPVKAREVQKFTEDLVGSVVHVLSHRQELRGWTGKEAPGP NPRVQVLAAQLLSDMKLQGKYGKRLLGPQWLDYDCDRAVIEDWVFRVPHVAIFLSVVICK GFLILCSSLDLTTLVPERQVDQGRGFESILDVLSVMYINAQLPREQRHRWCLLFSSELHG HSFSQLCGHITHRGPCVAVLEDHDKHVFGGFASCSWEVKPQFQGDNRCFLFSICPSMAVY THTGYNDHYMYLNHGQQTIPNGLGMGGQHNYFGLWVDVDFGKGHSRAKPTCTTYNSPQLS AQENFQFDKMEVWAVGDPSEEQLAKGNKSILDADPEAQALLEISGHSRHSEGLREVPDDE >gi568815582r:84466848_84717914|GENSCAN_predicted_CDS_1|1263_bp atgtggggaaagggcaagaaagccatgggaatcaccttgaaccacgtcggggaagctctt cccccagagatggtcaccaggctgtatgatggcatgcggagggtcgacctgacagggaag gcgaagggacccagtgagaacgtgtcccaggagcagttcacagcatccatgtcccacctg ttgaaaggaaactccgaggagaagagtctcatgattatgaaaatgatttctgccacagaa ggtcccgtgaaggccagagaagtccaaaagtttacagaggatctggttggctctgtggtg cacgtgctaagccacagacaggagctgagaggctggactgggaaggaagccccagggccc aacccccgggtgcaggtgctggctgctcagctgctctctgacatgaagctgcaaggcaag tatggcaagagacttctggggccccagtggctggactatgactgtgaccgagctgtgatc gaggactgggtgttcagggtcccccatgtggccatattcctgagtgtggtcatttgcaag ggctttctcatcctgtgctcgtctcttgatctgactaccctggtccctgagcgtcaagtg gaccagggcaggggttttgagagcatcctggatgtcctctctgtcatgtacatcaacgcc cagctgcctcgggagcagcggcaccgctggtgcctgctcttttcgtctgagctccatgga cacagcttctcccagctctgtggccacatcactcaccggggaccctgtgtggctgtcctc gaggaccatgacaagcatgtgttcggtgggtttgcctcttgctcttgggaggtgaagcct cagtttcaaggggacaacagatgcttcctgttctccatctgccccagcatggctgtgtac acacacacgggctacaacgaccactacatgtacttgaaccatggacagcagacgatcccg aacggactgggtatgggggggcagcacaattactttgggctttgggtggatgttgatttt gggaaaggacacagcagagccaagcccacgtgcaccacgtacaacagcccgcagctgtcg gctcaggagaacttccagtttgataagatggaggtgtgggcggttggagacccctcagag gagcagttggccaagggcaacaagagcatcctggatgcggaccctgaggcccaggccctg ctggagatcagtgggcattcgcgccacagcgaagggctccgggaagtcccggacgatgaa tga >gi568815582r:84466848_84717914|GENSCAN_predicted_peptide_2|523_aa MVTRLTGSEPNPVGAARGGAAVCGRSGCRPGNPARLRPSRQPSGASSTAWLNRLPPPHVR LFSRFPFTPRRLNLSPHFFSSFLLKPTETWSRGPAHYIRARAPDAQDTRAFRHEGLDPVS SLHTPVLPAHINQGLPYMVNRPSDNVQPGLVTEVLSFERGFARIFTEDLLPGGNRHEKLP VDFPKVLEPRGPRLCCCLLSGSTSRANGLSRFRANPRQKAQIHPAATLTEAGGLVISAYG FYKGHKGVTTVSVGYFSHLDTQTLDVHDFKEGTSVSAPPVLKPLQKAVNQEKATDTVNLD VRWLARSLIISLTFNPVRAMSPMAPSRQPQGTDNLSAINVSQAFCRCHRGMRRSLVSFDC SSLTVDVILRHSEYLLEYRGPQLRNLRWVEENEQFPSLHKHFRQKTTEHKVFDADETDGL DGGLNVNPRLLDLQVCQEKLKARFLGETFASLNVSTQSKLRTGTSRRCCASHALHDSEDV GFMFVPVWSAPYSKLHTAQPGLTSRFHCARCISARVDLLKTKV >gi568815582r:84466848_84717914|GENSCAN_predicted_CDS_2|1572_bp atggtgacgcgcctcacagggagtgagccgaacccggtgggtgcggctcggggcggggcc gcggtctgcgggaggagcggctgcaggcccgggaatccggcccggctgcgtccttcccgg cagccatcaggcgccagctccacagcctggctaaacagactccctccaccccacgtgcgc ctcttcagccggttcccgttcactccaagacgtctgaacctgtcaccccacttcttcagc tcgttccttctcaaacccaccgaaacctggtcccgggggcctgctcactacatccgagcc agggctcctgatgcccaagacacacgggcctttagacatgaaggactagaccctgtttcc agccttcacacccctgtacttcctgcacacatcaaccagggattgccttatatggtgaac aggcccagtgacaacgtgcagccagggctggtgacggaagtcctcagtttcgaaagaggc tttgcacggatctttacagaagacctgctccctggaggcaaccgccacgagaagcttccc gtggattttcccaaagttctagagccacgtggaccccgcctctgctgctgccttttgtct ggcagcaccagccgggctaacggtctaagcagatttcgtgcaaatccccgtcagaaagca caaatccacccggcggccacgctcacggaagcaggggggcttgtgatttctgcctatggc ttttataaaggtcataaaggggtcacaactgtttctgtgggttacttctctcacttggac acccaaaccctcgatgttcatgacttcaaagagggaactagtgtttcagctcctcccgtg ctgaaaccgctacaaaaggctgtcaaccaggaaaaggcaacagacacagtgaacctggat gtcagatggctggcaaggtctctcattatttccttgacatttaatccagttagggctatg tcccccatggctccttcacgccagccccaggggactgacaacctctcagccataaatgtc agccaggccttctgccgatgccaccgggggatgcggaggtccttggtttcctttgactgc agctccctgacagtagatgtgattctgcgacactcagaatacctcctggagtacagaggg cctcagttaaggaacctaagatgggtagaggaaaatgaacaatttccttccctacataag catttcaggcagaagacaactgaacacaaggtgttcgatgctgacgaaactgacgggttg gatggagggctgaatgtaaacccaaggttactggatcttcaggtttgtcaagaaaagctg aaagctagatttttaggtgaaacctttgcatctttaaatgttagcacccaatccaaactt cggactgggacttccaggcgctgttgtgcaagtcatgccctgcacgactccgaggacgtc ggtttcatgtttgttccagtgtggagtgctccctacagtaaactgcacactgcgcagcca gggctcacctctcgttttcattgtgcccgatgcatctctgctcgggtggacctactcaaa accaaggtctga >gi568815582r:84466848_84717914|GENSCAN_predicted_peptide_3|353_aa MATKIDKEACRAAYNLVRDDGSAVIWVTFKYDGSTIVPGEQGAEYQHFIQQCTVGSKLSD NGSTARRDNGLNTGTGWIAVTTLDVPYLGHTSEQPGVAAQFAWDGQRPCEGGEKCPVQIN KMMKDDRCVSSWPVDSLPSACTVFICFAFLEDSYTPVKARCECSLCSDALLVFVVLVSRM VVVCMSVSSTPWRSPPGERDFVPLVVVSLAGNPHPSWLDGCQSLSPMDDVRLFAFVRFTT GDAMSKRSKFALITWIGENVSGLQRAKTGTDKTLVKEVVQDCGDNAVTAGTCQQLASGRC GAVIPDTVASTVYSRCANFAKEFVISDRKELEEDFIKSELKKAGGANYDAQTE >gi568815582r:84466848_84717914|GENSCAN_predicted_CDS_3|1062_bp atggccaccaagatcgacaaagaggcttgccgggcggcgtacaacctggtgcgcgacgac ggctcggccgtcatctgggtgacttttaaatatgacggctccaccatcgtccccggcgag cagggagcggagtaccagcacttcatccagcagtgcacagttggaagcaagctttcggac aacggcagcacagcccggagggacaatggactcaacactggcactggctggatagcagtc accacgctcgacgtgccttatcttggtcatacctcagaacagcctggtgtggcagctcag ttcgcctgggatggtcagagaccttgcgagggcggtgagaaatgcccagtgcagattaat aaaatgatgaaggatgatagatgtgtatcttcctggcctgtggactctcttccctctgcc tgcactgtcttcatctgtttcgccttcctggaggactcctacacacccgtcaaggcccgg tgtgaatgttccctctgctcggatgccctccttgtctttgtggtgttggtgtcccgtatg gtggtcgtgtgcatgtctgtttcttcaactccttggcgaagtcctcctggagagagggac tttgtccctttggtggtggtatccttggcggggaatccccatccttcatggttagatggc tgccaatctctttcccccatggatgacgtccggttgtttgccttcgtgcgcttcaccacc ggggatgccatgagcaagaggtccaagtttgccctcatcacgtggatcggtgagaacgtc agcgggctgcagcgcgccaaaaccgggacggacaagaccctggtgaaggaggtcgtacag gattgtggggacaatgcagtgaccgctggcacctgccagcagctggcctccggacgctgt ggtgcagtgattcccgacacagtcgctagcacggtatacagtcgctgtgccaatttcgct aaggagtttgtgatcagtgatcggaaggagctggaggaagatttcatcaagagcgagctg aagaaggcggggggagccaattacgacgcccagacggagtaa >gi568815582r:84466848_84717914|GENSCAN_predicted_peptide_4|719_aa MEKLRHNGAKLLAQFTQLGVARRLGLGNTQQGKLLQQQVSGPRPGPSRLRLLAAGWSLLG GRCAEFRLGCGVRANDSGVPGDPAGEPVSAASLSPTLRAEISLMMEGSRQTRVSRPYKIS ESSKVYRWADHSSTVLQRLNEQRLRGLFCDVVLVADEQRVPAHRNLLAVCSDYFNSMFTI GMREAFQKEVELIGASYIGLKAVVDFLYGGELVLDGGNIDYVLETAHLLQIWTVVDFCCE YLEQEVSEDNYLYLQELASIYSLKRLDAFIDGFILNHFGTLSFTPDFLQNVSMQKLCVYL SSSEVQRECEHDLLQAALQWLTQQPEREAHARQVLENIHFPLIPKNDLLHRVKPAVCSLL PKEANCEGFIEEAVRYHNNLAAQPVMQTKRTALRTNQERLLFVGGEVSERCLELSDDTCY LDAKSEQWVKETPLPARRSHHCVAVLGGFIFIAGGSFSRDNGGDAASNLLYRYDPRCKQW IKVASMNQRRVDFYLASIEDMLVAIGGRNENGALSSVETYSPKTDSWSYVAGLPRFTYGH AGTIYKDFVYISGGHDYQIGPYRKNLLCYDHRTDVWEERRPMTTARGWHSMCSLGDSIYS IGGSDDNIESMERFDVLGVEAYSPQCNQWTRVAPLLHANSESGVAVWEGRIYILGGYSWE NTAFSKTVQVYDREADKWSRGVDLPKAIAGGSACVCALEPRPEDKKKKGKGKRHQDRGQ >gi568815582r:84466848_84717914|GENSCAN_predicted_CDS_4|2160_bp atggagaaactgaggcacaacggagctaagttgctggcccagttcacacagctgggagta gccaggagactcggacttggaaacacccagcagggaaaattgctgcagcagcaggtgagt ggcccccgccccgggccgtcccgactccgcctcctcgccgccggctggagcctgctgggc ggccgttgcgctgagttccgcctgggctgcggggtccgagcgaacgacagcggcgtcccc ggagaccccgccggcgagccggtgtcagcagcgtcgctttctccaacgctgagggctgaa atctctttaatgatggagggaagcaggcagacgcgagtgtctcggccatacaagatcagc gaatcatcaaaggtataccgctgggccgaccactcaagcacggtgctgcagcggctgaac gagcagcgtctccgcgggctcttctgcgacgtcgtcctggtggccgatgagcagcgtgtg ccagcccatcgcaacctgctggccgtgtgcagcgactacttcaactccatgttcaccatc ggcatgcgggaagctttccagaaggaggtggagctgatcggcgcctcctacattgggctc aaggccgtggtggacttcctgtacggcggggagctggtgctggatggcggcaacattgac tacgtcctggagacggctcacctgctgcagatctggacggtggtagacttctgctgtgag tacctggagcaggaggtgagcgaggacaactacctgtacctgcaggagctggcctccatc tacagcctcaagcggcttgatgccttcatcgatggcttcatcctgaaccacttcggcacg ctgtcctttacgcccgacttcctgcagaacgtctccatgcagaagctgtgtgtctacctg agcagcagcgaggtgcagcgggagtgtgagcacgacctcctgcaggccgccctgcagtgg ctgacgcagcagcccgagcgcgaggcccacgcccgccaggtgctggagaacatccacttc ccgctcatccccaagaacgacctgctgcaccgcgtcaagccggccgtgtgctcgctgctg cccaaggaggccaactgcgagggcttcatcgaggaggccgtgcgctaccacaacaacctg gcggcccagcccgtcatgcagaccaagcgcacggcgctgcgcaccaaccaggagcgcctg ctgtttgtgggcggcgaggtctccgagcggtgtctggagctcagtgacgacacctgctac ctggacgccaagagcgagcagtgggtcaaagagacgccgctgcccgcccggcggagccac cactgtgtcgcggtgctggggggcttcatcttcatcgccggcggcagcttctcacgggac aacggaggggatgcggcctccaatcttctttataggtatgacccccgctgtaaacagtgg atcaaggtggcctccatgaaccagcgccgtgtggatttctaccttgcctccatcgaagac atgctggtggccatcggcggccggaatgagaacggagcgctctcttcagtagagacgtac agtcccaagactgactcctggtcctatgtggccggcttgccaaggttcacgtacggccac gcgggcaccatctacaaagacttcgtgtacatctcggggggccacgactaccaaattggc ccctaccgcaagaacctgctatgctacgaccaccggacagacgtgtgggaggagcggcgg cccatgaccacggcgcgcggctggcacagcatgtgcagcctgggtgacagcatctactcc atcgggggcagcgatgacaacatcgagtccatggagcgcttcgacgtgctgggcgtggag gcctacagcccgcagtgcaaccagtggacccgcgtggcgccgctgctgcacgccaacagc gagtcgggcgtggcagtgtgggagggccgcatctacatcctgggcggctacagctgggag aacactgccttctccaagaccgtgcaggtgtacgaccgcgaggccgacaagtggagcagg ggcgtcgacctgcccaaggccatcgctggcgggtccgcctgtgtctgcgccctggagcca cggccagaggacaagaagaagaaaggcaaaggcaagaggcaccaggaccggggccagtga >gi568815582r:84466848_84717914|GENSCAN_predicted_peptide_5|39_aa MLLKRMRKNPGLGPVDGVPTTGSDSHKAHEMLLHHGGPG >gi568815582r:84466848_84717914|GENSCAN_predicted_CDS_5|120_bp atgcttctgaagcggatgcgaaagaacccaggtttgggaccagtggacggggttccaacc acgggctctgacagccacaaggcccatgagatgctgctgcaccatggaggaccagggtga >gi568815582r:84466848_84717914|GENSCAN_predicted_peptide_6|162_aa MSAARQAHAHLSAAQAQQRAGLPAPRGARPVRRRGGRCECVCAGEKMAAAGEAARQSAVY RGPCGPVLRVAEGRHGSAWGDCALCRGQHHMDEPEAAAGLAALSGQEGFLESEHFTPNTL HTRYPDVCVWFTLSSSPNICQFLLDSSDWKIVLSPWQIICTI >gi568815582r:84466848_84717914|GENSCAN_predicted_CDS_6|489_bp atgtccgcggcccggcaggcgcacgcccacctgtcggccgcgcaggcgcagcagcgggcc ggcctccccgcgccccgcggcgcgcggccagtgcgcaggcgcggcggccgatgcgagtgt gtatgtgcgggcgagaagatggcggcggcgggggaagcagcgaggcagtcagctgtgtac agaggcccttgtggtcctgtcctgagagtagcggaaggaagacatggctctgcctgggga gactgtgccttatgcagaggccaacaccacatggatgagccggaagcagccgctggcctg gctgctctttcgggtcaggaaggctttcttgagtctgagcacttcacacctaatacccta catacccgctatcctgacgtgtgtgtctggttcactctctcgtcctcccccaatatatgc cagttcctcctggactcctcagactggaagattgtcctttctccttggcaaatcatctgt actatctga