GENSCAN 1.0 Date run: 3-Nov-116 Time: 17:53:01 Sequence gi568815594f:89794972_90053415 : 258444 bp : 35.91% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 PlyA - 223 218 6 1.05 1.04 Term - 17959 17858 102 1 0 99 35 82 0.765 1.30 1.03 Intr - 27417 27275 143 1 2 59 86 176 0.931 13.65 1.02 Intr - 33213 33172 42 1 0 72 115 31 0.547 1.49 1.01 Init - 40696 40576 121 1 1 78 82 135 0.569 12.30 1.00 Prom - 41017 40978 40 -6.65 2.00 Prom + 48615 48654 40 -6.05 2.01 Init + 56353 56529 177 0 0 83 32 144 0.116 7.61 2.02 Intr + 61293 61573 281 2 2 38 65 151 0.097 3.15 2.03 Intr + 62024 62161 138 2 0 47 36 123 0.456 1.66 2.04 Intr + 80879 81046 168 2 0 7 45 159 0.010 1.64 2.05 Intr + 100036 100623 588 1 0 57 69 412 0.001 27.01 2.06 Intr + 114305 114378 74 2 2 60 75 40 0.052 -1.97 2.07 Intr + 128197 128301 105 2 0 125 40 108 0.615 8.97 2.08 Intr + 132824 132997 174 0 0 70 93 134 0.639 11.09 2.09 Intr + 139839 141827 1989 1 0 101 107 1147 0.199 104.27 2.10 Intr + 156634 156780 147 2 0 94 98 137 0.995 14.59 2.11 Term + 158026 158447 422 2 2 106 38 199 0.998 11.17 2.12 PlyA + 158868 158873 6 1.05 3.03 PlyA - 158912 158907 6 1.05 3.02 Term - 167404 167342 63 1 0 117 37 91 0.902 3.81 3.01 Init - 169873 169790 84 1 0 72 115 29 0.921 4.87 3.00 Prom - 169963 169924 40 -5.85 4.06 PlyA - 170626 170621 6 1.05 4.05 Term - 173452 173096 357 1 0 45 43 255 0.876 10.23 4.04 Intr - 173879 173691 189 2 0 123 33 75 0.414 4.36 4.03 Intr - 174829 174624 206 2 2 112 -24 164 0.276 5.60 4.02 Intr - 175029 174960 70 0 1 18 119 50 0.014 -1.06 4.01 Init - 176674 176564 111 1 0 87 76 66 0.015 5.56 4.00 Prom - 185127 185088 40 -3.85 5.06 PlyA - 185357 185352 6 1.05 5.05 Term - 196865 196690 176 0 2 37 55 125 0.375 1.04 5.04 Intr - 206485 206186 300 2 0 -15 73 231 0.038 7.08 5.03 Intr - 238342 238226 117 2 0 41 86 87 0.025 3.32 5.02 Intr - 247307 247262 46 2 1 64 95 40 0.001 -0.64 5.01 Init - 257356 257183 174 1 0 15 66 141 0.410 3.99 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 95065 95166 102 0 0 95 55 120 0.872 6.70 S.002 Init - 176711 176564 148 2 1 24 76 175 0.980 10.20 S.003 Term - 228678 228589 90 0 0 80 49 127 0.946 4.74 S.004 Init - 229641 229621 21 0 0 109 83 8 0.860 2.15 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594f:89794972_90053415|GENSCAN_predicted_peptide_1|135_aa MDVFMKGLSKAKEGVVAAAEKTKQGVAEAAGKTKEGVLYVGSKTKEGVVHGVATVAEKTK EQVTNVGGAVVTGVTAVAQKTVEGAGSIAAATGFVKKDQLGKAASYQGVRPKSFLLSDNT KPHESPLRGRTRDKL >gi568815594f:89794972_90053415|GENSCAN_predicted_CDS_1|408_bp atggatgtattcatgaaaggactttcaaaggccaaggagggagttgtggctgctgctgag aaaaccaaacagggtgtggcagaagcagcaggaaagacaaaagagggtgttctctatgta ggctccaaaaccaaggagggagtggtgcatggtgtggcaacagtggctgagaagaccaaa gagcaagtgacaaatgttggaggagcagtggtgacgggtgtgacagcagtagcccagaag acagtggagggagcagggagcattgcagcagccactggctttgtcaaaaaggaccagttg ggcaaggctgcatcctaccagggagtaagacccaagtccttcctgctttcagacaacacc aagcctcatgagtccccactcagaggaaggaccagagacaaactctaa >gi568815594f:89794972_90053415|GENSCAN_predicted_peptide_2|1420_aa MAEGKEEQVTSYVDGSRQGERAFAGKLPFLKQSDIMRLTLYHESSTRKTRPHDSTISHQW VQDSGCSTSSMTRRTARHCLTQEVQGVREFPFIAKQSCDRRHLKNRVTPTQILRFSNGLS KRHTRKLYPAPGSEGPMPTEPCSLLAQQSEIELQTDSGVDLQQTPTDLQLRILTVKRITN KQKGHPHQNYKATVTKMARYYWQMRQMKIVKLDELDYRYFASTATDKETTPPSTKKTQCP KPKFHNVQFQVQMYYLWSGGIGLNNSKHSWTIPEDGNSQKTMPSASVPPNKIQSLQILPT TRVMSAEIATTPEARTSEDSLLKSTLPPSETSAPAEGVRNQTLTSTEKAEGVVKLQNLTL PTNASIKFNPGAESVVLSNSTLKFLQSFARKSNEQATSLNTVGGTGGIGGVGGTGGVGNR APRETYLSRGDSSSSQRTDYQKSNFETTRGKNWCAYVHTRLSPTVILDNQVTYVPAQEQQ SLIHTNQAESHTAVGRGVAEQQQQQGCGDPEVMQKMTDQVNYQAMKLTLLQKKIDNISLT VNDVRNTYSSLEGKVSEDKSREFQSLLKGLKSKSINVLIRDIVREQFKIFQNDMQETVAQ LFKTVSSLSEDLESTRQIIQKVNESVVSIAAQQKFVLVQENRPTLTDIVELRNHIVNVRQ EMTLTCEKPIKELEVKQTHLEGALEQEHSRSILYYESLNKTLSKLKEVHEQLLSTEQVSD QKNAPAAESVSNNVTEYMSTLHENIKKQSLMMLQMFEDLHIQESKINNLTVSLEMEKESL RGECEDMLSKCRNDFKFQLKDTEENLHVLNQTLAEVLFPMDNKMDKMSEQLNDLTYDMEI LQPLLEQGASLRQTMTYEQPKEAIVIRKKIENLTSAVNSLNFIIKELTKRHNLLRNEVQG RDDALERRINEYALEMEDGLNKTMTIINNAIDFIQDNYALKETLSTIKDNSEIHHKCTSD METILTFIPQFHRLNDSIQTLVNDNQRYNFVLQVAKTLAGIPRDEKLNQSNFQKMYQMFN ETTSQVRKYQQNMSHLEEKLLLTTKISKNFETRLQDIESKVTQTLIPYYISVKKGSVVTN ERDQALQLQVLNSRFKALEAKSIHLSINFFSLNKTLHEVLTMCHNASTSVSELNATIPKW IKHSLPDIQLLQKGLTEFVEPIIQIKTQAALSNLTCCIDRSLPGSLANVVKSQKQVKSLP KKINALKKPTVNLTTVLIGRTQRNTDNIIYPEEYSSCSRHPCQNGGTCINGRTSFTCACR HPFTGDNCTIKLVEENALAPDFSKGSYRYAPMVAFFASHTYGMTIPGPILFNNLDVNYGA SYTPRTGKFRIPYLGVYVFKYTIESFSAHISGFLVVDGIDKLAFESENINSEIHCDRVLT GDALLELNYGQEVWLRLAKGTIPAKFPPVTTFSGYLLYRT >gi568815594f:89794972_90053415|GENSCAN_predicted_CDS_2|4263_bp atggcagaaggcaaggaggagcaagtcacgtcttatgtggatggcagcaggcaaggggag agagcttttgcagggaaactcccatttttaaaacaatcagatatcatgagacttactctc tatcatgagagcagcacaagaaagacccgaccccatgattcaactatctcccaccagtgg gtgcaggacagtgggtgcagcacatcgagcatgacccgaagaacagcgaggcattgcctc acccaggaagtgcaaggggtgagggaattccctttcatagccaagcaaagctgtgacaga cggcacctgaaaaatcgggtcactcccacccaaatactgcgcttttccaatggtcttagc aagcggcacaccaggaaattatatcctgcgcctggctcagagggtcccatgcccacggag ccttgctcattgctagcacagcagtctgagattgaactgcaaacagattctggagtggac ctccagcaaactccaacagacctgcagctgaggatcctgactgtcaaaaggataactaac aaacagaaaggacatccacaccaaaactacaaggctacagtaaccaaaatggcacggtat tactggcagatgaggcaaatgaagattgtaaagttagatgaacttgattatcgatacttt gccagtactgctacagacaaagagaccacacctccatctaccaaaaaaactcagtgtcct aaaccaaagtttcataatgttcaattccaagttcaaatgtattatttatggagtgggggc attgggcttaacaacagtaagcattcttggactatacctgaggatgggaactctcagaag actatgccttctgcttcagttcctccaaataaaatacaaagtttgcaaatactgccaacc actcgggtcatgtcggcggagatagctacaactccagaggcaagaacttctgaagacagt cttcttaaatcaacactgcctccctcagaaacaagtgcacctgctgagggtgtgagaaat caaactctcacatccacagagaaagcagaaggagtggtcaagttacagaatcttaccctc ccaaccaacgctagcatcaagttcaatcctggagcagaatcagtggtcctttccaattct acactgaaatttcttcagagctttgccagaaagtcaaatgaacaagcaacttctctaaac acagttggaggcactggaggcattggaggcgttggaggcactggaggcgtgggaaatcga gccccacgggaaacatacctcagccggggtgacagcagttccagccaaagaactgactac caaaaatcaaatttcgaaacaactagaggaaagaattggtgtgcttatgtacataccagg ttatctcccacagtgatattggacaaccaggtcacttatgtcccagcccaggaacagcaa agtttgatacacaccaaccaggctgaaagtcatacagctgttggcagaggagtagctgag cagcagcagcagcaaggctgtggtgacccagaagtgatgcaaaaaatgactgatcaggtg aactaccaggcaatgaaactgactcttctgcagaagaagattgacaatatttctttgact gtgaatgatgtaaggaacacttactcctccctagaaggaaaagtcagcgaagataaaagc agagaatttcaatctcttctaaaaggtctaaaatccaaaagcattaatgtactgataaga gacatagtaagagaacaatttaaaatttttcaaaatgacatgcaagagactgtagcacag ctcttcaagactgtatcaagtctatcagaggacctcgaaagcaccaggcaaataattcaa aaagttaatgaatctgtggtttcaatagcagcccagcaaaagtttgttttggtgcaagag aatcggcccactttgactgatatagtggaactaaggaatcacattgtgaatgtaaggcaa gaaatgactcttacatgtgagaagcctattaaagaactagaagtaaagcagactcattta gaaggtgctctagaacaggaacactcaagaagcattctgtattatgaatccctcaataaa actctttctaaattgaaggaagtacatgagcagcttttatcaactgaacaggtatcagac cagaagaatgctccagctgctgagtcagttagcaataatgtcactgagtacatgtctact ttacatgaaaatataaagaagcagagtttgatgatgctgcaaatgtttgaagatttgcac attcaagaaagcaagattaacaatctcaccgtctctttggagatggagaaagagtctctc agaggtgaatgtgaagacatgttatccaaatgcagaaatgattttaaatttcaacttaag gacacagaagagaatttacatgtgttaaatcaaacattggctgaagttctctttccaatg gacaataagatggacaaaatgagtgagcaactaaatgatttgacttatgatatggagatc cttcaacccttgcttgagcagggagcatcactcagacagacaatgacatatgaacaacca aaggaagcaatagtgataaggaaaaagatagaaaatctgactagtgctgtcaatagtcta aattttattatcaaagaacttacaaaaagacacaacttacttagaaatgaagtacagggt cgtgatgatgccttagaaagacgtatcaatgaatatgccttagaaatggaagatggcctc aataagacaatgactattataaataatgctattgatttcattcaagataactatgcccta aaagagactttaagtactattaaggataatagtgagatccatcataaatgtacctccgat atggaaactattttgacatttattcctcagttccaccgtctgaatgattctattcagact ttggtcaatgacaatcagagatataactttgttttgcaagtcgccaagacccttgcaggt attcccagagatgagaaactaaatcagtccaacttccaaaagatgtatcaaatgttcaat gaaaccacttcccaagtgagaaaataccagcaaaatatgagtcatttggaagaaaaacta ctcttaactaccaagatttccaaaaattttgagactcggttgcaagacattgagtctaaa gttacccagacgctcataccttattatatttcagttaaaaaaggcagtgtagttacaaat gagagagatcaggctcttcaactgcaagtattaaattccagatttaaggcgttggaagca aaatctatccatctttcaattaacttcttttcgcttaacaaaactctccacgaagtttta acaatgtgtcacaatgcttctacaagtgtgtcagaactgaatgctaccatccctaagtgg ataaaacattccctgccagatattcaacttcttcagaaaggtctaacagaatttgtggaa ccaataattcaaataaaaactcaagctgccctatctaatttaacttgttgtatagatcga tcgttgcctggtagtctggcaaatgttgtcaagtctcagaagcaagtaaaatcattgcca aagaaaattaacgcacttaagaaaccaacggtaaatcttaccacagtcctgataggccgg actcaaagaaacacggacaacataatatatcctgaggagtattcaagctgtagtcggcat ccgtgccaaaatgggggcacgtgcataaatggaagaactagctttacctgtgcctgcaga catccttttactggtgacaactgcactatcaagcttgtggaagaaaatgctttagctcca gatttttccaaaggatcttacagatatgcacccatggtggcattttttgcatctcatacg tatggaatgactatacctggtcctatcctgtttaataacttggatgtcaattatggagct tcatataccccaagaactggaaaatttagaattccgtatcttggagtatatgttttcaag tacaccatcgagtcatttagtgctcatatttctggatttttagtggttgatggaatagac aagcttgcatttgagtctgaaaatattaacagtgaaatacactgtgatagggttttaact ggggatgccttattagaattaaattatgggcaggaagtctggttacgacttgcaaaagga acaattccagccaagtttccccctgttactacatttagtggctatttattatatcgtaca taa >gi568815594f:89794972_90053415|GENSCAN_predicted_peptide_3|48_aa MLLTQSLFGGLFTRTRMKDTTERKFMKMLTGHKDEDEDLCDDPLSLNE >gi568815594f:89794972_90053415|GENSCAN_predicted_CDS_3|147_bp atgttgctcacacaaagcctgtttggtggtctcttcacacggacgcgcatgaaagatacc actgaaagaaaatttatgaaaatgcttactggacataaagatgaggatgaagacctttgt gatgatccactttcacttaatgaataa >gi568815594f:89794972_90053415|GENSCAN_predicted_peptide_4|310_aa MEEKEERKRRRKRKRRREEGRDGERKRMEKQKENQDPKDPPTTSGPQTDQPKKHLTNFKS ETKETRFIRGPKTPAPVTDWEGSLPLVFNHSRDTSLIIHPGFRGVRPRRDACLGPSPLAA SPTFLGKGPAAPRQTELGPNSSSASAPPPYNPFIASPPHTWSGLQFPSMTSPPPPAQQFT LKKVAGAKGIVKDLINLTFKVYNNRKKLQFLASTVRQTPATSPAHKNFQTPELQQPGVPP EPPPRGACYKFQKSGHRAKECLQPRIPPKPHPICVGPHWKSDCPTHLAATPRAPGTLAQG SLTPSQIFLA >gi568815594f:89794972_90053415|GENSCAN_predicted_CDS_4|933_bp atggaggagaaggaggagaggaaaaggagaaggaagaggaagaggaggagggaagaggga agagatggggaaagaaaaagaatggagaagcaaaaggagaaccaggatcctaaagatcca cctacgacctcaggtcctcagaccgaccagcccaagaaacatctcaccaatttcaaatcc gagacaaaggagacacgttttatccgtggacccaaaactccggcgccagtcacggactgg gaaggcagccttcccttggtgtttaatcattccagggacacctctctgattattcaccca ggcttcagaggtgtcagaccacgcagggacgcctgccttggtccttcacccttagcggca agtcccacttttctggggaaggggccagctgctcctcgccagaccgagctaggtcccaat tcttcctcagcctccgctcctccaccctataatccttttatcgcctcccctcctcacacc tggtccggcttacagtttccttccatgactagccctccccctcctgcccagcaatttact cttaaaaaggtggctggagctaaaggcatagtcaaggatttaattaacctcaccttcaag gtgtacaataatagaaaaaagttgcagttccttgcctccactgtgagacaaaccccagcc acatctccagcacacaagaacttccaaacgcctgaactgcagcagccaggtgttcctcca gaacctcctccccgaggagcttgctacaagttccagaaatctggccaccgggccaaggaa tgcctgcagcccaggattcctcctaagccacatcccatctgtgtgggaccccactggaaa tcggactgtccaactcacctggcagccactcccagagcccctggaactctggcccaaggc tctctgactccttcccagatcttcttggcttag >gi568815594f:89794972_90053415|GENSCAN_predicted_peptide_5|270_aa MQLNIASEQPHITDLARYMAAVNTIRSEIQSPKAKEIGASAPGLDNALLRADFRYLGQGV GRQSGLQAKTITDAPKTLDDRNNECLLYVNEKTGGWGLLDSFRMGIGHQKDQVFLIKKKE RKRKDEERKEERKEHGEEGRKSQSESERGILPLCFNAKTKPSSVLDRRSRAIICGFVGTS HVSSSSQGAVVIVMVVEAAVGSGTGRTDGEVLWSSETDMAVKVSEFKQGTKIFIPPHRSV ISCVLLLEWEHELGQGDYLHLSQFLKKASS >gi568815594f:89794972_90053415|GENSCAN_predicted_CDS_5|813_bp atgcaattaaatatagcatcagaacagccacatattactgaccttgcacgatacatggca gccgtgaataccattagaagtgagatccaatccccaaaggccaaggaaattggagcctca gctccagggttggataatgccctactgagggctgactttcgttacctgggtcagggagta ggaaggcagtcaggccttcaggcaaaaacaataacagatgctcctaaaaccttggacgat cggaataatgaatgtcttttgtatgttaatgagaagactggtggctggggactcctagat agcttcaggatggggattggtcaccagaaagaccaagttttcttgataaagaagaaagaa aggaaaagaaaggatgaggaaaggaaggaagaaaggaaggaacatggagaggaaggaagg aaatctcagtctgaatctgaaagaggcattttgcctttgtgcttcaatgccaaaaccaaa ccatcaagtgttcttgacaggaggtctagggctataatttgtggttttgttggcaccagc cacgtcagcagcagcagccaaggagcagtggtgatagtgatggtagtagaagcggcagtg ggaagtggtactggtagaactgatggggaagttttatggagctctgagactgatatggct gtcaaagtttctgagtttaagcaaggaaccaagatctttatacctccacatcgatcagtc attagttgtgtactgctcctagaatgggagcatgaacttgggcaaggagactatctccac ctgagccaatttctgaagaaggccagtagctga