GENSCAN 1.0 Date run: 8-Nov-116 Time: 07:29:34 Sequence gi568815574f:2742164_2966891 : 224728 bp : 40.77% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 927 966 40 -2.75 1.01 Init + 10112 10172 61 1 1 91 94 44 0.181 4.77 1.02 Intr + 11743 11859 117 2 0 80 110 55 0.195 6.42 1.03 Intr + 14748 14858 111 1 0 52 54 142 0.632 6.63 1.04 Intr + 17074 17182 109 2 1 21 47 103 0.016 -2.08 1.05 Intr + 20150 20326 177 2 0 78 105 75 0.056 6.31 1.06 Intr + 26195 26328 134 2 2 61 80 128 0.711 8.67 1.07 Intr + 28387 28428 42 2 0 99 119 81 0.980 9.59 1.08 Intr + 29098 29216 119 2 2 78 68 3 0.934 -3.44 1.09 Term + 29236 29358 123 0 0 107 54 108 0.968 6.70 1.10 PlyA + 29473 29478 6 1.05 2.00 Prom + 30558 30597 40 -8.55 2.01 Init + 31131 31717 587 2 2 57 23 319 0.369 17.02 2.02 Intr + 32553 32576 24 0 0 99 109 23 0.164 1.72 2.03 Term + 37935 38112 178 0 1 5 48 124 0.099 -3.72 2.04 PlyA + 39766 39771 6 1.05 3.03 PlyA - 40674 40669 6 1.05 3.02 Term - 45411 44826 586 2 1 30 48 738 0.112 57.00 3.01 Init - 48049 47895 155 1 2 69 20 189 0.103 9.70 3.00 Prom - 63885 63846 40 -4.45 4.03 PlyA - 64253 64248 6 1.05 4.02 Term - 64983 64282 702 0 0 15 48 188 0.381 0.03 4.01 Init - 66153 65749 405 0 0 78 80 272 0.485 22.04 4.00 Prom - 66227 66188 40 -9.75 5.03 PlyA - 66261 66256 6 1.05 5.02 Term - 68514 68242 273 0 0 69 37 238 0.617 11.29 5.01 Init - 71500 71495 6 1 0 78 90 0 0.200 0.13 5.00 Prom - 81350 81311 40 -3.55 6.00 Prom + 83751 83790 40 -7.25 6.01 Sngl + 88142 88480 339 1 0 78 41 267 0.736 16.78 6.02 PlyA + 88531 88536 6 1.05 7.00 Prom + 97378 97417 40 -4.15 7.01 Init + 99537 99665 129 2 0 38 94 124 0.359 6.75 7.02 Intr + 100002 100079 78 2 0 117 59 69 0.551 5.73 7.03 Intr + 101914 102094 181 0 1 87 82 134 0.508 11.22 7.04 Intr + 103483 103580 98 2 2 69 95 142 0.901 11.81 7.05 Intr + 112437 112608 172 2 1 67 111 206 0.984 19.39 7.06 Intr + 122925 123082 158 1 2 136 64 184 0.972 19.61 7.07 Term + 124630 124731 102 0 0 124 32 73 0.859 2.60 7.08 PlyA + 124767 124772 6 1.05 8.03 PlyA - 125971 125966 6 1.05 8.02 Term - 126207 125987 221 1 2 116 49 130 0.022 8.12 8.01 Init - 135202 135073 130 1 1 26 82 161 0.589 9.66 8.00 Prom - 139992 139953 40 -8.25 9.04 PlyA - 140756 140751 6 1.05 9.03 Term - 141244 141097 148 0 1 98 48 94 0.544 2.79 9.02 Intr - 143946 143714 233 0 2 76 38 183 0.591 7.75 9.01 Init - 153209 153135 75 2 0 93 91 33 0.432 5.24 9.00 Prom - 176127 176088 40 -3.65 10.04 PlyA - 176891 176886 6 1.05 10.03 Term - 179663 179505 159 2 0 80 37 114 0.818 2.46 10.02 Intr - 187313 187201 113 0 2 2 47 196 0.941 6.08 10.01 Init - 188993 188903 91 2 1 70 57 96 0.329 5.40 10.00 Prom - 192318 192279 40 -6.35 11.00 Prom + 192372 192411 40 -10.05 11.01 Init + 193683 194106 424 2 1 110 38 229 0.012 14.90 11.02 Intr + 218911 219483 573 2 0 53 86 697 0.083 57.78 11.03 Term + 220206 220348 143 1 2 74 43 59 0.864 -2.89 11.04 PlyA + 220573 220578 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 125549 126229 681 1 0 79 50 271 0.943 16.43 S.002 Init + 218928 219483 556 2 1 63 86 672 0.810 59.68 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815574f:2742164_2966891|GENSCAN_predicted_peptide_1|330_aa MESWWGLPCLAFLCFLMHARVQCSINYVRYATPHYKVDCVLDDLSIRWLMKSVLSTLKAA LPSARGLSVSQLSAVLVPGFLSSIQEESGHTNQLKDVHRTAPMTESYAAPSVISAVVENL LLVITSNPAKGDRASICNCIIWSLGSQERGQLVDLLGRGWAAWMLRPWRVHGFRGADHAP GRPAQLWCLQPRCLDWNCWQPWRGDVAQIQEADTVPVPQPSLSSLDHDDQERWRRPGQRD FDLADALDDPVHIPHCPIPRSQSMASNLPSQSTLHSVLPSTSLVTFTDGAGTEDLESSHS VFSSVQVQLEPFWLTRMLKICVLGPAESPA >gi568815574f:2742164_2966891|GENSCAN_predicted_CDS_1|993_bp atggagagctggtggggacttccctgtcttgcgttcctgtgttttctaatgcacgcccga gtacaatgttcaataaattatgtgaggtatgcaacacctcattataaagttgactgtgtg ctagatgatttgtccatccgttggctgatgaaaagtgttctgagcactctgaaggcagct ctgccatccgcacgtggcttaagtgttagccagctcagtgccgtcttggtacccgggttc ttgtccagcatccaggaagaatcaggtcacacgaaccagctgaaggatgtgcacaggaca gcacccatgacagagagttacgcagccccaagtgtcattagtgctgtagtggagaacctg ctgttagttataacttctaaccctgcgaaaggggacagagcttcaatctgcaactgcata atatggtcactgggaagtcaagaacggggccagttggttgaccttctggggagaggctgg gctgcttggatgctgagaccgtggagggttcatgggttccgaggagcagatcatgcacca gggcggcctgctcagctctggtgtctgcagccccggtgtctcgactggaactgctggcag ccttggaggggagatgttgcacagatccaagaggctgacacggttcctgtcccccagccc agtctgagcagcttggaccatgatgatcaagagcgctggaggaggccaggtcaaagagac tttgatttggcagatgcccttgatgaccctgttcatatccctcattgtcccataccccga tcccaatccatggcctccaatctcccctcccagagcacacttcattccgttctcccctca acttctctggttaccttcactgacggggctggcactgaggacctggaatcttcccattct gtgttttcatcagtccaagttcagctagaacctttctggctgaccagaatgctgaaaata tgtgtcctgggccctgcagaaagtcctgcttga >gi568815574f:2742164_2966891|GENSCAN_predicted_peptide_2|262_aa MKEEGKKGGGKERKKNRRRKEERKRIREGRREGRREGEEKGWRGGRKKGGWGGRKEEGGK GEREGRKEGEKEGRREGRKERRKEGEKEGRREGRKERRKEGEKEGRREGRKERRKEGAKE GRREGRKERRKEGEKEGRREGRKERRKEGEKEGRSEGRKERRKEGEKEGRREGRKERRKE GEKEGRKKGGQEGRGRTHQEAKLRLVGSIDAEILVIEHKLCIDSSYQNRRMVKKRKEKKR KRKWRRETMRRLKEEGYDHSIR >gi568815574f:2742164_2966891|GENSCAN_predicted_CDS_2|789_bp atgaaggaggaagggaagaaaggaggggggaaggaaagaaagaaaaataggagaagaaag gaagaaagaaaaagaataagggaaggacggagggaaggaaggagagagggagaggagaaa ggttggaggggaggaaggaagaaaggagggtggggaggaaggaaagaagaagggggaaaa ggagagagggaaggaaggaaggaaggagagaaggaaggaaggagagaaggaaggaaggag agaaggaaggaaggagagaaggaaggaaggagagaaggaaggaaggagagaaggaaggaa ggagagaaggaaggaaggagagaaggaaggaaggagagaaggaaggaaggagcgaaggaa ggaaggagagaaggaaggaaggagagaaggaaggaaggagagaaggaaggaaggagagaa ggaaggaaggagagaaggaaggaaggagagaaggaaggaaggagcgaaggaaggaaggag cgaaggaaggaaggagagaaggaaggaaggagagaaggaaggaaggagagaaggaaggaa ggagagaaggaaggaaggaagaagggagggcaggaagggagagggagaacccaccaagaa gccaaactcagactggttggatctatagatgcagaaattctggttatagagcacaaactg tgtatagatagctcctaccaaaacagacggatggtgaaaaaaagaaaagaaaagaaaaga aaaagaaaatggaggagagagactatgaggcgactaaaagaagagggatatgatcattcc attcgctga >gi568815574f:2742164_2966891|GENSCAN_predicted_peptide_3|246_aa MPYRVKRLKLVVVPRKRSKSRSRGSSSWLSLPEPRQGSGFPIDTSSCSVTITVFNSDDYS PAVQENIPALRRSSSFLCTESCNSKYQCETGENSKGNVQDRVKRPMNAFIVWSRDQRRKM ALENPRMRNSEISKQLGYQWKMLTEAEKWPFFQEAQKLQAMHREKYPNYKYRPRRKAKML PKNCSLLPADPASVLCSEVQLDNRLYRDDCTKATHSRMEHQLGHLPPINAASSPQQRDRY SHWTKL >gi568815574f:2742164_2966891|GENSCAN_predicted_CDS_3|741_bp atgccctacagggtgaagcggctgaagctggtagtggttccgaggaagcggtcaaagtcc cgctccagaggttcctcttcttggttgtcactcccggaaccccgccaggggtctggcttc cccatcgacacctcctcctgttcagtcaccatcaccgtattcaacagcgatgattacagt ccagctgtgcaagagaatattcccgctctccggagaagctcttccttcctttgcactgaa agctgtaactctaagtatcagtgtgaaacgggagaaaacagtaaaggcaacgtccaggat agagtgaagcgacccatgaacgcattcatcgtgtggtctcgcgatcagaggcgcaagatg gctctagagaatcccagaatgcgaaactcagagatcagcaagcagctgggataccagtgg aaaatgcttactgaagccgaaaaatggccattcttccaggaggcacagaaattacaggcc atgcacagagagaaatacccgaattataagtatcgacctcgtcggaaggcgaagatgctg ccgaagaattgcagtttgcttcccgcagatcccgcttcggtactctgcagcgaagtgcaa ctggacaacaggttgtacagggatgactgtacgaaagccacacactcaagaatggagcac cagctaggccacttaccgcccatcaacgcagccagctcaccgcagcaacgggaccgctac agccactggacaaagctgtag >gi568815574f:2742164_2966891|GENSCAN_predicted_peptide_4|368_aa MDKFLDTYTLPRLNQEEVESLNRPITGSEIEALINSLPTKTSPELDGFTAEFYQWYKEEL IPLLLKLFQSAEKEGILPNSFYKASIILIPKPGRVTIKKENFRPISLVNIDAGILKKILA NQIQQHIQKLIHHDQVIYRFNAIPMKLPMAFFTELEKTTLNSTWNQKRACIAKTILSKKN KAGGIMLPDFKLYYKATITKTAWYWYQNREIDQWNRTEASEIIPHIYNHLIFEKPEKNKK WGKDSLFNKWCWETWLAICRKLKLPPFLTPYTKINSRWIKDLNVRPKTIKTVEENKGNTT QDIGMGKDFMSKTRKAMATKAKIDKWDLIKLKSFCTAKETTIGVNKLATEWEKIFGIYPS DKGLISRI >gi568815574f:2742164_2966891|GENSCAN_predicted_CDS_4|1107_bp atggacaaattcctggacacctacactctcccaagactaaaccaggaagaagttgaatcc ctgaatagaccaataacaggctctgaaattgaggcattaattaatagcctaccaaccaaa acaagtccagaactagatggattcacagctgaattctaccagtggtacaaagaggagctg ataccattacttctgaaactattccaatcagcagaaaaagagggaatcctccctaattca ttttacaaggccagcatcatcctgataccaaagcctggcagagtcacaataaaaaaagag aattttagaccaatatccctggtgaacattgatgcaggaatcctcaagaaaatactggca aaccaaatccagcagcacatccaaaagcttatccatcatgatcaggtaatttatagattc aatgccatccccatgaagctaccaatggctttcttcacagaattggaaaaaactacttta aactccacatggaaccaaaaaagagcctgcattgccaagacaatcctaagcaagaagaac aaagctggaggcatcatgctgcctgacttcaaactatactacaaggctacaataaccaaa acagcatggtactggtaccaaaacagagagatagaccaatggaacagaacagaggcctca gaaataataccacacatctacaaccatctgatctttgagaaacctgagaaaaacaagaaa tggggaaaggattccctatttaataaatggtgctgggaaacctggctagccatatgtaga aagctgaaactgcctcccttccttacaccttatacaaaaattaattcaagatggattaaa gacttaaatgttagacctaaaaccataaaaactgtagaagaaaacaaaggcaataccact caagacataggcatgggcaaagacttcatgagtaaaacacgaaaagcaatggcaacaaaa gccaaaatagacaaatgggatctaattaaactaaagagcttctgcacagcaaaagaaact accatcggagtgaacaagctagctacagaatgggagaaaatttttggaatctacccatct gacaaagggctaatatccagaatctga >gi568815574f:2742164_2966891|GENSCAN_predicted_peptide_5|92_aa MEDCSSLPVMEQSCTENDFDELTEVGFRRLVITNFSELKEDIRTHRKEAKNLEKRLDKWL TRTNSVENSLNDLKELKTMARKICDTCTSFSS >gi568815574f:2742164_2966891|GENSCAN_predicted_CDS_5|279_bp atggaggattgcagctccttaccagtgatggaacaaagctgcacagagaatgactttgat gagttgacagaagtaggcttcagaaggttagtaataacaaacttctctgagctaaaggag gatattcgaacccatcgcaaggaagctaaaaaccttgaaaaaagattagacaaatggcta actagaacaaacagtgtagagaatagcttaaatgacctgaaggagctaaaaaccatggca cgaaaaatatgtgacacatgcacaagcttcagtagctga >gi568815574f:2742164_2966891|GENSCAN_predicted_peptide_6|112_aa MGRNQRKKAENSEKQNASSPPKEHNSSSSREEKWVENEFDELTEVGFRRWVITNSSELKE HVLTQCKETKNLEKRLEELLARITSLEKNINDPMELKNTAQELCEAYTSITS >gi568815574f:2742164_2966891|GENSCAN_predicted_CDS_6|339_bp atggggagaaaccagcgcaaaaaggctgaaaattccgaaaagcagaatgcctcttctcct ccaaaggaacacaactcctcatcctcaagggaagaaaaatgggtggagaatgagtttgat gaattgacagaagtaggtttcagaaggtgggtaataacaaactcctctgaactaaaggag catgttctaacccaatgcaaggaaactaaaaacctcgaaaaaaggttagaggaattgcta gctagaataaccagtttagagaagaacataaatgacccgatggagctgaaaaacacagcc caagaactctgtgaagcatacacaagtatcactagctga >gi568815574f:2742164_2966891|GENSCAN_predicted_peptide_7|305_aa MFHFSVAPTWWVGMLLVLLQETRFGEGRSEKSVSRCARSGGRHARGPKKHLKRVAAPKHW MLDKLTGVFAPRPSTGPHKLRECLPLIVFLRNRLKYALTGDEVKKICMQRFIKIDGKVRV DVTYPAGFMDVISIEKTGEHFRLVYDTKGRFAVHRITVEEAKYKLCKVRKITVGVKGIPH LVTHDARTIRYPDPVIKVNDTVQIDLGTGKIINFIKFDTGNLCMVIGGANLGRVGVITNR ERHPGSFDVVHVKDANGNSFATRLSNIFVIGNGNKPWISLPRGKGIRLTVAEERDKRLAT KQSSG >gi568815574f:2742164_2966891|GENSCAN_predicted_CDS_7|918_bp atgtttcacttttcggtcgcccctacttggtgggtgggaatgctgctggtgctgcttcag gaaacccgatttggagaggggaggtctgagaagagtgtgtctcgttgtgctcggtctggg ggccgtcacgcccggggccccaagaagcacttaaagcgtgttgcagcgccgaagcattgg atgcttgacaaactaacgggtgtatttgcacctcgtccatcgacaggtccccacaagctg agggaatgtcttcctctgatcgtcttcctcaggaatagactcaagtatgcgttgactgga gatgaggtaaagaagatatgtatgcaacgtttcatcaaaattgatggcaaggttcgagtg gatgtcacataccctgctggattcatggatgtcatcagcatcgagaagacaggtgaacat ttccgcctggtctatgacaccaagggccgttttgctgttcaccgcatcacagtggaagag gcaaagtacaagttgtgcaaagtgaggaagattactgtgggagtgaagggaatccctcac ctggtgactcatgatgctcgaaccatccgctacccagatcctgtcatcaaggtgaacgat actgtgcagattgatttagggactggcaagataatcaactttatcaaatttgatacaggc aatttgtgtatggtgattggtggagccaacctcggtcgtgttggtgtgatcaccaacagg gaaagacatcctggttcttttgatgtggtgcatgtgaaggatgccaatggcaacagcttt gccacgaggctttccaacatttttgtcattggcaatggcaataaaccttggatttccctg cccaggggaaagggcattcgacttactgttgctgaagagagagataagaggctggccacc aaacagagcagtggctaa >gi568815574f:2742164_2966891|GENSCAN_predicted_peptide_8|116_aa MTAGDHYHQDAETGSGPECQATLIFIGYKTKEQGKEYEPSPMIGSGTCQRRALGPQNNLP DVTPAPNTISALCNLGDLGRHPSSPRIRLSAPTALSWLPEPAPILAEEWSCPMGLL >gi568815574f:2742164_2966891|GENSCAN_predicted_CDS_8|351_bp atgacagcaggggaccactaccaccaagatgcggagactggtagtggccccgaatgccag gctacactgatatttattggatacaagacaaaggagcagggtaaggagtatgagccatct ccaatgatagggtcgggaacctgtcagaggcgggcactggggccacagaacaacttgcca gatgtcacccctgctcccaataccatctctgcactctgcaaccttggggacctgggaagg cacccctcgtcccccaggatcaggctgtctgctcccactgccctctcctggctcccagag cctgctccaatcctggctgaggagtggagctgccccatgggtctcctctga >gi568815574f:2742164_2966891|GENSCAN_predicted_peptide_9|151_aa MAHSHEANKSPENNPKEKVRPQTYLCAKAAKNAWATIPKGGVPVQAFLHIMQRSQEPYLQ FLARLQEAVKHQIPHTEGAEELTLTLAFENANTDCKHALAPARKLWKIAESDQFKSISWD ENGTCRVINEELFKKEILEERLFTEYWKLTV >gi568815574f:2742164_2966891|GENSCAN_predicted_CDS_9|456_bp atggctcattcacatgaagcaaataaatctccagaaaacaaccctaaagaaaaagtaagg cctcagacttacttgtgtgcaaaggctgctaaaaatgcctgggccacaattcctaaagga ggagtcccagtacaagcctttttacatatcatgcaaaggtcacaggagccctatttgcaa tttcttgcaagattacaagaggcagtgaagcatcagattcctcatactgagggtgcagaa gagctaaccttaactctagcttttgagaatgcaaacacggattgtaaacatgcactagca ccagcaaggaaactttggaaaatagctgaaagtgaccaattcaagtctatttcgtgggat gagaatggaacttgcagagtgattaatgaagaactcttcaagaaagaaattttggaagaa aggctctttacagaatattggaaactgacagtatga >gi568815574f:2742164_2966891|GENSCAN_predicted_peptide_10|120_aa MRDCISDIRSSYMADIDNKEESDLGQDLGLVLHFTGASVEDDDDYDEESEEMDDKEETKR DEKNYPGWFCSDLSYFLSSASFRITSLEKNINDLMKLKNTAQEFREAYTSTNSQIKQAEE >gi568815574f:2742164_2966891|GENSCAN_predicted_CDS_10|363_bp atgagggattgcatttcagatattcggagttcttacatggcagacattgacaacaaagaa gagtctgatcttggacaagatttggggctggtgttacactttactggagcaagtgttgaa gatgatgatgattatgatgaagaaagtgaagaaatggatgataaagaggaaacaaaaaga gatgagaaaaattatccaggctggttctgctcagatcttagttatttcttgtcttctgct agctttagaataaccagtttagagaagaacataaatgacctgatgaagctgaaaaacaca gcacaagaatttcgtgaagcatacacaagtaccaatagccaaatcaaacaagcagaagaa tag >gi568815574f:2742164_2966891|GENSCAN_predicted_peptide_11|379_aa MAAARRLLLGSIESVVGRRSRHLEGGIWKKNSVMILEAGMGKRTAAAAAAQPPARRRVCA TGRARSLGGPCLCRHMRAYVPDACGSATFRPSPHSPSLAGCSVGYALGDMPQASVRAPGS FRPSALPALCPSESLQPPVSAGADATHMDGDQIVVEIQEAVFVSNIVDSDITVHNFVPDD PDSVVIQDVVEDVVIEEDVQCSDILEEADVSENVIIPEQVLDSDVTEEVSLPHCTVPDDV LASDITSTSMSMPEHVLTSESMHVCDIGHVEHMVHDSVVEAEIITDPLTSDIVSEEVLVA DCAPEAVIDASGISVDQQDNDKASCEDYLMISCIFSTLIWLQLTESLCHLPADLNNLVIL IDQFITTVFGMKFYSVDLV >gi568815574f:2742164_2966891|GENSCAN_predicted_CDS_11|1140_bp atggcggctgctaggcgcctgctgctggggagtattgagagtgttgtcgggaggcggagc cgccatcttgaaggcggtatctggaaaaaaaattcggttatgatccttgaggcggggatg gggaaaaggacggcggcggcggcggcagcgcagcctccggcgcgacggcgtgtctgcgca acagggcgtgctcgttcccttggcggcccttgcctttgtcgccatatgcgcgcgtacgtt ccagacgcctgcggcagcgccacctttcggccttcccctcacagcccatccttggctggg tgcagtgtcggctacgctttaggtgacatgccgcaggcgtccgttcgggcgccggggtca tttcgcccctcagcgctcccggctctgtgcccttccgagagtctacagccacccgtttca gcaggagctgatgctacacacatggatggtgatcagattgttgtggaaatacaagaagca gtttttgtttctaatattgtggattctgacataactgtgcataactttgttcctgatgac ccagactcagttgtaatccaagatgttgttgaagatgttgtcatagaggaggatgttcag tgctcagatatcttagaagaggcagatgtatctgaaaatgtcatcattcctgagcaagtg ctggactcagatgtaactgaagaagtttctttaccacactgcacagtcccagatgatgtt ttagcttctgacattacttcaacctcaatgtctatgccagaacatgttttaacgagtgaa tccatgcatgtgtgtgacattggacatgttgaacatatggtgcatgatagtgtagtggaa gcagaaatcattactgatcctctgacgagtgacatagtttcagaagaagtattggtagca gactgtgcccctgaagcagtcatagatgccagcgggatctcagtggaccagcaagataat gacaaagccagctgtgaggactacctaatgatttcgtgtatattttctacgttaatatgg ttacaacttacggaatcattgtgtcatttacctgctgatttaaataatttagttatccta attgaccagttcattactacggtctttggaatgaagttttatagtgtagatttagtttag