GENSCAN 1.0 Date run: 4-Nov-116 Time: 13:53:39 Sequence gi568815588f:47606203_47806755 : 200553 bp : 40.56% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 222 217 6 1.05 1.03 Term - 10475 10255 221 0 2 23 42 173 0.351 2.42 1.02 Intr - 16074 15919 156 1 0 79 30 95 0.536 1.86 1.01 Init - 18766 18664 103 1 1 35 90 62 0.486 1.55 1.00 Prom - 28871 28832 40 -3.25 2.00 Prom + 32085 32124 40 -5.65 2.01 Init + 56153 56193 41 1 2 85 67 47 0.500 2.01 2.02 Intr + 60220 60352 133 1 1 53 34 106 0.101 1.43 2.03 Intr + 63044 63119 76 1 1 25 81 100 0.077 1.17 2.04 Intr + 66345 66479 135 1 0 62 24 110 0.698 1.62 2.05 Term + 67083 67429 347 1 2 14 38 251 0.583 6.37 2.06 PlyA + 70265 70270 6 1.05 3.00 Prom + 81595 81634 40 -6.05 3.01 Sngl + 84318 84809 492 2 0 101 37 371 0.955 29.10 3.02 PlyA + 84870 84875 6 1.05 4.00 Prom + 90833 90872 40 -6.15 4.01 Init + 100001 100144 144 1 0 104 -10 180 0.997 9.97 4.02 Intr + 100217 100547 331 1 1 72 -11 286 0.062 11.38 4.03 Intr + 104380 104514 135 2 0 82 94 66 0.019 6.22 4.04 Intr + 109449 109655 207 1 0 95 110 17 0.050 2.73 4.05 Intr + 113631 113859 229 1 1 103 82 24 0.204 -0.69 4.06 Intr + 120639 120754 116 0 2 72 58 89 0.216 3.47 4.07 Term + 124035 124234 200 1 2 79 54 194 0.576 11.68 4.08 PlyA + 124296 124301 6 1.05 5.20 PlyA - 124779 124774 6 1.05 5.19 Term - 137851 137719 133 0 1 121 54 79 0.188 4.38 5.18 Intr - 146020 145946 75 0 0 64 100 100 0.042 6.51 5.17 Intr - 147691 147605 87 0 0 72 59 94 0.032 3.07 5.16 Intr - 148057 147837 221 1 2 17 86 266 0.017 15.48 5.15 Intr - 150020 149773 248 0 2 99 70 256 0.537 21.16 5.14 Intr - 151126 151006 121 1 1 61 70 160 0.977 10.65 5.13 Intr - 151889 151719 171 2 0 129 64 119 0.985 12.82 5.12 Intr - 153725 153379 347 0 2 90 56 490 0.546 40.19 5.11 Intr - 154814 154585 230 1 2 106 97 217 0.998 20.99 5.10 Intr - 155443 155239 205 2 1 76 94 201 0.996 16.84 5.09 Intr - 156567 156432 136 0 1 76 102 51 0.655 4.52 5.08 Intr - 157240 156693 548 2 2 19 24 301 0.297 8.47 5.07 Intr - 158023 157771 253 1 1 82 71 92 0.611 2.98 5.06 Intr - 159460 159215 246 1 0 19 75 238 0.954 12.33 5.05 Intr - 160797 160410 388 2 1 55 50 171 0.214 3.87 5.04 Intr - 168572 168433 140 2 2 45 103 108 0.190 6.34 5.03 Intr - 169553 169409 145 1 1 116 89 -7 0.193 1.56 5.02 Intr - 170452 170293 160 2 1 42 59 111 0.633 1.72 5.01 Init - 171533 171389 145 2 1 63 57 81 0.532 2.83 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 80628 80592 37 0 1 55 86 45 0.879 1.22 S.002 Init + 89283 89340 58 2 1 64 75 69 0.864 4.72 S.003 Term + 100217 100600 384 1 0 72 49 310 0.937 19.40 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588f:47606203_47806755|GENSCAN_predicted_peptide_1|159_aa MLAFMQASKGLIRIFVISITKNAEVLTPQGVRLQGQGAWIICEHVYKEDKAGSTMGSLAC TMHTIVYVNAASQNHVVYNLPFGNGMALHPSCNGTIHDETVLGKFDRCPQEFQDEAKHGQ LLPASCFQNSEGYMLIEAAFDGKMEKEKRCQILLPTLDE >gi568815588f:47606203_47806755|GENSCAN_predicted_CDS_1|480_bp atgttggcttttatgcaagcaagcaagggcctaattcgaatttttgttatctcaatcact aaaaatgctgaggtattgacacctcaaggggtccgattacaaggtcagggagcatggatc atctgtgagcatgtctacaaggaggacaaagcgggtagtactatggggtctcttgcatgt accatgcacactatagtctatgtgaatgctgcttcccagaatcatgtagtgtacaatctt ccctttggcaatgggatggcattgcacccatcctgcaatggaaccatacatgatgagaca gttttgggaaagtttgataggtgtcctcaagagtttcaagatgaagctaaacatggccag ttgttgcctgcttcttgctttcaaaattctgaaggctacatgctcattgaggcagctttt gatggcaagatggagaaagagaaaagatgccagattcttcttcccactttagatgagtga >gi568815588f:47606203_47806755|GENSCAN_predicted_peptide_2|243_aa MGQIPHGIVLSSQYFTTVLREECCGYYTHLTDQGAEAQKPKDFCCQCHAFGWSYDVVQCS LHTCFYRRYLLPVPEPFEDSVDEAGAILTNKKNKIPALVEETVKKWDKYISDGNEDNGRR VEQDEGMKGQPPLEGRIAEKNPAEALQAFSSGMLQAGEELEITLTPHSGSPCFQPPRTGD RAEEVSETALIHFRPSQNVRKLPSRSLAGEEEQGCTRAPELQKARCATGEQGESSKPNSD RGA >gi568815588f:47606203_47806755|GENSCAN_predicted_CDS_2|732_bp atggggcagatccctcatggcatagtgttgtcctcgcagtactttacaacagtcctaaga gaggaatgctgtggttattacacccacttgacagaccagggagcagaagcacagaaaccc aaggatttctgctgtcaatgtcatgcatttggctggagttatgatgtcgtgcaatgctcc cttcacacctgtttctacaggcgctatctgttaccagttcctgagccttttgaagattcc gtggacgaagctggggctatactgacaaataagaaaaataaaattcctgcccttgtggag gagacagttaaaaaatgggacaaatacatttcagatggaaatgaggacaatggaagaaga gtagagcaggatgaaggcatgaaggggcaaccacccttagaaggcaggattgctgagaaa aatccagcagaggctctccaggctttcagttctggaatgctgcaggcaggggaagagcta gaaataactctcaccccacactctggcagtccatgttttcagccaccaaggactggggac agggcagaagaggtgagtgaaactgctctaattcatttcaggccttcacagaatgtaagg aagctgccctccagaagcctggctggggaggaggagcagggctgcacgagggcacctgag cttcagaaagcaaggtgtgcgactggagagcaaggagaatctagcaaaccaaactctgac cgtggcgcttaa >gi568815588f:47606203_47806755|GENSCAN_predicted_peptide_3|163_aa METEPKASYGEIRIPEENSIQLDGFTEAYESGQNQAYSLELFSPVCPKTENSRIHINSDK GLEEHTGSQELFSSEDELPPNEIRIELCSSGILCSQLNTFHKSAIKRSCTSEDKVAQSEA LSRVLQVAKKMKLISNGGDSAVEMDRRNVSEFKSIKKIINKKL >gi568815588f:47606203_47806755|GENSCAN_predicted_CDS_3|492_bp atggagactgaaccaaaggcaagttacggggagataagaatacctgaagagaattcgatt cagcttgatggttttacagaagcatatgaaagtggacaaaaccaagcatattcccttgaa ctttttagtcctgtttgtcctaaaacagaaaatagccgcattcacataaactctgataaa ggtcttgaagaacatacaggatctcaagaacttttcagttctgaagatgaactgccacca aatgagatacgtattgagttgtgtagctcaggaatactgtgttcccaactaaataccttc cacaaaagtgctattaaaagaagctgtacctctgaagataaagtggcccagtctgaagct ctatctagagtccttcaagtagctaagaaaatgaagttgatttctaatggaggagattct gctgtagaaatggatcggagaaatgtgtctgaatttaagagtattaaaaaaatcattaat aaaaaactgtga >gi568815588f:47606203_47806755|GENSCAN_predicted_peptide_4|453_aa MPQSKSRKIAILGYRSVGKSSLTIQFVEGQFVDSYDPTIENTFTKLITTYSIDINGYILV YSVTSIKSFEVIKVIHGKLLDMVGKVQIPIMLVGNKKDLHMERVISYEEGKALAESWNAA FLESSAKENQTAVDVFRRMILEAEKMDGGSFTRQVFMLEAVYSYRGQKQKKVMLTVEQAQ DQHYALVLWGPGAAWYPQLQRKKGYIWEFKYLFVQRNYTLENLELHTTPWSSCECLFDDD IRAITFKAKFQKSAPSFVKISDLATHLEDKCSGVVLIKAQISELAFPLTAAQKISLNAHS SLKSIFSSLPNIIYTGCAKCGLELETDENRIYKQCFSCLPFTMKKIYYRPALMTAVDGRH NVYIHVESKLIEKILLNISADCLNRVIVPSSEITYGMVVADLFHSLLAVSAEPCVLKIQS LFVLDENSYPLQQDFSLLDFYPDMVKHGANARL >gi568815588f:47606203_47806755|GENSCAN_predicted_CDS_4|1362_bp atgccgcagtcaaagtcccggaagatcgcgatcctgggctaccggtctgtggggaaatcc tcattgacgattcaatttgttgaaggccaatttgtggactcctacgatccaaccatagaa aacacttttacaaagttgatcacaacatactccatagatattaatggctatattcttgtg tattctgttacatcaatcaaaagttttgaagtgattaaagttatccatggcaaattgttg gatatggtggggaaagtacaaatacctattatgttggttggaaataagaaagacctgcat atggaaagggtgatcagttatgaagaagggaaagctttggcagaatcttggaatgcagct tttttggaatcttctgctaaagaaaatcagactgctgttgatgtttttcgaaggatgatt ttggaggcagaaaaaatggacgggggcagcttcacaaggcaagtcttcatgctcgaggca gtatacagttatagaggacagaagcagaaaaaagttatgttaacagtggaacaggcccaa gatcaacattatgcgcttgtattatggggtcctggagcagcctggtaccctcaacttcaa aggaaaaaaggttatatttgggaatttaaatatctttttgttcagcgcaattacacacta gaaaacctagaattgcatacaacgccttggtcatcctgtgagtgcttgtttgatgatgat ataagggcaattacatttaaagcaaaatttcaaaaaagtgcaccctcctttgtgaagata tcagacttagcaacccacctagaggataagtgttcaggagtggttctaattaaagcccag atttcagagctggcatttcctcttacagcagctcagaagatatctctaaatgctcacagt tctctgaagagtattttttcttctcttcccaacatcatatatactggctgtgcaaaatgt ggattggaactagaaacagatgagaacaggatctacaaacaatgttttagctgcttgcca tttactatgaagaaaatatattataggccagcgttaatgactgccgttgatggaagacat aatgtttacatccatgtagaatcaaagctgatagagaagattcttctcaacatttctgca gactgcctcaacagagtgatagttccttcctcagagatcacctatgggatggttgtggca gacctgttccactccttgttggcagtcagcgcagaaccttgtgtattgaagattcagagc ctttttgtgttagatgaaaacagctatccattacaacaagatttctccctcctggatttt tatcctgacatggtaaagcatggagccaatgcccgtctctga >gi568815588f:47606203_47806755|GENSCAN_predicted_peptide_5|1332_aa MEGSGQRGDHQGPKMSIPGNTPGERRCHPRTLLHSLGTFTISAHQDLVEIVQDKCVPHLD PMINTLSGNLEKETHSIPQGNVAMKPPSYMKPDTGAAQRRTKAAGMAGTTTDVSCMPTQN YCIRQYSSDFKRQKPNSNWLKHQRDFSLFLIRQLYHIPPKPYSLPLQQKTAQNSGSQPVA AFRFRTPGGCAVLSELWHLQVFKLLDFLQPSLEVLLPWKYNFRLAMKDRTHVCCCFQASQ LHRHGEEQGLKTGSRARAAARPWMPFCSGELAPGQPGLRAPGQWMEGRLPKVPVALCLIH GQRHRRGSPGSLPYSCSLVKKERGSQRWTEEEQLSGLGDTEKDGDFGKAQHGGREMAALG LALRKCQKGFPFQKTSHYPTPPPEQPEYSSGAIMEAAGVDAMPHWGGQRPGVSRSGGGFS RHPRLSPSRAEISPVKRTAEDSYWVKWPGSPPSHAPTAQTLEGEWLRPLHPGPEGSQTRP SGAPQFTTLLALGQPPLREPLTDINPIEKTEPSIRTLSCPKSQGWRVRSLNWNPGASGTR SRIALSTGLSAESERVRERRIPAQGHQAGSAGARDPRGDSTPGSGKAPPSRGRSCGWRGL GLRRIWEGGGSVLATRCEGSGAEPCVRRGGEGCSPALRDGGGTLTAPCPKPQGAPGAPGA RDPPWLEPCGFRELTLGTSSTTDRPCARPTAPRGALRDGLAAQPQPPRPARPACGSSPMA EQLALVIGGTIGGLLLLLLIGASCCLWRRFCATLTYEELPGTPAMATTAASSGQRDRPCQ PHARTQLSSPWCSTVLAERSETTPFTCHLHVVSLTLDRPPAVPFVVPPTLQGRDWVPLHS GEWADAPWDPCPASELLPHTSSGGLGDACMVGAINPELYKFPEDKSETDFPDGCLGRLWF SVEYEQEAERLLVGLIKAQHLQAPSETCSPLVKLYLLPDERRFLQSKTKRKTSNPQFDEH FIFQVQLLEQAGSGALRVAGKVSSKTITQRVLKFSVYHVDRQRKHQLLGQVLFPLKNETL VGDCRRVIWRDLEAESLEPPSEFGDLQFCLSYNDYLSRLTVVVLRAKGLRLQEDRGIVSV FVKVSLMNHNKFVKCKKTSAVLGSINPVYNETFSFKADATELDTASLSLTVVQNMEGDSK ATPYPGLLGWGPRGAAECSGRRAGALGRDAQQAQGAGEALACALPHHGALTLRPAPRFRF GSRPSSQLHVTATATRMDVSSNSSPSCQPARARRTPDLDQNLRCSEERSCLVMTQLYLLN GPAGAGYYSGTVSVSSDIRIDASHPHIPRPSFRPPYPPHVSPYRPSFCVNYLFISIAAVP SGKSSAWMLLAF >gi568815588f:47606203_47806755|GENSCAN_predicted_CDS_5|3999_bp atggaaggcagtggacaaagaggggaccaccaaggtccaaagatgtctatcccagggaat actcctggagagaggcgctgtcatccaaggactctgctccactccctgggaaccttcaca atttctgcccaccaggatttggtggagattgttcaggataagtgtgtaccccacttagat ccaatgataaatactttatctgggaatctggagaaagaaactcactctattccccaaggg aatgtggcgatgaagccaccatcttacatgaagccggatactggagccgcccagagaagg acaaaggctgcgggcatggcaggtaccaccactgatgtgtcttgtatgcctacacagaat tattgtattaggcaatactcttctgacttcaagaggcagaaacccaactcaaattggctt aagcaccaaagggacttttcactgtttcttataaggcagttataccatataccaccaaaa ccctactcactcccactgcaacagaagacagcgcagaactcaggttctcaaccagttgct gcattcaggttcagaacccctggaggatgtgctgtcctctccgaactgtggcatttgcag gtctttaaactcttggacttcctgcagccctccttggaagtgcttttgccatggaaatac aacttccgcttggccatgaaagacagaacccatgtctgctgctgtttccaggcttcccag ctccacagacacggggaagaacaaggtctgaaaacaggaagcagagccagagctgctgcc agaccctggatgcctttctgttctggggagctggctccaggccagccaggtctcagagcc cccgggcagtggatggaggggaggctgcccaaagtgcctgtggctctttgcctgattcac ggtcagcgccacaggagagggtctcctgggtccttgccctacagctgctcactggtgaag aaggaacgagggagccagagatggacagaggaggaacagctgtcaggacttggtgacaca gagaaagatggggactttggaaaagcacagcatgggggtcgagagatggcagctctgggc ctggcactcaggaagtgccagaagggattcccttttcaaaaaacatcccactacccaact ccacccccagaacagcctgagtacagctcgggcgcaatcatggaggcagctggtgtggat gccatgccccactggggtggtcagcgacctggggtaagcagaagtgggggaggattctcc agacaccccagactgtctccaagtagagcggagatctcgccagttaagcgcacagctgag gacagttactgggtgaagtggcctgggtctccgccttctcatgcccccaccgcccagacc ctagaaggggagtggctgcggcctctgcaccctggccctgagggctcccagacccggccc agtggggcaccccagttcaccaccctcctcgccttaggccagccgcccttacgagagccc ctcacggacattaacccaattgagaagactgaacccagcatacgaacgttaagttgtccc aaatcgcagggctggcgagtgaggagcctgaattggaacccaggagcttctggaacccgc tccagaattgcgctctcgaccggtttgtccgctgagagcgaacgggtgcgcgagcgccgc atccctgcccaaggccatcaggcaggcagcgctggggcccgggacccgcgcggagactcc accccgggatccgggaaggctcccccgagccggggtcggagctgcggctggaggggcctc ggcttgaggaggatctgggagggcgggggctcagtcctggccaccaggtgtgaggggtcg ggtgcggagccctgtgtcagacgcggcggtgaaggctgtagccctgctctccgggatggg ggtggtactctcaccgcaccctgccccaagccgcagggagcccctggcgcccctggcgcc cgggacccgccctggctggagccctgcggtttccgggagctcacgctgggcaccagcagc accacggaccgcccctgtgctcgcccgacggcgccccgcggcgctttaagagacggcctg gcagcccagccccagccgcccagaccggcgagaccagcctgcgggagcagccccatggcg gagcagctggccctggtgattgggggcaccatcggggggctgctgctgctgctgttgatc ggggcaagctgctgtctgtggagaaggttctgtgccaccctcacctatgaggagctgcct gggacaccagccatggccaccacagctgcctccagtgggcagcgggacaggccctgccag ccgcatgctaggacccaactgagcagcccctggtgctcaaccgtcctggctgagcgctct gagaccacaccattcacttgccacctacatgttgtctccctgactctggacaggccacca gctgtgccattcgtggtgcccccaacccttcaaggccgagattgggtgcccctgcacagt ggagagtgggccgatgccccatgggacccctgcccggcatcagagctgctgcctcacacc tccagcggcggccttggagatgcatgtatggtgggggccatcaacccagagctgtacaag ttcccagaggacaaaagtgagaccgacttccccgacggctgcctggggcggctgtggttc tcggtggaatatgagcaggaggctgagcggctgctggtgggcttgatcaaggcacagcac ctgcaagccccctcggagacctgcagccccctggtgaagctctacctgctgcccgatgag cggcgcttcctccaatccaagaccaaacgcaaaacctccaacccgcagtttgacgagcac ttcatctttcaggtacagcttctggagcaggcagggtctggtgccctccgcgttgctggg aaggtgtccagcaagaccatcacccagagggtgctgaagttctccgtctaccacgtggac aggcagaggaagcaccagctcctgggccaggtgctcttccccttgaagaatgagacccta gtgggggactgccggcgtgtcatctggagagacctggaggctgagagcctggagcccccc tcggagtttggcgacctccagttctgcctcagctacaacgactacctgagccgcctgacg gtggttgtgctgcgtgccaagggcctccggctccaggaggacagaggcattgtcagtgtg tttgtcaaagtgtctctgatgaaccacaacaagtttgtcaagtgcaagaagacttcagct gtgctgggctccatcaaccctgtgtacaatgagaccttcagcttcaaggccgatgccacc gagctggataccgctagcctcagcctgactgtggtgcagaacatggaaggggacagtaag gccacaccctaccctgggctgctgggatggggaccacgaggggctgccgagtgttcaggc agaagagctggagcactgggacgagatgctcagcaagcccaaggagctggtgaagcgctg gcatgcgctctgccgcaccacggagccctgacccttcgcccagcaccgcggttccgcttt gggagccgaccatcctcgcagttgcatgtgacagccacagccacacgcatggacgtttca tccaacagctccccgagctgccagccagcccgagccaggaggacccctgatttggaccag aatctcagatgctcagaagaaagatcctgtttagtgatgacccagctctacctgctgaat ggacctgcaggagcaggctactacagtggaactgtttctgtttcatctgacatccggata gatgccagccacccccacattccaagaccttccttccgccctccctaccccccgcacgtc tctccctaccgtccttctttctgtgtgaattatctgttcatctccattgctgcagtaccg tctggcaaatcatctgcttggatgctcctggctttttga