GENSCAN 1.0 Date run: 2-Nov-116 Time: 21:07:16 Sequence gi568815580r:32574660_32870591 : 295932 bp : 38.47% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.07 PlyA - 549 544 6 1.05 1.06 Term - 862 701 162 1 0 63 33 131 0.057 2.05 1.05 Intr - 6371 6026 346 1 1 92 59 95 0.002 1.47 1.04 Intr - 24084 23997 88 1 1 118 13 85 0.006 2.11 1.03 Intr - 32352 32247 106 0 1 40 32 119 0.015 0.37 1.02 Intr - 45314 45270 45 2 0 64 86 89 0.076 3.99 1.01 Init - 52955 52761 195 2 0 101 52 99 0.096 6.58 1.00 Prom - 72408 72369 40 -3.15 2.08 PlyA - 72691 72686 6 1.05 2.07 Term - 100138 99998 141 1 0 113 41 113 0.968 6.05 2.06 Intr - 102671 102514 158 0 2 79 80 130 0.997 10.11 2.05 Intr - 105668 105510 159 0 0 101 13 151 0.980 7.94 2.04 Intr - 112574 112496 79 2 1 70 115 34 0.115 2.51 2.03 Intr - 118500 118323 178 2 1 43 7 172 0.075 3.60 2.02 Intr - 120893 120804 90 1 0 72 48 106 0.141 3.19 2.01 Init - 134652 134534 119 0 2 68 103 57 0.207 4.92 2.00 Prom - 141554 141515 40 -3.35 3.00 Prom + 143343 143382 40 -6.85 3.01 Init + 145654 145815 162 0 0 48 86 101 0.189 5.68 3.02 Term + 168193 168396 204 0 0 113 53 171 0.505 12.49 3.03 PlyA + 168555 168560 6 1.05 4.03 PlyA - 171165 171160 6 1.05 4.02 Term - 172898 172722 177 2 0 60 37 144 0.838 3.20 4.01 Init - 176013 175942 72 0 0 64 100 57 0.960 5.62 4.00 Prom - 189622 189583 40 -6.45 5.04 PlyA - 191498 191493 6 1.05 5.03 Term - 193080 192960 121 2 1 92 50 74 0.772 0.97 5.02 Intr - 195975 194986 990 2 0 101 91 1601 0.794 150.73 5.01 Init - 196363 196257 107 1 2 92 59 146 0.578 9.84 5.00 Prom - 209722 209683 40 -6.65 6.04 PlyA - 211429 211424 6 1.05 6.03 Term - 217545 217480 66 0 0 112 39 89 0.737 3.36 6.02 Intr - 219896 219727 170 0 2 33 94 169 0.942 10.74 6.01 Init - 220306 220108 199 1 1 44 68 265 0.466 17.22 6.00 Prom - 224388 224349 40 -5.95 7.00 Prom + 224527 224566 40 -6.75 7.01 Init + 227438 227530 93 1 0 40 37 92 0.233 -0.27 7.02 Intr + 228379 228470 92 0 2 120 41 88 0.130 5.17 7.03 Intr + 250390 250472 83 1 2 61 119 40 0.523 2.76 7.04 Intr + 252469 252559 91 2 1 47 91 38 0.333 -1.87 7.05 Intr + 257082 257263 182 0 2 52 -5 180 0.076 3.69 7.06 Intr + 260197 260415 219 2 0 63 44 173 0.055 7.75 7.07 Intr + 268144 268234 91 2 1 67 72 52 0.038 -0.37 7.08 Term + 269062 269164 103 1 1 50 42 129 0.576 1.27 7.09 PlyA + 270710 270715 6 1.05 8.02 PlyA - 270948 270943 6 1.05 8.01 Sngl - 284754 283390 1365 0 0 70 41 582 0.994 47.57 8.00 Prom - 285705 285666 40 -6.15 9.02 PlyA - 285874 285869 6 1.05 9.01 Sngl - 286758 286102 657 0 0 65 43 432 0.776 32.32 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 34730 34911 182 1 2 37 73 192 0.804 11.30 S.002 Sngl - 182909 182694 216 2 0 54 54 161 0.814 4.22 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815580r:32574660_32870591|GENSCAN_predicted_peptide_1|313_aa MDFNVTIKDIGRSGLGVEKEKWEQVVKTGKEGWEVLLPRVTVEQKKKWNWKWCYCKNNGK HLAHLRKSEDDQGHAKSYRKRGGEEQQLIPGIYYMTAVHQRYLDHRTATSSLKISGMGLP ENQTAAIVIALVGLATKQGYWAPGWHGLPRVGPETSRLMPSPLSSLGMNRTPLPMGSNQS LHVTLIDLTWVTSPFLNQQGDELLCQILVTYPPLELGVWLFPSEPHEVKMRSGQFPQGKR DGVISNQEESSRPRYNINSCSCCVKMQSTLVAVGDSLIVTKLVDPPRSTNCLKARAVGWQ VTPKTVGLVKKGH >gi568815580r:32574660_32870591|GENSCAN_predicted_CDS_1|942_bp atggattttaatgttaccattaaagacattggaaggtctggtcttggtgtggagaaagaa aagtgggaacaggtggtaaaaacaggaaaagaaggatgggaggtcctcctcccacgggta acagtggaacaaaagaagaagtggaattggaagtggtgctattgcaagaacaatggaaag cacctagcacatctgagaaaatctgaagatgatcagggacatgccaagtcatatcggaag agaggtggagaagagcagcagctgataccagggatatactacatgacagcagttcatcaa agatatttggatcatcggacagcaactagctctctcaaaatatcagggatggggcttcct gagaaccagactgcagcaattgttattgcccttgtgggtctagccaccaagcaaggctac tgggccccaggctggcacggtctccccagggtgggaccggaaacctccaggctcatgccc tcccctttgagtagccttggcatgaatagaactcctctgcccatgggctccaatcagagt ctccatgtgactctgattgacctaacttgggtcacaagtccattcctgaaccagcagggg gatgaactactctgccagattttggtcacatacccacctttggagctaggggtgtggtta tttccatcagaaccacatgaagtaaaaatgaggagtgggcagttcccacaagggaaaagg gatggtgtgatctctaatcaggaagagagttcaagacccagatataacattaatagctgc tcctgttgtgtcaaaatgcaaagcacactggtagctgttggtgactcactgatagtcacc aagcttgtggaccctccaaggagcacaaactgcttgaaagccagggctgtaggttggcaa gtcacacctaagactgtgggactagtaaagaaaggccattaa >gi568815580r:32574660_32870591|GENSCAN_predicted_peptide_2|307_aa MRIFKGKTCQRQREEQMQRPDTGTCLECCRSKAAGMVGAPYAIQQCPPLRCGGGKLLVRV GWRGPVESEWWGMREGNEVACSAGGAAVTMSSQHSAAREWVHRSGKGIWANEQEHPPQSR EWFLYSIHLENTVQILSADMILDLIAGFNFHPCRKGGVHNGEYVPWLYCYDPVMDVWARK QDMNTKRAIHTLAVMNDRLYAIGGNHLKGFSHLDVMLVECYDPKGDQWNILQTPILEGRS GPGCAVLDDSIYLVGGYSWSMGAYKSSTICYCPEKGTWTELEGDVAEPLAGPACVTVILP SCVPYNK >gi568815580r:32574660_32870591|GENSCAN_predicted_CDS_2|924_bp atgagaatattcaaaggaaaaacatgtcagaggcagagagaagagcagatgcaaaggcct gacacaggaacatgtttggagtgttgcagaagcaaagcagccggtatggttggagcacct tatgccatacaacagtgcccaccactgcgttgtggaggtggaaaacttcttgttcgtgtt gggtggagaggaccagtggaatccgaatggtgggggatgagggaaggcaatgaggtggcc tgttcagcaggaggagccgccgtgaccatgagcagccagcattcagcagccagggaatgg gtgcaccggtctggtaaaggaatctgggctaatgaacaagagcacccaccacaatcccgt gaatggtttctatactccatccatctggaaaacacagtacaaattttgtcagccgatatg atcctcgatttaatagctggattcaacttccacccatgcaggaaagggggtgtacacaat ggagaatatgtcccatggctatattgctatgacccagtaatggatgtctgggctcgaaaa caagatatgaacacaaaacgtgcaattcacactttggctgtaatgaatgatcgcttgtat gcaattggaggaaatcatttgaaaggtttctcccaccttgatgtaatgcttgtggaatgc tatgacccaaaaggtgaccagtggaatatactccaaactcccattttggagggtcgaagt ggccctggctgtgcagtgcttgatgacagcatttaccttgtgggaggctacagctggagt atgggggcctacaagtcatctacaatatgctattgtccagagaaaggaacctggacagaa ctcgaaggagatgtagcagaaccgttggcaggccctgcctgtgtgacagttattctgccc tcttgtgtaccatacaacaaataa >gi568815580r:32574660_32870591|GENSCAN_predicted_peptide_3|121_aa METEDHSDSAYGQLRKGLEEKEGSGMTLKISGWRDEDAISPNREQQRQNGFKGKGATTER SDYPPIKSWQFAAINRDCDSQSAWQFLRKACNVSYAVKAVRCTLASTLDLMDVQSSGTVM D >gi568815580r:32574660_32870591|GENSCAN_predicted_CDS_3|366_bp atggaaacagaagatcatagtgacagcgcttatggtcagttgaggaaggggttggaggag aaagaggggtcagggatgacacttaagatttcaggttggagagatgaagatgctatttct cccaacagagaacaacaaagacagaatgggttcaaaggcaagggggcaaccactgaaagg agtgactacccacccatcaagagctggcaattcgctgctattaaccgtgactgtgattcc cagtccgcctggcagttcctgagaaaggcctgcaatgtctcatatgcagtaaaagctgta agatgcaccctggccagcacactcgacctgatggatgtacaatcctcaggaacagtaatg gattga >gi568815580r:32574660_32870591|GENSCAN_predicted_peptide_4|82_aa MKFGDLNIVRFGDYPNRGSWQRREISGSIRFSYERKPYVNCTCEGSSLCAPYENLIPDDV SLSPITPRWDCLVAGKQAQGSH >gi568815580r:32574660_32870591|GENSCAN_predicted_CDS_4|249_bp atgaaattcggtgacttgaatattgtaagatttggtgactaccctaacagaggcagctgg cagcgaagagaaattagtggcagtatcagattctcatatgagcgcaaaccctacgtgaat tgcacatgtgagggaagtagcttgtgtgctccttatgagaatctaatacctgatgatgtg tcactgtctcccatcacccccagatgggactgtctagttgcaggaaaacaagctcagggc tcccactga >gi568815580r:32574660_32870591|GENSCAN_predicted_peptide_5|405_aa MDTSPRGASASAATLPSLLPLLLLPTAELRWLVLGKLEEGLKPGAPSQLAMSRSGDRTST FDPSHSDNLLHGLNLLWRKQLFCDVTLTAQGQQFHCHKAVLASCSQYFRSLFSSHPPLGG GVGGQDGLGAPKDQQQPPQQQPSQQQQPPPQEEPGTPSSSPDDKLLTSPRAINNLVLQGC SSIGLRLVLEYLYTANVTLSLDTVEEVLSVSKILHIPQVTKLCVQFLNDQISVQNYKQVC KIAALHGLEETKKLANKYLVEDVLLLNFEEMRALLDSLPPPVESELALFQMSVLWLEHDR ETRMQYAPDLMKRLRFALIPAPELVERVQSVDFMRTDPVCQKLLLDAMNYHLMPFRQHCR QSLASRYVLRFNVRRVDFQNAKSISHFSQDSAHQLVPAGSVLVDG >gi568815580r:32574660_32870591|GENSCAN_predicted_CDS_5|1218_bp atggacacgagcccacgaggtgcctcggcctcagccgccactttgccgtcgctgctgcca ctgctgttgctgccaacagccgagctgaggtggttggttttaggaaagttggaggagggt ttaaagccaggtgcaccgagtcagctcgccatgtccagatccggggacaggacctccacc ttcgaccccagccacagcgacaacctgctgcacggcctcaacctgctgtggaggaagcag ctgttttgcgacgtgaccctgacggcccagggccagcagttccattgccacaaggccgtg ctggcctcctgctcgcagtacttccgatcgctcttctccagccacccccctctcggggga ggggtcggcggccaggacggcctgggggcccccaaggaccagcagcagccgccgcagcag cagccgtcacagcagcagcagccgccgccgcaggaggagcccgggactccttcttcctcc cccgacgacaagctgctgaccagcccccgggccatcaacaacctggtgctgcagggctgc tcgtccatcgggctgcgcctggtgctcgagtacctctacacggccaacgtgaccctgtcc ctggacacggtggaggaggtgctgtcggtcagcaagatcctgcacatcccccaggtcacc aagctctgcgtgcagttcctcaacgaccagatctcggtgcagaactacaagcaggtgtgc aagatcgccgcgctgcacggcctggaggagaccaagaagctggccaacaagtacctggtg gaggatgtgctgctgctcaacttcgaggagatgcgcgccctgctggactcgctgccgccc cccgtggagtcggagctggcgctcttccagatgtccgtgctgtggctggagcacgaccgc gagacccgcatgcagtatgcgcctgacctcatgaagcgcctccgcttcgccctcatcccg gccccggagctggtggagcgggtccagtcagtggatttcatgcgaaccgacccggtctgc cagaagctgctgctggacgccatgaactaccacctgatgcccttcaggcagcactgcagg cagagcctggccagcaggtatgtcctaaggttcaatgttagaagagttgactttcaaaat gcaaaatccatttcccactttagtcaagattcagcacaccagctagtgcctgctggctct gtcctggtggatggctga >gi568815580r:32574660_32870591|GENSCAN_predicted_peptide_6|144_aa MRLHSSALGWSMGLGAVEQGVALVEEARAAKEPTEGVGGSGMAACRSQALPRGKAAKARR EIERSAAEGVGSGLGQPRKGLPQCSGGLKGSSSAAKVGAQAEEAPRGSGGCEDCQHAVTS HQEPAKLKDDEHKNLYDDLFPLNG >gi568815580r:32574660_32870591|GENSCAN_predicted_CDS_6|435_bp atgcgcctgcactcctcagcccttgggtggtcgatgggactgggcgccgtggagcagggg gtggcgctcgtcgaggaggctcgggcagcaaaggagcccacggagggggtgggaggctca ggcatggcggcctgcagatcccaagccctgccccgcgggaaggcagctaaggcccggcga gaaatagagcgcagcgccgctgagggagtgggctccggccttggccagcccagaaagggg ctcccacagtgcagtggtggactgaagggctcctcaagtgccgccaaagtgggagcccag gcagaagaggctccgagagggagcgggggctgtgaggactgccagcacgctgtcacctct caccaggagcctgctaaactcaaagatgatgagcataaaaacctttatgatgatctgttt ccacttaatggatag >gi568815580r:32574660_32870591|GENSCAN_predicted_peptide_7|317_aa MDTARTKTFDKQQPKNPEEGISPAIQVELRKVSENDYLAEIIIIADRYLASCKVPCKHRD FLHHTSLDLSEKSRVEYGETFASYLPLYTGSSAFMMWTHQVSPHTILWVGVQIVDNRNNC VCQRLRVQTIGGPDDIWLLSSPRILAAVLISALFIKTDDARYNMLSVVQCMSRKELAEMP GRMNSNPRDPEMRKGCLTYSKKTKEAQRPEKSAGKGRRSEMQSEQWQAPRVCGKDFRFHP EEEDRSLESFDQGTCLCYSLYLRSVVLSSCTTPKKNEDMLDFEGYHSGKEEGATDTQRNR VVWKGKRGTTVSWENPF >gi568815580r:32574660_32870591|GENSCAN_predicted_CDS_7|954_bp atggacacagccaggacgaaaacattcgacaagcaacagccaaaaaatccagaggaagga atttcacctgcaatacaagtagaacttagaaaagtaagtgaaaatgattatctggctgag atcataatcattgctgatcggtacttggcctcctgcaaagttccgtgcaagcacagagac tttttacatcatacctctcttgatctgagtgaaaagagcagggtggaatacggtgaaaca tttgcctcttacctgcctctatacactggatcttcagcatttatgatgtggactcaccaa gtttcacctcacactatcctatgggtaggtgtccagattgtagataatagaaataattgc gtttgtcagagactcagggttcaaactattggaggtccagatgacatctggcttctcagc tcccccagaatactggctgcagtgcttatcagtgcattattcatcaaaactgatgatgca aggtacaacatgctgtctgtcgttcagtgtatgagtcgaaaggagttggcagaaatgcct ggcagaatgaacagtaatcccagagatcctgagatgagaaaagggtgtttgacttattca aagaagaccaaggaagcccaaagaccagagaagagtgcaggaaaggggcggcggtctgag atgcagtcagagcagtggcaagcccccagagtctgtggcaaagacttcaggtttcatcct gaagaggaagacaggtctttggagagttttgatcaaggaacttgcctgtgttatagcctg tacctgcgttcagtggtcctgagctcttgtaccacacccaagaagaatgaggatatgctg gactttgaagggtaccactccggaaaagaggagggtgccactgacacccagaggaaccga gtggtatggaaaggaaaacgaggtacaactgtgtcatgggaaaacccattttga >gi568815580r:32574660_32870591|GENSCAN_predicted_peptide_8|454_aa MDKFLDTYTLPRLNQEEVESLNRPITGPEIVAIINSLPTKKSPGPDGFTAEFYQRYKEEL VPFLLKLFQSIEKEGILPNSFYEASIILIPKPGRDTTKKENFRPISLMNIDAKILNKILA NRIQQHIKKLIHHDQVGFIPGMQGWFNIRKSINVIQHINRAKDKNHMIISIDAEKAFDKI QQPFMLKTLNKLAIDGTYFKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLLF KIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLKLISNFSKV SGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPL LKEIKEDTNKWKNIPCSWVGRINIVKMAILPKVIYRFNAIPIKLPMPFFTELGKTTLKFI WNQKRARIAKSILSQKNKAGGITLPASNYTTRLQ >gi568815580r:32574660_32870591|GENSCAN_predicted_CDS_8|1365_bp atggataaattcctcgacacatacactctcccaagactaaaccaggaagaagttgaatct ctgaatagaccaataacaggacctgaaattgtggcaataatcaatagtttaccaaccaaa aagagtccaggaccagatggattcacagccgaattctaccagaggtacaaggaggaactg gtaccattccttctgaaactattccaatcaatagaaaaagagggaatcctccctaactca ttttatgaggccagcatcattctgataccaaagccgggcagagacacaaccaaaaaagag aattttagaccaatatccttgatgaacattgatgcaaaaatcctcaataaaatactggca aaccgaatccagcagcacatcaaaaagcttatccaccatgatcaagtgggcttcatccct gggatgcaaggctggttcaatatacgcaaatcaataaatgtaatccagcatataaacaga gccaaagacaaaaaccacatgattatctcaatagatgcagaaaaagccttcgacaaaatt caacaacccttcatgctaaaaactctcaataaattagctattgatgggacgtatttcaaa ataataagagctatctatgacaaacccacagccaatatcatactgaatgggcaaaaactg gaagcattccctttgaaaactggtacaagacagggatgccctctctcaccgctcctattc aaaatagtgttggaagttctggccagggcaatcaggcaggagaaggaaataaagggtatt caattaggaaaagaggaagtcaaattgtccctgtttgcagacgacatgattgtttatcta gaaaaccccatcgtctcagcccaaaatctccttaagctgataagcaacttcagcaaagtc tcaggatacaaaatcaatgtacaaaaatcacaagcattcttatacaccaacaacagacaa acagagagccaaatcatgagtgaactcccattcacaattgcttcaaagagaataaaatac ctaggaatccaacttacaagggatgtgaaggacctcttcaaggagaactacaaaccactg ctcaaggaaataaaagaggatacaaacaaatggaagaacattccatgctcatgggtagga agaatcaatatcgtgaaaatggccatactgcccaaggtaatttacagattcaatgccatc cccatcaagctaccaatgcctttcttcacagaattgggaaaaactactttaaagttcata tggaaccaaaaaagagcccgcatcgccaagtcaatcctaagccaaaagaacaaagctgga ggcatcacactacctgcttcaaactatactacaaggctacagtaa >gi568815580r:32574660_32870591|GENSCAN_predicted_peptide_9|218_aa MEDEMNEMKREGKFREKRIQRNGQSLQEIWDYVKRPNLHLIGVPESDAENGTKLENTLQD IIQENFPNLARQANVQIQEIQRTPKRYSSRRATPRHIIVRFTKVEMKEKMLRAAREKGRV TLKGKPIRLTVDLSAETLRARREWGPIFNILKEKNFQPRISYPAKLSFISEGEIKYFTDK QMLRDFVTTRPALKELLKEVLNMERNNRYQPLQNHAKM >gi568815580r:32574660_32870591|GENSCAN_predicted_CDS_9|657_bp atggaagatgaaatgaatgaaatgaagcgagaagggaagtttagagaaaaaagaatacaa agaaatgggcaaagcctacaagaaatatgggactatgtgaaaagaccaaatctacatctg attggtgtacctgaaagtgatgcggagaatggaaccaagttggaaaacactctgcaggat attatccaggagaacttccccaatctagcaaggcaggccaacgttcagattcaggaaata cagagaacgccaaaaagatactcctcgagaagagcaactccaagacacataattgtcaga ttcaccaaagttgaaatgaaggaaaaaatgttaagggcagccagagagaaaggtcgggtt accctcaaagggaagcccatcagactaacagtggatctctcagcagaaaccctacgagcc agaagagagtgggggccaatattcaacattcttaaagaaaagaattttcaacccagaatt tcatatccagccaaactaagcttcataagtgaaggagaaataaaatactttacagacaag caaatgctgagagattttgtcaccaccaggcctgccctaaaagagctcctgaaggaagtg ctaaacatggaaaggaacaaccggtaccagccgctgcaaaatcatgccaaaatgtaa