GENSCAN 1.0 Date run: 3-Nov-116 Time: 11:53:42 Sequence gi568815597f:95134231_95344710 : 210480 bp : 38.18% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 PlyA - 528 523 6 1.05 1.04 Term - 26308 25452 857 2 2 59 41 317 0.476 16.06 1.03 Intr - 29309 29127 183 0 0 -4 80 184 0.643 7.24 1.02 Intr - 29574 29416 159 1 0 70 71 91 0.865 4.64 1.01 Init - 34210 34081 130 1 1 49 53 145 0.818 7.46 1.00 Prom - 39396 39357 40 -8.05 2.00 Prom + 39746 39785 40 -3.95 2.01 Init + 49812 49843 32 2 2 40 95 33 0.334 -1.54 2.02 Term + 57320 57638 319 2 1 102 42 236 0.976 13.77 2.03 PlyA + 58182 58187 6 1.05 3.04 PlyA - 58620 58615 6 1.05 3.03 Term - 79000 78201 800 2 2 31 38 269 0.519 8.63 3.02 Intr - 81787 81604 184 1 1 71 71 113 0.625 6.34 3.01 Init - 82412 82296 117 2 0 80 77 49 0.671 3.25 3.00 Prom - 84497 84458 40 -5.75 4.00 Prom + 90486 90525 40 -5.35 4.01 Init + 97330 97428 99 0 0 82 69 63 0.286 4.11 4.02 Term + 100393 100494 102 0 0 79 35 96 0.382 0.70 4.03 PlyA + 101635 101640 6 1.05 5.00 Prom + 104136 104175 40 -3.65 5.01 Init + 107351 107462 112 1 1 30 23 149 0.695 3.02 5.02 Term + 109981 110483 503 2 2 70 36 367 0.680 23.36 5.03 PlyA + 110702 110707 6 1.05 6.04 PlyA - 111233 111228 6 1.05 6.03 Term - 115955 115761 195 2 0 75 53 129 0.659 4.53 6.02 Intr - 142049 141942 108 2 0 92 52 83 0.065 4.66 6.01 Init - 156958 156902 57 1 0 75 97 51 0.681 6.06 6.00 Prom - 158493 158454 40 -3.65 7.02 PlyA - 159252 159247 6 1.05 7.01 Sngl - 161621 160779 843 2 0 52 43 290 0.853 16.40 7.00 Prom - 161714 161675 40 -6.15 8.02 PlyA - 161789 161784 6 1.05 8.01 Sngl - 162854 162108 747 2 0 49 43 512 0.951 38.73 8.00 Prom - 167690 167651 40 -4.55 9.05 PlyA - 167890 167885 6 1.05 9.04 Term - 177219 176902 318 0 0 80 54 186 0.926 8.30 9.03 Intr - 182368 182140 229 0 1 16 28 237 0.235 7.45 9.02 Intr - 184182 184028 155 0 2 51 80 105 0.414 3.95 9.01 Intr - 199036 198843 194 2 2 107 92 28 0.027 3.39 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:95134231_95344710|GENSCAN_predicted_peptide_1|442_aa MSESPNTDTFRVSEARLVKVGKQVEEAGECGTIVTEVKHLEADTSTGDTQANKVWSGPTA NSNRPVAEGPVRRKTNKQKGHPHQNPICMSPSSKTKAEQSWTENNFDKLREEGFKRSVIT NFSELKEDVRTHRKKAKNMQKRVDEWLTTINSIQKTSVLEVLARAIRQEKEIKRIQLGKE EVKLSLFADNMTVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTENQI MSELTFTIATKIIKYLGIQLTRDVKDLFKENYKPLLKEIKEDTNKWKSIPCSCVGRINIV KMAMLPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNEKRARIAKTILSQKNNAGGIMLP DFKLYYKATVTKTAWYWYQNRDVDQWNRAGASVIIPHIYNHLIFEKPDKNKKWGKDSLFN KWCWENWLATCRKLKLHPFLTP >gi568815597f:95134231_95344710|GENSCAN_predicted_CDS_1|1329_bp atgtctgaatctccaaacactgatacatttagagttagtgaagcaaggctggtaaaagtg ggtaagcaagtagaagaggcaggggaatgtgggaccattgtaacagaagtaaaacatcta gaagcagacacctccactggtgatactcaggcaaacaaggtctggagtggacctacagca aactccaacagacctgtagctgaaggtcctgttagaaggaaaactaacaaacagaaagga catccacaccaaaaccccatctgtatgtcaccatcatcaaagaccaaagcagaacaaagc tggacagagaacaactttgacaagttgagagaagaaggcttcaaacgatcagtaataaca aacttctccgagctaaaggaggatgttcgaacccatcgcaaaaaagctaaaaacatgcaa aaaagagtagacgaatggctaactacaataaacagcatacagaagacctcagtattggaa gttctggccagggcaatcaggcaggagaaagaaataaagcgtattcaattaggaaaagaa gaagtcaaactgtccctgtttgcagataacatgactgtatatttagaaaaccccatcgtc tcagcccaaaatctccttaagctgataagcaacttcagcaaagtctcaggatacaaaatc aatgtgcaaaaatcgcaagcattcctatacaccaataacagacaaacagagaaccaaatc atgagtgaactcacattcactattgctacaaagataataaaatacctaggaatccaactt acaagggatgtgaaggacctcttcaaggagaactacaaaccactgctcaaggaaataaaa gaggacacaaacaaatggaagagcattccatgctcatgtgtaggaagaatcaatatcgtg aaaatggccatgctgcccaaggtaatttatagattcaatgccatccccatcaagctacca atgactttcttcacagaattggaaaaaactactttaaagttcatatggaatgaaaaaaga gcccgcattgccaagacaatcctaagccaaaagaacaatgctggaggcatcatgctacct gacttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaac agagatgttgaccaatggaacagagcaggggcctcagtaataataccacacatctacaac catctgatctttgagaaacctgacaaaaacaagaaatggggaaaggattccctatttaat aaatggtgctgggaaaactggctagccacatgtagaaagctgaaactgcatcccttcctt acaccttaa >gi568815597f:95134231_95344710|GENSCAN_predicted_peptide_2|116_aa MPEEITFATKMWFFEALKYPKFSKAIVINGILMTVVFFIVRIASMLPHYGFMYSVYGTEP YIRLGVLIQLSWVISCVVLDVMNVMWMIKISKGCIKVISHIRQEKAKNSLQNGKLD >gi568815597f:95134231_95344710|GENSCAN_predicted_CDS_2|351_bp atgcctgaagaaattacatttgcaactaagatgtggttctttgaagctctgaagtatccc aagttttctaaagctatcgttatcaatggaatactcatgacagtagtattcttcatcgtg cggattgcctcaatgcttcctcattatggcttcatgtattccgtgtatggaacagaaccc tacataaggcttggagttttaatccagttatcctgggtcattagttgtgttgttttggat gtgatgaatgtcatgtggatgatcaaaatttcaaaaggttgcatcaaagtcatctctcac atcagacaagagaaagccaaaaatagtcttcagaatggaaaacttgattaa >gi568815597f:95134231_95344710|GENSCAN_predicted_peptide_3|366_aa MPTEPSKQRFIGLKFSLLVQQSEINLGCWSLAGGGASAMPPLVIPRQTGSGVDLQQTPTD LQLRGLTVRRKTNKQKKHQHQQKGHPHQNPIHRSPTSKTKEKEGILSNSFYEASIILIPN PGRDTTKKENFRPISLMNIDVKILNKILANRIQQHIKKLIHHNQVGFIPGMQGWFNICKS INVIHHINRIKDKNHMTLSIDAEKAFDKIQQRFMLKTLNKLSIDGMYLKIIRAIYDKPTA NVILNGQKLDAFPLKTGTRQGCPLSPLLFNIVLEVLARAIRQEKEIKHIQLGKEEVKLSL FADDMIVYLENPIVPAQNLLKLISNFSKVSGYKINVQKSQASLYTNNRQTESQIMSELHS QLLQRE >gi568815597f:95134231_95344710|GENSCAN_predicted_CDS_3|1101_bp atgcccaccgagcccagcaagcaaagattcattggcttgaaattctcgctgctagtgcag cagtctgagatcaacctgggatgctggagcttggcaggcggaggggcgtctgccatgcct ccgctggtgatacccaggcaaacagggtctggagtggacctccagcaaactccaacagac ctgcagctgaggggcctgactgttagaaggaaaactaacaaacagaagaagcatcagcat caacaaaaaggacatccacaccaaaaccccatccataggtcaccaacatcaaagaccaaa gaaaaagagggaatcctctctaactcattttatgaggccagcatcatcctgataccaaat cctggcagagacacaacaaaaaaagagaattttaggccaatatccctaatgaacatcgat gtgaaaatcctcaataaaatactggcaaaccgaatccagcagcacatcaaaaagcttatc caccacaatcaagtcggcttcatccctgggatgcaaggctggttcaacatatgcaaatca ataaacgtaatccatcacataaacagaatcaaagacaaaaaccacatgactctctcaata gatgcagaaaaggccttcgacaaaattcagcagcgcttcatgctaaaaactctcaataaa ctaagtattgatggaatgtatctcaaaataataagagctatttatgacaaacccacagcc aatgtcatactgaatgggcaaaaactggacgcattccctttgaaaactggcacaagacaa ggatgccctctctcaccactcctattcaacatagtattggaagttctggccagggcaatc aggcaagagaaagaaataaagcatattcaattaggaaaagaggaagtcaaattgtcccta tttgcagatgacatgattgtatatttagaaaaccccatcgtcccagcccaaaatctcctt aagctgataagcaacttcagcaaagtctcaggatacaaaatcaatgtgcaaaaatcacaa gcatccctatacaccaataacagacaaacagagagccaaatcatgagtgaactccattca caattgctacaaagagaataa >gi568815597f:95134231_95344710|GENSCAN_predicted_peptide_4|66_aa MEVDNALTVDIHPTFKYFEFSVACEYEETYYIKISPYKEACADAPSLFCVLPKHYALGPP ESTPRF >gi568815597f:95134231_95344710|GENSCAN_predicted_CDS_4|201_bp atggaggtagacaatgctctaactgtagacattcatcctacatttaaatattttgaattt tcagtagcctgtgaatacgaggaaacttattacattaagatctcaccctataaagaagcc tgtgctgacgccccctcgttgttctgtgtccttcctaagcactatgccctaggccccccc gagagcactccccggttctaa >gi568815597f:95134231_95344710|GENSCAN_predicted_peptide_5|204_aa MQKSLNKDINALCSEGIQKCQIVGIRPADTDLVMDERQTDGTVFRIHTKAEGFMDVDIPL ELVFHLPVNYPSCLPGISINSEQLTRAQCVTVKENLLEQAESLLSEPMVHELVLWIQQNL RHILSQPETGSGSEKCTFSTSTTMDDGLWITLLHLDHMRAKTKYVKIVEKWASDLRLTGR LMFMGKIILILLQGDRNNLKVPKS >gi568815597f:95134231_95344710|GENSCAN_predicted_CDS_5|615_bp atgcaaaagtccttgaacaaagatattaatgccctttgctcagaaggcattcagaaatgc cagattgtgggcatccgacccgcagacactgacctagtgatggatgagagacagacagat gggaccgtgttcagaattcacacaaaagctgaaggatttatggatgtggatatacctctg gaattggtgttccatttgccagtcaattatccttcatgtctacctggtatctcgattaac tctgaacagttgaccagggcccagtgtgtgactgtgaaagagaatttacttgagcaagca gagagccttttgtcggagcctatggttcatgagctggttctctggattcagcagaatctc aggcatatcctcagccaaccagaaactggcagtggcagtgaaaagtgtactttttcaaca agcacgaccatggatgatggattgtggataactcttttgcatttagatcacatgagagca aagactaaatatgtcaaaattgtggagaagtgggcttcagatttaaggctgacaggaaga ctgatgttcatgggtaaaataatactgattttactacagggagacagaaacaacctcaag gtgccaaaaagttaa >gi568815597f:95134231_95344710|GENSCAN_predicted_peptide_6|119_aa MQMQGEELSKRAEEMPRPKVLPKNHTSDATIERSLGALARVFQCKKDKRVEQIQLTSLPA WFLLGLSIEGTSGRSEGSTDSTVILVGSWYSSNRGGVIPDSNGPETVPISNFSSGLGSI >gi568815597f:95134231_95344710|GENSCAN_predicted_CDS_6|360_bp atgcagatgcagggggaagagctttccaaacgggcagaagaaatgccaaggccaaaggta ctgcctaaaaatcatacatcagatgctaccattgagagatctctgggggcactagcaaga gtatttcagtgcaaaaaagataaaagggttgagcaaattcaacttacttccttgccagct tggttcctgttaggcttgtcaatagaaggcacttctgggagatcagaaggtagcacagat agcactgtgatccttgttggcagttggtatagcagcaacaggggtggagtgattccagat tcaaatggaccagagacagtgcctattagcaattttagctctggtctaggcagcatttga >gi568815597f:95134231_95344710|GENSCAN_predicted_peptide_7|280_aa MADFNTPLSTLDRSTRQKVNKDTQELNSALHQADLIDIYRTLHPKSTEYTFFSAPHHTYC KIDHILGSKALLSKCKRTEIITNYLSDHSAIKLELRIKNLTQNRSTTWKLNNLLLNDYWV HNEMKAEIKMFFETNENKDTTYQNLWDAFKAVCRGKFIALNAHKRKQERSKIDTLTSQLK ELEKQEQTHSKASRRQEITKIRAELKEIETQKTLQKINESRSWFFERINKIDRPLARLIK KKRQKNQIDAIKNDKGDITTDPAEIQLPSENTTNTSTQIN >gi568815597f:95134231_95344710|GENSCAN_predicted_CDS_7|843_bp atggcagactttaacaccccactgtcaacattagacagatcaacaagacagaaagtcaac aaggatacccaggaattgaactcagctctgcaccaagcagacctaatagacatctacaga actctccaccccaaatcaacagaatatacatttttttcagcaccacaccacacctattgc aaaattgaccacatacttggaagtaaagctctcctcagcaaatgtaaaagaacagaaatt ataacaaactatctctcagaccacagtgcaatcaaactagaactcaggattaagaatctc actcaaaaccgctcaactacatggaaactgaacaacctgctcctgaatgactactgggta cataatgaaatgaaggcagaaataaagatgttctttgaaaccaacgagaacaaagacaca acataccagaatctctgggacgcattcaaagcagtgtgtagagggaaatttatagcacta aatgcccacaagagaaagcaggaaagatccaaaattgacaccctaacatcacaattaaaa gaactagaaaagcaagagcaaacacattcaaaagctagcagaaggcaagaaataactaaa atcagagcagaactgaaggaaatagagacacaaaaaacccttcaaaaaattaatgaatcc aggagctggttttttgaaaggatcaacaaaatagatagaccactagcaagactaataaag aaaaaaagacagaagaatcaaatagacgcaataaaaaatgataaaggggatatcaccacc gatcccgcagaaatacaactaccatcagagaatactacaaacacctctacacaaataaac tag >gi568815597f:95134231_95344710|GENSCAN_predicted_peptide_8|248_aa MELKTKARELREECRSLRSRCDQLEERVSAMEDEMNEMKREGKFREKRIKRNEQSLQEIW DYVKRPNLHLIGVPESDGENGTKLENTLQYIIQENFPNLARQANVQIQEIQRTPQRYSSR TATRRHIIVRFTKVEMKEKMLRAAREKGRVTLKGKPIRLTADLSAETLQARREWGPIFNI LKEKNIQPRISYPAKLSFISEGEIKYFTEKQMLRDFVTTRPALKELLKEALNMERNNRYQ PLQNHAKM >gi568815597f:95134231_95344710|GENSCAN_predicted_CDS_8|747_bp atggagctgaaaaccaaggctcgagaactacgtgaagaatgcagaagcctcaggagccga tgcgatcaactggaagaaagggtatcagcgatggaagatgaaatgaatgaaatgaagcga gaagggaagtttagagaaaaaagaataaaaagaaatgagcaaagcctccaagaaatatgg gactatgtgaaaagaccaaatctacatctgattggtgtacctgaaagtgacggggagaat ggaaccaagttggaaaacactctgcagtatattatccaggagaacttccccaatctagca aggcaggccaacgttcagattcaggaaatacagagaacgccacaaagatactcctcgaga acagcaactcgaagacacataattgtcagattcaccaaagttgaaatgaaggaaaaaatg ttaagggcagccagagagaaaggtcgggttaccctcaaagggaagcccatcagactaaca gcggatctctcggcagaaaccctacaagccagaagagagtgggggccaatattcaacatt cttaaagaaaagaatattcaacccagaatttcatatccagccaaactaagcttcataagt gaaggagaaataaaatactttacagagaagcaaatgctgagagattttgtcaccaccagg cctgccttaaaagagctcctgaaggaagcactaaacatggaaaggaacaaccggtaccag ccgctgcaaaatcatgccaaaatgtaa >gi568815597f:95134231_95344710|GENSCAN_predicted_peptide_9|298_aa XCVSMVCRPILSFHYFRLHRGEVCPLWHKRVCLGKWPCTGCQKRLTDPGVSMHTIEADLA LYLVYTKLTRFKSGLGKRDSILEGHLIPKGKKTYFIWNIVKSQKNFTSVPIGREPFEHSR CSVSADYAGEVTQKPDGGDDTQWPWDFVSKGSDKQHERSILRDTMGQTSALFTCPWEKFP GVDQKCVVVKLCQIRQGKTVHEDALIFLYNALLNDMSSLSILAKRKYNFKLKLSSDSVKL NSKETVTLGFIPLRQEAGQWGIDMSRSSVINLQFMLFLGDVPIMQEASPRLARATCLL >gi568815597f:95134231_95344710|GENSCAN_predicted_CDS_9|897_bp ncatgtgtgtccatggtatgcaggcctatcctcagctttcactatttcaggctccacaga ggagaggtctgtcctctgtggcataagagggtgtgtttaggcaaatggccctgcactggc tgccagaagagactcactgaccctggggtatcaatgcacactattgaagctgaccttgct ctatatcttgtctatacaaagttgacaagattcaaaagtggtttgggtaaaagagactca atcctagaggggcatttgattccaaaagggaagaagacgtacttcatctggaacatcgtg aagtcacagaagaacttcacaagtgttccgatcggaagggagccctttgaacattcaagg tgctcagtgagtgcagactatgcaggagaagtgacacagaagcctgatggaggtgatgat acccagtggccatgggactttgtcagcaaaggcagtgataaacagcatgagaggtcaata ctccgggacaccatggggcagacgtcagccttattcacatgcccatgggagaaattccct ggggtcgaccagaagtgcgtggtggtgaagctttgccagatacgacagggaaaaacagtc catgaggacgctttgatttttctttacaacgcgttactaaatgacatgagtagcctttcc atcctggccaagaggaaatataatttcaaactaaagttgagttcagattctgttaaatta aatagcaaagaaacagtcactcttggttttattcctcttagacaggaagcagggcagtgg ggaattgacatgagccgttcctcggtgattaacctgcagtttatgttgtttctaggtgac gtgcccatcatgcaggaagcttctccaaggctggcgagagcaacttgtctgctttga