GENSCAN 1.0 Date run: 2-Nov-116 Time: 21:01:17 Sequence gi568815580r:43170344_43374395 : 204052 bp : 35.89% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 8663 8738 76 1 1 54 54 97 0.349 4.20 1.02 Term + 24635 24759 125 0 2 36 41 142 0.236 1.87 1.03 PlyA + 24981 24986 6 1.05 2.03 PlyA - 26085 26080 6 1.05 2.02 Term - 55707 55516 192 0 0 72 42 145 0.566 4.74 2.01 Init - 71235 71185 51 0 0 68 103 19 0.401 2.61 2.00 Prom - 73303 73264 40 -3.25 3.00 Prom + 81050 81089 40 -5.75 3.01 Init + 82134 82205 72 2 0 26 67 102 0.758 3.02 3.02 Term + 83816 84031 216 1 0 58 40 153 0.831 3.66 3.03 PlyA + 84282 84287 6 1.05 4.07 PlyA - 84385 84380 6 1.05 4.06 Term - 94520 94167 354 2 0 59 48 367 0.154 23.31 4.05 Intr - 100305 100118 188 1 2 90 74 152 0.040 12.49 4.04 Intr - 101489 101369 121 2 1 75 110 47 0.530 4.75 4.03 Intr - 104020 103237 784 0 1 2 53 618 0.592 40.25 4.02 Intr - 107064 106905 160 1 1 125 99 121 0.821 15.02 4.01 Init - 111863 111788 76 2 1 52 103 9 0.362 0.10 4.00 Prom - 113630 113591 40 -7.55 5.00 Prom + 113668 113707 40 -9.05 5.01 Sngl + 116615 116944 330 1 0 62 41 395 0.931 27.97 5.02 PlyA + 117028 117033 6 1.05 6.00 Prom + 118027 118066 40 -6.15 6.01 Sngl + 118869 120497 1629 2 0 33 39 648 0.972 49.44 6.02 PlyA + 121729 121734 6 1.05 7.02 PlyA - 122752 122747 6 1.05 7.01 Sngl - 128199 127729 471 0 0 42 38 201 0.559 6.37 7.00 Prom - 139662 139623 40 -4.05 8.00 Prom + 147674 147713 40 -4.05 8.01 Sngl + 153898 154710 813 0 0 37 43 252 0.947 11.12 8.02 PlyA + 155089 155094 6 1.05 9.04 PlyA - 155886 155881 6 1.05 9.03 Term - 167267 167035 233 0 2 62 42 148 0.448 3.25 9.02 Intr - 180750 180550 201 1 0 68 107 150 0.535 13.24 9.01 Init - 187727 187673 55 2 1 106 39 51 0.289 3.50 9.00 Prom - 189416 189377 40 -4.85 10.00 Prom + 190246 190285 40 -8.35 10.01 Init + 191694 191976 283 2 1 30 73 315 0.707 21.45 10.02 Intr + 195431 195643 213 0 0 100 97 20 0.451 1.86 10.03 Term + 197761 197951 191 2 2 93 43 52 0.369 -2.17 10.04 PlyA + 198544 198549 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 100305 99998 308 1 2 90 39 259 0.853 15.49 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815580r:43170344_43374395|GENSCAN_predicted_peptide_1|66_aa MDIIEREEIAGSDEDGKKRKPLYTVVVMEEEIEEEQKQEEKAETKIERRKLGTCNASTQK QMLLEF >gi568815580r:43170344_43374395|GENSCAN_predicted_CDS_1|201_bp atggatattattgaaagagaagaaatagcaggttctgatgaagatgggaagaaaaggaaa cccttgtacactgttgtggtgatggaggaggagatagaggaagaacaaaaacaagaagaa aaagcagagacaaaaattgagagaagaaaattaggcacctgcaatgccagcacacagaaa caaatgctgttagaattttga >gi568815580r:43170344_43374395|GENSCAN_predicted_peptide_2|80_aa MNHTDAKYFLLHRIMRQTMMPTLTIPLEVPTRATRQEKEIKGIQIGKKEVKLFASNVIGY LENSSRVFSTPPESTPPESS >gi568815580r:43170344_43374395|GENSCAN_predicted_CDS_2|243_bp atgaatcatactgatgcaaaatactttctattgcatcgcatcatgagacagacaatgatg cccactctcaccattcctcttgaagttccaaccagagcaaccagacaagagaaagaaata aagggcatccaaatcggtaagaaggaagtcaaactgtttgctagcaatgtgattggttac cttgaaaactcctccagagttttcagtactcctccagaaagtactcctccagaaagctcc tag >gi568815580r:43170344_43374395|GENSCAN_predicted_peptide_3|95_aa MIEDQLRILEPEAFEEEFQLLIVTKEIRAMGINRLKVAEAGVQQSCLVEKDTEKTTRGNE TAKPAGILVGRASKTEGEYLQESSMFDVSDKLQSL >gi568815580r:43170344_43374395|GENSCAN_predicted_CDS_3|288_bp atgatagaggatcaactgagaattctggaaccagaagcatttgaggaagaattccaacta ctgattgtgactaaagaaataagagctatggggataaatagattgaaggtggcagaagct ggggttcaacaaagttgccttgtagaaaaggacactgagaagacaaccagaggaaatgaa acagcaaagcctgctggaattctagtggggagagcatccaagacagaaggggagtacctg caagagagcagtatgtttgatgtttctgataagctgcaaagcctatag >gi568815580r:43170344_43374395|GENSCAN_predicted_peptide_4|560_aa MSTINSYVAVYFSSIFFTILLYAFDGAGALGSALLFSALRKPALEIQASESSQAVFPSAP RTEHAVKNGSDHHQPGRICAFGLVFTVSLFAWICCQRKSSKSNKTPPYKFVHVLKGVDIY PENLNSKKKFGADDKNEVKNKPAVPKNSLHLDLEKRDLNGNFPKTNLKPGSPSDLENATP KLFLEGEKESVSPESLKSSTSLTSEEKQEKLGTLFFSLEYNFERKAFVVNIKEARGLPAM DEQSMTSDPYIKMTILPEKKHKVKTRVLRKTLDPAFDETFTFYGIPYTQIQELALHFTIL SFDRFSRDDIIGEVLIPLSGIELSEGKMLMNREIIKRNVRKSSGRGELLISLCYQSTTNT LTVVVLKARHLPKSDVSGLSDPYVKVNLYHAKKRISKKKTHVKKCTPNAVFNELFVFDIP CEGLEDISVEFLVLDSERGSRNEDHNSSPAREQNRMENEFDKLTEIGFRRWAITNSSELK EHALIQCKEAQNLAKRLDELLTRITSLEKNINDLMELKNTVRELCEAYTNINSQIYQEEE RISEIEDNEIKQKEKMREIK >gi568815580r:43170344_43374395|GENSCAN_predicted_CDS_4|1683_bp atgtctacaattaacagttatgttgctgtatatttctcatcaatatttttcacaattttg ctttacgcatttgatggagctggtgccctgggctctgcgctgttgttttcagcgctccga aagccggcgcttgagatccaggcaagtgaatccagccaggcagttttcccttcagcacct cggacagaacacgcagtaaaaaatggctccgatcaccaccagccgggaagaatttgtgca tttggcctggtcttcacagtctctctctttgcatggatctgctgtcagagaaaatcatcc aagtctaacaagactcctccatacaagtttgtgcatgtgcttaagggagttgatatttac cctgaaaacctaaatagcaaaaagaagtttggagcagatgataaaaatgaagtaaagaat aagccagctgtgccaaagaattcattgcatctggatcttgaaaagagagatctcaatggc aattttcccaaaaccaacctcaaacctggcagtccttctgatctggagaatgcaaccccg aagctctttttagaaggggaaaaagagtcagtttcccctgagagtttaaagtccagcact tcccttacttcagaagagaaacaagagaagctgggaactctcttcttctccttagaatac aacttcgagagaaaagcatttgtggtcaatatcaaggaagcccgtggcttgccagccatg gatgagcagtcgatgacctctgacccatatatcaaaatgacgatcctcccagagaagaag cataaagtgaaaactagagtgctgagaaaaaccttggatccagcttttgatgagaccttt acattctatgggataccctacacccaaatccaagaattggccttgcacttcacaattttg agttttgacaggttttcaagagatgatatcattggggaagttctaattcctctctcggga attgaattatctgaaggaaaaatgttaatgaatagagagatcatcaagagaaatgttagg aagtcttcaggacggggtgagttactgatctctctctgctatcagtccaccacaaacact ctaactgtggttgtcttaaaagctcgacatctgcctaaatctgatgtgtccggactttca gatccctatgtcaaagtgaacctgtaccatgccaaaaagagaatctccaagaagaagact catgtgaagaaatgcacccccaatgcagtgttcaatgagctgtttgtctttgatattcct tgtgagggccttgaagatataagtgttgaatttttggttttggattctgaaagggggtcc cgaaatgaggatcacaattcctcgccagcaagggaacaaaacaggatggagaatgagttt gacaaattgacagaaataggcttccgaaggtgggcaataacaaactcctccgagctaaag gagcatgctctaatccaatgcaaggaagctcagaaccttgcaaaaaggttagacgaattg ctaactagaataaccagtttagagaagaacataaatgacctgatggagctgaaaaacaca gtacgagaactttgtgaagcatacacaaatatcaatagccaaatctatcaagaggaagaa aggatatcagagattgaagataatgaaataaagcagaaagaaaagatgagagaaataaaa tga >gi568815580r:43170344_43374395|GENSCAN_predicted_peptide_5|109_aa MGKRQSRKTGNSKNQSASPPPKECSSSPAMEQSWMENDFDEFREEGFRRSNYSELQEEIR TNGKEVKSFEKKLDEWITRITNAEKSLKDLMELKTKARELHDECTSLSS >gi568815580r:43170344_43374395|GENSCAN_predicted_CDS_5|330_bp atgggaaaaagacagagcagaaaaactggaaactctaaaaatcagagtgcctctcctcct ccaaaggaatgcagctcctcaccagcaatggaacaaagctggatggagaatgactttgat gagttcagagaagaaggcttcagacgatcaaactactctgagctacaggaggaaattcga accaacggcaaagaagttaaaagctttgaaaaaaaattagacgaatggataactagaata accaatgcagagaagtccttaaaggacctgatggagctgaaaaccaaggctcgagagcta catgatgaatgcacaagcctcagtagctga >gi568815580r:43170344_43374395|GENSCAN_predicted_peptide_6|542_aa MKNDKGDITTDPTEIQTTIREYYKHLYANKLENLEEMHKFLDTYILPRLNQEEVESLNRP ITGSEIEAIINSLPTKKSPGPDGFTAEFYKRYKEEMVPFLLKLFQSIEKEGILPNSLYEA SIILIPKPGRDTTKRDNFRPISLMNIDAKILNKILANRIQQHIKKLIHHDQVGFIPGMQC WFNICKSINIIQHINRIKDKNHMIISIDAEKAFDKIQQLFMLKTRNKLGIGGTYLKIITA IYDKPTANIILNGQKLEAFPLKMGTREGCLLSPLLFNIVLEVLARAIRQEKEIKGIQLGK EEVKLSLFADDMIVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFLCTNNRQTESQ IMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWVGRINI VTMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRAHIAKSIVSQKNKAGGITL PDFKLYYKAAVTKTAWYWYQNRDIDQWNRTEPSEIMLHIYKYLIFDKPEKNKQWGKDSLF NK >gi568815580r:43170344_43374395|GENSCAN_predicted_CDS_6|1629_bp atgaaaaatgataaaggggatatcaccaccgatcccacagaaatacaaactaccatcaga gaatactataaacacctctatgcaaataaactagaaaatctagaagaaatgcataaattc ctcgacacatacatcctcccaagactaaaccaggaagaagttgaatctctgaatagacca ataacaggctctgaaattgaggcaataatcaatagcttaccaaccaaaaaaagtccagga ccagatggattcacagccgaattctacaagaggtacaaagaggagatggtaccatttctt ctgaaactattccagtcaatagaaaaagagggaatcctccctaactcactttatgaggcc agcatcatcttgataccaaagccgggcagagacacaaccaaaagagataattttagacca atatccttaatgaatatcgatgcaaaaatcctcaataaaatactggcaaacagaatccag cagcacatcaaaaagcttatccaccatgatcaagtgggcttcatccctgggatgcaatgc tggttcaacatatgcaaatcaataaatataatccagcatataaacagaatcaaagacaaa aaccacatgattatctcaatagatgccgaaaaggcctttgacaaaattcaacaactcttc atgctaaaaactcgcaataaattaggtattggtgggacgtatctcaaaataataacagct atctatgacaaacccacagccaatatcatactgaatgggcaaaaactggaagcattccct ttgaaaatgggcacaagagagggatgccttctctcaccactcctattcaacatagtgttg gaagttctggccagggcaatcaggcaggagaaggaaataaagggtattcaattaggaaaa gaagaagtcaaattgtccctgtttgcagatgacatgattgtatatctagaaaaccccatt gtctcagcccaaaatctcctcaagctgataagcaacttcagcaaagtctcaggatacaaa atcaatgtacaaaaatcacaagcattcttatgcaccaataacagacaaacagagagccaa atcatgagtgaactcccattcacaattgcttcaaagaggataaaatacctaggaatccaa cttacaagggacgtgaaggacctcttcaaggagaactacaaaccactgctcaatgaaata aaagaggatacaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatatt gtgacaatggccatactgcccaaggtaatttatagattcaatgccatccccatcaagcta ccaatgactttcttcacagaattggaaaaaaccactttaaagttcatatggaaccaaaaa agagcccacattgccaagtcaatcgtaagccaaaagaacaaagctggaggcatcacgcta cctgacttcaaactatactacaaggctgcagtaaccaaaacagcatggtactggtaccaa aacagagatatagaccaatggaacagaacagagccctcagaaataatgctgcatatctac aagtatctgatctttgacaaacctgagaaaaacaagcaatggggaaaggattccctattt aataaatag >gi568815580r:43170344_43374395|GENSCAN_predicted_peptide_7|156_aa MIISIDAEKAFDKIQQPFMLKTLNKLGIDGTYFKIIRAIYDKPTANIILNGQKLEAFPLK TGTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVS AQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTE >gi568815580r:43170344_43374395|GENSCAN_predicted_CDS_7|471_bp atgattatctcaatagatgcagaaaaagcctttgacaaaattcaacaacccttcatgcta aaaactctcaataaattaggtattgatgggacgtatttcaaaataataagagctatctat gacaaacccacagccaatatcatactgaatgggcaaaaactggaagcattccctttgaaa actggcacaagacagggatgccctctctcaccgctcctattcaatatagtgttggaagtt ctggccagggcaatcaggcaggagaaggaaataaagggtattcaattaggaaaagaggaa gtcaaattgtccctgtttgcagatgacatgattgtttatctagaaaaccccatcgtctca gcccaaaatctccttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaat gtacaaaaatcacaagcattcttatacaccaacaacagacaaacagagtaa >gi568815580r:43170344_43374395|GENSCAN_predicted_peptide_8|270_aa MNIDAKILNKILAKRIQQHIKKLIHHDQVGFIPGMQGWFNIHKSINVIQHINRTKDKNHM IISIDAEKAFDKIQQPFMLKTLNKLGIDGTYFKIIRAIYDKPTANIILNGQKLEAFPLKT GTRQGCPLSSLLFNIVLEVLARAIRQEKEIKGIQLGKEEVKLSLCADDMIVYLENPIVSA QNLLKLISNFSKVSGYKINVLKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTR DVKDLFKENYKPLLNEIKEDTNKWKNFPCS >gi568815580r:43170344_43374395|GENSCAN_predicted_CDS_8|813_bp atgaacattgatgcaaaaatcctcaataaaatactggcaaaacggatccagcagcacatc aaaaagcttatccaccatgatcaagtgggcttcatccctgggatgcaaggctggttcaac atacacaaatcaataaatgtaatccagcatataaacagaaccaaagacaaaaaccacatg attatctcaatagatgcagaaaaagcctttgacaaaattcaacaacccttcatgctaaaa actctcaataaattaggtattgatgggacgtatttcaaaataataagagctatctatgac aaacccacagccaatatcatactgaatgggcaaaaactggaagcattccctttgaaaact ggcacaagacagggatgccctctctcatcactcctattcaacatagtgttggaagttctg gccagggcaattaggcaggagaaggaaataaagggtattcaattaggaaaagaagaagtc aaattgtccctgtgtgcagacgacatgattgtatatctagaaaaccccattgtctcagcc caaaatctccttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaatgta ctaaaatcacaagcattcttatacaccaataacagacaaacagagagccaaatcatgagt gaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaacttacaagg gatgtgaaggacctcttcaaggagaactacaaaccactgctcaatgaaataaaagaggat acaaacaaatggaagaactttccatgctcatag >gi568815580r:43170344_43374395|GENSCAN_predicted_peptide_9|162_aa MEMLDELPLFNSEDLTIVFFHGHETMNLLGITPDERGCFSLSKPLSSFKALDENNDQAEI DLNCFKDLTSIPKYISSGGTFWSHSVLEVLAREIREEKEINGIQISKEKMKLSLFADDMV VYLENPKDSSRKLLELIKEFSKVSRYKINVHKSVALLYTNND >gi568815580r:43170344_43374395|GENSCAN_predicted_CDS_9|489_bp atggagatgcttgatgagcttccattgttcaactccgaggatcttaccattgtctttttc catggccatgagacaatgaatcttctgggtatcaccccagatgaacgaggctgcttcagt ttgagtaaaccactgtcttccttcaaggccctggatgaaaacaatgatcaagcagaaata gatcttaactgctttaaagacttaacttcaattccaaaatacatctctagtggaggaacc ttttggtctcattctgtactggaagtcctagccagagaaatcagagaagagaaagaaata aatggcatccaaatcagtaaagagaaaatgaaactatcattgtttgctgacgatatggtc gtttaccttgaaaacccaaaggactcctccagaaagctcctagaactgataaaagaattc agcaaagtttccagatacaagatcaatgtacacaaatcagtagctcttctatacaccaac aatgactaa >gi568815580r:43170344_43374395|GENSCAN_predicted_peptide_10|228_aa MSESLKTREADSAALSLQPKAQGPQGRCWHKFQSQRLKILESDVQGQEEWKEANILHVKK KEPEDSACKLISSSPAFFVPAALVANWMMPTHIEGKEGEKKGERKVGRREEENKEKRREA EREKERKGESEGGGREGKERQQEGMKGRKKDGRGREGGDLGWRKKETGKSKSYWARCSTK VYMKAKSQQETTFFTRRPSNVKLHQLLTCLLFQMRTNGFISYSSCSKH >gi568815580r:43170344_43374395|GENSCAN_predicted_CDS_10|687_bp atgtccgaaagcctcaaaaccagggaagctgacagtgcagccttaagtctgcagccgaag gcccaagggccccagggaagatgctggcacaagtttcagagccagcgtctgaagatcctg gagtctgatgtccaagggcaggaagagtggaaagaagcaaacatcctgcacgtgaagaag aaagagccagaagactcagcatgcaaacttatctcatcttctcctgctttctttgttcca gctgcattggtagccaattggatgatgcccacccacattgagggaaaggagggagagaag aagggagagagaaaggtaggaaggagggaagaagagaacaaagaaaaaagaagggaagca gaaagggagaaagaaagaaagggagagagtgagggaggagggagggaaggaaaggaaaga cagcaggaaggaatgaaaggaagaaagaaggatgggagagggagggagggaggggacctg ggatggaggaagaaagaaacaggaaaaagcaagagctactgggcaaggtgtagtacaaag gtatacatgaaagctaagagtcaacaagaaactacgtttttcactagaaggccttctaat gtgaagttacaccaactgctgacatgccttctcttccaaatgcgtaccaatgggtttatt tcctattcaagttgcagcaagcattaa