Positions of the identified or predicted genes and gene segments in each insert of the P1 clones are schematically presented by color-coded boxes above (rightward) and below (leftward) the wide line in the middle which represents the entire insert sequence. The insert length is given in parentheses together with the clone name at the top. Arrowheads indicate the directions of the DNA strands (5' to 3'). Dark and faint blue bars with numbers represent the positions of the identified potential protein genes and potential exons, respectively, and red bars the positions of structural RNA genes. Gray bars with numbers indicate the positions of the transcribed regions. The regions which showed similarity to the sequences in the protein database are shown by yellow, orange and red bars, each of which corresponds to BLASTP scores of 70-100, 100-250, and 250 or more, respectively. The green bars indicate the positions of the potential exons predicted by the Grail program. Each of the three different colors with increasing depth corresponds to the region with the Grail scores of less than 70, 70-90, and 90 or more, respectively.
A single exon or a region containing consecutive multiple exons which showed similarity to a single reported gene throughout the alignment was assigned as a potential protein gene. They were denoted by numbers with the clone names followed by sequential numbers from one end to another of the insert as shown in the left column of the table in each of pages. A region which matched only to portions of a reported gene and only to Arabidopsis ESTs were assigned as a potential exon(s) and a transcribed region, respectively. These regions were distinguished from the potential protein genes by adding "p" and "t" between the clone names and the sequential numbers in the identifiers, respectively.
RNA coding regions were assigned on the basis of sequence similarity to the reported structural RNAs. For tRNA genes, prediction by the tRNAscan-SE program was taken into account. potential RNA genes were denoted by numbers with the clone names followed by "r" and sequential numbers.