Two genes overlap in a more subtle manner when the same sequence of DNA is shared between two
nonhomologous proteins. This situation arises when the same sequence of DNA is translated in more than one reading frame. In cellular genes, a DNA sequence usually is read in only one of the three potential reading frames, but in some viral and mitochondrial genes, there is an overlap between two adjacent genes that are read in different reading frames. This situation is illustrated in Figure 2.18. The distance of overlap is usually relatively short, so that most of the sequence representing the protein retains a unique coding function.
In some cases, the alternative means of expression do not affect the sequence of the protein; for example, changes that affect the 5
nontranslated leader or the 3
nontranslated trailer may have regulatory consequences, but the same protein is made. In other cases, one exon is substituted for another, as indicated in Figure 2.19.
In this example, the proteins produced by the two mRNAs contain sequences that overlap extensively, but that are different within the alternatively spliced region. The 3
half of the troponin T gene of rat muscle contains 5 exons, but only 4 are used to construct an individual mRNA. Three exons,
WXZ, are the same in both expression patterns. However, in one pattern the
? exon is spliced between
X and
Z; in the other pattern, the
? exon is used. The
? and
? forms of troponin T therefore differ in the sequence of the amino acids present between sequences W and Z, depending on which of the alternative exons,
? or
?, is used. Either one of the
? and
? exons can be used to form an individual mRNA, but both cannot be used in the same mRNA.
Figure 2.20 illustrates an example in which alternative splicing leads to the inclusion of an exon in some mRNAs, while it is left out of others. A single type of transcript is made from the gene, but it can be spliced in either of two ways. In the first pathway, two introns are spliced out, and the three exons are joined together. In the second pathway, the second exon is not recognized. As a result, a single large intron is spliced out. This intron consists of intron 1 + exon 2 + intron 2. In effect, exon 2 has been treated in this pathway as part of the single intron. The pathways produce two proteins that are the same at their ends, but one of which has an additional sequence in the middle. So the region of DNA codes for more than one protein. (Other types of combinations that are produced by alternative splicing are discussed in 24.12 Alternative splicing involves differential use of splice junctions).
Sometimes two pathways operate simultaneously, a certain proportion of the RNA being spliced in each way; sometimes the pathways are alternatives that are expressed under different conditions, one in one cell type and one in another cell type.
So alternative (or differential) splicing can generate proteins with overlapping sequences from a single stretch of DNA. It is curious that the higher eukaryotic genome is extremely spacious in having large genes that are often quite dispersed, but at the same time it may make multiple products from an individual locus. Alternative splicing expands the number of proteins relative to the number of genes by ~15% in flies and worms, but has much bigger effects in man, where ~60% of genes may have alternative modes of expression (see 3.11 The human genome has fewer genes than expected). About 80% of the alternative splicing events result in a change in the protein sequence.