Transcription starts with a nucleoside triphosphate (usually a purine, A or G). The first nucleotide retains its 5
![]()
triphosphate group and makes the usual phosphodiester bond from its 3
![]()
position to the 5
![]()
position of the next nucleotide. The initial sequence of the transcript can be represented as:
But when the mature mRNA is treated
in vitro with enzymes that should degrade it into individual nucleotides, the 5
![]()
end does not give rise to the expected nucleoside triphosphate. Instead it contains two nucleotides, connected by a 5
?5
![]()
triphosphate linkage and also bearing methyl groups. The terminal base is always a guanine that is added to the original RNA molecule after transcription.
Addition of the 5
![]()
terminal G is catalyzed by a nuclear enzyme, guanylyl transferase. The reaction occurs so soon after transcription has started that it is not possible to detect more than trace amounts of the original 5
![]()
triphosphate end in the nuclear RNA. The overall reaction can be represented as a condensation between GTP and the original 5
![]()
triphosphate terminus of the RNA. Thus
Gppp + pppApNpNp...
GpppApNpNp... + pp + p
The new G residue added to the end of the RNA is in the reverse orientation from all the other nucleotides.
This structure is called a cap. It is a substrate for several methylation events. Figure 5.18 shows the full structure of a cap after all possible methyl groups have been added. Types of caps are distinguished by how many of these methylations have occurred:
- The first methylation occurs in all eukaryotes, and consists of the addition of a methyl group to the 7 position of the terminal guanine. A cap that possesses this single methyl group is known as a cap 0. This is as far as the reaction proceeds in unicellular eukaryotes. The enzyme responsible for this modification is called guanine-7-methyltransferase.
- The next step is to add another methyl group, to the 2
?O position of the penultimate base (which was actually the original first base of the transcript before any modifications were made). This reaction is catalyzed by another enzyme (2
?O-methyl-transferase). A cap with the two methyl groups is called cap 1. This is the predominant type of cap in all eukaryotes except unicellular organisms. - In a small minority of cases in higher eukaryotes, another methyl group is added to the second base. This happens only when the position is occupied by adenine; the reaction involves addition of a methyl group at the N6 position. The enzyme responsible acts only on an adenosine substrate that already has the methyl group in the 2
?O position. - In some species, a methyl group is added to the third base of the capped mRNA. The substrate for this reaction is the cap 1 mRNA that already possesses two methyl groups. The third-base modification is always a 2
?O ribose methylation. This creates the cap 2 type. This cap usually represents less than 10-15% of the total capped population.
In a population of eukaryotic mRNAs, every molecule is capped. The proportions of the different types of cap are characteristic for a particular organism. We do not know whether the structure of a particular mRNA is invariant or can have more than one type of cap.
In addition to the methylation involved in capping, a low frequency of internal methylation occurs in the mRNA only of higher eukaryotes. This is accomplished by the generation of N6 methyladenine residues at a frequency of about one modification per 1000 bases. There are 1-2 methyladenines in a typical higher eukaryotic mRNA, although their presence is not obligatory, since some mRNAs do not have any (for review see Bannerjee, 1980).