During transcription initiation, RNA polymerase binds tightly to the promoter DNA defining the start of transcription, transcribes comparatively slowly, and frequently releases short transcripts (3-8 nucleotides) in a process called abortive cycling. Transitioning to elongation, the second phase of transcription, the polymerase dissociates from the promoter while RNA synthesis continues. Elongation is characterized by higher rates of transcription and tight binding to the RNA transcript. The RNA polymerase from enterophage T7 (T7 RNAP) has been used as a model to understand the mechanism of transcription in general, and the transition from initiation to elongation specifically. This single-subunit enzyme undergoes dramatic conformational changes during this transition to support the changing requirements of nucleic acid interactions while continuously maintaining polymerase function. Crystal structures, available of multiple stages of the initiation complex and of the elongation complex, combined with biochemical and biophysical data, offer molecular detail of the transition. Some of the crystal structures contain a variant of T7 RNAP where proline 266 is substituted by leucine. This variant shows less abortive products and altered timing of transition, and is a valuable tool to study these processes. The structural transitions from early to late initiation are well understood and are consistent with solution data. The timing of events and the structural intermediates in the transition from late initiation to elongation are less well understood, but the available data allows one to formulate testable models of the transition to guide further research.
Citation: Karsten Theis. Snapshots of a Viral RNA Polymerase Switching Gears from Transcription Initiation to Elongation[J]. VIROLOGICA SINICA, 2013, 28 (6): 337-344 https://doi.org/10.1007/s12250-013-3397-3
Received: 23 October, 2013; Accepted: 28 November 2013; Published: 2 December 2013
Copyright: © Wuhan Institute of Virology, CAS and Springer-Verlag Berlin Heidelberg 2013
Data Availability: All relevant data are within the paper and its Supporting Information files.
Corresponding author: Karsten Theis, Received: Phone: +1-413-572 5312, Fax: +1-413-572 5441, E-mail: firstname.lastname@example.org.
When enterophage T7 infects enterobacteria, the success of the phage relies on the amazing catalytic efficiency of its RNA polymerase (T7 RNAP), which adds up to 250 nucleotides per second to an RNA transcript. In a matter of minutes after infection, protein expression is dominated by viral proteins mainly because T7 RNAP so efficiently transcribes RNA. This efficiency is harnessed in biotechnology and basic research. Without T7 RNAP, for example, structural biology would not be the same. Countless proteins studied by X-ray crystallography and NMR have been expressed using T7 RNAP-based expression systems in Escherichia coli (Studier F W, et al., 1986). Likewise, large scale preparations of RNA used in structural studies rely on T7 RNAP-catalyzed in vitro synthesis (Milligan J F, et al., 1987). Moreover, our understanding of transcription itself has been advanced enormously by using the T7 RNAP as a model enzyme. Even though T7 RNAP is smaller than most other RNA polymerases, and is composed of a single subunit (different from the multi-subunit RNAPs of prokaryotes and eukaryotes), it nevertheless displays most of the complexity of its larger "cousins". As a model system, T7 RNA polymerase has been thoroughly characterized in terms of both its structure and its mechanism; however, some big questions remain. This review focuses on structural and functional transitions during the synthesis of the first dozen or so nucleotides early in transcription.
Transcription occurs in two phases, initiation and elongation. While initiation involves drastic conformational changes in both protein and nucleic acid and is relatively slow, elongation involves only subtle conformational changes and is highly efficient. To initiate transcription, RNA polymerases bind sequence-specifically to promoter DNA and open a bubble in the DNA, exposing a single-stranded portion of the template strand in preparation for RNA synthesis. During transcription initiation, a growing DNA – RNA hybrid forms, with the 5'-end of RNA maintaining its interaction with the template strand (Fig. 1). In this phase of transcription, frequent loss of RNA from the initiation complex (IC) is observed. Because the enzyme remains bound to the promoter in these events, transcription can reinitiate quickly, leading to a process called abortive cycling that only ends when the RNA exceeds a certain length. Once the RNA reaches that length (for T7 RNAP, about 9-12 nucleotides), the promoter is released, the bubble collapses, and the 5'-end of the RNA dissociates from the template strand, resulting in a smaller size of the bubble (for T7 RNAP, a final size of about 8 nucleotides). In this elongation complex (EC), the size of the hybrid and the bubble is maintained as the polymerase translocates on the template strand. The events leading from transcription initiation to transcription elongation (reviewed for T7 RNAP in Martin C T et al., 2005 and Steitz T A, 2009) are summarized in Table 1.
While the catalytic role of RNAP is the same for initiation and elongation, its role in interacting with the nucleic acids changes dramatically. In RNAPs of prokaryotes and eukaryotes, this change in function is achieved by multi-subunit polymerases that shed promoter-binding subunits (such as the E. coli sigma factor) when transitioning from initiation to elongation. As a single-subunit polymerase able to initiate without additional protein factors, T7 RNAP achieves this change of function by undergoing a dramatic set of conformational changes, as crystal structures of the initiation and elongation complex show (Cheetham G M, et al., 1999; Yin Y W, et al., 2002; Tahirov T H, et al., 2002) (Fig. 2A and C). Similar to a gear box in a car, where a change in the juxtaposition of gears results in a different gear ratio, conformational changes in T7 RNAP result in different binding sites and protein-nucleic acid interactions when comparing the initiation complex to the elongation complex. The dramatic changes occur predominantly in the N-terminal part of the protein. In the initiation complex, the promoter-binding domain (PBD) interacts with the specificity loop to form the binding interface with the promoter (Fig. 1). In the elongation complex, these interactions are lost as the PBD moves away from its original position. Instead, the specificity loop now interacts with a sub domain newly formed from helices H1 and H2, forming a channel through which the 5'-end of the RNA exits. It has been argued that the high processivity of the elongation complex is due in some measure to the RNA bound tightly on the 3' side by the active site and on the 5' side by the RNA exit channel, topologically locking the template strand to the polymerase (Liu X, et al., 2009).
Knowing the initial and final states in a conformational change sometimes suggests a plausible path to achieve this transition. In this case, however, there is a substantial conformational change accompanied by the loss and gain. of many protein-protein and protein-nucleic acid interactions, begging for more structural data on intermediate states. In 2008, structures of an initiation complex of the P266L variant of T7 RNAP bound to 7 nt (IC7) and 8 nt RNA (IC8) were determined (Durniak K J, et al., 2008), confirming that the transition from initiation to elongation is not a two-state transition, but instead goes through multiple structural intermediates (Bandwar R P, et al., 2007)
Crystals of the IC7 and IC8 intermediate states using the P266L mutant
Crystallography is a slow technique, and special measures have to be taken to capture intermediates that are short-lived. It is possible to stall RNA polymerase at any position by leaving out nucleotides, or by offering a 3'-deoxy nucleotide. Another approach is to pre-assemble the RNA-DNA complex (using strategically placed mismatches between template and non-template strands to favor formation of the RNA-DNA hybrid and the transcription bubble) and allow the enzyme to bind to it (Daube S S, et al., 1992). The latter approach was used in crystallizing the elongation complex (Tahirov T H et al., 2002, Yin Y W, et al., 2002). For the IC7 and IC8 intermediate, RNA dissociation presents a hurdle to either approach. The P266L variant of T7 RNAP was discovered in a genetic screen set up to select for mutations that show less abortive cycling (Guillerez J, et al., 2005). Transcription assays show that the P266L variant aborts less throughout, and most markedly at positions +5 through +9. Using P266L, which is less prone to abortive cycling, allowed Durniak K J et al. (2008) to assemble late initiation complexes sufficiently stable for crystallization.
As the P266L mutation does not interfere with any aspect of transcription, these crystal structures represent on-pathway intermediates of transcription. How representative are they of transcription in the wild type enzyme? Guillerez J et al (2005) argued that weaker promoter binding of P266L might allow an earlier transition to the elongation conformation, explaining the reduction in abortive products. Using a multipronged approached, Ramírez-Tapia L E and Martin (2012) compared the timing of the transition (by testing for promoter loss using a fluorescent technique and by testing for conformational change in the enzyme using a proteolytic cleavage susceptibility assay) at different stages in transcription. Halted at positions up to +8, no transition to elongation was detected (in the 60 sec time frame of the assay) in either mutant or wild type. However, the two enzymes showed marked differences at position +9 in both assays, with a large fraction of wild type enzyme already in the elongation state while the P266L mutant remained in the promoter-bound initiation state. Overall, the data show that promoter loss and the large conformational change of the enzyme is delayed in P266L compared to the wild type enzyme. It is not clear if more subtle events leading up to promoter loss are also delayed in P266L, i.e. the IC7 structure of P266L might be representative of the wild type enzyme in its IC6 state in some aspects. For the interpretation of the IC8 structure, there are multiple concerns. First, the structure was determined at very low resolution (7 Å) using molecular replacement to obtain phases. While the structure clearly shows that the promoter is still bound, and this is important information in itself, any more detailed inter pretation of the coordinates should be done with extreme caution as one expects severe model bias at this low resolution. Second, it is unclear whether the structure is representative of the IC8 state of the wild type, or whether the wild type would already have undergone changes leading to promoter loss. Given that the IC7 and IC8 structures are very similar and the IC7 structure was determined at much higher resolution, this review focuses on the IC7 structure, which contains a wealth of information about the transition of T7 RNAP from initiation to elongation.
Structural changes from early to late initiation
In the structural transition from IC3 to IC7, promoter, the promoter binding domain and the specificity loop move as a rigid body away from the C-terminal domain, presumably pushed by the hybrid (Durniak K J, et al. 2008). Consequently, enzyme-promoter interactions are virtually unchanged, and space opens up to accommodate more template DNA and newly synthesized RNA within the enzyme. To allow the rigid body movement, which is a 40° rotation about an axis near the -4 region of the promoter, several elements in the structure undergo hinge or shear motions. Hinge motions include the base of the specificity loop, the single-stranded DNA connecting promoter and RNA-DNA hybrid, and the loop including residue 150 connecting helices G and H. The most dramatic change is the shear motion of helices C2 and D, which are connected with a loop that is disordered in both the IC3 and the IC7 structures (Fig. 3). In both structures, the two helices are in contact distance with a hydrophobic interface, but one helix moves relative to the other by more than 12 Å, reorganizing the contacts between hydrophobic residues Phe 51 and Phe 55 of helix C2 and Ile 74, Thr 75 and Leu 78 of helix D. There is a precedent for flat hydrophobic interfaces that allow multiple relative orientations of interaction partners (Ritacco C J, et al. 2013), suggesting that as RNA is extended stepwise from 3 to 7 nt, T7 RNAP might provide space stepwise by successively rotating around this axis.
Protein-RNA interactions in IC7, and correlations with abortive cycling
T7 RNAP, like other RNA polymerases, releases short RNA fragments early in transcription in a process called abortive cycling. The mechanism of abortive cycling comes down to binding interactions and dissociation kinetics of RNA. A stably bound RNA will not dissociate, but even a weakly bound RNA will stay bound if the dissociation rate is slow compared to the rate of adding the next nucleotide. From first principles and setting aside protein-nucleic acid interactions for the moment, one would expect that as the RNA-DNA hybrid gets longer, binding strength would increase while dissociation rates would decrease, simply because the number of base pairing and stacking interactions increase with length of the hybrid. If the transcription rate is independent of RNA length, rates of RNA fall off should decrease as transcription proceeds. However, transcription assays show that some RNA lengths are more prone to fall off and some less, varying in a non-systematic way. This points to discontinuous events during transcription, such as the DNA-RNA hybrid running out of space in the binding cavity of the enzyme, or the enzyme slowing down as conformational change becomes necessary to provide space. Vahia and Martin (2011) have probed the energetic basis of abortive cycling by systematically increasing or reducing different kinds of possible "stress" (destabilizing interactions) proposed in models of abortive cycling and measuring the ratio of abortive products (with a length of 2 to 6 nt) to longer products. None of the manipulations (changing the size of the template strand between promoter and transcription start site, increasing the size of RNA by adding bulk to the 5'-end, changing the energetic of bubble opening or collapse by introducing mismatches between template and non-template strands) show systematic changes in the amount of abortive products, suggesting that steric clashes might not directly influence binding strength, RNA dissociation kinetics and transcription rate in a way that explains the observed length distribution of abortive products.
Comparing the binding interactions of RNA with the enzyme in the IC3, IC7 and EC structures suggests that the set of interactions observed in the EC structure do not continuously become available as the RNA product increases in length, but appear in discontinuous jumps as the enzyme undergoes conformational change to expose or create binding interfaces (see Table 2). For example, while many of the C-terminal interaction partners seen in the EC structure are already in place in the IC7 structure, the exit channel lined by N-terminal residues is not. Once the protein undergoes the conformational change creating the exit channel and the 5' end of RNA binds to it, RNA binding affinity will increase markedly. Likewise in the transition from IC3 to IC7, residues of the thumb helix which eventually interact with the phosphate backbone and the minor groove of the hybrid (such as Arg 389) are engaged in interactions with the promoter binding domain. Only when the conformational change seen between the IC3 and IC7 structures occurs to expose this binding interface for contacts with the hybrid, will those interactions be able to kick in, again stabilizing the enzyme-hybrid interactions. The timing of protein conformational change making these nucleic acid binding interactions available might explain the observed non-systematic pattern of abortive products, but exploring this idea experimentally would require a probe sensitive to the protein conformational changes occurring from IC3 to I7.
Structural changes from late initiation to elongation
The IC7 structure shows how T7 RNAP makes space for the growing hybrid while remaining bound to the promoter. Proceeding from IC7 to EC, the promoter is released from the enzyme, allowing the initially melted bubble to collapse, driving displacement of the 5' end of the RNA (Gong P et al 2004). Supporting these changes in nucleic acid structure and binding, the N-terminal domain of the enzyme undergoes substantial reorganization, including dissociation of the specificity loop from the promoter binding domain, an extensive rigid body movement of the latter, a refolding of sub domain H and fusing of helices C1 and C2 (see Fig. 1 and 2). As a result of the reorganization, the RNA exit channel forms, lined by helix C, specificity loop, sub domain H, and a loop from 292 to 301. Because structural data on the protein conformation between the IC8 and EC state are lacking, it is not clear in which order and with which timing they might occur. It has been suggested that the interactions between helix C1 and the C-terminal domain, which are observed to persist without change in IC3, IC7 and EC structures, might serve to organize the conformational change by staying intact throughout (Theis K, et al., 2004). Overall, the EC structure is better defined than the IC7 structure (the loop between helix C2 and PBD and the entire sub domain H are resolved in the EC structure, and it was determined at higher resolution), presumably because of multiple additional protein-protein contacts such as the one between sub domain H and the specificity loop, and a gain in secondary structure upon transition to the EC c onformation. Curiously, the mitochondrial RNA polymerase, which shows high homology to T7 RNAP in the C-terminal domain and has an N-terminal element resembling the fold of the PBD in T7 RNAP, only undergoes subtle conformational change comparing apo-structure and elongation complex (Schwinghammer K, et al., 2013). A comprehensive comparison between these two enzymes will be possible only when an initiation complex of mitochondrial RNAP including its requisite initiation factors becomes available.
Possible triggers of promoter loss
The IC7 and IC8 structures show how T7 RNAP remains bound to the promoter while the enzyme undergoes conformational change to make room for the growing hybrid. As RNA synthesis extends past 8 nt, further changes are necessary to avoid a potential clash of the upstream end of the hybrid and the PBD (Fig. 2B). To avoid this clash, either the PBD or the hybrid (or both) have to make room. If the hybrid never exceeds 8 bp in length because the template strand starts to dissociates from RNA once it becomes longer than 8 nt, an IC7/IC8 like protein conformation could be maintained as the growing RNA fills the cavity in the protein (model A). In this model, the trigger for promoter loss would be the loss of the specificity loop from the promoter binding site, "pushed away" by single stranded 5' RNA. Conversely, the hybrid might grow to lengths longer than 8 bp (8 bp is the length in the IC7 structure as well as the EC structure) if the PBD continues to move away from the active site (model B). The trigger for promoter loss would be the growing hybrid pushing on the promoter binding domain until the binding site falls apart, perhaps because the covalent connections of the PBD with the C-terminal domain of the enzyme become "overstretched". From the structural data available at this point, however, it is not clear how the enzyme would support promoter binding while harboring a hybrid up to 12 bp in length. Merely continuing the rigid body motion observed from IC3 to IC7 to arrive at a putative IC12 structure would lead to clashes with the downstream DNA and fail to clear the path for the hybrid, which would need an addition al~17 Å space to accommodate another 5 bp compared to the IC7 structure (Fig. 2B). On the other hand, biochemical evidence suggests that at least some fraction of the enzyme remains promoter bound up to an RNA length of 12 nt (Tang G Q et al., 2009, Ramírez-Tapia L E, et al., 2012). This stage of the transition clearly warrants further research.
What is the role of Proline 266 in the structural transitions?
In some sense, the P266L mutant is a better enzyme than the wild type. Is it possible to rationalize how the mutation affects the amount of abortive products and the timing of initiation to elongation transition from the structural data? The chemical environment of Proline 266 changes both from IC3 to IC7 and from IC7 to EC structures (Fig. 4). Proline 266 is part of a loop that connects the most C-terminal helix of the promoter binding domain with the C-terminal domain of the protein, which does not change conformation in the IC to EC transition. In the IC3 and EC structures, the entire loop is well-defined, but in the intermediate IC7 structure, it is partially disordered (no coordinates for residues) and Pro 266 lies on the edge of the disordered segment, so its conformation might not be as well defined as in the other two states. In the early initiation conformation, proline 266 is part of a hydrophobic patch that includes Phe 400 (part of the thumb helix), Met 431 and Phe 432. As the polymerase transitions, Pro 266 moves away from these residues. For example, the distance between the C-alpha atoms of residues 266 and 400 is 7.1 Å in the IC3 structure, increases to 10.0 Å in the IC7 structure, and increases further to 15.2 Å in the EC structure. At the same time, Pro 266 approaches residues at the junction between helix C1 and C2. For example, the distance between the C-alpha atoms of residues 266 and Tyr 44 decrease from 11.9 Å in the IC3 structure to 5.5 Å in the IC7 structure to 4.4 Å in the EC structure. In the latter structure, helix C1 fuses with helix C2 to form a longer continuous helix that defines one side of the RNA exit channel. When the transition to elongation is complete, Pro266 stacks on the aromatic side chain of Tyr 44, again as part of a hydrophobic patch, and the entire loop containing Pro266 is resolved. Given that Pro 266 switches chemical environment at least twice (losing hydrophobic interactions with the thumb helix, and later gaining hydrophobic interactions with helix C1/C2), one can rationalize that the P266L mutation influences both early events (less 2-5 nt abortive) and late events (transitioning to EC later than the wild type enzyme). If the mutation disrupts the hydrophobic packing of residue 266 in both the IC3 and the EC conformation, one would expect that it becomes easier or faster to transition from IC3 to IC7, leading to less abortives. Likewise, one would expect that it becomes more difficult or slower to transition from IC7 to EC, explaining the observed later transition to elongation in Pro266Leu. Intriguingly, other mutations in this region, including the Phe55Pro mutation, result in higher levels of abortive products (Bandwar R P, et al., 2007) rather than the lower levels observed with Pro266Leu.
One of the fascinating aspects of virology is how viruses achieve biological function in a minimalist manner. The T7 RNA polymerase exemplifies this minimalism at the protein level. Smaller than most prokaryotic and eukaryotic polymerases, and not requiring any transcription factors, the T7 RNAP is able to do essentially the same job as its larger cousins. One example of this minimalism is the recycling of the specificity loop, first involved in specific major groove binding to the promoter during initiation, then as a lining of the RNA exit channel to provide non-specific electrostatic interactions during elongation; this is reminiscent of the dual use of genetic information through multiple overlapping reading frames at the nucleic acid level in viral genomes. The story of how T7 RNAP transitions from initiation to elongation is a fabulous example of rich protein-nucleic acid interactions, and the past decade has seen a wealth of structural and biochemical data that allows us to tell it. The last chapter of that story, however, is not written yet; it is still unknown how the large conformational change from late initiation to elongation occurs, and whether there are further stable intermediates. It is possible that X-ray crystallography, the method used to provide the detailed structural data available in this system, is unable to provide further information. If the conformational change occurs quickly through an heterogeneous ensemble of intermediates, other high resolution structural methods like NMR spectroscopy, cryo electron microscopy or methods still to be developed might be able to provide the data to start writing that chapter. In the meantime, the T7 RNA polymerase has already taught us invaluable lessons how transcription works, both in viral systems and, by extrapolation and comparison, in lower and higher organisms.
- . Bandwar R P, Ma N, Emanuel S A, Anikin M, Vassylyev D G, Patel S S, McAllister W T. 2007. The transition to an elongation complex by T7 RNA polymerase is a multistep process. J Biol Chem, 31: 22879-22886.
- . Cheetham G M, Steitz T A. 1999. Structure of a transcribing T7 RNA polymerase initiation complex. Science, 286(5448): 2305-2309.
- . Daube S S, von Hippel P H. 1992. Functional transcription elongation complexes from synthetic RNA-DNA bubble duplexes. Science, 258(5086): 1320-1324.
- . Durniak K J, Bailey S, Steitz T A. 2008. The structure of a transcribing T7 RNA polymerase in transition from initiation to elongation. Science, 322(5901): 553-557.
- . Gong P, Esposito E A, Martin C T. 2004. Initial bubble collapse plays a key role in the transition to elongation in T7 RNA polymerase. J Biol Chem, 279(43): 44277-44285.
- . Guillerez J, Lopez P J, Proux F, Launay H, Dreyfus M. 2005. A mutation in T7 RNA polymerase that facilitates promoter clearance. Proc Natl Acad Sci U S A, 102(17): 5958-5963.
- . Liu X, Martin C T. 2009. Transcription elongation complex stability: the topological lock. J Biol Chem, 284(52): 36262-36270.
- . Martin C T, Esposito E A, Theis K, Gong P. 2005. Structure and function in promoter escape by T7 RNA polymerase. Prog Nucleic Acid Res Mol Biol, 80: 323-347.
- . Milligan J F, Groebe D R, Witherell G W, Uhlenbeck O C. 1987. Oligoribonucleotide synthesis using T7 RNA polymerase and synthetic DNA templates. Nucleic Acids Res, 15(21): 8783-8798.
- . Ramírez-Tapia L E, Martin C T. 2012. New Insights into the Mechanism of Initial Transcription. The T7 RNA polymerase mutant P266L transitions to elongation at longer RNA lengths than wild type. J Biol Chem, 287(44): 37352-37361.
- . Ritacco C J, Kamtekar S, Wang J, Steitz T A. 2013. Crystalstructure of an intermediate of rotating dimers within the synaptic tetramer of the G-segment invertase. Nucleic Acids Res, 41(4): 2673-82.
- . Schwinghammer K, Cheung A C, Morozov Y I, Agaronyan K, Temiakov D, Cramer P. 2013. Structure of human mitochondrial RNA polymerase elongation complex. Nat Struct Mol Biol, 20(11): 1298-1303.
- . Steitz T A. 2009. The structural changes of T7 RNA polymerase from transcription initiation to elongation. Curr Opin Struct Biol, 19(6): 683-690.
- . Studier F W, Moffatt B A. 1986. Use of bacteriophage T7 RNA polymerase to direct selective high-level expression of cloned genes. J Mol Biol, 189(1): 113-130.
- . Tahirov T H, Temiakov D, Anikin M, Patlan V, McAllister W T, Vassylyev D G, Yokoyama S. 2002. Structure of a T7 RNA polymerase elongation complex at 2.9 Å resolution. Nature, 420(6911): 43-50.
- . Tang G Q, Roy R, Bandwar R P, Ha T, Patel S S. 2009. Real-time observation of the transition from transcription initiation to elongation of the RNA polymerase. Proc Natl Acad Sci U S A, 106(52): 22175-22180.
- . Theis K, Gong P, Martin C T. 2004. Topological and conformational analysis of the initiation and elongation complex of t7 RNA polymerase suggests a new twist. Biochemistry, 43(40): 12709-12715.
- . Turingan R S, Liu C, Hawkins M E, Martin C T. 2007. Structural confirmation of a bent and open model for the initiation complex of T7 RNA polymerase. Biochemistry, 46(7): 1714-1723.
- . Vahia A V, Martin C T. 2011. Direct tests of the energetic basis of abortive cycling in transcription. Biochemistry, 50(32): 7015-7022.
- . Yin Y W, Steitz T A. 2002. Structural basis for the transition from initiation to elongation transcription in T7 RNA polymerase. Science, 298(5597): 1387-1395.