-
The transactivator protein Tat of human immuno-deficiency virus-1(HIV-1) can strongly transactivate the HIV long terminal repeat (LTR) and is the most important regulator of viral gene expression and replication (7, 20). Hence, the Tat protein was considered an attractive target for antiviral therapy, and received much attention. Tat protein contains 86-104 amino acids depending on genotype, but three major functional domains are conserved: the N-terminal amphipathic α-helix domain, the cysteine-rich region which contains seven Cys residues and a third basic domain rich in Arg and Lys residues. This basic domain is responsible for interacting specifically with the transactivation response region (TAR) RNA (12, 21, 23). It has been proved that Tat is a multi-functional protein. One function is as an adaptor protein for cellular cofactors, many of which exhibit intrinsic enzymatic activities. Tat is also found to bind cyclin T1 through the highly conserved cysteine-rich region and recruit the cyclin-dependent kinase 9 (CDK9) to elongate HIV transcripts (1). Tat and cyclinT1 bind TAR cooperatively, and then induce phosphorylation of the C-terminal domain of RNA polymerase Ⅱ by CDK9. Among the Tat-binding proteins, there are a number of transcriptional coactivators with intrinsic histone acetyltransferase activity, e.g p300/CBP, p300/CBP-associated factor (PCAF), TAFⅡ250 and Tat-interacting protein 60 (Tip60) (1). Since Tat also induces remodeling of a single nucleosome (nuc-1) positioned at the HIV promoter, it was proposed that Tat stimulates transcriptional elongation of HIV both by increasing the intrinsic activity of the RNA polymerase Ⅱ complex to elongate more efficiently, and by recruiting histone-modifying enzymes to remodel the elongation block caused by nuc-1.
Despite its nuclear localization and function and the lack of any secretory signal sequence, Tat was found to be secreted by HIV-infected cells and to be taken up by affected neighbouring cells. Therefore, Tat is regarded as an important contributor to the pathogenesis of AIDS. The influence of this exogenous "bystander" effect on both infected and uninfected cells was thought to be mediated by its protein transduction domain (PTD). In fact, Tat can also fulfill the function as an exogenous factor, through interacting with a number of cellular membrane receptors including fibronectin, adhesion proteins and other cellular receptors (5, 10, 18, 24, 26) and then being internalized by means of endocytic activity (12). However, the exogenous activity and function of Tat is poorly studied due to difficulties with heterogeneously expressing this protein in E.coli bacteria. The preparation of Tat protein was assumed to be hampered by its high cysteine content and its susceptibility to oxidation leading to the formation of stable multimers (11). The successful cases of production of recombinant Tat are mainly restricted to the first exon of the tat gene (13, 19) or synthetic peptides (2). It was suggested that Tat protein exists in inclusion bodies and monomer forms as expression of Tat in E.coli (16) and the yield of purified product was relatively low (9). In this article, we cloned a full-length tat gene and compared the codon usage bias between eukaryotic Homo sapiens and prokaryotic E.coli. Results indicate that there are at least 14% rare codons in the Tat protein for E.coli. Therefore, the high ratio of rare codons might be the major reason for the poor expression level of Tat protein in E.coli. Just by changing the host strain from BL21(DE3) to Rosetta, which is supplemented with six tRNAs for Arg, Leu, Ile and Pro, the expression level of Tat increased dramatically. The achievement of effective expression and purification of the full-length, soluble dimer of Tat from a native gene, instead of a synthetic one, has paved the way to implement a method to fully investigate the exogenous function of HIV-1 Tat.
HTML
-
The full length coding region of the tat gene was cloned as described in the methods (GenBank accession number: EU001659), and sequence analysis revealed that there exists a greater diversity in the sequence as compared to the Tat proteins from other HIV virus strains (Fig. 1). However, its cysteine-rich domain comprising seven cysteines is conserved.
Figure 1. Comparison of Tat protein sequence we cloned with other different virus strains. The GenBank accession numbers are as follows: BH10 (M15654, K02008, K02009, K02010), CNHN24 (AY860947), 97CNGX (AAG38928) and the sequence we cloned 03CHHN (EU001659).
In the beginning, we tried to express this Tat protein from the E.coli BL21 (DE3) strain, but found that the this heterogeneous tat gene hardly expressed in E.coli BL21 (DE3). It has been reported that some eukaryotic genes are hard to express during heterologous gene expression, and the high cysteine content was assumed to be a key reason for this inhibition (11). Therefore, we compared the codon usage bias difference between Homo sapiens and E.coli using the Graphical codon usage analyzer (http://gcua.schoedl.de/seqoverall_v2.html). Comparison results indicated that there is a 40.5% mean difference (Fig. 2) mainly attributed to codons for Arg, Leu, Ile and Pro (Table 1). Among the E.coli rare codons encoded by tat, there are 9 of 9 for Arg, 1of 4 for Leu, 1 of 2 for Ile and 3 of 11 for Pro (Table 1).
Figure 2. Codon usage of HIV-1 tat gene between Homo sapiens and E.coli. codon usage analysis was performed in the Graphical codon usage analyzer website: http://gcua.schoedl.de/seqoverall_v2.html. Red bars means codon relative adaptiveness in Homo sapiens and black bars means codon relative adaptiveness in E.coli. Relative adaptiveness was derived from the codon usage frequency values and in positive correlation with it.
Table 1. The rare codons of tat gene for E.coli
-
To express Tat-his6 and GST-tat protein, pET22b-tat-his6 and pGEX5T-tat constructs were transformed into BL21 (DE3). After induction with IPTG, the lysate was subjected to SDS-PAGE and western blot analysis. As Fig. 3 shows, neither Tat-his6 nor GST-tat can be expressed in BL21 (DE3), no signal of Tat protein can even be detected by the western blot. As mentioned above, the rare codons could be key contributors to the inhibition of tat expression. Indeed, at least 14% (14 of 101) of the codons were rarely used in E.coli. Notably, all of the codons for Arg were rarely used in E.coli (Table 1). Accordingly, we then used the Rosetta strain, which was supplemented with six tRNAs for Arg, Leu, Ile and Pro compared to the BL21 (DE3). After induction with IPTG, the lysate was analyzed with SDS-PAGE and Western blot. As shown in Fig. 3, both Tat-his6 and GST-tat were expressed at high levels in the Rosetta strain, subsequent Western blot analysis also confirmed this expression. Further analysis revealed that all the expressed Tat-his6 and GST-tat fusion protein were soluble despite the fact that the induction was performed at 37℃.
Figure 3. A: Expression of GST (pGEX5T), GST-tat (pGEX5T-tat) fusion protein in E.coli BL21 (DE3) and Rosetta strains and Western blot diction with anti-GST antibody. B: Expression of Tat-his6 fusion protein (pET22b-tat) in E.coli BL21 (DE3) and Rosetta strains and Western blot with anti-His6 antibody.
-
To further characterize the recombinant Tat, trypsinized or non-trypsinized proteins were subjected to MALDI-TOF-MS analysis. Firstly, the molecular weight of purified Tat-his6 was calculated by MALDI-TOF-MS. As shown in Fig. 4, the molecular weight of Tat (12548.843, Fig. 4 A) is closer to the theoretical value (12599.30) when protease inhibitor cocktail (Roche) was added to the lysate during purification and the mass was measured immediately afterwards. On the contrary, the molecular weight value (11039.256, Fig. 4 B) is much smaller than the theoretical one when protease inhibitor was not added to the lyaste during purification and measured 48 h after purification at 4℃, which means the protein lost more than ten amino acid residuals. Secondly, we measured the peptide mass fingerprint (PMF) of recombinant Tat by MALDI-TOF-MS and predicted the PMF of Tat using the peptide-mass tool (http://expasy.org/tools/peptide-mass.html). There was some difference between measured and predicted PMF (Table 2, Fig. 4C). We believe that this difference can be attributed to the diversity between different strains. In fact, the database search (www. matrixsciences.com) revealed that measured PMF can be matched with other mutants of Tat (data not shown).
Figure 4. MALDI-TOF-MS analysis of recombinant Tat protein (molecular determination). A: The molecular weight of recombinant Tat protein determined by MALDI-TOF-MS immediately after purification. B: The molecular weight of recombinant Tat protein determined by MALDI-TOF-MS after 48h storage at 4℃ post-purification. C: The peptide mass fingerprint (PMF) of recombinant Tat protein as analyzed by MALDI-TOF-MS.
Table 2. The predicted and measured peptide mass fingerprint
-
As shown in Fig. 3B, two bands (MW 14kDa and 25kDa) of the purified Tat protein appear in SDS-PAGE gel. Western blot proves that both bands correspond to Tat-his6 proteins, which means that the larger one is the Tat dimer and the smaller is the monomer. The β-mercaptoethanol is commonly added to the loading buffer and functions as a reductant and breaks the disulfide bond between the two molecules. The results above indicate that Tat forms a dimer not by a cysteine disulfide bond but by another mechanism.
-
To further analyze the biological activity of this recombinant Tat, reporter plasmid KB/SP1LTR-nlacZ was transfected into the TE671 cell. The purified Tat protein was added immediately or after 48h of storage at 4 ℃into the cell culture to analyze the activity. In this assay, the expression of lacZ gene is driven by the Tat-activated HIV-1 LTR element, and it can be detected by means of X-gal staining. Staining results indicated that the recombinant Tat protein can efficiently activate the transcription of the target reporter gene, even after 48h storage at 4℃ post-purification (Fig. 5B).
Figure 5. Detection of the transactivation activity of the recombinant Tat protein. A: TE671 cells were transfected with reporter plasmid KB/SP1LTR-nlacZ, but without the treatment of recombinant Tat protein. B: TE671 cells were transfected with reporter plasmid KB/SP1LTR-nlacZ, and treated with the recombinant Tat protein at final concentration of 100 ng/mL.