Journal of Molecular Biology
Improved Segmental Isotope Labeling Methods for the NMR Study of Multidomain or Large Proteins: Application to the RRMs of Npl3p and hnRNP L
Introduction
Nuclear magnetic resonance (NMR) spectroscopy is a powerful method for the determination of protein structures in solution as well as for the investigation of protein dynamics, protein–protein and protein–ligand interactions. Currently, however, only proteins with molecular mass of up to ∼30 kDa can be studied structurally by NMR spectroscopy. Large proteins in solution show severe NMR line-broadening owing to slow tumbling and a high signal overlap as the result of the large number of resonances. The introduction of transverse relaxation-optimized spectroscopy1., 2. helped the scope of solution NMR to be extended. Also, problems of overlapping NMR signals can be reduced by isotope labeling by amino acid type,3., 4., 5. or by the recent stereo-array isotope labeling of amino acids.6 However, specific labeling of amino acid residues does not allow protein sequential resonance assignments and stereo-array isotope labeling of amino acids is limited to translation in vitro. An alternative solution for the study of large proteins by NMR would be to isotopically label segments or domains of a large protein. There are two methods that have been used successfully for segmental isotope labeling of proteins: expressed protein ligation (EPL),7., 8., 9., 10. and protein trans-splicing (PTS).11., 12. Segmental isotope labeling of proteins simplifies the structure determination of large proteins, and can provide valuable structural and biological information about multidomain proteins. Segmental isotope labeling allows one to determine whether protein domains interact with each other and helps to define their precise interface, or whether protein domains do not interact and therefore can be structurally (and biologically) characterized independently.
EPL is based on the reaction originally employed in Native Chemical Ligation,13., 14. where a C-terminal α-thioester of one peptide reacts with an N-terminal cysteine residue of the second peptide, resulting in the formation of a native peptide bond.8 However, the synthesis of large peptides has not been reported, as such peptides cannot be synthesized reliably with sufficient yields. EPL is a more convenient approach for generating large protein fragments for ligation.7., 8., 15., 16., 17. EPL relies on bacterial expression of individual fragments in fusion with an intein. Inteins catalyze their own excision, yielding protein fragments with reactive termini optimal for protein ligation. Their self-splicing properties have great value for various applications in protein chemistry, and several inteins are commercially available in systems used for intein-mediated purification of proteins. The IMPACT™-TWIN system introduced by New England Biolabs uses mini-inteins derived from the Synechocystis sp dnaB gene (Ssp DnaB),18 from the Mycobacterium xenopi gyrA gene (Mxe GyrA),19 and from the Methanobacterium thermoautotrophicum rir1 gene (Mth RIR1).20 The Ssp DnaB intein has been engineered to undergo pH or temperature-dependent cleavage at its C terminus, generating protein fragments with the desired N-terminal amino acid residue.21 The Mxe GyrA and Mth RIR1 inteins have been modified to undergo thiol-inducible cleavage at their N terminus, yielding the C-terminal thioester.22 The designed inteins of the IMPACT™-TWIN system thus provide a simple solution for segmental isotope labeling of proteins by EPL. PTS differs from EPL, as it is based on the reconstitution of an intein from two split intein fragments fused to an N and a C extein. After reconstitution, the intein excises itself, resulting in the ligation of the two exteins.8
The successful application of EPL and PTS for segmental isotope labeling of proteins has been reported for several NMR studies. Yamazaki and co-workers segmentally labeled the C-terminal domain of the RNA polymerase α subunit using a trans-splicing system based on the split PI-PfuI intein.12 More recently, the same group reported the segmental labeling of the 52 kDa β subunit of the F1-ATPase, which was used successfully in combination with transverse relaxation-optimized spectroscopy to obtain its backbone resonance assignment, followed by valuable conformational studies.11 Similarly, the potential application of EPL for NMR was demonstrated in subsequent studies. Xu and co-workers chemically ligated two folded recombinant domains (SH2 and SH3) of Abelson protein tyrosine kinase using an intein fusion to generate a C-terminal α-thioester derivate of Abl-SH3 and a factor Xa proteolysis to generate the N-terminal cysteine residue of Abl-SH2 for the ligation reaction.23 Camarero et al used EPL for segmental isotope labeling of two σA factor regions to investigate whether their interaction leads to autoinhibition, i.e. the abolishment of its binding to DNA.10 More recently, our group used EPL to investigate the interactions between the two C-terminal RNA recognition motifs (RRMs) of PTB.24 There, we developed an EPL protocol using an on-column ligation step that allowed us to obtain a very high yield. The segmentally labeled samples were used for a structural investigation of the interactions between the two RRMs, and for the determination of their precise structure.24 Other studies have contributed to the improvement of segmental isotope labeling techniques by finding more convenient ways to perform the ligation reaction in order to increase its yield and to simplify the protocol.24., 25. Significant effort was invested in the development of techniques resulting in the ligation of several protein fragments.9., 26., 27., 28. These works demonstrate the potential of EPL and PTS for the NMR study of large proteins in reducing the signal overlap as well as giving the possibility to investigate the functional properties of the proteins (conformational changes upon the ligand binding) or multidomain proteins by NMR.
Here, we report further improvement of our on-column EPL protocol (Figure 1) and its application to the study of the interactions of two multidomain proteins, hnRNP L and Npl3p. With the three different ligation reactions reported here, we show that EPL can be generally applied to any multidomain proteins, independent of whether the domains interact, are soluble or are folded. These results emphasize the usefulness of this method for the segmental labeling of any given polypeptide chain and their study by NMR. Furthermore, we found that the RRMs of heterogeneous nuclear ribonucleoprotein L (hnRNPL) interact, whereas those of Npl3 are independent. Subsequently, the structures of the two RRMs of Npl3p were determined from samples in which each RRM was expressed individually.
Section snippets
Segmental isotope labeling of two insoluble fragments: hnRNPL RRM3 and RRM4
The hnRNPL is a very abundant RNA-binding protein known to be involved mainly in alternative splicing and degradation of mRNA.29 hnRNP L contains four RRMs (Figure 2(a)). Preliminary studies carried out in our laboratory indicated that RRM3 and RRM4 might interact with each other despite the long linker between the two RRMs (417–461, Figure 2). In order to characterize this interaction structurally by NMR, we decided to segmentally label the RRM34 fragment using our EPL protocol (Figure 1).24
Discussion
Earlier, we described the successful segmental isotopic labeling of two soluble, folded and interacting domains using an on-column EPL protocol.2., 4. To fully exploit the potential of EPL for protein NMR, the EPL protocol must be sufficiently robust to enable ligation of proteins exhibiting less favorable properties. To make EPL as broadly applicable as possible, we have improved our protocol (Figure 1) to allow ligation of insoluble, non-interacting and misfolded domains.24 EPL was applied to
General methods
Ligated Npl3p RRM12 193 and RRM12 211 products were purified by size-exclusion chromatography using a Superdex™ 75 10/300 GL column at a flow-rate of 0.5 ml/min in 50 mM sodium phosphate (pH 7), 200 mM NaCl. SDS-PAGE was used for analysis of the expression and solubility of hnRNPL and Npl3p constructs, as well as estimation of the intein cleavage and ligation efficiencies. Matrix-assisted laser desorption/ionization time-of-flight (MALDI-TOF) mass spectroscopy analysis of all constructs was
Acknowledgements
This investigation was performed within the Structural Biology National Center of Competence in Research (NCCR) ISO-laboratory co-directed by Professor K. Wüthrich and Professor F. Allain. The investigation was supported by grants from the Swiss National Science Foundation and the ETH Zurich through the NCCR Structural Biology and by the Roche Research Fund for Biology at the ETH Zurich to F.H.T.A., who is an EMBO Young Investigator.
References (43)
- et al.
Expressed protein ligation, a novel method for studying protein-protein interactions in transcription
J. Biol. Chem.
(1998) - et al.
Recent advances in the application of expressed protein ligation to protein engineering
Curr. Opin. Biotechnol.
(2002) - et al.
Protein trans-splicing and functional mini-inteins of a cyanobacterial dnaB intein
Biochim. Biophys. Acta
(1998) - et al.
Characterization of a self-splicing mini-intein and its conversion into autocatalytic N- and C-terminal cleavage elements: facile production of protein building blocks for protein ligation
Gene
(1999) - et al.
The in vitro ligation of bacterially expressed proteins using an intein from Methanobacterium thermoautotrophicum
J. Biol. Chem.
(1999) - et al.
Synthesis of multi-domain proteins using expressed protein ligation: strategies for segmental isotopic labeling of internal regions
Tetrahedron
(2000) - et al.
A pulsed field gradient isotope-filtered 3D 13C HMQC-NOESY experiment for extracting intermolecular NOE contacts in molecular complexes
FEBS Letters
(1994) - et al.
Protein NMR structure determination with automated NOE assignment using the new software CANDID and the torsion angle dynamics algorithm DYANA
J. Mol. Biol.
(2002) - et al.
MOLMOL: a program for display and analysis of macromolecular structures
J. Mol. Graph.
(1996) - et al.
Attenuated T2 relaxation by mutual cancellation of dipole-dipole coupling and chemical shift anisotropy indicates an avenue to NMR structures of very large biological macromolecules in solution
Proc. Natl Acad. Sci. USA
(1997)
The use of TROSY for detection and suppression of conformational exchange NMR line broadening in biological macromolecules
J. Biomol. NMR
A robust and cost-effective method for the production of Val, Leu, Ile (δ1) methyl-protonated 15N-, 13C-, 2H-labeled proteins
J. Biomol. NMR
Cell-free synthesis and amino acid-selective stable isotope labeling of proteins for NMR analysis
J. Biomol. NMR
Dual amino acid-selective and site-directed stable-isotope labeling of the human c-Ha-Ras protein by cell-free synthesis
J. Biomol, NMR
Optimal isotope labelling for NMR protein structure determinations
Nature
Semisynthesis of proteins by expressed protein ligation
Annu. Rev. Biochem.
Protein ligation: an enabling technology for the biophysical analysis of proteins
Nature Methods
Insertion of a Synthetic peptide into a recombinant protein framework: a protein biosensor
J. Am. Chem. Soc.
Autoregulation of a bacterial σ factor explored by using segmental isotopic labeling and NMR
Proc. Natl Acad. Sci. USA
Conformational change of H+-ATPase β monomer revealed on segmental isotope labeling NMR spectroscopy
J. Am. Chem. Soc.
Segmental isotope labeling for protein NMR using peptide splicing
J. Am. Chem. Soc.
Cited by (44)
Population genetic structure of Sitobion miscanthi in China
2022, Journal of Integrative AgricultureCitation Excerpt :Understanding the migration behaviour of agricultural pests can lead to more effective control of these pests and prevent large outbreaks, which is of great significance to agricultural security (Hill 1975; Nair et al. 2000; Thomson et al. 2010). Fluorescence labelling and radioisotope methods are commonly used to study the migration of large animals (Skrisovska and Frédéric 2008). However, insects, such as aphids, are small and difficult to mark, so it is difficult to study their migration with traditional methods.
Selective Amino Acid Segmental Labeling of Multi-Domain Proteins
2015, Methods in EnzymologyCitation Excerpt :Insoluble precursor molecules that are obtained as inclusion bodies or cell-free precipitates require native refolding as well. This is usually achieved by complete unfolding of the target protein in a buffer containing chaotropic agents such as urea or guanidine hydrochloride and is followed by slow dialysis or rapid dilution into a buffer that supports native folding into a ligation-competent state (Burgess, 2009; Middelberg, 2002; Skrisovska & Allain, 2008). Ligation of the respective competent precursor molecules is initiated either after separate purification and conversion of each individual precursor into the reactive fragment or by simultaneous co-purification and conversion of both precursors on an affinity-column (on-column approach).
One, two, three, four! How multiple RRMs read the genome sequence
2015, Methods in EnzymologyCitation Excerpt :Structures of tandem RRMs characterized in their free states show extensive variations. In most cases, a flexible interdomain linker separates the two RRMs and no inter-RRM interactions are present as seen for Sxl, Nucleolin, PTB RRM12, Hrp1, Npl3, and HuR (Allain, Gilbert, Bouvet, & Feigon, 2000; Crowder, Kanaar, Rio, & Alber, 1999; Perez-Canadillas, 2006; Skrisovska & Allain, 2008; Vitali et al., 2006; Wang et al., 2013). In contrast, a few tandem RRMs show stable interactions between the RRMs in the free state as shown for PTB, nPTB, Prp24, FIR, U2AF65, hnRNP A1, hnRNP L, and more recently CPEB1 and 4 (Afroz et al., 2014; Bae et al., 2007; Barraud & Allain, 2013; Cukier et al., 2010; Mackereth et al., 2011; Vitali et al., 2006; Zhang et al., 2013).
Interactions affected by arginine methylation in the yeast protein-protein interaction network
2013, Molecular and Cellular ProteomicsCitation Excerpt :At this moment, it is unclear as to whether the number of methylated arginines or the type of methylated arginine residues affects the manner in which Npl3 interacts with its partner proteins. Structure data is only available for the RRM1 and RRM2 domains of Npl3 (57, 58) and residues 57–198 of Air2 (59, 60). The analysis of the location of the methylated residues on the proteins revealed that they mostly reside in regions of disorder (Table I).
Crystal structures and RNA-binding properties of the RNA recognition motifs of heterogeneous nuclear ribonucleoprotein L: Insights into its roles in alternative splicing regulation
2013, Journal of Biological ChemistryCitation Excerpt :The hydrophobic and hydrogen bond interactions occur via direct inter-RRM contacts (Fig. 3B) and indirect contacts mediated by the interdomain linker (Fig. 3C). The packed crystal structure of RRM34 is consistent with a previous observation, by NMR spectroscopy, of the RRM3 and RRM4 interaction (39). Interestingly, the amino acids involved in interdomain contacts are highly conserved across species (Fig. 3D), suggesting that RRM34 in all homologous proteins is in the same spatial arrangement and that this topology may be functionally significant.