ReviewC-type lectin-like domains
Introduction
C-type animal lectins represent an important recognition mechanism for oligosaccharides at cell surfaces, attached to circulating proteins and in the extracellular matrix. Binding of specific sugar structures by these lectins mediates biological events, such as cell–cell adhesion, serum glycoprotein turnover and innate immune responses to potential pathogens [1]. Proteins with diverse overall architecture contain homologous carbohydrate-recognition domains (CRDs) that mediate sugar binding. A comparison of the sequences of these modules defines a common sequence motif, whereas structural analysis of a prototypical domain reveals that this motif largely reflects the importance of the conserved residues in establishing the fold of this type of module [2].
Evidence discussed here and elsewhere indicates that many protein modules containing part or all of the C-type CRD motif serve functions other than saccharide recognition. Hence, it is appropriate to consider this motif a characteristic of C-type lectin-like domains (CTLDs) to reflect their similarity to the CRDs of C-type lectins without necessarily implying common function [3]. The structures of roughly a dozen CTLDs have now been established. Although overall similarity in fold is the most obvious feature of these structures, comparisons also reveal important differences in fold, in interactions with Ca2+ and in oligomerisation. The results summarised in this review highlight some of the patterns that have emerged, using the previously described structure of the CRD from rat serum mannose-binding protein as a prototype for the superfamily.
Convergent and divergent evolution the C-type lectin-like domain fold
Figure 1 summarises two important features of CTLDs that have emerged from recent structural work. First, a topology similar to the fold originally identified in mannose-binding protein has been recognised in at least three types of module that are not evidently related at the sequence level: the link-protein-type module from the mammalian cell-surface molecule CD44 [4], the angiogenesis inhibitor endostatin [5] and the bacterial adhesion molecule intimin [6]. The lack of sequence similarity and important differences in the packing of the hydrophobic cores of these molecules are strong indicators that the topological similarity results from convergent evolution. Second, extensive structural divergence within the CTLD superfamily is also documented, reflecting a range of functions that has become evident.
Amongst the structures of CTLDs that have been described in the past three years, most of the elements of regular secondary structure are conserved, although there is considerable variation in the loop structures. The most striking departure from the prototype mannose-binding protein structure is the absence of one α helix in the CTLD of the natural killer cell receptor CD94 [7••]. The sequence of this segment retains the pattern of hydrophobic amino acids that would be expected to generate an amphipathic helix suitable to form the side of the domain and sequence alignments would have led to the prediction that the helix would be present in CD94. As discussed below, its absence apparently reflects alternative interactions in the dimer interface formed by this portion of the domain.
Earlier structures of CRDs from mannose-binding protein and E-selectin were both of relatively short CRDs lacking an N-terminal disulfide bond present in many CTLDs 8, 9, 10. An extra segment appears in several more recently determined structures of CTLDs that are sometimes referred to as long-form CRDs or CTLDs. These longer CTLDs include lithostathine [11], tetranectin [12], factors IX/X-binding protein [13••] and CD94 [7••]. As expected from the presence of a disulfide bond linking cysteines separated by 10 residues in this segment, it forms a hairpin in all the structures. In several cases, one portion of the hairpin takes on β structure and pairs with the most N-terminal conserved β strand seen in all CTLDs.
The recently completed Caenorhabditis elegans genome sequence [14] provides an opportunity to explore the extent of the CTLD radiation. Roughly 180 potential CTLDs have been identified [15]. Of these, fewer than 10% are predicted to bind Ca2+ or carbohydrates, based on the features essential for such functions in mammalian CTLDs. Interestingly, the domain architectures of most C. elegans proteins containing CTLDs also differ from those of the known structural classes of mammalian proteins containing CTLDs. This difference may reflect the fact that important classes of mammalian proteins containing CTLDs remain to be identified, but it also may be a result of extensive shuffling of protein modules that has taken place since the divergence of the vertebrate and invertebrate lineages.
Section snippets
Carbohydrate-binding sites
Members of the CRD subfamily of CTLDs bind a variety of carbohydrate ligands and it is now clear that even CRDs with similar specificity can bind their ligands in distinct ways. In all of the available CRD–ligand structures, carbohydrate is complexed to the protein by forming coordination bonds with a conserved Ca2+, as well as by hydrogen bonding with acid and amide sidechains that also coordinate to the Ca2+ [2]. Binding of sugars such as mannose and N-acetylglucosamine, which display
Interactions of Ca2+ with C-type lectin-like domains
Each of the known C-type CRD structures contains a conserved Ca2+-binding site and many contain a second site analogous to that originally identified in mannose-binding protein. Based on the presence of amino acids that are ligands for these Ca2+, sequence alignments have been used to suggest that CTLDs in several other subfamilies contain one or both of these sites. Crystallographic analysis of the serum protein tetranectin, which binds to the kringle domain in plasminogen, confirms the
Multiple routes to oligomer formation
An increasing number of modes of interaction between CTLDs are being described. As in the case of interactions between other types of protein module, such as immunoglobulin-like domains, it appears that multiple surfaces can be involved in the interactions. Several subfamilies of C-type animal lectins contain CRDs adjacent to α-helical domains that have been postulated to form trimeric coiled coils. Direct evidence for such structures has been provided for rat and human mannose-binding protein
Novel ligands for C-type lectin-like domains
Known or speculated ligands for CTLDs that do not bind sugars fall into at least two groups: proteins and inorganic surfaces. Possible modes of interaction with water by the type II antifreeze proteins and with calcium carbonate by lithostathine have been discussed above. Protein ligands are known for several CTLDs that have been examined structurally. Although co-crystals containing CTLD and ligand have not been analysed in any of these cases, some speculations have been made about the
Conclusions
The past year has served to broaden our concept of what CTLDs look like and how they interact with ligands. CD94 and the factors IX/X-binding protein are the structures most divergent from the prototypical mannose-binding protein. These structures provide new models of variation within the CTLD superfamily. In addition, the Polyandrocarpa lectin structure reveals that, even within a largely conserved framework, interaction with carbohydrate ligands can occur in multiple different ways.
As
Acknowledgements
I thank Sébastien Poget, Glen Legge, Mark Proctor, Jonathan Butler, Mark Bycroft, Roger Williams and James Rini for communicating results prior to publication and Maureen Taylor for comments on the manuscript. This work was supported by the Wellcome Trust.
References and recommended reading
Papers of particular interest, published within the annual period of review, have been highlighted as:
• of special interest
•• of outstanding interest
References (33)
- et al.
Solution structure of the link module: a hyaluronan-binding domain involved in extracellular matrix stability and cell migration
Cell
(1996) - et al.
Structure of CD94 reveals a novel C-type lectin fold: implications for the NK cell-associated CD94/NKG2 receptors
Immunity
(1999) - et al.
Structural analysis of monosaccharide recognition by rat liver mannose-binding protein
J Biol Chem
(1996) - et al.
Crystal structure of tetranectin, a trimeric plasminogen-binding protein with an alpha-helical coiled coil
FEBS Lett
(1997) - et al.
The structure of a tunicate C-type lectin from Polyandrocarpa misakiensis complexed with D-galactose
J Mol Biol
(1999) - et al.
Structural basis of galactose recognition by C-type animal lectins
J Biol Chem
(1996) - et al.
Mechanism of N-acetylgalactosamine binding to a C-type animal lectin carbohydrate-recognition domain
J Biol Chem
(1998) - et al.
Pancreatic stone protein (lithostathine), a physiologically relevant pancreatic calcium carbonate crystal inhibitor?
J Biol Chem
(1997) - et al.
Lithostathine, the presumed pancreatic stone inhibitor, does not interact specifically with calcium carbonate crystals
J Biol Chem
(1998) - et al.
Trimeric structure of a C-type mannose-binding protein
Structure
(1994)
Crystal structure of the trimeric α-helical coiled-coil and the three lectin domains of human lung surfactant protein D
Structure
Biology of animal lectins
Annu Rev Cell Biol
Structural basis of lectin-carbohydrate interaction
Annu Rev Biochem
The C-type lectin superfamily in the immune system
Immunol Rev
Crystal structure of the angiogenesis inhibitor endostatin at 1.5 Å resolution
EMBO J
Structure of the cell-adhesion fragment of inimin from enteropathogenic Escherichia coli
Nat Struct Biol
Cited by (543)
Isolation and functional identification of immune cells in hemolymph of blood clams Tegillarca granosa
2024, Fish and Shellfish ImmunologyGenome-wide identification and expression analysis of C-type lectins in discus fish (Symphysodon aequifasciatus) during parental care
2024, Fish and Shellfish ImmunologyCharacterization and functional analysis of a novel C-type lectin in blunt snout bream (Megalobrama amblycephala)
2023, Fish and Shellfish ImmunologyA novel C-type lectin (CLEC12B) from Nile tilapia (Oreochromis niloticus) is involved in host defense against bacterial infection
2022, Fish and Shellfish ImmunologyCitation Excerpt :A protein family of calcium-loving and carbohydrate-binding proteins known as the C-type lectin receptor superfamily (CLR) regulates a variety of biological events including adhesion between cells, the turnover of serum glycoproteins, and the innate immune system's reaction to prospective invaders [1,2].