Review
C-type lectin-like domains

https://doi.org/10.1016/S0959-440X(99)00009-3Get rights and content

Abstract

Carbohydrate-recognition domains of C-type (Ca2+-dependent) animal lectins serve as prototypes for an important family of protein modules. Only some domains in this family bind Ca2+ or sugars. A comparison of recent structures of C-type lectin-like domains reveals diversity in the modular fold, particularly in the region associated with Ca2+ and sugar binding. Some of this diversity reflects the changes that occur during normal physiological functioning of the domains. C-type lectin-like domains associate with each other through several different surfaces to form dimers and trimers, from which ligand-binding sites project in a variety of different orientations.

Introduction

C-type animal lectins represent an important recognition mechanism for oligosaccharides at cell surfaces, attached to circulating proteins and in the extracellular matrix. Binding of specific sugar structures by these lectins mediates biological events, such as cell–cell adhesion, serum glycoprotein turnover and innate immune responses to potential pathogens [1]. Proteins with diverse overall architecture contain homologous carbohydrate-recognition domains (CRDs) that mediate sugar binding. A comparison of the sequences of these modules defines a common sequence motif, whereas structural analysis of a prototypical domain reveals that this motif largely reflects the importance of the conserved residues in establishing the fold of this type of module [2].

Evidence discussed here and elsewhere indicates that many protein modules containing part or all of the C-type CRD motif serve functions other than saccharide recognition. Hence, it is appropriate to consider this motif a characteristic of C-type lectin-like domains (CTLDs) to reflect their similarity to the CRDs of C-type lectins without necessarily implying common function [3]. The structures of roughly a dozen CTLDs have now been established. Although overall similarity in fold is the most obvious feature of these structures, comparisons also reveal important differences in fold, in interactions with Ca2+ and in oligomerisation. The results summarised in this review highlight some of the patterns that have emerged, using the previously described structure of the CRD from rat serum mannose-binding protein as a prototype for the superfamily.

Convergent and divergent evolution the C-type lectin-like domain fold

Figure 1 summarises two important features of CTLDs that have emerged from recent structural work. First, a topology similar to the fold originally identified in mannose-binding protein has been recognised in at least three types of module that are not evidently related at the sequence level: the link-protein-type module from the mammalian cell-surface molecule CD44 [4], the angiogenesis inhibitor endostatin [5] and the bacterial adhesion molecule intimin [6]. The lack of sequence similarity and important differences in the packing of the hydrophobic cores of these molecules are strong indicators that the topological similarity results from convergent evolution. Second, extensive structural divergence within the CTLD superfamily is also documented, reflecting a range of functions that has become evident.

Amongst the structures of CTLDs that have been described in the past three years, most of the elements of regular secondary structure are conserved, although there is considerable variation in the loop structures. The most striking departure from the prototype mannose-binding protein structure is the absence of one α helix in the CTLD of the natural killer cell receptor CD94 [7••]. The sequence of this segment retains the pattern of hydrophobic amino acids that would be expected to generate an amphipathic helix suitable to form the side of the domain and sequence alignments would have led to the prediction that the helix would be present in CD94. As discussed below, its absence apparently reflects alternative interactions in the dimer interface formed by this portion of the domain.

Earlier structures of CRDs from mannose-binding protein and E-selectin were both of relatively short CRDs lacking an N-terminal disulfide bond present in many CTLDs 8, 9, 10. An extra segment appears in several more recently determined structures of CTLDs that are sometimes referred to as long-form CRDs or CTLDs. These longer CTLDs include lithostathine [11], tetranectin [12], factors IX/X-binding protein [13••] and CD94 [7••]. As expected from the presence of a disulfide bond linking cysteines separated by 10 residues in this segment, it forms a hairpin in all the structures. In several cases, one portion of the hairpin takes on β structure and pairs with the most N-terminal conserved β strand seen in all CTLDs.

The recently completed Caenorhabditis elegans genome sequence [14] provides an opportunity to explore the extent of the CTLD radiation. Roughly 180 potential CTLDs have been identified [15]. Of these, fewer than 10% are predicted to bind Ca2+ or carbohydrates, based on the features essential for such functions in mammalian CTLDs. Interestingly, the domain architectures of most C. elegans proteins containing CTLDs also differ from those of the known structural classes of mammalian proteins containing CTLDs. This difference may reflect the fact that important classes of mammalian proteins containing CTLDs remain to be identified, but it also may be a result of extensive shuffling of protein modules that has taken place since the divergence of the vertebrate and invertebrate lineages.

Section snippets

Carbohydrate-binding sites

Members of the CRD subfamily of CTLDs bind a variety of carbohydrate ligands and it is now clear that even CRDs with similar specificity can bind their ligands in distinct ways. In all of the available CRD–ligand structures, carbohydrate is complexed to the protein by forming coordination bonds with a conserved Ca2+, as well as by hydrogen bonding with acid and amide sidechains that also coordinate to the Ca2+ [2]. Binding of sugars such as mannose and N-acetylglucosamine, which display

Interactions of Ca2+ with C-type lectin-like domains

Each of the known C-type CRD structures contains a conserved Ca2+-binding site and many contain a second site analogous to that originally identified in mannose-binding protein. Based on the presence of amino acids that are ligands for these Ca2+, sequence alignments have been used to suggest that CTLDs in several other subfamilies contain one or both of these sites. Crystallographic analysis of the serum protein tetranectin, which binds to the kringle domain in plasminogen, confirms the

Multiple routes to oligomer formation

An increasing number of modes of interaction between CTLDs are being described. As in the case of interactions between other types of protein module, such as immunoglobulin-like domains, it appears that multiple surfaces can be involved in the interactions. Several subfamilies of C-type animal lectins contain CRDs adjacent to α-helical domains that have been postulated to form trimeric coiled coils. Direct evidence for such structures has been provided for rat and human mannose-binding protein

Novel ligands for C-type lectin-like domains

Known or speculated ligands for CTLDs that do not bind sugars fall into at least two groups: proteins and inorganic surfaces. Possible modes of interaction with water by the type II antifreeze proteins and with calcium carbonate by lithostathine have been discussed above. Protein ligands are known for several CTLDs that have been examined structurally. Although co-crystals containing CTLD and ligand have not been analysed in any of these cases, some speculations have been made about the

Conclusions

The past year has served to broaden our concept of what CTLDs look like and how they interact with ligands. CD94 and the factors IX/X-binding protein are the structures most divergent from the prototypical mannose-binding protein. These structures provide new models of variation within the CTLD superfamily. In addition, the Polyandrocarpa lectin structure reveals that, even within a largely conserved framework, interaction with carbohydrate ligands can occur in multiple different ways.

As

Acknowledgements

I thank Sébastien Poget, Glen Legge, Mark Proctor, Jonathan Butler, Mark Bycroft, Roger Williams and James Rini for communicating results prior to publication and Maureen Taylor for comments on the manuscript. This work was supported by the Wellcome Trust.

References and recommended reading

Papers of particular interest, published within the annual period of review, have been highlighted as:

  • • of special interest

  • •• of outstanding interest

References (33)

  • K Hakansson et al.

    Crystal structure of the trimeric α-helical coiled-coil and the three lectin domains of human lung surfactant protein D

    Structure

    (1999)
  • K Drickamer et al.

    Biology of animal lectins

    Annu Rev Cell Biol

    (1993)
  • WI Weis et al.

    Structural basis of lectin-carbohydrate interaction

    Annu Rev Biochem

    (1996)
  • WI Weis et al.

    The C-type lectin superfamily in the immune system

    Immunol Rev

    (1998)
  • E Hohenester et al.

    Crystal structure of the angiogenesis inhibitor endostatin at 1.5 Å resolution

    EMBO J

    (1998)
  • G Kelly et al.

    Structure of the cell-adhesion fragment of inimin from enteropathogenic Escherichia coli

    Nat Struct Biol

    (1999)
  • Cited by (543)

    • A novel C-type lectin (CLEC12B) from Nile tilapia (Oreochromis niloticus) is involved in host defense against bacterial infection

      2022, Fish and Shellfish Immunology
      Citation Excerpt :

      A protein family of calcium-loving and carbohydrate-binding proteins known as the C-type lectin receptor superfamily (CLR) regulates a variety of biological events including adhesion between cells, the turnover of serum glycoproteins, and the innate immune system's reaction to prospective invaders [1,2].

    View all citing articles on Scopus
    View full text