Sequence Variability Analysis of Human Class I and Class II MHC Molecules: Functional and Structural Correlates of Amino Acid Polymorphisms

https://doi.org/10.1016/S0022-2836(03)00750-2Get rights and content

Abstract

Major histocompatibility complex class I (MHCI) and class II (MHCII) molecules display peptides on antigen-presenting cell surfaces for subsequent T-cell recognition. Within the human population, allelic variation among the classical MHCI and II gene products is the basis for differential peptide binding, thymic repertoire bias and allograft rejection. While available 3D structural analysis suggests that polymorphisms are found primarily within the peptide-binding site, a broader informatic approach pinpointing functional polymorphisms relevant for immune recognition is currently lacking. To this end, we have now analyzed known human class I (774) and class II (485) alleles at each amino acid position using a variability metric (V). Polymorphisms (V>1) have been identified in residues that contact the peptide and/or T-cell receptor (TCR). Using sequence logos to investigate TCR contact sites on HLA molecules, we have identified conserved MHCI residues distinct from those of conserved MHCII residues. In addition, specific class II (HLA-DP, -DQ, -DR) and class I (HLA-A, -B, -C) contacts for TCR binding are revealed. We discuss these findings in the context of TCR restriction and alloreactivity.

Introduction

Classical major histocompatibility complex (MHC) molecules are cell-surface glycoproteins that are central to the process of adaptive immunity, functioning to capture and display peptides on the surface of antigen-presenting cells (APCs).1 These plasma membrane-bound peptide–MHC complexes (pMHC) are scrutinized by T-lymphocytes via their T-cell receptors (TCRs) during immunosurveillance.2 Since T-cells recognizing self-peptides are eliminated during the process of thymic selection, those pMHC incorporating foreign peptides are the primary focus of T-cell-mediated immune responses.3

There are two major classes of MHC molecules, termed class I and class II.4 MHC class I (MHCI) molecules are expressed on most cells, bind endogenously derived peptides with sizes ranging from eight to ten amino acid residues and are recognized by CD8 cytotoxic T-lymphocytes (CTL). On the other hand, MHC class II (MHCII) are present only on specialized APCs, bind exogenously derived peptides with sizes varying from 9 to 22 residues, and are recognized by CD4 helper T-cells. These differences indicate that MHCI and MHCII molecules engage two distinct arms of the T-cell-mediated immune response, the former targeting invasive pathogens such as viruses for destruction by CD8 CTLs, and the latter inducing cytokine-based inflammatory mediators to stimulate CD4 helper T-cell activities including B-cell activation, maturation and antibody production.5

Sequence identity between classical MHCI and MHCII is low (<20%), yet their 3D structures are strikingly similar. MHCI molecules are heterodimers composed of a single-membrane-spanning α chain, paired with the soluble β2 microglobulin (β2m) protein. The α chain has been divided into three distinct segments termed α1, α2 and α3. The α3 region has an immunoglobulin (Ig)-like fold, whereas the membrane distal α1 and α2 segments form a peptide-binding cleft consisting of two α-helices overlying a floor comprised of eight antiparallel β-stranded sheets (α1α2 domain). MHCII molecules are heterodimers also but comprised of two subunits, each with a single membrane-spanning anchor. The extracellular segment of the α chain includes α1 and α2 domains, and likewise, the β chain is composed of β1 and β2 domains. In the mature MHCII molecule, the α2 and β2 domains fold as independent Ig-like domains, whereas α1 and β1 fold together, creating a single antigen-presenting platform with architecture very similar to that of the α1α2 domain in MHCI molecules.6., 7.

In the human, MHC molecules are referred to as HLA, an acronym for human leukocyte antigens, and are encoded by the chromosome 6p21.3-located HLA region.8., 9. The HLA segment is divided into three regions (from centromere to telomere), class II, class III and class I. Classical class I and class II HLA genes are contained in the class I and class II regions, respectively, whereas the class III locus bears genes encoding proteins involved in the immune system but not structurally related to MHC molecules. The classical HLA class I molecules are of three types, HLA-A, HLA-B and HLA-C. Only the α chains of these mature HLA class I molecules are encoded within the class I HLA locus by the respective HLA-A, HLA-B and HLA-C genes. In contrast, the β2m chain encoded by the β2m gene is located on chromosome 15. The classical HLA class II molecules are also of three types (HLA-DP, HLA-DQ and HLA-DR), with both the α and β chains of each encoded by a pair of adjacent loci. In addition to these classical HLA class I and HLA class II genes, the human MHC locus includes a long array of HLA pseudogenes as well as genes encoding non-classical MHCI and MHCII molecules.9 HLA-pseudogenes are an indication that gene duplication is the main driving force for HLA evolution,10 whereas non-classical MHCI and MHCII molecules often subserve a restricted function within the immune system quite distinct from that of antigen presentation to αβ TCRs.11., 12. Furthermore, in contrast to the classical HLA molecules (with the exception of the somewhat polymorphic MICA and MICB genes), non-classical HLA molecules are composed of a very limited number of allelic variants.

Genes encoding classical HLA molecules are extremely polymorphic, such that most classical HLA genes include a large number of allelic variants. Thus, the HLA IMGT/HLA database†13 currently includes 1524 HLA allelic sequences (904 HLA I alleles and 620 HLA II alleles) (release 1.16, 14/10/2002). The basis for this extreme level of allelic diversity has been linked to evolving optimization of immune protection against pathogens. Consistent with this notion, most polymorphisms are associated with the peptide-binding residues of the α1α2 and α1β1 domains of HLA class I and HLA class II molecules, respectively.14 Moreover, the rate of non-synonymous substitutions (those that result in amino acid changes) is greater than the rate of synonymous substitutions (those that do not change amino acid sequence) in codons specifying amino acids involved in peptide binding.15., 16. In addition, population studies of HLA class I genes indicate that codons involved in peptide binding have greater heterozygosity than those with other functions.17 Reproductive selection mechanisms may contribute toward shaping the HLA polymorphisms.18

HLA molecular variability studies at an amino acid level are limited,19 as most HLA polymorphism analyses have been performed at the nucleotide level. Thus, comprehensive analysis of HLA amino acid variability in the context of the ever-increasing number of pMHC structures, some in complex with cognate TCRs, is missing. In this regard, we have carried out a detailed analysis of the amino acid sequence variability in each of the six distinct classical HLA molecules (HLA-A, HLA-B, HLA-C, HLA-DP, HLA-DQ, and HLA-DR) using a variability metric given by the Shannon entropy equation.20 We have combined this variability metric with sequence contact maps of the MHC binding to peptides, TCRs and CD8 and CD4 co-receptors obtained from the analysis of the available X-ray crystallographic structures. Interestingly, the pattern of variability for every MHC molecule distinguishes MHCI and MHCII and defines characteristic “fingerprints” for each subclass. The relevance of these distinct patterns of variability in MHC peptide-binding specificity, as well as TCR restriction and alloreactivity is discussed. Sequence variability analysis of the peptide binding region of HLA molecules and their mapping onto the relevant 3D structures is available for visualization at the Molecular Immunology Foundation web site‡.

Section snippets

HLA polymorphisms

HLA genes are extremely polymorphic, with most genes consisting of a large number of allelic variants (Table 1) specifying differences at the amino acid level and fine structural detail as revealed by X-ray crystallography comparison of PDBs (Table 2). The number of HLA allelic variants that diverge in at least one amino acid residue varies for the individual HLA genes, being greatest for HLA-B and DRB1 genes with 447 and 271 variants, respectively (Table 1). At the other extreme is the HLA-DRA

MHC sequences and multiple sequence alignments

Amino acid sequences of the human MHCI and MHCII alleles were collected from the IMGT/HLA database.13 Amino acid sequences were collected from functional genes with sequences from pseudogenes being discarded. Thus, amino acid sequences of the α and β chains of the HLA-DP molecule (HLA-DPA, and HLA-DPB, respectively) were obtained from the HLA-DPA1 and HLA-DPB1 genes. Likewise, the HLA-DQA1 and HLA-DQB1 genes were the only sources of the amino acid sequences for the α and β chains of the HLA-DQ

Acknowledgements

This work was supported by NIH grant AI50900 and the Molecular Immunology Foundation. We acknowledge the excellent scientific input provided by Dr Alfonso Valencia and comments from Drs Jia-huai Wang, Bruce Reinhold and Linda Clayton.

References (69)

  • M.G. Rudolph et al.

    The specificity of TCR/pMHC interaction

    Curr. Opin. Immunol.

    (2002)
  • Y.H. Ding et al.

    Two human T cell receptors bind in a similar diagonal mode to the HLA-A2/Tax peptide complex using different TCR amino acids

    Immunity

    (1998)
  • J. Zerrahn et al.

    The MHC reactivity of the T cell repertoire prior to positive and negative selection

    Cell

    (1997)
  • T.C. Manning et al.

    Alanine scanning mutagenesis of an alphabeta T cell receptor: mapping the energy of antigen recognition

    Immunity

    (1998)
  • G. Vasmatzis et al.

    Computational determination of side chain specificity for pockets in class I MHC molecules

    Mol. Immunol.

    (1996)
  • J.B. Horowitz et al.

    Ability of fixed B-lymphoma cells to present foreign protein antigen fragments and allogenic MHC molecules to a cloned helper-T-cell line

    Cell Immunol.

    (1987)
  • D.A. Shoskes et al.

    Indirect presentation of MHC antigens in transplantation

    Immunol. Today

    (1994)
  • J.B. Reiser et al.

    A T cell receptor CDR3beta loop undergoes conformational changes of unprecedented magnitude upon binding to a peptide/MHC class I complex

    Immunity

    (2002)
  • M.J. Bevan

    High determinant density may explain the phenomenon of alloreactivity

    Immunol. Today

    (1984)
  • R.A. Sayle et al.

    RASMOL: biomolecular graphics for all

    Trends Biochem. Sci.

    (1995)
  • A. Sali et al.

    Comparative protein modelling by satisfaction of spatial restraints

    J. Mol. Biol.

    (1993)
  • Paul, W. E. (1998). Editor of Fundamental Immunology, Lippincott, Williams & Wilkins, Philadelphia,...
  • D.R. Madden

    The three-dimensional structure of peptide–MHC complexes

    Annu. Rev. Immunol.

    (1995)
  • Complete sequence and gene map of a human major histocompatibility complex

    Nature

    (1999)
  • T. Shiina et al.

    Molecular dynamics of MHC genesis unraveled by sequence analysis of the 1,796,938-bp HLA class I region

    Proc. Natl Acad. Sci. USA

    (1999)
  • M. Yeager et al.

    Evolution of the mammalian MHC: natural selection, recombination, and convergent evolution

    Immunol. Rev.

    (1999)
  • C. Alfonso et al.

    Nonclassical MHC class II molecules

    Annu. Rev. Immunol.

    (2000)
  • J. Robinson et al.

    IMGT/HLA database—a sequence database for the human major histocompatibility complex

    Tissue Antigens

    (2000)
  • A.L. Hughes et al.

    Pattern of nucleotide substitution at major histocompatibility complex class I loci reveals overdominant selection

    Nature

    (1988)
  • A.L. Hughes et al.

    Nucleotide substitution at major histocompatibility complex class II loci: evidence for overdominant selection

    Proc. Natl Acad. Sci. USA

    (1989)
  • P.W. Hedrick et al.

    Heterozygosity at individual amino acid sites: extremely high levels for HLA-A and -B genes

    Proc. Natl Acad. Sci. USA

    (1991)
  • D. Meyer et al.

    How selection shapes variation of the human major histocompatibility complex: a review

    Ann. Hum. Genet.

    (2001)
  • Weaver, W. & Shannon, C. E. (1949). The Mathematical Theory of Communication, University of Illinois Press, Urbana,...
  • E.A. Kabat

    Antigenic determinants and antibody complementarity

    Folia Allergol.

    (1970)
  • Cited by (0)

    View full text