Genetic Analysis of Case/Control Data Using Estimated Haplotype Frequencies: Application to APOE Locus Variation and Alzheimer's Disease

  1. Daniele Fallin1,6,
  2. Annick Cohen3,
  3. Laurent Essioux2,
  4. Ilya Chumakov3,
  5. Marta Blumenfeld3,
  6. Daniel Cohen3, and
  7. Nicholas J. Schork1,2,4,5,6,7
  1. 1Department of Epidemiology and Biostatistics, Case Western Reserve University, Cleveland, Ohio 44109, USA; 2Department of Statistical Genomics, Genset Corporation, La Jolla, California 92037, USA; 3Genset SA, Paris 75008, France; 4Department of Biostatistics and Program for Population Genetics, Harvard University, Boston, Massachusetts 02115, USA; 5The Jackson Laboratory, Bar Harbor, Maine 04609, USA

Abstract

There is growing debate over the utility of multiple locus association analyses in the identification of genomic regions harboring sequence variants that influence common complex traits such as hypertension and diabetes. Much of this debate concerns the manner in which one can use the genotypic information from individuals gathered in simple sampling frameworks, such as the case/control designs, to actually assess the association between alleles in a particular genomic region and a trait. In this paper we describe methods for testing associations between estimated haplotype frequencies derived from multilocus genotype data and disease endpoints assuming a simple case/control sampling design. These proposed methods overcome the lack of phase information usually associated with samples of unrelated individuals and provide a comprehensive way of assessing the relationship between sequence or multiple-site variation and traits and diseases within populations. We applied the proposed methods in a study of the relationship between polymorphisms within the APOE gene region and Alzheimer's disease. Cases and controls for this study were collected from the United States and France. Our results confirm the known association between the APOE locus and Alzheimer's disease, even when the ɛ4 polymorphism is not contained in the tested haplotypes. This suggests that, in certain situations, haplotype information and linkage disequilibrium-induced associations between polymorphic loci that neighbor loci harboring functional sequence variants can be exploited to identify disease-predisposing alleles in large, freely mixing populations via estimated haplotype frequency methods.

Footnotes

  • 6 These authors contributed equally to this work.

  • 7 Corresponding author.

  • E-MAIL njs2{at}po.cwru.edu; FAX (216) 778-8297.

  • Article and publication are at www.genome.org/cgi/doi/10.1101/gr.148401.

    • Received May 19, 2000.
    • Accepted October 12, 2000.
| Table of Contents

Preprint Server