The ClinSeq Project: Piloting large-scale genome sequencing for research in genomic medicine

  1. Leslie G. Biesecker1,2,5,
  2. James C. Mullikin1,2,
  3. Flavia M. Facio1,
  4. Clesson Turner1,
  5. Praveen F. Cherukuri1,
  6. Robert W. Blakesley1,2,
  7. Gerard G. Bouffard1,2,
  8. Peter S. Chines1,
  9. Pedro Cruz2,
  10. Nancy F. Hansen1,2,
  11. Jamie K. Teer1,
  12. Baishali Maskeri2,
  13. Alice C. Young2,
  14. NISC Comparative Sequencing Program1,2,
  15. Teri A. Manolio1,
  16. Alexander F. Wilson1,
  17. Toren Finkel3,
  18. Paul Hwang3,
  19. Andrew Arai3,
  20. Alan T. Remaley3,4,
  21. Vandana Sachdev3,
  22. Robert Shamburek3,
  23. Richard O. Cannon3 and
  24. Eric D. Green1,2
  1. 1 National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland 20892, USA;
  2. 2 National Institutes of Health Intramural Sequencing Center (NISC), National Institutes of Health, Bethesda, Maryland 20892, USA;
  3. 3 National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, Maryland 20892, USA;
  4. 4 Clinical Research Center, National Institutes of Health, Bethesda, Maryland 20892, USA

    Abstract

    ClinSeq is a pilot project to investigate the use of whole-genome sequencing as a tool for clinical research. By piloting the acquisition of large amounts of DNA sequence data from individual human subjects, we are fostering the development of hypothesis-generating approaches for performing research in genomic medicine, including the exploration of issues related to the genetic architecture of disease, implementation of genomic technology, informed consent, disclosure of genetic information, and archiving, analyzing, and displaying sequence data. In the initial phase of ClinSeq, we are enrolling roughly 1000 participants; the evaluation of each includes obtaining a detailed family and medical history, as well as a clinical evaluation. The participants are being consented broadly for research on many traits and for whole-genome sequencing. Initially, Sanger-based sequencing of 300–400 genes thought to be relevant to atherosclerosis is being performed, with the resulting data analyzed for rare, high-penetrance variants associated with specific clinical traits. The participants are also being consented to allow the contact of family members for additional studies of sequence variants to explore their potential association with specific phenotypes. Here, we present the general considerations in designing ClinSeq, preliminary results based on the generation of an initial 826 Mb of sequence data, the findings for several genes that serve as positive controls for the project, and our views about the potential implications of ClinSeq. The early experiences with ClinSeq illustrate how large-scale medical sequencing can be a practical, productive, and critical component of research in genomic medicine.

    Footnotes

    • 5 Corresponding author.

      E-mail leslieb{at}helix.nih.gov; fax (301) 402-2170.

    • 6 The ClinSeq consent form is available by request from the corresponding author.

    • [Supplemental material is available online at http://www.genome.org.]

    • Article published online before print. Article and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.092841.109.

      • Received February 17, 2009.
      • Accepted July 10, 2009.
    | Table of Contents

    Preprint Server