Accurate and reliable high-throughput detection of copy number variation in the human genome

  1. Heike Fiegler1,6,
  2. Richard Redon1,6,
  3. Dan Andrews1,6,
  4. Carol Scott1,6,
  5. Robert Andrews1,
  6. Carol Carder1,
  7. Richard Clark1,
  8. Oliver Dovey1,
  9. Peter Ellis1,
  10. Lars Feuk2,3,
  11. Lisa French1,
  12. Paul Hunt1,
  13. Dimitrios Kalaitzopoulos1,
  14. James Larkin1,
  15. Lyndal Montgomery1,
  16. George H. Perry4,
  17. Bob W. Plumb1,
  18. Keith Porter1,
  19. Rachel E. Rigby1,
  20. Diane Rigler1,
  21. Armand Valsesia1,
  22. Cordelia Langford1,
  23. Sean J. Humphray1,
  24. Stephen W. Scherer2,3,
  25. Charles Lee4,5,
  26. Matthew E. Hurles1, and
  27. Nigel P. Carter1,7
  1. 1 The Wellcome Trust Sanger Institute, The Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, United Kingdom;
  2. 2 Department of Genetics and Genomic Biology, The Hospital for Sick Children, Toronto, Ontario M5G 1X8, Canada;
  3. 3 Molecular and Medical Genetics, University of Toronto, Toronto, Ontario, MSG IL7, Canada;
  4. 4 Department of Pathology, Brigham and Women’s Hospital, Boston, Massachusetts 02115, USA;
  5. 5 Harvard Medical School, Boston, Massachusetts 02115, USA
  1. 6 These authors contributed equally to this work.

Abstract

This study describes a new tool for accurate and reliable high-throughput detection of copy number variation in the human genome. We have constructed a large-insert clone DNA microarray covering the entire human genome in tiling path resolution that we have used to identify copy number variation in human populations. Crucial to this study has been the development of a robust array platform and analytic process for the automated identification of copy number variants (CNVs). The array consists of 26,574 clones covering 93.7% of euchromatic regions. Clones were selected primarily from the published “Golden Path,” and mapping was confirmed by fingerprinting and BAC-end sequencing. Array performance was extensively tested by a series of validation assays. These included determining the hybridization characteristics of each individual clone on the array by chromosome-specific add-in experiments. Estimation of data reproducibility and false-positive/negative rates was carried out using self–self hybridizations, replicate experiments, and independent validations of CNVs. Based on these studies, we developed a variance-based automatic copy number detection analysis process (CNVfinder) and have demonstrated its robustness by comparison with the SW-ARRAY method.

Footnotes

  • 7 Corresponding author.

    7 E-mail npc{at}sanger.ac.uk; fax +44-1223-491919.

  • [Supplemental material is available online at www.genome.org.]

  • Article published online before print. Article and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.5630906

    • Received June 13, 2006.
    • Accepted August 24, 2006.
  • Freely available online through the Genome Research Open Access option.

Related Article

| Table of Contents
OPEN ACCESS ARTICLE

Preprint Server