Elsevier

Clinical Neurophysiology

Volume 119, Issue 12, December 2008, Pages 2677-2686
Clinical Neurophysiology

Invited review
On the blind source separation of human electroencephalogram by approximate joint diagonalization of second order statistics

https://doi.org/10.1016/j.clinph.2008.09.007Get rights and content

Abstract

Over the last ten years blind source separation (BSS) has become a prominent processing tool in the study of human electroencephalography (EEG). Without relying on head modeling BSS aims at estimating both the waveform and the scalp spatial pattern of the intracranial dipolar current responsible of the observed EEG. In this review we begin by placing the BSS linear instantaneous model of EEG within the framework of brain volume conduction theory. We then review the concept and current practice of BSS based on second-order statistics (SOS) and on higher-order statistics (HOS), the latter better known as independent component analysis (ICA). Using neurophysiological knowledge we consider the fitness of SOS-based and HOS-based methods for the extraction of spontaneous and induced EEG and their separation from extra-cranial artifacts. We then illustrate a general BSS scheme operating in the time-frequency domain using SOS only. The scheme readily extends to further data expansions in order to capture experimental source of variations as well. A simple and efficient implementation based on the approximate joint diagonalization of Fourier cospectral matrices is described (AJDC). We conclude discussing useful aspects of BSS analysis of EEG, including its assumptions and limitations.

Introduction

Recent studies on human electroencephalogram (EEG) are based on the theory of brain volume conduction. It is well established that the generators of brain electric fields recordable from the scalp are macroscopic post-synaptic potentials created by assemblies of pyramidal cells of the neocortex (Speckmann and Elger, 2005). Pyramidal cells are aligned and oriented perpendicularly to the cortical surface. Their synchrony is possible thanks to a dense net of local horizontal connections (mostly <1 mm). At recording distances larger than about three/four times the diameter of the synchronized assemblies the resulting potential behaves as if it were produced by electric dipoles; all higher terms of the multipole expansion vanish and we obtain the often invoked dipole approximation (Lopes da Silva and Van Rotterdam, 2005, Nunez and Srinivasan, 2006). Three physical phenomena are important for the arguments we advocate in this study. First, unless dipoles are moving there is no appreciable delay in the scalp sensor measurement (Lopes da Silva and Van Rotterdam, 2005). Second, in brain electric fields there is no appreciable electro-magnetic coupling (magnetic induction) in the frequencies up to about 1 MHz, thus the quasi-static approximation of Maxwell equations holds throughout the spectrum of interest (Nunez and Srinivasan, 2006, p. 535-540). Finally, for source oscillations below 40 Hz it has been verified experimentally that capacitive effects are also negligible, implying that potential difference is in phase with the corresponding generator (Nunez and Srinivasan, 2006, p. 61). These phenomena strongly support the superposition principle, according to which the relation between neocortical dipolar fields and scalp potentials may be approximated by a system of linear equations (Sarvas, 1987). Whether this is a great simplification, we need to keep in mind that it does not hold true for all cerebral phenomena. Rather, it does at the macroscopic spatial scale we are interested in here.

A common approach to the study of human EEG is to describe patterns in space and time and link empirical findings with anatomical and physiological knowledge. The problem is characterized by high temporal resolution (about 1 ms) and low spatial resolution (several cm3). For example, it has been estimated that without time averaging about 60 million contiguous neurons must be synchronously active as to produce observable scalp potentials (Nunez and Srinivasan, 2006, p. 21). Such a cluster would realistically extend over several cm2 of cortical gyral surface, whereas disentangling fields emitted by cortical functional units may require much higher precision. Because of volume conduction, scalp EEG potentials describe a mixture of the fields emitted by several dipoles extending over large cortical areas. Practically, in order to improve the spatial resolution it is often necessary to trade in the temporal one operating some form of temporal averaging. In summary, the path followed by much of current EEG research is to “isolate” in space and time the generators of the observed EEG as much as possible, counteracting the mixing caused by volume conduction and maximizing the signal-to-noise ratio (SNR).

Over the years we have assisted to the development of several classes of methods to improve the spatial specificity. Those include, among others, surface and cortical Laplacian (Nunez and Srinivasan, 2006), equivalent dipole fitting (Mosher et al., 1992) and distributed minimum norm (model-driven) or minimum variance (data-driven) inverse solutions (Greenblatt et al., 2005, Lopes da Silva, 2004). Targeted attempts include sparsification approaches (Gorodnitsky et al., 1995, Cotter et al., 2005) and spatial filters known as beamformers (Rodrı´guez-Rivera et al., 2006, Congedo, 2006). Surface Laplacian methods apply a spatial high-pass filtering to the scalp potential by estimating their second spatial derivative. They tend to overemphasize high spatial frequency and radial (to the scalp surface) dipolar fields. Inverse solutions seek source localization in a chosen solution space and rely on geometrical models of the head tissue. Unfortunately, the accurate description of EEG volume conduction is complicated by inhomogeneity (resistivity varies with type of tissue) and anisotropy (resistivity varies in different directions); therefore source localization methods are inevitably undermined by geometrical modeling error.

Another approach that persists in EEG literature is blind source separation (BSS). First studied in our laboratory during the first half of the 80’s (Ans et al., 1985, Hérault and Jutten, 1986) BSS has enjoyed considerable interest worldwide only a decade later, inspired by the seminal papers of Jutten and Herault, 1991, Comon, 1994 and Bell and Sejnowski (1995). BSS has today greatly expanded encompassing a wide range of engineering applications such as speech enhancement, image processing, geophysical data analysis, wireless communication and biological signal analysis (Hyvärinen et al., 2001, Cichocki and Amari, 2002, Choi et al., 2005). Such ubiquity springs from the “blind” nature of the BSS problem formulation: no knowledge of volume conduction or of source waveform is assumed. The problem may be attacked from several perspectives; several hundred BSS algorithms have been proposed over the last 20 years with more added on every year. Typically, such methods are based on the cancellation of second order statistics (SOS) and/or of higher (than two) order statistics (HOS). Their commonality resides in the assumption of a certain degree of source spatial independence, which is precisely modeled by the cancellation of those statistics. Both HOS and SOS have been employed with success in EEG. They are today established for denoising/artifact rejection (Vigário, 1997, Jung et al., 2000, Vorobyov and Cichocki, 2002, Iriarte et al., 2003, Joyce et al., 2004, Kierkels et al., 2006, Fitzgibbon et al., 2007, Frank and Frishkoff, 2007, Halder et al., 2007, Phlypo et al., 2007, Romero et al., 2008, Crespo-Garcia et al., 2008), improving brain computer interfaces (Qin et al., 2004, Serby et al., 2005, Wang and James, 2007, Dat and Guan, 2007, Kachenoura et al., 2008) and for increasing the SNR of single-trial time-locked responses (Cao et al., 2002, Sander et al., 2005, Lemm et al., 2006, Tang et al., 2006, Guimaraes et al., 2007, Zeman et al., 2007). Yet, it appears that only four of the many existing algorithms have repeatedly occurred in EEG literature. They are known as FastICA (Hyvärinen, 1999), JADE (Cardoso and Souloumiac, 1993), InfoMax (Bell and Sejnowski, 1995) and SOBI (Belouchrani et al., 1997). FastICA, InfoMax and JADE are ICA (HOS) methods, while SOBI is a SOS method. JADE and SOBI are solved by approximate joint diagonalization (Cardoso and Souloumiac, 1993, Pham, 2001b, Yeredor, 2002, Ziehe et al., 2004, Vollgraf and Obermayer, 2006, Li and Zhang, 2007, Fadaili et al., 2007, Degerine and Kane, 2007), a powerful algebraic tool which allows promising extensions that we will consider in this study.

Section snippets

The BSS problem for the brain

For N scalp sensors and M  N EEG dipolar fields with fixed location and orientation in the analyzed time interval, the linear BSS model simply states the superposition principle discussed above, i.e.,v(t)=As(t)+η(t),where v(t)  RN is the sensor measurement vector, A  RN·M is a time-invariant full column rank mixing matrix, s(t)  RM holds the time-course of the source components and η(t)  RN is additive noise, temporally white, possibly uncorrelated to s(t) and with spatially uncorrelated components.

A suitable class of solutions to the brain BSS problem

To tackle problem (1.1) assuming knowledge of sensor measurement only we need to reduce the number of admissible solutions. In this paper we are interested in weak restrictions converging toward conditionsˆ(t)=Gs(t),where s(t) holds the time-course of the true (unknown) source processes and the system matrixG=BˆAΛPapproximates a signed scaling (a diagonal matrix Λ) and raw permutation (P). Eq. (1.2) is obtained substituting (1.0) in (1.1) ignoring the noise term in the former. Whether

Different approaches for solving the source separation problem

It has been known for a long time that in general the BSS problem cannot be solved for sources that are Gaussian, independent and identically distributed (iid) (Darmois, 1953). The iid condition implies that each sample of the source components is statistically independent from the others and that they all follow the same probability distribution. Therefore, in order to solve the BSS problem the sources must be either (1) possibly iid, but non-Gaussian or (2) not iid. In case (1), one assumes

SOS vs. HOS: statistical considerations

Joyce et al. (2004) reports that successful separation of EEG data can be achieved using as few as 100 data points using SOBI (SOS) and 1000 or more using ICA (HOS) algorithms. This is a known advantage of SOS-based BSS methods; an higher statistical efficiency allows performing BSS on shorter time intervals, which is a safe strategy to prevent serious departures from the linear instantaneous model assumptions (see discussion). It has also been suggested that SOS estimations are more robust

The hypothesis of spatial independence

Human neocortex is a prodigious net of local and global interconnections. There are about as many neurons (1010) as cortico-cortical fibers connecting them in the 1-15 cm range (Nunez and Srinivasan, 2006, p. 7). Dense and sometimes distributed connections exist between the neocortex and sub-cortical structures as well. Therefore, one may ask if assuming independent time course of cortical cell assemblies is reasonable. It has been speculated that forcing independence of the BSS output may

Approximate joint diagonalization

The class of SOS BSS methods we are considering is consistently solved by approximate joint diagonalization algorithms (Cardoso and Souloumiac, 1993, Wax and Sheinvald, 1997, Pham, 2001b, Yeredor, 2002, Ziehe et al., 2004, Vollgraf and Obermayer, 2006, Li and Zhang, 2007, Fadaili et al., 2007, Degerine and Kane, 2007). Given a set of matrices {Q1, Q2, …}, the AJD seeks a matrix B^ such that the products B^Q1B^T,B^Q2B^T, are as diagonal as possible (subscript “T” indicates matrix

SOS BSS methods solved by approximate joint diagonalization

The first proposed SOS method (Féty and Uffelen, 1988, Tong et al., 1991b) exploited signal coloration. It consisted on joint diagonalization of two matrices, the covariance matrix and a lagged covariance matrix, allowing an exact solution via the well-known generalized eigenvalue-eigenvector decomposition (Choi et al., 2002, Parra and Sajda, 2003). The corresponding procedure for exploiting energy time variation traces back to the work of Souloumiac (1995); if the energy of a source component

Time-frequency expansions

Source separation methods can be applied in different representation spaces. In fact, applying to (1.0) any invertible and linearity-preserving transform T leads toT[v(t)]=AT[s(t)],which preserves the mixing model. Then, solving source separation in the transformed space still provides estimation of the matrix A or of its inverse B, which can be used directly in Eq. (1.1) for recovering the source s(t) in the initial space. For example, the transform T may be a discrete Fourier transform, a

Approximate joint diagonalization of cospectra (AJDC): an extended time-frequency approach

Without loss of generality, the AJDC solution to the BSS problem (1.1) can be written compactly such asB^=AJD(C),where C: {C1, C2, …} is the diagonalization set, i.e., a set of estimated Fourier cospectral matrices to be simultaneously diagonalized. The rational behind AJDC is expressed schematically in Fig. 1. Each cube of the parallelepiped in the figure represents abstractly a cospectral matrix. The grid of cubes represents the sampling of some source property unfolding along two continuous

Discussion

Blind source separation (BSS) is a widespread method used in a number of scientific and technical fields (Hyvärinen et al., 2001, Cichocki and Amari, 2002). Its use in EEG literature is currently growing at a fast pace. When applied to EEG data BSS decomposes scalp signals in a number of components. These components may correspond to the activity of cortical dipole layers generating the observed EEG. Precisely, BSS implicitly estimates their orientation and explicitly estimates their waveform

Acknowledgements

This Research has been partially supported by the French National Research Agency (ANR) within the National Network for Software Technologies (RNTL), project Open-ViBE (“Open Platform for Virtual Brain Environments”), Grant # ANR05RNTL01601, by the European COST Action B27 “Electric Neuronal Oscillations and Cognition”. During the period of the research the first author has been partially supported by Nova Tech EEG, Inc., Knoxville, TN and the second author by the French Ministry of Defense

References (126)

  • D.-T. Pham

    Blind separation of instantaneous mixture of sources via the Gaussian mutual information criterion

    Signal Process

    (2001)
  • G. Pfurtscheller et al.

    Event-related EEG/MEG synchronization and desynchronization: basic principles

    Clin Neurophysiol

    (1999)
  • S. Romero et al.

    A comparative study of automatic techniques for ocular artifact reduction in spontaneous EEG signals based on clinical target variables: a simulation case

    Comput Biol Med

    (2008)
  • R. Srinivasan et al.

    Source analysis of EEG oscillations using high-resolution EEG and MEG

    Prog Brain Res

    (2006)
  • A. Aissa-El-Bey et al.

    Underdetermined blind separation of nondisjoint sources in the time-frequency domain

    IEEE Trans Signal Process

    (2007)
  • Ans B, Hérault J, Jutten C. Adaptive neural architectures: detection of primitives. In: Proc COGNITIVA 1985. p....
  • G. Barbati et al.

    Functional source separation from magnetoencephalographic signals

    Hum Brain Mapp

    (2006)
  • G. Barbati et al.

    Functional source separation applied to induced visual gamma activity

    Hum Brain Mapp

    (2008)
  • A.J. Bell et al.

    An information-maximization approach to blind separation and blind deconvolution

    Neural Comput

    (1995)
  • A. Belouchrani et al.

    A blind source separation technique using second-order statistics

    IEEE Trans Signal Process

    (1997)
  • A. Belouchrani et al.

    Blind source separation based on time-frequency signal representations

    IEEE Trans Signal Process

    (1998)
  • J. Beran

    Statistics for long-memory processes

    (1994)
  • P. Bloomfield

    Fourier analysis of time series

    (2000)
  • A. Bousbia-Salah et al.

    A one step time-frequency blind identification

    7th Int Symp Sig Process Appl

    (2003)
  • G. Buzsáki

    Rhythms of the brain

    (2006)
  • J.F. Cardoso et al.

    Blind beamforming for non-Gaussian signals

    IEE Proc-F (Radar and Signal Process)

    (1993)
  • Cardoso J-F. On the performance of orthogonal source separation algorithms. In: Proc EUSIPCO, Edinburg (UK) 1994. p....
  • J.-F. Cardoso

    Blind signal separation: statistical principles

    IEEE Proc

    (1998)
  • Cardoso J-F. Multidimensional independent component analysis. In: Proc ICASSP, Seattle, USA; p. 1998b....
  • J.-F. Cardoso

    High-order contrasts for independent component analysis

    Neural Comput

    (1999)
  • S. Choi et al.

    Blind separation of nonstationary sources in noisy mixtures

    Electron Lett

    (2000)
  • S. Choi et al.

    Second order nonstationary source separation

    J VLSI Sig Process

    (2002)
  • S. Choi et al.

    Blind source separation and independent component analysis: a review

    Neural Inf Process Lett Rev

    (2005)
  • A. Cichocki et al.

    Adaptive blind signal and image processing. learning algorithms and applications

    (2002)
  • M. Congedo

    Subspace projection filters for real-time brain electromagnetic imaging

    IEEE Trans Biomed Eng

    (2006)
  • S. Cotter et al.

    Sparse solutions to linear inverse problems with multiple measurement vectors

    IEEE Trans Signal Process

    (2005)
  • M. Crespo-Garcia et al.

    Muscle artifact removal from human sleep EEG by using independent component analysis

    Ann Biomed Eng

    (2008)
  • G. Darmois

    Analyse générale des liaisons stochastiques

    Rev Inst Inter Stat

    (1953)
  • T.H. Dat et al.

    Feature selection based on fisher ratio and mutual information analyses for robust brain computer interface

    Proc ICASSP

    (2007)
  • W. De Clercq et al.

    Canonical correlation analysis applied to remove muscle artifacts from the electroencephalogram

    IEEE Trans Biomed Eng

    (2006)
  • S. Dégerine et al.

    Second-order blind separation of sources based on canonical partial innovations

    IEEE Trans Signal Process

    (2000)
  • S. Degerine et al.

    A comparative study of approximate joint diagonalization algorithms for blind source separation in presence of additive noise

    IEEE Trans Signal Process

    (2007)
  • Deville Y. Temporal and time-frequency correlation-based blind source separation methods. In: Proc ICA2003, Nara,...
  • Doron E, Yeredor A. Asymptotically Optimal Blind Separation of Parametric Gaussian Sources. In: Proc ICA 2004, Puntonet...
  • M. Dyrholm et al.

    Model selection for convolutive ICA with an application to spatiotemporal analysis of EEG

    Neural Comput

    (2007)
  • E.M. Fadaili et al.

    Nonorthogonal joint diagonalization/zero diagonalization for source separation based on time-frequency distributions

    IEEE Trans Signal Process

    (2007)
  • Féty L, Uffelen J-P. New methods for signal separation. In: Proc 14th conf HF Radio Syst Tech, London; 1988. p....
  • Févotte C, Theis FJ. Pivot selection strategies in jacobi joint block-diagonalization. In: Proc. ICA, London, U.K 2007;...
  • S.P. Fitzgibbon et al.

    Removal of EEG noise and artifact using blind source separation

    J Clin Neurophysiol

    (2007)
  • M. Frigo et al.

    The design and implementation of FFTW3

    Proc IEEE

    (2005)
  • Cited by (116)

    • RANDOMIZED JOINT DIAGONALIZATION OF SYMMETRIC MATRICES

      2024, SIAM Journal on Matrix Analysis and Applications
    View all citing articles on Scopus
    View full text