Classification of heart sounds using an artificial neural network

doi:10.1016/S0167-8655(02)00281-7

Pattern Recognition Letters

Volume 24, Issues 1–3, January 2003, Pages 617-629

https://doi.org/10.1016/S0167-8655(02)00281-7 Get rights and content

Abstract

A novel method is presented for the classification of heart sounds (HSs). Wavelet transform is applied to a window of two periods of HSs. Two analyses are realized for the signals in the window: segmentation of the first and second HSs, and extraction of the features.

After the segmentation, feature vectors are formed by using the wavelet detail coefficients at the sixth decomposition level. The best feature elements are analyzed by using dynamic programming. Grow and learn (GAL) network and linear vector quantization (LVQ) network are used for the classification of seven different HSs.

It is observed that HSs of patients are successfully classified by the GAL network compared to the LVQ network.

Introduction

Auscultation is a technique in which a stethoscope is used to listen to the sounds of a body. The structural defects of the heart are often reflected in the sounds the heart produces. Physicians use the stethoscope as a device to listen to a patient’s heart and make a diagnosis accordingly. They are particularly interested in abnormal sounds, which may suggest the presence of a cardiac pathology and also provide diagnostic information. For instance, a very important type of abnormal sound is the “murmur”, which is a sound caused by the turbulent flow of blood in the cardiovascular system. The timing and pitch of a murmur are of significant importance in the diagnosis of a heart condition, for example, murmurs during diastole are signs of malfunctioning of heart valves but murmurs during systole may correspond to either a pathological or healthy heart, depending on the acoustic characteristics of the murmurs.

Time–frequency/scale methods have been applied to characterize heart sounds (Debjais et al., 1997; Bently, 1996). In previous publications, the authors have discussed the characterization of heart murmurs using time–frequency methods over a number of cardiac cycles (Leung et al., 1998; Yoshida et al., 1997) and showed that features hence obtained were suitable for classification (Leung et al., 1999).

In this study, wavelet transform is proposed to analyze heart sounds (HSs) in time and frequency domains simultaneously. Each class of HSs contains characteristic and distinctive information that exists in time and frequency domains. Feature vectors are formed by using wavelet transform. Classification performance highly decreases if the right feature space is not constituted. Artificial neural networks (ANNs) are used as classifiers to increase the classification performance. The most prominent advantages of using an ANN as a classifier are: (i) Weights representing the solution are found by iteratively training, (ii) ANN has a simple structure for physical implementation, (iii) ANN can easily map complex class distributions, and (iv) generalization property of the ANN produces appropriate results for the input vectors that are not present in the training set.

In the literature, it is observed that multi-layer perceptron (MLP) (Lippmann, 1987) is widely used in the recognition of patterns with neural networks (Miller et al., 1992) and is also used to classify heart sounds (Barschdorff et al., 1995; Liang and Hartimo, 1998). One major problem encountered in MLP is its back-propagation algorithm (an iterative scheme) which takes too long time during learning. The second problem is the structure of the network, i.e., the number of hidden units and their interconnections, is defined by the programmer and the learning rule can modify only the connection weights. There is no rule which allows one to determine the necessary structure from a given application or training set. Lastly, the MLP may be caught by local minima, which decreases network performance.

In this study, an incremental and competitive learning network is proposed to handle the problems mentioned above and to increase the classification performance of HSs. In the literature, linear vector quantization (LVQ) and adaptive rezonance theory (ART) can be seen as the most basic schemes of the competitive learning network. A major advantage of the LVQ network is its fast learning speed. The major disadvantages are; it is not an incremental network, and the network generates feature vectors throughout the inside of a class homogeneously rather than concentrating them on the boundaries between classes. This causes the generation of an excessive number of feature vectors. ART2 (Carpenter and Grossberg, 1987) is a neural network that self-organizes stable recognition code patterns in real-time in response to arbitrary sequences of input patterns. The classification performance of the ART2 may decrease when the vigilance value is not carefully chosen. There is no direct way for choosing appropriate vigilance value, and a trial-and error process is usually time-consuming. The problem may be solved by using ‘slow learning mode’, but the learning speed of the ATR2 network slows down considerably. With ‘fast learning’ mode, the learning and clustering speed of ART2 models may improve; however, the non-centroid computation sometimes causes problems on the clustering results. The learning of a new pattern in ART2 tends to overwrite the previously stored information (Chin-Der and Stelios, 1997).

It is observed that incremental networks are widely used in the literature (Berlich et al., 1996; Burzevski and Mohan, 1996; Bruske and Sommer, 1995; Martinetz et al., 1993; Martinetz and Schulten, 1994; Fritzke, 1994, Fritzke, 1995). A number of approaches, advanced from SOM, have been proposed to achieve the objectives of retaining both the topology preserving and clustering properties. Fritzke (Fritzke, 1995) proposed a growing cell structure (GCS) for self-organizing clustering and topology preserving. The GCS approach starts the self-organizing with a k-dimensional simplex that is distributed over the input manifold. The GCS conditionally adds new nodes and removes old nodes based on a heuristic criterion that takes the relative winning frequency of a node or accumulated error into account, which is called a ‘resource’ of the output nodes. The resources of the winner and its neighbors determine the location of the added nodes. By computer simulation, Fritzke showed that output maps could be formed that resemble the topological structure of the input data in many different cases. To its simplicity, the competitive Hebbian rule has been used for topology learning in the growing neural gas (GCS) (Fritzke, 1994) and dynamic cell structure (Bruske and Sommer, 1995). However, these algorithms add and delete nodes based on the ‘resource’ used in GCS. This has created some complexity in its implementation.

In this study, grow and learn (GAL) is proposed as an incremental and competitive learning network to increase the classification performances of heart sounds. In a previous study (Ölmez et al., 1998) it is observed that GAL has fast training and classification, implementation simplicity, and satisfactory performance. Hence in order to carry out HS classification in real-time, we preferred to use GAL as an incremental neural network.

Section snippets

Methods

Decision making is performed in four stages: Segmentation of the first and second HSs, normalization process, feature extraction, and classification by the artificial neural network.

Firstly, a window is formed by the discrete data that contains two periods of HSs. Then, positions of the first (S1) and the second (S2) HSs within the window are determined (Huiying et al., 1997) by using wavelet detail coefficients at the sixth decomposition level.

By selecting S1 as the starting point, a new

Artificial neural networks

MLP (Lippmann, 1987) is frequently used in biomedical signal processing (Miller et al., 1992; Leung et al., 2000; Ölmez et al., 1998; Dokur and Ölmez, 2001; Dokur et al., 1998). It is observed that MLP has three disadvantages: (i) back-propagation algorithm takes too long time during the learning, (ii) the number of nodes in the hidden layers must be defined before the training (the structure is not automatically determined by the training algorithm), (iii) back-propagation algorithm may be

Computer simulations

In this study, HSs are categorized into seven classes: aortic stenosis, mitral regurgitation, mitral stenosis, pulmonary stenosis, aortic regurgitation, summation gallop, and normal. 28 subjects, each four subjects having the same type of HSs, are involved in the study. Therefore, there are 28 (4×7) records for the analysis of the HSs. Each record contains 12 periods of HSs. Training set contains 336 (28×12) feature vectors, 48 (28×12/7=48) feature vectors belonging to each class. Test set is

Conclusions

It is observed that four (Leung et al., 1999) and six (Leung et al., 2000) different HSs are classified by using time–frequency analysis. In the former study, fifteen features were extracted from the murmurs of four groups of patients. The features represented the energy distribution across the time–frequency plane. These features were used to train a two-dimensional SOM. However, satisfactory classification performances were not obtained. In the latter study, six different heart sounds were

References (30)

R. Berlich et al.
A comparison between the performance of feed forward neural networks and the supervised growing neural Gas algorithm
Nuclear Instruments and Methods in Physics Research A
(1997)
Z. Dokur et al.
ECG beat classification by a novel hybrid neural network
Computer Methods & Programs in Biomedicine
(2001)
Z. Dokur et al.
Segmentation of ultrasound images by using a hybrid neural network
Pattern Recognition Letters
(2002)
T. Martinetz et al.
Topology representation networks
Neural Networks
(1994)
Alpaydın, E., 1990. Neural models of incremental supervised and unsupervised learning. PhD Thesis, Ecole Polytechnique...
Barschdorff, D., Ester, S., Most, E., 1995. Phonocardiogram analysis of congenital and acquired heart diseases using...
J. Basak et al.
A connectionist model for category perception: theory and implementation
IEEE Transactions on Neural Networks
(1993)
Bently, P.M., 1996. Time–frequency analysis of native and prosthetic heart valve sounds. PhD Thesis, Electrical and...
Berlich, R., Kunze, M., Steffens, J., 1996. A comparison between the performance of feed forward neural networks and...
J. Bruske et al.
Dynamic cell structure learns perfectly topology preserving map
Neural Computation
(1995)

Burzevski, V., Mohan, C.K., 1996. Hierarchical growing cell structures, In: ICNN96: Proceedings of the International...

G.A. Carpenter et al.

ART2: self-organizing of stable category recognition codes for analog input patterns

Applied Optics

(1987)

W. Chin-Der et al.

A comparative study of self-organizing clustering algorithms Dignet and ATR2

Neural Networks

(1997)

Cohen, A., 1986. Biomedical Signal Processing, Vol. II, Boca Raton-Florida: CRC Press Inc.,...

I. Daubechies

Ten Lectures on Wavelets

(1994)

Cited by (130)

Phonocardiogram signal classification for the detection of heart valve diseases using robust conglomerated models
2023, Expert Systems with Applications
The diagnosis of cardiovascular diseases is quite important in the field of medical community. An important physiological signal of human body is heart sound and it arises due to the blood turbulence and pulsing of cardiac structures. For the early diagnosis of heart diseases, the analysis of heart sounds play an important role as they contain a huge quantity of pathological information associated with heart. To detect heart sounds, Phonocardiogram (PCG) is used as it is a highly useful and non-invasive technique and can be easily analyzed well. In this paper, some efficient models are proposed for the classification of PCG signals. Two important and robust conglomerated models are proposed initially, wherein the first strategy utilizes the concept of semi-supervised Non-negative Matrix Factorization (NMF) along with Brain Storming (BS) optimization algorithm and an advanced version of BS termed as Advanced BS (ABS) is proposed and then it is merged with Genetic Programming (GP) so that new algorithms such as BS-GP and ABS-GP are formed and finally the features selected through it are fed to classification through machine learning. The second strategy utilizes the concept of using three dimensionality reduction techniques along with Fuzzy C-means (FCM) clustering and then an Advanced Sine-Cosine (ASC) optimization algorithm with three different modifications is proposed for the purpose of feature selection and finally it is classified. Deep learning techniques were also employed in the study such as the usage of an Attention based Bidirectional Long Short-Term Memory (A-BLSTM), Ordinal Variational Autoencoder (O-VAE), Conditional Variational Autoencoders (CVAE), Hyperspherical CVAE (H-CVAE) and the Restricted Boltzmann Machine based Deep Belief Network (RBM-DBN) for the classification of PCG signals. The experiment is conducted on a publicly available dataset and results show that a high classification accuracy of 95.39% is obtained for the semi-supervised NMF concept with ABS-GP technique and Support Vector Machine (SVM) classifier.
Reduced features set neural network approach based on high-resolution time-frequency images for cardiac abnormality detection
2022, Computers in Biology and Medicine
A suitable temporal and spectral processing of the electrocardiogram (ECG) signals can facilitate the visual interpretation and discrimination between known patterns for classification. This paper proposes a non-invasive hybrid neural network and time-frequency (TF) based method to detect and classify commonly found cardiac abnormalities in ECG signals including congestive heart failure, ventricular tachyarrhythmia, intracardiac atrial fibrillation, arrhythmia, malignant ventricular ectopy, normal sinus rhythm, and postictal heart rate oscillations in partial epilepsy. Non-stationary raw ECG signals are collected from an online healthcare dataset source ‘PhysioBank’ that contains physiologic signals. These temporal signals are processed through Wigner-Ville distribution to produce high-resolution and concentrated TF images depicting specific visual patterns of cardiac abnormalities. The TF images are used to extract the abnormality parameters with the help of medical experts with good diagnostic accuracy. Principal component analysis (PCA) is employed for feature reduction and important features selection from the ECG signals. The selected features are used for training the multilayer feed-forward artificial neural network (ANN) for detection and classification while training parameters like the number of epochs, activation functions, and the learning rate is suitably selected with appropriate stopping criteria. Experimental results demonstrate the effectiveness of the hybrid neural-TF approach using PCA for abnormality detection and classification.
Automatic diagnosis of multiple cardiac diseases from PCG signals using convolutional neural network
2020, Computer Methods and Programs in Biomedicine
Cardiovascular diseases are critical diseases and need to be diagnosed as early as possible. There is a lack of medical professionals in remote areas to diagnose these diseases. Artificial intelligence-based automatic diagnostic tools can help to diagnose cardiac diseases. This work presents an automatic classification method using machine learning to diagnose multiple cardiac diseases from phonocardiogram signals.
The proposed system involves a convolutional neural network (CNN) model because of its high accuracy and robustness to automatically diagnose the cardiac disorders from the heart sounds. To improve the accuracy in a noisy environment and make the method robust, the proposed method has used data augmentation techniques for training and multi-classification of multiple cardiac diseases.
The model has been validated both heart sound data and augmented data using n-fold cross-validation. Results of all fold have been shown reported in this work. The model has achieved accuracy on the test set up to 98.60% to diagnose multiple cardiac diseases.
The proposed model can be ported to any computing devices like computers, single board computing processors, android handheld devices etc. To make a stand-alone diagnostic tool that may be of help in remote primary health care centres. The proposed method is non-invasive, efficient, robust, and has low time complexity making it suitable for real-time applications.
Acoustic feature based unsupervised approach of heart sound event detection
2020, Computers in Biology and Medicine
This paper represents an unsupervised approach to detect the positions of S1, S2 heart sound events in a Phonocardiogram (PCG) recording. Insufficiency of correctly annotated heart sound database drives us to investigate unsupervised techniques. Gammatone filter bank features are used to characterize the spectral pattern of fundamental heart sound events from noise contaminated PCG data. An unsupervised spectral clustering technique is employed for segmentation of S1/S2 and non-S1/S2 heart sound events. A Feature winning score is computed to identify the S1/S2 and non-S1/S2 frames. Finally, time based threshold is applied to detect the accurate positions of S1 and S2 heart sounds. The performance of spectral clustering is compared with other clustering methods. The proposed method offers a maximum F1-score of 98% and 92.5% for normal and abnormal PCG data respectively on 2016 PhysioNet/CinC challenge dataset. The heart sound annotation algorithm provided by PhysioNet has been used as the ground truth after hand correction.
Machine learning in geo- and environmental sciences: From small to large scale
2020, Advances in Water Resources
In recent years significant breakthroughs in exploring big data, recognition of complex patterns, and predicting intricate variables have been made. One efficient way of analyzing big data, recognizing complex patterns, and extracting trends is through machine-learning (ML) algorithms. The field of porous media, and more generally geoscience, have also witnessed much progress, and recent progress in developing various ML techniques have benefitted various problems in porous media and geoscience across disparate scales. Thus, it is becoming increasingly clear that it is imperative to adopt advanced ML methods for the problems in porous media and geoscience because they enable researchers to solve many difficult problems. At the same time, one can use the already existing extensive knowledge of porous media to endow ML algorithms and develop novel physics-guided methods. The goal of this review paper is to provide the first comprehensive review of the recently developed methods in the ML algorithms and describe their application to porous media and geoscience. Thus, we review the basic concept of the ML and describe more advanced methods, known as deep-learning algorithms. Then, the application of such methods to various problems in porous media and geoscience, such as hydrological modeling, fluid flow in porous media, and (sub)surface characterization, are reviewed. We also provide a discussion of future directions in this rapidly developing field.
An efficient heart sound segmentation approach using kurtosis and zero frequency filter features
2020, Biomedical Signal Processing and Control
This paper proposes an efficient heart sound segmentation method for automatic detection of heart sounds. In this method, the abrupt change at the heart sound locations is considered as a cue factor for segmentation. The phonocardiogram signal is analysed by passing kurtosis of the signal envelope through zero frequency filter (ZFF). The impulses at the locations of S1 and S2 in the filtered signal are used for the localization of heart sound. The performance of proposed method is evaluated on a real clinical dataset PhysioNet/CinC Challenge Heart Sound (PhysioNet/CinC). A set of 120 heart sound recordings, consisting of normal heart sound as well as pathological heart sound, is considered for evaluation. The experimental result shows that the proposed algorithm achieves an average sensitivity of 98.61%, average positive prediction of 99.11% and average overall accuracy of 98.07%. The robustness of the proposed algorithm is verified using additive white Gaussian noise and respiratory noise.

View all citing articles on Scopus

View full text

Classification of heart sounds using an artificial neural network

Abstract

Introduction

Section snippets

Methods

Artificial neural networks

Computer simulations

Conclusions

Nuclear Instruments and Methods in Physics Research A

Computer Methods & Programs in Biomedicine

Pattern Recognition Letters

Neural Networks

A connectionist model for category perception: theory and implementation

IEEE Transactions on Neural Networks

Dynamic cell structure learns perfectly topology preserving map

Neural Computation

ART2: self-organizing of stable category recognition codes for analog input patterns

Applied Optics

A comparative study of self-organizing clustering algorithms Dignet and ATR2

Neural Networks

Ten Lectures on Wavelets