Performance analysis of wavelet transforms and morphological operator-based classification of epilepsy risk levels

Harikumar, Rajaguru; Vijayakumar, Thangavel

doi:10.1186/1687-6180-2014-59

Research
Open access
Published: 03 May 2014

Performance analysis of wavelet transforms and morphological operator-based classification of epilepsy risk levels

Rajaguru Harikumar¹ &
Thangavel Vijayakumar¹

EURASIP Journal on Advances in Signal Processing volume 2014, Article number: 59 (2014) Cite this article

2493 Accesses
4 Citations
1 Altmetric
Metrics details

Abstract

The objective of this paper is to compare the performance of singular value decomposition (SVD), expectation maximization (EM), and modified expectation maximization (MEM) as the postclassifiers for classifications of the epilepsy risk levels obtained from extracted features through wavelet transforms and morphological filters from electroencephalogram (EEG) signals. The code converter acts as a level one classifier. The seven features such as energy, variance, positive and negative peaks, spike and sharp waves, events, average duration, and covariance are extracted from EEG signals. Out of which four parameters like positive and negative peaksand spike and sharp waves, events and average duration are extracted using Haar, dB2, dB4, and Sym 8 wavelet transforms with hard and soft thresholding methods. The above said four features are also extracted through morphological filters. Then, the performance of the code converter and classifiers are compared based on the parameters such as performance index (PI) and quality value (QV).The performance index and quality value of code converters are at low value of 33.26% and 12.74, respectively. The highest PI of 98.03% and QV of 23.82 are attained at dB2 wavelet with hard thresholding method for SVD classifier. All the postclassifiers are settled at PI value of more than 90% at QV of 20.

1 Introduction

The electroencephalogram (EEG) is a measure of cumulative firing of neurons in various parts of the brain [1]. It contains information regarding changes in the electrical potential of the brain obtained from a given set of recording electrodes. These data include the characteristic waveforms with accompanying variations in amplitude, frequency, phase, etc., as well as brief occurrence of electrical patterns such as spindles, sharps, and spike waveforms [2]. EEG patterns have shown to be modified by a wide range of variables including biochemical, metabolic, circulatory, hormonal, neuroelectric, and behavioral factors in [3]. In the past, the encephalographer, by visual inspection, was able to qualitatively distinguish normal EEG activity from either the localized or generalized abnormalities contained within relatively long EEG records [4]. The most important activity possibly detected from the EEG is the epilepsy [5]. Epilepsy is characterized by an uncontrolled excessive activity or potential discharge by either a part or all of the central nervous system [5]. The different types of epileptic seizures are characterized by different EEG waveform patterns [6]. With real-time monitoring to detect epileptic seizures gaining widespread recognition, the advent of computers has made it possible to effectively apply a host of methods to quantify the changes occurring based on the EEG signals [4]. The EEG is an important clinical tool for diagnosing, monitoring, and managing neurological disorders related to epilepsy [7]. This disorder is characterized by sudden recurrent and transient disturbances of mental function and/or movements of body that results in excessive discharge group of brain cells [8]. The presence of epileptiform activity in the EEG confirms the diagnosis of epilepsy, which sometimes may be confused with other disorders producing similar seizure-like activity [9]. Between seizures, the EEG of a patient with epilepsy may be characterized by occasional epileptic form transients-spikes and sharp waves [10]. Seizures are featured by short episodic neural synchronous discharges with considerably enlarged amplitude. This uneven synchrony may happen in the brain accordingly, i.e., partial seizures can be visible only in few channels of the EEG signal or generalized seizures, that are seen in every channel of the EEG signal involving the whole brain [11]. Epileptic seizure is an abnormality in EEG gathering and is featured by short and episodic neuronal synchronous discharges with severely high amplitude. This anomalous synchrony may happen in the brain locally (partial seizures) and is visible only in fewer channels of the EEG signal or including the entire brain, i.e., visible in all the channels of the EEG signal [12].

1.1 Related works

In the last three decades, the analysis and classification of epilepsy from EEG signal has become a fascinating research. A huge volume of research has already been performed which includes spike detection, classification epilepsy seizures, ictal and inter ictal analysis, nonlinear and linear analysis and soft computing methods. Gotman [9] discussed the improvement of epileptic seizure detection and evaluation. Pang et al. [10] summarized the history and evaluation of various spike detecting algorithms. The authors in [13] have discussed the different neural networks as a function approximation and universal approximation for epilepsy diagnosis. Rezasarang [14] encapsulated the performance of spike detecting algorithms in terms of sensitivity, specificity, and average detection. Rezasarang [14] orders the performance of spike detecting algorithms in terms of good detection ratio (GDR). McSharry et al. [8] discussed and enumerated the nonlinear methods and its relevance to predict epilepsy by considering EEG samples as time series. Majumdar [15] reviews various soft computing approaches of EEG signals which emphasize more on pattern recognition techniques. The paper [15] mainly focuses on dimensionality reduction, SNR problems, linear and soft computing techniques for EEG signal processing. Majumdar concludes that the neural network and Bayesian approaches are two popular choices even though linear statistical discriminants are easier to implement. Great deals of support vector machines (SVM) are also discussed in this paper for their classification accuracy. Hence, the EEG signal occupies a great deal of data regarding the work of the brain. However, classification and estimation of the signals are inadequate. As there is no explicit category suggested by the experts, visual examination of EEG signals in time domain may be deficient. Routine clinical diagnosis necessitates the analysis of EEG signals [13]. Hence, automation and computer methods have been utilized for this reason. Current multicenter clinical analysis indicates confirmation of premonitory symptoms in 6.2% of 500 patients with epilepsy [16]. Another interview-based study found that 50% of 562 patients felt ‘auras’ before seizures. Those clinical data provide a motivation to search for pre-monitoring alterations on EEG recordings from the brain and to employ a device that can act without human intervention to forewarn the patient [17]. On the other hand, despite decades of research, existing techniques do not yield to better performance. This paper addresses the application and comparison of singular value decomposition (SVD), expectation maximization (EM), and modified expectation maximization (MEM) classifiers towards optimization of code converter outputs in the classification of epilepsy risk levels.

Weber et al. [18] have proposed the three-stage design of an EEG seizure detection system. The first stage of the seizure detector compresses the raw data stream and transforms the data into variables which represent the state of the subject's EEG. These state measures are referred to as context parameters. The second stage of the system is a neural network that transforms the state measures into smaller number of parameters that are intended to represent measures of recognized phenomena such as small seizure in the EEG [9, 10]. The third stage consists of a few simple rules that confirm the existence of the phenomena under consideration. Similarly, this paper also presents a three-stage design for epilepsy risk level classification. The first stage extracts the seven required distinct features from raw EEG data stream of the patient in time domain. The next stage transforms these features into a code word through a code converter with seven alphabets which represents the patient's state in five distinct risk levels for a 2-s epoch of EEG signal per channel. The last stage is a SVD, EM, or MEM which optimizes the epilepsy risk level of the patient. The organization of the paper is as follows. Section 1 introduces the paper and materials, and its methods are discussed in Section 2. Section 3 describes about the SVD, EM, and MEM as postclassifiers for epilepsy risk level classification. Results are discussed in Section 4, and the paper is concluded in Section 5.

2 Materials and methods

2.1 Data acquisition of EEG signals

For a comparative study and to analyze the performance of the pre- and postclassifiers, we have obtained the raw EEG data of 20 epileptic patients in European data format (EDF) who underwent treatment in the Neurology Department of Sri Ramakrishna Hospital, Coimbatore. An issue that has been given great attention is the preprocessing stage of the EEG signals because it is important to use the best technique to extract the useful information embedded in the nonstationary biomedical signals. The obtained EEG records were continuous for about 30 s, each of them were divided into epochs of 2-s duration. A 2-s epoch is long enough to detect any significant changes in activity and presence of artifacts and also short enough to avoid any redundancy in the signal [19]. For a patient, there are 16 channels over three epochs. Having a frequency of 50 Hz, each epoch was sampled at a frequency of 200 Hz. Each sample corresponds to the instantaneous amplitude values of the signal, totaling to 400 values for an epoch. Figure 1 shows the model of the flow diagram of epilepsy risk level classification system. Four types of artifacts were present in our data. They included eye blink, electromyography (EMG) artifact, and chewing and motion artifacts [20]. Approximately, 1% of the data was artifacts. We did not make any attempt to select certain number of artifacts and of a specific nature. The objective of including artifacts was to have spikes versus nonspike categories of waveforms. The latter could be a normal background EEG and/or artifacts [21]. In order to train and test the feature extractor and classifiers, we need to select a suitable segment of EEG data. In our experiment, the training and testing were selected through a short sampling window and all EEG signals were visually examined by a qualified EEG technologist. A neurologist's decision regarding EEG features (or normal EEG segment) was used as the gold standard. We choose a sample window of 400 points corresponding to 2 s of the EEG data. This width can cover almost all types of transient epileptic patterns in the EEG signal, even though seizure often lasts longer [22].

In order to classify the risk level of the patients, certain parameters were chosen which are detailed below:

1.
For every epoch, the energy is calculated as [4]
$E = \sum_{i = 1}^{n} x_{i}^{2}$
(1)

where x_i is the signal sample value and n is the number of such samples.

2.
One of the simplest linear statistics that may be used for investigating the dynamics of underlying the EEG is the variance of the signal calculated in consecutive nonoverlapping windows. The variance (σ) is given by
$σ^{2} = \frac{\sum_{i = 1}^{n} {(x_{i} - μ)}^{2}}{n}$
(2)

where μ is the average amplitude of the epoch.

3.
For the average variance, the covariance of duration is determined by using the equation below:
$CD = \frac{\sum_{i = 1}^{p} {(D - t_{i})}^{2}}{p D^{2}}$
(3)

The following are the four parameters which are extracted using morphological filters and wavelet transforms:

D = \frac{\sum_{i = 1}^{p} t_{i}}{p}

(4)

where t_i is the peak to peak duration and p is the number of such durations.

1.
The total number of positive and negative peaks is found above the threshold.
2.
For a zero crossing function, if it lies between 20to 70 ms, then the spikes can be detected. If the zero crossing function lies between 70to 200 ms then the sharp waves are detected when the zero crossing function lies between 70 to 200 ms.
3.
After having detected, the total number of spikes and sharp waves were determined as the events.
4.
The duration for these waves is determined by the relation:

2.2 Wavelet transforms for feature extraction

The brain signals are nonstationary in nature. In order to capture the transients and events of the waveforms, we are in dire state to visualize the time and frequency simultaneously. Hence, the wavelet transforms are the better choice to extract the transient features and events from the EEG signals. The wavelet transform-based feature extraction is discussed as follows:

Let us consider a function f (t). The wavelet transform of this function is defined as [23]

wf (a, b) = \int_{- \infty}^{\infty} f (t) ψ_{a, b}^{*} (t) dt

(5)

where ψ* (t) is the complex conjugate of the wavelet function ψ (t).

With the set of the analyzing function, the wavelet family is deduced from the mother wavelet ψ (t) by [24]

ψ_{a, b}^{*} (t) = \frac{1}{\sqrt{2}} ψ (\frac{t - b}{a})

(6)

where a is the dilation parameter and b is the translation parameter.

The feature extraction process is initialized by studying the effect of simple Haar threshold. The Haar wavelet function can be represented as [25].

ψ (t) = \{\begin{cases} 1; 0 \leq t < 1 / 2 \\ - 1; 1 / 2 \leq t < 1 \\ 0 : otherwise \end{cases}

(7)

Wavelet thresholding is a signal estimation technique that exploits the capabilities of wavelet transform for signal denoising or smoothing. It depends on the choice of a threshold parameter which determines to a great extent the efficacy of denoising.

ρ_{T} (x) = \{\begin{array}{c} x, if |x| > T \\ 0, if |x| \leq T \end{array}

(8)

where T is the threshold level.

Typical threshold operators for denoising include hard threshold, soft threshold, and affine (firm) threshold. Hard threshold is defined as [24]. Soft thresholding (wavelet shrinkage) is given by

ρ_{T} (x) = \{\begin{array}{c} x - T, if (x \geq T) \\ x + T, if (x \leq T) \\ 0, if |x| < - T \end{array}

(9)

Haar, Db2, Db4, and Sym8 wavelets with hard thresholding and four types of soft thresholding methods such as heursure, minimax, rigsure, and sqtwolog are used to extract the parameters from EEG signals. With the help of an expert's knowledge and our experiences with the references [5, 20, 26], we have identified the following parametric ranges for five linguistic risk levels (very low, low, medium, high, and very high) in the clinical description for the patients which is shown in Table 1.

Table 1 Parameter ranges for various risk levels

Performance analysis of wavelet transforms and morphological operator-based classification of epilepsy risk levels

Abstract

1 Introduction

1.1 Related works

2 Materials and methods

2.1 Data acquisition of EEG signals

2.2 Wavelet transforms for feature extraction

2.3 Code converter as a preclassifier

2.4 Rhythmicity of code converter

2.5 Morphological filtering for feature extraction of EEG signals

3. Singular value decomposition, expectation maximization, and modified EM as postclassifier for classification of epilepsy risk levels

3.1 SVD theorem

3.2 Expectation maximization as a postclassifier

3.3 Modified expectation maximization algorithm

4. Results and discussion

4.1 Performance index

4.2 Quality value

5 Conclusions

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ original submitted files for images

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Rights and permissions

About this article

Cite this article

Share this article

Keywords