Epilepsy EEG classification using morphological component analysis

Mahapatra, Arindam Gajendra; Singh, Balbir; Wagatsuma, Hiroaki; Horio, Keiichi

doi:10.1186/s13634-018-0568-2

Research
Open access
Published: 08 August 2018

Epilepsy EEG classification using morphological component analysis

Arindam Gajendra Mahapatra ORCID: orcid.org/0000-0002-8852-2297¹,
Balbir Singh^1,2,
Hiroaki Wagatsuma^1,3,4 &
…
Keiichi Horio¹

EURASIP Journal on Advances in Signal Processing volume 2018, Article number: 52 (2018) Cite this article

3273 Accesses
7 Citations
1 Altmetric
Metrics details

Abstract

In this paper, we have proposed an application of sparse-based morphological component analysis (MCA) to address the problem of classification of the epileptic seizure using time series electroencephalogram (EEG). MCA was employed to decompose the EEG signal segments considering its morphology during epileptic events using undecimated wavelet transform (UDWT), local discrete cosine transform (LDCT), and Dirac bases forming the over-complete dictionary. Frequency-modulated time frequency features were extracted after applying the Hilbert transform. Feature root mean instantaneous frequency square (RMIFS) and its parameters and parameters ratio are used in two different pairs for classification using support vector machine (SVM), showing good and comparable results.

1 Introduction

Hyperactivity of neural subnetwork resulting into dysfunctioning of the brain from few seconds to several minutes can be considered as epileptic seizure [1]. Epileptic seizure has broad classification based on various causes and symptoms or signs [2]. EEG have been used for early diagnosis and detection of seizures. It carries valuable complex information of brain activity. Manual inspection of patient’s EEG is time consuming, and secondly, it is not accurate. Therefore, seizure diagnosis and detection system discriminating seizure data from nonseizure and interictal EEG providing information about data for diagnosis come handy. Seizure detection or classification system mainly consists of two parts. First, preprocessing, filter or decompose the EEG for feature computation and extraction, and second, use these data from the first part for the classification by some supervised algorithm [3]. This decomposition process and feature extraction in the first part plays a pivotal role. As EEG is a graphical representation of summation of neuronal activity recorded using electrodes over the scalp. It is important to decompose it into oscillatory modes risen from different brain activity. Seizure detection requires good features showing prominent difference for different brain activity. Classification of seizure against nonseizure healthy EEG helps in diagnosis of epileptic seizure occurrence in the subject, whereas classification of epileptic seizure (ictal) from interictal (the period between two consecutive seizures) is important for seizure warning and detection system [4]. In the past, various methods have been proposed and developed for seizure classification based on frequency domain such as Fourier [5, 6]. Short-time Fourier transform (STFT) methods based on time-frequency methods were also used [7] for this purpose. In STFT, window size is a crucial factor for deciding the tradeoff between frequency and time resolution [8]. Utilizing wavelet analysis [8, 9] and its variant like discrete wavelet transform for classification as employed by Guo et al. [10] to pre-analyze the EEG signals for epilepsy. Chen et al. [11] did similar work using dual tree complex wavelet (DTCWT) for decomposition to extract feature based on the logarithm of fast Fourier transform (FFT). Nearest neighbor (NN) classifier was used upon extracted features.

Various machine learning techniques have been used in conjunction with feature extraction for the classification of ictal from interictal and healthy nonseizure EEG. Feature extraction is an important part of this process and influence the discrimination power of the model [12]. Features like approximate entropy (ApEn) with autoregressive model and principal component analysis (PCA) were applied by Liang et al. [4]. K nearest neighbor (KNN), support vector machine (SVM), least square support vector machine (LS-SVM), decision trees, and naive Bayes are used on features derived from cross-correlation and power spectral density of signals [13, 14]. Genetic algorithm was used by Guo et al. [10] for feature extraction and classification purpose from feature database created by using discrete wavelet transform. SVM assembly implementation using median Teager energy and Limpel-Ziv entropy feature from five different frequency sub-bands computed from band-pass Gabor filter bank is presented in [15]. Permutation entropy feature was used in [16] to create seizure detection system. Complex network based ictal classification done by Zhu et al. [17]. Tempko et al. [18] had used a total of 55 features from time, frequency, energy, and entropy domain for classification of epileptic seizure.

Empirical mode decomposition (EMD) by Huang et al. [19] was used for EEG decomposition and feature computation and extraction for epilepsy classification. Features like weighted frequency [20], standard deviation, mean, variance, skew, and centroid [21, 22] are extracted using EMD for classification. Bandwidth based features from intrinsic mode functions (IMFs) of EMD were fed to LS-SVM in [23]. RMS frequency feature extracted from IMFs used upon SVM for classification of seizure was presented in [24]. Phase space representation in Sharma et al. [25] was utilized for discrimination of ictal has also shown good results. Sparse-based decomposition [26] and classification [27] are also proposed lately.

In epilepsy, the commonly observed behavior or morphology is spike train and sharp waves. The sudden transient burst of spikes and high-frequency oscillations in interictal recordings are also used for the localization of the epileptic seizures. Both disparity in background activity and EEG paroxysms make the automated analysis complicated. Artifacts in filtered data can give rise to false positives [28,29,30]. Recently, signal decomposition by focusing on morphological components are getting highlighted due to its applicability to nonlinear and non-stationary signal properties [31,32,33]. The mixing of sources causes the EEG signal to be nonlinear and non-stationary in nature. Due to this, separation of sources from desired mixed signal become more difficult in time or frequency domain. MCA uses the linear combination of coefficients similar to independent component analysis (ICA). PCA and ICA [34] are popular methods used for separation of sources or removal of artifacts. Both the methods works on a statistical approach and aim to find the linear projection of the signals, i.e., statistically independent [34]. The subspace projection is used to extract EEG components on time/space basis. PCA is a sophisticated method to reduce the artifacts and specifies principal components (PC) to reconstruct overall data structure and to remove the components with small amplitudes and irregular changes. It is very difficult to specify remaining PCs to represent such signal. To identify PC requires the prior knowledge of the artifacts [35]. In ICA, different estimation procedures such as mutual information minimization, maximization of non-Gaussianity, maximization of likelihood, SOBI, and Fastica are used for separation. Since ICA is based on the measure of statistical independence, the noise of the input is amplified by ICA and it makes the detection of the signal components difficult due to Gaussian noise spread over the component in an undesired way [36]. ICA generates spikes and bumps, if the sample size is not sufficient [37, 38]. Basic ICA is a multichannel source separation technique and does not work on single channel unlike MCA which can work perfectly with single channel [39]. Although MCA is well known in image processing domain [40, 41], it had found few applications in biomedical signal processing even after showing promising results in removing artifacts from EEG [39, 42, 43]. MCA identify the components of the signal based on sparsity in time frequency domain. It decompose the signal and then accurately reconstruct the signal using redundant transforms (mathematical function) called explicit dictionary. This combination of explicit dictionary forming over-complete dictionary is important for representation of different morphologies of EEG signal. Sparse-based reconstruction of EEG signal has an advantage of using minimum coefficients which gives it the advantage to be easily transferred it over the Internet. Every method has advantages and disadvantages and yet to reach the stage for real-time analysis as a single method.

The objective of this work is to present an approach considering the morphology of the EEG during an epileptic event for diagnosing and detection of the epileptic seizure. In this work, we have used MCA with undecimated wavelet transform (UDWT), local discrete cosine transform (LDCT), and Dirac bases composing the dictionary for decomposition. UDWT identifies the slow components in the EEG, LDCT identifies the spectral components, and Dirac identifies the spikes in the EEG. Root mean instantaneous frequency square (RMIFS) and the ratio of its consisting parameters from Dirac component are computed and given to SVM as input for classification. RMIFS is defined as square root over the sum of the time average squared bandwidth $ {\sigma}_T^2 $ and the center frequency square <ω>². These two parameters, $ {\sigma}_T^2 $ extracted from Dirac component and <ω>² from LDCT component, are also used for classification. These two sets of features show considerable high accuracy and sensitivity comparable with other existing works.

This paper is organized as MCA followed by the Hilbert transform, computation of feature, SVM, and dataset used followed by simulation and describing the physical relevance of the features, then Section 9 and Section 10 at the end.

2 Method and material

In the subsequent subsections, MCA is elaborated first, followed by features computed from its output decomposition. Briefly explained the SVM and the material and the data used in this work.

3 Morphological component analysis

Morphological component analysis uses the concept of sparsity and independent redundant transforms to decompose an EEG signal by adapting to the prevailing types of morphologies simultaneously. Representing EEG as a sparse linear contribution of coefficients, MCA uses over-complete dictionary ɸ ∈ R^n × k, where k is the morphological components of an EEG signal S ∈ Rⁿ decomposed by constructing source components {∅_k}_k ∈ Γ, where Γ representing the type of explicit dictionaries. An EEG signal can be represented as a sparse linear combination of coefficients. Over-complete dictionary ɸ is a set of explicit dictionary, defined by a set of mathematical functions to represent the specific morphologies of EEG [44]. Signal can be represented as

$$ {\displaystyle \begin{array}{l}S={\sum}_{i=0}^k{\beta}_i{\varnothing}_i+\zeta \\ {}={\beta}_1{\varnothing}_1+{\beta}_2{\varnothing}_2\dots +{\beta}_k{\varnothing}_k+\zeta \\ {}\cong {s}_1+{s}_2\dots +{s}_k\left(\zeta \ll 1\right)\\ {}={S}^{\hbox{'}}\end{array}} $$

(1)

where ∅_k represents a set of basis elements and β is the target coefficients to reconstruct the original EEG signal. ζ is assumed to be negligible noise tend to zero. By using three dictionaries, undecimated wavelet transform (UDWT), local discrete cosine transform (LDCT), and Dirac (Kronecker basis) [39, 45, 46] in this work, coefficients are optimized as

$$ \left\{{\beta}_0^{opt},{\beta}_1^{opt},{\beta}_2^{opt}\right\}=\underset{\beta_0,\cdots, {\beta}_2}{\arg \min}\sum \limits_{i=0}^2\parallel {\beta}_i{\parallel}_0 $$

(2)

subject to $ {S}^{\prime }={\sum}_{i=0}^k{\beta}_i{\varnothing}_i,\mathrm{k}=2\ \mathrm{in}\ \mathrm{this}\ \mathrm{work}. $

The basis pursuit solution [47] was used to represent the sparse component which describes Eq. (1) as

$$ \left\{{\beta}_0^{opt},{\beta}_1^{opt},{\beta}_2^{opt}\right\}=\underset{\beta_0,\cdots, {\beta}_2}{\arg \min}\sum \limits_{i=0}^2\parallel {\beta}_i{\parallel}_1+\lambda \parallel S-\sum \limits_{i=0}^2{\varnothing}_i{\beta}_i{\parallel}_2^2 $$

(3)

Equation (3) is optimized by block coordinate relaxation (BCR) method [48] in finite time. The algorithm given in [39] is as follows:

The number of iteration I_max=100 is used. Balbir et al. [39] had varied the value of λ from 3 to 5 depending on the type of hard and soft threshold. In this work, λ = 3 is used. Figure 1 depicts the working of MCA as described in Algorithm 1. From Figs. 2 and 3, it can be observed that UDWT is showing slow component of EEG whereas LDCT showing spectral component. Dirac basis is showing its ability to capture the spike morphology of EEG, capturing the negative spike train of seizure or ictal in Fig. 3.

4 Hilbert transform over decompositions

Hilbert transform was applied to the components produced by MCA. Representing real valued component c(t) into analytic form s(t) onto the real axis of the complex domain as

$$ s(t)=c(t)+j{c}_H(t), $$

(4)

Hilbert transform over c(t) produce c_H(t). The analytical form signifies that there is a shift or phase difference of $ \frac{\pi }{2} $ between the positive and the negative frequency. Imaginary part representing negative frequency is ignored, and only the real part representing the positive frequency is considered for working due to Hermitian symmetry. Equation (4) can be represented as in [19]

$$ s(t)=a(t){e}^{j\varphi (t)} $$

(5)

Instantaneous phase φ(t) and amplitude a(t) can be given by

$$ \varphi (t)=\mathit{\arctan}\left[\frac{c_H(t)}{c(t)}\right]. $$

(6)

$$ a(t)=\sqrt{c^2(t)+{c}_H^2(t),} $$

(7)

Instantaneous frequency is defined as derivative of instantaneous phase as in [49]

$$ \omega (t)={\varphi}^{\prime }(t). $$

(8)

Prime is representing differentiation in this work.

5 Computation of root mean instantaneous frequency square

Equation (9a) can be expressed in another way by using Hermitian time frequency operator $ \left(\frac{1}{j}\frac{d}{dt}\right) $. Center frequency can be written as in [50]

$$ <\omega >=\int \omega {\left|S\left(\omega \right)\right|}^2 d\omega, \kern0.5em $$

(9a)

$$ =\int {s}^{\ast }(t)\frac{1}{j}\frac{d}{dt}s(t) dt, $$

(9b)

$$ =\int \left({\varphi}^{\prime }(t)+\frac{1}{j}\frac{a^{\prime }(t)}{a(t)}\right){a}^2(t) dt, $$

(9c)

$$ <\omega >=\int {\varphi}^{\prime }(t){a}^2(t) dt, $$

(9d)

Imaginary part is ignored as zero, s^∗(t) is complex conjugate signal, and a²(t) is density in time [51].

Therefore, center frequency, as in [50], can be given by

$$ <\omega >=\int {\varphi}^{\prime }(t){a}^2(t) dt, $$

(10)

The S(ω) is the Fourier transform of the signal s(t).

$$ S\left(\omega \right)=\frac{1}{\sqrt{2}}\int {e}^{- i\omega t}s(t) dt. $$

(11)

Amplitude is normalized, and using Parseval’s theorem, ∫|S(ω)|²dω = ∫ |s(t)|²dt = 1. All integrals computed are between time interval [0, 23.6] as EEG signal segments [52] used in this work is of 23.6 s as described in Section 7.

By referring to [51, 53], when time-averaged square bandwidth $ {\sigma}_T^2 $ also known as bandwidth frequency modulation (BFM) [51] is expanded, this can be represented as in Eq. (12b). Rearranging Eq. (12b) gives us a root mean instantaneous frequency square (RMIFS) frequency. <. >_T means time domain.

$$ {\sigma}_T^2=\int {\left({\varphi}^{\prime }(t)-<\omega >\right)}^2{a}^2(t) dt, $$

(12a)

$$ {\sigma}_T^2=<{\varphi}^{\prime 2}(t)>-<\omega {>}^2, $$

(12b)

$$ {\sigma}_T^2=<{\omega}^2>-<\omega {>}^2, $$

(12c)

$$ <{\omega}^2>={\sigma}_T^2+<\omega {>}^2, $$

(12d)

$$ {f}_R=\sqrt{\sigma_T^2+<\omega {>}^2}. $$

(12e)

$ {\sigma}_T^2 $ and <ω>² are parameters, as can be seen in Eq. (12d), and their ratio can be given by

$$ {E}_{\mathrm{MIFS}}=\frac{<\omega {>}^2}{\sigma_T^2}. $$

(13)

Features extracted from components are depicted in Table 1 and in Fig. 4.

Table 1 Feature extracted from components

Full size table

6 SVM

Support vector machine (SVM), introduced by Vapnik [54], is used as classifier. SVM discriminate two different classes by creating a hyperplane which maximizes distance between among them. Radial basis function (RBF) kernel is used in this work represented:

$$ G\left({x}_i,{x}_j\right)={\mathit{\exp}}^{\left(\frac{{\left\Vert {x}_i,-{x}_j\right\Vert}^2}{2{\sigma}^2}\right)}, $$

(14)

where σ is a positive number.

7 Dataset

EEG dataset [52] commonly known as Bonn dataset was used to apply the method. Five subsets F, N, O, S, and Z make the dataset. All subsets consist of 100 signal segments, each of 23.6 s duration recorded with 173.61 Hz of sampling frequency containing 4097 samples. Subsets O and Z are recorded extracranially with eye open and with eyes closed from healthy subjects having no previous seizure history. Subsets F, N, and S have signal segments from intracranial experiments. Subsets F and N have interictal recording. Subset N is from the epileptic zone, and F is from the hippocampal formation of the opposite hemisphere. Subset S contains ictal EEG recording. In this work, six combinations of subsets are created for classification. First one is with sets F, S, second with N, S, third with O against S, fourth with Z against S, fifth with F, N together versus S, and sixth with O, Z against S for classification.

Every subset contains 100 data from each feature calculated upon 100 signal segments. These data are normalized using standard deviation and mean. Training and test set are prepared in 70:30 ratio for SVM. For one subset versus another individual subset’s classification, 70 samples are picked randomly without replacement from each set to create training set. Test set is made from the remaining 30 data samples. For F, N versus S, 35 samples are taken randomly without replacement from each F, N subsets and 70 random samples were picked without replacement from S set to make SVM training set. For test set, 15 samples are picked randomly from remaining 65 data samples from the subsets F, N and 30 remaining samples are taken from S set. The step of picking equal number of data samples from interictal and ictal sets was taken to avoid any bias and overfitting. Same process was adapted for classification of O, Z against S. Using grid search, best kernel parameters were searched, i.e., similar to cross-validation. But in most of the case, default kernel parameters have shown good results as presented in Table 2. We repeated this process hundred times means taking 100 trials following Bajaj et al. [23] who has taken 10 trials.

Table 2 Classification results from 100 trials on all six combinations of subsets

Full size table

8 Simulation

In MCA methodology, sparsity play a vital role in separating the components having different time-frequency properties or morphology of constructing of individual source components. The combination of explicit dictionaries forming an over-complete dictionary makes the MCA more powerful methods for denoising and source component separation [39]. Mostly, decomposition-based methods like PCA and ICA required prior knowledge about the decomposed components. MCA-based decomposition has an advantage in the accurate reconstruction of the original component because the source component has a low probability of occurrence simultaneously. This method relies on the sparsity and the over-completeness of the dictionary ɸ ∈ R^n × k, a set of k redundant transforms, which represent the specific morphologies of different components of signal. Due to the concept of sparsity and the over-completeness, the dictionary extended the traditional signal decomposition to feature extractions of multiple types of morphology simultaneously. EEG signal contains specific morphology depending on the activity in the brain. Therefore, EEG time course data can be decomposed by one explicit dictionary and cannot be decomposed by other explicit dictionaries. It estimates the components accurately as the decomposed components are sparse and independent. The S is the linear combination of different brain activity, where β is the brain activity and ɸ is the mixing matrix. Different basis functions were trialed in different combinations to create the epileptic-specific dictionary from a set of UDWT, discrete sine transform (DST), discrete cosine transform (DCT), LDCT, and Dirac basis functions, and finally, UDWT, LDCT, and Dirac were used depending on the significant difference shown by the extracted, proposed features. UDWT has not been used directly for feature extraction but has been kept in the dictionary to make LDCT spectral component remain free from slow-moving components. Dirac basis was used to capture the spike morphology of the epileptic seizure. Dirac basis is also useful in capturing the transient spikes in interictal which can help in localizing the epileptic zone.

After using MCA for decomposition, Hilbert transform was applied over the components which take the real value signal decomposition to complex time frequency domain. Real signal gives symmetrical density in frequency making mean or center frequency zero. Using analytic representation, we will have identical spectrum for positive frequencies and zero for negative frequencies [50]. Feature RMIFS (f_R) and the parameters ratio ($ \frac{<\omega {>}^2}{\sigma_T^2} $) are computed from Dirac component. RMIFS is defined as root over sum of the time-averaged bandwidth square $ {\sigma}_T^2 $ and the center frequency square <ω>². $ {f}_R^2 $ is always greater than <ω>² by $ {\sigma}_T^2 $. This feature is expressed completely in terms of frequency modulation $ {\sigma}_T^2 $ and average or center frequency square <ω>² in time domain which is advantageous as it is free from any amplitude-based component that is prone to noise. Computing f_R directly as $ \sqrt{<{\varphi}^{\prime 2}(t)>} $ or as $ \sqrt{\sigma_T^2+<\omega {>}^2} $ gives same value. The parameters ratio ($ \frac{<\omega {>}^2}{\sigma_T^2} $) shows how dominant center frequency square is over time-averaged bandwidth square. For example, from Fig. 4c, in the case of interictal $ {\sigma}_T^2 $ is at higher range making the parameters ratio at lower range whereas for ictal the behavior is opposite means during the ictal event, the frequency modulation is small compared to interictal as also observed by Bajaj et al. [23]. Frequency modulation $ {\sigma}_T^2 $ from Dirac component is at higher range in nonseizure and interictal than ictal, and center frequency from LDCT component is highest in nonseizure than in ictal and lowest in interictal. As time-averaged bandwidth can be taken as the standard deviation of instantaneous frequency around the center frequency and center frequency as the mean, f_R satisfies the definition of root mean square. Value of f_R will be close to center frequency when instantaneous frequencies are close to center frequency leading to small $ {\sigma}_T^2 $. That is, signal decomposition during epileptic event showing small frequency modulation will result in $ {f}_R^2 $ influenced by center frequency square <ω>². Dirac component was chosen for computation of RMIFS frequency because, firstly, it represents the spike morphology of the EEG and, secondly, it shows more significant difference than when f_R is computed from LDCT component. For classification using <ω>² and $ {\sigma}_T^2 $, <ω>² is computed from LDCT component and $ {\sigma}_T^2 $ from Dirac component simultaneously because LDCT represents the frequency component better than Dirac which shows modulation better.

Center frequency square <ω>² calculated from LDCT component are in range from higher delta wave to lower alpha wave in interictal sets F, N, whereas in healthy nonseizure sets O, Z, it is between higher theta wave to alpha wave. Center frequency in ictal set S was dispersed between lower theta wave to lower beta wave. RMIFS f_R on an average is in beta range for all the subsets of the Bonn dataset as presented in Fig. 4.

9 Results and discussion

These features are normalized using mean and standard deviation then fed to SVM in a set of two pairs separately to elaborate its significance in classification of seizures. These pairs of features are selected as they are showing opposite behavior which helps SVM to create the hyperplane discriminating the classes. Performance of the SVM classifier is evaluated by using the statistical parameters from previous works, i.e., specificity (SPE), sensitivity (SEN), and accuracy (Acc) [4, 23].

$$ \mathrm{SPE}=\frac{\mathrm{TN}}{\mathrm{TN}+\mathrm{FP}}\times 100, $$

(15)

$$ \mathrm{SEN}=\frac{\mathrm{TP}}{\mathrm{TP}+\mathrm{FN}}\times 100, $$

(16)

$$ \mathrm{Acc}=\frac{\mathrm{TP}+\mathrm{TN}}{\mathrm{TP}+\mathrm{TN}+\mathrm{FP}+\mathrm{FN}}\times 100, $$

(17)

where true positive and true negative events are denoted by TP and TN, i.e., detecting ictal and interictal correctly. FN and FP stands for false negative and false positive, respectively.

Classification of result of set F versus set S using both pairs of feature, i.e., f_R, ($ \frac{<\omega {>}^2}{\sigma_T^2} $) and $ {\sigma}_T^2 $, <ω>² shows similar result of average accuracy of 96.48 and 97.13% and average sensitivity of 93.53 and 94.26%. Average specificity using both the features are very high at 99.43 and 100.0%. Results are shown in Table 2.

Classification accuracy of both the pairs of feature for set N against S is good at 99.41 and 99.48%. Average sensitivity and specificity are 99.46, 99.90, 99.36, and 99.66%. For set O versus Z, features f_R, ($ \frac{<\omega {>}^2}{\sigma_T^2} $) show average accuracy of 99.91%, but $ {\sigma}_T^2 $, <ω>² achieved lowly at 87.98%. Similar accuracy result is observed for set Z versus S with 99.63% using f_R, ($ \frac{<\omega {>}^2}{\sigma_T^2} $), whereas 90.30% using $ \kern0.50em {\sigma}_T^2 $, <ω>². SVM plot for set N versus S using $ {\sigma}_T^2 $, <ω>² and set O against S using f_R, ($ \frac{<\omega {>}^2}{\sigma_T^2} $) is shown in Figs. 5 and 6. Average accuracy for set F, N together versus set S are at 93.05 and 93.61% for both sets of features, whereas average classification accuracy for set O, Z versus S are at 99.11 and 90.60%. We have compared this proposed work with previous work at Tables 3 and 4.

Table 3 Comparison of set F, N vs S results with other existing works on Bonn dataset

Full size table

Table 4 Comparison with other works on Bonn dataset for classification between healthy nonseizure set O, Z and seizure or ictal set S

Full size table

For most of the time default kernel parameters proved to be better. Even with an optimized parameters that are found with grid search, were close to default setting and shows little improvement of at most 1–1.5%. Therefore, the cases where we found improvement less than 1%, default setting or default kernel parameters were used which helped in avoiding computing overload of kernel parameters search and makes the application more practical. Both the pairs of features have shown similar classification result for interictal set versus ictal or seizure set whereas feature f_R, ($ \frac{<\omega {>}^2}{\sigma_T^2} $) has shown better results for healthy nonseizure classification against seizure set. Therefore, f_R, ($ \frac{<\omega {>}^2}{\sigma_T^2} $) features combination for classification are found to better than $ {\sigma}_T^2 $, <ω>². Figure 4 clearly shows it is hard to have two-dimensional map helping SVM to create hyperplane to separate nonseizure and seizure sets using feature combination of $ {\sigma}_T^2 $, <ω>² as they are quite intermingled. Although classification of seizure set which is the result of intracranial experiment against noninvasive extracranial nonseizure healthy EEG set is inappropriate, classification has been done for comparison purpose of the proposed method with previous works. Detailed comparison of the proposed work with the previously done work on Bonn dataset is presented in Tables 3 and 4.

10 Conclusions

MCA gives definite number of decomposition depending on the number of set of basis used in over-complete dictionary. This dictionary can be formed based on problem requirements. Selection of basis functions in the dictionary plays an important role in creating problem-specific application. We found LDCT component is best suited for spectral feature extraction, whereas Dirac bases are good in showing spike morphology of the EEG. Default setting of SVM kernel is suitable for proposed feature combinations which makes it suitable for practical application. To make the method reliable, 100 random trials were taken on SVM. 99.78% of highest average accuracy was observed for classification of interictal set N against ictal set S, whereas 99.91% of average accuracy was observed for classification of nonseizure set O against ictal set S. In future, we will try to form a dictionary to remove different artifacts from EEG and will try to create seizure prediction system using MCA and proposed features with suitable basis for long-term EEG signals.

Abbreviations

Acc:: Accuracy
ApEn:: Approximate entropy
BCR:: Block coordinate relaxation
BFM:: Bandwidth frequency modulation
CEEMDAN:: Complete ensemble empirical mode decomposition with adaptive noise
CFS:: Center frequency square
DCT:: Discrete cosine transform
DST:: Discrete sine transform
DTCWT:: Dual tree complex wavelet
EEG:: Electroencephalogram
EMD:: Empirical mode decomposition
FFT:: Fast Fourier transform
FLP:: Fractional linear prediction
FN:: False negative
FP:: False positive
ICA:: Independent component analysis
IMF:: Intrinsic mode function
KNN:: K nearest neighbor
L.P Filter:: Linear prediction filter
LDCT:: Local discrete cosine transform
LS-SVM:: Least square support vector machine
MCA:: Morphological component analysis
MLP:: Multilayer perceptron
NN:: Nearest neighbor
PC:: Principal component
PCA:: Principal component analysis
PSR:: Phase space representation
RBF:: Radial basis function
RDSTFT:: Rational discrete STFT
RMIFS:: Root mean instantaneous frequency square
SEN:: Sensitivity
SPE:: Specificity
STFT:: Short-time Fourier transform
SVM:: Support vector machine
TN:: True negative
TP:: True positive
UDWT:: Undecimated wavelet transform

References

RS Fisher, WE Boas, W Blume, C Elger, P Genton, P Lee, J Engel, Epileptic seizures and epilepsy: definitions proposed by the international league against epilepsy (ilae) and the international bureau for epilepsy (ibe). Epilepsia 46(4), 470–472 (2005)
Article Google Scholar
AT Berg, SF Berkovic, MJ Brodie, J Buchhalter, JH Cross, WE Boas, J Engel, J French, TA Glauser, GW Mathern, et al., Revised terminology and concepts for organization of seizures and epilepsies: report of the ILAE commission on classification and terminology, 2005-2009. Epilepsia 51(4), 676–685 (2010)
Article Google Scholar
S Ramgopal, S Thome-Souza, M Jackson, NE Kadish, IS Fernandez, J Klehm, W Bosl, C Reinsberger, S Schachter, T Loddenkemper, Seizure detection, seizure prediction, and closed-loop warning systems in epilepsy. Epilepsy Behav. 37, 291–307 (2014)
Article Google Scholar
SF Liang, HC Wang, WL Chang, Combination of EEG complexity and spectral analysis for epilepsy diagnosis and seizure detection. EURASIP J. Adv. Signal Process. 2010(1), 853434 (2010)
Article Google Scholar
V Srinivasan, C Eswaran, N Sriraam, Artificial neural network based epileptic detection using time-domain and frequency-domain features. J. Med. Syst. 29(6), 647–660 (2005)
Article Google Scholar
K Polat, S Günes, Classification of epileptiform EEG using a hybrid system based on decision tree classifier and fast Fourier transform. Appl. Math. Comput. 187(2), 1017–1026 (2007)
MathSciNet MATH Google Scholar
AT Tzallas, MG Tsipouras, DI Fotiadis, Epileptic seizure detection in EEGs using time-frequency analysis. IEEE Trans. Inf. Technol. Biomed. 13(5), 703–710 (2009)
Article Google Scholar
H Adeli, Z Zhou, N Dadmehr, Analysis of EEG records in an epileptic patient using wavelet transform. J. Neurosci. Methods 123(1), 69–87 (2003)
Article Google Scholar
H Ocak, Optimal classification of epileptic seizures in EEG using wavelet analysis and genetic algorithm. Signal Process. 88(7), 1858–1867 (2008)
Article MathSciNet MATH Google Scholar
L Guo, D Rivero, J Dorado, AP CR Munteanu, Automatic feature extraction using genetic programming: an application to epileptic EEG classification. Expert Syst. Appl. 38(8), 10425–10436 (2011)
Article Google Scholar
G Chen, Automatic EEG seizure detection using dual-tree complex wavelet-Fourier features. Expert Syst. Appl. 41(5), 2391–2394 (2014)
Article Google Scholar
BL WC Stacey, Technology insight: neuroengineering and epilepsy-designing devices for seizure control. Nat. Clin. Pract. Neurol. 4(4), 190–201 (2008)
Article Google Scholar
Y Li, PP Wen, et al., Clustering technique-based least square support vector machine for EEG signal classification. Comput. Methods Prog. Biomed. 104(3), 358–372 (2011)
Article Google Scholar
Z Iscan, Z Dokur, T Demiralp, Classification of electroencephalogram signals with combined time and frequency features. Expert Syst. Appl. 38(8), 10499–10505 (2011)
Article Google Scholar
Y Tang, D Durand, A tunable support vector machine assembly classifier for epileptic seizure detection. Expert Syst. Appl. 39(4), 3925–3938 (2012)
Article Google Scholar
N Nicolaou, J Georgiou, Detection of epileptic electroencephalogram based on permutation entropy and support vector machines. Expert Syst. Appl. 39(1), 202–209 (2012)
Article Google Scholar
G Zhu, Y Li, PP Wen, Epileptic seizure detection in EEGs signals using a fast weighted horizontal visibility algorithm. Comput. Methods Prog. Biomed. 115(2), 64–75 (2014)
Article Google Scholar
A Temko, E Thomas, W Marnane, G Lightbody, G Boylan, EEG-based neonatal seizure detection with support vector machines. Clin. Neurophysiol. 122(3), 464–473 (2011)
Article Google Scholar
NE Huang, Z Shen, SR Long, MC Wu, HH Shih, Q Zheng, NC Yen, CC Tung, HH Liu, in Proceedings of the Royal Society of London A: Mathematical, Physical and Engineering Sciences, the Royal Society. The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis, vol 454 (1998), pp. 903–995
Google Scholar
RJ Oweis, EW Abdulhay, Seizure classification in EEG signals utilizing Hilbert-Huang transform. Biomed. Eng. Online 10(1), 38 (2011)
Article Google Scholar
F Riaz, A Hassan, S Rehman, IK Niazi, K Dremstrup, EMD-based temporal and spectral features for the classification of EEG signals using supervised learning. IEEE Trans. Neural Syst. Rehabil. Eng. 24(1), 28–35 (2016)
Article Google Scholar
F K, Q J, Y Chai, Y Dong, Classification of seizure based on the time-frequency image of EEG signals using HHT and SVM. Biomed. Signal Process. Control 13, 15–22 (2014)
Article Google Scholar
V Bajaj, RB Pachori, Classification of seizure and nonseizure EEG signals using empirical mode decomposition. IEEE Trans. Inf. Technol. Biomed. 16(6), 1135–1142 (2012)
Article Google Scholar
AG Mahapatra, K Horio, in Systems, Man, and Cybernetics (SMC), 2016 IEEE International Conference on, IEEE. Overcoming drawback of feature instantaneous bandwidth using EMD for epileptic seizure classification by RMS frequency (2016), pp. 001322–001327
Chapter Google Scholar
R Sharma, RB Pachori, Classification of epileptic seizures in EEG signals based on phase space representation of intrinsic mode functions. Expert Syst. Appl. 42(3), 1106–1117 (2015)
Article Google Scholar
K Samiee, P Kovacs, M Gabbouj, Epileptic seizure detection in long-term EEG records using sparse rational decomposition and local Gabor binary patterns feature extraction. Knowl.-Based Syst. 118, 228–240 (2017)
Article Google Scholar
J Spilka, J Frecon, R Leonarduzzi, N Pustelnik, P Abry, M Doret, Sparse support vector machine for intrapartum fetal heart rate classification. IEEE J. Biomed. Health Inform. 21(3), 664–671 (2017)
Article Google Scholar
IJ Rampil, A primer for EEG signal processing in anesthesia. Anesthesiology 89(4), 980–1002 (1998)
Article Google Scholar
B Crepon, V Navarro, D Hasboun, S Clemenceau, J Martinerie, M Baulac, C Adam, M Le Van Quyen, Mapping interictal oscillations greater than 200 Hz recorded with intracranial macroelectrodes in human epilepsy. Brain 133(1), 33–45 (2009)
Article Google Scholar
HS Liu, T Zhang, FS Yang, A multistage, multimethod approach for automatic detection and classification of epileptiform EEG. IEEE Trans. Biomed. Eng. 49(12), 1557–1566 (2002)
Article Google Scholar
KJ Blinowska, PJ Durka, Unbiased high resolution method of EEG analysis in time-frequency space. Acta Neurobiol. Exp. 61(3), 157{174 (2001)
Google Scholar
E Imani, HR Pourreza, T Banaee, Fully automated diabetic retinopathy screening using morphological component analysis. Comput. Med. Imaging Graph. 43, 78–88 (2015)
Article Google Scholar
E Imani, M Javidi, HR Pourreza, Improvement of retinal blood vessel detection using morphological component analysis. Comput. Methods Prog. Biomed. 118(3), 263–279 (2015)
Article Google Scholar
A Hyvärinen, E Oja, Independent component analysis: algorithms and applications. Neural Netw. 13(4–5), 411–430 (2000)
Article Google Scholar
P Berg, M Scherg, A multiple source approach to the correction of eye artifacts. Electroencephalogr. Clin. Neurophysiol. 90(3), 229–241 (1994)
Article Google Scholar
GL Wallstrom, RE Kass, A Miller, JF Cohn, NA Fox, Automatic correction of ocular artifacts in the EEG: a comparison of regression-based and component-based methods. Int. J. Psychophysiol. 53(2), 105–119 (2004)
Article Google Scholar
A Hyvärinen, J Särelä, R Vigario, in Proc. Int. Workshop on Independent Component Analysis and Signal Separation (ICA'99). Spikes and bumps: Artefacts generated by independent component analysis with insu cient sample size (1999), pp. 425–429
Google Scholar
J Särelä, R Vigario, Overlearning in marginal distribution based ica: analysis and solutions. J. Mach. Learn. Res. 4(Dec), 1447–1469 (2003)
MathSciNet MATH Google Scholar
B Singh, H Wagatsuma, A removal of eye movement and blink artifacts from EEG data using morphological component analysis. Comput. Math. Methods Med. 2017, 1861645 (2017)
Article MathSciNet Google Scholar
Y Jiang, M Wang, Image fusion with morphological component analysis. Inf. Fusion 18, 107–118 (2014)
Article Google Scholar
M Dalla Mura, A Villa, JA Benediktsson, J Chanussot, L Bruzzone, Classification of hyperspectral images by using extended morphological attribute profiles and independent component analysis. IEEE Geosci. Remote Sens. Lett. 8(3), 542–546 (2011)
Article Google Scholar
RK X Yong, GE Ward, in Neural Engineering, 2009. NER'09. 4th International IEEE/EMBS Conference on, IEEE. Birch, generalized morphological component analysis for EEG source separation and artifact removal (2009), pp. 343–346
Chapter Google Scholar
S JW Matiko, J Beeby, in Engineering in Medicine and Biology Society (EMBC), 2013 35th Annual International Conference of the IEEE, IEEE. Tudor, real time eye blink noise removal from EEG signals using morphological component analysis (2013), pp. 13–16
Chapter Google Scholar
SS Chen, DL Donoho, MA Saunders, Atomic decomposition by basis pursuit. SIAM Rev. 43(1), 129–159 (2001)
Article MathSciNet MATH Google Scholar
M Püschel, JM Moura, The algebraic approach to the discrete cosine and sine transforms and their fast algorithms. SIAM J. Comput. 32(5), 1280{1316 (2003)
Article MathSciNet MATH Google Scholar
X Shao, SG Johnson, Type-IV DCT, DST, and MDCT algorithms with reduced numbers of arithmetic operations. Signal Process. 88(6), 1313–1326 (2008)
Article MATH Google Scholar
Y JL Starck, J Moudden, M Bobin, DD Elad, Morphological component analysis. Proc. SPIE 5914, 1–15 (2005)
Google Scholar
S Sardy, A Bruce, P Tseng, Block coordinate relaxation methods for nonparametric signal denoising with wavelet dictionaries, (1998).
Google Scholar
PJ Loughlin, B Tacer, Comments on the interpretation of instantaneous frequency. IEEE Signal Process Lett. 4(5), 123–125 (1997)
Article MATH Google Scholar
L Cohen, Time-frequency analysis (Prentice Hall PTR, Englewood Cliffs, 1995)
Google Scholar
L Cohen, C Lee, in Acoustics, Speech, and Signal Processing, 1990. ICASSP-90., 1990 International Conference on, IEEE. Instantaneous bandwidth for signals and spectrogram (1990), pp. 2451–2454
Google Scholar
RG Andrzejak, K Lehnertz, F Mormann, C Rieke, P David, CE Elger, Indications of nonlinear deterministic and finite-dimensional structures in time series of brain electrical activity: dependence on recording region and brain state. Phys. Rev. E 64(6), 061907 (2001)
Article Google Scholar
S Tolwinski, The Hilbert Transform and Empirical Mode Decomposition as Tools for Data Analysis (University of Arizona, Tucson, 2007)
Google Scholar
V Vapnik, The nature of statistical learning theory (Springer science & business media, 2013)
K Samiee, P Kovacs, M Gabbouj, Epileptic seizure classification of EEG time-series using rational discrete short-time Fourier transform. IEEE Trans. Biomed. Eng. 62(2), 541–552 (2015)
Article Google Scholar
AR Hassan, A Subasi, Automatic identification of epileptic seizures from EEG signals using linear programming boosting. Comput. Methods Prog. Biomed. 136, 65–77 (2016)
Article Google Scholar
S Altunay, Z Telatar, O Erogul, Epileptic EEG detection using the linear prediction error energy. Expert Syst. Appl. 37(8), 5661–5665 (2010)
Article Google Scholar
V Joshi, RB Pachori, A Vijesh, Classification of ictal and seizure-free EEG signals using fractional linear prediction. Biomed. Signal Process. Control 9, 1–5 (2014)
Article Google Scholar
RB Pachori, S Patidar, Epileptic seizure classification in EEG signals using second-order difference plot of intrinsic mode functions. Comput. Methods Prog. Biomed. 113(2), 494–502 (2014)
Article Google Scholar
J Martinez-del Rincon, MJ Santofimia, X del Toro, J Barba, F Romero, P Navas, JC Lopez, Non-linear classifiers applied to EEG analysis for epilepsy seizure detection. Expert Syst. Appl. 86, 99 (2017)
Article Google Scholar

Download references

Availability of data and materials

The EEG dataset [52] used in this work is available at http://www.meb.unibonn.de/epileptologie/science/physik/EEGdata.html

Author information

Authors and Affiliations

Graduate School of Life Science and Systems Engineering, Kyushu Institute of Technology, Kitakyushu, Japan
Arindam Gajendra Mahapatra, Balbir Singh, Hiroaki Wagatsuma & Keiichi Horio
National Institute for Physiological Sciences, Okazaki, Japan
Balbir Singh
Artificial Intelligence Research Center, National Institute of Advanced Industrial Science and Technology, Tokyo, Japan
Hiroaki Wagatsuma
Riken BSI, Wako, Japan
Hiroaki Wagatsuma

Authors

Arindam Gajendra Mahapatra
View author publications
You can also search for this author in PubMed Google Scholar
Balbir Singh
View author publications
You can also search for this author in PubMed Google Scholar
Hiroaki Wagatsuma
View author publications
You can also search for this author in PubMed Google Scholar
Keiichi Horio
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

The MCA code developed by Dr. BS under the supervision of Dr. HW is used in this work. Selection of basis function to create dictionary for epileptic application, feature proposal and extraction, classification and result analysis is done by AGM under the supervision of Dr. KH. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Arindam Gajendra Mahapatra.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Mahapatra, A.G., Singh, B., Wagatsuma, H. et al. Epilepsy EEG classification using morphological component analysis. EURASIP J. Adv. Signal Process. 2018, 52 (2018). https://doi.org/10.1186/s13634-018-0568-2

Download citation

Received: 18 January 2018
Accepted: 18 June 2018
Published: 08 August 2018
DOI: https://doi.org/10.1186/s13634-018-0568-2

Epilepsy EEG classification using morphological component analysis

Abstract

1 Introduction

2 Method and material

3 Morphological component analysis

4 Hilbert transform over decompositions

5 Computation of root mean instantaneous frequency square

6 SVM

7 Dataset

8 Simulation

9 Results and discussion

10 Conclusions

Abbreviations

References

Availability of data and materials

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Publisher’s Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords