Existing epileptic seizure automatic detection systems are often troubled by high-dimensional electroencephalogram (EEG) features. High-dimensional features will not only bring redundant information and noise, but also reduce the response speed of the system. In order to solve this problem, supervised locality preserving canonical correlation analysis (SLPCCA), which can effectively use both sample category information and nonlinear relationships between features, is introduced. And an epileptic signal classification method based on SLPCCA is proposed. Firstly, the power spectral density and the fluctuation index of the frequency slice wavelet transform are extracted as features from the EEG fragments. Next, SLPCCA obtains the optimal projection direction by maximizing the weight correlation between the paired samples in the class and their neighbors. And the projection combination of original features in the optimal direction is the fusion feature. The fusion features are then input into LS-SVM for training and testing. This method is verified on the Bonn dataset and the CHB-MIT dataset and gets good results. On various classification tasks of Bonn data set, the proposed method achieves an average classification accuracy of 99.16%. On the binary classification task of the inter-seizure and seizure epileptic EEG of the CHB-MIT dataset, the proposed method achieves an average accuracy of 97.18%. The experimental results show that the algorithm achieves excellent results compared with several state-of-the-art methods. In addition, the parameter sensitivity of SLPCCA and the relationship between the dimension of the fusion features and the classification results are discussed. Therefore, the stability and effectiveness of the method are further verified.
Citation: Hongming Liu, Yunyuan Gao, Jianhai Zhang, Juanjuan Zhang. Epilepsy EEG classification method based on supervised locality preserving canonical correlation analysis[J]. Mathematical Biosciences and Engineering, 2022, 19(1): 624-642. doi: 10.3934/mbe.2022028
Related Papers:
[1]
Ning Huang, Zhengtao Xi, Yingying Jiao, Yudong Zhang, Zhuqing Jiao, Xiaona Li .
Multi-modal feature fusion with multi-head self-attention for epileptic EEG signals. Mathematical Biosciences and Engineering, 2024, 21(8): 6918-6935.
doi: 10.3934/mbe.2024304
[2]
N Arunkumar, B Nagaraj, M Ruth Keziah .
EpilepIndex: A novel feature engineering tool to detect epilepsy using EEG signals. Mathematical Biosciences and Engineering, 2023, 20(12): 21670-21691.
doi: 10.3934/mbe.2023959
[3]
Dae Hyeon Kim, Jin-Oh Park, Dae-Young Lee, Young-Seok Choi .
Multiscale distribution entropy analysis of short epileptic EEG signals. Mathematical Biosciences and Engineering, 2024, 21(4): 5556-5576.
doi: 10.3934/mbe.2024245
[4]
Sakorn Mekruksavanich, Wikanda Phaphan, Anuchit Jitpattanakul .
Epileptic seizure detection in EEG signals via an enhanced hybrid CNN with an integrated attention mechanism. Mathematical Biosciences and Engineering, 2025, 22(1): 73-105.
doi: 10.3934/mbe.2025004
[5]
Bei Liu, Hongzi Bai, Wei Chen, Huaquan Chen, Zhen Zhang .
Automatic detection method of epileptic seizures based on IRCMDE and PSO-SVM. Mathematical Biosciences and Engineering, 2023, 20(5): 9349-9363.
doi: 10.3934/mbe.2023410
[6]
Jiacan Xu, Donglin Li, Peng Zhou, Chunsheng Li, Zinan Wang, Shenghao Tong .
A multi-band centroid contrastive reconstruction fusion network for motor imagery electroencephalogram signal decoding. Mathematical Biosciences and Engineering, 2023, 20(12): 20624-20647.
doi: 10.3934/mbe.2023912
[7]
Xiaowen Jia, Jingxia Chen, Kexin Liu, Qian Wang, Jialing He .
Multimodal depression detection based on an attention graph convolution and transformer. Mathematical Biosciences and Engineering, 2025, 22(3): 652-676.
doi: 10.3934/mbe.2025024
[8]
Yin Zhang, Xintong Wu, Jingwen Sun, Kecen Yue, Shuangshuang Lu, Bingjian Wang, Wenjia Liu, Haifeng Shi, Ling Zou .
Exploring changes in brain function in IBD patients using SPCCA: a study of simultaneous EEG-fMRI. Mathematical Biosciences and Engineering, 2024, 21(2): 2646-2670.
doi: 10.3934/mbe.2024117
[9]
Su-na Zhao, Yingxue Cui, Yan He, Zhendong He, Zhihua Diao, Fang Peng, Chao Cheng .
Teleoperation control of a wheeled mobile robot based on Brain-machine Interface. Mathematical Biosciences and Engineering, 2023, 20(2): 3638-3660.
doi: 10.3934/mbe.2023170
[10]
Xu Yin, Ming Meng, Qingshan She, Yunyuan Gao, Zhizeng Luo .
Optimal channel-based sparse time-frequency blocks common spatial pattern feature extraction method for motor imagery classification. Mathematical Biosciences and Engineering, 2021, 18(4): 4247-4263.
doi: 10.3934/mbe.2021213
Abstract
Existing epileptic seizure automatic detection systems are often troubled by high-dimensional electroencephalogram (EEG) features. High-dimensional features will not only bring redundant information and noise, but also reduce the response speed of the system. In order to solve this problem, supervised locality preserving canonical correlation analysis (SLPCCA), which can effectively use both sample category information and nonlinear relationships between features, is introduced. And an epileptic signal classification method based on SLPCCA is proposed. Firstly, the power spectral density and the fluctuation index of the frequency slice wavelet transform are extracted as features from the EEG fragments. Next, SLPCCA obtains the optimal projection direction by maximizing the weight correlation between the paired samples in the class and their neighbors. And the projection combination of original features in the optimal direction is the fusion feature. The fusion features are then input into LS-SVM for training and testing. This method is verified on the Bonn dataset and the CHB-MIT dataset and gets good results. On various classification tasks of Bonn data set, the proposed method achieves an average classification accuracy of 99.16%. On the binary classification task of the inter-seizure and seizure epileptic EEG of the CHB-MIT dataset, the proposed method achieves an average accuracy of 97.18%. The experimental results show that the algorithm achieves excellent results compared with several state-of-the-art methods. In addition, the parameter sensitivity of SLPCCA and the relationship between the dimension of the fusion features and the classification results are discussed. Therefore, the stability and effectiveness of the method are further verified.
1.
Introduction
Epilepsy is a brain disease with strong suddenness, recurrence and involuntary nature [1]. According to statistics from the World Health Organization, nearly 50 million people worldwide suffer from epilepsy [2]. At present, the clinical diagnosis of epilepsy mainly relies on medical history data and brain examinations, such as visual examination of long-term electroencephalogram(EEG) by experienced doctors. However, this method is time-consuming and the results are subjective. To solve this dilemma, a large number of methods have been proposed to identify epileptic EEG signals through signal processing and machine learning. These methods can reduce the burden of doctors and improve the diagnostic accuracy.
Due to the obvious nonlinearity and nonstationarity of epileptic EEG signals, researchers use multi-channel and high sampling rate EEG acquisition equipment to obtain EEG signals of subjects. In this way, they can ensure the highest spatial and temporal resolution. However, this method directly causes the feature dimension of the multi-channel EEG signal to be too high. On the one hand, information redundancy and noise are increased, which brings interference to accurate identification. On the other hand, the signal processing time is prolonged and the diagnosis efficiency is reduced.
In view of the possible dimension disasters in the feature extraction of multi-channel EEG signals, many feature dimension reduction algorithms are proposed. M. Yildiz et al. [3] used principal component analysis (PCA) to reduce dimensionality, obtaining the best 8-dimensional features and improving the classification accuracy by 9%. A. Matin et al. [4] combined PCA with two independent component analysis (ICA) algorithms and achieved good classification results in a variety of tasks. It have been found that manifold learning can improve the classification effect while reducing the dimension of EEG features. Yang et al. [5] proposed a feature dimension reduction algorithm based on local preserving projection and achieved a classification accuracy of more than 97% in a variety of classification tasks. J.Birjandtalab et al. [6] adopted a non-linear data embedding technique based on random nearest neighbor distance metric and achieved an F-measure of more than 87%. Hou et al. [7] used local linear embedding and random forest algorithms to achieve an average classification accuracy of 95%.
Although there have been many dimension reduction algorithms for a single feature, the relationships among features are not well considered. It may cause redundant information between different features, thereby increasing unnecessary feature dimensions. In order to reduce the feature dimension and the information redundancy between features, SLPCCA is applied to the classification of epilepsy EEG since it can effectively use both sample category information and nonlinear relationships between features. SLPCCA is used to convert the original high-dimensional multi-channel EEG features into low-dimensional fusion features, then the fusion features are input into the classifier. Through experiments on Bonn data set and CHB-MIT data set, it can be proved that SLPCCA can effectively reduce the multi-channel EEG feature dimension. At the same time, the redundant information and irrelevant components in the original features are removed, which improves the classification accuracy and helps to reduce the diagnosis time.
The layout of the paper is as follows. In Section 2, two original features in the experiment are introduced. In Section 3, theories and characteristics of SLPCCA is introduced. The data sets, classifiers and classification evaluation indicators used in the experiment, as well as the classification results under various classification tasks of different data sets are presented in Section 4. Parameter sensitivity and fusion feature dimension of SLPCCA is analyzed in Section 5. Finally, conclusion are presented in Section 6.
2.
Feature extraction
The automatic detection of epileptic seizures is mainly the mining of effective information in EEG through machine learning. Its essence can be attributed to a pattern recognition problem, in which it is important to design features that can effectively distinguish epileptic EEG. Since EEG are significantly different in different frequency bands, power spectral density (PSD) is selected to reflect the changes of EEG energy with frequency. Besides, due to the non-stationary nature of EEG, frequency domain features that require signals to be stationary, such as PSD, have limitations. To overcome the limitations, feature extraction methods based on time-frequency analysis are proposed and fluctuation index of frequency slice wavelet transform (FSWT-FI) is selected here.
2.1. Power spectral density
PSD [8] can describe the energy change of the frequency domain signal. Since the energy of the EEG signal during an epileptic seizure is significantly greater than that of the non-seizure, the PSD can effectively distinguish the seizure and non-seizure EEG. The way to obtain the PSD of signal f(t) is as follows:
P(ω)=+∞∑τ=−∞E[f(t)f∗(t+τ)e−iωτ],
(2.1)
where P(ω) means the power spectral density at frequency ω, E means expectation, '∗' means conjugate and τ is the time delay.
EEG can be roughly divided into five rhythms, namely δ(1∼3Hz), θ(4∼7Hz), α(8∼13Hz), β(14∼30Hz) and γ(30∼80Hz). Each rhythm has different energy characteristics during a seizure. Therefore, the PSD on these rhythms are selected as features of the automatic detection of epileptic seizures.
2.2. Fluctuation index of frequency slice wavelet transform
Frequency slice wavelet transform (FSWT) is a time-frequency analysis method proposed by Yan et al. [9]. The FSWT of signal f(t) is as follows:
Wf(t,ω,k)=12π∫+∞−∞ˆf(u)ˆp∗(ku−ωω)eiutdu,
(2.2)
where Wf(t,ω,k) is the frequency slice wavelet transform form of f(t), '∗' means conjugate and ˆf is the Fourier transform of f. ˆp is the Fourier transform of the mother wavelet function, that is, the frequency slice function. The commonly used frequency slicing function is ˆp(ω)=e−0.5ω2. k is the time-frequency analysis coefficient, which can be obtained as k=△ωpηs, where △ωp is the width of the frequency window of ˆp(ω) and ηs is the frequency resolution of f(t). The inverse Fourier transform of Wf(t,ω,k) in the time domain (t1,t2) and frequency domain (ω1,ω2) is as follows, then the signal component of the f(t) in the time frequency domain (t1,t2,ω1,ω2) can be obtained,
ft1,t2,ω1,ω2=12π∫ω2ω1∫t2t1W(τ,ω,k)eiω(t−τ)dτdω.
(2.3)
In real applications, EEG is processed in discrete form. Correspondingly, the discrete FSWT is given by [10]
W(⋅,k)=F−1{F{f}∗^Pk},k=0,1,⋯,n−1
(2.4)
where n is the length of the signal f, F and F−1 are the discrete Fourier transform and inverse discrete Fourier transform, respectively. Besides, the frequency slice function ^Pk is a sparse sequence of limited length. Equation (2.4) can be realized by reducing resample in the time domain and using fast Fourier transform for each k. So, the inverse discrete FSWT is as follows:
f=1nF−1{Nθ−1∑l=0W(l,k)e−i(2π/n)lk},k=0,1,⋯,n−1
(2.5)
where Nθ is the number of resample points.
After obtaining the time-frequency distribution of EEG signals through FSWT, the corresponding frequency ranges of rhythms δ, θ, α, β and γ are reconstructed to obtain the time-domain waveforms of each rhythm. Then the fluctuation index (FI) of each rhythm signal is calculated as follows for the subsequent classification task,
FI=n−1∑t=1(ft+1−ft)2
(2.6)
where n is the total length of the signal f. ft+1 and ft are signals at two adjacent moments.
3.
Feature fusion algorithm
Although the above-mentioned PSD and FSWT-FI can well reflect the characteristics of epilepsy EEG, they have the risk of causing a dimension explosion. Because they are extracted on each rhythm of the EEG, when the object is a multi-channel EEG, the feature dimension will increase rapidly. In order to avoid the risk of dimension explosion, feature fusion algorithm is used after feature extraction to fuse information of two features while reducing the dimension.
Canonical correlation analysis (CCA) [11] is a multivariate statistical algorithm dealing with the correlation between two sets of variates. CCA transforms the correlation between two sets of random variables into the correlation between a few pairs of typical variables. It can not only retain the effective information of multiple EEG features participating in the fusion, but also reduce redundant information, thereby reducing the feature dimension.
Although CCA can effectively analyze the correlation between features and has played a role in image recognition tasks [12], it has two shortcomings. On the one hand, CCA is an unsupervised feature fusion algorithm, so it does not contain sample category information. In order to incorporate sample category information into the feature fusion process, discriminant canonical correlation analysis (DCCA) [13] is proposed. The fusion features obtained through the supervised feature fusion algorithm will be better applied to the task of epilepsy EEG classification. On the other hand, as an essentially linear dimension reduction technique, CCA cannot reveal the nonlinear relationship between features. So, kernel canonical correlation analysis (KCCA) [14] is proposed to converts the nonlinear problem in the original space into a linear problem in the high-dimensional space and uses the kernel trick to solve it. However, it is difficult for KCCA to solve the nonlinear problem embedded in the local space. Therefore, the idea of local preservation is introduced and locality preserving canonical correlation analysis (LPCCA) [15] is proposed to better contain the nonlinear properties of EEG. Inheriting the advantages of DCCA and LPCCA while making up for their shortcomings and adding sample weights, supervised locality preserving canonical correlation analysis (SLPCCA) [16] is proposed.
3.1. Canonical correlation analysis
Suppose there are two random variables ξ and η. CCA is to find the projection direction α and β so that the correlation coefficient of the projection αTξ and βTη is the largest. α and β are called typical projection directions. αTξ and βTη are called canonical correlation variables. By analyzing several pairs of unrelated canonical correlation variables, the analysis of the correlation between ξ and η can be completed. Typical projection directions α and β can be obtained by the following criterion function.
argmaxα,βρ=αTSξηβ√αTSξξα√βTSηηβ,
(3.1)
where Sξη represents the cross-covariance matrix of ξ and η. Sξξ and Sηη represent the covariance matrix of ξ and η, respectively.
When CCA is used for feature fusion, suppose there are two standardized feature sets X=[x1,x2,⋯,xn]∈Rp∗n and Y=[y1,y2,⋯,yn]∈Rq∗n, which means X and Y respectively contain p-dimensional features and q-dimensional features of n samples. Based on the idea of CCA, the canonical correlation variables between X and Y are extracted first. These canonical correlation variables can be called canonical correlation features, denoted as αT1X and βT1Y, αT2X and βT2Y, ⋯, αTdX and βTdY, a total of d pairs. Then, feature set after projection, namely X∗=(α1,α2,…,αd)TX and Y∗=(β1,β2,…,βd)TY, can form the fusion feature Z=(X∗Y∗) for classification.
3.2. Discriminant canonical correlation analysis
Because the sample category information needs to be considered, DCCA reorders the features in the original feature set X and Y according to categories. Then, the matrix Cw describing the similarity within the feature vector class can be defined as Cw=XAYT, where A is a symmetric positive semi-definite block matrix with rank c and block size ni. c is the number of sample categories and ni is the number of samples in ith category. Similarly, the matrix Cb describing the similarity between feature vector classes can be defined as Cb=∑ci=1∑cj=1≠i∑nik=1∑njl=1x(i)ky(j)Tl=−XAYT.
So, the criterion function of the typical projection direction α and β of DCCA can be defined as
argmaxα,βρ=αTXAYTβ√αTSXXα√βTSYYβ.
(3.2)
Same as CCA, Z=(X∗Y∗) can be used as the fusion feature after projection for classification.
To analyze the local structure of the data, if NX(xi) is the set of local neighbors of the sample xi, the local correlation of the sample can be expressed as αT⋅∑j∈NX(xi)(xi−xj)(yi−yj)T⋅β. Then, the k neighboring matrices of feature sets X are defined as follows
where tx are the mean value of the distance between all samples in the feature set X. FY can be obtained in the same way. So, the local canonical correlation can be expressed as αT⋅∑ni=1∑nj=1FX(i,j)(xi−xj)FY(i,j)(yi−yj)T⋅β and the criterion function of α and β of the typical projection direction of LPCCA can be defined as
where DFX and DFY are two diagonal matrices with diagonal element values DFX(i,i)=∑nj=1FX(j,i) and DFY(i,i)=∑nj=1FY(j,i). Like CCA and DCCA, Z=(X∗Y∗) can be used as the fusion feature after projection for classification.
Drawing on the idea of LPCCA, SLPCCA constructs the intra-class neighbor set of the sample and defines the intra-class k neighbor matrix of feature sets X as follows,
where SNX is the set of intra-class nearest neighbors of the sample. SFY can be obtained in the same way. Similar to LPCCA, the criterion functions of the typical projection directions α and β of SLPCCA are as follows,
where SDSFX and SDSFY are diagonal matrices composed of column elements of SFX and SFY, respectively.
If ^SXY=XSFXSFTYYT, ^SXX=X(SDSFX−SFX)XT and ^SYY=Y(SDSFY−SFY)YT, the solution of the criterion function is equivalent to the solution of the following optimization problem
maxα,βρ=αT^SXYβs.t.{αT^SXXα=1βT^SYYβ=1
(3.7)
In order to solve the above optimization problem, the Lagrange multiplier method is used to obtain the following generalized eigenvalue equation:
(0^SXY^SYX0)(αβ)=σ2(^SXX00^SYY)(αβ),
(3.8)
where σ is the Lagrange multiplier. Next, the generalized eigenvector {αi,βi},i=1,⋯,d, corresponding to the first d maximum generalized eigenvalues can be found. Thus, α and β can be constructed and then Z=(X∗Y∗) can be obtained.
3.5. Epilepsy EEG feature fusion method based on SLPCCA
After extracting PSD and FSWT-FI from EEG data, in order to reduce the feature dimension and information redundancy, SLPCCA is adopted to merge PSD and FSWT-FI into a fusion feature. Firstly, the k neighbor matrix within the class according to the relationship between samples in the feature set PSD and FSWT-FI is established. Then, samples and neighbor samples in PSD are correlated with the corresponding samples and neighbor samples in FSWT-FI. The local structure of the data is thus preserved. After that, by solving the optimization problem, the projection direction is obtained. Finally, the original features are projected and combined into a fusion vector. The specific process of feature fusion by SLPCCA is as follows
● Construct the intra-class k neighbor matrix SFX and SFY of feature sets X and Y, namely PSD and FSWT-FI;
● Establish criterion function ρ as Eq (3.6);
● Calculate typical projection directions α and β by solving optimization problem as Eq (3.7);
● Get the projection X∗ and Y∗ of feature set X and Y by α and β;
● Combine X∗ and Y∗ into a fusion feature Z=(X∗Y∗);
● Z is the fusion feature set obtained by the fusion of PSD and FSWT-FI, which will be used to train and test the classifier.
If H=^SXX−12^SXY^SYY−12=∑ri=1λiuivTi, where λ is the eigenvalue, then it can be known that the optimal solutions for projection directions α and β are ^SXX−12ui and ^SYY−12vi[11], respectively. Under the constraints of the optimization problem shown in Eq (3.7), the typical projection directions, namely αi and βi, are at most r pairs (r=rank(^SXY)). Because the projection directions αi and βi corresponding to the larger eigenvalue λi contain more information, the projection direction corresponding to the d largest eigenvalues can be used to construct the final d-dimensional projection direction.
Besides, for any α and β that meet the constraints of the optimization problem, there are
{αTi^SXXαj=βTi^SYYβj=δijαTi^SXYβj=λiδij
(3.9)
where δij={1,i=j0,i≠j. So, if x∗i=αTixi and y∗i=βTiyi, then for any i≠j there are
Therefore, cov(z∗i,z∗j)=0, which means the components of fusion features are irrelevant [12]. It proves that there is no information redundancy between the fusion features.
In addition, SLPCCA is further analyzed and compared with the above feature fusion algorithm, namely CCA, DCCA and LPCCA.
4.
Experiment
4.1. Dataset
The EEG data comes from the German Bonn University dataset [17] and CHB-MIT dataset [18], which have been widely used in epilepsy detection related tasks.
The Bonn dataset contains 5 subsets of A-E. Each subset contains 100 segments of single-channel EEG signals composed of 4097 sampling points and the sampling frequency is 173.61Hz. Subsets A and B are respectively taken from the scalp surface of five healthy volunteers with their eyes open and closed. Subsets C and D are respectively taken from five epilepsy patients outside the lesion area and within the lesion area of the inter-epilepsy. Subset E is taken from the epileptic seizure area of the above five epilepsy patients. Each EEG signal is divided into 4 evenly, that is, each subset is divided into a data set containing 400 single-channel EEG signals composed of 1024 sampling points.
The CHB-MIT dataset consists of multi-channel EEG signals from 24 epilepsy patients aged 3 to 22 years. Most EEG signals adopt 23 EEG channels and their electrode positions and naming adopt the international 10–20 system [19]. A sliding window with a window length of 4 seconds and a step length of 2 seconds is used to extract the inter-seizure segment and the seizure segment from the continuous EEG signal.
In the experiment, 70% of each type of sample is used as the training set and 30% is used as the test set. For the Bonn data set, PSD of each sample and the FSWT-FI of each sample's δ, θ, α, β, and γ rhythms are extracted as features. Then feature fusion algorithms are used to get the fusion feature. For the CHB-MIT data set, 23 channels' PSD of each EEG signal and the FSWT-FI of the five EEG rhythms are extracted as features. After forming the concatenated feature vector, feature fusion algorithms are used to get the fusion feature.
4.2. Pretreatment, classifier and evaluation index
The original EEG will be interfered by noise, such as eye electrical artifact and EMG interference from human body. Therefore, in order to better extract EEG features, wavelet threshold filtering is used for denoising. Besides, the band stop filter is used to remove power-line interference.
Least squares support vector machine (LS-SVM), which has been widely used in various fields such as machine learning, is chosen as the classifier. In order to evaluate the effect of the classifier, accuracy, sensitivity and specificity are introduced as evaluation indicators. They are obtained as follows:
where TP (True Positive) is the number of positive samples that are correctly classified, TN (True Negtive) is the number of negative samples that are correctly classified, FP (False Positive) is the number of positive samples that are incorrectly classified and FN (False Negative) is the number of negative samples that are incorrectly classified.
4.3. Experiment result
4.3.1. Experiments on Bonn dataset
In order to consider a wider range of practical clinical situations, such as the classification of multiple types of EEG signals and the classification under the uneven number of samples, 5 subsets of Bonn dataset are used to form 11 classification groups. They include A vs E, B vs E, C vs E, D vs E, AB vs E, AC vs E, AD vs E, ABC vs E, ABCD vs E, A vs C vs E and AB vs CD vs E. These 11 classification groups can be divided into 4 classification tasks: normal EEG vs seizure epileptic EEG, inter-seizure epileptic EEG vs seizure epileptic EEG, non-seizure EEG vs seizure epileptic EEG and normal EEG vs inter-seizure epileptic EEG vs seizure epileptic EEG. The classification results on these four tasks are shown in Table 1, Table 2, Table 3 and Table 4 respectively. The results shown in the tables are the average of all experiments in the corresponding task and the optimal result in each task is bolded.
Table 1.
Result of normal EEG vs seizure epileptic EEG.
● inter-seizure epileptic EEG vs seizure epileptic EEG
● non-seizure EEG vs seizure epileptic EEG
● normal EEG vs inter-seizure epileptic EEG vs seizure epileptic EEG
It can be seen from the above results that fusion features that formed by the fusion of PSD and FI using SLPCCA algorithm can achieve the best classification effect in all kinds of classification tasks, with an average of 99.08%. Series features are also superior to single features in all classification tasks, second only to SLPCCA. For other feature fusion algorithms, because the algorithm itself is not suitable for epilepsy EEG classification tasks, the results are even inferior to a single feature on some classification tasks. But because of the excellent performance of SLPCCA, it can be believed that the application of feature fusion algorithm in the classification of epilepsy EEG is feasible and effective.
4.3.2. Experiment on CHB-MIT dataset
Although the effectiveness of the feature fusion algorithm has been proven, the difference in the classification effect of each algorithm on the Bonn dataset is small. In addition, because the Bonn dataset is a single-channel EEG dataset with low feature dimensions, the experimental results on the Bonn dataset cannot reflect the advantages of feature fusion algorithm in dimension reduction. Therefore, the binary classification of inter-seizure epileptic EEG and seizure epileptic EEG on the CHB-MIT dataset are carried out. The results are shown in the Table 5. The optimal result is also bolded.
Table 5.
Result of inter-seizure epileptic EEG vs seizure epileptic EEG.
It can be seen from Table 5 that, in the multi-channel EEG data set, the dimension reduction advantage of the feature fusion algorithm is more obvious. Compared with the series feature, the dimension of each fusion feature has different degrees of reduction. The fusion feature obtained by SLPCCA achieve the highest classification accuracy of 97.18% when the feature dimension is almost one-half of the single feature dimension and one-fourth of the serial feature dimension. It is foreseeable that when the feature dimension or the number of EEG channels is further increased, when the dimensional explosion disaster occurs in the series feature, the fusion feature obtained by SLPCCA can still be in a low dimension. Besides, because the algorithm itself is not suitable for epilepsy EEG classification tasks, the results obtained by other fusion algorithms except SLPCCA are not as good as single feature, just like the experiment on Bonn dataset. However, because CCA and LPCCA consider the correlation between features, their accuracies are still higher than that of single feature dimension reduction methods, such as PCA.
4.3.3. Comparison with other methods
The proposed method is compared with other methods under the same dataset. The comparison results of Bonn dataset and CHB-MIT dataset are shown in the Table 6 and Table 7 respectively. It can be seen from the result that SLPCCA has a higher accuracy in more comprehensive classification tasks.
It can be seen from the results that although CCA, DCCA and LPCCA can achieve dimension reduction, the accuracy of the fusion features obtained by them is negatively affected. CCA and LPCCA are unsupervised algorithms, so they may introduce wrong category information in the feature fusion process, thereby reducing the classification accuracy. The dimension of the fusion features obtained by the DCCA is limited by the number of categories, so DCCA is not suitable for small-category classification tasks. However, SLPCCA can not only achieve feature dimension reduction, but also improve classification accuracy, so it will be further analyzed next.
5.1. Discussion of algorithm parameter sensitivity
In order to add the local structure information of the data, SLPCCA draws on LPCCA to construct the intra-class k neighbor matrix of the feature set. Because the parameter k will affect the effect of the algorithm, sensitivity analysis is performed on k. k ranges from 15% to 90% of the total number of samples in the same category. Accuracy of task AB vs CD vs E on the Bonn dataset is the evaluation index. The result is shown in the Figure 1.
Figure 1.
Sensitivity analysis result of parameter k.
It can be seen from the figure that when k is bigger than 40%, the accuracy is high and the fluctuation range does not exceed 1%. Therefore, SLPCCA can be considered not sensitive to parameter changes and relatively stable.
5.2. Discussion of feature dimension and classification effect
Among the mentioned various feature fusion algorithms, only the fusion feature dimension of DCCA is limited by the number of classification categories. Except DCCA, in CCA, LPCCA and SLPCCA, the dimension of the projection direction depends on the number of non-zero eigenvalues of the generalized characteristic equation. The dimension of the projection direction will then determine the dimension of the fusion feature and ultimately affect the classification effect. Therefore, the classification accuracies of the fusion features of these three algorithms under different feature dimensions are analyzed. Accuracy on the CHB-MIT dataset is the evaluation index. The result is shown in the Figure 2.
Figure 2.
The relationship between feature dimension and classification accuracy.
It can be seen from the figure that as the fusion feature dimension decreases, the accuracy rate of CCA decreases the fastest, followed by LPCCA. And SLPCCA maintains a relatively high accuracy rate when the feature dimension decreases. It is because LPCCA constructs a k-adjacent matrix on the basis of CCA, compared to the fusion features generated by CCA, the fusion features generated by LPCCA have information about the local structure of the data. Besides, SLPCCA borrows from DCCA and changes from an unsupervised feature fusion algorithm to a supervised feature fusion algorithm. The added sample category information enables SLPCCA to contain more information with less dimensions and ultimately achieve a better classification effect. In real applications, EEG features usually have high dimension and contain redundant information. Moreover, because EEG has strong frequency characteristics, features of combined frequency bands are often selected. But some of these frequency bands may be irrelevant to the research, which will lead to the inclusion of irrelevant components. Therefore, it is necessary to use feature fusion methods such as SLPCCA to reduce dimension while connecting multiple features. Thus, redundant information and irrelevant components in features are reduced, and the application effect of EEG, such as classification accuracy, is improved.
6.
Conclusions
In order to avoid the dimensional explosion and information redundancy caused by the series connection of multi-channel EEG features, SLPCCA, which can reflect the sample category information and the local structure of the data, is used for feature fusion. Then the epilepsy classification method based on SLPCCA including the following three steps is proposed. First, PSD and FSWT-FI are extracted from EEG as features. Secondly, SLPCCA is used to find a projection direction and merge the two features of one sample into one feature after projection. Compared with other feature fusion algorithms such as CCA, SLPCCA contains not only the local structure of the sample, but also the category information of the sample. Finally, fusion features are used to train and test LS-SVM. The proposed method is verified on the Bonn dataset and the CHB-MIT dataset. The experiment results show that the proposed method can achieve an average classification accuracy of 99.16%, a sensitivity of 99.06% and a specificity of 99.58% on the 11 classification tasks on the Bonn dataset and the feature dimension is only 80% of the series feature dimension. It can achieve an accuracy of 97.18%, a sensitivity of 97.10% and a specificity of 97.77% on inter-seizure epileptic EEG vs seizure epileptic EEG task on the CHB-MIT dataset and the feature dimension is only 25% of the series feature dimension. It can be seen that the proposed method not only has better classification accuracy than other methods, but also can effectively reduce the feature dimension. What's more, because the proposed method has no special processing, such as selecting specific channels for some patients or selecting different features for different tasks manually, it can be well adapted to other epilepsy EEG dataset.
Acknowledgments
This work was supported by the National Nature Science Foundation of China (Grant No.61971168, 61871427, 62071161), the Open Research Projects of Zhejiang Lab (No. 2021MC0AB04).
Conflict of interest
All authors declare no conflicts of interest in this paper.
References
[1]
A. J. Durnford, W. Rodgers, F. J. Kirkham, M. A. Mullee, A. Whitney, M. Prevett, et al., Very good inter-rater reliability of engel and ilae epilepsy surgery outcome classifications in a series of 76 patients, Seizure, 20 (2011), 809–812. doi: 10.1016/j.seizure.2011.08.004. doi: 10.1016/j.seizure.2011.08.004
[2]
E. Howell, Epilepsy stigma: Moving from a global problem to a global solution, Seizure-Eur. J. Epilepsy, 19 (2010), 628–629. doi: 10.1016/j.seizure.2010.10.016. doi: 10.1016/j.seizure.2010.10.016
[3]
M. Yildiz, E. Bergİl, The investigation of channel selection effects on epileptic analysis of eeg signals, Balk. J. Electr. Comput. Eng., 3 (2015), 236–241.
[4]
A. Matin, R. A. Bhuiyan, S. R. Shafi, A. K. Kundu, M. U. Islam, A hybrid scheme using pca and ica based statistical feature for epileptic seizure recognition from eeg signal, in 2019 Joint 8th International Conference on Informatics, Electronics & Vision (ICIEV) and 2019 3rd International Conference on Imaging, Vision & Pattern Recognition (icIVPR), IEEE, (2019), 301–306. doi: 10.1109/ICIEV.2019.8858573.
[5]
Y. Liu, B. Jiang, J. Feng, J. Hu, H. Zhang, Classification of EEG signals for epileptic seizures using feature dimension reduction algorithm based on LPP, Multimedia Tools Appl., 80 (2020), 30261-–30282. doi: 10.1007/s11042-020-09135-7. doi: 10.1007/s11042-020-09135-7
[6]
J. Birjandtalab, M. B. Pouyan, M. Nourani, Nonlinear dimension reduction for eeg-based epileptic seizure detection, in 2016 IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI), IEEE, (2016), 595–598. doi: 10.1109/BHI.2016.7455968.
[7]
Q. Hou, Y. Liu, J. Liu, S. Sun, Epilepsy detection using random forest classification based on locally linear embedding algorithm, in 2020 5th International Conference on Control, Robotics and Cybernetics (CRC), IEEE, (2020), 202–206. doi: 10.1109/CRC51253.2020.9253455.
[8]
K. C. Chua, V. Chandran, R. Acharya, C. Lim, Automatic identification of epilepsy by hos and power spectrum parameters using eeg signals: A comparative study, in 2008 30th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, IEEE, (2008), 3824–3827. doi: 10.1109/IEMBS.2008.4650043.
[9]
Z. Yan, A. Miyamoto, Z. Jiang, Frequency slice wavelet transform for transient vibration response analysis, Mech. Syst. Signal Process., 23 (2009), 1474–1489. doi: 10.1016/j.ymssp.2009.01.008. doi: 10.1016/j.ymssp.2009.01.008
[10]
Z. Yan, T. Tao, Z. Jiang, H. Wang, Discrete frequency slice wavelet transform, Mech. Syst. Signal Process., 96 (2017), 385–392. doi: 10.1016/j.ymssp.2017.04.019. doi: 10.1016/j.ymssp.2017.04.019
[11]
H. Hotelling, Relations between two sets of variates, in Breakthroughs in statistics, Springer, (1992), 162–190. doi: 10.1007/978-1-4612-4380-9_14.
[12]
Q. S. Sun, S. G. Zeng, Y. Liu, P. A. Heng, D. S. Xia, A new method of feature fusion and its application in image recognition, Pattern Recognition, 38 (2005), 2437–2448. doi: 10.1016/j.patcog.2004.12.013. doi: 10.1016/j.patcog.2004.12.013
[13]
T. Sun, S. Chen, J. Yang, P. Shi, A supervised combined feature extraction method for recognition, in Procedings of the IEEE International Conference on Data Mining, Pisa, Italy, Citeseer, (2008), 1043–1048.
[14]
T. Melzer, M. Reiter, H. Bischof, Appearance models based on kernel canonical correlation analysis, Pattern Recognit., 36 (2003), 1961–1971. doi: 10.1016/S0031-3203(03)00058-X. doi: 10.1016/S0031-3203(03)00058-X
[15]
T. Sun, S. Chen, Locality preserving cca with applications to data visualization and pose estimation, Image Vision Comput., 25 (2007), 531–543. doi: 10.1016/j.imavis.2006.04.014. doi: 10.1016/j.imavis.2006.04.014
[16]
S. D. Hou, Q. S. Sun, D. S. Xia, Supervised locality preserving canonical correlation analysis algorithm, Pattern Recognit. Artif. Intell., 2012, 143–149.
[17]
R. Andrzejak, K. Lehnertz, F. Mormann, C. Rieke, P. David, C. Elger, Indications of nonlinear deterministic and finite-dimensional structures in time series of brain electrical activity: Dependence on recording region and brain state, Phys. Rev. E, 2001, 061907. doi: 10.1103/PhysRevE.64.061907. doi: 10.1103/PhysRevE.64.061907
[18]
A. Shoeb, J. Guttat, Application of machine learning To epileptic seizure detection, in International Conference on Machine Learning, 2010.
[19]
R. W. Homan, J. Herman, P. Purdy, Cerebral location of international 10–20 system electrode placement, Electroencephalogr. Clin. Neurophysiol., 66 (1987), 376–382. doi: 10.1016/0013-4694(87)90206-9. doi: 10.1016/0013-4694(87)90206-9
[20]
R. J. Martis, U. R. Acharya, J. H. Tan, A. Petznick, L. Tong, C. K. Chua, et al., Application of intrinsic time-scale decomposition (itd) to eeg signals for automated seizure prediction, Int. J. Neural Syst., 23 (2013), 1350023. doi: 10.1142/S0129065713500238. doi: 10.1142/S0129065713500238
[21]
N. Ahammad, T. Fathima, P. Joseph, Detection of epileptic seizure event and onset using EEG, BioMed. Res. Int., 2014. doi: 10.1155/2014/450573. doi: 10.1155/2014/450573
[22]
R. J. Martis, J. H. Tan, C. K. Chua, T. C. Loon, S. W. J. YEO, L. Tong, Epileptic eeg classification using nonlinear parameters on different frequency bands, J. Mech. Med. Biol., 15 (2015), 1550040. doi: 10.1142/S0219519415500402. doi: 10.1142/S0219519415500402
[23]
N. S. Tawfik, S. M. Youssef, M. Kholief, A hybrid automated detection of epileptic seizures in eeg records, Comput. Electr. Eng., 53 (2016), 177–190. doi:10.1016/j.compeleceng.2015.09.001. doi: 10.1016/j.compeleceng.2015.09.001
[24]
E. Kabir, Y. Zhang, Epileptic seizure detection from eeg signals using logistic model trees, Brain Inf., 3 (2016), 93–100. doi: 10.1007/s40708-015-0030-2. doi: 10.1007/s40708-015-0030-2
[25]
R. R. Sharma, R. B. Pachori, Time-frequency representation using ievdhm-ht with application to classification of epileptic eeg signals, IET Sci., Meas. Technol., 12 (2018), 72–82.
[26]
V. Gupta, R. B. Pachori, Epileptic seizure identification using entropy of fbse based eeg rhythms, Biomed. Signal Process. Control, 53 (2019), 101569. doi: 10.1016/j.bspc.2019.101569. doi: 10.1016/j.bspc.2019.101569
[27]
H. Al-Hadeethi, S. Abdulla, M. Diykh, R. C. Deo, J. H. Green, Adaptive boost ls-svm classification approach for time-series signal classification in epileptic seizure diagnosis applications, Expert Syst. Appl., 161 (2020), 113676. doi: 10.1016/j.eswa.2020.113676. doi: 10.1016/j.eswa.2020.113676
[28]
N. Rafiuddin, Y. U. Khan, O. Farooq, Feature extraction and classification of eeg for automatic seizure detection, in 2011 International Conference on Multimedia, Signal Processing and Communication Technologies, IEEE, (2011), 184–187. doi: 10.1109/MSPCT.2011.6150470.
[29]
Y. U. Khan, N. Rafiuddin, O. Farooq, Automated seizure detection in scalp eeg using multiple wavelet scales, in 2012 IEEE international conference on signal processing, computing and control, IEEE, (2012), 1–5. doi: 10.1109/ISPCC.2012.6224361.
[30]
M. Zabihi, S. Kiranyaz, A. B. Rad, A. K. Katsaggelos, M. Gabbouj, T. Ince, Analysis of high-dimensional phase space via poincaré section for patient-specific seizure detection, IEEE Trans. Neural Syst. Rehabil. Eng., 24 (2015), 386–398. doi: 10.1109/TNSRE.2015.2505238. doi: 10.1109/TNSRE.2015.2505238
[31]
P. Thodoroff, J. Pineau, A. Lim, Learning robust features using deep learning for automatic seizure detection, in Machine learning for healthcare conference, PMLR, (2016), 178–190.
[32]
M. Z. Ahmad, A. M. Kamboh, S. Saleem, A. A. Khan, Mallat's scattering transform based anomaly sensing for detection of seizures in scalp EEG, IEEE Access, 5 (2017), 16919–16929. doi: 10.1109/ACCESS.2017.2736014. doi: 10.1109/ACCESS.2017.2736014
[33]
D. Chen, S. Wan, J. Xiang, F. S. Bao, A high-performance seizure detection algorithm based on discrete wavelet transform (dwt) and EEG, PloS one, 12 (2017), e0173138. doi: 10.1371/journal.pone.0173138. doi: 10.1371/journal.pone.0173138
[34]
J. Bonello, L. Garg, G. Garg, E. E. Audu, Effective data acquisition for machine learning algorithm in eeg signal processing, in Soft Computing: Theories and Applications, Springer, (2018), 233–244. doi: 10.1007/978-981-10-5699-4_23.
[35]
K. M. Tsiouris, S. Markoula, S. Konitsiotis, D. D. Koutsouris, D. I. Fotiadis, A robust unsupervised epileptic seizure detection methodology to accelerate large EEG database evaluation, Biomed. Signal Process. Control, 40 (2018), 275–285. doi: 10.1016/j.bspc.2017.09.029. doi: 10.1016/j.bspc.2017.09.029
[36]
M. Zhou, C. Tian, R. Cao, B. Wang, Y. Niu, T. Hu, et al., Epileptic seizure detection based on EEG signals and CNN, Front. Neuroinf., 12 (2018), 95. doi: 10.3389/fninf.2018.00095. doi: 10.3389/fninf.2018.00095
[37]
M. B. Ahmadi, A. Craik, H. F. Azgomi, J. T. Francis, J. L. Contreras-Vidal, R. T. Faghih, Real-time seizure state tracking using two channels: A mixed-filter approach, in 2019 53rd Asilomar Conference on Signals, Systems, and Computers, IEEE, (2019), 2033–2039. doi: 10.1109/IEEECONF44664.2019.9048990.
[38]
J. Wu, T. Zhou, T. Li, Detecting epileptic seizures in eeg signals with complementary ensemble empirical mode decomposition and extreme gradient boosting, Entropy, 22 (2020), 140. doi: 10.3390/e22020140. doi: 10.3390/e22020140
[39]
S. Chakrabarti, A. Swetapadma, A. Ranjan, P. K. Pattnaik, Time domain implementation of pediatric epileptic seizure detection system for enhancing the performance of detection and easy monitoring of pediatric patients, Biomed. Signal Process. Control, 59 (2020), 101930. doi: 10.1016/j.bspc.2020.101930. doi: 10.1016/j.bspc.2020.101930
[40]
L. A. Moctezuma, M. Molinas, EEG channel-selection method for epileptic-seizure classification based on multi-objective optimization, Front. Neurosci., 14 (2020), 593. doi: 10.3389/fnins.2020.00593. doi: 10.3389/fnins.2020.00593
[41]
L. A. Moctezuma, M. Molinas, Classification of low-density eeg for epileptic seizures by energy and fractal features based on emd, J. Biomed. Res., 34 (2020), 180–190. doi: 10.7555/JBR.33.20190009. doi: 10.7555/JBR.33.20190009
This article has been cited by:
1.
Yuhuan Xiong, Jinghan Li, Duanpo Wu, Fang Dong, Junbiao Liu, Lurong Jiang, Jiuwen Cao, Yuansheng Xu,
Seizure detection algorithm based on fusion of spatio-temporal network constructed with dispersion index,
2023,
79,
17468094,
104155,
10.1016/j.bspc.2022.104155
2.
Yunyuan Gao, Hongming Liu, Feng Fang, Yingchun Zhang,
Classification of Working Memory Loads via Assessing Broken Detailed Balance of EEG-FNIRS Neurovascular Coupling Measures,
2023,
70,
0018-9294,
877,
10.1109/TBME.2022.3204718
Xiaodong Yang, Yanlin Ren, Binyi Hong, Aijun He, Jun Wang, Zhixiao Wang,
Epileptic detection in single and multi-lead EEG signals using persistent homology based on bi-directional weighted visibility graphs,
2023,
33,
1054-1500,
10.1063/5.0140579
5.
Ruchi Holker, Seba Susan,
Linear Monotonic Inter-electrode Associations as Quantitative EEG for Alcoholism Diagnosis,
2024,
17,
1875-6883,
10.1007/s44196-024-00660-z
6.
Wulin Zhang, Xuan Liu, Zirun Chang, Yuanming Tian, Xingkai Wang, Xueliang Gong, Qianyu Hao, Zirui Liu, Yuqiang Chen,
2024,
Segmentation method of pulmonary nodules based on AC-UNET,
9798400718106,
918,
10.1145/3671151.3671312
7.
Wulin Zhang, Yuanming Tian, Xuan Liu, Zirun Chang, Zirui Liu, Yuqiang Chen,
2024,
Research on feature extraction method based on epileptic EEG signal,
9798400718106,
888,
10.1145/3671151.3671307
8.
Min Wu, Hong Peng, Zhicai Liu, Jun Wang,
Seizure Detection of EEG Signals Based on Multi-Channel Long- and Short-Term Memory-Like Spiking Neural Model,
2024,
34,
0129-0657,
10.1142/S0129065724500515
9.
Lu 璐 Ma 马, Yan-Lin 彦霖 Ren 任, Ai-Jun 爱军 He 何, De-Qiang 德强 Cheng 程, Xiao-Dong 小冬 Yang 杨,
Detection of EEG signals in normal and epileptic seizures with multiscale multifractal analysis approach via weighted horizontal visibility graph,
2023,
32,
1674-1056,
110506,
10.1088/1674-1056/acdfbf
10.
Shan Liu, Jiang Wang, Shanshan Li, Lihui Cai,
Epileptic Seizure Detection and Prediction in EEGs Using Power Spectra Density Parameterization,
2023,
31,
1534-4320,
3884,
10.1109/TNSRE.2023.3317093
11.
Zhuoyi Li, Beibei Chen, Ning Zhu, Wenjun Li, Tianming Liu, Lei Guo, Junwei Han, Tuo Zhang, Zhiqiang Yan,
Epileptic Seizure Detection in SEEG Signals Using a Signal Embedding Temporal-Spatial–Spectral Transformer Model,
2025,
74,
0018-9456,
1,
10.1109/TIM.2025.3527489
12.
Md. Nuhi-Alamin, Ajay Krishno Sarkar, Md. Simul Hasan Talukder, Mohammad Aljaidi, Safkat Shahrier Swapnil,
2024,
Continuous Wavelet Transform Spectral Image Based Epileptic Episode Classification,
979-8-3315-4001-2,
1,
10.1109/ACIT62805.2024.10877163