Research article

Application of bi-modal signal in the classification and recognition of drug addiction degree based on machine learning


  • Received: 10 July 2021 Accepted: 09 August 2021 Published: 20 August 2021
  • Most studies on drug addiction degree are made based on statistical scales, addicts' account, and subjective judgement of rehabilitation doctors. No objective, quantified evaluation has been made. This paper uses devises the synchronous bimodal signal collection and experimentation paradigm with electroencephalogram (EEG) and forehead high-density near-infrared spectroscopy (NIRS) device. The drug addicts are classified into mild, moderate and severe groups with reference to the suggestions of researchers and medical experts. Data of 45 drug addicts (mild: 15; moderate: 15; and severe: 15) is collected, and then used to design an addiction degree testing algorithm based on decision fusion. The algorithm is used to classify mild, moderate and severe addiction. This paper pioneers to use two types of Convolutional Neural Network (CNN) to abstract the EEG and NIR data of drug addicts, and introduces batch normalization to CNN, thus accelerating training process, reducing parameter sensitivity, and enhancing system robustness. The characteristics output by two CNNs are transformed into dimensions. Two new characteristics are assigned with a weight of 50% each. The data is used for decision fusion. In the networks, 27 subjects are used as training sets, 9 as validation sets, and 9 as testing sets. The 3-class accuracy remains to be 63.15%, preliminarily justifying this method as an effective approach to measure drug addiction degree. And the method is ready to use, objective, and offers results in real time.

    Citation: Xuelin Gu, Banghua Yang, Shouwei Gao, Lin Feng Yan, Ding Xu, Wen Wang. Application of bi-modal signal in the classification and recognition of drug addiction degree based on machine learning[J]. Mathematical Biosciences and Engineering, 2021, 18(5): 6926-6940. doi: 10.3934/mbe.2021344

    Related Papers:

    [1] Banghua Yang, Xuelin Gu, Shouwei Gao, Ding Xu . Classification accuracy and functional difference prediction in different brain regions of drug abuser prefrontal lobe basing on machine-learning. Mathematical Biosciences and Engineering, 2021, 18(5): 5692-5706. doi: 10.3934/mbe.2021288
    [2] Ning Huang, Zhengtao Xi, Yingying Jiao, Yudong Zhang, Zhuqing Jiao, Xiaona Li . Multi-modal feature fusion with multi-head self-attention for epileptic EEG signals. Mathematical Biosciences and Engineering, 2024, 21(8): 6918-6935. doi: 10.3934/mbe.2024304
    [3] Hui Jiang, Ling Chen, Fengying Wei, Quanxin Zhu . Survival analysis and probability density function of switching heroin model. Mathematical Biosciences and Engineering, 2023, 20(7): 13222-13249. doi: 10.3934/mbe.2023590
    [4] Xiaowen Jia, Jingxia Chen, Kexin Liu, Qian Wang, Jialing He . Multimodal depression detection based on an attention graph convolution and transformer. Mathematical Biosciences and Engineering, 2025, 22(3): 652-676. doi: 10.3934/mbe.2025024
    [5] Sakorn Mekruksavanich, Wikanda Phaphan, Anuchit Jitpattanakul . Epileptic seizure detection in EEG signals via an enhanced hybrid CNN with an integrated attention mechanism. Mathematical Biosciences and Engineering, 2025, 22(1): 73-105. doi: 10.3934/mbe.2025004
    [6] Tianhui Sha, Yikai Zhang, Yong Peng, Wanzeng Kong . Semi-supervised regression with adaptive graph learning for EEG-based emotion recognition. Mathematical Biosciences and Engineering, 2023, 20(6): 11379-11402. doi: 10.3934/mbe.2023505
    [7] Ravichandra Madanu, Farhan Rahman, Maysam F. Abbod, Shou-Zen Fan, Jiann-Shing Shieh . Depth of anesthesia prediction via EEG signals using convolutional neural network and ensemble empirical mode decomposition. Mathematical Biosciences and Engineering, 2021, 18(5): 5047-5068. doi: 10.3934/mbe.2021257
    [8] Ying Chang, Lan Wang, Yunmin Zhao, Ming Liu, Jing Zhang . Research on two-class and four-class action recognition based on EEG signals. Mathematical Biosciences and Engineering, 2023, 20(6): 10376-10391. doi: 10.3934/mbe.2023455
    [9] Yanling An, Shaohai Hu, Shuaiqi Liu, Bing Li . BiTCAN: An emotion recognition network based on saliency in brain cognition. Mathematical Biosciences and Engineering, 2023, 20(12): 21537-21562. doi: 10.3934/mbe.2023953
    [10] Chenyang Jiang, Weidong Jiang . AGTR1, PLTP, and SCG2 associated with immune genes and immune cell infiltration in calcific aortic valve stenosis: analysis from integrated bioinformatics and machine learning. Mathematical Biosciences and Engineering, 2022, 19(4): 3787-3802. doi: 10.3934/mbe.2022174
  • Most studies on drug addiction degree are made based on statistical scales, addicts' account, and subjective judgement of rehabilitation doctors. No objective, quantified evaluation has been made. This paper uses devises the synchronous bimodal signal collection and experimentation paradigm with electroencephalogram (EEG) and forehead high-density near-infrared spectroscopy (NIRS) device. The drug addicts are classified into mild, moderate and severe groups with reference to the suggestions of researchers and medical experts. Data of 45 drug addicts (mild: 15; moderate: 15; and severe: 15) is collected, and then used to design an addiction degree testing algorithm based on decision fusion. The algorithm is used to classify mild, moderate and severe addiction. This paper pioneers to use two types of Convolutional Neural Network (CNN) to abstract the EEG and NIR data of drug addicts, and introduces batch normalization to CNN, thus accelerating training process, reducing parameter sensitivity, and enhancing system robustness. The characteristics output by two CNNs are transformed into dimensions. Two new characteristics are assigned with a weight of 50% each. The data is used for decision fusion. In the networks, 27 subjects are used as training sets, 9 as validation sets, and 9 as testing sets. The 3-class accuracy remains to be 63.15%, preliminarily justifying this method as an effective approach to measure drug addiction degree. And the method is ready to use, objective, and offers results in real time.



    No researchers have applied AI algorithms in the quantified evaluation of drug addiction degree by now. Based on statistical scales, addicts' account, and subjective judgement of rehabilitation doctors, this paper classifies drug addicts into mild, moderate and severe groups. The EEG-NIRS synchronous collection experiment is then designed to obtain the data of 45 subjects. Machine learning methods are used to classify the drug addiction degree. The main contributions of this study include: 1) the feasibility and advantages of EEG-NIRS in the research on detoxification are put forward; 2) artificial intelligence algorithm is used for Bi-modal Signal processing; 3) the objective and quantitative evaluation of drug addiction levels and craving is realized; 4) it provides a basis for the formulation of targeted rehabilitation training program in the future.

    The Electroencephalogram (EEG) is a type of spontaneous bioelectrical signal captured and recorded on the surface of the scalp. It has the characteristics of being noninvasive, no radiation damage and low cost, so it is widely used in the research of brain diseases. Near-infrared spectroscopy technology uses light injection and detection points to measure the hemodynamic function of brain tissue, and can record blood oxygen levels [1,2,3,4]. In human body, oxygenated hemoglobin (HbO2) and deoxyhemoglobin (Hbb) have specific absorption for near-infrared light at 600–800 nm. However, other biological tissues in the brain are relatively transparent in this wavelength range to measure the near-infrared light intensity changes at 650 and 760 nm. The cerebral hemodynamic data were obtained according to beer Lambert Law [5,6]. The EEG can capture the changes in temporal activity. NIRS quantitatively analyzes the blood oxygen metabolism level of brain tissue via the spectral measurement method [7,8], the combination of the two marks the successful integration of EEG and cerebral blood oxygen metabolism level to positively quantify the degree of drug addiction [9,10,11].

    Zhang et al. adopt the method of multi-modal task in the study and propose the new prediction framework of regions of interests (ROIs) on the cortex, improving the accuracy of predicting ROI [12]. Xia et al. propose a machine-learning method integrating stacked denoising autoencoder (SDAE) which has advantages over electrocardiogram (ECG) classification [13]. Wallois et al. used EEG-NIRS to synchronously collect information to study the brain activation mechanism [14]. In 2015, Balconi et al. studied the connection between EEG-NIRS and emotions, and the findings revealed the correlation between cortical forward networks and emotional stimuli [15]. Fazli et al. used two devices to synchronize information and improved the recognition accuracy of motion imagination to 90% [16]. Tomita et al. through visual cortex stimulation experiments, found that the brain-computer interface of EEG-NIRS is more accurate than using event-related potentials alone to recognize light scintillation [17]. At present, the research on the fusion of the results from the two methodologies has been applied in the fields of brain function diseases and cognitive analysis [18], evaluation of mental and movement disorders [19], brain-computer interface and other fields [20,21]. In 2017, Zich et al. studied the influence of age on the neurological correlation of myocardial infarction [22]. Safaie et al. studied the fusion monitoring of EEG-NIRS and EEG-NIRS [23]. Sawan's EEG-NIRS fusion was used to monitor intracranial brain function in epilepsy [24]. In 2015, Rand et al. designed a near-infrared probe which can be mixed with EEG electrodes, and verified the feasibility of the scheme through experiments [25]. In 2016, Buccino and other researchers tried to classify four different samples of motor imagery, namely right arm, left arm, right hand and left hand's tasks [26]. Shin et al, used EEG-NIRS to study two types of motor imagery, and the final accuracy rate reached 82% [27]. Guo et al. used deep learning to do emotion recognition for expression and gesture signals, with the highest accuracy rate of 50.42% [28].

    The study is based on the subjects of forty-five (All male) drug addicts (15 mild, 15 moderate and 15 severe) who were pre-screened according to the behavioral scale screening, oral narration and medical examination. According to the designed experimental paradigm, the EEG and NIRS data of these drug addicts were collected. This paper designs an algorithm to detect drug addiction using CNN's detection algorithm based on decision level fusion. EEG-CNN network and NIRS-CNN network, respectively extracted features related to the degree of addiction and performed some normalization processing before classification to obtain two types of feature signals with the same dimension. After that, the two features were given a 50% weight. 27 subjects were used as training data, 9 subjects were used as validation data and 9 subjects were used as test data, for final classification.

    At the Shanghai Qingdong Drug Rehabilitation Center, 45 subjects were recruited, all of whom were male methamphetamine withdrawal. Average age: 31.30 ± 4.64. Average education level: 10.07 ± 2.849. History of drug use: 9.929 ± 5.341. The duration of drug detoxification this time is 11.15 ± 2.73 months.

    The specific inclusion criteria for the subjects are as follows: 1) patients meeting the DSM-5 diagnostic criteria for mental disorders caused by psychoactive substances; 2) patients with detoxification period less than half a year; 3) patients with junior high school education or above; 4) patients aging 18–41 years; 5) patients with willingness to participate in this study and informed consent. Exclusion criteria: 1) severe cognitive impairment, or unable to cooperate with the study-related evaluations and examinations; 2) severe somatic diseases; 3) severe psychotic symptoms; 4) participating in other psychological intervention and treatment; 5) a history of abuse of other psychoactive substances (except nicotine). The study was conducted in accordance with the declaration of Helsinki and was approved by the Ethics Committee of Shanghai University (Approval No. ECSHU2020-071).

    Neuracle Technology, Wet electrode wireless EEG acquisition equipment. The electrode position international standard 10/20 system. Number of brain electrodes: 64, sampling rate: 1000 Hz, input range: +/- 375 mV, data transmission mode: WiFi. It can simultaneously collect multi-lead EEG, ECG and other signals. In order to ensure the signal quality, the impedance of brain electrodes is lower than 5 KΩ. According to the actual demand, 48 electrodes are selected. Figure 1, shows the distribution of EEG electrodes.

    Figure 1.  Electrode distribution of EEG equipment.

    NIRSIT (OBELAB, Seoul, Korea), light source type: dual wavelength VCSEL laser, technical spectrum: CW: wave 780 nm, 850 nm, spatial resolution: 4 ×4 mm2, time resolution: 8.13 Hz, number of light points: 24, number of detection points: 32, detection depth: 0.2–1.8 cm. The NIRS system used in the experiment was able to measure the signals from four SD separations: 15, 21.2, 30 and 33.5 mm, and allowed to measure the changes of hemodynamic reaction at different depths [29]. It is a functional near infrared spectroscopy (fNIRS) device with 204 channels [30]. Figure 2, The method used in the experiment of NIRS equipment.

    Figure 2.  The method used in the experiment of NIRS equipment.

    The NIRSIT channel and functional area are shown in Figure 3. The four advanced functional areas detected by the forehead near-infrared device are divided into the dorsolateral prefrontal cortex, the ventrolateral prefrontal cortex, frontopolar prefrontal cortex, and the orbital frontal cortex. The specific channel distribution: the right dorsolateral prefrontal cortex is 1, 2, 3, 5, 6, 11, 17, 18 channels. There were 19, 20, 33, 34, 35, 38, 39 and 43 channels in the left dorsolateral prefrontal lobe. There are 4, 9, 10, 40, 44, 45 channels in ventrolateral prefrontal cortex of left and right hemispheres. There are 14, 15, 16, 29, 30, 31, 32, 46, 47, 48 channels in the left and right orbital frontal cortex. The frontopolar prefrontal cortex electrode arrangement is 7, 8, 12, 13, 21, 22, 23, 24, 25, 26, 27, 28, 36, 37, 41 and 42.

    Figure 3.  Division of NIRSIT channels and functional areas.

    We used E-prime software package (Psychology Software Tools, Pittsburgh, PA) to write the experimental paradigm, with each map numbered. A complete experimental paradigm consists of the following three stages.

    Figures 4 and 5, are examples of drug abuse-related maps and neutral maps used in the experimental paradigm. Figure 6 The whole process of experimental paradigm. P means: drug map; N means: neutral map.

    Figure 4.  Examples of drug abuse-related maps in the experimental paradigm.
    Figure 5.  Examples of neutral maps in the experimental paradigm.
    Figure 6.  The whole experimental structure.

    The first stage of the experimental paradigm, 10 minutes in total, during which the subjects need to close their eyes for 5 minutes and then open their eyes for 5 minutes.

    The second stage, it lasts 6 minutes and is divided into drug maps and neutral maps. Among them, each block lasts 10 seconds. There are a total of 16 maps, and the display time of each map is 0.6 seconds. At the beginning, the first four maps are displayed randomly, during which there are two drug maps. After displaying the first four maps, the remaining 12 neutral maps are displayed randomly. After a block ends, there will be a 4-second interval map with a white background and a black cross. In the second stage, when the subjects see the drug map flash out, they need to be marked on the paper synchronously.

    The third stage, it lasts a total of 4.6 minutes, during which the maps are all neutral, with each block lasting 10 seconds. There are 16 maps in total, with a display speed of 0.6 seconds. There will be a 4-second interval between each block.

    Decision fusion means that each signal's data makes independent decisions first, and then the final decision was obtained with the help of weighted calculations according to the results of independent decisions of each signal.

    EEG and NIRS data were obtained separately, due to the difference in sampling frequency of EEG equipment and NIRS equipment. After the two types of data were preprocessed by their respective CNN networks, feature extraction, and decision-making, the two types of data were then processed by batch normalization at the decision-making level, so that the data becomes a distribution with a mean value of 0 and a standard deviation of 1. Figure 7, the flow chart of bimodal addiction recognition based on decision level fusion.

    Figure 7.  The flow chart of bimodal addiction recognition based on decision level fusion.

    Both EEG and NIRS networks are designed with Butterworth filters. The expression of n-order Butterworth filter is:

    (1)

    where n is the order, is the cutoff frequency, and is the passband edge frequency.

    In this experiment, n is 6 and the data is band-pass filtered. The selected frequency band ranges from 0.01 to 3 Hz. This frequency range can remove the interference of heartbeat, breathing, and slow drift on the original data. The EEG sampling rate is 1000 Hz, and the Butterworth filter is also selected for filtering. The selected frequency band ranges from 0.5 to 7.5 Hz. In the experiment, n takes 2 to filter the EEG data to remove signal interference such as ocular and facial myoelectric activity.

    Convolutional layers are the core of convolutional neural networks [31,32]. Their main role is to extract features from the input data. The calculation form is as follows:

    (2)

    is the th feature of the layer . is the convolutional kernel weight, and the th feature of the layer . is a bias parameter, is the activation function.

    The pooling layer sub-samples the input features according to specific rules in order to make the network robust to small changes in previously learned features [33,34]. The calculation form is as follows:

    (3)

    is the th feature of the layer . is the Subsampling coefficient. is the bias parameter, is a sub-sampling function, is the activation function.

    In this method, each neuron feature is normalized individually, and the mean and variance are calculated on a single training data block at a time, and then normalized. Each neuron feature is normalized individually, and the mean and variance are calculated on a single training data block at a time, and then normalized. The specific implementation process is as follows:

    Input: Values of over a mini-batch:
    Parameters to be learned:
    Output:
    1. Calculate the mean of each mini-batch:

    2. Calculate the variance of each mini-batch:

    3. Normalize the data:

    4. Scale and shift:

     | Show Table
    DownLoad: CSV

    In this paper, the activation function uses a Rectified Linear Unit (ReLU), and the corresponding calculation formula is as follows:

    (4)

    The results show that the derivation of the activation function is simple, and the output of some neurons is 0.

    The input to the fully connected layer must be one-dimensional, and the previous features are two-dimensional, so the order of each feature is converted to one-dimensional before it can be used as the input of the fully connected layer [35,36]. The calculation formula for the fully connected layer is as follows:

    (5)

    is the output value of the neuron. is the input feature vector of the neuron. is the weight. is the bias parameter. is the activation function; The first fully connected layer in this paper uses the ReLU activation function.

    In CNN, if the final output result is single-label multi-classification, the softmax function is usually used to normalize and map to the probability value [37,38], and the Softmax calculation formula is as follows:

    (6)

    is the value of the output neuron corresponding to the ith category.

    Required parameters:Learning rate ; ; Exponential decay rate for moment estimation; The random objective function , whose parameter is ; Initialized parameter vector .
    Preparation: (initialize first moment vector) (initialize second moment vector) (initialization time step).
    while not converged do

    (Calculate the gradient at the time step)
    (Updated first moment estimation)
    (Updated biased second moment estimation)
    (Calculate the first-order moment estimation for deviation correction)
    (Calculate the second-order moment estimation for deviation correction)
    (Update parameters)
    end while
    return (Final parameters)

     | Show Table
    DownLoad: CSV

    Adam algorithm is an algorithm to perform a step optimization for random objective functions. The core idea of this algorithm is adaptive low order moment estimation. The pseudo code for the Adam algorithm.

    Figure 8 shows the EEG-NIRS addiction classification structure. the module 1 is the CNN network architecture. Table 1 EEG model parameters.

    Figure 8.  EEG-NIRS drug addiction degree classification framework.
    Table 1.  EEG model parameters.
    Layer (type) Output Shape Parameter#
    conv2d_5 (Conv2D) (None, 38,282, 32) 608
    batch_normalization_6 (None, 32,282, 32) 128
    max_pooling2d (None, 16,141, 32) 0
    conv2d_6 (Conv2D) (None, 16,141, 32) 18464
    batch_normalization_7 (None, 16,141, 32) 128
    max_pooling2d_1 (None, 8, 71, 32) 0
    conv2d_7 (None, 8, 71, 32) 6176
    batch_normalization_8 (None, 8, 71, 32) 128
    max_pooling2d_2 (None, 4, 35, 32) 0
    conv2d_8 (None, 4, 35, 32) 6176
    batch_normalization_9 (None, 4, 35, 32) 128
    max_pooling2d_3 (None, 2, 17, 32) 0
    conv2d_9 (None, 2, 17, 32) 9248
    batch_normalization_10 (None, 2, 17, 32) 128
    max_pooling2d_4 (None, 1, 8, 32) 0
    conv2d_10 (None, 1, 8, 32) 9248
    batch_normalization_11 (None, 1, 8, 32) 128
    max_pooling2d_5 (None, 1, 7, 32) 0
    conv2d_11 (None, 1, 7, 16) 4624
    batch_normalization_12 (None, 1, 7, 16) 64
    max_pooling2d_6 (None, 1, 6, 16) 0
    conv2d_12 (None, 1, 6, 32) 4640
    batch_normalization_13 (None, 1, 6, 32) 128
    max_pooling2d_7 (None, 1, 6, 32) 0
    flatten (Flatten) (None, 288) 0
    dense (Dense) (None, 460) 132940
    batch_normalization_14 (None, 460) 1840

     | Show Table
    DownLoad: CSV

    The CNN network parameters of EEG before the fusion of the two features.

    Figure 8, Module 2 is the CNN network architecture corresponding to NIRS. Table 2 fNIRS model parameters.

    Table 2.  fNIRS model parameters.
    Layer (type) Output Shape Parameter#
    conv2d (Conv2D) (None, 9, 16, 32) 96
    batch_ normalization (None, 9, 16, 32) 128
    average_pooling2d (None, 9, 8, 32) 0
    conv2d_1 (None, 9, 8, 32) 4128
    batch_normalization_1 (None, 9, 8, 32) 128
    average_pooling2d_1 (None, 9, 4, 32) 0
    conv2d_2 (None, 9, 4, 32) 4128
    batch_normalization_2 (None, 9, 4, 32) 128
    average_pooling2d_2 (None, 9, 2, 32) 0
    conv2d_3 (None, 9, 2, 16) 2064
    batch_normalization_3 (None, 9, 2, 16) 64
    average_pooling2d_3 (None, 9, 1, 16) 0
    conv2d_4 (None, 9, 1, 32) 2080
    batch_normalization_4 (None, 9, 1, 32) 128
    average_pooling2d_4 (None, 9, 1, 32) 0
    batch_normalization_5 (None, 9, 1, 32) 128
    flatten_1 (Flatten) (None, 192) 0
    dense_1 (Dense) (None, 460) 88780
    batch_normalization_15 (None, 460) 1840

     | Show Table
    DownLoad: CSV

    Before the fusion of the two signals, the CNN network architecture of fNIRS.

    Figure 8, Module 3, normalization of EEG and NIRS output characteristics: EEG characteristics: batch normalization level. The EEG features were normalized to optimize the one-dimensional feature distribution. Near-infrared characteristics: batch normalization level. The near-infrared features were normalized to optimize the one-dimensional feature distribution. Weight distribution: after normalization, EEG and NIRS data of the same dimension and the same amount of data were obtained. At the same time, 50% weight was given to each type of data. The data was processed and output to the full connection level. Full connectivity level. One dimensional feature distribution was searched and finally classified.

    A total of 32 electrodes in the central, parietal, and occipital regions of the brain were selected for EEG. The sampling frequency of EEG data was 1000 Hz. The experiment used 45 subjects' data, each subject data has 56 trials, and each trial contained 32-channel EEG data of 1.125 s after the drug induced picture appeared. The CNN network selected 27 personal data for training, accounting for 60% of the total number of people, 9 personal data for validation and 9 personal data testing, each accounting for 20% of the total number of people.

    For fNIRS data, 16 channels in the dorsolateral prefrontal cortex, ventrolateral prefrontal cortex, and frontal polar prefrontal cortex, which theoretically, related directly to the drug users, were selected. The data format was 10 channels 8 Hz. The experiment used 45 subjects' data, each of which contained 56 trial labels. Each trial contained 16-channel near-infrared data of 1.125 s after the drug picture appeared. Consistent with EEG processing, 27 subjects were for data training, 9 subjects were used as validation data and 9 subjects were used as test data. There were 3024 trial labels of training data, 1008 trial labels of validation data and 1008 trial labels of testing data. Batch normalization changes the size of the data variance and the mean position by using optimization. This method processes the EEG and NIRS features separately, and converts EEG and NIRS features into data distribution of the same dimension and format, which is possible to combine the two features.

    In this paper, the CNN network uses the Adam optimizer to calculate loss through sparse categorical cross-entropy. After 1300 epochs, the result is shown in Figure 9. The loss changes showing a convergence trend, which proves that the network structure is stable. When monitoring the accuracy of the test data set, it was found that after 900 epochs, as shown in Figure 10, the accuracy of the test data set showed a significant overall decline, and over-fitting occurred. In order to suppress the occurrence of over-fitting, the Early stopping method is used to monitor the accuracy of the test set. This method continuously records the accuracy of the test set. If the optimal accuracy is not reached in multiple consecutive epochs, it stops the iteration and saves the optimal accuracy model. In this article, the Early stopping monitoring period is set to 5000 epochs. After many experiments, the optimal model basically appeared between 800 and 1000 epoch.

    Figure 9.  Loss of Hybrid CNN model during training.
    Figure 10.  Test data set accuracy during training.

    Figure 11, shows the average classification accuracy of drug addiction degree. Since the final model of neural network training is different each time. According to the CNN network, the correct rate of the ten average classification is 63.15%. The ultimate goal of this paper was to realize the intelligent judgement of the addiction degree of drug addicts through the machine learning method, instead of subjective judgment in traditional behavioral methods. From the analysis results obtained, it was consistent with the behavioral scale and expert judgment.

    Figure 11.  Average classification accuracy of drug addiction.

    In this paper, based on the EEG-NIRS bimodal Signal classification of drug addicts' method, using the machine learning algorithm, CNN classified the degree of addiction by learning the characteristics related to the degree of addiction of drug addicts, which was objective and easy to use.

    In this paper, EEG-NIRS was used to classified the degree of drug addiction by using physiological data. It is an improvement of traditional psychological methods. In the study, a bimodal machine-learning classification algorithm based on decision level fusion is proposed to realize the classification of drug abusers.

    In 2016, Das et al. Studied EEG-NIRS as a novel technology for brain monitoring [39]. Xing et al. summarized the fine-needle aspiration cytological diagnosis of malignant lymphoma and formed a simple and effective classification diagnosis method [40]. Shin et al. Verified the potential advantages of hybrid EEG-NIRS-BCIs in classification accuracy through experiments [41]. The bimodal fusion machine learning classification algorithm finally obtained a classification accuracy of 63.15%. This accuracy rate is not very high. However, by adjusting the network architecture of CNN, it is possible to improve the classification accuracy.

    To objectively evaluate the degree of drug addiction by machine learning, we should pay attention to the following aspects: 1) The integrity of EEG and NIRS data; 2) Feature extraction can be considered during machine learning network design, input layer fusion, feature layer fusion, decision layer fusion, and the classifier can consider the comparison of SVM, LDA, and CNN methods. This method combines the personal situation of drug users, psychological questionnaires, and doctors' suggestions, and has been verified through long experiments to evaluate the scientific nature of drug addiction cravings.

    The limitation of this study is that the number of subjects used is not large enough. When doing deep learning, we need more data as support, and the effect will become better; The limitation of the machine learning model is that the two-modal combination method does not achieve higher 3-class accuracy. When researchers encounter similar problems, they must adjust the machine learning architecture to obtain higher classification accuracy.

    This paper was intended to find an objective and effective method to evaluate the degree of drug addiction. In this regard, an experimental paradigm was designed to induce the drug users' cravings for drugs. The 45 subjects were induced to crave characteristics by drug maps in the experimental paradigm. Based on CNN learning these characteristics, the drug addiction degrees of drug users were finally classified.

    In addition, two CNNs were built on the basis of decision-level fusion, giving full play to the respective advantages of the EEG-CNN and the near-infrared CNN, and then fusion classification was achieved. Experiments had verified that this method could effectively detect the data features associated with addiction and define the addiction degree. The patterns were consistent between the algorithm classification and the classification of mild, moderate, and severe drug users provided by researchers and physicians.

    This project was supported by Key Research & Development Project of National Science and Technique Ministry of China (No. 2018YFC0807405, No. 2018YFC1312903), Defense Industrial Technology Development Program (JCKY2019413D002), National Natural Science Foundation of China (No. 61976133).

    The authors declare that they have no conflict of interest.



    [1] X. B. Chen, Y. L. Ma, X. Y. Liu, W. Z. Kong, X. G. Xi, Analysis of corticomuscular connectivity during walking using vine copula, Math. Biosci. Eng., 18 (2021), 4341-4357. doi: 10.3934/mbe.2021218
    [2] A. S. Huhn, R. K. Brooner, M. M. Sweeney, S. W. Yip, H. Ayaz, K. E. Dunn, Increased neural activity in the right dorsolateral prefrontal cortex during a risky decision-making task is associated with cocaine use in methadone-maintained patients, Drug. Alcohol. Depend., 205 (2019), 107650. doi: 10.1016/j.drugalcdep.2019.107650
    [3] R. Madanu, F. Rahman, M. F. Abbod, S. Z. Fan, J. S. Shieh, Depth of anesthesia prediction via EEG signals using convolutional neural network and ensemble empirical mode decomposition, Math. Biosci. Eng., 18 (2021), 5047-5068. doi: 10.3934/mbe.2021257
    [4] M. Ferrari, V. Quaresima, A brief review on the history of human functional near infrared spectroscopy (fNIRS) development and fields of application, Neuroimage, 63 (2021), 921-935.
    [5] A. M. Kroczek, F. B. Haeussinger, A. J. Fallgatter, A. Batra, A. C. Ehlis, Prefrontal functional connectivity measured with near-infrared spectroscopy during smoking cue exposure, Addict. Biol., 22 (2015), 513-522.
    [6] T. Liu, X. C. Liu, L. Yi, C. Z. Zhu, P. S. Markey, M. Pelowski, Assessing autism at its social and developmental roots, A review of Autism Spectrum Disorder studies using functional near-infrared spectroscopy, Neuroimage, 185 (2017), 955-967.
    [7] L. Huang, S. Y. Guo, Y. Wang, S. Wang, Q. B. Chu, L. Li, et al., Attention based residual network for medicinal fungi near infrared spectroscopy analysis, Math. Biosci. Eng., 16 (2019), 3003-3017. doi: 10.3934/mbe.2019149
    [8] F. Al-Shargie, T. B. Tang, M. Kiguchi, Assessment of mental stress effects on prefrontal cortical activities using canonical correlation analysis: an fNIRS-EEG study, Biomed. Opt. Express., 8 (2017), 2583-2598. doi: 10.1364/BOE.8.002583
    [9] B. Abibullaev, J. An, S. H. Lee, J. I. Moon, Design and evaluation of action observation and motor imagery based BCIs using near-infrared spectroscopy, Measurement, 98 (2017), 250-261. doi: 10.1016/j.measurement.2016.12.001
    [10] A. Kassab, J. L. Lan, J. Tremblay, P. Vannasing, M. Dehbozorgi, P. Pouliot, et al., Multichannel wearable fNIRS-EEG system for long-term clinical monitoring, Hum. Brain. Mapp., 39 (2018), 7-23. doi: 10.1002/hbm.23849
    [11] A. Lühmann, J. Addesa, S. Chandra, A. Das, M. Hayashibe, A. Dutta, Neural interfacing non-invasive brain stimulation with NIRS-EEG joint imaging for closed-loop control of neuroenergetics in ischemic stroke, IEEE Embc., (2017), 349-353.
    [12] T. Zhang, L. Guo, K. M. Li, C. F. Jing, Y. Yin, D. J. Zhu, et al, Predicting functional cortical ROIs via DTI-derived fiber shape models, Cerebral. Cortex., 22 (2012), 854-864. doi: 10.1093/cercor/bhr152
    [13] Y. F. Xia, H. L. Zhang, L. Xu, Z. F. Gao, H. Y. Zhang, H. F. Liu, et al., An automatic cardiac arrhythmia classification system with wearable electrocardiogram, IEEE. Access., 6 (2018), 16529-16538. doi: 10.1109/ACCESS.2018.2807700
    [14] F. Wallois, M. Mahmoudzadeh, A. Patil, R. Grebe, Usefulness of simultaneous EEG-NIRS recording in language studies, Brain. Lang., 121 (2012), 110-23. doi: 10.1016/j.bandl.2011.03.010
    [15] M. Balconi, E. Grippa, M. E. Vanutelli, What hemodynamic (fNIRS), electrophysiological (EEG) and autonomic integrated measures can tell us about emotional processing, Brain. Cogn., 95 (2015), 67-76. doi: 10.1016/j.bandc.2015.02.001
    [16] S. Fazli, J. Mehnert, J. Steinbrink, G. Curio, A. Villringer, K. R. Müller, et al., Enhanced preformance by a hybrid NIRS-EEG brain computer interface, Neuroimage, 59 (2012), 591-529.
    [17] Y. Tomita, F. B. Vialatte, G. Dreyfus, Y. Mitsukura, H. Bakardjian, A. Cichocki, Bimodal BCI using simultaneously NIRS and EEG, IEEE. Trans. Biomed. Eng., 61 (2014), 1274-1284. doi: 10.1109/TBME.2014.2300492
    [18] P. Pouliot, T. P. Y. Tran, V. Birca, P. Vannasing, J. Tremblay, M. Lassonde, et al., Hemodynamic changes during posterior epilepsies: an EEG-fNIRS study, Epilepsy. Res., 108 (2014), 883-890. doi: 10.1016/j.eplepsyres.2014.03.007
    [19] U. Jindal, M. Sood, A. Dutta, S. R. Chowdhury, Development of point of care testing device for neurovascular coupling from simultaneous recording of EEG and NIRS during anodal transcranial direct current stimulation, IEEE. J. Transl. Eng. He., 3 (2015), 1-12.
    [20] M. J. Khan, M. J. Y. Hong, K. S. Hong, Decoding of four movement directions using hybrid NIRS-EEG brain-computer interface. Front. Hum. Neurosci., 8 (2014), 412-432.
    [21] F. Putze, S. Hesslinger, C. Y. Tse, Y. Y. Huang, C. Herff, C. T. Guan, et al., Hybrid fNIRS-EEG based classification of auditory and visual perception processes, Neuroimage, 8 (2014), 19-29.
    [22] C. Zich, S. Debener, A. K. Thoene, L. C. Chen, C. Kranczioch, Simultaneous EEG-fNIRS reveals how age and feedback affect motor imagery signatures, Neurobiol. Aging., 49 (2017), 183-197. doi: 10.1016/j.neurobiolaging.2016.10.011
    [23] J. Safaie, R. Grebe, H. A. Moghaddam, F. Wallois, Toward a fully integrated wireless wearable EEG-NIRS bimodal acquisition system, J. Neural. Eng., 10 (2013), 637-637.
    [24] M. Sawan, M. T. Salam, J. L. Lan, A. Kassab, S. Gelinas, P. Vannasing, et al., Wireless recording systems: from noninvasive EEG-NIRS to invasive EEG devices, IEEE. T. Biomed. Circ. S., 7 (2013), 186-195. doi: 10.1109/TBCAS.2013.2255595
    [25] R. K. Almajidy, Y. Boudria, U. G. Hofmann, W. Besio, K, Mankodiya, A multimodal 2D brain computer interface, IEEE Embc., (2015), 1067-1070.
    [26] A. P. Buccino, H. O. Keles, A. Omurtag, Hybrid EEG-fNIRS asynchronous brain-computer interface for multiple motor tasks, PLoS One, 11 (2016), e0146610. doi: 10.1371/journal.pone.0146610
    [27] J. Shin, D. W. Kim, K. R. Muller, H. J. Hwang, Improvement of information transfer rates using a hybrid EEG-NIRS brain-computer interface with a short trial length: offline and pseudo-online analyses, Sensors, 18 (2018), 1827. doi: 10.3390/s18061827
    [28] D. Guo, G. M. Liu, Deep learning-based bimodal emotion recognition from facial expression and body posture, in Nanjing University of Posts and Telecommunications, 2019.
    [29] J. M. Kim, J. K. Choi, M. Choi, M. Ji, G. Hwang, S. B. Ko, Assessment of cerebral autoregulation using continuous-wave near-infrared spectroscopy during squat-stand maneuvers in subjects with symptoms of orthostatic intolerance, Sci. Rep., 8 (2018), 13257. doi: 10.1038/s41598-018-31685-y
    [30] J. Shin, J. Kwon, J. Choi, C. H. Im, Performance enhancement of a brain-computer interface using high-density multi-distance NIRS, Sci. Rep., 7 (2017), 16545. doi: 10.1038/s41598-017-16639-0
    [31] H. F. Song, W. W. Yang, S. S. Dai, L. Du, Y. C. Sun, Using dual-channel CNN to classify hyperspectral image based on spatial-spectral information, Math. Biosci. Eng., 17 (2020), 3450-3477. doi: 10.3934/mbe.2020195
    [32] C. L. Liu, P. Cui, T. Huang, Identification of cell cycle-regulated genes by convolutional neural network, Comb. Chem. High. Throughput. Screen., 20 (2017), 603-611.
    [33] S. Raghu, N. Sriraam, Y. Temel, S. V. Rao, P. L. Kubben, EEG based multi-class seizure type classification using convolutional neural network and transfer learning, Neural. Netw., 124 (2020), 202-212. doi: 10.1016/j.neunet.2020.01.017
    [34] J. Y. Li, Q. Z. Xu, M. X. Wu, T. Huang, Y. D. Wang, Pan-cancer cassification based on self-normalizing neural networks and feature selection, Front. Bioeng. Biotechnol., 8 (2020), 766. doi: 10.3389/fbioe.2020.00766
    [35] B. A. Mohammed, M. S. Al-Ani, An efficient approach to diagnose brain tumors through deep CNN, Math. Biosci. Eng., 18 (2020), 851-867.
    [36] L. Chen, X. Y, Pan, Y. H. Zhang, M. Liu, T. Huang, Y. D. Cai, Classification of widely and rarely expressed genes with recurrent neural network, Comput. Struct. Biotechnol. J., 17 (2019), 49-60. doi: 10.1016/j.csbj.2018.12.002
    [37] P. Croce, F. Zappasodi, L. Marzetti, A. Merla, V. Pizzella, A. M. Chiarelli, Deep convolutional neural networks for feature-less automatic classification of independent components in multi-channel electrophysiological brain recordings, IEEE. Trans. Biomed. Eng., 66 (2019), 2372-2380. doi: 10.1109/TBME.2018.2889512
    [38] X. Y. Pan, L. Chen, I. Liu, Z. B. Niu, T. Huang, Y. D. Cai, Identifying protein subcellular locations with embeddings-based node2loc, IEEE. ACM. T. Comput. Bi., 2021.
    [39] A. Das, D. Guhathakurta, R. Sengupta, A. Dutta, EEG-NIRS joint-image based assessment of neurovascular coupling in stroke, Int. J. Stroke., 11 (2016), 271-272.
    [40] S. F. Xing, Y. X. Zhang, H. L. Wang, J. Y. Tu, Study on the criteria of fine needle aspiration cytology for the diagnosis and classification of malignant lymphomas, Chin. J. Pathol., 16 (1987), 174.
    [41] J. Shin, A. von Luhmann, D. W. Kim, J. Mehnert, H. J. Hwang, K. R. Muller, Data descriptor: simultaneous acquisition of EEG and NIRS during cognitive tasks for an open access dataset, Sci. Data, 5 (2018), 180003. doi: 10.1038/sdata.2018.3
  • This article has been cited by:

    1. Wei Liu, Kebin Jia, Zhuozheng Wang, Zhuo Ma, A Depression Prediction Algorithm Based on Spatiotemporal Feature of EEG Signal, 2022, 12, 2076-3425, 630, 10.3390/brainsci12050630
    2. Yun-Hsuan Chen, Jie Yang, Hemmings Wu, Kevin T. Beier, Mohamad Sawan, Challenges and future trends in wearable closed-loop neuromodulation to efficiently treat methamphetamine addiction, 2023, 14, 1664-0640, 10.3389/fpsyt.2023.1085036
    3. Nasimeh Marvi, Javad Haddadnia, Mohammad Reza Fayyazi Bordbar, An automated drug dependence detection system based on EEG, 2023, 158, 00104825, 106853, 10.1016/j.compbiomed.2023.106853
    4. Yuxing Zhou, Xuelin Gu, Zhen Wang, Xiaoou Li, Identification of drug use degree by integrating multi-modal features with dual-input deep learning method, 2024, 1025-5842, 1, 10.1080/10255842.2024.2417206
    5. Nasimeh Marvi, Javad Haddadnia, Mohammad Reza Fayyazi Bordbar, Modeling functional brain connections in methamphetamine and opioid abusers, 2024, 21, 25900935, 100287, 10.1016/j.medntd.2024.100287
    6. Kui Wang, Digital mapping of coastal landscapes integrating ocean-environment relationships and machine learning, 2024, 17, 1865-0473, 4639, 10.1007/s12145-024-01386-4
    7. Wei Liu, Kebin Jia, Zhuozheng Wang, Graph-based EEG approach for depression prediction: integrating time-frequency complexity and spatial topology, 2024, 18, 1662-453X, 10.3389/fnins.2024.1367212
    8. Xiaowen Zhang, Xuelin Gu, Xiaoou Li, 2024, A Study on the Assessment of Drug Addiction Level of Bimodal Eeg-Nirs Based on Tsnet, 9798400717666, 258, 10.1145/3674658.3674698
  • Reader Comments
  • © 2021 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)
通讯作者: 陈斌, bchen63@163.com
  • 1. 

    沈阳化工大学材料科学与工程学院 沈阳 110142

  1. 本站搜索
  2. 百度学术搜索
  3. 万方数据库搜索
  4. CNKI搜索

Metrics

Article views(3228) PDF downloads(178) Cited by(8)

Figures and Tables

Figures(11)  /  Tables(2)

/

DownLoad:  Full-Size Img  PowerPoint
Return
Return

Catalog