Reinforced MCTS for non-intrusive online load identification based on cognitive green computing in smart grid

Yanmei Jiang; Mingsheng Liu; Jianhua Li; Jingyi Zhang; Yanmei Jiang; Mingsheng Liu; Jianhua Li; Jingyi Zhang

doi:10.3934/mbe.2022540

Mathematical Biosciences and Engineering

2022, Volume 19, Issue 11: 11595-11627. doi: 10.3934/mbe.2022540

Previous Article Next Article

Research article Special Issues

Reinforced MCTS for non-intrusive online load identification based on cognitive green computing in smart grid

1.
State Key Laboratory of Reliability and Intelligence of Electrical Equipment, Hebei University of Technology, No. 5340 Xiping Road, Beichen District, Tianjin 300401, China
2.
School of Information Science and Technology, Shijiazhuang TieDao University, No. 17 East North Second Ring Road, Changan District, Shijiazhuang 050043, China
3.
School of Cyber Science and Technology, Beihang University, No. 37 Xue Yuan Road, Haidian District, Beijing 100191, China
4.
Department of Rail Transportation, Hebei Jiaotong Vocational and Technical College, No. 219 The pearl river avenue, Shijiazhuang 050035, China

Academic Editor: Jerry Chun-Wei Lin

Received: 04 July 2022 Revised: 26 July 2022 Accepted: 03 August 2022 Published: 12 August 2022

Cognitive green computing (CGC) is widely used in the Internet of Things (IoT) for the smart city. As the power system of the smart city, the smart grid has benefited from CGC, which can achieve the dynamic regulation of the electric energy and resource integration optimization. However, it is still challenging for improving the identification accuracy and the performance of the load model in the smart grid. In this paper, we present a novel algorithm framework based on reinforcement learning (RL) to improve the performance of non-invasive load monitoring and identification (NILMI). In this model, a knowledge base of load power facilities (LPF-KB) architecture is designed to facilitate the load data-shared collection and storage; utilizing deep convolutional neural networks (DNNs) structure based on the attentional mechanism to enhance the representations learning of load features; using RL-based Monte-Carlo tree search (MCTS) method to construct an optimal strategy network, and to realize the online combined load prediction without relying on the prior knowledge. We use the massive experiment on the real-world datasets of household appliances to evaluate the performance of our method. The experimental results show that our approach has remarkable performance in reducing the load online identification error rate. Our model is a generic model, and it can be widely used in practical load monitoring identification and the power prediction system.

Keywords:

cognitive green computing,
deep learning,
load identification,
Monte-Carlo search tree,
reinforcement learning,
non-invasive load monitoring and identification

Citation: Yanmei Jiang, Mingsheng Liu, Jianhua Li, Jingyi Zhang. Reinforced MCTS for non-intrusive online load identification based on cognitive green computing in smart grid[J]. Mathematical Biosciences and Engineering, 2022, 19(11): 11595-11627. doi: 10.3934/mbe.2022540

Related Papers:

[1]	Xiaowen Jia, Jingxia Chen, Kexin Liu, Qian Wang, Jialing He . Multimodal depression detection based on an attention graph convolution and transformer. Mathematical Biosciences and Engineering, 2025, 22(3): 652-676. doi: 10.3934/mbe.2025024
[2]	Zhongwei Li, Wenqi Jiang, Xiaosheng Liu, Kai Tan, Xianji Jin, Ming Yang . GAN model using field fuzz mutation for in-vehicle CAN bus intrusion detection. Mathematical Biosciences and Engineering, 2022, 19(7): 6996-7018. doi: 10.3934/mbe.2022330
[3]	Jing Zhang, Ting Fan, Ding Lang, Yuguang Xu, Hong-an Li, Xuewen Li . Intelligent crowd sensing pickpocketing group identification using remote sensing data for secure smart cities. Mathematical Biosciences and Engineering, 2023, 20(8): 13777-13797. doi: 10.3934/mbe.2023613
[4]	Xing Hu, Minghui Yao, Dawei Zhang . Road crack segmentation using an attention residual U-Net with generative adversarial learning. Mathematical Biosciences and Engineering, 2021, 18(6): 9669-9684. doi: 10.3934/mbe.2021473
[5]	Bingyu Liu, Jiani Hu, Weihong Deng . Attention distraction with gradient sharpening for multi-task adversarial attack. Mathematical Biosciences and Engineering, 2023, 20(8): 13562-13580. doi: 10.3934/mbe.2023605
[6]	Jia Yu, Huiling Peng, Guoqiang Wang, Nianfeng Shi . A topical VAEGAN-IHMM approach for automatic story segmentation. Mathematical Biosciences and Engineering, 2024, 21(7): 6608-6630. doi: 10.3934/mbe.2024289
[7]	Qi Cui, Ruohan Meng, Zhili Zhou, Xingming Sun, Kaiwen Zhu . An anti-forensic scheme on computer graphic images and natural images using generative adversarial networks. Mathematical Biosciences and Engineering, 2019, 16(5): 4923-4935. doi: 10.3934/mbe.2019248
[8]	Paula Mercurio, Di Liu . Identifying transition states of chemical kinetic systems using network embedding techniques. Mathematical Biosciences and Engineering, 2021, 18(1): 868-887. doi: 10.3934/mbe.2021046
[9]	Shi Liu, Kaiyang Li, Yaoying Wang, Tianyou Zhu, Jiwei Li, Zhenyu Chen . Knowledge graph embedding by fusing multimodal content via cross-modal learning. Mathematical Biosciences and Engineering, 2023, 20(8): 14180-14200. doi: 10.3934/mbe.2023634
[10]	Sadia Anjum, Lal Hussain, Mushtaq Ali, Adeel Ahmed Abbasi, Tim Q. Duong . Automated multi-class brain tumor types detection by extracting RICA based features and employing machine learning techniques. Mathematical Biosciences and Engineering, 2021, 18(3): 2882-2908. doi: 10.3934/mbe.2021146

Abstract

1. Introduction

The role of water quality in streams, lakes, and seas can be stated as organic, synthetic, and environmental condition of waterbody ^[1,2]. The items for water quality include the diverse features such as dissolved oxygen (DO), chemical oxygen demand (COD), biochemical oxygen demand (BOD₅), total organic carbon (TOC), total phosphorus (T-P), total nitrogen (T-N), suspended solids (SS), turbidity (TU), potential of Hydrogen (pH), electrical conductivity (EC), water temperature (WT), and chlorophyll-a (Chl-a) and so on. The quantitative evaluation of water quality items is significant for the transaction of integrated water resources ^[3].

The water quality items can be identified in three ways: situ-measurement class (e.g., TU, pH, EC, DO, and WT), lab-measurement class (e.g., T-P, T-N, TOC, SS, COD, and Chl-a), and incubated-measurement class (e.g., BOD₅). Among water quality items, BOD₅, the representative incubated-measurement indicator, was considered as a reference to appraise the organic pollution of waterbody by the American Public Health Association Standard Methods Committee (APHASMC) ^[4]. Also, the concentration of BOD₅ can be recommended as the necessity of DO to cut down the organic matter of fluid at specific temperature ^[5]. It can, therefore, be estimated using the quantity of oxygen used up per liter of inspected data dependent on the 5-day period at 20 Celsius (℃) ^[6], and was assessed as one of essential river water quality items for the preservation and management of eco-environmental systems ^[7].

Although different machine learning and deep learning paradigms have been implemented for estimating the incubated-measurement indicator in rivers, this article proposes the unique technique for the accurate prediction of BOD₅ concentration. Hybrid neuroscience approaches involving the diverse data preprocessing coupled with the neuroscience techniques promote the evolution of more complex models based on the higher precision of estimated problems in natural behavior ^[2,8]. The double-stage synthesis models, one of hybrid neuroscience approaches, combining the wavelet transformation (WT) and different neuroscience models were developed and implemented to boost the predictive accuracy of BOD₅ concentration in Hwangji and Toilchun stations, South Korea. The standalone models such as extreme learning machine (ELM), support vector regression (SVR), and deep echo state network (Deep ESN) were also employed for integrating and evaluating novel double-stage synthesis model's scheme clearly. The novel double-stage synthesis models (i.e., Wavelet-ELM, Wavelet-SVR, and Wavelet-Deep ESN), therefore, demonstrates the efficient and accurate estimation of highly complex and nonstationary problem in rivers. The distinguished attraction of double-stage synthesis models motivates to explore the accurate prediction of BOD₅ concentration.

To the best of our knowledge and recognition from the previous information such as published articles, documents and reports, the double-stage synthesis models in the addressed article have not been frequently implemented for predicting BOD₅ concentration among the various water quality items. This article discusses the performance of implemented models (ELM, SVR, Deep ESN, Wavelet-ELM, Wavelet-SVR, and Wavelet-Deep ESN) for predicting BOD₅ concentration. They are evaluated by utilizing three mathematical formulae (R², NS, and RMSE) and four graphical aids (Scatter diagram, boxplot, violin plot, and Taylor diagram), respectively.

The rest of addressed research is arranged as follows. A brief review of BOD₅ concentration estimation and prediction is presented in section 2. The detailed description of machine learning and deep learning paradigms are provided in section 3. Also, the wavelet transformation is discussed. In section 4, report for data available and the criteria of model assessment are provided in detail. In section 5, a case study is presented by using the standalone and double-stage synthesis models based on water quantity and quality items collected in Hwangji and Toilchun stations, South Korea. In section 6, the advantages of standalone and double-stage synthesis models using mathematical formulae and graphical aids are discussed. In the end, the conclusions are drawn up.

2. Literature review on BOD₅ concentration estimation and prediction

Various machine learning and deep learning paradigms for the estimation and prediction issues of water quality have been extensively reported in numerous articles and documents. ^[9] developed the hybrid model utilizing SVR and firefly algorithm (FFA) for predicting water quality indicator in the Euphrates River, Iraq. They found that the SVR-FFA model could predict the water quality indicator accurately. ^[10] implemented four standalone and twelve hybrid models to predict the Iran water quality indicator. The BA-RT, one of hybrid models, provided the best performance to predict the Iran water quality indicator. ^[11] employed seven standalone and three hybrid models to predict the diverse water quality indicators in China. Results showed that the decision tree (DT), random forest (RF), and deep cascade forest (DCF) models produced the outstanding achievements to predict water quality indicators in major rivers and lakes. ^[12] reviewed the recent advances in water quality remote sensing system using 200 datasets of water quality indicators. They demonstrated that the deep learning model outperformed the other proposed models to predict water quality indicators in Midwestern United States. ^[13] investigated the ELM, RF, group method of data handling (GMDH), classification and regression tree (CART), and Bat-ELM models to predict the chlorophyll-a concentration in river and lake systems, USA. They concluded that the Bat-ELM model predicted the chlorophyll-a concentration precisely compared to other models. ^[14] proposed the deep learning models including the recurrent neural network (RNN), long-short term memory (LSTM), and gated recurrent unit (GRU) to predict the drainage water quality indicator in Southern China. They showed that the deep learning models produced better prediction compared to the multiple linear regression (MLR) and multilayer perception (MLP) models.

However, limited techniques and methods have been implemented to estimate and predict BOD₅ concentration ^{[15,16,17,18]}. ^[19] employed the regression tree (RT) and SVR models to estimate total suspended solids (TSS), total dissolved solid (TDS), COD, and BOD₅ concentration using the datasets from National Stormwater Quality Database (NSQD), USA. Results showed that the applied models could estimate BOD₅ concentration accurately. ^[20] developed the adaptive neuro-fuzzy inference system (ANFIS) and wavelet SVR (WSVR) models to predict BOD₅ concentration in Karun River, Iran. They demonstrated that the WSVR model provided better prediction compared to the ANFIS model. ^[1] estimated BOD₅ concentration employing the RF, gradient boosting regression tree (GBRT), ELM, and Deep ESN in the Han River, South Korea. It can be found from ^[1]'s article that the Deep ESN5 model supplied the most accurate predictions of BOD₅ concentration among the developed models. Also, ^[2] developed two-stage and standalone neuroscience models to predict BOD₅ concentration in the Nakdong River, South Korea. Considering the developed models, the DWT-RF5 and DWT-GRNN4 models were the best model for predicting BOD₅ concentration. ^[21] utilized the SVR, RF, artificial neural networks (ANNs), long short-term memory (LSTM), convolutional neural networks (CNN)-LSTM, and Bi-LSTM models for forecasting COD and BOD₅ concentrations in the Yamuna River, India. This investigation provided that the Bi-LSTM model supplied the best performance for forecasting COD and BOD₅ concentrations. ^[22] implemented four standalone (ANN, RF, support vector machines (SVMs), and gradient boosting machines (GBM)) and six hybrid (RF-SVMs, ANN-SVMs, GBM-SVMs, RF-ANNs, GBM-ANNs, and RF-GBM) models to predict BOD₅ concentration in the Buriganga River, Bangladesh. They found that the RF-SVMs model provided the best predictive accuracy among the developed models. In contrast with the above-mentioned machine learning and deep learning paradigms, the novel double-stage synthesis models were introduced to find the optimal models between BOD₅ concentration and well-known water quality items based on five input associations. The addressed research can highlight how the novel double-stage synthesis models enhance the predictive results of BOD₅ concentration.

3. Implemented models and supplementary method

The implemented models in the addressed article were machine learning (ELM and SVR) and deep learning (Deep ESN) paradigms, and the supplementary method was classified as the wavelet transformation, which is one of data preprocessing techniques used in various research fields. It can be seen from Figure 1 that the comprehensive mechanism of research process is underlined. Successive sub-phases explain the implemented models and supplementary method.

Figure 1. Comprehensive mechanism of research process.

DownLoad: Full-Size Img PowerPoint

3.1. Extreme Learning Machine (ELM)

^[23] initially recommended the ELM model as a rapid and effective category of feedforward neural networks (FFNN) (refer to Figure 2). It involves a single-middle-layer, which receives a particular scheme for training the parameters of networks compared to the conventional multilayer perceptron (MLP) model. The ELM model can map using a single-middle-layer with M independent input indicators and be written as follows:

$f(x) = \sum\limits_{j = 1}^M {\sum\limits_{i = 1}^L {{\beta _i}} {g_i}({w_i}{x_j} + b)}$

(1)

Figure 2. A schematic diagram of extreme learning machine (ELM) model.

DownLoad: Full-Size Img PowerPoint

where g(.) is the activation function, which supplies the output in the middle layer; ${\beta _i}$ is the weight of output for connecting the middle neurons to the output neuron; and L is the neuron number of the middle layer. The output indicator can be given by the following formula (2):

$y = \sum\limits_{j = 1}^M {\sum\limits_{i = 1}^L {{\beta _i}} {g_i}({w_i}{x_j} + b)} = t + \varepsilon$

(2)

where $\varepsilon$ is the error. The Gaussian and sigmoid functions are the most employed mapping ones in the ELM model's category. The underlying formula (3) expresses the Gaussian function:

$g({x_i}) = h(a, c, {x_i}) = \exp ( - a{\left\| {{x_i} - c} \right\|^2})$

(3)

where a and c refer to the activation functions. During training phase, the connection weight is fixed in the ELM model's category. That is, random values are allowed directly to neurons' activation functions instead of requesting an iterative process to update them. The connection weights for output neuron can be achieved continuously utilizing the least squares method. In other words, the fitting error ought to be reduced by computing ${\left\| {{\bf{H \pmb{\mathsf{ β}} }} - {\bf{T}}} \right\|^2}$ for the connection weight ( ${\bf{ \pmb{\mathsf{ β}} }}$ ), where T is the matrix for target and H is the randomized matrix corresponding to the middle layer:

${\bf{H}} = \left[ \begin{array}{l} g({x_1}) \hfill \\ . \hfill \\ . \hfill \\ g({x_N}) \hfill \end{array} \right], \, \, \, \, {\bf{T}} = \left[ \begin{array}{l} t_1^T \hfill \\ . \hfill \\ . \hfill \\ t_N^T \hfill \end{array} \right]$

(4)

The connection weight for output is resolved, based on the linear equation system such as ${\bf{ \pmb{\mathsf{ β}} }}{\text{ = }}{{\bf{{\rm H}}}^{\text{ + }}}{\bf{T}}$ , where ${{\bf{{\rm H}}}^{\text{ + }}}$ is the generalized inverse function of Moore-Penrose ^[1,24].

3.2. Support Vector Regression (SVR)

The SVR model (refer to Figure 3), which is a special type of SVMs, has been applied in various fields, including stock index prediction, bioinformatics engineering, chemical synthesis, and production process control and so on ^[25,26]. The generalization of conventional ANNs models may reduce to a local optimized generalization, while a universal optimization is insured for the SVR model ^[27,28,29].

Figure 3. A schematic diagram of support vector regression (SVR) model.

DownLoad: Full-Size Img PowerPoint

The fundamental concepts of SVR model are as follows. Recognizing the training sample $\left({{{\text{x}}_{\text{i}}}{\text{, }}{{\text{y}}_{\text{i}}}} \right)$ , where ${{\text{x}}_{\text{i}}} \in {\Re ^n}$ is a specific value of input indicator x, and ${{\text{y}}_{\text{i}}} \in {\Re ^n}$ is the matching value of surveyed model output. Also, a nonlinear transfer function ( $\Phi (\cdot)$ ) and a linear function ( ${\text{f(}} \cdot {\text{)}}$ ) can be defined between input and output indicators. The actual output, therefore, is expressed by formula (5):

$\overline {\text{y}} = {\text{f(x)}} = {{\text{w}}^{\text{T}}}\Phi {\text{(x)}} + B$

(5)

where $\overline {\text{y}}$ is the actual output; and w and B are the adjustable parameters of the model. In the SVR model, the empirical risk can be written as the following formula (6):

${{\text{R}}_{{\text{emp}}}} = \frac{{\text{1}}}{{\text{N}}}{\sum\limits_{{\text{i}} = {\text{1}}}^{\text{N}} {\left| {{{\text{y}}_{\text{i}}} - {{\overline {\text{y}} }_{\text{i}}}} \right|} _{\rm{ \mathsf{ ε} }}}$

(6)

where R_emp is the empirical risk; and ${\left| {{{\rm{y_i}}} - {{\overline {\rm{y}} }_{\rm{i}}}} \right|_{\rm{ \mathsf{ ε} }} }$ is Vapnik's ε-insensitive loss function. The adjustable parameters (i.e., w and B in formula (5)) of the model can be calculated by obtaining the minimum cost function ^[27]. In the addressed article, the following cost function was used:

${{\rm{ \mathsf{ψ} }}_{\rm{ \mathsf{ ε} }}}{\text{(w}}, {\rm{ \mathsf{ ξ} }} , {{\rm{ \mathsf{ ξ} }}^ * }{\text{)}} = \frac{{\text{1}}}{{\text{2}}}{{\text{w}}^{\text{T}}}{\text{w}} + {\text{C}}\sum\limits_{{\text{i}} = {\text{1}}}^{\text{N}} {{\text{(}}{{\rm{ \mathsf{ ξ} }}_{\text{i}}} + {\rm{ \mathsf{ ξ} }}_{\text{i}}^ * } {\text{)}}$

(7)

where ${{\rm{ \mathsf{ψ} }}_{\rm{ \mathsf{ ε} }}}{\text{(w}}, {\rm{ \mathsf{ ξ} }}, {{\rm{ \mathsf{ ξ} }}^ * }{\text{)}}$ is the cost function; ${{\rm{ \mathsf{ ξ} }}_{\text{i}}}{{, {\rm{ \mathsf{ ξ} }}}}_{\text{i}}^ *$ are the positive slack variables; and C is the cost constant. In addition, the constraints for formula (7) can be classified as: (1) ${{\text{y}}_{\text{i}}} - {\overline {\text{y}} _{\text{i}}} \leqslant {\rm{ \mathsf{ ε} }} + {{\rm{ \mathsf{ ξ} }}_i}$ i = 1, 2, ..., N; (2) $- {{\text{y}}_{\text{i}}} + {\overline {\text{y}} _{\text{i}}} \leqslant {\rm{ \mathsf{ ε} }} + {\rm{ \mathsf{ ξ} }}_{\text{i}}^ *$ i = 1, 2, ..., N; and (3) ${{\rm{ \mathsf{ ξ} }}_{\text{i}}}{{, {\rm{ \mathsf{ ξ} }}}}_{\text{i}}^ * \geqslant {\text{0}}$ i = 1, 2, ..., N.

3.3. Deep Echo State Network (Deep ESN)

The recurrent neural networks (RNN), including the echo state network (ESN), has the most broadly utilized reservoir computing (RC) method ^[1,31,32]. Since the RNN model is powerful and accurate for computing the complicated and nonlinear problems, the Deep ESN model is effective for historical data. The Deep ESN model contains an order of deformed recurrent layers named as reservoir, where every layer output performs as the following layer input. Figure 4 explains a conceptual diagram of Deep ESN model with the recurrent structure of reservoir from the viewpoint of a discrete-time dynamic system. The reservoir dynamic state can be renewed by recognizing the leaky integration ESN (i.e., LI-ESN) as below:

${\rm{x(t)}} = (1 - \alpha ){\rm{x(t}} - 1) + \alpha \tanh ({{\rm{W}}_{{\text{in}}}}{\rm{u(t)}} + {{\text{W}}_{\text{R}}}{\rm{x(t}} - 1))$

(8)

Figure 4. A conceptual diagram of deep echo state network (Deep ESN) model ^[1].

DownLoad: Full-Size Img PowerPoint

where α is the leaky coefficient; u(t) is the outside input based on time t; x(t) is the reservoir state in the corresponding layer based on time t; W_in is the matrix of input connection weight for the reservoir; and W_R is the matrix of recurrent connection weight. Because the input indicator to the following reservoir can be supplied by the output indicator of its prior reservoir, an ordinary equation for the Deep ESN model is organized for expanding the function of state transition as below.

$\begin{array}{l} {{\rm{x}}^\ell }({\text{t}}) = (1 - {\alpha ^\ell }){{\rm{x}}^\ell }({\text{t}} - 1) + {\alpha ^\ell }\tanh ({{\rm{W}}^\ell }{i^\ell }({\text{t}}) + {{\text{W}}^\ell }_{\text{R}}{{\rm{x}}^\ell }(t - 1));\, \, \hfill \\ {{\text{i}}^0}({\text{t}}) = {\text{u}}({\text{t}})\, \, \& \, \, {{\rm{i}}^\ell }{\rm{(t)}} = {{\rm{x}}^{\ell - 1}}({\text{t}});\, \, \, \, \, \ell = 1, 2, ..., {\rm{L}}\, \hfill \end{array}$

(9)

where $\ell$ is a layer (reservoir) in the structure of RC; ${{\text{W}}^\ell }_{\text{R}}$ are the connection weights between the layer $\ell$ and the prior one $\ell - 1$ ; and L is the quantity for the layers of reservoir ^[1,32,33]. Here, five layers were employed in the reservoir for the Deep ESN model to estimate BOD₅ concentration.

3.4. Wavelet Transformation (WT)

The WT method, which is one of multi-resolution signal procedure methods, is employed to build the double-stage synthesis models. The original data can be isolated into various frequency components involving an approximation and numerous details handling the WT method. In the addressed article, discrete-based wavelet transformation, which has been utilized for the data preprocessing in diverse fields, was selected. In fact, the discrete-based WT method can be accomplished by implementing the Mallat method ^[34]. The bottom line of Mallat method is two-route filters comprising two filters as low-pass and high-pass ^[2,35]. ^[36] defined that the coefficients for the wavelet (high-pass) and scaling (low-pass) in the j^th level of decomposition is outlined as

${W_{j, \;t}} \equiv \sum\limits_{l = 0}^{L - 1} {{h_l}{V_{j - 1, \;2t + 1 - l\;\bmod \;{N_{j - 1}}}}} , {V_{j, \;t}} \equiv \sum\limits_{l = 0}^{L - 1} {{g_l}{V_{j - 1, \;2t + 1 - l\;\bmod \;{N_{j - 1}}}}} , t = 0, \;1, \; \cdots , \;{N_j} - 1$

(10)

where ${W_{j, \; t}}$ and ${V_{j, \; t}}$ are the elements for corresponding ${{\bf{W}}_j}$ and ${{\bf{V}}_j}$ . The WT decomposes the complex and original input time series into the components (approximation and details) which show relatively simpler patterns than the original input time series. The different components obtained from WT were implemented as input association of corresponding double-stage synthesis model. Evolving double-stage synthesis models for the components separately and summing their predicted values can improve the predictive accuracy of double-stage synthesis models compared to performance of standalone models for the original input time series with high complexity. A flowchart for dual-step discrete-based WT is shown in figure 5. Here, two details (D₁ and D₂) and an approximation (A₂) are achieved from the original input time series. Also, Figure 6 illustrates the sequential diagram for evolving the double-stage synthesis models.

Figure 5. Dual-step discrete-based WT decomposition.

DownLoad: Full-Size Img PowerPoint

Figure 6. Sequential diagram for evolving double-stage synthesis models.

DownLoad: Full-Size Img PowerPoint

4. Report for data and assessment criteria

4.1. Preparation of utilized data

The original data can be isolated into various frequency In the addressed article, Hwangji (longitude 129°05′07″E; latitude 37°06′74″N) and Toilchun (longitude 128°44′46″E; latitude 36°47′09″N) stations were employed to predict BOD₅ concentration using diverse physical and chemical variables such as T-N, T-P, TOC, DO, WT, SS, COD, pH, EC, and station discharge (DIS) in South Korea. Figure 7 shows the illustrative map of Hwangji and Toilchun stations.

Figure 7. Illustrative map of Hwangji and Toilchun stations.

DownLoad: Full-Size Img PowerPoint

The surveyed data (2008/02-2020/12 for Hwangji and 2011/07-2020/12 for Toilchun stations) for water quantity and quality items can be directly accessed and collected from official website (http://water.nier.go.kr) of National Institute of Environmental Research (NIER), South Korea. The full data file consisted of training and validation samples. The training sample involved 80% (data = 398 from Hwangji and data = 294 from Toilchun stations) and the validation sample applied the last 20% (data = 99 from Hwangji and data = 74 from Toilchun stations) of full data file.

Recognizing the source code and software of machine learning and deep learning paradigms, the ELM model was evolved by employing the R (https://www.r-project.org, a free software environment for statistical computing and graphics) package and the elmNNRcpp (https://cran.r-project.org/web/packages/elmNNRcpp/index.html). In case of SVR model, it was implemented by the DTREG predictive modeling software (www.dtreg.com). In addition, the Deep ESN model was developed by utilizing the MATLAB programing language (https://www.mathworks.com), which is a freely available MATLAB toolbox for the Deep ESN (https://it.mathworks.com/matlabcentral/fileexchange/69402-deepesn).

The optimal number of hidden nodes for the ELM and Wavelet-ELM models was determined using a trial and error process. As the number of hidden nodes was changed from 1 to 5m (where, m is the number of input indicators), the number of hidden nodes with the minimum RMSE value was decided as the optimal value. The logistic sigmoid function and linear function were used for activating hidden and output nodes, respectively. In addition, epsilon type of SVR model with radial basis function (RBF) kernel was employed for predicting BOD₅ concentration using the SVR and Wavelet-SVR models. Also, the V-fold cross-validation were applied to validate the SVR and Wavelet-SVR models, and the grid search algorithm found the optimal parameters by minimizing total errors. Finally, the optimal number of layers and reservoirs units were decided based on the trial and error process for the Deep ESN and Wavelet-Deep ESN models.

Table 1 explains the computed results for the correlation coefficients and P values between individual input indicators and BOD₅ concentration. It can be judged from Table 1 that TOC (e.g., CC = 0.721, P value = 0.0001 at Hwangji and CC = 0.563, P value = 0.0001 at Toilchun stations) and COD (e.g., CC = 0.721, P value = 0.0001 at Hwangji and CC = 0.626, P value = 0.0001 at Toilchun stations) items exhibited high correlation and statistically significant with BOD₅ concentration among various input indicators. In the addressed article, all indicators can be categorized as class 1 (i.e., in situ-measurement items (pH, EC, DO, and WT), class 2 (i.e., lab-measurement items (SS, COD, T-N, T-P, and TOC) and incubated-measurement item (BOD₅)), and class 3 (i.e., water quantity item (river discharge)), respectively.

Table 1. Correlation coefficients and P values between corresponding input indicators and BOD₅ concentration.

Class	Input indicators	BOD₅ concentration
		Hwangji		Toilchun
		CC	P-value	CC	P-value
1	pH EC DO WT	-0.003 0.088 0.036 -0.074	0.9491 0.0494 0.4185 0.0974	0.073 -0.262 -0.064 0.123	0.1024 0.0001 0.1565 0.0058
2	SS COD T-N T-P TOC	0.120 0.721 0.163 0.349 0.721	0.0069 0.0001 0.0003 0.0001 0.0001	0.462 0.626 -0.195 0.479 0.563	0.0001 0.0001 0.0001 0.0001 0.0001
3	DIS	-0.042	0.3494	0.184	0.0001

| Show Table

DownLoad: CSV

4.2. Mathematical assessment criteria of standalone and double-stage synthesis models

To assess the performance of standalone (ELM, SVR, and Deep ESN) and double-stage synthesis (Wavelet-ELM, Wavelet-SVR, and Wavelet-Deep ESN) models, three mathematical formulae, which have been recognized and utilized worldwide, were employed. The coefficient of determination (R²) criterion ^[37,38] is clarified as the square of correlation between surveyed and estimated BOD₅ concentrations (see formula (11)). The Nash-Sutcliffe (NS) efficiency criterion ^[39] can resolve the models' effectiveness between surveyed and estimated BOD₅ concentrations (see formula (12)). Also, the disparity between surveyed and estimated BOD₅ concentrations can be referred by handling the root mean square error (RMSE) criterion ^[40]. The RMSE criterion can be computed by employing formula (13).

${R^2} = {\left( {\frac{{\frac{1}{n}\sum\limits_{i = 1}^n {(BO{D_{sur}} - {{\overline {BOD} }_{sur}})(BO{D_{est}} - {{\overline {BOD} }_{est}})} }}{{\sqrt {\frac{1}{n}\sum\limits_{i = 1}^n {{{(BO{D_{sur}} - {{\overline {BOD} }_{sur}})}^2}} } \sqrt {\frac{1}{n}\sum\limits_{i = 1}^n {{{(BO{D_{est}} - {{\overline {BOD} }_{est}})}^2}} } }}} \right)^2}$

(11)

${{NS = 1 - }}\frac{{\sum\limits_{i = 1}^n {{{{\mathit{[}}BO{D_{sur}} - BO{D_{est}}{\mathit{]}}}^2}} }}{{\sum\limits_{i = 1}^n {{{{\mathit{[}}BO{D_{sur}} - {{\overline {BOD} }_{sur}}{\mathit{]}}}^2}} }}$

(12)

${{RMSE = }}\sqrt {\frac{1}{n}\sum\limits_{i = 1}^n {{{{\mathit{[}}BO{D_{sur}} - BO{D_{est}}{\mathit{]}}}^2}} }$

(13)

where $BO{D_{sur}}$ and $BO{D_{est}}$ = surveyed and estimated BOD₅ concentrations; ${\overline {BOD} _{sur}}$ and $\overline B O{D_{est}}$ = surveyed and estimated mean BOD₅ concentrations; and n = the number of full data available.

5. Case study

The addressed article utilized the miscellaneous water quantity and quality items to predict BOD₅ concentration in Hwangji and Toilchun stations, South Korea. As defined formerly, the assessment of standalone and double-stage synthesis models to estimate BOD₅ concentration was the essential view of this article.

Among water quality items, some indicators, including pH, EC, DO, and WT, were directly surveyed by utilizing a commercial mechanical tool. Also, the indicators including SS, COD, T-N, T-P, and TOC were surveyed through the laboratory assistant system indirectly. BOD₅ concentration, however, can be indirectly surveyed via the incubation system, based on the 20 ℃ for the 5-day period ^[41]. Since the plan of addressed article was scheduled to predict BOD₅ concentration utilizing the standalone and double-stage synthesis models, this behavior could save and protect the time and effort to estimate and incubate BOD₅ concentration.

From the correlation coefficients and P values of water quantity and quality items (refer to Table 1), divergent organizations were provided to choose the best input association for given stations. To employ the same input indicators on both stations among them, some input indicators with positive (e.g., COD, TOC, T-P, and SS) and negative (e.g., WT, pH, and DIS) correlations were selected for diverse input associations in Hwangji station because input indicators with negative correlations can also contribute for predicting BOD₅ concentration. However, many input indicators based on positive (e.g., COD, TOC, T-P, SS, WT, pH, and DIS) correlation were implemented for different input associations in Toilchun station.

Hence, the standalone and double-stage synthesis models were evolved for predicting BOD₅ concentration, based on five input associations (so called, 1^st–5^th distributions). Because TOC and COD items were picked out as the underlying water quality items for given stations, the addressed article determined the consolidation of TOC and COD items as the 1^st distribution. Table 2 presents the diverse input associations of water quantity and quality items to predict BOD₅ concentration. All developed models in Table 2 can be categorized into five distributions.

Table 2. Diverse input associations of developed models for predicting BOD₅ concentration.

Classification	Division	Model	Distribution	Input association
Standalone	Machine learning	ELM	ELM1 ELM2 ELM3 ELM4 ELM5	COD+TOC COD+TOC+T-P+SS COD+TOC+WT+pH COD+TOC+T-P+SS+WT+pH COD+TOC+T-P+SS+WT+pH+DIS
	Machine learning	SVR	SVR1 SVR2 SVR3 SVR4 SVR5	COD+TOC COD+TOC+T-P+SS COD+TOC+WT+pH COD+TOC+T-P+SS+WT+pH COD+TOC+T-P+SS+WT+pH+DIS
	Deep learning	Deep ESN	Deep ESN1 Deep ESN2 Deep ESN3 Deep ESN4 Deep ESN5	COD+TOC COD+TOC+T-P+SS COD+TOC+WT+pH COD+TOC+T-P+SS+WT+pH COD+TOC+T-P+SS+WT+pH+DIS
Double-stage synthesis	Machine learning	Wavelet-ELM	Wavelet-ELM1 Wavelet-ELM2 Wavelet-ELM3 Wavelet-ELM4 Wavelet-ELM5	COD+TOC COD+TOC+T-P+SS COD+TOC+WT+pH COD+TOC+T-P+SS+WT+pH COD+TOC+T-P+SS+WT+pH+DIS
	Machine learning	Wavelet-SVR	Wavelet-SVR1 Wavelet-SVR2 Wavelet-SVR3 Wavelet-SVR4 Wavelet-SVR5	COD+TOC COD+TOC+T-P+SS COD+TOC+WT+pH COD+TOC+T-P+SS+WT+pH COD+TOC+T-P+SS+WT+pH+DIS
	Deep learning	Wavelet-Deep ESN	Wavelet-Deep ESN1 Wavelet-Deep ESN2 Wavelet-Deep ESN3 Wavelet-Deep ESN4 Wavelet-Deep ESN5	COD+TOC COD+TOC+T-P+SS COD+TOC+WT+pH COD+TOC+T-P+SS+WT+pH COD+TOC+T-P+SS+WT+pH+DIS

| Show Table

DownLoad: CSV

5.1. Predicting BOD₅ concentration at Hwangji station

5.1.1. Application of standalone models

The results of three mathematical formulae (R², NS, and RMSE) for the standalone models are summed up in Table 3 for Hwangji station. Table 3 explains that the outcomes of SVR1 (R² = 0.905, NS = 0.891, and RMSE = 0.299 mg/L) are more excellent than the ELM1 and Deep ESN1 in the validation phase dependent on the 1^st distribution. In the 2^nd distribution, the SVR2 (R² = 0.908, NS = 0.905, and RMSE = 0.279 mg/L) performs more excellent than the ELM2 and Deep ESN2. And, the SVR3 (R² = 0.925, NS = 0.915, and RMSE = 0.264 mg/L) surpasses the ELM3 and Deep ESN3 clearly in the validation phase for the 3^rd distribution. The contrast of standalone models in the 4^th distribution, furthermore, indicates that the ELM4 (R² = 0.902, NS = 0.893, and RMSE = 0.295 mg/L) dominates the SVR4 and Deep ESN4 in the validation phase. In the end, the SVR5 (R² = 0.905, NS = 0.884, and RMSE = 0.309 mg/L) is more accurate than ELM5 and Deep ESN5 in the validation phase for the 5^th distribution.

Table 3. Results of three mathematical criteria using the standalone models at Hwangji station.

Classification	Distribution	Validation phase
Classification	Distribution	R²	NS	RMSE (mg/L)
Standalone	ELM1 ELM2 ELM3 ELM4 ELM5	0.900 0.895 0.898 0.902 0.879	0.835 0.879 0.837 0.893 0.855	0.368 0.315 0.365 0.295 0.344
	SVR1 SVR2 SVR3 SVR4 SVR5	0.905 0.908 0.925 0.908 0.905	0.891 0.905 0.915 0.882 0.884	0.299 0.279 0.264 0.310 0.309
	Deep ESN1 Deep ESN2 Deep ESN3 Deep ESN4 Deep ESN5	0.871 0.884 0.845 0.857 0.886	0.806 0.831 0.805 0.769 0.809	0.398 0.371 0.399 0.434 0.394

| Show Table

DownLoad: CSV

Recognizing the impressive models from the 1^st–5^th distributions, the best accomplishment of standalone models can be found from the ELM (the 4^th distribution), SVR (the 3^rd distribution), and Deep ESN (the 2^nd distribution) among diverse input associations in the validation phase. Table 3 tells us that the desirable performance of SVR3 gives more accurate than ELM4 and Deep ESN2 in the validation phase. Therefore, it can be said that the SVR3 is the most accurate for predicting BOD₅ concentration among the desirable standalone models at Hwangji station.

5.1.2. Application of double-stage synthesis models

The results of three mathematical criteria for the double-stage synthesis models are also arranged in Table 4 at Hwangji station. From Table 4, it is clear that the outcomes of Wavelet-SVR1 (R² = 0.904, NS = 0.895, and RMSE = 0.293 mg/L) are more dominant compared to the Wavelet-ELM1 and Wavelet-Deep ESN1 in the validation phase dependent on the 1^st distribution. Based on the 2^nd distribution, the Wavelet-SVR2 (R² = 0.911, NS = 0.911, and RMSE = 0.271 mg/L) is more excellent than the Wavelet-ELM2 and Wavelet-Deep ESN2. Also, the Wavelet-SVR3 (R² = 0.920, NS = 0.912, and RMSE = 0.269 mg/L) outperforms the Wavelet-ELM3 and Wavelet-Deep ESN3 regarding the 3^rd distribution in the validation phase. Moreover, the contrast of double-stage synthesis models in the 4^th distribution demonstrates that the Wavelet-SVR4 (R² = 0.926, NS = 0.915, and RMSE = 0.264 mg/L) performs superior to the Wavelet-ELM4 and Wavelet-Deep ESN4 in the validation phase. In the end, Wavelet-SVR4 (R² = 0.919, NS = 0.914, and RMSE = 0.266 mg/L) is more efficient than the Wavelet-ELM5 and Wavelet-Deep ESN5 in the validation phase for the 5^th distribution.

Table 4. Results of three mathematical criteria using the double-stage synthesis models at Hwangji station.

Classification	Distribution	Validation phase
Classification	Distribution	R²	NS	RMSE (mg/L)
Double-stage synthesis	Wavelet-ELM1 Wavelet-ELM2 Wavelet-ELM3 Wavelet-ELM4 Wavelet-ELM5	0.837 0.816 0.831 0.796 0.734	0.776 0.812 0.772 0.777 0.717	0.428 0.393 0.432 0.427 0.481
	Wavelet-SVR1 Wavelet-SVR2 Wavelet-SVR3 Wavelet-SVR4 Wavelet-SVR5	0.904 0.911 0.920 0.926 0.919	0.895 0.911 0.912 0.915 0.914	0.293 0.271 0.269 0.264 0.266
	Wavelet-Deep ESN1 Wavelet-Deep ESN2 Wavelet-Deep ESN3 Wavelet-Deep ESN4 Wavelet-Deep ESN5	0.869 0.863 0.860 0.846 0.851	0.826 0.832 0.833 0.815 0.817	0.377 0.370 0.369 0.388 0.386

| Show Table

DownLoad: CSV

Contemplating the outstanding models from the 1^st-5^th distributions, the admirable performance of double-stage synthesis models can be judged from the Wavelet-ELM (the 2^nd distribution), Wavelet-SVR (the 4^th distribution), and Wavelet-Deep ESN (the 3^rd distribution) among diverse input associations in the validation phase. It can be noticed from Table 4 that the Wavelet-SVR4 provides more effective outcomes than the Wavelet-ELM2 and Wavelet-Deep ESN3 in the validation phase. For that reason, the Wavelet-SVR4 is more trustworthy than the Wavelet-ELM2 and Wavelet-Deep ESN3 for predicting BOD₅ concentration among the desirable double-stage synthesis models at Hwangji station.

5.1.3. Graphical aids of model accomplishment

To verify the accuracy of desirable standalone and double-stage synthesis models using graphical aids, Figures 8(a)–(f) present the scatterplots for the surveyed and estimated BOD₅ concentration values at Hwangji station. The linear functions and values of NS efficiency criterion are presented for the corresponding standalone and double-stage synthesis models, respectively. It can be concluded from NS values and the slopes of linear functions that an apparent discrepancy can be followed among the desirable standalone and double-stage synthesis models (ELM4, SVR3, Deep ESN2, Wavelet-ELM2, Wavelet-SVR4, and Wavelet-Deep ESN3). Therefore, the SVR3 and Wavelt-SVR4 performs the most reliable accuracy for predicting BOD₅ concentration values clearly, whereas the Wavelet-ELM2 was the worst among the desirable models at Hwangji station.

Figure 8. Scatterplots for the desirable standalone and double-stage synthesis models in the validation phase (Hwangji station).

DownLoad: Full-Size Img PowerPoint

Additional portraits can evaluate the performance of standalone and double-stage synthesis models using the boxplot, violin plot ^[42], and Taylor diagram ^[43]. Figures 9(a)–(c) show the diverse graphical aids for the desirable standalone and double-stage synthesis models at Hwangji station. It can be found from Figure 9(a) that the estimated BOD₅ concentrations of SVR3 and Wavelet-SVR4 yield more analogous configuration to the surveyed values for median, interquartile ranges and dispersion, adjacent values, and sign of skewness compared to other desirable models. Another graphical aid for the distribution of surveyed and estimated BOD₅ concentration values utilizing the desirable models can be provided with the violin plots (Figure 9(b)). The violin plot can be defined as one of approaches to discern the distribution of assigned numerical values. Figure 9(b) supplies a close shape pattern for the SVR3 and Wavelet-SVR4 concerning the median, interquartile, and distribution of assigned values. In addition, the Taylor diagram (Figure 9(c)) utilizes three statistical indices, including correlation coefficient, normalized standard deviation, and root mean square error. The principal approach of Taylor diagram can be explained as to find the closest estimated model with the corresponding surveyed BOD₅ concentration based on standard deviation (polar axis) and correlation coefficient (radial axis). The Taylor diagram, therefore, demonstrates the accuracy and efficiency of SVR3 and Wavelet-SVR4 over the other desirable models (ELM4, Deep ESN2, Wavelet-ELM2, and Wavelet-Deep ESN3).

Figure 9. Boxplots, violin plots, and Taylor diagram for the desirable standalone and double-stage synthesis models in the validation phase (Hwangji station).

DownLoad: Full-Size Img PowerPoint

5.2. Predicting BOD₅ concentration at Toilchun station

5.2.1. Utilization of standalone models

The outputs of three mathematical formulae (R², NS, and RMSE) for the standalone models are summed up in Table 5 for Toilchun station. Table 5 shows that the estimates of ELM1 (R² = 0.571, NS = 0.472, and RMSE = 0.472 mg/L) are preferable to the SVM1 and Deep ESN1 in the validation phase considering the 1^st distribution. During the 2^nd distribution, the SVR2 model (R² = 0.722, NS = 0.691, and RMSE = 0.361 mg/L) are more remarkable compared to the ELM2 and Deep ESN2. Also, the performance of SVR3 (R² = 0.701, NS = 0.661, and RMSE = 0.378 mg/L) exceeds the ELM3 and Deep ESN3 obviously regarding the 3^rd distribution in the validation phase. The contradiction of standalone models subjected to the 4^th distribution, besides, approves that the SVR4 (R² = 0.868, NS = 0.854, and RMSE = 0.248 mg/L) outperforms the ELM4 and Deep ESN4 in the validation phase. Eventually, the SVR5 (R² = 0.876, NS = 0.870, and RMSE = 0.234 mg/L) is more reliable than ELM5 and Deep ESN5 in the validation phase with the 5^th distribution.

Table 5. Results of three mathematical criteria using the standalone models at Toilchun station.

Classification	Distribution	Validation phase
Classification	Distribution	R²	NS	RMSE (mg/L)
Standalone	ELM1 ELM2 ELM3 ELM4 ELM5	0.571 0.671 0.677 0.739 0.808	0.472 0.630 0.641 0.738 0.807	0.472 0.395 0.389 0.332 0.285
	SVR1 SVR2 SVR3 SVR4 SVR5	0.534 0.722 0.701 0.868 0.876	0.477 0.691 0.661 0.854 0.870	0.469 0.361 0.378 0.248 0.234
	Deep ESN1 Deep ESN2 Deep ESN3 Deep ESN4 Deep ESN5	0.376 0.637 0.491 0.491 0.608	0.359 0.547 0.417 0.461 0.606	0.520 0.437 0.496 0.476 0.408

| Show Table

DownLoad: CSV

Contemplating the magnificent models among the 1^st–5^th distributions, the best capability of standalone models can be discovered from the ELM (the 5^th distribution), SVR (the 5^th distribution), and Deep ESN (the 5^th distribution) among diverse input associations in the validation phase. As seen from Table 5, the improved performance of SVR5 contributes better prediction compared to the ELM5 and Deep ESN5 in the validation phase. As a result, the SVR5 is most reliable for predicting BOD₅ concentration among the improved standalone models at Toilchun station.

5.2.2. Utilization of double-stage synthesis models

The outputs of three mathematical criteria for the double-stage synthesis models are still organized as in Table 6 at Toilchun station. As observed from Table 6, the estimates of Wavelet-SVR1 (R² = 0.662, NS = 0.647, and RMSE = 0.386 mg/L) are more prevalent than the Wavelet-ELM1 and Wavelet-Deep ESN1 in the validation phase utilizing the 1^st distribution. Favoring the 2^nd distribution, the Wavelet-SVR2 (R² = 0.866, NS = 0.845, and RMSE = 0.255 mg/L) is more exquisite than the Wavelet-ELM2 and Wavelet-Deep ESN2. Likewise, the performance of Wavelet-SVR3 (R² = 0.688, NS = 0.646, and RMSE = 0.386 mg/L) exceeds the Wavelet-ELM3 and Wavelet-Deep ESN3 viewing the 3^rd distribution during validation phase. Likewise, the contradiction of double-stage synthesis models in the 4^th distribution demonstrates that the Wavelet-SVR4 (R² = 0.922, NS = 0.917, and RMSE = 0.187 mg/L) surpasses the Wavelet-ELM4 and Wavelet-Deep ESN4 definitely in the validation phase. Eventually, the Wavelet-SVR5 (R² = 0.780, NS = 0.775, and RMSE = 0.308 mg/L) is more effective than the Wavelet-ELM5 and Wavelet-Deep ESN5 in the validation phase with the 5^th distribution.

Table 6. Results of three mathematical criteria using the double-stage synthesis models at Toilchun station.

Classification	Distribution	Validation phase
Classification	Distribution	R²	NS	RMSE (mg/L)
Double-stage synthesis	Wavelet-ELM1 Wavelet-ELM2 Wavelet-ELM3 Wavelet-ELM4 Wavelet-ELM5	0.608 0.539 0.665 0.541 0.698	0.565 0.530 0.627 0.522 0.689	0.428 0.445 0.397 0.449 0.362
	Wavelet-SVR1 Wavelet-SVR2 Wavelet-SVR3 Wavelet-SVR4 Wavelet-SVR5	0.662 0.866 0.688 0.922 0.780	0.647 0.845 0.646 0.917 0.775	0.386 0.255 0.386 0.187 0.308
	Wavelet-Deep ESN1 Wavelet-Deep ESN2 Wavelet-Deep ESN3 Wavelet-Deep ESN4 Wavelet-Deep ESN5	0.473 0.579 0.596 0.498 0.663	0.471 0.550 0.570 0.477 0.660	0.472 0.435 0.426 0.470 0.379

| Show Table

DownLoad: CSV

Envisaging the eminent models from the 1^st–5^th distributions, the attractive performance of double-stage synthesis models can be evaluated from the Wavelet-ELM (the 5^th distribution), Wavelet-SVR (the 4^th distribution), and Wavelet-Deep ESN (the 5^th distribution) among various input associations in the validation phase. It can be seen from Table 6 that the Wavelet-SVR4 yields more reliable outcomes compared to the Wavelet-ELM5 and Wavelet-Deep ESN5 in the validation phase. As a result, the Wavelet-SVR4 performs superior to the Wavelet-ELM5 and Wavelet-Deep ESN5 for predicting BOD₅ concentration among the enhanced double-stage synthesis models at Toilchun station.

5.2.3. Visual aids of model accomplishment

To validate the precision of desirable standalone and double-stage synthesis models using visual aids, Figures 10(a)–(f) provide the scatterplots for the surveyed and estimated BOD₅ concentration values employing the desirable standalone and double-stage synthesis models at Toilchun station. The linear formulae and values of NS efficiency criterion are inserted for the corresponding standalone and double-stage synthesis models, respectively. It can be inferred from NS values and the slopes of linear formulae that a definite inconsistency can be traced among the desirable standalone and double-stage synthesis models (ELM5, SVR5, Deep ESN5, Wavelet-ELM5, Wavelet-SVR4, and Wavelet-Deep ESN5). Therefore, the Wavelet-SVR4 accomplishes the most reliable precision for predicting BOD₅ concentration values obviously, while the Deep ESN5 yields the least precise among the desirable models at Toilchun station.

Figure 10. Scatterplots for the desirable standalone and double-stage synthesis models in the validation phase (Toilchun station).

DownLoad: Full-Size Img PowerPoint

Additional pictures can anticipate the performance of standalone and double-stage synthesis models using the boxplot, violin plot, and Taylor diagram. Figures 11(a)–(c) support the various visual aids for the desirable standalone and double-stage synthesis models at Toilchun station. It can be seen from Figure 11(a) that the boxplots of estimated BOD₅ concentration utilizing the Wavelet-SVR4 can resemble that of surveyed BOD₅ concentration intimately. Another visual aid for the distribution of surveyed and estimated BOD₅ concentration values employing the desirable models can be displayed with the violin plots (Figure 11(b)). The violin plot can be described as one of schematic techniques to reveal the distribution of mandated numerical values. Figure 11(b) demonstrates a similar contour pattern for the Wavelet-SVR4 employing the median, interquartile, and distribution of mandated values. Figure 11(c) supplies the Taylor diagram employing the desirable standalone and double-stage synthesis models for Toilchun station. It can be seen from Figure 11(c) that the point of Wavelet-SVR4 which has the minimal RMSE value visualizes the straight distance from the surveyed one, while the point of Deep ESN5 displays the longest distance from the surveyed point.

Figure 11. Boxplots, violin plots, and Taylor diagram for the optimal standalone and double-stage synthesis models during validation phase (Toilchun station).

DownLoad: Full-Size Img PowerPoint

6. Discussion

The addressed research explored the nonlinear behavior (e.g., hard to predict) of BOD₅ concentration employing standalone and double-stage synthesis models in Hwangji and Toilchun stations, South Korea. Since both stations (i.e., Hwangji and Toilchun) yielded the different high-quality accuracies for the desirable standalone models, it was hard to judge which model predicted BOD₅ concentration with accuracy. Also, the outputs of three mathematical formulae explained that the SVM models with diverse input associations could predict BOD₅ concentration precisely compared to the ELM and Deep ESN models based on the corresponding distribution on both stations. Because all standalone models enforced the various theoretical structures and inference, the accurate prediction was changed for diverse input associations of standalone models.

The main aim for developing the double-stage synthesis models was to enhance the accurate prediction of BOD₅ concentration compared to corresponding standalone models. Unfortunately, the Wavelet-ELM models could not boost the accurate prediction for corresponding ELM models from the perspective of double-stage synthesis models' performance, based on NS values at Hwangji station. Among the Wavelet-SVR models, the Wavelet-SVR1 (0.4% for SVR1), Wavelet-SVR2 (0.7% for SVR2), Wavelet-SVR4 (3.7% for SVR4), and Wavelet-SVR5 (3.4% for SVR5) models slightly enhanced the accurate prediction. Also, all the Wavelet-Deep ESN models increased the predictive accuracy on a small scale, including the Wavelet-Deep ESN1 (2.5% for Deep ESN1), Wavelet-Deep ESN2 (0.1% for Deep ESN2), Wavelet-Deep ESN3 (3.5% for Deep ESN3), Wavelet-Deep ESN4 (6.0% for Deep ESN4), and Wavelet-Deep ESN5 (1.0% for Deep ESN5). Noticing the desirable models' categorization for the standalone and double-stage synthesis models, the Wavelet-SVR4 model, which yielded the best accuracy, improved the accurate prediction by 12.7% (Wavelet-ELM2), 9.8% (Wavelet-Deep ESN3), 2.5% (ELM4), and 10.1% (Deep ESN2), respectively.

Regarding the performance evaluation of double-stage synthesis models by NS values at Toilchun station, only the Wavelet-ELM1 (19.7% for ELM1) model could boost the estimated efficiency obviously among the Wavelet-ELM models. Also, Wavelet-SVR1 (35.6% for SVR1), Wavelet-SVR2 (22.3% for SVR2), and Wavelet-SVR4 (7.4% for SVR4) models increased the accurate prediction clearly among the Wavelet-SVR models. In addition, all the Wavelet-Deep ESN models, including the Wavelet-Deep ESN1 (31.2% for Deep ESN1), Wavelet-Deep ESN2 (0.5% for Deep ESN2), Wavelet-Deep ESN3 (36.7% for Deep ESN3), Wavelet-Deep ESN4 (3.5% for Deep ESN4), and Wavelet-Deep ESN5 (8.9% for Deep ESN5) models, boosted the precise efficiency, respectively. Considering the desirable models' categorization for the standalone and double-stage synthesis models, the Wavelet-SVR4 model which produced the best accuracy, reinforced the accurate prediction by 32.8% (Wavelet-ELM5), 38.6% (Wavelet-Deep ESN5), 13.4% (ELM5), 5.2% (SVR5), and 51.0% (Deep ESN5), respectively. The double-stage synthesis models, therefore, could not always reinforce the accurate prediction of corresponding standalone models on both stations. This experience pursued the previous works of ^[2] and ^[44]. ^[44] predicted DO concentration employing the single and hybrid machine learning models in Florida, USA. Results demonstrated that the hybrid machine learning models could not regularly improve the predicted accuracy of single machine learning models. Also, ^[2] implemented the single and combinational paradigm to predict BOD₅ concentration in South Korea. They found that the combinational paradigm could not always increase the predictive accuracy of single models clearly.

Therefore, the process which embeds the different data preprocessing algorithms ^{[45,46,47,48,49]} in the diverse standalone (i.e., machine learning and deep learning) models, is required to increase the accurate prediction and efficiency of BOD₅ concentration for the continuous research.

7. Conclusions

The addressed research explored the precision and efficiency of the standalone and double-stage synthesis models for predicting BOD₅ concentration in Hwangji and Toilchun stations, South Korea. Five input associations (1^st–5^th distributions) were resolved for developing the standalone and double-stage synthesis models based on seven water quantity and quality items. For the modeling and prediction of standalone and double-stage synthesis models, the assembled data were divided into training and validation samples, respectively. Three mathematical formulae (R², NS, and RMSE) and four graphical aids (scatter diagram, boxplot, violin plot, and Taylor diagram) were used to evaluate the accurate prediction of addressed models.

Considering the best models from the 1^st-5^th distributions, the SVR3 (R² = 0.925, NS = 0.915, and RMSE = 0.264 mg/L) and Wavelet-SVR4 (R² = 0.926, NS = 0.915, and RMSE = 0.264 mg/L) models were the most precise compared to other desirable models (ELM4, Deep ESN2, Wavelet-ELM2, and Wavelet-Deep ESN3) based on standalone and double-stage synthesis models in the validation phase at Hwangji station. Also, the Wavelet-SVR4 model (R² = 0.922, NS = 0.917, and RMSE = 0.162 mg/L) provided more precise results than other desirable models (ELM5, SVR5, Deep ESN5, Wavelet-ELM5, and Wavelet-Deep ESN5) for predicting BOD₅ concentration in the validation phase at Toilchun station. However, it was found the addressed research that explained that the precision and efficiency of BOD₅ concentration estimated by the standalone models could not be reinforced by the double-stage synthesis models on both stations. Therefore, using the credible water quantity and quality items from the available data groups can confirm the outputs of the addressed research, and perform the best prediction of BOD₅ concentration employing the different standalone and double-stage synthesis models in river.

Conflict of interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

References

[1]	D. P. M. Abellana, A systemic analysis of green computing adoption using genetically evolved fuzzy cognitive map: A Philippine scenario, Kybernetes, 50 (2020), 2668–2696. https://doi.org/10.1108/K-05-2020-0263 doi: 10.1108/K-05-2020-0263
[2]	W. M. Zheng, Q. W. Chai, J. hang, X. S. Xue, Ternary compound ontology matching for cognitive green computing, Math. Biosci. Eng., 18 (2021), 4860–4870. https://doi.org/10.3934/mbe.2021247 doi: 10.3934/mbe.2021247
[3]	X. Liu, Y. Li, X. Zhang, W. Lu, M. Xiong, Energy-efficient resource optimization in green cognitive internet of things, Mobile Networks Appl., 25 (2020), 107750–107770. https://doi.org/10.1007/s11036-020-01510-w doi: 10.1007/s11036-020-01510-w
[4]	Y. M. Jiang, M. S. Liu, H. Peng, M. Z. A. Bhuiyan, A reliable deep learning-based algorithm design for IoT load identification in smart grid, Ad Hoc Networks, 123 (2021), 102643–102673. https://doi.org/10.1016/j.adhoc.2021.102643 doi: 10.1016/j.adhoc.2021.102643
[5]	Y. Liu, L. Zhong, J. Qiu, J. Lu, W. Wang, Unsupervised domain adaptation for non-intrusive load monitoring via adversarial and joint adaptation network, IEEE Trans. Ind. Inf., 18 (2021), 266–277. https://doi.org/10.1109/TII.2021.3065934 doi: 10.1109/TII.2021.3065934
[6]	H. Peng, J. X. Li, S. Z. Wang, L. H. Wang, Q. R. Gong, R. Y. Yang, et al., Hierarchical taxonomy-aware and attentional graph capsule RCNNs for large-scale multi-label text classification, IEEE Trans. Knowl. Data Eng., 33 (2021), 2505–2519 https://doi.org/10.1109/TKDE.2019.2959991 doi: 10.1109/TKDE.2019.2959991
[7]	A. F. Moreno Jaramillo, D. M. Laverty, D. J. Morrow, J. Martinez del Rincon, A. M. Foley, Load modelling and non-intrusive load monitoring to integrate distributed energy resources in low and medium voltage networks, Renewable Energy, 179 (2021), 445–466. https://doi.org/10.1016/j.renene.2021.07.056 doi: 10.1016/j.renene.2021.07.056
[8]	R. V. A. Monteiro, J. C. R. de Santana, R. F. S. Teixeira, A. S. Bretas, R. Aguiar, C. E. P. Poma, Non-intrusive load monitoring using artificial intelligence classifiers: Performance analysis of machine learning techniques, Electr. Power Syst. Res., 198 (2021), 107347. https://doi.org/10.1016/j.epsr.2021.107347 doi: 10.1016/j.epsr.2021.107347
[9]	H. X. Wang, J. S. Zhang, C. B. Lu, C. Y. Wu, Privacy preserving in non-intrusive load monitoring: A differential privacy perspective, IEEE Trans. Smart Grid, 12 (2020), 2529–2543. https://doi.org/10.1109/TSG.2020.3038757 doi: 10.1109/TSG.2020.3038757
[10]	D. Hua, F. Q. Huang, L. J. Wang, W. T. Chen, Simultaneous disaggregation of multiple appliances based on non-intrusive load monitoring, Electr. Power Syst. Res., 193 (2021), 106887. https://doi.org/10.1016/j.epsr.2020.106887 doi: 10.1016/j.epsr.2020.106887
[11]	M. Dincecco, S. Squartini, M. J. Zhong, Transfer learning for non-intrusive load monitoring, IEEE Trans. Smart Grid, 11 (2020), 1419-1429. https://doi.org/10.1109/TSG.2019.2938068 doi: 10.1109/TSG.2019.2938068
[12]	G. A. Raiker, R. B. Subba, U. Loganathan, S. Agrawal, A. S. Thakur, J. P. Barton, et al., Energy disaggregation using energy demand model and IoT based control, IEEE Trans. Ind. Appl., 57 (2020), 1746–1754. https://doi.org/10.1109/TIA.2020.3047016 doi: 10.1109/TIA.2020.3047016
[13]	F. Ciancetta, G. Bucci, E. Fiorucci, S. Mari, A. Fioravanti, A new convolutional neural network-based system for NILM applications, IEEE Trans. Instrum. Meas., 70 (2021), 1501112. https://doi.org/10.1109/TIM.2020.3035193 doi: 10.1109/TIM.2020.3035193
[14]	A. Faustine, L. Pereira, C. Klemenjak, Adaptive weighted recurrence graphs for appliance recognition in non-intrusive load monitoring, IEEE Trans. Smart Grid, 12 (2020), 398–406. https://doi.org/10.1109/TSG.2020.3010621 doi: 10.1109/TSG.2020.3010621
[15]	Y. Himeur, A. Alsalemi, F. Bensaali, A. Amira, An intelligent nonintrusive load monitoring scheme based on 2D phase encoding of power signals, Int. J. Intell. Syst., 36 (2020), 72–93. https://doi.org/10.1002/int.22292 doi: 10.1002/int.22292
[16]	Y. Yang, J. Zhong, W. Li, T. A. Gulliver, S. Li, Semisupervised multilabel deep learning based nonintrusive load monitoring in smart grids, IEEE Trans. Ind. Inf., 16 (2020), 6892–6902. https://doi.org/10.1109/TII.2019.2955470 doi: 10.1109/TII.2019.2955470
[17]	M. Kaselimi, N. Doulamis, A. Voulodimos, E. Protopapadakis, A. Doulamis, Context aware energy disaggregation using adaptive bidirectional LSTM models, IEEE Trans. Smart Grid, 11 (2020), 3054–3067. https://doi.org/10.1109/TSG.2020.2974347 doi: 10.1109/TSG.2020.2974347
[18]	H. Chen, Y. H. Wang, C. H. Fan, A convolutional autoencoder-based approach with batch normalization for energy disaggregation, J. Supercomput., 11 (2021), 2961–2978. https://doi.org/10.1007/s11227-020-03375-y doi: 10.1007/s11227-020-03375-y
[19]	P. Hao, J. Li, Y. He, Y. Liu, M. Bao, L. Wang, et al., Large-scale hierarchical text classification with recursively regularized deep graph-CNN, in Web Conference 2018: Proceedings of the 2018 World Wide Web Conference, Macao, Lyon, France, (2018), 1063–1072. https://doi.org/10.1145/3178876.3186005
[20]	H. Peng, J. Li, Q. Gong, Y. Ning, S. Wang, L. He, Motif-matching based subgraph-level attentional convolutional network for graph classification, in Proceedings of the AAAI Conference on Artificial Intelligence, 34 (2021), 5387–5394. https://doi.org/10.1609/aaai.v34i04.5987
[21]	A. Moradzadeh, O. Sadeghian, K. Pourhossein, B. Mohammadiivatloo, A. Anvarimoghaddam, Improving residential load disaggregation for sustainable development of energy via principal component analysis, Sustainability, 12 (2020), 1–14. https://doi.org/10.3390/su12083158 doi: 10.3390/su12083158
[22]	P. Hao, J. Li, Y. Song, Y. Liu, Incrementally learning the hierarchical softmax function for neural language models, in Thirty-First AAAI Conference on Artificial Intelligence, San Francisco, CA, 31 (2017), 3267–3273. https://doi.org/10.1609/aaai.v31i1.10994
[23]	A. U. Rehman, T. T. Lie, B. Vallès, S. R. Tito, Event-detection algorithms for low sampling nonintrusive load monitoring systems based on low complexity statistical features, IEEE Trans. Instrum. Meas., 69 (2019), 751–759. https://doi.org/10.1109/TIM.2019.2904351 doi: 10.1109/TIM.2019.2904351
[24]	M. S. Tsai, Y. H. Lin, Modern development of an adaptive non-intrusive appliance load monitoring system in electricity energy conservation, Appl. Energy, 96 (2012), 55–73. https://doi.org/10.1016/j.apenergy.2011.11.027 doi: 10.1016/j.apenergy.2011.11.027
[25]	Z. J. Zhou, Y. M. Xiang, H. Xu, Y. S. Wang, D. Shi, Z. W. Wang, Self-organizing probability neural network-based intelligent non-intrusive load monitoring with applications to low-cost residential measuring devices, Trans. Inst. Meas. Control, 96 (2020), 635–645. https://doi.org/10.1177/0142331220950865 doi: 10.1177/0142331220950865
[26]	C. Chen, P. Gao, J. Jiang, H. Wang, P. Li, S. Wan, A deep learning based non-intrusive household load identification for smart grid in China, Comput. Commun., 177 (2021), 175–184. https://doi.org/10.1016/j.comcom.2021.06.023 doi: 10.1016/j.comcom.2021.06.023
[27]	D. L. Su, Q. Shi, H. Xu, W. Wang, Nonintrusive load monitoring based on complementary features of spurious emissions, Electronics, 8 (2019), 1002. https://doi.org/10.3390/electronics8091002 doi: 10.3390/electronics8091002
[28]	F. Ciancetta, G. Bucci, E. Fiorucci, S. Mari, A. Fioravanti, A new convolutional neural network-based system for NILM applications, IEEE Trans. Instrum. Meas., 70 (2020), 1501112. https://doi.org/10.1109/TIM.2020.3035193 doi: 10.1109/TIM.2020.3035193
[29]	D. Ding, J. Li, K. Zhang, H. Wang, K. Wang, T. Cao, Non-intrusive load monitoring method with inception structured CNN, Appl. Intell., 30 (2021), 1–18. https://doi.org/10.1007/s10489-021-02690-y doi: 10.1007/s10489-021-02690-y
[30]	H. Peng, J. Li, Y. Song, R. Yang, R. Ranjan, P. S. Yu, et al., Streaming social event detection and evolution discovery in heterogeneous information networks, ACM Trans. Knowl. Discovery Data, 15 (2021), 1–33. https://doi.org/10.1145/3447585 doi: 10.1145/3447585
[31]	Z. Jia, L. Yang, Z. Zhang, H. Liu, F. Kong, Sequence to point learning based on bidirectional dilated residual network for non-intrusive load monitoring, Int. J. Electr. Power Energy Syst., 129 (2021), 106837. https://doi.org/10.1016/j.ijepes.2021.106837 doi: 10.1016/j.ijepes.2021.106837
[32]	H. Peng, R. Zhang, Y. Dou, R. Yang, J. Zhang, P. S. Yu, Reinforced neighborhood selection guided multi-relational graph neural networks, ACM Trans. Inf. Syst., 40 (2021), 1–46. https://doi.org/10.1145/3490181 doi: 10.1145/3490181
[33]	C. Dinesh, S. Makonin, I. V. Bajić, Residential power forecasting using load identification and graph spectral clustering, IEEE Trans. Circuits Syst. II Express Briefs, 66 (2019), 1900–1904. https://doi.org/10.1109/TCSII.2019.2891704 doi: 10.1109/TCSII.2019.2891704
[34]	H. Peng, J. Li, Z. Wang, R. Yang, M. Liu, M. Zhang, et al., Lifelong property price prediction: A case study for the toronto real estate market, IEEE Trans. Knowl. Data Eng., 40 (2021). https://doi.org/10.1109/TKDE.2021.3112749
[35]	Z. Wu, C. Wang, H. Zhang, W. Peng, W. Liu, A time-efficient factorial hidden Semi-Markov model for non-intrusive load monitoring, Electr. Power Syst. Res., 199 (2021), 107372. https://doi.org/10.1016/j.epsr.2021.107372 doi: 10.1016/j.epsr.2021.107372
[36]	N. Henao, K. Agbossou, S. Kelouwani, Y. Dubé, M. Fournier, Approach in nonintrusive type I load monitoring using subtractive clustering, IEEE Trans. Smart Grid, 8 (2017), 812–821. https://doi.org/10.1109/TSG.2015.2462719 doi: 10.1109/TSG.2015.2462719
[37]	D. Yang, X. Gao, L. Kong, Y. Pang, B. Zhou, An event-driven convolutional neural architecture for non-intrusive load monitoring of residential appliance, IEEE Trans. Consum. Electron., 66 (2020), 173–182. https://doi.org/10.1109/TCE.2020.2977964 doi: 10.1109/TCE.2020.2977964
[38]	T. T. H. Le, S. Heo, H. Kim, Toward load identification based on the hilbert transform and sequence to sequence long short-term memory, IEEE Trans. Smart Grid, 12 (2021), 3252–3264. https://doi.org/10.1109/TSG.2021.3066570 doi: 10.1109/TSG.2021.3066570
[39]	H. Rafiq, X. Shi, H. Zhang, H. Li, M. K. Ochani, A. A. Shah, Generalizability improvement of deep learning based non-intrusive load monitoring system using data augmentation, IEEE Trans. Smart Grid, 99 (2021), 75–114. https://doi.org/10.1109/TSG.2021.3082622 doi: 10.1109/TSG.2021.3082622
[40]	S. Makonin, F. Popowich, I. V. Bajić, B. Gill, L. Bartram, Exploiting HMM sparsity to perform online real-time nonintrusive load monitoring, IEEE Trans. Smart Grid, 7 (2016), 2575–2585. https://doi.org/10.1109/TSG.2015.2494592 doi: 10.1109/TSG.2015.2494592
[41]	S. P. Cai, Z. M. Sun, J. Yan, D. H. Tang, Y. Chen, Z. Y. Zhou, Fisher information and online SVR-based dynamic modeling methodology for meteorological sensitive load forecasting in smart grids, IEEE Trans. Smart Grid, 104 (2021), 513–527. https://doi.org/10.1007/s00202-021-01308-3 doi: 10.1007/s00202-021-01308-3
[42]	V. Álvarez, S. Mazuelas, J. A. Lozano, Probabilistic load forecasting based on adaptive online learning, IEEE Trans. Smart Grid, 36 (2021), 3668–3680. https://doi.org/10.1109/TPWRS.2021.3050837 doi: 10.1109/TPWRS.2021.3050837
[43]	Y. Du, F. Li, Intelligent multi-microgrid energy management based on deep neural network and model-free reinforcement learning, IEEE Trans. Smart Grid, 11 (2019), 1066–1076. https://doi.org/10.1109/TSG.2019.2930299 doi: 10.1109/TSG.2019.2930299
[44]	C. Wang, S. Mei, H. Yu, S. Cheng, L. Du, P. Yang, Unintentional islanding transition control strategy for three-/single-phase multimicrogrids based on artificial emotional reinforcement learning, IEEE Syst. J., 15 (2021), 5464–5475. https://doi.org/10.1109/JSYST.2021.3074296 doi: 10.1109/JSYST.2021.3074296
[45]	H. Peng, H. Wang, B. Du, M. Bhuiyan, H. Ma, J. Liu, et al., Spatial temporal incidence dynamic graph neural networks for traffic flow forecasting, Inf. Sci., 521 (2020), 277–290. https://doi.org/10.1016/j.ins.2020.01.043 doi: 10.1016/j.ins.2020.01.043
[46]	H. Peng, H. Li, Y. Song, V. W. Zheng, J. Li, Differentially private federated knowledge graphs embedding, IEEE Trans. Knowl. Data Eng., 40 (2021). https://doi.org/10.1145/3459637.3482252
[47]	Y. Li, R. Wang, Z. Yang, Optimal scheduling of isolated microgrids using automated reinforcement learning-based multi-period forecasting, IEEE Trans. Sustainable Energy, 13 (2022), 159–169. https://doi.org/10.1109/TSTE.2021.3105529 doi: 10.1109/TSTE.2021.3105529
[48]	D. Cao, W. Hu, X. Xu, Q. Wu, Q. Huang, Z. Chen, et al., Deep reinforcement learning based approach for optimal power flow of distribution networks embedded with renewable energy and storage devices, J. Mod. Power Syst. Clean Energy, 9 (2021), 1101–1110. https://doi.org/10.35833/MPCE.2020.000557 doi: 10.35833/MPCE.2020.000557
[49]	J. Li, H. Peng, Y. Cao, Y. Dou, H. Zhang, P. S. Yu, et al., Higher-order attribute-enhancing heterogeneous graph neural networks, IEEE Trans. Knowl. Data Eng., 40 (2021). https://doi.org/10.48550/arXiv.2104.07892
[50]	L. Xi, L. Zhou, L. Liu, D. Duan, Y. Xu, L. Yang, et al., A deep reinforcement learning algorithm for the power order optimization allocation of AGC in interconnected power grids, CSEE J. Power Energy Syst., 6 (2020), 712–723. https://doi.org/10.17775/CSEEJPES.2019.01840 doi: 10.17775/CSEEJPES.2019.01840
[51]	D. Silver, J. Schrittwieser, K. Simonyan, I. Antonoglou, A. Huang, A. Guez, et al., Mastering the game of Go without human knowledge, Nature, 550 (2017), 354–359. https://doi.org/10.1038/nature24270 doi: 10.1038/nature24270
[52]	R. Yao, X. Lu, H. Zhou, J. Lai, A novel category-specific pricing strategy for demand response in microgrids, IEEE Trans. Sustainable Energy, 12 (2020), 182–195. https://doi.org/10.1109/TSTE.2021.3106329 doi: 10.1109/TSTE.2021.3106329
[53]	Z. Xiao, C. Fan, J. Yuan, X. Xu, W. Gang, Comparison between artificial neural network and random forest for effective disaggregation of building cooling load, Case Stud. Therm. Eng., 28 (2021), 101589. https://doi.org/10.1016/j.csite.2021.101589 doi: 10.1016/j.csite.2021.101589
[54]	R. Yao, H. Zhou, D. Zhou, H. Zhang, State characteristic clustering for nonintrusive load monitoring with stochastic bhaviours in smart grids, Complexity, 2021 (2021), 8839595. https://doi.org/10.1155/2021/8839595 doi: 10.1155/2021/8839595
[55]	H. Peng, J. Li, Q. Gong, Y. Song, P. S. Yu, Fine-grained event categorization with heterogeneous graph convolutional networks, in Twenty-Eighth International Joint Conference on Artificial Intelligence IJCAI-19, (2019), 3238–3245. https://doi.org/10.48550/arXiv.1906.04580
[56]	Y. J. Zhang, L. Yu, Z. J. Fang, N. N. Xiong, L. J. Zhang, H. Y. Tian, An end-to-end deep learning model for robust smooth filtering identification, Future Gener. Comput. Syst., 127 (2021), 182–195. https://doi.org/10.1016/j.future.2021.09.004 doi: 10.1016/j.future.2021.09.004
[57]	Q. Liu, K. M. Kamoto, X. Liu, M. Sun, N. Linge, Low-complexity non-intrusive load monitoring using unsupervised learning and generalized appliance models, IEEE Trans. Consum. Electron., 65 (2019), 28–37. https://doi.org/10.1109/TCE.2019.2891160 doi: 10.1109/TCE.2019.2891160
[58]	H. Shuai, H. He, Online scheduling of a residential microgrid via Monte-Carlo tree search and a learned model, IEEE Trans. Smart Grid, 12 (2020), 1073–1087. https://doi.org/10.1109/TSG.2020.3035127 doi: 10.1109/TSG.2020.3035127
[59]	T. Shao, H. Zhang, K. Cheng, K. Zhang, L. Bie, The hierarchical task network planning method based on Monte Carlo tree search, Knowl.-Based Syst., 225 (2021), 107067. https://doi.org/10.1016/j.knosys.2021.107067 doi: 10.1016/j.knosys.2021.107067
[60]	R. Lu, S. H. Hong, M. Yu, Demand response for home energy management using reinforcement learning and artificial neural network, IEEE Trans. Smart Grid, 10 (2019), 6629–6639. https://doi.org/10.1109/TSG.2019.2909266 doi: 10.1109/TSG.2019.2909266
[61]	V. de Carvalho Neiva Pinheiro, A. L. Francato, W. B. Powell, Reinforcement learning for electricity dispatch in grids with high intermittent generation and energy storage systems: A case study for the Brazilian grid, Int. J. Energy Res., 44 (2020), 8635–8653. https://doi.org/10.1002/er.5551 doi: 10.1002/er.5551
[62]	H. Shuai, H. He, Online scheduling of a residential microgrid via Monte-Carlo tree search and a learned model, IEEE Trans. Smart Grid, 12 (2020), 1073–1087. https://doi.org/10.1109/TSG.2020.3035127 doi: 10.1109/TSG.2020.3035127
[63]	X. Liu, A. Fotouhi, Formula-E race strategy development using artificial neural networks and Monte Carlo tree search, Neural Comput. Appl., 32 (2020), 15191–15207. https://doi.org/10.1007/s00521-020-04871-1 doi: 10.1007/s00521-020-04871-1
[64]	E. J. Powley, P. I. Cowling, D. Whitehouse, Memory bounded Monte Carlo tree search, in Thirteenth Artificial Intelligence and Interactive Digital Entertainment Conference, 13 (2017), 94–100. Available from: https://ojs.aaai.org/index.php/AIIDE/article/view/12932.
[65]	R. Bonfigli, A. Felicetti, E. Principi, M. Fagiani, S. Squartini, F. Piazza, Denoising autoencoders for non-intrusive load monitoring: Improvements and comparative evaluation, Energy Build., 158 (2018), 1461–1474. https://doi.org/10.1016/j.enbuild.2017.11.054 doi: 10.1016/j.enbuild.2017.11.054
[66]	Y. Himeur, A. Alsalemi, F. Bensaali, A. Amira, Smart non-intrusive appliance identification using a novel local power histogramming descriptor with an improved k-nearest neighbors classifier, Sustainable Cities Soc., 67 (2021), 102764. https://doi.org/10.1016/j.scs.2021.102764 doi: 10.1016/j.scs.2021.102764
[67]	L. D. Nolasco, A. E. Lazzaretti, B. M. Mulinari, DeepDFML-NILM: A new CNN-based architecture for detection, feature extraction and multi-label classification in NILM signals, IEEE Sens. J., 22 (2022), 501–509. https://doi.org/10.1109/JSEN.2021.3127322 doi: 10.1109/JSEN.2021.3127322
[68]	Z. Huang, X. Wei, Y. Kai, Bidirectional LSTM-CRF models for sequence tagging, Comput. Sci., 2015 (2015). https://doi.org/10.48550/arXiv.1508.01991

This article has been cited by:

1.	Qiankun Zuo, Libin Lu, Lin Wang, Jiahui Zuo, Tao Ouyang, Constructing brain functional network by Adversarial Temporal-Spatial Aligned Transformer for early AD analysis, 2022, 16, 1662-453X, 10.3389/fnins.2022.1087176
2.	Changwei Gong, Changhong Jing, Xin-an Liu, Victoria X. Wang, Cheuk Ying Tang, Paul J. Kenny, Ye Li, Zuxin Chen, Shuqiang Wang, Generative artificial intelligence-enabled dynamic detection of rat nicotine-related circuits, 2024, 36, 0941-0643, 4693, 10.1007/s00521-023-09307-0
3.	Changwei Gong, Changhong Jing, Xuhang Chen, Chi Man Pun, Guoli Huang, Ashirbani Saha, Martin Nieuwoudt, Han-Xiong Li, Yong Hu, Shuqiang Wang, Generative AI for brain image computing and brain network computing: a review, 2023, 17, 1662-453X, 10.3389/fnins.2023.1203104

Reader Comments

Your name:*

Email:*
© 2022 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)