Sentence opinion mining model for fusing target entities in official government documents

Xiao Ma; Teng Yang; Feng Bai; Yunmei Shi; Xiao Ma; Teng Yang; Feng Bai; Yunmei Shi

doi:10.3934/era.2023177

Electronic Research Archive

2023, Volume 31, Issue 6: 3495-3509. doi: 10.3934/era.2023177

Previous Article Next Article

Research article Special Issues

Sentence opinion mining model for fusing target entities in official government documents

1.
Department of Big Data Development, The State Information Center, Beijing 100045, China
2.
School of Computer Science, Beijing Information Science and Technology University, Beijing 100101, China

Received: 08 January 2023 Revised: 27 March 2023 Accepted: 06 April 2023 Published: 20 April 2023

When drafting official government documents, it is necessary to firmly grasp the main idea and ensure that any positions stated within the text are consistent with those in previous documents. In combination with the field's demands, By taking advantage of suitable text-mining techniques to harvest opinions from sentences in official government documents, the efficiency of official government document writers can be significantly increased. Most existing opinion mining approaches employ text classification methods to directly mine the sentential text of official government documents while disregarding the influence of the objects described within the documents (i.e., the target entities) on the sentence opinion categories. To address these issues, this study proposes a sentence opinion mining model that fuses the target entities within documents. Based on the Bi-directional long short-term (BiLSTM) and attention mechanisms, the model fully considers the attention given by a official government document's target entity to different words within the corresponding sentence text, as well as the dependency between words of the sentence. The model subsequently fuses two by using feature vector fusion to obtain the final semantic representation of the text, which is then classified using a fully connected network and softmax function. Experimental results based on a dataset of official government documents show that the model significantly outperforms baseline models such as Text-convolutional neural network (TextCNN), recurrent neural network (RNN), and BiLSTM.

Keywords:

Citation: Xiao Ma, Teng Yang, Feng Bai, Yunmei Shi. Sentence opinion mining model for fusing target entities in official government documents[J]. Electronic Research Archive, 2023, 31(6): 3495-3509. doi: 10.3934/era.2023177

Related Papers:

[1]	Hongjie Deng, Lingxi Peng, Jiajing Zhang, Chunming Tang, Haoliang Fang, Haohuai Liu . An intelligent aerator algorithm inspired-by deep learning. Mathematical Biosciences and Engineering, 2019, 16(4): 2990-3002. doi: 10.3934/mbe.2019148
[2]	Huanhai Yang, Shue Liu . A prediction model of aquaculture water quality based on multiscale decomposition. Mathematical Biosciences and Engineering, 2021, 18(6): 7561-7579. doi: 10.3934/mbe.2021374
[3]	Delong Cui, Hong Huang, Zhiping Peng, Qirui Li, Jieguang He, Jinbo Qiu, Xinlong Luo, Jiangtao Ou, Chengyuan Fan . Next-generation 5G fusion-based intelligent health-monitoring platform for ethylene cracking furnace tube. Mathematical Biosciences and Engineering, 2022, 19(9): 9168-9199. doi: 10.3934/mbe.2022426
[4]	Lingmin Lin, Kailai Liu, Huan Feng, Jing Li, Hengle Chen, Tao Zhang, Boyun Xue, Jiarui Si . Glucose trajectory prediction by deep learning for personal home care of type 2 diabetes mellitus: modelling and applying. Mathematical Biosciences and Engineering, 2022, 19(10): 10096-10107. doi: 10.3934/mbe.2022472
[5]	Carlos Camilo-Garay, R. Israel Ortega-Gutiérrez, Hugo Cruz-Suárez . Optimal strategies for a fishery model applied to utility functions. Mathematical Biosciences and Engineering, 2021, 18(1): 518-529. doi: 10.3934/mbe.2021028
[6]	Jun Chen, Gangfeng Wang, Tao Xue, Tao Li . An improved polychromatic graphs-based BOM multi-view management and version control method for complex products. Mathematical Biosciences and Engineering, 2021, 18(1): 712-726. doi: 10.3934/mbe.2021038
[7]	Jose Guadalupe Beltran-Hernandez, Jose Ruiz-Pinales, Pedro Lopez-Rodriguez, Jose Luis Lopez-Ramirez, Juan Gabriel Avina-Cervantes . Multi-Stroke handwriting character recognition based on sEMG using convolutional-recurrent neural networks. Mathematical Biosciences and Engineering, 2020, 17(5): 5432-5448. doi: 10.3934/mbe.2020293
[8]	Boyi Zeng, Jun Zhao, Shantian Wen . A textual and visual features-jointly driven hybrid intelligent system for digital physical education teaching quality evaluation. Mathematical Biosciences and Engineering, 2023, 20(8): 13581-13601. doi: 10.3934/mbe.2023606
[9]	Mengfan Liu, Runkai Jiao, Qing Nian . Training method and system for stress management and mental health care of managers based on deep learning. Mathematical Biosciences and Engineering, 2022, 19(1): 371-393. doi: 10.3934/mbe.2022019
[10]	Ivan Izonin, Nataliya Shakhovska . Special issue: Informatics & data-driven medicine. Mathematical Biosciences and Engineering, 2021, 18(5): 6430-6433. doi: 10.3934/mbe.2021319

Abstract

1. Introduction

In recent years, as the scale of fishery aquaculture in China continues to expand, the contradiction between the development of related industries and environmental resources and the low productivity of labour has become increasingly prominent ^[1]. Such a situation has made standardised, digital and intelligent aquaculture technology a hot issue for current research ^[2,3]. The traditional human labour method of free-range aquaculture no longer meets the real needs of the current aquaculture market ^[4]. Therefore, efficient and accurate digital industrial aquaculture (DIA) is the direction of development that meets the realistic needs of aquaculture ^[5]. The development of DIA is an important part of building sustainable smart cities ^[6]. It is important to develop intelligent management solutions for DIA with the help of artificial intelligence technology ^[7]. The sensor network in the DIA can provide a large amount of scene monitoring data for this purpose ^[8]. This facilitates the implementation of intelligent management ^[9]. This work therefore aims to explore data-driven intelligent management solutions ^[10].

As artificial intelligence continues to advance, a number of scholars have begun to explore technological approaches to building assisted DIA using machine learning and deep learning in recent years ^{[11,12,13,14]}. Haq et al. ^[15] proposed a hybrid CNN and LSTM depth learning model to effectively predict the water quality of aquaculture. Wang et al. ^[16] proposed a dual neural network method including feature extraction network and full convolution semantic segmentation network to solve the problem of high cost of sample collection. Ahmed et al. ^[17] successfully identified the infected fish in aquaculture by using image processing and machine learning ^[18]. The advantages of these technologies are that they can improve the utilisation of available resources, increase productivity and free up labour while reducing costs ^[19]. In contrast to traditional techniques, the deep learning-based approach involves analysis and empirical learning of fish farming data and finally the development of intelligent management solutions ^[20,21,22]. Intelligent management encompasses many elements and has two main aspects: fish condition management and environmental condition management. The former in turn contains weight prediction, oxygen consumption prediction, feeding prediction, etc., while the latter contains water quality prediction ^[23]. How to use limited computing resources to achieve multiple business needs is a technical challenge for DIA ^[24].

To address the above issues, this paper extends the approach of single-objective prediction to a multi-objective prediction structure. Multiple objectives are predicted concurrently while maintaining the stability of the model ^{[25,26,27,28]}. To this end, this paper introduces a multi-objective neural network structure and proposes a data-driven intelligent management scheme for digital industrial aquaculture based on multi-object deep neural network (Mo-DIA). For fish condition management, a multi-objective prediction model is constructed using a double-hidden layer BP neural network ^[29,30]. The data on fish length, body width, temperature and feeding frequency are used to predict fish weight, fish oxygen consumption and fish feeding effectively. In terms of environmental condition management, the LSTM neural network-based multi-objective prediction model was constructed using the temporal correlation of water quality data series collection ^[31,32]. By training the analysis of water quality data, the time series values of eight attributes regarding Nitrite nitrogen ( ${\text{NO}}_2^ - {\text{ - N}}$ ), Nitrate nitrogen ( ${\text{NO}}_3^ - {\text{ - N}}$ ), Dissolved oxygen (DO), Temperature (T), Total nitrogen (TN), Potential of hydrogen (PH), Chemical oxygen demand (COD) and Ammonia nitrogen ( ${\text{NH}}_4^{\text{ + }}{\text{ - N}}$ ) are predicted. The main contributions of this paper can be summarised as:

● This work systematically analyses the main tasks and challenges of DIA intelligent management and explores a comprehensive technology-assisted model.

● This paper introduces a multi-objective neural network structure and proposes a multi-objective deep neural network-based approach to DIA intelligent management.

● The effectiveness of this paper's multi-objective deep neural network approach is confirmed by conducting extensive experiments on datasets collected in real scenarios.

● This work provides a comprehensive evaluation of the multi-objective concurrent prediction results, and the performance of the method is positive.

2. Problem statement

The goal of intelligent aquaculture is to move from data to decision making. In order to achieve intelligent aquaculture, all aspects of the culture need to be finely controlled, including oxygen aeration, feeding and feeding rates. As shown in Figure 1, it represents the complex scene of a digital industrial aquaculture. Experience from artificial aquaculture shows that fish cannot grow without sufficient oxygen, the right amount of bait and a good water quality environment. Therefore, the intelligent management solution based on DIA proposed in this paper is to use artificial intelligence algorithm technology combined with hardware equipment to deal with the state management of the fish and the water quality state management of the aquaculture environment.

Figure 1. Scenarios of digital industrialized aquaculture.

DownLoad: Full-Size Img PowerPoint

For fish condition management, the system collects images of fish behaviour and body size characteristics in real time through computer vision technology and sends them to the computer ^[33,34]. The computer analyses the fish length, width, temperature, feeding frequency and other data from the images. The data is then fed into the BP neural network model (Mo-BP) to predict fish weight, fish oxygen consumption and fish feeding status. The daily bait feeding rate is 2% of the fish body weight. The predicted data is used to determine whether the fish are in a normal growth process and to make decisions to ensure that the fish can grow normally. For the management of the environmental status of aquaculture is to better control the water quality environment of aquaculture. This system deploys sensors to monitor a wide range of water quality parameters in the water. These measured data are then passed into a recurrent neural network model (Mo-LSTM) to predict time series values regarding eight attributes: ${\text{NO}}_2^ - {\text{ - N}}$ , ${\text{NO}}_3^ - {\text{ - N}}$ , DO, T, TN, pH, COD and ${\text{NH}}_4^{\text{ + }}{\text{ - N}}$ . Based on the predictions, the water quality is automatically treated using the appropriate filtration equipment to monitor the farming environment and warn of abnormalities in real time.

3. Methodology

3.1. Overview

The DIA management approach proposed in this paper includes two major aspects: fish state management and environmental state management. As shown in , it represents technical illustration of the two sub-modules in the proposed Mo-DIA. In terms of fish condition management, this paper proposes a Mo-BP model to implement a multi-objective prediction model. The data on fish length, body width, temperature and feeding frequency are used to predict fish weight, fish oxygen consumption and fish feeding effectively. In terms of environmental state management, this paper proposes a Mo-LSTM model to implement a multi-objective prediction model. Using the time correlation of water quality data series collection, the time series values of eight attributes regarding ${\text{NO}}_2^ - {\text{ - N}}$ , ${\text{NO}}_3^ - {\text{ - N}}$ , DO, T, TN, pH, COD and ${\text{NH}}_4^{\text{ + }}{\text{ - N}}$ are predicted.

Figure 2. Technical illustration of the two sub-modules in the proposed Mo-DIA.

DownLoad: Full-Size Img PowerPoint

3.2. Fish state management based on multi-objective BP neural networks

Back Propagation Neural Network, an extremely common artificial neural network model, is a multi-layer feed-forward neural network trained according to an error back propagation algorithm. The learning process of the Mo-BP algorithm consists of two processes: the forward propagation of the signal and the backward propagation of the error. In this experiment, the number of hidden layers is set to 2, and each hidden layer contains 4 neurons.

3.2.1. Forward propagation calculation

In the forward propagation process, the final output value and the loss between the output value and the actual value are calculated based on the input samples, combined with the given initialized weight value $W$ and the value of the bias term $b$ .

Step 1: Parameter initialisation. The sample input here is denoted as $\vec a = \left({{x_1}, {x_2}} \right)$ . The initialised three-layer weight values and bias values are:

$\begin{equation} {W^{(1)}} = \left[ {\begin{array}{*{20}{l}} {{w_{\left( {{x_1},1} \right)}},{w_{\left( {{x_2},1} \right)}}} \\ {{w_{\left( {{x_1},2} \right)}},{w_{\left( {{x_2},2} \right)}}} \\ \begin{gathered} {w_{\left( {{x_1},3} \right)}},{w_{\left( {{x_2},3} \right)}} \hfill \\ {w_{\left( {{x_1},4} \right)}},{w_{\left( {{x_2},4} \right)}} \hfill \\ \end{gathered} \end{array}} \right] \end{equation}$

(3.1)

$\begin{equation} {W^{(2)}} = \left[ {\begin{array}{*{20}{l}} {{w_{(1,5)}},{w_{(2,5)}},{w_{(3,5)}},{w_{(4,5)}}} \\ \begin{gathered} {w_{(1,6)}},{w_{(2,6)}},{w_{(3,6)}},{w_{(3,6)}} \hfill \\ {w_{(1,7)}},{w_{(2,7)}},{w_{(3,7)}},{w_{(4,7)}} \hfill \\ {w_{(1,8)}},{w_{(2,8)}},{w_{(3,8)}},{w_{(4,8)}} \hfill \\ \end{gathered} \end{array}} \right] \end{equation}$

(3.2)

$\begin{equation} {W^{(3)}} = \left[ {\begin{array}{*{20}{l}} {{w_{(5,9)}},{w_{(6,9)}},{w_{(7,9)}},{w_{(8,9)}}} \\ \end{array}} \right] \end{equation}$

(3.3)

where the corresponding bias values for each layer are: ${b^{(1)}} = \left[ {{b_1}, {b_2}, {b_3}, {b_4}} \right]$ , ${b^{(2)}} = \left[ {{b_5}, {b_6}, {b_7}, {b_8}} \right]$ , ${b^{(3)}} = \left[ {{b_9}} \right]$ .

Step 2: Calculation of hidden layers. The proposed Mo-BP model has two hidden layers, where the input of the first hidden layer are ${z_1}, {z_2}, {z_3}, {z_4}$ according to Eq 3.4. The function $g\left(x \right)$ is chosen as the activation function, and the outputs of this layer are ${g_1}\left({{z_1}} \right), {g_2}\left({{z_2}} \right), {g_3}\left({{z_3}} \right), {g_4}\left({{z_4}} \right)$ . The input to the second layer are ${{z_5}, {z_6}, {z_7}, {z_8}}$ and the output are ${g_5}\left({{z_5}} \right), {g_6}\left({{z_6}} \right), {g_7}\left({{z_7}} \right), {g_8}\left({{z_8}} \right)$ according to Eq 3.5.

$\begin{equation} Z^{(1)} = W^{(1)} *(\vec{a})^T+\left(b^{(1)}\right)^T \end{equation}$

(3.4)

$\begin{equation} {Z^{(2)}} = {W^{(2)}}*{\left[ {{z_1},{z_2},{z_3},{z_4}} \right]^T} + {\left( {{b^{(2)}}} \right)^T} \end{equation}$

(3.5)

Step 3: Calculation of the output layer. The output layer is set up with only one neuron, so the input to this layer can be expressed as ${z_9}$ and the output as ${g_9}\left({{z_9}} \right)$ according to Eq 3.6.

$\begin{equation} {Z^{(3)}} = {W^{(3)}}*{\left[ {{z_5},{z_6},{z_7},{z_8}} \right]^T} + {\left( {{b^{(3)}}} \right)^T} \end{equation}$

(3.6)

3.2.2. Backwards propagation calculation

The samples are fed into the Mo-BP neural network model, ${\widehat {\text{y}}}$ denotes the result obtained by forward propagation and $y$ denotes the true result. According to the chain rule and mathematical induction, the $k$ layer error value ${\zeta ^{(k)}}$ can be calculated from the $k+1$ layer error value ${{\zeta ^{(k + 1)}}}$ by the formula:

$\begin{equation} {\zeta ^{(k)}} = {g_k}^\prime \left( {{z^{(k)}}} \right) \odot \left[ {{{\left( {{{({\text{W}})}^{(k + 1)}}} \right)}^T}{\zeta ^{(k + 1)}}} \right] \end{equation}$

(3.7)

where ${L({\text{y}}, \widehat {\text{y}})}$ denotes the loss function of the metric loss and $\odot$ denotes the Hadamard product. The parameters are learned according to gradient descent, and the expressions for the partial derivatives of the weights $W$ and bias $b$ are:

$\begin{equation} \frac{{\partial L({\text{y}},\widehat {\text{y}})}}{{\partial {W^{(k)}}}} = {\zeta ^{(k)}}{\left( {{n^{(k - 1)}}} \right)^T} \end{equation}$

(3.8)

$\begin{equation} \frac{{\partial L({\text{y}},\widehat {\text{y}})}}{{\partial {b^{(k)}}}} = {\zeta ^{(k)}} \end{equation}$

(3.9)

The back propagation algorithm updates the weights $W$ and bias $b$ based on the error between the predicted and true values to minimise the error for the purpose of optimising the model.

3.3. Environmental state management based on multi-objective recurrent neural networks

Long Short Term Memory networks (LSTM), suitable for processing and predicting important events with relatively long intervals and delays in time series. In this paper, the environmental state management of aquaculture is to predict the water quality parameters at a future moment for the historical water quality parameters with time series characteristics. Compared with the traditional RNN, Mo-LSTM can improve the problem of gradient disappearance and gradient explosion.

The advantage of the Mo-LSTM is that it can store four states, the current and previous moment values of the output, and the current and previous moment values of the memory state vector. An LSTM cell consists of an input gate ${I_{\text{t}}}$ , a forgetting gate ${F_{\text{t}}}$ , an output gate ${O_{\text{t}}}$ and a memory cell ${C_{\text{t}}}$ . ${{{\text{x}}_{\text{t}}}}$ denotes the input data at moment $t$ and the formulas are expressed as follows:

$\begin{equation} {I_{\text{t}}} = \sigma \left( {{D_{\text{i}}}{{\text{x}}_{\text{t}}} + {{\text{W}}_{\text{i}}}{H_{{\text{t}} - 1}} + {{\text{b}}_{\text{i}}}} \right) \end{equation}$

(3.10)

$\begin{equation} {F_{\text{t}}} = \sigma \left( {{D_{\text{f}}}{{\text{x}}_{\text{t}}} + {{\text{W}}_{\text{f}}}{H_{{\text{t}} - 1}} + {{\text{b}}_{\text{f}}}} \right) \end{equation}$

(3.11)

$\begin{equation} {U_{\text{t}}} = \tanh \left( {{D_{\text{u}}}{{\text{x}}_{\text{t}}} + {{\text{W}}_{\text{u}}}{H_{{\text{t}} - 1}} + {{\text{b}}_{\text{u}}}} \right) \end{equation}$

(3.12)

$\begin{equation} {C_{\text{t}}} = {F_{\text{t}}}*{C_{{\text{t}} - 1}} + {I_{\text{t}}}*{U_{\text{t}}} \end{equation}$

(3.13)

$\begin{equation} {O_{\text{t}}} = \sigma \left( {{D_0}{{\text{x}}_{\text{t}}} + {{\text{W}}_0}{H_{{\text{t}} - 1}} + {{\text{b}}_0}} \right) \end{equation}$

(3.14)

$\begin{equation} {H_{\text{t}}} = {O_{\text{t}}}*\tanh \left( {{C_{\text{t}}}} \right) \end{equation}$

(3.15)

where $\sigma$ denotes the $sigmoid$ activation function and $tanh$ is also the activation function. $D$ and $W$ denote the weight matrices from the input layer and the hidden layer to each gate, respectively. ${H_{\text{t}}}$ denotes the parameter state of the hidden layer. ${U_{\text{t}}}$ denotes the intermediate temporary memory unit parameter state in the network transmission and ${C_{\text{t}}}$ denotes the memory unit parameter state in the network transmission. ${{\text{b}}_{\text{i}}}, {{\text{b}}_{\text{f}}}, {{\text{b}}_{\text{u}}}$ and ${{\text{b}}_0}$ denote the bias vectors of the four states of the network.

4. Experiments and analysis

4.1. Datasets and pre-processing

The experimental datasets in this case is from the fish farming laboratory of National Research Base of Intelligent Manufacturing Service, Chongqing Technology and Business University.

For the dataset Fish_status of fish condition management, after a 30-day video collection period, the videos of feeding periods were copied. Due to the storage period of the video recorder was 1 hour, the feeding videos were cropped using VSplayer to only retain the video segments from the beginning of feeding to the end of the fish feeding period. Each feeding segment is approximately 10 minutes long and about 200MB in size, with a total of 60 segments. The keyframe extraction algorithm was used to extract 24,000 images from the 60 feeding segments. The image resolution is 2560×1440, and the size of each image is approximately 700KB. 1000 images without any remaining feed on the water surface were selected from the original dataset as the dataset for measuring body length. 1000 images with remaining feed on the water surface and 1000 images without any remaining feed were manually selected as the training set, while 500 images were randomly selected as the test set. This dataset contains seven attributes of fish length, fish width, fish weight, water temperature, oxygen consumption rate, fish feeding frequency and feeding amount. The fish weight is predicted from the fish length and width data, the oxygen consumption rate is predicted from the fish weight and water temperature, and the feeding amount is predicted from the fish feeding frequency.

For dataset Water_quality_status of environmental condition management, a total of 4561 consecutive time-series real data were collected using sensors to detect water quality parameters. This dataset contains eight columns of water quality parameters: ${\text{NO}}_2^ - {\text{ - N}}$ (mg/L), ${\text{NO}}_3^ - {\text{ - N}}$ (mg/L), DO (mg/L), T, TN (mg/L), pH, COD (mg/L) and ${\text{NH}}_4^{\text{ + }}{\text{ - N}}$ (mg/L). DO is an important factor in maintaining the stability of aquatic ecosystems, and can affect the respiration and metabolism of aquatic organisms. In industrial aquaculture systems, monitoring and regulating the level of DO can ensure the normal growth and reproduction of aquatic organisms. Temperature is an important factor that affects the growth and metabolism of aquatic organisms, and its changes can directly affect the physiology and behavior of aquatic organisms. In industrial aquaculture systems, controlling water temperature can promote the growth and immunity of organisms and improve the efficiency of aquaculture. PH is an indicator of the acidity and alkalinity of water, which has a certain impact on the metabolism and growth of aquatic organisms. In industrial aquaculture systems, maintaining a stable pH value of the water can promote the normal metabolism and immunity of organisms. ${\text{NH}}_4^{\text{ + }}{\text{ - N}}$ , ${\text{NO}}_2^ - {\text{ - N}}$ , ${\text{NO}}_3^ - {\text{ - N}}$ and TN are common pollutants in water, and their content has a certain impact on the growth and health of aquatic organisms. In industrial aquaculture systems, monitoring and regulating the level of nitrogen can prevent the accumulation of pollutants in the water, ensuring the cleanliness of the water and the stability of the ecosystem. COD is an indicator of the content of organic matter in water, which can reflect the degree of water pollution. In industrial aquaculture systems, monitoring and controlling the level of chemical oxygen demand can prevent the excessive accumulation of organic matter in the water, ensuring the cleanliness of the water and the stability of the ecosystem.

Both Fish_status and Water_quality_status are raw sampled data and contain some unrealistic outliers. In order to improve the data quality, firstly, this experiment cleansed the data, removing duplicate data and dealing with missing values and outliers. Secondly, data normalisation was carried out to eliminate the effect of the difference in magnitude and range of values between different attributes on the final prediction results.

4.2. Parameter settings

The fish state management based on multi-objective BP neural network, mainly using known data to achieve multi-objective prediction of fish weight, oxygen consumption rate and feeding amount. For the parameter settings of the Mo-BP model: the number of hidden layers is set to 2, each hidden layer contains 4 neurons, the number of training rounds is 1000, the Tanh function is selected as the activation function, and the division ratio of training set to test set is 80%. The learning rates were set to 0.01, 0.05 and 0.08 respectively to explore the best prediction of the model. In order to fully test the validity of the fish state management experiments, five commonly used evaluation metrics were used in this experiment: MAE, MSE, RMSE, ${R^2}$ , $Adjusted\_{R^2}$ . Each group of experiments was performed five times and the average result was taken as the final valid value.

Based on multi-objective recurrent neural network environmental state management, mainly using Mo-LSTM model to process water quality parameters with historical time series characteristics to achieve the prediction of ${\text{NO}}_2^ - {\text{ - N}}$ , ${\text{NO}}_3^ - {\text{ - N}}$ , DO, T, TN, pH, COD and ${\text{NH}}_4^{\text{ + }}{\text{ - N}}$ of a total of eight target water quality parameters. For the Mo-LSTM model settings: step size is set to 1, input_size is set to 7, hidden_size is set to 2, training rounds are 400 times, and the division ratio of training set to test set is 80%. The learning rate was set to 0.01, 0.05 and 0.08 for the three sets of experiments. In order to verify the validity of the environmental state management experiments, GRU and Bi-LSTM models were introduced for comparison and three typical and commonly used evaluation metrics were used: MAE, MSE and RMSE. Each group of experiments was performed five times and the average result was taken as the final valid value.

4.3. Results and analysis

4.3.1. Fish status management results and discussion

In terms of fish status management, this paper proposes a multi-objective prediction model. As shown in Table 1, the assessed target values for fish weight, oxygen consumption rate and feeding amount are indicated at learning rates of 0.01, 0.05 and 0.08 respectively. The analysis of the single target prediction shows that better prediction can be shown when the learning rate is 0.01. In an overall analysis, fish weight, oxygen consumption rate and feeding amount differed for different learning rates, but all tended to be consistently similar, which can effectively demonstrate the validity of the method.

Table 1. Results of experiments on the prediction of fish weight, oxygen consumption rate and feeding amount.

Metrics	Fish weight			Oxygen consumption rate			Feeding amount
Metrics	lr=0.01	lr=0.05	lr=0.08	lr=0.01	lr=0.05	lr=0.08	lr=0.01	lr=0.05	lr=0.08
MAE	5.2866	5.6034	5.3497	0.9472	0.9482	0.9178	1.4090	1.5072	1.4937
MSE	48.1733	48.8271	49.5273	1.2781	1.4721	1.2068	2.4721	2.5025	2.5284
RMSE	6.9407	6.8276	7.0242	1.1305	1.2642	1.1972	1.5723	1.5798	1.6032

| Show Table

DownLoad: CSV

The prediction error analysis chart representing fish weight, oxygen consumption rate and feeding rate, as shown in Figure 3, provide a clearer analysis of the error in the model predictions. The blue curves in the chart indicate the true measured values and the orange curves indicate the results of the model predictions. As can be seen from Figure 3(a), the prediction error for fish weight is around 6.9g. The accuracy of the model prediction is relatively average. From Figure 3(b) it can be analysed that the error in the oxygen consumption rate of the fish floats at 1.1mg/L in a pond with an average fish weight of 15956.14g. That is, for every 10g of fish weight, the error in oxygen consumption rate fluctuates at 0.0007 mg/L and the model prediction is extremely accurate. From the analysis in Figure 3(c), it can be seen that 1.6g of the model fitted feeding error was introduced by the initial few feeding trials. This is evident from Table 1, whereas when fish feeding behaviour is evident, the model fitting error is not significant. Further analysis of the fitted curve in the chart shows that the curve approximates a primary function, indicating a significant positive correlation between feeding frequency and the amount that should be fed. This demonstrates that the method is a good predictor of feeding quantity to support accurate feeding decisions.

Figure 3. Prediction error analysis chart for fish weight, oxygen consumption rate and feeding amount.

DownLoad: Full-Size Img PowerPoint

As shown in , the ${R^2}$ and $Adjusted\_{R^2}$ are introduced in this paper in order to better explore the interpretation and relevance of the input features to the target of the fish state. ${R^2}$ is used to describe the degree to which the input variables explain the output variables, and $Adjusted\_{R^2}$ eliminates the effects of sample size and number of features on basis of ${R^2}$ , which better reflects the quality of the model assessment. shows the ${R^2}$ for fish weight, oxygen consumption rate and feeding amount; $Adjusted\_{R^2}$ was applied to multiple linear regression, so $Adjusted\_{R^2}$ metrics for fish weight and oxygen consumption rate were tested. All data in are based on a learning rate of 0.01. As can be seen from the chart, the $Adjusted\_{R^2}$ can reach 0.96 in predicting the weight of fish, which indicates that both characteristics, fish length and fish width, are very explanatory for the weight and that the model is effective in predicting the weight of fish. In terms of fish oxygen consumption rate, the $Adjusted\_{R^2}$ can reach 0.97, which indicates that both fish weight and water temperature have strong explanations for fish oxygen consumption rate. In terms of feeding amount prediction analysis, it can reach an ${R^2}$ of 0.9930, which indicates that the feeding frequency of the fish can influence the feeding amount by 99%.

Figure 4.

$R^2$ and

$Adjust\_{R^2}$ results of fish state management under multi-objective prediction.

DownLoad: Full-Size Img PowerPoint

4.3.2. Environmental status management results and discussion

In terms of environmental state management, an LSTM-based multi-object prediction model for water quality parameters is proposed in this paper. As shown in Table 2, this experiment tested the Co-LSTM proposed in this paper and the comparative models GRU and Bi-LSTM, yielding values for the eight prediction objects assessed at learning rates of 0.01, 0.05 and 0.08, respectively. In general, the difference in prediction error between the three models for the recycled water quality dataset is not significant, and the indicators of the model can reach a satisfactory stability at a learning rate of 0.01. As shown in and , in order to observe the prediction effect of the model more comprehensively and directly, this paper visualizes the MAE and MSE indicators of water quality parameters under Co-LSTM, GRU and Bi-LSTM models. In order to meet the carbon source requirements for microbial growth and water purification, an external carbon source (glucose) is added to the biofilter in industrial aquaculture system to maintain sufficient COD concentration. Because the value of COD is adjusted manually and is very different from the other seven parameters, it is not shown in the figure. Further analysis revealed that if the model uses MAE as the main evaluation metric, Co-LSTM was found to be better than GRU and Bi-LSTM in predicting several water quality parameters, including ${\text{NO}}_3^ - {\text{ - N}}$ , DO, T, TN, pH, COD and ${\text{NH}}_4^{\text{ + }}{\text{ - N}}$ , while GRU was slightly better in predicting ${\text{NO}}_2^ - {\text{ - N}}$ . Experience from artificial culture shows that the parameters that have a greater impact on water quality conditions are mainly DO, COD and ${\text{NH}}_4^{\text{ + }}{\text{ - N}}$ . In the prediction of these water quality parameters, Mo-LSTM is superior to GRU and Bi-LSTM. Therefore, it is concluded that Mo-LSTM is superior to GRU and Bi-LSTM in the environmental state management prediction model.

Table 2. Results of multi-object water quality parameter predictions for environmental condition management.

Object	Model	lr=0.01		lr=0.05		lr=0.08
Object	Model	MAE	MSE	MAE	MSE	MAE	MSE
${\text{NO}}_2^ - {\text{ - N}}$	Mo-LSTM	0.0058	0.0008	0.0062	0.0011	0.0081	0.0013
	GRU	0.0024	0.0006	0.0027	0.0007	0.0031	0.0012
	Bi-LSTM	0.0061	0.00011	0.0068	0.0007	0.0089	0.0021
${\text{NO}}_3^ - {\text{ - N}}$	Mo-LSTM	0.4627	0.2349	0.4537	0.2174	0.4921	0.3845
	GRU	0.4694	0.2287	0.4802	0.2088	0.5692	0.2901
	Bi-LSTM	0.4657	0.2248	0.4731	0.2114	0.5233	0.3646
DO	Mo-LSTM	0.258	0.0873	0.2573	0.0892	0.2689	0.0952
	GRU	0.2563	0.0934	0.2597	0.0945	0.3082	0.1029
	Bi-LSTM	0.2572	0.0892	0.2581	0.0918	0.2894	0.1005
T	Mo-LSTM	0.1778	0.0414	0.1853	0.0492	0.1728	0.0479
	GRU	0.3249	0.11	0.3381	0.1195	0.3317	0.1204
	Bi-LSTM	0.2379	0.0817	0.2181	0.0837	0.2648	0.0953
TN	Mo-LSTM	0.2753	0.0809	0.2742	0.0800	0.2829	0.0926
	GRU	0.4017	0.1673	0.3902	0.1559	0.4192	0.1857
	Bi-LSTM	0.3974	0.1185	0.3147	0.1192	0.3712	0.1203
pH	Mo-LSTM	0.0402	0.0029	0.0481	0.0039	0.0401	0.0018
	GRU	0.0471	0.0038	0.0512	0.0041	0.0425	0.0034
	Bi-LSTM	0.0438	0.0037	0.0494	0.0052	0.0418	0.0027
COD	Mo-LSTM	1.9185	5.3993	1.9022	5.3210	1.8937	5.2877
	GRU	3.3124	22.1170	3.3019	22.0911	3.3298	23.0374
	Bi-LSTM	2.8371	17.2849	2.6290	16.8407	2.8928	15.3779
${\text{NH}}_4^{\text{ + }}{\text{ - N}}$	Mo-LSTM	0.0152	0.0003	0.0159	0.0014	0.0184	0.0017
	GRU	0.0165	0.0004	0.0169	0.0017	0.0216	0.0021
	Bi-LSTM	0.0171	0.0003	0.0162	0.0019	0.0207	0.0019

| Show Table

DownLoad: CSV

Figure 5. Comparison of MAE values of water quality parameters.

DownLoad: Full-Size Img PowerPoint

Figure 6. Comparison of MSE values of water quality parameters.

DownLoad: Full-Size Img PowerPoint

As shown in Figure 7, this experiment also tested the RMSE values of the eight water quality parameters at different learning rate settings, which were analysed visually to increase the validity of the model evaluation. It is evident from the chart that the Mo-LSTM model represents a curve that lies steadily below the GRU and Bi-LSTM curves. In summary, the validity of using Mo-LSTM neural networks to construct models for the prediction of environmental water quality is verified.

Figure 7. RMSE results of environmental state management under multi objective prediction.

DownLoad: Full-Size Img PowerPoint

4.3.3. Results and discussion

The intelligent digital industrial aquaculture in this paper analyses the results of the experiments in terms of fish state management and environmental state management respectively. For the former, the results in Table 1, Figure 3 and Figure 4 can well demonstrate the superiority of multi-object BP neural network-based fish state management. For the latter, the results in Table 2, Figure 5, Figure 6 and Figure 7 can well demonstrate the superiority of multi-object recurrent neural network-based environmental state management. The realisation of these results can be attributed to two possible aspects.

On the one hand, the experimental data in this paper were collected autonomously by the experimental group, and the real reliability of the data can bring a great advantage to the training of the model. On the other hand, the construction of a multi-object deep neural network-based model brings more stability to the prediction results.

5. Conclusion

This paper successfully proposes a multi-object deep neural network-based intelligent management method for digital industrial aquaculture for effectively improving multiple business requirements in aquaculture. The method is divided into two parts: fish state management and environmental state management. On the one hand, the BP neural network based on double hidden layers is used to build a multi-objective prediction model to accurately predict the weight, oxygen consumption rate and feeding amount of fish. On the other hand, the LSTM-based neural network is used to build a multi-objective prediction model to realise the prediction of water quality parameters for the aquaculture environment. Promote the intelligent management of aquaculture by combining intelligent algorithm and data drive. The experiments on real datasets show that the Mo-DIA method proposed in this paper has high performance and feasibility. In the future work, it is expected that more useful data can be collected to increase the feature magnitude and adjust the scheme model framework, so as to better improve the efficiency and accuracy of the intelligent management scheme of digital industrial aquaculture.

6. Acknowledgement

This work was funded by the Researchers Supporting Project number (RSPD2023R681) King Saud University, Riyadh, Saudi Arabia, also funded by the Natural Science Foundation of Chongqing Science & Technology Commission (cstc2020jcyj- msxmX0721), Science and Technology Research Program of Chongqing Municipal Education Commission (KJQN202000810), Talent Introduction Funding of Chongqing Technology and Business University (2053031), and Teaching reform project of Chongqing Technology and Business University (212027).

References

[1]	W. Zhang, F. Wang, H. Zhao. J. Zhang, Topic extraction and analysis of S&T policies based on structural decomposition of government documents in Chinese, Stud. Sci. Sci., 38 (2020), 1185–1196. https://doi.org/10.16192/j.cnki.1003-2053.2020.07.005 doi: 10.16192/j.cnki.1003-2053.2020.07.005
[2]	Y. Zhai, The theoretical evolution and practical exploration of China's e-government development in the past 40 years of reform and opening up: from business online to service online in Chinese, E-Gov., 12 (2018), 80–89. https://doi.org/10.16582/j.cnki.dzzw.2018.12.008 doi: 10.16582/j.cnki.dzzw.2018.12.008
[3]	Q. Qin, X. Zhang, On the institutional attribute of documents jointly issued by the party and government in Chinese, J. Party School Cent. Comm. CPC. (Chin. Acad. Governance), 25 (2021), 120–128. https://doi.org/10.14119/j.cnki.zgxb.2021.04.014 doi: 10.14119/j.cnki.zgxb.2021.04.014
[4]	Y. Yang, F. Wang, Whether deleveraging task conflicts with industrial policy: an empirical study based on enterprises' debt structure in Chinese, J. Zhongnan Univ. Econ. Law, 2 (2020), 3–13. https://doi.org/10.19639/j.cnki.issn1003-5230.20200427.003 doi: 10.19639/j.cnki.issn1003-5230.20200427.003
[5]	S. Sun, C. Luo, J. Chen, A review of natural language processing techniques for opinion mining systems, Inf. Fusion, 36 (2017), 10–25. https://doi.org/10.1016/j.inffus.2016.10.004 doi: 10.1016/j.inffus.2016.10.004
[6]	J. Gong, X. Huang, Text categorization framework based on improved TF-IDF and k-nearest neighbor in Chinese, Comput. Eng. Des., 39 (2018), 1340–1344+1349. https://doi.org/10.16208/j.issn1000-7024.2018.05.024 doi: 10.16208/j.issn1000-7024.2018.05.024
[7]	M. T. Zulfikar, Suharjito, Detection traffic congestion based on Twitter data using machine learning, Procedia Comput. Sci., 2019 (2019), 118–124. https://doi.org/10.1016/j.procs.2019.08.148 doi: 10.1016/j.procs.2019.08.148
[8]	B. Zhao, L. Wang, H. Guo, Weighted naive bayes text classification algorithm based on poisson distribution in Chinese, Comput. Eng., 46 (2020), 91–96. https://doi.org/10.19678/j.issn.1000-3428.0054056 doi: 10.19678/j.issn.1000-3428.0054056
[9]	X. Huang, G. Liu, X. Liu, A. Yang, Sentiment classification depth model based on word2vec and bi-directional LSTM in Chinese, Appl. Res. Comput., 36 (2019), 3583–3587+3596. https://doi.org/10.19734/j.issn.1001-3695.2018.08.0599 doi: 10.19734/j.issn.1001-3695.2018.08.0599
[10]	K. Cheng, Y. Yue, Z. Song, Sentiment classification based on part-of-speech and self-attention mechanism, IEEE Access, 2020 (2020), 16387–16396. https://doi.org/10.1109/ACCESS.2020.2967103 doi: 10.1109/ACCESS.2020.2967103
[11]	X. Wu, L. Chen, T. Wei, T. T. Fan, Sentiment analysis of chinese short text based on self-attention and Bi-LSTM in Chinese, J. Chin. Inf. Process., 33 (2019), 100–107. https://doi.org/10.3969/j.issn.1003-0077.2019.06.015 doi: 10.3969/j.issn.1003-0077.2019.06.015
[12]	T. Li, D. Ji, Sentiment analysis of micro-blog based on SVM and CRF using various combinations of features in Chinese, Appl. Res. Comput., 32 (2015), 978–981. https://doi.org/10.3969/j.issn.1001-3695.2015.04.004 doi: 10.3969/j.issn.1001-3695.2015.04.004
[13]	A. Goel, J. Gautam, S. Kumar, Real time sentiment analysis of tweets using naive bayes, in 2016 2nd International Conference on Next Generation Computing Technologies (NGCT), (2016), 257–261. https://doi.org/10.1109/NGCT.2016.7877424
[14]	J. Ababneh, Application of naive bayes, decision tree, and k-nearest neighbors for automated text classification, Mod. Appl. Sci., 13 (2019), 31–36. https://doi.org/10.5539/mas.v13n11p31 doi: 10.5539/mas.v13n11p31
[15]	R. Ahuja, S. C. Sharma, Sentiment analysis on different domains using machine learning algorithms, Adv. Data Inf. Sci., 2022 (2022), 143–153. https://doi.org/10.1007/978-981-16-5689-7_13 doi: 10.1007/978-981-16-5689-7_13
[16]	Y. Kim, Convolutional neural networks for sentence classification, arXiv preprint, (2014), arXiv: 1408.5882. https://doi.org/10.48550/arXiv.1408.5882
[17]	R. Johnson, Z. Tong, Deep pyramid convolutional neural networks for text categorization, in Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, (2017), 562–570. https://doi.org/10.18653/v1/p17-1052
[18]	S. Yu, D. Liu, Y. Zhang, S. Zhao, W. Wang, DPTCN: A novel deep CNN model for short text classification, J. Intell. Fuzzy Syst., 6 (2021), 7093–7100. https://doi.org/10.3233/JIFS-210970 doi: 10.3233/JIFS-210970
[19]	Y. Wang, A. Sun, J. Han, Y. Liu, X. Zhu, Sentiment analysis by capsules, in the Web Conference, (2018), 1165–1174. https://doi.org/10.1145/3178876.3186015
[20]	R. Wang, Z. Li, J. Cao, C. Tong, W. Lei, Convolutional recurrent neural networks for text classification, in 2019 International Joint Conference on Neural Networks, (2019), 1–6. https://doi.org/10.1109/IJCNN.2019.8852406
[21]	A. Singh, S. K. Dargar, A. Gupta, A. Kumar, A. K. Srivastava, M. Srivastava, et al., Evolving long short-term memory network-based text classification, Comput. Intell. Neurosci., 2022 (2022), 1687–5265. https://doi.org/10.1155/2022/4725639 doi: 10.1155/2022/4725639
[22]	D. Bahdanau, K. Cho, Y. Bengio, Neural machine translation by jointly learning to align and translate, arXiv preprint, (2014), arXiv: 1409.0473. https://doi.org/10.48550/arXiv.1409.0473
[23]	Y. Wang, M. Huang, X. Zhu, L. Zhao, Attention-based LSTM for aspect-level sentiment classification, in Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, (2016), 606–615. https://doi.org/0.18653/v1/D16-1058
[24]	Y. Liu, P. Li, X. Hu, Combining context-relevant features with multi-stage attention network for short text classification, Comput. Speech Lang., 71 (2021), 1–14. https://doi.org/10.1016/j.csl.2021.101268 doi: 10.1016/j.csl.2021.101268
[25]	C. Hu, N. Liang, Deeper attention-based LSTM for aspect sentiment analysis in Chinese, Appl. Res. Comput., 36 (2019), 1075–1079, https://doi.org/10.19734/j.issn.1001-3695.2017.11.0736 doi: 10.19734/j.issn.1001-3695.2017.11.0736
[26]	T. Mikolov, K. Chen, G. Corrado, J. Dean, Efficient estimation of word representations in vector space, arXiv preprint, (2013), arXiv: 1301.3781. https://doi.org/10.48550/arXiv.1301.3781
[27]	J. Bu, L. Ren, S. Zheng, Y. Yang, J. Wang, F. Zhang, et al., ASAP: A chinese review dataset towards aspect category sentiment analysis and rating prediction, arXiv preprint, (2021), arXiv: 2103.06605. https://doi.org/10.48550/arXiv.2103.06605
[28]	F. Song, L. Gao, Performance evaluation metric for text classifiers in Chinese, Comput. Eng., 30 (2004), 107–127. https://doi.org/10.3969/j.issn.1000-3428.2004.13.044 doi: 10.3969/j.issn.1000-3428.2004.13.044
[29]	D. Ma, S. Li, X. Zhang, H. Wang, Interactive attention networks for aspect-level sentiment classification, in Twenty-Sixth International Joint Conference on Artificial Intelligence, (2017), 1–7. https://doi.org/10.24963/ijcai.2017/568
[30]	A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. Gomez, et al., Attention is all you need, arXiv preprint, (2017), arXiv: 1706.03762. https://doi.org/10.48550/arXiv.1706.03762
[31]	P. Zhou, W. Shi, J. Tian, Z. Qi, B. Li, H. Hao, et al., Attention-based bidirectional long short-term memory networks for relation classification, in Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, (2016), 207–212. https://doi.org/10.18653/v1/P16-2034
[32]	S. Sabour, N. Frosst, G. E. Hinton, Dynamic routing between capsules, in Proceedings of the 31st International Conference on Neural Information Processing Systems, (2017), 3859–3869.

This article has been cited by:

1.	Xiaohong Peng, Tianyu Zhou, Zhenlu Wu, Zhao Li, 2023, A Survey of Deep Learning for Intelligent Feeding in Smart Fish Farming, 9798400716485, 590, 10.1145/3653081.3653179
2.	Sherine Ragab, Seyed Hossein Hoseinifar, Hien Van Doan, Waldemar Rossi, Simon Davies, Mohamed Ashour, Ehab El-Haroun, Overview of aquaculture Artificial Intelligence (AAI) applications: enhance sustainability and productivity, reduce labor costs, and increase the quality of aquatic products, 2024, 2300-8733, 10.2478/aoas-2024-0075
3.	Juqiang Feng, Feng Cai, Long Wu, Xing Zhang, Kaifeng Huang, State of charge estimation for lithium‐ion battery pack based on real vehicle data and optimized backpropagation method by adaptive cross mutation sparrow search algorithm, 2024, 12, 2050-0505, 896, 10.1002/ese3.1656
4.	Zheng Zhang, Xiang Lu, Shouqi Cao, An efficient detection model based on improved YOLOv5s for abnormal surface features of fish, 2024, 21, 1551-0018, 1765, 10.3934/mbe.2024076
5.	Thales Francisco Gonçalves , Johanna Marcela Concha Obando, Luiz Cláudio Chiavani Júnior, Ana Paula Andrade-Santos, Esthefany Caroline França Silva, Thalisia Cunha dos Santos, Roberto Kazuyoshi Naoe, Érico Tadao Teramoto, Guilherme Wolff Bueno, Piscicultura inteligente: a integração das Tecnologia 4.0 e “Business Intelligence” para gestão ágil na aquicultura, 2025, 16, 2178-9010, e4524, 10.7769/gesec.v16i1.4524

Reader Comments

Your name:*

Email:*
© 2023 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

Electronic Research Archive

1 1.3

Metrics

Article views(1262) PDF downloads(55) Cited by(0)

Preview PDF

Download XML

Export Citation

Article outline

Show full outline

Figures and Tables

Figures(1) / Tables(6)

Electronic Research Archive

Sentence opinion mining model for fusing target entities in official government documents

Related Papers:

Abstract

1. Introduction

2. Problem statement

3. Methodology

3.1. Overview

3.2. Fish state management based on multi-objective BP neural networks

3.2.1. Forward propagation calculation

3.2.2. Backwards propagation calculation

3.3. Environmental state management based on multi-objective recurrent neural networks

4. Experiments and analysis

4.1. Datasets and pre-processing

4.2. Parameter settings

4.3. Results and analysis

4.3.1. Fish status management results and discussion

4.3.2. Environmental status management results and discussion

4.3.3. Results and discussion

5. Conclusion

6. Acknowledgement

References

This article has been cited by:

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Figures and Tables

Other Articles By Authors

Catalog

Electronic Research Archive

Sentence opinion mining model for fusing target entities in official government documents

Related Papers:

Abstract

1. Introduction

2. Problem statement

3. Methodology

3.1. Overview

3.2. Fish state management based on multi-objective BP neural networks

3.2.1. Forward propagation calculation

3.2.2. Backwards propagation calculation

3.3. Environmental state management based on multi-objective recurrent neural networks

4. Experiments and analysis

4.1. Datasets and pre-processing

4.2. Parameter settings

4.3. Results and analysis

4.3.1. Fish status management results and discussion

4.3.2. Environmental status management results and discussion

4.3.3. Results and discussion

5. Conclusion

6. Acknowledgement

References

This article has been cited by:

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Figures and Tables

Other Articles By Authors

Related pages

Tools

Export File

Citation

Format

Content

Catalog