Span-based model for overlapping entity recognition and multi-relations classification in the food domain

Mengqi Zhang; Lei Ma; Yanzhao Ren; Ganggang Zhang; Xinliang Liu; Mengqi Zhang; Lei Ma; Yanzhao Ren; Ganggang Zhang; Xinliang Liu

doi:10.3934/mbe.2022240

Mathematical Biosciences and Engineering

2022, Volume 19, Issue 5: 5134-5152. doi: 10.3934/mbe.2022240

Previous Article Next Article

Research article Special Issues

Span-based model for overlapping entity recognition and multi-relations classification in the food domain

1.
School of E-business and Logistics, Beijing Technology and Business University, Beijing 100048, China
2.
National Engineering Laboratory for Agri-product Quality Traceability, Beijing Technology and Business University, Beijing 100048, China
3.
School of Computer Science and Engineering, Beijing Technology and Business University, Beijing 100048, China
4.
Digital Campus Construction Center, Capital Normal University, Beijing 100048, China

Academic Editor: Runzhang Xu

Received: 17 December 2021 Revised: 24 February 2022 Accepted: 15 March 2022 Published: 18 March 2022

Information extraction (IE) is an important part of the entire knowledge graph lifecycle. In the food domain, extracting information such as ingredient and cooking method from Chinese recipes is crucial to safety risk analysis and identification of ingredient. In comparison with English, due to the complex structure, the richness of information in word combination, and lack of tense, Chinese IE is much more challenging. This dilemma is particularly prominent in the food domain with high-density knowledge, imprecise syntactic structure. However, existing IE methods focus only on the features of entities in a sentence, such as context and position, and ignore features of the entity itself and the influence of self attributes on prediction of inter entity relationship. To solve the problems of overlapping entity recognition and multi-relations classification in the food domain, we propose a span-based model known as SpIE for IE. The SpIE uses the span representation for each possible candidate entity to capture span-level features, which transforms named entity recognition (NER) into a classification mission. Besides, SpIE feeds extra information about the entity into the relation classification (RC) model by considering the effect of entity's attributes (both the entity mention and entity type) on the relationship between entity pairs. We apply SpIE on two datasets and observe that SpIE significantly outperforms the previous neural approaches due to capture the feature of overlapping entity and entity attributes, and it remains very competitive in general IE.

Keywords:

Citation: Mengqi Zhang, Lei Ma, Yanzhao Ren, Ganggang Zhang, Xinliang Liu. Span-based model for overlapping entity recognition and multi-relations classification in the food domain[J]. Mathematical Biosciences and Engineering, 2022, 19(5): 5134-5152. doi: 10.3934/mbe.2022240

Related Papers:

[1]	Zhenzhong Xu, Xu Chen, Linchao Yang, Jiangtao Xu, Shenghan Zhou . Multi-modal adaptive feature extraction for early-stage weak fault diagnosis in bearings. Electronic Research Archive, 2024, 32(6): 4074-4095. doi: 10.3934/era.2024183
[2]	Ramalingam Sakthivel, Palanisamy Selvaraj, Oh-Min Kwon, Seong-Gon Choi, Rathinasamy Sakthivel . Robust memory control design for semi-Markovian jump systems with cyber attacks. Electronic Research Archive, 2023, 31(12): 7496-7510. doi: 10.3934/era.2023378
[3]	Nan Xu, Longbin Yang, Andrea Lazzaretto, Massimo Masi, Zhenyu Shen, YunPeng Fu, JiaMeng Wang . Fault location in a marine low speed two stroke diesel engine using the characteristic curves method. Electronic Research Archive, 2023, 31(7): 3915-3942. doi: 10.3934/era.2023199
[4]	Jichen Hu, Ming Zhu, Tian Chen . The nonlinear observer-based fault diagnosis method for the high altitude airship. Electronic Research Archive, 2025, 33(2): 907-930. doi: 10.3934/era.2025041
[5]	Xu Chen, Wenbing Chang, Yongxiang Li, Zhao He, Xiang Ma, Shenghan Zhou . Resnet-1DCNN-REA bearing fault diagnosis method based on multi-source and multi-modal information fusion. Electronic Research Archive, 2024, 32(11): 6276-6300. doi: 10.3934/era.2024292
[6]	Zhizhou Zhang, Yueliang Pan, Weilong Zhao, Jinchu Zhang, Zheng Zi, Yuan Xie, Hehong Zhang . Frequency analysis of a discrete-time fast nonlinear tracking differentiator algorithm based on isochronic region method. Electronic Research Archive, 2024, 32(9): 5157-5175. doi: 10.3934/era.2024238
[7]	Xiaoyan Wu, Guowen Ye, Yongming Liu, Zhuanzhe Zhao, Zhibo Liu, Yu Chen . Application of Improved Jellyfish Search algorithm in Rotate Vector reducer fault diagnosis. Electronic Research Archive, 2023, 31(8): 4882-4906. doi: 10.3934/era.2023250
[8]	Xingyue Liu, Kaibo Shi, Yiqian Tang, Lin Tang, Youhua Wei, Yingjun Han . A novel adaptive event-triggered reliable $H_\infty$ control approach for networked control systems with actuator faults. Electronic Research Archive, 2023, 31(4): 1840-1862. doi: 10.3934/era.2023095
[9]	Sahar Badri . HO-CER: Hybrid-optimization-based convolutional ensemble random forest for data security in healthcare applications using blockchain technology. Electronic Research Archive, 2023, 31(9): 5466-5484. doi: 10.3934/era.2023278
[10]	Arvind Prasad, Shalini Chandra, Ibrahim Atoum, Naved Ahmad, Yazeed Alqahhas . A collaborative prediction approach to defend against amplified reflection and exploitation attacks. Electronic Research Archive, 2023, 31(10): 6045-6070. doi: 10.3934/era.2023308

Abstract

1. Introduction

Optimization algorithm refers to the process of finding the best combination for a set of decision variables to solve a specific problem. For the complex optimization problems emerging in various fields such as engineering, economy, and medicine, it is not easy to find the optimal global solution by using traditional methods of mathematical optimization. However, the swarm intelligence optimization algorithm, which simulates the behavior of natural organisms, has successfully solved many complex optimization problems ^[1,2,3,4]. With the in-depth understanding of biological organisms, researchers have successively developed a series of swarm intelligence optimization algorithms, such as Particle Swarm Optimization (PSO) ^[5], Ant Colony Optimization (ACO) ^[6], Artificial Bee Colony (ABC) ^[7], Firefly Algorithm (FA) ^[8], Grey Wolf Optimizer (GWO) ^[9], Seagull Optimization Algorithm (SOA) ^[10], Slime Mould Algorithm (SMA) ^[11] and so on.

Mirjalili et al. ^[12] proposed a Salp Swarm Algorithm (SSA) in 2017, which is a new swarm intelligence optimization algorithm. Its optimization idea came from the population mechanism of the salp swarm chain foraging in the ocean. Once the SSA was proposed, it has attracted the extensive attention of many scholars because of its simple principle and easy implementation. Currently, this algorithm has been widely used in the fields of feature extraction ^[13,14], image segmentation ^[15,16], dispatch optimization ^[17], nodes localization ^[18], and so on.

The essence of the SSA is a random search optimization algorithm. It has the shortcomings of low accuracy in the later stage of iteration and is easy to get stuck at local optima. As a meta-heuristic algorithm, the searching behavior of SSA is divided into two main phases: exploration and exploitation phases. In exploration phase, it can efficiently discover the search space mostly by randomization, but it may face abrupt changes. In exploitation phase, it converges toward the most promising region. But, SSA often traps into local optima due to its stochastic nature and lack of balancing between exploration and exploitation. Thus, from this point, many studies have been presented to improve the performance of SSA and to overcome these defects.

Some improvement of single strategy has applied to enhance the performance of SSA by scholars. Sayed et al. ^[19] used a chaotic mapping sequence to take place the random parameter, which significantly improved the convergence rate and resulting precision of SSA. Abbassi et al. ^[20] proposed an Opposition-based Learning Modified Salp Swarm Algorithm (OLMSSA) for the accurate identification of circuit parameters. Singh et al. ^[21] updated the position of the salp swarm by sine cosine to enhance the exploration and exploitation capability. Syed et al. ^[22] proposed a strategy based on the weighted distance position update called the Weighted Salp Swarm Algorithm (WSSA) to enhance the performance and convergence rate of the SSA. Singh et al. ^[23] proposed a Hybrid SSA-PSO algorithm (HSSAPSO) by adding the speed optimization method of particle swarm optimization algorithm in the position update stage of salp swarm to avoid premature convergence of the optimal solution in the search space.

Moreover, multi-strategy improvement is adopted to enhance the SSA and has achieved good results with the development of research and application of SSA. Zhang et al. ^[24] used the Gaussian Barebone and stochastic fractal search mechanism to balance the global search ability and local search ability of the basic SSA. Liu et al. ^[25] proposed a new SSA-based method named MCSSA, in which the structure of SSA is rearranged using a chaos-assisted exploitation strategy and multi-population foundation to enhance its performance. Zhang et al. ^[26] proposed an ensemble composite mutation strategy to boost the exploitation and exploration trends of SSA, as well as a restart strategy to assist salps in getting away from local optimum. Zhao et al. ^[27] made an improvement of SSA called AGSSA, in which an adaptive control parameter is introduced into the position update stage of followers to boost the local exploitative ability of the population, and the elite gray wolf domination strategy is introduced in the last stage of the population position update to help the population find the global optimal solution faster. Zhang et al. ^[28] presented a chaotic SSA with differential evolution (CDESSA), and in the proposed framework, chaotic initialization is utilized to produce a better initial population aim at locating a better global optimal, and the differential evolution is used to build up the search capability of each agent. Xia et al. ^[29] proposed a QBSSA, in which an adaptive barebones strategy help to reach both accurate convergence speed and high solution quality and a quasi-oppositional-based learning make the population away from trapping into local optimal and expand the search space. Zhang et al. ^[30] proposed an enhanced SSA (ESSA), which improves the performance of SSA by embedding strategies such as orthogonal learning, quadratic interpolation, and generalized oppositional learning.

2. Motivation and innovation

Although the basic SSA enriches some characteristics like fast convergence speed and simple implementation, it may trap at sub-optimal solutions easily in some cases when handling the more complex optimization problems. Some improved algorithms of SSA have been provided by scholars mentioned as above, but each algorithm has its own merits and drawbacks. Hence, there is no guarantee which algorithm is best suited for a specific problem according to the "No free lunch" theorem ^[31]. In practical applications, there are special requirements on the accuracy or the convergence speed of the algorithm, so it is necessary to adopt more strategies.

In basic SSA, when the whole swarm of salps falls into a sub-optimal solution, the algorithm is trapped at that local solution and eventually stagnate at that suboptimal solution. So, we proposed a strategy of using dimension-by-dimension centroid opposition-based learning to make the slap population get more wide search space. Moreover, a random factor is used to increase the randomness of the population distribution and PSO's social learning strategy is added to speed up convergence.

The main contributions of this paper are as follows:

1) An improved algorithm which combines dimension-by-dimension centroid opposition-based learning, random factor, and PSO's social learning strategy (DCORSSA-PSO) is proposed.

2) The performance of proposed DCORSSA-PSO is verified by comparing it with several well-known algorithms in benchmark functions.

3) The proposed DCORSSA-PSO is used to the design of system reliability optimization based on T-S fault tree and has achieved good result.

The remainder section of this article is structured as follows. Section 3 introduces the basic principles of SSA. Section 4 introduces the mathematical principles of dimension-by-dimension centroid opposition-based learning, the addition of random factor and PSO' social learning, and proposes DCOSSA, DCORSSA and DCORSSA-PSO. Section 5 contains simulation experiment and result analysis. In Section 6, the efficacy of the proposed DCORSSA-PSO is assessed on engineering design of system reliability optimization. Finally, conclusions and future works are summarized in Section 7.

3. Basic principles of salp swarm algorithm

The salp swarm algorithm was proposed by Mirjalili et al. in 2017 ^[12], which is a heuristic swarm intelligent optimization algorithm that simulates the navigating and foraging behavior of salps. The salp chain consists of two types of salps: leader and follower. The leader is the salp at the head of the salp chain, and the other salps are considered followers. To enhance the population diversity of the algorithm and enhance the ability to jump out of the local optima, half the salps are selected as the leaders.

The position update equation of the leader is as follows ^[32]:

$x_j^i = \left\{ {\begin{array}{*{20}{c}} {{F_j} + {c_1}\left( {\left( {u{b_j} - l{b_j}} \right){c_2} + l{b_j}} \right)}&{0 \leqslant {c_3} < 0.5} \\ {{F_j} - {c_1}\left( {\left( {u{b_j} - l{b_j}} \right){c_2} + l{b_j}} \right)}&{0.5 \leqslant {c_3} \leqslant 1} \end{array}} \right.$

(1)

where $x_j^i$ is the position of the ith leader in the jth dimension; F_j is the position of food source in the jth dimension; ub_j and lb_j indicate the upper and lower bound of the jth dimension, respectively; c₂ and c₃ are random numbers in the range [0, 1], which decide respectively the moving step and the moving direction (positive or negative) of the jth dimension. c₁ is the convergence factor, which is used to balance the exploration and exploitation ability of the algorithm in the iterative process. c₁ is defined as follows:

${c_1} = 2{e^{ - {{(4t/T)}^2}}}$

(2)

where t is the current number of iterations and T is the maximum number of iterations.

The position update equation of followers is as follows:

$x_j^i = \frac{1}{2}(x_j^i + x_j^{i - 1})$

(3)

where i ≥ 2 and $x_j^i$ is the position of the ith follower in the jth dimension. Eq (3) shows that the ith follower in the jth dimension is updated according to the center of the previous generation of the ith follower and the i-1th follower in the jth dimension.

4. An improved salp swarm algorithm

Although the SSA is experienced to reach good accuracy compared with recent meta-heuristics, it may still face the shortcomings of getting trapped in local optima and is not suitable for highly complex optimization functions. To extend the search capability of SSA, a new hybrid salp swarm algorithm based on dimension-by-dimension centroid opposition-based learning strategy, random factor, and PSO's social learning strategy (DCORSSA-PSO) is proposed to solve engineering problems.

4.1. DCOSSA

4.1.1. Opposition-based learning

Opposition-based learning is a novel learning strategy proposed by Tizhoosh in 2005 ^[33]. The principle is that the current optimal solution and the opposition-based learning solution are searched for simultaneously during the population iteration and the better one in these two solutions is retained to the next generation according to the fitness value. This searching method improves the population diversity and enhances the ability of the algorithm to jump out of the local solutions.

Definition 1: Let x∈R be a real number and x∈[lb, ub]. The opposite number $\tilde x$ is defined as follows:

$\tilde x = lb + ub - x$

(4)

Analogously, the opposite number in a multidimensional case is also defined.

Definition 2: Let $X = \left({{x_1}, {x_2}, \cdots, {x_D}} \right)$ be a point in a D-dimensional coordinate system with x₁, ..., x_D∈R, and x_j∈ [lb_j, ub_j]. The opposite point $\tilde X = \left({{{\tilde x}_1}, {{\tilde x}_2}, \cdots, {{\tilde x}_D}} \right)$ is defined as:

${\tilde x_j} = l{b_j} + u{b_j} - {x_j}, {\text{ }}1 \leqslant j \leqslant D$

(5)

4.1.2. Centroid opposition-based learning

When the opposition-based learning method calculates the opposite point, two boundaries (min and max) are taken from two extreme points in the population for every dimension. The remaining points of the population are not considered, so this represents a weakness in terms of convergence speed. A Centroid Opposition-Based Computing (COBC) was proposed by Rahnamayan ^[34], which takes the entire population to generate the opposite points and hence improves the convergence speed and solution accuracy.

Let (X₁, ..., X_n) be n points in D-dimensional search space with each point in that space carrying a unit mass. Then the centroid of the body can be defined as follows:

$M = \frac{{{X_1} + {X_2} + \cdots + {X_n}}}{n}$

(6)

where then we have

${M_j} = \frac{{\sum\limits_{i = 1}^n {{x_{i, j}}} }}{n}$

(7)

where x_{i, j} is the jth dimension of ith point, M_j is the jth dimension centroid of all n points.

The opposite-point ${\tilde X_i}$ of the point X_i is calculated as follows:

$\left\{ {\begin{array}{*{20}{c}} {{{\tilde X}_i} = 2 \times M - {X_i}} \\ {{{\tilde x}_{i, j}} = 2 \times {M_j} - {x_{i, j}}, {\text{ }}1 \leqslant j \leqslant D} \end{array}} \right.$

(8)

4.1.3. Dimension-by-dimension centroid opposition-based learning

At present, most algorithms adopt the method of variation for all dimension information of the population and select or eliminate evolution by comparing the fitness values of different individuals as a whole for all dimension, however, it is difficult to ensure that each selected dimensional information of the evolutionary individual is better than that of the eliminated individual. Therefore, there is often inter-dimensional interference in the calculation of the fitness value, which masks the information of evolution dimension and reduces the quality of solution and the convergence speed.

The dimension-by-dimension update evaluation strategy refers to evaluating the fitness value separately for each dimension, which can reduce the inter-dimensional interference between each dimension of the individual and avoid the problem of low variation efficiency. This operation for population individuals can more accurately evaluate the fitness value in each iteration. However, for the high-dimensional test functions, it will greatly increase the time complexity of the algorithm. Through the test, it is found that better results can still be achieved by updating the dimensional information of the optimal individual instead of each individual.

4.1.4. DCOSSA

This paper combines the dimension-by-dimension update strategy with the centroid opposition-based learning strategy, and proposes a Dimension-by-dimension Centroid Opposition-based learning Salp Swarm Algorithm (DCOSSA). The basic step is: Firstly, calculate the opposite point of the centroid of the current population through Eqs (4)–(8). Then, replace the information at the first dimension of the food source with the first dimension of the opposite point of the center of gravity. If the fitness value of the new food source is better than the fitness value of the original food source, the opposite solution information of the population center of gravity of this dimension is retained, otherwise, the update result of this dimension is discarded. At last, update the next food source information in dimension order until all dimensions are updated.

The pseudo-code of the strategy of dimension-by-dimension centroid opposition-based learning is as follows:

Calculate the center position M of the iterative population according to Eq (6)

for j = 1: Dim

Calculate opposite solution of the food source position ${\tilde F_j}$ according to Eq (8)

If f ( ${\tilde F_j}$ ) is better than f (F_j) then

F_j = ${\tilde F_j}$

end if

end for

4.2. DCORSSA

According to the position update Eq (3) of followers in the salp swarm algorithm, the position of follower takes the center of gravity of the corresponding position of the salp in the previous generation and the position adjacent to the previous salp. The head follower uses the information of the leader slap besides its information of previous generation, so the whole population of followers are affected gradually by the leader slap. It can be seen from Eq (1) that the leader's position is updated near the food source. Therefore, when the food source does not fall into the local solutions, this end-to-end search mechanism can enable the followers to fully carry out local exploitation, but the gradual transmission of the global optimal information is not conducive to the rapid convergence of the algorithm. Moreover, when the food source falls into the local solutions, the followers will fall into the dilemma of invalid region search, that is, the lack of population diversity, so the algorithm is easy to fall into the local extremes.

The above searching mechanism is fixed and lack of dynamic adjustment. Therefore, this paper proposes a DCORSSA algorithm on the basis of DCOSSA, that is, a random factor is added to the update equation of its followers to enhance the update randomness, so that population can get more chance to jump out of the local optima.

A random factor c₄ between 0 and 1 replaces the constant 1 of Eq (3). So, the new position update equation of followers is as follows:

$x_j^i = \frac{{{c_4}}}{2}(x_j^i + x_j^{i - 1})$

(9)

4.3. DCORSSA-PSO

As mentioned above in Section 4.2, the one-by-one transfer mechanism of salp chain makes the convergence speed is slow. So, in this section, we introduce the social learning strategy of PSO on the basis of DCORSSA. The Particle Swarm Optimization (PSO) is a very practical swam optimization algorithm proposed by Kennedy and Eberhart ^[5]. Particles search for optimization through information sharing mechanism. That is, particles obtain their own historical experience (individual optimal pⁱ) and group experience (global optimal p^g) through information exchange between individuals to achieve the purpose of optimization. The update formula of velocity and position are as follows:

$v_j^i(t + 1) = \omega v_j^i(t) + {c_1}{r_1}(t)[p_j^i(t) - x_j^i(t)] + {c_2}{r_2}(t)[p_j^g(t) - x_j^i(t)]$

(10)

$x_j^i(t + 1) = x_j^i(t) + v_j^i(t + 1)$

(11)

where $v_j^i$ is the velocity of the ith particle in the jth dimension, $x_j^i$ is the position of the ith particle in the jth dimension, t refers to the iteration number, w is inertia weight that aims at determining the effect of previous velocities on current velocity, c₁ represents the individual learning factor and c₂ represents the social learning factor, r₁ and r₂ are random variables used to increase the randomness of particle flight, whose values are normally distributed in [0, 1], $p_j^i$ and $p_j^g$ indicate the elements of individual optimal location and global optimal location in the jth dimension, respectively.

Equation (10) includes three parts: the first part is the "inertia" part, which is the motion inertia of particles, reflecting the tendency of particles to maintain their previous velocity; the second part is the "individual learning", which reflects the trend of particles moving to their previous best position in history; the third part is the "social learning", which reflects the trend of particles moving to the best position in the previous history of the population.

Finally, based on DCORSSA, this paper further proposes a DCORSSA-PSO algorithm. That is, the social learning strategy of PSO which is the third part of Eq (10) is introduced to the position update equation of the followers of DCORSSA. On the basis of increasing the random distribution of its own position, the improved algorithm makes full use of the global information and strengthens the tendency of individuals to move to the food source.

The position update equation of followers of proposed DCORSSA-PSO is as follows:

$x_j^i = x_j^i + {c_5} \times {\text{rand}}(0, 1) \times \left( {{F_j} - x_j^i} \right)$

(12)

where $x_j^i$ is the position of the ith follower in the jth dimension updated by Eq (9), F_j is the food source position of the jth dimension, c₄ is random number between 0 and 1, c₅ is the social learning factor, taken c₅ = 1.49.

The flow chart of the algorithm DCORSSA-PSO is shown in Figure 1.

Figure 1. Flow chart of DCORSSA-PSO.

DownLoad: Full-Size Img PowerPoint

The pseudo-code of DCORSSA-PSO is as follows:

Set the initial parameters of the algorithm: population numbers N, population dimensions Dim, population iteration times T, search upper bound ub and search lower bound lb;

Initialize the population randomly, calculate the fitness value of each individual, and take the position with the optimal fitness value as the food source position F;

for t = 1: T

for i = 1: N

if i < = N/2

Update the position of leader by Eq (1)

else

Update the position of followers by Eqs (9) and (12)

end if

end for

Update the food source position F and its fitness value

Calculate the center position M of the iterative population according to Eq (6)

for j = 1: Dim

Calculate opposite solution of the food source position ${\tilde F_j}$ according to Eq (8)

If f ( ${\tilde F_j}$ )is better than f (F_j) then

F_j = ${\tilde F_j}$

end if

end for

4.4. Computational complexity of DCORSSA-PSO

According to the literature ^[12], the computational complexity of the SSA algorithm is O(t(d∗n + Cof∗n)) where t shows the number of iterations, d is the number of variables (dimension), n is the number of solutions, and Cof indicates the cost of the objective function. In the DCORSSA-PSO, the fitness function is recalculated for each dimension of the food source with the dimension-by-dimension centroid opposition-based learning, so the amount of operation of O (Cof * d) is increased; simultaneously, to increase the search vitality of the algorithm and improve the search speed of the algorithm, random factor and PSO's social learning strategy are introduced. Still, the number of code execution is not increased. So the computational complexity of DCORSSA-PSO algorithm is O(t(d∗n + Cof∗n + Cof*d)). It can be seen that the computational complexity of the DCORSSA-PSO algorithm is higher than that of standard SSA, and it increases with the increase of population dimension.

5. Simulation experiment and result analysis

5.1. Testing environment

Test environment: the hard disk running environment is CPUi5-7200U, the memory is 12GB, the software running environment is windows10 system, and the running software is MATLAB 2019b. These parameters of different algorithms are set the same: population sizes N = 30, population dimensions Dim = 30, and the maximum number of iterations T = 500. Respective parameter settings for involved algorithms are shown in Table 1.

Table 1. Parameter settings for involved algorithms.

Algorithm	Parameters
SSA	c₁∈[2 × e^-16, 2 × e]; c₂∈[0, 1]; c₃∈[0, 1];
DCOSSA	c₁∈[2 × e-¹⁶, 2 × e]; c₂∈[0, 1]; c₃∈[0, 1];
DCORSSA	c₁∈[2 × e^-16, 2 × e]; c₂∈[0, 1]; c₃∈[0, 1]; c₄∈[0, 1];
DCORSSA-PSO	c₁∈[2 × e^-16, 2 × e]; c₂∈[0, 1]; c₃∈[0, 1]; c₄∈[0, 1]; c₅ = 1.49;
PSO	w∈[0.4, 0.9]; c₁ = c₂ = 1.49;
GWO	r₁∈[0, 1]; r₂∈[0, 1]

| Show Table

DownLoad: CSV

5.2. Test function

Ten classical benchmark functions, as shown in Table 2, are used to evaluate the performance of DCORSSA-PSO. SSA, PSO, DCOSSA and GWO are the algorithms for comparison. In the ten functions, f₁~f₄ are unimodal test functions that can test the optimization accuracy of the algorithms, f₅~f₈ are multimodal test functions that can test the global optimization ability and convergence speed of the algorithms, f₉~f₁₀ are ill-conditioned test functions that can test the exploration and exploitation capabilities of the algorithms.

Table 2. Benchmark functions used in the study.

Benchmark function	Range	f_min
${f_1}\left(x \right) = \sum\limits_{i = 1}^n {x_i^2}$	[-100,100]	0
${f_2}\left(x \right) = \sum\limits_{i = 1}^n {\left\| {{x_i}} \right\|} + \prod\limits_{i = 1}^n {\left\| {{x_i}} \right\|}$	[-10, 10]	0
${f_3}\left(x \right) = \mathop {\max }\limits_{i = 1}^n \left\{ {\left\| {{x_i}} \right\|} \right\}$	[-100,100]	0
${f_4}\left(x \right) = \sum\limits_{i = 1}^n {{{\left({\left\lfloor {{x_i} + 0.5} \right\rfloor } \right)}^2}}$	[-100,100]	0
${f_5}\left(x \right) = \sum\limits_{i = 1}^n {\left[{x_i^2- 10\cos \left({2{\text{π }}{x_i}} \right) + 10} \right]}$	[-5.12, 5.12]	0
${f_6}\left(x \right) = \frac{1}{{4000}}\sum\limits_{i = 1}^n {x_i^2} - \prod\limits_{i = 1}^n {\cos \frac{{{x_i}}}{{\sqrt i }}} + 1$	[-600,600]	0
${f_7}\left(x \right) = 1 - \cos \left({2\pi \sqrt {\sum\limits_{i = 1}^n {x_i^2} } } \right) + 0.1\sqrt {\sum\limits_{i = 1}^n {x_i^2} }$	[-32, 32]	0
${f_8}\left(x \right) = - 20\exp \left({ - 0.2\sqrt {\frac{1}{n}\sum\limits_{i = 1}^n {x_i^2} } } \right) - \exp \left({\frac{1}{n}\sum\limits_{i = 1}^n {\cos \left({2\pi {x_i}} \right)} } \right) + 20 + e$	[-100,100]	0
${f_9}\left(x \right) = \sum\limits_{i = 1}^n {ix_i^4} + rand$	[-1.28, 1.28]	0
${f_{10}}\left(x \right) = \sum\limits_{i = 1}^{n - 1} {\left[{100{{\left({{x_{i + 1}} - x_i^2} \right)}^2} + {{\left({{x_i} - 1} \right)}^2}} \right]}$	[-30, 30]	0

| Show Table

DownLoad: CSV

5.3. Experimental results and analysis

To objectively test the optimization performance of the algorithm of DCORSSA-PSO, the same initial population is selected for all algorithms, and the average fitness value, optimal fitness value, standard deviation of fitness value, and the average running time of each algorithm for 30 times independently are counted to evaluate the algorithm comprehensively. The comparison results of each algorithm are shown in Table 3.

Table 3. Test results of DCORSSA-PSO and compared algorithms.

Functions	Measure	SSA	DCOSSA	DCORSSA	DCORSSA-PSO	PSO	GWO
f₁	Mean	2.40×10^-7	2.28×10^-9	2.70×10^-27	5.80×10^-44*	2.08×10²	1.42×10^-27
	Best	3.70×10^-8	9.29×10^-12	6.45×10^-32	1.42×10^-46*	5.82×10¹	3.07×10^-29
	Std	4.09×10^-7	5.16×10^-9	5.86×10^-27	1.17×10^-43*	1.08×10²	1.82×10^-27
	Time/s	1.01×10^-1*	1.99×10^-1	2.01×10^-1	2.11×10^-1	2.71×10^-1	1.70×10^-1
f₂	Mean	1.96	1.36×10^-5	1.00×10^-14	7.80×10^-23*	6.30	7.96×10^-17
	Best	1.71×10^-1	1.03×10^-6	6.24×10^-16	2.12×10^-24*	3.57	4.54×10^-18
	Std	1.46	3.95×10^-5	1.18×10^-14	9.73×10^-23*	1.75	4.69×10^-17
	Time/s	9.37×10^-2*	1.83×10^-1	1.80×10^-1	1.93×10^-1	2.41×10^-1	1.47×10^-1
f₃	Mean	1.20×10¹	1.08	9.05×10^-15	7.16×10^-23*	1.24×10¹	8.01×10^-7
	Best	5.19	1.36×10^-1	6.79×10^-17	1.98×10^-24*	4.76	1.16×10^-7
	Std	3.35	6.96×10^-1	1.29×10^-14	8.06×10^-23*	3.19	7.47×10^-7
	Time/s	9.07×10^-2*	1.75×10^-1	1.72×10^-1	1.85×10^-1	2.36×10^-1	1.41×10^-1
f₄	Mean	1.08×10^-7	9.70×10^-10*	3.39×10^-7	6.50×10^-8	1.67×10²	7.97×10^-1
	Best	2.54×10^-8	9.62×10^-12*	2.56×10^-8	2.41×10^-8	5.07×10¹	2.57×10^-1
	Std	8.12×10^-8	2.23×10^-9*	8.51×10^-7	2.66×10^-8	8.92×10¹	2.57×10^-1
	Time/s	9.29×10^-2*	1.79×10^-1	1.76×10^-1	1.91×10^-1	2.42×10^-1	1.44×10^-1
f₅	Mean	6.16×10¹	4.64×10^-1	0*	0*	1.31×10²	3.42
	Best	2.79×10¹	2.80×10^-12	0*	0*	8.73×10¹	0*
	Std	1.80×10¹	5.68×10^-1	0*	0*	2.41×10¹	4.36
	Time/s	1.05×10^-1*	1.98×10^-1	1.92×10^-1	2.00×10^-1	2.59×10^-1	1.52×10^-1
f₆	Mean	9.54×10^-1	1.00×10^-1	0*	0*	2.65	4.12×10^-13
	Best	8.49×10^-1	4.44×10^-2	0*	0*	1.57	5.00×10^-15
	Std	4.30×10^-2	3.49×10^-2	0*	0*	1.04	4.65×10^-13
	Time/s	1.22×10^-1*	2.42×10^-1	2.36×10^-1	2.46×10^-1	2.77×10^-1	1.70×10^-1
f₇	Mean	1.96	1.04	2.96×10^-15	2.25×10^-23*	2.75	1.83×10^-1
	Best	1.00	7.00×10^-1	1.22×10^-16	1.86×10^-25*	1.80	9.99×10^-2
	Std	4.18×10^-1	2.40×10^-1	2.99×10^-15	2.90×10^-23*	4.88×10^-1	3.79×10^-2
	Time/s	9.56×10^-2*	1.94×10^-1	1.91×10^-1	2.00×10^-1	2.46×10^-1	1.51×10^-1
f₈	Mean	2.46	7.58×10^-6	8.23×10^-15	8.88×10^-16*	5.84	9.80×10^-14
	Best	9.31×10^-1	7.50×10^-7	8.88×10^-16*	8.88×10^-16*	4.45	7.55×10^-14
	Std	7.42×10^-1	1.01×10^-5	1.23×10^-14	0*	8.80×10^-1	1.49×10^-14
	Time/s	1.08×10^-1*	2.04×10^-1	1.98×10^-1	2.10×10^-1	2.62×10^-1	1.52×10^-1
f₉	Mean	1.51	5.48×10^-2	6.72×10^-5*	7.24×10^-4	1.22	1.72×10^-2
	Best	5.85×10^-1	1.73×10^-2	1.32×10^-7*	7.12×10^-6	3.09×10^-2	8.50×10^-4
	Std	6.72×10^-1	1.91×10^-2	6.83×10^-5*	1.55×10^-3	1.23	1.55×10^-2
	Time/s	1.59×10^-1*	3.16×10^-1	3.18×10^-1	3.27×10^-1	3.06×10^-1	2.09×10^-1
f₁₀	Mean	2.77×10²	5.65×10¹	2.79×10¹	2.64×10¹*	5.39×10³	2.69×10¹
	Best	2.49×10¹*	3.96×10^-2	2.74×10¹	2.62×10¹	5.89×10²	2.57×10¹
	Std	4.33×10²	4.31×10¹	2.05×10^-1	1.33×10^-1*	5.25×10³	7.83×10^-1
	Time/s	1.24×10^-1*	2.47×10^-1	2.47×10^-1	2.60×10^-1	2.81×10^-1	1.78×10^-1
Note: the mark "*" at the top right of the data indicates the best result obtained by all algorithms.

| Show Table

DownLoad: CSV

From the experimental results of 30 independent runs in Table 3, it can be seen that the optimization performance of each algorithm in the standard test functions is different. The optimal value and average value can measure the accuracy of the optimization algorithm. In multimodal functions f₅ and f₆, DCORSSA-PSO and DCORSSA can search the theory optimal value 0, showing excellent optimization ability. In the test functions f₁~f₃, f₇, the DCORSSA-PSO algorithm is superior to other algorithms in terms of both the average value and the optimal value. In the test functions f₈ and f₁₀, DCORSSA-PSO is superior to other algorithms in terms of the average value. In the unimodal test function f₁, DCORSSA-PSO is more than 10 orders of magnitude higher than other algorithms in the accuracy of the optimal value. Compared with SSA, the convergence accuracy of DCORSSA and DCORSSA-PSO is also greatly improved, which indicates that adding different optimization strategies is very helpful to improve the optimization of SSA. At the same time, in the ill-conditioned function f₉, the optimization accuracy of DCORSSA-PSO reaches 7.12 × 10^-6; although its accuracy is improved compared with SSA, it is worse than DCORSSA. In the ill-conditioned function f₁₀, although the DCORSSA-PSO algorithm improves the mean optimization accuracy compared with the SSA algorithm, the optimal value search is still insufficient compared with the SSA algorithm. In unimodal function f₄, DCORSSA-PSO inferiors to DCOSSA in terms of average value and optimal value. These cases which DCORSSA-PSO does not get the best performance indicate that the DCORSSA-PSO algorithm is still insufficient in search of some functions.

The standard deviation can measure the optimization stability of the optimization algorithm. Except for f₄ and f₉, the standard deviation of the DCORSSA-PSO algorithm calculated 30 times independently is always less than that of other algorithms, which shows that the improved DCORSSA-PSO algorithm can ensure the optimization stability of the algorithm when dealing with unimodal, multimodal, even ill-conditioned functions.

In terms of average running time, the SSA algorithm has a shorter running time than PSO algorithm and GWO algorithm, which shows that the improved DCOSSA, DCORSSA and DCORSSA-PSO have inherent advantages in operation speed. The average running time of DCORSSA-PSO algorithm is slightly longer than that of the DCOSSA, which does not cause a significant increase in running time, indicating that the addition of random factor and PSO's social learning strategy have little impact on the time complexity of the algorithm. The average running time of DCORSSA-PSO algorithm and DCORSSA is longer than that of the SSA algorithm, mainly due to the addition of dimension-by-dimension centroid opposition-based learning strategy.

The Wilcoxon signed-rank test ^[35] with a significance level of 0.05 was used to judge the statistical difference between the improved algorithm DCORSSA-PSO and the comparative algorithms such as SSA. The statistical results are shown in Table 4, in which: "+" indicates that the test result of DCORSSA-PSO is superior to the corresponding comparison algorithm. "=" indicates that the performance of the DCORSSA-PSO test result is similar to the corresponding comparison algorithm, and there is no statistically significant difference. "-" indicates that the DCORSSA-PSO test result is inferior to that of the corresponding comparison algorithm.

Table 4. Wilcoxon signed-rank test of DCORSSA-PSO and other algorithms.

Comparison group	+/=/-	Comparison group	+/=/-
DCORSSA-PSO VS SSA	9/1/0	DCORSSA-PSO VS PSO	10/0/0
DCORSSA-PSO VS DCOSSA	9/0/1	DCORSSA-PSO VS GWO	10/0/0
DCORSSA-PSO VS DCORSSA	7/2/1	---

| Show Table

DownLoad: CSV

According to the Wilcoxon signed-rank test results described in Table 4, it can be learned that DCORSSA-PSO wins in 45( = 9 + 9 + 7 + 10 + 10) cases, loses in 2 cases and shows a tie in the other cases in the total 50 ( = 5*10) cases. In general, the DCORSSA-PSO algorithm is better than other algorithms such as SSA algorithm in most functions, which proves the effectiveness of the proposed improved method.

In addition, in order to further evaluate the statistical comparison of the optimization performance of each algorithm, Friedman test ^[36] is used to study the difference between each algorithm as is shown in Table 5. The average ranking value (ARV) represents the average ranking value of the Friedman test of an algorithm that runs 30 times of all test functions independently. The smaller the ARV, the higher the optimization performance of the algorithm.

Table 5. Friedman test of DCORSSA-PSO and other algorithms.

Algorithm	SSA	DCOSSA	DCORSSA	DCORSSA-PSO	PSO	GWO
ARV	4.8533	3.5700	2.2567	1.4300	5.8700	3.0200
rank	5	4	2	1	6	3

| Show Table

DownLoad: CSV

From Table 5, we can clearly see the statistical results of the Friedman test. The ARV of DCORSSA-PSO integrating the three learning strategies is 1.4300, and the rank is No.1, which indicates that DCORSSA-PSO is significantly better than other comparison algorithms in solving these test functions. In addition, the rank of DCORSSA and DCOSSA combining the other strategies are No.2 and No.4 respectively, indicating that the above-mentioned optimization strategies are of great help in improving the optimization accuracy of the SSA algorithm.

Figure 2 shows the average convergence curve of each algorithm in 10 standard test functions. To better observe the optimization effect of the algorithm, the logarithm based on 10 is taken for the optimization fitness values of f₁~f₁₀.

Figure 2. Average convergence curve of the standard test functions.

DownLoad: Full-Size Img PowerPoint

In Figure 2, it can be seen that the optimization speed and accuracy of DCOSSA added with the dimension-by-dimension centroid opposition-based learning strategy are greatly improved compared with SSA, which shows that the dimension-by-dimension centroid opposition-based learning strategy is of great benefit to improve the population diversity and the ability to jump out of the local solutions. Compared with DCOSSA and the other three algorithms, DCORSSA-PSO which adds a random factor and integrates the social learning strategy of PSO, declines rapidly in the middle of the iteration, and its optimization speed is significantly ahead. Especially in the middle and early iterations of the function, the DCORSSA-PSO algorithm can almost quickly search for the optimal value, and continues to show high search activity in the later iterations. Even in multimodal functions f₅ and f₆, the curves are interrupted because the DCORSSA-PSO algorithm searches the theoretical optimal value 0 (the independent variable of lg cannot be 0).

All the above show that the DCORSSA-PSO algorithm is effective in dealing with unimodal, multimodal, and ill-conditioned test functions, and it has better optimization accuracy and speed, which is very helpful to solve the problems to be optimized in engineering practice.

6. System reliability optimization model

Nowadays, more and more engineering problems are adopting optimization methods to get optimal performance ^[37], while system reliability optimization is one of the most useful engineering fields. System reliability optimization refers to finding an optimal design under certain resource constraints to obtain the highest reliability of the system or minimizing the investment while meeting specific reliability index requirements, thus obtaining the maximum economic benefits. At present, practice shows that the optimal redundancy allocation design is one of the most usually used methods to reduce system failure probability and improve system reliability. Redundancy design means that when a part of the system fails, the redundant part is activated through the monitoring and switching mechanism to complete the same function instead of the failed part, to reduce the failure probability of the system.

Many scholars have used intelligent optimization algorithms to solve reliability optimization problems. In literature ^[38], an enhanced nest cuckoo optimization algorithm was used to study the system reliability redundancy allocation with a cold-standby strategy. Literature ^[39] carried out reliability optimization of a fuzzy multi-objective system based on genetic algorithm and cluster analysis. Literature ^[40] proposed a new particle swarm optimization algorithm based on fuzzy adaptive inertia weight to solve the reliability redundancy allocation problem.

6.1. T-S fault tree construction

Fault tree analysis is one of the commonly used reliability analysis methods, which is oriented by system failure and unit failure. A fault tree is composed of events and gates. It is named fault tree because its fault logic relationship is graphically represented like a tree with the top event as root, event logic causality represented by the gate as a branch, and bottom event as a leaf. T-S model ^[41] was proposed by Takagi and Sugeno in 1985. Through if-then rules, a series of local linear subsystems and membership functions were used to accurately describe nonlinear systems. Song et al. ^[42] constructed T-S gates to describe event relations based on the T-S model, proposing the T-S fault tree analysis method. Yao et al. ^[43] proposed a new reliability optimization method based on the T-S fault tree and EPSO (Extended PSO).

Hypothetically, a mechanical system consists of two subsystems, each of which can improve the system reliability by adding a redundant design. In this paper, the T-S fault tree analysis method is used to construct the reliability allocation optimization model of the system, and the DCORSSA-PSO algorithm is used to optimize its reliability allocation. The T-S fault tree of the mechanical system is shown in Figure 3.

Figure 3. T-S fault tree of a mechanical system.

DownLoad: Full-Size Img PowerPoint

In Figure 3, x₁~x₅ are the bottom events, y₁~y₂ are the intermediate events, and y₃ is the top event. G₁~G₃ are T-S gates. Fuzzy numbers 0, 0.5 and 1 represent the three states of normal, semi failure, and complete failure of each part, respectively. The fault states of each part are independent of each other. According to expert experience and historical data, the rule tables of the T-S gate are defined as shown in Tables 6–8.

Table 6. Rule table of T-S gate 1.

rules	x₁	x₂	x₃	y₁			rules	x₁	x₂	x₃	y₁
rules	x₁	x₂	x₃	0	0.5	1	rules	x₁	x₂	x₃	0	0.5	1
1	0	0	0	1	0	0	15	0.5	0.5	1	0	0	1
2	0	0	0.5	0.2	0.5	0.3	16	0.5	1	0	0	0	1
3	0	0	1	0	0	1	17	0.5	1	0.5	0	0	1
4	0	0.5	0	0.3	0.5	0.2	18	0.5	1	1	0	0	1
5	0	0.5	0.5	0.2	0.3	0.5	19	1	0	0	0	0	1
6	0	0.5	1	0	0	1	20	1	0	0.5	0	0	1
7	0	1	0	0	0	1	21	1	0	1	0	0	1
8	0	1	0.5	0	0	1	22	1	0.5	0	0	0	1
9	0	1	1	0	0	1	23	1	0.5	0.5	0	0	1
10	0.5	0	0	0.2	0.5	0.3	24	1	0.5	1	0	0	1
11	0.5	0	0.5	0.1	0.4	0.5	25	1	1	0	0	0	1
12	0.5	0	1	0	0	1	26	1	1	0.5	0	0	1
13	0.5	0.5	0	0.1	0.5	0.4	27	1	1	1	0	0	1
14	0.5	0.5	0.5	0.1	0.4	0.5	-	-	-	-	-	-	-

| Show Table

DownLoad: CSV

Table 7. Rule table of T-S gate 2.

rules	x₄	x₅	y₂			rules	x₄	x₅	y₂
rules	x₄	x₅	0	0.5	1	rules	x₄	x₅	0	0.5	1
1	0	0	1	0	0	6	0.5	1	0	0	1
2	0	0.5	0.4	0.4	0.2	7	1	0	0.1	0.2	0.7
3	0	1	0.1	0.1	0.8	8	1	0.5	0	0	1
4	0.5	0	0.8	0.1	0.1	9	1	1	0	0	1
5	0.5	0.5	0.1	0.5	0.4	-	-	-	-	-	-

| Show Table

DownLoad: CSV

Table 8. Rule table of T-S gate 3.

rules	y₁	y₂	y₃			rules	y₁	y₂	y₃
rules	y₁	y₂	0	0.5	1	rules	y₁	y₂	0	0.5	1
1	0	0	1	0	0	6	0.5	1	0	0	1
2	0	0.5	0.4	0.5	0.1	7	1	0	0.1	0.2	0.7
3	0	1	0.1	0.1	0.8	8	1	0.5	0	0	1
4	0.5	0	0.8	0.1	0.1	9	1	1	0	0	1
5	0.5	0.5	0.1	0.5	0.4	-	-	-	-	-	-

| Show Table

DownLoad: CSV

Taking Table 6 as an example, each row in rules1~27 represents a G₁ gate rule. For example, in rule1, the fault states of bottom events x₁, x_2, and x₃ are 0, 0, and 0 respectively, then the occurrence probability of fault state 0 of y₁ is P¹(y₁ = 0) = 1, the occurrence probability of fault state 0.5 of y₁ is P¹(y₁ = 0.5) = 0, the occurrence probability of fault state 1 of y₁ is P¹(y₁ = 1) = 0. Under the same rule, the sum of the occurrence possibilities of each fault state of the superior event y₁ is 1, that is, P¹(y₁ = 0) + P¹(y₁ = 0.5) + P¹(y₁ = 1) = 1.

6.2. System reliability optimization model

According to the T-S fault tree and the corresponding rule gate of the system, a system reliability optimization model is constructed with the lowest system fault probability as the objective function and the overall cost of the system as the constraint. Among them, the system cost is the sum of the expenses of each component unit, its connectors, and switching equipment. The unit cost increases nonlinearly with the improvement of its reliability. The objective function and cost constraint expression are as follows:

$\min \sum\limits_{q = 2}^3 {P\left( {T = {T_q}} \right)} = \sum\limits_{l = 1}^9 {P_o^l} {P^l}\left( {T = 0.5} \right) + \sum\limits_{l = 1}^9 {P_o^l} {P^l}\left( {T = 1} \right)$

(13)

${\text{s}}{\text{.t}}\sum\limits_{i = 1}^5 {{\alpha _i}} {\left\{ {\frac{{ - \mu }}{{\ln \left[ {1 - \sum\limits_{{\alpha _i} = 2}^3 {P\left( {x_i^{{\alpha _i}}} \right)} } \right]}}} \right\}^{{\beta _i}}} \times \left[ {{n_i} + \exp \left( {\frac{{{n_i}}}{4}} \right)} \right] \leqslant {C_0}$

(14)

where P(T = T_q) is the fault probability of top event T when its fault state is T_q; P^l(T = 0.5) and P^l(T = 1) respectively represent the probability when the fault state of top event T is 0.5 and 1 in the rule l; $P_0^l$ is the execution degree of T-S rule; $P(x_i^{{a_i}})$ is the failure probability of bottom event x_i when its failure state is $x_i^{{a_i}}$ ; n_i is the redundancy number of the element; μ is the fault-free operation time, taken μ = 1000 h; C₀ is the constraint value of system cost, taken C₀ = 175. α_i and β_i can be seen in Table 9.

Table 9. Constraint parameter values of α_i and β_i.

i	1	2	3	4	5
10⁵α_i	2.540×10^-5	2.483 × 10^-5	6.420 × 10^-5	7.160 × 10^-5	2.417 × 10^-5
β_i	1.500	1.500	1.500	1.500	1.500

| Show Table

DownLoad: CSV

6.3. Fitness function

The penalty function is one of the main constraint optimization methods available at present, whose core idea is to transform the original constrained optimization problem into an unconstrained problem by constructing auxiliary functions. In this paper, the cost constraint in the system reliability optimization model is transformed into an unconstrained optimization problem by introducing a penalty function. That is, a penalty factor is added to the fitness value of the salps that does not satisfy the cost constraints, so that the infeasible solution can be eliminated in the process of evolution. In this paper, the maximum probability value of system fault N (N = 1) is used as the penalty factor, and the failure probability fitness function is constructed as follows:

$fitness = \left\{ {\begin{array}{*{20}{c}} {\sum\limits_{q = 2}^3 {P\left( {T = {T_q}} \right)} }&{C \leqslant {C_0}} \\ {\sum\limits_{q = 2}^3 {P\left( {T = {T_q}} \right)} + N}&{Otherwise} \end{array}} \right.$

(15)

6.4. Results comparison and analysis

The DCORSSA-PSO algorithm is compared with SSA, PSO, and GWO algorithms. Set the maximum number of iterations of the above five algorithms T = 500. And the reliability optimization results of the algorithms are shown in Table 10.

Table 10. Optimization results of four algorithms.

Optimized parameters	SSA		DCORSSA-PSO		PSO		GWO
Optimized parameters	P(x_i)	n_i	P(x_i)	n_i	P(x_i)	n_i	P(x_i)	n_i
x₁	1.28 × 10^-1	3	1.09 × 10^-1	3	1.58 × 10^-1	3	1.37 × 10^-1	3
x₂	9.22 × 10^-2	3	1.08 × 10^-1	3	1.58 × 10^-1	3	1.68 × 10^-1	3
x₃	1.38 × 10^-1	3	9.00 × 10^-2	3	1.07 × 10^-1	3	1.88 × 10^-1	3
x₄	1.26 × 10^-1	3	1.41 × 10^-1	3	1.70 × 10^-1	3	1.57 × 10^-1	3
x₅	1.16 × 10^-1	3	9.66 × 10^-2	3	1.11 × 10^-1	3	1.59 × 10^-1	3
P	5.03 × 10^-3		3.71 × 10^-3*		8.03 × 10^-3		1.22 × 10^-2
C	175.00		175.00		115.76		105.89*
Time/s	4.91 × 10^-1*		6.05 × 10^-1		8.17 × 10^-1		5.09 × 10^-1

| Show Table

DownLoad: CSV

According to the optimization results in Table 10, when taking the minimum failure probability of the system as the objective function, the system failure probability optimized by the DCORSSA-PSO algorithm is lower than that of other algorithms including SSA, PSO, GWO, which proves the feasibility and superiority of the improved algorithm. In terms of running time, SSA has the shortest one, while DCORSSA-PSO has the longest one. This indicates that the multi-strategy improvement of DCORSSA-PSO spend more time, but the running time of DCORSSA-PSO is less than one second which can fulfill the needs of practical engineering.

In addition, in order to more intuitively show the reliability optimization process of the four algorithms, the iterative curve is shown in Figure 4.

Figure 4. Optimization comparison curves.

DownLoad: Full-Size Img PowerPoint

To test the stableness of the DCORSSA-PSO algorithm, let all the algorithms run 30 times at the same initial condition. Table 11 shows the statistical results of failure probability.

Table 11. Statistical results of five algorithms.

Measure	SSA	DCORSSA-PSO	PSO	GWO
Mean	6.956 × 10^-3	3.746 × 10^-3*	5.196 × 10^-3	8.304 × 10^-3
Best	3.710 × 10^-3	3.709 × 10^-3*	4.387 × 10^-3	4.404 × 10^-3
Std	3.703 × 10^-3	3.886 × 10^-5*	8.625 × 10^-4	2.518 × 10^-3
Time/s	5.466 × 10^-1*	6.795 × 10^-1	8.975 × 10^-1	5.617 × 10^-1

| Show Table

DownLoad: CSV

From Table 11, we can find that in a statistical sense, DCORSSA-PSO compared with SSA, PSO and GWO still get the best result of failure probability including mean value, best value and standard deviation except for running time. The average failure probability obtained by DCORSSA-PSO algorithm relatively reduced by 46.14% compared to SSA, which shows that DCORSSA-PSO greatly improves the optimization performance of SSA by integrating the multi-strategy improvement.

In this paper, a box plot is used to analyze the data distribution of the system failure probability. The box plot consists of five parts: upper limit, upper quartile, median, lower quartile and lower limit. The upper limit is connected to the upper quartile with a dashed line, and same to the lower limit and the lower quartile. The center mark indicates the median. Figure 5 shows the boxplot of different algorithms.

Figure 5. Boxplot of the different algorithms.

DownLoad: Full-Size Img PowerPoint

In the statistical result of Figure 5, y-axis P means the failure probability, and it can be found that DCORSSA-PSO has the lowest median of P value, which means that the reliability calculated by DCORSSA-PSO is the highest. In addition, the box obtained by DCORSSA-PSO is very compact, that is, the range of the box formed between the upper quartile and the lower quartile is the smallest, indicating that DCORSSA-PSO has less volatility compared to the datasets of other algorithms. Therefore, DCORSSA-PSO outperforms other algorithms. On the other hand, outliers (+) appear in the failure probability optimized by SSA, DCORSSA-PSO and PSO, indicating that further research in performance improvement is needed for DCORSSA-PSO.

7. Conclusions and future works

This paper proposes a DCORSSA-PSO algorithm that hybridizes dimension-by-dimension centroid opposition-based learning strategy, random factor and PSO's social learning strategy based on standard SSA. The improved algorithm mainly improves the standard SSA algorithm in three parts: a) a dimension-by-dimension centroid opposition-based learning strategy is added to the food source update, which can expand the population search range, strengthen the dimension evolution information, and enhance the ability to jump out of the local solutions; b) random factor is added in the update equation of followers to enhance the diversity of population distribution; c) drawing on the experience of PSO's social learning strategy, in the update equation of followers, the food source is added to directly guide the followers to improve the convergence speed of the algorithm. The comparison results in the synthesis of ten standard test functions and the reliability optimization example show that the DCORSSA-PSO algorithm is superior to other algorithms in optimization, which proves that the above improvement strategy has good feasibility and superiority to improve the optimization performance of the SSA algorithm. As a future plan, the method of increasing the diversity of the population will be introduced into the research of DCORSSA-PSO such as the levy-flight theory, chaos mapping. At the same time, DCORSSA-PSO can be employed to optimize pattern classification, fuzzy control, machine learning, etc.

Acknowledgments

This project is supported by the National Natural Science Foundation of China (Grant No. 51975508), Natural Science Foundation of Hebei Province (Grant No. E2021203061).

Conflict of interest

All authors declare that there is no conflict of interests in this paper.

References

[1]	Y. Lin, S. Shen, Z. Liu, H. Luan, M. Sun, Neural relation extraction with selective attention over instances, in Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 1 (2016), 2124–2133. https://doi.org/10.18653/v1/P16-1200
[2]	M. Nickel, K. Murphy, V. Tresp, E. Gabrilovich, A review of relational machine learning for knowledge graphs, Proc. IEEE, 104 (2016), 11–33. https://doi.org/10.1109/JPROC.2015.2483592 doi: 10.1109/JPROC.2015.2483592
[3]	D. Nadeau, S. Sekine, A survey of named entity recognition and classification, Lingvisticae Investigationes, 30 (2007), 3–26. https://doi.org/10.1075/li.30.1.03nad doi: 10.1075/li.30.1.03nad
[4]	N. Bach, S. Badaskar, A review of relation extraction, 2007. Available from: https://www.cs.cmu.edu/nbach/papers/A-survey-on-Relation-Extraction.pdf.
[5]	M. Miwa, Y. Sasaki, Modeling joint entity and relation extraction with table representation, in Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), (2014), 1858–1869. https://doi.org/10.3115/v1/D14-1200
[6]	Z. Zhong, D. Chen, A frustratingly easy approach for entity and relation extraction, in Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, (2021), 50–61. https://doi.org/10.18653/v1/2021.naacl-main.5
[7]	R. D. Semmel, J. Mayfield, Automated query formulation using an entity-relationship conceptual schema, J. Intell. Inf. Syst., 8 (1997), 267–290. https://doi.org/10.1023/A:1008629825231 doi: 10.1023/A:1008629825231
[8]	J. A. P. Sacenti, R. Fileto, R. Willrich, Knowledge graph summarization impacts on movie recommendations, J. Intell. Inf. Syst., 58 (2022), 43–66. https://doi.org/10.1007/s10844-021-00650-z doi: 10.1007/s10844-021-00650-z
[9]	Y. Li, Y. Chen, I. Podlubny, Stability of fractional-order nonlinear dynamic systems: Lyapunov direct method and generalized Mittag–Leffler stability, Comput. Math. Appl., 59 (2010), 1810–1821. https://doi.org/10.1016/j.camwa.2009.08.019 doi: 10.1016/j.camwa.2009.08.019
[10]	H. Zhou, S. Ning, Z. Liu, C. Lang, Z. Liu, B. Lei, Knowledge-enhanced biomedical named entity recognition and normalization: application to proteins and genes, BMC Bioinf., 21 (2020). https://doi.org/10.1186/s12859-020-3375-3 doi: 10.1186/s12859-020-3375-3
[11]	Y. Huang, B. Niu, An improved method for web text affective cognition computing based on knowledge graph, Comput. Mater. Continua, 59 (2019), 1–14. https://doi.org/10.32604/cmc.2019.06032 doi: 10.32604/cmc.2019.06032
[12]	J. Tan, Q. Qiu, W. Guo, T. Li, Research on the construction of a knowledge graph and knowledge reasoning model in the field of urban traffic, Sustainability, 13 (2021), 3191. https://doi.org/10.3390/su13063191 doi: 10.3390/su13063191
[13]	M. Habibi, L. Weber, M. Neves, D. L. Wiegandt, U. Leser, Deep learning with word embeddings improves biomedical named entity recognition, Bioinformatics, 33 (2017), i37–i48. https://doi.org/10.1093/bioinformatics/btx228 doi: 10.1093/bioinformatics/btx228
[14]	C. Friedman, P. O. Alderson, J. H. M. Austin, J. J. Cimino, S. B. Johnson, A general natural-language text processor for clinical radiology, J. Am. Med. Inf. Assoc., 1 (1994), 161–174. https://doi.org/10.1136/jamia.1994.95236146 doi: 10.1136/jamia.1994.95236146
[15]	J. Li, A. Sun, J. Han, C. Li, A survey on deep learning for named entity recognition, IEEE Trans. Knowl. Data Eng., 34 (2022), 50–70. https://doi.org/10.1109/TKDE.2020.2981314 doi: 10.1109/TKDE.2020.2981314
[16]	Y. LeCun, Y. Bengio, G. Hinton, Deep learning, Nature, 521 (2015), 436–444. https://doi.org/10.1038/nature14539 doi: 10.1038/nature14539
[17]	P. Gupta, H. Schütze, B. Andrassy, Table filling multi-task recurrent neural network for joint entity and relation extraction, in Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers, (2016), 2537–2547. Available from: https://aclanthology.org/C16-1239.pdf.
[18]	G. Bekoulis, J. Deleu, T. Demeester, C. Develder, Joint entity recognition and relation extraction as a multi-head selection problem, Expert Syst. Appl., 114 (2018), 34–45. https://doi.org/10.1016/j.eswa.2018.07.032 doi: 10.1016/j.eswa.2018.07.032
[19]	M. Miwa, M. Bansal, End-to-end relation extraction using LSTMs on sequences and tree structures, in Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 1 (2016), 1105–1116. https://doi.org/10.18653/v1/P16-1105
[20]	D. Q. Nguyen, K. Verspoor, End-to-end neural relation extraction using deep biaffine attention, in Advances in Information Retrieval, 11437 (2019), 729–738. https://doi.org/10.1007/978-3-030-15712-8_47
[21]	S. Zheng, F. Wang, H. Bao, Y. Hao, P. Zhou, B. Xu, Joint extraction of entities and relations based on a novel tagging scheme, in Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 1 (2017), 1227–1236. https://doi.org/10.18653/v1/P17-1113
[22]	X. Ma, E. Hovy, End-to-end sequence labeling via bi-directional LSTM-CNNs-CRF, in Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 1 (2016), 1064–1074. https://doi.org/10.18653/v1/P16-1101
[23]	M. Cao, Z. Yang, L. Ling, H. Lin, J. Wang, Joint drug entity and relations extraction based on neural network, J. Comput. Res. Dev., 56 (2019), 1432–1440. https://crad.ict.ac.cn/EN/Y2019/V56/I7/1432
[24]	W. Lv, Z. Liao, S. Liu, Y. Zhang, MEIM: A multi-source software knowledge entity extraction integration model, Comput. Mater. Continua, 66 (2021), 1027–1042. https://doi.org/10.32604/cmc.2020.012478 doi: 10.32604/cmc.2020.012478
[25]	P. Zhou, W. Shi, J. Tian, Z. Qi, B. Li, H. Hao, et al., Attention-based bidirectional long short-term memory networks for relation classification, in Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2 (2016), 207–212. https://doi.org/10.18653/v1/P16-2034
[26]	Z. Li, N. Ding, Z. Liu, H. Zheng, Y. Shen, Chinese relation extraction with multi-grained information and external linguistic knowledge, in Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, (2019), 4377–4386. https://doi.org/10.18653/v1/P19-1430
[27]	R. Alzaidy, C. Caragea, C. L. Giles, Bi-lstm-crf sequence labeling for keyphrase extraction from scholarly documents, in The World Wide Web Conference, New York, (2019), 2551–2557. https://doi.org/10.1145/3308558.3313642
[28]	K. Lee, L. He, M. Lewis, L. Zettlemoyer, End-to-end neural coreference resolution, in Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, (2017), 188–197. https://doi.org/10.18653/v1/D17-1018
[29]	Y. Luan, D. Wadden, L. He, A. Shah, M. Ostendorf, H. Hajishirzi, A general framework for information extraction using dynamic span graphs, in Proceedings of the 2019 Conference of the North {A}merican Chapter of the Association for Computational Linguistics: Human Language Technologies, 1 (2019), 3036–3046. https://doi.org/10.18653/v1/N19-1308
[30]	D. Wadden, U. Wennberg, Y. Luan, H. Hajishirzi, Entity, relation, and event extraction with contextualized span representations, in Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), (2019), 5784–5789. https://doi.org/10.18653/v1/D19-1585
[31]	J. Devlin, M. Chang, K. Lee, K. Toutanova, BERT: Pre-training of deep bidirectional transformers for language understanding, in Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 1 (2019), 4171–4186. https://doi.org/10.18653/v1/N19-1423
[32]	Y. Cui, W. Che, T. Liu, B. Qin, Z. Yang, Pre-training with whole word masking for Chinese BERT, IEEE/ACM Trans. Audio, 29 (2021), 3504–3514. https://doi.org/10.1109/TASLP.2021.3124365 doi: 10.1109/TASLP.2021.3124365
[33]	K. Dixit, Y. Al-Onaizan, Span-level model for relation extraction, in Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, (2019), 5308–5314. https://doi.org/10.18653/v1/P19-1525
[34]	L. Baldini Soares, N. FitzGerald, J. Ling, T. Kwiatkowski, Matching the blanks: Distributional similarity for relation learning, in Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, (2019), 2895–2905. https://doi.org/10.18653/v1/P19-1279
[35]	Z. Zhang, X. Han, Z. Liu, X. Jiang, M. Sun, Q. Liu, ERNIE: Enhanced language representation with informative entities, in Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, (2019), 1441–1451. https://doi.org/10.18653/v1/P19-1139
[36]	H. Peng, T. Gao, X. Han, Y. Lin, P. Li, Z. Liu, et al., Learning from context or names? An empirical study on neural relation extraction, in Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), (2020), 3661–3672. 10.18653/v1/2020.emnlp-main.298
[37]	Y. Gao, Y. Wang, P. Wang, L. Gu, Medical named entity extraction from chinese resident admit notes using character and word attention-enhanced neural network, Int. J. Environ. Res. Public Health, 17 (2020), 1614. https://doi.org/10.3390/ijerph17051614 doi: 10.3390/ijerph17051614
[38]	G. Bekoulis, J. Deleu, T. Demeester, C. Develder, Adversarial training for multi-context joint entity and relation extraction, in Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, (2018), 2830–2836. https://doi.org/10.18653/v1/D18-1307
[39]	J. Ge, S. Li, F. Yiqiu, Joint extraction method of Chinese entity relationship based on mixture of characters and words, Appl. Res. Comput., 38 (2021), 2619–2623. https://doi.org/10.19734/j.issn.1001-3695.2021.01.0006 doi: 10.19734/j.issn.1001-3695.2021.01.0006

This article has been cited by:

1.	Yue Zhong, Jieming Gu, Lightweight block ciphers for resource-constrained environments: A comprehensive survey, 2024, 157, 0167739X, 288, 10.1016/j.future.2024.03.054
2.	Usman Haruna Garba, Adel N. Toosi, Muhammad Fermi Pasha, Suleman Khan, SDN-based detection and mitigation of DDoS attacks on smart homes, 2024, 221, 01403664, 29, 10.1016/j.comcom.2024.04.001

Reader Comments

Your name:*

Email:*
© 2022 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

Mathematical Biosciences and Engineering

3.9

Metrics

Article views(2887) PDF downloads(110) Cited by(5)

Preview PDF

Download XML

Export Citation

Article outline

Show full outline

Figures and Tables

Figures(8) / Tables(7)

Mathematical Biosciences and Engineering

Span-based model for overlapping entity recognition and multi-relations classification in the food domain

Related Papers:

Abstract

1. Introduction

2. Motivation and innovation

3. Basic principles of salp swarm algorithm

4. An improved salp swarm algorithm

4.1. DCOSSA

4.1.1. Opposition-based learning

4.1.2. Centroid opposition-based learning

4.1.3. Dimension-by-dimension centroid opposition-based learning

4.1.4. DCOSSA

4.2. DCORSSA

4.3. DCORSSA-PSO

4.4. Computational complexity of DCORSSA-PSO

5. Simulation experiment and result analysis

5.1. Testing environment

5.2. Test function

5.3. Experimental results and analysis

6. System reliability optimization model

6.1. T-S fault tree construction

6.2. System reliability optimization model

6.3. Fitness function

6.4. Results comparison and analysis

7. Conclusions and future works

Acknowledgments

Conflict of interest

References

This article has been cited by:

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Figures and Tables

Other Articles By Authors

Catalog

Mathematical Biosciences and Engineering

Span-based model for overlapping entity recognition and multi-relations classification in the food domain

Related Papers:

Abstract

1. Introduction

2. Motivation and innovation

3. Basic principles of salp swarm algorithm

4. An improved salp swarm algorithm

4.1. DCOSSA

4.1.1. Opposition-based learning

4.1.2. Centroid opposition-based learning

4.1.3. Dimension-by-dimension centroid opposition-based learning

4.1.4. DCOSSA

4.2. DCORSSA

4.3. DCORSSA-PSO

4.4. Computational complexity of DCORSSA-PSO

5. Simulation experiment and result analysis

5.1. Testing environment

5.2. Test function

5.3. Experimental results and analysis

6. System reliability optimization model

6.1. T-S fault tree construction

6.2. System reliability optimization model

6.3. Fitness function

6.4. Results comparison and analysis

7. Conclusions and future works

Acknowledgments

Conflict of interest

References

This article has been cited by:

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Figures and Tables

Other Articles By Authors

Related pages

Tools

Export File

Citation

Format

Content

Catalog