Differential evolution particle swarm optimization algorithm based on good point set for computing Nash equilibrium of finite noncooperative game

Huimin Li; Shuwen Xiang; Yanlong Yang; Chenwei Liu; Huimin Li; Shuwen Xiang; Yanlong Yang; Chenwei Liu

doi:10.3934/math.2021081

AIMS Mathematics

2021, Volume 6, Issue 2: 1309-1323. doi: 10.3934/math.2021081

Previous Article Next Article

Research article

Differential evolution particle swarm optimization algorithm based on good point set for computing Nash equilibrium of finite noncooperative game

College of mathematics and statistics, Guizhou University, Guiyang 550025, China

Received: 05 August 2020 Accepted: 07 November 2020 Published: 17 November 2020
MSC : 68W50, 90C33, 91A06, 91A10

In this paper, a hybrid differential evolution particle swarm optimization (PSO) method based on a good point set (GPDEPSO) is proposed to compute a finite noncooperative game among N people. Stochastic functional analysis is used to prove the convergence of this algorithm. First, an ergodic initial population is generated by using a good point set. Second, PSO is proposed and utilized as the variation operator to perform variation crossover selection with differential evolution (DE). Finally, the experimental results show that the proposed algorithm has a better convergence speed, accuracy, and global optimization ability than other existing algorithms in computing the Nash equilibrium of noncooperative games among N people. In particular, the efficiency of the algorithm is higher for determining the Nash equilibrium of a high-dimensional payoff matrix game.

Keywords:

non-cooperative game,
Nash equilibrium,
good point set,
differential evolution particle swarm optimization algorithm,
high-dimensional payoff matrix

Citation: Huimin Li, Shuwen Xiang, Yanlong Yang, Chenwei Liu. Differential evolution particle swarm optimization algorithm based on good point set for computing Nash equilibrium of finite noncooperative game[J]. AIMS Mathematics, 2021, 6(2): 1309-1323. doi: 10.3934/math.2021081

Related Papers:

[1]	Huimin Li, Shuwen Xiang, Shunyou Xia, Shiguo Huang . Finding the Nash equilibria of $n$ -person noncooperative games via solving the system of equations. AIMS Mathematics, 2023, 8(6): 13984-14007. doi: 10.3934/math.2023715
[2]	Chengqing Pan, Haishu Lu . On the existence of solutions for systems of generalized vector quasi-variational equilibrium problems in abstract convex spaces with applications. AIMS Mathematics, 2024, 9(11): 29942-29973. doi: 10.3934/math.20241447
[3]	Radu Precup, Andrei Stan . Stationary Kirchhoff equations and systems with reaction terms. AIMS Mathematics, 2022, 7(8): 15258-15281. doi: 10.3934/math.2022836
[4]	Zifu Fan, Youpeng Tao, Wei Zhang, Kexin Fan, Jiaojiao Cheng . Research on open and shared data from government-enterprise cooperation based on a stochastic differential game. AIMS Mathematics, 2023, 8(2): 4726-4752. doi: 10.3934/math.2023234
[5]	Chenwei Liu, Shuwen Xiang, Yanlong Yang . Existence and essential stability of Nash equilibria for biform games with Shapley allocation functions. AIMS Mathematics, 2022, 7(5): 7706-7719. doi: 10.3934/math.2022432
[6]	Ying Sun, Yuelin Gao . An improved composite particle swarm optimization algorithm for solving constrained optimization problems and its engineering applications. AIMS Mathematics, 2024, 9(4): 7917-7944. doi: 10.3934/math.2024385
[7]	Amir Hossein Rashme, Zahra Farhad Touski, Madjid Eshaghi . A survey of critical structures in competitive games. AIMS Mathematics, 2018, 3(1): 44-55. doi: 10.3934/Math.2018.1.44
[8]	Jaicer López-Rivero, Hugo Cruz-Suárez, Carlos Camilo-Garay . Nash equilibria in risk-sensitive Markov stopping games under communication conditions. AIMS Mathematics, 2024, 9(9): 23997-24017. doi: 10.3934/math.20241167
[9]	Jewaidu Rilwan, Poom Kumam, Idris Ahmed . Overtaking optimality in a discrete-time advertising game. AIMS Mathematics, 2022, 7(1): 552-568. doi: 10.3934/math.2022035
[10]	Wen Li, Deyi Li, Yuqiang Feng, Du Zou . Existence and stability of fuzzy Pareto-Nash equilibria for fuzzy constrained multi-objective games with fuzzy payoffs. AIMS Mathematics, 2023, 8(7): 15907-15931. doi: 10.3934/math.2023812

Abstract

1. Introduction

Game theory is a mathematical theory that studies interactions between decision makers. It mainly studies how players use the information they have to make decisions. Game theory is widely used in many fields, such as economics, political science, biology and other fields ^{[1,2,3,4,5,6]}. Noncooperative game is the main component of game theory, and the Nash equilibrium is the core concept of noncooperative game. However, achieving equilibrium requires players to make predictions according to certain steps in the game. Therefore, it can be reduced to a calculation problem to a certain extent.

At present, many numerical methods have been proposed to solve the Nash equilibrium, such as the Lemke-Howson algorithm ^[7], global Newton algorithm ^[8], projection-like method ^[9], trust region algorithm ^[10] and other traditional calculation methods. In recent years, with the increased complexity of game problems, traditional numerical methods are almost impossible to use in solving due to the increasing difficulty of finding the solution and the computation time. Increasing numbers of scholars have focused on intelligent algorithms for biological simulation. Because of the rationality of imitating biological behaviors, such algorithms have the implicit rational characteristics of games. By such means, such algorithms can be considered as some sort of paths to achieve the Nash equilibrium.

Pavlidis ^[11] verified the effectiveness of three computational intelligence algorithms, namely, covariance matrix adaptation evolution strategies, particle swarm optimization (PSO) and differential evolution (DE), to compute the Nash equilibrium of finite strategic games. Boryczka ^[12] compared DE with two well-known algorithms: simplicial subdivision and the Lemke-Howson algorithm. It has also been proven that the DE method can obtain an approximate Nash equilibrium solution. Chen ^[13] utilized the genetic algorithm (GA) to acquire the Nash equilibrium of an $N$ -person noncooperative game. With the examples of double matrix games, the implementation of the genetic algorithm was discussed and proven to be effective. Qiu ^[14] proposed an immune algorithm (IA) for solving game equilibrium. The advantages of this algorithm in solving game problems and its stable convergence were verified through examples. Franken ^[15] investigated the application of coevolutionary training techniques based on PSO to evolve the iterated prisoner's dilemma (IPD). Jia ^[16] proposed an immune particle swarm optimization (IPSO) to compute finite noncooperative games among $N$ people. The results show that this algorithm is superior to the immune algorithm and original swarm algorithm in solving game problems. Yang ^[17] proposed the fireworks algorithm (FWA) to compute the Nash equilibrium of $N$ -person noncooperative game. Computer simulation results demonstrate that the proposed algorithm is effective and superior to IPSO.

The above studies all demonstrate the advantages of intelligent algorithms in solving Nash equilibrium problems. However, several issues remain, such as the higher complexity of the algorithm, its slow convergence speed and its low accuracy; in particular, the convergence of the algorithm is not proven theoretically. The goal of this paper is to propose an efficient hybrid of GPDEPSO to solve the game problem and then prove its convergence. The paper is organized as follows: In Section 2, we summarize the concept of a noncooperative $N$ -person game. In Section 3, we propose a hybrid algorithm, GPDEPSO, to compute the Nash equilibrium of noncooperative $N$ -person games. First, we initialize the population with a good point set to ensure the initial particle distribution is global, which will help the algorithm avoid local convergence. Then, the position updating formula of PSO is simplified in a new form that does not have a velocity term, and it is used as a variation operator to perform variation crossover selection with DE. The convergence of GPDEPSO is proved by stochastic functional analysis in Section 4. Section 5 is devoted to computational experiments, and by comparing the algorithm proposed in this paper with other algorithms, its superiority is proven.

2. Game and Nash equilibrium Definitions ^[18]

Definition 1. We consider an $N$ -person finite strategic game defined by $\Gamma = ((N, S_{i}, U_{i}, X_{i}, f_{i}), i = 1, \ldots, n)$ ,

where

(1). $N = \{1, \ldots, n\}$ is the set of players and $n$ is the number of players;

(2). $S_{i} = \{s_{i1}, \ldots, s_{im_{i}}\} \ \forall i\in N$ is the pure strategy set of player $i$ , $m_{i}$ represents the number of strategies available to player $i$ , $S = \prod_{i = 1}^nS_{i}$ is the Cartesian product of pure strategy sets of all players, and each pure strategy profile meets $(S_{1}, S_{2}, \ldots, S_{n})\in S$ ;

(3). $U_{i}:S\rightarrow \mathbb{R}\ \forall i\in N$ represents the payoff function;

(4). $X_{i} = \{x_{i} = (x_{i1}, \ldots, x_{im_{i}}):x_{ik}\geq0, k = 1, \ldots, m_{i}, \sum_{k_{i} = 1}^{m_{i}}x_{ik_{i}} = 1\}\ \forall i\in N$ is the set of mixed strategies, where $x_{ij}$ is the probability that player $i$ adopts $s_{ij}$ for $j = 1, \cdots, m_i$ . $X = \prod_{i = 1}^nX_{i}$ is the Cartesian product of mixed strategy sets of all players, and each mixed strategy profile meets $(x_{1}, x_{2}, \ldots, x_{n})\in X$ ;

(5). $f_{i}:X\rightarrow \mathbb{R} \; \forall i\in N$ represents the expected payoff function.

$f_{i}(x_{1}, \ldots, x_{n}) = \sum_{k_{1} = 1}^{m_{1}}\ldots\sum_{k_{n} = 1}^{m_{n}}U_{i}(s_{1k_{1}}, \ldots, s_{nk_{n}})\prod_{i = 1}^{n}x_{ik_{i}}$ represents player $i$ getting the expected payoff value when he chooses a mixed strategy $x_{i} = (x_{i1}, \ldots, x_{im_{i}})\in X_{i}$ . where $U_{i}(s_{ik_{i}}, \ldots, s_{nk_{n}})$ represents player $i$ getting the payoff value when each player $i$ chooses the pure strategy $s_{ik_{i}}\in S_{i}, i = 1, \ldots, n$ .

Definition 2. If there is $x^{\ast} = (x_{1}^{\ast}, \ldots, x_{n}^{\ast})$ , such that $f_{i}(x_{i}^{\ast}, x_{i^{\wedge}}^{\ast}) = \max\limits_{u_i\in X_i}f_{i}(u_{i}, x_{i^{\wedge}}^{\ast}), \; \forall i\in N$ , then $x^{\ast}$ is the Nash equilibrium point of an $N$ -person finite noncooperative game, where $i^{\wedge} = N \backslash \{i\}, \ \forall i\in N$ .

Conclusion 1. Mixed strategy $x^{\ast}$ is the Nash equilibrium point if and only if every pure strategy $s_{ik_{i}}(1\leq k_{i}\leq m_{i})$ of each player $i$ has $f_{i}(x^{\ast})\geq f_{i}(x^{\ast} \shortparallel s_{ik_{i}})$ , where $(x^{\ast} \shortparallel s_{ik_{i}})$ is only the player $i$ replacing their own strategy with $s_{ik_{i}}$ , and the other players do not change their own strategy under the condition of equilibrium solution $x^{\ast}$ .

In particular, the $\Gamma$ is a bimatrix game when $N = 2$ , $(x^{\ast}, y^{\ast})$ is the Nash equilibrium solution if and only if $\left\{\begin{aligned} &x^{\ast} Ay^\ast\geq xAy^{\ast T}, \ \ \forall x, \\ &x^{\ast}By^{\ast}\geq x^{\ast}By^{ T}, \ \ \forall y, \end{aligned}\right.$ where $A$ and $B$ are payoff matrices for each player.

Theorem 1. A mixed strategy $x^{\ast}\in X$ is the Nash equilibrium point of a game $\Gamma$ if and only if $x^{\ast}$ is an optimal solution to the following optimization problem, and the optimal value is 0:

$\begin{equation} \left\{ \begin{aligned} & min \; f(x) = \sum\limits_{i = 1}^{n}\max\limits_{1\leq k_i\leq m_i}\{f_{i}(x \shortparallel s_{ik_{i}}) -f_i(x),0\}\\ & \sum\limits_{k_{i} = 1}^{m_{i}}x_{ik_{i}} = 1\\ &0\leq x_{ik_{i}}\leq 1\\ &i = 1,\cdots,n; k_i = 1,\cdots,m_i \end{aligned} \right. \end{equation}$

(2.1)

Proof Necessity: Suppose that $x^{\ast}$ is the Nash equilibrium point. According to Conclusion 1, we have $f_{i}(x^{\ast}) \geq f_{i}(x^{\ast} \shortparallel s_{ik_{i}}), \forall i\in N, \forall s_{ik_{i}}\in S_i$ , then

$\begin{equation*} \sum\limits_{i = 1}^{n}\max\limits_{1\leq k_i\leq m_i}\{f_{i}(x^{\ast} \shortparallel s_{ik_{i}}) -f_i(x^{\ast}),0\} = 0. \end{equation*}$

Because of $x^{\ast}\in X = \prod_{i = 1}^nX_{i}$ , the constraint condition of (2.1) hold. In formula (2.1), $f(x)$ is nonnegative, so $f(x)$ gets the minimum value 0 at $x^{\ast}$ .

Sufficiency: Assume that $x^{\ast}$ is the solution of problem (2.1). According to $x^{\ast}$ satisfies conditions of (2.1), we can know that $x^{\ast}\in X = \prod_{i = 1}^nX_{i}$ . And because of $f(x^{\ast}) = 0$ , so $f_{i}(x^{\ast} \shortparallel s_{ik_{i}}) -f_i(x^{\ast})\leq 0$ , that is $f_i(x^{\ast})\geq f_{i}(x^{\ast} \shortparallel s_{ik_{i}}), \forall i\in N, \forall s_{ik_{i}}\in S_i$ . So $x^{\ast}$ is the Nash equilibrium point of the game $\Gamma$ .

For the two-person double matrix game, the above optimization problem can be simplified as:

$\begin{equation} f(x,y) = \max\{\max\limits_{1 \lt i \lt m}\{A_{i}y^{T}-xAy^{T},0\}\}+\max\{\max\limits_{1 \lt j \lt n}\{xB_{j}-xBy^{T},0\}\}. \end{equation}$

(2.2)

where $A_{i}$ is the $i$ th row of matrix $A$ and $B_{j}$ is the $j$ th column of matrix $B$ . Then, $(x^{\ast}, y^{\ast})$ is a Nash equilibrium solution of a two-person noncooperative game, which is also $f(x^{\ast}, y^{\ast}) = 0$ .

3. Differential evolution particle swarm optimization algorithm based on good point set (GPDEPSO)

3.1. Good point set

Good point set was originally proposed by Hua Luogeng et al. ^[19] and defined as follows:

(1). Let $G_{s}$ be a unit cube in $s$ -dimensional Euclidean space. Let $x\in G_{s}$

$\begin{equation} x = (x_{1},x_{2},\ldots,x_{s})\in G_{s},0\leq x_{j}\leq 1,j = 1,\ldots,s. \end{equation}$

(3.1)

(2). Let $G_{s}$ have a set of points $P_n(i)$ with $n$ points:

$\begin{align} & P_n(i) = \{x^{(1)},\cdots,x^{(i)},\cdots,x^{(n)}\} \\ & x^{(i)} = \{\{x_1^{(i)}\},\cdots ,\{x_j^{(i)}\},\cdots,\{x_s^{(i)}\}\} \\ & 0\leq x_j^{(i)}\leq 1;\; i = 1,\cdots,n;\; j = 1,2,\cdots,s. \end{align}$

(3.2)

where $\{\cdot\}$ represents the decimal part of the value.

(3). For any given point $r = (r_1, r_2, \cdots, r_s)\in G_s$ , let $N_n(r) = N_n(r_1, r_2, \cdots, r_s)$ represents the number of points in $P_n(i)$ that meet the following inequality:

$\begin{equation*} 0\leq x_j^{(i)}\leq r_j,\; j = 1,2,\cdots,s. \end{equation*}$

$\varphi(n) = sup|(N_n(r)/n)-|r||$ , where $|r| = r_1r_2\cdot \cdots \cdot r_s$ , is called a deviation of the point set $P_n(i)$ . If $\forall n, \varphi(n) = O(1)$ , then $P_n(i)$ is said to be uniformly distributed on $G_s$ and the deviation is $\varphi(n)$ .

(4). Let $r\in G_s$ , and $P_n(i) = \{(r_1\ast i, r_2\ast i, \cdots, r_s\ast i, ), i = 1, \cdots, n\}$ , the deviation $\varphi(n)$ meets $\varphi(n) = C(r, \varepsilon)\cdot n^{-1+\varepsilon}$ , where $C(r, \varepsilon)$ is a constant related only to $r$ , $\varepsilon$ ( $\varepsilon$ is an arbitrarily small positive number), then, the $P_n(i)$ is called the good point set, $r$ is called the good point.

In this paper, we take

$\begin{equation} r_{i} = \{e^{i},1\leq i\leq s\}. \end{equation}$

(3.3)

In the following, we generate two distribution maps (Figures 1 and 2) of 500 populations with the random point method and the exponential good point set method, respectively.

Figure 1. Two-dimensional initial population generated by random method.

DownLoad: Full-Size Img PowerPoint

Figure 2. Two-dimensional initial population generated by exponential sequence.

DownLoad: Full-Size Img PowerPoint

As shown, the good point sequence is more uniform and global than the random point method. In addition, the good point method is independent with spatial dimensions, thus it can be well adapted to the high-dimensional problems. It is also stable that the distribution results are the same every time when the number of points is the same.

3.2. Differential evolution algorithm (DE)

DE is a new simple and robust evolutionary algorithm that was first introduced by Storm and Price ^[20]. There are four operations of DE: initialization, variation, crossover, and selection operation.

(1). Initialization

Let each individual in a population be

$\begin{equation} X = (x_{i1},\ldots,x_{ij},\ldots,x_{iD}) \ \ i = 1,\ldots,N;j = 1,\ldots,D. \end{equation}$

(3.4)

where $N$ and $D$ represent, respectively, the population size and space dimension. In the study of DE, it is generally assumed that the initial population conforms to uniform probability distribution, and its form is as follows:

$\begin{equation} x_{ij} = rand[0,1]\cdot (x_{j}^{U}-x_{j}^{L})+x_{j}^{L} \ \ i = 1,\ldots,N;j = 1,\ldots,D. \end{equation}$

(3.5)

where $rand[0,1]$ represents random values in the range [0,1], $x_{j}^{U}$ and $x_{j}^{L}$ represent, respectively, the upper and lower bounds of parameter variables.

(2). Variation

The variation operation is mainly executed to distinguish DE from other evolutionary algorithms. The variation of individual $V = (v_{i1}, \ldots, v_{iD})$ is generated by the following equation:

$\begin{align} &V_{i}^{t+1} = X_{r1}^{t}+F\cdot(X_{r2}^{t}-X_{r3}^{t}) \ \ \\ & X_{r1}^{t},X_{r2}^{t},X_{r3}^{t}\in X;\; i = 1,\ldots,N. \end{align}$

(3.6)

where $r1$ , $r2$ , and $r3$ are different integers between 1 and $N$ , and they are also different from $i$ ; $F$ is a constriction factor to control the size of difference of two individuals, and $t$ is the current iterate point.

(3). Crossover

We use the crossover between the parent and offspring with the given probability for generating new individual $U = (u_{i1}, \ldots, u_{iD})$ :

$\begin{equation} u_{ij}^{t+1} = \left\{ \begin{aligned} v_{ij}^{t+1} & , & if (rand(j)\leq CR)or (j = rnbr(i)), \\ x_{ij}^{t+1} & , & otherwise. \end{aligned} \right. \end{equation}$

(3.7)

where $rand(j)$ is random value in the range [0,1], $CR$ is crossover operator in the range [0,1], and $rnbr(i)\in\{1, \ldots, D\}$ is a randomly selected sequence, which ensures that a new individual gets at least one component value from the variation vector.

(4). Selection

The offspring $X_{i}^{t+1}$ is generated by selecting the individual and parent according to the following equation:

$\begin{equation} X_{ij}^{t+1} = \left\{ \begin{aligned} U_{i}^{t+1} & , & if (f(U_{i}^{t+1}) \lt f(X_{i}^{t})), \\ X_{i}^{t} & , & otherwise. \end{aligned} \right. \end{equation}$

(3.8)

where $f(\cdot)$ is the fitness function value.

3.3. Particle swarm optimization algorithm (PSO)

PSO was proposed by Eberhart and Kennedy ^[21], and is a random search algorithm, which is inspired by the activities of flocking birds. PSO uses a population of individuals called particles, and there are two main operations in PSO, speed updating and position updating:

$\begin{align} & v_{ij}^{t+1} = \omega v_{ij}^{t}+c_{1}r_{1}(x_{pbest}^{t}-x_{ij}^{t})+c_{2}r_{2}(x_{gbest}^{t}-x_{ij}^{t}) \\ & x_{ij}^{t+1} = x_{ij}^{t}+v_{ij}^{t} \\ & i = 1,\ldots,N;\ j = 1,\ldots,D. \end{align}$

(3.9)

The $\omega$ is the inertia weight, $c_{1}$ and $c_{2}$ are acceleration constants, $r_{1}$ and $r_{2}$ are random values in the interval (0, 1), $v_{ij}^{t}$ is the $i$ th particle's velocity in generation $t$ , $x_{ij}^{t}$ is the $i$ th particle's position in generation $t$ , $x_{pbest}^{t}$ is the personal best position of particle $i$ before generation $t$ , and $x_{gbest}^{t}$ is the global best position in the searching history^[22].

Speed updating can be further explained as follows: $\omega v_{ij}^{t}$ is called the current state of the particle with the ability to balance global and local search. $c_{1}r_{1}(x_{pbest}^{t}-x_{ij}^{t})$ is the cognitive modal of the particle, which represents the ability of learning from itself and endows particles with strong local search capabilities. $c_{2}r_{2}(x_{gbest}^{t}-x_{ij}^{t})$ represents the social cognition modal of the particle and information sharing among particles, that is, the ability to learn from the entire population and endow particles with strong global search ability. Then, the position updating makes the particles reach the new position.

A simplified position transformation formula without velocity term is expressed as follows ^[23]:

$\begin{equation} x_{ij}^{t+1} = \omega x_{ij}^{t}+c_{1}r_{1}(x_{pbest}^{t}-x_{ij}^{t})+c_{2}r_{2}(x_{gbest}^{t}-x_{ij}^{t}). \end{equation}$

(3.10)

3.4. GPDEPSO experimental steps and its implementation

The steps of GPDEPSO is described as follows :

Step 1: Set the parameters of GPDEPSO, such as $N$ , $D$ , $CR$ , $F_0$ , $\omega_{min}$ , $\omega_{max}$ , $x^L$ , $x^U$ , $c_1$ , $c_2$ , and set the maximum number of iteratios $T$ , accuracy $\varepsilon$ , where

$\begin{align*} &\omega = \omega_{max}-(\omega_{max}-\omega_{min})\cdot t/T\\ & F = F_{0}\cdot2^{\lambda}\\ & \lambda = e^{1-T/(T+1-t)} \end{align*}$

Step 2: Randomly generate $N$ initial populations $P(0)$ by using a good point set, and $\sum_{k_{i} = 1}^{m_{i}}x_{ik_{i}} = 1, x_{ik_{i}}\geq0, x_{ik_{i}}\in X_{i}, \ \ i = 1, \ldots, N;k_{i} = 1, \ldots, m_{i}$ .

Step 3: Calculate the fitness function value $f(x)$ of each individual in population $P(t)$ and determine the $x_{pbest}^{t}$ and $x_{gbest}^{t}$ .

Step 4: The next generation population $P_{1}(t)$ is generated by variation of formula (3.10), and population $P_{2}(t)$ is generated by variation of formula (3.6).

Step 5: The population $P^{'}{(t)}$ is generated by crossover of formula (3.7).

Step 6: According to formula (3.8), populations $P(t)$ and $P^{'}{(t)}$ are selected to generate offspring population $P(t+1)$ and the fitness function value of population $P(t+1)$ is calculated.

Step 7: Determine whether to end according to the accuracy and the maximum number of iterations, and output the optimal value; otherwise, turn to step 3.

The pseudo code of GPDEPSO is as follows:

Algorithm 1 GPDEPSO

Input: Parameters

$N$ ,

$D$ ,

$CR$ ,

$F_0$ ,

$T$ ,

$\omega_{min}$ ,

$\omega_{max}$ ,

$x^L$ ,

$x^U$ ,

$c_1$ ,

$c_2$ ,

$\varepsilon$
Output: The best vector (Solution)

$\cdots \; \Delta$

$t \gets 1$ (Initialization with good point set)
for

$i = 0$ to

$N$ do
for

$j = 0$ to

$D$ do

$x_{i, j}^t=rand(0, 1)\cdot (x_{i, j}^U-x_{i, j}^L)+x_{i, j}^L$
    end for
  end for
  while

$|f(\Delta)|\geq \varepsilon$ or

$t\leq T$ do
for

$i = 0$ to

$N$ do
(Update the

$x_{p}^t, x_{g}^t$ )
for

$j = 0$ to

$D$ do

$x_{p}^t=x_{pbest}^t$

$x_{g}^t=x_{gbest}^t$
      end for
      (Variation and Crossover)
      for

$j = 0$ to

$D$ do

$v_{i, j}^t=Variation\; (x_{p}^t, x_{g}^t, x_{i, j}^t)$

$u_{i, j}^t=Crossover\; (v_{i, j}^t, x_{i, j}^t)$
      end for
      (Selection)
      if

$f(u_{i, j}^t) < f(x_{i, j}^t)$ then

$x_{i, j}^t\leftarrow u_{i, j}^t$
else {

$f(x_{i, j}^t) < f(\Delta)$ }

$\Delta \leftarrow x_{i, j}^t$
      end if
    end for

$t=t+1$
end while
return

$the\; best \; vector \; \Delta$

4. Convergence analysis of GPDEPSO

The main operations of GPDEPSO are a variation operation based on PSO, a crossover operation, and a selection operation of DE. Considering variation crossover operations as a difference operator based on PSO (DOPSO), which is essentially a variation operation, the individual $X_i$ generates an intermediate individual $V_i$ with at least $1-(1-CR-1/D)^D$ probability. The selection operation is regarded as a selection operator (SO), which is strictly based on the strategy of survival of the fittest. It produces a better new generation of individuals by eliminating the inferior individuals in $X_i$ and $V_i$ .

For the minimal optimization problem, GPDEPSO evaluation function $\{f(X_i^t) :1 < t < T\}$ is a monotonous nonincremental sequence. The convergence of GPDEPSO is illustrated by He's stochastic functional analysis method in the literature ^[24]. An iteration process of GPDEPSO is abstracted as a composite random mapping of a DOPSO and a SO.

Definition 3. The process of generating test vector $U$ by DOPSO is to recombine and transform everyone dimensional component of the target vector according to probability $\theta = CR+1/D$ . In addition, the process can be described as a random mapping of the solution space $\Psi_1:\Omega\times S\rightarrow S^2$ , which is defined as:

$\begin{equation} \mu\{\omega|\Psi_1(\omega,X_i) = \langle X_i,V_i\rangle\} = \mu\{V_i = X_{r1}+F\cdot(X_{r2}-X_{r3})\} = 1-(1-\theta)^D. \end{equation}$

(4.1)

where $(\Omega, A, \mu)$ is the complete probability measure space and $\Omega$ is the nonempty abstract set, and its element $\omega$ is a basic event. $A$ is the $\sigma-algebra$ composed of some subsets of $\Omega$ , $\mu$ is the probability measure on $A$ , and $S$ is the solution space: $S = \{X|X = (x_1, x_2, \cdots, x_D), x_j^L\leq x_j\leq x_j^U, j = 1, 2, \cdots, D\}$ .

Definition 4. The selection operator SO is the process of selecting the optimal individual from the test vector $U_i$ and the target vector $X_i$ according to the greedy selection method. It is a mapping on the solution space $\Psi_2:S^2\rightarrow S$ :

$\begin{equation} \Psi_2(\langle X_i,U_i\rangle) = \min\{f(X_i),f(U_i)\}. \end{equation}$

(4.2)

Combining the above two definitions, it can be seen that one iteration of GPDEPSO is equivalent to mapping $\Psi = (\Psi_2\circ \Psi_1):\Omega \times S\rightarrow S(\Omega\times S\rightarrow S^2\rightarrow S)$ to the current population $P(t)$ , where $\Psi$ is a reverse-order synthesis mapping corresponding to DOPSO and SO mapping. Then, for the new population $P(t+1)$ it can be expressed as $P(t+1) = \Psi(\omega, P(t)) = \Psi_2(\Psi_1(\omega, P(t))), 0\leq t\leq T-1$ .

Let $f(X_{best}^t)$ be the fitness function value of the best individual $X_{best}^t$ in $P(t)$ . Under the effect of $\Psi$ , the new generation population generated by GPDEPSO will be superior to the previous one. Therefore, the sequence $\{f(X_{best}^t)\}_{1\leq t\leq T}$ is necessarily a monotonous nonincremental sequence (assuming that $f(X)$ is the minimum optimization function in this paper). In addition, the evolutionary process of GPDEPSO can also be characterized by the optimal individual, and the mapping $\Psi$ can be redefined as a mapping corresponding to the process of generating the optimal individual, that is, $X_{best}^{t+1} = \Psi(\omega, X_{best}^t) = \Psi_2(\Psi_1(\omega, X_{best}^t))$ .

Lemma 1. Let $\lambda:S\times S\rightarrow R$ be the distance defined on $S$ and satisfy $\lambda(X_i, X_j) = |f(X_i)-f(X_j)|, \forall X_i, X_j\in S$ ; then, $(S, \lambda)$ is a complete separable metric space.

Similar to the method presented in Ref. ^[25], Lemma 1 is established.

Theorem 2. The random mapping $\Psi = (\Psi_2\circ \Psi_1):\Omega \times S\rightarrow S$ formed by one iteration of GPDEPSO is a random contraction operator.

Proof. According to the definition of DOPSO and SO operation, it can be seen that the new population generated by GPDEPSO in each iteration is better than the previous one. Therefore, for random mapping $\Psi = (\Psi_2\circ \Psi_1):\Omega \times S\rightarrow S$ , there exists a random variable with nonnegative real value $0\leq K(\omega) < 1$ , which makes the following formula hold:

$\begin{align} \lambda(\Psi(\omega,X_{best}^{t-1}),\Psi(\omega,X_{best}^{t})) & = \lambda(X_{best}^{t},X_{best}^{t+1}) \\ & = |f(X_{best}^{t})-f(X_{best}^{t+1})| \\ & \leq K(\omega)\cdot |f(X_{best}^{t-1})-f(X_{best}^{t})| \\ & = K(\omega)\cdot \lambda(X_{best}^{t-1},X_{best}^{t}). \end{align}$

(4.3)

letting

$\begin{equation*} \Omega_0 = \{\omega|\lambda(\Psi(\omega,X_{best}^{t-1}),\Psi(\omega,X_{best}^{t}))\leq K(\omega)\cdot \lambda(X_{best}^{t-1},X_{best}^{t})\}\subseteq \Omega \end{equation*}$

then

$\begin{equation*} \mu(\Omega_0) = 1. \end{equation*}$

Therefore, the mapping formed by GPDEPSO, $\Psi:\Omega \times S \rightarrow S$ is a random contraction operator.

Lemma 2. (Random Contraction Mapping Theorem) ^[26]

Letting $\Psi:\Omega \times S \rightarrow S$ be a random operator, satisfying that for almost all $\omega\in \Omega$ and $\Psi(\omega)$ is contractive operator, then for $\Psi(\omega)$ there exists a unique random fixed point $\xi(\omega)$ , and $\Psi(\omega, \xi(\omega)) = \xi(\omega)$ .

According to Theorem 2, the random mapping $\Psi$ is a random contraction operator. By using the conclusion of Lemma 2, for $\Psi(\omega)$ there must exist a unique random fixed point. Then, the convergence criterion of GPDEPSO can be derived, which means that GPDEPSO is asymptotically convergent.

5. Experimental design and results

5.1. Numerical examples

We take the classical games given in ^[16] and ^[17] as our numerical examples, and solve them separately by using the GPDEPSO given in this paper and comparing with other algorithms. (For Examples 1-3, the parameters of the algorithm are set as $N = 50$ , $T = 100$ , $\varepsilon = 10^{-4}$ , etc. For examples 4 and 5 of high dimension, the parameters setting are, respectively, $N = 50$ , $T = 100$ , $\varepsilon = 10^{-5}$ , etc., and $N = 100$ , $T = 300$ , $\varepsilon = 10^{-5}$ , etc.).

Example 1 Prisoner's Dilemma Game $\Gamma(X_{1}, Y_{1}, A_{1}, B_{1})$ :

$A_{1} = \left[ \begin{array}{ccc} -8 \; \; & 0 \\ -15 \; \; & -1 \end{array} \right ],\ \ B_{1} = \left[ {\begin{array}{ccc} -8 \; \; & -15 \\ 0\; \; & -1 \end{array}} \right].$

Example 2 Guessing Game $\Gamma(X_{2}, Y_{2}, A_{2}, B_{2})$ :

$A_{2} = \left[ \begin{array}{ccc} 1 \; \; & -1 \\ -1 \; \; & 1 \end{array} \right ],\ \ \ B_{2} = \left[ {\begin{array}{*{20}{c}} -1 \; \; & 1 \\ 1\; \; & -1 \end{array}} \right].$

Example 3 Supervisory Game $\Gamma(X_{3}, Y_{3}, A_{3}, B_{3})$ :

$A_{3} = \left[ {\begin{array}{ccc} 0 \; \; & 50 \\ 30 \; \; & 30 \end{array}} \right],\ \ \ B_{3} = \left[ {\begin{array}{ccc} -10 \; \; & -50 \\ 60\; \; & 70 \end{array}} \right].$

Example 4 Consider a Three-Dimensional Payoff Matrix Game $\Gamma(X_{4}, Y_{4}, A_{4}, B_{4})$ :

$A_{4} = \left[ {\begin{array}{ccc} 1 \; \; \; \; & 0 \; \; \; \; & 0\\ 0 \; \; \; \; & 1 \; \; \; \; & 0\\ 0 \; \; \; \; & 0 \; \; \; \; & 1 \end{array}} \right],\ \ \ B_{4} = \left[ {\begin{array}{ccc} 0 \; \; \; \; & 1 \; \; \; \; & 0 \\ 0 \; \; \; \; & 0 \; \; \; \; & 1 \\ 1 \; \; \; \; & 0 \; \; \; \; & 0 \end{array}} \right].$

Example 5 Consider a 10-Dimensional Payoff Matrix Game $\Gamma(X_{5}, Y_{5}, A_{5}, B_{5})$ :

$A_{5} = \begin{bmatrix} 1 \; \; \; & 0 \; \; \; & \cdots \; \; \; & 0 \\ 0 \; \; \; & 1 \; \; \; & \cdots \; \; \; & 0 \\ \vdots \; \; & \vdots \; \; & \ddots \; \; & \vdots \\ 0 \; \; \; & 0 \; \; & \cdots\ \; \; & 1 \\ \end{bmatrix},\ \ B_{5} = \begin{bmatrix} 0 \; \; \; & 1 \; \; \; & \cdots \; \; \; & 0 \\ \vdots \; \; \; & \vdots \; \; \; & \ddots \; \; \; & \vdots \\ 0 \; \; \; & 0 \; \; \; & \cdots \; \; \; & 1 \\ 1 \; \; \; & 0 \; \; \; & \cdots\ \; \; \; & 0 \\ \end{bmatrix}.$

5.2. Results and discussion

The following Tables 1-4 present computational results of the above examples by GPDEPSO.

Table 1. Results of Prisoner's Dilemma Game

$\Gamma(X_{1}, Y_{1}, A_{1}, B_{1})$ .

Test times	Iteration times	Mixed strategy of player 1	Mixed strategy of player 2	Fitness value
Firstly	1	(1, 0)	(1, 0)	0
Secondly	2	(1, 0)	(1, 0)	0
Thirdly	1	(1, 0)	(1, 0)	0
Fourthly	1	(1, 0)	(1, 0)	0

| Show Table

DownLoad: CSV

Table 2. Results of Guessing Game

$\Gamma(X_{2}, Y_{2}, A_{2}, B_{2})$ .

Test times	Iteration times	Mixed strategy of player 1	Mixed strategy of player 2	Fitness value
Firstly	4	(0.5000, 0.5000)	(0.5000, 0.5000)	3.8124e-05
Secondly	4	(0.5000, 0.5000)	(0.5000, 0.5000)	2.2619e-05
Thirdly	7	(0.5000, 0.5000)	(0.5000, 0.5000)	8.2926e-05
Fourthly	6	(0.5000, 0.5000)	(0.5000, 0.5000)	8.5655e-05

| Show Table

DownLoad: CSV

Table 3. Results of Supervisory Game

$\Gamma(X_{3}, Y_{3}, A_{3}, B_{3})$ .

Test times	Iteration times	Mixed strategy of player 1	Mixed strategy of player 2	Fitness value
Firstly	10	(0.2000, 0.8000)	(0.4000, 0.6000)	6.7765e-05
Secondly	12	(0.2000, 0.8000)	(0.4000, 0.6000)	9.8433e-05
Thirdly	11	(0.2000, 0.8000)	(0.4000, 0.6000)	2.9027e-05
Fourthly	12	(0.2000, 0.8000)	(0.4000, 0.6000)	3.3894e-05

| Show Table

DownLoad: CSV

Table 4. Results of Three-Dimensional Payoff Matrix Game

$\Gamma(X_{4}, Y_{4}, A_{4}, B_{4})$ .

Test times	Iteration times	Mixed strategy of player 1	Mixed strategy of player 2	Fitness value
Firstly	19	(0.3333, 0.3333, 0.3333)	(0.3333, 0.3333, 0.3333)	8.6491e-06
Secondly	20	(0.3333, 0.3333, 0.3333)	(0.3333, 0.3333, 0.3333)	9.3384e-06
Thirdly	17	(0.3333, 0.3333, 0.3333)	(0.3333, 0.3333, 0.3333)	9.3761e-06
Fourthly	19	(0.3333, 0.3333, 0.3333)	(0.3333, 0.3333, 0.3333)	6.3213e-06

| Show Table

DownLoad: CSV

From above four tables, we can see that the Nash equilibrium can be obtained under the given parameters by using GPDEPSO proposed in this paper. In addition, the accuracy of fitness function values is higher approximately $10^4$ times and $10^2$ times than that of Refs. ^[16] and ^[17]. It can be seen that the speed of this algorithm does not be affected under the condition of increasing the population size (N = 50). On the contrary, according to the results of Examples 1, 3, and 4, it is show that the number of iterations is reduced significantly, and the Example 4 is the most obvious. Compared with Refs.^[16], the number of iteration of Example 4 is reduced by approximately 16 times, which is approximately 8 times compared with Ref.^[17]. Through the above analysis, GPDEPSO is superior to the algorithms presented in the existing literature in terms of iteration times and accuracy of results. Furthermore, although the number of populations is lager than that in the previous literature, it does not affect the speed and accuracy of the algorithm, but plays a powerful role in finding the global optimal solution.

The following are two comparison figures for solving Example 5, a high-dimensional payoff matrix game. is a comparison between GPDEPSO and hybrid differential evolution particle swarm optimization algorithm (DEPSO). is a comparison of GPDEPSO, differential evolution algorithm based on good point set (GPDE), and particle swarm optimization algorithm based on good point set (GPPSO). The Nash equilibrium solution calculated by all methods is $x^{\ast} = y^{\ast} = (0.1, \ldots, 0.1)_{1\times10}$ .

Figure 3. Comparison of GPDEPSO and DEPSO.

DownLoad: Full-Size Img PowerPoint

Figure 4. Comparison of GPDEPSO, GPDE and GPPSO.

DownLoad: Full-Size Img PowerPoint

As shown in Figures 3 and 4, the high-dimensional payoff matrix game can be solved better by GPDEPSO. Compared with Ref. ^[17], it can be seen that not only the calculation speed is greatly improved, but the accuracy of the fitness function value is also better. It can be seen from Figure 3 that GPDEPSO converges faster and more smoothly than DEPSO, which indicates that the good point set can avoid falling into a local optimum during the calculation process. From Figure 4, comparing GPDEPSO, GPPSO, with GPDE, and find that GPDEPSO combines the ability of PSO to find the global optimum quickly with the fast convergence ability of DE. In Example 5, GPDEPSO can quickly find the global optimal solution within the first 50 iterations, and the approximate exact solution can be obtained after 150 iterations. Therefore, GPDEPSO has a good advantage in solving Nash equilibrium problems with a high-dimensional payoff matrix.

6. Conclusions

We propose GPDEPSO from the point of view of DE, considering the different characteristics of DE and PSO, and we use a good point set to make the initial data more uniform. This algorithm combines the advantages of DE and PSO, which not only ensures the simple operation, easy implementation, and fast convergence of DE, but also enhances its global optimization ability. By solving the Nash equilibrium of noncooperative games, we find that the proposed algorithm is superior to the comparative algorithms in terms of the calculation accuracy and convergence. In particular, for a high-dimensional payoff matrix game, the efficiency of the algorithm is remarkable. In the future, since a Nash equilibrium problem is a complex, NP-hard problem, it would be interesting to consider the influence of different variation operations and selection strategies on the solution of the Nash equilibrium, as well as to consider the Nash equilibrium problems for more complex multiobjective games and multiple games.

Acknowledgments

This study was supported by the National Natural Science Foundation of China (71961003), the Qian Jiaohe YJSCXJH ((2019)029), the Qian Kehe Platform Talents ((2017)5788), and the Qian Kehe LH Zi ((2017)7223).

Conflict of interest

The authors declare no conflict of interest.

References

[1]	T. Kunieda, K. Nishimura, Finance and Economic Growth in a Dynamic Game, Dyn. Games Appl., 8 (2018), 588-600. doi: 10.1007/s13235-018-0249-7
[2]	G. M. Korres, A. Kokkinou, Political Decision in a Game Theory Approach, European SocioEconomic Integration, 28 (2013), 51-61.
[3]	A. Traulsen, Biological Models in Game Theory, Journal of Statistical Theory and Practice, 10 (2016), 472-474. doi: 10.1080/15598608.2016.1172462
[4]	M. Qingfeng, T. Shaohong, L. Zhen, Ch. Bingyao, Sh. Weixiang, A Review of Game Theory Application Research in Safety Management, IEEE Access, 8 (2020), 107301-107313.
[5]	Q. Wang, L. Zhao, L. Guo, J. Ran, Z. Lijun, X. Yujing, et al., A Generalized Nash Equilibrium Game Model for Removing Regional Air Pollutant, J. Clean. Prod., 227 (2019), 522-531.
[6]	J. Sheng, W. Zhou, B. Zhu, The Coordination of Stakeholder Interests in Environmental Regulation: Lessons From China's Environmental Regulation Policies From The Perspective of The Evolutionary Game Theory, J. Clean. Prod., 249 (2020), 119385. doi: 10.1016/j.jclepro.2019.119385
[7]	C. E. Lemke, J. T. Howson, Jr, Equilibrium Points of Bimatrix Games, Journal of the Society for Industrial and Applied Mathematics, 12 (1964), 413-423.
[8]	S. Govindan, R. Wilson, A Global Newton Method to Compute Nash Equilibria, Journal of Economic Theory, 110 (2003), 65-86. doi: 10.1016/S0022-0531(03)00005-X
[9]	J. Zhang, B. Qu, N. Xiu, Some Projection-like Methods for The Generalized Nash Equilibria, Comput. Optim. Appl., 45 (2010), 89-109. doi: 10.1007/s10589-008-9173-x
[10]	Y. Ya xiang, A Trust Region Algorithm for Nash Equilibrium Problems, Pac. J. Optim., 7 (2011), 125-138.
[11]	N. G. Pavlidis, K. E. Parsopoulos, M. N. Vrahatis, Computing Nash Equilibria Through Computational Intelligence Methods, J. Comput. Appl. Math., 175 (2005), 113-136. doi: 10.1016/j.cam.2004.06.005
[12]	U. Boryczka, P. Juszczuk, Differential Evolution as a New Method of Computing Nash Equilibria, Transactions on Computational Collective Intelligence IX, 7770 (2013), 192-216. doi: 10.1007/978-3-642-36815-8_9
[13]	C. ShiJun, S. YongGuang, W. ZongXin, A Genetic Algorithm to Acquire the Nash Equilibrium, System Engineering, 19 (2001), 67-70.
[14]	Q. ZhongHua, G. Jie, Z. YueXing, Applying Immune Algorithm to Solving Game Problem, Journal of Systems Engineering, 21 (2006), 398-404.
[15]	N. Franken, A. P. Engelbrecht, Particle Swarm Optimization Approaches to Coevolve Strategies for The Iterated Prisoner's Dilemma, IEEE T. Evolut. Comput., 9 (2005), 562-579. doi: 10.1109/TEVC.2005.856202
[16]	J. WenSheng, X. ShuWen, Y. JianFeng, W. S. Hu, Solving Nash Equilibrium for N-persons' Non-cooperative Game Based on Immune Particle Swarm Algorithm, Application Research of Computers, 29 (2012), 28-31.
[17]	Y. Yanlong, X. Shuwen, X. Shunyou, J. Wensheng, Solving Nash Equilibrium of Non-Cooperative Game Based on Fireworks Algorithm, Computer Applications and Software, (in Chinese) 35 (2018), 215-218.
[18]	Y. Jian, Selection of Game Theory, Chinese: Science Press, 2014.
[19]	H. Luogeng, W. Yuan, Application of Number Theory in Modern Analysis, Beijing: Science Press, 1978.
[20]	R. Storn, K. Price, Minimizing The Real Functions of The ICEC'96 Contest by Differential Evolution, Proceedings of IEEE International Conference on Evolutionary Computation IEEE, (1996), 824-844.
[21]	R. C. Eberhart, J. Kennedy, A New Optimizer Using Particle Swarm Theory. In: 6th Symposium on Micro Machine and Human Science, Nagoya, Japan, (1995), 39-43.
[22]	Y. Wu, Y. Wu, X. Liu, Couple-based Particle Swarm Optimization for Short-term Hydrothermal Scheduling, Appl. Soft Comput., 74 (2019), 440-450. doi: 10.1016/j.asoc.2018.10.041
[23]	J. Shen, F. Liang, M. Zheng, New Hybrid Differential Evolution and Particle Swarm Optimization Algorithm and Its Application, Sichuan Daxue Xuebao (Gongcheng Kexue Ban) Journal of Sichuan University (Engineering Science Edition), 46 (2014), 38-43.
[24]	Y. C. He, X. Z. Wang, K. Q. Liu, Y. Q. Wang, Convergent Analysis and Algorithmic Improvement of Differential Evolution, Journal of Software, 21 (2010), 875-885. doi: 10.3724/SP.J.1001.2010.03486
[25]	M. Q. Li, J. S. Kou, D. Lin, S. Q. Li, Basic Theory and Application of Genetic Algorithm, Beijing: Science Press, (in Chinese) (2002), 115-119.
[26]	T. S. Lu, Random Functional Analysis and Its Application, Qingdao: Qingdao Ocean University Press, 1990.

This article has been cited by:

1.	Jicheng Yao, Xiaonan Luo, Fang Li, Yizhou Feng, Jundi Dou, Lixiang Dai, Songhua Xu, Ruiai Chen, 2022, Butterfly intelligent optimization algorithm based on Good Point Set and adaptive weight factor, 978-1-6654-5478-0, 194, 10.1109/ICDH57206.2022.00037
2.	Perla Rubi Castañeda-Aviña, Esteban Tlelo-Cuautle, Luis-Gerardo de la Fraga, Phase noise optimization of integrated ring voltage-controlled oscillators by metaheuristics, 2022, 7, 2473-6988, 14826, 10.3934/math.2022813
3.	He Wang, Hualong Du, Qiuyu Cui, Pengfei Lui, Xin Ma, A multi-motor speed synchronization control enhanced by artificial bee colony algorithm, 2023, 56, 0020-2940, 133, 10.1177/00202940221122160
4.	Jimeng Li, Xing Cheng, Junling Peng, Zong Meng, A new adaptive parallel resonance system based on cascaded feedback model of vibrational resonance and stochastic resonance and its application in fault detection of rolling bearings, 2022, 164, 09600779, 112702, 10.1016/j.chaos.2022.112702
5.	Huimin Li, Shuwen Xiang, Wensheng Jia, Yanlong Yang, Shiguo Huang, Hao Gao, Solving Multiobjective Game in Multiconflict Situation Based on Adaptive Differential Evolution Algorithm with Simulated Annealing, 2021, 2021, 1563-5147, 1, 10.1155/2021/9957279
6.	Huimin Li, Shuwen Xiang, Shunyou Xia, Shiguo Huang, Finding the Nash equilibria of $n$ -person noncooperative games via solving the system of equations, 2023, 8, 2473-6988, 13984, 10.3934/math.2023715
7.	Xiaoying Yang, Wanli Zhang, Chengfang Tan, Tongqing Liao, A Novel Localization Technology Based on DV-Hop for Future Internet of Things, 2023, 12, 2079-9292, 3220, 10.3390/electronics12153220
8.	Bing Sun, Yuanren Zeng, Daqi Zhu, Dynamic task allocation in multi autonomous underwater vehicle confrontational games with multi-objective evaluation model and particle swarm optimization algorithm, 2024, 153, 15684946, 111295, 10.1016/j.asoc.2024.111295
9.	Zhiyan Xing, Yanlong Yang, Zuopeng Hu, Nash equilibrium realization of population games based on social learning processes, 2023, 20, 1551-0018, 17116, 10.3934/mbe.2023763
10.	Yan Sun, Application of the Improved Cuckoo Algorithm in Differential Equations, 2024, 12, 2227-7390, 345, 10.3390/math12020345
11.	Bing Sun, Yuanren Zeng, Zinan Su, Task allocation in multi-AUV dynamic game based on interval ranking under uncertain information, 2023, 288, 00298018, 116057, 10.1016/j.oceaneng.2023.116057
12.	Jian Xu, Zhiyong Han, Liangang Yin, Zheping Yan, Yuyang Yu, Guangzhi Ma, Multi-strategy-based artificial bee colony algorithm for AUV path planning with angle constraints, 2024, 312, 00298018, 119155, 10.1016/j.oceaneng.2024.119155
13.	Yusmel González-Hernández, Patrick Perré, Building blocks needed for mechanistic modeling of bioprocesses: A critical review based on protein production by CHO cells, 2024, 18, 22140301, e00232, 10.1016/j.mec.2024.e00232
14.	Hongbin Sun, Qing Cui, Jingya Wen, Lei Kou, Optimization and scheduling scheme of park-integrated energy system based on multi-objective Beluga Whale Algorithm, 2024, 11, 23524847, 6186, 10.1016/j.egyr.2024.05.073
15.	Sidharth Samal, Rajashree Dash, Developing a two stage optimized random vector functional link neural network based predictor model utilizing a swift crow search algorithm, 2025, 28, 1386-7857, 10.1007/s10586-024-04875-9

Reader Comments

Your name:*

Email:*
© 2021 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

AIMS Mathematics

1.8 3.4

Metrics

Article views(5164) PDF downloads(241) Cited by(15)

Preview PDF

Download XML

Export Citation

Article outline

Show full outline

Figures and Tables

Figures(4) / Tables(4)

AIMS Mathematics

Differential evolution particle swarm optimization algorithm based on good point set for computing Nash equilibrium of finite noncooperative game

Related Papers:

Abstract

1. Introduction

2. Game and Nash equilibrium Definitions ^[18]

3. Differential evolution particle swarm optimization algorithm based on good point set (GPDEPSO)

3.1. Good point set

3.2. Differential evolution algorithm (DE)

3.3. Particle swarm optimization algorithm (PSO)

3.4. GPDEPSO experimental steps and its implementation

4. Convergence analysis of GPDEPSO

5. Experimental design and results

5.1. Numerical examples

5.2. Results and discussion

6. Conclusions

Acknowledgments

Conflict of interest

References

This article has been cited by:

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Figures and Tables

Other Articles By Authors

Catalog

AIMS Mathematics

Differential evolution particle swarm optimization algorithm based on good point set for computing Nash equilibrium of finite noncooperative game

Related Papers:

Abstract

1. Introduction

2. Game and Nash equilibrium Definitions [18]

3. Differential evolution particle swarm optimization algorithm based on good point set (GPDEPSO)

3.1. Good point set

3.2. Differential evolution algorithm (DE)

3.3. Particle swarm optimization algorithm (PSO)

3.4. GPDEPSO experimental steps and its implementation

4. Convergence analysis of GPDEPSO

5. Experimental design and results

5.1. Numerical examples

5.2. Results and discussion

6. Conclusions

Acknowledgments

Conflict of interest

References

This article has been cited by:

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Figures and Tables

Other Articles By Authors

Related pages

Tools

Export File

Citation

Format

Content

Catalog

2. Game and Nash equilibrium Definitions ^[18]