Uniformity of markov elements in deep reinforcement learning for traffic signal control

Bao-Lin Ye; Peng Wu; Lingxi Li; Weimin Wu; Bao-Lin Ye; Peng Wu; Lingxi Li; Weimin Wu

doi:10.3934/era.2024174

Electronic Research Archive

2024, Volume 32, Issue 6: 3843-3866. doi: 10.3934/era.2024174

Previous Article Next Article

Research article

Uniformity of markov elements in deep reinforcement learning for traffic signal control

1.
School of Information Science and Engineering, Jiaxing University, Jiaxing 314001, China
2.
School of Information Science and Engineering, Zhejiang Sci-Tech University, Hangzhou 310018, China
3.
Elmore Family School of Electrical and Computer Engineering, Purdue University, Indianapolis 46202, USA
4.
State Key Laboratory of Industrial Control Technology, Zhejiang University, Hangzhou 310027, China

Received: 29 February 2024 Revised: 23 April 2024 Accepted: 06 May 2024 Published: 12 June 2024

Traffic signal control (TSC) plays a crucial role in enhancing traffic capacity. In recent years, researchers have demonstrated improved performance by utilizing deep reinforcement learning (DRL) for optimizing TSC. However, existing DRL frameworks predominantly rely on manually crafted states, actions, and reward designs, which limit direct information exchange between the DRL agent and the environment. To overcome this challenge, we propose a novel design method that maintains consistency among states, actions, and rewards, named uniformity state-action-reward (USAR) method for TSC. The USAR method relies on: 1) Updating the action selection for the next time step using a formula based on the state perceived by the agent at the current time step, thereby encouraging rapid convergence to the optimal strategy from state perception to action; and 2) integrating the state representation with the reward function design, allowing for precise assessment of the efficacy of past action strategies based on the received feedback rewards. The consistency-preserving design method jointly optimizes the TSC strategy through the updates and feedback among the Markov elements. Furthermore, the method proposed in this paper employs a residual block into the DRL model. It introduces an additional pathway between the input and output layers to transfer feature information, thus promoting the flow of information across different network layers. To assess the effectiveness of our approach, we conducted a series of simulation experiments using the simulation of urban mobility. The USAR method, incorporating a residual block, outperformed other methods and exhibited the best performance in several evaluation metrics.

Keywords:

Citation: Bao-Lin Ye, Peng Wu, Lingxi Li, Weimin Wu. Uniformity of markov elements in deep reinforcement learning for traffic signal control[J]. Electronic Research Archive, 2024, 32(6): 3843-3866. doi: 10.3934/era.2024174

Related Papers:

[1]	Xintao Li, Rongrui Lin, Lianbing She . Periodic measures for a neural field lattice model with state dependent superlinear noise. Electronic Research Archive, 2024, 32(6): 4011-4024. doi: 10.3934/era.2024180
[2]	Shuang Wang, FanFan Chen, Chunlian Liu . The existence of periodic solutions for nonconservative superlinear second order ODEs: a rotation number and spiral analysis approach. Electronic Research Archive, 2025, 33(1): 50-67. doi: 10.3934/era.2025003
[3]	Lianbing She, Nan Liu, Xin Li, Renhai Wang . Three types of weak pullback attractors for lattice pseudo-parabolic equations driven by locally Lipschitz noise. Electronic Research Archive, 2021, 29(5): 3097-3119. doi: 10.3934/era.2021028
[4]	Nan Xiang, Aying Wan, Hongyan Lin . Diffusion-driven instability of both the equilibrium solution and the periodic solutions for the diffusive Sporns-Seelig model. Electronic Research Archive, 2022, 30(3): 813-829. doi: 10.3934/era.2022043
[5]	Zhili Zhang, Aying Wan, Hongyan Lin . Spatiotemporal patterns and multiple bifurcations of a reaction- diffusion model for hair follicle spacing. Electronic Research Archive, 2023, 31(4): 1922-1947. doi: 10.3934/era.2023099
[6]	Yixuan Wang, Xianjiu Huang . Ground states of Nehari-Pohožaev type for a quasilinear Schrödinger system with superlinear reaction. Electronic Research Archive, 2023, 31(4): 2071-2094. doi: 10.3934/era.2023106
[7]	Xiangwen Yin . A review of dynamics analysis of neural networks and applications in creation psychology. Electronic Research Archive, 2023, 31(5): 2595-2625. doi: 10.3934/era.2023132
[8]	Weiyu Li, Hongyan Wang . Dynamics of a three-molecule autocatalytic Schnakenberg model with cross-diffusion: Turing patterns of spatially homogeneous Hopf bifurcating periodic solutions. Electronic Research Archive, 2023, 31(7): 4139-4154. doi: 10.3934/era.2023211
[9]	Peng Yu, Shuping Tan, Jin Guo, Yong Song . Data-driven optimal controller design for sub-satellite deployment of tethered satellite system. Electronic Research Archive, 2024, 32(1): 505-522. doi: 10.3934/era.2024025
[10]	Peng Gao, Pengyu Chen . Blowup and MLUH stability of time-space fractional reaction-diffusion equations. Electronic Research Archive, 2022, 30(9): 3351-3361. doi: 10.3934/era.2022170

Abstract

1. Introduction

This paper deals with periodic measures of the following reaction-diffusion lattice systems driven by superlinear noise defined on the integer set $\mathbb Z^k$ :

$\begin{align} \begin{split} du_i(t)&+\lambda(t)u_i(t)dt-\nu(t)(u_{(i_1-1,i_2,\ldots,i_k)}(t)+u_{i_1,i_2-1,\ldots,i_k}(t)+\ldots+u_{i_1,i_2,\ldots,i_k-1}(t)\\ &-2ku_{(i_1,i_2,\ldots,i_k)}(t)+u_{(i_1+1,i_2,\ldots,i_k)}(t)+u_{(i_1,i_2+1,\ldots,i_k)}(t)+\ldots+u_{(i_1,i_2,\ldots,i_k+1)}(t))dt\\ = &f_i(t,u_i(t))dt+g_i(t)dt+\sum\limits_{j = 1}^{\infty} (h_{i,j}(t)+\delta_{i,j}\hat{\sigma}_{i,j}(t,u_i(t)))dW_j(t), \end{split} \end{align}$

(1.1)

along with initial conditions:

$\begin{equation} u_i(0) = u_{0,i}, \end{equation}$

(1.2)

where $i = (i_1, i_2, \ldots, i_k)\in \mathbb Z^k$ , $\lambda(t), \nu(t)$ are continuous functions, $\lambda(t) > 0$ , $(f_i)_{i\in\mathbb Z^k}$ and $(\hat{\sigma}_{i, j})_{i\in\mathbb Z^k, j\in\mathbb N}$ are two sequences of continuously differentiable nonlinearities with arbitrary and superlinear growth rate from $\mathbb R \times \mathbb R\to \mathbb R$ , respectively, $g = (g_i)_{i\in\mathbb Z^k}$ and $h = (h_{i, j})_{i\in\mathbb Z^k, j\in\mathbb N}$ are two time-dependent random sequences, and $\delta = (\delta_{i, j})_{i\in\mathbb Z^k, j\in\mathbb N}$ is a sequence of real numbers. The sequence of independent two-sided real-valued Wiener processes $(W_j)_{j\in\mathbb N}$ is defined on a complete filtered probability space $(\Omega, \mathcal F, \{\mathcal F_t\}_{t\in\mathbb R}, \mathbb P)$ . Furthermore, we assume that system (1.1) is a time periodic system; more precisely, there exists $T > 0$ such that the time-dependent functions $\lambda, \nu, f_i, g, h, \sigma_{i, j}(i\in\mathbb Z^k, j\in\mathbb N)$ in (1.1) are all $T$ -periodic in time.

Lattice systems are gradually becoming a large and evolving interdisciplinary research field, due to wide range of applications in physics, biology and engineering such as pattern recognition, propagation of nerve pulses, electric circuits, and so on, see ^{[1,2,3,4,5,6]} and the references therein for more details. The well-posedness and the dynamics of these equations have been studied by many authors, ^[7,8,9,10] for deterministic systems and ^{[11,12,13,14,15,16,17,18,19]} for stochastic systems where the existence of random attractors and probability measures have been examined. Especially, the authors research the limiting behavior of periodic measures of lattice systems in ^[15].

Nonlinear noise was proposed and studied for the first time in ^[19], the authors researches the long-term behavior of lattice systems driven by nonlinear noise in terms of random attractors and invariant measures. Before that, the research on noise was limited to additive noise and linear multiplicative noise, which can be transformed into a deterministic system. However, if the diffusion coefficients are nonlinear, then one cannot convert the stochastic system into a pathwise deterministic one, and thereby this problem cannot be studied under the frameworks of deterministic systems aforementioned. As an extension of ^[19], a class of reaction-diffusion lattice systems driven by superlinear noise, where the noise has a superlinear growth order $q\in [2, p)$ , is studied by taking advantage of the dissipativeness of the nonlinear drift function $f_i$ in (1.1) to control the superlinear noise in ^[20].

In the paper, we will study the existence of periodic measures of reaction-diffusion lattice systems drive by superlinear noise. One of the main tasks in our analysis is to solve the superlinear noise terms. We remark that if the noise grows linearly, then the estimates we need can be obtained by applying the standard methods available in the literature. We adopt the ideas that take advantage of the nonlinear drift terms' the polynomical growth rate $p\ (p\ge2)$ to control the noise polynomical rate $q\in[2, p)$ . Furthermore, notice that $l^2$ is an infinite-dimensional phase space and problem (1.1)–(1.2) is defined on the unbounded set $\mathbb Z^k$ . The unboundedness of $\mathbb{Z}^k$ as well as the infinite-dimensionalness of $l^2$ introduce a major difficulty, because of the non-compactness of usual Sobolev embeddings on unbounded domains. We will employ the dissipativeness of the drift function in (1.1) as well as a cutoff technique to prove that the tails of solutions are uniformly small in $L^2(\Omega, l^2)$ . Based upon this fact we obtain the tightness of distribution laws of solutions, and then the existence of periodic measures.

In the next section, we discuss the well-poseness of solutions of (1.1) and (1.2). Section 3 is devoted to the uniform estimates of solutions including the uniform estimates on the tails of solutions. In Section 4, we show the existence of periodic measures of (1.1) and (1.2).

2. Global well-posedness of reaction-diffusion lattice systems with superlinear noise

In this section, we prove the existence and uniqueness of solutions to system (1.1) and (1.2). We first discuss the assumptions on the nonlinear drift and diffusion terms in (1.1).

We begin with the following Banach space:

$\begin{equation*} l^r = \{u = (u_i)_{i\in\mathbb Z^k}:\sum\limits_{i\in\mathbb Z^k}|u_i|^r < +\infty\}\ \text{with norm}\ \|u\|_r = \Big(\sum\limits_{i\in\mathbb Z^k}|u_i|^r\Big)^{\frac{1}{r}}, \forall r\ge 1. \end{equation*}$

The norm and inner product of $l^2$ are denoted by $(\cdot, \cdot)$ and $\|\cdot\|$ , respectively. For the nonlinear drift function $f_i\in C^1(\mathbb R\times\mathbb R, \mathbb R)$ in the equation we assume that for all $s\in\mathbb R$ and $i\in\mathbb Z^k$ ,

$\begin{align} &f_i(t,s)s\le -\gamma_1|s|^p+\phi_{1,i},\ \phi_1 = \{\phi_{1,i}\}_{i\in\mathbb Z^k}\in l^1, \end{align}$

(2.1)

$\begin{align} & |f_i(t,s)|\le \phi_{2,i}|s|^{p-1}+\phi_{3,i},\ \phi_2 = \{\phi_{2,i}\}_{i\in\mathbb Z^k}\in l^\infty,\ \phi_3 = \{\phi_{3,i}\}_{i\in\mathbb Z^k}\in l^2, \end{align}$

(2.2)

$\begin{align} & |f^{'}_i(t,s)|\le \phi_{4,i}|s|^{p-2}+\phi_{5,i},\ \phi_4 = \{\phi_{4,i}\}_{i\in\mathbb Z^k}\in l^\infty, \ \phi_5 = \{\phi_{5,i}\}_{i\in\mathbb Z^k}\in l^\infty, \end{align}$

(2.3)

where $p > 2$ and $\gamma_1 > 0$ are constants. For the sequence of continuously differentiable diffusion functions $\hat{\sigma} = (\hat{\sigma}_{i, j})_{i\in\mathbb Z^k, j\in\mathbb N}$ , we assume, for all $s\in\mathbb R$ and $j\in\mathbb N$ ,

$\begin{align} &|\hat{\sigma}_{i,j}(t,s)|\le\varphi_{1,i}|s|^{\frac{q}{2}} +\varphi_{2,i},\ \varphi_1 = \{\varphi_{1,i}\}_{i\in\mathbb Z^k}\in l^{\frac{2p}{p-q}},\ \varphi_2 = \{\varphi_{2,i}\}_{i\in\mathbb Z^k}\in l^2, \end{align}$

(2.4)

$\begin{align} &|\hat{\sigma}_{i,j}^{'}(t,s)|\le\varphi_{3,i}|s|^{\frac{q}{2}-1}+\varphi_{4,i},\ \varphi_3 = \{\varphi_{3,i}\}_{i\in\mathbb Z^k}\in l^{q},\ \varphi_4 = \{\varphi_{4,i}\}_{i\in\mathbb Z^k}\in l^\infty, \end{align}$

(2.5)

where $q\in[2, p)$ is a constant. For processes $g(t) = (g_i(t))_{i\in\mathbb Z^k}$ and $h(t) = (h_{i, j})_{i\in\mathbb Z^k, j\in\mathbb N}$ are both continuous in $t\in\mathbb R$ , which implies that for all $t\in\mathbb R$ ,

$\begin{equation} \|g(t)\|^2 = \sum\limits_{i\in\mathbb Z^k}|g_i(t)|^2 < \infty\ \text{and}\ \|h(t)\|^2 = \sum\limits_{i\in\mathbb Z^k}\sum\limits_{j\in\mathbb N}|h_{i,j}(t)|^2 < \infty. \end{equation}$

(2.6)

In addition, we assume $\delta = (\delta_{i, j})_{i\in\mathbb Z^k, j\in\mathbb N}$ satisfies

$\begin{equation} c_\delta: = \sum\limits_{j\in\mathbb N}\sum\limits_{i\in\mathbb Z^k}|\delta_{i,j}|^2 < \infty. \end{equation}$

(2.7)

We will investigate the periodic measures of system (1.1)–(1.2) for which we assume that all given time-dependent functions are T-periodic in $t\in\mathbb R$ for some $T > 0$ ; that is, for all $t\in\mathbb R, i\in\mathbb Z^k$ and $k\in\mathbb N$ .

$\begin{align*} \begin{split} &\lambda(t+T) = \lambda(t),\qquad\nu(t+T) = \nu(t),\qquad h(t+T) = h(t),\\ &g(t+T) = g(t),\qquad f(t+T,\cdot) = f(t,\cdot),\qquad\sigma(t+T,\cdot) = \sigma(t,\cdot). \end{split} \end{align*}$

If $m:\mathbb R\to \mathbb R$ is a continuous T-periodic function, we denote

$\begin{equation*} \overline{m} = \max\limits_{0\le t\le T}m(t),\qquad\underline{m} = \min\limits_{0\le t\le T}m(t). \end{equation*}$

We want to reformulate problem (1.1)–(1.2) as an abstract one in $l^2$ . Given $1\le j\le k, u = (u_i)_{i\in\mathbb Z^k}\in l^2$ and $i = (i_1, i_2, \ldots, i_k)\in\mathbb Z^k$ . Let us define the operators from $l^2$ to $l^2$ by

$\begin{align*} \begin{split} &(B_ju)_i = u_{(i_1,\ldots,i_j+1,\ldots,i_k)}-u_{(i_1,\ldots,i_j,\ldots,i_k)},\\ &(B_j^*u)_i = u_{(i_1,\ldots,i_j-1,\ldots,i_k)}-u_{(i_1,\ldots,i_j,\ldots,i_k)},\\ &(A_ju)_i = -u_{(i_1,\ldots,i_j+1,\ldots,i_k)}+2u_{(i_1,\ldots,i_j,\ldots,i_k)}-u_{(i_1,\ldots,i_j-1,\ldots,i_k)}, \end{split} \end{align*}$

and

$\begin{align*} \begin{split} (A_ku)_i = &-u_{(i_1-1,i_2,\ldots,i_k)}-u_{(i_1,i_2-1,\ldots,i_k)}-\ldots-u_{(i_1,i_2,\ldots,i_k-1)}\\ &+2ku_{(i_1,i_2,\ldots,i_k)}-u_{(i_1+1,i_2,\ldots,i_k)}-u_{(i_1,i_2+1,\ldots,i_k)}-\ldots-u_{(i_1,i_2,\ldots,i_k+1)}. \end{split} \end{align*}$

For all $1\le j\le k, u = (u_i)_{i\in\mathbb Z^k}\in l^2$ and $v = (v_i)_{i\in\mathbb Z^k}\in l^2$ we see

$\begin{equation} \|B_ju\|\le 2\|u\|,(B^*_ju,v) = (u,B_jv),A_j = B_jB_j^*\ \text{and}\ A_k = \sum\limits_{j = 1}^{k}A_j. \end{equation}$

(2.8)

Again, define the operators $f, \sigma_j:\mathbb R\times l^2\to l^2$ by

$\begin{equation*} f(t,u) = (f_i(t,u_i))_{i\in\mathbb Z^k}\ \text{and}\ \sigma_j(t,u) = (\delta_{i,j}\hat{\sigma}_{i,j}(t,u_i))_{i\in\mathbb Z^k},\forall t\in\mathbb R,\forall u = (u_i)_{i\in\mathbb Z^k}\in l^2. \end{equation*}$

It follows from (2.3) that there exists $\theta\in (0, 1)$ such that for $p > 2$ and $u, v\in l^2$ ,

$\begin{align} \begin{split} \sum\limits_{i\in\mathbb Z^k}|f_i(t,u_i)-f_i(t,v_i)|^2& = \sum\limits_{i\in\mathbb Z^k}|f_i^{'}(\theta u_i+(1-\theta)v_i)|^2|u_i-v_i|^2\\ &\le \sum\limits_{i\in\mathbb Z^k}(|\phi_{4,i}||\theta u_i+(1-\theta)v_i|^{p-2}+|\phi_{5,i}|)^2|u_i-v_i|^2\\ &\le \sum\limits_{i\in\mathbb Z^k}(2^{2p-4}|\phi_{4,i}|^2(|u_i|^{2p-4}+|v_i|^{2p-4})+2|\phi_{5,i}|^2)|u_i-v_i|^2\\ &\le (2^{2p-4}\|\phi_4\|^2_{l^\infty}(\|u\|^{2p-4}+\|v\|^{2p-4})+2\|\phi_5\|^2_{l^\infty})\|u-v\|^2. \end{split} \end{align}$

(2.9)

This together with $f(t, 0)\in l^2$ by (2.2) yields $f(t, u)\in l^2$ for all $u\in l^2$ , and thereby $f:\mathbb R\times l^2\to l^2$ is well-defined. In addition, we deduce from (2.9) that $f:\mathbb R\times l^2\to l^2$ is a locally Lipschitz continuous function, that is, for every $n\in\mathbb N$ , we can find a constant $c_1(n) > 0$ satisfying, for all $u, v\in l^2$ with $\|u\|\le n$ and $\|v\|\le n$ ,

$\begin{equation} \|f(u)-f(v)\|\le c_1(n)\|u-v\|. \end{equation}$

(2.10)

For $q\in[2, p)$ and $u\in l^2$ , one can deduce from(2.4), (2.7) and Young's inequality that for all $\varpi > 0,$

$\begin{array}{l} \varpi\sum\limits_{j\in\mathbb N}\|\sigma_j(t,u)\|^2 = \varpi\sum\limits_{j\in\mathbb N}\sum\limits_{i\in\mathbb Z^k}|\delta_{i,j}\hat{\sigma}_{i,j}(t,u_i)|^2\\ \le 2\varpi \sum\limits_{j\in\mathbb N}\sum\limits_{i\in\mathbb Z^k}|\delta_{i,j}|^2(|\varphi_{1,i}|^2|u_i|^q+|\varphi_{2,i}|^2)\le 2\varpi c_\delta\sum\limits_{i\in\mathbb Z^k}(|\varphi_{1,i}|^2|u_i|^q+|\varphi_{2,i}|^2)\\ \le \frac{\gamma_1}{2}\sum\limits_{i\in\mathbb Z^k}|u_i|^p+\frac{p-q}{p}\Big(\frac{p\gamma_1}{2q}\Big)^{-\frac{q}{p-q}}(2\varpi c_\delta)^{\frac{p}{p-q}}\sum\limits_{i\in\mathbb Z^k}|\varphi_{1,i}|^{\frac{2p}{p-q}}+2\varpi c_\delta\sum\limits_{i\in\mathbb Z^k}|\varphi_{2,i}|^2\\ \le \frac{\gamma_1}{2}\|u\|_p^p+\frac{p-q}{p}\Big(\frac{p\gamma_1}{2q}\Big)^{-\frac{q}{p-q}}(2\varpi c_\delta)^{\frac{p}{p-q}}\|\varphi_1\|_{\frac{2p}{p-q}}^{\frac{2p}{p-q}}+2\varpi c_\delta\|\varphi_2\|^2, \end{array}$

(2.11)

where $\gamma_1$ is the same number as in (2.1). From (2.11) and $l^2\subseteq l^p$ for $p > 2$ , we find that $\sigma_j(t, u)\in l^2$ for all $u\in l^2$ . Then $\sigma_j:\mathbb R\times l^2\to l^2$ is also well-defined. In addition, it yields from (2.5) and (2.7) that there exists $\eta\in (0, 1)$ such that for $q\in [2, p)$ and $u, v\in l^2$ ,

$\begin{array}{l} \sum\limits_{j\in\mathbb N}\sum\limits_{i\in\mathbb Z^k}|\delta_{i,j}\hat{\sigma}_{i,j}(t,u_i)-\delta_{i,j}\hat{\sigma}_{i,j}(t,v_i)|^2 = \sum\limits_{i\in\mathbb Z^k}\sum\limits_{j\in\mathbb N}|\delta_{i,j}|^2|\hat{\sigma}_{i,j}(t,u_i)-\hat{\sigma}_{i,j}(t,v_i)|^2\\ = \sum\limits_{i\in\mathbb Z^k}\sum\limits_{j\in\mathbb N}|\delta_{i,j}|^2|\hat{\sigma}_{i,j}^{'}(\eta u_i+(1-\eta)v_i)|^2|u_i-v_i|^2\\ \le c_\delta\sum\limits_{i\in\mathbb Z^k}(|\varphi_{3,i}||\eta u_i+(1-\eta)v_i|^{\frac{q}{2}-1}+|\varphi_{4,i}|)^2|u_i-v_i|^2\\ \le c_\delta\sum\limits_{i\in\mathbb Z^k}(2^{q-2}|\varphi_{3,i}|^2(|u_i|^{q-2}+|v_i|^{q-2})+2|\varphi_{4,i}|^2)|u_i-v_i|^2\\ \le c_\delta\sum\limits_{i\in\mathbb Z^k}\Big(2^{q-2}\Big(\frac{4}{q}|\varphi_{3,i}|^q+\frac{q-2}{q}|u_i|^q+\frac{q-2}{q}|v_i|^q\Big)\\ \qquad+2|\varphi_{4,i}|^2\Big)|u_i-v_i|^2\\ \le c_\delta(2^{q-1}(\|\varphi_3\|^q_q+\|u\|^q+\|v\|^q)+2\|\varphi_4\|^2_{l^\infty})\|u-v\|^2. \end{array}$

(2.12)

This implies that $\sigma_j:\mathbb R\times l^2\to l^2$ is also locally Lipschitz continuous, more precisely, for every $n\in\mathbb N$ , one can find a constant $c_2(n) > 0$ satisfying, for all $u, v\in l^2$ with $\|u\|\le n$ and $\|v\|\le n,$

$\begin{equation} \sum\limits_{j\in\mathbb N}\|\sigma_j(u)\|^2\le c_2^2(n). \end{equation}$

(2.13)

and

$\begin{equation} \sum\limits_{j\in\mathbb N}\|\sigma_j(u)-\sigma_j(v)\|^2\le c_2^2(n)\|u-v\|^2. \end{equation}$

(2.14)

By above notations one is able to rewrite (1.1)–(1.2) as the following system in $l^2$ for $t > 0$ :

$\begin{equation} du(t)+\nu(t)A_ku(t)dt+\lambda(t)u(t)dt = f(t,u(t))dt+g(t)dt+\sum\limits_{j = 1}^{\infty}(h_j(t)+\sigma_j(t,u(t)))dW_j(t), \end{equation}$

(2.15)

with initial condition:

$\begin{equation} u(0) = u_0\in l^2, \end{equation}$

(2.16)

in the present article, the solutions of system (2.15)–(2.16) are interpreted in the following sense.

Definition 2.1. Suppose $u_0\in L^2(\Omega, l^2)$ is $\mathcal F_0$ -measurable, a continuous $l^2$ -valued $\mathcal F_t$ -adapted stochastic process $u$ is called a solution of equations (2.15) and (2.16) if $u\in L^2(\Omega, C([0, T], l^2))\cap L^p(\Omega, L^p(0, T;l^p))$ for all $T > 0$ , and the following equation holds for all $t\ge0$ and almost all $\omega\in\Omega$ :

$\begin{align} \begin{split} u(t) = &u_0+\int_{0}^{t}(-\nu(s)A_ku(s)-\lambda(s)u(s)+f(s,u(s))+g(s))ds\\ &+\sum\limits_{j = 1}^{\infty}\int_{0}^{t}(h_j(s)+\sigma_j(s,u(s)))dW_j(s)\ \text{in}\ l^2. \end{split} \end{align}$

(2.17)

Similar to Ref.^[20], we can get (2.15) and (2.16) exist global solutions in the sense of Definition 2.1.

3. Uniform estimates

In this section, we derive the uniform estimates of solutions of (2.15)–(2.16). These estimates will be used to establish the tightness of a set of probability distributions of $u$ in $l^2$ .

We assume that

$\begin{equation} \alpha(t) = \lambda(t)-16k|\nu(t)| > 0. \end{equation}$

(3.1)

Lemma 3.1. Let (2.1)–(2.7) and (3.1) hold. Then the solutions $u(t, 0, u_0)$ of system (2.15) and (2.16) with initial data $u_0$ at time $0$ satisfy, for all $t\ge0$ ,

$\begin{align} \begin{split} &E(\|u(t,0,u_0)\|^2)+\int_{0}^{t}e^{\underline{\alpha}(r-t)}E(\|u(r,0,u_0)\|_p^p)dr\\ &\le L_1\Big(E(\|u_0\|^2)+\sum\limits_{j = 1}^{\infty}\overline{\|h_j\|}^2+\overline{\|g\|}^2+\|\varphi_1\|^{\frac{2p}{p-q}}_{\frac{2p}{p-q}}+\|\varphi_2\|^2+\|\phi_1\|_1\Big), \end{split} \end{align}$

(3.2)

where $L_1 > 0$ is a positive constant which depends on $\underline{\alpha}, p, q, \gamma, c_\delta, t,$ but indepentent of $u_0$ .

Proof. Applying Ito's formula to (2.15) we get

$\begin{align*} \begin{split} &d(\|u(t)\|^2)+2\nu(t)\sum\limits_{j = 1}^{k}\|B_ju(t)\|^2dt+2\lambda(t)\|u(t)\|^2dt = 2(f(t,u(t)),u(t))dt\\ &+2(g(t),u(t))dt+\sum\limits_{j = 1}^{\infty}\|h_j(t)+\sigma(t,u(t))\|^2dt +2\sum\limits_{j = 1}^{\infty}u(t)(h_j(t)+\sigma_j(t,u(t)))dW_j(t). \end{split} \end{align*}$

This implies

$\begin{align} \begin{split} &\frac{d}{dt}E(\|u(t)\|^2)+2\nu(t)\sum\limits_{j = 1}^{k}E(\|B_ju(t)\|^2)+2\lambda(t) E(\|u(t)\|^2)\\ &\le 2E(f(t,u(t)),u(t))+2E(g(t),u(t))+2\sum\limits_{j = 1}^{\infty}E(\|h_j(t)\|^2)+2\sum\limits_{j = 1}^{\infty}E(\|\sigma(t,u(t))\|^2). \end{split} \end{align}$

(3.3)

For the second term on the left-hand side of (3.3), we have

$\begin{equation} 2|\nu(t)|\sum\limits_{j = 1}^{k}E(\|B_ju(t)\|^2)\le 8k|\nu(t)|E(\|u(t)\|^2). \end{equation}$

(3.4)

For the first term on the right-hand side of (3.3), we get from (2.1) that

$\begin{equation} 2E(f(t,u(t)),u(t))\le -2\gamma_1E(\|u(t)\|_p^p)+2\|\phi_1\|_1. \end{equation}$

(3.5)

For the second term on the right-hand side of (3.3), we have

$\begin{equation} 2E(g(t),u(t))\le \lambda(t)E(\|u(t)\|^2)+\frac{1}{\lambda(t)}E(\|g(t)\|^2). \end{equation}$

(3.6)

For the last term on the right-hand side of (3.3), we infer from (2.11) with $\omega = 2$ that

$\begin{equation} 2\sum\limits_{j = 1}^{\infty}E(\|\sigma_j(t,u(t))\|^2)\le \frac{\gamma_1}{2}E(\|u(t)\|_p^p)+\frac{p-q}{p}\Big(\frac{p\gamma_1}{2q}\Big)^{-\frac{q}{p-q}}(4c_\delta)^{\frac{p}{p-q}}\|\varphi_1\|^{\frac{2p}{p-q}}_{\frac{2p}{p-q}}+4c_\delta\|\varphi_2\|^2. \end{equation}$

(3.7)

By (3.3)–(3.7) we get

$\begin{align} \begin{split} \frac{d}{dt}E(\|u(t)&\|^2)+\underline{\alpha}E(\|u(t)\|^2)+\frac{3}{2}\gamma_1E(\|u(t)\|_p^p)\\ &\le E\Bigg(\sum\limits_{j = 1}^{\infty}2\|h_j(t)\|^2+\frac{1} {\lambda(t)}\|g(t)\|^2\Bigg)+C_1, \end{split} \end{align}$

(3.8)

implies that

$\begin{align} \begin{split} \frac{d}{dt}E(\|u(t)&\|^2)+\underline{\alpha}E(\|u(t)\|^2)+\frac{3}{2}\gamma_1E(\|u(t)\|_p^p)\\ &\le 2\sum\limits_{j = 1}^{\infty}\|\overline{h_j}\|^2 +\frac{1}{\underline{\lambda}}\|\overline{g}\|^2+C_1, \end{split} \end{align}$

(3.9)

where $C_1 = \frac{p-q}{p}\Big(\frac{p\gamma_1}{2q}\Big)^{-\frac{q}{p-q}} (4c_\delta)^{\frac{p}{p-q}}\|\varphi_1\|^{\frac{2p}{p-q}} _{\frac{2p}{p-q}}+4c_\delta\|\varphi_2\|^2+2\|\phi_1\|_1$ . Multiplying (3.9) by $e^{\underline{\alpha}t}$ and integrating over $(0, t)$ to obtain

$\begin{align} \begin{split} E(\|u(t,0,&u_0)\|^2)+\frac{3}{2}\gamma_1\int_{0}^{t}e^{\underline{\alpha}(r-t)}E(\|u(r,0,u_0)\|^p_p)dr\\ &\le e^{-\underline{\alpha}t}E(\|u_0\|^2)+C_2\int_{0}^{t}e^{\underline{\alpha}(r-t)}dr, \end{split} \end{align}$

(3.10)

where $C_2 = 2\sum_{j = 1}^{\infty}\|\overline{h_j}\|^2+\frac{1}{\underline{\lambda}}\|\overline{g}\|^2+C_1$ . This completes the proof.

Lemma 3.2. Let (2.1)–(2.7), and (3.1) be satisfied. Then for compact subset $\mathcal K$ of $l^2$ , one can find a number $N_0 = N_0(\mathcal K)\in\mathbb N$ such that the solutions $u(t, 0, u_0)$ of (2.15) and (2.16) satisfy, for all $n\ge N_0$ and $t\ge 0$ ,

$\begin{equation} E\Bigg(\sum\limits_{\|i\|\ge n}|u_i(t,0,u_0)|^2\Bigg)+\int_{0}^{t}e^{\underline{\alpha}(r-t)}E\Bigg(\sum\limits_{\|i\|\ge n}|u_i(r,0,u_0)|^p\Bigg)dr\le\varepsilon, \end{equation}$

(3.11)

where $u_0\in\mathcal K$ and $\|i\|: = \max_{i\le j\le k}|i_j|$ .

Proof. Define a smooth function $\xi:\mathbb R\to[0, 1]$ such that

$\begin{equation} \xi(s) = 0\ \mathit{\text{for}}\ |s|\le 1\ \mathit{\text{and}}\ \xi(s) = 1\ \mathit{\text{for}}\ |s|\ge 2. \end{equation}$

(3.12)

Denote by

$\begin{equation} \xi_n = \Bigg(\xi\Bigg(\frac{\|i\|}{n}\Bigg)\Bigg)_{i\in\mathbb Z^k}\ \mathit{\text{and}}\ \xi_nu = \Bigg(\xi\Bigg(\frac{\|i\|}{n}\Bigg)u_i\Bigg)_{i\in\mathbb Z^k}, \forall u = (u_i)_{i\in\mathbb Z^k},n\in\mathbb N. \end{equation}$

(3.13)

Similar notations will also be used for other terms. It follows from (2.15) that

$\begin{align} \begin{split} d(\xi_nu(t))&+\nu(t)\xi_nA_ku(t)dt+\lambda(t)\xi_nu(t)dt\\ & = \xi_nf(t,u(t))dt+\xi_ng(t)dt+\sum\limits_{j = 1}^{\infty}(\xi_nh_j(t)+\xi_n\sigma_j(t,u(t)))dW_j(t). \end{split} \end{align}$

(3.14)

By Ito's formula and (3.14) we have

$\begin{align} \begin{split} d\|\xi_n&u(t)\|^2+2\nu(t)(A_k(u(t)),\xi_n^2u(t))dt+2\lambda(t)\|\xi_nu(t)\|^2dt\\ & = 2(f(t,u(t)),\xi_n^2u(t))dt+2(g(t),\xi_n^2u(t))dt\\ &\quad+\sum\limits_{j = 1}^{\infty}\|\xi_nh_j(t)+\xi_n\sigma_j(t,u(t))\|^2dt +2\sum\limits_{j = 1}^{\infty}(h_j(t)+\sigma_j(t,u(t)),\xi_n^2u(t))dW_j. \end{split} \end{align}$

(3.15)

This yields

$\begin{align} \begin{split} \frac{d}{dt}E(\|\xi_n&u(t)\|^2)+2\nu(t)E(A_k(u(t)),\xi_n^2u(t))+2\lambda(t)E(\|\xi_nu(t)\|^2) = 2E(f(t,u(t)),\xi_n^2u(t))\\ &+2E(g(t),\xi_n^2u(t))+2\sum\limits_{j = 1}^{\infty}E(\|\xi_nh_j(t)\|^2) +2\sum\limits_{j = 1}^{\infty}E(\|\xi_n\sigma_j(t,u(t))\|^2)dt. \end{split} \end{align}$

(3.16)

For the second term on the left-hand side of (3.16), we have

$\begin{align} \begin{split} 2\nu(t)E&(A_k(u(t)),\xi_n^2u(t)) = 2\nu(t)\sum\limits_{j = 1}^{k}E(B_ju(t),B_j(\xi_n^2u(t)))\\ & = 2\nu(t)E\bigg(\sum\limits_{j = 1}^{k}\sum\limits_{i\in\mathbb Z^k}(u_{i_1,\ldots,i_j+1,\ldots,i_k}-u_i)\\ &\qquad\qquad\qquad\times\bigg(\xi^2\Big(\frac{\|(i_1,\ldots,i_j+1,\ldots,i_k)\|}{n}\Big)u_{(i_1,\ldots,i_j+1,\ldots,i_k)}-\xi^2\Big(\frac{\|i\|}{n}\Big)u_i\bigg)\bigg)\\ & = 2\nu(t)E\bigg(\sum\limits_{j = 1}^{k}\sum\limits_{i\in\mathbb Z^k}\xi^2\Big(\frac{\|i\|}{n}\Big)(u_{i_1,\ldots,i_j+1,\ldots,i_k}-u_i)^2\bigg)\\ &\quad+2\nu(t)E\bigg(\sum\limits_{j = 1}^{k}\sum\limits_{i\in\mathbb Z^k}\bigg(\xi^2\Big(\frac{\|(i_1,\ldots,i_j+1,\ldots,i_k)\|}{n}\Big)-\xi^2\Big(\frac{\|i\|}{n}\Big)\bigg)\\ &\qquad\qquad\qquad\qquad\qquad\qquad\qquad\qquad\times(u_{(i_1,\ldots,i_j+1,\ldots,i_k)}-u_i)u_{(i_1,\ldots,i_j+1,\ldots,i_k)}\bigg). \end{split} \end{align}$

(3.17)

We first deal with the first term on the right-hand side of (3.17). Notice that

$\begin{align} \begin{split} 2&|\nu(t)|E\bigg(\sum\limits_{j = 1}^{k}\sum\limits_{i\in\mathbb Z^k}\xi^2\Big(\frac{\|i\|}{n}\Big)(u_{i_1,\ldots,i_j+1,\ldots,i_k}-u_i)^2\bigg)\\ & = 2|\nu(t)|E\bigg(\sum\limits_{j = 1}^{k}\sum\limits_{i\in\mathbb Z^k}\Big|\xi\Big(\frac{\|i\|}{n}\Big)u_{(i_1,\ldots,i_j+1,\ldots,i_k)}-\xi\Big(\frac{\|i\|}{n}\Big)u_i\Big|^2\bigg)\\ &\le 4|\nu(t)|E\bigg(\sum\limits_{j = 1}^{k}\sum\limits_{i\in\mathbb Z^k}\Big|\bigg(\xi\Big(\frac{\|i\|}{n}\Big)-\xi\Big(\frac{\|(i_1,\ldots,i_j+1,\ldots,i_k)\|}{n}\Big)\bigg)u_{(i_1,\ldots,i_j+1,\ldots,i_k)}\Big|^2\bigg)\\ &\quad+4|\nu(t)|E\bigg(\sum\limits_{j = 1}^{k}\sum\limits_{i\in\mathbb Z^k}\Big|\xi\Big(\frac{\|(i_1,\ldots,i_j+1,\ldots,i_k)\|} {n}\Big)u_{(i_1,\ldots,i_j+1,\ldots,i_k)}-\xi\Big( \frac{\|i\|}{n}\Big)u_i\Big|^2\bigg).\\ \end{split} \end{align}$

(3.18)

By the definition of function $\xi$ , there exists a constant $C_3 > 0$ such that $|\xi^{'}(s)|\le C_3$ for all $s\in\mathbb R$ . Then the first term on the right-hand side of (3.18) is bounded by

$\begin{align} \begin{split} 4&|\nu(t)|E\bigg(\sum\limits_{j = 1}^{k}\sum\limits_{i\in\mathbb Z^k}\Big|\bigg(\xi\Big(\frac{\|i\|}{n}\Big)-\xi\Big(\frac{\|(i_1,\ldots,i_j+1,\ldots,i_k)\|}{n}\Big)\bigg)u_{(i_1,\ldots,i_j+1,\ldots,i_k)}\Big|^2\bigg)\\ & = 4|\nu(t)|E\bigg(\sum\limits_{j = 1}^{k}\sum\limits_{i\in\mathbb Z^k}\Big|\xi\Big(\frac{\|i\|}{n}\Big)-\xi\Big(\frac{\|(i_1,\ldots,i_j+1,\ldots,i_k)\|}{n}\Big)\Big|^2\big|u_{(i_1,\ldots,i_j+1,\ldots,i_k)}\big|^2\bigg)\\ &\le \frac{4C_3^2}{n^2}|\nu(t)|E\bigg(\sum\limits_{j = 1}^{k}\sum\limits_{i\in\mathbb Z^k}\big|u_{(i_1,\ldots,i_j+1,\ldots,i_k)}\big|^2\bigg)\le \frac{4C_3^2k}{n^2}|\nu(t)|E(\|u\|^2). \end{split} \end{align}$

(3.19)

By the definition of $|B_ju|_i$ , the last term on the right-hand side of (3.18) is bounded by

$\begin{align} \begin{split} 4&|\nu(t)|E\bigg(\sum\limits_{j = 1}^{k}\sum\limits_{i\in\mathbb Z^k}\Big|\xi\Big(\frac{\|(i_1,\ldots,i_j+1,\ldots,i_k)\|}{n}\Big)u_{(i_1,\ldots,i_j+1,\ldots,i_k)}-\xi\Big(\frac{\|i\|}{n}\Big)u_i\Big|^2\bigg)\\ &\le 4|\nu(t)|E\Big(\sum\limits_{j = 1}^{k}\|B_j(\xi_nu(t))\|^2\Big)\le 16k|\nu(t)|E(\|\xi_nu(t)\|^2). \end{split} \end{align}$

(3.20)

Then we find from (3.18) to (3.20) that the first term on the right-hand side of (3.17) is bounded by

$\begin{align} \begin{split} 2&|\nu(t)|E\bigg(\sum\limits_{j = 1}^{k}\sum\limits_{i\in\mathbb Z^k}\xi^2\Big(\frac{\|i\|}{n}\Big)(u_{(i_1,\ldots,i_j+1,\ldots,i_k)}-u_i)^2\bigg)\\ &\le 16k|\nu(t)|E(\|\xi_nu(t)\|^2)+\frac{4C_3^2k}{n^2}|\nu(t)|E(\|u\|^2). \end{split} \end{align}$

(3.21)

In addition, we find that the last term on the right-hand side of (3.17) can be bounded by

$\begin{align} \begin{split} 2&\Big|\nu(t)E\bigg(\sum\limits_{j = 1}^{k}\sum\limits_{i\in\mathbb Z^k}\bigg(\xi^2\Big(\frac{\|(i_1,\ldots,i_j+1,\ldots,i_k)\|}{n}\Big)-\xi^2\Big(\frac{\|i\|}{n}\Big)\bigg)\\ &\qquad\qquad\qquad\qquad\qquad\qquad\qquad\times(u_{(i_1,\ldots,i_j+1,\ldots,i_k)}-u_i)u_{(i_1,\ldots,i_j+1,\ldots,i_k)}\bigg)\Big|\\ &\le 2|\nu(t)|E\bigg(\sum\limits_{j = 1}^{k}\sum\limits_{i\in\mathbb Z^k}\Big|\xi^2\Big(\frac{\|(i_1,\ldots,i_j+1,\ldots,i_k)\|}{n}\Big)-\xi^2\Big(\frac{\|i\|}{n}\Big)\Big|\\ &\qquad\qquad\qquad\qquad\qquad\qquad\qquad\times|u_{(i_1,\ldots,i_j+1,\ldots,i_k)}-u_i||u_{(i_1,\ldots,i_j+1,\ldots,i_k)}|\bigg)\\ &\le 4|\nu(t)|E\bigg(\sum\limits_{j = 1}^{k}\sum\limits_{i\in\mathbb Z^k}\Big|\xi\Big(\frac{\|(i_1,\ldots,i_j+1,\ldots,i_k)\|}{n}\Big)-\xi\Big(\frac{\|i\|}{n}\Big)\Big|\\ &\qquad\qquad\qquad\qquad\qquad\qquad\qquad\times|u_{(i_1,\ldots,i_j+1,\ldots,i_k)}-u_i||u_{(i_1,\ldots,i_j+1,\ldots,i_k)}|\bigg)\\ &\le \frac{4C_3}{n}|\nu(t)|E\bigg(\sum\limits_{j = 1}^{k}\sum\limits_{i\in\mathbb Z^k}|u_{(i_1,\ldots,i_j+1,\ldots,i_k)}-u_i||u_{(i_1,\ldots,i_j+1,\ldots,i_k)}|\bigg)\\ &\le \frac{8kC_3}{n}|\nu(t)|E(\|u\|^2). \end{split} \end{align}$

(3.22)

By (3.21), (3.22) and (3.17), we infer that the second term on the left-hand side of (3.16) satisfied

$\begin{align} \begin{split} 2|\nu(t)E&(A_k(u(t)),\xi_n^2u(t))|\le C_4|\nu(t)|\Big(\frac{1}{n}+\frac{1}{n^2}\Big)E(\|u\|^2)+16k|\nu(t)|E(\|\xi_nu(t)\|^2), \end{split} \end{align}$

(3.23)

where $C_4 = 4kC_3(2+C_3)$ . For the first term on the right-hand side of (3.16), we find from (2.1) that

$\begin{align} \begin{split} 2E(f(t,u(t)),\xi_n^2u(t))&\le -2\gamma_1E\bigg(\sum\limits_{i\in\mathbb Z^k}\xi^2\Big(\frac{\|i\|}{n}\Big)|u_i(t)|^p\bigg)+2E\bigg(\sum\limits_{i\in\mathbb Z^k}\xi^2\Big(\frac{\|i\|}{n}\Big)|\phi_{1,i}|\bigg)\\ &\le -2\gamma_1E\bigg(\sum\limits_{i\in\mathbb Z^k}\xi^2\Big(\frac{\|i\|}{n}\Big)|u_i(t)|^p\bigg)+2\sum\limits_{\|i\|\ge n}|\phi_{1,i}|. \end{split} \end{align}$

(3.24)

For the second term on the right-hand side of (3.16), we infer from Young's inequality that

$\begin{align} \begin{split} 2E(g,\xi_n^2u(t))&\le \underline{\lambda}E(\|\xi_nu(t)\|^2)+\frac{1}{\underline{\lambda}}E\bigg(\sum\limits_{i\in\mathbb Z^k}\xi^2\Big(\frac{\|i\|}{n}\Big)|g_i|^2\bigg)\\ &\le \underline{\lambda}E(\|\xi_nu(t)\|^2)+\frac{1}{\underline{\lambda}}\sum\limits_{\|i\|\ge n}|g_i|^2. \end{split} \end{align}$

(3.25)

For the last term on the right-hand side (3.16), we infer from (2.4) and Young's inequality that

$\begin{align} \begin{split} 2&\sum\limits_{j = 1}^{\infty}E\Big(\|\xi_n\sigma_j(t,u(t))\|^2\Big) = 2\sum\limits_{j = 1}^{\infty}E\bigg(\sum\limits_{i\in\mathbb Z^k}\Big|\xi\Big(\frac{\|i\|}{n}\Big)\delta_{i,j}\hat{\sigma}_{i,j}(t,u_i(t))\Big|^2\bigg)\\ &\le 4\sum\limits_{j = 1}^{\infty}E\bigg(\sum\limits_{i\in\mathbb Z^k}\xi^2\Big(\frac{\|i\|}{n}\Big)|\delta_{i,j}|^2\Big(|\varphi_{1,i}|^2|u_i(t)|^q+|\varphi_{2,i}|^2\Big)\bigg)\\ &\le 4c_\delta E\bigg(\sum\limits_{i\in\mathbb Z^k}\xi^2\Big(\frac{\|i\|}{n}\Big)\Big(|\varphi_{1,i}|^2|u_i(t)|^q+|\varphi_{2,i}|^2\Big)\bigg)\\ &\le \gamma_1E\bigg(\sum\limits_{i\in\mathbb Z^k}\xi^2\Big(\frac{\|i\|}{n}\Big)|u_i(t)|^p\bigg)+\frac{p-q}{p}\Big(\frac{p\gamma_1}{q}\Big)^{-\frac{q}{p-q}}(4c_\delta)^{\frac{p}{p-q}}\sum\limits_{i\in\mathbb Z^k}\xi^2\Big(\frac{\|i\|}{n}\Big)|\varphi_{1,i}|^{\frac{2p}{p-q}}\\ &\quad+4c_\delta\sum\limits_{i\in\mathbb Z^k}\xi^2\Big(\frac{\|i\|}{n}\Big)|\varphi_{2,i}|^2\\ &\le \gamma_1E\bigg(\sum\limits_{i\in\mathbb Z^k}\xi^2\Big(\frac{\|i\|}{n}\Big)|u_i(t)|^p\bigg)+\frac{p-q}{p}\Big(\frac{p\gamma_1}{q}\Big)^{-\frac{q}{p-q}}(4c_\delta)^{\frac{p}{p-q}}\sum\limits_{\|i\|\ge n}|\varphi_{1,i}|^{\frac{2p}{p-q}}\\ &\quad+4c_\delta\sum\limits_{\|i\|\ge n}|\varphi_{2,i}|^2. \end{split} \end{align}$

(3.26)

Substituting (3.23)–(3.26) into (3.16) we get

$\begin{align} \begin{split} &\frac{d}{dt}E(\|\xi_nu(t)\|^2)+\underline{\alpha}E(\|\xi_nu(t)\|^2)+\gamma_1E\bigg(\sum\limits_{i\in\mathbb Z^k}\xi^2\Big(\frac{\|i\|}{n}\Big)|u_i(t)|^p\bigg)\\ &\le C_4|\nu|\Big(\frac{1}{n}+\frac{1}{n^2}\Big)E(\|u\|^2)+C_5\bigg(\sum\limits_{\|i\|\ge n}\Big(\overline{|g_i|}^2+|\varphi_{1,i}|^{\frac{2p}{p-q}}+|\varphi_{2,i}|^2+|\phi_{1,i}|\Big)+\sum\limits_{\|i\|\ge n}\sum\limits_{j = 1}^{\infty}\overline{|h_{i,j}|}^2\bigg), \end{split} \end{align}$

(3.27)

where $C_5 = 2+\frac{1}{\underline{\lambda}}+\frac{p-q}{p}(\frac{p\gamma_1}{q})^{-\frac{q}{p-q}}(4c_\delta)^{\frac{p}{p-q}}+4c_\delta$ . One can multiply (3.27) by $e^{\underline{\alpha}t}$ and integrate over $(0, t)$ in order to obtain

$\begin{align} \begin{split} &E(\|\xi_nu(t,0,u_0)\|^2)+\gamma_1\int_{0}^{t}e^{\underline{\alpha}(r-t)}E\bigg(\sum\limits_{i\in\mathbb Z^k}\xi^2\Big(\frac{\|i\|}{n}\Big)|u_i(r,0,u_0)|^p\bigg)dr\\ &\le e^{-\underline{\alpha}t}E(\|\xi_nu_0\|^2)+C_4|\nu|\Big(\frac{1}{n}+\frac{1}{n^2}\Big)\int_{0}^{t}e^{\underline{\alpha}(r-t)}E(\|u(r,0,u_0)\|^2)dr\\ &+\frac{C_5}{\underline{\alpha}}\bigg(\sum\limits_{\|i\|\ge n}\Big(\overline{|g_i|}^2+|\varphi_{1,i}|^{\frac{2p}{p-q}}+|\varphi_{2,i}|^2+|\phi_{1,i}|\Big)+\sum\limits_{\|i\|\ge n}\sum\limits_{j = 1}^{\infty}\overline{|h_{i,j}|}^2\bigg). \end{split} \end{align}$

(3.28)

Since $\mathcal K$ is a compact subset of $l^2$ we infer from (3.1) that

$\begin{equation} \lim\limits_{n\to\infty}\sup\limits_{u_0\in\mathcal K}\sup\limits_{t\ge 0}e^{-\underline{\alpha}t}E(\|\xi_nu_0\|^2)\le \lim\limits_{n\to\infty}\sup\limits_{u_0\in\mathcal K}E(\sum\limits_{\|i\|\ge n}|u_{0,i}|^2) = 0. \end{equation}$

(3.29)

By Lemma 3.1, we find that for all $u_0\in\mathcal K$ and $t\ge 0$ , as $n\to\infty$ ,

$\begin{align} \begin{split} &\Big(\frac{1}{n}+\frac{1}{n^2}\Big)\int_{0}^{t}e^{\underline{\alpha}(r-t)}E(\|u(r,0,u_0)\|^2)dr\\ &\le \frac{L_1}{\underline{\alpha}}\Big(\frac{1}{n}+\frac{1}{n^2}\Big)\Big(E(\|u_0\|^2)+\sum\limits_{j = 1}^{\infty}\overline{\|h_j\|}^2+\overline{\|g\|}^2+\|\varphi_1\|^{\frac{2p}{p-q}}_{\frac{2p}{p-q}}+\|\varphi_2\|^2+\|\phi_1\|_1\Big)\\ &\le \frac{L_1}{\underline{\alpha}}\Big(\frac{1}{n}+\frac{1}{n^2}\Big)\Big(C_{6}+\sum\limits_{j = 1}^{\infty}\overline{\|h_j\|}^2+\overline{\|g\|}^2+\|\varphi_1\|^{\frac{2p}{p-q}}_{\frac{2p}{p-q}}+\|\varphi_2\|^2+\|\phi_1\|_1\Big)\to0, \end{split} \end{align}$

(3.30)

where $L_1$ is the same number of (3.1) and $C_{6} > 0$ is a constant depending only on $u_0$ .By $\varphi_1\in l^{\frac{2p}{p-q}}, \varphi_2\in l^2, \phi_1\in l^1$ , (2.6) and (3.1), we infer that

$\begin{equation} \sum\limits_{\|i\|\ge n}\Big(\overline{|g_i|}^2+|\varphi_{1,i}|^{\frac{2p}{p-q}}+|\varphi_{2,i}|^2+|\phi_{1,i}|\Big)+\sum\limits_{\|i\|\ge n}\sum\limits_{j = 1}^{\infty}\overline{|h_{i,j}|}^2\to 0\ \mathit{\text{as}}\ n\to\infty. \end{equation}$

(3.31)

It follows from (3.28) to (3.31) that as $n\to\infty$ ,

$\begin{equation} \sup\limits_{u_0\in\mathcal K}\sup\limits_{t\ge 0}\bigg(E(\|\xi_nu(t,0,u_0)\|^2)+\int_{0}^{t}e^{\underline{\alpha}(r-t)}E\Big(\sum\limits_{i\in\mathbb Z^k}\xi^2\Big(\frac{\|i\|}{n}\Big)|u_i(r,0,u_0)|^p\Big)dr\bigg)\to0. \end{equation}$

(3.32)

Then for every $\varepsilon > 0$ we can find a number $N_0 = N_0(\mathcal K)\in\mathbb N$ satisfying, for all $n\ge N_0$ and $t\ge 0$ ,

$\begin{align} \begin{split} &\bigg(E\Big(\sum\limits_{\|i\|\ge 2n}|u_i(t,0,u_0)|^2\Big)+\int_{0}^{t}e^{\underline{\alpha}(r-t)}E\Big(\sum\limits_{\|i\|\ge 2n}|u_i(t,0,u_0)|^p\Big)dr\bigg)\\ &\le \bigg(E\Big(\|\xi_nu(t,0,u_0)\|^2\Big)+\int_{0}^{t}e^{\underline{\alpha}(r-t)}E\Big(\sum\limits_{i\in\mathbb Z^k}\xi^2\Big(\frac{\|i\|}{n}\Big)|u_i(t,0,u_0)|^p\Big)dr\bigg)\le\varepsilon, \end{split} \end{align}$

(3.33)

uniformly for $u_0\in\mathcal K$ and $t\ge 0$ . This concludes the proof.

4. Existence of periodic measures

In the sequel, we use $\mathcal L(u(t, 0, u_0))$ to denote the probability distribution of the solution $u(t, 0, u_0)$ of (2.15)–(2.16) which has initial condition $u_0$ at initial time $0$ . Then we have the following tightness of a family of distributions of solutions.

Lemma 4.1. Suppose (2.1)–(2.7) and (3.1) hold. Then the family $\{\mathcal L(u(t, 0, u_0)):t\ge 0\}$ of the distributions ofthe solutions of (2.15)–(2.16) is tight on $l^2$ .

Proof. For simplicity, we will write the solution $u (t, 0, u_0)$ as $u(t)$ from now on. It follows from Lemma 3.1 that there exists a constant $c_1 > 0$ such that

$\begin{equation} E\left( {\left\| {u(t) } \right\|^2 } \right) \le c_1 ,\quad\text{for all} \quad t\geq0. \end{equation}$

(4.1)

By Chebyshev's inequality, we get from (4.1) that for all $t\geq0$ ,

$P\left( {\left\| {u(t) } \right\|^2\ge R } \right) \le \frac{{c_1 }}{{R^2 }}\rightarrow 0\quad\text{as}\quad R\rightarrow \infty.$

Hence for every $\epsilon > 0$ , there exists $R_1 = R_1(\epsilon) > 0$ such that for all $t\geq0$ ,

$\begin{equation} P\left\{ {\left\| {u(t) } \right\|^2 \ge R_1 } \right\} \le \frac{1}{2}\epsilon. \end{equation}$

(4.2)

By Lemma 3.2, we infer that for each $\epsilon > 0$ and $m\in \mathbb{N}$ , there exists an integer $n_m = n_m(\epsilon, m)$ such that for all $t\geq 0$ ,

$E\left( { \sum\limits_{\left| i \right| > n_m } {\left| {u_i \left( t \right)} \right|^2 } } \right) < \frac{\epsilon }{{2^{2m + 2} }},$

and hence for all $t\geq 0$ and $m\in \mathbb{N}$ ,

$\begin{equation} P\left( {\left\{ { \sum\limits_{\left| i \right| > n_m } {\left| {u_i \left( r \right)} \right|^2 } \ge \frac{1}{{2^m }}} \right\}} \right) \le 2^m E\left( { \sum\limits_{\left| i \right| > n_m } {\left| {u_i \left( r \right)} \right|^2 } } \right) < \frac{\epsilon }{{2^{m + 2} }}. \end{equation}$

(4.3)

It follows from (4.3) for all $t\geq 0$ ,

$P\left( {\mathop \cup \limits_{m = 1}^\infty \left\{ { \sum\limits_{\left| i \right| > n_m } {\left| {u_i \left( t \right)} \right|^2 } \ge \frac{1}{{2^m }}} \right\}} \right) \le \sum\limits_{m = 1}^\infty {\frac{\epsilon }{{2^{m + 2} }} \le \frac{1}{4}\epsilon ,}$

which shows that for all $t\geq 0$ ,

$\begin{equation} P\left( {\left\{ { \sum\limits_{\left| i \right| > n_m } {\left| {u_i \left( t \right)} \right|^2 } \le \frac{1}{{2^m }}\,\,\text{for all}\,\, m\in \mathbb{N}} \right\}} \right) > 1 -\frac{\epsilon}{2}. \end{equation}$

(4.4)

Given $\epsilon > 0$ , set

$\begin{align} Y_{1,\epsilon } & = \left\{ {v \in l^2:\left\| {v } \right\| \le R_1 \left(\epsilon \right)} \right\}, \end{align}$

(4.5)

$\begin{align} Y_{2,\epsilon } & = \left\{ {v \in l^2: \sum\limits_{\left| i \right| > n_m } {\left| {v_i \left( r \right)} \right|^2 } \le \frac{1}{{2^m }}\,\,\text{for all}\,\, m\in \mathbb{N}} \right\}, \end{align}$

(4.6)

and

$\begin{equation} Y_\epsilon = Y_{1,\epsilon} \cap Y_{2,\epsilon }. \end{equation}$

(4.7)

By (4.2) and (4.4) we get, for all $t\geq0$ ,

$\begin{equation} P\left( {\left\{ {u(t) \in Y_\epsilon } \right\}} \right) > 1 - \epsilon . \end{equation}$

(4.8)

Now, we show the precompactness of $\left\{ {v:v \in Y_\epsilon } \right\}$ in $l^2$ . Given $\kappa > 0$ , choose an integer $m_0 = m_0 \left(\kappa \right) \in \mathbb{N}$ such that $2^{m_0 } > \frac{8}{{\kappa ^2 }}$ . Then by (4.6) we obtain

$\begin{equation} \sum\limits_{\left| i \right| > n_{m_0 } } {\left| {v_i } \right|^2 } \le \frac{1}{{2^{m_0 } }} < \frac{{\kappa ^2 }}{8},\quad \forall v \in Y_\epsilon. \end{equation}$

(4.9)

On the other hand, by (4.5) we see that the set $\left\{ {(v_i)_{|i|\leq m_0}:v \in Y_\epsilon } \right\}$ is bounded in the finite-dimensional space $R^{2m_0+1}$ and hence precompact. Consequently, $\left\{ {v:v \in Y_\epsilon } \right\}$ has a finite open cover of balls with radius $\frac{\kappa }{2}$ , which along with (4.9) implies that the set $\left\{ {v:v \in Y_\epsilon } \right\}$ has a finite open cover of balls with radius $\kappa$ in $l^2$ . Since $\kappa > 0$ is arbitrary, we find that the set $\left\{ {v: v\in Y_\epsilon } \right\}$ is precompact in $l^2$ . This completes the proof.

If $\phi:l^2\to\mathbb R$ is a bounded Borel function, then for $0\le r\le t$ and $u_0\in l^2$ , we set

$(p_{r,t}\phi)(u_0) = E(\phi(u(t,r,u_0)))$

and

$p(r,u_0;t,\Gamma) = (p_{r,t}1_\Gamma)(u_0),$

where $\Gamma\in\mathcal B(l^2)$ and $1_\Gamma$ is the characteristic function of $\Gamma$ . The operators $p_{s, t}$ with $0\le s\le t$ are called the transition operators for the solutions of (2.15)–(2.16). Recall that a probability measure $\nu$ on $l^2$ is periodic for (2.15)–(2.16) if

$\int_{l^2}(p_{0,t+T}\phi)(u_0)d\nu(u_0) = \int_{l^2} (p_{0,t}\phi)(u_0)d\nu(u_0),\qquad\forall t\ge0.$

Lemma 4.2. ^[21]Let $\varrho(\psi, \omega)$ be a scalar bounded measurable randomfunction of $\psi$ , independent of $\mathcal F_s$ . Let $\varsigma$ be an $\mathcal F_s$ -measurable random variable. Then

$E\left( {\varrho \left( {\varsigma ,\omega } \right)|\mathcal F_s } \right) = E\left( {\varrho \left( {\varsigma ,\omega } \right)} \right).$

The transition operators $\{p_{r, t}\}_{0\le r\le t}$ have the following properties.

Lemma 4.3. Assume that (2.1)–(2.7) and (3.1) hold. Then:

(i) $\{p_{r, t}\}_{0\le r\le t}$ is Feller; that is, for every bounded andcontinuous $\phi: l^2\to\mathbb R$ , the function $p_{r, t}\phi: l^2\to\mathbb R$ is also bounded and continuous for all $0\le r\le t.$

(ii) The family $\{p_{r, t}\}_{0\le r\le t}$ is T-periodic; that is, for all $0\le r\le t$ ,

$p(r, u_0;t,\cdot) = p(r+T,u_0;t+T,\cdot),\qquad\forall u_0\in l^2.$

(iii) $\{u(t, 0, u_0)\}_{t\ge 0}$ is a $l^2$ -valued Markov process.

Finally, we present our main result on the existence of periodic measures for problem (2.15)–(2.16).

Theorem 4.4. Assume that (2.1)–(2.7) and (3.1) hold. Then problem (2.15)–(2.16) has a periodic measure on $l^2$ .

Proof. We apply Krylov-Bogolyubov's method to prove the existence of periodic measures of (2.15)–(2.16), define a probability measure $\mu_n$ by

$\begin{equation} \mu_n = \frac{1}{n}\sum\limits_{l = 1}^{n}p(0,0;lT,\cdot). \end{equation}$

(4.10)

By Lemma 4.1 we see the sequence $\{\mu_n\}^\infty_{n = 1}$ is tight on $l^2$ , and hence there exists a probability measure $\mu$ on $l^2$ such that, up to a subsequence,

$\begin{equation} \mu_n\to\mu,\qquad\text{as}\ n\to\infty. \end{equation}$

(4.11)

By (4.10)–(4.11) and Lemma 4.3, we infer that for every $t\ge0$ and every bounded and continuous function $\phi:l^2\to\mathbb R,$

$\begin{align} \begin{split} &{\int_{l^2} {\left( {p_{0,t} \phi } \right)\left( u_0 \right)d\mu \left( u_0 \right)} } = \int_{l^2} {\left( {\int_{l^2} {\phi \left( y \right) p\left( {0, u_0 ;t,dy} \right)} } \right)} d\mu \left( u_0 \right) \\ & = \mathop {\lim }\limits_{n \to \infty } \frac{1}{n}\sum\limits_{l = 1}^n {\int_{l^2} {\left( {\int_{l^2} {\phi \left( y \right)p\left( {0,u_0 ;t,dy} \right)} } \right)} p\left( {0,0;lT,du_0 } \right)} \\ & = \mathop {\lim }\limits_{n \to \infty } \frac{1}{n}\sum\limits_{l = 1}^n {\int_{l^2} {\left( {\int_{l^2} {\phi \left( y \right)p\left( {kT,u_0 ;t + lT,dy} \right)} } \right)} p\left( {0,0;kT,du_0 } \right)} \\ & = \mathop {\lim }\limits_{n \to \infty } \frac{1}{n}\sum\limits_{l = 1}^n {\int_{l^2} {\phi \left( y \right)p\left( {0,0;t + lT,dy} \right)} } \\ & = \mathop {\lim }\limits_{n \to \infty } \frac{1}{n}\sum\limits_{l = 1}^n {\int_{l^2} {\phi \left( y \right)p\left( {0,0;t + lT + T,dy} \right)} } \\ & = \mathop {\lim }\limits_{n \to \infty } \frac{1}{n}\sum\limits_{k = 1}^n {\int_{l^2} {\left( {\int_{l^2} {\phi \left( y \right)p\left( {0,u_0 ;t + T,dy} \right)} } \right)} p\left( {0,0;lT,du_0 } \right)} \\ & = \int_{l^2} {\left( {\int_{l^2} {\phi \left( y \right)p\left( {0,u_0;t + T,dy} \right)} } \right)} d\mu \left( u_0\right)\\ & = {\int_{l^2} {\left( {p_{0,t + T} \phi } \right)\left( u_0 \right)d\mu \left( u_0 \right)} } , \end{split} \end{align}$

(4.12)

which shows that $\mu$ is a periodic measure of (2.15)–(2.16), as desired.

Conflict of interest

The author declares there is no conflict of interest.

References

[1]	B. Ye, S. Zhu, L. Li, W. Wu, Short-term traffic flow prediction at isolated intersections based on parallel multi-task learning, Syst. Sci. Control Eng., 12 (2024), 1–17. https://doi.org/10.1080/21642583.2024.2316160 doi: 10.1080/21642583.2024.2316160
[2]	M. J. Smith, T. Iryo, R. Mounce, K. Satsukawa, D. Watling, Zero-queue traffic control, using green-times and prices together, Transp. Res. Part C: Emerging Technol., 138 (2022), 103630. https://doi.org/10.1016/j.trc.2022.103630 doi: 10.1016/j.trc.2022.103630
[3]	B. Ye, W. Wu, L. Li, W. Mao, A hierarchical model predictive control approach for signal splits optimization in large-scale urban road networks, IEEE Trans. Intell. Transp. Syst., 17 (2016), 2182–2192. https://doi.org/10.1109/TITS.2016.2517079 doi: 10.1109/TITS.2016.2517079
[4]	H. Wang, J. Zhu, B. Gu, Model-based deep reinforcement learning with traffic inference for traffic signal control, Appl. Sci., 13 (2023), 4010. https://doi.org/10.3390/app13064010 doi: 10.3390/app13064010
[5]	B. Ye, W. Wu, K. Ruan, L. Li, T. Chen, H. Gao, et al., A survey of model predictive control methods for traffic signal control, IEEE/CAA J. Autom. Sin., 6 (2019), 623–640. https://doi.org/10.1109/JAS.2019.1911471 doi: 10.1109/JAS.2019.1911471
[6]	B. B. Elallid, N. Benamar, A. S. Hafid, T. Rachidi, N. Mrani, A comprehensive survey on the application of deep and reinforcement learning approaches in autonomous driving, J. King Saud Univ.-Comput. Inf. Sci., 34 (2022), 7366–7390. https://doi.org/10.1016/j.jksuci.2022.03.013 doi: 10.1016/j.jksuci.2022.03.013
[7]	B. Ye, W. Wu, W. Mao, A two-way arterial signal coordination method with queueing process considered, IEEE Trans. Intell. Transp. Syst., 16 (2015), 3440–3452. https://doi.org/10.1109/TITS.2015.2461493 doi: 10.1109/TITS.2015.2461493
[8]	X. Li, Webster sequences, apportionment problems, and just-in-time sequencing, Discrete Appl. Math., 306 (2022), 52–69. https://doi.org/10.1016/j.dam.2021.09.020
[9]	T. Thunig, R. Scheffler, M. Strehler, K. Nagel, Optimization and simulation of fixed-time traffic signal control in real-world applications, Proc. Comput. Sci., 151 (2019), 826–833. https://doi.org/10.1016/j.procs.2019.04.113 doi: 10.1016/j.procs.2019.04.113
[10]	C. Yu, W. Ma, X. Yang, A time-slot based signal scheme model for fixed-time control at isolated intersections, Transp. Res. Part B: Methodol., 140 (2020), 176–192. https://doi.org/10.1016/j.trb.2020.08.004 doi: 10.1016/j.trb.2020.08.004
[11]	A. J. Calle-Laguna, J. Du, H. A. Rakha, Computing optimum traffic signal cycle length considering vehicle delay and fuel consumption, Transp. Res. Interdiscip. Perspect., 3 (2019), 100021. http://doi.org/10.1016/j.trip.2019.100021 doi: 10.1016/j.trip.2019.100021
[12]	M. Noaeen, A. Naik, L. Goodman, J. Crebo, T. Abrar, Z. S. H. Abad, et al., Reinforcement learning in urban network traffic signal control: A systematic literature review, Expert Syst. Appl., 199 (2022), 116830. https://doi.org/10.1016/j.eswa.2022.116830 doi: 10.1016/j.eswa.2022.116830
[13]	R. Bokade, X. Jin, C. Amato, Multi-agent reinforcement learning based on representational communication for large-scale traffic signal control, IEEE Access, 11 (2023), 47646–47658. https://doi.org/10.1109/ACCESS.2023.3275883 doi: 10.1109/ACCESS.2023.3275883
[14]	A. A. A. Alkhatib, K. A. Maria, S. AlZu'bi, E. A. Maria, Smart traffic scheduling for crowded cities road networks, Egypt. Inf. J., 23 (2022), 163–176. https://doi.org/10.1016/j.eij.2022.10.002 doi: 10.1016/j.eij.2022.10.002
[15]	M. R. T. Fuad, E. O. Fernandez, F. Mukhlish, A. Putri, H. Y. Sutarto, Y. A. Hidayat, et al., Adaptive deep Q-network algorithm with exponential reward mechanism for traffic control in urban intersection networks, Sustainability, 14 (2022), 14590. https://doi.org/10.3390/su142114590 doi: 10.3390/su142114590
[16]	S. Choi, D. Lee, S. Kim, S. Tak, Framework for connected and automated bus rapid transit with sectionalized speed guidance based on deep reinforcement learning: Field test in sejong city, Transp. Res. Part C: Emerging Technol., 148 (2023), 104049. https://doi.org/10.1016/j.trc.2023.104049 doi: 10.1016/j.trc.2023.104049
[17]	D. He, J. Kim, H. Shi, B. Ruan, Autonomous anomaly detection on traffic flow time series with reinforcement learning, Transp. Res. Part C: Emerging Technol., 150 (2023), 104089. https://doi.org/10.1016/j.trc.2023.104089 doi: 10.1016/j.trc.2023.104089
[18]	D. Li, F. Zhu, T. Chen, Y. D. Wong, C. Zhu, J. Wu, COOR-PLT: A hierarchical control model for coordinating adaptive platoons of connected and autonomous vehicles at signal-free intersections based on deep reinforcement learning, Transp. Res. Part C: Emerging Technol., 146 (2023), 103933, https://doi.org/10.1016/j.trc.2022.103933 doi: 10.1016/j.trc.2022.103933
[19]	I. Tunc, M. T. Soylemez, Fuzzy logic and deep Q learning based control for traffic lights, Alexandria Eng. J., 67 (2023), 343–359. https://doi.org/10.1016/j.aej.2022.12.028 doi: 10.1016/j.aej.2022.12.028
[20]	M. Gregurić, K. Kušić, E. Ivanjko, Impact of Deep Reinforcement Learning on Variable Speed Limit strategies in connected vehicles environments, Eng. Appl. Artif. Intell., 112 (2022), 104850. https://doi.org/10.1016/j.engappai.2022.104850 doi: 10.1016/j.engappai.2022.104850
[21]	B. Liu, Z. Ding, A distributed deep reinforcement learning method for traffic light control, Neurocomputing, 490 (2022), 390–399. https://doi.org/10.1016/j.neucom.2021.11.106 doi: 10.1016/j.neucom.2021.11.106
[22]	T. A. Haddad, D. Hedjazi, S. Aouag, A deep reinforcement learning-based cooperative approach for multi-intersection traffic signal control, Eng. Appl. Artif. Intell., 114 (2022), 105019. https://doi.org/10.1016/j.engappai.2022.105019 doi: 10.1016/j.engappai.2022.105019
[23]	S. M. A. B. A. Islam, A. Hajbabaie, H. A. A. Aziz, A real-time network-level traffic signal control methodology with partial connected vehicle information, Transp. Res. Part C: Emerging Technol., 121 (2020), 102830. https://doi.org/10.1016/j.trc.2020.102830 doi: 10.1016/j.trc.2020.102830
[24]	A. Jaleel, M. A. Hassan, T. Mahmood, M. U. Ghani, A. U. Rehman, Reducing congestion in an intelligent traffic system with collaborative and adaptive signaling on the edge, IEEE Access, 8 (2020), 205396–205410. https://doi.org/10.1109/ACCESS.2020.3037348 doi: 10.1109/ACCESS.2020.3037348
[25]	S. Bouktif, A. Cheniki, A. Ouni, H. El-Sayed, Deep reinforcement learning for traffic signal control with consistent state and reward design approach, Knowl.-Based Syst., 267 (2023), 110440, https://doi.org/10.1016/j.knosys.2023.110440 doi: 10.1016/j.knosys.2023.110440
[26]	S. Bouktif, A. Cheniki, A. Ouni, Traffic signal control using hybrid action space deep reinforcement learning, Sensors, 21 (2021), 2302. https://doi.org/10.3390/s21072302 doi: 10.3390/s21072302
[27]	B. Ye, P. Wu, W. Wu, L. Li, Y. Zhu, B. Chen, Q-learning based traffic signal control method for an isolated intersection, in 2022 China Automation Congress (CAC), (2022), 6063–6068, https://doi.org/10.1109/CAC57257.2022.10054839
[28]	Y. Gong, M. Abdel-Aty, Q. Cai, M. S. Rahman, Decentralized network level adaptive signal control by multi-agent deep reinforcement learning, Transp. Res. Interdiscip. Perspect., 1 (2019), 100020. https://doi.org/10.1016/j.trip.2019.100020 doi: 10.1016/j.trip.2019.100020
[29]	J. Gu, Y. Fang, Z. Sheng, P. Wen, Double deep Q-network with a dual-agent for traffic signal control, Appl. Sci., 10 (2020), 1622. https://doi.org/10.3390/app10051622 doi: 10.3390/app10051622
[30]	W. Ma, L. Wan, C. Yu, L. Zou, J. Zheng, Multi-objective optimization of traffic signals based on vehicle trajectory data at isolated intersections, Transp. Res. Part C: Emerging Technol., 120 (2020), 102821. https://doi.org/10.1016/j.trc.2020.102821 doi: 10.1016/j.trc.2020.102821
[31]	A. Lopez, W. Jin, M. A. Al Faruque, Security analysis for fixed-time traffic control systems, Transp. Res. Part B: Methodol., 139 (2020), 473–495. https://doi.org/10.1016/j.trb.2020.07.002 doi: 10.1016/j.trb.2020.07.002
[32]	W. Lin, H. Wei, Cyber-physical models for distributed CAV data intelligence in support of self-organized adaptive traffic signal coordination control, Expert Syst. Appl., 224 (2023), 120035. https://doi.org/10.1016/j.eswa.2023.120035 doi: 10.1016/j.eswa.2023.120035

This article has been cited by:

1.	Xintao Li, Rongrui Lin, Lianbing She, Periodic measures for a neural field lattice model with state dependent superlinear noise, 2024, 32, 2688-1594, 4011, 10.3934/era.2024180
2.	Xintao Li, Lianbing She, Jingjing Yao, Periodic measures of fractional stochastic discrete wave equations with nonlinear noise, 2024, 57, 2391-4661, 10.1515/dema-2024-0078
3.	Hailang Bai, Mingkai Yuan, Dexin Li, Yunshun Wu, Weak and Wasserstein convergence of periodic measures of stochastic neural field lattice models with Heaviside ’s operators and locally Lipschitz Lévy noises, 2025, 143, 10075704, 108602, 10.1016/j.cnsns.2025.108602

Reader Comments

Your name:*

Email:*
© 2024 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)