A stochastic linear-quadratic optimal control problem with jumps in an infinite horizon

Jiali Wu; Maoning Tang; Qingxin Meng; Jiali Wu; Maoning Tang; Qingxin Meng

doi:10.3934/math.2023202

AIMS Mathematics

2023, Volume 8, Issue 2: 4042-4078. doi: 10.3934/math.2023202

Previous Article Next Article

Research article

A stochastic linear-quadratic optimal control problem with jumps in an infinite horizon

1.
Department of Mathematics, Zhejiang Normal University, Jinhua 321004, China
2.
Department of Mathematical Sciences, Huzhou University, Zhejiang 313000, China

Received: 19 June 2022 Revised: 19 October 2022 Accepted: 23 October 2022 Published: 01 December 2022
MSC : 60H10, 93E24

In this paper, a stochastic linear-quadratic (LQ, for short) optimal control problem with jumps in an infinite horizon is studied, where the state system is a controlled linear stochastic differential equation containing affine term driven by a one-dimensional Brownian motion and a Poisson stochastic martingale measure, and the cost functional with respect to the state process and control process is quadratic and contains cross terms. Firstly, in order to ensure the well-posedness of our stochastic optimal control of infinite horizon with jumps, the $L^2$ -stabilizability of our control system with jump is introduced. Secondly, it is proved that the $L^2$ -stabilizability of our control system with jump is equivalent to the non-emptiness of the admissible control set for all initial state and is also equivalent to the existence of a positive solution to some integral algebraic Riccati equation (ARE, for short). Thirdly, the equivalence of the open-loop and closed-loop solvability of our infinite horizon optimal control problem with jumps is systematically studied. The corresponding equivalence is established by the existence of a $stabilizing\ solution$ of the associated generalized algebraic Riccati equation, which is different from the finite horizon case. Moreover, any open-loop optimal control for the initial state $x$ admiting a closed-loop representation is obatined.

Keywords:

stochastic linear quadratic optimal control,
stabilizability,
open-loop solvability,
closed-loop solvability,
algrbraic Riccati equation,
stabilizing solution,
closed-loop representation

Citation: Jiali Wu, Maoning Tang, Qingxin Meng. A stochastic linear-quadratic optimal control problem with jumps in an infinite horizon[J]. AIMS Mathematics, 2023, 8(2): 4042-4078. doi: 10.3934/math.2023202

Related Papers:

[1]	Wei Ji . On the equilibrium strategy of linear-quadratic time-inconsistent control problems. AIMS Mathematics, 2025, 10(3): 5480-5494. doi: 10.3934/math.2025253
[2]	Huihui Zhong, Weijian Wen, Jianjun Fan, Weijun Yang . Reinforcement learning-based adaptive tracking control for flexible-joint robotic manipulators. AIMS Mathematics, 2024, 9(10): 27330-27360. doi: 10.3934/math.20241328
[3]	Vasilii Zaitsev, Inna Kim . Arbitrary finite spectrum assignment and stabilization of bilinear systems with multiple lumped and distributed delays in state. AIMS Mathematics, 2025, 10(3): 6934-6951. doi: 10.3934/math.2025317
[4]	Yanmei Xue, Jinke Han, Ziqiang Tu, Xiangyong Chen . Stability analysis and design of cooperative control for linear delta operator system. AIMS Mathematics, 2023, 8(6): 12671-12693. doi: 10.3934/math.2023637
[5]	Erfeng Xu, Wenxing Xiao, Yonggang Chen . Local stabilization for a hyperchaotic finance system via time-delayed feedback based on discrete-time observations. AIMS Mathematics, 2023, 8(9): 20510-20529. doi: 10.3934/math.20231045
[6]	Gengjiao Yang, Fei Hao, Lin Zhang, Lixin Gao . Stabilization of discrete-time positive switched T-S fuzzy systems subject to actuator saturation. AIMS Mathematics, 2023, 8(6): 12708-12728. doi: 10.3934/math.2023640
[7]	Da-Ke Gu, Rui-Yuan Wang, Yin-Dong Liu . A parametric approach of partial eigenstructure assignment for high-order linear systems via proportional plus derivative state feedback. AIMS Mathematics, 2021, 6(10): 11139-11166. doi: 10.3934/math.2021647
[8]	Dennis Llemit, Jose Maria Escaner IV . Value functions in a regime switching jump diffusion with delay market model. AIMS Mathematics, 2021, 6(10): 11595-11609. doi: 10.3934/math.2021673
[9]	Shouzong Liu, Yang Xu, Mingzhan Huang . Comparative analysis of sterile mosquito release strategies based on a population suppression model. AIMS Mathematics, 2024, 9(9): 23344-23367. doi: 10.3934/math.20241135
[10]	Yanghe Cao, Junsheng Zhao, Zongyao Sun . State feedback stabilization problem of stochastic high-order and low-order nonlinear systems with time-delay. AIMS Mathematics, 2023, 8(2): 3185-3203. doi: 10.3934/math.2023163

Abstract

1. Introduction

LQ optimal control is an important branch of control theory. The state equation of the control system is a linear equation, the performance index is a quadratic index and the optimal control can be given in the form of linear feedback. Getting the feedback form of optimal control is the most basic issue of LQ optimal control problems. The research on LQ optimal control problems has a long history. It can be traced back to the works of Bellman-Glicksberg-Gross ^[1] in 1958, and they first attempted to solve deterministic LQ optimal control problem. In 1960, Kalman ^[10] solved the deterministic LQ optimal control problem in the form of linear state feedback, and introduced Riccati equation into the control theory. The above-mentioned works were concerned with deterministic cases, i.e., the state equation is a linear ordinary differential equation (ODE, for short), and all the involved functions are deterministic. Compared with LQ optimal control problems, the control system of stochastic LQ optimal control probelms is stochastic. In 1962, Kushner ^[11] first study a stochastic LQ optimal control probelm driven by Brownian motion with stochastic differential equation of Itô-type by dynamic programming. In 1968, Wonham ^[21,22] first extended the deterministic LQ optimal control problem to a stochastic LQ optimal control problem that contained Riccati differential equation, followed by several researchers (see, for example, Davis ^[27] and Bensoussan ^[28]). Kohlmann and Zhou ^[29] discussed the relationship between a stochastic control problem and a backward stochastic differential equation (BSDE, for short). Based on ^[29], Lim and Zhou ^[12] firstly solved a general LQ optimal control problem of BSDE and gave an explicit form of optimal control. Li, Sun and Yong ^[17] studied the open-loop and closed-loop solvability of stochastic LQ optimal control.

In classical optimal control problems, the termination time is a real number. But among many dynamic optimization problems in economics, the termination time of the optimal control problem might be infinite. We call the mentioned case the optimal control problems with infinite horizon. In 1974, Halkin ^[6] introduced the necessary conditions for optimal control problems with infinite horizon. In 2000, Rami, Zhou, and Moore ^[15] discussed well-posedness and attainability of indefinite stochastic linear-quadratic control problems over an infinite time horizon. In 2003, Wu and Li ^[23] studied an infinite horizon LQ problem with unbounded controls in a Hilbert space. In 2005, Guatteri and Tessitore ^[3] studied the backward stochastic Riccati equation in infinite dimensions, and in 2008, Guatteri and Tessitore ^[4] studied LQ optimal control problems with stochastic coefficients over an infinite horizon. In 2009, Guatteri and Masiero ^[5] discussed ergodic optimal quadratic control problems for an affine equation with stochastic coefficients over an infinite horizon. Then, Hu ^[7] studied the optimal quadratic control for an affine equation driven by Lévy processes over an infinite horizon in 2013. In 2015, Huang, Li and Yong ^[9] discussed a LQ optimal control problem for mean-field stochastic differential equations over an infinite horizon. In 2016, Sun-Li-Yong ^[17] put forward the concepts of open-loop and closed-loop solvabilities, and it was shown that the closed-loop solvability is equivalent to the existence of a regular solution of the Riccati equation. Different from finite-horizon, Sun-Yong ^[18] found that for infinite-horizon LQ optimal control problems, both the open-loop and closed-loop solvabilities are equivalent to the existence of a $static\ stabilizing\ solution$ to the associated generalized ARE. Also, every open-loop optimal control admits a closed-loop representation.

In fact, many random phenomena presented discontinuous motion characteristics with jumps. Therefore, the Poisson jump process which describes such discontinuous random phenomena came into. The following are the research status of the system with jumps. In ^[2], Boel Varaiya et al. discussed the optimal control problem of processes with jumps for the first time. In 1994, Tang and Li ^[20] first proved the necessary condition of stochastic optimal control with jumps and first discussed the BSDE with a Poisson process. In 1997, Situ ^[16] made a further research on the solutions of BSDE with jumps. In 2003, Wu and Wang ^[24] studied the stochastic LQ optimal control problem with state equation driven by Brownian motion and Poisson jump process, and obtained the existence and uniqueness result of solutions of the deterministic Riccati equation. In 2008, Hu and Øksendal ^[8] discussed the partial information linear quadratic control for jump diffusions. Then in 2009, Oksendal and Sulem ^[14] studied the maximum principles for optimal control of forward-backward stochastic differential equations with jumps. In 2014, Meng ^[13] discussed the existence and uniqueness of solutions to the backward stochastic Riccati equation with jumps. In 2018, Li-Wu-Yu ^[30] studied the stochastic LQ optimal control problem with Poisson processes under the indefinite case.

Different from the research on the optimal control problem of the jump diffusion system mentioned above, in this paper we will generalize the results of the infinite horizon stochastic LQ problem of Sun-Yong ^[18] to the jump diffusion system and our goal is to establish the corresponding stability theory and the optimal state feedback representing of the optimal control for jump diffusion system. Firstly, we introduce the concept of $L^2$ -stabilizability of a jump-diffusion system over an infinite horizon, and find that the existence of the admissible control set is equivalent to the $L^2$ -stabilizability of a jump-diffusion system over an infinite horizon. Secondly, we introduce the definition of the ARE of a jump-diffusion system over an infinite horizon and prove the equivalent among the existence of the admissible control set, the $L^2$ -stabilizability of the system and the positive solvalibity of the ARE. Thirdly, we give the concepts of the open-loop and closed-loop solvability of the stochastic LQ optimal control problem with jumps over an infinite horizon. We then introduce the concept of the stabilizing solution of the associated generalized ARE and find that both the open-loop and closed-loop solvability of the problem are equivalent to the existence of a stabilizing solution of the associated generalized ARE, which is different from the finite horizon case. Finally, we find that any open-loop optimal control for the initial state $x$ admits a closed-loop representation. In addition, Our results are generalizations of a recently published paper, Sun and Yong ^[18], on the similar topic to jump diffusion systems. The state equation in our case is a stochastic differential equation driven by a one-dimensional standard Brownian motion and a Poisson stochastic martingale measure, which is more general. It is well-known that processes arising by Poisson random measures play an increasing role in modeling stochastic dynamical systems. And it helps us deal with unexpected situations in financial problems, so it is necessary to use a jump system to characterize.

The paper is organized as follows. In Section 2, we introduce some basic notions and lemmas used throughout this paper. In Section 3, we state the problem. Section 4 aims to give the definition of $L^2$ -stable and $L^2$ -stabilizable, further we describe the structure of admissible control set and prove the equivalence of the non-emptiness of the admissible control set, the $L^2$ -stabilizability of the control system and the existence of a positive solution to an algebraic Riccati equation. In Section 5, we introduce the notions of open-loop and closed-loop solvabilities as well as the algebraic Riccati equation, and finally we state the main result of the paper.

2. Preliminaries

Throughout this paper, we let $(\Omega, \mathcal {F}, \{\mathcal {F}_{t}\}_{t\geq0}, \mathbb P)$ be a complete probability space on which a one-dimensional standard Brownian motion $W = \{W(t); t\geq0\}$ is defined. Denote by $\mathcal P$ the $\mathcal F_t$ -predictable $\sigma$ -field on $[0, \infty)\times\Omega$ and by $\mathcal B(\Lambda)$ the Borel $\sigma$ -algebra of any topological space $\Lambda$ . Let $(Z, \mathcal {B}(Z), \nu)$ be a measurable space with $\nu(Z) < \infty$ and $\eta:\Omega\times D_{\eta}\rightarrow Z$ be an $\mathcal F_t$ -adapted stationary Poisson point process with characteristic measure $\nu$ , where $D_{\eta}$ is a countable subset of $(0, \infty)$ . Then the counting measure induced by $\eta$ is

$\mu((0, t]\times A): = \#\{s\in D_{\eta};s\leq t, \eta(s)\in A\}, \; \; \; for\; \; t > 0, A\in\mathcal B(Z).$

And $\tilde{\mu}(dt, d\theta) = \mu(dt, d\theta)-dt\nu(d\theta)$ is a compensated Poisson random martingle measure which is assumed to be independent of the Brownian motion $W$ . In the following, we introduce the basic notations used throughout this paper.

2.1. Notations

● $\mathbb H$ : The Hilbert space with norm $\|\cdot\|_{\mathbb H}$ .

● $\mathbb{R}^{n}$ : The n-dimensional Euclidean space.

● $\mathbb{R}^{n\times m}$ : The space of all $(n\times m)$ matrices.

● $\langle \alpha, \beta \rangle$ : The inner product in $\mathbb{R}^{n}, \ \forall \alpha, \ \beta \in \mathbb{R}^{n}$ .

● $|\alpha| = \sqrt{\langle \alpha, \alpha \rangle}$ : The norm of $\alpha, \ \forall \alpha \in \mathbb{R}^{n}$ .

● $M^\top$ : The transpose of matrix $M$ .

● $M^{\dagger}$ : The Moore-Penrose pseudoinverse of a matrix $M$ .

● $\langle M, N \rangle = tr(M^\top N)$ : The inner product in $\mathbb{R}^{n\times m}, \ \forall M, \ N \in \mathbb{R}^{n\times m}$ .

● $|M| = \sqrt{tr(M^\top M)}$ : The norm of $M, \ \forall M \in \mathbb{R}^{n\times m}$ .

● $\mathscr R(M)$ : The range of a matrix or an operator $M$ .

● $\mathbb{S}^{n}\in \mathbb{R}^{n\times n}$ : The set of all $(n\times n)$ symmetric matrices.

● $\mathbb{S}_{+}^{n}\in \mathbb{S}^{n}$ : The set of all $(n\times n)$ non-negative definite symmetric matrices.

● $\mathbb{F} = \{\mathcal {F}_{t}\}_{ t \geq 0}$ : The natural filtration. In other words, the flow of information generated by all market noises.

● $(\Omega, \mathcal {F}, P)$ : A complete probability space.

● $(\Omega, \mathcal {F}, \mathbb{F}, P)$ : The complete filtered probability space.

● $C([0, \infty); \mathbb{R})$ : The space of all continuous functions $\varphi: [0, \infty) \rightarrow \mathbb{R}$ .

● $L_{\mathbb{F}}^2(\mathbb H)$ : The space of all $\mathbb H$ -valued and $\mathbb F$ -progressively measurable processes $g(\cdot)$ satisfying $g: [0, \infty)\times \Omega \rightarrow \mathbb H$ and

$\mathbb E \int_0^{\infty}\|g(t)\|_{\mathbb H}^2dt < \infty.$

● $\mathcal X_{loc}[0, \infty)$ : The space of all $\mathcal {F}_{t}$ -adapted and càdlàg processes $g(\cdot)$ satisfying $g: [0, \infty)\times \Omega \rightarrow \mathbb H$ and

$\mathbb E \bigg( \sup \limits_{0\leq t\leq T}\|g(s)\|_{\mathbb H}^2 \bigg) < \infty$

for every $T > 0.$

● $\mathcal X[0, \infty)$ : The space of all $\mathcal {F}_{t}$ -adapted and càdlàg processes $g(\cdot)$ satisfying $g\in\mathcal X_{loc}[0, \infty)$ and

$\mathbb E\int_0^{\infty}\|g(t)\|_{\mathbb H}^2dt < \infty.$

● $L^{\nu, 2}(Z; \mathbb H)$ : The space of all $\mathbb H$ -valued measurable function $r = \{r(\theta), \ \theta \in Z\}$ defined on the measurable space $(Z, \mathcal {B}(E); \nu)$ satisfying

$\int_z\|r(\theta)\|_{\mathbb H}^2 \nu(d\theta) < \infty.$

● $L^{\nu, 2}_{\mathbb{F}}([0, \infty)\times Z; \mathbb H)$ : The space of all $L^{\nu, 2}(\mathbb H)$ -valued and $\mathcal {F}_{t}$ - predictable processes $r = \{r(t, \omega, \theta), \ (t, \omega, \theta) \in [0, \infty)\times\Omega\times Z\}$ satisfying

$\mathbb E \int_0^{\infty}\|r(s, \cdot)\|_{\mathbb H}^{2} ds < \infty.$

2.2. Pseudoinverse

We recall some properties of the reference ^[19].

Lemma 2.1. For any M $\in\mathbb R^{m\times n}$ , there exists a unique matrix $M^{\dagger}\in\mathbb R^{n\times m}$ such that

$MM^{\dagger}M = M, \; \; \; \; (MM^{\dagger})^\top = MM^{\dagger}, \; \; \; \; M^{\dagger}MM^{\dagger} = M^{\dagger}, \; \; \; \; (M^{\dagger}M)^\top = M^{\dagger}M.$

In addition, if $M\in\mathbb S^n$ , then $M^{\dagger}\in\mathbb S^n, \; MM^{\dagger} = M^{\dagger}M$ , and $M\geq0$ if and only if $M^{\dagger}\geq0$ .

Lemma 2.2. Let L $\in\mathbb R^{n\times k}$ and $N\in\mathbb R^{n\times m}$ . The matrix equation NX = L has a solution if and only if

$\begin{equation} \mathscr R(L)\subseteq\mathscr R(N), \end{equation}$

(2.1)

in which case the general solution is given by

$\begin{equation} X = N^{\dagger}L+(I-N^{\dagger}N)Y, \end{equation}$

(2.2)

while $Y\in\mathbb R^{m\times k}$ is arbitrary.

The matrix $M^{\dagger}$ above is called the $Moore$ - $Penrose\ pseudoinverse$ of $M$ .

Remark 2.1. (ⅰ) Clearly, condition (2.1) is equivalent to $NN^{\dagger}L = L$ .

(ⅱ) By Lemma 2.2, if $N\in\mathbb S^n$ , and $NX = L$ , then $X^\top NX = L^\top N^{\dagger}L$ .

Lemma 2.3. (Extended Schur's lemma) Let $L\in\mathbb R^{n\times m}, \ M\in\mathbb S^n$ , and $N\in\mathbb S^m$ . The following conditions are equivalent:

(i) $M-LN^{\dagger}L^\top\geq0, \ N\geq0$ , and $\mathscr R(L^\top)\subseteq\mathscr R(N);$

(ii) $\left(\begin{array}{cc}M&L\\L^\top&N\end{array}\right)\geq0.$

3. Formulation of problem

Now we consider a controlled linear stochastic system on the infinite horizon $[0, \infty)$ , the state equation is as follows:

$\begin{equation} \left\{\begin{array}{ll} dX(t) = [AX(t)+Bu(t)+b(t)]dt+[CX(t)+Du(t)+\sigma(t)]dW(t) \\\; \; \; \; \; \; \; \; \; \; \; \; + \int_{Z}[E(\theta)X(t-) +F(\theta)u(t)+h(t, \theta)]\tilde{\mu}(dt, d\theta), \; \; \; \; t \in [0, \infty), \\X(0) = x \in \mathbb R^n, \end{array} \right. \end{equation}$

(3.1)

and the quadratic cost functional is given by:

$\begin{eqnarray} \begin{split} J(x;u) \triangleq &\mathbb{E} \int_0^{\infty}\bigg[\langle QX(t), X(t)\rangle +2\langle SX(t), u(t)\rangle+\langle Ru(t), u(t) \rangle\\&\; \; \; \; \; \; \; +2\langle q(t), X(t) \rangle+2\langle \rho(t), u(t) \rangle \bigg]dt\\ = &\mathbb{E} \int_0^{\infty}\bigg[ \bigg{\langle}\left(\begin{array}{cc} Q & S^\top \\ S & R \end{array} \right) \left( \begin{array}{c} X(t)\\u(t)\end{array}\right), \left( \begin{array}{c} X(t)\\u(t)\end{array}\right)\bigg{\rangle} +2\bigg{\langle}\left(\begin{array}{c} q(t) \\ \rho(t)\end{array}\right), \left( \begin{array}{c} X(t)\\u(t)\end{array}\right)\bigg{\rangle}\bigg]dt, \end{split} \end{eqnarray}$

(3.2)

where $A, C\in\mathbb R^{n\times n}; \; E\in L^{\nu, 2}(Z; \mathbb R^{n\times n}); \; B, D, \in\mathbb R^{n\times m}; \; F\in L^{\nu, 2}(Z; \mathbb R^{n\times m}); \; Q\in\mathbb S^n; \; S\in\mathbb R^{m\times n}; \; R\in\mathbb S^m$ are given constant matrices, and $b(\cdot), \sigma(\cdot), q(\cdot)\in L^2_{\mathbb F}({\mathbb R}^n); h(\cdot, \cdot)\in L^{\nu, 2}_{\mathbb F}([0, \infty)\times Z; \mathbb R^n); \rho(\cdot)\in L^2_{\mathbb F}({\mathbb R}^m)$ are given vector-valued $\mathcal F_t$ -measurable processes. In the above, $u(\cdot)\in L^2_{\mathbb F}({\mathbb R}^m)$ is called the $control\ process, \ x$ , which belongs to $\mathbb R^n$ is called the $initial\ state$ , and the $X(\cdot)\equiv X(\cdot; x, u(\cdot))\in\mathbb R^n$ , which is the solution of SDE (3.1) is called the $state\ process$ corresponding to the initial state $x$ and the control $u(\cdot)$ .

Different from the finite case, the solution $X(\cdot)\equiv X(\cdot; x, u(\cdot))$ of (3.1) might not always be square-integrable for $(x, u(\cdot))\in \mathbb R^n \times L^2_{\mathbb F}({\mathbb R}^m)$ over an infinite time horizon. To make sure the cost functional $J(x; u)$ is well defined, we introduce:

$\mathscr U \triangleq \left\{u\in L^2_{\mathbb F}({\mathbb R}^m)|\; \mathbb{E} \int_0^{\infty}{\lvert X(t;x, u)\rvert}^2dt < \infty \right\}, \; \; \; \; \; x \in \mathbb R^n .$

The element $u\in\mathscr U$ is called an $admissible \ control$ associated with $x$ , and the linear quadratic optimal control problem over an infinite time horizon now can be stated as follows.

Problem 3.1. For any given initial state $x\in \mathbb R^n$ , find an admissible control $u^*\in \mathscr U$ such that

$\begin{equation} J(x;{u}^*) = \inf\limits_{u\in \mathscr U}J(x;u) = V(x). \end{equation}$

(3.3)

$u^*\in\mathscr U$ is called an open-loop optimal control of Problem (SLQ) $_{\infty}$ for the initial state $x$ if it satisfies (3.3), and the corresponding state process $X^*(\cdot)\equiv X(\cdot; x, u^*)$ is called an $optimal \ state \ process.$ The function $V(\cdot)$ is called the $value \ function$ of Problem (SLQ) $_{\infty}$ . A special case when $b, \sigma, h, q, \rho = 0$ , we use (SLQ) $_{\infty}^0$ , $J^0(x; u)$ , and $V^0(x)$ to denote the Problem, the cost functional, and the value function corresponding to Problem 3.1.

The following assumptions on the coefficients will be in force throughout this paper.

Assumption 3.1. The coefficients of the state equation satisfy the following: $A, C\in\mathbb R^{n\times n}; \; B, D\in\mathbb R^{n\times m}$ are given constant matrices; $\; E(\cdot)\in L^{\nu, 2}(Z; \mathbb R^{n\times n}); \; \; F(\cdot)\in L^{\nu, 2}(Z; \mathbb R^{n\times m})$ are given deterministic matrix-valued function, $b(\cdot), \sigma(\cdot)\in L^2_{\mathbb F}({\mathbb R}^n); h(\cdot, \cdot)\in L^{\nu, 2}_{\mathbb F}([0, \infty)\times Z; \mathbb R^n)$ are given vector-valued $\mathcal F_t$ -measurable processes.

Assumption 3.2. The coefficients of the cost functional satisfy the following: $Q\in\mathbb S^n; \; S\in\mathbb R^{m\times n}; \; R\in\mathbb S^m$ are given constant matrices, $q(\cdot)\in L^2_{\mathbb F}({\mathbb R}^n), \rho(\cdot)\in L^2_{\mathbb F}({\mathbb R}^m)$ are given vector-valued $\mathcal F_t$ -measurable processes.

Assumption 3.3. For all $\theta\in Z$ , there exists a constant $\lambda > 0$ such that

$I+E(\theta)\geq\lambda I.$

As we mentioned before, admissible controls might not exist. To solve this question, the concept of $stability$ is introduced as follows. Let us consider the following uncontrolled linear system:

$dX(t) = AX(t)dt+CX(t)dW(t)+\int_Z E(\theta)X(t-)\widetilde{\mu}(dt, d\theta), \; \; t\geq 0,$

and we use [ $A, C, E$ ] to denote this system.

Definition 3.1. System [ $A, C, E$ ] is said to be $L^2$ -stable if for any initial state $x\in \mathbb R^n$ , its solution $X(\cdot; x)$ satisfies

$\mathbb{E} \int_0^{\infty}| X(t;x)|^2dt < \infty, \; \; \forall x\in \mathbb R^n,$

i.e., $X(\cdot; x)\in\mathcal{X}[0, \infty).$

Consider the following BSDE over an infinite horizon [0, $\infty$ ):

$\begin{equation} dY(t) = -[A^\top Y(t)+C^\top Z(t)+\int_ZE(\theta)^\top r(t, \theta)\nu(d\theta)+\varphi(t)]dt+Z(t)dW(t)\\ +\int_Zr(t, \theta)\tilde{\mu}(dt, d\theta), \end{equation}$

(3.4)

where Assumption 3.1 holds, and { $\varphi(t); 0\leq t < \infty$ } is a given $\mathbb F$ -progressively measurable, $\mathbb R^n$ -valued process. We call the solution ( $Y(\cdot), Z(\cdot), r(\cdot, \cdot)$ ) the adjoint processes corresponding to $X(\cdot)$ .

Definition 3.2. If $(Y(\cdot), Z(\cdot), r(\cdot, \cdot))\in\mathcal{X}[0, \infty)\times L^{2}_{\mathbb{F}}(\mathbb R^n)\times L^{\nu, 2}_{\mathbb{F}}([0, \infty)\times Z; \mathbb R^n)$ satisfies the integral version of (3.4):

$\begin{eqnarray} \begin{split} Y(t)& = Y(0)-\int_0^t[A^\top Y(s)+C^\top Z(s)+\int_ZE(\theta)^\top r(s, \theta)\nu(d\theta)+\varphi(s)]ds\\ &\; \; \; +\int_0^tZ(s)dW(s)+\int_0^t\int_Zr(s, \theta)\tilde{\mu}(ds, d\theta), \; \; \; t\geq0, \; \; \; a.s, \end{split} \end{eqnarray}$

(3.5)

then we call it an $L^2$ -stable adapted solution of (3.4).

Lemma 3.2. Let Assumptions 3.1–3.3 be satisfied. Suppose that [A, C, E] is $L^2$ -stable, then for any $\varphi\in L^2_{\mathbb F}(\mathbb R^n)$ , Eq (3.4) admits a unique $L^2$ -stable adapted solution (Y, Z, r).

The proof is similar to proof of Theorem A.2.2. of ^[19].

Lemma 3.3. Under Assumptions 3.1–3.3, the strong solution $X(\cdot)$ of the SDE (3.1) with $u(\cdot) = 0$ has the explicit expression:

$\begin{eqnarray} \begin{split} X(t)& = \Lambda(t)\Lambda(0)^{-1}x\\ &\; \; \; +\Lambda(t)\int_{0}^t\Lambda(s)^{-1}\left[ b(s)-C\sigma(s)-\int_ZE(\theta)[I+E(\theta)]^{-1}h(s, \theta)\nu(d\theta)\right] ds\\ &\; \; \; +\Lambda(t)\int_{0}^{t}\Lambda(s)^{-1}\sigma(s)dW(s)\\ &\; \; \; +\Lambda(t)\int_{0}^t\int_Z\Lambda(s-)^{-1}[I+E(\theta)]^{-1}h(s, \theta)\tilde{\mu}(ds, d\theta), \end{split} \end{eqnarray}$

(3.6)

where $\Lambda(\cdot)$ is the unique solution of the following matrix-valued SDE:

$\begin{equation} \left\{\begin{aligned} d\Lambda(t)& = A\Lambda(t)dt+C\Lambda(t)dW(t)+\int_ZE(\theta)\Lambda(t-)\tilde{\mu}(dt, d\theta), \; \; \; t\geq0, \\ \Lambda(0)& = I_n, \end{aligned} \right. \end{equation}$

(3.7)

and $\Lambda(t)^{-1}$ exists, satisfying

$\begin{equation} \left\{\begin{aligned} d\Lambda(t)^{-1}& = \Lambda(t)^{-1}\left[ -A+C^2+\int_ZE^2(\theta)[I+E(\theta)]^{-1}\nu(d\theta)\right] dt\\ &\; \; \; -\Lambda(t)^{-1}CdW(t)-\int_Z\Lambda(t-)^{-1}E(\theta)[I+E(\theta)]^{-1}\tilde{\mu}(dt, d\theta), \\ \Lambda(0)^{-1}& = I_n. \end{aligned} \right. \end{equation}$

(3.8)

The proof is similar to the proof of Theorem 6.14 of ^[26].

4. Admissible control sets and stabilizability

4.1. Stability

Regarding $L^2$ -stability of the system [ $A, C, E$ ] the following result holds:

Theorem 4.1. Let Assumptions 3.1–3.3 be satisfied. The system [A, C, E] is $L^2$ -stable if and only if there exists a $K \in \mathbb{S}_+^n$ such that

$\begin{equation} KA+A^\top K+C^\top KC+\int_ZE(\theta)^\top KE(\theta)\nu(d\theta) < 0. \end{equation}$

(4.1)

In this case, the Lyapunov equation

$KA+A^\top K+C^\top KC+\int_ZE(\theta)^\top KE(\theta)\nu(d\theta)+\Psi = 0$

admits a unique solution $K\in \mathbb S^n$ for any $\Psi\in \mathbb S^n$ , which is given by

$K = \mathbb E\bigg[\int_0^{\infty}\Lambda(t)^\top\Psi\Lambda(t)dt\bigg],$

where $\Lambda$ is the solution of (3.7).

Proof. Necessity. Consider the following linear ODE on [0, $\infty$ ) for any fixed $\Psi\in\mathbb S^n$ :

$\begin{equation} \left\{\begin{array}{ll} \dot{\Phi}(t) = \Phi(t) A+A^\top \Phi(t)+C^\top \Phi(t)C+\int_ZE(\theta)^\top \Phi(t)E(\theta)\nu(d\theta)+\Psi;\\ \Phi(0) = 0. \end{array} \right. \end{equation}$

(4.2)

It is clear that (4.2) is uniquely solvable on [0, $\infty$ ). And to the solution $\Phi(t)$ , we define a function

$\Phi_{\tau}(s) = \Phi(\tau-s), \; \; s\in[0, \tau],$

which is a solution of the following equation

$\begin{equation} \left\{\begin{array}{ll} \dot{\Phi}_{\tau}(s)+\Phi_{\tau}(s) A+A^\top \Phi_{\tau}(s)+C^\top \Phi_{\tau}(s)C+\int_ZE(\theta)^\top \Phi_{\tau}(s)E(\theta)\nu(d\theta)+\Psi = 0;\\ \Phi_{\tau}(\tau) = 0\nonumber \end{array} \right. \end{equation}$

on the interval [0, $\tau$ ] for any fixed $\tau > 0$ . Clearly, we have $X(s) = \Lambda(s)x$ , and $X(\cdot)$ is the solution of [ $A, C, E$ ] with initial state $x$ . Applying Itô's formula to $s\mapsto \langle \Phi_{\tau}(s)X(s), X(s)\rangle$ , we have

$\begin{eqnarray} \begin{split} -\langle \Phi_{\tau}(0)x, x\rangle = &\mathbb E[\langle \Phi_{\tau}(\tau)X(\tau), X(\tau)\rangle-\langle \Phi_{\tau}(0)x, x\rangle]\\ = &\mathbb E\int_0^{\tau}\big\langle(\dot{\Phi}_{\tau}+\Phi_{\tau} A+A^\top \Phi_{\tau}+C^\top \Phi_{\tau}C+\int_ZE(\theta)^\top \Phi_{\tau}E(\theta)\nu(d\theta))X, X\big\rangle(s)ds\\ = &-\mathbb E\int_0^{\tau}\langle \Psi X(s), X(s)\rangle ds\\ = &-x^\top\bigg[\mathbb E\int_0^{\tau}\Lambda(s)^\top \Psi\Lambda(s)ds\bigg]x.\nonumber \end{split} \end{eqnarray}$

Thus,

$\Phi(\tau) = \Phi_{\tau}(0) = \mathbb E\int_0^{\tau}\Lambda(s)^\top \Psi\Lambda(s)ds, \; \; \tau\geq 0.$

Since the system [ $A, C, E$ ] is $L^2$ -stable, it is easy to check that

$\lim\limits_{\tau\mapsto\infty} \Phi(\tau) = \mathbb E\int_0^{\infty}\Lambda(s)^\top \Psi\Lambda(s)ds\equiv K.$

Because $\Phi(t)$ is the solution to (4.2), we have for any $t > 0$ ,

$\begin{eqnarray} \begin{split} \Phi(t+1)-\Phi(t) = &\bigg(\int_t^{t+1}\Phi(s)ds \bigg)A+A^\top\bigg(\int_t^{t+1}\Phi(s)ds\bigg)+C^\top\bigg(\int_t^{t+1}\Phi(s)ds\bigg)C\\+&\int_ZE(\theta)^\top\bigg(\int_t^{t+1}\Phi(s)ds\bigg)E(\theta)\nu(d\theta)+\Psi.\nonumber \end{split} \end{eqnarray}$

Letting $t\rightarrow \infty$ , we have

$KA+A^\top K+C^\top KC+\int_ZE(\theta)^\top KE(\theta)\nu(d\theta)+\Psi = 0.$

What's more, (4.1) holds when we take $\Psi = I_n$ , and the corresponding $K\in\mathbb S_+^{n}$ .

Sufficiency. Now we suppose that $X(\cdot) \equiv X(\cdot; x)$ is a solution of [ $A, C, E$ ] with initial state $x$ , and suppose $K\in\mathbb S_+^n$ satisfies (4.1). Applying Itô's formula to $s\mapsto \langle KX(s), X(s)\rangle$ , we have for any $t > 0$ ,

$\begin{array}{l} \mathbb E\langle KX(t), X(t)\rangle-\langle Kx, x\rangle =\\ \mathbb E\int_0^t\bigg\langle\left[ KA+A^\top K+C^\top KC+\int_ZE(\theta)^\top KE(\theta)\nu(d\theta)\right] X(s), X(s)\bigg\rangle ds. \end{array}$

Let $\lambda > 0$ be the smallest eigenvalue of $-\big(KA+A^\top K+C^\top KC+\int_ZE(\theta)^\top KE(\theta)\nu(d\theta)\big).$ We get that

$\begin{eqnarray} \begin{split} \lambda \mathbb E\int_0^t|X(s)|^2 ds\leq & -\mathbb E\int_0^t\bigg\langle\left[ KA+A^\top K+C^\top KC+\int_ZE(\theta)^\top KE(\theta)\nu(d\theta)\right] X(s).X(s)\bigg\rangle ds\\ = &\langle Kx, x\rangle-\mathbb E\big\langle KX(t), X(t)\big\rangle\\ \leq&\langle Kx, x\rangle, \; \; \; \; \forall t > 0, \nonumber \end{split} \end{eqnarray}$

which implies the $L^2$ -stability of [ $A, C, E$ ].

With regard to the nonhomogeneous system:

$\begin{equation} dX(t) = \big[AX(t)+\varphi(t)\big]dt+\big[CX(t)+\rho(t)\big]dW(t)+\\ \int_Z\bigg[E(\theta)X(t-)+\omega(t, \theta)\bigg]\widetilde{\mu}(dt, d\theta), \; \; t\geq0, \end{equation}$

(4.3)

we have the following result.

Proposition 4.2. Let Assumptions 3.1–3.3 hold. If [ $A, C, E$ ] is $L^2$ -stable, then the solution $X(\cdot)\equiv X(\cdot; x, \varphi, \rho, \omega)$ of (4.3) is in $\mathcal X[0, \infty)$ for any $\varphi, \rho\in L^2_{\mathbb F}(\mathbb R^n), \omega\in L^{\nu, 2}_{\mathbb F}([0, \infty)\times Z; \mathbb R^n)$ , and any initial state $x\in\mathbb R^n$ . Moreover, there exists a positive constant $M$ , which is independent of $x, \varphi, \rho\ and \ \omega$ , such that

$\mathbb E\int_0^{\infty}|X(t)|^2 dt\leq M \bigg\{|x|^2+\mathbb E\int_0^{\infty}\bigg[|\varphi(t)|^2+|\rho(t)|^2+ \int_Z|\omega(t, \theta)|^2\nu(d\theta) \bigg]dt \bigg\}.$

Proof. Since [ $A, C, E$ ] is $L^2$ -stable, there exists a $K\in \mathbb{S}_+^n$ such that

$KA+A^\top K+C^\top KC+\int_ZE(\theta)^\top KE(\theta)\nu(d\theta)+I_n = 0$

because of Theorem 4.1. Applying Itô's formula to $s\mapsto\langle KX(s), X(s)\rangle$ , we have

$\begin{eqnarray} \begin{split} \mathbb E\langle K&X(t), X(t)\rangle-\langle Kx, x\rangle\\ = &\mathbb E\int_0^t\bigg[\bigg\langle\bigg(KA+A^\top K+C^\top KC+\int_ZE(\theta)^\top KE(\theta)\nu(d\theta)\bigg)X(s), X(s)\bigg\rangle\\ &\; \; \; \; \; \; \; \; +2\bigg\langle K\varphi(s)+C^\top K\rho(s)+\int_ZE(\theta)^\top K\omega(s, \theta)\nu(d\theta), X(s)\bigg\rangle\\ &\; \; \; \; \; \; \; \; +\bigg\langle K\rho(s), \rho(s)\bigg\rangle+\int_Z\bigg\langle K\omega(s, \theta), \omega(s, \theta)\bigg\rangle\nu(d\theta)\bigg]ds\\ = &\mathbb E\int_0^t\bigg[-|X(s)|^2+2\bigg\langle K\varphi(s)+C^\top K\rho(s)+\int_ZE(\theta)^\top K\omega(s, \theta)\nu(d\theta), X(s)\bigg\rangle\\ &\; \; \; \; \; \; \; \; +\bigg\langle K\rho(s), \rho(s)\bigg\rangle+\int_Z\bigg\langle K\omega(s, \theta), \omega(s, \theta)\bigg\rangle\nu(d\theta)\bigg]ds\nonumber \end{split} \end{eqnarray}$

for all $t > 0$ . We define

$\begin{eqnarray} \begin{split} \chi(s) = &K\varphi(s)+C^\top K\rho(s)+\int_ZE(\theta)^\top K\omega(s, \theta)\nu(d\theta), \\ \gamma(s) = &\bigg\langle K\rho(s), \rho(s)\bigg\rangle+\int_Z\bigg\langle K\omega(s, \theta), \omega(s, \theta)\bigg\rangle\nu(d\theta);\; \; \; s > 0.\nonumber \end{split} \end{eqnarray}$

According to Cauchy-Schwarz inequality, we obtain

$\begin{eqnarray} \begin{split} \lambda\mathbb E|X(t)|^2\leq&\mathbb E\langle KX(t), X(t)\rangle\\ \leq&\langle Kx, x\rangle+\mathbb E\int_0^t\bigg[-|X(s)|^2+2\langle\chi(s), X(s)\rangle+\gamma(s)\bigg]ds\\ \leq&\langle Kx, x\rangle+\mathbb E\int_0^t\bigg[-\frac{1}{2}|X(s)|^2+2|\chi(s)|^2+\gamma(s)\bigg]ds\\ = &\langle Kx, x\rangle+\int_0^t\bigg[-\frac{1}{2}\mathbb E|X(s)|^2+2\mathbb E|\chi(s)|^2+\mathbb E\gamma(s)\bigg]ds, \nonumber \end{split} \end{eqnarray}$

where the $\lambda > 0$ is the smallest eigenvalue of $K$ . Finally, by Gronwall's inequality, we get that

$\lambda\mathbb E|X(t)|^2\leq\langle Kx, x\rangle e^{-(2\lambda)^{-1}t}+\int_0^t e^{-(2\lambda)^{-1}(t-s)}\bigg[ 2\mathbb E|\chi(s)|^2+\mathbb E\gamma(s)\bigg]ds.$

In summary, the conclusion of the integrability of $\mathbb E|X(t)|^2$ over $[0, \infty)$ can be obtained together with Young's inequality.

4.2. Stabilizability

According to Proposition 4.2, the admissible control set $\mathscr U$ is nonempty (actually it equals to $L_{\mathbb F}^2(\mathbb R^n)$ ) for all $x\in\mathbb R^n$ if the system [ $A, C, E$ ] is $L^2$ -stable. Now we introduce the concept of $stabilizability$ to characterize the admissible control set. Denote by [ $A, C, E; B, D, F$ ] the following controlled linear system:

$\begin{array}{l} dX(t) = [AX(t)+Bu(t)]dt+[CX(t)+Du(t)]dW(t)+\int_Z[E(\theta)X(t-)+\\ F(\theta)u(t)]\tilde{\mu}(dt, d\theta), \; \; \; t\geq 0. \end{array}$

Definition 4.1. [ $A, C, E; B, D, F$ ] is said to be $L^2$ -stabilizability if there exists a matrix $\Phi\in\mathbb R^{m\times n}$ such that [ $A+B\Phi, C+D\Phi, E+F\Phi$ ] is $L^2$ -stable, and we call $\Phi$ a $stabilizer$ of [ $A, C, E; B, D, F$ ]. $\mathscr{T}\equiv\mathscr{T}$ [ $A, C, E; B, D, F$ ] is the set of all stabilizers of [ $A, C, E; B, D, F$ ].

Proposition 4.3. Let the Assumptions 3.1–3.3 be satisfied. Suppose that $\Phi\in\mathscr T$ [ $A, C, E; B, D, F$ ]. Then for any $x\in\mathbb R^n$ ,

$\mathscr U = \big\{\Phi X_{\Phi}(\cdot;x, v)+v:v\in L_{\mathbb F}^2(\mathbb R^m)\big\},$

where $X_{\Phi}(\cdot; x, v)$ is the solution to the SDE

$\begin{equation} \left\{\begin{array}{ll} dX_{\Phi}(t) = [(A+B\Phi)X_{\Phi}(t)+Bv(t)+b(t)]dt \\\; \; \; \; \; \; \; \; \; \; \; \; +[(C+D\Phi)X_{\Phi}(t)+Dv(t)+\sigma(t)]dW(t) \\\; \; \; \; \; \; \; \; \; \; \; \; + \int_{Z}\left[ \bigg(E(\theta)+F(\theta)\Phi\bigg)X_{\Phi}(t)+F (\theta)v(t)+h(t, \theta)\right] \tilde{\mu}(dt, d\theta), \; \; \; t \in [0, \infty), \\X_{\Phi}(0) = x \in \mathbb R^n. \end{array} \right. \end{equation}$

(4.4)

Proof. Since $\Phi\in\mathscr T$ [ $A, C, E; B, D, F$ ], the system [ $A+B\Phi, C+D\Phi, E+F\Phi$ ] is $L^2$ -stable. Let $X_{\Phi}(\cdot)\equiv X_{\Phi}(\cdot; x, v)$ be the corresponding solution to (4.4) and let $v\in L_{\mathbb F}^2(\mathbb R^m)$ . By Proposition 4.2, $X_{\Phi}\in\mathcal X[0, \infty).$ We set

$u = \Phi X_{\Phi}+v \in L_{\mathbb F}^2(\mathbb R^m),$

and let $X\in\mathcal X_{loc}[0, \infty)$ be the solution of

$\begin{equation} \left\{\begin{array}{ll} dX(t) = [AX(t)+Bu(t)+b(t)]dt+[CX(t)+Du(t)+\sigma(t)]dW(t) \\\; \; \; \; \; \; \; \; \; \; \; \; + \int_{Z}[E(\theta)X(t-) +F(\theta)u(t)+h(t, \theta)]\tilde{\mu}(dt, d\theta), \; \; \; t \in [0, \infty), \\X(0) = x \in \mathbb R^n. \end{array} \right. \end{equation}$

(4.5)

Then we have $X = X_{\Phi}\in\mathcal X[0, \infty)$ . For the proof the reader is referred to Theorem 1.19 in ^[31]. And hence $u\in\mathscr U.$

On the flip side, let $X\in\mathcal X[0, \infty)$ be the corresponding solution to (4.5) and suppose that $u\in\mathscr U$ . Then we denote the control $v$ by

$v\triangleq u-\Phi X\in L_{\mathbb F}^2(\mathbb R^m).$

Again by the uniqueness of solutions, the solution $X_{\Phi}$ of (4.4) coincides with $X$ , and we get that $u$ admits a representation of the form $\Phi X_{\Phi}(\cdot; x, v)+v$ .

The above result shows that the $L^2$ -stabilizability is sufficient for the existence of an admissible control and gives an explicit description of the admissible control set. The following we will show a further result that the $L^2$ -stabilizability is not only sufficient, but also necessary, for the non-emptiness of $\mathscr U$ for all $x\in\mathbb R^n$ . We define

$\begin{equation} \left\{\begin{array}{ll} \mathcal L(K) = KA+A^\top K+C^\top KC+\int_ZE(\theta)^\top KE(\theta)\nu(d\theta)+Q, \\ \mathcal M(K) = B^\top K+D^\top KC+\int_ZF(\theta)^\top KE(\theta)\nu(d\theta)+S, \\ \mathcal N(K) = R+D^\top KD+\int_ZF(\theta)^\top KF(\theta)\nu(d\theta), \end{array} \right. \end{equation}$

(4.6)

for $K\in\mathbb S^n$ . We first present the following lemma.

Lemma 4.4. Let Assumptions 3.1–3.3 hold. Suppose that for each $T > 0$ , the following differential Riccati equation

$\begin{equation} \left\{\begin{array}{ll} \dot{K}_T(s)+\mathcal L(K_T(s))-\mathcal M(K_T(s))^\top\mathcal N(K_T(s))^{-1}\mathcal M(K_T(s)) = 0, \\ K_T(T) = G \end{array} \right. \end{equation}$

(4.7)

admits a solution $K_T\in C([0, T]; \mathbb S^n)$ , which satisfies

$\mathcal N(K_T(s)) > 0, \; \; \; \forall s\in[0, T].$

Then the following algebraic Riccati equation:

$\begin{equation} \mathcal L(K)-\mathcal M(K)^\top\mathcal N(K)^{-1}\mathcal M(K) = 0 \end{equation}$

(4.8)

has a solution $K$ if $K_T(0)$ converges to $K$ as $T\rightarrow \infty$ and $\mathcal N(K)$ is invertible.

Proof. We denote

$\begin{equation} \left\{\begin{array}{ll} K_1(s) = K_{T_1}(T_1-s), \; \; \; 0\leq s\leq T_1, \\ K_2(s) = K_{T_2}(T_2-s), \; \; \; 0\leq s\leq T_2\nonumber \end{array} \right. \end{equation}$

for any fixed $T_1$ and $T_2$ , which satisfy $0 < T_1 < T_2 < \infty$ . Then we define

$\Phi_i(s) = \mathcal N(K_i(s))^{-1}\mathcal M(K_i(s)), \; \; \; \; i = 1, 2.$

It is easy to conclude that both $K_1$ and $K_2$ are solutions of the following equatiion:

$\begin{equation} \left\{\begin{array}{ll} \dot{\Sigma}(s)-\mathcal L(\Sigma(s))+\mathcal M(\Sigma(s))^\top\mathcal N(\Sigma(s))^{-1}\mathcal M(\Sigma(s)) = 0, \\\Sigma(0) = G \end{array} \right. \end{equation}$

(4.9)

on the interval [0, $T_1$ ]. Thus, for any $s\in[0, T_1]$ , it follows that

$K_1(s) = K_2(s)$

because of the uniqueness of solutions to ODEs, and the difference $\Delta = K_1-K_2$ satisfies $\Delta(0) = 0$ . To interpret it, we get that

$\begin{eqnarray} \begin{split} \dot{\Delta} = &\Delta A+A^\top\Delta+C^\top\Delta C+\int_ZE(\theta)^\top\Delta E(\theta)\nu(d\theta)\\ &\; \; \; \; \; -\big[\Delta B+C^\top\Delta D+\int_ZE(\theta)^\top\Delta F(\theta)\nu(d\theta)\big]\Phi_1\\ &\; \; \; \; \; +\Phi_2^\top\big[D^\top\Delta D+\int_ZF(\theta)^\top\Delta F(\theta)\nu(d\theta)\big]\Phi_1\\ &\; \; \; \; \; -\Phi_2^\top\big[B^\top\Delta+D^\top\Delta C+\int_ZF(\theta)^\top\Delta E(\theta)\nu(d\theta)\big], \; \; \; \; \; \; s\in[0, T_1].\nonumber \end{split} \end{eqnarray}$

By assumption, $\Phi_1$ and $\Phi_2$ are continuous and hence bounded. Thus, for some positive constant $M$ which is independent of $\Delta$ , we have

$\begin{eqnarray} \begin{split} |\Delta(t)|\leq&\int_0^t\bigg|\Delta A+A^\top\Delta+C^\top\Delta C+\int_ZE(\theta)^\top\Delta E(\theta)\nu(d\theta)\\ &\; \; \; \; \; -\big[\Delta B+C^\top\Delta D+\int_ZE(\theta)^\top\Delta F(\theta)\nu(d\theta)\big]\Phi_1\\ &\; \; \; \; \; +\Phi_2^\top\big[D^\top\Delta D+\int_ZF(\theta)^\top\Delta F(\theta)\nu(d\theta)\big]\Phi_1\\ &\; \; \; \; \; -\Phi_2^\top\big[B^\top\Delta+D^\top\Delta C+\int_ZF(\theta)^\top\Delta E(\theta)\nu(d\theta)\big]\bigg|ds\\ \leq& M\int_0^t|\Delta(s)|ds, \; \; \; \forall t\in[0, T_1].\nonumber \end{split} \end{eqnarray}$

Then we get $\Delta(s) = 0$ for all $s\in[0, T_1]$ by Gronwall's inequality. Thus, the function $\Sigma:[0, \infty)\rightarrow \mathbb S^n$ can be defined by

$\Sigma(s) = K_T(T-s), \; \; \; \; \; \; \; 0\leq {s}\leq {T}.$

Assume $\mathcal N(K)$ is invertible, since we define

$\begin{eqnarray} \begin{split} \Theta(s)&\triangleq \mathcal L(\Sigma(s))-\mathcal M(\Sigma(s))^\top\mathcal N(\Sigma(s))^{-1}\mathcal M(\Sigma(s)), \\ \Theta_{\infty}&\triangleq\mathcal L(K)-\mathcal M(K)^\top\mathcal N(K)^{-1}\mathcal M(K), \nonumber \end{split} \end{eqnarray}$

it is easy to get that $\lim_{s\rightarrow \infty}\Theta(s) = \Theta_{\infty}$ if $\Sigma(T) = K_T(0)$ converges to $K$ as $T\rightarrow \infty$ . On the other side, we get that

$\Sigma(T+1)-\Sigma(T) = \int_T^{T+1}\Theta(t)dt, \; \; \; \; \; \forall T > 0$

on the whole interval [0, $\infty$ ) since $\Sigma$ satisfies (4.9). Thus, we have

$\begin{eqnarray} \begin{split} |\Theta_{\infty}| = &\left| \int_T^{T+1} \Theta_{\infty} dt\right| \\ = &\left| \int_T^{T+1}\left[ \Theta(t)+\Theta_{\infty}-\Theta(t)\right] dt\right|\\ \leq&\bigg|\int_T^{T+1}\Theta(t)dt\bigg|+\bigg|\int_T^{T+1}\big[\Theta_{\infty}-\Theta(t)\big]dt\bigg|\\ \leq&|\Sigma(T+1)-\Sigma(T)|+\int_T^{T+1}|\Theta_{\infty}-\Theta(t)|dt.\nonumber \end{split} \end{eqnarray}$

Letting $T\rightarrow \infty$ , the desired result can be proved.

Theorem 4.5. Under Assumptions 3.1–3.3, the following statements are equivalent:

(i) $\mathscr U\neq\varnothing$ for all $x\in\mathbb R^n$ ;

(ii) $\mathscr T$ [ $A, C, E; B, D, F$ ] $\neq\varnothing$ ;

(iii)The following algebraic Riccati equation (ARE, for short) admits a positive solution $K\in\mathbb S_+^n$ :

$\begin{eqnarray} \begin{split} KA+&A^\top K+C^\top KC+\int_ZE(\theta)^\top KE(\theta)\nu(d\theta)+I\\ -&\big[KB+C^\top KD+\int_ZE(\theta)^\top KF(\theta)\nu(d\theta)\big]\big[I+D^\top KD\\ +&\int_ZF(\theta)^\top KF(\theta)\nu(d\theta)\big]^{-1}\big[B^\top K+D^\top KC+\int_ZF(\theta)^\top KE(\theta)\nu(d\theta)\big] = 0. \end{split} \end{eqnarray}$

(4.10)

We define

$\begin{eqnarray} \begin{split} \Pi\triangleq&-\big[I+D^\top KD+\int_ZF(\theta)^\top KF(\theta)\nu(d\theta)\big]^{-1}\\ &\big[B^\top K+D^\top KC+\int_ZF(\theta)^\top KE(\theta)\nu(d\theta)\big]. \end{split} \end{eqnarray}$

(4.11)

If the above are satisfied and $K$ is a positive solution of (4.10), then we have $\Pi\in\mathscr T[A, C, E; B, D, F]$ .

Proof. The implication (ⅱ) $\Rightarrow$ (ⅰ) has been proved in Proposition 4.3. Then we show the implication (ⅲ) $\Rightarrow$ (ⅱ). We assume $K\in\mathbb S_+^n$ is a solution of (4.10) and $\Pi$ has the expression (4.11), then we have

$\begin{eqnarray} \begin{split} K(A+B\Pi)+&(A+B\Pi)^\top K+(C+D\Pi)^\top K(C+D\Pi)\\ +&\int_Z[E(\theta)+F(\theta)\Pi]^\top K[E(\theta)+F(\theta)\Pi]\nu(d\theta) = -I-\Pi^\top\Pi < 0. \nonumber \end{split} \end{eqnarray}$

Thus, $\Pi$ is a stabilizer of [ $A, C, E; B, D, F$ ] according to Theorem 4.1 and Definition 4.1.

We next show that (ⅰ) $\Rightarrow$ (ⅲ). Yet the general, we may assume $b = \sigma = 0$ . Now take a series of $u_i\in\mathscr U(e_i), \; i = 1, ..., n$ , and define $U = (u_1, ..., u_n)$ while $e_1, ..., e_n$ is the standard basis for $\mathbb R^n$ . Then, we have $Ux\in\mathscr U$ for all $x\in\mathbb R^n$ because of the linearity of the state equation. Consider the following cost functional

$\hat{J}(x;u) = \mathbb E\int_0^{\infty}[|\mathbb X(t)x|^2+|U(t)x|^2]dt,$

while $\mathbb X\in L_{\mathbb F}^2(\mathbb R^{n\times n})$ solves the following matrix SDE:

$\begin{equation} \left\{\begin{array}{ll} d\mathbb X(t) = [A\mathbb X(t)+BU(t)]dt+[C\mathbb X(t)+DU(t)]dW(t)\\ \; \; \; \; \; \; \; \; \; \; +\int_Z[E(\theta)\mathbb X(t)+F(\theta)U(t)]\tilde{\mu}(dt, d\theta), \; \; \; \; \; \; \; \; \; \; t\in[0, \infty), \\ \; \mathbb X(0) = I_n.\nonumber \end{array} \right. \end{equation}$

We have for any $x\in\mathbb R^n$ ,

$\begin{eqnarray} \begin{split} \inf\limits_{u\in\mathscr U}\hat J(x;u)\leq&\mathbb E\int_0^{\infty}\bigg[|\mathbb X(t)x|^2+|U(t)x|^2\bigg]dt\\ = &\bigg\langle\bigg(\mathbb E\int_0^{\infty}\big[\mathbb X(t)^\top\mathbb X(t)+U(t)^\top U(t)\big]dt\bigg)x, x\bigg\rangle. \end{split} \end{eqnarray}$

(4.12)

Now for a fixed but arbitrary $T > 0$ , let us consider the optimal control problem in the finite time horizon [0, $T$ ] with state equation

$\begin{equation} \left\{\begin{array}{ll} dX_T(t) = [AX_T(t)+Bu(t)]dt+[CX_T(t)+Du(t)]dW(t)\\ \; \; \; \; \; \; \; \; \; \; +\int_Z\big[E(\theta)X_T(t)+F(\theta)u(t)\big]\tilde{\mu}(dt, d\theta), \; \; t\in[0, T], \\ \; X_T(0) = x, \nonumber \end{array} \right. \end{equation}$

and cost functional

$\hat J_T(x;u) = \mathbb E\int_0^T\big[|X_T(t)|^2+|u(t)|^2\big]dt.$

According to Proposition 4.2 of the reference ^[25], the differential Riccati equation:

$\begin{equation} \left\{\begin{array}{ll} \dot{K}_T(t)+K_T(t)A+A^\top K_T(t)+C^\top K_T(t)C+ \int_ZE(\theta)^\top K_T(t)E(\theta)\nu(d\theta)+I\\ \; \; \; \; \; \; \; -\big[K_T(t)B+C^\top K_T(t)D+ \int_ZE(\theta)^\top K_T(t)F(\theta)\nu(d\theta)\big]\big[I+D^\top K_T(t)D\\ \; \; \; \; \; \; \; + \int_ZF(\theta)^\top K_T(t)F(\theta)\nu(d\theta)\big]^{-1}\big[B^\top K_T(t)+D^\top K_T(t)C\\ \; \; \; \; \; \; \; + \int_ZF(\theta)^\top K_T(t)E(\theta)\nu(d\theta)\big] = 0, \; \; \; \; \; \; \; t\in[0, T], \\ K_T(T) = 0\nonumber \end{array} \right. \end{equation}$

admits a unique solution $K_T\in C([0, T]; \mathbb S_+^n)$ such that

$\langle K_T(0)x, x\rangle = \inf\limits_{u\in L^2_{\mathbb F}(0, T;\mathbb R^m)}\hat J_T(x;u), \; \; \; \; \forall x\in\mathbb R^n.$

Because of the restriction $u|_{[0, T]}$ of $u$ to [0, $T$ ] belongs to $\mathscr U[0, T]\equiv L^2_{\mathbb F}(0, T; \mathbb R^m)$ for any $u\in\mathscr U,$ we have

$\langle K_T(0)x, x\rangle\leq\hat{J}_T(x;u|_{[0, T]})\leq\hat{J}(x;u), \; \; \; \; \forall u\in\mathscr U.$

And along with (4.12), it implies that

$\begin{equation} \langle K_T(0)x, x\rangle\leq\langle\Psi x, x\rangle, \; \; \; \; \forall x\in\mathbb R^n, \end{equation}$

(4.13)

where $\Psi\triangleq\int_0^{\infty}\big[\mathbb X(t)^\top\mathbb X(t)+U(t)^\top U(t)\big]dt$ . On the flip side, the restriction $u|_{[0, T]}$ of $u\in L^2_{\mathbb F}(0, T; \mathbb R^m)$ also belongs to $L^2_{\mathbb F}(0, T'; \mathbb R^m)$ for any fixed $T' > T > 0$ . Thus, we have

$\langle K_T(0)x, x\rangle\leq\hat{J}_T(x;u|_{[0, T]})\leq\hat{J}_{T'}(x;u), \; \; \; \; \forall u\in L^2_{\mathbb F}(0, T';\mathbb R^m),$

which in turn gives

$\begin{equation} \langle K_T(0)x, x\rangle\leq \langle K_{T'}(0)x, x\rangle, \; \; \; \; \forall x\in\mathbb R^n. \end{equation}$

(4.14)

Combining (4.13) and (4.14), since $K_T\in C([0, T]; \mathbb S_+^n),$ it is easy to get that

$0 < K_T(0)\leq K_{T'}(0)\leq\Psi, \; \; \; \; \forall 0 < T < T' < \infty.$

Thus, it follows that $K_T(0)$ converges increasingly to some $K\in\mathbb S_+^n$ as $T\nearrow\infty.$ By Lemma 4.4, the limit matrix $K$ is a solution of the ARE (4.10).

5. Solvability and the algebraic Riccati equation

According to Theorem 4.5, we have proved that $\mathscr U\neq\varnothing$ for all $x\in\mathbb R^n$ is equivalent to $\mathscr T$ [ $A, C, E; B, D, F$ ] $\neq\varnothing$ . Thus, it is reasonable to give the following assumption:

(A) System [ $A, C, E; B, D, F$ ] is $L^2$ -stabilizable, i.e., $\mathscr T[A, C, E; B, D, F]\neq\varnothing.$

Definition 5.1. For an initial state $x\in\mathbb R^n$ , an element $u^*\in\mathscr U$ is called an open-loop optimal control of (SLQ) $_{\infty}$ if

$J(x;u^*)\leq J(x;u), \; \; \; \; \forall u\in \mathscr{U}.$

Problem (SLQ) $_{\infty}$ is said to be (uniquely) open-loop solvable at x if an open-loop optimal control (uniquely) exists for $x$ . If it is (uniquely) open-loop solvable at all $x\in\mathbb R^n$ , Problem (SLQ) $_{\infty}$ is said to be (uniquely) open-loop solvable.

Definition 5.2. A pair $(\Phi^*, v^*)\in\mathscr T[A, C, E; B, D, F]\times L^2_{\mathbb F}(\mathbb R^m)$ is called a closed-loop optimal strategy of Problem (SLQ) $_{\infty}$ if

$J(x;\Phi^*X^*+v^*)\leq J(x;\Phi X+v)$

for all $(x, \Phi, v)\in\mathbb R^n\times\mathscr T[A, C, E; B, D, F]\times L^2_{\mathbb F}(\mathbb R^m)$ , where $X^*$ and $X$ are the closed-loop state processes corresponding to $(x, \Phi^*, v^*)$ and $(x, \Phi, v)$ , respectively. Problem (SLQ) $_{\infty}$ is said to be ( $uniquely$ ) closed-loop $solvable$ if a closed-loop optimal strategy (uniquely) exists. The outcome

$u\triangleq\Phi X+v$

of a closed-loop strategy ( $\Phi, v$ ) is called a closed-loop control for the initial state $x$ , where $X$ is the closed-loop state process corresponding to :

$\begin{equation} \left\{\begin{array}{ll} dX(t) = \big[(A+B\Phi)X(t)+Bv(t)+b(t)\big]dt\\ \; \; \; \; \; \; \; \; \; +\big[(C+D\Phi)X(t)+Dv(t)+\sigma(t)\big]dW(t)\\ \; \; \; \; \; \; \; \; \; + \int_Z\bigg[\big(E(\theta)+F(\theta)\Phi\big)X(t)+F(\theta)v(t)+h(t, \theta)\bigg]\tilde{\mu}(dt, d\theta), \; \; \; \; t\geq 0, \\ \; X(0) = x.\nonumber \end{array} \right. \end{equation}$

Remark 5.1. From Proposition 4.3, $\mathscr U$ is composed of closed-loop controls for all $x$ when (A) holds. In Definition 5.2, the outcome $u^*\equiv\Phi^*X^*+v^*$ of a closed-loop optimal strategy $\left(\Phi^*, v^*\right)$ is an open-loop optimal control for the initial state $X^*(0)$ , which implies that closed-loop solvability is sufficient to open-loop solvability.

According to the definition of the open-loop and closed-loop solvabilities in ^[17], it is easy to get that for finite horizon cases, closed-loop solvability implies open-loop solvability, whereas open-loop solvability does not necessarily imply closed-loop solvability. However, when it comes to infinite horizon, the open-loop and closed-loop solvabilities are equivalent, and both are equivalent to the existence of a $stabilizing \ solution$ to a $generalized \ algebraic \ Riccati \ equation$ which we will prove later.

Definition 5.3. Let Assumptions 3.1–3.3 be satisfied. The following constrained a generalized algebraic Riccati equation (ARE)

$\begin{equation} \left\{\begin{array}{ll} \mathcal L(K)-\mathcal M(K)^\top\mathcal N(K)^{\dagger}\mathcal M(K) = 0, \\ \mathscr R(\mathcal M(K))\subseteq\mathscr R(\mathcal N(K)), \\ \mathcal N(K)\geq0. \end{array} \right. \end{equation}$

(5.1)

If there exists a $\Theta\in\mathbb R^{m\times n}$ such that the matrix

$\begin{equation} \Phi\triangleq-\mathcal N(K)^{\dagger}\mathcal M(K)+[I-\mathcal N(K)^{\dagger}\mathcal N(K)]\Theta \end{equation}$

(5.2)

is a stabilizer of [ $A, C, E; B, D, F$ ], then the solution $K\in\mathbb S^n$ of (5.1) is said to be $stabilizing$ .

Remark 5.2. Because of the properties of the Moore-Penrose pseudoinverse (see Lemma 2.2 and Remark 2.1), one has

$\mathcal N(K)\Phi = -\mathcal M(K), \; \; \; \mathcal M(K)^\top\Phi = -\Phi^\top\mathcal N(K)\Phi = -\mathcal M(K)^\top\mathcal N(K)^{\dagger}\mathcal M(K)$

if $K$ is a solution (not necessarily stabilizing) to the ARE (5.1) and $\Phi$ is defined by (5.2).

5.1. A study of Problem (SLQ) $_{\infty}^0$

In order to solve Problem (SLQ) $_{\infty}$ , we first discuss Problem (SLQ) $_{\infty}^0$ in this section. Assume that the nonhomogeneous terms $b, \sigma, h, q$ , and $\rho$ are all zero, and we assume that the system [ $A, C, E$ ] is $L^2$ -stable (i.e., 0 $\in\mathscr T[A, C, E; B, D, F]$ ) for the sake of simplicify.

Proposition 5.1. Let Assumptions 3.1–3.3 hold. Suppose that [ $A, C, E$ ] is $L^2$ -stable. Then there exist a bounded self-adjoint linear operator $H_2: L^2_{\mathbb F}(\mathbb R^m)\rightarrow L^2_{\mathbb F}(\mathbb R^m)$ , a bounded linear operator $H_1: \mathbb R^n\rightarrow L^2_{\mathbb F}(\mathbb R^m)$ , a matrix $H_0\in\mathbb S^n$ , and $\tilde{u}\in L^2_{\mathbb F}(\mathbb R^m), \ \tilde{x}\in\mathbb R^n, \ g\in\mathbb R$ such that for any $(x, u)\in\mathbb R^n\times L^2_{\mathbb F}(\mathbb R^m)$ ,

$J(x;u) = \langle H_2u, u\rangle +2\langle H_1x, u\rangle +\langle H_0x, x\rangle +2\langle u, \tilde{u}\rangle +2\langle x, \tilde{x}\rangle +g.$

In particular, in the case of Problem $({\rm{SLQ}}) _{\infty}^0 (i.e., b, \sigma, h, q, \rho = 0),$

$J^0(x;u) = \langle H_2u, u\rangle +2\langle H_1x, u\rangle +\langle H_0x, x\rangle,$

where

$\begin{equation} \left\{\begin{array}{ll} H_2 = M_0^*QM_0+2SM_0+R, \\ H_1 = M_0^*QL_0+SL_0, \\ H_0 = L_0^*QL_0, \\ \tilde{u} = M_0^*QN_0+M_0^*q+SN_0+\rho, \\ \tilde{x} = L_0^*q+L_0^*QN_0, \\ g = \langle QN_0, N_0\rangle +2\langle q, N_0\rangle , \nonumber \end{array} \right. \end{equation}$

and

$\begin{equation} \left\{\begin{array}{ll} L_0(t)\; = \Lambda(t), \\ M_0(t) = \Lambda(t) \int_0^t\Lambda(s)^{-1}\left[ B-CD-\int_ZE(\theta)(I+E(\theta))^{-1}F(\theta)\nu(d\theta)\right] ds\\\; \; \; \; \; \; \; \; \; +\Lambda(t) \int_0^t\Lambda(s)^{-1}DdW_s+\Lambda(t)\int_0^t\int_ZF(\theta)(I+E(\theta))^{-1}\tilde{\mu}(ds, d\theta), \\ N_0(t) = \Lambda(t) \int_0^t\Lambda(s)^{-1}\left[ b-C\sigma-\int_ZE(\theta)(I+E(\theta))^{-1}h(s, \theta)\nu(d\theta)\right] ds\\\; \; \; \; \; \; \; \; \; +\Lambda(t) \int_0^t\Lambda(s)^{-1}\sigma dW_s+\Lambda(t)\int_0^t\int_Zh(s, \theta)(I+E(\theta))^{-1}\tilde{\mu}(ds, d\theta).\nonumber \end{array} \right. \end{equation}$

Proposition 5.2. Under Assumptions 3.1–3.3, if [ $A, C, E$ ] is $L^2$ -stable, then we have the following conclusion:

(i) Problem (SLQ) $_{\infty}$ is open-loop solvable at x if and only if $H_2\geq0$ (i.e., $H_2$ is a positive operator) and $H_1x+\tilde{u}\in\mathscr R(H_2).$ In this instance, $u^*$ is an open-loop optimal control for the initial state x if and only if

$H_2u^*+H_1x+\tilde{u} = 0.$

(ii) If Problem (SLQ) $_{\infty}$ is open-loop solvable, then so is Problem (SLQ) $_{\infty}^0$ .

(iii) If Problem (SLQ) $_{\infty}^0$ is open-loop solvable, then there exists a $U^*\in L^2_{\mathbb F}(\mathbb R^{m\times n})$ such that for any $x\in\mathbb R^n, \ U^*x$ is an open-loop optimal control of Problem (SLQ) $_{\infty}^0$ for the initial state x.

Proof. (ⅰ) By definition, a process $u^*\in L^2_{\mathbb F}(\mathbb R^m)$ is an open-loop optimal control for the initial state $x$ if and only if

$\begin{equation} J(x;u^*+\gamma w)-J(x;u^*)\geq0, \; \; \; \; \forall w\in L^2_{\mathbb F}(\mathbb R^m), \; \; \forall\gamma\in\mathbb R. \end{equation}$

(5.3)

According to Proposition 5.1, we have

$\begin{eqnarray} \begin{split} J(x;u^*+\gamma w)& = \langle H_2(u^*+\gamma w), u^*+\gamma w\rangle +2\langle H_1x, u^*+\gamma w\rangle\\ &\; \; \; \; +\langle H_0x, x\rangle+2\langle u^*+\gamma w, \tilde{u}\rangle +2\langle x, \tilde{x}\rangle +g\\& = J(x;u^*)+\gamma^2\langle H_2w, w\rangle +2\gamma\langle H_2u^*+H_1x+\tilde{u}, w\rangle .\nonumber \end{split} \end{eqnarray}$

Thus, (5.3) is equivalent to

$\gamma^2\langle H_2w, w\rangle +2\gamma\langle H_2u^*+H_1x+\tilde{u}, w\rangle\geq0, \; \; \; \; \forall w\in L^2_{\mathbb F}(\mathbb R^m), \; \; \forall\gamma\in\mathbb R,$

which in turn is equivalent to

$\langle H_2w, w\rangle \geq0, \; \; \forall w\in L^2_{\mathbb F}(\mathbb R^m) \ {\rm and} \ H_2u^*+H_1x+\tilde{u} = 0.$

So (ⅰ) is proved.

For conclusion (ⅱ), since Problem (SLQ) $_{\infty}$ is open-loop solvable, it is easy to get that $H_2\geq0$ and $H_1x+\tilde{u}\in\mathscr R(H_2)$ for all $x\in\mathbb R^n$ according to (ⅰ). Then we take $x = 0$ , it is tempting to conclude that $\tilde{u}\in\mathscr R(H_2)$ , and hence $H_1x\in\mathscr R(H_2)$ for all $x\in\mathbb R^n$ . Finally, we obtain the open-loop solvability of Problem (SLQ) $_{\infty}^0$ by using (ⅰ) again.

For conclusion (ⅲ), assume $e_1, ..., e_n$ is the standard basis for $\mathbb R^n$ . It is easy to get that $U^*\triangleq(u_1^*, ..., u_n^*)$ has the desired properties since $u_i^*$ ia an open-loop optimal control of Problem (SLQ) $_{\infty}^0$ for the initial state $e_i$ .

5.1.1. A finite horizon approach

Let $\Lambda$ be the solution of (3.7), and the matrix

$G\triangleq\mathbb E\int_0^{\infty}\Lambda(t)^\top Q\Lambda(t)dt$

is well-defined if [A, C, E] is $L^2$ -stable. Thus, we can consider the following LQ problem over the finite time horizon [ $0, T$ ] for any $T > 0$ .

Problem (SLQ) $_T^0$ . On the finite time horizon [ $0, T$ ], we assume $X_T(t)$ is a solution of the state equation:

$\begin{equation} \left\{\begin{array}{ll} dX_T(t) = [AX_T(t)+Bu(t)]dt+[CX_T(t)+Du(t)]dW(t)\\\; \; \; \; \; \; \; \; \; \; \; + \int_Z[E(\theta)X_T(t)+F(\theta)u(t)]\tilde{\mu}(dt, d\theta), \\\; X_T(0) = x, \end{array} \right. \end{equation}$

(5.4)

then for any given $x\in\mathbb R^n$ , find a $u^*\in\mathscr U[0, T]$ such that the cost functional

$J_T^0(x;u)\triangleq\mathbb E\bigg\{\langle GX_T(T), X_T(T)\rangle+\int_0^T \bigg{\langle}\left(\begin{array}{cc} Q & S^\top \\ S & R \end{array} \right) \left( \begin{array}{c} X_T(t)\\u(t)\end{array}\right), \left( \begin{array}{c} X_T(t)\\u(t)\end{array}\right)\bigg{\rangle}dt\bigg\}$

is minimized over $\mathscr U [0, T]$ .

Proposition 5.3. Let Assumptions 3.1–3.3 be satisfied. If [A, C, E] is $L^2$ -stable, since we use $V_T^0(x)$ to denote the value function of Problem (SLQ) $_T^0$ , and use $V^0(x)$ to denote the value function of Problem (SLQ) $^0_{\infty}$ , it is easy to get the following conclusions:

(i) For any $x\in\mathbb R^n$ and $u\in\mathscr U[0, T]$ ,

$J_T^0(x;u) = J^0(x;u_e),$

where $u_e\in L^2_{\mathbb F}(\mathbb R^m)$ is the zero-extension of u:

$u_e(t) = u(t)\; \; \; \; if \ t\in[0, T];\; \; \; \; u_e(t) = 0\; \; \; \; if \ t\in(T, \infty).$

(ii) If there exists a $\varepsilon > 0$ such that

$\begin{equation} \langle H_2v, v\rangle \geq\varepsilon||v||^2, \; \; \; \; \forall v\in L^2_{\mathbb F}(\mathbb R^m), \end{equation}$

(5.5)

then

$\begin{equation} J_T^0(0;u)\geq\varepsilon\mathbb E\int_0^T|u(t)|^2dt, \; \; \; \; \forall u\in\mathscr U[0, T]. \end{equation}$

(5.6)

(iii) $\lim_{T\rightarrow \infty}V_T^0(x) = V^0(x)$ for all $x\in\mathbb R^n$ .

Proof. (ⅰ) Let $X_T$ be the solution of (5.4) and $X$ be the solution of

$\begin{equation} \left\{\begin{array}{ll} dX(t) = [AX(t)+Bu_e(t)]dt+[CX(t)+Du_e(t)]dW(t)\\\; \; \; \; \; \; \; \; \; \; \; + \int_Z[E(\theta)X(t)+F(\theta)u_e(t)]\tilde{\mu}(dt, d\theta), \; \; \; \; t\geq0, \\\; X(0) = x, \nonumber \end{array} \right. \end{equation}$

for a fixed $x\in\mathbb R^n$ and an arbitrary $u\in\mathscr U[0, T].$ Thus, we have that

$X(t) = \left\{\begin{aligned}&X_T(t), \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; t\in[0, T], \\&\Lambda(t)\Lambda(T)^{-1}X_T(T), \; \; \; \; t\in(T, \infty).\end{aligned}\right.$

Noting that for $t\geq T, \ \Lambda(t)\Lambda(T)^{-1}$ and $\Lambda(t-T)$ have the same distribution and are independent of $\mathcal F_T$ , we have

$\begin{eqnarray} \begin{split} \mathbb E&\langle \left( \mathbb E\int_0^{\infty}\Lambda(t)^\top Q\Lambda(t)dt\right) X_T(T), X_T(T)\rangle \\ & = \mathbb E\langle \left( \mathbb E\int_T^{\infty}[\Lambda(t-T)]^\top Q[\Lambda(t-T)]dt\right) X_T(T), X_T(T)\rangle \\ & = \mathbb E\langle \left( \mathbb E\int_T^{\infty}[\Lambda(t)\Lambda(T)^{-1}]^\top Q[\Lambda(t)\Lambda(T)^{-1}]dt\right) X_T(T), X_T(T)\rangle \\ & = \mathbb E\int_T^{\infty}\langle Q\Lambda(t)\Lambda(T)^{-1}X_T(T), \Lambda(t)\Lambda(T)^{-1}X_T(T)\rangle dt\\ & = \mathbb E\int_T^{\infty}\langle QX(t), X(t)\rangle dt.\nonumber \end{split} \end{eqnarray}$

It follows that

$\begin{eqnarray} \begin{split} J_T^0(x;u)& = \mathbb E\bigg\{ \langle \left( \mathbb E\int_o^{\infty}\Lambda(t)^\top Q\Lambda(t)dt\right) X_T(T), X_T(T)\rangle\\ &\; \; \; \; \; \; \; \; +\int_0^T\langle \left(\begin{array}{cc}Q & S^\top\\S & R\end{array}\right)\left(\begin{array}{c}X_T(t)\\u(t)\end{array}\right), \left(\begin{array}{c}X_T(t)\\u(t)\end{array}\right)\rangle dt\bigg \}\\ & = \mathbb E\bigg\{\int_T^{\infty}\langle QX, X\rangle dt+\int_0^T\langle \left(\begin{array}{cc}Q & S^\top\\S & R\end{array}\right)\left(\begin{array}{c}X\\u_e\end{array}\right), \left(\begin{array}{c}X\\u_e\end{array}\right)\rangle dt\bigg\}\\ & = J^0(x;u_e). \end{split} \end{eqnarray}$

(5.7)

(ⅱ) Taking $x = 0$ in (5.7), we obtain

$\begin{eqnarray} \begin{split} J^0_T(0;u)& = J^0(0;u_e) = \langle H_2u_e, u_e\rangle \\ &\geq\varepsilon\mathbb E\int_0^{\infty}|u_e(t)|^2dt = \varepsilon\mathbb E\int_0^T|u(t)|^2dt, \nonumber \end{split} \end{eqnarray}$

which proves (ⅱ).

Finally let us prove (ⅲ). According to (5.7), it is easy to obtain

$V^0(x)\leq J^0(x;u_e) = J_T^0(x;u), \; \; \; \; \forall u\in\mathscr U[0, T].$

Let $V_T^0(x)$ be the infimum of $J_T^0(x; u)$ over $u\in\mathscr U[0, T]$ , thus

$\begin{equation} V^0(x)\leq V_T^0(x), \; \; \; \; \forall T > 0. \end{equation}$

(5.8)

On the flip side, when $V^0(x) > -\infty,$ for any given $\varsigma > 0$ , we can find a $u^{\varsigma}\in L_{\mathbb F}^2(\mathbb R^m)$ such that

$\begin{equation} \mathbb E\int_0^{\infty}\langle \left(\begin{array}{cc}Q & S^\top\\S & R\end{array}\right)\left(\begin{array}{c}X^{\varsigma}(t)\\u^{\varsigma}(t)\end{array}\right), \left(\begin{array}{c}X^{\varsigma}(t)\\u^{\varsigma}(t)\end{array}\right)\rangle dt = J^0(x;u^{\varsigma})\leq V^0(x)+\varsigma, \end{equation}$

(5.9)

where $X^{\varsigma}$ is the solution of

$\begin{equation} \left\{\begin{array}{ll} dX^{\varsigma}(t) = [AX^{\varsigma}(t)+Bu^{\varsigma}(t)]dt+[CX^{\varsigma}(t)+Du^{\varsigma}(t)]dW(t)\\ \; \; \; \; \; \; \; \; \; + \int_Z[E(\theta)X^{\varsigma}(t)+F(\theta)u^{\varsigma}(t)]\tilde{\mu}(dt, d\theta), \; \; \; \; t\geq0, \\ X^{\varsigma}(0) = x.\nonumber \end{array} \right. \end{equation}$

Since by Proposition 4.2 $X^{\varsigma}\in\mathcal X[0, \infty)$ , it is easy to get that for large $T > 0$ ,

$|\mathbb E\langle GX^{\varsigma}(T), X^{\varsigma}(T)\rangle |+\bigg|\mathbb E\int_T^{\infty}\langle \left(\begin{array}{cc}Q & S^\top\\S & R\end{array}\right)\left(\begin{array}{c}X^{\varsigma}(t)\\u^{\varsigma}(t)\end{array}\right), \left(\begin{array}{c}X^{\varsigma}(t)\\u^{\varsigma}(t)\end{array}\right)\rangle dt\bigg|\leq\varsigma.$

Now we use $u_T^{\varsigma}$ to be the restriction of $u^{\varsigma}$ to $[0, T]$ , thus we have

$\begin{eqnarray} \begin{split} J^0(x;u^{\varsigma})& = J_T^0(x;u_T^{\varsigma})-\mathbb E\langle GX^{\varsigma}(T), X^{\varsigma}(T)\rangle \\& \; \; \; +\mathbb E\int_T^{\infty}\langle \left(\begin{array}{cc}Q & S^\top\\S & R\end{array}\right)\left(\begin{array}{c}X^{\varsigma}(t)\\u^{\varsigma}(t)\end{array}\right), \left(\begin{array}{c}X^{\varsigma}(t)\\u^{\varsigma}(t)\end{array}\right)\rangle dt\\ &\geq V_T^0(x)-\varsigma. \end{split} \end{eqnarray}$

(5.10)

Since we combine (5.9) and (5.10), it is not hard to get that for large $T > 0$ ,

$V_T^0(x)\leq V^0(x)+2\varsigma.$

To summarize, we have that $V_T^0(x)\rightarrow V^0(x)$ as $T\rightarrow \infty.$ The case when $V^0(x) = -\infty$ can also be proved by a similar method.

Proposition 5.2(ⅰ) implies that if [A, C, E] is $L^2$ -stable and the operator $H_2$ is uniformly positive, then Problem $({\rm SLQ})^0_{\infty}$ is uniquely open-loop solvable and the unique optimal control is given by

$u_x^* = -H_2^{-1}H_1x.$

And if we substitute the optimal control $u_x^*$ into the cost functional, it is easy to get

$V^0(x) = \langle (H_0-H_1^*H_2^{-1}H_1)x, x\rangle , \; \; \; \; x\in\mathbb R^n.$

Notice that $H_0-H_1^*H_2^{-1}H_1$ is a matrix.

Theorem 5.4. Let Assumptions 3.1–3.3 hold. If [A, C, E] is $L^2$ -stable and (5.5) holds for some $\varepsilon > 0$ , then we have the following conclusions

(i) the matrix $K\triangleq H_0-H_1^*H_2^{-1}H_1$ solves the ARE

$\begin{equation} \left\{\begin{array}{ll} \mathcal L(K)-\mathcal M(K)^\top\mathcal N(K)^{-1}\mathcal M(K) = 0, \\ \mathcal N(K) > 0;\nonumber \end{array} \right. \end{equation}$

(ii) the matrix $\Phi\triangleq-\mathcal N(K)^{-1}\mathcal M(K)$ is a stabilizer of [A, C, E; B, D, F];

(iii) the unique open-loop optimal control of Problem (SLQ) $_{\infty}^0$ for the initial state $x$ is given by

$u_x^*(t) = \Phi X_{\Phi}(t;x), \; \; t\in[0, \infty),$

where $X_{\Phi}(\cdot; x)$ is the solution to the following equation:

$\begin{equation} \left\{\begin{array}{ll} dX_{\Phi}(t) = (A+B\Phi)X_{\Phi}(t)dt+(C+D\Phi)X_{\Phi}(t)dW(t)\\ + \int_Z[E(\theta)+F(\theta)\Phi]X_{\Phi}(t)\tilde{\mu}(dt, d\theta), \; t\in[0, \infty), \\ X_{\Phi}(0) = x.\nonumber \end{array} \right. \end{equation}$

Proof. By Proposition 5.3(ⅱ), (5.6) holds. According to Proposition 4.2 of ^[25], it is easy to get that for any $T > 0$ , the differential Riccati equation

$\begin{equation} \left\{\begin{array}{ll} \dot{K}_T(t)+\mathcal L(K_T(t))-\mathcal M(K_T(t))^\top\mathcal N(K_T(t))^{-1}\mathcal M(K_T(t)) = 0, \; \; \; \; t\in[0, T], \\ K_T(T) = G\nonumber \end{array} \right. \end{equation}$

admits a unique solution $K_T\in C([0, T]; \mathbb S^n)$ such that

$\mathcal N(K_T(t))\geq\varepsilon I, \; \; \; \; \forall t\in[0, T];\; \; \; \; V_T^0(x) = \langle K_T(0)x, x\rangle , \; \; \; \; \forall x\in\mathbb R^n.$

From Proposition 5.3(ⅲ), we see that

$\lim\limits_{T\rightarrow \infty}K_T(0) = K, \; \; \; \; \mathcal N(K) > 0.$

Thus the conclusion (ⅰ) can be proved by Lemma 4.4.

Then we will prove the conclusions (ⅱ) and (ⅲ). Firstly, we fix an $x\in\mathbb R^n$ and let $(X_x^*, u_x^*)$ be the corresponding optimal pair of Problem (SLQ) $^0_{\infty}.$ Applying Itô's formula to $t\rightarrow \langle KX_x^*(t), X^*_x(t)\rangle$ and noting that $\lim_{t\rightarrow \infty}\langle KX_x^*(t), X^*_x(t)\rangle = 0,$ we have

$\begin{eqnarray} \begin{split} -\langle Kx, x\rangle & = \mathbb E\int_0^{\infty}\bigg\{2\langle K[AX_x^*(t)+Bu_x^*(t)], X_x^*(t)\rangle\\ &\; \; \; +\langle K[CX_x^*(t)+Du_x^*(t)], CX_x^*(t)+Du_x^*(t)\rangle\\ &\; \; \; +\int_Z\langle K[E(\theta)X_x^*(t)+F(\theta)u_x^*(t)], E(\theta)X_x^*(t)+F(\theta)u_x^*(t)\rangle\nu(d\theta) \bigg\}dt\\ & = \mathbb E\int_0^{\infty}\bigg\{\langle [KA+A^\top K+C^\top KC+\int_ZE(\theta)^\top KE(\theta)\nu(d\theta)]X_x^*(t), X_x^*(t)\rangle\\ &\; \; \; +2\langle [B^\top K+D^\top KC+\int_ZF(\theta)^\top KE(\theta)\nu(d\theta)]X_x^*(t), u_x^*(t)\rangle\\ &\; \; \; +\langle [D^\top KD+\int_ZF(\theta)^\top KF(\theta)\nu(d\theta)]u_x^*(t), u_x^*(t)\rangle \bigg\}dt.\nonumber \end{split} \end{eqnarray}$

On the flip side, we have

$\langle Kx, x\rangle = J^0(x;u^*_x) = \mathbb E\int_0^{\infty}\bigg[\langle QX_x^*, X_x^*\rangle +2\langle SX_x^*, u_x^*\rangle +\langle Ru_x^*, u_x^*\rangle \bigg]dt.$

Adding the last two equations yields

$\begin{eqnarray} \begin{split} 0& = \mathbb E\int_0^{\infty}\bigg[\langle \mathcal L(K)X_x^*, X_x^*\rangle +2\langle \mathcal M(K)X_x^*, u_x^*\rangle +\langle \mathcal N(K)u_x^*, u_x^*\rangle \bigg]dt\\ & = \mathbb E\int_0^{\infty}\bigg[\langle \mathcal M(K)^\top\mathcal N(K)^{-1}\mathcal M(K)X_x^*, X_x^*\rangle +2\langle \mathcal M(K)X_x^*, u_x^*\rangle +\langle \mathcal N(K)u_x^*, u_x^*\rangle \bigg]dt\\ & = \mathbb E\int_0^{\infty}\bigg[\langle\Phi^\top \mathcal N(K)\Phi X_x^*, X_x^*\rangle-2\langle \mathcal N(K)\Phi X_x^*, u_x^*\rangle +\langle \mathcal N(K)u_x^*, u_x^*\rangle\bigg]dt\\ & = \mathbb E\int_0^{\infty}\langle \mathcal N(K)[u_x^*(t)-\Phi X_x^*(t)], u_x^*(t)-\Phi X_x^*(t)\rangle dt.\nonumber \end{split} \end{eqnarray}$

Since $\mathcal N(K) = R+D^\top KD+\int_ZF(\theta)^\top KF(\theta)\nu(d\theta) > 0,$ it is easy to get that

$u_x^*(t) = \Phi X_x^*(t), \; \; \; \; t\in[0, \infty),$

and hence $X_x^*$ is the solution of

$\begin{equation} \left\{\begin{array}{ll} dX_x^*(t) = (A+B\Phi)X_x^*(t)dt+(C+D\Phi)X_x^*(t)dW(t)\\ \; \; \; \; \; \; \; \; \; \; \; + \int_Z[E(\theta)+F(\theta)\Phi]X_x^*(t)\tilde{\mu}(dt, d\theta), \; \; \; \; \; \; \; \; t\in[0, \infty), \\ \; X_x^*(0) = x.\nonumber \end{array} \right. \end{equation}$

Since $X_x^*\in\mathcal X[0, \infty)$ and $x$ is arbitrary, it is not hard to get that $\Phi$ is a stabilizer of [ $A, C, E; B, D, F$ ], and the rest of the proof is clear.

5.1.2. Open-loop and closed-loop solvability

As we have proved in Proposition 5.2, the condition $H_2\geq0$ is merely necessary for the existence of an open-loop optimal control, and according to Theorem 5.4, the uniform positivity condition (5.5) is only sufficient. In order to find the connection between the above, let us consider the following cost functional for $\varsigma > 0$ :

$\begin{eqnarray} \begin{split} J^0_{\varsigma}(x;u)&\triangleq\mathbb E\int_0^{\infty}\langle \left(\begin{array}{cc}Q & S^\top\\S & R+\varsigma I\end{array}\right)\left(\begin{array}{c}X(t)\\u(t)\end{array}\right), \left(\begin{array}{c}X(t)\\u(t)\end{array}\right)\rangle dt\\ & = J^0(x;u)+\varsigma\mathbb E\int_0^{\infty}|u(t)|^2dt\\ & = \langle (H_2+\varsigma I)u, u\rangle +2\langle H_1x, u\rangle +\langle H_0x.x\rangle .\nonumber \end{split} \end{eqnarray}$

Now we denote by Problem (SLQ) $_{\infty}^{0, \varsigma}$ the problem of minimizing $J_{\varsigma}^0(x; u)$ subject to the state equation

$\begin{equation} \left\{\begin{array}{ll} dX(t) = [AX(t)+Bu(t)]dt+[CX(t)+Du(t)]dW(t) \\\; \; \; \; \; \; \; \; \; \; \; \; + \int_{Z}[E(\theta)X(t) +F(\theta)u(t)]\tilde{\mu}(dt, d\theta), \; \; \; t \in [0, \infty), \\X(0) = x , \nonumber \end{array} \right. \end{equation}$

and by $V_{\varsigma}^0(x)$ the corresponding value function. Since we assume $H_2\geq0$ , and then the operator $H_2+\varsigma I$ is uniformly positive for all $\varsigma > 0$ , it is easy to apply Theorem 5.4 to Problem (SLQ) $_{\infty}^{0, \varsigma}$ . Thus we will obtain a characterization for the value function $V^0(x)$ of Problem (SLQ) $^0_{\infty}$ while $\varsigma\rightarrow0.$

Theorem 5.5. Let Assumptions 3.1–3.3 be satisfied. If [A, C, E] is $L^2$ -stable, and Problem (SLQ) $^0_{\infty}$ is open-loop solvable, then the generalized ARE (5.1) admits a stabilizing solution $K\in\mathbb S^n.$ Moreover, $V^0(x) = \langle Kx, x\rangle$ for all $x\in\mathbb R^n.$

Proof. According to Proposition 5.2, the open-loop solvability of Problem (SLQ) $^0_{\infty}$ implies $H_2\geq0.$ Thus, for any $\varsigma > 0$ , the ARE:

$\begin{equation} \left\{\begin{array}{ll} \mathcal L(K_{\varsigma})-\mathcal M(K_{\varsigma})^\top[\mathcal N(K_{\varsigma})+\varsigma I]^{-1}\mathcal M(K_{\varsigma}) = 0, \\ \mathcal N(K_{\varsigma})+\varsigma I > 0 \end{array} \right. \end{equation}$

(5.11)

admits a unique solution $K_{\varsigma}\in\mathbb S^n$ such that $V^0_{\varsigma}(x) = \langle K_{\varsigma}x, x\rangle$ for all $x\in\mathbb R^n$ according to Theorem 5.4. Then we define a stabilizer of [A, C, E; B, D, F]:

$\begin{equation} \Phi_{\varsigma}\triangleq-[\mathcal N(K_{\varsigma})+\varsigma I]^{-1}\mathcal M(K_{\varsigma}). \end{equation}$

(5.12)

Thus, it follows that the unique open-loop optimal control $u_{\varsigma}^*(\cdot; x)$ of Problem (SLQ) $_{\infty}^{0, \varsigma}$ for the initial state $x$ is given by

$u_{\varsigma}^*(t;x) = \Phi_{\varsigma}\Pi_{\varsigma}(t)x, \; \; \; \; t\geq0,$

since $\Pi_{\varsigma}$ is a solution of the matrix SDE:

$\begin{equation} \left\{\begin{array}{ll} d\Pi_{\varsigma}(t) = (A+B\Phi_{\varsigma})\Pi_{\varsigma}(t)dt+(C+D\Phi_{\varsigma})\Pi_{\varsigma}(t)dW(t)\\ \; \; \; \; \; \; \; \; \; \; + \int_z[E(\theta)+F(\theta)\Phi_{\varsigma}]\Pi_{\varsigma}(t)\tilde{\mu}(dt, d\theta), \; \; \; \; t\geq0, \\ \; \Pi_{\varsigma}(0) = I.\nonumber \end{array} \right. \end{equation}$

Then we denote $U^*\in L^2_{\mathbb F}(\mathbb R^{m\times n})$ to be a process which is defined in Proposition 5.2(ⅲ). According to the definition of the value function, it is easy to get that for any $x\in\mathbb R^n$ and $\varsigma > 0,$

$\begin{eqnarray} \begin{split} V^0&(x)+\varsigma\mathbb E\int_0^{\infty}|\Phi_{\varsigma}\Pi_{\varsigma}(t)x|^2dt\leq J^0(x;\Phi_{\varsigma}\Pi_{\varsigma}(t)x)+\varsigma\mathbb E\int_0^{\infty}|\Phi_{\varsigma}\Pi_{\varsigma}(t)x|^2dt\\ & = J^0_{\varsigma}(x;\Phi_{\varsigma}\Pi_{\varsigma}(t)x) = V^0_{\varsigma}(x) = \langle K_{\varsigma}x, x\rangle \leq J^0_{\varsigma}(x;U^*x)\\ & = V^0(x)+\varsigma\mathbb E\int_0^{\infty}|U^*(t)x|^2dt. \end{split} \end{eqnarray}$

(5.13)

Equation (5.13) implies that for any $x\in\mathbb R^n$ and $\varsigma > 0,$

$\begin{equation} V^0(x)\leq\langle K_{\varsigma}x, x\rangle\leq V^0(x)+\varsigma\mathbb E\int_0^{\infty}|U^*(t)x|^2dt, \end{equation}$

(5.14)

$\begin{equation} 0\leq\mathbb E\int_0^{\infty}|\Phi_{\varsigma}\Pi_{\varsigma}(t)x|^2dt\leq\mathbb E\int_0^{\infty}|U^*(t)x|^2dt. \end{equation}$

(5.15)

From (5.14)) we see that $K\equiv\lim_{\varsigma\rightarrow 0}K_{\varsigma}$ exists and $V^0(x) = \langle Kx, x\rangle$ for all $x\in\mathbb R^n$ .

From (5.15) we see that the family of positive semi-definite matrices:

$\Theta_{\varsigma} = \mathbb E\int_0^{\infty}\Pi_{\varsigma}(t)^\top\Phi_{\varsigma}^\top\Phi_{\varsigma}\Pi_{\varsigma}(t)dt, \; \; \; \; \varsigma > 0$

is bounded, and the system [ $A+B\Phi_{\varsigma}, C+D\Phi_{\varsigma}, E+F\Phi_{\varsigma}$ ] is $L^2$ -stable for $\Phi_{\varsigma}$ is a stabilizer of [ $A, C, E; B, D, F$ ]. According to Theorem 4.1, we have

$\begin{eqnarray} \begin{split} \Theta_{\varsigma}(A+B\Phi_{\varsigma})+(A&+B\Phi_{\varsigma})^\top \Theta_{\varsigma}+(C+D\Phi_{\varsigma})^\top \Theta_{\varsigma}(C+D\Phi_{\varsigma})\\ &+\int_Z\left[ E(\theta)+F(\theta)\Phi_{\varsigma}\right] ^\top \Theta_{\varsigma}\left[ E(\theta)+F(\theta)\Phi_{\varsigma}\right] \nu(d\theta)+\Phi_{\varsigma}^\top \Phi_{\varsigma} = 0.\nonumber \end{split} \end{eqnarray}$

It follows that

$0\leq\Phi_{\varsigma}^\top\Phi_{\varsigma}\leq-\left[ \Theta_{\varsigma}(A+B\Phi_{\varsigma})+(A+B\Phi_{\varsigma})^\top \Theta_{\varsigma}\right], \; \; \; \; \forall\varsigma > 0 .$

The above, together with the boundedness of $\left\lbrace \Theta_{\varsigma}\right\rbrace _{\varsigma > 0}$ , shows that

$\begin{equation} |\Phi_{\varsigma}|^2\leq J(1+|\Phi_{\varsigma}|), \; \; \; \; \forall\varsigma > 0, \end{equation}$

(5.16)

for some constant $J > 0.$ Noting that (5.16) implies the boundedness of $\left\lbrace \Phi_{\varsigma}\right\rbrace _{\varsigma > 0}$ , we may choose a sequence $\left\lbrace \varsigma_k\right\rbrace _{k = 1}^{\infty}\subseteq(0, \infty)$ with $\lim_{k\rightarrow \infty}\varsigma_k = 0$ such that $\Phi\equiv\lim_{k\rightarrow \infty}\Phi_{\varsigma_k}$ exists. Observe that

$\mathcal N(K)\Phi = \lim\limits_{k\rightarrow \infty}[\mathcal N(K_{\varsigma_k})+\varsigma_kI]\Phi_{\varsigma_k} = -\lim\limits_{k\rightarrow \infty}\mathcal M(K_{\varsigma_k}) = -\mathcal M(K).$

Thus, we have by Lemma 2.2 that

$\begin{equation} \mathscr R(\mathcal M(K))\subseteq\mathscr R(\mathcal N(K)), \end{equation}$

(5.17)

$\begin{equation} \Phi = -\mathcal N(K)^{\dagger}\mathcal M(K)+[I-\mathcal N(K)^{\dagger}\mathcal N(K)]\Theta, \end{equation}$

(5.18)

for some $\Theta\in\mathbb R^{m\times n}.$ Notice that by (5.12), $\mathcal M(K_{\varsigma})^\top = -\Phi_{\varsigma}^\top[\mathcal N(K_{\varsigma})+\varsigma I].$ Thus (5.11) can be written as

$\begin{equation} \left\{\begin{array}{ll} \mathcal L(K_{\varsigma})-\Phi_{\varsigma}^\top[\mathcal N(K_{\varsigma})+\varsigma I]\Phi_{\varsigma} = 0, \\ \mathcal N(K_{\varsigma})+\varsigma I > 0.\nonumber \end{array} \right. \end{equation}$

Now let $k\rightarrow \infty$ along $\left\lbrace \varsigma_k\right\rbrace _{k = 1}^{\infty}$ , then $\varsigma_k\rightarrow0$ in the above. Thus we have

$\begin{equation} \left\{\begin{array}{ll} \mathcal L(K)-\Phi^\top\mathcal N(K)\Phi = 0, \\ \mathcal N(K)\geq 0, \nonumber \end{array} \right. \end{equation}$

which, together with (5.17) and (5.18), implies that $K$ solves the generalized ARE (5.1). Following we will show that $K$ is a stabilizing solution, and we need only to show $\Phi\in\mathscr T$ [ $A, C, E; B, D, F$ ]. To prove it, we define $\Pi$ as a solution of the matrix SDE:

$\begin{equation} \left\{\begin{array}{ll} d\Pi(t) = (A+B\Phi)\Pi(t)dt+(C+D\Phi)\Pi(t)dW(t)\\\; \; \; \; \; \; \; \; \; \; \; + \int_Z\left[ E(\theta)+F(\theta)\Phi\right] \Pi(t)\tilde{\mu}(dt, d\theta), \; \; \; \; t\geq 0, \\ \Pi(0) = I.\nonumber \end{array} \right. \end{equation}$

Since $\Phi_{\varsigma_k}\rightarrow \Phi$ as $k\rightarrow \infty$ , we get $\Pi_{\varsigma_k}(t)\rightarrow \Pi(t)$ , a.s.for all $t\geq 0$ . By Fatou's lemma and (5.15), we get that

$\begin{eqnarray} \begin{split} \mathbb E\int_0^{\infty}|\Phi\Pi(t)x|^2dt&\leq\lim\limits_{k\rightarrow \infty}\inf\mathbb E\int_0^{\infty}|\Phi_{\varsigma_k}\Pi_{\varsigma_k}(t)x|^2dt\\ &\leq\mathbb E\int_0^{\infty}|U^*(t)x|^2dt < \infty, \; \; \; \; \forall x\in\mathbb R^n.\nonumber \end{split} \end{eqnarray}$

This implies $\Phi\Pi\in L_{\mathbb F}^2(\mathbb R^{m\times n}).$ Thus, by Proposition 4.2, it is easy to get that $\Pi\in L_{\mathbb F}^2(\mathbb R^{m\times n}).$ Consequently, $\Phi\in\mathscr T[A, C, E; B, D, F].$

According to Theorem 5.5, if [ $A, C, E$ ] is $L^2$ -stable, then the existence of a stabilizing solution to the generalized ARE is necessary for the open-loop solvability of Problem (SLQ) $_{\infty}^0.$ Later we will show that the converse is also true.

Proposition 5.6. Let Assumptions 3.1–3.3 hold. Suppose that the generalized ARE (5.1) admits a stabilizing solution $K\in\mathbb S^n$ . Then Problem ${\rm(SLQ)} _{\infty}^0$ is closed-loop solvable.

Proof. Let $X(\cdot)\equiv X(\cdot; x, u)$ be the solution of the following equation:

$\begin{equation} \left\{\begin{array}{ll} dX(t) = [AX(t)+Bu(t)]dt+[CX(t)+Du(t)]dW(t) \\\; \; \; \; \; \; \; \; \; \; \; \; + \int_{Z}[E(\theta)X(t) +F(\theta)u(t)]\tilde{\mu}(dt, d\theta), \; \; \; \; t \in [0, \infty), \\X(0) = x , \nonumber \end{array} \right. \end{equation}$

for arbitrary fixed initial state $x$ and admissible control $u\in\mathscr U$ . Applying Itô's formula to $t\mapsto\langle KX(t), X(t)\rangle$ , it is easy to get that

$\begin{eqnarray} \begin{split} -\langle Kx, x\rangle & = \mathbb E\int^{\infty}_0\bigg[\langle \bigg(KA+A^\top K+C^\top KC+\int_ZE(\theta)^\top KE(\theta)\nu(d\theta)\bigg)X(t), X(t)\rangle\\ &+2\langle \bigg(B^\top K+D^\top KC+\int_ZF(\theta)^\top KE(\theta)\nu(d\theta)\bigg)X(t), u(t)\rangle\\ & +\langle\bigg( D^\top KD+\int_ZF(\theta)^\top KF(\theta)\nu(d\theta)\bigg)u(t), u(t)\rangle \bigg]dt.\nonumber \end{split} \end{eqnarray}$

Thus, we have

$\begin{eqnarray} \begin{split} J^0(x;u)-\langle Kx, x\rangle = E\int_0^{\infty}\langle \left(\begin{array}{cc}\mathcal L(K) & \mathcal M(K)^\top\\\mathcal M(K) & \mathcal N(K) \end{array}\right)\left(\begin{array}{c}X(t)\\u(t)\end{array}\right), \left(\begin{array}{c}X(t)\\u(t)\end{array}\right)\rangle dt.\nonumber \end{split} \end{eqnarray}$

By the extended Schur's lemma (Lemma 2.3), we obtain

$\begin{eqnarray} \begin{split} \left(\begin{array}{cc}\mathcal L(K) & \mathcal M(K)^\top\\\mathcal M(K) & \mathcal N(K) \end{array}\right)\geq 0.\nonumber \end{split} \end{eqnarray}$

Thus,

$\begin{equation} J^0(x;u)\geq\langle Kx, x\rangle , \; \; \; \; \forall u\in\mathscr U. \end{equation}$

(5.19)

On the flip side, we can choose a $\Theta\in\mathbb R^{m\times n}$ such that the matrix

$\begin{equation} \Phi^*\triangleq-\mathcal N(K)^{\dagger}\mathcal M(K)+[I-\mathcal N(K)^{\dagger}\mathcal N(K)]\Theta\nonumber \end{equation}$

is a stabilizer of [ $A, C, E; B, D, F$ ] since $K$ is stabilizing. By Remark 5.2,

$\begin{equation} \mathcal N(K)\Phi^* = -\mathcal M(K), \; \; \mathcal M(K)^\top\Phi^* = -(\Phi^*)^\top\mathcal N(K)\Phi^* = -\mathcal M(K)^\top\mathcal N(K)^{\dagger}\mathcal M(K).\nonumber \end{equation}$

Thus, for any $x^*\in\mathbb R^n$ ,

$\begin{eqnarray} \begin{split} \langle \left(\begin{array}{cc}\mathcal L(K) & \mathcal M(K)^\top\\\mathcal M(K) & \mathcal N(K) \end{array}\right)\left(\begin{array}{c}x^*\\\Phi^*x^*\end{array}\right), \left(\begin{array}{c}x^*\\\Phi^*x^*\end{array}\right)\rangle \\ = \langle [\mathcal L(K)+2\mathcal M(K)^\top\Phi^*+(\Phi^*)^\top\mathcal N(K)\Phi^*]x^*, x^*\rangle \\ = \langle [\mathcal L(K)-\mathcal M(K)^\top\mathcal N(K)^{\dagger}\mathcal M(K)]x^*, x^*\rangle = 0. \end{split} \end{eqnarray}$

(5.20)

Following we will show that $(\Phi^*, 0)$ is a closed-loop optimal strategy of Problem (SLQ) $_\infty^0$ . Firstly, let $X^*$ be the closed-loop state process corresponding to $(x, \Phi^*, 0)$ :

$\begin{equation} \left\{\begin{array}{ll} dX^*(t) = (A+B\Phi^*)X^*(t)dt+(C+D\Phi^*)X^*(t)dW(t) \\\; \; \; \; \; \; \; \; \; \; \; \; + \int_{Z}[E(\theta)+F(\theta)\Phi^*]X^*(t)\tilde{\mu}(dt, d\theta), \; \; \; \; t \in [0, \infty), \\X^*(0) = x .\nonumber \end{array} \right. \end{equation}$

Then, applying Itô's formula to $t\mapsto\langle KX^*(t), X^*(t)\rangle$ and using (5.20), we obtain

$\begin{eqnarray} \begin{split} J^0&(x;\Phi^*X^*)-\langle Kx, x\rangle \\ & = \mathbb E\int_0^{\infty}\langle \left(\begin{array}{cc}\mathcal L(K) & \mathcal M(K)^\top\\\mathcal M(K) & \mathcal N(K) \end{array}\right)\left(\begin{array}{c}X^*(t)\\\Phi^*X^*(t)\end{array}\right), \left(\begin{array}{c}X^*(t)\\\Phi^*X^*(t)\end{array}\right)\rangle dt = 0.\nonumber \end{split} \end{eqnarray}$

Combining the last equation with (5.19), the conclusion is proved.

Remark 5.3. According to the proof of Proposition 5.6, it is clear that if $K$ is a stabilizing solution to the generalized ARE (5.1), then $V^0(x) = \langle Kx, x\rangle$ for all $x\in\mathbb R^n$ .

Combining Remark 5.1, Theorem 5.5, and Proposition 5.6, we have the following result.

Theorem 5.7. Let Assumptions 3.1–3.3 be satisfied. Suppose that [ $A.C, E$ ] is $L^2$ -stable. Then the following statements are equivalent:

(i) Problem (SLQ) $_\infty^0$ is open-loop solvable;

(ii) Problem (SLQ) $_\infty^0$ is closed-loop solvable;

(iii) The generalized ARE (5.1) admits a unique stabilizing solution.

5.1.3. Nonhomogeneous problems

Following we will focus our attention to Problem (SLQ) $_\infty$ , and show a result that is similar to Theorem 5.7.

Denote by $\Omega$ a stabilizer of [ $A, C, E; B, D, F$ ], and define

$\begin{equation} \left\{\begin{array}{ll} \hat{A} = A+B\Omega, \; \; \; \; \hat{C} = C+D\Omega, \; \; \; \; \hat{E} = E+F\Omega, \\ \hat{S} = S+R\Omega, \; \; \; \; \hat{Q} = Q+S^\top\Omega+\Omega^\top S+\Omega^\top R\Omega, \\ \hat{q} = q+\Omega^\top\rho. \end{array} \right. \end{equation}$

(5.21)

Then we use $\hat{X}(\cdot; x, v)$ to denote the solution of the following state equation

$\begin{equation} \left\{\begin{array}{ll} d\hat{X}(t) = [\hat{A}\hat{X}(t)+Bv(t)+b(t)]dt\\ \; \; \; \; \; \; \; \; \; +[\hat{C}\hat{X}(t)+Dv(t)+\sigma(t)]dW(t)\\ \; \; \; \; \; \; \; \; \; + \int_Z[\hat{E}(\theta)\hat{X}(t)+F(\theta)v(t)+h(\theta)]\tilde{\mu}(dt, d\theta), \; \; \; \; t\geq0, \\ \; \hat{X}(0) = x \end{array} \right. \end{equation}$

(5.22)

with regard to $x$ and $v$ . And we have the cost functional

$\begin{eqnarray} \begin{split} \hat{J}(x;v) \triangleq &J(x;\Omega\hat{X}+v)\\ = &\mathbb{E} \int_0^{\infty}\bigg[ \bigg{\langle}\left(\begin{array}{cc} Q & S^\top \\ S & R \end{array} \right) \left( \begin{array}{c} \hat{X}(t)\\\Omega\hat{X}(t)+v(t)\end{array}\right), \left( \begin{array}{c} \hat{X}(t)\\\Omega\hat{X}(t)+v(t)\end{array}\right)\bigg{\rangle}\\ &+2\bigg{\langle}\left(\begin{array}{c} q(t) \\ \rho(t)\end{array}\right), \left( \begin{array}{c} \hat{X}(t)\\\Omega\hat{X}(t)+v(t)\end{array}\right)\bigg{\rangle}\bigg]dt\\ = &\mathbb{E} \int_0^{\infty}\bigg[ \bigg{\langle}\left(\begin{array}{cc} \hat{Q} & \hat{S}^\top \\ \hat{S} & R \end{array} \right) \left( \begin{array}{c} \hat{X}\\v\end{array}\right), \left( \begin{array}{c} \hat{X}\\v\end{array}\right)\bigg{\rangle} +2\bigg{\langle}\left(\begin{array}{c} \hat{q} \\ \rho\end{array}\right), \left( \begin{array}{c} \hat{X}\\v\end{array}\right)\bigg{\rangle}\bigg]dt. \end{split} \end{eqnarray}$

(5.23)

It is easy to find that [ $\hat{A}, \hat{C}, \hat{E}$ ] is $L^2$ -stable. And we denote by (SLQ) $_{\infty}^{'}$ the problem of minimizing (5.23) subject to (5.22). According to Proposition 4.3, we have the following obvious conclusions about Problem (SLQ) $_{\infty}^{'}$ .

Proposition 5.8. Under Assumptions 3.1–3.3, let $\Omega$ be a stabilizer of [A, C, E; B, D, F]. Then

(i) Problem (SLQ) $_{\infty}^{'}$ is open-loop solvable at $x\in\mathbb R^n$ if and only if Problem (SLQ) $_{\infty}$ is so. In this case, $v^*$ is an open-loop optimal control of Problem (SLQ) $_{\infty}^{'}$ if and only if $u^*\triangleq v^*+\Omega\hat{X}(\cdot; x, v^*)$ is an open -loop optimal control of Problem (SLQ) $_{\infty}$ ;

(ii) Problem (SLQ) $_{\infty}^{'}$ is closed-loop solvable if and only if Problem (SLQ) $_{\infty}$ is so. In this case, $(\Omega^*, v^*)$ is a closed-loop optimal strategy of Problem (SLQ) $_{\infty}^{'}$ if and only if $(\Omega^*+\Omega, v^*)$ is a closed-loop optimal strategy of Problem (SLQ) $_{\infty}$ .

Following we will show the main result of this section.

Theorem 5.9. Under Assumptions 3.1–3.3 and (A), we have the following equivalent statements:

(i) Problem (SLQ) $_\infty$ is open-loop solvable;

(ii) Problem (SLQ) $_\infty$ is closed-loop solvable;

(iii) The generalized ARE (5.1) admits a stabilizing solution $K\in\mathbb S^n$ , and the BSDE

$\begin{eqnarray} \begin{split} d\eta = -\bigg\{&[A-B\mathcal N(K)^{\dagger}\mathcal M(K)]^\top\eta+[C-D\mathcal N(K)^{\dagger}\mathcal M(K)]^\top(\zeta_1+K\sigma)\\&+\int_Z[E(\theta)-F(\theta)\mathcal N(K)^{\dagger}\mathcal M(K)]^\top\left[ \zeta_2+Kh(t, \theta)\right] \nu(d\theta)-\mathcal M(K)^\top\mathcal N(K)^{\dagger}\rho\\&+Kb+q\bigg\}dt+\zeta_1dW(t)+\int_Z\zeta_2\tilde{\mu}(dt, d\theta), \; \; \; \; t\geq0, \end{split} \end{eqnarray}$

(5.24)

admits an $L^2$ -stable adapted solution ( $\eta, \zeta_1, \zeta_2$ ) such that

$\begin{eqnarray} \begin{split} \varpi(t)\triangleq B^\top\eta(t)+D^\top[\zeta_1(t)+K\sigma(t)]+\int_Z F(\theta)^\top[\zeta_2(t)+Kh(t, \theta)]\nu(d\theta)+\rho(t)\in\mathscr R(\mathcal N(K)), \\ a.e.\; \; t\in[0, \infty), \; \; a.s. \end{split} \end{eqnarray}$

(5.25)

In the above case, all closed-loop optimal strategies ( $\Phi^*, v^*$ ) are given by

$\begin{equation} \left\{\begin{array}{ll} \Phi^* = -\mathcal N(K)^{\dagger}\mathcal M(K)+[I-\mathcal N(K)^{\dagger}\mathcal N(K)]\Theta, \\ v^*\; = -\mathcal N(K)^{\dagger}\varpi+[I-\mathcal N(K)^{\dagger}\mathcal N(K)]w, \end{array} \right. \end{equation}$

(5.26)

where $\Theta\in\mathbb R^{m\times n}$ and $w\in L^2_{\mathbb F}(\mathbb R^m)$ are arbitrary. And it turns out that $\Phi^*\in\mathscr T[A, C, E; B, D, F]$ . Then every open-loop optimal control $u^*$ for the initial state $x$ admits a closed-loop representation:

$\begin{equation} u^*(t) = \Phi^*X^*(t)+v^*(t), \; \; \; \; t\in[0, \infty), \end{equation}$

(5.27)

where ( $\Phi^*, v^*$ ) is a closed-loop optimal strategy of Problem (SLQ) $_{\infty}$ and $X^*$ is the corresponding closed-loop state process. Moreover,

$\begin{eqnarray} \begin{split} V(x)& = \langle Kx, x\rangle +2\mathbb E\langle \eta(0), x\rangle \\ &\; \; \; +\mathbb E\int_0^{\infty} \bigg[ \langle K\sigma, \sigma\rangle +\int_Z\langle Kh(t, \theta), h(t, \theta)\rangle \nu(d\theta) +2\langle \eta, b\rangle +2\langle \zeta_1, \sigma\rangle \\ &\; \; \; +2\langle \zeta_2, \int_Zh(t, \theta)\nu(d\theta)\rangle-\langle \mathcal N(K)^{\dagger}\varpi, \varpi\rangle\bigg] dt.\nonumber \end{split} \end{eqnarray}$

Proof. Since Remark 5.1 holds, it is obvious that the implication (ⅱ) $\Rightarrow$ (ⅰ) follows.

To prove the implication (ⅰ) $\Rightarrow$ (ⅲ), we will consider Problem (SLQ) $_{\infty}^{'}$ firstly. According to Proposition 5.8(ⅰ), it is open-loop solvable. Since the system [ $\hat{A}, \hat{C}, \hat{E}$ ] is $L^2$ -stable, it is easy to get that by Proposition 5.2 and Theorem 5.5, the ARE

$\begin{equation} \left\{\begin{array}{ll} K\hat{A}+\hat{A}^\top K+\hat{C}^\top K\hat{C}+ \int_Z\hat{E}(t, \theta)^\top K\hat{E}(t, \theta)\nu(d\theta)+\hat{Q}\\-\left[ KB+\hat{C}^\top KD+ \int_Z\hat{E}(t, \theta)^\top KF(t, \theta)\nu(d\theta)+\hat{S}^\top\right] \\\times\left[ R+D^\top KD+ \int_ZF(t, \theta)^\top KF(t, \theta)\nu(d\theta)\right] ^{\dagger}\\\times\left[ B^\top K+D^\top K\hat{C}+ \int_ZF(t, \theta)^\top K\hat{E}(t, \theta)\nu(d\theta)+\hat{S}\right] = 0, \\ \mathscr R\left[ B^\top K+D^\top K\hat{C}+ \int_ZF(t, \theta)^\top K\hat{E}(t, \theta)\nu(d\theta)+\hat{S}\right] \subseteq\\ \mathscr R\left[ R+D^\top KD+ \int_ZF(t, \theta)^\top KF(t, \theta)\nu(d\theta)\right] , \\ \mathcal N(K) = R+D^\top KD+ \int_ZF(t, \theta)^\top KF(t, \theta)\nu(d\theta)\geq 0 \end{array} \right. \end{equation}$

(5.28)

admits a (unique) stabilizing solution $K\in\mathbb S^n$ . Choose a $\Gamma\in\mathbb R^{m\times n}$ such that

$\begin{equation} \Omega^*\triangleq-\mathcal N(K)^{\dagger}\bigg[B^\top K+D^\top K\hat{C}+\int_ZF(\theta)^\top K\hat{E}(\theta)\nu(d\theta)+\hat{S}\bigg]+[I-\mathcal N(K)^{\dagger}\mathcal N(K)]\Gamma\nonumber \end{equation}$

is a stabilizer of [ $\hat{A}, \hat{C}, \hat{E}; B, D, F$ ]. According to Remark 5.2 and (5.21),

$\begin{eqnarray} \begin{split} \mathcal N(K)(\Omega^*+\Omega)& = -\left[ B^\top K+D^\top K\hat{C}+\int_ZF(\theta)^\top K\hat{E}(\theta)\nu(d\theta)+\hat{S}\right] +\mathcal N(K)\Omega\\ & = -\left[ B^\top K+D^\top KC+\int_ZF(\theta)^\top KE(\theta)\nu(d\theta)+S\right] = -\mathcal M(K). \end{split} \end{eqnarray}$

(5.29)

It follows that $\mathscr R(\mathcal M(K))\subseteq\mathscr R(\mathcal N(K))$ . Substituting (5.21) into the first equation of (5.28) gives

$\begin{eqnarray} \begin{split} 0 = &\mathcal L(K)+\mathcal M(K)^\top\Omega+\Omega^\top\mathcal M(K)-\mathcal M(K)^\top\mathcal N(K)^{\dagger}\mathcal M(K)\\ &-\mathcal M(K)^\top N(K)^{\dagger}\mathcal N(K)\Omega-\Omega^\top \mathcal N(K)N(K)^{\dagger}\mathcal M(K)\\ = &\mathcal L(K)-\mathcal M(K)^\top\mathcal N(K)^{\dagger}\mathcal M(K)+\mathcal M(K)^\top[I-\mathcal N(K)^{\dagger}\mathcal N(K)]\Omega\\ &+\Omega^\top[I-\mathcal N(K)\mathcal N(K)^{\dagger}]\mathcal M(K)\\ = &\mathcal L(K)-\mathcal M(K)^\top\mathcal N(K)^{\dagger}\mathcal M(K)-(\Omega^*+\Omega)^\top\mathcal N(K)[I-\mathcal N(K)^{\dagger}\mathcal N(K)]\Omega\\ &-\Omega^\top[I-\mathcal N(K)\mathcal N(K)^{\dagger}]\mathcal N(K)(\Omega^*+\Omega)\\ = &\mathcal L(K)-\mathcal M(K)^\top\mathcal N(K)^{\dagger}\mathcal M(K).\nonumber \end{split} \end{eqnarray}$

Therefore, $K$ solves the ARE (5.1). Since $\Omega^*+\Omega$ is a stabilizer of [ $A, C, E; B, D, F$ ], it is easy to get that $K$ is stabilizing according to (5.29) and Lemma 2.2.

Now choose $\Theta\in\mathbb R^{m\times n}$ such that the matrix

$\begin{equation} \Phi\triangleq-\mathcal N(K)^{\dagger}\mathcal M(K)+[I-\mathcal N(K)^{\dagger}\mathcal N(K)]\Theta\nonumber \end{equation}$

is a stabilizer of [ $A, C, E; B, D, F$ ], and think over the following BSDE on [0, $\infty$ ):

$\begin{eqnarray} \begin{split} d\eta(t) = &-\bigg[(A+B\Phi)^\top\eta+(C+D\Phi)^\top(\zeta_1+K\sigma)+\int_Z(E(\theta)+F(\theta)\Phi)^\top\\ &\cdot(\zeta_2+Kh(t, \theta))\nu(d\theta)+\Phi^\top\rho+Kb+q \bigg]dt+\zeta_1dW(t)+\int_Z\zeta_2\tilde{\mu}(dt, d\theta). \end{split} \end{eqnarray}$

(5.30)

Since [ $A+B\Phi, C+D\Phi, E+F\Phi$ ] is $L^2$ -stable, it is not hard to get the conclusion that (5.30) admits a unique $L^2$ -stable adapted solution ( $\eta, \zeta_1, \zeta_2$ ) by Lemma 3.2. Now we let $X(\cdot)\equiv X(\cdot; x, u)$ be the corresponding state process for fixed but arbitrary $x$ and $u\in\mathscr U$ . Applying Itô's formula to $t\mapsto\langle KX(t), X(t)\rangle$ , then it follows that

$\begin{eqnarray} \begin{split} -\langle Kx, x\rangle & = \mathbb E\int_0^{\infty}\left[ \langle\left( KA+A^\top K+C^\top KC+\int_Z E(\theta)^\top KE(\theta)\nu(d\theta)\right) X, X \rangle \right. \\ &\; \; \; \; \left.+2\langle \left( B^\top K+D^\top KC+\int_ZF(\theta)^\top KE(\theta)\nu(d\theta)\right) X, u\rangle\right.\\ &\; \; \; \; \left. +\langle \left( D^\top KD +\int_ZF(\theta)^\top KF(\theta)\nu(d\theta)\right) u, u\rangle \right.\\ &\; \; \; \; \left.+2\langle C^\top K\sigma+Kb+\int_ZE(\theta)^\top Kh(t, \theta)\nu(d\theta), X\rangle \right.\\ &\; \; \; \; \left.+2\langle D^\top K\sigma+\int_ZF(\theta)^\top Kh(t, \theta)\nu(d\theta), u\rangle \right.\\ &\; \; \; \; \left.+\langle K\sigma, \sigma\rangle +\int_Z\langle Kh(t, \theta), h(t, \theta)\rangle\nu(d\theta)\right]dt. \nonumber \end{split} \end{eqnarray}$

Applying Itô's formula to $t\mapsto\langle \eta(t), X(t)\rangle$ yields

$\begin{eqnarray} \begin{split} \mathbb E\langle \eta(0), x\rangle & = \mathbb E\int_0^{\infty}\left[ \langle \Phi^\top\left( B^\top\eta+D^\top\zeta_1+D^\top K\sigma+\rho+\int_ZF(\theta)^\top(\zeta_2+Kh(t, \theta))\nu(d\theta)\right) , X\rangle \right.\\ &\; \; \; \; \left.+\langle C^\top K\sigma+Kb+q+\int_ZE(\theta)^\top Kh(t, \theta)\nu(d\theta), X \rangle \right.\\ &\; \; \; \; \left.-\langle B^\top\eta+D^\top\zeta_1+\int_ZF(\theta)^\top\zeta_2\nu(d\theta), u\rangle \right.\\ &\; \; \; \; \left.-\langle \eta, b\rangle -\langle \zeta_1, \sigma\rangle -\langle \zeta_2, \int_Zh(t, \theta)\nu(d\theta)\rangle\right]dt . \nonumber \end{split} \end{eqnarray}$

Denote $\varpi(t) = B^\top\eta(t)+D^\top[\zeta_1(t)+K\sigma(t)]+\int_Z F(t, \theta)^\top[\zeta_2(t)+Kh(t, \theta)]\nu(d\theta)+\rho(t)$ , then we have

$\begin{eqnarray} \begin{split} J(&x;u)-\langle Kx, x\rangle -2\mathbb E\langle \eta(0), x\rangle \\ & = \mathbb E\int_0^{\infty}\bigg[ \langle \mathcal L(K)X, X\rangle +2\langle \mathcal M(K)X, u\rangle +\langle \mathcal N(K)u, u\rangle-2\langle \Phi^\top\varpi, X\rangle +2\langle \varpi, u\rangle +\langle K\sigma, \sigma\rangle \\ &\; \; \; \; + 2\langle \eta, b\rangle +2\langle \zeta_1, \sigma\rangle+\int_Z\langle Kh(t, \theta), h(t, \theta)\rangle \nu(d\theta) +2\langle \zeta_2, \int_Zh(t, \theta)\nu(d\theta)\rangle \bigg]dt \\ & = \mathbb E\int_0^{\infty}\bigg[ \langle \mathcal N(K)(u-\Phi X), u-\Phi X\rangle +2\langle \varpi, u-\Phi X\rangle +\langle K\sigma, \sigma\rangle+2\langle \eta, b\rangle +2\langle \zeta_1, \sigma\rangle \\ &\; \; \; \; +\int_Z\langle Kh(t, \theta), h(t, \theta)\rangle \nu(d\theta)+2\langle \zeta_2, \int_Zh(t, \theta)\nu(d\theta)\rangle\bigg] dt. \end{split} \end{eqnarray}$

(5.31)

Now we assume that $u^*$ is an open-loop optimal control of Problem (SLQ) $_{\infty}$ for the initial state $x$ , and $X_{\Phi}(\cdot; x, v)$ is a solution to the following SDE:

$\begin{equation} \left\{\begin{array}{ll} dX_{\Phi}(t) = [(A+B\Phi)X_{\Phi}(t)+Bv(t)+b(t)]dt \\\; \; \; \; \; \; \; \; \; \; \; \; +[(C+D\Phi)X_{\Phi}(t)+Dv(t)+\sigma(t)]dW(t) \\\; \; \; \; \; \; \; \; \; \; \; \; + \int_{Z}[(E(\theta)+F(\theta)\Phi)X_{\Phi}(t)+Fv(t)+h(t, \theta)]\tilde{\mu}(dt, d\theta), \; \; \; t \in [0, \infty), \\X_{\Phi}(0) = x \in \mathbb R^n.\nonumber \end{array} \right. \end{equation}$

By Proposition 4.3, we have the conclusion that any admissible control with respect to the initial state $x$ is of the form

$\begin{equation} \Phi X_{\Phi}(\cdot;x, v)+v, \; \; \; \; v\in L^2_{\mathbb F}(\mathbb R^m).\nonumber \end{equation}$

Thus $u^* = \Phi X_{\Phi}(\cdot; x, v^*)+v^*$ for some $v^*\in L^2_{\mathbb F}(\mathbb R^m)$ . Then we have

$\begin{eqnarray} \begin{split} J(&x;\Phi X_{\Phi}(\cdot;x, v^*)+v^*) = J(x;u^*)\\ &\leq J(x;\Phi X_{\Phi}(\cdot;x, v)+v), \; \; \; \; \forall v\in L^2_{\mathbb F}(\mathbb R^m). \end{split} \end{eqnarray}$

(5.32)

Since we take $u = \Phi X_{\Phi}(\cdot; x, v)+v$ , it is easy to get that

$\begin{equation} X(\cdot;x, u^*) = X_{\Phi}(\cdot;x, v^*), \; \; \; \; X(\cdot;x, u) = X_{\Phi}(\cdot;x, v), \nonumber \end{equation}$

thus, according to (5.31) and (5.32), it follows that for any $v\in L^2_{\mathbb F}(\mathbb R^m)$ ,

$\begin{eqnarray} \begin{split} \mathbb E&\int_0^{\infty}\left[ \langle \mathcal N(K)v^*, v^*\rangle +2\langle \varpi, v^*\rangle \right] dt\\ & = J(x;u^*)-\langle Kx, x\rangle -2\mathbb E\langle \eta(0), x\rangle\\ &\; \; \; \; -\mathbb E\int_0^{\infty}\left[ \langle K\sigma, \sigma\rangle +\int_Z\langle Kh(t, \theta), h(t, \theta)\rangle \nu(d\theta) +2\langle \eta, b\rangle +2\langle \zeta_1, \sigma\rangle +2\langle \zeta_2, \int_Zh(t, \theta)\nu(d\theta)\rangle\right] dt\\ &\leq J(x;u)-\langle Kx, x\rangle -2\mathbb E\langle \eta(0), x\rangle\\ &\; \; \; \; -\mathbb E\int_0^{\infty}\left[ \langle K\sigma, \sigma\rangle +\int_Z\langle Kh(t, \theta), h(t, \theta)\rangle \nu(d\theta) +2\langle \eta, b\rangle +2\langle \zeta_1, \sigma\rangle +2\langle \zeta_2, \int_Zh(t, \theta)\nu(d\theta)\rangle\right] dt\\ & = \mathbb E\int_0^{\infty}\left[ \langle \mathcal N(K)v, v\rangle +2\langle \varpi, v\rangle \right] dt.\nonumber \end{split} \end{eqnarray}$

The above inequality implies that $v^*$ is a minimizer of the functional

$\begin{equation} H(v) = \mathbb E\int_0^{\infty}\left[\langle \mathcal N(K)v, v\rangle +2\langle \varpi, v\rangle \right] dt, \; \; \; \; v\in L^2_{\mathbb F}(\mathbb R^m). \nonumber \end{equation}$

Thus, it follows that

$\begin{equation} \mathcal N(K)v^*+\varpi = 0, \; \; \; \; a.e.\ t\geq0, \; \; a.s.\nonumber \end{equation}$

By Lemma 2.2, we have

$\begin{equation} \left\{\begin{array}{ll} \varpi\in\mathscr R(\mathcal N(K)), \; \; \text{and}\\ v^* = -\mathcal N(K)^{\dagger}\varpi+\left[ I-\mathcal N(K)^{\dagger}\mathcal N(K)\right]\vartheta \ \text{for some}\ \vartheta\in L^2 _{\mathbb F}(\mathbb R^m).\nonumber \end{array} \right. \end{equation}$

Observing that

$\begin{equation} \left[ \Phi^\top+\mathcal M(K)^\top\mathcal N(K)^{\dagger}\right] \varpi = -\Theta^\top\left[ I-\mathcal N(K)\mathcal N(K)^{\dagger}\right] \mathcal N(K)v^* = 0, \nonumber \end{equation}$

we obtain

$\begin{eqnarray} \begin{split} (A&+B\Phi)^\top\eta+(C+D\Phi)^\top(\zeta_1+K\sigma)+\int_Z\left[ E(\theta)+F(\theta)\Phi\right]^\top \left[ \zeta_2+Kh(t, \theta)\right] \nu(d\theta)+\Phi^\top\rho+Kb+q\\ & = A^\top\eta+C^\top(\zeta_1+K\sigma)+\int_ZE(\theta)^\top \left[ \zeta_ 2+Kh(t, \theta)\right]\nu(d\theta) +Kb+q+\Phi^\top\varpi\\ & = A^\top\eta+C^\top(\zeta_1+K\sigma)+\int_ZE(\theta)^\top \left[ \zeta_ 2+Kh(t, \theta)\right]\nu(d\theta) +Kb+q-\mathcal M(K)^\top\mathcal N(K)^{\dagger}\varpi\\ & = \left[ A-B\mathcal N(K)^{\dagger}\mathcal M(K)\right] ^\top\eta+\left[ C-D\mathcal N(K)^{\dagger}\mathcal M(K)\right]^\top(\zeta_1+K\sigma)\\ &+\int_Z\left[ E(\theta)-F(\theta)\mathcal N(K)^{\dagger}\mathcal M(K)\right]^\top\left[ \zeta_2+Kh(t, \theta)\right] \nu(d\theta) -\mathcal M(K)^\top\mathcal N(K)^{\dagger}\rho+Kb+q.\nonumber \end{split} \end{eqnarray}$

Thus, it follows that ( $\eta, \zeta_1, \zeta_2$ ) is an $L^2$ -stable adapted solution of the BSDE (5.24). And from Remark 2.1, we have

$\begin{equation} \langle \varpi, v^*\rangle = -\langle \mathcal N(K)v^*, v^*\rangle = -\langle \mathcal N(K)^{\dagger}\varpi, \varpi\rangle .\nonumber \end{equation}$

Therefore, since we replace $u$ by $u^* = \Phi X_{\Phi}(\cdot; x, v^*)+v^*$ in (5.31), it is not hard to get that

$\begin{array}{l} V(x) = J(x;u^*)\\ = \langle Kx, x\rangle +2\mathbb E\langle \eta(0), x\rangle \\ +\mathbb E\int_0^{\infty} \left[ \langle K\sigma, \sigma\rangle +\int_Z\langle Kh(t, \theta), h(t, \theta)\rangle \nu(d\theta)+2\langle \eta, b\rangle +2\langle \zeta_1, \sigma\rangle +2\langle \zeta_2, \int_Zh(t, \theta)\nu(d\theta)\rangle\right] dt\\ +\mathbb E\int_0^{\infty}\left[\langle \mathcal N(K)v^*, v^*\rangle +2\langle \varpi, v^*\rangle \right] dt\\ = \langle Kx, x\rangle +2\mathbb E\langle \eta(0), x\rangle \\ +\mathbb E\int_0^{\infty} \left[ \langle K\sigma, \sigma\rangle +\int_Z\langle Kh(t, \theta), h(t, \theta)\rangle \nu(d\theta) +2\langle \eta, b\rangle +2\langle \zeta_1, \sigma\rangle +2\langle \zeta_2, \int_Zh(t, \theta)\nu(d\theta)\rangle\right] dt\\ -\mathbb E\int_0^{\infty}\langle \mathcal N(K)^{\dagger}\varpi, \varpi\rangle dt.\nonumber \end{array}$

Finally, to prove the implication (ⅲ) $\Rightarrow$ (ⅱ), we firstly take an arbitrary ( $x, u$ ) $\in\mathbb R^n\times\mathscr U$ and let $X(\cdot)\equiv X(\cdot; x, u)$ be the corresponding state process. According to (5.31), we have

$\begin{eqnarray} \begin{split} J(x;u)& = \langle Kx, x\rangle +2\mathbb E\langle \eta(0), x\rangle \\ &\; \; \; +\mathbb E\int_0^{\infty} \bigg[ \langle K\sigma, \sigma\rangle +\int_Z\langle Kh(t, \theta), h(t, \theta)\rangle \nu(d\theta)\\ &\; \; \; +2\langle \eta, b\rangle +2\langle \zeta_1, \sigma\rangle +2\langle \zeta_2, \int_Zh(t, \theta)\nu(d\theta)\rangle\bigg]dt\\ &\; \; \; +\mathbb E\int_0^{\infty}\bigg[\langle \mathcal L(K)X, X\rangle +2\langle \mathcal M(K)X, u\rangle\\ &\; \; \; +\langle \mathcal N(K)u, u\rangle +2\langle \varpi, u+\mathcal N(K)^{\dagger}\mathcal M(K)X\rangle \bigg]dt. \end{split} \end{eqnarray}$

(5.33)

Let ( $\Phi^*, v^*$ ) be defined by (5.26). Then by Lemma 2.2 and Remark 2.1, we have

$\begin{eqnarray} \begin{split} \mathcal M(K)& = -\mathcal N(K)\Phi^*, \; \; \; \; \mathcal L(K) = \mathcal M(K)^\top\mathcal N(K)^{\dagger}\mathcal M(K) = (\Phi^*)^\top\mathcal N(K)\Phi^*, \\ \varpi& = -\mathcal N(K)v^*, \; \; \; \; \mathcal N(K)\mathcal N(K)^{\dagger}\mathcal M(K) = -\mathcal N(K)\Phi^*.\nonumber \end{split} \end{eqnarray}$

Then we substitute the above into (5.33) and complete the square, it follows that

$\begin{eqnarray} \begin{split} J(x;u)& = \langle Kx, x\rangle +2\mathbb E\langle \eta(0), x\rangle \\ &\; \; \; +\mathbb E\int_0^{\infty} \bigg[ \langle K\sigma, \sigma\rangle +\int_Z\langle Kh(t, \theta), h(t, \theta)\rangle \nu(d\theta) +2\langle \eta, b\rangle +2\langle \zeta_1, \sigma\rangle \\ &\; \; \; +2\langle \zeta_2, \int_Zh(t, \theta)\nu(d\theta)\rangle-\langle \mathcal N(K)v^*, v^*\rangle \bigg]dt\\ &\; \; \; +\mathbb E\int_0^{\infty}\langle \mathcal N(K)(u-\Phi^*X-v^*), u-\Phi^*X-v^*\rangle dt. \end{split} \end{eqnarray}$

(5.34)

Thus, owing to $\mathcal N(K)\geq0$ and $\Phi^*$ is a stabilizer of [ $A, C, E; B, D, F$ ], it is not hard to get that

$\begin{eqnarray} \begin{split} J(x;u)&\geq\langle Kx, x\rangle +2\mathbb E\langle \eta(0), x\rangle \\ &\; \; \; +\mathbb E\int_0^{\infty} \bigg[ \langle K\sigma, \sigma\rangle +\int_Z\langle Kh(t, \theta), h(t, \theta)\rangle \nu(d\theta) +2\langle \eta, b\rangle +2\langle \zeta_1, \sigma\rangle \\ &\; \; \; +2\langle \zeta_2, \int_Zh(t, \theta)\nu(d\theta)\rangle-\langle \mathcal N(K)v^*, v^*\rangle \bigg]dt\\ &\; \; \; = J(x;\Phi^*X^*+v^*), \; \; \; \forall x\in\mathbb R^n, \; \; \forall u\in\mathscr U, \end{split} \end{eqnarray}$

(5.35)

which shows that ( $\Phi^*, v^*$ ) is a closed-loop optimal strategy of Problem (SLQ) $_{\infty}$ .

Finally, assume that ( $\check{\Phi}, \check{v}$ ) is an another closed-loop optimal strategy, and $\check{X}$ is the solution of the following closed-loop system

$\begin{equation} \left\{\begin{array}{ll} d\check{X}(t) = [(A+B\check{\Phi})\check{X}(t)+B\check{v}(t)+b(t)]dt \\\; \; \; \; \; \; \; \; \; \; \; \; +[(C+D\check{\Phi})\check{X}(t)+D\check{v}(t)+\sigma(t)]dW(t) \\\; \; \; \; \; \; \; \; \; \; \; \; + \int_{Z}[(E(\theta)+F(\theta)\check{\Phi})\check{X}(t)+F(\theta)\check{v}(t)+h(t, \theta)]\tilde{\mu}(dt, d\theta), \; \; \; t \in [0, \infty), \\\check{X}(0) = x \in \mathbb R^n.\nonumber \end{array} \right. \end{equation}$

Since we denote by $\check{u} = \check{\Phi}\check{X}+\check{v}$ the outcome of $(\check{\Phi}, \check{v})$ , it is obvious that

$X(t;x, \check{u}) = \check{X}(t), \; \; \; \; t\geq0.$

Now (5.34) and (5.35) imply that

$\begin{eqnarray} \begin{split} V(x)& = J(x;\check{u}) = \langle Kx, x\rangle +2\mathbb E\langle \eta(0), x\rangle \\ &\; \; \; +\mathbb E\int_0^{\infty} \bigg[ \langle K\sigma, \sigma\rangle +\int_Z\langle Kh(t, \theta), h(t, \theta)\rangle \nu(d\theta) +2\langle \eta, b\rangle +2\langle \zeta_1, \sigma\rangle +2\langle \zeta_2, \int_Zh(t, \theta)\nu(d\theta)\rangle \\ &\; \; \; -\langle \mathcal N(K)v^*, v^*\rangle \bigg]dt\\ &\; \; \; +\mathbb E\int_0^{\infty}\langle \mathcal N(K)(\check{u}-\Phi^*\check{X}-v^*), \check{u}-\Phi^*\check{X}-v^*\rangle dt\\ &\; \; \; = V(x)+\mathbb E\int_0^{\infty}|\mathcal N(K)^{\frac1{2}}(\check{\Phi}\check{X}+\check{v}-\Phi^*\check{X}-v^*)|^2dt, \nonumber \end{split} \end{eqnarray}$

from which we have that

$\mathcal N(K)^{\frac1{2}}(\check{\Phi}\check{X}+\check{v}-\Phi^*\check{X}-v^*) = 0, \; \; \; \; \forall x\in\mathbb R^n.$

Multiplying the above by $\mathcal N(K)^{\frac1{2}}$ , we obtain

$\begin{equation} \mathcal N(K)(\check{\Phi}-\Phi^*)\check{X}+\mathcal N(K)(\check{v}-v^*) = 0, \; \; \; \forall x\in\mathbb R^n. \end{equation}$

(5.36)

Since $\check{\Phi}, \Phi^*, \check{v},$ and $v^*$ are independent of $x$ , and (5.36) holds for all $x\in\mathbb R^n$ , it is easy to get that for any $x\in\mathbb R^n$ , the solution $X_0$ of

$\begin{equation} \left\{\begin{array}{ll} dX_0(t) = (A+B\check{\Phi})X_0(t)dt+(C+D\check{\Phi})X_0(t)dW(t) \\\; \; \; \; \; \; \; \; \; \; \; \; + \int_{Z}[E(\theta)+F(\theta)\check{\Phi}]X_0(t)\tilde{\mu}(dt, d\theta), \; \; \; \; t \in [0, \infty), \\X_0(0) = x , \nonumber \end{array} \right. \end{equation}$

satisfies $\mathcal N(K)(\check{\Phi}-\Phi^*)X_0 = 0$ . It follows that $\mathcal N(K)(\check{\Phi}-\Phi^*) = 0$ and hence $\mathcal N(K)(\check{v}-v^*) = 0$ . Now we have

$\mathcal N(K)\check{\Phi} = \mathcal N(K)\Phi^* = -\mathcal M(K), \; \; \; \; \mathcal N(K)\check{v} = \mathcal N(K)v^* = -\varpi.$

According to Lemma 2.2, $(\check{\Phi}, \check{v})$ must be of the form (5.26). Since we denote $\bar{X}$ the corresponding optimal state process, if $\bar{u}$ is an open-loop optimal control for the initial state $x$ , it is not hard to get that

$\mathcal N(K)(\bar{u}-\Phi^*\bar{X}-v^*) = 0,$

i.e.,

$\mathcal N(K)\bar{u} = \mathcal N(K)\Phi^*\bar{X}+\mathcal N(K)v^* = -\mathcal M(K)\bar{X}-\varpi.$

According to Lemma 2.2, there exists a $w\in L^2_{\mathbb F}(\mathbb R^m)$ such that

$\begin{eqnarray} \begin{split} \bar{u}& = -\mathcal N(K)^{\dagger}\mathcal M(K)\bar{X}-\mathcal N(K)^{\dagger}\varpi+\left[ I-\mathcal N(K)^{\dagger}\mathcal N(K)\right] w\\ & = \left\lbrace -\mathcal N(K)^{\dagger}\mathcal M(K)+\left[ I-\mathcal N(K)^{\dagger}\mathcal N(K)\right]\Theta\right\rbrace \bar{X}\\ &\; \; \; -\mathcal N(K)^{\dagger}\varpi+\left[ I-\mathcal N(K)^{\dagger}\mathcal N(K)\right](w-\Theta\bar{X}), \nonumber \end{split} \end{eqnarray}$

where $\Theta\in\mathbb R^{m\times n}$ satisfies

$-\mathcal N(K)^{\dagger}\mathcal M(K)+\left[ I-\mathcal N(K)^{\dagger}\mathcal N(K)\right]\Theta\in\mathscr T[A, C, E; B, D, F],$

which shows that $\bar{u}$ has the closed-loop representation (5.27).

6. Conclusions

This paper mainly studies a kind of stochastic LQ optimal control problem with jumps in an infinite horizon. Firstly, we prove that the $L^2$ -stabilizability of our control system is equivalent to the non-emptiness of the admissible control set for all initial state and is also equivalent to the existence of a positive solution to some integral ARE. Then we conclude the equivalence of the open-loop and closed-loop solvability and the existence of a stabilizing solution of the associated generalized ARE. Finally, we explore that any open-loop optimal control for the initial state admits a closed-loop representation, and the representation is obtained.

Acknowledgments

The authors would like to thank anonymous referees for helpful comments and suggestions which improved the original version of the paper. Q. Meng was supported by the Key Projects of Natural Science Foundation of Zhejiang Province (No. Z22A013952) and the National Natural Science Foundation of China (No. 12271158 and No. 11871121). Maoning Tang was supported by the Natural Science Foundation of Zhejiang Province (No. LY21A010001).

Conflict of interest

The authors declare that they have no competing interests.

References

[1]	R. Bellman, I. Glicksberg, O. Gross, Some aspects of the mathematical theory of control processes, Technical Report R-313, The Rand Corporation, 1958.
[2]	R. Boel, P. Varaiya, Optimal control of jump processes, SIAM J. Control Optim., 15 (1977), 92–119. http://dx.doi.org/10.1137/0315008 doi: 10.1137/0315008
[3]	G. Guatteri, G. Tessitore, On the backward stochastic Riccati equation in infinite dimensions, SIAM J. Control Optim., 44 (2005), 159–194. http://dx.doi.org/10.1137/S0363012903425507 doi: 10.1137/S0363012903425507
[4]	G. Guatteri, G. Tessitore, Backward stochastic Riccati equations and infinite horizon L-Q optimal control problems with stochastic coefficients, Appl. Math. Optim., 57 (2008), 207–235. http://dx.doi.org/10.1007/s00245-007-9020-y doi: 10.1007/s00245-007-9020-y
[5]	G. Guatteri, F. Masiero, Infinite horizon and ergodic optimal quadratic control problems for an affine equation with stochastic coefficients, SIAM J. Control Optim., 48 (2009), 1600–1631. http://dx.doi.org/10.1137/070696234 doi: 10.1137/070696234
[6]	H. Halkin, Necessary conditions for optimal control problems with infinite horizons, Economet. Soc., 1974,267–272. http://dx.doi.org/10.2307/1911976 doi: 10.2307/1911976
[7]	S. Hu, Infinite horizontal optimal quadratic control for an affine equation driven by $L\acute{e}vy$ processes, Chinese Ann. Math., 34A (2013), 179–204.
[8]	Y. Hu, B. Oksendal, Partial information linear quadratic control for jump diffusions, SIAM J. Control Optim., 47 (2008), 1744–1761. http://dx.doi.org/10.1137/060667566 doi: 10.1137/060667566
[9]	J. Huang, X. Li, J. Yong, A linear-quadratic optimal control problem for mean-field stochastic differential equations in infinite horizon, Math. Optim. Control, 5 (2015), 97–139. http://dx.doi.org/10.48550/arXiv.1208.5308 doi: 10.48550/arXiv.1208.5308
[10]	R. Kalman, Contributions to the theory of optimal control, Bol. Soc. Math., 5 (1960), 102–119.
[11]	H. Kushner, Optimal stochastic control, IRE Trans. Autom. Control, 7 (1962), 120–122.
[12]	A. Lim, X. Zhou, Linear-quadratic control of backward stochastic differential equations, SIAM J. Control Optim., 40 (2001), 450–474. http://dx.doi.org/10.1137/S0363012900374737 doi: 10.1137/S0363012900374737
[13]	Q. Meng, General linear quadratic optimal stochastic control problem driven by a Brownian motion and a Poisson random martingale measure with random coefficients, Stoch. Anal. Appl., 32 (2014), 88–109. http://dx.doi.org/10.1080/07362994.2013.845106 doi: 10.1080/07362994.2013.845106
[14]	B. Øksendal, A. Sulem, Maximum principles for optimal control of forward-backward stochastic differential equations with jumps, SIAM J. Control Optim, 48 (2009), 2945–2976. http://dx.doi.org/10.1137/080739781 doi: 10.1137/080739781
[15]	M. Rami, X. Zhou, J. Moore, Well-posedness and attainability of indefnite stochastic linear quadratic control in infinite time horizon, Syst. Control Lett., 41 (2000), 123–133. http://dx.doi.org/10.1016/S0167-6911(00)00046-3 doi: 10.1016/S0167-6911(00)00046-3
[16]	R. Situ, On solution of backward stochastic differential equations with jumps and applications, Stoch. Proc. Appl., 66 (1997), 209–236. http://dx.doi.org/10.1016/S0304-4149(96)00120-2 doi: 10.1016/S0304-4149(96)00120-2
[17]	J. Sun, X. Li, J. Yong, Open-loop and closed-loop solvabilities for stochastic linear quadratic optimal control problems, SIAM J. Control Optim., 54 (2016), 2274–2308. http://dx.doi.org/10.1137/15M103532X doi: 10.1137/15M103532X
[18]	J. Sun, J. Yong, Stochastic linear quadratic optimal control problems in infinite horizon, Appl. Math. Optim., 2018,145–183. http://dx.doi.org/10.1007/s00245-017-9402-8 doi: 10.1007/s00245-017-9402-8
[19]	J. Sun, J. Yong, Stochastic linear-quadratic optimal control theory: Open-loop and closed-loop solutions, Switzerland: Springer, 2020. http://dx.doi.org/10.1007/978-3-030-20922-3
[20]	S. Tang, X. Li, Necessary conditions for optimal control of stochastic systems with random jumps, SIAM J. Control Optim., 32 (1994), 1447–1475. http://dx.doi.org/10.1137/S0363012992233858 doi: 10.1137/S0363012992233858
[21]	W. Wonham, On a matrix riccati equation of stochastic control, SIAM J. Control, 6 (1968), 681–697. http://dx.doi.org/10.1137/0306043 doi: 10.1137/0306043
[22]	W. Wonham, Random differential equation in control theory, Matematika, 2 (1970), 131–212.
[23]	H. Wu, X. Li, An infinite horizon linear quadratic problem with unbounded controls in Hilbert space, Acta Math. Sin., 17 (2001), 527–540. http://dx.doi.org/10.1007/s101140100123 doi: 10.1007/s101140100123
[24]	Z. Wu, X. Wang, FBSDE with Poisson process and its application to linear quadratic stochastic optimal control problem with random jumps, Acta Autom. Sin., 29 (2003), 821–826.
[25]	Y. Yang, M. Tang, Q. Meng, A mean-field stochastic linear-quadratic optimal control problem with jumps under partial information, ESAIM-Control Optim. Ca., 28 (2022). http://dx.doi.org/10.1051/cocv/2022039 doi: 10.1051/cocv/2022039
[26]	J. Yong, X. Zhou, Stochastic controls: Hamiltonian systems and HJB equations, Springer Science Business Media, 2012. http://dx.doi.org/10.1016/s0005-1098(02)00057-2
[27]	M. H. A. Davis, Linear estimation and stochastic control, Chapman and Hall, 1977.
[28]	A. Bensoussan, Lectures on stochastic control, part Ⅰ. In nonlinear filtering and stochastic control, Lect. Notes Math., 972 (1982), 1–39. http://dx.doi.org/10.1007/bfb0064859 doi: 10.1007/bfb0064859
[29]	M. Kohlmann, X. Zhou, Relationship between backward stochastic differential equations and stochastic controls: A linear-quadratic approach, SIAM J. Control Optim., 38 (2000), 1392–1407. http://dx.doi.org/10.1137/S036301299834973X doi: 10.1137/S036301299834973X
[30]	N. Li, Z. Wu, Z. Yu, Indefinite stochastic linear-quadratic optimal control problems with random jumps and related stochastic Riccati equations, Sci. China Math., 61 (2018), 563–576. http://dx.doi.org/10.1007/s11425-015-0776-6 doi: 10.1007/s11425-015-0776-6
[31]	B. Øksendal, S. Agnes, Applied stochastic control of jump diffusions, Springer Berlin Heidelberg, 2005. http://dx.doi.org/10.1007/3-540-26441-8-3

This article has been cited by:

Fan Wu, Xun Li, Xin Zhang, Stochastic Linear Quadratic Optimal Control Problems with Regime-Switching Jumps in Infinite Horizon, 2025, 63, 0363-0129, 852, 10.1137/24M1645334

Reader Comments

Your name:*

Email:*
© 2023 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

AIMS Mathematics

1.8 3.4

Metrics

Article views(1612) PDF downloads(113) Cited by(1)

Preview PDF

Download XML

Export Citation

Article outline

Show full outline

AIMS Mathematics

A stochastic linear-quadratic optimal control problem with jumps in an infinite horizon

Related Papers:

Abstract

1. Introduction

2. Preliminaries

2.1. Notations

2.2. Pseudoinverse

3. Formulation of problem

4. Admissible control sets and stabilizability

4.1. Stability

4.2. Stabilizability

5. Solvability and the algebraic Riccati equation

5.1. A study of Problem (SLQ) $_{\infty}^0$

5.1.1. A finite horizon approach

5.1.2. Open-loop and closed-loop solvability

5.1.3. Nonhomogeneous problems

6. Conclusions

Acknowledgments

Conflict of interest

References

This article has been cited by:

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Other Articles By Authors

Catalog

AIMS Mathematics

A stochastic linear-quadratic optimal control problem with jumps in an infinite horizon

Related Papers:

Abstract

1. Introduction

2. Preliminaries

2.1. Notations

2.2. Pseudoinverse

3. Formulation of problem

4. Admissible control sets and stabilizability

4.1. Stability

4.2. Stabilizability

5. Solvability and the algebraic Riccati equation

5.1. A study of Problem (SLQ)0∞ _{\infty}^0

5.1.1. A finite horizon approach

5.1.2. Open-loop and closed-loop solvability

5.1.3. Nonhomogeneous problems

6. Conclusions

Acknowledgments

Conflict of interest

References

This article has been cited by:

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Other Articles By Authors

Related pages

Tools

Export File

Citation

Format

Content

Catalog

5.1. A study of Problem (SLQ) $_{\infty}^0$