An optimal control problem without control costs

Mario Lefebvre; Mario Lefebvre

doi:10.3934/mbe.2023239

Mathematical Biosciences and Engineering

2023, Volume 20, Issue 3: 5159-5168. doi: 10.3934/mbe.2023239

Previous Article Next Article

Research article Special Issues

An optimal control problem without control costs

Mario Lefebvre ^,

Department of Mathematics and Industrial Engineering, Polytechnique Montréal, C.P. 6079, Succursale Centre-ville, Montréal, H3C 3A7, Canada

Academic Editor: Jesús Martín Vaquero

Received: 22 November 2022 Revised: 28 December 2022 Accepted: 31 December 2022 Published: 09 January 2023

A two-dimensional diffusion process is controlled until it enters a given subset of $\mathbb{R}^2$ . The aim is to find the control that minimizes the expected value of a cost function in which there are no control costs. The optimal control can be expressed in terms of the value function, which gives the smallest value that the expected cost can take. To obtain the value function, one can make use of dynamic programming to find the differential equation it satisfies. This differential equation is a non-linear second-order partial differential equation. We find explicit solutions to this non-linear equation, subject to the appropriate boundary conditions, in important particular cases. The method of similarity solutions is used.

Keywords:

Citation: Mario Lefebvre. An optimal control problem without control costs[J]. Mathematical Biosciences and Engineering, 2023, 20(3): 5159-5168. doi: 10.3934/mbe.2023239

Related Papers:

[1]	Heping Ma, Hui Jian, Yu Shi . A sufficient maximum principle for backward stochastic systems with mixed delays. Mathematical Biosciences and Engineering, 2023, 20(12): 21211-21228. doi: 10.3934/mbe.2023938
[2]	Dan Zhu, Qinfang Qian . Optimal switching time control of the hyperbaric oxygen therapy for a chronic wound. Mathematical Biosciences and Engineering, 2019, 16(6): 8290-8308. doi: 10.3934/mbe.2019419
[3]	Alessia Civallero, Cristina Zucca . The Inverse First Passage time method for a two dimensional Ornstein Uhlenbeck process with neuronal application. Mathematical Biosciences and Engineering, 2019, 16(6): 8162-8178. doi: 10.3934/mbe.2019412
[4]	H. J. Alsakaji, F. A. Rihan, K. Udhayakumar, F. El Ktaibi . Stochastic tumor-immune interaction model with external treatments and time delays: An optimal control problem. Mathematical Biosciences and Engineering, 2023, 20(11): 19270-19299. doi: 10.3934/mbe.2023852
[5]	Giuseppe D'Onofrio, Enrica Pirozzi . Successive spike times predicted by a stochastic neuronal model with a variable input signal. Mathematical Biosciences and Engineering, 2016, 13(3): 495-507. doi: 10.3934/mbe.2016003
[6]	Minna Shao, Hongyong Zhao . Dynamics and optimal control of a stochastic Zika virus model with spatial diffusion. Mathematical Biosciences and Engineering, 2023, 20(9): 17520-17553. doi: 10.3934/mbe.2023778
[7]	Xiaoxuan Pei, Kewen Li, Yongming Li . A survey of adaptive optimal control theory. Mathematical Biosciences and Engineering, 2022, 19(12): 12058-12072. doi: 10.3934/mbe.2022561
[8]	Laurenz Göllmann, Helmut Maurer . Optimal control problems with time delays: Two case studies in biomedicine. Mathematical Biosciences and Engineering, 2018, 15(5): 1137-1154. doi: 10.3934/mbe.2018051
[9]	Miniak-Górecka Alicja, Nowakowski Andrzej . Sufficient optimality conditions for a class of epidemic problems with control on the boundary. Mathematical Biosciences and Engineering, 2017, 14(1): 263-275. doi: 10.3934/mbe.2017017
[10]	Erin N. Bodine, Louis J. Gross, Suzanne Lenhart . Optimal control applied to a model for species augmentation. Mathematical Biosciences and Engineering, 2008, 5(4): 669-680. doi: 10.3934/mbe.2008.5.669

Abstract

1. Introduction

We consider a two-dimensional controlled diffusion process $(X_1(t), X_2(t))$ defined by the following system of stochastic differential equations:

$\begin{eqnarray} {{\rm{d}}} X_1(t) & = & f_1[X_1(t)] \quad {{\rm{d}}} t + b_1[X_1(t)] \quad u^2(t) \quad {{\rm{d}}} t + \left\{v_1[X_1(t)]\right\}^{1/2} \quad {{\rm{d}}} B_1(t), \end{eqnarray}$

(1.1)

$\begin{eqnarray} {{\rm{d}}} X_2(t) & = & f_2[X_2(t)] \quad {{\rm{d}}} t + b_2[X_2(t)] \quad u(t) \quad {{\rm{d}}} t + \left\{v_2[X_2(t)]\right\}^{1/2} \quad {{\rm{d}}} B_2(t), \end{eqnarray}$

(1.2)

where $f_i(\cdot)$ is a real function, $b_i(\cdot) \neq 0$ , $u(t)$ is the control variable, $v_i(\cdot) > 0$ and $\{B_i(t), t \ge 0\}$ is a standard Brownian motion, for $i = 1, 2$ . The two Brownian motions are assumed to be independent. The functions $f_i(\cdot)$ and $v_i(\cdot)$ are respectively the infinitesimal mean and variance of the uncontrolled process, for $i = 1, 2$ . The functions $b_1(\cdot)$ and $b_2(\cdot)$ are control coefficients or parameters.

Let

$\begin{equation} T(x_1, x_2) = \inf\{t > 0: (X_1(t), X_2(t)) \in D \mid (X_1(0) = x_1, X_2(0) = x_2) \notin D\}, \end{equation}$

(1.3)

where $D$ is a subset of $\mathbb{R}^2$ . The random variable $T$ is called a first-passage time in probability theory. The aim is to minimize the expected value of the cost function

$\begin{equation} J(x_1, x_2) = \int_0^{T(x_1, x_2)} \left\{q[X_1(t), X_2(t)] + \lambda\right\} \quad {{\rm{d}}} t + K[X_1(T), X_2(T)], \end{equation}$

(1.4)

where $q(\cdot, \cdot) \ge 0$ , $\lambda$ is a real constant and $K(\cdot, \cdot)$ is a general termination cost function. This type of stochastic optimal control problem is known as a homing problem; see Whittle [, p. 289] or Whittle [, p. 222]. Notice however that there are no control costs. Therefore, the above problem is actually an extension of the classic homing problem. Moreover, we see in Eqs (1.1) and (1.2) that the control variable does not have the same effect on each component of the two-dimensional diffusion process $(X_1(t), X_2(t))$ . Notice also that the problem is time-invariant, because the functions $f_i(\cdot), b_i(\cdot)$ and $v_i(\cdot)$ , for $i = 1, 2$ , as well as $q(\cdot, \cdot)$ and $K(\cdot, \cdot)$ do not depend explicitly on $t$ .

Recent papers on homing problems include the following ones: Kounta and Dawson ^[3], Makasu ^[4] and Lefebvre ^[5]. The original homing problem has been extended in various ways: Lefebvre and Kounta ^[6] replaced the diffusion processes by discrete-time Markov chains, Lefebvre and Moutassim ^[7] considered the problem for jump-diffusion processes, and Lefebvre ^[8] treated the case of controlled autoregressive processes.

There are some papers on optimization problems for which the final time is random. However, this final time is not a first-passage time, as in homing problems. Such problems were considered, in particular, in Yan and Koo ^[9], Rodosthenous and Zhang ^[10], Yun and Choi ^[11], Khatab {et al.} ^[12] and Yu ^[13].

Homing problems are sometimes expressed as dynamical games; see Lefebvre ^[14]. It is possible to find papers on differential games with a random time horizon; see, for instance, Marín-Solano and Shevkoplyas ^[15] and Zaremba {et al.} ^[16]. However, in these papers, the final time is again not a first-passage time.

Next, we define the value function by

$\begin{equation} F(x_1, x_2) = \inf\limits_{\substack{u(t) \\ 0 \le t \le T(x_1, x_2)}} E[J(x_1, x_2)]. \end{equation}$

(1.5)

That is, $F(x_1, x_2)$ is the expected cost obtained by using the optimal control in the interval $[0, T]$ . In Section 2, we will make use of dynamic programming to find the differential equation it satisfies. This differential equation is a non-linear second-order partial differential equation (PDE). We will see that the optimal control $u^*$ can be expressed in terms of the value function as follows:

$\begin{equation} u^* = -\frac{b_2(x_2)}{2 \quad b_1(x_1)} \frac{F_{x_2}(x_1, x_2)}{F_{x_1}(x_1, x_2)}. \end{equation}$

(1.6)

In Section 3, we will find explicit solutions to the non-linear PDE satisfied by the value function, subject to the appropriate boundary conditions, in important particular cases. The method of similarity solutions will be used. Finally, some final remarks will be made in Section 4.

2. Dynamic programming

Bellman's principle of optimality states that "an optimal policy has the property that, whatever the initial state and the initial decision, it must constitute an optimal policy with regards to the state resulting from the first decision". Hence, any remaining part of an optimal policy is also optimal. Therefore, we can write that

$\begin{eqnarray} F(x_1, x_2) & = & \inf\limits_{\substack{u(t) \\ 0 \le t \le \Delta t}} \quad E\bigg[\int_0^{\Delta t} \left\{q[X_1(t), X_2(t)] + \lambda \right\} \quad {{\rm{d}}} t \\ && \qquad + \; F\big(x_1 + [f_1(x_1) + b_1(x_1) \quad u^2(0)] \quad \Delta t + v_1^{1/2}(x_1) \quad B_1(\Delta t), \\ && \qquad \qquad x_2 + [f_2(x_2) + b_2(x_2) \quad u(0)] \quad \Delta t + v_2^{1/2}(x_2) \quad B_2(\Delta t)\big) \\ && \qquad + \; o(\Delta t)\bigg]. \end{eqnarray}$

(2.1)

We have

$\begin{equation} \int_0^{\Delta t} \left\{q[X_1(t), X_2(t)] + \lambda \right\} \quad {{\rm{d}}} t \simeq [q(x_1, x_2)+ \lambda] \quad \Delta t. \end{equation}$

(2.2)

Moreover, a standard Brownian motion $\{B(t), t \ge 0\}$ is such that

$\begin{equation} E[B(\Delta t)] = 0 \quad \text{and} \quad E\left[B^2(\Delta t)\right] = {\rm Var}[B(\Delta t)] = \Delta t. \end{equation}$

(2.3)

It follows, assuming that $F(x_1, x_2)$ is twice differentiable with respect to $x_1$ and $x_2$ and making use of Taylor's formula, that

$\begin{eqnarray} F(x_1, x_2) & = & \inf\limits_{\substack{u(t) \\ 0 \le t \le \Delta t}} \quad \bigg\{[q(x_1, x_2) + \lambda] \quad \Delta t + F(x_1, x_2) \\ && \qquad \quad + \, [f_1(x_1) + b_1(x_1) \quad u^2(0)] \quad \Delta t \quad F_{x_1}+ \frac{1}{2} \quad v_1(x_1) \quad \Delta t \quad F_{x_1, x_1} \\ && \qquad \quad + \, [f_2(x_2) + b_2(x_2) \quad u(0)] \quad \Delta t \quad F_{x_2} + \frac{1}{2} \quad v_2(x_2) \quad \Delta t \quad F_{x_2, x_2} \\ && \qquad \quad + \, o(\Delta t)\bigg\}. \end{eqnarray}$

(2.4)

Finally, dividing each side of the previous equation by $\Delta t$ and letting $\Delta t$ decrease to zero, we obtain the following dynamic programming equation:

$\begin{eqnarray} 0 & = & \inf\limits_{u(0)} \quad \bigg\{q(x_1, x_2) + \lambda \\ && \qquad + \, [f_1(x_1) + b_1(x_1) \quad u^2(0)] \quad F_{x_1}+ \frac{1}{2} \quad v_1(x_1) \quad F_{x_1, x_1} \\ && \qquad + \, [f_2(x_2) + b_2(x_2) \quad u(0)] \quad F_{x_2} + \frac{1}{2} \quad v_2(x_2) \quad F_{x_2, x_2}\bigg\}. \end{eqnarray}$

(2.5)

Differentiating Eq (2.5) with respect to $u(0)$ , we find, as mentioned in the Introduction section, that the optimal control is

$\begin{equation} u^*(0) = -\frac{b_2(x_2)}{2 \quad b_1(x_1)} \frac{F_{x_2}(x_1, x_2)}{F_{x_1}(x_1, x_2)}. \end{equation}$

(2.6)

Then, substituting the above expression into Eq (2.5), we can state the following proposition.

Proposition 2.1. The value function $F(x_1, x_2)$ satisfies the second-order, non-linear PDE

$\begin{equation} 0 = q(x_1, x_2) + \lambda - \frac{b_2^2(x_2)}{4 \quad b_1(x_1)} \quad \frac{F_{x_2}^2}{F_{x_1}} + \sum\limits_{i = 1}^2 \left\{f_i(x_i) \quad F_{x_i} + \frac{1}{2} \quad v_i(x_i) \quad F_{x_i x_i}\right\}, \end{equation}$

(2.7)

subject to the boundary condition

$\begin{equation} F(x_1, x_2) = K(x_1, x_2) \quad {if (x_1, x_2) \in D .} \end{equation}$

(2.8)

In the next section, explicit solutions to (2.7), (2.8) will be obtained in important particular cases. The method of similarity solutions will be used.

3. Explicit solutions

{Case I}. The first particular case that we consider is the one for which $f_i(\cdot) \equiv 0$ , $b_i(\cdot) \equiv 1$ , $v_i(\cdot) \equiv 1$ , for $i = 1, 2$ , $q(\cdot, \cdot) \equiv 0$ , $\lambda = 1$ , $K(\cdot, \cdot) \equiv 0$ and we choose the first-passage time

$\begin{equation} T_1(x_1, x_2) = \inf\{t > 0: X_1(t) - X_2(t) = k_1 \; \text{or} \; k_2 \mid k_1 < x_1-x_2 < k_2\}, \end{equation}$

(3.1)

where $x_i = X_i(0)$ for $i = 1, 2$ . The diffusion process $(X_1(t), X_2(t))$ is then defined by the stochastic differential equations

$\begin{eqnarray} {{\rm{d}}} X_1(t) & = & u^2(t) \quad {{\rm{d}}} t + {{\rm{d}}} B_1(t), \end{eqnarray}$

(3.2)

$\begin{eqnarray} {{\rm{d}}} X_2(t) & = & u(t) \quad {{\rm{d}}} t + {{\rm{d}}} B_2(t). \end{eqnarray}$

(3.3)

Thus, $(X_1(t), X_2(t))$ is a controlled two-dimensional standard Brownian motion. This case is arguably the simplest non-degenerate two-dimensional problem that can be examined. Equation (2.7) reduces to

$\begin{equation} 0 = 1 - \frac{1}{4} \quad \frac{F_{x_2}^2}{F_{x_1}} + \frac{1}{2} \quad F_{x_1 x_1} + \frac{1}{2} \quad F_{x_2 x_2}, \end{equation}$

(3.4)

subject to the boundary conditions

$\begin{equation} F(x_1, x_2) = 0 \quad \text{if}~~ x_1-x_2 = k_1 \; \text{or} ~ \; k_2 . \end{equation}$

(3.5)

To solve (3.4), (3.5), we will make use of the method of similarity solutions. We look for a solution of the form

$\begin{equation} F(x_1, x_2) = H(w), \end{equation}$

(3.6)

where $w : = x_1-x_2$ is the similarity variable. For the method to work, we must be able to express both the Eq (3.4) and the boundary conditions (3.5) in terms of $w$ . We find that Eq (3.4) is transformed into the second-order linear ordinary differential equation

$\begin{equation} 0 = 1 - \frac{1}{4} \quad H'(w) + H''(w), \end{equation}$

(3.7)

while the boundary conditions become

$\begin{equation} H(k_1) = H(k_2) = 0. \end{equation}$

(3.8)

The general solution of Eq (3.7) can be expressed as follows:

$\begin{equation} H(w) = c_1 \quad {{\rm{e}}}^{w/4} + 4 \quad w + c_2. \end{equation}$

(3.9)

The particular solution that satisfies the boundary conditions (3.8) is

$\begin{equation} H(w) = 4 \quad w + 4 \quad \frac{k_1 \quad {{\rm{e}}}^{k_2/4} - k_2 \quad {{\rm{e}}}^{k_1/4} - (k_1-k_2) \quad {{\rm{e}}}^{w/4}}{ {{\rm{e}}}^{k_1/4}- {{\rm{e}}}^{k_2/4}} \end{equation}$

(3.10)

for $k_1 \le w \le k_2$ . Let us choose $k_1 = 0$ and $k_2 = 1$ . Then, the above solution reduces to

$\begin{equation} H(w) = 4 \quad w + 4 \quad \frac{ {{\rm{e}}}^{w/4}-1}{ {{\rm{e}}}^{1/4}-1} \quad \text{for }~~ 0 \le w \le 1 . \end{equation}$

(3.11)

It follows that the value function $F(x_1, x_2)$ is given by

$\begin{equation} F(x_1, x_2) = 4 \quad (x_1-x_2) + 4 \quad \frac{ {{\rm{e}}}^{(x_1-x_2)/4}-1}{ {{\rm{e}}}^{1/4}-1} \end{equation}$

(3.12)

for $(x_1, x_2) \in \mathbb{R}^2$ such that $0 \le x_1-x_2 \le 1$ .

Next, we deduce from Eq (2.6) and the fact that $F_{x_1} = H'(w) = -F_{x_2}$ that the optimal control in this particular problem is actually a constant:

$\begin{equation} u^*(0) \equiv \frac{1}{2}. \end{equation}$

(3.13)

Hence, the optimally controlled diffusion process satisfies

$\begin{eqnarray} {{\rm{d}}} X^*_1(t) & = & \frac{1}{4} \quad {{\rm{d}}} t + {{\rm{d}}} B_1(t), \end{eqnarray}$

(3.14)

$\begin{eqnarray} {{\rm{d}}} X^*_2(t) & = & \frac{1}{2} \quad {{\rm{d}}} t + {{\rm{d}}} B_2(t). \end{eqnarray}$

(3.15)

That is, $\{X_1^*(t), t\ge 0\}$ (respectively $\{X_2^*(t), t\ge 0\}$ ) is a Wiener process with drift parameter $1/4$ (resp. $1/2$ ) and variance parameter $1$ . Since the two processes are independent, we can state that the one-dimensional process $\{X^*(t), t\ge 0\}$ defined by

$\begin{equation} X^*(t) = X_1^*(t) - X_2^*(t) \quad \text{for }~~ t \ge 0 \end{equation}$

(3.16)

is a Wiener process with drift parameter $\mu = -1/4$ and variance parameter $\sigma^2 = 2$ .

Remarks. (i) With the choices $q(\cdot, \cdot) \equiv 0$ , $\lambda = 1$ and $K(\cdot, \cdot) \equiv 0$ that we made above, the cost function $J(x_1, x_2)$ defined in Eq (1.4) reduces to $T_1(x_1, x_2)$ . Therefore, the aim is to make the two-dimensional controlled process leave the continuation region as soon as possible. Even though there are no control costs, we saw that the optimal solution consists in choosing a (finite) constant control.

(ii) Let $T_1^*(x_1, x_2)$ be the first-passage time when we use the optimal control. We may write that $F(x_1, x_2) = E[T_1^*(x_1, x_2)]$ . The function $m(w) : = E[T_1^*(w = x_1-x_2)]$ satisfies the second-order linear ordinary differential equation

$\begin{equation} m''(w) -\frac{1}{4} \quad m'(w) = -1, \end{equation}$

(3.17)

subject to the boundary conditions $m(0) = m(1) = 0$ . We then deduce from Eqs (3.7) and (3.8) (with $k_1 = 0$ and $k_2 = 1$ ) that the functions $H(w)$ and $m(w)$ are the same.

Case II. Assume now that $f_i(\cdot) \equiv 0$ , $b_i[X_i(t)] = X_i(t)$ , $v_i[X_i(t)] = X_i^2(t)$ , for $i = 1, 2$ , $q(\cdot, \cdot) \equiv 0$ , $\lambda = 1$ and $K(\cdot, \cdot) \equiv 0$ . Moreover, we define

$\begin{equation} T_2(x_1, x_2) = \inf\left\{t > 0: \frac{X_1^2(t)}{X_2^2(t)} = k_1 \; \text{or} \; k_2 \; \bigg| \; k_1 < \frac{x_1^2}{x_2^2} < k_2\right\}, \end{equation}$

(3.18)

where $k_1 > 0$ . The controlled diffusion process $(X_1(t), X_2(t))$ is such that

$\begin{eqnarray} {{\rm{d}}} X_1(t) & = & X_1(t) \quad u^2(t) \quad {{\rm{d}}} t + X_1(t) \quad {{\rm{d}}} B_1(t), \end{eqnarray}$

(3.19)

$\begin{eqnarray} {{\rm{d}}} X_2(t) & = & X_2(t) \quad u(t) \quad {{\rm{d}}} t + X_2(t) \quad {{\rm{d}}} B_2(t). \end{eqnarray}$

(3.20)

This time, $(X_1(t), X_2(t))$ is a controlled two-dimensional geometric Brownian motion. A geometric Brownian motion $\{Y(t), t\ge 0\}$ can be expressed as the exponential of a Wiener process. Therefore, if we assume that $Y(0) > 0$ , then we can state that $Y(t) > 0$ for any $t \ge 0$ .

Equation (2.7) takes the form

$\begin{equation} 0 = 1 - \frac{x_2^2}{4 \quad x_1} \quad \frac{F_{x_2}^2}{F_{x_1}} + \frac{1}{2} \quad x_1^2 \quad F_{x_1 x_1} + \frac{1}{2} \quad x_2^2 \quad F_{x_2 x_2}, \end{equation}$

(3.21)

and is subject to the boundary conditions

$\begin{equation} F(x_1, x_2) = 0 \quad \text{if }~~ x_1^2/x_2^2 = k_1 \; \text{or} \; k_2 . \end{equation}$

(3.22)

Based on the boundary conditions, we now look for a solution of the form $F(x_1, x_2) = H(w = x_1^2/x_2^2)$ . We have

$\begin{equation} F_{x_1} = H'(w) \quad (2 \quad x_1/x_2^2), \quad F_{x_2} = H'(w) \quad (-2 \quad x_1^2/x_2^3), \end{equation}$

(3.23)

$\begin{equation} F_{x_1 x_1} = H''(w) \quad (2 \quad x_1/x_2^2)^2 + H'(w) \quad (2/x_2^2) \end{equation}$

(3.24)

and

$\begin{equation} F_{x_2 x_2} = H''(w) \quad (-2 \quad x_1^2/x_2^3)^2 + H'(w) \quad (6 \quad x_1^2/x_2^4). \end{equation}$

(3.25)

Substituting these expressions into Eq (3.21), we find that it becomes

$\begin{equation} 0 = 1 + \frac{7}{2} \quad w \quad H'(w) + 4 \quad w^2 \quad H''(w). \end{equation}$

(3.26)

The boundary conditions are simply $H(k_1) = H(k_2) = 0$ , as in Case I.

The general solution of Eq (3.26) is

$\begin{equation} H(w) = c_1 \quad w^{1/8} + 2 \quad \ln(w) + c_2. \end{equation}$

(3.27)

With $k_1 = 1$ and $k_2 = 2$ , we find that

$\begin{equation} H(w) = \frac{2 \quad \ln(2)}{2^{1/8}-1} \quad (1-w^{1/8}) + 2 \quad \ln(w) \quad \text{for }~~ 1 \le w \le 2 . \end{equation}$

(3.28)

Finally, from the expressions in Eq (3.23), we calculate

$\begin{equation} u^*(0) = -\frac{x_2}{2 \quad x_1} \frac{(-2 \quad x_1^2/x_2^3)}{(2 \quad x_1/x_2^2)} \equiv \frac{1}{2}. \end{equation}$

(3.29)

Thus, the optimal control is again a constant. It follows that

$\begin{eqnarray} {{\rm{d}}} X^*_1(t) & = & \frac{1}{4} \quad X^*_1(t) \quad {{\rm{d}}} t + X^*_1(t) \quad {{\rm{d}}} B_1(t), \end{eqnarray}$

(3.30)

$\begin{eqnarray} {{\rm{d}}} X^*_2(t) & = & \frac{1}{2} \quad X^*_2(t) \quad {{\rm{d}}} t + X^*_2(t) \quad {{\rm{d}}} B_2(t). \end{eqnarray}$

(3.31)

The optimally controlled process $\{X_i^*(t), t\ge 0\}$ is also a geometric Brownian motion, for $i = 1, 2$ . We can write that $X_1^*(t) = {{\rm{e}}}^{Z_1(t)}$ , where $\{Z_1(t), t\ge 0\}$ is a Wiener process with drift parameter $-1/4$ and variance parameter $1$ . Similarly, $X_2^*(t) = {{\rm{e}}}^{Z_2(t)}$ , where $\{Z_2(t), t\ge 0\}$ is a Wiener process with drift parameter $0$ and variance parameter $1$ . Hence, by independence,

$\begin{equation} W(t) : = \frac{[X_1^*(t)]^2}{[X_2^*(t)]^2} = {{\rm{e}}}^{Z(t)}, \end{equation}$

(3.32)

where $\{Z(t), t\ge 0\}$ is a Wiener process with drift parameter $-1/2$ and variance parameter $8$ . The infinitesimal parameters of $\{W(t), t \ge 0\}$ are given by $7 \quad w/2$ and $8 \quad w^2$ . Therefore, we may write that the function $m(w) : = E[T_2^*(w = x_1^2/x_2^2)]$ satisfies the second-order linear ordinary differential equation

$\begin{equation} 4 \quad m''(w) + \frac{7}{2} \quad m'(w) = -1, \end{equation}$

(3.33)

subject to $m(1) = m(2) = 0$ , from which we may conclude that the functions $m(w)$ and $H(w)$ coincide, as required.

Case III. To conclude this section, we will present a case when the optimal control is not a constant. Assume, in Case II, that $b_1[X_1(t)] = X_1^2(t)$ , $b_2[X_2(t)] = X_2^{3/2}(t)$ , $\lambda = 0$ and $K(X_1(T_2), X_2(T_2)) = X_1^2(T_2)/X_2^2(T_2)$ . Hence, there is only a termination cost. The aim is now to make the controlled process $(X_1(t), X_2(t))$ leave the continuation region through a given part of its boundary. Indeed, the optimizer must try to make $X_1^2(t)/X_2^2(t)$ take on the value $k_1$ before $k_2$ ( $> k_1)$ .

We find that Eq (3.26) becomes

$\begin{equation} 0 = \left(-\frac{1}{2} \quad w^{1/2} + 4 \quad w\right) \quad H'(w) + 4 \quad w^2 \quad H''(w), \end{equation}$

(3.34)

subject to $H(k_i) = k_i$ , for $i = 1, 2$ . The general solution of the above equation is

$\begin{equation} H(w) = c_1 + c_2 \quad {\rm Ei}_1\left(\frac{1}{4 \quad \sqrt{w}}\right), \end{equation}$

(3.35)

where ${\rm Ei}_1$ is an exponential integral function defined by

$\begin{equation} {\rm Ei}_1(z) = \int_1^{\infty} {{\rm{e}}}^{-v z} \quad v^{-1} \quad {{\rm{d}}} v. \end{equation}$

(3.36)

The particular solution that satisfies the boundary conditions $H(1) = 1$ and $H(2) = 2$ is

$\begin{equation} H(w) = \frac{-{\rm Ei}_1\left(\frac{1}{4 \quad \sqrt{w}}\right) + 2 \quad {\rm Ei}_1\left(\frac{1}{4}\right) - {\rm Ei}_1\left(\frac{\sqrt{2}}{8}\right) }{{\rm Ei}_1\left(\frac{1}{4}\right)-{\rm Ei}_1\left(\frac{\sqrt{2}}{8}\right)} \quad \text{for}~~ 1 \le w \le 2 . \end{equation}$

(3.37)

We can now calculate the optimal control. We find that

$\begin{equation} u^*(0) = \frac{\sqrt{x_2}}{2 \quad x_1}. \end{equation}$

(3.38)

We notice that not only the optimal control is not a constant, it is not a function of $w: = x_1^2/x_2^2$ either. The optimally controlled process $(X_1^*(t), X_2^*(t))$ satisfies the following stochastic differential equations:

$\begin{eqnarray} {{\rm{d}}} X^*_1(t) & = & \frac{1}{4} \quad X^*_2(t) \quad {{\rm{d}}} t + X^*_1(t) \quad {{\rm{d}}} B_1(t), \end{eqnarray}$

(3.39)

$\begin{eqnarray} {{\rm{d}}} X^*_2(t) & = & \frac{\left[X^*_2(t)\right]^2}{2 \quad X^*_1(t)} \quad {{\rm{d}}} t + X^*_2(t) \quad {{\rm{d}}} B_2(t). \end{eqnarray}$

(3.40)

Remark. Another case for which the optimal control is not a constant is the one when we replace $b_1[X_1(t)]$ by $1$ and $b_2[X_2(t)]$ by $\sqrt{X_2(t)}$ in Case III. This time, the value function is

$\begin{equation} F(x_1, x_2) = \frac{{\rm Ei}_1\left(-\frac{x_1}{4 \quad x_2}\right) + {\rm Ei}_1\left(-\frac{\sqrt{2}}{4}\right) - 2 \quad {\rm Ei}_1\left(-\frac{1}{4}\right)}{{\rm Ei}_1\left(-\frac{\sqrt{2}}{4}\right)-{\rm Ei}_1\left(-\frac{1}{4}\right)} \end{equation}$

(3.41)

for $x_1 > 0$ and $x_2 > 0$ such that $1 \le x^2_1/x_2^2 \le 2$ . Finally, the optimal control is given by

$\begin{equation} u^*(0) = \frac{x_1}{2 \quad \sqrt{x_2}}. \end{equation}$

(3.42)

4. Conclusions

In this paper, a stochastic optimal control problem for a two-dimensional diffusion process $(X_1(t), X_2(t))$ has been considered. This problem is an extension of the so-called homing problems, in which the final time, rather than being either a fixed constant or infinity, is a random variable. The optimizer stops controlling the processes the first time a certain event occurs. Here, the cost function was modified: there were no control costs. However, the control variable $u(t)$ was assumed to influence each part of the controlled process differently; namely, the state dynamics are quadratic in $u(t)$ for $X_1(t)$ , while they are linear in the case of $X_2(t)$ .

In Section 2, we gave the PDE satisfied by the value function in the general case. Then, in Section 3, we presented various particular cases for which we were able to obtain explicit and exact solutions to the problems considered. The method of similarity solutions was used to solve the appropriate equations. Although there are no control costs, the optimal control was never either identical to zero or infinite.

When the method of similarity solutions fails, we could of course at least try to obtain numerical solutions to any particular problem. However, the aim of this paper was to present exact analytical solutions to important problems.

Acknowledgments

This research was supported by the Natural Sciences and Engineering Research Council of Canada (NSERC). The author also wishes to thank the anonymous reviewers of this paper for their constructive comments.

Conflict of interest

The author reports that there are no competing interests to declare.

References

[1]	P. Whittle, Optimization Over Time, Vol. I, Wiley, Chichester, 1982.
[2]	P. Whittle, Risk-Sensitive Optimal Control, Wiley, Chichester, 1990.
[3]	M. Kounta, N. J. Dawson, Linear quadratic Gaussian homing for Markov processes with regime switching and applications to controlled population growth/decay, Methodol. Comput. Appl. Probab., 23 (2021), 1155–1172. https://doi.org/10.1007/s11009-020-09800-2 doi: 10.1007/s11009-020-09800-2
[4]	C. Makasu, Homing problems with control in the diffusion coefficient, IEEE Trans. Autom. Control, 67 (2022), 3770–3772. https://doi.org/10.1109/TAC.2022.3157077 doi: 10.1109/TAC.2022.3157077
[5]	M. Lefebvre, Minimizing or maximizing the first-passage time to a time-dependent boundary, Optimization, 71 (2022), 387–401. https://doi.org/10.1080/02331934.2021.1914039 doi: 10.1080/02331934.2021.1914039
[6]	M. Lefebvre, M. Kounta, Discrete homing problems, Arch. Control Sci., 23 (2013), 5–18. https://doi.org/10.2478/v10170-011-0039-6 doi: 10.2478/v10170-011-0039-6
[7]	M. Lefebvre, A. Moutassim, Exact solutions to the homing problem for a Wiener process with jumps, Optimization, 70 (2021), 307–319. https://doi.org/10.1080/02331934.2019.1711084 doi: 10.1080/02331934.2019.1711084
[8]	M. Lefebvre, The homing problem for autoregressive processes, IMA J. Math. Control Inf., 39 (2022), 322–344. https://doi.org/10.1093/imamci/dnab047 doi: 10.1093/imamci/dnab047
[9]	Z. Yang, H. K. Koo, Optimal consumption and portfolio selection with early retirement option, Math. Oper. Res., 43 (2018), 1378–1404. https://doi.org/10.1287/moor.2017.0909 doi: 10.1287/moor.2017.0909
[10]	N. Rodosthenous, H. Zhang, Beating the omega clock: an optimal stopping problem with random time-horizon under spectrally negative Lévy models, Ann. Appl. Probab., 28 (2018), 2105–2140. https://doi.org/10.1214/17-AAP1322 doi: 10.1214/17-AAP1322
[11]	W. Y. Yun, C. H. Choi, Optimum replacement intervals with random time horizon, J. Qual. Maint. Eng., 6 (2000), 269–274. https://doi.org/10.1108/13552510010346798 doi: 10.1108/13552510010346798
[12]	A. Khatab, N. Rezg, D. Ait-Kadi, Optimum block replacement policy over a random time horizon, J. Intell. Manuf., 22 (2011), 885–889. https://doi.org/10.1007/s10845-009-0364-9 doi: 10.1007/s10845-009-0364-9
[13]	Z. Yu, Continuous-time mean-variance portfolio selection with random horizon, Appl. Math. Optim., 68 (2013). https://doi.org/10.1007/S00245-013-9209-1
[14]	M. Lefebvre, A stochastic model for computer virus propagation, J. Dyn. Games, 7 (2020), 163–174. https://doi.org/10.3934/jdg.2020010 doi: 10.3934/jdg.2020010
[15]	J. Marín-Solano, E. V. Shevkoplyas, Non-constant discounting and differential games with random time horizon, Automatica, 47 (2011), 2626–2638. https://doi.org/10.1016/j.automatica.2011.09.010 doi: 10.1016/j.automatica.2011.09.010
[16]	A. Zaremba, E. Gromova, A. Tur, A differential game with random time horizon and discontinuous distribution, Mathematics, 8 (2020). https://doi.org/10.3390/math8122185

Reader Comments

Your name:*

Email:*
© 2023 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

Mathematical Biosciences and Engineering

3.9

Metrics

Article views(1736) PDF downloads(81) Cited by(0)

Preview PDF

Download XML

Export Citation

Article outline

Show full outline

Mathematical Biosciences and Engineering

An optimal control problem without control costs

Related Papers:

Abstract

1. Introduction

2. Dynamic programming

3. Explicit solutions

4. Conclusions

Acknowledgments

Conflict of interest

References

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Other Articles By Authors

Catalog

Mathematical Biosciences and Engineering

An optimal control problem without control costs

Related Papers:

Abstract

1. Introduction

2. Dynamic programming

3. Explicit solutions

4. Conclusions

Acknowledgments

Conflict of interest

References

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Other Articles By Authors

Related pages

Tools

Export File

Citation

Format

Content

Catalog