Open book decompositions of links of quotient surface singularities and support genus problem

Elif Dalyan; Elif Dalyan

doi:10.3934/math.2020005

AIMS Mathematics

2020, Volume 5, Issue 1: 54-78. doi: 10.3934/math.2020005

Previous Article Next Article

Research article

Open book decompositions of links of quotient surface singularities and support genus problem

Elif Dalyan ^,

Department of Mathematics, Hitit University, Corum, TURKEY

Received: 11 July 2019 Accepted: 24 September 2019 Published: 17 October 2019
MSC : 57R17, 32S55, 32S25

In this paper we write explicitly the open book decompositions of links of quotient surface singularities supporting the corresponding unique Milnor fillable contact structure. The page-genus of these Milnor open books are minimal among all Milnor open books supporting the same contact structure. We also investigate whether the Milnor genus is equal to the support genus for links of quotient surface singularities. We show that for many types of the quotient surface singularities the Milnor genus is equal to the support genus. In the remaining cases we are able to find a small upper bound for the support genus.

Keywords:

Citation: Elif Dalyan. Open book decompositions of links of quotient surface singularities and support genus problem[J]. AIMS Mathematics, 2020, 5(1): 54-78. doi: 10.3934/math.2020005

Related Papers:

[1]	Mohamed Akel, Muajebah Hidan, Salah Boulaaras, Mohamed Abdalla . On the solutions of certain fractional kinetic matrix equations involving Hadamard fractional integrals. AIMS Mathematics, 2022, 7(8): 15520-15531. doi: 10.3934/math.2022850
[2]	Gauhar Rahman, Shahid Mubeen, Kottakkaran Sooppy Nisar . On generalized $\mathtt{k}$ -fractional derivative operator. AIMS Mathematics, 2020, 5(3): 1936-1945. doi: 10.3934/math.2020129
[3]	Mahmoud Abul-Ez, Mohra Zayed, Ali Youssef . Further study on the conformable fractional Gauss hypergeometric function. AIMS Mathematics, 2021, 6(9): 10130-10163. doi: 10.3934/math.2021588
[4]	Jin Li . Barycentric rational collocation method for fractional reaction-diffusion equation. AIMS Mathematics, 2023, 8(4): 9009-9026. doi: 10.3934/math.2023451
[5]	D. L. Suthar, D. Baleanu, S. D. Purohit, F. Uçar . Certain k-fractional calculus operators and image formulas of k-Struve function. AIMS Mathematics, 2020, 5(3): 1706-1719. doi: 10.3934/math.2020115
[6]	Saima Naheed, Shahid Mubeen, Thabet Abdeljawad . Fractional calculus of generalized Lommel-Wright function and its extended Beta transform. AIMS Mathematics, 2021, 6(8): 8276-8293. doi: 10.3934/math.2021479
[7]	Abdelhamid Zaidi, Saleh Almuthaybiri . Explicit evaluations of subfamilies of the hypergeometric function $_3F_2(1)$ along with specific fractional integrals. AIMS Mathematics, 2025, 10(3): 5731-5761. doi: 10.3934/math.2025264
[8]	Hasanen A. Hammad, Manuel De la Sen . Tripled fixed point techniques for solving system of tripled-fractional differential equations. AIMS Mathematics, 2021, 6(3): 2330-2343. doi: 10.3934/math.2021141
[9]	Khadijeh Sadri, David Amilo, Kamyar Hosseini, Evren Hinçal, Aly R. Seadawy . A tau-Gegenbauer spectral approach for systems of fractional integro-differential equations with the error analysis. AIMS Mathematics, 2024, 9(2): 3850-3880. doi: 10.3934/math.2024190
[10]	Obaid Algahtani, M. A. Abdelkawy, António M. Lopes . A pseudo-spectral scheme for variable order fractional stochastic Volterra integro-differential equations. AIMS Mathematics, 2022, 7(8): 15453-15470. doi: 10.3934/math.2022846

Abstract

1. Introduction

In the present work, we discuss discontinuous Galerkin (DG) approximations to a nonlinear optimal control problem (OCP) of ordinary differential equations (ODEs). More precisely, we consider the following optimal control problem:

$\begin{equation} \mbox{Minimize } J(u, x) : = \int_0^T g(t, x(t), u(t))\, dt, \end{equation}$

(1.1)

subject to

$\begin{align} \left\{\begin{aligned} x'(t) & = f(t, x(t), u(t)), \quad \mbox{a.e. on } [0, T], \cr x(0) & = x_0, \cr {u} &\in \mathcal{U}_{ad}, \quad \mbox{a.e. on } [0, T]. \end{aligned}\right. \end{align}$

(1.2)

Here $u(t) \in \mathbb{R}^m$ is the control, and $x(t) \in \mathbb{R}^d$ is the state of the system at time $t \in [0, T]$ . Further, $g: [0, T] \times \mathbb{R}^d \times \mathbb{R}^m \to \mathbb{R}$ and $f: [0, T] \times \mathbb{R}^d \times \mathbb{R}^m \to \mathbb{R}^d$ are given, and the set of admissible controls $\mathcal{U}_{ad} \subset {\mathcal{U}: = L^{\infty}(0, T; \mathbb{R}^m)}$ is given by

$\mathcal{U}_{ad} : = \{ u(t) \in \mathbb{R}^m : u_\ell \leq u(t) \leq u_u\}$

for some $u_\ell, u_u \in \mathbb{R}^m$ . Here the inequality is understood in the component-wise sense.

There have been a lot of study on the numerical computation for the above problem. The numerical schemes need a discretization of the ODEs, for example, the Euler discretization for the OCPs of ODEs are well studied for sufficiently smooth optimal controls based on strong second-order optimality conditions ^[2,13,14]. For optimal control problems with control appearing linearly, the optimal control may be discontinuous, for an instance, bang-bang controller, and such conditions may not be satisfied. In that respect, there have been many studies to develop new second-order optimality conditions for the optimal control problems with control appearing linearly ^[3,21,31,32]. The second-order Runge-Kutta approximations for the OPCs was studied in ^[15]. Recently, works ^[16,17] developed a novel stability technique to obtain new error estimates for the Euler discretization of OCPs.

The Pseudo-spectral method is also popularly used for the discretization due to its capability of high-order accuracy for smooth solutions to the OCPs ^[20,33]. However, the high-order accuracy of the Pseudo-spectral method is known to be often lost for bang-bang OCPs, where the solutions may not be smooth enough. To handle this issue, Henriques et al. ^[24] proposed a mesh refinement method based on a high-order DG method for the OCPs of ODEs. The DG method discretizes the time interval in small time subintervals, in which the weak formulation is employed. The test functions are usually taken as piecewise polynomials which can be discontinuous at boundaries of the time interval, see Section 2 for more detailed discussion. We refer to ^[7,19,34] and references therein for DG methods for ODEs. It is also worth to refer to papers for the analysis of the discretization of optimal control problems of PDEs, for example, the elliptic problems ^[1,23,35] and the parabolic problems ^{[9,12,25,26,27,28,29]}. In addition, the recent works ^[22,30] studied the discretization of the optimal control for fractional diffusion problems.

In this paper, we provide a rigorous analysis for the DG discretization applied to the nonlinear OCPs (1.1) and (1.2) with arbitrary order $r \in \mathbb{N} \cup \{0\}$ for general functions $f$ and $g$ with suitable smoothness. Motivated from a recent work by Neitzel and Vexler ^[29], we impose the non-degeneracy condition (2.4) on an optimal control $\bar{u}$ of the OCPs (1.1) and (1.2). We obtain the existence and convergence results for the semi-discretized case and the fully discretized case. The rates of the convergence results depend on the regularity of the optimal solution $\bar{u}$ and its adjoint state with the degree of piecewise polynomials mentioned above, see Section 2 for details.

It is worth noticing that the control is not required to be linear in the state Eq (1.2), and the control space $\mathcal{U}_{ad}$ allows to take into account discontinuous controls. The constraints for controls are defined by lower and upper bounds. Moreover, the cost functional is also given in a general form, not limited to be quadratic. We mention that the DG discretization of zeroth order was used in ^[29] for the optimal control problem for the semi-linear parabolic equation where the control is linearly applied to the system.

For notational simplicity, we denote by $I: = (0, T)$ , $X: = L^2(I; \mathbb{R}^d)$ , and $(v, w)_I = (v, w)_{L^2(I; \mathbb{R}^d)}$ . We also use simplified notations:

$\|\cdot\|_{L^p(I)} : = \|\cdot\|_{L^p(I; \mathbb{R}^d)} \quad \mbox{and} \quad \|\cdot\|_{W^{p, \infty}(I)} : = \|\cdot\|_{W^{p, \infty}(I; \mathbb{R}^d)}$

for $1 \leq p \leq \infty$ . Throughout this paper, for any compact set $K \subset \mathbb{R}^m$ , we assume that $f, g \in C([0, T]; W^{3, \infty}(\mathbb{R}^d \times K))$ satisfy

$\begin{equation} \sup\limits_{0 \leq t \leq T} \left(\|f(t, \cdot, \cdot)\|_{W^{3, \infty}} + \|g(t, \cdot, \cdot)\|_{W^{3, \infty}} \right)\leq M \end{equation}$

(1.3)

for some $M > 0$ .

We next introduce the control-to-state mapping $G: \mathcal{U} \to X \cap L^\infty(I; \mathbb{R}^d)$ , $G(u) = x$ , with $x$ solving (1.2). It induces the cost functional $j: \mathcal{U} \to \mathbb{R}_+$ , $u \mapsto J(u, G(u))$ . This makes the optimal control problems (1.1) and (1.2) equivalent to

$\begin{equation} \text{ Minimize } j(u)\text{ subject to } u \in \mathcal{U}_{ad} . \end{equation}$

(1.4)

Definition 1.1. A control $\bar u \in \mathcal{U}_{ad}$ is a local solution of (1.4) if there exists a constant $\varepsilon > 0$ such that $j(u) \geq j(\bar u)$ holds for all $u \in \mathcal{U}_{ad}$ with $\|\bar u - u\|_{L^2 (I)} \leq \varepsilon$ .

In the proof of the existence and convergence results, the main task is to show that the strong convexity of $j$ induced by the second-order optimality condition (2.4) is preserved near the optimal control $\bar{u}$ and also for its DG discretized version $j_h$ . It is achieved using the second-order analysis in Section 4. As a preliminary, we also justify that $j$ and $j_h$ are twice differentiable, by showing the differentiability of the control-to-state mapping $G$ and its discretized version $G_h$ in the appendix.

In Section 2, we explain the DG discretization of the ODEs and the OCP. Then we present the main results for the semi-discretized case and provide some preliminary results. In Section 3, the adjoint problems are studied. Section 4 is devoted to study the second order analysis of the cost functionals $j$ and $j_h$ . In Section 5, we prove the existence of the local solution and obtain the convergence rate for the semi-discretized case. Section 6 is devoted to establish the existence and convergence results for the fully discretized case. Finally, in Section 7, we perform several numerical experiments for linear and nonlinear OCPs. In Appendix A, we obtain first and second order derivatives of the control-to-state mapping $G$ . Appendix B is devoted to prove a Grönwall-type inequality for the discretization of the ODEs (1.2) involving the control variable. It is used in Appendix C to establish the differentiability of the discrete control-to-state mapping $G_h$ and obtain the derivatives. In Appendix D, we prove Lemmas 3.3 and 3.5, which reformulate the first derivatives of the cost functionals in terms of the adjoint states. In Appendix E, we derive the formulas on the second order derivatives of the cost functionals.

2. DG formulation

In this section, we describe the approximation of the OCPs (1.1) and (1.2) with the DG method, and then we state the main results on the semi-discrete case. First, we illustrate the discretization of the ordinary differential equations

$\begin{equation} \left\{\begin{array}{lll} x' (t) = F(t, x(t)), \quad t \in (0, T), \\ x(0) = x_0, \end{array}\right. \end{equation}$

(2.1)

where $x: [0, T] \rightarrow \mathbb{R}^d$ , $F:(0, T) \times \mathbb{R}^d \rightarrow \mathbb{R}^d$ is uniformly Lipschitz continuous with respect to $x$ , i.e.,

$\begin{equation*} \|F(t, u) -F(t, v)\|\leq L \|u -v\|, \quad u, v \in \mathbb{R}^d, \; t \in (0, T), \end{equation*}$

with a constant $L > 0$ . By the Cauchy Lipschitz theorem, we have the existence and uniqueness of classical solution $x$ of (2.1).

Given an integer $N \in \mathbb{N}$ , we consider a partition of $I$ into $N$ -intervals $\{I_n\}_{n = 1}^N$ given by $I_n = (t_{n-1}, t_n)$ with nodes $0 = :t_0 < t_1 < \cdots < t_{N-1} < t_N : = T$ . Let $h_n$ be the length of $I_n$ , i.e., $h_n = t_n - t_{n-1}$ , and we set $h: = \max_{1 \leq n \leq N}h_n$ . For a piecewise continuous function $\varphi :[0, T] \rightarrow \mathbb{R}^d$ , we also define

$\varphi^+_n : = \lim\limits_{t \rightarrow 0^{+}} \varphi(t_n + t), \quad 0 \leq n \leq N-1, \qquad \varphi^-_n : = \lim\limits_{t \rightarrow 0^{+}} \varphi(t_n - t), \quad 1 \leq n \leq N.$

The jumps across the nodes is denoted by $[\varphi]_n : = \varphi^+_n - \varphi^-_n$ for $1 \leq n \leq N-1$ . For $r \in \mathbb{N} \cup \{0\}$ , we define

$X^r_h : = \{ \varphi_h \in X : \varphi_h |_{I_n} \in P^r(I_n), \quad 1 \leq n \leq N\},$

where $P^r(I_n)$ represents the set of all polynomials of $t$ up to order $r$ defined on $I_n$ with coefficients in $\mathbb{R}^d$ . Then the DG approximate solution $x_h$ of (2.1) is given as

$\begin{equation} \sum\limits_{n = 1}^N \left(x'(t) - F(t, x(t)), \varphi(t) \right)_{I_n} + \sum\limits_{n = 2}^N ([x]_{n-1}, \varphi^+_{n-1}) + (x^+_0, \varphi^+_0) = (x_0, \varphi^+_0) \end{equation}$

(2.2)

for all $\varphi \in X^r_h$ . Here $(\cdot, \cdot)$ denotes the inner product in $\mathbb{R}^d$ , and

$(A(t), B(t))_{I_n} = \int_{I_n} (A(t), B(t))\, dt$

for integrable functions $A, B: I_n \rightarrow \mathbb{R}^d$ .

We recall the error estimate for the DG approximation of (2.1) from [34,Corollary 3.15 & Theorem 2.6].

Theorem 2.1. Let $x(t)$ be the solution of $(2.1)$ such that $x \in W^{k, \infty}(I; \mathbb{R}^d)$ for some $k \geq 1$ . Suppose that $h L < 1$ . Then there exists a unique DG approximate solution $x_h \in X_h^r$ to (2.2) of order $r \in \mathbb{N} \cup \{0\}$ . Furthermore, we have

$\sup\limits_{0 \leq t \leq T}|x_h (t) - x(t)| \leq C h^{\min\{r+1, k\}}\|x\|_{W^{k, \infty} (I; \mathbb{R}^d)},$

where $C > 0$ is determined by $L$ , $T$ , and $r$ .

Now, for given $u \in \mathcal{U}$ , we consider the approximate solution $x \in X^r_h$ of the control problem (1.2) satisfying

$\begin{equation} \sum\limits_{n = 1}^N \left(x'(t) - f(t, x(t), u(t)), \varphi(t) \right)_{I_n} + \sum\limits_{n = 2}^N ([x]_{n-1}, \varphi^+_{n-1}) + (x^+_0, \varphi^+_0) = (x_0, \varphi^+_0) \end{equation}$

(2.3)

for all $\varphi \in X^r_h$ .

Throughout the paper, we will consider local solutions $\bar{u}$ to (1.4) satisfying the following non-degeneracy condition.

Assumption 1. Let $\bar{u} \in \mathcal{U}_{ad}$ be the local solution of (1.1). We assume that it satisfies

$\begin{equation} j'' (\bar{u})(v, v) \geq \gamma \|v\|_{L^2(I)}^2, \quad \forall \, v \in \mathcal{U} \end{equation}$

(2.4)

for some $\gamma > 0$ .

The differentiability of the cost functional $j(u) = J(u, G(u))$ with respect to $u \in \mathcal{U}$ is induced by the differentiability of the solution mapping $G(u)$ justified in Appendix A (see also the proofs of Lemmas 3.3 and E.1). Note that the above second-order optimality condition holds under suitable regularity assumptions on the function $f$ , $g$ , and solutions, see Remark E.2 for a detailed discussion. We refer to ^[4,5] for further discussion on the second-order condition and also ^[8,10,11] for the optimal control problem of PDEs.

In addition, we assume that $\bar{u} \in \mathcal{U}_{ad}$ has bounded total variation, i.e., $V(\bar{u}) \leq R/2$ for a fixed value $R > 0$ . Here the total variation $V (f)$ for $f \in L^{\infty}(0, T)$ is defined as

$V(f) : = \sup\limits_{P} \sum\limits_{j = 0}^{n}|f(x_j) - f(x_{j+1})|,$

where $P$ is any partition $P = \{ 0 = x_0 < x_1 < x_2 < \cdots < x_n < x_{n+1} = T\}$ .

Considering a discrete control-to-state mapping $G_h : \mathcal{U} \to X^r_h$ , $u \mapsto G_h(u)$ , where $G_h(u)$ is the solution of (2.3), we introduce the discrete cost functional $j_h : \mathcal{U} \to \mathbb{R}_+, u \mapsto J(u, G_h(u))$ . Let us consider the following discretized version of (1.1):

$\begin{equation} \min\limits_{u \in \mathcal{U}_{ad}\cap \mathcal{V}_R} j_h(u), \end{equation}$

(2.5)

where

$\mathcal{V}_{R} = \{ u \in \mathcal{U}\; :\; V(u) \leq R\}.$

We now define the local solution to (2.5) as follows.

Definition 2.2. A control $\bar u_h \in \mathcal{U}_{ad} \cap \mathcal{V}_R$ is called a local solution of (2.5) if there exists an $\delta > 0$ such that $j_h(u) \geq j_h(\bar u_h)$ holds for all $u \in \mathcal{U}_{ad}\cap \mathcal{V}_R$ with $\|u - \bar u_h\|_{L^2 (I)} \leq \delta$ .

In the first main result, we prove the existence of the local solution to the approximate problem (2.5).

Theorem 2.3. Let $\bar{u} \in \mathcal{U}_{ad}\cap \mathcal{V}_{R/2}$ be a local solution of $(1.1)$ satisfying Assumption 1. Then, there are constants $\epsilon > 0$ and $h_0 > 0$ such that for $h \in (0, h_0)$ the approximate problem $(2.5)$ has a local solution $\bar{u}_h \in \mathcal{U}_{ad} \cap \mathcal{V}_R$ satisfying $\|\bar{u}_h - \bar{u}\|_{L^2 (I)} < \varepsilon$ .

The second main result is the following convergence estimate of the approximate solutions.

Theorem 2.4. Let $\bar{u} \in \mathcal{U}_{ad}\cap \mathcal{V}_{R/2}$ be a local solution of $(1.4)$ satisfying Assumption 1, let $\bar{u}_h$ be the approximate solution found in Theorem 2.3, and let $\lambda (\bar{u})$ be the adjoint state defined in Definition 3.1 below. Assume that the state $\bar{x} = G(\bar{u})$ belongs to $W^{k_1, \infty}(I; \mathbb{R}^d)$ and the adjoint state $\lambda (\bar{u})$ belongs to $W^{k_2, \infty}(I; \mathbb{R}^d)$ for some $k_1, k_2 \geq 1$ . Then we have

$\begin{equation*} \|\bar{u}- \bar{u}_h\|_{L^2(I)} = O(h^{\min\{r+1, k_1, k_2\}}). \end{equation*}$

The required regularity of solutions $\bar x$ and $\lambda(\bar u)$ can be obtained under suitable smoothness assumptions on $f$ , $g$ , and $\bar u$ , see Remark 3.2 below. The above result establishes the error estimate concerning the discretization of the ODEs in the OCPs. We will give the proofs of Theorems 2.3 and 2.4 in Section 5. On the other hand, to implement a numerical computation to the OCP (1.4), one needs also to consider an approximation of the control space with a finite dimensional space. In Section 6, we will see that the proof of Theorem 2.4 can be extended to the error analysis incorporating the discretization of the control space.

3. Adjoint states

This section is devoted to study the adjoint states to the OCP (1.1) and its discretized version (2.5).

We introduce a bilinear form $b(\cdot, \cdot)$ for $x \in {W^{1, \infty} (0, T)}$ and $\varphi \in X$ by

$\begin{equation} b(x, \varphi) : = \int_0^T x'(t) \cdot \varphi(t)\, dt. \end{equation}$

(3.1)

Then, for a fixed control $u \in \mathcal{U}$ and initial data $x_0 \in \mathbb{R}^{d}$ , a weak formulation of (1.2) can be written as

$\begin{equation} b(x, \varphi) = \int_0^T f(t, x(t), u(t))\cdot\varphi(t)\, dt \end{equation}$

(3.2)

for all $\varphi \in X$ with $x(0) = x_0$ .

Definition 3.1. For a control $u \in \mathcal{U}$ , we define the adjoint state $\lambda = \lambda(u) \in W^{1, \infty}(0, T)$ as the solution to

$\begin{equation} \lambda' (t) = - \partial_x f (t, x(t), u(t)) \lambda (t) + \partial_x g (t, x(t), u(t)), \end{equation}$

(3.3)

with $\lambda(T) = 0$ . It satisfies the weak formulation

$\begin{equation} b({\varphi}, \lambda) = ({\varphi}, \partial_x f (\cdot, x, u) \lambda - \partial_x g(\cdot, x, u))_{L^2(I)} \end{equation}$

(3.4)

for all $\varphi \in X$ with $\lambda (T) = 0$ .

Remark 3.2. It follows from the Eqs (1.2) and (3.3) that if

$f \in C^\alpha_b( \mathbb{R}_+ \times \mathbb{R}^d \times \mathbb{R}^d), \quad g \in C^\beta_b( \mathbb{R}_+ \times \mathbb{R}^d \times \mathbb{R}^d) \quad \mbox{and} \quad u \in C^\gamma_b([0, T]),$

we have

$x \in C^{\min\{\alpha, \gamma\}+1}_b([0, T]) \quad \mbox{and} \quad \lambda \in C^{\min\{\alpha, \beta, \gamma+1\}}_b([0, T]).$

For $u, v \in \mathcal{U}$ , the derivative of $j$ at $u$ in the direction $v$ is defined by

$\begin{equation*} j' (u) v : = \lim\limits_{t \rightarrow 0^{+}} \frac{j(u+tv) -j (u)}{t}. \end{equation*}$

It is well-known that the derivative of the cost functional can be calculated with the adjoint state, as described below.

Lemma 3.3. We have

$\begin{equation} j'(u)(v) = \left( \partial_u g(\cdot, x, u) - \partial_u f(\cdot, x, u)\lambda( u), v \right)_I \end{equation}$

(3.5)

for all $v \in \mathcal{U}_{ad}$ , where $x = G(u)$ .

Proof. For the completeness of the paper, we give the proof in Appendix D.

Next we describe the adjoint problem for the approximate problem (2.5). For ${x}, \varphi \in X^r_h$ , we define

$\begin{equation} B(x, \varphi) : = \sum\limits_{n = 1}^N (x', \varphi)_{I_n} + \sum\limits_{n = 2}^N ([x]_{n-1}, \varphi^+_{n-1}) + (x^+_0, \varphi^+_0). \end{equation}$

(3.6)

For approximate solution $x_h = G_h (u) \in X^r_h$ , the Eq (2.3) with control $u \in \mathcal{U}$ can be written as

$\begin{equation} B(x_h, \varphi) = \left(f(\cdot, x_h, u), \varphi \right)_I + (x_0, \varphi^+_0), \quad \forall \, \varphi \in X^r_h. \end{equation}$

(3.7)

Now we define the adjoint equation for the approximate problem (2.5).

Definition 3.4. The adjoint state $\lambda_h = \lambda_h(u) \in X^r_h$ is defined as the solution of the following discrete adjoint equation:

$\begin{equation} B(\varphi, \lambda_h) = (\varphi, \partial_x f(\cdot, x_h, u)\lambda_h - \partial_x g(\cdot, x_h, u))_I, \quad \forall \, \varphi \in X^r_h. \end{equation}$

(3.8)

In Appendix D, we briefly explain how the adjoint Eq (3.8) can be derived from the Lagrangian related to (2.5). We also have an analogous result to Lemma 3.3.

Lemma 3.5. We have

$\begin{equation} j'_h(u)(v) = ( \partial_u g(\cdot, x_h, u) - \partial_u f(\cdot, x_h, u)\lambda_h, v)_I, \quad {\forall \, v \in \mathcal{U}_{ad}, } \end{equation}$

(3.9)

where $x_h = G_h(u)$ .

Proof. The proof is given in Appendix D.

In order to prove the main results in Section 2, we shall use the following lemma.

Lemma 3.6. Let $u \in \mathcal{U}$ . Suppose that $x = G(u) \in W^{k_1, \infty} (I; \mathbb{R}^d)$ and $\lambda = \lambda (u) \in W^{k_2, \infty}(I; \mathbb{R}^d)$ for some $k_1, k_2 \geq 1$ . Then we have

$\begin{equation} \| \lambda ({u}) - \lambda_h ({u})\|_{L^2(I)} = O(h^{\min \{k_1, k_2, r+1\}}). \end{equation}$

(3.10)

Proof. We recall from (3.4) and (3.8) that $\lambda = \lambda ({u})$ solves

$\begin{equation} b(\varphi, \lambda) - (\varphi, \partial_x f (\cdot, {x}, {u})\lambda)_{L^2(I)} = -(\varphi, \partial_x g(\cdot, x, u))_I, \end{equation}$

(3.11)

and $\lambda_h = \lambda_h ({u})$ solves

$\begin{equation} \begin{split} &B(\varphi, \lambda_h) - (\varphi, \partial_x f (\cdot, {x}, {u}) \lambda_h)_{L^2(I)} \cr &\quad = - (\varphi, \partial_x g(\cdot, {x}_h, {u}))_{L^2(I)} + (\varphi, ( \partial_x f (\cdot, {x}_h, {u}) - \partial_x f (\cdot, {x}, {u})) \lambda_h)_{L^2(I)}, \quad \forall\, \varphi \in X_h^{r}. \end{split} \end{equation}$

(3.12)

Here $x \in G({u}) \in X$ and ${x}_h = G_h ({u}) \in X_h$ . The estimate of ${x}-{x}_h$ is induced from Theorem 2.1 as follows:

$\begin{equation} \| {x}- {x}_h\|_{L^{\infty}(I)} = O(h^{\min\{k_1, r+1\}}) \|{x}\|_{W^{k_1, \infty}(I)}. \end{equation}$

(3.13)

As an auxiliary function, we consider $\zeta_h \in X_h$ solving

$\begin{equation} B({\varphi}, \zeta_h ) - ({\varphi}, \partial_x f(\cdot, x, u) \zeta_h)_{I} = - ({\varphi}, \partial_x g(\cdot, x, u))_{I}, \quad \forall\, \varphi \in X_h^r, \end{equation}$

(3.14)

which is the DG discretization of (3.11) in a backward way (see Lemma 3.7 below). Then, by Theorem 2.1, we have

$\begin{equation} \| \zeta_h - \lambda \|_{L^{\infty}(I)} = O (h^{\min\{k_2, r+1\}}) \|\lambda\|_{W^{k_2, \infty}(I)}. \end{equation}$

(3.15)

By (3.13), we obtain

$\begin{equation*} \partial_x g(\cdot, {x}, {u}) - \partial_x g(\cdot, {x}_h, {u}) = O(h^{\min\{k_1, r+1\}}) \end{equation*}$

and

$\begin{equation*} ( \partial_x f (\cdot, {x}_h, {u}) - \partial_x f (\cdot, {x}, {u})) \lambda_h ({u}) = O(h^{\min\{k_1, r+1\}}). \end{equation*}$

Combining these estimates with (3.12) and (3.14) gives

$\begin{equation*} B(\varphi, \lambda_h - \zeta_h) = (\varphi, \partial_x f (\cdot, x, u) (\lambda_h - \zeta_h))_{I} + (\varphi, R(t))_{I}, \quad \forall\, \varphi \in X_h^{r}, \end{equation*}$

where $R: I \rightarrow \mathbb{R}^d$ is given by

$R(t) = ( \partial_x f (\cdot, {x}_h, {u}) - \partial_x f (\cdot, {x}, {u})) \lambda_h ({u}) + \partial_x g(\cdot, {x}, {u}) - \partial_x g(\cdot, {x}_h, {u}),$

and it satisfies $\|R(t)\| = O(h^{\min\{k_1, r+1\}})$ . This, together with Lemma B.4, yields

$\begin{equation*} \|\lambda_h - \zeta_h\|_{L^{\infty}(I)} = O(h^{\min\{k_1, r+1\}}). \end{equation*}$

Combining this estimate with (3.15),

$\begin{equation*} \|\lambda_h - \lambda \|_{L^{\infty}(I)} = O(h^{\min\{k_1, k_2, r+1\}}), \end{equation*}$

which completes the proof.

With abusing a notation for simplicity, let us define $J$ as the interval $I$ given a partition $0 = s_0 < s_1 < \cdots < s_{N-1} < s_N = T$ with $s_j = t_{N-j}$ . Also we set $X_{h, J}^{r}$ as the DG space $X_h^r$ with the new partition. Then we have the following lemma.

Lemma 3.7. Assume that $\lambda \in X_h^r$ is a solution to

$\begin{equation*} B(\phi, \lambda) = (\phi, F(t, \lambda) )_{I}, \quad \forall\; \phi \in X_h^r. \end{equation*}$

Then $W:I \rightarrow \mathbb{R}^d$ defined by $W(t) = \lambda (T-t)$ for $t \in I = [0, T]$ satisfies

$\begin{equation*} B(W, \psi) = (F(t, W), \psi)_{I}, \quad \forall\; \psi \in X_{h, J}^r. \end{equation*}$

Proof. By an integration by parts,

$\begin{equation*} \begin{split} B(\phi, \lambda) & = \sum\limits_{n = 1}^{N} (\phi', \lambda)_{I_n} + \sum\limits_{n = 2}^{N} ([\phi]_{n-1}, \lambda_{n-1}^{+}) + (\phi_0^{+}, \lambda_0^{+}) \\ & = -\sum\limits_{n = 1}^{N} (\phi, \lambda' )_{I_n} -\sum\limits_{n = 1}^{N-1} (\phi_{n}^{-}, [\lambda]_{n}) + (\phi_{N}^{-}, \lambda_N^{-}), \end{split} \end{equation*}$

which leads to

$\begin{equation} -\sum\limits_{n = 1}^{N} (\phi, \lambda' )_{I_n} -\sum\limits_{n = 1}^{N-1} (\phi_{n}^{-}, [\lambda]_{n}) + (\phi_{N}^{-}, \lambda_N^{-}) = (\phi, F(t, \lambda))_{I}, \quad \forall\; \phi \in X_h^r. \end{equation}$

(3.16)

We now observe that $W(t) = \lambda (T-t)$ satisfies $W' (t) = - \lambda' (T-t)$ and $[W]_{N-n} = - [\lambda]_{n}$ . We also set $\psi (t) = \phi (T-t)$ . Then $\psi \in X_{h, J}^{r}$ and we have $\phi_{n}^{-} = \psi_{N-n}^{+}$ . Considering $J_n : = (s_{n-1}, s_n)$ , it holds that $J_n = I_{N+1-n}$ for $1 \leq n \leq N$ . Using these notations, we write (3.16) as

$\begin{equation*} \sum\limits_{n = 1}^{N} (\psi, W')_{J_{N+1-n}} +\sum\limits_{n = 1}^{N-1} (\psi_{N-n}^{+}, [W]_{N-n}) + (\psi_{0}^{+}, W_0^{+}) = (\psi, F(t, W))_{I}, \quad \forall\; \psi \in X_{h, J}^{r}. \end{equation*}$

Rearranging this, we get

$\begin{equation*} \sum\limits_{n = 1}^{N} (W', \psi)_{J_{n}} +\sum\limits_{n = 1}^{N-1} ([W]_{n}, \psi_{n}^{+}) + (W_0^{+}, \psi_0^{+}) = (F(t, W), \psi)_{I}, \quad \forall\; \psi \in X_{h, J}^{r}, \end{equation*}$

which is the desired equation $B(W, \psi) = (F(t, W), \psi)_{I}$ . The proof is finished.

4. Second order analysis

In this section, we analyze the second order condition of the functions $j$ and $j_h$ , which are essential in the existence and convergence estimates in the next sections.

4.1. Second order condition for $j$

We defined the solution mapping $G: \mathcal{U} \rightarrow X \cap L^{\infty}(I; \mathbb{R}^d)$ in the previous section. Here we present Lipschitz estimates for the solution mapping $G$ , its derivative $G'$ , and the solution to the adjoint Eq (3.4).

Lemma 4.1. There there exists $C > 0$ such that for all $u, \hat u \in \mathcal{U}_{ad}$ and $v \in \mathcal{U}$ we have

$\|G(u) - G(\hat u)\|_{L^\infty(I)} \leq C\|u - \hat u\|_{L^2(I)}, \quad \|G'(u)v - G'(\hat u)v\|_{L^{\infty}(I)} \leq C\|u - \hat u\|_{L^2(I)}\|v\|_{L^{2}(I)},$

and

$\|\lambda(u) - \lambda(\hat u)\|_{L^\infty (I)} \leq C\|u - \hat u\|_{L^2 (I)}.$

Proof. Let us denote by $x = G(u)$ and $\hat x = G(\hat u)$ . Then it follows from (3.2) that

$\begin{equation} (x - \hat x)' (t) = f(t, x(t), u(t)) - f(t, \hat x (t), \hat u (t)). \end{equation}$

(4.1)

By (1.3), there exists a constant $C > 0$ such that

$\begin{equation*} \begin{split} \left|f(t, x(t), u(t)) - f(t, \hat x(t), \hat{u}(t))\right| &\leq C|\hat{x}(t) - x(t)| + C |\hat{u}(t) - u(t)|. \end{split} \end{equation*}$

Using this estimate and applying the Grönwall inequality in (4.1), we get the inequality

$\begin{equation*} \|x - \hat{x}\|_{L^{\infty}(I)} \leq C \|u-\hat{u}\|_{L^{1}(I)} \leq C \|u-\hat{u}\|_{L^{2}(I)} . \end{equation*}$

This gives the first inequality. For the second one, if we set $y = G'(u)v$ and $\hat y = G'(\hat u)v$ , then it follows from Lemma A.1 that

$\begin{aligned} (y - \hat y)' (t) & = \partial_x f(t, x(t), u(t))( y - \hat y)(t) + ( \partial_x f(t, x, u) - \partial_x f(t, \hat x, \hat u))\hat y (t) \cr &\quad + ( \partial_u f(t, x, u) - \partial_u f(t, \hat x, \hat u))v (t). \end{aligned}$

This together with the first assertion above yields

$\begin{aligned} \|y - \hat y\|_{L^{\infty}(I)} &\leq C\|( \partial_x f(\cdot, x, u) - \partial_x f(\cdot, \hat x, \hat u))\hat y\|_{L^{1}(I)}\cr &\quad + C\|( \partial_u f(\cdot, x, u) - \partial_u f(\cdot, \hat x, \hat u))v\|_{L^{1}(I)}\cr &\leq C \left(\|x - \hat x\|_{L^2(I)} + \|u - \hat u\|_{L^2(I)} \right)\|v\|_{L^{2}(I)}\cr &\leq C\|u - \hat u\|_{L^2(I)}\|v\|_{L^{2}(I)}. \end{aligned}$

For notational simplicity, we denote by $\lambda = \lambda(u)$ and $\hat \lambda = \lambda(\hat u)$ . Then we get

$\begin{aligned} -(\lambda - \hat\lambda)' (t) & = \partial_x f(\cdot, x, u)(\lambda - \hat\lambda)(t) + ( \partial_x f(\cdot, x, u) - \partial_x f(\cdot, \hat x , \hat u)){\hat\lambda(t)} \cr &\quad - ( \partial_x g(\cdot, x, u) - \partial_x g(\cdot, \hat x, \hat u))(t), \; t\in (0, T), \end{aligned}$

with $(\lambda - \hat{\lambda}) (T) = 0$ . By applying the Grönwall inequality in a backward way, we obtain

$\begin{aligned} \|\lambda - \hat\lambda\|_{L^\infty(I)} &\leq C\|( \partial_x f(\cdot, x, u) - \partial_x f(\cdot, \hat x , \hat u))\hat\lambda\|_{L^1(I)} \cr &\quad + C\| \partial_x g(\cdot, x, u) - \partial_x g(\cdot, \hat x, \hat u)\|_{L^1(I)}\cr &\leq C(\|\hat \lambda\|_{L^\infty(I)} + 1) \left( \|x - \hat x\|_{L^\infty(I)} + \|u - \hat u\|_{L^2(I)} \right)\cr &\leq C\|u - \hat u\|_{L^2(I)}, \end{aligned}$

where we used

$\|\hat \lambda\|_{L^\infty(I)} \leq C\| \partial_x g\|_{L^\infty (I)}$

due to (3.3) and $\hat\lambda(T) = 0$ . This completes the proof.

We now show that the second order condition of $j$ holds near the optimal local solution $\bar{u} \in \mathcal{U}_{ad}$ .

Lemma 4.2. Suppose that $\bar{u} \in \mathcal{U}_{ad}$ satisfies Assumption 1. Then there exists $\varepsilon > 0$ such that

$j'' (u)(v, v) \geq \frac{\gamma}{2} \|v\|_{L^2(I)}^2$

holds for all $v \in \mathcal{U}$ and all $u \in \mathcal{U}_{ad}$ with $\|u - \bar{u}\|_{L^2 (I)} \leq 2 \varepsilon$ . Here $\gamma > 0$ is given in (2.4).

Proof. Let $y (t) = G' (u)v$ and $y(\bar{u})(t) = G' (\bar{u}) v$ . By using Lemma E.1, we find

$\begin{aligned} & {j}'' (u)(v, v) - {j}'' (\bar{u}) (v, v) \\ & = -\int_0^{T} \lambda(t)\left(\frac{\partial^2 f}{(\partial x)^2} (t, x, u) y^2(t) + 2 \frac{\partial^2 f}{\partial x \partial u} (t, x, u) y(t)v(t) + \frac{\partial^2 f}{(\partial u)^2} (t, x, u) v^2(t)\right)dt \\ &\quad + \int_0^{T} \frac{\partial^2 g}{(\partial x)^2} (t, x, u) y^2 (t) + 2 \frac{\partial^2 g}{\partial x \partial u} (t, x, u) y(t) v(t) + \frac{\partial^2 g}{(\partial u)^2} (t, x, u) v^2 (t)\, dt \\ &\quad +\int_0^{T}\bar\lambda(t) \left( \frac{\partial^2 f}{(\partial x)^2} (t, \bar{x}, u) \bar{y}^2(t) + 2 \frac{\partial^2 f}{\partial x \partial u} (t, \bar{x}, u) \bar{y}(t) v(t) + \frac{\partial^2 f}{(\partial u)^2} (t, \bar{x}, u) v^2(t) \right) dt \\ &\quad - \int_0^{T} \frac{\partial^2 g}{(\partial x)^2} (t, \bar{x}, u) {\bar{y}}^2 (t) + 2 \frac{\partial^2 g}{\partial x \partial u} (t, \bar{x}, u) \bar{y} (t) v(t) + \frac{\partial^2 g}{(\partial u)^2} (t, \bar{x}, u) v^2 (t)\, dt, \end{aligned}$

where we denoted by $\lambda(t) : = \lambda(u)(t)$ , $x(t) : = G(u)(t)$ , $\bar\lambda(t) : = \lambda(\bar u)(t)$ , and $\bar x(t) : = G(\bar u)(t)$ . On the other hand, it follows from Lemma 4.1 that

$\begin{align} \begin{aligned} \|x-\bar{x}\|_{L^\infty(I)} &\leq C\|u -\bar{u}\|_{L^2(I)}, \quad \|y-\bar{y}\|_{L^{\infty}(I)}\leq C\|u -\bar{u}\|_{L^2(I)}\|v\|_{L^2(I)}, \cr \|y\|_{L^{\infty}(I)} &\leq C\|v\|_{L^2(I)}, \quad \|\lambda\|_{L^\infty(I)} + \|\bar\lambda\|_{L^\infty(I)} \leq C\| \partial_x g\|_{L^\infty (I)}, \quad \mbox{and} \cr \|\lambda-\bar\lambda\|_{L^\infty(I)} &\leq C\|u -\bar{u}\|_{L^2(I)}. \end{aligned} \end{align}$

(4.2)

This together with the following estimate

$\begin{aligned} \int_0^T |y^2(t) - \bar y^2(t) |\, dt &\leq \int_0^T |y(t) + \bar y(t) ||y(t) - \bar y(t) |\, dt \cr &\leq \|y - \bar y\|_{L^2(I)} \left(\|y\|_{L^2(I)} + \|\bar y\|_{L^2(I)} \right)\cr &\leq C\|u -\bar{u}\|_{L^2(I)}\|v\|_{L^2(I)}^2, \end{aligned}$

yields

$\begin{equation*} |(j'' (u) (v, v) - j'' (\bar{u}) (v, v))|\leq C\|u-\bar{u}\|_{L^2 (I)} \|v\|_{L^2(I)}^2. \end{equation*}$

Combining this with (2.4) we have

$\begin{aligned} j'' (u) (v, v) & = j'' (\bar{u}) (v, v) + (j'' (u) (v, v) - j'' (\bar{u}) (v, v)) \\ &\geq \gamma \|v\|_{L^2(I)}^2 - C\|u-\bar{u}\|_{L^2 (I)} \|v\|_{L^2(I)}^2. \end{aligned}$

By choosing $\varepsilon = \frac{\gamma}{4C} > 0$ here, we obtain the desired result.

As a consequence of this lemma, we have the following result.

Theorem 4.3. Let $\bar{u} \in \mathcal{U}_{ad}$ satisfy the first optimality condition and Assumption 1. Then, there exist a constant $\varepsilon > 0$ such that

$j(u) \geq j (\bar{u}) + \frac{\gamma}{2} \|u-\bar{u}\|_{L^2(I)}^2$

for any $u \in \mathcal{U}_{ad}$ with $\|u-\bar{u}\|_{L^2 (I)} \leq 2 \varepsilon$ .

Proof. Choose $\varepsilon > 0$ as in Lemma 4.2. By Taylor's theorem, we get

$j(u) = j(\bar{u}) + j' (\bar{u}) (u- \bar{u}) + \frac{1}{2}j'' (\bar{u}_{s}) (u-\bar{u}, u -\bar{u}),$

where $\bar u_{s} = \bar{u} + s(u-\bar{u})$ for some $s \in [0, 1]$ . On the other hand, the first optimality condition implies

$\begin{equation} j'(\bar u)(u - \bar u) \geq 0, \quad \forall \, u \in \mathcal{U}_{ad}. \end{equation}$

(4.3)

Moreover, we also find

$\|\bar u-\bar u_{s}\|_{L^2 (I)} \leq s\|u-\bar{u}\|_{L^2 (I)} \leq 2 \varepsilon.$

Using these observations and Lemma 4.2, we conclude

$j(u) \geq j(\bar{u}) + \frac{\gamma}{2} \|u-\bar{u}\|_{L^2(I)}^2.$

The proof is finished.

4.2. Second order condition for $j_h$

In this part, we investigate the second order condition for the discrete cost functional $j_h$ . Similarly as in the previous subsection, we first provide the Lipschitz estimates for $G_h$ and the discrete adjoint state.

Lemma 4.4. Let $u, \hat u \in \mathcal{U}_{ad}$ and $v \in \mathcal{U}$ be given. Then, there exists $C > 0$ , independent of $h \in (0, 1)$ , such that

$\begin{aligned} \|G_h(u) - G_h(\hat u)\|_{L^\infty(I)} &\leq C\|u - \hat u\|_{L^2(I)}, \cr \|G'_h(u)v - G'_h(\hat u)v\|_{L^2(I)} &\leq C\|u - \hat u\|_{L^2(I)}\|v\|_{L^2(I)}, \end{aligned}$

and

$\|\lambda_h(u) - \lambda_h(\hat u)\|_{L^\infty(I)} \leq C\|u - \hat u\|_{L^2(I)}.$

Proof. The first and the third assertions are proved in Lemma B.5. The second estimate is proved in Lemma C.2.

Lemma 4.5. For $u \in \mathcal{U}_{ad}$ , let $x = G(u)$ be given by the solution of the state Eq $(1.2)$ , and let $y = G'(u)v$ for $v \in \mathcal{U}$ . Let $x_h = G_h(u)$ be the solution of the discrete state Eq $(3.7)$ , and let $y_h = G'_h(u)v$ . Then we have

$\|y_h - y\|_{L^{\infty}(I)} \leq Ch \|v\|_{L^2(I)}.$

Proof. Define $\tilde y :[0, T] \rightarrow \mathbb{R}^d$ by the solution to

$\begin{equation} \tilde{y}' (t) = \partial_x f(\cdot, x_h, u)\tilde y (t) + \partial_u f(\cdot, x_h, u)v (t), \quad \tilde{y}(0) = 0. \end{equation}$

(4.4)

Recall from Lemma A.1 that $y$ satisfies

$y' (t) = \partial_x f(\cdot, x, u)y + \partial_u f(\cdot, x, u)v, \quad y(0) = 0.$

Combining these two equations, we get

$\begin{aligned} (\tilde y - y)' (t)& = \partial_x f(t, x_h, u)(\tilde y - y)(t) + \left( \partial_x f(t, x_h, u) - \partial_x f(t, x, u) \right)y(t) \\&\quad + ( \partial_u f(t, x_h, u) - \partial_u f(t, x, u) v (t). \end{aligned}$

Using the Grönwall inequality here with (4.2) and (3.13), we find that

$\begin{equation} \begin{split} \|\tilde y - y\|_{L^{\infty}(I)}&\leq C\|x_h - x\|_{L^\infty(I)} \left( \|y\|_{L^2(I)} + \|v\|_{L^2(I)} \right)\cr &\leq C\|x_h - x\|_{L^\infty(I)}\|v\|_{L^2(I)}\cr &\leq Ch\|v\|_{L^2(I)}. \end{split} \end{equation}$

(4.5)

On the other hand, $y_h$ satisfies

$B(y_h, \varphi) = ( \partial_x f(\cdot, x_h, u)y_h + \partial_u f(\cdot, x_h, u)v, \varphi)_I, \quad \forall\, \varphi \in X_h^r,$

which is the DG discretization of (4.4) in a backward way in view of Lemma 3.7. Thus, we may use Theorem 2.1 to obtain the following error estimate:

$\|\tilde y - y_h\|_{L^{\infty}(I)} \leq Ch\|v\|_{L^2(I)}.$

This, together with (4.5) gives us the estimate

$\|y_h - y\|_{L^{\infty}(I)} \leq \|\tilde y - y\|_{L^{\infty}(I)}+ \|\tilde y - y_h\|_{L^{\infty}(I)} \leq Ch\|v\|_{L^2(I)}.$

The proof is finished.

Lemma 4.6. For $\varepsilon > 0$ given in Lemma 4.2, there exists $h_0 > 0$ such that for $h \in (0, h_0)$ we have the following inequality

$j''_h (u) (v, v) \geq \frac{\gamma}{4}\|v\|_{L^2(I)}^2, \quad v \in \mathcal{U}$

for any $u \in \mathcal{U}_{ad}$ satisfying $\|u -\bar{u}\|_{L^2(I)} \leq \varepsilon$ .

Proof. We first claim that

$\begin{equation} |j'' (u)(v, v) - j''_h (u) (v, v)| \leq Ch\|v\|_{L^2(I)}^2 \end{equation}$

(4.6)

for $h > 0$ small enough, where $C > 0$ is independent of $h$ . Let $x(t) = G(u)(t)$ , $\lambda (t) = \lambda(u) (t)$ , $x_h (t) = G_h (u)(t)$ , and $\lambda_h (t) = \lambda_h (u)(t)$ . Also we let $y = G' (u)v$ and $y_h = {G_h}' (u)v$ . It follows from Lemmas E.1 and E.3 that

$\begin{aligned} & j'' (u)(v, v) - j''_h (u) (v, v) \\ & = -\int_0^{T} \lambda(t)\left(\frac{\partial^2 f}{(\partial x)^2} (t, x, u) y^2(t) + 2 \frac{\partial^2 f}{\partial x \partial u} (t, x, u) y(t)v(t) + \frac{\partial^2 f}{(\partial u)^2} (t, x, u) v^2(t)\right)dt \\ &\quad + \int_0^{T} \frac{\partial^2 g}{(\partial x)^2} (t, x, u) y^2 (t) + 2 \frac{\partial^2 g}{\partial x \partial u} (t, x, u) y(t) v(t) + \frac{\partial^2 g}{(\partial u)^2} (t, x, u) v^2 (t)\, dt \\ &\quad +\int_0^{T}\lambda_h(t) \left( \frac{\partial^2 f}{(\partial x)^2} (t, x_h, u) y_h^2(t) + 2 \frac{\partial^2 f}{\partial x \partial u} (t, x_h, u) y_h(t) v(t) + \frac{\partial^2 f}{(\partial u)^2} (t, x_h, u) v^2(t) \right) \, dt \\ &\quad - \int_0^{T} \frac{\partial^2 g}{(\partial x)^2} (t, x_h, u) y_h^2 (t) + 2 \frac{\partial^2 g}{\partial x \partial u} (t, x_h, u) y_h (t) v(t) + \frac{\partial^2 g}{(\partial u)^2} (t, x_h, u) v^2 (t)\, dt. \end{aligned}$

In order to show (4.6), by using a similar argument as in the proof of Lemma 4.2, it suffices to show that there exists $C > 0$ , independent of $h$ , such that

$\begin{equation} \|x-x_h\|_{L^\infty(I)} \leq Ch, \quad \|y-y_h\|_{L^{\infty}(I)} \leq Ch\|v\|_{L^2(I)}, \quad \|y_h\|_{L^2(I)} \leq C\|v\|_{L^2(I)}, \end{equation}$

(4.7)

$\begin{equation} \|\lambda_h\|_{L^\infty(I)} \leq C, \quad \|\lambda-\lambda_h\|_{L^\infty(I)} \leq Ch, \end{equation}$

(4.8)

and

$\int_0^T |y^2(t) - y_h^2(t)|\, dt \leq Ch\|v\|_{L^2(I)}^2.$

The first and second inequalites in (4.7) hold due to Theorem 2.1 and Lemma 4.5. For the third one in (4.7) is proved in (C.2). By Lemma 3.6, the second inequality in (4.8) holds. We also find

$\|\lambda_h\|_{L^\infty(I)} \leq \|\lambda-\lambda_h\|_{L^\infty(I)} + \|\lambda\|_{L^\infty(I)} \leq Ch + C \leq C,$

which asserts the first inequality in (4.8). Finally, we obtain

$\begin{aligned} \int_0^T |y^2(t) - y_h^2(t)|\, dt &\leq \int_0^T |y(t) + y_h(t)||y(t) - y_h(t)|\, dt \cr &\leq \|y(t) - y_h(t)\|_{L^2(I)} \left(\|y\|_{L^2(I)} + \|y_h\|_{L^2(I)} \right)\cr &\leq Ch\|v\|_{L^2(I)}^2, \end{aligned}$

due to (4.7). All of the above estimates enable us to prove the claim (4.6). This together with Lemma 4.2 yields

$\begin{aligned} j''_h(u)(v, v) &\geq j''(u)(v, v) - |j''_h(u)(v, v) - j''(u)(v, v)| \cr &\geq \frac\gamma2\|v\|_{L^2(I)}^2 - Ch\|v\|_{L^2(I)}^2 \cr & \geq \frac\gamma4\|v\|_{L^2(I)}^2 \end{aligned}$

for $0 < h < h_0: = \gamma/(4C)$ . The proof is finished.

5. Existence and convergence results for the semi-discrete case

We first prove the existence of the local solution to the approximate problem (2.5).

Proof of Theorem 2.3. Choose $\varepsilon > 0$ as in Theorem 4.3. We consider the following set

$\overline{B_{2 \varepsilon} (\bar{u})} = \{ u \in \mathcal{U}_{ad} :\; \|u -\bar{u}\|_{L^2(I)} \leq 2 \varepsilon\},$

and recall from Section 2 the space $\mathcal{V}_{R} = \{ u \in \mathcal{U}\; :\; V(u) \leq R\}.$ We will find a minimizer $\bar v$ of $j_h$ in the space $W_{ \varepsilon, R}: = \overline{B_{2 \varepsilon}(\bar{u})} \cap \mathcal{V}_R$ , and then show that $\|\bar{v} - \bar{u} \|_{L^2 (I)} < \varepsilon$ . It will imply that $\bar{v}$ is a local solution to (2.5).

Since $j_h$ is lower bounded on $W_{ \varepsilon, R}$ , there exists a sequence $\{v_k\}_{k \in \mathbb{N}} \subset \overline{B_{ \varepsilon}(\bar{u})} \cap \mathcal{V}_R$ such that

$\begin{equation} \lim\limits_{k \rightarrow \infty} j_h (v_k) = \inf\limits_{v \in W_{ \varepsilon, R}} j_h (v). \end{equation}$

(5.1)

Moreover, since $W_{ \varepsilon, R}$ is compactly embedded in $L^p(I)$ for any $p \in [1, \infty)$ , up to a subsequence, there exists a function $\bar{v} \in W_{ \varepsilon, R}$ such that $\{v_k\}$ converges to $\bar v$ in $L^{2}(I)$ and converges a.e. to $\bar{v}$ . By definition, the function $z_k : = G_h (v_k) \in X_h^r$ satisfies

$\begin{equation} \sum\limits_{n = 1}^N \left({z_k}'(t) - f(t, {z_k}(t), v_k(t)), \varphi(t) \right)_{I_n} + \sum\limits_{n = 2}^N ([{z_k}]_{n-1}, \varphi^+_{n-1}) + ({z_k}^+_0, \varphi^+_0) = ({z_k}_0, \varphi^+_0) \end{equation}$

(5.2)

for all $\varphi \in X^r_h$ . Note that $\{z_k\}_{k \in \mathbb{N}}$ is a bounded set in the finite dimensional space $X_h^r$ by Theorem 2.4 (see also Lemma B.4). Therefore we can find a subsequence such that $z_k$ converges uniformly to a function $\bar{z} \in X_h^r$ . We claim that $\bar{z} = G_h (\bar{v})$ . Indeed, since $v_k (t)$ converges a.e. to $\bar{v}(t)$ for $t \in I$ and $f$ is Lipschitz continuous, we may take a limit $k$ to infinity in (5.2) to deduce

$\sum\limits_{n = 1}^N \left({\bar{z}}'(t) - f(t, {\bar{z}}(t), \bar{v}(t)), \varphi(t) \right)_{I_n} + \sum\limits_{n = 2}^N ([\bar{z}]_{n-1}, \varphi^+_{n-1}) + (\bar{z}^+_0, \varphi^+_0) = (\bar{z}_0, \varphi^+_0)$

for all $\varphi \in X^r_h$ . This yields that $\bar{z} = G_h (\bar{v})$ , which enables us to derive

$\begin{split} \lim\limits_{k \rightarrow \infty} j_h (v_k) & = \lim\limits_{k \rightarrow \infty} \int_0^{T} g(t, z_k (t), v_k (t)) dt \\ & = \int_0^{T} \lim\limits_{k \rightarrow \infty} g(t, z_k (t), v_k (t)) \, dt \\ & = \int_0^{T} g(t, \bar{z}(t), \bar{v}(t))\, dt \cr & = \int_0^{T} g(t, G_h (\bar{v})(t), \bar{v}(t)) \, dt \cr & = j_h (\bar{v}). \end{split}$

This together with (5.1) implies that $\bar{v} \in W_{ \varepsilon, R}$ satisfies

$j_h (\bar{v}) = \inf\limits_{v \in W_{ \varepsilon, R}} j_h (v).$

It remains to show that the minimizer $\bar{v} \in W_{ \varepsilon, R}$ is achieved in the interior of $B_ \varepsilon (\bar{u}) = \{u \in \mathcal{U}_{ad}\, :\, \|u-\bar{u}\|_{L^2 (I)} < \varepsilon\}$ . To show this, we recall that

$\begin{equation*} j(u) = J(u, G(u)) = \int_0^{T} g(t, G(u)(t), u(t)) \, dt \end{equation*}$

and

$\begin{equation*} j_h (u) = J(u, G_h (u)) = \int_0^{T} g(t, G_h (u)(t), u(t))\, dt. \end{equation*}$

Since $\|G(u) \|_{W^{1, \infty}(I)} \leq C$ for all $u \in \mathcal{U}_{ad}$ , we see from Theorem 2.1 that

$\begin{equation*} \|G_h (u) - G(u) \|_{L^{\infty}(I)} \leq Ch \|G (u) \|_{W^{1, \infty}(I)} \leq Ch, \end{equation*}$

where $C > 0$ is independent of $h$ . Combining this with the Lipschitz continuity of $G$ yields that

$\begin{equation*} |j (u) - j_h (u) | \leq Ch, \quad \forall\, u \in \mathcal{U}_{ad}. \end{equation*}$

Taking $h_0 = \gamma \varepsilon^2 /(8C)$ . Using this and the estimate

$j (u) \geq j(\bar{u}) + \frac{\gamma}{2} \varepsilon^2, \quad \forall\, u \in \mathcal{U}_{ad} \quad \text{with} \; \varepsilon \leq \|u -\bar{u}\|_{L^2(I)} \leq 2 \varepsilon$

from Theorem 4.3, it follows that for $h \in (0, h_0)$ we have

$\begin{equation} j_h (u) \geq j_h (\bar{u}) + \frac{\gamma}{4} \varepsilon^2, \quad \forall\, u \in \mathcal{U}_{ad} \quad \text{with}\; \varepsilon \leq \|u-\bar{u}\|_{L^2(I)} \leq 2 \varepsilon. \end{equation}$

(5.3)

Thus, the minimizer $\bar{v}$ is achieved in $B_ \varepsilon (\bar{u})$ . It gives that $j_h (u) \geq j_h (\bar{v})$ for all $u \in \mathcal{V}_R$ with $\|u-\bar{v}\|_{L^2} \leq \varepsilon$ . We now provide the details of the convergence estimate of the approximate solutions.

Proof of Theorem 2.4. Analogous to (4.3), the discrete first order necessary optimality condition for $\bar u_h \in \mathcal{U}_{ad}$ reads

$j'_h(\bar u_h)( u - \bar u_h) \geq 0, \quad \forall \, u \in B_{ \varepsilon} (\bar{u}_h) \cap \mathcal{V}_R.$

Inserting here $u = \bar{u}$ and summing it with (4.3), we get

$\begin{equation} \begin{split} 0 &\leq (j' (\bar{u}) - j_h' (\bar{u}_h)) (\bar{u}_h - \bar{u}) \\ & = (j' (\bar{u}) - j_h' (\bar{u})) (\bar{u}_h - \bar{u}) + (j_h' (\bar{u}) - j_h' (\bar{u}_h)) (\bar{u}_h - \bar{u}). \end{split} \end{equation}$

(5.4)

Now, by applying the mean value theorem with a value $t \in (0, 1)$ , we have

$\begin{equation} \begin{split} C \|\bar{u}_h - \bar{u}\|_{L^2(I)}^2 & \leq j_h'' (\bar{u} - t (\bar{u}- \bar{u}_h))(\bar{u}_h - \bar{u}, \bar{u}_h - \bar{u}) \\ & = (j_h' (\bar{u}_h) - {j_h}' (\bar{u})) (\bar{u}_h - \bar{u}) \\ & \leq ( {j}' (\bar{u}) -{j_h}' (\bar{u})) (\bar{u}_h - \bar{u}), \end{split} \end{equation}$

(5.5)

where we used Lemma 4.6 in the first inequality and (5.4) in the second inequality. For our aim, it only remains to estimate the right hand side. Let us express it using the adjoint states. From (3.5), we have

$\begin{equation} j'(\bar u)(\bar u_h - \bar u) = \left( \partial_u g(\cdot, \bar x, \bar u) - \partial_u f(\cdot, \bar x, \bar u)\lambda(\bar u), \bar u_h-\bar u \right)_I, \end{equation}$

(5.6)

and it follows from (3.9) that

$\begin{equation} j'_h(\bar u)( \bar{u}_h - \bar u) = ( \partial_u g(\cdot, \bar x_h, \bar u) - \partial_u f(\cdot, \bar x_h, \bar u)\lambda_h (\bar u), \bar{u}_h - \bar u)_I. \end{equation}$

(5.7)

Here we remind that $\bar{x}_h \in X_h^r$ denotes the solution to (2.3) with control $\bar{u}$ and initial data $x_0$ . Combining (5.6) and (5.7) we find

$\begin{aligned} ( {j}' (\bar{u}) - j_h' (\bar{u}))(\bar{u}_h -\bar{u})& = \Big( \partial_u g(\cdot, \bar{x}, \bar{u}) - \partial_u g(\cdot, \bar{x}_h, \bar{u}), \; \bar u_h-\bar u\Big)_I \\ &\quad - \Big( \partial_u f(\cdot, \bar{x}, \bar{u}) \lambda (\bar{u}) - \partial_u f (\cdot, \bar{x}_h, \bar{u}) \lambda_h (\bar{u}), \; \bar u_h-\bar u\Big)_I. \end{aligned}$

Applying Hölder's inequality here and using (1.3), we deduce

$\begin{equation} \begin{split} &( j' (\bar{u}) -j_h' (\bar{u}))(\bar{u}_h -\bar{u})\cr &\quad \leq \|\partial_u \partial_x g\|_{L^{\infty}} \|\bar{x}-\bar{x}_h \|_{L^2(I)} \|\bar{u}_h - \bar{u}\|_{L^2(I)}\\ &\qquad + \|\lambda (\bar{u})\|_{L^{\infty}(I)}\|\partial_u f (\cdot, \bar{x}, \bar{u}) - \partial_u f (\cdot, \bar{x}_h, \bar{u}) \|_{L^2(I)} \|\bar{u}_h - \bar{u}\|_{L^2(I)}\\ &\qquad + \|\partial_u f(\cdot, \bar{x}_h, \bar{u})\|_{L^{\infty}} \|\lambda (\bar{u}) - {\lambda_{h}} (\bar{u})\|_{L^2(I)} \|\bar{u}_h - \bar{u}\|_{L^2(I)}\cr &\quad \leq C \left(\|\bar{x}-\bar{x}_h \|_{L^2(I)} +\|\lambda (\bar{u}) -{\lambda_{h}}(\bar{u})\|_{L^2(I)} \right)\|\bar{u}_h - \bar{u}\|_{L^2(I)}. \end{split} \end{equation}$

(5.8)

Now we apply (3.10) and (3.13) to get

$\begin{equation} ( {j}' (\bar{u}) -{j_h}' (\bar{u}))(\bar{u}_h -\bar{u})\leq C h^{\min\{k_1, k_2, r+1\}} \|\bar{u}_h -\bar{u}\|_{L^2(I)}. \end{equation}$

(5.9)

Combining this with (5.5), we finally obtain

$\|\bar{u}_h - \bar{u}\|_{L^2(I)} \leq C h^{\min\{k_1, k_2, r+1\}}.$

This completes the proof.

6. Existence and convergence results for the fully discrete case

This section is devoted to the existence and convergence results for the fully discrete case. We consider a finite dimensional space $\mathcal{U}_h$ which discretizes the control space $\mathcal{U}_{ad}$ , for example, the space of step functions

$\mathcal{U}_h = \{ u \in \mathcal{U}_{ad} \mid \; u: \text{piecewise constant on}\; I_k = [t_{k-1}, t_k]\},$

or the high-order DG space $\mathcal{U}_h = X_h^r \cap \mathcal{U}_{ad}$ with $r \in \mathbb{N}$ .

We say that $\bar{u}_h \in \mathcal{U}_h$ is a local solution to

$\begin{equation} \min\limits_{u \in U_h} j_h (u), \end{equation}$

(6.1)

if there is a value $\varepsilon > 0$ such that $j_h (u) \geq j_h (\bar{u}_h)$ for all $u \in \mathcal{U}_h$ with $\|u- \bar{u}_h \|_{L^2} \leq \varepsilon$ .

The existence result of local solution is provided in the following theorem.

Theorem 6.1. Choose $\varepsilon > 0$ as in Theorem 4.3. Let $\bar{u} \in \mathcal{U}_{ad}$ be a local solution of $(1.4)$ satisfying Assumption 1. Fix any $\varepsilon > 0$ . Then there exists $h_0 > 0$ such that for $h \in (0, h_0)$ problem $(6.1)$ has a local solution $\bar{u}_h \in \mathcal{U}_h$ such that $\|\bar{u}- \bar{u}_h\|_{L^2} \leq \varepsilon$ .

Proof. By compactness and continuity, $j_h$ has a minimizer $\bar u_h$ in

$\overline{B_{2 \varepsilon} (\bar{u})} = \{ u \in \mathcal{U}_{h} :\; \|u -\bar{u}\|_{L^2(I)} \leq 2 \varepsilon\},$

since $\mathcal{U}_h$ is finite dimensional. Next we aim to show that the minimizer $\bar{u}_h$ satisfies

$\|\bar{u}_h - \bar{u}\|_{L^2 (I)} \leq \varepsilon.$

To show this, we recall from (5.3) that there is a value $h_0 > 0$ such that for $h \in (0, h_0)$ we have

$\begin{equation*} j_h (u) \geq j_h (\bar{u}) + \frac{\gamma}{4} \varepsilon^2, \quad \forall\, u \in \mathcal{U}_{ad} \quad \text{with}\quad \varepsilon \leq \|u-\bar{u}\|_{L^2(I)} \leq 2 \varepsilon. \end{equation*}$

Combining this with the minimality of $\bar{u}_h$ for $j_h$ in $\overline{B_{2 \varepsilon}(\bar{u})}$ , we find that $\|\bar{u}_h - \bar{u}\|_{L^2 (I)} \leq \varepsilon$ . It then yields that

$\begin{equation*} j_h (u) \geq j_h (\bar{u}_h), \quad \forall\, u \in \mathcal{U}_h \quad \text{with}\quad \|u-\bar{u}_h\|_{L^2} \leq \varepsilon. \end{equation*}$

Thus $\bar{u}_h$ is a local solution of (6.1).

We establish the convergence result in the following theorem.

Theorem 6.2. Assume the same statements for $\bar{u}\in \mathcal{U}_{ad}$ and $\lambda (\bar{u})$ in Theorem 2.4. In addition, suppose that there exists a projection operator $P_h : \mathcal{U} \rightarrow \mathcal{U}_h$ and a value $a > 0$ such that

$\|P_h \bar{u} - \bar{u}\|_{L^2 (I)} = O(h^a) \quad \mathit{\mbox{for}} \quad h \in (0, 1).$

Let $\bar{u}_h \in \mathcal{U}_h$ be a local solution to $(6.1)$ constructed in Theorem 6.1. Then the following estimate holds:

$\|\bar{u}_h - \bar{u}\|_{L^2 (I)} = O (h^{\min \{ r+1, k_1, k_2, a/2}\}).$

If we further assume that $j' (\bar{u}) = 0$ , then the above estimate can be improved to

$\|\bar{u}_h - \bar{u}\|_{L^2 (I)} = O (h^{\min \{ r+1, k_1, k_2, a}\}).$

Proof. In this case, by the first optimality conditions on $\bar{u}$ and $\bar{u}_{h}$ , we have

$j' (\bar{u}) (\bar{u}_h - \bar{u}) \geq 0 \quad \mbox{and} \quad {j_h}' (\bar{u}_h) (P_h \bar{u} - \bar{u}_h) \geq 0.$

The latter condition can be written as

$\begin{aligned} 0 &\leq j_h' (\bar{u}_h) (\bar{u} - \bar{u}_h) + j_h' (\bar{u}_h) (P_h \bar{u} - \bar{u}) = j_h' (\bar{u}_h) (\bar{u} - \bar{u}_h) +R_h, \end{aligned}$

where $R_h : = {j_h}' (\bar{u}_h) (P_h \bar{u} - \bar{u})$ . Summing up the above two inequalities provides

$\begin{aligned} 0 &\leq (j' (\bar{u}) - {j_h}' (\bar{u}_h)) (\bar{u}_h - \bar{u})+R_h\\ & = (j' (\bar{u}) - j_h' (\bar{u})) (\bar{u}_h - \bar{u}) + (j_h' (\bar{u}) - j_h' (\bar{u}_h)) (\bar{u}_h - \bar{u})+R_h, \end{aligned}$

i.e.,

$\begin{equation} (j_h' (\bar{u}_h) - j_h' (\bar{u})) (\bar{u}_h - \bar{u}) \leq ( j' (\bar{u}) -j_h' (\bar{u})) (\bar{u}_h - \bar{u})+R_h. \end{equation}$

(6.2)

By the assumption of the theorem,

$\begin{equation} \|R_h \|_{L^2 (I)} = O (h^a). \end{equation}$

(6.3)

On the other hand, by applying the mean value theorem and Lemma 4.6, we obtain

$(j_h' (\bar{u}_h) - j_h' (\bar{u})) (\bar{u}_h - \bar{u}) = j_h'' (\bar{u} + t (\bar{u}- \bar{u}_h))(\bar{u}_h - \bar{u}, \bar{u}_h - \bar{u}) \geq C\|\bar{u}_h - \bar{u}\|_{L^2(I)}^2.$

Combining this with (6.2) yields

$\|\bar{u}_h - \bar{u}\|_{L^2(I)}^2 \leq C( j' (\bar{u}) -j_h' (\bar{u})) (\bar{u}_h - \bar{u}) + CR_h.$

Applying here the estimate (5.9) in the previous proof, we have

$\begin{equation} \|\bar{u}_h - \bar{u}\|_{L^2(I)}^2 \leq C h^{\min\{k_1, k_2, r+1\}} \|\bar{u}_h -\bar{u}\|_{L^2(I)} + CR_h, \end{equation}$

(6.4)

which together with (6.3) gives the desired estimate

$\|\bar{u}_h - \bar{u}\|_{L^2 (I)} = O (h^{\min \{ r+1, k_1, k_2, a/2}\}).$

When we further assume $j' (\bar{u}) = 0$ , it follows that

${j_h}' (\bar{u}_h) = ({j_h}' (\bar{u}_h) - {j_h}' (\bar{u})) +({j_h}' (\bar{u}) -j' (\bar{u})).$

Using this and the estimates in (5.8), we find

$\begin{split} |R_h| = | {j_h}' (\bar{u}_h) (P_h \bar{u} - \bar{u})| &\leq C \left( \|\bar{u}_h -\bar{u}\|_{L^2 (I)} + h^{\min\{k_1, k_2, r+1\}} \right) \| P_h \bar{u} - \bar{u}\|_{L^2(I)} \\ &\leq C h^a \left( \|\bar{u}_h -\bar{u}\|_{L^2 (I)} + h^{\min\{k_1, k_2, r+1\}} \right). \end{split}$

Inserting this into (6.4) yields

$\begin{split} \|\bar{u}_h - \bar{u}\|_{L^2(I)}^2 &\leq C h^{\min\{k_1, k_2, r+1\}} \|\bar{u}_h -\bar{u}\|_{L^2(I)} \\ &\quad + Ch^a ( \|\bar{u}_h -\bar{u}\|_{L^2 (I)} + h^{\min\{k_1, k_2, r+1\}}). \end{split}$

It gives the desired estimate

$\|\bar{u}_h - \bar{u}\|_{L^2 (I)} = O (h^{\min \{ r+1, k_1, k_2, a}\}).$

The proof is done.

7. Numerical experiments

In this section, we present several numerical experiments which validate our theoretical results. The forward-backward DG methods ^[18] is employed to solve the examples of the OCPs.

7.1. Linear problem

Let us consider the following simple one dimensional OCP, which has been used as an example ^[36], that consists of maximizing the functional

$J = \frac12\int_0^1 x^2(t) + u^2(t)\, dt$

subject to the state equation

$\begin{equation} x'(t) = -x(t) + u(t), \quad x(0) = 1, \end{equation}$

(7.1)

and $\mathcal{U} = L^2 ([0, 1])$ . Using a similar idea as in Section 3 based on the maximum principle, we can derive the adjoint equation to the above optimal control problem:

$\lambda'(t) = \lambda(t) - x(t), \quad \lambda(1) = 0.$

Furthermore, we also find that the optimal solutions $\bar u = - \lambda$ and $\bar x$ satisfies (7.2). Thus we have the solution

$\bar x(t) = \frac{\sqrt2 \cosh(\sqrt2(t-1))-\sinh(\sqrt2(t-1))}{\sqrt2 \cosh(\sqrt2) + \sinh(\sqrt2)}$

and

$\bar u(t) = \frac{\sinh(\sqrt2(t-1)}{\sqrt2 \cosh(\sqrt2) + \sinh(\sqrt2)}.$

For fixed $r \in \mathbb{N}$ , we use $X_h^r$ for the approximate space of $\mathcal{U}$ . In , we report the discrete $L^2$ error between optimal solutions and its approximations for the above optimal control problem. Here $r+1$ is the number of grid points on each time interval $I_n$ , and we used the equidistant points for our numerical computations. The numerical result confirms that the error is of order $h^{r+1}$ as proved in Theorem 2.4.

Table 1. Discrete

$L^2$ error:

$\|\bar x - \bar x_h\|_{L^2(I)}$ and

$\|\bar u - \bar u_h\|_{L^2(I)}$ .

	$h$	$\\|\bar x - \bar x_h\\|_{L^2(I)}$	$\\|\bar u - \bar u_h\\|_{L^2(I)}$	$\log_2 \frac{\\|\bar{x}-\bar{x}_{2h}\\|}{\\|\bar{x}-\bar{x}_{h}\\|}$	$\log_2 \frac{\\|\bar{u}-\bar{u}_{2h}\\|}{\\|\bar{u}-\bar{u}_{h}\\|}$
	$(0.1) \times 2^{0}$	1.9455e-03	6.2543e-04
	$(0.1) \times 2^{-1}$	4.8861e-04	1.6088e-04	2.00	1.96
	$(0.1) \times 2^{-2}$	1.2240e-04	4.0780e-05	2.00	1.98
$r=1$	$(0.1) \times 2^{-3}$	3.0629e-05	1.0264e-05	2.00	1.99
	$(0.1) \times 2^{-4}$	7.6607e-06	2.5748e-06	2.00	2.00
	$(0.1) \times 2^{-5}$	1.9156e-06	6.4477e-07	2.00	2.00
	$(0.1) \times 2^{0}$	2.6708e-05	1.3269e-05
	$(0.1) \times 2^{-1}$	3.3523e-06	1.6837e-06	2.99	2.98
	$(0.1) \times 2^{-2}$	4.1979e-07	2.1202e-07	3.00	2.99
$r=2$	$(0.1) \times 2^{-3}$	5.2518e-08	2.6599e-08	3.00	3.00
	$(0.1) \times 2^{-4}$	6.5673e-09	3.3308e-09	3.00	3.00
	$(0.1) \times 2^{-5}$	8.2108e-10	4.1672e-10	3.00	3.00
	$(0.1) \times 2^{0}$	2.8964e-07	9.5564e-08
	$(0.1) \times 2^{-1}$	1.8172e-08	6.0617e-09	4.00	3.98
	$(0.1) \times 2^{-2}$	1.1377e-09	3.8151e-10	4.00	3.99
$r=3$	$(0.1) \times 2^{-3}$	7.1152e-11	2.3918e-11	4.00	4.00
	$(0.1) \times 2^{-4}$	4.4370e-12	1.4871e-12	4.00	4.01
	$(0.1) \times 2^{-5}$	2.7555e-13	8.4657e-14	4.01	4.13

| Show Table

DownLoad: CSV

7.2. Nonlinear problem

In this part, we consider the following nonlinear optimal control problem:

$J = \frac12\int_0^{1/5} x^2(t) + u^2(t)\, dt$

subject to the state equation

$\begin{equation} x'(t) = x^2(t) + u(t), \quad x(0) = 2. \end{equation}$

(7.2)

In this case, the corresponding adjoint equation and optimal control are given as follows.

$\lambda'(t) = - x(t)(1 + 2\lambda(t)) \quad \mbox{and} \quad \bar u(t) = - \lambda(t),$

and thus the optimal solution $\bar x$ solves

$x'(t) = x^2(t) -\lambda(t), \quad x(0) = 2.$

In this case, since we have no explicit form of the actual solutions, we take the reference solutions $\bar{x}_h$ (resp., $\bar{u}_h$ ) with $h = (0.1) \times 2^{-9}$ instead of $\bar{x}$ (resp., $\bar{u}$ ). In , we arrange the discrete $L^2$ error between reference solutions and its approximations.

Table 2. Discrete

$L^2$ error:

$\|\bar x - \bar x_h\|_{L^2(I)}$ and

$\|\bar u - \bar u_h\|_{L^2(I)}$ .

	$h$	$\\|\bar x - \bar x_h\\|_{L^2(I)}$	$\\|\bar u - \bar u_h\\|_{L^2(I)}$	$\log_2 \frac{\\|\bar{x}-\bar{x}_{2h}\\|}{\\|\bar{x}-\bar{x}_{h}\\|}$	$\log_2 \frac{\\|\bar{u}-\bar{u}_{2h}\\|}{\\|\bar{u}-\bar{u}_{h}\\|}$
	$0.1$	1.3006e-02	2.6587e-03
	$(0.1) \times 2^{-1}$	4.5715e-03	6.8872e-04	1.51	1.95
	$(0.1) \times 2^{-2}$	1.3286e-03	1.7024e-04	1.78	2.02
$r=1$	$(0.1) \times 2^{-3}$	3.5677e-04	4.2187e-05	1.90	2.01
	$(0.1) \times 2^{-4}$	9.2305e-05	1.0492e-05	1.95	2.01
	$(0.1) \times 2^{-5}$	2.3420e-05	2.6101e-06	1.98	2.01
	$0.1$	7.9288e-04	7.1751e-05
	$(0.1) \times 2^{-1}$	1.6928e-04	6.8412e-06	2.23	3.40
	$(0.1) \times 2^{-2}$	2.7566e-05	7.2059e-07	2.62	3.25
$r=2$	$(0.1) \times 2^{-3}$	3.9391e-06	8.4373e-08	2.81	3.10
	$(0.1) \times 2^{-4}$	5.2676e-07	1.0332e-08	2.90	3.03
	$(0.1) \times 2^{-5}$	6.8107e-08	1.2833e-09	2.95	3.01
	$0.1$	4.8978e-05	2.3326e-06
	$(0.1) \times 2^{-1}$	5.8217e-06	2.0158e-07	3.07	3.53
	$(0.1) \times 2^{-2}$	5.0236e-07	1.3655e-08	3.53	3.88
$r=3$	$(0.1) \times 2^{-3}$	3.6929e-08	8.7619e-10	3.77	3.96
	$(0.1) \times 2^{-4}$	2.5037e-09	5.5551e-11	3.88	3.98
	$(0.1) \times 2^{-5}$	1.6329e-10	3.6858e-12	3.94	3.91

| Show Table

DownLoad: CSV

Next we consider a two dimensional problem given by

$J = \frac12\int_0^{1/5} \Big(x^2(t) + y^2(t) + u^2 (t) \Big)\, dt,$

subject to the state equation

$\begin{equation} \begin{split} x'(t) & = x^2(t) + y(t), \quad x(0) = 2, \\ y'(y) & = -3 y(t) + u(t), \quad y(0) = 1. \end{split} \end{equation}$

(7.3)

In this case, the corresponding adjoint equation and optimal control are given as follows.

$\begin{equation} \begin{split} \lambda_1'(t) & = - x(t)(1 +2 \lambda(t)) \quad \mbox{and} \quad \bar u(t) = - \lambda_2 (t), \\ \lambda_2'(t)& = -y(t) - \lambda_1 (t)+ 3\lambda_2 (t). \end{split} \end{equation}$

(7.4)

This case also has no explicit form of the actual solutions and so we take the reference solutions $\bar{x}_h$ (resp., $\bar{u}_h$ ) with $h = (0.1) \times 2^{-9}$ instead of $\bar{x}$ (resp., $\bar{u}$ ). The discrete $L^2$ error between reference solutions and its approximations are arranged in Table 3.

Table 3. Discrete

$L^2$ error:

$\|\bar x - \bar x_h\|_{L^2(I)}$ and

$\|\bar u - \bar u_h\|_{L^2(I)}$ .

	$h$	$\\|\bar x - \bar x_h\\|_{L^2(I)}$	$\\|\bar u - \bar u_h\\|_{L^2(I)}$	$\log_2 \frac{\\|\bar{x}-\bar{x}_{2h}\\|}{\\|\bar{x}-\bar{x}_{h}\\|}$	$\log_2 \frac{\\|\bar{u}-\bar{u}_{2h}\\|}{\\|\bar{u}-\bar{u}_{h}\\|}$
	$0.1$	5.6850e-03	3.6402e-03
	$(0.1) \times 2^{-1}$	1.6706e-03	1.1148e-03	1.48	1.71
	$(0.1) \times 2^{-2}$	4.5109e-04	2.9952e-04	1.77	1.90
$r=1$	$(0.1) \times 2^{-3}$	1.1702e-04	7.7189e-05	1.89	1.96
	$(0.1) \times 2^{-4}$	2.9736e-05	1.9566e-05	1.95	1.98
	$(0.1) \times 2^{-5}$	7.4372e-06	4.9221e-06	1.98	1.99
	$0.1$	1.1860e-03	2.9482e-05
	$(0.1) \times 2^{-1}$	2.5679e-04	2.6302e-06	2.21	3.49
	$(0.1) \times 2^{-2}$	4.2605e-05	3.5132e-07	2.59	2.90
$r=2$	$(0.1) \times 2^{-3}$	6.1623e-06	4.8266e-08	2.79	2.86
	$(0.1) \times 2^{-4}$	8.2960e-07	6.3722e-09	2.89	2.92
	$(0.1) \times 2^{-5}$	1.0764e-07	8.1940e-10	2.95	2.96
	$0.1$	7.3645e-05	1.0438e-06
	$(0.1) \times 2^{-1}$	9.4018e-06	6.7811e-08	2.97	3.94
	$(0.1) \times 2^{-2}$	8.4549e-07	4.1743e-09	3.48	4.02
$r=3$	$(0.1) \times 2^{-3}$	6.3517e-08	2.5778e-10	3.73	4.02
	$(0.1) \times 2^{-4}$	4.3534e-09	1.6014e-11	3.87	4.01
	$(0.1) \times 2^{-5}$	2.8493e-10	9.9925e-13	3.93	4.00

| Show Table

DownLoad: CSV

8. Conclusions

In this paper, we established the analysis for the DG discretization applied to the nonlinear OCP with arbitrary degree of piecewise polynomials $r$ for nonlinear functions $f$ and $g$ with suitable smoothness assumptions. Under the non-degeneracy condition on an optimal control of the OCP, we obtained the existence of the local solution to the approximate problem and the sharp $L^2$ -error estimates of the approximated solutions. These results was extended to the fully discrete case, in which the control space is also discretized. Finally, we showed numerical experiments validating our theoretical results. Based on the results of this paper, it would be interesting to analyze the mesh refinement method for the discontinuous galerkin method of the optimal control problems. We would like to investigate this problem in the future.

Acknowledgments

The authors are grateful to the referees for valuable comments on the paper. The work of W. Choi is supported by NRF grants (No. 2016R1A5A1008055) and (No. 2021R1F1A1059671).

Conflict of interest

The authors declare no conflict of interest.

Appendix

A. Differentiability of the control-to-state mapping

In this section, we show that the control-to-state mapping $G$ is twice differentiable, and obtain the derivatives.

Lemma A.1. Let $x^s = G(u+sv)$ and $y:[0, T] \rightarrow \mathbb{R}^d$ be the solution of

$\begin{equation} y' (t) = \frac{\partial f}{\partial x} (t, x(t), u(t))y(t) + \frac{\partial f}{\partial u} (t, x(t), u(t)) v(t), \; t \in (0, T), \quad y(0) = 0. \end{equation}$

(A.1)

Then we have

$\frac{d}{ds} G(u+sv)|_{s = 0} = y.$

Proof. Recall that $x^s$ and $x$ satisfy

$\begin{equation*} (x^s)' (t) = f(t, x^s (t), u(t) + s v(t)) \quad \text{and}\quad x' (t) = f (t, x(t), u(t)), \end{equation*}$

respectively. Using this, we find that $r(t) : = x^s (t) - x (t) - s y(t)$ satisfies

$\begin{equation} \begin{split} &(x^s (t) - x (t) - s y(t))' (t) \\ & \quad = f(t, x^s, u+s v) - f(t, x, u) - s \left( \frac{\partial f}{\partial x}(t, x, u) y(t) + \frac{\partial f}{\partial u}(t, x, u) v(t) \right) \\ & \quad = : \frac{\partial f}{\partial x}(t, x, u) (x^s (t) - x(t) -s y(t))+ A_1(t) + A_2(t), \end{split} \end{equation}$

(A.2)

where

$\begin{equation*} A_1(t) : = f(t, x^s, u) - f(t, x, u) - \frac{\partial f}{\partial x}(t, x, u) (x^s (t) - x(t)), \end{equation*}$

and

$\begin{equation*} A_2(t) : = f(t, x^s, u+sv) - f(t, x^s, u) - s \frac{\partial f}{\partial u}(t, x, u) v(t). \end{equation*}$

$\begin{equation*} \lim\limits_{s \rightarrow 0} \frac{x^s (t) - x(t) - sy (t)}{s} = 0, \end{equation*}$

which yields that

$\frac{d}{ds} x^s (t) = y(t).$

Next we show the twice differentiablity of the mapping $s \rightarrow G(u+sv)$ at $s = 0$ .

Lemma A.2. Let $z:[0, T] \rightarrow \mathbb{R}^d$ be the solution of

$\begin{aligned} z' (t) & = \frac{\partial^2 f}{(\partial x)^2} (t, x(t), u(t)) y^2(t) + 2 \frac{\partial^2 f}{\partial x \partial u} (t, x(t), u(t)) y(t)v(t) + \frac{\partial^2 f}{(\partial u)^2} (t, x(t), u(t)) v^2(t) \\ &\quad + \frac{\partial f}{\partial x}(t, x(t), u(t)) z(t), \quad z(0) = 0. \end{aligned}$

Then we have

$\frac{d^2}{(ds)^2} G(u+sv)|_{s = 0} = z(t).$

Proof. Let

$y^s (t) = \frac{d}{ds}G(u+sv) \quad \mbox{and} \quad y(t) = \frac{d}{ds} G(u+sv)|_{s = 0}.$

Then we get

$\begin{equation} \begin{split} &(y^s)' (t) -y' (t) - sz' (t)\\ &\quad = \frac{\partial f}{\partial x} (t, x^s, u+sv) y^s (t) + \frac{\partial f}{\partial u} (t, x^{s}, u+sv) v(t) - \frac{\partial f}{\partial x}(t, x, u) y(t)- \frac{\partial f}{\partial u}(t, x, u) v(t) \\ &\qquad - s\left[ \frac{\partial^2 f}{(\partial x)^2} (t, x(t), u) y^2 (t) + 2 \frac{\partial^2 f}{\partial x \partial u} (t, x(t), u) y(t) v(t) \right. \\ & \quad + \left. \frac{\partial^2 f}{(\partial u)^2} (t, x(t), u) v^2 (t) + \frac{\partial f}{\partial x}(t, x(t), u) z(t) \right] \\ &\quad = : \frac{\partial f}{\partial x}(t, x(t), u) (y^s (t)- y (t) - sz(t)) + A_1 (t) + A_2 (t), \end{split} \end{equation}$

(A.3)

where

$\begin{equation*} \begin{split} A_1 (t)&: = \left[\frac{\partial f}{\partial x}(t, x^s, u+sv) - \frac{\partial f}{\partial x}(t, x, u)\right] y^s (t) - s\left[ \frac{\partial^2 f}{(\partial x)^2} (t, x, u) y(t) + \frac{\partial^2 f}{\partial x \partial u}(t, x, u) v(t) \right] y(t) \end{split} \end{equation*}$

and

$\begin{equation*} \begin{split} A_2 (t) &: = \left[\frac{\partial f}{\partial u}(t, x^s, u+sv) - \frac{\partial f}{\partial u}(t, x(t), u) \right] v(t) - s \left[ \frac{\partial^2 f}{(\partial u)^2} (t, x, u) v(t) + \frac{\partial^2 f}{\partial x \partial u} (t, x, u) y(t) \right]v(t). \end{split} \end{equation*}$

By Lemma 4.1, we have $|y^s (t) - y(t)|\leq Cs$ . Given this estimate and that

$\frac{d}{ds} x^s (t)|_{s = 0} = y(t)$

from Lemma A.1, an elementary calculus shows that $|A_1 (t)| \leq Cs^2$ and $|A_2 (t)| \leq Cs^2$ . Inserting this estimate into (C.5) and applying the Grönwall's lemma, we find

$\begin{equation*} y^s (t) - y(t) - sz(t) = O(s^2). \end{equation*}$

It proves that

$\frac{d}{ds} y^s (t)|_{s = 0} = z(t).$

This implies that

$\frac{d^2}{(ds)^2} G(u+sv)|_{s = 0} = z(t)$

since

$y^s (t) = \frac{d}{ds} G(u+sv).$

This completes the proof.

B. Grönwall-type inequality for the DG discretization of ODEs

In this section, we provide a Grönwall-type inequality for the DG discretization of ODEs with inputs. It will be used in Section C to establish the differentiability of the discrete control-to-state mapping $G_h$ .

We begin with recalling from [34,Lemma 2.4] the following lemma.

Lemma B.1. Let $I = (a, b)$ and $k = b-a > 0$ . Then we have

$\begin{equation*} \int_a^b |\phi (t)|^2 \, dt \leq \frac{1}{k} \sum\limits_{i = 1}^{d} \left( \int_a^b \phi_i (t) \, dt \right)^2 + \frac{1}{2} \int_a^b (b-t) (t-a) |\phi' (t)|^2 \, dt \end{equation*}$

for all $\phi (t) = (\phi_1 (t), \cdots, \phi_d (t)) \in P^r ((a, b); \mathbb{R}^d)$ , $r \in \mathbb{N}_0$ , where

$\begin{equation*} P^{r}((a, b); \mathbb{R}^d) = \{ (p_1, \cdots, p_d)\; :\; p_k :(a, b) \rightarrow \mathbb{R}\; is \; a \; polynomial\; of \; order \; \leq r \}. \end{equation*}$

The next result is from [34,Lemma 3.1].

Lemma B.2. For $I = (a, b)$ and $r \in \mathbb{N}_0$ , we have

$\begin{equation*} \|\phi\|_{L^{\infty}(I)}^2 \leq C \log (r+1) \int_a^b |\phi' (t)|^2 (t-a) \, dt + C|\phi (b)|^2 \end{equation*}$

for all $\phi (t) = (\phi_1 (t), \cdots, \phi_d (t)) \in P^r ((a, b); \mathbb{R}^d)$ . Here $C > 0$ is independent of $r$ , $a$ , $b$ , and $d$ .

We shall use the following Grönwall inequality.

Lemma B.3. Let $\{a_n\}_{n = 1}^{N}$ and $\{b_n\}_{n = 1}^{N}$ be sequences of non-negative numbers satisfying $b_1 \leq b_2 \leq\cdots \leq b_{N}$ and $b_1 = 0$ . Assume that for a value $h \in (0, 1/2)$ we have

$\begin{equation*} (1-h) b_{n+1} \leq b_{n} + a_{n} \end{equation*}$

for $n \in \mathbb{N}$ . Then there exists a constant $Q > 0$ independent of $h \in (0, 1/2)$ and $N \in \mathbb{N}$ such that

$\begin{equation*} b_n \leq e^{Q (nh)} \sum\limits_{k = 1}^{n} a_k \end{equation*}$

for any $n \in \mathbb{N}$ with $n \leq N/h$ .

Proof. The proof can be obtained by induction.

Now we obtain the Grönwall-type inequality.

Lemma B.4. Suppose that

$\begin{equation} \begin{split} |B(x, \varphi)| \leq C \sum\limits_{n = 1}^{N} \left( |(x(t), \varphi (t))_{I_n}| + |(u(t), \varphi(t))_{I_n}| \right) \end{split} \end{equation}$

(B.1)

for all $\varphi \in X_h^r$ . Then there exists a constant $C > 0$ independent of $h > 0$ such that

$\begin{equation*} \| x \|_{L^{\infty}(I)} \leq C \|u \|_{L^{2}(I)} \end{equation*}$

for all $u_1, u_2 \in \mathcal{U}_{ad}$ and $h > 0$ small enough.

Proof. From the condition (B.1) we have

$\begin{equation*} \begin{split} &\left| \sum\limits_{n = 1}^{N} (x' (t), \varphi (t))_{I_n} + \sum\limits_{n = 2}^{N} ([x]_{n-1}, \varphi_{n-1}^{+})_{I_n} + (x_{0}^{+}, \varphi_0^{+})_{I_1}\right|\cr &\quad \leq C \sum\limits_{n = 1}^{N} |(x(t), \varphi (t))_{I_n}| + |(u(t), \varphi(t))_{I_n}| \end{split} \end{equation*}$

for all $\varphi \in X_h^r$ . To obtain the desired estimates, for each $n \in \{1, \cdots, N\}$ we shall take the following test functions $\varphi \in X_h^r$ supported on $I_n$ given as

$\begin{equation*} \begin{split} \varphi (t)& = (x_1 -x_2) (t) 1_{I_n} (t), \\ \varphi (t) & = (t-t_{n-1}) (x_1 - x_2)' (t) 1_{I_n} (t), \quad \mbox{and} \\ \varphi (t) & = (t-t_{n-1})1_{I_n}(t), \end{split} \end{equation*}$

where $1_{I_n}: I \to \{0, 1\}$ denotes the indicator function, that is, $1_{I_n}(t) = 1$ for $t \in I_n$ and $1_{I_n}(t) = 0$ for $t \in I\setminus I_n$ . First we take $\varphi (t) = x(t)1_{I_n}(t)$ for $n = 1, 2, \cdots, N$ . Then,

$\begin{equation} \begin{split} (x' (t), x (t))_{I_n}+ \left( [x]_{n-1}, x_{n-1}^{+}\right) \leq C |(x(t), x (t))_{I_n}| + |(u(t), x(t))_{I_n}|, \end{split} \end{equation}$

(B.2)

where for $n = 1$ we abuse a notation $[x]_{0}$ to mean $x_0^{+}$ . Notice that

$\begin{equation*} ([x]_{n-1}, x_{n-1}^{+}) = (x_{n-1}^{+})^2 - (x_{n-1}^{-}, x_{n-1}^{+}), \end{equation*}$

where for $n = 1$ the above is understood as $([x]_{0}, x_0^{+}) = (x_0^{+})^2.$ Using this in (B.2), we find

$\begin{equation*} \begin{split} &\frac{1}{2} |x_{n}^{-} |^2 - \frac{1}{2} |x_{n-1}^{+}|^2 +| x_{n-1}^{+}|^2\leq (x_{n-1}^{-}, x_{n-1}^{+}) + C |(x(t), x (t))_{I_n}| + |(u(t), x(t))_{I_n}|. \end{split} \end{equation*}$

By applying Cauchy-Schwarz inequality, we obtain

$\begin{equation} \frac{1}{2}|x_{n}^{-}|^2 \leq \frac{1}{2} |x_{n-1}^{-}|^2 + C \|x(t)\|_{L^2(I_n)}^2 + C\|u(t)\|_{L^2 (I_n)}^2. \end{equation}$

(B.3)

Secondly, we take $\varphi (t) = (t-t_{n-1})x' (t)1_{I_n} (t)$ to have

$\begin{equation*} \begin{split} (x' (t), (t-t_{n-1}) x' (t))_{I_n} \leq \left( x(t), \; (t-t_{n-1})x' (t) \right)_{I_n} +\left( u(t), \; (t-t_{n-1})x' (t) \right)_{I_n}. \end{split} \end{equation*}$

By using Hölder's inequality, we get

$\begin{equation} \int_{I_n} (t-t_{n-1}) |x' (t)|^2\, dt \leq \int_{I_n} |t-t_{n-1}| (|x(t)|^2 + |u(t)|^2 )\, dt. \end{equation}$

(B.4)

Notice that

$\begin{equation*} \left(x' (t), (t-t_{n-1})\right)_{I_n} = - \int_{I_n} x(t) \, dt + x (t_n) (t_n - t_{n-1}). \end{equation*}$

Thus, choosing $\varphi (t) = (t-t_{n-1})1_{I_n}(t)$ gives

$\begin{equation*} \begin{split} \left|\int_{I_n} x(t)\, dt + x (t_n) (t_n - t_{n-1}) \right| \leq C \int_{I_n} |x(t)| (t-t_{n-1})\, dt + C \int_{I_n} |u(t)| (t-t_{n-1})\, dt, \end{split} \end{equation*}$

and subsequently, this yields

$\begin{equation*} \begin{split} \left| \int_{I_n} x(t)\, dt \right|^2 & \leq 2h_n^2 | x_{n}^{-}|^2 + 2 \int_{I_n}( x(t)^2+ u(t)^2 )\, dt \int_{I_n} (t_{n-1} -t)^2 \, dt \\ & \leq 2h_n^2 |x_{n}^{-}|^2 + Ch_n^3 \int_{I_n}( x(t)^2+ u(t)^2 )\, dt, \end{split} \end{equation*}$

where $h_n = t_n - t_{n-1}$ . This together with Lemma A.1 asserts

$\begin{equation} \begin{split} \left| \int_{I_n} x (t) \, dt\right|^2 & \leq 2h_n^2 |x_n^{-}|^2 + C h_n^4 \int_{I_n} (t-t_{n-1}) |x' (t)|^2 \, dt + Ch_n^3 \int_{I_n} |u(t)|^2 \, dt \end{split} \end{equation}$

(B.5)

for $h > 0$ small enough. Combining (B.3) and (B.4), we find

$\begin{equation*} \begin{split} &\int_{I_n} (t-t_{n-1}) |x' (t)|^2 \, dt + |x_{n}^{-}|^2 \\ &\quad \leq C\|x\|_{L^2(I_n)}^2 + C \int_{I_n}|u(t)|^2 \, dt + |x_{n-1}^{-}|^2 \\ &\quad \leq \frac{C}{h_n} \left| \int_{I_n} x(t)\, dt\right|^2 + Ch_n \int_{I_n} (t-t_{n-1}) |x' (t)|^2 \, dt + |x_{n-1}^{-}|^2 + C\int_{I_n} |u(t)|^2\, dt, \end{split} \end{equation*}$

where we applied Lemma B.1 in the second inequality. This, together with (B.5), we obtain

$\begin{equation} \frac{1}{2}\int_{I_n} (t-t_{n-1}) |x' (t)|^2 \, dt + |x_{n}^{-}|^2 \leq Ch_n |x_{n}^{-}|^2 + |x_{n-1}^{-}|^2 + C\int_{I_n} |u(t)|^2 \, dt \end{equation}$

(B.6)

for $h > 0$ small enough, where for $n = 1$ one has $|x_0^{-}| = 0$ . This inequality trivially gives

$\begin{equation*} |x_{n}^{-}|^2 \leq Ch_n |x_{n}^{-}|^2 + |x_{n-1}^{-}|^2 + C\int_{I_n} |u(t)|^2 \, dt \end{equation*}$

for $n = 1, \cdots, N$ . Now, by applying Lemma B.3 to find an estimate of $|x_n^{-1}|^2$ and inserting it into (B.6), we achieve

$\begin{equation*} \frac{1}{2}\int_{I_n} (t-t_{n-1}) |x' (t)|^2 \, dt + |x_{n}^{-}|^2 \leq C \int_{0}^{T} |u(t)|^2 \, dt. \end{equation*}$

Finally, by applying Lemma B.2 to the above, we obtain the desired estimate.

As a corollary, we have the following Lipschitz estimates.

Lemma B.5. For $u, v \in \mathcal{U}_{ad}$ we have

$\begin{equation*} \|G_h (u) - G_h (v)\|_{L^{\infty}(I)} \leq C \|u-v\|_{L^2 (I)} \end{equation*}$

and

$\begin{equation*} \|\lambda_h (u) - \lambda_h (v) \|_{L^{\infty}(I)} \leq C \|u-v\|_{L^2 (I)}. \end{equation*}$

Proof. Let us denote by $x = G_h(u)$ and $\hat{x} = G_h(v)$ . Then it follows from (2.3) that

$\begin{aligned} B((x -\hat{x}), \varphi )& = \Big(f(t, x(t), u(t)) - f(t, \hat x (t), \hat u (t)), \; \varphi\Big), \quad \forall\, \varphi \in X_h^r. \end{aligned}$

By (1.3), there exists a constant $C > 0$ such that

$\begin{equation*} \begin{split} \left|f(t, x(t), u(t)) - f(t, \hat x(t), \hat{u}(t))\right| &\leq C|\hat{x}(t) - x(t)| + C |\hat{u}(t) - u(t)|. \end{split} \end{equation*}$

By applying Lemma B.4, we get the inequality

$\begin{equation*} \|x - \hat{x}\|_{L^{\infty}(I)} \leq C \|u-\hat{u}\|_{L^{2}(I)} . \end{equation*}$

This gives the first inequality. For the second one, we denote by $\lambda = \lambda_h (u)$ and $\hat \lambda = \lambda_h (v)$ . Then, we see from Lemma 3.8 that

$\begin{aligned} B(\varphi, \; (\lambda - \hat\lambda)) & = \Big(\varphi, \; \partial_x f(\cdot, x, u)(\lambda - \hat\lambda)(t) + ( \partial_x f(\cdot, x, u) - \partial_x f(\cdot, \hat x , \hat u))(t) \cr & \quad - ( \partial_x g(\cdot, x, u) - \partial_x g(\cdot, \hat x, \hat u))\Big)_{I}, \quad \forall\, \varphi \in X_h^r. \end{aligned}$

By applying Lemma B.4 again in a backward way (see Lemma 3.7), we obtain

$\begin{aligned} \|\lambda - \hat\lambda\|_{L^\infty(I)} &\leq C\|( \partial_x f(\cdot, x, u) - \partial_x f(\cdot, \hat x , \hat u))\hat\lambda\|_{L^2(I)} \cr &\quad + C\| \partial_x g(\cdot, x, u) - \partial_x g(\cdot, \hat x, \hat u)\|_{L^2(I)}\cr &\leq C(\|\hat \lambda\|_{L^\infty(I)} + 1) \left( \|x - \hat x\|_{L^\infty(I)} + \|u - \hat u\|_{L^2(I)} \right)\cr &\leq C\|u - \hat u\|_{L^2(I)}, \end{aligned}$

where we used

$\|\hat \lambda\|_{L^\infty(I)} \leq C\| \partial_x g\|_{L^\infty (I)},$

due to Lemma B.4. This completes the proof.

C. Differentiability of discrete control-to-state mapping

This section is devoted to prove that the discrete control-to-state mapping $G_h$ is twice differentiable. We also obtain the first and second derivatives of $G_h$ .

Theorem C.1. We denote $x_h^s = G_h(u+sv)$ and set $y_h \in X_h^r$ be the solution of the following discretized equation:

$\begin{equation} B(y_h, \varphi ) = \left( \frac{\partial f}{\partial x} (t, x_h, u) y_h (t) + \frac{\partial f}{\partial u}(t, x_h, u) v(t), \, \varphi (t)\right)_{I}, \quad \forall \, \varphi \in X_h^r, \end{equation}$

(C.1)

where $x_h = G_h (u)$ . Then we have $\frac{d}{ds} x_h^s (t) = y_h(t)$ .

Proof. By Theorem 2.1 there exists a solution $y_h \in X_h^r$ to

$B(y_h, \varphi ) = \left( \frac{\partial f}{\partial x} (t, x_h, u) y_h (t) + \frac{\partial f}{\partial u}(t, x_h, u) v(t), \, \varphi (t)\right)_{I}, \quad \forall \, \varphi \in X_h^r.$

By Lemma B.4 we get

$\begin{equation} \|y_h\|_{L^{\infty}(I)} \leq C\|v\|_{L^2 (I)}. \end{equation}$

(C.2)

Recall that $x^s$ and $x$ satisfy

$\begin{equation*} B(x_h^s, \varphi ) = \Big(f(t, x^s, u + s v), \; \varphi (t)\Big)_{I} \quad \text{and}\quad B(x_h (t), \; \varphi ) = \Big( f (t, x, u), \; \varphi (t) \Big)_{I}. \end{equation*}$

Using this, we find that $r(t) : = x_h^s (t) - x_h (t) - s y_h(t)$ satisfies

$\begin{equation} \begin{split} &B\Big((x_h^s - x_h - s y_h), \; \varphi \Big) \\ &\quad = \left(f(t, x_h^s, u+s v) - f(t, x_h, u) - s \left( \frac{\partial f}{\partial x}(t, x_h, u) y(t) + \frac{\partial f}{\partial u}(t, x_h, u) v(t) \right), \; \varphi (t)\right) \\ &\quad = \left(\frac{\partial f}{\partial x}(t, x_h, u) (x_h^s (t) - x_h(t) -s y_h(t))+ A_1 + A_2, \; \varphi (t) \right) \end{split} \end{equation}$

(C.3)

for all $\varphi \in X_h^r$ , where

$\begin{equation*} A_1 = f(t, x_h^s, u) - f(t, x_h, u) - \frac{\partial f}{\partial x}(t, x_h, u) (x_h^s (t) - x_h(t)) \end{equation*}$

and

$\begin{equation*} A_2 = f(t, x_h^s, u+sv) - f(t, x_h^s, u) - s \frac{\partial f}{\partial u}(t, x_h, u) v(t). \end{equation*}$

$\begin{equation*} \lim\limits_{s \rightarrow 0} \frac{x_h^s (t) - x_h(t) - sy_h (t)}{s} = 0, \end{equation*}$

which yields that

$\frac{d}{ds} x_h^s (t) = y_h(t).$

This completes the proof.

Lemma C.2. The following holds.

$\|{G_h}' (u_1) v - {G_h}' (u_2) v\|_{L^{\infty}(I)} \leq C \|u_1 - u_2\|_{L^{2}(I)}\|v\|_{L^{\infty}(I)}.$

Proof. Let $y_h = {G_h}' (u_1) v \in X_h^r$ and $z_h = {G_h}' (u_2)v \in X_h^r$ . Then we obtain

$B(y_h, \varphi ) = \left( \frac{\partial f}{\partial x} (t, G_h (u_1), u_1) y_h (t) + \frac{\partial f}{\partial u}(t, G_h (u_1), u_1) v(t), \, \varphi (t)\right)_{I}$

and

$B(z_h, \varphi ) = \left( \frac{\partial f}{\partial x} (t, G_h (u_2), u_2) z_h (t) + \frac{\partial f}{\partial u}(t, G_h (u_2), u_2) v(t), \, \varphi (t)\right)_{I}$

for all $\varphi \in X_h^r$ . Combining these equalities, we have

$\begin{equation} \begin{split} B(y_h -z_h, \varphi ) & = \left( \frac{\partial f}{\partial x} (t, G_h (u_1), u_1) (y_h -z_h) (t), \, \varphi (t)\right)_{I}\\ &\quad + \left( \left( \frac{\partial f}{\partial x} (t, G_h (u_1), u_1) - \frac{\partial f}{\partial x}(t, G_h (u_2), u_2) \right) z_h (t), \, \varphi (t)\right)_{I}\\ &\quad + \left( \left(\frac{\partial f}{\partial u} (t, G_h (u_1), u_1) -\frac{\partial f}{\partial u}(t, G_h (u_2), u_2) \right) v(t), \, \varphi (t)\right)_{I} \end{split} \end{equation}$

(C.4)

for all $\varphi \in X_h^r$ . On the other hand, the following two inequalities hold:

$\begin{split} & \left| \left( \frac{\partial f}{\partial x} (t, G_h (u_1), u_1) - \frac{\partial f}{\partial x}(t, G_h (u_2), u_2) \right) z_h (t) \right| \leq C( |u_1 - u_2| + |G_h (u_1) - G_h (u_2)| ) |z_h (t)| \end{split}$

and

$\begin{split} & \left| \left(\frac{\partial f}{\partial u} (t, G_h (u_1), u_1) -\frac{\partial f}{\partial u}(t, G_h (u_2), u_2) \right) v(t) \right| \leq C( |u_1 - u_2| + |G_h (u_1) - G_h (u_2)| ) |v (t)|. \end{split}$

Given these estimates, by applying Lemma B.4 to (C.4), we obtain

$\begin{split} \|y_h -z_h\|_{L^{\infty}(I)}& \leq C\big\|( |u_1 - u_2| + |G_h (u_1) - G_h (u_2)| ) |z_h (t)|\big\|_{L^2(I)} \cr &\quad + C\big\|( |u_1 - u_2| + |G_h (u_1) - G_h (u_2)| ) |v (t)|\big\|_{L^2(I)} \\ & \leq C \| u_1 - u_2\|_{L^2(I)} \|v\|_{L^{\infty}(I)}, \end{split}$

where we used Lemma B.5 in the second inequality.

Lemma C.3. Let $z_h \in X^r_h$ be the solution of the following discretized equation:

$\begin{aligned} &B(z_h, \varphi) \cr & \ = \int_0^{T}\left(\frac{\partial^2 f}{(\partial x)^2} (t, x_h, u) y_h^2(t) + 2 \frac{\partial^2 f}{\partial x \partial u} (t, x_h, u) y_h(t) v(t) + \frac{\partial^2 f}{(\partial u)^2} (t, x_h, u) v^2 (t)\right) \varphi(t) \, dt\\ & \ \quad + \int_0^{T} \frac{\partial f}{\partial x}(t, x_h, u) z_h (t) \varphi(t) \, dt \end{aligned}$

for any $\varphi \in X_h^r$ , where $y_h \in X^r_h$ is the solution of (C.1). Then we have

$\begin{equation*} \frac{d^2}{(ds)^2} G_h(u+sv)|_{s = 0} = z_h(t). \end{equation*}$

Proof. Let

$y_h^s (t) = \frac{d}{ds}G_h(u+sv) \quad \mbox{and} \quad y_h(t) = \frac{d}{ds} G_h(u+sv)|_{s = 0}.$

It then follows that

$\begin{equation} \begin{split} &B\Big((y_h^s) (t) -y_h (t) - sz_h (t), \; \varphi (t)\Big) \\ &\quad = : \Big(\frac{\partial f}{\partial x}(t, x_h(t), u) (y_h^s (t)- y_h (t) - sz_h(t)) + A_1 (t) + A_2 (t), \; \varphi (t) \Big), \end{split} \end{equation}$

(C.5)

where

$\begin{equation*} \begin{split} A_1 (t)&: = \left[\frac{\partial f}{\partial x}(t, x_h^s, u+sv) - \frac{\partial f}{\partial x}(t, x_h, u)\right] y_h^s (t) \\ &\quad - s\left[ \frac{\partial^2 f}{(\partial x)^2} (t, x_h, u) y_h(t) + \frac{\partial^2 f}{\partial x \partial u}(t, x_h, u) v(t) \right] y_h(t) \end{split} \end{equation*}$

and

$\begin{equation*} \begin{split} A_2 (t)&: = \left[\frac{\partial f}{\partial u}(t, x_h^s, u+sv) - \frac{\partial f}{\partial u}(t, x_h, u) \right] v(t) \\ &\quad - s \left[ \frac{\partial^2 f}{(\partial u)^2} (t, x_h, u) v(t) + \frac{\partial^2 f}{\partial x \partial u} (t, x_h, u) y_h(t) \right]v(t). \end{split} \end{equation*}$

We obtain from Lemma C.2 the estimate $|y_h^s (t) - y_h(t)|\leq Cs$ . Upon this estimate and that $\frac{d}{ds} x_h^s (t)|_{s = 0} = y_h(t)$ from Lemma C.1, an elementary calculus reveals that $|A_1 (t)| \leq Cs^2$ and $|A_2 (t)| \leq Cs^2$ . Putting this estimate into (C.5) and using Lemma B.4, we find

$\begin{equation*} y^s (t) - y(t) - sz(t) = O(s^2). \end{equation*}$

This yields that

$\frac{d}{ds} y_h^s (t)|_{s = 0} = z_h(t),$

and so we have

$\frac{d^2}{(ds)^2} G_h(u+sv)|_{s = 0} = z_h(t)$

since

$y_h^s (t) = \frac{d}{ds} G_h(u+sv).$

The proof is done.

D. Derivations of the first order derivative of cost functionals

In this part, we give the proofs of Lemmas 3.3 and 3.5. Before presenting it, we shall explain how to derive the discrete adjoint Eq (3.8) from the Lagrangian associated to (2.5).

Let us first write the Lagrangian of the problems (1.1) and (3.7) as follows:

$\begin{align} \begin{aligned} \mathcal{L}_h({x_h}, u, \lambda_h) &: = \int_0^T g(t, x_h(t), u(t))\, dt + B(x_h, \lambda_h) - \left(f(\cdot, x_h, u), \lambda_h \right)_I - (x_0, \lambda^+_{h, 0}) \end{aligned} \end{align}$

(D.1)

for $\lambda_h \in X_h^r$ , where the bilinear operator $B(\cdot, \cdot)$ is given by (3.7). If we compute the functional derivatives of the above Lagrangian (D.1) with respect to the adjoint state $\lambda_h$ , then $\delta \mathcal{L}_h/\delta \lambda_h = 0$ leads (3.7). We now derive the equation of discrete adjoint state. Using the integration by parts, we find

$B(x_h, \lambda_h) = - \sum\limits_{n = 1}^N (x_h, \lambda_h')_{I_n} - \sum\limits_{n = 1}^{N-1} (x_{h, n}^-, [\lambda_h]_n) + (x_{h, N}^-, \lambda_{h, N}^-).$

This enables us to rewrite the Lagrangian (D.1) as

$\begin{aligned} \mathcal{L}_h(x_h, u, \lambda_h) & = \int_0^T g(t, x_h(t), u(t))\, dt - \sum\limits_{n = 1}^N (x_h, \lambda_h')_{I_n} - \left(f(\cdot, x_h, u), \lambda_h \right)_I \cr &\quad - \sum\limits_{n = 1}^{N-1} (x_{h, n}^-, [\lambda_h]_n) + (x_{h, N}^-, \lambda_{h, N}^-) - (x_0, \lambda^+_{h, 0}), \end{aligned}$

and this further implies

$\begin{align} \begin{aligned} 0& = \frac{\delta \mathcal{L}_h(x_h, u, \lambda_h)}{\delta x_h}(\psi_h)\cr & = \int_0^T \frac{ \partial g}{ \partial x}(t, x_h(t), u(t)) \psi_h(t) \, dt - \sum\limits_{n = 1}^N (\psi_h, \lambda_h')_{I_n} - \left(\frac{ \partial f}{ \partial x}(\cdot, x_h, u)\psi_h, \lambda_h \right)_I \cr &\quad - \sum\limits_{n = 1}^{N-1} (\psi_{h, n}^-, [\lambda_h]_n) + (\psi_{h, N}^-, \lambda_{h, N}^-) \\ & = \int_0^T \frac{ \partial g}{ \partial x}(t, x_h(t), u(t)) \psi_h(t) \, dt - \left(\frac{ \partial f}{ \partial x}(\cdot, x_h, u)\psi_h, \lambda_h \right)_I + B(\psi_h, \lambda_h) \end{aligned} \end{align}$

(D.2)

for all $\psi_h \in X^r_h$ , where we applied the integration by parts for $(\psi_h, {\lambda_h}')_{I_n}$ to derive the second equality. The above equality corresponds to the adjoint Eq (3.8).

Proof of Lemma 3.3. In order to compute the functional derivative of $j$ with respect to $u$ , we consider $j(u+sv) = J(u + sv, G(u+sv))$ with $v \in \mathcal{U}$ and $s \in \mathbb{R}_+$ . If we set $x^s(t) : = G(u(t)+sv(t))$ it follows from Lemma A.1 that $y = \frac{d}{ds} x^s (t)|_{s = 0}$ satisfies

$\begin{equation} y' (t) = \frac{\partial f}{\partial x} (t, x, u)y(t) + \frac{\partial f}{\partial u} (t, x, u) v(t), \end{equation}$

(D.3)

with the initial condition $y(0) = 0$ . Recall from (3.4) that the adjoint state $\lambda (t) = \lambda(u)(t)$ satisfies

$\begin{equation} \lambda' (t) = \frac{\partial g}{\partial x}(t, x, u) - \lambda (t) \frac{\partial f}{\partial x}(t, x, u). \end{equation}$

(D.4)

Since $x^s (t)$ is differentiable with respect to $s$ , the cost $j(u+sv)$ is differentiable with respect to $s$ and it is computed as

$\begin{aligned} j'(u)v & = \frac{d}{ds}j(u + sv)\bigg|_{s = 0}\cr & = \int_0^T \frac{ \partial g}{ \partial u} (t, x(t), u(t))v(t)\, dt + \int_0^T\frac{ \partial g}{ \partial x} (t, x(t), u(t))y(t)\, dt \cr & = \int_0^T \left(\frac{ \partial g}{ \partial u} (t, x(t), u(t)) - \lambda (t) \frac{\partial f}{\partial u}(t, x(t), u(t)) \right)v(t)\, dt, \end{aligned}$

where we used

$\begin{aligned} \int_0^T\frac{ \partial g}{ \partial x} (t, x(t), u(t))y(t)\, dt & = \int_0^T \left(\lambda' (t) + \lambda (t) \frac{\partial f}{\partial x}(t, x(t), u(t)) \right) y(t)\, dt\cr & = -\int_0^T \lambda (t) \frac{\partial f}{\partial u}(t, x(t), u(t)) v(t)\, dt, \end{aligned}$

due to (D.3), (D.4), $y(0) = 0$ , and $\lambda(T) = 0$ .

Proof of Lemma 3.5. The proof is very similar to Lemma 3.3. We consider $j_h(u+sv) = J(u+sv, G_h(u + sv))$ with $v \in \mathcal{U}$ and $s \in \mathbb{R}_+$ . We recall from Lemma C.1 that the function $x_{h}^s: = G_h(u + sv)$ is differentiable at $s = 0$ with

$\frac{d}{ds} x_h^s |_{s = 0} = y_h,$

where $y_h \in X_h^r$ satisfies the following equation:

$\begin{equation} B(y_h, \varphi) = \left(\frac{ \partial f}{ \partial x}(\cdot, x_h, u) y_h + \frac{ \partial f}{ \partial u} (\cdot, x_h, u)v, \varphi \right)_I, \quad \forall\, \varphi \in X^r_h. \end{equation}$

(D.5)

Using this, we obtain

$\begin{align} \begin{aligned} j'_h(u)v & = \frac{d}{ds}j_h(u+sv)\bigg|_{s = 0} \cr & = \int_0^T \frac{ \partial g}{ \partial u} (t, x_h(t), u(t))v(t)\, dt + \int_0^T \frac{ \partial g}{ \partial x} (t, x_h(t), u(t))y_h(t)\, dt. \end{aligned} \end{align}$

(D.6)

We then take $\psi_h = y_h$ in (D.2) to get

$\begin{aligned} &\int_0^T \frac{ \partial g}{ \partial x} (t, x_h(t), u(t))y_h(t)\, dt \cr &\quad = \sum\limits_{n = 1}^N (y_h, \lambda_h')_{I_n} + \left(\frac{ \partial f}{ \partial x}(\cdot, x_h, u)y_h, \lambda_h \right)_I + \sum\limits_{n = 1}^{N-1} (y_{h, n}^-, [\lambda_h]_n) - (y_{h, N}^-, \lambda_{k, N}^-). \end{aligned}$

On the other hand, by using the integration by parts, we find

$\begin{aligned} &\sum\limits_{n = 1}^N (y_h, \lambda_h')_{I_n} + \sum\limits_{n = 1}^{N-1} (y_{h, n}^-, [\lambda_h]_n) - (y_{h, N}^-, \lambda_{h, N}^-) \cr &\quad = - \sum\limits_{n = 1}^N (y_h', \lambda_h)_{I_n} - \sum\limits_{n = 2}^N ([y_h]_{n-1}, \lambda_{h, n-1}^+) - (y_{h, 0}^+, \lambda_{h, 0}^+) \cr &\quad = - B(w_h, \lambda_h), \end{aligned}$

where $B(\cdot, \cdot)$ is appeared in (3.6). This yields

$\begin{aligned} \int_0^T \frac{ \partial g}{ \partial x} (t, x_h(t), u(t))y_h(t)\, dt & = -B(y_h, \lambda_h) + \left(\frac{ \partial f}{ \partial x}(\cdot, x_h, u)y_h, \lambda_h \right)_I\cr & = - \left(\frac{ \partial f}{ \partial u} (\cdot, x_h, u)v, \lambda_h \right)_I, \end{aligned}$

due to (D.5). This together with (D.6) concludes

$j_h'(u)v = \int_0^T \left(\frac{ \partial g}{ \partial u} (t, x_h(t), u(t)) - \frac{ \partial f}{ \partial u}(t, x_h(t), u(t))\lambda_h(t) \right) v(t)\, dt,$

where $v \in \mathcal{U}$ .

E. Derivations of the second order derivative of cost functionals

In this appendix, we provide details of the derivation of the second order derivative of cost functional $j$ and its discrete version $j_h$ .

Lemma E.1. Let $j$ be the cost functional for the optimal control problems $(1.1)$ and $(1.2)$ . Then, for $u \in \mathcal{U}_{ad}$ and $v \in \mathcal{U}$ , we have

$\begin{aligned} j''(u)(v, v) & = - \int_0^{T} \lambda(t) \left(\frac{\partial^2 f}{(\partial x)^2} (t, x(t), u(t)) y^2(t) + 2 \frac{\partial^2 f}{\partial x \partial u} (t, x(t), u(t)) y(t)v(t) \right) dt \cr &\quad - \int_0^{T} \lambda(t) \frac{\partial^2 f}{(\partial u)^2} (t, x(t), u(t)) v^2(t)\, dt + \int_0^T \frac{\partial^2 g}{(\partial x)^2} (t, x(t), u(t)) y^2(t)\, dt \\ &\quad + \int_0^{T} 2 \frac{\partial^2 g}{\partial x \partial u} (t, x, u) y(t) v(t) \, dt + \int_0^T \frac{\partial^2 g}{(\partial u)^2} (t, x(t), u(t)) v^2 (t) \, dt. \end{aligned}$

Proof. Similarly as in Appendix D, we consider $j(u+sv) = J(u + sv, G(u+sv))$ with $v \in \mathcal{U}$ and $s \in \mathbb{R}_+$ and set $x^s(t) : = G(u(t)+sv(t))$ . By Lemmas A.1 and A.2, it follows that

$\frac{d}{ds}x^s|_{s = 0} = y \quad \mbox{and} \quad \frac{d^2}{(ds)^2}x^s|_{s = 0} = z,$

where $y \in X$ is given as in (D.3) and $z \in X$ is the solution to

with the initial condition $z(0) = 0$ . Then we obtain

$\begin{align} \begin{aligned} j''(u)(v, v) & = \frac{d^2}{ds^2} j(u+sv)\bigg|_{s = 0}\cr & = \frac{d^2}{ds^2} \int_0^{T} g(t, x^s(t), u(t)+sv(t)) \, dt \bigg|_{s = 0} \\ & = \int_0^{T} \frac{\partial g}{\partial x} (t, x(t), u(t)) z(t) \, dt + \int_0^T \frac{\partial^2 g}{(\partial x)^2} (t, x(t), u(t)) y^2(t)\, dt \\ &\quad + \int_0^{T} 2 \frac{\partial^2 g}{\partial x \partial u} (t, x, u) y(t) v(t) \, dt + \int_0^T \frac{\partial^2 g}{(\partial u)^2} (t, x(t), u(t)) v^2 (t) \, dt. \end{aligned} \end{align}$

(E.1)

On the other hand, we use (D.4) to get

$\begin{aligned} &\int_0^{T} \frac{\partial g}{\partial x}(t, x(t), u(t)) z(t) \, dt \\ &\quad = \int_{0}^{T} \lambda' (t) z(t) \, dt + \int_0^{T} \frac{\partial f}{\partial x}(t, x(t), u(t)) \lambda (t) z(t) \, dt \\ &\quad = -\int_0^{T} \lambda(t) z' (t) \, dt+\int_0^{T} \frac{\partial f}{\partial x}(t, x(t), u(t)) \lambda (t) z(t) \, dt \\ &\quad = - \int_0^{T} \lambda(t) \left(\frac{\partial^2 f}{(\partial x)^2} (t, x(t), u(t)) y^2(t) + 2 \frac{\partial^2 f}{\partial x \partial u} (t, x(t), u(t)) y(t)v(t) \right) dt \cr &\qquad - \int_0^{T} \lambda(t) \frac{\partial^2 f}{(\partial u)^2} (t, x(t), u(t)) v^2(t) \, dt, \end{aligned}$

where we used $\lambda(T) = 0$ and $z(0) = 0$ . By combining the above with (E.1), we have

$\begin{align} \begin{aligned} j''(u)(v, v) & = - \int_0^{T} \lambda(t) \left(\frac{\partial^2 f}{(\partial x)^2} (t, x(t), u(t)) y^2(t) + 2 \frac{\partial^2 f}{\partial x \partial u} (t, x(t), u(t)) y(t)v(t) \right) dt \cr &\quad - \int_0^{T} \lambda(t) \frac{\partial^2 f}{(\partial u)^2} (t, x(t), u(t)) v^2(t) dt + \int_0^T \frac{\partial^2 g}{(\partial x)^2} (t, x(t), u(t)) y^2(t)\, dt \\ &\quad + \int_0^{T} 2 \frac{\partial^2 g}{\partial x \partial u} (t, x, u) y(t) v(t) \, dt + \int_0^T \frac{\partial^2 g}{(\partial u)^2} (t, x(t), u(t)) v^2 (t) \, dt. \end{aligned} \end{align}$

(E.2)

This completes the proof.

Remark E.2. Solving the differential Eq (A.1) gives

$y(t) = \int_0^t \frac{\partial f}{\partial u} (s, x(s), u(s)) v(s)\exp \left( \int_s^t\frac{\partial f}{\partial x} (\tau, x(\tau), u(\tau))\, d\tau \right)\, ds,$

and thus

$|y(t)| \leq C \int_0^t |v(s)|\, ds,$

where $C > 0$ depends only on $\|f\|_{L^\infty(0, T; W^{1, \infty})}$ and $T > 0$ . This estimate for $y$ enables to bound the first four integrals on the right hand side of (E.2) by

$C\|v\|_{L^2(I)}^2,$

where $C > 0$ depends only on $\|\lambda\|_{L^\infty(I)}$ , $T$ , $\|f\|_{L^\infty(0, T; W^{2, \infty})}$ , and $\|g\|_{L^\infty(0, T; W^{2, \infty})}$ . This implies that if $g$ is given by

$g(t, x, u) = \tilde g(t, x) + \gamma |u|^2,$

then we have

$j''(u)(v, v) \geq (2\gamma-C)\|v\|_{L^2(I)}^2,$

which satisfies (2.4) if $\gamma > C/2$ . It would be interesting to develop a numerical method to check (2.4) for general case.

Next we proceed the similar calculation for the approximate solution.

Lemma E.3. Let $j_h$ be the discrete cost functional for the optimal control problems $(1.1)$ and $(1.2)$ . Then, for $u \in \mathcal{U}_{ad}$ and $v \in \mathcal{U}$ , we have

$\begin{aligned} &j''_h (u) (v, v)\cr &\, = -\int_0^{T}\left(\frac{\partial^2 f}{(\partial x)^2} (t, x_h, u) y_h^2(t) + 2 \frac{\partial^2 f}{\partial x \partial u} (t, x_h, u) y_h(t) v(t) + \frac{\partial^2 f}{(\partial u)^2} (t, x_h, u) v^2 (t)\right) \lambda_h(t) \, dt \\ &\, \quad + \int_0^{T} \left(\frac{\partial^2 g}{(\partial x)^2} (t, x_h, u) y_h^2 (t) + 2 \frac{\partial^2 g}{\partial x \partial u} (t, x_h, u) y_h (t) v(t) + \frac{\partial^2 g}{(\partial u)^2} (t, x_h, u) v^2(t) \right) dt. \end{aligned}$

Proof. Similarly as in the proof of Lemma 3.5, we consider $j_h(u+sv) = J(u+sv, G_h(u + sv))$ with $v \in \mathcal{U}$ and $s \in \mathbb{R}_+$ and set $x_{h}^s: = G_h(u + sv)$ . We recall from Theorem C.1 and Theorem C.3 that

$\frac{d}{ds} x_h^s|_{s = 0} = y_h \quad \mbox{and} \quad \frac{d^2}{(ds)^2}x_h^s|_{s = 0} = z_h,$

where $z_h \in X^r_h$ satisfies

Now a straightforward computation gives

$\begin{aligned} j''_h (u) (v, v)& = \frac{d^2}{ds^2} \int_0^{T} g(t, x_h^s(t), u(t)+sv(t)) \, dt\bigg|_{s = 0} \\ & = \int_0^{T} \frac{\partial g}{\partial x} (t, x_h(t), u(t)) z_h(t) \, dt + \int_0^{T} \frac{\partial^2 g}{(\partial x)^2} (t, x_h(t), u(t)) y_h^2 (t) \, dt \\ &\quad + \int_0^{T} 2 \frac{\partial^2 g}{\partial x \partial u} (t, x_h(t), u(t)) y_h (t) v(t) \, dt + \int_0^{T} \frac{\partial^2 g}{(\partial u)^2} (t, x_h(t), u(t)) v^2 (t) \, dt. \end{aligned}$

Note that the discrete adjoint state $\lambda_h(t) = \lambda_h(u)(t)$ satisfies

$- B(\psi, \lambda_h ) + \left( \frac{\partial f}{\partial x} (t, x_h, u)\lambda_h, \psi\right)_I = \left( \frac{\partial g}{\partial x}(t, x_h, u), \psi\right)_I$

for all $\psi \in X^r_h$ . Thus by considering $\psi = z_h \in X^r_h$ , we find

$\begin{aligned} &\left( \frac{\partial g}{\partial x}(t, x_h, u), z_h\right)_I \\ &\quad = -B(z_h, \lambda_h ) + \left( \frac{\partial f}{\partial x} (t, x_h, u)\lambda_h, z_h\right)_I \\ &\quad = -\int_0^{T}\left( \frac{\partial^2 f}{(\partial x)^2} (t, x_h, u) y_h^2(t) + 2 \frac{\partial^2 f}{\partial x \partial u} (t, x_h, u) y_h(t) v(t) + \frac{\partial^2 f}{(\partial u)^2} (t, x_h, u) v^2 (t)\right) \lambda_h(t) \, dt. \end{aligned}$

Combining the above equalities, we have

$\begin{aligned} &j''_h (u) (v, v)\cr &\ = -\int_0^{T}\left(\frac{\partial^2 f}{(\partial x)^2} (t, x_h, u) y_h^2(t) + 2 \frac{\partial^2 f}{\partial x \partial u} (t, x_h, u) y_h(t) v(t) + \frac{\partial^2 f}{(\partial u)^2} (t, x_h, u) v^2 (t)\right) \lambda_h(t) \, dt \\ &\ \quad + \int_0^{T} \left(\frac{\partial^2 g}{(\partial x)^2} (t, x_h, u) y_h^2 (t) + 2 \frac{\partial^2 g}{\partial x \partial u} (t, x_h, u) y_h (t) v(t) + \frac{\partial^2 g}{(\partial u)^2} (t, x_h, u) v^2(t) \right) dt. \end{aligned}$

This completes the proof.

References

[1]	S. Altınok, M. Bhupal, Minimum page-genus of Milnor open books on links of rational surface singularities, Contemp. Math., 475 (2008), 1-10. doi: 10.1090/conm/475/09272
[2]	M. Bhupal, Open books decompositions of links of simple surface singularities, Int. J. Math., 20 (2009), 1527-1545. doi: 10.1142/S0129167X09005868
[3]	M. Bhupal, K. Ono, Symplectic fillings of links of quotient surface singularities, Nagoya Math. J., 207 (2012), 1-45. doi: 10.1017/S0027763000022297
[4]	M. Bhupal, B. Ozbagci, Milnor open books of links of some rational surface singularities, Pac. J. Math., 254 (2011), 47-65. doi: 10.2140/pjm.2011.254.47
[5]	C. Bonatti, L. Paris, Roots in the mapping class groups, P. London Math. Soc., 98 (2009), 471-503. doi: 10.1112/plms/pdn036
[6]	E. Brieskorn, Rationale singularitäten komplexer flächen, Invent. Math., 4 (1968), 336-358. doi: 10.1007/BF01425318
[7]	D. T. Gay, Open books and configurations of symplectic surfaces, Alg. & Geo. Top., 3 (2003), 569-586.
[8]	T. Etgü, B. Ozbagci, Explicit Horizantal Open Books on some plumbings, Int. J. Math., 17 (2006), 1013-1031. doi: 10.1142/S0129167X06003801
[9]	J. Etnyre, Planar open book decompositions and contact structures, Int. Math. Res. Notices, 79 (2004), 4255-4267.
[10]	J. Etnyre, B. Ozbagci, Open books and plumbings, Int. Math. Res. Notices, 2006 (2006), 1-17.
[11]	J. Etnyre, B. Ozbagci, Invariants of contact structures from open book decompositions, Trans. AMS., 360 (2008), 3133-3151. doi: 10.1090/S0002-9947-08-04459-0
[12]	M. Korkmaz, B. Ozbagci, On sections of elliptic fibrations, Michigan Math. J., 56 (2008), 77-87. doi: 10.1307/mmj/1213972398
[13]	P. Lisca. On symplectic fillings of 3-manifolds, Turk. J. Math., 23 (1999), 151-159.
[14]	P. Lisca, A. Stipsicz. Ozsvath-Szabo invariants and tight contact three-manifolds I, Geom. Topol., 8 (2004), 925-945.
[15]	A. Némethi, M. Tosun, Invariants of open books of links of surface singularities, Stud. Sci. Math. Hung., 48 (2011), 135-144.
[16]	P. Ozsváth, A. Stipsicz, Z. Szabo, Planar open books and Floer homology, Int. Math. Res. Notices, 54 (2005), 3385-3401.
[17]	D. Prill, Local classification of quotients of complex manifolds by discontinous groups, Duke Math. J., 34 (1967), 375-386. doi: 10.1215/S0012-7094-67-03441-2
[18]	S. Schönenberger, Planar open books and symplectic fillings, PhD dissertation, University of Pennsylvania, 2005.
[19]	E. Yılmaz, Open book decompositions of links of quotient surface singularities, PhD dissertation, Middle East Technical University, 2009.

This article has been cited by:

Zifu Fan, Youpeng Tao, Wei Zhang, Kexin Fan, Jiaojiao Cheng, Research on open and shared data from government-enterprise cooperation based on a stochastic differential game, 2023, 8, 2473-6988, 4726, 10.3934/math.2023234

Reader Comments

Your name:*

Email:*
© 2020 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)