Simulating a strongly nonlinear backward stochastic partial differential equation via efficient approximation and machine learning

Wanyang Dai; Wanyang Dai

doi:10.3934/math.2024909

AIMS Mathematics

2024, Volume 9, Issue 7: 18688-18711. doi: 10.3934/math.2024909

Previous Article Next Article

Research article Special Issues

Simulating a strongly nonlinear backward stochastic partial differential equation via efficient approximation and machine learning

Wanyang Dai ^,

Department of Mathematics and State Key Laboratory of Novel Software Technology, Nanjing University, Nanjing 210093, China

Received: 15 March 2024 Revised: 22 May 2024 Accepted: 23 May 2024 Published: 04 June 2024
MSC : 60H35, 65C30, 60H15, 60K37, 60H30

We have studied a strongly nonlinear backward stochastic partial differential equation (B-SPDE) through an approximation method and with machine learning (ML)-based Monte Carlo simulation. This equation is well-known and was previously derived from studies in finance. However, how to analyze and solve this equation has remained a problem for quite a long time. The main difficulty is due to the singularity of the B-SPDE since it is a strongly nonlinear one. Therefore, by introducing new truncation operators and integrating the machine learning technique into the platform of a convolutional neural network (CNN), we have developed an effective approximation method with a Monte Carlo simulation algorithm to tackle the well-known open problem. In doing so, the existence and uniqueness of a 2-tuple adapted strong solution to an approximation B-SPDE were proved. Meanwhile, the convergence of a newly designed simulation algorithm was established. Simulation examples and an application in finance were also provided.

Keywords:

Citation: Wanyang Dai. Simulating a strongly nonlinear backward stochastic partial differential equation via efficient approximation and machine learning[J]. AIMS Mathematics, 2024, 9(7): 18688-18711. doi: 10.3934/math.2024909

Related Papers:

[1]	Min-Ku Lee, Jeong-Hoon Kim . Closed-form approximate solutions for stop-loss and Russian options with multiscale stochastic volatility. AIMS Mathematics, 2023, 8(10): 25164-25194. doi: 10.3934/math.20231284
[2]	Xiuxian Chen, Zhongyang Sun, Dan Zhu . Mean-variance investment and risk control strategies for a dynamic contagion process with diffusion. AIMS Mathematics, 2024, 9(11): 33062-33086. doi: 10.3934/math.20241580
[3]	Meijiao Wang, Qiuhong Shi, Maoning Tang, Qingxin Meng . Stochastic differential equations in infinite dimensional Hilbert space and its optimal control problem with Lévy processes. AIMS Mathematics, 2022, 7(2): 2427-2455. doi: 10.3934/math.2022137
[4]	Boubaker Smii . Representation of the solution of a nonlinear molecular beam epitaxy equation. AIMS Mathematics, 2024, 9(12): 36012-36030. doi: 10.3934/math.20241708
[5]	Jafar Biazar, Fereshteh Goldoust . Multi-dimensional Legendre wavelets approach on the Black-Scholes and Heston Cox Ingersoll Ross equations. AIMS Mathematics, 2019, 4(4): 1046-1064. doi: 10.3934/math.2019.4.1046
[6]	Abdelwahed Motwake, Aisha Hassan Abdalla Hashim, Marwa Obayya, Majdy M. Eltahir . Enhancing land cover classification in remote sensing imagery using an optimal deep learning model. AIMS Mathematics, 2024, 9(1): 140-159. doi: 10.3934/math.2024009
[7]	Xiaoming Wang, Muhammad W. Yasin, Nauman Ahmed, Muhammad Rafiq, Muhammad Abbas . Numerical approximations of stochastic Gray-Scott model with two novel schemes. AIMS Mathematics, 2023, 8(3): 5124-5147. doi: 10.3934/math.2023257
[8]	Mashael M Asiri, Abdelwahed Motwakel, Suhanda Drar . Robust sign language detection for hearing disabled persons by Improved Coyote Optimization Algorithm with deep learning. AIMS Mathematics, 2024, 9(6): 15911-15927. doi: 10.3934/math.2024769
[9]	Yao Fu, Sisi Zhou, Xin Li, Feng Rao . Multi-assets Asian rainbow options pricing with stochastic interest rates obeying the Vasicek model. AIMS Mathematics, 2023, 8(5): 10685-10710. doi: 10.3934/math.2023542
[10]	Wei Xue, Pengcheng Wan, Qiao Li, Ping Zhong, Gaohang Yu, Tao Tao . An online conjugate gradient algorithm for large-scale data analysis in machine learning. AIMS Mathematics, 2021, 6(2): 1515-1537. doi: 10.3934/math.2021092

Abstract

1. Introduction

In this paper, we study a strongly nonlinear backward stochastic partial differential equation (B-SPDE) in (1.1) through an approximation method and with machine learning (ML)-based Monte Carlo simulation,

$\begin{eqnarray} V(t,x) = H(T,x)-\frac{1}{2}\int_{t}^{T}\frac{(V_{x}(s,x)+\bar{V}_{x}(s,x))^{2}}{V_{xx}(s,x)}ds-\int_{t}^{T}\bar{V}(s,x)dW(s), \end{eqnarray}$

(1.1)

which has a given terminal random field $H(T, x)$ at time $T$ for $(t, x)\in [0, T]\times R$ and $R = (-\infty, \infty)$ and which is driven by a Brownian motion $W(\cdot)$ over time interval $[0, T]$ .

The equation in (1.1) is well-known and was previously derived from studies in finance (see Musiela and Zariphopoulou ^[22], $\emptyset$ ksendal et al. ^[24], etc.), where, the primary motivation to study the B-SPDE was to solve an optimal portfolio selection problem in an incomplete financial market modeled by a stochastic differential equation driven by multi-dimensional Brownian motion (see, e.g., Musiela and Zariphopoulou ^[22] and Kramkov and Sirbu ^[18]). Furthermore, the problem describes the evolution of a related value function and seeks to maximize an expected utility from terminal wealth over admissible strategies. To find an optimal investment policy for this stochastic optimization problem, the well-known dynamic programming principle can be applied to determine such an optimal control policy in a backward way with respect to the time parameter. More precisely, in this financial problem, the random field $V(t, x)$ denotes the value function of an investor's target to maximize his expected utility of terminal wealth over admissible strategy set ${\cal A}_{T}$ (see its definition in Subsection 2.2) with $t\in[0, T]$ , i.e.,

$\begin{eqnarray} V(t,x) = \sup\limits_{\beta\in{\cal A}_{T}}E\bigg[\mu_{T}(X^{\beta}(T))\Big|{\cal F}_{t},X(t) = x\bigg] \end{eqnarray}$

(1.2)

for a given trading horizon $[0, T]$ and the investor's utility $\mu_{T}(\cdot):R_{+}\rightarrow R$ at terminal time $T$ (see more explanations in Subsection 2.2). Note that, in this case, we take $H(T, \cdot) = \mu_{T}(\cdot)$ in the equation of (1.1). Furthermore, $X^{\beta}(t)$ denotes the present value of the investor's aggregate investment at time $t$ (see more details in Subsection 2.2). In existing studies (see, e.g., Cerny and Kallsen ^[2] and Dai ^[8]), the authors aim to find a so-called variance optimal martingale measure $Q^{*}$ such that the value function in (1.2) has a simple expression given by the following conditional expectation,

$\begin{eqnarray} V(t,x) = E_{Q^{*}}\left[H(T,x)\Big|{\cal F}_{t},X(t) = x\right]. \end{eqnarray}$

(1.3)

In this sense, $V(t, x)$ can be decomposed into a macro-trend part and a micro-regulating (volatility) part as shown in (1.1) due to martingale representation theorem (see, e.g., $\emptyset$ ksendal ^[23]). Roughly speaking, the random field $\bar{V}(\cdot, \cdot)$ corresponds to the volatility rate and is the Malliavin derivative of $V(t, x)$ with respect to the Wiener measure corresponding to the Brownian motion $W$ (see more details in Dai ^[9]).

Due to some difficulties with solving the problem in (1.2) directly and as an alternative to the method presented in (1.3), the authors in Musiela and Zariphopoulou ^[22] derived the B-SPDE to solve the optimal investment problem. In this sense, the B-SPDE might be considered as the non-Markovian analogue of the traditional Hamilton-Jacobi-Bellman (HJB) equation in Markovian models or in its dual formulation. Once we obtain the solution to the B-SPDE, we can determine the optimal investment policy (see Subsection 2.2 for more details). However, how to analyze and solve Eq (1.1) in general has remained a problem for quite a long time. The main difficulty is due to the singularity of the B-SPDE since it is a strongly nonlinear one. To develop a method to solve this problem, we need to go over the work in Dai ^[9] and make a comparison between it and our current study.

More precisely, in Dai ^[9], we developed a generic convolutional neural network (CNN)-based numerical scheme to simulate the 2-tuple adapted strong solution to a unified system of B-SPDEs driven by Brownian motions, which can be applied to many B-SPDE equations. Nevertheless, in proving the unique existence of the 2-tuple adapted strong solution to the unified system, we need to impose the so-called general local Lipschitz and linear growth conditions. Furthermore, in Dai ^[9], the generic numerical scheme was developed by a CNN through conditional expectation projection, which is a completely discrete and iterative algorithm in terms of both time and space. However, in estimating the mean-square error and proving the convergence for the CNN-based numerical scheme, we also need the general local Lipschitz and linear growth conditions. Moreover, the generic numerical scheme in Dai ^[9] does not integrate ML techniques into its computation algorithm.

Although the equation in (1.1) is a special case of our unified system of B-SPDEs in Dai ^[9], it is a strongly nonlinear one with singularity. Therefore, the B-SPDE in (1.1) does not satisfy the imposed general local Lipschitz and linear growth conditions in Dai ^[9], which implies that the developed method in Dai ^[9] cannot be directly applied to solve the equation in (1.1). Hence, by introducing new truncation operators and integrating the ML technique into the CNN platform in Dai ^[9], we develop an effective approximation method with a Monte Carlo simulation algorithm to tackle such a well-known open problem. Concerning a CNN, readers are also referred to Brizuela and Merchan ^[12], Dai ^[10], LeCun et al. ^[20], Vaswani et al. ^[27], and Yamashitza et al. ^[28] for more details. Note that the purpose of integrating the ML technique into our platform is to speed the convergence of our new algorithm designed in the current study (see Algorithm 3.1 and Figure 1 for more details).

Figure 1. The flow chart of Algorithm 3.1, where the abbreviation "ML" means machine learning.

DownLoad: Full-Size Img PowerPoint

To go further, we give some explanations about the notations used in the equation of (1.1). The notations $V_{x}$ , $V_{xx}$ , and $\bar{V}_{x}$ are the corresponding first-order and second-order partial derivatives of $V$ and $\bar{V}$ in terms of position parameter $x\in R$ . Since the second partial derivative $V_{xx}$ appears in the denominator of the second term on the right-hand side of (1.1), the equation in (1.1) is a strongly nonlinear one and exhibits singularity. In addition, since both $V(t, x)$ and $\bar{V}(t, x)$ are unknown random fields, this equation is a diophantine equation (see the related explanation in Dai ^[9]). Therefore, one major task in studying the equation in (1.1) is to try to obtain an adapted solution pair $(V, \bar{V})$ with respect to a filtration $\{{\cal F}_{t}, t\geq 0\}$ generated by the Brownian motion, i.e., ${\cal F}_{t} = \sigma(W(s), s\leq t)$ . Then, based on this paired solution, we can get its related further optimal financial investment strategy (see the discussions in Musiela and Zariphopoulou ^[22], $\emptyset$ ksendal et al. ^[24], etc.).

Concerning the relationship between $V$ and $\bar{V}$ , we can interpret $\bar{V}$ as a regulating process of $V$ due to the martingale representation theorem (see, e.g., $\emptyset$ ksendal ^[23]). Furthermore, under certain conditions, $\bar{V}$ can be expressed as a functional of the Malliavin derivative of $V$ (see Lemmas 4.6 and 4.7 in Dai ^[9] for more details). However, based on the Malliavin functional relationship between $V$ and $\bar{V}$ , it is difficult to design a direct computation algorithm to calculate $(V, \bar{V})$ numerically due to its complexity. Therefore, in Dai ^[9], this explicit relationship through Malliavin calculus is only used to prove the convergence of a more directly designed computation algorithm in solving $(V, \bar{V})$ numerically for B-SPDEs under the so-called generalized linear growth and Lipschitz conditions. Nevertheless, in the current study, our B-SPDE in (1.1) is a strongly nonlinear one that does not satisfies the generalized linear growth and Lipschitz conditions. Thus, our main focus in this paper is on the evolution of the study in Dai ^[9] to solve our current strongly nonlinear problem in terms of algorithm design, analysis, and implementation. The Malliavin functional relationship between $V$ and $\bar{V}$ will not be directly used in this study.

As introduced previously, to the best of our knowledge, the B-SPDE in (1.1) is still not well-solved. Hence, we here try to develop a numerical scheme with related theory to simulate this equation in an approximated way. More precisely, we consider an approximated analog of the equation in (1.1) as follows,

$\begin{eqnarray} V(t,x) = H(T,x)-\frac{1}{2}\int_{t}^{T}\frac{(\Phi_{\bar{K}}(V_{x}(s,x)+\bar{V}_{x}(s,x)))^{2}}{\Psi^{\epsilon,K}(V_{xx}(s,x))}ds-\int_{t}^{T}\bar{V}(s,x)dW(s) \end{eqnarray}$

(1.4)

for $x\in D = [0, b]$ with $b > 0$ , where $\Phi_{\bar{K}}(\cdot)$ is a truncation map corresponding to the first-order derivatives $V_{x}(t, x)$ and $\bar{V}_{x}(t, x)$ (if any) for a small number $\epsilon > 0$ and a large number $\bar{K} > 0$ . In other words, for $f_{x}(t, x) = V_{x}(t, x)+\bar{V}_{x}(t, x)$ , we have that

$\begin{eqnarray} \Phi_{\bar{K}}(f_{x}(t,x)) &\equiv&\left\{\begin{array}{ll} \;\;\bar{K}&\mbox{if}\;\;\;\;\;f_{x}(t,x)\:\; > \;\;\bar{K},\\ \;\;f_{x}(t,x)&\mbox{if}\;\;\;\;|f_{x}(t,x)|\;\leq\;\;\bar{K},\\ -\bar{K}&\mbox{if}\;\;\;\;\;f_{x}(t,x)\;\; < -\bar{K}, \end{array}\right. \end{eqnarray}$

(1.5)

where $|\cdot|$ denotes the absolute value of a number $\cdot$ . Furthermore, $\Psi^{\epsilon, K}(\cdot)$ is another truncation map corresponding to the second-order derivative $V_{xx}(t, x)$ (if any) for a small number $\epsilon > 0$ and a large number $K > 0$ , i.e.,

$\begin{eqnarray} \Psi^{\epsilon,K}(V_{xx}(t,x)) &\equiv&\left\{\begin{array}{ll} \;\;V_{xx}(t,x)&\mbox{if}\;\;\;\;\;\;\epsilon\;\leq\; |V_{xx}(t,x)|\;\leq\;\;\;K,\\ \;\;\;\epsilon&\mbox{if}\;\;\;\;\;\;0\;\leq\;\;V_{xx}(t,x)\;\; < \;\;\;\epsilon,\\ -\epsilon&\mbox{if}\;\;-\epsilon\; < \;\;V_{xx}(t,x)\;\; < \;\;\;0,\\ \;\;K&\mbox{if}\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;V_{xx}(t,x)\;\; > \;\;K,\\ -K&\mbox{if}\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;V_{xx}(t,x)\;\; < -K. \end{array}\right. \end{eqnarray}$

(1.6)

From (1.6), we can see that the absolute value of the truncation map $\Psi^{\epsilon, K}(\cdot)$ is always greater or equal to the positive number $\epsilon$ (i.e., $|\Psi^{\epsilon, K}(\cdot)|\geq\epsilon$ ). Thus, the denominator appearing in (1.4) is always away from zero, which implies that the equation for the given constants $\epsilon > 0$ , $K > 0$ , and $\bar{K} > 0$ has the potential to be well-behaved. From (1.5) and (1.6), we can see that the truncation maps $\Phi_{\bar{K}}(\cdot)$ and $\Psi^{\epsilon, K}(\cdot)$ are both bounded for the given constants $\epsilon > 0$ , $K > 0$ , and $\bar{K} > 0$ . Furthermore, based on the truncated maps in (1.5) and (1.6) and the mentioned properties, we can prove that the equation in (1.4) satisfies the generalized local linear growth and generalized local Lipschitz conditions as given in Dai ^[9]. Hence, under a suitable terminal condition, there uniquely exists a $\{{\cal F}_{t}\}$ -adapted strong solution ( $V^{\epsilon, K}_{\bar{K}}(t, x), \bar{V}^{\epsilon, K}_{\bar{K}}(t, x)$ ) to the equation in (1.4), which implies that this equation in (1.4) is not a singular one for the given constants $\epsilon > 0$ , $K > 0$ , and $\bar{K} > 0$ . Thus, according to the equation in (1.4), we can develop a Monte Carlo simulation algorithm by enhancing the one developed in Dai ^[9] through adding an additional machine learning (ML) loop (see Algorithm 3.1 in Section 3 for more details). Note that the design of using a sequence of structure-preserving B-SPDEs in (1.4) to approximate the one in (1.1) is actually motivated from the diffusion approximation for queueing networks where $r$ is used to index a specific network. More exactly, the diffusion approximation is to approximate a target limit system through a sequence of physical systems when $r$ tends to infinity (see, e.g., Dai ^[4,5,7]). Furthermore, during the approximation, each system keeps its structure unchanged.

In summary, the contributions of our paper can be stated in two folds: theoretic contributions and numerical contributions.

Concerning the theoretic contributions of our paper, we study the strongly nonlinear B-SPDE in (1.1) through an approximation method by introducing new truncation operators as in (1.5) and (1.6). The main reason for doing it in this way is due to the singularity of the B-SPDE in (1.1). To guarantee the meaningfulness of the approximation, the existence and uniqueness of a 2-tuple adapted strong solution to the approximation B-SPDE in (1.4) are proved. Furthermore, it is also proved that, if the B-SPDE in (1.1) has a solution pair $(V, \bar{V})$ , the solution pair to the approximation B-SPDE in (1.4) will converge to $(V, \bar{V})$ as $\epsilon$ tends to zero and $K, \bar{K}$ tend to the infinity in certain senses. In addition, based on the approximated B-SPDE in (1.4), some investment policies are analytically derived for a financial market with the aim to conduct some accuracy comparisons.

Concerning the numerical contributions of our paper, we develop a Monte Carlo simulation-based algorithm to numerically solve the approximation B-SPDE in (1.4). In doing so, we integrate an additional ML loop into the platform of a convolutional neural network (CNN) as studied in Dai ^[9]. Based on the equation in (1.4), we can prove the convergence of this newly designed algorithm in the case that the equation in (1.1) has an adapted strong solution. The basic idea to develop such an algorithm is to find a CNN denoted by ${\cal U}$ with an additional ML loop at each time point to approximate ( $V^{\epsilon, K}_{\bar{K}}, \bar{V}^{\epsilon, K}_{\bar{K}}$ ) (see with explanations in Subsection 3.1 for more details). In this CNN, we consider ( $V^{\epsilon, K}_{\bar{K}}, \bar{V}^{\epsilon, K}_{\bar{K}}$ ) as paired parameters to be trained at each node $(t, x)$ (also called neuron $(t, x)$ ). The learning strategy for this CNN is to minimize the mean-squared error (loss) between ( $V^{\epsilon, K}_{\bar{K}}, \bar{V}^{\epsilon, K}_{\bar{K}}$ ) and ${\cal U}$ . Note that, in Dai ^[9], the CNN corresponding to ${\cal U}$ is expressed as a conditional expectation projection for a given data set. To compute the conditional expectation, the well-known tower law (see, e.g., Kallenberg ^[16]) is used to design the CNN ${\cal U}$ (see Subsection 3.1 for more details), which is a backward one. Similar to a multi-layer perceptron (MLP) as studied in Cybenko ^[3], Haykin ^[13], and Hornik et al. ^[14], our CNN ${\cal U}$ is a fully connected neural network. However, it is still computationally efficient since our B-SPDE in (1.4) is a Browinian motion driven one. Therefore, the computation of our conditional expectation does not need the additional multiple parameter training since the transition probability and density of a Brownian motion is given. As an alternative, our conditional expectation projection can also be presented by a linear or nonlinear regression model with parameters, which is closely related to the so-called Kolmogorov-Arnold network due to Kolmogorov's superposition theorem (see, e.g., Kolmogorov ^[17] and Braun and Griebel ^[1]) and the recent study in Liu et al. ^[21]. If we do in this way, we need to train additional parameters, which is time-consuming and may sacrifice some accuracy. Thus, in Dai ^[9] and in our current paper, we report our achievements based on our successfully implemented and tower-law-oriented CNN ${\cal U}$ . Furthermore, along this line, we refer the reader to these related papers Gonon et al. ^[11], Kratsios et al. ^[19], Peluchetti ^[25], Sirignano and Spiliopoulos ^[26], Vaswani et al. ^[27] for more details. Finally, concerning the numerical contributions of this study, we also provide simulation examples supported with an application in finance.

For the reader's convenience, the partial simulation results according to our developed simulation scheme are shown in , where we present the simulation results concerning the paired solution $(V^{\epsilon, K}_{\bar{K}}, \bar{V}^{\epsilon, K}_{\bar{K}})$ to the approximated B-SPDE in (1.4) with $\epsilon = 1/100000$ , $K = 10^{9}$ , and $\bar{K} = 2^{64}$ . The computed values at time point $t_{j_{0}} = t_{n_{0}}$ with $n_{0} = 10$ correspond to the input terminal values at $t_{n_{0}+1} = T$ . The "solution error check" titled in the third plot of the first column in is in terms of the difference between the two sides of the B-SPDE in (1.4) and is with respect to a particular sample path. From the simulation results displayed in the third plot of the first column, we can see that our algorithm is quite accurate. The three plots in the first row of display the simulated $V^{\epsilon, K}_{\bar{K}}(t_{n_{0}}, x)$ together with its simulated first-order and second-order derivatives in terms of position parameter $x\in\{b/d, 2b/d..., (d-dropnum)b/d\}$ with $b = 1/6000$ , $d = 100$ , and $dropnum = 30$ . Although the graph in the third plot is non-smooth, it is close to a smooth line. Furthermore, the three plots in the second row of display the simulated $\bar{V}^{\epsilon, K}_{\bar{K}}(t_{n_{0}}, x)$ together with its simulated first-order and second-order derivatives. In addition, the second and third plots in the third row of display the simulated terminal value $V^{\epsilon, K}_{\bar{K}}(t_{n_{0}+1}, x)$ together with its simulated first-order derivative. Concerning the simulations about the paired solution ( $V^{\epsilon, K}_{\bar{K}}(t, x), \bar{V}^{\epsilon, K}_{\bar{K}}(t, x)$ ), readers are referred to Section 4 for more details.

Figure 2. Simulated solution pair and pathwise error comparison to the B-SPDE in (1.4) with

$\epsilon = 1/100000$ ,

$K = upperbound = 10^{9}$ ,

$\bar{K} = 2^{64}$ , and

$learning rate = 1/2$ . Furthermore,

$T = 0.1$ ,

$n = 60000$ ,

$hhh = 2$ ,

$terminalcoefficient = 2*100$ ,

$Q = 2352$ ,

$b = 1/6000$ ,

$d = 100$ ,

$dropnum = 30$ ,

$BMD = 2$ ,

$bmdp = 3000$ , and

$k = 1$ . In this figure, we display the evolution results of

$(d-dropnum)$ points with respect to the position parameter

$x\in \{b/d, 2b/d, ..., (d-dropnum)b/d\}$ over horizon-axes at a particular time point

$T(n-1)/n$ .

DownLoad: Full-Size Img PowerPoint

As pointed out previously, the primary interest in deriving the B-SPDE in (1.1) is to find the financial investment strategy together with myopic investment and excess hedging demand (see, e.g., Dai ^[6,8], Musiela and Zariphopoulou ^[22], and $\emptyset$ ksendal et al. ^[24]). Therefore, based on the simulation study conducted in this paper and the related strategy study presented in Musiela and Zariphopoulou ^[22], we can also obtain the simulated financial control strategies as presented in . In this figure, we display the simulated investment policy, myopic policy, and excess hedging demand at time point $t_{j_{0}}$ with $j_{0} = n_{0}$ and $n_{0} = 10$ . These policies correspond to the formula derived in Musiela and Zariphopoulou ^[22]. The three graphs in the left column are corresponding to the simulated pathwise results. The three graphs in the right column correspond to the simulated results in the mean average sense with respect to the simulation iteration number $Q$ . Theoretically, the myopic policy should continue to be constant. Our simulated results support this theoretical result. However, this theoretic result also further justifies the correctness of our algorithm and simulations. Concerning this financial policy simulation, readers are referred to Section 4 for more details.

Figure 3. Simulated investment policy, myopic policy, and excess hedging demand with

$\epsilon = 1/100000$ ,

$K = upperbound = 10^{9}$ ,

$\bar{K} = 2^{64}$ , and

$learning rate = 1/2$ . Furthermore,

$T = 0.1$ ,

$n = 60000$ ,

$hhh = 2$ ,

$terminalcoefficient = 2*100$ ,

$Q = 2352$ ,

$b = 1/6000$ ,

$d = 100$ ,

$dropnum = 30$ ,

$BMD = 2$ ,

$bmdp = 3000$ , and

$k = 1$ . In this figure, we display the evolution results of

$(d-dropnum)$ points with respect to the position parameter

$x\in \{b/d, 2b/d, ..., (d-dropnum)b/d\}$ over horizon-axes at a particular time point

$T(n-1)/n$ .

DownLoad: Full-Size Img PowerPoint

The remainder of the paper is organized as follows. In Section 2, we present the unique existence theorem with proof of our approximated B-SPDE. Furthermore, in this section, we also present the application of our study in finance. In Section 3, we design our Monte Carlo simulation algorithm through CNN and ML. The convergence of our designed simulation algorithm is also proved. In Section 4, we present our numerical simulation case studies. Finally, in Section 5, we summarize our study conducted in this paper with conclusions.

2. Unique existence theorem with an application in finance

This section consists of two subsections. The unique existence theorem is stated in Subsection 2.1. The financial application of our unique existence result is presented in Subsection 2.2.

2.1. Unique existence theorem

Let $C^{2}(D, R)$ be the Banach space of all functions $f$ having continuous derivatives up to the order $2$ with the uniform norm,

$\begin{eqnarray} &&\|f\|_{C^{2}(D,R)} = \max\limits_{c\in\{0,1,2\}}\max\limits_{j\in\{1,...,r(c)\}}\sup\limits_{x\in D}\left|f^{(c)}_{j}(x)\right| \end{eqnarray}$

(2.1)

for each $f\in C^{2}(D, R)$ . The $r(c)$ in (2.1) for each $c\in\{0, 1, 2\}$ is the total number of the partial derivatives of the $c$ th order. Then, we can introduce our required measurable spaces used in this paper. First, we use $L^{2}_{{\cal F}}([0, T], C^{2}(D; R))$ to denote the set of all $R$ -valued (also called $C^{2}(D; R)$ -valued) measurable random fields $Z(t, x)$ satisfying

$\begin{eqnarray} &&E\left[\int_{0}^{T}\|Z(t)\|^{2}_{C^{2}(D,R)}dt\right] < \infty, \end{eqnarray}$

(2.2)

where the random field $Z(t, x)$ is assumed to be adapted to $\{{\cal F}_{t}, t\in[0, T]\}$ for each $x\in D$ and $Z(t, x)\in C^{2}(D, R)$ with a fixed $t\in[0, T]$ ). Second, we use $L^{2}_{{\cal F}, 1}([0, T], C^{2}(D, R))$ to denote the corresponding set of predictable processes (see the definitions on pages 21 and 45 of Ikeda and Watanabe ^[15]). Third, we use $L^{2}_{{\cal F}_{T}}(\Omega, C^{2}(D; R))$ to denote the set of all $R$ -valued, ${\cal F}_{T}$ -measurable random fields $\zeta(x, \omega)$ for each $x\in D$ and sample point $\omega\in\Omega$ , where $\zeta(x, \omega)\in C^{2}(D, R)$ for each $\omega\in\Omega$ satisfies

$\begin{eqnarray} &&\|\zeta\|^{2}_{L^{2}_{{\cal F}_{T}}(\Omega,C^{2}(D,R))}\equiv E\left[\|\zeta\|^{2}_{C^{2}(D,R)}\right] < \infty. \end{eqnarray}$

(2.3)

Therefore, we can introduce our supporting space as follows,

$\begin{eqnarray} &&\bar{{\cal Q}}^{2}_{{\cal F}}([0,T]\times D)\equiv L^{2}_{{\cal F}}([0,T],C^{2}(D,R))\times L^{2}_{{\cal F},1}([0,T],C^{2}(D,R)). \end{eqnarray}$

(2.4)

Finally, before introducing our unique existence theorem, we suppose that the terminal value in (1.1) (and hence in (1.4)) is given by

$\begin{eqnarray} &&H(x) = h_{1}(x)h_{2}(W(T)), \end{eqnarray}$

(2.5)

where both $h_{1}$ and $h_{2}$ are polynomials (interested readers are also referred to Dai ^[9] for more related explanation). Then, we can state our unique existence theorem as follows.

Theorem 2.1. Under the terminal condition in (2.5), there is a unique strong $\{{\cal F}_{t}\}$ -adapted solution pair $(V^{\epsilon, K}_{\bar{K}}(t, x), \bar{V}^{\epsilon, K}_{\bar{K}}(t, x))$ to the B-SPDE in (1.4) within the space $\bar{{\cal Q}}^{2}_{{\cal F}}([0, T]\times D)$ for a small constant $\epsilon > 0$ , a large number $K > 0$ , and a large number $\bar{K} > 0$ with $t\in[0, T]$ . Furthermore, we have that

$\begin{eqnarray} \sup\limits_{t\in[0,T]}E\left[\left\|V^{\epsilon,K}_{\bar{K}}(t)\right\|^{2}_{C^{2}(D,R)}\right] < \infty. \end{eqnarray}$

(2.6)

Proof. Corresponding to the B-SPDE in (1.4), we define a second-order partial differential operator ${\cal L}^{\epsilon, K}_{\bar{K}}$ as follows,

$\begin{eqnarray} {\cal L}^{\epsilon,K}_{\bar{K}}(t,x,V,\bar{V})& = &\frac{(\Phi_{\bar{K}}(V_{x}(t,x)+\bar{V}_{x}(t,x)))^{2}}{\Psi^{\epsilon,K}(V_{xx}(t,x))}. \end{eqnarray}$

(2.7)

Then, we can conclude that the operator ${\cal L}^{\epsilon, K}_{\bar{K}}$ defined in (2.7) satisfies the generalized local Lipschitz and linear growth conditions as introduced in Dai ^[9], i.e.,

$\begin{eqnarray} \left|\Delta{\cal L}^{\epsilon,K}_{\bar{K}}(s,x,(u,\bar{u}),(v,\bar{v}))\right|&\leq&K_{D}\left(\|u-v\|_{C^{2}(D,R)}+\|\bar{u}-\bar{v}\|_{C^{2}(D,R)}\right), \end{eqnarray}$

(2.8)

$\begin{eqnarray} \left|{\cal L}^{\epsilon,K}_{\bar{K}}(s,x,(u,\bar{u}))\right|&\leq& K_{D}\left(\|u\|_{C^{2}(D,R)}+\|\bar{u}\|_{C^{2}(D,R)}\right) \end{eqnarray}$

(2.9)

for each fixed $(t, x)\in [0, T]\times D$ , and any $(u, \bar{u})$ , $(v, \bar{v})$ $\in C^{2}(D, R)\times C^{2}(D, R)$ , where $(u, \bar{u})$ and $(v, \bar{v})$ are two pairs corresponding to the equation in (1.4). Furthermore, $K_{D}$ is a nonnegative constant depending on $D$ , $\epsilon$ , $K$ , and $\bar{K}$ . In addition, the operator $\Delta{\cal L}^{\epsilon, K}_{\bar{K}}$ is defined by

$\begin{eqnarray} &&\Delta{\cal L}^{\epsilon,K}_{\bar{K}}(s,x,(u,\bar{u}),(v,\bar{v})) \equiv{\cal L}^{\epsilon,K}_{\bar{K}}(s,x,(u,\bar{u}))-{\cal L}^{\epsilon,K}_{\bar{K}}(s,x,(v,\bar{v})) \end{eqnarray}$

(2.10)

for each given $(t, x, (u, \bar{u}), (v, \bar{v}))$ ).

In fact, for any two pairs of $(U, \bar{U})$ and $(V, \bar{V})$ in the space $\bar{{\cal Q}}^{2}_{{\cal F}}([0, T]\times D)$ , it follows from the definition in (2.7) that we can prove the claim in (2.8) as follows,

$\begin{equation} \begin{aligned} &\left|\Delta{\cal L}^{\epsilon,K}_{\bar{K}}(s,x,(U,\bar{U}),(V,\bar{V}))\right| \\ &\leq\left|\frac{\left(\Phi_{\bar{K}}(U_{x}(s,x)+\bar{U}_{x}(s,x))\right)^{2}}{\Psi^{\epsilon,K}(U_{xx}(s,x))}-\frac{\left(\Phi_{\bar{K}}(V_{x}(s,x)+\bar{V}_{x}(s,x))\right)^{2}}{\Psi^{\epsilon,K}(V_{xx}(s,x))}\right| \\ & = \Bigg|\frac{\Psi^{\epsilon,K}(V_{xx}(s,x))\left(\Phi_{\bar{K}}(U_{x}(s,x)+\bar{U}_{x}(s,x))\right)^{2}}{\Psi^{\epsilon,K}(U_{xx}(s,x))\Psi^{\epsilon,K}(V_{xx}(s,x))} \\ &-\;\frac{\Psi^{\epsilon,K}(U_{xx}(s,x))\left(\Phi_{\bar{K}}(V_{x}(s,x)+\bar{V}_{x}(s,x))\right)^{2}}{\Psi^{\epsilon,K}(U_{xx}(s,x))\Psi^{\epsilon,K}(V_{xx}(s,x))}\Bigg| \\ &\leq\frac{1}{\epsilon^{2}}\Bigg|\Psi^{\epsilon,K}(V_{xx}(s,x))\left(\left(\Phi_{\bar{K}}(U_{x}(s,x)+\bar{U}_{x}(s,x))\right)^{2} -\;\left(\Phi_{\bar{K}}(V_{x}(s,x)+\bar{V}_{x}(s,x))\right)^{2}\right)\Bigg| \\ &+\frac{1}{\epsilon^{2}}\Bigg|\Big(\Psi^{\epsilon,K}(U_{xx}(s,x))-\Psi^{\epsilon,K}(V_{xx}(s,x))\Big)\Big(\Phi_{\bar{K}}(V_{x}(s,x)+\bar{V}_{x}(s,x))\Big)^{2}\Bigg| \\ &\leq K_{D}^{1}\Bigg(\bigg|\Psi^{\epsilon,K}(U_{xx}(s,x))-\Psi^{\epsilon,K}(V_{xx}(s,x))\bigg| \\ &+\;\bigg|\Phi_{\bar{K}}(U_{x}(s,x)+\bar{U}_{x}(s,x))-\Phi_{\bar{K}}(V_{x}(s,x)+\bar{V}_{x}(s,x))\bigg|\Bigg) \\ &\leq K_{D}\bigg(\|u-v\|_{C^{2}(D,R)}+\|\bar{u}-\bar{v}\|_{C^{2}(D,R)}\bigg), \end{aligned} \end{equation}$

(2.11)

where $K^{1}_{D}$ is some positive constant depending on $\epsilon$ , $K$ , $\bar{K}$ , and $D$ . Furthermore, $K_{D}$ is a positive constant depending on $K_{D}^{1}$ .

Similarly, by applying the definition in (2.7), we can prove the claim in (2.9) as follows,

$\begin{eqnarray} \left|{\cal L}^{\epsilon}_{K}(s,x,(U,\bar{U}))\right| \leq\frac{1}{\epsilon}\bigg(\Phi_{\bar{K}}(U_{x}(t,x)+\bar{U}_{x}(t,x))\bigg)^{2} \label{profkblipschitzoI} \leq K_{D}\bigg(\|u\|_{C^{2}(D,R)}+\|\bar{u}\|_{C^{2}(D,R)}\bigg), \end{eqnarray}$

where the constant $K_{D}$ in (2.11) and (2.12) can be chosen such that the inequalities in (2.11) and (2.12) are both true.

Finally, it follows from (2.11), (2.12), and the discussion in Dai ^[9] that the claims in our main theorem are true. Hence, we complete the proof of Theorem 2.1. □

2.2. Application in finance

Consider a financial market consisting of one risky asset and one riskless asset. The risky asset is a stock whose price dynamics is driven by the Brownian motion $W(\cdot)$ , i.e.,

$\begin{eqnarray} dS(t) = S(t)\Big(u(t)dt+\sigma(t)dW(t)\Big) \end{eqnarray}$

(2.12)

with initial price $S(0) > 0$ . Furthermore, $u(\cdot)$ and $\sigma(\cdot)$ are $\{{\cal F}_{t}\}$ -progressive measurable processes with values in $R = (-\infty, \infty)$ . The riskless asset is with the price process $R(t)$ with an interest rate $r(t)$ , i.e.,

$\begin{eqnarray} dR(t) = r(t)R(t)dt. \end{eqnarray}$

(2.13)

Then, it follows from the discussion in Musiela and Zariphopoulou ^[22] that the present value $X^{\beta}(t) = \beta^{0}(t)+\beta^{1}(t)$ of the aggregate investment concerning the riskless investment strategy $\beta^{0}(t)$ and the risky investment strategy $\beta^{1}(t)$ is given by

$\begin{eqnarray} dX^{\beta}(t) = \beta(t)(u(t)-r(t))dt+\sigma(t)\beta(t)dW(t), \end{eqnarray}$

(2.14)

where $\beta(t) = \beta^{1}(t)$ is the discounted strategy in the following admissibility set,

$\begin{eqnarray} {\cal A}& = &\Bigg\{\beta:\beta(t)\;\;\mbox{is self-financing and}\;\;\{{\cal F}_{t}\}\mbox{-progressively measurable} \\ && \mbox{satisfying}\;\;E\left[\int_{0}^{t}|\sigma(s)\beta(s)|^{2}\right] < \infty,\; X^{\beta}(t)\geq 0,\;t\geq 0\Bigg\}. \end{eqnarray}$

(2.15)

Now, for a given trading horizon $[0, T]$ and an investor's utility $\mu_{T}(\cdot):R_{+}\rightarrow R$ at terminal time $T$ , which is supposed to be an increasing and convex function of his wealth, we can represent the risk-seeking attitude of an investor (the risk-avoiding case corresponding to a concave function can be similarly discussed, see, e.g., Musiela and Zariphopoulou ^[22] for a reference). The investor's target is to maximize the expected utility of terminal wealth over the admissible strategy set ${\cal A}_{T}$ corresponding to the one in (2.15) with $t\in[0, T]$ , i.e., to solve the optimization problem presented in (1.2). Then, if we take $u(t) = \sigma(t)\equiv 1$ and $r(t)\equiv0$ , respectively, in (2.12) and (2.13), the optimal equation corresponding to the optimization problem in (1.2) is given by the B-SPDE in (1.1) with $H(T, x) = \mu_{T}(x)$ (concerning the justification of this claim, readers are referred to Musiela and Zariphopoulou ^[22] for more details). In this case, the optimal feedback investment strategy is given by

$\begin{eqnarray} \beta^{*}(t) = -\frac{V_{x}(t,x)}{V_{xx}(t,x)}-\frac{\bar{V}_{x}(t,x)}{V_{xx}(t,x)}, \end{eqnarray}$

(2.16)

where the first term (will be denoted by $\beta^{*, m}(t)$ ) on the right-hand-side of (2.16) is called a myopic investment strategy resembling the investment policy followed by an investor in a financial market where the investment opportunity set keeps constant through time. Furthermore, the second term (will be denoted by $\beta^{*, h}(t)$ ) on the right-hand-side of (2.16) is referred as the excess hedging demand denoting the additional investment due to the volatility $\bar{V}(t, x)$ of the performance process $V(t, x)$ . Thus, corresponding to the paired solution $(V^{\epsilon, K}_{\bar{K}}(t, x), \bar{V}^{\epsilon, K}_{\bar{K}}(t, x))$ to the equation in (1.4), the approximated optimal investment strategies can be denoted by $\beta^{*}_{\epsilon}(t)$ , $\beta^{*, m}_{\epsilon}(t)$ , and $\beta^{*, h}_{\epsilon}(t)$ (see, e.g., the simulation results in Figures 3–5).

Figure 4. Simulated investment policy, myopic policy, and excess hedging demand with

$\epsilon = 1/100000$ ,

$K = upperbound = 10^{9}$ ,

$\bar{K} = 2^{64}$ , and

$learning rate = 1/2$ . Furthermore,

$T = 0.1$ ,

$n = 60000$ ,

$hhh = 2$ ,

$terminalcoefficient = 2*100$ ,

$Q = 2352$ ,

$b = 1/6000$ ,

$d = 100$ ,

$dropnum = 30$ ,

$BMD = 2$ ,

$bmdp = 3000$ , and

$k = 1$ . In this figure, we display the evolution results of

$(d-dropnum)$ points with respect to the position parameter

$x\in \{b/d, 2b/d, ..., (d-dropnum)b/d\}$ over horizon-axes at a particular time point

$T(n-8)/n$ .

DownLoad: Full-Size Img PowerPoint

Figure 5. Simulated dynamical evolutions of investment policy, myopic policy, and excess hedging demand with

$\epsilon = 1/100000$ ,

$K = upperbound = 10^{9}$ ,

$\bar{K} = 2^{64}$ , and

$learning rate = 1/2$ . Furthermore,

$T = 0.1$ ,

$n = 60000$ ,

$hhh = 2$ ,

$terminalcoefficient = 2*100$ ,

$Q = 2352$ ,

$b = 1/6000$ ,

$d = 100$ ,

$dropnum = 30$ ,

$BMD = 2$ ,

$bmdp = 3000$ , and

$k = 1$ . In this figure, we display the evolution results of

$(d-dropnum)$ points with respect to the position parameter

$x\in \{b/d, 2b/d, ..., (d-dropnum)b/d\}$ over horizon-axes and time parameter

$t\in\{T, T(n-1)/n, ..., T(n-8)/n\}$ .

DownLoad: Full-Size Img PowerPoint

3. The Monte Carlo simulation algorithm and its convergence

This section consists of two subsections concerning the design of a Monte Carlo simulation algorithm and proving its convergence with error bound estimation.

3.1. Simulation algorithm

In this subsection, we develop a Monte Carlo simulation algorithm based on both CNN and machine learning (see ) to simulate the 2-tuple adapted strong solution to the B-SPDE in (1.4). More precisely, we consider a partition $\pi$ for the product region of $[0, T]\times D$ with $D = [0, b]$ as follows,

$\begin{eqnarray} \pi:&&0 = t_{0} < t_{1} < \cdot\cdot\cdot < t_{n_{0}} = T\;\;\;\;\;\mbox{with}\;\;n_{0}\in\{0,1,...\}, \\ &&0 = x_{0} < x_{1} < \cdot\cdot\cdot < x_{n_{1}} = b\;\;\;\;\mbox{with}\;\;n_{1}\in\{0,1,...\}, \end{eqnarray}$

(3.1)

where $t_{j_{0}}$ for $j_{0}\in\{0, 1, 2, ..., n_{0}\}$ and $x_{j_{1}}$ for $j_{1}\in\{0, 1, ..., n_{1}\}$ are the points of divisions over the time interval $[0, T]$ and the space interval $D$ . Then, for all $j_{l}\in\{1, ..., n_{l}\}$ with $l\in\{0, 1\}$ , we take

$\begin{eqnarray} \Delta^{t,\pi}_{j_{0}}& = &t_{j_{0}}-t_{j_{0}-1}, \end{eqnarray}$

(3.2)

$\begin{eqnarray} \Delta^{\pi}_{1}& = &x_{j_{1}}-x_{j_{1}-1} = \frac{b}{n_{1}}, \end{eqnarray}$

(3.3)

$\begin{eqnarray} \Delta^{\pi}W_{j_{0}}& = &W(t_{j_{0}})-W(t_{j_{0}-1}). \end{eqnarray}$

(3.4)

Furthermore, let

$\begin{eqnarray} |\pi|&\equiv&\max\limits_{j_{0}\in\{1,...,n_{0}\}}\left\{\Delta^{t,\pi}_{j_{0}},\;\Delta^{\pi}_{1}\right\}, \end{eqnarray}$

(3.5)

$\begin{eqnarray} D^{j_{1}}&\equiv&\Big[x_{j_{1}-1},\;x_{j_{1}}\Big), \end{eqnarray}$

(3.6)

$\begin{eqnarray} {\cal X}&\equiv&\Big\{x_{j_{1}}:\;j_{1}\in\{0,1,...,n_{1}\}\Big\}. \end{eqnarray}$

(3.7)

Now, we use the forward and the backward difference techniques to approximate the partial derivatives appearing in (1.4). More precisely, for each $f\in\{V^{\epsilon, K}_{\bar{K}}, \bar{V}^{\epsilon, K}_{\bar{K}}\}$ , $x\in{\cal X}$ , and each integer $c\in\{1, 2\}$ , we define the $c^{th}$ -quotient of differences, which corresponds to the $c^{th}$ -order derivative of $f$ along the $x$ direction, as follows,

$\begin{eqnarray} f^{(c)}_{\pi}(t,x) &\equiv&\left\{\begin{array}{ll} \;\;\;\frac{f^{(c-1)}_{\pi}(t,x+\Delta^{\pi}_{1}) -f^{(c-1)}_{\pi}(t,x)}{\Delta^{\pi}_{1}} &\mbox{if}\;\;x = x_{j_{1}}\;\;\mbox{and}\;\;j_{1} < n_{1},\\ \\ -\frac{f^{(c-1)}_{\pi}(t,x-\Delta^{\pi}_{1}) -f^{(c-1)}_{\pi}(t,x)}{\Delta^{\pi}_{1}} &\mbox{if}\;\;x = x_{j_{1}}\;\;\mbox{and}\;\;j_{1} = n_{1}, \end{array} \right. \end{eqnarray}$

(3.8)

where we adopt the convention that $f^{(0)}_{\pi} = f_{\pi}$ . Furthermore, to simplify the notations, we use $\{V^{\epsilon}, \bar{V}^{\epsilon}\}$ to denote $\{V^{\epsilon, K}_{\bar{K}}, \bar{V}^{\epsilon, K}_{\bar{K}}\}$ , and we define

$\begin{eqnarray} {\cal L}^{\epsilon,K}_{\bar{K}}\left(t,x,V^{\epsilon}_{\pi}(t,x)\right)\equiv{\cal L}^{\epsilon,K}_{\bar{K}}(t,x,(V^{\epsilon}_{\pi}(t,x),V^{\epsilon,(1)}_{\pi}(t,x),V^{\epsilon,(2)}_{\pi}(t,x)),(\bar{V}^{\epsilon}_{\pi}(t,x),\bar{V}^{\epsilon,(1)}_{\pi}(t,x))) \end{eqnarray}$

(3.9)

for each $x\in{\cal X}$ . Moreover, we use ${\cal L}^{\epsilon, K}_{\pi, \bar{K}}$ to denote the fully discretized version of ${\cal L}^{\epsilon, K}_{\bar{K}}$ . Then, we can present the following Monte Carlo simulation algorithm.

Algorithm 3.1. This algorithm consists of three parts: Part I, Part II, and Part III:

Part I. This part is an iterative one in terms of $\{(V^{\epsilon}(t_{j_{0}}, x), \bar{V}^{\epsilon}(t_{j_{0}}, x))$ : $x\in{\cal X}\}$ with $j_{0}$ decreasing from $n_{0}$ to 1 in a backward way,

$\begin{eqnarray} V^{\epsilon}_{\pi}(t_{n_{0}},x)& = &H_{\pi}(x), \end{eqnarray}$

(3.10)

$\begin{eqnarray} \bar{V}^{\epsilon}_{\pi}(t_{n_{0}},x)& = &0, \end{eqnarray}$

(3.11)

$\begin{eqnarray} \;\;\;\;\;V^{\epsilon}_{\pi}(t_{j_{0}-1},x) & = &E\left[\left.V^{\epsilon}_{\pi}(t_{j_{0}},x) +{\cal L}^{\epsilon,K}_{\pi,\bar{K}}(t_{j_{0}},x,V^{\epsilon}_{\pi}(t_{j_{0}},x)) \Delta^{t,\pi}_{j_{0}}\right|{\cal F}_{t_{j_{0}-1}}\right], \end{eqnarray}$

(3.12)

$\begin{eqnarray} \;\;\;\;\;\bar{V}^{\epsilon}_{\pi}(t_{j_{0}-1},x) & = &\frac{1}{\Delta^{\pi}_{j_{0}}}E\left[\left.V^{\epsilon}_{\pi}(t_{j_{0}},x)\Delta^{\pi}W_{j_{0}}\right|{\cal F}_{t_{j_{0}-1}}\right] \\ &&+E\left[\left.{\cal L}^{\epsilon,K}_{\pi,\bar{K}}(t_{j_{0}},x,V^{\epsilon}_{\pi}(t_{j_{0}},x))\Delta^{\pi}W_{j_{0}}\right|{\cal F}_{t_{j_{0}-1}}\right]. \nonumber \end{eqnarray}$

(3.13)

Part II. This part is a machine learning loop at time $t_{j_{0}-1}$ in order to minimize the difference concerning the values on both sides of the equation in (1.4) along each sample path, i.e.,

$\begin{eqnarray} &&V^{\epsilon,k+1}_{\pi}(t_{j_{0}-1},x) = V^{\epsilon,k}_{\pi}(t_{j_{0}-1},x)-\alpha\nabla G(V^{\epsilon,k}_{\pi}(t_{j_{0}-1},x)) \end{eqnarray}$

(3.14)

for each $k\in\{0, 1, 2, ...\}$ with $V^{\epsilon, 0}_{\pi}(t_{j_{0}-1}, x) = V^{\epsilon}_{\pi}(t_{j_{0}-1}, x)$ , where $\alpha$ is a given learning rate and $\nabla G(\cdot)$ is the stochastic gradient of an optimization problem, i.e.,

$\begin{eqnarray} \min\limits_{V^{\epsilon,k}_{\pi}(t_{j_{0}-1},x)\in R}G(V^{\epsilon,k}_{\pi}(t_{j_{0}-1},x)) \end{eqnarray}$

with its objective function $G(V^{\epsilon, k}_{\pi}(t_{j_{0}-1}, x))$ given by

$\begin{eqnarray} \;\;\Bigg(V^{\epsilon,k}_{\pi}(t_{j_{0}-1},x)-\Big(V_{\pi}^{{{\epsilon}}}(t_{j_{0}},x)-\frac{1}{2}{\cal L}^{\epsilon,K}_{\pi,\bar{K}}(t_{j_{0}-1},x,V^{\epsilon,k}_{\pi}(t_{j_{0}-1},x)) \Delta^{t,\pi}_{j_{0}}\Big)+\bar{V}_{\pi}^{{{\epsilon}}}(t_{j_{0}-1},x)\Delta^{\pi}W_{j_{0}}\Bigg)^{2}. \end{eqnarray}$

The machine learning loop in this part has a stopping rule as follows.

$\begin{eqnarray} &&\mathit{\text{Choose a number }} (k+1) \mathit{\text{ to stop the iteration,}} \\ && \mathit{\text{then take the new }} V^{\epsilon}_{\pi}(t_{j_{0}-1},x) \mathit{\text{ to be }} V^{\epsilon,k+1}_{\pi}(t_{j_{0}-1},x) . \end{eqnarray}$

(3.15)

Part III. This part is to compute the numerical derivatives at each time $t_{j_{0}-1}$ , i.e.,

$\begin{eqnarray} &&\mathit{\text{Compute }} V^{\epsilon,(c)}_{\pi}(t_{j_{0}-1},x) \mathit{\text{ and }} \bar{V}^{\epsilon,(c)}_{\pi}(t_{j_{0}-1},x) \\ &&\mathit{\text{for each }} x\in{\cal X} \mathit{\text{ with }} c\in\{1,2\} \mathit{\text{ via the formula in (3.8).}} \end{eqnarray}$

(3.16)

Concerning the architecture, the loss, and the learning strategy corresponding to Algorithm 3.1, the associated flow chart is shown in Figure 1. In this flow chart, we present a backward CNN supported with a ML loop.

As displayed in the graph on the left-hand side of , this CNN denoted by ${\cal U}$ has $(n_{0}+1)$ -layers arranged in a backward way, which corresponds to Part I of Algorithm 3.1. In this CNN, a node (also called a neuron) is indexed by $(t_{j}, x_{i})$ with $j\in\{n_{0}, n_{0}-1, ..., 1, 0\}$ and $i\in\{0, 1, ..., n_{1}\}$ , where, in our Algorithm 3.1, we take $j = j_{0}$ and $i = j_{1}$ . Associated with each node $(t_{j}, x_{i})$ , the paired solution $(V^{\epsilon}(t_{j}, x_{i}), \bar{V}^{\epsilon}(t_{j}, x_{i}))$ is considered as a pair of parameters to be trained or estimated. The training process is arranged in a backward way as follows,

$\begin{eqnarray} &(V^{\epsilon}(t_{n_{0}},x_{i}),\bar{V}^{\epsilon}(t_{n_{0}},x_{i}))\rightarrow (V^{\epsilon}(t_{n_{0}-1},x_{i}),\bar{V}^{\epsilon}(t_{n_{0}-1},x_{i}))\rightarrow \cdot\cdot\cdot\rightarrow (V^{\epsilon}(t_{0},x_{i}),\bar{V}^{\epsilon}(t_{0},x_{i})) \end{eqnarray}$

for each $i\in\{0, 1, ..., n_{1}\}$ . The design rational for this CNN is the tower law for expectation (see, e.g., Theorem 5.1 (vii) on page 81 of Kallenberg ^[16]) and more explanations concerning the design rational for a CNN can be found in Dai ^[9]. The learning strategy for this CNN is to minimize the mean-squared error (loss) between $(V^{\epsilon}, \bar{V}^{\epsilon})$ and ${\cal U}$ .

As displayed in the graph on the right-hand side of Figure 1, the supporting ML loop corresponds to Part II of Algorithm 3.1. The purpose of adding this ML loop into Algorithm 3.1 is to speed up the convergence of the algorithm in Dai ^[9]. Actually, in Dai ^[9], the algorithm is designed only through the backward CNN corresponding to Part I of Algorithm 3.1. Interestingly, our numerical implementations presented in this paper indicate that the convergence of Algorithm 3.1 is indeed faster after adding this ML loop. Concerning the supporting ML loop, the iteration is only designed for $V_{\pi}^{\epsilon}$ as shown in Part II of Algorithm 3.1. In this iteration, we keep $\bar{V}_{\pi}^{\epsilon}$ the same for all $k$ . In this ML loop, the learning strategy corresponds to solve an optimization problem as presented in Part II of Algorithm 3.1, which is to minimize the difference (loss) of the two sides of the B-SPDE in (1.4) along each sample path in the squared error sense. In the learning iteration of (3.14), the learning rate $\alpha$ is designed to satisfy the condition $\alpha K_{D}^{1} < 1$ for a constant $K_{D}^{1}$ (that is presented in (3.22) of this paper). It is worthwhile to point out that, even after we add this ML loop, we still can prove the convergence of this newly designed Algorithm 3.1, which is presented in the next subsubsection.

Finally, the associated simulation examples based on Algorithm 3.1 will be provided in Section 4. Instead, in the next subsection, we first conduct the convergence analysis with error bound estimation.

3.2. Convergence with error bound estimation

In this subsection, we focus on the discussion concerning the convergence and error bound estimation of Algorithm 3.1. More precisely, for each $t\in[0, T]$ , $x\in{\cal X}$ , and $j_{0}\in\{n_{0}, n_{0}-1, ..., 1\}$ , we define

$\begin{eqnarray} \Delta V^{\epsilon,K}_{\bar{K}}(t,x)& = &V^{\epsilon,K}_{\bar{K}}(t,x)-V^{\epsilon}_{\pi}(t,x), \end{eqnarray}$

(3.17)

$\begin{eqnarray} \Delta\bar{V}^{\epsilon,K}_{\bar{K}}(t,x)& = &\bar{V}^{\epsilon,K}_{\bar{K}}(t,x)-\bar{V}^{\epsilon}_{\pi}(t,x), \\ V^{\epsilon}_{\pi}(t,x)& = &V^{\epsilon}_{\pi}(t_{j_{0}-1},x),\;t\in[t_{j_{0}-1},t_{j_{0}}), \\ \bar{V}^{\epsilon}_{\pi}(t,x)& = &\bar{V}^{\epsilon}_{\pi}(t_{j_{0}-1},x),\;t\in[t_{j_{0}-1},t_{j_{0}}). \end{eqnarray}$

(3.18)

Furthermore, let

$\begin{eqnarray} &&\xi(k) = \sum\limits_{i_{0} = 1}^{k}1+\sum\limits_{i_{0} = 1}^{k}\sum\limits_{i_{1} = 1}^{i_{0}}1+\sum\limits_{i_{0} = 1}^{k}\sum\limits_{i_{1} = 1}^{i_{0}}\sum\limits_{i_{2} = 1}^{i_{1}}1+\cdot\cdot\cdot+\sum\limits_{i_{0} = 1}^{k}\sum\limits_{i_{1} = 1}^{i_{0}}\sum\limits_{i_{2} = 1}^{i_{1}}\cdot\cdot\cdot\sum\limits_{i_{m-1} = 1}^{2}1. \end{eqnarray}$

(3.19)

In addition, it follows from the generalized local Lipschitz and linear growth conditions in (2.8) and (2.9) that

$\begin{eqnarray} \left|\Delta(\nabla G)(s,x,u,v)\right|&\leq&K^{1}_{D}\|u-v\|_{C^{2}(D,R)}, \end{eqnarray}$

(3.20)

$\begin{eqnarray} \left|\nabla G(s,x,u))\right|&\leq& K^{1}_{D}\|u\|_{C^{2}(D,R)} \end{eqnarray}$

(3.21)

for some positive constant $K_{D}^{1}$ , where $(t, x)\in [0, T]\times D$ and $u, v\in C^{2}(D, R)\times C^{2}(D, R)$ . Therefore, we can introduce the following assumption concerning the learning rate $\alpha$ ,

$\begin{eqnarray} \alpha K^{1}_{D} < 1. \end{eqnarray}$

(3.22)

Finally, let $\|\cdot\|$ be the largest absolute value corresponding to each used function for all $x\in{\cal X}$ . Then, we can present our algorithm convergence theorem with error bound estimation as follows.

Theorem 3.1. For Algorithm 3.1 with the condition in (3.22) and a given iteration number $k$ in (3.14), there is a nonnegative constant $C$ depending only on the terminal time $T$ , the Lipschitz constant $K_{D}$ in (2.8) and (2.9), and the supremum (a constant) in (2.6) such that the following mean-square error estimation is true,

$\begin{eqnarray} &&\sup\limits_{t\in[0,T]}\left(E\left[\left\|\Delta V^{\epsilon,K}_{\bar{K}}(t)\right\|^{2}\right]+ E\left[\left\|\Delta\bar{V}^{\epsilon,K}_{\bar{K}}(t)\right\|^{2}\right]\right)\leq C|\pi| \end{eqnarray}$

(3.23)

for all sufficiently small $|\pi|$ . In addition, consider a sequence of increasing sets ${\cal X}_{\bar{k}}$ with $\bar{k}\in\{1, 2, ...\}$ (i.e., ${\cal X}_{1}\subset{\cal X}_{2}\subset\cdot\cdot\cdot$ ). Suppose that the corresponding maximal mesh gauge $|\pi|_{\bar{k}}$ along $\bar{k}\in\{1, 2, ...\}$ satisfies

$\begin{eqnarray} &&|\pi|_{\bar{k}}\rightarrow 0,\;\;\;\;\;\;\sum\limits_{\bar{k} = 1}^{\infty}\Big(|\pi|_{\bar{k}}\Big)^{\frac{1}{3}} < \infty. \end{eqnarray}$

Then, for a given ${\cal X}\in\{{\cal X}_{1}, {\cal X}_{2}, ...\}$ , we have the a.s. convergence for Algorithm 3.1 as $\bar{k}\rightarrow \infty$ ,

$\begin{eqnarray} &&\sup\limits_{t\in[0,T]}\left(\left\|\Delta V^{\epsilon,K}_{\bar{K}}(t)\right\|+\left\|\Delta\bar{V}^{\epsilon,K}_{\bar{K}}(t)\right\|\right)\rightarrow 0\;\;\;\;\;\mathit{\mbox{a.s.}} \end{eqnarray}$

(3.24)

Proof. First, for the machine learning loop given by (3.14) in Part II of Algorithm 3.1, a given integer $k\in\{0, 1, 2, ...\}$ , and an index $j_{0}\in\{n_{0}, n_{0}-1, ..., 1\}$ , it follows from the facts in (3.20) and (3.21) that

$\begin{eqnarray} \left\|V^{\epsilon,k+1}_{\pi}(t_{j_{0}-1})-V^{\epsilon,0}_{\pi}(t_{j_{0}-1})\right\| &\leq&\alpha\sum\limits_{i_{0} = 1}^{k}\left\|\nabla G(V^{\epsilon,i_{0}}_{\pi}(t_{j_{0}-1}))\right\| \\ &\leq&\alpha\sum\limits_{i_{0} = 1}^{k}\Bigg(\sum\limits_{i_{1} = 1}^{i_{0}}\left\|\nabla G(V^{\epsilon,i_{1}}_{\pi}(t_{j_{0}-1}))-\nabla G(V^{\epsilon,i_{1}-1}_{\pi}(t_{j_{0}-1}))\right\|+\left\|\nabla G(V^{\epsilon,0}_{\pi}(t_{j_{0}-1}))\right\|\Bigg) \\ &\leq&\alpha|\pi|K_{D}^{1}\sum\limits_{i_{0} = 1}^{k}\Bigg(\sum\limits_{i_{1} = 1}^{i_{0}}\left\|V^{\epsilon,i_{1}}_{\pi}(t_{j_{0}-1})-V^{\epsilon,i_{1}-1}_{\pi}(t_{j_{0}-1})\right\|_{C^{2}({\cal X},R)}+\left\|V^{\epsilon,0}(t_{j_{0}-1})\right\|_{C^{2}({\cal X},R)}\Bigg) \\ &\leq&\alpha|\pi|K_{D}^{1}\sum\limits_{i_{0} = 1}^{k}\Bigg(\sum\limits_{i_{1} = 1}^{i_{0}}\alpha|\pi|K_{D}^{1}\bigg(\sum\limits_{i_{2} = 1}^{i_{1}}\left\|V^{\epsilon,i_{2}}_{\pi}(t_{j_{0}-1})-V^{\epsilon,i_{2}-1}_{\pi}(t_{j_{0}-1})\right\|_{C^{2}({\cal X},R)} \\ &&\; +\left\|V^{\epsilon,0}_{\pi}(t_{j_{0}-1})\right\|_{C^{2}({\cal X},R)}\bigg)+\left\|V^{\epsilon,0}_{\pi}(t_{j_{0}-1})\right\|_{C^{2}({\cal X},R)}\Bigg) \\ &&\cdot\cdot\cdot \\ &&\cdot\cdot\cdot \\ &&\cdot\cdot\cdot \\ \\ &\leq&\alpha|\pi|K^{1}_{D}\xi(k)\left\|V^{\epsilon,0}_{\pi}(t_{j_{0}-1})\right\|_{C^{2}({\cal X},R)}, \end{eqnarray}$

(3.25)

where $\xi(k)$ is given in (3.19). Thus, for the given iteration number $k$ and $t\in[t_{j_{0}-1}, t_{j_{0}})$ , it follows from (3.25) and the discussion in Dai ^[9] that

$\begin{equation} \begin{aligned} E\left[\left\|\Delta V^{\epsilon,K}_{\bar{K}}(t)\right\|^{2}\right] & = E\left[\left\|V^{\epsilon,K}_{\bar{K}}(t)-V^{\epsilon,k+1}_{\pi}(t)\right\|^{2}\right] \\ &\leq 2E\left[\left\|V^{\epsilon,K}_{\bar{K}}(t,x)-V^{\epsilon,0}_{\pi}(t)\right\|^{2}\right]+2E\left[\left\|V^{\epsilon,0}_{\pi}(t)-V^{\epsilon,k+1}_{\pi}(t)\right\|^{2}\right] \\ &\leq C_{1}|\pi|+\Big(\alpha|\pi|K^{1}_{D}\xi(k)\Big)^{2}E\left[\left\|V^{\epsilon,0}_{\pi}(t_{j_{0}-1})\right\|^{2}_{C^{2}({\cal X},R)}\right] \\ &\leq C_{1}|\pi|+2\Big(\alpha|\pi|K^{1}_{D}\xi(k)\Big)^{2} \Bigg(E\left[\left\|V^{\epsilon,0}_{\pi}(t)-V^{\epsilon,K}_{\bar{K}}(t)\right\|^{2}_{C^{2}({\cal X},R)}\right] +E\left[\left\|V^{\epsilon,K}_{\bar{K}}(t)\right\|^{2}_{C^{2}({\cal X},R)}\right]\Bigg) \\ &\leq C_{1}|\pi|+2\Big(\alpha|\pi|K^{1}_{D}\xi(k)\Big)^{2}\left(C_{1}|\pi|+\sup\limits_{t\in[0,T]}E\left[\left\|V^{\epsilon,K}_{\bar{K}}(t)\right\|^{2}_{C^{2}(D,R)}\right]\right), \end{aligned} \end{equation}$

(3.26)

where $C_{1}$ is some constant depending on $T$ and $K_{D}$ . Since $V^{\epsilon, 0}_{\pi}(t_{j_{0}-1}, x)$ for each $j_{0}\in\{n_{0}, n_{0}-1, ..., 1\}$ satisfies the equation in (3.12), it follows from Theorem 2.1 and (3.26) that the claim in (3.23) is true. Furthermore, the claim in (3.24) can be similarly proved by applying the discussion in Dai ^[9]. Hence, we finish the proof of Theorem 3.1. □

Now, if the B-SPDE in (1.1) has a $\{{\cal F}_{t}\}$ -adapted solution pair ( $V(t, x), \bar{V}(t, x)$ ), then $V_{xx}(t, x)$ can not be zero a.s. at all points $\{(t, x)\in[0, T]\times D\}$ . Therefore, the numerical procedure in Algorithm 3.1 can also be applied to solve the equation in (1.1), e.g.,

$\begin{eqnarray} V_{\pi}(t_{n_{0}},x)& = &H_{\pi}(x), \end{eqnarray}$

(3.27)

$\begin{eqnarray} \bar{V}_{\pi}(t_{n_{0}},x)& = &0, \end{eqnarray}$

(3.28)

$\begin{eqnarray} \;\;\;\;\;V_{\pi}(t_{j_{0}-1},x) & = &E\left[\left.V_{\pi}(t_{j_{0}},x) +{\cal L}_{\pi}(t_{j_{0}},x,V_{\pi}(t_{j_{0}},x)) \Delta^{t,\pi}_{j_{0}}\right|{\cal F}_{t_{j_{0}-1}}\right], \end{eqnarray}$

(3.29)

$\begin{eqnarray} \;\;\;\;\;\bar{V}_{\pi}(t_{j_{0}-1},x) & = &\frac{1}{\Delta^{\pi}_{j_{0}}}E\left[\left.V_{\pi}(t_{j_{0}},x)\Delta^{\pi}W_{j_{0}}\right|{\cal F}_{t_{j_{0}-1}}\right] \\ &&+E\left[\left.{\cal L}_{\pi}(t_{j_{0}},x,V_{\pi}(t_{j_{0}},x))\Delta^{\pi}W_{j_{0}}\right|{\cal F}_{t_{j_{0}-1}}\right], \nonumber \end{eqnarray}$

(3.30)

where ${\cal L}_{\pi}$ in (3.29) and (3.30) is the discrete version of the following partial differential operator,

$\begin{eqnarray} {\cal L}(t,x,V,\bar{V})& = &\frac{(V_{x}(t,x)+\bar{V}_{x}(t,x))^{2}}{V_{xx}(t,x)}. \end{eqnarray}$

(3.31)

Then, we have the following corollary.

Corollary 3.2. Under the conditions in (3.22) with the Lipschitz constant $K_{D}$ in (2.8) $-$ (2.9), if there is a pair of $\{{\cal F}_{t}\}$ -adapted solutions ( $V(t, x), \bar{V}(t, x)$ ) in the space $\bar{{\cal Q}}^{2}_{{\cal F}}([0, T]\times D)$ to the B-SPDE in (1.1), then we have that

$\begin{eqnarray} &&\Big(V^{\epsilon}_{\pi}(t_{j_{0}},x),\bar{V}^{\epsilon}_{\pi}(t_{j_{0}},x)\Big)\rightarrow \Big(V_{\pi}(t_{j_{0}},x),\bar{V}_{\pi}(t_{j_{0}},x)\Big)\;\;\;\;\;\;\mathit{\mbox{a.s.}} \end{eqnarray}$

(3.32)

$\begin{eqnarray} &&E\Bigg[\bigg|\Big(V^{\epsilon}_{\pi}(t_{j_{0}},x),\bar{V}^{\epsilon}_{\pi}(t_{j_{0}},x)\Big)-\Big(V_{\pi}(t_{j_{0}},x),\bar{V}_{\pi}(t_{j_{0}},x)\Big)\bigg|\Bigg]\rightarrow 0 \end{eqnarray}$

(3.33)

as $\bar{K}\rightarrow \infty$ first, $K\rightarrow \infty$ second, $\epsilon\rightarrow 0$ third for each $x\in{\cal X}$ and each $j_{0}\in\{n_{0}, n_{0}-1, ..., 1\}$ in a backward way, and $|\pi|_{\bar{k}}\rightarrow 0$ last along $\bar{k}\in\{1, 2, ...\}$ .

Proof. We prove the claim in (3.32) and (3.33) by induction in terms of $j_{0}\in\{n_{0}, n_{0}-1, ..., 2, 1\}$ . First, we consider the case that $j_{0} = n_{0}-1$ , and it follows from (3.12) and (3.29) that

$\begin{equation} \begin{aligned} &\Bigg|V_{\pi}(t_{n_{0}-1},x)-V^{\epsilon}_{\pi}(t_{n_{0}-1},x)\Bigg| \\ & \leq E\Bigg[\left(\frac{(V_{x}(t_{n_{0}},x))^{2}}{|V_{xx}(t_{n_{0}},x)|}+\frac{\bar{K}^{2}}{|V_{xx}(t_{n_{0}},x)|}\right)I_{\{\epsilon\leq|V_{xx}(t_{n_{0}},x)|\leq K\}} I_{\{|V_{x}(t_{n_{0}},x)| > \bar{K}\}}\Delta^{t,\pi}_{n_{0}}\bigg|{\cal F}_{t_{n_{0}-1}}\Bigg] \\ & +E\Bigg[\left(\frac{(V_{x}(t_{n_{0}},x))^{2}}{|V_{xx}(t_{n_{0}},x)|}+\frac{\bar{K}^{2}}{\epsilon}\right)I_{\{|V_{xx}(t_{n_{0}},x)| < \epsilon\}} I_{\{|V_{x}(t_{n_{0}},x)| > \bar{K}\}}\Delta^{t,\pi}_{n_{0}}\bigg|{\cal F}_{t_{n_{0}-1}}\Bigg] \\ & \;\;\;+E\Bigg[\left(\frac{(V_{x}(t_{n_{0}},x))^{2}}{|V_{xx}(t_{n_{0}},x)|}+\frac{\bar{K}^{2}}{K}\right) I_{\{|V_{xx}(t_{n_{0}},x)| > K\}}I_{\{|V_{x}(t_{n_{0}},x)| > \bar{K}\}}\Delta^{t,\pi}_{n_{0}}\bigg|{\cal F}_{t_{n_{0}-1}}\Bigg] \\ & +E\Bigg[\left(\frac{(V_{x}(t_{n_{0}},x))^{2}}{|V_{xx}(t_{n_{0}},x)|}+\frac{(V_{x}(t_{n_{0}},x))^{2}}{\epsilon}\right)I_{\{|V_{xx}(t_{n_{0}},x)| < \epsilon\}} I_{\{|V_{x}(t_{n_{0}},x)|\leq\bar{K}\}}\Delta^{t,\pi}_{n_{0}}\bigg|{\cal F}_{t_{n_{0}-1}}\Bigg] \\ & +E\Bigg[\left(\frac{(V_{x}(t_{n_{0}},x))^{2}}{|V_{xx}(t_{n_{0}},x)|}+\frac{(V_{x}(t_{n_{0}},x))^{2}}{K}\right) I_{\{|V_{xx}(t_{n_{0}},x)| > K\}}I_{\{|V_{x}(t_{n_{0}},x)|\leq\bar{K}\}}\Delta^{t,\pi}_{n_{0}}\bigg|{\cal F}_{t_{n_{0}-1}}\Bigg] \\ &\leq 2E\Bigg[\frac{(V_{x}(t_{n_{0}},x))^{2}}{|V_{xx}(t_{n_{0}},x)|}I_{\{\epsilon\leq|V_{xx}(t_{n_{0}},x)|\leq K\}} I_{\{|V_{x}(t_{n_{0}},x)| > \bar{K}\}}\Delta^{t,\pi}_{n_{0}}\bigg|{\cal F}_{t_{n_{0}-1}}\Bigg] \\ & +2E\Bigg[\frac{(V_{x}(t_{n_{0}},x))^{2}}{|V_{xx}(t_{n_{0}},x)|}I_{\{|V_{xx}(t_{n_{0}},x)| < \epsilon\}} I_{\{|V_{x}(t_{n_{0}},x)| > \bar{K}\}}\Delta^{t,\pi}_{n_{0}}\bigg|{\cal F}_{t_{n_{0}-1}}\Bigg] \\ & +\;\;E\Bigg[\left(\frac{(V_{x}(t_{n_{0}},x))^{2}}{|V_{xx}(t_{n_{0}},x)|}+\frac{(V_{x}(t_{n_{0}},x))^{2}}{K}\right) I_{\{|V_{xx}(t_{n_{0}},x)| > K\}}I_{\{|V_{x}(t_{n_{0}},x)| > \bar{K}\}}\Delta^{t,\pi}_{n_{0}}\bigg|{\cal F}_{t_{n_{0}-1}}\Bigg] \\ & +2E\Bigg[\frac{(V_{x}(t_{n_{0}},x))^{2}}{|V_{xx}(t_{n_{0}},x)|}I_{\{|V_{xx}(t_{n_{0}},x)| < \epsilon\}} I_{\{|V_{x}(t_{n_{0}},x)|\leq\bar{K}\}}\Delta^{t,\pi}_{n_{0}}\bigg|{\cal F}_{t_{n_{0}-1}}\Bigg] \\ & +\;\;E\Bigg[\left(\frac{(V_{x}(t_{n_{0}},x))^{2}}{|V_{xx}(t_{n_{0}},x)|}+\frac{(V_{x}(t_{n_{0}},x))^{2}}{K}\right) I_{\{|V_{xx}(t_{n_{0}},x)| > K\}}I_{\{|V_{x}(t_{n_{0}},x)|\leq\bar{K}\}}\Delta^{t,\pi}_{n_{0}}\bigg|{\cal F}_{t_{n_{0}-1}}\Bigg] \\ &\leq 2E\Bigg[\frac{(V_{x}(t_{n_{0}},x))^{2}}{|V_{xx}(t_{n_{0}},x)|} I_{\{|V_{x}(t_{n_{0}},x)| > \bar{K}\}}\Delta^{t,\pi}_{n_{0}}\bigg|{\cal F}_{t_{n_{0}-1}}\Bigg] \\ & +2E\Bigg[\frac{(V_{x}(t_{n_{0}},x))^{2}}{|V_{xx}(t_{n_{0}},x)|} I_{\{|V_{x}(t_{n_{0}},x)| > \bar{K}\}}\Delta^{t,\pi}_{n_{0}}\bigg|{\cal F}_{t_{n_{0}-1}}\Bigg] \\ & +\;\;E\Bigg[\left(\frac{(V_{x}(t_{n_{0}},x))^{2}}{|V_{xx}(t_{n_{0}},x)|}+\frac{(V_{x}(t_{n_{0}},x))^{2}}{K}\right) I_{\{|V_{x}(t_{n_{0}},x)| > \bar{K}\}}\Delta^{t,\pi}_{n_{0}}\bigg|{\cal F}_{t_{n_{0}-1}}\Bigg] \\ & +2E\Bigg[\frac{(V_{x}(t_{n_{0}},x))^{2}}{|V_{xx}(t_{n_{0}},x)|}I_{\{|V_{xx}(t_{n_{0}},x)| < \epsilon\}} \Delta^{t,\pi}_{n_{0}}\bigg|{\cal F}_{t_{n_{0}-1}}\Bigg] \\ & +\;\;E\Bigg[\left(\frac{(V_{x}(t_{n_{0}},x))^{2}}{|V_{xx}(t_{n_{0}},x)|}+\frac{(V_{x}(t_{n_{0}},x))^{2}}{K}\right) I_{\{|V_{xx}(t_{n_{0}},x)| > K\}}\Delta^{t,\pi}_{n_{0}}\bigg|{\cal F}_{t_{n_{0}-1}}\Bigg]. \end{aligned} \end{equation}$

(3.34)

Then, by the monotone (and conditional monotone) convergence theorem, we first let $\bar{K}\rightarrow \infty$ , second let $K\rightarrow \infty$ , and third let $\epsilon\rightarrow 0$ , and we can conclude that

$\begin{eqnarray} \bigg|V^{\epsilon}_{\pi}(t_{n_{0}-1},x)-V_{\pi}(t_{n_{0}-1},x)\bigg|&\rightarrow & 0\;\;\;\mbox{a.s.}, \end{eqnarray}$

(3.35)

$\begin{eqnarray} E\bigg[\bigg|V^{\epsilon}_{\pi}(t_{n_{0}-1},x)-V_{\pi}(t_{n_{0}-1},x)\bigg|\bigg]&\rightarrow & 0. \end{eqnarray}$

(3.36)

Second, we consider the case that $j_{0} = n_{0}-2$ . Let $V_{\pi, \bar{K}}^{\epsilon, K}(t_{n_{0}-1}, x)$ be the corresponding value computed through (3.29). Then, it follows from (3.12) and (3.29) that

$\begin{equation} \begin{aligned} &\Bigg|V_{\pi}(t_{n_{0}-2},x)-V^{\epsilon}_{\pi}(t_{n_{0}-2},x)\Bigg| \leq E\Bigg[\Big|\Delta V_{\bar{K}}^{\epsilon,K}(t_{n_{0}-1},x)\Big|\bigg|{\cal F}_{t_{n_{0}-2}}\Bigg] +E\Bigg[\Big|V_{\pi}(t_{n_{0}-1},x)-V_{\pi,\bar{K}}^{\epsilon,K}(t_{n_{0}-1},x)\Big|\bigg|{\cal F}_{t_{n_{0}-2}}\Bigg] \\ & +E\Bigg[\left(\frac{(V_{x}(t_{n_{0}-1},x)+\bar{V}_{x}(t_{n_{0}-1},x))^{2}}{|V_{xx}(t_{n_{0}-1},x)|}+\frac{\bar{K}^{2}}{|V_{xx}(t_{n_{0}-1},x)|}\right)I_{\{\epsilon\leq|V_{xx}(t_{n_{0}-1},x)|\leq K\}} I_{\{|V_{x}(t_{n_{0}-1},x)+\bar{V}_{x}(t_{n_{0}-1},x)| > \bar{K}\}}\Delta^{t,\pi}_{n_{0}-1}\bigg|{\cal F}_{t_{n_{0}-2}}\Bigg] \\ & +E\Bigg[\left(\frac{(V_{x}(t_{n_{0}},x)+\bar{V}_{x}(t_{n_{0}-1},x))^{2}}{|V_{xx}(t_{n_{0}},x)|}+\frac{\bar{K}^{2}}{\epsilon}\right)I_{\{|V_{xx}(t_{n_{0}},x)| < \epsilon\}} I_{\{|V_{x}(t_{n_{0}-1},x)+\bar{V}_{x}(t_{n_{0}-1},x)| > \bar{K}\}}\Delta^{t,\pi}_{j_{0}}\bigg|{\cal F}_{t_{n_{0}-2}}\Bigg] \\ & \;\;\;+E\Bigg[\left(\frac{(V_{x}(t_{n_{0}-1},x)+\bar{V}_{x}(t_{n_{0}-1},x))^{2}}{|V_{xx}(t_{n_{0}-1},x)|}+\frac{\bar{K}^{2}}{K}\right) I_{\{|V_{xx}(t_{n_{0}-1},x)| > K\}} I_{\{|V_{x}(t_{n_{0}-1},x)+\bar{V}_{x}(t_{n_{0}-1},x)| > \bar{K}\}}\Delta^{t,\pi}_{n_{0}-1}\bigg|{\cal F}_{t_{n_{0}-2}}\Bigg] \\ & +E\Bigg[\left(\frac{(V_{x}(t_{n_{0}-1},x)+\bar{V}_{x}(t_{n_{0}-1},x))^{2}}{|V_{xx}(t_{n_{0}-1},x)|}+\frac{(V_{x}(t_{n_{0}-1},x)+\bar{V}_{x}(t_{n_{0}-1},x))^{2}}{\epsilon}\right) \\ & \;\;\; I_{\{|V_{xx}(t_{n_{0}-1},x)| < \epsilon\}}I_{\{|V_{x}(t_{n_{0}-1},x)+\bar{V}_{x}(t_{n_{0}-1},x)|\leq\bar{K}\}}\Delta^{t,\pi}_{n_{0}-1}\bigg|{\cal F}_{t_{n_{0}-2}}\Bigg] \\ & +E\Bigg[\left(\frac{(V_{x}(t_{n_{0}-1},x)+\bar{V}_{x}(t_{n_{0}-1},x))^{2}}{|V_{xx}(t_{n_{0}-1},x)|}+\frac{(V_{x}(t_{n_{0}-1},x)+\bar{V}_{x}(t_{n_{0}-1},x))^{2}}{K}\right) \\ & \;\;\; I_{\{|V_{xx}(t_{n_{0}-1},x)| > K\}}I_{\{|V_{x}(t_{n_{0}-1},x)+\bar{V}_{x}(t_{n_{0}-1},x)|\leq\bar{K}\}}\Delta^{t,\pi}_{n_{0}-1}\bigg|{\cal F}_{t_{n_{0}-2}}\Bigg] \\ &\leq E\Bigg[\Big|\Delta V_{\bar{K}}^{\epsilon,K}(t_{n_{0}-1},x)\Big|\bigg|{\cal F}_{t_{n_{0}-2}}\Bigg] +E\Bigg[\Big|V_{\pi}(t_{n_{0}-1},x)-V_{\pi,\bar{K}}^{\epsilon,K}(t_{n_{0}-1},x)\Big|\bigg|{\cal F}_{t_{n_{0}-2}}\Bigg] \\ & +4E\Bigg[\frac{(V_{x}(t_{n_{0}-1},x)+\bar{V}_{x}(t_{n_{0}-1},x))^{2}}{|V_{xx}(t_{n_{0}-1},x)|} I_{\{|V_{x}(t_{n_{0}-1},x)|+\bar{V}_{x}(t_{n_{0}-1},x) > \bar{K}\}}\Delta^{t,\pi}_{n_{0}-1}\bigg|{\cal F}_{t_{n_{0}-2}}\Bigg] \\ & +2E\Bigg[\Big(V_{x}(t_{n_{0}-1},x)+\bar{V}_{x}(t_{n_{0}-1},x)\Big)^{2}\left(\frac{1}{|V_{xx}(t_{n_{0}-1},x)|}+\frac{1}{K}\right) I_{\{|V_{xx}(t_{n_{0}-1},x)| > K\}} \Delta^{t,\pi}_{n_{0}-1}\bigg|{\cal F}_{t_{n_{0}-2}}\Bigg] \\ & +2E\Bigg[\frac{(V_{x}(t_{n_{0}-1},x)+\bar{V}_{x}(t_{n_{0}-1},x))^{2}}{|V_{xx}(t_{n_{0}-1},x)|}I_{\{|V_{xx}(t_{n_{0}-1},x)| < \epsilon\}} \Delta^{t,\pi}_{n_{0}-1}\bigg|{\cal F}_{t_{n_{0}-2}}\Bigg]. \end{aligned} \end{equation}$

(3.37)

Then, by the monotone convergence theorem, we first let $\bar{K}\rightarrow \infty$ , second let $K\rightarrow \infty$ , and third let $\epsilon\rightarrow 0$ , and it follows from (3.35) and (3.36) that

$\begin{eqnarray} &&\bigg|V^{\epsilon}_{\pi}(t_{n_{0}-2},x)-V_{\pi}(t_{n_{0}-2},x)\bigg| \end{eqnarray}$

(3.38)

$\begin{eqnarray} &\rightarrow &E\Bigg[\Big|\Delta V_{\bar{K}}^{\epsilon,K}(t_{n_{0}-1},x)\Big|\bigg|{\cal F}_{t_{n_{0}-2}}\Bigg] +E\Bigg[\Big|V_{\pi}(t_{n_{0}-1},x)-V_{\pi,\bar{K}}^{\epsilon,K}(t_{n_{0}-1},x)\Big|\bigg|{\cal F}_{t_{n_{0}-2}}\Bigg]\;\;\;\;\;\mbox{a.s.}, \\ &&E\bigg[\bigg|V^{\epsilon}_{\pi}(t_{n_{0}-2},x)-V_{\pi}(t_{n_{0}-2},x)\bigg|\bigg] \\ &\rightarrow &E\Bigg[\Big|\Delta V_{\bar{K}}^{\epsilon,K}(t_{n_{0}-1},x)\Big|\Bigg] +E\Bigg[\Big|V_{\pi}(t_{n_{0}-1},x)-V_{\pi,\bar{K}}^{\epsilon,K}(t_{n_{0}-1},x)\Big|\Bigg]. \end{eqnarray}$

(3.39)

Finally, a similar argument can be applied to the analysis for $(\bar{V}_{\pi}(t_{n_{0}-1}, x)-\bar{V}^{\epsilon}_{\pi}(t_{n_{0}-1}, x))$ . Then, it follows from the induction, dominated convergence theorem, and Theorem 3.1 that the claims in (3.32) and (3.33). Hence, we have finished the proof of Corollary 3.2. □

4. Simulation examples

In this section, we present simulation examples to show the effectiveness of Algorithm 3.1. More precisely, we first simulate the paired solution to the approximated B-SPDE in (1.4). Since there is no analytic solution available to the B-SPDE, we conduct numerical comparison concerning the difference between two sides of our simulated B-SPDE to show the correctness of our computed solution. All of the results concerning the simulated solution are presented in Figures 2, 6, and 8. Second, based on the simulated solution of the B-SPDE, we also conduct further simulations concerning the financial investment strategies derived in (2.16). All of the simulated results are displayed in Figures 3–5. Note that some predicted properties in theory concerning the financial investment strategies are found in the simulation results, which further justifies the correctness of our simulated solution to the B-SPDE in (1.4).

Figure 6. Simulated solution pair and pathwise error comparison to the B-SPDE in (1.4) with

$\epsilon = 1/100000$ ,

$K = upperbound = 10^{9}$ ,

$\bar{K} = 2^{64}$ , and

$learning rate = 1/2$ . Furthermore,

$T = 0.1$ ,

$n = 60000$ ,

$hhh = 2$ ,

$terminalcoefficient = 2*100$ ,

$Q = 2352$ ,

$b = 1/6000$ ,

$d = 100$ ,

$dropnum = 30$ ,

$BMD = 2$ ,

$bmdp = 3000$ , and

$k = 1$ . In this figure, we display the evolution results of

$(d-dropnum)$ points with respect to the position parameter

$x\in \{b/d, 2b/d, ..., (d-dropnum)b/d\}$ over horizon-axes at a particular time point

$T(n-8)/n$ .

DownLoad: Full-Size Img PowerPoint

Figure 7. First case of the simulated dynamic evolving solution pair to the B-SPDE in (1.4) with

$\epsilon = 1/100000$ ,

$K = upperbound = 10^{9}$ ,

$\bar{K} = 2^{64}$ , and

$learning rate = 1/2$ . Furthermore,

$T = 0.1$ ,

$n = 60000$ ,

$hhh = 2$ ,

$terminalcoefficient = 2*100$ ,

$Q = 2352$ ,

$b = 1/6000$ ,

$d = 100$ ,

$dropnum = 30$ ,

$BMD = 2$ ,

$bmdp = 3000$ , and

$k = 1$ . In this figure, we display the evolution results of

$(d-dropnum)$ points with respect to the position parameter

$x\in \{b/d, 2b/d, ..., (d-dropnum)b/d\}$ over horizon-axes and time parameter

$t\in\{T, T(n-1)/n, ..., T(n-8)/n\}$ .

DownLoad: Full-Size Img PowerPoint

Figure 8. Second case of the simulated dynamic evolving solution pair to the B-SPDE in (1.4) with

$\epsilon = 1/100000$ ,

$K = upperbound = 10^{9}$ ,

$\bar{K} = 2^{64}$ , and

$learning rate = 1/2$ . Furthermore,

$T = 0.1$ ,

$n = 60000$ ,

$hhh = 2$ ,

$terminalcoefficient = 2*100$ ,

$Q = 2352$ ,

$b = 1/6000$ ,

$d = 100$ ,

$dropnum = 30$ ,

$BMD = 2$ ,

$bmdp = 3000$ , and

$k = 1$ . In this figure, we display the evolution results of

$(d-dropnum)$ points with respect to the position parameter

$x\in \{b/d, 2b/d, ..., (d-dropnum)b/d\}$ over horizon-axes and time parameter

$t\in\{T, T(n-1)/n, ..., T(n-8)/n\}$ .

DownLoad: Full-Size Img PowerPoint

In all of the simulations, we use the notation $T$ to denote the terminal time, a positive integer $n$ to denote the number of equally divided subintervals over $[0, T]$ , a positive integer $hhh$ to represent the highest order of partial derivatives corresponding to the implemented equation, a positive integer $Q$ to be the total number of normally distributed random numbers, a positive number $b$ to be the size for the space parameter, and a positive integer $d$ to be the number of equally divided subintervals over $[0, b]$ . Furthermore, we use a positive number $BMD$ to denote the upper bound of an interval such that our driving Brownian motion $W\in[-BMD, BMD]$ and use a positive integer $bmdp$ to denote the number of equally divided subintervals over $[0, BMD]$ . In addition, we use an integer $(n_{0}+1)$ to denote the number of layers of backward network as in the left part of and use an integer $(k+1)$ to denote the number of layers of reinforcement iterations as in the right part of Figure 1.

Note that the main purpose to impose the upper bounds $K$ and $\bar{K}$ in (1.4) is for the equation in (1.4) to satisfy the general local Lipschitz and linear growth conditions. In our simulations, we take them to be sufficiently large in order that our simulation results are stable and acceptable. Actually, we take $\bar{K}$ to be the largest number allowed by our used computer, i.e., $\bar{K} = 2^{64}$ . The computed first-order derivatives of $(V^{\epsilon, K}_{\bar{K}}, \bar{V}^{\epsilon, K}_{\bar{K}})$ are much less than this upper bound $\bar{K}$ . In this sense, our simulations are stable and acceptable. Concerning $K$ , we have used different numbers for tests. Our simulation results presented in this paper correspond to $K = 10^{9}$ . If we used $K = 10^{7}$ , the simulation results were not affected by this change. For safety, we used the simulation results corresponding to $K = 10^{9}$ in this paper.

However, since we are handling a B-SPDE in (1.1) with singularity, the choice of $\epsilon$ indeed has a significant impact on our simulation results. After careful tests, we chose to present the simulation results corresponding to $\epsilon = 1/100000$ in this paper. If we chose $\epsilon > 1/100000$ up to $\epsilon = 1/100$ , the corresponding simulation results were still reasonable and acceptable. Nevertheless, if we chose $\epsilon < 1/100000$ significantly, our simulation results were not acceptable. Concerning the choice of the learning rate $\alpha$ , we chose $\alpha = 1/2$ for good simulation results. The simulation results corresponding to $\alpha$ around 1/2 were also acceptable. Nevertheless, if we chose $\alpha = 1$ , the simulation results are not acceptable.

In , we present the simulation results of the paired solution $(V^{\epsilon, K}_{\bar{K}}, \bar{V}^{\epsilon, K}_{\bar{K}})$ to the approximated B-SPDE in (1.4) with $\epsilon = 1/100000$ , $K = 10^{9}$ , and $\bar{K} = 2^{64}$ . The computed values at time point $t_{j_{0}} = t_{n_{0}}$ with $n_{0} = 10$ correspond to the input terminal values at $t_{n_{0}+1} = T$ . The terminal random field $H(x)$ in (1.4) is taken to be the form

$\begin{eqnarray} &&H(x) = C(1+W^{2}(T))\left(x+\frac{5.5}{60000}\right)^{2}, \end{eqnarray}$

(4.1)

where $C$ is a positive constant and is taken to be $terminalcoefficient = 2*100$ as in the explanation for . Note that the "solution error check" titled in the third plot of the first column in is in terms of the difference (denoted by "Err") between the two sides of the B-SPDE in (1.4) and is with respect to a particular sample path. In applying Algorithm 3.1 with the terminal time at $t_{n_{0}+1} = T$ to solve the equation in (1.4), the corresponding Err at time $t_{n_{0}}$ has the following expression,

$\begin{eqnarray} \;\;\mbox{Err}(t_{n_{0}},x)& = &V(t_{n_{0}},x)-H(x)+\frac{1}{2}\Big(T-t_{n_{0}}\Big)\frac{\Phi_{\bar{K}}(V_{x}(t_{n_{0}},x)+\bar{V}_{x}(t_{n_{0}},x))}{\Psi^{\epsilon,K}(V_{xx}(t_{n_{0}},x)} \\ &&+\Big(W(T)-W(t_{n_{0}})\Big)\bar{V}(t_{n_{0}},x). \end{eqnarray}$

From the simulation results displayed in the third plot of the first column, we can see that our algorithm is quite accurate. Furthermore, the three plots in the first row of display the simulated $V^{\epsilon, K}_{\bar{K}}(t_{n_{0}}, x)$ together with its simulated first-order and second-order derivatives in terms of position parameter $x\in\{b/d, 2b/d..., (d-dropnum)b/d\}$ with $d = 100$ and $dropnum = 30$ . Although the graph in the third plot is non-smooth, it is close to a smooth line. In addition, the three plots in the second row of display the simulated $\bar{V}^{\epsilon, K}_{\bar{K}}(t_{n_{0}}, x)$ together with its simulated first-order and second-order derivatives. Finally, the second and third plots in the third row of display the simulated terminal value $V^{\epsilon, K}_{\bar{K}}(t_{n_{0}+1}, x)$ together with its simulated first-order derivative. In , we display the similar plots as in but with the simulated results at time point $t_{n_{0}-8} = t_{3}$ .

In , we display the first case of the solution evolving as the time index $j_{0}$ decreases from $n_{0}+1$ to 3 with $n_{0} = 10$ . More precisely, the three graphs in the magenta color display the dynamic evolving of $V^{\epsilon, K}_{\bar{K}}(t_{j_{0}}, x)$ together with their first-order and second-order derivatives. The graph in the blue color displays the solution evolving of $\bar{V}^{\epsilon, K}_{\bar{K}}(t_{j_{0}}, x)$ . Similarly, in , we display the second case of the solution evolving as the time index $j_{0}$ decreases from $n_{0}+1$ to 3 with $n_{0} = 10$ . More precisely, the three graphs in the blue color display the dynamic evolving of $\bar{V}^{\epsilon, K}_{\bar{K}}(t_{j_{0}}, x)$ together with their first-order and second-order derivatives. The graph in the magenta color displays the solution evolving of $V^{\epsilon, K}_{\bar{K}}(t_{j_{0}}, x)$ .

In , we display the simulated investment policy, myopic policy, and excess hedging demand at time point $t_{j_{0}}$ with $j_{0} = n_{0}$ and $n_{0} = 10$ . These policies correspond to the formula in (2.16) and its related explanations. The three graphs in the left column correspond to the simulated pathwise results. The three graphs in the right column correspond to the simulated results in the mean average sense with respect to the simulation iteration number $Q$ . Theoretically, the myopic policy should continue to be constant. Our simulated results support this theoretical result. However, this theoretic result also further justifies the correctness of our algorithm and simulations. Similarly, in , we display the simulated investment policy, myopic policy, and excess hedging demand at time point $t_{n_{0}-8} = t_{3}$ with $n_{0} = 10$ . Finally, in , we display the simulated dynamical evolutions of investment policy, myopic policy, and excess hedging demand with respect to time parameter $t\in\{T, T(n-1)/n, ..., T(n-8)/n\}$ .

5. Conclusions

In this paper, we studied a strongly nonlinear B-SPDE through an approximation method. This equation is well-known and was previously derived from studies in finance. However, how to analyze and solve this equation has continued to an open problem for quite a long time. Therefore, by applying our previously established theory and numerical scheme together with CNN and ML, we have developed an effective approximation method with a Monte Carlo simulation algorithm to tackle the well-known open problem. In doing so, the existence and uniqueness of the 2-tuple adapted strong solution to an approximation B-SPDE were proved. Meanwhile, the convergence of a newly designed simulation algorithm was established. Simulation examples and applications in finance were also provided.

Use of AI tools declaration

The author declares he has not used Artificial Intelligence (AI) tools in the creation of this article.

Acknowledgments

The author acknowledges that the project was funded by the National Natural Science Foundation of China with Grant No. 11771006. The author would like to thank the editors and the reviewers for their helpful comments and suggestions to revise this paper.

Conflict of interest

The author declares that he has no competing interests.

References

[1]	J. Braun, M. Griebel, On a constructive proof of Kolmogorov's superposition thoerem, Constr. Approx., 35 (2009), 653–675. https://doi.org/10.1007/s00365-009-9054-2 doi: 10.1007/s00365-009-9054-2
[2]	A. Cĕrný, J. Kallsen. On the structure of general mean-variance hedging strategies, Ann. Appl. Probab., 35 (2007), 1479–1531. https://doi.org/10.1214/009117906000000872 doi: 10.1214/009117906000000872
[3]	G. Cybenko, Approximation by superpositions of a sigmoidal function, Math. Control Signal System, 1 (1989), 303–314. https://doi.org/10.1007/BF02551274 doi: 10.1007/BF02551274
[4]	W. Dai, Brownian approximations for queueing networks with finite buffers: modeling, heavy traffic analysis and numerical implementations, Ph.D thesis, Georgia Institute of Technology, 1996.
[5]	J. G. Dai, W. Dai, A heavy traffic limit theorem for a class of open queueing networks with finite buffers, Queueing Syst., 32 (1999), 5–40. https://doi.org/10.1023/A:1019178802391 doi: 10.1023/A:1019178802391
[6]	W. Dai, Mean-variance portfolio selection based on a generalized BNS stochastic volatility model, Int. J. Comput. Math., 88 (2011), 3521–3534. https://doi.org/10.1080/00207160.2011.606904 doi: 10.1080/00207160.2011.606904
[7]	W. Dai, Optimal rate scheduling via utility-maximization for $J$ -user MIMO Markov fading wireless channels with cooperation, Oper. Res., 61 (2013), 1450–1462. https://doi.org/10.1287/opre.2013.1224 doi: 10.1287/opre.2013.1224
[8]	W. Dai, Mean-variance hedging based on an incomplete market with external risk factors of non-Gaussian OU processes, Math. Probl. Eng., 2015 (2015), 625289. https://doi.org/10.1155/2015/625289 doi: 10.1155/2015/625289
[9]	W. Dai, Convolutional neural network based simulation and analysis for backward stochastic partial differential equations, Comput. Math. Appl., 119 (2022), 21–58. https://doi.org/10.1016/j.camwa.2022.05.019 doi: 10.1016/j.camwa.2022.05.019
[10]	W. Dai, Optimal policy computing for blockchain based smart contracts via federated learning, Oper. Res. Int. J., 22 (2022), 5817–5844. https://doi.org/10.1007/s12351-022-00723-z doi: 10.1007/s12351-022-00723-z
[11]	L. Gonon, L. Grigoryeva, J. P. Ortega, Approximation bounds for random neural networks and reservoir systems, Ann. Appl. Probab., 33 (2023), 28–69. https://doi.org/10.1214/22-AAP1806 doi: 10.1214/22-AAP1806
[12]	R. Gozalo-Brizuela, E. C. Garrido-Merchan, ChatGPT is not all you need. A state of the art review of large generative AI models, preprint paper, 2023. https://doi.org/10.48550/arXiv.2301.04655
[13]	S. Haykin, Neural networks: A Comprehensive Foundation, New Jersey: Prentice Hall PTR, 1994.
[14]	K. Hornik, M. Stinchcombe, H. White, Multilayer feedforward networks are universal approximators, Neur. Networks, 2 (1989), 359–366. https://doi.org/10.1016/0893-6080(89)90020-8 doi: 10.1016/0893-6080(89)90020-8
[15]	N. Ikeda, S. Watanabe, Stochastic Differential Equations and Diffusion Processes, 2 Eds., Kodansha: North-Holland, 1989.
[16]	O. Kallenberg, Foundation of Modern Probability, Berlin: Springer, 1997.
[17]	A. N. Kolmogorov, On the representation of continuous functions of several variables as superpositions of continuous functions of a smaller number of variables, Dokl. Akad. Nauk, 108 (1956).
[18]	D. Kramkov, M. Sirbu, On the two times differentiability of the value function in the problem of optimal investment in incomplete markets, Ann. Appl. Probab., 16 (2006), 1352–1384. https://doi.org/10.1214/105051606000000259 doi: 10.1214/105051606000000259
[19]	A. Kratsios, V. Debarnot, I. Dokmannić, Small transformers compute universal metric embeddings, J. Mach. Learning Res., 24 (2023), 1–48.
[20]	Y. LeCun, B. Boser, J. S. Denker, D. Henderson, R. E. Howard, W. Hubbard, et al., Backpropagation applied to handwritten zip code recognition, Neur. Comput., 1 (1989), 541–551. https://doi.org/10.1162/neco.1989.1.4.541 doi: 10.1162/neco.1989.1.4.541
[21]	Z. Liu, Y. Wang, S. Vaidya, F. Ruehle, J. Halverson, M. Solja $\breve{c}$ ić, et al., KAN: Kolmogorov-Arnold networks, preprint paper, 2024. https://arXiv.org/pdf/2404.19756
[22]	M. Musiela, T. Zariphopoulou. Stochastic partial differential equations and portfolio choice, In: Contemporary Quantitative Finance, Berlin: Springer, 2009. https://doi.org/10.1007/978-3-642-03479-4_11
[23]	B. $\emptyset$ ksendal, Stochastic Differential Equations, 6 Eds, New York: Springer, 2005.
[24]	B. $\emptyset$ ksendal, A. Sulem, T. Zhang, A stochastic HJB equation for optimal control of forward-backward SDEs, In: The Fascination of Probability, Statistics and their Applications, Berlin: Springer, 2016.
[25]	S. Peluchetti, Diffusion bridge mixture transports, Schr $\ddot{o}$ dinger bridge problems and generative modeling, J. Mach. Learning Res., 24 (2023), 1–51.
[26]	J. Sirignano, K. Spiliopoulos, Dgm: a deep learning algorithm for solving partial differential equations, J. Comput. Phys., 375 (2018), 1339–1364. https://doi.org/10.1016/j.jcp.2018.08.029 doi: 10.1016/j.jcp.2018.08.029
[27]	A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, et al., Attention is all you need, Adv. Neur. Informa. Proc. Syst., 30 (2017), 5998–6008.
[28]	R. Yamashitza, M. Nishio, R. K. G. Do, Togashi, Convolutional neural networks: an overview and application in radiology, Insights into Imaging, 9 (2018), 611–629. https://doi.org/10.1007/s13244-018-0639-9 doi: 10.1007/s13244-018-0639-9

This article has been cited by:

1.	Wanyang Dai, Stochastic Differential Games and a Unified Forward–Backward Coupled Stochastic Partial Differential Equation with Lévy Jumps, 2024, 12, 2227-7390, 2891, 10.3390/math12182891
2.	Wanyang Dai, Gene mutation estimations via mutual information and Ewens sampling based CNN & machine learning algorithms, 2025, 0266-4763, 1, 10.1080/02664763.2025.2460076

Reader Comments

Your name:*

Email:*
© 2024 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

AIMS Mathematics

1.8 3.4

Metrics

Article views(1123) PDF downloads(52) Cited by(2)

Preview PDF

Download XML

Export Citation

Article outline

Show full outline

Figures and Tables

Figures(8)

AIMS Mathematics

Simulating a strongly nonlinear backward stochastic partial differential equation via efficient approximation and machine learning

Related Papers:

Abstract

1. Introduction

2. Unique existence theorem with an application in finance

2.1. Unique existence theorem

2.2. Application in finance

3. The Monte Carlo simulation algorithm and its convergence

3.1. Simulation algorithm

3.2. Convergence with error bound estimation

4. Simulation examples

5. Conclusions

Use of AI tools declaration

Acknowledgments

Conflict of interest

References

This article has been cited by:

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Figures and Tables

Other Articles By Authors

Catalog

Abstract

1. Introduction

2. Unique existence theorem with an application in finance

2.1. Unique existence theorem

2.2. Application in finance

3. The Monte Carlo simulation algorithm and its convergence

3.1. Simulation algorithm

3.2. Convergence with error bound estimation

4. Simulation examples

5. Conclusions

Use of AI tools declaration

Acknowledgments

Conflict of interest

References

AIMS Mathematics

Simulating a strongly nonlinear backward stochastic partial differential equation via efficient approximation and machine learning

Related Papers:

Abstract

1. Introduction

2. Unique existence theorem with an application in finance

2.1. Unique existence theorem

2.2. Application in finance

3. The Monte Carlo simulation algorithm and its convergence

3.1. Simulation algorithm

3.2. Convergence with error bound estimation

4. Simulation examples

5. Conclusions

Use of AI tools declaration

Acknowledgments

Conflict of interest

References

This article has been cited by:

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Figures and Tables

Other Articles By Authors

Related pages

Tools

Export File

Citation

Format

Content

Catalog

Abstract

1. Introduction

2. Unique existence theorem with an application in finance

2.1. Unique existence theorem

2.2. Application in finance

3. The Monte Carlo simulation algorithm and its convergence

3.1. Simulation algorithm

3.2. Convergence with error bound estimation

4. Simulation examples

5. Conclusions

Use of AI tools declaration

Acknowledgments

Conflict of interest

References