Sequential stochastic blackbox optimization with zeroth-order gradient estimators

Charles Audet; Jean Bigeon; Romain Couderc; Michael Kokkolaras; Charles Audet; Jean Bigeon; Romain Couderc; Michael Kokkolaras

doi:10.3934/math.20231321

AIMS Mathematics

2023, Volume 8, Issue 11: 25922-25956. doi: 10.3934/math.20231321

Previous Article Next Article

Research article Special Issues

Sequential stochastic blackbox optimization with zeroth-order gradient estimators

1.
GERAD, Department of Mathematical and Industrial Engineering, École Polytechnique de Montréal, Montréal, Québec, Canada
2.
Nantes University, École Centrale Nantes, CNRS, LS2N, UMR 6004, F-44000 Nantes, France
3.
GSCOP, Department of Industrial Engineering, Grenoble-Alpes University, Grenoble, France
4.
GERAD and Department of Mechanical Engineering, McGill University, Montréal, Canada

Received: 23 May 2023 Revised: 11 August 2023 Accepted: 21 August 2023 Published: 08 September 2023
MSC : 65K05, 90C15, 90C30, 90C56, 90C90

This work considers stochastic optimization problems in which the objective function values can only be computed by a blackbox corrupted by some random noise following an unknown distribution. The proposed method is based on sequential stochastic optimization (SSO), i.e., the original problem is decomposed into a sequence of subproblems. Each subproblem is solved by using a zeroth-order version of a sign stochastic gradient descent with momentum algorithm (i.e., ZO-signum) and with increasingly fine precision. This decomposition allows a good exploration of the space while maintaining the efficiency of the algorithm once it gets close to the solution. Under the Lipschitz continuity assumption on the blackbox, a convergence rate in mean is derived for the ZO-signum algorithm. Moreover, if the blackbox is smooth and convex or locally convex around its minima, the rate of convergence to an $\epsilon$ -optimal point of the problem may be obtained for the SSO algorithm. Numerical experiments are conducted to compare the SSO algorithm with other state-of-the-art algorithms and to demonstrate its competitiveness.

Keywords:

Citation: Charles Audet, Jean Bigeon, Romain Couderc, Michael Kokkolaras. Sequential stochastic blackbox optimization with zeroth-order gradient estimators[J]. AIMS Mathematics, 2023, 8(11): 25922-25956. doi: 10.3934/math.20231321

Related Papers:

[1]	Tamer Nabil . Ulam stabilities of nonlinear coupled system of fractional differential equations including generalized Caputo fractional derivative. AIMS Mathematics, 2021, 6(5): 5088-5105. doi: 10.3934/math.2021301
[2]	Dongming Nie, Usman Riaz, Sumbel Begum, Akbar Zada . A coupled system of $p$ -Laplacian implicit fractional differential equations depending on boundary conditions of integral type. AIMS Mathematics, 2023, 8(7): 16417-16445. doi: 10.3934/math.2023839
[3]	Changlong Yu, Jing Li, Jufang Wang . Existence and uniqueness criteria for nonlinear quantum difference equations with $p$ -Laplacian. AIMS Mathematics, 2022, 7(6): 10439-10453. doi: 10.3934/math.2022582
[4]	Kirti Kaushik, Anoop Kumar, Aziz Khan, Thabet Abdeljawad . Existence of solutions by fixed point theorem of general delay fractional differential equation with $p$ -Laplacian operator. AIMS Mathematics, 2023, 8(5): 10160-10176. doi: 10.3934/math.2023514
[5]	Saeed M. Ali, Mohammed S. Abdo, Bhausaheb Sontakke, Kamal Shah, Thabet Abdeljawad . New results on a coupled system for second-order pantograph equations with $\mathcal{ABC}$ fractional derivatives. AIMS Mathematics, 2022, 7(10): 19520-19538. doi: 10.3934/math.20221071
[6]	Ihtisham Ul Haq, Shabir Ahmad, Sayed Saifullah, Kamsing Nonlaopon, Ali Akgül . Analysis of fractal fractional Lorenz type and financial chaotic systems with exponential decay kernels. AIMS Mathematics, 2022, 7(10): 18809-18823. doi: 10.3934/math.20221035
[7]	Subramanian Muthaiah, Manigandan Murugesan, Muath Awadalla, Bundit Unyong, Ria H. Egami . Ulam-Hyers stability and existence results for a coupled sequential Hilfer-Hadamard-type integrodifferential system. AIMS Mathematics, 2024, 9(6): 16203-16233. doi: 10.3934/math.2024784
[8]	A. M. A. El-Sayed, H. H. G. Hashem, Sh. M. Al-Issa . A comprehensive view of the solvability of non-local fractional orders pantograph equation with a fractal-fractional feedback control. AIMS Mathematics, 2024, 9(7): 19276-19298. doi: 10.3934/math.2024939
[9]	Hasanen A. Hammad, Hassen Aydi, Hüseyin Işık, Manuel De la Sen . Existence and stability results for a coupled system of impulsive fractional differential equations with Hadamard fractional derivatives. AIMS Mathematics, 2023, 8(3): 6913-6941. doi: 10.3934/math.2023350
[10]	Ymnah Alruwaily, Lamya Almaghamsi, Kulandhaivel Karthikeyan, El-sayed El-hady . Existence and uniqueness for a coupled system of fractional equations involving Riemann-Liouville and Caputo derivatives with coupled Riemann-Stieltjes integro-multipoint boundary conditions. AIMS Mathematics, 2023, 8(5): 10067-10094. doi: 10.3934/math.2023510

Abstract

1. Introduction

One of the critical drivers in the management of value chains is supply and demand matching. Demand swings can occur monthly, daily or even hourly. Firms have to even out the demand to avoid lost sales and to avoid increasing capacity cushions. The marketing department can influence the demand through means such as offering complementary services and products, offering promotional pricing, using pre-scheduled appointments and reservations, allowing backlogs, backorders and stockouts and/or implementing revenue management.

Revenue management, also called yield management, appeared first in the service sector. It is the process of adjusting price at the right time to maximize revenue. It has been used successfully by such industries as airlines, hotels, cruise lines, car rental agencies, etc. In revenue management, computer software can make updates in real time, using decision rules for opening or closing price categories depending on parameters such as the difference between capacity and demand, production schedules or inventory levels.

A number of industries, such as Dell, Alibaba Group, and Amazon.com have adopted dynamic pricing strategies and sell products directly to customers through their websites. Among the factors that contributed to this phenomenon are the availability of demand data, the ease of adjusting prices thanks to new technologies and the availability of decision-support systems for analysis of the demand data and dynamic pricing.

It has been observed that research on pricing and research on production planning differ in the way they look at the demand rate. Research on production planning assumes that the demand rate is determined exogenously and therefore is uncontrollable. However, the demand rate can often be controlled by varying the price structure. For this reason, pricing research often focuses on demand function properties. The artificial separation of production planning and pricing can only lead to suboptimal solutions.

These considerations have led to an increased interest in the development of models integrating pricing and production decisions to improve the profitability of companies. Among the early references is the work of Whitin ^[1] who analyzed the newsvendor problem with price-dependent demand. Since then there has been a considerable amount of literature that deals with the interface between marketing and manufacturing decisions, i.e., the simultaneous determination of pricing and production. Also, many review papers have appeared. Among them are those written by Eliashberg and Steinberg ^[2], Elmaghraby and Keskinocak ^[3], Chan et al. ^[4], Simchi-Levi et al. ^[5], Yano and Gilbert ^[6], Niu et al. ^[7], Chen and Simchi-Levi ^[8], Zhang ^[9], and den Boer ^[10].

Optimal control techniques provide powerful tools to understand the behavior of dynamic systems. They have been applied naturally to pricing and production where the system is dynamic. Among the early research using optimal control theory, there is the work by Feichtinger and Hartl ^[11] who deal with the problem of simultaneously determining the optimal price policy and production rate over a given planning horizon. They use a nonlinear demand function $f(\pi(t), t)$ where the control variable is $\pi(t)$ , i.e., the price at time $t$ .

Lin and Shue ^[12] investigated the optimal policies for price and warranty length determination when defective items are replaced free of charge. The demand $f(\pi, w, Q)$ is dependent on the two control variables, i.e., the price $\pi(t)$ and the warranty length $w(t)$ , while $Q(t)$ is the accumulated sales up to time $t$ . A similar model has been studied by Lin ^[13] who incorporated a dynamic product quality into the model of Lin and Shue ^[12].

Feng et al. ^[14] studied the optimal control of an assembly system that produces one final product with multiple components and sells it at variable price. In their paper, the product order arrivals are modeled as a nonhomogeneous Poisson process with a rate that is dependent on the selling price at the time. There are two choices of price levels to sell the product: high $p_1$ and low $p_2$ , with corresponding demand rates $\lambda_1$ and $\lambda_2$ . The control variable is $\pi(t) = p_1 \mbox{ or }p_2$ . Feng et al. ^[14] assumed that backorder costs are linear in the length of time that a backorder remains on the books. Keblis and Feng ^[15] extended the work of Feng et al. by allowing a more general stockout cost function that includes both fixed and variable cost elements.

Cai et al. ^[16] studied the optimal selling price of a deteriorating product in a finite time horizon where the time horizon is either known or unknown. They assumed that the demand rate depends linearly on the selling price $\pi(t)$ at time $t$ . As such, they describe the demand $d(\pi(t)) = a - b\pi(t)$ , where $a > 0, b > 0$ and $\pi(t)$ is the control variable.

Chenavaz ^[17] analyzed the conditions under which better product quality implies a higher or lower product price. In an optimal control framework, the firm sets the dynamic pricing and product innovation policies. The demand $D = D(\pi, q)$ depends on the price $\pi$ and the quality $q$ . In particular, he considers the multiplicative separable demand function $D = h(\pi)l(q)$ and the additive separable demand function $D = h(\pi)+ l(q)$ . Similar models were analyzed by Chenavaz ^[18] and Vörös ^[19]; however, they ignored the relationship between price and quality.

Adida and Perakisy ^[20] studied the same model as Cai et al. ^[16] for multiple products with a shared production capacity rate. The demand rate for product $i$ is $d_i(t) = \alpha_i(t) - \beta_i(t)\pi_i(t)$ and the control variable is the price $\pi_i(t)$ of one unit of product $i$ .

In a paper by Weber ^[21], a retailer is allowed to choose a dynamic price, a dynamic advertising rate and the inventory capacity for a sales period of fixed length. The inventory deteriorates at an exponential rate. The time- and price-dependent deterministic demand rate $\lambda_R(\pi, t)$ is assumed to be a nonincreasing separable function of price and time.

Herbon and Khmelnitsky ^[22] have developed a dynamic pricing model of storable perishable items to determine the optimal replenishment schedule of a product. In their work, customer demand is assumed to be a pseudo-additive function of price and time since replenishment: $\lambda(\pi(t), t) = \lambda_1(\pi(t)) + \lambda_2(t), t \le T_{max}, \pi(t) \le p_{max}(t)$ .

Yang and Cai ^[23] previously focused on an emission-dependent supply chain consisting of one emission-dependent manufacturer and one emission permit supplier under the carbon-and-trade scheme. In their work, the demand not only depends on the current price, it is also sensitive to the historical price. They introduced a reference price $r(t)$ , expressed by the differential equation

$\frac{ dr(t)}{dt} = \delta \left[ \pi(t) - r(t) \right],$

where $\delta > 0$ and $\pi(t)$ is the price at time $t$ , while the demand rate is $D(t) = \alpha - \beta \left[ \pi(t) - r(t) \right]$ , with $\alpha, \beta > 0$ . The control variables are the price $\pi(t)$ for the manufacturer and the carbon pricing policy $W_e(t)$ for the supplier.

We consider two models in this paper, and both are of the tracking type and aim to coordinate the pricing and production strategies. A single product is produced by a firm. All of the models surveyed above assume that the price is a control variable. We take a different approach where the dynamic price is a state variable. We provide a rule for the dynamics of the price. The model predictive approach we use here provides the optimal production policies as well as the resulting optimal inventory and price paths.

Our models incorporate several economic and management characteristics that are crucial for obtaining an understanding of the pricing dynamics in a market. The economic and management characteristics of this model are centered around understanding and leveraging the dynamics of demand, supply, inventory; and pricing. We explain briefly the economic and management characteristics (in short E.C. and M.C. respectively) of each feature of our model. The key features are as follows: 1- Price Changes in Response to Demand and Supply (E.C.: This reflects a fundamental principle in economics in which prices are determined by the interaction of demand and supply. In a competitive market, prices tend to adjust to balance the quantity demanded with the quantity supplied; M.C.: Managers need to be aware of market dynamics and factors influencing both demand and supply); 2- Incorporation of Inventory Levels (E.C.: Inventory levels are a key economic consideration. The model recognizes that the quantity of unsold products in inventory can impact pricing decisions; M.C.: Managers must balance the costs associated with holding inventory against potential revenue gains from adjusting prices based on inventory levels); 3- Price-Demand Relationship (E.C.: The model acknowledges that price is not fixed but can change in response to shifts in demand; M.C.: Understanding the price-demand relationship is essential for managers to optimize revenue and market share); 4- Dynamic Pricing Strategies (E.C.: The model suggests a dynamic pricing approach whereby prices change over time based on market conditions; M.C.: Managers employing dynamic pricing strategies need to be adaptive and responsive to market changes); 5- Market Equilibrium Considerations (E.C.: The model is implicitly based on the concept of market equilibrium, where the quantity demanded equals the quantity supplied, leading to a stable price; M.C.: Managers must be aware of the market equilibrium point and the factors that can shift it. Pricing decisions should aim to achieve equilibrium to avoid persistent surpluses or shortages).

The next section describes the two integrated production planning-pricing models and solves them by using a model predictive control approach. Analytical solutions are obtained whenever possible, while numerical solutions, along with examples, are given whenever an explicit solution cannot be derived. In Section 3, the paper is summarized and future research directions are given.

2. Integrated production planning-pricing models

Consider a manufacturer that can control its inventory level by focusing on production and pricing jointly. To state the considered models we use the following notation:

$I(t)$ : The inventory level at time $t$ ,

$\pi(t)$ : The price at time $t$ ,

$S(t)$ : The supply rate at time $t$ ,

$D(t, I(t), \pi(t))$ : The demand rate at time $t$ , inventory level $I(t)$ ; and price $\pi(t)$ ,

$H$ : The length of the planning horizon,

$T$ : The length of the prediction horizon,

$I_0$ : The initial inventory level,

$\pi_0$ : The initial price value,

$\hat S(t)$ : The goal supply rate at time $t$ ,

$\hat \pi(t)$ : The goal price at time $t$ ,

$\hat I(t)$ : The goal inventory level at time $t$ ,

$q_i, p, r_i$ : The positive unit costs.

The control problem is formulated in continuous time over a planning horizon $[0, H]$ . The firm manufactures a product that can be sold during $[0, H]$ . The selling price of each unit is set as $\pi(t)$ at time $t$ . Let $I(t)$ denote the inventory level at time $t$ . To model the variations of the price, we are going to consider in this section two models. In the first one, the supply rate is dynamic. A more general model is considered next, where we assume that the supply rate depends on time and on both state variables namely, the price and the inventory level. In both models, the demand rate depends on time and on both states.

2.1. Dynamic supply

The system is controlled by using $S(t)$ , i.e., the supply (production) rate at time $t$ , while $I(t)$ and $\pi(t)$ are the state variables. It is assumed that, at time $t$ , the demand rate $D(t, I(t), \pi(t))$ depends on both the inventory level and the price. To describe the variations of the inventory level, we use the usual state equation

$\begin{equation} \dot I(t) = S(t) - D(t,I(t),\pi(t)), \end{equation}$

(2.1)

with the known initial inventory level $I(0) = I_0$ . Let us now model the variations of the price. According to the Walrasian assumption, price tends to increase (decrease) if the demand is greater than (less) than the supply. The general dynamic formalization of the Walrasian assumption is as follows:

$\dot \pi = f \left( D - S \right),$

where it is assumed that $xf(x) > 0$ for $x \ne 0$ . We shall study the properties of this model by using the linear approximation $f(x) = k_1 x, \; k_1 > 0$ .

With this linearization, the dynamics of price adjustments in a model of a competitive market reflects the difference between demand and supply as follows

$\dot \pi = k_1 \left( D - S \right).$

However, this model neglects the inventory of unsold merchandise. To study how the dynamics of price adjustments are affected if we take into account this inventory, it is natural to assume that inventory has a negative effect on the price. This consideration leads to the following integro-differential formulation

$\dot \pi(t) = k_1 \left(D(t,I(t),\pi(t)) - S(t) \right) - k_2 \int_0^t \left[ S(\tau) - D(\tau,I(\tau),\pi(\tau)) \right] d\tau,$

with $k_1 > 0$ , $k_2 > 0$ . The second term expresses the accumulated stock as the integral of past differences. With $k_2 > 0$ , this term causes the price to adjust downward when the inventory is positive. Taking into account that the price increases when the demand increases, we write the dynamics of the price on the planning horizon as follows:

$\begin{eqnarray} \dot \pi(t) & = & k_1 \left[ D(t,I(t),\pi(t)) - S(t)\right] - k_2 \int_0^t \left[ S(\tau) - D(\tau,I(\tau),\pi(\tau)) \right] d\tau + k_3 D(t,I(t),\pi(t)), \end{eqnarray}$

(2.2)

with $k_1 > 0$ , $k_2 > 0$ , and $k_3 > 0$ and the known initial price $\pi(0) = \pi_0$ . Finally, substituting (2.1) into (2.2) yields

$\begin{equation} \dot \pi(t) = -k_1 \dot I(t) - k_2 [I(t)-I(0)] + k_3 D(t,I(t),\pi(t)). \end{equation}$

(2.3)

The system under study is of the tracking type, and the firm has set a goal inventory level $\hat I$ , a goal supply rate $\hat S$ , and a goal price $\hat \pi$ as its targets. Penalties are incurred if a variable deviates from its target. Letting $t_0 \in [0, H]$ and $0 < T < H$ , the objective is to minimize the sum of the deviations over the prediction horizon $[t_0, t_0 + T]$ :

$\begin{eqnarray} J & = & \int_{t_0}^{t_0+T} \Bigg\{ \frac{q_1}{2} \left[ I(t) - \hat I(t) \right]^2 + \frac{q_2}{2} \left[ \pi(t) - \hat \pi(t) \right]^2 dt + \frac{p}{2} \left[ S(t) - \hat{S} \right]^2 \Bigg\} dt \\ & & \quad + \frac{r_1}{2} \left[ I(t_0+T) - \hat I(t_0+T) \right]^2 + \frac{r_2}{2} \left[ \pi(t_0+T) - \hat \pi(t_0+T) \right]^2. \end{eqnarray}$

(2.4)

First, we have to point out that the targets have to satisfy the state equations, that is,

$\begin{eqnarray*} \frac{d}{dt} \hat I(t) & = & \hat S(t) - D(t,\hat I(t),\hat \pi(t)),\\ \frac{d}{dt} \hat \pi(t) & = & -k_1 \frac{d}{dt} \hat I(t) - k_2 [\hat I(t)-\hat I(0)] + k_3 D(t,\hat I(t),\hat \pi(t)). \end{eqnarray*}$

We introduce the shifting variables, as follows:

$\Delta I(t) = I(t) - \hat I(t), \quad \Delta \pi(t) = \pi(t)-\hat \pi(t), \quad \Delta S(t) = S(t)-\hat S(t).$

We rewrite both (2.1) and (2.3) in terms of shifting variables as follows:

$\begin{eqnarray} \frac{d}{dt} \Delta I(t) & = & \Delta S(t) + \tilde D(t, I(t), \pi(t)), \end{eqnarray}$

(2.5)

$\begin{eqnarray} \frac{d}{dt} \Delta \pi(t) & = & -k_1 \Delta S(t) + \bar D(t,\hat I(t),\hat \pi(t)) , \end{eqnarray}$

(2.6)

with

$\tilde D(t,I(t),\pi(t)) : = - D(t, I(t), \pi(t))+D(t,\hat I(t),\hat \pi(t))$

and

$\bar D(t,I(t),\pi(t)) : = -(k_1+k_3) \tilde D(t,I(t),\pi(t))- k_2 [\Delta I(t)-\Delta I(0)].$

Using the shifting operator $\Delta$ , the problem is to minimize

$\begin{equation} J = \int_{t_0}^{t_0+T} F(t) dt + R(t_0+T), \end{equation}$

(2.7)

where

$\begin{equation} F(t) = \frac{q_1}{2} \Delta I(t)^2 + \frac{q_2}{2} \Delta \pi(t)^2 + \frac{p}{2} \Delta S(t)^2 \end{equation}$

(2.8)

and

$\begin{equation} R(t_0+T) = \frac{r_1}{2} \Delta I(t_0+T)^2 + \frac{r_2}{2} \Delta \pi(t_0+T)^2. \end{equation}$

(2.9)

Calculation of the integral (2.7) is done by using the trapezoid formula. Divide the time interval $[t_0, t_0+T]$ into $m$ subintervals of equal length $h = \frac{T}{m}$ . Then,

$\begin{eqnarray} J & \simeq & \frac{h}{2} \left[ F(t_0) + 2\sum\limits_{i = 1}^{m-1} F(t_0+ih) + F(t_0+mh) \right] \\ & & + \frac{r_1}{2}\Delta I(t_0+mh)^2 + \frac{r_2}{2} \Delta \pi(t_0+mh)^2. \end{eqnarray}$

(2.10)

The first-order Taylor approximation, combined with (2.5) and (2.6), yields

$\begin{eqnarray} \Delta I(t+ih) & \simeq & c_1(t,i) + ih \Delta S(t), \end{eqnarray}$

(2.11)

$\begin{eqnarray} \Delta \pi(t+ih) & \simeq & c_2(t,i) - k_1ih \Delta S(t), \end{eqnarray}$

(2.12)

with

$\begin{eqnarray} c_1(t,i) & = & \Delta I(t) + ih \tilde D(t,I(t),\pi(t)),\\ c_2(t,i) & = & \Delta \pi(t) + ih \bar D(t,I(t),\pi(t)). \end{eqnarray}$

Taking the squares of (2.11) and (2.12) and substituting the result into (2.8) yields

$\begin{eqnarray*} F(t+ih) & \simeq & \frac{1}{2} \left[q_1 c_1(t,i)^2 + q_2 c_2(t,i)^2\right] + ih\left[q_1c_1(t,i) - q_2 c_2(t,i)k_1 \right]\Delta S(t)\\ & & + \frac{1}{2} \bar q i^2 \Delta S(t)^2 + \frac{p}{2} \Delta S(t+ih)^2, \end{eqnarray*}$

where $\bar q = \left(q_1 + k_1^2q_2 \right) h^2$ . This equation can be written in the following simpler form:

$F(t+ih) \simeq A(t,i) + B(t,i) \Delta S(t) + E(t,i) \Delta S(t)^2 + \frac{p}{2} \Delta S(t+ih)^2,$

where

$\begin{eqnarray*} A(t,i)& : = & \frac{1}{2} \left[ q_1c_1(t,i)^2 + q_2c_2(t,i)^2 \right], \\ B(t,i)& : = & ih\left[ q_1c_1(t,i) - q_2c_2(t,i)k_1 \right], \\ E(i)& : = & \frac{1}{2} \bar q i^2. \end{eqnarray*}$

Then, we can write the objective function (2.10) in terms of the control variables:

$J \simeq {\bf A}(t_0) + {\bf B}(t_0) \Delta S(t_0) + {\bf E} \Delta S(t_0)^2 + \frac{hp}{4} \Delta S(t_0+mh)^2 + \frac{hp}{2} \sum\limits_{i = 1}^{m-1} \Delta S(t_0+ih)^2,$

where ${\bf A}(t_0)$ is independent of the control variables. The explicit forms of ${\bf A}(t_0)$ , ${\bf B}(t_0)$ and ${\bf E}$ will be needed to compute the optimal value $J^*$ of $J$ , and they are given as follows:

$\begin{eqnarray*} {\bf A}(t_0) & : = & \frac{hq_1}{4} \Delta I(t_0)^2 + \frac{hq_2}{4} \Delta \pi(t_0)^2 + h \sum\limits_{i = 1}^{m-1}A(t_0,i) + \frac{h}{2} A(t_0,m) \\ & & + \frac{1}{2} \left[ r_1 c_1(t_0,m)^2 + r_2 c_2(t_0,m)^2 \right], \\ {\bf B}(t_0) & : = & a_{11} \Delta I(t_0) - a_{12} \Delta \pi(t_0) + a_{13} \tilde D(t_0,I(t_0),\pi(t_0)) - a_{14} \bar D(t_0,I(t_0),\pi(t_0)),\cr\cr {\bf E} & : = & \frac{hp_1}{4} + \frac{h\bar q}{2} \left( \beta + \frac{m^2}{2} \right) + \frac{\bar r m^2}{2}, \end{eqnarray*}$

where

$\begin{equation} \begin{array}{clclclcl} a_{11} & = & h^2 q_1 \alpha + mh\left( \frac{hq_1}{2} + r_1 \right),\quad a_{12} & = & h^2 k_1q_2\alpha + mhk_1\left( \frac{hq_2}{2} + r_2 \right), \cr\cr a_{13} & = & h^3 q_1 \beta + m^2h^2 \left( \frac{hq_1}{2} + r_1 \right), \hskip1mm a_{14} & = & h^2 k_1q_2 \beta h + m^2h^2k_1 \left( \frac{hq_2}{2} + r_2 \right), \end{array} \end{equation}$

(2.13)

with $\alpha : = \sum_{i = 1}^{m-1} i = \frac{m(m-1)}{2}$ , $\beta : = \sum_{i = 1}^{m-1} i^2 = \frac{m(m-1)(2m-1)}{6}$ , and $\bar r : = \left(r_1 + k_1^2r_2 \right) h^2$ . Let us now introduce a matrix notation and set

$\begin{eqnarray*} \Delta {\mathbb S}(t_0) &: = & [\Delta S(t_0), \Delta S(t_0+h),\cdots,\Delta S(t_0+mh)]^T_{(m+1)\times 1} ; \\ {\mathbb B}(t_0) & : = & {\bf B}(t_0) {\mathbf e}_1 \mbox{ with } {\mathbf e}_1 = [1, 0,\cdots,0]^T_{(m+1)\times 1} ; \\ {\mathbb E} & : = & \left( {\mathbb E}_{ij} \right)_{(m+1)\times(m+1)}. \end{eqnarray*}$

Here, ${\mathbb E}$ is an $(m+1) \times (m+1)$ diagonal matrix whose elements are ${\mathbb E}_{00} = {\bf E}$ , ${\mathbb E}_{ii} = \frac{hp}{2}, i = 1, \cdots, m-1$ , and ${\mathbb E}_{mm} = \frac{hp}{4}$ . In order to derive the optimality condition, we rewrite the objective function in the following vectorial form:

$\begin{equation} J(\Delta {\mathbb S}(t_0)) \simeq {\bf A}(t_0) + {{\mathbb B}(t_0) }^T\Delta {\mathbb S}(t_0) + \Delta {\mathbb S(t_0)}^T {\mathbb E} \Delta {\mathbb S(t_0)}. \end{equation}$

(2.14)

The unique global minimum of the objective function $J$ is reached at $\Delta {\mathbb S}^*(t_0)$ , which is the solution of the vectorial equation

$\frac{\partial J}{\partial \Delta {\mathbb S}(t_0)} = 0,$

i.e.,

$\Delta {\mathbb S^*(t_0)} = -\frac{1}{2} {\mathbb E}^{-1}{\mathbb B}(t_0).$

This implies that

$\begin{equation} \Delta S^*(t_0) = -\frac{{\bf B}(t_0)}{2{\bf E}}. \end{equation}$

(2.15)

Now we can readily find the explicit form of the optimal objective function value. By substituting the optimal control (2.15) in (2.14), we get:

$J(\Delta {\mathbb S^*(t_0))} \simeq {\bf A}(t_0) - \frac{hp}{4} {\bf B}^2(t_0).$

However, we still have to find the optimal price and the optimal inventory level. Since our previous analysis is valid for any $t_0\in [0, H]$ , we substitute the expressions of $\Delta {S^*}(t)$ in (2.5) and (2.6) to obtain a system of linear differential equations:

$\begin{equation} \left\{ \begin{array}{ccc} \frac{d }{dt}\Delta I(t) & = & \Delta S^*(t) + \tilde D(t,I(t),\pi(t)),\cr\cr \frac{d }{dt}\Delta \pi(t) & = & -k_1 \Delta S^*(t) + \bar D(t,I(t),\pi(t)). \end{array}\right. \end{equation}$

(2.16)

While (2.15) provides the optimal supply rate, the solution of the system of differential equations given by (2.16) provides the optimal inventory level and the optimal price. Of course, the optimal trajectories depend on the shape of the demand rate function. To illustrate how the solution of (2.16) can be obtained, let us assume the following explicit form of the function $D$ in terms of $I$ and $\pi$ :

$D(t,I(t),\pi(t)) = d_1(t) - d_2I(t) + d_3\pi(t).$

Then,

$\begin{eqnarray*} \tilde D(t,I(t),\pi(t)) & = & d_2\Delta I(t) - d_3 \Delta \pi(t),\\ \bar D(t,I(t),\pi(t)) & = &-[k_2+(k_1+k_3) d_2]\Delta I(t) + d_3(k_1+k_3) \Delta \pi(t)+ k_2 \Delta I(0)]. \end{eqnarray*}$

Upon substitution, the system (2.16) becomes

$\begin{equation} \left\{ \begin{array}{ccc} \frac{d }{dt} I(t) & = & l_{11} I(t) + l_{12} \pi(t) + \bar l_{1}(t), \cr\cr \frac{d }{dt} \pi(t) & = & l_{21} I(t) + l_{22} \pi(t) + \bar l_{2}(t), \end{array}\right. \end{equation}$

(2.17)

i.e.,

$\frac{d }{dt} X(t) = AX(t) + B(t),$

where

$X(t) : = \left( \begin{array}{ccc} I(t) \cr\cr \pi(t) \end{array} \right), \quad B(t) : = \left( \begin{array}{ccc} \bar l_{1}(t) \cr\cr \bar l_{2}(t) \end{array} \right), \quad A: = \left( \begin{array}{lclclc} l_{11} & l_{12} \cr\cr l_{21} & l_{22} \end{array} \right),$

and with $x_1 : = \frac{h\bar q}{2} \left(\beta + \frac{m^2}{2} \right) + \frac{\bar r m^2}{2}$ ,

$\begin{array}{lclclclclc} l_{11} & : = & -\frac{ a_{11} + a_{14}k_2 }{2\left(\frac{hp_1}{4} + x_1\right)} + \left[1-\frac{a_{13} + a_{14}(k_1+k_3)}{2\left(\frac{hp_1}{4} + x_1\right)}\right]d_2, \cr\cr l_{12} &: = & \frac{ a_{12} }{2\left(\frac{hp_1}{4} + x_1\right)} - \left[1 - \frac{ a_{13} + a_{14}(k_1+k_3) }{2\left(\frac{hp_1}{4}+x_1\right)} \right]d_3, \cr\cr l_{21} & : = & -k_2 + \frac{ k_1(a_{11} + a_{14}k_2) }{2\left(\frac{hp_1}{4} + x_1\right)} + \left[ \frac{ k_1[a_{13} + a_{14}(k_1+k_3)] }{2\left(\frac{hp_1}{4} + x_1\right)} - (k_1+ k_3) \right]d_2, \cr\cr l_{22} & : = & -\frac{ k_1a_{12} }{2\left(\frac{hp_1}{4} + x_1\right)} + \left[ -\frac{ k_1[a_{13} + a_{14}(k_1+k_3)] }{2\left(\frac{hp_1}{4} + x_1\right)} + k_1 + k_3 \right]d_3, \cr\cr \bar l_{1}(t) & : = & \frac{a_{14}k_2\Delta I(0)}{2\left(\frac{hp_1}{4}+x_1\right)}-l_{11}\hat I(t)- l_{12}\hat \pi(t)+\frac{d}{dt}\hat I(t) , \cr\cr \bar l_{2}(t) & : = & k_2(1-\frac{a_{14}k_1}{2\left(\frac{hp_1}{4} + x_1\right)})\Delta I(0)-l_{21} \hat I(t)- l_{22} \hat \pi(t)+ \frac{d}{dt}\hat \pi(t). \end{array}$

This is a nonhomogeneous system of linear equations with constant coefficients, and the explicit form of its solution $X(t)$ can be computed as

$X(t) = M(t)\cdot M(0)^{-1}X(0) + M(t) \int_0^t M^{-1}(s) B(s) ds,$

where $\lambda_1$ and $\lambda_2$ are the eigenvalues of $A$ and $M(t)$ is the fundamental matrix

$M(t) = \left( \begin{array}{ccc} e^{\lambda_1 t} l_{12} & & e^{\lambda_2 t} l_{12} \\ e^{\lambda_1 t} ( \lambda_1-l_{11}) & & e^{\lambda_2 t} (\lambda_2-l_{11}) \end{array} \right).$

In order to go further we need to compute the integral term in the general solution, which is not possible without the explicit forms of $\bar l_1(t)$ and $\bar l_2(t)$ , that is, the explicit forms of the target rates $\hat I(t)$ and $\hat \pi(t)$ . For illustration purposes, let us consider the following two cases:

Case 1: $\hat I(t)$ and $\hat \pi(t)$ are constant.

In this case, $\bar l_1(t)$ and $\bar l_2(t)$ are both constant and we put $\bar l_1(t)\equiv\bar l_1$ and $\bar l_2(t)\equiv \bar l_2$ . Then, the integral term can be easily computed and we have

$\int_0^t M^{-1}(s) B(s) ds = \left( \begin{array}{lclclclc} -\frac{ \Big[ (\lambda_2-l_{11})\bar l_1 - l_{12} \bar l_2 \Big] }{\lambda_1 l_{12} (\lambda_2-\lambda_1) } [e^{-\lambda_1 t}-1] \cr\cr \frac{\Big[ (\lambda_1-l_{11})\bar l_1 - l_{12} \bar l_2\Big] }{\lambda_2 l_{12} (\lambda_2-\lambda_1) } [e^{-\lambda_2 t}-1] \end{array} \right).$

We also need to compute $M^{-1}(0)X(0)$ :

$M^{-1}(0)X(0) = \left( \begin{array}{ccc} \frac{ (\lambda_2-l_{11}) }{ l_{12} (\lambda_2-\lambda_1) } I_0 + \frac{ 1 }{ (\lambda_2-\lambda_1) }\pi_0 \cr\cr \frac{ (\lambda_1-l_{11}) }{ l_{12} (\lambda_2-\lambda_1) } I_0 + \frac{ 1 }{ (\lambda_2-\lambda_1) }\pi_0 \end{array} \right).$

Therefore, the optimal inventory level and the optimal price are respectively given by

(2.18)

(2.19)

where

$\begin{array}{lclclcl} C_{11} & = & \frac{(\lambda_2-l_{11})}{ l_{12} (\lambda_2-\lambda_1) }\left( \frac{a_{14}k_2\Delta I(0)}{2\left(\frac{hp_1}{4}+x_1\right)} - l_{12} \hat \pi-l_{11}\hat I\right)-\frac{1}{(\lambda_2-\lambda_1) }\left(k_2(1-\frac{a_{14}k_1}{2\left(\frac{hp_1}{4} + x_1\right)})\Delta I(0)-l_{21}\hat I-l_{22}\hat \pi \right), \\\cr C_{21} & = & \frac{ 1 }{ (\lambda_2-\lambda_1) } \left(\frac{a_{14}k_2\Delta I(0)}{2\left(\frac{hp_1}{4}+x_1\right)} - l_{12} \hat \pi-l_{11}\hat I\right) - \frac{ (\lambda_1-l_{11}) }{ l_{12} (\lambda_2-\lambda_1) }\left( k_2(1-\frac{a_{14}k_1}{2\left(\frac{hp_1}{4} + x_1\right)})\Delta I(0)-l_{21}\hat I-l_{22}\hat \pi\right). \cr \end{array}$

Example 2.1. To illustrate this case, we take the target rates $\hat l(t)$ and $\hat \pi(t)$ as constant and $\hat I(t) = 4$ and $\hat \pi(t) = 2.5$ . We take the goal supply rate $\hat S(t) = 3\sin(t)+10$ , and we take $d_1(t) = 3cos(t)+t^2+4$ . The constants used in this example are as follows: $T = 5; m = 100; h = 0.05; q_1 = 0.01; q_2 = 0.1; r_1 = 0.01; r_2 = 0.1$ ; $p_1 = 0.01; k_1 = 0.9; k_2 = 0.01; k_3 = 1; d_2 = 1; d_3 = 2; I_0 = 8; \pi_0 = 2$ . Figure 1 depicts the variations of the optimal state variables. As can be seen, the inventory level tends to the goal inventory level, and the price tends to the goal price. Figure 2 depicts the variations of the optimal supply and demand rates. As can be seen, both tend to the goal supply rate.

Figure 1. Inventory level (top) and price (bottom).

DownLoad: Full-Size Img PowerPoint

Figure 2. Supply rate (top) and demand rate (bottom).

DownLoad: Full-Size Img PowerPoint

Case 2: $\hat I(t)$ and $\hat \pi(t)$ are not necessarily constant.

We consider the following explicit forms of the target rates: $\hat I(t) = d_{5}\sin(t) + d_{6}$ and $\hat \pi(t) = d_{7}\cos(t) + d_{8}$ , with $d_i\in \mathbb R, \, i = 5, 6, 7, 8$ . In this case, we have

$\begin{array}{lclclcl} \bar l_{1}(t) & : = & L_{11} \sin(t)+L_{12}\cos(t)+L_{13}, \cr\cr \bar l_{2}(t) & : = & L_{21} \sin(t)+L_{22}\cos(t)+L_{23}, \end{array}$

with

$\begin{array}{llllllllll} L_{11} : = - l_{11}d_5; &L_{12} : = (d_5-l_{12}d_7);\cr \cr L_{13} : = \frac{a_{14}k_2\Delta I(0)}{2\left(\frac{hp_1}{4}+x_1\right)} - l_{12} d_8-l_{11}d_6; &L_{21} : = - (l_{21}d_5+d_7); \cr\cr L_{22} : = -l_{22}d_7; &L_{23} : = k_2(1-\frac{a_{14}k_1}{2\left(\frac{hp_1}{4} + x_1\right)})\Delta I(0)-l_{21}d_6-l_{22}d_8, \end{array}$

and

$M^{-1}(t)B(t) = \left( \begin{array}{lclclclc} C_{11}e^{-\lambda_1 t} + C_{12}e^{-\lambda_1 t} \sin(t) + C_{13}e^{-\lambda_1 t}\cos(t) \cr\cr C_{21}e^{-\lambda_2 t} + C_{22}e^{-\lambda_2 t} \sin(t) + C_{23}e^{-\lambda_2 t} \cos(t) \end{array} \right),$

with

$\begin{array}{lclclcl} C_{11} = \frac{ 1 }{ l_{12} (\lambda_2-\lambda_1) }\Big[(\lambda_2-l_{11})L_{13}-l_{12}L_{23}\Big]; &C_{12} = \frac{ 1 }{ l_{12} (\lambda_2-\lambda_1) }\Big[(\lambda_2-l_{11})L_{11}-l_{12}L_{21}\Big]; \\\cr C_{13} = \frac{ 1 }{ l_{12} (\lambda_2-\lambda_1) }\Big[(\lambda_2-l_{11})L_{12}-l_{12}L_{22}\Big]; &C_{21} = \frac{ 1 }{ l_{12} (\lambda_2-\lambda_1) }\Big[l_{12}L_{23}-(\lambda_1-l_{11})L_{13}\Big]; \\\cr C_{22} = \frac{ 1 }{ l_{12} (\lambda_2-\lambda_1) }\Big[l_{12}L_{21}-(\lambda_1-l_{11})L_{11}\Big]; &C_{23} = \frac{ 1 }{ l_{12} (\lambda_2-\lambda_1) }\Big[l_{12}L_{22}-(\lambda_1-l_{11})L_{12}\Big]. \end{array}$

Then, the integral term can be easily computed and we have

$\begin{eqnarray} I^*(t) & = & \frac{ 1 }{ (\lambda_2-\lambda_1) } \left(e^{\lambda_1 t} [(\lambda_2-l_{11})I_0-l_{12}\pi_0]+e^{\lambda_2 t} [(l_{11}-\lambda_1)I_0+l_{12}\pi_0] \right)\cr\cr & & + \frac{C_{12}l_{12}}{1+\lambda_1^2} \left[e^{\lambda_1 t}- \lambda_1 \sin t - \cos t) \right] +\frac{C_{22}l_{12}}{1+\lambda_2^2} \left[e^{\lambda_2 t}- \lambda_2 \sin t - \cos t)\right] \cr\cr & & + \frac{C_{13}l_{12}}{1+\lambda_1^2} \left[\lambda_1e^{\lambda_1 t}- \lambda_1 \cos t + \sin t) \right] +\frac{C_{23}l_{12}}{1+\lambda_2^2} \left[\lambda_2 e^{\lambda_2 t}- \lambda_2 \cos t +\sin t)\right]\cr\cr & & + \frac{l_{12}C_{11}}{\lambda_1}\left( e^{\lambda_1 t}-1 \right)+ \frac{l_{12}C_{21}}{\lambda_2}\left( e^{\lambda_2 t}-1 \right), \end{eqnarray}$

(2.20)

$\begin{eqnarray} \pi^*(t) & = & \frac{ (\lambda_1-l_{11})e^{\lambda_1 t} }{ l_{12}(\lambda_2-\lambda_1) } [(\lambda_2-l_{11})I_0-l_{12}\pi_0]+\frac{ (\lambda_2-l_{11})e^{\lambda_2 t} }{ l_{12}(\lambda_2-\lambda_1) } [(l_{11}-\lambda_1)I_0+l_{12}\pi_0]\cr\cr & & + \frac{C_{12}(\lambda_1-l_{11})}{1+\lambda_1^2} \left[e^{\lambda_1 t}- \lambda_1 \sin t - \cos t) \right] +\frac{C_{22}(\lambda_2-l_{11})}{1+\lambda_2^2} \left[e^{\lambda_2 t}- \lambda_2 \sin t - \cos t)\right] \cr\cr & & + \frac{C_{13}(\lambda_1-l_{11})}{1+\lambda_1^2} \left[\lambda_1e^{\lambda_1 t}- \lambda_1 \cos t + \sin t) \right] +\frac{C_{23}(\lambda_2-l_{11})}{1+\lambda_2^2} \left[\lambda_2 e^{\lambda_2 t}- \lambda_2 \cos t +\sin t)\right]\cr\cr & & + \frac{(\lambda_1-l_{11})C_{11}}{\lambda_1}\left( e^{\lambda_1 t}-1 \right)+ \frac{(\lambda_2-l_{11})C_{21}}{\lambda_2}\left( e^{\lambda_2 t}-1 \right). \end{eqnarray}$

(2.21)

Example 2.2. To illustrate this case, we take the goal rates $\hat l(t)$ and $\hat \pi(t)$ , where both are of the form $\hat I(t) = \sin(t)+4$ and $\hat \pi(t) = 0.2\cos(t)+2$ . We assume that the goal supply rate $\hat S(t) = 3\sin(t)+10$ . We take $d_1(t) = 3cos(t)+t^2+4$ . The constants used in this example are as follows: $T = 5; m = 100; h = 0.05; q_1 = 0.01; d_2 = 1; d_3 = 1; q_2 = 0.1; r_1 = 0.01$ ; $r_2 = 0.1; p_1 = 0.01; k_1 = 0.9; k_2 = 0.01; k_3 = 1; I_0 = 8; \pi_0 = 2$ . Figure 3 depicts the variations of the optimal state variables. As can be seen, the inventory level tends to the goal inventory level, and the price tends to the goal price. Figure 4 depicts the variations of the optimal supply and demand rates. As can be seen, both tend to the goal supply rate.

Figure 3. Inventory level (top) and price (bottom).

DownLoad: Full-Size Img PowerPoint

Figure 4. Supply rate (top) and demand rate (bottom).

DownLoad: Full-Size Img PowerPoint

2.2. State-dependent supply

Assume now that the supply rate depends on the two state variables, i.e., $I(t)$ , the inventory level at time $t$ , and $\pi(t)$ , the price at time $t$ . In order to go further in our analysis, we take the following explicit form for $S$ :

$S(t,I(t),\pi(t)) = s_1(t) - s_2(t)I(t) + s_3(t)\pi(t).$

Thus, $s_i(t), i = 1, 2, 3$ now denotes the new control variables and $\hat s_i, i = 1, 2, 3$ denotes the corresponding goal controls. We notice that, for simplicity, we assume that the goal controls $\hat s_i, i = 1, 2, 3$ are constants. Since all targets have to satisfy the state equations, we write

$\begin{eqnarray*} \frac{d}{dt} \hat I(t)& = & \hat s_1 - \hat s_2 \hat I(t) + \hat s_3 \hat \pi(t) - D(t, \hat I(t), \hat \pi(t)), \\ \frac{d}{dt} \hat \pi(t) & = & -k_1[\hat s_1-\hat s_2\hat I(t)+\hat s_3 \hat \pi(t)-D(t, \hat I(t), \hat \pi(t)]-k_2[\hat I(t)-\hat I(0)] \cr &+& k_3 D(t,\hat I(t),\hat \pi(t)). \end{eqnarray*}$

Combining these two differential equations with the state differential equations, we can write the following differential system in terms of the shifting operator $\Delta$ :

$\begin{equation} \frac{d}{dt}\Delta I(t) = \Delta s_1(t) - I(t) \Delta s_2(t) + \pi(t) \Delta s_3(t) + \tilde D(t,I(t),\pi(t)), \end{equation}$

(2.22)

$\begin{equation} \frac{d}{dt}\Delta \pi(t) = -k_1 \Delta s_1(t) + k_1I(t) \Delta s_2(t) - k_1\pi(t) \Delta s_3(t) + \bar D(t,I(t),\pi(t)), \end{equation}$

(2.23)

with

$\begin{eqnarray*} \tilde D(t,I(t),\pi(t)) & : = & - \hat s_2 \Delta I(t) + \hat s_3 \Delta \pi(t) - [D(t,I(t),\pi(t))-D(t,\hat I(t),\hat \pi(t))],\\ \bar D(t,I(t),\pi(t)) & : = & k_1\hat s_2 \Delta I(t)+k_1\hat s_3 \Delta \pi(t) - k_2 [\Delta I(t)-\Delta I(0)] \\ & & +(k_1+k_3)[ D(t,I(t),\pi(t))- D(t,\hat I(t),\hat \pi(t))]. \end{eqnarray*}$

The new objective function to minimize is

$\begin{equation} J = \int_{t_0}^{t_0+T} F(t) dt + R(t_0+T), \end{equation}$

(2.24)

where

$\begin{equation} F(t) = \frac{q_1}{2} \Delta I(t)^2 + \frac{q_2}{2} \Delta \pi(t)^2 + \frac{p_1}{2} \Delta s_1(t)^2 + \frac{p_2}{2} \Delta s_2(t)^2 + \frac{p_3}{2} \Delta s_3(t)^2 \end{equation}$

(2.25)

and

$\begin{equation} R(t_0+T) = \frac{r_1}{2} \Delta I(t_0+T)^2 + \frac{r_2}{2} \Delta \pi(t_0+T)^2. \end{equation}$

(2.26)

Proceeding as in the previous section, we employ the trapezoid formula to calculate the integral in (2.24); the first-order Taylor approximation, combined with (2.22) and (2.23), yields

$\begin{eqnarray} \Delta I(t+ih) & \simeq & c_1(t,i) + u_1(t,i),\\ \Delta \pi(t+ih) & \simeq & c_2(t,i) + u_2(t,i), \end{eqnarray}$

with

$\begin{eqnarray} c_1(t,i) & = & \Delta I(t) + ih \tilde D(t,I(t),\pi(t)), \\ u_1(t,i) & = & ih \Delta s_1(t) - ihI(t) \Delta s_2(t) + ih \pi(t) \Delta s_3(t), \end{eqnarray}$

and

$\begin{eqnarray} c_2(t,i) & = & \Delta \pi(t) +ih \bar D(t,I(t),\pi(t)) , \\ u_2(t,i) & = & -k_1ih \Big[ \Delta s_1(t) - I(t)\Delta s_2(t) + \pi(t)\Delta s_3(t) \Big]. \end{eqnarray}$

Some lengthy calculations allow to write $F(t+ih)$ as follows:

$\begin{eqnarray*} F(t+ih) & \simeq & A(t,i) + \sum\limits_{k = 1}^{3} \Big[ B_k(t,i)\Delta s_k(t) + E_k(t,i) \Delta s_k(t)^2 \Big]\\ & & + L_1(t,i) \Delta s_1(t)\Delta s_2(t) + L_2(t,i) \Delta s_1(t)\Delta s_3(t) + L_3(t,i) \Delta s_2(t)\Delta s_3(t) \\ & & + \frac{p_1}{2} \Delta s_1(t+ih)^2 + \frac{p_2}{2} \Delta s_2(t+ih)^2 + \frac{p_3}{2} \Delta s_3(t+ih)^2, \end{eqnarray*}$

where $A(t, i) : = \frac{1}{2} \left[ q_1c_1(t, i)^2 + q_2c_2(t, i)^2 \right]$ ,

$\begin{array}{lllllllll} &B_1(t,i) : = ih\left[ q_1c_1(t,i) - q_2c_2(t,i)k_1 \right]; & E_1(t,i) : = \frac{1}{2} \bar q i^2 ; & L_1(t,i) : = -\bar q i^2I(t) ;\\ &B_2(t,i) : = -B_1(t,i)I(t) ; & E_2(t,i) : = E_1(t,i)I(t)^2 ; & L_2(t,i) : = \bar q i^2 \pi(t) ; \\ &B_3(t,i) : = B_1(t,i) \pi(t); & E_3(t,i) : = E_1(t,i)\pi(t)^2 ; & L_3(t,i) : = -\bar q i^2I(t)\pi(t). \end{array}$

Therefore, we can write the objective function (2.24) in terms of the control variables, as follows:

$\begin{eqnarray*} J & \simeq & {\bf A}(t_0) + \sum\limits_{k = 1}^{3} {\bf B}_k(t_0)\Delta s_k(t_0) + \sum\limits_{k = 1}^{3}{\bf E}_k(t_0) \Delta s_k(t_0)^2\\ & & + {\bf L}_1(t_0) \Delta s_1(t_0)\Delta s_2(t_0) + {\bf L}_2(t_0) \Delta s_1(t_0)\Delta s_3(t_0) + {\bf L}_3(t_0) \Delta s_2(t_0)\Delta s_3(t_0) \\ & & + \frac{hp_1}{4} \Delta s_1(t_0+mh)^2 + \frac{hp_2}{4} \Delta s_2(t_0+mh)^2 + \frac{hp_3}{2} \Delta s_3(t_0+mh)^2\\ & & + \sum\limits_{i = 1}^{m-1} \frac{hp_1}{2} \Delta s_1(t_0+ih)^2 + \sum\limits_{i = 1}^{m-1} \frac{hp_2}{2} \Delta s_2(t_0+ih)^2 + \sum\limits_{i = 1}^{m-1} \frac{hp_3}{2} \Delta s_3(t_0+ih)^2, \end{eqnarray*}$

where ${\bf A}(t_0)$ is independent of the control variables; also,

$\begin{eqnarray*} {\bf B}_1(t_0) & : = & h \sum\limits_{i = 1}^{m-1} B_1(t_0,i) + \frac{h}{2} B_1(t_0,m) + mh\left[ r_1c_1(t_0,m) - r_2c_2(t_0,m)k_1 \right]; \\ {\bf B}_2(t_0) & : = & h \sum\limits_{i = 1}^{m-1} B_2(t_0,i) + \frac{h}{2} B_2(t_0,m) + mh\left[ r_2c_2(t_0,m)k_1 - r_1c_1(t_0,m) \right] I(t_0); \\ {\bf B}_3(t_0) & : = & h \sum\limits_{i = 1}^{m-1} B_3(t_0,i) + \frac{h}{2} B_3(t_0,m) + mh\left[ r_1c_1(t_0,m) - r_2c_2(t,m)k_1 \right]\pi(t_0); \\ {\bf E}_1(t_0) & : = & \frac{hp_1}{4} + h\sum\limits_{i = 1}^{m-1} E_1(t_0,i) + \frac{h}{2} E_1(t_0,m) + \frac{1}{2} \bar r m^2 ; \\ {\bf E}_2(t_0) & : = & \frac{hp_2}{4} + h\sum\limits_{i = 1}^{m-1} E_2(t_0,i) + \frac{h}{2} E_2(t_0,m) + \frac{1}{2} \bar r m^2I(t_0)^2 ; \\ {\bf E}_3(t_0) & : = & \frac{hp_3}{4} + h\sum\limits_{i = 1}^{m-1} E_3(t_0,i) + \frac{h}{2} E_3(t_0,m) + \frac{1}{2} \bar r m^2 \pi(t_0)^2 ; \\ {\bf L}_1(t_0) & : = & h \sum\limits_{i = 1}^{m-1} L_1(t_0,i) + \frac{h}{2} L_1(t_0,m) - \bar r m^2 I(t_0) ; \\ {\bf L}_2(t_0) & : = & h \sum\limits_{i = 1}^{m-1} L_2(t_0,i) + \frac{h}{2} L_2(t_0,m) + \bar r m^2 \pi(t_0) ; \\ {\bf L}_3(t_0) & : = & h \sum\limits_{i = 1}^{m-1} L_3(t_0,i) + \frac{h}{2} L_3(t_0,m) - \bar r m^2 I(t_0) \pi(t_0). \end{eqnarray*}$

We introduce a vector-matrix notation by setting

$\begin{eqnarray*} \Delta {\mathbb S}_k(t_0) & : = & [\Delta s_k(t_0), \Delta s_k(t_0+h), \cdots, \Delta s_k(t_0+mh)]^T_{(m+1)\times 1} , \quad k = 1,2,3; \\ {\mathbb B}_k(t_0) & : = & {\bf B}_k(t_0) {\mathbf e}_1 \mbox{ with } {\mathbf e}_1 = [1, 0,\cdots,0]^T_{(m+1)\times 1}, \quad k = 1,2,3 ; \\ {\mathbb E}_k(t_0) & : = & {\rm Diag} \Big[{\bf E}_k(t_0), \frac{hp_k}{2}, \frac{hp_k}{2}, \frac{hp_k}{2}, \dots, \frac{hp_k}{2}, \frac{hp_k}{4} \Big] ; \quad k = 1,2,3 ; \\ {\mathbb L}_k(t_0) & : = & {\rm Diag} \Big[{\bf L}_k(t_0), 0, 0, 0, \dots, 0, 0 \Big] ; \quad k = 1,2,3. \end{eqnarray*}$

In order to derive the optimality conditions, we rewrite the objective function in the following vectorial form:

$\begin{eqnarray*} J(\Delta {\mathbb S}_1(t_0),\Delta {\mathbb S}_2(t_0) ,\Delta {\mathbb S}_3(t_0)) & \simeq & {\bf A}(t_0) + \sum\limits_{k = 1}^3 {{\mathbb B}_k (t_0) }^T \Delta {\mathbb S}_k(t_0) + \sum\limits_{k = 1}^3 \Delta {\mathbb S}_k(t_0)^T {\mathbb E}_k(t_0) \Delta {\mathbb S}_k(t_0) \\ & & + \Delta {\mathbb S}_1(t_0)^T {\mathbb L}_1(t_0) \Delta {\mathbb S}_2(t_0) + \Delta {\mathbb S}_1(t_0)^T {\mathbb L}_2(t_0) \Delta {\mathbb S}_3(t_0)\cr\cr & & + \Delta {\mathbb S}_2(t_0)^T {\mathbb L}_3(t_0) \Delta {\mathbb S}_3(t_0). \end{eqnarray*}$

The unique global minimum of the objective function $J$ is reached at the point $(\Delta {\mathbb S}^*_1(t_0), \Delta {\mathbb S}^*_2(t_0), \Delta {\mathbb S}^*_3(t_0))$ , which is the solution of the linear system, i.e., $\begin{array}{ll} \frac{\partial J}{\partial \Delta {\mathbb S}_k(t_0)} = 0, \quad k = 1, 2, 3, \end{array}$ which can be written in the following vectorial form:

$\left\{ \begin{array}{ll} 2 {\mathbb E}_1(t_0) \Delta {\mathbb S}_1(t_0) + {\mathbb L}_1(t_0) \Delta {\mathbb S}_2(t_0) + {\mathbb L}_2(t_0) \Delta {\mathbb S}_3(t_0) = -{{\mathbb B}_1}(t_0) ; \cr\cr {\mathbb L}_1(t_0) \Delta {\mathbb S}_1(t_0) + 2 {\mathbb E}_2(t_0) \Delta {\mathbb S}_2(t_0) + {\mathbb L}_3(t_0) \Delta {\mathbb S}_3(t_0) = -{{\mathbb B}_2}(t_0) ; \cr\cr {\mathbb L}_2(t_0) \Delta {\mathbb S}_1(t_0) + {\mathbb L}_3(t_0) \Delta {\mathbb S}_2(t_0) + 2 {\mathbb E}_3(t_0) \Delta {\mathbb S}_3(t_0) = -{{\mathbb B}_3(t_0) } , \end{array}\right.$

which implies that

$\left\{ \begin{array}{ll} 2 {\mathbf E}_1(t_0) \Delta {s}_1(t_0) + {\mathbf L}_1(t_0) \Delta {s}_2(t_0) + {\mathbf L}_2(t_0) \Delta {s}_3(t_0) = -{{\mathbf B}_1(t_0) } ; \cr\cr {\mathbf L}_1(t_0) \Delta {s}_1(t_0) + 2 {\mathbf E}_2(t_0) \Delta {s}_2(t_0) + {\mathbf L}_3(t_0) \Delta {s}_3(t_0) = -{{\mathbf B}_2(t_0) } ; \cr\cr {\mathbf L}_2(t_0) \Delta {s}_1(t_0) + {\mathbf L}_3(t_0) \Delta {s}_2(t_0) + 2 {\mathbf E}_3(t_0) \Delta {s}_3(t_0) = -{{\mathbf B}_3(t_0)} . \end{array}\right.$

Set

${\mathbb A}(t_0): = \left[ \begin{array}{ccc} 2 {\mathbf E}_1(t_0) & {\mathbf L}_1(t_0) & {\mathbf L}_2(t_0) \cr\cr {\mathbf L}_1(t_0) & 2 {\mathbf E}_2(t_0) & {\mathbf L}_3(t_0) \cr\cr {\mathbf L}_2(t_0) & {\mathbf L}_3(t_0) & 2 {\mathbf E}_3(t_0) \end{array}\right]; \quad {\mathbb A}_1(t_0): = \left[ \begin{array}{ccc} - {\mathbf B}_1(t_0) & {\mathbf L}_1(t_0) & {\mathbf L}_2(t_0) \cr\cr -{\mathbf B}_2(t_0) & 2 {\mathbf E}_2(t_0) & {\mathbf L}_3(t_0) \cr\cr - {\mathbf B}_3(t_0) & {\mathbf L}_3(t_0) & 2 {\mathbf E}_3(t_0) \end{array}\right];$

${\mathbb A}_2(t_0): = \left[ \begin{array}{ccc} 2 {\mathbf E}_1(t_0) & - {\mathbf B}_1(t_0) & {\mathbf L}_2(t_0) \cr\cr {\mathbf L}_1(t_0) & -{\mathbf B}_2(t_0) & {\mathbf L}_3(t_0) \cr\cr {\mathbf L}_2(t_0) &- {\mathbf B}_3(t_0) & 2 {\mathbf E}_3(t_0) \end{array}\right]; \quad {\mathbb A}_3(t_0): = \left[ \begin{array}{ccc} 2 {\mathbf E}_1(t_0) & {\mathbf L}_1(t_0) & -{\mathbf B}_1(t_0) \cr\cr {\mathbf L}_1(t_0) & 2 {\mathbf E}_2(t_0) & -{\mathbf B}_2(t_0) \cr\cr {\mathbf L}_2(t_0) & {\mathbf L}_3(t_0) & - {\mathbf B}_3(t_0) \end{array}\right].$

Since

$\begin{array}{lclclclclcl} {\bf E}_1(t_0) & = & \frac{hp_1}{4} + x_1, & & {\bf E}_2(t_0) & = & \frac{hp_2}{4} + x_1I(t_0)^2, & & {\bf E}_3(t_0) & = & \frac{hp_3}{4} + x_1 \pi(t_0)^2, \cr\cr {\bf L}_1(t_0) & = & -2 x_1 I(t_0), & & {\bf L}_2(t_0) & = &2x_1\pi(t_0), & & {\bf L}_3(t_0) & = & -2 x_1 I(t_0)\pi(t_0), \end{array}$

it follows that

${\rm Det}({\mathbb A}(t_0)) = \frac{h^3p_2p_3p_1}{8} + \frac{h^2p_2p_3 x_1}{2} + \frac{ h^2p_1p_3x_1 }{2} I(t_0)^2 + \frac{h^2 p_2p_1x_1}{2} \pi(t_0)^2 > 0,$

and, thus,

$\Delta {s}^*_1(t_0) = \frac{\det(\mathbb A_1(t_0))}{\det(\mathbb A(t_0))}, \quad \Delta {s}^*_2(t_0) = \frac{\det(\mathbb A_2(t_0))}{\det(\mathbb A(t_0))}, \quad \Delta {s}^*_3(t_0) = \frac{\det(\mathbb A_3(t_0))}{\det(\mathbb A(t_0))}.$

Since our previous analysis is valid for any $t_0\in [0, H]$ , we substitute the expressions of $\Delta {s}^*_k(t), k = 1, 2, 3$ in (2.22) and (2.23) to obtain a system of linear differential equations:

$\begin{eqnarray} \label{system3} \hskip0mm \frac{d }{dt}\Delta I(t) & = & \hskip0mm \Delta s^*_1(t) - I(t) \Delta s^*_2(t) + \pi(t) \Delta s^*_3(t) + \tilde D(t,I(t),\pi(t)),\cr\cr \hskip0mm \frac{d }{dt}\Delta \pi(t) & = & \hskip0mm -k_1 \Delta s^*_1(t) + k_1I(t) \Delta s^*_2(t) - k_1\pi(t) \Delta s^*_3(t) + \bar D(t,I(t),\pi(t)) \end{eqnarray}$

with

To solve this system of equations, we need to calculate the above determinants. Using the notation (2.13), we have

$\begin{eqnarray*} {\bf B}_1(t) & = & a_{11} \Delta I(t) - a_{12} \Delta \pi(t) + a_{13} \tilde D(t,I(t),\pi(t)) - a_{14} \bar D(t,I(t),\pi(t)), \\ {\bf B}_2(t) & = & -{\bf B}_1(t) I(t), \\ {\bf B}_3(t) & = & {\bf B}_1(t) \pi(t). \end{eqnarray*}$

In order to go further in our analysis and solve this differential system, we need an explicit form of the function $D$ in terms of $I$ and $\pi$ . To do that, we assume that $D$ has the form

$D(t,I(t),\pi(t)) = d_1(t) - d_2(t)I(t) + d_3(t)\pi(t).$

Then,

$\begin{eqnarray*} \tilde D(t,I(t),\pi(t)) & = & [\hat s_2 - d_2(t)]\hat I(t)- [\hat s_3 - d_3(t)]\hat \pi(t)- [\hat s_2 - d_2(t)] I(t) + [\hat s_3 - d_3(t)] \pi(t),\\ \bar D(t,I(t),\pi(t)) & = & k_2\Delta I(0)-k_1 [\hat s_1 - d_1(t)] + k_3d_1(t)- \Big[ k_1\hat s_2 - k_2 - (k_1+k_3)d_2(t) \Big]\hat I(t) \\ & & -\Big[ k_1\hat s_3 + (k_1+k_3)d_3(t) \Big] \hat \pi(t)+ \Big[ k_1\hat s_2 - k_2 - (k_1+k_3)d_2(t) \Big]I(t) \\ & & + \Big[ k_1\hat s_3 + (k_1+k_3)d_3(t) \Big] \pi(t). \end{eqnarray*}$

Substituting $\bar D(t, I(t), \pi(t))$ and $\tilde D(t, I(t), \pi(t))$ in ${\bf B}_1(t), {\bf B}_2(t),$ and ${\bf B}_3(t)$ , we obtain

$\begin{eqnarray*} {\bf B}_1(t) & = & b_1(t)I(t) + b_2(t)\pi(t) + b_3(t), \\ {\bf B}_2(t) & = & -\Big[ b_1(t)I(t) + b_2(t)\pi(t) + b_3(t) \Big] I(t), \\ {\bf B}_3(t) & = & \Big[ b_1(t)I(t) + b_2(t)\pi(t) + b_3(t)\Big) \pi(t), \end{eqnarray*}$

where

$\begin{eqnarray*} b_1(t) & = & \alpha_1 + \beta_1 d_2(t), \\ b_2(t) & = & \alpha_2 - \beta_1 d_3(t), \\ b_3(t) & = & \alpha_3(t) - \beta_1 \hat I(t) d_2(t)+\beta_1 \hat \pi(t) d_3(t), \end{eqnarray*}$

with

$\begin{array}{lclclclclcl} \alpha_1 & = & a_{11}-a_{13}\hat s_2-a_{14}(k_1 \hat s_2-k_2), \cr \beta_1 & = & a_{13}+a_{14}(k_1+k_3),\\ \alpha_2 & = & -a_{12}+a_{13}\hat s_3-a_{14}k_1 \hat s_3, \cr \alpha_3(t) & = & -a_{14}k_2\Delta I(0)+(a_{12}-a_{13}\hat s_3+a_{14}k_1\hat s_3)\hat \pi(t)+(a_{13}\hat s_2+a_{14}(k_1 \hat s_2+k_2)-a_{11})\hat I(t). \end{array}$

Now, we can compute the determinants:

${\rm Det}({\mathbb A}_1(t)) = -\Big[ \frac{h^2p_3p_2b_3(t)}{4} + \frac{h^2p_3p_2b_1(t)}{4}I(t) + \frac{h^2p_3p_2b_2(t)}{4}\pi(t) \Big],$

${\rm Det}({\mathbb A}_2(t)) = \frac{h^2p_1p_3b_{3}(t) }{4} I(t) + \frac{h^2p_1p_3b_{2}(t)}{4} I(t)\pi(t) + \frac{h^2p_1p_3b_{1}(t)}{4} I(t)^2,$

${\rm Det}({\mathbb A}_3(t)) = -\Big[ \frac{h^2p_1p_2b_{3}(t) }{4} \pi(t) + \frac{h^2p_1p_2b_{1}(t)}{4} I(t) \pi(t) + \frac{h^2p_1p_2b_{2}(t) }{4} \pi(t)^2 \Big].$

Consequently, we obtain the optimal solution of the vectorial minimization problem as follows:

$\begin{eqnarray*} \Delta {s}^*_1(t) & = & -\frac{ \frac{p_3p_2b_3(t)}{2} + \frac{p_3p_2b_{1}(t) }{2}I(t) + \frac{p_3p_2b_{2}(t)}{2}\pi(t)}{\frac{hp_2p_3p_1 }{4} + p_2p_3x_1 + p_1p_3x_1 I(t)^2 + p_2p_1x_1 \pi(t)^2},\cr\cr\cr \Delta {s}^*_2(t) & = & -\frac{ \frac{p_1p_3b_{3}(t) }{2} I(t) + \frac{p_1p_3b_{2}(t) }{2} I(t)\pi(t) + \frac{p_1p_3b_{1}(t)}{2} I(t)^2}{\frac{hp_2p_3p_1 }{4} + p_2p_3 x_1 + p_1p_3x_1 I(t)^2 + p_2p_1x_1 \pi(t)^2},\cr\cr\cr \Delta {s}^*_3(t) & = & -\frac{ \frac{p_1p_2b_{3}(t) }{2} \pi(t) + \frac{p_1p_2b_1(t)}{2}I(t) \pi(t) + \frac{p_1p_2b_2(t)}{2} \pi(t)^2} {\frac{hp_2p_3p_1}{4} + p_2p_3x_1 + p_1p_3x_1 I(t)^2 + p_2p_1x_1 \pi(t)^2}. \end{eqnarray*}$

By substituting these expressions in the system of differential equations given by (2.22) and (2.23), we get a system of differential equations with a nonlinear right-hand side which cannot be solved explicitly, and, in order to go further, we take some particular cases:

Case 1: $s_2$ and $s_3$ are constant.

Assume that the supply rate, which is our control, is given in the following form:

$S(t, I(t), \pi(t)) = s_1(t)-s_2 I(t)+s_3\pi(t),$

where the control is reduced to one function $s_1$ and the two other coefficients $s_2$ and $s_3$ are given. In this case, the parameters $p_2$ and $p_3$ in the objective function will be taken to be zero and the optimal solution reduces to a single value $\Delta {s^*_1}$ , which is given by

$\Delta {s^*_1}(t) = -\frac{ b_3(t)}{2\left(\frac{hp_1}{4} + x_1\right)} - \frac{ b_{1}(t) }{2\left(\frac{hp_1}{4} + x_1\right)} I(t) - \frac{ b_{2}(t) }{2\left(\frac{hp_1}{4} + x_1\right)}\pi(t).$

Our differential system will take the following simple form:

$\begin{equation} \left\{ \begin{array}{ccc} \frac{d }{dt} \Delta I(t) & = & \Delta s^*_1(t) + \tilde D(t,I(t),\pi(t)),\cr\cr \frac{d }{dt} \Delta \pi(t) & = & -k_1 \Delta s^*_1(t) + \bar D(t,I(t),\pi(t)), \end{array}\right. \end{equation}$

(2.27)

with

$\begin{eqnarray*} \tilde D(t,I(t),\pi(t)) & = & d_2(t) I(t) - d_3(t) \pi(t)+d_3(t) \hat \pi(t)- d_2(t) \hat I(t),\\ \bar D(t,I(t),\pi(t)) & = & k_2\Delta I(0)-k_1 [\hat s_1 - d_1(t)] + k_3d_1(t)+ \Big[ k_2 + (k_1+k_3)d_2(t) \Big]\hat I(t) \\ & & - (k_1+k_3)d_3(t) \hat \pi(t)- \Big[ k_2 + (k_1+k_3)d_2(t) \Big]I(t) \\ & & + (k_1+k_3)d_3(t) \pi(t). \end{eqnarray*}$

After rearrangement, we can write the above system in the following form:

$\begin{equation} \left\{ \begin{array}{ccc} \frac{d }{dt} I(t) & = & l_{11}(t) I(t) + l_{12}(t) \pi(t) + \bar l_{1}(t), \cr\cr \frac{d }{dt} \pi(t) & = & l_{21}(t) I(t) + l_{22}(t) \pi(t) +\bar l_{2}(t), \end{array}\right. \end{equation}$

(2.28)

i.e.,

$\frac{d }{dt} X(t) = A(t) X(t) + B(t),$

where

$X(t): = \left( \begin{array}{ccc} I(t) \cr\cr \pi(t) \end{array} \right), \quad B(t): = \left( \begin{array}{ccc} \bar l_{1}(t) \cr\cr \bar l_{2}(t) \end{array} \right), \quad A(t): = \left( \begin{array}{lclclc} l_{11}(t) & l_{12}(t) \cr\cr l_{21}(t) & l_{22}(t) \end{array} \right),$

and

$\begin{array}{lclclclclc} l_{11}(t) & : = & \frac{ -\alpha_1}{2\left(\frac{hp_1}{4} + x_1\right)} + \left[1-\frac{\beta_1}{2\left(\frac{hp_1}{4} + x_1\right)}\right]d_2(t), \cr\cr l_{12}(t) &: = & - \frac{ \alpha_2 }{2\left(\frac{hp_1}{4} + x_1\right)} - \left[1 - \frac{ \beta_1 }{2\left(\frac{hp_1}{4}+x_1\right)} \right]d_3(t), \cr\cr l_{21}(t) & : = & -k_2 + \frac{ k_1\alpha_1 }{2\left(\frac{hp_1}{4} + x_1\right)} + \left[ \frac{ k_1\beta_1 }{2\left(\frac{hp_1}{4} + x_1\right)} - (k_1+ k_3) \right]d_2(t), \cr\cr l_{22}(t) & : = & \frac{ k_1\alpha_2 }{2\left(\frac{hp_1}{4} + x_1\right)} + \left[ \frac{ -k_1\beta_1 }{2\left(\frac{hp_1}{4} + x_1\right)} + k_1 + k_3 \right]d_3(t), \cr\cr \bar l_{1}(t) & : = & \frac{d}{dt}\hat I(t) -\frac{\alpha_3(t) }{2\left(\frac{hp_1}{4} + x_1\right)}+ \left[1+\frac{\beta_1 }{2\left(\frac{hp_1}{4} + x_1\right)}\right] \hat \pi(t)d_3(t)+\left[\frac{\beta_1 }{2\left(\frac{hp_1}{4} + x_1\right)}-1\right] \hat I(t)d_2(t), \cr\cr \bar l_{2}(t) & : = & \frac{d}{dt}\hat \pi(t) +\frac{k_1\alpha_3(t) }{2\left(\frac{hp_1}{4} + x_1\right)}+k_2\Delta I(0)-k_1 \hat s_1 +k_2 \hat I(t) + (k_1+k_3)d_1(t)\cr\cr &+& \left[(k_1+k_3)- \frac{k_1\beta_1 }{2\left(\frac{hp_1}{4} + x_1\right)}\right]\hat I(t)d_2(t) + \left[ \frac{k_1\beta_1 }{2\left(\frac{hp_1}{4} + x_1\right)}-(k_1+k_3)\right]\hat \pi(t)d_3(t) . \end{array}$

This is a nonhomogeneous system of linear equations with variable coefficients, and the explicit form of its solution $X(t)$ can be computed analytically in some cases.

Case 1.1: $d_2$ and $d_3$ are constant.

Assume that the demand rate $D(t, I(t), \pi(t))$ is given by the following form:

$D(t, I(t), \pi(t)) = d_1(t) - d_2 I(t) + d_3\pi(t).$

In this case, $A(t)$ becomes a constant matrix:

$A(t)\equiv A: = \left( \begin{array}{lclclc} l_{11} & l_{12} \cr\cr l_{21} & l_{22} \end{array} \right);$

hence, the explicit form of the solution $X(t)$ of the system of differential equations can be computed analytically. Proceeding as in Subsection 2.1, the general solution of the nonhomogenous differential system is given by the following formula:

$X(t) = M(t)\cdot M(0)^{-1}X(0) + M(t) \int_0^t M^{-1}(s) B(s) ds,$

where $M(t)$ is the fundamental matrix. In order to go further, we need to compute the integral term in the general solution, which is not possible without the explicit form of the targets $\hat I(t)$ and $\hat\pi(t)$ . Note that, when $\hat I(t)$ and $\hat\pi(t)$ are constant, we fall back on a system similar to the one studied in case 1 of Subsection 2.1, and the solution can be computed as in (2.18) and (2.19). Also, when $\hat I(t)$ and $\hat\pi(t)$ are not necessarily constant, then we fall back on a system similar to the one studied in case 2 of Subsection 2.1, and the solution can be found as in (2.20) and (2.21).

Case 1.2: Not both $d_2$ and $d_3$ are constant.

In this case, the matrix $A(t)$ is dependent on time and the solution of the linear differential system cannot be found analytically. However, using packages (such as ode45 solver in MATLAB), the solution can be found numerically, as the following example shows:

Example 2.3. To illustrate, we take the following functions $d_1, d_2,$ and $d_3$ :

$d_1(t): = 4+4\sin(t), \quad d_2(t): = 0.1+t^2, \quad d_3(t): = \frac{t^2}{3}-\frac{2\sin(t)}{3}-0.63.$

The constants used in this example are as follows: $m = 20; h = 0.01; q_1 = 1; q_2 = 1; r_1 = 0.1; r_2 = 0.1; p_1 = 0.1; k_1 = 0.7; k_2 = 0.1; k_3 = 0.1; I_0 = 2; \hat I = 7; \pi_0 = 10; \hat \pi = 6.6, \hat s_1 = 1.47$ . Figure 5 depicts the variations of the optimal state variables. As can be seen, the inventory level tends to the goal inventory level, and the price tends to the goal price. Figure 6 depicts the variations of the optimal supply and demand rates. As can be seen, both tend to the goal supply rate.

Figure 5. Inventory level (top) and price (bottom).

DownLoad: Full-Size Img PowerPoint

Figure 6. Demand rate (top) and supply rate (bottom).

DownLoad: Full-Size Img PowerPoint

Case 2: Not both $s_2$ and $s_3$ are constant.

In this case, the differential system becomes nonlinear even when the functions $d_1, d_2,$ and $d_3$ are constants. The analytic solution of the differential system cannot be found and we need software to solve it numerically. We take the following example as a demonstration of the numerical solvability of the differential system.

Example 2.4. Take the functions $d_1, d_2,$ and $d_3$ to be given respectively by

$d_1(t): = 2+4\sin(t), \quad d_2(t): = 0.1+t^2, \quad d_3(t): = 0.274t^2-0.548\sin(t)+0.7534.$

The constants used in this example are as follows: $m = 200; h = 0.01; q_1 = 1; q_2 = 1; r_1 = 0.01; r_2 = 0.1; p_1 = 0.01; p_2 = 0.001; p_3 = 0.01, k_1 = 0.9; k_2 = 0.01; k_3 = 0.1; I_0 = 8; \hat I = 2; \pi_0 = 2; \hat \pi = 7.3; \hat s_1 = 1.47; \hat s_2 = 0.1; \hat s_3 = 0.1$ . Figure 7 shows the variations of the optimal inventory level and the optimal price rate. Figure 8 shows the variations of the optimal supply and demand rates. All variables converge to their respective goals.

Figure 7. Inventory level (top) and price (bottom).

DownLoad: Full-Size Img PowerPoint

Figure 8. Demand rate (top) and Supply rate (bottom).

DownLoad: Full-Size Img PowerPoint

3. Conclusions

Since firms may not be able to control the price, we have proposed, in this paper, a model for the variations of a dynamic price. The model is based on the Walrasian assumption that the price changes in a manner that is reflected in the difference between the demand and the supply. The model also takes into account the inventory of unsold product and the fact that the price changes with the demand. Using the model predictive control technique, we have obtained the optimal supply rate (control variable) and the optimal price and inventory level (state variables). These solutions are obtained analytically when possible and numerically otherwise. Numerical examples illustrate the results obtained. Although we did not conduct any sensitivity analysis, we note that this could have been easily done either on the explicit results of the paper, or numerically.

As future work, we suggest to consider the case in which the product is subject to deterioration. A dynamic pricing policy is particularly well-suited in this case, and the price can be adjusted as the product deteriorates.

Use of AI tools declaration

The authors declare they have not used Artificial Intelligence (AI) tools in the creation of this article.

Acknowledgments

The first author extends his appreciations to Researchers Supporting Project, number (RSPD2023R1001), King Saud University, Riyadh Saudi Arabia. The authors would like to thank the referees for their careful and thorough reading of the paper.

Conflict of interest

The authors declare no conflicts of interest.

References

[1]	C. Audet, J. Dennis, Mesh adaptive direct search algorithms for constrained optimization, SIAM J. Optimiz., 17 (2006), 188–217. http://dx.doi.org/10.1137/040603371 doi: 10.1137/040603371
[2]	C. Audet, K. Dzahini, M. Kokkolaras, S. Le Digabel, Stochastic mesh adaptive direct search for blackbox optimization using probabilistic estimates, Comput. Optim. Appl., 79 (2021), 1–34. http://dx.doi.org/10.1007/s10589-020-00249-0 doi: 10.1007/s10589-020-00249-0
[3]	C. Audet, W. Hare, Derivative-free and blackbox optimization, Cham: Springer, 2017. http://dx.doi.org/10.1007/978-3-319-68913-5
[4]	C. Audet, A. Ihaddadene, S. Le Digabel, C. Tribes, Robust optimization of noisy blackbox problems using the mesh adaptive direct search algorithm, Optim. Lett., 12 (2018), 675–689. http://dx.doi.org/10.1007/s11590-017-1226-6 doi: 10.1007/s11590-017-1226-6
[5]	K. Balasubramanian, S. Ghadimi, Zeroth-order nonconvex stochastic optimization: handling constraints, high dimensionality, and saddle points, Found. Computat. Math., 22 (2022), 35–76. http://dx.doi.org/10.1007/s10208-021-09499-8 doi: 10.1007/s10208-021-09499-8
[6]	J. Bernstein, Y. Wang, K. Azizzadenesheli, A. Anandkumar, SignSGD: compressed optimisation for non-convex problems, Proceedings of International Conference on Machine Learning, 2018,560–569.
[7]	S. Bhatnagar, H. Prasad, L. Prashanth, Stochastic recursive algorithms for optimization, London: Springer, 2013. http://dx.doi.org/10.1007/978-1-4471-4285-0
[8]	J. Blank, K. Deb, Pymoo: multi-objective optimization in Python, IEEE Access, 8 (2020), 89497–89509. http://dx.doi.org/10.1109/ACCESS.2020.2990567 doi: 10.1109/ACCESS.2020.2990567
[9]	H. Cai, Y. Lou, D. McKenzie, W. Yin, A zeroth-order block coordinate descent algorithm for huge-scale black-box optimization, Proceedings of the 38th International Conference on Machine Learning, 2021, 1193–1203.
[10]	H. Cai, D. McKenzie, W. Yin, Z. Zhang, A one-bit, comparison-based gradient estimator, Appl. Comput. Harmon. Anal., 60 (2022), 242–266. http://dx.doi.org/10.1016/j.acha.2022.03.003 doi: 10.1016/j.acha.2022.03.003
[11]	H. Cai, D. Mckenzie, W. Yin, Z. Zhang, Zeroth-order regularized optimization (zoro): approximately sparse gradients and adaptive sampling, SIAM J. Optim., 32 (2022), 687–714. http://dx.doi.org/10.1137/21M1392966 doi: 10.1137/21M1392966
[12]	N. Carlini, D. Wagner, Towards evaluating the robustness of neural networks, Proceedings of 2017 IEEE Symposium on Security and Privacy, 2017, 39–57. http://dx.doi.org/10.1109/SP.2017.49 doi: 10.1109/SP.2017.49
[13]	K. Chang, Stochastic nelder-mead simplex method-a new globally convergent direct search method for simulation optimization, Eur. J. Oper. Res., 220 (2012), 684–694. http://dx.doi.org/10.1016/j.ejor.2012.02.028 doi: 10.1016/j.ejor.2012.02.028
[14]	R. Chen, M. Menickelly, K. Scheinberg, Stochastic optimization using a trust-region method and random models, Math. Program., 169 (2018), 447–487. http://dx.doi.org/10.1007/s10107-017-1141-8 doi: 10.1007/s10107-017-1141-8
[15]	X. Chen, S. Liu, K. Xu, X. Li, X. Lin, M. Hong, et al., Zo-adamm: zeroth-order adaptive momentum method for black-box optimization, Proceedings of 33rd Conference on Neural Information Processing Systems, 2019, 1–12.
[16]	A. Conn, K. Scheinberg, L. Vicente, Introduction to derivative-free optimization, Philadelphia: SIAM, 2009. http://dx.doi.org/10.1137/1.9780898718768
[17]	F. Curtis, K. Scheinberg, R. Shi, A stochastic trust region algorithm based on careful step normalization, Informs Journal on Optimization, 1 (2019), 200–220. http://dx.doi.org/10.1287/ijoo.2018.0010 doi: 10.1287/ijoo.2018.0010
[18]	J. Deng, W. Dong, R. Socher, L. Li, K. Li, F. Li, Imagenet: a large-scale hierarchical image database, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2009,248–255. http://dx.doi.org/10.1109/CVPR.2009.5206848 doi: 10.1109/CVPR.2009.5206848
[19]	M. Garneau, Modelling of a solar thermal power plant for benchmarking blackbox optimization solvers, Ph. D Thesis, École Polytechnique de Montréal, 2015.
[20]	S. Ghadimi, G. Lan, Stochastic first-and zeroth-order methods for nonconvex stochastic programming, SIAM J. Optim., 23 (2013), 2341–2368. http://dx.doi.org/10.1137/120880811 doi: 10.1137/120880811
[21]	S. Ghadimi, A. Ruszczynski, M. Wang, A single timescale stochastic approximation method for nested stochastic optimization, SIAM J. Optim., 30 (2020), 960–979. http://dx.doi.org/10.1137/18M1230542 doi: 10.1137/18M1230542
[22]	N. Hansen, The CMA evolution strategy: a comparing review, In: Towards a new evolutionary computation, Berlin: Springer, 2006, 75–102. http://dx.doi.org/10.1007/3-540-32494-1_4
[23]	S. Karimireddy, Q. Rebjock, S. Stich, M. Jaggi, Error feedback fixes signsgd and other gradient compression schemes, Proceedings of the 36th International Conference on Machine Learning, 2019, 3252–3261.
[24]	J. Kiefer, J. Wolfowitz, Stochastic estimation of the maximum of a regression function, Ann. Math. Statist., 23 (1952), 462–466. http://dx.doi.org/10.1214/aoms/1177729392 doi: 10.1214/aoms/1177729392
[25]	B. Kim, H. Cai, D. McKenzie, W. Yin, Curvature-aware derivative-free optimization, arXiv:2109.13391.
[26]	D. Kingma, J. Ba, Adam: a method for stochastic optimization, arXiv:1412.6980.
[27]	M. Kokkolaras, Z. Mourelatos, P. Papalambros, Impact of uncertainty quantification on design: an engine optimisation case study, International Journal of Reliability and Safety, 1 (2006), 225–237. http://dx.doi.org/10.1504/IJRS.2006.010786 doi: 10.1504/IJRS.2006.010786
[28]	A. Krizhevsky, I. Sutskever, G. Hinton, Imagenet classification with deep convolutional neural networks, Commun. ACM, 60 (2017), 84–90. http://dx.doi.org/10.1145/3065386 doi: 10.1145/3065386
[29]	S. Le Digabel, Algorithm 909: NOMAD: nonlinear optimization with the MADS algorithm, ACM T. Math. Software, 37 (2011), 1–15. http://dx.doi.org/10.1145/1916461.1916468 doi: 10.1145/1916461.1916468
[30]	S. Liu, P. Chen, X. Chen, M. Hong, Sign-SGD via zeroth-order oracle, Proceedings of International Conference on Learning Representations, 2019, 1–24.
[31]	S. Liu, P. Chen, B. Kailkhura, G. Zhang, A. Hero, P. Varshney, A primer on zeroth-order optimization in signal processing and machine learning: principals, recent advances, and applications, IEEE Signal Proc. Mag., 37 (2020), 43–54. http://dx.doi.org/10.1109/MSP.2020.3003837 doi: 10.1109/MSP.2020.3003837
[32]	S. Liu, B. Kailkhura, P. Chen, P. Ting, S. Chang, L. Amini, Zeroth-order stochastic variance reduction for nonconvex optimization, Proceedings of the 32nd International Conference on Neural Information Processing Systems, 2018, 3731–3741.
[33]	A. Maggiar, A. Wachter, I. Dolinskaya, J. Staum, A derivative-free trust-region algorithm for the optimization of functions smoothed via gaussian convolution using adaptive multiple importance sampling, SIAM J. Optim., 28 (2018), 1478–1507. http://dx.doi.org/10.1137/15M1031679 doi: 10.1137/15M1031679
[34]	Y. Nesterov, V. Spokoiny, Random gradient-free minimization of convex functions, Found. Comput. Math., 17 (2017), 527–566. http://dx.doi.org/10.1007/s10208-015-9296-2 doi: 10.1007/s10208-015-9296-2
[35]	N. Papernot, P. McDaniel, I. Goodfellow, S. Jha, Z. Berkay Celik, A. Swami, Practical black-box attacks against machine learning, Proceedings of the 2017 ACM on Asia conference on computer and communications security, 2017,506–519. http://dx.doi.org/10.1145/3052973.3053009 doi: 10.1145/3052973.3053009
[36]	E. Real, S. Moore, A. Selle, S. Saxena, Y. Suematsu, J. Tan, et al., Large-scale evolution of image classifiers, Proceedings of the 34th International Conference on Machine Learning, 2017, 2902–2911.
[37]	H. Robbins, S. Monro, A stochastic approximation method, Ann. Math. Statist., 22 (1951), 400–407. http://dx.doi.org/10.1214/aoms/1177729586 doi: 10.1214/aoms/1177729586
[38]	R. Rockafellar, J. Royset, Risk measures in engineering design under uncertainty, Proceedings of International Conference on Applications of Statistics and Probability, 2015, 1–8. http://dx.doi.org/10.14288/1.0076159 doi: 10.14288/1.0076159
[39]	R. Rubinstein, Simulation and the Monte Carlo method, Hoboken: John Wiley & Sons Inc., 1981. http://dx.doi.org/10.1002/9780470316511
[40]	A. Ruszczynski, W. Syski, Stochastic approximation method with gradient averaging for unconstrained problems, IEEE T. Automat. Contr., 28 (1983), 1097–1105. http://dx.doi.org/10.1109/TAC.1983.1103184 doi: 10.1109/TAC.1983.1103184
[41]	J. Spall, Multivariate stochastic approximation using a simultaneous perturbation gradient approximation, IEEE T. Automat. Contr., 37 (1992), 332–341. http://dx.doi.org/10.1109/9.119632 doi: 10.1109/9.119632
[42]	M. Styblinski, T. Tang, Experiments in nonconvex optimization: stochastic approximation with function smoothing and simulated annealing, Neural Networks, 3 (1990), 467–483.
[43]	C. Szegedy, V. Vanhoucke, S. Ioffe, J. Shlens, Z. Wojna, Rethinking the inception architecture for computer vision, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2016, 2818–2826. http://dx.doi.org/10.1109/CVPR.2016.308 doi: 10.1109/CVPR.2016.308
[44]	V. Volz, J. Schrum, J. Liu, S. Lucas, A. Smith, S. Risi, Evolving mario levels in the latent space of a deep convolutional generative adversarial network, Proceedings of the Genetic and Evolutionary Computation Conference, 2018,221–228. http://dx.doi.org/10.1145/3205455.3205517 doi: 10.1145/3205455.3205517
[45]	K. Xu, S. Liu, P. Zhao, P. Chen, H. Zhang, Q. Fan, et al., Structured adversarial attack: towards general implementation and better interpretability, Proceedings of International Conference on Learning Representations, 2019, 1–21.

This article has been cited by:

1.	Zun Li, Binqiang Xue, Youyuan Chen, Event-Triggered State Estimation for Uncertain Systems with Binary Encoding Transmission Scheme, 2023, 11, 2227-7390, 3679, 10.3390/math11173679
2.	Kaihong Zhao, Asymptotic stability of a periodic GA-predation system with infinite distributed lags on time scales, 2024, 97, 0020-7179, 1542, 10.1080/00207179.2023.2214251
3.	Najat Chefnaj, Khalid Hilal, Ahmed Kajouni, The existence, uniqueness and Ulam–Hyers stability results of a hybrid coupled system with $\Psi$ -Caputo fractional derivatives, 2024, 70, 1598-5865, 2209, 10.1007/s12190-024-02038-y
4.	Kaihong Zhao, Solvability, Approximation and Stability of Periodic Boundary Value Problem for a Nonlinear Hadamard Fractional Differential Equation with p-Laplacian, 2023, 12, 2075-1680, 733, 10.3390/axioms12080733
5.	Zakaria Yaagoub, El Mehdi Farah, Shabir Ahmad, Three-strain epidemic model for influenza virus involving fractional derivative and treatment, 2024, 1598-5865, 10.1007/s12190-024-02284-0
6.	Kaihong Zhao, Global asymptotic stability for a classical controlled nonlinear periodic commensalism AG-ecosystem with distributed lags on time scales, 2023, 37, 0354-5180, 9899, 10.2298/FIL2329899Z
7.	Kaihong Zhao, Attractor of a nonlinear hybrid reaction–diffusion model of neuroendocrine transdifferentiation of human prostate cancer cells with time-lags, 2023, 8, 2473-6988, 14426, 10.3934/math.2023737
8.	Ahmed M. A. El-Sayed, Malak M. S. Ba-Ali, Eman M. A. Hamdallah, Asymptotic Stability and Dependency of a Class of Hybrid Functional Integral Equations, 2023, 11, 2227-7390, 3953, 10.3390/math11183953
9.	Luchao Zhang, Xiping Liu, Zhensheng Yu, Mei Jia, The existence of positive solutions for high order fractional differential equations with sign changing nonlinearity and parameters, 2023, 8, 2473-6988, 25990, 10.3934/math.20231324
10.	Ping Tong, Qunjiao Zhang, Existence of solutions to Caputo fractional differential inclusions of $1 < \alpha < 2$ with initial and impulsive boundary conditions, 2023, 8, 2473-6988, 21856, 10.3934/math.20231114
11.	Kaihong Zhao, Juqing Liu, Xiaojun Lv, A Unified Approach to Solvability and Stability of Multipoint BVPs for Langevin and Sturm–Liouville Equations with CH–Fractional Derivatives and Impulses via Coincidence Theory, 2024, 8, 2504-3110, 111, 10.3390/fractalfract8020111
12.	Mei Wang, Baogua Jia, Finite-time stability and uniqueness theorem of solutions of nabla fractional $(q, h)$ -difference equations with non-Lipschitz and nonlinear conditions, 2024, 9, 2473-6988, 15132, 10.3934/math.2024734
13.	Keyu Zhang, Qian Sun, Donal O'Regan, Jiafa Xu, Positive solutions for a Riemann-Liouville-type impulsive fractional integral boundary value problem, 2024, 9, 2473-6988, 10911, 10.3934/math.2024533
14.	Maosong Yang, Michal Fečkan, JinRong Wang, Ulam’s Type Stability of Delayed Discrete System with Second-Order Differences, 2024, 23, 1575-5460, 10.1007/s12346-023-00868-y
15.	Kaihong Zhao, Study on the stability and its simulation algorithm of a nonlinear impulsive ABC-fractional coupled system with a Laplacian operator via F-contractive mapping, 2024, 2024, 2731-4235, 10.1186/s13662-024-03801-y
16.	Kaihong Zhao, Generalized UH-stability of a nonlinear fractional coupling $(\mathcalligra{p}_{1},\mathcalligra{p}_{2})$ -Laplacian system concerned with nonsingular Atangana–Baleanu fractional calculus, 2023, 2023, 1029-242X, 10.1186/s13660-023-03010-3
17.	Luchao Zhang, Xiping Liu, Mei Jia, Zhensheng Yu, Piecewise conformable fractional impulsive differential system with delay: existence, uniqueness and Ulam stability, 2024, 70, 1598-5865, 1543, 10.1007/s12190-024-02017-3
18.	Vladislav Kovalnogov, Ruslan Fedorov, Tamara Karpukhina, Theodore Simos, Charalampos Tsitouras, On Reusing the Stages of a Rejected Runge-Kutta Step, 2023, 11, 2227-7390, 2589, 10.3390/math11112589
19.	Xiaojun Lv, Kaihong Zhao, Haiping Xie, Stability and Numerical Simulation of a Nonlinear Hadamard Fractional Coupling Laplacian System with Symmetric Periodic Boundary Conditions, 2024, 16, 2073-8994, 774, 10.3390/sym16060774

Reader Comments

Your name:*

Email:*
© 2023 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

AIMS Mathematics

1.8 3.4

Metrics

Article views(1643) PDF downloads(82) Cited by(0)

Preview PDF

Download XML

Export Citation

Article outline

Show full outline

Figures and Tables

Figures(2) / Tables(6)

AIMS Mathematics

Sequential stochastic blackbox optimization with zeroth-order gradient estimators

Related Papers:

Abstract

1. Introduction

2. Integrated production planning-pricing models

2.1. Dynamic supply

2.2. State-dependent supply

3. Conclusions

Use of AI tools declaration

Acknowledgments

Conflict of interest

References

This article has been cited by:

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Figures and Tables

Other Articles By Authors

Catalog

AIMS Mathematics

Sequential stochastic blackbox optimization with zeroth-order gradient estimators

Related Papers:

Abstract

1. Introduction

2. Integrated production planning-pricing models

2.1. Dynamic supply

2.2. State-dependent supply

3. Conclusions

Use of AI tools declaration

Acknowledgments

Conflict of interest

References

This article has been cited by:

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Figures and Tables

Other Articles By Authors

Related pages

Tools

Export File

Citation

Format

Content

Catalog