Citation: Krzysztof Fujarewicz. Estimation of initial functions for systems with delays from discrete measurements[J]. Mathematical Biosciences and Engineering, 2017, 14(1): 165-178. doi: 10.3934/mbe.2017011
[1] | H.Thomas Banks, Danielle Robbins, Karyn L. Sutton . Theoretical foundations for traditional and generalized sensitivity functions for nonlinear delay differential equations. Mathematical Biosciences and Engineering, 2013, 10(5&6): 1301-1333. doi: 10.3934/mbe.2013.10.1301 |
[2] | Heping Ma, Hui Jian, Yu Shi . A sufficient maximum principle for backward stochastic systems with mixed delays. Mathematical Biosciences and Engineering, 2023, 20(12): 21211-21228. doi: 10.3934/mbe.2023938 |
[3] | Tyler Cassidy, Morgan Craig, Antony R. Humphries . Equivalences between age structured models and state dependent distributed delay differential equations. Mathematical Biosciences and Engineering, 2019, 16(5): 5419-5450. doi: 10.3934/mbe.2019270 |
[4] | H.T. Banks, S. Dediu, H.K. Nguyen . Sensitivity of dynamical systems to parameters in a convex subset of a topological vector space. Mathematical Biosciences and Engineering, 2007, 4(3): 403-430. doi: 10.3934/mbe.2007.4.403 |
[5] | Filippo Cacace, Valerio Cusimano, Alfredo Germani, Pasquale Palumbo, Federico Papa . Closed-loop control of tumor growth by means of anti-angiogenic administration. Mathematical Biosciences and Engineering, 2018, 15(4): 827-839. doi: 10.3934/mbe.2018037 |
[6] | Ranjit Kumar Upadhyay, Swati Mishra, Yueping Dong, Yasuhiro Takeuchi . Exploring the dynamics of a tritrophic food chain model with multiple gestation periods. Mathematical Biosciences and Engineering, 2019, 16(5): 4660-4691. doi: 10.3934/mbe.2019234 |
[7] | Blaise Faugeras, Olivier Maury . An advection-diffusion-reaction size-structured fish population dynamics model combined with a statistical parameter estimation procedure: Application to the Indian Ocean skipjack tuna fishery. Mathematical Biosciences and Engineering, 2005, 2(4): 719-741. doi: 10.3934/mbe.2005.2.719 |
[8] | Joseph E. Carroll . A two-dimensional discrete delay-differential system model of viremia. Mathematical Biosciences and Engineering, 2022, 19(11): 11195-11216. doi: 10.3934/mbe.2022522 |
[9] | Dimitri Breda, Davide Liessi . A practical approach to computing Lyapunov exponents of renewal and delay equations. Mathematical Biosciences and Engineering, 2024, 21(1): 1249-1269. doi: 10.3934/mbe.2024053 |
[10] | Masoud Saade, Samiran Ghosh, Malay Banerjee, Vitaly Volpert . An epidemic model with time delays determined by the infectivity and disease durations. Mathematical Biosciences and Engineering, 2023, 20(7): 12864-12888. doi: 10.3934/mbe.2023574 |
Dynamical systems with delays are important class of models describing phenomena appearing in many areas, for example in industry or biology. One of the practical problems related to such models is a need to estimate their parameters based on measurements carried out in the real system (process).
There are many works dealing with this problem [1], [4], [12], [14], [15], [16], [17], [18]. Unfortunately, most of proposed approaches assume that the analyzed system is linear and both input and output signals for delaying elements can be measured.
More general and universal approaches, for non-linear systems with delays have been proposed in [13] and [8]. Both approaches depends on gradient-based minimization of an appropriately defined objective function. The latter approach uses so-called structural adjoint sensitivity analysis, which decreases significantly computational effort when many parameters are estimated. Moreover, this approach is more general and can be applied to any dynamical system presented in structural form as block diagram containing any number of delay elements.
All above mentioned methods are focused only on estimation of time delays and eventually other parameters of the mathematical model. But general task of identification of dynamical systems requires also estimation of initial conditions in the situation when they are unknown.
In case of one discrete delay element the initial condition (its "state" for time
There are relatively little works related to the problem of estimation of initial functions for systems with delays. In paper [2] a gradient based approach to estimation of initial functions for non-linear systems described by retarded type delay differential equations (RDDE). Another paper [3] deals with systems described by neutral type delay differential equations (NDDE). In both works a gradient-based estimation of initial functions is done by using adjoint sensitivity analysis.
In this work a more general structural adjoint sensitivity analysis is utilized. It is especially useful for models described by block diagrams and was originally developed for neural networks [5] and afterwards was used for different models described by ordinary differential equations [6,7,11], systems with delays [8] and age-structured models [10]. Recently it has been used for spatiotemporal models of tumor growth [9].
The structural sensitivity approach can be applied to any non-linear dynamical system presented as a block diagram and containing many discrete delay elements. Therefore, it may be used for wider class than analysed in [2] and [3]. For example it may be applied for systems containing delays in input (control) channel, which is not allowed in RDDE and NDDE models. The proposed approach can be used for systems which output signals are measurable continuously and for sampled systems where the information output signals is available only at discrete time moments.
Let us consider a model of dynamical system with one isolated delay element presented in Fig. 1. We do not assume any particular structure of the model
1. Linear static element represented by a gain matrix
2. Linear continuous-time dynamical element represented by a transfer function
3. Linear discrete-time dynamical element represented by a transfer function
4. Non-linear static element described by a function
5. Summing junction.
6. Branching node.
7. Ideal d-c pulser, placed between discrete-time part of the system and the continuous-time part, which output signal contains Dirac pulses multiplied by instantaneous value of its discrete-time input.
8. Ideal c-d pulser, placed between continuous-time part of the system and the discrete-time part, which output signal contains Kronecker pulses multiplied by instantaneous value of its continuous-time input.
Using such a set of elements one can present any non-linear hybrid continuous-discrete dynamical system with delays of arbitrary structure as a block diagram.
For the sake of simplicity the system presented in Fig. 1 contains only one delay element but in general case there can be more delay elements with different delay times.
The delay element is described by the input-output relation
r(t)=q(t−τ) | (1) |
with the initial condition
q(t)=φ(t)fort∈[−τ,0] | (2) |
The function
The delay element can be also mathematically described in Laplace operator domain by its transfer function which is frequently used for example in control systems theory. The transfer function
K(s)=R(s)Q(s)=L{r(t)}L{q(t)}=e−sτ | (3) |
where
We also assume that the output signal
Let us define an objective function which is a measure of discrepancy between the measurements and the output of the model
J=12N∑n=1(y(n)−d(n))2 | (4) |
Problem 1. Find the initial function of the delay element
The above task will be solved iteratively using the gradient-based approach. Hence, we need to solve the folowing sub-problem
Problem 2. Find the gradient of the objective function (4) in the space of the initial function
To solve the Problem 2 we will use the adjoint sensitivity analysis. In works [5], [7] rules for construction on the sensitivity model and the adjoint model have been presented. In addition in [8] such rules has been extended to systems with delays and it has been shown how to perform the sensitivity analysis with respect to delay times. Now, we are going to show how to calculate the gradient of the objective function in the space of the initial function of the delay element
Before we start to solve problems formulated in previous section let us present one delay element in the form which will be more suitable for further analysis. This form comes from the observation that the delay element with non-zero initial condition can be replaced by a delay element with zero initial conditions and with additional signal
A function
ψ(t)={φ(t−τ)fort≤τ0fort>τ | (5) |
Therefore the task of finding the gradient of the objective function in the initial function space
Problem 3. Find the gradient of the objective function (4) in the space of the input signal
The delay time
The sensitivity model of the delay element presented in Fig. 2, which describes relationship between variations of all signals
Rules for construction of the adjoint system presented in works [5] and [7] specify, among others, that the directions of all signals should be reversed and all summing junctions should be replaced by branching nodes. As a result we obtain the adjoint system of one delay element presented in Fig. 3b. The output signal
To solve the Problem 3 (and consecutively Problem 2 and Problem 2) let us extend the general model presented in Fig. 1. The extended model, presented in Fig. 4, takes into account that we minimize the objective function (4). It is obtained by using a non-linear element calculating the quadratic function in (4) and the discrete transfer function
Moreover, the extended model has an additional input signal
The extended model presented in Fig. 4 is an example of a hybrid continuous-discrete-time system. It contains both, continuous-time part (for time
The adjoint system stimulated by the Kronecker pulse
β(tf−t)=∇ψ(t)J | (6) |
This signal in the time interval
To illustrate how the proposed approach works, we provide results of six numerical examples. They were performed under different conditions which are shown in Table 1. First of all, three different models, with different number of delays and their location, were used. Structures (block diagrams) of models A, B and C are presented in figures 6, 12 and 14 respectively. Moreover, in all examples times of discrete measurements
Example | Model | Number of delays | Sampling time | Initial function(s) | Delay time(s) | Results |
1 | A (Fig. 6) | 1 | 0+ | Estimated | Known | Fig. 8 |
2 | A (Fig. 6) | 1 | 0+ | Estimated | Estimated | Fig. 9 |
3 | A (Fig. 6) | 1 | 0.1 | Estimated | Estimated | Fig. 10 |
4 | A (Fig. 6) | 1 | 0.1 | Fixed (=0) | Estimated | Fig. 11 |
5 | B (Fig. 12) | 2 | 0+ | Estimated | Known | Fig. 13 |
6 | C (Fig. 14) | 2 | 0+ | Estimated | Known | Fig. 15 |
Each model, A, B and C, is a first-order system, described by a first-order delay differential equation and hence contains one integrating element - transfer function
Each model has one scalar external input signal
In every numerical example measurements
1In fact, in presented numerical examples both plant and the model are "models" but we consistently use two different names to emphasize that the plant generates measurements and the model is fitted to measurements.
The gradient obtained by simulation of the adjoint model is used in the simplest iterative gradient-based optimization procedure:
ψk+1(t)=ψk(t)−c∇ψ(t)J | (7) |
where
τk+1=τk−c∇τJ | (8) |
with the same value of
Example 1. In the first example the model A is used. It is presented in Fig. 6.
It is a simple first-order system with one delay described by the following delay differential equation:
˙y(t)=−y(t)−y(t−τ)+u(t) | (9) |
An adjont system for the Model A, created by using rules described in [5], [7], is presented in Fig. 7.
This is a part of the overall adjont system from Fig. 5 and generates a function
The unknown (estimated) initial function
In this example, and in the next one, it is assumed that measurements are quasi-continuous i.e. sampling time is infinitesimally small2
2For the computer simulation it is the same as the variable step size (with assumed upper limit) used by ODE solver.
The results of the estimation of the initial function obtained after nearly 500 iterations of the gradient-descent optimization procedure (7) are presented in Fig. 8. The starting initial function
Example 2. The only difference between Example 2 and the previous Example 1 is that delay time
Once again, it can bee seen that the estimation process is convergent. The estimated value of
Example 3. In this example measurements are no longer quasi-countinuous. The sampling time
The estimated initial functions of the delay element differs from the true initial function used in the plant, see Fig. 10a. There are characteristic "jumps" caused by sampling. As previously, the delay time is estimated correctly - Fig. 10b. The difference between
Nevertheless, the performance index, which takes into account only discrete-time measurements, reached a very small value. Furthermore, one can see that the prediction error
● the gradient of the objective function in the initial function space is calculated correctly,
● the output the model fits the discrete-time measurements and estimation procedure is convergent in the sense of the objective function value,
● the initial function of the delay, in general, is not convergent to the true initial function.
The last conclusion is more general. It is impossible to reconstruct perfectly a continuous function based only on discrete-time data, without further assumptions, like for example assumption about a frequency band limits in Nyquist-Shannon theorem. However, from the practical point of view, one can see that the initial function is estimated pretty well and it is close to the true function.
Example 4. In this example we show results of estimation of the delay time
One can see that the objective function is decreased. It means that the gradient of the objective function w.r.t the delay time is calculated correctly. However, the delay time is not estimated correctly. Even when
The conclusion coming from this example is that it is still worthwhile to estimate the initial functions of delays, even when we know that this estimate is not accurate (like in Example 3) or when we are not interested in information about the initial function at all. Simultaneous estimation of model's parameters and initial functions of delays improves estimation results of these parameters.
Example 5. In the next two examples we show results of initial functions estimation when there are more delays in the system. In Example 5 Model B with two delays, presented in Fig. 12, is used. The structure of the system is similar to the model A, except for the second delay acting in the upper branch.
The initial function for the second delay used in the plant is a sine wave presented in Fig. 13b - dashed line.
One can see from Fig. 13 that both initial function are estimated correctly like in Examples 1 and 2. Once again output signals of the model and the plant presented in Fig. 13d are nearly indistinguishable due to very small prediction error. Now, let us go to the next example where we will see a non-identifiable case.
Example 6. The structure of the model C used in this example is presented in Fig. 14. Like the model B, it also contains a second delay but placed in input channel (the input signal
Let us look at results of the gradient-based estimation process presented in Fig. 15. One can see that both initial function are estimated incorrectly, see Fig. 15a and 15b. Nevertheless, the output of the model is close to the output of the plant Fig. 15c. It means the solution is not unique and initial functions of these two delays are not identifiable. Besides the functions used in the plant, there are also other (at least two found in this example) optimal functions minimizing the performance index
In this work a gradient-based approach to estimation of initial functions for sampled non-linear systems with delays has been presented. The gradient of the appropriately defined quadratic objective function in the space of the initial function is obtained by using so-called structural sensitivity analysis.
Six numerical examples: for different sampling times, different number of delays and where delay time has been also estimated together with initial functions, have been presented. All these examples have shown that it is possible to efficiently calculate the gradient of the objective function in the space of initial functions.
Nevertheless, for some cases we have encountered the problem of non-identifiability - the objective function has been minimized, but the estimated initial function has differed from the reference initial function. It has been shown that discrete (non-continuous) measurements cause non-identifiability of continuous initial functions. The observed differences between outputs of the plant and the model comes from the nature of measurements - the output of the plant is measured only at (relatively rare) discrete moments, for which the prediction error is negligible but between them it stays significant. Two, or more initial functions can also be non-identifiable, even for (quasi-) continuous measurements.
Results obtained on this work suggest further investigation the of non-identifiability problem of initial functions. There are also some possibilities to decrease (not to eliminate) the prediction error between sampling times and they will be investigated in the future works.
This work was funded by the Polish National Science Centre under grant DEC-2013/11/B/ST7/01713. Part of calculations were performed on the Ziemowit computational cluster (http://www.ziemowit.hpc.polsl.pl) created in the POIG.02.01.00-00-166/08 project (BIO-FARMA) and expanded in the POIG.02.03.01-00-040/13 project (Syscancer). Ronald Hancock (Laval University, Laval, QC, Canada) is greatly acknowledged for valuable comments and English editing of the manuscript.
[1] | [ M. Anguelova,B. Wennberg, State elimination and identifiability of the delay parameter for nonlinear time-delay systems, Automatica, 44 (2008): 1373-1378. |
[2] | [ C. T. H. Baker,E. I. Parmuzin, Identification of the initial function for nonlinear delay differential equations, Russ. J. Numer. Anal. Math. Modelling, 20 (2005): 45-66. |
[3] | [ C. T. H. Baker,E. I. Parmuzin, Initial function estimation for scalar neutral delay differential equations, Russ. J. Numer. Anal. Math. Modelling, 23 (2008): 163-183. |
[4] | [ L. Belkoura,J. P. Richard,M. Fliess, Parameters estimation of systems with delayed and structured entries, Automatica, 45 (2009): 1117-1125. |
[5] | [ K. Fujarewicz,A. Galuszka, Generalized backpropagation through time for continuous time neural networks and discrete time measurements, Artificial Intelligence and Soft Computing -ICAISC 2004 (eds. L. Rutkowski, J. Siekmann, R. Tadeusiewicz and L. A. Zadeh), Lecture Notes in Computer Science, 3070 (2004): 190-196. |
[6] | [ K. Fujarewicz,M. Kimmel,A. Swierniak, On fitting of mathematical models of cell signaling pathways using adjoint systems, Math. Biosci. Eng., 2 (2005): 527-534. |
[7] | [ K. Fujarewicz,M. Kimmel,T. Lipniacki,A. Swierniak, Adjoint systems for models of cell signalling pathways and their application to parametr fitting, IEEE/ACM Transactions on Computational Biology and Bioinformatics, 4 (2007): 322-335. |
[8] | [ K. Fujarewicz,K. Lakomiec, Parameter estimation of systems with delays via structural sensitivity analysis, Discrete and Continuous Dynamical Systems -series B, 19 (2014): 2521-2533. |
[9] | [ K. Fujarewicz,K. Lakomiec, Adjoint sensitivity analysis of a tumor growth model and its application to spatiotemporal radiotherapy optimization, Mathematical Biosciences and Engineering, 13 (2016): 1131-1142. |
[10] | [ M. Jakubczak,K. Fujarewicz, Application of adjoint sensitivity analysis to parameter estimation of age-structured model of cell cycle, in Information Technologies in Medicine, (eds. E. Pietka, P. Badura, J. Kawa and W. Wieclawek), Advances in Intelligent Systems and Computing, 472 (2016): 123-131. |
[11] | [ K. Ł akomiec, S. Kumala, R. Hancock, J. Rzeszowska-Wolny and K. Fujarewicz, Modeling the repair of DNA strand breaks caused by γ-radiation in a minichromosome, Physical Biology 11 (2014), 045003. |
[12] | [ M. Liu,Q. G. Wang,B. Huang,C. C. Hang, Improved identification of continuous-time delay processes from piecewise step tests, Journal of Process Control, 17 (2007): 51-57. |
[13] | [ R. Loxton,K. L. Teo,V. Rehbock, An optimization approach to state-delay identification, IEEE Transactions on Automatic Control, 55 (2010): 2113-2119. |
[14] | [ B. Ni,D. Xiao,S. L. Shah, Time delay estimation for MIMO dynamical systems with time-frequency domain analysis, Journal of Process Control, 20 (2010): 83-94. |
[15] | [ B. Rakshit,A. R. Chowdhury,P. Saha, Parameter estimation of a delay dynamical system using synchronization inpresence of noise, Chaos, Solitons and Fractals, 32 (2007): 1278-1284. |
[16] | [ J. P. Richard, Time-delay systems: An overview of some recent advances and open problems, Automatica, 39 (2003): 1667-1694. |
[17] | [ Y. Tang,X. Guan, Parameter estimation of chaotic system with time-delay: A differential evolution approach, Chaos, Solitons and Fractals, 42 (2009): 3132-3139. |
[18] | [ Y. Tang,X. Guan, Parameter estimation for time-delay chaotic systems by particle swarm optimization, Chaos, Solitons and Fractals, 40 (2009): 1391-1398. |
1. | Krzysztof Łakomiec, Karolina Kurasz, Krzysztof Fujarewicz, 2019, Chapter 42, 978-3-319-91210-3, 481, 10.1007/978-3-319-91211-0_42 | |
2. | Krzysztof Fujarewicz, Krzysztof Łakomiec, Spatiotemporal sensitivity of systems modeled by cellular automata, 2018, 41, 01704214, 8897, 10.1002/mma.5358 | |
3. | Krzysztof Fujarewicz, Krzysztof Łakomiec, 2020, Chapter 48, 978-3-030-50935-4, 567, 10.1007/978-3-030-50936-1_48 |
Example | Model | Number of delays | Sampling time | Initial function(s) | Delay time(s) | Results |
1 | A (Fig. 6) | 1 | 0+ | Estimated | Known | Fig. 8 |
2 | A (Fig. 6) | 1 | 0+ | Estimated | Estimated | Fig. 9 |
3 | A (Fig. 6) | 1 | 0.1 | Estimated | Estimated | Fig. 10 |
4 | A (Fig. 6) | 1 | 0.1 | Fixed (=0) | Estimated | Fig. 11 |
5 | B (Fig. 12) | 2 | 0+ | Estimated | Known | Fig. 13 |
6 | C (Fig. 14) | 2 | 0+ | Estimated | Known | Fig. 15 |
Example | Model | Number of delays | Sampling time | Initial function(s) | Delay time(s) | Results |
1 | A (Fig. 6) | 1 | 0+ | Estimated | Known | Fig. 8 |
2 | A (Fig. 6) | 1 | 0+ | Estimated | Estimated | Fig. 9 |
3 | A (Fig. 6) | 1 | 0.1 | Estimated | Estimated | Fig. 10 |
4 | A (Fig. 6) | 1 | 0.1 | Fixed (=0) | Estimated | Fig. 11 |
5 | B (Fig. 12) | 2 | 0+ | Estimated | Known | Fig. 13 |
6 | C (Fig. 14) | 2 | 0+ | Estimated | Known | Fig. 15 |