Citation: Andreas Widder. On the usefulness of set-membership estimation in the epidemiology of infectious diseases[J]. Mathematical Biosciences and Engineering, 2018, 15(1): 141-152. doi: 10.3934/mbe.2018006
[1] | Masoud Saade, Samiran Ghosh, Malay Banerjee, Vitaly Volpert . An epidemic model with time delays determined by the infectivity and disease durations. Mathematical Biosciences and Engineering, 2023, 20(7): 12864-12888. doi: 10.3934/mbe.2023574 |
[2] | Ayako Suzuki, Hiroshi Nishiura . Transmission dynamics of varicella before, during and after the COVID-19 pandemic in Japan: a modelling study. Mathematical Biosciences and Engineering, 2022, 19(6): 5998-6012. doi: 10.3934/mbe.2022280 |
[3] | Minami Ueda, Tetsuro Kobayashi, Hiroshi Nishiura . Basic reproduction number of the COVID-19 Delta variant: Estimation from multiple transmission datasets. Mathematical Biosciences and Engineering, 2022, 19(12): 13137-13151. doi: 10.3934/mbe.2022614 |
[4] | F. Berezovskaya, G. Karev, Baojun Song, Carlos Castillo-Chavez . A Simple Epidemic Model with Surprising Dynamics. Mathematical Biosciences and Engineering, 2005, 2(1): 133-152. doi: 10.3934/mbe.2005.2.133 |
[5] | Yukun Tan, Durward Cator III, Martial Ndeffo-Mbah, Ulisses Braga-Neto . A stochastic metapopulation state-space approach to modeling and estimating COVID-19 spread. Mathematical Biosciences and Engineering, 2021, 18(6): 7685-7710. doi: 10.3934/mbe.2021381 |
[6] | Holly Gaff, Elsa Schaefer . Optimal control applied to vaccination and treatment strategies for various epidemiological models. Mathematical Biosciences and Engineering, 2009, 6(3): 469-492. doi: 10.3934/mbe.2009.6.469 |
[7] | Glenn Ledder, Donna Sylvester, Rachelle R. Bouchat, Johann A. Thiel . Continuous and pulsed epidemiological models for onchocerciasis with implications for eradication strategy. Mathematical Biosciences and Engineering, 2018, 15(4): 841-862. doi: 10.3934/mbe.2018038 |
[8] | Mohammed Meziane, Ali Moussaoui, Vitaly Volpert . On a two-strain epidemic model involving delay equations. Mathematical Biosciences and Engineering, 2023, 20(12): 20683-20711. doi: 10.3934/mbe.2023915 |
[9] | David J. Gerberry . An exact approach to calibrating infectious disease models to surveillance data: The case of HIV and HSV-2. Mathematical Biosciences and Engineering, 2018, 15(1): 153-179. doi: 10.3934/mbe.2018007 |
[10] | Anuj Mubayi, Christopher Kribs Zaleta, Maia Martcheva, Carlos Castillo-Chávez . A cost-based comparison of quarantine strategies for new emerging diseases. Mathematical Biosciences and Engineering, 2010, 7(3): 687-717. doi: 10.3934/mbe.2010.7.687 |
Infectious diseases are an ever-current topic in human society. The impact on life expectancy, quality of life, and the economy of the affected communities can be immense. This is true not only of decades-long lethal pandemics such as HIV/AIDS ([6]) and violent disease outbreaks such as the recent Ebola virus epidemic ([17]), but also less threatening but persistent diseases such as influenza ([14]). Even diseases that affect only animals, such as foot and mouth disease, can have a noticeable impact on the economy ([7]).
Given the importance of the subject, it is consequential that many different, powerful mathematical modelling techniques have been developed that are widely used (for an introduction see for example [9,11]). For the sake of simplicity we will presently only deal with models described by ordinary differential equations (ODEs), which in particular includes compartmental models. In these models the population is divided into several sub-populations (so called compartments) and the dynamics are described by giving the rates at which individuals progress from one compartment to another. As an example, among the simplest such models are susceptible-infected-recovered-models (SIR-models) where the population is divided into the three mentioned compartments and the dynamics are given by
˙S(t)=−βI(t)S(t),˙I(t)=βI(t)S(t)−γI(t),˙R(t)=γI(t). |
In order for this model to be able to correctly describe the progression of a disease, the two parameters
The result is that knowledge about the parameter
The paper is organised as follows. In Section 2 we present the set-estimation procedure in a general setting. In Section 3 we discuss this procedure and explore some of its properties. In Section 4 we demonstrate the technique in some numerical examples by applying it to data from the Ebola virus outbreak in 2014.
We consider an ODE system
˙x(t)=f(t,x(t),p),x(0)=x0. | (1) |
Here,
We assume that the parameter
R(T)={x∈Rn:∃p∈P so that x=x[p](T)}. |
To put this in an epidemiological context using the simple example given in the introduction, the vector
We will in the following describe how an estimate
Here we present a method to find approximations to the exact set-membership approximation
For a fixed time
Jℓ(p):=⟨ℓ,x[p](T)⟩→max, | (2) |
where
⟨ℓ,xℓ⟩≥supp∈P⟨ℓ,x[p](T)⟩−ε. |
Obviously,
Let
R(T)⊆⋂i=1,…,k{x∈Rn:⟨ℓi,x−xℓi⟩≤ε}=:EΛ(T). |
Consequently,
If we are only interested in a set-membership estimation in the subspace
prL(R(T))⊆prL(⋂i=1,…,k{x∈Rn:⟨ℓi,x−xℓi⟩≤ε})=:EL(T). |
For example, if in an
For
Jℓ(p)=⟨ℓ,x[p](T)⟩. |
Note that the solution to this problem gives the scalar projection of the vector
Denote by
˙λ[p](t)=−f′x(t,x[p](t),p)∗λ[p](t),λ[p](T)=−ℓ. | (3) |
Theorem 2.1. The functional
J′ℓ(p)=−T∫0f′p(t,x[p](t),p)∗λ[p](t)dt. | (4) |
Proof. We have
Jℓ(p)−Jℓ(ˆp)=⟨ℓ,x[p](T)−x[ˆp](T)⟩ |
and
T∫0⟨˙λ[ˆp](t),x[p](t)−x[ˆp](t)⟩dt=⟨λ[ˆp](T),x[p](T)−x[ˆp](T)⟩ −⟨λ[ˆp](0),x[p](0)−x[ˆp](0)⟩−T∫0⟨λ[ˆp](t),˙x[p](t)−˙x[ˆp](t)⟩dt. |
Using that
Jℓ(p)−Jℓ(ˆp)=−T∫0⟨˙λ[ˆp](t),x[p](t)−x[ˆp](t)⟩+⟨λ[ˆp](t),˙x[p](t)−˙x[ˆp](t)⟩dt =−T∫0⟨˙λ[ˆp](t),x[p](t)−x[ˆp](t)⟩+⟨λ[ˆp](t),f(t,x[p](t),p)−f(t,x[ˆp](t),ˆp)⟩dt =−T∫0⟨˙λ[ˆp](t),x[p](t)−x[ˆp](t)⟩+⟨λ[ˆp](t),f′x(t,x[ˆp](t),ˆp)(x[p](t)−x[ˆp](t))⟩ +⟨λ[ˆp](t),f′p(t,x[ˆp](t),ˆp)(p−ˆp)⟩dt+o(|p−ˆp|) =⟨−T∫0f′p(t,x[ˆp](t),ˆp)∗λ[ˆp](t)dt,p−ˆp⟩+o(|p−ˆp|). |
The last equality is due to (3), while the fact that the remaining terms in the Taylor approximation are
Using the result of the previous section we can derive a numerical gradient projection procedure for solving the problem (2). Starting with a point
The optimisation problem (2) has to be solved for many different times
Another possible difficulty in this procedure is the calculation of the projection onto the set
![]() |
Here we want to discuss some attributes of the set-membership estimation that is described in Section 2.
As already noted, the set-membership estimation
The first way is to consider only estimations of a single component of
If an approximation of the non-convex set
This procedure may be numerically costly so that it might be more reasonable to consider the numerically cheaper estimates for the convex hull of
There are some special cases in which other procedures may be more effective to calculate the reachable set. For example, if the parameter set is one dimensional, i.e. given by an interval
Set-membership estimates can be used to calculate worst-case-scenarios (or best-case-scenarios). If the quantity in question is one of the components of
If the quantity in question is not one of the components of
Furthermore, if we are interested in a quantity
Another useful aspect of this calculations is that not only do we get information about the extremal values of
A common way (see e.g. [2]) to obtain pointwise confidence intervals for the trajectories of various variables is the following: Calculate the maximum likelihood estimate
This is based on the fact that under certain circumstances the maximum likelihood estimate is asymptotically normally distributed. In particular this approach is only valid if enough data is available for the estimator to approach this asymptotic distribution. Set-membership estimates can be used to describe confidence intervals in a different way, without any need of further assumptions.
Under the model assumption, i.e. that the trajectories follow equation (1), every trajectory is given by
1We use here frequentist connotation. However, the same argument can be made in Bayesian terms, if
P(x[ˆp](T)∈R(T))=P(x[ˆp](T)∈R(T)∣ˆp∈P)P(ˆp∈P) +P(x[ˆp](T)∈R(T)∣ˆp∉P)P(ˆp∉P). | (5) |
By the definition of the reachable set the first probability on the right hand side is
P(x[ˆp](T)∈R(T))≥0.95. |
This probability is of course to be interpreted in the sense of a confidence region. That is, if the reachable set is calculated from a data set, there is at least a
Thus the reachable set describes a confidence region for the trajectory of at least the confidence level of the region for the parameter. The same is of course true if we look at the trajectory of components of
In this section we use a model and data presented by Althaus in [1] for the outbreak of the Ebola virus in 2014 in Guinea. The model equations are
˙S(t)=−βe−ktS(t)I(t)N(t),˙E(t)=βe−ktS(t)I(t)N(t)−σE(t),˙I(t)=σE(t)−γI(t),˙R(t)=(1−f)γI(t), | (6) |
together with
˙C(t)=σE(t),˙D(t)=fγI(t). |
Here,
2The values presented here are slightly different than the one in [1]. There, rounded values have been given. The more accurate values presented here were kindly provided by the author of [1] in personal correspondence.
β=0.268574(0.266924,0.270189),f=0.7368011(0.719586,0.754707),k=0.0023379(0.0022597,0.002415). |
The box described in
3The calculations here were carried out in Matlab. The code has been made public in [21].
Extremal parameters. In simple models it is sometimes obvious what parameter configuration maximises (or minimises) certain compartments. For example, looking at the equations in (6) we see that an increase in
In order to capture at least one time dependent effect, we now consider a slight variation of Equations (6). We now assume that recovered individuals may return into the susceptible class and let
˙S(t)=−βS(t)I(t)N(t)+ρR(t),˙E(t)=βS(t)I(t)N(t)−σE(t),˙I(t)=σE(t)−γI(t),˙R(t)=γI(t)−ρR(t). | (7) |
We assume that our knowledge about the parameters can be described by the inclusions
β∈[0.05,0.06],σ∈[0.3,0.35],γ∈[0.05,0.055],ρ∈[0.2,0.23]. |
These numbers are not based on data and are chosen to highlight the dynamical features. In Figure 3 we show the set-membership estimation for
In this paper we discuss a technique to calculate set-membership estimations of trajectories for dynamical systems in which some parameters are not known. We apply this technique, which is well known in control theory, to epidemiological models, and emphasise certain aspects that are of particular use in this context.
Firstly, we describe how the estimates can be interpreted as a confidence region for the trajectories if the knowledge about the parameters is given in form of confidence intervals, as is often the case. Since this is a topic of great interest, further analysis of this method is desired. In particular it would be of interest to study the efficiency of the presented method compared with other established ones.
We furthermore explain how this method can be used to find parameters which yield the highest or lowest value for a state variable of choice. The technique presented here identifies the best and worst case scenario given all available information, and indicates which parameters should be influenced to achieve a more preferable outcome. This may be of particular interest to policy makers.
In this paper we focus on ODE models in which the dynamics depend on a finite dimensional parameter vector. However, this method can be altered for more general settings. For example, Theorem 2.1 can easily be adapted to include the case that the initial condition
It can also be applied to heterogeneous models with structured populations, where the dynamics are described by partial differential equations. In such models, initial conditions are given by a density function in an infinite dimensional space. These densities are usually not known. Also, parameters may depend on the population structure and therefore lie in some infinite dimensional function space. Some results have been presented to deal with unknown initial conditions ([19,20]), and further research may provide results regarding unknown parameter functions.
Set-membership estimation is shown to be a flexible technique that can be applied in many different situations. It is not meant to supplant any existing technique, but to supplement them, and to help extract the maximum amount of information from epidemiological data.
[1] | [ C. L. Althaus, Estimating the reproduction number of Ebola virus (EBOV) during the 2014 outbreak in West Africa, PLOS Currents Outbreaks, (2014). |
[2] | [ C. L. Althaus,N. Low,E. O. Musa,F. Shuaib,S. Gsteiger, Ebola virus disease outbreak in Nigeria: Transmission dynamics and rapid control, Epidemics, 11 (2015): 80-84. |
[3] | [ H. Andersson and T. Britton, Stochastic Epidemic Models and Their Statistical Analysis, Lecture Notes in Statistics, 151. Springer-Verlag, New York, 2000. |
[4] | [ R. Baier,M. Gerdts, A computational method for non-convex reachable sets using optimal control, in 2009 European Control Conference (ECC), IEEE, null (2009): 97-102. |
[5] | [ D. P. Bertsekas, Dynamic Programming and Optimal Control, Athena Scientific, 1995. |
[6] | [ A. Bhargava,F. Docquier, HIV pandemic, medical brain drain, and economic development in Sub-Saharan Africa, The World Bank Economic Review, 22 (2008): 345-366. |
[7] | [ A. Blake,M. T. Sinclair,G. Sugiyarto, Quantifying the impact of foot and mouth disease on tourism and the UK economy, Tourism Economics, 9 (2003): 449-465. |
[8] | [ A. Capaldi,S. Behrend,B. Berman,J. Smith,J. Wright,A. L. Lloyd, Parameter estimation and uncertainty quantication for an epidemic model, Mathematical Biosciences and Engineering, 9 (2012): 553-576. |
[9] | [ O. Diekmann, H. Heesterbeek and T. Britton, Mathematical Tools for Understanding Infectious Disease Dynamics, Princeton University Press, 2013. |
[10] | [ J. Hsu, Multiple Comparisons: Theory and Methods, CRC Press, 1996. |
[11] | [ M. J. Keeling and P. Rohani, Modeling Infectious Diseases in Humans and Animals, Princeton University Press, 2008. |
[12] | [ A. B. Kurzhanski and P. Varaiya, Dynamics and Control of Trajectory Tubes, Springer, 2014. |
[13] | [ P. E. Lekone,B. F. Finkenstädt, Statistical inference in a stochastic epidemic SEIR model with control intervention: Ebola as a case study, Biometrics, 62 (2006): 1170-1177. |
[14] | [ N.-A. M. Molinari,I. R. Ortega-Sanchez,M. L. Messonnier,W. W. Thompson,P. M. Wortley,E. Weintraub,C. B. Bridges, The annual impact of seasonal influenza in the US: measuring disease burden and costs, Vaccine, 25 (2007): 5086-5096. |
[15] | [ E. Polak, Computational Methods in Optimization: A Unified Approach, vol. 77, Academic Press, New York-London, 1971. |
[16] | [ E. D. Sontag, Mathematical Control Theory: Deterministic Finite Dimensional Systems, Second edition. Texts in Applied Mathematics, 6. Springer-Verlag, New York, 1998. |
[17] | [ M. R. Thomas, G. Smith, F. H. Ferreira, D. Evans, M. Maliszewska, M. Cruz, K. Himelein and M. Over, The economic impact of Ebola on Sub-Saharan Africa: Updated estimates for 2015, World Bank Group. |
[18] | [ T. Toni,D. Welch,N. Strelkowa,A. Ipsen,M. P. Stumpf, Approximate Bayesian computation scheme for parameter inference and model selection in dynamical systems, Journal of the Royal Society Interface, 6 (2009): 187-202. |
[19] | [ T. Tsachev,V. M. Veliov,A. Widder, Set-membership estimation for the evolution of infectious diseases in heterogeneous populations, Journal of Mathematical Biology, 74 (2017): 1081-1106. |
[20] | [ V. Veliov,A. Widder, Modelling and estimation of infectious diseases in a population with heterogeneous dynamic immunity, Journal of Biological Dynamics, 10 (2016): 457-476. |
[21] | [ A. Widder, Matlab files for set-membership estimation, Zenodo, 2016. |