Numerical study of discretization algorithms for stable estimation of disease parameters and epidemic forecasting

Aurelie Akossi; Gerardo Chowell-Puente; Alexandra Smirnova; Aurelie Akossi; Gerardo Chowell-Puente; Alexandra Smirnova

doi:10.3934/mbe.2019182

Mathematical Biosciences and Engineering

2019, Volume 16, Issue 5: 3674-3693. doi: 10.3934/mbe.2019182

Previous Article Next Article

Research article Special Issues

Numerical study of discretization algorithms for stable estimation of disease parameters and epidemic forecasting

1.
Department of Mathematics and Statistics, Georgia State University, Atlanta, USA
2.
Department of Population Health Sciences, Georgia State University, Atlanta, USA

Received: 20 February 2019 Accepted: 17 April 2019 Published: 25 April 2019

In this paper we investigate how various discretization schemes could be incorporated in regularization algorithms for stable parameter estimation and forecasting in epidemiology. Specifically, we compare parametric and nonparametric discretization tools in terms of their impact on the accuracy of recovered disease parameters as well as their impact on future projections of new incidence cases. Both synthetic and real data for 1918 ``Spanish Flu" pandemic in San Francisco are considered. The discrete approximation of a time dependent transmission rate is combined with the Levenberg-Marquardt algorithm used to solve the nonlinear least squares problem aimed at fitting the model to limited incidence data for an unfolding outbreak. Our simulation study highlights the crucial role of a priori information at the early stage of an epidemic in mitigating the lack of stability in over-parameterized models with insufficient data. Fortunately, our results suggest that a balanced combination of problem-oriented regularization techniques is one way in which scientists can still draw useful conclusions about system parameters and in turn generate reliable forecasts that policy makers could use to guide control interventions.

Keywords:

Citation: Aurelie Akossi, Gerardo Chowell-Puente, Alexandra Smirnova. Numerical study of discretization algorithms for stable estimation of disease parameters and epidemic forecasting[J]. Mathematical Biosciences and Engineering, 2019, 16(5): 3674-3693. doi: 10.3934/mbe.2019182

Related Papers:

[1]	Li-Xiang Feng, Shuang-Lin Jing, Shi-Ke Hu, De-Fen Wang, Hai-Feng Huo . Modelling the effects of media coverage and quarantine on the COVID-19 infections in the UK. Mathematical Biosciences and Engineering, 2020, 17(4): 3618-3636. doi: 10.3934/mbe.2020204
[2]	Xiaoqiang Dai, Kuicheng Sheng, Fangzhou Shu . Ship power load forecasting based on PSO-SVM. Mathematical Biosciences and Engineering, 2022, 19(5): 4547-4567. doi: 10.3934/mbe.2022210
[3]	Haoyu Wang, Xihe Qiu, Jinghan Yang, Qiong Li, Xiaoyu Tan, Jingjing Huang . Neural-SEIR: A flexible data-driven framework for precise prediction of epidemic disease. Mathematical Biosciences and Engineering, 2023, 20(9): 16807-16823. doi: 10.3934/mbe.2023749
[4]	Gianni Gilioli, Sara Pasquali, Fabrizio Ruggeri . Nonlinear functional response parameter estimation in a stochastic predator-prey model. Mathematical Biosciences and Engineering, 2012, 9(1): 75-96. doi: 10.3934/mbe.2012.9.75
[5]	Tailei Zhang, Hui Li, Na Xie, Wenhui Fu, Kai Wang, Xiongjie Ding . Mathematical analysis and simulation of a Hepatitis B model with time delay: A case study for Xinjiang, China. Mathematical Biosciences and Engineering, 2020, 17(2): 1757-1775. doi: 10.3934/mbe.2020092
[6]	Karyn L. Sutton, H.T. Banks, Carlos Castillo-Chávez . Estimation of invasive pneumococcal disease dynamics parameters and the impact of conjugate vaccination in Australia. Mathematical Biosciences and Engineering, 2008, 5(1): 175-204. doi: 10.3934/mbe.2008.5.175
[7]	Damilola Olabode, Jordan Culp, Allison Fisher, Angela Tower, Dylan Hull-Nye, Xueying Wang . Deterministic and stochastic models for the epidemic dynamics of COVID-19 in Wuhan, China. Mathematical Biosciences and Engineering, 2021, 18(1): 950-967. doi: 10.3934/mbe.2021050
[8]	Davide De Gaetano . Forecasting volatility using combination across estimation windows: An application to S&P500 stock market index. Mathematical Biosciences and Engineering, 2019, 16(6): 7195-7216. doi: 10.3934/mbe.2019361
[9]	Sarita Bugalia, Jai Prakash Tripathi, Hao Wang . Estimating the time-dependent effective reproduction number and vaccination rate for COVID-19 in the USA and India. Mathematical Biosciences and Engineering, 2023, 20(3): 4673-4689. doi: 10.3934/mbe.2023216
[10]	Sha He, Sanyi Tang, Libin Rong . A discrete stochastic model of the COVID-19 outbreak: Forecast and control. Mathematical Biosciences and Engineering, 2020, 17(4): 2792-2804. doi: 10.3934/mbe.2020153

Abstract

1. Introduction

Stable estimation of system parameters for infectious disease outbreaks is of paramount importance to the design of adequate forecasting algorithms ^[1,2,3]. Oftentimes parameter estimation procedures are cast as ODE-constrained nonlinear least squares problems, where infinite dimensional time dependent disease parameters need to be recovered from finite dimensional data sets. As the result, the Jacobian of the corresponding parameter-to-data operator is generally ill-conditioned and may be numerically singular. When such an operator is fitted to noise-contaminated epidemiological data, the estimated parameters tend to be entirely unreliable due to severe error propagation into the approximate solution. The sources of noise in the reported incidence data vary for different types of diseases and can be attributed to possible under or over reporting owing to, for instance, a large proportion of asymptomatic cases or false diagnostics.

Noisy data coupled with modeling, discretization, and computational errors necessitate the use of special mathematical tools known as regularization ^[4,5]. It amounts to solving some "nearby" auxiliary problem in place of the initial one. The auxiliary problem has to be formulated in such a way that its solution is less sensitive to noise propagation as opposed to the solution of the original problem.

A time dependent transmission rate of an infectious disease is an important parameter, which can be defined as the effective contact rate, that is, the probability of infection given contact between an infectious and susceptible individual multiplied by the average rate of contacts between these groups. Generally the transmission rate cannot be pre-estimated since it depends on multiple environmental, genetic, social, and other factors. Hence one has to recover cause form effect using epidemiological data for an emerging outbreak together with a suitable compartmental model governing the disease. Once recovered and extrapolated, the transmission rate can be used to project future incidence cases. That, in turn, may be helpful in the design of effective control measures and optimal resource allocation.

In what follows, we use Matlab built-in implementation of the Levenberg-Marquardt algorithm ^[6,7] to reconstruct a variable transmission rate. The regularization provided by this optimization scheme, which is a penalized version of the Gauss-Newton procedure ^[8], is enforced by the appropriate problem-oriented discretization tools. Specifically, we compare what we call parametric and non-parametric discretization routines. By parametric discretization we mean that the transmission rate is modeled by a pre-defined function that involves only a few parameters. The rationale behind this approach is simple: if one is given some a priori information about the outbreak, one can reasonably choose an appropriate expression to describe changes in the transmission coefficient. For example, in case of a single cycle outbreak with the incorporation of control measures at the beginning stages, it is reasonable to assume a declining transmission rate defined, say, by a hyperbolic, harmonic, or exponential function. While parametric discretization may not capture all aspects of the actual transmission rate, it may capture enough crucial information to provide a useful forecasting tool. Our main expectation is that recovering fewer parameters helps mitigate instability caused by noise and the lack of data without, we hope, a significant loss in accuracy.

At the same time, even for a single-cycle outbreak, the transmission rate of an infectious disease may vary depending on the type of a disease, population group, characteristics of a region, and the efficiency of control measures. So, realistically we cannot expect transmission rates to always exhibit a simple decline pattern and therefore parametric discretization inevitably leads to a loss of information. In order to better capture the shape of the time dependent transmission rate, one has to use non-parametric discretization schemes. In such schemes, the transmission rate is projected onto a subspace spanned by a finite set of orthogonal polynomials or spline functions. Again, depending on the nature of the transmission, one may use Legendre or Chebyshev polynomials, B-splines, wavelets, or other base functions.

The main goal of our numerical study is to see how parametric and non-parametric discretization schemes compare in terms of accuracy of parameter estimation and in terms of their ability to provide a reliable forecasting tool. The paper is organized as follows. In Section 2, the governing SEIR model and the regularized inversion procedure are outlined, followed by the discussion of parametric and non-parametric discretization algorithms. In Section 3, numerical experiments with synthetic data are presented. Simulation results with real data for the 1918 influenza outbreak in San Francisco are given in Section 4. Future plans are summarized in Section 5.

2. Problem formulation and mathematical preliminaries

Consider a well-mixed population of size $N$ , where individuals have the same probability of being in contact with each other. The population is sorted into four classes: susceptible ( $S$ ), exposed ( $E$ ), infectious ( $I$ ) and removed ( $R$ ) ^[9] as shown in Figure 2.

Figure 1. B-spline base functions used for

$10$ weeks of data and

$h = 4$ .

Variable	Parameter
$N$	Total effective population size
$\beta(t)$	Transmission rate
$1/\kappa$	Average incubation period
$1/\gamma$	Average time from the onset of symptoms to recovery

Variable	Experiment 1	Experiment 2
$N$	6,000,000	55,000
$1/\kappa$	$8/7$ weeks	$2$ days
$1/\gamma$	$6/7$ weeks	$3$ days

	$10$ Data points	$30$ Data points	$50$ Data points
Experiment $1$	$\tau_0 = 10^{10}$	$\tau_0 = 10^{8}$	$\tau_0 = 10^{8}$
	$h=4$	$h=12$	$h=20$
Experiment $2$	$\tau_0 = 10^{12}$	$\tau_0 = 10^{12}$	$\tau_0 = 10^{12}$
	$h=4$	$h=12$	$h=20$
Experiment $3$	$\tau_0 = 10^{11}$	$\tau_0 = 10^{11}$	$\tau_0 = 10^{13}$
	$h=4$	$h=12$	$h=20$

[1]	N. Tuncer, C. Mohanakumar, S. Swanson, et al., E cacy of control measures in the control of Ebola, Liberia 2014–2015, J. Biol. Dynam., 12 (2018), 913–937.
[2]	N. Tuncer, M. Marctheva, B. LaBarre, et al., Structural and practical identifiability analysis of ZIKA epidemiological models, B. Math. Biol., 80 (2018), 2209–2241.
[3]	G. Chowell, L. Sattenspiel, S. Bansal, et al., Mathematical models to characterize early epidemic growth: a review, Phys. life rev., 18 (2016), 66–97.
[4]	A. B. Bakunshinsky and M. Yu. Kokurin, Iterative methods for Ill-Posed Operator Equations with Smooth Operators, Springer, Dordrecht, Great Britain, 2004.
[5]	H. Engl, M. Hanke and A. Neubauer, Regularization of Inverse Problems, Kluwer Academic Pub-lisher, Dordecht, Boston, London, 1996.
[6]	J. E. Dennis and R. B. Schnabel, Numerical Methods for Unconstrained Optimization and Nonlin-ear Equations, Prentice-Hall, Englewood Cli s, New Jersey, 1983.
[7]	J. Nocedal and S. J. Wright, Numerical Optimization, Springer-Verlag, New York, 1999.
[8]	A. Smirnova, R. Renaut and T. Khan, Convergence and applications of a modified iteratively regularized Gauss-Newton algorithm, Inverse Probl., 23 (2007), 1546–1563.
[9]	R. M. Anderson and R. M. May, Infectious Diseases of Humans: Dynamics and Control, Oxford University Press Inc, New York, 1992.
[10]	C. de Boor, A Practical Guide to Splines, Springer-Verlag, 1978.
[11]	B. Efron and R. Tibshirani, Bootstrap methods for standard errors, confidence intervals, and other measures of statistical accuracy, Stat. Sci., 1 (1986), 54–75.
[12]	G. Chowell, C. E. Ammon, N.W. Hengartner, et. al., Transmission dynamics of the great influenza pandemic of 1918 in Geneva, Switzerland: Assessing the e ects of hypothetical interventions, J. Theor. Biol, 241 (2006), 193–204.
[13]	B. Kaltenbacher, A. Neubauer and O. Scherzer, Iterative regularization methods for nonlinear illposed problems, Radon Series on Computational and Applied Mathematics, 6, Walter de Gruyter, Berlin, 2008.
[14]	A. Sirmnova, B. Sirb and G. Chowell, On stable parameter estimation and forecasting in epidemi-ology by the Levenberg-Marquardt Algorithm with Broyden's rank-one updates for the Jacobian operator, B. Math. Biol., 2019.
[15]	G. Chowell, M. MacLachan and E. P. Fenichel, Accounting for behavioral responses during a flu epidemic using home television viewing, BMC Infect. Dis., 15 (2015), 21.
[16]	P. Guo, Q. Zhang, Y. Chen, et al., An ensemble forecast model of dengue in Guangzhou, China using climate and social media surveillance data, Sci. Total Environ., 647 (2019), 752–762.
[17]	L. Kim, S. M. Fast and N. Markuzon, Incorporating media data into a model of infectious disease transmission, PLoS One 14 (2019), e0197646.
[18]	C. A Marques-Toledo, C. M. Degener, L. Vinhal, et al., Dengue prediction by the web: Tweets are a useful tool for estimating and forecasting Dengue at country and city level, PLoS Negl. Trop. Dis., 11 (2017), e0005729.
[19]	Y. Teng, D. Bi, G. Xie, et al., Dynamic forecasting of Zika epidemics using google trends, PLoS One, 12 (2017), e0165085.

1.	Benjamin Wacker, Jan Christian Schlüter, A cubic nonlinear population growth model for single species: theory, an explicit–implicit solution algorithm and applications, 2021, 2021, 1687-1847, 10.1186/s13662-021-03399-5
2.	Slavi G. Georgiev, Lubin G. Vulkov, 2023, Chapter 4, 978-3-031-20950-5, 34, 10.1007/978-3-031-20951-2_4
3.	Slavi Georgiev, Lubin Vulkov, Numerical Coefficient Reconstruction of Time-Depending Integer- and Fractional-Order SIR Models for Economic Analysis of COVID-19, 2022, 10, 2227-7390, 4247, 10.3390/math10224247
4.	Slavi G. Georgiev, Lubin G. Vulkov, 2022, 2528, 0094-243X, 080025, 10.1063/5.0101044
5.	Slavi Georgiev, Mathematical Identification Analysis of a Fractional-Order Delayed Model for Tuberculosis, 2023, 7, 2504-3110, 538, 10.3390/fractalfract7070538

Mathematical Biosciences and Engineering

Numerical study of discretization algorithms for stable estimation of disease parameters and epidemic forecasting

Related Papers:

Abstract

1. Introduction

2. Problem formulation and mathematical preliminaries

3. Numerical experiments with synthetic data

4. Simulations with real data

5. Conclusions and discussion

Acknowledgments

Conflict of interest

References

This article has been cited by:

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Figures and Tables

Other Articles By Authors

Related pages

Tools

Export File

Citation

Format

Content

Catalog

Abstract

1. Introduction

2. Problem formulation and mathematical preliminaries

3. Numerical experiments with synthetic data

4. Simulations with real data

5. Conclusions and discussion

Acknowledgments

Conflict of interest

References