Feedback control of an HBV model based on ensemble kalman filter and differential evolution

  • Received: 13 March 2017 Revised: 14 September 2017 Published: 01 June 2018
  • MSC : Primary: 92B05; Secondary: 49K15

  • In this paper, we derive efficient drug treatment strategies for hepatitis B virus (HBV) infection by formulating a feedback control problem. We introduce and analyze a dynamic mathematical model that describes the HBV infection during antiviral therapy. We determine the reproduction number and then conduct a qualitative analysis of the model using the number. A control problem is considered to minimize the viral load with consideration for the treatment costs. In order to reflect the status of patients at both the initial time and the follow-up visits, we consider the feedback control problem based on the ensemble Kalman filter (EnKF) and differential evolution (DE). EnKF is employed to estimate full information of the state from incomplete observation data. We derive a piecewise constant drug schedule by applying DE algorithm. Numerical simulations are performed using various weights in the objective functional to suggest optimal treatment strategies in different situations.

    Citation: Junyoung Jang, Kihoon Jang, Hee-Dae Kwon, Jeehyun Lee. Feedback control of an HBV model based on ensemble kalman filter and differential evolution[J]. Mathematical Biosciences and Engineering, 2018, 15(3): 667-691. doi: 10.3934/mbe.2018030

    Related Papers:

    [1] Gerasimos G. Rigatos, Efthymia G. Rigatou, Jean Daniel Djida . Change detection in the dynamics of an intracellular protein synthesis model using nonlinear Kalman filtering. Mathematical Biosciences and Engineering, 2015, 12(5): 1017-1035. doi: 10.3934/mbe.2015.12.1017
    [2] Balázs Csutak, Gábor Szederkényi . Robust control and data reconstruction for nonlinear epidemiological models using feedback linearization and state estimation. Mathematical Biosciences and Engineering, 2025, 22(1): 109-137. doi: 10.3934/mbe.2025006
    [3] Xin Liu, Yingyuan Xiao, Xu Jiao, Wenguang Zheng, Zihao Ling . A novel Kalman Filter based shilling attack detection algorithm. Mathematical Biosciences and Engineering, 2020, 17(2): 1558-1577. doi: 10.3934/mbe.2020081
    [4] Yukun Tan, Durward Cator III, Martial Ndeffo-Mbah, Ulisses Braga-Neto . A stochastic metapopulation state-space approach to modeling and estimating COVID-19 spread. Mathematical Biosciences and Engineering, 2021, 18(6): 7685-7710. doi: 10.3934/mbe.2021381
    [5] H. Thomas Banks, Shuhua Hu, Zackary R. Kenz, Hien T. Tran . A comparison of nonlinear filtering approaches in the context of an HIV model. Mathematical Biosciences and Engineering, 2010, 7(2): 213-236. doi: 10.3934/mbe.2010.7.213
    [6] Xiaoqin Wang, Yiping Tan, Yongli Cai, Kaifa Wang, Weiming Wang . Dynamics of a stochastic HBV infection model with cell-to-cell transmission and immune response. Mathematical Biosciences and Engineering, 2021, 18(1): 616-642. doi: 10.3934/mbe.2021034
    [7] Tingting Xue, Long Zhang, Xiaolin Fan . Dynamic modeling and analysis of Hepatitis B epidemic with general incidence. Mathematical Biosciences and Engineering, 2023, 20(6): 10883-10908. doi: 10.3934/mbe.2023483
    [8] Yiping Tan, Yongli Cai, Zhihang Peng, Kaifa Wang, Ruoxia Yao, Weiming Wang . Dynamics of a stochastic HBV infection model with drug therapy and immune response. Mathematical Biosciences and Engineering, 2022, 19(8): 7570-7585. doi: 10.3934/mbe.2022356
    [9] Jiying Ma, Shasha Ma . Dynamics of a stochastic hepatitis B virus transmission model with media coverage and a case study of China. Mathematical Biosciences and Engineering, 2023, 20(2): 3070-3098. doi: 10.3934/mbe.2023145
    [10] Tahir Khan, Fathalla A. Rihan, Muhammad Ibrahim, Shuo Li, Atif M. Alamri, Salman A. AlQahtani . Modeling different infectious phases of hepatitis B with generalized saturated incidence: An analysis and control. Mathematical Biosciences and Engineering, 2024, 21(4): 5207-5226. doi: 10.3934/mbe.2024230
  • In this paper, we derive efficient drug treatment strategies for hepatitis B virus (HBV) infection by formulating a feedback control problem. We introduce and analyze a dynamic mathematical model that describes the HBV infection during antiviral therapy. We determine the reproduction number and then conduct a qualitative analysis of the model using the number. A control problem is considered to minimize the viral load with consideration for the treatment costs. In order to reflect the status of patients at both the initial time and the follow-up visits, we consider the feedback control problem based on the ensemble Kalman filter (EnKF) and differential evolution (DE). EnKF is employed to estimate full information of the state from incomplete observation data. We derive a piecewise constant drug schedule by applying DE algorithm. Numerical simulations are performed using various weights in the objective functional to suggest optimal treatment strategies in different situations.


    1. Introduction

    Hepatitis B virus (HBV) infection is an important global health problem implicated in liver cancer and cirrhosis. Nearly 2 billion people worldwide have been infected and 240 million patients have suffered from chronic HBV infection. Eighty percent of all liver cancer is caused by chronic hepatitis B, resulting in half a million fatalities annually [22]. Several therapeutic agents have been approved by the Food and Drug Administration, including interferon and nucleosides/nucleotides analogues (NUCs) such as lamivudine, adefovir, entecavir, telbivudine and tenofovir [1,2,13]. These agents fall into one of two categories: inhibiting de novo infection and inhibiting viral production. Despite the success of these drugs in reducing liver damage and delaying the progression of liver disease in chronically infected people, their long-term use comes with substantial complications. A critical challenge in the treatment of patients with chronic Hepatitis B is the emergence of drug resistance. In addition, high drug cost and complicated drug regimens impose a burden on some patients who have limited access to antiviral agents in developing countries [1].

    Mathematical models have recently contributed significantly to understanding many complex biological systems and investigating the dynamics and control of virus such as human immunodeficiency virus (HIV) and hepatitis C virus. Many researchers have used mathematical models to simulate the course of virus infection and predict the potential response to different therapies [10,18]. They also have applied optimal control techniques with mathematical models to suggest optimal treatment strategies for HIV, tuberculosis, and vector-borne diseases [3,4,17]. Several studies have explored the optimization of strategies for vaccination distribution for influenza using control theory [15,16]. A mathematical model was developed to estimate the effects of pre-exposure prophylaxis (PrEP) on the HIV epidemic in South Korea [14].

    Optimal control of HBV infection is the subject of research interest because it may contribute to the development of effective treatment strategies. The effectiveness of HBV therapy may be improved by developing dynamic drug strategies, where the treatment schedule changes over time in response to the individual's disease progression. The model we use to derive the optimal drug treatment strategies for HBV infection is originally developed in [13], although the paper did not provide a mathematical analysis of the model. The authors introduced immune effectors as a new compartment in the model to show triphasic viral dynamics since traditional biphasic models is not sufficient to explain long-term viral dynamics of hepatitis B. In this paper, we first perform a qualitative analysis of the HBV model and then consider feedback control of HBV infection incorporating current knowledge of patients. The reproduction number is determined and the stability of the steady state is investigated by using the number which is commonly used to measure the potential for the disease spread in epidemiology. We also derive optimal treatment strategies for HBV infection by formulating a feedback control problem. The model predictive control (MPC) is an advanced feedback control methodology which solves a finite-horizon open-loop control problem iteratively such that the current state is measured at the sampling time [12]. Application of the MPC method requires full information on the current state. In a clinical setting, however, it is impossible to quantify all state variables because of a lack of technical skills. To overcome the imperfection of the observation data, we use the ensemble Kalman filter (EnKF). EnKF is a recursive algorithm that produces estimates of the optimal state of the nonlinear system by using a series of observed data with noise [7,8].

    A differential evolution (DE) algorithm is employed to derive the piecewise constant drug schedule, where the current state is sampled at several measurement times. DE, introduced by Price and Storn, is a population-based direct-search global optimization algorithm. In the evolutionary computation, a population of candidate solutions, called agents, iteratively progresses towards the optimum with regard to a given measure of quality. These agents are moved around in the search-space by perturbations using three main steps of mutation, crossover and selection [21,Section 2.1,page 37-47].

    The rest of this paper is organized as follows: Section 2 introduces and analyzes a mathematical model describing the dynamics of HBV and immune response during antiviral therapy. The section derives the reproduction number and investigates the stability of the steady state. Section 3 formulates a feedback control problem based on EnKF and DE to derive optimal drug treatment strategies for HBV infection. The section addresses EnKF to estimate full information of the state at sampling time from partial observation data and the DE to derive an optimal piecewise constant control. Section 4 presents the results of numerical simulations with various weights in the objective functional. Section 5 concludes.


    2. Model analysis

    The system of ordinary differential equations describing the compartmental HBV infection dynamics is given by

    dTdt=SdTT(1ημ1(t))bVT+αfIEdIdt=(1ημ1(t))bVT+mIdIIαIEdVdt=(1ϵμ2(t))pIcVdEdt=SE+BEIE(I+KE)DEE (1)

    where the four state variables T, I, V, and E represent the number of uninfected or target cells, infected cells, virus load, and immune effectors, respectively.

    Target cells are assumed to be produced at the constant rate S. Target cells die at rate dTT and become infected at the rate bVT, where b is a new infection rate of target cells. New infection with HBV is blocked by treatment with NUCs, nucleoside analogues. The parameter η represents the efficacy of the treatment. Thus, the infection rate is reduced to (1ημ1)bVT where 0η1.

    Infected cells are produced at the rate of (1ημ1)bVT and divide by mitosis at the rate of mI, where m is the mitotic production rate of infected cells. Infected cells die at rate dII and are cleared by immune effector at rate αIE, where the immune effector-induced clearance rate of the infected cells is represented as α in the equation of the infected cell. The mechanism for immune effectors involves cytolytic and noncytolytic activities. Immune effectors are thought to perform noncytolytic activity, probably causing noncytolytic, cytokine-induced curing? of infected cells [11,20]. Although there is currently no way to measure noncytolytic immune activity during HBV infection, it was assumed that the scale of noncytolytic activity is related to the scale of cytolytic activity. Therefore, noncytolytic activity was expressed using α with a calibration coefficient f in the first equation of the system where 0f1.

    Free virions are assumed to be produced from infected cells at the rate of pI. The treatment of inhibiting viral production is introduced to reduce the rate by a factor of (1ϵμ2), where ϵ denotes the efficacy of the treatment. Then, HBV is produced at the reduced rate of (1ϵμ2)pI where 0ϵ1 and die at the rate of cV. The immune effectors are produced at the constant rate SE and die at the rate DEE. The immune effectors are stimulated at the rate of BEIEI+KE, where BE is the maximum birth rate for immune effectors and KE is the Michaelis constant. That is, the stimulation of immune response is assumed to be similar to Michaelis-Menten kinetics.

    The control variables μ1(t) and μ2(t) represent time-dependent drug treatments of inhibiting de novo infection and viral production satisfying 0μi(t)1, (i=1,2), respectively. We assume that μi(t)=0 and μi(t)=1, (i=1,2) represent no-and full treatment scenarios, respectively. For simplicity of notation in the qualitative analysis, we assume that μ1(t)1 and μ2(t)1 in this section. The mathematical model (1) contains many parameters that have to be assigned before numerical simulations. The descriptions and values for the parameters are summarized in Table 1. For a more detailed discussion of this model and parameter values, see [13].

    Table 1. Parameters used in the model (1). They are principally extracted from Kim et al. [13].
    Descriptionvalueunits
    Sproduction rate of target cells 5×105 cellsmLday
    dTdeath rate of target cells0.003 1day
    ηtreatment efficacy of inhibiting de novo infection [0,1]
    bde novo infection rate of target cells 4×1010mLvirionsday
    fcalibration coefficient of α for target cells0.1
    mmitotic production rate of infected cells0.003 1day
    dIdeath rate of infected cells0.043 1day
    αimmune effector-induced clearance rate of infected cells 7×104mLcellsday
    ϵtreatment efficacy of inhibiting viral production [0,1]
    pviral production rate by infected cells6.24virionscellsday
    cclearance rate of free virions0.71day
    SEproduction rate of immune effectors9.33 cellsmLday
    BEmaximum birth rate for immune effectors0.5 1day
    KEMichaelis-Menten type coefficient for immune effectors 4.07×105 cellsmL
    DEdeath rate of immune effectors0.52 1day
     | Show Table
    DownLoad: CSV

    For any biological model to be feasible, it is essential that all states of the model must remain non-negative. A mathematical analysis of the model should include verification of this property. We begin by defining a domain Ω for the HBV model (1) and making assumptions for some parameters.

    Ω={(T,I,V,E)R4+|0T+ISD,0V(1ϵ)pScD,0ESEDEBE},

    where D=min(dT,dIm).

    Assumptions:

    A1: The mitotic production rate of infected cells is smaller than the death rate of infected cells, i.e. m<dI.

    A2: The maximum birth rate for immune effectors is smaller than the death rate of immune effectors, i.e. BE<DE.

    Theorem 2.1. Assume that A1 and A2 hold, then Ω is positively invariant under the system (1).

    Proof. Note that the states T, I, V, and E decrease only in proportion to their present sizes, and thus, all states remain nonnegative if their initial values are nonnegative.

    Adding the first two equations in the model (1), we obtain

    ddt(T+I)=SdTT+mIdIIαIE(1f)SdTT(dIm)ISD(T+I),

    where D=min(dT,dIm). Clearly T(t)+I(t)SD for all t when initial value T0+I0SD. Similarly, we have V(t)(1ϵ)pScD and E(t)SEDEBE for all t. Therefore, Ω is positively invariant under the system (1)

    Now we analyze the stability of the steady states by calculating the reproduction number. In epidemiology, the basic reproduction number, R0, is defined as the average number of secondary infections generated by a typical primary infection in a susceptible population. It is used to determine whether or not a disease can spread through the population. In this paper, the R0 is considered as the expected number of target cells infected through viruses reproduced by a single infected cell. It is a key threshold quantity for conducting qualitative analysis of the model. Because the model (1) include treatment, we speak of a control reproduction number Rc rather than a basic reproduction number R0. If there is no treatment (μ1(t)=0, μ2(t)=0), then Rc reduces to R0.?

    Clearly, the HBV model (1) has a unique virus-free equilibrium given by

    EQ0=(SdT,0,0,SEDE).

    Using the next generation approach [6,page 230], we calculate Rc, which is critical to the stability of the virus-free equilibrium. To this end, we split the HBV model (1) into two systems with a disease compartment x=(I,V)t and a non-disease compartment y=(T,E)t as

    dxdt=FV and dydt=[SdTT(1η)bVT+αfIESE+BEIE(I+KE)DEE],

    where

    F=[(1η)bVT0]andV=[(dIm+αE)I(1ϵ)pI+cV].

    The linearized equations for the disease compartment, x, at the virus-free equilibrium can be written as

    x=(FV)x,

    where

    F=Fx=[0(1η)bSdT00]andV=Vx=[dIm+αSEDE0(1ϵ)pc].

    Then Rc is given by

    Rc=ρ(FV1)=(1η)bS(1ϵ)pcdT(dIm+αSEDE), (2)

    where ρ(A) denotes the spectral radius of a matrix A.

    Theorem 2.2. Assume that A1 and A2 hold. The virus-free equilibrium, EQ0 for the system (1) is locally asymptotically stable if Rc<1 and unstable if Rc>1.

    Proof. At the virus-free equilibrium, the Jacobian matrix of the system (1) is

    J(EQ0)=[dTαfSEDE(1η)bSdT00(mdIαSEDE)(1η)bSdT00(1ϵ)pc00BESEKEDE0DE].

    And the characteristic polynomial of the Jacobian matrix is given by

    p(λ)=(dTλ)(DEλ)((mdIαSEDEλ)(cλ)(1ϵ)p(1η)bSdT).

    Clearly, λ1=dT,λ2=DE are two of the eigenvalues of the Jacobian matrix. Denote λ3 and λ4 as the other eigenvalues. These values are the roots of the quadratic polynomial in p(λ) given by

    (mdIαSEDEλ)(cλ)(1ϵ)p(1η)bSdT=λ2(mdIαSEDEc)λ+(mdIαSEDE)(c)(1ϵ)p(1η)bSdT.

    So we have

    λ3+λ4=mdIαSEDEc,λ3λ4=(mdIαSEDE)(c)(1ϵ)p(1η)bSdT=(dIm+αSEDE)c(1Rc).

    By assumption A1, we have λ3+λ4<0. Therefore, all of the eigenvalues of the Jacobian matrix have negative real parts when Rc<1. However, not all roots of this polynomial can have negative real parts when Rc>1. This means that the virus-free equilibrium is locally asymptotically stable when Rc<1 and unstable when Rc>1.

    We note that the global stability of the virus-free equilibrium can be established by verifying the conditions of the global stability theorem in [5,page 176]. Consider the abstract form of a mathematical model to introduce the theorem:

    x=Axˆf(x,y)y=g(x,y) (3)

    where x denotes a disease compartment and y denotes a non-disease compartment.

    Definition 2.3. A is called M-matrix if and only if there exists a matrix B0 and a real number rρ(B) such that A=rIB.

    Remark 1. [9] A is a M-matrix if and only if A is a real nonsingular matrix such that aij0 for all ij and each entry of A1 is nonnegative.

    We now assume that y is a globally asymptotically stable equilibrium of y=g(0,y).

    Theorem 2.4. [5,page 176] If A is a nonsingular M-matrix and ˆf0, then (0,y) is a globally asymptotically stable equilibrium of (3).

    Theorem 2.5. Assume that A1 and A2 hold. If Rc<1, then the virus-free equilibrium, EQ0, is globally asymptotically stable when E(0)>SEDE and dTdIm.

    Proof. Our model can be written as

    x=Axˆf(x,y)y=g(x,y)=[SdTT(1η)bVT+αfIESE+BEIE(I+KE)DEE], (4)

    where

    A=[(mdIαSEDE)(1η)bSdT(1ϵ)pc],
    ˆf(x,y)=[(1η)bV(SdTT)+αI(ESEDE)0].

    It is clear that y=(SdT,SEDE) is a globally asymptotically stable equilibrium of y=g(0,y) and a12 and a21 in the matrix A are nonpositive.

    Now consider the inverse of A,

    A1=1(dIm+αSEDE)c(1Rc)[c(1η)bSdT(1ϵ)pdIm+αSEDE].

    Thus A is M-matrix when Rc<1 by Remark 1.

    In addition, ESEDE when E(0)>SEDE because of the form of the fourth equation in our model (1) and TSdT when dTdIm by Theorem 2.1. That is, ˆf(x,y)0. Therefore, the virus-free equilibrium, EQ0, is globally asymptotically stable when E(0)>SEDE and dTdIm by Theorem 2.4.

    Theorem 2.6. Suppose that assumptions A1 and A2 hold. There exists a unique positive chronic equilibrium if Rc>1 and there is no chronic equilibrium if Rc<1.

    Proof. To determine a chronic equilibrium, (T,I,V,E), the following equations are solved:

    SdTT(1η)bVT+αfIE=0(1η)bVT+mIdIIαIE=0 (5)
    (1ϵ)pIcV=0 (6)
    SE+BEIEI+KEDEE=0 (7)

    By the third and fourth equations, we have

    V=I(1ϵ)pc , E=SE(I+KE)I(BEDE)KEDE.

    Substituting V and E in the first equation

    T=S+αfIEdT+(1η)bV=S+αfISE(I+KE)I(BEDE)KEDEdT+(1η)bI(1ϵ)pc. (8)

    By the second equation and the above expression, we finally obtain a quadratic polynomial, P(I);

    P(I)=(1η)b(1ϵ)p(S(I(BEDE)KEDE)IαfSE(I+KE))+(mDI)(cdT+I(1η)b(1ϵ)p)(I(BEDE)KEDE)+αSE(I+KE)(cdT+I(1η)b(1ϵ)p)=AI2+BI+C=0

    where

    A=(mdI)(1η)b(1ϵ)p(BEDE)+αSE(1f)(1η)b(1ϵ)pB=(1η)b(1ϵ)p(S(BEDE)αfSEKE)+(mdI)cdT(BEDE)(mdI)(1η)b(1ϵ)pKEDE+αSEKE(1η)b(1ϵ)p+αSEdTC=(1η)b(1ϵ)pSKEDE+αSEKEcdT(mdI)cdTKEDE=KE((m+dI)DE+αSE)cdT(1Rc)

    By our assumptions, A>0. And C<0 if Rc>1. This means there exists only one positive root of P(I) and thus a corresponding chronic equilibrium, EQ=(T,I,V,E) if Rc>1. On the other hand, there is no chronic equilibrium if Rc<1.

    A general qualitative analysis of our model is very challenging due to the high nonlinearities in our model. However, we are still interested in the stability of the steady state, so we conducted a stability analysis of the steady state numerically. Given the parameter values in Table 1, we plot the bifurcation diagram of the model system (1) with varying ϵ and η=0.5. In Figure 1, the x and yaxes stand for Rc and the steady state I, respectively. By checking the eigenvalues of Jacobian, we verify that the virus-free steady state is locally asymptotically stable when Rc<1 and it is unstable when Rc>1. Moreover, our simulation supports that there exists a chronic equilibrium which is locally asymptotically stable when Rc>1.

    Figure 1. The bifurcation diagram of the model system.

    3. Feedback control

    One of the study goals is to design optimal drug treatment strategies using mathematical models and control techniques. One way to achieve this goal is to consider an optimal control problem minimizing the number of virions with consideration for the treatment costs. Thus, we define the objective functional as

    J(μ1,μ2)=w1V(t)+w2μ21(t)+w3μ22(t)dt (9)

    The weight constants wi, (i=1,2,3) can be chosen to balance the relative costs of the virus load V and control efforts of μ1(t) and μ2(t). Then we formulate an optimal control problem to minimize the objective functional constrained to the dynamics of the HBV infection:

    minimize J(μ1,μ2) subject to (1).

    In our optimal control formulation, the MPC method is used in order to reflect the status of patients at both initial time and follow-up visits. Therapy strategies are determined based on the current state of the system at each sampling time, which is the initial state for the open-loop control problem in each subinterval. In order to formulate the MPC, we assume that the current state is measured at ti where ti<ti+1 and Ti is the control horizon time such that ti+1ti+Ti for i=0,1,2,. The MPC algorithm proceeds as follows:

    1. Solve the open-loop optimal control problem minimizing the objective functional (9) in the interval [ti,ti+Ti] with the initial value x(ti)=xi,

    2. Determine the state x(t) in the interval [ti,ti+Ti] by solving the model (1) with the derived optimal control functions,

    3. Determine xi+1 by using measured data and the predicted state x(t) at t=ti+1,

    4. Repeat this process over the next time interval [ti+1,ti+1+Ti+1] with initial value x(ti+1)=xi+1.

    There are several technical issues to be addressed to implement our approach. Synthesis of the nonlinear feedback control requires full knowledge of all the state variables. However, in a clinical setting, only partial information of the state is given due to lack of technical skill to quantify all the state variables. The ideas of the Kalman filter (KF) are employed to design a state estimator. Then we are interested in restricted class of piecewise constant controllers that are usual treatment protocols in practice. DE, an algorithm that optimizes a problem by iteratively improving a candidate solution, has a great potential as a tool for deriving piecewise constant strategies.

    In this section, we give a brief summary of two main techniques, KF and DE, used in the MPC. We begin with the basic ideas and background of KF and move to modified algorithms to take care of nonlinearity and the system with continuous dynamics and discrete measurements. In the second part, the DE method is introduced and each step is explained with illustrations. Figure 2 shows how these techniques are incorporated to solve a control problem to yield an efficient treatment strategy.

    Figure 2. Feedback control algorithm.

    3.1. Hybrid ensemble Kalman filter (EnKF)

    Kalman filter (KF) uses a series of measurements observed over time containing noise and produces estimates of state variables by a combination of the probability distribution from the model prediction and the measurement. It is a recursive algorithm that only utilizes the first two moments of the state, mean and covariance, to characterize the entire optimal estimate for linear systems with additive Gaussian noise in both the process model and the observation. Consider a linear, discrete system of dynamics and measurement:

    xk=Fk1xk1+wk1zk=Hkxk+vk (10)

    where xk is a state vector, Fk the state transition model, and Hk the observation model which maps the true state xk into the observation zk. There are several assumptions for KF regarding noise terms in process and measurement models. The initial state x0 is a Gaussian with a known mean ˉx0 and a covariance matrix P0. Noise wk and vk are assumed to be zero-mean Gaussian processes with covariances Qk and Rk, respectively.

    The KF algorithm consists of a prior step to predict by the process model and a posterior step to update based on the measurement, which can be summarized as

    • Prior estimation (prediction):

    ˆxk=Fk1ˆxk1Pk=Fk1Pk1FTk1+Qk1

    • Posterior estimation (update):

    K=PkHTk(HkPkHTk+Rk)1ˆxk=ˆxk+K(zkHkˆxk)Pk=(IKHk)Pk

    The probability distribution of state variables in a linear model is completely characterized by its mean and covariance for all times if the initial condition follows a normal distribution. For a nonlinear model, however, the first two moments of the state will not characterize the full probability density, but do determine the mean path and the dispersion about that path. A number of extensions have been proposed to deal with nonlinear dynamics. We also note that physical systems are often represented as continuous-time dynamics in conjunction with discrete-time measurements.

    ˙x(t)=f(x,t)+w(t)zk=Hkxk+vk (11)

    The extended Kalman filter approximates the nonlinear dynamics model by a linearization about the current state. This linear model is then propagated forward under the basic KF algorithm and is used to approximate the optimal mean and covariance for the state of the system. It has been observed that the extended Kalman filter may fail to achieve meaningful results, in the case of large nonlinearity or poor initial guess. Another set of approaches based on sampling techniques have been developed, as opposed to deterministic ones, to characterize the distributions. One such approach, EnKF, generates numerous points sampled from the assumed distribution and propagates them forward to calculate the mean and variance of these samples [8,page 38-46]. Adjustments have to be made to account for nonlinearity and discrepancy in time. In this particular problem, we only shows the numerical results obtained by applying EnKF because the results of both EKF and EnKF algorithms are consistent. The hybrid version EnKF algorithm is given as follows:

    • Initialize: Generate particles X={x0,j} sampled from initial distribution

    ˆx(0)=E[x0,j]P(0)=E[(x0,jE[x0,j])(x0,jE[x0,j])T]

    • Prior estimation (prediction):

    X={xk,j|˙xk,j=f(xk,j,t)+w(t) with initial values xk1,jX}ˆx=E[X]P=Cov[X,X]

    • Posterior estimation (update):

    Z={zk,j=Hkxk,j}Z={zk,j=zk+vk,j}K=Cov[X,Z](Cov[Z,Z]+Cov[Z,Z])1X={xk,j=xk,j+K(zk,jzk,j)}ˆxk=E[X]Pk=Cov[X,X]

    In order to apply the KF technique, we introduce the notation x and r to denote the state and control vectors, respectively. Thus we define

    x=(TIVE),r=(ηϵ)

    With the above notation, the HBV model (1) can be expressed in the form

    ˙x=f(x(t),r(t),t).

    For feedback control, we need current knowledge on the state of the system. We assume that partial state observation of viral load V is available, which is representative of the type of data in a clinical setting. Hence, the measurement or observation takes the form

    z=(0010)x=Hx

    3.2. Differential evolution (DE) method

    DE is a direct-search global optimization algorithm that was originally developed by Price and Storn in 1997. Let J:RDR be the cost functional to be minimized. The DE algorithm can then be described as follows:

    Step 1. Initialization The initial population of possible candidate vectors {xi:i=1,2,..,Np} are generated by random sampling in the search-space. Each element of a candidate vector of dimension D is denoted by xi=[xi,j:j=1,2,...,D].

    Step2. Mutation Mutant vectors {vi} are generated according to

    vi=xbest+P(xmxn)i=1,2,...,Np (12)

    where xbest is the best candidate, {xm,xn} are two distinct arbitrary candidate solution vectors and P(0,2) is a mutation scale factor.

    Step3. Crossover A set of vectors {ui} is formed for each i=1,2,...,Np,

    ui,j={vi,j,if randj(0,1)CRxi,j,otherwise (13)

    where j=1,2,...,D and CR[0,1] is a crossover probability.

    Step4. Selection The next generation vectors are selected as

    xi={ui,if J(ui)<J(xi)xi,otherwise (14)

    Repeat steps 2-4 until the termination criteria are met.

    DE creates mutant agents using the scaled difference of two randomly selected candidate vectors in the mutation step (Figure 3). In the algorithm, the scale factor P[0,2] gives the scatter around xbest and P>1 is required for a successfully optimized result, in general. The crossover step allows the construction of new candidate vectors by combining the current and mutant agents (Figure 4). CR[0,1], called a crossover probability, determines the extent of preference given to the mutant in recombination of components in each candidate vector. The average number of parameters mutated depends on the crossover probability and many genetic algorithms recommend a crossover probability of 1/D, where D is the dimension of a solution vector. In the selection, replace the current agent in the population if the candidate vector is an improvement.

    Figure 3. Mutation step to create a mutant vector vi.
    Figure 4. Crossover step to yield one of the vectors vi, ui, ui and xi as a new candidate.

    In summary, mutation expands the search space, crossover incorporates candidate vectors from the previous generation and new candidate vectors for recombination, and selection admits the one with the best fitness to the next generation. Therefore, mutation and crossover tend to increase the diversity of a population, whereas selection reduces it. To avoid premature convergence due to the selection pressure, it is crucial to choose P and CR that are of sufficient magnitude. Selecting desirable parameters in DE has been the subject of much research, since the choice of parameters can have a large impact on optimization performance [19].


    4. Numerical results

    We illustrate the nonlinear feedback control incorporating EnKF and DE algorithms using synthetic data. We consider a drug treatment strategy over 200 days with monthly measurement. A set of synthetic data is constructed by adding 5% random noise to the model prediction of viral load V for every time point for the measurement. The parameters used are summarized in Table 1 and the initial values for state variables are

    T0=1.4×108,I0=3.75×105,V0=5×108,E0=100

    At each sampling time, an initial guess for the covariance to start the EnKF is chosen to be a diagonal matrix in which diagonal entries are 10. In the DE algorithm, the population size Np, the differential weight P, and the crossover probability CR are selected to be Np=500,P=1.1, and CR=0.7, respectively. The values of the drug efficacy for η and ϵ are set to 0.9.

    To suggest general policies for efficient treatment, we run the simulations under various settings. The weight constants wi in the objective functional (9) can be chosen to balance the difference in the magnitudes and/or relative costs of the viral load and the drug treatment. In fact, the second and third terms in the objective functional (9) with weights w2 and w3 represent systemic costs of the drug treatments which mean severity of unintended side effects as well as treatment cost. In numerical simulations, weights for drug treatments w2 and w3 are varied to compare the optimal treatment strategies under different relative costs, keeping the weight for virus cells w1=107 fixed to balance the magnitudes.

    Figure 5 displays the time dependent feedback controls and the corresponding states with the choice of w2=105 and w3=105. Hybrid EnKF is used to recover the full states (diamond) from the partial measurement (star). Then the DE algorithm is followed to obtain the optimal piecewise constant functions. The derived drug schedules for both treatments are nearly identical to the full treatment in this case. That is, the maximum dosage of the both drugs are recommended for the whole duration if the systemic costs of the drug treatments are relatively low (w2=105 and w3=105). The corresponding optimal states obtained by solving the system (1) are also presented. Drug treatment causes the low viral load, which discourages an immune response.

    Figure 5. Feedback controls μ1, μ2 and the corresponding states T,V,I,E with w2=105, w3=105.

    The impact of different weights on feedback control is explored by varying the weights for w2 and w3 in Figure 6 and Figure 7. The dosage of μ1 tapers off from the end of treatment duration, whereas the dosage of μ2 is maintained, as the weights for μ1 and μ2 increase simultaneously. In Figure 6, the temporal schedule of optimal control μ1 stays at full dosage for 60 days and begins to reduce until it reaches the level of 20% approximately in the end. We note that the decline in the dosage of the drug inhibiting de novo infection is closely related to the virus load. Figure 7 shows that μ1 is almost zero except in the first 30 days whereas μ2 maintains its maximum value during the whole period in the case of w2=101, and w3=101. One may conclude that μ2 is more effective than μ1 in reducing the viral load and infected cells due to underlying mechanisms when both controls have the same level of reduction in inhibition. That is, it is more efficient that some patients who have limited access to therapeutic agents due to high cost should reduce their use of drugs that inhibit de novo infection while maintaining their use of drugs that inhibit viral production.

    Figure 6. Feedback controls μ1, μ2 and the corresponding states T,V,I,E with w2=103, w3=103.
    Figure 7. Feedback controls μ1, μ2 and the corresponding states T,V,I,E with w2=101, w3=101.

    In Figure 8, w2=103 and w3=102 are chosen to consider the case when μ1 and μ2 have different costs to implement. It is observed that the dosage of μ2 maintains its maximum value up to the 90th day and then tapers off whereas the dosage of μ1 maintains its full dosage for the whole treatment duration when the cost of μ2 is higher than the cost of μ1.

    Figure 8. Feedback controls μ1, μ2 and the corresponding states T,V,I,E with w2=103, w3=102.

    The relation between the relative dosage and the weights of two drugs are investigated further in Figure 9 and Figure 10. In the baseline case, we set efficacy of both drugs η=ϵ=0.9 and vary the weights w2 and w3 in the range [106,100]. To compared the total dosage of drugs that inhibit de novo infection and inhibit viral production regarding to the change of weights, the difference μ2μ1 between the total amount of μ2 and μ1 for the whole duration is displayed in the Figure 9. Then the positive region of difference where the total dosage of μ2 is higher than μ1, is divided from the negative one by a threshold curve (solid line). As it is noticed in previous results, the case of same weights is included in the positive region because μ1 begins first to reduce its dosage when the weights increase simultaneously.

    Figure 9. The difference between the total amount of μ2 and μ1 using same treatment efficacy (η=ϵ=0.9).
    Figure 10. The difference between the total amount of μ2 and μ1 using various combinations of treatment efficacy assuming the total efficacy of 99%.

    In Figure 10, simulations are performed using various combinations of treatment efficacy assuming the total efficacy of 99%. In the first row, threshold is portrayed when the efficacy of viral production inhibitor is higher than the efficacy of de novo infection inhibitor. The region where the dosage of μ2 is higher than μ1 is enlarged as the efficacy of viral production inhibitor, ϵ, increases. On the other hand, the positive region decreases if η rises as illustrated on the bottom.

    We performed numerical simulations with monthly, biweekly, and weekly measurement to investigate the influence of measurement time in Figure 11-13. Optimal treatment strategies are illustrated as weights w2 and w3 increase simultaneously. We also derived piecewise continuous optimal controls to carry out comparative assessments of different treatment schedules. While the resulting strategies are slightly different in details, the general conclusions are consistent. That includes that μ2 is more effective than μ1 in reducing the viral load and infected cells assuming the same level of reduction in inhibition.

    Figure 11. Piecewise constant controls with monthly, biweekly, and weekly measurement and piecewise continuous controls with monthly measurement using w2=105, w3=105.
    Figure 12. Piecewise constant controls with monthly, biweekly, and weekly measurement and piecewise continuous controls with monthly measurement using w2=103, w3=103.
    Figure 13. Piecewise constant controls with monthly, biweekly, and weekly measurement and piecewise continuous controls with monthly measurement using w2=101, w3=101.

    5. Conclusion

    This paper formulates and analyzes a mathematical model of the HBV infection for a better understanding of the interaction between the HBV infection and the immune response during antiviral therapy. The qualitative analysis shows that the proposed model possesses one virus-free equilibrium and one chronic equilibrium with some assumptions. A detailed local/global stability analysis of the virus-free steady state is conducted using the Jacobian matrix method combined with the reproductive number. Bifurcation analysis is also performed to support our theoretical results for the stability analysis numerically.

    In addition, the paper considers a feedback control problem to reflect the status at each sampling instance. The ideas of EnKF are used to address the issue of incomplete observation data. Then we apply the DE algorithm to derive piecewise constant control, which is the usual protocol in practice. The results of numerical simulations indicate that as the treatment costs increase, the drug dosage tapers off, resulting in decreased drug use. It is observed that μ2, which inhibits viral production, may be more effective than μ1, which inhibits de novo infection, in reducing the viral load assuming both controls have the same cost. However, drug dosage is a function of weight and is determined by the relative costs of drugs, in general.


    [1] [ Z. Abbas,A. R. Siddiqui, Management of hepatitis B in developing countries, World Journal of Hepatology, 3 (2011): 292-299.
    [2] [ D. Lavanchy, Hepatitis B virus epidemiology, disease burden, treatment, and current and emerging prevention and control measures, Journal of Viral Hepatitis, 11 (2004): 97-107.
    [3] [ B. M. Adams,H. T. Banks,M. Davidian,Hee-Dae Kwon,H. T. Tran,S. N. Wynne,E. S. Rosenberg, HIV dynamics: Modeling, data analysis, and optimal treatment protocols, Journal of Computational and Applied Mathematics, 184 (2005): 10-49.
    [4] [ K. Blayneh,Y. Cao,H.-D. Kwon, Optimal control of vector-borne diseases: Treatment and prevention, Discrete and Continuous Dynamical Systems-series B, 11 (2009): 587-611.
    [5] [ F. Brauer, P. Van den Driessche and J. Wu, Mathematical Epidemiology, Springer-Verlag, Berlin, Heidelberg, 2008.
    [6] [ C. Castillo-Chavez, Blower, P. van den Driessche, D. Kirschner and A. -A. Yakubu, Mathematical Approaches for Emerging and Reemerging Infectious Diseases, Springer-Verlag, New York, 2002.
    [7] [ F. Daum, Nonlinear filters: beyond the Kalman filter, IEEE Aerospace and Electronic Systems Magazine, 20 (2005): 57-69.
    [8] [ G. Evensen, Data Assimilation: The Ensemble Kalman Filter, Springer-Verlag Berlin Heidelberg, 2009.
    [9] [ T. Fujimoto,R. R. Ranade, Two Characterizations of Inverse-Positive Matrices: The Hawkins-Simon Condition and the Le Chatelier-Braun Principle, Electronic Journal of Linear Algebra, 11 (2004): 59-65.
    [10] [ J. Guedj,Y. Rotman,S. J. Cotler,C. Koh,P. Schmid, Understanding early serum hepatitis D virus and hepatitis B surface antigen kinetics during pegylated interferon-alpha therapy via mathematical modeling, Hepatology, 60 (2014): 1902-1910.
    [11] [ L. G. Guidotti,R. Rochford,J. Chung,M. Shapiro,R. Purcell, Viral clearance without destruction of infected cells during acute HBV infection, Science, 284 (1999): 825-829.
    [12] [ K. Ito,K. Kunisch, Asymptotic properties of receding horizon optimal control problems, SIAM Journal on Control and Optimization, 40 (2002): 1585-1610.
    [13] [ H. Y. Kim, H. -D. Kwon, T. S. Jang, J. Lim and H. Lee, Mathematical modeling of triphasic viral dynamics in patients with HBeAg-positive chronic hepatitis B showing response to 24-week clevudine therapy, PloS One, 7 (2012), e50377.
    [14] [ S. B. Kim, M. Yoon, N. S. Ku, M. H. Kim and J. E. Song, et, al., Mathematical modeling of HIV prevention measures including pre-exposure prophylaxis on hiv incidence in south korea, PloS One, 9 (2014), e90080.
    [15] [ J. Lee,J. Kim,H.-D. Kwon, Optimal control of an influenza model with seasonal forcing and age-dependent transmission rates, Journal of Theoretical Biology, 317 (2013): 310-320.
    [16] [ S. Lee,M. Golinski,G. Chowell, Modeling optimal age-specific vaccination strategies against pandemic influenza, Bulletin of Mathematical Biology, 74 (2012): 958-980.
    [17] [ E. Jung,S. Lenhart,Z. Feng, Optimal control of treatments in a two-strain tuberculosis model, Discrete and Continuous Dynamical Systems -Series B, 2 (2002): 473-482.
    [18] [ N. K. Martin,P. Vickerman,G. R. Foster,S. J. Hutchinson,D. J. Goldberg,M. Hickman, Can antiviral therapy for hepatitis C reduce the prevalence of HCV among injecting drug user populations? A modeling analysis of its prevention utility, Journal of Hepatology, 54 (2011): 1137-1144.
    [19] [ R. Storn,K. Price, Differential evolution -a simple and efficient heuristic for global optimization over continuous spaces, Journal of Global Optimization, 11 (1997): 341-359.
    [20] [ R. Thimme,S. Wieland,C. Steiger,J. Ghrayeb,K. A. Reimann, CD8(+) T cells mediate viral clearance and disease pathogenesis during acute hepatitis B virus infection, J. Virol, 77 (2003): 68-76.
    [21] [ K. V. Price, R. M. Storn and J. A. Lampinen, Differential Evolution: A Practical Approach to Global Optimization, Springer-Verlag, Berlin, Heidelberg, 2005.
    [22] [ Hepatitis B Foudation, http://www.hepb.org.
  • This article has been cited by:

    1. Wei Li, Wenyin Gong, Differential evolution with quasi-reflection-based mutation, 2021, 18, 1551-0018, 2425, 10.3934/mbe.2021123
    2. Arsit Boonyaprapasorn, Thanacha Choopojcharoen, Parinya Sa-Ngiamsunthorn, Kaned Thung-od, 2018, Synergetic Controller for Hepatitis B Epidemic System, 9781450365307, 31, 10.1145/3265639.3265655
    3. Zhouhong Li, Tianwei Zhang, Permanence for Leslie-Gower predator-prey system with feedback controls on time scales, 2020, 1607-3606, 1, 10.2989/16073606.2020.1799256
    4. Shaghayegh Gorji, Ahmad Fakharian, Rezvan Abbasi, Virtually Euler–Lagrange form for feedback passivation of a wide range of non-Euler–Lagrange systems: Adaptive passivity-based control of hepatitis B disease, 2023, 0959-6518, 095965182211469, 10.1177/09596518221146928
    5. Gaowang Zhang, Feng Wang, Jian Chen, Huayi Li, Fixed-time sliding mode attitude control of a flexible spacecraft with rotating appendages connected by magnetic bearing, 2022, 19, 1551-0018, 2286, 10.3934/mbe.2022106
    6. Xinjia Li, Wenlian Lu, Particle network EnKF for large-scale data assimilation, 2022, 10, 2296-424X, 10.3389/fphy.2022.998503
    7. Batuhan Bilgi, Meric Cetin, Selami Beyhan, 2018, Adaptive UKF Based State Estimation of HIV, Hepatitis-B and Cancer Mathematical Models, 978-1-5386-7641-7, 1, 10.1109/CEIT.2018.8751866
    8. Leah Mitchell, Andrea Arnold, Analyzing the effects of observation function selection in ensemble Kalman filtering for epidemic models, 2021, 339, 00255564, 108655, 10.1016/j.mbs.2021.108655
  • Reader Comments
  • © 2018 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)
通讯作者: 陈斌, bchen63@163.com
  • 1. 

    沈阳化工大学材料科学与工程学院 沈阳 110142

  1. 本站搜索
  2. 百度学术搜索
  3. 万方数据库搜索
  4. CNKI搜索

Metrics

Article views(4253) PDF downloads(626) Cited by(8)

Article outline

Figures and Tables

Figures(13)  /  Tables(1)

/

DownLoad:  Full-Size Img  PowerPoint
Return
Return

Catalog