A novel Bayesian federated learning framework to address multi-dimensional heterogeneity problem

Jianye Yang; Tongjiang Yan; Pengcheng Ren; Jianye Yang; Tongjiang Yan; Pengcheng Ren

doi:10.3934/math.2023769

AIMS Mathematics

2023, Volume 8, Issue 7: 15058-15080. doi: 10.3934/math.2023769

Previous Article Next Article

Research article

A novel Bayesian federated learning framework to address multi-dimensional heterogeneity problem

College of Science, China University of Petroleum, Qingdao 266580, China

Federated learning (FL) has attracted a lot of interests as a promising machine learning approach to protect user privacy and data security. It requires the clients to send model parameters to the server rather than private datasets, thus protecting privacy to a significant extent. However, there are several types of heterogeneities (data, model, objective and systems) in FL scenario, posing distinct challenges to the canonical FL algorithm (FedAvg). In this work, we propose a novel FL framework that integrates knowledge distillation and Bayesian inference to address this multi-dimensional heterogeneity problem. On the client side, we approximate the local likelihood function using a scaled multi-dimensional Gaussian probability density function (PDF). Moreover, each client is allowed to design customized model according to the requirement through knowledge distillation. On the server side, a multi-Gaussian product mechanism is employed to construct and maximize the global likelihood function, greatly enhancing the accuracy of the aggregated model in the case of data heterogeneity. Finally, we show in extensive empirical experiments on various datasets and settings that global model and local model can achieve better performance and require fewer communication rounds to converge compared with other FL techniques.

Keywords:

Citation: Jianye Yang, Tongjiang Yan, Pengcheng Ren. A novel Bayesian federated learning framework to address multi-dimensional heterogeneity problem[J]. AIMS Mathematics, 2023, 8(7): 15058-15080. doi: 10.3934/math.2023769

Related Papers:

[1]	Dong-Mei Li, Bing Chai . A dynamic model of hepatitis B virus with drug-resistant treatment. AIMS Mathematics, 2020, 5(5): 4734-4753. doi: 10.3934/math.2020303
[2]	Naveed Shahid, Muhammad Aziz-ur Rehman, Nauman Ahmed, Dumitru Baleanu, Muhammad Sajid Iqbal, Muhammad Rafiq . Numerical investigation for the nonlinear model of hepatitis-B virus with the existence of optimal solution. AIMS Mathematics, 2021, 6(8): 8294-8314. doi: 10.3934/math.2021480
[3]	Abdul Qadeer Khan, Fakhra Bibi, Saud Fahad Aldosary . Bifurcation analysis and chaos in a discrete Hepatitis B virus model. AIMS Mathematics, 2024, 9(7): 19597-19625. doi: 10.3934/math.2024956
[4]	Nauman Ahmed, Ali Raza, Ali Akgül, Zafar Iqbal, Muhammad Rafiq, Muhammad Ozair Ahmad, Fahd Jarad . New applications related to hepatitis C model. AIMS Mathematics, 2022, 7(6): 11362-11381. doi: 10.3934/math.2022634
[5]	Muhammad Farman, Ali Akgül, J. Alberto Conejero, Aamir Shehzad, Kottakkaran Sooppy Nisar, Dumitru Baleanu . Analytical study of a Hepatitis B epidemic model using a discrete generalized nonsingular kernel. AIMS Mathematics, 2024, 9(7): 16966-16997. doi: 10.3934/math.2024824
[6]	Alireza Sayyidmousavi, Katrin Rohlf . Stochastic simulations of the Schnakenberg model with spatial inhomogeneities using reactive multiparticle collision dynamics. AIMS Mathematics, 2019, 4(6): 1805-1823. doi: 10.3934/math.2019.6.1805
[7]	Liping Wang, Peng Wu, Mingshan Li, Lei Shi . Global dynamics analysis of a Zika transmission model with environment transmission route and spatial heterogeneity. AIMS Mathematics, 2022, 7(3): 4803-4832. doi: 10.3934/math.2022268
[8]	Sajjad Ali Khan, Kamal Shah, Poom Kumam, Aly Seadawy, Gul Zaman, Zahir Shah . Study of mathematical model of Hepatitis B under Caputo-Fabrizo derivative. AIMS Mathematics, 2021, 6(1): 195-209. doi: 10.3934/math.2021013
[9]	Abdul Qadeer Khan, Ayesha Yaqoob, Ateq Alsaadi . Discrete Hepatitis C virus model with local dynamics, chaos and bifurcations. AIMS Mathematics, 2024, 9(10): 28643-28670. doi: 10.3934/math.20241390
[10]	Yousef Alnafisah, Moustafa El-Shahed . Deterministic and stochastic model for the hepatitis C with different types of virus genome. AIMS Mathematics, 2022, 7(7): 11905-11918. doi: 10.3934/math.2022664

Abstract

1. Introduction

Hepatitis B is a life-threatening viral infection that presents a significant global public health challenge. It is associated with severe chronic conditions, including cirrhosis and hepatocellular carcinoma, which are major contributors to mortality among affected individuals. Beyond the immediate health impacts, chronic Hepatitis B infection can lead to long-term disabilities due to liver damage and related complications. The hepatitis B virus (HBV) targets liver cells, where it establishes infection, replicates extensively, and releases large quantities of viral particles into the bloodstream. The infection manifests in two forms: Acute and chronic. Acute hepatitis B often resolves within six months, with the immune system effectively clearing the virus and leading to full recovery. However, if the infection persists beyond six months, it progresses to a chronic state, which is where many disability-related complications arise. Chronic Hepatitis B, especially in those infected during childhood, increases the risk of progressive liver disease and complications that severely impair daily functioning. Common symptoms in advanced stages, such as hepatic encephalopathy, cause cognitive impairments, while fatigue and pain limit physical activity and mobility, leading to functional disabilities. Children infected with HBV between the ages of 1 and 8 years are at significant risk of developing chronic, often asymptomatic infection, yet they remain carriers capable of transmitting the virus to others. Globally, an estimated 240 million people live with chronic HBV-related liver infections, with around 600,000 deaths annually from both acute and chronic forms of the disease ^[1,2]. HBV transmission primarily occurs through contact with infected blood or bodily fluids, including through sexual contact, blood transfusions, and perinatal transmission from mother to child. Age at infection is a critical factor in disease progression and the potential for disability. Infants and young children, particularly those under six, are more likely to develop chronic hepatitis B, leading to an 80-90% likelihood of chronic infection in those infected within the first year of life and 30-35% in those infected between ages 1 and 6. By contrast, less than 5% of adults who acquire HBV progress to chronic symptoms. Nevertheless, 15-25% of individuals who contract HBV early in life experience HBV-related complications, including liver cancer, cirrhosis, and associated disabilities due to extensive liver damage ^[3,4].

Chronic carriers of the hepatitis B virus (HBV) typically do not exhibit a history of acute illness; however, they are at risk for developing cirrhosis, which involves scarring of the liver and can potentially lead to liver failure or hepatocellular carcinoma. A small percentage (1%–6%) of chronic carriers are able to clear the virus naturally. Some individuals infected with HBV may present symptoms similar to those caused by other viral infections, while many remain asymptomatic until serious complications, such as liver damage, emerge. In some cases, it can take 2 to 5 months for symptoms of hepatitis B to appear ^[5,6]. However, for others, symptoms may be minimal or completely absent, despite the potential for severe disease progression. Asymptomatic individuals, although not manifesting symptoms, can still transmit the virus and may develop chronic HBV infection later in life. Additionally, certain individuals may act as carriers of the virus without being infected ^[7]. Prophylactic administration of the HBV vaccine and hepatitis B immune globulin within 12 hours of birth can significantly reduce the risk of mother-to-child transmission of HBV, lowering it from 20-90% to 5-10%. Subsequent doses of the vaccine are typically given at 1–2 months and again at 6 months of age, but not beyond that ^[8,9]. In many adult cases, treatment is not required as spontaneous immunity often develops ^[10]. In any case, antiviral treatment might be vital in the beginning phases for people with compromised resistant frameworks or those encountering a forceful beginning of contamination. For those with ongoing HBV disease, therapy is vital to decrease the gamble of serious intricacies like liver malignant growth or cirrhosis. The span and way to deal with treatment are impacted by the HBV genotype and the particular antiviral routine utilized, which might go from a half year to a year ^[11].

One of the primary concerns in the study of hepatitis B virus (HBV) infection is developing strategies to control the infection rate and eradicate the virus from the population. Mathematical models are valuable tools for optimizing resources and implementing control measures more effectively ^[12,13]. Anderson and May used a simple mathematical model to illustrate the effect of carriers on HBV transmission ^[14]. A mathematical model was also developed to control HBV infection, which was later employed to formulate a strategy for eliminating HBV ^[7,15]. An age-structured model was proposed by Zheo et al. ^[16] for predicting HBV transmission and evaluating the effectiveness of vaccination programs in China. A model developed by Wang et al. ^[17] was used to analyze the impact of vaccination on a population and to assess other control measures for HBV infection, with further analysis and applications provided by Zhang and Zhou ^[18]. Khan et al. proposed a mathematical model aimed at controlling the spread of both chronic and acute HBV transmission ^[19]. Pulse vaccination epidemic models ^[20,21] have demonstrated that pulse vaccination can maintain the epidemic in a stable state by optimizing the quantity of vaccines administered and the intervals between vaccinations. However, the costs associated with vaccination and treatment strategies can be significant and may not always be feasible. Therefore, it is crucial to predict and implement vaccination and treatment strategies that are well-suited to the specific context. In this regard, Khan and Zaman developed a model for HBV transmission and vaccination ^[21], while Jaouade Danane and Karam Allali conducted mathematical analysis on the delayed treatment of HBV infection, considering the immune response and the role of DNA-containing capsids in the host's body ^[22].

Over the last decade, the study of mathematical models for biological systems has gained considerable attention within the scientific community, as highlighted in studies ^[23,24]. These models often incorporate state variables that are inherently nonnegative, such as physical attributes, chemical concentrations, population densities, and other measurable properties. Diffusion significantly impacts the spatial and temporal variation of these variables within a system. It represents the movement of particles such as molecules or individuals in a population from areas of higher concentration to areas of lower concentration. These models are generally built upon systems of differential equations. However, obtaining exact solutions to such systems is often challenging and complex, necessitating the use of approximation techniques. We aim to investigate the influence of diffusion on these models by employing numerical methods.

2. Model formulation

Hepatitis B, a viral infection instigated by the Hepatitis B Virus (HBV), predominantly targets and inflicts damage on the liver. A detailed mathematical model addressing HBV dynamics was recently introduced by Zada et al. ^[25]. This model categorizes the total population $N(t)$ at any time $t$ into five distinct compartments: the susceptible individuals $S(t)$ , those in the latent stage $L(t)$ , the acutely infectious group $I(t)$ , chronic HBV carriers $C(t)$ , and those who have recovered $R(t)$ . The total population at any time can thus be expressed as:

$N(t) = S(t) + L(t) + I(t) + C(t) + R(t).$

This equation captures the overall flow of individuals through different stages of the disease, describing how HBV transmission and progression occur across these compartments. The model is governed by the following set of nonlinear ordinary differential equations (ODEs):

$\begin{equation} \left\{ \begin{split} \frac{d S}{d t}& = \mu \omega(1-v C(t))-\left(\mu_0+\beta I(t)+\epsilon \beta C(t)+\gamma_3\right) S(t), \\ \frac{d L}{d t}& = (\beta I(t)+\epsilon \beta C(t)) S(t)-\left(\mu_0+\sigma\right) L(t), \\ \frac{d I}{d t}& = \sigma L(t)-\left(\mu_0+\gamma_1\right) I(t), \\ \frac{d C}{d t}& = \mu \omega \nu C(t)+q \gamma_1 I(t)-\left(\mu_0+\mu_1+\gamma_2\right) C(t), \\ \frac{d R}{d t}& = \gamma_2 C(t)+(1-q) \gamma_1 I(t)-\mu_0 R(t) . \end{split} \right. \end{equation}$

(2.1)

The parameters in the given system of equations represent various aspects of the hepatitis B virus (HBV) dynamics. The parameter $\mu$ represents the birth rate, and $\omega$ denotes the proportion of the population without vaccination. The term $\nu$ is the proportion of children born to chronically infected mothers who are unvaccinated. The parameter $v$ is related to the effect of treatment or intervention on the susceptible population, $S(t)$ . The rate $\mu_0$ represents the natural mortality rate, while $\beta$ is the transmission rate of the virus from acutely infected individuals, $I(t)$ , to susceptible individuals. Similarly, $\epsilon \beta$ indicates the transmission rate from chronically infected individuals, $C(t)$ , to susceptibles. The vaccination rate is represented by $\gamma_3$ , and $\sigma$ is the rate at which latently infected individuals, $L(t)$ , progress to the acute infection stage. The parameter $\gamma_1$ denotes the rate at which acutely infected individuals either recover or progress to chronic infection, with $q$ being the proportion that progresses to chronic infection. The chronic infection-related mortality rate is denoted by $\mu_1$ , and $\gamma_2$ is the rate at which chronically infected individuals recover.

The chronically infected compartment $C(t)$ is of particular importance because individuals in this class are at high risk of developing severe, long-term health complications, such as cirrhosis and hepatocellular carcinoma. These complications are associated with significant functional impairment, leading to long-term disability and reduced quality of life. Studying the dynamics of the chronically infected class helps in understanding the progression from acute infection to chronic disease and the subsequent disability burden, emphasizing the need for effective intervention strategies. Consequently, the model captures the dynamics of susceptible ( $S(t)$ ), latent ( $L(t)$ ), acutely infected ( $I(t)$ ), chronically infected ( $C(t)$ ), and recovered ( $R(t)$ ) populations over time, providing insights into both transmission and the long-term impact of HBV on population health.

Most of the HBV models available in the literature are non-spatial and assume that the population is well-mixed. However, this assumption can lead to inaccuracies, as the transmission dynamics of HBV are often influenced by spatial factors. To address this limitation, a spatially independent model introduced by Zada et al. ^[25] has been expanded to include spatial dynamics by incorporating a diffusion term, thereby creating a reaction-diffusion model. This model accounts for the movement of individuals and the consequent spatial variation in infection spread. The revised model is presented as follows:

$\begin{equation} \left\{\begin{split}& \frac{\partial S}{\partial t} = d_1\bigg(\frac{\partial^2 S}{\partial x^2}+\frac{\partial^2 S}{\partial y^2}\bigg)+\mu\omega(1-\nu C(t))-(\mu_{0}+\beta I(t)+\epsilon\beta C(t)+\gamma_{3})S(t), \\& \frac{\partial L}{\partial t} = d_2\bigg(\frac{\partial^2 L}{\partial x^2}+\frac{\partial^2 L}{\partial y^2}\bigg)+(\beta I(t)+\epsilon\beta C(t))S(t)-(\mu_{0}+\sigma)L(t),\\& \frac{\partial I}{\partial t} = d_3\bigg(\frac{\partial^2 I}{\partial x^2}+\frac{\partial^2 I}{\partial y^2}\bigg)+\sigma L(t)-(\mu_{0}+\gamma_{1})I(t),\\& \frac{\partial C}{\partial t} = d_4\bigg(\frac{\partial^2 C}{\partial x^2}+\frac{\partial^2 C}{\partial y^2}\bigg)+\mu\omega\nu C(t)+q\gamma_{1}I(t)-(\mu_{0}+\mu_{1}+\gamma_{2})C(t),\\& \frac{\partial R}{\partial t} = d_5\bigg(\frac{\partial^2 R}{\partial x^2}+\frac{\partial^2 R}{\partial y^2}\bigg)+\gamma_{2}C(t)+(1-q)\gamma_{1}I(t)-\mu_{0}R(t). \end{split}\right. \end{equation}$

(2.2)

The model is initialized with the following conditions:

$\begin{equation} \begin{split} S(x,y,0) & = f_{1}(x,y), \\ L(x,y,0) & = f_{2}(x,y), \\ I(x,y,0) & = f_{3}(x,y), \\ C(x,y,0) & = f_{4}(x,y), \\ R(x,y,0) & = f_{5}(x,y), \end{split} \end{equation}$

(2.3)

where $f_1(x, y)$ , $f_2(x, y)$ , $f_3(x, y)$ , $f_4(x, y)$ , and $f_5(x, y)$ define the initial spatial distributions of the various groups within the population, categorized by their disease status, respectively.

The boundary conditions for the reaction-diffusion model (2.2) are specified as homogeneous Neumann boundary conditions, which represent no flux across the boundaries. These conditions ensure that the spatial derivatives of the variables with respect to the boundary normals are zero. Mathematically, they are expressed as follows:

$\begin{equation} \frac{\partial S}{\partial n} = 0, \quad \frac{\partial L}{\partial n} = 0, \quad \frac{\partial I}{\partial n} = 0, \quad \frac{\partial C}{\partial n} = 0, \quad \frac{\partial R}{\partial n} = 0 \quad \text{on } \partial \Omega, \end{equation}$

(2.4)

where $\frac{\partial}{\partial n}$ denotes the derivative in the direction normal to the boundary $\partial \Omega$ , and $\Omega$ is the spatial domain. These boundary conditions indicate that there is no movement of individuals across the boundaries of the spatial domain, which is biologically reasonable for modeling the dynamics of populations within a confined area.

In this formulation, $S = S(x, y, t)$ , $L = L(x, y, t)$ , $I = I(x, y, t)$ , $C = C(x, y, t)$ , and $R = R(x, y, t)$ represent the spatially and temporally dependent compartments of the population. Since $R(t)$ is not directly involved in the first four equations, it is convenient to consider the system (2.2) as:

(2.5)

Subject to the initial conditions (2.3) and boundary conditions (2.4).

3. Steady states of the model

The hepatitis B epidemic model exhibits two primary equilibrium states: the disease-free equilibrium (DFE) and the endemic equilibrium (EE) ^[25]. The disease-free equilibrium occurs when the population is uninfected, represented mathematically as:

$\begin{equation} DFE = (S^0,L^0,I^0,C^{0}) = \left(\frac{\mu\omega}{\mu_{0}+\gamma_{3}},0,0,0\right). \end{equation}$

(3.1)

In contrast, the endemic equilibrium corresponds to a steady state where the disease persists within the population. This state can be expressed as:

$\begin{equation} EE = (S^{\ast},L^{\ast},I^{\ast},C^{\ast}). \end{equation}$

(3.2)

where

$\begin{equation} \left\{\begin{aligned}& S^{\ast} = \frac{l_{2}l_{3}(l_{1}-\mu\nu\omega)}{\beta\sigma(q\gamma_1\epsilon+l_{1}-\mu\nu\omega)},\\& L^{\ast} = \frac{-l_{2}^{2}l_{3}l_{4}(l_{1}-\mu\nu\omega)^2(R_{0}^{HBV}+1)}{\beta\sigma(q\epsilon\gamma_{1}+l_{1}-\mu\nu\omega) (l_{1}l_{2}l_{3}-\mu\nu(l_{3}\mu_{0}+\mu_{0}\gamma_{1}\omega-\omega\sigma\gamma_{1}(1-q)))},\\& I^{\ast} = \frac{-l_{2}l_{3}l_{4}(l_{1}-\mu\nu\omega)^2(R_{0}^{HBV}+1)}{\beta(q\epsilon\gamma_{1}+l_{1}-\mu\nu\omega) (l_{1}l_{2}l_{3}-\mu\nu(l_{3}\mu_{0}+\mu_{0}\gamma_{1}\omega-\omega\sigma\gamma_{1}(1-q)))},\\& C^{\ast} = \frac{l_{2}l_{3}l_{4}(l_{1}-\mu\nu\omega)(R_{0}^{HBV}+1)}{\beta(q\epsilon\gamma_{1}+l_{1}-\mu\nu\omega) (l_{1}l_{2}l_{3}-\mu\nu(l_{3}\mu_{0}+\mu_{0}\gamma_{1}\omega-\omega\sigma\gamma_{1}(1-q)))}, \end{aligned}\right. \end{equation}$

(3.3)

with the parameters defined as:

$l_{1} = \gamma_{2}+\mu_{0}+\mu_{1}, \quad l_{2} = \gamma_{1}+\mu_{0}, \quad l_{3} = \mu_{0}+\sigma, \quad l_{4} = \gamma_{3}+\mu_{0}.$

3.1. Basic reproduction number

The basic reproduction number, represented as $R_0^{HBV}$ , is determined here. This metric quantifies the average number of secondary infections caused by a single infection in a completely susceptible population. To calculate $R_0^{HBV}$ , it is essential to distinguish infected and noninfected populations, employing the next-generation matrix approach ^[26,27]. The variables $L, I, C$ , and $S$ represent the respective infected and noninfected cell populations. By isolating the infection terms, the infection-free equilibrium model is expressed using a matrix representation to account for infection terms $F$ and $V$ . The $F$ matrix specifically captures the terms driving the infection, while $V$ includes the remaining components of the system and $D$ represents the diffusion coefficients, as defined below:

$\begin{equation*} F = \begin{pmatrix} (\beta I + \epsilon \beta C) S \\ 0 & \\ 0 & \end{pmatrix}, \quad V = \begin{pmatrix} -(\mu_0 + \sigma)L \\ \sigma L - (\mu_0 + \gamma_1)I \\ (\mu\nu\omega C + q \gamma_1 I - (\mu_0 + \mu_1 + \gamma_2)C) \end{pmatrix}, \quad D = \begin{pmatrix} d_2 & 0 & 0 \\ 0 & d_3 & 0\\ 0 & 0 & d_4 \end{pmatrix}. \end{equation*}$

By calculating the Jacobian matrix of the system using $R_0^{HBV}$ , the matrices $F^*$ and $V^*$ are obtained as follows:

$\begin{equation*} F^* = \begin{pmatrix} 0 & \beta \frac{\mu \omega}{\mu_0+\gamma_3} & \epsilon \beta \frac{\mu \omega}{\mu_0+\gamma_3} \\ 0 & 0 & 0 \\ 0 & 0 & 0 \end{pmatrix}, \quad V^* = \begin{pmatrix} -\left(\mu_0+\sigma\right) & 0 & 0 \\ \sigma & -\left(\mu_0+\gamma_1\right) & 0 \\ 0 & q \gamma_1 & \mu \omega \nu-\left(\mu_0+\mu_1+\gamma_2\right) \end{pmatrix}. \end{equation*}$

At equilibrium, the transition and infection rates are described by $F^*$ and $V^*$ . The time spent in each state is determined by the inverse of $V^*$ , denoted as $V^{*-1}$ . During an outbreak, the number of new infections is derived from the product $F^* V^{*-1}$ . The dominant eigenvalue of $F^* V^{*-1}$ represents the basic reproduction number, which can be expressed as:

$\begin{equation} R_0^{HBV} = \frac{\sigma \beta \mu \omega \left(\epsilon q \gamma_1 - \left(\mu \nu \omega - (\mu_0 + \mu_1 + \gamma_2)\right)\right)}{(\mu_0 + \sigma)(\mu_0 + \gamma_1)(\mu_0 + \gamma_3)(\mu \omega \nu - (\mu_0 + \mu_1 + \gamma_2))}. \end{equation}$

(3.4)

The model predicts a disease-free state, whereas $R_0 > 1$ indicates the persistence of the disease, leading to the endemic equilibrium.

4. Stability of endemic equilibrium point

In this section, the stability of the two-dimensional diffusive epidemic model is examined. The system represented by Eq (2.5) tends to converge toward the equilibrium values $(S^{\ast}, L^{\ast}, I^{\ast}, C^{\ast})$ . To assess the stability of these equilibrium points, the system is linearized around the steady state, employing the methodology outlined in ^[28] and ^[29].

Theorem 4.1. For the system (2.5), the endemic equilibrium is locally asymptotically stable if and only if $R_0^{HBV} > 1$ and the following condition is satisfied:

$\mu_0 + \mu_1 + \gamma_2 - \mu \omega \nu - \frac{q \gamma_1 \sigma}{\mu_0 + \sigma} > 0.$

Proof. Let the perturbed variables of $S(x, y, t)$ , $L(x, y, t)$ , $I(x, y, t)$ , and $C(x, y, t)$ be denoted as $S_1(x, y, t)$ , $L_1(x, y, t)$ , $I_1(x, y, t)$ , and $C_1(x, y, t)$ , respectively. To investigate the stability of the system, we linearize Eq (2.5) around the equilibrium point $E^{*}$ , using the methodology described in ^[28,29]. The corresponding linearized system can be written as:

$\begin{equation} \left\{\begin{split}& \frac{\partial S}{\partial t} = a_{11} S_1(x,y,t)+a_{12} L_1(x,y,t)+a_{13} I_1(x,y,t) +a_{14} C_1(x,y,t) + d_1\bigg(\frac{\partial^2 S}{\partial x^2}+\frac{\partial^2 S}{\partial y^2}\bigg), \\& \frac{\partial L}{\partial t} = a_{21} S_1(x,y,t)+a_{22} L_1(x,y,t)+a_{23} I_1(x,y,t)+a_{24} C_1(x,y,t) + d_2\bigg(\frac{\partial^2 L}{\partial x^2}+\frac{\partial^2 L}{\partial y^2}\bigg),\\& \frac{\partial I}{\partial t} = a_{31} S_1(x,y,t)+a_{32} L_1(x,y,t)+a_{33} I_1(x,y,t)+a_{34} C_1(x,y,t) + d_3\bigg(\frac{\partial^2 I}{\partial x^2}+\frac{\partial^2 I}{\partial y^2}\bigg),\\& \frac{\partial C}{\partial t} = a_{41} S_1(x,y,t)+a_{42} L_1(x,y,t)+a_{43} I_1(x,y,t)+a_{44} C_1(x,y,t) + d_4\bigg(\frac{\partial^2 C}{\partial x^2}+\frac{\partial^2 C}{\partial y^2}\bigg). \end{split}\right. \end{equation}$

(4.1)

To solve the linearized system, a Fourier series approach is employed, as outlined in references ^[28] and ^[29]. The solution for $S_1(x, y, t), L_1(x, y, t), I_1(x, y, t)$ and $C_1(x, y, t)$ can be expressed as:

$\begin{equation} \left\{\begin{split} S_1(x,y,t) = & \sum\limits_{\zeta_1}\sum\limits_{\zeta_2} S_{\zeta_{1}\zeta_{2}} e^{\lambda t} cos(\zeta_1 x) cos(\zeta_2 y), \\ L_1(x,y,t) = & \sum\limits_{\zeta_1}\sum\limits_{\zeta_2} L_{\zeta_{1}\zeta_{2}} e^{\lambda t} cos(\zeta_1 x) cos(\zeta_2 y), \\ I_1(x,y,t) = & \sum\limits_{\zeta_1}\sum\limits_{\zeta_2} I_{\zeta_{1}\zeta_{2}} e^{\lambda t} cos(\zeta_1 x) cos(\zeta_2 y),\\ C_1(x,y,t) = & \sum\limits_{\zeta_1}\sum\limits_{\zeta_2} C_{\zeta_{1}\zeta_{2}} e^{\lambda t} cos(\zeta_1 x) cos(\zeta_2 y), \\ \end{split}\right. \end{equation}$

(4.2)

where $\zeta_i$ (for $i = 1, 2$ ) are the wave numbers corresponding to the nodes $n_i$ , with $\zeta_1 = \frac{n_1 \pi}{2}$ and $\zeta_2 = \frac{n_2 \pi}{2}$ . The functions $S_1(x, y, t)$ , $L_1(x, y, t)$ , $I_1(x, y, t)$ , and $C_1(x, y, t)$ are defined over the spatial domain $(x, y) \in \Omega \subset \mathbb{R}^2$ and time domain $t \in [0, T]$ , where $\Omega = [X_{\text{min}}, X_{\text{max}}] \times [Y_{\text{min}}, Y_{\text{max}}]$ represents the two-dimensional spatial region of interest. These functions represent perturbations around the equilibrium state and evolve over time under the specified boundary and initial conditions. By substituting the expressions for $S_1(x, y, t)$ , $L_1(x, y, t)$ , $I_1(x, y, t)$ , and $C_1(x, y, t)$ into the system, we obtain a system of equations suitable for further analysis.

$\begin{equation} \left\{\begin{split}& \sum\limits_{\zeta_1}\sum\limits_{\zeta_2} \left(a_{11}-d_1 \zeta_{1}^{2}-d_1 \zeta_{2}^{2}-\lambda\right) S_{\zeta_1 \zeta_2} + \sum\limits_{\zeta_1}\sum\limits_{\zeta_2} a_{12}L_{\zeta_1 \zeta_2} + \sum\limits_{\zeta_1}\sum\limits_{\zeta_2}a_{13}I_{\zeta_1 \zeta_2}+\sum\limits_{\zeta_1}\sum\limits_{\zeta_2}a_{14}C_{\zeta_1 \zeta_2} = 0, \\& \sum\limits_{\zeta_1}\sum\limits_{\zeta_2} a_{21} S_{\zeta_1 \zeta_2} + \sum\limits_{\zeta_1}\sum\limits_{\zeta_2} \left(a_{22}-d_2 \zeta_{1}^{2}-d_2 \zeta_{2}^{2}-\lambda\right)L_{\zeta_1 \zeta_2} + \sum\limits_{\zeta_1}\sum\limits_{\zeta_2}a_{23}I_{\zeta_1 \zeta_2}+\sum\limits_{\zeta_1}\sum\limits_{\zeta_2}a_{24}C_{\zeta_1 \zeta_2} = 0, \\& \sum\limits_{\zeta_1}\sum\limits_{\zeta_2} a_{31} S_{\zeta_1 \zeta_2} + \sum\limits_{\zeta_1}\sum\limits_{\zeta_2} a_{32}L_{\zeta_1 \zeta_2} + \sum\limits_{\zeta_1}\sum\limits_{\zeta_2} \left(a_{33}-d_3 \zeta_{1}^{2}-d_3 \zeta_{2}^{2}-\lambda\right) I_{\zeta_1 \zeta_2}+\sum\limits_{\zeta_1}\sum\limits_{\zeta_2}a_{34}C_{\zeta_1 \zeta_2} = 0,\\& \sum\limits_{\zeta_1}\sum\limits_{\zeta_2} a_{41} S_{\zeta_1 \zeta_2} + \sum\limits_{\zeta_1}\sum\limits_{\zeta_2} a_{42}L_{\zeta_1 \zeta_2} + \sum\limits_{\zeta_1}\sum\limits_{\zeta_2}a_{43} I_{\zeta_1 \zeta_2}+\sum\limits_{\zeta_1}\sum\limits_{\zeta_2}\left(a_{44}-d_4 \zeta_{1}^{2}-d_4 \zeta_{2}^{2}-\lambda\right)C_{\zeta_1 \zeta_2} = 0. \end{split}\right. \end{equation}$

(4.3)

The variational matrix $\mathbf{V}$ for (4.3) is

$\begin{equation} \mathbf{V} = \begin{pmatrix} a_{11}-d_1 \zeta_{1}^{2}-d_1 \zeta_{2}^{2}-\lambda & a_{12} & a_{13} & a_{14} \\ a_{21} & a_{22}-d_2 \zeta_{1}^{2}-d_2 \zeta_{2}^{2}-\lambda & a_{23} & a_{24} \\ a_{31} & a_{32} & a_{33}-d_3 \zeta_{1}^{2}-d_3 \zeta_{2}^{2}-\lambda & a_{34}\\ a_{41} & a_{42} & a_{43} & a_{44}-d_4 \zeta_{1}^{2}-d_4 \zeta_{2}^{2}-\lambda \end{pmatrix} \end{equation}$

(4.4)

where

$\begin{equation*} \begin{split} \mathbf{V} = & \left(\begin{array}{cc} -(\mu_{0}+\beta I^{\ast}+\epsilon\beta C^{\ast}+\gamma_{3}+d_1 \zeta_{1}^{2}+d_1 \zeta_{2}^{2})-\lambda & 0 \\ \beta I^{\ast}+\epsilon\beta C^{\ast} & -(\mu_{0}+\sigma+d_2 \zeta_{1}^{2}+d_2 \zeta_{2}^{2})-\lambda \\ 0 & \sigma \\ 0 & 0 \\ \end{array}\right. \\ &\; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \left.\begin{array}{cc} -\beta S^{\ast} & -(\mu\omega\nu+\epsilon\beta S^{\ast})\\ \beta S^{\ast} & \epsilon\beta S^{\ast}\\ -(\mu_{0}+\gamma_{1}+d_3 \zeta_{1}^{2}+d_3 \zeta_{2}^{2})-\lambda & 0\\ q\gamma_{1} & -(\mu_{0}+\mu_{1}+\gamma_{2}-\mu\omega\nu+d_4 \zeta_{1}^{2}+d_4 \zeta_{2}^{2})-\lambda \end{array}\right). \end{split} \end{equation*}$

To find the eigenvalues of the given matrix $\mathbf{V}$ , we reduce the matrix $\mathbf{V}$ to an upper triangular form by performing the following row operations:

To eliminate $a_{21}$ , subtract $\frac{a_{21}}{a_{11}} r_1$ from $r_2$ . The updated second row becomes:

$r_2' = \begin{bmatrix} 0, -(\mu_0 + \sigma+d_2 \zeta_{1}^{2}+d_2 \zeta_{2}^{2}), \beta S^\ast + \frac{(\beta I^\ast + \epsilon \beta C^\ast)(-\beta S^\ast)}{-(\mu_0 + \beta I^\ast + \epsilon \beta C^\ast + \gamma_3+d_1 \zeta_{1}^{2}+d_1 \zeta_{2}^{2})}, \epsilon \beta S^\ast - \frac{(\beta I^\ast + \epsilon \beta C^\ast)(-\epsilon \beta S^\ast - \mu \omega \nu)}{-(\mu_0 + \beta I^\ast + \epsilon \beta C^\ast + \gamma_3+d_1 \zeta_{1}^{2}+d_1 \zeta_{2}^{2})} \end{bmatrix}.$

To eliminate $a_{32}$ , subtract $\frac{a_{32}}{a_{22}} r_2$ from $r_3$ . The updated third row becomes:

$\begin{equation*} \begin{split} r_3' = & \left[\begin{array}{c} 0, 0, -\left(\mu_0 + \gamma_1+d_3 \zeta_{1}^{2}+d_3 \zeta_{2}^{2} + \frac{\sigma}{(\mu_0 + \sigma+d_2 \zeta_{1}^{2}+d_2 \zeta_{2}^{2})}\left(\beta S^\ast + \frac{(\beta I^\ast + \epsilon \beta C^\ast)(-\beta S^\ast)}{-(\mu_0 + \beta I^\ast + \epsilon \beta C^\ast + \gamma_3+d_1 \zeta_{1}^{2}+d_1 \zeta_{2}^{2})}\right)\right),\\ \end{array}\right. \\ &\; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \left.\begin{array}{c} -\frac{\sigma}{(\mu_0 + \sigma+d_2 \zeta_{1}^{2}+d_2 \zeta_{2}^{2})}\left(\epsilon \beta S^\ast - \frac{(\beta I^\ast + \epsilon \beta C^\ast)(-\epsilon \beta S^\ast - \mu \omega \nu)}{-(\mu_0 + \beta I^\ast + \epsilon \beta C^\ast + \gamma_3+d_1 \zeta_{1}^{2}+d_1 \zeta_{2}^{2})}\right) \end{array}\right]. \end{split} \end{equation*}$

To eliminate $a_{43}$ , subtract $\frac{a_{43}}{a_{33}} r_3$ from $r_4$ . The updated fourth row becomes:

$r_4' = \begin{bmatrix} 0, 0, 0, -\left(\mu_0 + \mu_1 + \gamma_2 +d_4 \zeta_{1}^{2}+d_4 \zeta_{2}^{2} - \mu \omega \nu - \frac{q \gamma_1 \sigma}{\mu_0 + \sigma}\right) \end{bmatrix}.$

After performing the row operations, the matrix is transformed into the following upper triangular form:

$\begin{equation*} \begin{split} \mathbf{V} = & \left(\begin{array}{cc} -(\mu_{0}+\beta I^{\ast}+\epsilon\beta C^{\ast}+\gamma_{3}+d_1 \zeta_{1}^{2}+d_1 \zeta_{2}^{2})-\lambda & 0 \\ 0 & -(\mu_0 + \sigma+d_2 \zeta_{1}^{2}+d_2 \zeta_{2}^{2})-\lambda \\ 0 & 0 \\ 0 & 0 \\ \end{array}\right. \\ &\; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \left.\begin{array}{cc} -\beta S^\ast & -(\epsilon \beta S^\ast + \mu \omega \nu)\\ \beta S^\ast + \frac{(\beta I^\ast + \epsilon \beta C^\ast)(-\beta S^\ast)}{-(\mu_0 + \beta I^\ast + \epsilon \beta C^\ast + \gamma_3+d_1 \zeta_{1}^{2}+d_1 \zeta_{2}^{2})} & \epsilon \beta S^\ast - \frac{(\beta I^\ast + \epsilon \beta C^\ast)(-\epsilon \beta S^\ast - \mu \omega \nu)}{-(\mu_0 + \beta I^\ast + \epsilon \beta C^\ast + \gamma_3+d_1 \zeta_{1}^{2}+d_1 \zeta_{2}^{2})}\\ - \mathcal{R}_3 & - \mathcal{R}_4 \\ 0 & -\left(+d_4 \zeta_{1}^{2}+d_4 \zeta_{2}^{2}+\mu_0 + \mu_1 + \gamma_2 - \mu \omega \nu - \frac{q \gamma_1 \sigma}{\mu_0 + \sigma}\right)-\lambda \end{array}\right). \end{split} \end{equation*}$

Thus,

$\begin{aligned} \mathcal{R}_3 & = \mu_0 + \gamma_1 + d_3 \zeta_{1}^{2} + d_3 \zeta_{2}^{2} + \frac{\sigma}{\mu_0 + \sigma} \left( \beta S^\ast + \frac{(\beta I^\ast + \epsilon \beta C^\ast)(\beta S^\ast)}{(\mu_0 + \beta I^\ast + \epsilon \beta C^\ast + \gamma_3 + d_1 \zeta_{1}^{2} + d_1 \zeta_{2}^{2})} \right), \\ \mathcal{R}_4 & = \frac{\sigma}{\mu_0 + \sigma} \left( \epsilon \beta S^\ast - \frac{(\beta I^\ast + \epsilon \beta C^\ast)(-\epsilon \beta S^\ast - \mu \omega \nu)}{-(\mu_0 + \beta I^\ast + \epsilon \beta C^\ast + \gamma_3 + d_1 \zeta_{1}^{2} + d_1 \zeta_{2}^{2})} \right). \end{aligned}$

The eigenvalues of the system are:

$\begin{aligned} \lambda_1 & = -\left(\mu_0 + \beta I^\ast + \epsilon \beta C^\ast + \gamma_3 +d_1 \zeta_{1}^{2}+d_1 \zeta_{2}^{2}\right), \quad \text{where } \lambda_1 < 0,\\ \lambda_2 & = -\left(\mu_0 + \sigma+d_2 \zeta_{1}^{2}+d_2 \zeta_{2}^{2}\right), \quad \text{where } \lambda_2 < 0,\\ \lambda_3 & = -\left(\mu_0 + \gamma_1 +d_3 \zeta_{1}^{2}+d_3 \zeta_{2}^{2}+ \frac{\sigma}{\mu_0 + \sigma}\left(\beta S^\ast + \frac{\left(\beta I^\ast + \epsilon \beta C^\ast\right)\left(\beta S^\ast\right)}{\left(\mu_0 + \beta I^\ast + \epsilon \beta C^\ast + \gamma_3+d_1 \zeta_{1}^{2}+d_1 \zeta_{2}^{2}\right)}\right)\right), \quad \text{where } \lambda_3 < 0,\\ \lambda_4 & = -\left(+d_4 \zeta_{1}^{2}+d_4 \zeta_{2}^{2}+\mu_0 + \mu_1 + \gamma_2 - \mu \omega \nu - \frac{q \gamma_1 \sigma}{\mu_0 + \sigma}\right). \end{aligned}$

The three eigenvalues $\lambda_1, \lambda_2, \lambda_3$ are explicitly negative as long as the biological parameters (rates and population sizes) remain positive and satisfy reasonable conditions. $\lambda_4$ depends on the term $\mu_0 + \mu_1 + \gamma_2 - \mu \omega \nu - \frac{q \gamma_1 \sigma}{\mu_0 + \sigma}$ , which determines whether it is positive or negative. For stability, $\lambda_4$ must satisfy:

$\mu_0 + \mu_1 + \gamma_2 - \mu \omega \nu - \frac{q \gamma_1 \sigma}{\mu_0 + \sigma} > 0.$

Consider the eigenvalue:

$\lambda = -\left(\mu_0 + \mu_1 + \gamma_2 - \mu \omega \nu - \frac{q \gamma_1 \sigma}{\mu_0 + \sigma}\right).$

The eigenvalue $\lambda$ is negative ( $\lambda < 0$ ) if and only if:

$\mu_0 + \mu_1 + \gamma_2 - \mu \omega \nu - \frac{q \gamma_1 \sigma}{\mu_0 + \sigma} > 0.$

This implies that the endemic equilibrium is locally stable when the above condition holds.

Conversely, if:

$\mu_0 + \mu_1 + \gamma_2 - \mu \omega \nu - \frac{q \gamma_1 \sigma}{\mu_0 + \sigma} < 0,$

then $\lambda > 0$ , indicating that the equilibrium is unstable.

The basic reproduction number $R_0^{HBV}$ is given by:

$R_0^{HBV} = \frac{\sigma \beta \mu \omega \left(\epsilon q \gamma_1 - \left(\mu \nu \omega - (\mu_0 + \mu_1 + \gamma_2)\right)\right)}{(\mu_0 + \sigma)(\mu_0 + \gamma_1)(\mu_0 + \gamma_3)(\mu \omega \nu - (\mu_0 + \mu_1 + \gamma_2))}.$

For $R_0^{HBV} > 1$ , the term $\mu \omega \nu - (\mu_0 + \mu_1 + \gamma_2)$ in the denominator must be positive:

$\mu \omega \nu > \mu_0 + \mu_1 + \gamma_2.$

This implies that the inequality:

$\mu_0 + \mu_1 + \gamma_2 - \mu \omega \nu - \frac{q \gamma_1 \sigma}{\mu_0 + \sigma} > 0,$

holds in the endemic state ( $R_0^{HBV} > 1$ ). In contrast, when $R_0^{HBV} \leq 1$ , the condition flips, and the equilibrium becomes unstable ( $\lambda > 0$ ).

5. Numerical schemes

The finite difference (FD) schemes are formulated by discretizing the computational domain $[0, L]^2 \times [0, T]$ into a grid comprising $M^2 \times N$ discrete points. The spatial and temporal step sizes are defined as $h = \frac{L}{M}$ and $\tau = \frac{T}{N}$ , respectively. The coordinates of the grid points are expressed as:

$\begin{equation} \begin{split} x_{\zeta_1} & = \zeta_1 h, \quad \zeta_1 = 1, 2, 3, \dots, M, \\ y_{\zeta_2} & = \zeta_2 h, \quad \zeta_2 = 1, 2, 3, \dots, M, \\ t_n & = n\tau, \quad n = 1, 2, 3, \dots, N, \end{split} \end{equation}$

(5.1)

where $\zeta_1$ and $\zeta_2$ denote spatial indices, and $n$ represents the temporal index. The FD approximations of the variables $S^{n}_{\zeta_1, \zeta_2}$ , $L^{n}_{\zeta_1, \zeta_2}$ , $I^{n}_{\zeta_1, \zeta_2}$ , and $C^{n}_{\zeta_1, \zeta_2}$ are given as $S(\zeta_1h, \zeta_2h, n\tau)$ , $L(\zeta_1h, \zeta_2h, n\tau)$ , $I(\zeta_1h, \zeta_2h, n\tau)$ , and $C(\zeta_1h, \zeta_2h, n\tau)$ , respectively.

5.1. Finite difference method

In this section, we discuss the implementation of the forward Euler finite difference (FD) scheme for solving the two-dimensional reaction-diffusion model describing hepatitis B dynamics. In this method, the time derivative is discretized using a forward difference approach, while the spatial derivatives are handled through a central difference scheme. The forward Euler FD scheme applied to system (2.5) is expressed as follows:

$\begin{equation} \begin{split} S^{n+1}_{\zeta_1,\zeta_2} & = S^{n}_{\zeta_1,\zeta_2}+\lambda_1\bigg(S^{n}_{\zeta_1-1,\zeta_2}+S^{n}_{\zeta_1+1,\zeta_2}-4S^{n}_{\zeta_1,\zeta_2}+S^{n}_{\zeta_1,\zeta_2-1}+S^{n}_{\zeta_1,\zeta_2+1}\bigg) \\ &+\tau\mu\omega(1-\nu C^{n}_{\zeta_1,\zeta_2})-\tau(\mu_{0}+\beta I^{n}_{\zeta_1,\zeta_2}+\epsilon\beta C^{n}_{\zeta_1,\zeta_2}+\gamma_{3})S^{n}_{\zeta_1,\zeta_2}, \\ L^{n+1}_{\zeta_1,\zeta_2} & = L^{n}_{\zeta_1,\zeta_2}+\lambda_2\bigg(L^{n}_{\zeta_1-1,\zeta_2}+L^{n}_{\zeta_1+1,\zeta_2}-4L^{n}_{\zeta_1,\zeta_2}+L^{n}_{\zeta_1,\zeta_2-1}+L^{n}_{\zeta_1,\zeta_2+1}\bigg) \\ &+\tau(\beta I^{n}_{\zeta_1,\zeta_2}+\epsilon\beta C^{n}_{\zeta_1,\zeta_2})S^{n}_{\zeta_1,\zeta_2}-\tau(\mu_{0}+\sigma)L^{n}_{\zeta_1,\zeta_2}, \\ I^{n+1}_{\zeta_1,\zeta_2} & = I^{n}_{\zeta_1,\zeta_2}+\lambda_3\bigg(I^{n}_{\zeta_1-1,\zeta_2}+I^{n}_{\zeta_1+1,\zeta_2}-4I^{n}_{\zeta_1,\zeta_2}+I^{n}_{\zeta_1,\zeta_2-1}+I^{n}_{\zeta_1,\zeta_2+1}\bigg) \\ &+\tau\sigma L^{n}_{\zeta_1,\zeta_2}-\tau(\mu_{0}+\gamma_{1})I^{n}_{\zeta_1,\zeta_2}, \\ C^{n+1}_{\zeta_1,\zeta_2} & = C^{n}_{\zeta_1,\zeta_2}+\lambda_4\bigg(C^{n}_{\zeta_1-1,\zeta_2}+C^{n}_{\zeta_1+1,\zeta_2}-4C^{n}_{\zeta_1,\zeta_2}+C^{n}_{\zeta_1,\zeta_2-1}+C^{n}_{\zeta_1,\zeta_2+1}\bigg) \\ &+\tau\mu\omega\nu C^{n}_{\zeta_1,\zeta_2}+\tau q\gamma_{1}I^{n}_{\zeta_1,\zeta_2}-\tau(\mu_{0}+\mu_{1}+\gamma_{2})C^{n}_{\zeta_1,\zeta_2}, \\ \end{split} \end{equation}$

(5.2)

where

$\begin{equation} \begin{split} \lambda_1 & = \frac{d_1 \tau}{h^2}, \\ \lambda_2 & = \frac{d_2 \tau}{h^2}, \\ \lambda_3 & = \frac{d_3 \tau}{h^2}, \\ \lambda_4 & = \frac{d_4 \tau}{h^2}. \end{split} \end{equation}$

(5.3)

5.2. Crank Nicolson method

In this section, we apply the Crank-Nicolson operator splitting finite difference (OS-FD) scheme to numerically solve the hepatitis B epidemic model. Typically, reaction-diffusion equations are decomposed into two distinct subsystems. The first subsystem addresses the nonlinear reaction terms over a half-time step, while the second subsystem deals with the linear diffusion terms during the subsequent time step. The implementation of the Crank-Nicolson OS-FD scheme begins with the following procedure for the initial time step:

$\begin{equation} \begin{split} S^{n+\frac{1}{3}}_{\zeta_1,\zeta_2} & = S^{n}_{\zeta_1,\zeta_2}+\tau\mu\omega(1-\nu C^{n}_{\zeta_1,\zeta_2})-\tau(\mu_{0}+\beta I^{n}_{\zeta_1,\zeta_2}+\epsilon\beta C^{n}_{\zeta_1,\zeta_2}+\gamma_{3})S^{n}_{\zeta_1,\zeta_2}, \\ L^{n+\frac{1}{3}}_{\zeta_1,\zeta_2} & = L^{n}_{\zeta_1,\zeta_2}+\tau(\beta I^{n}_{\zeta_1,\zeta_2}+\epsilon\beta C^{n}_{\zeta_1,\zeta_2})S^{n}_{\zeta_1,\zeta_2}-\tau(\mu_{0}+\sigma)L^{n}_{\zeta_1,\zeta_2}, \\ I^{n+\frac{1}{3}}_{\zeta_1,\zeta_2} & = I^{n}_{\zeta_1,\zeta_2}+\tau\sigma L^{n}_{\zeta_1,\zeta_2}-\tau(\mu_{0}+\gamma_{1})I^{n}_{\zeta_1,\zeta_2},\\ C^{n+\frac{1}{3}}_{\zeta_1,\zeta_2} & = C^{n}_{\zeta_1,\zeta_2}+\tau\mu\omega\nu C^{n}_{\zeta_1,\zeta_2}+\tau q\gamma_{1}I^{n}_{\zeta_1,\zeta_2}-\tau(\mu_{0}+\mu_{1}+\gamma_{2})C^{n}_{\zeta_1,\zeta_2}. \end{split} \end{equation}$

(5.4)

In the second step, the methodology applied for the Crank-Nicolson OS-FD scheme is as follows:

$\begin{equation} \begin{split} -\frac{\lambda_1}{2} S^{n+\frac{2}{3}}_{\zeta_1-1,\zeta_2}+(1+\lambda_1)S^{n+\frac{2}{3}}_{\zeta_1,\zeta_2}-\frac{\lambda_1}{2}S^{n+\frac{2}{3}}_{\zeta_1+1,\zeta_2} & = \frac{\lambda_1}{2} S^{n+\frac{1}{3}}_{\zeta_1-1,\zeta_2}+(1-\lambda_1)S^{n+\frac{1}{3}}_{\zeta_1,\zeta_2}+\frac{\lambda_1}{2}S^{n+\frac{1}{3}}_{\zeta_1+1,\zeta_2}, \\ -\frac{\lambda_2}{2} L^{n+\frac{2}{3}}_{\zeta_1-1,\zeta_2}+(1+\lambda_2)L^{n+\frac{2}{3}}_{\zeta_1,\zeta_2}-\frac{\lambda_2}{2}L^{n+\frac{2}{3}}_{\zeta_1+1,\zeta_2} & = \frac{\lambda_2}{2} L^{n+\frac{1}{3}}_{\zeta_1-1,\zeta_2}+(1-\lambda_2)L^{n+\frac{1}{3}}_{\zeta_1,\zeta_2}+\frac{\lambda_2}{2}L^{n+\frac{1}{3}}_{\zeta_1+1,\zeta_2}, \\ -\frac{\lambda_3}{2} I^{n+\frac{2}{3}}_{\zeta_1-1,\zeta_2}+(1+\lambda_3)I^{n+\frac{2}{3}}_{\zeta_1,\zeta_2}-\frac{\lambda_3}{2}I^{n+\frac{2}{3}}_{\zeta_1+1,\zeta_2} & = \frac{\lambda_3}{2} I^{n+\frac{1}{3}}_{\zeta_1-1,\zeta_2}+(1-\lambda_3)I^{n+\frac{1}{3}}_{\zeta_1,\zeta_2}+\frac{\lambda_3}{2}I^{n+\frac{1}{3}}_{\zeta_1+1,\zeta_2}, \\ -\frac{\lambda_3}{2} C^{n+\frac{2}{3}}_{\zeta_1-1,\zeta_2}+(1+\lambda_3)C^{n+\frac{2}{3}}_{\zeta_1,\zeta_2}-\frac{\lambda_3}{2}C^{n+\frac{2}{3}}_{\zeta_1+1,\zeta_2} & = \frac{\lambda_3}{2} C^{n+\frac{1}{3}}_{\zeta_1-1,\zeta_2}+(1-\lambda_3)C^{n+\frac{1}{3}}_{\zeta_1,\zeta_2}+\frac{\lambda_3}{2}C^{n+\frac{1}{3}}_{\zeta_1+1,\zeta_2}. \end{split} \end{equation}$

(5.5)

The approach for the third step involves the following process:

$\begin{equation} \begin{split} -\frac{\lambda_1}{2} S^{n+1}_{\zeta_1,\zeta_2-1}+(1+\lambda_1)S^{n+1}_{\zeta_1,\zeta_2}-\frac{\lambda_1}{2}S^{n+1}_{\zeta_1,\zeta_2+1} & = \frac{\lambda_1}{2} S^{n+\frac{2}{3}}_{\zeta_1,\zeta_2-1}+(1-\lambda_1)S^{n+\frac{2}{3}}_{\zeta_1,\zeta_2}+\frac{\lambda_1}{2}S^{n+\frac{2}{3}}_{\zeta_1,\zeta_2+1}, \\ -\frac{\lambda_2}{2} L^{-n+1}_{\zeta_1,\zeta_2-1}+(1+\lambda_2)L^{-n+1}_{\zeta_1,\zeta_2}-\frac{\lambda_2}{2}l^{-n+1}_{\zeta_1,\zeta_2+1} & = \frac{\lambda_2}{2} L^{n+\frac{2}{3}}_{\zeta_1,\zeta_2-1}+(1-\lambda_2)L^{n+\frac{2}{3}}_{\zeta_1,\zeta_2}+\frac{\lambda_2}{2}L^{n+\frac{2}{3}}_{\zeta_1,\zeta_2+1}, \\ -\frac{\lambda_3}{2} I^{n+1}_{\zeta_1,\zeta_2-1}+(1+\lambda_3)I^{n+1}_{\zeta_1,\zeta_2}-\frac{\lambda_3}{2}I^{n+1}_{\zeta_1,\zeta_2+1} & = \frac{\lambda_3}{2} I^{n+\frac{2}{3}}_{\zeta_1,\zeta_2-1}+(1-\lambda_3)I^{n+\frac{2}{3}}_{\zeta_1,\zeta_2}+\frac{\lambda_3}{2}I^{n+\frac{2}{3}}_{\zeta_1,\zeta_2+1}, \\ -\frac{\lambda_4}{2} C^{n+1}_{\zeta_1,\zeta_2-1}+(1+\lambda_4)C^{n+1}_{\zeta_1,\zeta_2}-\frac{\lambda_4}{2}C^{n+1}_{\zeta_1,\zeta_2+1} & = \frac{\lambda_4}{2} C^{n+\frac{2}{3}}_{\zeta_1,\zeta_2-1}+(1-\lambda_4)C^{n+\frac{2}{3}}_{\zeta_1,\zeta_2}+\frac{\lambda_4}{2}C^{n+\frac{2}{3}}_{\zeta_1,\zeta_2+1}. \end{split} \end{equation}$

(5.6)

The Crank Nicolson OS-FD scheme is unconditionally stable.

5.3. Unconditionally positivity preserving method

In this section, we design a UPP-FD scheme for the hepatitis B epidemic model in two dimensions. The rules for designing the UPP-FD scheme are based on the rules given by Mickens ^[30]. The UPP-FD scheme for Susceptible in Eq (2.5) is designed as follows:

$\begin{equation} \begin{split} S^{n+1}_{\zeta_1,\zeta_2} & = S^{n}_{\zeta_1,\zeta_2}+\lambda_1(S^{n}_{\zeta_1-1,\zeta_2}+S^{n}_{\zeta_1+1,\zeta_2} +S^{n}_{\zeta_1,\zeta_2-1}+S^{n}_{\zeta_1,\zeta_2+1})-4\lambda_1S^{n+1}_{\zeta_1,\zeta_2} \\ &+\tau\mu\omega(1-\nu C^{n}_{\zeta_1,\zeta_2})-\tau(\mu_{0}+\beta I^{n}_{\zeta_1,\zeta_2}+\epsilon\beta C^{n}_{\zeta_1,\zeta_2}+\gamma_{3})S^{n+1}_{\zeta_1,\zeta_2}, \end{split} \end{equation}$

(5.7)

$(1+4\lambda_1+\tau(\mu_{0}+\beta I^{n}_{\zeta_1,\zeta_2}+\epsilon\beta C^{n}_{\zeta_1,\zeta_2}+\gamma_{3})) S^{n+1}_{\zeta_1,\zeta_2} = S^{n}_{\zeta_1,\zeta_2}+\lambda_1(S^{n}_{\zeta_1-1,\zeta_2}+S^{n}_{\zeta_1+1,\zeta_2} +S^{n}_{\zeta_1,\zeta_2-1} +S^{n}_{\zeta_1,\zeta_2+1})+\tau\mu\omega(1-\nu C^{n}_{\zeta_1,\zeta_2}),$

$\begin{equation} S^{n+1}_{\zeta_1,\zeta_2} = \left[\frac{S^{n}_{\zeta_1,\zeta_2}+\lambda_1(S^{n}_{\zeta_1-1,\zeta_2}+S^{n}_{\zeta_1+1,\zeta_2} +S^{n}_{\zeta_1,\zeta_2-1}+S^{n}_{\zeta_1,\zeta_2+1})+\tau\mu\omega(1-\nu C^{n}_{\zeta_1,\zeta_2})} {(1+4\lambda_1+\tau(\mu_{0}+\beta I^{n}_{\zeta_1,\zeta_2}+\epsilon\beta C^{n}_{\zeta_1,\zeta_2}+\gamma_{3}))}\right]. \end{equation}$

(5.8)

The UPP-FD scheme for Exposed in Eq (2.5) is designed as follows:

$\begin{equation} \begin{split} L^{n+1}_{\zeta_1,\zeta_2} & = L^{n}_{\zeta_1,\zeta_2}+\lambda_2(L^{n}_{\zeta_1-1,\zeta_2}+L^{n}_{\zeta_1+1,\zeta_2} +L^{n}_{\zeta_1,\zeta_2-1}+L^{n}_{\zeta_1,\zeta_2+1})-4\lambda_2L^{n+1}_{\zeta_1,\zeta_2} \\ &+\tau(\beta I^{n}_{\zeta_1,\zeta_2}+\epsilon\beta C^{n}_{\zeta_1,\zeta_2})S^{n}_{\zeta_1,\zeta_2}-\tau(\mu_{0}+\sigma)L^{n+1}_{\zeta_1,\zeta_2}, \end{split} \end{equation}$

(5.9)

$(1+4\lambda_2+\tau(\mu_{0}+\sigma))L^{n+1}_{\zeta_1,\zeta_2} = L^{n}_{\zeta_1,\zeta_2}+\lambda_2(L^{n}_{\zeta_1-1,\zeta_2}+L^{n}_{\zeta_1+1,\zeta_2} +L^{n}_{\zeta_1,\zeta_2-1} +L^{n}_{\zeta_1,\zeta_2+1})+\tau(\beta I^{n}_{\zeta_1,\zeta_2}+\epsilon\beta C^{n}_{\zeta_1,\zeta_2})S^{n}_{\zeta_1,\zeta_2},$

$\begin{equation} L^{n+1}_{\zeta_1,\zeta_2} = \left[\frac{L^{n}_{\zeta_1,\zeta_2}+\lambda_2(L^{n}_{\zeta_1-1,\zeta_2}+L^{n}_{\zeta_1+1,\zeta_2} +L^{n}_{\zeta_1,\zeta_2-1}+L^{n}_{\zeta_1,\zeta_2+1})+\tau(\beta I^{n}_{\zeta_1,\zeta_2}+\epsilon\beta C^{n}_{\zeta_1,\zeta_2})S^{n}_{\zeta_1,\zeta_2}}{(1+4\lambda_2+\tau(\mu_{0}+\sigma))}\right]. \end{equation}$

(5.10)

The UPP-FD scheme for Infected in Eq (2.5) is designed as follows:

$\begin{equation} \begin{split} I^{n+1}_{\zeta_1,\zeta_2} & = I^{n}_{\zeta_1,\zeta_2}+\lambda_3(I^{n}_{\zeta_1-1,\zeta_2}+I^{n}_{\zeta_1+1,\zeta_2} +I^{n}_{\zeta_1,\zeta_2-1}+I^{n}_{\zeta_1,\zeta_2+1})-4\lambda_3I^{n+1}_{\zeta_1,\zeta_2} \\ &+\tau\sigma L^{n}_{\zeta_1,\zeta_2}-\tau(\mu_{0}+\gamma_{1})I^{n+1}_{\zeta_1,\zeta_2}, \end{split} \end{equation}$

(5.11)

$(1+4\lambda_3+\tau(\mu_{0}+\gamma_{1}))I^{n+1}_{\zeta_1,\zeta_2} = I^{n}_{\zeta_1,\zeta_2}+\lambda_3(I^{n}_{\zeta_1-1,\zeta_2}+I^{n}_{\zeta_1+1,\zeta_2} +I^{n}_{\zeta_1,\zeta_2-1} +I^{n}_{\zeta_1,\zeta_2+1})+\tau\sigma L^{n}_{\zeta_1,\zeta_2},$

$\begin{equation} I^{n+1}_{\zeta_1,\zeta_2} = \left[\frac{I^{n}_{\zeta_1,\zeta_2}+\lambda_3(I^{n}_{\zeta_1-1,\zeta_2}+I^{n}_{\zeta_1+1,\zeta_2} +I^{n}_{\zeta_1,\zeta_2-1}+I^{n}_{\zeta_1,\zeta_2+1})+\tau\sigma L^{n}_{\zeta_1,\zeta_2}}{(1+4\lambda_3+\tau(\mu_{0}+\gamma_{1}))}\right]. \end{equation}$

(5.12)

The UPP-FD scheme for chronic in Eq (2.5) is designed as follows:

$\begin{equation} \begin{split} C^{n+1}_{\zeta_1,\zeta_2} & = C^{n}_{\zeta_1,\zeta_2}+\lambda_4(C^{n}_{\zeta_1-1,\zeta_2}+C^{n}_{\zeta_1+1,\zeta_2} +C^{n}_{\zeta_1,\zeta_2-1}+C^{n}_{\zeta_1,\zeta_2+1})-4\lambda_4C^{n+1}_{\zeta_1,\zeta_2} \\ &+\tau\mu\omega\nu C^{n+1}_{\zeta_1,\zeta_2}+\tau q\gamma_{1}I^{n}_{\zeta_1,\zeta_2}-\tau(\mu_{0}+\mu_{1}+\gamma_{2})C^{n+1}_{\zeta_1,\zeta_2}, \end{split} \end{equation}$

(5.13)

$(1+4\lambda_4-\tau\mu\omega\nu +\tau(\mu_{0}+\mu_{1}+\gamma_{2}))C^{n+1}_{\zeta_1,\zeta_2} = C^{n}_{\zeta_1,\zeta_2}+\lambda_4(C^{n}_{\zeta_1-1,\zeta_2}+C^{n}_{\zeta_1+1,\zeta_2} +C^{n}_{\zeta_1,\zeta_2-1} +C^{n}_{\zeta_1,\zeta_2+1})+\tau q\gamma_{1}I^{n}_{\zeta_1,\zeta_2},$

$\begin{equation} C^{n+1}_{\zeta_1,\zeta_2} = \left[\frac{C^{n}_{\zeta_1,\zeta_2}+\lambda_4(C^{n}_{\zeta_1-1,\zeta_2}+C^{n}_{\zeta_1+1,\zeta_2} +C^{n}_{\zeta_1,\zeta_2-1}+C^{n}_{\zeta_1,\zeta_2+1})+\tau q\gamma_{1}I^{n}_{\zeta_1,\zeta_2}}{(1+4\lambda_4-\tau\mu\omega\nu +\tau(\mu_{0}+\mu_{1}+\gamma_{2}))}\right]. \end{equation}$

(5.14)

Theorem 5.1. The finite difference approximation method UPP-FD, as outlined in Eqs (5.8), (5.10), (5.12), and (5.14), preserves the positivity of the solution under the assumption that the initial conditions are non-negative. Specifically, it holds that:

$S^{n}_{\zeta_1,\zeta_2} \geq 0, L^{n}_{\zeta_1,\zeta_2} \geq 0,$

$\begin{equation} I^{n}_{\zeta_1,\zeta_2} \Rightarrow S^{n+1}_{\zeta_1,\zeta_2} \geq 0, L^{n+1}_{\zeta_1,\zeta_2} \geq 0, I^{n+1}_{\zeta_1,\zeta_2} \geq 0, C^{n+1}_{\zeta_1,\zeta_2} \geq 0. \end{equation}$

(5.15)

Proof. We prove positivity preservation by induction, analyzing each equation in the UPP-FD scheme. The proof involves the following steps:

Step 1. Base case:

Assume that at $n = 0$ , the initial conditions satisfy:

$S^0_{\zeta_1,\zeta_2} \geq 0, \quad L^0_{\zeta_1,\zeta_2} \geq 0, \quad I^0_{\zeta_1,\zeta_2} \geq 0, \quad C^0_{\zeta_1,\zeta_2} \geq 0,$

for all $\zeta_1, \zeta_2$ . These initial conditions are biologically realistic, as population densities cannot be negative. Thus, the base case is satisfied.

Step 2. Inductive hypothesis:

Assume that at time step $n$ , the positivity condition holds:

$S^n_{\zeta_1,\zeta_2} \geq 0, \quad L^n_{\zeta_1,\zeta_2} \geq 0, \quad I^n_{\zeta_1,\zeta_2} \geq 0, \quad C^n_{\zeta_1,\zeta_2} \geq 0,$

for all $\zeta_1, \zeta_2$ .

We aim to prove that the positivity condition holds at time step $n+1$ :

$S^{n+1}_{\zeta_1,\zeta_2} \geq 0, \quad L^{n+1}_{\zeta_1,\zeta_2} \geq 0, \quad I^{n+1}_{\zeta_1,\zeta_2} \geq 0, \quad C^{n+1}_{\zeta_1,\zeta_2} \geq 0.$

Step 3. Inductive step:

We prove positivity for each state variable at $n+1$ , starting with $S^{n+1}_{\zeta_1, \zeta_2}$ .

Positivity of $S^{n+1}_{\zeta_1, \zeta_2}$ , from (5.8), we have:

$S^{n+1}_{\zeta_1,\zeta_2} = \frac{S^n_{\zeta_1,\zeta_2} + \lambda_1 \left(S^n_{\zeta_1-1,\zeta_2} + S^n_{\zeta_1+1,\zeta_2} + S^n_{\zeta_1,\zeta_2-1} + S^{n}_{\zeta_1,\zeta_2+1}\right) + \tau \mu \omega (1 - \nu C^n_{\zeta_1,\zeta_2})}{1 + 4\lambda_1 + \tau (\mu_0 + \beta I^n_{\zeta_1,\zeta_2} + \epsilon \beta C^n_{\zeta_1,\zeta_2} + \gamma_3)}.$

Numerator analysis:

● $S^n_{\zeta_1, \zeta_2} \geq 0$ by the inductive hypothesis.

● $\lambda_1 \left(S^n_{\zeta_1-1, \zeta_2} + S^n_{\zeta_1+1, \zeta_2} + S^n_{\zeta_1, \zeta_2-1} + S^{n}_{\zeta_1, \zeta_2+1}\right) \geq 0$ , as $\lambda_1 > 0$ and neighboring terms are non-negative.

● $\tau \mu \omega (1 - \nu C^n_{\zeta_1, \zeta_2}) \geq 0$ , since $C^n_{\zeta_1, \zeta_2} \geq 0$ and $\nu < 1$ .

Denominator analysis: The denominator $1 + 4\lambda_1 + \tau (\mu_0 + \beta I^n_{\zeta_1, \zeta_2} + \epsilon \beta C^n_{\zeta_1, \zeta_2} + \gamma_3) > 0$ , as all parameters are positive. Thus, $S^{n+1}_{\zeta_1, \zeta_2} \geq 0$ .

Positivity of $L^{n+1}_{\zeta_1, \zeta_2}$ , from (5.10), we have:

$L^{n+1}_{\zeta_1,\zeta_2} = \frac{L^n_{\zeta_1,\zeta_2} + \lambda_2 \left(L^n_{\zeta_1-1,\zeta_2} + L^n_{\zeta_1+1,\zeta_2} + L^n_{\zeta_1,\zeta_2-1} + L^{n}_{\zeta_1,\zeta_2+1}\right) + \tau (\beta I^n_{\zeta_1,\zeta_2} + \epsilon \beta C^n_{\zeta_1,\zeta_2}) S^n_{\zeta_1,\zeta_2}}{1 + 4\lambda_2 + \tau (\mu_0 + \sigma)}.$

Numerator analysis:

● $L^n_{\zeta_1, \zeta_2} \geq 0$ by the inductive hypothesis.

● $\lambda_2 \left(L^n_{\zeta_1-1, \zeta_2} + L^n_{\zeta_1+1, \zeta_2} + L^n_{\zeta_1, \zeta_2-1} + L^{n}_{\zeta_1, \zeta_2+1}\right) \geq 0$ .

● $\tau (\beta I^n_{\zeta_1, \zeta_2} + \epsilon \beta C^n_{\zeta_1, \zeta_2}) S^n_{\zeta_1, \zeta_2} \geq 0$ .

Denominator analysis: The denominator $1 + 4\lambda_2 + \tau (\mu_0 + \sigma) > 0$ . Thus, $L^{n+1}_{\zeta_1, \zeta_2} \geq 0$ .

Positivity of $I^{n+1}_{\zeta_1, \zeta_2}$ , from (5.12), we have:

$I^{n+1}_{\zeta_1,\zeta_2} = \frac{I^n_{\zeta_1,\zeta_2} + \lambda_3 \left(I^n_{\zeta_1-1,\zeta_2} + I^n_{\zeta_1+1,\zeta_2} + I^n_{\zeta_1,\zeta_2-1} + I^{n}_{\zeta_1,\zeta_2+1}\right) + \tau \sigma L^n_{\zeta_1,\zeta_2}}{1 + 4\lambda_3 + \tau (\mu_0 + \gamma_1)}.$

Numerator analysis:

● $I^n_{\zeta_1, \zeta_2} \geq 0$ .

● $\lambda_3 \left(I^n_{\zeta_1-1, \zeta_2} + I^n_{\zeta_1+1, \zeta_2} + I^n_{\zeta_1, \zeta_2-1} + I^{n}_{\zeta_1, \zeta_2+1}\right) \geq 0$ .

● $\tau \sigma L^n_{\zeta_1, \zeta_2} \geq 0$ .

Denominator analysis: The denominator $1 + 4\lambda_3 + \tau (\mu_0 + \gamma_1) > 0$ . Thus, $I^{n+1}_{\zeta_1, \zeta_2} \geq 0$ .

Positivity of $C^{n+1}_{\zeta_1, \zeta_2}$ , from (5.14), we have:

$C^{n+1}_{\zeta_1,\zeta_2} = \frac{C^n_{\zeta_1,\zeta_2} + \lambda_4 \left(C^n_{\zeta_1-1,\zeta_2} + C^n_{\zeta_1+1,\zeta_2} + C^n_{\zeta_1,\zeta_2-1} + C^{n}_{\zeta_1,\zeta_2+1}\right) + \tau q \gamma_1 I^n_{\zeta_1,\zeta_2}}{1 + 4\lambda_4 - \tau \mu \omega \nu + \tau (\mu_0 + \mu_1 + \gamma_2)}.$

Numerator analysis:

● $C^n_{\zeta_1, \zeta_2} \geq 0$ .

● $\lambda_4 \left(C^n_{\zeta_1-1, \zeta_2} + C^n_{\zeta_1+1, \zeta_2} + C^n_{\zeta_1, \zeta_2-1} + C^{n}_{\zeta_1, \zeta_2+1}\right) \geq 0$ .

● $\tau q \gamma_1 I^n_{\zeta_1, \zeta_2} \geq 0$ .

Denominator analysis: The denominator $1 + 4\lambda_4 - \tau \mu \omega \nu + \tau (\mu_0 + \mu_1 + \gamma_2) > 0$ . Thus, $C^{n+1}_{\zeta_1, \zeta_2} \geq 0$ . By induction, positivity is preserved at all time steps $n+1$ , provided the initial conditions are positive. □

Remark 5.1. The Eqs (5.8), (5.10), (5.12) and (5.14) ensure a positive solution due to the non-negativity of all terms on the right-hand side, regardless of the parameters involved in the system.

5.4. Stability

In this section, we analyze the stability of the finite difference (FD) schemes. We begin by applying the UPP-FD scheme (Eq (5.8)) to the reaction-diffusion equation for $S(x, y, t)$ (Eq (2.5)). By subsequently linearizing this discretized equation and substituting the perturbation $\Phi(t)e^{\iota(\varpi_1 x+\varpi_2 y)}$ for $S^{n}_{\zeta_1, \zeta_2}$ , we derive the following stability condition:

$\begin{equation} \begin{split} \bigg|\frac{\Phi(t+\Delta t)}{\Phi(t)}\bigg| & = \bigg|\frac{1+4\lambda_1-8\lambda_1\sin^2(\varpi_1\frac{\Delta x}{2})}{1+4\lambda_1+\tau(\mu_{0}+\gamma_{3})} \bigg|\\ &\leq \frac{1+4\lambda_1}{1+4\lambda_1+\tau(\mu_{0}+\gamma_{3})} < 1. \end{split} \end{equation}$

(5.16)

Keep in mind that $\Delta x = \Delta y$ . Following a similar approach for $L^{n+1}_{\zeta_1, \zeta_2}$ , the result is obtained as:

$\begin{equation} \begin{split} \bigg|\frac{\Phi(t+\Delta t)}{\Phi(t)}\bigg| & = \bigg|\frac{1+4\lambda_2-8\lambda_2\sin^2(\varpi_1\frac{\Delta x}{2})}{1+4\lambda_2+\tau(\mu_{0}+\sigma)} \bigg|\\ &\leq \frac{1+4\lambda_2}{1+4\lambda_2+\tau(\mu_{0}+\sigma)} < 1. \end{split} \end{equation}$

(5.17)

Using a similar process for $I^{n+1}_{\zeta_1, \zeta_2}$ , we obtain,

$\begin{equation} \begin{split} \bigg|\frac{\Phi(t+\Delta t)}{\Phi(t)}\bigg| & = \bigg|\frac{1+4\lambda_3-8\lambda_3\sin^2(\varpi_1\frac{\Delta x}{2})}{1+4\lambda_3+\tau(\mu_{0}+\gamma_{1})} \bigg|\\ &\leq \frac{1+4\lambda_3}{1+4\lambda_3+\tau(\mu_{0}+\gamma_{1})} < 1. \end{split} \end{equation}$

(5.18)

In the same fashion, the procedure for $C^{n+1}_{\zeta_1, \zeta_2}$ , we obtain,

$\begin{equation} \begin{split} \bigg|\frac{\Phi(t+\Delta t)}{\Phi(t)}\bigg| & = \bigg|\frac{1+4\lambda_4-8\lambda_4\sin^2(\varpi_1\frac{\Delta x}{2})}{1+4\lambda_4-\tau\mu\omega\nu+\tau(\mu_{0}+\mu_{1}+\gamma_{2})} \bigg|\\ &\leq \frac{1+4\lambda_3}{1+4\lambda_4-\tau\mu\omega\nu+\tau(\mu_{0}+\mu_{1}+\gamma_{2})} < 1. \end{split} \end{equation}$

(5.19)

The analysis clearly demonstrates that the proposed UPP-FD scheme maintains stability under all conditions.

5.5. Consistency

The consistency of the UPP-FD scheme is evaluated using the Taylor series expansion. The expressions for $S^{n+1}_{\zeta_1, \zeta_2}$ , $S^{n}_{\zeta_1+1, \zeta_2}$ , $S^{n}_{\zeta_1-1, \zeta_2}$ , $S^{n}_{\zeta_1, \zeta_2+1}$ , and $S^{n}_{\zeta_1, \zeta_2-1}$ are derived through their Taylor series expansions.

$\begin{eqnarray} S^{n+1}_{\zeta_1,\zeta_2} & = & S^{n}_{\zeta_1,\zeta_2}+\tau\frac{\partial S}{\partial t}+\frac{\tau^2}{2!}\frac{\partial^2 S}{\partial t^2} +\frac{\tau^3}{3!}\frac{\partial^3 S}{\partial t^3}+\cdots, \end{eqnarray}$

(5.20)

$\begin{eqnarray} S^{n}_{\zeta_1+1,\zeta_2} & = & S^{n}_{\zeta_1,\zeta_2}+h\frac{\partial S}{\partial x}+\frac{h^2}{2!}\frac{\partial^2 S}{\partial x^2} +\frac{h^3}{3!}\frac{\partial^3 S}{\partial x^3}+\cdots, \end{eqnarray}$

(5.21)

$\begin{eqnarray} S^{n}_{\zeta_1-1,\zeta_2} & = & S^{n}_{\zeta_1,\zeta_2}-h\frac{\partial S}{\partial x}+\frac{h^2}{2!}\frac{\partial^2 S}{\partial x^2} -\frac{h^3}{3!}\frac{\partial^3 S}{\partial x^3}+\cdots, \end{eqnarray}$

(5.22)

$\begin{eqnarray} S^{n}_{\zeta_1,\zeta_2+1} & = & S^{n}_{\zeta_1,\zeta_2}+h\frac{\partial S}{\partial y}+\frac{h^2}{2!}\frac{\partial^2 S}{\partial y^2} +\frac{h^3}{3!}\frac{\partial^3 S}{\partial y^3}+\cdots, \end{eqnarray}$

(5.23)

$\begin{eqnarray} S^{n}_{\zeta_1,\zeta_2-1} & = & S^{n}_{\zeta_1,\zeta_2}-h\frac{\partial S}{\partial y}+\frac{h^2}{2!}\frac{\partial^2 S}{\partial y^2} -\frac{h^3}{3!}\frac{\partial^3 S}{\partial y^3}+\cdots. \end{eqnarray}$

(5.24)

Considering the UPP-FD scheme for Eq (5.8),

(5.25)

Substituting the values of $S^{n+1}_{\zeta_1, \zeta_2}$ , $S^{n}_{\zeta_1+1, \zeta_2}$ , $S^{n}_{\zeta_1-1, \zeta_2}$ , $S^{n}_{\zeta_1, \zeta_2+1}$ , and $S^{n}_{\zeta_1, \zeta_2-1}$ in the above equation and after simplification, we get

$\begin{equation*} \begin{split} &\left(\frac{\partial S}{\partial t}+\frac{\tau}{2!}\frac{\partial^2 S}{\partial t^2} +\frac{\tau^2}{3!}\frac{\partial^3 S}{\partial t^3}+\cdots\right) \left(1+4\frac{d_1\tau}{h^2}+\tau\mu_0+\tau\beta I^{n}_{\zeta_1,\zeta_2}+\tau\epsilon \beta C^{n}_{\zeta_1,\zeta_2}+\tau\gamma_{3}\right) \\ & = 2d_1\left(\frac{1}{2!}\frac{\partial^2 S}{\partial x^2}+\frac{h^2}{4!}\frac{\partial^4 S}{\partial x^4}+\cdots+\frac{1}{2!}\frac{\partial^2 S}{\partial y^2}+\frac{h^2}{4!}\frac{\partial^4 S}{\partial y^4}+\cdots\right)+\mu\omega(1-\nu C^{n}_{\zeta_1,\zeta_2})-(\mu_{0}+\beta I^{n}_{\zeta_1,\zeta_2}+\epsilon\beta C^{n}_{\zeta_1,\zeta_2}+\gamma_{3})S^{n}_{\zeta_1,\zeta_2} \end{split} \end{equation*}$

replace $\tau = h^3$ and $h\rightarrow 0$ , we have

$\frac{\partial S}{\partial t} = d_1\left(\frac{\partial^2 S}{\partial x^2}+\frac{\partial^2 S}{\partial y^2}\right)+\mu\omega(1-\nu C)-(\mu_{0}+\beta I+\epsilon\beta C+\gamma_{3})S.$

Similarly, the formulas for $L^{n+1}_{\zeta_1, \zeta_2}$ , $L^{n}_{\zeta_1+1, \zeta_2}$ , $L^{n}_{\zeta_1-1, \zeta_2}$ , $L^{n}_{\zeta_1, \zeta_2+1}$ , and $L^{n}_{\zeta_1, \zeta_2-1}$ are

$\begin{eqnarray} L^{n+1}_{\zeta_1,\zeta_2} & = & L^{n}_{\zeta_1,\zeta_2}+\tau\frac{\partial L}{\partial t}+\frac{\tau^2}{2!}\frac{\partial^2 L}{\partial t^2} +\frac{\tau^3}{3!}\frac{\partial^3 L}{\partial t^3}+\cdots, \end{eqnarray}$

(5.26)

$\begin{eqnarray} L^{n}_{\zeta_1+1,\zeta_2} & = & L^{n}_{\zeta_1,\zeta_2}+h\frac{\partial L}{\partial x}+\frac{h^2}{2!}\frac{\partial^2 L}{\partial x^2} +\frac{h^3}{3!}\frac{\partial^3 L}{\partial x^3}+\cdots, \end{eqnarray}$

(5.27)

$\begin{eqnarray} L^{n}_{\zeta_1-1,\zeta_2} & = & L^{n}_{\zeta_1,\zeta_2}-h\frac{\partial L}{\partial x}+\frac{h^2}{2!}\frac{\partial^2 L}{\partial x^2} -\frac{h^3}{3!}\frac{\partial^3 L}{\partial x^3}+\cdots, \end{eqnarray}$

(5.28)

$\begin{eqnarray} L^{n}_{\zeta_1,\zeta_2+1} & = & L^{n}_{\zeta_1,\zeta_2}+h\frac{\partial L}{\partial y}+\frac{h^2}{2!}\frac{\partial^2 L}{\partial y^2} +\frac{h^3}{3!}\frac{\partial^3 L}{\partial y^3}+\cdots, \end{eqnarray}$

(5.29)

$\begin{eqnarray} L^{n}_{\zeta_1,\zeta_2-1} & = & L^{n}_{\zeta_1,\zeta_2}-h\frac{\partial L}{\partial y}+\frac{h^2}{2!}\frac{\partial^2 L}{\partial y^2} -\frac{h^3}{3!}\frac{\partial^3 L}{\partial y^3}+\cdots. \end{eqnarray}$

(5.30)

Considering the UPP-FD scheme for Eq (5.10),

$\begin{equation} \begin{split} L^{n+1}_{\zeta_1,\zeta_2} & = L^{n}_{\zeta_1,\zeta_2}+\lambda_2(L^{n}_{\zeta_1-1,\zeta_2}+L^{n}_{\zeta_1+1,\zeta_2} +L^{n}_{\zeta_1,\zeta_2-1}+L^{n}_{\zeta_1,\zeta_2+1})-4\lambda_2L^{n+1}_{\zeta_1,\zeta_2} \\ &\tau(\beta I^{n}_{\zeta_1,\zeta_2}+\epsilon\beta C^{n}_{\zeta_1,\zeta_2})S^{n}_{\zeta_1,\zeta_2}-\tau(\mu_{0}+\sigma)L^{n+1}_{\zeta_1,\zeta_2}. \end{split} \end{equation}$

(5.31)

Substituting the values of $L^{n+1}_{\zeta_1, \zeta_2}$ , $L^{n}_{\zeta_1+1, \zeta_2}$ , $L^{n}_{\zeta_1-1, \zeta_2}$ , $L^{n}_{\zeta_1, \zeta_2+1}$ , and $L^{n}_{\zeta_1, \zeta_2-1}$ in the above equation and after simplification, we get

$\begin{equation*} \begin{split} &\left(\frac{\partial L}{\partial t}+\frac{\tau}{2!}\frac{\partial^2 L}{\partial t^2} +\frac{\tau^2}{3!}\frac{\partial^3 L}{\partial t^3}+\cdots\right) \left(1+4\frac{d_2\tau}{h^2}+\tau(\mu_{0}+\sigma)\right) \\ & = 2d_2\left(\frac{1}{2!}\frac{\partial^2 L}{\partial x^2}+\frac{h^2}{4!}\frac{\partial^4 L}{\partial x^4}+\cdots+\frac{1}{2!}\frac{\partial^2 L}{\partial y^2}+\frac{h^2}{4!}\frac{\partial^4 L}{\partial y^4}+\cdots\right)+(\beta I^{n}_{\zeta_1,\zeta_2}+\epsilon\beta C^{n}_{\zeta_1,\zeta_2})S^{n}_{\zeta_1,\zeta_2}-(\mu_{0}+\sigma)L^{n}_{\zeta_1,\zeta_2} \end{split} \end{equation*}$

replace $\tau = h^3$ and $h\rightarrow 0$ , we have

$\frac{\partial L}{\partial t} = d_2\left(\frac{\partial^2 L}{\partial x^2}+\frac{\partial^2 L}{\partial y^2}\right)+(\beta I+\epsilon\beta C)S-(\mu_{0}+\sigma)L.$

Similarly, the formulas for $I^{n+1}_{\zeta_1, \zeta_2}$ , $I^{n}_{\zeta_1+1, \zeta_2}$ , $I^{n}_{\zeta_1-1, \zeta_2}$ , $I^{n}_{\zeta_1, \zeta_2+1}$ , and $I^{n}_{\zeta_1, \zeta_2-1}$ are

$\begin{eqnarray} I^{n+1}_{\zeta_1,\zeta_2} & = & I^{n}_{\zeta_1,\zeta_2}+\tau\frac{\partial I}{\partial t}+\frac{\tau^2}{2!}\frac{\partial^2 I}{\partial t^2} +\frac{\tau^3}{3!}\frac{\partial^3 I}{\partial t^3}+\cdots, \end{eqnarray}$

(5.32)

$\begin{eqnarray} I^{n}_{\zeta_1+1,\zeta_2} & = & I^{n}_{\zeta_1,\zeta_2}+h\frac{\partial I}{\partial x}+\frac{h^2}{2!}\frac{\partial^2 I}{\partial x^2} +\frac{h^3}{3!}\frac{\partial^3 I}{\partial x^3}+\cdots, \end{eqnarray}$

(5.33)

$\begin{eqnarray} I^{n}_{\zeta_1-1,\zeta_2} & = & I^{n}_{\zeta_1,\zeta_2}-h\frac{\partial I}{\partial x}+\frac{h^2}{2!}\frac{\partial^2 I}{\partial x^2} -\frac{h^3}{3!}\frac{\partial^3 I}{\partial x^3}+\cdots, \end{eqnarray}$

(5.34)

$\begin{eqnarray} I^{n}_{\zeta_1,\zeta_2+1} & = & I^{n}_{\zeta_1,\zeta_2}+h\frac{\partial I}{\partial y}+\frac{h^2}{2!}\frac{\partial^2 I}{\partial y^2} +\frac{h^3}{3!}\frac{\partial^3 I}{\partial y^3}+\cdots, \end{eqnarray}$

(5.35)

$\begin{eqnarray} I^{n}_{\zeta_1,\zeta_2-1} & = & I^{n}_{\zeta_1,\zeta_2}-h\frac{\partial I}{\partial y}+\frac{h^2}{2!}\frac{\partial^2 I}{\partial y^2} -\frac{h^3}{3!}\frac{\partial^3 I}{\partial y^3}+\cdots. \end{eqnarray}$

(5.36)

Considering the UPP-FD scheme for Eq (5.12)

(5.37)

Substituting the values of $I^{n+1}_{\zeta_1, \zeta_2}$ , $I^{n}_{\zeta_1+1, \zeta_2}$ , $I^{n}_{\zeta_1-1, \zeta_2}$ , $I^{n}_{\zeta_1, \zeta_2+1}$ , and $I^{n}_{\zeta_1, \zeta_2-1}$ in the above equation and after simplification, we get

$\begin{equation*} \begin{split} &\left(\frac{\partial I}{\partial t}+\frac{\tau}{2!}\frac{\partial^2 I}{\partial t^2} +\frac{\tau^2}{3!}\frac{\partial^3 I}{\partial t^3}+\cdots\right) \left(1+4\frac{d_3\tau}{h^2}+\tau(\mu_{0}+\gamma_{1})\right) \\ & = 2d_3\left(\frac{1}{2!}\frac{\partial^2 I}{\partial x^2}+\frac{h^2}{4!}\frac{\partial^4 I}{\partial x^4}+\cdots+\frac{1}{2!}\frac{\partial^2 I}{\partial y^2}+\frac{h^2}{4!}\frac{\partial^4 I}{\partial y^4}+\cdots\right)+\sigma L^{n}_{\zeta_1,\zeta_2}-(\mu_{0}+\gamma_{1})I^{n}_{\zeta_1,\zeta_2}. \end{split} \end{equation*}$

replace $\tau = h^3$ and $h\rightarrow 0$ , we have

$\frac{\partial I}{\partial t} = d_3\left(\frac{\partial^2 I}{\partial x^2}+\frac{\partial^2 I}{\partial y^2}\right)+\sigma L-(\mu_{0}+\gamma_{1})I.$

Similarly, the formulas for $C^{n+1}_{\zeta_1, \zeta_2}$ , $C^{n}_{\zeta_1+1, \zeta_2}$ , $C^{n}_{\zeta_1-1, \zeta_2}$ , $C^{n}_{\zeta_1, \zeta_2+1}$ , and $C^{n}_{\zeta_1, \zeta_2-1}$ are

$\begin{eqnarray} C^{n+1}_{\zeta_1,\zeta_2} & = & C^{n}_{\zeta_1,\zeta_2}+\tau\frac{\partial C}{\partial t}+\frac{\tau^2}{2!}\frac{\partial^2 C}{\partial t^2} +\frac{\tau^3}{3!}\frac{\partial^3 C}{\partial t^3}+\cdots, \end{eqnarray}$

(5.38)

$\begin{eqnarray} C^{n}_{\zeta_1+1,\zeta_2} & = & C^{n}_{\zeta_1,\zeta_2}+h\frac{\partial C}{\partial x}+\frac{h^2}{2!}\frac{\partial^2 C}{\partial x^2} +\frac{h^3}{3!}\frac{\partial^3 C}{\partial x^3}+\cdots, \end{eqnarray}$

(5.39)

$\begin{eqnarray} C^{n}_{\zeta_1-1,\zeta_2} & = & C^{n}_{\zeta_1,\zeta_2}-h\frac{\partial C}{\partial x}+\frac{h^2}{2!}\frac{\partial^2 C}{\partial x^2} -\frac{h^3}{3!}\frac{\partial^3 C}{\partial x^3}+\cdots, \end{eqnarray}$

(5.40)

$\begin{eqnarray} C^{n}_{\zeta_1,\zeta_2+1} & = & C^{n}_{\zeta_1,\zeta_2}+h\frac{\partial C}{\partial y}+\frac{h^2}{2!}\frac{\partial^2 C}{\partial y^2} +\frac{h^3}{3!}\frac{\partial^3 C}{\partial y^3}+\cdots, \end{eqnarray}$

(5.41)

$\begin{eqnarray} C^{n}_{\zeta_1,\zeta_2-1} & = & C^{n}_{\zeta_1,\zeta_2}-h\frac{\partial C}{\partial y}+\frac{h^2}{2!}\frac{\partial^2 C}{\partial y^2} -\frac{h^3}{3!}\frac{\partial^3 C}{\partial y^3}+\cdots. \end{eqnarray}$

(5.42)

Considering the UPP-FD scheme for Eq (5.14),

(5.43)

Substituting the values of $C^{n+1}_{\zeta_1, \zeta_2}$ , $C^{n}_{\zeta_1+1, \zeta_2}$ , $C^{n}_{\zeta_1-1, \zeta_2}$ , $C^{n}_{\zeta_1, \zeta_2+1}$ , and $C^{n}_{\zeta_1, \zeta_2-1}$ in the above equation and after simplification, we get

$\begin{equation*} \begin{split} &\left(\frac{\partial C}{\partial t}+\frac{\tau}{2!}\frac{\partial^2 C}{\partial t^2} +\frac{\tau^2}{3!}\frac{\partial^3 C}{\partial t^3}+\cdots\right) \left(1+4\frac{d_4\tau}{h^2}-\tau\mu\omega\nu+\tau(\mu_{0}+\mu_{1}+\gamma_{2})\right) \\ & = 2d_4\left(\frac{1}{2!}\frac{\partial^2 C}{\partial x^2}+\frac{h^2}{4!}\frac{\partial^4 C}{\partial x^4}+\cdots+\frac{1}{2!}\frac{\partial^2 c}{\partial y^2}+\frac{h^2}{4!}\frac{\partial^4 C}{\partial y^4}+\cdots\right)+\mu\omega\nu C^{n}_{\zeta_1,\zeta_2}+q\gamma_{1}I^{n}_{\zeta_1,\zeta_2}-(\mu_{0}+\mu_{1}+\gamma_{2})C^{n}_{\zeta_1,\zeta_2}. \end{split} \end{equation*}$

replace $\tau = h^3$ and $h\rightarrow 0$ , we have

$\frac{\partial C}{\partial t} = d_3\left(\frac{\partial^2 C}{\partial x^2}+\frac{\partial^2 C}{\partial y^2}\right)+\mu\omega\nu C+ q\gamma_{1}I-(\mu_{0}+\mu_{1}+\gamma_{2})C.$

A similar methodology can be utilized to analyze the consistency of the well-established classical forward Euler finite difference scheme.

5.6. Numerical results

The nonlinearity and spatial variability in model (2.5) make deriving exact analytical solutions under arbitrary initial conditions highly challenging. Consequently, numerical methods are employed to approximate solutions. Various established techniques are commonly used for solving partial differential equations (PDEs) in epidemiological models. These include methods such as the Fourier Spectral Method (FSM), the Non-Standard Finite Difference Method (NSFDM), and the Finite Element Method (FEM), among others. A detailed discussion of these approaches and their applications can be found in related literature, including ^[31]. An ideal numerical method for solving PDEs should strike a balance between accuracy, computational efficiency, adaptability to complex geometries, and ease of implementation. However, no single method excels in all these aspects. For instance, NSFDM provides a higher degree of flexibility and accuracy for certain cases but can introduce complexity and stability challenges, as well as increased computational demands ^[30]. Similarly, FEM is recognized for its adaptability to irregular geometries, but it requires intensive meshing efforts, particularly for intricate domains. Spectral methods, while highly accurate, are constrained by their reliance on periodic boundary conditions and are generally more suitable for problems with simple geometries ^[32]. Considering these trade-offs, we adopt the Crank-Nicolson operator splitting method alongside the Unconditionally Positivity Preserving method to numerically solve the PDEs ^[33,34]. These methods are chosen for their ability to maintain a balance between precision, stability, and computational efficiency.

The Crank-Nicolson strategy is known for its second order accuracy in both instances. By consolidating operator splitting, the strategy decouples the complex PDE framework into less complex subproblems, which are more straightforward to address while keeping up with solidness and precision. This technique can manage a more extensive scope of limit conditions and calculations contrasted with Fourier spectral strategies, which are confined to occasional circumstances. It additionally requires meshing contrasted with FEM, making it computationally more effective in complex geometries. Similarly, one of the vital benefits of the Unconditionally Positivity Preserving strategy is its capacity to save the non-pessimism of the arrangement, which is pivotal in epidemiological models where negative qualities are not truly significant. This technique stays stable no matter what the time step size, taking into account bigger time ventures without forfeiting exactness or presenting hazards. This is a huge improvement over conventional strategies like FDM, which might demand modest moves toward keeping up with solidness. the Unconditionally Positivity Preserving method is simpler to implement, particularly for problems with irregular geometries.

In this section, the CNOS-FD and UPP-FD methods are applied to compute numerical solutions for the model described in Eq (2.5). The numerical simulations were performed using MATLAB R2023a, a popular tool for computational analysis and simulations. The simulations utilized a spatial step size of $h = 0.1$ and a time step size of $dt = 0.005$ , ensuring compliance with the Von Neumann stability criterion. The diffusivity constants used in all cases are $d_1 = 0.3$ , $d_2 = 0.1$ , $d_3 = 0.5$ , and $d_4 = 0.01$ , where $d_1, d_2, d_3,$ and $d_4$ correspond to the diffusion coefficients for $S(x, y, t)$ , $L(x, y, t)$ , $I(x, y, t)$ , and $C(x, y, t)$ , respectively. The model parameters used in the numerical simulations are $q = 0.7$ , $\beta = 0.0091$ , $\mu = 0.0121$ , $\mu_1 = 0.01$ , $\mu_0 = 0.0693$ , $\omega = 0.85$ , $v = 0.46$ , $\epsilon = 0.02$ , $\gamma_1 = 0.03$ , $\gamma_2 = 0.02$ , $\gamma_3 = 0.01$ , and $\sigma = 0.04$ . The spatial and temporal domains are defined as $X_{\text{min}} = 0$ , $X_{\text{max}} = 10$ , $Y_{\text{min}} = 0$ , $Y_{\text{max}} = 10$ , and $T_{\text{max}} = 30$ . The discretization parameters are given as $h = 0.1$ , $N_x = \frac{X_{\text{max}} - X_{\text{min}}}{h} + 1 = 101$ , $N_y = \frac{Y_{\text{max}} - Y_{\text{min}}}{h} + 1 = 101$ , $\Delta t = 0.005$ , and $M = \frac{T_{\text{max}}}{\Delta t} + 1 = 6001$ . The spatial grid points are $x = \text{linspace}(X_{\text{min}}, X_{\text{max}}, N_x)$ and $y = \text{linspace}(Y_{\text{min}}, Y_{\text{max}}, N_y)$ . These values define the spatial and temporal resolution of the simulation grid, as well as the model parameters used in the reaction-diffusion equations. The initial conditions $S(x, y, 0) = 5 \cdot \left(1 + 0.5 \cdot \sin\left(\frac{\pi x}{5}\right)\right) \cdot \left(1 + 0.5 \cdot \cos\left(\frac{\pi y}{5}\right)\right)$ , $L(x, y, 0) = 3 \cdot \left(1 + 0.5 \cdot \cos\left(\frac{\pi x}{5}\right)\right) \cdot \left(1 + 0.5 \cdot \sin\left(\frac{\pi y}{5}\right)\right)$ , $I(x, y, 0) = 20 \cdot \sin\left(\frac{\pi x}{10}\right) \cdot \cos\left(\frac{\pi y}{10}\right)$ , and $C(x, y, 0) = 0.5 \cdot \cos\left(\frac{\pi x}{10}\right) \cdot \sin\left(\frac{\pi y}{10}\right)$ are considered. Homogeneous Neumann boundary conditions ( $\frac{\partial S}{\partial n} = \frac{\partial L}{\partial n} = \frac{\partial I}{\partial n} = \frac{\partial C}{\partial n} = 0$ ) are applied, ensuring no flux across the boundaries. Simulation results depicting the distribution of acutely infected individuals in one, two, and three spatial dimensions, with and without the incorporation of spatial diffusion, are shown in Figures 1–5.

Figure 1. Simulation results depicting the distribution of susceptible individuals in one, two, and three spatial dimensions, with and without the incorporation of spatial diffusion. Subfigures (a), (c), and (e) correspond to the scenarios including diffusion, while subfigures (b), (d), and (f) illustrate the results without diffusion.

DownLoad: Full-Size Img PowerPoint

Figure 2. Simulation results depicting the distribution of latent individuals in one, two, and three spatial dimensions, with and without the incorporation of spatial diffusion. Subfigures (a), (c), and (e) correspond to the scenarios including diffusion, while subfigures (b), (d), and (f) illustrate the results without diffusion.

DownLoad: Full-Size Img PowerPoint

Figure 3. Simulation results depicting the distribution of acutely infected individuals in one, two, and three spatial dimensions, with and without the incorporation of spatial diffusion. Subfigures (a), (c), and (e) correspond to the scenarios including diffusion, while subfigures (b), (d), and (f) illustrate the results without diffusion.

DownLoad: Full-Size Img PowerPoint

Figure 4. Simulation results depicting the distribution of chronically infected individuals in one, two, and three spatial dimensions, with and without the incorporation of spatial diffusion. Subfigures (a), (c), and (e) correspond to the scenarios including diffusion, while subfigures (b), (d), and (f) illustrate the results without diffusion.

DownLoad: Full-Size Img PowerPoint

Figure 5. Simulation results illustrating the distribution of individuals in each compartment (Susceptible, Latent, Infected, Chronic) across three spatial dimensions. These distributions include the effects of spatial diffusion, modeled using the Unconditionally Positivity Preserving (UPP) method.

DownLoad: Full-Size Img PowerPoint

Figure 1 presents the simulation results illustrating the dynamics of the susceptible population in scenarios incorporating spatial diffusion (Figure 1a, c and e) and those without diffusion (Figure 1b, d and f) across one, two, and three spatial dimensions. In the case of diffusion (1a), the susceptible population exhibits a slower decline over time, as diffusion facilitates the spatial redistribution of individuals, resulting in a more gradual exposure to infection based on proximity to infected individuals. Conversely, the absence of diffusion (1b) leads to a faster reduction in the susceptible population, characteristic of a well-mixed population where exposure occurs uniformly. In two dimensions, diffusion (1c) enables a uniform spatial spread of susceptibles, reducing high-density areas and mitigating localized outbreaks through smoother population distribution. Without diffusion (1d), hotspots of high susceptibility persist, highlighting spatial heterogeneity and an increased risk of concentrated outbreaks. Similarly, in three dimensions, diffusion (1e) promotes homogeneity in the distribution of susceptibles, as evidenced by smoother surface plots, while the absence of diffusion (1f) results in pronounced peaks and troughs, indicative of localized population clusters prone to outbreaks. These findings underscore the critical role of diffusion in representing real-world movement, such as migration or urbanization, which mitigates the risk of localized epidemics by evening out infection exposure across regions. In contrast, scenarios without diffusion demonstrate the heightened vulnerability of a static, heterogeneous population to rapid and concentrated outbreaks.

Figure 2 illustrates the spatial and temporal dynamics of the latent population under scenarios with diffusion (Figure 2a, c and e) and without diffusion (Figure 2b, d and f) across one, two, and three spatial dimensions. In the case of diffusion (2a), the latent population exhibits a slower decline over time, as diffusion enables individuals to migrate from high-transmission areas or regions with intense infection pressures, thereby slowing the transition to the infectious stage. In contrast, the absence of diffusion (2b) results in a more rapid reduction of the latent population due to the concentration of individuals in high-risk areas, leading to quicker progression to infection or recovery. In two dimensions, diffusion (2c) enables a smoother spatial distribution of latent individuals, reducing the risk of localized clusters that could exacerbate transmission rates. Without diffusion (2d), hotspots of latent population density emerge, increasing the potential for localized outbreaks and rapid disease progression. Similarly, in three dimensions, diffusion (2e) promotes uniformity in the latent population distribution, reflected by smoother surface plots that highlight reduced spatial gradients. Conversely, the absence of diffusion (2f) leads to pronounced peaks and troughs, representing significant spatial heterogeneity with concentrated latent populations in certain areas. These results emphasize the critical role of diffusion in real-world scenarios, where movement through migration or travel spreads latent carriers more evenly across regions, mitigating localized risks and delaying the progression to acute infection in high-density areas.

Figure 3 illustrates the dynamics of the acutely infected population under conditions with diffusion (Figure 3a, c and e) and without diffusion (Figure 3b, d and f) across one, two, and three spatial dimensions. With diffusion (3a), the acutely infected population exhibits a slower peak and decline, as the spatial redistribution disperses infected individuals across the domain, reducing concentrated hotspots and delaying progression or recovery. In the absence of diffusion (3b), the infected population peaks sharply and declines faster, indicating localized clustering of infected individuals, which intensifies transmission and quickly depletes the compartment as individuals progress or recover. In two dimensions, diffusion (3c) leads to a smoother spatial spread of acutely infected individuals, highlighting the homogenizing effect of movement in reducing sharp variations and hotspots. Without diffusion (3d), high-density clusters emerge, increasing the risk of localized outbreaks and overburdening regional resources. Similarly, in three dimensions, diffusion (3e) ensures a uniform spatial distribution of acutely infected individuals, as reflected in smoother surface plots with smaller peaks, thereby preventing extreme local concentrations. Conversely, the absence of diffusion (3f) results in pronounced peaks and troughs, indicating significant spatial heterogeneity and highly localized infection clusters. These findings emphasize that spatial diffusion, representing real-world movement such as migration or travel, disperses infectious individuals more evenly, reducing localized transmission intensity and delaying the epidemic's progression. Without diffusion, infected individuals remain trapped in high-density regions, amplifying transmission rates, exacerbating localized outbreaks, and straining healthcare resources.

Figure 4 illustrates the dynamics of chronically infected individuals with diffusion (Figure 4a, c and e) and without diffusion (Figure 4b, d and f) across one, two, and three spatial dimensions. When diffusion is included (4a), the chronically infected population shows a gradual rise and fall, indicating slower accumulation and depletion as spatial movement prevents clustering and reduces the intensity of transmission hotspots. In contrast, the absence of diffusion (4b) results in sharper peaks and faster declines, as chronically infected individuals remain concentrated in specific regions, leading to rapid disease progression and higher localized burdens. In two dimensions, diffusion (4c) leads to a smoother and more uniform spatial distribution of chronically infected individuals, mitigating the formation of high-density clusters. Without diffusion (4d), hotspots emerge, with distinct regions of high chronic infection densities that increase the risk of long-term health complications. Similarly, in three dimensions, diffusion (4e) promotes a more balanced spatial distribution, reflected in surface plots with smaller peaks and smoother gradients. Conversely, the absence of diffusion (4f) produces sharp peaks and significant spatial heterogeneity, highlighting areas of concentrated chronic infections and potential localized healthcare burdens. These findings underscore the critical role of diffusion in redistributing chronically infected individuals across the spatial domain, reducing the risk of localized overburdening of healthcare resources and the perpetuation of disease transmission. Without diffusion, chronic cases remain confined to high-density regions, exacerbating long-term health burdens and straining regional healthcare systems. This analysis highlights the importance of incorporating spatial diffusion in models to better understand chronic infection dynamics and inform targeted public health interventions.

6. Conclusions

The study of traveling wave solutions within the framework of nonlinear reaction-diffusion equations provides valuable insights into the modeling of diverse physical and biological processes. In the context of HBV infection, while the spatial distribution of uninfected host cells and infected hepatocytes remains largely stationary, the diffusion of viral particles and therapeutic agents plays a critical role in disease dynamics. This observation inspired the formulation of a diffusion-based model to better understand the mechanisms governing HBV transmission and its treatment. Through a comprehensive analysis grounded in the theory of monotone dynamical systems, we rigorously investigate the existence of traveling wave fronts in reaction-diffusion systems. Our findings indicate that the traveling wave front in the modeled system, under specific initial conditions, represents the action of therapeutic interventions, culminating in the eventual elimination of HBV. These solutions illustrate a dynamic transition, characterized by specific wave velocities, from a persistent infection state to one of eradication. This transition captures the gradual replacement of infection by therapeutic effects, effectively linking equilibrium states over temporal and spatial domains. The determination of the basic reproduction number through the next-generation matrix method offers a crucial metric for understanding the conditions under which the disease can invade or persist in a population. We identify the disease-free and endemic equilibria, demonstrating their stability under specific parameter conditions. This emphasizes the importance of chronic infections, given their role in severe long-term disabilities, such as cirrhosis and hepatocellular carcinoma, and the associated societal and healthcare burdens. The numerical simulations, performed using advanced techniques like the Crank-Nicolson scheme and positivity-preserving methods, validate the theoretical findings and provide actionable insights. These simulations underscore the importance of spatial considerations and the effectiveness of intervention strategies, such as vaccination and treatment, in curbing HBV transmission. This research not only advances the understanding of HBV dynamics but also serves as a critical tool for public health planning, offering valuable guidance for designing targeted interventions, and optimizing resource allocation. Researchers could expand upon this work by integrating additional real-world complexities, such as heterogeneity in host immunity and varying healthcare access, to further refine the model's applicability and precision.

Author contributions

Kamel Guedri: Writing – original draft, Formal analysis, Data curation; Rahat Zarin: Writing – original draft, Software, Investigation; Ashfaq Khan: Formal analysis, Writing – original draft, Resources; Amir Khan: Conceptualization, Writing – review & editing, Supervision; Basim M. Makhdoum: Funding acquisition, Project administration, Validation; Hatoon A. Niyazi: Writing – review & editing, Methodology, Visualization. All authors have read and approved the final version of the manuscript for publication.

Use of Generative-AI tools declaration

The authors declare they have not used Artificial Intelligence (AI) tools in the creation of this article.

Acknowledgments

The authors extend their appreciation to the King Salman center For Disability Research for funding this work through Research Group no KSRG-2024-200.

Funding

The authors extend their appreciation to the King Salman center For Disability Research for funding this work through Research Group no KSRG-2024-200.

Conflict of interest

All authors declare no conflicts of interest in this paper.

References

[1]	H. B. Mcmahan, E. Moore, D. Ramage, B. A. y Arcas, Federated learning of deep networks using model averaging, arXiv: 1602.05629.
[2]	T. Li, A. Sahu, A. Talwalkar, V. Smith, Federated learning: challenges, methods, and future directions, IEEE Signal Proc. Mag., 37 (2020), 50–60. https://doi.org/10.1109/MSP.2020.2975749 doi: 10.1109/MSP.2020.2975749
[3]	D. Li, J. Wang, FedMD: heterogenous federated learning via model distillation, arXiv: 1910.03581.
[4]	T. Nishio, R. Yonetani, Client selection for federated learning with heterogeneous resources in mobile edge, 2019 IEEE International Conference on Communications (ICC), Shanghai, China, 2019, 1–7. https://doi.org/10.1109/ICC.2019.8761315
[5]	L. Liu, F. Zheng, H. Chen, G. J. Qi, H. Huang, L. Shao, A Bayesian federated learning framework with online Laplace approximation, arXiv: 2102.01936.
[6]	B. Mcmahan, E. Moore, D. Ramage, S. Hampson, B. A. y Arcas, Communication-efficient learning of deep networks from decentralized data, In: Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, New York: PMLR, 2017, 1273–1282.
[7]	B. Wu, X. Dai, P. Zhang, Y. Wang, F. Sun, Y. Wu, FBNet: hardware-aware efficient convnet design via differentiable neural architecture search, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA, 2019, 10726–10734. https://doi.org/10.1109/CVPR.2019.01099
[8]	C. He, M. Annavaram, S. Avestimehr, Fednas: federated deep learning via neural architecture search, arXiv: 2004.08546.
[9]	T. Shen, J. Zhang, X. Jia, F. Zhang, G. Huang, P. Zhou, et al., Federated mutual learning, arXiv: 2006.16765.
[10]	C. Xie, S. Koyejo, I. Gupta, Asynchronous federated optimization, arXiv: 1903.03934.
[11]	W. Wu, L. He, W. Lin, R. Mao, C. Maple, S. Jarvis, SAFA: a semi-asynchronous protocol for fast federated learning with low overhead, IEEE T. Comput., 70 (2021), 655–668. https://doi.org/10.1109/TC.2020.2994391 doi: 10.1109/TC.2020.2994391
[12]	Y. Zhang, Y. Xu, S. Wei, Y. Wang, Y. Li, X. Shang, Doubly contrastive representation learning for federated image recognition, Pattern Recogn., 139 (2023), 109507. https://doi.org/10.1016/j.patcog.2023.109507 doi: 10.1016/j.patcog.2023.109507
[13]	J. Xiao, C. Du, Z. Duan, W. Guo, A novel server-side aggregation strategy for federated learning in Non-IID situations, 2021 20th International Symposium on Parallel and Distributed Computing (ISPDC), Cluj-Napoca, Romania, 2021, 17–24.
[14]	L. Hu, H. Yan, L. Li, Z. Pan, X. Liu, Z. Zhang, MHAT: an efficient model-heterogenous aggregation training scheme for federated learning, Inform. Sciences, 560 (2021), 493–503. https://doi.org/10.1016/j.ins.2021.01.046 doi: 10.1016/j.ins.2021.01.046
[15]	T. Li, A. Sahu, M. Zaheer, M. Sanjabi, A. Talwalkar, V. Smith, Federated optimization in heterogeneous networks, Proceedings of Machine Learning and Systems, 2 (2020), 429–450.
[16]	Q. Li, B. He, D. Song, Model-contrastive federated learning, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, TN, USA, 2021, 10708–10717. https://doi.org/10.1109/CVPR46437.2021.01057
[17]	M. Mendieta, T. Yang, P. Wang, M. Lee, Z. Ding, C. Chen, Local learning matters: rethinking data heterogeneity in federated learning, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, LA, USA, 2022, 8397–8406. https://doi.org/10.1109/cvpr52688.2022.00821
[18]	M. Al-Shedivat, J. Gillenwater, E. Xing, A. Rostamizadeh, Federated learning via posterior averaging: a new perspective and practical algorithms, arXiv: 2010.05273.
[19]	H. Chang, V. Shejwalkar, R. Shokri, A. Houmansadr, Cronus: robust and heterogeneous collaborative learning with black-box knowledge transfer, arXiv: 1912.11279.
[20]	Y. Zhang, T. Xiang, T. Hospedales, H. Lu, Deep mutual learning, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 2018, 4320–4328. https://doi.org/10.1109/CVPR.2018.00454
[21]	C. Blundell, J. Cornebise, K. Kavukcuoglu, D. Wierstra, Weight uncertainty in neural network, The 32nd International Conference on Machine Learning (ICML), Lille, France, 2015, 1613–1622.
[22]	K. Shridhar, F. Laumann, M. Liwicki, A comprehensive guide to bayesian convolutional neural network with variational inference, arXiv: 1901.02731.
[23]	A. Wilson, P. Izmailov, Bayesian deep learning and a probabilistic perspective of generalization, The 34th Conference on Neural Information Processing Systems (NeurIPS), Vancouver, Canada, 2020, 4697–4708. https://doi.org/10.5555/3495724.3496118
[24]	O. Goldreich, S. Micali, A. Wigderson, How to play any mental game, or a completeness theorem for protocols with honest majority, In: Providing sound foundations for cryptography: on the work of shafi goldwasser and silvio micali, New York: Association for Computing Machinery, 2019,307–328. https://doi.org/10.1145/3335741.3335755
[25]	L. T. Phong, Y. Aono, T. Hayashi, L. Wang, S. Moriai, Privacy-preserving deep learning via additively homomorphic encryption, IEEE T. Inf. Foren. Sec., 13 (2018), 1333–1345. https://doi.org/10.1109/TIFS.2017.2787987 doi: 10.1109/TIFS.2017.2787987
[26]	R. Geyer, T. Klein, M. Nabi, Differentially private federated learning: a client level perspective, arXiv: 1712.07557.
[27]	P. Kairouz, H. McMahan, B. Avent, A. Bellet, M. Bennis, A. N. Bhagoji, et al., Advances and open problems in federated learning, Found. Trends Mach. Le., 14 (2021), 1–210. https://doi.org/10.1561/2200000083 doi: 10.1561/2200000083
[28]	Q. Yang, Y. Liu, T. Chen, Y. Tong, Federated machine learning: concept and applications, ACM T. Intel. Syst. Tec., 10 (2019), 12. https://doi.org/10.1145/3298981 doi: 10.1145/3298981
[29]	Y Lecun, L. Bottou, Y. Bengio, P. Haffner, Gradient-based learning applied to document recognition, P. IEEE, 86 (1998), 2278–2324. https://doi.org/10.1109/5.726791 doi: 10.1109/5.726791
[30]	A. Krizhevsky, G. Hinton, Learning multiple layers of features from tiny images, Technical Report TR-2009, University of Toronto, Toronto, 2009.
[31]	Y. Lecun, B. Boser, J. S. Denker, R. E. Howard, W. Habbard, L. D. Jackel, et al., Handwritten digit recognition with a back-propagation network, In: Advances in Neural Information Processing systems 2, San Francisco: Morgan Kaufmann Publishers Inc., 1989,396–404. https://doi.org/10.5555/109230.109279
[32]	A. Ashukha, A. Lyzhov, D. Molchanov, D. Vetrov, Pitfalls of in-domain uncertainty estimation and ensembling in deep learning, arXiv: 2002.06470.
[33]	M. Yurochkin, M. Agarwal, S. Ghosh, K. Greenewald, N. Hoang, Y. Khazaeni, Bayesian nonparametric federated learning of neural networks, The 36th International Conference on Machine Learning, Long Beach, California, USA, 2019, 7252–7261.
[34]	T. H. Hsu, H. Qi, M. Brown, Measuring the effects of non-identical data distribution for federated visual classification, arXiv: 1909.06335.

Reader Comments

Your name:*

Email:*
© 2023 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)