A mathematical model of discrete attachment to a cellulolytic biofilm using random DEs

Jack M. Hughes; Hermann J. Eberl; Stefanie Sonner; Jack M. Hughes; Hermann J. Eberl; Stefanie Sonner

doi:10.3934/mbe.2022310

Mathematical Biosciences and Engineering

2022, Volume 19, Issue 7: 6582-6619. doi: 10.3934/mbe.2022310

Previous Article Next Article

Research article Special Issues

A mathematical model of discrete attachment to a cellulolytic biofilm using random DEs

1.
Department of Mathematics, University of British Columbia, 1984 Mathematics Rd., Vancouver, BC V6T 1Z2, Canada
2.
Department of Mathematics and Statistics, University of Guelph, 50 Stone Rd E., Guelph, ON N1G 2W1, Canada
3.
Radboud University, IMAPP – Mathematics, Postbus 9010, 6500 GL Nijmegen, The Netherlands

Received: 15 January 2022 Revised: 02 April 2022 Accepted: 19 April 2022 Published: 26 April 2022

We propose a new mathematical framework for the addition of stochastic attachment to biofilm models, via the use of random ordinary differential equations. We focus our approach on a spatially explicit model of cellulolytic biofilm growth and formation that comprises a PDE-ODE coupled system to describe the biomass and carbon respectively. The model equations are discretized in space using a standard finite volume method. We introduce discrete attachment events into the discretized model via an impulse function with a standard stochastic process as input. We solve our model with an implicit ODE solver. We provide basic simulations to investigate the qualitative features of our model. We then perform a grid refinement study to investigate the spatial convergence of our model. We investigate model behaviour while varying key attachment parameters. Lastly, we use our attachment model to provide evidence for a stable travelling wave solution to the original PDE-ODE coupled system.

Keywords:

Citation: Jack M. Hughes, Hermann J. Eberl, Stefanie Sonner. A mathematical model of discrete attachment to a cellulolytic biofilm using random DEs[J]. Mathematical Biosciences and Engineering, 2022, 19(7): 6582-6619. doi: 10.3934/mbe.2022310

Related Papers:

[1]	Yousef Rohanizadegan, Stefanie Sonner, Hermann J. Eberl . Discrete attachment to a cellulolytic biofilm modeled by an Itô stochastic differential equation. Mathematical Biosciences and Engineering, 2020, 17(3): 2236-2271. doi: 10.3934/mbe.2020119
[2]	Blessing O. Emerenini, Stefanie Sonner, Hermann J. Eberl . Mathematical analysis of a quorum sensing induced biofilm dispersal model and numerical simulation of hollowing effects. Mathematical Biosciences and Engineering, 2017, 14(3): 625-653. doi: 10.3934/mbe.2017036
[3]	Fadoua El Moustaid, Amina Eladdadi, Lafras Uys . Modeling bacterial attachment to surfaces as an early stage of biofilm development. Mathematical Biosciences and Engineering, 2013, 10(3): 821-842. doi: 10.3934/mbe.2013.10.821
[4]	Fabiana Russo, Alberto Tenore, Maria Rosaria Mattei, Luigi Frunzo . Multiscale modelling of the start-up process of anammox-based granular reactors. Mathematical Biosciences and Engineering, 2022, 19(10): 10374-10406. doi: 10.3934/mbe.2022486
[5]	Nikodem J. Poplawski, Abbas Shirinifard, Maciej Swat, James A. Glazier . Simulation of single-species bacterial-biofilm growth using the Glazier-Graner-Hogeweg model and the CompuCell3D modeling environment. Mathematical Biosciences and Engineering, 2008, 5(2): 355-388. doi: 10.3934/mbe.2008.5.355
[6]	Mudassar Imran, Hal L. Smith . A model of optimal dosing of antibiotic treatment in biofilm. Mathematical Biosciences and Engineering, 2014, 11(3): 547-571. doi: 10.3934/mbe.2014.11.547
[7]	Vincenzo Luongo, Maria Rosaria Mattei, Luigi Frunzo, Berardino D'Acunto, Kunal Gupta, Shankararaman Chellam, Nick Cogan . A transient biological fouling model for constant flux microfiltration. Mathematical Biosciences and Engineering, 2023, 20(1): 1274-1296. doi: 10.3934/mbe.2023058
[8]	Donna J. Cedio-Fengya, John G. Stevens . Mathematical modeling of biowall reactors for in-situ groundwater treatment. Mathematical Biosciences and Engineering, 2006, 3(4): 615-634. doi: 10.3934/mbe.2006.3.615
[9]	Fazal Abbas, Rangarajan Sudarsan, Hermann J. Eberl . Longtime behavior of one-dimensional biofilm models with shear dependent detachment rates. Mathematical Biosciences and Engineering, 2012, 9(2): 215-239. doi: 10.3934/mbe.2012.9.215
[10]	Choah Shin, Azhar Alhammali, Lisa Bigler, Naren Vohra, Malgorzata Peszynska . Coupled flow and biomass-nutrient growth at pore-scale with permeable biofilm, adaptive singularity and multiple species. Mathematical Biosciences and Engineering, 2021, 18(3): 2097-2149. doi: 10.3934/mbe.2021108

Abstract

1. Introduction

Bacterial biofilms are microbial depositions on immersed solid abiotic or biotic surfaces. Once attached, they form a self-secreted matrix of extracellular polymeric substance (EPS) that provides structural stability and an adhesion mechanism to the surface. The EPS is assumed to be mostly composed of polysaccharides, but it may contain nucleic acids and proteins. The bacterial cells become sessile and are protected from anti-microbials and mechanical stress due to the EPS ^[1,2,3].

The surface to which the biofilm attaches is called the substratum. Some substrata are impenetrable and inert such as rocks or plastic. However, we will be considering substrata that are reactive and can be degraded. These degradable substrata can play a vital role in the biofilm development cycle as they can provide the nutrients needed for biofilm growth and proliferation, which is the main focus of the cellulolytic biofilms we will focus on.

Biofilms require nutrients to grow, which we call substrates. Substrates are either supplied by the substratum (this is the case for cellulolytic biofilms) or by the surrounding environment through diffusion or advection. Depending on the availability, rate of delivery, and local concentrations of substrates, biomass growth may be slow and limited in specific locations, while in other locations, the biomass may be fast-growing and abundant. If we assume the availability of substrates is not a growth-limiting factor, then the metabolic growth rate of the biofilm will determine the generation of biological by-products, and the uptake of substrates ^[4].

A biofilm community typically consists of many separated sub-colonies, which may merge and form into larger colonies. Macroscopically, biofilm communities are homogeneous, meaning they look like a thin film covering the substratum. However, structural differences become apparent at the mesoscopic level (10 $\mu$ m $-$ 1mm in this case). At these scales, the heterogeneity of a biofilm community manifests itself. Factors such as specific growth rate, maximum cell density, and substrate concentration determine the level of heterogeneity of the biofilm community.

The formation of biofilms onto the substratum typically occurs via the physical attachment of free-floating bacteria in the aqueous phase surrounding the substratum by forming a firm physiochemical bond. One possible source of free-floating bacteria is from biomass detachment as a result of shear forces exerted on the biofilm from a fluid ^[5]. The possibility of adhesion of bacterial cells to a certain position on the substratum can be influenced by many factors, including ^[6]:

● Bacterial characteristics: surface charge, appendages, hydrophobicity, etc.,

● Environment: temperature, bacterial concentration, chemical treatment, etc.,

● Substratum properties: chemical composition, roughness, physical configuration, etc.,

● Hydrodynamics in the aqueous phase and possibly fluid-structure interaction if the substratum is not rigid.

Hence, the attachment and adhesion of biomass to the substratum depends on numerous parameters. When modelling attachment, this implies that many model parameters are required to capture this phenomenon. Including all of these different parameters in a model will likely make the model too complex. However, one possible mathematical simplification of this process is to consider the attachment phenomenon as a random event at each location of the substratum, where all the underlying mechanisms are lumped together into one driving stochastic process per location on the substratum. In other words, we view the attachment phenomenon as a random behaviour of the cells in the aqueous phase for which the underlying mechanisms of attachment are not considered. This notion is supported by experimental observations in ^[7,8]. Thus, whatever processes may affect attachment at a given time and with a given set of system parameters are driven by a stochastic process at each inoculation point of the substratum.

The addition of biomass attachment has not readily been used in the modelling literature despite the importance of the process on biofilm growth and formation. Even when attachment is included in models, the approach is typically the addition of a simple deterministic function based on no experimental data, which we discuss at the end of this section. The primary focus of this work is the mathematical modelling of discrete stochastic cell attachment and how this attachment may affect the spatial and temporal evolution of biofilms. The overall goal is to adapt the mathematical framework presented in ^[9]. The model presented in ^[9] has two major shortcomings, difficulty in model analysis and slow simulation times. The difficulty in model analysis comes from the Itô stochastic differential equations (SDEs) formalism ^[10], while the slow simulation times come from small step-sizes caused by the explicit method used. Therefore, we use random ordinary differential equations (RODEs), which allows us to use ODE theory and numerics path-wise. Moreover, RODEs allow to use different stochastic processes and these stochastic processes can appear in nonlinear terms (see Section 3.1).

Barring computational costs, we want our modelling framework to apply to biofilms of different types and morphologies and in any number of spatial dimensions. However, for analysis and simulation purposes, we apply our modelling framework to the case of 2D cellulolytic biofilms. Cellulolytic biofilms mainly arise in the biofuel production industry, and they have specific characteristics that we will exploit. The attachment phenomenon is assumed to be a discrete process and is characterized in terms of the adhesion factors discussed above; however, the attachment phenomenon is not very well-understood.

Biofuels are an alternative fuel source in the transportation sector in place of fossil fuels and can be implemented in the existing infrastructure. A significant benefit of biofuels is that it is an abundant renewable resource and has a net-zero carbon dioxide release into the atmosphere. However, the production of biofuels using food products, such as sugars, starch, fats, and oils, is not sustainable due to high and increasing demand for food ^[11]. Therefore, it is better to use inedible resources, such as lignocellulose^*, to produce biofuels. The annual production of cellulosic ethanol in the US from lignocellulosic resources was estimated at 53 billion gallons in 2016 ^[12].

^*Lignocellulose is composed of carbohydrate polymers (cellulose, hemicellulose), and an aromatic polymer (lignin).

The production of cellulosic ethanol is considered achievable for lignocellulosic biomass. However, a few difficulties arise, including biomass recalcitrance, which is associated with the resistance of cell walls to enzymatic deconstruction, and the heterogeneity of the substrate caused by the complex structure of lignocellulose ^[13]. Therefore, the lignocellulose biomass needs to be treated and prepared before it can be extracted. In consolidated bioprocessing (CBP), cellulolytic bacteria perform fermentation and hydrolysis of the sugars in solid lignocellulosic substances ^[13,14]. CBP requires micro-organisms that are highly efficient at fermenting the resultant sugars and producing cellulolytic enzymes. The micro-organisms should also produce little by-product and be able to live in environments with high levels of ethanol.

Common bacteria used in CBP include Clostridium thermocellum and Caldicellulosiruptor obsidiansis, which consume cellulosic substrate and form cellulolytic biofilms ^[7,15]. These cellulolytic biofilm colonies are distributed spatially and evolve temporally ^[8]. Cellulolytic biofilms form very little EPS, attach directly to the substrate, and are typically monolayers ^[7,8]. These cellulolytic biofilms differ from the traditional biofilms, which grow into the aqueous phase that contains the substrates. These cellulolytic biofilms carve out crater-like structures in the substratum, which are referred to as inverse colonies ^[8]. Cellulolytic biofilms form and grow through several stages (see Figure 1):

Figure 1. Top: Schematic diagram of the six stages of cellulolytic biofilm formation: 1) initial cell attachment, 2) cell growth and division, 3) inverted colony formation, 4) emergence of the crater-like formation, 5) further growth and merging of craters, 6) uniform biofilm formation, from ^[8]. Bottom: Confocal Laser Scanning Micrographs of cellulosic biofilms. Left: Distribution of C. obsidiansis cells on a cellulose chad after a) 0h, b) 8h, c) 16h, d) 24h, e) 44h, f) 48h, g) 56h, and h) 68h. The radius of the craters is

$\approx$ 40

${\rm{ \mathsf{ μ} }}$ m ^[8]. Right: Crater formation by C. thermocellum. a) top view, b) cross-sectional view. Length scale

$=$ 10

${\rm{ \mathsf{ μ} }}$ m ^[8]. All images are from ^[8].

DownLoad: Full-Size Img PowerPoint

$1)$ Initial cell attachment to the substratum,

$2)$ Cell growth and division,

$3)$ Inverted colony formation,

$4)$ Emergence of crater-like structures due to substrate consumption and substratum degradation,

$5)$ Radial growth of the crater-like depressions and the eventual coalescence of the crater-like structures,

$6)$ Formation of a uniform-thickness biofilm.

The different stages of the spatiotemporal evolution of biofilms formed by C. obsidiansis on cellulose chads are captured by a confocal laser scanning microscope and are visualized in Figure 1 ^[8]. The crater-like structures that form are provided in Figure 1 ^[8].

The mathematical modelling of stochastic attachment of cells to cellulolytic biofilms has been addressed in two different works. First, in an ad hoc manner by modifying a deterministic model of cellulolytic biofilms ^[16]. Secondly, ^[9] developed a model by first discretizing the deterministic model from ^[16] and then adding an additional stochastic term to the biomass equations using an attachment factor, which converted the model to an SDE system. Therefore, both models have the same deterministic part that deals with the formation and temporal evolution of cellulolytic biofilms and is described by a highly nonlinear degenerate coupled system of a partial differential equation (PDE) and an ODE for the biomass density and substrate concentration respectively. However, the SDE model in ^[9] is constructed via a spatial discretization of the PDE-ODE coupled system.

The model in ^[16] has an ad hoc inclusion of stochastic cell attachment via the addition of an impulse function to the biomass equation. The modelling mechanism behind the impulse function works as follows. If a certain quantity ( $Q$ ) is greater than a uniform random number over $[0, 1]$ , then the impulse function becomes non-zero, and a fixed amount of biomass is added to the PDE; otherwise, the value of the impulse function is zero, and hence no biomass is added. The quantity $Q$ is defined based on a few assumptions on the likelihood of an attachment event occurring. An attachment event is less likely to occur if the location has a large quantity of biomass, M, already, or if the substrate concentration, C, is too low, i.e., $Q\propto (1-M)\times C$ , where we assume that $M$ is bounded by 1. By formulating the attachment phenomenon in this way, paper ^[16] was able to reproduce the qualitative behaviour of cellulolytic biofilms with cell attachment. This can be seen from the graphical visualization taken from ^[16] (see Figure 2) and the comparison with the experimental observations given in Figure 1.

Figure 2. Example simulations of cellulolytic biofilm attachment models with a starting central inoculation, where

$(0\leq M\leq1)$ stands for the biomass. Left: The PDE-ODE coupled system with random attachment presented in ^[16]. Right: An example simulation of the SDE random attachment model presented in ^[9].

DownLoad: Full-Size Img PowerPoint

Although the approach taken in ^[16] reproduces the expected qualitative behaviour, there is no mathematical framework and it is not based on a stochastic process. In particular, the model cannot be written as a stochastic or random (partial) differential equation.

The model in ^[9] seeks to utilize the insight gained from ^[16] but using a rigorous mathematical framework. In order to capture cellular attachment a similar impulse term is mollified and added to the biomass equation and an attachment factor is introduced for each grid cell of the domain. The attachment factor is a stochastic process based on the construction of $Q$ given above, without drift. The attachment factor was simulated alongside the biomass and substrate equations. The model consists of an Itô SDE with the necessary regularity properties to apply SDE theory. The model in ^[9] reproduced the qualitative behaviour observed in experiments (see Figure 2). However, there are still significant downsides to this approach. The first being the similar ad hoc approach with an attachment factor, and the second being the numerical implementation. The numerical method for solving this model was the simplest available. Namely, ^[9] used the Euler-Maruyama method with a time-stepping approach constructed to ensure stability of the system. The downside of this method stems from the fact that discretized diffusion-reaction equations are typically stiff ^[17], especially with the nonlinear diffusion coefficient in this model. Therefore, the time-step required to maintain stability is quite small.

Other attachment modelling strategies are used in the literature for different types of biofilms. The approaches taken in other models are deterministic. For example, in ^[18], attachment was added to a 3D biofilm model via a pre-determined function of time. In other models, the attachment of biomass is prescribed as a function of the biomass concentration in the aqueous phase ^[19,20,21]. Typically the function $\alpha u$ , where $\alpha$ is the attachment rate and $u$ is the concentration of biomass in the aqueous phase, is used. The attachment terms in both cases are not related to any underlying mechanism. This is similar to the approaches for cellulolytic biofilms ^[16] and to the approach we will present. However, we take into account the stochastic nature of attachment, which might be a more realistic approach.

2. The spatially explicit deterministic continuum model

2.1. Model assumptions

The model in ^[16] adapts the model for traditional biofilms proposed in ^[22] to the case of cellulolytic biofilms. The aqueous phase does not explicitly appear in the model formulation and the nutrients are contained in the substratum. The model assumptions are given below.

$1)$ The local capacity of the substratum for accommodating biomass is limited.

$2)$ The only growth-limiting substrate is carbon. The carbon is only present inside the substratum and is assumed to be immobile and not replenished after consumption.

$3)$ The spatial distribution, production, and movement of biomass only depends on the local availability of space and nutrients in the substratum. The biofilm does not exhibit notable expansion if there is available space.

$4)$ Carbon is consumed by the bacteria and converted into new cells. We assume that biomass growth and carbon degradation are governed by standard saturation kinetics.

$5)$ Cells are lost from the biofilm either by detachment or cell lysis. Both processes are combined into one process and are proportional to the local biomass density. Once carbon becomes depleted, biofilm growth ceases, and the loss of cells becomes the dominant process.

$6)$ No cellular attachment occurs from the aqueous phase.

We will describe the two-dimensional variation of the model presented in ^[16]. More specifically, we will be considering the limiting case of cellulosic biofilms that spread across a very thin cellulosic substratum, where the depth of the substratum is negligible compared to the length scales of the colonies. This is one of the two cases described in ^[16].

2.2. Governing equations

The model is a coupled system of a PDE describing the spatiotemporal evolution of the biomass density $M = M(t, x)$ and an ODE describing the evolution of the carbon concentration $C = C(t, x)$ . We formulate the model in a rectangular spatial domain $\Omega: = [0, 1]\times[0, W]\subset\mathbb{R}^2$ , with $W\leq 1$ . All variables and parameters are dimensionless (see ^[9] for more details). The model is given by

$\begin{align} \partial_t M& = \nabla \cdot(D(M)\nabla M)+F(C)M, &(t, x)\in(0, \infty)\times\Omega, \end{align}$

(2.1)

$\begin{align} \partial_t C& = -G(C)M, &(t, x)\in(0, \infty)\times\Omega. \end{align}$

(2.2)

Here $D(M)$ is the biomass diffusion coefficient, $F(C)$ is the net biomass growth rate, and $G(C)$ is the uptake rate of the carbon substrate. The substrate uptake rate is the only driving factor in (2.2) since there is no transport of carbon by model assumption 2.1. This is in contrast to the modelling framework presented in ^[22]. The actual biofilm consists of the sub-region of the domain $\Omega$ defined by

$\begin{align*} \Omega_M(t): = \{x\in\Omega\mid M(t, x) > 0\}. \end{align*}$

Following ^[9,16,22], we define the biomass density dependent diffusion coefficient by

$\begin{align} D(M): = \delta\frac{M^\alpha}{(1-M)^\beta}, \quad \alpha, \beta\geq1, \end{align}$

(2.3)

where $\delta > 0$ is the motility coefficient. By model assumption 2.1 and the non-dimensionalization given in ^[9], we assume that the maximum attainable biomass is $1$ .

The diffusion coefficient exhibits two distinct cases of behaviour: (ⅰ) if the local biomass density is low, then little diffusion will occur, meaning the substratum can accommodate additional biomass (in accordance with model assumption 2.1); (ⅱ) if the local biomass density is close to its maximum value, then the equation for biomass will exhibit super diffusion behaviour and the biomass will move quickly to neighbouring locations (in accordance with model assumption 2.1). For $0\leq M\ll 1$ , the diffusion coefficient can be approximated by a degenerate power law, which is known from the porous medium equation, i.e., $D(M)\approx M^\alpha$ , which approaches 0 as $M\to0$ . From simulations in ^[23], we know that if the initial data has compact support, then so does the solution. Therefore, the degenerate power-law creates an expanding biofilm colony with a sharp front. The exponent $\alpha$ in the diffusion coefficient controls how much biomass can be accommodated locally before a noticeable expansion into neighbouring regions occurs. When $0\ll M < 1$ , the biomass density approaches the singularity in the diffusion coefficient and super diffusion dominates, i.e., $D(M)\approx (1 - M)^{-\beta}$ . The exponent $\beta$ determines the severity of the super diffusion. Both diffusion modes are required to ensure expansion with finite speed (due to the degenerate power-law) and to have the local volume filling effect (due to the super diffusion) ^[16].

The net biomass growth rate is composed of the biomass growth rate and the cell loss rate. By model assumption 2.1, the cell loss rate includes both cell lysis and detachment of viable bacteria into the aqueous phase. By model assumption 2.1, carbon is the only growth-limiting substrate in the system, and by model assumption 2.1 the biomass growth rate is described by Monod kinetics ^[24]. Therefore, the net growth rate is given by

$\begin{align} F(C) = \frac{C}{\kappa+C}-\lambda, \end{align}$

(2.4)

where $\kappa > 0$ is the half-saturation constant and $\lambda > 0$ is the cell loss rate. The maximum growth rate is equal to 1 by our non-dimensionalization.

The uptake rate of the carbon substrate is modelled using the standard Monod saturation kinetics ^[24] by model assumption 2.1 and is given by

$\begin{align} G(C) = \Upsilon\frac{C}{\kappa+C}, \end{align}$

(2.5)

where $\Upsilon > 0$ is the maximum specific consumption rate.

Biomass is restricted to the domain $\Omega$ and it is thus natural to impose homogeneous Neumann conditions on (2.1), i.e.,

$\begin{align} \partial_nM(t, x)& = 0, &(t, x)\in[0, \infty)\times\partial\Omega, \end{align}$

(2.6)

where $\partial_n$ denotes the outward normal derivative and $\partial\Omega$ the boundary of $\Omega$ . We also impose the initial conditions

$\begin{align} M(0, x) = M_0(x), \quad C(0, x) = C_0(x), \quad x\in\Omega. \end{align}$

(2.7)

We assume that $M_0(x) > 0$ in small subsets of $\Omega$ and $M_0(x) = 0$ everywhere else and we assume $C_0(x) > 0$ in $\Omega$ . In general, $C_0(x)$ can depend on the spatial position $x$ ; however, in most of our simulations we restrict ourselves to the case where $C_0(x) = C_\infty = const$ as in the experiments conducted in ^[7,8,25], which used homogeneous paper chads.

2.3. Spatial discretization

For simulation purposes and to introduce our attachment term, we spatially discretize the PDE-ODE coupled system. We place an $N\times K$ uniform grid $\mathscr{N}$ onto the rectangular domain $\Omega: = [0, 1]\times[0, W]$ with $W\leq1$ . We use the finite volume scheme as introduced in ^[26] and described in detail in ^[9]. This converts our PDE-ODE coupled system into $2NK$ ODEs for each grid cell $(i, j)$ as

$\begin{align} \frac{dM_{i, j}}{dt}& = \frac{1}{\Delta x}\sum\limits_{\sigma\in\mathscr{N}_{i, j}} J_\sigma+\left(\frac{C_{i, j}}{\kappa+C_{i, j}}-\lambda\right)M_{i, j}, \end{align}$

(2.8)

$\begin{align} \frac{d C_{i, j}}{dt}& = -\Upsilon\frac{C_{i, j}}{\kappa+C_{i, j}}M_{i, j}. \end{align}$

(2.9)

where $\mathscr{N}_{i, j}: = \{(i+1, j), (i-1, j), (i, j+1), (i, j-1)\}$ is the set of neighbouring grid cells and

$\begin{align} J_\sigma = \begin{cases} \frac{1}{2\Delta x}(D(M_\sigma)+D(M_{i, j}))(M_\sigma-M_{i, j}) & \;{\rm{for }}\; \sigma\in\mathscr{N}\\ 0 & \;{\rm{for }}\; \sigma\not\in\mathscr{N}, \end{cases} \end{align}$

(2.10)

is the flux out of the grid cell $(i, j)$ into grid cell $\sigma$ , where $\Delta x = 1/N = W/K$ is the side length of the grid cells. We write the $2NK$ ODEs in matrix-vector notation as

$\begin{align} \begin{aligned} \frac{d\boldsymbol{{M}}}{dt}& = \mathscr{D}(\boldsymbol{{M}})\boldsymbol{M}+\mathscr{F}(\boldsymbol{{C}})\boldsymbol{{M}}, \\ \frac{d\boldsymbol{{C}}}{dt}& = -\mathscr{G}(\boldsymbol{{C}})\boldsymbol{{M}}, \end{aligned} \end{align}$

(2.11)

after ordering the grid cells by $\pi(i, j) = (i-1)K+j$ . The matrices $\mathscr{D}(\boldsymbol{{M}})$ , $\mathscr{F}(\boldsymbol{{C}})$ and $\mathscr{G}(\boldsymbol{{C}})$ are $NK\times NK$ and represent the diffusion, net biomass growth, and carbon uptake, respectively.

Remark 2.1. The matrix $\mathscr{D}(\boldsymbol{M})$ is symmetric and weakly diagonally dominant with non-negative off-diagonal entries and non-positive diagonal entries ^[9]. Therefore, $\mathscr{D}(\boldsymbol{M})$ is at least negative semi-definite. The matrices $\mathscr{F}(\boldsymbol{C})$ and $\mathscr{G}(\boldsymbol{C})$ are diagonal matrices with diagonal entries $f_{pp} = f(C_p)$ and $g_{pp} = G(C_p)$ . If $C_p = 0$ , then $f_{pp} = -\lambda$ , the natural cell loss rate, and $g_{pp} = 0$ .

In the next section, we introduce our new approach for modelling cellular attachment via RODEs.

3. A spatially explicit model with discrete random attachment events

3.1. Random ordinary differential equations and the Wiener process

Here we introduce random ordinary differential equations (RODEs), which we will use to model random attachment. We follow the description in ^[27].

Let $(S, \mathscr{F}, \mathbb{P})$ be a probability space, where $S$ is the set of outcomes, $\mathscr{F}$ is a $\sigma$ -algebra on $S$ , and $\mathbb{P}$ is a probability measure, and let $\eta:[0, T]\times S\to\mathbb{R}^m$ be an $\mathbb{R}^m$ -valued stochastic process with continuous sample paths. In addition, let $g:\mathbb{R}^d\times\mathbb{R}^m\to\mathbb{R}^d$ . A RODE in $\mathbb{R}^d$ is an equation of the form

$\begin{align} \frac{dx}{dt} = g(x, \eta_t(s)), \quad x\in\mathbb{R}^d, \end{align}$

(3.1)

that can be viewed as a non-autonomous ordinary differential equation

$\begin{align} \frac{dx}{dt} = G_s(t, x): = g(x, \eta_t(s)) \end{align}$

(3.2)

for almost every realization $s\in S$ ^[27]. Since RODEs are just non-autonomous ODEs, we can apply results from ODE theory path-wise (for a given realization $s\in S$ ) to gain insight into the dynamics of an RODE, as long as our $G_s$ still satisfies the regularity requirements. The results in our case only require continuity in $t$ , which our stochastic processes satisfy.

In fact, we will use the Wiener process $W_t$ . The Wiener process is a Gaussian process with the following properties ^[28]:

$1)$ $W_0 = 0$ almost surely.

$2)$ $W_t$ has independent increments, namely $W_{t_1}-W_{\tau_1}$ and $W_{t_2}-W_{\tau_2}$ are independent random variables for all $0\leq \tau_1 < t_1\leq \tau_2 < t_2$ .

$3)$ The increments $W_t-W_\tau$ are normally distributed with $E(W_t-W_\tau) = 0$ and $Var(W_t-W_\tau) = t-\tau$ for all $0\leq \tau < t$ .

$4)$ The sample paths are continuous almost surely.

3.2. Mathematical formulation of random cell attachment

Similarly to ^[9], we add discrete random attachment events. Attachment is the process by which cells in the aqueous phase attach to either the substratum or the pre-existing biomass. In many controlled experiments, no bacteria are present at the inflow, and thus, bacteria are only present in the aqueous phase if they have been previously detached from the biofilm. This is an example of the experiments done in ^[7,29], on which the models presented in ^[9,16] are based. Similar to ^[9], we make the simplifying assumption that attachment and detachment can be modelled independently. This is supported by ^[7] as they found that the availability of cells in the aqueous phase was not a limiting factor for attachment. Therefore, we assume that there is an unlimited supply of available cells for attachment. We also assume that cells generally do not attach to areas that have low carbon concentration or areas with high biomass density. This is consistent with the assumptions in ^[9,16].

We add a mollified impulse function to each biomass Eq (2.8) with a mutually independent Wiener process $W_t$ as argument for each grid cell $(i, j)$ . As in ^[9], we use the Fermi-Dirac distribution. It allows us to easily and independently control the location of the impulse, the height of the impulse, and how well the mollification approximates a true discontinuous impulse function. The Fermi-Dirac distribution with parameters $a, b, \sigma$ is given by

$\begin{align} f(\phi)& = \frac{1}{e^\frac{\phi-b}{\sigma}+1} - \frac{1}{e^\frac{\phi-a}{\sigma}+1}. \end{align}$

(3.3)

The parameters $a < b$ determines the location of the impulse, while the parameter $\sigma$ controls how close the approximation is to the true impulse function. In particular, as $\sigma\to0$ the Fermi-Dirac distribution converges to an impulse function supported on $[a, b]$ turning on at $\phi = a$ and off at $\phi = b$ .

We now add to each biomass equation the term

$\begin{align} \gamma g(M_{i, j})h(C_{i, j})f\left(W_t^{(i, j)}\right), \end{align}$

(3.4)

where $0 < \gamma\ll1$ controls the intensity of bacterial attachment per unit time and $W_t^{(i, j)}$ is an independent standard Wiener process for each grid cell $(i, j)$ . Here we see that the stochastic processes appear in the nonlinear function $f$ . The functions $g$ and $h$ satisfy the attachment assumptions depending on $M$ and $C$ , i.e., $g(1) = 0 = h(0)$ , both $g(0), h(1)\leq 1$ , $g$ is decreasing and $h$ is increasing, and these functions are sufficiently smooth. As in ^[9], we take functions of the form

$\begin{align} g(M) = \begin{cases} (1-M)^{\nu_1}, & M < 1, \\ 0, & M\geq 1, \end{cases}, \quad h(C) = C^{\nu_2}. \end{align}$

(3.5)

The exponents $\nu_1, \nu_2 > 0$ are included for generality. Hence, from (2.8) we obtain

$\begin{align} \frac{dM_{i, j}}{dt} = \frac{1}{\Delta x}\sum\limits_{\sigma\in\mathscr{N}_{i, j}}J_\sigma+\left(\frac{C_{i, j}}{\kappa+C_{i, j}}-\lambda\right)M_{i, j} + \gamma g(M_{i, j})h(C_{i, j})f\left(W_t^{(i, j)}\right). \end{align}$

(3.6)

Using the same ordering as before, our model becomes a system of $2NK$ RODEs given by

$\begin{align} \begin{aligned} \frac{d\boldsymbol{{M}}}{dt}& = \mathscr{D}(\boldsymbol{{M}})\boldsymbol{M}+\mathscr{F}(\boldsymbol{{C}})\boldsymbol{{M}}+\mathscr{T}(\boldsymbol{M}, \boldsymbol{C})\boldsymbol{F}\left(\boldsymbol{W_t}\right), \\ \frac{d\boldsymbol{{C}}}{dt}& = -\mathscr{G}(\boldsymbol{{C}})\boldsymbol{{M}}, \end{aligned} \end{align}$

(3.7)

where $\mathscr{D}(\boldsymbol{{M}})$ , $\mathscr{F}(\boldsymbol{{C}})$ , and $\mathscr{G}(\boldsymbol{C})$ are the same as in (2.11). Moreover, $\mathscr{T}(\boldsymbol{M}, \boldsymbol{C})$ is an $NK\times NK$ matrix with entries calculated based on (3.4) and $\boldsymbol{F}(\boldsymbol{W_t})$ is an $NK$ vector with components equal to (3.3).

Remark 3.1. The matrix $\mathscr{T}(\boldsymbol{M}, \boldsymbol{C})$ is a diagonal matrix with diagonal entries $i_{pp} = \gamma g(M_p)h(C_p)$ . These entries vanish if either $M_p = 1$ or $C_p = 0$ and attain their maximum value if both $M_p = 0$ and $C_p = 1$ . Therefore, $0\leq i_{pp}\ll 1$ . The $p$ th entry of $\boldsymbol{{F}}$ is given by $F_p(\boldsymbol{W_t}) = f(W_t^p) = \left(e^{\frac{W_t^p-b}{\sigma}}+1\right)^{-1}-\left(e^{\frac{W_t^p-a}{\sigma}}+1\right)^{-1}$ . Finally, by construction, $0\leq F_p\leq 1$ .

3.3. Model analysis of a regularized system

As in previous works ^[9,26,30], we regularize the diffusion coefficient and consider

$\begin{align} D^\epsilon(M) = \begin{cases} 0, & M < 0, \\ \delta\frac{M^\alpha}{(1-M)^\beta}, & 0\leq M\leq 1-\epsilon, \\ \delta\epsilon^{-\beta}(1-\epsilon)^\alpha, &M > 1-\epsilon, \end{cases} \end{align}$

(3.8)

for $\epsilon > 0$ sufficiently small. This leads to the regularized, approximate system

$\begin{align} \begin{aligned} \frac{d\boldsymbol{{M^\epsilon}}}{dt}& = \mathscr{D^\epsilon}(\boldsymbol{{M^\epsilon}})\boldsymbol{M}^\epsilon+\mathscr{F}(\boldsymbol{{C^\epsilon}})\boldsymbol{{M^\epsilon}}+\mathscr{T}(\boldsymbol{M}^\epsilon, \boldsymbol{C}^\epsilon)\boldsymbol{F}\left(\boldsymbol{W_t}\right), \\ \frac{d\boldsymbol{{C^\epsilon}}}{dt}& = -\mathscr{G}(\boldsymbol{{C^\epsilon}})\boldsymbol{{M^\epsilon}}. \end{aligned} \end{align}$

(3.9)

We perform analysis pathwise, i.e., for a given realization of the Wiener processes. We will show solutions are non-negative and bounded using comparison theorems and the tangent criterion from ^[31]. We will also study the long-term behaviour of our system. We first state a lemma that follows from standard Calculus results.

Lemma 3.2. Consider the scalar ODE

$\begin{align*} \frac{d u}{dt} = -au+\rho(t) \end{align*}$

with $a > 0$ , and $\rho(t)\geq0$ and continuous for all $t\geq0$ with $\rho(t)\to\rho_\infty$ as $t\to\infty$ . Then,

$\begin{align*} \lim\limits_{t\to\infty}u(t) = \frac{\rho_\infty}{a}. \end{align*}$

Proposition 3.3. Let $\boldsymbol{{M}}^\epsilon$ and $\boldsymbol{{C}}^\epsilon$ be path-wise solutions to $(3.9)$ and suppose that the initial data satisfy ${\bf{0}}\leq \boldsymbol{{M}}^\epsilon(0)\leq\zeta{\bf{1}}$ with $\zeta\in(0, 1)$ and ${\bf{0}}\leq\boldsymbol{{C}}^\epsilon(0)\leq{\bf{1}}$ . Then for almost every realization of the Wiener processes there exists a unique solution and the corresponding solution satisfies

$\begin{align*} &{\bf{0}}\leq\boldsymbol{{M}}^\epsilon\leq \mu{{\bf{1}}}\quad {{for\; all}}\;t\geq0, \\ &{\bf{0}}\leq \boldsymbol{{C}}^\epsilon\leq{\bf{1}}\quad {{for \;all}}\;t\geq0, \end{align*}$

where the constant $\mu > 0$ depends on the initial value and model parameters.

Proof. To prove the non-negativity, we invoke the tangent criterion ^[31]. First we consider the substrate equation. From Remark 2.1, the matrix $\mathscr{G}(\boldsymbol{C}^\epsilon)$ is diagonal and thus the $pth$ component of the substrate equation is

$\begin{align*} -(\mathscr{G}(\boldsymbol{C}^\epsilon)\boldsymbol{M}^\epsilon)_p = -\Upsilon\frac{C_p^\epsilon M_p^\epsilon}{\kappa+C_p^\epsilon}. \end{align*}$

For $C_p^\epsilon = 0$ , the above vanishes and since $-\mathscr{G}(\boldsymbol{C}^\epsilon)\boldsymbol{M}^\epsilon$ satisfies a Lipschitz condition, it follows from the tangent criterion that $\boldsymbol{C}^\epsilon\geq0$ ^[31].

The equation for the biomass consists of three components. From Remarks 2.1 and 3.1 we have

● $pth$ component of $\mathscr{T}(\boldsymbol{M}^\epsilon, \boldsymbol{C}^\epsilon)\boldsymbol{F}\left(\boldsymbol{W_t}\right)$

$\begin{align*} \mathfrak{if}_p = \gamma g(M_p^\epsilon)h(C_p^\epsilon)f(W_t^p), \end{align*}$

● $pth$ component of $\mathscr{F}(\boldsymbol{{C^\epsilon}})\boldsymbol{{M^\epsilon}}$

$\begin{align*} \mathfrak{fm}_p = \left(\frac{C_p^\epsilon}{\kappa+C_p^\epsilon}-\lambda\right)M_p^\epsilon, \end{align*}$

● $pth$ component of $\mathscr{D^\epsilon}(\boldsymbol{{M^\epsilon}})\boldsymbol{M}^\epsilon$

$\begin{align*} \mathfrak{dm}_p = &\frac{1}{2\Delta x^2}[(D^\epsilon(M^\epsilon_{p-K})+D^\epsilon(M^\epsilon_p))M^\epsilon_{p-K} + (D^\epsilon(M^\epsilon_{p-1})+D^\epsilon(M^\epsilon_p))M^\epsilon_{p-1}\\ &-(D^\epsilon(M^\epsilon_{p-K})+D^\epsilon(M^\epsilon_{p-1})+4D^\epsilon(M^\epsilon_p)+D^\epsilon(M^\epsilon_{p+1})+D^\epsilon(M^\epsilon_{p+K}))M^\epsilon_p\\ &+(D^\epsilon(M^\epsilon_{p+1})+D^\epsilon(M^\epsilon_p))M^\epsilon_{p+1} + (D^\epsilon(M^\epsilon_{p+K})+D^\epsilon(M^\epsilon_p))M^\epsilon_{p+K}], \end{align*}$

where $M^\epsilon_i = 0$ , whenever the grid cell does not exist.

If $M^\epsilon_p = 0$ , then $\mathfrak{if}_p\geq0$ , $\mathfrak{fm}_p = 0$ , and

$\begin{align*} \mathfrak{dm}_p = {\frac{1}{2\Delta x^2}}[D^\epsilon(M_{p-M}^\epsilon)M_{p-M}^\epsilon+D^\epsilon(M_{p-1}^\epsilon)M_{p-1}^\epsilon+D^\epsilon(M_{p+1}^\epsilon)M_{p+1}^\epsilon+D^\epsilon(M_{p+M}^\epsilon)M_{p+M}^\epsilon]\geq0 \end{align*}$

if $M_{p-M}^\epsilon, \ M_{p-1}^\epsilon, \ M_{p+1}^\epsilon, \ M_{p+M}^\epsilon$ are all non-negative. Therefore, $\boldsymbol{{M}}^\epsilon$ is non-negative.

The non-negativity of $\boldsymbol{M}^\epsilon$ and $\boldsymbol{C}^\epsilon$ now implies that $\boldsymbol{C}^\epsilon$ is bounded by 1, as $C^\epsilon$ is non-increasing. We finish by proving the upper bound for $\boldsymbol{{M}}^\epsilon$ . Since $d\boldsymbol{C}^\epsilon / dt\leq0$ , ${\bf{0}}\leq\boldsymbol{C}^\epsilon\leq{{\bf{1}}}$ , and $-\mathscr{G}(\boldsymbol{C}^\epsilon)\boldsymbol{M}^\epsilon$ satisfies a Lipschitz condition, we can conclude that

$\begin{align} \lim\limits_{t\to\infty}\frac{d\boldsymbol{C}^\epsilon}{dt} = 0. \end{align}$

(3.10)

Therefore, each component of $d\boldsymbol{C}^\epsilon / dt\leq0$ must converge to 0. Let

$\begin{align} \boldsymbol{\eta}(t): = \mathscr{G}(\boldsymbol{{C^\epsilon}})\boldsymbol{{M^\epsilon}} = -\frac{d\boldsymbol{C}^\epsilon}{dt}, \end{align}$

(3.11)

and define $\eta_1(t): = {{\bf{1}}}^T\boldsymbol{\eta}(t)$ . By (3.10), we know $\eta_1(t)\to0$ as $t\to\infty$ . Now we can consider the ODE system

$\begin{align} \begin{aligned} \frac{d\boldsymbol{{\hat M^\epsilon}}}{dt}& = \mathscr{D^\epsilon}(\boldsymbol{{\hat M^\epsilon}})\boldsymbol{{\hat M^\epsilon}}+\mathscr{F}(\boldsymbol{{\hat C^\epsilon}})\boldsymbol{{\hat M^\epsilon}}+\gamma{\bf{1}}, \\ \frac{d\boldsymbol{{\hat C^\epsilon}}}{dt}& = -\mathscr{G}(\boldsymbol{{\hat C^\epsilon}})\boldsymbol{{\hat M^\epsilon}}. \end{aligned} \end{align}$

(3.12)

Let $\boldsymbol{{M}}_{tot} = \sum_{p = 1}^{NK} \hat M^\epsilon_p$ . Then, we can construct a differential inequality for $\boldsymbol{M}_{tot}$ by summing up all components of the $\boldsymbol{{\hat M^\epsilon}}$ DE in (3.12). This gives

$\begin{align} \frac{d\boldsymbol{{M}}_{tot}}{dt}& = \sum\limits_{p = 1}^{NK}F(\hat C^\epsilon_p)\hat M^\epsilon_p + NK\gamma = \sum\limits_{p = 1}^{NK}\left[\frac{\hat C^\epsilon_p}{\kappa+\hat C^\epsilon_p}-\lambda\right]\hat M^\epsilon_p + NK\gamma\leq-\lambda\boldsymbol{M}_{tot}+\frac{\eta_1(t)}{\Upsilon}+NK\gamma. \end{align}$

(3.13)

From this we define the barrier ODE

$\begin{align} \frac{d\boldsymbol{{\overline M}}_{tot}}{dt}+\lambda\boldsymbol{{\overline M_{tot}}} = \frac{\eta_1(t)}{\Upsilon}+NK\gamma, \end{align}$

(3.14)

which we know is an upper bound on $\boldsymbol{M}_{tot}$ by comparison theorems ^[31]. By Lemma 3.2, we know

$\begin{align} \lim\limits_{t\to\infty}\boldsymbol{{\overline M_{tot}}}(t) = \frac{NK\gamma}{\lambda}. \end{align}$

(3.15)

The ODE (3.14), is a standard linear ODE, and thus the solution does not tend to infinity in finite time and since $\boldsymbol{M}_{tot}\geq0$ , we have

$\begin{align} \boldsymbol{{\overline M_{tot}}} \leq \mu \end{align}$

(3.16)

for some $\mu\geq 0$ that depends on model parameters and initial data. Since

$\begin{align} \mathfrak{if}_p = \gamma g(M^\epsilon_p)h(C^\epsilon_p)f(W_t^p)\leq\gamma, \end{align}$

(3.17)

we observe that the solutions to (3.12) are upper bounds for the solutions to our RODE model (3.9), which implies the biomass density is bounded.

Since the Wiener process is pathwise continuous and the right-hand side of (3.9) satisfies a local Lipschitz condition, we have a unique local solution, and by the boundedness condition, we have a unique global solution ^[31].

We now move on to showing the long-term behaviour of (3.9).

Proposition 3.4. All solutions to $(3.7)$ , with non-negative initial data, satisfy

$\begin{align} \lim\limits_{t\to\infty} \boldsymbol{M}^\epsilon(t) = {\bf{0}} \quad {{and}} \quad \lim\limits_{t\to\infty} \boldsymbol{C}^\epsilon(t) = \boldsymbol{{\tilde C}}, \end{align}$

(3.18)

where ${\bf{0}}\leq \boldsymbol{{\tilde C}}\leq {{\bf{1}}}$ .

Proof. For notational simplicity, we denote by $\boldsymbol{M} = \boldsymbol{M}(t)$ and $\boldsymbol{C} = \boldsymbol{C}(t)$ the solutions to (3.9). By (3.10) in the previous proof

$\begin{align} \lim\limits_{t\to\infty}-{\mathscr{G}}(\boldsymbol{C})\boldsymbol{M} = {\bf{0}}. \end{align}$

(3.19)

We also know that

$\begin{align} (\mathscr{G}(\boldsymbol{C})\boldsymbol{M})_p = \Upsilon\frac{C_p}{\kappa+C_p}M_p, \end{align}$

(3.20)

and hence, $\mathscr G(\boldsymbol{C})\boldsymbol{M} = {\bf{0}}$ if and only if $C_pM_p = 0$ for all $p = 1, ..., NK$ . Therefore, since we know $\boldsymbol{M}$ is bounded by a constant, we have

$\begin{align} (\boldsymbol{M}^T\mathscr{T}(\boldsymbol{M}, \boldsymbol{C})\boldsymbol{F}(\boldsymbol{W_t}))_p = \gamma g(M_p)M_p C_p^{\nu_2}f(W_t^p)\to0 \end{align}$

(3.21)

as $t\to\infty$ for all $p$ . Therefore, $\boldsymbol{M}^T\mathscr{G}(\boldsymbol{{C}})\boldsymbol{{M}}/\Upsilon+\boldsymbol{M}^T\mathscr{T}(\boldsymbol{M}, \boldsymbol{C})\boldsymbol{F}(\boldsymbol{W_t})\to0$ as $t\to\infty$ . Since we have the existence of a unique solution to (3.9) by Proposition 3.3, we know $\boldsymbol{M}^T\mathscr{G}(\boldsymbol{{C}})\boldsymbol{{M}}/\Upsilon+\boldsymbol{M}^T\mathscr{T}(\boldsymbol{M}, \boldsymbol{C})\boldsymbol{F}(\boldsymbol{W_t})$ is continuous. To this end we define

$\begin{align} \eta(t): = \frac{1}{\Upsilon}\boldsymbol{M}^T\mathscr{G}(\boldsymbol{{C}})\boldsymbol{{M}}+\boldsymbol{M}^T\mathscr{T}(\boldsymbol{M}, \boldsymbol{C})\boldsymbol{F}(\boldsymbol{W_t}), \end{align}$

(3.22)

where we know $\eta(t)$ is continuous for all $t > 0$ and $\eta(t)$ converges to $\eta_\infty: = 0$ as $t\to\infty$ . We will now use a discrete energy estimate type argument to show die-out of biomass. Let

$\begin{align} E(t):& = \left\lVert{\boldsymbol{M}(t)}\right\rVert_2^2. \end{align}$

(3.23)

Then if we multiply the biomass equation in (3.9) by $2\boldsymbol{M}^T$ on the left we get

$\begin{align} \frac{dE}{dt}& = 2\boldsymbol{M}^T\frac{d\boldsymbol{M}}{dt} = 2\boldsymbol{M}^T\mathscr D_\epsilon(\boldsymbol{M})\boldsymbol{M}+2\boldsymbol{M}^T{\mathscr G}(\boldsymbol{C})\boldsymbol{M}+2\boldsymbol{M}^T \mathscr{T}(\boldsymbol{M}, \boldsymbol{C})\boldsymbol{F}(\boldsymbol{W_t}). \end{align}$

(3.24)

By Remark 2.1, we know $\mathscr{D}(\boldsymbol{M})$ is negative semi-definite. The same reasoning applies to the regularized diffusion matrix and so $\mathscr{D}_\epsilon(\boldsymbol{M})$ is also negative semi-definite. Therefore, $\boldsymbol{M}^T\mathscr{D}_\epsilon(\boldsymbol{M})\boldsymbol{M}\leq0$ . So,

$\begin{align} \begin{aligned} \frac{dE}{dt}&\leq-2\lambda E+2\eta(t). \end{aligned} \end{align}$

(3.25)

since $\mathscr G(C) = \Upsilon(\mathscr F(C)+\lambda I)$ , where $I$ is the identity matrix. From this we define the barrier ODE

$\begin{align} \frac{d\overline{E}}{dt}+2\lambda\overline{E} = 2\eta(t), \end{align}$

(3.26)

which we know is an upper bound on (3.24) by comparison theorems ^[31]. By Lemma 3.2, we know that

$\begin{align} \lim\limits_{t\to\infty}\overline{E}(t) = \frac{\eta_\infty}{\lambda} = 0. \end{align}$

(3.27)

Since, by definition, $E$ is non-negative, we know that $E\to0$ as $t\to\infty$ . Therefore, $\boldsymbol{M}(t)\to 0$ as $t\to\infty$ since $E(t) = \left\lVert{\boldsymbol{M}(t)}\right\rVert_2^2 = 0$ if and only if $\boldsymbol{M}(t) = 0$ .

Remark 3.5. We cannot prove that the biomass is strictly bounded by 1, which is an underlying assumption of the model. For generic parameters and initial data, it can occur that $M\geq1$ in finite time. This issue is due to the homogeneous Neumann boundary conditions, which imply that no biomass can leave the domain and biomass decays if and only if $C$ is sufficiently small. We can determine the critical substrate concentration for a given grid cell, such that biomass loss will start dominating locally. This occurs when $C < \lambda\kappa/(1-\lambda) = :C_{crit}$ . We always assume $\lambda < 1$ ; otherwise, cell loss would always dominate growth. Strictly speaking, our model is only valid as long as $M < 1$ . However, during all simulations that we will present, the biomass concentration never reached 1, i.e. C dropped below $C_{crit}$ before $M$ could reach unity. Therefore, the restriction of our model to the case where $M < 1$ has no practical effect.

4. Computational implementation

For numerical integration of the RODE system (3.7) we use the ODE trapezoid method with constant time-stepping. We use ODE numerics as the traditional SDE and RODE numerics ^[32,33] led to instability, negative values or excessive simulation runtime, and were not remedied by the techniques found in ^[34]. To solve the non-linear system of equations, we use a fixed-point iteration scheme to convert the non-linear system into a sequence of linear systems. Namely, let $U$ be either state vairable, then $U^k$ denotes the solution at the current time step and $U^{k+1}$ the solution at the next time step. We denote by $U^{(p)}$ and $U^{(p+1)}$ the current and next iterate in our fixed point iteration in the current timestep. To simplify notation we omit here the time step counter $k$ . Then, in each time step we iterate

$\begin{align} \begin{split} \left[I-\frac{1}{2}\left(\Delta t\mathscr{D}(\boldsymbol{M}^{(p)})+\Delta t\mathscr{F}(\boldsymbol{C}^{(p)})\right)\right]\boldsymbol{{M}}^{(p+1)} = &\boldsymbol{M}^k+\frac{\Delta t}{2}\left(\mathscr{T}(\boldsymbol{M}^{(p)}, \boldsymbol{C}^{(p)})\boldsymbol{F}\left(\boldsymbol{W}_{t_{k+1}}\right)\right.\\ &+\mathscr{D}(\boldsymbol{M}^{k})\boldsymbol{M}^{k} +\mathscr{F}(\boldsymbol{C}^{k})\boldsymbol{M}^{k}\\ &\left. +\mathscr{T}(\boldsymbol{M}^{k}, \boldsymbol{C}^{k})\boldsymbol{F}\left(\boldsymbol{W}_{t_{k}}\right)\right), \end{split} \end{align}$

(4.1)

$\begin{align} \boldsymbol{C}^{(p+1)} = &\boldsymbol{C}^k-\frac{\Delta t}{2}\left(\mathscr{G}(\boldsymbol{C}^{(p+1)})\boldsymbol{M}^{(p+1)}+\mathscr{G}(\boldsymbol{C}^{k})\boldsymbol{M}^{k}\right), \end{align}$

(4.2)

over $p$ until the $1-$ norm of successive iterations is less than some pre-specified tolerance. For our simulations we use a tolerance of $10^{-6}$ . Once convergence is reached we set $\boldsymbol{M}^{k+1} = \boldsymbol{M}^{(p+1)}$ and $\boldsymbol{C}^{k+1} = C^{(p+1)}$ . Each linear system for $\boldsymbol{M}^{(p+1)}$ is solved using the Conjugate Gradient method ^[35], since the resulting system is symmetric positive definite ^[36]. See ^[36] for more details.

Simulations were conducted using Julia 1.5.3 ^[37]. To simulate the independent Wiener processes, we use $randn$ , which is a Guassian random number generator built into Julia. Namely, given the previous value of the Wiener process $W_{t_k}$ at time $t_k$ , we calculate the next value $W_{t_{k+1}}$ at time $t_{k+1}$ , via $W{t_{k+1}} = W_{t_k}+\Delta W_{t_k}$ , where $\Delta W_{t_k}$ is a Guassian random number with mean 0 and variance $\Delta t = t_{k+1}-t_k$ . We control the seeding of the random number generation by the $Random.seed!$ function.

We parallelize the construction of the linear systems, the pseudo-random number generation (PRNG), the array copying, and the convergence calculations. We used two different parallelization techniques in Julia. The first is the macro $@threads$ , which is placed in front of for loops, while the second is the $vmapntt$ function, which provides vectorized parallel execution of a scalar function. For both methods, the ' ${\tt -t } n$ ' runtime flag was used to control the number of threads available, where $n$ is the number of threads. For more details see ^[36].

We ran all of our simulations on two separate computers. The first is a custom-built workstation with an AMD Ryzen Threadripper 3955WX (3.9 GHz, 16 cores, 32 threads) and 64 GB RAM running Ubuntu 20.10 (Groovy Gorilla). The second workstation is a custom-built Intel Xeon E5-1660 v4 (3.2 GHz, 6 cores, 12 threads) with 32 GB RAM running Ubuntu 16.04.7 LTS (Xenial Xerus).

5. Simulation results and computational analysis

5.1. 2D simulations

We begin by showcasing several simulations to illustrate the efficacy of our model and numerical method. The first set is with an initially clean domain, i.e., the initial biomass density is zero everywhere. The second set investigates the model with an initially colonized domain, which starts with a small circular inoculation of biomass to see how the random attachment events interact with pre-existing biomass.

5.1.1. Case 1: initially clean domain

Here we simulate (3.7) with the initial conditions $M_0\equiv0$ and $C_0\equiv1$ everywhere in $\Omega$ . Between the two simulations, we controlled the emerging characteristics of the biomass by the impulse interval $[a, b]$ . Both simulations use the parameter values given in with Simulation 1 having an attachment interval $[a, b] = [0.99999, 1.0]$ (see ) while Simulation 2 has an attachment interval $[a, b] = [0.9999, 1.0]$ (see ). As the simulations progress temporally, we see small colonies form across the domain; in both simulations, this occurs at around $t = 20$ . These colonies grow rapidly, and once they reach saturation, they expand over to neighbouring colonies. The simulations manifest the crater-like structures (see Figure 1) resembling the colonies formed in cellulolytic biofilms (see Figure 1). As the colonies continue to spread, some colonies coalesce and form communities. These communities continue to coalesce and form until the substratum is fully degraded. Once the substrate in the substratum becomes limited, the biomass begins to decay until no substrate is present and, in turn, no biomass.

Table 1. Model parameters used for the simulations of (3.7). All values in the table are dimensionless.

Parameter	Symbol	Value	Reference
Motility Coefficient	$\tilde\delta$	$10^{-6}$	^[16]
Diffusion Coefficient Exponent 1	$\alpha$	4.0	^[16]
Diffusion Coefficient Exponent 2	$\beta$	4.0	^[16]
Maximum Consumption Rate	$\tilde\Upsilon$	0.4	^[16]
Half Saturation Concentration	$\tilde\kappa$	0.01	^[16]
Cell Loss Rate	$\tilde\lambda$	0.42	^[16]
Control Parameter of $f$	$\sigma$	0.000001	assumed
Shift Parameters of $f$	$a$ & $b$	Varied	assumed
Density of an Attached Cell per Time Step	$\gamma$	0.01	assumed
Biomass Attachment Exponents	$\nu_{1/2}$	2.0	assumed
Domain	$\Omega$	$[0, 1]\times[0, 1]$	assumed
Grid Size	$N\times K$	$2^{9}\times 2^{9}$	assumed
Step Size	$\Delta t$	$10^{-2}$	assumed
Seed	$-$	$1$	assumed

| Show Table

DownLoad: CSV

Figure 3. Simulation of (3.7) with initial data

$M_0\equiv0$ and

$C_0\equiv1$ everywhere. The biomass densities (left) with their corresponding substrate concentrations (right) are provided at the different time points listed. The interval of attachment is

$[0.99999, 1.0]$ .

DownLoad: Full-Size Img PowerPoint

Figure 4. Simulation of (3.7) with initial data

$M_0\equiv0$ and

$C_0\equiv1$ everywhere. The biomass densities (left) with their corresponding substrate concentrations (right) are provided at the different time points listed. The interval of attachment is

$[0.9999, 1.0]$ .

DownLoad: Full-Size Img PowerPoint

and show subtle differences in the colony formation. In , we have fewer attachment events taking place, which is expected due to the smaller impulse interval $[a, b]$ . At time $t = 20$ we have slightly more established attachment events in Figure 4 compared to . If an attachment event occurs in the simulation depicted in , then it must occur in the simulation depicted in since $[0.99999, 1]\subset[0.9999, 1]$ and Wiener processes have continuous sample paths. The simulation ends earlier for the larger impulse interval. Meaning the substrate and biomass reaching zero (or close to 0) everywhere. On we see that the simulation finishes shortly after $t = 50$ , while in it finishes at $t = 45$ . This is caused by the slightly sooner attachment events and the larger total biomass resulting from more attachment events. More biomass in the system, especially spread out across more grid cells, means quicker substrate degradation. More substrate degradation means faster biomass death and detachment, as discussed in Remark 3.5.

Lastly, although it is difficult to see in these visualisations, the attachment events that contribute to substantial additional biomass, i.e. when $\gamma g(M)h(C)f(W_t)\approx\gamma$ , cease in the grid locations that have high biomass density or low substrate concentration.

5.1.2. Case 2: initially colonized domain

Here we simulated (3.7) with the initial biomass density given by

$\begin{align} M_0(x, y) = \begin{cases} -\frac{h}{d^2}\left((x-0.5)^2+(y-0.5)^2\right)+h, &\;{\rm{if }}\; (x-0.5)^2+(y-0.5)^2 < d^2, \\ 0 & \text{otherwise}, \end{cases} \end{align}$

(5.1)

with $h = 0.1$ , $d = 5/127$ , and $C_0\equiv1$ everywhere in $\Omega$ . The goal of these simulations is to examine the effects of attachment when a substantial bacterial colony is already present in the system.

We present two separate simulations. Both using the default simulation parameters in with attachment intervals $[a, b] = [0.99999, 1.0]$ (see ) and $[a, b] = [0.9999, 1]$ (see Figure 6).

Figure 5. Simulation of (3.7) with initial data

$M_0$ given by (5.1) and

$C_0\equiv1$ . The biomass densities (left) with their corresponding substrate concentrations (right) are provided at the different time points listed. The interval of attachment is

$[0.99999, 1.0]$ .

DownLoad: Full-Size Img PowerPoint

Figure 6. Simulation of (3.7) with initial data

$M_0$ given by (5.1) and

$C_0\equiv1$ . The biomass densities (left) with their corresponding substrate concentrations (right) are provided at the different time points listed. The interval of attachment is

$[0.9999, 1.0]$ .

DownLoad: Full-Size Img PowerPoint

From and we see that before attachment events start taking place, the central colony expands radially outward. The characteristic for cellulolytic biofilms craterlike depressions form. Initially they have a rim of approximately constant thickness in the wake of which eventually all biomass decays upon depletion of all nutrients. Eventually after this crater like structure merges with other colonies, the biomass vanishes as it attempts to expand into regions in which nutrients already have been depleted, i.e., into regions in which biomass loss terms dominate over growth. This is in agreement with experimental observations reported in ^[25] based on microscopy and online CO $_2$ measurements that monitor microbial activity. Already the deterministic model in ^[16], on which our work is based, and the SDE model in ^[9], upon which we aim to improve, showed this behavior. At around $t = 20$ in both simulations, we see attached colonies start to form, which is consistent with the initially clean domain simulations as we use the same seeding for the stochastic processes. However, unlike the previous simulations, we see that no attachment occurs on or inside the central colony, which is caused by the lack of substrate and/or high biomass densities at these locations, and therefore unfavorable conditions for attachment per our model assumptions.

Like in the previous simulations, for larger impulse intervals, more attachment events occur, and when the impulse interval is closer to 0, attachment events occur slightly sooner. In both cases, the colonies that form due to attachment have ample time to grow and expand before they merge with the central colony. They consume the majority of the substrate in the sections of the domain which they occupy. As a result, the central colony cannot reach the edges of the domain before becoming completely distorted by the attached biomass. Unsurprisingly, the simulation with the larger attachment interval (see Figure 6) consumes the substrate quicker, distorts the central colony much sooner and to a greater degree when compared to the simulation with the smaller attachment interval (see Figure 5).

When comparing our simulations illustrated in Figures 3–6 to the empirical imaging in we see that and , which both use the attachment interval $[a, b] = [0.99999, 1.0]$ , best capture the amount of biomass deposited onto these domains.

5.2. Grid refinement study

We performed a spatial grid refinement study to investigate the variations in our attachment model (3.7) as we refine the grid. Due to the stochastic nature of the model, we ran an ensemble of simulations for each grid size. We use the generally accepted notion that 30 samples are sufficient to apply the Central Limit Theorem^† and thus, we perform 30 simulations for each grid size ^[38]. To compare the results between the grid resolutions, we keep the size of the attachment site constant. This preserves the added energy in the system.

^†The use of the Central Limit Theorem will become more apparent when we discuss the stability of travelling wave solutions. We kept the number of simulations the same across different experiments for consistency sake.

We generate two statistics, the sample mean (expectation) and the standard error of the sample mean, given by

$\begin{align} \text{sample mean}&: = E[X] = \frac{1}{n}\sum\limits_{i = 1}^nX_i, \end{align}$

(5.2)

$\begin{align} \text{standard error of the sample mean}&: = SEM[X] = \frac{\sqrt{\frac{1}{n-1}\sum\limits_{i = 1}^n\left(X_i-E[X]\right)^2}}{\sqrt{n}}. \end{align}$

(5.3)

where $X = (X_1, ..., X_n)$ denotes the samples and $n$ is the number of realizations (samples).

The quantities of interest (QI) we will use are the total biomass and carbon,

$\begin{align} M_{TOT}(t)&: = \int_{\Omega}M(t, x)dx = \Delta x^2\sum\limits_{p = 1}^{NK} M_p(t), \end{align}$

(5.4)

$\begin{align} C_{TOT}(t)&: = \int_{\Omega}C(t, x)dx = \Delta x^2\sum\limits_{p = 1}^{NK} C_p(t), \end{align}$

(5.5)

where $\Omega$ is the spatial domain, $NK$ the number of grid cells, and $M_p(t)$ , $C_p(t)$ denote the $p$ th component of the vectors $\boldsymbol{M}(t)$ and $\boldsymbol{C}(t)$ . The occupancy of the biomass and carbon,

$\begin{align} M_{A}(t)&: = \int_{\Omega_M'(t)}dx = \Delta x^2\sum\limits_{p = 1}^{NK} U^M_p(t), \end{align}$

(5.6)

$\begin{align} C_{A}(t)&: = \int_{\Omega_C'(t)}dx = \Delta x^2\sum\limits_{p = 1}^{NK} U^C_p(t), \end{align}$

(5.7)

where $\Omega_{M}'(t): = \{x\in\Omega\mid M(t, x) > 10^{-2}\}$ and $\Omega_{C}'(t): = \{x\in\Omega\mid C(t, x) > 10^{-2}\}$ , and the points $U^L_p(t)$ for $L = M, C$ are 1 if $L_p(t) > 10^{-2}$ and zero otherwise. We use $10^{-2}$ as an approximation to 0 to avoid numerical noise. Lastly, we investigate the number of distinct biofilm colonies and depleted substrate colonies, which we denote by $M_{CC} = M_{CC}(t)$ and $C_{CC} = C_{CC}(t)$ for biomass and carbon respectively. To calculate the number of different colonies we determine the number of connected components using $\Omega_{M/C}'(t)$ with the near-nearest neighbour formulation ^[39].

We also calculate the relative distance between the expectation curves given by

$\begin{align} dist_N = \frac{\left\lVert{U_N-U_{1024}}\right\rVert_p}{\left\lVert{U_{1024}}\right\rVert_p} \end{align}$

(5.8)

with $p = 1, 2, \infty$ denoting the usual $p$ -norms and $U_N$ denoting the expectation time series for the $N\times N$ grid.

For all grid refinement simulations, we used the default parameter values given in and an attachment interval of $[a, b] = [0.99, 1.0]$ . We use a larger attachment interval for the grid refinement study since the number of Wiener processes is less. We set the initial conditions to $M\equiv 0$ and $C\equiv 1$ in order to simulate an initially clean domain.

The time series of the sample means of the total biomass and substrate, the occupancy, and the number of communities are given in Figures 7, 9 and 11 with the associated standard error plots given in Figures 8, 10 and 12. The relative distances between the grid sizes are given in Table 2.

Figure 7. The time evolution of the expectation of total biomass (left) and substrate (right) for several grid sizes averaged over 30 realizations. The spatial size of an attachment event is