1.
Introduction
In this paper we are interested in numerical approximation of heat conduction with phase change in heterogeneous and composite materials. We are motivated by important applications in modeling permafrost, human tissue, and phase change materials (PCM) for thermal energy storage, smart textiles and buildings. These applications involve materials with drastically different thermal properties separated by an interface, thus require conservative algorithms. While we focus on mathematical and computational challenges rather than on practical engineering or geophysics scenarios, we aim to develop robust and accurate yet simple algorithms adequate for low regularity of solutions and easily extendable to future multiphysics simulators. Challenges similar to those discussed here occur in simulation of multiphase flow in fractures or rocks of different type [1,2], as well as in other applications which we give in what follows.
Overview. We focus on two phases: solid and liquid, separated by a free boundary, or by a region in which these phases coexist. In a single material we consider the following nonlinear parabolic equation
to be supplemented by appropriate boundary and initial conditions. The model is solved for the temperature θ, the internal energy (enthalpy) density w and the heat flux q. Here f represents heat sources, k is the heat conductivity. The definition of w closes the model. For this definition, we consider one of the following equilibrium relationships
In particular, in the Stefan problem denoted by (ST), w∈αST(θ)=cθ+Lχ. Here c is the heat capacity, L is the latent heat, and the water fraction χ∈H(θ) (where H is the Heaviside graph) "translates" the well known Stefan condition prescribing the velocity of the free boundary S between the liquid and solid regions to (1.1) defined over both solid and liquid regions. Its approximation is (ST)ϵ, in which αSTϵ(θ) is some ϵ-dependent single-valued piecewise smooth approximation to αST with Lipschitz constant LαSTϵ∼ϵ−1; see Section 2 for details. In turn, in permafrost models (P) w=αP(θ) is given by a monotone piecewise smooth Lipschitz function αP(⋅), with some large Lipschitz constant LαP; we give details in Section 5. See also a summary of notation in Table 1.
Known work. Even in a single material the models (1.1) and (1.2) does not have classical solutions and features challenges to the analysis and approximation due to the nonlinear multi-valued or only piecewise smooth character of α(⋅). Typically, (1.1) is posed in the sense of distributions, and its solutions (θ,w) have low regularity.
The majority of numerical analysis for (1.1), under homogeneous Dirichlet boundary conditions, is carried out for (ST)ϵ after so-called Kirchhoff transformation which renders (ST) or (ST)ϵ in the equivalent form, and with a new variable u instead of θ
Since u (as well as θ) are expected to be continuous, it appears natural to seek their continuous piecewise linear (P1) finite element approximations. In turn, wh is either also piecewise linear (P1) [3] or piecewise constant (P0) [4,5]. We denote these approaches, respectively, as P1-P1 or P1-P0. For the permafrost (P) problem, the literature reports on the node-centered finite difference or piecewise linear finite element implementations, but without theoretical analysis or studies of convergence; see, e.g., in [6,7,8,9,10,11]. These also fall in the category of P1-based approximations. In turn, it is common to use cell-centered finite differences (CCFD) or finite volumes (FV) for a variety of fluid flow problems including those in subsurface [12].
Heterogeneous materials and conservative approximations. It is well-known that the P1-based approximations due to their poor approximation of fluxes are not well suited for computational modeling of problems with large heterogeneity of coefficients such as Darcy flow in porous media coupled to transport; see, e.g., discussions in [12,13]. For computational models of (1.1) and (1.2) alone in single material, this would not be an issue.
However, our goals in this paper are to study (1.1) and (1.2) in heterogeneous domains; additionally, we seek robust algorithms suitable for coupled multiphysics system such as THM [14,15,16,17], and, simultaneously, for multiphase multicomponent systems at complicated geometries such as at pore-scale based on voxel grids [18,19,20,21]. Therefore, we draw from literature on conservative approximations using mixed finite element methods for multiphase flow in porous media and specifically on problems with nonlinearities similar to (1.2) [22,23,24]. These feature conservative fluxes and piecewise constant (P0) approximations to scalar unknowns. In this paper we approximate both θ and w with piecewise constants (P0), thus we refer to these algorithms as P0-P0. Finally, when fluxes are approximated in the space RT[0] (discussed below), the algorithms can be conveniently implemented as cell-centered finite differences (CCFD); this gives a robust easily extendable structure towards more complex physics across many scales. Although these algorithms feature lower order approximation error, this is not an issue given low regularity of solutions to (1.1). Unfortunately, the approximation of fluxes features a technical gap since the normal fluxes for (ST) feature a jump; we defer the associated theoretical issues to future work.
Our results. (i) We first evaluate the feasibility of P0-P0 approximations with fully implicit in time approximations for (1.1) with (1.2) for a single material such as bulk water. We show our approach compares well to P1-based schemes, and that the solvers are robust, even if there are some theoretical concerns. (ii) Second, we develop P0-P0 algorithms for the heterogeneous case k=k(x;θ) and α=α(x;θ) in multiple materials, when the data features the realistic but most challenging case of piecewise constant k(x;⋅) and α(x;⋅). Another challenge is that θ may not be continuous across material interfaces with high thermal resistivity. We prove various results and show robustness of our P0-P0 and monolithic CCFD algorithm which is conservative. Finally, we apply P0-P0 scheme to permafrost modeling including in heterogeneous materials, tie the theoretical framework for αP to that developed for other αST and αSTϵ, and confirm convergence.
Overall, we find that P0-P0 algorithm is robust for heterogeneous extensions of (1.1) with any of (1.2) we considered. The order of convergence in the scalar variables (θ,w) is, for the most part, about O(h) and, O(√h) whenever time step τ=O(h) is used. The nonlinear solver performs also well, even though it might need improvements for strongly heterogeneous nonlinearities.
Outline: We give details on the phase change problems modeled by (1.1) with (1.2a) and (1.2b) in Section 2. In Section 2.4 we overview, compare, and illustrate traditional piecewise linear approximations to (ST) and (ST)ϵ which we call P1-P1 or P1-P0. In Section 3 we define our P0-P0 approximations to (ST) and (ST)ϵ, prove some auxiliary results, and compare convergence and robustness of the scheme to P1-based approaches. In Section 4 we address heterogeneous (ST) such as in (1.3), and prove properties of the P0-P0 scheme for the solutions that feature a jump across the material interfaces. In Section 5 we provide details on the permafrost model (P) (1.2c), along with convergence results for P0-P0 scheme, and illustrate with simulations for both homogeneous and heterogeneous problem. We close in Section 6, with auxiliary results given in Appendix 7.
1.1. Notation
The problem (1.1) is posed in Q=Ω×(0,T) where Ω⊂Rd is an open bounded spatial domain with a smooth boundary ∂Ω. In heterogeneous case, we partition ¯Ω=⋃NMATj=1¯Ω(j) into subdomains Ω(j), corresponding to j=1,…NMAT materials, with the material interfaces Γ=⋃ijΓ(ij),Γ(ij)=∂Ω(i)∩∂Ω(j). Thermal properties of a region depend on the material in that region. We assume that in material j the heat conduction and phase transition parameters are known fixed constants. For example, at a point x∈Ω(j), the freezing temperature is fixed for this material so that θfr(x)=θ(j)fr. In turn, the thermal conductivity k(x;θ)=k(j)(θ) depends on the temperature in a fixed way specific to material j. We denote this dependence of thermal properties as follows
The notation Q(j)p indicate the portion of Q in domain (j) and phase p.
We assume that the subdomains and interfaces are smooth enough so that standard notation and results using the spaces Ck(Ω), and Lebesque and Sobolev spaces such as Lp(Ω) and HkΩ) make sense. For a Lipschitz continuous function f we denote its Lipschitz constant by Lf. We denote by V=H10(Ω) for the primal variational formulation, and by X=Hdiv(Ω) and M=L2(Ω) the spaces for fluxes and scalars, respectively, needed in the mixed setting. By (u,v)=∫Ωuv we denote the inner product of scalar functions on L2 as well as that for vector valued functions on (L2)d. The norms ||f||G of functions in space G are as indicated with a subscript G which is dropped if G=L2(Ω) or G can be inferred from context. For time-dependent problems when t∈(0,T) and, e.g., u∈L2(0,T;V) we use the shorthand notation L2(V). For discrete formulations, we recall the spaces Vh⊂V of piecewise linear finite elements based on a triangular grid. We will also use Xh⊂X some approximations to the vector valued functions such as heat flux, and Mh⊂M the space of piecewise constant approximations.
By 1U(x) we denote the characteristic function of set U⊂Rd∋x. For two sets Ω− and Ω+ separated by an interface ∂Ω+∩∂Ω−, by [r(x)]=(r|Ω+−r|Ω−)|x we denote the jump of quantity r at some interface point x∈∂Ω+∩∂Ω−. These two domains correspond to two materials, or to two phases of one material. When the sign of the jump is important, we make it precise on which side of the interface the limits are taken.
Next, consider real functions u,w defined on Q, and a function β:R→R. The statement u=β(w) means that u(x,t)=β(w(x,t)) a.e. on Q. Similarly, consider a graph of a relation α⊂R×R for which α−1=β is a non-injective function, but α(⋅) is multi-valued. In this case υ∈α(r) is equivalent to (r,υ)∈α, and, for functions u,w on Q, the statement w∈α(u) means that w(x,t)∈α(u(x,t)) a.e. (x,t)∈Q.
In time-discrete formulations we use uniform time stepping with time step τ>0 and 0=t0<t1,… and tn=nτ. For an unknown u(⋅,tn), we denote by un(⋅) its time approximation, and by unh(⋅) its fully discrete approximation, which may be identified by the set of values such as Uni. For a known function f(⋅,t), we denote fn to be the value f(⋅,tn) or its integral average 1τ∫tntn−1f(⋅,υ)dυ, made precise in the context.
Finally, when using physical data and units, we use SI units or state otherwise; see Table 1.
In addition, we make some assumptions on the data. Assumption 1.1 is a standard assumption for data of heat conduction problems.
Assumption 1.1. We assume that the latent heat coefficients L(j)≥0, and that heat capacity and conductivity coefficients are cmax≥c(j)p≥cmin>0;kmax≥k(j)p≥kmin>0 for all phases p=s,l, and domains j=1,…NMAT.
For simplicity of notation when reviewing literature as well as in the proofs of some results we assume the following. (More realistic cases are given in examples.)
Assumption 1.2. Let θ satisfy homogeneous Dirichlet b.c. θ|∂Ω=0. We also assume that some initial value winit∈L2(Ω) is given.
Last but not least, our focus in this paper is on the nonlinear relationship α given as one of the three specific choices α(ST), α(ST)ϵ, α(P) defined in (1.2) which come from phase change problems. However, our results apply to other possible α(⋅) with properties summarized in Assumption 1.3. These properties are well known for α(ST), α(ST)ϵ; see, e.g., [25]. For α(P) in the permafrost problem, these properties follow from the algebraic formulas discussed in Section 5.
Assumption 1.3. We assume that α satisfies one of the following properties (a)–(c), similar to (ST), (ST)ϵ, (P), respectively: (a) α is a maximal monotone graph with a non-injective Lipschitz inverse β=α−1, similarly to αST, (b) α is a monotone strictly increasing function which is piecewise smooth and globally Lipschitz, with an injective Lipschitz inverse, similarly to αSTϵ, (c) α is a smooth strictly increasing globally Lipschitz function which has one point of non-differentiability (similar to αP).
2.
Formulations and approximation of Stefan problem in a single material
In this section we provide details on (ST) and (ST)ϵ. We suppress these superscripts now. A description of the variables used in this paper is given in Table 1.
2.1. Stefan problem
We follow closely [27] as well as the applications literature [28] for thermodynamics of multiple phases. We consider the temperature field θ(x,t),x∈Ω,t>0 and define Ωl={x∈Ω:θ>0}, Ωs={x∈Ω:θ<0}, with Ql,Qs defined analogously. We also consider the phase interface S=∂Ql∩∂Qs, and St its instance at time t. The motion of St is described by its velocity v, with the normal component v⋅ν. In turn, the normal n to S is oriented towards Ql, with components (nx,nt) with nx parallel to ν, so that we have nt=−v⋅nx.
We recall first the so-called strong (classical) formulation of Stefan problem: we seek the temperature θ in each region Ωp,p=s,l which satisfies the heat equation
The first part of (2.1a) is the energy conservation involving heat flux q, external source f, and the internal energy cθ dependent on the temperature. The second part is the Fourier heat conduction law.
The coefficients of volumetric heat capacity c and heat conductivity k depend on the phase of a material; see Table 2 for typical values. In this paper we consider the simplest realistic case in which c,k are piecewise constant in each Ωs,Ωl, respectively.
One can extend (2.1b) to θ=0 using arithmetic averages [27](Ⅳ.4.1); see also Section 3.2.3.
2.1.1. Free boundary
Finding the free boundary S=∂Ql∩∂Qs, is part of the problem, with S(x,t):θ(x,t)=0. Additionally, the Stefan condition governs the velocity of St as follows
where ν is the unit normal to St=S∩Ω×{t}.
The strong (classical) form of the Stefan problem seeks {S,θ}, with θ is expected to have continuous derivatives to all the relevant order in Q∖S. As is well known, such a solution may not exist [27] (v)Section Ⅳ.1. Hence, there is need to define weak solutions of (2.1).
2.2. Weak formulation
The weak form of (2.1) is derived upon integration by parts using test functions from D(Q) and assuming that θ∈C(¯Q) but that its derivatives need only be in L1(Q∖S). The energy conservation (2.1a) is written in all of Q, and the weak form, in the sense of distributions, reads
The definition of enthalpy w follows by integration of parts of (2.2a) written in the sense of distributions, the use of (2.1a)–(2.1c) along with the assumption that S is smooth and θ continuous across S; see, e.g., [27](Ⅳ.1p.101) [25](A1 p244). This definition "translates" the Stefan condition (2.1c) as follows, in one of the many variants
Here ϕ is the order (phase) parameter and χ=ϕ+12 is the water fraction. In equilibrium, these are set to ϕ=1,χ=1 in Ωl and ϕ=−1,χ=0 in Ωs. On S where θ=0, ϕ and χ are independent variables determined uniquely by the heat content w; in particular, 0≤χ=wL≤1 when θ=0. However, when written in terms of temperature, the relationships (θ,χ) or (θ,ϕ) appear multi-valued
This only appears when the fact that χ (and ϕ) are independent variables is ignored.
The definition (2.2b) can be also explained from thermodynamic principles: here we have dw=c(θ,χ)dθ+Ldχ which expresses a general dependence of c on θ in (2.1b) as a weighted fraction of the heat capacities cl in the liquid and cs in solid phases
with a possibly variable cp=cp(θ). Then one integrates dw=cdθ+Ldχ to yield
This formula implies (2.1b) and has a nice nontrivial counterpart in permafrost; see Section 5.
In relaxation or in phase field models, i.e., away from equilibrium, ϕ is not given by (2.3), but is governed by its own dynamics, with its range possibly outside [−1,1]. We describe and illustrate such models in Section 2.4.1.
2.2.1. Kirchhoff transformation
The analysis of (2.2) proceeds after we change variables and write
The variable u is called the "Kirchhoff temperature" and is distinguished from the true temperature θ. To transform (2.2) to (2.6) we replace k∇θ=k(θ)∇θ=∇u with u(θ)=∫θ0k(r)dr, which, with (2.1b), gives
Next, from (2.2b) with (2.1b) we have the relationships for (θ,w)
Finally, we combine (2.8) with (2.7) and see that u=u(θ)=u(β(w))=βK(w) with
We see that β and βK are non-injective functions, while α and αK are maximal monotone graphs, i.e., the range of identity plus the graph is R, and they are monotone. The functions βK,β are Lipschitz, with Lipschitz constants
The functions βK,β along with the graphs αK,α are affine bounded, i.e., there are constants Cβ=max{c−1s,c−1l,Lc−1l},CβK=max{ksc−1s,klc−1l,Lklc−1l} and Cα=max{cs,cl,L},CαK=max{csk−1s,clk−1l,L} with which
2.2.2. Approximations of (ST)
There are many ways to approximate a multi-valued graph α (or αK, sgn or H) by a single-valued Lipschitz function so that its inverse is injective. One very specific (one-sided) approximation is the Yosida approximation
We see that Lαλ=O(λ−1) which blows up as λ↓0. For this and other similar approximations of αST by αSTϵ there is no jump of q⋅n or of w across the free boundary S. Rather, these variables vary sharply in the region
See, e.g., illustration in Figure 1(c).
2.3. Well-posedness
The models (1.1) with (1.2) and in particular (2.2) and (2.6) are nonlinear monotone evolution equations with maximal monotone graphs α. Theory for such equations in Banach spaces is given in [25]; the specific case of Stefan problem is discussed in [27] in an abstract Hilbert space setting; see also [3].
We recall only the case of Dirichlet homogeneous boundary conditions, in the Hilbert space case with V=H10(Ω). We seek w∈L∞(L2)∩H1(H−1), u∈L2(V) such that u=βK(w), w(⋅,0)=winit∈L2(Ω) and (see, e.g., [4])
Assuming some smooth enough winit is given, the maximal monotonicity of αK and the fact that both αK and α−1K are affine bounded are sufficient for [27](1.20, p34; Thm.Ⅱ.5.1, p59). In particular, with f∈L1(L2)∩L2(V′), and winit∈L2(Ω), one obtains that there exists a unique solution ∫T0u∈L∞(V)∩H1(L2) (note that this implies u∈L2(L2)). Further refinement of the theory [27](Prop.Ⅱ.1.3 and Thm.Ⅱ.1.4))[27] gives u,w∈L∞(L2), with uniqueness in [27](Thm 5.2). Further regularity [27](Thm 2.5) indicates u∈H1(L2)∩L∞(V). Since θ can be found from (2.7) which is continuous, one generally obtains similar qualitative properties of θ as those of u.
These results indicate quite low regularity of w in contrast to the case when αϵ is single valued and Lipschitz continuous, since then wϵ has the same regularity as uϵ [27](Thm 2.6). Further results hold for in-homogeneous Dirichlet data as long as it is smooth enough; these results elucidate the connection between weak and strong formulations [27](Ⅳ.6). Under nonlinear Neumann boundary conditions [29] for NMAT=1 one obtains θ∈L2(H1),w∈L2(L2).
2.4. Approximation of solutions to Stefan problem in a single material: review
In this section we review several approaches towards the numerical approximation of (2.2) and the Kirchhoff transformed version (2.6) which combine time-stepping with some spatial discretization such as finite elements or finite differences. We focus on the traditional mesh-based PDE discretization; other approaches include explicit tracking of the free boundary, but without solving for (θ,w) on a mesh; see, e.g., [30] for review. These latter approaches do not apply very well to the simulation at pore-scale, permafrost, or heterogeneous materials.
In Section 2.4.1 we recall and illustrate time stepping schemes. Next in Section 2.4.2 we review spatial discretization approaches focusing on piecewise linear approximations uh∈Vh⊂V; the methods differ by how wh is approximated and how (ST)ϵ is selected.
2.4.1. Time stepping for an ODE
Consider some f=f(t), and A>0 in
We illustrate several time discrete schemes which approximate the relationship of (θ,w) either by regularization, Chernoff formula, or by relaxation.
The backward Euler scheme is
and the solution for every n≥1 is guaranteed from Lemma 7.1; in practice, we use Newton's iteration substituting θn=β(wn).
The solution to (2.15) can be also computed using so-called Chernoff formula [4]
where μ is the relaxation parameter, a constant approximation to 1β′, which satisfies 0<μ≤L−1β, with Lβ from (2.10). The Chernoff formula offers a way to linearize the non-linear relationship β. Chernoff formula does not require any nonlinear iteration, since it solves a linear problem for θ followed by an update of wn. However, it produces consistency errors growing with μ−1.
We also consider the phase-relaxation approach: we replace the equilibrium relationship (2.8) w=cθ+Lχ=cθ+L2(ϕ+1) in which ϕ∈sgn(θ) by a coupled problem which allows ϕ=ϕ(t) to evolve towards this equilibrium written as sgn−1(ϕ)∋θ. [27](Ⅴ.1). This relaxation is another form of approximation to the equilibrium problem; see [31] for the PDE (2.2). In a more general phase-field approach, one can consider an additional dissipative term similar to Aϕ, plus a coarsening term proportional to ϕ which acts to counteract Aϕ; these together moderate the evolution of ϕ coupled to the PDE (2.2).
Let ε>0 and γ>0. The phase-relaxation approach to (2.15) is given as
Here g is one of two possible choices which approximate sgn−1(⋅) and contain the linear destabilisation term. We consider g∘(ϕ)=ϕ3−ϕ similar to that in Allen-Cahn equation, [32] which we call "smooth". The other is g(ϕ)=g◻(ϕ)=sgn−1(ϕ)−ϕ which we call "non-smooth" [27](Section Ⅵ.5) [33,34]. We discretize (2.18) fully implicitly and solve by Newton's iteration
Example 2.1. We solve (2.15) with α=αST given by (2.8) with parameters in Table 2 (W). We choose A=10−2,winit=−1, and a forcing term f(t)=sin(πt3600) for t∈[0,3600]. We derive the exact solution given in Section 7.1.
In Figure 2 we plot numerical solutions using τ=40 and one of the three schemes: fully implicit, Chernoff formula, and phase relaxation with (2.18). For Chernoff formula, we choose μ=1.5 so that μ≤L−1β=cs=1.90. For phase relaxation, the key challenge here is a choice of the parameters ε=1/10, γ=1, so that the time-relaxed dynamics resembles that of (2.15).
In the end we see that the fully implicit solution (2.16) trails the exact solutions θ(t) and w(t) fairly well. The Chernoff formula and phase relaxation approaches seem to have less close agreement, especially in w(t). While they offer other advantages, this experiment informs our subsequent choice to focus on the fully implicit time stepping.
2.4.2. Spatial approximations
Early approaches to numerical solution to (ST) include the nodal finite difference approach in [42] for which convergence (but no specific order) is proven, independently of ϵ in the regularization (ST)ϵ of (ST) problem. In [26] some time error analysis is provided.
The majority of rigorous work is on nodal piecewise linear approximations Vh∋uh≈u∈V in (2.14); we call these P1-based. The approaches differ in time stepping (as in Section 2.4.1), in how the original problem (ST) is approximated by some (ST)ϵ, and in how wnh is defined. In particular, [43] prove L2(L2) order of convergence close to O(h) for fully implicit approximations for (ST)ϵ, when ϵ=O(h2),τ=O(h2) but require ∂twϵ to be bounded, and refer actually to simulations with the (P) problem discussed in [7] instead of (ST)ϵ. In turn, [4,5,44] approximate solutions for some (ST)ϵ rather than (ST); [4,5] make use of the Chernoff formula similar to (2.17), while [44] and [31] use phase relaxation; these are closely related as shown in Figure 2.
The main difference between the individual approaches is in the treatment of spatial integrals ∫Ωwψ and ∫Ωuψ with ψ∈Vh, and in adjustments to how wh is found. In some schemes the numerical integration, or one of projection operators such as P0h,P1h,Πh are used. In some, piecewise constants and wnh∈Mh are used, this is similar to our P0-P0 schemes to be defined in Section 3.
The theoretically estimated convergence error depends, as usual, on h,τ and ϵ. Generally, θ is predicted to be approximated well, qualitatively and quantitatively, by all the P1-based schemes, with the order about O(h), in the weaker norms. However, the errors w−wh are, as expected, higher, and wh appears "smeared" near the free boundary St.
We set up detailed experiments and provide illustrations as part of our subsequent studies of P0-P0 schemes for (ST) problem in comparison to P1-based schemes. Our tests given in detail in Section 7.3 along with fine details on the schemes show somewhat better rates than those predicted and tested in [26,31,44]. Of the schemes studied, [5] produces the best approximation and convergence rates. We acknowledge the limitations of our study only in d=1, but believe these provide good starting point for subsequent comparisons with P0-P0 schemes.
3.
Approximation to Stefan problem using P0-P0 finite elements and CCFD for a single material
In this section we propose P0-P0 spatial discretization for (1.1) combined with any one of (1.2) combined with fully implicit in time scheme. We start with a mixed finite element formulation for (1.1), using M=L2(Ω) for scalar unknowns such as θ and w, and q∈X=Hdiv(Ω) which features continuous normal components across any smooth surface. Next we choose P0-P0 approximations θh,wh∈Mh⊂M. We also seek fluxes qh∈Xh=RT[0] on a rectangular grid as in [22,45]. The pair (Xh,Mh) is a stable pair for the Darcy problem satisfying the Banach-Neĉas-Babuŝka conditions [46]. For linear problems they approximate the fluxes and scalar unknowns to the same order O(h), with superconvergence for smooth solutions and some norms [45,46,47,48].
Remark 3.1. For nonlinear relationship represented by (1.2) we encounter here the major challenge. The choice X=Hdiv works for the (ST)ϵ and (P) problems when α(⋅) in (1.2b) and (1.2c) is single-valued. However, in the Stefan problem (ST) with (1.2a), the fluxes q∉Hdiv(Ω) since their normal takes a jump across St, which ties to the multivalued character of α(⋅). This raises concerns on the approximability of q∉X by qh∈Xh. We acknowledge this difficulty and formally develop the theory only for the single-valued α such as for (ST)ϵ and (P) problems, but we extend the algorithms in Xh×Mh to the (ST) problem. We defer further study to future considerations.
The P0-P0 algorithm has several attractive features. 1) First, the normal fluxes of qh∈Xh are continuous, which leads to conservative schemes across element and material interfaces; to support this, we work in the (θ,w) formulation instead with Kirchhoff variables (u,w). 2) Second, if θh is piecewise constant, it is natural to define wh=α(θh)∈Mh in a consistent fashion so that (2.8) is enforced at every degree of freedom. 3) Third, the P0-P0 equivalent to the mixed framework features approximation properties known from the literature; in particular, the results in [22] are most relevant for the present nonlinear case of (ST)ϵ and (P) problems. In fact, we demonstrate that the convergence in (θ,w) is not inferior to that for P1-based schemes from Section 2.4 even for (ST) problem; this is done in Section 3.4. 4) Last, the approximations, up to quadrature, are equivalent to a CCFD scheme for θh∈Mh from which the fluxes qh∈Xh follow post-processing; these features, recalled in Section 3.2, make implementation easy and allow its extensions to more complex nonlinear problems and multiple materials.
Below we first set-up the notation and recall main results on the mixed finite element discretization leading to P0-P0 algorithm. For simplicity of notation we consider Ω⊂Rd,d=2 and assume that it is well covered by a rectangular grid Th=(ωij)ij so that ¯Ω=⋃ijωij, with maxij|ωij|=maxijhx,ihy,j=h2. Each cell ωij∈Th has a center at some (xij,yij) and edges γi−1/2,j,γi,j−1/2,γi+1/2,j,γi,j+1/2, when listed clockwise from the left edge. Throughout this section we use Assumption 1.2.
3.1. P0-P0 scheme from discrete mixed formulation for linear parabolic problems
We recall the mixed formulation for the linear case of (1.1) and (1.2b) with L=0 but allowing k=k(x),c=c(x), with some given initial condition w(x,0)=winit(x). We have then k−1q=−∇θ;∂tw+∇⋅q=f;w=α(θ)=cθ. Its weak formulation follows after we multiply each of the equations in (1.1) by test functions ψ∈X, η∈M, integrate by parts, respectively, and apply the boundary conditions. We seek (q,θ)∈X×M which satisfy
(In these equations the symbols (a,b) mean inner product ∫Ωab in L2(Ω) as given in Section 1.1). The approximations qnh,θnh with wh=α(θh) are formulated after (3.1) is discretized in time, and when at each time step tn we seek (qnh,θnh)∈Xh×Mh which satisfy a system similar to (3.1). We make these precise now.
We consider the well known spaces (Xh×Mh) built on Th with Xh=RT[0], the lowest order Raviart-Thomas space on rectangles [22,45,48]. The space Mh contains piecewise constants on Th; the basis functions spanning Mh are simply 1ωij, and θh|ωij=Θij associated with the cell centers of each ωij. The vector valued functions in Xh are tensor products of piecewise linears in one coordinate with piecewise constants in the other. In particular, (qh)1 is identified by their edge values at the left and right edges (i±1/2,j) so we have, e.g., (qh)1|γi+1/2,j=qi+1/2,j; analogously (qh)2 is identified by values at the bottom and top edges i,j±1/2, respectively, (qh)2|γi,j−1/2=qi,j−1/2. The basis functions for the vector valued functions in Xh are ψi±1/2,j for (qh)1 and ψi,j±1/2 for (qh)2. Let (Qn,Θn) denote the degrees of freedom for qnh,θnh in their bases.
We define the fully implicit approximations (qnh,θnh)∈Xh×Mh to (3.1) as those that satisfy
We applied here numerical integration to the first integral in (3.2a) and replaced (k−1qnh,ψ) by its approximation (k−1qnh,ψ)h. Specifically, the following numerical integration is used: on every ωij=(xi−1/2,xi+1/2)×(xj−1/2,xj+1/2), the trapezoidal (T) scheme over (xi−1/2,xi+1/2) and (M) midpoint scheme on (xj−1/2,xj+1/2) for the first component of qh⋅ψh, and M×T scheme for the second component. This leads to some useful simplifications, very well known and described in [49]. In particular, (k−1qnh,ψ)h gives KQ with a diagonal matrix K of positive edge factors Ti±1/2,j and Ti,j±1/2 involving k−1; see Section 7.5 for more details. We also get (∇⋅qnh,η)=−BQn while −(θnh,∇⋅ψ) becomes BTΘn. The linear system reads
with Gn=Fn+1τWn−1. Note that C is the diagonal matrix of positive coefficients c|ωij.
Remark 3.2. After we multiply the second equation in (3.3) by (−1), the system (3.3) has a saddle-point structure. Thus it is well-posed, i.e., the operator N=[KBTB−C] is an isomorphism [45](Prop.3.3.1 and Thm 3.6.2), because K is coercive (positive definite) on the kernel of B, C is positive definite. and B is surjective from Xh→Mh.
Remark 3.3. The system (3.3) can be easily modified to account for non-homogeneous Dirichlet boundary conditions; see Section 7.5 for details.
Connection to CCFD. Since K is diagonal, every degree of freedom of Qn has an easy discrete interpretation, thus one can eliminate Qn, and (3.3) is equivalent to
This system is known as the cell-centered finite difference (CCFD) formulation. Now BK−1BT is symmetric and at least nonnegative definite for Neumann boundary conditions, and positive definite for Dirichlet conditions. Since C is positive definite, we have a unique solution Θ from which Q follows.
3.1.1. Literature notes
For linear problems such as (3.1), mixed FE solution (qh,θh) features optimal first order convergence of the errors ||q−qh||X and ||θ−θh||M for the choice of RT[0]×Mh [46]. For Darcy and potential flow problems the quadrature error is lower order, and the mixed approach provides formal interpretation of the CCFD algorithm [49]. For parabolic problems under Neumann boundary conditions and strong assumptions on the smoothness of θ and q, [48] shows that
This order, is, in general, not featured for nonlinear problems such as (1.1).
3.2. P0-P0 scheme for nonlinear heat equation with single-valued nonlinearity
Now we consider (1.1) or Kirchhoff-transformed problem (2.6) with (1.2b) or (1.2c) in the mixed formulation. We seek (q,θ) or (q,u)∈X×M and replace w=cθ by w=α(θ) or w=αK(u) in (3.1b) to get the weak mixed and discrete mixed formulations similar to that for (3.1).
3.2.1. Literature for nonlinear problems in mixed form
The challenge, well described in [22] is that one must respect the regularity of the unknowns (q,θ,w) which is usually inferior to that for the linear case when ∂tw∈L2(Ω). In particular, when βK(⋅) has derivative vanishing pointwise, it may happen that ∂tw∉L2(Ω). This difficulty is partially overcome with a Kirchhoff transformation and upon integration in time, and/or discretization in time; we refer to e.g., [22,46,50] for thorough discussion. The approach of taking finite differences in time is also known as the Rothe or Crandall-Ligget or Hille-Yosida framework [25,29].
For nonlinear problems which are Kirchhoff-transformed the mixed framework is set-up for the solutions (wnh,unh,qnh) in [22,23,51]. However, the error estimates depend on the smoothness of w and u. Disregarding the error in the fluxes, these results, when applied to (1.1) state that, as in e.g., [22]
with C,¯C↓0 as h↓0 depending on the smoothness of u, but with the order not specified directly. The work [22] does not directly address existence and uniqueness of the solutions. Overall, the results in [22,23,51] are well-suited for problems such as Richards equation, with α which features somewhat different challenges than those for (ST), (ST)ϵ, (P) problems listed in Assumption 1.3.
3.2.2. P0-P0 algorithm for nonlinear single-valued function
We extend now (3.3) to the nonlinear case when k=k(θ) and w=α(θ), working with (θ,w) as scalar unknowns. We define some approximations ˜cn≈c(θn) and ˜kn≈k(θn), and take some ˜α≈α; we make these precise in Section 3.2.3. The fully discrete problem reads
This nonlinear system uses ˜K based on ˜k. We prove first that the system has a unique solution; this is needed since Remark 3.2 does not apply to this nonlinear case. We complete details on the algorithm in Section 3.2.3.
Lemma 3.1. Let Assumption 1.1 hold on ˜k, and let ˜K and B be computed as in Section 3.1; in particular, let ˜K be positive definite. Then there exists a unique solution (Q,Θ,W) to (3.6) and its generalization
Proof. We discuss only (3.7) since the result for (3.6) follows as its special single-valued case. We proceed in one of two alternative ways which are each worthwhile discussing. One is that we rewrite (3.7) eliminating Q as in (3.8)
We set A=B˜K−1BT which is at least nonnegative definite as discussed earlier. The system is as in Lemma 7.1, thus the existence of a unique solution (Θ,W) follows, with Q found by postprocessing from (3.6a).
Yet another proof follows ideas in [50](Thm 3.2), and is worthwhile mentioning because it extends to the abstract weak formulation of (1.1) in (X,M) for the case when q∈X. We note that the system (3.7) can be written as M([Q,Θ]T)=[0,G]T; the nonlinear operator M a sum of diagonal matrix of maximal monotone operators [˜K00∂Φα] (we define Φα as in Lemma 7.1), and the accretive linear thus Lipschitz operator [0BT−B0]. This means that M is maximal monotone, and that there exists a unique solution to (3.7).
3.2.3. Details
Now we need to define ˜k,˜c and ˜α. These are needed since we extend (3.3) to the nonlinear case when k=k(θ) and w=α(θ). Typically, we assign these cell-wise based on the Θ=(Θij)ij≡θh
(The value k∗,c∗ can be one of many including kl+ks2 and cl+cs2, respectively [27](Ⅳ.4.1)). Also,
(In practice, the set Ωh0 is empty). From these the formula for ˜α follows by (2.8).
Lastly, we need to make precise which Θ we use in (3.10). When entering a new time step tn, we have the previous time step value Θn−1. When iterating on (3.6), in iteration m, we have Θn,(m−1) to denote the iteration-lagged value, while we seek the new Θn,(m). We must therefore make precise in (3.10) whether it depends on the old Θn−1, or on iteration-lagged value Θn,(m−1). Adopting the notation for the sets in (3.10) from that for Θ we get, e.g., Ωh,n−1l or Ωh,n,(m−1)l. In other words, we can calculate ˜kij from (3.9) as either kn−1ij or kn,(m−1)ij. These choices give the matrix ˜K=Kn−1 or ˜K=Kn,(m−1), respectively. Thanks to Assumption 1.1 these have positive entries; see also Section 7.5. Therefore Lemma 3.1 applies to (3.6).
Remark 3.4. The sets in (3.10) are not the same as
If neither of Ωl(t),Ωs(t) is empty, Ω∖(¯Ωhl(tn)∪¯Ωhs(tn))≠∅, as illustrated in Figure 3.
3.2.4. Nonlinear solver
Next we discuss the nonlinear solver for (3.7). The solution to (3.7) exists and is unique according to Lemma 3.1, but the proof makes a reference to some (iterative) optimisation algorithm to find the desired minimizer Θn.
In practice, the use of such a minimisation algorithm may be tedious and is unnecessary. Instead, we solve the problem using Newton's method: we rewrite (3.8) from the proof of Lemma 3.1 in residual form using the single-valued inverse β of α.
In each iteration m=1,2,…, given Wn,(m−1), Θn,(m−1) we solve
This scheme means we are solving (3.12) simultaneously for two variables Θn,Wn. However, the second part of (3.12) is diagonal, thus we can, instead, eliminate Θn,(m) and solve the problem in terms of Wn,(m).
It is known that Newton's iteration is not guaranteed to converge for an arbitrary initial guess even for smooth F=(F1,F2). Now the nonlinear function β in (3.12) is only piecewise differentiable, but such case is covered by the theory and practice of semi-smooth Newton methods in [52]. In our tests the Newton solver is robust and essentially grid-independent as long as the time step τ is not too large. We discuss performance of this iteration in Section 3.4.
3.3. Addressing multivalued (ST) problem with P0-P0
As we showed in Section 3.2, there is no difficulty formulating P0-P0 algorithm and solving the fully discrete case of (1.1) with multi-valued α (1.2a).
In fact, given Θn we get Qn from (3.6a); this gives qnh∈Xh. However, the true q∉Hdiv. Thus any attempts to quantify the approximation error for qh−q must take into account this important discrepancy expressed already in Remark 3.1. Thus there is a question whether the use of qh∈Xh is appropriate for the (ST) problem. This challenge is somewhat more complex than the pointwise degeneracy with β′=0 pointwise handled, e.g., in [22], which still keeps q∈X.
At this time we see various avenues to address this challenge. One is to solve (ST)ϵ formally, and create a sequence qϵh∈Xh for a collection of ϵ>0 adjusted to h, and defining ˜qnh as their limit. The fluxes qϵh would be reasonably accurate approximations to qϵ∈X, and their limit ˜qnh to q∉X. One other is to find some approximation ~qnh to q by postprocessing qnh. One can also consider projecting q to Xh. We defer further analyses of possible improvements to q−qnh to future work.
3.4. Results of P0-P0 algorithm
Now we present examples and study the convergence of P0-P0 approximation for (ST) problem. We focus on the scalar unknowns (θ,w), and defer the study of the error q−qh to future work. We study the approximation error θerr=θ−θh and werr=w−wh. We choose only those norms that are easy to use when analytical solution is not available, and are easy to compare to the theoretical results on P1-based schemes. For completeness, the definitions of these norms are given in Section 7.4.1.
In convergence studies we use uniform spatial and temporal discretization. We also note that when using fine grid, the error rates, especially those for w, are sensitive to interpolation and machine precision; thus, some care in grid refinement is needed to obtain consistent rates.
The goal now is to compare the performance of our P0-P0 scheme with the P1-based schemes from Section 2.4. These results provide confidence in our work on multiple materials, as well as guide the choice of time step τ depending on h. We use two test cases which we call (RBC) from [26] and another called (VV) from [44]. These examples feature an assumed free boundary S moving with some given prescribed velocity, winit and time-dependent Dirichlet boundary conditions found from the exact solution. For illustration, the plots of the solutions and their approximations are given in Figure 4 and Figure 5.
Example 3.1. (VV) This example from [44] is not connected to any particular physical scenario, but results in very simple mathematical calculations. Let Ω=(0,0.4) and T=0.2, f=0 and L=c=k=1. Note that since the data is not physical, no particular units are used, even if our code assumes units such as those in Table 1. We have the free boundary for (2.2) S:ψ(x,t)=0, with ψ(x,t)=−x+t+0.1, and
In experiments we stop the simulation at T=0.2, a time at which the free boundary position is still inside Ω. This choice helps to analyse how well the free boundary is approximated by the different numerical schemes.
Example 3.2. (RBC) example uses realistic physically meaningful data from [26]. We have Ω=(0,20)[cm], L=306[J cm−3], cs=1.90 and cl=4.19 [J cm−3 ∘C −1], ks=0.023 and kl=0.0058 [J sec−1 cm−1 ∘C −1]. The free boundary s(t)=s0+vst, and
where αl=vscl/kl and αs=vscs/ks, and data as in Table 2. We also use the parameters in Table 3.
3.4.1. Convergence tests
Convergence results for P0-P0 algorithm and (VV) example are given in Table 4, with τ=h/10. We also provide the fine grid-study with hfine=6.4×10−5 in Table 5; these results are very similar to those in Table 4, thus they validate our process for using θhfine as a proxy for θ.
The convergence results for (RBC) using the fine grid solution and exact solution are shown in Tables 6 and 7 respectively. We also tabulate the comparison to P1-based methods in Section 7.4.2 in Table 13.
Summary of convergence of P0-P0 schemes: Generally, we see that our P0-P0 schemes converge roughly with first order in θ and half order in w. These rates are similar to those for P1-P0 schemes from Section 2.4. However, our P0-P0 schemes seem to improve on P1-based schemes qualitatively and quantitatively; in particular, we see improvement in the quality of approximations to the enthalpy, which seems due to the lack of consistency errors such as those for Chernoff formulas or phase relaxation.
We acknowledge that this might be due to only testing in d = 1; nevertheless, these results are promising.
3.5. Robustness of nonlinear solver
Last but not least we discuss performance of the nonlinear solver for (3.7) since it works without regularization for (ST) problem, and we have not found a discussion for (ST) problem in the literature.
The maximum number of iterations required for each case to converge is listed in Table 8. The solver uses for stopping criterium a combination of absolute tolerance of 10−12 and a relative tolerance of 10−6 and τ=h/10, relative to the first iteration.
Overall, the solver performs reliably under a variety of circumstances, including with uniform coefficients used in Example 3.1 (VV), realistic thermal properties used in Example 3.2 (RBC) and in the presence of multiple materials in Example 4.2 to be discussed below. It is also important that the performance seems to be mesh independent.
Though not illustrated in Table 8, we see in practice that the number of Newton iterations may increase with larger discrepancies between parameters and larger latent heat L values. We add the notes on the specifics of the heterogeneous permafrost case in Section 5.
3.6. Summary
More work on the theoretical underpinnings of P0-P0 algorithm is needed for Stefan problem, but overall our P0-P0 (CCFD) algorithm seems well suited to all the choices of α in (1.2) including even the most challenging case of αST(⋅).
4.
Stefan problem with heterogeneity, and its P0-P0 approximation allowing for thermal resistivity of interface
We consider now the main challenge addressed in this paper, that of simulation of phase change problem in a heterogeneous domain, a generalization of (2.2) to the case when the region Ω is occupied by NMAT materials, each in Ω(j),j=1,…NMAT, with interfaces denoted by Γij=∂Ω(i)∩∂Ω(j), and Γ=⋃ijΓij. We also have Σ=Γ×(0,T). See Figures 1 and 6 for illustration. The data corresponding to material (j) are denoted by c(j)l,c(j)s,k(j)l,k(j)s,θ(j)fr,L(j). With these we formulate α(j) as in (2.8), and assume that the data satisfies Assumption 1.1, and that Γ is at least as smooth as ∂Ω. We denote the appropriate functional spaces local to Ω(j) with superscript (j), e.g., in X(j), and so on.
Let (2.2) hold in each Ω(j). Denote by w(j),θ(j) the restriction of w,θ to Ω(j), so we have
This problem requires some initial conditions and some boundary conditions on ∂Ω.
We also need some interface conditions on each Γij. For these, it is natural to assume (i) continuity of fluxes, and (ii) of the temperatures across each Γij. However, in some applications including PCM, the condition (ii) must be relaxed to model the heat conduction across a very thin region of very low heat conductivity. When the width of that layer is very small compared to the width of the regions surrounding this layer, and if the interface region is approximated by a lower dimensional interface Γij, the temperature appears to take a jump, since the limits of temperature from both sides of Γij are quite distinct, while the flux is preserved [37]. See Figure 6 for illustration.
This jump condition is formulated, e.g., in [29,53,54]
where ρR≥0 is called "thermal resisitivity" and where the orientation of the unit normal ν to Γij is from Qi to Qj. (More generally, one can consider ρR to be specific to an interface).
If ρR=0 or the fluxes q(j) vanish, we have continuity of temperatures θ across Γij (but not necessarily of u). If ρR>0, the condition (4.1c) is a Robin condition which slows down the process of reaching thermal equilibrium across the interface. This latter case is important for applications. However, its mathematical and computational challenges have not been well studied. We address some of the challenges that (4.1) brings from theoretical and algorithmic point of view when ρR>0.
The mixed formulation is particularly convenient for problems involving interfaces and multiple materials. In particular, the condition (4.1b) prescribes the continuity of normal fluxes across Γ which is naturally preserved for the fluxes q∈X. The condition also prescribes a possible jump of the scalar unknowns, but this is not an issue when extending θ∈M. Therefore, if the problems on each Ω(j) are well posed in X(j)×M(j) for each j, then the global problem can be well studied in X×M; see, e.g., [55,56]. Similarly, one can easily define the P0-P0 (CCFD) algorithms on each subdomain coupled to the discrete counterparts of (4.1b)–(4.1c).
As concerns the overall solution algorithm based on CCFD for (4.1a), in principle, some domain decomposition approach iteration is required to satisfy (4.1b)–(4.1c). However, one of our contributions in this paper is that we are able to formulate a monolithic P0-P0 algorithm on Ω. We also formulate various theoretical results and estimates for qh∈Xh. We note however that even though we do not state approximation properties in this paper, these may involve θ considered in some broken spaces such as H(Ω)=H1(Ω(1))×H1(Ω(2)) rather than H1(Ω) due to the discontinuity of θ across Γ.
After some literature review in Section 4.1, we formulate and illustrate our algorithms in Sections 4.2 for linear and nonlinear problems when q∈X. We extend these to (ST) problem and apply the same scheme, even if some of the theoretical results do not apply when q∉X. We provide examples and study convergence in Section 4.3.2.
4.1. Literature on heterogeneous Stefan pbm
The mathematical literature on (4.1) is not abundant, but we review what is available. The problem is a special case of more general heterogeneous nonlinear parabolic problem in which
When the data (4.2) vary smoothly as functions of its first argument, it is possible to apply Kirchhoff transformation on Ω but this introduces various lower order terms in the PDE. This approach considered, e.g., in [57,58], allows some well-posedness analysis.
Another class of approaches in [59,60,61] study a problem similar to Stefan problem via minimization of a collection of normal convex integrands, and assumes smoothness of α(x;θ) in x, for an application with phase change of a different type where the multivalued graph α(x;⋅) representing the solubility constraint depends smoothly on x. At the same time, the challenges discussed here are similar to those revealed by simulations in [61] when applied to piecewise constant data.
The piecewise constant case (1.3) of interest in this paper does not allow Kirchhoff transformation but is also most realistic. The well-posedness of the case ρR>0 is studied in the primal setting in [29] under external Neumann boundary conditions on ∂Ω and with some initial data. Existence of solutions is proven for ρR>0, with estimates which allow the limiting case ρR↓0. Uniqueness is also proven [29]. For ρR>0, the paper predicts w∈L∞(Q), θ∈L2(H), and that the jump is
We note that as as ρR↓0, the estimates (4.3) suggest continuity of the temperature across Γ but also predict the blowup of ∂tw.
As concerns numerical approximation, [54] describes the approximation in (V(j)h)j similar to the P1-P1 schemes described in Section 2.4. A-priori bounds for (θnh,wnh) including those similar to (4.3) are proved in [53] for ρR>0. Finally, a time discrete approximation to ∂twh is measured in some seminorms which involve h2τ2, 1√ρR; these suggest to use hτ bounded as h→0,τ→0, and ρR such that τ2ρR→0. Based on these a-priori bounds, convergence to the solution corresponding to ρR=0 is established. However, no rate of convergence for the approximation errors θ−θh is given.
Last but not least we mention other problems and applications which feature jumps of the primary unknowns and fluxes. These works fit in the framework of heterogeneous domain decomposition [56] and inform our results, but are not closely related. The closest is our prior work on the domain decomposition approach for semiconductor modeling of heterogeneous junctions in [62,63,64]. Further, there is closely related substantial work on multiphase flow and multiple rock types in, e.g., [1,2] as well as the abundant literature on Beavers-Joseph conditions on the Stokes-Darcy interfaces, e.g. in [65] and other heterogenous domain decomposition settings.
4.2. P0-P0 scheme
We approximate (4.1) with P0-P0 discrete scheme from Section 3. We recall that even though the formal approximation with X∋q≈qh∈Xh is appropriate only for the single-valued setting w=α(θ) (as in Remark 3.1), the P0-P0 algorithm produces discrete conservative fluxes qh∈Xh also for the multivalued case w∈α(θ).
We start with P0-P0 (or CCFD) algorithm on each material domain, a counterpart of (3.7), which we couple with discrete counterparts of (4.1b)–(4.1c). Then we show that this domain decomposition formulation can be handled by a monolithic solver. With this, the theoretical results including convergence analyses available for single domain formulations in X(j)h×M(j)h extend to that on Xh×Mh (as long as q∈X). Moreover, the problem does not require any iteration on the interface.
We explain the details first for the linear heat equation with the notation from Section 3.2; the generalization to the nonlinear problem is straightforward. For simplicity, we focus on only two materials NMAT=2 which occupy the domains Ω(1),Ω(2) separated by an interface Γ=Γ12.
4.2.1. P0-P0 formulation with mixed finite elements
We rewrite (4.1) in a time-discrete mixed form, with subdomain problems (4.4a)–(4.4b) to be solved by (θn,(j),qn,(j))j. We also make precise the initial, external, and coupling boundary conditions (4.4c) and coupling via (4.4d)
We approximate (4.4) with P0-P0 algorithm for (4.4a)–(4.4c) on each Ω(j) as in Section 3, each implemented as CCFD, and corresponding to a block linear system similar to (3.3) is solved, with off-diagonal coupling terms expressing (4.4d).
This coupling could be solved by iteration; we refer to [56] for a discussion of domain decomposition iterative algorithms. In contrast, we propose to satisfy (4.4d) without an iteration which simplifies the implementation substantially. We explain the ideas briefly for d=1 in Section 4.2.2.
4.2.2. Monolithic scheme
Let Ω=(a,b) and Ω(1)=(a,x∗) and Ω(2)=(x∗,b), with a grid (ωi)i covering Ω(1)∪Ω(2) so that Γ=x∗ is one of the cell edges. With global numbering of the cells ωi in Ω, we have i=1,…j∗,j∗+1,…M, where Γ=x∗=xi∗+1/2=γi∗+1/2 is between some cells ωi∗ and ωi∗+1. Now the problem is discretized as in Section 3.2 with q(1)h and q(2)h identified by the edge values Q(1)=(Q1/2,Q3/2,…Q−i∗+1/2) and Q(2)=(Q+i∗+1/2,Qi∗+3/2,…QM+1/2), respectively. We note the double value of the flux Q−i∗+1/2,Q+i∗+1/2 to be set equal due to (4.4d). The temperatures θ(1)h∈M(1)h and θ(2)h∈M(2)h, respectively, are identified by the cell values Θ(1)=(Θ1,Θ2,…Θi∗), and Θ(2)=(Θi∗+1…ΘM). Finally, the Dirichlet conditions at the external boundaries are with Θ∗1/2=θD(x1/2), and Θ∗M+1/2=θD(xM+1/2). These enter (3.3) as in Remark 3.3.
The approximation to the first part of the interface condition (4.4d) equates the fluxes Q−i∗+1/2=Q+i∗+1/2. We also require Dirichlet values Θ−∗ and Θ∗+ to be used by each subdomain problem at x∗ so that the second part of (4.4d) holds. Finding these is part of the problem. In summary, the approximation to (4.4) is as follows.
where we find and from the solutions on and defined as follows.
The solution to (4.5) requires formulation of appropriate matrices , to solve (4.5b) and (4.5c), respectively. These come from the transmissivities on every and are given as in Section 7.5 by (7.11). We also have two boundary edge factors and given by (7.13) independently on and , respectively.
The problem (4.5) could be solved by iteration. However, our idea is that (4.5) can be solved by a much simpler monolithic CCFD solver on for and for which . The system takes the form
The definition of is a straightforward extension of , but involves as well as an appropriately chosen transmissibility to guarantee (4.4d).
Lemma 4.1. Solution to (4.5) exists and is unique; it is equivalent to that of (4.6) provided
where we use the shorthand notation and .
Proof. The proof is purely algebraic. Assume (4.5) is satisfied. Denote , , so (4.5a) holds for some , . We know also from (4.5b) and (4.5c).
1. Recall and . Setting these equal from the first condition in (4.5a), and expressing from the second part of (4.5a), we work to eliminate and get a formula for in terms of and . After some lengthy calculations we get
2. Next we recalculate . After some algebra we get
3. Now the expression on the right hand side can be written as
This provides the definition of (4.7), i.e., of for the monolithic formulation. Existence and uniqueness of the solutions to (4.6) follows directly from that for (3.3) with the special definition of which satisfies the same properties as all other .
Remark 4.1. When , (4.7) reduces to give as the usual weighted harmonic average of and in (7.11), and provides the interface value of
In other words, a monolithic CCFD approach on matching grids is equivalent to the domain decomposition approach. This observation is perhaps not new, but is nevertheless very useful.
Corollary 4.1. The results in Lemma 4.1 easily extend to when is aligned with cell interfaces. In addition, these results extend to nonlinear problems such as (3.6) and (3.7) with
Here in each time step and iteration one updates the transmissibilities based on the current guess of . This system uses special interface transmissivities involving resulting in the dependence of matrix on .
Remark 4.2. Convergence of the solutions to for the linear problem is expected to be qualitatively similar to the case without jump, since, upon Lemma 4.1, it follows from those for CCFD on each domain, e.g., in [48]. At the same time, the solution is not globally smooth enough, thus broken norms must be used when referring to the approximation error's dependence on higher order norms.
We provide an example with a numerical approximation in Section 4.3. We also prove theoretical estimates on the jump .
4.2.3. Estimates of the jump on the interface
We aim to derive results similar to (4.3) in the mixed setting. We write the fully discrete mixed form of (4.4), and work with the discrete counterparts of , with . As explained for (4.6), we use which can be , or or , formed, respectively, either in a fully implicit, time-lagging or iteration lagging fashion.
Lemma 4.2. Let . Then the numerical solution of (4.1) satisfies, at every
Proof. We suppress and . The weak mixed formulation of (4.1a) after integrating and adding the equations over reads: find such that for every we have the following
To derive the estimates we now choose and . Adding the two equations and cancelling the skew-symmetric terms we obtain, after rearranging
The right hand side now involves the boundary and the right hand side . We have control over the boundary term since . The first term on the left hand side is nonnegative. The second term can be bounded from below for , as well as for and
In fact, for (ST) problem we have . For (ST), we have , with a similar calculation for from Section 5. Then we apply Cauchy-Schwarz inequality to the integral and follow up with the inequality . With sufficiently large we can now move the term to the left hand side to balance it with . We conclude that thus, upon (4.4d) we finally obtain
from which (4.8) follows.
This result is similar to that predicted in (4.3) [29] in the primal formulation. With more work, we can also cover the case .
Corollary 4.2. The estimate in Lemma 4.2 extends to the solutions for (ST) or (P) problems with single-valued when .
4.3. Numerical examples of P0-P0 for heterogeneous domain with two materials
Now we illustrate the problem (4.1) with simulations using our P0-P0 solver.
4.3.1. Linear heat equation with a jump
We consider first an example in for which we study the dependence of the jump of on .
Example 4.1. Let with some . Let some be given, as well as , , and so that . We also impose boundary conditions , The numerical solutions are computed with , and . We consider different ratios , and .
We plot the solutions to Example 4.1 found numerically with our monolithic P0-P0 solver. We also consider the corresponding stationary limit of (4.1); see the details in (7.15) and (7.14) in Section 7.6.
The numerical solution to the stationary problem in this simple case essentially coincides with the exact solution; this is typical for P0-P0 solution. The non-stationary solutions evolve towards this stationary solution. The qualitative nature is consistent with the imposed interface jump condition.
In the end we also test the scaling of the jump predicted by (4.8) given what we found for the stationary solution in Section 7.6; we check how it behaves over time. The details in Section 7.6 predict that the jump of scales linearly with as , unlike predicted for the evolution problem in (4.3). For large ratio and small we find that the jump behaves similarly to that for the stationary problem. For small , and before the solutions are close to the stationary limit, we see that the size of the jump depends significantly on the grid discretization . This behavior correlates with the large magnitude of when is small. In fact, it also correlates with the estimates of in (4.3) (see also subsequent discussion in Example 4.3). We defer further study of these features to the future.
4.3.2. Examples for Stefan problem for heterogeneous domain with two materials
Now we consider (ST) problem, and start by checking convergence of our P0-P0 scheme for the case of heterogeneous materials. To this end, we modify Example 3.2.
Example 4.2. We let with with parameters given in Table 9. Note that the material within has while has ; the freezing temperature for both.
The initial conditions are which corresponds to for all . Boundary conditions are and for . Figure 8 shows the temperature increase throughout the domain due to the heat transfer through the left boundary.
We do not have an exact solution, thus we must use fine grid solution for convergence studies. The coarse grid solution for , is compared with that for the fine grid, at the first fine grid time step of and the time . In convergence study we disregard the initial time period in which the solution has very steep due to the discrepancy between the initial and boundary data. The convergence of the solution is of lower order during that time and pollutes the rest of the error analysis.
Convergence results, calculated for are shown in Table 10. We see similar rates of convergence for this heterogeneous example as for the homogeneous Example 3.2.
Next we illustrate the need for multiple materials by simulating heat flow at the pore-scale. We proceed to the simulation in at the pore-scale which is motivated by our interest in permafrost. We use our P0-P0 solver with , since there is no interface with high resistivity between the mineral and water components.
Example 4.3. Let with , with water saturated pore space , and the rock grains ; see Figure 9. The material parameters are those of water for and inferred from those for silica for as given in Table 2. We start from thermal equilibrium, with constant temperature with which we calculate .
The heat in increases due to the boundary condition at where we set . We also assume insulated boundary conditions elsewhere for .
In Figure 9 we show temperature profiles of four time steps. The melting front moves from left to right with a phase change in the void space , but the grains do not undergo phase change. The heat flux moves faster through the areas with more grains, since this change requires less energy required for phase change than the heat conduction in the water component.
4.4. Summary
More work on testing the heterogeneous Stefan problem and even the linear heat equation is needed, but overall we believe our monolithic P0-P0 scheme performs well for multiple materials. While not reported here, we tested the simulation cases of different freezing temperatures, and drastically different parameter between domains, and see that the algorithms perform in a robust way across these scenarios, even if a decrease of the time step is needed in the most challenging cases.
5.
Permafrost models
Permafrost is defined as the ground that remains frozen for two or more years [41,66,67]. Temperatures in permafrost respond to the changes in ambient temperatures at the soil-atmosphere interface, possibly including the effects of precipitation and vegetation. In the upper portion of permafrost called active layer the temperatures increase in the summer and decrease in other parts of the year. The bottom of the active layer is isothermal but the depth of this layer is changing due to the increase of ambient temperatures, and this causes the thawing of some portions of permafrost, with further environmental consequences. The importance and impact of coupled phenomena within permafrost models is of current interest. Modeling of the coupled phenomena in permafrost regions is complex, and requires careful conservative approximations across the scales [10,14,15,67,68,69].
5.1. Energy conservation in permafrost
We recall now the (ST) problem discussed in Section 2 for conservation of energy in any volume occupied only by water component in one of two phases: liquid or solid. For any small sub-volume centered at some we can calculate the water fraction . In equilibrium, it equals 1 whenever in the entire . On the other hand, if on , then . In the case , we have , and can be considered as an independent thermodynamic variable at constant volume; see [27]. In other words, the water fraction can be considered a pointwise or a volumetric quantity, and in (ST) problem we assumed (2.3) and .
The model for conservation of energy in permafrost must extend this formulation to account for the presence of rock grains which do not change phase. We assume here that the soil rock grains have a constant heat capacity and a constant thermal conductivity .
The presence of rock grains has several consequences. The first consequence is that in the energy balance we have to account for the rock as a separate material; we discuss this in Section 5.1.2. Second, the presence of rocks affect the local energy landscape at the fluid-rock interfaces; in particular, it causes depression of freezing temperatures in small pores, as well as premelting, the presence of a thin film of water around rock grains [70,71]. Both phenomena have significant effects on the qualitative behavior of phase transitions in permafrost.
In modeling, one must consider the scale at which we wish to consider the phenomena. At the pore-scale of to scale, the rock grains and water occupied domains should be treated as separate materials such as in Example 4.3. In practice, modeling large scale changes in permafrost in response to the temperature in the environment must occur at larger scale such as that of , i.e., at the so-called Darcy scale. Typically, models at Darcy scale take advantage of constitutive relationships measured experimentally in a laboratory. Thanks to modern computational science the pore-scale modeling can be connected to Darcy scale such as in our work and in particular [18,19,20,21], but a thorough discussion is outside our present scope.
5.1.1. Permafrost as a porous medium
We introduce notation which helps to explain the difference and connections between Stefan problem and permafrost models.
As mentioned earlier, porous medium is made of rock grains and non-rock "void space" occupied by fluids such as water and air. In this paper we consider only the water component. The rock grains occupy a fixed portion of , and the water component in occupies the remainder so that . The void (non-rock) proportion is called porosity, and . The water component can be in liquid or solid phases so that , and we have
The quantity is called "unfrozen water content" in permafrost and is determined experimentally for a particular soil type depending on the temperature.
5.1.2. Energy balance in permafrost
The presence of rock grains influences the energy balance. In permafrost the balance of energy must account for the energy content in and , thus the heat capacity is a weighted fraction of that in rock grains and in fluid space,
which can be also written, after rearranging, as
where is the (constant) heat capacity of the "unfrozen soil" and is the (constant) heat capacity of the "frozen soil". In the end, we get [8]
Remark 5.1. We can now compare the formulas for in the classical Stefan problem (2.5) and in the permafrost model (5.4). The former sees the change between "solid" and "liquid", but the latter (5.4) presents the phase change problem between the "frozen soil" and the "unfrozen soil", where the amount of each is controlled by the unfrozen water content .
5.1.3. Experimental models
One of important features of permafrost is the presence of unfrozen water at low temperatures i.e. even when . This phenomenon is not fully explained, and is accompanied by the depression of the freezing temperatures in small pores. The permafrost models take advantage of the empirical data which fits one of the parametric relationships for such as (5.1), where is the unfrozen water content determined experimentally and dependent on soil type. We follow a formulation based on [14]: here is parametrized with some soil-type dependent parameters , as well as with which denotes the residual unfrozen water content at some really low reference temperature
Now is the threshold temperature called freezing point depression such that for , only liquid water is present. Unlike in bulk water, typically . Parameter depends on the soil type. For example, for coarse-grained soils such as sand, the value of is large, but for fine-grained soils such as clay is small. For a particular soil type, we assume to be a constant.
Remark 5.2. A variety of algebraic models for other than (5.5) are given in applications literature; see Table 11 for expressions adapted so that they fit (5.1). The models are qualitatively similar to each other. For a particular expression for plugged in (5.4) they produce which is a monotone increasing injective Lipschitz function. In particular, with (5.5) we get, after some rearranging
For comparison, we plot corresponding to the the different models in Figure 10. Here we use the parameters [16] , , and along with the thermal properties of water as given in Table 2.
We see that each can be seen as an approximation to the monotone graph , and each is a strictly monotone smooth increasing function of , differentiable everywhere except at (as we stated in Assumption 1.3 for ).
5.1.4. Thermal conductivity in permafrost
To complete the permafrost model, we need to define the thermal conductivity of the porous medium which is made of and , with the latter partitioned between and .
It is well known that, unlike capacity given in (5.3), conductivity of composite materials depends not just on the values of , as well as on the fractions , but also on the geometry of how the phases are arranged. Still, some authors consider straightforward weighting with volumetric fractions as in (5.3) which gives, e.g., in [16]
where is the thermal conductivity of the "unfrozen soil" and is the thermal conductivity of the "frozen soil". While (5.7) is a good first order approximation, it does not take into account the geometry of the rock grains; more accurate expressions based on data are considered in [10,14,15] [41](.) Other general approaches can be considered and involve upscaling or homogenization; see, e.g., [18,74]. More work is needed on more accurate expression for .
For the needs of this paper we use (5.7), combined with some selected model for . This completes the permafrost model, once boundary and initial conditions are specified. To construct these for realistic scenarios, care is needed. The former depend on environmental conditions, while the latter must be found carefully since thermal equilibrium might not be always realistic to assume.
5.2. Approximation of permafrost model with P0-P0
We recall now the P0-P0 (equivalent to CCFD) approximation to (1.1) with (1.2) given in Section 3. The function described in Section 5.1.3 features a piecewise smooth nonlinearity, thus fits the general class of problems that Section 3 applies to, and we expect that the scheme P0-P0 will work well.
In fact, the fluxes permafrost problem feature continuous normal components since there is no jump prescribed. Therefore, we expect that is approximated to at least to first order accuracy, similarly to what we encountered for the (harder) (ST) problem.
To put our P0-P0 approach in perspective, we first briefly review relevant literature; see Section 5.2.1. Then we follow up with the test of convergence of P0-P0 in Section 5.2.2 along with examples in Section 5.2.3.
5.2.1. Approaches in literature
The work [6] uses a variational formulation, implicitly discretized in time, to generate a set of nonlinear algebraic equations which are solved using the Newton's method; they test their algorithm on examples; see also discussion in [11]. Next, [8] use a fixed grid finite element method further employing Picard's method to deal with the consequential nonlinearity in the mass and stiffness matrix. In turn, [9] use a node-centered finite difference scheme and solve the nonlinear equations using Newton's methods, and [15] implement their finite element model within the commercially available solver ABAQUS.
We adopt the fully discrete formulation for P0-P0 as given in Section 3 and extended to heterogeneous media in Section 4.
5.2.2. Convergence of P0-P0 scheme for permafrost model
Example 5.1. For convergence studies, we choose an example based on [26]. We consider the scenario as in Example 3.2, but with instead of , and with the following initial and boundary conditions
The soil properties are chosen to be . We run the simulation over a time period of . Since the exact solution is unknown, the convergence rates are calculated using a fine grid solution computed on a mesh with cells and time step .
The results are given in Table 12. We see a modest improvement in the present case with over the rates obtained for in the Stefan problem, with order robustly for and for . Further, we also observe that the order of convergence seems independent of small variations in the soil type, or in the parameter in (5.5). These results are for as in (ST) problem.
Next we attempt which is optimal in linear and mildly nonlinear parabolic problems. We attempt the latter since the graph is much smoother than any of , hence, we hope to have better than first order convergence. However, there is only mild improvement in the convergence rate when using compared to ; specifically, the order of convergence is roughly –. In the end, this improvement may or may not justify the extra computational effort of using .
5.2.3. Simulation of realistic examples
Next we apply our P0-P0 algorithm to physical examples pertaining to permafrost thawing. The goal of these examples is to test the robustness of our scheme and to illustrate the modeling aspects of permafrost. Since most of the interesting dynamics in permafrost occurs due to the varying surface boundary conditions and along the depth, we confine ourselves to the case.
We use the expression (5.5) for the unfrozen water content .
Example 5.2. Homogeneous case. Consider a small column of soil with physical parameters as in Table 2 along with , . We use the following initial and boundary conditions
and run the simulation over a time period of . The solution at different times is shown in Figure 11.
We see the temperature profile trending slowly towards the stationary distribution. When the simulation is run over a time period of at least hours, steady state is achieved in which the temperature qualitatively has an almost linear profile.
Example 5.3. Heterogeneous case. We extend Example 5.2 to a heterogeneous case. We consider be composed of two different soil types: and . Thermal properties of water are as in Table 2, and we use . The difference in the properties of coarse and fine soil is in parameter in (5.5). We choose and . We start with the initial and boundary conditions
and run the simulation over a time period of . The solution at different times is shown in Figure 11.
The difference between Examples 5.2 and 5.3 is apparent from Figure 11. At a prominent jump in enthalpy appears near ; this effect is due to the soil heterogeneity.
Next we aim to simulate the effect of permafrost warming due to variable temperature at the top boundary, and a possible effect of climate warming.
Example 5.4. Homogeneous case with time dependent boundary condition. Let represent a column of soil of porosity , with the top boundary subject to time-varying temperature boundary conditions representing typical environmental change; see, e.g., [41](Figures 3–5) and [75] where the effect of climate warming is considered. At the bottom we apply fixed Dirichlet condition representing fixed temperature below the active layer. For simulation we use thermal properties of water from Table 2, along with , . Our goal is to simulate the temperature and enthalpy profile during the time period of years following an earlier period of time , initiated with
We use the simulated (shown in the top row of Figure 12) as the initial data for the new time period , with
The simulation results are shown in Figure 12. The temperature at a depth and as a function of time is plotted along with the temperature and enthalpy at the end of years. We use the simulation results to get a rough estimate of the active layer thickness to be .
5.3. Solver challenges and open questions
The P0-P0 approximation scheme and the solver behaves similarly for permafrost problem (P) as for (ST) problem. For single material problems, the model is robust and features essentially mesh independent number of iterations under reasonable time steps.
However, simulation in heterogeneous domains pose difficulties. The performance of Newton's method can be rather rugged, and is a (well-known) challenge [76]. For permafrost, the solver performance deteriorates in the case of highly disparate data, and/or and large time steps. Specifically, the former is an issue if there is a big difference, e.g., in the soil parameters (or ) in Section 5.1.
One way to overcome this challenge is to use adaptive time stepping through which the time step is reduced if Newton iteration struggles to converge. Generally, reducing the time step leads to smoother convergence of residuals to zero. For very difficult cases related to high degree of heterogeneity, we regularize the problem by introducing an artificial "intermediate" layer in the region close to the interface in which the soil parameters vary more smoothly. Such a strategy can be used to find an initial guess for Newton step, or be used as the solution for a few initial time steps.
5.4. Summary
More work on the solver and schemes is needed, but overall we believe our P0-P0 scheme performs well for the permafrost problem (P). We see that its behavior is smoother and the challenges are somewhat milder than those for the (ST) problem, except in heterogeneous domains.
6.
Summary and future works
In this paper we consider a collection of models motivated by the applications to the phase change problems dubbed (ST), their (ST) modeling approximation, and the models (P) in permafrost.
For approximation, we consider the technique well known for its conservative properties, mixed finite element method on rectangular elements. Because the solutions are expected to have low regularity, we choose to use only lowest order finite elements, with piecewise constant approximations to the scalar unknowns which we call P0-P0, and which are similar to finite volume or cell-centered finite difference approaches. For time discretization we employ a first order fully implicit scheme. We show that our P0-P0 approach works well for (ST) as well as (ST) and (P) permafrost, and that they compare well to the P1-based approaches that were primarily formulated for (ST). We extend the P0-P0 approach to heterogeneous case of multiple materials, and showed that a monolithic P0-P0 discretization we construct is robust and easy to implement. We set up theoretical framework relating to known literature results, proved several results, and point out the challenges and open questions.
More work is underway. In particular, due to the challenges we acknowledged above, rigorous convergence analysis of P0-P0 is inaccessible at this time, even if it can be inferred in the limit of that for (ST). The main challenge is that that the use of for the heat flux is not directly possible for (ST). At the same time, our algorithms are robust even if they do not address this formal difficulty, but require more work which we defer to the future.
On modeling side, in permafrost, or more generally, in freezing soils, there are additional micro-scale physical effects which contribute to the complex physics of phase change. These include freezing temperature depression, presence of air which acts as an insulator, configuration (geometry) of the pore space, as well as the chemical composition of liquid and mineral phases. While these are not fully explained, we aim to connect the computational models of multi-material pore-scale to the macro-scale experimental models such as those in Section 5.
For (P) models in permafrost, in the paper we neglect convection and other physical phenomena which can potentially contribute to the temperature changes such as flow and mechanical behavior; these, along with other physical and environmental effects including those of radiation, vegetation and snow cover, and more, will be studied in forthcoming paper.
7.
Appendix
7.1. Exact solution for Example 2.1
The exact solution to (2.15) with data as in Example 2.1 is given by
where is when phase change begins, is when all the solid has changed into liquid, and is the initial enthalpy.
7.2. Auxiliary finite dimensional result
We consider a useful ODE system on with a matrix and right hand side
We define the solutions as limits of the solutions to the time-discrete problem
The result below will be used many times in this paper.
Lemma 7.1. Let be maximal monotone and symmetric nonnegative definite. Then there exists a unique solution to (7.2).
Proof. We find as the minimizer of
where we set , with the (strictly convex) primitive of (i.e., , and the subgradient ). The functional is strictly convex due to the convexity of and strict convexity of , and thus it has a unique minimizer , and we set . With known , we find from , which follows from (7.2).
7.3. Detailed comparison of P1-based formulations
Now we provide a detailed comparison of the P1-based formulations recalled in Section 2.4. We do so, for simplicity, for and a grid covering with cells with centers so that and . Other results not reviewed here include grid adaptive schemes in [77].
We recalling that is identified by its nodal values . In turn, in P1-P0 approaches are identified by .
We denote by the projection onto constants , by the projection onto , and by the Ritz projection onto .
P1-P0 approximations: In [4]: one seeks such that
Here we require , since (7.3b) implements the Chernoff formula. Note that the advantage of (7.3) is that (7.3a) is linear in . However, a consistency error arises since and , and
Next, [5] are able to improve the convergence rates by modifying the scheme given by (7.3) through the use of regularization and projection. Their fully discrete regularized scheme reads [5]: find such that
where is given by (7.5), essentially a Yosida approximation to ,
for some .
The work of [44] uses a semi-implicit scheme for phase relaxation coupled with (2.6) given by They seek as an independent variable and solve
where , with being the linear interpolator. We see that (7.6b) is equivalent to the Chernoff formula (7.3b) with . This shows the subtle difference in the scheme by [5] and [44]: in [44] numerical integration is employed whereas in [5] the projection operator is used to obtain .
Remark 7.1. (7.6b) is obtained using the discretization
To account for in (7.7) instead of , the stability condition must be enforced [31].
P1-P1 fully implicit approach in [3] A scheme of a different flavor is explored in [3]. Fully implicit in time formulation of (2.6) is approximated with
This discrete systems is, in practice, similar to (7.2) except it applies to (ST) and pays considerable attention to the identification and the role of in this last equation. While no regularization of is needed, the results are not accompanied by numerical computations. Optimal convergence rates [3](Theorem 2.7) apply to all models (1.2)
where is a constant such that
In particular, for (ST) and (2.8) we have . This corresponds to an order of convergence when . However, [3] do not include numerical implementation details or experiments.
Other P1-P1 schemes include those in relaxation models [31].
7.4. Convergence studies of P1-based and P0-P0 algorithms.
We carry out convergence analyses when the true solution is known, or when only its proxy, a fine grid solution is available. For time dependent problems the use of renders convergence order verification in some norms impractical. Therefore in this paper we only use norms; we also confine ourselves only to the error in scalar unknowns, deferring the discussion of the error in the fluxes to future work.
7.4.1. Error norms
As stated in Introduction, the notation means . We denote the norms appropriately, say as ; all these have to be approximated. In particular, we distinguish the grid norms
We also use a tighter approximation than the grid norm
7.4.2. P1-based schemes
The theoretically estimated convergence error depends, as usual, on and . Of the literature we have reviewed, [26,31,44] provide numerical results supporting their theoretically derived convergence rates. We set up additional experiments and convergence study as a prequel to the study of our P0-P0 schemes for (ST) problem: we use analytical solutions given in [26,44] for . The theoretical convergence orders as well as those we tested are tabulated in Table 13. We compare the [P1-P0] schemes given in [5,44] in Examples 3.1 and 3.2, respectively; see the plots shown in Figure 13. Additionally, we mention the work in [4] which provides a semi-discrete scheme which we further discretize in space following [5]. For this reason, along with using a small regularization parameter in (7.4), the numerical solution following the formulation as in [4] is virtually indistinguishable from [5] in Example 2 and hence is not shown in Figure 13.
All together, we observe that is approximated well, qualitatively and quantitatively, by all the schemes. However, appears to be "smeared" near the interface and features higher errors. Of all the schemes, that in [5] produces the best approximation and convergence rates. The examples we show are consistent with the rates proved in [44]; see Table 13.
7.5. Auxiliary properties
We recall now one especially useful and well known feature of numerical implementation of (3.6) when numerical integration is used to get the entries of matrix . As discussed in [24,49], these follow from the use of trapezoidal (T) and midpoint (M) rules which leads to an algebraic expression involving the so-called transmissivity or transmissibility edge factor , where
is the weighted scaled harmonic average of and so that (see, e.g., [24](eq.(15)))
We emphasize that the use of harmonic averages leads to conservative fluxes; this is in contrast with primal formulations in which arithmetic averages are used, and fluxes are not conservative.
Next, we integrate and obtain, for the total normal flux across and the first component of
The right hand side explains how the entries of matrix arise.
A similar expression is obtained for and using .
The use of factors allows the interpretation of (7.12) as a finite difference analogue of . It also explains why the matrix is diagonal.
Last but not least, (7.11) is done also for the second component of flux across .
Handling Dirichlet boundary conditions. The expression (7.12) is valid for the interior edges away from external boundaries and interfaces. If the edge is on a boundary on which Dirichlet condition is prescribed with some , then the analogue of (7.12) reads
The portion of the expression associated with moves to the right hand side of (3.3), but the structure of the matrix does not change.
7.6. Linear heat equation with a jump condition
We consider the special linear case with piecewise constant coefficients of (4.1) from Example 4
7.6.1. Stationary solution
We also consider the stationary solution to (7.14) which satisfy
It is not difficult to find its analytical solution, with and
In particular in the special case , and we get
From this we have the jump scaling linearly with as , unlike predicted for the evolution problem in (4.3).
Acknowledgements
This work was partially supported by the National Science Foundation DMS-1912938 and DMS-1522734, and by the NSF IRD plan 2019-21 for M. Peszynska.
This material is based upon work supported by and while serving at the National Science Foundation. Any opinion, findings, and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the National Science Foundation.
The authors would like to thank the anonymous reviewers whose remarks helped to improve the paper.
Conflict of interest
The authors declare there is no conflicts of interest.