Heterogeneous Stefan problem and permafrost models with P0-P0 finite elements and fully implicit monolithic solver

Lisa Bigler; Malgorzata Peszynska; Naren Vohra; Lisa Bigler; Malgorzata Peszynska; Naren Vohra

doi:10.3934/era.2022078

Electronic Research Archive

2022, Volume 30, Issue 4: 1477-1531. doi: 10.3934/era.2022078

Previous Article Next Article

Research article Special Issues

Heterogeneous Stefan problem and permafrost models with P0-P0 finite elements and fully implicit monolithic solver

Department of Mathematics, Oregon State University, Corvallis, OR 97331, USA

Academic Editor: Lei Wang

Received: 31 December 2021 Revised: 09 March 2022 Accepted: 13 March 2022 Published: 23 March 2022

We consider heat conduction models with phase change in heterogeneous materials. We are motivated by important applications including heat conduction in permafrost, phase change materials (PCM), and human tissue. We focus on the mathematical and computational challenges associated with the nonlinear and discontinuous character of constitutive relationships related to the presence of free boundaries and material interfaces. We propose a monolithic discretization framework based on lowest order mixed finite elements on rectangular grids well known for its conservative properties. We implement this scheme which we call P0-P0 as cell centered finite differences, and combine with a fully implicit time stepping scheme. We show that our algorithm is robust and compares well to piecewise linear approaches. While various basic theoretical properties of the algorithms are well known, we prove several results for the new heterogeneous framework, and point out challenges and open questions; these include the approximability of fluxes by piecewise continuous linears, while the true flux features a jump. We simulate a variety of scenarios of interest.

Keywords:

Stefan problem,
mixed finite element methods,
permafrost,
free boundary problems,
nonlinear parabolic partial differential equation,
semismooth Newton solver,
heterogeneous coefficients

Citation: Lisa Bigler, Malgorzata Peszynska, Naren Vohra. Heterogeneous Stefan problem and permafrost models with P0-P0 finite elements and fully implicit monolithic solver[J]. Electronic Research Archive, 2022, 30(4): 1477-1531. doi: 10.3934/era.2022078

Related Papers:

[1]	Yujia Chang, Yi Jiang, Rongliang Chen . A parallel domain decomposition algorithm for fluid-structure interaction simulations of the left ventricle with patient-specific shape. Electronic Research Archive, 2022, 30(9): 3377-3396. doi: 10.3934/era.2022172
[2]	Ying Liu, Yanping Chen, Yunqing Huang, Yang Wang . Two-grid method for semiconductor device problem by mixed finite element method and characteristics finite element method. Electronic Research Archive, 2021, 29(1): 1859-1880. doi: 10.3934/era.2020095
[3]	Jun Pan, Yuelong Tang . Two-grid $ H^1 $-Galerkin mixed finite elements combined with $ L1 $ scheme for nonlinear time fractional parabolic equations. Electronic Research Archive, 2023, 31(12): 7207-7223. doi: 10.3934/era.2023365
[4]	Yiyuan Qian, Haiming Song, Xiaoshen Wang, Kai Zhang . Primal-dual active-set method for solving the unilateral pricing problem of American better-of options on two assets. Electronic Research Archive, 2022, 30(1): 90-115. doi: 10.3934/era.2022005
[5]	Noelia Bazarra, José R. Fernández, Ramón Quintanilla . Numerical analysis of a problem in micropolar thermoviscoelasticity. Electronic Research Archive, 2022, 30(2): 683-700. doi: 10.3934/era.2022036
[6]	Yijun Chen, Yaning Xie . A kernel-free boundary integral method for reaction-diffusion equations. Electronic Research Archive, 2025, 33(2): 556-581. doi: 10.3934/era.2025026
[7]	Hongze Zhu, Chenguang Zhou, Nana Sun . A weak Galerkin method for nonlinear stochastic parabolic partial differential equations with additive noise. Electronic Research Archive, 2022, 30(6): 2321-2334. doi: 10.3934/era.2022118
[8]	Wenya Qi, Padmanabhan Seshaiyer, Junping Wang . A four-field mixed finite element method for Biot's consolidation problems. Electronic Research Archive, 2021, 29(3): 2517-2532. doi: 10.3934/era.2020127
[9]	Qian He, Wenxin Du, Feng Shi, Jiaping Yu . A fast method for solving time-dependent nonlinear convection diffusion problems. Electronic Research Archive, 2022, 30(6): 2165-2182. doi: 10.3934/era.2022109
[10]	Changling Xu, Huilai Li . Two-grid methods of finite element approximation for parabolic integro-differential optimal control problems. Electronic Research Archive, 2023, 31(8): 4818-4842. doi: 10.3934/era.2023247

Abstract

1. Introduction

In this paper we are interested in numerical approximation of heat conduction with phase change in heterogeneous and composite materials. We are motivated by important applications in modeling permafrost, human tissue, and phase change materials (PCM) for thermal energy storage, smart textiles and buildings. These applications involve materials with drastically different thermal properties separated by an interface, thus require conservative algorithms. While we focus on mathematical and computational challenges rather than on practical engineering or geophysics scenarios, we aim to develop robust and accurate yet simple algorithms adequate for low regularity of solutions and easily extendable to future multiphysics simulators. Challenges similar to those discussed here occur in simulation of multiphase flow in fractures or rocks of different type ^[1,2], as well as in other applications which we give in what follows.

Overview. We focus on two phases: solid and liquid, separated by a free boundary, or by a region in which these phases coexist. In a single material we consider the following nonlinear parabolic equation

$\begin{eqnarray} \partial_t w +\nabla \cdot q = f; \;\; q = -\nabla \cdot (k(\theta)\nabla \theta); \end{eqnarray}$

(1.1)

to be supplemented by appropriate boundary and initial conditions. The model is solved for the temperature $\theta$ , the internal energy (enthalpy) density $w$ and the heat flux $q$ . Here $f$ represents heat sources, $k$ is the heat conductivity. The definition of $w$ closes the model. For this definition, we consider one of the following equilibrium relationships

$\begin{eqnarray} (ST): \; w &\in& \alpha ^{ST}(\theta), {\rm{ or }} \end{eqnarray}$

(1.2a)

$\begin{eqnarray} (ST)_{\epsilon}:\; w & = & \alpha ^{ST_{\epsilon}}(\theta), \; \rm { or } \end{eqnarray}$

(1.2b)

$\begin{eqnarray} (P):\; w & = &\alpha^P(\theta). \end{eqnarray}$

(1.2c)

In particular, in the Stefan problem denoted by (ST), $w\in \alpha^{ST}(\theta) = c\theta +L\chi$ . Here $c$ is the heat capacity, $L$ is the latent heat, and the water fraction $\chi \in \mathcal{H}(\theta)$ (where $\mathcal{H}$ is the Heaviside graph) "translates" the well known Stefan condition prescribing the velocity of the free boundary $S$ between the liquid and solid regions to (1.1) defined over both solid and liquid regions. Its approximation is (ST) $_{\epsilon}$ , in which $\alpha ^{ST_{\epsilon}}(\theta)$ is some $\epsilon$ -dependent single-valued piecewise smooth approximation to $\alpha ^{ST}$ with Lipschitz constant $L_{\alpha ^{ST_{\epsilon}}}\sim \epsilon^{-1}$ ; see Section 2 for details. In turn, in permafrost models (P) $w = \alpha^{P}(\theta)$ is given by a monotone piecewise smooth Lipschitz function $\alpha^P(\cdot)$ , with some large Lipschitz constant $L_{\alpha^{P}}$ ; we give details in Section 5. See also a summary of notation in Table 1.

Table 1. Variables and symbols used in this paper. The units are chosen for consistency with the literature; ^[26] e.g.

Variable	Description	Units
$\theta$	Temperature	$\mathrm{[{^\circ C}]}$
$w$	Internal energy (enthalpy) density	$\mathrm{[{J / cm^3}]}$
$q$	Heat flux	$\mathrm{[{J / cm^2 \ s}]}$
$\chi$	Water fraction	$\mathrm{[{-}]}$
$c$	Volumetric heat capacity	$\mathrm{[{J /cm^3 \ ^\circ C}]}$
$k$	Thermal conductivity	$\mathrm{[{J / cm \ s \ ^\circ C}]}$
$L$	Latent heat of fusion/melting	$\mathrm{[{J / cm^3}]}$
$u$	Kirchhoff temperature	$\mathrm{[{J / cm \ s}]}$
$\eta$	Porosity	$\mathrm{[{-}]}$
$\chi_w$	Unfrozen water content	$\mathrm{[{-}]}$
Domains	Description	Units
$x \in \Omega\subset \mathbb{R}^d$	Spatial variable in the domain of heat conduction	$\mathrm{[{cm}]}$
$t \in (0, T)$	Time variable and interval	$\mathrm{[{s}]}$
$(x, t) \in \mathcal{Q}$	Space-time cylinder	$\mathrm{[{(cm, s)}]}$
$N_{MAT}$	Number of different materials	$\mathrm{[{-}]}$
$\Omega^{(j)}$	Subdomain with material $j = 1, \dots N_{MAT}$	$\mathrm{[{-}]}$
Relationship	Description
$\theta = \beta(w)$	Temperature-enthalpy relationship
$w \in \alpha(\theta)$	Enthalpy-temperature relationship
$u = \beta_K(w)$	Kirchhoff temperature-enthalpy relationship
$w \in \alpha_K (u)$	Enthalpy-Kirchhoff temperature relationship
Superscript	Choice of constitutive relationships
$ST$	Stefan problem
$ST_{\epsilon}$	Regularized Stefan problem
$P$	Permafrost model

| Show Table

DownLoad: CSV

Known work. Even in a single material the models (1.1) and (1.2) does not have classical solutions and features challenges to the analysis and approximation due to the nonlinear multi-valued or only piecewise smooth character of $\alpha(\cdot)$ . Typically, (1.1) is posed in the sense of distributions, and its solutions $(\theta, w)$ have low regularity.

The majority of numerical analysis for (1.1), under homogeneous Dirichlet boundary conditions, is carried out for (ST) $_{\epsilon}$ after so-called Kirchhoff transformation which renders (ST) or (ST) $_{\epsilon}$ in the equivalent form, and with a new variable $u$ instead of $\theta$

$\begin{eqnarray*} \partial_t w- \Delta u = f, \; u = \beta_{\rm{K}}(w). \end{eqnarray*}$

Since $u$ (as well as $\theta$ ) are expected to be continuous, it appears natural to seek their continuous piecewise linear (P1) finite element approximations. In turn, $w_h$ is either also piecewise linear (P1) ^[3] or piecewise constant (P0) ^[4,5]. We denote these approaches, respectively, as P1-P1 or P1-P0. For the permafrost (P) problem, the literature reports on the node-centered finite difference or piecewise linear finite element implementations, but without theoretical analysis or studies of convergence; see, e.g., in ^{[6,7,8,9,10,11]}. These also fall in the category of P1-based approximations. In turn, it is common to use cell-centered finite differences (CCFD) or finite volumes (FV) for a variety of fluid flow problems including those in subsurface ^[12].

Heterogeneous materials and conservative approximations. It is well-known that the P1-based approximations due to their poor approximation of fluxes are not well suited for computational modeling of problems with large heterogeneity of coefficients such as Darcy flow in porous media coupled to transport; see, e.g., discussions in ^[12,13]. For computational models of (1.1) and (1.2) alone in single material, this would not be an issue.

However, our goals in this paper are to study (1.1) and (1.2) in heterogeneous domains; additionally, we seek robust algorithms suitable for coupled multiphysics system such as THM ^{[14,15,16,17]}, and, simultaneously, for multiphase multicomponent systems at complicated geometries such as at pore-scale based on voxel grids ^{[18,19,20,21]}. Therefore, we draw from literature on conservative approximations using mixed finite element methods for multiphase flow in porous media and specifically on problems with nonlinearities similar to (1.2) ^[22,23,24]. These feature conservative fluxes and piecewise constant (P0) approximations to scalar unknowns. In this paper we approximate both $\theta$ and $w$ with piecewise constants (P0), thus we refer to these algorithms as P0-P0. Finally, when fluxes are approximated in the space $RT_{[0]}$ (discussed below), the algorithms can be conveniently implemented as cell-centered finite differences (CCFD); this gives a robust easily extendable structure towards more complex physics across many scales. Although these algorithms feature lower order approximation error, this is not an issue given low regularity of solutions to (1.1). Unfortunately, the approximation of fluxes features a technical gap since the normal fluxes for (ST) feature a jump; we defer the associated theoretical issues to future work.

Our results. (i) We first evaluate the feasibility of P0-P0 approximations with fully implicit in time approximations for (1.1) with (1.2) for a single material such as bulk water. We show our approach compares well to P1-based schemes, and that the solvers are robust, even if there are some theoretical concerns. (ii) Second, we develop P0-P0 algorithms for the heterogeneous case $k = k(x; \theta)$ and $\alpha = \alpha(x; \theta)$ in multiple materials, when the data features the realistic but most challenging case of piecewise constant $k(x; \cdot)$ and $\alpha(x; \cdot)$ . Another challenge is that $\theta$ may not be continuous across material interfaces with high thermal resistivity. We prove various results and show robustness of our P0-P0 and monolithic CCFD algorithm which is conservative. Finally, we apply P0-P0 scheme to permafrost modeling including in heterogeneous materials, tie the theoretical framework for $\alpha^{P}$ to that developed for other $\alpha ^{ST}$ and $\alpha ^{ST_{\epsilon}}$ , and confirm convergence.

Overall, we find that P0-P0 algorithm is robust for heterogeneous extensions of (1.1) with any of (1.2) we considered. The order of convergence in the scalar variables $(\theta, w)$ is, for the most part, about $O(h)$ and, $O(\sqrt{h})$ whenever time step $\tau = O(h)$ is used. The nonlinear solver performs also well, even though it might need improvements for strongly heterogeneous nonlinearities.

Outline: We give details on the phase change problems modeled by (1.1) with (1.2a) and (1.2b) in Section 2. In Section 2.4 we overview, compare, and illustrate traditional piecewise linear approximations to (ST) and (ST) $_{\epsilon}$ which we call P1-P1 or P1-P0. In Section 3 we define our P0-P0 approximations to (ST) and (ST) $_{\epsilon}$ , prove some auxiliary results, and compare convergence and robustness of the scheme to P1-based approaches. In Section 4 we address heterogeneous (ST) such as in (1.3), and prove properties of the P0-P0 scheme for the solutions that feature a jump across the material interfaces. In Section 5 we provide details on the permafrost model (P) (1.2c), along with convergence results for P0-P0 scheme, and illustrate with simulations for both homogeneous and heterogeneous problem. We close in Section 6, with auxiliary results given in Appendix 7.

1.1. Notation

The problem (1.1) is posed in $\mathcal{Q} = \Omega \times (0, T)$ where $\Omega \subset \mathbb{R}^d$ is an open bounded spatial domain with a smooth boundary $\partial \Omega$ . In heterogeneous case, we partition $\overline{\Omega} = \bigcup_{j = 1}^{ N_{MAT}} \overline{\Omega^{(j)}}$ into subdomains $\Omega^{(j)}$ , corresponding to $j = 1, \ldots N_{MAT}$ materials, with the material interfaces $\Gamma = \bigcup_{ij} \Gamma^{(ij)}, \Gamma^{(ij)} = \partial \Omega^{(i)} \cap \partial \Omega^{(j)}$ . Thermal properties of a region depend on the material in that region. We assume that in material $j$ the heat conduction and phase transition parameters are known fixed constants. For example, at a point $x \in \Omega^{(j)}$ , the freezing temperature is fixed for this material so that $\theta_{fr}(x) = \theta_{fr}^{(j)}$ . In turn, the thermal conductivity $k(x; \theta) = k^{(j)}(\theta)$ depends on the temperature in a fixed way specific to material $j$ . We denote this dependence of thermal properties as follows

$\begin{eqnarray} c(x;\theta)\vert_{\Omega^{(j)}} = c^{(j)}(\theta); k(x;\theta)\vert_{\Omega^{(j)}} = k^{(j)}(\theta); \theta_{fr}(x)\vert_{\Omega^{(j)}} = \theta_{fr}^{(j)}; L(x)\vert_{\Omega^{(j)}} = L^{(j)}. \end{eqnarray}$

(1.3)

The notation $\mathcal{Q}^{(j)}_p$ indicate the portion of $\mathcal{Q}$ in domain $(j)$ and phase $p$ .

We assume that the subdomains and interfaces are smooth enough so that standard notation and results using the spaces $C^k(\Omega)$ , and Lebesque and Sobolev spaces such as $L^p(\Omega)$ and $H^k\Omega)$ make sense. For a Lipschitz continuous function $f$ we denote its Lipschitz constant by $L_f$ . We denote by $V = H_0^1(\Omega)$ for the primal variational formulation, and by $X = H_{\mathrm{div}}(\Omega)$ and $M = L^2(\Omega)$ the spaces for fluxes and scalars, respectively, needed in the mixed setting. By $(u, v) = \int_{\Omega}uv$ we denote the inner product of scalar functions on $L^2$ as well as that for vector valued functions on $(L^2)^d$ . The norms $||{f}||_{{G}}$ of functions in space $G$ are as indicated with a subscript $G$ which is dropped if $G = L^2(\Omega)$ or $G$ can be inferred from context. For time-dependent problems when $t \in (0, T)$ and, e.g., $u \in L^2(0, T;V)$ we use the shorthand notation $L^2(V)$ . For discrete formulations, we recall the spaces $V_h \subset V$ of piecewise linear finite elements based on a triangular grid. We will also use $X_h \subset X$ some approximations to the vector valued functions such as heat flux, and $M_h \subset M$ the space of piecewise constant approximations.

By ${\bf{1}}_U(x)$ we denote the characteristic function of set $U \subset \mathbb{R}^d \ni x$ . For two sets $\Omega^-$ and $\Omega^+$ separated by an interface $\partial \Omega^+ \cap \partial \Omega^-$ , by $[r(x)] = (r\vert_{\Omega^+}-r\vert_{\Omega^-})\vert_x$ we denote the jump of quantity $r$ at some interface point $x \in \partial \Omega^+ \cap \partial \Omega^-$ . These two domains correspond to two materials, or to two phases of one material. When the sign of the jump is important, we make it precise on which side of the interface the limits are taken.

Next, consider real functions $u, w$ defined on $\mathcal{Q}$ , and a function $\beta: \mathbb{R} \to \mathbb{R}$ . The statement $u = \beta(w)$ means that $u(x, t) = \beta(w(x, t))$ a.e. on $\mathcal{Q}$ . Similarly, consider a graph of a relation $\alpha \subset \mathbb{R} \times \mathbb{R}$ for which $\alpha^{-1} = \beta$ is a non-injective function, but $\alpha(\cdot)$ is multi-valued. In this case $\upsilon \in \alpha(r)$ is equivalent to $(r, \upsilon) \in \alpha$ , and, for functions $u, w$ on $\mathcal{Q}$ , the statement $w \in \alpha(u)$ means that $w(x, t) \in \alpha(u(x, t))$ a.e. $(x, t) \in \mathcal{Q}$ .

In time-discrete formulations we use uniform time stepping with time step $\tau > 0$ and $0 = t^0 < t^1, \dots$ and $t^n = n\tau$ . For an unknown $u(\cdot, t^n)$ , we denote by $u^n(\cdot)$ its time approximation, and by $u_h^n(\cdot)$ its fully discrete approximation, which may be identified by the set of values such as $U^n_i$ . For a known function $f(\cdot, t)$ , we denote $f^n$ to be the value $f(\cdot, t^n)$ or its integral average $\tfrac{1}{\tau}\int_{t^{n-1}}^{t^n}f(\cdot, \upsilon)d\upsilon$ , made precise in the context.

Finally, when using physical data and units, we use SI units or state otherwise; see Table 1.

In addition, we make some assumptions on the data. Assumption 1.1 is a standard assumption for data of heat conduction problems.

Assumption 1.1. We assume that the latent heat coefficients $L^{(j)}\geq 0$ , and that heat capacity and conductivity coefficients are $c_{max}\geq c_p^{(j)}\geq c_{min} > 0; k_{max} \geq k_p^{(j)}\geq k_{min} > 0$ for all phases $p = s, l$ , and domains $j = 1, \ldots N_{MAT}$ .

For simplicity of notation when reviewing literature as well as in the proofs of some results we assume the following. (More realistic cases are given in examples.)

Assumption 1.2. Let $\theta$ satisfy homogeneous Dirichlet b.c. $\theta\vert_{\partial \Omega} = 0$ . We also assume that some initial value $w_{init}\in L^2(\Omega)$ is given.

Last but not least, our focus in this paper is on the nonlinear relationship $\alpha$ given as one of the three specific choices $\alpha^{(ST)}$ , $\alpha^{(ST)_{\epsilon}}$ , $\alpha^{(P)}$ defined in (1.2) which come from phase change problems. However, our results apply to other possible $\alpha(\cdot)$ with properties summarized in Assumption 1.3. These properties are well known for $\alpha^{(ST)}$ , $\alpha^{(ST)_{\epsilon}}$ ; see, e.g., ^[25]. For $\alpha^{(P)}$ in the permafrost problem, these properties follow from the algebraic formulas discussed in Section 5.

Assumption 1.3. We assume that $\alpha$ satisfies one of the following properties (a)–(c), similar to (ST), (ST) $_{\epsilon}$ , (P), respectively: (a) $\alpha$ is a maximal monotone graph with a non-injective Lipschitz inverse $\beta = \alpha^{-1}$ , similarly to $\alpha ^{ST}$ , (b) $\alpha$ is a monotone strictly increasing function which is piecewise smooth and globally Lipschitz, with an injective Lipschitz inverse, similarly to $\alpha ^{ST_{\epsilon}}$ , (c) $\alpha$ is a smooth strictly increasing globally Lipschitz function which has one point of non-differentiability (similar to $\alpha^P$ ).

2. Formulations and approximation of Stefan problem in a single material

In this section we provide details on (ST) and (ST) $_{\epsilon}$ . We suppress these superscripts now. A description of the variables used in this paper is given in Table 1.

2.1. Stefan problem

We follow closely ^[27] as well as the applications literature ^[28] for thermodynamics of multiple phases. We consider the temperature field $\theta(x, t), x \in \Omega, t > 0$ and define $\Omega_l = \{x\in \Omega: \theta > 0\}$ , $\Omega_s = \{x\in \Omega: \theta < 0\}$ , with $\mathcal{Q}_l, \mathcal{Q}_s$ defined analogously. We also consider the phase interface $S = \partial \mathcal{Q}_l \cap \partial \mathcal{Q}_s$ , and $S_t$ its instance at time $t$ . The motion of $S_t$ is described by its velocity $v$ , with the normal component $v \cdot \nu$ . In turn, the normal $n$ to $S$ is oriented towards $\mathcal{Q}_l$ , with components $(n_x, n_t)$ with $n_x$ parallel to $\nu$ , so that we have $n_t = -v \cdot n_x$ .

We recall first the so-called strong (classical) formulation of Stefan problem: we seek the temperature $\theta$ in each region $\Omega_p, p = s, l$ which satisfies the heat equation

$\begin{eqnarray} \partial_t (c\theta) + \nabla \cdot q = f; \;\;\;\;\; q = - k\nabla \theta, \; x \in \Omega_p. \end{eqnarray}$

(2.1a)

The first part of (2.1a) is the energy conservation involving heat flux $q$ , external source $f$ , and the internal energy $c\theta$ dependent on the temperature. The second part is the Fourier heat conduction law.

The coefficients of volumetric heat capacity $c$ and heat conductivity $k$ depend on the phase of a material; see for typical values. In this paper we consider the simplest realistic case in which $c, k$ are piecewise constant in each $\Omega_s, \Omega_l$ , respectively.

$\begin{eqnarray} c(\theta) = \begin{cases} c_s,& \theta < 0\\ c_l,& \theta > 0 \end{cases}, \;\; k(\theta) = \begin{cases} k_s,& \theta < 0\\ k_l,& \theta > 0 \end{cases}. \end{eqnarray}$

(2.1b)

Table 2. Thermal properties of some materials including water, components of phase change materials (PCM), human tissue, minerals, rock grains and insulators. Given are the latent heat of fusion/melting

$L$ , volumetric heat capacity

$c$ , thermal conductivity

$k$ , and melting point. For each material, the available thermal properties of its solid (

$s$ ) and liquid (

$l$ ) phase are provided. (

$^*$ ) Depends on proportion of silicone. (

$^{**}$ ) Depends on water content. The properties of (W) component will be used frequently below.

Material units	$L$ $\mathrm{[{J/cm^3}]}$	$c_s/c_l$ $\mathrm{[{J/ cm^3 \ ^\circ C}]}$	$k_s/k_l$ $\mathrm{[{J/ cm \ s \ ^\circ C}]}$	Melting $\mathrm{[{^\circ C}]}$ Point	Ref.
(W) Water/ice	306.00	1.90/4.19	0.0230/0.0058	0	^[26]
Paraffin solid/liquid	183.18	1.58/1.84	0.0040/0.0040	60	^[35]
Rubitherm RT 55	149.60	1.76/1.54	0.0020/0.0020	55	^[36]
Octadecanol (silicone) $^*$	168 to 213		0.002 to 0.002	55 to 60	^[37]
Human skin (epidermis) $^{**}$		0.8	0.0019		^[38,39]
Human muscle			0.0017 to 0.011		^[38]
Silica		0.7	0.014	1713	^[39,40]
Styrofoam			$0.00029$		^[41]

| Show Table

DownLoad: CSV

One can extend (2.1b) to $\theta = 0$ using arithmetic averages ^[27](Ⅳ.4.1); see also Section 3.2.3.

2.1.1. Free boundary

Finding the free boundary $S = \partial \mathcal{Q}_l \cap \partial \mathcal{Q}_s$ , is part of the problem, with $S(x, t): \theta(x, t) = 0$ . Additionally, the Stefan condition governs the velocity of $S_t$ as follows

$\begin{eqnarray} [(q_l-q_s) \cdot \nu] = Lv\cdot \nu; {\rm{ or }} [(k_l\nabla_{\nu} \theta_l-k_s \nabla_{\nu} \theta_s ] = -Lv\cdot \nu; \end{eqnarray}$

(2.1c)

$\begin{eqnarray} {\rm{ or }} [(k_l\nabla \theta_l-k_s \nabla \theta_s ] \cdot n_x = Ln_t \end{eqnarray}$

(2.1d)

where $\nu$ is the unit normal to $S_t = S \cap \Omega \times \{t\}$ .

The strong (classical) form of the Stefan problem seeks $\{S, \theta\}$ , with $\theta$ is expected to have continuous derivatives to all the relevant order in $\mathcal{Q} \setminus S$ . As is well known, such a solution may not exist ^[27] (v)Section Ⅳ.1. Hence, there is need to define weak solutions of (2.1).

2.2. Weak formulation

The weak form of (2.1) is derived upon integration by parts using test functions from $D(\mathcal{Q})$ and assuming that $\theta \in C(\overline{ \mathcal{Q}})$ but that its derivatives need only be in $L^1(\mathcal{Q}\setminus S)$ . The energy conservation (2.1a) is written in all of $\mathcal{Q}$ , and the weak form, in the sense of distributions, reads

$\begin{eqnarray} \partial_t w + \nabla \cdot q = f; \; q = - k\nabla \theta, \;\; {\rm{ in }}\; D'( \mathcal{Q}). \end{eqnarray}$

(2.2a)

The definition of enthalpy $w$ follows by integration of parts of (2.2a) written in the sense of distributions, the use of (2.1a)–(2.1c) along with the assumption that $S$ is smooth and $\theta$ continuous across $S$ ; see, e.g., ^[27](Ⅳ.1p.101) ^[25](A1 p244). This definition "translates" the Stefan condition (2.1c) as follows, in one of the many variants

$\begin{eqnarray} w = c\theta + \frac{L}{2}\phi; \;{\rm{ or }}\; w = c\theta+L\chi. \end{eqnarray}$

(2.2b)

Here $\phi$ is the order (phase) parameter and $\chi = \frac{\phi+1}{2}$ is the water fraction. In equilibrium, these are set to $\phi = 1, \chi = 1$ in $\Omega_l$ and $\phi = -1, \chi = 0$ in $\Omega_s$ . On $S$ where $\theta = 0$ , $\phi$ and $\chi$ are independent variables determined uniquely by the heat content $w$ ; in particular, $0 \leq \chi = \frac{w}{L} \leq 1$ when $\theta = 0$ . However, when written in terms of temperature, the relationships $(\theta, \chi)$ or $(\theta, \phi)$ appear multi-valued

$\begin{eqnarray} \phi \in \rm{sgn}(\theta) = \begin{cases} -1,& \theta < 0\\ [-1,1], & \theta = 0\\ 1,& \theta > 0 \end{cases}, \;\; \chi \in \mathcal{H}(\theta) = \begin{cases} 0,& \theta < 0\\ [0,1], & \theta = 0\\ 1,& \theta > 0 \end{cases}. \end{eqnarray}$

(2.3)

This only appears when the fact that $\chi$ (and $\phi$ ) are independent variables is ignored.

The definition (2.2b) can be also explained from thermodynamic principles: here we have $dw = c(\theta, \chi) d\theta+L d\chi$ which expresses a general dependence of $c$ on $\theta$ in (2.1b) as a weighted fraction of the heat capacities $c_l$ in the liquid and $c_s$ in solid phases

$\begin{eqnarray} c = c_l \chi + c_s(1-\chi), \end{eqnarray}$

(2.4)

with a possibly variable $c_p = c_p(\theta)$ . Then one integrates $dw = c d\theta+L d\chi$ to yield

$\begin{eqnarray} w(\theta,\chi) = \int_{\theta_{fr}}^{\theta} \left[ c_l(\upsilon)\chi(\upsilon) + c_s(\upsilon)(1-\chi(\upsilon)) \right] d\upsilon + L \chi. \end{eqnarray}$

(2.5)

This formula implies (2.1b) and has a nice nontrivial counterpart in permafrost; see Section 5.

In relaxation or in phase field models, i.e., away from equilibrium, $\phi$ is not given by (2.3), but is governed by its own dynamics, with its range possibly outside $[-1, 1]$ . We describe and illustrate such models in Section 2.4.1.

2.2.1. Kirchhoff transformation

The analysis of (2.2) proceeds after we change variables and write

$\begin{eqnarray} \partial_t w - \Delta u = f, \; w \in \alpha_{\rm{K}}(u)\equiv u = \beta_{\rm{K}}(w); \; {\rm{ or }}\; \partial_t \alpha_{\rm{K}}(u)-\Delta u \ni f. \end{eqnarray}$

(2.6)

The variable $u$ is called the "Kirchhoff temperature" and is distinguished from the true temperature $\theta$ . To transform (2.2) to (2.6) we replace $k \nabla \theta = k(\theta)\nabla\theta = \nabla u$ with $u(\theta) = \int_0^{\theta} k(r)dr$ , which, with (2.1b), gives

$\begin{eqnarray} u = u(\theta) = \begin{cases}k_l \theta, &\theta\geq 0, \\ k_s \theta, &\theta\leq 0 \end{cases}, \;\; {\rm{ or }} \;\; \theta = \theta(u) = \begin{cases}\frac{u}{k_l}, &u\geq 0, \\ \frac{u}{k_s}, &u\leq 0 \end{cases}. \end{eqnarray}$

(2.7)

Next, from (2.2b) with (2.1b) we have the relationships for $(\theta, w)$

$\begin{eqnarray} \theta = \beta(w) = \begin{cases} \frac{w}{c_s},& w < 0\\ 0, & w \in [0,L]\\ \frac{w-L}{c_l},& w > L \end{cases} \equiv w \in \alpha(\theta) = \begin{cases} c_s \theta,& \theta < 0\\ [0,L], & \theta = 0\\ L+c_l \theta,& \theta > 0 \end{cases}. \end{eqnarray}$

(2.8)

Finally, we combine (2.8) with (2.7) and see that $u = u(\theta) = u(\beta(w)) = \beta_{\rm{K}}(w)$ with

$\begin{eqnarray} u = \beta_{\rm{K}}(w) = \begin{cases} \frac{k_sw}{c_s},& w < 0\\ 0, & w \in [0,L]\\ \frac{k_l(w-L)}{c_l},& w > L \end{cases} \equiv w \in \alpha_{\rm{K}}(u) = \begin{cases} \frac{c_s u}{k_s},& u < 0\\ [0,L], & u = 0\\ L+\frac{c_lu}{k_l},& u > 0 \end{cases}. \end{eqnarray}$

(2.9)

We see that $\beta$ and $\beta_{\rm{K}}$ are non-injective functions, while $\alpha$ and $\alpha_{\rm{K}}$ are maximal monotone graphs, i.e., the range of identity plus the graph is $\mathbb{R}$ , and they are monotone. The functions $\beta_{\rm{K}}, \beta$ are Lipschitz, with Lipschitz constants

$\begin{eqnarray} L_{\beta} = max\{c_s^{-1},c_l^{-1}\}; \;\;\ L_{ \beta_{\rm{K}}} = max\{k_s c_s^{-1},k_lc_l^{-1}\}. \end{eqnarray}$

(2.10)

The functions $\beta_{\rm{K}}, \beta$ along with the graphs $\alpha_{\rm{K}}, \alpha$ are affine bounded, i.e., there are constants $C_{\beta} = max\{c_s^{-1}, c_l^{-1}, Lc_l^{-1} \}, C_{ \beta_{\rm{K}}} = max\{k_s c_s^{-1}, k_l c_l^{-1}, L k_l c_l^{-1} \}$ and $C_{\alpha} = max\{c_s, c_l, L\}, C_{ \alpha_{\rm{K}}} = max\{c_s k_s^{-1}, c_l k_l^{-1}, L \}$ with which

$\begin{eqnarray} |\beta(w)| &\leq& C_{\beta}(1 + |w|); \; | \beta_{\rm{K}}(w)| \leq C_{ \beta_{\rm{K}}}(1 + |w|), \ \forall w \in \mathbb{R}, \end{eqnarray}$

(2.11a)

$\begin{eqnarray} |w| &\leq& C_\alpha(1 + |\theta|), \forall \theta \in \mathbb{R}, w \in \alpha(\theta); \end{eqnarray}$

(2.11b)

$\begin{eqnarray} | \alpha_{\rm{K}}(u)| &\leq& C_{ \alpha_{\rm{K}}}(1 + |u|), \forall u \in \mathbb{R}, w \in \alpha_{\rm{K}}(u). \end{eqnarray}$

(2.11c)

2.2.2. Approximations of (ST)

There are many ways to approximate a multi-valued graph $\alpha$ (or $\alpha_{\rm{K}}$ , ${\mathrm{sgn}}$ or $\mathcal{H}$ ) by a single-valued Lipschitz function so that its inverse is injective. One very specific (one-sided) approximation is the Yosida approximation

$\begin{eqnarray} \alpha_{\lambda} = \tfrac{1}{\lambda}(I-(I+\lambda \alpha)^{-1}). \end{eqnarray}$

(2.12)

We see that $L_{\alpha_\lambda} = O(\lambda^{-1})$ which blows up as $\lambda \downarrow 0$ . For this and other similar approximations of $\alpha ^{ST}$ by $\alpha ^{ST_{\epsilon}}$ there is no jump of $q \cdot n$ or of $w$ across the free boundary $S$ . Rather, these variables vary sharply in the region

$\begin{eqnarray} \Omega_l^{\epsilon} = \{x: 0 < \theta(x) < \epsilon\}. \end{eqnarray}$

(2.13)

See, e.g., illustration in Figure 1(c).

Figure 1. Illustration of domains of interest in this paper. (a) Liquid and solid domains in (ST) with a sharp liquid-solid interface

$S: \theta = 0$ (dashed curve), which is advancing during melting; shown is its normal velocity given by Stefan condition and its components. (b) Domains in the regularized (ST)

$_{\epsilon}$ problem including the light colored region

$\Omega_l^{\epsilon} \subset \Omega_l$ given by (2.13):

$0 < \theta(x, t) < \epsilon; x \in \Omega_l^{\epsilon}$ ; (c-d) Domains with with piecewise constant thermal properties. (c) Pore-scale domain with two materials: grains in black and void space filled with ice and water. (d) Phase change material at mesoscale.

DownLoad: Full-Size Img PowerPoint

2.3. Well-posedness

The models (1.1) with (1.2) and in particular (2.2) and (2.6) are nonlinear monotone evolution equations with maximal monotone graphs $\alpha$ . Theory for such equations in Banach spaces is given in ^[25]; the specific case of Stefan problem is discussed in ^[27] in an abstract Hilbert space setting; see also ^[3].

We recall only the case of Dirichlet homogeneous boundary conditions, in the Hilbert space case with $V = H_0^1(\Omega)$ . We seek $w \in L^{\infty}(L^2) \cap H^1(H^{-1})$ , $u \in L^2(V)$ such that $u = \beta_{\rm{K}}(w), \ w(\cdot, 0) = w_{init} \in L^2(\Omega)$ and (see, e.g., ^[4])

$\begin{eqnarray} &(\frac{\partial w}{\partial t}, \psi) + (\nabla u, \nabla \psi) = 0, \ \forall \psi \in V. \end{eqnarray}$

(2.14)

Assuming some smooth enough $w_{init}$ is given, the maximal monotonicity of $\alpha_{\rm{K}}$ and the fact that both $\alpha_{\rm{K}}$ and $\alpha_{\rm{K}}^{-1}$ are affine bounded are sufficient for ^[27](1.20, p34; Thm.Ⅱ.5.1, p59). In particular, with $f \in L^1(L^2) \cap L^2(V')$ , and $w_{init} \in L^2(\Omega)$ , one obtains that there exists a unique solution $\int_0^T u \in L^{\infty}(V) \cap H^1(L^2)$ (note that this implies $u \in L^2(L^2)$ ). Further refinement of the theory ^[27](Prop.Ⅱ.1.3 and Thm.Ⅱ.1.4))^[27] gives $u, w\in L^{\infty}(L^2)$ , with uniqueness in ^[27](Thm 5.2). Further regularity ^[27](Thm 2.5) indicates $u\in H^1(L^2) \cap L^{\infty}(V)$ . Since $\theta$ can be found from (2.7) which is continuous, one generally obtains similar qualitative properties of $\theta$ as those of $u$ .

These results indicate quite low regularity of $w$ in contrast to the case when $\alpha_{\epsilon}$ is single valued and Lipschitz continuous, since then $w_{\epsilon}$ has the same regularity as $u_{\epsilon}$ ^[27](Thm 2.6). Further results hold for in-homogeneous Dirichlet data as long as it is smooth enough; these results elucidate the connection between weak and strong formulations ^[27](Ⅳ.6). Under nonlinear Neumann boundary conditions ^[29] for $N_{MAT} = 1$ one obtains $\theta \in L^2(H^1), w \in L^2(L^2)$ .

2.4. Approximation of solutions to Stefan problem in a single material: review

In this section we review several approaches towards the numerical approximation of (2.2) and the Kirchhoff transformed version (2.6) which combine time-stepping with some spatial discretization such as finite elements or finite differences. We focus on the traditional mesh-based PDE discretization; other approaches include explicit tracking of the free boundary, but without solving for $(\theta, w)$ on a mesh; see, e.g., ^[30] for review. These latter approaches do not apply very well to the simulation at pore-scale, permafrost, or heterogeneous materials.

In Section 2.4.1 we recall and illustrate time stepping schemes. Next in Section 2.4.2 we review spatial discretization approaches focusing on piecewise linear approximations $u_h \in V_h \subset V$ ; the methods differ by how $w_h$ is approximated and how (ST) $_{\epsilon}$ is selected.

2.4.1. Time stepping for an ODE

Consider some $f = f(t)$ , and $A > 0$ in

$\begin{eqnarray} &w' + A \theta = f(t), \ \theta(t) = \beta(w(t)), \ t > 0; &w(0) = w_{init}. \end{eqnarray}$

(2.15)

We illustrate several time discrete schemes which approximate the relationship of $(\theta, w)$ either by regularization, Chernoff formula, or by relaxation.

The backward Euler scheme is

$\begin{eqnarray} &w^n + \tau A \theta^n = \tau f^n + w^{n-1}, \; w^n \in \alpha(\theta^n), \ \forall n \geq 1,\; w^0 = w_{init}. \end{eqnarray}$

(2.16)

and the solution for every $n\geq 1$ is guaranteed from Lemma 7.1; in practice, we use Newton's iteration substituting $\theta^n = \beta(w^n)$ .

The solution to (2.15) can be also computed using so-called Chernoff formula ^[4]

$\begin{eqnarray} &\theta^n + \frac{\tau}{\mu} A\theta^n = \frac{\tau}{\mu} f^n + \beta(w^{n-1}), \end{eqnarray}$

(2.17a)

$\begin{eqnarray} &w^{n} = w^{n-1} + \mu (\theta^n - \beta(w^{n-1})), \ \forall n \geq 1, \; w^0 = w_{init}, \end{eqnarray}$

(2.17b)

where $\mu$ is the relaxation parameter, a constant approximation to $\frac{1}{\beta'}$ , which satisfies $0 < \mu \leq L_{\beta}^{-1}$ , with $L_\beta$ from (2.10). The Chernoff formula offers a way to linearize the non-linear relationship $\beta$ . Chernoff formula does not require any nonlinear iteration, since it solves a linear problem for $\theta$ followed by an update of $w^n$ . However, it produces consistency errors growing with $\mu^{-1}$ .

We also consider the phase-relaxation approach: we replace the equilibrium relationship (2.8) $w = c\theta+L\chi = c\theta+\frac{L}{2}(\phi+1)$ in which $\phi\in {\mathrm{sgn}}(\theta)$ by a coupled problem which allows $\phi = \phi(t)$ to evolve towards this equilibrium written as ${\mathrm{sgn}}^{-1}(\phi) \ni \theta$ . ^[27](Ⅴ.1). This relaxation is another form of approximation to the equilibrium problem; see ^[31] for the PDE (2.2). In a more general phase-field approach, one can consider an additional dissipative term similar to $A\phi$ , plus a coarsening term proportional to $\phi$ which acts to counteract $A\phi$ ; these together moderate the evolution of $\phi$ coupled to the PDE (2.2).

Let $\varepsilon > 0$ and $\gamma > 0$ . The phase-relaxation approach to (2.15) is given as

$\begin{eqnarray} & w'+A \theta = f, \; w = c (\theta) \theta +\frac{L}{2}(\phi+1), \; w(0) = w_{init}, \end{eqnarray}$

(2.18a)

$\begin{eqnarray} & \phi' + \frac{1}{\varepsilon} g(\phi) = \gamma \theta, \ \phi(0) = {\mathrm{sgn}}(w_{init}). \end{eqnarray}$

(2.18b)

Here $g$ is one of two possible choices which approximate ${\mathrm{sgn}}^{-1}(\cdot)$ and contain the linear destabilisation term. We consider $g_\circ(\phi) = \phi^3-\phi$ similar to that in Allen-Cahn equation, ^[32] which we call "smooth". The other is $g(\phi) = g_\square(\phi) = {\mathrm{sgn}}^{-1}(\phi)-\phi$ which we call "non-smooth" ^[27](Section Ⅵ.5) ^[33,34]. We discretize (2.18) fully implicitly and solve by Newton's iteration

$\begin{eqnarray} & w^n- w^{n-1} +\tau A \theta^n = \tau f^n, \; w^n = c(\theta^n)\theta^n+ \frac{L}{2} (\phi^n+1), \; w^0 = w_{init}, \end{eqnarray}$

(2.19a)

$\begin{eqnarray} & \phi ^n- \phi^{n-1} +\frac{\tau}{\varepsilon} g(\phi^n) = \tau \gamma \theta^n, \; \phi^0 = {\mathrm{sgn}}(w_{init}). \end{eqnarray}$

(2.19b)

Example 2.1. We solve (2.15) with $\alpha = \alpha ^{ST}$ given by (2.8) with parameters in (W). We choose $A = 10^{-2}, w_{init} = -1,$ and a forcing term $f(t) = \sin{\left(\frac{\pi t}{3600} \right)}$ for $t\in [0, 3600]$ . We derive the exact solution given in Section 7.1.

In we plot numerical solutions using $\tau = 40$ and one of the three schemes: fully implicit, Chernoff formula, and phase relaxation with (2.18). For Chernoff formula, we choose $\mu = 1.5$ so that $\mu \leq L_{\beta}^{-1} = c_s = 1.90$ . For phase relaxation, the key challenge here is a choice of the parameters $\varepsilon = 1/10, \ \gamma = 1$ , so that the time-relaxed dynamics resembles that of (2.15).

Figure 2. Numerical solution of Example 2.1 using Backward Euler, Chernoff formula, Smooth (S) and Non-smooth (NS) phase relaxation approximation. The thin black line shows the exact solution (7.1). From left to right: plots of

$\theta(t)$ ,

$w(t)$ and

$(\theta, w)$ .

DownLoad: Full-Size Img PowerPoint

In the end we see that the fully implicit solution (2.16) trails the exact solutions $\theta(t)$ and $w(t)$ fairly well. The Chernoff formula and phase relaxation approaches seem to have less close agreement, especially in $w(t)$ . While they offer other advantages, this experiment informs our subsequent choice to focus on the fully implicit time stepping.

2.4.2. Spatial approximations

Early approaches to numerical solution to (ST) include the nodal finite difference approach in ^[42] for which convergence (but no specific order) is proven, independently of $\epsilon$ in the regularization (ST) $_{\epsilon}$ of (ST) problem. In ^[26] some time error analysis is provided.

The majority of rigorous work is on nodal piecewise linear approximations $V_h \ni u_h \approx u \in V$ in (2.14); we call these P1-based. The approaches differ in time stepping (as in Section 2.4.1), in how the original problem (ST) is approximated by some (ST) $_{\epsilon}$ , and in how $w_h^n$ is defined. In particular, ^[43] prove $L^2(L^2)$ order of convergence close to $O(h)$ for fully implicit approximations for (ST) $_{\epsilon}$ , when $\epsilon = O(h^2), \tau = O(h^2)$ but require $\partial_t w_{\epsilon}$ to be bounded, and refer actually to simulations with the (P) problem discussed in ^[7] instead of (ST) $_{\epsilon}$ . In turn, ^[4,5,44] approximate solutions for some (ST) $_{\epsilon}$ rather than (ST); ^[4,5] make use of the Chernoff formula similar to (2.17), while ^[44] and ^[31] use phase relaxation; these are closely related as shown in Figure 2.

The main difference between the individual approaches is in the treatment of spatial integrals $\int_{\Omega} w \psi$ and $\int_{\Omega} u \psi$ with $\ \psi \in V_h$ , and in adjustments to how $w_h$ is found. In some schemes the numerical integration, or one of projection operators such as $P_h^0, P_h^1, \Pi_h$ are used. In some, piecewise constants and $w^n_h\in M_h$ are used, this is similar to our P0-P0 schemes to be defined in Section 3.

The theoretically estimated convergence error depends, as usual, on $h, \tau$ and $\epsilon$ . Generally, $\theta$ is predicted to be approximated well, qualitatively and quantitatively, by all the P1-based schemes, with the order about $O(h)$ , in the weaker norms. However, the errors $w-w_h$ are, as expected, higher, and $w_h$ appears "smeared" near the free boundary $S_t$ .

We set up detailed experiments and provide illustrations as part of our subsequent studies of P0-P0 schemes for (ST) problem in comparison to P1-based schemes. Our tests given in detail in Section 7.3 along with fine details on the schemes show somewhat better rates than those predicted and tested in ^[26,31,44]. Of the schemes studied, ^[5] produces the best approximation and convergence rates. We acknowledge the limitations of our study only in $d = 1$ , but believe these provide good starting point for subsequent comparisons with P0-P0 schemes.

3. Approximation to Stefan problem using P0-P0 finite elements and CCFD for a single material

In this section we propose P0-P0 spatial discretization for (1.1) combined with any one of (1.2) combined with fully implicit in time scheme. We start with a mixed finite element formulation for (1.1), using $M = L^2(\Omega)$ for scalar unknowns such as $\theta$ and $w$ , and $q \in X = H_{\mathrm{div}}(\Omega)$ which features continuous normal components across any smooth surface. Next we choose P0-P0 approximations $\theta_h, w_h \in M_h \subset M$ . We also seek fluxes $q_h \in X_h = RT_{[0]}$ on a rectangular grid as in ^[22,45]. The pair $(X_h, M_h)$ is a stable pair for the Darcy problem satisfying the Banach-Neĉas-Babuŝka conditions ^[46]. For linear problems they approximate the fluxes and scalar unknowns to the same order $O(h)$ , with superconvergence for smooth solutions and some norms ^{[45,46,47,48]}.

Remark 3.1. For nonlinear relationship represented by (1.2) we encounter here the major challenge. The choice $X = H_{\mathrm{div}}$ works for the (ST) $_{\epsilon}$ and (P) problems when $\alpha(\cdot)$ in (1.2b) and (1.2c) is single-valued. However, in the Stefan problem (ST) with (1.2a), the fluxes $q \not \in H_{\mathrm{div}}(\Omega)$ since their normal takes a jump across $S_t$ , which ties to the multivalued character of $\alpha(\cdot)$ . This raises concerns on the approximability of $q\not \in X$ by $q_h \in X_h$ . We acknowledge this difficulty and formally develop the theory only for the single-valued $\alpha$ such as for (ST) $_{\epsilon}$ and (P) problems, but we extend the algorithms in $X_h \times M_h$ to the (ST) problem. We defer further study to future considerations.

The P0-P0 algorithm has several attractive features. 1) First, the normal fluxes of $q_h \in X_h$ are continuous, which leads to conservative schemes across element and material interfaces; to support this, we work in the $(\theta, w)$ formulation instead with Kirchhoff variables $(u, w)$ . 2) Second, if $\theta_h$ is piecewise constant, it is natural to define $w_h = \alpha(\theta_h) \in M_h$ in a consistent fashion so that (2.8) is enforced at every degree of freedom. 3) Third, the P0-P0 equivalent to the mixed framework features approximation properties known from the literature; in particular, the results in ^[22] are most relevant for the present nonlinear case of (ST) $_{\epsilon}$ and (P) problems. In fact, we demonstrate that the convergence in $(\theta, w)$ is not inferior to that for P1-based schemes from Section 2.4 even for (ST) problem; this is done in Section 3.4. 4) Last, the approximations, up to quadrature, are equivalent to a CCFD scheme for $\theta_h \in M_h$ from which the fluxes $q_h \in X_h$ follow post-processing; these features, recalled in Section 3.2, make implementation easy and allow its extensions to more complex nonlinear problems and multiple materials.

Below we first set-up the notation and recall main results on the mixed finite element discretization leading to P0-P0 algorithm. For simplicity of notation we consider $\Omega \subset \mathbb{R}^d, d = 2$ and assume that it is well covered by a rectangular grid $\mathcal{T}^{h} = (\omega_{ij})_{ij}$ so that $\overline{\Omega} = \bigcup_{ij} \omega_{ij}$ , with $\max_{ij}|{\omega_{ij}}| = \max_{ij} h_{x, i} h_{y, j} = h^2$ . Each cell $\omega_{ij} \in \mathcal{T}^h$ has a center at some $(x_{ij}, y_{ij})$ and edges $\gamma_{i-1/2, j}, \gamma_{i, j-1/2}, \gamma_{i+1/2, j}, \gamma_{i, j+1/2}$ , when listed clockwise from the left edge. Throughout this section we use Assumption 1.2.

3.1. P0-P0 scheme from discrete mixed formulation for linear parabolic problems

We recall the mixed formulation for the linear case of (1.1) and (1.2b) with $L = 0$ but allowing $k = k(x), c = c(x)$ , with some given initial condition $w(x, 0) = w_{init}(x)$ . We have then $k^{-1}q = -\nabla \theta; \partial_t w + \nabla \cdot q = f; w = \alpha(\theta) = c\theta$ . Its weak formulation follows after we multiply each of the equations in (1.1) by test functions $\psi \in X$ , $\eta \in M$ , integrate by parts, respectively, and apply the boundary conditions. We seek $(q, \theta) \in X \times M$ which satisfy

$\begin{eqnarray} ({k}^{-1} q, \psi) -(\theta, \nabla \cdot \psi)& = &0, \;\; \forall \psi \in X, \end{eqnarray}$

(3.1a)

$\begin{eqnarray} (\nabla \cdot q, \eta) + (\partial_t w, \eta) & = &(f,\eta);\; \forall \eta \in M; \; w = \alpha(\theta) = c(x)\theta. \end{eqnarray}$

(3.1b)

(In these equations the symbols $(a, b)$ mean inner product $\int_{\Omega}ab$ in $L^2(\Omega)$ as given in Section 1.1). The approximations $q^n_h, \theta^n_h$ with $w_h = \alpha(\theta_h)$ are formulated after (3.1) is discretized in time, and when at each time step $t^n$ we seek $(q^n_h, \theta^n_h) \in X_h \times M_h$ which satisfy a system similar to (3.1). We make these precise now.

We consider the well known spaces $(X_h \times M_h)$ built on $\mathcal{T}^h$ with $X_h = RT_{[0]}$ , the lowest order Raviart-Thomas space on rectangles ^[22,45,48]. The space $M_h$ contains piecewise constants on $\mathcal{T}^h$ ; the basis functions spanning $M_h$ are simply ${\bf{1}}_{\omega_{ij}}$ , and $\theta_h\vert_{\omega_{ij}} = \Theta_{ij}$ associated with the cell centers of each $\omega_{ij}$ . The vector valued functions in $X_h$ are tensor products of piecewise linears in one coordinate with piecewise constants in the other. In particular, $(q_h)_1$ is identified by their edge values at the left and right edges $(i\pm 1/2, j)$ so we have, e.g., $(q_h)_1 \vert_{\gamma_{i+1/2, j}} = q_{i+1/2, j}$ ; analogously $(q_h)_2$ is identified by values at the bottom and top edges $i, j \pm 1/2$ , respectively, $(q_h)_2 \vert_{\gamma_{i, j-1/2}} = q_{i, j-1/2}$ . The basis functions for the vector valued functions in $X_h$ are $\psi_{i\pm 1/2, j}$ for $(q_h)_1$ and $\psi_{i, j\pm 1/2}$ for $(q_h)_2$ . Let $(Q^n, \Theta^n)$ denote the degrees of freedom for $q_h^n, \theta_h^n$ in their bases.

We define the fully implicit approximations $(q_h^n, \theta_h^n) \in X_h \times M_h$ to (3.1) as those that satisfy

$\begin{eqnarray} ({k}^{-1} q^n_h, \psi)_h -(\theta_h^n, \nabla \cdot \psi)& = &0, \;\; \forall \psi \in X_h, \end{eqnarray}$

(3.2a)

$\begin{eqnarray} (\nabla \cdot q_h^n, \eta) + (\frac{w_h^n-w_h^{n-1}}{\tau}, \eta) & = &(f^n,\eta);\;\forall \eta \in M_h;\; w_h = \alpha(\theta_h^n). \end{eqnarray}$

(3.2b)

We applied here numerical integration to the first integral in (3.2a) and replaced $({k}^{-1} q^n_h, \psi)$ by its approximation $({k}^{-1} q^n_h, \psi)_h$ . Specifically, the following numerical integration is used: on every $\omega_{ij} = (x_{i-1/2}, x_{i+1/2}) \times (x_{j-1/2}, x_{j+1/2})$ , the trapezoidal (T) scheme over $(x_{i-1/2}, x_{i+1/2})$ and (M) midpoint scheme on $(x_{j-1/2}, x_{j+1/2})$ for the first component of $q_h \cdot \psi_h$ , and $M \times T$ scheme for the second component. This leads to some useful simplifications, very well known and described in ^[49]. In particular, $({k}^{-1} q^n_h, \psi)_h$ gives $\mathcal{K} Q$ with a diagonal matrix $\mathcal{K}$ of positive edge factors $\mathcal{T}_{i \pm 1/2, j}$ and $\mathcal{T}_{i, j\pm 1/2}$ involving $k^{-1}$ ; see Section 7.5 for more details. We also get $(\nabla \cdot q_h^n, \eta) = - \mathcal{B} Q^n$ while $-(\theta_h^n, \nabla \cdot \psi)$ becomes $\mathcal{B}^T \Theta^n$ . The linear system reads

$\begin{eqnarray} { \mathcal{K}} Q^n + \mathcal{B}^T\Theta^n & = &0, \end{eqnarray}$

(3.3a)

$\begin{eqnarray} - \mathcal{B} Q^n +\frac{1}{\tau}W^n & = &G^n, \; W^n = \mathcal{C} \Theta^n. \end{eqnarray}$

(3.3b)

with $G^n = F^n+\tfrac{1}{\tau}W^{n-1}$ . Note that $\mathcal{C}$ is the diagonal matrix of positive coefficients $c\vert_{\omega_{ij}}$ .

Remark 3.2. After we multiply the second equation in (3.3) by $(-1)$ , the system (3.3) has a saddle-point structure. Thus it is well-posed, i.e., the operator $\mathcal{N} = \left[ \begin{array}{cc}{ \mathcal{K}} & \mathcal{B}^T\\ \mathcal{B} & - \mathcal{C}\end{array} \right]$ is an isomorphism ^[45](Prop.3.3.1 and Thm 3.6.2), because ${ \mathcal{K}}$ is coercive (positive definite) on the kernel of $\mathcal{B}$ , $\mathcal{C}$ is positive definite. and $\mathcal{B}$ is surjective from $X_h \to M_h$ .

Remark 3.3. The system (3.3) can be easily modified to account for non-homogeneous Dirichlet boundary conditions; see Section 7.5 for details.

Connection to CCFD. Since $\mathcal{K}$ is diagonal, every degree of freedom of $Q^n$ has an easy discrete interpretation, thus one can eliminate $Q^n$ , and (3.3) is equivalent to

$\begin{eqnarray} (\tau \mathcal{B} \mathcal{K}^{-1} \mathcal{B}^T + \mathcal{C}) \Theta^n = \tau G^n. \end{eqnarray}$

(3.4)

This system is known as the cell-centered finite difference (CCFD) formulation. Now $\mathcal{B} \mathcal{K}^{-1} \mathcal{B}^T$ is symmetric and at least nonnegative definite for Neumann boundary conditions, and positive definite for Dirichlet conditions. Since $\mathcal{C}$ is positive definite, we have a unique solution $\Theta$ from which $Q$ follows.

3.1.1. Literature notes

For linear problems such as (3.1), mixed FE solution $(q_h, \theta_h)$ features optimal first order convergence of the errors $||{q-q_h}||_{{X}}$ and $||{\theta-\theta_h}||_{{M}}$ for the choice of $RT_{[0]} \times M_h$ ^[46]. For Darcy and potential flow problems the quadrature error is lower order, and the mixed approach provides formal interpretation of the CCFD algorithm ^[49]. For parabolic problems under Neumann boundary conditions and strong assumptions on the smoothness of $\theta$ and $q$ , ^[48] shows that

$\begin{eqnarray} ||{\theta_h-\theta}||_{{\infty,2}} = O(\tau+h^2). \end{eqnarray}$

(3.5)

This order, is, in general, not featured for nonlinear problems such as (1.1).

3.2. P0-P0 scheme for nonlinear heat equation with single-valued nonlinearity

Now we consider (1.1) or Kirchhoff-transformed problem (2.6) with (1.2b) or (1.2c) in the mixed formulation. We seek $(q, \theta)$ or $(q, u)\in X \times M$ and replace $w = c\theta$ by $w = \alpha(\theta)$ or $w = \alpha_{\rm{K}}(u)$ in (3.1b) to get the weak mixed and discrete mixed formulations similar to that for (3.1).

3.2.1. Literature for nonlinear problems in mixed form

The challenge, well described in ^[22] is that one must respect the regularity of the unknowns $(q, \theta, w)$ which is usually inferior to that for the linear case when $\partial_t w \in L^2(\Omega)$ . In particular, when $\beta_{\rm{K}}(\cdot)$ has derivative vanishing pointwise, it may happen that $\partial_t w \not \in L^2(\Omega)$ . This difficulty is partially overcome with a Kirchhoff transformation and upon integration in time, and/or discretization in time; we refer to e.g., ^[22,46,50] for thorough discussion. The approach of taking finite differences in time is also known as the Rothe or Crandall-Ligget or Hille-Yosida framework ^[25,29].

For nonlinear problems which are Kirchhoff-transformed the mixed framework is set-up for the solutions $(w^n_h, u^n_h, q^n_h)$ in ^[22,23,51]. However, the error estimates depend on the smoothness of $w$ and $u$ . Disregarding the error in the fluxes, these results, when applied to (1.1) state that, as in e.g., ^[22]

$\begin{eqnarray*} ||{W^n-w^n}||_{{H^{-1}(\Omega)}} \leq C(u,q, \beta_{\rm{K}};h), \\ \sum\limits_{k = 1}^n (W^k-w^k,U^k-u^k) \tau \leq \overline{C}(u,q, \beta_{\rm{K}};h) , \end{eqnarray*}$

with $C, \overline{C} \downarrow 0$ as $h \downarrow 0$ depending on the smoothness of $u$ , but with the order not specified directly. The work ^[22] does not directly address existence and uniqueness of the solutions. Overall, the results in ^[22,23,51] are well-suited for problems such as Richards equation, with $\alpha$ which features somewhat different challenges than those for (ST), (ST) $_{\epsilon}$ , (P) problems listed in Assumption 1.3.

3.2.2. P0-P0 algorithm for nonlinear single-valued function

We extend now (3.3) to the nonlinear case when $k = k(\theta)$ and $w = \alpha(\theta)$ , working with $(\theta, w)$ as scalar unknowns. We define some approximations $\tilde{c}^n \approx c(\theta^n)$ and $\tilde{k}^n \approx k(\theta^n)$ , and take some $\tilde{\alpha} \approx \alpha$ ; we make these precise in Section 3.2.3. The fully discrete problem reads

$\begin{eqnarray} \tilde{ \mathcal{K}} Q^n + \mathcal{B}^T\Theta^n & = &0, \end{eqnarray}$

(3.6a)

$\begin{eqnarray} - \mathcal{B} Q^n +\frac{1}{\tau}W^n & = &G^n, \; W^n = \tilde{\alpha}(\Theta^n). \end{eqnarray}$

(3.6b)

This nonlinear system uses $\tilde{ \mathcal{K}}$ based on $\tilde{k}$ . We prove first that the system has a unique solution; this is needed since Remark 3.2 does not apply to this nonlinear case. We complete details on the algorithm in Section 3.2.3.

Lemma 3.1. Let Assumption 1.1 hold on $\tilde{k}$ , and let $\tilde{ \mathcal{K}}$ and $\mathcal{B}$ be computed as in Section 3.1; in particular, let $\tilde{ \mathcal{K}}$ be positive definite. Then there exists a unique solution $(Q, \Theta, W)$ to (3.6) and its generalization

$\begin{eqnarray} \tilde{ \mathcal{K}} Q + \mathcal{B}^T\Theta & = &0, \end{eqnarray}$

(3.7a)

$\begin{eqnarray} - \mathcal{B} Q +\frac{1}{\tau}W & = &g, \; W \in \tilde{\alpha}(\Theta). \end{eqnarray}$

(3.7b)

Proof. We discuss only (3.7) since the result for (3.6) follows as its special single-valued case. We proceed in one of two alternative ways which are each worthwhile discussing. One is that we rewrite (3.7) eliminating $Q$ as in (3.8)

$\begin{eqnarray} \mathcal{B} \tilde{ \mathcal{K}}^{-1} \mathcal{B}^T \Theta+\tfrac{1}{\tau} W = G, W \in \tilde{\alpha}(\Theta). \end{eqnarray}$

(3.8)

We set $A = \mathcal{B} \tilde{ \mathcal{K}}^{-1} \mathcal{B}^T$ which is at least nonnegative definite as discussed earlier. The system is as in Lemma 7.1, thus the existence of a unique solution $(\Theta, W)$ follows, with $Q$ found by postprocessing from (3.6a).

Yet another proof follows ideas in ^[50](Thm 3.2), and is worthwhile mentioning because it extends to the abstract weak formulation of (1.1) in $(X, M)$ for the case when $q \in X$ . We note that the system (3.7) can be written as $\mathcal{M} ([Q, \Theta]^T) = [0, G]^T$ ; the nonlinear operator $\mathcal{M}$ a sum of diagonal matrix of maximal monotone operators $\left[ \begin{array}{cc}\tilde{ \mathcal{K}} & 0\\0 & \partial \Phi_{\alpha}\end{array} \right]$ (we define $\Phi_{\alpha}$ as in Lemma 7.1), and the accretive linear thus Lipschitz operator $\left[ \begin{array}{cc}0 & \mathcal{B}^T\\- \mathcal{B} & 0\end{array} \right]$ . This means that $\mathcal{M}$ is maximal monotone, and that there exists a unique solution to (3.7).

3.2.3. Details

Now we need to define $\tilde{k}, \tilde{c}$ and $\tilde{\alpha}$ . These are needed since we extend (3.3) to the nonlinear case when $k = k(\theta)$ and $w = \alpha(\theta)$ . Typically, we assign these cell-wise based on the $\Theta = (\Theta_{ij})_{ij} \equiv \theta_h$

$\begin{eqnarray} \tilde{k}_{ij} = \begin{cases}k_l, &\omega_{ij} \subset \Omega_l^{h} \\ k_s,& \omega_{ij} \subset \Omega_s^{h} \\ k^*,& \omega_{ij} \subset \Omega_0^{h} \end{cases}; \;\; \tilde{c}_{ij} = \begin{cases}c_l, &\omega_{ij} \subset \Omega_l^{h} \\ c_s,& \omega_{ij} \subset \Omega_s^{h} \\ c^*,& \omega_{ij} \subset \Omega_0^{h} \end{cases}. \end{eqnarray}$

(3.9)

(The value $k^*, c^*$ can be one of many including $\frac{k_l+k_s}{2}$ and $\frac{c_l+c_s}{2}$ , respectively ^[27](Ⅳ.4.1)). Also,

$\begin{eqnarray} \;{\rm{given }}\; (\Theta_{ij})_{ij}, \;{\rm{ define }}\; \Omega_l^h = \bigcup\limits_{\Theta_{ij} > 0} \omega_{ij}; \;\; \Omega_s^h = \bigcup\limits_{\Theta_{ij} < 0} \omega_{ij}; \;\; \Omega_0^h = \bigcup\limits_{\Theta_{ij} = 0} \omega_{ij}. \end{eqnarray}$

(3.10)

(In practice, the set $\Omega_0^{h}$ is empty). From these the formula for $\tilde{\alpha}$ follows by (2.8).

Lastly, we need to make precise which $\Theta$ we use in (3.10). When entering a new time step $t^n$ , we have the previous time step value $\Theta^{n-1}$ . When iterating on (3.6), in iteration $m$ , we have $\Theta^{n, (m-1)}$ to denote the iteration-lagged value, while we seek the new $\Theta^{n, (m)}$ . We must therefore make precise in (3.10) whether it depends on the old $\Theta^{n-1}$ , or on iteration-lagged value $\Theta^{n, (m-1)}$ . Adopting the notation for the sets in (3.10) from that for $\Theta$ we get, e.g., ${\Omega}_l^{h, n-1}$ or ${\Omega}_l^{h, n, (m-1)}$ . In other words, we can calculate $\tilde{k}_{ij}$ from (3.9) as either ${k}_{ij}^{n-1}$ or ${k}_{ij}^{n, (m-1)}$ . These choices give the matrix $\tilde{ \mathcal{K}} = \mathcal{K}^{n-1}$ or $\tilde{ \mathcal{K}} = \mathcal{K}^{n, (m-1)}$ , respectively. Thanks to Assumption 1.1 these have positive entries; see also Section 7.5. Therefore Lemma 3.1 applies to (3.6).

Remark 3.4. The sets in (3.10) are not the same as

$\begin{eqnarray} \;{\rm{given }}\; t > 0, {\rm{and}}\; (\Omega_l(t),\Omega_s(t)), \;{\rm{ define }}\; \overline{\Omega_l^h}(t) = \bigcup\limits_{\omega_{ij} \subset \Omega_l} \omega_{ij}; \;\; \overline{\Omega_s^h}(t) = \bigcup\limits_{\omega_{ij}\subset \Omega_s} \omega_{ij}. \end{eqnarray}$

(3.11)

If neither of $\Omega_l(t), \Omega_s(t)$ is empty, $\Omega \setminus (\overline{\Omega_l^h}(t^n) \cup \overline{\Omega_s^h}(t^n)) \neq \emptyset$ , as illustrated in Figure 3.

Figure 3. Illustrations of a grid over

$\Omega$ , or over

$\mathcal{Q}$ , for (ST) problem and (ST)

$_{\epsilon}$ . Left: the free boundary

$S$ is not aligned with grid cell interfaces, and

$q \not \in H_{\mathrm{div}}(\Omega)$ . Middle: the approximation of

$q$ by

$q_h$ should be reasonable in the (shaded) region

$(\overline{\Omega_l^h}(t^n) \cup \overline{\Omega_s^h}(t^n))$ defined in (3.11), even if its complement in

$\Omega$ is not empty. Right: in (ST)

$_{\epsilon}$ ,

$q \in H_{\mathrm{div}}(\Omega)$ but

$q$ features sharp gradient in the region

$\Omega_{\epsilon}$ defined in (2.13).

DownLoad: Full-Size Img PowerPoint

3.2.4. Nonlinear solver

Next we discuss the nonlinear solver for (3.7). The solution to (3.7) exists and is unique according to Lemma 3.1, but the proof makes a reference to some (iterative) optimisation algorithm to find the desired minimizer $\Theta^{n}$ .

In practice, the use of such a minimisation algorithm may be tedious and is unnecessary. Instead, we solve the problem using Newton's method: we rewrite (3.8) from the proof of Lemma 3.1 in residual form using the single-valued inverse $\beta$ of $\alpha$ .

In each iteration $m = 1, 2, \ldots$ , given $W^{n, (m-1)}$ , $\Theta^{n, (m-1)}$ we solve

$\begin{eqnarray} F_1(\Theta^{n,(m)},W^{n,(m)})& = &\tau \mathcal{B} (\tilde{ \mathcal{K}}^{n,(m-1)})^{-1} \mathcal{B}^T \Theta^{n,(m)} +W^{n,(m)}-\tau G^n = 0, \;\; \end{eqnarray}$

(3.12a)

$\begin{eqnarray} F_2(\Theta^{n,(m)},W^{n,(m)}& = &\Theta^{n,(m)}-\beta(W^{n,(m)}) = 0. \end{eqnarray}$

(3.12b)

This scheme means we are solving (3.12) simultaneously for two variables $\Theta^{n}, W^{n}$ . However, the second part of (3.12) is diagonal, thus we can, instead, eliminate $\Theta^{n, (m)}$ and solve the problem in terms of $W^{n, (m)}$ .

It is known that Newton's iteration is not guaranteed to converge for an arbitrary initial guess even for smooth $F = (F_1, F_2)$ . Now the nonlinear function $\beta$ in (3.12) is only piecewise differentiable, but such case is covered by the theory and practice of semi-smooth Newton methods in ^[52]. In our tests the Newton solver is robust and essentially grid-independent as long as the time step $\tau$ is not too large. We discuss performance of this iteration in Section 3.4.

3.3. Addressing multivalued (ST) problem with P0-P0

As we showed in Section 3.2, there is no difficulty formulating P0-P0 algorithm and solving the fully discrete case of (1.1) with multi-valued $\alpha$ (1.2a).

In fact, given $\Theta^n$ we get $Q^n$ from (3.6a); this gives $q^n_h \in X_h$ . However, the true $q \not \in H_{\mathrm{div}}$ . Thus any attempts to quantify the approximation error for $q_h-q$ must take into account this important discrepancy expressed already in Remark 3.1. Thus there is a question whether the use of $q_h \in X_h$ is appropriate for the (ST) problem. This challenge is somewhat more complex than the pointwise degeneracy with $\beta' = 0$ pointwise handled, e.g., in ^[22], which still keeps $q \in X$ .

At this time we see various avenues to address this challenge. One is to solve (ST) $_{\epsilon}$ formally, and create a sequence $q_h^{\epsilon} \in X_h$ for a collection of $\epsilon > 0$ adjusted to $h$ , and defining $\tilde{q}_h^n$ as their limit. The fluxes $q_h^{\epsilon}$ would be reasonably accurate approximations to $q^{\epsilon} \in X$ , and their limit $\tilde{q}_h^n$ to $q \not \in X$ . One other is to find some approximation $\tilde{q_h^n}$ to $q$ by postprocessing $q_h^n$ . One can also consider projecting $q$ to $X_h$ . We defer further analyses of possible improvements to $q-q_h^n$ to future work.

3.4. Results of P0-P0 algorithm

Now we present examples and study the convergence of P0-P0 approximation for (ST) problem. We focus on the scalar unknowns $(\theta, w)$ , and defer the study of the error $q-q_h$ to future work. We study the approximation error $\theta_{err} = \theta-\theta_h$ and $w_{err} = w-w_h$ . We choose only those norms that are easy to use when analytical solution is not available, and are easy to compare to the theoretical results on P1-based schemes. For completeness, the definitions of these norms are given in Section 7.4.1.

In convergence studies we use uniform spatial and temporal discretization. We also note that when using fine grid, the error rates, especially those for $w$ , are sensitive to interpolation and machine precision; thus, some care in grid refinement is needed to obtain consistent rates.

The goal now is to compare the performance of our P0-P0 scheme with the P1-based schemes from Section 2.4. These results provide confidence in our work on multiple materials, as well as guide the choice of time step $\tau$ depending on $h$ . We use two test cases which we call (RBC) from ^[26] and another called (VV) from ^[44]. These examples feature an assumed free boundary $S$ moving with some given prescribed velocity, $w_{init}$ and time-dependent Dirichlet boundary conditions found from the exact solution. For illustration, the plots of the solutions and their approximations are given in Figure 4 and Figure 5.

Figure 4. Solution to (VV) Example 3.1 with

$M = 10, \tau = 10^{-2}$ at three different times

$t = 0.01, \ t = 0.1,$ and

$t = 0.2$ .

DownLoad: Full-Size Img PowerPoint

Figure 5. Solution to (RBC) Example 3.2 with

$M = 20, \tau = 500$ , at three different time steps

$t = 20000, t = 100000$ , and

$t = 200000$ . See also more details in Figure 13 in Section 7.4.2.

DownLoad: Full-Size Img PowerPoint

Example 3.1. (VV) This example from ^[44] is not connected to any particular physical scenario, but results in very simple mathematical calculations. Let $\Omega = (0, 0.4)$ and $T = 0.2$ , $f = 0$ and $L = c = k = 1$ . Note that since the data is not physical, no particular units are used, even if our code assumes units such as those in . We have the free boundary for (2.2) $S:\psi(x, t) = 0$ , with $\psi (x, t) = -x+t+0.1$ , and

$\begin{eqnarray} (VV)\;\;\; \begin{cases} w(x,t) = 2\left( e^{\psi (x,t) }-1 \right) +1,\; \theta = w-1 &\psi (x,t) \ge 0, \\ w(x,t) = e^{\psi (x,t) }-1,\; \theta = w & \psi(x,t) < 0 \end{cases}. \end{eqnarray}$

(3.13)

In experiments we stop the simulation at $T = 0.2$ , a time at which the free boundary position is still inside $\Omega$ . This choice helps to analyse how well the free boundary is approximated by the different numerical schemes.

Example 3.2. (RBC) example uses realistic physically meaningful data from ^[26]. We have $\Omega = (0, 20)$ [cm], $L = 306$ [J cm $^{-3}$ ], $c_s = 1.90$ and $c_l = 4.19$ [J cm $^{-3}$ $^\circ$ C $^{-1}$ ], $k_s = 0.023$ and $k_l = 0.0058$ [J sec $^{-1}$ cm $^{-1}$ $^\circ$ C $^{-1} ]$ . The free boundary $s(t) = s_0 + v_st$ , and

$\begin{eqnarray} (RBC)\;\; \begin{cases}w = -B +(B+L)e^{\alpha_w (v_s t -x +s_0)},\; \theta = (w-L)/c_l & x \le s(t), \\ w = -B +Be^{\alpha_s (v_s t -x +s_0)},\; \theta = w/c_s &x > s(t), \end{cases} \end{eqnarray}$

(3.14)

where $\alpha_l = v_s c_l/k_l$ and $\alpha_s = v_s c_s/k_s$ , and data as in Table 2. We also use the parameters in Table 3.

Table 3. Parameters in Example 3.2.

$s_0$	$B$	$v_s$
15 $\mathrm{[{cm }]}$	-594 $\mathrm{[{J/cm^3 }]}$	$-5 \times 10^{-5}$ $\mathrm{[{cm/sec }]}$

| Show Table

DownLoad: CSV

3.4.1. Convergence tests

Convergence results for P0-P0 algorithm and (VV) example are given in , with $\tau = h/10$ . We also provide the fine grid-study with $h_{fine} = 6.4 \times 10^{-5}$ in ; these results are very similar to those in , thus they validate our process for using $\theta_{h_{fine}}$ as a proxy for $\theta$ .

Table 4. Temperature error calculated with the exact solution in Example 3.1 (VV) as described in Section 3.4.

(VV) example convergence rates for error with exact solution
$M_x$	$h_x$	$\tau$	$\|\| \theta_{err}\|\|_{\infty, 1}$	Order	$\|\|\theta_{err}\|\|_{\infty, 2}$	Order	$\|\|\theta_{err}\|\|_{2, 2}$	Order
10	$4.0 \times 10^{-2}$	$1.0 \times 10^{-2}$	$5.6635\times 10 ^{-3}$	-	$1.1472\times 10 ^{-2}$	-	$2.4479\times 10 ^{-3}$	-
50	$8.0 \times 10^{-3}$	$2.0 \times 10^{-2}$	$8.6400\times 10 ^{-4}$	1.1682	$1.8488\times 10 ^{-3}$	1.1342	$3.9386\times 10 ^{-4}$	1.1352
250	$1.6 \times 10^{-3}$	$4.0 \times 10^{-4}$	$1.5112\times 10 ^{-4}$	1.0833	$3.0694\times 10 ^{-4}$	1.1157	$6.6943\times 10 ^{-5}$	1.1011
1250	$3.2 \times 10^{-4}$	$8.0 \times 10^{-5}$	$2.8084\times 10 ^{-5}$	1.0456	$5.5618\times 10 ^{-5}$	1.0613	$1.2197\times 10 ^{-5}$	1.0579
6250	$6.4 \times 10^{-5}$	$1.6\times 10^{-5}$	$5.4205\times 10 ^{-6}$	1.0221	$1.0502\times 10 ^{-5}$	1.0358	$2.3245\times 10 ^{-6}$	1.0300
$M_x$	$h_x$	$\tau$	$\|\|w_{err}\|\|_{\infty, 1}$	Order	$\|\|w_{err}\|\|_{\infty, 2}$	Order	$\|\|w_{err}\|\|_{2, 2}$	Order
10	$4.0\times 10^{-2}$	$1.0\times 10^{-2}$	$2.3537\times 10 ^{-2}$	-	$1.1428\times 10 ^{-1}$	-	$2.5765\times 10 ^{-2}$	-
50	$8.0\times 10^{-3}$	$2.0 \times 10^{-3}$	$5.6481\times 10 ^{-3}$	0.8868	$5.7288\times 10 ^{-2}$	0.4291	$1.1383\times 10 ^{-2}$	0.5076
250	$1.6\times 10^{-3}$	$4.0\times 10^{-2}$	$1.4820\times 10 ^{-3}$	0.8313	$3.4627\times 10 ^{-2}$	0.3128	$6.4965\times 10 ^{-3}$	0.3485
1250	$3.2 \times 10^{-4}$	$8.0 \times 10^{-5}$	$2.6131\times 10 ^{-4}$	1.0783	$1.3404\times 10 ^{-2}$	0.5897	$3.0606\times 10 ^{-3}$	0.4677
6250	$6.4 \times 10^{-5}$	$1.6 \times 10^{-5}$	$5.7018\times 10 ^{-5}$	0.9459	$6.5021\times 10 ^{-3}$	0.4495	$1.2067\times 10 ^{-3}$	0.5783

$M_x$	$h_x$	$\tau$	$\|\|\theta_{err}\|\|_{\infty, L^1}$	Order	$\|\|\theta_{err}\|\|_{\infty, L^2}$		Order
10	$4.0\times 10^{-2}$	$1.0\times 10 ^{-2}$	$7.8606\times 10 ^{-3}$	-	$1.5139\times 10 ^{-2}$		-
50	$8.0\times 10^{-3}$	$2.0\times 10 ^{-3}$	$1.5822\times 10 ^{-3}$	0.9960	$3.0487\times 10 ^{-3}$		0.9957
250	$1.6 \times 10^{-3}$	$4.0\times 10 ^{-4}$	$3.1399\times 10 ^{-4}$	1.0048	$6.0509\times 10 ^{-4}$		1.0048
1250	$3.2 \times 10^{-4}$	$8.0\times 10 ^{-5}$	$6.2432\times 10 ^{-5}$	1.0036	$1.2031\times 10 ^{-4}$		1.0036
$M_x$	$h_x$	$\tau$	$\|\|w_{err}\|\|_{\infty, L^1}$	Order	$\|\|w_{err}\|\|_{\infty, L^2}$		Order
10	$4.0\times 10^{-2}$	$1.0\times 10 ^{-2}$	$2.7565\times 10 ^{-2}$	-	$1.0452\times 10 ^{-1}$		-
50	$8.0\times 10^{-3}$	$2.0\times 10 ^{-3}$	$5.7124\times 10 ^{-3}$	0.9779	$5.7020\times 10 ^{-2}$		0.3765
250	$1.6 \times 10^{-3}$	$4.0\times 10 ^{-4}$	$1.1932\times 10 ^{-3}$	0.9730	$2.5900\times 10 ^{-2}$		0.4903
1250	$3.2 \times 10^{-4}$	$8.0\times 10 ^{-5}$	$2.4169\times 10 ^{-4}$	0.9921	$1.1867\times 10 ^{-2}$		0.4849

| Show Table

DownLoad: CSV

Table 5. Convergence error for Example 3.1 (VV) as described in Section 3.4, error calculated with fine grid where

$M_{x, fine} = 6250$ in both

$||\cdot||_{\infty, 1}$ ,

$||\cdot||_{\infty, 2}$ and

$||\cdot||_{\infty, L^1}$ ,

$||\cdot||_{\infty, L^2}$ , in comparison with errors calculated using the exact solution as in Table 4.

(VV) example convergence rates for error with fine grid solution
$M_x$	$h_x$	$\tau$	$\|\| \theta_{err}\|\|_{\infty, 1}$	Order	$\|\|\theta_{err}\|\|_{\infty, 2}$	Order	$\|\|\theta_{err}\|\|_{2, 2}$	Order
10	$4.0 \times 10^{-2}$	$1.0 \times 10^{-2}$	$5.6586\times 10 ^{-3}$	-	$1.1462\times 10 ^{-2}$	-	$2.4464\times 10 ^{-3}$	-
50	$8.0 \times 10^{-3}$	$2.0 \times 10^{-3}$	$8.5860\times 10 ^{-4}$	1.1716	$1.8394\times 10 ^{-3}$	1.1368	$3.9202\times 10 ^{-4}$	1.1377
250	$1.6 \times 10^{-3}$	$4.0 \times 10^{-4}$	$1.4570\times 10 ^{-4}$	1.1021	$2.9652\times 10 ^{-4}$	1.1340	$6.4815\times 10 ^{-5}$	1.1183
1250	$3.2 \times 10^{-4}$	$8.0 \times 10^{-5}$	$2.2663\times 10 ^{-5}$	1.1562	$4.5146\times 10 ^{-5}$	1.1695	$9.9217\times 10 ^{-6}$	1.1661
$M_x$	$h_x$	$\tau$	$\|\|w_{err}\|\|_{\infty, 1}$	Order	$\|\|w_{err}\|\|_{\infty, 2}$	Order	$\|\|w_{err}\|\|_{2, 2}$	Order
10	$4.0 \times 10^{-2}$	$1.0 \times 10^{-2}$	$2.0291\times 10 ^{-2}$	-	$8.8338\times 10 ^{-2}$	-	$1.5892\times 10 ^{-2}$	-
50	$8.0 \times 10^{-3}$	$2.0 \times 10^{-3}$	$5.6438\times 10 ^{-3}$	0.7951	$5.7286\times 10 ^{-2}$	0.2691	$8.7731\times 10 ^{-3}$	0.3691
250	$1.6 \times 10^{-3}$	$4.0 \times 10^{-4}$	$1.1927\times 10 ^{-3}$	0.9658	$2.8148\times 10 ^{-2}$	0.4415	$4.2063\times 10 ^{-3}$	0.4568
1250	$3.2 \times 10^{-4}$	$8.0 \times 10^{-5}$	$2.3874\times 10^{-4}$	0.9995	$1.3043\times 10 ^{-2}$	0.4779	$1.9348\times 10 ^{-3}$	0.4825

$M_x$	$h_x$	$\tau$	$\|\|\theta_{err}\|\|_{\infty, L^1}$	Order	$\|\|\theta_{err}\|\|_{\infty, L^2}$		Order
10	$4.0 \times 10^{-2}$	$1.0\times 10 ^{-2}$	$7.8603\times 10 ^{-3}$	-	$1.5140\times 10 ^{-2}$		-
50	$8.0 \times 10^{-3}$	$2.0\times 10 ^{-3}$	$1.5819\times 10 ^{-3}$	0.9961	$3.0482\times 10 ^{-3}$		0.9959
250	$1.6 \times 10^{-3}$	$4.0\times 10 ^{-4}$	$3.1365\times 10 ^{-4}$	1.0054	$6.0466\times 10 ^{-4}$		1.0051
1250	$3.2 \times 10^{-4}$	$8.0\times 10 ^{-5}$	$6.2148\times 10 ^{-5}$	1.0058	$1.1999\times 10 ^{-4}$		1.0049
$M_x$	$h_x$	$\tau$	$\|\|w_{err}\|\|_{\infty, L^1}$	Order	$\|\|w_{err}\|\|_{\infty, L^2}$		Order
10	$4.0 \times 10^{-2}$	$1.0\times 10 ^{-2}$	$2.4335\times 10 ^{-2}$	-	$1.0102\times 10^{-1}$		-
50	$8.0 \times 10^{-3}$	$2.0\times 10 ^{-3}$	$5.7104\times 10^{-3}$	0.9007	$5.3364\times 10^{-2}$		0.3965
250	$1.6 \times 10^{-3}$	$4.0\times 10 ^{-4}$	$1.1920\times 10^{-3}$	0.9734	$2.5740\times 10^{-2}$		0.4530
1250	$3.2 \times 10^{-4}$	$8.0\times 10 ^{-5}$	$1.9459\times 10^{-4}$	1.1261	$1.1151\times 10^{-2}$		0.5198

| Show Table

DownLoad: CSV

The convergence results for (RBC) using the fine grid solution and exact solution are shown in Tables 6 and 7 respectively. We also tabulate the comparison to P1-based methods in Section 7.4.2 in Table 13.

Table 6. Convergence error with exact solution from (RBC) example described in Example 3.2 from Section 3.4.

(RBC) example convergence rates for error with exact solution
$M_x$	$h_x$	$\tau$	$\|\|\theta_{err}\|\|_{\infty, 1}$	Order	$\|\|\theta_{err}\|\|_{\infty, 2}$	Order	$\|\|\theta_{err}\|\|_{2, 2}$	Order
20	1.0	$5.0 \times 10^{3}$	$7.4093\times 10^{0}$	-	$2.5085\times 10^{0}$	-	$4.5435\times 10^{2}$	-
200	$1.0 \times 10^{-1}$	$5.0 \times 10^{2}$	$5.9623\times 10^{-1}$	1.0944	$2.4650\times 10^{-1}$	1.0076	$4.3065\times 10^{1}$	1.0233
2000	$1.0 \times 10^{-2}$	$5.0 \times 10^{1}$	$2.9571\times 10^{-2}$	1.3045	$7.8455\times 10^{-3}$	1.4972	$2.2161\times 10^{0}$	1.2885
20000	$1.0 \times 10^{-3}$	5.0	$3.1198\times 10^{-3}$	0.9767	$8.2743\times 10^{-4}$	0.9769	$2.4358\times 10^{-1}$	0.9589
$M_x$	$h_x$	$\tau$	$\|\|w_{err}\|\|_{\infty, 1}$	Order	$\|\|w_{err}\|\|_{\infty, 2}$	Order	$\|\|w_{err}\|\|_{2, 2}$	Order
20	1.0	$5.0 \times 10^{3}$	$2.1277\times 10^{2}$	-	$2.0829\times 10^{2}$	-	$3.7168\times 10^{4}$	-
200	$1.0 \times 10^{-1}$	$5.0 \times 10^{2}$	$1.3797\times 10^{1}$	1.1881	$3.9387\times 10^{1}$	0.7233	$9.3584\times 10^{3}$	0.5990
2000	$1.0 \times 10^{-2}$	$5.0 \times 10^{1}$	$1.3045\times 10^{0}$	1.0243	$1.2142\times 10^{1}$	0.5111	$2.8891\times 10^{3}$	0.5104
20000	$1.0 \times 10^{-3}$	5.0	$1.3097\times 10^{-1}$	0.9983	$3.8480\times 10^{0}$	0.4990	$9.1760\times 10^{2}$	0.4981

$M_x$	$h_x$	$\tau$	$\|\|\theta_{err}\|\|_{\infty, L^1}$	Order	$\|\|\theta_{err}\|\|_{\infty, L^2}$		Order
20	1.0	$5.0 \times 10^{3}$	$1.2507\times 10^{1}$	-	$3.3887\times 10^{0}$		-
200	$1.0 \times 10^{-1}$	$5.0 \times 10^{2}$	$1.1890\times 10^{0}$	1.0220	$3.3678\times 10^{-1}$		1.0027
2000	$1.0 \times 10^{-2}$	$5.0 \times 10^{1}$	$9.0292\times 10^{-2}$	1.1195	$2.4144\times 10^{-2}$		1.1445
$M_x$	$h_x$	$\tau$	$\|\|w_{err}\|\|_{\infty, L^1}$	Order	$\|\|w_{err}\|\|_{\infty, L^2}$		Order
20	1.0	$5.0 \times 10^{3}$	$1.9895\times 10^{2}$	-	$2.0449\times 10^{2}$		-
200	$1.0 \times 10^{-1}$	$5.0 \times 10^{2}$	$1.9028\times 10^{1}$	1.0193	$6.2413\times 10^{1}$		0.5154
2000	$1.0 \times 10^{-3}$	$5.0 \times 10^{1}$	$1.5034\times 10^{0}$	1.1023	$1.2142\times 10^{1}$		0.7110

| Show Table

DownLoad: CSV

Table 7. Convergence error calculated with fine grid for Example 3.2 (RBC) from Section 3.4.

(RBC) example convergence rates for error with fine grid solution
$M_x$	$h_x$	$\tau$	$\|\|\theta_{err}\|\|_{\infty, 1}$	Order	$\|\|\theta_{err}\|\|_{\infty, 2}$	Order	$\|\|\theta_{err}\|\|_{2, 2}$	Order
20	1.0	$5.0 \times 10^{3}$	$7.4070\times 10^{0}$	-	$2.5080\times 10^{0}$	-	$4.5425\times 10^{2}$	-
200	$1.0 \times 10^{-1}$	$5.0 \times 10^{2}$	$5.9330\times 10^{-1}$	1.0964	$2.4588\times 10^{-1}$	1.0086	$4.2914\times 10^{1}$	1.0247
2000	$1.0 \times 10^{-2}$	$5.0 \times 10^{1}$	$2.6568\times 10^{-2}$	1.3489	$7.0525\times 10^{-3}$	1.5424	$1.9892\times 10^{0}$	1.3339
$M_x$	$h_x$	$\tau$	$\|\|w_{err}\|\|_{\infty, 1}$	Order	$\|\|w_{err}\|\|_{\infty, 2}$	Order	$\|\|w_{err}\|\|_{2, 2}$	Order
20	1.0	$5.0 \times 10^{3}$	$2.1277\times 10^{2}$	-	$2.0829\times 10^{2}$	-	$3.4425\times 10^{4}$	-
200	$1.0 \times 10^{-1}$	$5.0 \times 10^{2}$	$1.2880\times 10^{1}$	1.2180	$3.8145\times 10^{1}$	0.7372	$7.4205\times 10^{3}$	0.6664
2000	$1.0 \times 10^{-2}$	$5.0 \times 10^{1}$	$1.2954\times 10^{0}$	0.9975	$1.2142\times 10^{1}$	0.4972	$2.8891\times 10^{3}$	0.4097

$M_x$	$h_x$	$\tau$	$\|\|\theta_{err}\|\|_{\infty, L^1}$	Order	$\|\|\theta_{err}\|\|_{\infty, L^2}$		Order
20	1.0	$5.0 \times 10^{3}$	$1.2505\times 10^{1}$	-	$3.3883\times 10^{0}$		-
200	$1.0 \times 10^{-1}$	$5.0 \times 10^{2}$	$1.1877\times 10^{0}$	1.0224	$3.3632\times 10^{-1}$		1.0032
2000	$1.0 \times 10^{-2}$	$5.0 \times 10^{1}$	$8.9758\times 10^{-2}$	1.1216	$2.3903\times 10^{-2}$		1.1483
$M_x$	$h_x$	$\tau$	$\|\|w_{err}\|\|_{\infty, L^1}$	Order	$\|\|w_{err}\|\|_{\infty, L^2}$		Order
20	1.0	$5.0 \times 10^{3}$	$1.9401\times 10^{2}$	-	$2.0449\times 10^{2}$		-
200	$1.0 \times 10^{-1}$	$5.0 \times 10^{2}$	$1.9025\times 10^{1}$	1.0085	$6.2413\times 10^{1}$		0.5154
2000	$1.0 \times 10^{-2}$	$5.0 \times 10^{1}$	$1.3806\times 10^{0}$	1.1393	$1.1519\times 10^{1}$		0.7339

| Show Table

DownLoad: CSV

Summary of convergence of P0-P0 schemes: Generally, we see that our P0-P0 schemes converge roughly with first order in $\theta$ and half order in $w$ . These rates are similar to those for P1-P0 schemes from Section 2.4. However, our P0-P0 schemes seem to improve on P1-based schemes qualitatively and quantitatively; in particular, we see improvement in the quality of approximations to the enthalpy, which seems due to the lack of consistency errors such as those for Chernoff formulas or phase relaxation.

We acknowledge that this might be due to only testing in $d$ = $1$ ; nevertheless, these results are promising.

3.5. Robustness of nonlinear solver

Last but not least we discuss performance of the nonlinear solver for (3.7) since it works without regularization for (ST) problem, and we have not found a discussion for (ST) problem in the literature.

The maximum number of iterations required for each case to converge is listed in . The solver uses for stopping criterium a combination of absolute tolerance of $10^{-12}$ and a relative tolerance of $10^{-6}$ and $\tau = h/10$ , relative to the first iteration.

Table 8. Newton iterations for Examples 2 (VV), 3 (RBC), and multiple material Example 4.2 as discussed in Section 3.5.

(VV)		(RBC)		(4.2)
Table 5		Table 7		Table 10
$M_x$	iter	$M_x$	iter	$M_x$	iter
10	4	20	4	20	3
50	5	200	5	100	5
250	5	2000	5	500	5
1250	5	-	-	2500	5

| Show Table

DownLoad: CSV

Overall, the solver performs reliably under a variety of circumstances, including with uniform coefficients used in Example 3.1 (VV), realistic thermal properties used in Example 3.2 (RBC) and in the presence of multiple materials in Example 4.2 to be discussed below. It is also important that the performance seems to be mesh independent.

Though not illustrated in , we see in practice that the number of Newton iterations may increase with larger discrepancies between parameters and larger latent heat $L$ values. We add the notes on the specifics of the heterogeneous permafrost case in Section 5.

3.6. Summary

More work on the theoretical underpinnings of P0-P0 algorithm is needed for Stefan problem, but overall our P0-P0 (CCFD) algorithm seems well suited to all the choices of $\alpha$ in (1.2) including even the most challenging case of $\alpha ^{ST}(\cdot)$ .

4. Stefan problem with heterogeneity, and its P0-P0 approximation allowing for thermal resistivity of interface

We consider now the main challenge addressed in this paper, that of simulation of phase change problem in a heterogeneous domain, a generalization of (2.2) to the case when the region $\Omega$ is occupied by $N_{MAT}$ materials, each in $\Omega^{(j)}, j = 1, \ldots N_{MAT}$ , with interfaces denoted by $\Gamma_{ij} = \partial \Omega^{(i)} \cap \partial \Omega^{(j)}$ , and $\Gamma = \bigcup_{ij} \Gamma_{ij}$ . We also have $\Sigma = \Gamma \times (0, T)$ . See and for illustration. The data corresponding to material $(j)$ are denoted by $c^{(j)}_l, c^{(j)}_s, k^{(j)}_l, k^{(j)}_s, \theta_{fr}^{(j)}, L^{(j)}$ . With these we formulate $\alpha^{(j)}$ as in (2.8), and assume that the data satisfies Assumption 1.1, and that $\Gamma$ is at least as smooth as $\partial \Omega$ . We denote the appropriate functional spaces local to $\Omega^{(j)}$ with superscript $^{(j)}$ , e.g., in $X^{(j)}$ , and so on.

Figure 6. Illustration of thermal conduction in domains where the fluxes are continuous as in (4.1b) but the temperature appears to take a jump modeled by (4.1c). Left: two materials separated by a layer of third material with very low conductivity; temperature in function of

$x$ is shown as a black curve. Middle: the same materials as on left when the width of the interface is very small, thus it is only practical to model this region as low dimensional interface, and the temperature features behavior with a jump. Right: the results of simulation of stationary and non-stationary heat conduction in Example 4.1 similar to the case illustrated in the middle.

DownLoad: Full-Size Img PowerPoint

Figure 7. Solutions to Example 4.1 in Section 4.3 obtained with monolithic CCFD, with comparison to the corresponding stationary solution

$\theta^{stat}$ given in (7.15). The graphs correspond to the different ratios

$\tfrac{k^{(2)}}{k^{(1)}}$ and

$\rho_{R}$ as indicated. The magnitude of the jump

$[\theta_h]_{\Gamma}$ scales with

$\rho_{R}$ , as expected, but is robust with respect to

$\tfrac{k^{(2)}}{k^{(1)}}$ . For small

$\tfrac{k^{(2)}}{k^{(1)}}$ and small

$t$ , the jump

$[\theta_h]_{\Gamma}$ is sensitive to

$h$ .

DownLoad: Full-Size Img PowerPoint

Let (2.2) hold in each $\Omega^{(j)}$ . Denote by $w^{(j)}, \theta^{(j)}$ the restriction of $w, \theta$ to $\Omega^{(j)}$ , so we have

$\begin{eqnarray} \partial_t w^{(j)} + \nabla \cdot q^{(j)} = f^{(j)}; \; q^{(j)} = - k\nabla \theta^{(j)}, \; w^{(j)} \in \alpha^{(j)}(\theta^{(j)}). \end{eqnarray}$

(4.1a)

This problem requires some initial conditions and some boundary conditions on $\partial \Omega$ .

We also need some interface conditions on each $\Gamma_{ij}$ . For these, it is natural to assume (i) continuity of fluxes, and (ii) of the temperatures across each $\Gamma_{ij}$ . However, in some applications including PCM, the condition (ii) must be relaxed to model the heat conduction across a very thin region of very low heat conductivity. When the width of that layer is very small compared to the width of the regions surrounding this layer, and if the interface region is approximated by a lower dimensional interface $\Gamma_{ij}$ , the temperature appears to take a jump, since the limits of temperature from both sides of $\Gamma_{ij}$ are quite distinct, while the flux is preserved ^[37]. See Figure 6 for illustration.

This jump condition is formulated, e.g., in ^[29,53,54]

$\begin{eqnarray} q^{(i)}\cdot \nu = q^{(j)} \cdot \nu, \;{\rm{ on }}\; \Gamma_{ij}. \end{eqnarray}$

(4.1b)

$\begin{eqnarray} \theta^{(j)} -\theta^{(i)} = - \rho_{R} q^{(i)} \cdot \nu, \; \;{\rm{ on }}\; \Gamma_{ij}. \end{eqnarray}$

(4.1c)

where $\rho_{R}\geq0$ is called "thermal resisitivity" and where the orientation of the unit normal $\nu$ to $\Gamma_{ij}$ is from $\mathcal{Q}_i$ to $\mathcal{Q}_j$ . (More generally, one can consider $\rho_{R}$ to be specific to an interface).

If $\rho_{R} = 0$ or the fluxes $q^{(j)}$ vanish, we have continuity of temperatures $\theta$ across $\Gamma_{ij}$ (but not necessarily of $u$ ). If $\rho_{R} > 0$ , the condition (4.1c) is a Robin condition which slows down the process of reaching thermal equilibrium across the interface. This latter case is important for applications. However, its mathematical and computational challenges have not been well studied. We address some of the challenges that (4.1) brings from theoretical and algorithmic point of view when $\rho_{R} > 0$ .

The mixed formulation is particularly convenient for problems involving interfaces and multiple materials. In particular, the condition (4.1b) prescribes the continuity of normal fluxes across $\Gamma$ which is naturally preserved for the fluxes $q \in X$ . The condition also prescribes a possible jump of the scalar unknowns, but this is not an issue when extending $\theta \in M$ . Therefore, if the problems on each $\Omega^{(j)}$ are well posed in $X^{(j)} \times M^{(j)}$ for each $j$ , then the global problem can be well studied in $X \times M$ ; see, e.g., ^[55,56]. Similarly, one can easily define the P0-P0 (CCFD) algorithms on each subdomain coupled to the discrete counterparts of (4.1b)–(4.1c).

As concerns the overall solution algorithm based on CCFD for (4.1a), in principle, some domain decomposition approach iteration is required to satisfy (4.1b)–(4.1c). However, one of our contributions in this paper is that we are able to formulate a monolithic P0-P0 algorithm on $\Omega$ . We also formulate various theoretical results and estimates for $q_h \in X_h$ . We note however that even though we do not state approximation properties in this paper, these may involve $\theta$ considered in some broken spaces such as $\mathcal{H}(\Omega) = H^1(\Omega^{(1)}) \times H^1(\Omega^{(2)})$ rather than $H^1(\Omega)$ due to the discontinuity of $\theta$ across $\Gamma$ .

After some literature review in Section 4.1, we formulate and illustrate our algorithms in Sections 4.2 for linear and nonlinear problems when $q \in X$ . We extend these to (ST) problem and apply the same scheme, even if some of the theoretical results do not apply when $q \not \in X$ . We provide examples and study convergence in Section 4.3.2.

4.1. Literature on heterogeneous Stefan pbm

The mathematical literature on (4.1) is not abundant, but we review what is available. The problem is a special case of more general heterogeneous nonlinear parabolic problem in which

$\begin{eqnarray} \theta_{fr} = \theta_{fr}(x); L = L(x); c = c(x;\theta); k = k(x;\theta). \end{eqnarray}$

(4.2)

When the data (4.2) vary smoothly as functions of its first argument, it is possible to apply Kirchhoff transformation on $\Omega$ but this introduces various lower order terms in the PDE. This approach considered, e.g., in ^[57,58], allows some well-posedness analysis.

Another class of approaches in ^[59,60,61] study a problem similar to Stefan problem via minimization of a collection of normal convex integrands, and assumes smoothness of $\alpha(x; \theta)$ in $x$ , for an application with phase change of a different type where the multivalued graph $\alpha(x; \cdot)$ representing the solubility constraint depends smoothly on $x$ . At the same time, the challenges discussed here are similar to those revealed by simulations in ^[61] when applied to piecewise constant data.

The piecewise constant case (1.3) of interest in this paper does not allow Kirchhoff transformation but is also most realistic. The well-posedness of the case $\rho_{R} > 0$ is studied in the primal setting in ^[29] under external Neumann boundary conditions on $\partial \Omega$ and with some initial data. Existence of solutions is proven for $\rho_{R} > 0$ , with estimates which allow the limiting case $\rho_{R} \downarrow 0$ . Uniqueness is also proven ^[29]. For $\rho_{R} > 0$ , the paper predicts $w\in L^{\infty}(Q)$ , $\theta\in L^2(\mathcal{H})$ , and that the jump is

$\begin{eqnarray} {\rm{(i) }}\;\; ||{[\theta]}||_{{L^2(\Sigma)}} = O(\sqrt{ \rho_{R}}), \;\; {\rm{ (ii) }}\;\;||{\partial_t w}||_{{L^2(\mathcal{H}')}}\leq C_1 +\tfrac{C_2}{ \sqrt{ \rho_{R}}}. \end{eqnarray}$

(4.3)

We note that as as $\rho_{R} \downarrow 0$ , the estimates (4.3) suggest continuity of the temperature across $\Gamma$ but also predict the blowup of $\partial_t w$ .

As concerns numerical approximation, ^[54] describes the approximation in $(V^{(j)}_h)_j$ similar to the P1-P1 schemes described in Section 2.4. A-priori bounds for $(\theta^n_h, w^n_h)$ including those similar to (4.3) are proved in ^[53] for $\rho_{R} > 0$ . Finally, a time discrete approximation to $\partial_t w_h$ is measured in some seminorms which involve $\frac{h^2}{\tau^2}$ , $\frac{1}{\sqrt{ \rho_{R}}}$ ; these suggest to use $\frac{h}{\tau}$ bounded as $h \to 0, \tau \to 0$ , and $\rho_{R}$ such that $\frac{\tau^2}{ \rho_{R}} \to 0$ . Based on these a-priori bounds, convergence to the solution corresponding to $\rho_{R} = 0$ is established. However, no rate of convergence for the approximation errors $\theta-\theta_h$ is given.

Last but not least we mention other problems and applications which feature jumps of the primary unknowns and fluxes. These works fit in the framework of heterogeneous domain decomposition ^[56] and inform our results, but are not closely related. The closest is our prior work on the domain decomposition approach for semiconductor modeling of heterogeneous junctions in ^[62,63,64]. Further, there is closely related substantial work on multiphase flow and multiple rock types in, e.g., ^[1,2] as well as the abundant literature on Beavers-Joseph conditions on the Stokes-Darcy interfaces, e.g. in ^[65] and other heterogenous domain decomposition settings.

4.2. P0-P0 scheme

We approximate (4.1) with P0-P0 discrete scheme from Section 3. We recall that even though the formal approximation with $X \ni q \approx q_h \in X_h$ is appropriate only for the single-valued setting $w = \alpha(\theta)$ (as in Remark 3.1), the P0-P0 algorithm produces discrete conservative fluxes $q_h \in X_h$ also for the multivalued case $w \in \alpha(\theta)$ .

We start with P0-P0 (or CCFD) algorithm on each material domain, a counterpart of (3.7), which we couple with discrete counterparts of (4.1b)–(4.1c). Then we show that this domain decomposition formulation can be handled by a monolithic solver. With this, the theoretical results including convergence analyses available for single domain formulations in $X^{(j)}_h \times M^{(j)}_h$ extend to that on $X_h \times M_h$ (as long as $q \in X$ ). Moreover, the problem does not require any iteration on the interface.

We explain the details first for the linear heat equation with the notation from Section 3.2; the generalization to the nonlinear problem is straightforward. For simplicity, we focus on only two materials $N_{MAT} = 2$ which occupy the domains $\Omega^{(1)}, \Omega^{(2)}$ separated by an interface $\Gamma = \Gamma_{12}$ .

4.2.1. P0-P0 formulation with mixed finite elements

We rewrite (4.1) in a time-discrete mixed form, with subdomain problems (4.4a)–(4.4b) to be solved by $(\theta^{n, (j)}, q^{n, (j)})_{j}$ . We also make precise the initial, external, and coupling boundary conditions (4.4c) and coupling via (4.4d)

$\begin{eqnarray} (k^{(j)})^{-1}q^{(j)}& = &-\nabla \theta^{(j)}, \; {x \in \Omega^{(j)}}, t > 0 \end{eqnarray}$

(4.4a)

$\begin{eqnarray} \nabla \cdot q^{(j)}+\tfrac{1}{\tau}w^{(j)}& = &g, \; \; x \in \Omega^{(j)},\; w^{(j)} = \alpha^{(j)}(\theta^{(j)}). \end{eqnarray}$

(4.4b)

$\begin{eqnarray} w^{(j)}(x,0) = w_{init}(x)\vert_{\Omega^{(j)}}&,&\;\; \theta\vert_{\partial\Omega \cap \partial \Omega^{(j)}} = \theta_D \vert_{\partial\Omega \cap \partial \Omega^{(j)}}, \end{eqnarray}$

(4.4c)

$\begin{eqnarray} (q^{(2)}-q^{(1)})\cdot \nu_{\Gamma}& = &0,\; \theta^{(2)}-\theta^{(1)} = - \rho_{R} q^{(1)}, (x,t) \in \Sigma. \end{eqnarray}$

(4.4d)

We approximate (4.4) with P0-P0 algorithm for (4.4a)–(4.4c) on each $\Omega^{(j)}$ as in Section 3, each implemented as CCFD, and corresponding to a block linear system similar to (3.3) is solved, with off-diagonal coupling terms expressing (4.4d).

This coupling could be solved by iteration; we refer to ^[56] for a discussion of domain decomposition iterative algorithms. In contrast, we propose to satisfy (4.4d) without an iteration which simplifies the implementation substantially. We explain the ideas briefly for $d = 1$ in Section 4.2.2.

4.2.2. Monolithic scheme

Let $\Omega = (a, b)$ and $\Omega^{(1)} = (a, x^*)$ and $\Omega^{(2)} = (x^*, b)$ , with a grid $(\omega_i)_i$ covering $\Omega^{(1)} \cup \Omega^{(2)}$ so that $\Gamma = x^*$ is one of the cell edges. With global numbering of the cells $\omega_i$ in $\Omega$ , we have $i = 1, \ldots j^*, j^*+1, \ldots M$ , where $\Gamma = x^* = x_{i*+1/2} = \gamma_{i*+1/2}$ is between some cells $\omega_{i*}$ and $\omega_{i*+1}$ . Now the problem is discretized as in Section 3.2 with $q^{(1)}_h$ and $q^{(2)}_h$ identified by the edge values $Q^{(1)} = (Q_{1/2}, Q_{3/2}, \ldots Q_{i*+1/2}^-)$ and $Q^{(2)} = (Q_{i*+1/2}^+, Q_{i^*+3/2}, \ldots Q_{M+1/2})$ , respectively. We note the double value of the flux $Q_{i*+1/2}^-, Q_{i*+1/2}^+$ to be set equal due to (4.4d). The temperatures $\theta^{(1)}_h \in M_h^{(1)}$ and $\theta^{(2)}_h \in M_h^{(2)}$ , respectively, are identified by the cell values $\Theta^{(1)} = (\Theta_1, \Theta_2, \ldots \Theta_{i*})$ , and $\Theta^{(2)} = (\Theta_{i*+1} \ldots \Theta_{M})$ . Finally, the Dirichlet conditions at the external boundaries are with $\Theta^*_{1/2} = \theta_D(x_{1/2})$ , and $\Theta^*_{M+1/2} = \theta_D(x_{M+1/2})$ . These enter (3.3) as in Remark 3.3.

The approximation to the first part of the interface condition (4.4d) equates the fluxes $Q_{i*+1/2}^- = Q_{i*+1/2}^+$ . We also require Dirichlet values $\Theta_{\ast}^-$ and $\Theta_{\ast}+$ to be used by each subdomain problem at $x^*$ so that the second part of (4.4d) holds. Finding these is part of the problem. In summary, the approximation to (4.4) is as follows.

$\begin{eqnarray} {\rm{Find}}\; \Theta_{\ast}^- \; {\rm{and}} \; \Theta_{\ast}^+ {\rm{so\; that \;the\; discrete \;counterpart \;of\; (4.4d) \;holds}} \\ Q_{i*+1/2,j}^- = Q_{i*+1/2,j}^+; \;\;\;\; \; \Theta_{\ast}^– \Theta_{\ast}^+ = - \rho_{R} Q_{i*+1/2,j}^-,\end{eqnarray}$

(4.5a)

where we find $Q_{i*+1/2, j}^-$ and $Q_{i*+1/2, j}^+$ from the solutions on $\Omega^{(1)}$ and $\Omega^{(2)}$ defined as follows.

$\begin{eqnarray} {\rm{Use}}\; \Theta^*_{1/2} \;{\rm{and}} \; \Theta_{\ast}^- {\rm{as\; Dirichlet\; boundary \;conditions\; in\; (3.3)\; and\; find}}\\(\Theta_1\ldots \Theta_{i*}) \rm{ and }(Q_{1/2},Q_{3/2}, \ldots Q_{i*-1/2}, Q_{i*+1/2}^-). \end{eqnarray}$

(4.5b)

$\begin{eqnarray} {\rm{Use}} \; \Theta_{\ast}^+ \;{\rm{and }}\; \Theta^*_{M+1/2} \; {\rm{as \;Dirichlet\; boundary\; conditions\; in\; (3.3)\; and\; find \;}} \\(\Theta_{i*+1},\Theta_{i^*+2},\ldots \Theta_{M}) \;{\rm{ and } } \;(Q_{i*+1/2}^+,Q_{i^*+3/2}, \ldots Q_{M+1/2}). \end{eqnarray}$

(4.5c)

The solution to (4.5) requires formulation of appropriate matrices $\mathcal{K}^{(1)}$ , $\mathcal{K}^{(2)}$ to solve (4.5b) and (4.5c), respectively. These come from the transmissivities $(\mathcal{T}_{i\pm1/2})_i$ on every $\gamma_{i+1/2} \in \Omega^{(j)}$ and are given as in Section 7.5 by (7.11). We also have two boundary edge factors $\mathcal{T}_{i*+1/2}^{-}$ and $\mathcal{T}_{i*+1/2}^{+}$ given by (7.13) independently on $\Omega^{(1)}$ and $\Omega^{(2)}$ , respectively.

The problem (4.5) could be solved by iteration. However, our idea is that (4.5) can be solved by a much simpler monolithic CCFD solver on $\Omega$ for $\Theta = (\Theta^{(1)}, \Theta^{(2)})$ and $Q = (Q^{(1)}, Q^{(2)})$ for which $Q_{j*+1/2}^- = Q_{j*+1/2}^+$ . The system takes the form

$\begin{eqnarray} { \mathcal{K}}^{ \rho_{R}} Q + \mathcal{B}^T \Theta& = &0; \end{eqnarray}$

(4.6a)

$\begin{eqnarray} - \mathcal{B} Q + \mathcal{C} W & = &G; W = \alpha(\Theta). \end{eqnarray}$

(4.6b)

The definition of $\mathcal{B}$ is a straightforward extension of $\mathcal{B}^{(j)}$ , but $\mathcal{K}^{ \rho_{R}}$ involves $\mathcal{K}^{(j)}$ as well as an appropriately chosen transmissibility $\mathcal{T}_{i*+1/2}^{\rho}$ to guarantee (4.4d).

Lemma 4.1. Solution to (4.5) exists and is unique; it is equivalent to that of (4.6) provided

$\begin{eqnarray} \mathcal{T}^{\rho}_{i*+1/2} = \frac{ \mathcal{T}^- \mathcal{T}^+}{ \mathcal{T}_-+ \mathcal{T}^+ + \mathcal{T}^- \mathcal{T}^+ \rho_{R}}, \end{eqnarray}$

(4.7)

where we use the shorthand notation $\mathcal{T}^- = \mathcal{T}_{i*+1/2, j}^-$ and $\mathcal{T}^+ = \mathcal{T}_{i*+1/2, j}^+$ .

Proof. The proof is purely algebraic. Assume (4.5) is satisfied. Denote $Q^- = Q_{j*+1/2}^-$ , $Q^+ = Q_{j*+1/2}^+$ , so (4.5a) holds for some $\Theta_{\ast}^-, \Theta_{\ast}^+$ , $Q^-, Q^+$ . We know also $\Theta_{i*}, \Theta_{i*+1}$ from (4.5b) and (4.5c).

1. Recall $Q^{-} = \mathcal{T}^-(\Theta_{i*}- \Theta_{\ast}^-)$ and $Q^{+} = \mathcal{T}^+(\Theta_{\ast}^+-\Theta_{i*+1})$ . Setting these equal from the first condition in (4.5a), and expressing $Q^-$ from the second part of (4.5a), we work to eliminate $Q^+$ and get a formula for $\Theta^-$ in terms of $\Theta_{i*}$ and $\Theta_{i*+1}$ . After some lengthy calculations we get

$\begin{eqnarray*} \Theta_{\ast}^- = ( \mathcal{T}^-(1+ \mathcal{T}^+ \rho) + \mathcal{T}^+)^{-1} \left[ \mathcal{T}^-(1+ \mathcal{T}^+\rho)\Theta_{i*} + \mathcal{T}^+ \Theta_{i*+1}\right]. \end{eqnarray*}$

2. Next we recalculate $Q^{-} = \mathcal{T}^-(\Theta_{\ast}^–\Theta^-_{i*})$ . After some algebra we get

$\begin{eqnarray*} Q^- = \frac{ \mathcal{T}^- \mathcal{T}^+}{ \mathcal{T}^-+ \mathcal{T}^++ \mathcal{T}^- \mathcal{T}^+ \rho}(\Theta_{i*}-\Theta_{i*+1}). \end{eqnarray*}$

3. Now the expression on the right hand side can be written as

$\begin{eqnarray*} Q^- = \mathcal{T}^{\rho}_{i*+1/2} (\Theta_{i*}-\Theta_{i*+1}). \end{eqnarray*}$

This provides the definition of (4.7), i.e., of $\mathcal{K}^{ \rho_{R}}$ for the monolithic formulation. Existence and uniqueness of the solutions to (4.6) follows directly from that for (3.3) with the special definition of $\mathcal{T}^{\rho}_{i*+1/2, j}$ which satisfies the same properties as all other $\mathcal{T}_{i \pm 1/2}$ .

Remark 4.1. When $\rho_{R} = 0$ , (4.7) reduces to give $\mathcal{T}_{i*+1/2}$ as the usual weighted harmonic average of $k_{i*}$ and $k_{i^*+1}$ in (7.11), and provides the interface value of $\Theta_{\ast} = \Theta_{\ast}^- = \Theta_{\ast}^+$

$\begin{eqnarray*} \mathcal{T}_{i*+1/2} : = \frac{ \mathcal{T}^- \mathcal{T}^+}{ \mathcal{T}_-+ \mathcal{T}^+ + \mathcal{T}^-}, \;\; \Theta_{\ast} = \frac{T^-}{T^-+T^+} \Theta_{i*} + \frac{T^+}{T^-+T^+}\Theta_{i*+1}. \end{eqnarray*}$

In other words, a monolithic CCFD approach on matching grids is equivalent to the domain decomposition approach. This observation is perhaps not new, but is nevertheless very useful.

Corollary 4.1. The results in Lemma 4.1 easily extend to $d = 2$ when $\Gamma$ is aligned with cell interfaces. In addition, these results extend to nonlinear problems such as (3.6) and (3.7) with

$\begin{eqnarray*} w = \alpha^*(\theta) \;{\rm{means}}\; w(x,t) = \alpha^{(j)}(\theta(x,t)) \;{\rm{for \;a.e.}}\; x \in \Omega^{(j)},t > 0. \end{eqnarray*}$

Here in each time step $n$ and iteration $(m)$ one updates the transmissibilities $\mathcal{T}_{i\pm 1/2}^{n, (m)}$ based on the current guess of $W^{n, (m)}$ . This system uses special interface transmissivities involving $\rho_{R}$ resulting in the dependence of matrix $\tilde{ \mathcal{K}}^{n, (m)}_{ \rho_{R}}$ on $\rho_{R}$ .

Remark 4.2. Convergence of the solutions $(q_h, \theta_h)$ to $(q, \theta)$ for the linear problem is expected to be qualitatively similar to the case without jump, since, upon Lemma 4.1, it follows from those for CCFD on each domain, e.g., in ^[48]. At the same time, the solution $\theta\in M$ is not globally smooth enough, thus broken norms must be used when referring to the approximation error's dependence on higher order norms.

We provide an example with a numerical approximation in Section 4.3. We also prove theoretical estimates on the jump $[\theta]_{\Gamma}$ .

4.2.3. Estimates of the jump on the interface

We aim to derive results similar to (4.3) in the mixed setting. We write the fully discrete mixed form of (4.4), and work with the discrete counterparts of $\theta^{(j)}, w^{(j)}, q^{(j)}$ , with $w^{(j)}\in \alpha^{(j)}(\theta^{(j)})$ . As explained for (4.6), we use $\tilde{k}$ which can be $k(\theta^n)$ , or $k(\theta^{n-1})$ or $k(\theta^{n, (m)})$ , formed, respectively, either in a fully implicit, time-lagging or iteration lagging fashion.

Lemma 4.2. Let $\theta_D = 0$ . Then the numerical solution of (4.1) satisfies, at every $t^n$

$\begin{eqnarray} ||{\theta_h^{n,(1)}-\theta_h^{n,(2)}}||_{{L^2(\Gamma)}} = O(\sqrt{ \rho_{R}}). \end{eqnarray}$

(4.8)

Proof. We suppress $h$ and $n$ . The weak mixed formulation of (4.1a) after integrating and adding the equations over $(j)$ reads: find $(q, \theta) \in X_h \times M_h$ such that for every $\psi \in X_h, \eta \in M_h$ we have the following

$\begin{eqnarray} \int_{\Omega} \tilde{k}^{-1} q \cdot \psi - \int_{\Omega} \theta \nabla \cdot \psi + \int_{\partial \Omega} \theta \psi \cdot \nu + \int_{\Gamma} (\theta^{(1)}-\theta^{(2)}) \psi \cdot n& = &0, \end{eqnarray}$

(4.9a)

$\begin{eqnarray} \int_{\Omega} \eta \nabla \cdot q + \tfrac{1}{\tau}\int_{\Omega} w \eta & = & \int_{\Omega} g \eta. \end{eqnarray}$

(4.9b)

To derive the estimates we now choose $\psi = q$ and $\eta = \theta$ . Adding the two equations and cancelling the skew-symmetric terms we obtain, after rearranging

$\begin{eqnarray*} \int_{\Omega} \tilde{k}^{-1} q \cdot q + \tfrac{1}{\tau}\int_{\Omega} w \theta + \int_{\Gamma} (\theta^{(1)}-\theta^{(2)}) q \cdot n = \int_{\Omega} g \theta - \int_{\partial \Omega} \theta_D q \cdot n. \end{eqnarray*}$

The right hand side now involves the boundary and the right hand side $g$ . We have control over the boundary term since $\theta_D = 0$ . The first term on the left hand side is nonnegative. The second term can be bounded from below for $\alpha ^{ST}$ , as well as for $\alpha ^{ST_{\epsilon}}$ and $\alpha^{(P)}$

$\begin{eqnarray*} \int_{\Omega} w\theta = \int_{\Omega_l} w\theta + \int_{\Omega_s} w\theta \geq c_{\alpha}\int_{\Omega} \theta^2. \end{eqnarray*}$

In fact, for (ST) problem we have $c_{\alpha} = \min_j((c^{(j)}_l), (c^{(j)}_s))$ . For (ST) $_{\epsilon}$ , we have $c_{\alpha} = \min_j((c^{(j)}_l), (c^{(j)}_s), \epsilon^{-1})$ , with a similar calculation for $\alpha^{(P)}$ from Section 5. Then we apply Cauchy-Schwarz inequality to the integral $\int_{\Omega}g\theta \leq ||{g}||_{{L^2}} ||{\theta}||_{{L^2}}$ and follow up with the inequality $|{a}||{b}| \leq \tfrac{a^2}{2s}+\tfrac{sb^2}{2}$ . With sufficiently large $s$ we can now move the term $\tfrac{||{\theta}||_{{L^2}}^2}{2s}$ to the left hand side to balance it with $c_{\alpha}||{\theta}||_{{L^2}}$ . We conclude that $\int_{\Gamma} (\theta^{(1)}-\theta^{(2)}) q \cdot n \leq \tfrac{s||{g}||_{{L^2}}^2}{2} = C$ thus, upon (4.4d) we finally obtain

$\begin{eqnarray} \int_{\Gamma} (\theta^{(1)}-\theta^{(2)}) q \cdot n = \int_{\Gamma} \tfrac{1}{ \rho_{R}} (\theta^{(1)}-\theta^{(2)})^2 \leq C, \end{eqnarray}$

(4.10)

from which (4.8) follows.

This result is similar to that predicted in (4.3) ^[29] in the primal formulation. With more work, we can also cover the case $\theta_D \neq 0$ .

Corollary 4.2. The estimate in Lemma 4.2 extends to the solutions $(q, \theta)$ for (ST) $_{\epsilon}$ or (P) problems with single-valued $\alpha$ when $q \in X$ .

4.3. Numerical examples of P0-P0 for heterogeneous domain with two materials

Now we illustrate the problem (4.1) with simulations using our P0-P0 solver.

4.3.1. Linear heat equation with a jump

We consider first an example in $d = 1$ for which we study the dependence of the jump of $\theta$ on $\rho$ .

Example 4.1. Let $\Omega = (a, b)$ with some $a < x^* < b$ . Let some $\rho\geq 0$ be given, as well as $L = 0$ , $k^{(1)}(x) = k_1; k^{(2)}(x) = k_2$ , and $c^{(1)} = c^{(2)} = 1$ so that $w = \theta$ . We also impose boundary conditions $\theta(a) = \theta_a, \theta(b) = \theta_b$ , The numerical solutions are computed with $M = 50$ , and $\tau = 0.0.001$ . We consider different ratios $k_2/k_1$ , and $\rho_{R}$ .

We plot the solutions to Example 4.1 found numerically with our monolithic P0-P0 solver. We also consider the corresponding stationary limit $\theta^{stat}(x)$ of (4.1); see the details in (7.15) and (7.14) in Section 7.6.

The numerical solution to the stationary problem in this simple case essentially coincides with the exact solution; this is typical for P0-P0 solution. The non-stationary solutions evolve towards this stationary solution. The qualitative nature is consistent with the imposed interface jump condition.

In the end we also test the scaling of the jump $[\theta]_{\Gamma}$ predicted by (4.8) given what we found for the stationary solution in Section 7.6; we check how it behaves over time. The details in Section 7.6 predict that the jump of $\theta^{stat}(x)$ scales linearly with $\rho_{R}$ as $\rho_{R}\downarrow 0$ , unlike $O(\sqrt{ \rho_{R}})$ predicted for the evolution problem in (4.3). For large ratio $\tfrac{k^{(2)}}{k^{(1)}}$ and small $\rho_{R} < 1$ we find that the jump behaves similarly to that for the stationary problem. For small $\tfrac{k^{(2)}}{k^{(1)}}$ , and before the solutions are close to the stationary limit, we see that the size of the jump $[\theta_h]_{\Gamma}$ depends significantly on the grid discretization $h$ . This behavior correlates with the large magnitude of $\partial_t w$ when $t$ is small. In fact, it also correlates with the estimates of $\partial_t w$ in (4.3) (see also subsequent discussion in Example 4.3). We defer further study of these features to the future.

4.3.2. Examples for Stefan problem for heterogeneous domain with two materials

Now we consider (ST) problem, and start by checking convergence of our P0-P0 scheme for the case of heterogeneous materials. To this end, we modify Example 3.2.

Example 4.2. We let $\Omega = (0, 0.4)$ with ${\bar{\Omega} = \bar{\Omega}^{(1)} \cup \bar{\Omega}^{(2)}}$ with parameters given in . Note that the material within $\Omega^{(2)}$ has $L\neq 0$ while $\Omega^{(1)}$ has $L = 0$ ; the freezing temperature $\theta_{fr} = 0$ for both.

Table 9. Heterogeneous materials from Example 4.2.

$\Omega$	$k_s$	$k_l$	$c_s$	$c_l$	$L$	$\theta_{fr}$
$\Omega^{(1)}=(0, 0.1)$	0.5	0.15	0.5	1	0	0
$\Omega^{(2)}=(0.1, 0.4)$	1	0.25	1	2	10	0

| Show Table

DownLoad: CSV

The initial conditions are $w_{init}(x) = c_s(x)\theta_{init}(x)$ which corresponds to $\theta_{init}(x) = -2$ for all $x \in \Omega$ . Boundary conditions are $\theta(0.4, t) = -2$ and $\theta(0, t) = 15$ for $t \in (0, T]$ . Figure 8 shows the temperature increase throughout the domain due to the heat transfer through the left boundary.

Figure 8. Simulation results in Example 4.2. Top row: solution at $t = 0.01$ ; Bottom row: solution at $t = 0.15$ . The interface separating $\bar{\Omega}^{(1)}$ and $\bar{\Omega}^{(2)}$ is shown by the vertical magenta line.

DownLoad: Full-Size Img PowerPoint

We do not have an exact solution, thus we must use fine grid solution for convergence studies. The coarse grid solution for $M_x = 20$ , is compared with that for the fine grid, $M_x = 2500$ at the first fine grid time step of $t = 0.005$ and the time $t = 0.15$ . In convergence study we disregard the initial time period $t \in [0, 0.005]$ in which the solution has very steep $w_t$ due to the discrepancy between the initial and boundary data. The convergence of the solution is of lower order during that time and pollutes the rest of the error analysis.

Convergence results, calculated for $t \in [0.01, 0.15]$ are shown in Table 10. We see similar rates of convergence for this heterogeneous example as for the homogeneous Example 3.2.

Table 10. Convergence error for $\theta$ and $w$ in Example 4.2 calculated for $t > 0.005$ .

$M_x$	$h_x$	$\tau$	$\|\|\theta_{err, num}\|\|_{\infty, 1}$	Order	$\|\|\theta_{err, num}\|\|_{\infty, 2}$	Order	$\|\|\theta_{err, num}\|\|_{2, 2}$	Order
20	$2.0\times 10^{-2}$	$5.0\times 10^{-3}$	$3.8777\times 10^{-1}$	-	$9.8085\times 10^{-1}$	-	$7.7555\times 10^{-2}$	-
100	$4.0\times 10^{-3}$	$1.0\times 10^{-3}$	$6.1873\times 10^{-2}$	1.1404	$1.4680\times 10^{-1}$	1.1801	$1.0807\times 10^{-2}$	1.2245
500	$8.0\times 10^{-4}$	$2.0\times 10^{-4}$	$1.3598\times 10^{-2}$	0.9414	$3.0670\times 10^{-2}$	0.9729	$2.1699\times 10^{-3}$	0.9975
2500	$1.6\times 10^{-4}$	$4.0\times 10^{-5}$	$2.3750\times 10^{-3}$	1.0842	$5.3472\times 10^{-3}$	1.0853	$3.7841\times 10^{-4}$	1.0851
$M_x$	$h_x$	$\tau$	$\|\|w_{err, num}\|\|_{\infty, 1}$	Order	$\|\|w_{err, num}\|\|_{\infty, 2}$	Order	$\|\|w_{err, num}\|\|_{2, 2}$	Order
20	$2.0\times 10^{-2}$	$5.0\times 10^{-3}$	$3.8122\times 10^{-1}$	-	$1.4377\times 10^{0}$	-	$2.9132\times 10^{-1}$	-
100	$4.0\times 10^{-3}$	$1.0\times 10^{-3}$	$5.6373\times 10^{-2}$	1.1876	$5.8560\times 10^{-1}$	0.5580	$1.0393\times 10^{-1}$	0.6404
500	$8.0\times 10^{-4}$	$2.0\times 10^{-4}$	$1.2365\times 10^{-2}$	0.9426	$2.6286\times 10^{-1}$	0.4977	$4.7824\times 10^{-2}$	0.4823
2500	$1.6\times 10^{-4}$	$4.0\times 10^{-5}$	$2.1711\times 10^{-3}$	1.0809	$9.8921\times 10^{-2}$	0.6072	$1.8280\times 10^{-2}$	0.5975

| Show Table

DownLoad: CSV

Next we illustrate the need for multiple materials by simulating heat flow at the pore-scale. We proceed to the simulation in $d = 2$ at the pore-scale which is motivated by our interest in permafrost. We use our P0-P0 solver with $\rho_{R} = 0$ , since there is no interface with high resistivity between the mineral and water components.

Example 4.3. Let ${\Omega = (0, 1) \times (0, 1)}\mathrm{[{cm^2}]}$ with ${\bar{\Omega} = \bar{\Omega}^{(1)} \cup \bar{\Omega}^{(2)}}$ , with water saturated pore space $\bar{\Omega}^{(1)}$ , and the rock grains $\bar{\Omega}^{(2)}$ ; see . The material parameters are those of water for $\Omega^{(1)}$ and inferred from those for silica for $\Omega^{(2)}$ as given in . We start from thermal equilibrium, with constant temperature $\theta_{init} = -1 {{\mathrm{[{^\circ C}]}}}$ with which we calculate $w_{init}(x) = c_s(x)\theta_{init}(x)$ .

Figure 9. Simulation described in Example 4.3 within Section 4.3.2 of heat conduction in the pore space, with geometry within ${\Omega = (0, 1) \times (0, 1)}\mathrm{[{cm^2}]}$ depicted on top. We simulate the thawing front moving from the left to the right starting from thermal equilibrium and with left boundary subject to increased temperature. Displayed is the first time step (left), two middle time steps (middle) and steady state (right) for temperature $\theta$ (top) and enthalpy $w$ (bottom).

DownLoad: Full-Size Img PowerPoint

The heat in $\Omega$ increases due to the boundary condition at $\partial \Omega_{\text{left}} = \{0\} \times [0, 1]$ where we set $\theta_D(x, t)|_{\partial \Omega_{\text{left}}} = 10$ . We also assume insulated boundary conditions elsewhere $q \cdot \nu|_{\partial \Omega \setminus \partial \Omega_{\text{left} }} = 0$ for $t \in (0, T]$ .

In we show temperature profiles of four time steps. The melting front moves from left to right with a phase change in the void space $\Omega^{(1)}$ , but the grains do not undergo phase change. The heat flux moves faster through the areas with more grains, since this change requires less energy required for phase change than the heat conduction in the water component.

4.4. Summary

More work on testing the heterogeneous Stefan problem and even the linear heat equation is needed, but overall we believe our monolithic P0-P0 scheme performs well for multiple materials. While not reported here, we tested the simulation cases of different freezing temperatures, and drastically different parameter $L$ between domains, and see that the algorithms perform in a robust way across these scenarios, even if a decrease of the time step is needed in the most challenging cases.

5. Permafrost models

Permafrost is defined as the ground that remains frozen for two or more years ^[41,66,67]. Temperatures in permafrost respond to the changes in ambient temperatures at the soil-atmosphere interface, possibly including the effects of precipitation and vegetation. In the upper portion of permafrost called active layer the temperatures increase in the summer and decrease in other parts of the year. The bottom of the active layer is isothermal but the depth of this layer is changing due to the increase of ambient temperatures, and this causes the thawing of some portions of permafrost, with further environmental consequences. The importance and impact of coupled phenomena within permafrost models is of current interest. Modeling of the coupled phenomena in permafrost regions is complex, and requires careful conservative approximations across the scales ^{[10,14,15,67,68,69]}.

5.1. Energy conservation in permafrost

We recall now the (ST) problem discussed in Section 2 for conservation of energy in any volume $\Omega$ occupied only by water component in one of two phases: liquid or solid. For any small sub-volume $\omega \subset \Omega$ centered at some $x \in \Omega$ we can calculate the water fraction $\chi(x) = \frac{|\Omega_l \cap \omega|}{|\Omega \cap \omega|}$ . In equilibrium, $\chi$ it equals 1 whenever $\theta > 0$ in the entire $\omega$ . On the other hand, if $\theta < 0$ on $\omega$ , then $\chi(x) = 0$ . In the case $\theta = 0$ , we have $0 \leq \chi(x) \leq 1$ , and $\chi$ can be considered as an independent thermodynamic variable at constant volume; see ^[27]. In other words, the water fraction can be considered a pointwise or a volumetric quantity, and in (ST) problem we assumed (2.3) and $\chi = \chi(\theta)$ .

The model for conservation of energy in permafrost must extend this formulation to account for the presence of rock grains which do not change phase. We assume here that the soil rock grains have a constant heat capacity $c_r$ and a constant thermal conductivity $k_r$ .

The presence of rock grains has several consequences. The first consequence is that in the energy balance we have to account for the rock as a separate material; we discuss this in Section 5.1.2. Second, the presence of rocks affect the local energy landscape at the fluid-rock interfaces; in particular, it causes depression of freezing temperatures in small pores, as well as premelting, the presence of a thin film of water around rock grains ^[70,71]. Both phenomena have significant effects on the qualitative behavior of phase transitions in permafrost.

In modeling, one must consider the scale at which we wish to consider the phenomena. At the pore-scale of $\mathrm{[{mm}]}$ to $\mathrm{[{nm}]}$ scale, the rock grains and water occupied domains should be treated as separate materials such as in Example 4.3. In practice, modeling large scale changes in permafrost in response to the temperature in the environment must occur at larger scale such as that of $\mathrm{[{m}]}$ , i.e., at the so-called Darcy scale. Typically, models at Darcy scale take advantage of constitutive relationships measured experimentally in a laboratory. Thanks to modern computational science the pore-scale modeling can be connected to Darcy scale such as in our work and in particular ^{[18,19,20,21]}, but a thorough discussion is outside our present scope.

5.1.1. Permafrost as a porous medium

We introduce notation which helps to explain the difference and connections between Stefan problem and permafrost models.

As mentioned earlier, porous medium is made of rock grains and non-rock "void space" occupied by fluids such as water and air. In this paper we consider only the water component. The rock grains occupy a fixed portion $\Omega_r$ of $\Omega$ , and the water component in $\Omega_w$ occupies the remainder so that $\overline{\Omega} = \overline{\Omega_w} \cup \overline{\Omega_r}$ . The void (non-rock) proportion $\eta = \frac{|\Omega_w|}{|\Omega|}$ is called porosity, and $0 < \eta\leq 1$ . The water component can be in liquid or solid phases so that $\overline{\Omega_w} = \overline{\Omega_l} \cup \overline{\Omega_s}$ , and we have $\chi = \tfrac{|\Omega_l|}{|\Omega|} = \tfrac{|\Omega_l|}{|\Omega_w|} \tfrac{|\Omega_w|}{|\Omega|} = \eta \chi_w, \text{ with } \chi_w = \tfrac{|\Omega_l|}{|\Omega_w|}$

$\begin{eqnarray} \chi(\theta) = \eta \chi_w(\theta). \end{eqnarray}$

(5.1)

The quantity $\chi_w$ is called "unfrozen water content" in permafrost and is determined experimentally for a particular soil type depending on the temperature.

5.1.2. Energy balance in permafrost

The presence of rock grains influences the energy balance. In permafrost the balance of energy must account for the energy content in $\Omega_r$ and $\Omega_w$ , thus the heat capacity is a weighted fraction of that in rock grains and in fluid space,

$\begin{eqnarray} c(\theta) = c_r (1-\eta)+\eta\left( c_l \chi_w + c_s (1-\chi_w)\right) = c_r (1-\eta) + c_l \chi + c_s (\eta - \chi), \end{eqnarray}$

(5.2)

which can be also written, after rearranging, as

$\begin{eqnarray} c(\theta) = c_u \chi_w + c_f(1-\chi_w); \;\; c_u = (1-\eta)c_r+ \eta c_l, \;\; c_f = \eta c_s + (1-\eta)c_r, \end{eqnarray}$

(5.3)

where $c_u$ is the (constant) heat capacity of the "unfrozen soil" and $c_f$ is the (constant) heat capacity of the "frozen soil". In the end, we get ^[8]

$\begin{eqnarray} w = \int_{\theta_{fr}}^{\theta} c(\upsilon)d\upsilon + L\eta \chi_w(\theta);\; \;\; c(\theta) = c_u \chi_w(\theta) + c_f(1 - \chi_w(\theta)). \end{eqnarray}$

(5.4)

Remark 5.1. We can now compare the formulas for $w$ in the classical Stefan problem (2.5) and in the permafrost model (5.4). The former sees the change between "solid" and "liquid", but the latter (5.4) presents the phase change problem between the "frozen soil" and the "unfrozen soil", where the amount of each is controlled by the unfrozen water content $\chi_w$ .

5.1.3. Experimental models

One of important features of permafrost is the presence of unfrozen water at low temperatures i.e. $\chi(x, t) > 0$ even when $\theta(x, t) < \theta_{fr}$ . This phenomenon is not fully explained, and is accompanied by the depression of the freezing temperatures in small pores. The permafrost models take advantage of the empirical data which fits one of the parametric relationships for $\chi_w(\theta)$ such as (5.1), where $\chi_w$ is the unfrozen water content determined experimentally and dependent on soil type. We follow a formulation based on ^[14]: here $\chi_w(\theta)$ is parametrized with some soil-type dependent parameters $b > 0$ , as well as with $\chi_{res} \in (0, 1) \mathrm{[{-}]}$ which denotes the residual unfrozen water content at some really low reference temperature

$\begin{eqnarray} \chi_w(\theta) = \begin{cases} \chi_{res} + (1 - \chi_{res})e^{b(\theta - \theta_{fr})}; & \theta \leq \theta_{fr} \\ 1; & \theta \geq \theta_{fr} \end{cases}. \end{eqnarray}$

(5.5)

Now $\theta_{fr}$ is the threshold temperature called freezing point depression such that for $\theta > \theta_{fr}$ , only liquid water is present. Unlike in bulk water, typically $\theta_{fr} < 0$ . Parameter $b \mathrm{[{1/^\circ C}]}$ depends on the soil type. For example, for coarse-grained soils such as sand, the value of $b$ is large, but for fine-grained soils such as clay $b$ is small. For a particular soil type, we assume $b$ to be a constant.

Remark 5.2. A variety of algebraic models for $\chi_w (\cdot; \theta)$ other than (5.5) are given in applications literature; see for expressions adapted so that they fit (5.1). The models are qualitatively similar to each other. For a particular expression for $\chi_w(\theta)$ plugged in (5.4) they produce $w = \alpha^P(\theta)$ which is a monotone increasing injective Lipschitz function. In particular, with (5.5) we get, after some rearranging

$\begin{eqnarray} \alpha^P(\theta)\rvert_{\theta > \theta_{fr}} & = & c_u(\theta - \theta_{fr}) + \eta L, \\ \alpha^P(\theta)\rvert_{\theta \leq \theta_{fr}} & = & c_f(\theta - \theta_{fr}) + (c_u - c_f)\left( \chi_{res}(\theta - \theta_{fr}) + \frac{(1-\chi_{res})}{b}\left( e^{b(\theta - \theta_{fr})} - 1 \right) \right) \\ &+& \eta L \left(\chi_{res} + (1-\chi_{res})e^{b(\theta - \theta_{fr})} \right) . \end{eqnarray}$

(5.6)

Table 11. Different models for liquid water fraction in Section 5.1.

Model (reference)	Unfrozen water content $\chi_w \rvert_{\theta \leq \theta_{fr}}$	Parameters	Typical values
[L] ^[72]	$\chi_w(\theta) = \left \lvert \frac{\theta_{fr}}{\theta} \right \rvert^a$	$a\mathrm{[{-}]}$	Silt: $a \in [0.3, 0.5]$ ^[73]
[W] ^[6]	$\chi_w(\theta) = (1-\chi_{res})\left[ \frac{\theta_{\lambda}}{\theta_{\lambda} + \theta_{fr} - \theta} \right]^4 + \chi_{res}$	$\theta_{\lambda} \mathrm{[{ ^\circ F }]}$	Sand: $\theta_\lambda = 0$ , Silt: $\theta_\lambda =2$ ^[6]
[M] ^[14]	$\chi_w(\theta) = \chi_{res} + (1 - \chi_{res})e^{b(\theta - \theta_{fr})}$	$b\mathrm{[{ 1/^\circ C }]}$	Clay: $b = 0.16$ , ^[15]
[A] ^[41]	$\chi_w(\theta) = a (-\theta)^{b}$	$a[-], b[-]$	Silt: $a=3.2, b=-0.5$ , ^[41]
			Basalt: $a=3.45, b=-1.13$ ^[41]

| Show Table

DownLoad: CSV

For comparison, we plot $\alpha^P$ corresponding to the the different models in . Here we use the parameters ^[16] $c_r = 0.9$ , $k_r = 0.0195$ , $\theta_{fr} = -0.5, \chi_{res} = 0.2035$ and $\eta = 0.43$ along with the thermal properties of water as given in Table 2.

Figure 10. Enthalpy given by (5.4) for different experimental models for unfrozen water content $\chi_w(\theta)$ tabulated in Table 11. The plots mimic the properties of clay ^[15]. Although the graphs appear to have a singular derivative near $\theta_{fr}$ , the functions are all Lipschitz. For the particular curves, we use $b = 0.2 \mathrm{[{1/^\circ C}]}$ for [M] ^[14], $a = 0.4$ for [L] ^[72], $\theta_{\lambda} = 1 \mathrm{[{^\circ C}]}$ for [W] ^[6]. Also shown is the enthalpy curve (ST) calculated without considering the effects of unfrozen water content when at i.e.. when the enthalpy (5.4) is calculated assuming that $\chi_w = 0 \ \forall \ \theta < \theta_{fr}$ .

DownLoad: Full-Size Img PowerPoint

We see that each $\alpha^P$ can be seen as an approximation to the monotone graph $\alpha ^{ST}$ , and each is a strictly monotone smooth increasing function of $\theta$ , differentiable everywhere except at $\theta_{fr}$ (as we stated in Assumption 1.3 for $\alpha^P$ ).

5.1.4. Thermal conductivity in permafrost

To complete the permafrost model, we need to define the thermal conductivity of the porous medium which is made of $\Omega_r$ and $\Omega_w$ , with the latter partitioned between $\Omega_l$ and $\Omega_s$ .

It is well known that, unlike capacity given in (5.3), conductivity $k$ of composite materials depends not just on the values of $k_r, k_l, k_s$ , as well as on the fractions $\eta, \chi_w$ , but also on the geometry of how the phases are arranged. Still, some authors consider straightforward weighting with volumetric fractions as in (5.3) which gives, e.g., in ^[16]

$\begin{eqnarray} k = k_l \chi + k_s (\eta - \chi) + k_r(1-\eta) = k_u \chi_w + k_f(1-\chi_w). \end{eqnarray}$

(5.7)

where $k_u = \eta k_l + (1-\eta)k_r$ is the thermal conductivity of the "unfrozen soil" and $k_f = \eta k_s + (1-\eta)k_r$ is the thermal conductivity of the "frozen soil". While (5.7) is a good first order approximation, it does not take into account the geometry of the rock grains; more accurate expressions based on data are considered in ^[10,14,15] ^[41](.) Other general approaches can be considered and involve upscaling or homogenization; see, e.g., ^[18,74]. More work is needed on more accurate expression for $k(x, t)$ .

For the needs of this paper we use (5.7), combined with some selected model for $\chi_w = \chi_w(\theta(x, t))$ . This completes the permafrost model, once boundary and initial conditions are specified. To construct these for realistic scenarios, care is needed. The former depend on environmental conditions, while the latter must be found carefully since thermal equilibrium might not be always realistic to assume.

5.2. Approximation of permafrost model with P0-P0

We recall now the P0-P0 (equivalent to CCFD) approximation to (1.1) with (1.2) given in Section 3. The function $\alpha^P$ described in Section 5.1.3 features a piecewise smooth nonlinearity, thus fits the general class of problems that Section 3 applies to, and we expect that the scheme P0-P0 will work well.

In fact, the fluxes $q$ permafrost problem feature continuous normal components since there is no jump prescribed. Therefore, we expect that $\theta$ is approximated to at least to first order accuracy, similarly to what we encountered for the (harder) (ST) problem.

To put our P0-P0 approach in perspective, we first briefly review relevant literature; see Section 5.2.1. Then we follow up with the test of convergence of P0-P0 in Section 5.2.2 along with examples in Section 5.2.3.

5.2.1. Approaches in literature

The work ^[6] uses a variational formulation, implicitly discretized in time, to generate a set of nonlinear algebraic equations which are solved using the Newton's method; they test their algorithm on $2D$ examples; see also discussion in ^[11]. Next, ^[8] use a fixed grid finite element method further employing Picard's method to deal with the consequential nonlinearity in the mass and stiffness matrix. In turn, ^[9] use a node-centered finite difference scheme and solve the nonlinear equations using Newton's methods, and ^[15] implement their finite element model within the commercially available solver ABAQUS.

We adopt the fully discrete formulation for P0-P0 as given in Section 3 and extended to heterogeneous media in Section 4.

5.2.2. Convergence of P0-P0 scheme for permafrost model

Example 5.1. For convergence studies, we choose an example based on ^[26]. We consider the scenario as in Example 3.2, but with $\alpha^P(\cdot)$ instead of $\alpha ^{ST}$ , and with the following initial and boundary conditions

$\begin{eqnarray} w_{init}(x) = \alpha^P(-1),\;\; \theta(0, t) = -1 {{\mathrm{[{^\circ C}]}}}, \ \theta(20, t) = 1 {{\mathrm{[{^\circ C}]}}}. \end{eqnarray}$

(5.8)

The soil properties are chosen to be $\eta = 0.43, \ b = 0.9$ . We run the simulation over a time period of $(0, 50000)\mathrm{[{s}]}$ . Since the exact solution is unknown, the convergence rates are calculated using a fine grid solution $(\theta^n_{h_{fine}}, w^n_{h_{fine}})_n$ computed on a mesh with $M = 4000$ cells and time step $\tau = 25 \mathrm{[{s}]}$ .

The results are given in . We see a modest improvement in the present case with $\alpha^P$ over the rates obtained for $\alpha ^{ST}$ in the Stefan problem, with order robustly $\approx 1$ for $\theta$ and $\approx 0.5$ for $w$ . Further, we also observe that the order of convergence seems independent of small variations in the soil type, or in the parameter $b$ in (5.5). These results are for $\tau \approx O(h)$ as in (ST) problem.

Table 12. Convergence orders using P0-P0 scheme to permafrost problem for Example 5.1, and a fine grid solution. Different soil types are used, as shown by parameter $b$ in Column 3.

$M$	$\tau$	$b$	$\\|\theta_{err} \\|_{\infty, 2}$	$\\|\theta_{err}\\|_{\infty, 1}$	$\\|\theta_{err}\\|_{2, 2}$	$\\|w_{err}\\|_{\infty, 2}$	$\\|w_{err}\\|_{\infty, 1}$	$\\|w_{err}\\|_{2, 2}$
	$\tau = O(h)$
$\{20, 40\}$	$\{5000, 2500\}$	$0.1$	0.37	0.50	0.81	0.52	0.63	0.93
$\{40, 80\}$	$\{2500, 1250\}$	$0.1$	0.34	0.55	0.80	0.43	0.61	0.86
	$\tau = O(h^2)$
$\{20, 40\}$	$\{20000, 5000\}$	$0.1$	0.65	1.06	1.30	0.53	0.95	1.30
$\{40, 80\}$	$\{5000, 1250\}$	$0.1$	0.54	0.99	1.46	0.54	1.01	1.44
	$\tau = O(h)$
$\{20, 40\}$	$\{5000, 2500\}$	$1$	0.57	0.66	1.08	0.95	1.15	1.36
$\{40, 80\}$	$\{2500, 1250\}$	$1$	0.49	0.61	0.95	0.69	0.85	1.18
	$\tau = O(h^2)$
$\{20, 40\}$	$\{20000, 5000\}$	$1$	0.56	1.00	1.35	0.50	0.97	1.29
$\{40, 80\}$	$\{5000, 1250\}$	$1$	0.54	1.03	1.48	0.53	1.03	1.42
	$\tau = O(h)$
$\{20, 40\}$	$\{5000, 2500\}$	$4$	0.62	0.73	1.22	1.55	1.77	1.56
$\{40, 80\}$	$\{2500, 1250\}$	$4$	0.60	0.63	1.02	0.50	0.72	1.28
	$\tau = O(h^2)$
$\{20, 40\}$	$\{20000, 5000\}$	$4$	0.54	0.96	1.35	0.51	0.98	1.18
$\{40, 80\}$	$\{5000, 1250\}$	$4$	0.53	1.03	1.47	0.54	1.03	1.32

| Show Table

DownLoad: CSV

Next we attempt $\tau \approx O(h^2)$ which is optimal in linear and mildly nonlinear parabolic problems. We attempt the latter since the graph $\alpha^P$ is much smoother than any of $\alpha ^{ST_{\epsilon}}$ , hence, we hope to have better than first order convergence. However, there is only mild improvement in the convergence rate when using $\tau \approx O(h^2)$ compared to $\tau = O(h)$ ; specifically, the order of convergence is roughly $\approx 1.3$ – $1.5$ . In the end, this improvement may or may not justify the extra computational effort of using $\tau = O(h^2)$ .

5.2.3. Simulation of realistic examples

Next we apply our P0-P0 algorithm to physical examples pertaining to permafrost thawing. The goal of these examples is to test the robustness of our scheme and to illustrate the modeling aspects of permafrost. Since most of the interesting dynamics in permafrost occurs due to the varying surface boundary conditions and along the depth, we confine ourselves to the $d = 1$ case.

We use the expression (5.5) for the unfrozen water content $\chi_w(\theta)$ .

Example 5.2. Homogeneous case. Consider a small column of soil $\Omega = (0, 20) \mathrm{[{cm}]}$ with physical parameters as in along with $\eta = 0.43, \chi_{res} = 0.1$ , $b = 0.5 \mathrm{[{1/^\circ C}]}$ . We use the following initial and boundary conditions

$\begin{eqnarray} w_{init}(x) = \alpha^P(2),\;\; \theta(0, t) = 2 \mathrm{[{^\circ C}]}, \ \theta(20, t) = -4 \mathrm{[{^\circ C}]}, \end{eqnarray}$

and run the simulation over a time period of $(0, 50000)\mathrm{[{s}]}$ . The solution at different times is shown in Figure 11.

Figure 11. Solution at different time steps of the permafrost examples. Left: results for homogeneous soil in Example 5.3; Right: simulation for heterogeneous soil in Example 5.3. Also shown is the interface separating the fine and coarse soil. Bottom: the corresponding $(\theta, w)$ curves at $t = 50000$ for the homogeneous and heterogeneous example, respectively.

DownLoad: Full-Size Img PowerPoint

We see the temperature profile trending slowly towards the stationary distribution. When the simulation is run over a time period of at least $4.5$ hours, steady state is achieved in which the temperature qualitatively has an almost linear profile.

Example 5.3. Heterogeneous case. We extend Example 5.2 to a heterogeneous case. We consider $\Omega = (0, 20) \mathrm{[{cm}]}$ be composed of two different soil types: $\Omega^{fine} = (0, 15)$ and $\Omega^{coarse} = (15, 20)$ . Thermal properties of water are as in , and we use $\eta = 0.43, \chi_{res} = 0.1$ . The difference in the properties of coarse and fine soil is in parameter $b$ in (5.5). We choose $b_{fine} = 0.5 \mathrm{[{1/^\circ C}]}$ and $b_{coarse} = 1 \mathrm{[{1/^\circ C}]}$ . We start with the initial and boundary conditions

$\begin{eqnarray} w_{init}(x) = \alpha^P(2); \;\; \theta(0, t) = 2 \mathrm{[{^\circ C}]}, \ \theta(20, t) = -4 \mathrm{[{^\circ C}]}, \end{eqnarray}$

and run the simulation over a time period of $(0, 50000)\mathrm{[{s}]}$ . The solution at different times is shown in Figure 11.

The difference between Examples 5.2 and 5.3 is apparent from . At $t \geq 25000 \mathrm{[{s}]}$ a prominent jump in enthalpy appears near $x \approx 15 \mathrm{[{cm}]}$ ; this effect is due to the soil heterogeneity.

Next we aim to simulate the effect of permafrost warming due to variable temperature at the top boundary, and a possible effect of climate warming.

Example 5.4. Homogeneous case with time dependent boundary condition. Let $\Omega = (0, 15) \mathrm{[{m}]}$ represent a column of soil of porosity $\eta = 0.43$ , with the top boundary subject to time-varying temperature boundary conditions representing typical environmental change; see, e.g., ^[41](–) and ^[75] where the effect of climate warming is considered. At the bottom we apply fixed Dirichlet condition representing fixed temperature below the active layer. For simulation we use thermal properties of water from , along with $\chi_{res} = 0.1$ , $b = 0.5$ . Our goal is to simulate the temperature and enthalpy profile during the time period of $[0, 2.5]$ years following an earlier period of time $[-10, 0]$ , initiated with

$\begin{eqnarray} &w(x,-10) = \alpha^P(-0.2), \ \theta(0, t) = -0.5 + 10\sin{(\tfrac{2\pi }{365}t - 0.2\pi)} {{\mathrm{[{^\circ C}]}}}, \ \\ &\theta(15, t) = -0.2 {{\mathrm{[{^\circ C}]}}}, \ t \in [-10, 0]. \end{eqnarray}$

We use the simulated $w(x, 0)$ (shown in the top row of ) as the initial data for the new time period $[0, 2.5]$ , with

$\begin{eqnarray} \label{eq:permafrost3_bc2} \theta(0, t) = -0.5 + 0.052t + 10\sin{(\tfrac{2\pi }{365}t - 0.2\pi)} {{\mathrm{[{^\circ C}]}}}, \ \ \theta(15, t) = -0.2 {{\mathrm{[{^\circ C}]}}}, \ t \in [0, 2.5] \mathrm{[{yr}]}. \end{eqnarray}$

Figure 12. Top: $\theta_{init}$ and $\theta(\cdot, 2.5)$ , $w(\cdot, 2.5)$ for Example 5.4. Middle left: temperature plot. Included are $\theta(0, t)$ (top boundary), and $\theta(x, t)$ at the depth $x = 0.525, x = 1.525,$ and $x = 3.125 \mathrm{[{m}]}$ for a period of $t\in [0, 2.5]$ years. Middle right: enthalpy plot at the corresponding $(x, t)$ . Bottom: temperature and enthalpy at the end of simulation $t = 2.5$ [years].

DownLoad: Full-Size Img PowerPoint

The simulation results are shown in . The temperature at a depth $x\approx 0.5, 1.5$ and $3.5$ as a function of time is plotted along with the temperature and enthalpy at the end of $2.5$ years. We use the simulation results to get a rough estimate of the active layer thickness to be $\sim 4 \mathrm{[{m}]}$ .

5.3. Solver challenges and open questions

The P0-P0 approximation scheme and the solver behaves similarly for permafrost problem (P) as for (ST) problem. For single material problems, the model is robust and features essentially mesh independent number of iterations under reasonable time steps.

However, simulation in heterogeneous domains pose difficulties. The performance of Newton's method can be rather rugged, and is a (well-known) challenge ^[76]. For permafrost, the solver performance deteriorates in the case of highly disparate data, and/or and large time steps. Specifically, the former is an issue if there is a big difference, e.g., in the soil parameters $b$ (or $a$ ) in Section 5.1.

One way to overcome this challenge is to use adaptive time stepping through which the time step is reduced if Newton iteration struggles to converge. Generally, reducing the time step leads to smoother convergence of residuals to zero. For very difficult cases related to high degree of heterogeneity, we regularize the problem by introducing an artificial "intermediate" layer in the region close to the interface $\Gamma$ in which the soil parameters vary more smoothly. Such a strategy can be used to find an initial guess for Newton step, or be used as the solution for a few initial time steps.

5.4. Summary

More work on the solver and schemes is needed, but overall we believe our P0-P0 scheme performs well for the permafrost problem (P). We see that its behavior is smoother and the challenges are somewhat milder than those for the (ST) problem, except in heterogeneous domains.

6. Summary and future works

In this paper we consider a collection of models motivated by the applications to the phase change problems dubbed (ST), their (ST) $_{\epsilon}$ modeling approximation, and the models (P) in permafrost.

For approximation, we consider the technique well known for its conservative properties, mixed finite element method on rectangular elements. Because the solutions are expected to have low regularity, we choose to use only lowest order finite elements, with piecewise constant approximations to the scalar unknowns $(\theta, w)$ which we call P0-P0, and which are similar to finite volume or cell-centered finite difference approaches. For time discretization we employ a first order fully implicit scheme. We show that our P0-P0 approach works well for (ST) as well as (ST) $_{\epsilon}$ and (P) permafrost, and that they compare well to the P1-based approaches that were primarily formulated for (ST) $_{\epsilon}$ . We extend the P0-P0 approach to heterogeneous case of multiple materials, and showed that a monolithic P0-P0 discretization we construct is robust and easy to implement. We set up theoretical framework relating to known literature results, proved several results, and point out the challenges and open questions.

More work is underway. In particular, due to the challenges we acknowledged above, rigorous convergence analysis of P0-P0 is inaccessible at this time, even if it can be inferred in the limit of that for (ST) $_{\epsilon}$ . The main challenge is that that the use of $X = H_{\mathrm{div}}(\Omega)$ for the heat flux $q$ is not directly possible for (ST). At the same time, our algorithms are robust even if they do not address this formal difficulty, but require more work which we defer to the future.

On modeling side, in permafrost, or more generally, in freezing soils, there are additional micro-scale physical effects which contribute to the complex physics of phase change. These include freezing temperature depression, presence of air which acts as an insulator, configuration (geometry) of the pore space, as well as the chemical composition of liquid and mineral phases. While these are not fully explained, we aim to connect the computational models of multi-material pore-scale to the macro-scale experimental models such as those in Section 5.

For (P) models in permafrost, in the paper we neglect convection and other physical phenomena which can potentially contribute to the temperature changes such as flow and mechanical behavior; these, along with other physical and environmental effects including those of radiation, vegetation and snow cover, and more, will be studied in forthcoming paper.

7. Appendix

7.1. Exact solution for Example 2.1

The exact solution to (2.15) with data as in Example 2.1 is given by

$\begin{eqnarray} w(t) = \begin{cases} \frac{1}{1 + \frac{A^2 T_{end}^2}{c_s^2 \pi^2}}\left[ -\frac{T_{end}}{\pi} \cos{(\frac{\pi t}{T_{end}})} + \right] + w_{init}; & t \in [0, t_1] \nonumber \\ -\frac{T_{end}}{\pi} \cos{(\frac{\pi t}{T_{end}})} + \cos{(\frac{\pi t_1}{T_{end}})}; & t \in [t_1, t_2], \nonumber \\ \frac{1}{1 + \frac{A^2 T_{end}^2}{c_l^2 \pi^2}}[ -\frac{T_{end}}{\pi} \cos{(\frac{\pi t}{T_{end}})} + \frac{AT_{end}^2}{c_l \pi^2} \cos{(\frac{\pi t}{T_{end}}}) \nonumber \\ - \left(-\frac{T_{end}}{\pi}\cos{(\frac{\pi t_2}{T_{end}})} + \frac{A T_{end}^2}{c_l \pi^2}\sin{(\frac{\pi t_2}{T_{end}})} \right) e^{\frac{A(t_2-t)}{c_l}} ] + Le^{\frac{-At}{c_l}}; & t \in [t_2, 3600], \end{cases} \end{eqnarray}$

where $t_1 = 44.2658 \mathrm{[{s}]}$ is when phase change begins, $t_2 = 858.5458 \mathrm{[{s}]}$ is when all the solid has changed into liquid, and $w_{init} = -1$ is the initial enthalpy.

7.2. Auxiliary finite dimensional result

We consider a useful ODE system on $R^k$ with a matrix $A \in \mathbb{R}^{k \times k}$ and right hand side $F(t)\in \mathbb{R}^k$

$\begin{eqnarray} \tfrac{d}{dt}W+AU = F, w(0) = W_{init}; \; W \in \alpha(U). \end{eqnarray}$

(7.1)

We define the solutions $W(t)$ as limits of the solutions $W^n$ to the time-discrete problem

$\begin{eqnarray} \tfrac{W^n-W^{n-1}}{\tau}+AU^n = F^{n},\; W^n \in \alpha(U^n),n\geq 1; W^0 = W_{init}. \end{eqnarray}$

(7.2)

The result below will be used many times in this paper.

Lemma 7.1. Let $\alpha(\cdot)$ be maximal monotone and $A$ symmetric nonnegative definite. Then there exists a unique solution $(U^n, W^n)$ to (7.2).

Proof. We find $U^n$ as the minimizer of

$\begin{eqnarray*} J(U) = \frac{\tau}{2}U^TAU+ \Phi_{\alpha}(U)-U^T(W^{n-1}+\tau F^{n}) \end{eqnarray*}$

where we set $\Phi_{\alpha}(U) = \sum_{i = 1}^k \phi_{\alpha}(U_i)$ , with $\phi_{\alpha}(r)$ the (strictly convex) primitive of $\alpha(r)$ (i.e., $\phi_{\alpha}'(r) = \alpha(r)$ , and the subgradient $\partial \Phi_{\alpha}(U) = \sum_k \phi_{\alpha}(U_i)$ ). The functional $J(U)$ is strictly convex due to the convexity of $U^TAU$ and strict convexity of $\Phi_{\alpha}$ , and thus it has a unique minimizer $U$ , and we set $U^n = U$ . With known $U^n$ , we find $W^n$ from $W^n+\tau AU^n = W^{n-1}+\tau F^{n}$ , which follows from (7.2).

7.3. Detailed comparison of P1-based formulations

Now we provide a detailed comparison of the P1-based formulations recalled in Section 2.4. We do so, for simplicity, for $d = 1$ and a grid covering $\Omega = [a, b]$ with $M$ cells $\omega_i = [x_{i-1/2}, x_{i+1/2}]$ with centers $x_i$ so that $\bar{\Omega} = \cup_{i = 1}^{M} \omega_i$ and $x_{\frac{1}{2}} = a, \ x_{M + \frac{1}{2}} = b$ . Other results not reviewed here include grid adaptive schemes in ^[77].

We recalling that $u^n_h \in V_h$ is identified by its nodal values $(U_{j\pm 1/2^n})_j$ . In turn, in P1-P0 approaches $w_h^n \in M_h$ are identified by $(W_j^n)_j$ .

We denote by $\Pi_h^0 u_h^n$ the $L^2$ projection onto constants $\Pi_h^0: L^2(\Omega) \rightarrow M_h$ , by $P_h^0$ the $L^2$ projection onto $V_h$ , and by $P_h^1$ the Ritz projection onto $V_h$ .

P1-P0 approximations: In ^[4]: one seeks $w_h^n \in M_h$ such that

$\begin{eqnarray} &(u^n_h, \psi) + \frac{\tau}{\mu}(\nabla u^n_h, \nabla \psi) = ( \beta_{\text{K}}(w_h^{n-1}), \psi), \ \forall \psi \in V_h, \end{eqnarray}$

(7.3a)

$\begin{eqnarray} &W_j^n = W_j^{n-1} + \mu\left(u^n_h (x_j) - \beta_{\text{K}}(W^{n-1}_j) \right), \; \forall {\omega_j}. \end{eqnarray}$

(7.3b)

Here we require $\mu \leq L_{ \beta_{\text{K}}}^{-1}$ , since (7.3b) implements the Chernoff formula. Note that the advantage of (7.3) is that (7.3a) is linear in $u_h^n$ . However, a consistency error arises since $\mu \neq { \beta_{\text{K}}'(W^{n-1}_j)}^{-1}$ and $u^n_h(x_j) = \tfrac{{U^n}_{j-\frac{1}{2}} + {U^n}_{j + \frac{1}{2}}}{2} \neq \beta_{\text{K}}({W}^{n}_j)$ , and

$\begin{eqnarray*} \beta_{\text{K}}({W}^{n}_j) = \beta_{\text{K}}\left({W}^{n-1}_j + (W^n_j - {W}^{n-1}_j) \right) \approx \beta_{\text{K}}({W}^{n-1}_j) + (W^n_j - {W}^{n-1}_j) \beta_{\text{K}}'({W}^{n-1}_j). \end{eqnarray*}$

Next, ^[5] are able to improve the convergence rates by modifying the scheme given by (7.3) through the use of regularization and projection. Their fully discrete regularized scheme reads ^[5]: find $u_h^n \in V_h, \ w_h^n \in M_h$ such that

$\begin{eqnarray} &(\Pi_h^0 u_h^n, \psi) + \frac{\tau}{\mu}(\nabla u_h^n, \nabla \psi) = ({ \beta_{\text{K}}}_\epsilon(w_h^{n-1}), \psi), \ \forall \psi \in V_h, \end{eqnarray}$

(7.4a)

$\begin{eqnarray} &W^n_j = W^{n-1}_j + \mu\left((\Pi_h^0 u_h^{n})(x_j) - { \beta_{\text{K}}}_\epsilon(W^{n-1}_j)\right), \end{eqnarray}$

(7.4b)

where ${ \beta_{\text{K}}}_\epsilon$ is given by (7.5), essentially a Yosida approximation to $\beta_{\text{K}}$ ,

$\begin{eqnarray} { \beta_{\text{K}}}_\epsilon(w) = \begin{cases} \beta_{\text{K}}(w); & w < 0, \ w > \frac{L k_l}{k_l-c_l\epsilon} \\ \epsilon w; & 0 \leq w \leq \frac{L k_l}{k_l-c_l \epsilon} \end{cases}, \end{eqnarray}$

(7.5)

for some $0 < \epsilon < \frac{k_l}{c_l}$ .

The work of ^[44] uses a semi-implicit scheme for phase relaxation coupled with (2.6) given by $\epsilon \frac{\partial \chi}{\partial t} + { \mathcal{H}^{-1}}(\chi) \ni u,$ They seek ${\chi_h}^n \in M_h$ as an independent variable and solve

$\begin{eqnarray} &\frac{\tau}{\epsilon}(u_h^n, \psi)_h + ({\chi_h}^n, \psi) + \tau(\nabla u_h^n, \nabla \psi) = \frac{\tau}{\epsilon}(u_h^{n-1}, \psi)_h + ({\chi_h}^{n-1}, \psi), \forall \psi \in V_h, \end{eqnarray}$

(7.6a)

$\begin{eqnarray} &{\chi_h}^{n}(x_j) = \left(\frac{\tau}{\epsilon} (\Pi_h^0 u_h^{n-1})(x_j) + {\chi_h}^{n-1}(x_j) \right) - \frac{\tau}{\epsilon} \beta_{\text{K}} \left( \frac{\tau}{\epsilon} (\Pi_h^0 u_h^{n-1})(x_j) + {\chi_h}^{n-1}(x_j) \right), \end{eqnarray}$

(7.6b)

where $(u_h^n, \psi)_h = \int_{\Omega} I_h(u_h^n \psi)$ , with $I_h : C^0(\overline{\Omega}) \rightarrow V_h$ being the linear interpolator. We see that (7.6b) is equivalent to the Chernoff formula (7.3b) with $\mu = \frac{\tau}{\epsilon}$ . This shows the subtle difference in the scheme by ^[5] and ^[44]: in ^[44] numerical integration $(u_h^n, \psi)_h$ is employed whereas in ^[5] the projection operator is used to obtain $(P_h^0 u_h^n, \psi)$ .

Remark 7.1. (7.6b) is obtained using the discretization

$\begin{eqnarray} \epsilon\frac{{\chi_h}^n(x_j) - {\chi_h}^{n-1}(x_j)}{\tau} + H^{-1}({\chi_h}^{n}(x_j)) = u_h^{n-1}(x_j). \end{eqnarray}$

(7.7)

To account for $u_h^{n-1}$ in (7.7) instead of $u_h^n$ , the stability condition $\tau \leq \epsilon$ must be enforced ^[31].

P1-P1 fully implicit approach in ^[3] A scheme of a different flavor is explored in ^[3]. Fully implicit in time formulation of (2.6) is approximated with $u_h^n \in V_h$

$\begin{eqnarray} (w_h^{n},\psi)+\tau(\nabla u_h^n,\nabla \psi) = \tau (f^{n},\psi)+(w_h^{n-1},\psi), ; w_h^n \in P_h^0(\alpha(u_h^n)). \end{eqnarray}$

(7.8)

This discrete systems is, in practice, similar to (7.2) except it applies to (ST) and pays considerable attention to the identification and the role of $P_{h}^0$ in this last equation. While no regularization of $\alpha$ is needed, the results are not accompanied by numerical computations. Optimal convergence rates ^[3](Theorem 2.7) apply to all models (1.2)

$\begin{eqnarray} \|\theta - \theta_h \|_{L^2(L^2)} + \ \| w - w_h \|_{L^{\infty}(H^{-1})} \ \leq C \left( \tau + h \left[ \ln{\left( h^{-1} \right)}\right]^{1 + \frac{r}{2}} \right) \end{eqnarray}$

(7.9)

where $0 \leq r < \infty$ is a constant such that

$\begin{eqnarray} |\alpha(\theta) | \leq C(1 + |\theta|^r). \end{eqnarray}$

(7.10)

In particular, for (ST) and (2.8) we have $r = 1$ . This corresponds to an order of convergence $\approx 1$ when $\tau = O(h)$ . However, ^[3] do not include numerical implementation details or experiments.

Other P1-P1 schemes include those in relaxation models ^[31].

7.4. Convergence studies of P1-based and P0-P0 algorithms.

We carry out convergence analyses when the true solution $\theta, w$ is known, or when only its proxy, a fine grid solution $\theta_{h_{fine}}, w_{h_{fine}}$ is available. For time dependent problems the use of $\theta_{h_{fine}}, w_{h_{fine}}$ renders convergence order verification in some norms impractical. Therefore in this paper we only use $L^p$ norms; we also confine ourselves only to the error in scalar unknowns, deferring the discussion of the error in the fluxes to future work.

7.4.1. Error norms

As stated in Introduction, the notation $f \in L^{\infty}(L^p)$ means $f \in L^{\infty}(0, T;L^p(\Omega))$ . We denote the norms appropriately, say as $||{f}||_{{\infty, p}}$ ; all these have to be approximated. In particular, we distinguish the grid norms

$\begin{eqnarray*} \label{eq:gridNorm1} \lVert \theta_{err} \rVert_{\infty , p} & = & \max\limits_n ||{\theta_h-\theta(\cdot,t^n)}||_{{p,\Delta}} = \max\limits_n \left( h \sum\limits_{i = 1}^M |\theta_i-\theta(x_i,t^n)|^p \right)^{1/p}. \\ \label{eq:gridNorml2l2} \lVert \theta_{err} \rVert_{2 , 2} & = &\left(\tau \sum\limits_n ||{\theta_h^n-\theta(\cdot,t^n)}||_{{2,\Delta}}\right)^{1/2}. \end{eqnarray*}$

We also use a tighter approximation than the grid norm

$\begin{eqnarray*} \label{eq:LNormp} \lVert \theta_{err} \rVert_{\infty,{L^p}} = \max\limits_n \left(\int_ \Omega |\theta_h^n-\theta(\cdot,t^n) |^p\right)^{1/p} \approx \max\limits_n \left(\sum _{j = 1}^M \sum\limits_{l = 1}^{10} |\theta^n_j-\theta(x_{j+\frac{l-5}{10}},t^n)|^p\right)^{1/p}. \end{eqnarray*}$

7.4.2. P1-based schemes

The theoretically estimated convergence error depends, as usual, on $h, \tau$ and $\epsilon$ . Of the literature we have reviewed, ^[26,31,44] provide numerical results supporting their theoretically derived convergence rates. We set up additional experiments and convergence study as a prequel to the study of our P0-P0 schemes for (ST) problem: we use analytical solutions given in ^[26,44] for $d = 1$ . The theoretical convergence orders as well as those we tested are tabulated in Table 13. We compare the [P1-P0] schemes given in ^[5,44] in Examples 3.1 and 3.2, respectively; see the plots shown in . Additionally, we mention the work in ^[4] which provides a semi-discrete scheme which we further discretize in space following ^[5]. For this reason, along with using a small regularization parameter $\epsilon$ in (7.4), the numerical solution following the formulation as in ^[4] is virtually indistinguishable from ^[5] in Example 2 and hence is not shown in Figure 13.

Table 13. Convergence orders obtained in Examples 3.1 (VV) and 3.2 (RBC) for P1-based schemes from literature and for our P0-P0 scheme. Also listed is the order of convergence for $\|{\theta_{err} \|_{ 2, 2}}$ derived theoretically in the literature which is denoted by (Lit.).

Scheme $\theta$ - $w$	Case	$\\|\theta_{err} \\|_{\infty, 2}$	$\\|\theta_{err}\\|_{\infty, 1}$	$\\|\theta_{err}\\|_{2, 2}$	(Lit.)	$\\|w_{err} \\|_{\infty, 2}$	$\\|w_{err}\\|_{\infty, 1}$	$\\|w_{err}\\|_{2, 2}$	Notes
P1-P0 ^[4]	(VV)	0.47	0.52	0.52	0.25	0.27	0.53	0.24	$\tau = O(h)$
P1-P0 ^[4]	(RBC)	0.42	0.44	0.42	0.25	0.26	0.5	0.25	$\tau = O(h)$
P1-P0 ^[5]	(VV)	0.84	0.84	0.91	0.66	0.51	0.96	0.52	$\tau = O(h^2)$
P1-P0 ^[5]	(RBC)	0.70	0.72	0.72	0.66	0.45	0.87	0.45	$\tau = O(h^2)$
P1-P0 ^[44]	(VV)	0.66	0.69	0.73	0.50	0.35	0.70	0.35	$\tau = O(h^{\frac{3}{2}})$
P1-P0 ^[44]	(RBC)	0.14	0.13	0.26	0.50	0.28	0.47	0.24	$\tau = O(h^{\frac{3}{2}})$
P0-P0	(VV)	1.38	1.43	1.27	-	0.50	1.00	0.47	$\tau = O(h)$
P0-P0	(RBC)	1.08	1.09	1.07	-	0.45	0.96	0.47	$\tau = O(h)$

| Show Table

DownLoad: CSV

Figure 13. Top row: Solutions with P1-P0 to (VV) Example 2 with $M = 10, \tau = 10^{-2}$ at $t = 0.2$ . Bottom row: Solutions P1-P0 to (RBC) Example 3 with $M = 20, \tau = 500$ at time $t = 200000$ .

DownLoad: Full-Size Img PowerPoint

All together, we observe that $\theta$ is approximated well, qualitatively and quantitatively, by all the schemes. However, $w$ appears to be "smeared" near the interface and features higher errors. Of all the schemes, that in ^[5] produces the best approximation and convergence rates. The examples we show are consistent with the rates proved in ^[44]; see Table 13.

7.5. Auxiliary properties

We recall now one especially useful and well known feature of numerical implementation of (3.6) when numerical integration is used to get the entries of matrix $\mathcal{K}$ . As discussed in ^[24,49], these follow from the use of trapezoidal (T) and midpoint (M) rules $(\int_{\Omega} k^{-1}(q_h)_1 \psi_{i+1/2, j})_{TM}$ which leads to an algebraic expression involving the so-called transmissivity or transmissibility edge factor $\mathcal{T}_{i+1/2, j} = h_{y, j}k_{i+1/2, j}$ , where

$\begin{eqnarray*} k_{i+1/2,j} = \left( \tfrac{1}{2} h_i k_{ijk}^{-1} + \tfrac{1}{2} h_{i+1} k_{i+1,j,k}^{-1}\right)^{-1} \end{eqnarray*}$

is the weighted scaled harmonic average of $k_{ij}$ and $k_{i+1, j}$ so that (see, e.g., ^[24](eq.(15)))

$\begin{eqnarray} \left(\int_{\Omega} k^{-1}(q_h)_1 \psi_{i+1/2,j}\right)_{TM} = q_{i+1/2,j} \mathcal{T}_{j+1/2,j}^{-1}h_{y,j},\; \end{eqnarray}$

(7.11)

We emphasize that the use of harmonic averages leads to conservative fluxes; this is in contrast with primal formulations in which arithmetic averages are used, and fluxes are not conservative.

Next, we integrate $\int_{\Omega} \nabla \cdot \psi_{i+1/2} \theta_h$ and obtain, for the total normal flux $h_{y, j}q_h \cdot\nu \vert_{\gamma_{i+1/2, j}} = h_{y, j}q_{i+1/2, j}$ across $\gamma_{i+1/2, j}$ $\omega_{i, j}$ and the first component $(q_h)_1$ of $q_h$

$\begin{eqnarray} h_{y,j} \mathcal{T}_{i+1/2,j}^{-1}(q_{i+1/2,j}) = -(\theta_{i+1,j} -\theta_{i,j}). \end{eqnarray}$

(7.12a)

The right hand side explains how the entries of matrix $\mathcal{B}^T$ arise.

A similar expression is obtained for $(q_h)_2$ and $(q_{i, j+1/2})$ using $\mathcal{T}_{i, j+1/2}^{-1}$ .

$\begin{eqnarray} h_{x,i} \mathcal{T}_{i,j+1/2}^{-1}(q_{i,j+1/2}) = -(\theta_{i,j+1} -\theta_{i,j}). \end{eqnarray}$

(7.12b)

The use of factors $\mathcal{T}$ allows the interpretation of (7.12) as a finite difference analogue of $q = -k\nabla \theta$ . It also explains why the matrix $\mathcal{K}$ is diagonal.

Last but not least, (7.11) is done also for the second component of flux $(q_h)_2$ across $\omega_{ij}$ .

Handling Dirichlet boundary conditions. The expression (7.12) is valid for the interior edges away from external boundaries and interfaces. If the edge $\gamma_{i+1/2, j}$ is on a boundary on which Dirichlet condition $\theta\vert_{\gamma_{i+1/2, j}} = \theta_*$ is prescribed with some $\theta_*$ , then the analogue of (7.12) reads

$\begin{eqnarray} h_{y,j} \mathcal{T}_{*,j+1/2,j}^{-1}(q_{*,i+1/2,j}) = -(\theta_* -\theta_{i,j}), \; \text{ with } \mathcal{T}_{*,j+1/2,j}^{-1} = h_{y,j}\left( \tfrac{1}{2} h_i k_{ijk}^{-1} \right). \end{eqnarray}$

(7.13)

The portion of the expression associated with $\theta_*$ moves to the right hand side of (3.3), but the structure of the matrix does not change.

7.6. Linear heat equation with a jump condition

We consider the special linear case with piecewise constant coefficients of (4.1) from Example 4

$\begin{eqnarray} \partial_t \theta+q_x = 0, q = -k\theta_x, x \in (0, x^*)\cup ( x^*,1), \end{eqnarray}$

(7.14a)

$\begin{eqnarray} \theta(0,t) = 1, \theta(1,t) = 0, \end{eqnarray}$

(7.14b)

$\begin{eqnarray} \theta^{(2)}( x^*,t)-\theta^{(1)}( x^*,t) = - \rho_{R} q^{(1)}( x^*,t); \; q^{(2)}( x^*,r) = q^{(1)}( x^*,t). \end{eqnarray}$

(7.14c)

$\begin{eqnarray} \theta(x,0) = 0. \end{eqnarray}$

(7.14d)

7.6.1. Stationary solution

We also consider the stationary solution $q^{stat}(x) = q(x), \theta^{stat}(x) = \theta(x)$ to (7.14) which satisfy

$\begin{eqnarray} q_x = 0, q = -k\theta_x, x \in (0, x^*)\cup ( x^*,1), \end{eqnarray}$

(7.15a)

$\begin{eqnarray} \theta(0) = 1, \theta(1) = 0, \end{eqnarray}$

(7.15b)

$\begin{eqnarray} \theta^{(2)}( x^*)-\theta^{(1)}( x^*) = - \rho_{R} q^{(1)}( x^*);\;\; q^{(2)}( x^*) = q^{(1)}( x^*). \end{eqnarray}$

(7.15c)

It is not difficult to find its analytical solution, with $q = q^{(1)} = q^{(2)} = C$ and

$\begin{eqnarray*} \theta^{(1)}(x) = -\tfrac{C}{k_1}(x)+C_1; x \in \Omega^{(1)}; \;\; \theta^{(2)}(x) = -\tfrac{C}{k_2}(x)+C_2; x \in \Omega^{(2)}; \\ C = \left[ ( x^*(\tfrac{1}{k_2}-\tfrac{1}{k_1}- \rho_{R})+\tfrac{a}{k_1}-\tfrac{b}{k_2}\right]^{-1}(\theta_b-\theta_a); \; C_1 = \theta_a+\tfrac{a}{k_1}C; \; C_2 = \theta_b+\tfrac{b}{k_2}C. \end{eqnarray*}$

In particular in the special case $k_1 = 1 = k_2$ , and $a = 0, b = 1, \theta_0 = 1, \theta_1 = 0, x^* = \tfrac{1}{2}$ we get

$\begin{eqnarray} q = \tfrac{1}{1+\rho}; \;\; \theta^{(1)}(x) = \tfrac{-1}{1+\rho}x+1;\;\; \theta^{(2)}(x) = \tfrac{-1}{1+\rho}x+\tfrac{1}{1+ \rho_{R}} \end{eqnarray}$

(7.16)

From this we have the jump $[\theta]\vert_{ x^*} = \theta^{(2)}(x^*)-\theta^{(1)}(x^*) = -\tfrac{ \rho_{R}}{1+ \rho_{R}}$ scaling linearly with $\rho_{R}$ as $\rho_{R}\downarrow 0$ , unlike $O(\sqrt{ \rho_{R}})$ predicted for the evolution problem in (4.3).

Acknowledgements

This work was partially supported by the National Science Foundation DMS-1912938 and DMS-1522734, and by the NSF IRD plan 2019-21 for M. Peszynska.

This material is based upon work supported by and while serving at the National Science Foundation. Any opinion, findings, and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the National Science Foundation.

The authors would like to thank the anonymous reviewers whose remarks helped to improve the paper.

Conflict of interest

The authors declare there is no conflicts of interest.

References

[1]	E. Ahmed, J. Jaffré, J. E. Roberts, A reduced fracture model for two-phase flow with different rock types, Math. Comput. Simul., 137 (2017), 49–70. https://doi.org/10.1016/j.matcom.2016.10.005 doi: 10.1016/j.matcom.2016.10.005
[2]	C. Alboin, J. Jaffré, J. E. Roberts, X. Wang, C. Serres, Domain decomposition for some transmission problems in flow in porous media, Numer. Treat. Multiphase Flows Porous Media, 552 (2000), 22–34.
[3]	J. Rulla, N. J. Walkington, Optimal rates of convergence for degenerate parabolic problems in two dimensions, SIAM J. Numer. Anal., 33 (1996), 56–67. https://doi.org/10.1137/0733004 doi: 10.1137/0733004
[4]	E. Magenes, R. H. Nochetto, C. Verdi, Energy error estimates for a linear scheme to approximate nonlinear parabolic problems, ESAIM: M2AN, 21 (1987), 655–678.
[5]	R. H. Nochetto, C. Verdi, The combined use of a nonlinear Chernoff formula with a regularization procedure for two-phase Stefan problems, Numer. Funct. Anal. Optim., 9 (1988), 1177–1192. https://doi.org/10.1080/01630568808816279 doi: 10.1080/01630568808816279
[6]	J. A. Wheeler, Simulation of heat transfer from a warm pipeline buried in permafrost, Am. Inst. Chem. Eng., (1973), 267–284.
[7]	J. A. Wheeler, Permafrost thermal design for the trans-Alaska pipeline, Moving Boundary Probl., (1978), 267–284.
[8]	D. Nicolsky, V. Romanovsky, G. Tipenko, Using in-situ temperature measurements to estimate saturated soil thermal properties by solving a sequence of optimization problems, The Cryosphere, 1 (2007), 41–58. https://doi.org/10.5194/tc-1-41-2007 doi: 10.5194/tc-1-41-2007
[9]	S. Marchenko, V. Romanovsky, G. Tipenko, Numerical modeling of spatial permafrost dynamics in alaska, in Proceedings of Ninth International Conference on Permafrost, Ninth International Conference on Permafrost, (2008), 1125–1130.
[10]	E. E. Jafarov, S. S. Marchenko, V. E. Romanovsky, Numerical modeling of permafrost dynamics in alaska using a high spatial resolution dataset, The Cryosphere, 6 (2012), 613–624. https://doi.org/10.5194/tc-6-613-2012 doi: 10.5194/tc-6-613-2012
[11]	T. Kelley, J. Rulla, Solution of the discretized Stefan problem by Newton's method, Nonlinear Anal., 14 (1990), 851–872. https://doi.org/10.1016/0362-546X(90)90025-C doi: 10.1016/0362-546X(90)90025-C
[12]	M. F. Wheeler, M. Peszynska, Computational engineering and science methodologies for modeling and simulation of subsurface applications, Adv. Water Resour., 25 (2002), 1147–1173.
[13]	C. Dawson, S. Sun, M. F. Wheeler, Compatible algorithms for coupled flow and transport, Comput. Methods Appl. Mech. Eng., 193 (2004), 2565–2580. https://doi.org/10.1016/j.cma.2003.12.059 doi: 10.1016/j.cma.2003.12.059
[14]	R. L. Michalowski, A constitutive model of saturated soils for frost heave simulations, Cold Reg. Sci. Technol., 22 (1993), 47–63. https://doi.org/10.1016/0165-232X(93)90045-A doi: 10.1016/0165-232X(93)90045-A
[15]	Y. Zhang, R. Michalowski, Thermal-hydro-mechanical analysis of frost heave and thaw settlement, J. Geotech. Geoenviron. Eng., 141 (2015).
[16]	H. Liu, P. Maghoul, A. Shalaby, A. Bahari, Thermo-hydro-mechanical modeling of frost heave using the theory of poroelasticity for frost-susceptible soils in double-barrel culvert sites, Trans. Geotech., 20 (2019). https://doi.org/10.1016/j.trgeo.2019.100251 doi: 10.1016/j.trgeo.2019.100251
[17]	F. Yu, P. Guo, Y. Lai, D. Stolle, Frost heave and thaw consolidation modelling. part 2: One-dimensional thermohydromechanical (THM) framework, Can. Geotech. J., 57 (2020), 1595–1610.
[18]	M. Peszynska, A. Trykozko, Pore-to-core simulations of flow with large velocities using continuum models and imaging data, Comput. Geosci., 17 (2013), 623–645. https://doi.org/10.1007/s10596-013-9344-4 doi: 10.1007/s10596-013-9344-4
[19]	M. Peszynska, A. Trykozko, G. Iltis, S. Schlueter, D. Wildenschild, Biofilm growth in porous media: Experiments, computational modeling at the porescale, and upscaling, Adv. Water Res., 95 (2016), 288–301. https://doi.org/10.1016/j.advwatres.2015.07.008 doi: 10.1016/j.advwatres.2015.07.008
[20]	C. Shin, A. Alhammali, L. Bigler, N. Vohra, M. Peszynska, Coupled flow and biomass-nutrient growth at pore-scale with permeable biofilm, adaptive singularity and multiple species, Math. Biosci. Eng., 18 (2021), 2097–2149. https://doi.org/10.3934/mbe.2021108 doi: 10.3934/mbe.2021108
[21]	M. Peszynska, J. Umhoefer, C. Shin, Reduced model for properties of multiscale porous media with changing geometry, Computation, 9 (2021), 1–44.
[22]	T. Arbogast, M. F. Wheeler, N. Y. Zhang, A nonlinear mixed finite element method for a degenerate parabolic equation arising in flow in porous media, SIAM J. Numer. Anal., 33 (1996), 1669–1687. https://doi.org/10.1137/S0036142994266728 doi: 10.1137/S0036142994266728
[23]	C. S. Woodward, C. N. Dawson, Analysis of expanded mixed finite element methods for a nonlinear parabolic equation modeling flow into variably saturated porous media, SIAM J. Numer. Anal., 37 (2000), 701–724.
[24]	M. Peszynska, E. Jenkins, M. F. Wheeler, Boundary conditions for fully implicit two-phase flow model, in Recent Advances in Numerical Methods for Partial Differential Equations and Applications (eds. X. Feng and T. P. Schulze), Contemporary Mathematics Series, American Mathematical Society, 306 (2002), 85–106.
[25]	R. E. Showalter, Monotone operators in Banach space and nonlinear partial differential equations, vol. 49 of Mathematical Surveys and Monographs, American Mathematical Society, Providence, RI, 1997. https://doi.org/10.1090/surv/049
[26]	J. C. Rogers, A. E. Berger, M. Ciment, The alternating phase truncation method for numerical solution of a Stefan problem, SIAM J. Numer. Anal., 16 (1979), 563–587.
[27]	A. Visintin, Models of phase transitions, vol. 28 of Progress in Nonlinear Differential Equations and their Applications, Birkhäuser Boston, Inc., Boston, MA, 1996. https://doi.org/10.1007/978-1-4612-4078-5
[28]	L. W. Lake, Enhanced oil recovery, Prentice Hall, 1989.
[29]	T. Roubicek, The Stefan problem in heterogeneous media, Ann. l'Inst. Henri Poincaré Anal. Linéaire, 6 (1989), 481–501.
[30]	E. Javierre, C. Vuik, F. Vermolen, S. van der Zwaag, A comparison of numerical models for one-dimensional Stefan problems, J. Comput. Appl. Math., 192 (2006), 445–459. https://doi.org/10.1016/j.cam.2005.04.062 doi: 10.1016/j.cam.2005.04.062
[31]	X. Jiang, R. Nochetto, A P1–P1 finite element method for a phase relaxation model Ⅰ: Quasiuniform mesh, Siam J. Numer. Anal., 35 (1998), 1176–1190.
[32]	S. M. Allen, J. M. Cahn, A microscopic theory for antiphase boundary motion and its application to antiphase domain coarsening, Acta Metall., 27 (1979), 1085–1095. https://doi.org/10.1016/0001-6160(79)90196-2 doi: 10.1016/0001-6160(79)90196-2
[33]	Y. Oono, S. Puri, Study of phase-separation dynamics by use of cell dynamical systems, I. modeling, Phys. Rev. A, 38 (1988), 434–453. https://link.aps.org/doi/10.1103/PhysRevA.38.434 doi: 10.1103/PhysRevA.38.434
[34]	J. F. Blowey, C. M. Elliott, The Cahn–Hilliard gradient theory for phase separation with non-smooth free energy part Ⅰ: Mathematical analysis, Eur. J. Appl. Math., 2 (1991), 233–280. https://doi.org/10.1017/S095679250000053X doi: 10.1017/S095679250000053X
[35]	P. Reddy, C. Gunasekar, A. Mhaske, V. Krishna, Enhancement of thermal conductivity of pcm using filler graphite powder materials, IOP Conf. Ser.: Mater. Sci. Eng., 402 (2018). https://doi.org/10.1088/1757-899X/402/1/012173 doi: 10.1088/1757-899X/402/1/012173
[36]	Rubitherm ® Technologies GmbH, 2021. https://www.rubitherm.eu
[37]	D. Yu, Z. He, Shape-remodeled macrocapsule of phase change materials for thermal energy storage and thermal management, Appl. Energy, 247 (2019), 503–516. https://doi.org/10.1016/j.apenergy.2019.04.072 doi: 10.1016/j.apenergy.2019.04.072
[38]	M. L. Cohen, Measurement of the thermal properties of human skin: A review, J. Invest. Dermatol., 69 (1977), 333–338. https://doi.org/10.1111/1523-1747.ep12507965 doi: 10.1111/1523-1747.ep12507965
[39]	Engineering Toolbox, 2021. https://www.engineeringtoolbox.com
[40]	Wikipedia, 2021. https://en.wikipedia.org
[41]	O. B. Andersland, B. Ladanyi, Frozen Ground Engineering, 2nd edition, Wiley, ASCE, Hoboken, 2004.
[42]	G. H. Meyer, Multidimensional Stefan problems, SIAM J. Numer. Anal., 10 (1973), 522–538. https://doi.org/10.1137/0710047 doi: 10.1137/0710047
[43]	J. W. Jerome, M. E. Rose, Error estimates for the multidimensional two-phase Stefan problem, Math. Comput., 39 (1982), 377–414. https://doi.org/10.1090/S0025-5718-1982-0669635-2 doi: 10.1090/S0025-5718-1982-0669635-2
[44]	C. Verdi, A. Visintin, Error estimates for a semi-explicit numerical scheme for Stefan-type problems., Numer. Math., 52 (1987/88), 165–186. http://eudml.org/doc/133231
[45]	D. Boffi, M. Fortin, F. Brezzi, Mixed Finite Element Methods and Applications, Springer series in computational mathematics, 2013.
[46]	A. Ern, J. L. Guermond, Theory and practice of finite elements, vol. 159 of Applied Mathematical Sciences, Springer-Verlag, New York, 2004. https://doi.org/10.1007/978-1-4757-4355-5
[47]	F. Brezzi, M. Fortin, Mixed and hybrid finite element methods, vol. 15 of Springer Series in Computational Mathematics, Springer-Verlag, New York, 1991.
[48]	A. Weiser, M. F. Wheeler, On convergence of block-centered finite differences for elliptic problems, SIAM J. Numer. Anal., 25 (1988), 351–375. https://doi.org/10.1137/0725025 doi: 10.1137/0725025
[49]	T. F. Russell, M. F. Wheeler, Finite element and finite difference methods for continuous flows in porous media, Math. Reservoir Simul., (1983), 35–106.
[50]	R. E. Showalter, Nonlinear degenerate evolution equations in mixed formulation, SIAM J. Math. Anal., 42 (2010), 2114–2131. https://doi.org/10.1137/100789427 doi: 10.1137/100789427
[51]	E. Schneid, P. Knabner, F. Radu, A priori error estimates for a mixed finite element discretization of the Richards' equation, Numer. Math., 98 (2004), 353–370. https://doi.org/10.1007/s00211-003-0509-2 doi: 10.1007/s00211-003-0509-2
[52]	M. Ulbrich, Semismooth Newton methods for variational inequalities and constrained optimization problems in function spaces, vol. 11 of MOS-SIAM Series on Optimization, Society for Industrial and Applied Mathematics (SIAM), Philadelphia, PA, 2011.
[53]	T. Roubicek, Numerical solution of the nonlinear heat equation in heterogeneous media, Numer. Funct. Anal. Optim., 11 (1990), 793–810.
[54]	T. Roubicek, A finite-element approximation of Stefan problems in heterogeneous media, in Free Boundary Value Problems, (1990), 267–275.
[55]	R. Glowinski, M. F. Wheeler, Domain decomposition and mixed finite element methods for elliptic problems, in First International Symposium on Domain Decomposition Methods for Partial Differential Equations (eds. R. Glowinski, G. H. Golub, G. A. Meurant and J. Periaux), SIAM, Philadelphia, (1988), 144–172.
[56]	A. Quarteroni, A. Valli, Domain decomposition methods for partial differential equations, Numerical Mathematics and Scientific Computation, Oxford University Press, New York, 1999. https://doi.org/10.1007/978-94-011-5412-38
[57]	I. Pawlow, A variational inequality approach to generalized two-phase Stefan problem in several space variables, Ann. Mat. Pura Appl., 131 (1982), 333–373. https://doi.org/10.1007/BF01765160 doi: 10.1007/BF01765160
[58]	M. Niezgodka, I. Pawłow, A generalized Stefan problem in several space variables, Appl. Math. Optim., 9 (1982), 193–224. https://doi.org/10.1007/BF01460125 doi: 10.1007/BF01460125
[59]	N. L. Gibson, F. P. Medina, M. Peszynska, R. E. Showalter, Evolution of phase transitions in methane hydrate, J. Math. Anal. Appl., 409 (2014), 816–833. https://doi.org/10.1016/j.jmaa.2013.07.023 doi: 10.1016/j.jmaa.2013.07.023
[60]	M. Peszynska, R. Showalter, J. Webster, Advection of methane in the hydrate zone: Model, analysis and examples, Mathe. Methods Appl. Sci., 38 (2015), 4613–4629. https://doi.org/10.1002/mma.3401 doi: 10.1002/mma.3401
[61]	M. Peszynska, C. Shin, Stability of a numerical scheme for methane transport in hydrate zone under equilibrium and non-equilibrium conditions, Comput. Geosci., 5 (2021), 1855–1886. https://doi.org/10.1007/s10596-021-10053-2 doi: 10.1007/s10596-021-10053-2
[62]	D. Foster, T. Costa, M. Peszynska, G. Schneider, Multiscale modeling of solar cells with interface phenomena, J. Coupled Syst. Multiscale Dyn., 1 (2013), 179–204. https://doi.org/10.1166/jcsmd.2013.1013 doi: 10.1166/jcsmd.2013.1013
[63]	T. Costa, D. Foster, M. Peszynska, Domain decomposition for heterojunction problems in semiconductors, in VECPAR 2014, High Performance Computing for Computational Science - VECPAR 2014, 11th International Conference, (2014), 92–101. http://arXiv.org/abs/1412.7946.
[64]	T. Costa, D. H. Foster, M. Peszynska, Progress in modeling of semiconductor structures with heterojunctions, J. Coupled Syst. Multiscale Dyn., 3 (2015), 66–86. https://doi.org/10.1166/jcsmd.2015.1066 doi: 10.1166/jcsmd.2015.1066
[65]	M. Discacciati, E. Miglio, A. Quarteroni, Mathematical and numerical models for coupling surface and groundwater flows, Appl. Numer. Math., 43 (2002), 57–74. https://doi.org/10.1016/S0168-9274(02)00125-3 doi: 10.1016/S0168-9274(02)00125-3
[66]	M. Sandells, D. Flocco, Introduction to the Physics of the Cryosphere, Morgan and Claypool, 2014.
[67]	T. Osterkamp, C. Burn, Permafrost, in Encyclopedia of Atmospheric Sciences, (2003), 1717–1729. https://doi.org/10.1016/B0-12-227090-8/00311-0
[68]	X. Zhang, Y. Wu, E. Zhai, P. Ye, Coupling analysis of the heat-water dynamics and frozen depth in a seasonally frozen zone, J. Hydrol., 593 (2021). https://doi.org/10.1016/j.jhydrol.2020.125603 doi: 10.1016/j.jhydrol.2020.125603
[69]	X. Zhang, E. Zhai, Y. Wu, D. Sun, Y. Lu, Theoretical and numerical analyses on hydro–thermal–salt–mechanical interaction of unsaturated salinized soil subjected to typical unidirectional freezing process, Int. J. Geomech., 21 (2021), 04021104. https://doi.org/10.1061/(ASCE)GM.1943-5622.0002036 doi: 10.1061/(ASCE)GM.1943-5622.0002036
[70]	J. Wettlaufer, M. G. Worster, Premelting dynamics, Annu. Rev. Fluid Mech., 38 (2006), 427–452. https://doi.org/10.1146/annurev.fluid.37.061903.175758 doi: 10.1146/annurev.fluid.37.061903.175758
[71]	A. W. Rempel, J. S. Wettlaufer, M. G. Worster, Premelting dynamics in a continuum model of frost heave, J. Fluid Mech., 498 (2004), 227–244. https://doi.org/10.1146/annurev.fluid.37.061903.175758 doi: 10.1146/annurev.fluid.37.061903.175758
[72]	C. W. Lovell, Temperature effects on phase composition and strength of partially-frozen soil, Highw. Res. Board Bull., 1957.
[73]	V. Romanovsky, T. Osterkamp, Effects of unfrozen water on heat and mass transport in the active layer and permafrost, Permafrost Periglacial Processes, 11 (2000), 219–239. https://doi.org/10.1002/1099-1530(200007/09) doi: 10.1002/1099-1530(200007/09)
[74]	Ulrich Hornung, Homogenization and porous media, vol. 6 of Interdisciplinary Applied Mathematics, Springer-Verlag, New York, 1997.
[75]	H. Zhang, J. Zhang, Z. Zhang, J. Chen, Y. You, A consolidation model for estimating the settlement of warm permafrost, Comput. Geotech., 76 (2016), 43–50. https://doi.org/10.1016/j.compgeo.2016.02.013 doi: 10.1016/j.compgeo.2016.02.013
[76]	C. T. Kelley, Iterative methods for linear and nonlinear equations, SIAM, Philadelphia, 1995.
[77]	M. Paolini, G. Sacchi, C. Verdi, Finite element approximations of singular parabolic problems, Int. J. Numer. Methods Eng., 26 (1988), 1989–2007. https://doi.org/10.1002/nme.1620260907 doi: 10.1002/nme.1620260907

This article has been cited by:

1.	Malgorzata Peszynska, Naren Vohra, Lisa Bigler, Upscaling an Extended Heterogeneous Stefan Problem from the Pore-Scale to the Darcy Scale in Permafrost, 2024, 22, 1540-3459, 436, 10.1137/23M1552000
2.	M. Peszynska, Z. Hilliard, N. Vohra, Coupled flow and energy models with phase change in permafrost from pore- to Darcy scale: Modeling and approximation, 2024, 450, 03770427, 115964, 10.1016/j.cam.2024.115964
3.	Naren Vohra, Malgorzata Peszynska, Iteratively coupled mixed finite element solver for thermo-hydro-mechanical modeling of permafrost thaw, 2024, 22, 25900374, 100439, 10.1016/j.rinam.2024.100439
4.	Naren Vohra, Malgorzata Peszynska, Robust conservative scheme and nonlinear solver for phase transitions in heterogeneous permafrost, 2024, 442, 03770427, 115719, 10.1016/j.cam.2023.115719
5.	Julieta Bollati, María F. Natale, José A. Semitiel, Domingo A. Tarzia, Determination of one unknown coefficient in a two-phase free boundary problem in an angular domain with variable thermal conductivity and specific heat, 2024, 531, 0022247X, 127775, 10.1016/j.jmaa.2023.127775
6.	Zizheng Zhang, Wenlong Tian, Yuli Hu, Bo Li, Numerical study on phase change propulsion mechanism of thermal underwater glider based on spectral method, 2024, 256, 13594311, 124066, 10.1016/j.applthermaleng.2024.124066

Reader Comments

Your name:*

Email:*
© 2022 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

Electronic Research Archive

1 1.3

Metrics

Article views(3326) PDF downloads(164) Cited by(6)

Preview PDF

Download XML

Export Citation

Article outline

Show full outline

Abstract

Introduction

Formulations and approximation of Stefan problem in a single material

Approximation to Stefan problem using P0-P0 finite elements and CCFD for a single material

Stefan problem with heterogeneity, and its P0-P0 approximation allowing for thermal resistivity of interface

Permafrost models

Summary and future works

Appendix

Acknowledgements

Conflict of interest

References

Figures and Tables

Figures(13) / Tables(13)

Electronic Research Archive

Heterogeneous Stefan problem and permafrost models with P0-P0 finite elements and fully implicit monolithic solver

Related Papers:

Abstract

1. Introduction

1.1. Notation

2. Formulations and approximation of Stefan problem in a single material

2.1. Stefan problem

2.1.1. Free boundary

2.2. Weak formulation

2.2.1. Kirchhoff transformation

2.2.2. Approximations of (ST)

2.3. Well-posedness

2.4. Approximation of solutions to Stefan problem in a single material: review

2.4.1. Time stepping for an ODE

2.4.2. Spatial approximations

3. Approximation to Stefan problem using P0-P0 finite elements and CCFD for a single material

3.1. P0-P0 scheme from discrete mixed formulation for linear parabolic problems

3.1.1. Literature notes

3.2. P0-P0 scheme for nonlinear heat equation with single-valued nonlinearity

3.2.1. Literature for nonlinear problems in mixed form

3.2.2. P0-P0 algorithm for nonlinear single-valued function

3.2.3. Details

3.2.4. Nonlinear solver

3.3. Addressing multivalued (ST) problem with P0-P0

3.4. Results of P0-P0 algorithm

3.4.1. Convergence tests

3.5. Robustness of nonlinear solver

3.6. Summary

4. Stefan problem with heterogeneity, and its P0-P0 approximation allowing for thermal resistivity of interface

4.1. Literature on heterogeneous Stefan pbm

4.2. P0-P0 scheme

4.2.1. P0-P0 formulation with mixed finite elements

4.2.2. Monolithic scheme

4.2.3. Estimates of the jump on the interface

4.3. Numerical examples of P0-P0 for heterogeneous domain with two materials

4.3.1. Linear heat equation with a jump

4.3.2. Examples for Stefan problem for heterogeneous domain with two materials

4.4. Summary

5. Permafrost models

5.1. Energy conservation in permafrost

5.1.1. Permafrost as a porous medium

5.1.2. Energy balance in permafrost

5.1.3. Experimental models

5.1.4. Thermal conductivity in permafrost

5.2. Approximation of permafrost model with P0-P0

5.2.1. Approaches in literature

5.2.2. Convergence of P0-P0 scheme for permafrost model

5.2.3. Simulation of realistic examples

5.3. Solver challenges and open questions

5.4. Summary

6. Summary and future works

7. Appendix

7.1. Exact solution for Example 2.1

7.2. Auxiliary finite dimensional result

7.3. Detailed comparison of P1-based formulations

7.4. Convergence studies of P1-based and P0-P0 algorithms.

7.4.1. Error norms

7.4.2. P1-based schemes

7.5. Auxiliary properties

7.6. Linear heat equation with a jump condition

7.6.1. Stationary solution

Acknowledgements

Conflict of interest

References

This article has been cited by:

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Figures and Tables

Other Articles By Authors

Related pages

Tools

Export File

Citation

Format

Content

Catalog