
Citation: Stephanie H. DeLuca, Samuel L. DeLuca, Andrew Leaver-Fay, Jens Meiler. RosettaTMH: a method for membrane protein structure elucidation combining EPR distance restraints with assembly of transmembrane helices[J]. AIMS Biophysics, 2016, 3(1): 1-26. doi: 10.3934/biophy.2016.1.1
[1] | Vincent Renault, Michèle Thieullen, Emmanuel Trélat . Optimal control of infinite-dimensional piecewise deterministic Markov processes and application to the control of neuronal dynamics via Optogenetics. Networks and Heterogeneous Media, 2017, 12(3): 417-459. doi: 10.3934/nhm.2017019 |
[2] | Hyeontae Jo, Hwijae Son, Hyung Ju Hwang, Eun Heui Kim . Deep neural network approach to forward-inverse problems. Networks and Heterogeneous Media, 2020, 15(2): 247-259. doi: 10.3934/nhm.2020011 |
[3] | Didier Georges . Infinite-dimensional nonlinear predictive control design for open-channel hydraulic systems. Networks and Heterogeneous Media, 2009, 4(2): 267-285. doi: 10.3934/nhm.2009.4.267 |
[4] | Thibault Liard, Raphael Stern, Maria Laura Delle Monache . A PDE-ODE model for traffic control with autonomous vehicles. Networks and Heterogeneous Media, 2023, 18(3): 1190-1206. doi: 10.3934/nhm.2023051 |
[5] | Chenchen Li, Chunyan Zhang, Lichao Feng, Zhihui Wu . Note on prescribed-time stability of impulsive piecewise-smooth differential systems and application in networks. Networks and Heterogeneous Media, 2024, 19(3): 970-991. doi: 10.3934/nhm.2024043 |
[6] | Yin Yang, Aiguo Xiao . Dissipativity and contractivity of the second-order averaged L1 method for fractional Volterra functional differential equations. Networks and Heterogeneous Media, 2023, 18(2): 753-774. doi: 10.3934/nhm.2023032 |
[7] | Mostafa Bendahmane . Analysis of a reaction-diffusion system modeling predator-prey with prey-taxis. Networks and Heterogeneous Media, 2008, 3(4): 863-879. doi: 10.3934/nhm.2008.3.863 |
[8] | Bing Feng, Congyin Fan . American call option pricing under the KoBoL model with Poisson jumps. Networks and Heterogeneous Media, 2025, 20(1): 143-164. doi: 10.3934/nhm.2025009 |
[9] | Shui-Nee Chow, Xiaojing Ye, Hongyuan Zha, Haomin Zhou . Influence prediction for continuous-time information propagation on networks. Networks and Heterogeneous Media, 2018, 13(4): 567-583. doi: 10.3934/nhm.2018026 |
[10] | Simone Göttlich, Stephan Martin, Thorsten Sickenberger . Time-continuous production networks with random breakdowns. Networks and Heterogeneous Media, 2011, 6(4): 695-714. doi: 10.3934/nhm.2011.6.695 |
In this paper, we deal with a new variational problem suggested by applications to satellite image segmentation. The satellite images are an important source for extracting landscape boundaries and other vegetation structures, which can provide extremely useful insights for applications in environmental monitoring, agriculture, forestry, and other related fields (see, for instance, [1,2,3]). In particular, in agricultural crop field classification, one fundamental problem is to provide a disjunctive decomposition of a fixed domain $ \Omega\subset\mathbb{R}^2 $ onto a finite number of nonempty subsets $ \Omega = \Omega_1\cup\Omega_2\cup \dots\cup \Omega_K $ such that each of these subsets could be associated with a crop that is grown in this area, with forest regions, water zones, and so on, and this correspondence must be established at a rather a high level of accuracy. Up-to-date and accurate crop maps (or crop field classification) are needed to update agricultural statistics, to provide agricultural crop yield prediction, and are often used in environmental modeling. Typically, such an association between a given region and some agricultural crop can be made through the detection and quantitative assessment of green vegetation, which is one of the major applications of remote sensing studies. The information obtained in this way is a source of knowledge used for environmental resources management. One of the ways to get such information is the determination of the so-called vegetation indices (see [4] for a review and development). Since over the years many vegetation indices have been proposed for determining the vigor and health of vegetation, the reliability of information about vegetation directly and strictly depends on the fidelity, preciseness, and smoothness of the corresponding vegetation indices within each particular crop field.
The most commonly used vegetation index (Ⅵ) is the so-called slope-based infrared percentage vegetation index (IPVI),
$ IPVI:=u2,du1,d+u2,d,0≤IPVI≤1, $
|
(1.1) |
where $ u_{i, d} = u_{i, d}(x_1, x_2) $, $ i = 1, 2 $, with $ (x_1, x_2)\in\Omega $, are functions of two variables representing the intensity of red ($ Red $) and near-infrared ($ NIR $) reflectance of some region $ \Omega $ of $ \mathbb{R}^2 $, respectively.
Thus, each pixel $ x = (x_1, x_2)\in\Omega $ of the original image can be associated with the corresponding IPVI-feature. The problem, which is suggested by application to remote sensing satellite image processing, consists of computing a decomposition
$ Ω=Ω1∪Ω2∪⋯∪ΩK∪K $
|
(1.2) |
of the domain of the image $ F:\Omega\rightarrow \mathbb{R}^2 $ such that
$ ({\rm{a}}) $ the IPVI-characteristic varies smoothly and/or slowly within each $ \Omega_j $;
$ ({\rm{b}}) $ the IPVI-characteristic varies discontinuously and/or rapidly across most of the boundary $ K $ between different $ \Omega_j $.
The distinguished features of this statement that do not permit to reduce it to the standard settings of the segmentation problem (see, for instance, the Mumford-Shah energy-based model [5] or the models proposed by Alvarez [6], Guichard [7], Lions, Morel [8], Caselles [9], and others) are the following ones:
● Each region $ \Omega_1 $, $ \Omega_2 $, $ \dots $, $ \Omega_K $ should consist of pixels that can be reasonably grouped according to the IPVI-characteristic. Simultaneously, these regions should be easy to differentiate according to the chosen image feature;
● The respective interiors of image regions should have a more or less simple geometry without gaps. Boundaries of image regions should be smooth enough but also accurate with respect to the chosen image feature;
● The most restrictive obstacle in the construction of such decomposition is the fact that these subdomains should not overlap the borders between fields or contain any fragments of such borders, meaning that they cannot take in even small parts of different fields with arguably different crops.
All of these make the abovementioned segmentation problem rather challenging. It is enough to observe that a precise consideration of this problem demonstrates that the quantitative interpretation of remote sensing information from vegetation is a complex task. Many studies have limited this interpretation by assuming that the extracting vegetation information is uniformly and smoothly distributed within the particular crop fields. However, this assumption is broken when trying to apply these types of vegetation indices on heterogeneous canopies such as plantations with a mixed combination of soil, weeds, and other crops, or plantation where the vegetation of interest has different IPVI-characteristic due to spatial variability. The main idea, we realize in the new setting of the variational problem, can be briefly described as follows. We propose to make use of the so-called $ f $-decomposition instead of the standard Chan-Vese "active contours without edges" model [10]. The role of the function $ f:\Omega\rightarrow \mathbb{R} $ in such decomposition of $ \Omega $ has to guarantee that the new objects $ \left\{\Omega_j\right\}_{j = 1}^K $ after the $ f $-decomposition will have homogeneous values of the target function $ f $ within each separate field (a similar point of view can be found in [4,11]). In particular, in the case of the agricultural applications, where $ \Omega $ stands for a zone of interest, the IPVI-characteristic can be considered as the main feature of this area, i.e., in this case $ f(x) = \mathrm{IPVI}(x) $ for all $ x\in\Omega $. Thus, the main idea that we push forward is to formulate the segmentation problem as a constrained minimization problem in a special anisotropic functional space, with the "effect of anisotropy" we associate with the structure topology of IPVI-distribution. As a result, the main benefit of such an approach can be briefly described as follows:
$ ({\rm{i}}) $ It prevents the appearance of subdomains containing zones of discontinuity of $ f $ or places where this function tends to change rapidly by utilizing the main characteristic of the given function $ f $ — the unit normal vector field $ \theta:\Omega\rightarrow \mathbb{R}^2 $ to the level sets of $ f $. This characteristic has been used to construct the so-called anisotropic diffusion tensor $ M^f $, which can be defined as a square parametrized matrix function $ M^f(x) = \left[I-\eta^2\theta(x)\otimes\theta(x)\right] $. This matrix plays a central role in the process of $ f $-decomposition of domain $ \Omega $, and we associate with it special anisotropic perimeters of the obtained subdomains (segments).
$ ({\rm{ii}}) $ The second characteristic feature of our approach is the fact that we apply the Jeffreys divergence to replace the standard Euclidean distance in the fidelity term of the objective functional. It is well-known that compared with Euclidean distance, Jeffreys divergence leads to more accurate results in information measurement (for the details of this metric and its advantages, we refer to [12,13,14]).
The paper is organized as follows. In Section 2, we give some preliminaries related to the space of functions of bounded variation and other notions. Section 3 is devoted to the description of some specification of the standard $ BV(\Omega) $ space. In particular, we introduce the so-called anisotropic version for the total variation of $ L^1(\Omega) $-functions. At the end of this section, we show that some of the results of Samson et al. [15] can be extended to the case of subsets with a finite anisotropic perimeter.
The precise setting of the main constrained minimization problem and its previous analysis are given in Section 4. We show that the proposed minimization problem can be interpreted as a special case of the piecewise-constant Mumford-Shah segmentation problem and the Chan-Vese active contour model without edges. We study this problem in the space of $ L^1(\Omega) $-functions with bounded anisotropic total variation, where the type of anisotropy is closely related to the structure of the image $ f $ which is involved in the segmentation procedure. It is worth emphasizing that the anisotropic perimeter of the segments with uniform distribution of $ IPVI $-characteristic can drastically differ from the standard one because the natural edges of the original image $ f $ can affect it significantly. Despite the "natural" setting of the proposed segmentation problem, the existence of its minimizers seems to be an open issue nowadays. The main reason is that the objective functional is neither coercive nor lower semicontinuous with respect to the weak-$ \ast $ topology of $ BV(\Omega) $ space. This circumstance stimulated us to introduce a special family of unconstrained two-parametric problems to approximate the original one. We show that each of those approximated problems is well-posed and has a nonempty set of minimizers.
Section 5 aims to derive optimality conditions for approximated problems and provide their formal substantiation. In Section 6, we study the asymptotic behavior of the approximated problems and their solutions. The main question is to find out whether the convergence of minima of approximated problems is to minima of the original segmentation problem as small parameters tend to zero. Our main result of this section asserts that: If the original problem has a nonempty set of minimizers, then some of them can be successfully attained by the solution of approximated problems. Otherwise, we can come to a solution of the relaxed version of the original segmentation problem. In Section 7, the implementation of the proposed optimization problem is illustrated, providing numerical experiences with satellite images.
A detailed description of the algorithm (including the method of marching squares for the generation of closed contours in the two-dimensional case) and finite-difference scheme for the proposed approach with the results of numerical simulation using the real-life satellite images will be considered in the forthcoming paper.
We denote by $ \mathcal{L}^2 $ the Lebesgue $ 2 $-dimensional measure in $ \mathbb{R}^2 $ and by $ \mathcal{H}^1 $ the $ 1 $-dimensional Hausdorff measure. Let $ \Omega $ be a bounded open subset of $ \mathbb{R}^2 $ with a Lipschitz boundary. For any subset $ E\subset\Omega $, we denote by $ |E| $ its $ 2 $-dimensional Lebesgue measure $ \mathcal{L}^2(E) $. For a subset $ E\subseteq \Omega $, let $ \overline{E} $ denote its closure and $ \partial E $ its boundary. We define the characteristic function $ \chi_E $ of $ E $ by
$ \chi_E(x): = \left\{ 1, for x∈E,0, otherwise. \right. $
|
For a function $ u $, we denote by $ u|_E $ its restriction to the set $ E\subseteq\Omega $, and by $ u^{\partial E} $ its trace on $ \partial E $. Let $ C^\infty_0(\Omega) $ be the infinitely differentiable functions with compact support in $ \Omega $. The $ k $-dimensional Hausdorff measure is denoted by $ \mathcal{H}^k $, and is the restriction of the measure $ \mu $ to the set $ E $. For a Banach space $ X $, its dual is $ X^\ast $ and $ \left < \cdot, \cdot\right > _{X^\ast, X} $ is the duality form on $ X^\ast\times X $. By $ \rightharpoonup $ and $ \stackrel{\ast}{\rightharpoonup} $, we denote the weak and weak$ ^\ast $ convergence in normed spaces.
We remind here of the most common definitions of some functional spaces that we will use later on.
Throughout the paper, we will often use the concept of weak and strong convergence in $ L^1(\Omega) $. Let $ \left\{f_n\right\}_{n\in \mathbb{N}} $ be a bounded sequence of functions in $ L^1(\Omega) $. We recall that $ \left\{f_n\right\}_{n\in \mathbb{N}} $ is called equi-integrable on $ \Omega $ if for any $ \delta > 0 $ there is a $ \tau = \tau(\delta) $, such that $ \int_S |f_n|\, dx < \delta $ for every measurable subset $ S\subset\Omega $ of Lebesgue measure $ |S| < \tau $. Then, the following assertions are equivalent for $ L^1(\Omega) $-bounded sequences (see, for instance, [16,17]):
$ ({\rm{i}}) $ a sequence $ \left\{f_k\right\}_{k\in \mathbb{N}} $ is weakly convergent in $ L^1(\Omega) $;
$ ({\rm{ii}}) $ the sequence $ \left\{f_k\right\}_{k\in \mathbb{N}} $ is equi-integrable.
The following theorem holds.
Theorem 1. [16] If a bounded sequence $ \left\{f_k\right\}_{k\in \mathbb{N}}\subset L^1(\Omega) $ is equi-integrable and $ f_k\rightarrow f $ almost everywhere in $ \Omega $, then $ f_k\rightarrow f $ strongly in $ L^1(\Omega) $.
We set $ |E| = \mathcal{L}^2(E) $, the Lebesgue measure of a measurable set $ E\subset\mathbb{R}^2 $. Let $ \mathcal{M}(\Omega; \mathbb{R}^2) $ be the space of all $ \mathbb{R}^2 $-valued Borel measures which is, according to the Riesz theory, the dual of the space $ C_0(\Omega; \mathbb{R}^2) $ of all continuous vector-valued functions $ \varphi(\cdot) $ with compact support in $ \Omega $ and equipped with the uniform norm.
$ \|\varphi\|_\infty = \left(\sum\limits_{i = 1}^2 \sup\limits_{x\in\Omega}|\varphi_i(x)|^2\right)^{1/2}. $ |
Note that $ \mathcal{M}(\Omega; \mathbb{R}^2) $ is isomorphic to the product space
$ \mathcal{M}^2(\Omega): = \prod\limits_{i = 1}^2\mathcal{M}(\Omega) $ |
and that $ \mu = (\mu_1, \mu_2)\in \mathcal{M}(\Omega; \mathbb{R}^2) $ $ \Leftrightarrow $ $ \mu_i\in \left[C_0(\Omega)\right]^\ast $, $ i = 1, 2 $.
Given a vector-valued measure $ \mu:\mathcal{B}(\Omega)\rightarrow \mathbb{R}^2 $, we use the notation $ |\mu| $ for its total variation. We recall that
$ |μ|(E)=sup{2∑i=1∫Ωφidμi : φ=[φ1φ2]∈C0(E;R2), ‖φ‖∞≤1}, $
|
(2.1) |
for all measurable $ E\subseteq \Omega $.
The usual weak-$ \ast $ topology on $ \mathcal{M}(\Omega; \mathbb{R}^2) $ is defined as the weakest topology on $ \mathcal{M}(\Omega; \mathbb{R}^2) $, for which the maps $ \mu\mapsto \sum_{i = 1}^2 \int_{\Omega} \varphi_i\, d\mu_i $ are continuous for every $ \varphi\in C_0(\Omega; \mathbb{R}^2) $.
By $ BV(\Omega) $, we denote the space of all functions $ u\in L^1(\Omega) $, for which their distributional derivatives are representable by finite Borel measures in $ \Omega $, i.e.,
$ \int_{\Omega}u\frac{\partial\phi}{\partial x_i}\, dx = -\int_{\Omega}\phi\, D_iu, \quad\forall\, \phi\in C^\infty_0(\Omega), \ i = 1, 2 $ |
for some $ \mathbb{R}^2 $-valued measure $ Du = (D_1u, D_2u)\in \mathcal{M}^2(\Omega) $. It can be shown that $ BV(\Omega) $, endowed with the norm $ \|u\|_{BV(\Omega)} = \|u\|_{L^1(\Omega)}+|Du|(\Omega) $, is a Banach space, where in view of Eq (2.1), the total variation of $ Du $ in $ \Omega $ can be defined as
$ |Du|(Ω):=∫Ω|Du|=sup{∫Ωudivφdx : φ∈C10(Ω;R2), |φ(x)|≤1 for x∈Ω}. $
|
(2.2) |
We recall that the product topology of the strong topology of $ L^1(\Omega) $ for $ u $ and of the weak-$ \ast $ topology of measures for $ Du $ is called the weak-$ \ast $ topology of $ BV(\Omega) $, and it is denoted $ BV $-$ \ast $. As a result, a sequence $ \{f_k\}_{k = 1}^\infty $ $ \ast $-converges to $ f $ in $ BV(\Omega) $ if, and only if, the two following conditions hold (see [18, p.124]): $ f_k\to f $ strongly in $ L^1(\Omega) $ and $ D f_k\stackrel{\ast}{\rightharpoonup} Df $ weakly-$ ^\ast $ in $ \mathcal{M}(\Omega; \mathbb{R}^2) $, i.e.,
$ \lim\limits_{k\to\infty} \int_{\Omega} \left(\phi, Df_k\right) = \int_{\Omega} \left(\phi, Df\right), \quad\forall\, \phi\in C_0(\Omega;\mathbb{R}^2), $ |
where, in fact, $ Df_k = (D_{x_1}f_k, D_{x_2} f_k)\in \mathcal{M}(\Omega; \mathbb{R}^2) $ and, therefore, the notation $ \int_{\Omega} \phi\, Df_k $ should be interpreted as follows:
$ \int_{\Omega} \left(\phi, Df_k\right): = \int_{\Omega}\phi_1\, D_{x_1} f_k+ \int_{\Omega}\phi_2\, D_{x_2} f_k. $ |
Moreover, if $ \{f_k\}_{k = 1}^\infty\subset BV(\Omega) $ converges strongly to some $ f $ in $ L^1(\Omega) $ and $ \sup_{k\in \mathbb{N}}\int_\Omega |Df_k| < +\infty $, then (see, for instance, [16] and [18])
$ (i) f∈BV(Ω) and ∫Ω|Df|≤lim infk→∞∫Ω|Dfk|;(ii) fk∗⇀f in BV(Ω). $
|
(2.3) |
A simple criterion for the $ BV $-$ \ast $ convergence can be stated as follows (see [18, p.125], [19, Theorem 1.19]):
Proposition 2. A sequence $ \left\{u_k\right\}_{k\in \mathbb{N}}\subset BV(\Omega) $ $ BV $-$ \ast $-converges to $ u $ if, and only if, $ \left\{u_k\right\}_{k\in \mathbb{N}} $ is bounded in $ BV(\Omega) $ and $ \left\{u_k\right\}_{k\in \mathbb{N}} $ converges to $ u $ strongly in $ L^1(\Omega) $.
The following embedding result for the $ BV $-function is very useful with respect to the variational problem that we study in this paper.
Proposition 3. [16, p.378] Let $ \Omega $ be an open bounded Lipschitz subset of $ \mathbb{R}^2 $. Then, the embedding $ BV(\Omega)\hookrightarrow L^2(\Omega) $ is continuous and the embeddings $ BV(\Omega)\hookrightarrow L^p(\Omega) $ are compact for all $ p $ such that $ 1\le p < 2 $. Moreover, there exists a constant $ C_{em} > 0 $, which depends only on $ \Omega $ and $ p $ such that for all $ u $ in $ BV(\Omega) $,
$ \left( \int_{\Omega}|u|^p\, dx\right)^{1/p}\le C_{em}\|u\|_{BV(\Omega)}, \quad\forall\, p\in[1, 2]. $ |
We also make use of the following property concerning to approximation of $ BV $-functions by smooth ones.
Theorem 4. [20] Assume $ f\in BV(\Omega) $. Then, there exist a sequence $ \left\{f_k\right\}_{k = 1}^\infty\subset BV(\Omega)\cap C^\infty(\Omega) $ such that
$ fk→f in L1(Ω),|Dfk|(Ω)→|Du|(Ω) as k→∞. $
|
Let $ E $ be an $ \mathcal{L}^2 $-measurable subset of $ \mathbb{R}^2 $ with finite Lebesgue measure. Let $ \chi_E $ be its characteristic function. Following R. Caccioppoli [21], we say that $ E $ is a set with a finite perimeter in $ \Omega $ if $ \chi_E\in BV(\Omega) $. This means that the distributional gradient $ D\chi_E $ is a vector-valued measure with finite total variation. The total variation $ |D\chi_E|(\Omega) $ is called the perimeter of $ E $ in $ \Omega $, i.e., $ P(E, \Omega) = |D\chi_E|(\Omega) $ and, therefore,
$ P(E,Ω)=sup{∫ΩχEdivφdx : φ∈C10(Ω;R2), ‖φ‖L∞(Ω;R2)≤1}. $
|
(2.4) |
We also notice that if $ E_t: = \left\{x\in\Omega\ :\ f(x) > t\right\} $ stands for the level set for given $ f\in BV(\Omega) $ and $ t\in\mathbb{R} $, then (see [20, Theorem 5.5.1]) $ E_t $ has a finite perimeter for a.e. $ t\in\mathbb{R} $.
The main goal of this section is to introduce some specification to the standard space of functions with bounded variation $ BV(\Omega) $. This option is mainly motivated by the natural application in image segmentation problems. In view of this, we introduce the so-called anisotropic version for the total variation of the $ BV $-functions. In principle, the notion of anisotropic total variation is not new in the literature (we refer to [22,23] for more details), and our representation for anisotropic total variation can be viewed as some specification of the rule in [22]. The main interest is in the application of this concept to the generalization of the well-known results of Samson et al. [15]. Namely, we focus on the proof of the following relation:
$ \lim\limits_{{{\varepsilon}}\to 0}\int_{\Omega}\left|M D \left[\chi_E\right]_{{\varepsilon}}\right| = \int_{\Omega}\left|M D \chi_E\right|, $ |
where $ \left[\chi_E\right]_{{\varepsilon}} $ stands for a smooth approximation of the characteristic function of a given set $ E\subset\Omega $. The main results of this section are presented in the form of Lemma 5 and its corollary.
Let $ \Omega $ be an open bounded and connected subset of $ \mathbb{R}^2 $ with a Lipschitz boundary $ \partial\Omega $. Let $ M:\Omega\rightarrow \mathbb{R}^{2\times 2} $ be a given matrix function such that
$ M(x)=Mt(x),β−1|ξ|2≤(ξ,M(x)ξ)≤β|ξ|2∀ξ∈R2,M(⋅)∈C∞(Ω;R2×2), $
|
(3.1) |
for some constant $ \beta > 1 $, i.e., $ M(x) $ is a positive-definite symmetric matrix for each $ x\in\Omega $.
We say that $ u\in L^1(\Omega) $ is a function with bounded anisotropic variation if
$ \sup\left\{\int_{\Omega}u{\mathop{\mathrm{div}}}(M \varphi)\, dx\ :\ \varphi\in C^1_0(\Omega;\mathbb{R}^2), \ |\varphi(x)|\le 1\ \forall\, x\in\Omega)\right\} < +\infty. $ |
It means that there exists a Radon measure $ Du $ such that
$ \int_\Omega u\, {\mathop{\mathrm{div}}}(M\varphi)\, dx = -\int_{\Omega} \left(\varphi, MD u\right)\quad \forall \varphi\in C^1_0(\Omega;\mathbb{R}^2). $ |
Moreover, for the total variation of the measure $ MDu $, we have the following representation:
$ |MDu|(Ω):=∫Ω|MDu|=sup{∫Ωudiv(Mφ)dx : φ∈C10(Ω;R2), |φ(x)|≤1 ∀x∈Ω)}. $
|
(3.2) |
Then, property (3.1) implies that
$ β−1(‖u‖L1(Ω)+|MDu|(Ω))≤‖u‖L1(Ω)+|Du|(Ω)⏟‖u‖BVM(Ω)≤β−1(‖u‖L1(Ω)+|MDu|(Ω)), ∀u∈BV(Ω). $
|
(3.3) |
Hence, the expression can be viewed as an equivalent norm to standard one $ \|\cdot\|_{BV_M(\Omega)} $ on the space $ BV(\Omega) $. As a result, the main properties of $ BV $-functions (see, for instance, [18,19,20]) can be reformulated with respect to the new norm. In particular, let $ M:\Omega\rightarrow \mathbb{R}^{2\times 2} $ be a given matrix function with property (3.1). Then:
$ ({\rm{j}}) $ If $ \left\{u_k\right\}_{k\in\mathbb{N}}\subset BV(\Omega) $ is a bounded sequence, then there exist a subsequence $ \left\{u_{k_i}\right\}_{i\in\mathbb{N}} $ and a function $ u\in BV(\Omega) $ such that
$ uki→ustrongly in L1(Ω),MDuki∗⇀MDu weakly-∗in M(Ω;R2); $
|
$ ({\rm{jj}}) $ If $ \{u_k\}_{k = 1}^\infty\subset BV(\Omega) $ converges strongly to some $ u $ in $ L^1(\Omega) $ and satisfies $ \sup_{k\in \mathbb{N}}\int_\Omega|MDu_k| < +\infty $, then
$ uk∗⇀u in BV(Ω), u∈BV(Ω), and ∫Ω|MDu|≤lim infk→∞∫Ω|MDuk|; $
|
(3.4) |
$ ({\rm{jjj}}) $ Let $ u\in BV(\Omega) $ be an arbitrary function. Then, there exists a sequence $ \left\{u_k\right\}_{k\in\mathbb{N}}\subset C^\infty(\Omega)\cap BV(\Omega) $ such that $ \|u_k-u\|_{L^1(\Omega)}\rightarrow 0 $ and $ { }\int_{\Omega} |M Du_k|\rightarrow \int_{\Omega} |M Du| $ as $ k\to\infty $.
By analogy with the standard notion, we can also define an anisotropic version of the perimeter, namely, we say that an $ \mathcal{L}^2 $-measurable subset $ U\subset\Omega $ has a finite $ M $-perimeter if $ |M D\chi_U|(\Omega) < +\infty $, where $ \chi_U(\cdot) $ stands for the characteristic function of the set $ U $. In this case, we write down
$ Per(U;M;Ω):=∫Ω|MDχU|=sup{∫Udiv(Mφ)dx:φ∈C10(Ω;R2), |φ(x)|≤1 ∀x∈Ω}. $
|
(3.5) |
Moreover, for any $ u\in BV(\Omega) $ and $ M(\cdot)\in C^\infty(\Omega; \mathbb{R}^{2\times 2}) $ with property (3.1), the following anisotropic Coarea formula
$ ∫Ω|MDu|=∫+∞−∞Per({u>t};M;Ω)dt $
|
(3.6) |
holds true (see [22]).
It is clear that, for a given level parameter $ l\in\mathbb{R} $, the area of the region $ \left\{x\in\Omega\ :\ \varphi(x) > l\right\} $ can be defined as
$ A\left\{x\in\Omega\ :\ \varphi(x)\ge l\right\} = \int_{\Omega} \chi_{\{\varphi(x)\ge l\}}\, dx. $ |
Here, $ \chi_{\{\varphi(x)\ge l\}}(x) $ stands for the characteristic function of the set $ \{x\in\Omega:\varphi(x)\ge l\} $. To obtain some approximation of this area, we can substitute $ \chi_E $ by its smooth approximation. With that in mind, for a given small positive parameter $ {{\varepsilon}} $, we fix a positive symmetric mollifier $ \eta\in C^\infty_c(\mathbb{R}^2) $, i.e., $ \eta(x) $ is zero outside a compact set $ B_1 = \left\{x\in\mathbb{R}^2: |x|\le 1\right\} $,
$ \int_{B_1} \eta(x)\, dx = 1, \quad\eta(x)\ge 0, \ \text{and }\ \nu(x) = \mu(|x|)\ \text{for some function }\ \mu:\mathbb{R}^{+}\to\mathbb{R}, $ |
and set
$ [χE]ε=ηε∗χEwithηε(x)=ε−2η(xε), $
|
(3.7) |
that is,
$ \left[\chi_E\right]_{{\varepsilon}} = {{\varepsilon}}^{-2}\int_{\mathbb{R}^2} \eta\left(\frac{x-z}{{{\varepsilon}}}\right) \chi_E(z)\, dz = \int_{\mathbb{R}^2} \eta(w) \chi_E(x+{{\varepsilon}} w)\, dw. $ |
Then, using the standard properties of mollifiers, it can be shown that
$ ({\rm{i}}) $ $ \left[\chi_E\right]_{{\varepsilon}}\rightarrow \chi_E $ in $ L^1(\Omega) $ for any measurable subset $ E $ of $ \mathbb{R}^2 $ as $ \emptyset\to 0 $;
$ ({\rm{ii}}) $ $ 0\le \left[\chi_E\right]_{{\varepsilon}}(x)\le 1 $ for all $ x\in \mathbb{R}^2 $;
$ ({\rm{iii}}) $ If $ E\subset\mathbb{R}^2 $ is bounded and $ g\in L^1(\mathbb{R}^2) $, then $ { }\int_{\mathbb{R}^2} \left[\chi_E\right]_{{\varepsilon}} g\, dx = \int_{\mathbb{R}^2}\chi_E\left[g\right]_{{\varepsilon}}\, dx $;
$ ({\rm{iv}}) $ If $ {{\varepsilon}}\subseteq \Omega $, then $ \mathrm{supp}\, \left[\chi_E\right]_{{\varepsilon}}\subseteq \Omega_{{\varepsilon}} = \left\{x\in\mathbb{R}^2 : \mathrm{dist}\, (x, \Omega)\le{{\varepsilon}} \right\} $.
The following property will be utilized in our further analysis (see Section 6) and it can be considered as a natural generalization of the results of Samson et al. [15] (see also for comparison [19, Proposition 1.15]).
Lemma 5. Let $ E $ be an open set such that $ \overline{E}\subset\Omega $ and $ E $ has a finite $ M $-perimeter $ \mathrm{Per}(E; M;\Omega) $, where $ M:\Omega\rightarrow \mathbb{R}^{2\times 2} $ is a given matrix function with property (3.1). Let $ \left[\chi_E\right]_{{\varepsilon}} $ be the mollified characteristic function described above. Then,
$ limε→0∫Ω|MD[χE]ε|=∫Ω|MDχE|. $
|
(3.8) |
Proof. Taking into account the standard properties of mollifiers, we have $ \left\{\left[\chi_E\right]_{{\varepsilon}}\right\}_{{{\varepsilon}} > 0}\subset BV(\Omega) $ and $ \left[\chi_E\right]_{{\varepsilon}}\rightarrow \chi_E $ in $ L^1(\Omega) $ as $ {{\varepsilon}}\to 0 $. Then, inequality (3.4) implies that
$ ∫Ω|MDχE|≤lim infε→0∫Ω|MD[χE]ε|. $
|
(3.9) |
To establish a reverse inequality, we fix an arbitrary function $ \varphi\in C^1_0(\Omega; \mathbb{R}^2) $ with $ |\varphi(x)|\le 1 $. Then, there exists a vector-valued function $ \zeta\in C^1_0(\Omega; \mathbb{R}^2) $ such that $ \zeta = M\varphi $. Therefore, by (iii)-property of mollifiers, we have
$ ∫Ω[χE]εdiv(Mφ)dx=∫ΩχE[div(Mφ)]εdx=∫ΩχEdiv[Mφ]εdx=∫ΩχEdiv[ζ]εdx≤sup{∫ΩεχEdiv[ζ]εdx:ζ∈C10(Ω;R2), ζ=Mφ, |M−1[ζ]ε(x)|≤1 ∀x∈Ωε}≤sup{∫ΩεχEdiv(Mφ)dx:φ∈C10(Ωε;R2), |φ(x)|≤1 ∀x∈Ωε}=∫Ωε|MDχE|. $
|
(3.10) |
Taking then the supremum over all such $ \varphi $, we arrive at the relation
$ \int_{\Omega}\left|M D\left[\chi_E\right]_{{\varepsilon}}\right|\le \int_{\Omega_{{\varepsilon}}} |M_{{\varepsilon}} D\chi_E|. $ |
Hence,
$ lim supε→0∫Ω|MD[χE]ε|≤limε→0∫Ωε|MDχE|. $
|
Since $ E $ and $ \Omega $ are open sets and $ \overline{E}\subset\Omega $, it follows that
$ \int_{\partial \Omega} |D\chi_E| = 0\quad\text{and}\quad \int_{\partial \Omega} |M D\chi_E| = 0. $ |
Therefore,
$ limε→0∫Ωε|MDχE|=∫¯Ω|MDχE|=∫Ω|MDχE|+∫∂Ω|MDχE|=∫Ω|MDχE|,limε→0∫Ωε|DχE|=∫¯Ω|DχE|=∫Ω|DχE|+∫∂Ω|DχE|=∫Ω|DχE|<+∞. $
|
As a result, we obtain
$ \limsup\limits_{{{\varepsilon}}\to 0} \int_{\Omega}\left|M D\left[\chi_E\right]_{{\varepsilon}}\right|\le \int_{\Omega} |M D\chi_E|. $ |
It remains to combine this inequality with Eq (3.9).
Arguing similarly, we can generalize the Eq (3.8) as follows:
Corollary 6. Let $ E\subset\Omega $ and the matrix $ M:\Omega\rightarrow \mathbb{R}^{2\times 2} $ be the same as in Lemma 5. Then,
$ limε→0∫Ω|MεD[χE]ε|=∫Ω|MDχE|, $
|
(3.11) |
where $ \left\{M_{{\varepsilon}}\right\}_{{{\varepsilon}} > 0}\subset C^\infty(\Omega; \mathbb{R}^{2\times 2}) $ stands for any smooth approximation of the matrix $ M $ such that
$ limε→0‖Mε−M‖C(Ωε;R2×2)=0,Mε(x)=Mtε(x),β−1|ξ|2≤(ξ,Mε(x)ξ)≤β|ξ|2∀ξ∈R2, ∀ε>0. $
|
Let $ f:\Omega\rightarrow \mathbb{R} $ be a given function. Hereinafter, we associate the function $ f $ with a given gray scale image. We define a smoothed version of the original image using the convolution of $ f $ with a Gaussian kernel
$ Gσ(x)=1(√2πσ)2exp(−|x|22σ2),σ>0, $
|
(4.1) |
i.e., $ f_{\sigma} = \left(G_\sigma\ast f(\cdot)\right)(x): = { }\int_{\Omega} G_\sigma(x-y) f(y)\, dy $. Here, $ \sigma > 0 $ is a given small positive value.
Since $ f_{\sigma}\in C^\infty(\overline{\Omega}) $, it follows that the boundaries of level sets $ \left\{x\in\Omega\, :\, f_{\sigma}(x)\ge\lambda \right\} $, for all feasible $ \lambda\in[0, C_f] $, can be described by smooth curves with finite length. So, at all points $ x\in\Omega $ of each level sets of $ f_{\sigma} $, we can define a unit normal vector field $ \theta(x) $ following the rule
$ θ(x)={∇fσ(x)|∇fσ(x)|−1,if |∇fσ(x)|≠0,0,otherwise. $
|
(4.2) |
Then, we associate with the vector field $ \theta:\Omega\rightarrow \mathbb{R}^2 $ the following linear operator $ R:\mathbb{R}^2\rightarrow \mathbb{R}^2 $:
$ Rξ:=ξ−η2(θ,ξ)θ=[I−η2θ⊗θ]ξ,∀ξ∈R2, $
|
(4.3) |
where $ \eta\in (0, 1) $ is a given threshold which should be sufficiently close to $ 1 $. Setting
$ Mf(x)=[I−η2θ(x)⊗θ(x)], $
|
(4.4) |
we see that
$ Mf(x)=[Mf(x)]t,(1−η2)|ξ|2≤(ξ,Mf(x)ξ)≤|ξ|2,Mf(⋅)∈C∞(Ω;R2×2), $
|
(4.5) |
i.e., $ M^f $ is a symmetric and positive-definite matrix on $ \Omega $ and it satisfies property (3.1) with $ \beta = (1-\eta^2)^{-1} $.
Remark 7. Let's assume for a moment that $ \xi = \nabla v(x) $, where $ v\in W^{1, 1}(\Omega) $ is a given function and $ x\in\Omega $ is a Lebesgue point of $ f $ in which the original image $ f $ is not expected to change drastically in any direction, i.e., $ x $ is not close to a discontinuity of $ f $ or a zone where $ f $ tends to change rapidly. Then, Eq (4.4) implies that $ M^f $ can be represented at this point as a unit matrix. So, at this point, we have $ M^f\nabla v\approx \nabla v $.
On the other hand, if we consider a point $ x\in\Omega $ that is close to a discontinuity of $ f $, then $ M^f\nabla v $ reduces to $ (1-\eta^2)\nabla v $ if the gradient $ \nabla v(x) $ at this point is colinear to $ \theta(x) $, and to $ \nabla v(x) $ provided $ \nabla v(x) $ is orthogonal to $ \theta(x) $. In view of this, the expression $ M^f\nabla v $ can be interpreted as the directional total variation of $ v $ along the vector field $ \theta $ (see [24] for the details).
Definition 8. We say that a gray scale image $ f:\Omega\rightarrow \mathbb{R} $ is feasible for the segmentation procedure using level sets if there exists a value $ \gamma > 0 $ such that
$ f\in L^\infty(\Omega), \ f(x)\ge \gamma > 0\ {a.e.\; in }\ \Omega. $ |
We denote the set of all feasible images by $ \mathcal{F}_\gamma $.
We are now in a position to state the main object of our interest in this paper. Let $ f\in\mathcal{F}_\gamma $ be a given image, and let $ \varphi:\Omega\rightarrow \mathbb{R} $ be a level set function such that $ \varphi\in BV(\Omega) $. We associate with this function a collection of $ m+2 $ distinct level values $ l_0 < l_1 < \dots < l_m < l_{m+1} $ such that $ l_0\le \varphi(x)\le l_{m+1} $ almost everywhere in $ \Omega $.
Then, the constrained optimization problem we are going to consider can be stated as follows:
$ J(c,φ)=∫Ω(f−c0)log(fc0)χ{φ(x)<l1}dx+m−1∑j=1∫Ω(f−cj)log(fcj)[χ{φ(x)>lj}−χ{φ(x)>lj+1}]dx+∫Ω(f−cm)log(fcm)χ{φ(x)>lm}dx+αm∑j=1∫Ω|MfDχ{φ(x)>lj}|→infφ∈Ξ, $
|
(4.6) |
where $ \alpha > 0 $ is a weight coefficient and the set of feasible solutions is defined as follows:
$ Ξ={(c,φ)∈Rm+1×BV(Ω)|l0≤φ(x)≤lm+1 a.e. in Ω,c=(c0,c1,…,cm), cj≥0, j=0,…,m}. $
|
(4.7) |
It is worth noticing that the objective functional $ J $ is well-defined on the set $ \Xi $. Indeed, in this case the assumption $ \varphi\in BV(\Omega) $ implies that the level sets $ E_t = \left\{x\in \Omega: \varphi(x) > t\right\} $ have finite anisotropic perimeter $ \mathrm{Per}(\left\{u > t\right\}; M^f; \Omega) $ for $ \mathcal{L}^1 $ a.a. $ t\in\mathbb{R} $. Since
$ \mathrm{Per}(\left\{\varphi > l_j\right\};M^f;\Omega) = \int_{\Omega} |M^f D\chi_{\{\varphi(x) > l_j\}}| < \infty $ |
for each $ j = 1, \dots, m $, it follows that $ J(c, \varphi) < \infty $ for each $ (c, \varphi)\in\Xi $.
Minimization problems (4.6) and (4.7) can be interpreted as a special case of the piecewise-constant Mumford-Shah segmentation problem and the active contour model (see, for instance, [8,10,25,26]). We can indicate the following principle features of this statement:
● The problem is investigated in the space of $ L^1(\Omega) $-functions with bounded anisotropic total variation and with additional pointwise constraints, where the matrix of anisotropy is closely related to the structure of the image $ f $, which is involved in the segmentation procedure.
As a result, the anisotropic perimeter of the region $ \left\{x\in\Omega: l_j < \varphi(x) \right\} $, which is given by the term $ \int_{\Omega} |M^f D\chi_{\{\varphi(x) > l_j\}}| $, can drastically differ from the standard one because the natural edges of the original image $ f $ can affect it significantly;
● To find a piecewise-constant approximation of the given image $ f\in\mathcal{F}_\gamma $ in the form
$ u(x)=c0χ{φ(x)<l1}(x)+m−1∑j=1cj[χ{φ(x)>lj}(x)−χ{φ(x)>lj+1}(x)]+cmχ{φ(x)>lm}(x), $
|
(4.8) |
we utilize the Jeffreys divergence between two elements $ f, g\in L^2(\Omega) $ instead of the standard $ L^2 $-norm of their difference $ \|f-g\|_{L^2(\Omega)} $. In spite of the fact that the trick of replacing the squared Euclidean norm with the Jeffreys distance does not alleviate the original problem from the point of view of its solvability and mathematical analysis, it makes the segmentation results more stable to the Poisson noise contaminated images (see [12,13,14] for the details);
● We consider the segmentation problems (4.6) and (4.7) as a constrained minimization problem in $ BV(\Omega) $ space with the pointwise constraints $ l_0\le \varphi(x)\le l_{m+1} $ on the set of feasible functions $ \varphi:\Omega\rightarrow \mathbb{R} $.
● In the proposed statement of the segmentation problems (4.6) and (4.7), it is admitted that the set
$ K = \Omega\setminus \Big[ \cup_{j = 0}^m \left\{x\in\Omega : l_j < \varphi(x) < l_{j+1}\right\} \Big] $ |
may have a nonzero $ \mathcal{L}^2 $-measure.
However, the existence of minimizers to the problems (4.6) and (4.7) seems to be an open issue nowadays because the standard application of the direct method of calculus of variation to this problem faces some unsolved challenges. To apply the direct method for proving the existence of minimizers, it is necessary to find a topology for which the functional (4.6) is lower semicontinuous while ensuring compactness of minimizing sequences. In view of the structure of the set of admissible solutions $ \Xi\subset \mathbb{R}^{m+1}\times L^1(\Omega) $ (see Eq (4.7)), the natural topology, in this case, is the product of the norm topology in $ \mathbb{R}^{m+1} $ and the weak-$ \ast $ topology in $ BV(\Omega) $. However, the objective functional $ J:\Xi\rightarrow \mathbb{R} $ is not coercive and lower semicontinuous on $ \mathbb{R}^{m+1}\times L^1(\Omega) $ with respect to the above mentioned topology. Moreover, even if $ \varphi_k\rightarrow \varphi $ strongly in $ L^1(\Omega) $ as $ k\to\infty $, it does not imply the strong convergence in $ L^1(\Omega) $ of $ \chi_{\{\varphi_k(x) > l_j\}} $ to $ \chi_{\{\varphi(x) > l_j\}} $. In particular, it is clear that the implication
$ \left[\lim\limits_{k\to\infty}\|\varphi_k-\varphi\|_{L^1(\Omega)} = 0\right]\quad\Longrightarrow\quad\left[\lim\limits_{k\to\infty}\|\chi_{\{\varphi_k(x) > l_j\}}-\chi_{\{\varphi(x) > l_j\}}\|_{L^1(\Omega)} = 0\right] $ |
may hold true if only $ \mathcal{L}^2\left\{x\in\Omega:\varphi(x) = l_j\right\} = 0 $.
To overcome this difficulty, we make use of the following family of two-parametric approximated problems:
$ Jε,τ(c,φ)=∫Ω(f−c0)log(fc0)[χAτ0]εdx+m−1∑j=1∫Ω(f−cj)log(fcj)([χAτj]ε−[χAτj+1]ε)dx+∫Ω(f−cm)log(fcm)[χAτm]εdx+1ε[‖Ψ(l0−φ)‖L1(Ω)+‖Ψ(φ−lm+1)‖L1(Ω)]+ε|MfDφ|(Ω)+αm∑j=1∫Ω2ε|MfD[χEτj]ε|→infφ∈Ξε, $
|
(4.9) |
where $ \tau $ and $ {{\varepsilon}} $ are small parameters, which vary within strictly decreasing sequences of positive numbers converging to $ 0 $. The functions $ \left[\chi_{E_j^\tau}\right]_{{\varepsilon}}(\cdot)\in C^\infty(\mathbb{R}) $, $ j = 1, \dots, m $, are defined in Eq (3.7),
$ Aτ0={x∈Ω:φ(x)≤l1−τ},Aτj={x∈Ω:φ(x)≥lj+τ}, j=1,…,m,Eτj={x∈Ω:φ(x)≥lj−τ}, j=1,…,m,} $
|
(4.10) |
$ Ω2ε={x∈R2:dist(x,Ω)≤2ε}, $
|
(4.11) |
$ Ψ(z)={zδ,if z≥0,0,if z<0,}with a given exponent δ∈(1,2), $
|
(4.12) |
and
$ Ξε={(c,φ)∈Rm+1×BV(Ω)|c=(c0,c1,…,cm), cj≥0, j=0,…,m}. $
|
(4.13) |
Thus, a pair $ (c, \varphi) $ sounds as feasible to the problems (4.6) and (4.7) if $ (c, \varphi)\in\Xi_{{\varepsilon}} $, i.e.,
$ c∈Cad:={c=(c0,c1,…,cm), cj≥0, j=0,…,m},φ∈Gad:=BV(Ω). $
|
Before proceeding further, we make use of the following property.
Lemma 9. Let $ f\in\mathcal{F}_\gamma $ be a given image, let $ c = (c_0, c_1, \dots, c_m) $ with $ c_j > 0 $, let $ \{\varphi_k\}_{k = 1}^\infty\subset L^1(\Omega) $ be a strongly convergent sequence, and let $ \varphi\in L^1(\Omega) $ be its limit. Then, for each $ j = 1, \dots, m-1 $,
$ limk→∞∫Ω(f−cj)log(fcj)([χ{φk(x)≥lj+τ}]ε−[χ{φk(x)≥lj+1+τ}]ε)dx=∫Ω(f−cj)log(fcj)([χ{φ(x)≥lj+τ}]ε−[χ{φ(x)≥lj+1+τ}]ε)dx. $
|
(4.14) |
Proof. Since $ L^1(\Omega)\ni\varphi_k\rightarrow \varphi $ as $ k\to\infty $, we may suppose that, up to a subsequence, $ \varphi_k(x)\rightarrow \varphi(x) $ almost everywhere in $ \Omega $. Then, taking into account the continuity of the function $ \left[\chi_{E}\right]_{{\varepsilon}}(\cdot) $ and the fact that the passage in the inequality $ \varphi_k(x)\ge l_j+\tau $ leads to $ \varphi(x)\ge l_j+\tau $ for a.e $ x\in\Omega) $, we see that
$ \left[\chi_{\{\varphi_k(x)\ge l_j+\tau\}}\right]_{{\varepsilon}}\rightarrow \left[\chi_{\{\varphi(x)\ge l_j+\tau\}}\right]_{{\varepsilon}}\quad\text{ a.e. in }\ \Omega. $ |
Moreover, since $ \left|\left[\chi_{\{\varphi_k(x)\ge l_j+\tau\}}\right]_{{\varepsilon}}(x)\right|\le 1 $ in $ \Omega $, it follows from the Lebesgue dominated theorem that
$ \left[\chi_{\{\varphi_k(x)\ge l_j+\tau\}}\right]_{{\varepsilon}}(\cdot)\rightarrow \left[\chi_{\{\varphi(x)\ge l_j+\tau\}}\right]_{{\varepsilon}}(\cdot) \quad\text{strongly in }\ L^1(\Omega). $ |
Since $ f\in\mathcal{F}_\gamma $ and $ c = (c_0, c_1, \dots, c_m) $ with $ c_j > 0 $, it follows that the integrant $ \left(f-c_{j}\right)\log\left(\frac{f}{c_{j}}\right) $ is dominated by some integrable function $ g $ in the sense that
$ \left|\left(f(x)-c_{j}\right)\log\left(\frac{f(x)}{c_{j}}\right) \right| \le g(x)\quad \text{ a.e. in }\ \Omega. $ |
In particular,
$ 0≤(f(x)−cj)log(f(x)cj)([χ{φk(x)≥lj+τ}]ε−[χ{φk(x)≥lj+1+τ}]ε)≤(f(x)−cj)log(f(x)cj)≤{|f(x)2−c2j|cj,a.e. in Ω∩{f(x)>cj},|logγcj||f(x)−cj|,a.e. in Ω∩{f(x)≤cj},}≤{1cj‖f‖2L∞(Ω)+cj,a.e. in Ω∩{f(x)>cj},|logγcj|(‖f‖L∞(Ω)+cj),a.e. in Ω∩{f(x)≤cj}.} $
|
As a result, Eq (4.14) is a direct consequence of the Lebesgue dominated theorem. Arguing similarly, we can establish the same assertion for the first and the third terms in Eq (4.9).
To conclude this section, we give an existence result for the parametrized optimization problems (4.9)–(4.13).
Theorem 10. Let $ f\in\mathcal{F}(\gamma) $ be a given gray scale image, and let
$ {M^f = \left[I-\eta^2\theta(x)\otimes\theta(x)\right], } $ |
where the vector field $ \theta\in C^\infty(\Omega; \mathbb{R}^{2}) $ is defined by the rule (4.2). Then, for each $ {{\varepsilon}}\in(0, 1) $ and $ \tau > 0 $ small enough, the constrained minimization problems (4.9)–(4.13) admit at least one solution.
Proof. Since the objective functional is bounded from below on $ \Xi_{{\varepsilon}}\subset \mathbb{R}^{m+1}\times BV(\Omega) $, it follows that there exists a minimizing sequence to problems (4.9)–(4.13), i.e.,
$ inf(c,φ)∈ΞεJε,τ(c,φ)=limk→∞Jε,τ(ck,φk)≤C<+∞, $
|
(4.15) |
where $ C $ stands for a strictly positive constant that can be different from line to line. Without loss of generality, we can suppose that $ C = \zeta+1 $.
Taking into account the fact that
$ ‖φk‖Lδ(Ω)=(∫{φk<l0}|φk|δdx+∫{l0≤φk<lm+1}|φk|δdx+∫{φk>lm+1}|φk|δdx)1δ≤(∫{φk<l0}|φk|δdx)1δ+(∫{φk>lm+1}|φk|δdx)1δ+max{|l0|,|lm+1|}|Ω|1δ $
|
and
$ ∫{φk<l0}|φk|δdx≤2δ−1‖Ψ(l0−φk)‖L1(Ω)+2δ−1|l0|δ|Ω|,∫{φk>lm+1}|φk|δdx≤2δ−1‖Ψ(φk−lm+1)‖L1(Ω)+2δ−1|lm+1|δ|Ω|, $
|
we see that
$ ‖φk‖δLδ(Ω)≤22δ−2‖Ψ(l0−φk)‖L1(Ω)+22δ−2‖Ψ(φk−lm+1)‖L1(Ω)+22δ−23(|l0|+|lm+1|)δ|Ω|≤ε22δ−2Jε,τ(ck,φk)+22δ−23(|l0|+|lm+1|)δ|Ω|. $
|
(4.16) |
Since
$ \|\varphi_k\|^{\delta}_{L^1(\Omega)}\le \|\varphi_k\|^{\delta}_{L^\delta(\Omega)} |\Omega|^{\delta-1}, $ |
and the function $ K:\mathbb{R}_{\ge}\rightarrow \mathbb{R}_{\ge} $, given by $ K(z) = \left(f-z\right) \log\left(\frac{f}{z}\right) $, is locally continuous and coercive, i.e.,
$ limz→+∞K(z)z=+∞,limz→+0K(z)z=+∞, $
|
(4.17) |
it follows from Eqs (4.16) and (4.17) that there exists a constant $ C^\ast(\zeta, K, m, |\Omega|, l_i, \delta) > 0 $ such that
$ m∑j=0c2j,k+‖φk‖δL1(Ω)+|Dφk|(Ω)≤C∗+ε22δ−2Jε,τ(ck,φk)|Ω|δ−1+1εJε,τ(ck,φk)≤C∗+ε22δ−2(ζ+1)|Ω|δ−1+1ε(ζ+1),∀k∈N. $
|
Hence, the sequence $ \left\{(c_k, \varphi_k)\right\}_{k\in\mathbb{N}} $ is bounded in $ \mathbb{R}^{m+1}\times BV(\Omega) $. Then, from the compactness property in $ BV $-space and the fact that $ BV(\Omega) $ is compactly embedded in $ L^\delta(\Omega) $, we can deduce the existence of a subsequence of $ \left\{(c_k, \varphi_k)\right\}_{k\in\mathbb{N}} $, that we denote in the same way, and a pair $ (c^0, \varphi^0)\in \mathbb{R}^{m+1}\times BV(\Omega) $ such that
$ ck→c0 in Rm+1,φk→φ0 strongly in Lδ(Ω), $
|
(4.18) |
$ φk(x)→φ0(x)almost everywhere in Ω, $
|
(4.19) |
$ Dφk∗⇀Dφ0 weakly−∗in M(Ω;R2). $
|
(4.20) |
Without loss of generality, we can suppose that each of the functions $ \left\{\varphi_k\right\}_{k\in\mathbb{N}} $ and $ \varphi^0 $ is extended by zero outside of $ \Omega $, and $ \varphi_k(x)\rightarrow \varphi^0(x) $ for a.e. $ x\in \Omega_{2{{\varepsilon}}} $. Then, in view of the standard properties of mollifiers, we have
$ \left\{\left[\chi_{\{\varphi_k(x)\ge l\}}\right]_{{\varepsilon}}\right\}_{k\in\mathbb{N}}\subset BV(\Omega_{2{{\varepsilon}}}) $ |
and
$ \left[\chi_{\{\varphi_k(x)\ge l\}}\right]_{{\varepsilon}}\rightarrow \chi_{\{\varphi^0(x)\ge l\}}\ \text{ in } L^1(\Omega_{2{{\varepsilon}}})\; {\rm{as}} \; k\to \infty . $ |
Then, property (3.4) implies that
$ ∫Ω2ε|MfDχ{φ0(x)≥l}|≤lim infk→∞∫Ω2ε|MfD[χ{φk(x)≥l}]ε|, $
|
(4.21) |
$ limk→∞|MfDφk|(Ω)by (4.18), (4.20)≤|MfDφ0|(Ω). $
|
(4.22) |
Besides, in view of the properties (4.18)–(4.20) and the fact that $ \Psi\in C^1_{loc}(\mathbb{R}) $, we have the pointwise convergence
$ Ψ(l0−φk)(x)→Ψ(l0−φ0)(x)a.e. in Ω,Ψ(φk−lm+1)(x)→Ψ(φ0−lm+1)(x)a.e. in Ω. $
|
Since
$ \Psi(l_0-\varphi_k)\le |l_0-\varphi_k|^\delta\quad\text{and}\quad \Psi(\varphi_k-l_{m+1})\le |\varphi_k-l_{m+1}|^\delta\quad\text{a.e. in }\ \Omega, $ |
and the sequences $ \left\{|l_0-\varphi_k|^\delta\right\}_{k\in\mathbb{N}} $ and $ \left\{|\varphi_k-l_{m+1}|^\delta\right\}_{k\in\mathbb{N}} $ are bounded in $ L^p(\Omega) $ with $ p = 2/\delta > 1 $ (by the continuous embedding $ BV(\Omega)\hookrightarrow L^2(\Omega) $), it follows from Vitali's lemma that
$ \Psi(l_0-\varphi_k)\rightarrow \Psi(l_0-\varphi^0)\ \text{and}\ \Psi(\varphi_k-l_{m+1})\rightarrow \Psi(\varphi^0-l_{m+1})\ \text{ in } L^r(\Omega)\ \forall 1\le r < p . $ |
Hence,
$ limk→∞[‖Ψ(l0−φk)‖L1(Ω)+‖Ψ(φk−lm+1)‖L1(Ω)]=‖Ψ(l0−φ0)‖L1(Ω)+‖Ψ(φ0−lm+1)‖L1(Ω). $
|
(4.23) |
As a result, the lower semicontinuity property of the minimizing sequence
$ inf(c,φ)∈ΞεJε,τ(c,φ)=limk→∞Jε,τ(ck,φk)≥lim infk→∞Jε,τ(ck,φk)≥Jε,τ(c0,φ0). $
|
(4.24) |
is a direct consequence of relations (4.21)–(4.23) and Lemma 9.
It remains to notice that due to the pointwise convergence (4.18), we have
$ c^0 = (c^0_0, c^0_1, \dots, c^0_m)\ \text{ with }\ c^0_j\ge 0\ \text{ for all } j = 0, \dots, m . $ |
Hence, the limit pair $ (c^0, \varphi^0) $ is a feasible solution, i.e., $ (c^0, \varphi^0)\in \Xi_{{\varepsilon}} $, and, therefore,
$ J_{{\varepsilon}}(c^0, \varphi^0) = \inf\limits_{(c, \varphi)\in \Xi_{{\varepsilon}}} J_{{\varepsilon}}(c, \varphi)\le C < +\infty. $ |
Thus, $ (c^0, \varphi^0) $ is a minimizer to the problems (4.9)–(4.13).
This section aims to derive some optimality conditions for the minimization problems (4.9)–(4.13). Let $ {{\varepsilon}} $ and $ \tau $ be given small positive values. With that in mind, we study the differentiability properties of the objective functional $ J_{{{\varepsilon}}, \tau}(c, \varphi) $ in order to specify its local behavior in the immediate vicinity of its minimum point. The corresponding Euler-Lagrange system is presented in Theorem 11.
Let $ (c^0_{{{\varepsilon}}, \tau}, \varphi^0_{{{\varepsilon}}, \tau})\in \Xi_{{\varepsilon}} $ be a local minimizer to problems (4.9)–(4.13). Then,
$ c^0_{{{\varepsilon}}, \tau} = (c^0_{{{\varepsilon}}, \tau, 0}, c^0_{{{\varepsilon}}, \tau, 1}, \dots, c^0_{{{\varepsilon}}, \tau, m}) $ |
with $ c^0_{{{\varepsilon}}, \tau, j}\ge 0 $ for each $ j\in\left\{0, \dots, m\right\} $ and $ \varphi^0_{{{\varepsilon}}, \tau}\in \mathcal{G}_{ad} $. In the objective functional $ J_{{{\varepsilon}}, \tau} $, we distinguish three terms
$ J_{{{\varepsilon}}, \tau}(c, \varphi) = F_{{{\varepsilon}}, \tau}(c_0, \dots, c_{m}, \varphi)+\Phi_{{\varepsilon}}(\varphi)+j_{{{\varepsilon}}, \tau} (\varphi) $ |
with
$ Fε,τ(c0,…,cm,φ)=∫Ω(f−c0)log(fc0)[χAτ0]εdx+m−1∑j=1∫Ω(f−cj)log(fcj)([χAτj]ε−[χAτj+1]ε)dx+∫Ω(f−cm)log(fcm)[χAτm]εdx,Φε(φ)=1ε[‖Ψ(l0−φ)‖L1(Ω)+‖Ψ(φ−lm+1)‖L1(Ω)],jε,τ(φ)=ε|MfDφ|(Ω)+αm∑j=1∫Ω2ε|MfD[χEτj]ε|. $
|
From the differentiability of $ \left(f-c_{i}\right)\log\left(\frac{f}{c_i}\right) $ and $ \left[\chi_{A_j^\tau}\right]_{{\varepsilon}} $, it is immediate that the functional $ F_{{{\varepsilon}}, \tau} $ is of the class $ C^1 $. Hence, there exist linear continuous functionals
$ DφFε,τ(c0ε,τ,φ0ε,τ):Rm+1×BV(Ω)→RandDcjFε,τ(c0ε,τ,φ0ε,τ):Rm+1×BV(Ω)→R,j=0,…,m $
|
such that
$ Fε,τ(c0ε,φ0ε,τ+λh)=Fε,τ(c0ε,τ,φ0ε,τ)+λDφFε,τ(c0ε,τ,φ0ε,τ)[h]+r(h,λ),Fε,τ(c0ε,τ+λμej,φ0ε,τ)=Fε(c0ε,τ,φ0ε,τ)+λDcjFε,τ(c0ε,τ,φ0ε,τ)[μej]+rj(μ,λ),j=0,…,m, $
|
for any $ h\in BV(\Omega) $ and $ \mu\in\mathbb{R} $, where $ |r(h, \lambda)| = o(|\lambda|) $ and $ |r_j(\mu, \lambda)| = o(|\lambda|) $ as $ \lambda\to 0 $, and
$ e_j = (0, \dots, \underbrace{1}_{\text{j-th slot}}, \dots, 0)^t\in\mathbb{R}^{m+1}. $ |
Moreover, making use of the following representations
$ \chi_{A_0^\tau} = H(l_1-\tau-\varphi)\quad\text{and}\quad \chi_{A_j^\tau} = H(\varphi-l_j-\tau), \ j = 1, \dots.m $ |
with $ H(z) = \left\{1, \ z\ge 0\atop 0, \ z < 0\right\} $, we have
$ \left[\chi_{A_0^\tau}\right]_{{\varepsilon}} = H_{{\varepsilon}}(l_1-\tau-\varphi)\quad\text{and}\quad \left[\chi_{A_j^\tau}\right]_{{\varepsilon}} = H_{{\varepsilon}}(\varphi-l_j-\tau), \ j = 1, \dots.m, $ |
where $ H_{{\varepsilon}}(z) = \left[\eta_{{\varepsilon}} \ast \chi_E\right](z) $ stands for the smooth approximation of $ H(z) $ through the mollification.
Then, direct calculations show that
$ DφFε,τ(c,φ)[h]=−∫Ω(f−c0)log(fc0)H′ε(l1−τ−φ)hdx,+m−1∑j=1∫Ω(f−cj)log(fcj)[H′ε(φ−lj−τ)−H′ε(φ−lj+1−τ)]hdx,+∫Ω(f−cm)log(fcm)H′ε(φ−lm−τ)hdx, $
|
(5.1) |
$ DcjFε,τ(c,φ)[μej]=−μcj∫ΩfΛj(φ)dx−μ∫Ωlog(f)Λj(φ)dx+μcj∫ΩΛj(φ)dx+μ∫ΩΛj(φ)dx,j=0,1,…,m $
|
(5.2) |
with
$ Λ0(φ)=Hε(l1−φ−τ),Λm(φ)=Hε(φ−lm−τ),Λj(φ)=Hε(φ−lj−τ)−Hε(φ−lj+1−τ),j=1,…,m−1. $
|
Since $ f\in \mathcal{F}_\gamma $, it follows from Eq (5.2) that the unique solution of the system
$ DcjFε,τ(c0ε,τ,φ0ε,τ)[μej]=0,j=0,…,m $
|
(5.3) |
can be expressed as
$ {c0ε,τ,0=c0(φ0ε,τ)=∫ΩfHε(l1−φ0ε,τ−τ)dx∫ΩHε(l1−φ0ε,τ−τ)dx,c0ε,τ,j=cj(φ0ε,τ)=∫Ωf[Hε(φ0ε,τ−lj−τ)−Hε(φ0ε,τ−lj+1−τ)]dx∫Ω[Hε(φ0ε,τ−lj−τ)−Hε(φ0ε,τ−lj+1−τ)]dx,c0ε,τ,m=cm(φ0ε,τ)=∫ΩfHε(φ0ε,τ−lm−τ)dx∫ΩHε(φ0ε,τ−lm−τ)dx $
|
(5.4) |
with $ c^0_{{{\varepsilon}}, \tau, j}\ge 0 $ for all $ j = 0, \dots, m $, i.e., $ c^0_{{{\varepsilon}}, \tau} \in \mathcal{C}_{ad} $.
Arguing in a similar manner and taking into account that $ \Psi\in C^1_{loc}(\mathbb{R}) $, it can be shown that
$ \Phi_{{\varepsilon}}\left(\varphi^0_{{{\varepsilon}}, \tau}+\lambda h\right) = \Phi_{{\varepsilon}}\left(\varphi^0_{{{\varepsilon}}, \tau}\right)+\lambda D_\varphi\Phi_{{\varepsilon}}\left(\varphi^0_{{{\varepsilon}}, \tau}\right)[h]+r(h, \lambda), $ |
for any $ h\in BV(\Omega) $, where $ |r(h, \lambda)| = o(|\lambda|) $ as $ \lambda\to 0 $, and $ D_\varphi \Phi_{{{\varepsilon}}} (\varphi^0_{{{\varepsilon}}, \tau}): BV(\Omega)\rightarrow \mathbb{R} $ is a linear continuous functional with the following representation:
$ DφΦε(φ0ε,τ)[h]=1ε[−∫ΩΨ′(l0−φ0ε,τ)hdx+∫ΩΨ′(φ0ε,τ−lm+1)hdx]. $
|
(5.5) |
Here,
$ \Psi^\prime(z) = \left\{ δzδ−1,if z≥0,0,if z<0. \right\} $
|
We are now in a position to establish the main result of this section.
Theorem 11. Given $ {{\varepsilon}} > 0 $ and $ \tau > 0 $ small enough, $ f\in \mathcal{F}_\gamma $, $ \alpha > 0 $, and a collection of $ m+2 $ distinct level values $ l_0 < l_1 < \dots < l_m < l_{m+1} $, let $ (c^0_{{{\varepsilon}}, \tau}, \varphi^0_{{{\varepsilon}}, \tau})\in \Xi_{{\varepsilon}} $ be a local minimizer to problems (4.9)–(4.13). Then, the pair $ (c^0_{{{\varepsilon}}, \tau}, \varphi^0_{{{\varepsilon}}, \tau}) $ satisfies the following Euler-Lagrange system:
$ −∫Ω(f−c0ε,τ,0)log(fc0ε,τ,0)H′ε(l1−φ0ε,τ−τ)[φ−φ0ε,τ]dx+m−1∑j=1∫Ω(f−c0ε,τ,j)log(fc0ε,τ,j)[H′ε(φ0ε,τ−lj−τ)−H′ε(φ0ε,τ−lj+1−τ)][φ−φ0ε,τ]dx+∫Ω(f−c0ε,τ,m)log(fc0ε,τ,m)H′ε(φ0ε,τ−lm−τ)[φ−φ0ε,τ]dx+1ε∫Ω[Ψ′(φ0ε,τ−lm+1)−Ψ′(l0−φ0ε,τ)][φ−φ0ε,τ]dx+jε,τ(φ)−jε,τ(φ0ε,τ)≥0,∀φ∈Gad, $
|
(5.6) |
where the constants $ c^0_{{{\varepsilon}}, \tau, 0}, c^0_{{{\varepsilon}}, \tau, 1}, \dots, c^0_{{{\varepsilon}}, \tau, m} $ are defined by the rule (5.4).
Proof. Since $ (c^0_{{{\varepsilon}}, \tau}, \varphi^0_{{{\varepsilon}}, \tau})\in \Xi_{{\varepsilon}} $ is a local minimum point of Eqs (4.9)–(4.13), we have that
$ Jε,τ(c0ε,τ+ρ(μ−(cj)0ε,τ)ej,φ0ε,τ)≥Jε,τ(c0ε,τ,φ0ε,τ), $
|
(5.7) |
$ Jε,τ(c0ε,τ,φ0ε,τ+ρ(φ−φ0ε,τ))≥Jε,τ(c0ε,τ,φ0ε,τ) $
|
(5.8) |
for all $ \rho > 0 $ small enough and any given $ \mu\ge 0 $ and $ \varphi\in \mathcal{G}_{ad} $.
As a result, inequality (5.7) leads to the Eq (5.3), and, hence, to the representation (5.4), whereas Eq (5.8) together with the convexity of $ j_{{\varepsilon}} $ implies
$ 0≤Jε,τ(c0ε,τ,φ0ε,τ+ρ(φ−φ0ε,τ))−Jε,τ(c0ε,τ,φ0ε,τ)ρ=Fε,τ(c0ε,τ,φ0ε,τ+ρ(φ−φ0ε,τ))−Fε,τ(c0ε,τ,φ0ε,τ)ρ+jε,τ(φ0ε,τ+ρ(φ−φ0ε,τ))−jε,τ(φ0ε,τ)ρ+Φε(φ0ε,τ+ρ(φ−φ0ε,τ))−Φε(φ0ε,τ)ρ≤Fε,τ(c0ε,τ,φ0ε,τ+ρ(φ−φ0ε,τ))−Fε,τ(c0ε,τ,φ0ε,τ)ρ+Φε(φ0ε,τ+ρ(φ−φ0ε,τ))−Φε(φ0ε,τ)ρ+jε,τ(φ)−jε,τ(φ0ε,τ). $
|
Now, passing to the limit as $ \rho\to 0 $, we get
$ 0\le D_\varphi F_{{{\varepsilon}}, \tau} (c^0_{{{\varepsilon}}, \tau}, \varphi^0_{{{\varepsilon}}, \tau})[\varphi-\varphi^0_{{{\varepsilon}}, \tau}]+ D_\varphi \Phi_{{{\varepsilon}}} (\varphi^0_{{{\varepsilon}}, \tau})[\varphi-\varphi^0_{{{\varepsilon}}, \tau}]+ j_{{{\varepsilon}}, \tau}(\varphi)- j_{{{\varepsilon}}, \tau} \left(\varphi^0_{{{\varepsilon}}, \tau}\right). $ |
Finally, using the expression of the Gateaux derivatives $ D_\varphi F_{{{\varepsilon}}, \tau} (c^0_{{{\varepsilon}}, \tau}, \varphi^0_{{{\varepsilon}}, \tau}) $ and $ D_\varphi \Phi_{{{\varepsilon}}} (\varphi^0_{{{\varepsilon}}, \tau}) $ given by Eqs (5.1) and (5.5), respectively, we immediately arrive at the optimality system (5.6).
The main question we are going to discuss in this section is to find out whether the convergence of minima of Eq (4.9) is to minima of Eq (4.6) as $ {{\varepsilon}} $ and $ \tau $ tend to zero. To this end, we make use of the basic results of the variational convergence of minimization problems and $ \Gamma $-convergence theory (see, for instance, [27,28,29]). In particular, in Lemmas 12 and 13, we show that the standard properties of $ \Gamma $-limits hold true for the objective functional $ J_{{{\varepsilon}}, \tau} $ with respect to the weak-$ \ast $ topology of $ BV(\Omega) $ space and the pointwise convergence in $ \mathbb{R}^{m+1} $. Utilizing these characteristic features, we establish the main variational property of the proposed approximation procedure (see Theorem 14). Namely, we prove that any sequence of optimal pairs to the approximated problems (4.9)–(4.13) is compact in the weak-$ \ast $-topology of $ \mathbb{R}^{m+1}\times BV(\Omega) $ and each cluster point is a solution of the problem
$ J_\tau(c, \varphi)\rightarrow \inf\limits_{(c, \varphi)\in \Xi\atop \varphi\in BV(\Omega)}, $ |
where the cost functional is defined in Eq (6.6).
We begin with the following noteworthy result.
Lemma 12. Let $ \tau > 0 $ be a given value such that $ \tau\ll 1 $. Let $ \left\{(c_{{\varepsilon}}, \varphi_{{\varepsilon}})\in\Xi_{{\varepsilon}}\right\}_{{{\varepsilon}}\to 0} $ be a sequence of feasible pairs to the approximated problems (4.9)–(4.13), satisfying the conditions
$ supε>0Jε,τ(cε,φε)<+∞andsupε>0‖φε‖BV(Ω)<+∞. $
|
(6.1) |
Then, there exist a subsequence $ \left\{(c_{{{\varepsilon}}_j}, \varphi_{{{\varepsilon}}_j})\right\}_{j = 1}^\infty $ with $ {{\varepsilon}}_j\to 0 $ as $ j\to\infty $, and a pair $ (c, \varphi)\in \mathbb{R}^{m+1}\times BV(\Omega) $ such that
$ cεj→c in Rm+1 asj→∞, $
|
(6.2) |
$ φεj→φ strongly in L1(Ω) as j→∞, $
|
(6.3) |
$ MfDφεj∗⇀MfDφ weakly−∗in M(Ω;R2) as j→∞, $
|
(6.4) |
$ (c,φ)∈ΞandJτ(c,φ)≤lim infj→∞Jεj,τ(cεj,φεj), $
|
(6.5) |
where
$ Jτ(c,φ)=∫Ω(f−c0)log(fc0)χAτ0dx+m−1∑j=1∫Ω(f−cj)log(fcj)[χAτj−χAτj+1]dx+∫Ω(f−cm)log(fcm)χAτmdx+αm∑j=1∫Ω|MfDχEτj|. $
|
(6.6) |
Proof. In view of the initial assumptions (see Eq (6.1)$ _2 $), the sequence $ \left\{(c_{{\varepsilon}}, \varphi_{{\varepsilon}})\right\}_{{{\varepsilon}}\to 0} $ is compact with respect to the product of norm topology of $ \mathbb{R}^{m+1} $ and the weak-$ \ast $ convergence in $ BV(\Omega) $. So, there exists a subsequence $ \left\{(c_{{{\varepsilon}}_j}, \varphi_{{{\varepsilon}}_j})\right\}_{j = 1}^\infty $ with $ {{\varepsilon}}_j\to 0 $ as $ j\to\infty $ and a pair $ (c, \varphi)\in \mathbb{R}^{m+1}\times BV(\Omega) $ such that, in addition to Eqs (6.2) and (6.3), we have
$ φεj(x)→φ(x) a.e. in Ω,Dφεj∗⇀Dφ weakly−∗in M(Ω;R2) as j→∞. $
|
(6.7) |
Since the matrix $ M^f $ is positive-definite on $ \Omega $, property (6.7) implies the weak-$ \ast $ convergence (6.4). Moreover, from Eqs (6.1) and (6.7), we deduce the existence of a constant $ C > 0 $ independent of $ {{\varepsilon}} $ such that
$ supj∈N|MfDφεj|(Ω)≤supj∈N|Dφεj|(Ω)≤C,‖Ψ(l0−φεj)‖L1(Ω)+‖Ψ(φεj−lm+1)‖L1(Ω)≤εjC,∀j∈N. $
|
Hence,
$ limj→∞[εj|MfDφεj|(Ω)]=0, $
|
(6.8) |
$ limj→∞‖Ψ(l0−φεj)‖L1(Ω)=0,limj→∞‖Ψ(φεj−lm+1)‖L1(Ω)=0. $
|
(6.9) |
It means that the limit function $ \varphi $ satisfies the pointwise constraints $ l_0\le \varphi(x)\le l_{m+1} $ a.e. in $ \Omega $. Thus, we see that $ (c, \varphi)\in\Xi $.
It remains to establish the inequality (6.5)$ _2 $. With that in mind, we make use of the pointwise convergence Eqs (6.2), (6.7), and properties (ⅰ) and (ⅱ) of the smoothed characteristic functions $ \left[\chi_E\right]_{{\varepsilon}} $. Then, arguing as in the proof of Lemma 9, we deduce that
$ [χ{φεj(x)≥lk+τ}]εj→χ{φ(x)≥lk+τ} a.e. in Ω,∀k=1,…,m(f(x)−cεj,k)log(f(x)cεj.k)→(f(x)−ck)log(f(x)ck) a.e. in Ω,∀k=0,…,m|(f(x)−cεj,k)log(f(x)cεj,k)|≤g(x) a.e. in Ω, $
|
with some integrable function $ g $. Hence, by the Lebesgue dominated theorem, we have:
$ ∫Ω(f−cεj,0)log(fcεj,0)[χ{φεj(x)≤l1−τ}]εdx+m−1∑k=1∫Ω(f−cεj,k)log(fcεj,k)([χ{φεj(x)≥lk+τ}]ε−[χ{φεj(x)≥lk+1+τ}]ε)dx+∫Ω(f−cεj,m)log(fcεj,m)[χ{φεj(x)≥lm+τ}]εdxj→∞⟶∫Ω(f−c0)log(fc0)χ{φ(x)≤l1−τ}dx+m−1∑j=1∫Ω(f−cj)log(fcj)(χ{φ(x)≥lk+τ}−χ{φ(x)≥lk+1+τ})dx+∫Ω(f−cm)log(fcm)χ{φ(x)≥lm+τ}dx. $
|
(6.10) |
To end the proof, we have to show that
$ lim infj→∞m∑k=1∫Ω2ε|MfD[χ{φεj(x)≥lk−τ}]εj|≥m∑k=1∫Ω|MfDχ{φ(x)≥lk−τ}| $
|
(6.11) |
With that in mind, we notice that
$ +∞by (6.1)>lim infj→∞m∑k=1∫Ω2ε|MfD[χ{φεj(x)≥lk−τ}]εj|≥lim infj→∞m∑k=1∫Ω|MfD[χ{φεj(x)≥lk−τ}]εj|. $
|
(6.12) |
Taking into account that (see Eq (6.1)$ _2 $), all level sets of the functions $ \left\{\varphi_{{{\varepsilon}}_j}\right\}_{j\in\mathbb{N}} $ have a finite perimeters, and we see that $ \chi_{\{\varphi_{{{\varepsilon}}_j}(x)\ge l_k-\tau\}}\in BV(\Omega) $ for each $ k = 1, \dots, m $. Moreover, in view of Eq (6.7), we have
$ χ{φεj(x)≥lk−τ}j→∞⟶χ{φ(x)≥lk−τ} a.e. in Ω and strongly in L1(Ω), $
|
(6.13) |
for each $ k\in\left\{1, \dots, m\right\} $.
Hence,
$ [χ{φεj(x)≥lk−τ}]εjj→∞⟶χ{φ(x)≥lk−τ} a.e. in Ω and strongly in L1(Ω),lim infj→∞m∑k=1∫Ω|MfD[χ{φεj(x)≥lk−τ}]εj|≥m∑k=1∫Ω|MfDχ{φ(x)≥lk−τ}|. $
|
(6.14) |
Together with Eq (6.12), we arrive at the announced inequality (6.11). Thus, the desired property (6.5) is a direct consequence of of relations (6.8)–(6.11).
Lemma 13. For every feasible pair $ (c, \varphi)\in\, \Xi $, there can be found a sequence $ \left\{(\widehat{c}_{{\varepsilon}}, \widehat{\varphi}_{{\varepsilon}})\right\}_{{{\varepsilon}} > 0} $ satisfying the properties
$ (ˆcε,ˆφε)∈Ξεfor ε>0smallenough, $
|
(6.15) |
$ (ˆcε→c in Rm+1,ˆφε∗⇀φ in BV(Ω) asε→0, $
|
(6.16) |
$ Jτ(c,φ)=limε→0Jε,τ(ˆcε,ˆφε). $
|
(6.17) |
Proof. Let $ (c, \varphi)\in\, \Xi $ be a given pair. In view of Theorem 4, we can suppose that $ \varphi\in C^\infty(\Omega) $.
Let $ \left\{c_{{\varepsilon}} \right\}_{{{\varepsilon}} > 0} $ be an arbitrary sequence in $ \mathbb{R}^{m+1} $ such that
$ cε∈Cad∀ε>0andcε→c as ε→0. $
|
(6.18) |
Then, we define the sequence $ \left\{(\widehat{c}_{{\varepsilon}}, \widehat{\varphi}_{{\varepsilon}})\right\}_{{{\varepsilon}} > 0} $ as follows:
$ \widehat{c}_{{\varepsilon}} = c_{{\varepsilon}} \quad\text{and}\quad \widehat{\varphi}_{{\varepsilon}} = \varphi, \quad\forall\, {{\varepsilon}} > 0. $ |
Since $ \varphi\in C^\infty(\Omega) $, it follows that $ \widehat{\varphi}_{{\varepsilon}} $ has a bouded anisotropic total variation and, therefore, $ \left\{(\widehat{c}_{{\varepsilon}}, \widehat{\varphi}_{{\varepsilon}})\in \Xi_{{\varepsilon}}\right)\}_{{{\varepsilon}} > 0} $ are the collection of feasible solutions for the corresponding two-parametric approximated problems (4.9). Then, due to the fact that $ l_0\le \varphi(x)\le l_{m+1} $ in $ \Omega $, we have:
$ ‖Ψ(l0−ˆφε)‖L1(Ω)+‖Ψ(ˆφε−lm+1)‖L1(Ω)=0,∀ε>0, $
|
(6.19) |
$ limε→0|MfDˆφε|(Ω)≤limε→0|Dˆφε|(Ω)=|Dφ|(Ω)<∞. $
|
(6.20) |
Furthermore, arguing as in the proof of Lemma 6 and taking into account the property (ⅰ) of mollifiers, we see that
$ ∫Ω(f−ˆcε,0)log(fˆcε,0)[χ{φ(x)≤l1−τ}]εdx+m−1∑k=1∫Ω(f−ˆcε,k)log(fˆcε,k)([χ{φ(x)≥lk+τ}]ε−[χ{φ(x)≥lk+1+τ}]ε)dx+∫Ω(f−ˆcε,m)log(fˆcε,m)[χ{φ(x)≥lm+τ}]εdxj→∞⟶∫Ω(f−c0)log(fc0)χ{φ(x)≤l1−τ}dx+m−1∑j=1∫Ω(f−cj)log(fcj)(χ{φ(x)≥lk+τ}−χ{φ(x)≥lk+1+τ})dx+∫Ω(f−cm)log(fcm)χ{φ(x)≥lm+τ}dx. $
|
(6.21) |
Thus, in view of Eqs (6.19)–(6.21), in order to deduce the Eq (6.17), it remains to show that
$ limε→0m∑j=1∫Ω2ε|MfD[χ{φ(x)≥lj−τ}]ε|=m∑j=1∫Ω|MfDχ{φ(x)≥lj−τ}|. $
|
(6.22) |
Observing that (see Lemma 5)
$ limε→0∫Ω2ε|MfD[χ{φ(x)≥lj−τ}]ε|=∫¯Ω|MfDχ{φ(x)≥lj−τ}|=∫Ω|MfDχ{φ(x)≥lj−τ}|+∫∂Ω|MfDχ{φ(x)≥lj−τ}|, $
|
(6.23) |
where the last term in Eq (6.23) is equal to zero because $ \{x\in\Omega : \varphi(x)\ge l_j-\tau\} $ is a closed subset of $ \Omega $.
As a result, we obtain
$ \lim\limits_{{{\varepsilon}}\to 0} \int_{\Omega_{2{{\varepsilon}}}} \left|M^f D\left[\chi_{\{\varphi(x)\ge l_j-\tau\}}\right]_{{\varepsilon}}\right| = \int_{\Omega} \left|M^f D \chi_{\{\varphi(x)\ge l_j-\tau\}} \right|, \quad\forall\, j = 1, \dots, m. $ |
This concludes the proof.
We are now in a position to state the main result of this section.
Theorem 14. Let $ \tau\ll 1 $ be a given positive value. Let $ \left\{(c^0_{{{\varepsilon}}, \tau}, \varphi^0_{{{\varepsilon}}, \tau})\in\Xi_{{\varepsilon}}\right\}_{{{\varepsilon}}\to 0} $ be a sequence of optimal pairs to the approximated minimization problems (4.9)–(4.13). Assume that the sequence $ \left\{\varphi^0_{{{\varepsilon}}, \tau}\right\}_{{{\varepsilon}} > 0} $ is bounded in $ BV(\Omega) $, and the relaxed problem
$ Jτ(c,φ)→inf(c,φ)∈Ξφ∈BV(Ω) $
|
(6.24) |
has a nonempty set of minimizers for the given value $ \tau > 0 $. Then, there exists a pair $ (c^\ast_\tau, \varphi^\ast_\tau)\in\Xi $ such that, up to a subsequence,
$ c0ε,τ→c∗τ in Rm+1 asε→0, $
|
(6.25) |
$ φ0ε,τ→φ∗τ strongly in L1(Ω), $
|
(6.26) |
$ MfDφ0ε,τ∗⇀MfDφ∗τ weakly−∗in M(Ω;R2), $
|
(6.27) |
$ inf(c,φ)∈ΞJτ(c,φ)=Jτ(c∗τ,φ∗τ)=limε→0Jε,τ(c0ε,τ,φ0ε,τ)=limε→0inf(c,φ)∈ΞεJε,τ(c,φ), $
|
(6.28) |
where the objective functional $ J_\tau:\Xi\to\mathbb{R} $ is defined in Eq (6.6).
Proof. First, we observe that a given sequence of minimizers for approximating problems (4.9)–(4.13) is compact with respect to the convergences (6.25)–(6.27). Indeed, for an arbitrary test function $ \widehat{\varphi}\in C^\infty_c(\mathbb{R}^2) $ and arbitrary vector $ \widehat{c}\in\mathbb{R}^{m+1} $ with positive components, we have:
$ \widehat{c}\in \mathcal{C}_{ad}\quad\text{and}\quad \widehat{\varphi}\in BV(\Omega), \quad\forall\, {{\varepsilon}} > 0. $ |
Let's assume that, in addition, the function $ \widehat{\varphi} $ satisfies the pointwise constraints $ l_{0}\le\widehat{\varphi}(x)\le l_{m+1} $ in $ \Omega $. Then, $ (\widehat{c}, \widehat{\varphi})\in\Xi_{{{\varepsilon}}} $ for each $ {{\varepsilon}} > 0 $, and, therefore,
$ J_{{{\varepsilon}}, \tau}(c^0_{{{\varepsilon}}, \tau}, \varphi^0_{{{\varepsilon}}, \tau}) = \inf\limits_{(c, \varphi)\in \Xi_{{{\varepsilon}}}}J_{{{\varepsilon}}, \tau}(c, \varphi)\le J_{{{\varepsilon}}, \tau}(\widehat{c}, \widehat{\varphi})\le\sup\limits_{{{\varepsilon}} > 0}J_{{{\varepsilon}}, \tau}(\widehat{c}, \widehat{\varphi})\le C < +\infty\quad\forall\, {{\varepsilon}} > 0. $ |
Hence,
$ supε>0Jε,τ(c0ε,τ,φ0ε,τ)<+∞andsupε>0‖φ0ε,τ‖BV(Ω)<+∞. $
|
(6.29) |
Thus, for the sequence of minimizers $ \left\{\left(c^0_{{{\varepsilon}}, \tau}, \varphi^0_{{{\varepsilon}}, \tau}\right)\in \Xi_{{{\varepsilon}}}\right\}_{{{\varepsilon}} > 0} $, all preconditions of Lemma 12 are fulfilled. Therefore, there exist a subsequence $ \left\{(c^0_{{{\varepsilon}}_k, \tau}, \varphi^0_{{{\varepsilon}}_k, \tau})\in \Xi_{{{\varepsilon}}_k}\right\}_{k\in\mathbb{N}} $ of the sequence $ \left\{(c^0_{{{\varepsilon}}, \tau}, \varphi^0_{{{\varepsilon}}, \tau})\in\Xi_{{\varepsilon}}\right\}_{{{\varepsilon}}\to 0} $ and a pair $ (c^\ast_\tau, \varphi^\ast_\tau)\in\Xi $, such that $ (c^0_{{{\varepsilon}}_k, \tau}, \varphi^0_{{{\varepsilon}}_k, \tau})\rightarrow (c^\ast_\tau, \varphi^\ast_\tau) $ is the sense of convergences (6.25)–(6.27) and
$ J_\tau(c^\ast_\tau, \varphi^\ast_\tau))\le\liminf\limits_{k \to \infty} J_{{{\varepsilon}}_k, \tau}(c^0_{{{\varepsilon}}_k, \tau}, \varphi^0_{{{\varepsilon}}_k, \tau}). $ |
From this, we deduce that
$ lim infk→∞inf(c,φ)∈ΞekJεk,τ(c,φ)=lim infk→∞Jεk,τ(c0εk,τ,φ0εk,τ)≥Jτ(c∗τ,φ∗τ)≥inf(c,φ)∈Ξφ∈BV(Ω)Jτ(c,φ)=Jτ(c0τ,φ0τ), $
|
(6.30) |
where $ (c^0_\tau, \varphi^0_\tau) $ is a minimizer for the relaxed problem (6.24).
On the other hand, Lemma 13 implies the existence of a realizing sequence $ \left\{(\widehat{c}_{{\varepsilon}}, \widehat{\varphi}_{{\varepsilon}})\right\}_{{{\varepsilon}} > 0} $ such that $ (\widehat{c}_{{\varepsilon}}, \widehat{\varphi}_{{\varepsilon}})\rightarrow (c^0_\tau, \varphi^0_\tau) $ as $ {{\varepsilon}}\to 0 $ in the sense of relations (6.16), and
$ J_\tau(c^0_\tau, \varphi^0_\tau) = \lim\limits_{{{\varepsilon}}\to 0} J_{{{\varepsilon}}, \tau}(\widehat{c}_{{\varepsilon}}, \widehat{\varphi}_{{\varepsilon}}). $ |
Utilizing this fact, we get
$ inf(c,φ)∈Ξφ∈BV(Ω)Jτ(c,φ)=Jτ(c0τ,φ0τ)=lim supε→0Jε,τ(ˆcε,ˆφε)≥lim supε→0inf(c,φ)∈ΞεJε,τ(c,φ)≥lim supk→∞inf(c,φ)∈ΞεkJεk,τ(c,φ)=lim supk→∞Jεk,τ(c0εk,τ,φ0εk,τ). $
|
(6.31) |
From this and Eq (6.30), we deduce that
$ \liminf\limits_{k\to\infty} J_{{{\varepsilon}}_k}(c^0_{{{\varepsilon}}_k, \tau}, \varphi^0_{{{\varepsilon}}_k, \tau}) \ge \limsup\limits_{k\to\infty}J_{{{\varepsilon}}_k, \tau}(c^0_{{{\varepsilon}}_k, \tau}, \varphi^0_{{{\varepsilon}}_k, \tau}). $ |
As a result, we have
$ Jτ(c0τ,φ0τ)=Jτ(c∗τ,φ∗τ)=inf(c,φ)∈Ξφ∈BV(Ω)Jτ((c,φ))=limk→∞inf(c,φ)∈ΞεkJεk,τ(c,φ). $
|
(6.32) |
Using these relations and the fact that the problem (6.24) is solvable, we may suppose that
$ (c^\ast_\tau, \varphi^\ast_\tau) = (c^0_\tau, \varphi^0_\tau). $ |
Since Eq (6.32) holds for all subsequences of $ \left\{(c^0_{{{\varepsilon}}, \tau}, \varphi^0_{{{\varepsilon}}, \tau})\in\Xi_{{\varepsilon}}\right\}_{{{\varepsilon}}\to 0} $, which are convergent in the sense of relations (6.25)–(6.27), it follows that these limits coincide and, therefore, $ (c^0_\tau, \varphi^0_\tau) $ is the limit of the whole sequence $ \left\{(c^0_{{{\varepsilon}}, \tau}, \varphi^0_{{{\varepsilon}}, \tau})\right\}_{{{\varepsilon}} > 0} $. Then, using the same argument for the entire sequence of minimizers, we finally obtain
$ lim infε→0inf(c,φ)∈ΞεJε,τ(c,φ)=lim infε→0Jε,τ(c0ε,τ,φ0ε,τ)≥Jτ(c0τ,φ0τ)≥inf(c,φ)∈Ξφ∈BV(Ω)Jτ(c,φ)=limε→0Jε,τ(c0ε,τ,φ0ε,τ)≥lim supε→0inf(c,φ)∈ΞεJε,τ(c,φ)=lim supε→0Jε,τ(c0ε,τ,φ0ε,τ), $
|
and this concludes the proof.
To illustrate the implementation of the proposed optimization problem (4.6) to the domain decomposition that corresponds to the homogeneity zones of a given function $ f:\Omega\rightarrow \mathbb{R} $, we provided numerical experiences with images that have been delivered by satellite Sentinel-2. As input data, we have used an image over the Dnipro area, Ukraine, with a resolution of 10m/pixel (see the left panel in Figure 1). This region represents a typical agricultural area with medium-sized fields of various shapes. As follows from the picture given in Figure 1 (see also the corresponding histogram in Figure 2), the observed data suffer from noise and blurs. So, at the first step, we have realized the denoising and debluring procedure (see the right panel in Figure 1) following the variational approach that has been recently proposed in [30]. As Figure 2 indicates, the histogram of the smoothed image has a strongly marked compactly localized spectrum that can be considered as a "good option" for its piecewise constant approximation. To conduct the numerical simulations of the segmentation procedure for the given area, we have set $ f(x) = u_2(x) $ in $ \Omega $, where $ u_2 $ stands for the intensity of the de-blurred image (see Figure 1) in the green spectral channel, and
$ m = 4, \ l_0 = -5000, \ l_1 = 0, \ l_2 = 1000, \ l_3 = 2000, \ l_4 = 300, \ l_m = 5000. $ |
In accordance with the results of Section 5, we have to solve the system (5.6) and find its solution $ \varphi^0_e $ for the corresponding function $ f $. Since, in practical implementations, it is reasonable to define the solution of the problem (5.6) using a "gradient descent" strategy, we started with some initial level-set function $ \varphi_0\in C(\Omega) $ and passed to the corresponding initial-boundary value problem for quasi-linear parabolic equations with Neumann boundary conditions. For numerical simulations, we set $ {{\varepsilon}} = 0.01 $, $ \tau = 10 $, $ \alpha = 1 $, $ \sigma = 3 $, $ \eta = 0.95 $, and the initial level set function $ \varphi_0\in C(\Omega) $ was defined as follows:
$ \varphi_0(x) = \left\{ +d(x,S),x∈insideS,0,x∈S,−d(x,S),x∈outsideS, \right. $
|
where $ S $ is a circle of radius $ 20 $ with a center at a central point of $ \Omega $, and $ d(x, S) $ denotes the Euclidean distance from the point $ x\in\Omega $ to the circle $ S $. We report the level sets in Figures 3 and 4.
All the authors conceived the idea, designed the methodology and the main proofs. All the authors contributed equally in the writing of the article.
The authors declare they have not used Artificial Intelligence (AI) tools in the creation of this article.
Peter Kogut thanks the support of University of Salerno.
The authors declare there is no conflict of interest.
[1] |
Krishnamurthy H, Gouaux E (2012) X-ray structures of LeuT in substrate-free outward-open and apo inward-open states. Nature 481: 469–474. doi: 10.1038/nature10737
![]() |
[2] | Sanders CR, Sonnichsen F (2006) Solution NMR of membrane proteins: practice and challenges. Magn Reson Chem 44 Spec No: S24–40. |
[3] |
Horst R, Stanczak P, Stevens RC, et al. (2013) beta2-Adrenergic receptor solutions for structural biology analyzed with microscale NMR diffusion measurements. Angew Chem Int Ed Engl 52: 331–335. doi: 10.1002/anie.201205474
![]() |
[4] |
Chun E, Thompson AA, Liu W, et al. (2012) Fusion partner toolchest for the stabilization and crystallization of G protein-coupled receptors. Structure 20: 967–976. doi: 10.1016/j.str.2012.04.010
![]() |
[5] |
Baker LA, Baldus M (2014) Characterization of membrane protein function by solid-state NMR spectroscopy. Curr Opin Struct Biol 27: 48–55. doi: 10.1016/j.sbi.2014.03.009
![]() |
[6] |
Hopkins AL, Groom CR (2002) The druggable genome. Nat Rev Drug Discov 1: 727–730. doi: 10.1038/nrd892
![]() |
[7] |
Overington JP, Al-Lazikani B, Hopkins AL (2006) How many drug targets are there? Nat Rev Drug Discov 5: 993–996. doi: 10.1038/nrd2199
![]() |
[8] |
Bakheet TM, Doig AJ (2009) Properties and identification of human protein drug targets. Bioinformatics 25: 451–457. doi: 10.1093/bioinformatics/btp002
![]() |
[9] |
Maslennikov I, Choe S (2013) Advances in NMR structures of integral membrane proteins. Curr Opin Struct Biol 23: 555–562. doi: 10.1016/j.sbi.2013.05.002
![]() |
[10] |
Klammt C, Maslennikov I, Bayrhuber M, et al. (2012) Facile backbone structure determination of human membrane proteins by NMR spectroscopy. Nat Methods 9: 834–839. doi: 10.1038/nmeth.2033
![]() |
[11] |
Berman HM, Battistuz T, Bhat TN, et al. (2002) The Protein Data Bank. Acta Crystallogr D Biol Crystallogr 58: 899–907. doi: 10.1107/S0907444902003451
![]() |
[12] |
Berman HM, Westbrook J, Feng Z, et al. (2000) The Protein Data Bank. Nucleic Acids Res 28: 235–242. doi: 10.1093/nar/28.1.235
![]() |
[13] |
Tang M, Comellas G, Rienstra CM (2013) Advanced solid-state NMR approaches for structure determination of membrane proteins and amyloid fibrils. Acc Chem Res 46: 2080–2088. doi: 10.1021/ar4000168
![]() |
[14] |
Ni QZ, Daviso E, Can TV, et al. (2013) High frequency dynamic nuclear polarization. Acc Chem Res 46: 1933–1941. doi: 10.1021/ar300348n
![]() |
[15] |
Zou P, McHaourab HS (2010) Increased sensitivity and extended range of distance measurements in spin-labeled membrane proteins: Q-band double electron-electron resonance and nanoscale bilayers. Biophys J 98: L18–20. doi: 10.1016/j.bpj.2009.12.4193
![]() |
[16] |
Mchaourab HS, Steed PR, Kazmier K (2011) Toward the fourth dimension of membrane protein structure: insight into dynamics from spin-labeling EPR spectroscopy. Structure 19: 1549–1561. doi: 10.1016/j.str.2011.10.009
![]() |
[17] |
Tusnady GE, Dosztanyi Z, Simon I (2004) Transmembrane proteins in the Protein Data Bank: identification and classification. Bioinformatics 20: 2964–2972. doi: 10.1093/bioinformatics/bth340
![]() |
[18] | Mchaourab HS, Lietzow MA, Hideg K, et al. (1996) Motion of spin-labeled side chains in T4 lysozyme. Correlation with protein structure and dynamics. Biochemistry 35: 7692–7704. |
[19] |
Hubbell WL, McHaourab HS, Altenbach C, et al. (1996) Watching proteins move using site-directed spin labeling. Structure 4: 779–783. doi: 10.1016/S0969-2126(96)00085-8
![]() |
[20] |
Fanucci GE, Cafiso DS (2006) Recent advances and applications of site-directed spin labeling. Curr Opin Struct Biol 16: 644–653. doi: 10.1016/j.sbi.2006.08.008
![]() |
[21] | Weierstall U, James D, Wang C, et al. (2014) Lipidic cubic phase injector facilitates membrane protein serial femtosecond crystallography. Nat Commun 5: 3309. |
[22] |
Liu W, Wacker D, Gati C, et al. (2013) Serial femtosecond crystallography of G protein-coupled receptors. Science 342: 1521–1524. doi: 10.1126/science.1244142
![]() |
[23] | Li D, Boland C, Walsh K, et al. (2012) Use of a robot for high-throughput crystallization of membrane proteins in lipidic mesophases. J Vis Exp: e4000. |
[24] | Li D, Boland C, Aragao D, et al. (2012) Harvesting and cryo-cooling crystals of membrane proteins grown in lipidic mesophases for structure determination by macromolecular crystallography. J Vis Exp: e4001. |
[25] |
Alexander N, Al-Mestarihi A, Bortolus M, et al. (2008) De novo high-resolution protein structure determination from sparse spin-labeling EPR data. Structure 16: 181–195. doi: 10.1016/j.str.2007.11.015
![]() |
[26] |
Hirst SJ, Alexander N, Mchaourab HS, et al. (2011) RosettaEPR: an integrated tool for protein structure determination from sparse EPR data. J Struct Biol 173: 506–514. doi: 10.1016/j.jsb.2010.10.013
![]() |
[27] |
Islam SM, Stein RA, McHaourab HS, et al. (2013) Structural refinement from restrained-ensemble simulations based on EPR/DEER data: application to T4 lysozyme. J Phys Chem B 117: 4740–4754. doi: 10.1021/jp311723a
![]() |
[28] |
Fischer AW, Alexander NS, Woetzel N, et al. (2015) BCL::MP-Fold: Membrane protein structure prediction guided by EPR restraints. Proteins 83: 1947–1962. doi: 10.1002/prot.24801
![]() |
[29] |
Jeschke G, Chechik V, Ionita P, et al. (2006) DeerAnalysis2006 - a comprehensive software package for analyzing pulsed ELDOR data. Appl Magn Reson 30: 473–498. doi: 10.1007/BF03166213
![]() |
[30] |
Hagelueken G, Ward R, Naismith JH, et al. (2012) MtsslWizard: In Silico Spin-Labeling and Generation of Distance Distributions in PyMOL. Appl Magn Reson 42: 377–391. doi: 10.1007/s00723-012-0314-0
![]() |
[31] |
Beasley KN, Sutch BT, Hatmal MM, et al. (2015) Computer Modeling of Spin Labels: NASNOX, PRONOX, and ALLNOX. Methods Enzymol 563: 569–593. doi: 10.1016/bs.mie.2015.07.021
![]() |
[32] |
Hatmal MM, Li Y, Hegde BG, et al. (2012) Computer modeling of nitroxide spin labels on proteins. Biopolymers 97: 35–44. doi: 10.1002/bip.21699
![]() |
[33] |
Alexander NS, Stein RA, Koteiche HA, et al. (2013) RosettaEPR: Rotamer Library for Spin Label Structure and Dynamics. PLoS One 8: e72851. doi: 10.1371/journal.pone.0072851
![]() |
[34] |
Sali A, Blundell TL (1993) Comparative protein modelling by satisfaction of spatial restraints. J Mol Biol 234: 779–815. doi: 10.1006/jmbi.1993.1626
![]() |
[35] |
Fiser A, Sali A (2003) Modeller: generation and refinement of homology-based protein structure models. Methods Enzymol 374: 461–491. doi: 10.1016/S0076-6879(03)74020-8
![]() |
[36] |
Webb B, Sali A (2014) Protein structure modeling with MODELLER. Methods Mol Biol 1137: 1–15. doi: 10.1007/978-1-4939-0366-5_1
![]() |
[37] | Webb B, Sali A (2014) Comparative protein structure modeling using MODELLER. Curr Protoc Bioinformatics 47: 5.6.1–5.6.32. |
[38] |
Rohl CA, Strauss CEM, Chivian D, et al. (2004) Modeling structurally variable regions in homologous proteins with rosetta. Proteins 55: 656–677. doi: 10.1002/prot.10629
![]() |
[39] |
Misura KM, Chivian D, Rohl CA, et al. (2006) Physically realistic homology models built with ROSETTA can be more accurate than their templates. Proc Natl Acad Sci U S A 103: 5361–5366. doi: 10.1073/pnas.0509355103
![]() |
[40] |
Schwede T, Kopp J, Guex N, et al. (2003) SWISS-MODEL: An automated protein homology-modeling server. Nucleic Acids Res 31: 3381–3385. doi: 10.1093/nar/gkg520
![]() |
[41] | Zhang Y (2009) I-TASSER: fully automated protein structure prediction in CASP8. Proteins 77 Suppl 9: 100–113. |
[42] |
Zhang Y (2008) I-TASSER server for protein 3D structure prediction. BMC Bioinformatics 9: 40. doi: 10.1186/1471-2105-9-40
![]() |
[43] |
Roy A, Kucukural A, Zhang Y (2010) I-TASSER: a unified platform for automated protein structure and function prediction. Nat Protoc 5: 725–738. doi: 10.1038/nprot.2010.5
![]() |
[44] |
Combs SA, Deluca SL, Deluca SH, et al. (2013) Small-molecule ligand docking into comparative models with Rosetta. Nat Protoc 8: 1277–1298. doi: 10.1038/nprot.2013.074
![]() |
[45] | Stevens RC, Cherezov V, Katritch V, et al. (2013) The GPCR Network: a large-scale collaboration to determine human GPCR structure and function. Nat Rev Drug Discov 12: 25–34. |
[46] |
Kroeze WK, Sheffler DJ, Roth BL (2003) G-protein-coupled receptors at a glance. J Cell Sci 116: 4867–4869. doi: 10.1242/jcs.00902
![]() |
[47] |
Yamashita A, Singh SK, Kawate T, et al. (2005) Crystal structure of a bacterial homologue of Na+/Cl--dependent neurotransmitter transporters. Nature 437: 215–223. doi: 10.1038/nature03978
![]() |
[48] |
Faham S, Watanabe A, Besserer GM, et al. (2008) The crystal structure of a sodium galactose transporter reveals mechanistic insights into Na+/sugar symport. Science 321: 810–814. doi: 10.1126/science.1160406
![]() |
[49] |
Perez C, Koshy C, Yildiz O, et al. (2012) Alternating-access mechanism in conformationally asymmetric trimers of the betaine transporter BetP. Nature 490: 126–130. doi: 10.1038/nature11403
![]() |
[50] |
Ma D, Lu P, Yan C, et al. (2012) Structure and mechanism of a glutamate-GABA antiporter. Nature 483: 632–636. doi: 10.1038/nature10917
![]() |
[51] |
Kazmier K, Sharma S, Quick M, et al. (2014) Conformational dynamics of ligand-dependent alternating access in LeuT. Nat Struct Mol Biol 21: 472–479. doi: 10.1038/nsmb.2816
![]() |
[52] |
Gregory KJ, Nguyen ED, Reiff SD, et al. (2013) Probing the metabotropic glutamate receptor 5 (mGlu(5)) positive allosteric modulator (PAM) binding pocket: discovery of point mutations that engender a "molecular switch" in PAM pharmacology. Mol Pharmacol 83: 991–1006. doi: 10.1124/mol.112.083949
![]() |
[53] | Yarov-Yarovoy V, Schonbrun J, Baker D (2006) Multipass membrane protein structure prediction using Rosetta. Proteins 62: 1010–1025. |
[54] |
Barth P, Schonbrun J, Baker D (2007) Toward high-resolution prediction and design of transmembrane helical protein structures. Proc Natl Acad Sci U S A 104: 15682–15687. doi: 10.1073/pnas.0702515104
![]() |
[55] |
Barth P, Wallner B, Baker D (2009) Prediction of membrane protein structures with complex topologies using limited constraints. Proc Natl Acad Sci U S A 106: 1409–1414. doi: 10.1073/pnas.0808323106
![]() |
[56] |
Nugent T, Jones DT (2012) Accurate de novo structure prediction of large transmembrane protein domains using fragment-assembly and correlated mutation analysis. Proc Natl Acad Sci U S A 109: E1540–1547. doi: 10.1073/pnas.1120036109
![]() |
[57] |
Nugent T, Jones DT (2013) Membrane protein orientation and refinement using a knowledge-based statistical potential. BMC Bioinformatics 14: 276. doi: 10.1186/1471-2105-14-276
![]() |
[58] |
Hopf TA, Colwell LJ, Sheridan R, et al. (2012) Three-dimensional structures of membrane proteins from genomic sequencing. Cell 149: 1607–1621. doi: 10.1016/j.cell.2012.04.012
![]() |
[59] |
Weiner BE, Woetzel N, Karakas M, et al. (2013) BCL::MP-fold: folding membrane proteins through assembly of transmembrane helices. Structure 21: 1107–1117. doi: 10.1016/j.str.2013.04.022
![]() |
[60] |
Simons KT, Kooperberg C, Huang E, et al. (1997) Assembly of Protein Tertiary Structures from Fragments with Similar Local Sequences using Simulated Annealing and Bayesian Scoring Functions. J Mol Biol 268: 209–225. doi: 10.1006/jmbi.1997.0959
![]() |
[61] |
Rohl CA, Strauss CE, Misura KM, et al. (2004) Protein structure prediction using Rosetta. Methods Enzymol 383: 66–93. doi: 10.1016/S0076-6879(04)83004-0
![]() |
[62] |
Kazmier K, Alexander NS, Meiler J, et al. (2011) Algorithm for selection of optimized EPR distance restraints for de novo protein structure determination. J Struct Biol 173: 549–557. doi: 10.1016/j.jsb.2010.11.003
![]() |
[63] |
Dimaio F, Leaver-Fay A, Bradley P, et al. (2011) Modeling symmetric macromolecular structures in rosetta3. PLoS One 6: e20450. doi: 10.1371/journal.pone.0020450
![]() |
[64] |
Leaver-Fay A, Tyka M, Lewis SM, et al. (2011) ROSETTA3: an object-oriented software suite for the simulation and design of macromolecules. Methods Enzymol 487: 545–574. doi: 10.1016/B978-0-12-381270-4.00019-6
![]() |
[65] |
Metropolis N, Rosenbluth A, Rosenbluth M, et al. (1953) Equations of state calculations by fast computing machines. J Chem Phys 21: 1087–1091. doi: 10.1063/1.1699114
![]() |
[66] |
Metropolis NU, Ulam S (1949) The Monte Carlo Method. J Am Stat Assoc 44: 335–341. doi: 10.1080/01621459.1949.10483310
![]() |
[67] |
Perozo E, Cortes DM, Cuello LG (1999) Structural rearrangements underlying K+-channel activation gating. Science 285: 73–78. doi: 10.1126/science.285.5424.73
![]() |
[68] |
Liu YS, Sompornpisut P, Perozo E (2001) Structure of the KcsA channel intracellular gate in the open state. Nat Struct Biol 8: 883–887. doi: 10.1038/nsb1001-883
![]() |
[69] |
Zou P, Mchaourab HS (2009) Alternating Access of the Putative Substrate-Binding Chamber in the ABC Transporter MsbA. J Mol Biol 393: 574–585. doi: 10.1016/j.jmb.2009.08.051
![]() |
[70] |
Altenbach C, Cai K, Klein-Seetharaman J, et al. (2001) Structure and function in rhodopsin: mapping light-dependent changes in distance between residue 65 in helix TM1 and residues in the sequence 306-319 at the cytoplasmic end of helix TM7 and in helix H8. Biochemistry 40: 15483–15492. doi: 10.1021/bi011546g
![]() |
[71] |
Viklund H, Elofsson A (2008) OCTOPUS: improving topology prediction by two-track ANN-based preference scores and an extended topological grammar. Bioinformatics 24: 1662–1668. doi: 10.1093/bioinformatics/btn221
![]() |
[72] |
Adamian L, Liang J (2006) Prediction of transmembrane helix orientation in polytopic membrane proteins. BMC Struct Biol 6: 13. doi: 10.1186/1472-6807-6-13
![]() |
[73] | Okada T, Sugihara M, Bondar AN, et al. (2004) The retinal conformation and its environment in rhodopsin in light of a new 2.2 A crystal structure. J Mol Biol 342: 571–583. |
[74] |
Misura KM, Baker D (2005) Progress and challenges in high-resolution refinement of protein structure models. Proteins 59: 15–29. doi: 10.1002/prot.20376
![]() |
[75] |
Tyka MD, Jung K, Baker D (2012) Efficient sampling of protein conformational space using fast loop building and batch minimization on highly parallel computers. J Comput Chem 33: 2483–2491. doi: 10.1002/jcc.23069
![]() |
[76] |
Woetzel N, Karakas M, Staritzbichler R, et al. (2012) BCL::Score--knowledge based energy potentials for ranking protein models represented by idealized secondary structure elements. PLoS One 7: e49242. doi: 10.1371/journal.pone.0049242
![]() |
[77] |
Karakas M, Woetzel N, Staritzbichler R, et al. (2012) BCL::Fold--de novo prediction of complex and large protein topologies by assembly of secondary structure elements. PLoS One 7: e49240. doi: 10.1371/journal.pone.0049240
![]() |