ECA-TFUnet: A U-shaped CNN-Transformer network with efficient channel attention for organ segmentation in anatomical sectional images of canines

Yunling Liu; Yaxiong Liu; Jingsong Li; Yaoxing Chen; Fengjuan Xu; Yifa Xu; Jing Cao; Yuntao Ma; Yunling Liu; Yaxiong Liu; Jingsong Li; Yaoxing Chen; Fengjuan Xu; Yifa Xu; Jing Cao; Yuntao Ma

doi:10.3934/mbe.2023827

Mathematical Biosciences and Engineering

2023, Volume 20, Issue 10: 18650-18669. doi: 10.3934/mbe.2023827

Previous Article Next Article

Research article Special Issues

ECA-TFUnet: A U-shaped CNN-Transformer network with efficient channel attention for organ segmentation in anatomical sectional images of canines

1.
College of Information and Electrical Engineering, China Agricultural University, Beijing 100083, China
2.
College of Veterinary Medicine, China Agricultural University, Beijing 100193, China
3.
Animal and Plant Disease Prevention and Control Center, Chaoyang District, Beijing 100016, China
4.
Shandong Digihuman Technology Co., Ltd, Shandong 250100, China
5.
College of Land Science and Technology, China Agricultural University, Beijing 100193, China

Academic Editor: Yudong Zhang

Received: 12 July 2023 Revised: 13 September 2023 Accepted: 18 September 2023 Published: 07 October 2023

Automated organ segmentation in anatomical sectional images of canines is crucial for clinical applications and the study of sectional anatomy. The manual delineation of organ boundaries by experts is a time-consuming and laborious task. However, semi-automatic segmentation methods have shown low segmentation accuracy. Deep learning-based CNN models lack the ability to establish long-range dependencies, leading to limited segmentation performance. Although Transformer-based models excel at establishing long-range dependencies, they face a limitation in capturing local detail information. To address these challenges, we propose a novel ECA-TFUnet model for organ segmentation in anatomical sectional images of canines. ECA-TFUnet model is a U-shaped CNN-Transformer network with Efficient Channel Attention, which fully combines the strengths of the Unet network and Transformer block. Specifically, The U-Net network is excellent at capturing detailed local information. The Transformer block is equipped in the first skip connection layer of the Unet network to effectively learn the global dependencies of different regions, which improves the representation ability of the model. Additionally, the Efficient Channel Attention Block is introduced to the Unet network to focus on more important channel information, further improving the robustness of the model. Furthermore, the mixed loss strategy is incorporated to alleviate the problem of class imbalance. Experimental results showed that the ECA-TFUnet model yielded 92.63% IoU, outperforming 11 state-of-the-art methods. To comprehensively evaluate the model performance, we also conducted experiments on a public dataset, which achieved 87.93% IoU, still superior to 11 state-of-the-art methods. Finally, we explored the use of a transfer learning strategy to provide good initialization parameters for the ECA-TFUnet model. We demonstrated that the ECA-TFUnet model exhibits superior segmentation performance on anatomical sectional images of canines, which has the potential for application in medical clinical diagnosis.

Keywords:

Citation: Yunling Liu, Yaxiong Liu, Jingsong Li, Yaoxing Chen, Fengjuan Xu, Yifa Xu, Jing Cao, Yuntao Ma. ECA-TFUnet: A U-shaped CNN-Transformer network with efficient channel attention for organ segmentation in anatomical sectional images of canines[J]. Mathematical Biosciences and Engineering, 2023, 20(10): 18650-18669. doi: 10.3934/mbe.2023827

Related Papers:

[1]	Zhiyuan Wang, Chu Zhang, Shaopei Xue, Yinjie Luo, Jun Chen, Wei Wang, Xingchen Yan . Dynamic coordinated strategy for parking guidance in a mixed driving parking lot involving human-driven and autonomous vehicles. Electronic Research Archive, 2024, 32(1): 523-550. doi: 10.3934/era.2024026
[2]	Xiaoying Zheng, Jing Wu, Xiaofeng Li, Junjie Huang . UAV search coverage under priority of important targets based on multi-location domain decomposition. Electronic Research Archive, 2024, 32(4): 2491-2513. doi: 10.3934/era.2024115
[3]	Yu Shen, Hecheng Li . A multi-strategy genetic algorithm for solving multi-point dynamic aggregation problems with priority relationships of tasks. Electronic Research Archive, 2024, 32(1): 445-472. doi: 10.3934/era.2024022
[4]	Sida Lin, Lixia Meng, Jinlong Yuan, Changzhi Wu, An Li, Chongyang Liu, Jun Xie . Sequential adaptive switching time optimization technique for maximum hands-off control problems. Electronic Research Archive, 2024, 32(4): 2229-2250. doi: 10.3934/era.2024101
[5]	Ismail Ben Abdallah, Yassine Bouteraa, Saleh Mobayen, Omar Kahouli, Ali Aloui, Mouldi Ben Amara, Maher JEBALI . Fuzzy logic-based vehicle safety estimation using V2V communications and on-board embedded ROS-based architecture for safe traffic management system in hail city. Electronic Research Archive, 2023, 31(8): 5083-5103. doi: 10.3934/era.2023260
[6]	Jian Gong, Yuan Zhao, Jinde Cao, Wei Huang . Platoon-based collision-free control for connected and automated vehicles at non-signalized intersections. Electronic Research Archive, 2023, 31(4): 2149-2174. doi: 10.3934/era.2023111
[7]	Hao Li, Zhengwu Wang, Shuiwang Chen, Weiyao Xu, Lu Hu, Shuai Huang . Integrated optimization of planning and operation of a shared automated electric vehicle system considering the trip selection and opportunity cost. Electronic Research Archive, 2024, 32(1): 41-71. doi: 10.3934/era.2024003
[8]	Wenjie Wang, Suzhen Wen, Shen Gao, Pengyi Lin . A multi-objective dynamic vehicle routing optimization for fresh product distribution: A case study of Shenzhen. Electronic Research Archive, 2024, 32(4): 2897-2920. doi: 10.3934/era.2024132
[9]	Yineng Ouyang, Zhaotao Liang, Zhihui Ma, Lei Wang, Zhaohua Gong, Jun Xie, Kuikui Gao . A class of constrained optimal control problems arising in an immunotherapy cancer remission process. Electronic Research Archive, 2024, 32(10): 5868-5888. doi: 10.3934/era.2024271
[10]	Michael Barg, Amanda Mangum . Statistical analysis of numerical solutions to constrained phase separation problems. Electronic Research Archive, 2023, 31(1): 229-250. doi: 10.3934/era.2023012

Abstract

1. Introduction

We consider the system of Hamilton-Jacobi equations

$\begin{equation} \begin{cases} \lambda u_1(x)+H_1(Du_1(x))+B_1(u_1(x), u_2(x)) = 0 \ & \text{ in } \mathbb{T}^n, \\ \lambda u_2(x)+H_2(Du_2(x))+B_2(u_1(x), u_2(x)) = 0 \ & \text{ in } \mathbb{T}^n, \end{cases} \end{equation}$

(1.1)

where $\lambda > 0$ is a given constant, the functions $H_i : \mathbb{R}^n\to \mathbb{R}$ and $B_i : \mathbb{R}^2 \to \mathbb{R}$ , with $i = 1, 2$ , are given continuous functions, and $\mathbb{T}^n$ denotes the $n$ -dimensional flat torus $\mathbb{R}^n/ \mathbb{Z}^n$ .

In a recent paper ^[6], the authors have investigated the vanishing discount problem for a nonlinear monotone system of Hamilton-Jacobi equations

$\begin{equation} \begin{cases} \lambda u_1(x)+G_1(x, Du_1(x), u_1(x), u_2(x), \ldots, u_m(x)) = 0 \ & \text{ in } \mathbb{T}^n, \\ \phantom{ \lambda u_1(x)+G_1(x, Du_1(x), u_1(x), u_2(x)}\vdots &\\ \lambda u_m(x)+G_m(x, Du_m(x), u_1(x), u_2(x), \ldots, u_m(x)) = 0 \ & \text{ in } \mathbb{T}^n, \end{cases} \end{equation}$

(1.2)

and established under some hypotheses on the $G_i \in C(\mathbb{T}^n \times \mathbb{R}^n \times \mathbb{R}^m)$ that, when $u_ \lambda = (u_{ \lambda, 1}, \ldots, u_{ \lambda, m})\in C(\mathbb{T}^n)^m$ denoting the (viscosity) solution of (1.2), the whole family $\{u_ \lambda\}_{ \lambda > 0}$ converges in $C(\mathbb{T}^n)^m$ to some $u_0\in C(\mathbb{T}^n)^m$ as $\lambda\to 0+$ . The constant $\lambda > 0$ in the above system is the so-called discount factor.

The hypotheses on the system are the convexity, coercivity, and monotonicity of the $G_i$ as well as the solvability of (1.2), with $\lambda = 0$ . Here the convexity of $G_i$ is meant that the functions $\mathbb{R}^n \times \mathbb{R}^m\ni (p, u)\mapsto G_i(x, p, u)$ are convex. We refer to ^[6] for the precise statement of the hypotheses.

Prior to work ^[6], there have been many contributions to the question about the whole family convergence (in other words, the full convergence) under the vanishing discount, which we refer to ^{[1,3,4,6,8,9,10]} and the references therein.

In the case of the scalar equation, B. Ziliotto ^[11] has recently shown an example of the Hamilton-Jacobi equation having non-convex Hamiltonian in the gradient variable for which the full convergence does not hold. In Ziliotto's approach, the first step is to find a system of two algebraic equations

$\begin{equation} \left\{\begin{aligned} & \lambda u+f(u-v) = 0, \\& \lambda v+g(v-u) = 0, \end{aligned}\right. \end{equation}$

(1.3)

with two unknowns $u, v\in \mathbb{R}$ and with a parameter $\lambda > 0$ as the discount factor, for which the solutions $(u_ \lambda, v_ \lambda)$ stay bounded and fail to fully converge as $\lambda \to 0+$ . Here, an "algebraic" equation is meant not to be a functional equation. The second step is to interpolate the two values $u_ \lambda$ and $v_ \lambda$ to get a function of $x\in \mathbb{T}^1$ which satisfies a scalar non-convex Hamilton-Jacobi equation in $\mathbb{T}^1$ .

In the first step above, Ziliotto constructs $f, g$ based on a game-theoretical and computational argument, and the formula for $f, g$ is of the minimax type and not quite explicit. In ^[5], the author has reexamined the system given by Ziliotto, with a slight generality, as a counterexample for the full convergence in the vanishing discount.

Our purpose in this paper is to present a system (1.3), with an explicit formula for $f, g$ , for which the solution $(u_ \lambda, v_ \lambda)$ does not fully converge to a single point in $\mathbb{R}^2$ . A straightforward consequence is that (1.1), with $B_1(u_1, u_2) = f(u_1-u_2)$ and $B_2(u_1, u_2) = g(u_2-u_1)$ , has a solution given by

$(u_{ \lambda, 1}(x), u_{ \lambda.2}(x)) = (u_ \lambda, v_ \lambda) \ \ \text{ for } x\in \mathbb{T}^n,$

under the assumption that $H_i(x, 0) = 0$ for all $x\in \mathbb{T}^n$ , and therefore, gives an example of a discounted system of Hamilton-Jacobi equations, the solution of which fails to satisfy the full convergence as the discount factor goes to zero.

The paper consists of two sections. This introduction is followed by Section 2, the final section, which is divided into three subsections. The main results are stated in the first subsection of Section 2, the functions $f, g$ , the key elements of (1.3), are contstructed in the second subsection, and the final subsection provides the proof of the main results.

2. A system of algebraic equations and the main results

Our main focus is now the system

$\begin{equation} \begin{cases} \lambda u+f(u-v) = 0&\\ \lambda v+g(v-u) = 0, \end{cases} \end{equation}$

(2.1)

where $f, g\in C(\mathbb{R}, \mathbb{R})$ are nondecreasing functions, to be constructed, and $\lambda > 0$ is a constant, to be sent to zero. Notice that (2.1) above is referred as (1.3) in the previous section.

We remark that, due to the monotonicity assumption on $f, g$ , the mapping $(u, v)\mapsto (f(u-v), g(v-u)), \, \mathbb{R}^2\to \mathbb{R}^2$ is monotone. Recall that, by definition, a mapping $(u, v)\mapsto (B_1(u, v), B_2(u, v)), \, \mathbb{R}^2\to \mathbb{R}^2$ is monotone if, whenever $(u_1, v_1), (u_2, v_2)\in \mathbb{R}^2$ satisfy $u_1-u_2\geq v_1-v_2$ (resp., $v_1-v_2\geq u_1-u_2$ ), we have $B_1(u_1, v_1)\geq B_1(u_2, v_2)$ (resp., $B_2(u_1, v_1)\geq B_2(u_2, v_2)$ ).

2.1. Main results

Our main results are stated as follows.

Theorem 1. There exist two increasing functions $f, g\in C(\mathbb{R}, \mathbb{R})$ having the properties $\textrm{(a)–(c):}$

$\textrm{(a)}$ For any $\lambda > 0$ there exists a unique solution $(u_ \lambda, v_ \lambda)\in \mathbb{R}^2$ to (2.1),

$\textrm{(b)}$ the family of the solutions $(u_ \lambda, v_ \lambda)$ to (2.1), with $\lambda > 0$ , is bounded in $\mathbb{R}^2$ ,

$\textrm{(c)}$ the family $\{(u_ \lambda, v_ \lambda)\}_{ \lambda > 0}$ does not converge as $\lambda\to 0+$ .

It should be noted that, as mentioned in the introduction, the above theorem has been somewhat implicitly established by Ziliotto ^[11]. In this note, we are interested in a simple and easy approach to finding functions $f, g$ having the properties (a)–(c) in Theorem 1.

The following is an immediate consequence of the above theorem.

Corollary 2. Let $H_i\in C(\mathbb{R}^n, \mathbb{R})$ , $i = 1, 2$ , satisfy $H_1(0) = H_2(0) = 0$ . Let $f, g\in C(\mathbb{R}, \mathbb{R})$ be the functions given by Theorem 1, and set $B_1(u_1, u_2) = f(u_1-u_2)$ and $B_2(u_1, u_2) = g(u_2-u_1)$ for all $(u_1, u_2)\in \mathbb{R}^2$ . For any $\lambda > 0$ , let $(u_{ \lambda, 1}, u_{ \lambda, 2})$ be the (viscosity) solution of (1.1). Then, the functions $u_{ \lambda, i}$ are constants, the family of the points $(u_{ \lambda, 1}, u_{ \lambda, 2})$ in $\mathbb{R}^2$ is bounded, and it does not converge as $\lambda\to 0+$ .

Notice that the convexity of $H_i$ in the above corollary is irrelevant, and, for example, one may take $H_i(p) = |p|^2$ for $i\in \mathbb{I}$ , which are convex functions.

We remark that a claim similar to Corollary 2 is valid when one replaces $H_i(p)$ by degenerate elliptic operators $F_i(x, p, M)$ as far as $F_i(x, 0, 0) = 0$ , where $M$ is the variable corresponding to the Hessian matrices of unknown functions. (See ^[2] for an overview on the viscosity solution approach to fully nonlinear degenerate elliptic equations.)

2.2. The functions $f, g$

If $f, g$ are given and $(u, v)\in \mathbb{R}^2$ is a solution of (2.1), then $w: = u-v$ satisfies

$\begin{equation} \lambda w+f(w)-g(-w) = 0. \end{equation}$

(2.2)

Set

$\begin{equation} h(r) = f(r)-g(-r) \ \ \text{ for } r\in \mathbb{R}, \end{equation}$

(2.3)

which defines a continuous and nondecreasing function on $\mathbb{R}$ .

To build a triple of functions $f, g, h$ , we need to find two of them in view of the relation (2.3). We begin by defining function $h$ .

For this, we discuss a simple geometry on $xy$ -plane as depicted in below. Fix $0 < k_1 < k_2$ . The line $y = -\frac 12k_2 +k_1(x+\frac 12)$ has slope $k_1$ and crosses the lines $x = -1$ and $y = k_2 x$ at $\mathrm{P}: = (-1, -\frac 12(k_1+k_2))$ and $\mathrm{Q}: = (-\frac 12, -\frac 12 k_2)$ , respectively, while the line $y = k_2x$ meets the lines $x = -1$ and $x = -\frac 12$ at $\mathrm{R}: = (-1, -k_2)$ and $\mathrm{Q} = (-\frac 12, -\frac 12 k_2)$ , respectively.

Figure 1. Graph of

$\psi$ .

DownLoad: Full-Size Img PowerPoint

Choose $k^* > 0$ so that $\frac 12(k_1+k_2) < k^* < k_2$ . The line $y = k^*x$ crosses the line $y = -\frac 12k_2 +k_1(x+\frac 12)$ at a point $\mathrm{S}: = (x^*, y^*)$ in the open line segment between the points $\mathrm{P} = (-\frac 12, -\frac 12(k_1+k_2))$ and $\mathrm{Q} = (-\frac 12, -\frac 12 k_2)$ . The line connecting $\mathrm{R} = (-1, -k_2)$ and $\mathrm{S} = (x^*, y^*)$ can be represented by $y = -k_2+k^+(x+1)$ , with $k^+: = \frac{y^*+k_2}{x^*+1} > k_2$ .

We set

$\psi(x) = \begin{cases} k_2 x \qquad\qquad \text{ for } x\in (-\infty, -1]\cup[-1/2, \infty), &\\ \min\{-k_2+k^+(x+1), -\frac 12 k_2+k_1(x+\frac 12)\} \ \ \text{ for } x\in(-1, -\frac 12).& \end{cases}$

It is clear that $\psi\in C(\mathbb{R})$ and increasing on $\mathbb{R}$ . The building blocks of the graph $y = \psi(x)$ are three lines whose slopes are $k_1 < k_2 < k^+$ . Hence, if $x_1 > x_2$ , then $\psi(x_1)-\psi(x_2)\geq k_1(x_1-x_2)$ , that is, the function $x\mapsto \psi(x)-k_1 x$ is nondecreasing on $\mathbb{R}$ .

Next, we set for $j\in \mathbb{N}$ ,

$\psi_j(x) = 2^{-j}\psi(2^j x) \ \ \text{ for } x\in \mathbb{R}.$

It is clear that for all $j\in \mathbb{N}$ , $\psi_j\in C(\mathbb{R})$ , the function $x\mapsto \psi_j(x)-k_1 x$ is nondecreasing on $\mathbb{R}$ , and

$\psi_j(x)\begin{cases} > k_2x \ \ & \text{ for all } x\in(-2^{-j}, -2^{-j-1}), \\ = k_2x \ \ & \text{ otherwise}. \end{cases}$

We set

$\eta(x) = \max\limits_{j\in \mathbb{N}}\psi_j(x) \ \ \text{ for } x\in \mathbb{R}.$

It is clear that $\eta\in C(\mathbb{R})$ and $x\mapsto \eta(x)-k_1x$ is nondecreasing on $\mathbb{R}$ . Moreover, we see that

$\eta(x) = k_2 x \ \ \text{ for all } x\in(-\infty, -\tfrac 12]\cup [0, \infty),$

and that if $-2^{-j} < x < -2^{-j-1}$ and $j\in \mathbb{N}$ ,

$\eta(x) = \psi_j(x) > k_2x.$

Note that the point $\mathrm{S} = (x^*, y^*)$ is on the graph $y = \psi(x)$ and, hence, that for any $j\in \mathbb{N}$ , the point $(2^{-j}x^*, 2^{-j}y^*)$ is on the graph $y = \eta(x)$ . Similarly, since the point $\mathrm{S} = (x^*, y^*)$ is on the graph $y = k^*x$ and for any $j\in \mathbb{N}$ , the point $(2^{-j}x^*, 2^{-j}y^*)$ is on the graph $y = k^*x$ . Also, for any $j\in \mathbb{N}$ , the point $(-2^{-j}, -k_2 2^{-j})$ lies on the graphs $y = \eta(x)$ and $y = k_2 x$ .

Fix any $d\geq 1$ and define $h\in C(\mathbb{R})$ by

$h(x) = \eta(x-d).$

For the function $h$ defined above, we consider the problem

$\begin{equation} \lambda z+h(z) = 0. \end{equation}$

(2.4)

Lemma 3. For any $\lambda\geq 0$ , there exists a unique solution $z_ \lambda\in \mathbb{R}$ of (2.4).

Proof. Fix $\lambda\geq 0$ . The function $x\mapsto h(x)+ \lambda x$ is increasing on $\mathbb{R}$ and satisfies

$\lim\limits_{x\to \infty}(h(x)+ \lambda x) = \infty \ \ \text{ and } \ \ \lim\limits_{x\to-\infty}(h(x)+ \lambda x) = -\infty.$

Hence, there is a unique solution of (2.4).

For any $\lambda\geq0$ , we denote by $z_ \lambda$ the unique solution of (2.4). Since $h(d) = 0$ , it is clear that $z_0 = d$ .

For later use, observe that if $\lambda > 0$ , $k > 0$ , and $(z, w)\in \mathbb{R}^2$ is the point of the intersection of two lines $y = - \lambda x$ and $y = k(x-d)$ , then $w = - \lambda z = k(z-d)$ and

$\begin{equation} z = \frac{kd}{k+ \lambda}. \end{equation}$

(2.5)

Lemma 4. There are sequences $\{\mu_j\}$ and $\{\nu_j\}$ of positive numbers converging to zero such that

$z_{\mu_j} = \frac{k_2d}{k_2+\mu_j} \ \ \mathit{\text{and}} \ \ z_{\nu_j} = \frac{k^*d}{k^*+\nu_j}.$

Proof. Let $j\in \mathbb{N}$ . Since $(-2^{-j}, -k_2 2^{-j})$ is on the intersection of the graphs $y = k_2x$ and $y = \eta(x)$ , it follows that $(-2^{-j}+d, -k_2 2^{-j})$ is on the intersection of the graphs $y = k_2(x-d)$ and $y = h(x)$ . Set

$\begin{equation} \mu_j = \frac{k_2 2^{-j}}{d-2^{-j}}, \end{equation}$

(2.6)

and note that $\mu_j > 0$ and that

$-\mu_j(d-2^{-j}) = -k_2 2^{-j},$

which says that the point $(d-2^{-j}, -k_22^{-j})$ is on the line $y = -\mu_j x$ . Combining the above with

$-k_2 2^{-j} = h(d-2^{-j})$

shows that $d-2^{-j}$ is the unique solution of (2.4). Also, since $(d-2^{-j}, -\mu_j(d-2^{-j})) = (d-2^{-j}, -k_2 2^{-j})$ is on the line $y = k_2(x-d)$ , we find by (2.5) that

$z_{\mu_j} = \frac{k_2 d}{k_2+\mu_j}.$

Similarly, since $(2^{-j} x^*, 2^{-j}y^*)$ is on the intersection of the graphs $y = k^*x$ and $y = \eta(x)$ , we deduce that if we set

$\begin{equation} \nu_j: = -\frac{2^{-j}y^*}{d+ 2^{-j}x^*} = \frac{2^{-j}|y^*|}{d-2^{-j}|x^*|}, \end{equation}$

(2.7)

then

$z_{\nu_j} = \frac{k^* d}{k^*+\nu_j}.$

It is obvious by (2.6) and (2.7) that the sequences $\{\mu_j\}_{j\in \mathbb{N}}$ and $\{\nu_j\}_{j\in \mathbb{N}}$ are decreasing and converge to zero.

We fix $k_0\in(0, k_1)$ and define $f, g\in C(\mathbb{R})$ by $f(x) = k_0(x-d)$ and

$g(x) = f(-x)-h(-x).$

It is easily checked that $g(x)-(k_1-k_0)x$ is nondecreasing on $\mathbb{R}$ , which implies that $g$ is increasing on $\mathbb{R}$ , and that $h(x) = f(x)-g(-x)$ for all $x\in \mathbb{R}$ . We note that

$\begin{equation} f(d) = h(d) = g(-d) = 0. \end{equation}$

(2.8)

2.3. Proof of the main results

We fix $f, g, h$ as above, and consider the system (2.1).

Lemma 5. Let $\lambda > 0$ . There exists a unique solution of (2.1).

The validity of the above lemma is well-known, but for the reader's convenience, we provide a proof of the lemma above.

Proof. By choice of $f, g$ , the functions $f, g$ are nondecreasing on $\mathbb{R}$ . We show first the comparison claim: if $(u_1, v_1), (u_2, v_2)\in \mathbb{R}^2$ satisfy

$\begin{align} & \lambda u_1+f(u_1-v_1)\leq 0, \quad \lambda v_1+g(v_1-u_1)\leq 0, \end{align}$

(2.9)

$\begin{align} & \lambda u_2+f(u_2-v_2)\geq 0, \quad \lambda v_2+g(v_2-u_2)\geq 0, \end{align}$

(2.10)

then $u_1\leq u_2$ and $v_1\leq v_2$ . Indeed, contrary to this, we suppose that $\max\{u_1-u_2, v_1-v_2\} > 0$ . For instance, if $\max\{u_1-u_2, v_1-v_2\} = u_1-u_2$ , then we have $u_1-v_1\geq u_2-v_2$ and $u_1 > u_2$ , and moreover

$0\geq \lambda u_1+f(u_1-v_1)\geq \lambda u_1+ f(u_2-v_2) > \lambda u_2+f(u_2-v_2),$

yielding a contradiction. The other case when $\max\{u_1-u_2, v_1-v_2\} = v_1-v_2$ , we find a contradiction, $0 > \lambda v_2+g(v_2-u_2)$ , proving the comparison.

From the comparison claim, the uniqueness of the solutions of (2.1) follows readily.

Next, we may choose a constant $C > 0$ so large that $(u_1, v_1) = (-C, -C)$ and $(u_2, v_2) = (C, C)$ satisfy (2.9) and (2.10), respectively. We write $S$ for the set of all $(u_1, u_2)\in \mathbb{R}^2$ such that (2.9) hold. Note that $(-C, -C)\in S$ and that for any $(u, v)\in S$ , $u\leq C$ and $v\leq C$ . We set

$\begin{aligned} u^*& = \sup\{u \, :\, (u, v)\in S \ \text{ for some }v\}, \\ v^*& = \sup\{v \, :\, (u, v)\in S \ \text{ for some }u\}. \end{aligned}$

It follows that $-C\leq u^*, v^*\leq C$ . We can choose sequences

$\{(u_n^1, v_n^1)\}_{n\in \mathbb{N}}, \, \{(u_n^2, v_n^2)\}_{n\in \mathbb{N}}\subset S$

such that $\{u_n^1\}, \{v_n^2\}$ are nondecreasing,

$\lim\limits_{n\to\infty}u_n^1 = u^* \ \ \text{ and } \ \ \lim\limits_{n\to \infty}v_n^2 = v^*.$

Observe that for all $n\in \mathbb{N}$ , $u_n^2\leq u^*$ , $v_n^1\leq v^*$ , and

$0\geq \lambda u_n^1+f(u_n^1-v_n^1)\geq \lambda u_n^1+f(u_n^1-v^*),$

which yields, in the limit as $n\to\infty$ ,

$0\geq \lambda u^*+f(u^*-v^*).$

Similarly, we obtain $0\geq \lambda v^*+g(v^*-u^*)$ . Hence, we find that $(u^*, v^*)\in S$ .

We claim that $(u^*, v^*)$ is a solution of (2.1). Otherwise, we have

$0 > \lambda u^*+f(u^*-v^*) \ \ \text{ or } \ \ 0 > \lambda v^*+g(v^*-u^*).$

For instance, if the former of the above inequalities holds, we can choose $\varepsilon > 0$ , by the continuity of $f$ , so that

$0 > \lambda (u^*+ \varepsilon)+f(u^*+ \varepsilon-v^*).$

Since $(u^*, v^*)\in S$ , we have

$0\geq \lambda v^*+g(v^*-u^*)\geq \lambda v^*+g(v^*-u^*- \varepsilon).$

Accordingly, we find that $(u^*+ \varepsilon, v^*)\in S$ , which contradicts the definition of $u^*$ . Similarly, if $0 > \lambda v^*+g(v^*-u^*)$ , then we can choose $\delta > 0$ so that $(u^*, v^*+ \delta)\in S$ , which is a contradiction. Thus, we conclude that $(u^*, v^*)$ is a solution of (2.1).

Theorem 6. For any $\lambda > 0$ , let $(u_ \lambda, v_ \lambda)$ denote the unique solution of (2.1). Let $\{\mu_j\}, \{\nu_j\}$ be the sequences of positive numbers from Lemma 2.4. Then

$\lim\limits_{j\to\infty} u_{\mu_j} = \frac {k_0d}{k_2} \ \ \mathit{\text{and}} \ \ \lim\limits_{j\to\infty} u_{\nu_j} = \frac{k_0d}{k^*}.$

In particular,

$\liminf\limits_{ \lambda\to 0}u_ \lambda\leq \frac{k_0d}{k_2} < \frac{k_0 d}{k^*}\leq \limsup\limits_{ \lambda \to 0}u_ \lambda.$

With our choice of $f, g$ , the family of solutions $(u_ \lambda, v_ \lambda)$ of (2.1), with $\lambda > 0$ , does not converge as $\lambda\to 0$ .

Proof. If we set $z_ \lambda = u_ \lambda-v_ \lambda$ , then $z_ \lambda$ satisfies (2.4). By Lemma 4, we find that

$z_{\mu_j} = \frac{k_2d}{k_2+\mu_j} \ \ \text{ and } \ \ z_{\nu_j} = \frac{k^*d}{k^*+\nu_j}.$

Since $u_ \lambda$ satisfies

$0 = \lambda u_ \lambda+f(z_ \lambda) = \lambda u_ \lambda+k_0(z_ \lambda-d),$

we find that

$u_{\mu_j} = -\frac{k_0(z_{\mu_j}-d)}{\mu_j} = -\frac {k_0d}{\mu_j}\left(\frac{k_2}{k_2+\mu_j}-1\right) = -\frac{k_0d}{\mu_j}\frac{-\mu_j}{k_2+\mu_j} = \frac{k_0d}{k_2+\mu_j},$

which shows that

$\lim\limits_{j\to\infty}u_{\mu_j} = \frac{k_0d}{k_2}.$

A parallel computation shows that

$\lim\limits_{j\to\infty}u_{\nu_j} = \frac{k_0d}{k^*}.$

Recalling that $0 < k^* < k_2$ , we conclude that

$\liminf\limits_{ \lambda\to 0}u_ \lambda\leq \frac{k_0d}{k_2} < \frac{k_0 d}{k^*}\leq \limsup\limits_{ \lambda \to 0}u_ \lambda.$

We remark that, since

$\lim\limits_{ \lambda \to 0}z_ \lambda = d \ \ \text{ and } \ \ v_ \lambda = u_ \lambda-z_ \lambda,$

$\lim\limits_{j\to\infty}v_{\mu_j} = \frac{k_0d}{k_2}-d \ \ \text{ and } \ \ \lim\limits_{j\to\infty}v_{\nu_j} = \frac{k_0d}{k^*}-d.$

We give the proof of Theorem 1.

Proof of Theorem 1. Assertions (a) and (c) are consequences of Lemma 5 and Theorem 6, respectively.

Recall (2.8). That is, we have $f(d) = h(d) = g(-d) = 0$ . Setting $(u_2, v_2) = (d, 0)$ , we compute that for any $\lambda > 0$ ,

$\lambda u_2+f(u_2-v_2) > f(d) = 0 \ \ \text{ and } \ \ \lambda v_2 +g(v_2-u_2) = g(-d) = 0.$

By the comparison claim, proved in the proof of Lemma 5, we find that $u_ \lambda \leq d$ and $v_ \lambda\leq 0$ for any $\lambda > 0$ . Similarly, setting $(u_1, v_1) = (0, -d)$ , we find that for any $\lambda > 0$ ,

$\lambda u_1+f(u_1-v_1) = f(d) = 0 \ \ \text{ and } \ \ \lambda v_1+g(v_1-u_1)\leq g(v_1-u_1) = g(-d) = 0,$

which shows by the comparison claim that $u_ \lambda \geq 0$ and $v_ \lambda\geq -d$ for any $\lambda > 0$ . Thus, the sequence $\{(u_ \lambda, v_ \lambda)\}_{ \lambda > 0}$ is bounded in $\mathbb{R}^2$ , which proves assertion (b).

Proof of Corollary 2. For any $\lambda > 0$ , let $(u_ \lambda, v_ \lambda)\in \mathbb{R}^2$ be the unique solution of (2.1). Since $H_1(0) = H_2(0) = 0$ , it is clear that the constant function $(u_{ \lambda, 1}(x), u_{ \lambda, 2}(x)): = (u_ \lambda, v_ \lambda)$ is a classical solution of (1.1). By a classical uniqueness result (see, for instance, [,Theorem 4.7]), $(u_{ \lambda, 1}, u_{ \lambda, 2})$ is a unique viscosity solution of (1.1). The rest of the claims in Corollary 2 is an immediate consequence of Theorem 1.

Some remarks are in order. (ⅰ) Following ^[11], we may use Theorem 6 as the primary cornerstone for building a scalar Hamilton-Jacobi equation, for which the vanishing discount problem fails to have the full convergence as the discount factor goes to zero.

(ⅱ) In the construction of the functions $f, g\in C(\mathbb{R}, \mathbb{R})$ in Theorem 6, the author has chosen $d$ to satisfy $d\geq 1$ , but, in fact, one may choose any $d > 0$ . In the proof, the core step is to find the function $h(x) = f(x)-g(-x)$ , with the properties: (a) the function $x\mapsto h(x)- \varepsilon x$ is nondecreasing on $\mathbb{R}$ for some $\varepsilon > 0$ and (b) the curve $y = h(x)$ , with $x < d$ , meets the lines $y = p(x-d)$ and $y = q(x-d)$ , respectively, at $P_j$ and $Q_j$ for all $j\in \mathbb{N}$ , where $p, q, d$ are positive constants such that $\varepsilon < p < q$ , and the sequences $\{P_j\}_{j\in \mathbb{N}}, \, \{Q_j\}_{j\in \mathbb{N}}$ converge to the point $(d, 0)$ . Obviously, such a function $h$ is never left-differentiable at $x = d$ nor convex in any neighborhood of $x = d$ . Because of this, it seems difficult to select $f, g\in C(\mathbb{R}, \mathbb{R})$ in Theorem 1, both smooth everywhere. In the proof of Theorem 6, we have chosen $\varepsilon = k_0$ , $p = k^*$ , $q = k_2$ , $P_j = (u_{\nu_j}, k^*(u_{\nu_j}-d))$ , and $Q_j = (u_{\mu_j}, k_2(u_{\mu_j}-d))$

Another possible choice of $h$ among many other ways is the following. Define first $\eta \, :\, \mathbb{R}\to \mathbb{R}$ by $\eta(x) = x(\sin(\log|x|)+2)$ if $x\not = 0$ , and $\eta(0) = 0$ (see ). Fix $d > 0$ and set $h(x) = \eta(x-d)$ for $x\in \mathbb{R}$ . we remark that $\eta\in C^\infty(\mathbb{R} \setminus \{0\})$ and $h\in C^\infty(\mathbb{R} \setminus\{d\})$ . Note that if $x\not = 0$ ,

$\eta'(x) = \sin(\log|x|)+\cos(\log|x|)+2\in[2-\sqrt 2, 2+\sqrt 2],$

Figure 2. Graph of

$\eta$ (slightly deformed).

DownLoad: Full-Size Img PowerPoint

and that if we set $x_j = -\exp(-2\pi j)$ and $\xi_j = -\exp\left(-2\pi j+\frac \pi 2\right)$ , $j\in \mathbb{N}$ , then

$\eta(x_j) = 2x_j \ \ \text{ and } \ \ \eta(\xi_j) = 3\xi_j.$

The points $P_j: = (x_j+d, 2x_j)$ are on the intersection of two curves $y = h(x)$ and $y = 2(x-d)$ , while the points $Q_j: = (d+\xi_j, 3\xi_j)$ are on the intersection of $y = h(x)$ and $y = 3(x-d)$ . Moreover, $\lim P_j = \lim Q_j = (d, 0)$ .

Acknowledgments

The author would like to thank the anonymous referees for their careful reading and useful suggestions. He was supported in part by the JSPS Grants KAKENHI No. 16H03948, No. 20K03688, No. 20H01817, and No. 21H00717.

Conflict of interest

The author declares no conflict of interest.

References

[1]	K. Karasawa, M. Oda, T. Kitasaka, K. Misawa, M. Fujiwara, C. W. Chu, et al., Multi-atlas pancreas segmentation: Atlas selection based on vessel structure, Med. Image Anal., 39 (2017), 18–28. https://doi.org/10.1016/j.media.2017.03.006 doi: 10.1016/j.media.2017.03.006
[2]	P. F. Li, P. Liu, C. L. Chen, H. Duan, W. J. Qiao, O. H. Ognami, The 3D reconstructions of female pelvic autonomic nerves and their related organs based on MRI: a first step towards neuronavigation during nerve-sparing radical hysterectomy, Eur. Radiol., 28 (2018), 4561–4569. https://doi.org/10.1007/s00330-018-5453-8 doi: 10.1007/s00330-018-5453-8
[3]	H. S. Park, D. S. Shin, D. H. Cho, Y. W. Jung, J. S. Park, Improved sectioned images and surface models of the whole dog body, Ann. Anat., 196 (2014), 352–359. https://doi.org/10.1016/j.aanat.2014.05.036 doi: 10.1016/j.aanat.2014.05.036
[4]	J. S. Park, Y. W. Jung, Software for browsing sectioned images of a dog body and generating a 3D model, Anat. Rec., 299 (2016), 81–87. https://doi.org/10.1002/ar.23200 doi: 10.1002/ar.23200
[5]	K. Czeibert, G. Baksa, A. Grimm, S. A. Nagy, E. Kubinyi, Ö. Petneházy, MRI, CT and high resolution macro-anatomical images with cryosectioning of a Beagle brain: creating the base of a multimodal imaging atlas, PLoS One, 14 (2019), e0213458. https://doi.org/10.1371/journal.pone.0213458 doi: 10.1371/journal.pone.0213458
[6]	X. Shu, Y. Y. Yang, B. Y. Wu, A neighbor level set framework minimized with the split Bregman method for medical image segmentation, Signal Process., 189 (2021), 108293. https://doi.org/10.1016/j.sigpro.2021.108293 doi: 10.1016/j.sigpro.2021.108293
[7]	X. Shu, Y. Y. Yang, J. Liu, X. J. Chang, B. Y. Wu, ALVLS: Adaptive local variances-Based levelset framework for medical images segmentation, Pattern Recogn., 136 (2023), 109257. https://doi.org/10.1016/j.patcog.2022.109257 doi: 10.1016/j.patcog.2022.109257
[8]	S. K. Zhou, H. Greenspan, C. Davatzikos, J. S. Duncan, B. Van Ginneken, A. Madabhushi, et al., A review of deep learning in medical imaging: Imaging traits, technology trends, case studies with progress highlights, and future promises, Proc. IEEE, 109 (2021), 820–838. https://doi.org/10.1109/JPROC.2021.3054390 doi: 10.1109/JPROC.2021.3054390
[9]	A. Majumdar, L. Brattain, B. Telfer, C. Farris, J. Scalera, Detecting intracranial hemorrhage with deep learning, in 2018 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), IEEE, (2018), 583–587. https://doi.org/10.1109/EMBC.2018.8512336
[10]	J. Long, E. Shelhamer, T. Darrell, Fully convolutional networks for semantic segmentation, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, (2015), 3431–3440. https://doi.org/10.1109/CVPR.2015.7298965
[11]	G. Huang, Z. Liu, L. Van Der Maaten, K. Q. Weinberger, Densely connected convolutional networks, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, (2017), 4700–4708. https://doi.org/10.1109/CVPR.2017.243
[12]	L. C. Chen, Y. Zhu, G. Papandreou, F. Schroff, H. Adam, Encoder-decoder with atrous separable convolution for semantic image segmentation, in Proceedings of the European Conference on Computer Vision (ECCV), (2018), 801–818.
[13]	O. Ronneberger, P. Fischer, T. Brox, U-net: Convolutional networks for biomedical image segmentation, in International Conference on Medical Image Computing and Computer-assisted Intervention, Springer, (2015), 234–241. https://doi.org/10.1007/978-3-319-24574-4_28
[14]	D. Schmid, V. B. Scholz, P. R. Kircher, I. E. Lautenschlaeger, Employing deep convolutional neural networks for segmenting the medial retropharyngeal lymph nodes in CT studies of dogs, Vet. Radiol. Ultrasound, 63 (2022), 763–770. https://doi.org/10.1111/vru.13132 doi: 10.1111/vru.13132
[15]	J. Park, B. Choi, J. Ko, J. Chun, I. Park, J. Lee, et al., Deep-learning-based automatic segmentation of head and neck organs for radiation therapy in dogs, Front. Vet. Sci., 8 (2021), 721612. https://doi.org/10.3389/fvets.2021.721612 doi: 10.3389/fvets.2021.721612
[16]	H. Cao, Y. Wang, J. Chen, D. Jiang, X. Zhang, Q. Tian, et al., Swin-unet: Unet-like pure transformer for medical image segmentation, in European Conference on Computer Vision, (2021), 205–218. https://doi.org/10.1007/978-3-031-25066-8_9
[17]	Y. Xu, X. He, G. Xu, G. Qi, K. Yu, L. Yin, et al., A medical image segmentation method based on multi-dimensional statistical features, Front. Neurosci., 16 (2022), 1009581. https://doi.org/10.3389/fnins.2022.1009581 doi: 10.3389/fnins.2022.1009581
[18]	A. Dosovitskiy, L. Beyer, A. Kolesnikov, D. Weissenborn, X. Zhai, T. Unterthiner, et al., An image is worth 16x16 words: Transformers for image recognition at scale, preprint, arXiv: 2010.11929.
[19]	N. Carion, F. Massa, G. Synnaeve, N. Usunier, A. Kirillov, S. Zagoruyko, End-to-end object detection with transformers, in European Conference on Computer Vision, Springer, (2020), 213–229. https://doi.org/10.1007/978-3-030-58452-8_13
[20]	S. Zheng, J. Lu, H. Zhao, X. Zhu, Z. Luo, Y. Wang, et al., Rethinking semantic segmentation from a sequence-to-sequence perspective with transformers, in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, (2021), 6881–6890. https://doi.org/10.1109/CVPR46437.2021.00681
[21]	J. Chen, Y. Lu, Q. Yu, X. Luo, E. Adeli, Y. Wang, et al., Transunet: Transformers make strong encoders for medical image segmentation, preprint, arXiv: 2102.04306.
[22]	B. Li, S. Liu, F. Wu, G. Li, M. Zhong, X. Guan, RT‐Unet: An advanced network based on residual network and transformer for medical image segmentation, Int. J. Intell. Syst., 37 (2022), 8565–8582. https://doi.org/10.1002/int.22956 doi: 10.1002/int.22956
[23]	H. Wang, P. Cao, J. Wang, O. R. Zaiane, Uctransnet: rethinking the skip connections in u-net from a channel-wise perspective with transformer, in Proceedings of the AAAI Conference on Artificial Intelligence, 36 (2022), 2441–2449. https://doi.org/10.1609/aaai.v36i3.20144
[24]	Q. Wang, B. Wu, P. Zhu, P. Li, W. Zuo, Q. Hu, ECA-Net: Efficient channel attention for deep convolutional neural networks, in 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), (2020), 11534–11542. https://doi.org/10.1109/CVPR42600.2020.01155
[25]	A. E. Kavur, N. S. Gezer, M. Barış, S. Aslan, P. H. Conze, V. Groza, et al., CHAOS challenge-combined (CT-MR) healthy abdominal organ segmentation, Med. Image Anal. , 69 (2021), 101950. https://doi.org/10.1016/j.media.2020.101950 doi: 10.1016/j.media.2020.101950
[26]	K. He, X. Zhang, S. Ren, J. Sun, Deep residual learning for image recognition, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, (2016), 770–778.
[27]	A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, et al., Attention is all you need, in Advances in Neural Information Processing Systems, 30 (2017).
[28]	H. Zhao, J. Shi, X. Qi, X. Wang, J. Jia, Pyramid scene parsing network, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, (2017), 2881–2890. https://doi.org/10.1109/CVPR.2017.660
[29]	J. Fu, J. Liu, H. Tian, Y. Li, Y. Bao, Z. Fang, et al., Dual attention network for scene segmentation, in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, (2019), 3146–3154.
[30]	Y. Cao, J. Xu, S. Lin, F. Wei, H. Hu, Gcnet: Non-local networks meet squeeze-excitation networks and beyond, in Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2019. https://doi.org/10.1109/ICCVW.2019.00246
[31]	Y. Yuan, X. Chen, J. Wang, Object-contextual representations for semantic segmentation, in European Conference on Computer Vision, Springer, (2020), 173–190. https://doi.org/10.1007/978-3-030-58539-6_11
[32]	Z. Liu, Y. Lin, Y. Cao, H. Hu, Y. Wei, Z. Zhang, et al., Swin transformer: Hierarchical vision transformer using shifted windows, in Proceedings of the IEEE/CVF International Conference on Computer Vision, (2021), 10012–10022. https://doi.org/10.1109/ICCV48922.2021.00986
[33]	E. Z. Xie, W. H. Wang, Z. D. Yu, A. Anandkumar, J. M. Alvarez, P. Luo, SegFormer: Simple and efficient design for semantic segmentation with transformers, in Advances in Neural Information Processing Systems, 34 (2021), 12077–12090.
[34]	M. D. Alahmadi, Medical image segmentation with learning semantic and global contextual representation, Diagnostics, 12 (2022), 1548. https://doi.org/10.3390/diagnostics12071548 doi: 10.3390/diagnostics12071548
[35]	J. Fang, C. Yang, Y. Shi, N. Wang, Y. Zhao, External attention based TransUNet and label expansion strategy for crack detection, IEEE Trans. Intell. Transp. Syst., 23 (2022), 19054–19063. https://doi.org/10.1109/TITS.2022.3154407 doi: 10.1109/TITS.2022.3154407
[36]	M. H. Guo, C. Z. Lu, Q. Hou, Z. Liu, M. M. Cheng, S. M. Hu, SegNeXt: Rethinking convolutional attention design for semantic segmentation, in Advances in Neural Information Processing Systems, 35 (2022), 1140–1156.
[37]	H. Bao, L. Dong, S. Piao, F. Wei, BEiT: BERT pre-training of image transformers, preprint, arXiv: 2106.08254.

This article has been cited by:

Yuandong Chen, Jinhao Pang, Yuchen Gou, Zhiming Lin, Shaofeng Zheng, Dewang Chen, Research on the A* Algorithm for Automatic Guided Vehicles in Large-Scale Maps, 2024, 14, 2076-3417, 10097, 10.3390/app142210097

Reader Comments

Your name:*

Email:*
© 2023 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)