Modified Newton-EHS method for solving nonlinear problems with complex symmetric Jacobian matrices

Lv Zhang; Qingbiao Wu; Lv Zhang; Qingbiao Wu

doi:10.3934/math.20231236

AIMS Mathematics

2023, Volume 8, Issue 10: 24233-24253. doi: 10.3934/math.20231236

Previous Article Next Article

Research article

Modified Newton-EHS method for solving nonlinear problems with complex symmetric Jacobian matrices

Lv Zhang ,
Qingbiao Wu ^,

School of Mathematical Sciences, Zhejiang University, Hangzhou, Zhejiang 310058, China

Received: 09 May 2023 Revised: 08 July 2023 Accepted: 27 July 2023 Published: 14 August 2023
MSC : 65F10, 65F50, 65H10

This manuscript is devoted to the study of numerical methods for a class of nonlinear problems. Instead of the standard Newton method, an efficient nonlinear solver is suggested to be used, and it is referred to as the Newton-EHS method, where "EHS" stands for Euler-extrapolated Hermitian-skew-Hermitian splitting. We construct this modified Newton-EHS method by utilizing a modified Newton method as the outer iteration and the EHS method as the inner iteration. Furthermore, we give the derivations of the local and semilocal convergence properties of the proposed method under the Hölder condition. Finally, in order to show the feasibility and validity of our new method, we compare it with some other iterative methods in two numerical examples.

Keywords:

Citation: Lv Zhang, Qingbiao Wu. Modified Newton-EHS method for solving nonlinear problems with complex symmetric Jacobian matrices[J]. AIMS Mathematics, 2023, 8(10): 24233-24253. doi: 10.3934/math.20231236

Related Papers:

[1]	Wan-Chen Zhao, Xin-Hui Shao . New matrix splitting iteration method for generalized absolute value equations. AIMS Mathematics, 2023, 8(5): 10558-10578. doi: 10.3934/math.2023536
[2]	Sani Aji, Poom Kumam, Aliyu Muhammed Awwal, Mahmoud Muhammad Yahaya, Kanokwan Sitthithakerngkiet . An efficient DY-type spectral conjugate gradient method for system of nonlinear monotone equations with application in signal recovery. AIMS Mathematics, 2021, 6(8): 8078-8106. doi: 10.3934/math.2021469
[3]	Yang Cao, Quan Shi, Sen-Lai Zhu . A relaxed generalized Newton iteration method for generalized absolute value equations. AIMS Mathematics, 2021, 6(2): 1258-1275. doi: 10.3934/math.2021078
[4]	Jan Nordström, Fredrik Laurén, Oskar Ålund . An explicit Jacobian for Newton's method applied to nonlinear initial boundary value problems in summation-by-parts form. AIMS Mathematics, 2024, 9(9): 23291-23312. doi: 10.3934/math.20241132
[5]	Hu Li . The modified quadrature method for Laplace equation with nonlinear boundary conditions. AIMS Mathematics, 2020, 5(6): 6211-6220. doi: 10.3934/math.2020399
[6]	Mouhamad Al Sayed Ali, Miloud Sadkane . Acceleration of implicit schemes for large systems of nonlinear differential-algebraic equations. AIMS Mathematics, 2020, 5(1): 603-618. doi: 10.3934/math.2020040
[7]	Malik Zaka Ullah, Sultan Muaysh Alaslani, Fouad Othman Mallawi, Fayyaz Ahmad, Stanford Shateyi, Mir Asma . A fast and efficient Newton-type iterative scheme to find the sign of a matrix. AIMS Mathematics, 2023, 8(8): 19264-19274. doi: 10.3934/math.2023982
[8]	Xin-Hui Shao, Wan-Chen Zhao . Relaxed modified Newton-based iteration method for generalized absolute value equations. AIMS Mathematics, 2023, 8(2): 4714-4725. doi: 10.3934/math.2023233
[9]	Fan Sha, Jianbing Zhang . Randomized symmetric Gauss-Seidel method for solving linear least squares problems. AIMS Mathematics, 2024, 9(7): 17453-17463. doi: 10.3934/math.2024848
[10]	Dingyu Zhu, Yueting Yang, Mingyuan Cao . An accelerated adaptive two-step Levenberg–Marquardt method with the modified Metropolis criterion. AIMS Mathematics, 2024, 9(9): 24610-24635. doi: 10.3934/math.20241199

Abstract

1. Introduction

This manuscript is about the study of numerical methods for large sparse nonlinear equations described by

$\begin{equation} F(u) = 0, \end{equation}$

(1.1)

where $F: \mathbb{D}\subset \mathbb{C}^n \rightarrow \mathbb{C}^n$ is a continuously differentiable function. We assume that the Jacobian matrix $F'(u)$ of the function $F$ can be split as follows:

$\begin{equation} F'(u) = W(u)+iT(u), \end{equation}$

(1.2)

where $W(u)$ and $T(u)$ are both real symmetric matrices, and $i = \sqrt{-1}$ stands for the imaginary unit. These nonlinear systems can be seen in the applications of physics, scientific computing and engineering ^[1,2,3].

To solve systems that can be described by (1.1), one of the most commonly used methods is Newton's method, i.e.,

$\begin{equation} F'(u_{k}) s_{k} = -F(u_{k}), \ \ k = 0,1,2,\cdots,\ \text{with} \ u_{k+1} : = u_{k} + s_{k}. \end{equation}$

(1.3)

However, when the problem size becomes large, the cost of each step of the traditional Newton method is expensive. It is well known that the idea of the inexact Newton method ^[4] overcomes this difficulty and improves the iteration efficiency. In each step of the inexact Newton method, only the inexact solution of Newton's equation (1.3) needs to be obtained. In this sense, we trade a bit of precision for better efficiency. Inexact Newton method have been widely used in recent decades.

Algorithm 1.1. Inexact Newton method

1. Set an initial guess $u_0$ .

2. Set some $\eta_{k} \in [0, 1)$ . For $k = 0, 1, \cdots$ we solve

$F'(u_k)s_k = -F(u_k), k = 0,1,\cdots,$

to find some $s_{k}$ which satisfies

$\begin{equation} \|F(u_{k})+F'(u_{k})s_{k}\| \leq \eta_{k}\|F(u_{k})\|. \end{equation}$

(1.4)

3. Set $u_{k+1} = u_{k}+s_{k}$ .

Inexact Newton methods can be viewed as inner-outer iterative methods. The outer part is Newton's method, which is employed to generate the sequence $\{x_k\}$ . And, the inner iterations are linear iterative methods used inside Newton's method to approximately solve Newton's equations. This kind of inner-outer iteration scheme has greatly improved the computational efficiency of the traditional Newton method. In the past few decades, a number of linear iterations, such as the classical splitting methods ^[5,6,7] and the modern Krylov subspace methods ^[9], have been utilized inside the inexact Newton methods ^[10,11,12]. For some other recent research papers in the area, see ^[13,14,15].

In ^[16], Darvishi and Barati construct a modified Newton method, which requires only one more evaluation of $F$ per step than the Newton method, while it has three $R$ -orders of convergence at least. Compared with Newton's method, the modified Newton method improves the convergence speed and convergence order. It has also been used as the mainstream outer iteration of the previously mentioned iterative scheme in recent years; for examples, see ^{[17,18,19,20,21,22,23]}. The modified Newton method is as follows:

$\begin{eqnarray} \left\{\begin{array}{l} v_k = u_k-{F'(u_k)}^{-1}F(u_k),\\ u_{k+1} = v_k-{F'(u_k)}^{-1}F(v_k),\;\; k = 0, 1, 2, \cdots. \end{array}\right. \end{eqnarray}$

(1.5)

In this manuscript, we shall use the Euler-extrapolated Hermitian and skew-Hermitian splitting (EHS) method as the inner solver of the modified Newton method to establish our new method. The rest of this manuscript is roughly structured as follows. In Section 2, we will review the EHS method and build the modified Newton-EHS (MN-EHS) method. In Sections 3 and 4 we analyze the local and semilocal convergence properties of the MN-EHS method under the Hölder condition, respectively. In Section 5, there are two numerical examples comparing our method with some other methods in the earlier literature to reveal the computational efficiencies of our new iteration scheme. Finally, a short summary is given in Section 6.

2. The MN-EHS method

In ^[24], Li and Ma proposed the EHS method. This method was constructed to appropriately solve complex symmetric linear problems described by $Au = b$ . They proposed the Euler-extrapolated technique in that paper, and it is used as an efficient solver in large sparse linear systems. In this part, the EHS method ^[24] will be reviewed.

Let us consider the complex symmetric linear problems described by

$Au = (W+iT)u = b,\ W,T\in \mathbb{R}^{n\times n},$

where $W, T$ are both symmetric, positive semi-definite matrices, $b\in \mathbb{C}^n$ is a known vector and $i = \sqrt{-1}$ is the imaginary unit.

The EHS iterative method can be simply represented by the following formula:

$(\cos(\theta)W + \sin(\theta)T)u_{k+1} = i (\sin(\theta)W - \cos(\theta)T )u_{k} + e^{-i\theta}b, \ k = 0,1,2,\cdots,$

where $\theta \in [0, \frac{\pi}{2}]$ .

Notice that $e^{-i\theta} = \cos(\theta) - i\sin(\theta)$ according to Euler's formula, and $u_0 \in \mathbb{C}$ is the initial guess.

Remark 2.1. There are some restrictions on the selection of the parameter $\theta$ . See the convergence theorems Theorem 3.1 and Theorem 3.2 in ^[24]. For more studies about Euler-extrapolated techniques, see ^[25,26,27].

Now, we intend to establish the MN-EHS method. Suppose that

$F'(u) = W(u) + iT(u),$

where $W(u), T(u)\in\mathbb{R}^{n\times n}$ , which can be calculated by using

$W(u) = \frac{1}{2}(F'(u)+F'(u)^*) \ \ \text{and }\ \ T(u) = -i \cdot \frac{1}{2}(F'(u)-F'(u)^*) ,$

where $F'(u)^*$ denotes the conjugate transpose matrix of $F'(u)$ .

Algorithm 2.1. MN-EHS method

1. Let the initial guess $u_{0}$ be given. Set a nonnegative parameter $\theta\in [0, \frac{\pi}{2}]$ , a positive constant $tol$ and two positive integer sequences $\{l_{k}\}^{\infty}_{k = 0}$ , $\{m_{k}\}^{\infty}_{k = 0}$ .

2. For $k = 0, 1, \cdots,$ until $\|F(u_{k})\| \leq tol \|F(u_{0})\|$ , do the following:

2.1. Set $d_{k, 0} = h_{k, 0} : = 0$ .

2.2. For $l = 0, 1, \cdots, l_{k}-1$ , apply the EHS method to the linear system described by (1.5):

$\begin{equation*} (\cos(\theta)W(u_k) + \sin(\theta)T(u_k))d_{k,l+1} = i(\sin(\theta)W(u_k) - \cos(\theta)T(u_k))d_{k,l} - e^{-i\theta}F(u_{k}); \end{equation*}$

obtain $d_{k, l_{k}}$ such that

$\begin{equation} \|F(u_{k})+F'(u_{k})d_{k, l_{k}}\|\leq \eta_{k}\|F(u_{k})\| , \ \text{for some} \ \eta_{k} \in [0, 1). \end{equation}$

(2.1)

2.3. Set $v_{k} = u_{k}+d_{k, l_{k}}$ .

2.4. Compute $F(v_{k})$ .

2.5. For $m = 0, 1, 2, \cdots, m_{k}-1$ , apply the EHS method to the linear system described by (1.5):

$\begin{equation*} (\cos(\theta)W(u_k) + \sin(\theta)T(u_k))h_{k,l+1} = i(\sin(\theta)W(u_k) - \cos(\theta)T(u_k))h_{k,l} - e^{-i\theta}F(v_{k}); \end{equation*}$

obtain $h_{k, m_{k}}$ such that

$\begin{equation} \|F(v_{k})+F'(u_{k})h_{k, m_{k}}\| \leq \tilde{\eta}_{k}\|F(v_{k})\| , \ \text{for some} \ \tilde{\eta}_{k} \in [0, 1). \end{equation}$

(2.2)

2.6. Set $u_{k+1} = v_{k}+h_{k, m_{k}}.$

3. End.

Remark 2.2. For the needs of subsequent study and derivation, we give the expressions of $d_{k, l_k}$ and $h_{k, m_k}$ :

$\begin{equation*} \begin{split} d_{k,l_k} = -\sum\limits_{j = 0}^{l_{k}-1}M(\theta;u_k)^j N(\theta;u_k)e^{-i\theta}F(u_k), \\ h_{k,m_k} = -\sum\limits_{j = 0}^{m_{k}-1}M(\theta;u_k)^j N(\theta;u_k)e^{-i\theta}F(v_k), \end{split} \end{equation*}$

where

$\begin{equation*} \begin{split} M(\theta;u) & = i(\cos(\theta)W(u)+ \sin(\theta)T(u))^{-1} ( \sin(\theta)W(u)-\cos(\theta)T(u) ), \\ N(\theta;u) & = (\cos(\theta)W(u)+ \sin(\theta)T(u))^{-1}. \end{split} \end{equation*}$

After straightforward derivation, we have

$\begin{equation} \begin{split} v_{k} & = u_{k}-\sum\limits_{j = 0}^{l_{k}-1} M(\theta ; u_{k})^{j} N(\theta ; u_{k}) e^{-i\theta} F(u_{k}), \\ u_{k+1} & = v_{k} - \sum\limits_{j = 0}^{m_{k}-1} M(\theta ; u_{k})^{j} N(\theta ; u_{k}) e^{-i\theta} F(v_{k}). \end{split} \end{equation}$

(2.3)

Define

$\begin{equation} \begin{split} B(\theta;u) &: = e^{i\theta}(\cos(\theta)W(u)+ \sin(\theta)T(u)), \\ C(\theta;u) &: = ie^{i\theta}(\sin(\theta)W(u)-\cos(\theta)T(u)). \end{split} \end{equation}$

(2.4)

Then, the Jacobian matrix $F'(u) = B(\theta; u)-C(\theta; u),$ and

$\begin{equation} \begin{split} M(\theta;u) & = B(\theta;u)^{-1}C(\theta;u), \\ N(\theta;u) & = e^{-i\theta}B(\theta;u)^{-1}. \end{split} \end{equation}$

(2.5)

Therefore, we equivalently represent the MN-EHS method as follows:

$\begin{equation} \begin{split} v_{k} & = u_{k} - (I-M(\theta;u_k)^{l_k} )F'(u_k)^{-1} F(u_{k}), \\ u_{k+1} & = v_{k} - (I-M(\theta;u_k)^{m_k} )F'(u_k)^{-1} F(v_{k}). \end{split} \end{equation}$

(2.6)

3. Local convergence property of MN-EHS method

In this part, our main task is to give the derivations of the convergence properties. Let us begin with the local convergence. In this section we will analyze the local convergence property under the Hölder continuous condition, similar to that in ^[28]. First of all, we give, without proof, the following Banach lemma.

Lemma 3.1. (Banach Lemma) Let $A, B$ in $\mathbb{C}^{n\times n}$ satisfy $\|I-BA\| < 1$ ; then, the matrices $A, B$ are nonsingular. Moreover,

$\|A^{-1}\| \leq \frac{\|B\|}{1-\|I-BA\|}, \ \|B^{-1}\| \leq \frac{\|A\|}{1-\|I-BA\|},$

and

$\|A^{-1}-B\| \leq \frac{\|B\|\|I-BA\|}{1-\|I-BA\|}, \ \|A-B^{-1}\| \leq \frac{\|A\|\|I-BA\|}{1-\|I-BA\|}.$

Suppose that $F:\mathbb{D} \subset \mathbb{C}^n \rightarrow \mathbb{C}^n$ is a G-differentiable function on $\mathbb{N}_0\subset \mathbb{D}$ , where $\mathbb{N}_0$ is the convex neighborhood of the point $u_*$ which satisfies $F(u_*) = 0$ . Its Jacobian matrix $F'(u)$ is continuous, positive definite and complex symmetric. For any $x \in \mathbb{D}$ , suppose that $F'(u) = W(u) + iT(u)$ is the splitting of the Jacobian matrix $F'(u)$ . $\mathbb{N}(u_*, r)$ denotes an open ball centered at $u_*$ with radius $r > 0$ .

Assumption 3.1. For arbitrary $u \in \mathbb{N}(u_*, r) \subset \mathbb{N}_0$ , assume that the following conditions hold.

(1) (The Bounded Condition) There are positive constants $\beta$ and $\gamma$ such that

$\max \big\{\|W(u_*)\|, \,\, \|T(u_*)\| \big\} \leq \beta \;\; {\rm and} \;\; \|F'(u_*)^{-1}\| \leq \gamma.$

(2) (The Hölder Condition) For some $p \in (0, 1]$ , there exist nonnegative constants $H_w$ and $H_t$ such that

$\begin{align*} \|W(u)-W(u_*)\| &\leq H_w \|u-u_*\|^p,\\ \|T(u)-T(u_*)\| &\leq H_t \|u-u_*\|^p. \end{align*}$

Lemma 3.2. Under Assumption 1, for any $u, v \in \mathbb{N}(u_*, r)$ , if $r\in \left(0, \frac{1}{(\gamma H)^{\frac{1}{p}}} \right)$ , then $F'(u)^{-1}$ exists. And, the following inequalities hold:

$\begin{align*} &\| F'(u)- F'(u_*)\| \leq H\|u-u_*\|^p,\\ &\| F'(u)^{-1}\| \leq S(u),\\ &\| F(v)\| \leq \frac{H}{p+1}\|v-u_*\|^{p+1} + 2\beta \|v-u_*\|,\\ &\|v-u_*-F'(u)^{-1}F(v)\| \leq S(u) \bigg(\frac{H}{p+1}\|v-u_*\|^p + H\|u-u_*\|^p \bigg)\|v-u_*\|, \end{align*}$

where $S(u): = \frac{\gamma}{1-\gamma H\|u-u_*\|^p }, \ \ H : = H_w + H_t.$

Proof. According to the Hölder condition,

$\begin{align*} \| F'(u)- F'(u_*)\| & = \| W(u)+ iT(u) - W(u_*)- iT(u_*)\| \\ &\leq \| W(u)- W(u_*)\| + \| i(T(u) - T(u_*))\| \\ &\leq (H_w + H_t) \|u-u_*\|^p = H\|u-u_*\|^p . \end{align*}$

Since $r\in \left(0, \frac{1}{(\gamma H)^{\frac{1}{p}}} \right)$ , we have

$\|F'(u_*)^{-1} (F'(u_*)-F'(u)) \| \leq \|F'(u_*)^{-1}\|\|F'(u_*)-F'(u)\| \leq \gamma H\|u-u_*\|^p \leq 1 .$

Then, according to $\|F'(u_*)^{-1}\| \leq \gamma$ and Lemma 3.1, $F'(u)^{-1}$ exists and

$\|F'(u)^{-1}\| \leq \frac{\|F'(u_*)^{-1}\|}{1 - \|F'(u_*)^{-1} (F'(u_*)-F'(u)) \|} \leq \frac{\gamma}{1-\gamma H\|u-u_*\|^p} = S(u) .$

$\|F'(u_*)\| = \| W(u_*)+ iT(u_*) \| \leq \| W(u_*) \| + \| iT(u_*) \| \leq 2\beta ,$

$\begin{align*} F(v) & = F(v) - F(u_*) - F'(u_*)(v-u_*) + F'(u_*)(v-u_*) \\ & = \int_{0}^{1} \bigg( F'(u_* + t(v-u_*)) -F'(u_*) \bigg)dt(v-u_*) + F'(u_*)(v-u_*) , \end{align*}$

we get

$\begin{align*} \|F(v)\| &\leq \|v-u_*\| \int_{0}^{1} \| ( F'(u_* + x(v-u_*)) -F'(u_*) ) \|dx + \| F'(u_*)(v-u_*) \| \\ &\leq \|v-u_*\| \int_{0}^{1} H \|x(v-u_*)\|^p dx + \| F'(u_*)(v-u_*) \| \\ &\leq \frac{H}{p+1} \|v-u_* \|^{p+1} + 2\beta \|v-u_*\|. \end{align*}$

As for the last inequality, since

$\begin{align*} &\quad\ v-u_*-F'(u)^{-1}F(v) \\ & = -F'(u)^{-1} ( F(v) - F(u_*) -F'(u)(v-u_*)) \\ & = -F'(u)^{-1}(F(v) - F(u_*) -F'(u_*)(v-u_*) ) + F'(u)^{-1}(F'(u) -F'(u_*))(v-u_*) \\ & = -F'(u)^{-1} \int_{0}^{1} \bigg( F'(u_*+ x(v-u_*) ) -F'(u_*) \bigg)dx(v-u_*) + F'(u)^{-1}\left(F'(u)-F'(u_*)\right)(v-u_*) , \end{align*}$

it follows that

$\begin{align*} &\quad\ \| v-u_*-F'(u)^{-1}F(v) \| \\ &\leq \|F'(u)^{-1}\| \cdot\bigg( \int_{0}^{1} \| F'(u_*+x(v-u_*)) - F'(u_*) \|dx + \|F'(u)-F'(u_*)\| \bigg) \cdot\|v-u_*\| \\ &\leq S(u) \bigg( \frac{H}{p+1}\|v-u_*\|^p + H\|u-u_*\|^p \bigg) \|v-u_*\|. \end{align*}$

□

In the remainder of this article, we use the symbol $\lfloor \rfloor$ to represent the smallest integer that is no less than the corresponding real number.

Theorem 3.1. Under the conditions of Assumption 3.1 and Lemma 3.2, let $r\in(0, r_0)$ , where $r_0 = \min\{r_1, r_2, r_3\}$ , and

$\begin{gather*} r_1 = \left(\frac{1}{2\gamma H}\right)^{\frac{1}{p}},\ r_2 = \left(\frac{\tau \chi}{2\tau H(2+\tau \chi)}\right) ^{\frac{1}{p}}, \ r_3 = \left(\frac{(1+p)(1-2\beta\gamma[(\tau+1)\chi]^{\nu})}{2(2+p)\gamma H} \right) ^{\frac{1}{p}}. \end{gather*}$

The constant $\nu = \min\{l_*, m_*\}$ , and $\nu$ satisfies $\nu > \lfloor -\frac{\ln(2\beta\gamma)}{\ln((\tau+1)\theta)} \rfloor$ , where $l_* = \liminf_{k\rightarrow \infty} l_k, m_* = \liminf_{k\rightarrow \infty} m_k$ , $\tau \in(0, \frac{1-\chi}{\chi})$ is a prescribed positive constant and $\chi \equiv \chi(\theta; u_0) = \|M(\theta; u_0)\| < 1.$

Then, for any initial guess $u_0\in \mathbb{N}(u_*, r)$ and any positive integer sequences $\{l_k\}^{\infty}_{k = 0}$ and $\{m_k\}^{\infty}_{k = 0}$ , the solution sequence $\{u_k\}^{\infty}_{k = 0}$ of the MN-EHS method represents convergence to the exact solution $u_*$ . In addition, we have

$\limsup\limits_{k\rightarrow \infty} \|u_k-u_*\|^{\frac{1}{k}}\leq g(r_0;\nu)^2,$

where

$g(t^p;\nu) : = \frac{\gamma}{1-\gamma Ht^p} \left(\frac{3+p}{1+p}Ht^p + 2\beta[(\tau+1)\chi]^{\nu} \right).$

Proof. Since $\|M(\theta; u_*)\| \leq \delta (\theta; u_*) < 1,$

$\begin{align*} \|B(\theta ; u_{*})^{-1}\| & = \| \left(I-M(\theta;u_{*})\right) F'(u_{*})^{-1} \| \leq \left(1 + \| M(\theta;u_*)\|\right) \| F'(u_*)^{-1}\| \leq 2\gamma. \end{align*}$

Then

$\begin{gather*} B(\theta ; u)-B(\theta ; u_{*}) = e^{i\theta}\big(\cos(\theta)(W(u)-W(u_*)) + \sin(\theta)(T(u)-T(u_*)) \big) , \\ C(\theta ; u)-C(\theta ; u_{*}) = ie^{i\theta} \big(\sin(\theta)(W(u)-W(u_*)) - \cos(\theta)(T(u)-T(u_*)) \big) . \end{gather*}$

According to the Hölder condition, we have

$\begin{gather*} \| B(\theta ; u)-B(\theta ; u_{*})\| \leq H\|u-u_*\|^p, \\ \|C(\theta ; u)-C(\theta ; u_{*})\| \leq H\|u-u_*\|^p. \end{gather*}$

Based on Lemma 3.1,

$\|B(\theta;u)^{-1} \| \leq \frac{\|B(\theta;u_*)^{-1}\|}{1-\|I- B(\theta;u_*)^{-1}B(\theta;u)\|} \leq \frac{ \|B(\theta;u_*)^{-1}\| }{1-\|B(\theta;u_*)^{-1}\| \|B(\theta;u_*) - B(\theta;u) \|} \leq \frac{2\gamma}{1-2\gamma H\|u-u^*\|^p}.$

Since

$\begin{align*} M(\theta ; u)-M(\theta ; u_{*}) & = B(\theta ; u)^{-1} C(\theta;u) - B(\theta ; u_{*})^{-1} C(\theta ; u_{*}) \\ & = B(\theta ; u)^{-1} \bigg( \big(C(\theta;u) - C(\theta;u_*)\big) - \big(B(\theta;u) - B(\theta;u_*)\big)M(\theta ; u_*) \bigg), \end{align*}$

then

$\begin{align*} \| M(\theta ; u)-M(\theta ; u_{*})\| &\leq \| B(\theta ; u)^{-1}\| \cdot \bigg[ \|C(\theta ; u)-C(\theta ; u_{*})\| + \|B(\theta ; u)-B(\theta ; u_{*})\| \cdot \|M( \theta ; u_{*})\| \bigg] \\ &\leq \frac{2\gamma}{1-2\gamma H\|u-u_*\|^p} 2H\|u-u_*\|^p \\ & = \frac{4\gamma H\|u-u_*\|^p}{1-2\gamma H\|u-u_*\|^p}. \end{align*}$

Here, $r < r_1$ implies that $\frac{4\gamma H\|u-u_*\|^p}{1-2\gamma H\|u-u_*\|^p} < \tau \chi.$

Hence,

$\|M(\theta;u)\| \leq \| M(\theta ; u)-M(\theta ; u_{*}) \| + \|M(\theta;u_*) \| \leq \frac{4\gamma H\|u-u_*\|^p}{1-2\gamma H\|u-u_*\|^p} + \chi \leq (\tau + 1) \chi.$

Furthermore, we have

$\begin{align*} \|v_{k}-u_{*}\| & = \| u_{k}-u_{*}-(I-M(\theta ; u)^{l_{k}}) F'(u_{k})^{-1} F(u_{k})\| \\ &\leq \|u_{k}-u_{*}-F'(u_{k})^{-1} F(u_{k})\| + \| M(\theta;u)^{l_{k}}\| \cdot \|F'(u_{k})^{-1} F(u_{k})\| \\ &\leq S(u_k) \left( \frac{H}{p+1}\|u_k-u_*\|^p + H\|u_k-u_*\|^p \right) \|u_k-u_*\| \\ &\ \ \ \ + [(\tau+1) \chi]^{l_{k}} S(u_k) \left( \frac{H}{p+1}\| u_{k}-u_{*}\|^{p+1}+2 \beta\| u_{k}-u_{*}\| \right) \\ &\leq S(u_k) \left( \frac{3+p}{1+p}H\|u_k-u_*\|^p + 2\beta[(\tau +1)\chi]^{l_k} \right) \|u_k-u_*\| \\ & = g(\|u_k-u_*\|^p;l_k)\|u_k-u_*\| \\ & < g(r_{0}^p ; \nu) \| u_{k}-u_{*} \| < \| u_{k}-u_{*}\| , \end{align*}$

and, similarly, we get

$\begin{align*} \|u_{k+1} - u_*\| & = \|v_k - u_*-(I-M(\theta ; u)^{m_{k}}) F'(u_{k})^{-1} F(u_{k})\| \\ &\leq \|v_k-u_*-F'(u_k)^{-1}F(v_k) \| + \|M(\theta;u)^{m_k}\| \cdot \|F'(u_{k})^{-1} F(u_{k})\| \\ &\leq S(u_k) \left( \frac{H}{p+1}\|v_k-u_*\|^p + H\|u_k-u_*\|^p \right) \|u_k-u_*\| \\ &\ \ \ \ + [(\tau+1) \chi]^{m_{k}} S(u_k) \left( \frac{H}{p+1}\| v_{k}-u_{*}\|^{p+1}+2 \beta\| v_{k}-u_{*}\| \right) \\ &\leq S(u_k) \left( \frac{3+p}{1+p}H\|u_k-u_*\|^p + 2\beta[(\tau +1)\chi]^{m_k} \right) \|v_k-u_*\| \\ &\leq g(\|u_k-u_*\|^p;m_k)\|v_k-u_*\| \\ &\leq g(r_0^p;\nu)^2 \|u_k-u_*\| < \|u_k-u_*\|. \end{align*}$

Then, by induction, we can prove that $\{u_k\}^{\infty}_{k = 0} \subset \mathbb{N}(u_*, r)$ . First, when $k = 0$ , $\|u_0-u_*\| < r < r_0$ and

$\|u_1-u_*\| < g(\|u_0-u_*\|^p;\nu)^2 \|u_0-u_*\| < \|u_0-u_*\| < r;$

then, $u_1\in \mathbb{N}(u_*, r)$ since $u_0\in \mathbb{N}(u_*, r)$ .

Now, by induction, when $k = 0$ , suppose that $u_n\in \mathbb{N}(u_*, r)$ ; then, we have

$\|u_{n+1} - u_*\| < g(r^p_{0};\nu)^2\|u_n-u_*\| < g(r^p_{0};\nu)^{2(n+1)}\|u_n-u_*\| < r,$

which implies that $u_{n+1}\in \mathbb{N}(u_*, r)$ for $k = n+1$ . Moreover, $u_{n+1}\rightarrow u_*$ as $n\rightarrow \infty$ . □

4. Semilocal convergence property of MN-EHS method

The convergence property discussed in the previous section is the local convergence property. The iteration is convergent on the premise that the initial guess of the iteration is located in an open ball of the exact solution. In practical calculation, it is hoped that the existence of the solution of the nonlinear system (1.1), as well as the convergence of iteration, can be ascertained directly from some conditions of the initial guess. Here, we put forward the semilocal convergence theorem of the MN-EHS method.

Assumption 4.1. For any $x_{0} \in \mathbb{N}_0$ , assume that the following conditions hold.

(1) (The Bounded Condition) There are two positive constants $\beta$ and $\gamma$ such that

$\begin{align} \max \Big\{ \|W(u_0)\|,\ \|T(u_0)\| \Big\} \leq \beta \ , \ \|F'(u_0)^{-1}\| \leq \gamma \ {\text and} \ \|F(u_0)\| \leq \delta. \end{align}$

(4.1)

(2) (The Hölder Condition) There are two nonnegative constants $H_w$ and $H_t$ such that, for any $u, v \in \mathbb{N}(u_{0}, r) \subset \mathbb{N}_{0}$ , the following inequalities are satisfied:

$\begin{align} \|W(u)-W(v)\| \leq H_w \|u-v\|^p, \end{align}$

(4.2)

$\begin{align} \|T(u)-T(v)\| \leq H_t \|u-v\|^p. \end{align}$

(4.3)

Lemma 4.1. For any $u, v \in \mathbb{N}(u_{0}, r)$ , if $r\in \left(0, (\frac{1}{\gamma H})^{\frac{1}{p}} \right)$ , then $F'(u)^{-1}$ exists. And, the following inequalities hold:

$\begin{align*} &\| F'(u)-F'(v) \| \leq H\|u-v\|^p, \\ &\| F'(u)\| \leq H\|u-u_0\|^p+2\beta,\\ &\| F(u)-F(v)-F'(v)(u-v) \| \leq \frac{H}{p+1}\| u-v\|^{p+1},\\ &\| F'(u)^{-1} \| \leq \frac{\gamma}{1-\gamma H \| u-u_{0}\|^p}, \end{align*}$

where $H : = H_w + H_t.$

Proof. It will be omitted because the proof of Lemma 4.1 is similar to that of Lemma 3.2. □

Before giving the semilocal convergence theorem, we need some preparations; we construct two sequences and give some lemmas.

First, we give two important functions:

$\begin{align} \lambda(t) = \frac{a}{p+1}t^{p+1} -bt + c, \end{align}$

(4.4)

$\begin{align} \omega(t) = dt^p - 1, \ \ t\in [0,d^{-\frac{1}{p}}], \end{align}$

(4.5)

where $a, b, c$ and $d$ are positive constants which satisfy

$a > bd, 0 < b \leq 1, d > 0, \frac{p+1}{p}\frac{c}{b} < \left(\frac{b}{a} \right)^ { \frac{1}{p} }.$

Define the sequences $\{t_k\}$ and $\{s_k\}$ by

$\begin{eqnarray} \left\{ \begin{array}{ll} t_0 = 0, \ s_0 = c, \\ s_k = t_k -\frac{\lambda(t_k)}{\omega(t_k)} ,\\ t_{k+1} = s_k -\frac{\lambda(s_k)}{\omega(t_k)}. \end{array}\right. \end{eqnarray}$

(4.6)

Lemma 4.2. $\lambda(t)$ is decreasing in $\left[ 0, \left(\frac{a}{b} \right)^{\frac{1}{p}} \right)$ but increasing in $\left[ \left(\frac{a}{b} \right)^\frac{1}{p}, +\infty \right)$ . Moreover, if

$\frac{p+1}{p}\frac{c}{b} < \left( \frac{b}{a} \right)^\frac{1}{p},$

then $\lambda(t) = 0$ has two solutions $t_*$ and $t_{**}$ in $(0, +\infty)$ , which satisfy

$0 < t_* < \frac{p+1}{p}\frac{c}{b} < t_{**}.$

Proof. See Lemma 2.1 in ^[18]. □

Lemma 4.3. Suppose that the sequences $\{t_k\}, \{s_k\}$ are generated by the formula (4.6). And, $t_*$ is the smaller nonnegative solution of $\varphi(t) = 0$ . Then, the sequences $\{t_k\}$ and $\{s_k\}$ increase, converge to $t_*$ and satisfy the following inequalities:

$0\leq t_k \leq s_k \leq t_{k+1} < t_*.$

Proof. See Lemma 2.2 in ^[18]. □

The following theorem is the semilocal convergence theorem. Take $a = (1+\eta)H\gamma, b = (1-\eta), c = (1+\eta)\gamma\delta$ and $d = H\gamma$ in $(4.4)$ .

Theorem 4.1. Set $r = \min(r_1, r_2)$ with

$\begin{gather*} r_1 = \left(\frac{(1+p)(1-2\beta\gamma[(\tau+1)\chi]^{\nu})}{2(2+p)\gamma H} \right) ^{\frac{1}{p}}, \ r_2 = \frac{1+p}{p}\frac{c}{b} , \end{gather*}$

where the constant $\nu = \min\{l_*, m_*\}$ satisfies $\nu > \lfloor -\frac{\ln(2\beta\gamma)}{\ln((\tau+1)\theta)} \rfloor$ ; also, $l_* = \liminf_{k\rightarrow \infty} l_k$ and $m_* = \liminf_{k\rightarrow \infty} m_k$ . $\tau \in(0, \frac{1-\chi}{\chi})$ is a prescribed positive constant and $\chi \equiv \chi(\theta; u_0) = \|M(\theta; u_0)\| < 1.$

Under the assumptions of Lemma 4.1, if

$\left( \frac{(1+\eta)\gamma}{1-\eta} \right)^{\frac{1+p}{p}} H^{\frac{1}{p}} \delta < \frac{p}{1+p},$

then the iteration sequence $\{u_k\}_{k = 0}^{\infty}$ generated by the MN-EHS method is well defined and converges to $u_*$ , which satisfies that $F(u_*) = 0$ .

Proof. The following formulas are true and can be proved by induction:

$\begin{eqnarray} \left\{ \begin{array}{lll} \| u_{k}-u_{0}\| \leq t_{k}-t_{0}, \\ \| F(u_{k}) \| \leq \frac{1}{(1+\eta)\gamma} \lambda(t_k), \\ \| v_{k}-u_{k}\| \leq s_{k}-t_{k}, \\ \|F(v_k)\| \leq \frac{1}{(1+\eta)\gamma} \lambda(s_k), \\ \|u_{k+1} - v_k\| \leq t_{k+1}-s_k. \end{array}\right. \end{eqnarray}$

(4.7)

We have

$\begin{align*} \|u_0-u_0\| & = 0 \leq t_0-t_0, \\ \|F(u_0)\| &\leq \delta = \frac{c}{\gamma(1+\eta)} = \frac{\lambda(t_0)}{\gamma(1+\eta)}, \\ \|v_0-u_0\| & = \|I-M(\theta;u_0)^{l_0}\| \|F'(u_0)^{-1}F(u_0)\| \leq (1+\chi^{l_0})\gamma\delta \leq (1+\eta)\gamma\delta = s_0-t_0, \\ \|F(v_0)\| &\leq \|F(v_0)-F(u_0) - F'(u_0)(v_0-u_0) \| + \|F(u_0)+F'(u_0)(v_0-u_0)\| \\ & \leq \frac{H}{p+1}\|v_0-u_0\|^{1+p} + \eta\|F(u_0)\| \\ & \leq \frac{H}{p+1}s_0^{1+p} + \eta\delta \\ & \leq \frac{1}{(1+\eta)\gamma} (\frac{a}{p+1}s_0^{p+1} + \eta(\eta+1)\gamma\delta) \\ & = \frac{1}{(\eta+1)\gamma} (\frac{a}{p+1}s_0^{p+1} + (1-b)s_0) \\ & = \frac{1}{(\eta+1)\gamma} (\frac{a}{p+1}s_0^{p+1} - bs_0 + c) \\ & = \frac{\lambda(s_0)}{(\eta+1)\gamma}, \\ \|u_1-v_0\| &\leq \|I-M(\theta;u_0)^{m_0}\|\|F'(u_0)^{-1}F(v_0)\| \leq (1+\chi^{l_0}) \gamma \frac{1}{(\eta+1)\gamma}\lambda(s_0) \leq -\frac{\lambda(s_0)}{\omega(t_0)} = t_1-s_0. \\ \end{align*}$

Here, we use an inequality introduced by Shen and Li in ^[29]. For any $k > 1$ , we have

$\begin{align*} \frac{H}{p+1}(t_k-t_{k-1})^{p+1} & = Ht_{k-1}^{p+1} \left( \frac{1}{p+1}\left( \frac{t_k-t_{k-1}}{t_{k-1}}^{p+1} \right)^{p+1} + \frac{t_k-t_{k-1}}{t_{k-1}} \right) - Ht_{k-1}^p (t_k-t_{k-1}) \\ & \leq \frac{H}{p+1}t_k^{p+1} - \frac{H}{p+1}t_{k-1}^{p+1} - Ht_{k-1}^p(t_k-t_{k-1}) \end{align*}$

since

$t^{1+p} + (1+p)t \leq (1+t)^{1+p} - 1.$

Now, by induction, for any k,

$\begin{align*} \|u_k-u_0\| &\leq \|u_k-v_{k-1} \| + \|v_{k-1}-u_{k-1}\| + \|u_{k-1}-u_0\| \\ &\leq (t_k-s_{k-1}) + (s_{k-1} -t_{k-1}) + (t_{k-1}-t_0) \\ & = t_k-t_0 < t_* < r. \end{align*}$

Since $u_{k-1}, v_{k-1} \in \mathbb{N}(u_{0}, r),$ we have

$\begin{align*} \|F(u_k)\| &\leq \|F(u_k) - F(v_{k-1}) - F'(v_{k-1})(u_k-v_{k-1})\| + \|F(v_{k-1}) + F'(v_{k-1})(u_k-v_{k-1})\| \\ &\leq \frac{H}{1+p}\|u_k-v_{k-1}\|^{1+p} + \eta \|F(v_{k-1})\| \\ &\leq \frac{H}{1+p}(t_k-s_{k-1})^{1+p} + \frac{\eta}{(1+\eta)\gamma} \lambda(s_{k-1}) \\ &\leq \frac{H}{1+p}t_k^{1+p} - \frac{H}{1+p}s_{k-1}^{1+p} - Hs_{k-1}^p(t_k-s_{k-1}) + \frac{\eta}{(1+\eta)\gamma}\lambda(s_{k-1}) \\ &\leq \frac{H}{1+p}t_k^{1+p} - \frac{H}{1+p}s_{k-1}^{1+p} - Ht_{k-1}^p(t_k-s_{k-1}) + \frac{\eta}{(1+\eta)\gamma}\lambda(s_{k-1}) \\ & = \frac{1}{(1+\eta)\gamma} \left[ \frac{a}{1+p}t_k^{1+p} - \frac{a}{1+p}s_{k-1}^{1+p} - H(1+\eta)\gamma t_{k-1}^p (t_k-s_{k-1}) + \eta \lambda(s_{k-1}) \right] \\ & = \frac{1}{(1+\eta)\gamma}\bigg[ \lambda(t_k) - \lambda(s_{k-1}) + b(t_k-s_{k-1}) -H\gamma(1+\eta)t_{k-1}^p(t_k-s_{k-1}) + \eta \lambda(s_{k-1}) \bigg] \\ & = \frac{1}{(1+\eta)\gamma}\lambda(t_k) + \frac{1}{(1+\eta)\gamma} \left[ -\frac{b\lambda(s_{k-1})}{\omega(t_{k-1})} + H\gamma(1+\eta) t_{k-1}^p\frac{\lambda(s_{k-1})}{\omega(t_{k-1})} +(\eta-1)\lambda(s_{k-1}) \right] \\ & = \frac{1}{(1+\eta)\gamma}\lambda(t_k) - \frac{1}{(1+\eta)\gamma} \frac{2\eta H\gamma t_{k-1}^p }{1-H\gamma t_{k-1}^p} \lambda(s_{k-1}) \\ &\leq \frac{1}{(1+\eta)\gamma}\lambda(t_k). \end{align*}$

Hence,

$\begin{align*} \|v_k-u_k\| &\leq \|I-M(\theta;u_k)^{l_k}\| \|F'(u_k)F(u_k)\| \\ &\leq ( 1 + ((1+\tau)\chi)^{l_k} )\|F'(u_k)\|\|F(u_k)\| \\ &\leq (1+\eta) \frac{\gamma}{1-\gamma Ht_k^p} \frac{\lambda(t_k)}{(1+\eta)\gamma} \\ & = -\frac{\lambda(t_k)}{\omega(t_k)} = s_k-t_k. \end{align*}$

Similarly, we have

$\begin{align*} \|F(u_k)\| &\leq \|F(v_k)- F(u_k) - F'(u_k)(v_k-u_k)\| + \|F(u_k) + F'(u_k)(v_k-u_k)\| \\ &\leq \frac{H}{1+p}\|v_k-u_k\|^{1+p} + \eta\|F(u_k)\| \\ &\leq \frac{H}{1+p}(s_k-t_k)^{1+p} + \frac{\eta}{(1+\eta)\gamma}\lambda(t_k) \\ &\leq \frac{H}{1+p}s_k^{1+p} - \frac{H}{1+p}t_k^{1+p} - Ht_k^p(s_k-t_k) + \frac{\eta}{(1+\eta)\gamma}\lambda(t_k) \\ &\leq \frac{1}{(1+\eta)\gamma} \bigg[ \lambda(s_k)-\lambda(t_k) + b(s_k-t_k) - H\gamma(1+\eta)t^p_k(s_k-t_k) + \eta\lambda(t_k) \bigg] \\ & = \frac{\lambda(s_k)}{(1+\eta)\gamma} + \frac{1}{(1+\eta)\gamma} \bigg[ -\frac{b\lambda(t_k)}{\omega(t_k)} + H\gamma(1+\eta) t_k^p \frac{ \lambda(t_k)}{\omega(t_k)} + (\eta-1)\lambda(t_k) \bigg] \\ & = \frac{\lambda(s_k)}{(1+\eta)\gamma} - \frac{1}{(1+\eta)\gamma} \frac{2\eta H\gamma t_k^p}{1-H\gamma t_k^p}\lambda(t_k) \\ &\leq \frac{\lambda(s_k)}{(1+\eta)\gamma}. \end{align*}$

Consequently,

$\begin{align*} \|u_{k+1}-v_k\| &\leq \|I-M(\theta;u_k)^{m_k}\|\|F'(u_k)^{-1}\|\|F(v_k)\| \\ &\leq (1+\eta)\frac{\gamma}{1-\gamma Ht_k^p}\frac{1}{(1+\eta)\gamma}\lambda(s_k) \\ & = -\frac{\lambda(s_k)}{\omega(t_k)} = t_{k+1}-s_k. \end{align*}$

Now, the formulas given by (4.7) have been proved by induction. For the reason that the sequences $\{t_k\}, \{s_k\}$ converge to $t_*$ and

$\begin{align*} \|u_{k+1}-u_0\| &\leq \|u_{k+1}-v_k\| + \|v_{k}-u_k\| + \|u_k-u_0\| \\ &\leq (t_k+1 -s_k) + (s_k-t_k) + (t_k-t_0) = t_{k+1}-t_0 < t_* < r, \end{align*}$

the sequence $\{u_k\}$ also converges to $u_*$ . Since $\|M(\theta; u_*)\| < 1$ , we have that $F(u_*) = 0$ . □

5. Numerical examples

Next, we shall represent the validity of our new method via numerical examples. We chose some methods given in previous paper, i.e., the modified Newton-PMHSS method ^[19] (MN-PMHSS) and modified Newton-GSOR method ^[23] (MN-GSOR), for comparison with the MN-EHS method. In our computation, the CPU running time, which is denoted by "CPU time" was recorded by implementing the command "tic-toc".

Regarding the computer programming, all of the numerical results in the following numerical examples were performed on a laptop, and the software was MATLAB version R2017b. This laptop had an AMD Ryzen7-4800H 2.90 GHz and 16.00 GB RAM. The number of outer iteration steps is denoted by "Outer IT", and that for the inner iteration steps is denoted by "Inner IT". A thorny problem is the selection of parameters in the iterations; we used the experimental optimal parameters in this study. That is, when the parameter minimizes the corresponding iteration steps and errors, it was chosen. All of the important data are listed in tables.

Example 5.1 What follows is a group of partial differential equations which can be converted to a nonlinear system:

$\left\{ \begin{array}{ll} u_t-(\alpha_1+i \beta_1)(u_{xx}+u_{yy})+\kappa u = - (\alpha_2+i \beta_2) u^{\frac{4}{3}},\;\;\;\;\; \rm{in}\;\; (0, 1] \times \Omega,\\ u(0,x, y) = u_0(x,y), \;\; \quad \quad \quad \quad \quad \quad\quad \quad\quad\quad\quad\quad\quad\quad \rm{in} \;\; \Omega,\\ u(t,x,y) = 0,\;\quad\quad\quad\quad \quad\quad\quad \quad \quad\quad\quad\quad\quad\quad\quad\quad\quad \rm{on} \;\; (0, 1] \in \partial\Omega, \end{array}\right.$

where $\Omega = (0, 1)\times (0, 1)$ and the boundary of $\Omega$ is $\partial \Omega$ . As for the constant $\kappa$ , it was used to measure the magnitudes of the reaction term; also, $\kappa$ was set as positive. We set the values of the parameters $\alpha_1 = \alpha_2 = 1$ and $\beta_1 = \beta_2 = 2.$ This problem can be converted to a nonlinear system, as discussed in this manuscript, by using the central finite-difference scheme. The grid was set as equidistant, and the step width was set as $\Delta t = h = 1/(N+1)$ .

Here is the form of this nonlinear system:

$\begin{equation} F(u) = Mu+(\alpha_2+i \beta_2)h \Delta t \Psi(u) = 0, \end{equation}$

(5.1)

where

$\begin{gather*} M = h(1+\kappa \Delta t) I_n + (\alpha_1+i \beta_1) \frac{\Delta t}{h} (A_N \otimes I_N+I_N \otimes A_N), \\ \Psi(u) = (u^{\frac{4}{3}}_1, u^{\frac{4}{3}}_2, \cdots, u^{\frac{4}{3}}_n)^{T}, \ \ n = N \times N. \end{gather*}$

Notice that $A_{N} = tridiag(-1, 2, -1)$ is a tridiagonal matrix; $\otimes$ is the Kronecker product symbol.

It is obvious that $u_* = 0$ is a solution of (5.1). And, the Jacobian matrix of $F(u)$ can be easily worked out

$\begin{equation*} F'(u) = M+\frac{4}{3}(\alpha_2+i \beta_2)h \Delta t diag(u^{\frac{1}{3}}_1, u^{\frac{1}{3}}_2, \cdots, u^{\frac{1}{3}}_n) . \end{equation*}$

In our experiment, the initial guess was chosen to be $u_0 = 1$ . The stopping term for the outer iteration was taken as

$\begin{equation*} \frac{\|F(u_k)\|_2}{\|F(u_0)\|_2} \leq 10^{-10}. \end{equation*}$

And, $\eta_k = \tilde{\eta}_{k} = \eta = 0.1$ is the tolerance of the inner iterations. gives the optimal values $\alpha$ or $\theta$ for the three methods.

Table 1. The optimal values of

$\alpha$ or

$\theta$ for the three methods.

N	MN-PMHSS			MN-GSOR			MN-EHS
N	$\kappa=1$	$\kappa=10$	$\kappa=100$	$\kappa=1$	$\kappa=10$	$\kappa=100$	$\kappa=1$	$\kappa=10$	$\kappa=100$
30	1.35	1.29	0.84	0.60	0.62	0.59	0.91	0.89	0.68
60	1.23	1.18	0.84	0.59	0.60	0.58	0.80	0.78	0.67
90	1.12	1.08	0.79	0.60	0.60	0.57	0.75	0.76	0.66

| Show Table

DownLoad: CSV

See Tables 2, and ; the experimental data when $N = 30, 60, 90$ are shown to compare our MN-EHS method with the MN-PMHSS method and MN-GSOR method. In order to show how the parameter is chosen, we represent Figure 1, which shows how the inner iteration steps of the MN-EHS method changes when the parameter varies. We employed the parameters that minimize the inner iteration steps as the optimal parameters. When the number of inner iteration steps are the same for different parameters, the one with a smaller error will be chosen.

Table 2. Experimental results for Example 5.1 when

$\eta = 0.1, N = 30$ .

$\kappa$	Method	Residual	CPU time (s)	Outer IT	Inner IT
$1$	MN-PMHSS	$2.8097 \times 10^{-12}$	0.1051	$5$	$40$
	MN-GSOR	$8.9701 \times 10^{-11}$	0.0720	$4$	$26$
	MN-EHS	$7.5151 \times 10^{-12}$	0.0505	$4$	$16$
$10$	MN-PMHSS	$3.0000 \times 10^{-12}$	0.1041	$5$	$40$
	MN-GSOR	$5.8729 \times 10^{-11}$	0.0739	$4$	$26$
	MN-EHS	$4.0716 \times 10^{-11}$	0.0574	$4$	$18$
$100$	MN-PMHSS	$2.4047 \times 10^{-11}$	0.1040	$5$	$40$
	MN-GSOR	$4.0082 \times 10^{-12}$	0.0897	$5$	$30$
	MN-EHS	$1.0118 \times 10^{-11}$	0.0709	$5$	$30$

| Show Table

DownLoad: CSV

Table 3. Experimental results for Example 5.1 when

$\eta = 0.1, N = 60$ .

$\kappa$	Method	Residual	CPU time (s)	Outer IT	Inner IT
$1$	MN-PMHSS	$3.6010 \times 10^{-12}$	0.8060	$5$	$40$
	MN-GSOR	$1.4244 \times 10^{-11}$	0.6079	$4$	$29$
	MN-EHS	$2.6932 \times 10^{-11}$	0.5807	$5$	$21$
$10$	MN-PMHSS	$4.1899 \times 10^{-12}$	0.7873	$5$	$40$
	MN-GSOR	$1.2750 \times 10^{-11}$	0.5923	$4$	$28$
	MN-EHS	$4.4046 \times 10^{-12}$	0.5631	$4$	$24$
$100$	MN-PMHSS	$3.0525 \times 10^{-11}$	0.7938	$5$	$40$
	MN-GSOR	$9.0779 \times 10^{-12}$	0.7393	$5$	$30$
	MN-EHS	$6.5092 \times 10^{-11}$	0.6728	$5$	$30$

| Show Table

DownLoad: CSV

Table 4. Experimental results for Example 5.1 when

$\eta = 0.1, N = 90$ .

$\kappa$	Method	Residual	CPU time (s)	Outer IT	Inner IT
$1$	MN-PMHSS	$5.3181 \times 10^{-12}$	4.2446	$5$	$40$
	MN-GSOR	$4.7277 \times 10^{-11}$	3.1208	$4$	$27$
	MN-EHS	$5.2068 \times 10^{-11}$	2.7840	$4$	$24$
$10$	MN-PMHSS	$6.1901 \times 10^{-12}$	4.2938	$5$	$40$
	MN-GSOR	$4.5188 \times 10^{-11}$	3.1155	$4$	$27$
	MN-EHS	$6.8388 \times 10^{-11}$	2.7904	$4$	$26$
$100$	MN-PMHSS	$4.4007 \times 10^{-11}$	4.4003	$5$	$40$
	MN-GSOR	$1.2694 \times 10^{-11}$	3.8304	$5$	$30$
	MN-EHS	$4.5328 \times 10^{-11}$	2.9485	$4$	$32$

| Show Table

DownLoad: CSV

Figure 1. Inner steps according to parameter value (MN-EHS method).

DownLoad: Full-Size Img PowerPoint

According to the results in Tables 2, and , the CPU time and the number of iterations in the MN-EHS method are typically shorter and smaller, respectively, than the MN-PMHSS method and the MN-GSOR method when the constant $\kappa$ and the problem size vary. This simply indicates that the MN-EHS method is more effective than the other two in this example.

Finally, the steps of iteration of the MN-EHS method when the problem size varies are shown in Figures 2, 3 and 4. Broadly, the steps of the outer iterations exhibited almost no change, and the steps of the inner iterations increased when the problem size varied, but the changes were not very intense.

Figure 2. Steps of iterations versus

$N$ when

$\kappa = 1$ (MN-EHS method).

DownLoad: Full-Size Img PowerPoint

Figure 3. Steps of iterations versus

$N$ when

$\kappa = 10$ (MN-EHS method).

DownLoad: Full-Size Img PowerPoint

Figure 4. Steps of iterations versus

$N$ when

$\kappa = 100$ (MN-EHS method).

DownLoad: Full-Size Img PowerPoint

Example 5.2 Consider the nonlinear Helmholtz equation

$\begin{equation} -\Delta u + \sigma_{1} u + i \sigma_{2} u = -e^{u}, \end{equation}$

(5.2)

where $\sigma_{1}$ and $\sigma_{2}$ are real coefficients. Notice that the solution of this equation should satisfy the Dirichlet boundary value condition on $D = [0, 1]\times [0, 1]$ . After discretization on the mesh size $h = 1/(N + 1)$ , the nonlinear system has the form

$F(x) = Mx+\Phi(x) = 0,$

where

$\begin{gather*} M = (K+\sigma_{1} I) + i \sigma_{2}I, \\ \Phi(x) = (e^{x_{1}}, e^{x_{2}}, \cdots, e^{x_{n}} )^{T}, \end{gather*}$

with

$K = I \otimes B_{N}+B_{N} \otimes I,$

and $B_{N} = \frac{1}{h^{2}} \operatorname{tridiag}(-1, 2, -1) \in R^{N \times N}$ is a tridiagonal matrix.

In this numerical experiment, we applied $\sigma_1 = 10^{3}$ and $\sigma_2 = 10^{4}$ . The initial guess was taken as $x_0 = \textbf{0}$ ; here, 0 is a zero vector. The tolerance of the inner iterations was set as $\eta_{k} = \tilde{\eta_{k}} = \eta = 0.1$ , i.e., the same as the first example. While a little different from the first example, the stopping criteria for the outer iterations was set as

$\frac{\|F(x_{k})\|}{\|F(x_{0})\|} \leq 10^{-6}.$

Table 5 lists the experimental parameters we applied.

Table 5. The optimal values of

$\alpha$ or

$\theta$ for the three methods on Example 5.2.

N	MN-PMHSS	MN-GSOR	MN-EHS
30	1.86	0.18	1.26
60	1.87	0.18	0.97
90	1.85	0.18	0.87

| Show Table

DownLoad: CSV

From , which displays the numerical results for $N = 30, 60, 90$ , the MN-EHS method still outperformed the other two methods in this example.

Table 6. Experimental results for Example 5.2.

$N$	Method	Residual	CPU time(s)	Outer IT	Inner IT
$30$	MN-PMHSS	$9.2568 \times 10^{-7}$	0.0584	$3$	$30$
	MN-GSOR	$9.2682 \times 10^{-8}$	0.1321	$3$	$82$
	MN-EHS	$9.6867 \times 10^{-9}$	0.0316	$3$	$12$
$60$	MN-PMHSS	$9.1223 \times 10^{-7}$	0.3841	$3$	$30$
	MN-GSOR	$1.0197 \times 10^{-7}$	0.8784	$3$	$82$
	MN-EHS	$4.3828 \times 10^{-7}$	0.2859	$3$	$24$
$90$	MN-PMHSS	$9.0837 \times 10^{-7}$	2.2419	$3$	$30$
	MN-GSOR	$1.0356 \times 10^{-7}$	4.8795	$3$	$82$
	MN-EHS	$4.3284 \times 10^{-7}$	1.8615	$3$	$41$

| Show Table

DownLoad: CSV

6. Conclusions

The main aim of this article was to present an iterative method for solving large-scale sparse nonlinear equations, which typically have complex symmetric Jacobian matrices. Finding solutions for this type of nonlinear equation system is very important in practical applications of a large number of scientific calculations. This paper presents the construction of a new MN-EHS method and gives derivations of the convergence properties. Two academic test examples which arise from differential equations are given. In the form of tables and data, we have compared the MN-EHS method with some methods in existing literature; the results indicate that our proposed method performs better than existing methods on these types of problems.

Acknowledgments

This work was supported by the National Natural Science Foundation of China (Grant no. 12271479, Research on some efficient and fast algorithms for complex nonlinear problems).

Conflict of interest

The authors declare that they have no conflict of interest.

References

[1]	C. Sulen, P. L. Sulem, The Nonlinear Schrödinger Equation: Self-focusing and Wave Collapse, Springer, New York, 1999.
[2]	L. S. Aranson, L. Kramer, The world of the complex Ginzburg-Landau equation, Rev. Mod. Phys., 74 (2002), 99–143.
[3]	W. C. Rheinboldt, Methods for Solving Systems of Nonlinear Equations, SIAM, Philadephia, 1998.
[4]	R. S. Dembo, S. C. Eisenstat, T. Steihaug, Inexact Newton mehtods, SIAM J. Numer. Anal., 19 (1982), 400–408. https://doi.org/10.1137/0719025 doi: 10.1137/0719025
[5]	Z. Z. Bai, G. H. Golub, M. K. Ng, Hermitian and skew-Hermitian splitting methods for non-Hermitian positive definite linear systems, SIAM J. Matrix Anal. Appl., 24 (2003), 603–626. https://doi.org/10.1137/S0895479801395458 doi: 10.1137/S0895479801395458
[6]	Z. Z. Bai, M. Benzi, F. Chen, On preconditioned MHSS iteration methods for complex symmetric linear systems, Numer. Algor., 56 (2011), 297–317. https://doi.org/10.1007/s11075-010-9441-6 doi: 10.1007/s11075-010-9441-6
[7]	X. L. Cui, S. L. Wu, A single step HSS method for non-Hermitian positive definite linear systems, J. Comput. Math., 44 (2015), 26–29. https://doi.org/10.1016/j.aml.2014.12.013 doi: 10.1016/j.aml.2014.12.013
[8]	Z. G. Huang, L. G. Wang, Z. Xu, J. J. Cui, An efficient two-step iterative method for solving a class of complex symmetric linear systems, Comput. Math. Appl., 75 (2018), 2473–2498. https://doi.org/10.1016/j.camwa.2017.12.026 doi: 10.1016/j.camwa.2017.12.026
[9]	H. A. van der Vorst, Krylov subspace iteration, Comput. Sci. Eng., 2 (2000), 32–37. https://doi.org/10.1109/5992.814655 doi: 10.1109/5992.814655
[10]	S. Bellavia, M. Macconi, B. Morini, A globally convergent Newton-GMRES subspace method for systems of nonlinear equations, SIAM J. Sci. Comput., 23 (2001), 940–960. https://doi.org/10.1137/S1064827599363976 doi: 10.1137/S1064827599363976
[11]	Z. Z. Bai, X.-P. Guo, On Newton-HSS methods for systems of nonlinear equations with positive-definite Jacobian matrices, J. Comput. Math., 28 (2010), 235–260. https://doi.org/10.4208/jcm.2009.10-m2836 doi: 10.4208/jcm.2009.10-m2836
[12]	A. L. Yang, Y. J. Wu, Newton-MHSS methods for solving systems of nonlinear equations with complex symmetric Jacobian matrices, Numer. Alg. Con. Opt., 2 (2012), 839–853. https://doi.org/10.3934/naco.2012.2.839 doi: 10.3934/naco.2012.2.839
[13]	M. Aristizabal, J. L. Hernández-Estrada, M. Garcia, H. Millwater, Solution and sensitivity analysis of nonlinear equations using a hypercomplex-variable Newton-Raphson method, Appl. Math. Comput., 451 (2023), 127981. https://doi.org/10.1016/j.amc.2023.127981 doi: 10.1016/j.amc.2023.127981
[14]	A. M. Awwal, P. Kumam, A. B. Abubakar, A modified conjugate gradient method for monotone nonlinear equations with convex constraints, Appl. Numer. Math., 145 (2019), 507–520. https://doi.org/10.1016/j.apnum.2019.05.012 doi: 10.1016/j.apnum.2019.05.012
[15]	A. B. Abubakar, P. Kumam, A. H. Ibrahim, J. Rilwan, Derivative-free HS-DY-type method for solving nonlinear equations and image restoration, Heliyon, 6 (2020), e05400. https://10.1016/j.heliyon.2020.e05400 doi: 10.1016/j.heliyon.2020.e05400
[16]	M. T. Darvishi, A. Barati, A third-order Newton-type method to solve systems of nonlinear equations, Appl. Math. Comput., 187 (2007), 630–635. https://doi.org/10.1016/j.amc.2006.08.080 doi: 10.1016/j.amc.2006.08.080
[17]	Q. B. Wu, M. H. Chen, Convergence analysis of modified Newton-HSS method for solving systems of nonlinear equations, Numer. Algor., 64 (2013), 659–683. https://doi.org/10.1007/s11075-012-9684-5 doi: 10.1007/s11075-012-9684-5
[18]	M. H. Chen, Q. B. Wu, R. F. Lin. Semilocal convergence analysis for the modified Newton-HSS method under the Hölder condition, Numer. Algor., 72 (2016), 667–685. https://doi.org/10.1007/s11075-015-0061-z doi: 10.1007/s11075-015-0061-z
[19]	H. X. Zhong, G. L. Chen, X. P. Guo, On preconditioned modified Newton-MHSS method for systems of nonlinear equations with complex symmetric Jacobian matrices, Numer. Algor., 69 (2015), 553–567. https://doi.org/10.1007/s11075-014-9912-2 doi: 10.1007/s11075-014-9912-2
[20]	M. H. Chen, Q. B. Wu, On modified Newton–DGPMHSS method for solving nonlinear systems with complex symmetric Jacobian matrices, Comput. Math. Appl., 76 (2018), 45–57. https://doi.org/10.1016/j.camwa.2018.04.003 doi: 10.1016/j.camwa.2018.04.003
[21]	F. Xie, Q. B. Wu, P. F. Dai, Modified Newton-SHSS method for a class of systems of nonlinear equations, Comput. Appl. Math., 38 (2019), 19–37. https://doi.org/10.1007/s40314-019-0793-9 doi: 10.1007/s40314-019-0793-9
[22]	F. Xie, R. F. Lin, Q. B. Wu, Modified Newton-DSS method for solving a class of systems of nonlinear equations with complex symmetric Jacobian matrices, Numer. Algor., 85 (2020), 951–975. https://doi.org/10.1007/s11075-019-00847-y doi: 10.1007/s11075-019-00847-y
[23]	X. Qi, H. T. Qu, X. Y. Xiao, Modified Newton-GSOR method for solving complex nonlinear systems with symmetric Jacobian matrices, Comput. Appl. Math., 39 (2020), 165–182. https://doi.org/10.1007/s40314-020-01204-9 doi: 10.1007/s40314-020-01204-9
[24]	C. L. Li, C. F. Ma, On Euler-extrapolated Hermitian/skew-Hermitian splitting method for complex symmetric linear systems, Appl. Math. Lett., 86 (2018), 42–48. https://doi.org/10.1016/j.aml.2018.06.016 doi: 10.1016/j.aml.2018.06.016
[25]	X. Xie, H. B. Li, On preconditioned Euler-extrapolated single-step Hermitian and skew-Hermitian splitting method for complex symmetric linear systems, Jpn. J. Ind. Appl. Math., 38 (2021), 503–518. https://doi.org/10.1007/s13160-020-00447-7 doi: 10.1007/s13160-020-00447-7
[26]	C. L. Li, C. F. Ma, The inexact Euler-extrapolated block preconditioners for a class of complex linear systems, Appl. Math. Lett., 104 (2020), 106229. https://doi.org/10.1016/j.aml.2020.106229 doi: 10.1016/j.aml.2020.106229
[27]	C. L. Li, C. F. Ma, On Euler Preconditioned SHSS iterative method for a class of complex symmetric linear systems, ESAIM-Math. Model. Num., 53 (2019), 1607–1627. https://doi.org/10.1051/m2an/2019029 doi: 10.1051/m2an/2019029
[28]	M. H. Chen, R. F. Lin, Q. B. Wu, Convergence analysis of the modified Newton-HSS method under the Hölder continuous condition, J. Comput. Appl. Math., 264 (2014), 115–130. https://doi.org/10.1016/j.cam.2013.12.047 doi: 10.1016/j.cam.2013.12.047
[29]	W. P. Shen, C. Li, Convergence criterion of inexact methods for operators with Hölder continuous derivatives, Taiwanese J. Math., 12 (2008), 1865–1882. http://doi.org/10.11650/twjm/1500405093 doi: 10.11650/twjm/1500405093

Reader Comments

Your name:*

Email:*
© 2023 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

AIMS Mathematics

1.8 3.4

Metrics

Article views(1230) PDF downloads(53) Cited by(0)

Preview PDF

Download XML

Export Citation

Article outline

Show full outline

Figures and Tables

Figures(4) / Tables(6)

AIMS Mathematics

Modified Newton-EHS method for solving nonlinear problems with complex symmetric Jacobian matrices

Related Papers:

Abstract

1. Introduction

2. The MN-EHS method

3. Local convergence property of MN-EHS method

4. Semilocal convergence property of MN-EHS method

5. Numerical examples

6. Conclusions

Acknowledgments

Conflict of interest

References

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Figures and Tables

Other Articles By Authors

Catalog

AIMS Mathematics

Modified Newton-EHS method for solving nonlinear problems with complex symmetric Jacobian matrices

Related Papers:

Abstract

1. Introduction

2. The MN-EHS method

3. Local convergence property of MN-EHS method

4. Semilocal convergence property of MN-EHS method

5. Numerical examples

6. Conclusions

Acknowledgments

Conflict of interest

References

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Figures and Tables

Other Articles By Authors

Related pages

Tools

Export File

Citation

Format

Content

Catalog