An accelerated adaptive two-step Levenberg–Marquardt method with the modified Metropolis criterion

Dingyu Zhu; Yueting Yang; Mingyuan Cao; Dingyu Zhu; Yueting Yang; Mingyuan Cao

doi:10.3934/math.20241199

AIMS Mathematics

2024, Volume 9, Issue 9: 24610-24635. doi: 10.3934/math.20241199

Previous Article Next Article

Research article

An accelerated adaptive two-step Levenberg–Marquardt method with the modified Metropolis criterion

School of Mathematics and Statistics, Beihua University, Jilin 132013, China

Received: 11 June 2024 Revised: 11 August 2024 Accepted: 15 August 2024 Published: 22 August 2024
MSC : 65K05, 90C30

In this paper, aiming at the nonlinear equations, a new two-step Levenberg–Marquardt method was proposed. We presented a new Levenberg–Marquardt parameter to obtain the trial step. A new modified Metropolis criterion was used to adjust the upper bound of the approximate step. The convergence of the method was analyzed under the H $\ddot{\rm o}$ lderian local error bound condition and the H $\ddot\rm o$ lderian continuity of the Jacobian. Numerical experiments showed that the new algorithm is effective and competitive in the numbers of functions, Jacobian evaluations and iterations.

Keywords:

nonlinear equations,
Levenberg–Marquardt method,
Metropolis criterion,
H $\ddot{\rm o}$ lderian local error bound,
H $\ddot{\rm o}$ lderian continuity

Citation: Dingyu Zhu, Yueting Yang, Mingyuan Cao. An accelerated adaptive two-step Levenberg–Marquardt method with the modified Metropolis criterion[J]. AIMS Mathematics, 2024, 9(9): 24610-24635. doi: 10.3934/math.20241199

Related Papers:

[1]	Muhammad Sajjad, Tariq Shah, Qin Xin, Bander Almutairi . Eisenstein field BCH codes construction and decoding. AIMS Mathematics, 2023, 8(12): 29453-29473. doi: 10.3934/math.20231508
[2]	Berna Arslan . On generalized biderivations of Banach algebras. AIMS Mathematics, 2024, 9(12): 36259-36272. doi: 10.3934/math.20241720
[3]	Moin A. Ansari, Ali N. A. Koam, Azeem Haider . Intersection soft ideals and their quotients on KU-algebras. AIMS Mathematics, 2021, 6(11): 12077-12084. doi: 10.3934/math.2021700
[4]	Shan Li, Kaijia Luo, Jiankui Li . Generalized Lie $n$ -derivations on generalized matrix algebras. AIMS Mathematics, 2024, 9(10): 29386-29403. doi: 10.3934/math.20241424
[5]	Jie Qiong Shi, Xiao Long Xin . Ideal theory on EQ-algebras. AIMS Mathematics, 2021, 6(11): 11686-11707. doi: 10.3934/math.2021679
[6]	Shakir Ali, Ali Yahya Hummdi, Mohammed Ayedh, Naira Noor Rafiquee . Linear generalized derivations on Banach $^$ -algebras. AIMS Mathematics, 2024, 9(10): 27497-27511. doi: 10.3934/math.20241335*
[7]	Dan Liu, Jianhua Zhang, Mingliang Song . Local Lie derivations of generalized matrix algebras. AIMS Mathematics, 2023, 8(3): 6900-6912. doi: 10.3934/math.2023349
[8]	Wen Teng, Jiulin Jin, Yu Zhang . Cohomology of nonabelian embedding tensors on Hom-Lie algebras. AIMS Mathematics, 2023, 8(9): 21176-21190. doi: 10.3934/math.20231079
[9]	Yingyu Luo, Yu Wang, Junjie Gu, Huihui Wang . Jordan matrix algebras defined by generators and relations. AIMS Mathematics, 2022, 7(2): 3047-3055. doi: 10.3934/math.2022168
[10]	He Yuan, Zhuo Liu . Lie $n$ -centralizers of generalized matrix algebras. AIMS Mathematics, 2023, 8(6): 14609-14622. doi: 10.3934/math.2023747

Abstract

1. Introduction

Let $m$ and $n$ be two positive integers with $m\geq 2$ and $n\geq 2$ , $[n] = \{1, 2, \ldots, n\}$ , $\mathbb{R}$ be the set of all real numbers, ${\mathbb{R}}^{n}$ be the set of all $n$ -dimensional real vectors. Let $x = (x_1, x_2, \ldots, x_m)\in\mathbb{R}^{m}$ and $y = (y_1, y_2, \ldots, y_n)\in\mathbb{R}^{n}$ . If a fourth-order tensor $\mathcal{A} = (a_{ijkl})\in\mathbb{R}^{[m]\times [n]\times [m]\times [n]}$ satisfies the properties

$a_{ijkl} = a_{kjil} = a_{ilkj} = a_{klij}, \quad i, k\in [m], \quad j, l\in [n],$

then we call $\mathcal{A}$ a partially symmetric tensor.

It is well know that the tensor of the elastic modulus of elastic materials is just partially symmetrical ^[11]. And the components of a fourth-order partially symmetric tensor $\mathcal{A}$ can be regarded as the coefficients of the following biquadratic homogeneous polynomial optimization problem ^[6,19]:

$\begin{eqnarray} &&\max\; f(x, y)\equiv \mathcal{A} xyxy \equiv \sum\limits_{i, k\in[m]}\sum\limits_{j, l\in[n]}a_{ijkl}x_{i}y_{j}x_{k}y_{l}, \\ &&\text{s.t.}\; \; x^{\top}x = 1, \; \; y^{\top}y = 1. \end{eqnarray}$

(1.1)

The optimization problem plays a great role in the analysis of nonlinear elastic materials and the entanglement problem in quantum physics ^[5,6,8,9,26]. To solve the problem, we would establish a new version based on the following definition:

Definition 1.1. ^[11,20,21] Let $\mathcal{A} = (a_{ijkl})\in\mathbb{R}^{[m]\times [n]\times [m]\times [n]}$ be a partially symmetric tensor. If there are $\lambda\in\mathbb{R}$ , $x\in {\mathbb{R}}^{m}\setminus\{0\}$ and $y\in {\mathbb{R}}^{n}\setminus\{0\}$ such that

$\begin{eqnarray} \mathcal{A}{\cdot}yxy = \lambda x, \quad \mathcal{A}xyx{\cdot} = \lambda y, \quad x^{\top}x = 1, \quad y^{\top}y = 1, \end{eqnarray}$

(1.2)

where

$\begin{eqnarray*} (\mathcal{A}\cdot yxy)_{i} = \sum\limits_{k\in[m]}\sum\limits_{j, l\in[n]}a_{ijkl}y_{j}x_{k}y_{l}, \; \; (\mathcal{A}xyx\cdot)_{l} = \sum\limits_{i, k\in[m]}\sum\limits_{ j\in[n]}a_{ijkl}x_{i}y_{j}x_{k}, \end{eqnarray*}$

then we call $\lambda$ an M-eigenvalue of $\mathcal{A}$ , $x$ and $y$ the left and right M-eigenvectors associated with $\lambda$ , respectively. Let $\sigma(\mathcal{A})$ be the set of all M-eigenvalues of $\mathcal{A}$ and $\lambda_{\max}(\mathcal{A})$ be the largest M-eigenvalue of $\mathcal{A}$ , i.e.,

$\begin{eqnarray*} \lambda_{\max}(\mathcal{A}) = \max\{|\lambda|:\lambda\in\sigma(\mathcal{A})\}. \end{eqnarray*}$

In 2009, Wang, Qi and Zhang ^[24] pointed out that Problem (1.1) is equivalently transformed into calculating the largest M-eigenvalue of a fourth-order partially symmetric tensor. Based on this, Wang et al. ^[24] presented an algorithm (WQZ-algorithm) to find the largest M-eigenvalue of a fourth-order partially symmetric tensor.

WQZ-algorithm [24,Algorithm 4.1]:

Initial step: Input $\mathcal{A} = (a_{ijkl})\in\mathbb{R}^{[m]\times [n]\times [m]\times [n]}$ and unfold it into a matrix $A = (A_{st})\in\mathbb{R}^{[mn]\times [mn]}$ by mapping $A_{st} = a_{ijkl}$ with $s = n(i-1)+j, \; \; \; \; t = n(k-1)+l.$

Substep 1: Take

$\begin{eqnarray} \tau = \sum\limits_{1\leq s\leq t\leq mn}|A_{st}|, \end{eqnarray}$

(1.3)

and set

$\begin{eqnarray} \overline{\mathcal{A}} = \tau\mathcal{I}+\mathcal{A}, \end{eqnarray}$

(1.4)

where $\mathcal{I} = (\delta_{ijkl})\in\mathbb{R}^{[m]\times [n]\times [m]\times [n]}$ with $\delta_{ijkl} = 1$ if $i = k$ and $j = l,$ otherwise, $\delta_{ijkl} = 0.$ Then unfold $\overline{\mathcal{A}} = (\overline{a}_{ijkl})\in\mathbb{R}^{[m]\times [n]\times [m]\times [n]}$ into a matrix $\overline{A} = (\overline{A}_{st})\in\mathbb{R}^{[mn]\times [mn]}.$

Substep 2: Compute the unit eigenvector $w = (w_i)_{i = 1}^{mn}\in\mathbb{R}^{mn}$ of matrix $\overline{A}$ associated with its largest eigenvalue, and fold vector $w$ into the matrix $W = (W_{ij})\in\mathbb{R}^{[m]\times [n]}$ in the following way:

$W_{ij} = w_k,$

set $i = \lceil k/n \rceil, \; \; \; j = (k-1)\text{modn}+1, \; \; \; \forall\; k = 1, 2, \cdots, mn.$

Substep 3: Compute the singular vectors $u_1$ and $v_1$ corresponding to the largest singular value $\sigma_1$ of the matrix $W$ . Specifically, the singular value decomposition of $W$ is

$W = U^T\Sigma V = \sum\limits_{i = 1}^r \sigma_iu_iv_i^T,$

where $\sigma_1\geq\sigma_2\geq\cdots\geq\sigma_r$ and $r$ is the rank of $W$ .

Substep 4: Take $x_0 = u_1, y_0 = v_1,$ and let $k = 0.$

Iterative step: Execute the following procedures alternatively until certain convergence criterion is satisfied and output $x^\ast, y^\ast:$

$\begin{eqnarray*} &&\overline{x}_{k+1} = \overline{\mathcal{A}}\cdot y_kx_ky_k, \; \; \; \; \; \; \; \; \; x_{k+1} = \frac{\overline{x}_{k+1}}{||\overline{x}_{k+1}||}, \\ &&\overline{y}_{k+1} = \overline{\mathcal{A}}x_{k+1}y_kx_{k+1}\cdot, \; \; \; \; \; y_{k+1} = \frac{\overline{y}_{k+1}}{||\overline{y}_{k+1}||}, \\ &&k = k+1. \end{eqnarray*}$

Final step: Output the largest M-eigenvalue of the tensor $\mathcal{A}$ :

$\lambda_{\max}(\mathcal{A}) = f(x^\ast, y^\ast)-\tau,$

where

$f(x^\ast, y^\ast) = \sum\limits_{i, k\in [m]}\sum\limits_{j, l\in [n]}\overline{a}_{ijkl}x_i^\ast y_j^\ast x_k^\ast y_l^\ast,$

and the associated M-eigenvectors: $x^\ast, y^\ast$ .

The M-eigenvalues of tensors have a close relationship with the strong ellipticity condition in elasticity theory, which guarantees the existence of the solution to the fundamental boundary value problems of elastostatics ^[3,5,16]. However, when the dimensions $m$ and $n$ of tensors are large, it is not easy to calculate all M-eigenvalues. Thus, the problem of M-eigenvalue localization have attracted the attention of many researchers and many M-eigenvalue localization sets are given; see ^{[2,4,13,14,15,17,18,23,27]}.

For this, Wang, Li and Che ^[23] presented the following M-eigenvalue localization set for a partially symmetric tensor:

Theorem 1.1. [23,Theorem 2.2] Let $\mathcal{A} = (a_{ijkl})\in\mathbb{R}^{[m]\times [n]\times [m]\times [n]}$ be a partially symmetric tensor. Then

$\begin{eqnarray*} \sigma(\mathcal{A})\subseteq\mathcal{H}(\mathcal{A}) = \bigcup\limits_{i\in[m]}\bigcap\limits_{k\in[m], k\neq i}\mathcal{H}_{i, k}(\mathcal{A}), \end{eqnarray*}$

where

$\begin{eqnarray*} &\mathcal{H}_{i, k}(\mathcal{A}) = \Big[\widehat{\mathcal{H}}_{i, k}(\mathcal{A})\cup(\overline{\mathcal{H}}_{i, k}(\mathcal{A})\cap\Gamma_{i}(\mathcal{A}))\Big], &\\ &\widehat{\mathcal{H}}_{i, k}(\mathcal{A}) = \{z\in\mathbb{C}: |z|\leq R_{i}(\mathcal{A})-R_{i}^{k}(\mathcal{A}), \; |z|\leq R_{k}^{k}(\mathcal{A})\}, &\\ &\overline{\mathcal{H}}_{i, k}(\mathcal{A}) = \{z\in\mathbb{C}: (|z|-(R_{i}(\mathcal{A})-R_{i}^{k}(\mathcal{A})))(|z|-R_{k}^{k}(\mathcal{A}))\leq R_{i}^{k}(\mathcal{A})(R_{k}(\mathcal{A})-R_{k}^{k}(\mathcal{A}))\}, &\\ &R_{i}(\mathcal{A}) = \sum\limits_{k\in[m]}\sum\limits_{j, l\in[n]}|a_{ijkl}|, \; \; R_{i}^{k}(\mathcal{A}) = \sum\limits_{j, l\in[n]}|a_{ijkl}|.& \end{eqnarray*}$

From the set $\mathcal{H}(\mathcal{A})$ in Theorem 1.1, we can obtain an upper bound of the largest M-eigenvalue $\lambda_{\max}(\mathcal{A})$ , which can be taken as an parameter $\tau$ in WQZ-algorithm. From Example 2 in ^[15], it can be seen that the smaller the upper bound of $\lambda_{\max}(\mathcal{A})$ , the faster WQZ-algorithm converges. In view of this, this paper intends to provide a smaller upper bound based on a new inclusion set and take this new upper bound as a parameter $\tau$ to make WQZ-algorithm converges to $\lambda_{\max}(\mathcal{A})$ faster.

The remainder of this paper is organized as follows. In Section 2, we provide an M-eigenvalue localization set for a partially symmetric tensor $\mathcal{A}$ and prove that the new set is tighter than some existing M-eigenvalue localization sets. In Section 3, based on the new set, we provide an upper bound for the largest M-eigenvalue of $\mathcal{A}$ . As an application, in order to make the sequence generated by WQZ-algorithm converge to the largest M-eigenvalue of $\mathcal{A}$ faster, we replace the parameter $\tau$ in WQZ-algorithm with the upper bound. In Section 4, we conclude this article.

2. A shaper M-eigenvalue localization set of a fourth-order partially symmetric tensor

In this section, we provide a new M-eigenvalue localization set of a fourth-order partially symmetric tensor and prove that the new M-eigenvalue localization set is tighter than that in Theorem 1.1, i.e., Theorem 2.2 in ^[23]. Before that, the following conclusion in ^[1,25] is needed.

Lemma 2.1. Let $x = (x_1, x_2, \ldots, x_n)^{\top}\in\mathbb{R}^{n}$ and $y = (y_1, y_2, \ldots, y_n)^{\top}\in\mathbb{R}^{n}$ . Then

a) If $\parallel x\parallel_{2} = 1$ , then $|x_{i}||x_{j}|\leq\frac{1}{2}$ for $i, j\in[n]$ , $i\neq j$ ;

b) $\Big(\sum\limits_{i\in[n]}x_{i}y_{i}\Big)^{2}\leq\sum\limits_{i\in[n]}x_{i}^2\sum\limits_{i\in[n]}y_{i}^2$ .

Theorem 2.1. Let $\mathcal{A} = (a_{ijkl})\in\mathbb{R}^{[m]\times [n]\times [m]\times [n]}$ be a partially symmetric tensor. Then

$\begin{eqnarray*} \sigma(\mathcal{A})\subseteq\Upsilon(\mathcal{A}) = \bigcup\limits_{i\in[m]}\bigcap\limits_{s\in[m], s\neq i}\Upsilon_{i, s}(\mathcal{A}), \end{eqnarray*}$

where

$\begin{align*} \Upsilon_{i, s}(\mathcal{A})& = \Big[\widehat{\Upsilon}_{i, s}(\mathcal{A})\cup(\widetilde{\Upsilon}_{i, s}(\mathcal{A})\cap\overline{\Upsilon}_{i, s}(\mathcal{A}))\Big], \\ \widehat{\Upsilon}_{i, s}(\mathcal{A})& = \{z\in\mathbb{R}: |z| < \widetilde{r}_{i}^{s}(\mathcal{A}), \; |z| < r_{s}^{s}(\mathcal{A})\}, \\ \widetilde{\Upsilon}_{i, s}(\mathcal{A})& = \{z\in\mathbb{R}: (|z|-\widetilde{r}_{i}^{s}(\mathcal{A}))(|z|- r_{s}^{s}(\mathcal{A}))\leq r_{i}^{s}(\mathcal{A})\widetilde{r}_{s}^{s}(\mathcal{A})\}, \\ \overline{\Upsilon}_{i, s}(\mathcal{A})& = \{z\in\mathbb{R}: |z| < \widetilde{r}_{i}^{s}(\mathcal{A})+r_{i}^{s}(\mathcal{A})\}, \end{align*}$

and

$\begin{align*} \widetilde{r}_{t}^{s}(\mathcal{A})& = \frac{1}{2}\sum\limits_{k\in[m], k\neq s}\sum\limits_{j, l\in[n], j\neq l}|a_{tjkl}|+\sum\limits_{k\in[m], k\neq s}\sqrt{\sum\limits_{l\in[n]} a_{tlkl}^{2}}, \\ r_{t}^{s}(\mathcal{A})& = \frac{1}{2}\sum\limits_{j, l\in[n], j\neq l}|a_{tjsl}|+\sqrt{\sum\limits_{l\in[n]}a_{tlsl}^{2}}, \; \; \; \; t\in[m]. \end{align*}$

Proof. Let $\lambda$ be an M-eigenvalue of $\mathcal{A}$ , $x\in \mathbb{R}^{m}\backslash\{0\}$ and $y\in\mathbb{R}^{n}\backslash\{0\}$ be its left and right M-eigenvectors, respectively. Then $x^{\top}x = 1$ . Let $|x_t| = \max\limits_{i\in [m]}|x_i|$ . Then $0 < |x_t|\leq 1$ . For any given $s\in[m]$ and $s\neq t$ , by the $t$ -th equation of (1.2), we have

$\begin{align*} \lambda x_{t}& = \sum\limits_{k\in[m]}\sum\limits_{j, l\in[n]}a_{tjkl}y_{j}x_{k}y_{l}\\ & = \sum\limits_{k\in[m], \atop k\neq s}\sum\limits_{j, l\in[n], \atop j\neq l}a_{tjkl}y_{j}x_{k}y_{l}+\sum\limits_{k\in[m], \atop k\neq s}\sum\limits_{l\in[n]}a_{tlkl}y_{l}x_{k}y_{l} +\sum\limits_{j, l\in[n], \atop j\neq l}a_{tjsl}y_{j}x_{s}y_{l}+\sum\limits_{l\in[n]}a_{tlsl}y_{l}x_{s}y_{l}. \end{align*}$

Taking the modulus of the above equation and using the triangle inequality and Lemma 2.1, one has

$\begin{align*} |\lambda||x_{t}|\leq&\sum\limits_{k\in[m], \atop k\neq s}\sum\limits_{j, l\in[n], \atop j\neq l}|a_{tjkl}||y_{j}||x_{k}||y_{l}|+\sum\limits_{k\in[m], \atop k\neq s}\sum\limits_{l\in[n]}|a_{tlkl}||y_{l}|| x_{k}||y_{l}| +\sum\limits_{j, l\in[n], \atop j\neq l}|a_{tjsl}||y_{j}||x_{s}||y_{l}|+\\ &\sum\limits_{l\in[n]}|a_{tlsl}||y_{l}||x_{s}||y_{l}|\\ \leq&\frac{1}{2}\sum\limits_{k\in[m], \atop k\neq s}\sum\limits_{j, l\in[n], \atop j\neq l}|a_{tjkl}||x_{t}|+\sum\limits_{k\in[m], \atop k\neq s}\sum\limits_{l\in[n]}|a_{tlkl}|| y_{l}||x_{t}| +\frac{1}{2}\sum\limits_{j, l\in[n], \atop j\neq l}|a_{tjsl}||x_{s}|+\sum\limits_{l\in[n]}|a_{tlsl}||y_{l}||x_{s}|\\ = &\frac{1}{2}\sum\limits_{k\in[m], \atop k\neq s}\sum\limits_{j, l\in[n], \atop j\neq l}|a_{tjkl}||x_{t}|+|x_{t}|\sum\limits_{k\in[m], \atop k\neq s}\Big(\sum\limits_{l\in[n]}| a_{tlkl}||y_{l}|\Big)+\frac{1}{2}\sum\limits_{j, l\in[n], \atop j\neq l}|a_{tjsl}||x_{s}|+|x_{s}|\sum\limits_{l\in[n]}|a_{tlsl}||y_{l}|\\ \leq&\frac{1}{2}\sum\limits_{k\in[m], \atop k\neq s}\sum\limits_{j, l\in[n], \atop j\neq l}|a_{tjkl}||x_{t}|+|x_{t}|\sum\limits_{k\in[m], \atop k\neq s}\Bigg(\sqrt{\sum\limits_{l\in[n]}|a_{tlkl}|^{2}}\sqrt{\sum\limits_{l\in[n]}|y_{l}|^{2}}\Bigg)\\ &+\frac{1}{2}\sum\limits_{j, l\in[n], \atop j\neq l}|a_{tjsl}||x_{s}|+|x_{s}|\sqrt{\sum\limits_{l\in[n]}|a_{tlsl}|^{2}}\sqrt{\sum\limits_{l\in[n]}|y_{l}|^{2}}\\ = &\frac{1}{2}\sum\limits_{k\in[m], \atop k\neq s}\sum\limits_{j, l\in[n], \atop j\neq l}|a_{tjkl}||x_{t}|+|x_{t}|\sum\limits_{k\in[m], \atop k\neq s}\sqrt{\sum\limits_{l\in[n]} a_{tlkl}^{2}} +\frac{1}{2}\sum\limits_{j, l\in[n], \atop j\neq l}|a_{tjsl}||x_{s}|+|x_{s}|\sqrt{\sum\limits_{l\in[n]}a_{tlsl}^{2}}\\ = &\Bigg(\frac{1}{2}\sum\limits_{k\in[m], \atop k\neq s}\sum\limits_{j, l\in[n], \atop j\neq l}|a_{tjkl}|+\sum\limits_{k\in[m], \atop k\neq s}\sqrt{\sum\limits_{l\in[n]} a_{tlkl}^{2}}\Bigg)|x_{t}| +\Bigg(\frac{1}{2}\sum\limits_{j, l\in[n], \atop j\neq l}|a_{tjsl}|+\sqrt{\sum\limits_{l\in[n]}a_{tlsl}^{2}}\Bigg)|x_{s}|\\ = &\widetilde{r}_{t}^{s}(\mathcal{A})|x_{t}|+r_{t}^{s}(\mathcal{A})|x_{s}|, \end{align*}$

i.e.,

$\begin{eqnarray} (|\lambda|-\widetilde{r}_{t}^{s}(\mathcal{A}))|x_{t}|\leq r_{t}^{s}(\mathcal{A})|x_{s}|. \end{eqnarray}$

(2.1)

By (2.1), we have $(|\lambda|-\widetilde{r}_{t}^{s}(\mathcal{A}))|x_{t}|\leq r_{t}^{s}(\mathcal{A})|x_{t}|$ , which leads to that $|\lambda|\leq \widetilde{r}_{t}^{s}(\mathcal{A})+ r_{t}^{s}(\mathcal{A})$ , i.e., $\lambda\in\overline{\Upsilon}_{t, s}(\mathcal{A})$ .

If $|x_{s}| > 0$ , then by the $s$ -th equation of (1.2), we have

$\begin{align*} \lambda x_{s}& = \sum\limits_{k\in[m]}\sum\limits_{j, l\in[n]}a_{sjkl}y_{j}x_{k}y_{l}\\ & = \sum\limits_{k\in[m], \atop k\neq s}\sum\limits_{j, l\in[n], \atop j\neq l}a_{sjkl}y_{j}x_{k}y_{l}+\sum\limits_{k\in[m], \atop k\neq s}\sum\limits_{l\in[n]}a_{slkl}y_{l}x_{k}y_{l}+\sum\limits_{j, l\in[n], \atop j\neq l}a_{sjsl}y_{j}x_{s}y_{l}+\sum\limits_{l\in[n]}a_{slsl}y_{l}x_{s}y_{l}. \end{align*}$

Taking the modulus of the above equation and using the triangle inequality and Lemma 2.1 yield

$\begin{align*} |\lambda||x_{s}|\leq&\sum\limits_{k\in[m], \atop k\neq s}\sum\limits_{j, l\in[n], \atop j\neq l}|a_{sjkl}||y_{j}||x_{k}||y_{l}|+\sum\limits_{k\in[m], \atop k\neq s}\sum\limits_{l\in[n]}|a_{slkl}||y_{l}||x_{k}||y_{l}| +\sum\limits_{j, l\in[n], \atop j\neq l}|a_{sjsl}||y_{j}||x_{s}||y_{l}|+\\ &\sum\limits_{l\in[n]}|a_{slsl}||y_{l}||x_{s}||y_{l}|\\ \leq&\frac{1}{2}\sum\limits_{k\in[m], \atop k\neq s}\sum\limits_{j, l\in[n], \atop j\neq l}|a_{sjkl}||x_{t}|+\sum\limits_{k\in[m], \atop k\neq s}\sum\limits_{l\in[n]}|a_{slkl}|| y_{l}||x_{t}| +\frac{1}{2}\sum\limits_{j, l\in[n], \atop j\neq l}|a_{sjsl}||x_{s}|+\sum\limits_{l\in[n]}|a_{slsl}||y_{l}||x_{s}|\\ = &\frac{1}{2}\sum\limits_{k\in[m], \atop k\neq s}\sum\limits_{j, l\in[n], \atop j\neq l}|a_{sjkl}||x_{t}|+|x_{t}|\sum\limits_{k\in[m], \atop k\neq s}\Bigg(\sum\limits_{l\in[n]}| a_{slkl}||y_{l}|\Bigg)+\frac{1}{2}\sum\limits_{j, l\in[n], \atop j\neq l}|a_{sjsl}||x_{s}|+|x_{s}|\sum\limits_{l\in[n]}|a_{slsl}||y_{l}|\\ \leq&\frac{1}{2}\sum\limits_{k\in[m], \atop k\neq s}\sum\limits_{j, l\in[n], \atop j\neq l}|a_{sjkl}||x_{t}|+|x_{t}|\sum\limits_{k\in[m], \atop k\neq s}\Bigg(\sqrt{\sum\limits_{l\in[n]}|a_{slkl}|^{2}}\sqrt{\sum\limits_{l\in[n]}|y_{l}|^{2}}\Bigg)\\ &+\frac{1}{2}\sum\limits_{j, l\in[n], \atop j\neq l}|a_{sjsl}||x_{s}|+|x_{s}|\sqrt{\sum\limits_{l\in[n]}|a_{slsl}|^{2}}\sqrt{\sum\limits_{l\in[n]}|y_{l}|^{2}}\\ = &\frac{1}{2}\sum\limits_{k\in[m], \atop k\neq s}\sum\limits_{j, l\in[n], \atop j\neq l}|a_{sjkl}||x_{t}|+|x_{t}|\sum\limits_{k\in[m], \atop k\neq s}\sqrt{\sum\limits_{l\in[n]}a_{slkl}^{2}} +\frac{1}{2}\sum\limits_{j, l\in[n], \atop j\neq l}|a_{sjsl}||x_{s}|+|x_{s}|\sqrt{\sum\limits_{l\in[n]} a_{slsl}^{2}}\\ = &\Bigg(\frac{1}{2}\sum\limits_{k\in[m], \atop k\neq s}\sum\limits_{j, l\in[n], \atop j\neq l}|a_{sjkl}|+\sum\limits_{k\in[m], \atop k\neq s}\sqrt{\sum\limits_{l\in[n]}a_{slkl}^{2}}\Bigg)|x_{t}| +\Bigg(\frac{1}{2}\sum\limits_{j, l\in[n], \atop j\neq l}|a_{sjsl}|+\sqrt{\sum\limits_{l\in[n]}a_{slsl}^{2}}\Bigg)|x_{s}|\\ = &\widetilde{r}_{s}^{s}(\mathcal{A})|x_{t}|+r_{s}^{s}(\mathcal{A})|x_{s}|, \end{align*}$

i.e.,

$\begin{eqnarray} (|\lambda|-r_{s}^{s}(\mathcal{A}))|x_{s}|\leq\widetilde{r}_{s}^{s}(\mathcal{A})|x_{t}|. \end{eqnarray}$

(2.2)

When $|\lambda|\geq \widetilde{r}_{t}^{s}(\mathcal{A})$ or $|\lambda|\geq r_{s}^{s}(\mathcal{A})$ , multiplying (2.1) and (2.2) and eliminating $|x_{t}||x_{s}| > 0$ , we have

$\begin{eqnarray} (|\lambda|-\widetilde{r}_{t}^{s}(\mathcal{A}))(|\lambda|-r_{s}^{s}(\mathcal{A}))\leq r_{t}^{s}(\mathcal{A})\widetilde{r}_{s}^{s}(\mathcal{A}), \end{eqnarray}$

(2.3)

which implies that

$\begin{eqnarray} \lambda\in(\widetilde{\Upsilon}_{t, s}(\mathcal{A})\cap\overline{\Upsilon}_{t, s}(\mathcal{A})). \end{eqnarray}$

(2.4)

When $|\lambda| < \widetilde{r}_{t}^{s}(\mathcal{A})$ and $|\lambda| < r_{s}^{s}(\mathcal{A})$ , it holds that

$\begin{eqnarray} \lambda\in\widehat{\Upsilon}_{t, s}(\mathcal{A}). \end{eqnarray}$

(2.5)

It follows from (2.4) and (2.5) that

$\begin{eqnarray} \lambda\in\Big[\widehat{\Upsilon}_{t, s}(\mathcal{A})\cup(\widetilde{\Upsilon}_{t, s}(\mathcal{A})\cap\overline{\Upsilon}_{t, s}(\mathcal{A}))\Big] = \Upsilon_{t, s}(\mathcal{A}). \end{eqnarray}$

(2.6)

If $|x_s| = 0$ in (2.1), then $|\lambda|\leq \widetilde{r}_{t}^{s}(\mathcal{A})$ . When $|\lambda| = \widetilde{r}_{t}^{s}(\mathcal{A})$ , then (2.3) holds and consequently, (2.4) holds. When $|\lambda| < \widetilde{r}_{t}^{s}(\mathcal{A})$ , if $|\lambda|\geq r_{s}^{s}(\mathcal{A})$ , then (2.3) and (2.4) hold. If $|\lambda| < r_{s}^{s}(\mathcal{A})$ , then (2.5) holds. Hence, (2.6) holds. By the arbitrariness of $s\in[m]$ , and $s\neq t$ , we have

$\begin{eqnarray*} \lambda\in\bigcap\limits_{t\neq s}\Upsilon_{t, s}(\mathcal{A})\subseteq\bigcup\limits_{t\in[m]}\bigcap\limits_{t\neq s}\Upsilon_{t, s}(\mathcal{A}), \end{eqnarray*}$

therefore, the assertion is proved.

Next, we give the relationship between the localization set $\mathcal{H}(\mathcal{A})$ given in Theorem 1.1 and the set $\Upsilon(\mathcal{A})$ given in Theorem 2.1.

Theorem 2.2. Let $\mathcal{A} = (a_{ijkl})\in\mathbb{R}^{[m]\times [n]\times [m]\times [n]}$ be a partially symmetric tensor. Then

$\begin{eqnarray*} \Upsilon(\mathcal{A})\subseteq\mathcal{H}(\mathcal{A}). \end{eqnarray*}$

Proof. For any $i, s\in[m]$ and $i\neq s$ , it holds that

$\begin{eqnarray} \widetilde{r}_{i}^{s}(\mathcal{A}) = \frac{1}{2}\sum\limits_{k\in[m], \atop k\neq s}\sum\limits_{j, l\in[n], \atop j\neq l}|a_{ijkl}|+\sum\limits_{k\in[m], \atop k\neq s}\sqrt{\sum\limits_{l\in[n]} a_{ilkl}^{2}}\leq\sum\limits_{k\in[m], \atop k\neq s}\sum\limits_{j, l\in[n]}| a_{ijkl}| = R_{i}(\mathcal{A})-R_{i}^{s}(\mathcal{A}); \end{eqnarray}$

(2.7)

and

$\begin{eqnarray} r_{i}^{s}(\mathcal{A}) = \frac{1}{2}\sum\limits_{j, l\in[n], \atop j\neq l}|a_{ijsl}|+\sqrt{\sum\limits_{l\in[n]}a_{ilsl}^{2}}\leq\sum\limits_{j, l\in[n]}| a_{ijsl}| = R_{i}^{s}(\mathcal{A}). \end{eqnarray}$

(2.8)

Let $z\in\Upsilon(\mathcal{A})$ . By Theorem 2.1, there is an index $i\in [m]$ such that for any $s\in[m]$ , $i\neq s$ , $z\in\Upsilon_{i, s}(\mathcal{A})$ , which means that $z\in\widehat{\Upsilon}_{i, s}(\mathcal{A})$ , or $z\in\widetilde{\Upsilon}_{i, s}(\mathcal{A})$ and $z\in\overline{\Upsilon}_{i, s}(\mathcal{A})$ .

Let $z\in\widehat{\Upsilon}_{i, s}(\mathcal{A})$ , i.e., $|z| < \widetilde{r}_{i}^{s}(\mathcal{A})$ and $|z| < r_{s}^{s}(\mathcal{A})$ . By (2.7) and (2.8), we have $|z|\leq R_{i}(\mathcal{A})-R_{i}^{s}(\mathcal{A})$ and $|z|\leq R_{s}^{s}(\mathcal{A})$ , therefore, $z\in\widehat{\mathcal{H}}_{i, s}(\mathcal{A})$ .

Let $z\in\widetilde{\Upsilon}_{i, s}(\mathcal{A})$ and $z\in\overline{\Upsilon}_{i, s}(\mathcal{A})$ , i.e.,

$\begin{eqnarray} (|z|-\widetilde{r}_{i}^{s}(\mathcal{A}))(|z|-r_{s}^{s}(\mathcal{A}))\leq r_{i}^{s}(\mathcal{A})\widetilde{r}_{s}^{s}(\mathcal{A}), \end{eqnarray}$

(2.9)

and

$\begin{eqnarray} |z| < \widetilde{r}_{i}^{s}(\mathcal{A})+r_{i}^{s}(\mathcal{A}). \end{eqnarray}$

(2.10)

By (2.7), (2.8) and (2.10), one has $|z| < \widetilde{r}_{i}^{s}(\mathcal{A})+r_{i}^{s}(\mathcal{A})\leq R_{i}(\mathcal{A})$ , which means that $z\in\Gamma_{i}(\mathcal{A})$ . When $|z|\geq R_{i}(\mathcal{A})-R_{i}^{s}(\mathcal{A})$ and $|z|\geq R_{s}^{s}(\mathcal{A})$ , by (2.7), (2.8) and (2.9), we have

$\begin{eqnarray*} |z|-\widetilde{r}_{i}^{s}(\mathcal{A})\geq|z|-(R_{i}(\mathcal{A})-R_{i}^{s}(\mathcal{A}))\geq0, \; |z|-r_{s}^{s}(\mathcal{A})\geq|z|-R_{s}^{s}(\mathcal{A})\geq0, \end{eqnarray*}$

then

$\begin{eqnarray*} (|z|-(R_{i}(\mathcal{A})-R_{i}^{s}(\mathcal{A})))(|z|-R_{s}^{s}(\mathcal{A}))&\leq& (|z|-\widetilde{r}_{i}^{s}(\mathcal{A}))(|z|-r_{s}^{s}(\mathcal{A}))\\ &\leq&r_{i}^{s}(\mathcal{A})\widetilde{r}_{s}^{s}(\mathcal{A})\leq R_{i}^{s}(\mathcal{A})(R_{s}(\mathcal{A})-R_{s}^{s}(\mathcal{A})), \end{eqnarray*}$

i.e.,

$\begin{eqnarray} (|z|-(R_{i}(\mathcal{A})-R_{i}^{s}(\mathcal{A})))(|z|-R_{s}^{s}(\mathcal{A}))\leq R_{i}^{s}(\mathcal{A})(R_{s}(\mathcal{A})-R_{s}^{s}(\mathcal{A})), \end{eqnarray}$

(2.11)

which means that $z\in\overline{\mathcal{H}}_{i, s}(\mathcal{A})$ . Thus, whether $R_{i}(\mathcal{A})-R_{i}^{s}(\mathcal{A})\leq |z|\leq R_{s}^{s}(\mathcal{A})$ or $R_{s}^{s}(\mathcal{A})\leq |z|\leq R_{i}(\mathcal{A})-R_{i}^{s}(\mathcal{A})$ , (2.11) also holds. When $|z|\leq R_{i}(\mathcal{A})-R_{i}^{s}(\mathcal{A})$ and $|z|\leq R_{s}^{s}(\mathcal{A})$ , it follows that $z\in\widehat{\mathcal{H}}_{i, s}(\mathcal{A})$ . i.e.,

$\begin{eqnarray*} z\in\Big[\widehat{\mathcal{H}}_{i, s}(\mathcal{A})\cup(\overline{\mathcal{H}}_{i, s}(\mathcal{A})\cap\Gamma_{i}(\mathcal{A}))\Big] = \mathcal{H}_{i, s}(\mathcal{A}). \end{eqnarray*}$

From the arbitrariness of $s\in[m]$ , and $s\neq i$ , we have

$\begin{eqnarray*} z\in\bigcap\limits_{s\in[m], s\neq i}\mathcal{H}_{i, s}(\mathcal{A})\subseteq\bigcup\limits_{i\in[m]}\bigcap\limits_{s\in[m], s\neq i}\mathcal{H}_{i, s}(\mathcal{A}), \end{eqnarray*}$

i.e., $z\in\mathcal{H}(\mathcal{A})$ . Therefore, $\Upsilon(\mathcal{A})\subseteq\mathcal{H}(\mathcal{A})$ .

In order to show the validity of the set $\Upsilon(\mathcal{A})$ given in Theorem 2.1, we present a running example.

Example 1. Let $\mathcal{A} = (a_{ijkl})\in\mathbb{R}^{[2]\times [2]\times [2]\times [2]}$ be a partially symmetric tensor with entries

$\begin{align*} a_{1111}& = 1, \; a_{1112} = 2, \; a_{1121} = 2, \; a_{1212} = 3, \\ a_{1222}& = 5, \; a_{1211} = 2, \; a_{1122} = 4, \; a_{1221} = 4, \\ a_{2111}& = 2, \; a_{2112} = 4, \; a_{2121} = 3, \; a_{2122} = 5, \\ a_{2211}& = 4, \; a_{2212} = 5, \; a_{2221} = 5, \; a_{2222} = 6. \end{align*}$

By Theorem 1.1, we have

$\begin{eqnarray*} \mathcal{H}(\mathcal{A}) = \bigcup\limits_{i\in[m]}\bigcap\limits_{k\in[m], k\neq i}\mathcal{H}_{i, k}(\mathcal{A}) = \{z\in\mathbb{C}: |z|\leq 29.4765\}. \end{eqnarray*}$

By Theorem 2.1, we have

$\begin{eqnarray*} \Upsilon(\mathcal{A}) = \bigcup\limits_{i\in[m]}\bigcap\limits_{s\in[m], s\neq i}\Upsilon_{i, s}(\mathcal{A}) = \{z\in\mathbb{C}: |z|\leq 20.0035\}. \end{eqnarray*}$

It is easy to see that $\Upsilon(\mathcal{A})\subseteq\mathcal{H}(\mathcal{A})$ and all M-eigenvalues are in $[-20.0035, 20.0035]$ . In fact, all different M-eigenvalues of $\mathcal{A}$ are $-1.2765$ , 0.0710, 0.1242, 0.2765, 0.3437 and 15.2091.

3. A sharp upper bound for the M-spectral radius of a partially symmetric tensor

In this section, based on the set in Theorem 2.1, we provide an upper bound for the largest M-eigenvalue of a fourth-order partially symmetric tensor $\mathcal{A}$ . As an application, we apply the upper bound as a parameter $\tau$ to the WQZ-algorithm to make the sequence generated by the WQZ-algorithm converges to the largest M-eigenvalue of $\mathcal{A}$ faster.

Theorem 3.1. Let $\mathcal{A} = (a_{ijkl})\in\mathbb{R}^{[m]\times [n]\times [m]\times [n]}$ be a partially symmetric tensor. Then

$\begin{eqnarray*} \rho(\mathcal{A})\leq\Omega(\mathcal{A}) = \max\limits_{i\in[m]}\min\limits_{s\in[m], i\neq s}\Omega_{i, s}(\mathcal{A}), \end{eqnarray*}$

where

$\begin{eqnarray*} \Omega_{i, s}(\mathcal{A}) = \max\Big\{\min\{\widetilde{r}_{i}^{s}(\mathcal{A}), r_{s}^{s}(\mathcal{A})\}, \min\{\widetilde{r}_{i}^{s}(\mathcal{A})+r_{i}^{s}(\mathcal{A}), \widehat{\Omega}_{i, s}(\mathcal{A})\}\Big\}, \end{eqnarray*}$

and

$\begin{eqnarray*} \widehat{\Omega}_{i, s}(\mathcal{A}) = \frac{1}{2}\Bigg\{\widetilde{r}_{i}^{s}(\mathcal{A})+r_{s}^{s}(\mathcal{A})+\sqrt{(r_{s}^{s}(\mathcal{A})-\widetilde{r}_{i}^{s}(\mathcal{A}))^2+ 4r_{i}^{s}(\mathcal{A})\widetilde{r}_{s}^{s}(\mathcal{A})}\Bigg\}. \end{eqnarray*}$

Proof. By Theorem 2.1 and $\rho(\mathcal{A})\in \sigma(\mathcal{A})$ , it follows that there exists an index $i\in[m]$ such that for any $s\in[m]$ and $s\neq i$ , $\rho(\mathcal{A})\in\widehat{\Upsilon}_{i, s}(\mathcal{A})$ , or $\rho(\mathcal{A})\in(\widetilde{\Upsilon}_{i, s}(\mathcal{A})\cap\overline{\Upsilon}_{i, s}(\mathcal{A}))$ . If $\rho(\mathcal{A})\in\widehat{\Upsilon}_{i, s}(\mathcal{A})$ , that is, $\rho(\mathcal{A}) < \widetilde{r}_{i}^{s}(\mathcal{A})$ and $\rho(\mathcal{A}) < r_{s}^{s}(\mathcal{A})$ , then

$\begin{eqnarray} \rho(\mathcal{A}) < \min\{\widetilde{r}_{i}^{s}(\mathcal{A}), r_{s}^{s}(\mathcal{A})\}. \end{eqnarray}$

(3.1)

If $\rho(\mathcal{A})\in(\widetilde{\Upsilon}_{i, s}(\mathcal{A})\cap\overline{\Upsilon}_{i, s}(\mathcal{A}))$ , that is,

$\begin{eqnarray} \rho(\mathcal{A}) < \widetilde{r}_{i}^{s}(\mathcal{A})+r_{i}^{s}(\mathcal{A}) < \min\{\widetilde{r}_{i}^{s}(\mathcal{A})+r_{i}^{s}(\mathcal{A})\}, \end{eqnarray}$

(3.2)

and

$\begin{eqnarray} (\rho(\mathcal{A})-\widetilde{r}_{i}^{s}(\mathcal{A}))(\rho(\mathcal{A})- r_{s}^{s}(\mathcal{A}))\leq r_{i}^{s}(\mathcal{A})\widetilde{r}_{s}^{s}(\mathcal{A}). \end{eqnarray}$

(3.3)

Solving Inequality (3.3), we have

$\begin{eqnarray} \rho(\mathcal{A})\leq\widehat{\Omega}_{i, s}(\mathcal{A})\leq\min\{\widehat{\Omega}_{i, s}(\mathcal{A})\}. \end{eqnarray}$

(3.4)

Combining (3.2) and (3.4), we have

$\begin{eqnarray} \rho(\mathcal{A})\leq\min\{\widetilde{r}_{i}^{s}(\mathcal{A})+r_{i}^{s}(\mathcal{A}), \widehat{\Omega}_{i, s}(\mathcal{A})\}. \end{eqnarray}$

(3.5)

Hence, by (3.1) and (3.5), we have

$\begin{eqnarray*} \rho(\mathcal{A})\leq\max\Big\{\min\{\widetilde{r}_{i}^{s}(\mathcal{A}), r_{s}^{s}(\mathcal{A})\}, \min\{\widetilde{r}_{i}^{s}(\mathcal{A})+r_{i}^{s}(\mathcal{A}), \widehat{\Omega}_{i, s}(\mathcal{A})\}\Big\} = \Omega_{i, s}(\mathcal{A}). \end{eqnarray*}$

Furthermore, by the arbitrariness of $s$ , we have

$\begin{eqnarray*} \rho(\mathcal{A})\leq\min\limits_{s\in[m], i\neq s}\Omega_{i, s}(\mathcal{A}). \end{eqnarray*}$

Since we do not know which $i$ is appropriate to $\rho(\mathcal{A})$ , we can only conclude that

$\begin{eqnarray*} \rho(\mathcal{A})\leq\max\limits_{i\in[m]}\min\limits_{s\in[m], i\neq s}\Omega_{i, s}(\mathcal{A}). \end{eqnarray*}$

This proof is complete.

Remark 3.1. In Theorem 3.1, we obtain an upper bound $\Omega(\mathcal{A})$ for the largest M-eigenvalue of a fourth order partially symmetric tensor $\mathcal{A}$ . Now, we take $\Omega(\mathcal{A})$ as the parameter $\tau$ in WQZ-algorithm to obtain a modified WQZ-algorithm. That is, the only difference between WQZ-algorithm and the modified WQZ-algorithm is the selection of $\tau$ , in particular, $\tau = \sum\limits_{1\leq s\leq t\leq mn}|A_{st}|$ in WQZ-algorithm and $\tau = \Omega(\mathcal{A})$ in the modified WQZ-algorithm.

Next, we take $\Omega(\mathcal{A})$ and some existing upper bounds of the largest M-eigenvalue as $\tau$ in WQZ-algorithm to calculate the largest M-eigenvalue of a fourth-order partially symmetric tensor $\mathcal{A}$ .

Example 2. Consider the tensor $\mathcal{A}$ in Example 4.1 of ^[24], where

$\mathcal{A}(:, :, 1, 1) = \left[\begin{array}{ccc} -0.9727&0.3169&-0.3437\\-0.6332&-0.7866&0.4257\\-0.3350&-0.9896&-0.4323\\ \end{array}\right],$

$\mathcal{A}(:, :, 2, 1) = \left[\begin{array}{ccc} -0.6332&-0.7866&0.4257\\0.7387&0.6873&-0.3248\\-0.7986&-0.5988&-0.9485\\ \end{array}\right],$

$\mathcal{A}(:, :, 3, 1) = \left[\begin{array}{ccc} -0.3350&-0.9896&-0.4323\\-0.7986&-0.5988&-0.9485\\0.5853&0.5921&0.6301\\ \end{array}\right],$

$\mathcal{A}(:, :, 1, 2) = \left[\begin{array}{ccc} 0.3169&0.6158&-0.0184\\-0.7866&0.0160&0.0085\\-0.9896&-0.6663&0.2559\\ \end{array}\right],$

$\mathcal{A}(:, :, 2, 2) = \left[\begin{array}{ccc} -0.7866&0.0160&0.0085\\0.6873&0.5160&-0.0216\\-0.5988&0.0411&0.9857\\ \end{array}\right],$

$\mathcal{A}(:, :, 3, 2) = \left[\begin{array}{ccc} -0.9896&-0.6663&0.2559\\-0.5988&0.0411&0.9857\\0.5921&-0.2907&-0.3881\\ \end{array}\right],$

$\mathcal{A}(:, :, 1, 3) = \left[\begin{array}{ccc} -0.3437&-0.0184&0.5649\\0.4257&0.0085&-0.1439\\-0.4323&0.2559&0.6162\\ \end{array}\right],$

$\mathcal{A}(:, :, 2, 3) = \left[\begin{array}{ccc} 0.4257&0.0085&-0.1439\\-0.3248&-0.0216&-0.0037\\-0.9485&0.9857&-0.7734\\ \end{array}\right],$

$\mathcal{A}(:, :, 3, 3) = \left[\begin{array}{ccc} -0.4323&0.2559&0.6162\\-0.9485&0.9857&-0.7734\\0.6301&-0.3881&-0.8526\\ \end{array}\right].$

By (1.3), we have $\tau = \sum\limits_{1\leq s\leq t\leq 9}|A_{st}| = 23.3503.$ By Corollary 1 of ^[17], we have

$\begin{eqnarray*} \rho(\mathcal{A})\leq 16.6014. \end{eqnarray*}$

By Theorem 3.5 of ^[23], we have

$\begin{eqnarray*} \rho(\mathcal{A})\leq 15.4102. \end{eqnarray*}$

By Corollary 2 of ^[17], we have

$\begin{eqnarray*} \rho(\mathcal{A})\leq 14.5910. \end{eqnarray*}$

By Corollary 1 of ^[15], where $S_{m} = S_{n} = 1$ , we have

$\begin{eqnarray*} \rho(\mathcal{A})\leq 13.8844. \end{eqnarray*}$

By Corollary 2 of ^[15], where $S_{m} = S_{n} = 1$ , we have

$\begin{eqnarray*} \rho(\mathcal{A})\leq 11.7253. \end{eqnarray*}$

By Theorem 3.1, we have

$\begin{eqnarray*} \rho(\mathcal{A})\leq 8.2342. \end{eqnarray*}$

From ^[24], it can be seen that $\lambda_{\max}(\mathcal{A}) = 2.3227$ .

Taking $\tau = 23.3503$ , 16.6014, 15.4102, 14.5910, 13.8844, 11.7253 and 8.2342 respectively, numerical results obtained by the WQZ-algorithm are shown in Figure 1.

Figure 1. Numerical results for the WQZ-algorithm with different

$\tau$ .

DownLoad: Full-Size Img PowerPoint

Numerical results in Figure 1 shows that :

1) When we take $\tau = 8.2342$ , the sequence more rapidly converges to the largest M-eigenvalue $\lambda_{\max}(\mathcal{A})$ than taking $\tau = 23.3503$ , $\tau = 16.6014$ , $\tau = 15.4102$ , $\tau = 14.5910$ , $\tau = 13.8844$ and $\tau = 11.7253$ , respectively.

2) When we take $\tau = 23.3503$ , 16.6014, 15.4102, 14.5910, 13.8844, 11.7253 and 8.2342, the WQZ-algorithm can get the largest M-eigenvalue $\lambda_{\max}(\mathcal{A})$ after finite iterations. However, under the same stopping criterion, if we take $\tau = \; 23.3503$ , 16.6014, 15.4102, 14.5910, 13.8844 and 11.7253, it can be seen that the WQZ-algorithm needs more iterations to obtain the largest M-eigenvalue, and when $\tau = 8.2342$ , WQZ-algorithm can obtain the largest M-eigenvalue $\lambda_{\max}(\mathcal{A})$ faster.

3) The choice of the parameter $\tau$ in WQZ-algorithm has a significant impact on the convergence speed of the WQZ-algorithm. When $\tau$ is larger, the convergence speed of WQZ-algorithm is slower. When $\tau$ is smaller and $\tau$ is greater than the largest M-eigenvalue, the WQZ-algorithm converges faster. In other words, the faster the largest M-eigenvalue can be obtained.

4) The numerical result of the upper bound of the M-spectral radius obtained by Theorem 3.1 is of great help to the WQZ-algorithm. Therefore, it shows that the results we get have a certain effect.

Now, we consider a real elasticity tensor, which is derived from the study of self-anisotropic materials ^[10] for explanation.

In anisotropy materials, the components of the tensor of elastic moduli $\mathcal{C} = (c_{ijkl})\in\mathbb{R}^{[3]\times [3]\times [3]\times [3]}$ satisfy the following symmetry:

$\begin{eqnarray*} c_{ijkl} = c_{jikl} = c_{ijlk} = c_{jilk}, \; \; c_{ijkl} = c_{klij}, \; \; \forall\; 1\leq i, j, k, l\leq 3, \end{eqnarray*}$

which is also called an elasticity tensor. After a lot of research, we know that there are many anisotropic materials, of which crystal is one of its typical examples. We classify from the crystal homologues ^[22], the elasticity tensor $\mathcal{C} = (c_{ijkl})\in\mathbb{R}^{[3]\times [3]\times [3]\times [3]}$ of some crystals for trigonal system, such as $CaCO_3$ and $HgS$ also satisfy

$\begin{align*} c_{1112}& = c_{2212} = c_{3323} = c_{3331} = c_{3312} = c_{2331} = 0, \\ c_{2222}& = c_{1111}, \; c_{3131} = c_{2323}, \; c_{2233} = c_{1133}, \; c_{2223} = -c_{1123}, \\ c_{2231}& = -c_{1131}, \; c_{3112} = \sqrt{2}c_{1123}, \; c_{2312} = -\sqrt{2}c_{1131}, \; c_{1212} = c_{1111}-c_{1122}. \end{align*}$

This shows that the triangular system of anisotropic materials has only 7 elasticities. In fact, $CaMg(CO_3)_2$ -dolomite and $CaCO_3$ -calcite have similar crystal structures, in which the atoms along any triplet are alternated with magnesium and calcium. In ^[22], we can know that the elasticity tensor of $CaMg(CO_3)_2$ -dolomite is as follows.

$\begin{align*} c_{2222}& = c_{1111} = 196.6, \; c_{3131} = c_{2323} = 83.2, \; c_{2233} = c_{1133} = 54.7, \; c_{2223} = -c_{1123} = 31.7, \\ c_{2231}& = -c_{1131} = -25.3, \; c_{3112} = 44.8, \; c_{2312} = -35.84, \; c_{1212} = 132.2, \; c_{3333} = 110, \\ c_{1122}& = 64.4. \end{align*}$

Next, we transform the elastic tensor $\mathcal{C}$ into a partially symmetric tensor $\mathcal{A}$ through the following double mapping, and the M-eigenvalue of $\mathcal{A}$ after transformation is the same as the M-eigenvalue of $\mathcal{C}$ ^[7,12]:

$\begin{eqnarray*} a_{ijkl} = a_{ikjl}, \; \; 1\leq i, j, k, l\leq 3. \end{eqnarray*}$

In order to illustrate the validity of the results we obtained, we take the above-mentioned partial symmetry tensor of the $CaMg(CO_3)_2$ -dolomite elasticity tensor transformation as an example.

Example 3. Consider the tensor $\mathcal{A}_2 = (a_{ijkl})\in\mathbb{R}^{[3]\times [3]\times [3]\times [3]}$ in Example 3 of ^[17], where

$\begin{align*} a_{2222}& = a_{1111} = 196.6, \; a_{3311} = a_{2233} = 83.2, \; a_{2323} = a_{3232} = a_{1313} = a_{3131} = 54.7, \\ a_{2223}& = a_{2232} = -a_{1213} = -a_{2131} = -31.7, \; a_{3333} = 110, \; a_{1212} = a_{2121} = 64.4, \\ a_{1122}& = 132.2, \; a_{2321} = a_{1232} = -a_{1311} = -a_{1131} = -25.3, \; a_{3112} = a_{1321} = 44.8, \\ a_{2132}& = a_{1223} = -35.84, \end{align*}$

and other $a_{ijkl} = 0$ .

The data results of Example 2 show that the upper bound of the largest M-eigenvalue in Theorem 3.1 is sharper than the existing results.Here, we only calculate the upper bound of the largest M-eigenvalue of $\mathcal{A}_2$ by Theorem 3.1, and use it as the parameter $\tau$ in the WQZ-algorithm to calculate the largest M-eigenvalue of $\mathcal{A}_2$ . Here, in order to distinguish different values of $\tau$ , we calculate the result by Theorem 3.1 and record it as $\tau_2$ , that is, WQZ-algorithm $\tau = \tau_2$ .

By Theorem 3.1, we can get $\tau_2 = 647.6100$ .

By Eq (1.3), we can get

$\begin{eqnarray*} \tau = \sum\limits_{1\leq s\leq t\leq 9}|A_{st}| = 1998.6000. \end{eqnarray*}$

In the WQZ-algorithm, when we take $\tau = 1998.6000$ and 647.6100 respectively, the numerical results we get are shown in Figure 2.

Figure 2. Numerical results for the WQZ-algorithm with different

$\tau$ .

DownLoad: Full-Size Img PowerPoint

As we can see in , in the WQZ-algorithm, when we regard $\tau_2$ as $\tau$ , it makes the convergence sequence in the WQZ-algorithm converges faster than $\tau = \sum\limits_{1\leq s\leq t\leq 9}|A_{st}|$ , so that the largest M-eigenvalue can be calculated faster.That is to say, in this article, the result we provide as the parameter $\tau$ in the WQZ-algorithm can speed up the convergence speed, so that the largest M-eigenvalue can be calculated quickly.

4. Conclusions

In this paper, we first in Theorem 2.1 provided an M-eigenvalue localization set $\Upsilon(\mathcal{A})$ for a fourth-order partially symmetric tensor $\mathcal{A}$ , and then proven that the set $\Upsilon(\mathcal{A})$ is tighter than the set $\mathcal{H}(\mathcal{A})$ in Theorem 2.2 of ^[23]. Secondly, based on the set $\Upsilon(\mathcal{A})$ , we derived an upper bound for the M-spectral radius of $\mathcal{A}$ . As an application, we took the upper bound of the M-spectral radius as a parameter $\tau$ in the WQZ-algorithm to make the sequence generated by this algorithm converge to the largest M-eigenvalue of $\mathcal{A}$ faster. Finally, two numerical examples are given to show the effectiveness of the set $\Upsilon(\mathcal{A})$ and the upper bound $\Omega(\mathcal{A})$ .

Acknowledgments

The author sincerely thanks the editors and anonymous reviewers for their insightful comments and constructive suggestions, which greatly improved the quality of the paper. The author also thanks Professor Jianxing Zhao (Guizhou Minzu University) for guidance. This work is supported by Science and Technology Plan Project of Guizhou Province (Grant No. QKHJC-ZK[2021]YB013).

Conflict of interest

The author declares no conflict of interest.

References

[1]	G. D. A. Moura, S. D. T. M. Bezerra, H. P. Gomes, S. A. D. Silva, Neural network using the Levenberg–Marquardt algorithm for optimal real-time operation of water distribution systems, Urban Water J., 15 (2018), 692–699. https://doi.org/10.1080/1573062X.2018.1539503 doi: 10.1080/1573062X.2018.1539503
[2]	Y. J. Sun, P. P. Wang, T. T. Zhang, K. Li, F. Peng, C. G. Zhu, Principle and performance analysis of the Levenberg–Marquardt algorithm in WMS spectral line fitting, Photonics, 9 (2022), 999. https://doi.org/10.3390/photonics9120999 doi: 10.3390/photonics9120999
[3]	A. Alloqmani, O. Alsaedi, N. Bahatheg, R. Alnanih, L. Elrefaei, Design principles-based interactive learning tool for solving nonlinear equations, Comput. Syst. Sci. Eng., 40 (2022), 1023–1042. https://doi.org/10.32604/csse.2022.019704 doi: 10.32604/csse.2022.019704
[4]	Z. W. Liao, F. Y. Zhu, W. Y. Gong, S. J. Li, X. Y. Mi, AGSDE: Archive guided speciation-based differential evolution for nonlinear equations, Appl. Soft Comput., 122 (2022), 108818. https://doi.org/10.1016/j.asoc.2022.108818 doi: 10.1016/j.asoc.2022.108818
[5]	Z. Seifi, A. Ghorbani, A. Abdipour, Time-domain analysis and experimental investigation of electromagnetic wave coupling to RF/microwave nonlinear circuits, J. Electromagnet Wave., 35 (2021), 51–70. https://doi.org/10.1080/09205071.2020.1825994 doi: 10.1080/09205071.2020.1825994
[6]	A. Rothwell, Numerical methods for unconstrained optimization, In: Optimization methods in structural design, Cham: Springer, 2017, 83–106. https://doi.org/10.1007/978-3-319-55197-5
[7]	G. L. Yuan, M. J. Zhang, A three-terms Polak-Ribière-Polyak conjugate gradient algorithm for large-scale nonlinear equations, J. Comput. Appl. Math., 286 (2015), 186–195. https://doi.org/10.1016/j.cam.2015.03.014 doi: 10.1016/j.cam.2015.03.014
[8]	G. L. Yuan, Z. X. Wei, X. W. Lu, A BFGS trust-region method for nonlinear equations, Computing, 92 (2011), 317–333. https://doi.org/10.1007/s00607-011-0146-z doi: 10.1007/s00607-011-0146-z
[9]	J. H. Zhang, Y. Q. Wang, J. Zhao, On maximum residual nonlinear Kaczmarz-type algorithms for large nonlinear systems of equations, J. Comput. Appl. Math., 425 (2023), 115065. https://doi.org/10.1016/j.cam.2023.115065 doi: 10.1016/j.cam.2023.115065
[10]	J. N. Wang, X. Wang, L. W. Zhang, Stochastic regularized Newton methods for nonlinear equations, J. Sci. Comput., 94 (2023), 51. https://doi.org/10.1007/s10915-023-02099-4 doi: 10.1007/s10915-023-02099-4
[11]	R. Behling, D. S. Gonçalves, S. A. Santos, Local convergence analysis of the Levenberg–Marquardt framework for Nonzero–Residue nonlinear least-squares problems under an error bound condition, J. Optim. Theory Appl., 183 (2019), 1099–1122. https://doi.org/10.1007/s10957-019-01586-9 doi: 10.1007/s10957-019-01586-9
[12]	E. H. Bergou, Y. Diouane, V. Kungurtsev, Convergence and complexity analysis of a Levenberg–Marquardt algorithm for inverse problems, J. Optim. Theory Appl., 185 (2020), 927–944. https://doi.org/10.1007/s10957-020-01666-1 doi: 10.1007/s10957-020-01666-1
[13]	K. Levenberg, A method for the solution of certain non-linear problems in least squares, Quart. Appl. Math., 2 (1944), 164–168. https://doi.org/10.1090/qam/10666 doi: 10.1090/qam/10666
[14]	D. W. Marquardt, An algorithm for least-squares estimation of nonlinear parameters, J. Soc. Ind. Appl. Math., 11 (1963), 431–441. https://doi.org/10.1137/0111030 doi: 10.1137/0111030
[15]	N. Yamashita, M. Fukushima, On the rate of convergence of the Levenberg–Marquardt method, In: Topics in numerical analysis, computing supplementa, Vienna: Springer, 2001,239–249. https://doi.org/10.1007/978-3-7091-6217-0_18
[16]	J. Y. Fan, Y. X. Yuan, On the convergence of a new Levenberg–Marquardt method, Report, Institute of Computational Mathematics and Scientific/Engineering Computing, Beijing: Chinese Academy of Science, 2001.
[17]	J. Y. Fan, A Modified Levenberg–Marquardt algorithm for singular system of nonlinear equations, J. Comput. Math., 21 (2003), 625–636.
[18]	K. Amini, F. Rostami, G. Caristi, An efficient Levenberg–Marquardt method with a new LM parameter for systems of nonlinear equations, Optimization, 67 (2018), 637–650. https://doi.org/10.1080/02331934.2018.1435655 doi: 10.1080/02331934.2018.1435655
[19]	C. F. Ma, L. H. Jiang, Some research on Levenberg–Marquardt method for the nonlinear equations, Appl. Math. Comput., 184 (2007), 1032–1040. https://doi.org/10.1016/j.amc.2006.07.004 doi: 10.1016/j.amc.2006.07.004
[20]	J. Y. Fan, J. Y. Pan, A note on the Levenberg–Marquardt parameter, Appl. Math. Comput., 207 (2009), 351–359. https://doi.org/10.1016/j.amc.2008.10.056 doi: 10.1016/j.amc.2008.10.056
[21]	J. Y. Fan, The modified Levenberg–Marquardt method for nonlinear equations with cubic convergence, Math. Comp., 81 (2012), 447–466. https://doi.org/10.1090/S0025-5718-2011-02496-8 doi: 10.1090/S0025-5718-2011-02496-8
[22]	J. Y. Fan, J. L. Zeng, A Levenberg–Marquardt algorithm with correction for singular system of nonlinear equations, Appl. Math. Comput., 219 (2013), 9438–9446. https://doi.org/10.1016/j.amc.2013.03.026 doi: 10.1016/j.amc.2013.03.026
[23]	J. Y. Fan, Accelerating the modified Levenberg–Marquardt method for nonlinear equations, Math. Comp., 83 (2014), 1173–1187. https://doi.org/10.1090/S0025-5718-2013-02752-4 doi: 10.1090/S0025-5718-2013-02752-4
[24]	X. D. Zhu, G. H. Lin, Improved convergence results for a modified Levenberg–Marquardt method for nonlinear equations and applications in MPCC, Optim. Method. Softw., 31 (2016), 791–804. https://doi.org/10.1080/10556788.2016.1171863 doi: 10.1080/10556788.2016.1171863
[25]	H. Y. Wang, J. Y. Fan, Convergence rate of the Levenberg–Marquardt method under Hölderian local error bound, Optim. Method. Softw., 35 (2020), 767–786. https://doi.org/10.1080/10556788.2019.1694927 doi: 10.1080/10556788.2019.1694927
[26]	M. L. Zeng, G. H. Zhou, Improved convergence results of an efficient Levenberg–Marquardt method for nonlinear equations, J. Appl. Math. Comput., 68 (2022), 3655–3671. https://doi.org/10.1007/s12190-021-01599-6 doi: 10.1007/s12190-021-01599-6
[27]	L. Chen, Y. F. Ma, A modified Levenberg–Marquardt method for solving system of nonlinear equations, J. Appl. Math. Comput., 69 (2023), 2019–2040. https://doi.org/10.1007/s12190-022-01823-x doi: 10.1007/s12190-022-01823-x
[28]	N. Metropolis, A. W. Rosenbluth, M. N. Rosenbluth, A. H. Teller, E. Teller, Equation of state calculations by fast computing machines, J. Chem. Phys., 21 (1953), 1087–1092. https://doi.org/10.1063/1.1699114 doi: 10.1063/1.1699114
[29]	R. Behling, A. Iusem, The effect of calmness on the solution set of systems of nonlinear equations, Math. Program., 137 (2013), 155–165. https://doi.org/10.1007/s10107-011-0486-7 doi: 10.1007/s10107-011-0486-7
[30]	G. W. Stewart, J. G. Sun, Matrix perturbation theory, New York: Academic Press, 1990.
[31]	R. B. Schnabel, P. D. Frank, Tensor methods for nonlinear equations, SIAM J. Numer. Anal., 21 (1984), 815–843. https://doi.org/10.1137/0721054 doi: 10.1137/0721054
[32]	J. J. Moré, B. S. Garbow, K. E. Hillstrom, Testing unconstrained optimization software, ACM T. Math. Software, 7 (1981), 17–41. https://doi.org/10.1145/355934.355936 doi: 10.1145/355934.355936
[33]	N. I. M. Gould, D. Orban, P. L. Toint. CUTEr and SifDec: A constrained and unconstrained testing environment, revisited, ACM T. Math. Software, 29 (2003), 373–394. https://doi.org/10.1145/962437.962439 doi: 10.1145/962437.962439
[34]	E. D. Dolan, J. J. Moré, Benchmarking optimization software with performance profiles, Math. Program., 91 (2002), 201–213. https://doi.org/10.1007/s101070100263 doi: 10.1007/s101070100263

This article has been cited by:

Muhammad Anwar Chaudhry, Asfand Fahad, Muhammad Imran Qureshi, Urwa Riasat, Musavarah Sarwar, Some Results about Weak UP-algebras, 2022, 2022, 2314-4785, 1, 10.1155/2022/1206804

Reader Comments

Your name:*

Email:*
© 2024 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

AIMS Mathematics

1.8 3.4

Metrics

Article views(970) PDF downloads(53) Cited by(0)

Preview PDF

Download XML

Export Citation

Article outline

Show full outline

AIMS Mathematics

An accelerated adaptive two-step Levenberg–Marquardt method with the modified Metropolis criterion

Related Papers:

Abstract

1. Introduction

2. A shaper M-eigenvalue localization set of a fourth-order partially symmetric tensor

3. A sharp upper bound for the M-spectral radius of a partially symmetric tensor

4. Conclusions

Acknowledgments

Conflict of interest

References

This article has been cited by:

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Figures and Tables

Other Articles By Authors

Catalog

Abstract

1. Introduction

2. A shaper M-eigenvalue localization set of a fourth-order partially symmetric tensor

3. A sharp upper bound for the M-spectral radius of a partially symmetric tensor

4. Conclusions

Acknowledgments

Conflict of interest

References

AIMS Mathematics

An accelerated adaptive two-step Levenberg–Marquardt method with the modified Metropolis criterion

Related Papers:

Abstract

1. Introduction

2. A shaper M-eigenvalue localization set of a fourth-order partially symmetric tensor

3. A sharp upper bound for the M-spectral radius of a partially symmetric tensor

4. Conclusions

Acknowledgments

Conflict of interest

References

This article has been cited by:

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Figures and Tables

Other Articles By Authors

Related pages

Tools

Export File

Citation

Format

Content

Catalog

Abstract

1. Introduction

2. A shaper M-eigenvalue localization set of a fourth-order partially symmetric tensor

3. A sharp upper bound for the M-spectral radius of a partially symmetric tensor

4. Conclusions

Acknowledgments

Conflict of interest

References